RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy18020
         (434 letters)



>gnl|CDD|203324 pfam05743, UEV, UEV domain.  This family includes the eukaryotic
           tumour susceptibility gene 101 protein (TSG101). Altered
           transcripts of this gene have been detected in sporadic
           breast cancers and many other human malignancies.
           However, the involvement of this gene in neoplastic
           transformation and tumorigenesis is still elusive.
           TSG101 is required for normal cell function of embryonic
           and adult tissues but that this gene is not a tumour
           suppressor for sporadic forms of breast cancer. This
           family is related to the ubiquitin conjugating enzymes.
          Length = 119

 Score =  111 bits (279), Expect = 1e-29
 Identities = 36/76 (47%), Positives = 48/76 (63%)

Query: 5   SGLVYHIPVCIWLMDTHPDHAPICYVRPTADMRINVSQNVDHTGKIYSPYLHTWTPQNSK 64
            G  Y+IP+ IWL DT+P   PIC+V+PT DM I V+ +VD  G++Y PYLH W   +S 
Sbjct: 44  RGNTYNIPILIWLPDTYPFSPPICFVKPTPDMVIKVNHHVDAQGRVYLPYLHNWNHPSSN 103

Query: 65  LEGLIKTLASAFSALP 80
           L  L++ LA  F   P
Sbjct: 104 LVDLVQELAQVFGEEP 119



 Score = 66.5 bits (163), Expect = 1e-13
 Identities = 22/51 (43%), Positives = 31/51 (60%)

Query: 101 IRPTADMRINVSQNVDHTGKIYSPYLHTWTAQNSKLEGLIKTLASAFSALP 151
           ++PT DM I V+ +VD  G++Y PYLH W   +S L  L++ LA  F   P
Sbjct: 69  VKPTPDMVIKVNHHVDAQGRVYLPYLHNWNHPSSNLVDLVQELAQVFGEEP 119


>gnl|CDD|117992 pfam09454, Vps23_core, Vps23 core domain.  ESCRT complexes form the
           main machinery driving protein sorting from endosomes to
           lysosomes. The core domain of the Vps23 subunit of the
           heterotrimeric ESCRT-I complex is a helical hairpin
           sandwiched in a fan-like formation between two other
           helical hairpins from Vps28 (pfam03997) and Vps37. Vps23
           gives ESCRT-I complex its stability.
          Length = 65

 Score = 96.2 bits (240), Expect = 8e-25
 Identities = 35/65 (53%), Positives = 45/65 (69%)

Query: 361 DEAVTTTAPIYKQILNTFTEEAATEDTIYYMGEALRRGVIDLEVYLKQVRSLSRKQFMFR 420
           DEA+    P+  Q+L    E+ A EDTIYY+  AL+RG IDLE YLK VR L+R+QF+ R
Sbjct: 1   DEAIVAEDPLLNQLLELVAEDHAIEDTIYYLERALKRGKIDLETYLKHVRELAREQFLKR 60

Query: 421 ALLFK 425
           AL+ K
Sbjct: 61  ALIQK 65


>gnl|CDD|224117 COG1196, Smc, Chromosome segregation ATPases [Cell division and
           chromosome partitioning].
          Length = 1163

 Score = 48.2 bits (115), Expect = 8e-06
 Identities = 26/122 (21%), Positives = 54/122 (44%), Gaps = 7/122 (5%)

Query: 276 SLKSAVEDKIRR--KMNEQKVQAQEEIEILKQTENELNQGKFKLARMFERIDKEKAELER 333
           SL+   E   +   ++ E+  + +E+++ L++   EL +   +L    E ++ EK ELE 
Sbjct: 818 SLEQRRERLEQEIEELEEEIEELEEKLDELEEELEELEKELEELKEELEELEAEKEELED 877

Query: 334 SISFLKEKETELDEILAVLT---EKTEVDVDEAVT--TTAPIYKQILNTFTEEAATEDTI 388
            +  L+E++ EL+E L  L     + + ++++            + L     E   E   
Sbjct: 878 ELKELEEEKEELEEELRELESELAELKEEIEKLRERLEELEAKLERLEVELPELEEELEE 937

Query: 389 YY 390
            Y
Sbjct: 938 EY 939



 Score = 47.8 bits (114), Expect = 8e-06
 Identities = 35/162 (21%), Positives = 68/162 (41%), Gaps = 22/162 (13%)

Query: 271 EHIKMSLKSAVEDKIR-----RKMNEQKVQAQEEIEILKQTENELN-------QGKFKLA 318
           + ++  L+   E+  R      ++ E+  +A++EIE LK    EL        +   +L 
Sbjct: 235 KELRKELEELEEELSRLEEELEELQEELEEAEKEIEELKSELEELREELEELQEELLELK 294

Query: 319 RMFERIDKEKAELERSISFLKEKETELDEILAVLTEKTEV---------DVDEAVTTTAP 369
              E ++ E + L   +  L+ +  EL+E L  L EK E           + E +     
Sbjct: 295 EEIEELEGEISLLRERLEELENELEELEERLEELKEKIEALKEELEERETLLEELEQLLA 354

Query: 370 IYKQILNTFTEEAAT-EDTIYYMGEALRRGVIDLEVYLKQVR 410
             ++      E+ +   + +  + EALR  + +LE  L ++R
Sbjct: 355 ELEEAKEELEEKLSALLEELEELFEALREELAELEAELAEIR 396



 Score = 39.3 bits (92), Expect = 0.004
 Identities = 18/94 (19%), Positives = 43/94 (45%)

Query: 288 KMNEQKVQAQEEIEILKQTENELNQGKFKLARMFERIDKEKAELERSISFLKEKETELDE 347
           ++  +K + ++E++ L++ + EL +   +L      + +E  +L   +  L+ K   L+ 
Sbjct: 867 ELEAEKEELEDELKELEEEKEELEEELRELESELAELKEEIEKLRERLEELEAKLERLEV 926

Query: 348 ILAVLTEKTEVDVDEAVTTTAPIYKQILNTFTEE 381
            L  L E+ E + ++ + T      + L    E 
Sbjct: 927 ELPELEEELEEEYEDTLETELEREIERLEEEIEA 960



 Score = 39.3 bits (92), Expect = 0.004
 Identities = 30/99 (30%), Positives = 47/99 (47%), Gaps = 12/99 (12%)

Query: 271 EHIKMSLKSAVEDKIRRKM-----NEQKVQAQEEIEILKQTENELNQGKFKLARMF---- 321
           E ++  L+S  E   + K       E++   QEE+E L++   E  +    L R      
Sbjct: 761 EELEEELESLEEALAKLKEEIEELEEKRQALQEELEELEEELEEAERRLDALERELESLE 820

Query: 322 ---ERIDKEKAELERSISFLKEKETELDEILAVLTEKTE 357
              ER+++E  ELE  I  L+EK  EL+E L  L ++ E
Sbjct: 821 QRRERLEQEIEELEEEIEELEEKLDELEEELEELEKELE 859



 Score = 38.5 bits (90), Expect = 0.007
 Identities = 19/78 (24%), Positives = 40/78 (51%), Gaps = 3/78 (3%)

Query: 283 DKIRRKMNEQKVQAQEEIEILKQTENELNQGKFKLARMFERID---KEKAELERSISFLK 339
           +++ R++ E K +     E L+Q ++ L + + +L  + E ++   +   ELE  +  L+
Sbjct: 712 EELERQLEELKRELAALEEELEQLQSRLEELEEELEELEEELEELQERLEELEEELESLE 771

Query: 340 EKETELDEILAVLTEKTE 357
           E   +L E +  L EK +
Sbjct: 772 EALAKLKEEIEELEEKRQ 789



 Score = 37.8 bits (88), Expect = 0.013
 Identities = 30/149 (20%), Positives = 61/149 (40%), Gaps = 22/149 (14%)

Query: 281 VEDKIR------RKMNEQKVQAQEEIEILKQ--------TENELNQGKFKLARMFERIDK 326
           +ED +        K+  Q  +A+   E+  +           +L + + +L  + E + +
Sbjct: 191 LEDLLEELEKQLEKLERQAEKAERYQELKAELRELELALLLAKLKELRKELEELEEELSR 250

Query: 327 EKAELERSISFLKEKETELDEILAVLTEKTEVDVDEAVTTTAPIYKQILNTFTEEAATED 386
            + ELE     L+E E E++E+ + L E  E +++E       + ++I          E 
Sbjct: 251 LEEELEELQEELEEAEKEIEELKSELEELRE-ELEELQEELLELKEEI-------EELEG 302

Query: 387 TIYYMGEALRRGVIDLEVYLKQVRSLSRK 415
            I  + E L     +LE   +++  L  K
Sbjct: 303 EISLLRERLEELENELEELEERLEELKEK 331



 Score = 37.0 bits (86), Expect = 0.019
 Identities = 21/90 (23%), Positives = 43/90 (47%), Gaps = 5/90 (5%)

Query: 273 IKMSLKSAVEDKIRRKMNEQKVQAQEEI--EILKQTENELNQGKFKLARMFERIDKEK-- 328
           ++  L+         K     ++ + E     L++ E EL + + +L  + ER+++ +  
Sbjct: 707 LRRQLEELERQLEELKRELAALEEELEQLQSRLEELEEELEELEEELEELQERLEELEEE 766

Query: 329 -AELERSISFLKEKETELDEILAVLTEKTE 357
              LE +++ LKE+  EL+E    L E+ E
Sbjct: 767 LESLEEALAKLKEEIEELEEKRQALQEELE 796



 Score = 36.6 bits (85), Expect = 0.025
 Identities = 24/143 (16%), Positives = 57/143 (39%), Gaps = 6/143 (4%)

Query: 271 EHIKMSLKSAVEDKIRRKMNEQKVQAQEEIEILKQTENELNQGKFKLARMFERIDKEKAE 330
           E ++  L+   E+    ++ E+  + +EE+E L++   +L +   +L    + + +E  E
Sbjct: 740 EELEEELEELEEE--LEELQERLEELEEELESLEEALAKLKEEIEELEEKRQALQEELEE 797

Query: 331 LERSISFLKEKETELDEILAVLTEKTEVDVDEAVTTTAPIYKQILNTFTEEAATEDTIYY 390
           LE  +   + +   L+  L    E  E   +        + ++I     +    E+ +  
Sbjct: 798 LEEELEEAERRLDALEREL----ESLEQRRERLEQEIEELEEEIEELEEKLDELEEELEE 853

Query: 391 MGEALRRGVIDLEVYLKQVRSLS 413
           + + L     +LE    +   L 
Sbjct: 854 LEKELEELKEELEELEAEKEELE 876



 Score = 35.1 bits (81), Expect = 0.082
 Identities = 28/160 (17%), Positives = 65/160 (40%), Gaps = 14/160 (8%)

Query: 270 DEHIKMSLK--SAVEDKIRRKMNEQKVQAQEEIEILKQ----TENELNQGKFKLARMFER 323
            E ++  L      ++++  K++    + +E  E L++     E EL + + +L  +   
Sbjct: 346 LEELEQLLAELEEAKEELEEKLSALLEELEELFEALREELAELEAELAEIRNELEELKRE 405

Query: 324 IDKEKAELERSISFLKEKETELDEILAVLTEKTEVDVDEAVTTTAPIYKQILNTFTEEAA 383
           I+  +  LER    L++ + EL E+        E +++E  T    + +++     +   
Sbjct: 406 IESLEERLERLSERLEDLKEELKEL--------EAELEELQTELEELNEELEELEEQLEE 457

Query: 384 TEDTIYYMGEALRRGVIDLEVYLKQVRSLSRKQFMFRALL 423
             D +  +   L     +L+   K++ SL  +     A  
Sbjct: 458 LRDRLKELERELAELQEELQRLEKELSSLEARLDRLEAEQ 497



 Score = 33.5 bits (77), Expect = 0.26
 Identities = 28/140 (20%), Positives = 61/140 (43%), Gaps = 7/140 (5%)

Query: 276 SLKSAVEDKIRRKMNEQKVQAQEEIEILKQTENELNQGKFKLARMFERIDKEKAELERSI 335
            LK+ + +     +  +  + ++E+E L++  + L +   +L    E  +KE  EL+  +
Sbjct: 217 ELKAELRELELALLLAKLKELRKELEELEEELSRLEEELEELQEELEEAEKEIEELKSEL 276

Query: 336 SFLKEKETELDEILAVLT---EKTEVDVDEAVTTTAPIYKQILNTFTEEAATEDTIYYMG 392
             L+E+  EL E L  L    E+ E ++         +  ++          ++ I    
Sbjct: 277 EELREELEELQEELLELKEEIEELEGEISLLRERLEELENELEELEERLEELKEKI---- 332

Query: 393 EALRRGVIDLEVYLKQVRSL 412
           EAL+  + + E  L+++  L
Sbjct: 333 EALKEELEERETLLEELEQL 352



 Score = 32.8 bits (75), Expect = 0.38
 Identities = 21/97 (21%), Positives = 45/97 (46%), Gaps = 11/97 (11%)

Query: 267  TITDEHIKMSLKSAVEDKIRRKMNEQKVQAQEEIEILKQTENELNQGKF-KLARMFERID 325
             + +E+   +L++ +E +I R         +EEIE L    N     ++ ++   +E + 
Sbjct: 934  ELEEEYE-DTLETELEREIER--------LEEEIEALGPV-NLRAIEEYEEVEERYEELK 983

Query: 326  KEKAELERSISFLKEKETELDEILAVLTEKTEVDVDE 362
             ++ +LE +   L E   ELD+      ++T   ++E
Sbjct: 984  SQREDLEEAKEKLLEVIEELDKEKRERFKETFDKINE 1020



 Score = 32.0 bits (73), Expect = 0.66
 Identities = 33/143 (23%), Positives = 62/143 (43%), Gaps = 12/143 (8%)

Query: 283 DKIRRKMNE---QKVQAQEEIEILKQTENELNQGKFKLARMFERIDKEKAELERSISFLK 339
           D++  ++ E   +  + +EE+E L+  + EL     +L    E +++E  ELE  ++ LK
Sbjct: 845 DELEEELEELEKELEELKEELEELEAEKEELEDELKELEEEKEELEEELRELESELAELK 904

Query: 340 EKETELDEILAVLTEKTEVDVDEAVTTTAPIYKQILNTFTEEAATEDTIYYMGEALRR-G 398
           E+  +L E L  L  K E    E       + ++  +T   E   E  I  + E +   G
Sbjct: 905 EEIEKLRERLEELEAKLERLEVELPELEEELEEEYEDTLETEL--EREIERLEEEIEALG 962

Query: 399 VIDL------EVYLKQVRSLSRK 415
            ++L      E   ++   L  +
Sbjct: 963 PVNLRAIEEYEEVEERYEELKSQ 985



 Score = 30.8 bits (70), Expect = 1.9
 Identities = 23/134 (17%), Positives = 50/134 (37%), Gaps = 24/134 (17%)

Query: 286 RRKMNEQKVQAQEEIEILKQTENELNQGKFKLARM---FERIDKEKAELERSISFLKEKE 342
           RRK+        EE   + + +    + + KL R     ER++    ELE+ +  L+ + 
Sbjct: 157 RRKL-------IEEAAGVSKYKERKEEAERKLERTEENLERLEDLLEELEKQLEKLERQA 209

Query: 343 TELDEILAVLTEKTEVDVDEAVTTTAPIYKQILNTFTEEAATEDTIYYMGEALRRGVIDL 402
            + +    +  E  E+++         +  ++     E    E       E L R   +L
Sbjct: 210 EKAERYQELKAELRELELAL-------LLAKLKELRKELEELE-------EELSRLEEEL 255

Query: 403 EVYLKQVRSLSRKQ 416
           E   +++    ++ 
Sbjct: 256 EELQEELEEAEKEI 269



 Score = 30.5 bits (69), Expect = 2.1
 Identities = 18/79 (22%), Positives = 37/79 (46%), Gaps = 4/79 (5%)

Query: 279 SAVEDKIRRKMNEQKVQAQEEIEILKQTENELNQGKFKLARMFERIDKEKAELERSISFL 338
            ++ED +     + + + + ++E LK+    L +   +L    E +++E  ELE  +  L
Sbjct: 698 RSLEDLLEELRRQLE-ELERQLEELKRELAALEEELEQLQSRLEELEEELEELEEELEEL 756

Query: 339 KEKETELDEILAVLTEKTE 357
           +E    L+E+   L    E
Sbjct: 757 QE---RLEELEEELESLEE 772



 Score = 30.5 bits (69), Expect = 2.1
 Identities = 29/140 (20%), Positives = 64/140 (45%), Gaps = 11/140 (7%)

Query: 279 SAVEDKIRRKMNEQKVQAQEEIEILKQTENELNQGKFKLARMFERI---DKEKAELERSI 335
           +A+E+++ +  +  +   +EE+E L++   EL +   +L  + E +   ++  A+L+  I
Sbjct: 726 AALEEELEQLQSRLEEL-EEELEELEEELEELQE---RLEELEEELESLEEALAKLKEEI 781

Query: 336 SFLKEKETELDEILAVLTEKTEVDVDEAVTTTAPIYKQILNTFTEEAATEDTIYYMGEAL 395
             L+EK   L E L  L E+ E    EA      + +++ +        E  I  + E +
Sbjct: 782 EELEEKRQALQEELEELEEELE----EAERRLDALERELESLEQRRERLEQEIEELEEEI 837

Query: 396 RRGVIDLEVYLKQVRSLSRK 415
                 L+   +++  L ++
Sbjct: 838 EELEEKLDELEEELEELEKE 857


>gnl|CDD|237177 PRK12704, PRK12704, phosphodiesterase; Provisional.
          Length = 520

 Score = 45.5 bits (109), Expect = 3e-05
 Identities = 32/119 (26%), Positives = 55/119 (46%), Gaps = 4/119 (3%)

Query: 271 EHIKMSLKSAVEDKIRRKMNEQKVQAQEEIEILKQTENELNQGKFKLARMFERIDKEKAE 330
           E IK       +++I +  NE + + +E    L++ E  L Q +  L R  E ++K + E
Sbjct: 52  EAIKKEALLEAKEEIHKLRNEFEKELRERRNELQKLEKRLLQKEENLDRKLELLEKREEE 111

Query: 331 LERSISFLKEKETELDEILAVLTEKTEVDVDE----AVTTTAPIYKQILNTFTEEAATE 385
           LE+    L++K+ EL++    L E  E  + E    +  T     + +L    EEA  E
Sbjct: 112 LEKKEKELEQKQQELEKKEEELEELIEEQLQELERISGLTAEEAKEILLEKVEEEARHE 170


>gnl|CDD|185594 PTZ00395, PTZ00395, Sec24-related protein; Provisional.
          Length = 1560

 Score = 45.8 bits (108), Expect = 3e-05
 Identities = 42/187 (22%), Positives = 66/187 (35%), Gaps = 24/187 (12%)

Query: 151 PPLYSSRTKQEPATNPTYNGNMASNAPYPTQSYMPMPGSNV---NTPYPTYPPSA----- 202
           P   ++     P  NP  +    SN P     Y  +P SN    N P    PPS+     
Sbjct: 422 PGYNNAPNSNTPYNNPPNSNTPYSNPPNSNPPYSNLPYSNTPYSNAPLSNAPPSSAKDHH 481

Query: 203 ---YPGYNPNAAAAPSGSIYPSYPTPGATPGAGGYSPYPLYP----PGGSTPATPYPVPT 255
              +  Y   AA  P+ ++ P+   P A    G        P    P GS P       T
Sbjct: 482 SAYHAAYQHRAANQPAANL-PTANQPAANNFHGAAGNSVGNPFASRPFGSAPYGGNAATT 540

Query: 256 PSPA--------PAVTSGGTITDEHIKMSLKSAVEDKIRRKMNEQKVQAQEEIEILKQTE 307
             P         P   +     ++  + S++S+  +      NE   + +E   +LK+T 
Sbjct: 541 ADPNGIAKREDHPEGGTNRQKYEQSDEESVESSSSENSSENENEVTDKGEEIYSLLKKTI 600

Query: 308 NELNQGK 314
           N ++  K
Sbjct: 601 NRIDMNK 607



 Score = 44.7 bits (105), Expect = 1e-04
 Identities = 37/154 (24%), Positives = 50/154 (32%), Gaps = 20/154 (12%)

Query: 162 PATNPTYNGNMASNAPYPTQSYMPMPGSN---VNTPYPTYPPSAYPGYNPNAAAAPSGSI 218
             +NP YN    SN PY        P SN    N PY   P S  P  N   + AP  S 
Sbjct: 418 GNSNPGYNNAPNSNTPYNNPPNSNTPYSNPPNSNPPYSNLPYSNTPYSNAPLSNAPPSSA 477

Query: 219 --YPS-YPT----------PGATPGAGGYSPYPLYPPGGSTPATPYPVPTPSPAPAVTSG 265
             + S Y                P A   +    +   G++   P+       AP   + 
Sbjct: 478 KDHHSAYHAAYQHRAANQPAANLPTANQPAANNFHGAAGNSVGNPFASRPFGSAPYGGNA 537

Query: 266 GTITDEHIKMSLKSAVEDKIRRKMNEQKVQAQEE 299
            T  D     +  +  ED      N QK +  +E
Sbjct: 538 ATTADP----NGIAKREDHPEGGTNRQKYEQSDE 567



 Score = 36.2 bits (83), Expect = 0.034
 Identities = 29/101 (28%), Positives = 40/101 (39%), Gaps = 12/101 (11%)

Query: 162 PATNPTYNGNMASNAPYPTQSYMPMPGSNVNTPYPTYPPSAYPGYNPNAAAAPSGSIYPS 221
           P +N +YN    SNA     +      S  N  Y + P ++ PGYN    +    +  P+
Sbjct: 383 PHSNASYNCAAYSNAAQSNAAQSNAGFS--NAGY-SNPGNSNPGYNNAPNSNTPYNNPPN 439

Query: 222 YPTPGATPGAGGYSPYPLYPPGGSTP--ATPYPVPTPSPAP 260
             TP + P      P    PP  + P   TPY     S AP
Sbjct: 440 SNTPYSNP------PNS-NPPYSNLPYSNTPYSNAPLSNAP 473


>gnl|CDD|223496 COG0419, SbcC, ATPase involved in DNA repair [DNA replication,
           recombination, and repair].
          Length = 908

 Score = 45.9 bits (109), Expect = 3e-05
 Identities = 19/94 (20%), Positives = 38/94 (40%), Gaps = 2/94 (2%)

Query: 266 GTITDEHIKMSLKSAVEDKIRRKMNEQKVQAQEEIEILKQTENELNQGKFKLARMFERID 325
           G    E  +  L    E ++     E+++  ++E   L++   EL +   +L      + 
Sbjct: 464 GQELPEEHEKELLELYELELEEL--EEELSREKEEAELREEIEELEKELRELEEELIELL 521

Query: 326 KEKAELERSISFLKEKETELDEILAVLTEKTEVD 359
           + +  L+  +    EK   L E L  L EK ++ 
Sbjct: 522 ELEEALKEELEEKLEKLENLLEELEELKEKLQLQ 555



 Score = 42.8 bits (101), Expect = 3e-04
 Identities = 28/146 (19%), Positives = 61/146 (41%), Gaps = 12/146 (8%)

Query: 269 TDEHIKMSLKSAVEDKIRRKMNEQKVQAQEEIEILKQTENELNQGKFKLARMFERIDKEK 328
             E +++      E+++R      + + +E+IE L++ E E+ +    L    E +    
Sbjct: 260 EIESLELEALKIREEELRELERLLE-ELEEKIERLEELEREIEE----LEEELEGLRALL 314

Query: 329 AELERSISFLKEKETELDEI---LAVLTEKTEVDVDEAVTTTAPIYKQILNTFTEEAATE 385
            ELE  +  LK  E  L+++   L  L  + E ++ E     A + ++ L    E     
Sbjct: 315 EELEELLEKLKSLEERLEKLEEKLEKLESELE-ELAEEKNELAKLLEERLKELEER---L 370

Query: 386 DTIYYMGEALRRGVIDLEVYLKQVRS 411
           + +    E     +  LE  +++++ 
Sbjct: 371 EELEKELEKALERLKQLEEAIQELKE 396



 Score = 39.7 bits (93), Expect = 0.003
 Identities = 30/141 (21%), Positives = 63/141 (44%), Gaps = 6/141 (4%)

Query: 276 SLKSAVEDKIRRKMNEQKVQAQEEIEILKQTENELNQGKFKLARMFERIDKEKAELERSI 335
            L++ +E+    K   Q  Q +EE+  L+    EL +   +L  +  R ++ + EL   +
Sbjct: 537 KLENLLEELEELKEKLQLQQLKEELRQLEDRLQELKELLEELRLLRTRKEELE-ELRERL 595

Query: 336 SFLKEKETELDEILAVL-TEKTEVDVDEAVTTTAPIYKQILNTFTEEAATEDTIYYMGEA 394
             LK+K  EL+E L+ L      +++ EA        ++       E    +    + E 
Sbjct: 596 KELKKKLKELEERLSQLEELLQSLELSEAENEL----EEAEEELESELEKLNLQAELEEL 651

Query: 395 LRRGVIDLEVYLKQVRSLSRK 415
           L+  + +LE  ++++ +  R+
Sbjct: 652 LQAALEELEEKVEELEAEIRR 672



 Score = 37.4 bits (87), Expect = 0.016
 Identities = 27/91 (29%), Positives = 43/91 (47%), Gaps = 10/91 (10%)

Query: 277 LKSAVEDKIRRKMNEQKVQA--QEEIEI-LKQTENELNQGKFKLARMFERIDKEKAE--- 330
           L+S +E+    K    K+     +E+E  L++ E EL +   +L ++ E I + K E   
Sbjct: 341 LESELEELAEEKNELAKLLEERLKELEERLEELEKELEKALERLKQLEEAIQELKEELAE 400

Query: 331 ----LERSISFLKEKETELDEILAVLTEKTE 357
               LE     L+E E EL+E+   L E  E
Sbjct: 401 LSAALEEIQEELEELEKELEELERELEELEE 431



 Score = 37.0 bits (86), Expect = 0.018
 Identities = 23/90 (25%), Positives = 42/90 (46%), Gaps = 11/90 (12%)

Query: 271 EHIKMSLKSAVEDKIRRKMNEQKVQAQEEIEILKQTENELNQGKFKLARMFERIDKEK-- 328
           E     L+  +E  +     E+  Q +E I+ LK+   EL +    L  + E +++ +  
Sbjct: 367 EERLEELEKELEKAL-----ERLKQLEEAIQELKE---ELAELSAALEEIQEELEELEKE 418

Query: 329 -AELERSISFLKEKETELDEILAVLTEKTE 357
             ELER +  L+E+  +L+E +  L  K  
Sbjct: 419 LEELERELEELEEEIKKLEEQINQLESKEL 448



 Score = 35.9 bits (83), Expect = 0.048
 Identities = 26/112 (23%), Positives = 52/112 (46%), Gaps = 22/112 (19%)

Query: 267 TITDEHIKMSLKSAVEDKIRRKMNEQKVQAQE-------EIEILKQTENELNQGKFKLAR 319
           +   E I+  L+ A+E++++     +++Q ++       EIE L++   EL + K +L  
Sbjct: 195 SELLEDIEDLLE-ALEEELKELKKLEEIQEEQEEEELEQEIEALEERLAELEEEKERLEE 253

Query: 320 MFERIDKEK--------------AELERSISFLKEKETELDEILAVLTEKTE 357
           +  R+ + +               ELER +  L+EK   L+E+   + E  E
Sbjct: 254 LKARLLEIESLELEALKIREEELRELERLLEELEEKIERLEELEREIEELEE 305



 Score = 32.8 bits (75), Expect = 0.44
 Identities = 16/80 (20%), Positives = 37/80 (46%), Gaps = 2/80 (2%)

Query: 277 LKSAVEDKIRRKMNEQKVQA--QEEIEILKQTENELNQGKFKLARMFERIDKEKAELERS 334
           L+   E     K    ++++   E ++I ++   EL +   +L    ER+++ + E+E  
Sbjct: 244 LEEEKERLEELKARLLEIESLELEALKIREEELRELERLLEELEEKIERLEELEREIEEL 303

Query: 335 ISFLKEKETELDEILAVLTE 354
              L+     L+E+  +L +
Sbjct: 304 EEELEGLRALLEELEELLEK 323



 Score = 32.4 bits (74), Expect = 0.51
 Identities = 28/143 (19%), Positives = 56/143 (39%), Gaps = 9/143 (6%)

Query: 275 MSLKSAVEDKIRRKM-NEQKVQAQEEIEILKQTENELNQGKFKLARMFERIDKEKAELER 333
               S +E+ ++    +E + + +E  E L   E+EL +   +   + E +     ELE 
Sbjct: 606 EERLSQLEELLQSLELSEAENELEEAEEEL---ESELEKLNLQAE-LEELLQAALEELEE 661

Query: 334 SISFLKEKETELD-EILAVLTEKTEVDVDEAVTTTAPIYKQILNTFTEEAATEDTIYYMG 392
            +   +E E E+  E+  +  E+   +  E +       +Q+     E       I  + 
Sbjct: 662 KV---EELEAEIRRELQRIENEEQLEEKLEELEQLEEELEQLREELEELLKKLGEIEQLI 718

Query: 393 EALRRGVIDLEVYLKQVRSLSRK 415
           E L     +LE   K++  L + 
Sbjct: 719 EELESRKAELEELKKELEKLEKA 741



 Score = 32.0 bits (73), Expect = 0.63
 Identities = 41/166 (24%), Positives = 70/166 (42%), Gaps = 29/166 (17%)

Query: 271 EHIKMSLKSAVEDKIRRKM--NEQKVQAQEEIEILKQTENELNQGKFKLA---------- 318
           E ++  ++  +E +IRR++   E + Q +E++E L+Q E EL Q + +L           
Sbjct: 657 EELEEKVE-ELEAEIRRELQRIENEEQLEEKLEELEQLEEELEQLREELEELLKKLGEIE 715

Query: 319 RMFERIDKEKAELERSISFLKEKETELDEILAVLTEKTEVDVDEAVTTTAPIYKQILNTF 378
           ++ E ++  KAELE     L++ E  L+  L  L EK       A      I + +L   
Sbjct: 716 QLIEELESRKAELEELKKELEKLEKALEL-LEELREKLGKAGLRAD-----ILRNLLAQI 769

Query: 379 TEEA----ATEDTIYYMGEAL-----RRGVIDLEVYLK-QVRSLSR 414
             EA    +      Y    L       G + + VY   +VR +  
Sbjct: 770 EAEANEILSKLSLNRYDLRRLTIRKDGNGGLVVVVYDGGEVRPIKT 815


>gnl|CDD|233758 TIGR02169, SMC_prok_A, chromosome segregation protein SMC,
           primarily archaeal type.  SMC (structural maintenance of
           chromosomes) proteins bind DNA and act in organizing and
           segregating chromosomes for partition. SMC proteins are
           found in bacteria, archaea, and eukaryotes. It is found
           in a single copy and is homodimeric in prokaryotes, but
           six paralogs (excluded from this family) are found in
           eukarotes, where SMC proteins are heterodimeric. This
           family represents the SMC protein of archaea and a few
           bacteria (Aquifex, Synechocystis, etc); the SMC of other
           bacteria is described by TIGR02168. The N- and
           C-terminal domains of this protein are well conserved,
           but the central hinge region is skewed in composition
           and highly divergent [Cellular processes, Cell division,
           DNA metabolism, Chromosome-associated proteins].
          Length = 1164

 Score = 45.8 bits (109), Expect = 4e-05
 Identities = 29/143 (20%), Positives = 60/143 (41%), Gaps = 10/143 (6%)

Query: 282 EDKIRRKMNEQKVQAQEEIEILKQTENELNQGKFKLARMFERIDKEKAELERSISFLKEK 341
            DK   +  ++    +E++E LK+  NEL +   +L    +R+ +E A+L  +I+ ++ K
Sbjct: 376 VDKEFAETRDELKDYREKLEKLKREINELKRELDRLQEELQRLSEELADLNAAIAGIEAK 435

Query: 342 ETELDEILAVLTEKTEVDVDEAVTTTA---PIYKQILNTFTEEAATEDTIYYMGEALRRG 398
             EL+E       + +    +     A      +++ +   E    E  +      L+R 
Sbjct: 436 INELEEEKEDKALEIKKQEWKLEQLAADLSKYEQELYDLKEEYDRVEKEL----SKLQRE 491

Query: 399 VIDLEVYLKQVRSLSRKQFMFRA 421
           + + E    Q R+   +    RA
Sbjct: 492 LAEAE---AQARASEERVRGGRA 511



 Score = 40.8 bits (96), Expect = 0.001
 Identities = 26/128 (20%), Positives = 54/128 (42%), Gaps = 10/128 (7%)

Query: 284 KIRRKMNEQKVQA--------QEEIEILKQTENELNQGKFKLARMFERIDKEKAELERSI 335
           K + K  E++++         +EE+E L+    +L      L +  + ++ +  ELER I
Sbjct: 846 KEQIKSIEKEIENLNGKKEELEEELEELEAALRDLESRLGDLKKERDELEAQLRELERKI 905

Query: 336 SFLKEKETELDEILAVLTEKTEVDVDEAVTTTAPIYKQILNTFTEEAATEDTIYYMGEAL 395
             L+ +  +  + L+ L  K E   +E ++       +      EE + ED +    + +
Sbjct: 906 EELEAQIEKKRKRLSELKAKLEALEEE-LSEIEDPKGEDEEIPEEELSLED-VQAELQRV 963

Query: 396 RRGVIDLE 403
              +  LE
Sbjct: 964 EEEIRALE 971



 Score = 37.7 bits (88), Expect = 0.012
 Identities = 19/78 (24%), Positives = 38/78 (48%), Gaps = 6/78 (7%)

Query: 276 SLKSAVEDKIRR--KMNEQKVQAQEEIEILK----QTENELNQGKFKLARMFERIDKEKA 329
           SL+ ++ +K R      E+  + + EI+ L     + E E+ + + +  ++ E   + K 
Sbjct: 305 SLERSIAEKERELEDAEERLAKLEAEIDKLLAEIEELEREIEEERKRRDKLTEEYAELKE 364

Query: 330 ELERSISFLKEKETELDE 347
           ELE   + L+E + E  E
Sbjct: 365 ELEDLRAELEEVDKEFAE 382



 Score = 37.7 bits (88), Expect = 0.014
 Identities = 18/80 (22%), Positives = 40/80 (50%), Gaps = 8/80 (10%)

Query: 283 DKIRRKMNEQKVQAQEEIEILKQTENELNQGKFKLARMFERIDKEKAELERSISFLKEK- 341
            +I  +++E   +  +    + + E E+ Q + +  ++ ER+     ELE  +S L+++ 
Sbjct: 698 RRIENRLDELSQELSDASRKIGEIEKEIEQLEQEEEKLKERL----EELEEDLSSLEQEI 753

Query: 342 ---ETELDEILAVLTEKTEV 358
              ++EL E+ A + E  E 
Sbjct: 754 ENVKSELKELEARIEELEED 773



 Score = 37.4 bits (87), Expect = 0.019
 Identities = 19/84 (22%), Positives = 36/84 (42%), Gaps = 9/84 (10%)

Query: 276 SLKSAVED---KIRRKMNEQKVQAQEE---IEILKQTENELNQGKFKLA---RMFERIDK 326
            L   + D   KI     E +   QEE    E L++ E +L+  + ++       + ++ 
Sbjct: 706 ELSQELSDASRKIGEIEKEIEQLEQEEEKLKERLEELEEDLSSLEQEIENVKSELKELEA 765

Query: 327 EKAELERSISFLKEKETELDEILA 350
              ELE  +  L+E   +L+  L+
Sbjct: 766 RIEELEEDLHKLEEALNDLEARLS 789



 Score = 35.8 bits (83), Expect = 0.051
 Identities = 19/90 (21%), Positives = 46/90 (51%), Gaps = 4/90 (4%)

Query: 276 SLKSAVEDKIRRKMNEQKVQAQEEIEILKQTENELNQGKFKLARMFERIDKEKAELERSI 335
            L++ +   + R + E++ + ++  E L + E E+++   ++  +   I++E+   ++  
Sbjct: 298 ELEAEIA-SLERSIAEKERELEDAEERLAKLEAEIDKLLAEIEELEREIEEERKRRDKLT 356

Query: 336 SFLKEKETELDEILAVLTEKTEVDVDEAVT 365
               E + EL+++ A   E  EVD + A T
Sbjct: 357 EEYAELKEELEDLRA---ELEEVDKEFAET 383



 Score = 35.0 bits (81), Expect = 0.086
 Identities = 31/159 (19%), Positives = 64/159 (40%), Gaps = 24/159 (15%)

Query: 279 SAVEDKIRRKMN------EQKVQAQEEIEILKQTE-NELNQGKFKLARMFERIDKEKAEL 331
             + D+ R+++       E+  + Q  ++  ++ E  EL + K  L R  E I+++ A L
Sbjct: 190 DLIIDEKRQQLERLRREREKAERYQALLKEKREYEGYELLKEKEALERQKEAIERQLASL 249

Query: 332 ERSISFLK-----------EKETELDEILAVLTEKTEVDVDEAVTTTAPIYK---QILNT 377
           E  +  L            E E  L+E+   + +  E   +E +     I +   +I + 
Sbjct: 250 EEELEKLTEEISELEKRLEEIEQLLEELNKKIKDLGE---EEQLRVKEKIGELEAEIASL 306

Query: 378 FTEEAATEDTIYYMGEALRRGVIDLEVYLKQVRSLSRKQ 416
               A  E  +    E L +   +++  L ++  L R+ 
Sbjct: 307 ERSIAEKERELEDAEERLAKLEAEIDKLLAEIEELEREI 345



 Score = 34.3 bits (79), Expect = 0.14
 Identities = 23/127 (18%), Positives = 57/127 (44%), Gaps = 6/127 (4%)

Query: 274 KMSLKSAVEDKIRRKMNEQKVQAQEEIEILKQTENELNQGKFKLARMFERIDKEKAELER 333
           +++L+    +K  +++ EQ++  +E+I+ +++    LN  K +L    E ++    +LE 
Sbjct: 823 RLTLEKEYLEKEIQELQEQRIDLKEQIKSIEKEIENLNGKKEELEEELEELEAALRDLES 882

Query: 334 SISFLKEKETELDEILAVLTEKTEVDVDEAVTTTAPIYKQILNTFTEEAAT-EDTIYYMG 392
            +  LK++  EL+  L  L  K    ++E         ++ L+    +    E+ +  + 
Sbjct: 883 RLGDLKKERDELEAQLRELERK----IEEL-EAQIEKKRKRLSELKAKLEALEEELSEIE 937

Query: 393 EALRRGV 399
           +      
Sbjct: 938 DPKGEDE 944



 Score = 33.9 bits (78), Expect = 0.17
 Identities = 26/99 (26%), Positives = 45/99 (45%), Gaps = 14/99 (14%)

Query: 279 SAVEDKIRRKMNEQKVQAQEEIE-----------ILKQTENELNQGKFKLARMFERIDKE 327
             +  KI+    E++++ +E+I             + + E EL   + +LA++   IDK 
Sbjct: 275 EELNKKIKDLGEEEQLRVKEKIGELEAEIASLERSIAEKERELEDAEERLAKLEAEIDKL 334

Query: 328 KAE---LERSISFLKEKETELDEILAVLTEKTEVDVDEA 363
            AE   LER I   +++  +L E  A L E+ E    E 
Sbjct: 335 LAEIEELEREIEEERKRRDKLTEEYAELKEELEDLRAEL 373



 Score = 32.7 bits (75), Expect = 0.46
 Identities = 17/82 (20%), Positives = 37/82 (45%), Gaps = 7/82 (8%)

Query: 283 DKIRRKMNEQKV---QAQEEIEILKQTEN----ELNQGKFKLARMFERIDKEKAELERSI 335
           +++ R++ E++    +  EE   LK+       EL +   + A   + +   + +LE+  
Sbjct: 339 EELEREIEEERKRRDKLTEEYAELKEELEDLRAELEEVDKEFAETRDELKDYREKLEKLK 398

Query: 336 SFLKEKETELDEILAVLTEKTE 357
             + E + ELD +   L   +E
Sbjct: 399 REINELKRELDRLQEELQRLSE 420



 Score = 31.6 bits (72), Expect = 0.95
 Identities = 21/80 (26%), Positives = 36/80 (45%), Gaps = 8/80 (10%)

Query: 288 KMNEQKVQAQEEIEILKQTENELNQGKFKLARMFERIDKEKAELERSISFLKEKETEL-- 345
            +  +    Q E+  ++   +EL+Q     +R    I+KE  +LE+    LKE+  EL  
Sbjct: 685 GLKRELSSLQSELRRIENRLDELSQELSDASRKIGEIEKEIEQLEQEEEKLKERLEELEE 744

Query: 346 ------DEILAVLTEKTEVD 359
                  EI  V +E  E++
Sbjct: 745 DLSSLEQEIENVKSELKELE 764



 Score = 31.6 bits (72), Expect = 1.2
 Identities = 21/88 (23%), Positives = 41/88 (46%), Gaps = 12/88 (13%)

Query: 268 ITDEHIKMSLK-SAVEDKIRRKMNEQKVQAQEEIEILKQTENELNQGKFKLARMFERIDK 326
           + +E  ++  +   +E K+ R   E K   ++EI+ L++   +L           E+I  
Sbjct: 803 LEEEVSRIEARLREIEQKLNRLTLE-KEYLEKEIQELQEQRIDLK----------EQIKS 851

Query: 327 EKAELERSISFLKEKETELDEILAVLTE 354
            + E+E      +E E EL+E+ A L +
Sbjct: 852 IEKEIENLNGKKEELEEELEELEAALRD 879



 Score = 30.0 bits (68), Expect = 3.0
 Identities = 14/46 (30%), Positives = 23/46 (50%)

Query: 312 QGKFKLARMFERIDKEKAELERSISFLKEKETELDEILAVLTEKTE 357
               +L R+ ER++  K EL    S L+  E  LDE+   L++ + 
Sbjct: 671 SEPAELQRLRERLEGLKRELSSLQSELRRIENRLDELSQELSDASR 716



 Score = 29.3 bits (66), Expect = 5.5
 Identities = 15/74 (20%), Positives = 35/74 (47%), Gaps = 1/74 (1%)

Query: 282 EDKIRRKMNEQKVQAQEEIEILKQTENELNQGKFKLARMFERIDKEKAELERSISFLKEK 341
             +++R + E+    + E+  L+     +     +L++      ++  E+E+ I  L+++
Sbjct: 673 PAELQR-LRERLEGLKRELSSLQSELRRIENRLDELSQELSDASRKIGEIEKEIEQLEQE 731

Query: 342 ETELDEILAVLTEK 355
           E +L E L  L E 
Sbjct: 732 EEKLKERLEELEED 745


>gnl|CDD|192987 pfam12325, TMF_TATA_bd, TATA element modulatory factor 1 TATA
           binding.  This is the C-terminal conserved coiled coil
           region of a family of TATA element modulatory factor 1
           proteins conserved in eukaryotes. The proteins bind to
           the TATA element of some RNA polymerase II promoters and
           repress their activity. by competing with the binding of
           TATA binding protein. TMF1_TATA_bd is the most conserved
           part of the TMFs. TMFs are evolutionarily conserved
           golgins that bind Rab6, a ubiquitous ras-like
           GTP-binding Golgi protein, and contribute to Golgi
           organisation in animal and plant cells. The Rab6-binding
           domain appears to be the same region as this C-terminal
           family.
          Length = 121

 Score = 42.6 bits (101), Expect = 4e-05
 Identities = 20/73 (27%), Positives = 34/73 (46%), Gaps = 1/73 (1%)

Query: 285 IRRKMNEQKVQAQEEIEILKQTENELNQGKFKLARMFERIDKEKAELERSISFLKEKETE 344
           +RR   E     ++E+  L+   +E  Q   KL    E + + K E+E     L++ E  
Sbjct: 26  LRRLEGE-LASLKDELARLEAERDEARQEIVKLTEENEELRELKKEIEELEKELEDLEQR 84

Query: 345 LDEILAVLTEKTE 357
            +  L +L EK+E
Sbjct: 85  YETTLELLGEKSE 97


>gnl|CDD|218191 pfam04652, DUF605, Vta1 like.  Vta1 (VPS20-associated protein 1) is
           a positive regulator of Vps4. Vps4 is an ATPase that is
           required in the multivesicular body (MVB) sorting
           pathway to dissociate the endosomal sorting complex
           required for transport (ESCRT). Vta1 promotes correct
           assembly of Vps4 and stimulates its ATPase activity
           through its conserved Vta1/SBP1/LIP5 region.
          Length = 315

 Score = 43.9 bits (104), Expect = 9e-05
 Identities = 29/133 (21%), Positives = 45/133 (33%), Gaps = 15/133 (11%)

Query: 160 QEPATNPTYNGNMASNAPYPTQSYMPMPGSNVNTPYPTYPPSAYPGYNPNAAAAPSGSIY 219
           ++     T + N          S  P    + +   P++P       +P+ ++ P     
Sbjct: 159 EDADVATTNSDNSFPGEDADPASASPSDPPSSSPGVPSFPSPPEDPSSPSDSSLPP---A 215

Query: 220 PSYPTPGATPGAGGYSPYPLYPPGGSTPATPY---------PVPTPSPA---PAVTSGGT 267
           PS       P +      P  PPG + P  P            PTP  A   PA   G T
Sbjct: 216 PSSFQSDTPPPSPESPTNPSPPPGPAAPPPPPVQQVPPLSTAKPTPPSASATPAPIGGIT 275

Query: 268 ITDEHIKMSLKSA 280
           + D+ I  + K A
Sbjct: 276 LDDDAIAKAQKHA 288


>gnl|CDD|233757 TIGR02168, SMC_prok_B, chromosome segregation protein SMC, common
           bacterial type.  SMC (structural maintenance of
           chromosomes) proteins bind DNA and act in organizing and
           segregating chromosomes for partition. SMC proteins are
           found in bacteria, archaea, and eukaryotes. This family
           represents the SMC protein of most bacteria. The smc
           gene is often associated with scpB (TIGR00281) and scpA
           genes, where scp stands for segregation and condensation
           protein. SMC was shown (in Caulobacter crescentus) to be
           induced early in S phase but present and bound to DNA
           throughout the cell cycle [Cellular processes, Cell
           division, DNA metabolism, Chromosome-associated
           proteins].
          Length = 1179

 Score = 43.5 bits (103), Expect = 2e-04
 Identities = 31/132 (23%), Positives = 51/132 (38%), Gaps = 18/132 (13%)

Query: 286 RRKMNEQKVQAQEEIEILKQTENELNQGKFKLARMFERIDKEKAELERSISFLKEKETEL 345
             ++ E+  + Q+E+  L    + L Q K  L      ++++  ELE  +  L+ K  EL
Sbjct: 276 VSELEEEIEELQKELYALANEISRLEQQKQILRERLANLERQLEELEAQLEELESKLDEL 335

Query: 346 DEILAVLTEKTEVDVDEAVTTTAPIYKQILNTFTEEAATEDTIYYMGEALRRGVIDLEVY 405
            E LA L EK E              + +     E  A         E L   + +LE  
Sbjct: 336 AEELAELEEKLEE--------LKEELESLEAELEELEAEL-------EELESRLEELE-- 378

Query: 406 LKQVRSLSRKQF 417
            +Q+ +L  K  
Sbjct: 379 -EQLETLRSKVA 389



 Score = 40.4 bits (95), Expect = 0.002
 Identities = 22/149 (14%), Positives = 54/149 (36%), Gaps = 6/149 (4%)

Query: 275 MSLKSAVEDKIRRKMNEQKVQAQEEIEILKQTENELNQGKFKLARMFERIDKEKAELERS 334
           +  +    +    ++  +  + +E++E L+    +L      L    ER++     LE  
Sbjct: 356 LEAELEELEAELEELESRLEELEEQLETLRSKVAQLELQIASLNNEIERLEARLERLEDR 415

Query: 335 ISFLKEKETELDEILAVLTEKTEVDVDEAVTTTAPIYKQILNTFTEEAATEDTIYYMGEA 394
              L++      EI  +L +  E ++ E       + +++     E    E+ +  + E 
Sbjct: 416 RERLQQ------EIEELLKKLEEAELKELQAELEELEEELEELQEELERLEEALEELREE 469

Query: 395 LRRGVIDLEVYLKQVRSLSRKQFMFRALL 423
           L      L+   +++  L  +      L 
Sbjct: 470 LEEAEQALDAAERELAQLQARLDSLERLQ 498



 Score = 38.1 bits (89), Expect = 0.009
 Identities = 17/85 (20%), Positives = 36/85 (42%), Gaps = 6/85 (7%)

Query: 277 LKSAVEDKIRRKMN--EQKVQAQEEIEILKQTENELNQGKFKLARMFERIDKEKAELERS 334
           L+  +E          E+  +A++ ++  ++   +L      L R+ E ++         
Sbjct: 452 LQEELERLEEALEELREELEEAEQALDAAERELAQLQARLDSLERLQENLEG----FSEG 507

Query: 335 ISFLKEKETELDEILAVLTEKTEVD 359
           +  L + ++ L  IL VL+E   VD
Sbjct: 508 VKALLKNQSGLSGILGVLSELISVD 532



 Score = 37.3 bits (87), Expect = 0.017
 Identities = 24/131 (18%), Positives = 54/131 (41%), Gaps = 20/131 (15%)

Query: 276 SLKSAVEDKIRRKM---------NEQKVQAQEEIEILKQTENELNQGKFKLARMFERIDK 326
           +L + +    ++K            Q  + + ++E L+   +EL     +LA +    ++
Sbjct: 292 ALANEISRLEQQKQILRERLANLERQLEELEAQLEELESKLDEL---AEELAEL----EE 344

Query: 327 EKAELERSISFLKEKETELDEILAVLTEKTEVDVDEAVTTTAPIYKQILNTFTEEAATED 386
           +  EL+  +  L+ +  EL+  L  L E    +++E + T      Q+     + A+  +
Sbjct: 345 KLEELKEELESLEAELEELEAELEEL-ESRLEELEEQLETLRSKVAQLEL---QIASLNN 400

Query: 387 TIYYMGEALRR 397
            I  +   L R
Sbjct: 401 EIERLEARLER 411



 Score = 36.2 bits (84), Expect = 0.038
 Identities = 23/85 (27%), Positives = 43/85 (50%), Gaps = 3/85 (3%)

Query: 276 SLKSAVEDKIRR--KMNEQKVQAQEEIEILKQTENELNQGKFKLARMFERIDKEKAELER 333
           SL+  +    RR   + EQ  +  E+IE L     EL +   +L    E +  E+A LE 
Sbjct: 828 SLERRIAATERRLEDLEEQIEELSEDIESLAAEIEELEELIEELESELEALLNERASLEE 887

Query: 334 SISFLKEKETELD-EILAVLTEKTE 357
           +++ L+ +  EL  E+  + ++++E
Sbjct: 888 ALALLRSELEELSEELRELESKRSE 912



 Score = 34.3 bits (79), Expect = 0.16
 Identities = 15/64 (23%), Positives = 31/64 (48%), Gaps = 7/64 (10%)

Query: 292 QKVQAQEEIEILKQTENELNQGKFKLARMFERIDKEKAELERSISFLKEKETELDEILAV 351
           +  + +EE+E L++   E  +         E +  E  ELE  +  L+ + +EL+E +  
Sbjct: 233 RLEELREELEELQEELKEAEE-------ELEELTAELQELEEKLEELRLEVSELEEEIEE 285

Query: 352 LTEK 355
           L ++
Sbjct: 286 LQKE 289



 Score = 31.2 bits (71), Expect = 1.4
 Identities = 22/132 (16%), Positives = 42/132 (31%), Gaps = 26/132 (19%)

Query: 280 AVEDKIRRKMNEQKVQAQEEIEILKQTENELNQGKFKLARMFERIDKEKAELERSISFLK 339
              +++ R+++  +    +++  L+           +L     ++ KE  ELE       
Sbjct: 719 KELEELSRQISALR----KDLARLEAEVE-------QLEERIAQLSKELTELE------- 760

Query: 340 EKETELDEILAVLTEKTEVDVDEAVTTTAPIYKQILNTFTEEAATEDTIYYMGEALRRGV 399
            +  EL+E L    E+      E     A I         EE           + LR  +
Sbjct: 761 AEIEELEERLEEAEEELAEAEAEIEELEAQI-----EQLKEELKALREAL---DELRAEL 812

Query: 400 IDLEVYLKQVRS 411
             L      +R 
Sbjct: 813 TLLNEEAANLRE 824



 Score = 30.4 bits (69), Expect = 2.1
 Identities = 18/106 (16%), Positives = 36/106 (33%), Gaps = 24/106 (22%)

Query: 276 SLKSAVEDKI---RRKMNEQKVQAQEEIEILKQTENELNQGKFKLA-------------- 318
              +  E +I     ++ + K + +   E L +   EL     + A              
Sbjct: 775 EELAEAEAEIEELEAQIEQLKEELKALREALDELRAELTLLNEEAANLRERLESLERRIA 834

Query: 319 ---RMFERIDKEKAELERSISFL----KEKETELDEILAVLTEKTE 357
              R  E ++++  EL   I  L    +E E  ++E+ + L     
Sbjct: 835 ATERRLEDLEEQIEELSEDIESLAAEIEELEELIEELESELEALLN 880



 Score = 30.4 bits (69), Expect = 2.4
 Identities = 31/151 (20%), Positives = 63/151 (41%), Gaps = 24/151 (15%)

Query: 281 VED---KIRRKMN---EQKVQAQEEIEI---LKQTE-----NELNQGKFKLARMFERIDK 326
           +ED   ++ R++     Q  +A+   E+   L++ E       L + + +L  + E + +
Sbjct: 191 LEDILNELERQLKSLERQAEKAERYKELKAELRELELALLVLRLEELREELEELQEELKE 250

Query: 327 EKAELERSISFLKEKETELDEILAVLTEKTEVDVDEAVTTTAPIYKQILNTFTEEAAT-E 385
            + ELE   + L+E E +L+E+   ++E     ++E +       ++ L     E +  E
Sbjct: 251 AEEELEELTAELQELEEKLEELRLEVSE-----LEEEIE----ELQKELYALANEISRLE 301

Query: 386 DTIYYMGEALRRGVIDLEVYLKQVRSLSRKQ 416
                + E L      LE    Q+  L  K 
Sbjct: 302 QQKQILRERLANLERQLEELEAQLEELESKL 332



 Score = 30.4 bits (69), Expect = 2.6
 Identities = 22/158 (13%), Positives = 56/158 (35%), Gaps = 20/158 (12%)

Query: 263  TSGGTITDEHIKMSLKSAVEDKIRRKMNEQKVQAQEEIEILKQTENELNQGKFKLARMFE 322
                 +  E   +  + A  ++    +  +  +  EE+  L+   +EL +   +L     
Sbjct: 866  ELIEELESELEALLNERASLEEALALLRSELEELSEELRELESKRSELRRELEELREKLA 925

Query: 323  RIDKEKAELERSISFLKEKETELDEILAVLTEKTEVDVDEAVTTTAPIYKQILNTFTEEA 382
            +++     L          E  +D +   L+E+  + ++EA      I         +E 
Sbjct: 926  QLELRLEGL----------EVRIDNLQERLSEEYSLTLEEAEALENKIE-------DDEE 968

Query: 383  ATEDTIYYMGEALRR-GVIDLEVYLKQVRSLS-RKQFM 418
                 +  +   ++  G ++L   +++   L  R  F+
Sbjct: 969  EARRRLKRLENKIKELGPVNLAA-IEEYEELKERYDFL 1005


>gnl|CDD|110537 pfam01540, Lipoprotein_7, Adhesin lipoprotein.  This family
           consists of the p50 and variable adherence-associated
           antigen (Vaa) adhesins from Mycoplasma hominis. M.
           hominis is a mycoplasma associated with human urogenital
           diseases, pneumonia, and septic arthritis. An adhesin is
           a cell surface molecule that mediates adhesion to other
           cells or to the surrounding surface or substrate. The
           Vaa antigen is a 50-kDa surface lipoprotein that has
           four tandem repetitive DNA sequences encoding a periodic
           peptide structure, and is highly immunogenic in the
           human host. p50 is also a 50-kDa lipoprotein, having
           three repeats A,B and C, that may be a tetramer of
           191-kDa in its native environment.
          Length = 353

 Score = 42.8 bits (100), Expect = 2e-04
 Identities = 33/136 (24%), Positives = 63/136 (46%), Gaps = 16/136 (11%)

Query: 260 PAVTSGGTITDEHIKMSLKSAVEDK---IRRKMNEQKVQAQEEIEILKQTENE------- 309
           PA+ S  +   E+ K + K A++DK   I +++ E+  + Q  IE LK+  NE       
Sbjct: 89  PAIISKLSAAVENAK-NEKKAIDDKNAQIAKELAERNAKIQSNIEELKKINNEAFELSKT 147

Query: 310 LNQGKFKLARMFERIDKEKAELERSISFLKEKETELDEILAVLTEKTEVDVDEAVTTTAP 369
           +N+   ++ + F+     K +LE     L +K  ++DE   V + +    + E  +    
Sbjct: 148 VNKTIAEVEKKFKIPKDFKEQLENFADDLLDKSRQIDEFTTVTSTQEGFTLAELES---- 203

Query: 370 IYKQILNTFTEEAATE 385
            +K+I  T+     +E
Sbjct: 204 -FKEITTTWFNGMKSE 218


>gnl|CDD|206034 pfam13863, DUF4200, Domain of unknown function (DUF4200).  This
           family is found in eukaryotes. It is a coiled-coil
           domain of unknwon function.
          Length = 126

 Score = 40.7 bits (96), Expect = 2e-04
 Identities = 27/114 (23%), Positives = 47/114 (41%), Gaps = 23/114 (20%)

Query: 283 DKIRRKMNEQKVQAQEEIEILKQTENELNQG-----KF------KLARMFERIDKEKA-- 329
           D  R +   ++   ++  E L++ E EL +      KF      K  R  ++ ++EK   
Sbjct: 17  DAKREEFERREELLKQREEELEKKEEELQESLIKFDKFLKENEAKRRRAEKKAEEEKKLR 76

Query: 330 -ELERSISFLKEKETELDEILAVLTEKTEVDVDEAVTTTAPIYKQILNTFTEEA 382
            E E  I  LK +  EL   +    EK E  ++E        Y++ L +  E +
Sbjct: 77  KEKEEEIKELKAELEELKAEI----EKLEEKLEEYQ-----PYEEFLESVVERS 121


>gnl|CDD|220950 pfam11029, DAZAP2, DAZ associated protein 2 (DAZAP2).  DAZ
           associated protein 2 has a highly conserved sequence
           throughout evolution including a conserved polyproline
           region and several SH2/SH3 binding sites. It occurs as a
           single copy gene with a four-exon organisation and is
           located on chromosome 12. It encodes a ubiquitously
           expressed protein and binds to DAZ and DAZL1 through DAZ
           repeats.
          Length = 136

 Score = 39.1 bits (91), Expect = 8e-04
 Identities = 27/99 (27%), Positives = 32/99 (32%), Gaps = 2/99 (2%)

Query: 147 FSALPPLYSSRTKQEPATNPTYNGNMASNAPYPTQSYMPMPGSNVNTPYPTYPPSAYPGY 206
           +   PP YS   +      P      AS        Y+PM       P  ++PP AY   
Sbjct: 1   YPDAPPAYSELYQPSYVVPPQAQMPQASAPYPGPSMYLPMAQVMAVGPQSSHPPMAYYPI 60

Query: 207 NPNAAAAPSGSIYPSYPT--PGATPGAGGYSPYPLYPPG 243
                  P GS          GA  G G  S  P  PPG
Sbjct: 61  GAPPPVYPPGSTVLVQGGFDAGARFGPGTGSSIPPPPPG 99



 Score = 29.0 bits (65), Expect = 2.3
 Identities = 16/65 (24%), Positives = 23/65 (35%), Gaps = 1/65 (1%)

Query: 195 YPTYPPSAYPGYNPNAAAAPSGSIYPSY-PTPGATPGAGGYSPYPLYPPGGSTPATPYPV 253
           YP  PP+    Y P+    P   +  +  P PG +          + P     P   YP+
Sbjct: 1   YPDAPPAYSELYQPSYVVPPQAQMPQASAPYPGPSMYLPMAQVMAVGPQSSHPPMAYYPI 60

Query: 254 PTPSP 258
             P P
Sbjct: 61  GAPPP 65


>gnl|CDD|114219 pfam05483, SCP-1, Synaptonemal complex protein 1 (SCP-1).
           Synaptonemal complex protein 1 (SCP-1) is the major
           component of the transverse filaments of the
           synaptonemal complex. Synaptonemal complexes are
           structures that are formed between homologous
           chromosomes during meiotic prophase.
          Length = 787

 Score = 41.2 bits (96), Expect = 8e-04
 Identities = 34/126 (26%), Positives = 56/126 (44%), Gaps = 22/126 (17%)

Query: 298 EEIEILKQTENELNQGKFKLARMFERIDKEKAELERSISFLKEKETELDEILAVLTEKTE 357
           EE+   +Q   E N+ + K+  M   + K+ +ELE    F   KE EL+E+  +L E  +
Sbjct: 362 EELLRTEQQRLEKNEDQLKIITM--ELQKKSSELEEMTKFKNNKEVELEELKKILAEDEK 419

Query: 358 VDVDEAVTTTAPIYKQILNTFTEEAATEDTIYYMGEALRRGVIDLEV-----------YL 406
           + +DE         KQ      E    E  + ++ +A  + + DLE+           YL
Sbjct: 420 L-LDEK--------KQFEKIAEELKGKEQELIFLLQAREKEIHDLEIQLTAIKTSEEHYL 470

Query: 407 KQVRSL 412
           K+V  L
Sbjct: 471 KEVEDL 476



 Score = 30.5 bits (68), Expect = 2.0
 Identities = 28/98 (28%), Positives = 49/98 (50%), Gaps = 10/98 (10%)

Query: 268 ITDEHIKMSLKSAVEDKIRRKMNEQKVQAQEEIEILKQTENELNQGKFKLARMFERIDKE 327
           IT++  KM   + + ++ R K N    Q +E+ ++  +   EL + K  L +  E I   
Sbjct: 249 ITEKENKMKDLTFLLEESRDKAN----QLEEKTKLQDENLKELIEKKDHLTKELEDI--- 301

Query: 328 KAELERSISFLKEKETELD---EILAVLTEKTEVDVDE 362
           K  L+RS+S  K  E +L    + +  LTE+ E  ++E
Sbjct: 302 KMSLQRSMSTQKALEEDLQIATKTICQLTEEKEAQMEE 339



 Score = 28.5 bits (63), Expect = 8.1
 Identities = 20/65 (30%), Positives = 35/65 (53%), Gaps = 5/65 (7%)

Query: 291 EQKVQAQEEIEILKQTENELNQGKFKLARMFERIDKEKAELERSISFLKEKETELDEILA 350
           E+ + A EE+ +  Q EN   +  FKL    E+I   + E ++ I+   +KE ++  +L 
Sbjct: 193 EKMILAFEELRV--QAENARLEMHFKLKEDHEKIQHLEEEYKKEIN---DKEKQVSLLLI 247

Query: 351 VLTEK 355
            +TEK
Sbjct: 248 QITEK 252


>gnl|CDD|236090 PRK07764, PRK07764, DNA polymerase III subunits gamma and tau;
           Validated.
          Length = 824

 Score = 41.5 bits (98), Expect = 9e-04
 Identities = 34/158 (21%), Positives = 49/158 (31%), Gaps = 15/158 (9%)

Query: 144 ASAFSALPPLYSSRTKQEPATNPTYNGNMASNAPYPTQSYMPMPGSNVNTPYPTYPPSAY 203
           A A +A     ++                A+ AP       P P      P P     A 
Sbjct: 393 APAAAAPSAAAAAPAAAPAPAAAA-PAAAAAPAPAAAPQPAPAPAPAPAPPSPAGNAPAG 451

Query: 204 PGYNPNAAAAPSGSIYPSYPTPGATPGAGGYSPYPLYPPGGSTPATPYPVPTP-SPAPAV 262
              +P  AAAPS       P P         +P P   P  + PA P P   P +PA   
Sbjct: 452 GAPSPPPAAAPSAQ-----PAPAPAA-----APEPTAAPAPAPPAAPAPAAAPAAPAAPA 501

Query: 263 TSGGTITDEHIKMS---LKSAVEDKIRRKMNEQKVQAQ 297
              G      ++     + +AV  + R+       +A 
Sbjct: 502 APAGADDAATLRERWPEILAAVPKRSRKTWAILLPEAT 539



 Score = 40.7 bits (96), Expect = 0.001
 Identities = 19/97 (19%), Positives = 27/97 (27%), Gaps = 3/97 (3%)

Query: 170 GNMASNAPYPTQSYMPMPGSNVNTPYPTYPPSAYPGYNPNAAAAPSGSIYPSYPTPGATP 229
             +A  A  P  +      +    P     P+A       A A  +       P P   P
Sbjct: 385 LGVAGGAGAPAAAAPSAAAA---APAAAPAPAAAAPAAAAAPAPAAAPQPAPAPAPAPAP 441

Query: 230 GAGGYSPYPLYPPGGSTPATPYPVPTPSPAPAVTSGG 266
            +   +      P     A P   P P+PA A     
Sbjct: 442 PSPAGNAPAGGAPSPPPAAAPSAQPAPAPAAAPEPTA 478



 Score = 35.7 bits (83), Expect = 0.057
 Identities = 26/116 (22%), Positives = 37/116 (31%), Gaps = 10/116 (8%)

Query: 147 FSALPPLYSSRTKQEPATNPTYNGNMASNAPYPTQ-SYMPMPGSNVNTPYPTYPPSAYPG 205
            S+ PP  ++R     A         A  A  P + S  P PG      +P +       
Sbjct: 604 ASSGPPEEAARPAAPAAPAAPAAPAPAGAAAAPAEASAAPAPGVAAPEHHPKHVAVPDAS 663

Query: 206 YNPNAAAAPSGSIYPSYPTPGATPGAGGYSPYPLYPPGGSTPATPYPVPTPSPAPA 261
              +   A +G   P+ P P   P A         P   +  A   P P P+  P 
Sbjct: 664 DGGDGWPAKAGGAAPAAPPPAPAPAA---------PAAPAGAAPAQPAPAPAATPP 710



 Score = 34.6 bits (80), Expect = 0.11
 Identities = 16/75 (21%), Positives = 21/75 (28%), Gaps = 3/75 (4%)

Query: 185 PMPGSNVNTPYPTYPPSAYPGYNPNAAAAPSGSIYPSYPTPGATPGAGGYSPYPLYPPGG 244
           P        P P           P A AAP+    P+ P P     A   +     P   
Sbjct: 592 PGAAGGEGPPAPASSGPPEEAARPAAPAAPAA---PAAPAPAGAAAAPAEASAAPAPGVA 648

Query: 245 STPATPYPVPTPSPA 259
           +    P  V  P  +
Sbjct: 649 APEHHPKHVAVPDAS 663



 Score = 31.9 bits (73), Expect = 0.75
 Identities = 21/101 (20%), Positives = 26/101 (25%), Gaps = 6/101 (5%)

Query: 173 ASNAPYPTQSYMPMPGSNVNTP----YPTYPPSAYPGYNPNAAAAPSGSIYPSYPTPGAT 228
           A+    P       P      P     P  P +  P     A A  S +  P    P   
Sbjct: 594 AAGGEGPPAPASSGPPEEAARPAAPAAPAAPAAPAPAGAAAAPAEASAAPAPGVAAPEHH 653

Query: 229 PGAGGYSPYPLYPPGGSTPATPYPV--PTPSPAPAVTSGGT 267
           P             G    A       P P+PAPA  +   
Sbjct: 654 PKHVAVPDASDGGDGWPAKAGGAAPAAPPPAPAPAAPAAPA 694



 Score = 30.7 bits (70), Expect = 1.6
 Identities = 23/89 (25%), Positives = 28/89 (31%), Gaps = 3/89 (3%)

Query: 173 ASNAPYPTQSYMPMPGSNVNTPYPTYPPSAYPGYNPNAAAAPSGSIYPSYPTPGATPGAG 232
           A   P P  +  P P    + P    PP A  G +  + AA      P  P     P   
Sbjct: 697 APAQPAPAPAATP-PAGQADDPAAQ-PPQAAQGASAPSPAADDPVPLPPEPDDPPDPAGA 754

Query: 233 GYSPYP-LYPPGGSTPATPYPVPTPSPAP 260
              P P   P   + PA   P   PS   
Sbjct: 755 PAQPPPPPAPAPAAAPAAAPPPSPPSEEE 783



 Score = 30.3 bits (69), Expect = 2.3
 Identities = 20/98 (20%), Positives = 26/98 (26%), Gaps = 9/98 (9%)

Query: 176 APYPTQSYMPMPGSNVNTPYPTYPPSAYPGYNPNAAAAPSGSIYPSYPTPGATPGAGGYS 235
              P         +    P    P    P       A  +       P       A   +
Sbjct: 675 GAAPAAPPPAPAPAAPAAPAGAAPAQPAPAPAATPPAGQADDPAAQPPQAAQGASAPSPA 734

Query: 236 --------PYPLYPPGGS-TPATPYPVPTPSPAPAVTS 264
                   P P  PP  +  PA P P P P+PA A  +
Sbjct: 735 ADDPVPLPPEPDDPPDPAGAPAQPPPPPAPAPAAAPAA 772



 Score = 29.6 bits (67), Expect = 4.3
 Identities = 14/70 (20%), Positives = 14/70 (20%), Gaps = 2/70 (2%)

Query: 200 PSAYPGYNPNAAAA--PSGSIYPSYPTPGATPGAGGYSPYPLYPPGGSTPATPYPVPTPS 257
             A  G  P AA    P        P   A P A      P  P      A P       
Sbjct: 584 VEAVVGPAPGAAGGEGPPAPASSGPPEEAARPAAPAAPAAPAAPAPAGAAAAPAEASAAP 643

Query: 258 PAPAVTSGGT 267
                     
Sbjct: 644 APGVAAPEHH 653


>gnl|CDD|223021 PHA03247, PHA03247, large tegument protein UL36; Provisional.
          Length = 3151

 Score = 41.5 bits (97), Expect = 0.001
 Identities = 32/131 (24%), Positives = 44/131 (33%), Gaps = 6/131 (4%)

Query: 144  ASAFSALPPLYSSRTKQEPATNPTYNGNMASNAPYPTQSYMPM---PGSNVNTPYPTYPP 200
            A+A  A P L ++         P   G  A  A  PT +  P    P +    P      
Sbjct: 2726 AAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTAGPPAPAPPAAPAAGPPRRLTR 2785

Query: 201  SAYPGYNPNAAAAPSGSIYPSYPTPGATPGAG---GYSPYPLYPPGGSTPATPYPVPTPS 257
             A    + +  + PS       P     P A      SP    PP  S   T  P P   
Sbjct: 2786 PAVASLSESRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGP 2845

Query: 258  PAPAVTSGGTI 268
            P P++  GG++
Sbjct: 2846 PPPSLPLGGSV 2856



 Score = 38.0 bits (88), Expect = 0.012
 Identities = 27/104 (25%), Positives = 35/104 (33%), Gaps = 7/104 (6%)

Query: 162  PATNPTYNGNMASNAPYPTQSYMPMPGSNVNTPYPTYPPSAYPGYNPNAAAAPSGSIYPS 221
             A  PT  G++ S A  P     P P  +        PP           A+P+    P+
Sbjct: 2686 RAARPTV-GSLTSLADPPPPPPTPEPAPHALVSATPLPPGPAAA----RQASPALPAAPA 2740

Query: 222  YPTPGATPGAGGYSPYPLYPPGGSTPATPYP--VPTPSPAPAVT 263
             P   A P   G    P  PP  + P  P P   P   P   +T
Sbjct: 2741 PPAVPAGPATPGGPARPARPPTTAGPPAPAPPAAPAAGPPRRLT 2784



 Score = 37.2 bits (86), Expect = 0.022
 Identities = 34/131 (25%), Positives = 44/131 (33%), Gaps = 9/131 (6%)

Query: 142  TLASAFSALPPLYSSRTKQEPATNPTYNGNMASNAPYPTQSYMPMPGSNVNTPYPTYPPS 201
            T A   +  PP   +       T P       S    P+      P + V  P    PP+
Sbjct: 2762 TTAGPPAPAPPAAPAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPPA 2821

Query: 202  AYP--GYNPNAAAAPSGSIYPSYPTPGATPGAGGYSP---YPLYPPGGS---TPATPYPV 253
            A P     P  +A P+    P  P P + P  G  +P       PP  S    PA P   
Sbjct: 2822 ASPAGPLPPPTSAQPTAPPPPPGPPPPSLPLGGSVAPGGDVRRRPPSRSPAAKPAAPARP 2881

Query: 254  PTPS-PAPAVT 263
            P      PAV+
Sbjct: 2882 PVRRLARPAVS 2892



 Score = 36.5 bits (84), Expect = 0.030
 Identities = 42/189 (22%), Positives = 53/189 (28%), Gaps = 4/189 (2%)

Query: 78   ALPPLYSSRTKQEPATNPTYNGNIRPTADMRINVSQNVDHTGKIYSPYLHTWTAQNSKLE 137
            A  P   S T       P       P A +            +  SP L    A  +   
Sbjct: 2687 AARPTVGSLTSLADPPPPPPTPEPAPHALVSATPLPPGPAAARQASPALPAAPAPPAVPA 2746

Query: 138  GLIKTLASAFSALPPLYSSRTKQEPATNPTYNGNMASNAPYPTQSYMPMPGSNVNTPYPT 197
            G       A  A PP  +      P   P          P    S      S  +   P 
Sbjct: 2747 GPATPGGPARPARPPTTAGPPAPAPPAAPAAGPPRRLTRP-AVASLSESRESLPSPWDPA 2805

Query: 198  YPPSAYPGYNP--NAAAAPSGSI-YPSYPTPGATPGAGGYSPYPLYPPGGSTPATPYPVP 254
             PP+A          AA+P+G +  P+   P A P   G  P  L   G   P       
Sbjct: 2806 DPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLPLGGSVAPGGDVRRR 2865

Query: 255  TPSPAPAVT 263
             PS +PA  
Sbjct: 2866 PPSRSPAAK 2874



 Score = 36.5 bits (84), Expect = 0.031
 Identities = 26/112 (23%), Positives = 35/112 (31%), Gaps = 6/112 (5%)

Query: 161  EPATNPTYNGNMASNAPYPTQSYMPMPGSNVNTPYPTYPPSAYPGYNPNAAAAPSGSIYP 220
            EPA +   +       P   +   P   +    P     P A PG  P   A P  +  P
Sbjct: 2709 EPAPHALVSATPLPPGPAAARQASPALPAAPAPPAVPAGP-ATPG-GPARPARPPTTAGP 2766

Query: 221  SYPTPGATPGAG---GYSPYPLYPPGGSTPATPYPVPTPSPAPAVTSGGTIT 269
              P P A P AG     +   +     S  + P P   P+  PA        
Sbjct: 2767 PAPAPPAAPAAGPPRRLTRPAVASLSESRESLPSP-WDPADPPAAVLAPAAA 2817



 Score = 35.3 bits (81), Expect = 0.073
 Identities = 37/132 (28%), Positives = 44/132 (33%), Gaps = 20/132 (15%)

Query: 128 TWTAQNSKLEGLIKTLASAFSALPPLYSSRTKQEPATNPTYNGNMASNAPYPTQSYMPMP 187
           TWT  +S LE L         A  P    R+ +  AT P   G    +   P     P+P
Sbjct: 358 TWTPPSS-LEDLSAGRHHPKRASLPTRKRRSARHAAT-PFARGPGGDDQTRPAA---PVP 412

Query: 188 GSNVNTPYPTYPPSAYPGYNPNAAAAPSGSIYPSYPTPGATPGAGGYSPYPLYPPGGSTP 247
            S    P P   P         A+A P     P+ P P A PG     P P        P
Sbjct: 413 AS---VPTPAPTPVP-------ASAPPP----PATPLPSAEPG-SDDGPAPPPERQPPAP 457

Query: 248 ATPYPVPTPSPA 259
           AT      P  A
Sbjct: 458 ATEPAPDDPDDA 469



 Score = 34.5 bits (79), Expect = 0.13
 Identities = 28/121 (23%), Positives = 34/121 (28%), Gaps = 18/121 (14%)

Query: 149  ALPPLYSSRTKQEPATNPTYNGNMASNAPYPTQSYMPMPGSNVNTPYPTYPPSAYPGYNP 208
            ALPP    R  Q  A             P P Q     P      P P  PP   P   P
Sbjct: 2899 ALPPDQPERPPQPQA-------------PPPPQPQPQPPPPPQPQPPPPPPPRPQPPLAP 2945

Query: 209  NAAAAPSGSIYPSYPTPGATPGAGGYSPYPLYPPGGSTPATPYPVPTPSP-----APAVT 263
                A +G    + P P       G    P +      P+   P  +  P        V+
Sbjct: 2946 TTDPAGAGEPSGAVPQPWLGALVPGRVAVPRFRVPQPAPSREAPASSTPPLTGHSLSRVS 3005

Query: 264  S 264
            S
Sbjct: 3006 S 3006



 Score = 34.1 bits (78), Expect = 0.15
 Identities = 23/91 (25%), Positives = 27/91 (29%), Gaps = 6/91 (6%)

Query: 177  PYPTQSYMPMPGSNVNTPYPTYPPSAYPGYNPNAAAAPSGSIYPSYPTPGATPGAGGYSP 236
            P P  +    P  +V  P P   PS      P   +       P        P      P
Sbjct: 2554 PLPPAAPPAAPDRSVPPPRPAPRPS-----EPAVTSRARRPDAPPQSARPRAPVDDRGDP 2608

Query: 237  YPLYPPGGSTPATPYP-VPTPSPAPAVTSGG 266
                PP    P T  P  P PSP+PA     
Sbjct: 2609 RGPAPPSPLPPDTHAPDPPPPSPSPAANEPD 2639



 Score = 34.1 bits (78), Expect = 0.16
 Identities = 18/70 (25%), Positives = 25/70 (35%), Gaps = 3/70 (4%)

Query: 193 TPYPTYPPSAYPGYNPNAAAAPSGSIYPSYPTPGATPGAGGYSPYPLYPPGGSTPATPY- 251
             +   P +  PG   +    P+  +  S PTP  TP      P P  P   + P +   
Sbjct: 388 ARHAATPFARGPG--GDDQTRPAAPVPASVPTPAPTPVPASAPPPPATPLPSAEPGSDDG 445

Query: 252 PVPTPSPAPA 261
           P P P   P 
Sbjct: 446 PAPPPERQPP 455



 Score = 33.8 bits (77), Expect = 0.24
 Identities = 31/126 (24%), Positives = 40/126 (31%), Gaps = 15/126 (11%)

Query: 148  SALPPLYSSRTKQEPATNPTYNGNMASNAPYPTQSYMPM-----PGSNVNTPYPTYPPSA 202
            +ALPP  S      P T+           P P    +P+     PG +V    P+  P+A
Sbjct: 2816 AALPPAASPAGPLPPPTSAQPTA--PPPPPGPPPPSLPLGGSVAPGGDVRRRPPSRSPAA 2873

Query: 203  YPGYNPN--------AAAAPSGSIYPSYPTPGATPGAGGYSPYPLYPPGGSTPATPYPVP 254
             P              A + S   +   P     P      P P   P    P  P P P
Sbjct: 2874 KPAAPARPPVRRLARPAVSRSTESFALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPPP 2933

Query: 255  TPSPAP 260
             P P P
Sbjct: 2934 PPPPRP 2939



 Score = 33.4 bits (76), Expect = 0.31
 Identities = 34/133 (25%), Positives = 39/133 (29%), Gaps = 22/133 (16%)

Query: 151  PPLYSSRTKQEPATNPTYNGNMASNAPYPTQSYMPMPGSNVNTP-------YPTYPPSAY 203
            PP  S      PA N        +  P       P PG  V+ P             S  
Sbjct: 2626 PPPPSPS----PAANEPDPHPPPTVPPPERPRDDPAPG-RVSRPRRARRLGRAAQASSPP 2680

Query: 204  PGYNPNAAAAPSGSIY----PSYPTPGATPGAGGYSPYPLYPPG-----GSTPATPY-PV 253
                  AA    GS+     P  P P   P           PPG      ++PA P  P 
Sbjct: 2681 QRPRRRAARPTVGSLTSLADPPPPPPTPEPAPHALVSATPLPPGPAAARQASPALPAAPA 2740

Query: 254  PTPSPAPAVTSGG 266
            P   PA   T GG
Sbjct: 2741 PPAVPAGPATPGG 2753



 Score = 33.0 bits (75), Expect = 0.38
 Identities = 20/95 (21%), Positives = 28/95 (29%), Gaps = 6/95 (6%)

Query: 173  ASNAPYPTQSYMPMPGSNVNTPYPTYPPSAYPGYNPNAAAAPSGSIYPSYPTPGATPGAG 232
            A + P P     P    + + P P   P       P+  A  S +  P  P   A P A 
Sbjct: 2548 AGDPPPPLPPAAPPAAPDRSVPPPRPAP------RPSEPAVTSRARRPDAPPQSARPRAP 2601

Query: 233  GYSPYPLYPPGGSTPATPYPVPTPSPAPAVTSGGT 267
                     P   +P  P       P P+ +    
Sbjct: 2602 VDDRGDPRGPAPPSPLPPDTHAPDPPPPSPSPAAN 2636



 Score = 32.6 bits (74), Expect = 0.55
 Identities = 22/68 (32%), Positives = 24/68 (35%), Gaps = 4/68 (5%)

Query: 199  PPSAYPGYNPNAAAAPSGSIYPSYPTPGATPGAGGYSPY-PLYPPGGSTPATPYPVPTPS 257
            PP   P   P  A     S  P  P P A   A    P  P  P   + PATP   P   
Sbjct: 2701 PPP--PPPTPEPAPHALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATP-GGPARP 2757

Query: 258  PAPAVTSG 265
              P  T+G
Sbjct: 2758 ARPPTTAG 2765



 Score = 31.8 bits (72), Expect = 0.96
 Identities = 30/129 (23%), Positives = 34/129 (26%), Gaps = 17/129 (13%)

Query: 147  FSALPPLYSSRTKQEPATNPTYNGNMASNAPYPTQSYMPMPGSNVNTPYPTYPPSAYPGY 206
                PP  S   K      P             T+S+   P      P P  PP   P  
Sbjct: 2862 VRRRPPSRSPAAKPAAPARPPVRRLARPAVSRSTESFALPPDQPERPPQPQAPPPPQPQP 2921

Query: 207  NPNAAAAPSGSIYPSYPTPG----------ATPGAGGYSPYPLYPPGGSTPATPYPVP-- 254
             P     P     P  P P              GAG  S     P  G+       VP  
Sbjct: 2922 QPPPPPQPQ----PPPPPPPRPQPPLAPTTDPAGAGEPSGAVPQPWLGALVPGRVAVPRF 2977

Query: 255  -TPSPAPAV 262
              P PAP+ 
Sbjct: 2978 RVPQPAPSR 2986



 Score = 31.4 bits (71), Expect = 1.2
 Identities = 21/118 (17%), Positives = 27/118 (22%), Gaps = 4/118 (3%)

Query: 151  PPLYSSRTKQEPATNPTYNGNMASNAPYPTQSYMPMPGSNVNTPY----PTYPPSAYPGY 206
            PP          +  P   G  A+    P     P P +    P     P  P       
Sbjct: 2705 PPTPEPAPHALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTA 2764

Query: 207  NPNAAAAPSGSIYPSYPTPGATPGAGGYSPYPLYPPGGSTPATPYPVPTPSPAPAVTS 264
             P A A P+               A         P        P  V  P+ A    +
Sbjct: 2765 GPPAPAPPAAPAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPPAA 2822



 Score = 30.3 bits (68), Expect = 3.1
 Identities = 23/97 (23%), Positives = 25/97 (25%), Gaps = 8/97 (8%)

Query: 173  ASNAPYPTQSYMPMPGSNVNTPY---PTYPPSAYPGYNPNAAAAPSGSIYPSYPTPGAT- 228
            A +   P     P P     T     P  PP +     P           P  P P  T 
Sbjct: 2563 APDRSVPPPRPAPRPSEPAVTSRARRPDAPPQSARPRAPVDDRGDPRGPAPPSPLPPDTH 2622

Query: 229  ---PGAGGYSPYPLYPPGGSTPATPYPVPTPSPAPAV 262
               P     SP    P     P  P P   P   PA 
Sbjct: 2623 APDPPPPSPSPAANEPDPHPPPTVP-PPERPRDDPAP 2658



 Score = 29.5 bits (66), Expect = 4.1
 Identities = 19/92 (20%), Positives = 26/92 (28%), Gaps = 3/92 (3%)

Query: 174  SNAPYPTQSYMPMPGSNVNTPYPTYPPSAYPGYNPNAAAAPSGSIYPSYPTPGATPGAGG 233
            S A  P     P P   +  P  +    ++    P+    P     P  P P   P    
Sbjct: 2870 SPAAKPAAPARP-PVRRLARPAVSRSTESFA-LPPDQPERPPQPQAPPPPQPQPQPPPPP 2927

Query: 234  YSPYPLYPPGGSTPATPYPVPTPSPAPAVTSG 265
              P P  PP         P   P+ A   +  
Sbjct: 2928 -QPQPPPPPPPRPQPPLAPTTDPAGAGEPSGA 2958



 Score = 29.5 bits (66), Expect = 4.9
 Identities = 22/97 (22%), Positives = 28/97 (28%), Gaps = 10/97 (10%)

Query: 174  SNAPYPTQSYMPMPGSNVNTPYPTYPPSAYPGYNPNAAAAPSGSIYPSYPTPGATPGAGG 233
            S +P   +     P +      P   P+      P  A     +   S P       A  
Sbjct: 2630 SPSPAANEPDPHPPPTVPPPERPRDDPAPGRVSRPRRARRLGRAAQASSPPQRPRRRA-- 2687

Query: 234  YSPYPLYPPGGSTPAT---PYPVPTPSPAPAVTSGGT 267
                   P  GS  +    P P PTP PAP      T
Sbjct: 2688 -----ARPTVGSLTSLADPPPPPPTPEPAPHALVSAT 2719



 Score = 29.1 bits (65), Expect = 6.9
 Identities = 13/61 (21%), Positives = 15/61 (24%)

Query: 211 AAAPSGSIYPSYPTPGATPGAGGYSPYPLYPPGGSTPATPYPVPTPSPAPAVTSGGTITD 270
           AA P                     P P   P  ++   P   P PS  P    G     
Sbjct: 391 AATPFARGPGGDDQTRPAAPVPASVPTPAPTPVPASAPPPPATPLPSAEPGSDDGPAPPP 450

Query: 271 E 271
           E
Sbjct: 451 E 451


>gnl|CDD|237178 PRK12705, PRK12705, hypothetical protein; Provisional.
          Length = 508

 Score = 40.5 bits (95), Expect = 0.002
 Identities = 29/146 (19%), Positives = 59/146 (40%), Gaps = 23/146 (15%)

Query: 271 EHIKMSLKSAVEDKIRRKMNEQKVQAQEEIEILKQTENELNQGKFKLARMFERIDKEKAE 330
           E    +     ++ + R+ N+Q+ +A+ E E L++ E  L Q + +L    E++D  + +
Sbjct: 47  EEKLEAALLEAKELLLRERNQQRQEARREREELQREEERLVQKEEQLDARAEKLDNLENQ 106

Query: 331 LERSISFLKEKETELDE-------------------ILAVLTEKTEVDVDEAVTTTAPIY 371
           LE     L  +E EL+E                      +L +  + +++E         
Sbjct: 107 LEEREKALSARELELEELEKQLDNELYRVAGLTPEQARKLLLKLLDAELEEEKAQ---RV 163

Query: 372 KQILNTFTEEAATEDTIYYMGEALRR 397
           K+I      E A       + +A++R
Sbjct: 164 KKIEEEADLE-AERKAQNILAQAMQR 188


>gnl|CDD|184923 PRK14959, PRK14959, DNA polymerase III subunits gamma and tau;
           Provisional.
          Length = 624

 Score = 40.4 bits (94), Expect = 0.002
 Identities = 37/160 (23%), Positives = 50/160 (31%), Gaps = 24/160 (15%)

Query: 179 PTQSYMPMPGSNVNTPYPTYPPSAYPGYNPNAAAAPSGSIYPSYPTPGATPGAGGYSP-- 236
           P+ S    P S      PT      PG       AP+  + PS   P     +   SP  
Sbjct: 381 PSGSAAEGPASGGAATIPT------PGTQGPQGTAPAAGMTPSSAAPATPAPSAAPSPRV 434

Query: 237 ----YPLYPPGGSTPATPYP-VPTPSPAP----AVTSGGTITDEHIKMSLKSAVEDKIRR 287
                P  PP    P  P P +P  SP P    +V S           S  +       R
Sbjct: 435 PWDDAPPAPPRSGIPPRPAPRMPEASPVPGAPDSVASASDAPPTLGDPSDTAEHTPSGPR 494

Query: 288 KMN-------EQKVQAQEEIEILKQTENELNQGKFKLARM 320
             +        +  Q      +L+Q   E   G+ +LA M
Sbjct: 495 TWDGFLEFCQGRNGQGGRLATVLRQATPEHADGRLRLATM 534


>gnl|CDD|237874 PRK14971, PRK14971, DNA polymerase III subunits gamma and tau;
           Provisional.
          Length = 614

 Score = 40.1 bits (94), Expect = 0.002
 Identities = 27/98 (27%), Positives = 36/98 (36%), Gaps = 1/98 (1%)

Query: 204 PGYNPNAAAAPSGSIYPSYPTPGATPGAGGYSPYPLYPPGGSTPATPYPVPTPSPAP-AV 262
               P+AAAA S S   S      +       P    P     P    PV  PS AP AV
Sbjct: 388 AAPQPSAAAAASPSPSQSSAAAQPSAPQSATQPAGTPPTVSVDPPAAVPVNPPSTAPQAV 447

Query: 263 TSGGTITDEHIKMSLKSAVEDKIRRKMNEQKVQAQEEI 300
                  ++ I +S  S++     R + E+  QA   I
Sbjct: 448 RPAQFKEEKKIPVSKVSSLGPSTLRPIQEKAEQATGNI 485



 Score = 32.8 bits (75), Expect = 0.37
 Identities = 19/104 (18%), Positives = 26/104 (25%), Gaps = 11/104 (10%)

Query: 162 PATNPTYNGNMASNAPYPTQSYMPMPGSNVNTPYPTYPPSAYPGYNPNAAAAPSGSIYPS 221
           P           S A   + S    P  +     P+ P SA       A   P+ S+ P 
Sbjct: 381 PVFTQPAAAPQPSAAAAASPS----PSQSSAAAQPSAPQSA----TQPAGTPPTVSVDPP 432

Query: 222 YPTPGATPGAGGYSPYPLYPPGGSTPATPYPVPTPSPAPAVTSG 265
              P   P      P  + P               S  P+    
Sbjct: 433 AAVPVNPPSTA---PQAVRPAQFKEEKKIPVSKVSSLGPSTLRP 473


>gnl|CDD|237057 PRK12323, PRK12323, DNA polymerase III subunits gamma and tau;
           Provisional.
          Length = 700

 Score = 39.9 bits (93), Expect = 0.002
 Identities = 20/75 (26%), Positives = 26/75 (34%), Gaps = 1/75 (1%)

Query: 191 VNTPYPTYPPSAYPGYNPNAAAAPSGSIYPSYPTPGATPGAGGYSPYPLYPPGGSTPA-T 249
              P P  PP+A       AAAA + +  P+  +P     A         P G   PA  
Sbjct: 394 AAAPAPAAPPAAPAAAPAAAAAARAVAAAPARRSPAPEALAAARQASARGPGGAPAPAPA 453

Query: 250 PYPVPTPSPAPAVTS 264
           P   P  +  PA   
Sbjct: 454 PAAAPAAAARPAAAG 468



 Score = 34.1 bits (78), Expect = 0.16
 Identities = 25/117 (21%), Positives = 35/117 (29%), Gaps = 7/117 (5%)

Query: 144 ASAFSALPPLYSSRTKQEPATNPTYNGNMASNAPYPTQSYMPMPGSNVNTPYPTYPPSAY 203
             A +A  P  ++     P   P      A+ A     +      +           +  
Sbjct: 384 QPAPAAAAPAAAAPAPAAPPAAPAAAPAAAAAARAVAAAPARRSPAPEALAAARQASARG 443

Query: 204 PGYNPNAAAAPSGSIYPSYPTPGATPGAGGYSPYPLYPPGGSTPATPYPVPTPSPAP 260
           PG  P  A AP      + P   A P A G  P        + PA   P   P+PA 
Sbjct: 444 PGGAPAPAPAP-----AAAPAAAARPAAAGPRPVA--AAAAAAPARAAPAAAPAPAD 493



 Score = 32.9 bits (75), Expect = 0.40
 Identities = 25/141 (17%), Positives = 40/141 (28%), Gaps = 21/141 (14%)

Query: 144 ASAFSALPPLYSSRTKQEPATNPTYNGNMASN-----APYPTQSYMPMPGSNVNTPYPTY 198
            +A +A   + ++  ++ PA         AS      AP P  +    P +         
Sbjct: 410 PAAAAAARAVAAAPARRSPAPEALAAARQASARGPGGAPAPAPAPAAAPAAAARPAAAGP 469

Query: 199 PPSAYPGYNPNAAAAPSGSIYPSYPT-------PGATPGAGGYSPYPLYP---------P 242
            P A       A AAP+ +  P+          P          P              P
Sbjct: 470 RPVAAAAAAAPARAAPAAAPAPADDDPPPWEELPPEFASPAPAQPDAAPAGWVAESIPDP 529

Query: 243 GGSTPATPYPVPTPSPAPAVT 263
             + P   +    P+PA A  
Sbjct: 530 ATADPDDAFETLAPAPAAAPA 550



 Score = 32.2 bits (73), Expect = 0.61
 Identities = 19/84 (22%), Positives = 28/84 (33%), Gaps = 7/84 (8%)

Query: 187 PGSNVNTPYPTYPPSAY-------PGYNPNAAAAPSGSIYPSYPTPGATPGAGGYSPYPL 239
           P +    P     P+A            P AA A + +   +     A P     +P  L
Sbjct: 374 PATAAAAPVAQPAPAAAAPAAAAPAPAAPPAAPAAAPAAAAAARAVAAAPARRSPAPEAL 433

Query: 240 YPPGGSTPATPYPVPTPSPAPAVT 263
                ++   P   P P+PAPA  
Sbjct: 434 AAARQASARGPGGAPAPAPAPAAA 457



 Score = 30.6 bits (69), Expect = 1.7
 Identities = 24/121 (19%), Positives = 34/121 (28%), Gaps = 6/121 (4%)

Query: 144 ASAFSALPPLYSSRTKQEPATNPTYNGNMASNAPYPTQSYMPMPGSNVNTPYPTYPPSAY 203
           A+A +A     +       A  P      A+ A  P  +      +       +  P A 
Sbjct: 375 ATAAAAPVAQPAPAAAAPAAAAPAPAAPPAAPAAAPAAAAAARAVAAAPAR-RSPAPEAL 433

Query: 204 PGYNPNAAAAPSGSIYPSYPTPGATPGAGGYSPYPLYPPGGSTPATPYPVPTPSPAPAVT 263
                 +A  P G+  P+ P P A P A           G    A          APA  
Sbjct: 434 AAARQASARGPGGAPAPA-PAPAAAPAAAARPA----AAGPRPVAAAAAAAPARAAPAAA 488

Query: 264 S 264
            
Sbjct: 489 P 489



 Score = 29.5 bits (66), Expect = 4.4
 Identities = 28/126 (22%), Positives = 39/126 (30%), Gaps = 4/126 (3%)

Query: 144 ASAFSALPPLYSSRTKQEPATNPTYNGNMASNAPYPTQSYMPMPGSNVNTPYPTYPPSAY 203
           A A +  P    +   +  A  P      A+ AP           ++ + P     P  +
Sbjct: 447 APAPAPAPAAAPAAAARPAAAGPRPVAAAAAAAPARAAPAAAPAPADDDPPPWEELPPEF 506

Query: 204 PGYNP-NAAAAPSGSIYPSYPTPGATPGAGGYSPYPLYPPGGSTP---ATPYPVPTPSPA 259
               P    AAP+G +  S P P        +      P     P   A   PV  P P 
Sbjct: 507 ASPAPAQPDAAPAGWVAESIPDPATADPDDAFETLAPAPAAAPAPRAAAATEPVVAPRPP 566

Query: 260 PAVTSG 265
            A  SG
Sbjct: 567 RASASG 572


>gnl|CDD|202833 pfam03962, Mnd1, Mnd1 family.  This family of proteins includes
           MND1 from S. cerevisiae. The mnd1 protein forms a
           complex with hop2 to promote homologous chromosome
           pairing and meiotic double-strand break repair.
          Length = 188

 Score = 38.4 bits (90), Expect = 0.003
 Identities = 31/137 (22%), Positives = 56/137 (40%), Gaps = 30/137 (21%)

Query: 290 NEQKVQAQEEIEILKQTENELNQGKFKLARMFERIDKEKA----ELERS--ISFLKEKET 343
           ++   + +  +E LK+   EL Q   +L     +I+K K       ER+  +  LK+ E 
Sbjct: 61  SQALNKLKTRLEKLKKELEELKQRIAELQ---AQIEKLKKGREETEERTELLEELKQLEK 117

Query: 344 ELDEILAVLTEKTEVD---VDEAVTTTAPIYKQILNTFTEEAATEDTIYYMGEALRRGVI 400
           EL ++ A L +  + D   +++    T  + K+  N +T      D I+           
Sbjct: 118 ELKKLKAELEKYEKNDPERIEKLKEET-KVAKEAANRWT------DNIF----------- 159

Query: 401 DLEVYLKQVRSLSRKQF 417
            L+ YL +   L   Q 
Sbjct: 160 SLKSYLSKKFGLDEAQL 176


>gnl|CDD|227061 COG4717, COG4717, Uncharacterized conserved protein [Function
           unknown].
          Length = 984

 Score = 39.1 bits (91), Expect = 0.005
 Identities = 27/127 (21%), Positives = 47/127 (37%), Gaps = 13/127 (10%)

Query: 288 KMNEQKVQAQEEIEILKQTENELNQGKFKLARMFERIDKEKAELERSISFLKEKETELDE 347
           K + +  Q  + +E LKQ  NE+++ + + A   + ++  +AE  R    L E  +EL  
Sbjct: 175 KPSGRNPQINQLLEKLKQERNEIDEAEKEYATYHKLLESRRAEHAR----LAELRSELRA 230

Query: 348 ILAVLTEKTEVDVDEAVTTTAPIYKQILNTFTEEAATEDTIYYMGEALRRGVIDLEVYLK 407
               +    +              +Q L    EE AT           R GV+ LE    
Sbjct: 231 DRDHIRALRDAVELWPRLQEWKQLEQELTRRREELATFP---------RDGVLRLEKREA 281

Query: 408 QVRSLSR 414
            ++    
Sbjct: 282 HLQKTEA 288



 Score = 29.0 bits (65), Expect = 5.6
 Identities = 24/131 (18%), Positives = 49/131 (37%), Gaps = 9/131 (6%)

Query: 278 KSAVEDKIRR--KMNEQKVQAQEEIEILKQTENELNQGKFKLARMFERIDKEKAELERSI 335
           ++ ++  +R   +++E K  A E  E L   E +L   +      FE    E+    RS+
Sbjct: 312 EAVLQALVRLHQQLSEIKASAFELTETLAGIEADLRDKEEAAGNGFE---AERVHDLRSL 368

Query: 336 SFLKE---KETELDEILAVLTEKTEVDVDEAVTTTAPIYKQILNTFTEEAATEDTIYYMG 392
             +      + EL +  A   ++ +              +++ +   E  A ++      
Sbjct: 369 ECMLRYQSSQRELKQTEAAYCKRLDEKRLFEDEAEEEARQRLADDEEEVRAGDEAREEKI 428

Query: 393 EALRRGVIDLE 403
            A  + VID E
Sbjct: 429 AANSQ-VIDKE 438


>gnl|CDD|218108 pfam04487, CITED, CITED.  CITED, CBP/p300-interacting
           transactivator with ED-rich tail, are characterized by a
           conserved 32-amino acid sequence at the C-terminus.
           CITED proteins do not bind DNA directly and are thought
           to function as transcriptional co-activators.
          Length = 206

 Score = 37.6 bits (87), Expect = 0.006
 Identities = 25/113 (22%), Positives = 37/113 (32%), Gaps = 5/113 (4%)

Query: 166 PTYNGNMASNAPYPTQSYMPMPGSNVNTPYPTYPPSAYPGYNPNAAAAPSGSIYPSY-PT 224
           P    +      +P QS       N ++    +     P    +       + Y  +   
Sbjct: 59  PRSAMSGPMGGGHPHQSMPAYMMFNPSSKPQPFMLVPGPQLMASMQLQKLNTQYQGHAGA 118

Query: 225 PGATPGAGGYSPYPLYPPGGSTPATPYPVPTPSPAPAVTSGGTITDEHIKMSL 277
           P   PG GG      + PG   P     +P P+  P V     I DE + MSL
Sbjct: 119 PAGHPGGGGPQQ---FRPGAGQPPGMQHMPAPALPPNVIDTDLI-DEEVLMSL 167


>gnl|CDD|219321 pfam07174, FAP, Fibronectin-attachment protein (FAP).  This family
           contains bacterial fibronectin-attachment proteins
           (FAP). Family members are rich in alanine and proline,
           are approximately 300 long, and seem to be restricted to
           mycobacteria. These proteins contain a
           fibronectin-binding motif that allows mycobacteria to
           bind to fibronectin in the extracellular matrix.
          Length = 297

 Score = 38.0 bits (88), Expect = 0.006
 Identities = 24/78 (30%), Positives = 30/78 (38%), Gaps = 6/78 (7%)

Query: 184 MPMPGSNVNTPYPTYPPSAYPGYNPNAAAAPSGSIYPSYPTPGATPGAGGYSPYPLYPPG 243
           + +P +    P P  PP         AAAAP+ +  P  P P A P      P    PP 
Sbjct: 31  IALPATANADPAPPPPPP------STAAAAPAPAAPPPPPPPAAPPAPQPDDPNAAPPPP 84

Query: 244 GSTPATPYPVPTPSPAPA 261
            + P  P P P    AP 
Sbjct: 85  PADPNAPPPPPVDPNAPP 102



 Score = 30.2 bits (68), Expect = 1.8
 Identities = 25/94 (26%), Positives = 31/94 (32%), Gaps = 2/94 (2%)

Query: 144 ASAFS-ALPPLYSSRTKQEPATNPTYNGNMASNAPYPTQSYMPMPGSNVNTPYPTYPPSA 202
           ASA + ALP   ++     P    T     A  AP P       P    + P    PP  
Sbjct: 26  ASAVAIALPATANADPAPPPPPPSTAAAAPAPAAPPPPPPPAAPPAPQPDDPNAAPPPPP 85

Query: 203 YPGYNPNAAAAPSGSIYPSYPTPGATPGA-GGYS 235
                P        +  P  P PG    A GG+S
Sbjct: 86  ADPNAPPPPPVDPNAPPPPAPEPGRIDNAVGGFS 119



 Score = 28.7 bits (64), Expect = 6.2
 Identities = 23/87 (26%), Positives = 26/87 (29%), Gaps = 10/87 (11%)

Query: 173 ASNAPYPTQSYMPMPGSNVNTPYPTYPPSAYPGYNPNAAAAPSGSIYPSYPTPGATPGAG 232
           A  AP P              P P  PP+A P   P+   A         P P A P A 
Sbjct: 39  ADPAPPPPPPSTAAAAPAPAAPPPPPPPAAPPAPQPDDPNAAP-------PPPPADPNA- 90

Query: 233 GYSPYPLYPPGGSTPATPYPVPTPSPA 259
              P P   P    P  P P    +  
Sbjct: 91  --PPPPPVDPNAPPPPAPEPGRIDNAV 115


>gnl|CDD|206039 pfam13868, Trichoplein, Tumour suppressor, Mitostatin.  Trichoplein
           or mitostatin, was first defined as a meiosis-specific
           nuclear structural protein. It has since been linked
           with mitochondrial movement. It is associated with the
           mitochondrial outer membrane, and over-expression leads
           to reduction in mitochondrial motility whereas lack of
           it enhances mitochondrial movement. The activity appears
           to be mediated through binding the mitochondria to the
           actin intermediate filaments (IFs).
          Length = 349

 Score = 38.0 bits (89), Expect = 0.007
 Identities = 19/92 (20%), Positives = 40/92 (43%), Gaps = 8/92 (8%)

Query: 274 KMSLKSAVEDKIRRKMNEQKVQAQEEIEILKQTENELNQGKFKLARMFERIDKEKAELER 333
           +  L+  +E++ +R+  E + + QE  ++ +  E    + + +     E+  K + E++ 
Sbjct: 73  RAVLQEQIEEREKRRQEEYEERLQEREQMDEIIERIQEEDEAEAQEKREKQKKLREEIDE 132

Query: 334 SIS-------FLKEKETELDE-ILAVLTEKTE 357
                       KE+E E +  IL    EK E
Sbjct: 133 FNEERIERKEEEKEREREEELKILEYQREKAE 164



 Score = 34.1 bits (79), Expect = 0.14
 Identities = 20/73 (27%), Positives = 34/73 (46%), Gaps = 2/73 (2%)

Query: 270 DEHIKMSLKSAVEDKIRRKMNEQKVQAQEEIEILKQTENELNQGKFKLARMFERIDKEKA 329
           DE      +   E K R+K  E+  + + + + L++   E  Q + K  R+ E   +E+A
Sbjct: 202 DELRADLYQEEYERKERQKEKEEAEKRRRQKQELQRAREE--QIEEKEERLQEERAEEEA 259

Query: 330 ELERSISFLKEKE 342
           E ER +    E E
Sbjct: 260 ERERMLEKQAEDE 272



 Score = 33.3 bits (77), Expect = 0.23
 Identities = 19/93 (20%), Positives = 34/93 (36%), Gaps = 19/93 (20%)

Query: 282 EDKIRRKMNEQ------KVQAQEEIEILKQTENELNQGKFKLARMFERIDKEKAELERSI 335
           E KI     E+      +   + E +  K+ E        +L    E  + E+ EL+   
Sbjct: 152 ELKILEYQREKAEREEEREAERRERKEEKEREVA------RLRAQQEEAEDEREELDELR 205

Query: 336 SFL-------KEKETELDEILAVLTEKTEVDVD 361
           + L       KE++ E +E      +K E+   
Sbjct: 206 ADLYQEEYERKERQKEKEEAEKRRRQKQELQRA 238



 Score = 32.6 bits (75), Expect = 0.37
 Identities = 20/80 (25%), Positives = 36/80 (45%), Gaps = 9/80 (11%)

Query: 277 LKSAVEDKIRRKMNEQKVQAQEEIEILKQTENELNQGKFKLARMFER--IDKEKAELERS 334
           +    E++I RK   ++ + +E  E LK  E +  + + +  R  ER    +EK      
Sbjct: 130 IDEFNEERIERK---EEEKEREREEELKILEYQREKAEREEEREAERRERKEEKEREVAR 186

Query: 335 ISFLKEKET----ELDEILA 350
           +   +E+      ELDE+ A
Sbjct: 187 LRAQQEEAEDEREELDELRA 206


>gnl|CDD|223065 PHA03378, PHA03378, EBNA-3B; Provisional.
          Length = 991

 Score = 38.1 bits (88), Expect = 0.008
 Identities = 42/150 (28%), Positives = 51/150 (34%), Gaps = 11/150 (7%)

Query: 123 SPYLHTWTAQNSKLEGLIKTLASAFSALPPLYSSRTKQEPATNPTYNGNMASNAPYPTQS 182
           +PY  TWT           +   A + LP  ++  T Q P   PT      +  P   Q 
Sbjct: 660 TPYKPTWTQIGHI--PYQPSPTGANTMLPIQWAPGTMQPPPRAPT-PMRPPAAPPGRAQR 716

Query: 183 YMPMPGSN---VNTPYPTYPPSAYPGYNPNAAAAPSGSIYPS-----YPTPGATPGAGGY 234
                G        P    PP+A PG     AAAP  +  P+        P A PGA   
Sbjct: 717 PAAATGRARPPAAAPGRARPPAAAPGRARPPAAAPGRARPPAAAPGRARPPAAAPGAPTP 776

Query: 235 SPYPLYPPGGSTPATPYPVPTPSPAPAVTS 264
            P P  PP         P P P P    TS
Sbjct: 777 QPPPQAPPAPQQRPRGAPTPQPPPQAGPTS 806



 Score = 32.7 bits (74), Expect = 0.48
 Identities = 26/103 (25%), Positives = 33/103 (32%), Gaps = 7/103 (6%)

Query: 165 NPTYNGNMASNAPYPTQSYMPMPGSNVNTPYPTYPPSAYPGYNPNAAAAPSGSIYPSYPT 224
            P+  G             M  P      P P  PP+A PG     AAA   +  P+   
Sbjct: 675 QPSPTGANTMLPIQWAPGTMQPPPR---APTPMRPPAAPPGRAQRPAAATGRARPPAAAP 731

Query: 225 PGATPGAGGYSPYPLYPPGGSTPAT--PYPVPTPSPAPAVTSG 265
             A P A   +P    PP  +      P   P  +  PA   G
Sbjct: 732 GRARPPAA--APGRARPPAAAPGRARPPAAAPGRARPPAAAPG 772


>gnl|CDD|235175 PRK03918, PRK03918, chromosome segregation protein; Provisional.
          Length = 880

 Score = 38.1 bits (89), Expect = 0.009
 Identities = 21/73 (28%), Positives = 37/73 (50%), Gaps = 2/73 (2%)

Query: 282 EDKIRRKMNEQKVQAQEEIEILKQTENELNQGKFKLARMFERIDKEKAELERSISFLKEK 341
           E+K++  + E K +A EE   L +   E      ++ +   R+++E   +E  I  L+EK
Sbjct: 279 EEKVKE-LKELKEKA-EEYIKLSEFYEEYLDELREIEKRLSRLEEEINGIEERIKELEEK 336

Query: 342 ETELDEILAVLTE 354
           E  L+E+   L E
Sbjct: 337 EERLEELKKKLKE 349



 Score = 36.6 bits (85), Expect = 0.026
 Identities = 32/104 (30%), Positives = 58/104 (55%), Gaps = 13/104 (12%)

Query: 280 AVEDKIRRKMNEQKV-QAQEEIEILKQTENELNQGKFK-LARMFERID---KEKAELERS 334
            +E+K++ K N +++ +  EE E LK+   +L +G+ K L +  E+++   K+ AELE+ 
Sbjct: 507 ELEEKLK-KYNLEELEKKAEEYEKLKEKLIKL-KGEIKSLKKELEKLEELKKKLAELEKK 564

Query: 335 ISFLKEKETELDEILAVLTE---KTEVDVDEAVTTTAPIYKQIL 375
              L E E EL E+L  L E   ++  +++E +    P Y + L
Sbjct: 565 ---LDELEEELAELLKELEELGFESVEELEERLKELEPFYNEYL 605



 Score = 36.6 bits (85), Expect = 0.028
 Identities = 19/84 (22%), Positives = 38/84 (45%), Gaps = 9/84 (10%)

Query: 282 EDKIRRKMNEQKVQAQEEIEILKQTENELNQGKFKLA--------RMFERIDKEKAELER 333
             ++ +++ E + +  E  E  K  + EL + K +L         +  E ++K K E+E 
Sbjct: 347 LKELEKRLEELEERH-ELYEEAKAKKEELERLKKRLTGLTPEKLEKELEELEKAKEEIEE 405

Query: 334 SISFLKEKETELDEILAVLTEKTE 357
            IS +  +  EL + +  L +  E
Sbjct: 406 EISKITARIGELKKEIKELKKAIE 429



 Score = 36.2 bits (84), Expect = 0.035
 Identities = 27/117 (23%), Positives = 55/117 (47%), Gaps = 8/117 (6%)

Query: 270 DEHIKMS-LKSAVEDKIRRKMNEQKVQAQEEIEILKQTENELNQGKFKLARMFERIDKEK 328
           +E+IK+S       D++R  + ++  + +EEI  +++   EL + + +L    E + K+ 
Sbjct: 293 EEYIKLSEFYEEYLDELRE-IEKRLSRLEEEINGIEERIKELEEKEERL----EELKKKL 347

Query: 329 AELERSISFLKEKETELDEILAVLTEKTEVDVDEAVTTTAPIYKQILNTFTEEAATE 385
            ELE+ +  L+E+    +E  A   E   +       T   + K++     E+A  E
Sbjct: 348 KELEKRLEELEERHELYEEAKAKKEELERLKKRLTGLTPEKLEKELEE--LEKAKEE 402



 Score = 35.8 bits (83), Expect = 0.047
 Identities = 18/84 (21%), Positives = 44/84 (52%), Gaps = 7/84 (8%)

Query: 274 KMSLKSAVEDKIRRKMNEQKVQAQEEIEILKQTENELNQGKFKLARMFERIDKEKA---E 330
           +++  S+   ++R ++ +     ++E++ L++ + E+ + + +L  +     K +    E
Sbjct: 208 EINEISSELPELREELEK----LEKEVKELEELKEEIEELEKELESLEGSKRKLEEKIRE 263

Query: 331 LERSISFLKEKETELDEILAVLTE 354
           LE  I  LK++  EL+E +  L E
Sbjct: 264 LEERIEELKKEIEELEEKVKELKE 287



 Score = 34.7 bits (80), Expect = 0.11
 Identities = 21/107 (19%), Positives = 47/107 (43%), Gaps = 12/107 (11%)

Query: 278 KSAVEDKIRRKMNEQKVQAQEEIEILKQTEN---ELNQGKFKLARMFERIDKEKAELERS 334
            + +  ++     E   + +E ++ L+   N   EL   + +L R  + + K + EL+++
Sbjct: 572 LAELLKELEELGFESVEELEERLKELEPFYNEYLELKDAEKELEREEKELKKLEEELDKA 631

Query: 335 ISFLKEKETELDEILAVLTE-KTEVDVDEAVTTTAPIYKQILNTFTE 380
              L E E  L+E+   L E + +   +E        Y+++   + E
Sbjct: 632 FEELAETEKRLEELRKELEELEKKYSEEE--------YEELREEYLE 670



 Score = 33.5 bits (77), Expect = 0.25
 Identities = 17/82 (20%), Positives = 45/82 (54%), Gaps = 1/82 (1%)

Query: 267 TITDEHIKMSLKSAVEDKIRRKMNE-QKVQAQEEIEILKQTENELNQGKFKLARMFERID 325
            +     +++      +++R+++ E +K  ++EE E L++   EL++    L    E ++
Sbjct: 627 ELDKAFEELAETEKRLEELRKELEELEKKYSEEEYEELREEYLELSRELAGLRAELEELE 686

Query: 326 KEKAELERSISFLKEKETELDE 347
           K + E+++++  LKE+  E ++
Sbjct: 687 KRREEIKKTLEKLKEELEEREK 708



 Score = 32.3 bits (74), Expect = 0.64
 Identities = 20/66 (30%), Positives = 37/66 (56%), Gaps = 1/66 (1%)

Query: 283 DKIRRKMNEQKVQAQEEIEILKQTENELNQGKFKLARMFERIDKEKAELERSISFLKEKE 342
           +K+ +++ E + + +EEIE L++    L   K KL      +++   EL++ I  L+EK 
Sbjct: 224 EKLEKEVKELE-ELKEEIEELEKELESLEGSKRKLEEKIRELEERIEELKKEIEELEEKV 282

Query: 343 TELDEI 348
            EL E+
Sbjct: 283 KELKEL 288



 Score = 31.6 bits (72), Expect = 1.1
 Identities = 21/74 (28%), Positives = 41/74 (55%), Gaps = 4/74 (5%)

Query: 278 KSAVEDKIRRKMN-EQKVQAQEE--IEILKQTENELNQGKFKLARMFERIDKEKAELERS 334
              +E  I+R  N E+ ++ +E+   E+L++  NE++    +L    E+++KE  ELE  
Sbjct: 178 IERLEKFIKRTENIEELIKEKEKELEEVLREI-NEISSELPELREELEKLEKEVKELEEL 236

Query: 335 ISFLKEKETELDEI 348
              ++E E EL+ +
Sbjct: 237 KEEIEELEKELESL 250



 Score = 30.4 bits (69), Expect = 2.1
 Identities = 25/91 (27%), Positives = 43/91 (47%), Gaps = 5/91 (5%)

Query: 271 EHIKMSLKSAVEDKIRRKMNEQKVQAQEEIEILKQTENELNQGKFKLARMFERID---KE 327
           E+I+  L    E ++   + E    + E  E+ ++ E  L +   +L  + E I+   KE
Sbjct: 189 ENIE-ELIKEKEKELEEVLREINEISSELPELREELEK-LEKEVKELEELKEEIEELEKE 246

Query: 328 KAELERSISFLKEKETELDEILAVLTEKTEV 358
              LE S   L+EK  EL+E +  L ++ E 
Sbjct: 247 LESLEGSKRKLEEKIRELEERIEELKKEIEE 277



 Score = 30.0 bits (68), Expect = 2.8
 Identities = 19/80 (23%), Positives = 40/80 (50%), Gaps = 13/80 (16%)

Query: 284 KIRRKMNEQKVQAQEEIE-ILKQTEN--EL--NQGKFKLARMFERIDK---EKAELERSI 335
           ++ +++  +     E +E  +K+TEN  EL   + K +L  +   I++   E  EL   +
Sbjct: 169 EVIKEIKRRI----ERLEKFIKRTENIEELIKEKEK-ELEEVLREINEISSELPELREEL 223

Query: 336 SFLKEKETELDEILAVLTEK 355
             L+++  EL+E+   + E 
Sbjct: 224 EKLEKEVKELEELKEEIEEL 243



 Score = 29.3 bits (66), Expect = 5.2
 Identities = 31/144 (21%), Positives = 62/144 (43%), Gaps = 18/144 (12%)

Query: 275 MSLKSAVEDKIRRKMNEQKVQAQEEIEILKQTENELNQGKFKLARMFERIDKEKAELERS 334
            + K  +E   +R       + ++E+E L++ + E+ +   K+      + KE  EL+++
Sbjct: 368 KAKKEELERLKKRLTGLTPEKLEKELEELEKAKEEIEEEISKITARIGELKKEIKELKKA 427

Query: 335 ISFLKEKETELDEILAVLTEKTEVDVDEAVTTTAPIYKQILNTFTEE-AATEDTIYYMGE 393
           I  LK+ + +       LTE+                K++L  +T E    E  +  + E
Sbjct: 428 IEELKKAKGKCPVCGRELTEEHR--------------KELLEEYTAELKRIEKELKEIEE 473

Query: 394 A---LRRGVIDLEVYLKQVRSLSR 414
               LR+ + +LE  LK+   L +
Sbjct: 474 KERKLRKELRELEKVLKKESELIK 497


>gnl|CDD|221408 pfam12072, DUF3552, Domain of unknown function (DUF3552).  This
           presumed domain is functionally uncharacterized. This
           domain is found in bacteria, archaea and eukaryotes.
           This domain is about 200 amino acids in length. This
           domain is found associated with pfam00013, pfam01966.
           This domain has a single completely conserved residue A
           that may be functionally important.
          Length = 201

 Score = 37.1 bits (87), Expect = 0.009
 Identities = 28/112 (25%), Positives = 52/112 (46%), Gaps = 18/112 (16%)

Query: 294 VQAQEEIEILK-QTENELNQGKFKLARMFERIDKEKAELERSISFLKEKETELDEILAVL 352
           ++A+EEI  L+ + E EL + + +L R  +R+ +++  L+R    L++KE  L+E    L
Sbjct: 56  LEAKEEIHKLRAEAERELKERRNELQRQEKRLLQKEETLDRKDESLEKKEESLEEKEKEL 115

Query: 353 T------EKTEVDVDEAVTT-----------TAPIYKQILNTFTEEAATEDT 387
                  E+ E +++E +             TA   K+IL    E     + 
Sbjct: 116 AARQQQLEEKEEELEELIEEQQQELERISGLTAEEAKEILLEEVEAELKHEA 167


>gnl|CDD|188306 TIGR03319, RNase_Y, ribonuclease Y.  Members of this family are
           RNase Y, an endoribonuclease. The member from Bacillus
           subtilis, YmdA, has been shown to be involved in
           turnover of yitJ riboswitch [Transcription, Degradation
           of RNA].
          Length = 514

 Score = 37.6 bits (88), Expect = 0.010
 Identities = 29/118 (24%), Positives = 49/118 (41%), Gaps = 19/118 (16%)

Query: 281 VEDKIRRKMNEQKVQAQEEIEILKQTENELNQGKFKLARMFERIDKEKAELERSISFLKE 340
            ++++ +   E + + +E    L++ E  L Q +  L R  E +DK++  LE+    L  
Sbjct: 56  AKEEVHKLRAELERELKERRNELQRLERRLLQREETLDRKMESLDKKEENLEKKEKELSN 115

Query: 341 KETELDEILAVLTEK-----------TEVDVDEAVTTTAPIYKQILNTFTEEAATEDT 387
           KE  LDE    L E            + +  +EA        K+IL    EE A  + 
Sbjct: 116 KEKNLDEKEEELEELIAEQREELERISGLTQEEA--------KEILLEEVEEEARHEA 165


>gnl|CDD|219342 pfam07227, DUF1423, Protein of unknown function (DUF1423).  This
           family represents a conserved region approximately 500
           residues long within a number of Arabidopsis thaliana
           proteins of unknown function.
          Length = 446

 Score = 37.4 bits (87), Expect = 0.011
 Identities = 28/155 (18%), Positives = 54/155 (34%), Gaps = 14/155 (9%)

Query: 218 IYPSYPTPGATPGAGGYSPYPLYPPGGSTP----ATPYPVPTPSPAPAVTSGGTITDE-- 271
           +  S      +      +P        +                    +    +  D   
Sbjct: 269 LDSSKSLENTSEPGRLIAPQEACRNRIAEVVQEAVRKMESVAEEKMRMLKKARSALDACD 328

Query: 272 -HIKMSLKSAVEDKIRRKMNEQKVQAQEEIEILKQTENELNQGKFKLARM----FERI-- 324
             ++   +   E K+ R+  + ++   E I  LKQ E ++ Q K   AR      +RI  
Sbjct: 329 RELEEKAREVSELKMERQRKKPQIDELESIVRLKQAEADMFQLKADEARREAERLQRIAL 388

Query: 325 -DKEKAELERSISFLKEKETELDEILAVLTEKTEV 358
              EK+E E +  +LK + +E +E    L E+ ++
Sbjct: 389 AKTEKSEEEYASKYLKLRLSEAEEERQYLFEELKL 423


>gnl|CDD|215130 PLN02217, PLN02217, probable pectinesterase/pectinesterase
           inhibitor.
          Length = 670

 Score = 37.8 bits (87), Expect = 0.011
 Identities = 24/88 (27%), Positives = 38/88 (43%), Gaps = 7/88 (7%)

Query: 186 MPGSNVNTPYPTYPPSAYPGYNPNAAAAPSGSIYPSYPTPGATPGAGGYSPYPLYPPG-- 243
           +PG  V      Y P  + G   +  + P+GS   S  T  +   +   +P    P G  
Sbjct: 550 IPGKGV-----PYIPGLFAGNPGSTNSTPTGSAASSNTTFSSDSPSTVVAPSTSPPAGHL 604

Query: 244 GSTPATPYPVPTPSPAPAVTSGGTITDE 271
           GS PATP  + +PS +P  +  G+ +  
Sbjct: 605 GSPPATPSKIVSPSTSPPASHLGSPSTT 632



 Score = 33.5 bits (76), Expect = 0.25
 Identities = 23/88 (26%), Positives = 37/88 (42%), Gaps = 4/88 (4%)

Query: 177 PYPTQSYMPMPGSNVNTPYPTYPPSAYPGYNPNAAAAPSGSIYPSYPTPGATPGAGGYSP 236
           PY    +   PGS  +TP      SA       ++ +PS  + PS   P    G+   +P
Sbjct: 556 PYIPGLFAGNPGSTNSTP----TGSAASSNTTFSSDSPSTVVAPSTSPPAGHLGSPPATP 611

Query: 237 YPLYPPGGSTPATPYPVPTPSPAPAVTS 264
             +  P  S PA+    P+ +P+   +S
Sbjct: 612 SKIVSPSTSPPASHLGSPSTTPSSPESS 639


>gnl|CDD|219791 pfam08317, Spc7, Spc7 kinetochore protein.  This domain is found in
           cell division proteins which are required for
           kinetochore-spindle association.
          Length = 321

 Score = 37.3 bits (87), Expect = 0.012
 Identities = 17/76 (22%), Positives = 35/76 (46%), Gaps = 1/76 (1%)

Query: 283 DKIRRKMNEQKVQAQEEIEILKQTENELNQ-GKFKLARMFERIDKEKAELERSISFLKEK 341
           + I+ K+ ++    +EEI  L+Q  +ELN     +L +  + +     ++      L+E 
Sbjct: 174 NSIKPKLRKKLQALKEEIASLRQLADELNLCDPLELEKARQELRSLSVKISEKRKQLEEL 233

Query: 342 ETELDEILAVLTEKTE 357
           + EL E+   +   T 
Sbjct: 234 QQELQELTIAIEALTN 249


>gnl|CDD|235850 PRK06669, fliH, flagellar assembly protein H; Validated.
          Length = 281

 Score = 37.3 bits (87), Expect = 0.012
 Identities = 22/111 (19%), Positives = 41/111 (36%), Gaps = 24/111 (21%)

Query: 269 TDEHIKMSLKSAVE--DKIRRKM--NEQKVQAQ---------EEIEILKQTENE--LNQG 313
             E ++ + + A E   K   +     +K+Q Q         EE+E L +        +G
Sbjct: 68  AFEIVEAAEEEAKEELLKKTDEASSIIEKLQMQIEREQEEWEEELERLIEEAKAEGYEEG 127

Query: 314 KFK-LARMFERIDKEKAELERSISFLKEK--------ETELDEILAVLTEK 355
             K      E + +   +L + I  L +K        E E+ E+   + +K
Sbjct: 128 YEKGREEGLEEVRELIEQLNKIIEKLIKKREEILESSEEEIVELALDIAKK 178



 Score = 28.8 bits (65), Expect = 5.2
 Identities = 13/64 (20%), Positives = 28/64 (43%), Gaps = 4/64 (6%)

Query: 282 EDKIRRKMNEQKVQAQEE-IEILKQTENELNQGKFKLARMFERIDKEKAELERSISFLKE 340
            ++      E    A+EE  E L +  +E +     + ++  +I++E+ E E  +  L E
Sbjct: 61  IEEAEEDAFEIVEAAEEEAKEELLKKTDEASS---IIEKLQMQIEREQEEWEEELERLIE 117

Query: 341 KETE 344
           +   
Sbjct: 118 EAKA 121


>gnl|CDD|226513 COG4026, COG4026, Uncharacterized protein containing TOPRIM domain,
           potential nuclease [General function prediction only].
          Length = 290

 Score = 36.8 bits (85), Expect = 0.013
 Identities = 21/99 (21%), Positives = 48/99 (48%), Gaps = 5/99 (5%)

Query: 267 TITDEHIKMSLKSAVEDKIRRKMNEQKVQAQEEIEILKQTENELNQGKFKLARMFERIDK 326
           T+      M LK   E+ ++ K+ E + + +E ++ L++ E E  + + +L    +R++ 
Sbjct: 123 TLQRVPEYMDLKEDYEE-LKEKLEELQKEKEELLKELEELEAEYEEVQERL----KRLEV 177

Query: 327 EKAELERSISFLKEKETELDEILAVLTEKTEVDVDEAVT 365
           E + LE  +  L  +  +L +    L    E+  +E ++
Sbjct: 178 ENSRLEEMLKKLPGEVYDLKKRWDELEPGVELPEEELIS 216


>gnl|CDD|220366 pfam09728, Taxilin, Myosin-like coiled-coil protein.  Taxilin
           contains an extraordinarily long coiled-coil domain in
           its C-terminal half and is ubiquitously expressed. It is
           a novel binding partner of several syntaxin family
           members and is possibly involved in Ca2+-dependent
           exocytosis in neuroendocrine cells. Gamma-taxilin,
           described as leucine zipper protein Factor Inhibiting
           ATF4-mediated Transcription (FIAT), localises to the
           nucleus in osteoblasts and dimerises with ATF4 to form
           inactive dimers, thus inhibiting ATF4-mediated
           transcription.
          Length = 309

 Score = 37.0 bits (86), Expect = 0.014
 Identities = 30/146 (20%), Positives = 55/146 (37%), Gaps = 37/146 (25%)

Query: 267 TITDEHIKMSLKSAVEDKIRRKMNEQKVQAQEEIE---------------------ILKQ 305
               E  + +LK  ++D++    N      +E  E                     +LK 
Sbjct: 106 KELTEKFQSTLKD-IQDQMEEHSNPNHKLCEENEELREKLKELIEQYELREQHFEKLLKT 164

Query: 306 TENE--LNQGKFKLARMF-----ERIDKEKAEL---ERSISFLKEKETELDEILAVLTEK 355
            + E  LN  K + A+       E+  +EK  L      +S LKE E EL   L + ++K
Sbjct: 165 KDLEQQLNDAKLEQAQEQAEQEQEKHKREKEILLKEAAQVSTLKETEKELRAQLNLYSDK 224

Query: 356 TEVDVDEAVTTTAPIYKQILNTFTEE 381
            + +  + +  +     ++  TF +E
Sbjct: 225 FK-EFQDTLNKS----NEVFLTFKQE 245


>gnl|CDD|218439 pfam05109, Herpes_BLLF1, Herpes virus major outer envelope
           glycoprotein (BLLF1).  This family consists of the BLLF1
           viral late glycoprotein, also termed gp350/220. It is
           the most abundantly expressed glycoprotein in the viral
           envelope of the Herpesviruses and is the major antigen
           responsible for stimulating the production of
           neutralising antibodies in vivo.
          Length = 830

 Score = 37.5 bits (86), Expect = 0.015
 Identities = 26/119 (21%), Positives = 36/119 (30%), Gaps = 9/119 (7%)

Query: 150 LPPLYSSRTKQ-EPATNPTYNGNMASNAPYPTQSYMPMPGSNVNTPYPTYPPSAYPGYNP 208
           LPP  +  T      T+ T  G  +S  P  T      P S   +  P           P
Sbjct: 455 LPPASTGPTVSTADPTSGTPTGTTSSTLPEDTS-----PTSRTTSATPNATSPTPAVTTP 509

Query: 209 NAAAAPSGSIYPSYPTPGATPGAGGYSPYPLYPPGGSTPATPYPVPTP---SPAPAVTS 264
           NA +  +     +      TP   G +     PP G+T       P     SP     +
Sbjct: 510 NATSPTTQKTSDTPNATSPTPIVIGVTTTATSPPTGTTSVPNATSPQVTEESPVNNTNT 568


>gnl|CDD|236912 PRK11448, hsdR, type I restriction enzyme EcoKI subunit R;
           Provisional.
          Length = 1123

 Score = 37.6 bits (88), Expect = 0.015
 Identities = 19/63 (30%), Positives = 29/63 (46%)

Query: 295 QAQEEIEILKQTENELNQGKFKLARMFERIDKEKAELERSISFLKEKETELDEILAVLTE 354
             Q+E+  LKQ      + K +   + E   +E   LE   + L+EK+ EL+  L  L E
Sbjct: 146 ALQQEVLTLKQQLELQAREKAQSQALAEAQQQELVALEGLAAELEEKQQELEAQLEQLQE 205

Query: 355 KTE 357
           K  
Sbjct: 206 KAA 208


>gnl|CDD|217020 pfam02403, Seryl_tRNA_N, Seryl-tRNA synthetase N-terminal domain.
           This domain is found associated with the Pfam tRNA
           synthetase class II domain (pfam00587) and represents
           the N-terminal domain of seryl-tRNA synthetase.
          Length = 108

 Score = 34.9 bits (81), Expect = 0.016
 Identities = 23/75 (30%), Positives = 41/75 (54%), Gaps = 5/75 (6%)

Query: 283 DKIRRKMNEQKVQAQEEIEILKQTENELNQ--GKFKLARMFERIDKEKAELERSISFLKE 340
           D++  +++E++ + Q E+E L+   NEL++  GK K  +  E  +   AE++     LK 
Sbjct: 29  DELL-ELDEERRELQVELEELQAERNELSKEIGKAKKKK--EDAEALIAEVKELKDELKA 85

Query: 341 KETELDEILAVLTEK 355
            E EL E+ A L + 
Sbjct: 86  LEAELRELEAELDKL 100


>gnl|CDD|236583 PRK09601, PRK09601, GTP-binding protein YchF; Reviewed.
          Length = 364

 Score = 37.0 bits (87), Expect = 0.017
 Identities = 26/107 (24%), Positives = 46/107 (42%), Gaps = 17/107 (15%)

Query: 283 DKIRRKMNEQKVQAQEEIEILKQTENELNQGKFKLARMFERIDKEKAELERSISFLKEKE 342
           +++ +K      +A+ E+E+L++    L +G  K AR  E +  E+ +L +S+  L  K 
Sbjct: 145 ERLEKKAKGGDKEAKAELELLEKLLEHLEEG--KPARTLE-LTDEEEKLLKSLQLLTAK- 200

Query: 343 TELDEILAVLTEKTEVDVDEAVTTTAPIYKQILNTFTEEAATEDTIY 389
                +L V       +VDE        Y + +    E AA E    
Sbjct: 201 ----PVLYV------ANVDEDDLADGNPYVKKVR---EIAAKEGAEV 234


>gnl|CDD|217372 pfam03105, SPX, SPX domain.  We have named this region the SPX
           domain after (SYG1, Pho81 and XPR1). This 180 residue
           length domain is found at the amino terminus of a
           variety of proteins. In the yeast protein SYG1, the
           N-terminus directly binds to the G- protein beta subunit
           and inhibits transduction of the mating pheromone
           signal. This finding suggests that all the members of
           this family are involved in G-protein associated signal
           transduction. The N-termini of several proteins involved
           in the regulation of phosphate transport, including the
           putative phosphate level sensors PHO81 from
           Saccharomyces cerevisiae and NUC-2 from Neurospora
           crassa, are also members of this family. The SPX domain
           of S. cerevisiae low-affinity phosphate transporters
           Pho87 and Pho90 auto-regulates uptake and prevents
           efflux. This SPX dependent inhibition is mediated by the
           physical interaction with Spl2 NUC-2 contains several
           ankyrin repeats pfam00023. Several members of this
           family are annotated as XPR1 proteins: the xenotropic
           and polytropic retrovirus receptor confers
           susceptibility to infection with murine leukaemia
           viruses (MLV). The similarity between SYG1, phosphate
           regulators and XPR1 sequences has been previously noted,
           as has the additional similarity to several predicted
           proteins, of unknown function, from Drosophila
           melanogaster, Arabidopsis thaliana, Caenorhabditis
           elegans, Schizosaccharomyces pombe, and Saccharomyces
           cerevisiae. In addition, given the similarities between
           XPR1 and SYG1 and phosphate regulatory proteins, it has
           been proposed that XPR1 might be involved in G-protein
           associated signal transduction and may itself function
           as a phosphate sensor.
          Length = 176

 Score = 35.9 bits (83), Expect = 0.019
 Identities = 34/135 (25%), Positives = 57/135 (42%), Gaps = 20/135 (14%)

Query: 277 LKSAVEDKIRRKMNEQKVQAQEEIEILKQTENELNQGKFKLARMFERIDKEKAELERSIS 336
           +K     K+    NE      E   +L + E E ++ +F     FE +DKE   LE+  S
Sbjct: 29  IKELTRAKLSLTTNED--GEDESSNLLNEEELEEDESEF-----FEALDKE---LEKVNS 78

Query: 337 FLKEKETELDEILAVLTEKTEVDVDEAVTTTAPIYKQILNTFTEEAATEDTIYYMGEALR 396
           F KEKE EL E L  L ++ E   +    T++           +            E L+
Sbjct: 79  FYKEKEAELLERLEELEKQLEELEERRDETSSAKSSPSDKNLLKA----------FEELK 128

Query: 397 RGVIDLEVYLKQVRS 411
           + +++L   L+ ++S
Sbjct: 129 KALLELYRELQLLKS 143


>gnl|CDD|220309 pfam09606, Med15, ARC105 or Med15 subunit of Mediator complex
           non-fungal.  The approx. 70 residue Med15 domain of the
           ARC-Mediator co-activator is a three-helix bundle with
           marked similarity to the KIX domain. The sterol
           regulatory element binding protein (SREBP) family of
           transcription activators use the ARC105 subunit to
           activate target genes in the regulation of cholesterol
           and fatty acid homeostasis. In addition, Med15 is a
           critical transducer of gene activation signals that
           control early metazoan development.
          Length = 768

 Score = 36.9 bits (85), Expect = 0.019
 Identities = 22/80 (27%), Positives = 29/80 (36%), Gaps = 7/80 (8%)

Query: 181 QSYMPMPGSNVNTPYPTYPPSAYPGYNPNAAAAPSGSIYPSYPTPGATPGAGGYSPYPLY 240
           Q  +         P     PS  P    N +        PS P+PG  PG+      P  
Sbjct: 395 QGGLGANPMQQGQPGMMSSPSPVPQVQTNQSMPQPPQ--PSVPSPG-GPGSQP----PQS 447

Query: 241 PPGGSTPATPYPVPTPSPAP 260
             GG  P+ P  +P+PSP  
Sbjct: 448 VSGGMIPSPPALMPSPSPQM 467


>gnl|CDD|197874 smart00787, Spc7, Spc7 kinetochore protein.  This domain is found
           in cell division proteins which are required for
           kinetochore-spindle association.
          Length = 312

 Score = 36.5 bits (85), Expect = 0.020
 Identities = 20/62 (32%), Positives = 32/62 (51%), Gaps = 1/62 (1%)

Query: 297 QEEIEILKQTENELNQGKFK-LARMFERIDKEKAELERSISFLKEKETELDEILAVLTEK 355
           +EE+  LKQ E+EL       L R  E++ K   E+   +  L+E E EL E+ + + + 
Sbjct: 185 EEELRQLKQLEDELEDCDPTELDRAKEKLKKLLQEIMIKVKKLEELEEELQELESKIEDL 244

Query: 356 TE 357
           T 
Sbjct: 245 TN 246


>gnl|CDD|221533 pfam12329, TMF_DNA_bd, TATA element modulatory factor 1 DNA
           binding.  This is the middle region of a family of TATA
           element modulatory factor 1 proteins conserved in
           eukaryotes that contains at its N-terminal section a
           number of leucine zippers that could potentially form
           coiled coil structures. The whole proteins bind to the
           TATA element of some RNA polymerase II promoters and
           repress their activity. by competing with the binding of
           TATA binding protein. TMFs are evolutionarily conserved
           golgins that bind Rab6, a ubiquitous ras-like
           GTP-binding Golgi protein, and contribute to Golgi
           organisation in animal and plant cells.
          Length = 74

 Score = 33.4 bits (77), Expect = 0.022
 Identities = 26/77 (33%), Positives = 40/77 (51%), Gaps = 8/77 (10%)

Query: 279 SAVEDKIRRKMNEQKVQAQEEIEILKQTENELNQGKFKLARMFERIDKEKAELERSISFL 338
           S++E K+  K +EQ  Q  EE E L + E + N    KL         +  ELE+ I+ L
Sbjct: 1   SSLEKKLAEK-DEQIAQLMEEGEKLSKKELKHNNTIKKL-------RAKNKELEKEIAEL 52

Query: 339 KEKETELDEILAVLTEK 355
           K+K  +L++ L  L E+
Sbjct: 53  KKKLEKLEKELENLEER 69



 Score = 28.4 bits (64), Expect = 1.5
 Identities = 18/67 (26%), Positives = 33/67 (49%), Gaps = 9/67 (13%)

Query: 282 EDKIRRKMNE-QKVQAQE-----EIEILKQTENELNQGKFKLARMFERIDKEKAELERSI 335
           +++I + M E +K+  +E      I+ L+    EL +   +L +  E+++KE   LE   
Sbjct: 11  DEQIAQLMEEGEKLSKKELKHNNTIKKLRAKNKELEKEIAELKKKLEKLEKELENLE--- 67

Query: 336 SFLKEKE 342
             LK  E
Sbjct: 68  ERLKRAE 74


>gnl|CDD|216108 pfam00769, ERM, Ezrin/radixin/moesin family.  This family of
           proteins contain a band 4.1 domain (pfam00373), at their
           amino terminus. This family represents the rest of these
           proteins.
          Length = 244

 Score = 35.9 bits (83), Expect = 0.025
 Identities = 18/90 (20%), Positives = 39/90 (43%), Gaps = 14/90 (15%)

Query: 271 EHIKMSLK--SAVEDKIRRKMNEQKVQAQEEIEILKQTENELNQGKFKLARMFERIDKEK 328
           E ++ + K     E+     + E+  Q +EE ++L++  +EL +   +L       ++E+
Sbjct: 19  EDMRRAQKELEEYEETALE-LEEKLKQEEEEAQLLEKKADELEEENRRLEEEAAASEEER 77

Query: 329 -------AELERSISFLKE----KETELDE 347
                   E    ++ L+E    KE E  +
Sbjct: 78  ERLEAEVDEATAEVAKLEEEREKKEAETRQ 107



 Score = 28.2 bits (63), Expect = 8.0
 Identities = 17/71 (23%), Positives = 35/71 (49%), Gaps = 4/71 (5%)

Query: 291 EQKVQAQEEIE-ILKQTENELNQGKFKLARMFERIDKEKAEL---ERSISFLKEKETELD 346
           E+  + Q+E+E  ++Q E ++ + + +L    E   + + +L   E     L++K  EL+
Sbjct: 1   EEAEREQQELEERMEQMEEDMRRAQKELEEYEETALELEEKLKQEEEEAQLLEKKADELE 60

Query: 347 EILAVLTEKTE 357
           E    L E+  
Sbjct: 61  EENRRLEEEAA 71



 Score = 28.2 bits (63), Expect = 8.4
 Identities = 16/71 (22%), Positives = 31/71 (43%), Gaps = 1/71 (1%)

Query: 287 RKMNEQKVQAQEEIEILKQTENELNQGKFKLARMFERIDKEKAELERSISFLKEKETE-L 345
           R++ E+   ++EE E L+   +E      KL    E+ + E  +L++ +   +E      
Sbjct: 64  RRLEEEAAASEEERERLEAEVDEATAEVAKLEEEREKKEAETRQLQQELREAQEAHERAR 123

Query: 346 DEILAVLTEKT 356
            E+L      T
Sbjct: 124 QELLEAAAAPT 134


>gnl|CDD|217817 pfam03961, DUF342, Protein of unknown function (DUF342).  This
           family of bacterial proteins has no known function. The
           proteins are in the region of 500-600 amino acid
           residues in length.
          Length = 450

 Score = 36.5 bits (85), Expect = 0.026
 Identities = 18/107 (16%), Positives = 48/107 (44%), Gaps = 7/107 (6%)

Query: 263 TSGGTITDEHIKMSLKSAVE-----DKIRRKMNEQKVQAQEEIEILKQTENELNQGKFKL 317
           +  GT T+  + +      E     ++  +++ E+  + ++ ++ L   +    Q   + 
Sbjct: 315 SPAGTKTEIEVGVDFPELKEELKELEEELKELEEELEKIKKLLKKLP--KKARGQLPPEK 372

Query: 318 ARMFERIDKEKAELERSISFLKEKETELDEILAVLTEKTEVDVDEAV 364
               E++ + K +L   +  L+E+  EL E L  L  + ++ V++ +
Sbjct: 373 REQLEKLLETKEKLSEELEELEEELKELKEELESLYSEGKISVNKTI 419


>gnl|CDD|224312 COG1394, NtpD, Archaeal/vacuolar-type H+-ATPase subunit D [Energy
           production and conversion].
          Length = 211

 Score = 35.3 bits (82), Expect = 0.035
 Identities = 21/130 (16%), Positives = 54/130 (41%), Gaps = 11/130 (8%)

Query: 273 IKMSLKSAVED--KIRRKMNEQKVQAQEEIEILKQTENELNQGKFKLARMFERIDKEKAE 330
           + M  ++ V++  ++R ++ ++  +A E + +    E         +  +   + KEK E
Sbjct: 36  LIMEFRAIVKEAKELREELEKELEEAYESLALASAAEGID-----AVEEIAL-VQKEKLE 89

Query: 331 LE---RSISFLKEKETELDEILAVLTEKTEVDVDEAVTTTAPIYKQILNTFTEEAATEDT 387
           ++    +I  +     EL E+     +   +     +      ++++L    E A  E T
Sbjct: 90  VDVDVENIMGVVVPTFELVELTPPPYDLGILSTSAWLDEAIEKFEELLEKLIELAELETT 149

Query: 388 IYYMGEALRR 397
           +  + E +R+
Sbjct: 150 LRLLLEEIRK 159


>gnl|CDD|224495 COG1579, COG1579, Zn-ribbon protein, possibly nucleic acid-binding
           [General function prediction only].
          Length = 239

 Score = 35.4 bits (82), Expect = 0.038
 Identities = 21/100 (21%), Positives = 40/100 (40%), Gaps = 5/100 (5%)

Query: 282 EDKIRRKMN--EQKVQA---QEEIEILKQTENELNQGKFKLARMFERIDKEKAELERSIS 336
             +IR ++   E+K+ A   + E+  L        +    L      + +E  +LE+ I 
Sbjct: 68  IQEIRERIKRAEEKLSAVKDERELRALNIEIQIAKERINSLEDELAELMEEIEKLEKEIE 127

Query: 337 FLKEKETELDEILAVLTEKTEVDVDEAVTTTAPIYKQILN 376
            LKE+   L++ LA    + E +V E       +  +   
Sbjct: 128 DLKERLERLEKNLAEAEARLEEEVAEIREEGQELSSKREE 167



 Score = 31.6 bits (72), Expect = 0.59
 Identities = 20/72 (27%), Positives = 35/72 (48%), Gaps = 12/72 (16%)

Query: 280 AVEDKIRRKMNEQKVQAQEEIEILK----QTENELNQGKFKLARMFERIDKEKAELERSI 335
           ++ED++   M E + + ++EIE LK    + E  L + +        R+++E AE+    
Sbjct: 107 SLEDELAELMEEIE-KLEKEIEDLKERLERLEKNLAEAE-------ARLEEEVAEIREEG 158

Query: 336 SFLKEKETELDE 347
             L  K  EL E
Sbjct: 159 QELSSKREELKE 170



 Score = 29.6 bits (67), Expect = 2.6
 Identities = 26/147 (17%), Positives = 58/147 (39%), Gaps = 13/147 (8%)

Query: 277 LKSAVE-DKIRRKMNEQKVQAQEEIEILKQTENELNQGKFKLARM---FERIDKEKAELE 332
           LKS +   K+  + +  + + +E  + LK+ + EL      L  +    E ++ + ++LE
Sbjct: 6   LKSLLAIQKLDLEKDRLEPRIKEIRKALKKAKAELEALNKALEALEIELEDLENQVSQLE 65

Query: 333 RSISFLKEKETELDEILAVLT-----EKTEVDVDEAVTTTAPIYKQILNTFTEEAATEDT 387
             I  ++E+    +E L+ +          +++  A      +  ++     E    E  
Sbjct: 66  SEIQEIRERIKRAEEKLSAVKDERELRALNIEIQIAKERINSLEDELAELMEEIEKLEKE 125

Query: 388 IYYMGEALRRGVIDLEVYLKQVRSLSR 414
           I    E L+  +  LE  L +  +   
Sbjct: 126 I----EDLKERLERLEKNLAEAEARLE 148


>gnl|CDD|235360 PRK05183, hscA, chaperone protein HscA; Provisional.
          Length = 616

 Score = 35.9 bits (84), Expect = 0.039
 Identities = 16/39 (41%), Positives = 23/39 (58%), Gaps = 4/39 (10%)

Query: 269 TDEHI-KM---SLKSAVEDKIRRKMNEQKVQAQEEIEIL 303
           TD+ I +M   S+  A ED   R + EQKV+A+  +E L
Sbjct: 505 TDDEIARMLKDSMSHAEEDMQARALAEQKVEAERVLEAL 543


>gnl|CDD|191179 pfam05053, Menin, Menin.  MEN1, the gene responsible for multiple
           endocrine neoplasia type 1, is a tumour suppressor gene
           that encodes a protein called Menin which may be an
           atypical GTPase stimulated by nm23.
          Length = 618

 Score = 36.1 bits (83), Expect = 0.040
 Identities = 21/107 (19%), Positives = 37/107 (34%), Gaps = 15/107 (14%)

Query: 214 PSGSIYPSYPTP----GATPGAGGYSPYPLYPPGGST---------PATPYPVPTPSPAP 260
           PSG   P+   P     +       +         +T           T  P+P  S  P
Sbjct: 492 PSGGESPNPELPANNNNSNSNNNNNNGADRKEAAATTGNATTTSNGSGTSVPLPVSSEPP 551

Query: 261 AVTSGGTITDEHIKMS-LKS-AVEDKIRRKMNEQKVQAQEEIEILKQ 305
               G  IT    KM  +K   + +K+     + ++ AQ ++++  Q
Sbjct: 552 QHKEGPVITFYSEKMKGMKELLLAEKLNSSAIKLQLTAQSQVQMKVQ 598


>gnl|CDD|218107 pfam04484, DUF566, Family of unknown function (DUF566).  Family of
           related proteins that is plant specific.
          Length = 313

 Score = 35.7 bits (82), Expect = 0.040
 Identities = 33/181 (18%), Positives = 63/181 (34%), Gaps = 8/181 (4%)

Query: 172 MASNAPYPTQSYMPMPGSNVNTPYPTYPPSAYPGYNPNAAA--APSGSIYPSYPTPGATP 229
           ++S+    + S  P   +   +P  + P       +    +   PS        +   +P
Sbjct: 25  LSSSFLSTSASSRPRRLNAPASPPSSSPARNTSSSSSFGLSKQRPSSLSRGRLSSRFVSP 84

Query: 230 GAGGYSPYPLYPPGGSTPATPYPVPTPSPAPAVTSGGTITDEHIKMSLKSAVEDKIRRKM 289
             G  S         +T +T     + SP+ +  +  +        S+ S + D  R K 
Sbjct: 85  SRGSPSAAASLNGSLATASTS---GSSSPSRSRRTTSSDLSSGNGPSVLSFMADVKRGKK 141

Query: 290 NEQKVQAQEEIEILKQTENELNQGKFKLARMFERIDKEKAELERSISFLKEKETELDEIL 349
              K    E+   L+   N L Q +F  AR    +  +K   E+ +     + +EL   +
Sbjct: 142 GPSK---IEDAHQLRLLYNRLLQWRFVNARAEAAMAVQKLNAEKQLFNAWLRISELRNSV 198

Query: 350 A 350
           A
Sbjct: 199 A 199


>gnl|CDD|169428 PRK08404, PRK08404, V-type ATP synthase subunit H; Validated.
          Length = 103

 Score = 33.6 bits (77), Expect = 0.043
 Identities = 27/91 (29%), Positives = 43/91 (47%), Gaps = 17/91 (18%)

Query: 282 EDKIRRKMNEQKV---QAQEEI-----EILKQTENELNQGKFKLARMFERIDKEKAELER 333
           E++I +   E K    +A+EE      EI+K+ E E         ++ E+  KE  E  +
Sbjct: 16  EERIEKAKEEAKKIIRKAKEEAKKIEEEIIKKAEEEAQ-------KLIEKKKKEGEEEAK 68

Query: 334 SISFLKEKETELDEILAVLTEKTEVDVDEAV 364
            I  L+E E E++E+     E  E  V EA+
Sbjct: 69  KI--LEEGEKEIEELKVKAEENFETAVSEAI 97


>gnl|CDD|233830 TIGR02350, prok_dnaK, chaperone protein DnaK.  Members of this
           family are the chaperone DnaK, of the DnaK-DnaJ-GrpE
           chaperone system. All members of the seed alignment were
           taken from completely sequenced bacterial or archaeal
           genomes and (except for Mycoplasma sequence) found
           clustered with other genes of this systems. This model
           excludes DnaK homologs that are not DnaK itself, such as
           the heat shock cognate protein HscA (TIGR01991).
           However, it is not designed to distinguish among DnaK
           paralogs in eukaryotes. Note that a number of dnaK genes
           have shadow ORFs in the same reverse (relative to dnaK)
           reading frame, a few of which have been assigned
           glutamate dehydrogenase activity. The significance of
           this observation is unclear; lengths of such shadow ORFs
           are highly variable as if the presumptive protein
           product is not conserved [Protein fate, Protein folding
           and stabilization].
          Length = 595

 Score = 35.7 bits (83), Expect = 0.045
 Identities = 27/103 (26%), Positives = 50/103 (48%), Gaps = 15/103 (14%)

Query: 262 VTSGGTITDEHIKMSLKSA----VEDKIRRKMNEQKVQAQEEIEILKQTENELNQGKFKL 317
           +T+   +++E I+  +K A     EDK R++  E +  A     +  Q E  L +   KL
Sbjct: 492 ITASSGLSEEEIERMVKEAEANAEEDKKRKEEIEARNNAD---SLAYQAEKTLKEAGDKL 548

Query: 318 ARMFERIDKEKAELERSISFLKE--KETELDEILAVLTEKTEV 358
                   +EK ++E++++ LKE  K  +++EI A   E  + 
Sbjct: 549 P------AEEKEKIEKAVAELKEALKGEDVEEIKAKTEELQQA 585


>gnl|CDD|217843 pfam04012, PspA_IM30, PspA/IM30 family.  This family includes PspA
           a protein that suppresses sigma54-dependent
           transcription. The PspA protein, a negative regulator of
           the Escherichia coli phage shock psp operon, is produced
           when virulence factors are exported through secretins in
           many Gram-negative pathogenic bacteria and its homologue
           in plants, VIPP1, plays a critical role in thylakoid
           biogenesis, essential for photosynthesis. Activation of
           transcription by the enhancer-dependent bacterial
           sigma(54) containing RNA polymerase occurs through ATP
           hydrolysis-driven protein conformational changes enabled
           by activator proteins that belong to the large AAA(+)
           mechanochemical protein family. It has been shown that
           PspA directly and specifically acts upon and binds to
           the AAA(+) domain of the PspF transcription activator.
          Length = 220

 Score = 34.7 bits (80), Expect = 0.066
 Identities = 19/115 (16%), Positives = 42/115 (36%), Gaps = 14/115 (12%)

Query: 266 GTITDEHIKMSLKSAVEDKIRRKM--NEQKVQAQE-EIEILKQTEN------ELNQ--GK 314
                   +++ + +  +++R+++   E K+Q  + +   LK           +N   G 
Sbjct: 98  KQAEALETQLTQQRSAVEQLRKQLAALETKIQQLKAKKTALKARLKAAKAQEAVNTSLGS 157

Query: 315 FKL---ARMFERIDKEKAELERSISFLKEKETELDEILAVLTEKTEVDVDEAVTT 366
                    FERI+++  E E       E  +  D    +     +++V E V  
Sbjct: 158 ASTESATDSFERIEEKIEEREARADAAAELASAQDLDAKLEAAGIQMEVSEDVLA 212


>gnl|CDD|130141 TIGR01069, mutS2, MutS2 family protein.  Function of MutS2 is
           unknown. It should not be considered a DNA mismatch
           repair protein. It is likely a DNA mismatch binding
           protein of unknown cellular function [DNA metabolism,
           Other].
          Length = 771

 Score = 35.2 bits (81), Expect = 0.069
 Identities = 19/74 (25%), Positives = 33/74 (44%)

Query: 274 KMSLKSAVEDKIRRKMNEQKVQAQEEIEILKQTENELNQGKFKLARMFERIDKEKAELER 333
           +  +   + ++ +    E K +    IE L   E EL Q    L ++ +  +K K ELE+
Sbjct: 491 RYGIPHFIIEQAKTFYGEFKEEINVLIEKLSALEKELEQKNEHLEKLLKEQEKLKKELEQ 550

Query: 334 SISFLKEKETELDE 347
            +  LKE+E     
Sbjct: 551 EMEELKERERNKKL 564



 Score = 29.0 bits (65), Expect = 6.8
 Identities = 17/73 (23%), Positives = 33/73 (45%), Gaps = 6/73 (8%)

Query: 291 EQKVQAQEEI-----EILKQTENELNQGKFKLARMFERIDKEKAELERSIS-FLKEKETE 344
               +  E+      ++LK+ E    + + ++  + ER   +K ELE+     LK  + E
Sbjct: 521 SALEKELEQKNEHLEKLLKEQEKLKKELEQEMEELKERERNKKLELEKEAQEALKALKKE 580

Query: 345 LDEILAVLTEKTE 357
           ++ I+  L EK  
Sbjct: 581 VESIIRELKEKKI 593


>gnl|CDD|218701 pfam05698, Trigger_C, Bacterial trigger factor protein (TF)
           C-terminus.  In the E. coli cytosol, a fraction of the
           newly synthesised proteins requires the activity of
           molecular chaperones for folding to the native state.
           The major chaperones implicated in this folding process
           are the ribosome-associated Trigger Factor (TF), and the
           DnaK and GroEL chaperones with their respective
           co-chaperones. Trigger Factor is an ATP-independent
           chaperone and displays chaperone and
           peptidyl-prolyl-cis-trans-isomerase (PPIase) activities
           in vitro. It is composed of at least three domains, an
           N-terminal domain which mediates association with the
           large ribosomal subunit, a central substrate binding and
           PPIase domain with homology to FKBP proteins, and a
           C-terminal domain of unknown function. The positioning
           of TF at the peptide exit channel, together with its
           ability to interact with nascent chains as short as 57
           residues renders TF a prime candidate for being the
           first chaperone that binds to the nascent polypeptide
           chains. This family represents the C-terminal region of
           the protein.
          Length = 162

 Score = 33.8 bits (78), Expect = 0.070
 Identities = 20/91 (21%), Positives = 42/91 (46%), Gaps = 10/91 (10%)

Query: 269 TDEHIKMSLKSAVEDKIRRKMNEQKVQAQEEIEILKQTENELNQGKFKLARMFERIDKEK 328
           ++E ++   K   E +++  +  +++  +E+IE+   T+ E+ +   +LA  +       
Sbjct: 78  SEEELREEFKEEAEKRVKLGLILEEIAKEEKIEV---TDEEIKEEIEELASQYG------ 128

Query: 329 AELERSISFLKEKETELDEILAVLTEKTEVD 359
            E E    F K K  +L  +   L E+  VD
Sbjct: 129 MEPEEVKEFYK-KNEQLSALKEDLLEEKAVD 158


>gnl|CDD|224802 COG1890, RPS1A, Ribosomal protein S3AE [Translation, ribosomal
           structure and biogenesis].
          Length = 214

 Score = 34.2 bits (79), Expect = 0.071
 Identities = 26/117 (22%), Positives = 43/117 (36%), Gaps = 27/117 (23%)

Query: 268 ITDEHIKMSLKSAVEDKIRRKMNEQKVQAQEEIEILKQTENELNQGKFKLARMFERIDKE 327
            T    K S K A    IR+ M           EI+++  +EL   +F    +  RI  E
Sbjct: 125 FTRRRAKTSQKRA----IRKIM----------FEIIEEKASELTFEEFVQELIPGRIAAE 170

Query: 328 KAELERSISFLKEKETELDEILAVLTEKTEVDVDEAVTTTAPIYKQILNTFTEEAAT 384
             E  + I  L++          V   K++V         A   + +L+  +EE  +
Sbjct: 171 IEEAAKKIYPLRK----------VEIRKSKV---LKEPKEAEPEQAVLHGESEEVGS 214


>gnl|CDD|216833 pfam01991, vATP-synt_E, ATP synthase (E/31 kDa) subunit.  This
           family includes the vacuolar ATP synthase E subunit, as
           well as the archaebacterial ATP synthase E subunit.
          Length = 195

 Score = 33.9 bits (78), Expect = 0.085
 Identities = 25/113 (22%), Positives = 49/113 (43%), Gaps = 13/113 (11%)

Query: 281 VEDKIRRKMNEQKVQAQEEIEILK-----QTENELNQ---GKFKLARMFERIDKEKAELE 332
           +  +   K  E + +A+EE EI K     + E ++ +    K K A M ++I    A+ E
Sbjct: 2   IRQEAEEKAEEIRAEAEEEFEIEKAEAVEEAEKKIEEIYEKKEKQAEMEKQIIISNAKNE 61

Query: 333 RSISFLKEKETELDEILAVLTEKTEVDVDEAVTTTAPIYKQILNTFTEEAATE 385
             +  L  +E  LD +     E+ +  +   ++     YK +L    ++A  +
Sbjct: 62  ARLKVLNAREELLDSVF----EEAKERLAN-LSEDKDEYKDLLKDLIKQALVK 109


>gnl|CDD|224755 COG1842, PspA, Phage shock protein A (IM30), suppresses
           sigma54-dependent transcription [Transcription / Signal
           transduction mechanisms].
          Length = 225

 Score = 34.2 bits (79), Expect = 0.089
 Identities = 27/100 (27%), Positives = 41/100 (41%), Gaps = 11/100 (11%)

Query: 270 DEHIKMSLKSAVEDKI-RRKMNEQKV-QAQEEIEIL-KQTENELNQGKFKLARMFERIDK 326
            E  K   + A+   I R+K  E+K+ +AQ   E L ++ E  L  G   LAR       
Sbjct: 38  SELAK--ARQALAQAIARQKQLERKLEEAQARAEKLEEKAELALQAGNEDLAREALE--- 92

Query: 327 EKAELERSISFLKEKETELDEILAVLT---EKTEVDVDEA 363
           EK  LE     L+ +  + +E +  L       E  + E 
Sbjct: 93  EKQSLEDLAKALEAELQQAEEQVEKLKKQLAALEQKIAEL 132


>gnl|CDD|219133 pfam06682, DUF1183, Protein of unknown function (DUF1183).  This
           family consists of several eukaryotic proteins of around
           360 residues in length. The function of this family is
           unknown.
          Length = 317

 Score = 34.3 bits (79), Expect = 0.093
 Identities = 22/130 (16%), Positives = 29/130 (22%), Gaps = 25/130 (19%)

Query: 154 YSSRTKQEPATNPTYNGNMASNAPYPTQSYMPMPGSNVNTPYPTYPPSAYPGYNPNAAAA 213
           Y                      P P     P   S+   P          GY      +
Sbjct: 194 YGGGGGGGGGGGGGGGSG---PGPPP-----PGFKSS-FPPPYGPGAGPSSGYGSGGTRS 244

Query: 214 PSGSIYPSYPTPGATPGAGGY-----------SPYPLYPPGGST--PATPYPVPTPSPAP 260
             G   P + T     GA GY                     S    ++     + S + 
Sbjct: 245 GQGGWGPGFWTGLGAGGALGYLFGSRRNNNSSYGRSYGSGSPSYSPSSSSNSSSSSSSSS 304

Query: 261 AVTS---GGT 267
             TS   GGT
Sbjct: 305 TRTSSGFGGT 314


>gnl|CDD|233673 TIGR01991, HscA, Fe-S protein assembly chaperone HscA.  The Heat
           Shock Cognate proteins HscA and HscB act together as
           chaperones. HscA resembles DnaK but belongs in a
           separate clade. The apparent function is to aid assembly
           of iron-sulfur cluster proteins. Homologs from Buchnera
           and Wolbachia are clearly in the same clade but are
           highly derived and score lower than some examples of
           DnaK [Protein fate, Protein folding and stabilization].
          Length = 599

 Score = 34.6 bits (80), Expect = 0.10
 Identities = 17/43 (39%), Positives = 25/43 (58%), Gaps = 4/43 (9%)

Query: 267 TITDEHI-KM---SLKSAVEDKIRRKMNEQKVQAQEEIEILKQ 305
            ++DE I +M   S K A ED   R + EQKV+A+  +E L+ 
Sbjct: 487 GLSDEEIERMLKDSFKHAEEDMYARALAEQKVEAERILEALQA 529


>gnl|CDD|216868 pfam02084, Bindin, Bindin. 
          Length = 239

 Score = 34.1 bits (78), Expect = 0.11
 Identities = 20/86 (23%), Positives = 24/86 (27%), Gaps = 4/86 (4%)

Query: 188 GSNVNTPYPTYPPSAYPGYNPNAAAAPSGSIYPSYPTP---GATPGAGGYSPYPLYPPGG 244
           G+  N P    P      Y      A  G        P   G  PGAGG +P      GG
Sbjct: 2   GNMQNYPQAMNPQMGGGNYPAPGQPAQQGYANQGMGGPVGGGGGPGAGGGAPGG-PVGGG 60

Query: 245 STPATPYPVPTPSPAPAVTSGGTITD 270
              +   P        A  +     D
Sbjct: 61  GGGSGGPPGGGEVAGEAEDAMSEFDD 86



 Score = 32.2 bits (73), Expect = 0.43
 Identities = 25/108 (23%), Positives = 29/108 (26%), Gaps = 13/108 (12%)

Query: 170 GNMASNAPYPTQSYMPMPGSNVNTPYPTYPPSAYPGYNPNAAAAPSGSIYPSYPTPGATP 229
           GNM +   YP      M G N    YP     A  GY       P G         GA  
Sbjct: 2   GNMQN---YPQAMNPQMGGGN----YPAPGQPAQQGYANQGMGGPVGGGGGPGAGGGAPG 54

Query: 230 GAGGYSPYPLYPPGGSTPATPYPVPTPSPAPAVTS------GGTITDE 271
           G  G        P G            S     +S        TI+ +
Sbjct: 55  GPVGGGGGGSGGPPGGGEVAGEAEDAMSEFDDYSSSSIEEGDTTISAD 102


>gnl|CDD|204194 pfam09304, Cortex-I_coil, Cortexillin I, coiled coil.  Members of
           this family are predominantly found in the
           actin-bundling protein Cortexillin I from Dictyostelium
           discoideum. They adopt a structure consisting of an
           18-heptad-repeat alpha-helical coiled-coil, and are a
           prerequisite for the assembly of Cortexillin I.
          Length = 107

 Score = 32.2 bits (73), Expect = 0.12
 Identities = 22/100 (22%), Positives = 47/100 (47%), Gaps = 4/100 (4%)

Query: 286 RRKMNEQKVQAQEEIEILKQTENELNQGKFKLARMFERIDKEKAELERSISFLKEKETEL 345
           + ++  +    +  +E  K ++ EL + K +L  +   ++ +KAE E+    L+E + ++
Sbjct: 11  KNELANKLAGLENSLESEKASQEELAKQKDQLRSLLASLEAQKAEREQ---RLRELQAKI 67

Query: 346 DEILAVLTEKTEVDVD-EAVTTTAPIYKQILNTFTEEAAT 384
           DE+L  L  +    ++ E+  +     K IL     EA  
Sbjct: 68  DELLKNLELEKLAKMELESRLSKTEKDKAILELKLAEAKD 107


>gnl|CDD|220369 pfam09731, Mitofilin, Mitochondrial inner membrane protein.
           Mitofilin controls mitochondrial cristae morphology.
           Mitofilin is enriched in the narrow space between the
           inner boundary and the outer membranes, where it forms a
           homotypic interaction and assembles into a large
           multimeric protein complex. The first 78 amino acids
           contain a typical amino-terminal-cleavable mitochondrial
           presequence rich in positive-charged and hydroxylated
           residues and a membrane anchor domain. In addition, it
           has three centrally located coiled coil domains.
          Length = 493

 Score = 34.2 bits (79), Expect = 0.12
 Identities = 26/105 (24%), Positives = 46/105 (43%), Gaps = 11/105 (10%)

Query: 283 DKIRRKMNEQKVQAQEEIE-ILKQTENELNQGKFKLARMFERIDKEKAELERSISFLKEK 341
           D++ +K+ E K + +EE+E  LK+   EL     KL        +  A LE   + L+++
Sbjct: 173 DQLSKKLAELKAEEEEELERALKEKREELLS---KLEE------ELLARLESKEAALEKQ 223

Query: 342 -ETELDEILAVLTEKTEVDVDEAVTTTAPIYKQILNTFTEEAATE 385
              E +     L +K E  + + +   A  ++Q L       A E
Sbjct: 224 LRLEFEREKEELRKKYEEKLRQELERQAEAHEQKLKNELALQAIE 268



 Score = 33.8 bits (78), Expect = 0.17
 Identities = 22/78 (28%), Positives = 39/78 (50%), Gaps = 9/78 (11%)

Query: 280 AVEDKIRRKMNE---QKVQAQEEIEILKQTENELNQGKFKLARMFERIDKEKAELERSIS 336
             ++++R+K  E   Q+++ Q E    ++ +NEL     +L R F +  KEK E ER+  
Sbjct: 230 REKEELRKKYEEKLRQELERQAEA-HEQKLKNELALQAIELQREFNKEIKEKVEEERN-G 287

Query: 337 FLKEKETELDEILAVLTE 354
            L     +L E+ + L  
Sbjct: 288 RLA----KLAELNSRLKG 301


>gnl|CDD|217933 pfam04156, IncA, IncA protein.  Chlamydia trachomatis is an
           obligate intracellular bacterium that develops within a
           parasitophorous vacuole termed an inclusion. The
           inclusion is non-fusogenic with lysosomes but intercepts
           lipids from a host cell exocytic pathway. Initiation of
           chlamydial development is concurrent with modification
           of the inclusion membrane by a set of C.
           trachomatis-encoded proteins collectively designated
           Incs. One of these Incs, IncA, is functionally
           associated with the homotypic fusion of inclusions. This
           family probably includes members of the wider Inc family
           rather than just IncA.
          Length = 186

 Score = 33.6 bits (77), Expect = 0.12
 Identities = 21/97 (21%), Positives = 43/97 (44%), Gaps = 5/97 (5%)

Query: 286 RRKMNEQKVQAQEEIEILKQTENELNQGKFKLARMFERIDKEKAELERSISFLKEKETEL 345
               + +  + +E    L + + +L++    L    E +++  AELE  +  LKE    L
Sbjct: 64  APVQSVRPQKLEELQGELSELKQQLSE----LQEELEDLEERIAELESELEDLKEDLQLL 119

Query: 346 DEILAVLTEKTEVDVDEAVTTTAPIYKQILNTFTEEA 382
            E+L  L E+    ++E++   A   +++     EE 
Sbjct: 120 RELLKSLEER-LESLEESIKELAKELRELRQDLREEV 155


>gnl|CDD|202045 pfam01920, Prefoldin_2, Prefoldin subunit.  This family includes
           prefoldin subunits that are not detected by pfam02996.
          Length = 106

 Score = 32.2 bits (74), Expect = 0.12
 Identities = 14/73 (19%), Positives = 32/73 (43%), Gaps = 8/73 (10%)

Query: 291 EQKVQAQEEIEILKQTENELNQGK------FKLARMFERIDKEKA--ELERSISFLKEKE 342
           +Q  Q + +++ ++    EL            +  +  + DKE+   ELE     L+++ 
Sbjct: 19  QQIKQLETQLKEIELVLEELELLDEDTKVYKLIGDVLVKQDKEEVKEELEERKETLEKEI 78

Query: 343 TELDEILAVLTEK 355
             L++ L  L ++
Sbjct: 79  KTLEKQLEKLEKE 91


>gnl|CDD|218167 pfam04599, Pox_G5, Poxvirus G5 protein.  This protein has been
           predicted to be related to the FEN-1 endonuclease.
          Length = 426

 Score = 34.2 bits (79), Expect = 0.12
 Identities = 24/94 (25%), Positives = 40/94 (42%), Gaps = 16/94 (17%)

Query: 274 KMSLKSAVEDKIRRKMNEQKVQAQEEIEILKQTENELNQGKFKLARMFERIDKEKAELER 333
            + +K ++ +K +  +     +  EEIE L+     L+        M+E I   K +LE 
Sbjct: 76  SIDIKESLREKRKNSLKNTSKRKTEEIENLEDAIKNLDVEDE----MYEEI---KTDLEL 128

Query: 334 SIS------FLKEK---ETELDEILAVLTEKTEV 358
            I       FL  K   +  L+EIL+ L E   +
Sbjct: 129 RIDKLSFHNFLANKSNLKITLEEILSSLKEDVTI 162


>gnl|CDD|223649 COG0576, GrpE, Molecular chaperone GrpE (heat shock protein)
           [Posttranslational modification, protein turnover,
           chaperones].
          Length = 193

 Score = 33.5 bits (77), Expect = 0.12
 Identities = 22/104 (21%), Positives = 40/104 (38%), Gaps = 10/104 (9%)

Query: 275 MSLKSAVEDKIRRKMNEQKVQAQEEIEILKQTENELNQGKFKLARMFERIDKEKAELERS 334
           MS K    ++   +  E+    + E E  ++ E E      +     + I + +A+LE  
Sbjct: 1   MSDKEQKTEEPDAEETEE--AEKSEEEEAEEEEPEEENELEEEQ---QEIAELEAQLEE- 54

Query: 335 ISFLKEKETELDEILAVLTEKTEVDVDEAVTT-TAPIYKQILNT 377
              LK+K          L ++TE + +EA         K +L  
Sbjct: 55  ---LKDKYLRAQAEFENLRKRTEREREEAKKYAIEKFAKDLLPV 95


>gnl|CDD|220708 pfam10349, WWbp, WW-domain ligand protein.  The WWbp domain is
           characterized by several short PY and PT-like motifs of
           the PPPPY form. These appear to bind directly to the WW
           domains of WWP1 and WWP2 and other such diverse proteins
           as dystrophin and YAP (Yes-associated protein). This is
           the WW-domain binding protein WWbp via PY and PY_like
           motifs. The presence of a phosphotyrosine residue in the
           pWBP-1 peptide abolishes WW domain binding which
           suggests a potential regulatory role for tyrosine
           phosphorylation in modulating WW domain-ligand
           interactions. Given the likelihood that WWP1 and WWP2
           function as E3 ubiquitin-protein ligases, it is possible
           that initial substrate-specific recognition occurs via
           WW domain-substrate protein interaction followed by
           ubiquitin transfer and subsequent proteolysis. This
           domain lies just downstream of the GRAM (pfam02893) in
           many members.
          Length = 111

 Score = 32.4 bits (74), Expect = 0.13
 Identities = 25/77 (32%), Positives = 26/77 (33%), Gaps = 15/77 (19%)

Query: 183 YMPMPGSNVNTPYPTYPPSAYP---GYNPNAAAAPSGSIYPSYPTPGATPGAGGYSPYPL 239
           Y P        P P Y   A P   G  P    AP G  YP+ P P       GY   P 
Sbjct: 47  YPPPGAYVHLEPLPAYGQYAAPPPYGPPPPYYPAPPGV-YPTPPPPN-----SGYMADPQ 100

Query: 240 YPPGGSTPATPYPVPTP 256
            PP       PYP P  
Sbjct: 101 EPP------PPYPGPPQ 111


>gnl|CDD|183756 PRK12799, motB, flagellar motor protein MotB; Reviewed.
          Length = 421

 Score = 34.3 bits (78), Expect = 0.13
 Identities = 27/123 (21%), Positives = 33/123 (26%), Gaps = 12/123 (9%)

Query: 138 GLIKTLASAFSALPPLYSSRTKQEPATNPTYNGNMASNAPYPTQSYMPMPGSNVNTPYPT 197
            +  + A   S+     S+     PA  P+            TQS      S V      
Sbjct: 303 AVTPSSAVTQSSAITPSSAAIPS-PAVIPS---------SVTTQSATTTQASAVALSSAG 352

Query: 198 YPPS--AYPGYNPNAAAAPSGSIYPSYPTPGATPGAGGYSPYPLYPPGGSTPATPYPVPT 255
             PS    PG     AA P         T      + G        P  S PA P     
Sbjct: 353 VLPSDVTLPGTVALPAAEPVNMQPQPMSTTETQQSSTGNITSTANGPTTSLPAAPASNIP 412

Query: 256 PSP 258
            SP
Sbjct: 413 VSP 415



 Score = 32.4 bits (73), Expect = 0.43
 Identities = 22/93 (23%), Positives = 30/93 (32%), Gaps = 5/93 (5%)

Query: 158 TKQEPATNPTYNGNMA-SNAPYPTQSYMPMPGSNVNTPYPTYPPSAYPGYNPNAAAAPSG 216
               P    T +  +  S+A  P+ + +P P      P      SA       A A  S 
Sbjct: 296 HGTVPVAAVTPSSAVTQSSAITPSSAAIPSPAVI---PSSVTTQSATTTQAS-AVALSSA 351

Query: 217 SIYPSYPTPGATPGAGGYSPYPLYPPGGSTPAT 249
            + PS  T   T       P  + P   ST  T
Sbjct: 352 GVLPSDVTLPGTVALPAAEPVNMQPQPMSTTET 384


>gnl|CDD|144972 pfam01576, Myosin_tail_1, Myosin tail.  The myosin molecule is a
           multi-subunit complex made up of two heavy chains and
           four light chains it is a fundamental contractile
           protein found in all eukaryote cell types. This family
           consists of the coiled-coil myosin heavy chain tail
           region. The coiled-coil is composed of the tail from two
           molecules of myosin. These can then assemble into the
           macromolecular thick filament. The coiled-coil region
           provides the structural backbone the thick filament.
          Length = 859

 Score = 34.2 bits (79), Expect = 0.14
 Identities = 20/84 (23%), Positives = 44/84 (52%), Gaps = 6/84 (7%)

Query: 277 LKSAVEDKIRRKMNEQKVQAQEEIEILKQTENELNQGKFKLARMFERIDKEKAELERS-- 334
           L++   +++++K+N++  + +E  E      + L + K +L      ++  + ELER+  
Sbjct: 314 LRAEELEELKKKLNQKISELEEAAEAANAKCDSLEKTKSRLQ---SELEDLQIELERANA 370

Query: 335 -ISFLKEKETELDEILAVLTEKTE 357
             S L++K+   D+ILA    K +
Sbjct: 371 AASELEKKQKNFDKILAEWKRKVD 394



 Score = 32.3 bits (74), Expect = 0.57
 Identities = 20/75 (26%), Positives = 36/75 (48%), Gaps = 7/75 (9%)

Query: 269 TDEHIKM--SLKSAVEDKIRRKMNEQKVQAQEEIEILKQTENELNQGKFKLARMFERIDK 326
            +E  K    L+  V DK++ K+   K Q +E  E+ +   ++  + + +L    ER D 
Sbjct: 787 VEEDKKNLERLQDLV-DKLQAKIKTYKRQLEEAEEVAQINLSKYRKAQRELEDAEERAD- 844

Query: 327 EKAELERSISFLKEK 341
                ERS++ L+ K
Sbjct: 845 ---TAERSLNKLRAK 856



 Score = 31.6 bits (72), Expect = 0.87
 Identities = 21/74 (28%), Positives = 34/74 (45%), Gaps = 7/74 (9%)

Query: 275 MSLKSAVE---DKIRRKMNEQKVQAQEEIEILKQTENELNQGKFKLARMFERIDKEKAEL 331
             LK+ +E   D++     E K   Q+EI  L     E  +   +L +   R++ EK EL
Sbjct: 415 FRLKNELEELKDQVEALRRENK-NLQDEIHDLTDQLGEGGRNVHELEKARRRLEAEKDEL 473

Query: 332 ERSISFLKEKETEL 345
           + +   L+E E  L
Sbjct: 474 QAA---LEEAEAAL 484



 Score = 30.0 bits (68), Expect = 3.4
 Identities = 18/81 (22%), Positives = 39/81 (48%), Gaps = 4/81 (4%)

Query: 280 AVEDKIRRKMNEQKVQAQEEIEILKQTENELNQGKFKLARMFERIDKEKAELERSIS-FL 338
              +K  +++  Q  + Q +++ L++  N+L   K +L      + ++  E E  +S   
Sbjct: 176 LNAEKKAKQLESQLSELQVKLDELQRQLNDLTSQKSRLQSENSDLTRQLEEAEAQVSNLS 235

Query: 339 KEK---ETELDEILAVLTEKT 356
           K K   E++L+E    L E++
Sbjct: 236 KLKSQLESQLEEAKRSLEEES 256


>gnl|CDD|204784 pfam11932, DUF3450, Protein of unknown function (DUF3450).  This
           family of proteins are functionally uncharacterized.
           This protein is found in bacteria and eukaryotes.
           Proteins in this family are about 260 amino acids in
           length.
          Length = 250

 Score = 33.7 bits (78), Expect = 0.14
 Identities = 28/139 (20%), Positives = 54/139 (38%), Gaps = 29/139 (20%)

Query: 288 KMNEQKVQAQEEIE-ILKQTENELNQGKFKLARMFERIDKEKAELERSISFLKEKETEL- 345
           K  +Q  +   EI  +LK+ EN       +L R+     +E A L++ I  +++    L 
Sbjct: 47  KWADQTQELLAEIRQLLKEIENLRVYND-QLQRLVANQQQEIASLQQQIEQIEKTRQGLV 105

Query: 346 ---DEILAVLTEKTEVDV------------------DEAVTTTAPIYKQILNTFTEEAAT 384
                ++  L +   +D+                  D A  + A  ++++L  +  EA  
Sbjct: 106 PLMYRMIDGLEQFVALDLPFLKEERLARLARLRDLMDRADVSLAEKFRRVLEAYQIEADY 165

Query: 385 EDTIYYMGEALRRGVIDLE 403
             TI      + +G I L+
Sbjct: 166 GRTI-----EVYQGEITLD 179


>gnl|CDD|215507 PLN02939, PLN02939, transferase, transferring glycosyl groups.
          Length = 977

 Score = 34.1 bits (78), Expect = 0.15
 Identities = 24/118 (20%), Positives = 46/118 (38%), Gaps = 10/118 (8%)

Query: 256 PSPAPAVTSGGTITD----EHIKMSLKSAVEDKIRRKMNEQ--KVQAQEEIEILKQTENE 309
                  TS     +    +  +       E +   K  EQ    Q ++ + +++  E  
Sbjct: 84  MELPQKSTSSDDDHNRASMQRDEAIAAIDNEQQTNSKDGEQLSDFQLEDLVGMIQNAEKN 143

Query: 310 ---LNQGKFKLARMFERIDKEKAELERSISFLKEKETELDEILAVLT-EKTEVDVDEA 363
              LNQ + +     E+I  EK  L+  I+ L+ + +E D  + +   EK  V++ E 
Sbjct: 144 ILLLNQARLQALEDLEKILTEKEALQGKINILEMRLSETDARIKLAAQEKIHVEILEE 201


>gnl|CDD|215656 pfam00012, HSP70, Hsp70 protein.  Hsp70 chaperones help to fold
           many proteins. Hsp70 assisted folding involves repeated
           cycles of substrate binding and release. Hsp70 activity
           is ATP dependent. Hsp70 proteins are made up of two
           regions: the amino terminus is the ATPase domain and the
           carboxyl terminus is the substrate binding region.
          Length = 598

 Score = 34.1 bits (79), Expect = 0.15
 Identities = 27/99 (27%), Positives = 50/99 (50%), Gaps = 15/99 (15%)

Query: 262 VTSGGTITDEHI-KM---SLKSAVEDKIRRKMNEQKVQAQEEIEILKQTENELNQGKFKL 317
           +T+   ++D+ I +M   + + A EDK R++  E K +A+E +      E  L +   KL
Sbjct: 495 ITASSGLSDDEIERMVKDAEEYAAEDKKRKERIEAKNEAEEYVY---SLEKSLKEEGDKL 551

Query: 318 ARMFERIDKEKAELERSISFLKE--KETELDEILAVLTE 354
                  + +K ++E +I +LKE  +  + +EI A   E
Sbjct: 552 P------EADKKKVEEAIEWLKEELEGEDKEEIEAKTEE 584


>gnl|CDD|220596 pfam10138, Tellurium_res, Tellurium resistance protein.  Members of
           this family confer resistance to the metalloid element
           tellurium and its salts.
          Length = 98

 Score = 31.6 bits (72), Expect = 0.15
 Identities = 15/50 (30%), Positives = 20/50 (40%)

Query: 241 PPGGSTPATPYPVPTPSPAPAVTSGGTITDEHIKMSLKSAVEDKIRRKMN 290
           P     PA P P P P+  P   S  T+T E   +SL          ++N
Sbjct: 4   PVPPPAPAPPAPAPPPAAPPVSLSKITLTKEGPSVSLTKQGGGFGTIRVN 53


>gnl|CDD|224259 COG1340, COG1340, Uncharacterized archaeal coiled-coil protein
           [Function unknown].
          Length = 294

 Score = 33.5 bits (77), Expect = 0.16
 Identities = 17/72 (23%), Positives = 38/72 (52%), Gaps = 9/72 (12%)

Query: 283 DKIRRK---MNEQKVQAQEEI----EILKQTENELNQGKFKLARMFERIDKEKAELERSI 335
           D++R++   ++E+ V+  ++I    E  +  +NEL + + K+  +  R  ++ A+     
Sbjct: 203 DELRKEADELHEEFVELSKKIDELHEEFRNLQNELRELEKKIKAL--RAKEKAAKRREKR 260

Query: 336 SFLKEKETELDE 347
             LKE+  E+ E
Sbjct: 261 EELKERAEEIYE 272



 Score = 32.3 bits (74), Expect = 0.45
 Identities = 14/85 (16%), Positives = 31/85 (36%), Gaps = 2/85 (2%)

Query: 270 DEHIKMSLKSAVEDKIRRKMNEQKVQAQEEIEILKQTENELNQGKFKLARMFERIDKEKA 329
           +E +K        D++++K  E   + QE     ++   E+ +   +   + +  D+   
Sbjct: 157 NEKLKELKAEI--DELKKKAREIHEKIQELANEAQEYHEEMIKLFEEADELRKEADELHE 214

Query: 330 ELERSISFLKEKETELDEILAVLTE 354
           E       + E   E   +   L E
Sbjct: 215 EFVELSKKIDELHEEFRNLQNELRE 239



 Score = 31.6 bits (72), Expect = 0.68
 Identities = 17/59 (28%), Positives = 31/59 (52%), Gaps = 3/59 (5%)

Query: 291 EQKVQAQEEIEILKQTENELNQGKFKLARMFERIDKEKAELERSISFLKEKETELDEIL 349
            ++ Q +EEIE LK+  +EL +   + + + E+ D+  A++       +E   E DEI 
Sbjct: 13  LKRKQLKEEIEELKEKRDELRK---EASELAEKRDELNAKVRELREKAQELREERDEIN 68



 Score = 30.4 bits (69), Expect = 1.6
 Identities = 19/86 (22%), Positives = 34/86 (39%), Gaps = 11/86 (12%)

Query: 275 MSLKSAVEDKIRRKMNEQKVQAQEEIEILKQTENELNQGKFKLARMFERIDKEKAELE-- 332
             L    E ++ +K+ E     ++E+E  K+   E      KL  +   ID+ K +    
Sbjct: 126 SVLTPEEERELVQKIKE----LRKELEDAKKALEE----NEKLKELKAEIDELKKKAREI 177

Query: 333 -RSISFLKEKETELDEILAVLTEKTE 357
              I  L  +  E  E +  L E+ +
Sbjct: 178 HEKIQELANEAQEYHEEMIKLFEEAD 203



 Score = 30.4 bits (69), Expect = 1.9
 Identities = 17/76 (22%), Positives = 34/76 (44%), Gaps = 10/76 (13%)

Query: 283 DKIRRKMNEQKVQAQEEIEILKQTENELNQGKFKLARMFERID---KEKAELERSISF-- 337
           D++  K+ E + +AQE  E   +   E+ + K K   +  ++    KE  EL+   +   
Sbjct: 44  DELNAKVRELREKAQELREERDEINEEVQELKEKRDEINAKLQELRKEYRELKEKRNEFN 103

Query: 338 -----LKEKETELDEI 348
                +K  E E++ +
Sbjct: 104 LGGRSIKSLEREIERL 119


>gnl|CDD|227512 COG5185, HEC1, Protein involved in chromosome segregation,
           interacts with SMC proteins [Cell division and
           chromosome partitioning].
          Length = 622

 Score = 33.8 bits (77), Expect = 0.17
 Identities = 22/109 (20%), Positives = 45/109 (41%), Gaps = 26/109 (23%)

Query: 270 DEHIKMSLKSAVEDKIRR------KMNEQKVQAQEEIEILKQTENEL------------- 310
           D +   +  +A++ K +       K+  +    +EEI+ L+   +EL             
Sbjct: 310 DSNKYENYVNAMKQKSQEWPGKLEKLKSEIELKEEEIKALQSNIDELHKQLRKQGISTEQ 369

Query: 311 ----NQGKFKLARMFERIDKEKAELERSISFLKEKETELDEILAVLTEK 355
               NQ + KL R  ++I+ +  +L +S+   K ++ E   I   L + 
Sbjct: 370 FELMNQEREKLTRELDKINIQSDKLTKSV---KSRKLEAQGIFKSLEKT 415



 Score = 28.8 bits (64), Expect = 6.2
 Identities = 16/86 (18%), Positives = 41/86 (47%), Gaps = 2/86 (2%)

Query: 283 DKIRRKMNEQKVQAQEEIEILKQTENELNQGKFKLARMFERI--DKEKAELERSISFLKE 340
           DK+ + +  +K++AQ   + L++T  + +     + R   +I  +   + L+ +I  L  
Sbjct: 392 DKLTKSVKSRKLEAQGIFKSLEKTLRQYDSLIQNITRSRSQIGHNVNDSSLKINIEQLFP 451

Query: 341 KETELDEILAVLTEKTEVDVDEAVTT 366
           K + ++E +     +   ++ E + T
Sbjct: 452 KGSGINESIKKSILELNDEIQERIKT 477


>gnl|CDD|178748 PLN03209, PLN03209, translocon at the inner envelope of chloroplast
           subunit 62; Provisional.
          Length = 576

 Score = 33.7 bits (77), Expect = 0.17
 Identities = 28/112 (25%), Positives = 36/112 (32%), Gaps = 14/112 (12%)

Query: 162 PATNPTYNGNMASNAPYPTQSYMPMPGSNVNTPYPTYP---PSAYPGYNPNAAAAPSGSI 218
           P+     N      A    +   P+      +PY  Y    P   P        +PS S 
Sbjct: 414 PSPGSASNVPEVEPAQVEAKKTRPL------SPYARYEDLKPPTSPSPTAPTGVSPSVSS 467

Query: 219 YPSYPTPGATPGAGGYSPYPLYPPGGSTPATPYPV-----PTPSPAPAVTSG 265
             S P    T  A   +     PP    P +PY V     P  SP+PA   G
Sbjct: 468 TSSVPAVPDTAPATAATDAAAPPPANMRPLSPYAVYDDLKPPTSPSPAAPVG 519


>gnl|CDD|223250 COG0172, SerS, Seryl-tRNA synthetase [Translation, ribosomal
           structure and biogenesis].
          Length = 429

 Score = 33.7 bits (78), Expect = 0.17
 Identities = 23/96 (23%), Positives = 44/96 (45%), Gaps = 13/96 (13%)

Query: 279 SAVEDKIRRKMNEQKVQAQEEIEILKQTENELNQ--------GKFKLARMFERIDKEKAE 330
           +   DK+  +++E++ +   E+E L+   NEL++        G+     +   + + K +
Sbjct: 25  ALDVDKLL-ELDEERRKLLRELEELQAERNELSKEIGRALKRGEDDAEELIAEVKELKEK 83

Query: 331 LERSISFLKEKETELDEILAVL----TEKTEVDVDE 362
           L+   + L E E ELD +L  +     E   V  DE
Sbjct: 84  LKELEAALDELEAELDTLLLTIPNIPHESVPVGKDE 119


>gnl|CDD|135898 PRK06397, PRK06397, V-type ATP synthase subunit H; Validated.
          Length = 111

 Score = 31.8 bits (72), Expect = 0.18
 Identities = 15/81 (18%), Positives = 32/81 (39%), Gaps = 7/81 (8%)

Query: 276 SLKSAVEDKIRRKMNEQKVQAQEEIEILKQTENELNQGKFKLARMFERIDKEKAELE-RS 334
           S+   + +    + NE K    +  E  K+TE E       L      + + + E E ++
Sbjct: 21  SIDKEIANIKNEQENEIKEAKSKYEEKAKKTEEES------LNMYNAALMEARKEAEKKA 74

Query: 335 ISFLKEKETELDEILAVLTEK 355
           +  + + + E   I   ++E 
Sbjct: 75  VEIINKAKQEASLIKLKISED 95


>gnl|CDD|237030 PRK12270, kgd, alpha-ketoglutarate decarboxylase; Reviewed.
          Length = 1228

 Score = 34.1 bits (79), Expect = 0.19
 Identities = 15/58 (25%), Positives = 22/58 (37%), Gaps = 2/58 (3%)

Query: 205 GYNPNAAAAPSGSIYPSYPTPGATPGAGGYSPYPLYPPGGSTPATPYPVPTPSPAPAV 262
            Y P + AAP+ +   +     A+  A   +      P  + PA   P   P PA A 
Sbjct: 35  DYGPGSTAAPTAA--AAAAAAAASAPAAAPAAKAPAAPAPAPPAAAAPAAPPKPAAAA 90



 Score = 32.6 bits (75), Expect = 0.57
 Identities = 15/63 (23%), Positives = 17/63 (26%)

Query: 199 PPSAYPGYNPNAAAAPSGSIYPSYPTPGATPGAGGYSPYPLYPPGGSTPATPYPVPTPSP 258
           P S        AAAA + S   + P   A        P    P     PA          
Sbjct: 38  PGSTAAPTAAAAAAAAAASAPAAAPAAKAPAAPAPAPPAAAAPAAPPKPAAAAAAAAAPA 97

Query: 259 APA 261
           AP 
Sbjct: 98  APP 100



 Score = 31.0 bits (71), Expect = 1.3
 Identities = 13/61 (21%), Positives = 20/61 (32%)

Query: 204 PGYNPNAAAAPSGSIYPSYPTPGATPGAGGYSPYPLYPPGGSTPATPYPVPTPSPAPAVT 263
           PG      AA + +   +     A       +P P  P   +  A P P    + A A  
Sbjct: 38  PGSTAAPTAAAAAAAAAASAPAAAPAAKAPAAPAPAPPAAAAPAAPPKPAAAAAAAAAPA 97

Query: 264 S 264
           +
Sbjct: 98  A 98



 Score = 29.9 bits (68), Expect = 3.4
 Identities = 17/81 (20%), Positives = 27/81 (33%)

Query: 194 PYPTYPPSAYPGYNPNAAAAPSGSIYPSYPTPGATPGAGGYSPYPLYPPGGSTPATPYPV 253
           P  T  P+A       AA+AP+ +     P   A       +P     P  +  A   P 
Sbjct: 38  PGSTAAPTAAAAAAAAAASAPAAAPAAKAPAAPAPAPPAAAAPAAPPKPAAAAAAAAAPA 97

Query: 254 PTPSPAPAVTSGGTITDEHIK 274
             P+ A A        ++ + 
Sbjct: 98  APPAAAAAAAPAAAAVEDEVT 118


>gnl|CDD|130214 TIGR01144, ATP_synt_b, ATP synthase, F0 subunit b.  This model
           describes the F1/F0 ATP synthase b subunit in bacteria
           only. Scoring just below the trusted cutoff are the
           N-terminal domains of Mycobacterial b/delta fusion
           proteins and a subunit from an archaeon, Methanosarcina
           barkeri, in which the ATP synthase homolog differs in
           architecture and is not experimentally confirmed. This
           model helps resolve b from the related b' subunit.
           Within the family is an example from a
           sodium-translocating rather than proton-translocating
           ATP synthase [Energy metabolism, ATP-proton motive force
           interconversion].
          Length = 147

 Score = 32.4 bits (74), Expect = 0.20
 Identities = 23/126 (18%), Positives = 47/126 (37%), Gaps = 22/126 (17%)

Query: 271 EHIKMSLKSAVEDKIRRKMNEQKVQAQEEIEILKQTENELNQGKFKLARMFERIDKEKAE 330
           + I   L SA   K            +E     K+ +  L + K +   + E  +K  +E
Sbjct: 29  KKIADGLASAERAK------------KEAALAQKKAQVILKEAKDEAQEIIENANKRGSE 76

Query: 331 LERSISFLKEKETELDEILA-----VLTEKTEV--DVDEAVTTTA-PIYKQILNTFTEEA 382
           +        E   E ++I A     +  EK +   ++ + V   +    ++I+    ++ 
Sbjct: 77  ILEEA--KAEAREEREKIKAQARAEIEAEKEQAREELRKQVADLSVLGAEKIIERNIDKQ 134

Query: 383 ATEDTI 388
           A +D I
Sbjct: 135 AQKDLI 140


>gnl|CDD|234750 PRK00409, PRK00409, recombination and DNA strand exchange inhibitor
           protein; Reviewed.
          Length = 782

 Score = 33.6 bits (78), Expect = 0.20
 Identities = 17/70 (24%), Positives = 36/70 (51%), Gaps = 4/70 (5%)

Query: 286 RRKMNEQKVQAQEEIEILKQTENELNQGKFKLARMFERIDKEKAELERSISF-LKEKETE 344
            R++ ++  +A+  ++  ++ + EL +   K  ++ E  DK   E E+     +KE + E
Sbjct: 529 ERELEQKAEEAEALLKEAEKLKEELEE---KKEKLQEEEDKLLEEAEKEAQQAIKEAKKE 585

Query: 345 LDEILAVLTE 354
            DEI+  L +
Sbjct: 586 ADEIIKELRQ 595



 Score = 32.5 bits (75), Expect = 0.45
 Identities = 12/81 (14%), Positives = 34/81 (41%), Gaps = 8/81 (9%)

Query: 278 KSAVEDKIRRKMNEQKVQAQEEIEILKQTENELNQGKFKLARMFERIDKEKAELERSISF 337
           K  ++++  + + E + +AQ+ I+  K+  +E+      +  + +      A ++     
Sbjct: 557 KEKLQEEEDKLLEEAEKEAQQAIKEAKKEADEI------IKELRQLQKGGYASVKAHE-- 608

Query: 338 LKEKETELDEILAVLTEKTEV 358
           L E    L++      +K + 
Sbjct: 609 LIEARKRLNKANEKKEKKKKK 629


>gnl|CDD|130689 TIGR01628, PABP-1234, polyadenylate binding protein, human types 1,
           2, 3, 4 family.  These eukaryotic proteins recognize the
           poly-A of mRNA and consists of four tandem RNA
           recognition domains at the N-terminus (rrm: pfam00076)
           followed by a PABP-specific domain (pfam00658) at the
           C-terminus. The protein is involved in the transport of
           mRNA's from the nucleus to the cytoplasm. There are four
           paralogs in Homo sapiens which are expressed in testis
           (GP:11610605_PABP3 ), platelets (SP:Q13310_PABP4 ),
           broadly expressed (SP:P11940_PABP1) and of unknown
           tissue range (SP:Q15097_PABP2).
          Length = 562

 Score = 33.6 bits (77), Expect = 0.21
 Identities = 38/189 (20%), Positives = 60/189 (31%), Gaps = 21/189 (11%)

Query: 181 QSYMPMPGSNVNTPYPTYPPSAYPGYNPNAAAAPSGSIYPSYPTPGATPGAGGYSPYPL- 239
               PM G+    PY    P     +N      P  S+ P+   PG      G +P    
Sbjct: 386 PMGSPMGGAMGQPPYYGQGPQQQ--FNGQPLGWPRMSMMPTPMGPGGPLRPNGLAPMNAV 443

Query: 240 -YPPGGSTPATPYPVPTPSPA-PAVTSGGTITDEHIKMSLKSAVEDKIRRKMNEQKVQAQ 297
             P   +  A   P   P    P   S     D     S  S       +    +K+   
Sbjct: 444 RAPSRNAQNAAQKPPMQPVMYPPNYQSLPLSQDLPQPQSTAS-------QGGQNKKLAQV 496

Query: 298 EEIEILKQTENELNQGKFKLARMFERIDKEKAELERSIS--FLKEKETELDEILAVLTEK 355
                 +  +  L +      R+F  ++  +  L   I+   L+   +EL  +L    E 
Sbjct: 497 LASATPQMQKQVLGE------RLFPLVEAIEPALAAKITGMLLEMDNSELLHLLES-PEL 549

Query: 356 TEVDVDEAV 364
            +  VDEA+
Sbjct: 550 LKSKVDEAL 558


>gnl|CDD|227315 COG4982, COG4982, 3-oxoacyl-[acyl-carrier protein].
          Length = 866

 Score = 33.7 bits (77), Expect = 0.22
 Identities = 18/69 (26%), Positives = 27/69 (39%), Gaps = 1/69 (1%)

Query: 235 SPYPLYPPGGSTPATPYPVPTPSPAPAVTSGGTITDEHIKMSLKSAVEDKIRRKMNEQKV 294
           +  P        PA   P P  + APA  +G    DE  K S        ++ K    ++
Sbjct: 16  ATPPAPAASAPAPAAAAPAPVAAAAPA-AAGPRPDDEPFKASDALHALVALKLKKRIDQI 74

Query: 295 QAQEEIEIL 303
           +A + IE L
Sbjct: 75  EALDSIEDL 83


>gnl|CDD|232931 TIGR00344, alaS, alanine--tRNA ligase.  The model describes
           alanine--tRNA ligase. This enzyme catalyzes the reaction
           (tRNAala + L-alanine + ATP = L-alanyl-tRNAala +
           pyrophosphate + AMP) [Protein synthesis, tRNA
           aminoacylation].
          Length = 851

 Score = 33.5 bits (77), Expect = 0.22
 Identities = 19/96 (19%), Positives = 37/96 (38%), Gaps = 12/96 (12%)

Query: 268 ITDEHIKMSLKSAVEDKIRR----KMNEQKVQAQEEIEILKQTENELNQGKFKLARMFER 323
           I  E       S +   +RR      N       E+ + LK+  + L     +L +  ER
Sbjct: 678 IVKE-------SGIAAGVRRIEAVTGNAAIEYLNEQEDKLKELADILKVTPNELPKKIER 730

Query: 324 IDKEKAELERSISFLKEKETELDEILAVLTEKTEVD 359
             +E   L++ +  LK K  +  +I  +  +   ++
Sbjct: 731 FFEEWKALQKELESLKLKIADQ-KIDELKDKAETIN 765


>gnl|CDD|224188 COG1269, NtpI, Archaeal/vacuolar-type H+-ATPase subunit I [Energy
           production and conversion].
          Length = 660

 Score = 33.5 bits (77), Expect = 0.24
 Identities = 21/88 (23%), Positives = 38/88 (43%), Gaps = 2/88 (2%)

Query: 276 SLKSAVEDKIRRKMNEQKVQAQ--EEIEILKQTENELNQGKFKLARMFERIDKEKAELER 333
           SL S V D +R     +       EE+E L+     L +      +    +++   +LE 
Sbjct: 61  SLLSEVLDYLRSVKGLEGRLFILPEEVEKLEAELKSLEEVIKPAEKFSSEVEELTRKLEE 120

Query: 334 SISFLKEKETELDEILAVLTEKTEVDVD 361
            +S L E+  +L+++L  L     +D D
Sbjct: 121 RLSELDEELEDLEDLLEELEPLAYLDFD 148


>gnl|CDD|114270 pfam05539, Pneumo_att_G, Pneumovirinae attachment membrane
           glycoprotein G. 
          Length = 408

 Score = 33.1 bits (75), Expect = 0.25
 Identities = 25/130 (19%), Positives = 40/130 (30%), Gaps = 16/130 (12%)

Query: 155 SSRTKQEPA--TNPTYNGNMASNAPYPTQ-----------SYMPMPGSNVNTPYPTYPPS 201
           +S+T   P   ++PTY   +   +   TQ           S     G+   T      P 
Sbjct: 174 TSKTTSWPTEVSHPTYPSQVTPQSQPATQGHQTATANQRLSSTEPVGTQGTTTSSNPEPQ 233

Query: 202 AYPGYNPNAAAAPSGSIYPSYPTPGATPGAGG--YSPYPLYPPGGSTPATPYPVPTPSPA 259
             P  +    +       PS  +   +    G  ++     PP  S   +P+   TP P 
Sbjct: 234 TEPPPSQRGPSGSPQHP-PSTTSQDQSTTGDGQEHTQRRKTPPATSNRRSPHSTATPPPT 292

Query: 260 PAVTSGGTIT 269
                 G  T
Sbjct: 293 TKRQETGRPT 302


>gnl|CDD|223033 PHA03291, PHA03291, envelope glycoprotein I; Provisional.
          Length = 401

 Score = 33.0 bits (75), Expect = 0.26
 Identities = 25/120 (20%), Positives = 34/120 (28%), Gaps = 6/120 (5%)

Query: 162 PATNPTYNGNMASNAPYPTQSYMPMPG-SNVNTPYPTYPPSAYPGYNPNAAAAPSGSIYP 220
            A  P   G+   +         P  G ++V  P  T  P+     +P     PS     
Sbjct: 172 LAAPPLGEGSADGSCDPALPLSAPRLGPADVFVP-ATPRPTPRTTASPETTPTPST---T 227

Query: 221 SYPTPGATPGAGGYSPYPLYPPGGSTPATPYPVPTPSPAPAVTSGGTITDEHIKMSLKSA 280
           + P     P        P          TP P PTP    A  +  T   E  +  L   
Sbjct: 228 TSPPSTTIPAPSTTIAAPQAGTTPEAEGTPAP-PTPGGGEAPPANATPAPEASRYELTVT 286


>gnl|CDD|222968 PHA03065, PHA03065, Hypothetical protein; Provisional.
          Length = 438

 Score = 33.0 bits (76), Expect = 0.27
 Identities = 24/92 (26%), Positives = 38/92 (41%), Gaps = 16/92 (17%)

Query: 276 SLKSAVEDKIRRKMNEQKVQAQEEIEILKQTENELNQGKFKLARMFERIDKEKAELERSI 335
            +K ++ +K R+       + +EEIE L+     L+        M+E I   K +LE  I
Sbjct: 79  PIKQSLREKRRKASKNTIKRKREEIEKLEDDIKNLDVDDE----MYEEI---KTDLELKI 131

Query: 336 S------FLKEK---ETELDEILAVLTEKTEV 358
                  FL      +  L+  LA L E  E+
Sbjct: 132 DKLSFQLFLANSNNLKRLLESALARLGENVEI 163


>gnl|CDD|115367 pfam06705, SF-assemblin, SF-assemblin/beta giardin.  This family
           consists of several eukaryotic SF-assemblin and related
           beta giardin proteins. During mitosis the
           SF-assemblin-based cytoskeleton is reorganised; it
           divides in prophase and is reduced to two dot-like
           structures at each spindle pole in metaphase. During
           anaphase, the two dots present at each pole are
           connected again. In telophase there is an asymmetrical
           outgrowth of new fibres. It has been suggested that
           SF-assemblin is involved in re-establishing the
           microtubular root system characteristic of interphase
           cells after mitosis.
          Length = 247

 Score = 32.8 bits (75), Expect = 0.27
 Identities = 20/66 (30%), Positives = 32/66 (48%), Gaps = 6/66 (9%)

Query: 288 KMNEQKVQAQEEIEILKQTENELNQGKFK--LARMFERIDKEKAELERSISFLKEKETEL 345
           K+NE      EE    KQ ++++ + K     AR+  RID EK   E  +S L+   +E+
Sbjct: 129 KVNECVAAIDEERISRKQEQSQILK-KVAEDFARISARIDTEKNTREAEVSALR---SEI 184

Query: 346 DEILAV 351
            E+   
Sbjct: 185 HEVKGN 190


>gnl|CDD|223092 COG0013, AlaS, Alanyl-tRNA synthetase [Translation, ribosomal
           structure and biogenesis].
          Length = 879

 Score = 33.4 bits (77), Expect = 0.27
 Identities = 17/65 (26%), Positives = 34/65 (52%), Gaps = 2/65 (3%)

Query: 296 AQEEIEILKQTENELNQGKFKLARMFERIDKEKAELERSISFLKEKETELDEILAVLTEK 355
            QE+ ++LK+  + L      L ++ ER+ +E  ELE+ +  LK+K     E+  +L+  
Sbjct: 710 LQEQEDLLKEIASLLKVPPELLPKV-ERLLEELKELEKELERLKKKL-AAAELADLLSNA 767

Query: 356 TEVDV 360
              ++
Sbjct: 768 KAEEI 772


>gnl|CDD|237864 PRK14950, PRK14950, DNA polymerase III subunits gamma and tau;
           Provisional.
          Length = 585

 Score = 33.2 bits (76), Expect = 0.27
 Identities = 37/168 (22%), Positives = 49/168 (29%), Gaps = 16/168 (9%)

Query: 131 AQNSKLEGLIKTLASAFSALPPLYSSRT---KQEPATNPTYNGNMASN-APYPTQSYMPM 186
           +Q + LE L K +  AFS L   +  RT    Q P         +    AP P +     
Sbjct: 319 SQIANLEALTKWV-KAFSQLD--FQLRTTSYGQLPLELAVIEALLVPVPAPQPAKPTAAA 375

Query: 187 PGSNVNTPYPTYPPSAYPGYNPNAAAAPSGSIYPSYPTPGATPGAGGYSPYPLYPPGGST 246
           P     TP P+  P A    N         +  P  P     P A      P   P  + 
Sbjct: 376 PSPVRPTPAPSTRPKAAAAANIPPKEPVRETATP--PPVPPRPVAPPVPHTPESAPKLTR 433

Query: 247 PATPYPVPTPSPAPA-------VTSGGTITDEHIKMSLKSAVEDKIRR 287
            A P         PA                E ++   K  + D   R
Sbjct: 434 AAIPVDEKPKYTPPAPPKEEEKALIADGDVLEQLEAIWKQILRDVPPR 481


>gnl|CDD|220383 pfam09756, DDRGK, DDRGK domain.  This is a family of proteins of
           approximately 300 residues, found in plants and
           vertebrates. They contain a highly conserved DDRGK
           motif.
          Length = 189

 Score = 32.4 bits (74), Expect = 0.27
 Identities = 19/90 (21%), Positives = 38/90 (42%), Gaps = 4/90 (4%)

Query: 273 IKMSLKSAVEDKIRRKMNEQKVQAQEEIEILKQTENELNQGKFKLARMFERIDKEKAELE 332
            K+  K   + + ++   +Q+   +EE E  K+ E +    + +   + E  +K+K E E
Sbjct: 1   KKIGAKKRAKLEEKQARRQQREAEEEEREERKKLEEKREGERKEEEELEEEREKKKEEEE 60

Query: 333 RSISFLKEKETELDEILAVLTEKTEVDVDE 362
           R     + +E    E       K+   V+E
Sbjct: 61  R----KEREEQARKEQEEYEKLKSSFVVEE 86


>gnl|CDD|236733 PRK10672, PRK10672, rare lipoprotein A; Provisional.
          Length = 361

 Score = 33.1 bits (76), Expect = 0.28
 Identities = 24/105 (22%), Positives = 34/105 (32%), Gaps = 17/105 (16%)

Query: 166 PTYNGNMAS----NAPYPTQSYMPMPGSNVNTPYPTYPPSAYPGYNPNAAAAPSGSIYPS 221
           P  +G M +     AP P    +P+  S + +  PT  P    G+         G +  S
Sbjct: 198 PDLSGGMGTPSVQPAPAPQGDVLPVSNSTLKSEDPTGAPVTSSGFLGAPTTLAPGVLEGS 257

Query: 222 YPTPGATPGAGGYSPYPLYPPGGSTPATPYPVPTPSPAPAVTSGG 266
            PTP A   A          P  +  A        S +    SG 
Sbjct: 258 EPTPTAPSSA----------PATAPAAAAPQAAATSSS---ASGN 289


>gnl|CDD|240271 PTZ00108, PTZ00108, DNA topoisomerase 2-like protein; Provisional.
          Length = 1388

 Score = 33.5 bits (77), Expect = 0.30
 Identities = 21/80 (26%), Positives = 41/80 (51%), Gaps = 7/80 (8%)

Query: 274  KMSLKSAVEDKIRRKMNEQKVQAQEEIEILKQTE------NELNQGKFKLARMFERIDKE 327
             M + S  ++K+  K+N +  + ++E+E LK T        +L++ +  L    E  +KE
Sbjct: 1093 SMPIWSLTKEKVE-KLNAELEKKEKELEKLKNTTPKDMWLEDLDKFEEALEEQEEVEEKE 1151

Query: 328  KAELERSISFLKEKETELDE 347
             A+ +R  S  K K ++L +
Sbjct: 1152 IAKEQRLKSKTKGKASKLRK 1171


>gnl|CDD|129694 TIGR00606, rad50, rad50.  All proteins in this family for which
           functions are known are involvedin recombination,
           recombinational repair, and/or non-homologous end
           joining.They are components of an exonuclease complex
           with MRE11 homologs. This family is distantly related to
           the SbcC family of bacterial proteins.This family is
           based on the phylogenomic analysis of JA Eisen (1999,
           Ph.D. Thesis, Stanford University).
          Length = 1311

 Score = 33.1 bits (75), Expect = 0.32
 Identities = 19/85 (22%), Positives = 37/85 (43%), Gaps = 10/85 (11%)

Query: 270 DEHIKMSLKSAVEDKIRRKMNEQKVQAQEEIEILKQTENELNQGKFKLARMFERIDKEKA 329
            E      +   E+ I  K      +AQ+++  +K+   ++      +  +  +I   K 
Sbjct: 914 LETFLEKDQQEKEELISSK-ETSNKKAQDKVNDIKE---KVKNIHGYMKDIENKIQDGKD 969

Query: 330 ELERSISFLKEKETELDEILAVLTE 354
           +      +LK+KETEL+ + A L E
Sbjct: 970 D------YLKQKETELNTVNAQLEE 988



 Score = 32.7 bits (74), Expect = 0.44
 Identities = 20/73 (27%), Positives = 39/73 (53%), Gaps = 1/73 (1%)

Query: 285 IRRKMNEQKVQAQEEIEILKQTENELNQGKFKLARMFERIDKEKAELERSISFLKEKETE 344
           IR ++  ++ Q +   EI+K  ENEL+  K +L  +   + K   +L+  I  LK ++ +
Sbjct: 222 IRDQITSKEAQLESSREIVKSYENELDPLKNRLKEIEHNLSK-IMKLDNEIKALKSRKKQ 280

Query: 345 LDEILAVLTEKTE 357
           +++  + L  K E
Sbjct: 281 MEKDNSELELKME 293



 Score = 31.9 bits (72), Expect = 0.85
 Identities = 25/95 (26%), Positives = 45/95 (47%), Gaps = 9/95 (9%)

Query: 274 KMSLKSAVEDKIRRKMNEQKVQAQEEIEILKQTENELNQGKFKLARM---FERIDKEKAE 330
           K  LK    D+IR +        + + EIL++ + EL     +L ++    +RI +   E
Sbjct: 420 KERLKQEQADEIRDEKKGLGRTIELKKEILEKKQEELKFVIKELQQLEGSSDRILELDQE 479

Query: 331 L---ERSISFLKEK---ETELDEILAVLTEKTEVD 359
           L   ER +S  ++    ET   E+ ++  EK ++D
Sbjct: 480 LRKAERELSKAEKNSLTETLKKEVKSLQNEKADLD 514


>gnl|CDD|222417 pfam13851, GAS, Growth-arrest specific micro-tubule binding.  This
           family is the highly conserved central region of a
           number of metazoan proteins referred to as growth-arrest
           proteins. In mouse, Gas8 is predominantly a testicular
           protein, whose expression is developmentally regulated
           during puberty and spermatogenesis. In humans, it is
           absent in infertile males who lack the ability to
           generate gametes. The localisation of Gas8 in the
           motility apparatus of post-meiotic gametocytes and
           mature spermatozoa, together with the detection of Gas8
           also in cilia at the apical surfaces of epithelial cells
           lining the pulmonary bronchi and Fallopian tubes
           suggests that the Gas8 protein may have a role in the
           functioning of motile cellular appendages. Gas8 is a
           microtubule-binding protein localised to regions of
           dynein regulation in mammalian cells.
          Length = 201

 Score = 32.2 bits (74), Expect = 0.32
 Identities = 19/105 (18%), Positives = 41/105 (39%), Gaps = 26/105 (24%)

Query: 267 TITDEHIKM--SLKSAVEDKIRRKMNEQKVQAQE-EIEILKQTENELNQGKFKLARMFER 323
            IT  ++++  SLK  + + +++     +    E   E  +  E              ++
Sbjct: 20  DITHNNLELIKSLKEEIAE-MKKNEEHNEKLMAEIAQENKRLVEP------------LKK 66

Query: 324 IDKEKAELERSISF----------LKEKETELDEILAVLTEKTEV 358
            ++E  EL + +            LK +  EL++ L  L  ++EV
Sbjct: 67  AEEEVEELRKKLKDYEKDKQSLKNLKARLKELEKELKNLKWESEV 111


>gnl|CDD|221787 pfam12815, CTD, Spt5 C-terminal nonapeptide repeat binding Spt4.
           The C-terminal domain of the transcription elongation
           factor protein Spt5 is necessary for binding to Spt4 to
           form the functional complex that regulates early
           transcription elongation by RNA polymerase II. The
           complex may be involved in pre-mRNA processing through
           its association with mRNA capping enzymes. This CTD
           domain carries a regular nonapeptide repeat that can be
           present in up to 18 copies, as in S. pombe. The repeat
           has a characteristic TPA motif.
          Length = 112

 Score = 31.3 bits (71), Expect = 0.33
 Identities = 21/74 (28%), Positives = 22/74 (29%), Gaps = 9/74 (12%)

Query: 185 PMPGSNVNTP-YPTYPPSAYPGYNPNAAAAPSGSIYPSYPTPGATPGAGGYSPYPLYPPG 243
           P  G    TP Y        P Y            Y    TPGA P     +P P    G
Sbjct: 46  PAFGDGSRTPAYGGESNGRTPAY-------GGSGSYLGASTPGAAPTPAKSAPTPGAWGG 98

Query: 244 GSTPAT-PYPVPTP 256
             TP       PTP
Sbjct: 99  APTPGYSADSAPTP 112


>gnl|CDD|223061 PHA03369, PHA03369, capsid maturational protease; Provisional.
          Length = 663

 Score = 33.0 bits (75), Expect = 0.35
 Identities = 35/179 (19%), Positives = 54/179 (30%), Gaps = 20/179 (11%)

Query: 158 TKQEPATNPTYNGNMASNAPYPTQSYMPMPGSNVNTPYPTYPPSAYPGYNPNAAAAPSGS 217
           T    A +               Q++          P     P    G  P +  A S  
Sbjct: 350 TASLTAPSRVLAAAAKVAVIAAPQTH--------TGPADRQRPQRPDGI-PYSVPARS-P 399

Query: 218 IYPSYPTPGATPGAGGYSPYPLYPPGGSTPATPY-PVPTPSPAPAVTS---GGTITDEHI 273
           +    P P      G  SPY    PG S    P  PVP     P V        +   H 
Sbjct: 400 MTAYPPVPQFCGDPGLVSPYNPQSPGTSYGPEPVGPVPPQPTNPYVMPISMANMVYPGHP 459

Query: 274 KMSLKSAVEDKIRRK--MNEQKVQAQEEIEILKQTENELNQGKFKLARMFERIDKEKAE 330
           +   +   E K +R   + E+ ++  + ++ LK+ +  L +     A     I K    
Sbjct: 460 Q---EHGHERKRKRGGELKEELIETLKLVKKLKEEQESLAKELEATAHK-SEIKKIAES 514


>gnl|CDD|218881 pfam06070, Herpes_UL32, Herpesvirus large structural phosphoprotein
           UL32.  The large phosphorylated protein (UL32-like) of
           herpes viruses is the polypeptide most frequently
           reactive in immuno-blotting analyses with antisera when
           compared with other viral proteins.
          Length = 777

 Score = 33.0 bits (75), Expect = 0.35
 Identities = 33/228 (14%), Positives = 52/228 (22%), Gaps = 17/228 (7%)

Query: 71  TLASAFSALPPLYSSRTKQEPATNPTYNGNIRPTADMRINVSQNVDHTGKIYSPYLHTW- 129
           TL  A S   P  +   + +P+      G   P A        +    G      +    
Sbjct: 548 TLIVALSLASPSTAGSPRPKPSLGKFVIGT-DPFAFANTVRLTDNMRGGNGVGSSVKPKG 606

Query: 130 TAQNSKLEGLIKTLASAFSALPPLYSSRTKQEPATNPT----YNGNMASNAPYPTQSYMP 185
           +A +  L G    L  A +      SS                  +       P      
Sbjct: 607 SASSKPLTGPGSDLKPA-TLNGKTPSSSLVGAARNAGASSKVKIPSGLGGFTSPISLLES 665

Query: 186 MPGSNVNTPYPTYPPSAYPGYNPNAAAAPSGSIYPSYPTP---------GATPGAGGYSP 236
                + +   T      P           G    +  T            TP  G + P
Sbjct: 666 ALEDVLTSATSTPVKKNDPYLWDTNGEKAGGGTESASTTDVFQNFAGLNKKTPVGGPFQP 725

Query: 237 YPLYPPGGSTPATPYPVPTPSPAPAVTSGGTITDEHIKMSLKSAVEDK 284
            P       + ++P       P      G    D+ +   L     D 
Sbjct: 726 KPPLSRALDSASSPGGSG-GKPGLDGVEGAKGKDDDVVGILSKIKGDS 772


>gnl|CDD|223520 COG0443, DnaK, Molecular chaperone [Posttranslational modification,
           protein turnover, chaperones].
          Length = 579

 Score = 33.1 bits (76), Expect = 0.35
 Identities = 25/99 (25%), Positives = 45/99 (45%), Gaps = 5/99 (5%)

Query: 262 VTSGGTITDEHI-KM---SLKSAVEDKIRRKMNEQKVQAQEEIEILKQTENELNQGKFKL 317
           + +   ++DE I +M   +  +A  DK  R++ E + +A+  I  L++   E+ +   + 
Sbjct: 477 IKASSGLSDEEIERMVEDAEANAALDKKFRELVEARNEAESLIYSLEKALKEIVKVSEEE 536

Query: 318 ARMFER-IDKEKAELERSISFLKEKETELDEILAVLTEK 355
               E  I   +  LE     +K K  EL E+   L EK
Sbjct: 537 KEKIEEAITDLEEALEGEKEEIKAKIEELQEVTQKLAEK 575


>gnl|CDD|225288 COG2433, COG2433, Uncharacterized conserved protein [Function
           unknown].
          Length = 652

 Score = 32.8 bits (75), Expect = 0.36
 Identities = 19/86 (22%), Positives = 36/86 (41%), Gaps = 5/86 (5%)

Query: 276 SLKSAVEDKI-RRKMNEQKVQAQEEIEILKQTENELNQGKFKLARMFERIDKEKAELERS 334
            L  A+       +  E++   +EE   +   E  +     KL    ER+++E +EL+R 
Sbjct: 389 PLAEALSKVKEEERPREKEGTEEEERREITVYEKRIK----KLEETVERLEEENSELKRE 444

Query: 335 ISFLKEKETELDEILAVLTEKTEVDV 360
           +  LK +  +L+  L     +    V
Sbjct: 445 LEELKREIEKLESELERFRREVRDKV 470


>gnl|CDD|236138 PRK07994, PRK07994, DNA polymerase III subunits gamma and tau;
           Validated.
          Length = 647

 Score = 32.9 bits (76), Expect = 0.36
 Identities = 13/90 (14%), Positives = 17/90 (18%)

Query: 172 MASNAPYPTQSYMPMPGSNVNTPYPTYPPSAYPGYNPNAAAAPSGSIYPSYPTPGATPGA 231
                P           S   T  PT   +           A +    P+ P P  T   
Sbjct: 363 APLPEPEVPPQSAAPAASAQATAAPTAAVAPPQAPAVPPPPASAPQQAPAVPLPETTSQL 422

Query: 232 GGYSPYPLYPPGGSTPATPYPVPTPSPAPA 261
                      G +      P       P 
Sbjct: 423 LAARQQLQRAQGATKAKKSEPAAASRARPV 452


>gnl|CDD|222449 pfam13908, Shisa, Wnt and FGF inhibitory regulator.  Shisa is a
           transcription factor-type molecule that physically
           interacts with immature forms of the Wnt receptor
           Frizzled and the FGF receptor within the endoplasmic
           reticulum to inhibit their post-translational maturation
           and trafficking to the cell surface.
          Length = 177

 Score = 31.7 bits (72), Expect = 0.37
 Identities = 17/51 (33%), Positives = 18/51 (35%), Gaps = 7/51 (13%)

Query: 216 GSIYPSYPTPGATPGAGGYSPYPLYPPGGSTPATPYPVPTPSPAPAVTSGG 266
            S  PSYP P        Y  Y   PP    PA PY +  P P      G 
Sbjct: 131 PSTAPSYPGPQ-------YQGYHPMPPQPGMPAPPYSLQYPPPGLLQPQGP 174



 Score = 29.0 bits (65), Expect = 3.6
 Identities = 19/77 (24%), Positives = 23/77 (29%), Gaps = 14/77 (18%)

Query: 171 NMASNAPYPTQSY-MPMPGSNVNTPYPTYPPSAYPGYNPNAAAAPSGSIYPSYPTPGATP 229
            M        Q+  +P P S      P+YP   Y GY+P       G   P Y       
Sbjct: 113 VMTRATSTTVQTTPLPQPPST----APSYPGPQYQGYHPMPPQ--PGMPAPPYSLQYP-- 164

Query: 230 GAGGYSPYPLYPPGGST 246
                 P  L P G   
Sbjct: 165 -----PPGLLQPQGPPP 176


>gnl|CDD|227352 COG5019, CDC3, Septin family protein [Cell division and chromosome
           partitioning / Cytoskeleton].
          Length = 373

 Score = 32.3 bits (74), Expect = 0.43
 Identities = 14/77 (18%), Positives = 31/77 (40%), Gaps = 7/77 (9%)

Query: 274 KMSLKSAVEDKIRRKMNEQKVQAQEEIEILKQTENE---LNQGKFKL--ARMFERIDKEK 328
           K+S      +   ++++E +    EE   LK+   E     + + +     + E   +  
Sbjct: 291 KLSGLKNSGEPSLKEIHEAR--LNEEERELKKKFTEKIREKEKRLEELEQNLIEERKELN 348

Query: 329 AELERSISFLKEKETEL 345
           ++LE     L++ E  L
Sbjct: 349 SKLEEIQKKLEDLEKRL 365


>gnl|CDD|237605 PRK14086, dnaA, chromosomal replication initiation protein;
           Provisional.
          Length = 617

 Score = 32.5 bits (74), Expect = 0.44
 Identities = 22/97 (22%), Positives = 22/97 (22%), Gaps = 17/97 (17%)

Query: 179 PTQSYMPMPGSNVNTPYPTYPPSAYPGYNPNAAAA--PSGSIYPSYPT------------ 224
                 P          P      Y GY    A    P        PT            
Sbjct: 94  EPAPPPPHARRTSEPELPRPGRRPYEGYGGPRADDRPPGLPRQDQLPTARPAYPAYQQRP 153

Query: 225 -PGATPGAGGYSPYPLYPPGGSTPATPYPVPTPSPAP 260
            PGA P A            G  P  PY  P  S AP
Sbjct: 154 EPGAWPRAADDYG-WQQQRLGFPPRAPYASPA-SYAP 188



 Score = 30.2 bits (68), Expect = 2.7
 Identities = 15/72 (20%), Positives = 16/72 (22%), Gaps = 5/72 (6%)

Query: 189 SNVNTPYPTYPPSAYPGYNPNAAAAPSGSIYPSYPTPGATPGAGGYSPYPLYPPGGSTPA 248
           S      P           P     P    Y  Y  P A     G       P   + PA
Sbjct: 91  SAGEPAPPPPHARRTSE--PELPR-PGRRPYEGYGGPRADDRPPGLPRQDQLPT--ARPA 145

Query: 249 TPYPVPTPSPAP 260
            P     P P  
Sbjct: 146 YPAYQQRPEPGA 157



 Score = 29.0 bits (65), Expect = 5.7
 Identities = 28/158 (17%), Positives = 29/158 (18%), Gaps = 28/158 (17%)

Query: 137 EGLIKTLASAFSALPPLYSSRTKQEPATNPTYNGNMASNAPYPTQSYMPMPGSNVNTPYP 196
           EG     A       P         PA             P     Y          P  
Sbjct: 119 EGYGGPRADDRPPGLPRQDQLPTARPAYPAYQQRPEPGAWPRAADDYGWQQQRLGFPPRA 178

Query: 197 TYPPSAYPGYNPNAAAAPSGSIYPSYPT----------------------PGATPGAGGY 234
            Y   A           P  +  P Y                        P   PGAG  
Sbjct: 179 PYASPASYAPEQERDREPYDAGRPEYDQRRRDYDHPRPDWDRPRRDRTDRPEPPPGAGHV 238

Query: 235 ---SPYPLYPPGGSTPATPYPVPTP---SPAPAVTSGG 266
               P P               P P    PAPA   G 
Sbjct: 239 HRGGPGPPERDDAPVVPIRPSAPGPLAAQPAPAPGPGE 276


>gnl|CDD|235600 PRK05771, PRK05771, V-type ATP synthase subunit I; Validated.
          Length = 646

 Score = 32.6 bits (75), Expect = 0.46
 Identities = 18/88 (20%), Positives = 39/88 (44%), Gaps = 18/88 (20%)

Query: 278 KSAVEDKIRRKMNEQKVQAQEEI---EILKQTENELNQGKFKLARMFERIDKEKAELERS 334
            S   ++  +K+  ++++ +EE    E++++ + EL +           I+KE+  L   
Sbjct: 189 LSDEVEEELKKLGFERLELEEEGTPSELIREIKEELEE-----------IEKERESLLEE 237

Query: 335 ISFLKEKETEL----DEILAVLTEKTEV 358
           +  L +K  E      E L +  E+ E 
Sbjct: 238 LKELAKKYLEELLALYEYLEIELERAEA 265



 Score = 31.4 bits (72), Expect = 1.1
 Identities = 21/79 (26%), Positives = 40/79 (50%), Gaps = 3/79 (3%)

Query: 283 DKIRRKMNEQKVQAQEEIEILKQTENELNQGKFKLARMFERIDKEKAELERSISFLKEKE 342
           DK+R  + +     +E+ ++  ++  EL +   +     E+I+KE  ELE  IS L+ + 
Sbjct: 60  DKLRSYLPKLNPLREEKKKVSVKSLEELIKDVEE---ELEKIEKEIKELEEEISELENEI 116

Query: 343 TELDEILAVLTEKTEVDVD 361
            EL++ +  L      D+D
Sbjct: 117 KELEQEIERLEPWGNFDLD 135


>gnl|CDD|107179 PHA02414, PHA02414, hypothetical protein.
          Length = 111

 Score = 30.6 bits (69), Expect = 0.46
 Identities = 19/85 (22%), Positives = 35/85 (41%), Gaps = 2/85 (2%)

Query: 291 EQKVQAQEEIEILKQTENELNQGKFKLARMFERIDKEKAELERSISFLKEKETELDEILA 350
           E K+Q  E  +  K    EL     +L  +   +DK+ A      S +  +   L+E ++
Sbjct: 17  EDKIQEGELTD--KGDNKELEVAVAELRDIVVSLDKDVAVNSEKQSHIYYQIERLEEKIS 74

Query: 351 VLTEKTEVDVDEAVTTTAPIYKQIL 375
            L E  + +  E   T   ++  +L
Sbjct: 75  ALAESNKKEDTEKKDTVEKVFMIVL 99


>gnl|CDD|218704 pfam05701, DUF827, Plant protein of unknown function (DUF827).
           This family consists of several plant proteins of
           unknown function. Several sequences in this family are
           described as being "myosin heavy chain-like".
          Length = 484

 Score = 32.3 bits (73), Expect = 0.48
 Identities = 22/85 (25%), Positives = 41/85 (48%), Gaps = 18/85 (21%)

Query: 274 KMSLKSAVEDKIRRKMNEQKV---QAQEEIEILKQTENELNQGKFKLARMFERIDKEKAE 330
           ++S+ ++V   + +++ E K    +A EE + L+     L Q           +++EK E
Sbjct: 247 EISIMASV-ASVLKELEEAKANLEKAAEEEKSLRNLVESLKQ----------ELEEEKKE 295

Query: 331 LERSISFLKEKETELDEILAVLTEK 355
           LE     L+EKE E +E  + L  +
Sbjct: 296 LEE----LREKEGEAEEAASSLEAE 316



 Score = 30.7 bits (69), Expect = 1.7
 Identities = 22/93 (23%), Positives = 44/93 (47%), Gaps = 7/93 (7%)

Query: 276 SLKSAVEDKIRRKMNEQKVQAQEEIE----ILKQTENELNQGKFKLARMFERIDKEKAEL 331
              S++E ++ R  +E++ +A+E++      L+Q  +E  + + +     E + K K E 
Sbjct: 308 EAASSLEAELNRTKSEKESKAREKMVEIPLKLQQASSEAEEARKEAEAAREELRKLKEEA 367

Query: 332 ERSISFLKEKETELDEILAVLTEKTEVDVDEAV 364
           E++ + L+  E  L    A L E       EA+
Sbjct: 368 EQTKAALETAELRL---EAALKEAEAAKAAEAL 397


>gnl|CDD|225805 COG3266, DamX, Uncharacterized protein conserved in bacteria
           [Function unknown].
          Length = 292

 Score = 32.2 bits (73), Expect = 0.49
 Identities = 30/157 (19%), Positives = 50/157 (31%), Gaps = 15/157 (9%)

Query: 149 ALPPLYSSRT--KQEPATNPTYNGNMA---SNAPYPTQSYMPMPGSNVNTPYPTYPPSAY 203
           +LPP+ S+ T  ++  A +      +    +NA    Q+   +    V +  PT P +  
Sbjct: 77  SLPPISSTPTQGQEPLAQDGQQRVEVQGDLNNAAVQPQNLSQLNNVAVTSTLPTEPATVA 136

Query: 204 PGYNPNAAAAPSGSIYPSYPTPGATPGAGGYSPYPLYPPGGSTPATPYPVPTPSPAPA-- 261
           P  N +   A      P+   P          P        +  AT   V T SPA    
Sbjct: 137 PVRNASVPTAER----PAITRPVRAQAVS--EPAVEPKAAKTATATEAKVQTASPAQTPA 190

Query: 262 --VTSGGTITDEHIKMSLKSAVEDKIRRKMNEQKVQA 296
                 G      +K +  S    ++    +   V  
Sbjct: 191 TPPAGKGAAASGQLKSAPSSHYTLQLSASGSYDNVNG 227


>gnl|CDD|149160 pfam07926, TPR_MLP1_2, TPR/MLP1/MLP2-like protein.  The sequences
           featured in this family are similar to a region of human
           TPR protein and to yeast myosin-like proteins 1 (MLP1)
           and 2 (MLP2). These proteins share a number of features;
           for example, they all have coiled-coil regions and all
           three are associated with nuclear pores. TPR is thought
           to be a component of nuclear pore complex- attached
           intra-nuclear filaments, and is implicated in nuclear
           protein import. Moreover, its N-terminal region is
           involved in the activation of oncogenic kinases,
           possibly by mediating the dimerisation of kinase domains
           or by targeting these kinases to the nuclear pore
           complex. MLP1 and MLP2 are involved in the process of
           telomere length regulation, where they are thought to
           interact with proteins such as Tel1p and modulate their
           activity.
          Length = 132

 Score = 31.1 bits (71), Expect = 0.50
 Identities = 21/93 (22%), Positives = 44/93 (47%), Gaps = 14/93 (15%)

Query: 279 SAVEDKIRRKMNEQKVQAQE-----EIEILKQTEN---------ELNQGKFKLARMFERI 324
                K++  + EQ   A E     E E++K  E+         +LN+ K ++A++    
Sbjct: 23  EEKIQKLQEDLEEQAEIANEAQQKYERELVKHAEDIEELQALRKQLNELKKEIAQLKAEA 82

Query: 325 DKEKAELERSISFLKEKETELDEILAVLTEKTE 357
           +  +AEL  +    +E++  L++ L+ L ++ E
Sbjct: 83  ESAQAELSEAEESWEEQKKMLEDELSELEKRIE 115



 Score = 30.3 bits (69), Expect = 0.70
 Identities = 14/57 (24%), Positives = 28/57 (49%), Gaps = 4/57 (7%)

Query: 288 KMNEQKVQAQEEIEILK----QTENELNQGKFKLARMFERIDKEKAELERSISFLKE 340
            + +Q  + ++EI  LK      + EL++ +       + ++ E +ELE+ I  L E
Sbjct: 63  ALRKQLNELKKEIAQLKAEAESAQAELSEAEESWEEQKKMLEDELSELEKRIEELNE 119


>gnl|CDD|220614 pfam10174, Cast, RIM-binding protein of the cytomatrix active zone.
            This is a family of proteins that form part of the CAZ
           (cytomatrix at the active zone) complex which is
           involved in determining the site of synaptic vesicle
           fusion. The C-terminus is a PDZ-binding motif that binds
           directly to RIM (a small G protein Rab-3A effector). The
           family also contains four coiled-coil domains.
          Length = 774

 Score = 32.7 bits (74), Expect = 0.50
 Identities = 21/97 (21%), Positives = 36/97 (37%), Gaps = 22/97 (22%)

Query: 280 AVEDKIRRKMNEQK------VQAQEEIEILKQ-----------TENELNQGKFKLARMFE 322
           A +  I  K  + K       +A+ EI  L+             + E+   +  LA M  
Sbjct: 228 ASQKLIDEKDTKIKEFEKMLEKAENEIYRLQSRCDTSEADRNRLDKEVEAERSALAAMKA 287

Query: 323 RIDKEKAELERSISFLKEKETELDEILAVLTEKTEVD 359
           + D+   EL R  + L   +TEL+ +        + D
Sbjct: 288 KCDRAAQELSRKKTELLGLQTELETL-----ANQDSD 319


>gnl|CDD|234173 TIGR03346, chaperone_ClpB, ATP-dependent chaperone ClpB.  Members
           of this protein family are the bacterial ATP-dependent
           chaperone ClpB. This protein belongs to the AAA family,
           ATPases associated with various cellular activities
           (pfam00004). This molecular chaperone does not act as a
           protease, but rather serves to disaggregate misfolded
           and aggregated proteins [Protein fate, Protein folding
           and stabilization].
          Length = 852

 Score = 32.6 bits (75), Expect = 0.50
 Identities = 26/90 (28%), Positives = 49/90 (54%), Gaps = 13/90 (14%)

Query: 273 IKMSLKSAVE--DKIRRKMNEQKVQAQEEIEILKQTENELNQGKFKLARMFERIDKEKAE 330
           I+M + S  E  D++ R++    +Q + E E LK+ ++E ++      R+ E ++KE AE
Sbjct: 396 IRMEIDSKPEELDELDRRI----IQLEIEREALKKEKDEASK-----ERL-EDLEKELAE 445

Query: 331 LERSISFLKEK-ETELDEILAVLTEKTEVD 359
           LE   + L+E+ + E   I  +   K E++
Sbjct: 446 LEEEYADLEEQWKAEKAAIQGIQQIKEEIE 475


>gnl|CDD|221818 pfam12868, DUF3824, Domain of unknwon function (DUF3824).  This is
           a repeating domain found in fungal proteins. It is
           proline-rich, and the function is not known.
          Length = 135

 Score = 31.0 bits (70), Expect = 0.50
 Identities = 26/68 (38%), Positives = 28/68 (41%), Gaps = 3/68 (4%)

Query: 193 TPYPTYPPSAYPGYNPNAAAAPSGSIYPSYPTPGATPGAGGYSPYPLYPPGGSTPATPYP 252
           TPYP  PP + P Y PN+   P        P PG  PG       P  PPG   P   YP
Sbjct: 57  TPYPPSPPVSDPRYYPNSNYFPPPPGSTPVPPPGPQPGYNPADYPP--PPGAVPPPQNYP 114

Query: 253 VPTPSPAP 260
            P P P  
Sbjct: 115 YP-PGPGQ 121



 Score = 30.2 bits (68), Expect = 0.84
 Identities = 24/62 (38%), Positives = 29/62 (46%), Gaps = 5/62 (8%)

Query: 178 YPTQSYMPMPGSNVNTPYPTYPPSAYPGYNPNAAAAPSGSIYPSYPTP-GATPGAGGYSP 236
           YP  +Y P P  +  TP P  PP   PGYNP     P G++ P    P    PG   Y+P
Sbjct: 71  YPNSNYFPPPPGS--TPVP--PPGPQPGYNPADYPPPPGAVPPPQNYPYPPGPGQDPYAP 126

Query: 237 YP 238
            P
Sbjct: 127 RP 128


>gnl|CDD|177114 MTH00039, COX3, cytochrome c oxidase subunit III; Validated.
          Length = 260

 Score = 32.0 bits (73), Expect = 0.51
 Identities = 14/33 (42%), Positives = 19/33 (57%)

Query: 240 YPPGGSTPATPYPVPTPSPAPAVTSGGTITDEH 272
           +PP G  P  P+ VP  + A  ++SG TIT  H
Sbjct: 115 WPPTGINPINPFLVPLLNTAVLLSSGVTITWSH 147


>gnl|CDD|240403 PTZ00400, PTZ00400, DnaK-type molecular chaperone; Provisional.
          Length = 663

 Score = 32.5 bits (74), Expect = 0.51
 Identities = 25/86 (29%), Positives = 45/86 (52%), Gaps = 7/86 (8%)

Query: 262 VTSGGTITDEHIKMSLKSAVEDKIRRKMNEQKVQAQEEIEIL-KQTENELNQGKFKLARM 320
           + S G ++DE I+  +K A E K + +  ++ V A+ E E L    E +L+  K K++  
Sbjct: 535 IQSSGGLSDEEIEKMVKEAEEYKEQDEKKKELVDAKNEAETLIYSVEKQLSDLKDKIS-- 592

Query: 321 FERIDKEKAELERSISFLKEKETELD 346
               D +K EL++ I+ L+   +  D
Sbjct: 593 ----DADKDELKQKITKLRSTLSSED 614


>gnl|CDD|222160 pfam13476, AAA_23, AAA domain. 
          Length = 204

 Score = 31.8 bits (72), Expect = 0.53
 Identities = 25/124 (20%), Positives = 52/124 (41%), Gaps = 9/124 (7%)

Query: 267 TITDEHIKMSLKSAVEDKIRRKMNEQKVQAQE----EIEILKQTENELNQGKFKLARMF- 321
           T  +   K+ L+   E +   K   +KV+       EI+ L++  +EL +   ++  +  
Sbjct: 79  TFENNKGKLKLRLIEESRELTKKKGKKVKKSILEIVEIDELQEFIDELLKSDKEILPLLL 138

Query: 322 ----ERIDKEKAELERSISFLKEKETELDEILAVLTEKTEVDVDEAVTTTAPIYKQILNT 377
               ER+++ K + +     L+E E EL+E+        ++  ++         K+ L  
Sbjct: 139 YLGQERLEELKFKRKEKKERLEELEKELEELEDEKDLLEKLLEEKEKKKELEELKEELEE 198

Query: 378 FTEE 381
             EE
Sbjct: 199 LLEE 202


>gnl|CDD|223020 PHA03246, PHA03246, large tegument protein UL36; Provisional.
          Length = 3095

 Score = 32.6 bits (74), Expect = 0.54
 Identities = 17/112 (15%), Positives = 28/112 (25%), Gaps = 6/112 (5%)

Query: 209  NAAAAPSGSIYPSYPTPGATPGAGG-YSPYPLYPPGGSTPATPYPVPTPSPAPAVTSGGT 267
               A P    YP+  T G   G          Y P   T             P       
Sbjct: 2634 EDRAVPENVEYPALATQGDQDGFVSKVGSAMKYKPPNVTAMAQ-----SIVQPDTQKSSI 2688

Query: 268  ITDEHIKMSLKSAVEDKIRRKMNEQKVQAQEEIEILKQTENELNQGKFKLAR 319
                 I+++    +    R+K N  + + +           E+     +  R
Sbjct: 2689 KKKTDIRLAGYRTMPADNRQKGNGLRHKPKTGRSHTTIKNVEVYPSPPRWTR 2740


>gnl|CDD|237695 PRK14376, PRK14376, hypothetical protein; Provisional.
          Length = 176

 Score = 31.5 bits (71), Expect = 0.55
 Identities = 13/36 (36%), Positives = 13/36 (36%), Gaps = 2/36 (5%)

Query: 231 AGGYSPYPLYPPGGSTPATPYPVPTPSPAPAVTSGG 266
           AGGY P P   PG S    P  V  P        G 
Sbjct: 102 AGGYDPVPGEAPGTS--VGPGTVRKPRGRTRTAGGP 135



 Score = 28.4 bits (63), Expect = 5.0
 Identities = 15/60 (25%), Positives = 20/60 (33%), Gaps = 5/60 (8%)

Query: 200 PSAYPGYNPNAAAAPSGSIYPSYPTPGATPGAGGYSPYPLYPPGGSTPATPYPVPTPSPA 259
           P +  GY+P    AP  S       PG      G +     P G     TP+    P+  
Sbjct: 99  PLSAGGYDPVPGEAPGTS-----VGPGTVRKPRGRTRTAGGPVGAVDAGTPHRSRRPTGG 153


>gnl|CDD|237624 PRK14143, PRK14143, heat shock protein GrpE; Provisional.
          Length = 238

 Score = 31.6 bits (72), Expect = 0.55
 Identities = 21/66 (31%), Positives = 32/66 (48%), Gaps = 9/66 (13%)

Query: 290 NEQKVQAQEEIEILKQTENELNQGKFKLARMFE----RIDKEKAELERSISFLKEKETEL 345
             +  Q ++E+E LKQ   ELN    ++A  F+    R  +E+ +L      L+ K   L
Sbjct: 66  AARLAQLEQELESLKQELEELNSQYMRIAADFDNFRKRTSREQEDLR-----LQLKCNTL 120

Query: 346 DEILAV 351
            EIL V
Sbjct: 121 SEILPV 126


>gnl|CDD|218056 pfam04388, Hamartin, Hamartin protein.  This family includes the
           hamartin protein which is thought to function as a
           tumour suppressor. The hamartin protein interacts with
           the tuberin protein pfam03542. Tuberous sclerosis
           complex (TSC) is an autosomal dominant disorder and is
           characterized by the presence of hamartomas in many
           organs, such as brain, skin, heart, lung, and kidney. It
           is caused by mutation either TSC1 or TSC2 tumour
           suppressor gene. TSC1 encodes a protein, hamartin,
           containing two coiled-coil regions, which have been
           shown to mediate binding to tuberin. The TSC2 gene codes
           for tuberin pfam03542. These two proteins function
           within the same pathway(s) regulating cell cycle, cell
           growth, adhesion, and vesicular trafficking.
          Length = 667

 Score = 32.2 bits (73), Expect = 0.56
 Identities = 33/191 (17%), Positives = 57/191 (29%), Gaps = 24/191 (12%)

Query: 148 SALPPLYSSRTKQEPATNPTYNGNMASNAPYPTQSYMPMPGSNVNTPY----PTYPPSAY 203
           S+  P  S R    P+ +   N + + +      S  P    +           + PS+ 
Sbjct: 279 SSAYPNSSPRQALPPSISLPQNTSTSGSLHSAQTSRRPNTTFDKAASSGTKDSLWSPSSL 338

Query: 204 PGYNPNAAAAPSGSIYPSYPTPGATPGAGGYSPYPLYPPGGSTPATPYPVPTPSPAPAVT 263
            G     A  PS          G +P     SP  L      T  +    P     P+ +
Sbjct: 339 CGM----ATPPSS--------IGMSPLILSLSPSHLSGRAPGTTGSGKGEPASESTPSTS 386

Query: 264 SGGTITDEHIKMSLKSAVEDKIRRKMNEQKVQAQEEI---EILKQTENELNQGKFKLARM 320
                  + I  ++ +       RK   Q   +  ++   E L+  E     G    A  
Sbjct: 387 PPPPGLADDIVRAIFATSSRSAPRKEELQNESSFPKLVRQENLQNIEKSAEGGILDAA-- 444

Query: 321 FERIDKEKAEL 331
              + +E  +L
Sbjct: 445 ---VTEELLKL 452


>gnl|CDD|237782 PRK14666, uvrC, excinuclease ABC subunit C; Provisional.
          Length = 694

 Score = 32.2 bits (73), Expect = 0.57
 Identities = 20/67 (29%), Positives = 23/67 (34%), Gaps = 7/67 (10%)

Query: 208 PNAAAAPSGSIYPSYPTPGATPG-------AGGYSPYPLYPPGGSTPATPYPVPTPSPAP 260
           P    AP GS  P  P   ATP          G +P  L     + PA   P    + A 
Sbjct: 343 PLLPDAPEGSSDPVVPVAAATPVDASLPDVRTGTAPTSLANVSHADPAVAQPTQAATLAG 402

Query: 261 AVTSGGT 267
           A   G T
Sbjct: 403 AAPKGAT 409


>gnl|CDD|206687 cd01900, YchF, YchF GTPase.  YchF is a member of the Obg family,
           which includes four other subfamilies of GTPases: Obg,
           DRG, Ygr210, and NOG1. Obg is an essential gene that is
           involved in DNA replication in C. crescentus and
           Streptomyces griseus and is associated with the
           ribosome. Several members of the family, including YchF,
           possess the TGS domain related to the RNA-binding
           proteins. Experimental data and genomic analysis suggest
           that YchF may be part of a nucleoprotein complex and may
           function as a GTP-dependent translational factor.
          Length = 274

 Score = 31.7 bits (73), Expect = 0.59
 Identities = 17/56 (30%), Positives = 32/56 (57%), Gaps = 3/56 (5%)

Query: 283 DKIRRKMNEQKVQAQEEIEILKQTENELNQGKFKLARMFERIDKEKAELERSISFL 338
           +++ +K      +A+EE+E+L++ +  L +G  K AR  E  D+E   L +S+  L
Sbjct: 141 ERLEKKAKSGDKEAKEELELLEKIKEHLEEG--KPARTLELTDEEIKIL-KSLQLL 193


>gnl|CDD|217393 pfam03154, Atrophin-1, Atrophin-1 family.  Atrophin-1 is the
           protein product of the dentatorubral-pallidoluysian
           atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive
           neurodegenerative disorder. It is caused by the
           expansion of a CAG repeat in the DRPLA gene on
           chromosome 12p. This results in an extended
           polyglutamine region in atrophin-1, that is thought to
           confer toxicity to the protein, possibly through
           altering its interactions with other proteins. The
           expansion of a CAG repeat is also the underlying defect
           in six other neurodegenerative disorders, including
           Huntington's disease. One interaction of expanded
           polyglutamine repeats that is thought to be pathogenic
           is that with the short glutamine repeat in the
           transcriptional coactivator CREB binding protein, CBP.
           This interaction draws CBP away from its usual nuclear
           location to the expanded polyglutamine repeat protein
           aggregates that are characteristic of the polyglutamine
           neurodegenerative disorders. This interferes with
           CBP-mediated transcription and causes cytotoxicity.
          Length = 979

 Score = 32.4 bits (73), Expect = 0.60
 Identities = 29/124 (23%), Positives = 44/124 (35%), Gaps = 10/124 (8%)

Query: 149 ALPPLYSSRTKQEPATNPTYNGNMASNAPYPTQSYMPMPGSNVNT---PYPTYPPSAYPG 205
           AL PL S  T   P+ +P     M  + P  +    P   +   +      T+P S    
Sbjct: 386 ALKPLSSLPTHHPPSAHPPPLQLMPQSQPLQSVPAQPPVLTQSQSLPPKASTHPHSGLHS 445

Query: 206 YNPNAAAA-----PSG--SIYPSYPTPGATPGAGGYSPYPLYPPGGSTPATPYPVPTPSP 258
             P +  A       G  +I P    P +TP A   +     PPG + P++        P
Sbjct: 446 GPPQSPFAQHPFTSGGLPAIGPPPSLPTSTPAAPPRASSGSQPPGSALPSSGGCAGPGPP 505

Query: 259 APAV 262
            P +
Sbjct: 506 LPPI 509



 Score = 30.0 bits (67), Expect = 2.7
 Identities = 39/153 (25%), Positives = 51/153 (33%), Gaps = 33/153 (21%)

Query: 148 SALPPLYSSRTKQEPATNPTYNGNMASNA-PYPTQSYMPMPGSNVNTPYPT--------- 197
             LPP  S    + P T P       S+  P   Q   P P    N P P          
Sbjct: 335 QPLPPAPSMPHIKPPPTTPIPQLPNQSHKHPPHLQGPSPFPQMPSNLPPPPALKPLSSLP 394

Query: 198 --YPPSAYP----------GYNPNAAAAPSGSIYPSYPTPGAT-PGAGGY-----SPYPL 239
             +PPSA+P                A  P  +   S P   +T P +G +     SP+  
Sbjct: 395 THHPPSAHPPPLQLMPQSQPLQSVPAQPPVLTQSQSLPPKASTHPHSGLHSGPPQSPFAQ 454

Query: 240 YP--PGGSTPATPYPVP---TPSPAPAVTSGGT 267
           +P   GG     P P     TP+  P  +SG  
Sbjct: 455 HPFTSGGLPAIGPPPSLPTSTPAAPPRASSGSQ 487


>gnl|CDD|238453 cd00890, Prefoldin, Prefoldin is a hexameric molecular chaperone
           complex, found in both eukaryotes and archaea, that
           binds and stabilizes newly synthesized polypeptides
           allowing them to fold correctly.  The complex contains
           two alpha and four beta subunits, the two subunits being
           evolutionarily related. In archaea, there is usually
           only one gene for each subunit while in eukaryotes there
           two or more paralogous genes encoding each subunit
           adding heterogeneity to the structure of the hexamer.
           The structure of the complex consists of a double beta
           barrel assembly with six protruding coiled-coils.
          Length = 129

 Score = 30.8 bits (70), Expect = 0.60
 Identities = 23/99 (23%), Positives = 43/99 (43%), Gaps = 16/99 (16%)

Query: 274 KMSLKSAVEDKIRRKMNEQKVQAQEEIEILKQTENELNQGKFKLARMF---ERIDKEKA- 329
           ++        K+  ++ E + +A+E +E LK+ E E        A +F   E  D +K  
Sbjct: 14  QLEALQQQLQKLEAQLTEYE-KAKETLETLKKAEEEKELLVPLGAGLFVKAEVKDDDKVL 72

Query: 330 -----------ELERSISFLKEKETELDEILAVLTEKTE 357
                       LE +I FLK++   L++ +  L ++ E
Sbjct: 73  VDLGTGVYVEKSLEEAIEFLKKRLETLEKQIEKLEKQLE 111



 Score = 30.0 bits (68), Expect = 1.2
 Identities = 20/103 (19%), Positives = 37/103 (35%), Gaps = 18/103 (17%)

Query: 287 RKMNEQKVQAQEEIEILKQTENELNQGKFKLARMFERIDKEKAELERSISFLKEKETELD 346
           +++  Q  Q Q+++E L+Q   +L              +K K  LE      +EKE  + 
Sbjct: 2   QELAAQLQQLQQQLEALQQQLQKLEA-------QLTEYEKAKETLETLKKAEEEKELLVP 54

Query: 347 EILAVLTEKTEVDVDEAVTTTAPIYKQI-LNTFTE---EAATE 385
               +  +    D D+       +   +    + E   E A E
Sbjct: 55  LGAGLFVKAEVKDDDK-------VLVDLGTGVYVEKSLEEAIE 90


>gnl|CDD|218545 pfam05300, DUF737, Protein of unknown function (DUF737).  This
           family consists of several uncharacterized mammalian
           proteins of unknown function.
          Length = 187

 Score = 31.3 bits (71), Expect = 0.62
 Identities = 33/182 (18%), Positives = 59/182 (32%), Gaps = 42/182 (23%)

Query: 220 PSYPTPGATPGAGGYSPYPLYPPGGSTPATPYPVPTPSPAPAVTSGGTITDEHIKMSLKS 279
           PS P+    P           P           +          +     +    +    
Sbjct: 26  PSQPSSKPQPSP------GFGPSSVPESELRRLIAGELKGALEDAKRPSEETAGGLQSSE 79

Query: 280 AVEDKIRRKMNEQKVQAQEEIEILKQTENE----------------LNQGKFKLARMFER 323
             ED ++R   EQ    QEE+  + + E E                  Q + K   +  +
Sbjct: 80  VKEDLLKRYEQEQ-AIVQEELARIAKREREAAEEQLSRAVLREKASAEQERQKAKHLARQ 138

Query: 324 IDKEKAELERSISFLKEKETELDEILAVLTEKTEVDVDEAVTTTAPIYKQILNTFTEEAA 383
           +++++AEL+R  +F KE+       LA L EK            +  YK     +  +AA
Sbjct: 139 LEEKEAELKRLDAFYKEQ-------LARLEEK-----------NSEFYKLTSEQY-HKAA 179

Query: 384 TE 385
           ++
Sbjct: 180 SK 181


>gnl|CDD|184927 PRK14963, PRK14963, DNA polymerase III subunits gamma and tau;
           Provisional.
          Length = 504

 Score = 32.1 bits (73), Expect = 0.63
 Identities = 9/53 (16%), Positives = 12/53 (22%)

Query: 209 NAAAAPSGSIYPSYPTPGATPGAGGYSPYPLYPPGGSTPATPYPVPTPSPAPA 261
            A          +   P   P         L     S  + P    T + AP 
Sbjct: 340 LALGGAPSEGVAAVAPPAPAPADLTQRLNRLEKEVRSLRSAPTAAATAAGAPL 392



 Score = 31.3 bits (71), Expect = 1.1
 Identities = 16/81 (19%), Positives = 20/81 (24%)

Query: 185 PMPGSNVNTPYPTYPPSAYPGYNPNAAAAPSGSIYPSYPTPGATPGAGGYSPYPLYPPGG 244
           P  G     P    P       N       S    P+     A      + P P  PP  
Sbjct: 346 PSEGVAAVAPPAPAPADLTQRLNRLEKEVRSLRSAPTAAATAAGAPLPDFDPRPRGPPAP 405

Query: 245 STPATPYPVPTPSPAPAVTSG 265
               +    P  +PA A    
Sbjct: 406 EPARSAEAPPLVAPAAAPAGL 426


>gnl|CDD|227278 COG4942, COG4942, Membrane-bound metallopeptidase [Cell division
           and chromosome partitioning].
          Length = 420

 Score = 32.0 bits (73), Expect = 0.65
 Identities = 14/62 (22%), Positives = 27/62 (43%)

Query: 284 KIRRKMNEQKVQAQEEIEILKQTENELNQGKFKLARMFERIDKEKAELERSISFLKEKET 343
              +++   + +   E   L    +E    + KLA++ E   K  A+L   +S  ++K  
Sbjct: 168 ATLKQLAAVRAEIAAEQAELTTLLSEQRAQQAKLAQLLEERKKTLAQLNSELSADQKKLE 227

Query: 344 EL 345
           EL
Sbjct: 228 EL 229



 Score = 29.7 bits (67), Expect = 3.3
 Identities = 32/151 (21%), Positives = 61/151 (40%), Gaps = 33/151 (21%)

Query: 279 SAVEDKIRRKMNE-QKVQAQEEIEILKQTENELNQGKFKLARM---FERIDKEKAELERS 334
           +A+E KIR + ++  K++ Q     LK  E E+   + +L       +++ K+ A+L   
Sbjct: 48  AALEKKIREQQDQRAKLEKQ-----LKSLETEIASLEAQLIETADDLKKLRKQIADLNAR 102

Query: 335 ISFLKEKETELDEILAVLTEKTEVDVDEAVTTTAPIYKQILNT------FTEEAATED-- 386
           ++ L+ +E E    LA                 A + +   N         E+A      
Sbjct: 103 LNALEVQEREQRRRLAEQ--------------LAALQRSGRNPPPALLVSPEDAQRSVRL 148

Query: 387 TIYY--MGEALRRGVIDLEVYLKQVRSLSRK 415
            IYY  +  A    +  L+  LKQ+ ++  +
Sbjct: 149 AIYYGALNPARAERIDALKATLKQLAAVRAE 179



 Score = 29.3 bits (66), Expect = 3.8
 Identities = 17/60 (28%), Positives = 30/60 (50%), Gaps = 8/60 (13%)

Query: 303 LKQTENELNQGKFKLARMFERIDKEKAELERSISFLKEKETELDEILAVLTEKTEVDVDE 362
           LKQ + E+   + K+         ++A+LE+ +   K  ETE+  + A L E T  D+ +
Sbjct: 40  LKQIQKEIAALEKKIRE----QQDQRAKLEKQL---KSLETEIASLEAQLIE-TADDLKK 91



 Score = 29.3 bits (66), Expect = 4.9
 Identities = 16/76 (21%), Positives = 34/76 (44%), Gaps = 10/76 (13%)

Query: 273 IKMSLKSAVEDKIRRKMNEQKVQA--QEEIEILKQTENELNQGKFKLARMFERIDKEKAE 330
           + +S + A +  +R  +    +     E I+ LK T  +L   + ++A        E+AE
Sbjct: 135 LLVSPEDA-QRSVRLAIYYGALNPARAERIDALKATLKQLAAVRAEIAA-------EQAE 186

Query: 331 LERSISFLKEKETELD 346
           L   +S  + ++ +L 
Sbjct: 187 LTTLLSEQRAQQAKLA 202



 Score = 28.1 bits (63), Expect = 10.0
 Identities = 16/60 (26%), Positives = 29/60 (48%), Gaps = 3/60 (5%)

Query: 289 MNEQKVQAQEEIEILKQTENELNQGKFKLARMFERIDKEKAELERSISFLKEKETELDEI 348
            ++Q  Q Q+EI  L++   E    + KL +  + ++ E A LE     L E   +L ++
Sbjct: 36  DDKQLKQIQKEIAALEKKIREQQDQRAKLEKQLKSLETEIASLEAQ---LIETADDLKKL 92


>gnl|CDD|219780 pfam08286, Spc24, Spc24 subunit of Ndc80.  Spc24 is a component of
           the evolutionarily conserved kinetochore-associated
           Ndc80 complex and is involved in chromosome segregation.
          Length = 117

 Score = 30.4 bits (69), Expect = 0.66
 Identities = 20/64 (31%), Positives = 32/64 (50%), Gaps = 3/64 (4%)

Query: 309 ELNQGKFKLARMFERIDKEKAELERSISFLKEKETELDEILAVLTEKTEVDVDEAVTTTA 368
           EL   KF+LA   + ++  ++EL    S L+E +  L+E+     E  EVD DE     +
Sbjct: 3   ELENEKFRLA---KELNDLESELNNLQSELEELKERLEELEEQEVEGDEVDEDETPVDDS 59

Query: 369 PIYK 372
            + K
Sbjct: 60  NVLK 63


>gnl|CDD|221389 pfam12037, DUF3523, Domain of unknown function (DUF3523).  This
           presumed domain is functionally uncharacterized. This
           domain is found in eukaryotes. This domain is typically
           between 257 to 277 amino acids in length. This domain is
           found associated with pfam00004. This domain has a
           conserved LER sequence motif.
          Length = 276

 Score = 31.7 bits (72), Expect = 0.66
 Identities = 27/112 (24%), Positives = 43/112 (38%), Gaps = 27/112 (24%)

Query: 286 RRKMNEQKVQAQEEI-----EILKQTENELNQGKFKLARMFERIDKEKAELERSISFLK- 339
           +R+ NE+ ++ QEE       + + TE E+ +       M     +E+AELER     K 
Sbjct: 127 QRRQNEELLKMQEESVLRQEAMRRATEEEILE-------MRRETIEEEAELERENIRAKI 179

Query: 340 ----------EKETELDEILAVLTEKTEVD---VDEAVTTTAPIYKQILNTF 378
                     E+E E D    +L  K   +   V E++ TT           
Sbjct: 180 EAEARGRAKEERENE-DINREMLKLKANEERETVLESIKTTFSHIGGGFRAL 230


>gnl|CDD|203570 pfam07058, Myosin_HC-like, Myosin II heavy chain-like.  This family
           represents a conserved region within a number of myosin
           II heavy chain-like proteins that seem to be specific to
           Arabidopsis thaliana.
          Length = 351

 Score = 31.9 bits (72), Expect = 0.66
 Identities = 21/68 (30%), Positives = 33/68 (48%), Gaps = 13/68 (19%)

Query: 282 EDKIRRKMNEQKVQAQEEIEILKQTENELNQGKF----------KLARMFERIDKEKAEL 331
           E+KI  KM+ QKV    E+E L Q+  EL +                R F+ +++E+  L
Sbjct: 22  ENKILDKMHRQKVA---EVEKLSQSVRELEEAVLAGGAAANAVRDYQRKFQEMNEERKTL 78

Query: 332 ERSISFLK 339
           ER ++  K
Sbjct: 79  ERELARAK 86


>gnl|CDD|233508 TIGR01649, hnRNP-L_PTB, hnRNP-L/PTB/hephaestus splicing factor
           family.  Included in this family of heterogeneous
           ribonucleoproteins are PTB (polypyrimidine tract binding
           protein ) and hnRNP-L. These proteins contain four RNA
           recognition motifs (rrm: pfam00067).
          Length = 481

 Score = 32.1 bits (73), Expect = 0.67
 Identities = 18/79 (22%), Positives = 22/79 (27%), Gaps = 6/79 (7%)

Query: 196 PTYPPSAYPGYNPNAAAAPSGSIYPSY--PTPGATPGAGGYS-PYPLYPPGGSTP---AT 249
           P    +            PS   +  Y        P AGG     P  PP    P   A 
Sbjct: 199 PGLDQTHRQRQPALLGQHPSSYGHDGYSSHGGPLAPLAGGDRMGPPHGPPSRYRPAYEAA 258

Query: 250 PYPVPTPSPAPAVTSGGTI 268
           P      S  PA    G++
Sbjct: 259 PLAPAISSYGPAGGGPGSV 277


>gnl|CDD|185320 PRK15422, PRK15422, septal ring assembly protein ZapB; Provisional.
          Length = 79

 Score = 29.7 bits (66), Expect = 0.68
 Identities = 21/68 (30%), Positives = 34/68 (50%), Gaps = 1/68 (1%)

Query: 291 EQKVQ-AQEEIEILKQTENELNQGKFKLARMFERIDKEKAELERSISFLKEKETELDEIL 349
           E KVQ A + I +L+    EL +    L++  +    ++ ELER  + LKE++    E L
Sbjct: 10  EAKVQQAIDTITLLQMEIEELKEKNNSLSQEVQNAQHQREELERENNHLKEQQNGWQERL 69

Query: 350 AVLTEKTE 357
             L  + E
Sbjct: 70  QALLGRME 77


>gnl|CDD|234053 TIGR02903, spore_lon_C, ATP-dependent protease, Lon family.
           Members of this protein family resemble the widely
           distributed ATP-dependent protease La, also called Lon
           and LonA. It resembles even more closely LonB, which is
           a LonA paralog found in genomes if and only if the
           species is capable of endospore formation (as in
           Bacillus subtilis, Clostridium tetani, and select other
           members of the Firmicutes) and expressed specifically in
           the forespore compartment. Members of this family are
           restricted to a subset of spore-forming species, and are
           very likely to participate in the program of endospore
           formation. We propose the designation LonC [Protein
           fate, Degradation of proteins, peptides, and
           glycopeptides, Cellular processes, Sporulation and
           germination].
          Length = 615

 Score = 32.1 bits (73), Expect = 0.68
 Identities = 24/97 (24%), Positives = 48/97 (49%), Gaps = 7/97 (7%)

Query: 253 VPTPSPAPAV--TSGGTITDEHIKMSLKSAVEDKIRRKMNEQKVQAQEEI--EILKQTEN 308
           VP P   P +   +   I D   + ++++ +E K+  +M E++ +  EEI  ++LK+ + 
Sbjct: 60  VPDPRELPEILEDTEDHIADILARRTVENRIERKVETRMQERQNKYLEEIRLQVLKEEKG 119

Query: 309 ELNQGKFKLARMFERIDKEKAELERSI-SFLKEKETE 344
             N    K     E+++K+K  L +S  S L+ +   
Sbjct: 120 PENSSTLKKLERLEKLEKKK--LHKSAQSLLRPRAFS 154


>gnl|CDD|219569 pfam07777, MFMR, G-box binding protein MFMR.  This region is found
           to the N-terminus of the pfam00170 transcription factor
           domain. It is between 150 and 200 amino acids in length.
           The N-terminal half is rather rich in proline residues
           and has been termed the PRD (proline rich domain),
           whereas the C-terminal half is more polar and has been
           called the MFMR (multifunctional mosaic region). It has
           been suggested that this family is composed of three
           sub-families called A, B and C, classified according to
           motif composition. It has been suggested that some of
           these motifs may be involved in mediating
           protein-protein interactions. The MFMR region contains a
           nuclear localisation signal in bZIP opaque and GBF-2.
           The MFMR also contains a transregulatory activity in
           TAF-1. The MFMR in CPRF-2 contains cytoplasmic retention
           signals.
          Length = 189

 Score = 31.4 bits (71), Expect = 0.68
 Identities = 27/85 (31%), Positives = 34/85 (40%), Gaps = 13/85 (15%)

Query: 198 YPPSAYPGY-NPNAAAAPSGSIYPSYPTPGATPGAGGYSPYP-LYPPGG----------S 245
           Y P   P Y N + A++P    Y   P     P  G   PY  +YPPGG          S
Sbjct: 38  YGPRPPPPYFNSSVASSPQPHPYMWGPQQPMMPPYGTPPPYAAMYPPGGVYAHPSMPPGS 97

Query: 246 TPATPYPVPTPSPAPAVTSGGTITD 270
            P +PY +P+    P  T     TD
Sbjct: 98  HPFSPYAMPSAEV-PGSTPLSMETD 121


>gnl|CDD|218636 pfam05557, MAD, Mitotic checkpoint protein.  This family consists
           of several eukaryotic mitotic checkpoint (Mitotic arrest
           deficient or MAD) proteins. The mitotic spindle
           checkpoint monitors proper attachment of the bipolar
           spindle to the kinetochores of aligned sister chromatids
           and causes a cell cycle arrest in prometaphase when
           failures occur. Multiple components of the mitotic
           spindle checkpoint have been identified in yeast and
           higher eukaryotes. In S.cerevisiae, the existence of a
           Mad1-dependent complex containing Mad2, Mad3, Bub3 and
           Cdc20 has been demonstrated.
          Length = 722

 Score = 32.2 bits (73), Expect = 0.69
 Identities = 31/116 (26%), Positives = 41/116 (35%), Gaps = 13/116 (11%)

Query: 233 GYSPYPLYPPGGSTPATPYPVPTPSPAPAVTSGGTITDEHIKMSLKSAVEDKIRRKMNEQ 292
           G S   +   GG+       +P     P   S           SL+  VE+     M   
Sbjct: 3   GESNTTVMADGGNQSILRSDLPKFLSQPLEGSHHLGVSTSALSSLQKQVEES----MTLL 58

Query: 293 KVQAQEEIE-ILKQTENELNQGKFKLARMFERIDKEKAELERSISFLKEK-ETELD 346
           +       +  L Q ENEL Q +       +R    + ELER  S L E  E ELD
Sbjct: 59  QRAELIRSKSKLIQLENELMQKE----LEHKR---AQIELERKASTLAENYERELD 107


>gnl|CDD|184943 PRK14981, PRK14981, DNA-directed RNA polymerase subunit F;
           Provisional.
          Length = 112

 Score = 30.2 bits (69), Expect = 0.69
 Identities = 19/73 (26%), Positives = 32/73 (43%), Gaps = 8/73 (10%)

Query: 284 KIRRKMNEQKVQAQEEIEILKQ--TENELN--QGK-FKLARMFERIDKEKAELERSISFL 338
             ++ + E+ +   E  EIL +   E EL+    +       F ++D E A  E  +  L
Sbjct: 2   IGKKILEEEYITIAEAKEILSEIEEERELSYELRRTLDYLNRFSKLDPEDA--EELVEEL 59

Query: 339 KEKETELDEILAV 351
            E E ++ E  AV
Sbjct: 60  LELE-KMKEKTAV 71


>gnl|CDD|191033 pfam04576, Zein-binding, Zein-binding.  This domain binds to zein
           proteins, pfam01559. Zein proteins are seed storage
           proteins.
          Length = 94

 Score = 30.0 bits (68), Expect = 0.70
 Identities = 20/75 (26%), Positives = 34/75 (45%), Gaps = 22/75 (29%)

Query: 269 TDEHIKMSL-----KSAVE---DKIRRKMNEQKVQAQEEIEILKQTENELNQGKFKLARM 320
             E + M L     K+A+E    + +R   E+    QEEIE+L+               +
Sbjct: 33  ASEAMAMILRLQEEKAALEMEARQYQRMAEEKMEYDQEEIELLEDL-------------L 79

Query: 321 FERIDKEKAELERSI 335
           +ER ++E A LE+ +
Sbjct: 80  YER-EREIAALEKEL 93


>gnl|CDD|191331 pfam05645, RNA_pol_Rpc82, RNA polymerase III subunit RPC82.  This
           family consists of several DNA-directed RNA polymerase
           III polypeptides which are related to the Saccharomyces
           cerevisiae RPC82 protein. RNA polymerase C (III)
           promotes the transcription of tRNA and 5S RNA genes. In
           Saccharomyces cerevisiae, the enzyme is composed of 15
           subunits, ranging from 160 to about 10 kDa.
          Length = 215

 Score = 31.2 bits (71), Expect = 0.70
 Identities = 28/136 (20%), Positives = 50/136 (36%), Gaps = 24/136 (17%)

Query: 252 PVPTPSPAPAVTSGGTITDEHIKMSLKSAVEDKIRRKMNEQKVQAQEEIEILKQTENELN 311
            VP+P   P       + +EH++ + +S+   +I+R+   +     E  ++ +  E  L 
Sbjct: 14  RVPSPHFRPIGDLWNQLYEEHLRSTPRSSTTSEIKRRAEAKVKAKTELRDLRESPEILLK 73

Query: 312 QGKFKLARMFERIDKEKAE------------LERSISFLKEKETELDEILAVLTEKTEVD 359
                LA   +    +K              LER    L+      DE L  L +     
Sbjct: 74  GLGRSLADDTDNPTGKKRTKILDPGLVWRVNLERFHKHLR------DEALVDLAKSR--- 124

Query: 360 VDEAVTTTAPIYKQIL 375
           +     TTA +Y+  L
Sbjct: 125 IGS---TTAEVYRAAL 137


>gnl|CDD|234428 TIGR03979, His_Ser_Rich, His-Xaa-Ser repeat protein HxsA.  Members
           of this protein share two defining regions. One is a
           histidine/serine-rich cluster, typically
           H-R-S-H-S-S-H-R-S-H-S-S-H. Members are found always in
           the context of a pair of radical SAM proteins, HxsB and
           HxsC, and a fourth protein HxsD. The system is predicted
           to perform peptide modifications, likely in the
           His-Xaa-Ser region, to produce some uncharacterized
           natural product.
          Length = 186

 Score = 31.0 bits (70), Expect = 0.71
 Identities = 15/35 (42%), Positives = 19/35 (54%), Gaps = 4/35 (11%)

Query: 230 GAGGYSPYPLYPPGGSTPATPYPVPTPSPAPAVTS 264
           GAGG    P      ST +  YPVP+PS +P+  S
Sbjct: 71  GAGGSYSVPSGD--TSTYS--YPVPSPSYSPSPGS 101


>gnl|CDD|117486 pfam08919, F_actin_bind, F-actin binding.  The F-actin binding
           domain forms a compact bundle of four antiparallel
           alpha-helices, which are arranged in a left-handed
           topology. Binding of F-actin to the F-actin binding
           domain may result in cytoplasmic retention and
           subcellular distribution of the protein, as well as
           possible inhibition of protein function.
          Length = 179

 Score = 31.2 bits (70), Expect = 0.71
 Identities = 16/45 (35%), Positives = 23/45 (51%), Gaps = 6/45 (13%)

Query: 223 PTPGATPGAGGYSPYPLYPPGGSTPATPYPVPTPSPAPAVTSGGT 267
           P P A P     +     P G  TP +P P+P+ SP+P+  + GT
Sbjct: 6   PVPPAVPKPQSTAK----PVG--TPPSPVPLPSTSPSPSKMANGT 44


>gnl|CDD|212016 cd11626, HR1_ROCK, Protein kinase C-related kinase homology region
           1 (HR1) Rho-binding domain of Rho-associated coiled-coil
           containing protein kinase.  ROCK is also referred to as
           Rho-associated kinase or simply as Rho kinase. It is a
           serine/threonine protein kinase that is activated via
           interaction with Rho GTPases and is involved in many
           cellular functions including contraction, adhesion,
           migration, motility, proliferation, and apoptosis. ROCKs
           are the best-described effectors of RhoA. There are two
           isoforms, ROCK1 and ROCK2, which may be functionally
           redundant in some systems, but exhibit different tissue
           distributions. Both isoforms are ubiquitously expressed
           in most tissues, but ROCK2 is more prominent in brain
           and skeletal muscle while ROCK1 is more pronounced in
           the liver, testes, and kidney. Studies in knockout mice
           result in different phenotypes, suggesting that the two
           isoforms do not compensate for each other during
           embryonic development. ROCK contains an N-terminal
           extension, a catalytic kinase domain, and a long
           C-terminal extension, which contains a Rho-binding HR1
           domain and a pleckstrin homology (PH) domain. ROCK is
           auto-inhibited by HR1 and PH domains interacting with
           the catalytic domain. HR1 domains are anti-parallel
           coiled-coil (ACC) domains that bind small GTPases from
           the Rho family.
          Length = 66

 Score = 29.2 bits (65), Expect = 0.72
 Identities = 14/68 (20%), Positives = 33/68 (48%), Gaps = 4/68 (5%)

Query: 285 IRRKMNEQKVQAQEEIEILKQTENELNQGKFKLARMFERIDKEKAELERSISFLKEKETE 344
           ++ +  E + +A  E E  +  EN++      L    E + K+K E +++    ++ + +
Sbjct: 3   LQHRQQEYQRKADMEAEKRRNVENDVAA----LKDQLEDLKKQKQESQKAEEKARQLQKQ 58

Query: 345 LDEILAVL 352
           L+E   +L
Sbjct: 59  LEEANRLL 66


>gnl|CDD|234501 TIGR04211, SH3_and_anchor, SH3 domain protein.  Members of this
           protein family have a signal peptide, a strongly
           conserved SH3 domain, a variable region, and then a
           C-terminal hydrophobic transmembrane alpha helix region.
          Length = 198

 Score = 31.1 bits (71), Expect = 0.73
 Identities = 23/81 (28%), Positives = 35/81 (43%), Gaps = 4/81 (4%)

Query: 288 KMNEQKVQAQEEIEILKQTENELNQGKFKLARMFERIDKEKAELERSISFLKEKETELDE 347
           ++ E+  + QE++  L+Q   EL Q    L    ER+ KE A +++    L     ELDE
Sbjct: 77  ELQEELAELQEQLAELQQENQELKQELSTLEAELERLQKELARIKQ----LSANAIELDE 132

Query: 348 ILAVLTEKTEVDVDEAVTTTA 368
               L E+      E     A
Sbjct: 133 ENRELREELAELKQENEALEA 153


>gnl|CDD|214562 smart00212, UBCc, Ubiquitin-conjugating enzyme E2, catalytic domain
           homologues.  Proteins destined for proteasome-mediated
           degradation may be ubiquitinated. Ubiquitination follows
           conjugation of ubiquitin to a conserved cysteine residue
           of UBC homologues. This pathway functions in regulating
           many fundamental processes required for cell
           viability.TSG101 is one of several UBC homologues that
           lacks this active site cysteine.
          Length = 145

 Score = 30.7 bits (70), Expect = 0.78
 Identities = 21/82 (25%), Positives = 31/82 (37%), Gaps = 10/82 (12%)

Query: 2   TNLSGLVYHIPVCIWLMDTHPDHAPICYVRPTADMRINVSQNVDHTGKIYSPYLH--TWT 59
           T   G V+ +   I   + +P   P           I    NVD +G+I    L    W+
Sbjct: 40  TPYEGGVFKL--TIEFPEDYPFKPPKVKFITK----IY-HPNVDSSGEICLDILKQEKWS 92

Query: 60  PQNSKLEGLIKTLASAFSALPP 81
           P    LE ++ +L S  S   P
Sbjct: 93  P-ALTLETVLLSLQSLLSEPNP 113


>gnl|CDD|217051 pfam02463, SMC_N, RecF/RecN/SMC N terminal domain.  This domain is
           found at the N terminus of SMC proteins. The SMC
           (structural maintenance of chromosomes) superfamily
           proteins have ATP-binding domains at the N- and
           C-termini, and two extended coiled-coil domains
           separated by a hinge in the middle. The eukaryotic SMC
           proteins form two kind of heterodimers: the SMC1/SMC3
           and the SMC2/SMC4 types. These heterodimers constitute
           an essential part of higher order complexes, which are
           involved in chromatin and DNA dynamics. This family also
           includes the RecF and RecN proteins that are involved in
           DNA metabolism and recombination.
          Length = 1162

 Score = 31.9 bits (72), Expect = 0.79
 Identities = 19/98 (19%), Positives = 49/98 (50%), Gaps = 7/98 (7%)

Query: 271 EHIKMSLKSAVEDKIRRKMNEQKVQAQEEIEILKQTENELNQGKFKLARMFERIDKEKAE 330
           E+ ++  K  +E++    ++  K+  +E I++L++   +  +   ++    + ++KE+  
Sbjct: 212 EYYQLKEKLELEEENLLYLDYLKLN-EERIDLLQELLRDEQE---EIESSKQELEKEEEI 267

Query: 331 LERSISFLKEKETE---LDEILAVLTEKTEVDVDEAVT 365
           L + +   KE+E E    +E L +L ++ E    E + 
Sbjct: 268 LAQVLKENKEEEKEKKLQEEELKLLAKEEEELKSELLK 305


>gnl|CDD|220365 pfam09726, Macoilin, Transmembrane protein.  This entry is a highly
           conserved protein present in eukaryotes.
          Length = 680

 Score = 31.8 bits (72), Expect = 0.80
 Identities = 27/91 (29%), Positives = 39/91 (42%), Gaps = 15/91 (16%)

Query: 282 EDKIRRKMNEQK-----VQAQ----EEIEILKQTENELNQGKFKLARMFERIDKEKAELE 332
           E+K R+K  E+       QA     E  E LKQ + +L     KL    +  ++E   LE
Sbjct: 501 EEKKRKKEEEETAARAAAQAAASREECAESLKQAKQDLEMEIKKLEHDLKLKEEECRMLE 560

Query: 333 RSISFLK-----EKETE-LDEILAVLTEKTE 357
           +    L+     EKETE L   L  + +K  
Sbjct: 561 KEAQELRKYQESEKETEVLMSALQAMQDKNL 591


>gnl|CDD|227507 COG5180, PBP1, Protein interacting with poly(A)-binding protein
           [RNA processing and modification].
          Length = 654

 Score = 31.6 bits (71), Expect = 0.81
 Identities = 24/93 (25%), Positives = 32/93 (34%), Gaps = 11/93 (11%)

Query: 162 PATNPTYNGNMASNAPYPTQSYMPMPGSNVNTPYPTYPPSAYPGYNP----NAAAAPSGS 217
           P  NP    NM     +P    M  P ++ N     +   +   Y P         PS  
Sbjct: 510 PGMNPAMGMNMGGMMGFP----MGGPSASPNPMMNGFAAGSMGMYMPFQPQPMFYHPSPQ 565

Query: 218 IYPSYPTPGATPGAGGYSPYPLYPPGGSTPATP 250
           + P   + GA  G G  SP   + P G   A P
Sbjct: 566 MMPVMGSNGAEEGGGNISP---HVPAGFMAAGP 595


>gnl|CDD|206350 pfam14182, YgaB, YgaB-like protein.  The YgaB-like protein family
           includes the B. subtilis YgaB protein, which is
           functionally uncharacterized. This family of proteins is
           found in bacteria. Proteins in this family are
           approximately 90 amino acids in length.
          Length = 79

 Score = 29.2 bits (66), Expect = 0.86
 Identities = 17/61 (27%), Positives = 26/61 (42%), Gaps = 12/61 (19%)

Query: 297 QEEIEILKQTENELNQGKFKLARMFERIDKEKAELERSISFLKEKETELDEILAVLTEKT 356
           Q E+E  ++ E EL               ++KAELE     +     EL EI  +  ++T
Sbjct: 20  QSELERCQEIEKEL------------LELEKKAELESIQQEISRMRKELKEIQELFEKQT 67

Query: 357 E 357
           E
Sbjct: 68  E 68


>gnl|CDD|220555 pfam10079, DUF2317, Uncharacterized protein conserved in bacteria
           (DUF2317).  Members of this family of hypothetical
           bacterial proteins have no known function.
          Length = 541

 Score = 31.8 bits (73), Expect = 0.86
 Identities = 17/81 (20%), Positives = 34/81 (41%), Gaps = 6/81 (7%)

Query: 281 VEDKIRRKMNEQKVQAQEEIEILKQTENELNQ---GKFKLARMFERIDKEKAELERSISF 337
           +E K+ +K+   K+    E ++       L +    +   + +    ++ KAELE     
Sbjct: 384 IEPKVEKKL--DKLGLSIE-DVFLHGLELLKEKVLEEKANSEIDIDFEEAKAELEAQFKR 440

Query: 338 LKEKETELDEILAVLTEKTEV 358
           L+E+  ++D  L    E  E 
Sbjct: 441 LREEAAKIDPSLEGALEANEA 461


>gnl|CDD|177328 PHA01929, PHA01929, putative scaffolding protein.
          Length = 306

 Score = 31.2 bits (70), Expect = 0.87
 Identities = 26/98 (26%), Positives = 29/98 (29%), Gaps = 6/98 (6%)

Query: 150 LPPLYSSRTKQ-EPATNPTYNGNMASNAPYPTQSYMPMPGSNVNTPYPTYPPSAYPGYNP 208
           LPP  +       PA  PT   N       P Q     PG+      PT  P   P    
Sbjct: 8   LPPGLAGLVANVPPAAAPTPQPNPVIQPQAPVQP--GQPGAPQQLAIPTQQPQPVPTSAM 65

Query: 209 NAAAAPSGSIYPSYPTPGATPGAGGYSPYPLYPPGGST 246
                      P+ P P A P AG   P  L  P    
Sbjct: 66  TPHVVQQA---PAQPAPAAPPAAGAALPEALEVPPPPA 100



 Score = 28.9 bits (64), Expect = 5.0
 Identities = 19/72 (26%), Positives = 23/72 (31%), Gaps = 10/72 (13%)

Query: 199 PPSAYPGYNPNAAAAPSGSIYPSYP-------TPGATPGAGGYSPY-PLYPPGGSTPATP 250
           PP+A P   PN    P   + P  P        P   P     S   P        PA P
Sbjct: 20  PPAAAPTPQPNPVIQPQAPVQPGQPGAPQQLAIPTQQPQPVPTSAMTPHVVQQA--PAQP 77

Query: 251 YPVPTPSPAPAV 262
            P   P+   A+
Sbjct: 78  APAAPPAAGAAL 89


>gnl|CDD|203446 pfam06428, Sec2p, GDP/GTP exchange factor Sec2p.  In Saccharomyces
           cerevisiae, Sec2p is a GDP/GTP exchange factor for
           Sec4p, which is required for vesicular transport at the
           post-Golgi stage of yeast secretion.
          Length = 90

 Score = 29.2 bits (66), Expect = 0.90
 Identities = 24/80 (30%), Positives = 38/80 (47%), Gaps = 12/80 (15%)

Query: 280 AVEDKIRRKMNEQKVQAQEEIEILKQTENELNQGKFKLA-RMFERIDKEKAELE----RS 334
           A E K R +  ++K + ++E+E       +L    F  A +M     +E+  +E    + 
Sbjct: 4   AEEKKKRLRAEKEKKKLEQEVE-------DLTASLFDEANKMVADARREREAVEIKNEKL 56

Query: 335 ISFLKEKETELDEILAVLTE 354
              LKEKET LD + A L E
Sbjct: 57  EEQLKEKETLLDSLQAQLKE 76


>gnl|CDD|204405 pfam10146, zf-C4H2, Zinc finger-containing protein.  This is a
           family of proteins which appears to have a highly
           conserved zinc finger domain at the C terminal end,
           described as -C-X2-CH-X3-H-X5-C-X2-C-. The structure is
           predicted to contain a coiled coil. Members are
           annotated as being tumour-associated antigen HCA127 in
           humans but this could not confirmed.
          Length = 215

 Score = 31.0 bits (70), Expect = 0.91
 Identities = 23/83 (27%), Positives = 42/83 (50%), Gaps = 15/83 (18%)

Query: 283 DKIRRKMNEQKVQAQEEIE-----------ILKQTENELNQGKFKLARMFERIDKEKAEL 331
           DK   ++ E+K+Q  EE+             +KQ+++EL +   K+ARM      E   L
Sbjct: 38  DKEMEELLEEKMQHVEELRQIHADINDMETEIKQSKSELERRMGKIARMHG----EYNPL 93

Query: 332 ERSISFLKEKETELDEILAVLTE 354
           + SI+ +++ E  L+E+  +  E
Sbjct: 94  KESINEMRKLELGLEELPQLHEE 116


>gnl|CDD|216269 pfam01056, Myc_N, Myc amino-terminal region.  The myc family
           belongs to the basic helix-loop-helix leucine zipper
           class of transcription factors, see pfam00010. Myc forms
           a heterodimer with Max, and this complex regulates cell
           growth through direct activation of genes involved in
           cell replication. Mutations in the C-terminal 20
           residues of this domain cause unique changes in the
           induction of apoptosis, transformation, and G2 arrest.
          Length = 329

 Score = 31.4 bits (71), Expect = 0.93
 Identities = 18/98 (18%), Positives = 28/98 (28%), Gaps = 9/98 (9%)

Query: 181 QSYMPMPGSNVNTPYPTYPPSAYPGYNPNAAAAPSGSIYPSYPTPGAT----PGAGGYSP 236
            S        + +  P   PS     +P + +    + Y    +  A+    P      P
Sbjct: 134 ASLQAARKEGLPSDSPAPAPSPRGRPHPASGSGRLSASYLQDLSTSASECIDPSV--VFP 191

Query: 237 YPL---YPPGGSTPATPYPVPTPSPAPAVTSGGTITDE 271
           YPL            TP       P  + +SG     E
Sbjct: 192 YPLNERSKSSKVASPTPRLGLRTPPNSSSSSGSDSESE 229


>gnl|CDD|100796 PRK01156, PRK01156, chromosome segregation protein; Provisional.
          Length = 895

 Score = 31.8 bits (72), Expect = 0.93
 Identities = 14/81 (17%), Positives = 38/81 (46%), Gaps = 7/81 (8%)

Query: 282 EDKIRR---KMNEQKVQAQEEIEILKQTENELNQGKFKLARMFERIDKEKA-ELERSISF 337
           E+K        NE+K + +E+I  ++    ++++    L +  E ++ E+  +     + 
Sbjct: 464 EEKSNHIINHYNEKKSRLEEKIREIEIEVKDIDEKIVDLKKRKEYLESEEINKSINEYNK 523

Query: 338 LKEKETELDEI---LAVLTEK 355
           ++    +L++I   +  L +K
Sbjct: 524 IESARADLEDIKIKINELKDK 544


>gnl|CDD|191136 pfam04949, Transcrip_act, Transcriptional activator.  This family
           of proteins may act as a transcriptional activator. It
           plays a role in stress response in plants.
          Length = 160

 Score = 30.6 bits (69), Expect = 0.98
 Identities = 28/92 (30%), Positives = 46/92 (50%), Gaps = 11/92 (11%)

Query: 275 MSLKSAVEDKIRRKMNE--QKVQAQ-----EEIEILKQTENEL----NQGKFKLARMFER 323
           +SL  A E++I R+  E  ++VQAQ     EE + L +   EL    +  + +++ + +R
Sbjct: 33  LSLFKAKEEEIERRKMEVRERVQAQLGRVEEETKRLAEIREELEGMADPMRKEVSMIRKR 92

Query: 324 IDKEKAELERSISFLKEKETELDEILAVLTEK 355
           ID    EL+      ++KE E  E L    EK
Sbjct: 93  IDSVNRELKPLGQTCQKKEKEYKEALEAYNEK 124


>gnl|CDD|177188 MTH00130, COX3, cytochrome c oxidase subunit III; Provisional.
          Length = 261

 Score = 30.9 bits (70), Expect = 1.0
 Identities = 17/45 (37%), Positives = 22/45 (48%), Gaps = 5/45 (11%)

Query: 228 TPGAGGYSPYPLYPPGGSTPATPYPVPTPSPAPAVTSGGTITDEH 272
           TP  GG      +PP G T   P+ VP  + A  + SG T+T  H
Sbjct: 109 TPELGG-----CWPPTGITTLDPFEVPLLNTAVLLASGVTVTWAH 148


>gnl|CDD|237256 PRK12901, secA, preprotein translocase subunit SecA; Reviewed.
          Length = 1112

 Score = 31.5 bits (72), Expect = 1.0
 Identities = 19/77 (24%), Positives = 34/77 (44%), Gaps = 9/77 (11%)

Query: 277 LKSAVEDKIRRKMNE-----QKVQAQEEIEILKQTENELNQGKFKLARMFERIDKEKAEL 331
           LK    DK  R + E     +K++A+   E+   + +EL     K     + I +  A++
Sbjct: 5   LKKLFGDKSERDLKEIQPIVEKIKAEYP-ELEALSNDELRA---KTDEFKQYIKEAVADI 60

Query: 332 ERSISFLKEKETELDEI 348
           +  I  LK +  E  +I
Sbjct: 61  DAKIEELKAEAIESLDI 77


>gnl|CDD|232965 TIGR00414, serS, seryl-tRNA synthetase.  This model represents the
           seryl-tRNA synthetase found in most organisms. This
           protein is a class II tRNA synthetase, and is recognized
           by the pfam model tRNA-synt_2b. The seryl-tRNA
           synthetases of two archaeal species, Methanococcus
           jannaschii and Methanobacterium thermoautotrophicum,
           differ considerably and are included in a different
           model [Protein synthesis, tRNA aminoacylation].
          Length = 418

 Score = 31.2 bits (71), Expect = 1.1
 Identities = 23/87 (26%), Positives = 38/87 (43%), Gaps = 12/87 (13%)

Query: 290 NEQKVQAQEEIEILKQTENELN--------QGKFKLARMFERIDKEKAELERSISFLKEK 341
           ++++ +   EIE L+   NEL+        Q K K+  + + + + K EL    + LK  
Sbjct: 36  DDERKKLLSEIEELQAKRNELSKQIGKAKGQKKDKIEEIKKELKELKEELTELSAALKAL 95

Query: 342 ETELDEILA----VLTEKTEVDVDEAV 364
           E EL + L     +  E   V  DE  
Sbjct: 96  EAELQDKLLSIPNIPHESVPVGKDEED 122


>gnl|CDD|220001 pfam08748, DUF1789, Domain of unknown function (DUF1789).  Putative
           uncharacterized domain found in phage-related conserved
           hypothetical protein from Bordetella.
          Length = 125

 Score = 29.7 bits (67), Expect = 1.1
 Identities = 17/85 (20%), Positives = 27/85 (31%), Gaps = 14/85 (16%)

Query: 315 FKLARMFERIDKEKAELERSISFLK--------EKETELDEILAVLTEKTEVDVDEAVTT 366
           FKL     R+  E A++E +   +         +K  E D  L       E    + +  
Sbjct: 14  FKLPVKIPRLGGEDAKVEFTFKHMDRKELAELYDKWGESDAKLGEKAAAEEAFTAKQIDL 73

Query: 367 TAPIYKQIL------NTFTEEAATE 385
            A + K I         F +E    
Sbjct: 74  DAELIKAIAVGWDLDEEFNDENVEL 98


>gnl|CDD|203138 pfam04977, DivIC, Septum formation initiator.  DivIC from B.
           subtilis is necessary for both vegetative and
           sporulation septum formation. These proteins are mainly
           composed of an amino terminal coiled-coil.
          Length = 80

 Score = 29.1 bits (66), Expect = 1.1
 Identities = 14/59 (23%), Positives = 24/59 (40%), Gaps = 5/59 (8%)

Query: 286 RRKMNEQKVQAQEEIEILKQTENELNQGKFKLARMFERIDKE-KAELERSISFLKEKET 343
             ++N++    Q E+  LK    EL      L    + I++  ++EL       K  ET
Sbjct: 19  YYQLNQEIAALQAELAKLKAENEELEAEVKDLKSDPDYIEERARSELG----LSKPGET 73


>gnl|CDD|202427 pfam02841, GBP_C, Guanylate-binding protein, C-terminal domain.
           Transcription of the anti-viral guanylate-binding
           protein (GBP) is induced by interferon-gamma during
           macrophage induction. This family contains GBP1 and
           GPB2, both GTPases capable of binding GTP, GDP and GMP.
          Length = 297

 Score = 31.1 bits (71), Expect = 1.2
 Identities = 12/60 (20%), Positives = 31/60 (51%), Gaps = 1/60 (1%)

Query: 291 EQKVQAQEEIEILKQTENELNQGKFKLARMFERIDKEKAELER-SISFLKEKETELDEIL 349
           EQ++  +++ E  +  E +    +  + ++ E+++ E+ +L       L+ K  E +E+L
Sbjct: 219 EQELLREKQKEEEQMMEAQERSYQEHVKQLIEKMEAEREKLLAEQERMLEHKLQEQEELL 278


>gnl|CDD|235906 PRK07003, PRK07003, DNA polymerase III subunits gamma and tau;
           Validated.
          Length = 830

 Score = 31.4 bits (71), Expect = 1.2
 Identities = 22/112 (19%), Positives = 31/112 (27%), Gaps = 5/112 (4%)

Query: 188 GSNVNTPYPTYPPSAYPGYNPNAAAAPSGSIYPSYPTPGATPGAGGYSPYPLYPPGGSTP 247
           G       P     A P     AAAA   S  P+        GA   +  P      +  
Sbjct: 365 GGAPGGGVPARVAGAVPAPGARAAAAVGASAVPAVTAVTGAAGA---ALAP--KAAAAAA 419

Query: 248 ATPYPVPTPSPAPAVTSGGTITDEHIKMSLKSAVEDKIRRKMNEQKVQAQEE 299
           AT    P  +PAP  T+            + +    +        +  AQ  
Sbjct: 420 ATRAEAPPAAPAPPATADRGDDAADGDAPVPAKANARASADSRCDERDAQPP 471



 Score = 30.6 bits (69), Expect = 1.9
 Identities = 21/113 (18%), Positives = 33/113 (29%), Gaps = 6/113 (5%)

Query: 155 SSRTKQEPATNPTYNGNMASNAPYPTQSYMPMPGSNVNTPYPTYPPSAYPGYNPNAAAAP 214
            +     P          A +      +  P    + + P    PP A         A  
Sbjct: 441 DAADGDAPVPAKANARASADSRCDERDAQPPADSGSASAPASDAPPDA-----AFEPAPR 495

Query: 215 SGSIYPSYPTPGATPGAGGYSPYPLYPPGGSTPATPYPVPTPSPA-PAVTSGG 266
           + +   + P       A   +     P   + PA     PTP+ A PA  +GG
Sbjct: 496 AAAPSAATPAAVPDARAPAAASREDAPAAAAPPAPEARPPTPAAAAPAARAGG 548



 Score = 29.0 bits (65), Expect = 5.4
 Identities = 16/87 (18%), Positives = 22/87 (25%), Gaps = 10/87 (11%)

Query: 176 APYPTQSYMPMPGSNVNTPYPTYPPSAYPGYNPN-AAAAPSGSIYPSYPTPGATPGAGGY 234
            P P        G++          +   G      AAA + +     P     P A   
Sbjct: 380 VPAPGARAAAAVGASAVPAVT--AVTGAAGAALAPKAAAAAAATRAEAPPAAPAPPATAD 437

Query: 235 SPYPLYPPGGSTPATPYPVPTPSPAPA 261
                   G        PVP  + A A
Sbjct: 438 R-------GDDAADGDAPVPAKANARA 457


>gnl|CDD|189476 pfam00261, Tropomyosin, Tropomyosin. 
          Length = 237

 Score = 30.7 bits (70), Expect = 1.2
 Identities = 20/90 (22%), Positives = 40/90 (44%), Gaps = 19/90 (21%)

Query: 277 LKSAVEDKIRRKMNEQKVQAQEEIEILKQTENELNQGKFKLARMFERIDKEKAELERSIS 336
           L +A+E     K+ E +  A E     K  EN   +         E++++ +A+L+ +  
Sbjct: 59  LATALE-----KLEEAEKAADESERGRKVLENRSLKDD-------EKMEQLEAQLKEAKE 106

Query: 337 FLKEKETELDEI---LAVLTEKTEVDVDEA 363
             +E + + +E+   L V     E D++ A
Sbjct: 107 IAEEADRKYEEVARKLVV----VEGDLERA 132



 Score = 29.9 bits (68), Expect = 2.1
 Identities = 23/68 (33%), Positives = 35/68 (51%), Gaps = 16/68 (23%)

Query: 303 LKQTENELNQGKFKLARMFERIDK-----EKAELE-----RSISFLKEKETELD---EIL 349
           +KQ E EL++ + +L    E++++     EKAE E     R I  L+E   +L+   E L
Sbjct: 3   IKQLEEELDKAEERLEEAQEKLEEAEKRAEKAEAEVASLNRRIQLLEE---DLERSEERL 59

Query: 350 AVLTEKTE 357
           A   EK E
Sbjct: 60  ATALEKLE 67



 Score = 29.9 bits (68), Expect = 2.3
 Identities = 17/60 (28%), Positives = 31/60 (51%), Gaps = 3/60 (5%)

Query: 288 KMNEQKVQAQEEIEILKQTENELNQGKFKLARMFERIDKEKAELERSISFLKEKETELDE 347
           K  E+  +AQE+   L++ E    + + ++A +  RI   + +LERS   L     +L+E
Sbjct: 12  KAEERLEEAQEK---LEEAEKRAEKAEAEVASLNRRIQLLEEDLERSEERLATALEKLEE 68


>gnl|CDD|205306 pfam13125, DUF3958, Protein of unknown function (DUF3958).  This
           family of proteins is functionally uncharacterized. This
           family of proteins is found in bacteria. Proteins in
           this family are approximately 120 amino acids in length.
           There are two conserved sequence motifs: RLF and TWH.
          Length = 99

 Score = 29.0 bits (65), Expect = 1.3
 Identities = 20/76 (26%), Positives = 35/76 (46%), Gaps = 19/76 (25%)

Query: 291 EQKVQAQEEIEILKQTENELNQGKFKLARMFERI------DKEKAEL-----------ER 333
           E++ + Q  I+  ++ E + ++ K +  R+F RI      DKE +             ER
Sbjct: 3   EEQDRNQSAIQTQERAEADFHELKNRSNRLFNRILETWHGDKEMSLFFMNIRQEAQQIER 62

Query: 334 SISFL--KEKETELDE 347
            ++F    +KET L E
Sbjct: 63  KLTFELENQKETLLKE 78


>gnl|CDD|184060 PRK13453, PRK13453, F0F1 ATP synthase subunit B; Provisional.
          Length = 173

 Score = 30.3 bits (68), Expect = 1.3
 Identities = 24/115 (20%), Positives = 57/115 (49%), Gaps = 18/115 (15%)

Query: 270 DEHIKMSLKSAVEDKIRRKMNEQKVQAQEEIEILKQTENELNQGKFKLARMFERIDKE-K 328
           +E  K  LK   ++++++ + + KVQA+      +Q E  +++   +   M E    E  
Sbjct: 72  EEENKQKLKET-QEEVQKILEDAKVQAR------QQQEQIIHEANVRANGMIETAQSEIN 124

Query: 329 AELERSISFLKEKETELDEILAVLTEKTEV-DVDEAVTTTAPIYKQILNTFTEEA 382
           ++ ER+I+ +  + +EL  ++A    + E+ + D+         K +++ + +EA
Sbjct: 125 SQKERAIADINNQVSELSVLIASKVLRKEISEQDQ---------KALVDKYLKEA 170


>gnl|CDD|237871 PRK14965, PRK14965, DNA polymerase III subunits gamma and tau;
           Provisional.
          Length = 576

 Score = 30.9 bits (70), Expect = 1.4
 Identities = 17/52 (32%), Positives = 18/52 (34%)

Query: 210 AAAAPSGSIYPSYPTPGATPGAGGYSPYPLYPPGGSTPATPYPVPTPSPAPA 261
           A A PS +     P   A P      P P   P     A P P P P  A A
Sbjct: 381 APAPPSAAWGAPTPAAPAAPPPAAAPPVPPAAPARPAAARPAPAPAPPAAAA 432


>gnl|CDD|219339 pfam07223, DUF1421, Protein of unknown function (DUF1421).  This
           family represents a conserved region approximately 350
           residues long within a number of plant proteins of
           unknown function.
          Length = 357

 Score = 30.7 bits (69), Expect = 1.4
 Identities = 26/121 (21%), Positives = 31/121 (25%), Gaps = 11/121 (9%)

Query: 148 SALPPLYSSRTKQEPATNPT-YNGNMASNAPYPTQSYMPMPGSNVNTPYPTYPPSAYPGY 206
           S LPP       Q+P      Y    +   P P Q              P   P      
Sbjct: 100 SQLPPQQVQSVPQQPTPQQEPYYPPPSQPQPPPAQQPQAQQ-----PQPPPQVPQQQQYQ 154

Query: 207 NPNAAAAPSGSIYPSYPTPGATPGAGGYSPYPLYPPGGSTPATPYP---VPTPSPAPAVT 263
           +P     P     P      A   +G Y     Y P    P  P P      P  + A  
Sbjct: 155 SP--PQQPQYQQNPPPQAQSAPQVSGLYPEESPYQPQSYPPNEPLPSSMAMQPPYSGAPP 212

Query: 264 S 264
           S
Sbjct: 213 S 213



 Score = 28.4 bits (63), Expect = 7.2
 Identities = 25/100 (25%), Positives = 33/100 (33%), Gaps = 7/100 (7%)

Query: 155 SSRTKQEPATNPTYNGNMASNAPYPTQSYMPMPGSNVNTPY-PTYPPSAYPGYNPNAAAA 213
           S +    P  +P   G           S    P S     Y  + PP +   +   A+ A
Sbjct: 213 SQQFYGPPQPSPYMYGGPGGRPNSGFPSGQQPPPSQGQEGYGYSGPPPSKGNHGSVASYA 272

Query: 214 PSGSI------YPSYPTPGATPGAGGYSPYPLYPPGGSTP 247
           P GS       YPS P     P A   S  P+   G  +P
Sbjct: 273 PQGSSQSYSTAYPSLPAATVLPQALPMSSAPMSGGGSGSP 312



 Score = 28.0 bits (62), Expect = 9.4
 Identities = 24/114 (21%), Positives = 33/114 (28%), Gaps = 7/114 (6%)

Query: 149 ALPPLYSSRTKQEPATNPTYNGNMASNAPYPTQSYMPMPGSNVNTPYPTYPPSAYPGYNP 208
           ALPP  + ++ Q             S AP   Q    +P   V +      P   P Y P
Sbjct: 72  ALPPAPAPQSPQPDQQQQ-------SQAPPSHQYPSQLPPQQVQSVPQQPTPQQEPYYPP 124

Query: 209 NAAAAPSGSIYPSYPTPGATPGAGGYSPYPLYPPGGSTPATPYPVPTPSPAPAV 262
            +   P  +  P    P   P       Y   P        P P    +P  + 
Sbjct: 125 PSQPQPPPAQQPQAQQPQPPPQVPQQQQYQSPPQQPQYQQNPPPQAQSAPQVSG 178


>gnl|CDD|214360 CHL00094, dnaK, heat shock protein 70.
          Length = 621

 Score = 30.9 bits (70), Expect = 1.4
 Identities = 20/78 (25%), Positives = 41/78 (52%), Gaps = 11/78 (14%)

Query: 273 IKMSLKSAVEDKIRRKMNEQKVQAQEEIEILKQTENELNQGKFKLARMFERIDKEKAELE 332
           +K + K+A EDK +R+  + K QA+    +  Q E +L + K K++      +++K ++E
Sbjct: 511 VKEAEKNAAEDKEKREKIDLKNQAE---SLCYQAEKQLKELKDKIS------EEKKEKIE 561

Query: 333 RSISFLKE--KETELDEI 348
             I  L++  +    + I
Sbjct: 562 NLIKKLRQALQNDNYESI 579


>gnl|CDD|177553 PHA03185, PHA03185, UL14 tegument protein; Provisional.
          Length = 214

 Score = 30.4 bits (68), Expect = 1.4
 Identities = 20/56 (35%), Positives = 25/56 (44%), Gaps = 8/56 (14%)

Query: 212 AAPSGSIYPSYPT-PGATPGAGGYSPYPLYPPGGSTPATPYPVPTPSPA-PAVTSG 265
           +AP+  + P   + PG+ P      P     PGG  PA P P P P P  PA  S 
Sbjct: 156 SAPA--VRPGDLSGPGSRPT---SGPVAGGLPGGGAPAPPTP-PRPGPTDPASESD 205


>gnl|CDD|235540 PRK05641, PRK05641, putative acetyl-CoA carboxylase biotin carboxyl
           carrier protein subunit; Validated.
          Length = 153

 Score = 29.8 bits (67), Expect = 1.4
 Identities = 13/36 (36%), Positives = 17/36 (47%), Gaps = 2/36 (5%)

Query: 236 PYPLYPPGGSTPATPYPV--PTPSPAPAVTSGGTIT 269
           P P   P  + P+ P PV    P+PAPA      +T
Sbjct: 53  PTPAPAPAPAVPSAPTPVAPAAPAPAPASAGENVVT 88


>gnl|CDD|177614 PHA03377, PHA03377, EBNA-3C; Provisional.
          Length = 1000

 Score = 31.2 bits (70), Expect = 1.4
 Identities = 26/108 (24%), Positives = 31/108 (28%), Gaps = 21/108 (19%)

Query: 173 ASNAPYPTQSYMPMPGSNVNTPYPTY-----PPSAYPGYNPNAAAAPSGSIYPSYPTP-G 226
           +  APY               PYP Y     P + Y GY    A     S YP Y  P G
Sbjct: 743 SHQAPYSGHEEPQAQ----QAPYPGYWEPRPPQAPYLGYQEPQAQGVQVSSYPGYAGPWG 798

Query: 227 ATPGAGGY----SPYPLYPPGGST-----PATPYPVPT--PSPAPAVT 263
                  Y    + +  YP  G       P  P+  P    S      
Sbjct: 799 LRAQHPRYRHSWAYWSQYPGHGHPQGPWAPRPPHLPPQWDGSAGHGQD 846


>gnl|CDD|177161 MTH00099, COX3, cytochrome c oxidase subunit III; Validated.
          Length = 261

 Score = 30.5 bits (69), Expect = 1.5
 Identities = 17/45 (37%), Positives = 22/45 (48%), Gaps = 5/45 (11%)

Query: 228 TPGAGGYSPYPLYPPGGSTPATPYPVPTPSPAPAVTSGGTITDEH 272
           TP  GG      +PP G TP  P  VP  + +  + SG +IT  H
Sbjct: 109 TPELGGC-----WPPTGITPLNPLEVPLLNTSVLLASGVSITWAH 148


>gnl|CDD|177464 PHA02682, PHA02682, ORF080 virion core protein; Provisional.
          Length = 280

 Score = 30.6 bits (68), Expect = 1.5
 Identities = 21/77 (27%), Positives = 24/77 (31%), Gaps = 2/77 (2%)

Query: 185 PMPGSNVNTPYPTYPPSAYPGYNPNAAAAPSGSIYPSYPTPGATPGAGGYSPYPLYPPGG 244
                    P       A     P  A A   +  P+ P P   P      P P  PP  
Sbjct: 87  ACAAPAPACPACAPAAPAPAVTCPAPAPACPPATAPTCPPPAVCPAPA--RPAPACPPST 144

Query: 245 STPATPYPVPTPSPAPA 261
                  P+PTP PAPA
Sbjct: 145 RQCPPAPPLPTPKPAPA 161


>gnl|CDD|184281 PRK13729, PRK13729, conjugal transfer pilus assembly protein TraB;
           Provisional.
          Length = 475

 Score = 30.9 bits (70), Expect = 1.6
 Identities = 30/114 (26%), Positives = 39/114 (34%), Gaps = 18/114 (15%)

Query: 160 QEPATNPTYNGNMASNAPYPTQSYMPMPGSNVNTPYPTYPPSAYPGYNPNAAAAPSGSIY 219
            E       N   A+  P P Q     PG     P P   P ++P     A   P+    
Sbjct: 117 AEQVKALGANPVTATGEPVP-QMPASPPGPE-GEPQPGNTPVSFPPQGSVAVPPPTA--- 171

Query: 220 PSYPTPGAT-PGAGGYSPYPL----------YPPGGSTPATPYPVPTPSPAPAV 262
             YP  G T P    Y   P+          Y  G   P+ PY +P+ S A A+
Sbjct: 172 -FYPGNGVTPPPQVTYQSVPVPNRIQRKTFTYNEGKKGPSLPY-IPSGSFAKAM 223


>gnl|CDD|221432 pfam12128, DUF3584, Protein of unknown function (DUF3584).  This
           protein is found in bacteria and eukaryotes. Proteins in
           this family are typically between 943 to 1234 amino
           acids in length. This family contains a P-loop motif
           suggesting it is a nucleotide binding protein. It may be
           involved in replication.
          Length = 1198

 Score = 30.8 bits (70), Expect = 1.6
 Identities = 32/148 (21%), Positives = 62/148 (41%), Gaps = 15/148 (10%)

Query: 278 KSAVEDKIRRKMNEQKVQAQEEIEILKQTENELNQGKFKLARMFERIDKEKAEL--ERSI 335
           + +++DK+   + E+K QA+ ++  L     +L Q   +     E +  +  EL  ER  
Sbjct: 665 QQSLKDKLELAIAERKQQAETQLRQLDA---QLKQLLEQQQAFLEALKDDFRELRTERL- 720

Query: 336 SFLKEKETELDEILAVLTEKTEVDVDEAVTTTAPIYKQILNTFTEEAATEDTIYYMGEAL 395
           +  +  E ELD  LA L+   E    +A      + KQ            +T+    + L
Sbjct: 721 AKWQVVEGELDNQLAQLSAAIEAARTQAKARLKELKKQYDRELASLDVDPNTV----KEL 776

Query: 396 RRGVIDLEVYLKQVRS-----LSRKQFM 418
           +R + +LE  ++++          + FM
Sbjct: 777 KRQIEELETTIERIAVRRPEVREYRAFM 804


>gnl|CDD|214697 smart00498, FH2, Formin Homology 2 Domain.  FH proteins control
           rearrangements of the actin cytoskeleton, especially in
           the context of cytokinesis and cell polarisation.
           Members of this family have been found to interact with
           Rho-GTPases, profilin and other actin-assoziated
           proteins. These interactions are mediated by the
           proline-rich FH1 domain, usually located in front of FH2
           (but not listed in SMART). Despite this cytosolic
           function, vertebrate formins have been assigned
           functions within the nucleus. A set of Formin-Binding
           Proteins (FBPs) has been shown to bind FH1 with their WW
           domain.
          Length = 392

 Score = 30.8 bits (70), Expect = 1.7
 Identities = 18/89 (20%), Positives = 34/89 (38%), Gaps = 14/89 (15%)

Query: 283 DKIRRKMNEQKVQAQEEIEILKQTENELNQGKFKLARMFERIDKEKAE---LERSISFL- 338
           DK    M      A+E+ + L++  ++L     KL   +    K+ +     +    FL 
Sbjct: 269 DKFIEVMKPFLKAAKEKYDKLQKDLSDLKTRFEKLVEYYGEDPKDTSPEEFFKDFNEFLK 328

Query: 339 ----------KEKETELDEILAVLTEKTE 357
                     K++E E +    ++ E TE
Sbjct: 329 EFSKAAEENIKKEEEEEERRKKLVKETTE 357


>gnl|CDD|217495 pfam03326, Herpes_TAF50, Herpesvirus transcription activation
           factor (transactivator).  This family includes EBV BRLF1
           and similar ORF 50 proteins from other herpesviruses.
          Length = 500

 Score = 30.5 bits (69), Expect = 1.7
 Identities = 19/119 (15%), Positives = 28/119 (23%), Gaps = 6/119 (5%)

Query: 148 SALPPLYSSRTKQEPATNPTYNGNMASNAPYPTQSYMPMPGSNVNTPYPTYPPSAYPGYN 207
            A     +   +   +  P  + +  S  P P+ +       N N               
Sbjct: 282 IACSSPQAEPEQGVQSYVPQSSDSRPSCFPAPSTTQPTFLPPNTNKKAKRDRRPQMVTPK 341

Query: 208 PNAAAAPSGSIYPSYP-TPGATPGAGGYSP-----YPLYPPGGSTPATPYPVPTPSPAP 260
               AA S +        P   P   G SP              + A   P     PA 
Sbjct: 342 QEGGAAVSQNHDGGTVRAPRGRPSGSGQSPPSNSPLLSSLADTPSGAAHQPASLLPPAV 400


>gnl|CDD|233119 TIGR00763, lon, ATP-dependent protease La.  This protein is induced
           by heat shock and other stresses in E. coli, B.
           subtilis, and other species. The yeast member,
           designated PIM1, is located in the mitochondrial matrix,
           required for mitochondrial function, and also induced by
           heat shock [Protein fate, Degradation of proteins,
           peptides, and glycopeptides].
          Length = 775

 Score = 30.7 bits (70), Expect = 1.8
 Identities = 17/73 (23%), Positives = 34/73 (46%), Gaps = 11/73 (15%)

Query: 273 IKMSLKSAVEDKIRRK-----MNEQKVQAQEEIEILKQTENELNQGKFKL------ARMF 321
           ++  +   VE+K+ +      + EQ    ++E+ I K  ++EL + K KL        + 
Sbjct: 207 LQNKITKKVEEKMEKTQREYYLREQLKAIKKELGIEKDDKDELEKLKEKLEELKLPEEVK 266

Query: 322 ERIDKEKAELERS 334
           + I+KE  +L   
Sbjct: 267 KVIEKELTKLSLL 279


>gnl|CDD|148679 pfam07218, RAP1, Rhoptry-associated protein 1 (RAP-1).  This family
           consists of several rhoptry-associated protein 1 (RAP-1)
           sequences which appear to be specific to Plasmodium
           falciparum.
          Length = 790

 Score = 30.8 bits (69), Expect = 1.8
 Identities = 28/160 (17%), Positives = 54/160 (33%), Gaps = 12/160 (7%)

Query: 196 PTYPPSAYPGYNPNAAAAPSGSIYPSYPTPGATPGAGGYSPYPLYPPGGSTPATPYPVPT 255
           P  P         +++   S +   S     +  G    S       G ++ +    V  
Sbjct: 132 PGKPKGNQGEGLASSSDGKSKASAKSGSKSASKHGESNSSDESATDSGKASASVAGIVGA 191

Query: 256 PSPAPAVTSGGTITDEHI----------KMSLKSAVEDKIRRKMNEQKVQAQEEIEILKQ 305
              AP          E +          K  L+  +E++I    N+    A+EE   L +
Sbjct: 192 DEEAPPAPKNTLTPLEELYETNVNLFALKHPLEK-LEEEIDILKNDGDKVAEEEEFELDE 250

Query: 306 TENELNQGKFK-LARMFERIDKEKAELERSISFLKEKETE 344
              E  + K + L ++    D+EK + +  I F++    +
Sbjct: 251 EHEEAEEDKKEALEKIGAEGDEEKFKFDEEIKFIEHDVKD 290


>gnl|CDD|147949 pfam06068, TIP49, TIP49 C-terminus.  This family consists of the
           C-terminal region of several eukaryotic and archaeal
           RuvB-like 1 (Pontin or TIP49a) and RuvB-like 2 (Reptin
           or TIP49b) proteins. The N-terminal domain contains the
           pfam00004 domain. In zebrafish, the liebeskummer (lik)
           mutation, causes development of hyperplastic embryonic
           hearts. lik encodes Reptin, a component of a
           DNA-stimulated ATPase complex. Beta-catenin and Pontin,
           a DNA-stimulated ATPase that is often part of complexes
           with Reptin, are in the same genetic pathways. The
           Reptin/Pontin ratio serves to regulate heart growth
           during development, at least in part via the
           beta-catenin pathway. TBP-interacting protein 49 (TIP49)
           was originally identified as a TBP-binding protein, and
           two related proteins are encoded by individual genes,
           tip49a and b. Although the function of this gene family
           has not been elucidated, they are supposed to play a
           critical role in nuclear events because they interact
           with various kinds of nuclear factors and have DNA
           helicase activities.TIP49a has been suggested to act as
           an autoantigen in some patients with autoimmune
           diseases.
          Length = 395

 Score = 30.4 bits (69), Expect = 1.8
 Identities = 31/133 (23%), Positives = 47/133 (35%), Gaps = 28/133 (21%)

Query: 249 TPYPVPTP-SPAPAVTSGGTITDEHIKMSLKSAVEDKIRRKMNEQKVQAQEEIEILKQTE 307
                  P S        G IT +  KM     +  KI  ++ ++KVQA + I I     
Sbjct: 126 EIEEAENPLSGYGKTIKHGKITLKTTKMEKTLKLGPKIYEQLQKEKVQAGDVIYI----- 180

Query: 308 NELNQGKF-KLARMFERIDKEKAELERSIS-----FLKEKET-------ELDEI------ 348
            + N GK  KL R F R      E    +        K KE        ++D        
Sbjct: 181 -DKNTGKVKKLGRSFARATDFDLEATEFVPCPKGEVHKRKEVVQTVTLHDIDVANARPQG 239

Query: 349 -LAVLT-EKTEVD 359
            L++ + +K E+ 
Sbjct: 240 ILSLFSPKKGEIT 252


>gnl|CDD|215457 PLN02852, PLN02852, ferredoxin-NADP+ reductase.
          Length = 491

 Score = 30.4 bits (69), Expect = 1.8
 Identities = 12/35 (34%), Positives = 20/35 (57%)

Query: 321 FERIDKEKAELERSISFLKEKETELDEILAVLTEK 355
           +E+ID  + E  R+    +EK T ++E+L    EK
Sbjct: 457 WEKIDSAEKEAGRARGKPREKITSIEEMLKAANEK 491


>gnl|CDD|222878 PHA02562, 46, endonuclease subunit; Provisional.
          Length = 562

 Score = 30.8 bits (70), Expect = 1.8
 Identities = 16/90 (17%), Positives = 39/90 (43%)

Query: 268 ITDEHIKMSLKSAVEDKIRRKMNEQKVQAQEEIEILKQTENELNQGKFKLARMFERIDKE 327
           I D+  ++       D    ++ E   +  E+ + L + +N+++  K  L  + ++  K 
Sbjct: 304 IKDKLKELQHSLEKLDTAIDELEEIMDEFNEQSKKLLELKNKISTNKQSLITLVDKAKKV 363

Query: 328 KAELERSISFLKEKETELDEILAVLTEKTE 357
           KA +E   +   +   EL ++   L +  +
Sbjct: 364 KAAIEELQAEFVDNAEELAKLQDELDKIVK 393


>gnl|CDD|237622 PRK14140, PRK14140, heat shock protein GrpE; Provisional.
          Length = 191

 Score = 30.0 bits (68), Expect = 1.8
 Identities = 23/117 (19%), Positives = 50/117 (42%), Gaps = 17/117 (14%)

Query: 282 EDKIRRKMNEQKVQAQEEIEILKQT-------ENELNQGKFKLARM---FE------RID 325
           E+ +  ++ E+ V+ + E E+L +        E +L++ + +  R+   FE      + +
Sbjct: 18  EEAVEDEVEEETVEEESEAELLDEEQAKIAELEAKLDELEERYLRLQADFENYKRRIQKE 77

Query: 326 KEKAELERSISFLKEKETELDEI-LAVLTEKTEVDVDEAVTTTAPIYKQILNTFTEE 381
            E AE  R+ S   +    LD    A+  E  +      +     +++Q+L    +E
Sbjct: 78  NEAAEKYRAQSLASDLLPALDNFERALQIEADDEQTKSLLKGVEMVHRQLLEALKKE 134


>gnl|CDD|222259 pfam13608, Potyvirid-P3, Protein P3 of Potyviral polyprotein.  This
           is the P3 protein section of the Potyviridae
           polyproteins. The function is not known except that the
           protein is essential to viral survival.
          Length = 445

 Score = 30.6 bits (70), Expect = 1.9
 Identities = 14/83 (16%), Positives = 37/83 (44%), Gaps = 12/83 (14%)

Query: 276 SLKSAVEDKIRRKMNEQKVQAQEEIEILKQTENELNQGKFKLARMFERIDKEKAELERSI 335
           +L S + +  + K    + + ++E + +++              ++ +++ +    E  +
Sbjct: 264 TLVSIINEHRKLKREAAEEKEEKEEKEIRKLY------------LYSKLEGKLPTEEEFL 311

Query: 336 SFLKEKETELDEILAVLTEKTEV 358
            +++EK  EL E    L E+ EV
Sbjct: 312 EYIEEKNPELLETAEELIEEEEV 334


>gnl|CDD|129394 TIGR00293, TIGR00293, prefoldin, archaeal alpha subunit/eukaryotic
           subunit 5.  Members of this protein family, rich in
           coiled coil regions, are molecular chaperones in the
           class of the prefoldin (GimC) alpha subunit. Prefoldin
           is a hexamer of two alpha and four beta subunits. This
           protein appears universal in the archaea but is
           restricted to Aquifex aeolicus among bacteria so far.
           Eukaryotes have several related proteins; only prefoldin
           subunit 5, which appeared the most similar to archaeal
           prefoldin alpha, is included in this model. This model
           finds a set of small proteins from the Archaea and from
           Aquifex aeolicus that may represent two orthologous
           groups. The proteins are predicted to be mostly coiled
           coil [Protein fate, Protein folding and stabilization].
          Length = 126

 Score = 29.2 bits (66), Expect = 1.9
 Identities = 16/86 (18%), Positives = 31/86 (36%), Gaps = 25/86 (29%)

Query: 287 RKMNEQKVQAQEEIEILKQTENELNQGKFKLARMFERIDKEKAELERSISFLKE-KETEL 345
           +++  +    Q+++E L+                   +    AELE +I  L++ K  E 
Sbjct: 2   QQLAAELQILQQQVESLQAQ--------------IAALRALIAELETAIETLEDLKGAEG 47

Query: 346 DEIL----------AVLTEKTEVDVD 361
            E L          A + +  +V V 
Sbjct: 48  KETLVPVGAGSFVKAKVKDTDKVLVS 73


>gnl|CDD|219444 pfam07516, SecA_SW, SecA Wing and Scaffold domain.  SecA protein
           binds to the plasma membrane where it interacts with
           proOmpA to support translocation of proOmpA through the
           membrane. SecA protein achieves this translocation, in
           association with SecY protein, in an ATP dependent
           manner. This family is composed of two C-terminal alpha
           helical subdomains: the wing and scaffold subdomains.
          Length = 213

 Score = 29.8 bits (68), Expect = 1.9
 Identities = 19/86 (22%), Positives = 32/86 (37%), Gaps = 12/86 (13%)

Query: 270 DEHIKMSLKSAVEDKIRRKMNEQKVQAQEEIEILKQTENELNQGKFKLARMFERIDKEKA 329
            E I   ++  ++D +     E+    + ++E L   E EL +    L    +  + E  
Sbjct: 57  KEIILEMIEDVIDDIVEEYAPEKSYPEEWDLEGL---EEELRE-LLGLDLDIDEEELEGL 112

Query: 330 ELERSISFL--------KEKETELDE 347
             E     L        +EKE EL E
Sbjct: 113 TEEELKERLIEAAKEAYEEKEAELGE 138


>gnl|CDD|223039 PHA03307, PHA03307, transcriptional regulator ICP4; Provisional.
          Length = 1352

 Score = 30.5 bits (69), Expect = 1.9
 Identities = 24/101 (23%), Positives = 30/101 (29%)

Query: 161 EPATNPTYNGNMASNAPYPTQSYMPMPGSNVNTPYPTYPPSAYPGYNPNAAAAPSGSIYP 220
           E A      G  A +   P     P PG+          P+        A+ A  GS  P
Sbjct: 49  ELAAVTVVAGAAACDRFEPPTGPPPGPGTEAPANESRSTPTWSLSTLAPASPAREGSPTP 108

Query: 221 SYPTPGATPGAGGYSPYPLYPPGGSTPATPYPVPTPSPAPA 261
             P+    P        P   P         PV +P P PA
Sbjct: 109 PGPSSPDPPPPTPPPASPPPSPAPDLSEMLRPVGSPGPPPA 149



 Score = 29.4 bits (66), Expect = 5.1
 Identities = 15/84 (17%), Positives = 17/84 (20%), Gaps = 3/84 (3%)

Query: 176 APYPTQSYMPMPGSNVNTPYPTYPPSAYPGYNPNAAAAPSGSIYPSYPTPGATPGAGGYS 235
            P       P   S  +   P        G N      P     P      A P     +
Sbjct: 838 PPGAAARPPPARSSESSKSKPAAAGGRARGKNGRRRPRPPE---PRARPGAAAPPKAAAA 894

Query: 236 PYPLYPPGGSTPATPYPVPTPSPA 259
             P   P       P     P P 
Sbjct: 895 APPAGAPAPRPRPAPRVKLGPMPP 918



 Score = 29.0 bits (65), Expect = 6.3
 Identities = 23/77 (29%), Positives = 27/77 (35%), Gaps = 9/77 (11%)

Query: 187 PGSNVNTPYPTYPPSA--YPGYNPNAAAAPSGSIYPSYPTPGATPGAGGYSPYPLYPPGG 244
           PG+    P      S+   P      A   +G   P  P P A PGA         PP  
Sbjct: 839 PGAAARPPPARSSESSKSKPAAAGGRARGKNGRRRPRPPEPRARPGAAA-------PPKA 891

Query: 245 STPATPYPVPTPSPAPA 261
           +  A P   P P P PA
Sbjct: 892 AAAAPPAGAPAPRPRPA 908


>gnl|CDD|237865 PRK14951, PRK14951, DNA polymerase III subunits gamma and tau;
           Provisional.
          Length = 618

 Score = 30.5 bits (69), Expect = 1.9
 Identities = 17/69 (24%), Positives = 24/69 (34%), Gaps = 5/69 (7%)

Query: 196 PTYPPSAYPGYNPNAAAAPSGSIYPSYPTPGATPGAGGYSPYPLYPPGGSTPATPYPVPT 255
           P  P +A P   P A AA + +  P+     A       +  P   P     A     P 
Sbjct: 381 PARPEAAAPAAAPVAQAAAAPA--PAAAPAAAASAP---AAPPAAAPPAPVAAPAAAAPA 435

Query: 256 PSPAPAVTS 264
            +PA A  +
Sbjct: 436 AAPAAAPAA 444



 Score = 28.5 bits (64), Expect = 8.5
 Identities = 22/110 (20%), Positives = 30/110 (27%), Gaps = 2/110 (1%)

Query: 151 PPLYSSRTKQEPATNPTYNGNMASNAPYPTQSYMPMPGSNVNTPYPTYPPSAYPGYNPNA 210
            P          A         A  A     +  P        P P   P+A       A
Sbjct: 380 TPARPEAAAPAAAPVAQAAAAPAPAAAPAAAASAPAAPPAAAPPAPVAAPAAAAPAAAPA 439

Query: 211 AAAPSGSIYPSYPTPGATPGAGGYSPYPLYPPGGSTPATPYPVPTPSPAP 260
           AA  + ++ P+ P   A        P  + P      A P P   P+ A 
Sbjct: 440 AAPAAVALAPAPPAQAAPETVA--IPVRVAPEPAVASAAPAPAAAPAAAR 487


>gnl|CDD|217469 pfam03276, Gag_spuma, Spumavirus gag protein. 
          Length = 582

 Score = 30.7 bits (69), Expect = 2.0
 Identities = 23/95 (24%), Positives = 30/95 (31%), Gaps = 8/95 (8%)

Query: 184 MPMPGSNVNTPYPTYPPSAYPGYNPNAAAAPSGSIYPSYPTPGATPGAGGYSPYPLYPPG 243
             +PG+    P P       P  + + A + S +     P     P   G SP     P 
Sbjct: 180 GNIPGAI--QPPPPSSLPGLPPGSSSLAPSASSTPGNRLPRVSFNPFLPGPSPAQPSAPP 237

Query: 244 GSTPATP----YPVPTPSPAPAVTSGGTITDEHIK 274
            S PA P         P P P       I   HI+
Sbjct: 238 ASIPAPPIPPVIQYVAPPPVPPPQPIIPIQ--HIR 270


>gnl|CDD|220410 pfam09798, LCD1, DNA damage checkpoint protein.  This is a family
           of proteins which regulate checkpoint kinases. In
           Schizosaccharomyces pombe this protein is called Rad26
           and in Saccharomyces cerevisiae it is called LCD1.
          Length = 648

 Score = 30.5 bits (69), Expect = 2.0
 Identities = 15/50 (30%), Positives = 23/50 (46%), Gaps = 6/50 (12%)

Query: 285 IRRKMNEQKVQAQEEIEILKQTENELNQGKFKLARMFERIDKEKAELERS 334
           +R K++  + Q QEE    K   NEL + K       + + K K EL+  
Sbjct: 2   LRDKLDMLQQQKQEERNKQKSRVNEL-KEKHD-----QELQKLKQELQSL 45


>gnl|CDD|215533 PLN02983, PLN02983, biotin carboxyl carrier protein of acetyl-CoA
           carboxylase.
          Length = 274

 Score = 30.2 bits (68), Expect = 2.0
 Identities = 28/82 (34%), Positives = 33/82 (40%), Gaps = 3/82 (3%)

Query: 185 PMPGSNVNTPYPTYPPSAYPGYNPNAAAAPSGSIYPSYPTPGATPGAGGYSPYPLYPPGG 244
           P P + V    P  P +  P   P A  APS       PTP + P A   +P   +PP  
Sbjct: 144 PPPPAPVVMMQPPPPHAMPPASPPAAQPAPSAPASSPPPTPASPPPAK--APKSSHPPLK 201

Query: 245 STPA-TPYPVPTPSPAPAVTSG 265
           S  A T Y  P P   P V  G
Sbjct: 202 SPMAGTFYRSPAPGEPPFVKVG 223


>gnl|CDD|129241 TIGR00135, gatC, aspartyl/glutamyl-tRNA(Asn/Gln) amidotransferase,
           C subunit.  Archaea, organelles, and many bacteria
           charge Gln-tRNA by first misacylating it with Glu and
           then amidating Glu to Gln. This small protein is part of
           the amidotransferase heterotrimer and appears to be
           important to the stability of the amidase subunit encode
           by gatA, but its function may not be required in every
           organism that expresses gatA and gatB. The seed
           alignment for This model does not include any eukaryotic
           sequence and is not guaranteed to find eukaryotic
           examples, although it does find some. Saccharomyces
           cerevisiae, which expresses the amidotransferase for
           mitochondrial protein translation, seems to lack a gatC
           ortholog. This model has been revised to remove the
           candidate sequence from Methanococcus jannaschii, now
           part of a related model [Protein synthesis, tRNA
           aminoacylation].
          Length = 93

 Score = 28.4 bits (64), Expect = 2.0
 Identities = 17/56 (30%), Positives = 24/56 (42%), Gaps = 7/56 (12%)

Query: 338 LKEKE-----TELDEILAVLTEKTEVDVDEAVTTTAPIYKQILNTFTEEAATEDTI 388
           L E+E      +LDEIL  + +  EVD +     T P+   I N   E+   E   
Sbjct: 17  LSEEEAESFAGDLDEILDYVEQLNEVDTENVEPMTHPLE--ISNVLREDEPEEPLS 70


>gnl|CDD|236382 PRK09111, PRK09111, DNA polymerase III subunits gamma and tau;
           Validated.
          Length = 598

 Score = 30.3 bits (69), Expect = 2.1
 Identities = 14/68 (20%), Positives = 16/68 (23%), Gaps = 5/68 (7%)

Query: 204 PGYNPNAAAAPSGSIYPSYPTPGATPGAGGYSPYPLYPPGGSTPATPYPVPTPSPAPAVT 263
           P         P G        PGA   A         P  G   A    VP  + A A  
Sbjct: 392 PSPGGGGGGPPGGG-----GAPGAPAAAAAPGAAAAAPAAGGPAAALAAVPDAAAAAAAP 446

Query: 264 SGGTITDE 271
                  +
Sbjct: 447 PAPAAAPQ 454



 Score = 30.3 bits (69), Expect = 2.5
 Identities = 9/43 (20%), Positives = 11/43 (25%)

Query: 220 PSYPTPGATPGAGGYSPYPLYPPGGSTPATPYPVPTPSPAPAV 262
           P  P  G     GG           +  A         PA A+
Sbjct: 391 PPSPGGGGGGPPGGGGAPGAPAAAAAPGAAAAAPAAGGPAAAL 433


>gnl|CDD|218232 pfam04731, Caudal_act, Caudal like protein activation region.  This
           family consists of the amino termini of proteins
           belonging to the caudal-related homeobox protein family.
           This region is thought to mediate transcription
           activation. The level of activation caused by mouse Cdx2
           is affected by phosphorylation at serine 60 via the
           mitogen-activated protein kinase pathway. Caudal family
           proteins are involved in the transcriptional regulation
           of multiple genes expressed in the intestinal
           epithelium, and are important in differentiation and
           maintenance of the intestinal epithelial lining. Caudal
           proteins always have a homeobox DNA binding domain
           (pfam00046).
          Length = 135

 Score = 29.3 bits (66), Expect = 2.1
 Identities = 24/95 (25%), Positives = 31/95 (32%), Gaps = 18/95 (18%)

Query: 179 PTQSYMPMPGSNVNTPYPTYPP-SAYPGYNPNAAAAPSGSIYPSYPTPGATPGAGGYSPY 237
             Q+++  P       YP Y      PG N +    PSG+    Y  P        +S Y
Sbjct: 14  GAQNFVSAP------QYPDYGGYHHVPGMNLDPHGQPSGAWGSPYGPPRED-----WSAY 62

Query: 238 PLYPPGGSTPAT------PYPVPTPSPAPAVTSGG 266
              P   +T AT       Y  P  SP      G 
Sbjct: 63  GPGPGPSATAATGSPGQMAYSPPDYSPVHPPGPGP 97



 Score = 29.0 bits (65), Expect = 2.7
 Identities = 29/110 (26%), Positives = 35/110 (31%), Gaps = 26/110 (23%)

Query: 171 NMASNAPYPT-QSYMPMPGSNVNTP-------YPTYPP-----SAYPGYNPNAAAAPSGS 217
           N  S   YP    Y  +PG N++            Y P     SAY      +A A +GS
Sbjct: 17  NFVSAPQYPDYGGYHHVPGMNLDPHGQPSGAWGSPYGPPREDWSAYGPGPGPSATAATGS 76

Query: 218 IYPSYPTPGATPGAGGYSP--YPLYPPGGSTPATPYPVPTPSPAPAVTSG 265
                      PG   YSP  Y    P G  P    P  + S      S 
Sbjct: 77  -----------PGQMAYSPPDYSPVHPPGPGPGLLQPADSGSVEQLSPSA 115


>gnl|CDD|237255 PRK12900, secA, preprotein translocase subunit SecA; Reviewed.
          Length = 1025

 Score = 30.5 bits (69), Expect = 2.1
 Identities = 28/101 (27%), Positives = 47/101 (46%), Gaps = 14/101 (13%)

Query: 273 IKMSLKSAVEDKIRRKMNEQKVQAQEEIEILKQTENELNQGKFKLARMFERIDKEKAELE 332
           I+ SL S  +D +R K  E K + +  +E ++Q + +L +   KL      I  E+AE  
Sbjct: 30  IQASLASLSDDALREKGMELKSRVRGALEPIEQKKKDLEK---KLDN--PDISLEEAE-- 82

Query: 333 RSISFLKEKETELDEILAVLTEKTEVDVDEAVTTTAPIYKQ 373
            SI+       ELD +     E T   ++E +  T  + K+
Sbjct: 83  -SIN------EELDTLAKEYEEATAAALEEILPETFALVKE 116


>gnl|CDD|226400 COG3883, COG3883, Uncharacterized protein conserved in bacteria
           [Function unknown].
          Length = 265

 Score = 30.1 bits (68), Expect = 2.1
 Identities = 22/78 (28%), Positives = 35/78 (44%), Gaps = 7/78 (8%)

Query: 276 SLKSAVEDKIRRKMNEQKVQAQEEIEILKQTENELNQGKFKLARMFERIDKEKAELER-- 333
                 + K+     E+K   Q EIE L     E+     K+  + + ID+ KAE+++  
Sbjct: 31  DKIQNQDSKLSELQKEKK-NIQNEIESLDNQIEEIQS---KIDELQKEIDQSKAEIKKLQ 86

Query: 334 -SISFLKEKETELDEILA 350
             I+ LKE   E  E+L 
Sbjct: 87  KEIAELKENIVERQELLK 104



 Score = 28.1 bits (63), Expect = 7.5
 Identities = 16/75 (21%), Positives = 31/75 (41%), Gaps = 4/75 (5%)

Query: 287 RKMNEQKVQAQEEIEILKQTENELNQGKFKLARMFE----RIDKEKAELERSISFLKEKE 342
           +K+ EQ+ + ++ +E  +    +  +    L    E     ++ +KAE    I+ L  KE
Sbjct: 147 KKILEQQKEDKKSLEEKQAALEDKLETLVALQNELETQLNSLNSQKAEKNALIAALAAKE 206

Query: 343 TELDEILAVLTEKTE 357
                  A L E+  
Sbjct: 207 ASALGEKAALEEQKA 221


>gnl|CDD|177146 MTH00075, COX3, cytochrome c oxidase subunit III; Provisional.
          Length = 261

 Score = 30.1 bits (68), Expect = 2.1
 Identities = 14/33 (42%), Positives = 19/33 (57%)

Query: 240 YPPGGSTPATPYPVPTPSPAPAVTSGGTITDEH 272
           +PP G TP  P+ VP  + A  + SG T+T  H
Sbjct: 116 WPPTGITPLDPFEVPLLNTAVLLASGVTVTWAH 148


>gnl|CDD|177101 MTH00009, COX3, cytochrome c oxidase subunit III; Validated.
          Length = 259

 Score = 29.8 bits (67), Expect = 2.2
 Identities = 13/33 (39%), Positives = 18/33 (54%)

Query: 240 YPPGGSTPATPYPVPTPSPAPAVTSGGTITDEH 272
           +PP G  P  P+ VP  + A  + SG T+T  H
Sbjct: 114 WPPTGIEPLNPFSVPLLNTAVLLASGVTVTWAH 146


>gnl|CDD|215026 smart01104, CTD, Spt5 C-terminal nonapeptide repeat binding Spt4.
           The C-terminal domain of the transcription elongation
           factor protein Spt5 is necessary for binding to Spt4 to
           form the functional complex that regulates early
           transcription elongation by RNA polymerase II. The
           complex may be involved in pre-mRNA processing through
           its association with mRNA capping enzymes. This CTD
           domain carries a regular nonapeptide repeat that can be
           present in up to 18 copies, as in S. pombe. The repeat
           has a characteristic TPA motif.
          Length = 121

 Score = 29.0 bits (65), Expect = 2.2
 Identities = 22/110 (20%), Positives = 29/110 (26%), Gaps = 3/110 (2%)

Query: 162 PATNPTYNGNMASNAPYPTQSYMPMPGSNVNTPYPTYPPSAYPGYNPNAAAAPSGSIYPS 221
            +  P +       A     +     GS   TP      S  P +            +  
Sbjct: 11  GSKTPAWGSRTPGTAAGGAPTARGGSGSR--TPAWGGAGSRTPAWGGAGPTGSRTPAWGG 68

Query: 222 YPTPGATPGAGGYSPYPLYPPGGSTPATPYPVPTPSP-APAVTSGGTITD 270
               G     G  S +   P G     TP    TPS   PA   GG +  
Sbjct: 69  ASAWGNKSSEGSASSWAAGPGGAYGAPTPGYGGTPSAYGPATPGGGAMAG 118


>gnl|CDD|179381 PRK02193, truB, tRNA pseudouridine synthase B; Provisional.
          Length = 279

 Score = 30.1 bits (68), Expect = 2.3
 Identities = 10/46 (21%), Positives = 18/46 (39%), Gaps = 1/46 (2%)

Query: 112 SQNVDHTGKIYSPYLHTWTAQNSKLEGLIKTLASAFSALPPLYSSR 157
           S   D  G+I              LE  +  L  +   +PP++S++
Sbjct: 74  STTYDSEGQII-NVSQNIKVTKENLEEALNNLVGSQKQVPPVFSAK 118



 Score = 29.0 bits (65), Expect = 4.3
 Identities = 10/46 (21%), Positives = 18/46 (39%), Gaps = 1/46 (2%)

Query: 41  SQNVDHTGKIYSPYLHTWTPQNSKLEGLIKTLASAFSALPPLYSSR 86
           S   D  G+I              LE  +  L  +   +PP++S++
Sbjct: 74  STTYDSEGQII-NVSQNIKVTKENLEEALNNLVGSQKQVPPVFSAK 118


>gnl|CDD|220373 pfam09738, DUF2051, Double stranded RNA binding protein (DUF2051). 
           This is a novel protein identified as interacting with
           the leucine-rich repeat domain of human flightless-I,
           FliI protein.
          Length = 294

 Score = 29.9 bits (67), Expect = 2.3
 Identities = 31/133 (23%), Positives = 61/133 (45%), Gaps = 26/133 (19%)

Query: 259 APAVTSGGTITDEHIKMSLKSAVEDKIRRKM-------NEQKVQAQEEIEILKQTENELN 311
           + +V +  +I +  +K SL   VE+K+++ M       NE+     E +++LK    EL 
Sbjct: 69  SSSVDTEASIRE--LKDSL-IEVEEKVKKAMVSNAQLDNEKTNLMYE-VDLLKDKLEELE 124

Query: 312 QGKFKLARMFERIDKEKAELERSI-------SFLKEKETELDEILA-----VLTEKTEVD 359
           +   +L R FE   ++   L+R+I         LKE+  + D+++      ++ E    +
Sbjct: 125 ESLAELQREFEEKTRDLEALKRAIDILQHQLKELKEQLRQRDQLIEEHGLVLVPEVPNGE 184

Query: 360 VDEA---VTTTAP 369
             +     + TAP
Sbjct: 185 GSDEVNNASVTAP 197


>gnl|CDD|219094 pfam06583, Neogenin_C, Neogenin C-terminus.  This family represents
           the C-terminus of eukaryotic neogenin precursor
           proteins, which contains several potential
           phosphorylation sites. Neogenin is a member of the N-CAM
           family of cell adhesion molecules (and therefore
           contains multiple copies of pfam00047 and pfam00041) and
           is closely related to the DCC tumour suppressor gene
           product - these proteins may play an integral role in
           regulating differentiation programmes and/or cell
           migration events within many adult and embryonic
           tissues.
          Length = 295

 Score = 29.9 bits (67), Expect = 2.3
 Identities = 21/94 (22%), Positives = 26/94 (27%), Gaps = 15/94 (15%)

Query: 179 PTQSYMPMPGSN--VNTPYPTYPPSAYPGYNPNAAAAPSGSI-----YPSYP-----TPG 226
             Q             TP P Y P+A    +  A  APS SI      P++P      P 
Sbjct: 140 LYQHGDRAESVESVRQTPEPPYLPAAQSESSNAAEEAPSRSIPTAHVRPTHPLKSFAVPA 199

Query: 227 ATPGAGGYSPYPLYPP---GGSTPATPYPVPTPS 257
                    P     P           + V T S
Sbjct: 200 LPASMSTIEPKLPSTPLLTQQGPTLPKHSVKTAS 233


>gnl|CDD|224307 COG1389, COG1389, DNA topoisomerase VI, subunit B [DNA replication,
           recombination, and repair].
          Length = 538

 Score = 30.1 bits (68), Expect = 2.3
 Identities = 17/66 (25%), Positives = 28/66 (42%), Gaps = 1/66 (1%)

Query: 301 EILKQTENELNQGKFKLARMFERIDKEKAELERSISFLKEKETELDEILAVLTEKTEVDV 360
           EI  +    L +   KL     R  +E  E ++  +  K    E+ + LA + EK E D+
Sbjct: 442 EIENEIRLALMEVARKLKLYLSRKRREMEERKKRKTIEKYLP-EIAKKLAEILEKPEEDI 500

Query: 361 DEAVTT 366
            E +  
Sbjct: 501 REKLAK 506


>gnl|CDD|185616 PTZ00436, PTZ00436, 60S ribosomal protein L19-like protein;
           Provisional.
          Length = 357

 Score = 29.9 bits (66), Expect = 2.4
 Identities = 29/124 (23%), Positives = 36/124 (29%), Gaps = 1/124 (0%)

Query: 141 KTLASAFSALPPLYSSRTKQEPATNPTYNGNMASNAPYPTQSYMPMPGSNVNTPYPTYPP 200
           K  A A +A  P  ++    + A  P       + A  P       P      P     P
Sbjct: 218 KAAAPAKAAAAPAKAAAPPAKAAAAPAKAAAAPAKAAAPPAKAAAPPAKAAAPPAKAAAP 277

Query: 201 SAYPGYNPNAAAAPSGSIYPSYPTPGATPGAGGYSPYPLYPPGGSTPATPYPVPTPSPAP 260
            A     P  AAAP      +     A P     +P     P     A P    TP PA 
Sbjct: 278 PAKAAAPPAKAAAPPAKAAAAPAKAAAAPAKAAAAPAKAAAPPAKAAAPPAKAATP-PAK 336

Query: 261 AVTS 264
           A   
Sbjct: 337 AAAP 340


>gnl|CDD|234352 TIGR03779, Bac_Flav_CT_M, Bacteroides conjugative transposon TraM
           protein.  Members of this protein family are designated
           TraM and are found in a proposed transfer region of a
           class of conjugative transposon found in the Bacteroides
           lineage [Cellular processes, DNA transformation].
          Length = 410

 Score = 30.1 bits (68), Expect = 2.4
 Identities = 34/140 (24%), Positives = 58/140 (41%), Gaps = 12/140 (8%)

Query: 289 MNEQKVQAQEEIEILKQTENELNQGKFKLARMFERIDKEKAELERSISFLKEKETELDEI 348
           +   K++A E+ E L+Q + +  +    LA  FE+ DKE  +   S S   E+  E  E 
Sbjct: 54  IIGDKMKAYEQ-EALEQKQKDKKRTLQDLADSFEKSDKESQDYSSSDSGFDEEPDEPAE- 111

Query: 349 LAVLTEKTEVDVDEAVTTTAPIYKQILNTFTEEAATEDTIYYMGEALRRGVIDLEVYLKQ 408
               T  +   +  +         + L +F E   T++      + L R V +LE  L  
Sbjct: 112 ----TAGSLRPIRSSA-AAYRDINRELGSFYEYPKTDEE-----KELLREVEELESRLAT 161

Query: 409 VRSLSRKQFMFRALLFKCYK 428
             S + +     AL+ K Y+
Sbjct: 162 EPSPAPELEEQLALMEKSYE 181


>gnl|CDD|187842 cd09711, Csc1_I-D, CRISPR/Cas system-associated protein Csc1.
           CRISPR (Clustered Regularly Interspaced Short
           Palindromic Repeats) and associated Cas proteins
           comprise a system for heritable host defense by
           prokaryotic cells against phage and other foreign DNA;
           RAMP superfamily protein; predicted Cas5 ortholog; also
           known as CasA/Cse1 family.
          Length = 210

 Score = 29.8 bits (67), Expect = 2.4
 Identities = 15/87 (17%), Positives = 25/87 (28%), Gaps = 6/87 (6%)

Query: 137 EGLIKTLASAF-SALPPLYSSRTKQEPATNPTYNGNMASNAPYPTQSYMPMPGSNVNTPY 195
           E  I   A  +              +    PTYN ++A    +   +       N     
Sbjct: 21  EPYIGNTALDYALGYSLGEHPYRYVDATQRPTYNEDLAPINDFGYVTPAKPVYRNTMLGI 80

Query: 196 PTYPPSAYPGYNPNAAAAPSGSIYPSY 222
             Y  + YP     A    +   YP++
Sbjct: 81  WYYQTNNYP-----AEMVGTLYNYPNF 102


>gnl|CDD|220944 pfam11018, Cuticle_3, Pupal cuticle protein C1.  Insect cuticles
           are composite structures whose mechanical properties are
           optimised for biological function. The major components
           are the chitin filament system and the cuticular
           proteins, and the cuticle's properties are determined
           largely by the interactions between these two sets of
           molecules. The proteins can be ordered by species.
          Length = 164

 Score = 29.2 bits (65), Expect = 2.4
 Identities = 27/122 (22%), Positives = 35/122 (28%), Gaps = 17/122 (13%)

Query: 154 YSSRTKQE-PATNPTYN-GNMASNAPYPTQSYMPMPGSNVNTP--------YPTYPPSAY 203
           YSS  K +   +N  Y      +   Y   +                    Y      A 
Sbjct: 33  YSSVRKSDTRISNNAYQPAYAKTAYAYAAPAVYAAAAPVYAAHAYAAPAVHYAAAAHYAA 92

Query: 204 PGYNPNAAAAPSGSIYPSYPTPGATPGAGGYSPYPLYPPGGSTPATPYPVPTP---SPAP 260
           P Y   A AAP+ +   +Y    A P     + Y    P     A P         SPA 
Sbjct: 93  PAYAKYAYAAPAVTAKAAY----AAPAPVYKTAYAAAAPAVYAHAAPVVATATVAYSPAA 148

Query: 261 AV 262
           AV
Sbjct: 149 AV 150


>gnl|CDD|223783 COG0711, AtpF, F0F1-type ATP synthase, subunit b [Energy production
           and conversion].
          Length = 161

 Score = 29.2 bits (66), Expect = 2.5
 Identities = 16/82 (19%), Positives = 32/82 (39%), Gaps = 11/82 (13%)

Query: 277 LKSAVEDKIRRKMNEQKVQAQEEIE-ILKQTENELNQGKFKLARMFERIDKEKAE--LER 333
           +   ++ +   ++   K  A+ EIE   ++   EL     +LA          AE  L +
Sbjct: 88  IAEEIKAEAEEELERIKEAAEAEIEAEKERALEELRAEVAELAV-------AIAEKLLGK 140

Query: 334 SISFLKEKETELDEILAVLTEK 355
            +     ++  +D  +A L E 
Sbjct: 141 KVDE-AAQKDLIDAFIAELGEN 161


>gnl|CDD|220392 pfam09770, PAT1, Topoisomerase II-associated protein PAT1.  Members
           of this family are necessary for accurate chromosome
           transmission during cell division.
          Length = 804

 Score = 30.1 bits (68), Expect = 2.5
 Identities = 25/170 (14%), Positives = 44/170 (25%), Gaps = 17/170 (10%)

Query: 174 SNAPYPTQSYMPMPGSNVNTPYPTYPPSAYPGYNPNAAAAPSGSIYPSYPTPGATPGAGG 233
             A +P Q     P      P           + P  + AP+    P        P    
Sbjct: 187 RQAAFPQQGPPEQPPGYPQPPQGHPEQVQPQQFLPAPSQAPAQPPLPPQLPQQPPPLQQP 246

Query: 234 YSPYPLY--PPGGSTPA-------------TPYPVPTPSPAPAVTSGGTITDEHIKMSLK 278
             P      PP    P               P   PTP P         +        L 
Sbjct: 247 QFPGLSQQMPPPPPQPPQQQQQPPQPQAQPPPQNQPTPHPGLPQGQNAPLPPPQQPQLLP 306

Query: 279 SAVEDKIRRKMNEQKVQAQEEIEILKQTENELNQGKFKLARMFERIDKEK 328
              + + +++  + + Q  +  +  +Q E    +   +  R  + +   K
Sbjct: 307 LVQQPQGQQRGPQFREQLVQLSQ--QQREALSQEEAKRAKRRHKIVSLSK 354


>gnl|CDD|225880 COG3343, RpoE, DNA-directed RNA polymerase, delta subunit
           [Transcription].
          Length = 175

 Score = 29.3 bits (66), Expect = 2.6
 Identities = 18/76 (23%), Positives = 37/76 (48%), Gaps = 4/76 (5%)

Query: 291 EQKVQAQEE---IEILKQTENELNQGKFKLARMFERIDKEKAELERSIS-FLKEKETELD 346
           ++++QA  E   I+   +  +   +G        E  ++E+ +   S+     +++ + D
Sbjct: 88  DEEIQAMTEKKDIKAKDKEVDAFEEGDEDELDYDEDKEEEEDDEVDSLDDENDDEDEDDD 147

Query: 347 EILAVLTEKTEVDVDE 362
           EI+ +L E  EVD DE
Sbjct: 148 EIVEILIEDDEVDEDE 163


>gnl|CDD|235246 PRK04184, PRK04184, DNA topoisomerase VI subunit B; Validated.
          Length = 535

 Score = 29.9 bits (68), Expect = 2.7
 Identities = 20/67 (29%), Positives = 33/67 (49%), Gaps = 7/67 (10%)

Query: 301 EILKQTENELNQGKFKLARMFERIDKEKAELERSISFLKEKETELDEI---LAVLTEKTE 357
           EI K+    L +   KL +   R  KE+   +++ +F K     + EI   LA + EK E
Sbjct: 439 EIEKEIRLALQEVARKLKKYLSRKRKEEEAKKKAKTFEK----YIPEIARKLAEILEKEE 494

Query: 358 VDVDEAV 364
            D++E +
Sbjct: 495 PDIEEVL 501


>gnl|CDD|224729 COG1816, Add, Adenosine deaminase [Nucleotide transport and
           metabolism].
          Length = 345

 Score = 29.6 bits (67), Expect = 3.0
 Identities = 10/53 (18%), Positives = 22/53 (41%), Gaps = 2/53 (3%)

Query: 37  RINVSQNVDHTGKIYSPYLHTWTPQNSKLEGLIKTLASAFSALPPLYSSRTKQ 89
               + NV +    + PYLH  T +   ++ +++ L + F      +   +K 
Sbjct: 92  EDAAADNVVYAEIRFDPYLH--TKRGLSVDTVVEGLIAGFRPAERDFGIHSKL 142



 Score = 29.2 bits (66), Expect = 3.8
 Identities = 10/53 (18%), Positives = 22/53 (41%), Gaps = 2/53 (3%)

Query: 108 RINVSQNVDHTGKIYSPYLHTWTAQNSKLEGLIKTLASAFSALPPLYSSRTKQ 160
               + NV +    + PYLH  T +   ++ +++ L + F      +   +K 
Sbjct: 92  EDAAADNVVYAEIRFDPYLH--TKRGLSVDTVVEGLIAGFRPAERDFGIHSKL 142


>gnl|CDD|240289 PTZ00144, PTZ00144, dihydrolipoamide succinyltransferase;
           Provisional.
          Length = 418

 Score = 29.7 bits (67), Expect = 3.0
 Identities = 22/120 (18%), Positives = 35/120 (29%), Gaps = 13/120 (10%)

Query: 200 PSAYPGYNPNAAAAPSGSIYPSYPTPGATPGAGGYSPYPLYPPGGSTPATPYPVPTPSPA 259
           P A       AA A   +                  P P  P     P      P   P 
Sbjct: 124 PPAAAPAAAAAAKAEKTTPEKPKAAAPTPEPPAASKPTP--PAAAKPPEPA---PAAKPP 178

Query: 260 PAVTSGGTITDEHIKMSLKSAVEDKIRRKMNEQKVQAQEEIEILKQTENELNQGKFKLAR 319
           P   +     +  + MS       ++R+++ E+   +Q    +L  T NE +       R
Sbjct: 179 PTPVARADPRETRVPMS-------RMRQRIAERLKASQNTCAMLT-TFNECDMSALMELR 230


>gnl|CDD|216412 pfam01285, TEA, TEA/ATTS domain family. 
          Length = 424

 Score = 29.7 bits (67), Expect = 3.1
 Identities = 10/58 (17%), Positives = 17/58 (29%), Gaps = 1/58 (1%)

Query: 191 VNTPYPTYPPSAYPGYNPNAAAAPSGSIYPSYPTPGATPGAGGYSPYPLYPPGGSTPA 248
            +  +  Y     P +  +     S S Y S+ +   +       PY    P    P 
Sbjct: 141 SSAQHLFYIIPGGPSWRTSIKPFSS-SHYGSHNSSAYSDHLRPLQPYSGELPPPLGPN 197


>gnl|CDD|224691 COG1777, COG1777, Predicted transcriptional regulators
           [Transcription].
          Length = 217

 Score = 29.3 bits (66), Expect = 3.2
 Identities = 22/105 (20%), Positives = 39/105 (37%), Gaps = 3/105 (2%)

Query: 284 KIRRKMNEQKVQAQEEIEILKQTENELNQGKFKLARMFERIDKEKAELERSISFLKEKET 343
           +  R    +  ++ E I  L     E+N+   +L+R    + K+  EL   I    E + 
Sbjct: 104 ESERSEVSKLFKSPEGISELISRLLEINREIEELSRAQTELQKQLNELMDRIKEEIEDKD 163

Query: 344 ELDEILAV---LTEKTEVDVDEAVTTTAPIYKQILNTFTEEAATE 385
                  V   L +    DV+E    T    +++L    E+   E
Sbjct: 164 GDMTERIVLEYLLKNGAADVEETSRRTVLKIEEVLEILAEKGFVE 208


>gnl|CDD|234336 TIGR03734, PRTRC_parB, PRTRC system ParB family protein.  A novel
           genetic system characterized by six major proteins,
           included a ParB homolog and a ThiF homolog, is
           designated PRTRC, or ParB-Related,ThiF-Related Cassette.
           It is often found on plasmids. This protein family the
           member related to ParB, and is designated PRTRC system
           ParB family protein.
          Length = 554

 Score = 29.7 bits (67), Expect = 3.2
 Identities = 16/54 (29%), Positives = 20/54 (37%), Gaps = 5/54 (9%)

Query: 209 NAAAAPSGSIYPSYPTPGATPGAGGYSPYPLYPPGGSTPATPYPVPTPSPAPAV 262
            AAA       P+ P  G    A   SP      G + PA    VP+  P+  V
Sbjct: 329 AAAAQK-----PAAPAAGPGTPAKEKSPAETATSGAAKPAAKKAVPSSQPSNRV 377


>gnl|CDD|219897 pfam08549, SWI-SNF_Ssr4, Fungal domain of unknown function
           (DUF1750).  This is a fungal domain of unknown function.
          Length = 669

 Score = 30.0 bits (67), Expect = 3.3
 Identities = 15/63 (23%), Positives = 23/63 (36%), Gaps = 7/63 (11%)

Query: 150 LPPLYSSRTKQEPATNPTYNGNMASNAPYPTQSYMPMPGSNVNTPYPTYPPSAYPGYNPN 209
           LPP  + ++  +P            NA    Q + P PG+    P       A  G+  +
Sbjct: 204 LPPQMAGQSMYQPPGP-------YPNAMVGRQPFYPQPGAVAGPPKRRGGHKAPRGHRAS 256

Query: 210 AAA 212
            AA
Sbjct: 257 QAA 259


>gnl|CDD|218215 pfam04696, Pinin_SDK_memA, pinin/SDK/memA/ protein conserved
           region.  Members of this family have very varied
           localisations within the eukaryotic cell. pinin is known
           to localise at the desmosomes and is implicated in
           anchoring intermediate filaments to the desmosomal
           plaque. SDK2/3 is a dynamically localised nuclear
           protein thought to be involved in modulation of
           alternative pre-mRNA splicing. memA is a tumour marker
           preferentially expressed in human melanoma cell lines. A
           common feature of the members of this family is that
           they may all participate in regulating protein-protein
           interactions.
          Length = 131

 Score = 28.6 bits (64), Expect = 3.3
 Identities = 21/106 (19%), Positives = 36/106 (33%), Gaps = 30/106 (28%)

Query: 286 RRKMNEQKVQAQEEIEILKQTENELNQGKFKLARMFERIDKEKAELERSISFLKEKETEL 345
           RR   EQK++ QE+ E                    E + KEK EL       +E+  + 
Sbjct: 36  RRAEIEQKLEEQEKQER-------------------EELRKEKREL------FEERRRKQ 70

Query: 346 DEILAVLTEKTEVDVDEAVTTTAPIYKQILNTFTEEAATEDTIYYM 391
            E+  +  +  +  + E            + T      T+  +YY 
Sbjct: 71  LELRKLEQKMEDEKLQETWHEHNLALANFIRT-----KTKPHLYYR 111


>gnl|CDD|225471 COG2919, COG2919, Septum formation initiator [Cell division and
           chromosome partitioning].
          Length = 117

 Score = 28.5 bits (64), Expect = 3.4
 Identities = 6/26 (23%), Positives = 9/26 (34%)

Query: 287 RKMNEQKVQAQEEIEILKQTENELNQ 312
            ++  Q    Q E+E L      L  
Sbjct: 53  LQLQRQIAAQQAELEKLSARNTALEA 78


>gnl|CDD|237463 PRK13668, PRK13668, hypothetical protein; Provisional.
          Length = 267

 Score = 29.2 bits (66), Expect = 3.4
 Identities = 22/79 (27%), Positives = 37/79 (46%), Gaps = 14/79 (17%)

Query: 320 MFERIDKEKAELERSISFLKEKETELDEILAVLTEKTEVDVDEAVTTTAPIYKQILNTFT 379
           M +++ K  +  +   SF +EKET     L +       D  + VT + P    I+  + 
Sbjct: 6   MRDKLKKRLSHPDWDFSFDREKET-----LRI----ERKDNKKGVTISLP---PIVAKWE 53

Query: 380 E--EAATEDTIYYMGEALR 396
           E  E A ++ +YY+ EAL 
Sbjct: 54  EKKEKAIDEIVYYVEEALA 72


>gnl|CDD|114855 pfam06160, EzrA, Septation ring formation regulator, EzrA.  During
           the bacterial cell cycle, the tubulin-like cell-division
           protein FtsZ polymerises into a ring structure that
           establishes the location of the nascent division site.
           EzrA modulates the frequency and position of FtsZ ring
           formation.
          Length = 559

 Score = 29.8 bits (68), Expect = 3.4
 Identities = 22/93 (23%), Positives = 42/93 (45%), Gaps = 23/93 (24%)

Query: 288 KMNEQKVQAQEEIEILKQT--------------ENELNQGKFKLARMFERIDKEKAELER 333
              EQ  Q   E++ L+Q+              E +L     +L   ++++ +  AE + 
Sbjct: 317 HAREQNKQLLLELDRLQQSYTLNEDELETVRELEKQLE----ELEAQYDQLVERIAEKKV 372

Query: 334 SISFLKEKETELDEILAVLT--EKTEVDVDEAV 364
           + S L+E   EL+EI   L   EK + ++ E++
Sbjct: 373 AYSELQE---ELEEIEKQLEEIEKEQEELSESL 402


>gnl|CDD|220785 pfam10498, IFT57, Intra-flagellar transport protein 57.  Eukaryotic
           cilia and flagella are specialised organelles found at
           the periphery of cells of diverse organisms.
           Intra-flagellar transport (IFT) is required for the
           assembly and maintenance of eukaryotic cilia and
           flagella, and consists of the bidirectional movement of
           large protein particles between the base and the distal
           tip of the organelle. IFT particles contain multiple
           copies of two distinct protein complexes, A and B, which
           contain at least 6 and 11 protein subunits. IFT57 is
           part of complex B but is not, however, required for the
           core subunits to stay associated. This protein is known
           as Huntington-interacting protein-1 in humans.
          Length = 355

 Score = 29.7 bits (67), Expect = 3.5
 Identities = 12/77 (15%), Positives = 36/77 (46%)

Query: 283 DKIRRKMNEQKVQAQEEIEILKQTENELNQGKFKLARMFERIDKEKAELERSISFLKEKE 342
           +KI  +      Q ++ ++  ++    L+Q + K  +  + + +   EL      L++ +
Sbjct: 251 EKISSREKHLNNQLEQLVQEYREARRTLSQVQEKYNQASQGVSELTRELNEISEELEQVK 310

Query: 343 TELDEILAVLTEKTEVD 359
            E++E  A +++ + + 
Sbjct: 311 QEMEERGASMSDGSPLV 327


>gnl|CDD|150430 pfam09755, DUF2046, Uncharacterized conserved protein H4 (DUF2046).
            This is the conserved N-terminal 350 residues of a
           family of proteins of unknown function possibly
           containing a coiled-coil domain.
          Length = 308

 Score = 29.6 bits (66), Expect = 3.5
 Identities = 22/98 (22%), Positives = 39/98 (39%), Gaps = 3/98 (3%)

Query: 252 PVPTPSPAPAVTSGGTITDEHIKMSLKSAVEDKIRRKMNEQKVQAQ--EEIEILKQTENE 309
           PV  P     + +    T   I   + S   +  R + N  K +A   E++E   + E +
Sbjct: 204 PVSEPPSPRDIFTEAEDTVGGIASHITSLRSEVRRLRQNLAKSEADYTEKMEQYAKEERQ 263

Query: 310 LNQGKFKLARMFERIDKEKAELERSISFLKEKETELDE 347
             +   +L R   R  + +  L R +S   E   E+D+
Sbjct: 264 TREENIRLQRKLLREVERREALCRQLS-ESESSLEMDD 300



 Score = 28.0 bits (62), Expect = 9.8
 Identities = 29/119 (24%), Positives = 54/119 (45%), Gaps = 30/119 (25%)

Query: 292 QKVQA-QEEIEILKQT-ENELNQGKFKLARMFERIDKEKAELERSISFLKEKETELDEIL 349
           +K+QA ++E E L    E E       L+R   ++ +EK +LE+++   +E+E ++++++
Sbjct: 84  KKIQALKKEKETLAMNYEQEEEFLTNDLSRKLTQLRQEKIQLEQTLE--QEQEYQVNKLM 141

Query: 350 AVLTEKTEVDVDEAVTTTAPIYKQILNTFTEEAATEDTIYYMGEALRRGVIDLEVYLKQ 408
             + +K E D     T+                          E LRR  +DLE  L+Q
Sbjct: 142 RKI-DKLEADTLAKQTSL-------------------------EQLRREKVDLENTLEQ 174


>gnl|CDD|225502 COG2952, COG2952, Uncharacterized protein conserved in bacteria
           [Function unknown].
          Length = 183

 Score = 29.0 bits (65), Expect = 3.6
 Identities = 22/122 (18%), Positives = 53/122 (43%), Gaps = 8/122 (6%)

Query: 271 EHIKMSLKSAVEDKIRRKMNEQKVQAQEEIEILKQTENELNQGKFKLARMFERIDKEKAE 330
           E +K  +   +E  ++    E  +  Q + E+L++  +E+   +    ++F  + ++ A+
Sbjct: 32  ELLKELIIGILEKSVK---KENALDEQAK-ELLEENTDEIEFMRMDERQLFWMVKRKLAQ 87

Query: 331 LERSISFLKEKETEL-DEILAVLTEK--TEVDVDEAVTTTAPIYKQILNTFTEEAATEDT 387
            E      +E+  +L  +IL  + ++      V E +     IYK I        + E+ 
Sbjct: 88  EENFHLDWEERCNDLSHQILNKILDEDLIMFSVSENLIRNL-IYKSIDTYLKIYESIENE 146

Query: 388 IY 389
           ++
Sbjct: 147 VH 148


>gnl|CDD|166942 PRK00404, tatB, sec-independent translocase; Provisional.
          Length = 141

 Score = 28.6 bits (64), Expect = 3.7
 Identities = 15/55 (27%), Positives = 18/55 (32%), Gaps = 2/55 (3%)

Query: 206 YNPNAAAAPSGSIYPSYPTPGATPGAGGYSPYPLYPPGGSTPATPYPVPTPSPAP 260
             P    AP   + P  PT  +   A    P    P     PA   P P  +P P
Sbjct: 79  LAPLTPPAPPEPVTP--PTAQSPAPAVPTPPPTSTPAVPPAPAAAVPAPAAAPPP 131


>gnl|CDD|222665 pfam14303, NAM-associated, No apical meristem-associated C-terminal
           domain.  This domain is found in a number of different
           types of plant proteins including NAM-like proteins.
          Length = 147

 Score = 28.5 bits (64), Expect = 3.7
 Identities = 21/112 (18%), Positives = 39/112 (34%), Gaps = 18/112 (16%)

Query: 252 PVPTPSPAPAVTSGG------TITDEHIKMSLKSAVEDKIRRKMNEQKVQAQEEIEILKQ 305
                SP              +  +       K A E K+RR   + K +  E+    ++
Sbjct: 34  KRSNSSPGSTSNEENEDEDDESTAESKRPEGRKKAKE-KLRRDKLKAKKEEAEK----EK 88

Query: 306 TENELNQGKFKLARMFERIDKEKAELERSISFLKEKETELDEILAVLTEKTE 357
            + E      +  +     +KE+AELE+  +  K  + E  +I+   T    
Sbjct: 89  EKEE------RFMKALAEAEKERAELEKKKAEAKLMKEE-KKIMFADTSSLS 133


>gnl|CDD|216648 pfam01690, PLRV_ORF5, Potato leaf roll virus readthrough protein.
           This family consists mainly of the potato leaf roll
           virus readthrough protein. This is generated via a
           readthrough of open reading frame 3 a coat protein
           allowing transcription of open reading frame 5 to give
           an extended coat protein with a large c-terminal
           addition or read through domain. The readthrough protein
           is thought to play a role in the circulative aphid
           transmission of potato leaf roll virus. Also in the
           family is open reading frame 6 from beet western yellows
           virus and potato leaf roll virus both luteovirus and an
           unknown protein from cucurbit aphid-borne yellows virus
           a closterovirus.
          Length = 460

 Score = 29.7 bits (67), Expect = 3.8
 Identities = 12/21 (57%), Positives = 12/21 (57%)

Query: 241 PPGGSTPATPYPVPTPSPAPA 261
           P  G TP  P P PTP P PA
Sbjct: 7   PKPGPTPTPPPPAPTPEPTPA 27


>gnl|CDD|179385 PRK02224, PRK02224, chromosome segregation protein; Provisional.
          Length = 880

 Score = 29.6 bits (67), Expect = 3.8
 Identities = 16/67 (23%), Positives = 34/67 (50%), Gaps = 7/67 (10%)

Query: 298 EEIEILKQTENELNQGKFKLARMFERID--KEKAELERSISFLKEKETELDEILAVLTEK 355
           E +E L   E EL   + ++  + ER++  ++  E E  I  L+E+  +L+E++A    +
Sbjct: 475 ERVEEL---EAELEDLEEEVEEVEERLERAEDLVEAEDRIERLEERREDLEELIA--ERR 529

Query: 356 TEVDVDE 362
             ++   
Sbjct: 530 ETIEEKR 536


>gnl|CDD|177179 MTH00118, COX3, cytochrome c oxidase subunit III; Provisional.
          Length = 261

 Score = 29.1 bits (66), Expect = 3.8
 Identities = 17/45 (37%), Positives = 22/45 (48%), Gaps = 5/45 (11%)

Query: 228 TPGAGGYSPYPLYPPGGSTPATPYPVPTPSPAPAVTSGGTITDEH 272
           TP  GG      +PP G  P  P+ VP  + A  + SG T+T  H
Sbjct: 109 TPELGG-----QWPPTGIKPLNPFEVPLLNTAVLLASGVTVTWAH 148


>gnl|CDD|214403 MTH00024, COX3, cytochrome c oxidase subunit III; Validated.
          Length = 261

 Score = 29.3 bits (66), Expect = 3.8
 Identities = 13/33 (39%), Positives = 19/33 (57%)

Query: 240 YPPGGSTPATPYPVPTPSPAPAVTSGGTITDEH 272
           +PP G  P  P+ VP  + A  ++SG T+T  H
Sbjct: 116 WPPQGINPLNPFSVPLLNTAVLLSSGATVTWAH 148


>gnl|CDD|217305 pfam02970, TBCA, Tubulin binding cofactor A. 
          Length = 91

 Score = 27.6 bits (62), Expect = 4.1
 Identities = 16/58 (27%), Positives = 27/58 (46%), Gaps = 3/58 (5%)

Query: 291 EQKVQAQEEIEILKQTENELNQGKFKLARMFERIDKEKAELERSISFLKEKETELDEI 348
           E+      +   LK+ E  L + +  L     R+ K  A+LE    +L+E E  L+E+
Sbjct: 31  EKLKAEGADEYDLKKQEEVLKETEMMLPDCKRRLQKAVADLE---EYLEEYEEGLEEL 85


>gnl|CDD|237292 PRK13182, racA, polar chromosome segregation protein; Reviewed.
          Length = 175

 Score = 28.9 bits (65), Expect = 4.1
 Identities = 17/94 (18%), Positives = 38/94 (40%), Gaps = 16/94 (17%)

Query: 271 EHIKMSLKSAVEDK-IRRKMNEQKVQAQEEIEILKQTENELNQGKFKLARMFERIDKEKA 329
           E++K  ++     +  ++  +    + Q    +   +  +  Q + +L  +  R+D    
Sbjct: 47  EYVKSQIEEGQNMQDTQKPSSNDVEETQVNTIVQNISSVDFEQLEAQLNTITRRLD---- 102

Query: 330 ELER----------SISFLKEKETELDEILAVLT 353
           ELER          S   L+ +  E++E+L  L 
Sbjct: 103 ELERQLQQKADDVVSYQLLQHRR-EMEEMLERLQ 135


>gnl|CDD|184804 PRK14720, PRK14720, transcript cleavage factor/unknown domain
           fusion protein; Provisional.
          Length = 906

 Score = 29.7 bits (67), Expect = 4.1
 Identities = 16/58 (27%), Positives = 23/58 (39%), Gaps = 6/58 (10%)

Query: 340 EKETELDEILAVLTEKTEVDVDEAVTTTAPIYKQILNTFTEEAATEDTIYYMGEALRR 397
            K  E  ++  V     + D D        I K+ L T  EE   E  I Y+ +A+ R
Sbjct: 127 AKLNENKKLKGVWERLVKADRDNPE-----IVKK-LATSYEEEDKEKAITYLKKAIYR 178


>gnl|CDD|221952 pfam13166, AAA_13, AAA domain.  This family of domains contain a
           P-loop motif that is characteristic of the AAA
           superfamily. Many of the proteins in this family are
           conjugative transfer proteins. This family includes the
           PrrC protein that is thought to be the active component
           of the anticodon nuclease.
          Length = 713

 Score = 29.6 bits (67), Expect = 4.1
 Identities = 16/82 (19%), Positives = 39/82 (47%)

Query: 271 EHIKMSLKSAVEDKIRRKMNEQKVQAQEEIEILKQTENELNQGKFKLARMFERIDKEKAE 330
           +  K   K  +   +  ++ E     Q+E + L++  N L +   +L    + ++KE  E
Sbjct: 388 KKEKNKAKKKLWLHLVAELKEDIDAYQKEKKGLEKAINSLEKEIKQLEAEIKALEKEIKE 447

Query: 331 LERSISFLKEKETELDEILAVL 352
           LE+ ++ ++    E++++L   
Sbjct: 448 LEKQLTNIEPTADEINKLLKAY 469


>gnl|CDD|219358 pfam07271, Cytadhesin_P30, Cytadhesin P30/P32.  This family
           consists of several Mycoplasma species specific
           Cytadhesin P32 and P30 proteins. P30 has been found to
           be membrane associated and localised on the tip
           organelle. It is thought that it is important in
           cytadherence and virulence.
          Length = 279

 Score = 29.3 bits (65), Expect = 4.2
 Identities = 14/65 (21%), Positives = 20/65 (30%), Gaps = 1/65 (1%)

Query: 179 PTQSYMPMPGSNVNTPYPTYPPSAYPGYNPNAAAAPSGSIYPSY-PTPGATPGAGGYSPY 237
           P   +   P    N   P + P      +P     P+    P + P  G  P      P 
Sbjct: 215 PRPGFNQNPNMMPNMNRPGFRPQPGGFNHPGTPMGPNMQQRPGFNPNQGMNPPPHMAGPR 274

Query: 238 PLYPP 242
             +PP
Sbjct: 275 AGFPP 279


>gnl|CDD|191249 pfam05279, Asp-B-Hydro_N, Aspartyl beta-hydroxylase N-terminal
           region.  This family includes the N-terminal regions of
           the junctin, junctate and aspartyl beta-hydroxylase
           proteins. Junctate is an integral ER/SR membrane calcium
           binding protein, which comes from an alternatively
           spliced form of the same gene that generates aspartyl
           beta-hydroxylase and junctin. Aspartyl beta-hydroxylase
           catalyzes the post-translational hydroxylation of
           aspartic acid or asparagine residues contained within
           epidermal growth factor (EGF) domains of proteins.
          Length = 240

 Score = 29.1 bits (65), Expect = 4.3
 Identities = 13/75 (17%), Positives = 29/75 (38%), Gaps = 2/75 (2%)

Query: 270 DEHIKMSLKSAVEDKIRRKMNEQKVQAQEEIEILKQTENELNQGKFKLARMFERIDKEKA 329
           +E      K  +++K   + NE   +  E+ E  K   +++ +  +        ++  KA
Sbjct: 166 EETASEQYKQDMKEKASEQENEDSKEPVEKAERTKAETDDVTEEDYD--EEDNPVEDSKA 223

Query: 330 ELERSISFLKEKETE 344
             E       E++ E
Sbjct: 224 IKEELAKEPVEEQQE 238


>gnl|CDD|214922 smart00935, OmpH, Outer membrane protein (OmpH-like).  This family
           includes outer membrane proteins such as OmpH among
           others. Skp (OmpH) has been characterized as a molecular
           chaperone that interacts with unfolded proteins as they
           emerge in the periplasm from the Sec translocation
           machinery.
          Length = 140

 Score = 28.3 bits (64), Expect = 4.3
 Identities = 19/66 (28%), Positives = 38/66 (57%), Gaps = 4/66 (6%)

Query: 283 DKIRRKMNEQKVQAQEEIE-ILKQTENELNQGKFKLARMFERIDKEKAELERSISFLKEK 341
            KI ++    K  AQ+++E   K+ + EL + + +L ++ E++ K+ A L  S +  ++K
Sbjct: 7   QKILQESPAGK-AAQKQLEKEFKKRQAELEKLEKELQKLKEKLQKDAATL--SEAAREKK 63

Query: 342 ETELDE 347
           E EL +
Sbjct: 64  EKELQK 69


>gnl|CDD|226889 COG4487, COG4487, Uncharacterized protein conserved in bacteria
           [Function unknown].
          Length = 438

 Score = 29.4 bits (66), Expect = 4.4
 Identities = 20/109 (18%), Positives = 38/109 (34%), Gaps = 17/109 (15%)

Query: 283 DKIRRKMNEQKVQAQEEIEILKQ------TENELNQGKFKLARMFERIDKEKAELERSIS 336
           +  R K  EQ  +A  ++E  +           L + K + A +  +  + +A    + S
Sbjct: 167 ELEREKFEEQLHEANLDLEFKENEEQRESKWAILKKLK-RRAELGSQQVQGEALELPNES 225

Query: 337 FLKEKETELDEILAVLTEKTEVDVDEAVTTTAPIYKQILNTFTEEAATE 385
           F++ K    D       EK     D+       + K  + +   E   E
Sbjct: 226 FIRSK-FPSDIN-----EK----ADKGFRGFDCLNKVPIFSIQIEGKIE 264


>gnl|CDD|236669 PRK10263, PRK10263, DNA translocase FtsK; Provisional.
          Length = 1355

 Score = 29.7 bits (66), Expect = 4.4
 Identities = 22/114 (19%), Positives = 27/114 (23%), Gaps = 4/114 (3%)

Query: 149 ALPPLYSSRTKQEPATNPTYNGNMASNAPYPTQSYMPMPGSNVNTPYPTYPPSAYPGYNP 208
           A P +  +   Q+P          A+  P     Y P P      PY   P    P    
Sbjct: 387 AQPAVQYNEPLQQPVQPQQPYYAPAAEQPAQQPYYAPAPEQPAQQPYYA-PAPEQPVAGN 445

Query: 209 NAAAAPSGSIYPSYPTPGATPGAGGYSPYPLYPPGGSTPATPYPVPTPSPAPAV 262
              A    S +   P            P    P            P   P P V
Sbjct: 446 AWQAEEQQSTFA--PQSTYQTEQTYQQPAAQEPLYQQPQPVE-QQPVVEPEPVV 496


>gnl|CDD|219563 pfam07767, Nop53, Nop53 (60S ribosomal biogenesis).  This nucleolar
           family of proteins are involved in 60S ribosomal
           biogenesis. They are specifically involved in the
           processing beyond the 27S stage of 25S rRNA maturation.
           This family contains sequences that bear similarity to
           the glioma tumour suppressor candidate region gene 2
           protein (p60). This protein has been found to interact
           with herpes simplex type 1 regulatory proteins.
          Length = 387

 Score = 29.3 bits (66), Expect = 4.5
 Identities = 15/94 (15%), Positives = 32/94 (34%), Gaps = 11/94 (11%)

Query: 270 DEHIKMSLKSAVEDKIRRKMNEQKVQAQEEIEILKQTENELNQGKFKLARMFERIDKEKA 329
           DE      +S  E   +    ++K +AQ   +  ++          K      + +K+  
Sbjct: 255 DESAWEGFESEYEPINKPVRPKRKTKAQRN-KEKRR----------KELEREAKEEKQLK 303

Query: 330 ELERSISFLKEKETELDEILAVLTEKTEVDVDEA 363
           +    ++ LKE   E+ +       K E   +  
Sbjct: 304 KKLAQLARLKEIAKEVAQKEKARARKKEQRKERG 337


>gnl|CDD|203376 pfam06016, Reovirus_L2, Reovirus core-spike protein lambda-2 (L2).
            This family consists of several Reovirus core-spike
            protein lambda-2 (L2) sequences. The reovirus L2 genome
            segment encodes the core spike protein lambda-2, which
            mediates enzymatic reactions in 5' capping of the viral
            plus-strand transcripts.
          Length = 1290

 Score = 29.4 bits (66), Expect = 4.5
 Identities = 16/61 (26%), Positives = 22/61 (36%), Gaps = 3/61 (4%)

Query: 176  APYPTQSYMPMPGSNVNTPYPTYPPSAYPGYNPNAAAAPSGSIYPSYPTPGATPGAGGYS 235
            APY ++ +      N      + P     G+      A S S+ PS PT    PG     
Sbjct: 1232 APYDSREWAVKESGNTFVILDSQPFVLPQGW---DVLALSTSVSPSRPTYIVPPGDYTLV 1288

Query: 236  P 236
            P
Sbjct: 1289 P 1289


>gnl|CDD|179877 PRK04778, PRK04778, septation ring formation regulator EzrA;
           Provisional.
          Length = 569

 Score = 29.4 bits (67), Expect = 4.6
 Identities = 27/89 (30%), Positives = 45/89 (50%), Gaps = 15/89 (16%)

Query: 288 KMNEQKVQAQEEIEILKQ----TENEL-NQGKFK-----LARMFERIDKEKAELERSISF 337
              EQ  + +EEI+ +KQ     E+EL +  + +     L + ++ I +  AE E + S 
Sbjct: 321 HAKEQNKELKEEIDRVKQSYTLNESELESVRQLEKQLESLEKQYDEITERIAEQEIAYSE 380

Query: 338 LKEKETELDEILAVLT--EKTEVDVDEAV 364
           L+E   EL+EIL  L   EK +  + E +
Sbjct: 381 LQE---ELEEILKQLEEIEKEQEKLSEML 406



 Score = 29.0 bits (66), Expect = 5.4
 Identities = 18/76 (23%), Positives = 34/76 (44%), Gaps = 11/76 (14%)

Query: 301 EILKQTENELNQGKFKLARMFERIDKEKAELERSISFLKEKETELDEILAVLTE--KTEV 358
           E L + E   ++ +F+         K K E+    S L   E ++++IL  L E  ++E 
Sbjct: 86  EQLFEAEELNDKFRFR---------KAKHEINEIESLLDLIEEDIEQILEELQELLESEE 136

Query: 359 DVDEAVTTTAPIYKQI 374
              E V     +Y+++
Sbjct: 137 KNREEVEQLKDLYREL 152


>gnl|CDD|217443 pfam03234, CDC37_N, Cdc37 N terminal kinase binding.  Cdc37 is a
           molecular chaperone required for the activity of
           numerous eukaryotic protein kinases. This domain
           corresponds to the N terminal domain which binds
           predominantly to protein kinases and is found N terminal
           to the Hsp (Heat shocked protein) 90-binding domain
           pfam08565. Expression of a construct consisting of only
           the N-terminal domain of Saccharomyces pombe Cdc37
           results in cellular viability. This indicates that
           interactions with the cochaperone Hsp90 may not be
           essential for Cdc37 function.
          Length = 172

 Score = 28.6 bits (64), Expect = 4.6
 Identities = 10/44 (22%), Positives = 19/44 (43%), Gaps = 1/44 (2%)

Query: 319 RMFERIDKEKAELERSISFLKEKETELDEILAVLTEKTEVDVDE 362
           R  +R  +E  ELE+           +D++L+ L E++      
Sbjct: 40  RREQR-KQEIEELEKERIMNDRLLERVDKLLSELKEESLDSSQA 82


>gnl|CDD|129690 TIGR00602, rad24, checkpoint protein rad24.  All proteins in this
           family for which functions are known are involved in DNA
           damage tolerance (likely cell cycle checkpoints).This
           family is based on the phylogenomic analysis of JA Eisen
           (1999, Ph.D. Thesis, Stanford University) [DNA
           metabolism, DNA replication, recombination, and repair].
          Length = 637

 Score = 29.2 bits (65), Expect = 4.7
 Identities = 24/110 (21%), Positives = 35/110 (31%), Gaps = 4/110 (3%)

Query: 274 KMSLKS-AVEDKIRRKMNEQKVQAQEEIEILKQTENEL-NQGKFKLARMFERIDKEKAEL 331
           +MS KS A   K + K         +EI+ L   +  L            +R    + + 
Sbjct: 339 RMSTKSDAHASKSKIKGKHSSNNENQEIQALGGKDVSLFLFRALGKILYCKRATLNELDS 398

Query: 332 ERSISFLKEKETELDEILAVLTEKTEVDVDEAVTTTAPIYKQILNTFTEE 381
            R  S L E     D ++    E  E+       T      Q  N F  E
Sbjct: 399 PRLPSHLSELSR--DTLMVGPEEVVEMSHMPGDKTFNLYSHQNYNDFFVE 446


>gnl|CDD|165539 PHA03282, PHA03282, envelope glycoprotein E; Provisional.
          Length = 540

 Score = 29.1 bits (65), Expect = 4.7
 Identities = 19/53 (35%), Positives = 21/53 (39%)

Query: 208 PNAAAAPSGSIYPSYPTPGATPGAGGYSPYPLYPPGGSTPATPYPVPTPSPAP 260
           PN+ AA +       P P  TP  G   P          PATP P   P PAP
Sbjct: 147 PNSTAARAVVFLVVGPRPVPTPPGGTPPPDDDEGDEAGAPATPAPPLHPPPAP 199


>gnl|CDD|219837 pfam08430, Fork_head_N, Forkhead N-terminal region.  The region
           described in this family is found towards the N-terminus
           of various eukaryotic fork head/HNF-3-related
           transcription factors (which contain the pfam00250
           domain). These proteins play key roles in embryogenesis,
           maintenance of differentiated cell states, and
           tumorigenesis.
          Length = 137

 Score = 28.0 bits (62), Expect = 4.9
 Identities = 18/85 (21%), Positives = 26/85 (30%), Gaps = 6/85 (7%)

Query: 170 GNMASNAPYPTQSYMPMPGSNVNTPYPTYPPSAYPGYNPNAAAAPSGSIYPSYPTPGATP 229
            +M S   Y   +      ++         P A  G + ++    S    P Y   G+  
Sbjct: 19  YSMNSMNTYGPMNTSQGSANSSMNMSGYAGPGAMNGMSSSSMNGMS----PGYGGAGSPM 74

Query: 230 GAGGYSPYP--LYPPGGSTPATPYP 252
           G  G S     L P G      P P
Sbjct: 75  GMMGMSSMGTSLSPSGTMGAMGPMP 99


>gnl|CDD|214464 MTH00219, COX3, cytochrome c oxidase subunit III; Provisional.
          Length = 262

 Score = 28.6 bits (64), Expect = 5.3
 Identities = 13/33 (39%), Positives = 18/33 (54%)

Query: 240 YPPGGSTPATPYPVPTPSPAPAVTSGGTITDEH 272
           +PP G  P  P+ VP  + A  + SG T+T  H
Sbjct: 117 WPPTGINPLNPFQVPLLNTAVLLASGVTVTWAH 149


>gnl|CDD|227608 COG5283, COG5283, Phage-related tail protein [Function unknown].
          Length = 1213

 Score = 29.1 bits (65), Expect = 5.3
 Identities = 25/104 (24%), Positives = 47/104 (45%), Gaps = 20/104 (19%)

Query: 276 SLKSAVEDKIRR-KMNEQKVQAQEEIEILKQTENELNQGKFKLARMFERIDKEKAELERS 334
            LKS+++D  +  KM E++          K T++ L+  K K   + E ++K+K   E  
Sbjct: 26  VLKSSIKDSTQFWKMLEKQ---------QKLTKDGLSASKGKYEGLSEAMEKQKKAYED- 75

Query: 335 ISFLKEKETELDEILAVLTEKTEVDVDE--AVTTTAPIYKQILN 376
              LK++  E++      T+ ++    E  A  T A    + L+
Sbjct: 76  ---LKQEVKEVNR----ATQASKKAYQEYNAQYTQAENKLRSLS 112


>gnl|CDD|221827 pfam12881, NUT_N, NUT protein N terminus.  This family includes the
           NUT protein. The gene encoding for NUT protein (Nuclear
           Testis protein) is found fused to BRD3 or BRD4 genes, in
           some aggressive types of carcinoma, due to chromosomal
           translocations. Proteins of the BRD family contain two
           bromodomains that bind transcriptionally active
           chromatin through associations with acetylated histones
           H3 and H4. Such proteins are crucial for the regulation
           of cell cycle progression. On the other hand, little is
           known about NUT protein. NUT is known to have a Nuclear
           Export Sequence (NES) as well as a Nuclear Localization
           Signal (NLS), both located towards the C-terminal end of
           the protein. A fused NUT-GFP protein showed either
           cytoplasmic or nuclear localization, suggesting that it
           is subject to nuclear/cytoplasmic shuttling. Consistent
           with this possibility, treatment with leptomycin B an
           inhibitor of CRM1-dependent nuclear export resulted in
           re-distribution of NUT-GFP to the nucleus. Inspection of
           NUT revealed a C-terminal sequence similar to known
           nuclear export sequences (NES) which are often regulated
           by phosphorylation.
          Length = 328

 Score = 28.8 bits (64), Expect = 5.3
 Identities = 18/60 (30%), Positives = 23/60 (38%), Gaps = 1/60 (1%)

Query: 207 NPNAAAAPSGSIYPSYPTPGATPGAGGYSPYPLYPPGGSTPATPYPVPTPSPAPAVTSGG 266
           NP A+ +P  ++    P P          P PL   G S P  P  +      P VT GG
Sbjct: 10  NPGASLSPFAALPFLPPAPQPPDQPFWEPPPPLVTAGFS-PGNPLVLSALPSMPLVTEGG 68


>gnl|CDD|164623 MTH00052, COX3, cytochrome c oxidase subunit III; Provisional.
          Length = 262

 Score = 28.6 bits (64), Expect = 5.4
 Identities = 13/34 (38%), Positives = 20/34 (58%)

Query: 239 LYPPGGSTPATPYPVPTPSPAPAVTSGGTITDEH 272
           ++PP G  P  P+ VP  + A  ++SG T+T  H
Sbjct: 116 VWPPRGVDPLNPFSVPLLNTAVLLSSGATVTWAH 149


>gnl|CDD|219419 pfam07462, MSP1_C, Merozoite surface protein 1 (MSP1) C-terminus.
           This family represents the C-terminal region of
           merozoite surface protein 1 (MSP1) which are found in a
           number of Plasmodium species. MSP-1 is a 200-kDa protein
           expressed on the surface of the P. vivax merozoite.
           MSP-1 of Plasmodium species is synthesised as a
           high-molecular-weight precursor and then processed into
           several fragments. At the time of red cell invasion by
           the merozoite, only the 19-kDa C-terminal fragment
           (MSP-119), which contains two epidermal growth
           factor-like domains, remains on the surface. Antibodies
           against MSP-119 inhibit merozoite entry into red cells,
           and immunisation with MSP-119 protects monkeys from
           challenging infections. Hence, MSP-119 is considered a
           promising vaccine candidate.
          Length = 574

 Score = 29.1 bits (65), Expect = 5.4
 Identities = 13/71 (18%), Positives = 25/71 (35%), Gaps = 1/71 (1%)

Query: 214 PSGSIYPSYPTPGATPGAGGYSPYPLYPPGGSTP-ATPYPVPTPSPAPAVTSGGTITDEH 272
           P G+   +  T   TP     +P PL      +  +       P+    +  G   T+  
Sbjct: 259 PKGTTQEAKVTTVVTPPQADAAPSPLSVRPAGSSGSASGSTQIPTSGSVLGPGAAATELQ 318

Query: 273 IKMSLKSAVED 283
             + L++  E+
Sbjct: 319 QVVQLQNYDEE 329


>gnl|CDD|165468 PHA03201, PHA03201, uracil DNA glycosylase; Provisional.
          Length = 318

 Score = 28.7 bits (64), Expect = 5.4
 Identities = 19/74 (25%), Positives = 22/74 (29%), Gaps = 6/74 (8%)

Query: 199 PPSAYPGYNPNAAAAPSGS---IYPSYPTPGATP--GAGGYSPYPLYPPGGSTPATPYPV 253
           PP       P    +P  S     PS P PGA P  G       P   P G      +  
Sbjct: 11  PPRRPSPPRPTPPRSPDASPEETPPSPPGPGAEPPPGRAAGPAAPRRRPRGCPAGVTFSS 70

Query: 254 PTPSPAPAVTSGGT 267
             P P P +     
Sbjct: 71  SAP-PRPPLGLDDA 83


>gnl|CDD|224333 COG1415, COG1415, Uncharacterized conserved protein [Function
           unknown].
          Length = 373

 Score = 28.9 bits (65), Expect = 5.5
 Identities = 4/31 (12%), Positives = 16/31 (51%)

Query: 385 EDTIYYMGEALRRGVIDLEVYLKQVRSLSRK 415
           ++ I ++ E + +  +  +  L+ ++ L+  
Sbjct: 334 DELIEFLEELVEKARLGRQEKLRALKRLAEL 364


>gnl|CDD|223091 COG0012, COG0012, Predicted GTPase, probable translation factor
           [Translation, ribosomal structure and biogenesis].
          Length = 372

 Score = 29.1 bits (66), Expect = 5.5
 Identities = 20/103 (19%), Positives = 40/103 (38%), Gaps = 15/103 (14%)

Query: 283 DKIRRKM---NEQKVQAQEEIEILKQTENELNQGKFKLARMFERIDKEKAELERSISFLK 339
           +++ ++     +   + +EE+ +L + E  L +GK        +  +E  E   S++ L 
Sbjct: 146 ERLEKRAKAGKKLDKELKEELSLLGKLEEHLEEGKPARGLDLSKWSEEDLEALASLNLLT 205

Query: 340 EK------------ETELDEILAVLTEKTEVDVDEAVTTTAPI 370
            K               L+E +  L E    +  E V  +A I
Sbjct: 206 AKPMLYVANVSEDDLANLNEYVKRLKELAAKENAEVVPVSAAI 248


>gnl|CDD|173423 PTZ00133, PTZ00133, ADP-ribosylation factor; Provisional.
          Length = 182

 Score = 28.3 bits (63), Expect = 5.6
 Identities = 19/47 (40%), Positives = 25/47 (53%), Gaps = 8/47 (17%)

Query: 322 ERIDKEKAELERSISFLKEKETEL-DEILAVLTEKTEVDVDEAVTTT 367
           ERI   + ELER +S     E EL D +L V   K   D+  A++TT
Sbjct: 98  ERIGDAREELERMLS-----EDELRDAVLLVFANKQ--DLPNAMSTT 137


>gnl|CDD|220368 pfam09730, BicD, Microtubule-associated protein Bicaudal-D.  BicD
           proteins consist of three coiled-coiled domains and are
           involved in dynein-mediated minus end-directed transport
           from the Golgi apparatus to the endoplasmic reticulum
           (ER). For full functioning they bind with GSK-3beta
           pfam05350 to maintain the anchoring of microtubules to
           the centromere. It appears that amino-acid residues
           437-617 of BicD and the kinase activity of GSK-3 are
           necessary for the formation of a complex between BicD
           and GSK-3beta in intact cells.
          Length = 711

 Score = 29.0 bits (65), Expect = 5.6
 Identities = 16/65 (24%), Positives = 33/65 (50%), Gaps = 4/65 (6%)

Query: 290 NEQKVQAQEEIEILKQTENELNQGKFKLARMFERIDKEKAELERSISFLKEKETELDEIL 349
           + +KV A  E    ++ E+ L +   K A   +RI + +AEL++  + L   + E + + 
Sbjct: 6   SHRKVAADGE----EREESLLQESASKEAYYLQRILELQAELKQLRAELSNVQAENERLS 61

Query: 350 AVLTE 354
           ++  E
Sbjct: 62  SLSQE 66


>gnl|CDD|215637 PLN03223, PLN03223, Polycystin cation channel protein; Provisional.
          Length = 1634

 Score = 29.1 bits (65), Expect = 5.6
 Identities = 19/101 (18%), Positives = 30/101 (29%), Gaps = 16/101 (15%)

Query: 79  LPPLYSSRTKQEPATNPT-------------YNGNIRPTADMRINVSQNVDHTGKIY--- 122
           L PL       E   +P              Y   +R     R N  Q+   +  I+   
Sbjct: 528 LDPLKDDGVINEYNLDPQAAAVNAARRASTVYLLPLRDRITARRNELQDPQASTPIFPYP 587

Query: 123 SPYLHTWTAQNSKLEGLIKTLASAFSALPPLYSSRTKQEPA 163
           +P   T+T   +     +  LA  +    P   S  +  P 
Sbjct: 588 APSNFTFTNITAGDVKYVAALAPGYDLTAPTDLSLVEYNPV 628



 Score = 28.8 bits (64), Expect = 8.7
 Identities = 16/48 (33%), Positives = 20/48 (41%), Gaps = 8/48 (16%)

Query: 206  YNPNAAAAPSGSIYPSYPTPGATPGAGGYSPY--PLYPPGGSTPATPY 251
            YN    + P+  I P+Y T G      G  PY   L+ P G     PY
Sbjct: 1035 YNCTELSNPTYGISPTYNTSGIKGETIG--PYCKELFNPQG----LPY 1076


>gnl|CDD|149453 pfam08397, IMD, IRSp53/MIM homology domain.  The N-terminal
           predicted helical stretch of the insulin receptor
           tyrosine kinase substrate p53 (IRSp53) is an
           evolutionary conserved F-actin bundling domain involved
           in filopodium formation. The domain has been named IMD
           after the IRSp53 and missing in metastasis (MIM)
           proteins in which it occurs. Filopodium-inducing IMD
           activity is regulated by Cdc42 and Rac1 and is
           SH3-independent.
          Length = 218

 Score = 28.6 bits (64), Expect = 5.7
 Identities = 19/84 (22%), Positives = 32/84 (38%), Gaps = 10/84 (11%)

Query: 273 IKMSLKSAVEDKIRRKMNEQKVQAQEEIEILKQTENELNQGKFKLARMFERIDKEKAELE 332
            K  LK A+ED   +++  ++ + Q   E L +      + +F        +DK      
Sbjct: 139 SKYQLKEALEDVNDKQLELEEFEKQGVREALLE-----EKRRF---CFL--VDKHCPVSN 188

Query: 333 RSISFLKEKETELDEILAVLTEKT 356
             ++F  E  T L   L    E T
Sbjct: 189 EELAFHGEAITLLQTKLPDWQELT 212


>gnl|CDD|148614 pfam07106, TBPIP, Tat binding protein 1(TBP-1)-interacting protein
           (TBPIP).  This family consists of several eukaryotic
           TBP-1 interacting protein (TBPIP) sequences. TBP-1 has
           been demonstrated to interact with the human
           immunodeficiency virus type 1 (HIV-1) viral protein Tat,
           then modulate the essential replication process of HIV.
           In addition, TBP-1 has been shown to be a component of
           the 26S proteasome, a basic multiprotein complex that
           degrades ubiquitinated proteins in an ATP-dependent
           fashion. Human TBPIP interacts with human TBP-1 then
           modulates the inhibitory action of human TBP-1 on
           HIV-Tat-mediated transactivation.
          Length = 169

 Score = 28.0 bits (63), Expect = 5.7
 Identities = 16/63 (25%), Positives = 30/63 (47%), Gaps = 5/63 (7%)

Query: 287 RKMNEQKVQAQEEIEILKQTENELNQGKFKLARMFERIDKEKAELERSISFLKEKETELD 346
            K++ +  + +EE+++LKQ  + L      L            EL+  I  LK++  E++
Sbjct: 75  NKLDMEIEELREEVQLLKQDCSTLEIELKSLTSDL-----TTEELQEEIQELKKEVREIE 129

Query: 347 EIL 349
           E L
Sbjct: 130 EKL 132


>gnl|CDD|223961 COG1030, NfeD, Membrane-bound serine protease (ClpP class)
           [Posttranslational modification, protein turnover,
           chaperones].
          Length = 436

 Score = 28.9 bits (65), Expect = 5.7
 Identities = 13/26 (50%), Positives = 16/26 (61%)

Query: 377 TFTEEAATEDTIYYMGEALRRGVIDL 402
           T+ E   TE+      EALR+GVIDL
Sbjct: 159 TWAERFVTENLSLTAEEALRQGVIDL 184


>gnl|CDD|221923 pfam13102, DUF3946, Protein of unknown function (DUF3946).  A
           family of uncharacterized proteins found by clustering
           human gut metagenomic sequences. This family appears
           related to the N-terminal domain of phage integrases.
          Length = 140

 Score = 28.0 bits (63), Expect = 5.8
 Identities = 14/83 (16%), Positives = 34/83 (40%), Gaps = 8/83 (9%)

Query: 265 GGTITDEHIKMSLKSAVEDKIRRKMNEQKVQAQEEIEILKQTENELNQGKFKLARMFERI 324
            G    E + + L    +DK  ++ N++ +   E   I  +   EL   ++      ++ 
Sbjct: 23  NGKRKYEFLGLYLYPEPKDKAEKEQNKETLLLAE--AIRAKRILELQNEEYGFE---DKT 77

Query: 325 DKEKAELERSISFLKEKETELDE 347
           +K K +    + + KE   + ++
Sbjct: 78  EKSKIDF---LDYFKEYIEKKEK 97


>gnl|CDD|233045 TIGR00601, rad23, UV excision repair protein Rad23.  All proteins
           in this family for which functions are known are
           components of a multiprotein complex used for targeting
           nucleotide excision repair to specific parts of the
           genome. In humans, Rad23 complexes with the XPC protein.
           This family is based on the phylogenomic analysis of JA
           Eisen (1999, Ph.D. Thesis, Stanford University) [DNA
           metabolism, DNA replication, recombination, and repair].
          Length = 378

 Score = 28.7 bits (64), Expect = 5.9
 Identities = 17/86 (19%), Positives = 25/86 (29%), Gaps = 11/86 (12%)

Query: 191 VNTPYPTYPPSAYPGYNPNAAAAPSGSIYPSYPTPGATPGAGGYSPYPLYPPGGSTPATP 250
           V+ P       A P   P +A  P+ S  P+ P  G +            P       +P
Sbjct: 74  VSKPKTGTGKVAPPAATPTSAPTPTPS-PPASPASGMSAA----------PASAVEEKSP 122

Query: 251 YPVPTPSPAPAVTSGGTITDEHIKMS 276
                 + AP   S    +      S
Sbjct: 123 SEESATATAPESPSTSVPSSGSDAAS 148


>gnl|CDD|226883 COG4477, EzrA, Negative regulator of septation ring formation [Cell
           division and chromosome partitioning].
          Length = 570

 Score = 28.9 bits (65), Expect = 5.9
 Identities = 32/132 (24%), Positives = 52/132 (39%), Gaps = 23/132 (17%)

Query: 288 KMNEQKVQAQEEIEILKQT----ENELNQGKFKLARMFERIDKEKAELERSISFLKEKET 343
           K  E     +EEIE +K++    E EL  G  +      + +KE          LKE E+
Sbjct: 320 KAKENNEHLKEEIERVKESYRLAETEL--GSVR------KFEKE----------LKELES 361

Query: 344 ELDEILAVLTEKTEVDVDEAVTTTAPIYKQILNTFTEEAATEDTIYYMGEALRRGVIDLE 403
            LDEIL    E  EV   E       I K + +   E+   ++ +  + +       +LE
Sbjct: 362 VLDEILEN-IEAQEVAYSELQDNLEEIEKALTDIEDEQEKVQEHLTSLRKDELEARENLE 420

Query: 404 VYLKQVRSLSRK 415
               ++  + R 
Sbjct: 421 RLKSKLHEIKRY 432


>gnl|CDD|234022 TIGR02813, omega_3_PfaA, polyketide-type polyunsaturated fatty acid
            synthase PfaA.  Members of the seed for this alignment
            are involved in omega-3 polyunsaturated fatty acid
            biosynthesis, such as the protein PfaA from the
            eicosapentaenoic acid biosynthesis operon in
            Photobacterium profundum strain SS9. PfaA is encoded
            together with PfaB, PfaC, and PfaD, and the functions of
            the individual polypeptides have not yet been described.
            More distant homologs of PfaA, also included with the
            reach of this model, appear to be involved in
            polyketide-like biosynthetic mechanisms of
            polyunsaturated fatty acid biosynthesis, an alternative
            to the more familiar iterated mechanism of chain
            extension and desaturation, and in most cases are encoded
            near genes for homologs of PfaB, PfaC, and/or PfaD.
          Length = 2582

 Score = 29.2 bits (65), Expect = 6.0
 Identities = 45/204 (22%), Positives = 71/204 (34%), Gaps = 21/204 (10%)

Query: 169  NGNMASNAPYPTQSYMPMPGSNVNTPYPTYPPSAYPGYNPNAAAAPSGSIYPSYPTPGAT 228
            + N+   +P  TQ+  P+  S V T  P    +        A AAP      S P   A 
Sbjct: 1113 DSNIVKLSPLATQA--PVIKSVV-TQAPVVQVTISV-----APAAPVLPAVVSPPVVSAA 1164

Query: 229  PGAGGYSPYPLYPPGG---STPATPYPVPTPSPAPAVTSGGTITDEHIKMSLKSAVEDKI 285
            P     +   + P      + P        PS A A     ++ D        SA++  +
Sbjct: 1165 PAQSVATAVAMAPVAEVPIAVPVQQSVDYMPSVAQAAAPQASVND--------SAIQQVM 1216

Query: 286  RRKMNEQKVQAQEEIEILKQTENELNQGKFKLARMFERIDKEKAEL-ERSISFLKEKETE 344
               + E+     E +E+    E +L     K   +   + +   +L E +   L E  T 
Sbjct: 1217 MEVVAEKTGYPTEMLELEMDMEADLGIDSIKRVEILGSVQEIINDLPELNPEDLAELRT- 1275

Query: 345  LDEILAVLTEKTEVDVDEAVTTTA 368
            L EI+  +  K    V  A   TA
Sbjct: 1276 LGEIVNYMQSKVSAAVASAPVNTA 1299


>gnl|CDD|235179 PRK03947, PRK03947, prefoldin subunit alpha; Reviewed.
          Length = 140

 Score = 28.0 bits (63), Expect = 6.0
 Identities = 8/28 (28%), Positives = 17/28 (60%)

Query: 328 KAELERSISFLKEKETELDEILAVLTEK 355
           + +L+ +I  L +++ EL++ L  L E 
Sbjct: 89  EKDLDEAIEILDKRKEELEKALEKLEEA 116


>gnl|CDD|235461 PRK05431, PRK05431, seryl-tRNA synthetase; Provisional.
          Length = 425

 Score = 28.9 bits (66), Expect = 6.1
 Identities = 23/88 (26%), Positives = 43/88 (48%), Gaps = 15/88 (17%)

Query: 280 AVEDKIRRK-----------MNEQKVQAQEEIEILKQTENELNQ--GKFKLARMFERIDK 326
           AV++ + ++           ++E++ + Q E+E L+   N L++  G+ K     E  + 
Sbjct: 13  AVKEALAKRGFPLDVDELLELDEERRELQTELEELQAERNALSKEIGQAKRKG--EDAEA 70

Query: 327 EKAELERSISFLKEKETELDEILAVLTE 354
             AE++     +K  E ELDE+ A L E
Sbjct: 71  LIAEVKELKEEIKALEAELDELEAELEE 98


>gnl|CDD|219355 pfam07267, Nucleo_P87, Nucleopolyhedrovirus capsid protein P87.
           This family consists of several Nucleopolyhedrovirus
           capsid protein P87 sequences. P87 is expressed late in
           infection and concentrated in infected cell nuclei.
          Length = 606

 Score = 28.7 bits (64), Expect = 6.5
 Identities = 35/179 (19%), Positives = 54/179 (30%), Gaps = 28/179 (15%)

Query: 173 ASNAPYPTQSYMPMPGSNVNTPYPTYPPSAYPGYNPNAAAAPSGSIYPSYPTPGATPGAG 232
             N P  T    PMP  +  TP  T                       S+ TP  TP   
Sbjct: 263 QKNTPAQT----PMPERSWQTPAQTPARRISTPMTEEI---------KSWQTPLQTPAMY 309

Query: 233 GYSPYPLYPPGGSTPATPYPVPTPSPAPAVTSGGTITDEHIKMSLKSAVEDKIRRKMNEQ 292
                   P    T         PS   A++S      E          + K RR     
Sbjct: 310 SSDYQAPKPEPIYTWEELLRERFPSDLFAISSLPDSDSE--ASDSGPTRKRKRRRVPPLP 367

Query: 293 KVQAQEEIEILKQTENELNQGKFKLARMFERIDKEKAELERSISFLKEKETELDEILAV 351
           +  + E+ +   + E +           +E+  K + E ++  +FL+ K  EL +   V
Sbjct: 368 EYSSDEDEDDSDEDEVD-----------YEKERKRRREEDK--NFLRLKALELSKYAGV 413


>gnl|CDD|224417 COG1500, COG1500, Predicted exosome subunit [Translation, ribosomal
           structure and biogenesis].
          Length = 234

 Score = 28.4 bits (64), Expect = 6.7
 Identities = 27/89 (30%), Positives = 42/89 (47%), Gaps = 17/89 (19%)

Query: 324 IDKEKAELERSISFLKEKETELDEILAVLT--------EK-TEVDVDEAVTTTAP--IYK 372
           +D  KA       + + KE +L+E+LA  T        EK +E D+ +A  TT P  I +
Sbjct: 23  VDPNKAL-----EYREGKEVDLEEVLATETVFKDASKGEKASEEDLKKAFGTTDPDEIAE 77

Query: 373 QILNTFTEEAATEDTIYYMGEALRRGVID 401
           +IL    E   T +    M E  +R +I+
Sbjct: 78  EILKK-GEIQLTAEQRREMLEEKKRQIIN 105


>gnl|CDD|113398 pfam04625, DEC-1_N, DEC-1 protein, N-terminal region.  The
           defective chorion-1 gene (dec-1) in Drosophila encodes
           follicle cell proteins necessary for proper eggshell
           assembly. Multiple products of the dec-1 gene are formed
           by alternative RNA splicing and proteolytic processing.
           Cleavage products include S80 (80 kDa) which is
           incorporated into the eggshell, and further proteolysis
           of S80 gives S60 (60 kDa).
          Length = 407

 Score = 28.6 bits (63), Expect = 6.7
 Identities = 13/41 (31%), Positives = 16/41 (39%)

Query: 220 PSYPTPGATPGAGGYSPYPLYPPGGSTPATPYPVPTPSPAP 260
           P+ P+     GA    P P   P  + PA P P      AP
Sbjct: 94  PAMPSMPGLLGAAAPVPAPAPAPAAAPPAAPAPAADTPAAP 134


>gnl|CDD|217902 pfam04111, APG6, Autophagy protein Apg6.  In yeast, 15 Apg proteins
           coordinate the formation of autophagosomes. Autophagy is
           a bulk degradation process induced by starvation in
           eukaryotic cells. Apg6/Vps30p has two distinct functions
           in the autophagic process, either associated with the
           membrane or in a retrieval step of the carboxypeptidase
           Y sorting pathway.
          Length = 356

 Score = 28.7 bits (64), Expect = 6.7
 Identities = 12/61 (19%), Positives = 25/61 (40%), Gaps = 7/61 (11%)

Query: 287 RKMNEQKVQAQEEIEILKQTENELNQGKFKLARMFERIDKEKAELERSISFLKEKETELD 346
                       E++ LK+ E  L           E ++KE  +L+  +  L+E++ +L+
Sbjct: 59  NVEISNYEALDSELDELKKEEERLLD-------ELEELEKEDDDLDGELVELQEEKEQLE 111

Query: 347 E 347
            
Sbjct: 112 N 112


>gnl|CDD|218673 pfam05642, Sporozoite_P67, Sporozoite P67 surface antigen.  This
           family consists of several Theileria P67 surface
           antigens. A stage specific surface antigen of Theileria
           parva, p67, is the basis for the development of an
           anti-sporozoite vaccine for the control of East Coast
           fever (ECF) in cattle. The antigen has been shown to
           contain five distinct linear peptide sequences
           recognised by sporozoite-neutralising murine monoclonal
           antibodies.
          Length = 727

 Score = 28.9 bits (64), Expect = 6.9
 Identities = 21/84 (25%), Positives = 32/84 (38%), Gaps = 4/84 (4%)

Query: 217 SIYPSYPTPGATPGAGGYSPYPLYPPGGSTPATPY---PVPTPSPAPAVTSGGTITDEHI 273
            I P    PG   G    SP    P G +T  TP    P     P  +  +  T + + I
Sbjct: 258 VIEPGVGLPGVRVGDSTSSPSTTRPSGSTTTTTPASSGPSAPGGPGSSSRNAVTRSTDSI 317

Query: 274 KMSLKSAVEDK-IRRKMNEQKVQA 296
              + S    + I  +M E+++ A
Sbjct: 318 SGPIPSPGAPRAITGQMGEREMFA 341


>gnl|CDD|152115 pfam11679, DUF3275, Protein of unknown function (DUF3275).  This
           family of proteins with unknown function appear to be
           restricted to Proteobacteria.
          Length = 211

 Score = 28.3 bits (63), Expect = 7.0
 Identities = 13/62 (20%), Positives = 20/62 (32%), Gaps = 6/62 (9%)

Query: 210 AAAAPSGSIYPSYPTPGATPGAGGYSPYPLYPPGGSTPATPYPVPTPSPAPAVTSGGTIT 269
           A+AAP  S     P+P +        P             P+ +   +PA       +  
Sbjct: 106 ASAAPVASAPAPAPSPQSPK------PASRRASRDMRRIAPFGMNASAPAQEAAQASSDE 159

Query: 270 DE 271
           DE
Sbjct: 160 DE 161


>gnl|CDD|219111 pfam06625, DUF1151, Protein of unknown function (DUF1151).  This
           family consists of several hypothetical eukaryotic
           proteins of unknown function.
          Length = 122

 Score = 27.4 bits (61), Expect = 7.2
 Identities = 23/73 (31%), Positives = 38/73 (52%), Gaps = 8/73 (10%)

Query: 274 KMSLKSAVEDKIRRKMNEQKVQAQEEIEILKQTENELNQGKFKLARMFERIDKEKA---E 330
           K  L+  +E + R     Q ++ Q+E E  K+ ++EL +   K A+  E+++ EKA   E
Sbjct: 50  KPELQRVLEKRKRD----QVLKEQKEEEEAKRLQSELERELMKRAQKLEQLELEKAKEDE 105

Query: 331 LERSIS-FLKEKE 342
            +  I  FLK K 
Sbjct: 106 EQEQIPEFLKVKA 118


>gnl|CDD|171499 PRK12438, PRK12438, hypothetical protein; Provisional.
          Length = 991

 Score = 28.7 bits (64), Expect = 7.3
 Identities = 17/53 (32%), Positives = 19/53 (35%), Gaps = 3/53 (5%)

Query: 210 AAAAPSGSIYPSYPTPGATPGAGGYSPYPLYPPGGSTPATPYPVPTPSPAPAV 262
             A   G    S P PGA P A    P  + PP  + P    P     P  AV
Sbjct: 901 RVATAPGGDAASAPPPGAGPPA---PPQAVPPPRTTQPPAAPPRGPDVPPAAV 950


>gnl|CDD|237866 PRK14952, PRK14952, DNA polymerase III subunits gamma and tau;
           Provisional.
          Length = 584

 Score = 28.7 bits (64), Expect = 7.8
 Identities = 20/76 (26%), Positives = 26/76 (34%), Gaps = 3/76 (3%)

Query: 143 LASAFSALPPLYSSRTKQEPATNPTYNGNMASNAPYPTQSYMPMPGSNVNTPYPTYPPSA 202
           L SA  A   L     + E   + +   N+  NAP    +         + P P   P  
Sbjct: 362 LPSASDAESALLQRVERIETRLDMSIPANLLHNAPQAAPAPSAAAPEPKHQPAPEPRPVL 421

Query: 203 YP---GYNPNAAAAPS 215
            P      PNAAA  S
Sbjct: 422 APTPASGEPNAAAVRS 437


>gnl|CDD|178806 PRK00030, minC, septum formation inhibitor; Provisional.
          Length = 292

 Score = 28.5 bits (63), Expect = 7.8
 Identities = 17/91 (18%), Positives = 24/91 (26%), Gaps = 9/91 (9%)

Query: 179 PTQSYMPMPGSNVNTPYPTYPPSAYPGYNPNAAAAPSGSIYPSYPTPGATPGAGGYSPYP 238
             Q     P ++V TP P+ P +        A A P            A        P  
Sbjct: 107 APQVIDTAPPNDVATPVPSVPEATAEA---AAKAGPQDDEADGEQADEAPAHNPESVPTR 163

Query: 239 LYPPGGSTPATPYPVPTPSPAPAVTSGGTIT 269
                 +   T       +  P  +S   IT
Sbjct: 164 ------AARETTEANRPTATPPQSSSALVIT 188


>gnl|CDD|235262 PRK04235, PRK04235, hypothetical protein; Provisional.
          Length = 196

 Score = 28.0 bits (63), Expect = 8.0
 Identities = 11/32 (34%), Positives = 20/32 (62%), Gaps = 1/32 (3%)

Query: 298 EEIEILKQTENE-LNQGKFKLARMFERIDKEK 328
           E ++ L +  NE L +GK +L R+ E +++ K
Sbjct: 163 EYLKFLVEVANEKLLRGKERLNRLEEELEELK 194


>gnl|CDD|130706 TIGR01645, half-pint, poly-U binding splicing factor, half-pint
           family.  The proteins represented by this model contain
           three RNA recognition motifs (rrm: pfam00076) and have
           been characterized as poly-pyrimidine tract binding
           proteins associated with RNA splicing factors. In the
           case of PUF60 (GP|6176532), in complex with p54, and in
           the presence of U2AF, facilitates association of U2
           snRNP with pre-mRNA.
          Length = 612

 Score = 28.5 bits (63), Expect = 8.4
 Identities = 26/123 (21%), Positives = 38/123 (30%), Gaps = 21/123 (17%)

Query: 144 ASAFSALPPLYSSRTKQEPATNPTYNGNMASNAPYPTQSYMPMPGSNVNTPYPTYPPSAY 203
           A A +A+    +       ++ PT  GN A  +    ++             P  P +A 
Sbjct: 317 AVAGAAVLGPRAQSPATPSSSLPTDIGNKAVVSSAKKEA----------EEVPPLPQAA- 365

Query: 204 PGYNPNAAAAPSGSIYPSYPTPGATPGAGGYSPYPLYPPGGSTPATPYPVPTPSPAPAVT 263
                  A    G +    PTP   PG     P  + PPG   P    P    SP   + 
Sbjct: 366 ------PAVVKPGPM--EIPTPVPPPGLA--IPSLVAPPGLVAPTEINPSFLASPRKKMK 415

Query: 264 SGG 266
              
Sbjct: 416 REK 418


>gnl|CDD|233004 TIGR00513, accA, acetyl-CoA carboxylase, carboxyl transferase,
           alpha subunit.  The enzyme acetyl-CoA carboxylase
           contains a biotin carboxyl carrier protein or domain, a
           biotin carboxylase, and a carboxyl transferase. This
           model represents the alpha chain of the carboxyl
           transferase for cases in which the architecture of the
           protein is as in E. coli, in which the
           carboxyltransferase portion consists of two
           non-identical subnits, alpha and beta [Fatty acid and
           phospholipid metabolism, Biosynthesis].
          Length = 316

 Score = 28.2 bits (63), Expect = 8.6
 Identities = 19/81 (23%), Positives = 35/81 (43%), Gaps = 9/81 (11%)

Query: 283 DKIRRKMNEQKVQAQEEIEILKQTENELNQGKFKLARMFERIDKEKAELERSISFLKEKE 342
           + +R +  ++ V   EEIE L++   EL +  F     ++R      +L R      ++ 
Sbjct: 20  ESLRARSRDEDVDLSEEIERLEKRSVELTKKIFSNLGAWQR-----LQLAR----HPDRP 70

Query: 343 TELDEILAVLTEKTEVDVDEA 363
             LD I  +  +  E+  D A
Sbjct: 71  YTLDYIELIFDDFFELAGDRA 91


>gnl|CDD|227596 COG5271, MDN1, AAA ATPase containing von Willebrand factor type A
            (vWA) domain [General function prediction only].
          Length = 4600

 Score = 28.8 bits (64), Expect = 8.7
 Identities = 24/130 (18%), Positives = 48/130 (36%), Gaps = 21/130 (16%)

Query: 267  TITDEHIKMSLKSAVEDKIRRKMNEQKVQAQEEIEILKQTENELNQGKFKLARMFERIDK 326
            +       M+  +  ED+     ++Q    Q+  +I +  E+ L             I++
Sbjct: 4234 SANQNPDSMNSTNIAEDEADEVGDKQLQDGQDISDIKQTGEDTLPTEFGS-------INQ 4286

Query: 327  EKAELERSISFLKEKETELDEILAVLTEKTEVDVDEAVTTTAPIYKQILNTFTEEAATED 386
             +   E S    ++ E EL +    +T    + +DEA        + + N    E +T+ 
Sbjct: 4287 SEKVFELSE--DEDIEDELPDYNVKIT--PAMPIDEA--------RDLWN--KHEDSTKQ 4332

Query: 387  TIYYMGEALR 396
                + E LR
Sbjct: 4333 LSIELCEQLR 4342


>gnl|CDD|221480 pfam12238, MSA-2c, Merozoite surface antigen 2c.  This family of
           proteins is found in eukaryotes. Proteins in this family
           are typically between 263 and 318 amino acids in length.
           There is a conserved SFT sequence motif. MSA-2 is a
           plasma membrane glycoprotein which can be found in
           Babesia bovis species.
          Length = 201

 Score = 27.8 bits (62), Expect = 8.7
 Identities = 14/56 (25%), Positives = 18/56 (32%), Gaps = 1/56 (1%)

Query: 210 AAAAPSGSIYPS-YPTPGATPGAGGYSPYPLYPPGGSTPATPYPVPTPSPAPAVTS 264
           A A      + S   T    P  G  +P    PP    PA     P  +P    +S
Sbjct: 137 APAEYYSPKHSSSQGTSTTRPSDGSATPNTSAPPTPGNPAAQPEKPAETPKGNGSS 192


>gnl|CDD|165527 PHA03269, PHA03269, envelope glycoprotein C; Provisional.
          Length = 566

 Score = 28.5 bits (63), Expect = 8.7
 Identities = 20/100 (20%), Positives = 33/100 (33%), Gaps = 5/100 (5%)

Query: 162 PATNPTYNGNMASNAPYPTQSYMPMPGSNVNT-PYPTYPPSAYPGYNPNAAAAPSGSIYP 220
               P  + + A+  P P     P P    +  P P   P++     P+ A AP+ +   
Sbjct: 25  NIPIPELHTSAATQKPDPA----PAPHQAASRAPDPAVAPTSAASRKPDLAQAPTPAASE 80

Query: 221 SYPTPGATPGAGGYSPYPLYPPGGSTPATPYPVPTPSPAP 260
            +    A   A   +P P   P  +    P      + A 
Sbjct: 81  KFDPAPAPHQAASRAPDPAVAPQLAAAPKPDAAEAFTSAA 120


>gnl|CDD|227430 COG5099, COG5099, RNA-binding protein of the Puf family,
           translational repressor [Translation, ribosomal
           structure and biogenesis].
          Length = 777

 Score = 28.6 bits (64), Expect = 8.9
 Identities = 32/141 (22%), Positives = 48/141 (34%), Gaps = 13/141 (9%)

Query: 133 NSKLEGLIKTLASAFSALPPLYSSRTKQEPATNPTYNGNMASNAPYPTQSYMPMPGSNVN 192
            S  + L+  L S  S+  P  SSR K   A + T  GN  S+    T S      S  N
Sbjct: 92  TSGSQSLLMELPS--SSFNPSTSSRNKSNSALSSTQQGNANSSV---TLSSSTAS-SMFN 145

Query: 193 TPYPTYPPSAYPGYNPNAAAAPSGSIYPSYPTPGATPGAGGY---SPYPLYPPGGSTPAT 249
           +      P   P ++ +A    SGS + + P   ++         S          +P  
Sbjct: 146 SN---KLPLPNPNHSNSATTNQSGSSFINTPASSSSQPLTNLVVSSIKRFPYLTSLSPFF 202

Query: 250 PYPVPTPSPAPAVTSGGTITD 270
            Y +   S +    S  T   
Sbjct: 203 NYLIDPSSDSA-TASADTSPS 222


>gnl|CDD|235570 PRK05703, flhF, flagellar biosynthesis regulator FlhF; Validated.
          Length = 424

 Score = 28.3 bits (64), Expect = 9.0
 Identities = 24/121 (19%), Positives = 40/121 (33%), Gaps = 11/121 (9%)

Query: 251 YPVPTPSPAPAVTSGGTITDEHIKMSLKSAVEDKIRRKMNEQKVQAQEEIEILKQTENE- 309
                 +   A+        E  K   K +   +   K   +    Q+E++ L+    E 
Sbjct: 83  KRPSRTNSQDALLQAENALPEWKKELEKPSEPKEEEPKAAAESKVVQKELDELRDELKEL 142

Query: 310 ---LNQGKFKLARMFERIDKEKAELERSISFLKE---KETELDEILAVLTEKTEVDVDEA 363
              L      L ++ ERI  E AEL +    LK         +++L +L E        A
Sbjct: 143 KNLLEDQLSGLRQV-ERIPPEFAELYK---RLKRSGLSPEIAEKLLKLLLEHMPPRERTA 198

Query: 364 V 364
            
Sbjct: 199 W 199


>gnl|CDD|116500 pfam07888, CALCOCO1, Calcium binding and coiled-coil domain
           (CALCOCO1) like.  Proteins found in this family are
           similar to the coiled-coil transcriptional coactivator
           protein coexpressed by Mus musculus (CoCoA/CALCOCO1).
           This protein binds to a highly conserved N-terminal
           domain of p160 coactivators, such as GRIP1, and thus
           enhances transcriptional activation by a number of
           nuclear receptors. CALCOCO1 has a central coiled-coil
           region with three leucine zipper motifs, which is
           required for its interaction with GRIP1 and may regulate
           the autonomous transcriptional activation activity of
           the C-terminal region.
          Length = 546

 Score = 28.4 bits (63), Expect = 9.1
 Identities = 18/56 (32%), Positives = 28/56 (50%)

Query: 286 RRKMNEQKVQAQEEIEILKQTENELNQGKFKLARMFERIDKEKAELERSISFLKEK 341
           R  +  Q+ + Q+ I  L+Q    L Q K +  R+ E     +AELER    LK++
Sbjct: 208 RNALLAQRAENQQRILELEQDIQTLTQKKQENDRVLEGTQDIEAELERMKGELKQR 263


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.313    0.130    0.382 

Gapped
Lambda     K      H
   0.267   0.0630    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 22,168,786
Number of extensions: 2143914
Number of successful extensions: 4324
Number of sequences better than 10.0: 1
Number of HSP's gapped: 3476
Number of HSP's successfully gapped: 642
Length of query: 434
Length of database: 10,937,602
Length adjustment: 100
Effective length of query: 334
Effective length of database: 6,502,202
Effective search space: 2171735468
Effective search space used: 2171735468
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.9 bits)
S2: 60 (27.1 bits)