RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy10797
         (517 letters)



>gnl|CDD|224117 COG1196, Smc, Chromosome segregation ATPases [Cell division and
           chromosome partitioning].
          Length = 1163

 Score = 55.9 bits (135), Expect = 3e-08
 Identities = 32/151 (21%), Positives = 76/151 (50%)

Query: 296 EEAMRKIKQLEDQTHQLQKQVATHKQEEEALLNEMEVTGQAFEDMQEQNSRLLQQLREKD 355
           EE  R++++LE Q  +L++++A  ++E E L + +E   +  E+++E+   L ++L E +
Sbjct: 705 EELRRQLEELERQLEELKRELAALEEELEQLQSRLEELEEELEELEEELEELQERLEELE 764

Query: 356 DANFKLMTERIKSNQLHKLAREEKDTLREQVQALSLQIEAMHTAIRKLEEKERFLQTVLT 415
           +    L     K  +  +   E++  L+E+++ L  ++E     +  LE +   L+    
Sbjct: 765 EELESLEEALAKLKEEIEELEEKRQALQEELEELEEELEEAERRLDALERELESLEQRRE 824

Query: 416 NAEKELHLRNQAMDLNKRKAIESAQSAADLK 446
             E+E+    + ++  + K  E  +   +L+
Sbjct: 825 RLEQEIEELEEEIEELEEKLDELEEELEELE 855



 Score = 51.6 bits (124), Expect = 8e-07
 Identities = 29/151 (19%), Positives = 69/151 (45%), Gaps = 3/151 (1%)

Query: 285 SKRDERKKLADEEAMRKIKQLEDQTHQLQKQVATHKQEEEALLNEMEVTGQAFEDMQEQN 344
            +  E  +   EE   ++++LE++   L++ +A  K+E E L  + +   +  E+++E+ 
Sbjct: 743 EEELEELEEELEELQERLEELEEELESLEEALAKLKEEIEELEEKRQALQEELEELEEEL 802

Query: 345 SRLLQQLRE-KDDANFKLMTERIKSNQLHKLAREEKDTLREQVQALSLQIEAMHTAIRKL 403
               ++L   + +             ++ +L  EE + L E++  L  ++E +   + +L
Sbjct: 803 EEAERRLDALERELESLEQRRERLEQEIEEL-EEEIEELEEKLDELEEELEELEKELEEL 861

Query: 404 EEKERFLQTVLTNAEKEL-HLRNQAMDLNKR 433
           +E+   L+      E EL  L  +  +L + 
Sbjct: 862 KEELEELEAEKEELEDELKELEEEKEELEEE 892



 Score = 50.9 bits (122), Expect = 2e-06
 Identities = 32/141 (22%), Positives = 64/141 (45%), Gaps = 10/141 (7%)

Query: 284 DSKRDERKKLADE-EAMRKIKQLEDQTH--QLQKQVATHKQEEEALLNEMEVTGQAFEDM 340
           +    E ++  +E E        E +     L++++A  + E   + NE+E   +  E +
Sbjct: 350 EQLLAELEEAKEELEEKLSALLEELEELFEALREELAELEAELAEIRNELEELKREIESL 409

Query: 341 QEQNSRLLQQLREKDDANFKLMTERIKSNQLHKLAREEKDTLREQVQALSLQIEAMHTAI 400
           +E+  RL ++L +       L  E  +     +  + E + L E+++ L  Q+E +   +
Sbjct: 410 EERLERLSERLED-------LKEELKELEAELEELQTELEELNEELEELEEQLEELRDRL 462

Query: 401 RKLEEKERFLQTVLTNAEKEL 421
           ++LE +   LQ  L   EKEL
Sbjct: 463 KELERELAELQEELQRLEKEL 483



 Score = 49.7 bits (119), Expect = 3e-06
 Identities = 28/162 (17%), Positives = 73/162 (45%), Gaps = 4/162 (2%)

Query: 283 RDSKRDERKKLADEEAMRKIKQLEDQTHQLQKQVATHKQEEEALLNEMEVTGQAFEDMQE 342
              +  E ++   EE  + + +LE+   +L+++++   +E E L   +    +   +++ 
Sbjct: 334 ALKEELEERETLLEELEQLLAELEEAKEELEEKLSALLEELEELFEALR---EELAELEA 390

Query: 343 QNSRLLQQLREKDDANFKLMTERIKSNQLHKLAREEKDTLREQVQALSLQIEAMHTAIRK 402
           + + +  +L E       L     + ++  +  +EE   L  +++ L  ++E ++  + +
Sbjct: 391 ELAEIRNELEELKREIESLEERLERLSERLEDLKEELKELEAELEELQTELEELNEELEE 450

Query: 403 LEEKERFLQTVLTNAEKELHLRNQAMDLNKRKAIESAQSAAD 444
           LEE+   L+  L   E+EL    + +     K + S ++  D
Sbjct: 451 LEEQLEELRDRLKELERELAELQEELQ-RLEKELSSLEARLD 491



 Score = 48.2 bits (115), Expect = 9e-06
 Identities = 39/148 (26%), Positives = 75/148 (50%), Gaps = 9/148 (6%)

Query: 283 RDSKRDERKKLADE--EAMRKIKQLEDQTHQLQKQVATHKQEEEALLNEMEVTGQAFEDM 340
           R + ++E ++L +E  EA R++  LE +   L+++    +QE E L  E+E   +  +++
Sbjct: 788 RQALQEELEELEEELEEAERRLDALERELESLEQRRERLEQEIEELEEEIEELEEKLDEL 847

Query: 341 QEQNSRLLQQLREKDDANFKLMTERIKSNQLHKLAREEKDTLREQVQALS-------LQI 393
           +E+   L ++L E  +   +L  E+ +     K   EEK+ L E+++ L         +I
Sbjct: 848 EEELEELEKELEELKEELEELEAEKEELEDELKELEEEKEELEEELRELESELAELKEEI 907

Query: 394 EAMHTAIRKLEEKERFLQTVLTNAEKEL 421
           E +   + +LE K   L+  L   E+EL
Sbjct: 908 EKLRERLEELEAKLERLEVELPELEEEL 935



 Score = 45.1 bits (107), Expect = 8e-05
 Identities = 32/151 (21%), Positives = 73/151 (48%), Gaps = 1/151 (0%)

Query: 286 KRDERKKLADEEAMRKIKQLEDQTHQLQKQVATHKQEEEALLNEMEVTGQAFEDMQEQNS 345
           +     +   E+   ++++LE++  +L++++   ++  E L  E+E   +A   ++E+  
Sbjct: 723 RELAALEEELEQLQSRLEELEEELEELEEELEELQERLEELEEELESLEEALAKLKEEIE 782

Query: 346 RLLQQLREKDDANFKLMTERIKSNQLHKLAREEKDTLREQVQALSLQIEAMHTAIRKLEE 405
            L ++ +   +   +L  E  ++ +       E ++L ++ + L  +IE +   I +LEE
Sbjct: 783 ELEEKRQALQEELEELEEELEEAERRLDALERELESLEQRRERLEQEIEELEEEIEELEE 842

Query: 406 KERFLQTVLTNAEKEL-HLRNQAMDLNKRKA 435
           K   L+  L   EKEL  L+ +  +L   K 
Sbjct: 843 KLDELEEELEELEKELEELKEELEELEAEKE 873



 Score = 40.9 bits (96), Expect = 0.002
 Identities = 36/174 (20%), Positives = 88/174 (50%), Gaps = 8/174 (4%)

Query: 273 LEFRKFACVVRDSKRDERKKLADEEAMRKIKQLEDQTHQLQKQVATHKQEEEALLNEMEV 332
           L   + A ++    ++ RK+L  EE   ++ +LE++  +LQ+++   ++E E L +E+E 
Sbjct: 222 LRELELALLLAK-LKELRKEL--EELEEELSRLEEELEELQEELEEAEKEIEELKSELEE 278

Query: 333 TGQAFEDMQEQNSRLLQQLREKDD--ANFKLMTERIKSNQLHKLAREEKDTLREQVQALS 390
             +  E++QE+   L +++ E +   +  +   E +++       R E+  L+E+++AL 
Sbjct: 279 LREELEELQEELLELKEEIEELEGEISLLRERLEELENELEELEERLEE--LKEKIEALK 336

Query: 391 LQIEAMHTAIRKLEEKERFLQTVLTNAEKELHLRNQAMDLNKRKAIESAQSAAD 444
            ++E   T + +LE+    L+      E++L    + ++    +A+    +  +
Sbjct: 337 EELEERETLLEELEQLLAELEEAKEELEEKLSALLEELE-ELFEALREELAELE 389



 Score = 39.3 bits (92), Expect = 0.005
 Identities = 31/167 (18%), Positives = 73/167 (43%), Gaps = 6/167 (3%)

Query: 287 RDERKKLADEEAMRKIKQLEDQTHQLQKQVATHKQEEEALLNEMEVTGQAFEDMQEQNSR 346
            + ++++ + E   +I  L ++  +L+ ++   ++  E L  ++E   +  E+ +     
Sbjct: 291 LELKEEIEELE--GEISLLRERLEELENELEELEERLEELKEKIEALKEELEERETLLEE 348

Query: 347 LLQQL----REKDDANFKLMTERIKSNQLHKLAREEKDTLREQVQALSLQIEAMHTAIRK 402
           L Q L      K++   KL     +  +L +  REE   L  ++  +  ++E +   I  
Sbjct: 349 LEQLLAELEEAKEELEEKLSALLEELEELFEALREELAELEAELAEIRNELEELKREIES 408

Query: 403 LEEKERFLQTVLTNAEKELHLRNQAMDLNKRKAIESAQSAADLKLHL 449
           LEE+   L   L + ++EL      ++  + +  E  +   +L+  L
Sbjct: 409 LEERLERLSERLEDLKEELKELEAELEELQTELEELNEELEELEEQL 455



 Score = 39.3 bits (92), Expect = 0.005
 Identities = 37/177 (20%), Positives = 77/177 (43%), Gaps = 25/177 (14%)

Query: 285 SKRDERKKLADEEAMRKIKQ-------LEDQTHQLQKQVATHKQEEEALLNEMEVTGQAF 337
           SK  ERK    EEA RK+++       LED   +L+KQ+   +++ E            +
Sbjct: 168 SKYKERK----EEAERKLERTEENLERLEDLLEELEKQLEKLERQAEKAER--------Y 215

Query: 338 EDMQEQNSRL--LQQLREKDDANFKLMTERIKSNQLHKLAREEKDTLREQVQALSLQIEA 395
           ++++ +   L     L +  +   +L  E ++         EE + L+E+++    +IE 
Sbjct: 216 QELKAELRELELALLLAKLKELRKEL--EELEEEL--SRLEEELEELQEELEEAEKEIEE 271

Query: 396 MHTAIRKLEEKERFLQTVLTNAEKELHLRNQAMDLNKRKAIESAQSAADLKLHLVII 452
           + + + +L E+   LQ  L   ++E+      + L + +  E      +L+  L  +
Sbjct: 272 LKSELEELREELEELQEELLELKEEIEELEGEISLLRERLEELENELEELEERLEEL 328



 Score = 35.8 bits (83), Expect = 0.064
 Identities = 33/156 (21%), Positives = 68/156 (43%), Gaps = 30/156 (19%)

Query: 283  RDSKRDERKKLADEEAMRKIKQLEDQTHQLQKQVATHKQEEEALLNEMEVTGQAFEDMQE 342
             +    E+++L DE     +K+LE++  +L++++   + E   L  E+E   +  E+++ 
Sbjct: 865  LEELEAEKEELEDE-----LKELEEEKEELEEELRELESELAELKEEIEKLRERLEELEA 919

Query: 343  QNSRLLQQLREKDDAN---FKLMTERIKSNQLHKLAR-------------EEKDTLREQV 386
            +  RL  +L E ++     ++   E     ++ +L               EE + + E+ 
Sbjct: 920  KLERLEVELPELEEELEEEYEDTLETELEREIERLEEEIEALGPVNLRAIEEYEEVEERY 979

Query: 387  QALSLQIEAMHTAIRKLEE---------KERFLQTV 413
            + L  Q E +  A  KL E         +ERF +T 
Sbjct: 980  EELKSQREDLEEAKEKLLEVIEELDKEKRERFKETF 1015



 Score = 31.2 bits (71), Expect = 1.6
 Identities = 29/131 (22%), Positives = 56/131 (42%), Gaps = 16/131 (12%)

Query: 319 HKQEEEALLNEMEVTGQAFEDMQEQNSRLLQQLREKDDANFKLMTERIKSNQLHKLAREE 378
           +K+   A   E++   +   +++ Q  +L ++L+   +    L               EE
Sbjct: 658 NKRSSLAQKRELKELEEELAELEAQLEKLEEELKSLKNELRSLEDLL-----------EE 706

Query: 379 KDTLREQVQALSLQIEAMHTAIRKLEEKERFLQTVLTNAEKEL-HLRNQAMDLNKRKA-I 436
              LR Q++ L  Q+E +   +  LEE+   LQ+ L   E+EL  L  +  +L +R   +
Sbjct: 707 ---LRRQLEELERQLEELKRELAALEEELEQLQSRLEELEEELEELEEELEELQERLEEL 763

Query: 437 ESAQSAADLKL 447
           E    + +  L
Sbjct: 764 EEELESLEEAL 774


>gnl|CDD|233757 TIGR02168, SMC_prok_B, chromosome segregation protein SMC, common
           bacterial type.  SMC (structural maintenance of
           chromosomes) proteins bind DNA and act in organizing and
           segregating chromosomes for partition. SMC proteins are
           found in bacteria, archaea, and eukaryotes. This family
           represents the SMC protein of most bacteria. The smc
           gene is often associated with scpB (TIGR00281) and scpA
           genes, where scp stands for segregation and condensation
           protein. SMC was shown (in Caulobacter crescentus) to be
           induced early in S phase but present and bound to DNA
           throughout the cell cycle [Cellular processes, Cell
           division, DNA metabolism, Chromosome-associated
           proteins].
          Length = 1179

 Score = 52.8 bits (127), Expect = 3e-07
 Identities = 27/157 (17%), Positives = 73/157 (46%)

Query: 296 EEAMRKIKQLEDQTHQLQKQVATHKQEEEALLNEMEVTGQAFEDMQEQNSRLLQQLREKD 355
            E  +++++LE++  QL+K++    ++  AL  ++       E ++E+ ++L ++L E +
Sbjct: 701 AELRKELEELEEELEQLRKELEELSRQISALRKDLARLEAEVEQLEERIAQLSKELTELE 760

Query: 356 DANFKLMTERIKSNQLHKLAREEKDTLREQVQALSLQIEAMHTAIRKLEEKERFLQTVLT 415
               +L     ++ +    A  E + L  Q++ L  +++A+  A+ +L  +   L     
Sbjct: 761 AEIEELEERLEEAEEELAEAEAEIEELEAQIEQLKEELKALREALDELRAELTLLNEEAA 820

Query: 416 NAEKELHLRNQAMDLNKRKAIESAQSAADLKLHLVII 452
           N  + L    + +   +R+  +  +   +L   +  +
Sbjct: 821 NLRERLESLERRIAATERRLEDLEEQIEELSEDIESL 857



 Score = 47.7 bits (114), Expect = 1e-05
 Identities = 33/172 (19%), Positives = 74/172 (43%), Gaps = 15/172 (8%)

Query: 287 RDERKKLADEEAMRKIKQLEDQTHQLQKQVATHKQEEEALLNEMEVTGQAFEDMQEQ--- 343
             E K     E   ++ +LE++  +L++++ + + E E L  E+E      E+++EQ   
Sbjct: 327 ELESKLDELAE---ELAELEEKLEELKEELESLEAELEELEAELEELESRLEELEEQLET 383

Query: 344 -NSRLLQQLREKDDANFKLMTERIKSNQL-HKLAR-------EEKDTLREQVQALSLQIE 394
             S++ Q   +    N ++     +  +L  +  R         K     +++ L  ++E
Sbjct: 384 LRSKVAQLELQIASLNNEIERLEARLERLEDRRERLQQEIEELLKKLEEAELKELQAELE 443

Query: 395 AMHTAIRKLEEKERFLQTVLTNAEKELHLRNQAMDLNKRKAIESAQSAADLK 446
            +   + +L+E+   L+  L    +EL    QA+D  +R+  +       L+
Sbjct: 444 ELEEELEELQEELERLEEALEELREELEEAEQALDAAERELAQLQARLDSLE 495



 Score = 45.4 bits (108), Expect = 7e-05
 Identities = 24/123 (19%), Positives = 52/123 (42%)

Query: 300 RKIKQLEDQTHQLQKQVATHKQEEEALLNEMEVTGQAFEDMQEQNSRLLQQLREKDDANF 359
             + +L  +   L ++ A  ++  E+L   +  T +  ED++EQ   L + +        
Sbjct: 803 EALDELRAELTLLNEEAANLRERLESLERRIAATERRLEDLEEQIEELSEDIESLAAEIE 862

Query: 360 KLMTERIKSNQLHKLAREEKDTLREQVQALSLQIEAMHTAIRKLEEKERFLQTVLTNAEK 419
           +L     +     +    E+ +L E +  L  ++E +   +R+LE K   L+  L    +
Sbjct: 863 ELEELIEELESELEALLNERASLEEALALLRSELEELSEELRELESKRSELRRELEELRE 922

Query: 420 ELH 422
           +L 
Sbjct: 923 KLA 925



 Score = 42.7 bits (101), Expect = 4e-04
 Identities = 29/167 (17%), Positives = 68/167 (40%), Gaps = 17/167 (10%)

Query: 284 DSKRDERKKLADEEAM--RKIKQLEDQTHQLQKQVATHKQEEEALLNEMEVTGQAFEDMQ 341
           +  R+E ++L +E      ++++L  +  +L++++   + E   L  E+E   +    + 
Sbjct: 235 EELREELEELQEELKEAEEELEELTAELQELEEKLEELRLEVSELEEEIEELQKELYALA 294

Query: 342 EQNSRLLQQLREKDDANFKLMTERIKSNQLHKLAREEKDTLRE--------------QVQ 387
            + SRL QQ +   +    L  +  +     +    + D L E              +++
Sbjct: 295 NEISRLEQQKQILRERLANLERQLEELEAQLEELESKLDELAEELAELEEKLEELKEELE 354

Query: 388 ALSLQIEAMHTAIRKLEEKERFLQTVLTNAEKELH-LRNQAMDLNKR 433
           +L  ++E +   + +LE +   L+  L     ++  L  Q   LN  
Sbjct: 355 SLEAELEELEAELEELESRLEELEEQLETLRSKVAQLELQIASLNNE 401



 Score = 42.0 bits (99), Expect = 0.001
 Identities = 46/186 (24%), Positives = 88/186 (47%), Gaps = 34/186 (18%)

Query: 285 SKRDERKKLADEEAMRKI-------KQLEDQTHQLQKQVAT-HKQEEEAL--------LN 328
           SK  ER+K    E  RK+        +LED  ++L++Q+ +  +Q E+A         L 
Sbjct: 168 SKYKERRK----ETERKLERTRENLDRLEDILNELERQLKSLERQAEKAERYKELKAELR 223

Query: 329 EME--VTGQAFEDMQEQNSRLLQQL----REKDDANFKLMTERIKSNQLHKLAREEKDTL 382
           E+E  +     E+++E+   L ++L     E ++   +L     K  +L +L   E +  
Sbjct: 224 ELELALLVLRLEELREELEELQEELKEAEEELEELTAELQELEEKLEEL-RLEVSELEEE 282

Query: 383 REQVQA--LSLQIEAMHTAIRKLEEKERFLQTVLTNAEKELHLRNQAMDLNKRKAIESAQ 440
            E++Q    +L  E     I +LE++++ L+  L N E++L      ++  + K  E A+
Sbjct: 283 IEELQKELYALANE-----ISRLEQQKQILRERLANLERQLEELEAQLEELESKLDELAE 337

Query: 441 SAADLK 446
             A+L+
Sbjct: 338 ELAELE 343



 Score = 41.2 bits (97), Expect = 0.001
 Identities = 37/168 (22%), Positives = 69/168 (41%), Gaps = 25/168 (14%)

Query: 264  DSASNIYGSLEFRKFACVVRDSKRDERKKLADE--EAMRKIKQLEDQTHQLQKQVATHKQ 321
            ++  N   SLE            R E ++L++E  E   K  +L  +  +L++++A  + 
Sbjct: 876  EALLNERASLEEA------LALLRSELEELSEELRELESKRSELRRELEELREKLAQLEL 929

Query: 322  EEEALLNEMEVTGQAFEDMQEQNSRLLQQLREKDDANFKLMTERIKS--NQLHKL----- 374
              E L   ++   +   +         + L  K + + +    R+K   N++ +L     
Sbjct: 930  RLEGLEVRIDNLQERLSEEYSLTLEEAEALENKIEDDEEEARRRLKRLENKIKELGPVNL 989

Query: 375  -AREEKDTLREQVQALSLQIEAMHTAIRKLEE---------KERFLQT 412
             A EE + L+E+   L+ Q E +  A   LEE         +ERF  T
Sbjct: 990  AAIEEYEELKERYDFLTAQKEDLTEAKETLEEAIEEIDREARERFKDT 1037



 Score = 41.2 bits (97), Expect = 0.001
 Identities = 32/162 (19%), Positives = 75/162 (46%), Gaps = 8/162 (4%)

Query: 285 SKRDERKKLADEEAMRKIKQLEDQTHQLQKQVATHKQEEEALLNEMEVTGQAFEDMQEQN 344
           S+ +++K++  E    ++  LE Q  +L+ Q+   + + + L  E+    +  E+++E+ 
Sbjct: 298 SRLEQQKQILRE----RLANLERQLEELEAQLEELESKLDELAEELAELEEKLEELKEEL 353

Query: 345 SRLLQQLREKDDANFKLMTERIKSNQLHKLAREEKDTLREQVQALSLQIEAMHTAIRKLE 404
             L  +L E +    +L +   +  +  +  R +   L  Q+ +L+ +IE +   + +LE
Sbjct: 354 ESLEAELEELEAELEELESRLEELEEQLETLRSKVAQLELQIASLNNEIERLEARLERLE 413

Query: 405 EKERFLQTVLTNAEKELH---LRNQAMDLN-KRKAIESAQSA 442
           ++   LQ  +    K+L    L+    +L    + +E  Q  
Sbjct: 414 DRRERLQQEIEELLKKLEEAELKELQAELEELEEELEELQEE 455



 Score = 41.2 bits (97), Expect = 0.001
 Identities = 35/193 (18%), Positives = 77/193 (39%), Gaps = 4/193 (2%)

Query: 253 KSRWFDFRSLSDSASNIYGSLEFRKFACVVRDSKRDERKKLADEEAMRKIKQLEDQTHQL 312
           +    +      +       LE      V +  +R  +      E   +I++LE++  + 
Sbjct: 718 RKELEELSRQISALRKDLARLEAE----VEQLEERIAQLSKELTELEAEIEELEERLEEA 773

Query: 313 QKQVATHKQEEEALLNEMEVTGQAFEDMQEQNSRLLQQLREKDDANFKLMTERIKSNQLH 372
           ++++A  + E E L  ++E   +  + ++E    L  +L   ++    L        +  
Sbjct: 774 EEELAEAEAEIEELEAQIEQLKEELKALREALDELRAELTLLNEEAANLRERLESLERRI 833

Query: 373 KLAREEKDTLREQVQALSLQIEAMHTAIRKLEEKERFLQTVLTNAEKELHLRNQAMDLNK 432
                  + L EQ++ LS  IE++   I +LEE    L++ L     E     +A+ L +
Sbjct: 834 AATERRLEDLEEQIEELSEDIESLAAEIEELEELIEELESELEALLNERASLEEALALLR 893

Query: 433 RKAIESAQSAADL 445
            +  E ++   +L
Sbjct: 894 SELEELSEELREL 906



 Score = 40.0 bits (94), Expect = 0.003
 Identities = 22/103 (21%), Positives = 51/103 (49%)

Query: 320 KQEEEALLNEMEVTGQAFEDMQEQNSRLLQQLREKDDANFKLMTERIKSNQLHKLAREEK 379
           ++E E L  ++E   +   ++++  + L ++L E ++   +L  E  + ++     R++ 
Sbjct: 676 RREIEELEEKIEELEEKIAELEKALAELRKELEELEEELEQLRKELEELSRQISALRKDL 735

Query: 380 DTLREQVQALSLQIEAMHTAIRKLEEKERFLQTVLTNAEKELH 422
             L  +V+ L  +I  +   + +LE +   L+  L  AE+EL 
Sbjct: 736 ARLEAEVEQLEERIAQLSKELTELEAEIEELEERLEEAEEELA 778



 Score = 39.7 bits (93), Expect = 0.004
 Identities = 38/208 (18%), Positives = 69/208 (33%), Gaps = 37/208 (17%)

Query: 252 YKSRWFDFRSLSDSASNIYGSLEFRKFACVVRDSKRDERKKLADEEAMRKI-KQLEDQTH 310
            K      R   D        L   + A +    +  ER+  A E  +  + +Q+E+ + 
Sbjct: 794 LKEELKALREALDELRAELTLLN-EEAANLRERLESLERRIAATERRLEDLEEQIEELSE 852

Query: 311 Q----------LQKQVATHKQEEEALLNEMEVTGQAFEDMQEQNSRLLQQLREKDDANFK 360
                      L++ +   + E EALLNE        E +    S L             
Sbjct: 853 DIESLAAEIEELEELIEELESELEALLNERA---SLEEALALLRSEL------------- 896

Query: 361 LMTERIKSNQLHKLAREEKDTLREQVQALSLQIEAMHTAIRKLEEKERFLQTVLTNAEKE 420
                  S +L +L   ++  LR +++ L  ++  +   +  LE +   LQ  L+    E
Sbjct: 897 ----EELSEELREL-ESKRSELRRELEELREKLAQLELRLEGLEVRIDNLQERLS----E 947

Query: 421 LHLRNQAMDLNKRKAIESAQSAADLKLH 448
            +             IE  +  A  +L 
Sbjct: 948 EYSLTLEEAEALENKIEDDEEEARRRLK 975



 Score = 39.3 bits (92), Expect = 0.005
 Identities = 33/180 (18%), Positives = 74/180 (41%), Gaps = 21/180 (11%)

Query: 286 KRDERKKLADEEAMRKIKQLEDQTHQLQKQVATHKQEEEALLNEMEVTGQAFEDMQEQNS 345
            + E ++L     + ++++L ++  +LQ+++   ++E E L  E++   +  E+++ + S
Sbjct: 218 LKAELRELELALLVLRLEELREELEELQEELKEAEEELEELTAELQELEEKLEELRLEVS 277

Query: 346 RLLQQLREKDDANFKLMTERIKSNQLHKLAREEKDTLREQVQALSLQIEAMHTAIRKLEE 405
            L +++ E     + L  E  +  Q  ++ RE    L  Q++ L  Q+E       KL+E
Sbjct: 278 ELEEEIEELQKELYALANEISRLEQQKQILRERLANLERQLEELEAQLE---ELESKLDE 334

Query: 406 KERFLQTV-----------------LTNAEKELHLRNQAMDLNKRKAIESAQSAADLKLH 448
               L  +                 L   E EL      ++    + +E+ +S       
Sbjct: 335 LAEELAELEEKLEELKEELESLEAELEELEAELEELESRLE-ELEEQLETLRSKVAQLEL 393


>gnl|CDD|224495 COG1579, COG1579, Zn-ribbon protein, possibly nucleic acid-binding
           [General function prediction only].
          Length = 239

 Score = 50.4 bits (121), Expect = 5e-07
 Identities = 32/141 (22%), Positives = 61/141 (43%), Gaps = 4/141 (2%)

Query: 313 QKQVATHKQEEEALLNEMEVTGQAFEDMQEQNSRL---LQQLREKDDANFKLMTERIKSN 369
           +K +   K E EAL   +E      ED++ Q S+L   +Q++RE+     + ++      
Sbjct: 30  RKALKKAKAELEALNKALEALEIELEDLENQVSQLESEIQEIRERIKRAEEKLSAVKDER 89

Query: 370 QLHKLAREEKDTLREQVQALSLQIEAMHTAIRKLEEKERFLQTVLTNAEKELHLRNQAMD 429
           +L  L  E     +E++ +L  ++  +   I KLE++   L+  L   EK L      ++
Sbjct: 90  ELRALNIEI-QIAKERINSLEDELAELMEEIEKLEKEIEDLKERLERLEKNLAEAEARLE 148

Query: 430 LNKRKAIESAQSAADLKLHLV 450
               +  E  Q  +  +  L 
Sbjct: 149 EEVAEIREEGQELSSKREELK 169


>gnl|CDD|144972 pfam01576, Myosin_tail_1, Myosin tail.  The myosin molecule is a
           multi-subunit complex made up of two heavy chains and
           four light chains it is a fundamental contractile
           protein found in all eukaryote cell types. This family
           consists of the coiled-coil myosin heavy chain tail
           region. The coiled-coil is composed of the tail from two
           molecules of myosin. These can then assemble into the
           macromolecular thick filament. The coiled-coil region
           provides the structural backbone the thick filament.
          Length = 859

 Score = 47.7 bits (114), Expect = 1e-05
 Identities = 44/200 (22%), Positives = 78/200 (39%), Gaps = 43/200 (21%)

Query: 287 RDERKKLADEEAMRK--IKQLEDQTHQL-----QKQVATH-KQEEEALLNEMEVT----G 334
           R   +K  + E  RK   + +E     L      K  A+  K++ E  +NE+E+      
Sbjct: 507 RRLAEKEEEFENTRKNHQRAIESLQATLEAEAKGKAEASRLKKKLEGDINELEIALDHAN 566

Query: 335 QAFEDMQEQNSRLLQQLRE-----------KDDA--NFKLMTERIK-------------- 367
           +A  + Q+   +  QQ++E           ++DA     +   R                
Sbjct: 567 KANAEAQKNVKKYQQQVKELQTQVEEEQRAREDAREQLAVAERRATALEAELEELRSALE 626

Query: 368 -SNQLHKLAREEKDTLREQVQALSLQIEAMHTAIRKLEEKERFLQTVLTNAEKELHLRNQ 426
            + +  K A  E     E+V  L+ Q  ++    RKLE +   LQ+ L  A  EL     
Sbjct: 627 QAERARKQAETELAEASERVNELTAQNSSLIAQKRKLEGELAALQSDLDEAVNELKA--- 683

Query: 427 AMDLNKRKAIESAQSAADLK 446
           A +  K+   ++A+ A +L+
Sbjct: 684 AEERAKKAQADAARLAEELR 703



 Score = 37.3 bits (87), Expect = 0.018
 Identities = 36/155 (23%), Positives = 72/155 (46%), Gaps = 21/155 (13%)

Query: 284 DSKRDERKKLADEEAMRKIKQLEDQTHQLQKQVATHKQEEEALLNEMEVTGQAFEDMQEQ 343
           +S+ +E K+  +EE+  +   L+ Q  QL+        + ++L  ++E   +A  +++ Q
Sbjct: 242 ESQLEEAKRSLEEESRERAN-LQAQLRQLE-------HDLDSLREQLEEESEAKAELERQ 293

Query: 344 NSR---LLQQLREKDDANFKLMTERIKSNQLHKLAREEKDTLREQVQALSLQIEAMHTAI 400
            S+    +QQ R K ++   L  E ++         E K  L +++  L    EA +   
Sbjct: 294 LSKANAEIQQWRSKFESEGALRAEELE---------ELKKKLNQKISELEEAAEAANAKC 344

Query: 401 RKLEEKERFLQTVLTNAEKEL-HLRNQAMDLNKRK 434
             LE+ +  LQ+ L + + EL      A +L K++
Sbjct: 345 DSLEKTKSRLQSELEDLQIELERANAAASELEKKQ 379



 Score = 31.2 bits (71), Expect = 1.8
 Identities = 32/133 (24%), Positives = 54/133 (40%), Gaps = 9/133 (6%)

Query: 296 EEAMRKIKQLEDQTHQLQKQVATHKQEEEALLNEMEVTGQAFEDMQEQNSRLLQQLREKD 355
           E A     +LE +     K +A  K++ + L  E++   +   ++  +  RL  +L E  
Sbjct: 366 ERANAAASELEKKQKNFDKILAEWKRKVDELQAELDTAQREARNLSTELFRLKNELEELK 425

Query: 356 DANFKLMTERIK-SNQLHKLAREEKDTLREQVQALSLQIEAMHTAIRKLEEKERFLQTVL 414
           D    L  E     +++H L     D L E  + +      +  A R+LE ++  LQ  L
Sbjct: 426 DQVEALRRENKNLQDEIHDL----TDQLGEGGRNV----HELEKARRRLEAEKDELQAAL 477

Query: 415 TNAEKELHLRNQA 427
             AE  L L    
Sbjct: 478 EEAEAALELEESK 490



 Score = 30.8 bits (70), Expect = 2.0
 Identities = 30/149 (20%), Positives = 65/149 (43%), Gaps = 12/149 (8%)

Query: 294 ADEEAMRKIKQLEDQTHQLQKQVATHKQEEEALLNEMEVTGQAFEDMQEQNSRLLQQLRE 353
           A EEA   ++  E +  + Q +++  + E E  L E E   + FE+ ++ + R ++ L+ 
Sbjct: 476 ALEEAEAALELEESKVLRAQVELSQIRSEIERRLAEKE---EEFENTRKNHQRAIESLQA 532

Query: 354 KDDANFKLMTE--RIKSNQLHKLAREEKDTLREQVQALSLQIEAMHTAIRKLEEKERFLQ 411
             +A  K   E  R+K     KL  +  + L   +   +         ++K +++ + LQ
Sbjct: 533 TLEAEAKGKAEASRLKK----KLEGDINE-LEIALDHANKANAEAQKNVKKYQQQVKELQ 587

Query: 412 TVLTNAEKELH--LRNQAMDLNKRKAIES 438
           T +   ++         A+   +  A+E+
Sbjct: 588 TQVEEEQRAREDAREQLAVAERRATALEA 616



 Score = 30.4 bits (69), Expect = 3.2
 Identities = 30/159 (18%), Positives = 77/159 (48%), Gaps = 16/159 (10%)

Query: 289 ERKKLADEEAMRKIKQLEDQTHQLQKQVATHKQEEEALLNEMEVTGQAFEDMQEQNSRLL 348
           E  K  + E  +  K LE+   Q ++ +AT +++ +  +NE+    +  E +Q+Q ++  
Sbjct: 95  ELNKKREAELAKLRKDLEEANLQHEEALATLRKKHQDAINEL---SEQIEQLQKQKAKA- 150

Query: 349 QQLREKDDANFKLMTERIKSN--QLHKLAREEKDTLREQVQA----LSLQIEAMHTAIRK 402
               EK+ +  +   + + +   Q+ K A+   +   +Q+++    L ++++ +   +  
Sbjct: 151 ----EKEKSQLQAEVDDLLAQLDQIAK-AKLNAEKKAKQLESQLSELQVKLDELQRQLND 205

Query: 403 LEEKERFLQTVLTNAEKELH-LRNQAMDLNKRKAIESAQ 440
           L  ++  LQ+  ++  ++L     Q  +L+K K+   +Q
Sbjct: 206 LTSQKSRLQSENSDLTRQLEEAEAQVSNLSKLKSQLESQ 244


>gnl|CDD|224241 COG1322, COG1322, Predicted nuclease of restriction
           endonuclease-like fold, RmuC family [General function
           prediction only].
          Length = 448

 Score = 45.8 bits (109), Expect = 4e-05
 Identities = 22/118 (18%), Positives = 45/118 (38%), Gaps = 22/118 (18%)

Query: 301 KIKQLEDQTHQLQKQVATHKQEEEALLNEM-EVTGQAFEDMQEQNS-RLLQQLREKDDAN 358
           +++ L +   QL         E + L NE+ E   +   ++ +QN  +LL+ LRE  +  
Sbjct: 92  QLQLLIESLAQLS-------SEFQELANEIFEELNRRLAELNQQNLKQLLKPLREVLEK- 143

Query: 359 FKLMTERIKSNQLHKLAREEKDTLREQVQALSLQIEAMHTAIRKLEEKERFLQTVLTN 416
                         +   +      E+   L  +I+ +   I++L ++   L   L  
Sbjct: 144 ------------FREQLEQRIHESAEERSTLLEEIDRLLGEIQQLAQEAGNLTAALKG 189



 Score = 28.9 bits (65), Expect = 6.8
 Identities = 30/136 (22%), Positives = 47/136 (34%), Gaps = 17/136 (12%)

Query: 297 EAMRKIKQLEDQTHQLQKQVATHKQEEEALLNEMEVTGQAFEDMQEQNSRLLQQLREKDD 356
           + + ++ +LE    +L   V        A   E E        +Q  N  L+Q+L E   
Sbjct: 22  QLLLRLGRLEQMLGELA-AVLEQLLLLLAFRAEAEQLRTFARSLQALNLELIQELNE--- 77

Query: 357 ANFKLMTERIKSNQLHKLAREEKDTLREQVQALSLQIEAMHTAIRKLEEKERFL----QT 412
                +  R++   L    RE+   L E +  LS + +    A    EE  R L    Q 
Sbjct: 78  -----LKARLQQQLLQS--REQLQLLIESLAQLSSEFQ--ELANEIFEELNRRLAELNQQ 128

Query: 413 VLTNAEKELHLRNQAM 428
            L    K L    +  
Sbjct: 129 NLKQLLKPLREVLEKF 144


>gnl|CDD|233758 TIGR02169, SMC_prok_A, chromosome segregation protein SMC,
           primarily archaeal type.  SMC (structural maintenance of
           chromosomes) proteins bind DNA and act in organizing and
           segregating chromosomes for partition. SMC proteins are
           found in bacteria, archaea, and eukaryotes. It is found
           in a single copy and is homodimeric in prokaryotes, but
           six paralogs (excluded from this family) are found in
           eukarotes, where SMC proteins are heterodimeric. This
           family represents the SMC protein of archaea and a few
           bacteria (Aquifex, Synechocystis, etc); the SMC of other
           bacteria is described by TIGR02168. The N- and
           C-terminal domains of this protein are well conserved,
           but the central hinge region is skewed in composition
           and highly divergent [Cellular processes, Cell division,
           DNA metabolism, Chromosome-associated proteins].
          Length = 1164

 Score = 45.1 bits (107), Expect = 8e-05
 Identities = 34/177 (19%), Positives = 76/177 (42%), Gaps = 22/177 (12%)

Query: 285 SKRDERKKLADE------EAMRKIKQLEDQTHQLQKQVATHKQEEEALLNEMEVTGQAFE 338
           S+    +   DE      +A RKI ++E +  QL+++    K+  E L  ++    Q  E
Sbjct: 695 SELRRIENRLDELSQELSDASRKIGEIEKEIEQLEQEEEKLKERLEELEEDLSSLEQEIE 754

Query: 339 D----MQEQNSRLLQQLREKDDANFKLMTERIKSNQLH---KLAREEKDTLREQVQALSL 391
           +    ++E  +R+ +   E+D    +     +++   H      + E   L E+V  +  
Sbjct: 755 NVKSELKELEARIEE--LEEDLHKLEEALNDLEARLSHSRIPEIQAELSKLEEEVSRIEA 812

Query: 392 QIEAMHTAIRKLEEKERFLQTVLTNAEKELHLRNQAMDL-NKRKAIESAQSAADLKL 447
           ++  +   + +L  ++ +L+      ++   L+ Q +DL  + K+IE      + K 
Sbjct: 813 RLREIEQKLNRLTLEKEYLE------KEIQELQEQRIDLKEQIKSIEKEIENLNGKK 863



 Score = 40.4 bits (95), Expect = 0.002
 Identities = 35/159 (22%), Positives = 69/159 (43%), Gaps = 28/159 (17%)

Query: 290 RKKLAD-----EEAMRKIKQLEDQTHQLQKQVATHKQEEEALLNEME------------- 331
           ++K+ +         R I + E +    ++++A  + E + LL E+E             
Sbjct: 293 KEKIGELEAEIASLERSIAEKERELEDAEERLAKLEAEIDKLLAEIEELEREIEEERKRR 352

Query: 332 -VTGQAFEDMQEQNSRLLQQLREKDDANFKLMTERIKSNQ--LHKLARE------EKDTL 382
               + + +++E+   L  +L E  D  F    + +K  +  L KL RE      E D L
Sbjct: 353 DKLTEEYAELKEELEDLRAELEEV-DKEFAETRDELKDYREKLEKLKREINELKRELDRL 411

Query: 383 REQVQALSLQIEAMHTAIRKLEEKERFLQTVLTNAEKEL 421
           +E++Q LS ++  ++ AI  +E K   L+    +   E+
Sbjct: 412 QEELQRLSEELADLNAAIAGIEAKINELEEEKEDKALEI 450



 Score = 39.3 bits (92), Expect = 0.006
 Identities = 20/121 (16%), Positives = 54/121 (44%), Gaps = 3/121 (2%)

Query: 290 RKKLADEEAMR-KIKQLEDQTHQLQKQVATHKQEEEALLNEMEVTGQAFEDMQEQNSRLL 348
           R + A+ + +R +++ L+ +   LQ ++   +   + L  E+    +   +++++  +L 
Sbjct: 670 RSEPAELQRLRERLEGLKRELSSLQSELRRIENRLDELSQELSDASRKIGEIEKEIEQLE 729

Query: 349 QQLREKDDANFKLMTERIKSNQLHKLA-REEKDTLREQVQALSLQIEAMHTAIRKLEEKE 407
           Q+  EK     + + E + S +      + E   L  +++ L   +  +  A+  LE + 
Sbjct: 730 QE-EEKLKERLEELEEDLSSLEQEIENVKSELKELEARIEELEEDLHKLEEALNDLEARL 788

Query: 408 R 408
            
Sbjct: 789 S 789



 Score = 38.9 bits (91), Expect = 0.008
 Identities = 32/146 (21%), Positives = 72/146 (49%), Gaps = 14/146 (9%)

Query: 284 DSKRDERKKLADE-------EAMRKIKQLEDQTHQLQKQVATHKQEEEALLNEMEVTGQA 336
           D KR + ++L  E       +A+ K K+ E + ++L K+    ++++EA+  ++    + 
Sbjct: 194 DEKRQQLERLRREREKAERYQALLKEKR-EYEGYELLKEKEALERQKEAIERQLASLEEE 252

Query: 337 FEDMQEQNSRLLQQLREKDDANFKLMTERIKSNQLHKLAREEKDTLREQVQALSLQIEAM 396
            E + E+ S L ++L E +    + + ++IK      L  EE+  ++E++  L  +I ++
Sbjct: 253 LEKLTEEISELEKRLEEIEQL-LEELNKKIK-----DLGEEEQLRVKEKIGELEAEIASL 306

Query: 397 HTAIRKLEEKERFLQTVLTNAEKELH 422
             +I + E +    +  L   E E+ 
Sbjct: 307 ERSIAEKERELEDAEERLAKLEAEID 332



 Score = 36.6 bits (85), Expect = 0.037
 Identities = 20/144 (13%), Positives = 58/144 (40%), Gaps = 12/144 (8%)

Query: 283 RDSKRDERKKLADEEA-----MRKIKQLEDQTHQLQKQVATHKQEEEALLNEMEVTGQAF 337
           ++   D R +L + +        ++K   ++  +L++++   K+E + L  E++   +  
Sbjct: 363 KEELEDLRAELEEVDKEFAETRDELKDYREKLEKLKREINELKRELDRLQEELQRLSEEL 422

Query: 338 EDMQEQNSRLLQQLREKDDANFKLMTERIKSNQLHKLAREEKDTLREQVQALSLQIEAMH 397
            D+    + +  ++ E ++   +     IK          + + L   +     ++  + 
Sbjct: 423 ADLNAAIAGIEAKINELEEE-KEDKALEIKK------QEWKLEQLAADLSKYEQELYDLK 475

Query: 398 TAIRKLEEKERFLQTVLTNAEKEL 421
               ++E++   LQ  L  AE + 
Sbjct: 476 EEYDRVEKELSKLQRELAEAEAQA 499



 Score = 32.3 bits (74), Expect = 0.67
 Identities = 16/84 (19%), Positives = 39/84 (46%), Gaps = 7/84 (8%)

Query: 338 EDMQEQNSRLLQQLREKDDANFKLMTERIKSNQLHKLAREEKDTLREQVQALSLQIEAMH 397
           E ++ + S L  +LR  ++   +L  E   +++      +E + L ++ + L  ++E + 
Sbjct: 684 EGLKRELSSLQSELRRIENRLDELSQELSDASRKIGEIEKEIEQLEQEEEKLKERLEELE 743

Query: 398 TAIRKLEEKERFLQTVLTNAEKEL 421
             +  LE++       + N + EL
Sbjct: 744 EDLSSLEQE-------IENVKSEL 760



 Score = 32.3 bits (74), Expect = 0.69
 Identities = 33/215 (15%), Positives = 73/215 (33%), Gaps = 40/215 (18%)

Query: 287  RDERKKLADE--EAMRKIKQLEDQTHQLQKQVATHKQEEEALLNEMEVTGQAFEDMQEQN 344
              E ++L ++  +   +IK +E +   L  +    ++E E L   +        D++++ 
Sbjct: 832  EKEIQELQEQRIDLKEQIKSIEKEIENLNGKKEELEEELEELEAALRDLESRLGDLKKER 891

Query: 345  SRLLQQLREKDDANFKLMTERIKSNQLHKLAREEKDTLREQVQA---------------- 388
              L  QLRE +    +L  +  K  +     + + + L E++                  
Sbjct: 892  DELEAQLRELERKIEELEAQIEKKRKRLSELKAKLEALEEELSEIEDPKGEDEEIPEEEL 951

Query: 389  ----LSLQIEAMHTAIRKLEE------------KERFL-----QTVLTNAEKELHLRNQA 427
                +  +++ +   IR LE              +R       +  L    K +  R + 
Sbjct: 952  SLEDVQAELQRVEEEIRALEPVNMLAIQEYEEVLKRLDELKEKRAKLEEERKAILERIEE 1011

Query: 428  MDLNKRKAIESAQSAADLKLHLVI-IYTNGPARLE 461
             +  KR+    A  A +   + +    + G   L 
Sbjct: 1012 YEKKKREVFMEAFEAINENFNEIFAELSGGTGELI 1046


>gnl|CDD|188306 TIGR03319, RNase_Y, ribonuclease Y.  Members of this family are
           RNase Y, an endoribonuclease. The member from Bacillus
           subtilis, YmdA, has been shown to be involved in
           turnover of yitJ riboswitch [Transcription, Degradation
           of RNA].
          Length = 514

 Score = 44.1 bits (105), Expect = 1e-04
 Identities = 34/143 (23%), Positives = 67/143 (46%), Gaps = 21/143 (14%)

Query: 281 VVRDSKRD--ERKKL----ADEEAMRKIKQLEDQTHQ----LQKQVATHKQEEEALLNEM 330
           ++ ++K++    KK     A EE  +   +LE +  +    LQ+      Q EE L  +M
Sbjct: 37  IIEEAKKEAETLKKEALLEAKEEVHKLRAELERELKERRNELQRLERRLLQREETLDRKM 96

Query: 331 EVTGQAFEDMQEQNSRLLQQLREKDDANFKLMTERIKSNQLHKLAR-------EEKDTLR 383
           E   +  E+++++   L  + +  D+   +L  E + + Q  +L R       E K+ L 
Sbjct: 97  ESLDKKEENLEKKEKELSNKEKNLDEKEEEL--EELIAEQREELERISGLTQEEAKEILL 154

Query: 384 EQVQALSLQIEAMHTAIRKLEEK 406
           E+V+    + EA    I+++EE+
Sbjct: 155 EEVEE-EARHEAAKL-IKEIEEE 175


>gnl|CDD|220369 pfam09731, Mitofilin, Mitochondrial inner membrane protein.
           Mitofilin controls mitochondrial cristae morphology.
           Mitofilin is enriched in the narrow space between the
           inner boundary and the outer membranes, where it forms a
           homotypic interaction and assembles into a large
           multimeric protein complex. The first 78 amino acids
           contain a typical amino-terminal-cleavable mitochondrial
           presequence rich in positive-charged and hydroxylated
           residues and a membrane anchor domain. In addition, it
           has three centrally located coiled coil domains.
          Length = 493

 Score = 43.9 bits (104), Expect = 2e-04
 Identities = 38/159 (23%), Positives = 68/159 (42%), Gaps = 13/159 (8%)

Query: 288 DERKKLADEEAMRKIKQLEDQTHQLQKQVATHKQEEEALLNEM--EVTGQAFEDMQEQNS 345
            E     D E+   I   +++  QL K++A  K EEE  L     E   +    ++E+  
Sbjct: 153 KEDNLKDDLES--LIASAKEELDQLSKKLAELKAEEEEELERALKEKREELLSKLEEELL 210

Query: 346 RLLQQLREKDDANFKLMTERIKSNQLHKLAREEKDTLREQVQALSLQIEAMHT---AIRK 402
             L+      +   +L  ER K  +L K   E+   LR++++  +   E       A++ 
Sbjct: 211 ARLESKEAALEKQLRLEFEREKE-ELRKKYEEK---LRQELERQAEAHEQKLKNELALQA 266

Query: 403 LEEKERFLQTVLTNAEKELHLRNQAMD-LNKR-KAIESA 439
           +E +  F + +    E+E + R   +  LN R K +E A
Sbjct: 267 IELQREFNKEIKEKVEEERNGRLAKLAELNSRLKGLEKA 305



 Score = 34.6 bits (80), Expect = 0.13
 Identities = 31/150 (20%), Positives = 58/150 (38%), Gaps = 21/150 (14%)

Query: 274 EFRKFACVVRDSKRDERKKLADEEAMRKIKQLEDQTHQLQKQVATHKQE-----EEALLN 328
           E  +    +++ + +   KL +E   R   +      QL+ +    K+E     EE L  
Sbjct: 186 EEEELERALKEKREELLSKLEEELLARLESKEAALEKQLRLEFEREKEELRKKYEEKLRQ 245

Query: 329 EMEVTGQAFED-------MQ--EQNSRLLQQLREKDD---ANFKLMTERIKSN--QLHKL 374
           E+E   +A E        +Q  E      ++++EK +            + S    L K 
Sbjct: 246 ELERQAEAHEQKLKNELALQAIELQREFNKEIKEKVEEERNGRLAKLAELNSRLKGLEKA 305

Query: 375 AREEKDTLRE--QVQALSLQIEAMHTAIRK 402
                +   E  +VQ L L +EA+ +A++ 
Sbjct: 306 LDSRSEAEDENHKVQQLWLAVEALKSALKS 335


>gnl|CDD|132364 TIGR03321, alt_F1F0_F0_B, alternate F1F0 ATPase, F0 subunit B.  A
           small number of taxonomically diverse prokaryotic
           species, including Methanosarcina barkeri, have what
           appears to be a second ATP synthase, in addition to the
           normal F1F0 ATPase in bacteria and A1A0 ATPase in
           archaea. These enzymes use ion gradients to synthesize
           ATP, CC and in principle may run in either direction.
           This model represents the F0 subunit B of this apparent
           second ATP synthase.
          Length = 246

 Score = 43.1 bits (102), Expect = 2e-04
 Identities = 41/171 (23%), Positives = 73/171 (42%), Gaps = 45/171 (26%)

Query: 288 DER-KKLADE--EAMRKIKQLEDQTHQLQKQVATHKQEEEALLNEMEVTGQAFEDMQEQN 344
           D R KK+A E  +A  K ++ E +  + +++     Q+ E LL       +A E+ Q + 
Sbjct: 35  DAREKKIAGELADADTKKREAEQERREYEEKNEELDQQREVLLT------KAKEEAQAER 88

Query: 345 SRLLQQLREKDDANFKLMTERIKSNQLHKLAREEKDTLRE-QVQALSLQIEAMHTAIRKL 403
            RLL +                        AREE D +RE   +AL  +  A+   +R+ 
Sbjct: 89  QRLLDE------------------------AREEADEIREKWQEALRREQAALSDELRRR 124

Query: 404 EEKERFL--QTVLTN-AEKELHL--------RNQAMDLNKRKAIESAQSAA 443
              E F   + VLT+ A+ +L          R + +D +++ A+  A + +
Sbjct: 125 TGAEVFAIARKVLTDLADTDLEERMVDVFVQRLRTLDPDEKAALAEALADS 175


>gnl|CDD|237177 PRK12704, PRK12704, phosphodiesterase; Provisional.
          Length = 520

 Score = 43.6 bits (104), Expect = 2e-04
 Identities = 36/143 (25%), Positives = 71/143 (49%), Gaps = 21/143 (14%)

Query: 281 VVRDSKRD--ERKKLADEEAMRKIKQLEDQTHQ--------LQKQVATHKQEEEALLNEM 330
           ++ ++K++    KK A  EA  +I +L ++  +        LQK      Q+EE L  ++
Sbjct: 43  ILEEAKKEAEAIKKEALLEAKEEIHKLRNEFEKELRERRNELQKLEKRLLQKEENLDRKL 102

Query: 331 EVTGQAFEDMQEQNSRLLQQLREKDDANFKLMTERIKSNQLHKLAR-------EEKDTLR 383
           E+  +  E+++++   L Q+ +E +    +L  E +   QL +L R       E K+ L 
Sbjct: 103 ELLEKREEELEKKEKELEQKQQELEKKEEEL--EELIEEQLQELERISGLTAEEAKEILL 160

Query: 384 EQVQALSLQIEAMHTAIRKLEEK 406
           E+V+    + EA    I+++EE+
Sbjct: 161 EKVEE-EARHEAAVL-IKEIEEE 181



 Score = 32.1 bits (74), Expect = 0.77
 Identities = 30/139 (21%), Positives = 65/139 (46%), Gaps = 23/139 (16%)

Query: 286 KRDERKKL--ADEEAMRKIKQLEDQTHQLQKQVATH-KQEEEALLNEMEVTGQAFEDMQE 342
           K+    K+  A+EEA R +++ + +   ++K+     K+E   L NE E   +   + + 
Sbjct: 26  KKIAEAKIKEAEEEAKRILEEAKKEAEAIKKEALLEAKEEIHKLRNEFE---KELRERRN 82

Query: 343 QNSRLLQQLREKDDANFKLMTERIKSNQLHKLAREEKDTLREQVQALSLQIEAMHTAIRK 402
           +  +L ++L +K         E     +L  L + E++ L ++ + L  + +       +
Sbjct: 83  ELQKLEKRLLQK---------EENLDRKLELLEKREEE-LEKKEKELEQKQQ-------E 125

Query: 403 LEEKERFLQTVLTNAEKEL 421
           LE+KE  L+ ++    +EL
Sbjct: 126 LEKKEEELEELIEEQLQEL 144


>gnl|CDD|192111 pfam08647, BRE1, BRE1 E3 ubiquitin ligase.  BRE1 is an E3 ubiquitin
           ligase that has been shown to act as a transcriptional
           activator through direct activator interactions.
          Length = 96

 Score = 40.0 bits (94), Expect = 2e-04
 Identities = 22/96 (22%), Positives = 46/96 (47%)

Query: 326 LLNEMEVTGQAFEDMQEQNSRLLQQLREKDDANFKLMTERIKSNQLHKLAREEKDTLREQ 385
           L  E+    +AF+++     + + +L   +    +L  E+ K++Q +  A   KD L  +
Sbjct: 1   LQKELVSLEKAFKELSSLALKKVAELTILEQKKLRLTAEKAKADQKYFAAMRSKDALDAE 60

Query: 386 VQALSLQIEAMHTAIRKLEEKERFLQTVLTNAEKEL 421
            + L+ Q+      I +L++ E  L+  +   EK+L
Sbjct: 61  NKKLNTQLNKSSELIEQLKDLENELRRRIKELEKQL 96


>gnl|CDD|234501 TIGR04211, SH3_and_anchor, SH3 domain protein.  Members of this
           protein family have a signal peptide, a strongly
           conserved SH3 domain, a variable region, and then a
           C-terminal hydrophobic transmembrane alpha helix region.
          Length = 198

 Score = 41.9 bits (99), Expect = 3e-04
 Identities = 29/133 (21%), Positives = 62/133 (46%), Gaps = 16/133 (12%)

Query: 282 VRDSKRDE----RKKLADEEAMR-KIKQLEDQTHQLQKQVATHKQEEEALLNEMEVTGQA 336
           VR  K  E     + L+D  + R ++ +L+ +  +LQ+++A  +++   L  E +   Q 
Sbjct: 43  VRTPKGREGWVLSRYLSDTPSARERLPELQQELAELQEELAELQEQLAELQQENQELKQE 102

Query: 337 FEDMQEQNSRLLQQLREKDDANFKLMTERIKSNQLHKLAREEKDTLREQVQALSLQIEAM 396
              ++ +  RL ++L            +++ +N +     EE   LRE++  L  + EA+
Sbjct: 103 LSTLEAELERLQKELAR---------IKQLSANAI--ELDEENRELREELAELKQENEAL 151

Query: 397 HTAIRKLEEKERF 409
                +L+E E+ 
Sbjct: 152 EAENERLQENEQR 164



 Score = 29.6 bits (67), Expect = 2.8
 Identities = 14/80 (17%), Positives = 33/80 (41%), Gaps = 3/80 (3%)

Query: 368 SNQLHKLAREEKDTLREQVQALSLQIEAMHTAIRKLEEKERFLQTVLTNAEKELHLRNQA 427
             +L +L +EE   L+EQ+  L  + + +   +  LE +   LQ  L   ++      + 
Sbjct: 72  QQELAEL-QEELAELQEQLAELQQENQELKQELSTLEAELERLQKELARIKQLSANAIEL 130

Query: 428 MDLNKR--KAIESAQSAADL 445
            + N+   + +   +   + 
Sbjct: 131 DEENRELREELAELKQENEA 150


>gnl|CDD|223496 COG0419, SbcC, ATPase involved in DNA repair [DNA replication,
           recombination, and repair].
          Length = 908

 Score = 43.2 bits (102), Expect = 3e-04
 Identities = 27/162 (16%), Positives = 67/162 (41%), Gaps = 13/162 (8%)

Query: 282 VRDSKRDERKKLADEEA--MRKIKQLEDQTHQLQKQVATHKQEEEALLNEMEVTGQAFED 339
            ++   + + +L + E+  +  +K  E++  +L++     ++ EE +    E+  +  E 
Sbjct: 247 EKERLEELKARLLEIESLELEALKIREEELRELERL---LEELEEKIERLEELEREIEEL 303

Query: 340 MQEQNSRLLQQLREKDD--ANFKLMTERIK--SNQLHKLAREEKDTLREQVQALSLQIEA 395
            +E+   L   L E ++     K + ER++    +L KL  E ++   E+ +   L  E 
Sbjct: 304 -EEELEGLRALLEELEELLEKLKSLEERLEKLEEKLEKLESELEELAEEKNELAKLLEER 362

Query: 396 MHT---AIRKLEEKERFLQTVLTNAEKELHLRNQAMDLNKRK 434
           +      + +LE++       L   E+ +    + +      
Sbjct: 363 LKELEERLEELEKELEKALERLKQLEEAIQELKEELAELSAA 404



 Score = 40.9 bits (96), Expect = 0.002
 Identities = 33/155 (21%), Positives = 74/155 (47%), Gaps = 18/155 (11%)

Query: 285 SKRDERKKLADEEAMRKIKQLEDQTH---QLQKQVATHKQEE---EALLNEMEVTGQAFE 338
            +  E ++L  EE   KI++LE+      +L++++   +      E LL +++   +  E
Sbjct: 274 EELRELERLL-EELEEKIERLEELEREIEELEEELEGLRALLEELEELLEKLKSLEERLE 332

Query: 339 DMQEQN---SRLLQQLREKDDANFKLMTERIKSNQLH-KLAREEKDTLREQVQALSLQIE 394
            ++E+       L++L E+ +   KL+ ER+K  +   +   +E +   E+++ L   I+
Sbjct: 333 KLEEKLEKLESELEELAEEKNELAKLLEERLKELEERLEELEKELEKALERLKQLEEAIQ 392

Query: 395 AMHT-------AIRKLEEKERFLQTVLTNAEKELH 422
            +         A+ +++E+   L+  L   E+EL 
Sbjct: 393 ELKEELAELSAALEEIQEELEELEKELEELERELE 427



 Score = 40.5 bits (95), Expect = 0.002
 Identities = 30/163 (18%), Positives = 69/163 (42%), Gaps = 22/163 (13%)

Query: 284 DSKRDERKKLADEEAMRKIKQLEDQTH--QLQKQVATHKQEEEALLNEMEVTGQAFEDMQ 341
           D +        + + ++K+++++++    +L++++   ++    L  E E   +    + 
Sbjct: 200 DIEDLLEALEEELKELKKLEEIQEEQEEEELEQEIEALEERLAELEEEKERLEELKARLL 259

Query: 342 EQNSRLLQQL--REKDDANFKLMTERI--KSNQLHKLARE----------------EKDT 381
           E  S  L+ L  RE++    + + E +  K  +L +L RE                E + 
Sbjct: 260 EIESLELEALKIREEELRELERLLEELEEKIERLEELEREIEELEEELEGLRALLEELEE 319

Query: 382 LREQVQALSLQIEAMHTAIRKLEEKERFLQTVLTNAEKELHLR 424
           L E++++L  ++E +   + KLE +   L        K L  R
Sbjct: 320 LLEKLKSLEERLEKLEEKLEKLESELEELAEEKNELAKLLEER 362



 Score = 38.6 bits (90), Expect = 0.008
 Identities = 34/175 (19%), Positives = 76/175 (43%), Gaps = 15/175 (8%)

Query: 285 SKRDERKKLADEEAMRKIKQLEDQTHQLQKQVA-THKQEEEALLNEMEVTGQAFEDMQEQ 343
            +  E ++   EE   K+++LE+   +L++       Q+ +  L ++E   Q  ++   +
Sbjct: 518 IELLELEEALKEELEEKLEKLENLLEELEELKEKLQLQQLKEELRQLEDRLQELKE-LLE 576

Query: 344 NSRLLQQLREKDDANFKLMTERIKSNQLHKLAREEKDTLREQVQALSL---------QIE 394
             RLL+  +E+     +   + +K     K   E    L E +Q+L L           E
Sbjct: 577 ELRLLRTRKEE-LEELRERLKELKKKL--KELEERLSQLEELLQSLELSEAENELEEAEE 633

Query: 395 AMHTAIRKLEEKERFLQTVLTNAEKELHLRNQAMDLNKRKAIESAQSAADLKLHL 449
            + + + KL  +   L+ +L  A +EL  + + ++   R+ ++  ++   L+  L
Sbjct: 634 ELESELEKLNLQAE-LEELLQAALEELEEKVEELEAEIRRELQRIENEEQLEEKL 687



 Score = 38.2 bits (89), Expect = 0.012
 Identities = 34/180 (18%), Positives = 74/180 (41%), Gaps = 25/180 (13%)

Query: 288 DERKKLADEEAMRKIKQLEDQTHQLQKQVATHKQEEEALLNEMEVTGQAFEDMQEQNSRL 347
           +E  + A EE   K+++LE +  +  +++   +Q EE L    ++  +  E ++E+   L
Sbjct: 649 EELLQAALEELEEKVEELEAEIRRELQRIENEEQLEEKLEELEQLEEE-LEQLREELEEL 707

Query: 348 LQQLREKDDANFKLMTERIKSNQLHKLAREEKDTLREQVQALSLQIEAMHTAIRKLEEK- 406
           L++L E +        E ++S       + E + L+++++ L   +E +   +R+   K 
Sbjct: 708 LKKLGEIEQLI-----EELESR------KAELEELKKELEKLEKALE-LLEELREKLGKA 755

Query: 407 -------ERFLQTVLTNAEKELHLRNQAMDLNKRKAIESAQSAADLKLHLVIIYTNGPAR 459
                     L  +   A + L      + LN+                +V++Y  G  R
Sbjct: 756 GLRADILRNLLAQIEAEANEIL----SKLSLNRYDLRRLTIRKDGNGGLVVVVYDGGEVR 811



 Score = 35.9 bits (83), Expect = 0.054
 Identities = 36/166 (21%), Positives = 82/166 (49%), Gaps = 9/166 (5%)

Query: 282 VRDSKRDERKKLADEEAMRKIKQLEDQTHQLQKQVATHKQEEEALLNEMEVTGQAFEDMQ 341
           +R+  ++ +KKL + E      +   Q+ +L +     ++ EE L +E+E       +++
Sbjct: 591 LRERLKELKKKLKELEERLSQLEELLQSLELSEAENELEEAEEELESELE-KLNLQAELE 649

Query: 342 EQNSRLLQQLREKD---DANFKLMTERIKSNQLHKLAREEKDTLREQVQALSLQIEAMHT 398
           E     L++L EK    +A  +   +RI++ +  +   EE + L E+++ L  ++E    
Sbjct: 650 ELLQAALEELEEKVEELEAEIRRELQRIENEEQLEEKLEELEQLEEELEQLREELE---E 706

Query: 399 AIRKLEEKERFLQTVLTNAEKELHLRNQAMDLNKRKAIESAQSAAD 444
            ++KL E E+ ++  L + + EL    + ++  + KA+E  +   +
Sbjct: 707 LLKKLGEIEQLIEE-LESRKAELEELKKELEKLE-KALELLEELRE 750



 Score = 35.9 bits (83), Expect = 0.064
 Identities = 26/160 (16%), Positives = 71/160 (44%), Gaps = 13/160 (8%)

Query: 285 SKRDERKKLADEEAMRKIKQLEDQTHQLQKQVATHKQEEEALLNEMEVTGQAFEDMQEQN 344
            + +  + L +E     +++L ++   L++++   +++ E L +E+E       + + + 
Sbjct: 305 EELEGLRALLEE-----LEELLEKLKSLEERLEKLEEKLEKLESELE----ELAEEKNEL 355

Query: 345 SRLLQQL-REKDDANFKLMTERIKSNQLHKLAREEKDTLREQVQALSLQIEAMHTAIRKL 403
           ++LL++  +E ++   +L  E  K+ +  K   E    ++E  + L+    A+     +L
Sbjct: 356 AKLLEERLKELEERLEELEKELEKALERLKQLEEA---IQELKEELAELSAALEEIQEEL 412

Query: 404 EEKERFLQTVLTNAEKELHLRNQAMDLNKRKAIESAQSAA 443
           EE E+ L+ +    E+      +  +   +   +    A 
Sbjct: 413 EELEKELEELERELEELEEEIKKLEEQINQLESKELMIAE 452



 Score = 35.1 bits (81), Expect = 0.090
 Identities = 38/173 (21%), Positives = 74/173 (42%), Gaps = 23/173 (13%)

Query: 286 KRDERKKLADEEAMRKIKQLEDQTHQLQKQVATH-------KQEEEALLNEMEVTGQAFE 338
            R++ +    EE     K+L +   +L + +          +++ E L N +E   +  E
Sbjct: 491 SREKEEAELREEIEELEKELRELEEELIELLELEEALKEELEEKLEKLENLLEELEELKE 550

Query: 339 DMQEQN--------SRLLQQLREKDDANFKLMTERIKSNQLHKLAREEKDTLREQVQALS 390
            +Q Q            LQ+L+E  +   +L   R +  +L +L RE    L+++++ L 
Sbjct: 551 KLQLQQLKEELRQLEDRLQELKELLE---ELRLLRTRKEELEEL-RERLKELKKKLKELE 606

Query: 391 LQIEAMHTAIRKLEEKERFLQTVLTNAEKELHLRNQAMDLNKRKAIESAQSAA 443
            ++  +   ++ LE  E   +  L  AE+EL    +   LN +  +E    AA
Sbjct: 607 ERLSQLEELLQSLELSEAENE--LEEAEEELESELEK--LNLQAELEELLQAA 655



 Score = 33.6 bits (77), Expect = 0.30
 Identities = 29/170 (17%), Positives = 64/170 (37%), Gaps = 19/170 (11%)

Query: 286 KRDERKKLADEEAMRKIKQLEDQTHQLQKQVATHKQEEEALLNEMEVTGQAFEDMQEQ-- 343
           + +E  +   EE       LE+   +L++      +E E  L E+E   +  E+   Q  
Sbjct: 386 QLEEAIQELKEELAELSAALEEIQEELEELEKEL-EELERELEELEEEIKKLEEQINQLE 444

Query: 344 -NSRLLQQLR---------------EKDDANFKLMTERIKSNQLHKLAREEKDTLREQVQ 387
               ++ +L                E +    +L    ++  +      +E+  LRE+++
Sbjct: 445 SKELMIAELAGAGEKCPVCGQELPEEHEKELLELYELELEELEEELSREKEEAELREEIE 504

Query: 388 ALSLQIEAMHTAIRKLEEKERFLQTVLTNAEKELHLRNQAMDLNKRKAIE 437
            L  ++  +   + +L E E  L+  L    ++L    + ++  K K   
Sbjct: 505 ELEKELRELEEELIELLELEEALKEELEEKLEKLENLLEELEELKEKLQL 554


>gnl|CDD|220614 pfam10174, Cast, RIM-binding protein of the cytomatrix active zone.
            This is a family of proteins that form part of the CAZ
           (cytomatrix at the active zone) complex which is
           involved in determining the site of synaptic vesicle
           fusion. The C-terminus is a PDZ-binding motif that binds
           directly to RIM (a small G protein Rab-3A effector). The
           family also contains four coiled-coil domains.
          Length = 774

 Score = 42.3 bits (99), Expect = 6e-04
 Identities = 40/172 (23%), Positives = 68/172 (39%), Gaps = 20/172 (11%)

Query: 289 ERKKLADEEAMRKIKQLEDQTHQLQKQVATHKQEEEALLNEMEVTGQAFEDMQEQNSRLL 348
           + K    +E  + +++ E++ ++LQ +  T + +   L  E+E    A   M+ +  R  
Sbjct: 234 DEKDTKIKEFEKMLEKAENEIYRLQSRCDTSEADRNRLDKEVEAERSALAAMKAKCDRAA 293

Query: 349 QQLREKDDANFKLMTE-RIKSNQ-------LHKL------AREEKDTLREQVQALSLQIE 394
           Q+L  K      L TE    +NQ       L KL      A +EK  L+ +V AL  ++E
Sbjct: 294 QELSRKKTELLGLQTELETLANQDSDMRQHLDKLKEDLTRAEQEKAILQTEVDALRYELE 353

Query: 395 AMHTAIRKLEEKERFLQTVLTNAEKELH-----LRNQAMDLNK-RKAIESAQ 440
             H  + K     +  Q        E+             L   +K IE+ Q
Sbjct: 354 RKHNTLTKKTASLQAAQEEKATYAGEIEDMRDRYEKTERKLRVLQKKIENLQ 405



 Score = 31.9 bits (72), Expect = 1.0
 Identities = 34/156 (21%), Positives = 63/156 (40%), Gaps = 15/156 (9%)

Query: 300 RKIKQLEDQTHQLQ---KQVATHKQEEEALLNEMEVTGQAFEDMQEQNSRLLQQLREKDD 356
           RK   L  +T  LQ   ++ AT+  E E + +  E T +    +Q++   L +  R K+ 
Sbjct: 354 RKHNTLTKKTASLQAAQEEKATYAGEIEDMRDRYEKTERKLRVLQKKIENLQETFRRKER 413

Query: 357 ANFKLMTERIKSNQLHKLAREEKDTLREQVQALSLQIEAMHTAIR-----KLEEKERF-- 409
              K   ER++S Q         + L + +      IE +          + EE E +  
Sbjct: 414 -RLKEEKERLRSLQTDTNTDTALEKLEKALAEKERIIERLKEQRDRDERYEQEEFETYKK 472

Query: 410 ----LQTVLTNAEKELHLRNQAMDLNKRKAIESAQS 441
               L+  + N + +L  R   ++L K +  + A +
Sbjct: 473 EFEDLKEEVQNLQLKLSERELQLELLKEEVSKLASN 508



 Score = 31.5 bits (71), Expect = 1.1
 Identities = 31/142 (21%), Positives = 60/142 (42%), Gaps = 7/142 (4%)

Query: 296 EEAMRKIKQLEDQTHQLQKQVATHKQEEEAL------LNEMEVTGQAFEDMQEQNSRLLQ 349
           E    ++  L D   +LQ ++   +Q  +A       L EM  +       +E++S  L 
Sbjct: 124 ERQAMELFLLRDTLEKLQLRMERLQQTLDARDEEIKKLLEMLQSKGLSAKAEEEDSERLG 183

Query: 350 QLREKDDANFKLMTERIKSNQLHKLAREEKDTLREQVQALSLQIEAMHTAIRKLEEKERF 409
           ++RE +D    L     +  + H + REE    + Q++    + EA    I + + K + 
Sbjct: 184 RIREAEDQQSHLEVLLEQKEKEHMMLREEIHQ-KLQMERDDAKTEASQKLIDEKDTKIKE 242

Query: 410 LQTVLTNAEKELHLRNQAMDLN 431
            + +L  AE E++      D +
Sbjct: 243 FEKMLEKAENEIYRLQSRCDTS 264



 Score = 31.1 bits (70), Expect = 1.7
 Identities = 30/160 (18%), Positives = 66/160 (41%), Gaps = 19/160 (11%)

Query: 277 KFACVVRDSKRDERKKL--------ADEEAMRKIKQLEDQTHQLQKQVATHKQEEEALLN 328
           + A +  +  R++ +KL        A+ E+  +   ++  T + +   A  + E + LL+
Sbjct: 517 ERAHIELEKIREKHEKLEKELKRLRANPESADRGSAVDAGTSRSRADSAGARNEVDRLLD 576

Query: 329 EMEVTGQAFEDMQEQNSRLLQQL---------REKDDANFKLMTERIKSNQLHKLAREEK 379
            +E   Q  +D + +  RL ++L         +++     ++  ER  +  L +  R EK
Sbjct: 577 RLEKAEQERDDTEMEAGRLAKELEKAQRHLTKQQEKTEATRIEFERKSAELLEEAERLEK 636

Query: 380 DTLREQVQALSLQIEAMHTAIRKLEEKERFLQTVLTNAEK 419
               E+    S QI     A     E    +Q + ++ ++
Sbjct: 637 SEAEEETLRQSTQIGHAQAAAHNHIEHH--VQKLESDLKQ 674



 Score = 30.8 bits (69), Expect = 1.9
 Identities = 28/128 (21%), Positives = 48/128 (37%), Gaps = 9/128 (7%)

Query: 313 QKQVATHKQEEEALLNEMEVTGQAFEDMQEQNSRLLQQLREKDDANFKLMTERIKSNQLH 372
           + Q +  + E E    E+ +     E +Q +  RL Q L  +D+   KL+          
Sbjct: 113 EGQFSRIQAERERQAMELFLLRDTLEKLQLRMERLQQTLDARDEEIKKLLEMLQSKGLSA 172

Query: 373 KLAREEKDTLREQVQALSLQIEAMHTAIRKLEEKERFLQTVLTNAEKELHLRNQ-AMDLN 431
           K   E+ + L    +A   Q          LE+KE+          +E+H + Q   D  
Sbjct: 173 KAEEEDSERLGRIREAEDQQSHLEVL----LEQKEKEHMM----LREEIHQKLQMERDDA 224

Query: 432 KRKAIESA 439
           K +A +  
Sbjct: 225 KTEASQKL 232


>gnl|CDD|131391 TIGR02338, gimC_beta, prefoldin, beta subunit, archaeal.
           Chaperonins are cytosolic, ATP-dependent molecular
           chaperones, with a conserved toroidal architecture, that
           assist in the folding of nascent and/or denatured
           polypeptide chains. The group I chaperonin system
           consists of GroEL and GroES, and is found (usually) in
           bacteria and organelles of bacterial origin. The group
           II chaperonin system, called the thermosome in Archaea
           and TRiC or CCT in the Eukaryota, is structurally
           similar but only distantly related. Prefoldin, also
           called GimC, is a complex in Archaea and Eukaryota, that
           works with group II chaperonins. Members of this protein
           family are the archaeal clade of the beta class of
           prefoldin subunit. Closely related, but outside the
           scope of this family are the eukaryotic beta-class
           prefoldin subunits, Gim-1,3,4 and 6. The alpha class
           prefoldin subunits are more distantly related.
          Length = 110

 Score = 38.5 bits (90), Expect = 0.001
 Identities = 32/122 (26%), Positives = 53/122 (43%), Gaps = 19/122 (15%)

Query: 297 EAMRKIKQLEDQTHQLQKQVATHKQEEEALLNEMEVTGQAFEDMQEQNSRLLQQLREKDD 356
           +   ++ QL+    QLQ  VAT KQ+ EA L E E              + L++L    D
Sbjct: 4   QVQNQLAQLQQLQQQLQA-VATQKQQVEAQLKEAE--------------KALEELERLPD 48

Query: 357 ANFKLMTERIKSNQLHKLAREE-KDTLREQVQALSLQIEAMHTAIRKLEEKERFLQTVLT 415
                   +   N L K  +EE    L+E+ + L L+++ +     +L E+ + LQ  + 
Sbjct: 49  ---DTPVYKSVGNLLVKTDKEEAIQELKEKKETLELRVKTLQRQEERLREQLKELQEKIQ 105

Query: 416 NA 417
            A
Sbjct: 106 EA 107


>gnl|CDD|118696 pfam10168, Nup88, Nuclear pore component.  Nup88 can be divided
           into two structural domains; the N-terminal two-thirds
           of the protein has no obvious structural motifs but is
           the region for binding to Nup98, one of the components
           of the nuclear pore. the C-terminal end is a predicted
           coiled-coil domain. Nup88 is overexpressed in tumour
           cells.
          Length = 717

 Score = 41.4 bits (97), Expect = 0.001
 Identities = 27/151 (17%), Positives = 73/151 (48%), Gaps = 15/151 (9%)

Query: 290 RKKLADEEAMRKIKQLEDQTHQLQKQVATHKQEEEALLNEMEVTGQAFEDMQEQNSRLLQ 349
           +  LA EE  R++K L+ Q  +  + +   ++E ++L    E   + FE+ +     L+ 
Sbjct: 555 KHDLAREEFQRRVKLLQLQKEKQLEDIQDCREERKSLSERAEKLAEKFEEAKYNQELLVN 614

Query: 350 QLREKDDANFKLMTERIKSNQLHKLAREEKDTLREQVQALSLQIEAMHTAIRKLEEKERF 409
           + +       +L+  +  ++QL  L+  E+D + +++Q ++ Q++ +   I+++++K+ +
Sbjct: 615 RCK-------RLL--QSANSQLPVLSDSERD-MSKELQRINKQLQHLANGIKQVKKKKNY 664

Query: 410 LQTVLTNAEKELHLRNQAMDLN--KRKAIES 438
            +  +     +   +  +  L   + K I  
Sbjct: 665 QRYHM---ASQESPKKSSYTLPEKQHKTITE 692


>gnl|CDD|109486 pfam00430, ATP-synt_B, ATP synthase B/B' CF(0).  Part of the CF(0)
           (base unit) of the ATP synthase. The base unit is
           thought to translocate protons through membrane (inner
           membrane in mitochondria, thylakoid membrane in plants,
           cytoplasmic membrane in bacteria). The B subunits are
           thought to interact with the stalk of the CF(1)
           subunits. This domain should not be confused with the ab
           CF(1) proteins (in the head of the ATP synthase) which
           are found in pfam00006.
          Length = 132

 Score = 38.8 bits (91), Expect = 0.001
 Identities = 24/111 (21%), Positives = 52/111 (46%), Gaps = 17/111 (15%)

Query: 286 KRDERKKLADEEAMRKIKQLEDQTHQLQKQVATHKQEEEALLNEMEVTGQAFEDMQEQNS 345
           +R E+     +EA  ++KQ      + ++Q+A  + E   ++N  +   Q          
Sbjct: 30  ERKEKIANNIKEAEERLKQAAALLAEAEQQLAQARAEASEIINNAKKEAQ---------- 79

Query: 346 RLLQQLREKDDANFKLMTERIKSNQLHKLAREEKDT---LREQVQALSLQI 393
           +L +++    +A      ER+  +   ++ +E++     LR+QV AL++QI
Sbjct: 80  KLKEEIL--AEAQKDA--ERLLESARAEIEQEKEQALAELRQQVAALAVQI 126


>gnl|CDD|216108 pfam00769, ERM, Ezrin/radixin/moesin family.  This family of
           proteins contain a band 4.1 domain (pfam00373), at their
           amino terminus. This family represents the rest of these
           proteins.
          Length = 244

 Score = 40.1 bits (94), Expect = 0.001
 Identities = 30/147 (20%), Positives = 65/147 (44%), Gaps = 16/147 (10%)

Query: 296 EEAMRKIKQLEDQTHQLQKQVATHKQEEEALLNEMEVTGQAFEDMQEQNSRLLQQLREKD 355
           EEA R+ ++LE++  Q+++ +   ++E    L E E T    E+  +Q     Q L EK 
Sbjct: 1   EEAEREQQELEERMEQMEEDMRRAQKE----LEEYEETALELEEKLKQEEEEAQLL-EKK 55

Query: 356 DANFKLMTERIKSNQLHKLAREEKDTLREQVQALSLQIEAMHTAI-RKLEEKERFLQTVL 414
               +    R++         EE++ L  +V   + ++  +     +K  E  +      
Sbjct: 56  ADELEEENRRLEEEAAAS--EEERERLEAEVDEATAEVAKLEEEREKKEAETRQL----- 108

Query: 415 TNAEKELHLRNQAMDLNKRKAIESAQS 441
              ++EL    +A +  +++ +E+A +
Sbjct: 109 ---QQELREAQEAHERARQELLEAAAA 132



 Score = 36.6 bits (85), Expect = 0.021
 Identities = 30/168 (17%), Positives = 60/168 (35%), Gaps = 7/168 (4%)

Query: 289 ERKKLADEEAMRKIKQLEDQTHQLQKQVATHKQEEEALLNEMEVTGQAFEDMQEQNSRLL 348
            R +   EE      +LE++  Q +++    +++ + L  E     +     +E+  RL 
Sbjct: 22  RRAQKELEEYEETALELEEKLKQEEEEAQLLEKKADELEEENRRLEEEAAASEEERERLE 81

Query: 349 QQLREKDDANFKLMTERIK----SNQLHKLAREEKDTLREQVQALSLQIEAMHTA---IR 401
            ++ E      KL  ER K    + QL +  RE ++      Q L     A         
Sbjct: 82  AEVDEATAEVAKLEEEREKKEAETRQLQQELREAQEAHERARQELLEAAAAPTAPPHVAA 141

Query: 402 KLEEKERFLQTVLTNAEKELHLRNQAMDLNKRKAIESAQSAADLKLHL 449
            +  ++         A  +L       D ++ + +  A+    L+  L
Sbjct: 142 PVNGEQLEPDDNGEEASADLETDPDMKDRSEEERVTYAEKNERLQTQL 189


>gnl|CDD|218852 pfam06008, Laminin_I, Laminin Domain I.  coiled-coil structure. It
           has been suggested that the domains I and II from
           laminin A, B1 and B2 may come together to form a triple
           helical coiled-coil structure.
          Length = 263

 Score = 40.1 bits (94), Expect = 0.002
 Identities = 29/142 (20%), Positives = 53/142 (37%), Gaps = 21/142 (14%)

Query: 296 EEAMRKIKQLEDQTHQLQKQVATHKQEEEALLNEMEVTGQAFEDMQEQNSRLLQQLREKD 355
           +     IK L D   ++ +++A   +      N   + G A   M  +  R+L ++RE+D
Sbjct: 89  KTLAEAIKNLRDNIKEILEKLAGLGE------NAFALPGSALSRMLAEAQRMLGEIRERD 142

Query: 356 DANFKLMTERIKSNQLHKLAREEKDTLREQVQALSLQIEAMHTAIR--------KLEEKE 407
                   +   +    K A E  + ++   Q    + E +  A+R        KL +  
Sbjct: 143 -----FQKQLQNAEAELKAAEELLNRVQTWFQTPQEENEGLFEALRDDLNEYEAKLSDLR 197

Query: 408 RFLQTVL--TNAEKELHLRNQA 427
             L      T   + L+L NQ 
Sbjct: 198 ELLDEAQAKTRDAERLNLANQK 219



 Score = 36.6 bits (85), Expect = 0.019
 Identities = 30/148 (20%), Positives = 50/148 (33%), Gaps = 32/148 (21%)

Query: 297 EAMRKI-KQLEDQTHQLQ---KQVATHKQEEEALLNEMEVTGQAFEDMQEQ---NSRLLQ 349
            A  KI   +E+ T QL      ++  K++ E L   +    Q  E++Q++        Q
Sbjct: 16  PAPYKILYNIENLTQQLSGRLSPISATKRQLEDLEKALSKLAQDAENLQKKANQALAKAQ 75

Query: 350 QLREKDDANFKLMTERIKS----------------NQLHKLAREEKDTLREQVQALSLQI 393
           ++     A  +      K+                 +L  L           +  +  + 
Sbjct: 76  KVN----AAIERTLGHAKTLAEAIKNLRDNIKEILEKLAGLGENAFALPGSALSRMLAEA 131

Query: 394 EAMHTAIRKLEEKERFLQTVLTNAEKEL 421
           + M   IR     ER  Q  L NAE EL
Sbjct: 132 QRMLGEIR-----ERDFQKQLQNAEAEL 154


>gnl|CDD|221016 pfam11180, DUF2968, Protein of unknown function (DUF2968).  This
           family of proteins has no known function.
          Length = 192

 Score = 39.3 bits (92), Expect = 0.002
 Identities = 31/114 (27%), Positives = 45/114 (39%), Gaps = 34/114 (29%)

Query: 293 LADEEAMRKIKQLEDQTHQLQKQVATHKQEEEALLNEMEVTGQAFEDMQEQ--NSRLLQQ 350
           LAD E  R   QLE Q  QL++Q+A  +   E L  ++++        QEQ   +R  Q 
Sbjct: 100 LADVEIRRA--QLEAQKAQLERQIAASEARAERLQADLQLAQA-----QEQQVAARQQQ- 151

Query: 351 LREKDDANFKLMTERIKSNQLHKLAREEKDTLREQVQALSLQIEAMHTAIRKLE 404
                                   AR+E   L  Q QA   Q+  +   IR+L+
Sbjct: 152 ------------------------ARQEAVALEAQRQAAQAQLRKLQRQIRQLQ 181


>gnl|CDD|206039 pfam13868, Trichoplein, Tumour suppressor, Mitostatin.  Trichoplein
           or mitostatin, was first defined as a meiosis-specific
           nuclear structural protein. It has since been linked
           with mitochondrial movement. It is associated with the
           mitochondrial outer membrane, and over-expression leads
           to reduction in mitochondrial motility whereas lack of
           it enhances mitochondrial movement. The activity appears
           to be mediated through binding the mitochondria to the
           actin intermediate filaments (IFs).
          Length = 349

 Score = 40.3 bits (95), Expect = 0.002
 Identities = 27/128 (21%), Positives = 56/128 (43%), Gaps = 13/128 (10%)

Query: 282 VRDSKRDERKKLADEEAMRKIKQ-LEDQTHQLQKQVATHKQEEEALLNEMEVTGQAFEDM 340
           ++    +E ++   +E  R+ +  L++Q  + +K+    ++E E  L E E   +  E +
Sbjct: 52  LKALAEEEERERKRKEERREGRAVLQEQIEEREKR---RQEEYEERLQEREQMDEIIERI 108

Query: 341 QEQNSRLLQQLREKDDANFKLMTERIKSNQLHKLAREEKDTLREQVQALSLQIEAMHTAI 400
           QE++    Q+ REK     KL  E  + N+     +E      E+ +    +   +    
Sbjct: 109 QEEDEAEAQEKREKQK---KLREEIDEFNEERIERKE------EEKEREREEELKILEYQ 159

Query: 401 RKLEEKER 408
           R+  E+E 
Sbjct: 160 REKAEREE 167



 Score = 36.4 bits (85), Expect = 0.028
 Identities = 24/127 (18%), Positives = 50/127 (39%), Gaps = 18/127 (14%)

Query: 286 KRDERKKLADEEAMRKIKQLEDQTHQLQKQVATHKQEEEALLNEMEVTGQAFEDMQEQNS 345
           +R ER+K   EEA ++ +Q ++     ++Q+   ++  +    E E      E M E+ +
Sbjct: 214 ERKERQKEK-EEAEKRRRQKQELQRAREEQIEEKEERLQEERAEEE---AERERMLEKQA 269

Query: 346 RLLQQLREKDDANFKLMTERIKSNQLHKLA----REEKDTLREQVQALSLQIEAMHTAIR 401
              +  +E          +R      H+       EEK+  R   +   L+       +R
Sbjct: 270 EDEELEQENA-------EKRRMKRLEHRRELEQQIEEKEERRAAEREEELEEGER---LR 319

Query: 402 KLEEKER 408
           + E + +
Sbjct: 320 EEEAERQ 326



 Score = 36.4 bits (85), Expect = 0.028
 Identities = 27/141 (19%), Positives = 57/141 (40%), Gaps = 9/141 (6%)

Query: 303 KQLEDQTHQLQKQVATHKQEEEALLNEMEVTGQAFEDMQEQNSRLLQQLREKD-DANFKL 361
           K+ + Q  + ++  A  K+EE  +   ME         +E+  R  ++ R +      + 
Sbjct: 20  KERDAQIEEKKRIKAEEKEEERRIDEMMEEERLKALAEEEERERKRKEERREGRAVLQEQ 79

Query: 362 MTERIKSNQLHKLAREEKDTLREQVQALSLQI--EAMHTAIRKLEEKERFLQTVL-TNAE 418
           + ER    +  +   EE+   REQ+  +  +I  E    A  K E++++  + +   N E
Sbjct: 80  IEERE---KRRQEEYEERLQEREQMDEIIERIQEEDEAEAQEKREKQKKLREEIDEFNEE 136

Query: 419 KELH--LRNQAMDLNKRKAIE 437
           +        +     + K +E
Sbjct: 137 RIERKEEEKEREREEELKILE 157



 Score = 34.9 bits (81), Expect = 0.092
 Identities = 23/121 (19%), Positives = 49/121 (40%), Gaps = 8/121 (6%)

Query: 289 ERKKLADEEAMRKIKQLEDQTHQLQKQVATHKQEEEALLNEMEVTGQAF-EDMQEQNSRL 347
           ER+K   EE     ++   +  Q+ + +   ++E+EA   E     +   E++ E N   
Sbjct: 82  EREKRRQEEY----EERLQEREQMDEIIERIQEEDEAEAQEKREKQKKLREEIDEFN-EE 136

Query: 348 LQQLREKDDANFKLMTERIKSNQLHKLAREEKDTLREQVQALSLQIEAMHTAIRKLEEKE 407
             + +E++    +    +I   Q  K  REE+     + +    + E     +R  +E+ 
Sbjct: 137 RIERKEEEKEREREEELKILEYQREKAEREEE--REAERRERKEEKEREVARLRAQQEEA 194

Query: 408 R 408
            
Sbjct: 195 E 195



 Score = 34.1 bits (79), Expect = 0.14
 Identities = 25/150 (16%), Positives = 63/150 (42%), Gaps = 12/150 (8%)

Query: 289 ERKKLADEEAMRKIKQLEDQTHQLQKQVATHKQEEEALLNEMEVTGQAFEDMQEQNSRLL 348
           ER++  + + +   ++  ++  + + +    K+E+E  +  +    +  ED +E+   L 
Sbjct: 146 EREREEELKILEYQREKAEREEEREAERRERKEEKEREVARLRAQQEEAEDEREELDELR 205

Query: 349 QQLREKDDANFKLMTERIKSNQLHKLAREEKDTLREQV-QALSLQIEA-MHTAIRKLEEK 406
             L +++        ER K  Q  K   E++   ++++ +A   QIE        +  E+
Sbjct: 206 ADLYQEEY-------ER-KERQKEKEEAEKRRRQKQELQRAREEQIEEKEERLQEERAEE 257

Query: 407 ERFLQTVLTNAEKELHLRNQAMDLNKRKAI 436
           E   + +L   ++      +  +  KR+  
Sbjct: 258 EAERERML--EKQAEDEELEQENAEKRRMK 285



 Score = 33.3 bits (77), Expect = 0.26
 Identities = 26/156 (16%), Positives = 61/156 (39%), Gaps = 11/156 (7%)

Query: 287 RDERKKLADEEAMRKIKQLEDQTHQLQKQVATHKQEEEALLNEMEVTGQAFEDMQEQNSR 346
           ++E +  A E+  ++ K  E+     ++++   ++E+E    E       ++  + +   
Sbjct: 109 QEEDEAEAQEKREKQKKLREEIDEFNEERIERKEEEKEREREEELKI-LEYQREKAEREE 167

Query: 347 LLQQLREKDDANFKLMTERIKSNQLHKLA-REEKDTLREQVQALSLQIEAMHTAIRKLEE 405
             +  R +     +    R+++ Q      REE D LR       L  E      R+ E+
Sbjct: 168 EREAERRERKEEKEREVARLRAQQEEAEDEREELDELRAD-----LYQEEYERKERQKEK 222

Query: 406 KERFLQTVLTNAEKELHL-RNQAMDLNKRKAIESAQ 440
           +E   +      ++EL   R + ++  + +  E   
Sbjct: 223 EEAEKR---RRQKQELQRAREEQIEEKEERLQEERA 255



 Score = 29.1 bits (66), Expect = 6.0
 Identities = 10/74 (13%), Positives = 32/74 (43%), Gaps = 2/74 (2%)

Query: 281 VVRDSKRDERKKLADEEAMRKIKQLEDQTHQLQKQVATHKQEEEALLNEMEVTGQAFEDM 340
           ++     DE  +   E A ++  +  +   +L++Q+   ++   A   E    G+   + 
Sbjct: 264 MLEKQAEDEELEQ--ENAEKRRMKRLEHRRELEQQIEEKEERRAAEREEELEEGERLREE 321

Query: 341 QEQNSRLLQQLREK 354
           + +    +++ R++
Sbjct: 322 EAERQARIEEERQR 335


>gnl|CDD|179385 PRK02224, PRK02224, chromosome segregation protein; Provisional.
          Length = 880

 Score = 40.4 bits (95), Expect = 0.002
 Identities = 25/124 (20%), Positives = 54/124 (43%), Gaps = 2/124 (1%)

Query: 282 VRDSKRDERKKLADEEAMRKIKQLEDQTHQLQKQVATHKQEEEALLNEMEVTGQAFEDMQ 341
           +RD   + R  +A +    + + L +    L+++    ++E   L +E+E   +A ED +
Sbjct: 326 LRDRLEECR--VAAQAHNEEAESLREDADDLEERAEELREEAAELESELEEAREAVEDRR 383

Query: 342 EQNSRLLQQLREKDDANFKLMTERIKSNQLHKLAREEKDTLREQVQALSLQIEAMHTAIR 401
           E+   L +++ E  +       +   +    +  REE+D LRE+   L   +      + 
Sbjct: 384 EEIEELEEEIEELRERFGDAPVDLGNAEDFLEELREERDELREREAELEATLRTARERVE 443

Query: 402 KLEE 405
           + E 
Sbjct: 444 EAEA 447



 Score = 36.2 bits (84), Expect = 0.046
 Identities = 31/129 (24%), Positives = 61/129 (47%), Gaps = 20/129 (15%)

Query: 283 RDSKRDERKKLADEEAMRK--IKQLEDQTHQLQKQVATHKQEEEALLNEMEVTGQAFEDM 340
            +  R++R+ LA+    R+  + +  ++  +L+ +       +EA + E     +  E+ 
Sbjct: 608 IERLREKREALAELNDERRERLAEKRERKRELEAEF------DEARIEEAREDKERAEEY 661

Query: 341 QEQNSRLLQQLREKDDANFKLMTERIKS--NQLHKLA--REEKDTLREQVQALSLQIEAM 396
            EQ    L +LRE+ D     +   I +  N+L +L   RE ++ L  +V+AL    EA+
Sbjct: 662 LEQVEEKLDELREERDD----LQAEIGAVENELEELEELRERREALENRVEAL----EAL 713

Query: 397 HTAIRKLEE 405
           +    +LE 
Sbjct: 714 YDEAEELES 722



 Score = 31.5 bits (72), Expect = 1.3
 Identities = 36/195 (18%), Positives = 68/195 (34%), Gaps = 48/195 (24%)

Query: 286 KRDERKKLADEEAMRKIKQLEDQTHQLQKQVATHKQEEEALLNEMEVTGQAFEDMQEQNS 345
             +ER +   EEA     +LE +  + ++ V   ++E E L  E+E   + F D      
Sbjct: 353 DLEERAEELREEA----AELESELEEAREAVEDRREEIEELEEEIEELRERFGDAPVDLG 408

Query: 346 RL---LQQLRE-KDDANFKLMT---------ERIKSNQLHKLA----------------- 375
                L++LRE +D+   +            ER++  +    A                 
Sbjct: 409 NAEDFLEELREERDELREREAELEATLRTARERVEEAEALLEAGKCPECGQPVEGSPHVE 468

Query: 376 -----REEKDTLREQVQALSLQIEAMHTAIRKLEEKERFLQTVLTNAEKELH-LRNQAMD 429
                RE  + L  +++ L  ++E +   + + E+        L  AE  +  L  +  D
Sbjct: 469 TIEEDRERVEELEAELEDLEEEVEEVEERLERAED--------LVEAEDRIERLEERRED 520

Query: 430 LNKRKAIESAQSAAD 444
           L +  A         
Sbjct: 521 LEELIAERRETIEEK 535


>gnl|CDD|200948 pfam00038, Filament, Intermediate filament protein. 
          Length = 312

 Score = 39.9 bits (94), Expect = 0.002
 Identities = 34/170 (20%), Positives = 76/170 (44%), Gaps = 35/170 (20%)

Query: 291 KKLADEEAMRKIKQLEDQTHQLQKQVATHKQEEEALLNEMEVT----------------- 333
           +K  DE  + ++  LE +   LQ+++A  K+  E  + E++                   
Sbjct: 109 RKDLDEATLARV-DLEMKIESLQEELAFLKKNHEEEVRELQSQIQGQVNVEMDAARKLDL 167

Query: 334 GQAFEDMQEQNSRLLQQLREKDDANFKLMTERI-----KSNQLHKLAREEKDTLREQVQA 388
            +A  +++ Q   L ++ R++ +  +K   E +     ++    + A+EE   LR Q+Q+
Sbjct: 168 TKALAEIRAQYEELAEKNRQEAEEWYKSKLEELQQAAARNGDALRSAKEEITELRRQIQS 227

Query: 389 LSLQIEAMHTAIRKLEE-----KERF------LQTVLTNAEKEL-HLRNQ 426
           L ++++++      LE      +ER+       Q  ++  E+EL  L+ +
Sbjct: 228 LEIELQSLKKQKASLERQLAELEERYELELADYQDTISELEEELQQLKAE 277


>gnl|CDD|177131 MTH00059, ND2, NADH dehydrogenase subunit 2; Provisional.
          Length = 289

 Score = 39.1 bits (92), Expect = 0.003
 Identities = 28/117 (23%), Positives = 47/117 (40%), Gaps = 16/117 (13%)

Query: 118 FGIFRLSSTVFVYHCFSIFRFSSSTVFVYHCFSIFRFSSSTVFVYHCFSIFRFSSSTVFV 177
           FG+F      +VY  FS    +SS V ++    + +F      ++ CF +     S VFV
Sbjct: 96  FGLFPFM--FWVYRVFS----NSSWVVIWLISVVMKFP----VLFFCFFLQGGGLSLVFV 145

Query: 178 YHCFSIFRFSSSTVFVY-----HCFSIFRFSSSTVFVYHCFSIFRFSSSTIFVYHCF 229
               +I  F     + Y     + +     SSS   V  CF +       +++Y+ F
Sbjct: 146 DCFLTIL-FCGFLFWFYSLNWEYVWCHISLSSSATLVVACFCLSIDLCFFLYLYYFF 201



 Score = 34.1 bits (79), Expect = 0.12
 Identities = 28/114 (24%), Positives = 45/114 (39%), Gaps = 18/114 (15%)

Query: 133 FSIFRFSSSTVFVYHCFSIFRFSSSTVFVYHCFSIFRFSSSTVFVYHCFSIFRFSSSTVF 192
           F +F F     +VY  FS    +SS V ++    + +F      ++ CF +     S VF
Sbjct: 96  FGLFPFM---FWVYRVFS----NSSWVVIWLISVVMKFP----VLFFCFFLQGGGLSLVF 144

Query: 193 VYHCFSIFRFSSSTVFVY-----HCFSIFRFSSSTIFVYHCFDI-FSLTSTLFV 240
           V    +I  F     + Y     + +     SSS   V  CF +   L   L++
Sbjct: 145 VDCFLTIL-FCGFLFWFYSLNWEYVWCHISLSSSATLVVACFCLSIDLCFFLYL 197



 Score = 30.3 bits (69), Expect = 2.2
 Identities = 27/110 (24%), Positives = 44/110 (40%), Gaps = 17/110 (15%)

Query: 149 FSIFRFSSSTVFVYHCFSIFRFSSSTVFVYHCFSIFRFSSSTVFVYHCFSIFRFSSSTVF 208
           F +F F     +VY  FS    +SS V ++    + +F      ++ CF +     S VF
Sbjct: 96  FGLFPFM---FWVYRVFS----NSSWVVIWLISVVMKFP----VLFFCFFLQGGGLSLVF 144

Query: 209 VYHCFSI------FRFSSSTIFVYHCFDIFSLTSTLFVSICTSAFQMSFY 252
           V    +I      F F S       C    S ++TL V+    +  + F+
Sbjct: 145 VDCFLTILFCGFLFWFYSLNWEYVWCHISLSSSATLVVACFCLSIDLCFF 194


>gnl|CDD|227278 COG4942, COG4942, Membrane-bound metallopeptidase [Cell division
           and chromosome partitioning].
          Length = 420

 Score = 39.7 bits (93), Expect = 0.003
 Identities = 34/209 (16%), Positives = 76/209 (36%), Gaps = 29/209 (13%)

Query: 265 SASNIYGSLEFRKFACVVRDSKRDERKKLADEEAMRKIKQLEDQTHQLQKQVATHKQEEE 324
            AS +  ++    F+    D +  + +K       +KI++ +DQ  +L+KQ+ + + E  
Sbjct: 18  LASLLSAAVLAAAFSAAADDKQLKQIQK-EIAALEKKIREQQDQRAKLEKQLKSLETEIA 76

Query: 325 AL---LNEMEVTGQAFEDMQEQNSRLLQQLREKDDANFKLMTERI--------------- 366
           +L   L E     +         +  L  L  ++    + + E++               
Sbjct: 77  SLEAQLIETADDLKKLRKQIADLNARLNALEVQEREQRRRLAEQLAALQRSGRNPPPALL 136

Query: 367 -------KSNQLHKLAREEKDTLREQVQALSLQIEAMHTAIRKLEEKERFLQTVLTNAEK 419
                  +S +L            E++ AL   ++ +     ++  ++  L T+L+    
Sbjct: 137 VSPEDAQRSVRLAIYYGALNPARAERIDALKATLKQLAAVRAEIAAEQAELTTLLSEQRA 196

Query: 420 ELHLRNQAMDLNKRKAIESAQSAADLKLH 448
           +      A  L +RK    AQ  ++L   
Sbjct: 197 Q--QAKLAQLLEERKKTL-AQLNSELSAD 222



 Score = 30.1 bits (68), Expect = 3.5
 Identities = 14/47 (29%), Positives = 23/47 (48%), Gaps = 2/47 (4%)

Query: 287 RDERKKLAD--EEAMRKIKQLEDQTHQLQKQVATHKQEEEALLNEME 331
           R ++ KLA   EE  + + QL  +    QK++   +  E  L NE+ 
Sbjct: 195 RAQQAKLAQLLEERKKTLAQLNSELSADQKKLEELRANESRLKNEIA 241


>gnl|CDD|236795 PRK10920, PRK10920, putative uroporphyrinogen III
           C-methyltransferase; Provisional.
          Length = 390

 Score = 39.3 bits (92), Expect = 0.003
 Identities = 14/51 (27%), Positives = 23/51 (45%)

Query: 304 QLEDQTHQLQKQVATHKQEEEALLNEMEVTGQAFEDMQEQNSRLLQQLREK 354
            L +Q   LQK   + KQE E +L +           Q   ++ L +L++K
Sbjct: 71  ALANQLTALQKAQESQKQELEGILKQQAKALDQANRQQAALAKQLDELQQK 121


>gnl|CDD|234141 TIGR03185, DNA_S_dndD, DNA sulfur modification protein DndD.  This
           model describes the DndB protein encoded by an operon
           associated with a sulfur-containing modification to DNA.
           The operon is sporadically distributed in bacteria, much
           like some restriction enzyme operons. DndD is described
           as a putative ATPase. The small number of examples known
           so far include species from among the Firmicutes,
           Actinomycetes, Proteobacteria, and Cyanobacteria [DNA
           metabolism, Restriction/modification].
          Length = 650

 Score = 39.3 bits (92), Expect = 0.004
 Identities = 31/175 (17%), Positives = 60/175 (34%), Gaps = 24/175 (13%)

Query: 286 KRDERKKLADEEAMRKIKQLEDQTHQLQKQVATHKQEEEALLNEMEVTGQAFEDMQEQNS 345
            R  +K       + +I+ LE +  +  ++     QE   L NE+E          E   
Sbjct: 195 LRRRKKSELPSSILSEIEALEAELKEQSEKYEDLAQEIAHLRNELE----------EAQR 244

Query: 346 RLLQQLREKDDANFKLMTERIKSNQLHKLAREEKDTLREQVQ----------ALSLQIEA 395
            L    ++       L  ER +  +  K     +   R Q++           +   +++
Sbjct: 245 SLESLEKKFRSEGGDLFEEREQLERQLKEIEAARKANRAQLRELAADPLPLLLIPNLLDS 304

Query: 396 MHTAIRKLEEKERFLQT--VLTNAEKEL--HLRNQAMDLNKRKAIESAQSAADLK 446
               ++K E+ ++   T   L   +KEL   L   A+     K I +  +  D  
Sbjct: 305 TKAQLQKEEQSQQNQLTQEELEERDKELLESLPKLALPAEHVKEIAAELAEIDKP 359


>gnl|CDD|224259 COG1340, COG1340, Uncharacterized archaeal coiled-coil protein
           [Function unknown].
          Length = 294

 Score = 38.5 bits (90), Expect = 0.005
 Identities = 35/165 (21%), Positives = 72/165 (43%), Gaps = 17/165 (10%)

Query: 296 EEAMRKIKQLEDQTHQLQKQVATHKQEEEALLNEMEVTGQAFEDMQEQNSRLLQQLREKD 355
              + K+ +LE +  QL++++   K++ + L  E     +  +++  +   L ++ +E  
Sbjct: 2   LAMLDKLDELELKRKQLKEEIEELKEKRDELRKEASELAEKRDELNAKVRELREKAQELR 61

Query: 356 DANFKLMTE-----------RIKSNQLHKLAREEKDTLREQVQALSLQIEAMHTAIRKLE 404
           +   ++  E             K  +L K  RE K+   E        I+++   I +LE
Sbjct: 62  EERDEINEEVQELKEKRDEINAKLQELRKEYRELKEKRNEFNLGG-RSIKSLEREIERLE 120

Query: 405 EKERFLQTVLTNAEKELHLRNQAMDLNKRKAIESAQSAADLKLHL 449
           +K+   QT +   E+E  L  +  +L  RK +E A+ A +    L
Sbjct: 121 KKQ---QTSVLTPEEERELVQKIKEL--RKELEDAKKALEENEKL 160



 Score = 34.3 bits (79), Expect = 0.13
 Identities = 29/154 (18%), Positives = 65/154 (42%), Gaps = 30/154 (19%)

Query: 290 RKKLADEEAMRKIKQLEDQTHQLQKQVATHKQEEEALLNEMEVTGQAFEDMQEQNSRLLQ 349
           RK+L   E  +K  +  ++  +L+ ++   K++   +  +++   +   + QE +  +++
Sbjct: 144 RKEL---EDAKKALEENEKLKELKAEIDELKKKAREIHEKIQ---ELANEAQEYHEEMIK 197

Query: 350 QLREKDDANFKLMTERIKSNQLHKLAREEKDTLREQVQALSLQIEAMHTAIRKLEEKERF 409
              E D+                   R+E D L E+   LS +I+ +H   R L+ + R 
Sbjct: 198 LFEEADEL------------------RKEADELHEEFVELSKKIDELHEEFRNLQNELRE 239

Query: 410 LQTVLTNAEKELHLRNQAMDLNKRKAIESAQSAA 443
           L+  +    K L  + +     +R+  E  +  A
Sbjct: 240 LEKKI----KALRAKEK--AAKRREKREELKERA 267



 Score = 31.2 bits (71), Expect = 1.3
 Identities = 21/89 (23%), Positives = 42/89 (47%), Gaps = 8/89 (8%)

Query: 285 SKRDERKKLADE------EAMRKIKQLEDQTHQLQKQVATHKQEEEALLNEMEVTGQAFE 338
            + DE +K ADE      E  +KI +L ++   LQ ++   +++ +AL  + +   +  +
Sbjct: 200 EEADELRKEADELHEEFVELSKKIDELHEEFRNLQNELRELEKKIKALRAKEKAAKRREK 259

Query: 339 DMQEQNSRLLQQLREKDDANFKLMTERIK 367
              E+     +++ EK     KL TE + 
Sbjct: 260 R--EELKERAEEIYEKFKRGEKLTTEELL 286


>gnl|CDD|225288 COG2433, COG2433, Uncharacterized conserved protein [Function
           unknown].
          Length = 652

 Score = 38.9 bits (91), Expect = 0.007
 Identities = 30/150 (20%), Positives = 66/150 (44%), Gaps = 18/150 (12%)

Query: 296 EEAMRKIKQLEDQTHQLQKQVATHKQEEEALLNEMEVTGQAFEDMQEQNSRLLQQLREKD 355
            EA+ K+K+ E    +   +    ++E       ++   +  E ++E+NS L ++L E  
Sbjct: 391 AEALSKVKEEERPREKEGTE-EEERREITVYEKRIKKLEETVERLEEENSELKRELEELK 449

Query: 356 DANFKLMTERIKSNQLHKLAREEKDTLREQVQALSLQIEAMHTAIRKLEEKERFLQTVLT 415
               KL       ++L +  RE +D +R+       +I A    I +LE++    +  + 
Sbjct: 450 REIEKL------ESELERFRREVRDKVRK-----DREIRARDRRIERLEKELEEKKKRVE 498

Query: 416 NAEKELH--LRNQAMDLNKR----KAIESA 439
             E++L    + + ++L+ +    K +E  
Sbjct: 499 ELERKLAELRKMRKLELSGKGTPVKVVEKL 528



 Score = 30.1 bits (68), Expect = 3.3
 Identities = 24/123 (19%), Positives = 56/123 (45%), Gaps = 14/123 (11%)

Query: 286 KRDERKKLADEEAMRKIKQLEDQTHQLQKQVATHKQEEEA-LLNEMEVTGQAFEDMQEQN 344
           +  ER +  + E  R++++L+ +  +L+ ++   ++E    +  + E+          + 
Sbjct: 429 ETVERLEEENSELKRELEELKREIEKLESELERFRREVRDKVRKDREI-----RARDRRI 483

Query: 345 SRLLQQLREKDDANFKLMTERIKSNQLHKLAREEKDTLREQVQALSLQIEAMH-TAIRKL 403
            RL ++L EK     K   E ++  +L +L +  K  L  +   +   +E +   AI + 
Sbjct: 484 ERLEKELEEK-----KKRVEELE-RKLAELRKMRKLELSGKGTPVK-VVEKLTLEAIEEA 536

Query: 404 EEK 406
           EE+
Sbjct: 537 EEE 539


>gnl|CDD|173412 PTZ00121, PTZ00121, MAEBL; Provisional.
          Length = 2084

 Score = 38.6 bits (89), Expect = 0.008
 Identities = 47/167 (28%), Positives = 79/167 (47%), Gaps = 12/167 (7%)

Query: 286  KRDERKKLADEEAMRKIKQLEDQTHQLQKQVATHKQEEEALLNEMEVTGQAFEDMQEQNS 345
            K DE KK A EEA +K    + +  + +K     K E EA  +E E    A E+  E   
Sbjct: 1316 KADEAKKKA-EEAKKKADAAKKKAEEAKKAAEAAKAEAEAAADEAE----AAEEKAEAAE 1370

Query: 346  RLLQQLREKDDANFKLMTERIKSNQLHKLAREEKDTLREQVQALSLQIEAMHTAIRKLEE 405
            +  ++ ++K DA  K   E+ K+++  K A E+K    E  +A + + +A   A +K EE
Sbjct: 1371 KKKEEAKKKADAAKKKAEEKKKADEAKKKAEEDKKKADELKKAAAAKKKA-DEAKKKAEE 1429

Query: 406  KERF--LQTVLTNAEKELHLRNQAMDLNK----RKAIESAQSAADLK 446
            K++    +     A+K    + +A +  K    +K  E A+ A + K
Sbjct: 1430 KKKADEAKKKAEEAKKADEAKKKAEEAKKAEEAKKKAEEAKKADEAK 1476



 Score = 37.0 bits (85), Expect = 0.026
 Identities = 33/174 (18%), Positives = 68/174 (39%), Gaps = 21/174 (12%)

Query: 282  VRDSKRDERKKLADEEAMRKIKQLEDQTHQLQKQVATHKQEEEALLNEMEVTGQAFEDMQ 341
              +++ +E  KL +EE   K ++ +       K     K EEE    E        +  +
Sbjct: 1589 AEEARIEEVMKLYEEEKKMKAEEAKKAEEAKIKAEELKKAEEEKKKVEQ------LKKKE 1642

Query: 342  EQNSRLLQQLREKDDANFKLMTERIKSNQLHKLAREEKDTLREQVQALSLQIEAMHTAIR 401
             +  +  ++L++ ++ N      +IK+ +  K A E+K    E  +A   + +A     +
Sbjct: 1643 AEEKKKAEELKKAEEEN------KIKAAEEAKKAEEDKKKAEEAKKAEEDEKKAAEALKK 1696

Query: 402  KLEEKERFLQTVLTNAE---------KELHLRNQAMDLNKRKAIESAQSAADLK 446
            + EE ++  +     AE         K         +  K++A E  + A + K
Sbjct: 1697 EAEEAKKAEELKKKEAEEKKKAEELKKAEEENKIKAEEAKKEAEEDKKKAEEAK 1750



 Score = 33.2 bits (75), Expect = 0.49
 Identities = 38/163 (23%), Positives = 73/163 (44%), Gaps = 9/163 (5%)

Query: 286  KRDERKKLADEEAMRKIKQLEDQTHQLQKQVATHKQEEEALLNEMEVTGQAFEDMQEQNS 345
            K +E KK  +++   + K+ E+  +   ++    K+ EEA + E+    +  + M+ + +
Sbjct: 1553 KAEELKKAEEKKKAEEAKKAEEDKNMALRKAEEAKKAEEARIEEVMKLYEEEKKMKAEEA 1612

Query: 346  RLLQQLREKDDANFKLMTERIKSNQLHKLAREEKDTLREQVQALSLQIEAMHTAIRKLEE 405
            +  ++ + K +   K   E+ K  QL K   EEK       +A  L+       I+  EE
Sbjct: 1613 KKAEEAKIKAEELKKAEEEKKKVEQLKKKEAEEKK------KAEELKKAEEENKIKAAEE 1666

Query: 406  --KERFLQTVLTNAEKELHLRNQAMDLNKRKAIESAQSAADLK 446
              K    +     A+K      +A +  K++A E A+ A +LK
Sbjct: 1667 AKKAEEDKKKAEEAKKAEEDEKKAAEALKKEA-EEAKKAEELK 1708



 Score = 31.3 bits (70), Expect = 1.7
 Identities = 27/138 (19%), Positives = 63/138 (45%), Gaps = 4/138 (2%)

Query: 286  KRDERKKLADEEAMRKIKQLEDQTHQLQKQVATHKQEEEALLNEMEVTGQAFED---MQE 342
            K+ E+ K  + E  +K ++L+    + + + A   ++ E    + E   +A ED     E
Sbjct: 1633 KKVEQLKKKEAEEKKKAEELKKAEEENKIKAAEEAKKAEEDKKKAEEAKKAEEDEKKAAE 1692

Query: 343  QNSRLLQQLREKDDANFKLMTERIKSNQLHKLAREEKDTLREQVQALSLQIEAMHTAIRK 402
               +  ++ ++ ++   K   E+ K+ +L K A EE     E+ +  + + +      +K
Sbjct: 1693 ALKKEAEEAKKAEELKKKEAEEKKKAEELKK-AEEENKIKAEEAKKEAEEDKKKAEEAKK 1751

Query: 403  LEEKERFLQTVLTNAEKE 420
             EE+++ +  +    EK+
Sbjct: 1752 DEEEKKKIAHLKKEEEKK 1769



 Score = 30.5 bits (68), Expect = 2.9
 Identities = 37/166 (22%), Positives = 75/166 (45%), Gaps = 15/166 (9%)

Query: 286  KRDERKKLADE-----EAMRKIKQLEDQTHQLQKQVATHKQEEEALLNEMEVTGQAFEDM 340
            K DE KK A+E     EA +K ++ + +  + +K     K+ +EA         +A E  
Sbjct: 1471 KADEAKKKAEEAKKADEAKKKAEEAKKKADEAKKAAEAKKKADEA--------KKAEEAK 1522

Query: 341  QEQNSRLLQQLREKDDANFKLMTERIKSNQLHKLAREEKDTLREQVQALSLQIEAMHTAI 400
            +   ++  ++ ++ D+A  K   E+ K+++L K    +K   +++ +      E  + A+
Sbjct: 1523 KADEAKKAEEAKKADEA--KKAEEKKKADELKKAEELKKAEEKKKAEEAKKAEEDKNMAL 1580

Query: 401  RKLEEKERFLQTVLTNAEKELHLRNQAMDLNKRKAIESAQSAADLK 446
            RK EE ++  +  +    K      +      +KA E+   A +LK
Sbjct: 1581 RKAEEAKKAEEARIEEVMKLYEEEKKMKAEEAKKAEEAKIKAEELK 1626



 Score = 29.7 bits (66), Expect = 5.6
 Identities = 32/161 (19%), Positives = 68/161 (42%), Gaps = 8/161 (4%)

Query: 286  KRDERKKLADEEAMRKIKQLEDQTHQLQKQVATHKQEEEALLNEMEVTGQAFEDMQEQNS 345
            ++ E  +   EEA +K    + +  + +K     K+ EE      E+   A    +   +
Sbjct: 1364 EKAEAAEKKKEEAKKKADAAKKKAEEKKKADEAKKKAEEDKKKADELKKAAAAKKKADEA 1423

Query: 346  RLLQQLREKDDANFKLMTERIKSNQLHKLAREEKDTLREQVQALSLQIEAMHTAIRKLEE 405
            +   + ++K D   K   E  K+++  K A E K    E+ +  + + +    A +K EE
Sbjct: 1424 KKKAEEKKKADEAKKKAEEAKKADEAKKKAEEAKKA--EEAKKKAEEAKKADEAKKKAEE 1481

Query: 406  KERFLQTVLTNAEKELHLRNQAMDLNKRKAIESAQSAADLK 446
             ++  +      +K    + +A +   +KA E+ + A + K
Sbjct: 1482 AKKADEA----KKKAEEAKKKADEA--KKAAEAKKKADEAK 1516


>gnl|CDD|153280 cd07596, BAR_SNX, The Bin/Amphiphysin/Rvs (BAR) domain of Sorting
           Nexins.  BAR domains are dimerization, lipid binding and
           curvature sensing modules found in many different
           proteins with diverse functions. Sorting nexins (SNXs)
           are Phox homology (PX) domain containing proteins that
           are involved in regulating membrane traffic and protein
           sorting in the endosomal system. SNXs differ from each
           other in their lipid-binding specificity, subcellular
           localization and specific function in the endocytic
           pathway. A subset of SNXs also contain BAR domains. The
           PX-BAR structural unit determines the specific membrane
           targeting of SNXs. BAR domains form dimers that bind to
           membranes, induce membrane bending and curvature, and
           may also be involved in protein-protein interactions.
          Length = 218

 Score = 37.3 bits (87), Expect = 0.010
 Identities = 36/196 (18%), Positives = 60/196 (30%), Gaps = 17/196 (8%)

Query: 296 EEAMRKIKQLEDQTHQLQKQVATHKQEEEALLNEMEVTGQAFEDMQEQNSRLLQQLREKD 355
           EEA   I +LE+Q  +L KQ     +    L + +   G+A   + +    +  +L E  
Sbjct: 7   EEAKDYILKLEEQLKKLSKQAQRLVKRRRELGSALGEFGKALIKLAKCEEEVGGELGEAL 66

Query: 356 DANFKLMTERIKSNQLHKLAREEKDTLREQ-------VQALSLQIEAMHTAIRKLEEKER 408
               K   E          A +E   L E         QA+   ++    A+  L+  ++
Sbjct: 67  SKLGKAAEELSSL--SEAQANQELVKLLEPLKEYLRYCQAVKETLDDRADALLTLQSLKK 124

Query: 409 FLQTVLTNAEKELHLRNQAMDLNKRKAIESAQSAADLKLHLVIIYTNGPA-------RLE 461
            L +     EK              +  E  + A          Y            R  
Sbjct: 125 DLASKKAQLEKLKAAPGIKPA-KVEELEEELEEAESALEEARKRYEEISERLKEELKRFH 183

Query: 462 FGKDREFSAKFVNFRR 477
             + R+  A    F R
Sbjct: 184 EERARDLKAALKEFAR 199


>gnl|CDD|218452 pfam05130, FlgN, FlgN protein.  This family includes the FlgN
           protein and export chaperone involved in flagellar
           synthesis.
          Length = 140

 Score = 36.2 bits (84), Expect = 0.011
 Identities = 15/75 (20%), Positives = 40/75 (53%), Gaps = 1/75 (1%)

Query: 296 EEAMRKIKQLEDQTHQLQKQVATHKQEEEALLNEMEVTGQAFEDMQEQNSRLLQQLREKD 355
           +E + +++QLE++  QL  ++         L   +    +   ++ ++   LL++L+E +
Sbjct: 43  QELLAQLEQLEEERQQLLAELGL-SPGPAGLSELLAAEDEELRELWQELKELLEELQELN 101

Query: 356 DANFKLMTERIKSNQ 370
           + N +L+ + ++ NQ
Sbjct: 102 ERNGRLLEQSLEFNQ 116


>gnl|CDD|179877 PRK04778, PRK04778, septation ring formation regulator EzrA;
           Provisional.
          Length = 569

 Score = 37.9 bits (89), Expect = 0.011
 Identities = 32/166 (19%), Positives = 64/166 (38%), Gaps = 39/166 (23%)

Query: 295 DEEAMRKIKQLEDQTHQLQKQVATHK-------QEEEALLNEMEVTGQAFEDMQE----- 342
           +E+   +++QL+D   +L+K +  ++        E E  L  +E     F ++ E     
Sbjct: 135 EEKNREEVEQLKDLYRELRKSLLANRFSFGPALDELEKQLENLEEEFSQFVELTESGDYV 194

Query: 343 QNSRLLQQLREKDDANFKLMTERIKSNQLHKLAREEKDTLREQVQALSL----------- 391
           +   +L QL E+  A  + + E I      +L +E +  L +Q+Q L             
Sbjct: 195 EAREILDQLEEELAA-LEQIMEEIP-----ELLKELQTELPDQLQELKAGYRELVEEGYH 248

Query: 392 --------QIEAMHTAIRKLEEKERFLQTVLTNAEKELHLRNQAMD 429
                   +I+ +   I +       L   L  AE++     + +D
Sbjct: 249 LDHLDIEKEIQDLKEQIDENLALLEELD--LDEAEEKNEEIQERID 292



 Score = 32.1 bits (74), Expect = 0.76
 Identities = 38/148 (25%), Positives = 66/148 (44%), Gaps = 31/148 (20%)

Query: 295 DEEAMRKIKQLEDQTHQLQKQV-------ATHKQ---EEEALLNEMEVTGQAFEDMQEQN 344
           +E  +  ++QLE Q   L+KQ        A  +    E +  L E+    +  E  QE+ 
Sbjct: 343 NESELESVRQLEKQLESLEKQYDEITERIAEQEIAYSELQEELEEILKQLEEIEKEQEKL 402

Query: 345 SRLLQQLREKDDANFKLMTERIKSNQLHKLAR---------------EEKDTLREQVQAL 389
           S +LQ LR KD+   +   ER + N+LH++ R               E    + ++++AL
Sbjct: 403 SEMLQGLR-KDELEAREKLERYR-NKLHEIKRYLEKSNLPGLPEDYLEMFFEVSDEIEAL 460

Query: 390 SLQIEA----MHTAIRKLEEKERFLQTV 413
           + ++E     M    R LEE    ++T+
Sbjct: 461 AEELEEKPINMEAVNRLLEEATEDVETL 488



 Score = 31.7 bits (73), Expect = 1.0
 Identities = 27/146 (18%), Positives = 60/146 (41%), Gaps = 31/146 (21%)

Query: 301 KIKQLEDQTHQLQKQVATHKQEEEALLNEMEVTGQAFEDMQEQNSRL------LQQ---L 351
           +I QL D    L+++V   K  E+      +      E  +EQN  L      ++Q   L
Sbjct: 290 RIDQLYDI---LEREVKARKYVEK----NSDTLPDFLEHAKEQNKELKEEIDRVKQSYTL 342

Query: 352 REKDDANFKLMTERIKS--NQLHKLA-------------REEKDTLREQVQALSLQIEAM 396
            E +  + + + ++++S   Q  ++              +EE + + +Q++ +  + E +
Sbjct: 343 NESELESVRQLEKQLESLEKQYDEITERIAEQEIAYSELQEELEEILKQLEEIEKEQEKL 402

Query: 397 HTAIRKLEEKERFLQTVLTNAEKELH 422
              ++ L + E   +  L     +LH
Sbjct: 403 SEMLQGLRKDELEAREKLERYRNKLH 428


>gnl|CDD|205692 pfam13514, AAA_27, AAA domain.  This domain is found in a number of
           double-strand DNA break proteins. This domain contains a
           P-loop motif.
          Length = 1118

 Score = 37.8 bits (88), Expect = 0.015
 Identities = 27/138 (19%), Positives = 52/138 (37%), Gaps = 3/138 (2%)

Query: 284 DSKRDERKKLADEEAMRKIKQLEDQTHQLQKQVATHKQEEEALLNEMEVTGQAFEDMQEQ 343
             KR      A E+   +I++ E +  +    +   +    ALL     T         +
Sbjct: 799 RLKRARDTAAAAEKLAEEIEEAEKEVSEAAAALDEAEARLTALLRAARCTTIEELLAAVE 858

Query: 344 NSRLLQQLREKDDANFKLMTERIKSNQLHKLAREEKDTLREQVQALSLQIEAMHTAIRKL 403
            S   ++LR++  A  + +        L  L  E      +++ A    +E +   I +L
Sbjct: 859 RSDTYRELRKRIAALERTLVRAGGGLSLEALVAEAAALDPDELPAR---LEELARDIEEL 915

Query: 404 EEKERFLQTVLTNAEKEL 421
           EE+   L   +  A++EL
Sbjct: 916 EEELNELAQEVGAAKQEL 933



 Score = 32.4 bits (74), Expect = 0.63
 Identities = 28/157 (17%), Positives = 61/157 (38%), Gaps = 22/157 (14%)

Query: 296 EEAMRKIKQLEDQTHQLQKQVATHKQEEEALLNEMEVTGQAFEDMQEQNSRLLQQLREKD 355
           ++ ++ + + E++   L+K+     ++ E     +E   +    + E+ + L QQL    
Sbjct: 177 KDLVKALDEAEEELANLRKE----LRQLEKEKQRLERLRRLLPLLAERKA-LEQQLAALG 231

Query: 356 DANFKLMTERIKSNQLHKLAREEKDTLREQVQALSLQIEAMHTAIRKLEEKERFLQ---- 411
           +     +            AR E    R  ++ L+ ++EA+   + ++   E  L     
Sbjct: 232 EVI--DLPPDAVERYEE--ARAELRAARRNLELLTERLEALQAELDEISLDEELLAQAAA 287

Query: 412 -TVL-------TNAEKELHLRNQAMDLNKRKAIESAQ 440
              L        NAE++L  R   +  N R+A  +  
Sbjct: 288 IEALHQQRGEYRNAEQDLPDREGEIA-NAREAAAALL 323



 Score = 31.3 bits (71), Expect = 1.4
 Identities = 28/101 (27%), Positives = 48/101 (47%), Gaps = 14/101 (13%)

Query: 294 ADEEAMRKIKQLEDQTHQLQKQVATHKQEEEALLNEMEVTGQAFEDMQEQNSRLLQQLRE 353
             EE  R I++LE++ ++L ++V   KQE    L  M+  G    +++ +   LL QLR+
Sbjct: 904 RLEELARDIEELEEELNELAQEVGAAKQE----LARMD-GGSTAAELEAERESLLAQLRD 958

Query: 354 KDDANFKLMTER-IKSNQLHKLAREEKDTLREQVQALSLQI 393
                   + ER ++     +L R+  +  REQ Q   L  
Sbjct: 959 --------LAERYLELALASRLLRKAIERYREQRQPPVLNR 991



 Score = 30.5 bits (69), Expect = 2.4
 Identities = 27/151 (17%), Positives = 53/151 (35%), Gaps = 17/151 (11%)

Query: 300 RKIKQLEDQTHQLQKQVATHKQEEEALLNEMEVTGQAFEDMQEQNSRLLQQLREKDDANF 359
            ++ ++      L +  A      EAL  +      A +D+ ++   +            
Sbjct: 270 AELDEISLDEELLAQAAAI-----EALHQQRGEYRNAEQDLPDREGEIANAREAAAALLA 324

Query: 360 KLMTERIKSNQLHKLAREEKDTLREQVQALSLQIEAMHTAI----RKLEEKERFLQTVLT 415
           ++  +  +       +       +E V  L  + EA+  A+      LEE+ER L+ V  
Sbjct: 325 QIGPDADEE---AVESLRPSLAAKETVTELEKRKEALDQALKSARDALEERERELKQV-- 379

Query: 416 NAEKELH-LRNQAMDLNKRKAIESAQSAADL 445
               +L  L    +    R A+  A  A D+
Sbjct: 380 --RAQLAALPTVTVPEPLRAALADALRAGDI 408


>gnl|CDD|206034 pfam13863, DUF4200, Domain of unknown function (DUF4200).  This
           family is found in eukaryotes. It is a coiled-coil
           domain of unknwon function.
          Length = 126

 Score = 35.3 bits (82), Expect = 0.016
 Identities = 22/124 (17%), Positives = 55/124 (44%), Gaps = 13/124 (10%)

Query: 303 KQLEDQTHQLQKQVATHKQEEEALLNEMEVTGQAFEDMQEQNSRLLQQLREKDDANFKLM 362
           ++  ++            +  E LL + E      E+++++   L + L + D    +  
Sbjct: 6   RREMEEVQLALDAKREEFERREELLKQRE------EELEKKEEELQESLIKFDKFLKENE 59

Query: 363 TERIKSNQLHKLAREEKDTLREQVQALSLQIEAMHTAIRKLEEK-------ERFLQTVLT 415
            +R ++ +  +  ++ +    E+++ L  ++E +   I KLEEK       E FL++V+ 
Sbjct: 60  AKRRRAEKKAEEEKKLRKEKEEEIKELKAELEELKAEIEKLEEKLEEYQPYEEFLESVVE 119

Query: 416 NAEK 419
            + +
Sbjct: 120 RSPE 123


>gnl|CDD|217051 pfam02463, SMC_N, RecF/RecN/SMC N terminal domain.  This domain is
           found at the N terminus of SMC proteins. The SMC
           (structural maintenance of chromosomes) superfamily
           proteins have ATP-binding domains at the N- and
           C-termini, and two extended coiled-coil domains
           separated by a hinge in the middle. The eukaryotic SMC
           proteins form two kind of heterodimers: the SMC1/SMC3
           and the SMC2/SMC4 types. These heterodimers constitute
           an essential part of higher order complexes, which are
           involved in chromatin and DNA dynamics. This family also
           includes the RecF and RecN proteins that are involved in
           DNA metabolism and recombination.
          Length = 1162

 Score = 37.6 bits (87), Expect = 0.017
 Identities = 23/170 (13%), Positives = 66/170 (38%), Gaps = 9/170 (5%)

Query: 286 KRDERKKLADEEAMRKIKQLEDQTHQLQKQVATHKQEEEALLNEMEVTGQAFEDMQEQNS 345
           K     +  +   +  +K  E++   LQ+ +   ++E E+   E+E   +    + ++N 
Sbjct: 217 KEKLELEEENLLYLDYLKLNEERIDLLQELLRDEQEEIESSKQELEKEEEILAQVLKENK 276

Query: 346 RLLQQLREKDDANFKLMTERIKSNQLHKLAREEKDTLREQVQALSLQIEAMHTAIRKLEE 405
              ++ + +++    L  E        +  + E   L  +      +++     ++KLE+
Sbjct: 277 EEEKEKKLQEEELKLLAKEE-------EELKSELLKLERRKVDDEEKLKESEKELKKLEK 329

Query: 406 KERFLQTVLTNAEKEL--HLRNQAMDLNKRKAIESAQSAADLKLHLVIIY 453
           + +  +  +   EKEL      +  +  + + +E  Q   +     ++  
Sbjct: 330 ELKKEKEEIEELEKELKELEIKREAEEEEEEQLEKLQEKLEQLEEELLAK 379


>gnl|CDD|226809 COG4372, COG4372, Uncharacterized protein conserved in bacteria
           with the myosin-like domain [Function unknown].
          Length = 499

 Score = 37.3 bits (86), Expect = 0.019
 Identities = 33/157 (21%), Positives = 63/157 (40%), Gaps = 20/157 (12%)

Query: 293 LADEEAMRKIKQLED---QTHQLQKQVATHKQEEEALLNEMEVTGQAFEDMQEQNSRLLQ 349
           L +      + QL+D   Q   L+ ++ T + E+ A   E E      +  +++   + Q
Sbjct: 64  LLNRNLRSGVFQLDDIRPQLRALRTELGTAQGEKRAAETEREAARSELQKARQEREAVRQ 123

Query: 350 QLREKDDANFKLMTERIKSNQLHKLAREEKDTLREQVQALSLQIEAMHTAIRKLEEKERF 409
           +L               K+ Q   LAR     L +Q Q L  +++ +    R+LE + + 
Sbjct: 124 ELAA-------ARQNLAKAQQE--LAR-----LTKQAQDLQTRLKTLAEQRRQLEAQAQS 169

Query: 410 L---QTVLTNAEKELHLRNQAMDLNKRKAIESAQSAA 443
           L   Q  L  +  +L  +   + L   +  + AQ+ A
Sbjct: 170 LQASQKQLQASATQLKSQVLDLKLRSAQIEQEAQNLA 206



 Score = 36.9 bits (85), Expect = 0.027
 Identities = 28/124 (22%), Positives = 56/124 (45%), Gaps = 3/124 (2%)

Query: 326 LLNEMEVTGQA-FEDMQEQNSRLLQQLREKDDANFKLMTERIKSNQLHKLAREEKDTLRE 384
           LLN    +G    +D++ Q   L  +L           TER  +    + AR+E++ +R+
Sbjct: 64  LLNRNLRSGVFQLDDIRPQLRALRTELGTAQGEKRAAETEREAARSELQKARQEREAVRQ 123

Query: 385 QVQALSLQIEAMHTAIRKLEEKERFLQTVLTN-AEKELHLRNQAMDL-NKRKAIESAQSA 442
           ++ A    +      + +L ++ + LQT L   AE+   L  QA  L   +K ++++ + 
Sbjct: 124 ELAAARQNLAKAQQELARLTKQAQDLQTRLKTLAEQRRQLEAQAQSLQASQKQLQASATQ 183

Query: 443 ADLK 446
              +
Sbjct: 184 LKSQ 187



 Score = 35.4 bits (81), Expect = 0.079
 Identities = 30/156 (19%), Positives = 58/156 (37%), Gaps = 19/156 (12%)

Query: 282 VRDSKRDERKKLADEEAMRKIKQLEDQTHQLQ---KQVATHKQEEEALLNEMEVTGQAFE 338
           VR      R+ LA  +A +++ +L  Q   LQ   K +A  +++ EA    ++ + +  +
Sbjct: 121 VRQELAAARQNLA--KAQQELARLTKQAQDLQTRLKTLAEQRRQLEAQAQSLQASQKQLQ 178

Query: 339 DMQEQNSRLLQQL---------REKDDAN----FKLMTERIKSNQLHKLAREEK-DTLRE 384
               Q    +  L           ++ A      +  TE +           +       
Sbjct: 179 ASATQLKSQVLDLKLRSAQIEQEAQNLATRANAAQARTEELARRAAAAQQTAQAIQQRDA 238

Query: 385 QVQALSLQIEAMHTAIRKLEEKERFLQTVLTNAEKE 420
           Q+   + QI A    IR+ E + + L+T     E+E
Sbjct: 239 QISQKAQQIAARAEQIRERERQLQRLETAQARLEQE 274


>gnl|CDD|235175 PRK03918, PRK03918, chromosome segregation protein; Provisional.
          Length = 880

 Score = 37.4 bits (87), Expect = 0.023
 Identities = 39/146 (26%), Positives = 76/146 (52%), Gaps = 11/146 (7%)

Query: 296 EEAMRKIKQLED---QTHQLQKQVATHKQEEEALLNEMEVTGQAFEDMQEQNSRLLQQLR 352
           E+ +R+I  L+D       L + +   K+  E L   ++ T    E ++E+   L + LR
Sbjct: 148 EKVVRQILGLDDYENAYKNLGEVIKEIKRRIERLEKFIKRTENIEELIKEKEKELEEVLR 207

Query: 353 EKDDANFKLMTERIK-SNQLHKLAREEK--DTLREQVQALSLQIEAMHTAIRKLEEKERF 409
           E +    ++ +E  +   +L KL +E K  + L+E+++ L  ++E++  + RKLEEK R 
Sbjct: 208 EIN----EISSELPELREELEKLEKEVKELEELKEEIEELEKELESLEGSKRKLEEKIRE 263

Query: 410 LQTVLTNAEKELH-LRNQAMDLNKRK 434
           L+  +   +KE+  L  +  +L + K
Sbjct: 264 LEERIEELKKEIEELEEKVKELKELK 289



 Score = 35.4 bits (82), Expect = 0.071
 Identities = 30/139 (21%), Positives = 69/139 (49%), Gaps = 9/139 (6%)

Query: 300 RKIKQLED---QTHQLQKQVATHKQEEEALLNEMEVTGQAFEDMQEQNSRLLQQLREKDD 356
           R+I++LE    +T  +++ +   ++E E +L E+        +++E+  +L ++++E ++
Sbjct: 176 RRIERLEKFIKRTENIEELIKEKEKELEEVLREINEISSELPELREELEKLEKEVKELEE 235

Query: 357 ANFKLMTERIKSNQLHKLARE-EKDTLREQVQALSLQIEAMHTAIRKLEEKERFLQTVLT 415
                + E I+  +    + E  K  L E+++ L  +IE +   I +LEEK + L+ +  
Sbjct: 236 -----LKEEIEELEKELESLEGSKRKLEEKIRELEERIEELKKEIEELEEKVKELKELKE 290

Query: 416 NAEKELHLRNQAMDLNKRK 434
            AE+ + L     +     
Sbjct: 291 KAEEYIKLSEFYEEYLDEL 309



 Score = 34.7 bits (80), Expect = 0.14
 Identities = 33/174 (18%), Positives = 78/174 (44%), Gaps = 9/174 (5%)

Query: 296 EEAMRKIKQLEDQTHQLQKQVATHKQE---EEALLNEMEVTGQAFEDMQEQNSRLLQQLR 352
           E+   + ++L+++  +L+ ++ + K+E    E L  ++    +  ++++E+ + LL++L 
Sbjct: 521 EKKAEEYEKLKEKLIKLKGEIKSLKKELEKLEELKKKLAELEKKLDELEEELAELLKELE 580

Query: 353 EKDDANFKLMTERIKSNQLHKLAREEKDTLREQVQALSLQIEAMHTAIRKLEEKERFLQT 412
           E    + + + ER+K   L      E   L++  + L  + + +     +L++    L  
Sbjct: 581 ELGFESVEELEERLKE--LEPFYN-EYLELKDAEKELEREEKELKKLEEELDKAFEELAE 637

Query: 413 VLTNAEKELHLRNQAMDLNKRKAIESAQSAADLKLHLVIIYTNGPARLEFGKDR 466
                E+   LR +  +L K+ + E  +   +  L L        A LE  + R
Sbjct: 638 TEKRLEE---LRKELEELEKKYSEEEYEELREEYLELSRELAGLRAELEELEKR 688



 Score = 32.3 bits (74), Expect = 0.72
 Identities = 31/151 (20%), Positives = 62/151 (41%), Gaps = 22/151 (14%)

Query: 289 ERKKLADEEAMRKIKQLEDQTHQLQKQVATHKQEEEALLNEMEVTGQAFEDMQEQNSRLL 348
           E  +   E      ++LE++  +L++++   K+E E L               E+  + L
Sbjct: 241 EELEKELESLEGSKRKLEEKIRELEERIEELKKEIEEL---------------EEKVKEL 285

Query: 349 QQLREKDDANFKLMTERIKSNQLHKLAREEK--DTLREQVQALSLQI---EAMHTAIRKL 403
           ++L+EK +   KL         L +L   EK    L E++  +  +I   E     + +L
Sbjct: 286 KELKEKAEEYIKL--SEFYEEYLDELREIEKRLSRLEEEINGIEERIKELEEKEERLEEL 343

Query: 404 EEKERFLQTVLTNAEKELHLRNQAMDLNKRK 434
           ++K + L+  L   E+   L  +A    +  
Sbjct: 344 KKKLKELEKRLEELEERHELYEEAKAKKEEL 374



 Score = 31.6 bits (72), Expect = 1.2
 Identities = 38/174 (21%), Positives = 76/174 (43%), Gaps = 22/174 (12%)

Query: 260 RSLSDSASNIYGSLEFRKFACVVRDSKRDERKKLADEEAMRKIKQLEDQTHQLQKQVATH 319
             L +  + +   LE   F  V    +R +      E    +  +L+D   +L+++    
Sbjct: 566 DELEEELAELLKELEELGFESVEELEERLKEL----EPFYNEYLELKDAEKELEREEKEL 621

Query: 320 KQEEEALLNEMEVTGQAFEDMQEQNSRLLQQLREKDDANFKLMTERIKSNQLHKLAREEK 379
           K+ EE L        +AFE++ E   RL ++LR++ +       E+  S + ++  REE 
Sbjct: 622 KKLEEEL-------DKAFEELAETEKRL-EELRKELEE-----LEKKYSEEEYEELREEY 668

Query: 380 DTLREQVQALSLQIEAMH----TAIRKLEEKERFLQTVLTNAEKELHLRNQAMD 429
             L  ++  L  ++E +        + LE+ +  L+     A+KEL    +A++
Sbjct: 669 LELSRELAGLRAELEELEKRREEIKKTLEKLKEELEE-REKAKKELEKLEKALE 721



 Score = 30.4 bits (69), Expect = 2.6
 Identities = 31/135 (22%), Positives = 66/135 (48%), Gaps = 13/135 (9%)

Query: 295 DEEAMRKIKQLEDQTHQLQKQVATHKQEEEALLNEMEVTGQAFEDMQEQNSRLLQQLRE- 353
             E   + K  +++  +L+K++     E+  L  E+E   +A E+++E+ S++  ++ E 
Sbjct: 360 RHELYEEAKAKKEELERLKKRLTGLTPEK--LEKELEELEKAKEEIEEEISKITARIGEL 417

Query: 354 -KDDANFKLMTERIKSNQL------HKLAREEKDTLREQVQALSLQIEAMHTAIRKLEEK 406
            K+    K   E +K  +        +L  E +  L E+    + +++ +   ++++EEK
Sbjct: 418 KKEIKELKKAIEELKKAKGKCPVCGRELTEEHRKELLEE---YTAELKRIEKELKEIEEK 474

Query: 407 ERFLQTVLTNAEKEL 421
           ER L+  L   EK L
Sbjct: 475 ERKLRKELRELEKVL 489


>gnl|CDD|217933 pfam04156, IncA, IncA protein.  Chlamydia trachomatis is an
           obligate intracellular bacterium that develops within a
           parasitophorous vacuole termed an inclusion. The
           inclusion is non-fusogenic with lysosomes but intercepts
           lipids from a host cell exocytic pathway. Initiation of
           chlamydial development is concurrent with modification
           of the inclusion membrane by a set of C.
           trachomatis-encoded proteins collectively designated
           Incs. One of these Incs, IncA, is functionally
           associated with the homotypic fusion of inclusions. This
           family probably includes members of the wider Inc family
           rather than just IncA.
          Length = 186

 Score = 35.9 bits (83), Expect = 0.023
 Identities = 27/143 (18%), Positives = 68/143 (47%), Gaps = 14/143 (9%)

Query: 271 GSLEFRKFACVVRDSKRDERKKLAD-EEAMRKIKQLEDQTHQLQKQVATHKQEEEALLNE 329
           G L     A V     +   +   +  E  +++ +L+++   L++++A  + E E L  +
Sbjct: 56  GLLCLLLKAPVQSVRPQKLEELQGELSELKQQLSELQEELEDLEERIAELESELEDLKED 115

Query: 330 MEVTGQAFEDMQEQNSRLLQQLREKDDANFKLMTERIKSNQLHKLAREEKDTLREQVQAL 389
           +++  +  + ++E+    L+ L E    + K + + ++  +L +  REE + LRE+++ L
Sbjct: 116 LQLLRELLKSLEER----LESLEE----SIKELAKELR--ELRQDLREEVEELREELERL 165

Query: 390 SLQIEAMHTAIRKLEEKERFLQT 412
               E +      ++E +  L+ 
Sbjct: 166 Q---ENLQRLQEAIQELQSLLEQ 185



 Score = 34.7 bits (80), Expect = 0.049
 Identities = 24/116 (20%), Positives = 53/116 (45%), Gaps = 14/116 (12%)

Query: 310 HQLQKQVATHKQEEEALLNEMEVTGQAFEDMQEQNSRLLQQLREKDDANFKLMTERIKSN 369
            Q  +++     E +  L+E++   +  ED++E+ + L  +L +  +    L        
Sbjct: 71  PQKLEELQGELSELKQQLSELQ---EELEDLEERIAELESELEDLKEDLQLLRELLKSLE 127

Query: 370 Q-LHKLAREEKDT----------LREQVQALSLQIEAMHTAIRKLEEKERFLQTVL 414
           + L  L    K+           LRE+V+ L  ++E +   +++L+E  + LQ++L
Sbjct: 128 ERLESLEESIKELAKELRELRQDLREEVEELREELERLQENLQRLQEAIQELQSLL 183



 Score = 27.8 bits (62), Expect = 9.6
 Identities = 14/69 (20%), Positives = 32/69 (46%), Gaps = 2/69 (2%)

Query: 283 RDSKRDERKKLADEEAMRKIKQLEDQTHQLQKQVATHKQEEEALLNEMEVTGQAFEDMQE 342
           R+  +   ++L  E     IK+L  +  +L++ +    +E    L  ++   Q  ++  +
Sbjct: 120 RELLKSLEERL--ESLEESIKELAKELRELRQDLREEVEELREELERLQENLQRLQEAIQ 177

Query: 343 QNSRLLQQL 351
           +   LL+QL
Sbjct: 178 ELQSLLEQL 186


>gnl|CDD|221432 pfam12128, DUF3584, Protein of unknown function (DUF3584).  This
           protein is found in bacteria and eukaryotes. Proteins in
           this family are typically between 943 to 1234 amino
           acids in length. This family contains a P-loop motif
           suggesting it is a nucleotide binding protein. It may be
           involved in replication.
          Length = 1198

 Score = 37.0 bits (86), Expect = 0.024
 Identities = 22/111 (19%), Positives = 44/111 (39%), Gaps = 7/111 (6%)

Query: 317 ATHKQEEEALLNEMEVTGQAFEDMQEQNSRLLQQLREKDDANFKLMTERIKSNQLHKLAR 376
           A ++ E    L + E   Q+    Q+Q    L Q   + +   +   E        K AR
Sbjct: 599 AANETELRERLQQAEEALQSAVAKQKQAEEQLVQANAELEEQKRAEAEA---RTALKQAR 655

Query: 377 EEKDTLREQVQALSLQIEAMHTAIRKLEEKERFLQTVLTNAEKELHLRNQA 427
            +   L+ + Q+L  ++E      ++  E +  L+ +  +A+ +  L  Q 
Sbjct: 656 LDLQRLQNEQQSLKDKLELAIAERKQQAETQ--LRQL--DAQLKQLLEQQQ 702



 Score = 37.0 bits (86), Expect = 0.031
 Identities = 22/162 (13%), Positives = 66/162 (40%), Gaps = 8/162 (4%)

Query: 283 RDSKRDERKKLADEEAMRKIKQLEDQTHQLQKQVATHKQEEEALLNEMEVTGQAFEDMQE 342
           R + +D    +AD +A+R I+++  +  +LQ+          +L   ++         +E
Sbjct: 222 RLNPQDVEHWIADIQALRAIQKVAPEIEKLQEDFEQL----LSLELRLQHLHGELVADEE 277

Query: 343 QNSRLLQQLRE-KDDANFKLMTERIKSNQLHKLAREEKDTLREQVQALSLQIEAMHTAIR 401
           + +   ++ +E K+    +L T   +  +      +E      ++ A   ++E +     
Sbjct: 278 RLAEEQEERQEAKNRLRQQLRTLEDQLKEARDELNQELSAANAKLAADRSELELLEDQKG 337

Query: 402 KLEEKERFLQTVLTNAEKELHLRNQAMDLNKR-KAIESAQSA 442
             E+ +  ++ +  + ++   +R++  ++  R  A+      
Sbjct: 338 AFEDAD--IEQLQADLDQLPSIRSELEEVEARLDALTGKHQD 377



 Score = 35.8 bits (83), Expect = 0.070
 Identities = 33/179 (18%), Positives = 59/179 (32%), Gaps = 34/179 (18%)

Query: 293 LADEEAMR-KIKQLED--QTH-----QLQKQVATHKQEEEALLNEMEVTGQAFEDMQEQN 344
            A+E  +R +++Q E+  Q+      Q ++Q+     E E           A +  +   
Sbjct: 599 AANETELRERLQQAEEALQSAVAKQKQAEEQLVQANAELEEQKRAEAEARTALKQARLDL 658

Query: 345 SRLLQQLREKDDANFKLMTERIKSNQLHKLAREEKDTLREQVQALSLQIEAMHTAIRKLE 404
            RL  + +   D     + ER +       A  +   L  Q++ L  Q +A         
Sbjct: 659 QRLQNEQQSLKDKLELAIAERKQQ------AETQLRQLDAQLKQLLEQQQAF-----LEA 707

Query: 405 EKERFLQTVLTNAEKELHLRNQAMDLNKRKAIESAQSA--ADLKLHLVIIYTNGPARLE 461
            K+ F +  L                 K + +E       A L   +    T   ARL+
Sbjct: 708 LKDDFRE--LRTERLA-----------KWQVVEGELDNQLAQLSAAIEAARTQAKARLK 753



 Score = 35.1 bits (81), Expect = 0.094
 Identities = 40/197 (20%), Positives = 77/197 (39%), Gaps = 27/197 (13%)

Query: 272 SLEFRKFACVVRDSKRDERKKLADEEAMRKIKQLEDQTHQLQKQVATHKQEEEALL-NEM 330
           ++  RK     +  + D + K   E+    ++ L+D   +L+ +     Q  E  L N++
Sbjct: 675 AIAERKQQAETQLRQLDAQLKQLLEQQQAFLEALKDDFRELRTERLAKWQVVEGELDNQL 734

Query: 331 EVTGQAFEDMQEQNSRLLQQLREKDD--------------------ANFKLMTERIKSNQ 370
                A E  + Q    L++L+++ D                       +   ERI   +
Sbjct: 735 AQLSAAIEAARTQAKARLKELKKQYDRELASLDVDPNTVKELKRQIEELETTIERIAVRR 794

Query: 371 LHKLAREEKDTLREQ---VQALSLQIEAMHTAIRKLEEKERFLQTVLTNAEKELHLRNQA 427
             +  RE +  ++E      +L  +   +   +R+LE     LQ  LT   K+  LR + 
Sbjct: 795 -PE-VREYRAFMQETWLHRDSLREERPNLAIQLRELESSAEELQQELTRLIKDTKLRRKK 852

Query: 428 MDLNKRKAIESAQSAAD 444
           ++  +RKA+E      D
Sbjct: 853 LE-QERKALEKQLDQLD 868



 Score = 31.6 bits (72), Expect = 1.3
 Identities = 18/155 (11%), Positives = 57/155 (36%), Gaps = 11/155 (7%)

Query: 302 IKQLEDQTHQLQKQVATHKQEEEALLNEM-----EVTGQAFEDMQEQN---SRLLQQLRE 353
           +   E++  + Q++    K      L  +     E   +  +++   N   +    +L  
Sbjct: 272 LVADEERLAEEQEERQEAKNRLRQQLRTLEDQLKEARDELNQELSAANAKLAADRSELEL 331

Query: 354 KDDANFKLMTERIKSNQLHKLAREEKDTLREQVQALSLQIEAMHTAIRKLEEKERFLQTV 413
            +D         I+  Q      ++  ++R +++ +  +++A+    + ++ K   L+  
Sbjct: 332 LEDQKGAFEDADIEQLQADL---DQLPSIRSELEEVEARLDALTGKHQDVQRKYERLKQK 388

Query: 414 LTNAEKELHLRNQAMDLNKRKAIESAQSAADLKLH 448
           +    +    +N       R+  +  ++A +  L 
Sbjct: 389 IKEQLERDLEKNNERLAAIREEKDRQKAAIEEDLQ 423



 Score = 30.8 bits (70), Expect = 2.4
 Identities = 24/159 (15%), Positives = 59/159 (37%), Gaps = 18/159 (11%)

Query: 295 DEEAMRKIKQLEDQTHQLQKQVATHKQEEEALLNEMEVTGQAFEDMQEQNSRLLQQLREK 354
               +R+ +    +T   +  +   +     L  ++     + E++Q++ +RL++  + +
Sbjct: 793 RRPEVREYRAFMQETWLHRDSLREERP---NLAIQLRELESSAEELQQELTRLIKDTKLR 849

Query: 355 DDANFKLMTERIKSNQLHKLAREEKDTLREQVQALS----------LQIEAMHTAIRKLE 404
                KL  ER    +      E    LR++++ L+           +       + +LE
Sbjct: 850 ---RKKLEQERKALEKQLDQLDELLRGLRDEMRQLAELKEPANANQAEGSISER-LDQLE 905

Query: 405 EKERFLQTVLTNAEKEL-HLRNQAMDLNKRKAIESAQSA 442
           E +R  + +  + +K L        D +     E+ Q  
Sbjct: 906 EFKRKRKRLSGDLKKFLERFDGVIKDHSGSGLYETWQKL 944



 Score = 30.4 bits (69), Expect = 3.0
 Identities = 22/109 (20%), Positives = 42/109 (38%), Gaps = 5/109 (4%)

Query: 285 SKRDERKKLADEEAMRKIKQLEDQTHQLQKQVATHKQEEEALLNEMEVTGQAFEDMQEQN 344
            + +  K   +EE      +L     +L    AT     E L   +E+  +A E  QE+ 
Sbjct: 431 QQLEAGKLEFNEEEYELELRLGRLKQRLDSATAT----PEELEQ-LEINDEALEKAQEEQ 485

Query: 345 SRLLQQLREKDDANFKLMTERIKSNQLHKLAREEKDTLREQVQALSLQI 393
            +    + +      +L   R ++ +  + A      LR+ +  L LQ+
Sbjct: 486 EQAEANVEQLQSELRQLRKRRDEALEALQRAERRLLQLRQALDELELQL 534



 Score = 29.7 bits (67), Expect = 5.0
 Identities = 23/113 (20%), Positives = 45/113 (39%), Gaps = 16/113 (14%)

Query: 291 KKLADEEAMRKIKQLEDQTHQLQKQVATHKQEEEALLNEMEVTGQAFEDMQEQNSRLLQQ 350
            K A  +  R   + +    +L+  +A  KQ+ E  L +++          +       Q
Sbjct: 651 LKQARLDLQRLQNEQQSLKDKLELAIAERKQQAETQLRQLD---------AQLK-----Q 696

Query: 351 LREKDDANFKLMTERIKSNQLHKLA--REEKDTLREQVQALSLQIEAMHTAIR 401
           L E+  A  + + +  +  +  +LA  +  +  L  Q+  LS  IEA  T  +
Sbjct: 697 LLEQQQAFLEALKDDFRELRTERLAKWQVVEGELDNQLAQLSAAIEAARTQAK 749


>gnl|CDD|226889 COG4487, COG4487, Uncharacterized protein conserved in bacteria
           [Function unknown].
          Length = 438

 Score = 36.7 bits (85), Expect = 0.025
 Identities = 30/163 (18%), Positives = 65/163 (39%), Gaps = 20/163 (12%)

Query: 290 RKKLADEEAMRK----IKQLEDQTHQLQKQVATHKQEEEALLNEMEVTGQAFEDMQEQNS 345
             +  ++EA  K        + +  QL++Q+   K+E++ L NE     Q    +Q++ +
Sbjct: 43  TLEEFEKEANEKRAQYRSAKKKELSQLEEQLINQKKEQKNLFNEQ--IKQFELALQDEIA 100

Query: 346 RLLQQLREKDDANFKLMTERIKSNQLHKLAREEKDTLREQVQALSL---QIEAM----HT 398
           +L        + + +L        +L +L++E +  L+   + +       +        
Sbjct: 101 KLEALELLNLEKDKELELL---EKELDELSKELQKQLQNTAEIIEKKRENNKNEERLKFE 157

Query: 399 AIRKLEEKERFLQTVLTNAEKELHLRNQAMD-LNKRKAIESAQ 440
             +KLEE    L+      E++LH  N  ++     +  ES  
Sbjct: 158 NEKKLEES---LELEREKFEEQLHEANLDLEFKENEEQRESKW 197



 Score = 29.8 bits (67), Expect = 3.8
 Identities = 23/131 (17%), Positives = 53/131 (40%), Gaps = 15/131 (11%)

Query: 286 KRDERKKLADEEAMRKIKQLEDQTHQLQKQVATHKQEEEALLNEMEVTGQAFEDMQEQNS 345
             +E+ K  +     +I +LE     L+       +E E L  E++   +  +   +  +
Sbjct: 83  LFNEQIKQFELALQDEIAKLE----ALELLNLEKDKELELLEKELDELSKELQKQLQNTA 138

Query: 346 RLLQQLREKDDANFKLMTERIKSNQLHKLAREEKDTLREQVQALSLQIE---------AM 396
            ++++ RE +    +L  E  K  +   L  E ++   EQ+   +L +E         + 
Sbjct: 139 EIIEKKRENNKNEERLKFENEKKLEE-SLELE-REKFEEQLHEANLDLEFKENEEQRESK 196

Query: 397 HTAIRKLEEKE 407
              ++KL+ + 
Sbjct: 197 WAILKKLKRRA 207


>gnl|CDD|224755 COG1842, PspA, Phage shock protein A (IM30), suppresses
           sigma54-dependent transcription [Transcription / Signal
           transduction mechanisms].
          Length = 225

 Score = 36.1 bits (84), Expect = 0.026
 Identities = 29/128 (22%), Positives = 51/128 (39%), Gaps = 25/128 (19%)

Query: 285 SKRDERKKLAD------EEAMRKIKQLEDQTHQLQKQVATHKQEEEALLNEMEVTGQAFE 338
            K ++ +K+ +      E  + K +Q       L + +A  KQ E  L  E +      E
Sbjct: 20  DKAEDPEKMLEQAIRDMESELAKARQ------ALAQAIARQKQLERKL-EEAQARA---E 69

Query: 339 DMQEQNSRLLQQLREKDDANFKLMTERIKSNQLHKLAREEKDTLREQVQALSLQIEAMHT 398
            ++E+    LQ   E D A   L  E  +S +      +    L  ++Q    Q+E +  
Sbjct: 70  KLEEKAELALQAGNE-DLAREAL--EEKQSLE------DLAKALEAELQQAEEQVEKLKK 120

Query: 399 AIRKLEEK 406
            +  LE+K
Sbjct: 121 QLAALEQK 128



 Score = 30.4 bits (69), Expect = 1.7
 Identities = 22/132 (16%), Positives = 41/132 (31%), Gaps = 41/132 (31%)

Query: 292 KLADEEAMRKIKQLEDQTHQLQKQVATHKQEEEALLNEMEVTGQ---------------- 335
            LA E A+ + + LED    L+ ++   +++ E L  ++    Q                
Sbjct: 85  DLARE-ALEEKQSLEDLAKALEAELQQAEEQVEKLKKQLAALEQKIAELRAKKEALKARK 143

Query: 336 ----AFEDMQEQ--------NSRLLQQLREKDD---ANFKLMTERIKSNQLHKLAREEKD 380
               A E +                +++ EK +   A  +   E         LA    D
Sbjct: 144 AAAKAQEKVNRSLGGGSSSSAMAAFERMEEKIEEREARAEAAAE---------LAEGSGD 194

Query: 381 TLREQVQALSLQ 392
            L ++      Q
Sbjct: 195 DLDKEFAQAGAQ 206


>gnl|CDD|203738 pfam07716, bZIP_2, Basic region leucine zipper. 
          Length = 54

 Score = 33.0 bits (76), Expect = 0.029
 Identities = 12/34 (35%), Positives = 24/34 (70%)

Query: 289 ERKKLADEEAMRKIKQLEDQTHQLQKQVATHKQE 322
           E+KK  +EE   ++K+LE++  QL+++V   ++E
Sbjct: 21  EKKKQREEELEERVKELEEENAQLRQKVEQLEKE 54


>gnl|CDD|222290 pfam13654, AAA_32, AAA domain.  This family includes a wide variety
           of AAA domains including some that have lost essential
           nucleotide binding residues in the P-loop.
          Length = 509

 Score = 36.7 bits (86), Expect = 0.032
 Identities = 24/118 (20%), Positives = 46/118 (38%), Gaps = 27/118 (22%)

Query: 318 THKQEEEALLNEMEVTGQAFEDMQEQNSRLLQQLREK-DDANFKL-MT------------ 363
            ++  +E +  E           QE+     ++L E+  +  F L  T            
Sbjct: 112 EYEARKEEIEEEF----------QEKREEAFEELEEEAKEKGFALVRTPGGFVFAPLKDG 161

Query: 364 ERIKSNQLHKLAREEKDTLREQVQALSLQIEAMHTAIRKLEEKERFLQTVLTNAEKEL 421
           E +   +   L  EE++ L E++  L    E +   +R+L E ER  +  L   ++E+
Sbjct: 162 EPLTEEEFEALPEEEREELEEKIDELE---EELQEILRQLRELEREAREKLRELDREV 216


>gnl|CDD|202045 pfam01920, Prefoldin_2, Prefoldin subunit.  This family includes
           prefoldin subunits that are not detected by pfam02996.
          Length = 106

 Score = 34.1 bits (79), Expect = 0.034
 Identities = 21/121 (17%), Positives = 51/121 (42%), Gaps = 24/121 (19%)

Query: 301 KIKQLEDQTHQLQKQVATHKQEEEALLNEMEVTGQAFEDMQEQNSRLLQQLREKDDANFK 360
            I + + Q  Q  + +A   ++ E  L E+E+  +  E +              +D    
Sbjct: 3   LINKFQ-QLQQQLQLLAQQIKQLETQLKEIELVLEELELLD-------------EDTKVY 48

Query: 361 LMTERIKSNQLHKLAREEKDTLREQVQALSLQIEAMHTAIRKLEEKERFLQTVLTNAEKE 420
            +   +       L +++K+ ++E+++    + E +   I+ LE++   L+  L   ++E
Sbjct: 49  KLIGDV-------LVKQDKEEVKEELEE---RKETLEKEIKTLEKQLEKLEKELEELKEE 98

Query: 421 L 421
           L
Sbjct: 99  L 99


>gnl|CDD|217902 pfam04111, APG6, Autophagy protein Apg6.  In yeast, 15 Apg proteins
           coordinate the formation of autophagosomes. Autophagy is
           a bulk degradation process induced by starvation in
           eukaryotic cells. Apg6/Vps30p has two distinct functions
           in the autophagic process, either associated with the
           membrane or in a retrieval step of the carboxypeptidase
           Y sorting pathway.
          Length = 356

 Score = 36.0 bits (83), Expect = 0.035
 Identities = 23/91 (25%), Positives = 39/91 (42%), Gaps = 7/91 (7%)

Query: 284 DSKRDERKKLADEEAMRKIKQLEDQTHQLQKQVATHKQEEEALLNEMEVTGQAFEDMQEQ 343
           DS+ DE KK  +E  + ++++LE +   L  ++   ++E+E L NE     Q   +    
Sbjct: 69  DSELDELKK-EEERLLDELEELEKEDDDLDGELVELQEEKEQLENEEL---QYLREYNLF 124

Query: 344 NSRLLQQLREKDDANFKLMTERIKSNQLHKL 374
           +   LQ          +        NQL KL
Sbjct: 125 DRNNLQLEDNLQSLELQY---EYSLNQLDKL 152



 Score = 31.8 bits (72), Expect = 0.94
 Identities = 24/104 (23%), Positives = 47/104 (45%), Gaps = 7/104 (6%)

Query: 306 EDQTHQLQKQVATHKQEEEALLNEMEVTGQAFEDMQEQNSRLLQQLREKDDANFKLMTER 365
                 L  ++   K+EEE LL+E+E       ++++++  L  +L E  +   +L  E 
Sbjct: 62  ISNYEALDSELDELKKEEERLLDELE-------ELEKEDDDLDGELVELQEEKEQLENEE 114

Query: 366 IKSNQLHKLAREEKDTLREQVQALSLQIEAMHTAIRKLEEKERF 409
           ++  + + L       L + +Q+L LQ E     + KL +   F
Sbjct: 115 LQYLREYNLFDRNNLQLEDNLQSLELQYEYSLNQLDKLRKTNIF 158


>gnl|CDD|222417 pfam13851, GAS, Growth-arrest specific micro-tubule binding.  This
           family is the highly conserved central region of a
           number of metazoan proteins referred to as growth-arrest
           proteins. In mouse, Gas8 is predominantly a testicular
           protein, whose expression is developmentally regulated
           during puberty and spermatogenesis. In humans, it is
           absent in infertile males who lack the ability to
           generate gametes. The localisation of Gas8 in the
           motility apparatus of post-meiotic gametocytes and
           mature spermatozoa, together with the detection of Gas8
           also in cilia at the apical surfaces of epithelial cells
           lining the pulmonary bronchi and Fallopian tubes
           suggests that the Gas8 protein may have a role in the
           functioning of motile cellular appendages. Gas8 is a
           microtubule-binding protein localised to regions of
           dynein regulation in mammalian cells.
          Length = 201

 Score = 35.3 bits (82), Expect = 0.036
 Identities = 32/143 (22%), Positives = 60/143 (41%), Gaps = 24/143 (16%)

Query: 296 EEAMRKIKQ-LEDQTH-------QLQKQVATHKQEEEALLNEMEVTGQAFEDM------- 340
           E+A  +IK    D TH        L++++A  K+ EE     M    Q  + +       
Sbjct: 8   EKAFNEIKNYYNDITHNNLELIKSLKEEIAEMKKNEEHNEKLMAEIAQENKRLVEPLKKA 67

Query: 341 QEQNSRLLQQLR--EKDDANFKLMTERIKSNQLHKLAREEKDTLREQVQALSLQIEAMHT 398
           +E+   L ++L+  EKD  + K +  R+K  +      +E   L+ + + L  + E +  
Sbjct: 68  EEEVEELRKKLKDYEKDKQSLKNLKARLKELE------KELKNLKWESEVLEQRFEKVER 121

Query: 399 AIRKLEEK-ERFLQTVLTNAEKE 420
              +L +K E  +Q V      +
Sbjct: 122 ERDELYDKFEAAIQDVQQKTGLK 144



 Score = 31.0 bits (71), Expect = 0.93
 Identities = 32/147 (21%), Positives = 65/147 (44%), Gaps = 29/147 (19%)

Query: 286 KRDERKKLADEEAMRKIKQLEDQTHQLQKQVATHKQEEEALLNEMEVTGQAFEDMQEQNS 345
           K +E  +    E  ++ K+L +   + +++V       E L  +++   +  + ++   +
Sbjct: 41  KNEEHNEKLMAEIAQENKRLVEPLKKAEEEV-------EELRKKLKDYEKDKQSLKNLKA 93

Query: 346 RLLQQLREKDDANFKLMTERIKSNQLHKLAREEKDTLREQVQA---------------LS 390
           RL +   EK+  N K  +E +   +  K+ R E+D L ++ +A               L 
Sbjct: 94  RLKEL--EKELKNLKWESE-VLEQRFEKVER-ERDELYDKFEAAIQDVQQKTGLKNLLLE 149

Query: 391 LQIEAMHTAIRKLEEKERFLQTVLTNA 417
            ++EA++     LE+KE  L  VL  A
Sbjct: 150 QKLEALNEE---LEKKEAQLNEVLAAA 173


>gnl|CDD|214599 smart00283, MA, Methyl-accepting chemotaxis-like domains
           (chemotaxis sensory transducer).  Thought to undergo
           reversible methylation in response to attractants or
           repellants during bacterial chemotaxis.
          Length = 262

 Score = 35.7 bits (83), Expect = 0.041
 Identities = 32/145 (22%), Positives = 64/145 (44%), Gaps = 20/145 (13%)

Query: 276 RKFACVVRDSKRDERKKLADE--EAMRKIKQL----EDQTHQLQKQVATHKQEEEALLNE 329
           R FA V      DE +KLA+   E+ ++I+ L    +++T++    +     E E  +  
Sbjct: 126 RGFAVVA-----DEVRKLAERSAESAKEIESLIKEIQEETNEAVAAMEESSSEVEEGVEL 180

Query: 330 MEVTGQAFEDMQEQNSRLLQQLREKDDANFKLMT--ERIKS--NQLHKLAREEKDTLREQ 385
           +E TG A E++ +    +   ++E   A  +     E + +  +++ ++ +E      E 
Sbjct: 181 VEETGDALEEIVDSVEEIADLVQEIAAATDEQAAGSEEVNAAIDEIAQVTQETAAMSEEI 240

Query: 386 VQALSLQIEAMHTAIRKLEE-KERF 409
             A     E +     +L+E  ERF
Sbjct: 241 SAA----AEELSGLAEELDELVERF 261


>gnl|CDD|218051 pfam04380, BMFP, Membrane fusogenic activity.  BMFP consists of two
           structural domains, a coiled-coil C-terminal domain via
           which the protein self-associates as a trimer, and an
           N-terminal domain disordered at neutral pH but adopting
           an amphipathic alpha-helical structure in the presence
           of phospholipid vesicles, high ionic strength, acidic pH
           or SDS. BMFP interacts with phospholipid vesicles though
           the predicted amphipathic alpha-helix induced in the
           N-terminal half of the protein and promotes aggregation
           and fusion of vesicles in vitro.
          Length = 79

 Score = 32.9 bits (76), Expect = 0.045
 Identities = 18/83 (21%), Positives = 34/83 (40%), Gaps = 10/83 (12%)

Query: 330 MEVTGQAFEDMQEQNSRLL---QQLREKDDANFKLMTERIKSNQLHKL---AREEKDTLR 383
           M+   +  +++ +  +  L   Q  R + + N +   +      L KL    REE D  R
Sbjct: 1   MQDPNRLLDELAKLLTDALGAAQGPRREAEKNVRAQLQS----ALSKLDLVTREEFDVQR 56

Query: 384 EQVQALSLQIEAMHTAIRKLEEK 406
             +     ++EA+   +  LE K
Sbjct: 57  AVLARTREKLEALEARVAALEAK 79


>gnl|CDD|235850 PRK06669, fliH, flagellar assembly protein H; Validated.
          Length = 281

 Score = 35.8 bits (83), Expect = 0.046
 Identities = 23/135 (17%), Positives = 56/135 (41%), Gaps = 26/135 (19%)

Query: 294 ADEEAMRKIKQLEDQ----THQLQKQVATHKQEEEALLNEMEVTG------QAFEDMQEQ 343
           A EE ++K  +          Q++++    ++E E L+ E +  G      +  E+  E+
Sbjct: 79  AKEELLKKTDEASSIIEKLQMQIEREQEEWEEELERLIEEAKAEGYEEGYEKGREEGLEE 138

Query: 344 NSRLLQQLREKDDANFKLMTERIKSNQLH------KLAR----EEKDTLREQVQALSLQI 393
              L++QL +  +   K   E ++S++         +A+    E  +  +E   AL  + 
Sbjct: 139 VRELIEQLNKIIEKLIKKREEILESSEEEIVELALDIAKKVIKEISENSKEIALALVKE- 197

Query: 394 EAMHTAIRKLEEKER 408
                 ++++++   
Sbjct: 198 -----LLKEVKDATD 207


>gnl|CDD|220404 pfam09789, DUF2353, Uncharacterized coiled-coil protein (DUF2353). 
           Members of this family of uncharacterized proteins have
           no known function.
          Length = 319

 Score = 35.6 bits (82), Expect = 0.049
 Identities = 27/143 (18%), Positives = 51/143 (35%), Gaps = 7/143 (4%)

Query: 285 SKRDERKKLAD--EEAMRKIKQLEDQTHQLQKQVATHKQEEEALLNEMEVTGQAFEDMQE 342
           SK +  K L    E+   +  Q +     LQ +    K+ E  L  E+    Q    ++ 
Sbjct: 6   SKTEALKILGQDLEKCQNERDQFKLMAETLQMRYRNLKKSEREL--ELVAACQGDMKIKP 63

Query: 343 --QNSRLLQQLREKDDANFKLMTERIKSNQLHKLAREEKDTLREQVQALSLQIEAMHTAI 400
             +   L Q L +  + N +L T+  +  Q     + +   LR+ +    ++        
Sbjct: 64  GTKRQDLSQLLEKYREENQQLSTDVQELRQRKAELQGDIKLLRQTIAQQRVEFMGGGVR- 122

Query: 401 RKLEEKERFLQTVLTNAEKELHL 423
              EE+   +  +    EK   L
Sbjct: 123 ESFEERNDLVSQLEKLREKCKQL 145



 Score = 29.8 bits (67), Expect = 3.1
 Identities = 32/162 (19%), Positives = 65/162 (40%), Gaps = 28/162 (17%)

Query: 287 RDERKKLADEEAMRKIKQLEDQTHQLQKQVATHKQ---EEEALLNEMEVTGQAFEDMQEQ 343
           R+E ++L+ +     +++L  +  +LQ  +   +Q   ++        V     E  +E+
Sbjct: 78  REENQQLSTD-----VQELRQRKAELQGDIKLLRQTIAQQRVEFMGGGVR----ESFEER 128

Query: 344 NSRLLQQLREKDDANFKLMTERIKSNQLHKLARE---EKDTLREQVQALSLQIEAMHTA- 399
           N  L+ QL EK     K +   ++S  L     E   E+D  + +   L+ ++  +    
Sbjct: 129 N-DLVSQL-EKLREKCKQLESDLRS--LLDEKEEVLMERDEYQRKAHRLNNELSYLLGGD 184

Query: 400 ------IRKLEEKERFLQTVLTNAEKELHLRNQAMDLNKRKA 435
                 +  L  + R+L+  +   E+E  L      L K K 
Sbjct: 185 PRRVVDVDALVMENRYLKEKINTLEEESELIKMT--LAKYKQ 224


>gnl|CDD|236545 PRK09510, tolA, cell envelope integrity inner membrane protein
           TolA; Provisional.
          Length = 387

 Score = 35.9 bits (83), Expect = 0.050
 Identities = 40/172 (23%), Positives = 62/172 (36%), Gaps = 29/172 (16%)

Query: 284 DSKRDE--RKKLADEEA----------MRKIKQLEDQTHQLQKQVATHKQEEEALLNEME 331
            +KR E  RKK   ++A            ++KQLE +    Q+Q    KQ EEA      
Sbjct: 73  SAKRAEEQRKKKEQQQAEELQQKQAAEQERLKQLEKERLAAQEQ---KKQAEEAAKQAAL 129

Query: 332 VTGQAFEDMQEQNSRLLQQLREKDDANFKLMTERIKSNQLHKLAREEKDTLREQVQALSL 391
              QA E             +    A  K   E  ++    K A  E    + + +A   
Sbjct: 130 KQKQAEE----------AAAKAAAAAKAKAEAEAKRAAAAAKKAAAEAKK-KAEAEA--- 175

Query: 392 QIEAMHTAIRKLEEKERFLQTVLTNAEKELHLRNQAMDLNKRKAIESAQSAA 443
             +A   A +K E +           + E   + +A    K+KA   A++AA
Sbjct: 176 AKKAAAEAKKKAEAEAAAKAAAEAKKKAEAEAKKKAAAEAKKKAAAEAKAAA 227


>gnl|CDD|113545 pfam04778, LMP, LMP repeated region.  This family consists of a
           repeated sequence element found in the LMP group of
           surface-located membrane proteins of Mycoplasma hominis.
           The the number of repeats in the protein affects the
           tendency of cells to spontaneously aggregate.
           Agglutination may be an important factor in
           colonisation. Non-agglutinating microorganisms might
           easily be distributed whereas aggregation might provide
           a better chance to avoid an antibody response since some
           of the epitopes may be buried.
          Length = 157

 Score = 34.5 bits (79), Expect = 0.051
 Identities = 26/76 (34%), Positives = 41/76 (53%), Gaps = 9/76 (11%)

Query: 398 TAIRKLEEKERFLQTVLTNAEKELHLRNQAMDLNKRKAIESAQSAADLKLHLVIIYTNGP 457
           T + KL + +  +Q   T  +KE+   NQA+  N   +++SA+S+ D K   V   T   
Sbjct: 4   TLLAKLTDNDNTIQKAKTELDKEIQKANQAVASNNTASMQSAKSSLDAK---VTEITK-- 58

Query: 458 ARLE-FGKDREFSAKF 472
            +LE F KD++  AKF
Sbjct: 59  -KLETFNKDKD--AKF 71



 Score = 29.5 bits (66), Expect = 2.8
 Identities = 22/128 (17%), Positives = 49/128 (38%), Gaps = 23/128 (17%)

Query: 285 SKRDERKKLADEEAMRKIKQ-LEDQTHQLQKQVATHKQEEEALLNEMEVTGQAFEDMQEQ 343
            K ++     +  +M+  K  L+ +  ++ K++ T  ++++A   E+E T +  ++    
Sbjct: 28  QKANQAVASNNTASMQSAKSSLDAKVTEITKKLETFNKDKDAKFKELEQTRKDIDEFINT 87

Query: 344 N------SRLLQQLREKDD----------------ANFKLMTERIKSNQLHKLAREEKDT 381
           N      S L++ L  K D                AN +L     K+            +
Sbjct: 88  NKNNPNYSTLVKDLTSKKDSKNSVTNSSNKSDIEAANTELKQALAKAKAAKDQIDNANKS 147

Query: 382 LREQVQAL 389
           ++EQ+   
Sbjct: 148 IKEQLNDS 155


>gnl|CDD|227061 COG4717, COG4717, Uncharacterized conserved protein [Function
           unknown].
          Length = 984

 Score = 36.0 bits (83), Expect = 0.059
 Identities = 44/190 (23%), Positives = 72/190 (37%), Gaps = 27/190 (14%)

Query: 278 FACVVRD----SKRDERKKLADEEAMRKIKQLEDQTHQLQKQVATHKQEEEALL------ 327
             CV R       + E  +LA E  + + K+L D+   L+ ++  H++E   L       
Sbjct: 667 LFCVQRLRVAAELQKEEARLALEGNIERTKELNDE---LRAELELHRKEILDLFDCGTAD 723

Query: 328 NEMEVTGQAFEDMQEQNSR-----LLQQLREKDDANFKLMTERIKSNQLHKLAREEKDTL 382
            E      A E+ Q          L  QL       ++L      S    +L  EE   L
Sbjct: 724 TEDAFREAAREEQQLTQRESRLESLEAQLEGVAAEAYELSA----SLDQRELKEEELALL 779

Query: 383 REQVQALSLQIEAMHTAIRKLEEKERFLQTVLTNAEKELHLRNQAMDLNKRKAIESAQSA 442
            E + AL  ++E +H  +  L  +   L+   T AE      +   DL      E A+  
Sbjct: 780 EEAIDALDEEVEELHAQVAALSRQIAQLEGGGTVAELRQRRESLKEDLE-----EKARKW 834

Query: 443 ADLKLHLVII 452
           A L+L + ++
Sbjct: 835 ASLRLAVQVL 844



 Score = 33.3 bits (76), Expect = 0.41
 Identities = 32/173 (18%), Positives = 64/173 (36%), Gaps = 20/173 (11%)

Query: 282 VRDSKRDERKKLADEEAMRKIKQLEDQTHQLQKQVATHKQEEEALLNEMEVTGQAFEDMQ 341
           +R   +  RK L   EA    + LE +    +  +A  + E E  L+E+ ++ +   + Q
Sbjct: 555 IRQHWQQLRKALDQLEA--AYEALEGRFAAAEAAMAEWQSEWEEALDELGLSRELSPEQQ 612

Query: 342 EQNSRLLQQLREKDDANFKLMTERIKSNQLHKLAR--EEKDTLREQVQALSLQIEA---- 395
                 ++ L++           + K+   H++AR  EE+    E+V+ L   +EA    
Sbjct: 613 LDILSTMKDLKKLM---------QKKAELTHQVARLREEQAAFEERVEGLLAVLEAQFID 663

Query: 396 ---MHTAIRKLEEKERFLQTVLTNAEKELHLRNQAMDLNKRKAIESAQSAADL 445
              +    R     E   +      E  +    +  D  + +     +   DL
Sbjct: 664 LSTLFCVQRLRVAAELQKEEARLALEGNIERTKELNDELRAELELHRKEILDL 716



 Score = 30.2 bits (68), Expect = 3.5
 Identities = 34/192 (17%), Positives = 69/192 (35%), Gaps = 13/192 (6%)

Query: 266 ASNIYGSLEFRKFACVVRDSKRDERKKLADE----EAMRKIKQLEDQTHQLQKQVATH-- 319
            S   GS    K   V+        K         + + K+KQ  ++  + +K+ AT+  
Sbjct: 150 FSGTSGSPASTKLLEVLNKEADSLYKPSGRNPQINQLLEKLKQERNEIDEAEKEYATYHK 209

Query: 320 ----KQEEEALLNEMEVTGQAFED-MQEQNS--RLLQQLREKDDANFKLMTERIKSNQLH 372
               ++ E A L E+    +A  D ++       L  +L+E      +L   R +     
Sbjct: 210 LLESRRAEHARLAELRSELRADRDHIRALRDAVELWPRLQEWKQLEQELTRRREELATFP 269

Query: 373 KLAREEKDTLREQVQALSLQIEAMHTAIRKLEEKERFLQTVLTNAEKELHLRNQAMDLNK 432
           +      +     +Q    +I+A+   + +L++    L        + L   +Q +   K
Sbjct: 270 RDGVLRLEKREAHLQKTEAEIDALLVRLAELKDLASQLIPAKEAVLQALVRLHQQLSEIK 329

Query: 433 RKAIESAQSAAD 444
             A E  ++ A 
Sbjct: 330 ASAFELTETLAG 341


>gnl|CDD|224300 COG1382, GimC, Prefoldin, chaperonin cofactor [Posttranslational
           modification, protein turnover, chaperones].
          Length = 119

 Score = 33.4 bits (77), Expect = 0.061
 Identities = 36/122 (29%), Positives = 57/122 (46%), Gaps = 19/122 (15%)

Query: 297 EAMRKIKQLEDQTHQLQKQVATHKQEEEALLNEMEVTGQAFEDMQEQNSRLLQQLREKDD 356
           E   ++ QL+    QLQK V   KQ+ EA L E+E   +  E + E            D 
Sbjct: 7   EVQAQLAQLQQLQQQLQK-VILQKQQLEAQLKEIEKALEELEKLDE------------DA 53

Query: 357 ANFKLMTERIKSNQLHKLAREE-KDTLREQVQALSLQIEAMHTAIRKLEEKERFLQTVLT 415
             +K +      N L K+++EE  D L E+ + L L+I+ +     KL+E+   LQ+ + 
Sbjct: 54  PVYKKV-----GNLLVKVSKEEAVDELEERKETLELRIKTLEKQEEKLQERLEELQSEIQ 108

Query: 416 NA 417
            A
Sbjct: 109 KA 110


>gnl|CDD|227606 COG5281, COG5281, Phage-related minor tail protein [Function
           unknown].
          Length = 833

 Score = 35.8 bits (82), Expect = 0.066
 Identities = 34/160 (21%), Positives = 54/160 (33%), Gaps = 18/160 (11%)

Query: 283 RDSKRDERKKLADEEAMRKIKQLEDQTHQLQKQVATHKQEEEALLNEMEVTGQAFEDMQE 342
           ++      K L   +          +    QK +    +E+ ALL E          +QE
Sbjct: 484 QEQMTAALKALLAFQQQIADLSGAKEKASDQKSLLWKAEEQYALLKEE----AKQRQLQE 539

Query: 343 QNSRLLQQLREKDDANFKLMTERIKSNQLHKLAREEKDTLREQVQALSLQIEAMHTAIRK 402
           Q +  L + +++        T    S QL +L  ++ D      QA   Q E      R+
Sbjct: 540 QKA--LLEHKKE--------TLEYTS-QLAELLDQQADRFELSAQAAGSQKERGSDLYRE 588

Query: 403 LEEKERFLQTVLTNAEKELHLRNQAMDLNKRKAIESAQSA 442
              +       L  A  EL     A+DL +      A SA
Sbjct: 589 ALAQ---NAAALNKALNELAAYWSALDLLQGDWKAGALSA 625


>gnl|CDD|191111 pfam04849, HAP1_N, HAP1 N-terminal conserved region.  This family
           represents an N-terminal conserved region found in
           several huntingtin-associated protein 1 (HAP1)
           homologues. HAP1 binds to huntingtin in a polyglutamine
           repeat-length-dependent manner. However, its possible
           role in the pathogenesis of Huntington's disease is
           unclear. This family also includes a similar N-terminal
           conserved region from hypothetical protein products of
           ALS2CR3 genes found in the human juvenile amyotrophic
           lateral sclerosis critical region 2q33-2q34.
          Length = 307

 Score = 35.2 bits (81), Expect = 0.067
 Identities = 27/144 (18%), Positives = 67/144 (46%), Gaps = 9/144 (6%)

Query: 289 ERKKLADEEAMRKIKQLEDQTHQLQKQVATHKQEEEALLN----EMEVTGQAFEDMQEQN 344
           E+ KL +EE       L  +   L+ +  T++++E+ L+N    ++         + E+ 
Sbjct: 168 EKLKLLEEENEH----LRSEASHLKTETVTYEEKEQQLVNDCVKQLREANDQIASLSEEL 223

Query: 345 SRLLQQLREKDDANFKLMTERIKSNQLHKLAREEKDTLREQVQALSLQIEAMHTAIRKLE 404
           ++  + L  + +    L+++ +   +  K    E + L++ + A       +   +++L+
Sbjct: 224 AKKTEDLERQQEEITHLLSQIVDLQKKCKSYALENEELQQHLAAAKDAQRQLQAELQELQ 283

Query: 405 EKERFLQTVLTNAEKEL-HLRNQA 427
           +K      +L  A++EL +LRN+ 
Sbjct: 284 DKYAECMEMLHEAQEELKNLRNKG 307


>gnl|CDD|214699 smart00503, SynN, Syntaxin N-terminal domain.  Three-helix domain
           that (in Sso1p) slows the rate of its reaction with the
           SNAP-25 homologue Sec9p.
          Length = 117

 Score = 33.5 bits (77), Expect = 0.076
 Identities = 15/88 (17%), Positives = 38/88 (43%), Gaps = 5/88 (5%)

Query: 302 IKQLEDQTHQLQKQVATHKQE---EEALLNEMEVTGQAFEDMQEQNSRLLQQLREKDDAN 358
           I+++     +LQK           ++ L  ++E      + + ++    L++L +++  N
Sbjct: 17  IQKISQNVAELQKLHEELLTPPDADKELREKLERLIDDIKRLAKEIRAKLKELEKENLEN 76

Query: 359 FKLMTE--RIKSNQLHKLAREEKDTLRE 384
               +   R +  Q  KL ++ K+ + E
Sbjct: 77  RASGSASDRTRKAQTEKLRKKFKEVMNE 104


>gnl|CDD|220368 pfam09730, BicD, Microtubule-associated protein Bicaudal-D.  BicD
           proteins consist of three coiled-coiled domains and are
           involved in dynein-mediated minus end-directed transport
           from the Golgi apparatus to the endoplasmic reticulum
           (ER). For full functioning they bind with GSK-3beta
           pfam05350 to maintain the anchoring of microtubules to
           the centromere. It appears that amino-acid residues
           437-617 of BicD and the kinase activity of GSK-3 are
           necessary for the formation of a complex between BicD
           and GSK-3beta in intact cells.
          Length = 711

 Score = 35.6 bits (82), Expect = 0.077
 Identities = 36/160 (22%), Positives = 77/160 (48%), Gaps = 15/160 (9%)

Query: 294 ADEEAMR-KIKQLEDQTHQLQKQVATHKQEEEALLNEMEVTGQAFEDMQEQNSRLLQQLR 352
           + E     +I +L+ +  QL+ +++  + E E L +  +   +  E ++ Q  RL  +++
Sbjct: 27  SKEAYYLQRILELQAELKQLRAELSNVQAENERLSSLSQELKEENEMLELQRGRLRDEIK 86

Query: 353 EKDDANFKLMTERIKSNQLHKLAREEKDTLREQVQAL---SLQIEAMHTAIRKLEEKERF 409
           E      +L+ +    ++L     EE  +L++QV  L    ++ E +   IR+LEE+   
Sbjct: 87  EYKFREARLLQD---YSEL----EEENISLQKQVSVLRQSQVEFEGLKHEIRRLEEETEL 139

Query: 410 LQTVLTNAEKELHLRNQAMDLNKRKAIESAQSAADLKLHL 449
           L + L  A +   L+  A +    +A+E+ ++  + K  L
Sbjct: 140 LNSQLEEAAR---LKEIA-EKQLEEALETLKTEREQKNAL 175



 Score = 35.2 bits (81), Expect = 0.092
 Identities = 28/138 (20%), Positives = 63/138 (45%), Gaps = 12/138 (8%)

Query: 297 EAMRKIKQLEDQTHQLQKQVATHKQEEEALLNEMEVTGQAFEDMQEQNSRLLQQLREKDD 356
            A+R ++  ++Q   L  +      E+    +  EV     E ++ +    + ++ E   
Sbjct: 311 NALRNLQASKEQQDGLDSEKDRGSHEDG---DYYEVDINGPEILECKYRVAVSEVGELKA 367

Query: 357 ANFKLMTERIKSNQLHKLAREEKDTLREQVQALSLQIEAMHTAIRKLEEK----ERFLQT 412
              +L   + + N+L +   +EKD LR ++Q L+ ++ ++  +  + +E+    E+ L+ 
Sbjct: 368 ---ELKALKARYNELQERYEDEKDRLRNEIQNLAEKLLSLEKSSHEDQERSGDLEKELRK 424

Query: 413 VLTNA-EKELHLRNQAMD 429
           +   A E +  L N A D
Sbjct: 425 LTHVAGESQGSL-NAAQD 441



 Score = 29.8 bits (67), Expect = 3.8
 Identities = 50/241 (20%), Positives = 84/241 (34%), Gaps = 39/241 (16%)

Query: 291 KKLAD---EEAMRKIKQLEDQTHQLQKQVATHKQEEEAL-LNEME--VTGQAF----EDM 340
           K++A+   EEA+  +K   +Q + L+K+++ H   E+   L+ +   + G  F    E  
Sbjct: 151 KEIAEKQLEEALETLKTEREQKNALRKELSHHLNLEDFYSLSNLSISLDGLKFSEDPEAS 210

Query: 341 QEQNSRL-LQQLREKDDANFKLMTERIKSNQLHKLAREEKDTLRE----QVQALSLQIEA 395
            E N+    +                 +  +L   A    D   E    ++Q L  Q+E 
Sbjct: 211 TEPNNDGEDENGHLNGGPGLAKSLGTPRKGELFPPAPGVSDLFSELNLSEIQKLKQQLEQ 270

Query: 396 MHTAIRKLEEKERFLQTVLTNAEKEL------------HLRNQAMDLNKRKAIESAQSAA 443
           +      L    + LQT L  +E  L            HL         ++  +   S  
Sbjct: 271 VEREKASLLANLQELQTQLEQSEGALSEQHEKVNRLTEHLNALRNLQASKEQQDGLDSEK 330

Query: 444 DLKLHLVIIY----TNGPARLEFGKDREFSAKFVNFRRKSYFFVEFLDCLRTRYETRQRK 499
           D   H    Y     NGP  LE  K R   ++    + +       L  L+ RY   Q +
Sbjct: 331 DRGSHEDGDYYEVDINGPEILE-CKYRVAVSEVGELKAE-------LKALKARYNELQER 382

Query: 500 C 500
            
Sbjct: 383 Y 383


>gnl|CDD|192486 pfam10224, DUF2205, Predicted coiled-coil protein (DUF2205).  This
           entry represent a highly conserved 100 residue region
           which is likely to be a coiled-coil structure. The exact
           function is unknown.
          Length = 80

 Score = 32.5 bits (74), Expect = 0.079
 Identities = 19/62 (30%), Positives = 29/62 (46%), Gaps = 7/62 (11%)

Query: 362 MTERIKSNQLHKLAREEKDTLREQV-------QALSLQIEAMHTAIRKLEEKERFLQTVL 414
           M+ R  S  L  L +E +  L EQ        + LS ++EA+     KL  + +FLQ  +
Sbjct: 1   MSPRRDSEDLEALEKEARQRLIEQALELQDSLEDLSQRVEAVKEENLKLRSENQFLQQYI 60

Query: 415 TN 416
            N
Sbjct: 61  EN 62


>gnl|CDD|218661 pfam05622, HOOK, HOOK protein.  This family consists of several
           HOOK1, 2 and 3 proteins from different eukaryotic
           organisms. The different members of the human gene
           family are HOOK1, HOOK2 and HOOK3. Different domains
           have been identified in the three human HOOK proteins,
           and it was demonstrated that the highly conserved
           NH2-domain mediates attachment to microtubules, whereas
           the central coiled-coil motif mediates homodimerisation
           and the more divergent C-terminal domains are involved
           in binding to specific organelles (organelle-binding
           domains). It has been demonstrated that endogenous HOOK3
           binds to Golgi membranes, whereas both HOOK1 and HOOK2
           are localised to discrete but unidentified cellular
           structures. In mice the Hook1 gene is predominantly
           expressed in the testis. Hook1 function is necessary for
           the correct positioning of microtubular structures
           within the haploid germ cell. Disruption of Hook1
           function in mice causes abnormal sperm head shape and
           fragile attachment of the flagellum to the sperm head.
          Length = 713

 Score = 35.2 bits (81), Expect = 0.081
 Identities = 38/177 (21%), Positives = 79/177 (44%), Gaps = 27/177 (15%)

Query: 288 DERKKLADEEAMRKIKQLEDQTHQLQKQVATHKQEEEALLNEMEVTGQAFEDM---QEQN 344
           +   KL  EE+ +  K LE +  +L++++   ++E+E LL E +   +  E++   Q Q 
Sbjct: 374 ELHAKL-SEESKKADK-LEFEYKRLEEKLEALQKEKERLLAERDSLRETNEELRCGQAQQ 431

Query: 345 SRLLQQL-----------REKDDANFKLMTERIK-SNQLHKLAREEKDTLREQVQALSLQ 392
            +L Q              E   + ++    R++  N++ +L +E  +   E++  L   
Sbjct: 432 DQLTQADAGTSPSGDNLAAELLPSEYREKLIRLQHENKMLRLGQEGSE--NERITELQQL 489

Query: 393 IEAMHTAIRKLEEKERFLQTVLTNAEKELHLRNQAMDLNKRKAIESAQSAADLKLHL 449
           +E  +    +LE + R         ++ L L+ Q  DL  +KA++   S ++    L
Sbjct: 490 LEDANRRNNELETQLR------LANQRILELQQQVEDL--QKALQEQGSKSEDSSLL 538



 Score = 33.6 bits (77), Expect = 0.27
 Identities = 35/146 (23%), Positives = 63/146 (43%), Gaps = 28/146 (19%)

Query: 296 EEAMRKIKQLEDQTHQLQKQVATHKQEEEALLNEMEVTGQAFEDMQEQNSRLLQQLREKD 355
           E+A R+  +LE Q     +++   +Q+ E L   ++  G   ED     S+L + L    
Sbjct: 491 EDANRRNNELETQLRLANQRILELQQQVEDLQKALQEQGSKSEDSSLLKSKLEEHLE--- 547

Query: 356 DANFKLMTERIKSNQLHKLAREEKDTLREQVQALSLQIEAMHTAIRKLEEKERFLQTVLT 415
                         QLH+ A EE    REQ++   L+ +      RK+ E    L+  L 
Sbjct: 548 --------------QLHE-ANEELQKKREQIE--ELEPDQDQNLSRKIAE----LEAALQ 586

Query: 416 NAEKELHLRNQAMDLNKRKAIESAQS 441
             ++++    +AM+   +K +E A+ 
Sbjct: 587 KKDEDM----RAMEERYKKYVEKARE 608



 Score = 33.6 bits (77), Expect = 0.30
 Identities = 46/198 (23%), Positives = 79/198 (39%), Gaps = 34/198 (17%)

Query: 260 RSLSDSASNIYGSLEFRKFACVVRDSKRDERKKLAD-EEAMRKIKQLEDQTHQLQKQVAT 318
              S  +    G+L+            +  +K L D +EA  +  +L  + H+L KQV  
Sbjct: 157 EQGSSPSRESAGNLD------------QQLKKALEDLKEAQEEKDELAQRCHELDKQVLL 204

Query: 319 HKQEEEALLNEMEVTGQAFEDM--------QEQNSRLLQ---QLREKDDANFKLMT---- 363
            ++E+ +L  E E   +    +        Q  + +      QL +  + NF+L      
Sbjct: 205 LQEEKNSLQQENEKLQERLAQLEGSSLGPNQLGSKKYNLLQSQLEQLQEENFRLEAARDD 264

Query: 364 ERIKSNQLHKLARE------EKDTLREQVQALSLQIEAMHTAIRKLEEKERFLQTVLTNA 417
            RIK  +L K   E      E  +L  + QAL  +I+ +  +  K ++ E  ++T     
Sbjct: 265 YRIKCEELEKELAELQHRNDELTSLAAESQALKDEIDVLRESSDKAKKLEAQVETYKKKL 324

Query: 418 EKELHLRNQAMDLNKRKA 435
           E    LR Q   L +R A
Sbjct: 325 EDLNDLRRQVKLLEERNA 342



 Score = 32.1 bits (73), Expect = 0.81
 Identities = 37/166 (22%), Positives = 76/166 (45%), Gaps = 29/166 (17%)

Query: 283 RDSKRDERKKLADEEAMRKIKQLEDQTHQLQKQVATHKQEEEALL-NEMEVTGQAFEDMQ 341
           R+++ + + +LA++  +   +Q+ED    LQ+Q    K E+ +LL +++E   +   +  
Sbjct: 496 RNNELETQLRLANQRILELQQQVEDLQKALQEQ--GSKSEDSSLLKSKLEEHLEQLHEAN 553

Query: 342 EQNSRLLQQLRE---------------------KDDANFKLMTERIKSNQLHKLAREEKD 380
           E+  +  +Q+ E                     K D + + M ER K   + K ARE   
Sbjct: 554 EELQKKREQIEELEPDQDQNLSRKIAELEAALQKKDEDMRAMEERYKK-YVEK-AREVIK 611

Query: 381 TLREQVQALSLQIEAMHTAIRKLEEKERFLQTVLTNAEKELHLRNQ 426
           TL  +    S +I+ +     +L E+++ ++ + +  EK   +R Q
Sbjct: 612 TLDPKQNPASPEIQLLKK---QLTERDKRIRHLESEYEKAKPMREQ 654


>gnl|CDD|235316 PRK04863, mukB, cell division protein MukB; Provisional.
          Length = 1486

 Score = 35.3 bits (82), Expect = 0.086
 Identities = 27/110 (24%), Positives = 45/110 (40%), Gaps = 21/110 (19%)

Query: 288 DERKKLADEEAM---RKIKQLEDQTHQLQKQVATHKQEEEALLNEMEVTGQAFEDMQEQN 344
            E    A E  M   ++++QL+ +  +L  +       ++AL    E +G+ FED Q+  
Sbjct: 571 SESVSEARERRMALRQQLEQLQARIQRLAARAPAWLAAQDALARLREQSGEEFEDSQDVT 630

Query: 345 SRLLQQLREKDDANFKLMTERIKSNQLHKLAREEKDTLREQVQALSLQIE 394
             + Q L            ER +          E+D L  + QAL  +IE
Sbjct: 631 EYMQQLL------------ERERE------LTVERDELAARKQALDEEIE 662



 Score = 30.7 bits (70), Expect = 2.2
 Identities = 26/129 (20%), Positives = 52/129 (40%), Gaps = 29/129 (22%)

Query: 287  RDERKKLADEEAMRKIKQLEDQTHQLQKQVATHKQEEEALLNEMEVTGQAFEDMQEQNSR 346
            R++  +   EEA R ++Q  +   QL+  V+  + + E    + E   Q ++  Q+    
Sbjct: 900  REQLDEA--EEAKRFVQQHGNALAQLEPIVSVLQSDPE----QFEQLKQDYQQAQQTQRD 953

Query: 347  LLQQLREKDD-----ANFKL------------MTERIKSNQLHKLAREEKDTLREQVQAL 389
              QQ     +     A+F              + E+++  Q  + A +E+   REQ++  
Sbjct: 954  AKQQAFALTEVVQRRAHFSYEDAAEMLAKNSDLNEKLR--QRLEQAEQERTRAREQLR-- 1009

Query: 390  SLQIEAMHT 398
              Q +A   
Sbjct: 1010 --QAQAQLA 1016



 Score = 30.7 bits (70), Expect = 2.6
 Identities = 29/143 (20%), Positives = 56/143 (39%), Gaps = 30/143 (20%)

Query: 290  RKKLADEEAMR-----KIKQLEDQTHQ-------LQKQVATHKQEEEALLNEMEVTG-QA 336
            R++L   E  R     +++Q + Q  Q       L+      +Q  + L  E++  G  A
Sbjct: 991  RQRLEQAEQERTRAREQLRQAQAQLAQYNQVLASLKSSYDAKRQMLQELKQELQDLGVPA 1050

Query: 337  FEDMQEQNSRLLQQLREKDDANFKLMTERIKSNQLHKLAREEKDTLREQVQALSLQIEAM 396
                +E+     +  R++  A   L   R + NQL K           Q+     +++ +
Sbjct: 1051 DSGAEER----ARARRDELHAR--LSANRSRRNQLEK-----------QLTFCEAEMDNL 1093

Query: 397  HTAIRKLEEKERFLQTVLTNAEK 419
               +RKLE     ++  + NA+ 
Sbjct: 1094 TKKLRKLERDYHEMREQVVNAKA 1116


>gnl|CDD|222465 pfam13942, Lipoprotein_20, YfhG lipoprotein.  This family includes
           the YfhG protein from E. coli. Members of this family
           have an N-terminal lipoprotein attachment site. The
           members of this family are functionally uncharacterized.
          Length = 180

 Score = 34.2 bits (79), Expect = 0.092
 Identities = 28/81 (34%), Positives = 38/81 (46%), Gaps = 11/81 (13%)

Query: 347 LLQQLREKDDANFKLMTERIKSNQLHKLAREEKDTLREQVQALSLQIEAMHTAIRKLEEK 406
           LLQ  RE      +L  ER +  +L + +  E D LREQ   L  ++E   T  RKLE  
Sbjct: 100 LLQLWREGQVLQLQLAEERARYQRLQQSSDSELDRLREQQARLQYELE---TTTRKLEN- 155

Query: 407 ERFLQTVLTNAEKELHLRNQA 427
                  LT+ E++L  R Q 
Sbjct: 156 -------LTDIERQLSSRKQG 169


>gnl|CDD|182330 PRK10246, PRK10246, exonuclease subunit SbcC; Provisional.
          Length = 1047

 Score = 35.2 bits (81), Expect = 0.095
 Identities = 36/138 (26%), Positives = 55/138 (39%), Gaps = 23/138 (16%)

Query: 283 RDSKRDERKKLADEEAMRKIKQLEDQTHQLQKQ---VATHKQEEEALLNEMEVTGQAFED 339
            D+   E KKL +E A  +  QL+  T QLQ+      + +QEE+AL  +          
Sbjct: 532 LDALEKEVKKLGEEGAALR-GQLDALTKQLQRDESEAQSLRQEEQALTQQW--------- 581

Query: 340 MQEQNSRLLQQLREKDDANFKLMTERIKSNQLHKLAREEKDTLREQVQALSLQIEAMHTA 399
            Q   + L   L+ +DD    L  +     QL  L+         Q   L  QI A +  
Sbjct: 582 -QAVCASLNITLQPQDDIQPWLDAQEEHERQLRLLS---------QRHELQGQIAAHNQQ 631

Query: 400 IRKLEEKERFLQTVLTNA 417
           I + +++    Q  L  A
Sbjct: 632 IIQYQQQIEQRQQQLLTA 649



 Score = 32.9 bits (75), Expect = 0.56
 Identities = 36/176 (20%), Positives = 61/176 (34%), Gaps = 39/176 (22%)

Query: 308 QTHQLQKQVATHKQEEEALLNEMEVTG-QAFEDMQEQNSRLLQQLREKDDANFKLMTERI 366
            T Q  ++V T  Q   AL   +     +   ++Q Q   L   L E D   F+     +
Sbjct: 308 HTRQQIEEVNTRLQSTMALRARIRHHAAKQSAELQAQQQSLNTWLAEHD--RFRQWNNEL 365

Query: 367 ------------KSNQLHKLA------REEKDTL---------REQVQALSLQIE----- 394
                          QL +         ++ + L          E   AL+   E     
Sbjct: 366 AGWRAQFSQQTSDREQLRQWQQQLTHAEQKLNALPAITLTLTADEVAAALAQHAEQRPLR 425

Query: 395 ----AMHTAIRKLEEKERFLQTVLTNAEKELHLRNQAMDLNKRKAIESAQSAADLK 446
               A+H  I   +++   LQ  + N  +E   RN A++  +++  E  Q  AD+K
Sbjct: 426 QRLVALHGQIVPQQKRLAQLQVAIQNVTQEQTQRNAALNEMRQRYKEKTQQLADVK 481



 Score = 32.1 bits (73), Expect = 0.85
 Identities = 24/96 (25%), Positives = 41/96 (42%), Gaps = 5/96 (5%)

Query: 294 ADEEAMRKIKQLEDQTHQLQKQVATHKQEEEALLNEMEVTGQAFEDMQEQNSRLLQQLRE 353
           A EE  R+++ L  Q H+LQ Q+A H Q+      ++E   Q         +  L Q   
Sbjct: 604 AQEEHERQLRLLS-QRHELQGQIAAHNQQIIQYQQQIEQRQQQLLTALAGYALTLPQ--- 659

Query: 354 KDDANFKLMTERIKSNQLHKLAREEKDTLREQVQAL 389
            +D     +  R +  Q  +  + E   L+ ++Q L
Sbjct: 660 -EDEEASWLATRQQEAQSWQQRQNELTALQNRIQQL 694



 Score = 32.1 bits (73), Expect = 0.99
 Identities = 17/81 (20%), Positives = 37/81 (45%), Gaps = 10/81 (12%)

Query: 372 HKLAREEKDTLREQVQALSL----QIEAMHTAIRKLEEKERFLQTVLTNAEKELHLRNQA 427
           HK AR E + L+ Q   ++L    Q++++  +++ L ++E+ L T     ++ L+     
Sbjct: 193 HKSARTELEKLQAQASGVALLTPEQVQSLTASLQVLTDEEKQLLTAQQQQQQSLNW---- 248

Query: 428 MDLNKRKAIESAQSAADLKLH 448
             L +   ++   S     L 
Sbjct: 249 --LTRLDELQQEASRRQQALQ 267


>gnl|CDD|192445 pfam10018, Med4, Vitamin-D-receptor interacting Mediator subunit 4.
            Members of this family function as part of the Mediator
           (Med) complex, which links DNA-bound transcriptional
           regulators and the general transcription machinery,
           particularly the RNA polymerase II enzyme. They play a
           role in basal transcription by mediating activation or
           repression according to the specific complement of
           transcriptional regulators bound to the promoter.
          Length = 176

 Score = 33.5 bits (77), Expect = 0.12
 Identities = 18/94 (19%), Positives = 37/94 (39%), Gaps = 20/94 (21%)

Query: 347 LLQQLREKDDANFKLMTERIKSNQLHKLAREEKDTLREQVQALSLQIEAMHTAIRKLEEK 406
           L + L   DD     +   +K  + H+  ++    LR +V++L  +I             
Sbjct: 3   LAEDLISADDE----LKSALKELEEHQENQQRILELRAEVESLDEKI------------- 45

Query: 407 ERFLQTVLTNAEKELH--LRNQAMDLNKRKAIES 438
            + +   L  AEKEL       +++   ++ + S
Sbjct: 46  -KDILKELKEAEKELRTLPDLSSINKANKRKVSS 78


>gnl|CDD|218704 pfam05701, DUF827, Plant protein of unknown function (DUF827).
           This family consists of several plant proteins of
           unknown function. Several sequences in this family are
           described as being "myosin heavy chain-like".
          Length = 484

 Score = 34.6 bits (79), Expect = 0.12
 Identities = 39/165 (23%), Positives = 72/165 (43%), Gaps = 21/165 (12%)

Query: 287 RDERKKLADEEAMRKI-KQLEDQTHQLQKQVATHKQEEEALLNEMEVTGQAFEDMQEQNS 345
              +K+++   ++  + K+LE+    L+K      +EE++L N +E   Q  E+ +++  
Sbjct: 242 ESLQKEISIMASVASVLKELEEAKANLEKA----AEEEKSLRNLVESLKQELEEEKKELE 297

Query: 346 RLLQQLREKDDANFKLMTERIKSNQ-LHKLAREEKDTLREQVQALSLQIE-------AMH 397
            L ++  E ++A   L  E  ++       ARE+   +  ++Q  S + E       A  
Sbjct: 298 ELREKEGEAEEAASSLEAELNRTKSEKESKAREKMVEIPLKLQQASSEAEEARKEAEAAR 357

Query: 398 TAIRKLEEKERFLQTVLTNAEKELHLRNQAMDLNKRKAIESAQSA 442
             +RKL  KE   QT       EL L          K  E+A++A
Sbjct: 358 EELRKL--KEEAEQTKAALETAELRLEAA------LKEAEAAKAA 394



 Score = 33.8 bits (77), Expect = 0.20
 Identities = 35/171 (20%), Positives = 66/171 (38%), Gaps = 37/171 (21%)

Query: 286 KRDERKKLADEEAMRKIKQL-------EDQTHQLQKQVATHKQEEEALLNEMEVTGQAFE 338
           KR E    A +   +K+++L       ++   +        ++E      E +   + +E
Sbjct: 149 KRAEEAICASKVNEKKVEELTKEIIAMKESLERAHAAHLEAEEERIGAALEKDQDRETYE 208

Query: 339 DMQEQNSRLLQQLREKDDANF---KLMT-------------------------ERIKSNQ 370
              ++  + L++L++  D      KL                           E  K+N 
Sbjct: 209 KELKEAEKELERLKQDLDPEKDLEKLAEASAELESLQKEISIMASVASVLKELEEAKAN- 267

Query: 371 LHKLAREEKDTLREQVQALSLQIEAMHTAIRKLEEKERFLQTVLTNAEKEL 421
           L K A EEK +LR  V++L  ++E     + +L EKE   +   ++ E EL
Sbjct: 268 LEKAAEEEK-SLRNLVESLKQELEEEKKELEELREKEGEAEEAASSLEAEL 317



 Score = 32.3 bits (73), Expect = 0.59
 Identities = 26/139 (18%), Positives = 53/139 (38%), Gaps = 18/139 (12%)

Query: 285 SKRDERKKLADEEAMRKIKQLEDQTHQLQKQVATHKQEEEALLNE----MEVTGQAFEDM 340
           S+++ + +    E   K++Q   +  +        ++E EA   E     E   Q    +
Sbjct: 322 SEKESKAREKMVEIPLKLQQASSEAEE-------ARKEAEAAREELRKLKEEAEQTKAAL 374

Query: 341 QEQNSRLLQQLREKDDANF--KLMTERIKSNQLHKLAREEKDTLREQVQALSLQIEAMHT 398
           +    RL   L+E + A     L    IK+ Q  + + +      +  + ++L +E    
Sbjct: 375 ETAELRLEAALKEAEAAKAAEALALAEIKALQESEESAKA-----DSPRKITLSLEEYEE 429

Query: 399 AIRKLEEKERFLQTVLTNA 417
             +K EE E   +  +  A
Sbjct: 430 LSKKAEEAEELAEKKVAAA 448



 Score = 32.3 bits (73), Expect = 0.67
 Identities = 39/183 (21%), Positives = 77/183 (42%), Gaps = 8/183 (4%)

Query: 283 RDSKRDERKKLADEEAMRKIKQLEDQTHQLQ-KQVATHKQEEEALLNEMEVTGQAFEDMQ 341
           +  K +++ K   E A  + ++LE    +L+ ++  T   E +++  E+    Q ++ + 
Sbjct: 82  KAEKEEQQAKQDSELAKLRAEELEQGIQELEVERYITATAELDSVKEELRKIRQEYDALV 141

Query: 342 EQNSRLLQQLREKDDANFKLMTERIKSNQLHKLAREEKDTLREQVQALSLQIEAMHTAIR 401
           E+    L++  E   A         K  +L K     K++L E+  A  L+ E       
Sbjct: 142 EERDAALKRAEE---AICASKVNEKKVEELTKEIIAMKESL-ERAHAAHLEAEEERIGAA 197

Query: 402 KLEEKER-FLQTVLTNAEKELHLRNQAMDLNK--RKAIESAQSAADLKLHLVIIYTNGPA 458
             ++++R   +  L  AEKEL    Q +D  K   K  E++     L+  + I+ +    
Sbjct: 198 LEKDQDRETYEKELKEAEKELERLKQDLDPEKDLEKLAEASAELESLQKEISIMASVASV 257

Query: 459 RLE 461
             E
Sbjct: 258 LKE 260



 Score = 31.1 bits (70), Expect = 1.8
 Identities = 37/163 (22%), Positives = 66/163 (40%), Gaps = 24/163 (14%)

Query: 292 KLADEEAMRKIKQL---EDQTHQL-------QKQVATHKQEEEALLNEMEVTGQAFEDMQ 341
            LA EEA +  K+L   E +  Q+       ++ V   K + E    E +   Q  E  +
Sbjct: 39  MLAQEEANKLKKELEVAEKEKLQVLKELESTKRTVEDLKLKLEKAEKEEQQAKQDSELAK 98

Query: 342 EQNSRLLQQLREKDDANFKLMTERIKSNQLHKLAREEKDTLREQVQALSLQIEAMHTAIR 401
            +   L Q ++E +   +   T  + S       +EE   +R++  AL   +E    A++
Sbjct: 99  LRAEELEQGIQELEVERYITATAELDS------VKEELRKIRQEYDAL---VEERDAALK 149

Query: 402 KLEEKERFLQTVLTNAEKELHLRNQAMDLNKRKAIESAQSAAD 444
           + EE     +      E EL     AM    ++++E A +A  
Sbjct: 150 RAEEAICASKVNEKKVE-ELTKEIIAM----KESLERAHAAHL 187


>gnl|CDD|113156 pfam04375, HemX, HemX.  This family consists of several bacterial
           HemX proteins. The hemX gene is not essential for haem
           synthesis in B. subtilis. HemX is a polytopic membrane
           protein which by an unknown mechanism down-regulates the
           level of HemA.
          Length = 372

 Score = 34.5 bits (79), Expect = 0.14
 Identities = 19/85 (22%), Positives = 33/85 (38%), Gaps = 8/85 (9%)

Query: 335 QAFEDMQEQNSRL---LQQLREKDDANFKLMTERIKSNQLHKLAREEK--DTLREQVQAL 389
           Q    +Q +   L   L QL +  +A        +  +    LA+ EK    L + +  L
Sbjct: 53  QQVAGLQIKLEALAQELTQLAQALEAQLAQEVRVLIRDNQEALAQLEKVLGQLDKSLAEL 112

Query: 390 SLQIEAMHTAIRK---LEEKERFLQ 411
             ++ A+  A R    L E +  L+
Sbjct: 113 QDRVAAISGANRNDWLLAEADFLLK 137


>gnl|CDD|151039 pfam10473, Cenp-F_leu_zip, Leucine-rich repeats of kinetochore
           protein Cenp-F/LEK1.  Cenp-F, a centromeric kinetochore,
           microtubule-binding protein consisting of two
           1,600-amino acid-long coils, is essential for the full
           functioning of the mitotic checkpoint pathway. There are
           several leucine-rich repeats along the sequence of LEK1
           that are considered to be zippers, though they do not
           appear to be binding DNA directly in this instance.
          Length = 140

 Score = 32.8 bits (75), Expect = 0.14
 Identities = 28/136 (20%), Positives = 60/136 (44%), Gaps = 13/136 (9%)

Query: 276 RKFACVVRDSKRDERKKLADEEAMRKIKQLEDQTHQLQK-------QVATHKQEEEALLN 328
           +K   V+   K  ER+    ++   K++ LE +    ++       +    K E E L  
Sbjct: 3   KKQLHVLEKLKESEREADLLKD---KVENLERELEMSEENQELVILEAENSKAEVETLKT 59

Query: 329 EMEVTGQAFEDMQEQNSRLLQQLREKDDANFKLMTERIKSNQLHKLAREEKDTLREQVQA 388
           ++E   ++ +D++     L+    EK++   +L  ++ + ++L KL    ++ L E+ Q 
Sbjct: 60  KIEEMAESLKDLE---LDLVTVRSEKENLTKQLQEKQERVSELEKLNSSTENLLEEKEQE 116

Query: 389 LSLQIEAMHTAIRKLE 404
                E   TA+  L+
Sbjct: 117 KIQMKEESKTAVEMLQ 132


>gnl|CDD|204784 pfam11932, DUF3450, Protein of unknown function (DUF3450).  This
           family of proteins are functionally uncharacterized.
           This protein is found in bacteria and eukaryotes.
           Proteins in this family are about 260 amino acids in
           length.
          Length = 250

 Score = 33.7 bits (78), Expect = 0.15
 Identities = 16/63 (25%), Positives = 30/63 (47%), Gaps = 3/63 (4%)

Query: 296 EEAMRKIKQLEDQTHQLQKQVATHKQEEEAL---LNEMEVTGQAFEDMQEQNSRLLQQLR 352
            + +++I+ L     QLQ+ VA  +QE  +L   + ++E T Q    +  +    L+Q  
Sbjct: 60  RQLLKEIENLRVYNDQLQRLVANQQQEIASLQQQIEQIEKTRQGLVPLMYRMIDGLEQFV 119

Query: 353 EKD 355
             D
Sbjct: 120 ALD 122



 Score = 29.5 bits (67), Expect = 4.0
 Identities = 12/63 (19%), Positives = 28/63 (44%)

Query: 367 KSNQLHKLAREEKDTLREQVQALSLQIEAMHTAIRKLEEKERFLQTVLTNAEKELHLRNQ 426
           ++      ++++ D   +Q Q L  +I  +   I  L      LQ ++ N ++E+    Q
Sbjct: 33  QTVAAAAQSQKKIDKWADQTQELLAEIRQLLKEIENLRVYNDQLQRLVANQQQEIASLQQ 92

Query: 427 AMD 429
            ++
Sbjct: 93  QIE 95


>gnl|CDD|238103 cd00176, SPEC, Spectrin repeats, found in several proteins involved
           in cytoskeletal structure; family members include
           spectrin, alpha-actinin and dystrophin; the spectrin
           repeat forms a three helix bundle with the second helix
           interrupted by proline in some sequences; the repeats
           are independent folding units; tandem repeats are found
           in differing numbers and arrange in an antiparallel
           manner to form dimers; the repeats are defined by a
           characteristic tryptophan (W) residue in helix A and a
           leucine (L) at the carboxyl end of helix C and separated
           by a linker of 5 residues; two copies of the repeat are
           present here.
          Length = 213

 Score = 33.6 bits (77), Expect = 0.15
 Identities = 26/131 (19%), Positives = 53/131 (40%), Gaps = 21/131 (16%)

Query: 287 RDERKKLADEEAMRKIKQLEDQTHQLQKQVATHKQEEEALLNEMEVTGQAFEDMQEQNSR 346
            ++ + L+  +    ++ +E     L+K         EAL  E+    +  E + E   +
Sbjct: 17  SEKEELLSSTDYGDDLESVEAL---LKKH--------EALEAELAAHEERVEALNELGEQ 65

Query: 347 LLQQLREKDDANFKLMTERIKS-----NQLHKLAREEKDTLREQVQALSLQIEAMHTAIR 401
           L+++     +     + ER++       +L +LA E +  L E +       +A     +
Sbjct: 66  LIEEGHPDAEE----IQERLEELNQRWEELRELAEERRQRLEEALDLQQFFRDADD-LEQ 120

Query: 402 KLEEKERFLQT 412
            LEEKE  L +
Sbjct: 121 WLEEKEAALAS 131


>gnl|CDD|192773 pfam11559, ADIP, Afadin- and alpha -actinin-Binding.  This family
           is found in mammals where it is localised at cell-cell
           adherens junctions, and in Sch. pombe and other fungi
           where it anchors spindle-pole bodies to spindle
           microtubules. It is a coiled-coil structure, and in
           pombe, it is required for anchoring the minus end of
           spindle microtubules to the centrosome equivalent, the
           spindle-pole body. The name ADIP derives from the family
           being composed of Afadin- and alpha -Actinin-Binding
           Proteins Localised at Cell-Cell Adherens Junctions.
          Length = 149

 Score = 33.0 bits (76), Expect = 0.15
 Identities = 28/104 (26%), Positives = 47/104 (45%), Gaps = 12/104 (11%)

Query: 346 RLLQQLREKDDANFKLMTERIKSNQLHKLAREEKDTLREQVQALSLQIEAMHTAIRKLEE 405
            LLQQ     D  F+   E      L KL   E + L+  ++ L  Q+E +   +  L+ 
Sbjct: 40  SLLQQRDR--DLEFRESLEE----TLRKL-EAEIERLQNTIERLKTQLEDLERELALLQA 92

Query: 406 KERFLQTVLTNAEKELHLRNQAMDLNK-RKAIES--AQSAADLK 446
           KER L+  L   E+   L+N+  ++ + +  I+    Q   +LK
Sbjct: 93  KERQLEKKLKTLEQ--KLKNEKEEVQRLKNIIQQRKTQYNHELK 134



 Score = 29.6 bits (67), Expect = 2.4
 Identities = 21/102 (20%), Positives = 51/102 (50%), Gaps = 19/102 (18%)

Query: 290 RKKLADEEAMRK-IKQLEDQTHQLQKQVATHKQEEEALLNEMEVTGQAFEDMQEQNSRLL 348
           RK  A+ E ++  I++L+ Q   L++++A  + +E  L              +++   L 
Sbjct: 60  RKLEAEIERLQNTIERLKTQLEDLERELALLQAKERQL--------------EKKLKTLE 105

Query: 349 QQLR-EKDDANFKLMT-ERIKSNQLHKLAREEK--DTLREQV 386
           Q+L+ EK++        ++ K+   H+L + ++  + L+E++
Sbjct: 106 QKLKNEKEEVQRLKNIIQQRKTQYNHELKKRDREIEKLKERL 147


>gnl|CDD|224212 COG1293, COG1293, Predicted RNA-binding protein homologous to
           eukaryotic snRNP [Transcription].
          Length = 564

 Score = 34.3 bits (79), Expect = 0.16
 Identities = 37/173 (21%), Positives = 56/173 (32%), Gaps = 33/173 (19%)

Query: 286 KRDERKKLADEEAMRKIKQLEDQTHQLQKQVATHKQEEEALLNEMEVTGQAFEDMQEQNS 345
           +RD+ K+LA E      K+LE +  +L+ ++   + E E L         A E    Q  
Sbjct: 276 ERDKIKQLASE----LEKKLEKELKKLENKLEKQEDELEELEKA------AEEL--RQKG 323

Query: 346 RLLQQ--------LREKDDANFKLMTERIKSNQLHKLAREEKDT-------LREQVQALS 390
            LL          L+    A+F    E        K   E           L+     L 
Sbjct: 324 ELLYANLQLIEEGLKSVRLADFYGNEEIKIELDKSKTPSENAQRYFKKYKKLKGAKVNLD 383

Query: 391 LQIEAMHTAIRKLEEKERFLQT-----VLTNAEKELHLRNQ-AMDLNKRKAIE 437
            Q+  +  AI   E  +  L+       +    +EL           KRK  E
Sbjct: 384 RQLSELKEAIAYYESAKTALEKAEGKKAIEEIREELIEEGLLKSKKKKRKKKE 436


>gnl|CDD|221776 pfam12795, MscS_porin, Mechanosensitive ion channel porin domain.
           The small mechanosensitive channel, MscS, is a part of
           the turgor-driven solute efflux system that protects
           bacteria from lysis in the event of osmotic shock. The
           MscS protein alone is sufficient to form a functional
           mechanosensitive channel gated directly by tension in
           the lipid bilayer. The MscS proteins are heptamers of
           three transmembrane subunits with seven converging M3
           domains, and this MscS_porin is towards the N-terminal
           of the molecules. The high concentration of negative
           charges at the extracellular entrance of the pore helps
           select the cations for efflux.
          Length = 239

 Score = 33.8 bits (78), Expect = 0.16
 Identities = 27/168 (16%), Positives = 64/168 (38%), Gaps = 23/168 (13%)

Query: 284 DSKRDERKKLAD-----EEAMRKIKQLEDQTHQLQKQVATHKQEEEAL-LNEMEVTGQAF 337
           D     ++K        ++A +++++L  +   L+K  A    E   L L+++E      
Sbjct: 31  DEIDKSKQKADQYQKQIDDAPKELRELRQELEALKKTDAPVFPELANLSLSQLE------ 84

Query: 338 EDMQEQNSRLLQQLREKDDANFKLMTERIKSNQLHKLAREEKDTLREQVQALSLQIEAMH 397
           + + +  S+L +   +    N +L+  + +  +    A+++    R ++Q    +I    
Sbjct: 85  QRLAQTLSQLQELQEQLQQENSQLIELQTRPER----AQQQLSEARRRLQ----EIRNRL 136

Query: 398 TAIRKLEEKERFLQTVLTNAEKELHLRNQAMDLNKRKAIESAQSAADL 445
            A+          Q  L  A  EL      ++      + S  +  +L
Sbjct: 137 QALSPGGTPLAQAQRTLLQA--ELAALKAQIEEL-ELELLSNNNRQEL 181


>gnl|CDD|226513 COG4026, COG4026, Uncharacterized protein containing TOPRIM domain,
           potential nuclease [General function prediction only].
          Length = 290

 Score = 33.7 bits (77), Expect = 0.16
 Identities = 19/85 (22%), Positives = 39/85 (45%), Gaps = 8/85 (9%)

Query: 337 FEDMQEQNSRLLQQLREKDDANFKLMTERIKSNQLHKLAREEKDTLREQVQALSLQIEAM 396
            E++Q++   LL++L E + A ++ + ER+K          E   L E ++ L  ++  +
Sbjct: 144 LEELQKEKEELLKELEELE-AEYEEVQERLKR------LEVENSRLEEMLKKLPGEVYDL 196

Query: 397 HTAIRKLEEKERFLQTVL-TNAEKE 420
                +LE      +  L ++  KE
Sbjct: 197 KKRWDELEPGVELPEEELISDLVKE 221


>gnl|CDD|218806 pfam05911, DUF869, Plant protein of unknown function (DUF869).
           This family consists of a number of sequences found in
           Arabidopsis thaliana, Oryza sativa and Lycopersicon
           esculentum (Tomato). The function of this family is
           unknown.
          Length = 767

 Score = 34.5 bits (79), Expect = 0.17
 Identities = 32/163 (19%), Positives = 64/163 (39%), Gaps = 41/163 (25%)

Query: 287 RDERKKLADEEAMRKIKQLEDQTHQLQKQVATHKQEEEALLNEMEVTGQAFEDMQEQNSR 346
           R+E+++   + A  K K+ E    +L+K++   ++E      E              N  
Sbjct: 65  REEQEQKIQDAATEKTKEWEKVKAELEKKLLELQKELARAAAE--------------NDA 110

Query: 347 LLQQLREKDDANFKLMTERIKSNQLHKLAREEKDTLREQVQALSLQIEAMHTAIRKLEEK 406
           L + L+EK     +L               EEK     +++ L  ++E+       L+ +
Sbjct: 111 LSRSLQEKSKMIMEL--------------SEEKSRAESEIEDLKARLESAEKENSSLKYE 156

Query: 407 ERFLQTVLTNAEKELHLRNQAMDLNKRKAIESAQSAADLKLHL 449
              L        KEL +RN+  +++  ++ ++A      K HL
Sbjct: 157 VHVLS-------KELEIRNEERNMS-LQSADAAS-----KQHL 186


>gnl|CDD|214436 MTH00151, ND5, NADH dehydrogenase subunit 5; Provisional.
          Length = 565

 Score = 34.0 bits (79), Expect = 0.18
 Identities = 16/76 (21%), Positives = 22/76 (28%), Gaps = 7/76 (9%)

Query: 180 CFSIFRFSSSTVFVYHCFSIFRFSSSTVFVYHCFSIFRFSSSTIFVYHCFDIFSLT---S 236
             S   FS S               S +F    + +   SS +       D  SL+    
Sbjct: 6   LSSFLLFSYSVFLFPLLGYFGLNGISLIF---EWELLSVSSCSFSFELVLDWVSLSFSGV 62

Query: 237 TLFVSICTSAFQMSFY 252
              +S C   F  S Y
Sbjct: 63  VCLISGCVFLF-SSSY 77



 Score = 29.0 bits (66), Expect = 6.3
 Identities = 16/74 (21%), Positives = 21/74 (28%), Gaps = 3/74 (4%)

Query: 132 CFSIFRFSSSTVFVYHCFSIFRFSSSTVFVYHCFSIFRFSSSTVFVYHCFSIFRFSSSTV 191
             S   FS S               S +F    + +   SS +            S S V
Sbjct: 6   LSSFLLFSYSVFLFPLLGYFGLNGISLIF---EWELLSVSSCSFSFELVLDWVSLSFSGV 62

Query: 192 FVYHCFSIFRFSSS 205
                  +F FSSS
Sbjct: 63  VCLISGCVFLFSSS 76


>gnl|CDD|218549 pfam05308, Mito_fiss_reg, Mitochondrial fission regulator.  In
           eukaryotes, this family of proteins induces
           mitochondrial fission.
          Length = 248

 Score = 33.6 bits (77), Expect = 0.20
 Identities = 13/38 (34%), Positives = 23/38 (60%)

Query: 281 VVRDSKRDERKKLADEEAMRKIKQLEDQTHQLQKQVAT 318
           + ++  R +  K  +EEA++KI  LED+   L+ Q+A 
Sbjct: 103 LRQEEPRLQSLKPPNEEALQKISALEDELAFLRAQIAK 140


>gnl|CDD|219934 pfam08614, ATG16, Autophagy protein 16 (ATG16).  Autophagy is a
           ubiquitous intracellular degradation system for
           eukaryotic cells. During autophagy, cytoplasmic
           components are enclosed in autophagosomes and delivered
           to lysosomes/vacuoles. ATG16 (also known as Apg16) has
           been shown to be bind to Apg5 and is required for the
           function of the Apg12p-Apg5p conjugate in the yeast
           autophagy pathway.
          Length = 194

 Score = 33.3 bits (76), Expect = 0.21
 Identities = 31/135 (22%), Positives = 57/135 (42%), Gaps = 9/135 (6%)

Query: 307 DQTHQLQKQVATHKQEEEALLNEMEVTGQAFED-MQEQNSRLLQQLREKDDANFKLMTER 365
           +Q  QL  +    K E +AL  E      +               + E +    KL   R
Sbjct: 24  EQYRQLADRTRLLKVENDALQAEKYEQQSSHSSSPSADGPGSDAAIAEMEQKLAKL---R 80

Query: 366 IKSNQLHKLAREEKDTLREQVQALSLQIEAMHTAIRKLEEKERFLQTVLTNAEKELHLRN 425
            +  +LHK  R E   L +++  L+ ++E +   I++LE+    L++ +T+ E E+    
Sbjct: 81  EELTELHK-KRGE---LAQRLLLLNDELEQLRREIQQLEKTIAELRSEITSLETEIRDLR 136

Query: 426 QAMDLNKRKAIESAQ 440
           + +   K K  E+ Q
Sbjct: 137 EELQ-EKEKDNETLQ 150



 Score = 29.0 bits (65), Expect = 4.3
 Identities = 24/137 (17%), Positives = 57/137 (41%), Gaps = 24/137 (17%)

Query: 288 DERKKLADEEAMRKIKQLEDQTHQLQKQVATHKQEEEALLNEMEVTGQAFEDMQEQNSRL 347
            +  KL +E      K+ E     L       +   E    E  +      +++ + + L
Sbjct: 74  QKLAKLREELTELHKKRGELAQRLLLLNDELEQLRREIQQLEKTI-----AELRSEITSL 128

Query: 348 LQQLREKDDANFKLMTERIKSNQLHKLAREEKDTLREQVQALSLQIEAMHTAIRKLEEK- 406
             ++R+  +     + E+ K N+          TL++++ +L++++ A+   +RKL+++ 
Sbjct: 129 ETEIRDLREE----LQEKEKDNE----------TLQDELISLNIELNALEEKLRKLQKEN 174

Query: 407 ----ERFLQTVLTNAEK 419
               ER++      AE 
Sbjct: 175 QELVERWMAKKGQEAEA 191


>gnl|CDD|236912 PRK11448, hsdR, type I restriction enzyme EcoKI subunit R;
           Provisional.
          Length = 1123

 Score = 34.2 bits (79), Expect = 0.21
 Identities = 23/75 (30%), Positives = 40/75 (53%), Gaps = 7/75 (9%)

Query: 283 RDSKRDERKKLADEEAMRKIKQLEDQTHQLQKQVATHKQEEEALLNEMEVTGQAFE--DM 340
           +  +    + LA  E   K ++LE Q  QLQ++ A  +  +E      E+T QA +  ++
Sbjct: 175 QQQELVALEGLA-AELEEKQQELEAQLEQLQEKAA--ETSQERKQKRKEITDQAAKRLEL 231

Query: 341 QEQNSRLL--QQLRE 353
            E+ +R+L  QQLR+
Sbjct: 232 SEEETRILIDQQLRK 246



 Score = 31.8 bits (73), Expect = 1.1
 Identities = 25/108 (23%), Positives = 45/108 (41%), Gaps = 12/108 (11%)

Query: 306 EDQTHQLQKQVATHKQEEEALLNEMEVTGQAFEDMQEQNSRLLQQLREKDDANFKLMT-- 363
           E+  H LQ++V T KQ+ E    E   +    E  Q++   L     E ++   +L    
Sbjct: 141 ENLLHALQQEVLTLKQQLELQAREKAQSQALAEAQQQELVALEGLAAELEEKQQELEAQL 200

Query: 364 ERIKSNQLHKLAREEKDTLREQVQALSLQIEAMHTAIRKLEEKE-RFL 410
           E+++       ++E K   +E     + ++E        L E+E R L
Sbjct: 201 EQLQEKAAET-SQERKQKRKEITDQAAKRLE--------LSEEETRIL 239


>gnl|CDD|236745 PRK10722, PRK10722, hypothetical protein; Provisional.
          Length = 247

 Score = 33.4 bits (77), Expect = 0.21
 Identities = 27/81 (33%), Positives = 37/81 (45%), Gaps = 11/81 (13%)

Query: 347 LLQQLREKDDANFKLMTERIKSNQLHKLAREEKDTLREQVQALSLQIEAMHTAIRKLEEK 406
           L Q  R+       L  ER +  +L + +  E D LR+Q Q L  Q+E      RKLE  
Sbjct: 146 LYQLWRDGQALQLALAEERQRYQKLQQSSDSELDALRQQQQRLQYQLE---LTTRKLEN- 201

Query: 407 ERFLQTVLTNAEKELHLRNQA 427
                  LT+ E++L  R QA
Sbjct: 202 -------LTDIERQLSSRKQA 215


>gnl|CDD|189017 cd09610, M3B_PepF_5, Peptidase family M3B Oligopeptidase F (PepF). 
           Peptidase family M3B Oligopeptidase F (PepF;
           Pz-peptidase B; EC 3.4.24.-) is mostly bacterial and
           includes oligoendopeptidase F from Lactococcus lactis.
           This enzyme hydrolyzes peptides containing between 7 and
           17 amino acids with fairly broad specificity. The PepF
           gene is duplicated in L. lactis on the plasmid that
           bears it, while a shortened second copy is found in
           Bacillus subtilis. Most bacterial PepFs are cytoplasmic
           endopeptidases; however, the PepF Bacillus
           amyloliquefaciens oligopeptidase is a secreted protein
           and may facilitate the process of sporulation.
           Specifically, the yjbG gene encoding the homolog of the
           PepF1 and PepF2 oligoendopeptidases of Lactococcus
           lactis has been identified in Bacillus subtilis as an
           inhibitor of sporulation initiation when over expressed
           from a multicopy plasmid.
          Length = 472

 Score = 33.7 bits (78), Expect = 0.22
 Identities = 36/148 (24%), Positives = 56/148 (37%), Gaps = 29/148 (19%)

Query: 289 ERKKLADE--EAMRKIKQLEDQTHQLQKQ--VATH--KQEEEALLNEMEVTGQ-AFEDMQ 341
           E  KL +    A+  I +L D  H L+K    A H   + EE +LNE   TG+ A+    
Sbjct: 7   EWAKLDEAKAAALLAIPELADYRHYLEKVRRYAPHTLSEPEEKILNEKSPTGRGAWV--- 63

Query: 342 EQNSRLLQQLREKDDANFKLMTERIKSNQLHKLAREEKDTLREQVQALSLQIEAMHTAIR 401
                    LR + D     + E      +  L R     +R++         A     +
Sbjct: 64  RLFDETTSSLRFEFDGEKLTLEE------ILALLRSPDREVRKK---------AAKALTK 108

Query: 402 KLEEKERFL----QTVLTNAEKELHLRN 425
            L++  R L     TV+ + + E  LR 
Sbjct: 109 GLKKNSRTLTLIYNTVVKDKDIEDRLRG 136


>gnl|CDD|227608 COG5283, COG5283, Phage-related tail protein [Function unknown].
          Length = 1213

 Score = 34.1 bits (78), Expect = 0.23
 Identities = 29/177 (16%), Positives = 64/177 (36%), Gaps = 18/177 (10%)

Query: 284 DSKRDERKKLADEEAMRKIKQLEDQTHQLQKQVATHKQEEEALLNEMEVTGQAFEDMQE- 342
             K     K + +++ +  K LE Q    +  ++  K + E L   ME   +A+ED+++ 
Sbjct: 20  AVKNINVLKSSIKDSTQFWKMLEKQQKLTKDGLSASKGKYEGLSEAMEKQKKAYEDLKQE 79

Query: 343 --QNSRLLQQLREKDDANFKLMTE-----RIKSNQL------HKLAREEKDTLREQVQAL 389
             + +R  Q  ++         T+     R  S Q         L ++E   L+  +  L
Sbjct: 80  VKEVNRATQASKKAYQEYNAQYTQAENKLRSLSGQFGVASEQLMLQQKEIQRLQYAISTL 139

Query: 390 S----LQIEAMHTAIRKLEEKERFLQTVLTNAEKELHLRNQAMDLNKRKAIESAQSA 442
           +     Q   +     K    +  +  +  +  ++    N+ ++  K+ A       
Sbjct: 140 NKSMAAQARLLEQTGNKFGTADAKVVGLRESFGRQTEALNKQLERTKKVADALTYVL 196


>gnl|CDD|129694 TIGR00606, rad50, rad50.  All proteins in this family for which
           functions are known are involvedin recombination,
           recombinational repair, and/or non-homologous end
           joining.They are components of an exonuclease complex
           with MRE11 homologs. This family is distantly related to
           the SbcC family of bacterial proteins.This family is
           based on the phylogenomic analysis of JA Eisen (1999,
           Ph.D. Thesis, Stanford University).
          Length = 1311

 Score = 33.9 bits (77), Expect = 0.23
 Identities = 38/172 (22%), Positives = 71/172 (41%), Gaps = 16/172 (9%)

Query: 284 DSKRDERKKLADEEAMRKIKQLEDQTHQLQKQVATHKQEEEALLNEMEV----TGQAFED 339
           D  ++  K++  E  + KI +L+++   L+ +    +++   L  +ME     T +   D
Sbjct: 248 DPLKNRLKEI--EHNLSKIMKLDNEIKALKSRKKQMEKDNSELELKMEKVFQGTDEQLND 305

Query: 340 MQEQNSRLLQQLREKDDANFKLMTERIKSNQLHKLAREEKDTLREQVQALSLQIEAMHTA 399
           +   + R    +REK+        E  K N+  +L  +EK  L  +   L LQ +     
Sbjct: 306 LYHNHQR---TVREKERELVDCQRELEKLNKERRLLNQEKTELLVEQGRLQLQADRHQEH 362

Query: 400 IRKLEEKERFLQTVLT------NAEKELHLRNQAMDLNKRKAIESAQSAADL 445
           IR  +   + L T L           E  ++N    L   +  + A++AA L
Sbjct: 363 IRARDSLIQSLATRLELDGFERGPFSERQIKN-FHTLVIERQEDEAKTAAQL 413


>gnl|CDD|181619 PRK09039, PRK09039, hypothetical protein; Validated.
          Length = 343

 Score = 33.4 bits (77), Expect = 0.24
 Identities = 28/144 (19%), Positives = 59/144 (40%), Gaps = 18/144 (12%)

Query: 293 LADEEAMRKIK--QLEDQTHQLQKQVATHKQEEEALLNEMEVTGQAFEDMQEQNSRLLQQ 350
           LAD  ++ +     L+D    L+  ++  + E   L   +     A    + +   L Q+
Sbjct: 65  LADLLSLERQGNQDLQDSVANLRASLSAAEAERSRLQALLAELAGAGAAAEGRAGELAQE 124

Query: 351 LREKDDANFKLMTERIKSNQLHKLAREEKDTLREQVQALSLQIEAMHTAIRKLEEKERFL 410
           L  +               Q+   A  + + L +Q+ AL  Q+ A+  A+   E+++R  
Sbjct: 125 LDSEK--------------QVSARALAQVELLNQQIAALRRQLAALEAALDASEKRDRES 170

Query: 411 QTVLTNAEKELH--LRNQAMDLNK 432
           Q  + +  + L+  L  +  +LN+
Sbjct: 171 QAKIADLGRRLNVALAQRVQELNR 194



 Score = 29.2 bits (66), Expect = 5.6
 Identities = 29/134 (21%), Positives = 50/134 (37%), Gaps = 22/134 (16%)

Query: 261 SLSDSASNIYGSLEFRKFACVVRDSKRDERKKL--ADEEAMRKIKQLEDQTHQLQKQVAT 318
            L DS +N+  SL           +   ER +L     E        E +  +L +++ +
Sbjct: 78  DLQDSVANLRASL----------SAAEAERSRLQALLAELAGAGAAAEGRAGELAQELDS 127

Query: 319 HKQEEEALLNEMEVTGQAFEDMQEQNSRLLQQLR--EKDDANFKLMTERIKSNQLHKLAR 376
            KQ     L ++E+  Q    ++ Q + L   L   EK D             ++  L R
Sbjct: 128 EKQVSARALAQVELLNQQIAALRRQLAALEAALDASEKRDRE--------SQAKIADLGR 179

Query: 377 EEKDTLREQVQALS 390
                L ++VQ L+
Sbjct: 180 RLNVALAQRVQELN 193


>gnl|CDD|197664 smart00338, BRLZ, basic region leucin zipper. 
          Length = 65

 Score = 30.6 bits (70), Expect = 0.25
 Identities = 18/44 (40%), Positives = 29/44 (65%)

Query: 289 ERKKLADEEAMRKIKQLEDQTHQLQKQVATHKQEEEALLNEMEV 332
           ERKK   EE  RK++QLE +  +L+K++   ++E E L +E+E 
Sbjct: 22  ERKKAEIEELERKVEQLEAENERLKKEIERLRRELEKLKSELEE 65



 Score = 28.7 bits (65), Expect = 0.96
 Identities = 12/61 (19%), Positives = 31/61 (50%), Gaps = 1/61 (1%)

Query: 287 RDERKKLADEEAMRK-IKQLEDQTHQLQKQVATHKQEEEALLNEMEVTGQAFEDMQEQNS 345
           +  R++  + EA R+  ++ + +  +L+++V   + E E L  E+E   +  E ++ +  
Sbjct: 5   KRRRRRERNREAARRSRERKKAEIEELERKVEQLEAENERLKKEIERLRRELEKLKSELE 64

Query: 346 R 346
            
Sbjct: 65  E 65


>gnl|CDD|238345 cd00632, Prefoldin_beta, Prefoldin beta; Prefoldin is a hexameric
           molecular chaperone complex, composed of two
           evolutionarily related subunits (alpha and beta), which
           are found in both eukaryotes and archaea.  Prefoldin
           binds and stabilizes newly synthesized polypeptides
           allowing them to fold correctly.  The hexameric
           structure consists of a double beta barrel assembly with
           six protruding coiled-coils. The alpha prefoldin
           subunits have two beta hairpin structures while the beta
           prefoldin subunits (this CD) have only one hairpin that
           is most similar to the second hairpin of the alpha
           subunit. The prefoldin hexamer consists of two alpha and
           four beta subunits and is assembled from the beta
           hairpins of all six subunits. The alpha subunits
           initially dimerize providing a structural nucleus for
           the assembly of the beta subunits. In archaea, there is
           usually only one gene for each subunit while in
           eukaryotes there two or more paralogous genes encoding
           each subunit adding heterogeneity to the structure of
           the hexamer.
          Length = 105

 Score = 31.6 bits (72), Expect = 0.25
 Identities = 34/117 (29%), Positives = 58/117 (49%), Gaps = 23/117 (19%)

Query: 308 QTHQLQKQ---VATHKQEEEALLNEMEVTGQAFEDMQEQNSRLLQQLREKDDAN-FKLMT 363
           Q  QLQ+Q       +Q+ EA LNE     +A E+++          +  DDA  +KL+ 
Sbjct: 7   QLQQLQQQLQAYIVQRQKVEAQLNE---NKKALEELE----------KLADDAEVYKLV- 52

Query: 364 ERIKSNQLHKLAREE-KDTLREQVQALSLQIEAMHTAIRKLEEKERFLQTVLTNAEK 419
                N L K  +EE +  L+E+++ + L+I+ +      L+EK + LQ  +  A+K
Sbjct: 53  ----GNVLVKQEKEEARTELKERLETIELRIKRLERQEEDLQEKLKELQEKIQQAQK 105


>gnl|CDD|116500 pfam07888, CALCOCO1, Calcium binding and coiled-coil domain
           (CALCOCO1) like.  Proteins found in this family are
           similar to the coiled-coil transcriptional coactivator
           protein coexpressed by Mus musculus (CoCoA/CALCOCO1).
           This protein binds to a highly conserved N-terminal
           domain of p160 coactivators, such as GRIP1, and thus
           enhances transcriptional activation by a number of
           nuclear receptors. CALCOCO1 has a central coiled-coil
           region with three leucine zipper motifs, which is
           required for its interaction with GRIP1 and may regulate
           the autonomous transcriptional activation activity of
           the C-terminal region.
          Length = 546

 Score = 33.8 bits (77), Expect = 0.26
 Identities = 44/184 (23%), Positives = 79/184 (42%), Gaps = 16/184 (8%)

Query: 281 VVRDSKRDERKKLADEEAMRKIKQLEDQTHQLQKQVATHKQEEEALLNEMEVTGQAFEDM 340
           V     R E+ + A   +  K ++LE+Q  + Q    +   E  ALL +     Q   ++
Sbjct: 166 VSDMRSRIEQLETALRHSTEKTEELEEQHKEAQSSSESMSAERNALLAQRAENQQRILEL 225

Query: 341 QEQNSRLLQQLREKD-------DANFKLMTERIKSNQ-LHKLAREEKDTLREQVQALSLQ 392
           ++    L Q+ +E D       D   +L   + +  Q L K+  + +D   E++  L L+
Sbjct: 226 EQDIQTLTQKKQENDRVLEGTQDIEAELERMKGELKQRLKKMTIQRRDEETERID-LQLE 284

Query: 393 IEAMHTAIRKLEEKERFLQTVLTNAEKEL----HLRNQAMDLNKRKAIESAQSA---ADL 445
            E +H  +R L+E+    Q       +EL     LR+  +    +  +ESAQ +   + L
Sbjct: 285 NEQLHEDLRTLQERLESSQQKAGLLGEELASLGSLRDHTIAELHQSRLESAQMSLQLSQL 344

Query: 446 KLHL 449
            L L
Sbjct: 345 NLAL 348



 Score = 29.6 bits (66), Expect = 5.4
 Identities = 30/138 (21%), Positives = 57/138 (41%), Gaps = 5/138 (3%)

Query: 288 DERKKLADEEAMRKIKQLEDQTHQLQKQVATHKQEEEALLNEMEVTGQAFEDMQEQNSRL 347
           DE   L  E+    +  +  +   LQ Q+   ++E   L+ +         DM+ +  +L
Sbjct: 117 DELVTLEKEDDGDDLLLVVPKATILQNQLEGCQRERNDLMKKFLALEGDVSDMRSRIEQL 176

Query: 348 LQQLREKDDANFKLMTERIKSNQLHKLAREEKDTL---REQVQALSLQIEA-MHTAIRKL 403
              LR   +   +L  +  ++    +    E++ L   R + Q   L++E  + T  +K 
Sbjct: 177 ETALRHSTEKTEELEEQHKEAQSSSESMSAERNALLAQRAENQQRILELEQDIQTLTQKK 236

Query: 404 EEKERFL-QTVLTNAEKE 420
           +E +R L  T    AE E
Sbjct: 237 QENDRVLEGTQDIEAELE 254



 Score = 29.2 bits (65), Expect = 6.6
 Identities = 25/115 (21%), Positives = 46/115 (40%), Gaps = 13/115 (11%)

Query: 304 QLEDQTHQLQKQVATHKQEEEALLNEMEVTGQAFEDMQEQNSRLLQQLREKDDANFKLMT 363
           QL      L++  +   QE E L    E      + +  +  +L + L+E+     KL  
Sbjct: 340 QLSQLNLALKEGQSQWAQERETLRQSAEADKDRIQKLSAELLKLEEWLQEERSQREKLEV 399

Query: 364 E--------RIKSNQLHKLAREEKDTLR----EQVQALSLQIEAMHTAIRKLEEK 406
           E        R++ ++  +   E +  LR    E+ Q L  + + +   IR LE +
Sbjct: 400 ELGIEKDCNRVQLSENRRELSELRSALRVLQKEKEQ-LQEEKQELLDYIRVLELR 453


>gnl|CDD|227250 COG4913, COG4913, Uncharacterized protein conserved in bacteria
           [Function unknown].
          Length = 1104

 Score = 33.8 bits (77), Expect = 0.26
 Identities = 24/155 (15%), Positives = 57/155 (36%), Gaps = 12/155 (7%)

Query: 286 KRDERKKLADEEAMRKIKQLEDQTHQLQKQVATHKQEEEALLNEMEVTGQAFEDMQEQNS 345
           ++D+R+KL D    R     + +   L++ V      E+  + ++      +  +QEQ +
Sbjct: 595 EKDDRRKLGDRSTYRLGSTNDAKVETLRETVKAMLSREDFYMIKIMRQQGEYIKLQEQ-A 653

Query: 346 RLLQQLREKDDANFKLMTERIKSNQLHKLAREEKDTLREQVQALSLQIEAMHTAIRKLEE 405
             L  ++  + A+  L +         +    E     E++         +  A   L+ 
Sbjct: 654 NALAHIQALNFASIDLPSA--------QRQIAELQARLERLTHTQ---SDIAIAKAALDA 702

Query: 406 KERFLQTVLTNAEKELHLRNQAMDLNKRKAIESAQ 440
            +   + +    ++E+          KR A+ S +
Sbjct: 703 AQTRQKVLERQYQQEVTECAGLKKDLKRAAMLSRK 737



 Score = 30.4 bits (68), Expect = 3.0
 Identities = 35/176 (19%), Positives = 69/176 (39%), Gaps = 22/176 (12%)

Query: 289 ERKKLADEEAMRKIKQLE---DQTHQLQKQVATHKQEEEALLNEMEVTGQAFEDMQEQNS 345
           E      E+  R+I  L+      ++ +K   + K    AL   +   G   +  +EQ +
Sbjct: 245 EGAYEQVEDIKRQIHTLDPLVQLKNRREKAQQS-KDHANALKKALPTVGNRIKK-EEQET 302

Query: 346 RLLQQLREKDDANFKLMTERIKSNQLHKLAREEKDTLRE----QVQALSLQIEAMHTAIR 401
            + Q   E+  A  K+ + +I++++  ++     D +++    Q   LS + E      R
Sbjct: 303 LVRQFTVEQTQAKSKVESAKIETDRAREMETLAHDNVKQIVGAQHGILSAKREGAVDKRR 362

Query: 402 KLEEKERFLQTVLT--------NAEKELHLRNQAM----DLN-KRKAIESAQSAAD 444
            +      L  ++         +AE+ L L N A     +    R+A+ESA     
Sbjct: 363 TISTARAGLDALVKGLGGAAPESAEELLELNNAARLTVDEYPAAREALESAGQRNV 418


>gnl|CDD|227512 COG5185, HEC1, Protein involved in chromosome segregation,
           interacts with SMC proteins [Cell division and
           chromosome partitioning].
          Length = 622

 Score = 33.8 bits (77), Expect = 0.26
 Identities = 21/116 (18%), Positives = 55/116 (47%), Gaps = 6/116 (5%)

Query: 296 EEAMRKIKQLEDQTHQLQKQVATHKQEEEALLNEMEVTGQAFEDMQEQNSRLLQQLREKD 355
           +    K + L+  +++ +  V   KQ+ +    ++E      E  +E+    ++ L+   
Sbjct: 298 KTLREKWRALKSDSNKYENYVNAMKQKSQEWPGKLEKLKSEIELKEEE----IKALQSNI 353

Query: 356 DANFKLMTERIKSNQLHKLAREEKDTLREQVQALSLQIEAMHTAI--RKLEEKERF 409
           D   K + ++  S +  +L  +E++ L  ++  +++Q + +  ++  RKLE +  F
Sbjct: 354 DELHKQLRKQGISTEQFELMNQEREKLTRELDKINIQSDKLTKSVKSRKLEAQGIF 409



 Score = 29.6 bits (66), Expect = 5.6
 Identities = 31/137 (22%), Positives = 59/137 (43%), Gaps = 10/137 (7%)

Query: 295 DEEAMRKIK-QLEDQTHQLQKQVATHKQEEEALLN---EMEVTGQAFEDMQEQNSRLLQQ 350
            E + +++K   E   H +   +A  K + + L     E     Q  + ++E+  R L+ 
Sbjct: 251 YEPSEQELKLGFEKFVHIINTDIANLKTQNDNLYEKIQEAMKISQKIKTLREK-WRALKS 309

Query: 351 LREKDDANFKLMTERIKS--NQLHKLAREEKDTLREQVQALSLQIEAMHTAIRKLEEKER 408
              K +     M ++ +    +L KL + E +   E+++AL   I+ +H  +RK  +   
Sbjct: 310 DSNKYENYVNAMKQKSQEWPGKLEKL-KSEIELKEEEIKALQSNIDELHKQLRK--QGIS 366

Query: 409 FLQTVLTNAEKELHLRN 425
             Q  L N E+E   R 
Sbjct: 367 TEQFELMNQEREKLTRE 383


>gnl|CDD|114855 pfam06160, EzrA, Septation ring formation regulator, EzrA.  During
           the bacterial cell cycle, the tubulin-like cell-division
           protein FtsZ polymerises into a ring structure that
           establishes the location of the nascent division site.
           EzrA modulates the frequency and position of FtsZ ring
           formation.
          Length = 559

 Score = 33.7 bits (78), Expect = 0.26
 Identities = 26/140 (18%), Positives = 61/140 (43%), Gaps = 19/140 (13%)

Query: 296 EEAMRKIKQLEDQTHQLQKQVATHKQEEEALLNEMEVT--GQAFEDMQEQNSRLLQQ--- 350
            EA   + +LE++T  L++++    +E   LL E++     Q  E+++     + ++   
Sbjct: 190 LEAREVLLKLEEETDALEQKM----EEIPPLLKELQNEFPDQ-LEELKAGYREMTEEGYH 244

Query: 351 LREKD-DANFKLMTERIKSNQLHKLAREEKDTLREQVQALSLQIEAMH-------TAIRK 402
               D +   + + E+I  N L  L   + D   E+ + +  +I+ ++        A + 
Sbjct: 245 FDHLDIEKELQDLKEQIDQN-LALLEELDLDEAEEENEEIEERIDTLYDILEKEVKAKKF 303

Query: 403 LEEKERFLQTVLTNAEKELH 422
           +E+    L   L +A ++  
Sbjct: 304 VEKNIDKLTDFLEHAREQNK 323



 Score = 33.7 bits (78), Expect = 0.27
 Identities = 29/128 (22%), Positives = 60/128 (46%), Gaps = 24/128 (18%)

Query: 300 RKIKQLEDQTHQLQKQVATHKQEEEALLNEMEVTGQAFEDMQEQNSRL---LQQLREKDD 356
           +++++LE Q  QL +++A  K     L  E+E   +  E+++++   L   LQ LR KD+
Sbjct: 351 KQLEELEAQYDQLVERIAEKKVAYSELQEELEEIEKQLEEIEKEQEELSESLQGLR-KDE 409

Query: 357 ANFKLMTERIKSNQLHKLAR---------------EEKDTLREQVQALSLQIEA----MH 397
              +   +  +  +LH++ R               E    + ++++AL+ ++      M 
Sbjct: 410 LEAREKLQEYR-QKLHEIKRYLEKSNLPGLPEDYLEYFFVVSDEIEALADELNEVPINMD 468

Query: 398 TAIRKLEE 405
              R+LEE
Sbjct: 469 AVNRQLEE 476



 Score = 33.3 bits (77), Expect = 0.37
 Identities = 31/167 (18%), Positives = 68/167 (40%), Gaps = 39/167 (23%)

Query: 294 ADEEAMRKIKQLEDQTHQLQKQVATHK-------QEEEALLNEMEVTGQAFEDMQE---- 342
           ++E+  +++++L+D+  +L+K +            E E  L+E+E   + F ++ E    
Sbjct: 130 SEEKNRKEVEELKDKYRELRKTLLAKSFSYGPALDELEKQLDELEEEFEQFVELTESGDY 189

Query: 343 -QNSRLLQQLREKDDANFKLMTERIKSNQLHKLAREEKDTLREQVQALS----------- 390
            +   +L +L E+ DA  + M E I       L +E ++   +Q++ L            
Sbjct: 190 LEAREVLLKLEEETDALEQKM-EEIP-----PLLKELQNEFPDQLEELKAGYREMTEEGY 243

Query: 391 --------LQIEAMHTAIRKLEEKERFLQTVLTNAEKELHLRNQAMD 429
                    +++ +   I +       L   L  AE+E     + +D
Sbjct: 244 HFDHLDIEKELQDLKEQIDQNLALLEELD--LDEAEEENEEIEERID 288



 Score = 31.4 bits (72), Expect = 1.4
 Identities = 30/152 (19%), Positives = 62/152 (40%), Gaps = 29/152 (19%)

Query: 296 EEAMRKIKQLEDQTH-QLQKQVATHKQEEEALLNEMEVTGQAFEDMQEQNSRLLQ----- 349
           EE   +I++  D  +  L+K+V   K  E+     ++      E  +EQN +LL      
Sbjct: 277 EEENEEIEERIDTLYDILEKEVKAKKFVEK----NIDKLTDFLEHAREQNKQLLLELDRL 332

Query: 350 ----QLREKD--------------DANFKLMTERIKSN-QLHKLAREEKDTLREQVQALS 390
                L E +              +A +  + ERI      +   +EE + + +Q++ + 
Sbjct: 333 QQSYTLNEDELETVRELEKQLEELEAQYDQLVERIAEKKVAYSELQEELEEIEKQLEEIE 392

Query: 391 LQIEAMHTAIRKLEEKERFLQTVLTNAEKELH 422
            + E +  +++ L + E   +  L    ++LH
Sbjct: 393 KEQEELSESLQGLRKDELEAREKLQEYRQKLH 424


>gnl|CDD|221408 pfam12072, DUF3552, Domain of unknown function (DUF3552).  This
           presumed domain is functionally uncharacterized. This
           domain is found in bacteria, archaea and eukaryotes.
           This domain is about 200 amino acids in length. This
           domain is found associated with pfam00013, pfam01966.
           This domain has a single completely conserved residue A
           that may be functionally important.
          Length = 201

 Score = 32.9 bits (76), Expect = 0.26
 Identities = 32/127 (25%), Positives = 62/127 (48%), Gaps = 30/127 (23%)

Query: 287 RDERKKLADEEAMRKIKQLEDQTHQLQKQVATHKQEEEALLN---EMEVTGQAFEDMQEQ 343
           ++ R +L  +E     K+L  +   L ++  + +++EE+L     E+    Q  E+ +E+
Sbjct: 74  KERRNELQRQE-----KRLLQKEETLDRKDESLEKKEESLEEKEKELAARQQQLEEKEEE 128

Query: 344 NSRLLQQLREKDDANFKLMTERIKSNQLHKLAREE-KDTLREQVQALSLQIEAMHTA--- 399
              L+++ +++         ERI S     L  EE K+ L E+V+A     E  H A   
Sbjct: 129 LEELIEEQQQE--------LERI-SG----LTAEEAKEILLEEVEA-----ELKHEAAKM 170

Query: 400 IRKLEEK 406
           I+++EE+
Sbjct: 171 IKEIEEE 177


>gnl|CDD|219189 pfam06818, Fez1, Fez1.  This family represents the eukaryotic Fez1
           protein. Fez1 contains a leucine-zipper region with
           similarity to the DNA-binding domain of the
           cAMP-responsive activating-transcription factor 5. There
           is evidence that Fez1 inhibits cancer cell growth
           through regulation of mitosis, and that its alterations
           result in abnormal cell growth. Note that some family
           members contain more than one copy of this region.
          Length = 200

 Score = 32.6 bits (74), Expect = 0.29
 Identities = 31/137 (22%), Positives = 56/137 (40%), Gaps = 6/137 (4%)

Query: 297 EAMRKIKQLEDQTHQLQKQVATHKQEEEALLNEMEVTGQAFEDMQEQNSRLLQQLREKDD 356
           EA  ++++ E+Q  +LQ  + T   E E   NE++      E ++E+  RL  +L E  +
Sbjct: 42  EARAQLRESEEQLQELQDALRTKTLELEVCENELQRKKSEAELLREKAGRLEAELAELRE 101

Query: 357 ANFKLMTERIKSNQLHKLAREEKDTLREQVQALSLQIEAMHTAI-----RKLEEKERFLQ 411
           A                 A+ ++      ++ L  Q+E +   +     R+ E+   F Q
Sbjct: 102 ALASCPAPLPVLLTESDEAKAQRQAGEAGLRELRRQVERLRAELRYERRRREEQASSFEQ 161

Query: 412 TVLT-NAEKELHLRNQA 427
                  EKE  +R Q 
Sbjct: 162 ERRVWQEEKEKVIRYQK 178


>gnl|CDD|185747 cd09234, V_HD-PTP_like, Protein-interacting V-domain of mammalian
           His-Domain type N23 protein tyrosine phosphatase and
           related domains.  This family contains the V-shaped (V)
           domain of mammalian His-Domain type N23 protein tyrosine
           phosphatase (HD-PTP, also known as PTPN23) and related
           domains. It belongs to the V_Alix_like superfamily which
           includes the V domains of  Bro1 and Rim20 (also known as
           PalA) from Saccharomyces cerevisiae, mammalian Alix
           (apoptosis-linked gene-2 interacting protein X/ also
           known as apoptosis-linked gene-2 interacting protein 1,
           AIP1), and related domains. HD_PTP interacts with the
           ESCRT (Endosomal Sorting Complexes Required for
           Transport) system, and participates in cell migration
           and endosomal trafficking. The related Alix V-domain
           (belonging to a different family in this superfamily)
           contains a binding site, partially conserved in the
           superfamily, for the retroviral late assembly (L) domain
           YPXnL motif. The Alix V-domain is also a dimerization
           domain. In addition to the V-domain, HD_PTP also has an
           N-terminal Bro1-like domain, a proline-rich region
           (PRR), a catalytically inactive tyrosine phosphatase
           domain, and a region containing a PEST motif. Bro1-like
           domains bind components of the ESCRT-III complex,
           specifically to CHMP4 in the case of HD-PTP. The
           Bro1-like domain of HD-PTP can also bind human
           immunodeficiency virus type 1 (HIV-1) nucleocapsid.
           HD-PTP is encoded by the PTPN23 gene, a tumor suppressor
           gene candidate frequently absent in human kidney,
           breast, lung, and cervical tumors. This family also
           contains Drosophila Myopic, which promotes epidermal
           growth factor receptor (EGFR) signaling, and
           Caenorhabditis elegans (enhancer of glp-1) EGO-2 which
           promotes Notch signaling.
          Length = 337

 Score = 33.0 bits (76), Expect = 0.30
 Identities = 32/158 (20%), Positives = 56/158 (35%), Gaps = 43/158 (27%)

Query: 305 LEDQTHQLQKQVATHKQEEEALLNEMEVTGQAFEDMQEQNSRLLQQLRE---KDDANFKL 361
           L++   +L       + E+EA+  E++       +M++Q   L QQLR+   +DD   KL
Sbjct: 168 LDELQKKLPSPSLLDRPEDEAIEKELKRILNKVNEMRKQRRSLEQQLRDAIHEDDITSKL 227

Query: 362 MT------ERIKSNQLHK---------------------------------LAREEKDTL 382
           +T      E +   +L K                                  A  E    
Sbjct: 228 VTTTGGDMEDLFKEELKKHDQLVNLIEQNLAAQENILKALTEANAKYAPVRKALSETKQK 287

Query: 383 RE-QVQALSLQIEAMHTAIRKLEEKERFLQTVLTNAEK 419
           RE  + +L    EA    ++K ++   F + +  N  K
Sbjct: 288 RESTISSLIASYEAYEDLLKKSQKGIDFYKKLEGNVSK 325


>gnl|CDD|114219 pfam05483, SCP-1, Synaptonemal complex protein 1 (SCP-1).
           Synaptonemal complex protein 1 (SCP-1) is the major
           component of the transverse filaments of the
           synaptonemal complex. Synaptonemal complexes are
           structures that are formed between homologous
           chromosomes during meiotic prophase.
          Length = 787

 Score = 33.5 bits (76), Expect = 0.32
 Identities = 39/156 (25%), Positives = 67/156 (42%), Gaps = 7/156 (4%)

Query: 286 KRDERKKLADEEAMRKIKQLEDQTHQLQKQVATHKQEEEALLNEMEVTGQAFEDMQEQNS 345
           K  E  K  ++E +  ++  E + H L+ Q+   K  EE  L E+E      E  + +N 
Sbjct: 429 KIAEELKGKEQELIFLLQAREKEIHDLEIQLTAIKTSEEHYLKEVEDLKTELEKEKLKNI 488

Query: 346 RLLQQLREKDDANFKLMTERIKSNQLHKLAREEKDTLREQVQALSLQIEAMHTAIRKLEE 405
            L     +    N +L  E   S+   +L + ++D +  + Q      E M   I  LEE
Sbjct: 489 ELTAHCDKLLLENKELTQE--ASDMTLELKKHQEDIINCKKQE-----ERMLKQIENLEE 541

Query: 406 KERFLQTVLTNAEKELHLRNQAMDLNKRKAIESAQS 441
           KE  L+  L +  +E   +   +     K+ E+A+S
Sbjct: 542 KEMNLRDELESVREEFIQKGDEVKCKLDKSEENARS 577



 Score = 33.2 bits (75), Expect = 0.42
 Identities = 35/151 (23%), Positives = 66/151 (43%), Gaps = 15/151 (9%)

Query: 287 RDERKKLADEEAMRKIKQLEDQTHQLQKQVATHKQEEEALLNEMEVTGQAFEDMQEQNSR 346
           +DE  K   E+     K+LED    LQ+ ++T K  EE    ++++  +    + E+   
Sbjct: 280 QDENLKELIEKKDHLTKELEDIKMSLQRSMSTQKALEE----DLQIATKTICQLTEEKEA 335

Query: 347 LLQQLREKDDANFKLMTE-RIKSNQLHKLAREEKDTLREQVQALSLQIEAMHTAIRKLEE 405
            +++L +   A+  ++TE    +  L +L R E+  L +    L +    +     +LEE
Sbjct: 336 QMEELNKAKAAHSFVVTEFEATTCSLEELLRTEQQRLEKNEDQLKIITMELQKKSSELEE 395

Query: 406 KERF----------LQTVLTNAEKELHLRNQ 426
             +F          L+ +L   EK L  + Q
Sbjct: 396 MTKFKNNKEVELEELKKILAEDEKLLDEKKQ 426



 Score = 32.8 bits (74), Expect = 0.58
 Identities = 42/182 (23%), Positives = 82/182 (45%), Gaps = 23/182 (12%)

Query: 274 EFRKFACVVRDSKRDERKKLADEEAMRKI--KQLEDQTHQLQ-------------KQVAT 318
           EF    C + +  R E+++L   E   KI   +L+ ++ +L+             +++  
Sbjct: 353 EFEATTCSLEELLRTEQQRLEKNEDQLKIITMELQKKSSELEEMTKFKNNKEVELEELKK 412

Query: 319 HKQEEEALLNEMEVTGQAFEDMQ--EQNSRLLQQLREKDDANFKLMTERIKSNQLHKLAR 376
              E+E LL+E +   +  E+++  EQ    L Q REK+  + ++    IK+++ H L  
Sbjct: 413 ILAEDEKLLDEKKQFEKIAEELKGKEQELIFLLQAREKEIHDLEIQLTAIKTSEEHYLKE 472

Query: 377 -EEKDTLREQVQALSLQIEAMHTAIRKLEEKERFLQTVLTNAEKELHLRNQAMDLNKRKA 435
            E+  T  E+ +  ++++ A H     LE KE   +     ++  L L+    D+   K 
Sbjct: 473 VEDLKTELEKEKLKNIELTA-HCDKLLLENKELTQEA----SDMTLELKKHQEDIINCKK 527

Query: 436 IE 437
            E
Sbjct: 528 QE 529



 Score = 30.5 bits (68), Expect = 2.5
 Identities = 37/168 (22%), Positives = 86/168 (51%), Gaps = 14/168 (8%)

Query: 272 SLEFRKFACVVRDSKRDERKKLADEEAMRKI----KQLEDQTHQLQKQVATHKQEEEALL 327
           ++E +K +  + +  + +  K  + E ++KI    ++L D+  Q +K     K +E+ L+
Sbjct: 383 TMELQKKSSELEEMTKFKNNKEVELEELKKILAEDEKLLDEKKQFEKIAEELKGKEQELI 442

Query: 328 NEMEVTGQAFEDMQEQNSRLLQQ----LREKDDANFKLMTERIKSNQL----HKLAREEK 379
             ++   +   D++ Q + +       L+E +D   +L  E++K+ +L     KL  E K
Sbjct: 443 FLLQAREKEIHDLEIQLTAIKTSEEHYLKEVEDLKTELEKEKLKNIELTAHCDKLLLENK 502

Query: 380 DTLREQVQALSLQIEAMHTAIRKLE-EKERFLQTVLTNAEKELHLRNQ 426
           + L ++   ++L+++     I   + ++ER L+ +    EKE++LR++
Sbjct: 503 E-LTQEASDMTLELKKHQEDIINCKKQEERMLKQIENLEEKEMNLRDE 549



 Score = 30.1 bits (67), Expect = 3.8
 Identities = 37/131 (28%), Positives = 61/131 (46%), Gaps = 12/131 (9%)

Query: 320 KQEEEALLNEMEVTGQAFEDMQEQNSRLLQQLREKD----DANFKLMTERIKSNQLH--- 372
           K++ E + +  E   +   D ++Q S LL Q+ EK+    D  F L   R K+NQL    
Sbjct: 218 KEDHEKIQHLEEEYKKEINDKEKQVSLLLIQITEKENKMKDLTFLLEESRDKANQLEEKT 277

Query: 373 KLAREEKDTLREQVQALSLQIEAMHTAIRKLEEKERFLQTVLTNAEKELHL-----RNQA 427
           KL  E    L E+   L+ ++E +  ++++    ++ L+  L  A K +         Q 
Sbjct: 278 KLQDENLKELIEKKDHLTKELEDIKMSLQRSMSTQKALEEDLQIATKTICQLTEEKEAQM 337

Query: 428 MDLNKRKAIES 438
            +LNK KA  S
Sbjct: 338 EELNKAKAAHS 348


>gnl|CDD|233973 TIGR02680, TIGR02680, TIGR02680 family protein.  Members of this
           protein family belong to a conserved gene four-gene
           neighborhood found sporadically in a phylogenetically
           broad range of bacteria: Nocardia farcinica,
           Symbiobacterium thermophilum, and Streptomyces
           avermitilis (Actinobacteria), Geobacillus kaustophilus
           (Firmicutes), Azoarcus sp. EbN1 and Ralstonia
           solanacearum (Betaproteobacteria). Proteins in this
           family average over 1400 amino acids in length
           [Hypothetical proteins, Conserved].
          Length = 1353

 Score = 33.6 bits (77), Expect = 0.32
 Identities = 27/170 (15%), Positives = 64/170 (37%), Gaps = 13/170 (7%)

Query: 289 ERKKLADEEAMRKIKQLEDQTHQLQKQVATHKQEEE--ALLNEMEVTGQAFEDMQEQNSR 346
           E  +     A RK         Q ++++     + +     + +E  G A +   +    
Sbjct: 803 ESAERELARAARKAAAAAAAWKQARRELERDAADLDLPTDPDALEAVGLALKRFGDHLHT 862

Query: 347 LLQQLREKDDANFKLMTERIKSNQLHKLAREEKDTL---REQVQALSLQIEAMHTAI--- 400
           L   +RE   A  +   +R ++ +    ARE  +     R + +  SL++  +  ++   
Sbjct: 863 LEVAVRELRHAATRAAEQRARAARAESDAREAAEDAAEARAEAEEASLRLRTLEESVGAM 922

Query: 401 -RKLEEKERFLQTVLTNAEKELHLRNQAM----DLNKRKAIESAQSAADL 445
             ++  +    +  L +  +EL    +A+    +   R   + A++ A L
Sbjct: 923 VDEIRARLAETRAALASGGRELPRLAEALATAEEARGRAEEKRAEADATL 972



 Score = 32.9 bits (75), Expect = 0.48
 Identities = 29/179 (16%), Positives = 54/179 (30%), Gaps = 19/179 (10%)

Query: 282 VRDSKRDERKKLADEEAMRKIKQLE---DQTHQLQKQVATHKQ---EEEALLNE----ME 331
             D+ R   + L    A +  ++LE        LQ   A  +Q   E E+ L E    ++
Sbjct: 312 EADALRTRLEALQGSPAYQDAEELERARADAEALQAAAADARQAIREAESRLEEERRRLD 371

Query: 332 VTGQAFEDMQEQNSRLLQQLREKDDANFKLMTERIKSNQLHKLAREEK------DTLREQ 385
                 +D + +     +QL     A    ++    +     LA +E       D  R+ 
Sbjct: 372 EEAGRLDDAERELRAAREQLAR--AAERAGLSPAHTAEPDAALAAQELQELGALDARRQD 429

Query: 386 VQALSLQIEAMHTAIRKLEEKERFLQTVLTNAEKELHLRNQAMDLNKRKAIESAQSAAD 444
              +  Q       +R+ ++     +     A +                 E A  A  
Sbjct: 430 ADRVIAQRSEQVALLRRRDDVADRAEATHAAA-RARRDELDEEAEQAAARAELADEAVH 487


>gnl|CDD|183473 PRK12361, PRK12361, hypothetical protein; Provisional.
          Length = 547

 Score = 33.4 bits (77), Expect = 0.34
 Identities = 27/100 (27%), Positives = 45/100 (45%), Gaps = 15/100 (15%)

Query: 291 KKLADEEAMRKIKQLEDQTHQLQK-QVAT-HKQEEEALLNEMEVT---------GQAFED 339
             L  EE +++IKQ+  +T +L K Q+    K  E+  LN  +           G  +++
Sbjct: 202 PDLTVEEVLQQIKQIR-KTARLNKRQLRALEKMLEQGKLNIHKRAWLIANPVSGGGKWQE 260

Query: 340 MQEQNSRLLQQLREKDDANFKLMTERIKSNQLHKLAREEK 379
             EQ   + ++L+   D   KL T  I +  L K AR+  
Sbjct: 261 YGEQ---IQRELKAYFDLTVKLTTPEISAEALAKQARKAG 297


>gnl|CDD|234669 PRK00153, PRK00153, hypothetical protein; Validated.
          Length = 104

 Score = 30.9 bits (71), Expect = 0.36
 Identities = 14/38 (36%), Positives = 22/38 (57%), Gaps = 4/38 (10%)

Query: 299 MRKIKQLEDQTHQLQKQVATHKQEEEALLNEMEVTGQA 336
           M  ++ L  Q  Q+Q+++    QEE   L +MEV G+A
Sbjct: 2   MGNMQNLMKQAQQMQEKMQK-MQEE---LAQMEVEGEA 35


>gnl|CDD|202427 pfam02841, GBP_C, Guanylate-binding protein, C-terminal domain.
           Transcription of the anti-viral guanylate-binding
           protein (GBP) is induced by interferon-gamma during
           macrophage induction. This family contains GBP1 and
           GPB2, both GTPases capable of binding GTP, GDP and GMP.
          Length = 297

 Score = 33.0 bits (76), Expect = 0.36
 Identities = 16/109 (14%), Positives = 45/109 (41%), Gaps = 18/109 (16%)

Query: 302 IKQLEDQTHQLQKQVATHKQEEEALLNEMEVTGQAFEDMQEQNSRLLQQLREKDDANFKL 361
           I Q +      +K +   + + EA   E     +   + Q++  ++++        + K 
Sbjct: 192 ILQTDQALTAKEKAIEAERAKAEAAEAEQ----ELLREKQKEEEQMMEAQERSYQEHVKQ 247

Query: 362 MTERIKSNQ-----------LHKLAREE---KDTLREQVQALSLQIEAM 396
           + E++++ +            HKL  +E   K+  + + ++L  +I+ +
Sbjct: 248 LIEKMEAEREKLLAEQERMLEHKLQEQEELLKEGFKTEAESLQKEIQDL 296


>gnl|CDD|115307 pfam06637, PV-1, PV-1 protein (PLVAP).  This family consists of
           several PV-1 (PLVAP) proteins which seem to be specific
           to mammals. PV-1 is a novel protein component of the
           endothelial fenestral and stomatal diaphragms. The
           function of this family is unknown.
          Length = 442

 Score = 33.1 bits (75), Expect = 0.37
 Identities = 30/124 (24%), Positives = 51/124 (41%), Gaps = 24/124 (19%)

Query: 316 VATHKQEEEA--LLNEMEVTGQAFEDMQEQNSRLLQQLREKDDANFKLMTE--------R 365
           + + K EE A  L  ++E   +   D+Q Q     Q  +   +A  K   E        +
Sbjct: 278 IMSSKIEELARGLRADIERVARENADLQRQKLEAEQAAQAAQEAKAKAEKEAQAREAKLQ 337

Query: 366 IKSNQLHKLAREEKDTLREQVQALSLQIEAMHTAIRKLEEKERFLQTVLTNAEKELHLRN 425
            +  +  +LA EEK  LR +   L+          ++LE K+R  + +      EL +R 
Sbjct: 338 AECARQTQLALEEKAALRAERDNLA----------KELEAKKREAEQLRM----ELAIRI 383

Query: 426 QAMD 429
            A+D
Sbjct: 384 SALD 387


>gnl|CDD|144876 pfam01442, Apolipoprotein, Apolipoprotein A1/A4/E domain.  These
           proteins contain several 22 residue repeats which form a
           pair of alpha helices. This family includes:
           Apolipoprotein A-I. Apolipoprotein A-IV. Apolipoprotein
           E.
          Length = 191

 Score = 32.2 bits (74), Expect = 0.38
 Identities = 30/169 (17%), Positives = 67/169 (39%), Gaps = 34/169 (20%)

Query: 303 KQLEDQTHQLQKQVATHKQEEEALLNEMEVTGQAFEDMQEQNSRLLQQLREKDDANFKLM 362
           +QL     +   +++   +E EAL  E+    +  ED++ +    L +L+ K   N + +
Sbjct: 19  EQLGPYAQEFWARLS---KETEALREEL---QKDLEDVRARLQPYLDELKAKVGQNLEEL 72

Query: 363 TERIKS--NQLHKLAREEKDTLREQVQALSLQIEAMHTAIR------------------- 401
            +R+     +L K    + + LR ++   +   E +   +R                   
Sbjct: 73  RQRLAPYAEELRKRLNRDAEELRRKLAPYA---EELRDRLRQNVEALRARLGPYVEELRQ 129

Query: 402 KLEEKERFLQTVLTNAEKELHLR-NQAMDLNKRKAIESAQSAADLKLHL 449
           KL E+   L+  +    +E   + ++ ++  + K    A+   DL+  L
Sbjct: 130 KLAERLEELKESVGPYAEEYKAQLSEQVEELREKLEPYAE---DLREKL 175



 Score = 29.5 bits (67), Expect = 3.1
 Identities = 31/133 (23%), Positives = 58/133 (43%), Gaps = 27/133 (20%)

Query: 286 KRDE-RKKLADE-EAMRK---------IKQLEDQTHQLQKQVATHKQEEEALLNEMEVTG 334
             DE + K+    E +R+          K+L     +L++++A + +E    L    V  
Sbjct: 57  YLDELKAKVGQNLEELRQRLAPYAEELRKRLNRDAEELRRKLAPYAEELRDRL-RQNV-- 113

Query: 335 QAFEDMQEQNSRLLQQLREKDDANFKLMTERIK--SNQLHKLAREEKDTLREQVQALSLQ 392
              E ++ +    +++LR+K       + ER++     +   A E K  L EQV+ L  +
Sbjct: 114 ---EALRARLGPYVEELRQK-------LAERLEELKESVGPYAEEYKAQLSEQVEELREK 163

Query: 393 IEAMHTAIR-KLE 404
           +E     +R KLE
Sbjct: 164 LEPYAEDLREKLE 176


>gnl|CDD|217803 pfam03938, OmpH, Outer membrane protein (OmpH-like).  This family
           includes outer membrane proteins such as OmpH among
           others. Skp (OmpH) has been characterized as a molecular
           chaperone that interacts with unfolded proteins as they
           emerge in the periplasm from the Sec translocation
           machinery.
          Length = 157

 Score = 31.8 bits (73), Expect = 0.39
 Identities = 19/70 (27%), Positives = 37/70 (52%), Gaps = 1/70 (1%)

Query: 286 KRDERKKLADEEAMRKIKQLEDQTHQLQKQVATHKQEE-EALLNEMEVTGQAFEDMQEQN 344
           + ++  K    E  +K K+L+ +  +LQKQ AT  +E  +A   E++   Q  +  Q+  
Sbjct: 39  QLEKEFKKLQAELQKKEKELQKEEQKLQKQAATLSEEARKAKQQELQQKQQELQQKQQAA 98

Query: 345 SRLLQQLREK 354
            + LQQ +++
Sbjct: 99  QQELQQKQQE 108



 Score = 29.1 bits (66), Expect = 3.2
 Identities = 12/86 (13%), Positives = 45/86 (52%), Gaps = 3/86 (3%)

Query: 324 EALLNEMEVTGQAFEDMQEQNSRLLQQLREKDDANFKLMTERIKSNQLHKLAREEKDTLR 383
           + +L+E      A + ++++  +L  +L++K+    +   ++++      L+ E +   +
Sbjct: 24  QKVLSESPAGKAAQKQLEKEFKKLQAELQKKEKE-LQKEEQKLQKQAA-TLSEEARKAKQ 81

Query: 384 EQVQALSLQIE-AMHTAIRKLEEKER 408
           +++Q    +++     A ++L++K++
Sbjct: 82  QELQQKQQELQQKQQAAQQELQQKQQ 107


>gnl|CDD|218434 pfam05103, DivIVA, DivIVA protein.  The Bacillus subtilis divIVA1
           mutation causes misplacement of the septum during cell
           division, resulting in the formation of small, circular,
           anucleate mini-cells. Inactivation of divIVA produces a
           mini-cell phenotype, whereas overproduction of DivIVA
           results in a filamentation phenotype. These proteins
           appear to contain coiled-coils.
          Length = 131

 Score = 31.5 bits (72), Expect = 0.39
 Identities = 8/47 (17%), Positives = 28/47 (59%)

Query: 300 RKIKQLEDQTHQLQKQVATHKQEEEALLNEMEVTGQAFEDMQEQNSR 346
           ++ ++L+++  +L++++  +K+ EE L   + V  +  E+++    +
Sbjct: 39  KENEELKEEIERLEEKLEEYKELEETLQKTLVVAQETAEEVKANAQK 85


>gnl|CDD|180240 PRK05759, PRK05759, F0F1 ATP synthase subunit B; Validated.
          Length = 156

 Score = 31.7 bits (73), Expect = 0.39
 Identities = 27/110 (24%), Positives = 51/110 (46%), Gaps = 16/110 (14%)

Query: 288 DER-KKLAD--EEAMRKIKQLEDQTHQLQKQVATHKQEEEALLNEMEVTGQAF-EDMQEQ 343
           +ER KK+AD    A R  K+LE    + + Q+A  + E   ++ + +       E+ + +
Sbjct: 34  EERQKKIADGLAAAERAKKELELAQAKYEAQLAEARAEAAEIIEQAKKRAAQIIEEAKAE 93

Query: 344 NSRLLQQLREKDDANFKLMTERIKSNQLHKLAREEKDTLREQVQALSLQI 393
                 +++ +  A  ++  ER +       AREE   LR+QV  L++  
Sbjct: 94  AEAEAARIKAQ--AQAEIEQERKR-------AREE---LRKQVADLAVAG 131


>gnl|CDD|217843 pfam04012, PspA_IM30, PspA/IM30 family.  This family includes PspA
           a protein that suppresses sigma54-dependent
           transcription. The PspA protein, a negative regulator of
           the Escherichia coli phage shock psp operon, is produced
           when virulence factors are exported through secretins in
           many Gram-negative pathogenic bacteria and its homologue
           in plants, VIPP1, plays a critical role in thylakoid
           biogenesis, essential for photosynthesis. Activation of
           transcription by the enhancer-dependent bacterial
           sigma(54) containing RNA polymerase occurs through ATP
           hydrolysis-driven protein conformational changes enabled
           by activator proteins that belong to the large AAA(+)
           mechanochemical protein family. It has been shown that
           PspA directly and specifically acts upon and binds to
           the AAA(+) domain of the PspF transcription activator.
          Length = 220

 Score = 32.4 bits (74), Expect = 0.39
 Identities = 28/184 (15%), Positives = 62/184 (33%), Gaps = 35/184 (19%)

Query: 275 FRKFACVVRDSKRDERKKLADEEAM--RKIKQLEDQ----THQLQKQVATHKQEEEALLN 328
           F++   ++R +  +   K  D E M  + I+ ++ +       L + +A  KQ E  L  
Sbjct: 3   FKRLLRLLRANIHEGLDKAEDPEKMLEQAIRDMQSELGKARQALAQVIARQKQLERKL-- 60

Query: 329 EMEVTGQAFEDMQEQNSRLLQQLREKDDANFKLMTERIKS--------NQLHKLAREEKD 380
                 +  E  ++  ++    L + ++   +     I +               R   +
Sbjct: 61  -----EEQKEQAKKLENKARAALTKGNEELAREALAEIATLEKQAEALETQLTQQRSAVE 115

Query: 381 TLREQVQALSLQIEAMHTAIRKLEEKERFLQTVLTNAEKELHLRNQAMDLNKRKAIESAQ 440
            LR+Q+ AL            K+++ +     +    +           +N      S +
Sbjct: 116 QLRKQLAALE----------TKIQQLKAKKTALKARLKAAKA----QEAVNTSLGSASTE 161

Query: 441 SAAD 444
           SA D
Sbjct: 162 SATD 165


>gnl|CDD|177136 MTH00064, ND6, NADH dehydrogenase subunit 6; Provisional.
          Length = 151

 Score = 31.8 bits (73), Expect = 0.40
 Identities = 34/156 (21%), Positives = 56/156 (35%), Gaps = 32/156 (20%)

Query: 123 LSSTVFVYHCFSI-FRFSSSTVFVYHCFSIF--RFSSSTVFVYHCFSIFRFS-SSTVF-- 176
           L     +Y    + F FSS  V  Y+C  +      SS +    C+ ++ FS  S +F  
Sbjct: 3   LEVLFSLYFIVLVLFSFSSHPV--YYCVLLVVNSLISSLI----CYLVYGFSWYSLLFCL 56

Query: 177 -----VYHCF----------SIFRFSS-STVFVYHCFSIFRFSSSTVFVYHC---FSIFR 217
                VY  F          S   +S+   + V+  F         VF       FS + 
Sbjct: 57  VYVGGVYILFIFVSVFSPNSSFVSYSNLGELGVFFFFFFGLLLCVLVFYSLVNVEFSFYL 116

Query: 218 FSSSTIFVYHCFDIFSLTSTLFVSICTSAFQMSFYK 253
            +SS    Y    +  +   + +S   S  + S+Y+
Sbjct: 117 CTSSEGVFYLFLCLVLMFGFIVLSFIVS-SKTSYYR 151


>gnl|CDD|220365 pfam09726, Macoilin, Transmembrane protein.  This entry is a highly
           conserved protein present in eukaryotes.
          Length = 680

 Score = 33.0 bits (75), Expect = 0.40
 Identities = 30/146 (20%), Positives = 66/146 (45%), Gaps = 10/146 (6%)

Query: 295 DEEAMRKIKQLEDQTHQLQKQVATHKQEEEALLNEMEVTGQAFEDMQEQNSRLLQQL--R 352
           + E   +I  L      L+  +   K+E + L  ++     A    ++++ + +Q +  R
Sbjct: 429 ESELRNQISLLTSLERSLKSDLGQLKKENDMLQTKLNSMVSA----KQKDKQSMQSMEKR 484

Query: 353 EKDDANFKLMTERIKSNQLHKLAREEKDTLREQVQALSLQIE---AMHTAIRKLEEKERF 409
            K +A+ ++  E+  + +  +   EE+   R   QA + + E   ++  A + LE + + 
Sbjct: 485 LKSEADSRVNAEKQLAEEKKRKKEEEETAARAAAQAAASREECAESLKQAKQDLEMEIKK 544

Query: 410 LQTVLTNAEKE-LHLRNQAMDLNKRK 434
           L+  L   E+E   L  +A +L K +
Sbjct: 545 LEHDLKLKEEECRMLEKEAQELRKYQ 570


>gnl|CDD|222426 pfam13870, DUF4201, Domain of unknown function (DUF4201).  This is
           a family of coiled-coil proteins from eukaryotes. The
           function is not known.
          Length = 177

 Score = 31.8 bits (73), Expect = 0.43
 Identities = 20/120 (16%), Positives = 54/120 (45%), Gaps = 23/120 (19%)

Query: 340 MQEQNSRLLQQLREKDDANFKL--------------MTERI--KSNQLHKLAREEKDTL- 382
           ++   +++ ++L +K++    L              + E+I  ++ +L +L  +  +T+ 
Sbjct: 18  LKHTLAKIQEKLEQKEELGEGLHLIDFEQLQIENQALNEKIEERNKELKRLKLKYTNTVH 77

Query: 383 -----REQVQALSLQIEAMHTAIRKLEEKERFLQTVLTNAEKEL-HLRNQAMDLNKRKAI 436
                +E++  L  ++E +   +++ EE+   L+  L   + E   LR Q   L ++  +
Sbjct: 78  ILTHLKEKLHMLEEELERLKKELKEKEERLAKLRKELYKVKLERDKLRKQNKKLRQQGGL 137


>gnl|CDD|148617 pfam07111, HCR, Alpha helical coiled-coil rod protein (HCR).  This
           family consists of several mammalian alpha helical
           coiled-coil rod HCR proteins. The function of HCR is
           unknown but it has been implicated in psoriasis in
           humans and is thought to affect keratinocyte
           proliferation.
          Length = 739

 Score = 33.1 bits (75), Expect = 0.43
 Identities = 30/135 (22%), Positives = 60/135 (44%), Gaps = 21/135 (15%)

Query: 283 RDSKRDERKKLADEEAMRKIKQLEDQTHQLQKQVATHKQEEEALLNEMEVTGQAFEDMQE 342
           R+   +ER++LA+       +QLE +  + Q+ +A+   + EA        GQ  ++  E
Sbjct: 509 REQGEEERQRLAEVA-----QQLERELQESQESLASAGAQLEAAR-----AGQ--QESTE 556

Query: 343 QNSRLLQQLREKDDANFKLMTERIKS------NQLHKLAREEKDTLREQVQALSLQIEAM 396
           + + L Q+L ++ +   + + E++         QL    R   +  RE  +A+    +  
Sbjct: 557 EAASLRQELTQQQEVYGQALQEKVAEVETRLREQLSDTERRLNEARREHAKAVVSLRQIQ 616

Query: 397 HTAIRKLEEKERFLQ 411
             A +   EKER  +
Sbjct: 617 RQAAQ---EKERNQE 628



 Score = 33.1 bits (75), Expect = 0.45
 Identities = 26/118 (22%), Positives = 52/118 (44%), Gaps = 27/118 (22%)

Query: 292 KLADEEAMRKIKQLEDQTHQLQKQVATHKQEEEALLNEMEVTGQAFE-DMQEQNSRLLQQ 350
           K  +        QL+ Q  +LQ++V +  QE+  L   ++   +  E +++  NS++LQ 
Sbjct: 323 KAQELSHADSTCQLQKQVAELQEEVTSQSQEQAILQRSLQ--DKEAEVEVERMNSKVLQG 380

Query: 351 LREKDDANFKLMTERIKSNQLHKLAREEKDTLREQVQA----LSLQIEAMHTAIRKLE 404
              +                    A+E +  L++QV +    L L +EA+H++ + L 
Sbjct: 381 ELSR--------------------AQEARRRLQQQVASAEEQLQLVVEAVHSSQQGLL 418



 Score = 32.7 bits (74), Expect = 0.49
 Identities = 30/114 (26%), Positives = 47/114 (41%), Gaps = 11/114 (9%)

Query: 290 RKKLADEEAMRKIKQLEDQTHQLQKQVATHKQEEEALLNEMEVTGQAFEDMQEQNSRLLQ 349
           R  LA  E +RK   LE+   Q  ++     QE+ + L +            E  S L  
Sbjct: 121 RAALAGAEEVRK--NLEEGGQQELEEAQRLHQEQLSSLTQA---------HLEALSSLRS 169

Query: 350 QLREKDDANFKLMTERIKSNQLHKLAREEKDTLREQVQALSLQIEAMHTAIRKL 403
           +    + +   L T R    +    A+ E DTLREQ+     ++EA  T + +L
Sbjct: 170 KAEGLEKSLQSLETRRAGEAKALAAAQAEADTLREQLSKTQEELEAQVTLVEQL 223



 Score = 30.0 bits (67), Expect = 3.2
 Identities = 30/146 (20%), Positives = 66/146 (45%), Gaps = 11/146 (7%)

Query: 296 EEAMRKIKQLEDQTHQLQKQVATHKQEEEALLNEMEVTGQAFEDMQEQNSRLLQQLREKD 355
           E   + ++ LE +     K +A  + E + L  ++  T    E+++ Q + L++QLR   
Sbjct: 172 EGLEKSLQSLETRRAGEAKALAAAQAEADTLREQLSKTQ---EELEAQVT-LVEQLR--- 224

Query: 356 DANFKLMTERIKSNQLHKLAREEKDTLREQVQALSLQIEAMHTAIRKLEEKERFLQTVLT 415
               K + E++ +    +    E+  L + V+ L     A+   +  L+ + + L  +L 
Sbjct: 225 ----KYVGEQVVAESRSQAWELERGELLQTVKHLQEDRAALQATVELLQVRVQSLTHILA 280

Query: 416 NAEKELHLRNQAMDLNKRKAIESAQS 441
             E+EL  + Q +D+ + +     +S
Sbjct: 281 LQEEELTRKAQPVDILEPEFARKCRS 306


>gnl|CDD|114011 pfam05262, Borrelia_P83, Borrelia P83/100 protein.  This family
           consists of several Borrelia P83/P100 antigen proteins.
          Length = 489

 Score = 32.7 bits (74), Expect = 0.44
 Identities = 35/196 (17%), Positives = 77/196 (39%), Gaps = 24/196 (12%)

Query: 262 LSDSASNI-YGSLEFRKFACVVR---DSKRDERKKLAD------EEAMRKIKQLEDQTHQ 311
           LS + S++   S+  +K    +R   +   + R+ + D      +E  ++ +QL+++  +
Sbjct: 165 LSGNVSDVDTDSISDKKVVEALREDNEKGVNFRRDMTDLKERESQEDAKRAQQLKEELDK 224

Query: 312 LQKQVATHKQEEEALLNEMEVTGQAFEDMQEQNSRLLQQ----LREKDDANFKLMTERIK 367
            Q      +Q+ +          Q   D Q    R  QQ    L +  D +     +++ 
Sbjct: 225 KQIDADKAQQKAD--------FAQDNADKQRDEVRQKQQEAKNLPKPADTSSPKEDKQVA 276

Query: 368 SNQLHKLAREEKDTLREQVQALSLQIEAMHTAIRKLEEKERFLQTVLTNAEKELHLRNQA 427
            NQ  ++ + + +  +   +AL  +        ++ +  E+  +     A+K+      A
Sbjct: 277 ENQKREIEKAQIEIKKNDEEALKAKDHKAFDLKQESKASEKEAEDKELEAQKK--REPVA 334

Query: 428 MDLNKRKAIESAQSAA 443
            DL K K    AQ  +
Sbjct: 335 EDLQKTKPQVEAQPTS 350


>gnl|CDD|129705 TIGR00618, sbcc, exonuclease SbcC.  All proteins in this family for
           which functions are known are part of an exonuclease
           complex with sbcD homologs. This complex is involved in
           the initiation of recombination to regulate the levels
           of palindromic sequences in DNA. This family is based on
           the phylogenomic analysis of JA Eisen (1999, Ph.D.
           Thesis, Stanford University) [DNA metabolism, DNA
           replication, recombination, and repair].
          Length = 1042

 Score = 33.0 bits (75), Expect = 0.45
 Identities = 32/194 (16%), Positives = 62/194 (31%), Gaps = 21/194 (10%)

Query: 285 SKRDERKKLADEEAMR-----KIKQLEDQTHQLQKQVATHKQEEEALLNEMEVTG--QAF 337
           S+    +   +          +   L    H LQ+Q  T  Q+ ++L  E+++    QA 
Sbjct: 352 SQEIHIRDAHEVATSIREISCQQHTLTQHIHTLQQQKTTLTQKLQSLCKELDILQREQAT 411

Query: 338 EDMQEQNSRLLQQLREKDDANFKLMTERIKSNQLHKLAREEKDTLREQVQALSLQIEAMH 397
            D +    R LQ          +L     +              +    Q   L+   + 
Sbjct: 412 IDTRTSAFRDLQGQLAHAKKQQELQQRYAELCAAA---------ITCTAQCEKLEKIHLQ 462

Query: 398 TAIRKLEEKERFLQTV--LTNAEKELH-LRNQAMDLNKRKAIESAQSAADLKLHLVIIYT 454
            + + L+E+E+ LQT   +   E     +    +   + +      S          I  
Sbjct: 463 ESAQSLKEREQQLQTKEQIHLQETRKKAVVLARLLELQEEPCPLCGSCIHPNPARQDIDN 522

Query: 455 NGP--ARLEFGKDR 466
            GP   R++ G+  
Sbjct: 523 PGPLTRRMQRGEQT 536



 Score = 32.2 bits (73), Expect = 0.69
 Identities = 29/197 (14%), Positives = 62/197 (31%), Gaps = 31/197 (15%)

Query: 275 FRKFACVVRDSKRDERKKLADEEAMRKIKQLEDQTHQLQKQVATHKQEEEALLNEMEVTG 334
           F     V    +          E     +  E+ TH L+   A   QE  +  + + +  
Sbjct: 764 FNNNEEVTAALQTGAELSHLAAEIQFFNRLREEDTHLLKTLEAEIGQEIPSDEDILNLQC 823

Query: 335 QAFEDMQEQNSRLLQQLREKDDANFKLMTERIKSNQLHKLAREEK--DTLREQVQALSLQ 392
              E + ++  + L +L E             KS  L ++  +    +   +Q+  L+ +
Sbjct: 824 ---ETLVQEEEQFLSRLEE-------------KSATLGEITHQLLKYEECSKQLAQLTQE 867

Query: 393 IEAMHTAIRKLE---------EKERFLQTVL-TNAEKELHLRNQA---MDLNKRKAIESA 439
              +     KL          + +  ++ +        + L NQ+          +  +A
Sbjct: 868 QAKIIQLSDKLNGINQIKIQFDGDALIKFLHEITLYANVRLANQSEGRFHGRYADSHVNA 927

Query: 440 QSAADLKLHLVIIYTNG 456
           +    L L +   YT  
Sbjct: 928 RKYQGLALLVADAYTGS 944



 Score = 31.9 bits (72), Expect = 1.0
 Identities = 24/166 (14%), Positives = 55/166 (33%), Gaps = 9/166 (5%)

Query: 268 NIYGSLEFRKFACVVRDSKRDERKKLADEEAMRKIKQLEDQTHQLQKQVATHKQEEEALL 327
           N++   ++ + A +    K+    K        ++  L   T  +       KQ  E  L
Sbjct: 171 NLFPLDQYTQLALMEFAKKKSLHGKAELLTLRSQL--LTLCTPCMPDTYHERKQVLEKEL 228

Query: 328 NEMEVTGQAFEDMQEQNSRLLQQLREKDDANFKLMTERIKSNQL------HKLAREEKDT 381
             +    Q  +      ++  +   E+      L   R +  +L       +  +E  + 
Sbjct: 229 KHLREALQQTQQSHAYLTQKREAQEEQLKKQQLLKQLRARIEELRAQEAVLEETQERIN- 287

Query: 382 LREQVQALSLQIEAMHTAIRKLEEKERFLQTVLTNAEKELHLRNQA 427
              +   L+  I+A+    ++ +     LQ+ + +  K L  R   
Sbjct: 288 RARKAAPLAAHIKAVTQIEQQAQRIHTELQSKMRSRAKLLMKRAAH 333



 Score = 28.8 bits (64), Expect = 8.9
 Identities = 24/194 (12%), Positives = 50/194 (25%), Gaps = 32/194 (16%)

Query: 279 ACVVRDSKRDERKKLADEEAMRKIKQLEDQTHQLQKQVATHKQEEEALLNEM-----EVT 333
            C  +  K ++       +++++ +Q      Q+  Q    K    A L E+      + 
Sbjct: 448 TCTAQCEKLEKIHLQESAQSLKEREQQLQTKEQIHLQETRKKAVVLARLLELQEEPCPLC 507

Query: 334 GQAFEDMQ------------------EQNSRLLQQLREKDDANFKLMTERIKSNQLHKLA 375
           G                         EQ    L+   E          ++  S +     
Sbjct: 508 GSCIHPNPARQDIDNPGPLTRRMQRGEQTYAQLETSEEDVYHQLTSERKQRASLK----- 562

Query: 376 REEKDTLREQVQALSLQIEAMHTAIRKLEEKERFLQTVLTNAEKELHLRNQAMDLNKRKA 435
            E+   +++    L+         I  L+     LQ +    EK     +          
Sbjct: 563 -EQMQEIQQSFSILTQCDNRSKEDIPNLQNITVRLQDLT---EKLSEAEDMLACEQHALL 618

Query: 436 IESAQSAADLKLHL 449
            +         + L
Sbjct: 619 RKLQPEQDLQDVRL 632


>gnl|CDD|219487 pfam07611, DUF1574, Protein of unknown function (DUF1574).  A
           family of hypothetical proteins in Leptospira
           interrogans.
          Length = 345

 Score = 32.5 bits (74), Expect = 0.46
 Identities = 21/100 (21%), Positives = 39/100 (39%), Gaps = 11/100 (11%)

Query: 167 IFRFSS-STVFVYHCFSIFRFSSSTV---FVYHCFSIFRFSSSTVFVYHCFSIFRFSSST 222
           ++ FSS     VY+ + + + +S  V    +    S  +F+S++       S   +S   
Sbjct: 91  LYNFSSPVATPVYYLYWLEKIASGGVKPDLIVIESSPEQFNSNSTTFKD--SNLAYSFDP 148

Query: 223 IFVYHCFDIFSLT--STLFVSICTSAFQMSFYKSRWFDFR 260
           IF+    D  S    +  F +     F  S+ K R  +  
Sbjct: 149 IFILRYADTLSKEDVNYYFAN---RLFGTSYNKPRIKNVI 185



 Score = 31.4 bits (71), Expect = 1.2
 Identities = 15/84 (17%), Positives = 30/84 (35%), Gaps = 4/84 (4%)

Query: 115 DSSFGIFRLSSTVFVYHCFSIFRFSS-STVFVYHCFSIFRFSSSTV---FVYHCFSIFRF 170
            S    F        Y  + ++ FSS     VY+ + + + +S  V    +    S  +F
Sbjct: 71  SSRMLYFSNDDIEKKYPDWDLYNFSSPVATPVYYLYWLEKIASGGVKPDLIVIESSPEQF 130

Query: 171 SSSTVFVYHCFSIFRFSSSTVFVY 194
           +S++         + F    +  Y
Sbjct: 131 NSNSTTFKDSNLAYSFDPIFILRY 154


>gnl|CDD|221389 pfam12037, DUF3523, Domain of unknown function (DUF3523).  This
           presumed domain is functionally uncharacterized. This
           domain is found in eukaryotes. This domain is typically
           between 257 to 277 amino acids in length. This domain is
           found associated with pfam00004. This domain has a
           conserved LER sequence motif.
          Length = 276

 Score = 32.4 bits (74), Expect = 0.46
 Identities = 27/137 (19%), Positives = 56/137 (40%), Gaps = 23/137 (16%)

Query: 289 ERKKLADEEAMRKIKQ---LEDQTHQLQKQVATHKQEEEALLNEMEVTGQAFEDMQEQNS 345
            RK+   E   ++ +    L+ Q   + +Q A  +  EE +L     T +   +++ +N 
Sbjct: 116 ARKRYQKELEQQRRQNEELLKMQEESVLRQEAMRRATEEEILEMRRETIEEEAELERENI 175

Query: 346 RLL---------QQLREKDDANFKLMTERIKSNQLHKLAREEKDTLREQVQALSLQIEAM 396
           R           ++ RE +D N +++  +           EE++T+ E ++     I   
Sbjct: 176 RAKIEAEARGRAKEERENEDINREMLKLKAN---------EERETVLESIKTTFSHIGGG 226

Query: 397 HTAIRKLEEKERFLQTV 413
             A+  L +K +   TV
Sbjct: 227 FRAL--LTDKSKLTMTV 241



 Score = 28.6 bits (64), Expect = 7.1
 Identities = 21/112 (18%), Positives = 46/112 (41%), Gaps = 11/112 (9%)

Query: 288 DERKKLADEEAM--RKIKQLEDQ--THQLQKQVATHKQEEEALLNEMEVTGQAFEDMQEQ 343
           +ER+K   E+    ++  Q +D+    + QK++   +++ E LL   E +    E M+  
Sbjct: 92  EERRKTLQEQTQQEQQRAQYQDELARKRYQKELEQQRRQNEELLKMQEESVLRQEAMRRA 151

Query: 344 NSRLLQQLREKDDANFKLMTERIKSNQLH-------KLAREEKDTLREQVQA 388
               + ++R +       +       ++        K  RE +D  RE ++ 
Sbjct: 152 TEEEILEMRRETIEEEAELERENIRAKIEAEARGRAKEERENEDINREMLKL 203


>gnl|CDD|225606 COG3064, TolA, Membrane protein involved in colicin uptake [Cell
           envelope biogenesis, outer membrane].
          Length = 387

 Score = 32.6 bits (74), Expect = 0.46
 Identities = 34/171 (19%), Positives = 60/171 (35%), Gaps = 28/171 (16%)

Query: 283 RDSKRDERKKLADEEAMR-----KIKQLEDQTHQL----QKQVATHKQEEEALLNEMEVT 333
           + ++  + K+ A++E ++     ++K  E Q        Q Q+   +QEE+A     E  
Sbjct: 87  QVAEELKPKQAAEQERLKQLEKERLKAQEQQKQAEEAEKQAQLEQKQQEEQARKAAAEQK 146

Query: 334 GQAFEDMQEQNSRLLQQLREKDDANFKLMTERIKSNQLHKLAREEKDTLREQVQALSLQI 393
            +A     +  +   +  + K  A  K   E          A EE     E   A   + 
Sbjct: 147 KKAEAAKAKAAA---EAAKLKAAAEAKKKAEEAAK------AAEEAKAKAEAAAA---KK 194

Query: 394 EAMHTAIRKLEEKERFLQTVLTNAEKELHLRNQAMDLNKRKAIESAQSAAD 444
           +A   A    E+ +         AEK      +A    + KA    + AA 
Sbjct: 195 KAEAEAKAAAEKAKA-EAEAKAKAEK------KAEAAAEEKAAAEKKKAAA 238



 Score = 31.1 bits (70), Expect = 1.4
 Identities = 25/137 (18%), Positives = 58/137 (42%), Gaps = 7/137 (5%)

Query: 308 QTHQLQKQVATHKQEEEALLNEMEVTGQAFEDMQEQNSRLLQQLREKDDANFKLMTERIK 367
           Q  ++Q Q ++ K+ E+    + E   Q  E+++ + +   ++L++ +    K   ++ +
Sbjct: 63  QYGRIQSQQSSAKKGEQQRKKKEE---QVAEELKPKQAAEQERLKQLEKERLKAQEQQKQ 119

Query: 368 SNQLHKLAREEKDTLREQV--QALSLQIEAMHTAIRKLEEKERFLQTVLTNAEKELHLRN 425
           + +  K A+ E+    EQ    A   + +A     +   E  +        A+K+     
Sbjct: 120 AEEAEKQAQLEQKQQEEQARKAAAEQKKKAEAAKAKAAAEAAK--LKAAAEAKKKAEEAA 177

Query: 426 QAMDLNKRKAIESAQSA 442
           +A +  K KA  +A   
Sbjct: 178 KAAEEAKAKAEAAAAKK 194


>gnl|CDD|219286 pfam07072, DUF1342, Protein of unknown function (DUF1342).  This
           family consists of several hypothetical bacterial
           proteins of around 250 residues in length. Members of
           this family are often known as YacF after the
           Escherichia coli protein. The function of this family is
           unknown.
          Length = 211

 Score = 32.1 bits (74), Expect = 0.47
 Identities = 18/57 (31%), Positives = 24/57 (42%), Gaps = 10/57 (17%)

Query: 302 IKQLEDQTHQLQK-----QVATHKQEEEALLNEMEVTGQAFEDMQEQNSRLLQQLRE 353
           +K LE Q   L K      V       E LL+E+E   QA   +     R+ Q L+E
Sbjct: 46  LKDLERQRQTLLKWRGNPGVDQEA--LEQLLDEIE---QASAALNAAPGRIGQSLKE 97


>gnl|CDD|216292 pfam01086, Clathrin_lg_ch, Clathrin light chain. 
          Length = 225

 Score = 32.0 bits (73), Expect = 0.54
 Identities = 14/56 (25%), Positives = 26/56 (46%), Gaps = 5/56 (8%)

Query: 285 SKRDERKKLADEEAMRK-IKQLED----QTHQLQKQVATHKQEEEALLNEMEVTGQ 335
            +RDE  +   EE + K  K+++D       + +K  A +++E E  L E +    
Sbjct: 126 EERDEASEKKKEELIEKAQKEIDDFYENYNEKKEKTKAQNRKEAEQFLAERDDFSS 181


>gnl|CDD|218681 pfam05667, DUF812, Protein of unknown function (DUF812).  This
           family consists of several eukaryotic proteins of
           unknown function.
          Length = 536

 Score = 32.4 bits (74), Expect = 0.55
 Identities = 29/139 (20%), Positives = 60/139 (43%), Gaps = 16/139 (11%)

Query: 289 ERKKLADEEAMRK-----IKQLEDQTHQLQKQVATHKQEEEALLNEMEVTGQAFEDMQEQ 343
              ++  EE   K     ++ L++Q  +L+  +     E +AL ++++   +    ++++
Sbjct: 254 PPAEMDTEEDRTKEREAELEALQEQIDELESSIEEVLSEIKALASKIKQVNEELTTVRQE 313

Query: 344 NSRLLQQLREKDDANFKLMTERIKS--NQLHKLAREEKDTLREQVQALSLQIEAMHTAIR 401
           N  L ++ + K     K   E +    N + KL      +  E++  L+ Q EA  T   
Sbjct: 314 NEELEEEYKIK-----KRTVELLPDAENNVAKLQALVVAS-SERLLELAQQWEAHRTP-- 365

Query: 402 KLEEKERFLQTVLTNAEKE 420
            L ++ R L+    N E E
Sbjct: 366 -LIDEYRSLKEKNRNKEDE 383


>gnl|CDD|221533 pfam12329, TMF_DNA_bd, TATA element modulatory factor 1 DNA
           binding.  This is the middle region of a family of TATA
           element modulatory factor 1 proteins conserved in
           eukaryotes that contains at its N-terminal section a
           number of leucine zippers that could potentially form
           coiled coil structures. The whole proteins bind to the
           TATA element of some RNA polymerase II promoters and
           repress their activity. by competing with the binding of
           TATA binding protein. TMFs are evolutionarily conserved
           golgins that bind Rab6, a ubiquitous ras-like
           GTP-binding Golgi protein, and contribute to Golgi
           organisation in animal and plant cells.
          Length = 74

 Score = 30.0 bits (68), Expect = 0.57
 Identities = 15/75 (20%), Positives = 35/75 (46%), Gaps = 9/75 (12%)

Query: 288 DERKKLADEEAMRKIKQLEDQTHQLQKQVATH-------KQEEEALLNEMEVTGQAFEDM 340
              KKLA+++    I QL ++  +L K+   H       + + + L  E+    +  E +
Sbjct: 2   SLEKKLAEKDEQ--IAQLMEEGEKLSKKELKHNNTIKKLRAKNKELEKEIAELKKKLEKL 59

Query: 341 QEQNSRLLQQLREKD 355
           +++   L ++L+  +
Sbjct: 60  EKELENLEERLKRAE 74



 Score = 29.2 bits (66), Expect = 0.94
 Identities = 22/73 (30%), Positives = 35/73 (47%), Gaps = 2/73 (2%)

Query: 347 LLQQLREKDDANFKLMTERIK-SNQLHKLAREEKDTLREQVQALSLQIEAMHTAIRKLEE 405
           L ++L EKD+   +LM E  K S +  K        LR + + L  +I  +   + KLE+
Sbjct: 3   LEKKLAEKDEQIAQLMEEGEKLSKKELKH-NNTIKKLRAKNKELEKEIAELKKKLEKLEK 61

Query: 406 KERFLQTVLTNAE 418
           +   L+  L  AE
Sbjct: 62  ELENLEERLKRAE 74



 Score = 26.5 bits (59), Expect = 8.6
 Identities = 16/57 (28%), Positives = 27/57 (47%), Gaps = 5/57 (8%)

Query: 369 NQLHKLAREEKDT----LREQVQALSLQIEAMHTAIRKLEEKERFLQTVLTNAEKEL 421
           + L K   E KD     L E+ + LS +    +  I+KL  K + L+  +   +K+L
Sbjct: 1   SSLEKKLAE-KDEQIAQLMEEGEKLSKKELKHNNTIKKLRAKNKELEKEIAELKKKL 56


>gnl|CDD|220402 pfam09787, Golgin_A5, Golgin subfamily A member 5.  Members of this
           family of proteins are involved in maintaining Golgi
           structure. They stimulate the formation of Golgi stacks
           and ribbons, and are involved in intra-Golgi retrograde
           transport. Two main interactions have been
           characterized: one with RAB1A that has been activated by
           GTP-binding and another with isoform CASP of CUTL1.
          Length = 509

 Score = 32.5 bits (74), Expect = 0.58
 Identities = 27/170 (15%), Positives = 47/170 (27%), Gaps = 42/170 (24%)

Query: 290 RKKLADEEAMRKIKQLEDQTHQLQKQVATHKQEEEALLNEMEVTGQAF-------EDMQE 342
           + KL   +A  K                    + E L  E+     A        ++  E
Sbjct: 159 QDKLLLLDAAVKRHDGN--VITAVLIFLKRTLKLEILFQELAERQTALRHLSKHLQEEGE 216

Query: 343 QNSRL-------------LQQLREKDDANFKLMTERI---KSNQLHKLA----------- 375
               L             LQQ R+K     +   +RI   K   L +             
Sbjct: 217 LQQLLKLLRAEGESEKQELQQYRQKAHRILQSKEKRINFLKEGCLFEGLDSSTAQIELEE 276

Query: 376 -REEKDTLREQVQALSLQIEAMHTAIRKLEEKERFLQTVLTNAEKELHLR 424
            + E + ++E++  L  QI        + E ++R  +            R
Sbjct: 277 LKHESEHVQEEITKLEGQII-----QLRSEAQDREAEASGEAESFRKQPR 321


>gnl|CDD|149160 pfam07926, TPR_MLP1_2, TPR/MLP1/MLP2-like protein.  The sequences
           featured in this family are similar to a region of human
           TPR protein and to yeast myosin-like proteins 1 (MLP1)
           and 2 (MLP2). These proteins share a number of features;
           for example, they all have coiled-coil regions and all
           three are associated with nuclear pores. TPR is thought
           to be a component of nuclear pore complex- attached
           intra-nuclear filaments, and is implicated in nuclear
           protein import. Moreover, its N-terminal region is
           involved in the activation of oncogenic kinases,
           possibly by mediating the dimerisation of kinase domains
           or by targeting these kinases to the nuclear pore
           complex. MLP1 and MLP2 are involved in the process of
           telomere length regulation, where they are thought to
           interact with proteins such as Tel1p and modulate their
           activity.
          Length = 132

 Score = 30.7 bits (70), Expect = 0.63
 Identities = 12/54 (22%), Positives = 28/54 (51%), Gaps = 4/54 (7%)

Query: 302 IKQLEDQTHQLQKQVATHKQ----EEEALLNEMEVTGQAFEDMQEQNSRLLQQL 351
           I QL+ +    Q +++  ++    +++ L +E+    +  E++ EQN  L  Q+
Sbjct: 75  IAQLKAEAESAQAELSEAEESWEEQKKMLEDELSELEKRIEELNEQNKLLHDQI 128



 Score = 28.4 bits (64), Expect = 4.0
 Identities = 23/130 (17%), Positives = 56/130 (43%), Gaps = 24/130 (18%)

Query: 301 KIKQLEDQTHQLQKQVATHKQEEEALLNEMEVTGQAFED-MQEQ--NSRLLQQLREKDDA 357
           ++K+L ++    ++++   +++ E          Q +E  + +   +   LQ LR+    
Sbjct: 11  ELKRLLEKKEDAEEKIQKLQEDLEEQAEIANEAQQKYERELVKHAEDIEELQALRK---- 66

Query: 358 NFKLMTERIKSNQLHKLAREEKDTLREQVQALSLQIEAMHTAIRKLEEKERFLQTVLTNA 417
                       QL++L ++E   L+ + ++   ++     A    EE+++ L+  L+  
Sbjct: 67  ------------QLNEL-KKEIAQLKAEAESAQAELS---EAEESWEEQKKMLEDELSEL 110

Query: 418 EKELH-LRNQ 426
           EK +  L  Q
Sbjct: 111 EKRIEELNEQ 120


>gnl|CDD|220964 pfam11068, DUF2869, Protein of unknown function (DUF2869).  This
           bacterial family of proteins has no known function.
          Length = 131

 Score = 31.0 bits (71), Expect = 0.63
 Identities = 22/90 (24%), Positives = 40/90 (44%), Gaps = 16/90 (17%)

Query: 314 KQVATHKQEEEALLNEMEVTGQAFEDMQEQNSRLLQQLREKDDANFKLMTERIKSN---- 369
           KQ+ T + +EE              ++QEQ ++L Q+L++ +    K + E  K +    
Sbjct: 10  KQIVTERWKEELQ-----------AELQEQLTQLEQELQQLEFQGQKAIKEIRKQSAQQI 58

Query: 370 -QLHKLAREEKDTLREQVQALSLQIEAMHT 398
            Q+ +   +E+    EQ   L  QIE +  
Sbjct: 59  EQIQQQVEQERAKRLEQKNQLLFQIEQVQE 88



 Score = 27.6 bits (62), Expect = 8.2
 Identities = 16/85 (18%), Positives = 35/85 (41%), Gaps = 14/85 (16%)

Query: 291 KKLADEEAMRKIKQLEDQTHQLQKQVATHKQEEEALLNEMEVTG-QAFEDMQEQNSRLLQ 349
           K+    E   ++ QLE +  QL       + + +  + E+     Q  E +Q+Q  +   
Sbjct: 18  KEELQAELQEQLTQLEQELQQL-------EFQGQKAIKEIRKQSAQQIEQIQQQVEQERA 70

Query: 350 QLREKDDANFKLMTERIKSNQLHKL 374
           +  E+       +  +I+  Q+ +L
Sbjct: 71  KRLEQK----NQLLFQIE--QVQEL 89


>gnl|CDD|222647 pfam14282, FlxA, FlxA-like protein.  This family includes FlxA from
           E. coli. The expression of FlxA is regulated by the FliA
           sigma factor, a transcription factor specific for class
           3 flagellar operons. However FlxA is not required for
           flagellar function or formation.
          Length = 106

 Score = 30.5 bits (69), Expect = 0.65
 Identities = 13/53 (24%), Positives = 24/53 (45%), Gaps = 2/53 (3%)

Query: 293 LADEEAMRKIKQLEDQTHQLQKQVATHKQEEEALLNEMEVTGQAFEDMQEQNS 345
           L  EE  ++ + ++ Q   LQ Q+A  +Q++       +    + ED   Q S
Sbjct: 44  LTAEEKQKQQQLIQQQIQMLQAQIAQLQQQQS--EEAQQQQQSSIEDSPSQTS 94


>gnl|CDD|214457 MTH00205, ND4, NADH dehydrogenase subunit 4; Provisional.
          Length = 448

 Score = 32.3 bits (74), Expect = 0.65
 Identities = 24/126 (19%), Positives = 43/126 (34%), Gaps = 3/126 (2%)

Query: 116 SSFGIFRLSSTVFVY--HCFSIFRFSSSTVFVYHCFSIFRFSSSTVFVYHCFSIFRFSSS 173
           S FG+F L   +FV     +S F         +  F        T F+   FS    S  
Sbjct: 2   SIFGVFSLLIILFVLKFKDWSFFSNLLLIGLFFSSFFFSNSFLMTSFISSSFSFDSLSLP 61

Query: 174 TVFVYHCFSIFRFSSSTVFVYHCFSIFRFSSSTVFVYHCFSIFRFSSSTIFVYHCFDIFS 233
            + +   +       ++  V+      +  + +V +     I  FS   + +++     S
Sbjct: 62  LI-MLSFWISLLMVLASYKVFKSNKNSKLFTVSVIMLLLILILSFSVDNLLLFYITFEAS 120

Query: 234 LTSTLF 239
           L  TL 
Sbjct: 121 LIPTLI 126



 Score = 30.3 bits (69), Expect = 2.7
 Identities = 21/128 (16%), Positives = 34/128 (26%), Gaps = 13/128 (10%)

Query: 133 FSIFRFS-SSTVFVYHCFSIFRFSSSTVFVYHCFSIFRFSSSTVFVYHCFSIFRFSSSTV 191
           F +F       V  +  +S F         +  F        T F+   FS    S   +
Sbjct: 4   FGVFSLLIILFVLKFKDWSFFSNLLLIGLFFSSFFFSNSFLMTSFISSSFSFDSLSLPLI 63

Query: 192 FVYHCFSIFRFSSSTVFVYHCFSIFRFSSSTIFVYHCFDIFSLTSTLFVSICTSAFQMSF 251
                 S +        +    S   F S+         +  L   L +S       + F
Sbjct: 64  ----MLSFW-----ISLLMVLASYKVFKSNKNSKLFTVSVIMLLLILILSFSVDNLLL-F 113

Query: 252 YKSRWFDF 259
           Y    F+ 
Sbjct: 114 YI--TFEA 119


>gnl|CDD|218115 pfam04502, DUF572, Family of unknown function (DUF572).  Family of
           eukaryotic proteins with undetermined function.
          Length = 321

 Score = 32.0 bits (73), Expect = 0.66
 Identities = 20/97 (20%), Positives = 37/97 (38%), Gaps = 22/97 (22%)

Query: 289 ERKKLADEEAMRKIKQLEDQTHQLQKQVATHKQEEEALLNEMEVTGQAFEDMQEQNSRLL 348
             K+  +E A   +K+LE++T   +               EMEV     E+++E  SR  
Sbjct: 121 VEKEREEELAGDAMKKLENRTADSK--------------REMEVLE-RLEELKELQSR-- 163

Query: 349 QQLREKDDANFKLMTERIKSNQLHKLAREEKDTLREQ 385
                + D +   M E +   +  +   EE++     
Sbjct: 164 -----RADVDVNSMLEALFRREKKEEEEEEEEDEALI 195


>gnl|CDD|218215 pfam04696, Pinin_SDK_memA, pinin/SDK/memA/ protein conserved
           region.  Members of this family have very varied
           localisations within the eukaryotic cell. pinin is known
           to localise at the desmosomes and is implicated in
           anchoring intermediate filaments to the desmosomal
           plaque. SDK2/3 is a dynamically localised nuclear
           protein thought to be involved in modulation of
           alternative pre-mRNA splicing. memA is a tumour marker
           preferentially expressed in human melanoma cell lines. A
           common feature of the members of this family is that
           they may all participate in regulating protein-protein
           interactions.
          Length = 131

 Score = 30.9 bits (70), Expect = 0.66
 Identities = 15/56 (26%), Positives = 27/56 (48%)

Query: 371 LHKLAREEKDTLREQVQALSLQIEAMHTAIRKLEEKERFLQTVLTNAEKELHLRNQ 426
           L +  ++E++ LR++ + L  +       +RKLE+K    +   T  E  L L N 
Sbjct: 44  LEEQEKQEREELRKEKRELFEERRRKQLELRKLEQKMEDEKLQETWHEHNLALANF 99


>gnl|CDD|225647 COG3105, COG3105, Uncharacterized protein conserved in bacteria
           [Function unknown].
          Length = 138

 Score = 30.9 bits (70), Expect = 0.67
 Identities = 20/105 (19%), Positives = 46/105 (43%), Gaps = 12/105 (11%)

Query: 281 VVRDSKRDERKKLADEEAMRKIK-QLEDQTHQLQKQVATHKQEEEALLNEMEVTGQAFED 339
           + R + R  +++   +  + K+K QL++     ++++  H      LL  +    Q ++ 
Sbjct: 25  IARLTNRKLKQQQKLQYELEKVKAQLDEY----RQELVKHFARSAELLKTL---AQDYQK 77

Query: 340 MQEQ----NSRLLQQLREKDDANFKLMTERIKSNQLHKLAREEKD 380
           + +     ++ LL +L  KD+    L+ E   SN    +    +D
Sbjct: 78  LYQHMAKSSTSLLPELSAKDNPFRNLLAEEEASNDQAPVQMPPRD 122


>gnl|CDD|220173 pfam09311, Rab5-bind, Rabaptin-like protein.  Members of this
           family are predominantly found in Rabaptin and allow for
           binding to the GTPase Rab5. This interaction is
           necessary and sufficient for Rab5-dependent recruitment
           of Rabaptin5 to early endosomal membranes.
          Length = 180

 Score = 31.2 bits (71), Expect = 0.68
 Identities = 26/135 (19%), Positives = 48/135 (35%), Gaps = 14/135 (10%)

Query: 318 THKQEEEALLNEMEVTGQAFEDMQEQNSRLLQQLREKDDANFKLMTERIKSNQLHKLAR- 376
           +  Q +EA+ ++++    + + ++ Q  RL Q+       N  L  E   + Q  + +  
Sbjct: 5   SFSQVQEAVQDQLQELEASRQKLRAQLRRLQQE-------NQWLRGELALTQQELQASEQ 57

Query: 377 ------EEKDTLREQVQALSLQIEAMHTAIRKLEEKERFLQTVLTNAEKELHLRNQAMDL 430
                 EE   L+        ++    TA   LEE+ +     L   +  L     A   
Sbjct: 58  EVIQLPEEVKHLQFLCLQRREELIESRTASEHLEEELKSEIDPLKELKPNLEEELAAEIE 117

Query: 431 NKRKAIESAQSAADL 445
             +   E  Q  A L
Sbjct: 118 LPQIQSEKDQLQASL 132



 Score = 29.3 bits (66), Expect = 3.3
 Identities = 19/113 (16%), Positives = 42/113 (37%), Gaps = 17/113 (15%)

Query: 304 QLEDQTHQLQKQVATHKQEEEALLNEMEVTGQAFEDMQEQ-NSR--LLQQLREKDDANFK 360
           QL ++   LQ      ++E             A E ++E+  S    L++L+   +    
Sbjct: 61  QLPEEVKHLQFLCLQRREELIESR-------TASEHLEEELKSEIDPLKELKPNLEE--- 110

Query: 361 LMTERIKSNQLHKLAREEKDTLREQVQALSLQIEAMHTAIRKLEEKERFLQTV 413
                 +       +  EKD L+  +  +  ++  +H  + + E + R+   V
Sbjct: 111 --ELAAEIELPQIQS--EKDQLQASLYEIPARLRTLHNLVIQYESQGRYEVAV 159


>gnl|CDD|148750 pfam07321, YscO, Type III secretion protein YscO.  This family
           contains the bacterial type III secretion protein YscO,
           which is approximately 150 residues long. YscO has been
           shown to be required for high-level expression and
           secretion of the anti-host proteins V antigen and Yops
           in Yersinia pestis.
          Length = 152

 Score = 31.2 bits (71), Expect = 0.69
 Identities = 23/115 (20%), Positives = 46/115 (40%), Gaps = 3/115 (2%)

Query: 291 KKLADEEAMRKIKQLEDQTHQLQKQVATHKQEEEALLNEMEVTGQAFEDMQEQNSRLLQQ 350
           KKL  E A + +K+ E     L +  A H++ +++L +  +   Q    +  +       
Sbjct: 9   KKLRAERAEKAVKRQE---QALVQARARHREAQQSLQDYRQWRQQEENRLFAELQGATVL 65

Query: 351 LREKDDANFKLMTERIKSNQLHKLAREEKDTLREQVQALSLQIEAMHTAIRKLEE 405
           L+E +    ++   R     L +   E K  L  + Q L    + +  A +  E+
Sbjct: 66  LKELEKWQQQVGLLRENEASLEQQLAEAKQRLEAERQRLRQARQQLQEARKAQEK 120


>gnl|CDD|224643 COG1729, COG1729, Uncharacterized protein conserved in bacteria
           [Function unknown].
          Length = 262

 Score = 31.6 bits (72), Expect = 0.71
 Identities = 19/88 (21%), Positives = 27/88 (30%), Gaps = 9/88 (10%)

Query: 272 SLEFRKFACVVRDSKRDERKKLADEEAMRKIKQLEDQTHQLQKQVA--THKQEEEALLNE 329
           S      A       R    +        ++ QLE Q  QLQ ++      QE +   N+
Sbjct: 28  SGAPLSSAGNGSVEDRVTPLERVQNAHSYRLTQLEQQLRQLQGKIEELRGIQELQYQNNQ 87

Query: 330 MEVTGQAFEDMQEQNSRLLQQLREKDDA 357
                      QE+N   L  L     A
Sbjct: 88  NVER-------QEENEARLDSLESGRQA 108


>gnl|CDD|100796 PRK01156, PRK01156, chromosome segregation protein; Provisional.
          Length = 895

 Score = 32.2 bits (73), Expect = 0.72
 Identities = 25/158 (15%), Positives = 64/158 (40%), Gaps = 19/158 (12%)

Query: 290 RKKLADEEA-----MRKIKQLEDQTHQLQKQVATHKQEEEALLNEMEVTGQAFEDMQEQN 344
           +K++AD+E      +++I++L  + +         K     L +  ++  +   +++   
Sbjct: 203 KKQIADDEKSHSITLKEIERLSIEYNNAMDDYNNLKSALNELSSLEDMKNRYESEIKTAE 262

Query: 345 SRLLQQLREKDDANFKLMTER---IKSNQLHKLAREEKD---------TLREQVQALSLQ 392
           S L   +  + +  +K + ER   I ++ ++K      D           ++ +  +  +
Sbjct: 263 SDL--SMELEKNNYYKELEERHMKIINDPVYKNRNYINDYFKYKNDIENKKQILSNIDAE 320

Query: 393 IEAMHTAIRKLEEKERFLQTVLTNAEKELHLRNQAMDL 430
           I   H  I+KL   ++     +    +   L NQ ++L
Sbjct: 321 INKYHAIIKKLSVLQKDYNDYIKKKSRYDDLNNQILEL 358



 Score = 32.2 bits (73), Expect = 0.83
 Identities = 32/176 (18%), Positives = 66/176 (37%), Gaps = 17/176 (9%)

Query: 279 ACVVRDSKRDERKKLADEEAMRKIKQLEDQTHQLQKQVATHKQEEEALLNEMEVTGQAFE 338
           A  V      E  +    E  +++  LE +  +++      K   +  + E+E       
Sbjct: 573 ALAVISLIDIETNRSRSNEIKKQLNDLESRLQEIEIGFPDDKSYIDKSIREIENEANNLN 632

Query: 339 DMQ---EQNSRLLQQLREKDDANFKL----MTERIKS-NQLHKLAREEKDTLREQVQALS 390
           +     ++N  L+++LR K D N+K     +   I    ++     + +D L++      
Sbjct: 633 NKYNEIQENKILIEKLRGKID-NYKKQIAEIDSIIPDLKEITSRINDIEDNLKK----SR 687

Query: 391 LQIEAMHTAIRKLEEKERFLQTVLTNAEKELHLRNQAMDLNKRKAIESAQSAADLK 446
             ++       +LE     L+T +      ++  N+   L   K I+ A    DLK
Sbjct: 688 KALDDAKANRARLESTIEILRTRINELSDRINDINE--TLESMKKIKKA--IGDLK 739


>gnl|CDD|189476 pfam00261, Tropomyosin, Tropomyosin. 
          Length = 237

 Score = 31.8 bits (73), Expect = 0.73
 Identities = 34/134 (25%), Positives = 55/134 (41%), Gaps = 15/134 (11%)

Query: 300 RKIKQLEDQTHQLQKQV--ATHK-QEEEALL--NEMEVTG-----QAFEDMQEQNSRLLQ 349
           +KIKQLE++  + ++++  A  K +E E      E EV       Q  E+  E++   L 
Sbjct: 1   KKIKQLEEELDKAEERLEEAQEKLEEAEKRAEKAEAEVASLNRRIQLLEEDLERSEERLA 60

Query: 350 QLREKDDANFKLMTERIKS-NQLHKLAREEKDTLREQVQALSLQIEAMHTAIRKLEEKER 408
              EK +   K   E  +    L   + ++ + + +    L    E    A RK EE  R
Sbjct: 61  TALEKLEEAEKAADESERGRKVLENRSLKDDEKMEQLEAQLKEAKEIAEEADRKYEEVAR 120

Query: 409 FLQTVLTNAEKELH 422
            L  V    E +L 
Sbjct: 121 KLVVV----EGDLE 130


>gnl|CDD|184039 PRK13411, PRK13411, molecular chaperone DnaK; Provisional.
          Length = 653

 Score = 32.0 bits (73), Expect = 0.75
 Identities = 18/107 (16%), Positives = 45/107 (42%), Gaps = 18/107 (16%)

Query: 297 EAMRKIKQLEDQTHQLQKQVATHKQEEEALLNEMEVTGQAFEDMQEQNSRLLQQLREKDD 356
           E MR+  +   +  + +KQ+   K + ++LL   E T      ++E    + ++L+++ +
Sbjct: 508 ERMRQEAEKYAEEDRRRKQLIELKNQADSLLYSYEST------LKENGELISEELKQRAE 561

Query: 357 ANFKLMTERIKSNQLHKLARE---EKDTLREQVQALSLQIEAMHTAI 400
                     K  QL     +     + L++Q++     + A+   +
Sbjct: 562 Q---------KVEQLEAALTDPNISLEELKQQLEEFQQALLAIGAEV 599


>gnl|CDD|163506 TIGR03794, NHLM_micro_HlyD, NHLM bacteriocin system secretion
           protein.  Members of this protein family are homologs of
           the HlyD membrane fusion protein of type I secretion
           systems. Their occurrence in prokaryotic genomes is
           associated with the occurrence of a novel class of
           microcin (small bacteriocins) with a leader peptide
           region related to nitrile hydratase. We designate the
           class of bacteriocin as Nitrile Hydratase Leader
           Microcin, or NHLM. This family, therefore, is designated
           as NHLM bacteriocin system secretion protein. Some but
           not all NHLM-class putative microcins belong to the TOMM
           (thiazole/oxazole modified microcin) class as assessed
           by the presence of the scaffolding protein and/or
           cyclodehydratase in the same gene clusters [Transport
           and binding proteins, Amino acids, peptides and amines,
           Cellular processes, Biosynthesis of natural products].
          Length = 421

 Score = 32.1 bits (73), Expect = 0.78
 Identities = 29/166 (17%), Positives = 59/166 (35%), Gaps = 37/166 (22%)

Query: 296 EEAMRKIKQLEDQ----------------THQLQK-----QVATHKQEEEALLNEMEVTG 334
           +E+ +K+ QL++Q                 H  +      +     +EE A L+      
Sbjct: 99  QESYQKLTQLQEQLEEVRNYTGRLKEGRERHFQKSKEALEETIGRLREELAALSREVGKQ 158

Query: 335 QAFEDMQEQ---NSRLLQQLREKDDANFKLMTERIKSNQLHKLARE-EKDTLREQVQALS 390
           +             R+LQQ   ++   +       K+  ++ L  + ++  L   +Q+LS
Sbjct: 159 RGLLSRGLATFKRDRILQQQWREEQEKYD---AADKARAIYALQTKADERNLETVLQSLS 215

Query: 391 LQIEAMHTAIRKLEEKERFLQTV---LTNAEKELHLRNQAMDLNKR 433
                +          E+ L+TV   +  A  E+      ++LN R
Sbjct: 216 QADFQLAGV------AEKELETVEARIKEARYEIEELENKLNLNTR 255


>gnl|CDD|240364 PTZ00332, PTZ00332, paraflagellar rod protein; Provisional.
          Length = 589

 Score = 32.2 bits (73), Expect = 0.80
 Identities = 33/163 (20%), Positives = 73/163 (44%), Gaps = 16/163 (9%)

Query: 288 DERKKLADEEAMRKIKQLEDQTHQLQKQ--VATHKQEEEALLNEM-EVTGQAFED---MQ 341
           D   ++A+E+   K++  E     +  +  +    +EE    +++ EV  QA ++   M+
Sbjct: 207 DGEMQVAEEQMWTKVQLQERLIELVADKFRLIGKCEEENKSFSKIHEVQKQANQETSQMK 266

Query: 342 EQNSRLLQQ----LREKDDANFKLMTERIKSNQLHKLAREEKDTLREQVQALSLQIEAMH 397
           +   RL Q+    L+   DA  K   E  ++ + +   +E+ +      + +    +   
Sbjct: 267 DAKRRLKQRCETDLKHIHDAIQKADLEDAEAMKRYATNKEKSE------RFIRENEDRQE 320

Query: 398 TAIRKLEEKERFLQTVLTNAEKELHLRNQAMDLNKRKAIESAQ 440
            A  K+++ ER LQ + T   +E+  R +  D  +++ +E  Q
Sbjct: 321 EAWNKIQDLERQLQRLGTERFEEVKRRIEENDREEKRRVEYQQ 363


>gnl|CDD|219176 pfam06785, UPF0242, Uncharacterized protein family (UPF0242). 
          Length = 401

 Score = 31.8 bits (72), Expect = 0.82
 Identities = 32/145 (22%), Positives = 60/145 (41%), Gaps = 12/145 (8%)

Query: 290 RKKLADEEAMRKIKQLEDQTHQLQKQVATHKQE-EEALLNEMEVTGQAFEDMQEQNSRLL 348
           RK    +E +R I++  ++  +   Q+    Q+    L +   V  +     Q+      
Sbjct: 78  RKLTEQDEGLRNIRESIEERQKESDQLKLQNQKLVNQLGHARGVFIKTKGRYQKLEELSR 137

Query: 349 ------QQLREKDDANFKLMTERIKSNQLHKLAREEKDTLREQVQALSLQIEAMHTAIRK 402
                 Q L+ + DA  +   E+I+ NQ  +L RE  +TL  Q Q L+ + +A       
Sbjct: 138 HLKEENQCLQIQLDALVQECNEKIEENQ--ELNRELAETLAYQ-QELNDEYQATFVEQHN 194

Query: 403 LEEKERFLQTVLTNAEKEL--HLRN 425
           + +K +     L    ++L   +RN
Sbjct: 195 MLDKRQAYIGKLEAKVQDLMCEIRN 219



 Score = 29.5 bits (66), Expect = 4.6
 Identities = 27/148 (18%), Positives = 65/148 (43%), Gaps = 24/148 (16%)

Query: 296 EEAMRKIKQLEDQTHQLQKQVATHKQEEEALLNEMEVTGQAFEDMQEQNSRLLQQLREKD 355
           +  +RK+ + ++    +++ +   ++E + L              + QN +L+ QL    
Sbjct: 74  QTKIRKLTEQDEGLRNIRESIEERQKESDQL--------------KLQNQKLVNQLGH-- 117

Query: 356 DANFKLMTERIKSNQLHKLAREEKDTLREQVQALSLQIEAMHTAIRKLEEKERFLQTVLT 415
            A    +  + +  +L +L+R     L+E+ Q L +Q++A+     +  E+ + L   L 
Sbjct: 118 -ARGVFIKTKGRYQKLEELSRH----LKEENQCLQIQLDALVQECNEKIEENQELNRELA 172

Query: 416 NA---EKELHLRNQAMDLNKRKAIESAQ 440
                ++EL+   QA  + +   ++  Q
Sbjct: 173 ETLAYQQELNDEYQATFVEQHNMLDKRQ 200


>gnl|CDD|220366 pfam09728, Taxilin, Myosin-like coiled-coil protein.  Taxilin
           contains an extraordinarily long coiled-coil domain in
           its C-terminal half and is ubiquitously expressed. It is
           a novel binding partner of several syntaxin family
           members and is possibly involved in Ca2+-dependent
           exocytosis in neuroendocrine cells. Gamma-taxilin,
           described as leucine zipper protein Factor Inhibiting
           ATF4-mediated Transcription (FIAT), localises to the
           nucleus in osteoblasts and dimerises with ATF4 to form
           inactive dimers, thus inhibiting ATF4-mediated
           transcription.
          Length = 309

 Score = 31.6 bits (72), Expect = 0.88
 Identities = 31/139 (22%), Positives = 70/139 (50%), Gaps = 23/139 (16%)

Query: 283 RDSKRDERKKLADEEAMRKIKQLEDQTHQLQKQVATHKQEEEALLNEMEVTGQAFEDMQ- 341
            D+K ++ ++ A++E  +  K+ ++   +   QV+T K+ E+ L  ++ +    F++ Q 
Sbjct: 172 NDAKLEQAQEQAEQEQEKH-KREKEILLKEAAQVSTLKETEKELRAQLNLYSDKFKEFQD 230

Query: 342 ----------------EQNSRLLQQLREKDDANFKLMTERIKSNQ-LHKLAREEKDTLRE 384
                           E+ S+ +++L EK++  +K   E  KSN+ L ++A EE+    +
Sbjct: 231 TLNKSNEVFLTFKQEMEKMSKKIKKL-EKENLAWKRKWE--KSNKALLEMA-EERQKRDK 286

Query: 385 QVQALSLQIEAMHTAIRKL 403
           +++ L  +I+ +    R L
Sbjct: 287 EIEKLKKKIDKLEKLCRAL 305



 Score = 30.4 bits (69), Expect = 2.4
 Identities = 35/161 (21%), Positives = 60/161 (37%), Gaps = 28/161 (17%)

Query: 287 RDERKKLADEEAMRK---IKQLEDQTHQLQKQVATHKQEEEALLNEMEVTGQAFEDMQEQ 343
           ++E K  +DEE  ++    ++ +     +Q Q+  H      L  E E   +  +++ EQ
Sbjct: 91  KEENKARSDEEEEKRKELTEKFQSTLKDIQDQMEEHSNPNHKLCEENEELREKLKELIEQ 150

Query: 344 NSRLLQQLREKD-DANFKLMTERIKSNQLHKLAREEKDTLREQVQALSLQIEAMHTAIRK 402
                 +LRE+  +   K      + N   KL +               Q +A     + 
Sbjct: 151 -----YELREQHFEKLLKTKDLEQQLNDA-KLEQA--------------QEQAEQEQEKH 190

Query: 403 LEEKERFLQT---VLTNAEKELHLRNQ-AMDLNKRKAIESA 439
             EKE  L+    V T  E E  LR Q  +  +K K  +  
Sbjct: 191 KREKEILLKEAAQVSTLKETEKELRAQLNLYSDKFKEFQDT 231


>gnl|CDD|235461 PRK05431, PRK05431, seryl-tRNA synthetase; Provisional.
          Length = 425

 Score = 31.6 bits (73), Expect = 0.92
 Identities = 17/70 (24%), Positives = 34/70 (48%), Gaps = 10/70 (14%)

Query: 347 LLQQLREKDDANFKLMTE----RIKSNQL----HKLAREEKDT--LREQVQALSLQIEAM 396
            + +L E D+   +L TE    + + N L     +  R+ +D   L  +V+ L  +I+A+
Sbjct: 26  DVDELLELDEERRELQTELEELQAERNALSKEIGQAKRKGEDAEALIAEVKELKEEIKAL 85

Query: 397 HTAIRKLEEK 406
              + +LE +
Sbjct: 86  EAELDELEAE 95


>gnl|CDD|182409 PRK10361, PRK10361, DNA recombination protein RmuC; Provisional.
          Length = 475

 Score = 31.9 bits (72), Expect = 0.93
 Identities = 36/146 (24%), Positives = 65/146 (44%), Gaps = 25/146 (17%)

Query: 296 EEAMRKIKQLEDQTHQLQKQVATHKQ--EEEALLNEMEVTGQAFEDMQEQNSRLLQQLRE 353
           + A +K +QL ++   + +  A  +Q  + E    E E+       +Q  N+ L   LRE
Sbjct: 26  QHAQQKAEQLAEREEMVAELSAAKQQITQSEHWRAECELLNNEVRSLQSINTSLEADLRE 85

Query: 354 KDDANFKLMTERIKSNQLHKLAREEKDTLREQV---QALSLQIEAMHTAI-----RKLEE 405
                   +T R+++ Q H       D +R+ +   Q LS Q E +   I     R+++E
Sbjct: 86  --------VTTRMEAAQQHA-----DDKIRQMINSEQRLSEQFENLANRIFEHSNRRVDE 132

Query: 406 KER-FLQTVLTNAEKELH-LRNQAMD 429
           + R  L ++L+   ++L   R Q  D
Sbjct: 133 QNRQSLNSLLSPLREQLDGFRRQVQD 158


>gnl|CDD|130902 TIGR01843, type_I_hlyD, type I secretion membrane fusion protein,
           HlyD family.  Type I secretion is an ABC transport
           process that exports proteins, without cleavage of any
           signal sequence, from the cytosol to extracellular
           medium across both inner and outer membranes. The
           secretion signal is found in the C-terminus of the
           transported protein. This model represents the adaptor
           protein between the ATP-binding cassette (ABC) protein
           of the inner membrane and the outer membrane protein,
           and is called the membrane fusion protein. This model
           selects a subfamily closely related to HlyD; it is
           defined narrowly and excludes, for example, colicin V
           secretion protein CvaA and multidrug efflux proteins
           [Protein fate, Protein and peptide secretion and
           trafficking].
          Length = 423

 Score = 31.9 bits (73), Expect = 0.93
 Identities = 25/116 (21%), Positives = 44/116 (37%), Gaps = 10/116 (8%)

Query: 321 QEEEALLNEMEVTGQAFEDMQEQNSRLLQQLREKDDANFKLMTERIKSNQLHKLAREEKD 380
            +   L  E+    +A  D Q         L   +D       E IK      L    K 
Sbjct: 88  SQVLRLEAEVARL-RAEADSQAAIE-FPDDLLSAEDPAVP---ELIK--GQQSLFESRKS 140

Query: 381 TLREQVQALSLQIEAMHTAIRKLEEKERFLQTVLTNAEKELHLRNQAMDLNKRKAI 436
           TLR Q++ +  QI+ +   +  L+ + + L+  L    +EL  R +   L ++  +
Sbjct: 141 TLRAQLELILAQIKQLEAELAGLQAQLQALRQQLEVISEELEARRK---LKEKGLV 193



 Score = 29.2 bits (66), Expect = 5.3
 Identities = 37/163 (22%), Positives = 66/163 (40%), Gaps = 32/163 (19%)

Query: 283 RDSKRDERKKLADEEAMRKIKQLEDQTHQLQKQVATHKQEEEALLNEMEVTGQAFEDMQE 342
           + + R +      E  + +IKQLE +   LQ Q+   +Q+ E +  E+E   +  E    
Sbjct: 139 KSTLRAQL-----ELILAQIKQLEAELAGLQAQLQALRQQLEVISEELEARRKLKEKGLV 193

Query: 343 QNSRLLQQLREKDDANFKLMTERIKSNQLHKLAREEKDTLREQVQALSLQIEAMHTAIRK 402
               LL+  RE+ +                  A+ E   L  +++ L  QI+     + +
Sbjct: 194 SRLELLELERERAE------------------AQGELGRLEAELEVLKRQIDE--LQLER 233

Query: 403 LEEKERFLQTV---LTNAEKEL----HLRNQAMDLNKRKAIES 438
            + ++ F + V   LT A+  L       N+A D  +R  I S
Sbjct: 234 QQIEQTFREEVLEELTEAQARLAELRERLNKARDRLQRLIIRS 276


>gnl|CDD|197874 smart00787, Spc7, Spc7 kinetochore protein.  This domain is found
           in cell division proteins which are required for
           kinetochore-spindle association.
          Length = 312

 Score = 31.5 bits (72), Expect = 0.96
 Identities = 30/141 (21%), Positives = 60/141 (42%), Gaps = 25/141 (17%)

Query: 303 KQLEDQTHQLQKQVATHKQEEEALLNEMEVTGQAFEDMQEQNSRLLQQLREKDDANFKLM 362
           K LE     L + +   K++ + L+ E+E+            + +  +LR++ DA   L 
Sbjct: 140 KLLEGLKEGLDENLEGLKEDYKLLMKELELL-----------NSIKPKLRDRKDA---LE 185

Query: 363 TERIKSNQLHKLARE-------EKDTLREQVQALSLQIEAMHTAIRKLEEKERFLQTVL- 414
            E     QL +L  E       E D  +E+++ L  +I      + +LEE+ + L++ + 
Sbjct: 186 EE---LRQLKQLEDELEDCDPTELDRAKEKLKKLLQEIMIKVKKLEELEEELQELESKIE 242

Query: 415 TNAEKELHLRNQAMDLNKRKA 435
               K+  L  +  +  K+  
Sbjct: 243 DLTNKKSELNTEIAEAEKKLE 263


>gnl|CDD|220555 pfam10079, DUF2317, Uncharacterized protein conserved in bacteria
           (DUF2317).  Members of this family of hypothetical
           bacterial proteins have no known function.
          Length = 541

 Score = 31.8 bits (73), Expect = 0.99
 Identities = 24/120 (20%), Positives = 46/120 (38%), Gaps = 18/120 (15%)

Query: 300 RKIKQLEDQTHQLQKQVATHKQEE---EALLNEMEVTG--QAFEDMQEQNSRLLQQLRE- 353
            K+++  D+   L  +       E   E +L E   +     FE+ + +     ++LRE 
Sbjct: 386 PKVEKKLDK-LGLSIEDVFLHGLELLKEKVLEEKANSEIDIDFEEAKAELEAQFKRLREE 444

Query: 354 --KDDANFKLMTERIKSNQLHKLAREEKDTLREQVQALSLQIEAMH-TAIRKLEEKERFL 410
             K D + +   E  ++  L +L   EK  L+ Q           H   +R+ +  +  L
Sbjct: 445 AAKIDPSLEGALEANEAKILKQLEFLEKRLLKAQ--------RRKHEELLRQFDRLQTAL 496


>gnl|CDD|221952 pfam13166, AAA_13, AAA domain.  This family of domains contain a
           P-loop motif that is characteristic of the AAA
           superfamily. Many of the proteins in this family are
           conjugative transfer proteins. This family includes the
           PrrC protein that is thought to be the active component
           of the anticodon nuclease.
          Length = 713

 Score = 31.5 bits (72), Expect = 1.1
 Identities = 28/135 (20%), Positives = 63/135 (46%), Gaps = 6/135 (4%)

Query: 292 KLADEEAMRKIKQLEDQTHQLQKQVATHKQEEEALLNEMEVTGQAFEDMQEQNSRLLQQL 351
           KL  EE    ++ LE+   +  +++   K ++ +   E+E      E + +    + + +
Sbjct: 319 KLDVEELKALLEALEEILEKNLQKLEE-KLKDPSTSIELESITDLIESINDIIDAINELI 377

Query: 352 REKDD--ANFKLMTERIKSNQ-LHKLAREEKD--TLREQVQALSLQIEAMHTAIRKLEEK 406
           RE ++   N K    + K    LH +A  ++D    +++ + L   I ++   I++LE +
Sbjct: 378 REHNEKIDNLKKEKNKAKKKLWLHLVAELKEDIDAYQKEKKGLEKAINSLEKEIKQLEAE 437

Query: 407 ERFLQTVLTNAEKEL 421
            + L+  +   EK+L
Sbjct: 438 IKALEKEIKELEKQL 452



 Score = 30.0 bits (68), Expect = 3.4
 Identities = 39/161 (24%), Positives = 70/161 (43%), Gaps = 16/161 (9%)

Query: 295 DEEAMRKIKQLEDQTHQLQKQVATHKQEEEALLNEMEVTGQAFEDMQEQNSRLLQQLREK 354
           DEE    I+QLE+   + +  +    +E E++L+  E     F+   E+   LL+ L E 
Sbjct: 277 DEEYQELIEQLEELIDKYESHIEKALEELESILD-TEKENSEFKLDVEELKALLEALEEI 335

Query: 355 DDANFKLMTERIKSNQLHKLAREEKDTLREQVQALSLQIEAMHTAIRKLEEKERFLQTVL 414
            + N + + E++K +    +  E    L E +  +   I+A++  IR+  EK   L+   
Sbjct: 336 LEKNLQKLEEKLK-DPSTSIELESITDLIESINDI---IDAINELIREHNEKIDNLKKEK 391

Query: 415 TNAEKEL--HLRNQAMDL---------NKRKAIESAQSAAD 444
             A+K+L  HL  +  +             KAI S +    
Sbjct: 392 NKAKKKLWLHLVAELKEDIDAYQKEKKGLEKAINSLEKEIK 432


>gnl|CDD|218636 pfam05557, MAD, Mitotic checkpoint protein.  This family consists
           of several eukaryotic mitotic checkpoint (Mitotic arrest
           deficient or MAD) proteins. The mitotic spindle
           checkpoint monitors proper attachment of the bipolar
           spindle to the kinetochores of aligned sister chromatids
           and causes a cell cycle arrest in prometaphase when
           failures occur. Multiple components of the mitotic
           spindle checkpoint have been identified in yeast and
           higher eukaryotes. In S.cerevisiae, the existence of a
           Mad1-dependent complex containing Mad2, Mad3, Bub3 and
           Cdc20 has been demonstrated.
          Length = 722

 Score = 31.8 bits (72), Expect = 1.1
 Identities = 29/158 (18%), Positives = 61/158 (38%), Gaps = 12/158 (7%)

Query: 301 KIKQLEDQTHQLQKQVA----THKQEEEALLNEMEVTGQAFEDMQEQNSRL---LQQLRE 353
           ++ + + +  QL+ ++      HK+ +  L  +     + +E   ++N  L   L+ L E
Sbjct: 62  ELIRSKSKLIQLENELMQKELEHKRAQIELERKASTLAENYERELDRNLELEVRLKALEE 121

Query: 354 KD-DANFKLMTERIKSNQLH-KLAREEKDTLREQVQALSLQIEAMHTAIRKLEEKERFLQ 411
            +  A  +      ++  L  KL  E      E+   L    E++      L E +   Q
Sbjct: 122 LEKKAENEAAEAEEEAKLLKDKLDAESLKLQNEKEDQLKEAKESISRIKNDLSEMQCRAQ 181

Query: 412 ---TVLTNAEKELHLRNQAMDLNKRKAIESAQSAADLK 446
              T L   E EL    + ++  +++  E+ +    L 
Sbjct: 182 NADTELKLLESELEELREQLEECQKELAEAEKKLQSLT 219


>gnl|CDD|215715 pfam00097, zf-C3HC4, Zinc finger, C3HC4 type (RING finger).  The
           C3HC4 type zinc-finger (RING finger) is a cysteine-rich
           domain of 40 to 60 residues that coordinates two zinc
           ions, and has the consensus sequence:
           C-X2-C-X(9-39)-C-X(1-3)-H-X(2-3)-C-X2-C-X(4-48)-C-X2-C
           where X is any amino acid. Many proteins containing a
           RING finger play a key role in the ubiquitination
           pathway.
          Length = 40

 Score = 28.2 bits (63), Expect = 1.1
 Identities = 5/17 (29%), Positives = 8/17 (47%)

Query: 487 DCLRTRYETRQRKCPKC 503
            C+ +  E+    CP C
Sbjct: 24  KCILSWLESGNVTCPLC 40


>gnl|CDD|217020 pfam02403, Seryl_tRNA_N, Seryl-tRNA synthetase N-terminal domain.
           This domain is found associated with the Pfam tRNA
           synthetase class II domain (pfam00587) and represents
           the N-terminal domain of seryl-tRNA synthetase.
          Length = 108

 Score = 29.5 bits (67), Expect = 1.2
 Identities = 17/78 (21%), Positives = 40/78 (51%), Gaps = 11/78 (14%)

Query: 347 LLQQLREKDDANFKLMTE----RIKSNQLHKL------AREEKDTLREQVQALSLQIEAM 396
            + +L E D+   +L  E    + + N+L K        +E+ + L  +V+ L  +++A+
Sbjct: 27  DVDELLELDEERRELQVELEELQAERNELSKEIGKAKKKKEDAEALIAEVKELKDELKAL 86

Query: 397 HTAIRKLEEK-ERFLQTV 413
              +R+LE + ++ L ++
Sbjct: 87  EAELRELEAELDKLLLSI 104


>gnl|CDD|236942 PRK11637, PRK11637, AmiB activator; Provisional.
          Length = 428

 Score = 31.2 bits (71), Expect = 1.3
 Identities = 22/74 (29%), Positives = 39/74 (52%), Gaps = 14/74 (18%)

Query: 341 QEQNSRLLQQLREKDDANFKLMTERIKSNQLHKLAREEKDTLREQVQALSLQIEAMHTAI 400
           Q+Q + LL QL+++         E   S    KL RE ++TL      L+ QI+ ++ +I
Sbjct: 67  QQQRASLLAQLKKQ---------EEAISQASRKL-RETQNTL----NQLNKQIDELNASI 112

Query: 401 RKLEEKERFLQTVL 414
            KLE+++   + +L
Sbjct: 113 AKLEQQQAAQERLL 126


>gnl|CDD|223091 COG0012, COG0012, Predicted GTPase, probable translation factor
           [Translation, ribosomal structure and biogenesis].
          Length = 372

 Score = 31.0 bits (71), Expect = 1.4
 Identities = 35/158 (22%), Positives = 52/158 (32%), Gaps = 41/158 (25%)

Query: 269 IYGSLEFRKFACVVRDSKRDERKKLADEEAMRKIKQLEDQTHQLQKQVATHKQEEEALLN 328
              SLE R      R  KR +  K  D+E               +++++   + EE L  
Sbjct: 137 DLESLEKR----WERLEKRAKAGKKLDKEL--------------KEELSLLGKLEEHLEE 178

Query: 329 EMEVTGQAFEDMQEQNSRLLQQLR--------------EKDDANFKLMTERIKSNQLHKL 374
                G       E++   L  L               E D AN     +R+K     +L
Sbjct: 179 GKPARGLDLSKWSEEDLEALASLNLLTAKPMLYVANVSEDDLANLNEYVKRLK-----EL 233

Query: 375 AREEKDTLREQVQALSLQIEAMHTAIRKLEEKERFLQT 412
           A +E      +V  +S  IE     +   EEK  FL  
Sbjct: 234 AAKENA----EVVPVSAAIELELRELADAEEKGEFLIE 267


>gnl|CDD|218737 pfam05764, YL1, YL1 nuclear protein.  The proteins in this family
           are designated YL1. These proteins have been shown to be
           DNA-binding and may be a transcription factor.
          Length = 238

 Score = 30.8 bits (70), Expect = 1.4
 Identities = 19/92 (20%), Positives = 41/92 (44%), Gaps = 9/92 (9%)

Query: 282 VRDSKRDERKKLADEEAMRKIKQLEDQTHQLQKQVATHKQEEEALLNEMEVTGQAFEDMQ 341
              +K    ++L + E  RK  Q + +  + +K+     QEE   L E + T       +
Sbjct: 146 TVQNKEATHERLKEREIRRKKIQAKARKRKEKKKEKELTQEE--RLAEAKET-------E 196

Query: 342 EQNSRLLQQLREKDDANFKLMTERIKSNQLHK 373
             N + L++  E+++   K   + +K  +L++
Sbjct: 197 RINLKSLERYEEQEEEKKKAKIQALKKRRLYE 228


>gnl|CDD|178657 PLN03110, PLN03110, Rab GTPase; Provisional.
          Length = 216

 Score = 30.7 bits (69), Expect = 1.4
 Identities = 25/84 (29%), Positives = 42/84 (50%), Gaps = 11/84 (13%)

Query: 335 QAFEDMQEQNSRLLQQLREKDDANFKLMTERIKS--NQLHKLAREEKDTLREQVQALSLQ 392
           Q F+++Q    R L++LR+  D+N  +M    KS  N L  +A E+   L E+ + LS  
Sbjct: 98  QTFDNVQ----RWLRELRDHADSNIVIMMAGNKSDLNHLRSVAEEDGQALAEK-EGLSF- 151

Query: 393 IEAMHTAIRKLEEKERFLQTVLTN 416
              + T+  +    E+  QT+L  
Sbjct: 152 ---LETSALEATNVEKAFQTILLE 172


>gnl|CDD|225638 COG3096, MukB, Uncharacterized protein involved in chromosome
            partitioning [Cell division and chromosome partitioning].
          Length = 1480

 Score = 31.4 bits (71), Expect = 1.4
 Identities = 26/132 (19%), Positives = 61/132 (46%), Gaps = 10/132 (7%)

Query: 290  RKKLADEEAMRKIKQLEDQTHQLQKQVATHKQEEEALLNEMEVTGQAFEDMQEQNSRLLQ 349
            R++L   EA R   +  +Q  Q Q Q++ + Q    +L  ++ +    +++  +  + LQ
Sbjct: 990  RQRLEQAEAERTRAR--EQLRQHQAQLSQYNQ----VLASLKSSYDTKKELLNELQQELQ 1043

Query: 350  QLREKDDANFKLMTERIKSNQLHKL---AREEKDTLREQVQALSLQIEAMHTAIRKLEEK 406
             +  + D+       RI+ ++LH      R  ++ L +Q+     +++ +   +RKLE  
Sbjct: 1044 DIGVRADSG-AEERARIRRDELHAQLSTNRSRRNQLEKQLTFCEAEMDNLTRKLRKLERD 1102

Query: 407  ERFLQTVLTNAE 418
               ++  +  A+
Sbjct: 1103 YFEMREQVVTAK 1114


>gnl|CDD|113514 pfam04747, DUF612, Protein of unknown function, DUF612.  This
           family includes several uncharacterized proteins from
           Caenorhabditis elegans.
          Length = 517

 Score = 31.2 bits (69), Expect = 1.4
 Identities = 30/123 (24%), Positives = 52/123 (42%), Gaps = 12/123 (9%)

Query: 261 SLSDSASNIYGSLEFRKFACVVRDSKRDERKKLADEEAMRKIKQLEDQTHQLQKQVATHK 320
           S++D     + SLE  +    V   K+ E+KK          KQ+       QK  A   
Sbjct: 46  SINDQRKEAFASLELTEQPQQVEKVKKSEKKKAQ--------KQIAKDHEAEQKVNAKKA 97

Query: 321 QEEEALLNEMEVTGQAFEDMQEQNSRL----LQQLREKDDANFKLMTERIKSNQLHKLAR 376
            E+EA   E E   +A ++ + +  +     +Q+ +EK +A+ K +    K  +  K  +
Sbjct: 98  AEKEARRAEAEAKKRAAQEEEHKQWKAEQERIQKEQEKKEADLKKLQAEKKKEKAVKAEK 157

Query: 377 EEK 379
            EK
Sbjct: 158 AEK 160


>gnl|CDD|184696 PRK14474, PRK14474, F0F1 ATP synthase subunit B; Provisional.
          Length = 250

 Score = 30.9 bits (70), Expect = 1.4
 Identities = 20/87 (22%), Positives = 33/87 (37%), Gaps = 9/87 (10%)

Query: 304 QLEDQTHQLQ--KQVATHKQEEEALLNEM--EVTGQAFEDMQEQNSRLLQQLREKDDANF 359
           + +D   + Q   Q A   ++++  L +       QA E   EQ   LL + RE      
Sbjct: 44  RWQDAEQRQQEAGQEAERYRQKQQSLEQQRASFMAQAQEAADEQRQHLLNEAREDVATAR 103

Query: 360 KLMTERIKSNQLHKLAREEKDTLREQV 386
               E     QL +  +E    L++Q 
Sbjct: 104 DEWLE-----QLEREKQEFFKALQQQT 125



 Score = 30.2 bits (68), Expect = 2.1
 Identities = 15/84 (17%), Positives = 36/84 (42%), Gaps = 9/84 (10%)

Query: 302 IKQLEDQTHQLQKQVATHKQEEEALLNEMEVTGQAFEDMQEQNSRLLQQLREKDDANFKL 361
           I+ ++ +  ++  +    +Q ++    E E   Q  + +++Q +  + Q +E  D     
Sbjct: 31  IQVMKKRQQRIANRWQDAEQRQQEAGQEAERYRQKQQSLEQQRASFMAQAQEAADE---- 86

Query: 362 MTERIKSNQLHKLAREEKDTLREQ 385
                +   L   ARE+  T R++
Sbjct: 87  -----QRQHLLNEAREDVATARDE 105


>gnl|CDD|181787 PRK09343, PRK09343, prefoldin subunit beta; Provisional.
          Length = 121

 Score = 29.7 bits (67), Expect = 1.5
 Identities = 24/111 (21%), Positives = 48/111 (43%), Gaps = 19/111 (17%)

Query: 302 IKQLEDQTHQLQKQVATHKQEEEALLNEMEVTGQAFEDMQEQNSRLLQQLREKDDANFKL 361
           + QL+    QL++ +   K + +  L E+               + L++L +  D     
Sbjct: 13  LAQLQQLQQQLER-LLQQKSQIDLELREIN--------------KALEELEKLPD---DT 54

Query: 362 MTERIKSNQLHKLAREE-KDTLREQVQALSLQIEAMHTAIRKLEEKERFLQ 411
              +I  N L K+ + + +  L+E+ + L L+   +    +KL EK + LQ
Sbjct: 55  PIYKIVGNLLVKVDKTKVEKELKERKELLELRSRTLEKQEKKLREKLKELQ 105


>gnl|CDD|163064 TIGR02894, DNA_bind_RsfA, transcription factor, RsfA family.  In a
           subset of endospore-forming members of the Firmcutes,
           members of this protein family are found, several to a
           genome. Two very strongly conserved sequences regions
           are separated by a highly variable linker region. Much
           of the linker region was excised from the seed alignment
           for this model. A characterized member is the
           prespore-specific transcription RsfA from Bacillus
           subtilis, previously called YwfN, which is controlled by
           sigma factor F and seems to fine-tune expression of some
           genes in the sigma-F regulon. A paralog in Bacillus
           subtilis is designated YlbO [Regulatory functions, DNA
           interactions, Cellular processes, Sporulation and
           germination].
          Length = 161

 Score = 30.1 bits (68), Expect = 1.5
 Identities = 13/65 (20%), Positives = 28/65 (43%)

Query: 287 RDERKKLADEEAMRKIKQLEDQTHQLQKQVATHKQEEEALLNEMEVTGQAFEDMQEQNSR 346
           R E K+ A    ++ +                 ++E E L N+ E   +  E+++++  +
Sbjct: 70  RKELKREAGSLTLQDVISFLQNLKTTNPSDQALQKENERLKNQNESLQKRNEELEKELEK 129

Query: 347 LLQQL 351
           L Q+L
Sbjct: 130 LRQRL 134


>gnl|CDD|205039 pfam12718, Tropomyosin_1, Tropomyosin like.  This family is a set
           of eukaryotic tropomyosins. Within the yeast Tmp1 and
           Tmp2, biochemical and sequence analyses indicate that
           Tpm2p spans four actin monomers along a filament,
           whereas Tpmlp spans five. Despite its shorter length,
           Tpm2p can compete with Tpm1p for binding to F-actin.
           Over-expression of Tpm2p in vivo alters the axial
           budding of haploids to a bipolar pattern, and this can
           be partially suppressed by co-over-expression of Tpm1p.
           This suggests distinct functions for the two
           tropomyosins, and indicates that the ratio between them
           is important for correct morphogenesis. The family also
           contains higher eukaryote Tmp3 members.
          Length = 143

 Score = 29.9 bits (68), Expect = 1.5
 Identities = 15/67 (22%), Positives = 34/67 (50%)

Query: 292 KLADEEAMRKIKQLEDQTHQLQKQVATHKQEEEALLNEMEVTGQAFEDMQEQNSRLLQQL 351
           KL  E A  + ++LE++  +L+++    +QE  +L  + +   +  E ++EQ     ++L
Sbjct: 6   KLEAENAQERAEELEEKLKELEQENLEKEQEITSLQKKNQQLEEEVEKLEEQLKEAKEKL 65

Query: 352 REKDDAN 358
            E +   
Sbjct: 66  EESEKLA 72



 Score = 28.3 bits (64), Expect = 4.7
 Identities = 17/87 (19%), Positives = 38/87 (43%), Gaps = 8/87 (9%)

Query: 348 LQQLR-EKDDANFKLMTERIKSNQLHKLAREEKDTLREQVQALSLQIEAMHTAIRKLEEK 406
           +  L+ E ++A  +   E ++  +  K   +E     +++ +L  + + +   + KLEE+
Sbjct: 2   MNALKLEAENAQERA--EELE--EKLKELEQENLEKEQEITSLQKKNQQLEEEVEKLEEQ 57

Query: 407 ERFLQTVLTNAEKELHLRNQAMDLNKR 433
              L+      E+   L   A  L +R
Sbjct: 58  ---LKEAKEKLEESEKLATNAEALTRR 81


>gnl|CDD|215791 pfam00210, Ferritin, Ferritin-like domain.  This family contains
           ferritins and other ferritin-like proteins such as
           members of the DPS family and bacterioferritins.
          Length = 142

 Score = 30.0 bits (68), Expect = 1.6
 Identities = 16/67 (23%), Positives = 28/67 (41%), Gaps = 11/67 (16%)

Query: 353 EKDDANFKLMTERIKSNQLHKLAREEKDTLREQVQALSLQIEAMHTAIRKLEEKERFLQT 412
           E    + K +TE +   +L +LA EE D               +   + + EE E FL+ 
Sbjct: 86  EAALEHEKEVTESL--RELIELAEEEGDYATADF---------LEWFLEEQEEHEWFLEA 134

Query: 413 VLTNAEK 419
           +L   ++
Sbjct: 135 LLEKLKR 141


>gnl|CDD|204414 pfam10211, Ax_dynein_light, Axonemal dynein light chain.  Axonemal
           dynein light chain proteins play a dynamic role in
           flagellar and cilia motility. Eukaryotic cilia and
           flagella are complex organelles consisting of a core
           structure, the axoneme, which is composed of nine
           microtubule doublets forming a cylinder that surrounds a
           pair of central singlet microtubules. This
           ultra-structural arrangement seems to be one of the most
           stable micro-tubular assemblies known and is responsible
           for the flagellar and ciliary movement of a large number
           of organisms ranging from protozoan to mammals. This
           light chain interacts directly with the N-terminal half
           of the heavy chains.
          Length = 189

 Score = 30.3 bits (69), Expect = 1.6
 Identities = 21/74 (28%), Positives = 40/74 (54%), Gaps = 12/74 (16%)

Query: 290 RKKLADEEAM----RKIKQLEDQTHQLQKQVATHKQEEEAL----LNEMEVTGQAFED-- 339
           RK L  E+      ++IK+LE++  +L+K+VA  + + EA+      E ++  +   D  
Sbjct: 113 RKALQAEQGKSELEQEIKKLEEEKEELEKRVAELEAKLEAIEKREEEERQIEEKRHADEI 172

Query: 340 --MQEQNSRLLQQL 351
             +++QN +L  QL
Sbjct: 173 AFLKKQNQQLKSQL 186


>gnl|CDD|238453 cd00890, Prefoldin, Prefoldin is a hexameric molecular chaperone
           complex, found in both eukaryotes and archaea, that
           binds and stabilizes newly synthesized polypeptides
           allowing them to fold correctly.  The complex contains
           two alpha and four beta subunits, the two subunits being
           evolutionarily related. In archaea, there is usually
           only one gene for each subunit while in eukaryotes there
           two or more paralogous genes encoding each subunit
           adding heterogeneity to the structure of the hexamer.
           The structure of the complex consists of a double beta
           barrel assembly with six protruding coiled-coils.
          Length = 129

 Score = 29.6 bits (67), Expect = 1.6
 Identities = 26/127 (20%), Positives = 54/127 (42%), Gaps = 11/127 (8%)

Query: 302 IKQLEDQTHQLQKQ---VATHKQEEEALLNEMEVTGQAFEDMQEQNSRLLQQLREKDDAN 358
           +++L  Q  QLQ+Q   +    Q+ EA L E E   +  E +++        +       
Sbjct: 1   LQELAAQLQQLQQQLEALQQQLQKLEAQLTEYEKAKETLETLKKAEEEKELLVPLGAGLF 60

Query: 359 FK---LMTER----IKSNQLHKLAREE-KDTLREQVQALSLQIEAMHTAIRKLEEKERFL 410
            K      ++    + +    + + EE  + L+++++ L  QIE +   + KL+++   L
Sbjct: 61  VKAEVKDDDKVLVDLGTGVYVEKSLEEAIEFLKKRLETLEKQIEKLEKQLEKLQDQITEL 120

Query: 411 QTVLTNA 417
           Q  L   
Sbjct: 121 QEELQQL 127


>gnl|CDD|220648 pfam10243, MIP-T3, Microtubule-binding protein MIP-T3.  This
           protein, which interacts with both microtubules and
           TRAF3 (tumour necrosis factor receptor-associated factor
           3), is conserved from worms to humans. The N-terminal
           region is the microtubule binding domain and is
           well-conserved; the C-terminal 100 residues, also
           well-conserved, constitute the coiled-coil region which
           binds to TRAF3. The central region of the protein is
           rich in lysine and glutamic acid and carries KKE motifs
           which may also be necessary for tubulin-binding, but
           this region is the least well-conserved.
          Length = 506

 Score = 31.0 bits (70), Expect = 1.6
 Identities = 16/76 (21%), Positives = 38/76 (50%), Gaps = 5/76 (6%)

Query: 305 LEDQTHQLQKQVATHKQEEEALLNEMEVTGQAFEDMQEQNSRLLQQLREKDDANFKLMTE 364
           ++    +L+   + ++Q+ +AL  E   T +A E +++Q    L +L E+   + +    
Sbjct: 432 IDSMQRELEMWRSEYRQQAQALQQERRATEEALEPLKDQ----LAEL-EQAIKDQRSKIS 486

Query: 365 RIKSNQLHKLAREEKD 380
            +++N L   A+ +K 
Sbjct: 487 NVRANILKNEAKIQKL 502


>gnl|CDD|224236 COG1317, FliH, Flagellar biosynthesis/type III secretory pathway
           protein [Cell motility and secretion / Intracellular
           trafficking and secretion].
          Length = 234

 Score = 30.4 bits (69), Expect = 1.7
 Identities = 36/188 (19%), Positives = 65/188 (34%), Gaps = 50/188 (26%)

Query: 272 SLEFRKFACVVRDSKRDERKKLADEEA-----MRKIKQLEDQTHQLQKQVATHKQEEEAL 326
            +EF+  A      K    K + +EE        K ++LE    +LQ+ +   +   E  
Sbjct: 17  VIEFKDLA--NAGPKVAPEKAVLEEEELEQALEAKEEELESAAQELQEGIE--EGAREGY 72

Query: 327 LNEMEVTGQAFEDMQEQNSRLLQQLREKDDANFKLMTERIKSNQLHKLAREEKDTLREQV 386
               +     +E+  E+     + L                  +L    + E + L+E V
Sbjct: 73  EEGFQ---LGYEEGFEEGQEEGRVLERL--------------AKLIAEFQAELEALKEVV 115

Query: 387 QALSLQIEAMHTAIRKLEEKERFLQTVLTNAEK----ELHLRNQAMDLNKRKAIESA-QS 441
           +                   ++ +Q VL  A K    EL L  +A+    R+A+E     
Sbjct: 116 E-------------------KQLVQLVLEIARKVIGKELELDPEALLAAVREALEEVPLF 156

Query: 442 AADLKLHL 449
           AA + L +
Sbjct: 157 AAAITLRV 164


>gnl|CDD|219186 pfam06810, Phage_GP20, Phage minor structural protein GP20.  This
           family consists of several phage minor structural
           protein GP20 sequences of around 180 residues in length.
           The function of this family is unknown.
          Length = 156

 Score = 30.0 bits (68), Expect = 1.7
 Identities = 21/82 (25%), Positives = 34/82 (41%), Gaps = 19/82 (23%)

Query: 375 AREEKDTLREQVQALSLQI----------EAMHTAIRKLEEKERFLQTVLTNAEKELHLR 424
           A EE+D+L++Q++    QI          E +   I KL+++    +T     E +L   
Sbjct: 25  ANEERDSLKKQLKNRDKQIEDLKKKVKDNEELQKKIEKLKQQ---NKTAKEEYEAKLA-- 79

Query: 425 NQAMDLNKRKAIESAQSAADLK 446
               D     AIE A   A  +
Sbjct: 80  ----DTKLNNAIELALKKAKAR 97


>gnl|CDD|238105 cd00179, SynN, Syntaxin N-terminus domain; syntaxins are nervous
           system-specific proteins implicated in the docking of
           synaptic vesicles with the presynaptic plasma membrane;
           they are a family of receptors for intracellular
           transport vesicles; each target membrane may be
           identified by a specific member of the syntaxin family;
           syntaxins contain a moderately well conserved
           amino-terminal domain, called Habc, whose structure is
           an antiparallel three-helix bundle; a linker of about 30
           amino acids connects this to the carboxy-terminal
           region, designated H3 (t_SNARE), of the syntaxin
           cytoplasmic domain; the highly conserved H3 region forms
           a single, long alpha-helix when it is part of the core
           SNARE complex and anchors the protein on the cytoplasmic
           surface of cellular membranes; H3 is not included in
           defining this domain.
          Length = 151

 Score = 29.9 bits (68), Expect = 1.7
 Identities = 17/90 (18%), Positives = 39/90 (43%), Gaps = 8/90 (8%)

Query: 302 IKQLEDQTHQLQKQV----ATHKQEEEALLNEMEVTGQAFEDMQEQNSRLL---QQLREK 354
           I ++ +   +LQK           + E L  E+E   Q  + + ++    L   ++  E+
Sbjct: 15  IDKISEDVEELQKLHSQLLTAPDADPE-LKQELESLVQEIKKLAKEIKGKLKELEESNEQ 73

Query: 355 DDANFKLMTERIKSNQLHKLAREEKDTLRE 384
           ++A      +RI+  Q   L+++  + + E
Sbjct: 74  NEALNGSSVDRIRKTQHSGLSKKFVEVMTE 103


>gnl|CDD|217894 pfam04102, SlyX, SlyX.  The SlyX protein has no known function. It
           is short less than 80 amino acids and is found close to
           the slyD gene. The SlyX protein has a conserved PPH(Y/W)
           motif at its C-terminus. The protein may be a
           coiled-coil structure.
          Length = 68

 Score = 28.3 bits (64), Expect = 1.8
 Identities = 15/59 (25%), Positives = 33/59 (55%), Gaps = 8/59 (13%)

Query: 304 QLEDQTHQLQKQVATHKQEEEAL--LNEMEVTGQAFE--DMQEQNSRLLQQLREKDDAN 358
            LE++  +L+ ++A    +E+ +  LN+  V  Q  +   +Q Q   L+++L+E + +N
Sbjct: 1   SLEERIEELEIRLA---FQEDTIEELNQ-VVAEQQRQIDRLQRQLRLLVERLKEVESSN 55


>gnl|CDD|218545 pfam05300, DUF737, Protein of unknown function (DUF737).  This
           family consists of several uncharacterized mammalian
           proteins of unknown function.
          Length = 187

 Score = 30.1 bits (68), Expect = 1.8
 Identities = 28/124 (22%), Positives = 51/124 (41%), Gaps = 23/124 (18%)

Query: 288 DERKKLADEEAMRKIKQLEDQTHQLQKQVATHKQEEEALLNEMEVTGQAFEDMQEQNSRL 347
            E ++L   E    ++    +  +         + +E LL   E   Q    +QE+ +R+
Sbjct: 47  SELRRLIAGELKGALED-AKRPSEETAGGLQSSEVKEDLLKRYE---QEQAIVQEELARI 102

Query: 348 LQQLREKDDANFKLMTERIKSNQLHKLAREEKDTLREQVQALSLQIEAMHTAIRKLEEKE 407
            +  RE++ A      E++    L    RE+    +E+ +A  L         R+LEEKE
Sbjct: 103 AK--REREAAE-----EQLSRAVL----REKASAEQERQKAKHLA--------RQLEEKE 143

Query: 408 RFLQ 411
             L+
Sbjct: 144 AELK 147


>gnl|CDD|235899 PRK06975, PRK06975, bifunctional uroporphyrinogen-III
           synthetase/uroporphyrin-III C-methyltransferase;
           Reviewed.
          Length = 656

 Score = 30.8 bits (70), Expect = 1.8
 Identities = 14/50 (28%), Positives = 20/50 (40%), Gaps = 4/50 (8%)

Query: 297 EAMRKIKQLEDQTHQLQKQVATHKQEEEALLNEMEVTGQAFEDMQEQNSR 346
           E   K +Q +   HQL  Q A      +  L + +   QA E   +  SR
Sbjct: 368 ELRVKTEQAQASVHQLDSQFAQL----DGKLADAQSAQQALEQQYQDLSR 413


>gnl|CDD|226419 COG3904, COG3904, Predicted periplasmic protein [Function unknown].
          Length = 245

 Score = 30.3 bits (68), Expect = 1.8
 Identities = 13/47 (27%), Positives = 20/47 (42%)

Query: 325 ALLNEMEVTGQAFEDMQEQNSRLLQQLREKDDANFKLMTERIKSNQL 371
           A L EM +     + M       ++QL  K+    KL+T R   + L
Sbjct: 197 AYLREMGIGPGLLQMMLATPPSDIRQLGLKEMTAMKLVTSRDAVDLL 243


>gnl|CDD|226883 COG4477, EzrA, Negative regulator of septation ring formation [Cell
           division and chromosome partitioning].
          Length = 570

 Score = 30.8 bits (70), Expect = 1.9
 Identities = 23/79 (29%), Positives = 39/79 (49%), Gaps = 5/79 (6%)

Query: 301 KIKQLEDQTHQLQKQVATHKQEEEAL---LNEMEVTGQAFEDMQEQNSRLLQQLREKDDA 357
           ++K+LE    ++ + +   +     L   L E+E      ED QE+    L  LR KD+ 
Sbjct: 355 ELKELESVLDEILENIEAQEVAYSELQDNLEEIEKALTDIEDEQEKVQEHLTSLR-KDEL 413

Query: 358 NFKLMTERIKSNQLHKLAR 376
             +   ER+KS +LH++ R
Sbjct: 414 EARENLERLKS-KLHEIKR 431



 Score = 29.7 bits (67), Expect = 4.7
 Identities = 28/132 (21%), Positives = 55/132 (41%), Gaps = 12/132 (9%)

Query: 296 EEAMRKIKQLEDQTHQLQKQVATHKQEEEALLNEMEVTGQAFEDMQEQNSRLLQQLREKD 355
           EEA   +  L     ++   +A  + E    L +++     + DM+E+   L        
Sbjct: 200 EEAEEHMIALRSIMERIPSLLAELQTELPGQLQDLK---AGYRDMKEEGYHLEHV---NI 253

Query: 356 DANFKLMTERIKSN--QLHKL----AREEKDTLREQVQALSLQIEAMHTAIRKLEEKERF 409
           D+  + + E++  N   L +L    A EE   ++E++++L   +E    A   +EE    
Sbjct: 254 DSRLERLKEQLVENSELLTQLELDEAEEELGLIQEKIESLYDLLEREVEAKNVVEENLPI 313

Query: 410 LQTVLTNAEKEL 421
           L   L  A++  
Sbjct: 314 LPDYLEKAKENN 325


>gnl|CDD|237953 PRK15378, PRK15378, inositol phosphate phosphatase SopB;
           Provisional.
          Length = 564

 Score = 30.6 bits (69), Expect = 2.0
 Identities = 26/116 (22%), Positives = 42/116 (36%), Gaps = 9/116 (7%)

Query: 333 TGQAFEDMQEQNSRLLQQLREKDDANFKLMTERIKSNQLHKLAREEKDTLREQVQALSLQ 392
           T +AF+ +Q+     +Q L  +  A  K    R +   L    R +  T    +Q     
Sbjct: 14  TQEAFKSLQKTLYNGMQILSGQGKAPAKAPDARPEIIVL----RNKGATWGNYLQHQKTS 69

Query: 393 IEAMHTAIRKLEEKERFLQTVLTNAEKE----LHLRNQAMDLNKRKAIESAQSAAD 444
             ++H       +      T L  A K+      + NQ M+L K KA   A    +
Sbjct: 70  NHSLHNLYNLQRDLLTVAATALKLAGKQDPVLKAMANQ-MELAKVKADRPATKQEE 124


>gnl|CDD|223181 COG0103, RpsI, Ribosomal protein S9 [Translation, ribosomal
           structure and biogenesis].
          Length = 130

 Score = 29.4 bits (67), Expect = 2.1
 Identities = 8/19 (42%), Positives = 11/19 (57%)

Query: 274 EFRKFACVVRDSKRDERKK 292
             +K   + RD +R ERKK
Sbjct: 97  ALKKAGLLTRDPRRVERKK 115


>gnl|CDD|225177 COG2268, COG2268, Uncharacterized protein conserved in bacteria
           [Function unknown].
          Length = 548

 Score = 30.6 bits (69), Expect = 2.2
 Identities = 21/120 (17%), Positives = 39/120 (32%), Gaps = 13/120 (10%)

Query: 286 KRDERKKLADEEAMRKIKQLEDQTHQLQKQVATHKQEEEALLNEMEVTGQAFEDMQEQNS 345
             +  + +  +         E +  +  +Q A     E A   ++E+  +A     E  +
Sbjct: 358 AANIAEAIGAQAEAAVETARETEEAERAEQAALVAAAEAAEQEQVEIAVRAEAAKAEAEA 417

Query: 346 R------LLQQLREKDDAN---FKLMTERIKSNQLHKLAREEKDTLREQVQALSLQIEAM 396
           +        + +REK  A     + + E I+       A   K      VQAL    E  
Sbjct: 418 QAAEIKAEAEAIREKGKAEAEAKRALAEAIQVLGDAAAAELFKAL----VQALPEVAEEA 473


>gnl|CDD|221919 pfam13094, CENP-Q, CENP-A-nucleosome distal (CAD) centromere
           subunit.  CENP-Q is one of the components that assembles
           onto the CENP-A-nucleosome distal (CAD) centromere. The
           centromere, which is the basic element of chromosome
           inheritance, is epigenetically determined in mammals.
           CENP-A, the centromere-specific histone H3 variant,
           assembles an array of nucleosomes and it is this that
           seems to be the prime candidate for specifying
           centromere identity. CENP-A nucleosomes directly recruit
           a proximal CENP-A nucleosome associated complex (NAC)
           comprised of CENP-M, CENP-N and CENP-T, CENP-U(50),
           CENP-C and CENP-H. Assembly of the CENP-A NAC at
           centromeres is dependent on CENP-M, CENP-N and CENP-T.
           Additionally, there are seven other subunits which make
           up the CENP-A-nucleosome distal (CAD) centromere,
           CENP-K, CENP-L, CENP-O, CENP-P, CENP-Q, CENP-R and
           CENP-S, also assembling on the CENP-A NAC. Fta7 is the
           equivalent component of the fission yeast Sim4 complex.
          Length = 159

 Score = 29.6 bits (67), Expect = 2.5
 Identities = 25/125 (20%), Positives = 50/125 (40%), Gaps = 13/125 (10%)

Query: 292 KLADEEAMRKIKQLEDQTHQLQKQVATHKQEEEAL------LNEMEVTGQAFED-MQEQN 344
           +L   +   + +QL  +  QL+      ++EE  L      L E+E   +A E   +E++
Sbjct: 24  QLLRRKRALE-RQLAAELKQLELLEEEIRREEAELEKDLEYLQELEKNAKALEREREEES 82

Query: 345 SRLLQQLREKDDANFKLMTERIKSNQLHKLAREEKDTLREQVQALSLQI----EAMHTAI 400
             L   LR  +    +   E ++ + L    R     L  ++  L  Q+    E++   +
Sbjct: 83  KNLHPVLRLLESEVLEEN-ELLQDSLLELSERNFSPNLDPELLDLLKQLNKHLESLQGNL 141

Query: 401 RKLEE 405
            +L  
Sbjct: 142 EQLAG 146


>gnl|CDD|227463 COG5134, COG5134, Uncharacterized conserved protein [Function
           unknown].
          Length = 272

 Score = 30.0 bits (67), Expect = 2.5
 Identities = 22/130 (16%), Positives = 41/130 (31%), Gaps = 12/130 (9%)

Query: 283 RDSKRDERKKLADEEAMRKIKQLEDQTHQLQKQVATHKQEEEALLNEMEVTGQAF-EDMQ 341
           +D   D  K    E+          +    Q++   H     + +N ++   +    D  
Sbjct: 113 QDINEDPAKAENVEKVPESDAIEALEKQLTQQKSEKHN---SSAINFIDELNKRLWSDPF 169

Query: 342 EQNSRLLQQLREKDDANFKLMTERIKSNQLHKLAREEKDTLREQVQALSLQI-----EAM 396
             + RL +Q RE+         +  K   L   A  + D L          +        
Sbjct: 170 VSSQRLRKQFRERKKIE---KKQEAKDLSLKNRAALDIDILPSSSDKDKALLLLDNELGK 226

Query: 397 HTAIRKLEEK 406
           +  IRKL+ +
Sbjct: 227 NKFIRKLDYR 236


>gnl|CDD|221380 pfam12017, Tnp_P_element, Transposase protein.  Protein in this
           family are transposases found in insects. This region is
           about 230 amino acids in length and is found associated
           with pfam05485.
          Length = 236

 Score = 29.8 bits (67), Expect = 2.5
 Identities = 14/47 (29%), Positives = 25/47 (53%), Gaps = 5/47 (10%)

Query: 280 CVVRDSKRDERKKLADEEAMRKIKQLEDQTHQLQKQVATHKQEEEAL 326
            V+  +   E K L      +KI+ LE + H L++Q+   +Q E++L
Sbjct: 10  KVINHTTLMENKSLR-----QKIRALEKEMHSLRQQLEESQQLEKSL 51


>gnl|CDD|184695 PRK14471, PRK14471, F0F1 ATP synthase subunit B; Provisional.
          Length = 164

 Score = 29.4 bits (66), Expect = 2.5
 Identities = 34/118 (28%), Positives = 61/118 (51%), Gaps = 16/118 (13%)

Query: 320 KQEEEALLNEMEVTGQAFEDMQE---QNSRLLQQLREKDDANFK---LMTERIKSNQLHK 373
           K+ E+++ N +    +A ++MQ     N RLL++ R + DA  K    + E++ ++   +
Sbjct: 38  KEREDSIKNALASAEEARKEMQNLQADNERLLKEARAERDAILKEAREIKEKMIADAKEE 97

Query: 374 LAREEKDTLREQVQALSLQIE---AMH-----TAIRKLEEKERFLQTVLTNAEKELHL 423
            A+ E D + EQ +A S++ E   AM       A   +E  E+ L+  L+N EK+  L
Sbjct: 98  -AQVEGDKMIEQAKA-SIESEKNAAMAEIKNQVANLSVEIAEKVLRKELSNKEKQHKL 153


>gnl|CDD|214448 MTH00173, ATP6, ATP synthase F0 subunit 6; Provisional.
          Length = 231

 Score = 29.8 bits (68), Expect = 2.5
 Identities = 29/133 (21%), Positives = 49/133 (36%), Gaps = 22/133 (16%)

Query: 133 FSIFRFSSSTVFVYHCFSIFRFSSSTVFVYHCFSIFRFSSSTVFVYHCFSIFRFSSSTVF 192
           FS F   +S+       S   +  S + ++   S    SSS +     F +F  + S+  
Sbjct: 6   FSSFDDHNSSFSS---LSFLMWLLSLMSLFFFSSSVWVSSSNLSSV--FKLFVLTVSS-Q 59

Query: 193 VYHCFSIFR--FSSSTVFVYHCFSI---------FRFSSSTIFVYHCFDIFSLTSTLFVS 241
           V     +    FS     ++  F I         F FS ++   +     FSL   L++S
Sbjct: 60  VTRSSGLNLGGFSLLLSSLF-LFLISLNLSGLLPFVFSVTSHLAF----TFSLALPLWLS 114

Query: 242 ICTSAFQMSFYKS 254
           +  S    +  KS
Sbjct: 115 LILSGLFYNPSKS 127


>gnl|CDD|217754 pfam03836, RasGAP_C, RasGAP C-terminus. 
          Length = 139

 Score = 29.1 bits (66), Expect = 2.5
 Identities = 13/56 (23%), Positives = 29/56 (51%), Gaps = 5/56 (8%)

Query: 379 KDTLREQVQALSLQIEAMHTAIRKLEEKERFLQTVLTNAEKELHLRNQAMDLNKRK 434
           K   RE+ +    +++ +   ++KL +K ++L++ L   +   +L N    L K+K
Sbjct: 42  KRRYRERRKK---ELKKLEQTLKKLRDKNKYLESQLDYYKS--YLDNVLSTLQKKK 92


>gnl|CDD|203394 pfam06133, DUF964, Protein of unknown function (DUF964).  This
           family consists of several relatively short bacterial
           and archaeal hypothetical sequences. The function of
           this family is unknown.
          Length = 108

 Score = 28.6 bits (65), Expect = 2.5
 Identities = 15/68 (22%), Positives = 31/68 (45%), Gaps = 11/68 (16%)

Query: 294 ADEEAMRKIKQLEDQTHQLQ----------KQVATHKQEEEALLNEMEVTGQAFEDMQEQ 343
           ADEEA + I + +    ++Q          K+V    QE +  ++   +    F++ ++ 
Sbjct: 31  ADEEAQKLIDEFQKLQEEIQEKQMFGKEIPKEVQQKIQELKREIDLNPLV-AEFKEAEQA 89

Query: 344 NSRLLQQL 351
              LLQ++
Sbjct: 90  LQDLLQEV 97


>gnl|CDD|213755 TIGR02977, phageshock_pspA, phage shock protein A.  Members of this
           family are the phage shock protein PspA, from the phage
           shock operon. This is a narrower family than the set of
           PspA and its homologs, sometimes several in a genome, as
           described by pfam04012. PspA appears to maintain the
           protonmotive force under stress conditions that include
           overexpression of certain phage secretins, heat shock,
           ethanol, and protein export defects [Cellular processes,
           Adaptations to atypical conditions].
          Length = 219

 Score = 29.9 bits (68), Expect = 2.5
 Identities = 29/96 (30%), Positives = 46/96 (47%), Gaps = 13/96 (13%)

Query: 339 DMQEQNSRLLQQLRE-KDDANFKLMTERIKSNQLHKLARE---EKDTLREQVQALSLQIE 394
           +++ + SRL  Q+ + +D A   L   R        LAR    EK   +E  +AL  ++ 
Sbjct: 56  ELERRVSRLEAQVADWQDKAELALSKGR------EDLARAALIEKQKAQELAEALERELA 109

Query: 395 AMHTAIRKLEEKERFLQTVLTNA---EKELHLRNQA 427
           A+   + KL+E    LQ  L  A   +K L +R+QA
Sbjct: 110 AVEETLAKLQEDIAKLQAKLAEARARQKALAIRHQA 145


>gnl|CDD|224011 COG1086, COG1086, Predicted nucleoside-diphosphate sugar epimerases
           [Cell envelope biogenesis, outer membrane / Carbohydrate
           transport and metabolism].
          Length = 588

 Score = 30.3 bits (69), Expect = 2.6
 Identities = 17/95 (17%), Positives = 32/95 (33%), Gaps = 22/95 (23%)

Query: 149 FSIFRFSSSTVFVYHCFSIFRFSSSTVFVYHCFSIFRFSSSTVFVYHCFSIFRFSSSTVF 208
            S   +S + +     F +F        +Y   +++R S S +F      I +   ++  
Sbjct: 24  LSASLWSGAQLLTICVFILFG-------LYR--AVWRASVSDLF-----IILKAVLTSAL 69

Query: 209 VYHCFSIFR--FSSSTIFVYHCFDIFSLTSTLFVS 241
           V   F   R     S + VY       L   + + 
Sbjct: 70  VSLLFFSTRTDLPRSVVLVY------WLLLFVLIG 98



 Score = 29.9 bits (68), Expect = 3.5
 Identities = 17/80 (21%), Positives = 29/80 (36%), Gaps = 16/80 (20%)

Query: 133 FSIFRFSSSTVFVYHCFSIFRFSSSTVFVYHCFSIFRFSSSTVFVYHCFSIFRFSSSTVF 192
            S   +S + +     F +F        +Y   +++R S S +F      I +   ++  
Sbjct: 24  LSASLWSGAQLLTICVFILFG-------LYR--AVWRASVSDLF-----IILKAVLTSAL 69

Query: 193 VYHCFSIFR--FSSSTVFVY 210
           V   F   R     S V VY
Sbjct: 70  VSLLFFSTRTDLPRSVVLVY 89


>gnl|CDD|132534 TIGR03495, phage_LysB, phage lysis regulatory protein, LysB family.
            Members of this protein family are phage lysis
           regulatory protein, including the well-studied protein
           LysB (lysis protein B) of Enterobacteria phage P2. For
           members of this family, genes are found in phage or in
           prophage regions of bacterial genomes, typically near a
           phage lysozyme or phage holin.
          Length = 135

 Score = 29.3 bits (66), Expect = 2.6
 Identities = 10/52 (19%), Positives = 21/52 (40%)

Query: 375 AREEKDTLREQVQALSLQIEAMHTAIRKLEEKERFLQTVLTNAEKELHLRNQ 426
           A       + ++ + + Q+  +    ++ EE +  L+  L  A   L  R Q
Sbjct: 31  ANRVLKAQQAELASKANQLIVLLALAKRNEEAQAQLRQQLAQARALLAQREQ 82


>gnl|CDD|225616 COG3074, COG3074, Uncharacterized protein conserved in bacteria
           [Function unknown].
          Length = 79

 Score = 28.0 bits (62), Expect = 2.6
 Identities = 11/50 (22%), Positives = 26/50 (52%)

Query: 301 KIKQLEDQTHQLQKQVATHKQEEEALLNEMEVTGQAFEDMQEQNSRLLQQ 350
           K++Q  D    LQ ++   K++  +L  E++      E ++ +N +L ++
Sbjct: 12  KVQQAIDTITLLQMEIEELKEKNNSLSQEVQNAQHQREALERENEQLKEE 61


>gnl|CDD|225468 COG2916, Hns, DNA-binding protein H-NS [General function prediction
           only].
          Length = 128

 Score = 29.0 bits (65), Expect = 2.8
 Identities = 19/77 (24%), Positives = 35/77 (45%), Gaps = 14/77 (18%)

Query: 301 KIKQLEDQTHQLQKQVATHKQEEEALLNEMEVTGQAFEDMQEQNSRL--LQQLREKD--D 356
            ++ LE +  + ++QV   +QEEEA          A  +++E+  +   +++L  KD   
Sbjct: 16  YLELLE-EMLEKEEQVVQERQEEEA---------AAIAEIEERQEKYGTIRELLIKDGIT 65

Query: 357 ANFKLMTERIKSNQLHK 373
               L TE  K  +  K
Sbjct: 66  PEELLNTEVTKPRRKAK 82


>gnl|CDD|203446 pfam06428, Sec2p, GDP/GTP exchange factor Sec2p.  In Saccharomyces
           cerevisiae, Sec2p is a GDP/GTP exchange factor for
           Sec4p, which is required for vesicular transport at the
           post-Golgi stage of yeast secretion.
          Length = 90

 Score = 28.1 bits (63), Expect = 3.0
 Identities = 22/78 (28%), Positives = 40/78 (51%), Gaps = 19/78 (24%)

Query: 283 RDSKRDERKKLADEEAMRKIKQ-LEDQTHQL----QKQVATHKQEEEALLNEMEVTGQAF 337
            D   +++K+L  E+  +K++Q +ED T  L     K VA  ++E EA+           
Sbjct: 1   ADLAEEKKKRLRAEKEKKKLEQEVEDLTASLFDEANKMVADARREREAV----------- 49

Query: 338 EDMQEQNSRLLQQLREKD 355
              + +N +L +QL+EK+
Sbjct: 50  ---EIKNEKLEEQLKEKE 64


>gnl|CDD|222469 pfam13949, ALIX_LYPXL_bnd, ALIX V-shaped domain binding to HIV.
           The binding of the LYPxL motif of late HIV p6Gag and
           EIAV p9Gag to this domain is necessary for viral
           budding.This domain is generally central between an
           N-terminal Bro1 domain, pfam03097 and a C-terminal
           proline-rich domain. The retroviruses thus used this
           domain to hijack the ESCRT system of the cell.
          Length = 292

 Score = 30.0 bits (68), Expect = 3.0
 Identities = 36/134 (26%), Positives = 62/134 (46%), Gaps = 17/134 (12%)

Query: 290 RKKLADEEAMRKIKQLEDQTHQLQKQV----ATHKQEEEALLNEMEVTGQAFEDMQEQNS 345
           R KL  EE    ++ L     +L+  +     +   E    ++ +        +++ +  
Sbjct: 104 RSKL--EEHEENLELLSGPEEELEALLPSSSPSKTPEVSEQISRLRELLNKLNELKAERE 161

Query: 346 RLLQQLREK--DDANFKLMTE--RIKSNQLHKLAREEK---DTLREQV-QALSLQIEAMH 397
           +LL++L+EK  DD   KL+    ++ S+   +L  EE    D L+E+V Q LS Q E + 
Sbjct: 162 KLLEELKEKAQDDDISKLLIALNKLGSSNEEQLFEEELKKFDPLQERVEQNLSKQEELL- 220

Query: 398 TAIRKLEEKERFLQ 411
             I+  E  E FLQ
Sbjct: 221 KEIQ--EANEEFLQ 232


>gnl|CDD|224482 COG1566, EmrA, Multidrug resistance efflux pump [Defense
           mechanisms].
          Length = 352

 Score = 30.0 bits (68), Expect = 3.0
 Identities = 25/126 (19%), Positives = 46/126 (36%), Gaps = 13/126 (10%)

Query: 288 DERKKLADEEAMRKIKQLEDQTHQLQKQVATHKQEEEALLNEMEVTGQAFEDMQEQNSRL 347
               + A E+A   +   E Q   L+ Q+A+ +        E +   QA  +++ +   L
Sbjct: 86  PRDYRAALEQAEAALAAAEAQLRNLRAQLASAQALIAQ--AEAQDLDQAQNELERRA-EL 142

Query: 348 LQ------QLREKDDANFKLM---TERIKSNQLHKLAREEKDTLREQVQALSLQIEAMHT 398
            Q      +  ++  A  +         ++ Q   LA  E +    Q Q  S +  A+  
Sbjct: 143 AQRGVVSREELDRARAALQAAEAALAAAQAAQKQNLALLESEVSGAQAQVASAE-AALDQ 201

Query: 399 AIRKLE 404
           A   LE
Sbjct: 202 AKLDLE 207


>gnl|CDD|192987 pfam12325, TMF_TATA_bd, TATA element modulatory factor 1 TATA
           binding.  This is the C-terminal conserved coiled coil
           region of a family of TATA element modulatory factor 1
           proteins conserved in eukaryotes. The proteins bind to
           the TATA element of some RNA polymerase II promoters and
           repress their activity. by competing with the binding of
           TATA binding protein. TMF1_TATA_bd is the most conserved
           part of the TMFs. TMFs are evolutionarily conserved
           golgins that bind Rab6, a ubiquitous ras-like
           GTP-binding Golgi protein, and contribute to Golgi
           organisation in animal and plant cells. The Rab6-binding
           domain appears to be the same region as this C-terminal
           family.
          Length = 121

 Score = 28.7 bits (65), Expect = 3.0
 Identities = 14/60 (23%), Positives = 29/60 (48%), Gaps = 10/60 (16%)

Query: 340 MQEQNSRLLQQLREKDDAN---FKLMTERIKSNQLHKLAREEKDTLREQVQALSLQIEAM 396
           ++++ +RL     E+D+A     KL  E  +  +L K    E + L ++++ L  + E  
Sbjct: 36  LKDELARLE---AERDEARQEIVKLTEENEELRELKK----EIEELEKELEDLEQRYETT 88


>gnl|CDD|179749 PRK04147, PRK04147, N-acetylneuraminate lyase; Provisional.
          Length = 293

 Score = 30.0 bits (68), Expect = 3.0
 Identities = 14/44 (31%), Positives = 24/44 (54%)

Query: 403 LEEKERFLQTVLTNAEKELHLRNQAMDLNKRKAIESAQSAADLK 446
            EEK++ L+ V   A+ ++ L  Q   +N  +A E A+ A +L 
Sbjct: 56  TEEKKQVLEIVAEEAKGKVKLIAQVGSVNTAEAQELAKYATELG 99


>gnl|CDD|215180 PLN02316, PLN02316, synthase/transferase.
          Length = 1036

 Score = 30.2 bits (68), Expect = 3.1
 Identities = 31/121 (25%), Positives = 57/121 (47%), Gaps = 22/121 (18%)

Query: 240 VSICTSAFQMSFYKSRWFDFRSLSDSASN------IYGSLEFRKFACVVRDSKRDERKKL 293
           + I   A++M F    +F+ +++ D+  +      I G ++   F   + + KR E +KL
Sbjct: 205 LHIPKEAYKMDFV---FFNGQNVYDNNDHKDFCVEIEGGMDEHSFEDFLLEEKRRELEKL 261

Query: 294 ADEEAMRKIKQLEDQTHQLQKQVATHKQEEEALLNEMEVTGQAFEDMQEQNSRLLQQLRE 353
           A EEA R+ +Q E+Q  + +++ A      +A   + EV         E+    LQ L +
Sbjct: 262 AKEEAERE-RQAEEQRRREEEKAAMEADRAQA---KAEV---------EKRREKLQNLLK 308

Query: 354 K 354
           K
Sbjct: 309 K 309


>gnl|CDD|221352 pfam11975, Glyco_hydro_4C, Family 4 glycosyl hydrolase C-terminal
           domain. 
          Length = 231

 Score = 29.7 bits (67), Expect = 3.1
 Identities = 9/58 (15%), Positives = 20/58 (34%), Gaps = 6/58 (10%)

Query: 403 LEEKERFLQTVLTNAEKELHLRNQAMDLNKRKAIESAQSAADLKLHLVIIYTNGPARL 460
           L++ E+ +   L    KE  L  +   ++ + +             +  IY +  A  
Sbjct: 102 LDKVEKRIFEQLEKWYKEPELIVKPKKIDDKGSRYYGD------AAIAAIYNDKEAVF 153


>gnl|CDD|214922 smart00935, OmpH, Outer membrane protein (OmpH-like).  This family
           includes outer membrane proteins such as OmpH among
           others. Skp (OmpH) has been characterized as a molecular
           chaperone that interacts with unfolded proteins as they
           emerge in the periplasm from the Sec translocation
           machinery.
          Length = 140

 Score = 29.1 bits (66), Expect = 3.2
 Identities = 17/78 (21%), Positives = 36/78 (46%), Gaps = 9/78 (11%)

Query: 286 KRDERKKLADEEAMRKIKQLEDQTHQLQKQVAT----HKQEEEALLNEMEVTGQAF---- 337
           + ++  K    E  +  K+L+    +LQK  AT     ++++E  L +     Q      
Sbjct: 22  QLEKEFKKRQAELEKLEKELQKLKEKLQKDAATLSEAAREKKEKELQKKVQEFQRKQQKL 81

Query: 338 -EDMQEQNSRLLQQLREK 354
            +D+Q++    LQ++ +K
Sbjct: 82  QQDLQKRQQEELQKILDK 99


>gnl|CDD|220371 pfam09736, Bud13, Pre-mRNA-splicing factor of RES complex.  This
           entry is characterized by proteins with alternating
           conserved and low-complexity regions. Bud13 together
           with Snu17p and a newly identified factor,
           Pml1p/Ylr016c, form a novel trimeric complex. called The
           RES complex, pre-mRNA retention and splicing complex.
           Subunits of this complex are not essential for viability
           of yeasts but they are required for efficient splicing
           in vitro and in vivo. Furthermore, inactivation of this
           complex causes pre-mRNA leakage from the nucleus. Bud13
           contains a unique, phylogenetically conserved C-terminal
           region of unknown function.
          Length = 141

 Score = 28.8 bits (65), Expect = 3.2
 Identities = 13/76 (17%), Positives = 32/76 (42%), Gaps = 2/76 (2%)

Query: 283 RDSKRDERKKLADEEAMRKIKQLEDQTHQLQKQVATHKQEEEALLNEMEVTGQAFEDMQE 342
            + KR+E+++  +E+   + ++ E +  +   Q    ++  E L          + D ++
Sbjct: 14  IEEKREEKEREKEEK--ERKEEKEKEWGKGLVQKEEREKRLEELEKAKNKPLARYADDED 71

Query: 343 QNSRLLQQLREKDDAN 358
            +  L +Q R  D   
Sbjct: 72  YDEELKEQERWDDPMA 87


>gnl|CDD|217834 pfam03998, Utp11, Utp11 protein.  This protein is found to be part
           of a large ribonucleoprotein complex containing the U3
           snoRNA. Depletion of the Utp proteins impedes production
           of the 18S rRNA, indicating that they are part of the
           active pre-rRNA processing complex. This large RNP
           complex has been termed the small subunit (SSU)
           processome.
          Length = 239

 Score = 29.6 bits (67), Expect = 3.2
 Identities = 21/97 (21%), Positives = 37/97 (38%), Gaps = 20/97 (20%)

Query: 346 RLLQQLREK------DDANFKLMTERIKSNQLHKLAREEKDTLREQVQALSLQ----IEA 395
           + L+ L+EK      D+  F +++ +           E++   +EQV+ L  Q    +  
Sbjct: 37  KTLKALKEKAAERNPDEFYFGMISSKTDDGVHILKREEKEVLSKEQVKLLKTQDLNYVRT 96

Query: 396 MHTAIRKLEEKERFLQTVLTNAEKELHLRNQAMDLNK 432
              A  K  EK           ++ELHL +      K
Sbjct: 97  KRQAEAKKIEKL----------KEELHLTDSGSAGPK 123


>gnl|CDD|184282 PRK13730, PRK13730, conjugal transfer pilus assembly protein TrbC;
           Provisional.
          Length = 212

 Score = 29.5 bits (66), Expect = 3.3
 Identities = 16/48 (33%), Positives = 22/48 (45%), Gaps = 1/48 (2%)

Query: 342 EQNSRLLQQLREKDDANFKLMTER-IKSNQLHKLAREEKDTLREQVQA 388
           +Q   L +QLREK D   K   E+ +  N L +      D L  + QA
Sbjct: 35  KQQENLSRQLREKPDHQLKAWAEKQVLENPLQRSDNHFLDELVRKQQA 82


>gnl|CDD|189762 pfam00901, Orbi_VP5, Orbivirus outer capsid protein VP5.
           cryoelectron microscopy indicates that VP5 is a trimer
           implying that there are 360 copies of VP5 per virion.
          Length = 507

 Score = 30.0 bits (68), Expect = 3.4
 Identities = 26/124 (20%), Positives = 59/124 (47%), Gaps = 6/124 (4%)

Query: 276 RKFACVVRDSKRDERKKLADEEAMRKIKQLEDQTHQLQKQVATHKQEEEALLNEMEVTGQ 335
           RK   + R+ K +E ++  +++ + K  +  ++ ++  K  A  ++EEE    +ME+  +
Sbjct: 91  RKLKELEREQKEEEVREKHNKKIIEKFGEDLEEVYKFMKGEAKVEEEEE---KQMEILEK 147

Query: 336 AFEDMQEQNSRLLQQLREKDDANFKLMTERIK-SNQLHKLAREEKDTLREQVQALS--LQ 392
           A +   +      + L+    A  K   ER +   ++ +  R++ D L+  ++     LQ
Sbjct: 148 ALKSYLKIVKEENKSLQRLAKALQKESEERTQDETKMIEEYRDKIDALKNAIEVEKEGLQ 207

Query: 393 IEAM 396
            EA+
Sbjct: 208 EEAI 211


>gnl|CDD|193185 pfam12709, Kinetocho_Slk19, Central kinetochore-associated.  This
           is a family of proteins integrally involved in the
           central kinetochore. Slk19 is a yeast member and it may
           play an important role in the timing of nuclear
           migration. It may also participate, directly or
           indirectly, in the maintenance of centromeric tensile
           strength during mitotic stagnation, for instance during
           activation of checkpoint controls, when cells need to
           preserve nuclear integrity until cell cycle progression
           can be resumed.
          Length = 87

 Score = 28.1 bits (63), Expect = 3.4
 Identities = 14/41 (34%), Positives = 26/41 (63%), Gaps = 2/41 (4%)

Query: 291 KKLADEEAMRKIKQLEDQTHQLQKQVATHKQEEEALLNEME 331
           KKL  +E   + K L+ +  QL+KQ+ T ++E++ LL  ++
Sbjct: 49  KKL--DELELENKALKQEIEQLKKQLDTERKEKQELLKLLD 87


>gnl|CDD|221270 pfam11855, DUF3375, Protein of unknown function (DUF3375).  This
           family of proteins are functionally uncharacterized.
           This protein is found in bacteria. Proteins in this
           family are typically between 479 to 499 amino acids in
           length.
          Length = 471

 Score = 30.0 bits (68), Expect = 3.7
 Identities = 27/156 (17%), Positives = 58/156 (37%), Gaps = 24/156 (15%)

Query: 259 FRSLSDSASNIYGSLEFRKFACVVRDSKRDERKKLADEEAMRKIKQLEDQTHQLQKQVAT 318
           FR + D A  +    +FR+     R   R  R+++ D +  R   ++ ++          
Sbjct: 177 FRQILDLARELPA--DFRRVEDNFRQLDRQLRERIIDWDGSRG--EVLEEIFD------- 225

Query: 319 HKQEEEALLNEMEVTGQAFEDMQEQ-NSRLLQQLREKDDANFKLMTERIKSNQLHKLARE 377
                +A+ +  +  G++F    +       ++  E D+     +   +       L  E
Sbjct: 226 ---GYDAIADSDQ--GRSFRAFWDFLLDP--ERQEELDEL----LRAVLARPPAQGLDPE 274

Query: 378 EKDTLREQVQALSLQIEAMHTAIRKLEEK-ERFLQT 412
            ++ LR   + L    E +    R+L E   RF+ +
Sbjct: 275 LREFLRRLHRDLLEAGEEVQRTRRRLSESLRRFVDS 310


>gnl|CDD|211667 TIGR01530, nadN, NAD pyrophosphatase/5'-nucleotidase NadN.  This
           model describes NadN of Haemophilus influenzae and a
           small number of close homologs in pathogenic,
           Gram-negative bacteria. NadN is a periplasmic enzyme
           that cleaves NAD (nicotinamide adenine dinucleotide) to
           NMN (nicotinamide mononucleotide) and AMP. The NMN must
           be converted by a 5'-nucleotidase to nicotinamide
           riboside for import. NadN belongs a large family of
           5'-nucleotidases and has NMN 5'-nucleotidase activity
           for NMN, AMP, etc [Transport and binding proteins,
           Other, Biosynthesis of cofactors, prosthetic groups, and
           carriers, Pyridine nucleotides].
          Length = 545

 Score = 29.9 bits (67), Expect = 3.7
 Identities = 16/51 (31%), Positives = 24/51 (47%), Gaps = 2/51 (3%)

Query: 288 DERKKLADEEAMRKIKQLEDQTHQLQKQVATHKQEEEALLNEM--EVTGQA 336
           DERKK  D     K   L+D   +    +  +K E++ L  E+   +TG A
Sbjct: 303 DERKKALDTLKSMKSISLDDHDAKTDSLIEKYKSEKDRLAQEIVGVITGSA 353


>gnl|CDD|220623 pfam10186, Atg14, UV radiation resistance protein and
           autophagy-related subunit 14.  The Atg14 or Apg14
           proteins are hydrophilic proteins with a predicted
           molecular mass of 40.5 kDa, and have a coiled-coil motif
           at the N terminus region. Yeast cells with mutant Atg14
           are defective not only in autophagy but also in sorting
           of carboxypeptidase Y (CPY), a vacuolar-soluble
           hydrolase, to the vacuole. Subcellular fractionation
           indicate that Apg14p and Apg6p are peripherally
           associated with a membrane structure(s). Apg14p was
           co-immunoprecipitated with Apg6p, suggesting that they
           form a stable protein complex. These results imply that
           Apg6/Vps30p has two distinct functions: in the
           autophagic process and in the vacuolar protein sorting
           pathway. Apg14p may be a component specifically required
           for the function of Apg6/Vps30p through the autophagic
           pathway. There are 17 auto-phagosomal component proteins
           which are categorized into six functional units, one of
           which is the AS-PI3K complex (Vps30/Atg6 and Atg14). The
           AS-PI3K complex and the Atg2-Atg18 complex are essential
           for nucleation, and the specific function of the AS-PI3K
           apparently is to produce phosphatidylinositol
           3-phosphate (PtdIns(3)P) at the pre-autophagosomal
           structure (PAS). The localisation of this complex at the
           PAS is controlled by Atg14. Autophagy mediates the
           cellular response to nutrient deprivation, protein
           aggregation, and pathogen invasion in humans, and
           malfunction of autophagy has been implicated in multiple
           human diseases including cancer. This effect seems to be
           mediated through direct interaction of the human Atg14
           with Beclin 1 in the human phosphatidylinositol 3-kinase
           class III complex.
          Length = 307

 Score = 29.6 bits (67), Expect = 3.8
 Identities = 19/118 (16%), Positives = 44/118 (37%), Gaps = 15/118 (12%)

Query: 288 DERKKLADEEAMRKIKQLEDQTHQLQKQVATHKQEEEALLNEMEVTGQAFEDMQEQNSRL 347
           +E  KLA +    ++ + +++ +Q++ +++  K+E E                 E+  R 
Sbjct: 52  NEDGKLAADLLKLEVARKKERLNQIRARISQLKEEIEQKRER-----------IEELKRA 100

Query: 348 LQQLREKDDANFKLMTERIKSNQLHKLAREEKDTLREQVQALSLQIEAMHTAIRKLEE 405
           L Q R    +    + +R  S     L + + +  R + +  +L             E
Sbjct: 101 LAQRRSDLSSASYQLEKRRASQ----LEKLQDEIKRTRSKLNALHSLLAEKRSFLCRE 154


>gnl|CDD|219841 pfam08438, MMR_HSR1_C, GTPase of unknown function C-terminal.  This
           domain is found at the C-terminus of pfam01926 in
           archaeal and eukaryotic GTP-binding proteins. The
           C-terminal domain of the GTP-binding proteins is
           necessary for the complete activity of the protein of
           interacting with the 50S ribosome and binding of both
           adenine and guanine nucleotides, with a preference for
           guanine nucleotides.
          Length = 109

 Score = 28.3 bits (64), Expect = 3.9
 Identities = 14/28 (50%), Positives = 16/28 (57%)

Query: 400 IRKLEEKERFLQTVLTNAEKELHLRNQA 427
           I KL+EK      V T+AE EL LR  A
Sbjct: 14  IEKLKEKYPDEPVVPTSAEAELALRKAA 41


>gnl|CDD|214826 smart00806, AIP3, Actin interacting protein 3.  Aip3p/Bud6p is a
           regulator of cell and cytoskeletal polarity in
           Saccharomyces cerevisiae that was previously identified
           as an actin-interacting protein. Actin-interacting
           protein 3 (Aip3p) localizes at the cell cortex where
           cytoskeleton assembly must be achieved to execute
           polarized cell growth, and deletion of AIP3 causes gross
           defects in cell and cytoskeletal polarity. Aip3p
           localization is mediated by the secretory pathway,
           mutations in early- or late-acting components of the
           secretory apparatus lead to Aip3p mislocalization.
          Length = 426

 Score = 29.6 bits (67), Expect = 4.0
 Identities = 31/130 (23%), Positives = 52/130 (40%), Gaps = 11/130 (8%)

Query: 314 KQVATHKQEEEALLNEMEVTGQAFEDMQEQNSRLLQQLREKDDA------NFKLMTERIK 367
           K     + E ++L  E+ V  Q       +    ++ + EK D       +    + R  
Sbjct: 148 KLNKEQRAELKSLQRELAVLRQTHNSFFTEIKESIKDILEKIDKFKSSSLSASGSSNRAY 207

Query: 368 SNQLHKLAREEKDTLREQVQALSLQIEAMH--TAIRKLEEKERFLQTV---LTNAEKELH 422
                K   E+ D+L  +V  L   IEA+    A R +   ++ L+TV   L  A KEL 
Sbjct: 208 VESSKKKLSEDSDSLLTKVDDLQDIIEALRKDVAQRGVRPSKKQLETVQKELETARKELK 267

Query: 423 LRNQAMDLNK 432
              + +D+ K
Sbjct: 268 KMEEYIDIEK 277


>gnl|CDD|133905 PHA00489, PHA00489, scaffolding protein.
          Length = 101

 Score = 28.0 bits (62), Expect = 4.1
 Identities = 22/97 (22%), Positives = 37/97 (38%), Gaps = 6/97 (6%)

Query: 283 RDSKRDERKKLADEEAMRKIKQLEDQTHQLQKQVATHKQEEEALLNEMEVTGQAFEDMQE 342
            +       KL D E     +   +   QL++   +   E E L   +E      ED+  
Sbjct: 5   LEEHEAILNKLGDPELTESERT--EALQQLRESYGSFHSEYEELTEALEKLTAEKEDLIV 62

Query: 343 QNSRLLQQL----REKDDANFKLMTERIKSNQLHKLA 375
            NS+L +QL     E+++   K  +E I    L   +
Sbjct: 63  SNSKLFRQLGPTKEEEEEDKQKSFSETITIEDLEGNS 99


>gnl|CDD|223790 COG0718, COG0718, Uncharacterized protein conserved in bacteria
           [Function unknown].
          Length = 105

 Score = 28.0 bits (63), Expect = 4.2
 Identities = 15/38 (39%), Positives = 23/38 (60%), Gaps = 4/38 (10%)

Query: 299 MRKIKQLEDQTHQLQKQVATHKQEEEALLNEMEVTGQA 336
           M  +++L  Q  Q+QK++    QEE   L + EVTG+A
Sbjct: 4   MMDMQKLMKQAQQMQKKMQ-KMQEE---LAQKEVTGKA 37


>gnl|CDD|234701 PRK00252, alaS, alanyl-tRNA synthetase; Reviewed.
          Length = 865

 Score = 29.7 bits (68), Expect = 4.4
 Identities = 26/126 (20%), Positives = 53/126 (42%), Gaps = 13/126 (10%)

Query: 332 VTGQAFEDMQEQNSRLLQQLREKDDANFKLMTERIKS--NQLHKLAREEKDTLREQVQAL 389
           VTG+A  +   +   LL++L     A    + ER+++   +L +L +E +    +   A 
Sbjct: 687 VTGEAALEYLNEQEALLKELAALLKAKPSELPERVEALLEELKELEKELEQLKAKLAAAA 746

Query: 390 SLQIEAMHTAIRKLEEKERFLQTVLTNAEKELHLRNQAMDLNKRKAIESA----QSAADL 445
           +  + A    +  +    + L   +   + +  LR  A DL  +  + S      +  D 
Sbjct: 747 AGDLLAQAKEVNGV----KVLAAQVEGVDAK-ALRTLADDL--KDKLGSGVVVLATVEDG 799

Query: 446 KLHLVI 451
           K+ LV+
Sbjct: 800 KVSLVV 805


>gnl|CDD|218896 pfam06098, Radial_spoke_3, Radial spoke protein 3.  This family
           consists of several radial spoke protein 3 (RSP3)
           sequences. Eukaryotic cilia and flagella present in
           diverse types of cells perform motile, sensory, and
           developmental functions in organisms from protists to
           humans. They are centred by precisely organised,
           microtubule-based structures, the axonemes. The axoneme
           consists of two central singlet microtubules, called the
           central pair, and nine outer doublet microtubules. These
           structures are well-conserved during evolution. The
           outer doublet microtubules, each composed of A and B
           sub-fibres, are connected to each other by nexin links,
           while the central pair is held at the centre of the
           axoneme by radial spokes. The radial spokes are T-shaped
           structures extending from the A-tubule of each outer
           doublet microtubule to the centre of the axoneme. Radial
           spoke protein 3 (RSP3), is present at the proximal end
           of the spoke stalk and helps in anchoring the radial
           spoke to the outer doublet. It is thought that radial
           spokes regulate the activity of inner arm dynein through
           protein phosphorylation and dephosphorylation.
          Length = 288

 Score = 29.3 bits (66), Expect = 4.5
 Identities = 14/68 (20%), Positives = 29/68 (42%), Gaps = 5/68 (7%)

Query: 321 QEEEALLNEMEVTGQAFEDMQEQNSRLLQQLREKDDANFKLMTERIKSNQLHKLAREEKD 380
           +EEE  L E+    + FE  +       Q+L E +    +   E+ +  +  K  ++ + 
Sbjct: 150 EEEE--LAELRQQQRQFEQRRNAELAETQRLEEAER---RRREEKERRKKQDKERKQREK 204

Query: 381 TLREQVQA 388
              E++ A
Sbjct: 205 ETAEKIAA 212


>gnl|CDD|217135 pfam02601, Exonuc_VII_L, Exonuclease VII, large subunit.  This
           family consist of exonuclease VII, large subunit
           EC:3.1.11.6 This enzyme catalyzes exonucleolytic
           cleavage in either 5'->3' or 3'->5' direction to yield
           5'-phosphomononucleotides. This exonuclease VII enzyme
           is composed of one large subunit and 4 small ones.
          Length = 295

 Score = 29.2 bits (66), Expect = 4.7
 Identities = 34/155 (21%), Positives = 56/155 (36%), Gaps = 30/155 (19%)

Query: 296 EEAMRKIKQLEDQTHQLQKQVATHKQEEEALLNE------MEVTGQAFEDMQEQNSRL-- 347
            E ++K++ LE +  +  K     +Q+   LL E        +  Q  E + E   RL  
Sbjct: 146 TELLQKLEGLEQRLSRALKNRLEKEQDRLNLLRERLKSLSRRLLEQHEERLAELRDRLIS 205

Query: 348 ----LQQLREKDDANFKLMTERIKSNQL-HKLAREEKDTLREQVQALSLQIEAMHTAIRK 402
               L   ++       L  E  K++ L +KLA      L  Q++ALS          + 
Sbjct: 206 SIQNLLSRKQSRLERLILNRELEKNSLLENKLATA---NLTAQLKALSPL--------KT 254

Query: 403 LEEKERFLQTVLTNAEKELHLRNQAMDLNKRKAIE 437
           L   +R    V     K +     A +L     +E
Sbjct: 255 L---KRGFAIVRRKDGKIVT---SAAELKPGDNLE 283


>gnl|CDD|233885 TIGR02479, FliA_WhiG, RNA polymerase sigma factor, FliA/WhiG
           family.  Most members of this family are the flagellar
           operon sigma factor FliA, controlling transcription of
           bacterial flagellar genes by RNA polymerase. An
           exception is the sigma factor WhiG in the genus
           Streptomyces, involved in the production of sporulating
           aerial mycelium.
          Length = 224

 Score = 29.2 bits (66), Expect = 4.8
 Identities = 32/134 (23%), Positives = 56/134 (41%), Gaps = 25/134 (18%)

Query: 299 MRKIKQLEDQTHQLQKQVATHKQEEEALLNEMEVTGQAFEDMQEQ-NSRLLQQLREKDDA 357
            +K ++LE    +L+ ++     EEE +  E+ +  + +     + N+  L  L E  ++
Sbjct: 79  RQKARKLERAIRELEARLGREPTEEE-IAEELGMDLKEYRQALNEINALSLVSLDELLES 137

Query: 358 N-----FKLMTERIKS-NQLHKLAREEKDTLREQVQALSLQIEAMHTAIRKLEEKERFLQ 411
                      E  KS +   +L REE               EA+  AI  L E+E   Q
Sbjct: 138 GDDGGSLIDRIEDDKSEDPEEELEREEL-------------REALAEAIESLSERE---Q 181

Query: 412 TVLT-NAEKELHLR 424
            VL+    +EL+L+
Sbjct: 182 LVLSLYYYEELNLK 195


>gnl|CDD|220785 pfam10498, IFT57, Intra-flagellar transport protein 57.  Eukaryotic
           cilia and flagella are specialised organelles found at
           the periphery of cells of diverse organisms.
           Intra-flagellar transport (IFT) is required for the
           assembly and maintenance of eukaryotic cilia and
           flagella, and consists of the bidirectional movement of
           large protein particles between the base and the distal
           tip of the organelle. IFT particles contain multiple
           copies of two distinct protein complexes, A and B, which
           contain at least 6 and 11 protein subunits. IFT57 is
           part of complex B but is not, however, required for the
           core subunits to stay associated. This protein is known
           as Huntington-interacting protein-1 in humans.
          Length = 355

 Score = 29.3 bits (66), Expect = 4.9
 Identities = 29/136 (21%), Positives = 58/136 (42%), Gaps = 11/136 (8%)

Query: 317 ATHKQEEEALLNEMEVTGQA--------FEDMQEQNSRLLQQLREKDDANFKLMTERIKS 368
           A  K E E +L +++VT +          E M +    + + L E      KL  +   S
Sbjct: 190 AEWKLEVERVLPQLKVTIKTDAKDWRSHLEQMHQLKKAISKSLPETKQYLKKLSQD--LS 247

Query: 369 NQLHKLAREEKDTLREQVQALSLQIEAMHTAIRKLEEKERFLQTVLTNAEKELHLRNQAM 428
             L K++  EK  L  Q++ L  +       + +++EK       ++   +EL+  ++ +
Sbjct: 248 KALEKISSREK-HLNNQLEQLVQEYREARRTLSQVQEKYNQASQGVSELTRELNEISEEL 306

Query: 429 DLNKRKAIESAQSAAD 444
           +  K++  E   S +D
Sbjct: 307 EQVKQEMEERGASMSD 322


>gnl|CDD|217472 pfam03280, Lipase_chap, Proteobacterial lipase chaperone protein. 
          Length = 189

 Score = 28.8 bits (65), Expect = 4.9
 Identities = 15/70 (21%), Positives = 28/70 (40%), Gaps = 8/70 (11%)

Query: 284 DSKRDERKKLADEEAMRKIKQLEDQTHQLQKQVATHKQEEEALLNEMEVTGQAFEDMQEQ 343
              R  R  L   EA +++ QL+ +    Q++   +  E + +L           D  +Q
Sbjct: 124 QDLRAARAALVGPEAAQRLAQLDQERAAWQQRYDDYLAERDQILA------AGLSDADKQ 177

Query: 344 NSRLLQQLRE 353
               + QLR+
Sbjct: 178 --AQIAQLRQ 185


>gnl|CDD|188641 cd00954, NAL, N-Acetylneuraminic acid aldolase, also called
           N-acetylneuraminate lyase (NAL).  N-Acetylneuraminic
           acid aldolase, also called N-acetylneuraminate lyase
           (NAL), which catalyses the reversible aldol reaction of
           N-acetyl-D-mannosamine and pyruvate to give
           N-acetyl-D-neuraminic acid (D-sialic acid). It has a
           widespread application as biocatalyst for the synthesis
           of sialic acid and its derivatives. This enzyme has been
           shown to be quite specific for pyruvate as the donor,
           but flexible to a variety of D- and, to some extent,
           L-hexoses and pentoses as acceptor substrates. NAL is
           member of dihydrodipicolinate synthase family that
           comprises several pyruvate-dependent class I aldolases.
          Length = 288

 Score = 29.2 bits (66), Expect = 5.1
 Identities = 12/52 (23%), Positives = 26/52 (50%), Gaps = 3/52 (5%)

Query: 394 EAMHTAIRKLEEKERFLQTVLTNAEKELHLRNQAMDLNKRKAIESAQSAADL 445
           E    +   +EE+++  + V   A+ ++ L      LN +++ E A+ A +L
Sbjct: 47  EGFLLS---VEERKQIAEIVAEAAKGKVTLIAHVGSLNLKESQELAKHAEEL 95


>gnl|CDD|220719 pfam10368, YkyA, Putative cell-wall binding lipoprotein.  YkyA is a
           family of proteins containing a lipoprotein signal and a
           hydrolase domain. It is similar to cell wall binding
           proteins and might also be recognisable by a host immune
           defence system. It is thus likely to belong to pathways
           important for pathogenicity.
          Length = 205

 Score = 28.9 bits (65), Expect = 5.1
 Identities = 37/173 (21%), Positives = 77/173 (44%), Gaps = 47/173 (27%)

Query: 292 KLADEEAM-----RKIKQLEDQTHQLQKQV-------------------ATHKQEEEALL 327
           K A++E       +K+++LE +  +L  Q+                   A   + E+ L 
Sbjct: 27  KAAEQEKPFEEEQKKLEELEKKEQELYDQIIQLGKDDNEAIKKLSDQALANVDKREKLLK 86

Query: 328 NEMEVTGQAFEDMQEQNSRL----LQQLREKDDANFKLMTERIKS-NQLHK-------LA 375
            E E   ++ E+ +     +     ++L++K     K+M ER KS ++L+K       L 
Sbjct: 87  KEKESIEKSEEEFKSAKKYIEKIEDKKLKKKAKQLVKVMKERYKSYDKLYKAYKKALNLE 146

Query: 376 RE------EKDTLREQVQALSLQIEAMHTAIRKL-EEKERFLQ-TVLTNAEKE 420
           +E      +KD   +++     +I+A++ +  K+ + K++F + T   N EK+
Sbjct: 147 KELYEYLKDKDLTLKELDE---KIKAVNQSYEKIQKAKKKFNEYTKKYNKEKQ 196


>gnl|CDD|192780 pfam11594, Med28, Mediator complex subunit 28.  Mediator is a large
           complex of up to 33 proteins that is conserved from
           plants to fungi to humans - the number and
           representation of individual subunits varying with
           species. It is arranged into four different sections, a
           core, a head, a tail and a kinase-activity part, and the
           number of subunits within each of these is what varies
           with species. Overall, Mediator regulates the
           transcriptional activity of RNA polymerase II but it
           would appear that each of the four different sections
           has a slightly different function. Subunit Med28 of the
           Mediator may function as a scaffolding protein within
           Mediator by maintaining the stability of a submodule
           within the head module, and components of this submodule
           act together in a gene-regulatory programme to suppress
           smooth muscle cell differentiation. Thus, mammalian
           Mediator subunit Med28 functions as a repressor of
           smooth muscle-cell differentiation, which could have
           implications for disorders associated with abnormalities
           in smooth muscle cell growth and differentiation,
           including atherosclerosis, asthma, hypertension, and
           smooth muscle tumours.
          Length = 106

 Score = 28.0 bits (62), Expect = 5.2
 Identities = 15/52 (28%), Positives = 27/52 (51%), Gaps = 2/52 (3%)

Query: 367 KSNQLHKLAREEKDTLREQVQALSLQIEAMHTAIRKLEEKERFLQTVLTNAE 418
           K N+L     + K TL+E+ Q L  +++       K + K  + + +LT+AE
Sbjct: 27  KRNELS--VFKPKKTLKEEAQKLKEEMQRKDQLQTKHDSKIDYWENLLTDAE 76


>gnl|CDD|148013 pfam06152, Phage_min_cap2, Phage minor capsid protein 2.  Family of
           related phage minor capsid proteins.
          Length = 361

 Score = 29.3 bits (66), Expect = 5.2
 Identities = 25/94 (26%), Positives = 43/94 (45%), Gaps = 5/94 (5%)

Query: 262 LSDSASNIYGSLEFRKFACVVRDSKRDERKKLADEEA---MRKIKQLEDQTHQLQKQVAT 318
           LS    +IY  LE   F  ++R  K  +  +LAD      + K+ +L     Q  K +A 
Sbjct: 11  LSQPVVDIYSKLELDLFDNMIRRLKTRKSLQLADNVEQWQIEKLNELGLLNKQNIKLIAK 70

Query: 319 H-KQEEEALLNEMEVTG-QAFEDMQEQNSRLLQQ 350
           +    EEA+   ++  G +  +D+ +Q S  L +
Sbjct: 71  YTGVAEEAITRVIKNAGLKIIKDIDKQLSEALNK 104


>gnl|CDD|234750 PRK00409, PRK00409, recombination and DNA strand exchange inhibitor
           protein; Reviewed.
          Length = 782

 Score = 29.4 bits (67), Expect = 5.3
 Identities = 17/83 (20%), Positives = 29/83 (34%), Gaps = 24/83 (28%)

Query: 383 REQVQALSLQIEAMHTAIRKLEEKERFLQTVLTNAEKELH--------LRNQAMDLNKR- 433
           ++ +     ++  +   I  LEE ER L+     AE  L         L  +   L +  
Sbjct: 508 KKLIGEDKEKLNEL---IASLEELERELEQKAEEAEALLKEAEKLKEELEEKKEKLQEEE 564

Query: 434 ------------KAIESAQSAAD 444
                       +AI+ A+  AD
Sbjct: 565 DKLLEEAEKEAQQAIKEAKKEAD 587



 Score = 29.0 bits (66), Expect = 8.2
 Identities = 26/128 (20%), Positives = 54/128 (42%), Gaps = 16/128 (12%)

Query: 284 DSKRDERKKLADEEAMRKIKQLEDQTHQLQKQVATHKQEEEALLNEMEVTGQAFEDMQEQ 343
           + +R+  +K   EEA   +K+ E    +L+++    ++EE+ LL E E   Q        
Sbjct: 527 ELERELEQKA--EEAEALLKEAEKLKEELEEKKEKLQEEEDKLLEEAEKEAQQ------- 577

Query: 344 NSRLLQQLREKDDANFKLMTERIKSNQLHKLAREEKDTLREQVQALSLQIEAMHTAIRKL 403
               +++ +++ D   K + +  K       A E  +  +     L+   E      +K 
Sbjct: 578 ---AIKEAKKEADEIIKELRQLQKGGYASVKAHELIEARK----RLNKANEKKEKKKKKQ 630

Query: 404 EEKERFLQ 411
           +EK+  L+
Sbjct: 631 KEKQEELK 638


>gnl|CDD|234437 TIGR03998, thiol_BshC, bacillithiol biosynthesis cysteine-adding
           enzyme BshC.  Members of this protein family are BshC,
           an enzyme required for bacillithiol biosynthesis and
           described as a cysteine-adding enzyme. Bacillithiol is a
           low-molecular-weight thiol, an analog of glutathione and
           mycothiol, and is found largely in the Firmicutes
           [Biosynthesis of cofactors, prosthetic groups, and
           carriers, Glutathione and analogs].
          Length = 528

 Score = 29.2 bits (66), Expect = 5.5
 Identities = 15/99 (15%), Positives = 37/99 (37%), Gaps = 11/99 (11%)

Query: 313 QKQVATHKQEEEALLNEMEVTGQAFEDMQEQNSRLLQQLREKDDANFKLMTERIKSNQLH 372
            K+    +Q+ E +    E   +A E +   +  L ++L    D   + + E+  +    
Sbjct: 400 LKERFLERQDPEPIDALFE---EAKEQITAAHKPLREEL-AAVDPTLQGLAEKNLAKITK 455

Query: 373 KLAREEKDTLREQVQALSLQIEAMHTAIRKLEEKERFLQ 411
           ++   EK  LR   +            +++ +  +  L+
Sbjct: 456 QVEFLEKRLLRAIRRKH-------DVELQQFDRIQTNLR 487


>gnl|CDD|150420 pfam09744, Jnk-SapK_ap_N, JNK_SAPK-associated protein-1.  This is
           the N-terminal 200 residues of a set of proteins
           conserved from yeasts to humans. Most of the proteins in
           this entry have an RhoGEF pfam00621 domain at their
           C-terminal end.
          Length = 158

 Score = 28.5 bits (64), Expect = 5.5
 Identities = 23/91 (25%), Positives = 42/91 (46%), Gaps = 5/91 (5%)

Query: 337 FEDMQEQNSRLLQQLR----EKDDANFKLMTERIKSNQLHKLAREEKDTLREQVQALSLQ 392
            E ++E N RL  Q       +  A  KL+    +  Q  K  +++ + L E V+ L L+
Sbjct: 52  LELLREDNERLSTQYEREKELRKQAEQKLLEFEDELEQEKKELQKKIEDLEENVRQLELK 111

Query: 393 IEAMHTAIRKLEEKERFLQTVLTNAEKELHL 423
            + +   + +LEE+E  L+    N+  E + 
Sbjct: 112 AKNLSDQVSRLEERETELKKEY-NSLHERYT 141


>gnl|CDD|222255 pfam13600, DUF4140, N-terminal domain of unknown function
           (DUF4140).  This family is often found at the N-terminus
           of its member proteins, with DUF4139, pfam13598, at the
           C-terminus.
          Length = 101

 Score = 27.6 bits (62), Expect = 5.5
 Identities = 12/41 (29%), Positives = 22/41 (53%), Gaps = 3/41 (7%)

Query: 286 KRDERKKLADEEAMRKIKQLEDQTHQLQKQVATHKQEEEAL 326
           +RD     A   A   + +LE++   L+ ++A  + E+EAL
Sbjct: 56  RRDYLPDEATSPA---LAELEEELRALEDELAALEAEKEAL 93


>gnl|CDD|235855 PRK06707, PRK06707, amidase; Provisional.
          Length = 536

 Score = 29.1 bits (65), Expect = 5.8
 Identities = 22/91 (24%), Positives = 39/91 (42%), Gaps = 11/91 (12%)

Query: 341 QEQNSRLLQQLREKDDANFKLMTERIKSNQLHKLAREEKDTLREQVQALSLQIEAMHTAI 400
            E    +     ++ +   K + E I  N+        KD+ R      +L   +  +AI
Sbjct: 388 YEFKHNVNDYFSQQKNVPVKSLEEIIAFNK--------KDSKRRIKYGQTLIEASEKSAI 439

Query: 401 RKLEEKERFLQTVLTNAEKEL--HLRNQAMD 429
            K E ++  +QT   NA+KEL  +L  + +D
Sbjct: 440 TKDEFEKV-VQTSQENAKKELDRYLVEKGLD 469


>gnl|CDD|225521 COG2974, RdgC, DNA recombination-dependent growth factor C [DNA
           replication, recombination, and repair].
          Length = 303

 Score = 28.9 bits (65), Expect = 5.9
 Identities = 10/23 (43%), Positives = 16/23 (69%)

Query: 364 ERIKSNQLHKLAREEKDTLREQV 386
           E I++ Q  KL ++EKD L++ V
Sbjct: 88  EEIEAKQGRKLKKKEKDELKDDV 110


>gnl|CDD|191593 pfam06721, DUF1204, Protein of unknown function (DUF1204).  This
           family represents the C-terminus of a number of
           Arabidopsis thaliana hypothetical proteins of unknown
           function. Family members contain a conserved DFD motif.
          Length = 232

 Score = 28.9 bits (64), Expect = 6.0
 Identities = 24/81 (29%), Positives = 34/81 (41%), Gaps = 14/81 (17%)

Query: 305 LEDQTHQLQKQVATHKQEEEALLNEMEVTGQAFEDMQEQNSRLLQQLREKDDANFKLMTE 364
           LE       K+ A  K    AL  E E   +A  D+           R +DDA      E
Sbjct: 7   LEKCLESASKEAAHAKDAHAALAVEKEKMAEALADLA----------RCQDDAE----EE 52

Query: 365 RIKSNQLHKLAREEKDTLREQ 385
           + K+++L   A+ EK  LRE+
Sbjct: 53  KEKTDELADRAKSEKKRLRER 73


>gnl|CDD|131526 TIGR02473, flagell_FliJ, flagellar export protein FliJ.  Members of
           this family are the FliJ protein found, in nearly every
           case, in the midst of other flagellar biosynthesis genes
           in bacgterial genomes. Typically the fliJ gene is found
           adjacent to the gene for the flagellum-specific ATPase
           FliI. Sequence scoring in the gray zone between trusted
           and noise cutoffs include both probable FliJ proteins
           and components of bacterial type III secretion systems.
          Length = 141

 Score = 28.0 bits (63), Expect = 6.0
 Identities = 19/112 (16%), Positives = 45/112 (40%), Gaps = 15/112 (13%)

Query: 289 ERKKLADEEAMRKIKQLEDQTHQLQKQVATHKQEEEALLNEMEVTGQAFEDMQE------ 342
           + ++  +E+A  ++ + + +  +L+ Q+    +  E    +      A     E      
Sbjct: 9   DLREKEEEQAKLELAKAQAEFERLETQLQQLIKYREEYEQQALEKVGAGTSALELSNYQR 68

Query: 343 ----QNSRLLQQLREKDDANFKLMTERIKSNQLHKLAREEK--DTLREQVQA 388
                + R+ QQ +E      ++  +R    +L +  RE K  + L+E+ Q 
Sbjct: 69  FIRQLDQRIQQQQQELALLQQEVEAKR---ERLLEARRELKALEKLKEKKQK 117


>gnl|CDD|222812 PHA00727, PHA00727, hypothetical protein.
          Length = 278

 Score = 29.1 bits (65), Expect = 6.0
 Identities = 23/81 (28%), Positives = 41/81 (50%), Gaps = 8/81 (9%)

Query: 296 EEAMRKIKQLED---QTHQLQKQVATHKQEEEALL----NEMEVTGQAFEDMQEQNSRLL 348
           EE +RK + LE+   +  + QKQ+A  K  +         E E+  Q FE ++ + S+  
Sbjct: 11  EEELRKAQSLEELKQKYEEAQKQIADGKTLKRLYKVYEKREFELKKQQFEQLKAELSKKK 70

Query: 349 QQLR-EKDDANFKLMTERIKS 368
           ++ + EK D   K++ + I S
Sbjct: 71  KKFKKEKVDVRVKVVKKWINS 91


>gnl|CDD|214787 smart00721, BAR, BAR domain. 
          Length = 239

 Score = 28.9 bits (65), Expect = 6.4
 Identities = 27/105 (25%), Positives = 39/105 (37%), Gaps = 14/105 (13%)

Query: 286 KRDERKKLADEEAMRKIKQLEDQTHQLQKQVATHKQEEEALLNEMEVTGQAFEDMQEQNS 345
           K+ ERK L  + A  K+K+ +    + +K     K EEE     +    Q FE+   Q  
Sbjct: 141 KKLERKLLDYDSARHKLKKAKKSK-EKKKDEKLAKAEEE-----LRKAKQEFEESNAQLV 194

Query: 346 RLLQQLREKDDANFKLMTERIKSNQLHKLAREEKDTLREQVQALS 390
             L QL       F         N L  L   + +  RE  + L 
Sbjct: 195 EELPQLVASRVDFFV--------NCLQALIEAQLNFHRESYKLLQ 231


>gnl|CDD|234967 PRK01642, cls, cardiolipin synthetase; Reviewed.
          Length = 483

 Score = 29.0 bits (66), Expect = 6.4
 Identities = 7/46 (15%), Positives = 16/46 (34%)

Query: 283 RDSKRDERKKLADEEAMRKIKQLEDQTHQLQKQVATHKQEEEALLN 328
              +R ER +L      + ++ L+   H   ++ +        L  
Sbjct: 60  LGKRRAERARLMWPSTAKWLRDLKACKHIFAEENSEVAAPLFRLCE 105


>gnl|CDD|151391 pfam10944, DUF2630, Protein of unknown function (DUF2630).  This
           bacterial family of proteins have no known function.
          Length = 81

 Score = 26.9 bits (60), Expect = 6.6
 Identities = 10/44 (22%), Positives = 22/44 (50%), Gaps = 1/44 (2%)

Query: 362 MTERIKSNQLHKLAREEKDTLREQVQALSLQIEAMHTAIRKLEE 405
           M ++    ++++L  EE + LR ++Q   +     H  + +LE 
Sbjct: 1   MDDQDILARINELVAEEHE-LRAKLQQGEIDESEEHARLARLEV 43


>gnl|CDD|234252 TIGR03545, TIGR03545, TIGR03545 family protein.  This model
           represents a relatively rare but broadly distributed
           uncharacterized protein family, distributed in 1-2
           percent of bacterial genomes, all of which have outer
           membranes. In many of these genomes, it is part of a
           two-gene pair.
          Length = 555

 Score = 28.9 bits (65), Expect = 6.7
 Identities = 24/158 (15%), Positives = 58/158 (36%), Gaps = 18/158 (11%)

Query: 291 KKLADEEAMRKIKQLEDQTHQLQKQVATHKQEEEALLNEMEVTGQAFEDMQEQNSRLLQQ 350
           + L   E ++ ++  E+    L+      K+ ++ L N      Q  E+ +       ++
Sbjct: 152 RALLKGEDLKTVETAEEIEKSLKAMQQKWKKRKKDLPN-----KQDLEEYK-------KR 199

Query: 351 LREKDDANFKLMTERIKSNQLHKLAREEKDTLREQV----QALSLQIEAMHTAIRKLEE- 405
           L      + K   E  K  +     ++E    ++++      L    + +   + +L++ 
Sbjct: 200 LEAIKKKDIKNPLELQKIKEEFDKLKKEGKADKQKIKSAKNDLQNDKKQLKADLAELKKA 259

Query: 406 -KERFLQTVLTNAEKELHLRNQAMDLNKRKAIESAQSA 442
            +    +     A K   L+N A+DL   +  +  Q  
Sbjct: 260 PQNDLKRLENKYAIKSGDLKNFAVDLFGPEIRKYLQKF 297


>gnl|CDD|226400 COG3883, COG3883, Uncharacterized protein conserved in bacteria
           [Function unknown].
          Length = 265

 Score = 28.5 bits (64), Expect = 6.9
 Identities = 17/89 (19%), Positives = 42/89 (47%), Gaps = 10/89 (11%)

Query: 301 KIKQLEDQTHQLQKQVATHKQEEEALLNEMEVTGQAFEDMQEQNSRLLQQLREKDDANFK 360
           KI+  + +  +LQK+    + E E+L N++E      +++Q++  +  +   +K      
Sbjct: 32  KIQNQDSKLSELQKEKKNIQNEIESLDNQIEEIQSKIDELQKEIDQ-SKAEIKKLQKEIA 90

Query: 361 LMTERIKSNQLHKLAREEKDTLREQVQAL 389
            + E I          E ++ L+++ +A+
Sbjct: 91  ELKENI---------VERQELLKKRARAM 110


>gnl|CDD|177251 MTH00208, ND5, NADH dehydrogenase subunit 5; Provisional.
          Length = 628

 Score = 29.2 bits (66), Expect = 7.3
 Identities = 18/134 (13%), Positives = 47/134 (35%), Gaps = 18/134 (13%)

Query: 123 LSSTVFVYHCFSIFRFSSSTVF----VYHCFSIFRFSSSTVFVYHCFSIFRFSSSTVFVY 178
            S  + ++    I  F SS +               ++S +        +  S+ ++ + 
Sbjct: 10  SSINLSIFIILLISIFFSSNLPNISTSNASLGGNNNNNSIIN-------YNNSNFSISLL 62

Query: 179 HCFSIFRFSSSTVFVYHCFSIFRFSSSTVFVYHCFSIFRFSSSTIFVYHCFDIFSLTSTL 238
              +     +  +++     +    +  + + +  S   F+ S  F++  + I   +  L
Sbjct: 63  KTLAFLSLINLLIWI-----LLGCPNINISLSNWLSNTLFNFSLSFIFDFYFILFFSVAL 117

Query: 239 FVSICTSAFQMSFY 252
           FV+   S  + S Y
Sbjct: 118 FVT--WSIIEFSHY 129


>gnl|CDD|236798 PRK10929, PRK10929, putative mechanosensitive channel protein;
           Provisional.
          Length = 1109

 Score = 29.3 bits (66), Expect = 7.4
 Identities = 38/142 (26%), Positives = 68/142 (47%), Gaps = 19/142 (13%)

Query: 285 SKRDERKKLADEEAMRKIKQLEDQTHQLQKQVATHKQE--EEALLN-EM--EVTGQAFED 339
           + R E  +L  E A ++ +QL+     L+ Q+ + +Q   E AL + E+  E +G   + 
Sbjct: 200 NNRQELARLRSELAKKRSQQLDAYLQALRNQLNSQRQREAERALESTELLAEQSGDLPKS 259

Query: 340 MQEQ---N---SRLL-QQLREKDDANFKLMTERIKSNQLHKLAREEKDTLREQVQALSLQ 392
           +  Q   N   S+ L QQ +  D    +   +R  ++Q  ++ R+  +TLREQ Q L + 
Sbjct: 260 IVAQFKINRELSQALNQQAQRMDLIASQ---QRQAASQTLQV-RQALNTLREQSQWLGVS 315

Query: 393 I---EAMHTAIRKLEEKERFLQ 411
               EA+   + +L E  +  Q
Sbjct: 316 NALGEALRAQVARLPEMPKPQQ 337


>gnl|CDD|223783 COG0711, AtpF, F0F1-type ATP synthase, subunit b [Energy production
           and conversion].
          Length = 161

 Score = 28.0 bits (63), Expect = 7.6
 Identities = 22/96 (22%), Positives = 42/96 (43%), Gaps = 12/96 (12%)

Query: 349 QQLREKDDANFKLMTERIKSNQLHKLAREEKDTLREQVQALSLQIEAMHTAIRKLEEKER 408
           ++L+E+  A         +  Q  + ARE+   + EQ +  + QI     A     E E 
Sbjct: 50  ERLKEEAQALLA------EYEQELEEAREQASEIIEQAKKEAEQI-----AEEIKAEAEE 98

Query: 409 FLQTVLTNAEKELHL-RNQAMDLNKRKAIESAQSAA 443
            L+ +   AE E+   + +A++  + +  E A + A
Sbjct: 99  ELERIKEAAEAEIEAEKERALEELRAEVAELAVAIA 134


>gnl|CDD|222636 pfam14265, DUF4355, Domain of unknown function (DUF4355).  This
           family of proteins is found in bacteria and viruses.
           Proteins in this family are typically between 180 and
           214 amino acids in length.
          Length = 125

 Score = 27.6 bits (62), Expect = 7.7
 Identities = 13/46 (28%), Positives = 29/46 (63%), Gaps = 4/46 (8%)

Query: 284 DSKRDERKKLA----DEEAMRKIKQLEDQTHQLQKQVATHKQEEEA 325
           + K+ E +KLA    +E+A  ++++LE +  +L+ ++A  + + EA
Sbjct: 29  EEKKSEAEKLAKMSAEEKAEYELEKLEKELEELEAELARRELKAEA 74


>gnl|CDD|146145 pfam03357, Snf7, Snf7.  This family of proteins are involved in
           protein sorting and transport from the endosome to the
           vacuole/lysosome in eukaryotic cells. Vacuoles/lysosomes
           play an important role in the degradation of both lipids
           and cellular proteins. In order to perform this
           degradative function, vacuoles/lysosomes contain
           numerous hydrolases which have been transported in the
           form of inactive precursors via the biosynthetic pathway
           and are proteolytically activated upon delivery to the
           vacuole/lysosome. The delivery of transmembrane
           proteins, such as activated cell surface receptors to
           the lumen of the vacuole/lysosome, either for
           degradation/downregulation, or in the case of
           hydrolases, for proper localisation, requires the
           formation of multivesicular bodies (MVBs). These late
           endosomal structures are formed by invaginating and
           budding of the limiting membrane into the lumen of the
           compartment. During this process, a subset of the
           endosomal membrane proteins is sorted into the forming
           vesicles. Mature MVBs fuse with the vacuole/lysosome,
           thereby releasing cargo containing vesicles into its
           hydrolytic lumen for degradation. Endosomal proteins
           that are not sorted into the intralumenal MVB vesicles
           are either recycled back to the plasma membrane or Golgi
           complex, or remain in the limiting membrane of the MVB
           and are thereby transported to the limiting membrane of
           the vacuole/lysosome as a consequence of fusion.
           Therefore, the MVB sorting pathway plays a critical role
           in the decision between recycling and degradation of
           membrane proteins. A few archaeal sequences are also
           present within this family.
          Length = 169

 Score = 28.0 bits (63), Expect = 7.7
 Identities = 13/44 (29%), Positives = 24/44 (54%), Gaps = 2/44 (4%)

Query: 393 IEAMHTAIRKLEEKERFLQTVLTNAEKELHLRNQAMDLNKRKAI 436
           I ++  AIR+L++K+  L+  +   E E  ++  A   NK  A+
Sbjct: 3   ILSLRKAIRELDKKQESLEKKIKKLEAE--IKKLAKKGNKDAAL 44


>gnl|CDD|219791 pfam08317, Spc7, Spc7 kinetochore protein.  This domain is found in
           cell division proteins which are required for
           kinetochore-spindle association.
          Length = 321

 Score = 28.5 bits (64), Expect = 8.0
 Identities = 24/126 (19%), Positives = 58/126 (46%), Gaps = 10/126 (7%)

Query: 301 KIKQLEDQTHQLQKQVATHKQEEEALLNEMEVTGQAFEDMQEQNSRL---LQQLRE-KDD 356
           +++ LE     L++ +   K++EE L  ++ +       ++++   L   +  LR+  D+
Sbjct: 141 RMQLLEGLKEGLEENLEGMKRDEELLNKDLNLINSIKPKLRKKLQALKEEIASLRQLADE 200

Query: 357 ANFKLMTERIKSNQLHK--LAREEK-DTLREQVQALSLQIEAMHTAIRKLEEKERFLQTV 413
            N   + + ++  +  +   +   K    R+Q++ L  +++ +  AI  L  K+  L   
Sbjct: 201 LN---LCDPLELEKARQELRSLSVKISEKRKQLEELQQELQELTIAIEALTNKKSELLEE 257

Query: 414 LTNAEK 419
           +  AEK
Sbjct: 258 IAEAEK 263


>gnl|CDD|147845 pfam05914, RIB43A, RIB43A.  This family consists of several
           RIB43A-like eukaryotic proteins. Ciliary and flagellar
           microtubules contain a specialised set of
           protofilaments, termed ribbons, that are composed of
           tubulin and several associated proteins. RIB43A was
           first characterized in the unicellular biflagellate,
           Chlamydomonas reinhardtii although highly related
           sequences are present in several higher eukaryotes
           including humans. The function of this protein is
           unknown although the structure of RIB43A and its
           association with the specialised protofilament ribbons
           and with basal bodies is relevant to the proposed role
           of ribbons in forming and stabilising doublet and
           triplet microtubules and in organising their
           three-dimensional structure. Human RIB43A homologues
           could represent a structural requirement in centriole
           replication in dividing cells.
          Length = 379

 Score = 28.9 bits (65), Expect = 8.0
 Identities = 15/94 (15%), Positives = 40/94 (42%), Gaps = 12/94 (12%)

Query: 274 EFRKFACVVRDSKRDERKKLADEEAMRKIKQLEDQTHQLQKQVATHKQEEEALLNEMEVT 333
             RK     ++ +  E+++  +EE +R+ K+ + Q     +     +++      E  + 
Sbjct: 284 AIRK----GQEQQLQEKERRREEEQLRE-KEWDRQAINQARAAVLLERQ------ERRLR 332

Query: 334 GQAFEDMQEQNSRL-LQQLREKDDANFKLMTERI 366
            +  + +   N +L  +Q  +++  N +L T   
Sbjct: 333 KELRKQLDSANQQLAKEQKAQQEYLNKELYTNAP 366



 Score = 28.5 bits (64), Expect = 9.5
 Identities = 41/208 (19%), Positives = 70/208 (33%), Gaps = 43/208 (20%)

Query: 276 RKFACVVRDSKRDERKKLA-----DEEAMRK----------IKQLEDQTHQLQKQVATHK 320
            K A  + D  R E    A      EE  R+              E+Q    +++    +
Sbjct: 178 EKQADTLHDQLRVELDHRAQELASLEEECRRAVEQATADFNKALAEEQR---ERERREKQ 234

Query: 321 QEEEALLNEMEVTGQAFEDMQEQNSRLLQQLREKDDANFKLMTERIKSNQLHKLA----- 375
           +E+E  L E+    Q   D+  +N       +     + +++ +R K     +LA     
Sbjct: 235 EEQEDNLAEIY--NQLTSDLLTENP---DVAQSSFGPH-RVIPDRWKGMSPEQLAAIRKG 288

Query: 376 ----REEKDTLREQVQALSLQIEAM-HTAIRKLEEKERFLQTVLTNAEKELHLRNQAMDL 430
                +EK+  RE+ Q    + +       R     ER  +           LR Q    
Sbjct: 289 QEQQLQEKERRREEEQLREKEWDRQAINQARAAVLLERQER------RLRKELRKQLDSA 342

Query: 431 NKRKAIESAQSAADLKLHLVIIYTNGPA 458
           N++ A E     A  +     +YTN P 
Sbjct: 343 NQQLAKE---QKAQQEYLNKELYTNAPT 367


>gnl|CDD|182123 PRK09866, PRK09866, hypothetical protein; Provisional.
          Length = 741

 Score = 29.0 bits (65), Expect = 8.1
 Identities = 17/74 (22%), Positives = 33/74 (44%), Gaps = 1/74 (1%)

Query: 357 ANFKLMTERIKSNQLHKLAREEKDTLREQVQALSLQIEAMHTAIRKLEEKERFLQTVLTN 416
           AN  L   R  +++L   A++ ++ L  +   L++  E +   I ++EE  + LQ     
Sbjct: 412 ANASLYALRSAAHKLLNYAQQAREYLDFRAHGLNVACEQLRQNIHQVEESLQLLQLNQAQ 471

Query: 417 AEKEL-HLRNQAMD 429
              E+ H    A+ 
Sbjct: 472 VSGEIKHEIELALT 485


>gnl|CDD|215507 PLN02939, PLN02939, transferase, transferring glycosyl groups.
          Length = 977

 Score = 28.7 bits (64), Expect = 8.4
 Identities = 49/253 (19%), Positives = 91/253 (35%), Gaps = 68/253 (26%)

Query: 282 VRDSKRDERKKLADEEAMR---KIKQLEDQTHQLQKQVATHKQEEEALLNEMEVTGQAFE 338
           +R S+ D R KLA +E +      +QLE   ++L  + AT      +L  E        +
Sbjct: 177 MRLSETDARIKLAAQEKIHVEILEEQLEKLRNELLIRGATEGLCVHSLSKE-------LD 229

Query: 339 DMQEQNSRLLQQLREKDDANF---KLMTERIKSNQLHKLAREE---KDTLRE-------- 384
            ++E+N  L      KDD  F   +L+       ++ KL +E      +LRE        
Sbjct: 230 VLKEENMLL------KDDIQFLKAELIEVAETEERVFKLEKERSLLDASLRELESKFIVA 283

Query: 385 ---QVQALSLQIEAMHTAIRKLEEKERFLQTVLTNAEKELHLRNQAMDLNKRKAIESAQS 441
                +   LQ +       K+E  +  L       EK   + +Q  DL  R  ++  + 
Sbjct: 284 QEDVSKLSPLQYDCW---WEKVENLQDLLDRATNQVEKAALVLDQNQDL--RDKVDKLE- 337

Query: 442 AADLKLHLVIIYTNGPARLEFGKDREFSAKFVNFRRKSYFFVEFLDCLRTRYETRQRKCP 501
                           A L+     +FS+               ++ L+ + +  + +  
Sbjct: 338 ----------------ASLKEANVSKFSS-------------YKVELLQQKLKLLEERLQ 368

Query: 502 KCNAKANDYHRLY 514
             + + + Y +LY
Sbjct: 369 ASDHEIHSYIQLY 381


>gnl|CDD|233419 TIGR01450, recC, exodeoxyribonuclease V, gamma subunit.  This model
           describes the gamma subunit of exodeoxyribonuclease V.
           Species containing this protein should also have the
           alpha (TIGR01447) and beta (TIGR00609) subunits.
           Candidates from Borrelia and from the Chlamydias differ
           dramatically and score between trusted and noise cutoffs
           [DNA metabolism, DNA replication, recombination, and
           repair].
          Length = 1067

 Score = 28.9 bits (65), Expect = 8.5
 Identities = 20/81 (24%), Positives = 36/81 (44%), Gaps = 18/81 (22%)

Query: 326 LLNEMEVTGQ--AFEDMQEQNSRLLQQLREKDDANFKLMTERIKSNQLHKLAREEKDTLR 383
           LL E+E + +  AF +       LL QL + D  + +    R   N+   L  +++    
Sbjct: 302 LLPELEQSQEIDAFLEATPDT--LLHQL-QADILHLE---PRPVDNKRPSLPLDDR---- 351

Query: 384 EQVQALSLQIEAMHTAIRKLE 404
                 S+Q  A H+ +R++E
Sbjct: 352 ------SIQFHACHSPLREVE 366


>gnl|CDD|219563 pfam07767, Nop53, Nop53 (60S ribosomal biogenesis).  This nucleolar
           family of proteins are involved in 60S ribosomal
           biogenesis. They are specifically involved in the
           processing beyond the 27S stage of 25S rRNA maturation.
           This family contains sequences that bear similarity to
           the glioma tumour suppressor candidate region gene 2
           protein (p60). This protein has been found to interact
           with herpes simplex type 1 regulatory proteins.
          Length = 387

 Score = 28.5 bits (64), Expect = 8.6
 Identities = 16/96 (16%), Positives = 35/96 (36%), Gaps = 11/96 (11%)

Query: 285 SKRDERKKLADEEAMRKIKQLEDQTHQLQKQVATHKQEEEALLNEMEVTGQAFEDMQEQN 344
             R +RK    +    K K+   +  +  K+    K++   L        +  E  +E  
Sbjct: 272 PVRPKRKT---KAQRNKEKR-RKELEREAKEEKQLKKKLAQLA-------RLKEIAKEVA 320

Query: 345 SRLLQQLREKDDANFKLMTERIKSNQLHKLAREEKD 380
            +   + R+K+    +   +++K  +L K    E  
Sbjct: 321 QKEKARARKKEQRKERGEKKKLKRRKLGKHKYPEPP 356


>gnl|CDD|219969 pfam08687, ASD2, Apx/Shroom domain ASD2.  This region is found in
           the actin binding protein Shroom which mediates apical
           contriction in epithelial cells and is required for
           neural tube closure.
          Length = 274

 Score = 28.5 bits (64), Expect = 8.9
 Identities = 29/100 (29%), Positives = 42/100 (42%), Gaps = 21/100 (21%)

Query: 315 QVATHKQEEEALLNEMEVTGQAFEDMQEQNSRLLQQLREKDDANFKLMTERIKSNQLHKL 374
             +  K E   LL +M+       DMQ+      Q   E D+   +   E I S    KL
Sbjct: 69  STSAAKAE---LLIKMK-------DMQQ-----GQSEPEPDNDLQQKKVELISSLG-KKL 112

Query: 375 A--REEKDTLREQVQA---LSLQIEAMHTAIRKLEEKERF 409
               EE+++L E +QA   L  ++EA      K  E E+F
Sbjct: 113 QDLHEEQESLLEDIQANNRLGEEVEAKVKQACKPNEFEKF 152


>gnl|CDD|201451 pfam00804, Syntaxin, Syntaxin.  Syntaxins are the prototype family
           of SNARE proteins. They usually consist of three main
           regions - a C-terminal transmembrane region, a central
           SNARE domain which is characteristic of and conserved in
           all syntaxins (pfam05739), and an N-terminal domain that
           is featured in this entry. This domain varies between
           syntaxin isoforms; in syntaxin 1A it is found as three
           alpha-helices with a left-handed twist. It may fold back
           on the SNARE domain to allow the molecule to adopt a
           'closed' configuration that prevents formation of the
           core fusion complex - it thus has an auto-inhibitory
           role. The function of syntaxins is determined by their
           localisation. They are involved in neuronal exocytosis,
           ER-Golgi transport and Golgi-endosome transport, for
           example. They also interact with other proteins as well
           as those involved in SNARE complexes. These include
           vesicle coat proteins, Rab GTPases, and tethering
           factors.
          Length = 103

 Score = 26.9 bits (60), Expect = 9.4
 Identities = 22/94 (23%), Positives = 43/94 (45%), Gaps = 7/94 (7%)

Query: 296 EEAMRKIKQLEDQTHQLQKQVATHK---QEEEALLNEMEVTGQAFEDMQEQNSRLLQQLR 352
           EE   +I+++     +LQK           ++ L  E+E   Q  + +     + L+ L+
Sbjct: 10  EEIREEIQKIRQNLEELQKLHKRILTAPDSDKELREELEELTQEIKQLARAIKKKLKSLQ 69

Query: 353 EKDDANFKLMT----ERIKSNQLHKLAREEKDTL 382
           + + +   L      +RI+ NQL  L+++ KD L
Sbjct: 70  QLNKSEQGLGHSSAVDRIRKNQLANLSKKFKDVL 103


>gnl|CDD|226127 COG3599, DivIVA, Cell division initiation protein [Cell division
           and chromosome partitioning].
          Length = 212

 Score = 28.2 bits (63), Expect = 9.5
 Identities = 20/149 (13%), Positives = 55/149 (36%), Gaps = 18/149 (12%)

Query: 300 RKIKQLEDQTHQLQKQVATHKQEEEALLNEMEVTGQAFEDMQEQNSRLLQQLREKDDANF 359
            + + LED+  +L++++      E+                 +Q     ++L++  +A  
Sbjct: 44  DENEDLEDEIDELKEELKEAADAED-------------SQAIQQAETEAEELKQAAEAEA 90

Query: 360 KLMTERI--KSNQLHKLAREEKDTLREQVQALSLQIEAMHTAIRKLEEKERFLQTVLTNA 417
             + +R   ++ ++   A+ + D   E  + ++ ++E +    +   ++ R        +
Sbjct: 91  DDILKRASAQAQRVFGKAQYKADRYLEDAK-IAQELEDLKRQAQVERQRLRSDIEAQLAS 149

Query: 418 EKELHLRNQAMDLNKR--KAIESAQSAAD 444
            K+                A E A+  AD
Sbjct: 150 AKQEDWDEILRSTVDEVEAANEEAERLAD 178


>gnl|CDD|163611 pfam10046, BLOC1_2, Biogenesis of lysosome-related organelles
           complex-1 subunit 2.  Members of this family of proteins
           play a role in cellular proliferation, as well as in the
           biogenesis of specialized organelles of the
           endosomal-lysosomal system.
          Length = 99

 Score = 26.9 bits (60), Expect = 9.5
 Identities = 15/64 (23%), Positives = 32/64 (50%), Gaps = 4/64 (6%)

Query: 346 RLLQQLREKDDANFKLMTERIKS--NQLHKLAR--EEKDTLREQVQALSLQIEAMHTAIR 401
           +LL+++ E     +K M +  K    ++  L +  EE     +Q+ A+  Q+  +  A+ 
Sbjct: 24  KLLEKMNENTALKYKKMRQSAKGLRVEIEALNQKYEELKPYLQQIDAIEQQVTTLEEAVY 83

Query: 402 KLEE 405
           +L+E
Sbjct: 84  ELDE 87


>gnl|CDD|202101 pfam02050, FliJ, Flagellar FliJ protein. 
          Length = 122

 Score = 27.2 bits (61), Expect = 9.6
 Identities = 24/135 (17%), Positives = 54/135 (40%), Gaps = 23/135 (17%)

Query: 296 EEAMRKIKQLEDQTHQLQKQVATHKQEEEALLNEMEVTGQAFEDMQEQN-SRLLQQLREK 354
           ++A R + + + +  Q ++++   +Q       ++  +GQ     + +N    +  L E 
Sbjct: 1   DKAARNLAEAQQELEQAEEKLEELQQYRLEYRQQLSGSGQGISAAELRNYQAFISALDE- 59

Query: 355 DDANFKLMTERIKSNQLH-KLAREEKDTLREQVQALSLQIEAMHTAIRKLEEKERFLQTV 413
                      I   Q   + A ++ +  REQ Q  + + + +     KL E+E+     
Sbjct: 60  ----------AIAQQQQELEQAEKQVEQAREQWQEANQERKKL----EKLLEREK----- 100

Query: 414 LTNAEKELHLRNQAM 428
               +KE + R Q +
Sbjct: 101 -KKEQKEENRREQKL 114


>gnl|CDD|235389 PRK05287, PRK05287, hypothetical protein; Provisional.
          Length = 250

 Score = 28.2 bits (64), Expect = 9.7
 Identities = 21/57 (36%), Positives = 27/57 (47%), Gaps = 11/57 (19%)

Query: 302 IKQLEDQTHQLQK-----QVATHKQEEEALLNEMEVTGQAFEDMQEQNSRLLQQLRE 353
           +K+LE Q  +LQK      V       EALL E+E   QA   +     R+ Q LRE
Sbjct: 64  LKELERQRQKLQKWRGNPGVDQEA--LEALLQELE---QASAALNAAP-RIGQSLRE 114


>gnl|CDD|217869 pfam04065, Not3, Not1 N-terminal domain, CCR4-Not complex
           component. 
          Length = 233

 Score = 28.0 bits (63), Expect = 9.8
 Identities = 24/97 (24%), Positives = 36/97 (37%), Gaps = 21/97 (21%)

Query: 305 LEDQTHQLQKQVATHKQEEEALLNEMEVTGQAFEDMQEQNSRLLQQLREKDDANFKLMTE 364
           L +   +L KQ+   + E E+L  +     +A    QE+    + +L E          E
Sbjct: 127 LSNTIDELNKQIELLEAEVESLSVQR-KKRKADAAKQER----IAELEEL--------LE 173

Query: 365 RIKSNQ------LHKLAREEKDTLREQVQALSLQIEA 395
           R K +       L  L   E D   EQV  +   IE 
Sbjct: 174 RHKFHISKLELILRLLENGELDP--EQVDDIKEDIEY 208


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.325    0.134    0.390 

Gapped
Lambda     K      H
   0.267   0.0677    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 25,339,082
Number of extensions: 2470202
Number of successful extensions: 5613
Number of sequences better than 10.0: 1
Number of HSP's gapped: 5003
Number of HSP's successfully gapped: 633
Length of query: 517
Length of database: 10,937,602
Length adjustment: 101
Effective length of query: 416
Effective length of database: 6,457,848
Effective search space: 2686464768
Effective search space used: 2686464768
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 15 ( 7.0 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 40 (21.7 bits)
S2: 61 (27.4 bits)