RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy2547
         (1807 letters)



>gnl|CDD|215682 pfam00055, Laminin_N, Laminin N-terminal (Domain VI). 
          Length = 237

 Score =  271 bits (695), Expect = 5e-83
 Identities = 127/246 (51%), Positives = 158/246 (64%), Gaps = 18/246 (7%)

Query: 40  CYPATGNLLIGRENKLRASSTCGLHGNSTFCIVSFLEDK-KKCFTCNSSPRFDNNPAFSH 98
           CYPATGNL I R   L A+STCGLHG   +CI+S L+ + KKCF C+S+     NP  SH
Sbjct: 1   CYPATGNLAINRA--LSATSTCGLHGPEPYCILSHLQPRDKKCFLCDSN---SPNPRESH 55

Query: 99  KIHNIVYATVPRTKTISWWQSE---NGVE--NVTIQLDMEAEFHFTHLIIYFKTFRPAAM 153
            I  +     P+  T  WWQSE   NGV+  NVTI LD+EAEFHFT++II FKTFRPAAM
Sbjct: 56  PISFLTDTFNPQDTT--WWQSETMQNGVQYPNVTITLDLEAEFHFTYVIIKFKTFRPAAM 113

Query: 154 LIERSHDFGKTWQVYRYFAADCAKSFPGIPRDGLRKL---TDVYCESRYSSNNPSSGGEV 210
           + ERS DFG TW  Y+Y+A  C  ++PGIPR  +R      DV C SRYS   P + GEV
Sbjct: 114 IYERSSDFG-TWIPYQYYAYSCESTYPGIPRRPIRTGRAEDDVICTSRYSDIEPLTEGEV 172

Query: 211 ILRVLPPSLHHSNIDPYSSDVQNLIKMTNLRINFTKLHTLGDDLLDTRRQIQEKYYYAIT 270
           I   L       N DP S  +Q  +K TN+RI   +LHTLGD+LLD+  ++ EKYYYAI+
Sbjct: 173 IFSTLEGRPSADNFDP-SPRLQEWLKATNIRITLQRLHTLGDNLLDSDPEVLEKYYYAIS 231

Query: 271 QMIVRG 276
            ++V G
Sbjct: 232 DIVVGG 237


>gnl|CDD|214532 smart00136, LamNT, Laminin N-terminal domain (domain VI).
           N-terminal domain of laminins and laminin-related
           protein such as Unc-6/ netrins.
          Length = 238

 Score =  253 bits (648), Expect = 1e-76
 Identities = 99/250 (39%), Positives = 139/250 (55%), Gaps = 19/250 (7%)

Query: 34  SCERKSCYPATGNLLIGRENKLRASSTCGLHGNSTFCI-VSFLEDKKKCFTCNSSPRFDN 92
           +   +SCYP   NL  GRE  + A+STCG  G   +C  V   E  KKC  C++      
Sbjct: 1   AGRPRSCYPPFVNLAFGRE--VTATSTCGEPGPERYCKLVGHTEQGKKCDYCDA-----R 53

Query: 93  NPAFSHKIHNIVYATVPRTKTISWWQSEN---GVENVTIQLDMEAEFHFTHLIIYFKTFR 149
           NP  SH   N+     P   T  WWQSE    G +NV + LD+  EFH T++I+ F + R
Sbjct: 54  NPRRSHPAENLTDGNNPNNPT--WWQSEPLSNGPQNVNLTLDLGKEFHVTYVILKFCSPR 111

Query: 150 PAAMLIERSHDFGKTWQVYRYFAADCAKSFPGIPRDGLRKL--TDVYCESRYSSNNPSSG 207
           P+  ++ERS DFGKTWQ ++YF++DC ++F   PR  + K    +V C S YS   P  G
Sbjct: 112 PSLWILERS-DFGKTWQPWQYFSSDCRRTFGRPPRGPITKGNEDEVICTSEYSDIVPLEG 170

Query: 208 GEVILRVLPPSLHHSNIDPYSSDV-QNLIKMTNLRINFTKLHTLGDDLLDTRRQIQEKYY 266
           GE+   +L      S  D  +S V Q  +  TN+R+  T+L TLGD+L+D R ++  +YY
Sbjct: 171 GEIAFSLLEG--RPSATDFDNSPVLQEWVTATNIRVRLTRLRTLGDELMDDRPEVTRRYY 228

Query: 267 YAITQMIVRG 276
           YAI+ + V G
Sbjct: 229 YAISDIAVGG 238


>gnl|CDD|224117 COG1196, Smc, Chromosome segregation ATPases [Cell division and
            chromosome partitioning].
          Length = 1163

 Score = 86.3 bits (214), Expect = 1e-16
 Identities = 101/607 (16%), Positives = 230/607 (37%), Gaps = 103/607 (16%)

Query: 1214 KEFELLEKKIEDVKALVENTTRSSHDLTVMLTTIDDMKKQIMNNEDPNGVGKTLGNTTQR 1273
             E  LL  K+++++  +E       +L+ +   ++++++++   E             + 
Sbjct: 225  LELALLLAKLKELRKELEELEE---ELSRLEEELEELQEELEEAE-------------KE 268

Query: 1274 INLARLALNDLTEEIKKLNKTGEMLKENATLLQENNVEGALELTRQAHDTSLKLKEQSTE 1333
            I   +  L +L EE+++L +  E+L+    +     +EG + L R+  +   +L+ +  E
Sbjct: 269  IEELKSELEELREELEELQE--ELLELKEEI---EELEGEISLLRERLE---ELENELEE 320

Query: 1334 TEKQINDAERQCKRTENLVTKTFPQ--------------FTQGQEENEKALQRLNEKITS 1379
             E+++ + + + +  +  + +                    + +E+    L+ L E   +
Sbjct: 321  LEERLEELKEKIEALKEELEERETLLEELEQLLAELEEAKEELEEKLSALLEELEELFEA 380

Query: 1380 LEKNLPDLNELICDKRGDPCDNLCGGAGCGKCGGMWCSNGTLSESNSAKDYAETSKKLIN 1439
            L + L +L   + + R +                       L E     +  E   + ++
Sbjct: 381  LREELAELEAELAEIRNE-----------------------LEELKREIESLEERLERLS 417

Query: 1440 EKESKAEELRKHMIILSESNSAKDYAETSKKLINEKEAKAEEVFRGITTAKQESHAANIL 1499
            E+    +E  K   + +E    +   E   + + E E + EE+   +   ++E       
Sbjct: 418  ERLEDLKEELK--ELEAELEELQTELEELNEELEELEEQLEELRDRLKELERELAELQEE 475

Query: 1500 AKEAYDHAASVRNKTAAYVA--STSNITKQLDEFLNA--PGA--TLADIRNISGLAVNKN 1553
             +      +S+  +     A    S   + + E L +  PG    +A++     + V + 
Sbjct: 476  LQRLEKELSSLEARLDRLEAEQRASQGVRAVLEALESGLPGVYGPVAEL-----IKVKEK 530

Query: 1554 IQSNPEQIRERA--NEINNVIKSLTDIDTILTETAGDLAKANDLKR----KANLTKAGAD 1607
                 E   E A  N +  V+    + + +  + A +  K N   R      +  K    
Sbjct: 531  Y----ETALEAALGNRLQAVV---VENEEVAKK-AIEFLKENKAGRATFLPLDRIKPLRS 582

Query: 1608 STKNTVQKIVDVLTEARTAQDMAEVAIQTAKDDISAARKDLSQITNDLDDAQQKSNETNV 1667
               +     + + ++        E A++    D          + +DL+ A++ + +  +
Sbjct: 583  LKSDAAPGFLGLASDLIDFDPKYEPAVRFVLGDTL--------VVDDLEQARRLARKLRI 634

Query: 1668 KVKL------LQERLKSIQAGFLQNGRSAMDVENEEKNLEKEVALAQKQASGLRSRYQET 1721
            K ++      L E   SI  G     RS++  + E K LE+E+A  + Q   L    +  
Sbjct: 635  KYRIVTLDGDLVEPSGSITGGSRNK-RSSLAQKRELKELEEELAELEAQLEKLEEELKSL 693

Query: 1722 DNKLLSKAESSGLKRARGQMLLNKASQLSVNTTAKLKQLNETEAMFNSQESELTELSKNI 1781
             N+L S  +     R + + L  +  +L     A  ++L + ++     E EL EL + +
Sbjct: 694  KNELRSLEDLLEELRRQLEELERQLEELKRELAALEEELEQLQSRLEELEEELEELEEEL 753

Query: 1782 AELQKRI 1788
             ELQ+R+
Sbjct: 754  EELQERL 760



 Score = 70.1 bits (172), Expect = 1e-11
 Identities = 51/360 (14%), Positives = 129/360 (35%), Gaps = 63/360 (17%)

Query: 1478 KAEEVFRGITTAKQESHAANILA--KEAYDHAASVRNKTAAYVASTSNITKQLDEFLNAP 1535
            K   + +     + E   A + A  ++  +   S++N+  +       + +QL+E     
Sbjct: 659  KRSSLAQKRELKELEEELAELEAQLEKLEEELKSLKNELRSLEDLLEELRRQLEELE--- 715

Query: 1536 GATLADIRNISGLAVNKNIQSNPEQIRERANEINNVIKSLTDIDTILTETAGDLAKANDL 1595
                   R +  L   + + +  E++ +  + +  + + L +++  L E    L +  + 
Sbjct: 716  -------RQLEEL--KRELAALEEELEQLQSRLEELEEELEELEEELEELQERLEELEEE 766

Query: 1596 KRKANLTKAGADSTKNTVQKIVDVLTEARTAQDMAEVAIQTAKDDISAARKDLSQITNDL 1655
                    A        +++    L E     +     ++ A+  + A  ++L  +    
Sbjct: 767  LESLEEALAKLKEEIEELEEKRQALQEELEELEEE---LEEAERRLDALERELESLEQRR 823

Query: 1656 DDAQQKSNETNVKVKLLQERLKSIQA--GFLQNGRSAM-----DVENEEKNLEKEVALAQ 1708
            +  +Q+  E   +++ L+E+L  ++     L+     +     ++E E++ LE E+   +
Sbjct: 824  ERLEQEIEELEEEIEELEEKLDELEEELEELEKELEELKEELEELEAEKEELEDELKELE 883

Query: 1709 KQASGLRSRYQETDNKLLSKAESSGLKRARGQMLLNKASQLS------------------ 1750
            ++   L    +E +++L    E     R R + L  K  +L                   
Sbjct: 884  EEKEELEEELRELESELAELKEEIEKLRERLEELEAKLERLEVELPELEEELEEEYEDTL 943

Query: 1751 --------------------VNTTAKLKQLNETEAMFNSQESELTELSKNIAELQKRIQS 1790
                                VN  A +++  E E  +   +S+  +L +   +L + I+ 
Sbjct: 944  ETELEREIERLEEEIEALGPVNLRA-IEEYEEVEERYEELKSQREDLEEAKEKLLEVIEE 1002



 Score = 61.7 bits (150), Expect = 3e-09
 Identities = 57/371 (15%), Positives = 127/371 (34%), Gaps = 51/371 (13%)

Query: 1442 ESKAEELRKHMIILSE-SNSAK---DYAETSKKL--INEKEAKAEEVFRGITT-----AK 1490
             +K EE RK   ++ E +  +K      E  +KL    E   + E++   +        +
Sbjct: 151  NAKPEERRK---LIEEAAGVSKYKERKEEAERKLERTEENLERLEDLLEELEKQLEKLER 207

Query: 1491 QESHAANILAKEAYDHAASVRNKTAAYVASTSNITKQLDEFLNAPGATLADIRNISGLAV 1550
            Q   A      +A      +    A  +A    + K+L+E          ++  +     
Sbjct: 208  QAEKAERYQELKAELRELEL----ALLLAKLKELRKELEELEEELSRLEEELEELQ---- 259

Query: 1551 NKNIQSNPEQIRERANEINNVIKSLTDIDTILTETAGDLAKANDLKRKANLTKAGADSTK 1610
                    E++ E   EI  +   L ++         +L +  +   +        +   
Sbjct: 260  --------EELEEAEKEIEELKSELEELRE-------ELEELQEELLELKEEIEELEGEI 304

Query: 1611 NTVQKIVDVLTEARTAQDMAEVAIQTAKDDISAARKDLSQITNDLDDAQQKSNETNVKVK 1670
            +    + + L E     +  E  ++  K+ I A +++L +    L++ +Q   E     +
Sbjct: 305  S---LLRERLEELENELEELEERLEELKEKIEALKEELEERETLLEELEQLLAELEEAKE 361

Query: 1671 LLQERLKSIQAGF---LQNGRSAMDVENEEK--------NLEKEVALAQKQASGLRSRYQ 1719
             L+E+L ++        +  R  +     E          L++E+   +++   L  R +
Sbjct: 362  ELEEKLSALLEELEELFEALREELAELEAELAEIRNELEELKREIESLEERLERLSERLE 421

Query: 1720 ETDNKLLSKAESSGLKRARGQMLLNKASQLSVNTTAKLKQLNETEAMFNSQESELTELSK 1779
            +   +L          +   + L  +  +L         +L E E      + EL  L K
Sbjct: 422  DLKEELKELEAELEELQTELEELNEELEELEEQLEELRDRLKELERELAELQEELQRLEK 481

Query: 1780 NIAELQKRIQS 1790
             ++ L+ R+  
Sbjct: 482  ELSSLEARLDR 492



 Score = 53.2 bits (128), Expect = 1e-06
 Identities = 74/439 (16%), Positives = 165/439 (37%), Gaps = 77/439 (17%)

Query: 1259 DPNGVGKTLGNTTQRINLAR-LALNDLTEEIKKLNK-TGEMLKENATLLQEN-NVEGAL- 1314
            +P+G   T G+  +R +LA+   L +L EE+ +L     ++ +E  +L  E  ++E  L 
Sbjct: 647  EPSGS-ITGGSRNKRSSLAQKRELKELEEELAELEAQLEKLEEELKSLKNELRSLEDLLE 705

Query: 1315 ELTRQAHDTSLK---LKEQSTETEKQINDAERQCKRTENLVTKTFPQFTQGQEENEKALQ 1371
            EL RQ  +   +   LK +    E+++   + + +  E  +     +  +  EE ++ L+
Sbjct: 706  ELRRQLEELERQLEELKRELAALEEELEQLQSRLEELEEELE----ELEEELEELQERLE 761

Query: 1372 RLNEKITSLEKNLPDLNELICDKRGDPCDNLCGGAGCGKCGGMWCSNGTLSESNSAKDYA 1431
             L E++ SLE+ L  L E I                                    ++  
Sbjct: 762  ELEEELESLEEALAKLKEEI---------------------------------EELEEKR 788

Query: 1432 ETSKKLINEKESKAEELRKHMIILSE----SNSAKDYAETSKKLINEKEAKAEEVFRGIT 1487
            +  ++ + E E + EE  + +  L          ++  E   + + E+  + EE    + 
Sbjct: 789  QALQEELEELEEELEEAERRLDALERELESLEQRRERLEQEIEELEEEIEELEEKLDELE 848

Query: 1488 TAKQESHAANILAKEAYDHAASVRNKTAAYVASTSNITKQLDEFLNAPGATLADIRNISG 1547
               +E        KE  +   + + +    +       ++L+E          ++R +  
Sbjct: 849  EELEELEKELEELKEELEELEAEKEELEDELKELEEEKEELEE----------ELRELES 898

Query: 1548 LAVNKNIQSNPEQIRERANEINNVIKSLTDIDTILTETAGDLAKAN---DLKRKANLTKA 1604
                   +   E++RER  E+   ++ L      L E   +  +     +L+R+    + 
Sbjct: 899  ELAELKEEI--EKLRERLEELEAKLERLEVELPELEEELEEEYEDTLETELEREIERLEE 956

Query: 1605 GADSTKNTVQKIVDVLTEARTAQDMAEVAIQTAKDDISAARKDLSQITNDLDDAQQKSNE 1664
              ++    V      + E    ++  E  +++ ++D+  A++ L ++  +LD        
Sbjct: 957  EIEALGP-VNL--RAIEEYEEVEERYE-ELKSQREDLEEAKEKLLEVIEELD-------- 1004

Query: 1665 TNVKVKLLQERLKSIQAGF 1683
               K +  +E    I   F
Sbjct: 1005 -KEKRERFKETFDKINENF 1022



 Score = 44.7 bits (106), Expect = 5e-04
 Identities = 52/320 (16%), Positives = 108/320 (33%), Gaps = 63/320 (19%)

Query: 1214 KEFELLEKKIEDVKALVENTTRSSHDLTVMLTTIDDMKKQIMN-----NEDPNGVGKTLG 1268
             E   LE  +E+++  +E   R   +L   L  +++  +Q+ +      E+   + + L 
Sbjct: 695  NELRSLEDLLEELRRQLEELERQLEELKRELAALEEELEQLQSRLEELEEELEELEEELE 754

Query: 1269 NTTQRINLARLALNDLTEEIKKLNKTGEMLKENATLLQEN--NVEGALELTR-------- 1318
               +R+      L  L E + KL +  E L+E    LQE    +E  LE           
Sbjct: 755  ELQERLEELEEELESLEEALAKLKEEIEELEEKRQALQEELEELEEELEEAERRLDALER 814

Query: 1319 ----------QAHDTSLKLKEQSTETEKQINDAERQCKRTENLVTKTFPQFTQGQEENEK 1368
                      +      +L+E+  E E+++++ E + +  E  + +   +     EE E 
Sbjct: 815  ELESLEQRRERLEQEIEELEEEIEELEEKLDELEEELEELEKELEELKEEL----EELEA 870

Query: 1369 ALQRLNEKITSLEKNLPDLNELICDKRGDPCDNLCGGAGCGKCGGMWCSNGTLSESNSAK 1428
              + L +++  LE+   +L E + +   +                       L+E     
Sbjct: 871  EKEELEDELKELEEEKEELEEELRELESE-----------------------LAELKEEI 907

Query: 1429 DYAETSKKLINEKESKAEELRKHMIILSESNSAKDYAETSKKLINEKEAKAEEVFRGITT 1488
            +      + +  K  + E     +    E           ++ I   E + E     +  
Sbjct: 908  EKLRERLEELEAKLERLEVELPELEEELEEEYEDTLETELEREIERLEEEIEA-LGPV-- 964

Query: 1489 AKQESHAANILAKEAYDHAA 1508
                    N+ A E Y+   
Sbjct: 965  --------NLRAIEEYEEVE 976



 Score = 44.7 bits (106), Expect = 6e-04
 Identities = 43/276 (15%), Positives = 99/276 (35%), Gaps = 50/276 (18%)

Query: 1219 LEKKIEDVKALVENTTRSSHDLTVMLTTIDDMKKQIM-NNEDPNGVGKTLGNTTQRINLA 1277
            LE+ +  +K  +E        L      ++++++++       + + + L +  QR    
Sbjct: 770  LEEALAKLKEEIEELEEKRQALQ---EELEELEEELEEAERRLDALERELESLEQRRERL 826

Query: 1278 RLALNDLTEEIKKLNKTGEMLKENATLLQENNVEGALELTRQAHDTSLKLKEQSTETEKQ 1337
               + +L EEI++L       +E    L+E   E   EL  +  +   +L+ +  E E +
Sbjct: 827  EQEIEELEEEIEEL-------EEKLDELEEELEELEKEL-EELKEELEELEAEKEELEDE 878

Query: 1338 INDAERQCKRTENLVTKTFPQFTQGQEENEKALQRLNEKITSLEKNLPDLNELICDKRGD 1397
            + + E + +  E  + +          E ++ +++L E++  LE  L  L   +      
Sbjct: 879  LKELEEEKEELEEELRE----LESELAELKEEIEKLRERLEELEAKLERLEVEL------ 928

Query: 1398 PCDNLCGGAGCGKCGGMWCSNGTLSESNSAKDYAETSKKLINEKESKAEELRKHMIILSE 1457
                                          +   E    L  E E + E L + +  L  
Sbjct: 929  -------------------------PELEEELEEEYEDTLETELEREIERLEEEIEALGP 963

Query: 1458 SNSA--KDYAETSKKLINEKEAKAEEVFRGITTAKQ 1491
             N    ++Y E  ++   E +++ E++        +
Sbjct: 964  VNLRAIEEYEEVEERY-EELKSQREDLEEAKEKLLE 998



 Score = 44.3 bits (105), Expect = 6e-04
 Identities = 32/179 (17%), Positives = 71/179 (39%), Gaps = 14/179 (7%)

Query: 1214 KEFELLEKKIEDVKALVENTTRSSHDLTVMLTTIDDMKKQIMNNEDPNGVGKTLGNTTQR 1273
            +E E LE+K+++++  +E   +    L  +   +++++ +         +   L    + 
Sbjct: 835  EEIEELEEKLDELEEELEELEKE---LEELKEELEELEAEKEE------LEDELKELEEE 885

Query: 1274 INLARLALNDLTEEIKKLNKTGEMLKENATLLQENNVEGALELT-RQAHDTSLKLKEQST 1332
                   L +L  E+ +L +  E L+E    L+       +EL   +            T
Sbjct: 886  KEELEEELRELESELAELKEEIEKLRERLEELEAKLERLEVELPELEEELEEEYEDTLET 945

Query: 1333 ETEKQINDAERQCKRTENLVTKTFPQFTQGQEENEKALQRLNEKITSLEKNLPDLNELI 1391
            E E++I   E + +    +  +   ++    EE E+  + L  +   LE+    L E+I
Sbjct: 946  ELEREIERLEEEIEALGPVNLRAIEEY----EEVEERYEELKSQREDLEEAKEKLLEVI 1000


>gnl|CDD|223496 COG0419, SbcC, ATPase involved in DNA repair [DNA replication,
            recombination, and repair].
          Length = 908

 Score = 80.6 bits (199), Expect = 5e-15
 Identities = 93/552 (16%), Positives = 210/552 (38%), Gaps = 72/552 (13%)

Query: 1213 TKEFELLEKKIEDVKALVENTTRSSHDLTVMLTTIDDMKKQIMNNEDPNGVGKTLGNTTQ 1272
             +  E LE+KIE ++ L          L   L  +  + +++    +       L +  +
Sbjct: 280  ERLLEELEEKIERLEELEREIEE----LEEELEGLRALLEELEELLEK------LKSLEE 329

Query: 1273 RINLARLALNDLTEEIKKL----NKTGEMLKENATLLQENN------VEGALELTRQAHD 1322
            R+      L  L  E+++L    N+  ++L+E    L+E        +E ALE  +Q  +
Sbjct: 330  RLEKLEEKLEKLESELEELAEEKNELAKLLEERLKELEERLEELEKELEKALERLKQLEE 389

Query: 1323 TSLKLKEQSTETEKQINDAERQCKRTENLVTKTFPQFTQGQEENEKALQRLNEKITSLEK 1382
               +LKE+  E    + + + + +  E  + +      +  EE E+ +++L E+I  LE 
Sbjct: 390  AIQELKEELAELSAALEEIQEELEELEKELEEL----ERELEELEEEIKKLEEQINQLES 445

Query: 1383 NLPDLNELICDKRGDPCDNLCGGAGCGKCGGMWCSNGTLSESNSAKDYAETSKKLINEKE 1442
                + EL     G+ C  +C G    +                  +  E   + + E+ 
Sbjct: 446  KELMIAEL--AGAGEKCP-VC-GQELPE-----------EHEKELLELYELELEELEEEL 490

Query: 1443 SKAEELRKHMIILSESNSAKDYAETSKKLINEKEAKAEEVFRGITTAKQESHAANILAKE 1502
            S+ +E  +    + E        E     + E E   +E        ++     N+L + 
Sbjct: 491  SREKEEAELREEIEELEKELRELEEELIELLELEEALKEELE-----EKLEKLENLLEEL 545

Query: 1503 AYDHAASVRNKTAAYVASTSNITKQLDEFLNAPGATLADIRNISGLAVNKNIQSNPEQIR 1562
                      +    +    +  ++L E L      L  +R        + ++   E+++
Sbjct: 546  EELKEKLQLQQLKEELRQLEDRLQELKELL----EELRLLRT-----RKEELEELRERLK 596

Query: 1563 ERANEINNVIKSLTDIDTILTETAGDLAKANDLKRKANLTKAGADSTKNTVQKIVDVLTE 1622
            E   ++  + + L+ ++ +L       A+ N+L+      +  ++  K  +Q  ++ L +
Sbjct: 597  ELKKKLKELEERLSQLEELLQSLELSEAE-NELEEA--EEELESELEKLNLQAELEELLQ 653

Query: 1623 ARTAQDMAEVAIQTAKDDISAARKDLSQITNDLDDAQQKSNETNVKVKL--LQERLKSIQ 1680
            A   +   E  ++  + +I   R++L +I N+    ++      ++ +L  L+E L+ + 
Sbjct: 654  AALEE--LEEKVEELEAEI---RRELQRIENEEQLEEKLEELEQLEEELEQLREELEELL 708

Query: 1681 AGFLQNGRSAMDVENEEKNLE---KEVALAQKQASGLRSRYQETDNKLLSKAESSGLKRA 1737
                +  +   ++E+ +  LE   KE+   +K    L     E   + L KA        
Sbjct: 709  KKLGEIEQLIEELESRKAELEELKKELEKLEKALELL-----EELREKLGKAGLRADILR 763

Query: 1738 RGQM-LLNKASQ 1748
                 +  +A++
Sbjct: 764  NLLAQIEAEANE 775



 Score = 69.4 bits (170), Expect = 1e-11
 Identities = 98/559 (17%), Positives = 202/559 (36%), Gaps = 80/559 (14%)

Query: 1281 LNDLTEEIKKLNKTGEMLKENATLLQENNVEGALELTRQAHDTSLKLKEQSTETEKQIND 1340
            L  L EE+K+L K  E+ +E     +E  +E  +E   +      + KE+  E + ++ +
Sbjct: 205  LEALEEELKELKKLEEIQEE----QEEEELEQEIEALEERLAELEEEKERLEELKARLLE 260

Query: 1341 AERQCKRTENLVTKTFPQFTQGQEENEKALQRLNEKITSLEKNLPDLNEL---------I 1391
             E        +  +   +  +  EE E+ ++RL E    +E+   +L  L         +
Sbjct: 261  IESLELEALKIREEELRELERLLEELEEKIERLEELEREIEELEEELEGLRALLEELEEL 320

Query: 1392 CDKRGDPCDNLCGGAGCGKCGGMWCSNGTLSESNSAKDYAETSKKLINEKESKAEE---- 1447
             +K     + L       K           SE     +      KL+ E+  + EE    
Sbjct: 321  LEKLKSLEERL--EKLEEKL------EKLESELEELAEEKNELAKLLEERLKELEERLEE 372

Query: 1448 --------------LRKHMIILSESNSAKDYA-ETSKKLINEKEAKAEEVFRGITTAKQE 1492
                          L + +  L E  +    A E  ++ + E E + EE+ R +   ++E
Sbjct: 373  LEKELEKALERLKQLEEAIQELKEELAELSAALEEIQEELEELEKELEELERELEELEEE 432

Query: 1493 SHAANILAKEAYDHAASVRNKTAAYVASTSNI----TKQLDEFLNAPGATLADIRNISGL 1548
                 I   E   +    +    A +A          ++L E        L ++  +   
Sbjct: 433  -----IKKLEEQINQLESKELMIAELAGAGEKCPVCGQELPEEHE---KELLELYELELE 484

Query: 1549 AVNKNIQSNPEQIRERANEINNVIKSLTDIDTILTETAGDLAKANDLKRKANLTKAGADS 1608
             + + +    ++  E   EI  + K L +++  L E         +   +  L K     
Sbjct: 485  ELEEELSRE-KEEAELREEIEELEKELRELEEELIELLELEEALKEEL-EEKLEKL---- 538

Query: 1609 TKNTVQKIVDVLTEARTAQ-----DMAEVAIQTAKDDISAARKDLSQITNDLDDAQQKSN 1663
             +N ++++ ++  + +  Q        E  +Q  K+ +   R  L     +L++ +++  
Sbjct: 539  -ENLLEELEELKEKLQLQQLKEELRQLEDRLQELKELLEELRL-LRTRKEELEELRERLK 596

Query: 1664 ETNVKVKLLQERLKSIQAGFLQNGRSAMDVENEEKNLEKEV--ALAQKQASG-----LRS 1716
            E   K+K L+ERL  ++   L       + ENE +  E+E+   L +          L++
Sbjct: 597  ELKKKLKELEERLSQLEE--LLQSLELSEAENELEEAEEELESELEKLNLQAELEELLQA 654

Query: 1717 RYQETDNKL------LSKAESSGLKRARGQMLLNKASQLSVNTTAKLKQLNETEAMFNSQ 1770
              +E + K+      + +         + +  L +  QL        ++L E        
Sbjct: 655  ALEELEEKVEELEAEIRRELQRIENEEQLEEKLEELEQLEEELEQLREELEELLKKLGEI 714

Query: 1771 ESELTELSKNIAELQKRIQ 1789
            E  + EL    AEL++  +
Sbjct: 715  EQLIEELESRKAELEELKK 733



 Score = 69.0 bits (169), Expect = 2e-11
 Identities = 65/474 (13%), Positives = 154/474 (32%), Gaps = 88/474 (18%)

Query: 1364 EENEKALQRLNEKITSLEKNLPDLNELICDKRGDPCDNLCGGAGCGKCGGMWCSNGTLSE 1423
            E+ EK  + L E I   +  + +L   +                            +   
Sbjct: 167  EKYEKLSELLKEVIKEAKAKIEELEGQL----------------------------SELL 198

Query: 1424 SNSAKDYAETSKKLINEKESKAEELRKHMIILSESNSAKDYAETSKKLINEKEAKAEEVF 1483
             +         ++L  ++  K EE+++           +   E   +L  EKE   E   
Sbjct: 199  EDIEDLLEALEEEL--KELKKLEEIQEEQEEEELEQEIEALEERLAELEEEKERLEELKA 256

Query: 1484 RGITTAKQESHAANILA---KEAYDHAASVRNKTAAYVASTSNITKQLDEFLNAPGATLA 1540
            R +     E  A  I     +E       +  K          I +  +E         A
Sbjct: 257  RLLEIESLELEALKIREEELRELERLLEELEEKIERLEELEREIEELEEELEGL----RA 312

Query: 1541 DIRNISGLAVNKNIQSNPEQIRERANEINNVIKSLTDIDTILTETAGDLAKANDLKRKAN 1600
             +  +      + +    + + ER  ++   ++ L      L E   +LAK   L+ +  
Sbjct: 313  LLEEL------EELLEKLKSLEERLEKLEEKLEKLESELEELAEEKNELAKL--LEERLK 364

Query: 1601 LTKAGADSTKNTVQKIVDVLTEARTAQDMAEVAIQTAKDDISAARKDLSQITNDLDDAQQ 1660
              +   +  +  ++K ++ L +   A    +  +      +   +++L ++  +L++ ++
Sbjct: 365  ELEERLEELEKELEKALERLKQLEEAIQELKEELAELSAALEEIQEELEELEKELEELER 424

Query: 1661 KSNETNVKVKLLQERLKSIQAG--FLQNGRSAMDV--------------------ENEEK 1698
            +  E   ++K L+E++  +++    +     A +                     E E +
Sbjct: 425  ELEELEEEIKKLEEQINQLESKELMIAELAGAGEKCPVCGQELPEEHEKELLELYELELE 484

Query: 1699 NLEKEVALAQKQASGLRSRYQETDNKL-------------------LSKAESSGLKRARG 1739
             LE+E++   K+ + LR   +E + +L                     + +   L+    
Sbjct: 485  ELEEELSRE-KEEAELREEIEELEKELRELEEELIELLELEEALKEELEEKLEKLENLLE 543

Query: 1740 QMLLNKASQLSVNTTAKLKQLNETEAMFNSQESELTELSKNIAELQKRIQSCIN 1793
            ++   K          +L+QL +          EL  L     EL++ ++  + 
Sbjct: 544  ELEELKEKLQLQQLKEELRQLEDRLQELKELLEELRLLRTRKEELEE-LRERLK 596



 Score = 54.0 bits (130), Expect = 8e-07
 Identities = 49/284 (17%), Positives = 110/284 (38%), Gaps = 36/284 (12%)

Query: 1527 QLDEFLNAPGATLADI-RNISGLAVNKNIQSNPEQIRERANEINNVIKSLTDIDTILTET 1585
            + D FL +      +I   + GL   + +    E ++E   E    I+ L    + L E 
Sbjct: 144  EFDAFLKSKPKERKEILDELFGLEKYEKLS---ELLKEVIKEAKAKIEELEGQLSELLED 200

Query: 1586 AGDLAKANDLKRKANLTKAGADSTKNTVQKIVDVLTEARTAQDMAEVAIQTAKDDISAAR 1645
              DL +A + + K           +           E    +   E A++    ++   +
Sbjct: 201  IEDLLEALEEELKELKKLEEIQEEQ-----------EEEELEQEIE-ALEERLAELEEEK 248

Query: 1646 KDLSQITNDLDDAQQKSNETNVKVKLLQERLKSIQAGFLQNGRSAMDVENEEKNLE---K 1702
            + L ++   L + +    E    +K+ +E L+ ++    +       +E + + LE   +
Sbjct: 249  ERLEELKARLLEIESLELE---ALKIREEELRELERLLEE-------LEEKIERLEELER 298

Query: 1703 EVALAQKQASGLRSRYQETDNKL-LSKAESSGLKRARGQM--LLNKASQLSVNTTAKLKQ 1759
            E+   +++  GLR+  +E +  L   K+    L++   ++  L ++  +L+       K 
Sbjct: 299  EIEELEEELEGLRALLEELEELLEKLKSLEERLEKLEEKLEKLESELEELAEEKNELAKL 358

Query: 1760 LNETEAMFNSQ----ESELTELSKNIAELQKRIQSCINFIVDKS 1799
            L E       +    E EL +  + + +L++ IQ     + + S
Sbjct: 359  LEERLKELEERLEELEKELEKALERLKQLEEAIQELKEELAELS 402



 Score = 48.6 bits (116), Expect = 4e-05
 Identities = 43/328 (13%), Positives = 112/328 (34%), Gaps = 20/328 (6%)

Query: 1470 KLINEKEAKAEEVFRGITTAKQESHAANILAKEAYDHAASVRNKTAAYVASTSNITKQLD 1529
              +  K  + +E+   +    ++    + L KE    A +   +    ++      + L 
Sbjct: 147  AFLKSKPKERKEILDELF-GLEKYEKLSELLKEVIKEAKAKIEELEGQLSELLEDIEDLL 205

Query: 1530 EFLNAPGATLADIRNISGLAVNKNIQSNPEQIRERANEINNVIKSLTDIDT-ILTETAGD 1588
            E L      L  +  I      + ++   E + ER  E+    + L ++   +L   + +
Sbjct: 206  EALEEELKELKKLEEIQEEQEEEELEQEIEALEERLAELEEEKERLEELKARLLEIESLE 265

Query: 1589 LAKANDLKRKANLTKAGADSTKNTVQKIVDVLTEARTAQDMAEVAIQTAK-DDISAARKD 1647
            L      + +    +   +  +  ++++ ++  E    +   E+    A  +++    + 
Sbjct: 266  LEALKIREEELRELERLLEELEEKIERLEELEREI--EELEEELEGLRALLEELEELLEK 323

Query: 1648 LSQITNDLDDAQQKSNETNVKVKLLQERLKS----IQAGFLQNGRSAMDVENEEKNLEKE 1703
            L  +   L+  ++K  +   +++ L E        ++    +      ++E E +   + 
Sbjct: 324  LKSLEERLEKLEEKLEKLESELEELAEEKNELAKLLEERLKELEERLEELEKELEKALER 383

Query: 1704 VALAQKQASGLRSRYQETDNKLLSKAESSGLKRARGQMLLNKASQLSVNTTAKLKQLNET 1763
            +   ++    L+    E    L    E            L +  +       +L++L E 
Sbjct: 384  LKQLEEAIQELKEELAELSAALEEIQEE-----------LEELEKELEELERELEELEEE 432

Query: 1764 EAMFNSQESELTELSKNIAELQKRIQSC 1791
                  Q ++L      IAEL    + C
Sbjct: 433  IKKLEEQINQLESKELMIAELAGAGEKC 460


>gnl|CDD|214543 smart00180, EGF_Lam, Laminin-type epidermal growth factor-like
           domai. 
          Length = 46

 Score = 68.1 bits (167), Expect = 3e-14
 Identities = 33/46 (71%), Positives = 33/46 (71%)

Query: 773 CDCDPTGSTSKYCDQLGGNCVCKPNVVGRRCDRCAPGTYGFGPEGC 818
           CDCDP GS S  CD   G C CKPNV GRRCDRCAPG YG GP GC
Sbjct: 1   CDCDPGGSASGTCDPDTGQCECKPNVTGRRCDRCAPGYYGDGPPGC 46



 Score = 58.9 bits (143), Expect = 6e-11
 Identities = 18/42 (42%), Positives = 26/42 (61%)

Query: 821 CDCNHVGALDNICDVVSGQCKCRAQTYGRECDQCSPGYWNYP 862
           CDC+  G+    CD  +GQC+C+    GR CD+C+PGY+   
Sbjct: 1   CDCDPGGSASGTCDPDTGQCECKPNVTGRRCDRCAPGYYGDG 42



 Score = 57.7 bits (140), Expect = 1e-10
 Identities = 22/47 (46%), Positives = 29/47 (61%), Gaps = 1/47 (2%)

Query: 1079 CDCDPIGSVDVTECNEYTGQCQCKSGFGGRQCDQCQTNFWGNPNVEC 1125
            CDCDP GS   T C+  TGQC+CK    GR+CD+C   ++G+    C
Sbjct: 1    CDCDPGGSASGT-CDPDTGQCECKPNVTGRRCDRCAPGYYGDGPPGC 46



 Score = 56.6 bits (137), Expect = 3e-10
 Identities = 29/56 (51%), Positives = 32/56 (57%), Gaps = 10/56 (17%)

Query: 404 CDCDPGGSLDDGICDPYTDDVLVSGKCHCKPNVGGRRCDMCKDGFWNFNETNPNGC 459
           CDCDPGGS     CDP T      G+C CKPNV GRRCD C  G++      P GC
Sbjct: 1   CDCDPGGSASGT-CDPDT------GQCECKPNVTGRRCDRCAPGYYGD---GPPGC 46



 Score = 53.9 bits (130), Expect = 3e-09
 Identities = 18/43 (41%), Positives = 26/43 (60%), Gaps = 1/43 (2%)

Query: 462 CTCNIQGTINSQGCNKDTGDCTCKRNVEGRDCNQCLPHFWGLS 504
           C C+  G+ +   C+ DTG C CK NV GR C++C P ++G  
Sbjct: 1   CDCDPGGSASGT-CDPDTGQCECKPNVTGRRCDRCAPGYYGDG 42



 Score = 53.1 bits (128), Expect = 5e-09
 Identities = 23/46 (50%), Positives = 26/46 (56%), Gaps = 1/46 (2%)

Query: 1128 CDCDPSGSATSQCHQNNGTCVCHQGIGGVRCDTCARGYIG-TAPDC 1172
            CDCDP GSA+  C  + G C C   + G RCD CA GY G   P C
Sbjct: 1    CDCDPGGSASGTCDPDTGQCECKPNVTGRRCDRCAPGYYGDGPPGC 46



 Score = 50.4 bits (121), Expect = 6e-08
 Identities = 21/49 (42%), Positives = 25/49 (51%), Gaps = 3/49 (6%)

Query: 1028 CTCNILGIDHSKGPCDRTTGQCPCLPNVIGVSCDRCLKNHWKIASGTGC 1076
            C C+  G   + G CD  TGQC C PNV G  CDRC   ++      GC
Sbjct: 1    CDCDPGG--SASGTCDPDTGQCECKPNVTGRRCDRCAPGYYG-DGPPGC 46



 Score = 45.8 bits (109), Expect = 2e-06
 Identities = 20/39 (51%), Positives = 24/39 (61%)

Query: 514 CDCDPGASYDNNCDVISGQCKCKPHASGRTCSTPEQSYY 552
           CDCDPG S    CD  +GQC+CKP+ +GR C      YY
Sbjct: 1   CDCDPGGSASGTCDPDTGQCECKPNVTGRRCDRCAPGYY 39



 Score = 39.6 bits (93), Expect = 3e-04
 Identities = 20/46 (43%), Positives = 24/46 (52%), Gaps = 4/46 (8%)

Query: 867 CICN--GHADL-CDSKTGTCISCRNSTDGENCERCINGYYGDPRLG 909
           C C+  G A   CD  TG C  C+ +  G  C+RC  GYYGD   G
Sbjct: 1   CDCDPGGSASGTCDPDTGQCE-CKPNVTGRRCDRCAPGYYGDGPPG 45



 Score = 38.8 bits (91), Expect = 6e-04
 Identities = 15/44 (34%), Positives = 24/44 (54%), Gaps = 3/44 (6%)

Query: 976  CDCSNNTDLALPGNCDTLTGKCLQCRYDTEGDHCQVCKAGFFGN 1019
            CDC  +   +  G CD  TG+C +C+ +  G  C  C  G++G+
Sbjct: 1    CDC--DPGGSASGTCDPDTGQC-ECKPNVTGRRCDRCAPGYYGD 41



 Score = 36.1 bits (84), Expect = 0.005
 Identities = 14/50 (28%), Positives = 17/50 (34%), Gaps = 7/50 (14%)

Query: 278 CSCYGHASRCLPISEADTKPDMVHGRCECTHNTQGLNCEKCIDFYNDLPW 327
           C C    S       A    D   G+CEC  N  G  C++C   Y     
Sbjct: 1   CDCDPGGS-------ASGTCDPDTGQCECKPNVTGRRCDRCAPGYYGDGP 43



 Score = 35.7 bits (83), Expect = 0.008
 Identities = 16/54 (29%), Positives = 23/54 (42%), Gaps = 15/54 (27%)

Query: 341 CNCN---NHSTKCHFDEAVYNSTGRISGGVCDSCEHNTEGRNCERCKQYFYKDP 391
           C+C+   + S  C  D            G C+ C+ N  GR C+RC   +Y D 
Sbjct: 1   CDCDPGGSASGTCDPD-----------TGQCE-CKPNVTGRRCDRCAPGYYGDG 42



 Score = 34.2 bits (79), Expect = 0.029
 Identities = 18/50 (36%), Positives = 21/50 (42%), Gaps = 8/50 (16%)

Query: 917 CPCPGTVESGLSYADTCQLDPRSQDVICECKEGYRGTRCDGCLDNYFGNP 966
           C C      G S + TC  D       CECK    G RCD C   Y+G+ 
Sbjct: 1   CDCD----PGGSASGTCDPD----TGQCECKPNVTGRRCDRCAPGYYGDG 42


>gnl|CDD|238012 cd00055, EGF_Lam, Laminin-type epidermal growth factor-like domain;
           laminins are the major noncollagenous components of
           basement membranes that mediate cell adhesion, growth
           migration, and differentiation; the laminin-type
           epidermal growth factor-like module occurs in tandem
           arrays; the domain contains 4 disulfide bonds (loops
           a-d) the first three resemble epidermal growth factor
           (EGF); the number of copies of this domain in the
           different forms of laminins is highly variable ranging
           from 3 up to 22 copies.
          Length = 50

 Score = 65.8 bits (161), Expect = 2e-13
 Identities = 28/49 (57%), Positives = 30/49 (61%), Gaps = 2/49 (4%)

Query: 773 CDCDPTGSTSKYCDQLGGNCVCKPNVVGRRCDRCAPGTYGF--GPEGCK 819
           CDC+  GS S  CD   G C CKPN  GRRCDRCAPG YG      GC+
Sbjct: 2   CDCNGHGSLSGQCDPGTGQCECKPNTTGRRCDRCAPGYYGLPSQGGGCQ 50



 Score = 61.2 bits (149), Expect = 9e-12
 Identities = 20/44 (45%), Positives = 29/44 (65%), Gaps = 1/44 (2%)

Query: 1078 PCDCDPIGSVDVTECNEYTGQCQCKSGFGGRQCDQCQTNFWGNP 1121
            PCDC+  GS+   +C+  TGQC+CK    GR+CD+C   ++G P
Sbjct: 1    PCDCNGHGSLS-GQCDPGTGQCECKPNTTGRRCDRCAPGYYGLP 43



 Score = 60.8 bits (148), Expect = 1e-11
 Identities = 22/44 (50%), Positives = 30/44 (68%)

Query: 820 ACDCNHVGALDNICDVVSGQCKCRAQTYGRECDQCSPGYWNYPT 863
            CDCN  G+L   CD  +GQC+C+  T GR CD+C+PGY+  P+
Sbjct: 1   PCDCNGHGSLSGQCDPGTGQCECKPNTTGRRCDRCAPGYYGLPS 44



 Score = 59.3 bits (144), Expect = 4e-11
 Identities = 26/58 (44%), Positives = 32/58 (55%), Gaps = 8/58 (13%)

Query: 403 RCDCDPGGSLDDGICDPYTDDVLVSGKCHCKPNVGGRRCDMCKDGFWNFNETNPNGCE 460
            CDC+  GSL  G CDP T      G+C CKPN  GRRCD C  G++    +   GC+
Sbjct: 1   PCDCNGHGSLS-GQCDPGT------GQCECKPNTTGRRCDRCAPGYYGL-PSQGGGCQ 50



 Score = 56.2 bits (136), Expect = 5e-10
 Identities = 22/52 (42%), Positives = 31/52 (59%), Gaps = 2/52 (3%)

Query: 461 PCTCNIQGTINSQGCNKDTGDCTCKRNVEGRDCNQCLPHFWGLSESEEYGCK 512
           PC CN  G+++ Q C+  TG C CK N  GR C++C P ++GL      GC+
Sbjct: 1   PCDCNGHGSLSGQ-CDPGTGQCECKPNTTGRRCDRCAPGYYGLPSQGG-GCQ 50



 Score = 55.1 bits (133), Expect = 1e-09
 Identities = 21/49 (42%), Positives = 23/49 (46%)

Query: 1127 PCDCDPSGSATSQCHQNNGTCVCHQGIGGVRCDTCARGYIGTAPDCSPC 1175
            PCDC+  GS + QC    G C C     G RCD CA GY G       C
Sbjct: 1    PCDCNGHGSLSGQCDPGTGQCECKPNTTGRRCDRCAPGYYGLPSQGGGC 49



 Score = 50.8 bits (122), Expect = 3e-08
 Identities = 19/43 (44%), Positives = 24/43 (55%)

Query: 513 PCDCDPGASYDNNCDVISGQCKCKPHASGRTCSTPEQSYYSPP 555
           PCDC+   S    CD  +GQC+CKP+ +GR C      YY  P
Sbjct: 1   PCDCNGHGSLSGQCDPGTGQCECKPNTTGRRCDRCAPGYYGLP 43



 Score = 48.5 bits (116), Expect = 3e-07
 Identities = 23/51 (45%), Positives = 27/51 (52%), Gaps = 3/51 (5%)

Query: 1028 CTCNILGIDHSKGPCDRTTGQCPCLPNVIGVSCDRCLKNHWKIAS-GTGCE 1077
            C CN  G     G CD  TGQC C PN  G  CDRC   ++ + S G GC+
Sbjct: 2    CDCNGHGSLS--GQCDPGTGQCECKPNTTGRRCDRCAPGYYGLPSQGGGCQ 50



 Score = 48.1 bits (115), Expect = 3e-07
 Identities = 23/47 (48%), Positives = 26/47 (55%), Gaps = 4/47 (8%)

Query: 866 RCICNGHADL---CDSKTGTCISCRNSTDGENCERCINGYYGDPRLG 909
            C CNGH  L   CD  TG C  C+ +T G  C+RC  GYYG P  G
Sbjct: 1   PCDCNGHGSLSGQCDPGTGQCE-CKPNTTGRRCDRCAPGYYGLPSQG 46



 Score = 47.0 bits (112), Expect = 1e-06
 Identities = 18/53 (33%), Positives = 23/53 (43%), Gaps = 7/53 (13%)

Query: 277 SCSCYGHASRCLPISEADTKPDMVHGRCECTHNTQGLNCEKCIDFYNDLPWKP 329
            C C GH S          + D   G+CEC  NT G  C++C   Y  LP + 
Sbjct: 1   PCDCNGHGSL-------SGQCDPGTGQCECKPNTTGRRCDRCAPGYYGLPSQG 46



 Score = 42.7 bits (101), Expect = 3e-05
 Identities = 19/50 (38%), Positives = 27/50 (54%), Gaps = 3/50 (6%)

Query: 975  PCDCSNNTDLALPGNCDTLTGKCLQCRYDTEGDHCQVCKAGFFGNALEQN 1024
            PCDC  N   +L G CD  TG+C +C+ +T G  C  C  G++G   +  
Sbjct: 1    PCDC--NGHGSLSGQCDPGTGQC-ECKPNTTGRRCDRCAPGYYGLPSQGG 47



 Score = 39.6 bits (93), Expect = 4e-04
 Identities = 17/54 (31%), Positives = 23/54 (42%), Gaps = 15/54 (27%)

Query: 341 CNCNNHSTK---CHFDEAVYNSTGRISGGVCDSCEHNTEGRNCERCKQYFYKDP 391
           C+CN H +    C               G C+ C+ NT GR C+RC   +Y  P
Sbjct: 2   CDCNGHGSLSGQCDPG-----------TGQCE-CKPNTTGRRCDRCAPGYYGLP 43



 Score = 39.3 bits (92), Expect = 4e-04
 Identities = 21/59 (35%), Positives = 25/59 (42%), Gaps = 9/59 (15%)

Query: 916 PCPCPGTVESGLSYADTCQLDPRSQDVICECKEGYRGTRCDGCLDNYFGNPQERGGSCE 974
           PC C G      S +  C          CECK    G RCD C   Y+G P  +GG C+
Sbjct: 1   PCDCNGHG----SLSGQC----DPGTGQCECKPNTTGRRCDRCAPGYYGLP-SQGGGCQ 50


>gnl|CDD|215680 pfam00053, Laminin_EGF, Laminin EGF-like (Domains III and V).  This
           family is like pfam00008 but has 8 conserved cysteines
           instead of six.
          Length = 49

 Score = 65.1 bits (159), Expect = 3e-13
 Identities = 26/49 (53%), Positives = 30/49 (61%)

Query: 773 CDCDPTGSTSKYCDQLGGNCVCKPNVVGRRCDRCAPGTYGFGPEGCKAC 821
           CDC+P GS S  CD   G C+CKP V GR CDRC PG YG   +  + C
Sbjct: 1   CDCNPHGSLSDTCDPETGQCLCKPGVTGRHCDRCKPGYYGLPSDPGQGC 49



 Score = 64.3 bits (157), Expect = 6e-13
 Identities = 27/56 (48%), Positives = 31/56 (55%), Gaps = 7/56 (12%)

Query: 404 CDCDPGGSLDDGICDPYTDDVLVSGKCHCKPNVGGRRCDMCKDGFWNFNETNPNGC 459
           CDC+P GSL D  CDP T      G+C CKP V GR CD CK G++        GC
Sbjct: 1   CDCNPHGSLSDT-CDPET------GQCLCKPGVTGRHCDRCKPGYYGLPSDPGQGC 49



 Score = 61.2 bits (149), Expect = 7e-12
 Identities = 22/50 (44%), Positives = 30/50 (60%), Gaps = 1/50 (2%)

Query: 1079 CDCDPIGSVDVTECNEYTGQCQCKSGFGGRQCDQCQTNFWGNPNVECFPC 1128
            CDC+P GS+    C+  TGQC CK G  GR CD+C+  ++G P+     C
Sbjct: 1    CDCNPHGSLS-DTCDPETGQCLCKPGVTGRHCDRCKPGYYGLPSDPGQGC 49



 Score = 58.9 bits (143), Expect = 5e-11
 Identities = 21/43 (48%), Positives = 28/43 (65%)

Query: 821 CDCNHVGALDNICDVVSGQCKCRAQTYGRECDQCSPGYWNYPT 863
           CDCN  G+L + CD  +GQC C+    GR CD+C PGY+  P+
Sbjct: 1   CDCNPHGSLSDTCDPETGQCLCKPGVTGRHCDRCKPGYYGLPS 43



 Score = 55.1 bits (133), Expect = 1e-09
 Identities = 20/50 (40%), Positives = 29/50 (58%), Gaps = 1/50 (2%)

Query: 462 CTCNIQGTINSQGCNKDTGDCTCKRNVEGRDCNQCLPHFWGLSESEEYGC 511
           C CN  G+++   C+ +TG C CK  V GR C++C P ++GL      GC
Sbjct: 1   CDCNPHGSLSDT-CDPETGQCLCKPGVTGRHCDRCKPGYYGLPSDPGQGC 49



 Score = 53.1 bits (128), Expect = 7e-09
 Identities = 21/51 (41%), Positives = 25/51 (49%), Gaps = 2/51 (3%)

Query: 1128 CDCDPSGSATSQCHQNNGTCVCHQGIGGVRCDTCARGYIGTAPDCSPCGEC 1178
            CDC+P GS +  C    G C+C  G+ G  CD C  GY G   D  P   C
Sbjct: 1    CDCNPHGSLSDTCDPETGQCLCKPGVTGRHCDRCKPGYYGLPSD--PGQGC 49



 Score = 51.2 bits (123), Expect = 3e-08
 Identities = 21/51 (41%), Positives = 23/51 (45%), Gaps = 4/51 (7%)

Query: 1028 CTCNILGIDHSKGPCDRTTGQCPCLPNVIGVSCDRCLKNHW--KIASGTGC 1076
            C CN  G       CD  TGQC C P V G  CDRC   ++      G GC
Sbjct: 1    CDCNPHGSLS--DTCDPETGQCLCKPGVTGRHCDRCKPGYYGLPSDPGQGC 49



 Score = 43.9 bits (104), Expect = 1e-05
 Identities = 20/43 (46%), Positives = 24/43 (55%), Gaps = 4/43 (9%)

Query: 867 CICNGHADL---CDSKTGTCISCRNSTDGENCERCINGYYGDP 906
           C CN H  L   CD +TG C  C+    G +C+RC  GYYG P
Sbjct: 1   CDCNPHGSLSDTCDPETGQC-LCKPGVTGRHCDRCKPGYYGLP 42



 Score = 42.3 bits (100), Expect = 4e-05
 Identities = 16/31 (51%), Positives = 20/31 (64%)

Query: 514 CDCDPGASYDNNCDVISGQCKCKPHASGRTC 544
           CDC+P  S  + CD  +GQC CKP  +GR C
Sbjct: 1   CDCNPHGSLSDTCDPETGQCLCKPGVTGRHC 31



 Score = 41.2 bits (97), Expect = 1e-04
 Identities = 17/45 (37%), Positives = 23/45 (51%), Gaps = 3/45 (6%)

Query: 976  CDCSNNTDLALPGNCDTLTGKCLQCRYDTEGDHCQVCKAGFFGNA 1020
            CDC+ +  L     CD  TG+C  C+    G HC  CK G++G  
Sbjct: 1    CDCNPHGSL--SDTCDPETGQC-LCKPGVTGRHCDRCKPGYYGLP 42



 Score = 39.6 bits (93), Expect = 4e-04
 Identities = 19/57 (33%), Positives = 23/57 (40%), Gaps = 8/57 (14%)

Query: 917 CPCPGTVESGLSYADTCQLDPRSQDVICECKEGYRGTRCDGCLDNYFGNPQERGGSC 973
           C C        S +DTC      +   C CK G  G  CD C   Y+G P + G  C
Sbjct: 1   CDCNPHG----SLSDTC----DPETGQCLCKPGVTGRHCDRCKPGYYGLPSDPGQGC 49



 Score = 38.9 bits (91), Expect = 6e-04
 Identities = 16/55 (29%), Positives = 23/55 (41%), Gaps = 15/55 (27%)

Query: 341 CNCNNHSTK---CHFDEAVYNSTGRISGGVCDSCEHNTEGRNCERCKQYFYKDPN 392
           C+CN H +    C               G C  C+    GR+C+RCK  +Y  P+
Sbjct: 1   CDCNPHGSLSDTCDP-----------ETGQCL-CKPGVTGRHCDRCKPGYYGLPS 43



 Score = 36.6 bits (85), Expect = 0.004
 Identities = 17/58 (29%), Positives = 21/58 (36%), Gaps = 13/58 (22%)

Query: 278 CSCYGHAS---RCLPISEADTKPDMVHGRCECTHNTQGLNCEKCIDFYNDLPWKPAFG 332
           C C  H S    C          D   G+C C     G +C++C   Y  LP  P  G
Sbjct: 1   CDCNPHGSLSDTC----------DPETGQCLCKPGVTGRHCDRCKPGYYGLPSDPGQG 48


>gnl|CDD|235175 PRK03918, PRK03918, chromosome segregation protein; Provisional.
          Length = 880

 Score = 74.3 bits (183), Expect = 5e-13
 Identities = 115/636 (18%), Positives = 237/636 (37%), Gaps = 112/636 (17%)

Query: 1216 FELLEKKIEDVKALVENTTRSSHDLTVMLTTIDDMKKQIMNNEDPNGVGKTLGNTTQRIN 1275
            +E   K + +V   ++            +   +++++ I   E      K L    + IN
Sbjct: 160  YENAYKNLGEVIKEIKRRIERLEKF---IKRTENIEELIKEKE------KELEEVLREIN 210

Query: 1276 LARLALNDLTEEIKKLNKTGEMLKENATLLQENNVEGALELTRQAHDTSLK-----LKEQ 1330
                 L +L EE++KL K  + L+E    ++E   E     + +     L+     L+E+
Sbjct: 211  EISSELPELREELEKLEKEVKELEELKEEIEELEKE---LESLEGSKRKLEEKIRELEER 267

Query: 1331 STETEKQINDAERQCKRTENL---------VTKTFPQFTQGQEENEKALQRLNEKITSLE 1381
              E +K+I + E + K  + L         +++ + ++     E EK L RL E+I  +E
Sbjct: 268  IEELKKEIEELEEKVKELKELKEKAEEYIKLSEFYEEYLDELREIEKRLSRLEEEINGIE 327

Query: 1382 KNLPDLNELICDKRGDPCDNLCGGAGCGKCGGMWCSNGTLSESNSAKDYAETSKKLINEK 1441
            + + +L E                               L E     +  E   +L  E 
Sbjct: 328  ERIKELEEKEERLE-------------------ELKK-KLKELEKRLEELEERHELYEEA 367

Query: 1442 ESKAEELRKHMIILSESNSAK-----DYAETSKKLINEKEAKAEEVFRGITTAKQESHAA 1496
            ++K EEL +    L+     K     +  E +K+ I E+ +K       +    +E   A
Sbjct: 368  KAKKEELERLKKRLTGLTPEKLEKELEELEKAKEEIEEEISKITARIGELKKEIKELKKA 427

Query: 1497 NILAKEAY------------DHAASVRNKTAAYVASTSNITKQLDEFLNAPGATLADIRN 1544
                K+A             +H   +  +   Y A    I K+L E          ++R 
Sbjct: 428  IEELKKAKGKCPVCGRELTEEHRKELLEE---YTAELKRIEKELKEIEEKERKLRKELRE 484

Query: 1545 ISGLAVN--------------KNIQS-----NPEQIRERANEINNVIKSLTDIDTILTET 1585
            +  +                 K ++      N E++ ++A E   + + L  +   +   
Sbjct: 485  LEKVLKKESELIKLKELAEQLKELEEKLKKYNLEELEKKAEEYEKLKEKLIKLKGEIKSL 544

Query: 1586 AGDLAKANDLKRKANLTKAGADSTKNTVQKIVDVLTE---------ARTAQDMAE----- 1631
              +L K  +LK+K    +   D  +  + +++  L E             +++       
Sbjct: 545  KKELEKLEELKKKLAELEKKLDELEEELAELLKELEELGFESVEELEERLKELEPFYNEY 604

Query: 1632 VAIQTAKDDISAARKDLSQITNDLDDAQQKSNETNVKVKLLQERLKSIQAGFLQNGRSAM 1691
            + ++ A+ ++    K+L ++  +LD A ++  ET  +++ L++ L+ ++  +  +     
Sbjct: 605  LELKDAEKELEREEKELKKLEEELDKAFEELAETEKRLEELRKELEELEKKY--SEEEYE 662

Query: 1692 DVENEEKNLEKEVALAQKQASGLRSRYQETDNKLLS-KAESSGLKRARGQM-LLNKA-SQ 1748
            ++  E   L +E+A  + +   L  R +E    L   K E    ++A+ ++  L KA  +
Sbjct: 663  ELREEYLELSRELAGLRAELEELEKRREEIKKTLEKLKEELEEREKAKKELEKLEKALER 722

Query: 1749 LSV--NTTAKLKQLNETEAMFNSQE------SELTE 1776
            +        K K L +  A+    E       ELTE
Sbjct: 723  VEELREKVKKYKALLKERALSKVGEIASEIFEELTE 758



 Score = 51.6 bits (124), Expect = 4e-06
 Identities = 111/586 (18%), Positives = 209/586 (35%), Gaps = 133/586 (22%)

Query: 1252 KQIMNNEDPNGVGKTLGNTTQRINLARLALNDL---TEEIKKLNKTGEMLKENATLLQEN 1308
            +QI+  +D     K LG   + I      L      TE I++L K  E  KE   +L+E 
Sbjct: 152  RQILGLDDYENAYKNLGEVIKEIKRRIERLEKFIKRTENIEELIKEKE--KELEEVLREI 209

Query: 1309 NVEGALELTRQAHDTSLKLKEQSTETEKQINDAERQCKRTENLVTKTFPQFTQGQEEN-- 1366
            N                ++  +  E  +++   E++ K  E    +   +  + ++E   
Sbjct: 210  N----------------EISSELPELREELEKLEKEVKELE----ELKEEIEELEKELES 249

Query: 1367 -EKALQRLNEKITSLEKNLPDLNELICDKRGDPCDNLCGGAGCGKCGGMWCSNGTLSESN 1425
             E + ++L EKI  LE+ + +L + I +                           + E  
Sbjct: 250  LEGSKRKLEEKIRELEERIEELKKEIEELE-----------------------EKVKELK 286

Query: 1426 SAKDYAETSKKLINEKESKAEELRKHMIILSE-SNSAKDYAETSKKLINEKEAKAEEVFR 1484
              K+ AE   KL    E   +ELR+    LS          E  K+L  EKE + EE+ +
Sbjct: 287  ELKEKAEEYIKLSEFYEEYLDELREIEKRLSRLEEEINGIEERIKEL-EEKEERLEELKK 345

Query: 1485 GITTAK------QESHAANILAKEAYDHAASVRNKTAAYVASTSNITKQLDEFLNAPGAT 1538
             +   +      +E H     AK   +    ++ +      +   + K+L+E   A    
Sbjct: 346  KLKELEKRLEELEERHELYEEAKAKKEELERLKKRLTGL--TPEKLEKELEELEKA---- 399

Query: 1539 LADIRNISGLAVNKNIQSNPEQIRERANEINNVIKSLTD------------------ID- 1579
                         + I+    +I  R  E+   IK L                    +  
Sbjct: 400  ------------KEEIEEEISKITARIGELKKEIKELKKAIEELKKAKGKCPVCGRELTE 447

Query: 1580 ----TILTETAGDLAK-ANDLKRKANLTKAGADSTKNTVQKIVDVLTEARTAQDMAEVAI 1634
                 +L E   +L +   +LK      +      +  ++++  VL +      + E+A 
Sbjct: 448  EHRKELLEEYTAELKRIEKELKEIEEKER----KLRKELRELEKVLKKESELIKLKELA- 502

Query: 1635 QTAKDDISAARKDLSQITNDLDDAQQKSNE-TNVKVKL---------LQERLKSIQAGFL 1684
                + +    + L +   +L++ ++K+ E   +K KL         L++ L+ ++   L
Sbjct: 503  ----EQLKELEEKLKKY--NLEELEKKAEEYEKLKEKLIKLKGEIKSLKKELEKLEE--L 554

Query: 1685 QNGRSAMDVENEEKNLEKEVALAQKQASGLRSRYQETDNKLLSKAESS-----GLKRARG 1739
            +   +   +E +   LE+E+A   K+   L     E   + L + E        LK A  
Sbjct: 555  KKKLAE--LEKKLDELEEELAELLKELEELGFESVEELEERLKELEPFYNEYLELKDAEK 612

Query: 1740 QM--LLNKASQLSVNTTAKLKQLNETEAMFNSQESELTELSKNIAE 1783
            ++     +  +L        ++L ETE        EL EL K  +E
Sbjct: 613  ELEREEKELKKLEEELDKAFEELAETEKRLEELRKELEELEKKYSE 658



 Score = 40.4 bits (95), Expect = 0.011
 Identities = 37/248 (14%), Positives = 97/248 (39%), Gaps = 36/248 (14%)

Query: 1542 IRNISGLAVNKNIQSNPEQIRERANEINNVIKSLTDIDTILTETAGDLAKANDLKRKANL 1601
            +R I GL   +N   N  ++ +   EI   I+ L             + +  +++     
Sbjct: 151  VRQILGLDDYENAYKNLGEVIK---EIKRRIERLEKF----------IKRTENIEELIKE 197

Query: 1602 TKAGADSTKNTVQKIVDVLTEARTAQDMAEVAIQTAKDDISAARKDLSQITNDLDDAQQK 1661
             +   +     + +I   L E R      E+      + +    K+L ++  ++++ +++
Sbjct: 198  KEKELEEVLREINEISSELPELR-----EEL------EKLEKEVKELEELKEEIEELEKE 246

Query: 1662 SNETNVKVKLLQERLKSIQAGFLQNGRSAMDVENEEKNLEKEVALAQKQASGLRSRYQET 1721
                    + L+E+++ ++           +++ E + LE++V    K+       Y + 
Sbjct: 247  LESLEGSKRKLEEKIRELE-------ERIEELKKEIEELEEKVK-ELKELKEKAEEYIKL 298

Query: 1722 DNKLLSKAESSGLKRARGQMLLNKASQLSVNTTAKLKQLNETEAMFNSQESELTELSKNI 1781
             ++   +      +    +  L++  +       ++K+L E E      + +L EL K +
Sbjct: 299  -SEFYEEYLD---ELREIEKRLSRLEEEINGIEERIKELEEKEERLEELKKKLKELEKRL 354

Query: 1782 AELQKRIQ 1789
             EL++R +
Sbjct: 355  EELEERHE 362



 Score = 37.4 bits (87), Expect = 0.087
 Identities = 28/185 (15%), Positives = 79/185 (42%), Gaps = 14/185 (7%)

Query: 1211 AYTKEFELLEKKIEDVKALVENTTRSSHDLTVMLTTIDDMKKQIMNNEDPNGVGKTLGNT 1270
               K+   LEKK+++++  +    +   +L     +++++++++   E        L + 
Sbjct: 553  ELKKKLAELEKKLDELEEELAELLKELEELG--FESVEELEERLKELEPFYNEYLELKDA 610

Query: 1271 TQRIN--LARLAL--NDLTEEIKKLNKTGEMLKENATLLQENNVEGALELTRQAHDTSLK 1326
             + +      L     +L +  ++L +T + L+E    L+E   + + E   +  +  L+
Sbjct: 611  EKELEREEKELKKLEEELDKAFEELAETEKRLEELRKELEELEKKYSEEEYEELREEYLE 670

Query: 1327 LKEQSTETEKQINDAERQCKRTENLVTKTFPQFTQGQEENEKALQRLNEKITSLEKNLPD 1386
            L  +      ++ + E++ +  +  + K         +E  +  ++  +++  LEK L  
Sbjct: 671  LSRELAGLRAELEELEKRREEIKKTLEKL--------KEELEEREKAKKELEKLEKALER 722

Query: 1387 LNELI 1391
            + EL 
Sbjct: 723  VEELR 727



 Score = 35.4 bits (82), Expect = 0.33
 Identities = 49/289 (16%), Positives = 105/289 (36%), Gaps = 48/289 (16%)

Query: 1517 YVASTSNITKQLDEFLNAPGATLADIRNISGLAVNKNIQSNPEQIRERANEINNVIKSLT 1576
            ++  T NI + + E        L +I  IS             ++ E   E+  + K + 
Sbjct: 184  FIKRTENIEELIKEKEKELEEVLREINEIS------------SELPELREELEKLEKEVK 231

Query: 1577 DIDTILTETAGDLAKANDLKRKANLTKAGADSTKNTVQKIVDVLTEARTAQDMAEVAIQT 1636
            +++    E   ++ +          +K   +      +KI +   E R  +   E+    
Sbjct: 232  ELE----ELKEEIEELEKELESLEGSKRKLE------EKIRE--LEERIEELKKEI---- 275

Query: 1637 AKDDISAARKDLSQITND----------LDDAQQKSNETNVKVKLLQERLKSIQAGFLQN 1686
              +++    K+L ++              ++   +  E   ++  L+E +  I+   ++ 
Sbjct: 276  --EELEEKVKELKELKEKAEEYIKLSEFYEEYLDELREIEKRLSRLEEEINGIEE-RIKE 332

Query: 1687 GRSAMDVENEEKNLEKEVALAQKQASGLRSRYQETDNKLLSKAESSGLKRARGQMLLNKA 1746
                 +   E K   KE+   +K+   L  R++  +     K E   LK+    +   K 
Sbjct: 333  LEEKEERLEELKKKLKEL---EKRLEELEERHELYEEAKAKKEELERLKKRLTGLTPEKL 389

Query: 1747 -SQLSVNTTAKL---KQLNETEAMFNSQESELTELSKNIAELQKRIQSC 1791
              +L     AK    +++++  A     + E+ EL K I EL+K    C
Sbjct: 390  EKELEELEKAKEEIEEEISKITARIGELKKEIKELKKAIEELKKAKGKC 438


>gnl|CDD|233757 TIGR02168, SMC_prok_B, chromosome segregation protein SMC, common
            bacterial type.  SMC (structural maintenance of
            chromosomes) proteins bind DNA and act in organizing and
            segregating chromosomes for partition. SMC proteins are
            found in bacteria, archaea, and eukaryotes. This family
            represents the SMC protein of most bacteria. The smc gene
            is often associated with scpB (TIGR00281) and scpA genes,
            where scp stands for segregation and condensation
            protein. SMC was shown (in Caulobacter crescentus) to be
            induced early in S phase but present and bound to DNA
            throughout the cell cycle [Cellular processes, Cell
            division, DNA metabolism, Chromosome-associated
            proteins].
          Length = 1179

 Score = 74.3 bits (183), Expect = 5e-13
 Identities = 110/657 (16%), Positives = 229/657 (34%), Gaps = 137/657 (20%)

Query: 1214 KEFELLEKKIEDVKALVENTTRSSHDLTVMLTTIDDMKKQIMNNEDPNGVGKTLGNTTQR 1273
            ++ E L++++E ++A +E       +L  + + ++++++Q+                T R
Sbjct: 344  EKLEELKEELESLEAELEELEA---ELEELESRLEELEEQL---------------ETLR 385

Query: 1274 --INLARLALNDLTEEIKKLNKTGEMLKENATLLQENNVEGALELTRQAHDTSLKLKEQS 1331
              +    L +  L  EI++L    E L++    LQ+   E   +L         +L+ + 
Sbjct: 386  SKVAQLELQIASLNNEIERLEARLERLEDRRERLQQEIEELLKKLEEAELK---ELQAEL 442

Query: 1332 TETEKQINDAERQCKRTENLVTKTFPQFTQGQEENEKALQRLNEKITSLEKNLPDLNELI 1391
             E E+++ + + + +R E  + +   +     EE E+AL     ++  L+  L  L  L 
Sbjct: 443  EELEEELEELQEELERLEEALEELREEL----EEAEQALDAAERELAQLQARLDSLERLQ 498

Query: 1392 CDKRGDPC------DNLCGGAGCGKCGGMWCSNGTLSESNSAKDYAETSKKLIN--EKES 1443
             +  G          N  G +G           G LSE             LI+  E   
Sbjct: 499  ENLEGFSEGVKALLKNQSGLSGIL---------GVLSE-------------LISVDEGYE 536

Query: 1444 KAEE--LRKHM--IILSESNSAKDYAETSKK-----------------LINEKEAKAEEV 1482
             A E  L   +  +++   N+AK      K+                  I   + +  + 
Sbjct: 537  AAIEAALGGRLQAVVVENLNAAKKAIAFLKQNELGRVTFLPLDSIKGTEIQGNDREILKN 596

Query: 1483 FRG--------ITTAKQESHA-----ANILAKEAYDHAASVRNKTAAYVASTSNITKQLD 1529
              G        +    +   A       +L  +  D+A  +  K          +T    
Sbjct: 597  IEGFLGVAKDLVKFDPKLRKALSYLLGGVLVVDDLDNALELAKKLRPG---YRIVTLD-G 652

Query: 1530 EFLNAPGATL--ADIRNISGLAVNKNIQSNPEQIRERANEINN----VIKSLTDIDTILT 1583
            + +   G     +   N S L   + I+   E+I E   +I      + +   +++ +  
Sbjct: 653  DLVRPGGVITGGSAKTNSSILERRREIEELEEKIEELEEKIAELEKALAELRKELEELEE 712

Query: 1584 ETAGDLAKANDLKRKANLTKAGADSTKNTVQKIV---------------------DVLTE 1622
            E      +  +L R+ +  +      +  V+++                      + L E
Sbjct: 713  ELEQLRKELEELSRQISALRKDLARLEAEVEQLEERIAQLSKELTELEAEIEELEERLEE 772

Query: 1623 ARTAQDMAEVAIQTAKDDISAARKDLSQITNDLDDAQQKSNETNVKVKLLQERLKSIQAG 1682
            A      AE  I+  +  I   +++L  +   LD+ + +    N +   L+ERL+S++  
Sbjct: 773  AEEELAEAEAEIEELEAQIEQLKEELKALREALDELRAELTLLNEEAANLRERLESLERR 832

Query: 1683 FLQNGRSAMDVENEEKNLEKEVALAQKQASGLRSRYQETDNKLLSKAESSGLKRARGQML 1742
                 R   D+E + + L +++     +        +E   +L S+ E+   +RA  +  
Sbjct: 833  IAATERRLEDLEEQIEELSEDIESLAAE----IEELEELIEELESELEALLNERASLEEA 888

Query: 1743 LNKASQLSVNTTAKLK-----------QLNETEAMFNSQESELTELSKNIAELQKRI 1788
            L          + +L+           +L E        E  L  L   I  LQ+R+
Sbjct: 889  LALLRSELEELSEELRELESKRSELRRELEELREKLAQLELRLEGLEVRIDNLQERL 945



 Score = 72.4 bits (178), Expect = 2e-12
 Identities = 96/566 (16%), Positives = 196/566 (34%), Gaps = 102/566 (18%)

Query: 1272 QRINLARLALNDLTEEIKKLNKTGEMLKENATLLQENNVEGALELTRQAHDTSLKLKEQS 1331
            + I   +  L  L  EI +L +  ++L+E    L+    E   +L         +L E+ 
Sbjct: 281  EEIEELQKELYALANEISRLEQQKQILRERLANLERQLEELEAQLEELESK-LDELAEEL 339

Query: 1332 TETEKQINDAERQCKRTENLVTK---TFPQFTQGQEENEKALQRLNEKITSLEKNLPDLN 1388
             E E+++ + + + +  E  + +      +     EE E+ L+ L  K+  LE  +  LN
Sbjct: 340  AELEEKLEELKEELESLEAELEELEAELEELESRLEELEEQLETLRSKVAQLELQIASLN 399

Query: 1389 ELICDKRGDPCDNLCGGAGCGKCGGMWCSNGTLSE-SNSAKDYAETSKKLINEKESKAEE 1447
                                            +       +   +  ++L  E E   ++
Sbjct: 400  ------------------------------NEIERLEARLERLEDRRERLQQEIEELLKK 429

Query: 1448 LRKHMIILSESNSAKDYAETSKKLINEKEAKAEEVFRGITTAKQESHAANILAKEAYDHA 1507
            L +  +   ++       E  ++ + E + + E +   +   ++E   A      A    
Sbjct: 430  LEEAELKELQAEL-----EELEEELEELQEELERLEEALEELREELEEAEQALDAAEREL 484

Query: 1508 ASVRNKTAAYVASTSNITKQLDEFLNAPGATLADIRNISGLAVNKNIQSNPEQIR----- 1562
            A ++ +  +       + +  +          A ++N SGL+    + S  E I      
Sbjct: 485  AQLQARLDS-------LERLQENLEGFSEGVKALLKNQSGLSGILGVLS--ELISVDEGY 535

Query: 1563 ERANEINNVIKSLTDIDTILTETAGDLAKANDLKRKANLTKAG---ADSTKNTVQKIVDV 1619
            E A  I   +     +  ++ E      KA    ++  L +      DS K T  +I   
Sbjct: 536  EAA--IEAALGG--RLQAVVVENLNAAKKAIAFLKQNELGRVTFLPLDSIKGT--EIQGN 589

Query: 1620 LTEARTAQDMAEVAIQTAKDDISAA---RKDLS------QITNDLDDAQQKSNETNVKVK 1670
              E        E  +  AKD +      RK LS       + +DLD+A + + +     +
Sbjct: 590  DREILKNI---EGFLGVAKDLVKFDPKLRKALSYLLGGVLVVDDLDNALELAKKLRPGYR 646

Query: 1671 LLQERLKSIQAGFLQNGRSA------MDVENEEKNLEKEVALAQ---------------- 1708
            ++      ++ G +  G SA      ++   E + LE+++   +                
Sbjct: 647  IVTLDGDLVRPGGVITGGSAKTNSSILERRREIEELEEKIEELEEKIAELEKALAELRKE 706

Query: 1709 -----KQASGLRSRYQETDNKLLSKAESSGLKRARGQMLLNKASQLSVNTTAKLKQLNET 1763
                 ++   LR   +E   ++ +  +      A  + L  + +QLS   T    ++ E 
Sbjct: 707  LEELEEELEQLRKELEELSRQISALRKDLARLEAEVEQLEERIAQLSKELTELEAEIEEL 766

Query: 1764 EAMFNSQESELTELSKNIAELQKRIQ 1789
            E      E EL E    I EL+ +I+
Sbjct: 767  EERLEEAEEELAEAEAEIEELEAQIE 792



 Score = 58.9 bits (143), Expect = 3e-08
 Identities = 81/414 (19%), Positives = 149/414 (35%), Gaps = 78/414 (18%)

Query: 1269 NTTQRINLARLALNDLTEEIKKLNKTGEMLKENATLLQ------ENNVEGALELTRQAHD 1322
               + +   R  L +L EE+++L K  E L    + L+      E  VE   E   Q   
Sbjct: 695  ELEKALAELRKELEELEEELEQLRKELEELSRQISALRKDLARLEAEVEQLEERIAQLSK 754

Query: 1323 TSLKLKEQSTETEKQINDAERQCKRTENLVTKTFPQFTQGQEENEKALQRLNEKITSLEK 1382
               +L+ +  E E+++ +AE +    E  + +   Q  Q +EE    L+ L E +  L  
Sbjct: 755  ELTELEAEIEELEERLEEAEEELAEAEAEIEELEAQIEQLKEE----LKALREALDELRA 810

Query: 1383 NLPDLNELICDKRGDPCDNLCGGAGCGKCGGMWCSNGTLSESNSAKDYAETSKKL---IN 1439
             L  LNE   + R          A   +          L E    ++ +E  + L   I 
Sbjct: 811  ELTLLNEEAANLRERLESLERRIAATER------RLEDLEEQ--IEELSEDIESLAAEIE 862

Query: 1440 EKESKAEELRKHMIILS-ESNSAKDYAETSKKLINEKEAKAEEVFRGITTAKQESHAANI 1498
            E E   EEL   +  L  E  S ++     +  + E   +  E+    +  ++E      
Sbjct: 863  ELEELIEELESELEALLNERASLEEALALLRSELEELSEELRELESKRSELRRELEELR- 921

Query: 1499 LAKEAYDHAASVRNKTAAYVASTSNITKQLDEFLNAPGATLADIRNISGLAVNKNIQSNP 1558
                  +  A +  +         N+ ++L E  +    TL +       A+   I+ + 
Sbjct: 922  ------EKLAQLELRLEGLEVRIDNLQERLSEEYSL---TLEEA-----EALENKIEDDE 967

Query: 1559 EQIRERANEINNVIKSLTDIDTILTETAGDLAKANDLK-RKANLTKAGADSTKNTVQKIV 1617
            E+ R R   + N IK L  ++      A  + +  +LK R   LT           Q   
Sbjct: 968  EEARRRLKRLENKIKELGPVN----LAA--IEEYEELKERYDFLTA----------Q--- 1008

Query: 1618 DVLTEARTAQDMAEVAIQTAKDDISAARKDLSQITNDLDD-AQQKSNETNVKVK 1670
                                K+D++ A++ L +   ++D  A+++  +T  +V 
Sbjct: 1009 --------------------KEDLTEAKETLEEAIEEIDREARERFKDTFDQVN 1042



 Score = 58.5 bits (142), Expect = 3e-08
 Identities = 58/295 (19%), Positives = 124/295 (42%), Gaps = 45/295 (15%)

Query: 1531 FLNAPGATLADIRNI---SGLA-----------VNKNIQSNPEQIR-------------E 1563
            F+N     L DI+++   +GL            +++ I++ PE+ R             E
Sbjct: 113  FINGQPCRLKDIQDLFLDTGLGKRSYSIIEQGKISEIIEAKPEERRAIFEEAAGISKYKE 172

Query: 1564 RANE----INNVIKSLTDIDTILTETAGDLAKANDLKRKANLTKAGAD--STKNTVQK-- 1615
            R  E    +    ++L  ++ IL E    L     L+R+A   +   +  +    ++   
Sbjct: 173  RRKETERKLERTRENLDRLEDILNELERQL---KSLERQAEKAERYKELKAELRELELAL 229

Query: 1616 IVDVLTEARTAQDMAEVAIQTAKDDISAARKDLSQITNDLDDAQQKSNETNVKVKLLQER 1675
            +V  L E R   +         ++++  A ++L ++T +L + ++K  E  ++V  L+E 
Sbjct: 230  LVLRLEELREELE-------ELQEELKEAEEELEELTAELQELEEKLEELRLEVSELEEE 282

Query: 1676 LKSIQAGFLQNGRSAMDVENEEKNLEKEVALAQKQASGLRSRYQETDNKLLSKAESSGLK 1735
            ++ +Q            +E +++ L + +A  ++Q   L ++ +E ++KL   AE     
Sbjct: 283  IEELQKELYALANEISRLEQQKQILRERLANLERQLEELEAQLEELESKLDELAEELAEL 342

Query: 1736 RARGQMLLNKASQLSVNTTAKLKQLNETEAMFNSQESELTELSKNIAELQKRIQS 1790
              + + L  +   L         +L E E+     E +L  L   +A+L+ +I S
Sbjct: 343  EEKLEELKEELESLEAELEELEAELEELESRLEELEEQLETLRSKVAQLELQIAS 397



 Score = 56.2 bits (136), Expect = 2e-07
 Identities = 104/604 (17%), Positives = 206/604 (34%), Gaps = 139/604 (23%)

Query: 1273 RINLARLALNDLTEEIKKLNKTGEMLKENATLLQE-----NNVEGALELTRQAHDTSLKL 1327
            R NL RL   D+  E+++  K+ E   E A   +E       +E AL L  +  +    L
Sbjct: 185  RENLDRL--EDILNELERQLKSLERQAEKAERYKELKAELRELELAL-LVLRLEE----L 237

Query: 1328 KEQSTETEKQINDAERQCKRTENLVTKTFPQFTQGQ-EENEKALQRLNEKITSLEKNLPD 1386
            +E+  E ++++ +AE + +     + +      + + EE    +  L E+I  L+K L  
Sbjct: 238  REELEELQEELKEAEEELEELTAELQE-----LEEKLEELRLEVSELEEEIEELQKELYA 292

Query: 1387 LNELICDKRGDPCDNLCGGAGCGKCGGMWCSNGTLSESNSAKDYAETSKKLIN----EKE 1442
            L   I                                        E   + +     E E
Sbjct: 293  LANEISRL-----------------------EQQKQILRERLANLERQLEELEAQLEELE 329

Query: 1443 SKAEELRKHMIILSESNSAKDYAETSKKLINEKEAKAEEVFRGITTAKQESHAANILAKE 1502
            SK +EL +    L+E     +  +   + +  +  + E          +E  +     +E
Sbjct: 330  SKLDELAE---ELAELEEKLEELKEELESLEAELEELEAE-------LEELESRLEELEE 379

Query: 1503 AYDHAASVRNKTAAYVASTSNITKQLDEFLNAPGATLAD-IRNISGL---AVNKNIQSNP 1558
              +   S   +    +AS +N  ++L+  L           + I  L        ++   
Sbjct: 380  QLETLRSKVAQLELQIASLNNEIERLEARLERLEDRRERLQQEIEELLKKLEEAELKELQ 439

Query: 1559 EQIRERANEINNVIKSLTDIDTILTETAGDLAKAN----DLKRKANLTKAGADSTKNTVQ 1614
             ++ E   E+  + + L  ++  L E   +L +A       +R+    +A  DS +   +
Sbjct: 440  AELEELEEELEELQEELERLEEALEELREELEEAEQALDAAERELAQLQARLDSLERLQE 499

Query: 1615 K--------------------IVDVLTEARTAQDMAEVAIQTAKDDISAARKDLSQITND 1654
                                 I+ VL+E  +  +  E AI+ A   +    + +  +  +
Sbjct: 500  NLEGFSEGVKALLKNQSGLSGILGVLSELISVDEGYEAAIEAA---LGGRLQAV--VVEN 554

Query: 1655 LDDAQQ-----KSNE---------TNVKVKLLQERLKSIQAGFLQNGRSAMDVENEEKNL 1700
            L+ A++     K NE          ++K   +Q   + I          A D+   +  L
Sbjct: 555  LNAAKKAIAFLKQNELGRVTFLPLDSIKGTEIQGNDREILKNIEGFLGVAKDLVKFDPKL 614

Query: 1701 EK-------------EVALAQKQASGLRSRYQ--------------------ETDNKLLS 1727
             K             ++  A + A  LR  Y+                    +T++ +L 
Sbjct: 615  RKALSYLLGGVLVVDDLDNALELAKKLRPGYRIVTLDGDLVRPGGVITGGSAKTNSSILE 674

Query: 1728 KAESSGLKRARGQM--LLNKASQLSVNTTAKLKQLNETEAMFNSQESELTELSKNIAELQ 1785
            +     ++    ++  L  K ++L        K+L E E        EL ELS+ I+ L+
Sbjct: 675  R--RREIEELEEKIEELEEKIAELEKALAELRKELEELEEELEQLRKELEELSRQISALR 732

Query: 1786 KRIQ 1789
            K + 
Sbjct: 733  KDLA 736


>gnl|CDD|226809 COG4372, COG4372, Uncharacterized protein conserved in bacteria with
            the myosin-like domain [Function unknown].
          Length = 499

 Score = 61.2 bits (148), Expect = 3e-09
 Identities = 41/188 (21%), Positives = 82/188 (43%), Gaps = 17/188 (9%)

Query: 1620 LTEARTAQDMAEVAIQTAKDDISAARKDLSQITNDLDDAQQKSNETNVKVKLLQERLKSI 1679
               A T ++ A   +Q A+ +  A R++L+    +L  AQQ+      + + LQ RLK++
Sbjct: 97   KRAAETEREAARSELQKARQEREAVRQELAAARQNLAKAQQELARLTKQAQDLQTRLKTL 156

Query: 1680 ---------QAGFLQNGRSAM-----DVENEEKNLEKEVALAQKQASGLRSR---YQETD 1722
                     QA  LQ  +  +      ++++  +L+   A  +++A  L +R    Q   
Sbjct: 157  AEQRRQLEAQAQSLQASQKQLQASATQLKSQVLDLKLRSAQIEQEAQNLATRANAAQART 216

Query: 1723 NKLLSKAESSGLKRARGQMLLNKASQLSVNTTAKLKQLNETEAMFNSQESELTELSKNIA 1782
             +L  +A ++       Q    + SQ +    A+ +Q+ E E      E+    L + +A
Sbjct: 217  EELARRAAAAQQTAQAIQQRDAQISQKAQQIAARAEQIRERERQLQRLETAQARLEQEVA 276

Query: 1783 ELQKRIQS 1790
            +L+   Q+
Sbjct: 277  QLEAYYQA 284



 Score = 33.8 bits (77), Expect = 0.87
 Identities = 45/257 (17%), Positives = 88/257 (34%), Gaps = 37/257 (14%)

Query: 1559 EQIRERANEIN-NVIKSL--------------TDIDTILTETAGDLAKANDLKRKANLTK 1603
            E +R+       N+ K+               T + T+  +     A+A  L+      +
Sbjct: 119  EAVRQELAAARQNLAKAQQELARLTKQAQDLQTRLKTLAEQRRQLEAQAQSLQASQKQLQ 178

Query: 1604 AGADSTKNTVQKI----VDVLTEARTAQDMAEVAIQTAKD--DISAARKDLSQITNDLD- 1656
            A A   K+ V  +      +  EA+     A  A    ++    +AA +  +Q     D 
Sbjct: 179  ASATQLKSQVLDLKLRSAQIEQEAQNLATRANAAQARTEELARRAAAAQQTAQAIQQRDA 238

Query: 1657 DAQQKSNETNVKVKLLQERLKSIQAGFLQNGRSAMDVENEEKNLEKEVALAQKQASGLRS 1716
               QK+ +   + + ++ER + +Q            +E  +  LE+EVA  +        
Sbjct: 239  QISQKAQQIAARAEQIRERERQLQ-----------RLETAQARLEQEVAQLEAYYQAYVR 287

Query: 1717 RYQETDNKLLSKAESSGLKR-ARGQMLLNKASQL--SVNTTAKLKQLNETEAMFNSQESE 1773
              Q+       +  +   +R A+ Q      +QL  S N  A L +L  +      Q   
Sbjct: 288  LRQQAAATQRGQVLAGAAQRVAQAQAQAQAQAQLLSSANRPAAL-RLRRSPRRGRRQRPV 346

Query: 1774 LTELSKNIAELQKRIQS 1790
                ++      ++  S
Sbjct: 347  TRHTTRRRRPATRQSPS 363


>gnl|CDD|220614 pfam10174, Cast, RIM-binding protein of the cytomatrix active zone.
            This is a family of proteins that form part of the CAZ
            (cytomatrix at the active zone) complex which is involved
            in determining the site of synaptic vesicle fusion. The
            C-terminus is a PDZ-binding motif that binds directly to
            RIM (a small G protein Rab-3A effector). The family also
            contains four coiled-coil domains.
          Length = 774

 Score = 59.6 bits (144), Expect = 1e-08
 Identities = 106/654 (16%), Positives = 220/654 (33%), Gaps = 105/654 (16%)

Query: 1205 QTGASGAYTKEFELLEKKIEDVKALVENTTRSSHDLTVMLTTIDDMKKQIMNNEDPNGVG 1264
            +   SG  T E + LEK       +     R + +L ++  T++ ++ ++          
Sbjct: 95   KLEKSGNETGELQQLEKSEGQFSRIQAERERQAMELFLLRDTLEKLQLRM---------- 144

Query: 1265 KTLGNTTQRINLARLALNDLTEEIKKLNKTGEMLKENATLLQENNVEGALELTRQAHDTS 1324
            + L  T          L+   EEIKKL +  +    +A   +E++    L   R+A D  
Sbjct: 145  ERLQQT----------LDARDEEIKKLLEMLQSKGLSAKAEEEDS--ERLGRIREAEDQQ 192

Query: 1325 LKLKEQSTETEKQINDAERQCKRTENLVTKTFPQFTQGQEENEKALQRLNEKITSLEKNL 1384
              L+    + EK+      +  +   +      +     E ++K +   + KI   EK L
Sbjct: 193  SHLEVLLEQKEKEHMMLREEIHQKLQM-----ERDDAKTEASQKLIDEKDTKIKEFEKML 247

Query: 1385 PDLNELI------CDKRGDPCDNLCGGAGCGKCGGMWCSNGT------LSESNSAKDYAE 1432
                  I      CD      + L       +                LS   +     +
Sbjct: 248  EKAENEIYRLQSRCDTSEADRNRLDKEVEAERSALAAMKAKCDRAAQELSRKKTELLGLQ 307

Query: 1433 TSKKLINEKESKAEELRKHMIILSESNSAKDYA---------------ETSKKLINEKEA 1477
            T    +    ++  ++R+H+  L E  +  +                 E     + +K A
Sbjct: 308  TE---LETLANQDSDMRQHLDKLKEDLTRAEQEKAILQTEVDALRYELERKHNTLTKKTA 364

Query: 1478 KAEEVFRGITTAKQESH-----------AANILAKEAYDHAASVRNKTAAY------VAS 1520
              +       T   E                +L K+  +   + R K          + S
Sbjct: 365  SLQAAQEEKATYAGEIEDMRDRYEKTERKLRVLQKKIENLQETFRRKERRLKEEKERLRS 424

Query: 1521 TSNITKQLDEFLNAPGATLADIRNISGLAVNKNIQSNPEQ------------IRERANEI 1568
                T   D  L      LA+   I      K  +   E+              +   E+
Sbjct: 425  LQTDTNT-DTALEKLEKALAEKERIIERL--KEQRDRDERYEQEEFETYKKEFEDLKEEV 481

Query: 1569 NNVIKSLTDIDTILTETAGDLAK--ANDLKRKANLTKAGA--DSTKNTVQKIVDVLTEAR 1624
             N+   L++ +  L     +++K  +N LK++++L +A    +  +   +K+   L   R
Sbjct: 482  QNLQLKLSERELQLELLKEEVSKLASNQLKQRSDLERAHIELEKIREKHEKLEKELKRLR 541

Query: 1625 TAQ------DMAEVAIQTAKDDISAARKDLSQITNDLDDAQQKSNETNVKVKLLQERLK- 1677
                        +     ++ D + AR ++ ++ + L+ A+Q+ ++T ++   L + L+ 
Sbjct: 542  ANPESADRGSAVDAGTSRSRADSAGARNEVDRLLDRLEKAEQERDDTEMEAGRLAKELEK 601

Query: 1678 -SIQAGFLQNGRSAMDVENEEKNLEK-EVALAQKQASGLRSRYQETDNKLLSKAESSGLK 1735
                    Q    A  +E E K+ E  E A   +++       +++     ++A +    
Sbjct: 602  AQRHLTKQQEKTEATRIEFERKSAELLEEAERLEKSEAEEETLRQSTQIGHAQAAAHNHI 661

Query: 1736 RARGQMLLNKASQLSVNTTAKLKQLNETEAMFNSQESELTELSKNIAELQKRIQ 1789
                Q L +   QL       + QL +++      + EL       AE +K ++
Sbjct: 662  EHHVQKLESDLKQLRAEREQLVAQLEKSQQSLMEFQQELNAAE---AERRKHLE 712



 Score = 38.5 bits (89), Expect = 0.039
 Identities = 55/301 (18%), Positives = 105/301 (34%), Gaps = 47/301 (15%)

Query: 1215 EFELLEKKIEDVKALVENTTRSSHDLTVMLTTIDDMKKQIMNNEDPNGVGKTLGNTTQRI 1274
            EFE  +K+ ED+K  V+N      +  + L  + +   ++ +N+             QR 
Sbjct: 466  EFETYKKEFEDLKEEVQNLQLKLSERELQLELLKEEVSKLASNQL-----------KQRS 514

Query: 1275 NLARL--ALNDLTEEIKKLNKTGEMLKENATLLQ-----ENNVEGALELTRQAHDTSLKL 1327
            +L R    L  + E+ +KL K  + L+ N          +     +   +  A +   +L
Sbjct: 515  DLERAHIELEKIREKHEKLEKELKRLRANPESADRGSAVDAGTSRSRADSAGARNEVDRL 574

Query: 1328 KEQSTETEKQINDAERQCKRTENLVTKTFPQFTQGQEENEKALQRLNEKITSLEKNLPDL 1387
             ++  + E++ +D E +  R    + K     T+ QE+ E        K   L +    L
Sbjct: 575  LDRLEKAEQERDDTEMEAGRLAKELEKAQRHLTKQQEKTEATRIEFERKSAELLEEAERL 634

Query: 1388 NELICDKRGDPCDNLCGGAGCGKCGGMWCSNGTLSESNSAKDYAETSKKLINEKESKAEE 1447
             +   ++                         T      A  +      +  + ES  ++
Sbjct: 635  EKSEAEEET-------------------LRQSTQIGHAQAAAHNHIEHHV-QKLESDLKQ 674

Query: 1448 LRKHM--IILSESNSAKDYAETSKKLIN---EKEAKAEEVFRGITTAKQESHAANILAKE 1502
            LR     ++     S +   E  ++L     E+    EE        KQE+  A I  K+
Sbjct: 675  LRAEREQLVAQLEKSQQSLMEFQQELNAAEAERRKHLEENRE----LKQEALLAAISEKD 730

Query: 1503 A 1503
            A
Sbjct: 731  A 731


>gnl|CDD|218704 pfam05701, DUF827, Plant protein of unknown function (DUF827).  This
            family consists of several plant proteins of unknown
            function. Several sequences in this family are described
            as being "myosin heavy chain-like".
          Length = 484

 Score = 57.7 bits (139), Expect = 4e-08
 Identities = 95/473 (20%), Positives = 171/473 (36%), Gaps = 82/473 (17%)

Query: 1286 EEIKKLNKTGEMLKENATL--LQENNVEGALELTRQAHDTSLKLKEQSTETEKQINDAER 1343
            EE  KL K  E+  E   L  L+E      LE T++  +  LKLK +  E E+Q      
Sbjct: 43   EEANKLKKELEV-AEKEKLQVLKE------LESTKRTVED-LKLKLEKAEKEEQ------ 88

Query: 1344 QCKRTENLVTKTFPQFTQGQEENEKALQRLNEKITSLEKNLPDLNELICDKRGDPCDNLC 1403
            Q K+   L      +  QG +E E    R       L+    +L ++  +          
Sbjct: 89   QAKQDSELAKLRAEELEQGIQELEVE--RYITATAELDSVKEELRKIRQE---------- 136

Query: 1404 GGAGCGKCGGMWCSNGTLSESNSAKDYAETSKKLINEKESKAEELRKHMIILSES----N 1459
                          +  + E ++A   AE +       E K EEL K +I + ES    +
Sbjct: 137  -------------YDALVEERDAALKRAEEAICASKVNEKKVEELTKEIIAMKESLERAH 183

Query: 1460 SAKDYAETSKKLINEK--------EAKAEEVFRGITTAKQESHAANILAK-EAYDHAASV 1510
            +A   AE  +     +        E + +E  + +   KQ+      L K          
Sbjct: 184  AAHLEAEEERIGAALEKDQDRETYEKELKEAEKELERLKQDLDPEKDLEKLAEASAELES 243

Query: 1511 RNKTAAYVASTSNITKQLDE---FLNAPGATLADIRNISG-LAVN-KNIQSNPEQIRERA 1565
              K  + +AS +++ K+L+E    L         +RN+   L    +  +   E++RE+ 
Sbjct: 244  LQKEISIMASVASVLKELEEAKANLEKAAEEEKSLRNLVESLKQELEEEKKELEELREKE 303

Query: 1566 NEINNVIKSLTDIDTILTETAGDLA---------KANDLKRKANLTKAGADSTKNTVQKI 1616
             E      SL              A         K      +A   +  A++ +  ++K+
Sbjct: 304  GEAEEAASSLEAELNRTKSEKESKAREKMVEIPLKLQQASSEAEEARKEAEAAREELRKL 363

Query: 1617 VDVLTEARTAQDMAEVAIQTAKDDISAAR----KDLSQITNDLDDAQQKSNETNVKVKLL 1672
             +   + + A + AE+ ++ A  +  AA+      L++I    +  +    ++  K+ L 
Sbjct: 364  KEEAEQTKAALETAELRLEAALKEAEAAKAAEALALAEIKALQESEESAKADSPRKITLS 423

Query: 1673 QERLKSIQAGFLQNGRSAMDVENEEKNLEKEVALAQKQASGLRSRYQETDNKL 1725
             E  + +          +   E  E+  EK+VA A  Q    +    E+  KL
Sbjct: 424  LEEYEEL----------SKKAEEAEELAEKKVAAALAQVEEAKESENESLKKL 466



 Score = 46.1 bits (109), Expect = 1e-04
 Identities = 63/370 (17%), Positives = 143/370 (38%), Gaps = 34/370 (9%)

Query: 1432 ETSKKLINEKESKAEELRKHMIILSESNSAK--DYAETSKKLINEKEAKAEEVFRGITTA 1489
            ET   L  E+   A +L+K + + +E    +     E++K+ + + + K E+  +    A
Sbjct: 35   ETELMLAQEE---ANKLKKELEV-AEKEKLQVLKELESTKRTVEDLKLKLEKAEKEEQQA 90

Query: 1490 KQESHAANILAKEAYDHAASVRNKTAAYVASTSNITKQLDEFLNAPGATLADIRNISGLA 1549
            KQ+S  A + A+E       +  +      +  +  K+    +      L + R+ +   
Sbjct: 91   KQDSELAKLRAEELEQGIQELEVERYITATAELDSVKEELRKIRQEYDALVEERDAALKR 150

Query: 1550 VNKNIQS---NPEQIRERANEINNVIKSLTDIDTILTETAGDLAKANDLKRKANLTKAGA 1606
              + I +   N +++ E   EI  + +SL        E   +   A     K    +   
Sbjct: 151  AEEAICASKVNEKKVEELTKEIIAMKESLERAHAAHLEAEEERIGAA--LEKDQDRETYE 208

Query: 1607 DSTKNTVQKIVDVLTEARTAQDMAEVAIQTAKDDISAARKDLSQ------ITNDLDDAQQ 1660
               K   +++  +  +    +D+ +  +  A  ++ + +K++S       +  +L++A+ 
Sbjct: 209  KELKEAEKELERLKQDLDPEKDLEK--LAEASAELESLQKEISIMASVASVLKELEEAKA 266

Query: 1661 KSNETNVKVKLLQERLKSIQAGFLQNGRSAMDVENEEKNLEKEVALAQKQASGLRSRYQE 1720
               +   + K L+  ++S++           ++E E+K LE+   L +K+     +    
Sbjct: 267  NLEKAAEEEKSLRNLVESLK----------QELEEEKKELEE---LREKEGEAEEAASSL 313

Query: 1721 TDNKLLSKAESSGLKRARGQMLLNKASQLSVNTTAKLKQLNET--EAMFNSQESELTELS 1778
                  +K+E     R +   +  K  Q S       K+      E     +E+E T+ +
Sbjct: 314  EAELNRTKSEKESKAREKMVEIPLKLQQASSEAEEARKEAEAAREELRKLKEEAEQTKAA 373

Query: 1779 KNIAELQKRI 1788
               AEL+   
Sbjct: 374  LETAELRLEA 383



 Score = 43.4 bits (102), Expect = 0.001
 Identities = 54/292 (18%), Positives = 98/292 (33%), Gaps = 44/292 (15%)

Query: 1211 AYTKEFELLEKKIEDVKALVENTTRSSHDLTVMLTTIDDMKKQIMNNEDPNGVGKTLGNT 1270
             Y KE +  EK++E +K  ++   +    L      ++ ++K+I        V K L   
Sbjct: 206  TYEKELKEAEKELERLKQDLD-PEKDLEKLAEASAELESLQKEISIMASVASVLKELEEA 264

Query: 1271 TQRINLA-------RLALNDLTEEIKKLNKTGEMLKENATLLQENNVEGALELTRQAHDT 1323
               +  A       R  +  L +E+++  K  E L+E     +E       EL R   + 
Sbjct: 265  KANLEKAAEEEKSLRNLVESLKQELEEEKKELEELREKEGEAEEAASSLEAELNRTKSEK 324

Query: 1324 SLKLKEQSTETEKQINDAERQCKRTENLVTKTFPQFTQGQEENEKALQRLNEKITSLEKN 1383
              K +E+  E   ++  A  + +           +  + +EE E+    L       E  
Sbjct: 325  ESKAREKMVEIPLKLQQASSEAEEARKEAEAAREELRKLKEEAEQTKAALET----AELR 380

Query: 1384 LPDLNELICDKRGDPCDNLCGGAGCGKCGGMWCSNGTLSESNSAKDYAETSKKLINEKES 1443
            L    +                                  + +A+  A    K + E E 
Sbjct: 381  LEAALKEA------------------------------EAAKAAEALALAEIKALQESEE 410

Query: 1444 --KAEELRKHMIILSESNSAKDYAETSKKLINEKEAKAEEVFRGITTAKQES 1493
              KA+  RK  + L E       AE +++L  +K A A         ++ ES
Sbjct: 411  SAKADSPRKITLSLEEYEELSKKAEEAEELAEKKVAAALAQVEEAKESENES 462


>gnl|CDD|100796 PRK01156, PRK01156, chromosome segregation protein; Provisional.
          Length = 895

 Score = 57.6 bits (139), Expect = 5e-08
 Identities = 108/648 (16%), Positives = 225/648 (34%), Gaps = 101/648 (15%)

Query: 1183 DSILRGLADQTTKVITDASNIKQTGASGAYTKEFELLEKKIEDVKALVENTTRSSHDLTV 1242
            DS++ G   Q  K++ +   I     +      ++ L+  I+ ++A + N       L  
Sbjct: 141  DSLISGDPAQRKKILDEILEINSLERN------YDKLKDVIDMLRAEISNIDYLEEKLKS 194

Query: 1243 MLTTIDDMKKQIMNNEDPNG-VGKTLGNTTQRINLARLALNDLTEEIKKLNKTGEMLK-- 1299
                ++++KKQI ++E  +    K +   +   N A    N+L   + +L+   +M    
Sbjct: 195  SNLELENIKKQIADDEKSHSITLKEIERLSIEYNNAMDDYNNLKSALNELSSLEDMKNRY 254

Query: 1300 ---------ENATLLQENNVEGALE--LTRQAHDTSLKLKEQSTETEKQINDAERQCKRT 1348
                     + +  L++NN    LE    +  +D   K +    +  K  ND E + +  
Sbjct: 255  ESEIKTAESDLSMELEKNNYYKELEERHMKIINDPVYKNRNYINDYFKYKNDIENKKQIL 314

Query: 1349 ENLVTKTFPQFTQGQEENEKA---LQRLNEKITSLEKNLPDLNELICDKRGDPCDNLCGG 1405
             N+  +             K    LQ+        +    DLN  I +  G         
Sbjct: 315  SNIDAEI-----NKYHAIIKKLSVLQKDYNDYIKKKSRYDDLNNQILELEGYE------- 362

Query: 1406 AGCGKCGGMWCSNGTLSESNSAKDYAETSKKLINEKESKAEELRKHMIILSESNSAKDYA 1465
                             + NS     E+ KK I E     E +     I       +   
Sbjct: 363  ----------------MDYNSYLKSIESLKKKIEEYSKNIERMSAF--ISEILKIQEIDP 404

Query: 1466 ETSKKLINEKEAKAEEVFRGITTAKQESHAANILAKEAYDHAASVRNKTAAYVASTSNIT 1525
            +  KK +NE   K +++   +++  Q   A      E   +   +  ++   V  T+   
Sbjct: 405  DAIKKELNEINVKLQDISSKVSSLNQRIRALRENLDELSRNMEMLNGQSVCPVCGTTLGE 464

Query: 1526 KQLDEFLNAPGATLADIRNISGLAVNKNIQSNPEQIRERANEINNVIKSLTDIDTILTET 1585
            ++ +  +N      + +                E+IRE   E+ ++ + + D+       
Sbjct: 465  EKSNHIINHYNEKKSRLE---------------EKIREIEIEVKDIDEKIVDLKKRKEYL 509

Query: 1586 AGDLAKANDLKRKANLTKAGADSTKNTVQKIVDVLTEARTAQDMAEVAIQTAK----DDI 1641
              +  + N    + N  +    S +  ++ I   + E +   D  E      K    +D+
Sbjct: 510  ESE--EINKSINEYNKIE----SARADLEDIKIKINELKDKHDKYEEIKNRYKSLKLEDL 563

Query: 1642 SAARKD----LSQITN-DLDDAQQKSNETNVKVKLLQERLKSIQAGFLQ----NGRSAMD 1692
             + R      L+ I+  D++  + +SNE   ++  L+ RL+ I+ GF        +S  +
Sbjct: 564  DSKRTSWLNALAVISLIDIETNRSRSNEIKKQLNDLESRLQEIEIGFPDDKSYIDKSIRE 623

Query: 1693 VENEEKNLEKEVALAQ----------KQASGLRSRYQETDNKLLSKAESSGLKRARGQML 1742
            +ENE  NL  +    Q           +    + +  E D+ +    E +    +R   +
Sbjct: 624  IENEANNLNNKYNEIQENKILIEKLRGKIDNYKKQIAEIDSIIPDLKEIT----SRINDI 679

Query: 1743 LNKASQLSVNTTAKLKQLNETEAMFNSQESELTELSKNIAELQKRIQS 1790
             +   +               E+      + + ELS  I ++ + ++S
Sbjct: 680  EDNLKKSRKALDDAKANRARLESTIEILRTRINELSDRINDINETLES 727



 Score = 40.3 bits (94), Expect = 0.012
 Identities = 86/536 (16%), Positives = 182/536 (33%), Gaps = 80/536 (14%)

Query: 1287 EIKKLNKTGEMLKENATLLQEN--NVEGALELTRQAHDTSLKLKEQSTETEKQINDAERQ 1344
            EI  L +  + LK+   +L+    N++   E  + ++     +K+Q  + EK  +   ++
Sbjct: 160  EINSLERNYDKLKDVIDMLRAEISNIDYLEEKLKSSNLELENIKKQIADDEKSHSITLKE 219

Query: 1345 CKRTENLVTKTFPQFTQGQEENEKALQRLNEKITSLEKNLPDLNELICDKRGDPCDNLCG 1404
             +R           +   +      L  L+    SLE         I     D       
Sbjct: 220  IERLSIEYNNAMDDYNNLKSA----LNELS----SLEDMKNRYESEIKTAESD------- 264

Query: 1405 GAGCGKCGGMWCSNGTLSESNSAKDYAETSKKLINEKE-SKAEELRKHMIILSESNSAKD 1463
                         +  L ++N  K+  E   K+IN+        +  +    ++  + K 
Sbjct: 265  ------------LSMELEKNNYYKELEERHMKIINDPVYKNRNYINDYFKYKNDIENKKQ 312

Query: 1464 YAETSKKLINEKEA---KAEEVFRGITTAKQESHAANILAKEAYDHAASVR-------NK 1513
                    IN+  A   K   +        Q+ +   I  K  YD   +           
Sbjct: 313  ILSNIDAEINKYHAIIKKLSVL--------QKDYNDYIKKKSRYDDLNNQILELEGYEMD 364

Query: 1514 TAAYVASTSNITKQLDEFLNAPGATLADIRNISGLAVNKNIQSNPEQIRERANEINNVIK 1573
              +Y+ S  ++ K+++E+        A I  I  +      + +P+ I++  NEIN  ++
Sbjct: 365  YNSYLKSIESLKKKIEEYSKNIERMSAFISEILKIQ-----EIDPDAIKKELNEINVKLQ 419

Query: 1574 SL-TDIDTILTETAGDLAKANDLKRKANLTKAGADSTKNTVQKIVDVLTEARTAQDMAEV 1632
             + + + ++           ++L R   +            Q +  V         + E 
Sbjct: 420  DISSKVSSLNQRIRALRENLDELSRNMEMLNG---------QSVCPV-----CGTTLGEE 465

Query: 1633 AIQTAKDDISAARKDLSQITNDLDDAQQKSNETNVKVKLLQERLKSIQAGFLQNGRSAMD 1692
                  +  +  +  L +   +++   +  +E  V +K  +E L+S +     N  + ++
Sbjct: 466  KSNHIINHYNEKKSRLEEKIREIEIEVKDIDEKIVDLKKRKEYLESEEINKSINEYNKIE 525

Query: 1693 VENEE----KNLEKEVALAQKQASGLRSRYQETDNKLLSKAESSGLKRARGQMLLNKASQ 1748
                +    K    E+     +   +++RY+    + L    +S L  A   + L     
Sbjct: 526  SARADLEDIKIKINELKDKHDKYEEIKNRYKSLKLEDLDSKRTSWLN-ALAVISLIDIET 584

Query: 1749 LSVNTTAKLKQLNETEAMFNSQESELTELSKNIAELQKRIQSCINFIVDKSNNYKN 1804
                +    KQLN+        ES L E+     + +  I   I  I +++NN  N
Sbjct: 585  NRSRSNEIKKQLNDL-------ESRLQEIEIGFPDDKSYIDKSIREIENEANNLNN 633



 Score = 31.0 bits (70), Expect = 6.7
 Identities = 57/337 (16%), Positives = 118/337 (35%), Gaps = 54/337 (16%)

Query: 1212 YTKEFELLEKKIEDVKALVENTTRSSHDLTVMLTTIDDMKKQIMNNEDPNGVGKTLGNTT 1271
            Y ++   LE+KI +++  V++      DL      ++  +     NE             
Sbjct: 474  YNEKKSRLEEKIREIEIEVKDIDEKIVDLKKRKEYLESEEINKSINEYNKIESARADLED 533

Query: 1272 QRINLARLA-----LNDLTEEIKKLNKTGEMLKENATLLQENNVEGALELTRQAHDTSLK 1326
             +I +  L        ++    K L K  ++  +  + L    V   +++          
Sbjct: 534  IKIKINELKDKHDKYEEIKNRYKSL-KLEDLDSKRTSWLNALAVISLIDIE--------T 584

Query: 1327 LKEQSTETEKQINDAERQCKRTENLVTKTFPQFTQGQEENEKALQRLNEKITSLE--KNL 1384
             + +S E +KQ+ND E    R + +       F   +   +K+++ +  +  +L    N 
Sbjct: 585  NRSRSNEIKKQLNDLE---SRLQEIEI----GFPDDKSYIDKSIREIENEANNLNNKYNE 637

Query: 1385 PDLNELICDKRGDPCDNLCGGAGCGKCGGMWCSNGTLSESNSAKDYAETSKKLINEKESK 1444
               N+++ +K     DN                     +  +  D      K I  + + 
Sbjct: 638  IQENKILIEKLRGKIDN-------------------YKKQIAEIDSIIPDLKEITSRIND 678

Query: 1445 AEE----LRKHM-IILSESNSAKDYAETSKKLINEKEAKAEEVFRGITTAKQESHAANIL 1499
             E+     RK +    +     +   E  +  INE   +  ++   + + K+   A   L
Sbjct: 679  IEDNLKKSRKALDDAKANRARLESTIEILRTRINELSDRINDINETLESMKKIKKAIGDL 738

Query: 1500 --AKEAYDHA---ASVRNKTAAYVASTSNITKQLDEF 1531
               +EA+D +   A +R   +  +  TS   K L EF
Sbjct: 739  KRLREAFDKSGVPAMIRKSASQAM--TSLTRKYLFEF 773


>gnl|CDD|233758 TIGR02169, SMC_prok_A, chromosome segregation protein SMC, primarily
            archaeal type.  SMC (structural maintenance of
            chromosomes) proteins bind DNA and act in organizing and
            segregating chromosomes for partition. SMC proteins are
            found in bacteria, archaea, and eukaryotes. It is found
            in a single copy and is homodimeric in prokaryotes, but
            six paralogs (excluded from this family) are found in
            eukarotes, where SMC proteins are heterodimeric. This
            family represents the SMC protein of archaea and a few
            bacteria (Aquifex, Synechocystis, etc); the SMC of other
            bacteria is described by TIGR02168. The N- and C-terminal
            domains of this protein are well conserved, but the
            central hinge region is skewed in composition and highly
            divergent [Cellular processes, Cell division, DNA
            metabolism, Chromosome-associated proteins].
          Length = 1164

 Score = 55.8 bits (135), Expect = 2e-07
 Identities = 109/646 (16%), Positives = 235/646 (36%), Gaps = 110/646 (17%)

Query: 1213 TKEFELLEKKIEDVKALVENTTRSSHDLTVMLTTIDDMKKQI--MNNEDPNGVGKTLGNT 1270
             ++   LE+++E +   +    +   ++  +L   +++ K+I  +  E+   V + +G  
Sbjct: 243  ERQLASLEEELEKLTEEISELEKRLEEIEQLL---EELNKKIKDLGEEEQLRVKEKIGEL 299

Query: 1271 TQRINLARLALNDLTEEIKKLNKTGEMLKE--NATLLQENNVEGALELTRQAHDTSLK-- 1326
               I     ++ +   E++   +    L+   +  L +   +E  +E   +     L   
Sbjct: 300  EAEIASLERSIAEKERELEDAEERLAKLEAEIDKLLAEIEELEREIE-EERKRRDKLTEE 358

Query: 1327 LKEQSTETEKQINDAERQCKRTENLVTKTFPQFTQGQEENEKALQRLNEKITSLEKNLPD 1386
              E   E E    + E         V K F +     ++  + L++L  +I  L++ L  
Sbjct: 359  YAELKEELEDLRAELEE--------VDKEFAETRDELKDYREKLEKLKREINELKRELDR 410

Query: 1387 LNELICDKRGDPCD--NLCGGAGCGKCGGMWCSNGTLSESNSAKDYAETSKKLINEKESK 1444
            L E +     +  D      G                ++ N  ++  E     I ++E K
Sbjct: 411  LQEELQRLSEELADLNAAIAGI--------------EAKINELEEEKEDKALEIKKQEWK 456

Query: 1445 AEELRKHMIILSESNSAK--DYAETSKKL--INEKEAKAEEVFRGITTAKQESHAANIL- 1499
             E+L   +    +       +Y    K+L  +  + A+AE   R      +   A   + 
Sbjct: 457  LEQLAADLSKYEQELYDLKEEYDRVEKELSKLQRELAEAEAQARASEERVRGGRAVEEVL 516

Query: 1500 ----------------AKEAYDHAASVR--NKTAAYVASTSNITKQLDEFL---NAPGAT 1538
                              E Y  A  V   N+    V     + K+  E L    A  AT
Sbjct: 517  KASIQGVHGTVAQLGSVGERYATAIEVAAGNRLNNVVVEDDAVAKEAIELLKRRKAGRAT 576

Query: 1539 LADIRNISGLAVNKNIQSNPEQIRERANEINNVIKSLTDI----------------DTIL 1582
               +  +      ++ + +   + E     + VI    D+                DT++
Sbjct: 577  FLPLNKM------RDERRDLSILSE-----DGVIGFAVDLVEFDPKYEPAFKYVFGDTLV 625

Query: 1583 TETAGDLAK-----------ANDLKRKANLTKAGADSTKNTVQKIVDVLTEARTAQDMAE 1631
             E   + A+             +L  K+     G+ + +     I+   +E    Q + E
Sbjct: 626  VEDI-EAARRLMGKYRMVTLEGELFEKSGAMTGGSRAPRG---GILFSRSEPAELQRLRE 681

Query: 1632 VAIQTAKDDISAARKDLSQITNDLDDAQQKSNETNVKVKLLQERLKSIQAGFLQNGRSAM 1691
              ++  K ++S+ + +L +I N LD+  Q+ ++ + K+  +++ ++ ++    +      
Sbjct: 682  -RLEGLKRELSSLQSELRRIENRLDELSQELSDASRKIGEIEKEIEQLEQEEEKLKERLE 740

Query: 1692 DVENEEKNLEKEVALAQKQASGLRSR---YQETDNKL---LSKAESSGLKRARGQMLLNK 1745
            ++E +  +LE+E+   + +   L +R    +E  +KL   L+  E+  L  +R   +  +
Sbjct: 741  ELEEDLSSLEQEIENVKSELKELEARIEELEEDLHKLEEALNDLEAR-LSHSRIPEIQAE 799

Query: 1746 ASQLSVNTTAKLKQLNETEAMFNSQESELTELSKNIAELQKRIQSC 1791
             S+L    +    +L E E   N    E   L K I ELQ++    
Sbjct: 800  LSKLEEEVSRIEARLREIEQKLNRLTLEKEYLEKEIQELQEQRIDL 845



 Score = 55.8 bits (135), Expect = 2e-07
 Identities = 64/384 (16%), Positives = 122/384 (31%), Gaps = 75/384 (19%)

Query: 1423 ESNSAKDYAETSKKLINEKESKAEEL-RKHMIILSESNSAKDYAETSKKLINEKEAKAEE 1481
            E +S +      +  ++E   +  +  RK   I  E    +   E  K+ + E E     
Sbjct: 689  ELSSLQSELRRIENRLDELSQELSDASRKIGEIEKEIEQLEQEEEKLKERLEELEEDLSS 748

Query: 1482 VFRGITTAKQE--SHAANILAKEAYDHAASVRNKTAAYVASTSNITKQLDEFLNAPGATL 1539
            + + I   K E     A I   E   H                    +L+E LN   A L
Sbjct: 749  LEQEIENVKSELKELEARIEELEEDLH--------------------KLEEALNDLEARL 788

Query: 1540 AD--IRNISGL--AVNKNIQSNPEQIRERANEINNVIKSLTDIDTIL-TETAGDLAKAND 1594
            +   I  I      + + +      I  R  EI   +  LT     L  E      +  D
Sbjct: 789  SHSRIPEIQAELSKLEEEVSR----IEARLREIEQKLNRLTLEKEYLEKEIQELQEQRID 844

Query: 1595 LKRKANLTKAGADSTKNTVQKIVDVLTEARTA-QDMAEVAIQTAKD------DISAARKD 1647
            LK +    +   ++     +++ + L E   A +D+        K+       +    + 
Sbjct: 845  LKEQIKSIEKEIENLNGKKEELEEELEELEAALRDLESRLGDLKKERDELEAQLRELERK 904

Query: 1648 LSQITNDLDDAQQKSNETNVKVKLLQERLKSIQAGFLQNGRSAMDVENEEKNLEKEVALA 1707
            + ++   ++  +++ +E   K++ L+E L  I+    +      D E  E+ L  E   A
Sbjct: 905  IEELEAQIEKKRKRLSELKAKLEALEEELSEIEDPKGE------DEEIPEEELSLEDVQA 958

Query: 1708 QKQASGLRSRYQETDNKLLSKAESSGLKRARGQMLLNKASQLSVNTTAKLKQLNETEAMF 1767
            + Q      R  E  N                      A Q          +  E     
Sbjct: 959  ELQRVEEEIRALEPVN--------------------MLAIQ----------EYEEVLKRL 988

Query: 1768 NSQESELTELSKNIAELQKRIQSC 1791
            +  + +  +L +    + +RI+  
Sbjct: 989  DELKEKRAKLEEERKAILERIEEY 1012



 Score = 55.5 bits (134), Expect = 3e-07
 Identities = 63/375 (16%), Positives = 143/375 (38%), Gaps = 71/375 (18%)

Query: 1432 ETSKKLINEKESKAEELRKHMIILSESNSAKDYAETSKKLINEKEAKAEEVFRGITTAKQ 1491
            E  ++ I   +   +E R+ +                ++L  E+E KAE  ++ +   K+
Sbjct: 180  EEVEENIERLDLIIDEKRQQL----------------ERLRRERE-KAER-YQALLKEKR 221

Query: 1492 ESHAANILAKEAYDHAASVRNKTAAYVASTSNITKQLDEFLNAPGATLADI-RNISGLAV 1550
            E         E Y+          A       I +QL          L  +   IS L  
Sbjct: 222  EY--------EGYELLKEKE----ALERQKEAIERQLASL----EEELEKLTEEISEL-- 263

Query: 1551 NKNIQSNPEQIRERANEINNVIKSLTD-----IDTILTETAGDLAKA-----------ND 1594
             K +    E+I +   E+N  IK L +     +   + E   ++A              D
Sbjct: 264  EKRL----EEIEQLLEELNKKIKDLGEEEQLRVKEKIGELEAEIASLERSIAEKERELED 319

Query: 1595 LKRKANLTKAGADSTKNTVQKIVDVLTEARTAQDMAEVAIQTAKDDISAARKDLSQITND 1654
             + +    +A  D     ++++   + E R  +D         K+++   R +L ++  +
Sbjct: 320  AEERLAKLEAEIDKLLAEIEELEREIEEERKRRDKLTEEYAELKEELEDLRAELEEVDKE 379

Query: 1655 LDDAQQKSNETNVKVKLLQERLKSIQAGFLQNGRSAMDVENEEKNLEKEVALAQKQASGL 1714
              + + +  +   K++ L+  +  ++       R    ++ E + L +E+A      +G+
Sbjct: 380  FAETRDELKDYREKLEKLKREINELK-------RELDRLQEELQRLSEELADLNAAIAGI 432

Query: 1715 RSRYQETDNKLLSKAESSGLKRARGQMLLNKASQLSVNTTAKLKQLNETEAMFNSQESEL 1774
             ++  E + +   KA    +K+        K  QL+ + +   ++L + +  ++  E EL
Sbjct: 433  EAKINELEEEKEDKALE--IKKQEW-----KLEQLAADLSKYEQELYDLKEEYDRVEKEL 485

Query: 1775 TELSKNIAELQKRIQ 1789
            ++L + +AE + + +
Sbjct: 486  SKLQRELAEAEAQAR 500



 Score = 48.1 bits (115), Expect = 5e-05
 Identities = 75/471 (15%), Positives = 143/471 (30%), Gaps = 140/471 (29%)

Query: 1267 LGNTTQRINLARLALNDLTEEIKKLNKTGEMLKENATLLQE-NNVEGALELTR-QAHDTS 1324
            L    + I    L +++  +++++L +  E  +    LL+E    EG   L   +A +  
Sbjct: 179  LEEVEENIERLDLIIDEKRQQLERLRREREKAERYQALLKEKREYEGYELLKEKEALERQ 238

Query: 1325 LKLKEQSTETEKQINDAERQCKRTENLVTKTFPQFTQGQEENEKALQRLNEKITSLEKNL 1384
             +        E+Q+   E +                   E+  + +  L +++  +E+ L
Sbjct: 239  KE------AIERQLASLEEE------------------LEKLTEEISELEKRLEEIEQLL 274

Query: 1385 PDLNELICDKRGDPCDNLCGGAGCGKCGGMWCSNGTLSESNSAKDYAETSKKLINEKESK 1444
             +LN+ I D   +                         E           K+ I E E++
Sbjct: 275  EELNKKIKDLGEE-------------------------EQL-------RVKEKIGELEAE 302

Query: 1445 AEELRKHMIILSESNS-AKDYAETSKKL---INEKEAKAEEVFRGITTAKQESHAANILA 1500
               L +    ++E     +D  E   KL   I++  A+ EE+ R I   ++         
Sbjct: 303  IASLER---SIAEKERELEDAEERLAKLEAEIDKLLAEIEELEREIEEERKRRDKL---- 355

Query: 1501 KEAYDHAASVRNKT---AAYVASTSNITKQLDEFLNAPGATLADIRNISGLAVNKNIQSN 1557
             E Y                V      T+  DE                     K+ +  
Sbjct: 356  TEEYAELKEELEDLRAELEEVDKEFAETR--DEL--------------------KDYREK 393

Query: 1558 PEQIRERANEINNVIKSLTDIDTILTETAGDLAKANDLKRKANLTKAGADSTKNTVQKIV 1617
             E+++   NE+   +  L +    L+E   DL     +                   KI 
Sbjct: 394  LEKLKREINELKRELDRLQEELQRLSEELADLN--AAIAGI--------------EAKIN 437

Query: 1618 DVLTEARTAQDMAEVAIQTAKDDISAARKDLSQITNDLDDAQQKSNETNVKVKLLQERLK 1677
            ++  E                 +I      L Q+  DL   +Q+  +   +   +++ L 
Sbjct: 438  ELEEEKEDKAL-----------EIKKQEWKLEQLAADLSKYEQELYDLKEEYDRVEKELS 486

Query: 1678 SIQAGFLQNGRSAMDVENEEKNLEKEVALAQKQASGLRSRYQETDNKLLSK 1728
             +Q                    E   A AQ +AS  R R      ++L  
Sbjct: 487  KLQ-------------------RELAEAEAQARASEERVRGGRAVEEVLKA 518



 Score = 38.1 bits (89), Expect = 0.054
 Identities = 51/310 (16%), Positives = 105/310 (33%), Gaps = 56/310 (18%)

Query: 1209 SGAYTKEFELLEKKIEDVKALVENTTRSSHDLTVMLTTIDDMKKQIMNNEDPNGVGKTLG 1268
            SGA T         I   ++      R           ++ +K+++      + +   L 
Sbjct: 652  SGAMTGGSRAPRGGILFSRSEPAELQRLRER-------LEGLKREL------SSLQSELR 698

Query: 1269 NTTQRINLARLALNDLTEEIKKLNKTGEMLKENATLLQE--NNVE---GALELTRQAHDT 1323
                R++     L+D + +I ++ K  E L++    L+E    +E    +LE   +   +
Sbjct: 699  RIENRLDELSQELSDASRKIGEIEKEIEQLEQEEEKLKERLEELEEDLSSLEQEIENVKS 758

Query: 1324 SLK--------LKEQSTETEKQINDAERQCKRTENLVTKTFPQFTQGQEENEKALQRLNE 1375
             LK        L+E   + E+ +ND E +   +        P+      + E+ + R+  
Sbjct: 759  ELKELEARIEELEEDLHKLEEALNDLEARLSHSR------IPEIQAELSKLEEEVSRIEA 812

Query: 1376 KITSLEKNLPDLN---ELICDKRGDPCDNLCGGAGCGKCGGMWCSNGTLSESNSAKDYAE 1432
            ++  +E+ L  L    E +  +  +  +                      +  S +   E
Sbjct: 813  RLREIEQKLNRLTLEKEYLEKEIQELQEQR---------------IDLKEQIKSIEKEIE 857

Query: 1433 TSKKLINEKESKAEELRKHMIILSESNSAKDYAETSKKLINEKEAKAEEVFRGITTAKQE 1492
                   E E + EEL   +  L          +  K+  +E EA+  E+ R I   + +
Sbjct: 858  NLNGKKEELEEELEELEAALRDLESR-----LGDLKKE-RDELEAQLRELERKIEELEAQ 911

Query: 1493 SHAANILAKE 1502
                     E
Sbjct: 912  IEKKRKRLSE 921



 Score = 37.4 bits (87), Expect = 0.094
 Identities = 49/305 (16%), Positives = 105/305 (34%), Gaps = 64/305 (20%)

Query: 1215 EFELLEKKIEDVKALVENTTRSSHDLTVMLTTIDDMKKQIMNNEDPNGVGKTLGNTTQRI 1274
            E   L K+ E ++   E   R    L      ++ + ++I   E             +R+
Sbjct: 224  EGYELLKEKEALERQKEAIERQLASLE---EELEKLTEEISELE-------------KRL 267

Query: 1275 NLARLALNDLTEEIKKLNKTGEMLKENATLLQ--------ENNVEGALELTRQAHDTSLK 1326
                  L +L ++IK L +  E L+    + +        E ++         A +   K
Sbjct: 268  EEIEQLLEELNKKIKDLGE-EEQLRVKEKIGELEAEIASLERSIAEKERELEDAEERLAK 326

Query: 1327 LKEQSTETEKQINDAERQCKRTENLVTKTFPQFTQGQEENEKALQRLNE----------K 1376
            L+ +  +   +I + ER+ +       K   ++ + +EE E     L E          +
Sbjct: 327  LEAEIDKLLAEIEELEREIEEERKRRDKLTEEYAELKEELEDLRAELEEVDKEFAETRDE 386

Query: 1377 ITSLEKNLPDL----NELI--CDKRGDPCDNLCGGAGCGKCGGMWCSNGTLSE-SNSAKD 1429
            +    + L  L    NEL    D+  +    L               +  L++ + +   
Sbjct: 387  LKDYREKLEKLKREINELKRELDRLQEELQRL---------------SEELADLNAAIAG 431

Query: 1430 YAETSKKLINEKESKAEELRKHMIILSESN--SAKDYAETSKKLINEKEAKAEEVFRGIT 1487
                  +L  EKE KA E++K      E             ++ + + + + + V + ++
Sbjct: 432  IEAKINELEEEKEDKALEIKK-----QEWKLEQLAADLSKYEQELYDLKEEYDRVEKELS 486

Query: 1488 TAKQE 1492
              ++E
Sbjct: 487  KLQRE 491



 Score = 35.0 bits (81), Expect = 0.45
 Identities = 40/181 (22%), Positives = 66/181 (36%), Gaps = 28/181 (15%)

Query: 1214 KEFELLEKKIEDVKALVENTTRSSHDLTVMLTTIDDMKKQIMNNEDPNGVGKTLGNTTQR 1273
            KE E L  K E+++  +E    +  DL   L    D+KK+    E        L    ++
Sbjct: 854  KEIENLNGKKEELEEELEELEAALRDLESRLG---DLKKERDELE------AQLRELERK 904

Query: 1274 INLARLALNDLTEEIKKLNKTGEMLKENATLLQENNVEGALELTRQAHDTSLKLKEQSTE 1333
            I      +    + + +L    E L+E  + +++   E                 E+  E
Sbjct: 905  IEELEAQIEKKRKRLSELKAKLEALEEELSEIEDPKGED----------------EEIPE 948

Query: 1334 TEKQINDAERQCKRTENLVTKTFP---QFTQGQEENEKALQRLNEKITSLEKNLPDLNEL 1390
             E  + D + + +R E  +    P      Q  EE  K L  L EK   LE+    + E 
Sbjct: 949  EELSLEDVQAELQRVEEEIRALEPVNMLAIQEYEEVLKRLDELKEKRAKLEEERKAILER 1008

Query: 1391 I 1391
            I
Sbjct: 1009 I 1009


>gnl|CDD|114855 pfam06160, EzrA, Septation ring formation regulator, EzrA.  During
            the bacterial cell cycle, the tubulin-like cell-division
            protein FtsZ polymerises into a ring structure that
            establishes the location of the nascent division site.
            EzrA modulates the frequency and position of FtsZ ring
            formation.
          Length = 559

 Score = 55.3 bits (134), Expect = 2e-07
 Identities = 101/572 (17%), Positives = 205/572 (35%), Gaps = 154/572 (26%)

Query: 1271 TQRINLARLALNDLTEEIKKLNKTGEMLKENATLLQENNVEGALELTRQAHDTSLKLKEQ 1330
             ++  L  L +N+   ++KKLN TGE  +               E  RQ  D      + 
Sbjct: 32   ERKEELMNLPVNEELSKVKKLNLTGESEET-------------FEEWRQKWD------DI 72

Query: 1331 ST----ETEKQINDAERQCKRTENLVTKTFPQFTQGQEENEKALQRLNEKITSLEKN--- 1383
             T    + E+ + +AE             F +F + +    KA++ + E + ++E++   
Sbjct: 73   VTNSLPDIEELLFEAEEL--------NDKF-RFLKAK----KAIKEIEELLDTIEEDIEQ 119

Query: 1384 -LPDLNELICDKRGDPCDNLCGGAGCGKCGGMWCSNGTLSESNSAKDYAETSKKLINEKE 1442
             L +LNEL                                ES       E ++K + E +
Sbjct: 120  ILEELNELK-------------------------------ESE------EKNRKEVEELK 142

Query: 1443 SKAEELRKHMIILSESNSAKDYAETSKKLINEKEAKAEEVFRGITTAKQESHAANILAKE 1502
             K  ELRK   +L++S S   Y     +L    E + +E+        + + + + L  E
Sbjct: 143  DKYRELRKT--LLAKSFS---YGPALDEL----EKQLDELEEEFEQFVELTESGDYL--E 191

Query: 1503 AYDHAASVRNKTAAYVASTSNITKQLDEFLNAPGATLADIRNISGLA---------VNKN 1553
            A +    +  +T A       I   L E  N     L +++   G            + +
Sbjct: 192  AREVLLKLEEETDALEQKMEEIPPLLKELQNEFPDQLEELKA--GYREMTEEGYHFDHLD 249

Query: 1554 IQSNPEQIRERANEINNVIKSLTDIDTILTETAGDLAKANDLKRKANLTKAGADSTKNTV 1613
            I+   + ++E+ ++   +++ L            DL +A +            +  +  +
Sbjct: 250  IEKELQDLKEQIDQNLALLEEL------------DLDEAEE----------ENEEIEERI 287

Query: 1614 QKIVDVLTEARTAQDMAEVAIQTAKDDISAARKDLSQITNDLDDAQQK---SNETNVKVK 1670
              + D+L +   A+   E  I    D +  AR+   Q+  +LD  QQ    + +    V+
Sbjct: 288  DTLYDILEKEVKAKKFVEKNIDKLTDFLEHAREQNKQLLLELDRLQQSYTLNEDELETVR 347

Query: 1671 LLQERLKSIQAGFLQNGRSAMDVENEEK---NLEKEVALAQKQASGLRSRYQETDNKL-- 1725
             L+++L+ ++A + Q       +  ++     L++E+   +KQ   +    +E    L  
Sbjct: 348  ELEKQLEELEAQYDQ---LVERIAEKKVAYSELQEELEEIEKQLEEIEKEQEELSESLQG 404

Query: 1726 LSKAESSGLKRARGQMLLNKASQLSVNTTAKLKQLN------ETEAMFNSQESELTELSK 1779
            L K E      AR + L     +L       L++ N      +    F     E+  L+ 
Sbjct: 405  LRKDE----LEAR-EKLQEYRQKLH-EIKRYLEKSNLPGLPEDYLEYFFVVSDEIEALAD 458

Query: 1780 -------NIAELQKR---IQSCINFIVDKSNN 1801
                   N+  + ++       +  + +K+  
Sbjct: 459  ELNEVPINMDAVNRQLEEATDDVETLKEKTEE 490



 Score = 47.6 bits (114), Expect = 6e-05
 Identities = 98/550 (17%), Positives = 188/550 (34%), Gaps = 171/550 (31%)

Query: 1217 ELLEKKIEDVKA----LVENTTRSSHDLTVMLTTIDDMKKQIMNNEDPNGVGKTLGNTTQ 1272
            + +E+ IE +      L E+  ++  ++  +     +++K ++          + G    
Sbjct: 111  DTIEEDIEQILEELNELKESEEKNRKEVEELKDKYRELRKTLLAKSF------SYGPALD 164

Query: 1273 RINLARLA-LNDLTEEIKKLNKTGEMLKENATLLQENNVEGALELTRQAHDTSLKLKEQS 1331
             +   +L  L +  E+  +L ++G+ L                    +A +  LKL+E++
Sbjct: 165  ELE-KQLDELEEEFEQFVELTESGDYL--------------------EAREVLLKLEEET 203

Query: 1332 TETEKQINDAERQCKRTENLVTKTFP-QFT-------QGQEEN--------EKALQRLNE 1375
               E+++ +     K  +N     FP Q         +  EE         EK LQ L E
Sbjct: 204  DALEQKMEEIPPLLKELQN----EFPDQLEELKAGYREMTEEGYHFDHLDIEKELQDLKE 259

Query: 1376 KITSLEKNLPDL--------NELICDKRGDPCDNLCGGAGCGKCGGMWCSNGTLSESNSA 1427
            +I      L +L        NE I ++     D L                  L +   A
Sbjct: 260  QIDQNLALLEELDLDEAEEENEEIEER----IDTL---------------YDILEKEVKA 300

Query: 1428 KDYAETSKKLINEKESKAEELRKHMII----LSESNSAKDYAETSKKLINEKEAKAEEVF 1483
            K + E +   + +    A E  K +++    L +S     Y       +NE E    E  
Sbjct: 301  KKFVEKNIDKLTDFLEHAREQNKQLLLELDRLQQS-----Y------TLNEDEL---ETV 346

Query: 1484 RGITTAKQESHAANILAKEAYDHAASVRNKTAAYVASTSNITKQLDEFLNAPGATLADIR 1543
            R +   KQ       L  +       +  K  AY +      +++++ L           
Sbjct: 347  RELE--KQLEE----LEAQYDQLVERIAEKKVAY-SELQEELEEIEKQL----------- 388

Query: 1544 NISGLAVNKNIQSNPEQIRERANEINNVIKSLTDIDTILTETAGDLAKA-NDLKR---KA 1599
                           E+I +   E++  ++ L   +    E   +  +  +++KR   K+
Sbjct: 389  ---------------EEIEKEQEELSESLQGLRKDELEAREKLQEYRQKLHEIKRYLEKS 433

Query: 1600 NLTKAGAD------STKNTVQKIVDVLTEARTAQDMAEVAIQTAKDDISAARKDLSQITN 1653
            NL     D         + ++ + D L E     +M  V             + L + T+
Sbjct: 434  NLPGLPEDYLEYFFVVSDEIEALADELNEVPI--NMDAVN------------RQLEEATD 479

Query: 1654 DLDDAQQKSNETNVKVKLLQERLKSIQAGFLQNGRSAMDVENEEKNLEKEVALAQKQASG 1713
            D++  ++K+ E       L E+L  IQ              N  ++  +EVA A K+A  
Sbjct: 480  DVETLKEKTEELVDNA-TLAEQL--IQYA------------NRYRSDNEEVAEALKEAER 524

Query: 1714 L--RSRYQET 1721
            L     Y+E 
Sbjct: 525  LFENYDYKEA 534



 Score = 36.0 bits (84), Expect = 0.22
 Identities = 39/178 (21%), Positives = 80/178 (44%), Gaps = 31/178 (17%)

Query: 1214 KEFELLEKKIEDVKALVENTTRSSHDLTVMLTTIDDMKKQIMNNEDPNGVGKTLGNTTQR 1273
            KE + L+++I+   AL+E       DL       ++++++I           TL +  ++
Sbjct: 252  KELQDLKEQIDQNLALLEEL-----DLDEAEEENEEIEERI----------DTLYDILEK 296

Query: 1274 INLARLALNDLTEEIKKL-NKTGEMLKENATLLQENNVEGALELTRQAHDTSLKLKEQST 1332
               A+     + + I KL +      ++N  LL E      L+  +Q++  +    E   
Sbjct: 297  EVKAK---KFVEKNIDKLTDFLEHAREQNKQLLLE------LDRLQQSYTLNEDELETVR 347

Query: 1333 ETEKQINDAERQCKR-TENLVTKTFPQFTQGQEENEKALQRLNEKITSLEKNLPDLNE 1389
            E EKQ+ + E Q  +  E +  K    +++ QEE    L+ + +++  +EK   +L+E
Sbjct: 348  ELEKQLEELEAQYDQLVERIAEKKVA-YSELQEE----LEEIEKQLEEIEKEQEELSE 400


>gnl|CDD|221432 pfam12128, DUF3584, Protein of unknown function (DUF3584).  This
            protein is found in bacteria and eukaryotes. Proteins in
            this family are typically between 943 to 1234 amino acids
            in length. This family contains a P-loop motif suggesting
            it is a nucleotide binding protein. It may be involved in
            replication.
          Length = 1198

 Score = 55.1 bits (133), Expect = 4e-07
 Identities = 34/168 (20%), Positives = 63/168 (37%), Gaps = 28/168 (16%)

Query: 1621 TEARTAQDMAEVAIQTAKDDISAARKDLSQITNDLDDAQQKSNETNVKVK---LLQERLK 1677
            TE R     AE A+Q+A      A + L Q   +L++ ++   E    +K   L  +RL+
Sbjct: 603  TELRERLQQAEEALQSAVAKQKQAEEQLVQANAELEEQKRAEAEARTALKQARLDLQRLQ 662

Query: 1678 SIQAGFLQNGRSAMDVENEEKNLEKEVALAQKQASGLRSRYQETDNKLLSKAESSGLKRA 1737
            + Q             ++ +  LE  +A  ++QA       +    +L   A+   L   
Sbjct: 663  NEQ-------------QSLKDKLELAIAERKQQA-------ETQLRQL--DAQLKQLLEQ 700

Query: 1738 RGQMLLNKASQLSVNTTAKLKQLNETEAMFNSQESELTELSKNIAELQ 1785
            +   L           T +L +    E      +++L +LS  I   +
Sbjct: 701  QQAFLEALKDDFRELRTERLAKWQVVEG---ELDNQLAQLSAAIEAAR 745



 Score = 48.9 bits (117), Expect = 3e-05
 Identities = 91/552 (16%), Positives = 186/552 (33%), Gaps = 106/552 (19%)

Query: 1282 NDLTEEIKKLNKTGEMLKE--NATLLQENNVEGALE--LTRQAHDTSLKLKEQSTETEKQ 1337
              L  +++K N+    ++E  +           ALE  L +Q     L+  E+  E E +
Sbjct: 391  EQLERDLEKNNERLAAIREEKDRQKAAIEEDLQALESQLRQQLEAGKLEFNEEEYELELR 450

Query: 1338 INDAERQCKRTENLVTKTFPQFTQGQEENEKALQRLNEKITSLEKNLPDLN--ELICDKR 1395
            +   +++     +  T T P+  +  E N++AL++  E+    E N+  L        KR
Sbjct: 451  LGRLKQR----LDSATAT-PEELEQLEINDEALEKAQEEQEQAEANVEQLQSELRQLRKR 505

Query: 1396 GDPCDNLCGGAGCGKCGGMWCSNGTLSESNSAKDYAETSKKLINEKESKAEELRKHMIIL 1455
             D        A              L +   A    E   +L  +  S    LR      
Sbjct: 506  RDEALEALQRA-----------ERRLLQLRQA--LDELELQLSPQAGSLLHFLRNE---- 548

Query: 1456 SESNSAKDYAETSKKLINEK------------EAKAEEVFRGITTAKQESHAANILAKEA 1503
                 A  + E+  K+I+ +            E    +   G++           L  + 
Sbjct: 549  -----APGWEESIGKVISPELLERTDLDPQLVEGSDSDTLYGVS-----------LDLQR 592

Query: 1504 YDHAASVRNKTAAYVASTSNITKQLDEFLNAPGATLADIRNISGLAVNKNIQSNPEQIRE 1563
             D      N+T            +L E L      L         AV K  +   EQ+ +
Sbjct: 593  LDVPDYAANET------------ELRERLQQAEEALQS-------AVAK-QKQAEEQLVQ 632

Query: 1564 RANEINNVIKSLTDIDTILTETAGDLAKANDLKR--KANLTKAGADSTKNTVQKIVDVLT 1621
               E+    ++  +  T L +   DL +  + ++  K  L  A A+  +    ++  +  
Sbjct: 633  ANAELEEQKRAEAEARTALKQARLDLQRLQNEQQSLKDKLELAIAERKQQAETQLRQLDA 692

Query: 1622 EARTAQDMAEVAIQTAKDDISAARKDLSQ----ITNDLDD------AQQKSNETNVKVKL 1671
            + +   +  +  ++  KDD    R +       +  +LD+      A  ++  T  K +L
Sbjct: 693  QLKQLLEQQQAFLEALKDDFRELRTERLAKWQVVEGELDNQLAQLSAAIEAARTQAKARL 752

Query: 1672 LQERLKSIQAGFLQN----GRSAMDVENEEKNLEKEVALAQKQASGLRSR---YQETDNK 1724
              + LK      L +      +  +++ + + LE  +     +   +R      QET   
Sbjct: 753  --KELKKQYDRELASLDVDPNTVKELKRQIEELETTIERIAVRRPEVREYRAFMQETWLH 810

Query: 1725 LLS--------KAESSGLKRARGQMLLNKASQLSVNTTAKLKQLNETEAMFNSQESELTE 1776
              S          +   L+ +  ++       +   T  + K+L +       Q  +L E
Sbjct: 811  RDSLREERPNLAIQLRELESSAEELQQELTRLIKD-TKLRRKKLEQERKALEKQLDQLDE 869

Query: 1777 LSKNIAELQKRI 1788
            L + + +  +++
Sbjct: 870  LLRGLRDEMRQL 881



 Score = 43.9 bits (104), Expect = 0.001
 Identities = 31/204 (15%), Positives = 71/204 (34%), Gaps = 25/204 (12%)

Query: 1595 LKRKANLTKAGADSTKNTVQKIVDVLTEARTAQDMAEVAIQTAKDDISAARKDLSQITND 1654
            L+ +    +    S     ++  + L +A    +  + A   A+  +  AR DL ++ N 
Sbjct: 605  LRERLQQAEEALQSAVAKQKQAEEQLVQANAELEEQKRAEAEARTALKQARLDLQRLQN- 663

Query: 1655 LDDAQQKSNETNVKVKLLQERLKSIQAGFLQNGRSAMDVENEEKNLEKEVALAQKQASGL 1714
                 ++ +  +     + ER +  +            ++ + K L ++     +    L
Sbjct: 664  -----EQQSLKDKLELAIAERKQQAE-------TQLRQLDAQLKQLLEQQQAFLEA---L 708

Query: 1715 RSRYQETDNKLLSK-AESSGLKRARGQMLLNKASQLSVNTTAKLKQLNETEAMFNS---- 1769
            +  ++E   + L+K     G    +   L            A+LK+L   +         
Sbjct: 709  KDDFRELRTERLAKWQVVEGELDNQLAQLSAAIEAARTQAKARLKELK--KQYDRELASL 766

Query: 1770 --QESELTELSKNIAELQKRIQSC 1791
                + + EL + I EL+  I+  
Sbjct: 767  DVDPNTVKELKRQIEELETTIERI 790



 Score = 39.7 bits (93), Expect = 0.018
 Identities = 46/291 (15%), Positives = 106/291 (36%), Gaps = 58/291 (19%)

Query: 1539 LADIRNISGLAVNKNIQSNPEQIRERANEINNVIKSLTDIDTILTETAGDLAKANDLKRK 1598
            +ADI+ +  +          +++     ++    + L  ++  L    G+L    D +R 
Sbjct: 232  IADIQALRAI----------QKVAPEIEKLQEDFEQLLSLELRLQHLHGELV--ADEERL 279

Query: 1599 ANLTKAGADSTKNTVQKIVDVLTEARTAQDMAEVAIQTAKDDISAARKDLSQITNDLD-- 1656
            A   +   ++     Q       + RT +D  + A      ++SAA   L+   ++L+  
Sbjct: 280  AEEQEERQEAKNRLRQ-------QLRTLEDQLKEARDELNQELSAANAKLAADRSELELL 332

Query: 1657 ----------DAQQKS-------------NETNVKVKLLQERLKSIQAGFLQNGRSAMDV 1693
                      D +Q                E   ++  L  + + +Q  + +  +  +  
Sbjct: 333  EDQKGAFEDADIEQLQADLDQLPSIRSELEEVEARLDALTGKHQDVQRKYERL-KQKIKE 391

Query: 1694 ENE---EKNLEKEVALAQKQASGL---RSRYQETDNKLLSKAESSGLKRARGQML----L 1743
            + E   EKN E+  A+ +++           Q  +++L  + E+  L+    +      L
Sbjct: 392  QLERDLEKNNERLAAIREEKDRQKAAIEEDLQALESQLRQQLEAGKLEFNEEEYELELRL 451

Query: 1744 NKASQLSVNTTA---KLKQLNETEAMFNSQESELTELSKNIAELQKRIQSC 1791
             +  Q   + TA   +L+QL   +      + E  +   N+ +LQ  ++  
Sbjct: 452  GRLKQRLDSATATPEELEQLEINDEALEKAQEEQEQAEANVEQLQSELRQL 502



 Score = 32.4 bits (74), Expect = 2.7
 Identities = 32/179 (17%), Positives = 65/179 (36%), Gaps = 31/179 (17%)

Query: 1219 LEKKIEDVKALVENTTRSSHDLTVMLTTIDDMKKQIMNNE----DPNGVGKTLGNTTQRI 1274
            L+ ++  + A +E   R+          + ++KKQ  + E    D +             
Sbjct: 730  LDNQLAQLSAAIEAA-RTQAK-----ARLKELKKQ-YDRELASLDVDP-----------N 771

Query: 1275 NLARLA--LNDLTEEIKKLNKTGEMLKENATLLQENNV-EGALELTRQAHDTSLKLKEQS 1331
             +  L   + +L   I+++      ++E    +QE  +   +L   R        L  Q 
Sbjct: 772  TVKELKRQIEELETTIERIAVRRPEVREYRAFMQETWLHRDSLREERPN------LAIQL 825

Query: 1332 TETEKQINDAERQCKRTENLVTKTFPQFTQGQEENEKALQRLNEKITSLEKNLPDLNEL 1390
             E E    + +++  R          +  Q ++  EK L +L+E +  L   +  L EL
Sbjct: 826  RELESSAEELQQELTRLIKDTKLRRKKLEQERKALEKQLDQLDELLRGLRDEMRQLAEL 884


>gnl|CDD|144972 pfam01576, Myosin_tail_1, Myosin tail.  The myosin molecule is a
            multi-subunit complex made up of two heavy chains and
            four light chains it is a fundamental contractile protein
            found in all eukaryote cell types. This family consists
            of the coiled-coil myosin heavy chain tail region. The
            coiled-coil is composed of the tail from two molecules of
            myosin. These can then assemble into the macromolecular
            thick filament. The coiled-coil region provides the
            structural backbone the thick filament.
          Length = 859

 Score = 53.5 bits (129), Expect = 9e-07
 Identities = 125/657 (19%), Positives = 245/657 (37%), Gaps = 155/657 (23%)

Query: 1208 ASGAYTKEFELLEKK---IEDVKALVENTTRSSHDLTVMLTTIDDMKKQIMNNEDPNGVG 1264
            A GA   + EL +K+   +  ++  +E       +       +  ++K+  + +      
Sbjct: 86   AGGATAAQIELNKKREAELAKLRKDLEEANLQHEE------ALATLRKK--HQD------ 131

Query: 1265 KTLGNTTQRINLARLALNDLTEEIKKLNKT--------GEMLKEN----ATLLQENNVEG 1312
                           A+N+L+E+I++L K          ++  E     A L Q    + 
Sbjct: 132  ---------------AINELSEQIEQLQKQKAKAEKEKSQLQAEVDDLLAQLDQIAKAKL 176

Query: 1313 ALELTRQAHDTSL-KLKEQSTETEKQINDAERQCKR--TENLVTKTFPQFTQGQEENEKA 1369
              E   +  ++ L +L+ +  E ++Q+ND   Q  R  +EN         T+  EE E  
Sbjct: 177  NAEKKAKQLESQLSELQVKLDELQRQLNDLTSQKSRLQSENS------DLTRQLEEAEAQ 230

Query: 1370 LQRLNEKITSLEKNLPDLNELICDKRGDPCDNLCGGAGCGKCGGMWCSNGTLSES--NSA 1427
            +  L++  + LE  L +    + ++  +                       L        
Sbjct: 231  VSNLSKLKSQLESQLEEAKRSLEEESRE--------------------RANLQAQLRQLE 270

Query: 1428 KDYAETSKKLINEKESKAEELRKHMIILSESNSAKDYAETSKKLINEKEAKAEEV--FRG 1485
             D     ++L  E E+KAE  R+    LS++N+  +  +   K  +E   +AEE+   + 
Sbjct: 271  HDLDSLREQLEEESEAKAELERQ----LSKANA--EIQQWRSKFESEGALRAEELEELKK 324

Query: 1486 ITTAKQESHAANILAKEAYDHAASVRNKTAAYVASTSNITKQLD----EFLNAPGATLAD 1541
                K           E  + A +   K  +   + S +  +L+    E   A  A    
Sbjct: 325  KLNQKIS---------ELEEAAEAANAKCDSLEKTKSRLQSELEDLQIELERANAAASEL 375

Query: 1542 IRNISGLAVNKNIQSNPEQIRERANEINNVIKSLTDIDTILTETAGDLAKA----NDLKR 1597
             +       +K +     ++ E   E++   +   ++ T L     +L +       L+R
Sbjct: 376  EKKQKNF--DKILAEWKRKVDELQAELDTAQREARNLSTELFRLKNELEELKDQVEALRR 433

Query: 1598 K--------ANLTKAGADSTKNT--VQKIVDVL----TEARTAQDMAEVAIQTAKDDISA 1643
            +         +LT    +  +N   ++K    L     E + A + AE A++  +  +  
Sbjct: 434  ENKNLQDEIHDLTDQLGEGGRNVHELEKARRRLEAEKDELQAALEEAEAALELEESKVLR 493

Query: 1644 ARKDLSQITNDLD-DAQQKSNETNVKVKLLQERLKSIQAGFLQNGRSAMDVENEEKNLEK 1702
            A+ +LSQI ++++    +K  E     K  Q  ++S+QA      +   +    +K LE 
Sbjct: 494  AQVELSQIRSEIERRLAEKEEEFENTRKNHQRAIESLQATLEAEAKGKAEASRLKKKLEG 553

Query: 1703 -----EVAL---------AQKQASGLRSRYQETDNKLLSKAESSGLKRARGQMLLN---- 1744
                 E+AL         AQK       +YQ+   +L ++ E    +RAR          
Sbjct: 554  DINELEIALDHANKANAEAQKNV----KKYQQQVKELQTQVEEE--QRAREDAREQLAVA 607

Query: 1745 --KASQLSVNTTAKLKQLNETEAMFNSQ------ESELTELSKNIAELQKRIQSCIN 1793
              +A+ L     A+L++L    A+  ++      E+EL E S+ + EL  +  S I 
Sbjct: 608  ERRATALE----AELEEL--RSALEQAERARKQAETELAEASERVNELTAQNSSLIA 658



 Score = 43.5 bits (103), Expect = 0.001
 Identities = 111/563 (19%), Positives = 195/563 (34%), Gaps = 113/563 (20%)

Query: 1277 ARLALNDLTEEIKKLNKTGEMLKENATLLQENNVEGALELTRQAHDTSLKLKEQSTETEK 1336
            A  A ++L ++ K  +K     K     LQ   ++ A    R       +LK +  E + 
Sbjct: 368  ANAAASELEKKQKNFDKILAEWKRKVDELQAE-LDTAQREARNLSTELFRLKNELEELKD 426

Query: 1337 QINDAERQCKRTENLVTKTFPQFTQGQE---ENEKALQRLNEKITSLEKNLPDLNELICD 1393
            Q+    R+ K  ++ +     Q  +G     E EKA +RL  +   L+  L +    +  
Sbjct: 427  QVEALRRENKNLQDEIHDLTDQLGEGGRNVHELEKARRRLEAEKDELQAALEEAEAALEL 486

Query: 1394 KRGDPCDNLCGGAGCGKCGGMWCSNGTLSESNSAKDYAETSKKLINEKESKAEELRKHMI 1453
            +                      S    ++   ++  +E  ++L  EKE + E  RK+  
Sbjct: 487  EE---------------------SKVLRAQVELSQIRSEIERRL-AEKEEEFENTRKNHQ 524

Query: 1454 ILSESNSAKDYAET---------SKKL---INEKEAKAEEVFRGITTAKQESHAANILAK 1501
               ES  A   AE           KKL   INE E   +   +    A++         K
Sbjct: 525  RAIESLQATLEAEAKGKAEASRLKKKLEGDINELEIALDHANKANAEAQKNVKKYQQQVK 584

Query: 1502 EAYDHAASVRNKTAAYVASTSNITKQLDEFLNAPGATLADIRNISGLA--VNKNIQSNPE 1559
            E        +          +   ++      A  A L ++R+    A    K  ++   
Sbjct: 585  ELQTQVEEEQRAREDAREQLAVAERR----ATALEAELEELRSALEQAERARKQAETELA 640

Query: 1560 QIRERANEINNVIKSLTDIDTILTETAGDLAKANDLKRK--ANLTKAGADSTKNTVQKIV 1617
            +  ER NE+     SL                    KRK    L    +D        + 
Sbjct: 641  EASERVNELTAQNSSLIAQ-----------------KRKLEGELAALQSD--------LD 675

Query: 1618 DVLTEARTAQDMAEVAIQTAKDDISAARKDLSQITNDLDDAQQKS--NETNVK-----VK 1670
            + + E + A++ A+ A             D +++  +L   Q+ S   E   K     VK
Sbjct: 676  EAVNELKAAEERAKKAQA-----------DAARLAEELRQEQEHSQHLERLRKQLESQVK 724

Query: 1671 LLQERLKSIQAGFLQNGRSAMDV-ENEEKNLEKEVALAQKQASGLRSRYQETDNKLLSKA 1729
             LQ RL   +A  L+ G+  +   E   + LE E+   Q+       R+ ET  K L K 
Sbjct: 725  ELQVRLDEAEAAALKGGKKMIQKLEARVRELEAELDGEQR-------RHAET-QKNLRKM 776

Query: 1730 E------SSGLKRARGQMLLNKASQLSVNTTAKLK----QLNETEAMFNSQESELTELSK 1779
            E         ++  +  +   +   L     AK+K    QL E E +     S+  +  +
Sbjct: 777  ERRVKELQFQVEEDKKNLERLQ--DLVDKLQAKIKTYKRQLEEAEEVAQINLSKYRKAQR 834

Query: 1780 NIAELQKR---IQSCINFIVDKS 1799
             + + ++R    +  +N +  KS
Sbjct: 835  ELEDAEERADTAERSLNKLRAKS 857



 Score = 36.9 bits (86), Expect = 0.12
 Identities = 41/212 (19%), Positives = 82/212 (38%), Gaps = 33/212 (15%)

Query: 1593 NDLKRK---ANLTKAGADSTKNTVQKIVDVLTEART----AQDMAEVAIQTAKDDISAAR 1645
            N L RK    +   +  +  +  V ++   + E        ++  E   + A+     AR
Sbjct: 11   NQLYRKESELSQLSSKLEDEQALVAQLQKKIKELEARIRELEEELEAE-RAARAKAEKAR 69

Query: 1646 KDLS----QITNDLDDAQQKSNETNVKVKLLQERLKSIQAGFLQNGRSAMDVENEEKNLE 1701
             DLS    +++  L++A   +           E  K  +A   +  +   D+E      E
Sbjct: 70   ADLSRELEELSERLEEAGGATAAQ-------IELNKKREAELAKLRK---DLEEANLQHE 119

Query: 1702 KEVALAQKQASGLRSRYQET-DNKLLSKAESSGLKRARGQMLL---NKASQLSVNTTAKL 1757
            + +A  +K+     +   E  +   L K ++   ++ + Q+     +  +QL     AKL
Sbjct: 120  EALATLRKKHQDAINELSEQIEQ--LQKQKAK-AEKEKSQLQAEVDDLLAQLDQIAKAKL 176

Query: 1758 KQLNETEAMFNSQESELTELSKNIAELQKRIQ 1789
                  E      ES+L+EL   + ELQ+++ 
Sbjct: 177  ----NAEKKAKQLESQLSELQVKLDELQRQLN 204


>gnl|CDD|129694 TIGR00606, rad50, rad50.  All proteins in this family for which
            functions are known are involvedin recombination,
            recombinational repair, and/or non-homologous end
            joining.They are components of an exonuclease complex
            with MRE11 homologs. This family is distantly related to
            the SbcC family of bacterial proteins.This family is
            based on the phylogenomic analysis of JA Eisen (1999,
            Ph.D. Thesis, Stanford University).
          Length = 1311

 Score = 53.1 bits (127), Expect = 2e-06
 Identities = 97/564 (17%), Positives = 203/564 (35%), Gaps = 90/564 (15%)

Query: 1263 VGKTLGNTTQRINLARLALNDLTEEIKKLNKTGEMLKENATLLQEN--NVEGALELTRQA 1320
              +   +   +  L +   +++ +E K L +T E+ KE     QE    V   L+    +
Sbjct: 410  AAQLCADLQSKERLKQEQADEIRDEKKGLGRTIELKKEILEKKQEELKFVIKELQQLEGS 469

Query: 1321 HDTSLKLKEQSTETEKQINDAER----QCKRTE--NLVTKTFPQFTQGQEENEKALQRLN 1374
             D  L+L ++  + E++++ AE+    +  + E  +L  +      + ++ +++  Q  +
Sbjct: 470  SDRILELDQELRKAERELSKAEKNSLTETLKKEVKSLQNEKADLDRKLRKLDQEMEQLNH 529

Query: 1375 EKIT-----SLEKNLPDLNELICDKRGDPCDNLCGGAGCGKCGGMWCSNGTLSESNSAKD 1429
               T      L K+  D +E I   +    D L    G                  + K 
Sbjct: 530  HTTTRTQMEMLTKDKMDKDEQIRKIKSRHSDELTSLLG---------------YFPNKKQ 574

Query: 1430 YAETSKKLINEKESKAEELRKHMIILSESNSAKDYAETSKKLINEKEAKAEE-VFRGITT 1488
              +       E     + L K    L+     K++     +   E+ +  E+ +F    +
Sbjct: 575  LEDWLHSKSKEINQTRDRLAKLNKELASLEQNKNHINNELESKEEQLSSYEDKLFDVCGS 634

Query: 1489 AKQESHAANILA--KEAYDHAASVRNKTAAYVASTSNITKQLDEFLNAPGATLADIRNIS 1546
              +ES    +    +++    A +   TA Y   +  IT+  DE             N S
Sbjct: 635  QDEESDLERLKEEIEKSSKQRAMLAGATAVY---SQFITQLTDE-------------NQS 678

Query: 1547 GLAVNKNIQSNPEQIRERANEINNVIKSLTDIDTILTETAGDLAKANDLKRKANLTKAGA 1606
               V + +     +++E  +++ + ++   D    L  T  +L K    +++ +     A
Sbjct: 679  CCPVCQRVFQTEAELQEFISDLQSKLRLAPDK---LKSTESELKK---KEKRRDEMLGLA 732

Query: 1607 DSTKNTVQKIVDVLTEARTAQDMAEVAIQTAKDDISAARKDLSQITNDLDDAQQKSNETN 1666
               ++ +      + E R         IQ  K+DI      L  I  + + A  K   T+
Sbjct: 733  PGRQSIIDLKEKEIPELRNKLQKVNRDIQRLKNDIEEQETLLGTIMPEEESA--KVCLTD 790

Query: 1667 VKVKLLQERLKSIQAGFLQNGRSAMDVENEEKNLEKEVALAQKQASGLRSRYQETDNKLL 1726
            V +    ER +             M++++ E+ + ++ A  + Q S L    Q+ + +  
Sbjct: 791  VTI---MERFQ-------------MELKDVERKIAQQAA--KLQGSDLDRTVQQVNQEKQ 832

Query: 1727 SKAESSGLKRARGQMLLNKASQLSVNTTAKLKQLNETEAMFNSQESELTELSKNIA---- 1782
             K        ++ + L  K  Q       + +Q+   ++  N  +SE  ++  N+     
Sbjct: 833  EKQHELDTVVSKIE-LNRKLIQ------DQQEQIQHLKSKTNELKSEKLQIGTNLQRRQQ 885

Query: 1783 ------ELQKRIQSCINFIVDKSN 1800
                  EL   +QS I  I D   
Sbjct: 886  FEEQLVELSTEVQSLIREIKDAKE 909



 Score = 41.6 bits (97), Expect = 0.005
 Identities = 80/483 (16%), Positives = 180/483 (37%), Gaps = 68/483 (14%)

Query: 1281 LNDLTEEIKKLNKTGEMLKENATLLQE------NNVEGALELTRQAHDTSLKLKEQSTET 1334
            L  L EEI+K +K   ML     +  +      +  +    + ++   T  +L+E  ++ 
Sbjct: 641  LERLKEEIEKSSKQRAMLAGATAVYSQFITQLTDENQSCCPVCQRVFQTEAELQEFISDL 700

Query: 1335 EKQINDAERQCKRTENLVTKTFPQFTQ---GQEENEKALQRLNEKITSLEKNLPDLNELI 1391
            + ++  A  + K TE+ + K   +  +        +  +    ++I  L   L  +N  I
Sbjct: 701  QSKLRLAPDKLKSTESELKKKEKRRDEMLGLAPGRQSIIDLKEKEIPELRNKLQKVNRDI 760

Query: 1392 CDKRGDPCDN--LCGGAGCGKCGGMWCSNGTLSESNSAKDYAETSKKLINEKESKAEELR 1449
               + D  +   L G                + E  SAK    T   ++   + + +++ 
Sbjct: 761  QRLKNDIEEQETLLGT--------------IMPEEESAKV-CLTDVTIMERFQMELKDVE 805

Query: 1450 KHMIILSESNSAKDYAETSKKLINEKEAKAEEVFRGITTAKQESHAANILAKEAYDHAAS 1509
            + +   +      D   T +++  EK+ K  E    + +  + +       +E   H  S
Sbjct: 806  RKIAQQAAKLQGSDLDRTVQQVNQEKQEKQHE-LDTVVSKIELNRKLIQDQQEQIQHLKS 864

Query: 1510 VRNKTAAYVASTSNITKQLDEFLNAPGATLADIRNISGLAVNKNIQSNPEQIRERANEIN 1569
              N+  +         ++  +F                           EQ+ E + E+ 
Sbjct: 865  KTNELKSEKLQIGTNLQRRQQF--------------------------EEQLVELSTEVQ 898

Query: 1570 NVIKSLTDIDTILTETAGDLAKANDLKRKANLTKAGADSTKNTVQKIVDVLTEARTA--- 1626
            ++I+ + D     +          D + K  L  +   S K    K+ D+  + +     
Sbjct: 899  SLIREIKDAKEQDSPLE--TFLEKDQQEKEELISSKETSNKKAQDKVNDIKEKVKNIHGY 956

Query: 1627 QDMAEVAIQTAKDDISAARK-DLSQITNDLDDAQQKSNETNVKVKLLQERLKS--IQAGF 1683
                E  IQ  KDD    ++ +L+ +   L++ ++   + N  ++L+++ + +  IQ  +
Sbjct: 957  MKDIENKIQDGKDDYLKQKETELNTVNAQLEECEKHQEKINEDMRLMRQDIDTQKIQERW 1016

Query: 1684 LQNGRSAMDVENEEKNLEKEVA-----LAQKQASGLRSRYQETDNKL--LSKAESSGLKR 1736
            LQ+  +    ENE K +E+E+      + Q Q   ++  +Q+ +  +  + +     L R
Sbjct: 1017 LQDNLTLRKRENELKEVEEELKQHLKEMGQMQVLQMKQEHQKLEENIDLIKRNHVLALGR 1076

Query: 1737 ARG 1739
             +G
Sbjct: 1077 QKG 1079



 Score = 36.9 bits (85), Expect = 0.13
 Identities = 76/401 (18%), Positives = 146/401 (36%), Gaps = 60/401 (14%)

Query: 1432 ETSKKLINEKESKAEELR------KHMI--ILSESNSAKDYAETSKKLINEK---EAKAE 1480
            E+S++++   E++ + L+      +H +  I+   N  K      K++  +    E K E
Sbjct: 234  ESSREIVKSYENELDPLKNRLKEIEHNLSKIMKLDNEIKALKSRKKQMEKDNSELELKME 293

Query: 1481 EVFRGITTAKQ-----ESHAANILAKEAYDHAASVRNKTAAYVASTSNITKQLDEFLNAP 1535
            +VF+G  T +Q      +H   +  KE        + +          + ++  E L   
Sbjct: 294  KVFQG--TDEQLNDLYHNHQRTVREKE--RELVDCQRELEKLNKERRLLNQEKTELLVEQ 349

Query: 1536 GATL--ADIRNISGLAVNKNIQSNPEQIR----ERANEINNVIKSLTDIDTILTETAGDL 1589
            G     AD       A +  IQS   ++     ER       IK+     T++ E   D 
Sbjct: 350  GRLQLQADRHQEHIRARDSLIQSLATRLELDGFERGPFSERQIKNFH---TLVIERQEDE 406

Query: 1590 AKA-----NDLKRKANLTKAGADSTKNTVQKIVDVLTEARTAQDMAEVAIQTAKDDISAA 1644
            AK       DL+ K  L +  AD       +I D         ++ +  ++  ++++   
Sbjct: 407  AKTAAQLCADLQSKERLKQEQAD-------EIRDEKKGLGRTIELKKEILEKKQEELKFV 459

Query: 1645 RKDLSQITNDLDDAQQKSN------------ETNVKVKLLQERLKSIQAGFLQNGRS--A 1690
             K+L Q+    D   +               E N   + L++ +KS+Q       R    
Sbjct: 460  IKELQQLEGSSDRILELDQELRKAERELSKAEKNSLTETLKKEVKSLQNEKADLDRKLRK 519

Query: 1691 MDVENEEKNLEKEVALAQKQASGLRSRYQETDNKLLSKAESSGLKRARGQM-LLNKASQL 1749
            +D E E+ N        + Q   L ++ +   ++ + K +S         +       QL
Sbjct: 520  LDQEMEQLNHHTT---TRTQMEML-TKDKMDKDEQIRKIKSRHSDELTSLLGYFPNKKQL 575

Query: 1750 SVNTTAKLKQLNETEAMFNSQESELTELSKNIAELQKRIQS 1790
                 +K K++N+T         EL  L +N   +   ++S
Sbjct: 576  EDWLHSKSKEINQTRDRLAKLNKELASLEQNKNHINNELES 616



 Score = 36.9 bits (85), Expect = 0.14
 Identities = 92/565 (16%), Positives = 191/565 (33%), Gaps = 78/565 (13%)

Query: 1267 LGNTTQRINLARLALNDLTEEIKKLNKTG-EMLKENATLLQENNVEGALELTRQAHDTSL 1325
            L N  + I      +  L  EIK L     +M K+N+ L                    L
Sbjct: 250  LKNRLKEIEHNLSKIMKLDNEIKALKSRKKQMEKDNSEL-------------------EL 290

Query: 1326 KLKEQSTETEKQINDAERQCKRTENLVTKTFPQFTQGQEENEKALQRLNEKITSLEKNLP 1385
            K+++    T++Q+ND     +RT     +      +  E+  K  + LN++ T L     
Sbjct: 291  KMEKVFQGTDEQLNDLYHNHQRTVREKERELVDCQRELEKLNKERRLLNQEKTELLVEQG 350

Query: 1386 DLNELICDKRGDPCDNLCGGAGCGKCGGMWCSNGTLSESNSAKDYAETSKKLINEKESKA 1445
             L +L  D+                   +  S  T  E +  +    + +++ N      
Sbjct: 351  RL-QLQADRH--QEHIR-------ARDSLIQSLATRLELDGFERGPFSERQIKNFHTLVI 400

Query: 1446 EEL-RKHMIILSESNSAKDYAETSKKLINEKEAKAEEVFRGITTAKQESHAANILAKEAY 1504
            E    +           +      ++  +E   + + + R I   K+      IL K+  
Sbjct: 401  ERQEDEAKTAAQLCADLQSKERLKQEQADEIRDEKKGLGRTIELKKE------ILEKKQ- 453

Query: 1505 DHAASVRNKTAAYVASTSNITKQLDEFLNAPGATLADIRNISGLAVNKNIQSNPEQIRER 1564
                 ++      +     +    D  L          R +S    N   ++  ++++  
Sbjct: 454  ---EELKFV----IKELQQLEGSSDRILELDQELRKAERELSKAEKNSLTETLKKEVKSL 506

Query: 1565 ANEINNVIKSLTDIDTILTETAGDLAKANDLKRKANLTKAGADSTKNTVQKIVDVLTEAR 1624
             NE  ++ + L  +D    E            +   LTK   D  +  ++KI    ++  
Sbjct: 507  QNEKADLDRKLRKLDQ---EMEQLNHHTTTRTQMEMLTKDKMDKDEQ-IRKIKSRHSDEL 562

Query: 1625 TAQ-------DMAEVAIQTAKDDISAARKDLSQITNDLDDAQQKSNETNVKVKLLQERLK 1677
            T+           E  + +   +I+  R  L+++  +L   +Q  N  N +++  +E+L 
Sbjct: 563  TSLLGYFPNKKQLEDWLHSKSKEINQTRDRLAKLNKELASLEQNKNHINNELESKEEQLS 622

Query: 1678 SIQAGFLQNGRSAMDVENEEKNLEKEVALAQKQ------ASGLRSRYQET---------- 1721
            S +         + D E++ + L++E+  + KQ      A+ + S++             
Sbjct: 623  SYEDKLFD-VCGSQDEESDLERLKEEIEKSSKQRAMLAGATAVYSQFITQLTDENQSCCP 681

Query: 1722 --DNKLLSKAESSGLKRARGQMLL---NKASQLSVNTTAKLKQLNETEAMFNSQESELTE 1776
                   ++AE           L    +K          K K+ +E   +   ++S +  
Sbjct: 682  VCQRVFQTEAELQEFISDLQSKLRLAPDKLKSTESELKKKEKRRDEMLGLAPGRQSIIDL 741

Query: 1777 LSKNIAELQKRIQSCINFIVDKSNN 1801
              K I EL+ ++Q     I    N+
Sbjct: 742  KEKEIPELRNKLQKVNRDIQRLKND 766



 Score = 35.0 bits (80), Expect = 0.51
 Identities = 35/184 (19%), Positives = 76/184 (41%), Gaps = 16/184 (8%)

Query: 1612 TVQKIVDVLTEARTAQDMAEVAIQTAKDDISAARKDLSQITNDLDDAQQKSNETNVKVKL 1671
            T++++     +      M    ++  K+     R  ++     L+ +++       ++  
Sbjct: 190  TLRQVRQTQGQKVQEHQMELKYLKQYKEKACEIRDQITSKEAQLESSREIVKSYENELDP 249

Query: 1672 LQERLKSIQAGFLQNGRSAMDVENEEKNLEKEVALAQKQASGLRSR----YQETDNKLLS 1727
            L+ RLK I+     N    M ++NE K L+      +K  S L  +    +Q TD +L  
Sbjct: 250  LKNRLKEIE----HNLSKIMKLDNEIKALKSRKKQMEKDNSELELKMEKVFQGTDEQL-- 303

Query: 1728 KAESSGLKRARGQMLLNKASQLSVNTTAKLKQLNETEAMFNSQESELTELSKNIAELQKR 1787
                + L     + +  K  +L V+   +L++LN+   + N +++EL  + +   +LQ  
Sbjct: 304  ----NDLYHNHQRTVREKEREL-VDCQRELEKLNKERRLLNQEKTEL-LVEQGRLQLQAD 357

Query: 1788 IQSC 1791
                
Sbjct: 358  RHQE 361


>gnl|CDD|218636 pfam05557, MAD, Mitotic checkpoint protein.  This family consists of
            several eukaryotic mitotic checkpoint (Mitotic arrest
            deficient or MAD) proteins. The mitotic spindle
            checkpoint monitors proper attachment of the bipolar
            spindle to the kinetochores of aligned sister chromatids
            and causes a cell cycle arrest in prometaphase when
            failures occur. Multiple components of the mitotic
            spindle checkpoint have been identified in yeast and
            higher eukaryotes. In S.cerevisiae, the existence of a
            Mad1-dependent complex containing Mad2, Mad3, Bub3 and
            Cdc20 has been demonstrated.
          Length = 722

 Score = 51.9 bits (124), Expect = 3e-06
 Identities = 86/594 (14%), Positives = 194/594 (32%), Gaps = 85/594 (14%)

Query: 1217 ELLEKKIEDVKALVENTTRSSHDLTVMLTTIDDMKKQIMNNEDPNGVGKTLGNTTQRINL 1276
              LE+K   +    E     + +L V L       K +   E             +   L
Sbjct: 89   IELERKASTLAENYERELDRNLELEVRL-------KALEELE--KKAENEAAEAEEEAKL 139

Query: 1277 ARLALNDLTEEIKKLNKTGEMLKENATLLQENNVEGALELTRQAHDTSLKLKEQSTETE- 1335
             +  L+    E  KL    E   + A            E+  +A +   +LK   +E E 
Sbjct: 140  LKDKLD---AESLKLQNEKEDQLKEAKESISRIKNDLSEMQCRAQNADTELKLLESELEE 196

Query: 1336 -KQINDAERQCKRTENLVTKTFPQFTQGQEENEKALQRLNEKITSLEKNLPDLNELICDK 1394
             ++  +  ++         ++         +N   ++ L E++   E++   +  +    
Sbjct: 197  LREQLEECQKELAEAEKKLQSLTSEQASSADNSVKIKHLEEELKRYEQDAEVVKSMKEQL 256

Query: 1395 RGDPCDNLCGGAGCGKCGGMWCSNGTLSESNSAKDYAETSKKLINEKESKAEELRKHMII 1454
               P                        E  + ++     + +  + E   EEL      
Sbjct: 257  LQIP--------------------ELERELAALREENRKLRSMKEDNELLKEELEDLQSR 296

Query: 1455 LSESNSAKDYAETSK----KLINE-KEAK--AEEVFRGITTAKQESHAANILAKEAYDHA 1507
            L      ++     +    KL NE K  K   +++   + T    S    +L  E     
Sbjct: 297  LERFEKMREKLADLELEKEKLENELKSWKSLLQDIGLNLRTPDDLSRRIVVLQNEELQLK 356

Query: 1508 ASVRNKTAAYVASTSNITKQL--DEFLNA-PGATLADIRNISGLAVNKNIQSNPEQIRER 1564
               +N + +  A     T Q    E   A         +  +  A+ + +Q     + + 
Sbjct: 357  E--KNGSISSSAKQLETTLQQLQLERQKAVSEILELKKKLEALKALVRRLQRRLTLVTKE 414

Query: 1565 ANEINNVIKSLTDIDTILTETAGDLAKANDLKRKANLTKAGADSTKNTVQKIVDVLTEAR 1624
             + +  ++ S     T  + +   + +   L+   +L +    S    ++  +  L E  
Sbjct: 415  RDGLRAILNSYDKELTETSVSGQLMKR---LEEAEDLVQ-KVQSHLAKMENQLSELEE-- 468

Query: 1625 TAQDMAEVAIQTAKDDISAARKDLSQITNDLDDAQQKSNETNVKVKLLQERLKSIQAGFL 1684
                  +V  Q  KD  +    ++  +   L   ++  +        L+ ++++++    
Sbjct: 469  ------DVGQQ--KDRNNTLETEIKLLKEQLVSNERLLSFVKEATNALRLKIETLER--- 517

Query: 1685 QNGRSAMDVENEEKNLEKEVALAQKQASGLRSRYQETDNKLLSKAESSGLK--RARGQML 1742
            +  R    +  E+  LE ++     +   L+  Y  +  K+L  + +   +  +   Q +
Sbjct: 518  ERDR----LRQEKSLLEMKL-----EHLCLQGDYNASRTKVLHMSLNPASEAEQIAKQTI 568

Query: 1743 LNKASQLSVNTTAKLKQLNETEAMFNSQESELTEL------SKNIAELQKRIQS 1790
                ++       KLK+  +      SQ  +L +       SK IA+L+K+++S
Sbjct: 569  EALQAECE-----KLKERLQALEEGKSQPGDLEKAVGSHISSKEIAQLKKQVES 617



 Score = 43.8 bits (103), Expect = 0.001
 Identities = 44/275 (16%), Positives = 92/275 (33%), Gaps = 35/275 (12%)

Query: 1522 SNITKQLDEFLNAPGATLADIRNISGLAVNKNIQSNPE-QIRERANEINNVIKSLTDIDT 1580
            S +   L +FL+ P      +     L V+ +  S+ + Q+ E    +      L    +
Sbjct: 17   SILRSDLPKFLSQP------LEGSHHLGVSTSALSSLQKQVEESMTLLQRA--ELIRSKS 68

Query: 1581 ILTETAGDLAKAN--------DLKRKANLTKAGADSTKNTVQKIVDVLTEARTAQDMAEV 1632
             L +   +L +          +L+RKA+      +   +   ++   L      +  AE 
Sbjct: 69   KLIQLENELMQKELEHKRAQIELERKASTLAENYERELDRNLELEVRLKALEELEKKAEN 128

Query: 1633 AIQTAKDDISAARKDLSQITNDL--DDAQQKSNETNVKVKLLQERLKSIQAGFLQNGRSA 1690
                A+++    +  L   +  L  +   Q                  I+    +    A
Sbjct: 129  EAAEAEEEAKLLKDKLDAESLKLQNEKEDQLKEAKESIS--------RIKNDLSEMQCRA 180

Query: 1691 MDVENEEKNLEKEVALAQKQASGLRSRYQETDNKLLSKAESSGLKRARGQMLLNKASQLS 1750
             + + E K LE E+   ++Q    +    E + KL        L   +     N  S   
Sbjct: 181  QNADTELKLLESELEELREQLEECQKELAEAEKKL------QSLTSEQASSADN--SVKI 232

Query: 1751 VNTTAKLKQLNETEAMFNSQESELTELSKNIAELQ 1785
             +   +LK+  +   +  S + +L ++ +   EL 
Sbjct: 233  KHLEEELKRYEQDAEVVKSMKEQLLQIPELERELA 267



 Score = 38.8 bits (90), Expect = 0.031
 Identities = 83/493 (16%), Positives = 161/493 (32%), Gaps = 105/493 (21%)

Query: 1314 LELTRQAHDTSLKLKEQSTETEKQINDAERQCKRTENLVTKTFPQFTQGQEENEKALQRL 1373
            ++L  +     L+ K    E E++ +      +R                 E E  L+ L
Sbjct: 71   IQLENELMQKELEHKRAQIELERKASTLAENYERE-----------LDRNLELEVRLKAL 119

Query: 1374 NEKITSLEKNLPDLNELICDKRGDPCDNLCGGAGCGKCGGMWCSNGTLSESNSAKD-YAE 1432
             E     E    +  E                                 E+   KD    
Sbjct: 120  EELEKKAENEAAEAEE---------------------------------EAKLLKDKLDA 146

Query: 1433 TSKKLINEKESKAEELRKHMIILSESNSAKDY------AET-SKKLINEKE---AKAEEV 1482
             S KL NEKE + +E ++   I    N   +       A+T  K L +E E    + EE 
Sbjct: 147  ESLKLQNEKEDQLKEAKES--ISRIKNDLSEMQCRAQNADTELKLLESELEELREQLEEC 204

Query: 1483 FRGITTAKQESHAANILAKEAYDHAASVRNKTAAYVASTSNITKQLDEFLNAPGATLADI 1542
             + +  A+++  +       + D+  SV+ K              L+E L         +
Sbjct: 205  QKELAEAEKKLQSLTSEQASSADN--SVKIK-------------HLEEELKRYEQDAEVV 249

Query: 1543 R-NISGLAVNKNIQSNPEQIRERANEINNVIKS-------LTDIDTILTETAGDLAKAND 1594
            +     L     ++     +RE   ++ ++ +        L D+ + L        K  D
Sbjct: 250  KSMKEQLLQIPELERELAALREENRKLRSMKEDNELLKEELEDLQSRLERFEKMREKLAD 309

Query: 1595 LKRKANLTKAGADSTKNTVQKIVDVLTEARTAQDMAEVAIQTAKDDI-SAARKDLSQI-T 1652
            L+ +    +    S K+ +Q   D+    RT  D++   +    +++    +        
Sbjct: 310  LELEKEKLENELKSWKSLLQ---DIGLNLRTPDDLSRRIVVLQNEELQLKEKNGSISSSA 366

Query: 1653 NDLDDAQQKSNETNVKVKLLQERLKSIQAGFLQNGRSAMDVENEEKNLEKEVALAQKQAS 1712
              L+   Q+          LQ   +   +  L+  +    ++   + L++ + L  K+  
Sbjct: 367  KQLETTLQQ----------LQLERQKAVSEILELKKKLEALKALVRRLQRRLTLVTKERD 416

Query: 1713 GLRSRYQETDNKL----LSKAESSGLKRARGQMLLNKASQLSVNTTAKLKQLNET----E 1764
            GLR+     D +L    +S      L+ A    L+ K          +L +L E     +
Sbjct: 417  GLRAILNSYDKELTETSVSGQLMKRLEEAED--LVQKVQSHLAKMENQLSELEEDVGQQK 474

Query: 1765 AMFNSQESELTEL 1777
               N+ E+E+  L
Sbjct: 475  DRNNTLETEIKLL 487



 Score = 33.0 bits (75), Expect = 1.9
 Identities = 32/151 (21%), Positives = 61/151 (40%), Gaps = 11/151 (7%)

Query: 1648 LSQITNDLDDAQQKSNETNVKVKLLQERLKSIQAGFLQNGRSAMDVENEEKNLEKEVALA 1707
            L Q+ N+L    QK  E       L+ +  ++   + +     +++E   K LE+    A
Sbjct: 70   LIQLENEL---MQKELEHKRAQIELERKASTLAENYERELDRNLELEVRLKALEELEKKA 126

Query: 1708 QKQASGLRSRYQETDNKL------LSKAESSGLKRARGQ--MLLNKASQLSVNTTAKLKQ 1759
            + +A+      +   +KL      L   +   LK A+     + N  S++         +
Sbjct: 127  ENEAAEAEEEAKLLKDKLDAESLKLQNEKEDQLKEAKESISRIKNDLSEMQCRAQNADTE 186

Query: 1760 LNETEAMFNSQESELTELSKNIAELQKRIQS 1790
            L   E+       +L E  K +AE +K++QS
Sbjct: 187  LKLLESELEELREQLEECQKELAEAEKKLQS 217


>gnl|CDD|235316 PRK04863, mukB, cell division protein MukB; Provisional.
          Length = 1486

 Score = 51.9 bits (125), Expect = 4e-06
 Identities = 71/373 (19%), Positives = 140/373 (37%), Gaps = 73/373 (19%)

Query: 1443 SKAEELRKHMIILSESNSAKDYAETSKKL-INEKEAKA--EEVFRGITTAKQESHAANIL 1499
                +L KH+I  S +  A DY   + +  ++ +EA     E++        E +    +
Sbjct: 253  QSDRDLFKHLITESTNYVAADYMRHANERRVHLEEALELRRELYTSRRQLAAEQYRLVEM 312

Query: 1500 AKE-----------------AYDHAASVRN------KTAAYVASTSNITKQLDEFLNAPG 1536
            A+E                 A DH   V+       K   Y A    + ++L+E      
Sbjct: 313  ARELAELNEAESDLEQDYQAASDHLNLVQTALRQQEKIERYQADLEELEERLEE------ 366

Query: 1537 ATLADIRNISGLAVNKNIQSNPEQIRERANEINNVIKSLTDIDTILTETAGDLAKANDLK 1596
                         V +      E+   RA           ++D + ++ A D  +A D++
Sbjct: 367  ----------QNEVVEEADEQQEENEARAEAAE------EEVDELKSQLA-DYQQALDVQ 409

Query: 1597 RKANLTKAGADSTKNTVQKIVDVLTEARTAQDMAEVAIQTAKDDISAARKDLSQITNDLD 1656
            +    T+A         Q+ V  L  A+    + ++    A+D +   +    + T +L 
Sbjct: 410  Q----TRAIQ------YQQAVQALERAKQLCGLPDLTADNAEDWLEEFQAKEQEATEELL 459

Query: 1657 DAQQKSNETNVKVKLLQERLKSIQAGFLQNGRS-AMDVENE-EKNLEKEVALAQKQASGL 1714
              +QK +         ++  + ++    +  RS A DV  E  + L ++  LA+ Q   L
Sbjct: 460  SLEQKLSVAQAAHSQFEQAYQLVRKIAGEVSRSEAWDVARELLRRLREQRHLAE-QLQQL 518

Query: 1715 RSRYQETDNKLLSKAESSGLKRARGQMLLNKASQLSVNTTAKLKQLNETEAMFNSQESEL 1774
            R R  E + +L  +      +R   +    K    +++   +L+QL E        E+ L
Sbjct: 519  RMRLSELEQRLRQQQR---AERLLAE--FCKRLGKNLDDEDELEQLQE------ELEARL 567

Query: 1775 TELSKNIAELQKR 1787
              LS++++E ++R
Sbjct: 568  ESLSESVSEARER 580



 Score = 33.4 bits (77), Expect = 1.5
 Identities = 32/146 (21%), Positives = 54/146 (36%), Gaps = 29/146 (19%)

Query: 1643 AARKDLSQITNDLDDAQQKSNETNVKVKLLQERLKSIQAGFLQNGRSAMDVENEEKNLEK 1702
              R++L      L   Q +  E   ++  L E    ++    Q+ ++A D  N  +    
Sbjct: 290  ELRRELYTSRRQLAAEQYRLVEMARELAELNEAESDLE----QDYQAASDHLNLVQT--- 342

Query: 1703 EVALAQKQASGLRSRYQETDNKLLSKAESSGLKRARGQMLLNKASQLSVNTTAKLKQLNE 1762
              AL Q++      RYQ    +L  + E                 Q  V   A  +Q  E
Sbjct: 343  --ALRQQEKIE---RYQADLEELEERLEE----------------QNEVVEEAD-EQQEE 380

Query: 1763 TEAMFNSQESELTELSKNIAELQKRI 1788
             EA   + E E+ EL   +A+ Q+ +
Sbjct: 381  NEARAEAAEEEVDELKSQLADYQQAL 406



 Score = 33.0 bits (76), Expect = 2.0
 Identities = 38/192 (19%), Positives = 78/192 (40%), Gaps = 41/192 (21%)

Query: 1606 ADSTKNTVQKIVDVLTEARTAQDMAEVAIQTAKDDISAARKDLSQITNDLDDAQQKSNET 1665
            A  T+   ++    LTE    Q  A  + + A + ++       ++   L+ A+Q+    
Sbjct: 947  AQQTQRDAKQQAFALTEVV--QRRAHFSYEDAAEMLAKNSDLNEKLRQRLEQAEQERTR- 1003

Query: 1666 NVKVKLLQERLKSIQAGFLQNG------RSAMDVENEE-KNLEKEVALAQKQASGLRSRY 1718
                   +E+L+  QA   Q        +S+ D + +  + L++E+     Q  G+    
Sbjct: 1004 ------AREQLRQAQAQLAQYNQVLASLKSSYDAKRQMLQELKQEL-----QDLGVP--- 1049

Query: 1719 QETDNKLLSKAESSGLKRAR-GQMLLNKASQLSVNTTAKLKQLNETEAMFNSQESELTEL 1777
                      A+S   +RAR  +  L+   +LS N + +    N+ E      E+E+  L
Sbjct: 1050 ----------ADSGAEERARARRDELHA--RLSANRSRR----NQLEKQLTFCEAEMDNL 1093

Query: 1778 SKNIAELQKRIQ 1789
            +K + +L++   
Sbjct: 1094 TKKLRKLERDYH 1105


>gnl|CDD|224755 COG1842, PspA, Phage shock protein A (IM30), suppresses
            sigma54-dependent transcription [Transcription / Signal
            transduction mechanisms].
          Length = 225

 Score = 49.2 bits (118), Expect = 5e-06
 Identities = 39/191 (20%), Positives = 76/191 (39%), Gaps = 36/191 (18%)

Query: 1595 LKRKANLTKAGADSTKNTVQKIVDVLTEARTAQDMAEVAIQTAKDDISAARKDLSQITND 1654
              R  +L KA  +     + K  D        + M E AI+  + +++ AR+ L+Q    
Sbjct: 4    FSRLKDLVKANINEL---LDKAED-------PEKMLEQAIRDMESELAKARQALAQAI-- 51

Query: 1655 LDDAQQKSNETNVKVKLLQERLKSIQAGFLQNGRSAMDVENEE---KNLEKEVALAQKQA 1711
               A+QK  E   K++  Q R + ++    +    A+   NE+   + LE E    +  A
Sbjct: 52   ---ARQKQLER--KLEEAQARAEKLE----EKAELALQAGNEDLAREALE-EKQSLEDLA 101

Query: 1712 SGLRSRYQETDNKLL-SKAESSGLKR----ARGQMLLNKASQLSVNTTAKLKQL------ 1760
              L +  Q+ + ++   K + + L++     R +    KA + +     K+ +       
Sbjct: 102  KALEAELQQAEEQVEKLKKQLAALEQKIAELRAKKEALKARKAAAKAQEKVNRSLGGGSS 161

Query: 1761 NETEAMFNSQE 1771
            +   A F   E
Sbjct: 162  SSAMAAFERME 172



 Score = 34.6 bits (80), Expect = 0.34
 Identities = 28/156 (17%), Positives = 50/156 (32%), Gaps = 26/156 (16%)

Query: 1247 IDDMKKQIMNNEDPNGVGKTLGNTTQRINLARLALNDLTEEIKKLNKTGE--MLKENATL 1304
            I DM+ ++          + L     R       L +     +KL +  E  +   N  L
Sbjct: 33   IRDMESELAK------ARQALAQAIARQKQLERKLEEAQARAEKLEEKAELALQAGNEDL 86

Query: 1305 LQE-----NNVEGALELTRQ----AHDTSLKLKEQSTETEKQINDAERQCKRTENL---V 1352
             +E      ++E   +        A +   KLK+Q    E++I +   + K         
Sbjct: 87   AREALEEKQSLEDLAKALEAELQQAEEQVEKLKKQLAALEQKIAELRAK-KEALKARKAA 145

Query: 1353 TKTFPQFTQ-----GQEENEKALQRLNEKITSLEKN 1383
             K   +  +            A +R+ EKI   E  
Sbjct: 146  AKAQEKVNRSLGGGSSSSAMAAFERMEEKIEEREAR 181


>gnl|CDD|173412 PTZ00121, PTZ00121, MAEBL; Provisional.
          Length = 2084

 Score = 51.7 bits (123), Expect = 5e-06
 Identities = 84/367 (22%), Positives = 153/367 (41%), Gaps = 41/367 (11%)

Query: 1431 AETSKKLINEKESKAEELRKHMIILSESNSAKDYAETSKKLINEKEAKAEEVFRGITTAK 1490
            A+ +KK   E++ KA+E +K      +++ AK  AE +KK  +  + KAEE  +    AK
Sbjct: 1292 ADEAKK--AEEKKKADEAKKKAEEAKKADEAKKKAEEAKKKADAAKKKAEEAKKAAEAAK 1349

Query: 1491 QESHAANILAKEAYDHAASVRNKTAAYVASTSNITKQLDEFLNAPGATLADIRNISGLAV 1550
             E+ AA   A+ A + A +   K            K+ +E   A                
Sbjct: 1350 AEAEAAADEAEAAEEKAEAAEKKKEEAKKKADAAKKKAEEKKKA---------------- 1393

Query: 1551 NKNIQSNPEQIRERANEINNVIKSLTDIDTILTETAGDLAKANDLKRKANLTKAGADSTK 1610
                +   E+ +++A+E+     +    D    + A +  KA++ K+KA   K  AD  K
Sbjct: 1394 -DEAKKKAEEDKKKADELKKAAAAKKKADE-AKKKAEEKKKADEAKKKAEEAKK-ADEAK 1450

Query: 1611 NTVQKIVDVLTEARTAQDMAEVAIQTAKDDISAARKDLSQITNDLDDAQQKSNETNVKV- 1669
               +       EA+ A++  + A +  K D     K  ++     D+A++K+ E   K  
Sbjct: 1451 KKAE-------EAKKAEEAKKKAEEAKKAD---EAKKKAEEAKKADEAKKKAEEAKKKAD 1500

Query: 1670 ---KLLQERLKSIQAGFLQNGRSAMDVEN-EEKNLEKEVALAQ--KQASGLRSRYQETDN 1723
               K  + + K+ +A   +  + A + +  EE     E   A+  K+A  L+   +    
Sbjct: 1501 EAKKAAEAKKKADEAKKAEEAKKADEAKKAEEAKKADEAKKAEEKKKADELKKAEELKKA 1560

Query: 1724 KLLSKAESSGLKRARGQMLLNKASQLSVNTTAKLKQ---LNETEAMFNSQESELTELSKN 1780
            +   KAE +        M L KA +      A++++   L E E    ++E++  E +K 
Sbjct: 1561 EEKKKAEEAKKAEEDKNMALRKAEEAKKAEEARIEEVMKLYEEEKKMKAEEAKKAEEAKI 1620

Query: 1781 IAELQKR 1787
             AE  K+
Sbjct: 1621 KAEELKK 1627



 Score = 39.7 bits (92), Expect = 0.019
 Identities = 70/369 (18%), Positives = 143/369 (38%), Gaps = 20/369 (5%)

Query: 1423 ESNSAKDYAETSKKLINEKESKAEELRKHMIILSESNSAKDYAETSKKLINEKEAKAEEV 1482
            E+ +A++ AE ++K   E + KA+  +K      +++ AK  AE  KK  +E +  A   
Sbjct: 1358 EAEAAEEKAEAAEKKKEEAKKKADAAKKKAEEKKKADEAKKKAEEDKKKADELKKAAAAK 1417

Query: 1483 FRGITTAK--QESHAANILAKEAYD-HAASVRNKTAAYVASTSNITKQLDEFLNAPGATL 1539
             +     K  +E   A+   K+A +   A    K A          K+ +E   A  A  
Sbjct: 1418 KKADEAKKKAEEKKKADEAKKKAEEAKKADEAKKKAEEAKKAEEAKKKAEEAKKADEAK- 1476

Query: 1540 ADIRNISGLAVNKNIQSNPEQIRERANEINNVIKSLTDIDTILTETAGDLAKANDLKRKA 1599
               +           +   E+ +++A+E     ++    D    + A +  KA++  +KA
Sbjct: 1477 ---KKAEEAKKADEAKKKAEEAKKKADEAKKAAEAKKKADEA--KKAEEAKKADEA-KKA 1530

Query: 1600 NLTKAGADSTKNTVQKIVDVLTEARTAQDMAEVAIQTAKDDISAARKDLSQITNDLDDAQ 1659
               K   ++ K   +K  D L +A   +   E             +    +   +   A+
Sbjct: 1531 EEAKKADEAKKAEEKKKADELKKAEELKKAEEKKKAEEAKKAEEDKNMALRKAEEAKKAE 1590

Query: 1660 QKSNETNVKVKLLQERLKSIQAGFLQNGR-SAMDVENEEKNLEKEVALAQKQASGLRSRY 1718
            +   E  +K+   ++++K+ +A   +  +  A +++  E+  +K   L +K+A   +   
Sbjct: 1591 EARIEEVMKLYEEEKKMKAEEAKKAEEAKIKAEELKKAEEEKKKVEQLKKKEAEEKKKAE 1650

Query: 1719 QETDNKLLSKAESSGLKRARGQMLLNKASQLSVNTTAKLKQLNETEAMFNSQESELTELS 1778
            +      L KAE     +A       K ++       + K+  E E        +  E +
Sbjct: 1651 E------LKKAEEENKIKAAE---EAKKAEEDKKKAEEAKKAEEDEKKAAEALKKEAEEA 1701

Query: 1779 KNIAELQKR 1787
            K   EL+K+
Sbjct: 1702 KKAEELKKK 1710



 Score = 30.9 bits (69), Expect = 7.9
 Identities = 63/287 (21%), Positives = 103/287 (35%), Gaps = 49/287 (17%)

Query: 1286 EEIKKLNKTGEMLKENATLLQENNVEGALELTRQAHDTSLKLKEQSTETEKQINDAERQC 1345
            EE KK  +    ++E   L +E     A E  +       K+K +  +  ++      Q 
Sbjct: 1584 EEAKKAEE--ARIEEVMKLYEEEKKMKAEEAKKAEEA---KIKAEELKKAEEEKKKVEQL 1638

Query: 1346 KRTENLVTKTFPQFTQGQEENEKALQRLNEKITSLEKNLPDLNELICDKRGDPCDNLCGG 1405
            K+ E    K   +  + +EEN+       +K    +K   +  +   D++          
Sbjct: 1639 KKKEAEEKKKAEELKKAEEENKIKAAEEAKKAEEDKKKAEEAKKAEEDEK---------- 1688

Query: 1406 AGCGKCGGMWCSNGTLSESNSAKDYAETSKKLINEKESKAEELRKHMIILSESNSAKDYA 1465
                              + + K  AE +KK    K+ +AEE +K   +       K  A
Sbjct: 1689 ----------------KAAEALKKEAEEAKKAEELKKKEAEEKKKAEELKKAEEENKIKA 1732

Query: 1466 ETSKKLINEKEAKAEEVFRGITTAKQESHAANILAKEAYDHAASVRNKTAAYVASTSNIT 1525
            E +KK   E + KAEE  +     K+ +H    L KE    A  +R +  A       I 
Sbjct: 1733 EEAKKEAEEDKKKAEEAKKDEEEKKKIAH----LKKEEEKKAEEIRKEKEAV------IE 1782

Query: 1526 KQLDEFLNAPGATLADIRNISGLAVNKNIQSNPEQIRERANEINNVI 1572
            ++LDE          + R +      K+I  N   I E   E N VI
Sbjct: 1783 EELDE--------EDEKRRMEVDKKIKDIFDNFANIIEGGKEGNLVI 1821


>gnl|CDD|226400 COG3883, COG3883, Uncharacterized protein conserved in bacteria
            [Function unknown].
          Length = 265

 Score = 49.7 bits (119), Expect = 5e-06
 Identities = 51/241 (21%), Positives = 93/241 (38%), Gaps = 35/241 (14%)

Query: 1552 KNIQSNPEQIRERANEINNVIKSLT-DIDTI---LTETAGDLAKA-NDLKRKANLTKAGA 1606
            KNIQ+  E +  +  EI + I  L  +ID     + +   ++A+   ++  +  L K  A
Sbjct: 48   KNIQNEIESLDNQIEEIQSKIDELQKEIDQSKAEIKKLQKEIAELKENIVERQELLKKRA 107

Query: 1607 DS--TKNTVQKIVDVLTEARTAQDMAEVAIQTAKDDISAARKDL--SQIT--NDLDDAQQ 1660
             +     T    +DV+  +++  D+      TA   I  A K +   Q      L++ Q 
Sbjct: 108  RAMQVNGTATSYIDVILNSKSFSDLISRV--TAISVIVDADKKILEQQKEDKKSLEEKQA 165

Query: 1661 KSNETNVKVKLLQERLKSIQAGFLQNGRSAMDVENEEKNLEK-EVALAQKQASGLRSRYQ 1719
               +    +  LQ  L++              + +++        ALA K+AS L  +  
Sbjct: 166  ALEDKLETLVALQNELETQLN----------SLNSQKAEKNALIAALAAKEASALGEKAA 215

Query: 1720 ETDNKLLSKAESSGLKRARGQMLLNKASQLSVNTTAKLKQLNETEAMFNSQESELTELSK 1779
              + K L  AE++  + A+ +     A+Q          Q   T A    Q S +TE + 
Sbjct: 216  LEEQKAL--AEAAAAEAAKQEAAAKAAAQ-----EQAALQAAATAA----QPSAVTESAS 264

Query: 1780 N 1780
             
Sbjct: 265  A 265



 Score = 45.9 bits (109), Expect = 1e-04
 Identities = 35/196 (17%), Positives = 72/196 (36%), Gaps = 39/196 (19%)

Query: 1622 EARTAQDMAEVAIQTAKDDISAARKDLSQITNDLDDAQQKSNETNVKVKLLQERLKSIQA 1681
             A    D     IQ     +S  +K+   I N+++    +  E   K+  LQ+ +   +A
Sbjct: 25   FAALLSDK----IQNQDSKLSELQKEKKNIQNEIESLDNQIEEIQSKIDELQKEIDQSKA 80

Query: 1682 GFLQNGRSAMDVENEEKNLEKEVALAQKQASGLRSRYQ------------ETDN------ 1723
               +  +   ++   ++N+ +   L +K+A   R+                + +      
Sbjct: 81   EIKKLQK---EIAELKENIVERQELLKKRA---RAMQVNGTATSYIDVILNSKSFSDLIS 134

Query: 1724 -----KLLSKAESSGLKRARGQMLLNKASQLSVNTTAKLKQL----NETEAMFNSQESEL 1774
                  ++  A+   L++ +      +  Q ++    KL+ L    NE E   NS  S+ 
Sbjct: 135  RVTAISVIVDADKKILEQQKEDKKSLEEKQAALED--KLETLVALQNELETQLNSLNSQK 192

Query: 1775 TELSKNIAELQKRIQS 1790
             E +  IA L  +  S
Sbjct: 193  AEKNALIAALAAKEAS 208



 Score = 32.0 bits (73), Expect = 2.5
 Identities = 40/243 (16%), Positives = 78/243 (32%), Gaps = 75/243 (30%)

Query: 1332 TETEKQINDAERQCKRTENLVTKTFPQFTQGQEENEKA---LQRLNEKITSLEKNLPDLN 1388
            +E +K+  + + + +  +N + +   +  + Q+E +++   +++L ++I  L++N+ +  
Sbjct: 41   SELQKEKKNIQNEIESLDNQIEEIQSKIDELQKEIDQSKAEIKKLQKEIAELKENIVERQ 100

Query: 1389 ELICDKRGDPCDNLCGGAGCGKCGGMWCSNGT------LSESNSAKDYA----------- 1431
            EL+  KR                  M  +         +  S S  D             
Sbjct: 101  ELL-KKRAR---------------AMQVNGTATSYIDVILNSKSFSDLISRVTAISVIVD 144

Query: 1432 ---------ETSKKLINEK----ESKAEELRKHMIILSES----------------NSAK 1462
                     +  KK + EK    E K E L      L                     A 
Sbjct: 145  ADKKILEQQKEDKKSLEEKQAALEDKLETLVALQNELETQLNSLNSQKAEKNALIAALAA 204

Query: 1463 DYAETSKK---LINEKEAKAEEVFRGITTAKQESHAANILAKEAYDHAASVRNKTAAYVA 1519
              A    +   L  +K             A ++  AA   A+E    AA     TAA  +
Sbjct: 205  KEASALGEKAALEEQKALAEAAAAE----AAKQEAAAKAAAQEQ---AALQAAATAAQPS 257

Query: 1520 STS 1522
            + +
Sbjct: 258  AVT 260



 Score = 30.5 bits (69), Expect = 7.6
 Identities = 11/50 (22%), Positives = 22/50 (44%)

Query: 1755 AKLKQLNETEAMFNSQESELTELSKNIAELQKRIQSCINFIVDKSNNYKN 1804
            +   Q+ E ++  +  + E+ +    I +LQK I      IV++    K 
Sbjct: 56   SLDNQIEEIQSKIDELQKEIDQSKAEIKKLQKEIAELKENIVERQELLKK 105


>gnl|CDD|221952 pfam13166, AAA_13, AAA domain.  This family of domains contain a
            P-loop motif that is characteristic of the AAA
            superfamily. Many of the proteins in this family are
            conjugative transfer proteins. This family includes the
            PrrC protein that is thought to be the active component
            of the anticodon nuclease.
          Length = 713

 Score = 51.2 bits (123), Expect = 5e-06
 Identities = 55/375 (14%), Positives = 123/375 (32%), Gaps = 60/375 (16%)

Query: 1452 MIILSESNS-AKDYAETSKKLINEKEAKAEEVFRGITTAKQESHAA-NILAKEAYDHAAS 1509
            +  L E N   +   E  KK + + E K E++   I   ++E   A N    +A+   A 
Sbjct: 83   VFTLGEENIEIEAQIEELKKELKKLEEKIEQLEAEIEKKEEELEKAKNKFLDKAWKKLAK 142

Query: 1510 VRNKTAAYVASTSNITKQLDEFLNAPGATLADIRNISGLAVNKNIQSNPEQIRERANEIN 1569
              +         SN+++ L             ++ +  + +N +   + E+++ +   + 
Sbjct: 143  KYD---------SNLSEALKGLNYKKNFKEKLLKELKSVILNASSLLSLEELKAKIKTLF 193

Query: 1570 NVIKSLTDIDTILTETAGDLAKANDLKRKANLTKAGADSTKNTVQKIVDVLTEARTAQDM 1629
            +  K    + T+      ++ +A  L++    +     S           L       D 
Sbjct: 194  SSNKPELALLTLSVIDFDEIEQAEILEKSIIGSSDVPISE----------LINNLGNSDW 243

Query: 1630 AEVAIQTAKDD---------ISAARKDLSQITNDLDDAQQKSNETNVKVKLLQERLKSIQ 1680
             +  ++  ++          I+  RK   ++    D+  Q+  E   +++ L ++ +S  
Sbjct: 244  VKEGLEYHEEGDTCPFCQQTITEERKA--ELEAHFDEEYQELIE---QLEELIDKYESHI 298

Query: 1681 AGFLQNGRSAMDVENEEKNLEKEVALAQKQASGLRSRYQETDNKLLSKAESSGLKRARGQ 1740
               L+   S +D E E    + +V   +     L    ++   KL  K            
Sbjct: 299  EKALEELESILDTEKENSEFKLDVEELKALLEALEEILEKNLQKLEEK------------ 346

Query: 1741 MLLNKASQLS--VNTTAKLKQLNETEAMFNSQESELTELSKNIAELQKR---------IQ 1789
              L   S      + T  ++ +N+     N    E  E   N+ + + +         + 
Sbjct: 347  --LKDPSTSIELESITDLIESINDIIDAINELIREHNEKIDNLKKEKNKAKKKLWLHLVA 404

Query: 1790 SCINFIVDKSNNYKN 1804
                 I       K 
Sbjct: 405  ELKEDIDAYQKEKKG 419



 Score = 45.8 bits (109), Expect = 2e-04
 Identities = 20/181 (11%), Positives = 69/181 (38%), Gaps = 18/181 (9%)

Query: 1505 DHAASVRNKTAAYVASTSNITKQLDEFLNAPGATLADIRNISGLAVNKNIQSNPEQIRER 1564
            +      +     +    +I    ++  +     + +++ +   A+ + ++ N +++ E+
Sbjct: 289  ELIDKYESHIEKALEELESIL-DTEKENSEFKLDVEELKALLE-ALEEILEKNLQKLEEK 346

Query: 1565 ANEINNVIKSLTDIDTILTETAGDLAKANDLKRKANLTKAGADSTKNTVQKIVDVLTEAR 1624
              + +  I+ L  I  ++      +   N+L R+ N            +  +     +A+
Sbjct: 347  LKDPSTSIE-LESITDLIESINDIIDAINELIREHN----------EKIDNLKKEKNKAK 395

Query: 1625 TAQDMAEVA-----IQTAKDDISAARKDLSQITNDLDDAQQKSNETNVKVKLLQERLKSI 1679
                +  VA     I   + +     K ++ +  ++   + +      ++K L+++L +I
Sbjct: 396  KKLWLHLVAELKEDIDAYQKEKKGLEKAINSLEKEIKQLEAEIKALEKEIKELEKQLTNI 455

Query: 1680 Q 1680
            +
Sbjct: 456  E 456



 Score = 33.1 bits (76), Expect = 1.7
 Identities = 24/125 (19%), Positives = 53/125 (42%), Gaps = 14/125 (11%)

Query: 1266 TLGNTTQRINLARLALNDLTEEIKKLNKTGEMLKENATLLQENNVEGALELTRQAHDTSL 1325
             L + T  I      ++ + E I++ N+  + LK+      +  +   L L  +  +   
Sbjct: 355  ELESITDLIESINDIIDAINELIREHNEKIDNLKKEKNKA-KKKLW--LHLVAELKEDID 411

Query: 1326 KLKEQSTETEKQINDAERQCKRTENLVTKTFPQFTQGQEENEKALQRLNEKITSLEKNLP 1385
              +++    EK IN  E++ K           Q     +  EK ++ L +++T++E    
Sbjct: 412  AYQKEKKGLEKAINSLEKEIK-----------QLEAEIKALEKEIKELEKQLTNIEPTAD 460

Query: 1386 DLNEL 1390
            ++N+L
Sbjct: 461  EINKL 465


>gnl|CDD|234541 TIGR04320, Surf_Exclu_PgrA, SEC10/PgrA surface exclusion domain.
            This model describes a conserved domain found in surface
            proteins of a number of Firmutes. Many members have LPXTG
            C-terminal anchoring motifs and a substantial number have
            the KxYKxGKxW putative sorting signal at the N-terminus.
            The tetracycline resistance plasmid pCF10 in Enterococcus
            faecalis promotes conjugal plasmid transfer in response
            to sex pheromones, but PgrA/Sec10 encoded by that
            plasmid, a member of this family, specifically inhibits
            the ability of cells to receive homologous plasmids. The
            phenomenon is called surface exclusion.
          Length = 356

 Score = 50.1 bits (120), Expect = 6e-06
 Identities = 35/181 (19%), Positives = 54/181 (29%), Gaps = 39/181 (21%)

Query: 1494 HAANILAKEAYDHAASVRNKTAAYVA-STSNITKQLDEFLNAPGATLADIRNISGLAVNK 1552
            HA N+L     +          AY+  S SN       F+N   + +AD           
Sbjct: 200  HAQNLLGDGKGN--------PGAYLGVSISNDGGVNIHFVNFNDSYIADGNKFDK----- 246

Query: 1553 NIQSNPEQIRERANEINNVIKSLTDIDTILTETAGDL--AKANDLKRKANLTKAGADSTK 1610
                           I N   SL  +   L     DL  A+      +A LT A      
Sbjct: 247  -------------TPIPNTPNSLKALQAALATAQADLAAAQTALNTAQAALTTAQTAYAA 293

Query: 1611 --NTVQKIVDVLTEARTAQD-MAEVAIQTAKDDISAAR-------KDLSQITNDLDDAQQ 1660
                +      L  A+      A+  + TA+  ++ A        + L+ +  DL   Q 
Sbjct: 294  AQAALATAQKELANAQAQALQTAQNNLATAQAALANAEARLAKAKEALANLNADLAKKQA 353

Query: 1661 K 1661
             
Sbjct: 354  A 354


>gnl|CDD|223910 COG0840, Tar, Methyl-accepting chemotaxis protein [Cell motility and
            secretion / Signal transduction mechanisms].
          Length = 408

 Score = 49.6 bits (118), Expect = 1e-05
 Identities = 46/323 (14%), Positives = 121/323 (37%), Gaps = 45/323 (13%)

Query: 1417 SNGTLSE--SNSAKDYAETSKKLINEKESKAEELRKHMIILSE--SNSAKDYAETSKKLI 1472
            + G L++    S+ D      K  NE      ++   +   +E  S ++++ A ++ +L 
Sbjct: 99   AAGDLTKRIDESSNDEFGQLAKSFNEMILNLRQIIDAVQDNAEALSGASEEIAASATELS 158

Query: 1473 NEKEAKAEEVFRGITTAKQESHAANILAKEAYDHAASVRNKTAAYVASTSNITKQLDEFL 1532
               + +AE +    +  ++ S     +A  A         + AA  +  S + ++  E +
Sbjct: 159  ARADQQAESLEEVASAIEELSETVKEVAFNA--------KEAAALASEASQVAEEGGEEV 210

Query: 1533 NAPGATLADIRNISGLAVNKNIQSNPEQIRERANEINNVIKSLTDIDTILTET------- 1585
                  +  ++ I      + +    +++ E + EI  +      I++I  +T       
Sbjct: 211  RQ---AVEQMQEI-----AEELAEVVKKLSESSQEIEEITSV---INSIAEQTNLLALNA 259

Query: 1586 ------AGDLAKANDLKRKANLTKAGADSTKNTVQKIVDVLTEARTAQDMAEVAIQTAKD 1639
                  AG+  +       A+  +  A+ + ++ ++I  ++ E +     A   ++ +  
Sbjct: 260  AIEAARAGEAGRG--FAVVADEVRKLAERSADSAKEIGLLIEEIQNEAADAVEHMEESAS 317

Query: 1640 DISAARKDLSQITNDLDDAQQKSNETNVKVKLLQERLKSIQAGFLQNGRSAMDVE---NE 1696
            ++S   K + +  + L +      E +  +  +    +   A   +   S  +++    E
Sbjct: 318  EVSEGVKLVEETGSSLGEIAAAIEEVSQLISEIAAATEEQTAVLEEINASIEELDDVTQE 377

Query: 1697 EKNLEKEVALA----QKQASGLR 1715
                 +E+A A    ++ A  L 
Sbjct: 378  NAAAVEELAAASEELKELAEKLL 400


>gnl|CDD|226883 COG4477, EzrA, Negative regulator of septation ring formation [Cell
            division and chromosome partitioning].
          Length = 570

 Score = 50.1 bits (120), Expect = 1e-05
 Identities = 102/551 (18%), Positives = 183/551 (33%), Gaps = 115/551 (20%)

Query: 1273 RINLARLALNDLTEEIKKLNKTGEMLKENATLLQENNVEGALELTRQAHDTSL-----KL 1327
            +  L  L +ND   ++KKL+ TG+              E   E  RQ  D  +      +
Sbjct: 37   KNELLNLPVNDEISKVKKLHLTGQ-------------TETKFEEWRQKWDDIVTNSFADV 83

Query: 1328 KEQSTETEKQINDAERQCKRTENLVTKTFPQFTQGQEENEKALQRLNEKITSLEKNLPDL 1387
            +E   E E  + D  R   + ++ +     Q T  +E+ E+ L+ LNE + S EKN    
Sbjct: 84   EEHLFEAE-ALADKFR-FNKAKHEIDDIEQQLTLIEEDIEQILEDLNELVESEEKN---- 137

Query: 1388 NELICDKRGDPCDNLCGGAGCGKCGGMWCSNGTLSESNSAKDYAETSKKL---INEKESK 1444
                  +  D    L                         +D      +      E E K
Sbjct: 138  -----SEEIDHVLEL--------------------YEELRRDVLANRHQYGEAAPELEKK 172

Query: 1445 AEELRKHMIILSESNSAKDYAETSKKLINEKE------AKAEEVFRGITTAKQESHAANI 1498
             E + + +    E  S+ DY E  + L   +E      +  E +   +   + E      
Sbjct: 173  LENIEEELSQFVELTSSGDYIEAREVLEEAEEHMIALRSIMERIPSLLAELQTELPGQLQ 232

Query: 1499 LAKEAYDHAASVRNKTAAYVASTSNITKQLDEFLNAPGATLADIRNISGLAVNKNIQSNP 1558
              K  Y        K   Y     NI  +L+            +  +             
Sbjct: 233  DLKAGYR-----DMKEEGYHLEHVNIDSRLERLKEQLVENSELLTQL-----------EL 276

Query: 1559 EQIRERANEINNVIKSLTDIDTILTETAGDLAKANDLKRKANLTKAGADSTKNTVQKIVD 1618
            ++  E    I   I+SL D+  +  E     AK    +    L           ++K  +
Sbjct: 277  DEAEEELGLIQEKIESLYDL--LEREVE---AKNVVEENLPILPDY--------LEKAKE 323

Query: 1619 VLTEARTAQDMAEV--AIQTAKDDISAAR---KDLSQITNDLDDAQQKSNETNVKVKLLQ 1673
                    +++  V  + + A+ ++ + R   K+L ++ + LD+  +      V    LQ
Sbjct: 324  NNEH--LKEEIERVKESYRLAETELGSVRKFEKELKELESVLDEILENIEAQEVAYSELQ 381

Query: 1674 ERLKSIQAGFLQNGRSAMDVENEEKNLEKEVALAQKQASGLRSRYQETDNKLLSKAESSG 1733
            + L+ I+            V+    +L K+   A++    L+S+  E   + + K+   G
Sbjct: 382  DNLEEIEKALTDIEDEQEKVQEHLTSLRKDELEARENLERLKSKLHEIK-RYMEKSNLPG 440

Query: 1734 LKRARGQMLLNKASQLSVNTTAKLKQLNE---------------TEAMFNSQESELTELS 1778
            L       L   A          +K+L+E               TE M N+ E E  E+ 
Sbjct: 441  LPE-TFLSLFFTAGH---EIQDLMKELSEVPINMEAVSALVDIATEDM-NTLEDETEEVV 495

Query: 1779 KNIAELQKRIQ 1789
            +N    ++ IQ
Sbjct: 496  ENAVLAEQLIQ 506



 Score = 34.3 bits (79), Expect = 0.69
 Identities = 90/491 (18%), Positives = 173/491 (35%), Gaps = 125/491 (25%)

Query: 1217 ELLEKKIEDVKALVENTTRSSHDLTVMLTTIDDMKKQIMNNEDPNGVGKTLGNTTQRINL 1276
            E +E+ +ED+  LVE+  ++S ++  +L   +++++ ++ N    G         +    
Sbjct: 118  EDIEQILEDLNELVESEEKNSEEIDHVLELYEELRRDVLANRHQYG---------EAAPE 168

Query: 1277 ARLALNDLTEEIKK---LNKTGEMLKENATLLQENNVEGALE-LTRQAHDTSLKLKEQST 1332
                L ++ EE+ +   L  +G+ ++    L +      AL  +  +    SL L E  T
Sbjct: 169  LEKKLENIEEELSQFVELTSSGDYIEAREVLEEAEEHMIALRSIMERI--PSL-LAELQT 225

Query: 1333 ETEKQINDAERQCKRTENLVTKT--FPQFTQGQEENEKALQRLNEKITSLEKNLPDLNEL 1390
            E   Q+ D +   +    +  +              +  L+RL E+   L +N   L +L
Sbjct: 226  ELPGQLQDLKAGYRD---MKEEGYHLEHV-----NIDSRLERLKEQ---LVENSELLTQL 274

Query: 1391 ICDKRGDPCDNLCGGAGCGKCGGMWCSNGTLSESNSAKDYAETSKKLINEKESKAEELRK 1450
              D+                                    AE    LI EK     +L +
Sbjct: 275  ELDE------------------------------------AEEELGLIQEKIESLYDLLE 298

Query: 1451 HMIILSESNSAKDYAETSKKLINEKEAKAEEVFRGITTAKQESHAANILAKEAY----DH 1506
              +       AK+  E +  ++ +   KA+E        K+E        KE+Y      
Sbjct: 299  REV------EAKNVVEENLPILPDYLEKAKEN---NEHLKEEIE----RVKESYRLAETE 345

Query: 1507 AASVR---NKTAAYVASTSNITKQLDEFLNAPGA---TLADIRNISGLAVNKNIQSNPEQ 1560
              SVR    +     +    I + ++    A       L +I     L    +I+   E+
Sbjct: 346  LGSVRKFEKELKELESVLDEILENIEAQEVAYSELQDNLEEIEK--AL---TDIEDEQEK 400

Query: 1561 IRERANEINNVIKSLTDIDTILTETAGDLAKA-NDLKR---KANLTKAGADST-----KN 1611
            ++E        + SL   +    E    L    +++KR   K+NL   G   T       
Sbjct: 401  VQEH-------LTSLRKDELEARENLERLKSKLHEIKRYMEKSNLP--GLPETFLSLFFT 451

Query: 1612 TVQKIVDVLTEARTAQDMAEVAIQTAKDDISAARKDLSQITNDLDDAQQKSNETNVKVKL 1671
               +I D++ E      ++EV I     ++ A    +   T D++  + ++ E      L
Sbjct: 452  AGHEIQDLMKE------LSEVPI-----NMEAVSALVDIATEDMNTLEDETEEVVENAVL 500

Query: 1672 LQERLKSIQAG 1682
             +   + IQ G
Sbjct: 501  AE---QLIQYG 508


>gnl|CDD|224428 COG1511, COG1511, Predicted membrane protein [Function unknown].
          Length = 780

 Score = 49.8 bits (119), Expect = 1e-05
 Identities = 55/393 (13%), Positives = 113/393 (28%), Gaps = 37/393 (9%)

Query: 1422 SESNSAKDYAETSKKLINEKESKAEELRKHMIILSESNSAKDYAETSKKLINEKEAKAEE 1481
                 A D AE  K   +E  +  ++L          N+  + + T    +N   +    
Sbjct: 178  GGVKGAADGAEKLKDGTDEASNGNKKLS------DLLNTLNNSSATFSDGLNALTSGLTT 231

Query: 1482 VFRGITTAKQESHAANILAKEAYDHAASVRNKTAAYVASTSNITKQLDEFLNAPGATLAD 1541
            +  G+               E    A  + N+     +S  +      + L A    L  
Sbjct: 232  LTDGLNQLDSGLGTLAAGIGELKQGAEQL-NEGIGEFSSGLSELNSGVQDLAAGVPQLNQ 290

Query: 1542 IRNISGLAVNKNIQSNPEQIRERANEINNVIKSLTDIDTILTETAGDLAKANDLKRKANL 1601
               IS LA   ++  +      + + +   +  +       T ++ + A+   L    ++
Sbjct: 291  G--ISALAAGLSLPDSLGD---QFSSLQEALTQIAQGLKQKTSSSLEAAQ-GSLSSLQSM 344

Query: 1602 TKAGADSTKNTVQKIVDVLTEARTAQDMAEVA---------IQTAKDDISAARKDLSQIT 1652
                           VD L      Q + E           + T  D +S     L Q +
Sbjct: 345  LALSKSLDLTAEGATVDALGAPDGVQWLDESQKTLATLSELLSTGIDGVSEGLDALEQAS 404

Query: 1653 NDLDDAQQKSNETNVKV----KLLQERLKSIQAGFLQNGRSAMDVENEEKNLEKEVALAQ 1708
              L  +  K      ++      L      +          +  ++     L   +A   
Sbjct: 405  AQLAKSLAKLKTAVAQIAASIAQLLPGASEVL-----KTLKSKGLDKLLNQLNGALAKGS 459

Query: 1709 KQASGLRSRYQETDNKLLSKAESSGLKRARGQMLLNKASQLSVNTTAKLKQLNETEAMFN 1768
                   S   ++   + S    +GL       L + ++ LS +    L QL +   +  
Sbjct: 460  NALVQGLSDANDSFRSITSAQLKAGLNT-----LADGSNDLS-SLGPGLGQLADGSKLLA 513

Query: 1769 SQESELTELSKNIAELQKRIQSCINFIVDKSNN 1801
               SEL   S  + +    +   +  + D   +
Sbjct: 514  DGLSELNTGSAQLRDGLGELSDGLTELADSLQD 546



 Score = 42.1 bits (99), Expect = 0.003
 Identities = 56/389 (14%), Positives = 118/389 (30%), Gaps = 56/389 (14%)

Query: 1424 SNSAKDYAETSKKLINEKESKAEELRK-----HMIILSESNSAKDYAETSKKLINEK--- 1475
            +   KD  + +     +       L          + + ++      +   +L +     
Sbjct: 187  AEKLKDGTDEASNGNKKLSDLLNTLNNSSATFSDGLNALTSGLTTLTDGLNQLDSGLGTL 246

Query: 1476 EAKAEEVFRGITTAKQESHA-------ANILAKEAYDHAASVRNKTAAYVASTSNITKQL 1528
             A   E+ +G     +            N   ++       +    +A  A  S      
Sbjct: 247  AAGIGELKQGAEQLNEGIGEFSSGLSELNSGVQDLAAGVPQLNQGISALAAGLSLPDSLG 306

Query: 1529 DEFLNAPGATLADIRNISGLAVNKNIQSNPEQIRERANEINNVIKSLTDID------TIL 1582
            D+F +        +  I+     K   S         + + +++     +D      T+ 
Sbjct: 307  DQFSS----LQEALTQIAQGLKQKTSSSLEAAQGSL-SSLQSMLALSKSLDLTAEGATVD 361

Query: 1583 TETAGDLAKANDLKRKANLTKAGADST-KNTVQKIVDVLTEARTAQDMAEVAIQTAKDDI 1641
               A D  +  D  +K   T +   ST  + V + +D L +A      +   ++TA   I
Sbjct: 362  ALGAPDGVQWLDESQKTLATLSELLSTGIDGVSEGLDALEQASAQLAKSLAKLKTAVAQI 421

Query: 1642 SAARKDLSQITNDLDDAQQKSNETNVKVKLLQERLKSIQAGFLQNGRSAMDVENEEKNLE 1701
            +A+   L    +++    +      +  +L     K   A  L  G S            
Sbjct: 422  AASIAQLLPGASEVLKTLKSKGLDKLLNQLNGALAKGSNA--LVQGLSD----------- 468

Query: 1702 KEVALAQKQASGLRSRYQETDNKLLSKAE--SSGLKRARGQMLLNKASQLSVNTTAKLK- 1758
               A    + S   ++ +   N L   +   SS          L   S+L  +  ++L  
Sbjct: 469  ---ANDSFR-SITSAQLKAGLNTLADGSNDLSSLGPGLGQ---LADGSKLLADGLSELNT 521

Query: 1759 ---QLNETEAMFNSQESELTELSKNIAEL 1784
               QL +     +     LTEL+ ++ + 
Sbjct: 522  GSAQLRDGLGELS---DGLTELADSLQDA 547



 Score = 35.6 bits (82), Expect = 0.28
 Identities = 30/169 (17%), Positives = 59/169 (34%), Gaps = 27/169 (15%)

Query: 1626 AQDMAEVAIQTAKDDISAARKDLSQITNDLD--DAQQKSNETNVKVKLLQERLKSIQAGF 1683
            A  + E A     ++IS       ++T       A     +    VK   +  + ++ G 
Sbjct: 141  APKITEKAADKLLNEIS------KELTETYTKVVAFPTIYDLGGGVKGAADGAEKLKDGT 194

Query: 1684 LQNGRSAMDVENEEKNLEKEVALAQKQASGLRSRYQETDNKLLSKAE--SSGLKR--ARG 1739
                       +E  N  K+++      +   + + +  N L S     + GL +  +  
Sbjct: 195  -----------DEASNGNKKLSDLLNTLNNSSATFSDGLNALTSGLTTLTDGLNQLDSGL 243

Query: 1740 QMLLNKASQLSVNTTAKLKQLNETEAMFNSQESELTELSKNIAELQKRI 1788
              L     +L        +QLNE    F+S  SEL    +++A    ++
Sbjct: 244  GTLAAGIGELK----QGAEQLNEGIGEFSSGLSELNSGVQDLAAGVPQL 288


>gnl|CDD|217051 pfam02463, SMC_N, RecF/RecN/SMC N terminal domain.  This domain is
            found at the N terminus of SMC proteins. The SMC
            (structural maintenance of chromosomes) superfamily
            proteins have ATP-binding domains at the N- and
            C-termini, and two extended coiled-coil domains separated
            by a hinge in the middle. The eukaryotic SMC proteins
            form two kind of heterodimers: the SMC1/SMC3 and the
            SMC2/SMC4 types. These heterodimers constitute an
            essential part of higher order complexes, which are
            involved in chromatin and DNA dynamics. This family also
            includes the RecF and RecN proteins that are involved in
            DNA metabolism and recombination.
          Length = 1162

 Score = 49.2 bits (117), Expect = 2e-05
 Identities = 92/608 (15%), Positives = 169/608 (27%), Gaps = 122/608 (20%)

Query: 1286 EEIKKLNKTGEMLKENATLLQENNVEGALELTRQAHDTSLKLKEQSTETEKQINDAERQC 1345
            E+ KK  +  ++++E   L +       L+L        LK + +      Q+ +     
Sbjct: 168  EKRKKKERLKKLIEETENLAELIIDLEELKLQEL----KLKEQAKKALEYYQLKEKLE-- 221

Query: 1346 KRTENLVTKTFPQFTQGQEENEKALQRLNEKITSLEKNLPDLNELICDKRGDPCDNLCGG 1405
               ENL+   + +  + + +  + L R  ++     K   +  E I  +           
Sbjct: 222  LEEENLLYLDYLKLNEERIDLLQELLRDEQEEIESSKQELEKEEEILAQVLKENKE---- 277

Query: 1406 AGCGKCGGMWCSNGTLSESNSAKDYAETSKKL-INEKESKAEELRKHMIILSESNSAKDY 1464
                             +    ++    +K+    + E    E RK         S K+ 
Sbjct: 278  -------------EEKEKKLQEEELKLLAKEEEELKSELLKLERRKVDDEEKLKESEKEL 324

Query: 1465 AETSKKLINEKEAKAEEVFRGITTAKQESHAANILAKEAYDHAASVRNKTAAYVASTSNI 1524
             +  K+L  EKE   E     +    +E        +E  +    ++ K          +
Sbjct: 325  KKLEKELKKEKEEIEE-----LEKELKELEIKREAEEEEEEQLEKLQEK--LEQLEEELL 377

Query: 1525 TKQLDEFLNAPGATLAD------IRNISGLAVNKNIQSNPEQIRERANEINNVIKSLTDI 1578
             K+  E      +          ++N         ++   EQ  +   E       + + 
Sbjct: 378  AKKKLES-ERLSSAAKLKEEELELKNEEEKEAKLLLE-LSEQEEDLLKEEKKEELKIVEE 435

Query: 1579 DTILTETAG------------DLAKANDLKRKANLTKAGADSTKNTVQKIVDVLTEARTA 1626
                 ET                 K    K +   ++     TK         L   R  
Sbjct: 436  LEESLETKQGKLTEEKEELEKQALKLLKDKLELKKSEDLLKETKLVKLLEQLELLLLRQK 495

Query: 1627 QDMA------------------------------------EVAIQTAKDDISAARKDLSQ 1650
             + A                                     VA+   K  IS A      
Sbjct: 496  LEEASQKESKAREGLAVLLALIKDGVGLRIISAAGRGGDLGVAVANYKVAISVAVIVEVS 555

Query: 1651 ITNDLDDAQQKS-----------NETNVKVKLLQERLKSIQAG-------FLQNGRSAMD 1692
               D  D +QK                +   LL+  LKSI            Q  ++ ++
Sbjct: 556  AVADEVDERQKLVRALTELPLPARTFVLLDPLLKLPLKSIAVLEIDPALNLAQLNKATLE 615

Query: 1693 VENEEKN------------LEKEVALAQKQASGLR---SRYQETDNKLLSKAESSGLKRA 1737
             + ++K             L K +  A+ + SGLR   S  +    K   KA  S L + 
Sbjct: 616  ADEDDKRAKVVEGILKDTELTKLLESAKAKESGLRKGVSLEEGLAEKSELKASLSELTKE 675

Query: 1738 RGQMLLNKASQLSVNTTAKLKQLNETEAMFNSQESELTELSKNIAELQKRIQSCINFIVD 1797
                   +    S    AK + L   E +   ++    EL K   E ++ +   +    D
Sbjct: 676  LLAEQELQEKAESEL--AKNEILRRQEEIKKKEQRIKEELKKLKLEKEELLADKVQEAQD 733

Query: 1798 KSNNYKNC 1805
            K N     
Sbjct: 734  KINEELKL 741


>gnl|CDD|179877 PRK04778, PRK04778, septation ring formation regulator EzrA;
            Provisional.
          Length = 569

 Score = 48.7 bits (117), Expect = 3e-05
 Identities = 94/568 (16%), Positives = 192/568 (33%), Gaps = 153/568 (26%)

Query: 1271 TQRINLARLALNDLTEEIKKLNKTGEMLKENATLLQENNVEGALELTRQAHDTSLKLKEQ 1330
             ++  L  L +ND  E++KKLN TG+  ++              E  RQ  D      E 
Sbjct: 36   ERKQELENLPVNDELEKVKKLNLTGQSEEK-------------FEEWRQKWD------EI 76

Query: 1331 STET----EKQINDAERQCKRTENLVTKTFPQFTQGQEENEKALQRLNEKITSLEKN--- 1383
             T +    E+Q+ +AE             F +F + +      +  +   +  +E++   
Sbjct: 77   VTNSLPDIEEQLFEAEEL--------NDKF-RFRKAK----HEINEIESLLDLIEEDIEQ 123

Query: 1384 -LPDLNELICDKRGDPCDNLCGGAGCGKCGGMWCSNGTLSESNSAKDYAETSKKLINEKE 1442
             L +L EL+                               ES       E +++ + + +
Sbjct: 124  ILEELQELL-------------------------------ESE------EKNREEVEQLK 146

Query: 1443 SKAEELRKHMIILSESNSAKDYAETSKKLINEKEAKAEEVFRGITTAKQESHAANILAKE 1502
                ELRK   +L+   S   +     +L    E + E +    +   + + + + +  E
Sbjct: 147  DLYRELRKS--LLANRFS---FGPALDEL----EKQLENLEEEFSQFVELTESGDYV--E 195

Query: 1503 AYDHAASVRNKTAAYVASTSNITKQLDEFLNAPGATLADIRNISGLA---------VNKN 1553
            A +    +  + AA       I + L E        L +++   G            + +
Sbjct: 196  AREILDQLEEELAALEQIMEEIPELLKELQTELPDQLQELKA--GYRELVEEGYHLDHLD 253

Query: 1554 IQSNPEQIRERANEINNVIKSLTDIDTILTETAGDLAKAND--------LKRKANLTKAG 1605
            I+   + ++E   +I+  +  L ++D  L E      +  +        L+R+    KA 
Sbjct: 254  IEKEIQDLKE---QIDENLALLEELD--LDEAEEKNEEIQERIDQLYDILEREV---KA- 304

Query: 1606 ADSTKNTVQKIVDVLTE--ARTAQDMAEVAIQT---------AKDDISAAR---KDLSQI 1651
                +  V+K  D L +      +   E+  +           + ++ + R   K L  +
Sbjct: 305  ----RKYVEKNSDTLPDFLEHAKEQNKELKEEIDRVKQSYTLNESELESVRQLEKQLESL 360

Query: 1652 TNDLDDAQQKSNETNVKVKLLQERLKSIQAGFLQNGRSAMDVENEEKNLEKEVALAQKQA 1711
                D+  ++  E  +    LQE L+ I     +  +    +    + L K+   A+++ 
Sbjct: 361  EKQYDEITERIAEQEIAYSELQEELEEILKQLEEIEKEQEKLSEMLQGLRKDELEAREKL 420

Query: 1712 SGLRSRYQETDNKLLSKAESSGL-----------KRARGQM--LLNKASQLSVNTTAKLK 1758
               R++  E    L    E S L                ++  L  +  +  +N  A  +
Sbjct: 421  ERYRNKLHEIKRYL----EKSNLPGLPEDYLEMFFEVSDEIEALAEELEEKPINMEAVNR 476

Query: 1759 QLNETEAMFNSQESELTEL--SKNIAEL 1784
             L E      + E E  EL  +  + E 
Sbjct: 477  LLEEATEDVETLEEETEELVENATLTEQ 504



 Score = 48.7 bits (117), Expect = 3e-05
 Identities = 46/255 (18%), Positives = 100/255 (39%), Gaps = 51/255 (20%)

Query: 1559 EQIRERANEINNVIKSLTDIDTILTETAGDLAKANDLKRKANLTKAGA--DSTKNTVQKI 1616
            E+ R++ +EI     SL DI+  L E      K    K K  + +  +  D  +  +++I
Sbjct: 67   EEWRQKWDEIVTN--SLPDIEEQLFEAEELNDKFRFRKAKHEINEIESLLDLIEEDIEQI 124

Query: 1617 VDVLTEARTAQDMAEVAIQTAKDDISAARKDLSQITNDLDDAQQKSNETNVKVKLLQERL 1676
            ++ L E   +++     ++  KD     RK L                    +  L+++L
Sbjct: 125  LEELQELLESEEKNREEVEQLKDLYRELRKSLL----------ANRFSFGPALDELEKQL 174

Query: 1677 KSIQAGFLQNGRSAMDVENEEKN----------LEKEVALAQKQASGLRSRYQETDNKL- 1725
            ++++  F     S      E  +          LE+E+A  ++    +    +E   +L 
Sbjct: 175  ENLEEEF-----SQFVELTESGDYVEAREILDQLEEELAALEQIMEEIPELLKELQTELP 229

Query: 1726 --LSKAESSGLKRARGQML--------LNKASQLSVNTTAKLKQLNETEAMFNSQESELT 1775
              L + + +G +    +++        L+   ++        +Q++E  A+   +E +L 
Sbjct: 230  DQLQELK-AGYR----ELVEEGYHLDHLDIEKEI----QDLKEQIDENLALL--EELDLD 278

Query: 1776 ELSKNIAELQKRIQS 1790
            E  +   E+Q+RI  
Sbjct: 279  EAEEKNEEIQERIDQ 293



 Score = 44.4 bits (106), Expect = 5e-04
 Identities = 92/538 (17%), Positives = 191/538 (35%), Gaps = 169/538 (31%)

Query: 1217 ELLEKKIEDVKA----LVENTTRSSHDLTVMLTTIDDMKKQIMNNEDPNGVGKTLGNTTQ 1272
            +L+E+ IE +      L+E+  ++  ++  +     +++K ++ N        + G    
Sbjct: 115  DLIEEDIEQILEELQELLESEEKNREEVEQLKDLYRELRKSLLANRF------SFGPALD 168

Query: 1273 RINLARLA-LNDLTEEIKKLNKTGEMLKENATLLQENNVEGALELTRQAHDTSLKLKEQS 1331
             +   +L  L +   +  +L ++G+ +                    +A +   +L+E+ 
Sbjct: 169  ELE-KQLENLEEEFSQFVELTESGDYV--------------------EAREILDQLEEEL 207

Query: 1332 TETEKQINDAE---RQCKRTENLVTKTFP-QFTQ---G----QEEN--------EKALQR 1372
               E+ + +     ++ +          P Q  +   G     EE         EK +Q 
Sbjct: 208  AALEQIMEEIPELLKELQ-------TELPDQLQELKAGYRELVEEGYHLDHLDIEKEIQD 260

Query: 1373 LNEKITSLEKNLPDLNELICDKRGDPCDNLCGGAGCGKCGGMWCSNGTLSESNSAKDYAE 1432
            L E+I   ++NL  L EL  D+                            E N  ++  E
Sbjct: 261  LKEQI---DENLALLEELDLDE--------------------------AEEKN--EEIQE 289

Query: 1433 TSKKL--INEKESKAE-ELRKHMIILSE--SNSAKDYAETSKKL--------INEKEAKA 1479
               +L  I E+E KA   + K+   L +   ++ +   E  +++        +NE E   
Sbjct: 290  RIDQLYDILEREVKARKYVEKNSDTLPDFLEHAKEQNKELKEEIDRVKQSYTLNESEL-- 347

Query: 1480 EEVFRGITTAKQESHAANILAKEAYDHAASVRNKTAAYVASTSNITKQLDEFLNAPGATL 1539
             E  R +   KQ       L K+  +    +  +  AY    S + ++L+E         
Sbjct: 348  -ESVRQLE--KQL----ESLEKQYDEITERIAEQEIAY----SELQEELEE--------- 387

Query: 1540 ADIRNISGLAVNKNIQSNPEQIRERANEINNVIKSLTDIDTILTETAGDLAKA-NDLKR- 1597
                          I    E+I +   +++ +++ L   +    E         +++KR 
Sbjct: 388  --------------ILKQLEEIEKEQEKLSEMLQGLRKDELEAREKLERYRNKLHEIKRY 433

Query: 1598 --KANLTKAGAD--STKNTVQKIVDVLTEARTAQDMAEVAIQTAKDDISAARKDLSQITN 1653
              K+NL     D       V   ++ L     A+++ E  I     ++ A  + L + T 
Sbjct: 434  LEKSNLPGLPEDYLEMFFEVSDEIEAL-----AEELEEKPI-----NMEAVNRLLEEATE 483

Query: 1654 DLDDAQQKSNETNVKVKLLQERLKSIQAGFLQNGRSAMDVENEEKNLEKEVALAQKQA 1711
            D++  ++++ E       L E+L  IQ              N  ++  +EVA A  +A
Sbjct: 484  DVETLEEETEELVENA-TLTEQL--IQYA------------NRYRSDNEEVAEALNEA 526



 Score = 37.1 bits (87), Expect = 0.11
 Identities = 60/345 (17%), Positives = 122/345 (35%), Gaps = 74/345 (21%)

Query: 1471 LINEKEAKAEEVFRGITTAKQESHAANILAKEAYDHA----ASVRNKTAAYVASTSNITK 1526
            L++  E   E++   +     ES   N   +E  +        +R    A   S      
Sbjct: 113  LLDLIEEDIEQILEELQELL-ESEEKN---REEVEQLKDLYRELRKSLLANRFSFGPALD 168

Query: 1527 QLDEFLNAPGATLADIRNISGL---AVNKNIQSNPEQIRE---RANEINNVIKSLTD-ID 1579
            +L++ L           N+       V      +  + RE   +  E    ++ + + I 
Sbjct: 169  ELEKQLE----------NLEEEFSQFVELTESGDYVEAREILDQLEEELAALEQIMEEIP 218

Query: 1580 TILTETAGDL-AKANDLK---RKANLTKAGADSTKNTVQKIVDVLTEARTAQDMAEVAIQ 1635
             +L E   +L  +  +LK   R+  L + G       +                 E  IQ
Sbjct: 219  ELLKELQTELPDQLQELKAGYRE--LVEEGYHLDHLDI-----------------EKEIQ 259

Query: 1636 TAKDDISAARKDLSQITNDLDDAQQKSNETNVKVKLLQERL-KSIQAGFLQNGRSAMDVE 1694
              K+ I      L ++  DLD+A++K+ E   ++  L + L + ++A           VE
Sbjct: 260  DLKEQIDENLALLEEL--DLDEAEEKNEEIQERIDQLYDILEREVKAR--------KYVE 309

Query: 1695 NEEKNLEKEVALAQKQASGLRSRYQETD----NKLLSKAESSGLKRARG-----QMLLNK 1745
                 L   +  A++Q   L+   +E D    +  L+++E   L+  R      + L  +
Sbjct: 310  KNSDTLPDFLEHAKEQNKELK---EEIDRVKQSYTLNESE---LESVRQLEKQLESLEKQ 363

Query: 1746 ASQLSVNTTAKLKQLNETEAMFNSQESELTELSKNIAELQKRIQS 1790
              +++     +    +E +        +L E+ K   +L + +Q 
Sbjct: 364  YDEITERIAEQEIAYSELQEELEEILKQLEEIEKEQEKLSEMLQG 408



 Score = 35.2 bits (82), Expect = 0.38
 Identities = 42/188 (22%), Positives = 75/188 (39%), Gaps = 51/188 (27%)

Query: 1214 KEFELLEKKIEDVKALVENTTRSSHDLTVMLTTIDDMKKQI------MNNE--DPNGVGK 1265
            KE + L+++I++  AL+E       DL       ++++++I      +  E      V K
Sbjct: 256  KEIQDLKEQIDENLALLEEL-----DLDEAEEKNEEIQERIDQLYDILEREVKARKYVEK 310

Query: 1266 TLGNTTQRINLARLALNDLTEEIKKLNKTGEMLKENATLLQENNVEGALELTRQAHDTSL 1325
                    +  A+    +L EEI       + +K++ TL      E  LE  RQ      
Sbjct: 311  NSDTLPDFLEHAKEQNKELKEEI-------DRVKQSYTL-----NESELESVRQ------ 352

Query: 1326 KLKEQSTETEKQINDAERQCKR-TENLVTKTFPQFTQGQEENEKALQRL----------N 1374
                     EKQ+   E+Q    TE +  +    +++ QEE E+ L++L          +
Sbjct: 353  --------LEKQLESLEKQYDEITERIAEQEIA-YSELQEELEEILKQLEEIEKEQEKLS 403

Query: 1375 EKITSLEK 1382
            E +  L K
Sbjct: 404  EMLQGLRK 411


>gnl|CDD|217843 pfam04012, PspA_IM30, PspA/IM30 family.  This family includes PspA a
            protein that suppresses sigma54-dependent transcription.
            The PspA protein, a negative regulator of the Escherichia
            coli phage shock psp operon, is produced when virulence
            factors are exported through secretins in many
            Gram-negative pathogenic bacteria and its homologue in
            plants, VIPP1, plays a critical role in thylakoid
            biogenesis, essential for photosynthesis. Activation of
            transcription by the enhancer-dependent bacterial
            sigma(54) containing RNA polymerase occurs through ATP
            hydrolysis-driven protein conformational changes enabled
            by activator proteins that belong to the large AAA(+)
            mechanochemical protein family. It has been shown that
            PspA directly and specifically acts upon and binds to the
            AAA(+) domain of the PspF transcription activator.
          Length = 220

 Score = 47.0 bits (112), Expect = 3e-05
 Identities = 38/213 (17%), Positives = 75/213 (35%), Gaps = 51/213 (23%)

Query: 1595 LKRKANLTKAGADSTKNTVQKIVDVLTEARTAQDMAEVAIQTAKDDISAARKDLSQITND 1654
             KR   L +A        + K  D        + M E AI+  + ++  AR+ L+Q+   
Sbjct: 3    FKRLLRLLRANIHEG---LDKAED-------PEKMLEQAIRDMQSELGKARQALAQVI-- 50

Query: 1655 LDDAQQKSNETNVKVKLLQERLKSIQAGFLQNGRSAMDVENEE---------KNLEK--- 1702
               A+QK  E   K++  +E+ K ++       R+A+   NEE           LEK   
Sbjct: 51   ---ARQKQLER--KLEEQKEQAKKLE----NKARAALTKGNEELAREALAEIATLEKQAE 101

Query: 1703 ----EVALAQKQASGLRSRYQETDNKLLSKAESSGLKRARGQMLLNKASQLSVNTTAKLK 1758
                ++   +     LR +    + K+          +A+   L  +            +
Sbjct: 102  ALETQLTQQRSAVEQLRKQLAALETKIQQ-------LKAKKTALKARLKAAKAQ-----E 149

Query: 1759 QLNETEAMFNSQE--SELTELSKNIAELQKRIQ 1789
             +N +    +++        + + I E + R  
Sbjct: 150  AVNTSLGSASTESATDSFERIEEKIEEREARAD 182



 Score = 43.1 bits (102), Expect = 5e-04
 Identities = 29/122 (23%), Positives = 40/122 (32%), Gaps = 16/122 (13%)

Query: 1277 ARLALNDLTEEIKKLNKTGEMLKENATLLQENNVEGALELTRQAHDTSLKLKEQSTETEK 1336
            AR AL    EE+ +     E L E ATL  E   E       Q      +L++Q    E 
Sbjct: 74   ARAALTKGNEELAR-----EALAEIATL--EKQAEALETQLTQQRSAVEQLRKQLAALET 126

Query: 1337 QINDAERQ---CKRTENLVTKTFPQFTQ-----GQEENEKALQRLNEKITSLEKNLPDLN 1388
            +I   + +    K       K              E    + +R+ EKI   E       
Sbjct: 127  KIQQLKAKKTALKARLK-AAKAQEAVNTSLGSASTESATDSFERIEEKIEEREARADAAA 185

Query: 1389 EL 1390
            EL
Sbjct: 186  EL 187


>gnl|CDD|149160 pfam07926, TPR_MLP1_2, TPR/MLP1/MLP2-like protein.  The sequences
            featured in this family are similar to a region of human
            TPR protein and to yeast myosin-like proteins 1 (MLP1)
            and 2 (MLP2). These proteins share a number of features;
            for example, they all have coiled-coil regions and all
            three are associated with nuclear pores. TPR is thought
            to be a component of nuclear pore complex- attached
            intra-nuclear filaments, and is implicated in nuclear
            protein import. Moreover, its N-terminal region is
            involved in the activation of oncogenic kinases, possibly
            by mediating the dimerisation of kinase domains or by
            targeting these kinases to the nuclear pore complex. MLP1
            and MLP2 are involved in the process of telomere length
            regulation, where they are thought to interact with
            proteins such as Tel1p and modulate their activity.
          Length = 132

 Score = 44.6 bits (106), Expect = 5e-05
 Identities = 25/127 (19%), Positives = 47/127 (37%), Gaps = 36/127 (28%)

Query: 1628 DMAEVAIQTAKDDISAARKDLSQITNDLDD-------AQQK------------------- 1661
               +  ++   +    A + + ++  DL++       AQQK                   
Sbjct: 6    SSLQAELKRLLEKKEDAEEKIQKLQEDLEEQAEIANEAQQKYERELVKHAEDIEELQALR 65

Query: 1662 --SNETNVKVKLLQERLKSIQAGFLQNGRSAMDVENEEKNLEKEVALAQKQASGLRSRYQ 1719
               NE   ++  L+   +S QA   +   S    E ++K LE E++  +K+   L  +  
Sbjct: 66   KQLNELKKEIAQLKAEAESAQAELSEAEESW---EEQKKMLEDELSELEKRIEELNEQ-- 120

Query: 1720 ETDNKLL 1726
               NKLL
Sbjct: 121  ---NKLL 124


>gnl|CDD|189476 pfam00261, Tropomyosin, Tropomyosin. 
          Length = 237

 Score = 45.7 bits (109), Expect = 8e-05
 Identities = 42/192 (21%), Positives = 81/192 (42%), Gaps = 29/192 (15%)

Query: 1608 STKNTVQKIVDVLTEARTAQDMAEVAIQTAKDDISAARKDLSQITNDLDDAQQKSNETNV 1667
              +  + K  + L EA+   + AE   + A+ ++++  + +  +  DL+ ++++      
Sbjct: 5    QLEEELDKAEERLEEAQEKLEEAEKRAEKAEAEVASLNRRIQLLEEDLERSEERLATALE 64

Query: 1668 KVKLLQERLK----SIQA-GFLQNGRSAMDVENEEK--NLEKEVALAQKQASGLRSRYQE 1720
            K   L+E  K    S +    L+N RS  D   +EK   LE ++  A++ A     +Y+E
Sbjct: 65   K---LEEAEKAADESERGRKVLEN-RSLKD---DEKMEQLEAQLKEAKEIAEEADRKYEE 117

Query: 1721 TDNKL------LSKAESSGLKRARGQMLLNKASQLS---VNTTAKLKQLNETEAMFNSQE 1771
               KL      L +AE       R +   +K  +L          LK L  +E   + +E
Sbjct: 118  VARKLVVVEGDLERAE------ERAEAAESKIVELEEELKVVGNNLKSLEVSEEKASQRE 171

Query: 1772 SELTELSKNIAE 1783
                E  +++ E
Sbjct: 172  DSYEEKIRDLTE 183


>gnl|CDD|220365 pfam09726, Macoilin, Transmembrane protein.  This entry is a highly
            conserved protein present in eukaryotes.
          Length = 680

 Score = 47.2 bits (112), Expect = 8e-05
 Identities = 108/528 (20%), Positives = 186/528 (35%), Gaps = 97/528 (18%)

Query: 1290 KLNKTGEMLKENA---TLLQEN-NVEGALELTRQAHDTSLKLKEQSTETEK----QINDA 1341
            +L K  E+ KEN     LLQ+    E    L+    + S    +  T T++    Q +  
Sbjct: 179  RLRKQREVQKENEFYMQLLQQALPKEENHTLSVTDKEKSEASSKGLTSTKELVPVQNSGG 238

Query: 1342 ERQCKRTENLVTKTFPQFTQGQEENE------KALQRLNEKITSLEKNLPDLNELICDKR 1395
                 ++ N  T       +G++ +        ++   N      +  L  + E+I +  
Sbjct: 239  NHSLSKSSNSQTPELEYSEKGKDHHHSHNHQHHSIGINNHHSKHADSKLQTI-EVIENHS 297

Query: 1396 G--DPCDNLCGGAGCGKCGGMWCSNGTLSESNSAKDYAETSKKLINEKESKAEELRKHMI 1453
                P  +   G           S  T S S+SA   +  SK   + K S   +      
Sbjct: 298  NKSRPSSSSTNG-----------SKETTSNSSSAAAGSIGSKSSKSAKHSNRNKSNSSPK 346

Query: 1454 ILSESNSAKDYAETSKKLINEKEAKAEEVFRGITTAKQESHAANILAKEAYDHAASVRNK 1513
              S +N +   +  S    NE + K        +++       +     A      V N 
Sbjct: 347  SHSSANGSVPSSSVSD---NESKQKRA----SKSSSGARDSKKDASGMSANGT---VENC 396

Query: 1514 TAAYVASTSNITKQLDEFLNAPGATLADIRNISGLAVNKNIQSNPEQIRERANEINNVIK 1573
                  ST +  ++L+           DI         K +Q+  +Q R+  +E+ N I 
Sbjct: 397  IPENKISTPSAIERLE----------QDI---------KKLQAELQQARQNESELRNQIS 437

Query: 1574 SLTDIDTILTETAGDLAKAND-LKRKANLTKAGADSTKNTVQKIVDVLTEARTAQDMAEV 1632
             LT ++  L    G L K ND L+ K N   +     K ++Q +   L     ++  AE 
Sbjct: 438  LLTSLERSLKSDLGQLKKENDMLQTKLNSMVSAKQKDKQSMQSMEKRLKSEADSRVNAEK 497

Query: 1633 AIQTAKDDISAARKDLSQITNDLDDAQQKSNETNVKVKLLQERLKSIQAGFLQNGRSAMD 1692
             +   K      RK   + T     AQ  ++         +E  +S++        +  D
Sbjct: 498  QLAEEK-----KRKKEEEETAARAAAQAAASR--------EECAESLKQ-------AKQD 537

Query: 1693 VENEEKNLEKEVALAQKQASGLRSRYQETDNKLLSKAESSGLKRA------RGQMLLNKA 1746
            +E E K LE ++ L +++   L    QE      S+ E+  L  A      +  ML N  
Sbjct: 538  LEMEIKKLEHDLKLKEEECRMLEKEAQELRKYQESEKETEVLMSALQAMQDKNLMLEN-- 595

Query: 1747 SQLSVNTTAKL----------KQLNETEAMFNSQESELTELSKNIAEL 1784
              LS  T  KL          +QL         +E+E+ +L   IA++
Sbjct: 596  -SLSAETRLKLDLFSALGDVKRQLEIAHGAIYKREAEIVDLKSKIADI 642



 Score = 32.6 bits (74), Expect = 2.4
 Identities = 28/115 (24%), Positives = 44/115 (38%), Gaps = 15/115 (13%)

Query: 1277 ARLALNDLTEEIKKLNKTGEMLKENATLLQENNVEGALELTRQAHDTSLKLKEQSTETEK 1336
            +R+       E KK  K  E     A      + E   E        SLK  +Q  E E 
Sbjct: 491  SRVNAEKQLAEEKKRKKEEEETAARAAAQAAASREECAE--------SLKQAKQDLEMEI 542

Query: 1337 QINDAERQCKRTENLVTKTFPQFTQGQEENEK-------ALQRLNEKITSLEKNL 1384
            +  + + + K  E  + +   Q  +  +E+EK       ALQ + +K   LE +L
Sbjct: 543  KKLEHDLKLKEEECRMLEKEAQELRKYQESEKETEVLMSALQAMQDKNLMLENSL 597


>gnl|CDD|205692 pfam13514, AAA_27, AAA domain.  This domain is found in a number of
            double-strand DNA break proteins. This domain contains a
            P-loop motif.
          Length = 1118

 Score = 46.7 bits (111), Expect = 1e-04
 Identities = 68/412 (16%), Positives = 133/412 (32%), Gaps = 80/412 (19%)

Query: 1405 GAGCGKCGGM-----------WCSNGTLSESNSAKDYAETSKKLINEKESKAEELRKHMI 1453
             AG G  G +           W   G   E N A    +  K+L  E E +  +L+    
Sbjct: 122  AAGVGSLGSVRDELEKEADELWKPRGRKPEINVAL---KELKEL--EAEIREVQLK---- 172

Query: 1454 ILSESNSAKDYAETSKKLINEKEAKAEEVFRGITTAKQESHAANILAKEAYDHAASVRNK 1513
                    + + +  K L    EA+ E     +   ++E        K+  +     R +
Sbjct: 173  -------TRTWKDLVKAL---DEAEEE-----LANLRKELRQLEKE-KQRLE-----RLR 211

Query: 1514 TAA-YVASTSNITKQLDEFLNAP------GATLADIRNISGLAVNKNIQSNPEQIRERAN 1566
                 +A    + +QL                  + R     A  +N+    E + ER  
Sbjct: 212  RLLPLLAERKALEQQLAALGEVIDLPPDAVERYEEARAELR-AARRNL----ELLTERLE 266

Query: 1567 EINNVIKSLTDIDTILTETAGDLAKANDLKRKANLTKAGADSTKNTVQKIV------DVL 1620
             +   +  ++ +D  L   A  +   +   ++     A  D      +          +L
Sbjct: 267  ALQAELDEIS-LDEELLAQAAAIEALHQ--QRGEYRNAEQDLPDREGEIANAREAAAALL 323

Query: 1621 TEARTAQDMAEVAIQTAKDDISAARKDLS-------QITNDLDDAQQKSNETNVKVKLLQ 1673
             +     D   V    +     AA++ ++        +   L  A+    E   ++K ++
Sbjct: 324  AQIGPDADEEAVE---SLRPSLAAKETVTELEKRKEALDQALKSARDALEERERELKQVR 380

Query: 1674 ERLKSIQA--------GFLQNGRSAMDVENEEKNLEKEVALAQKQASGLRSRYQETDNKL 1725
             +L ++            L +   A D++     LE+EVA+A+++ +   SR       L
Sbjct: 381  AQLAALPTVTVPEPLRAALADALRAGDIDAALAALEQEVAVAKRELAQALSRLGLWRGDL 440

Query: 1726 LSKAESSGLKRARGQMLLNKASQLSVNTTAKLKQLNETEAMFNSQESELTEL 1777
                        + Q  L +A ++      K  +L E E    + E +L  L
Sbjct: 441  EELVALPVPSAEQVQAFLKEAEEIMQAKRQKRDRLLELEEDLETLELQLRHL 492


>gnl|CDD|234141 TIGR03185, DNA_S_dndD, DNA sulfur modification protein DndD.  This
            model describes the DndB protein encoded by an operon
            associated with a sulfur-containing modification to DNA.
            The operon is sporadically distributed in bacteria, much
            like some restriction enzyme operons. DndD is described
            as a putative ATPase. The small number of examples known
            so far include species from among the Firmicutes,
            Actinomycetes, Proteobacteria, and Cyanobacteria [DNA
            metabolism, Restriction/modification].
          Length = 650

 Score = 46.2 bits (110), Expect = 2e-04
 Identities = 61/320 (19%), Positives = 113/320 (35%), Gaps = 43/320 (13%)

Query: 1472 INEKEAKAEEVFRGITTAKQE-SHAANIL--AKEAYDHAASVRNKTAAYVASTSNITK-Q 1527
            I   EA+ +E         QE +H  N L  A+ + +            +       + Q
Sbjct: 211  IEALEAELKEQSEKYEDLAQEIAHLRNELEEAQRSLESLEKKFRSEGGDLFEEREQLERQ 270

Query: 1528 LDEFLNAPGATLADIRNISG----LAVNKN-IQSNPEQIR--ERANEINNVIKSLTDIDT 1580
            L E   A  A  A +R ++     L +  N + S   Q++  E++ +     + L + D 
Sbjct: 271  LKEIEAARKANRAQLRELAADPLPLLLIPNLLDSTKAQLQKEEQSQQNQLTQEELEERDK 330

Query: 1581 ILTETAGDLAKANDLKRKANLTKAGADSTKNTVQKIVDVLTEARTAQDMAEVAIQTAKDD 1640
             L E+   LA   +  ++     A  D    T  +I   L+ +   Q   EV IQ  K +
Sbjct: 331  ELLESLPKLALPAEHVKEIAAELAEIDKPATTDSEIPHRLSGSELTQ--LEVLIQQVKRE 388

Query: 1641 ISAARKDLSQITNDLDDAQQKSNETNVKVKLLQERLKSIQA-GFLQNGRSAM-DVENEEK 1698
            +  A+  L +   +L++              + +++ +I +   +      + + +NE  
Sbjct: 389  LQDAKSQLLKELRELEEELA----------EVDKKISTIPSEEQIAQLLEELGEAQNELF 438

Query: 1699 NLEKEVALAQKQASGLRSRYQETDNKLLSKAESSG------------------LKRARGQ 1740
              E E+    +Q   L+   +     L  K +                     LK  R +
Sbjct: 439  RSEAEIEELLRQLETLKEAIEALRKTLDEKTKQKINAFELERAITIADKAKKTLKEFREK 498

Query: 1741 MLLNKASQLSVNTTAKLKQL 1760
            +L  K  QL    T   K+L
Sbjct: 499  LLERKLQQLEEEITKSFKKL 518



 Score = 39.7 bits (93), Expect = 0.017
 Identities = 43/185 (23%), Positives = 72/185 (38%), Gaps = 27/185 (14%)

Query: 1215 EFELLEKKIEDVK-ALVENTTRSSHDLTVMLTTIDDMKKQIMNNEDPNGVGKTLGNTTQR 1273
            E   LE  I+ VK  L +  ++   +L  +   + ++ K+I        + + L      
Sbjct: 374  ELTQLEVLIQQVKRELQDAKSQLLKELRELEEELAEVDKKISTIPSEEQIAQLLE----- 428

Query: 1274 INLARLALNDLTEEIKKLNKTGEMLKENATLLQENNVEGALELTRQAHDTSLKLKEQSTE 1333
                   L +   E+ +     E   E      E   E A+E  R+  D   K K  + E
Sbjct: 429  ------ELGEAQNELFRS----EAEIEELLRQLETLKE-AIEALRKTLDEKTKQKINAFE 477

Query: 1334 TEKQINDAERQCKRTENLVTKTFPQFTQGQEENEKALQRLNEKITSLEKNLPDLNELICD 1393
             E+ I  A++          KT  +F   ++  E+ LQ+L E+IT   K L   + LI  
Sbjct: 478  LERAITIADK--------AKKTLKEF--REKLLERKLQQLEEEITKSFKKLMRKHNLISR 527

Query: 1394 KRGDP 1398
             + DP
Sbjct: 528  LKIDP 532


>gnl|CDD|222878 PHA02562, 46, endonuclease subunit; Provisional.
          Length = 562

 Score = 45.8 bits (109), Expect = 2e-04
 Identities = 31/183 (16%), Positives = 66/183 (36%), Gaps = 26/183 (14%)

Query: 1559 EQIRERANEINNVIKSLTDIDTILTETAGDLAKANDLKRKANLTKAGADSTKNTVQKIVD 1618
            +++ E A  I   I+ LTD    L     D +    L  K N   A   S     QK++ 
Sbjct: 223  DELVEEAKTIKAEIEELTDELLNLVMDIEDPS--AALN-KLNTAAAKIKSKIEQFQKVIK 279

Query: 1619 VLTEART----AQDMAE------------VAIQTAKDDISAARKDLSQITNDLDDAQQKS 1662
            +  +        Q ++E              +Q + + +  A  +L +I ++ ++  +K 
Sbjct: 280  MYEKGGVCPTCTQQISEGPDRITKIKDKLKELQHSLEKLDTAIDELEEIMDEFNEQSKKL 339

Query: 1663 NETNVKVKLLQERLKSIQAGFLQNGRSAMDVENEEKNLEKEVALAQKQASGLRSRYQETD 1722
             E   K+   ++ L ++          A  V+   + L+ E     ++ + L+    +  
Sbjct: 340  LELKNKISTNKQSLITLV-------DKAKKVKAAIEELQAEFVDNAEELAKLQDELDKIV 392

Query: 1723 NKL 1725
               
Sbjct: 393  KTK 395


>gnl|CDD|179385 PRK02224, PRK02224, chromosome segregation protein; Provisional.
          Length = 880

 Score = 45.4 bits (108), Expect = 3e-04
 Identities = 61/379 (16%), Positives = 136/379 (35%), Gaps = 65/379 (17%)

Query: 1436 KLINEKESKAEELRKHMIILSESNSAKDYA----ETSKKLINEKEAKAEEVFRGITTAK- 1490
            + +N  ES+  EL +    +      ++ A    + + +++ E E + EE    + T + 
Sbjct: 206  ERLNGLESELAELDE---EIERYEEQREQARETRDEADEVLEEHEERREE----LETLEA 258

Query: 1491 -----QESHAANILAKEAYDHAASVRNKTAAYVASTSNITKQLDEFLNAPGATLADIRNI 1545
                 +E+ A     +E          +          + ++ D+ L   G   AD   +
Sbjct: 259  EIEDLRETIAETEREREELAEEVRDLRERL------EELEEERDDLLAEAGLDDADAEAV 312

Query: 1546 SGL-----AVNKNIQSNPEQIRERANEINNVIKSLT-DIDTILTETAGDLAKANDLKRKA 1599
                      ++ ++   E+ R  A   N   +SL  D D +         +A +L+ + 
Sbjct: 313  EARREELEDRDEELRDRLEECRVAAQAHNEEAESLREDADDLEERAEELREEAAELESEL 372

Query: 1600 NLTKAGADSTKNTVQKIVDVLTEARTAQDMAEVAIQTAKDDISAARKDLSQITNDLDDAQ 1659
               +   +  +  ++++ + + E R     A V +  A+D       +   +  + D+ +
Sbjct: 373  EEAREAVEDRREEIEELEEEIEELRERFGDAPVDLGNAED-----FLEE--LREERDELR 425

Query: 1660 QKSNETNVKVKLLQERLKSIQAGFLQNGR--------------SAMDVENEEKNLEKEVA 1705
            ++  E    ++  +ER++  +A  L+ G+                ++ E+ E+  E E  
Sbjct: 426  EREAELEATLRTARERVEEAEA-LLEAGKCPECGQPVEGSPHVETIE-EDRERVEELEAE 483

Query: 1706 LAQKQASGLRSRYQETDNKLLSKAESSGLKRARGQMLLNKASQLSVNTTAKLKQLNETEA 1765
            L       L    +E + + L +AE       R + L  +   L        + + E   
Sbjct: 484  LED-----LEEEVEEVEER-LERAEDLVEAEDRIERLEERREDLE-------ELIAERRE 530

Query: 1766 MFNSQESELTELSKNIAEL 1784
                +     EL +  AEL
Sbjct: 531  TIEEKRERAEELRERAAEL 549



 Score = 43.5 bits (103), Expect = 0.001
 Identities = 94/548 (17%), Positives = 183/548 (33%), Gaps = 109/548 (19%)

Query: 1248 DDMKKQIMNNEDPNGVGKTLGNTTQRINLARLALNDLTEEIKKLNKTGEMLKENATLLQE 1307
            D +K QI   E+ +          +R+N     L +L EEI++  +  E  +E  T  + 
Sbjct: 190  DQLKAQIEEKEEKDLH--------ERLNGLESELAELDEEIERYEEQREQARE--TRDEA 239

Query: 1308 NNVEGALELTRQAHDTSLKLKEQSTETEKQINDAERQCKRTENLVTKTFPQFTQGQEENE 1367
            + V    E  R+  +T   L+ +  +  + I + ER  +R E                  
Sbjct: 240  DEVLEEHEERREELET---LEAEIEDLRETIAETER--EREE------------------ 276

Query: 1368 KALQRLNEKITSLEKNLPDLNELICDKRGDPCDNLCGGAGCGKCGGMWCSNGTLSESNSA 1427
                 L E++  L + L +L E   D            A  G             +   A
Sbjct: 277  -----LAEEVRDLRERLEELEEERDDLL----------AEAGL------------DDADA 309

Query: 1428 KDYAETSKKLINEKESKAEELRKHMIILSESNS-AKDYAETSKKLINEKEAKAEEVFRGI 1486
            +      ++L +  E   + L +  +     N  A+   E +  L    E +AEE     
Sbjct: 310  EAVEARREELEDRDEELRDRLEECRVAAQAHNEEAESLREDADDL----EERAEE----- 360

Query: 1487 TTAKQESHAANILAKEAYDHAASVRNKTAAYVASTSNITKQLDEFLNAPGATLADIRNIS 1546
                +E   A  L  E  +   +V ++          I +  + F +AP     D+ N  
Sbjct: 361  ---LRE--EAAELESELEEAREAVEDRREEIEELEEEIEELRERFGDAP----VDLGNAE 411

Query: 1547 GLAVNKNIQSNPEQIRERANEINNVIKSLTDIDTILTETAGDLAKANDLKRKANLTKAG- 1605
                        E++RE  +E+      L      L      + +A  L       + G 
Sbjct: 412  DFL---------EELREERDELREREAELE---ATLRTARERVEEAEALLEAGKCPECGQ 459

Query: 1606 --ADSTKNTVQKIVDVLTEARTAQDMAEVAIQTAKDDISAARKDLSQITNDLDDAQQKSN 1663
                S        V+ + E R   +  E  ++  ++++    + L +   DL +A+ +  
Sbjct: 460  PVEGS------PHVETIEEDRERVEELEAELEDLEEEVEEVEERLERAE-DLVEAEDRIE 512

Query: 1664 ETNVKVKLLQERLKSIQAGFLQNGRSAMDVENEEK-NLEKEVALAQKQASGLRSRYQETD 1722
                + + L+E +   +   ++  R   +   E    LE E    ++ A+      +E  
Sbjct: 513  RLEERREDLEELIAERRET-IEEKRERAEELRERAAELEAEAEEKREAAAEAEEEAEEAR 571

Query: 1723 NKLLSKAESSGLKRARGQMLLNKASQLSV--NTTAKLKQLNETEAMFNSQESE----LTE 1776
             ++          + R + L    + L+   +   ++++L E          E    L E
Sbjct: 572  EEVAELNSKLAELKERIESLERIRTLLAAIADAEDEIERLREKREALAELNDERRERLAE 631

Query: 1777 LSKNIAEL 1784
              +   EL
Sbjct: 632  KRERKREL 639



 Score = 43.5 bits (103), Expect = 0.001
 Identities = 102/555 (18%), Positives = 190/555 (34%), Gaps = 135/555 (24%)

Query: 1215 EFELLEKKIEDVKALVENTTRSSHDLTVMLTTIDDMKKQIMNNEDPNGVGKTLGNTTQRI 1274
            E E LE +IED++  +  T R   +L      + D+++++   E+               
Sbjct: 252  ELETLEAEIEDLRETIAETEREREELA---EEVRDLRERLEELEEERDDL---------- 298

Query: 1275 NLARLALNDLTEEI---------KKLNKTGEMLKENATLLQE--NNVEGALELTRQAHDT 1323
             LA   L+D   E           +  +  + L+E     Q      E   E      + 
Sbjct: 299  -LAEAGLDDADAEAVEARREELEDRDEELRDRLEECRVAAQAHNEEAESLREDADDLEER 357

Query: 1324 SLKLKEQSTETEKQINDAERQC-KRTENL---------VTKTFP----QFTQGQEENEKA 1369
            + +L+E++ E E ++ +A      R E +         + + F          ++  E+ 
Sbjct: 358  AEELREEAAELESELEEAREAVEDRREEIEELEEEIEELRERFGDAPVDLGNAEDFLEEL 417

Query: 1370 LQ---RLNEKITSLEKNLPDLNELICDKRGDPCDNLCGGAGCGKCG----------GMWC 1416
             +    L E+   LE  L    E + +        L     C +CG           +  
Sbjct: 418  REERDELREREAELEATLRTARERVEEAE-----ALLEAGKCPECGQPVEGSPHVETIEE 472

Query: 1417 SNGTLSESNSAKDYAETSKKLINEKESKAEELRKHMIILSESNSAKDYAETSKKLINEKE 1476
                + E  +  +  E   + + E+  +AE+L +           ++  E  ++LI E+ 
Sbjct: 473  DRERVEELEAELEDLEEEVEEVEERLERAEDLVE---AEDRIERLEERREDLEELIAERR 529

Query: 1477 AKAEEVFRGITTAKQESHAANILAKEAYDHAASVRNKTAAYVASTSNITKQLDEFLNAPG 1536
               EE          +   A  L + A +  A    K  A   +     +  +E      
Sbjct: 530  ETIEE----------KRERAEELRERAAELEAEAEEKREAAAEAEEEAEEAREE------ 573

Query: 1537 ATLADIRNISGLAVNKNIQSNPEQIRERANEINNVIKSLTDIDTILTETAGDLAKANDLK 1596
              +A++   S LA  K    + E+IR     I +      +I+  L E    LA+ ND +
Sbjct: 574  --VAELN--SKLAELKERIESLERIRTLLAAIADAED---EIER-LREKREALAELNDER 625

Query: 1597 RKANLTKAGADSTKNTVQKIVDVLTEART-----AQDMAEVAIQTAKDDISAARKDLSQI 1651
            R                    + L E R        +  E  I+ A++D   A + L Q+
Sbjct: 626  R--------------------ERLAEKRERKRELEAEFDEARIEEAREDKERAEEYLEQV 665

Query: 1652 TNDLDDAQQKSNETNVKVKLLQERLKSIQAGFLQNGRSAMDVENEEKNLE----KEVALA 1707
               LD+ +++ ++       LQ  + +              VENE + LE    +  AL 
Sbjct: 666  EEKLDELREERDD-------LQAEIGA--------------VENELEELEELRERREALE 704

Query: 1708 QKQASGLRSRYQETD 1722
             +    L + Y E +
Sbjct: 705  NRV-EALEALYDEAE 718



 Score = 42.3 bits (100), Expect = 0.003
 Identities = 86/437 (19%), Positives = 158/437 (36%), Gaps = 107/437 (24%)

Query: 1219 LEKKIEDVKALVENTTRSSHDLTVMLTTIDDMKKQIMNNEDP-NGVGKTLGNTTQRINLA 1277
            LE ++E+ +  VE   R           I++++++I    +        LGN    +   
Sbjct: 368  LESELEEAREAVE--DRRE--------EIEELEEEIEELRERFGDAPVDLGNAEDFLEEL 417

Query: 1278 RLALNDLTEEIKKLNKT----GEMLKENATLLQENN-------VEGA--LELTRQAHDTS 1324
            R   ++L E   +L  T     E ++E   LL+          VEG+  +E   +  +  
Sbjct: 418  REERDELREREAELEATLRTARERVEEAEALLEAGKCPECGQPVEGSPHVETIEEDRERV 477

Query: 1325 LKLKEQSTETEKQINDAERQCKRTENLVTKTFPQFTQGQEENEKALQRLNEKITSLEKNL 1384
             +L+ +  + E+++ + E + +R E+LV            E E  ++RL E+        
Sbjct: 478  EELEAELEDLEEEVEEVEERLERAEDLV------------EAEDRIERLEER-------R 518

Query: 1385 PDLNELICDKRGDPCDNLCGGAGCGKCGGMWCSNGTLSESNSAKDYAETSKKLINEKESK 1444
             DL ELI ++R                                    ET    I EK  +
Sbjct: 519  EDLEELIAERR------------------------------------ET----IEEKRER 538

Query: 1445 AEELRKHMIILSESNSAKDYAETSKKLINEKEAKAEEVFRGITTAKQESHAANILAKEAY 1504
            AEELR+       +   +  AE  ++   E E +AEE    +     +  A      E+ 
Sbjct: 539  AEELRE------RAAELEAEAEEKREAAAEAEEEAEEAREEVAELNSK-LAELKERIESL 591

Query: 1505 DHAASVRNKTAAYVASTSNIT---KQLDEFLNAPGATLADIRN-ISGLA--VNKNIQSNP 1558
            +   ++    A        +    + L E  +     LA+ R     L    ++      
Sbjct: 592  ERIRTLLAAIADAEDEIERLREKREALAELNDERRERLAEKRERKRELEAEFDEARIEEA 651

Query: 1559 EQIRERANE-INNVIKSLTDIDTILTETAGDLAKA-----NDLKRKANLTKAGADSTKNT 1612
             + +ERA E +  V + L +    L E   DL        N+L+    L +   ++ +N 
Sbjct: 652  REDKERAEEYLEQVEEKLDE----LREERDDLQAEIGAVENELEELEEL-RERREALENR 706

Query: 1613 VQKIVDVLTEARTAQDM 1629
            V+ +  +  EA   + M
Sbjct: 707  VEALEALYDEAEELESM 723



 Score = 40.0 bits (94), Expect = 0.013
 Identities = 43/229 (18%), Positives = 93/229 (40%), Gaps = 23/229 (10%)

Query: 1559 EQIRERANEINNVIKSLTDIDTILTETAGDLAKANDLKRKANLTKAGADSTKNTVQKIVD 1618
             QI E+  E  ++ + L  +++ L E   ++ +  + + +A  T+  AD        +++
Sbjct: 194  AQIEEK--EEKDLHERLNGLESELAELDEEIERYEEQREQARETRDEADE-------VLE 244

Query: 1619 VLTEARTAQDMAEVAIQTAKDDISAARKDLSQITNDLDDAQQKSNETNVKVKLLQERLKS 1678
               E R   +  E  I+  ++ I+   ++  ++  ++ D +++  E       L+E    
Sbjct: 245  EHEERREELETLEAEIEDLRETIAETEREREELAEEVRDLRERLEE-------LEEERDD 297

Query: 1679 IQAGFLQNGRSAMDVENEEKNLEKEVALAQKQASGLRSRYQETDNKLLSKAESSGLKRAR 1738
            + A    +   A  VE   + LE      + +    R   Q  + +  S  E +     R
Sbjct: 298  LLAEAGLDDADAEAVEARREELEDRDEELRDRLEECRVAAQAHNEEAESLREDADDLEER 357

Query: 1739 GQMLLNKASQLSVNTTAKLKQLNETEAMFNSQESELTELSKNIAELQKR 1787
             + L  +A++L         +L E       +  E+ EL + I EL++R
Sbjct: 358  AEELREEAAELE-------SELEEAREAVEDRREEIEELEEEIEELRER 399


>gnl|CDD|227278 COG4942, COG4942, Membrane-bound metallopeptidase [Cell division and
            chromosome partitioning].
          Length = 420

 Score = 45.1 bits (107), Expect = 3e-04
 Identities = 36/213 (16%), Positives = 81/213 (38%), Gaps = 27/213 (12%)

Query: 1602 TKAGADSTKNTVQKIVDV---LTEARTAQDMAEVAIQTAKDDISAARKDLSQITNDLDDA 1658
              A     K   ++I  +   + E +  +   E  +++ + +I++    L +  +DL   
Sbjct: 33   AAADDKQLKQIQKEIAALEKKIREQQDQRAKLEKQLKSLETEIASLEAQLIETADDLKKL 92

Query: 1659 QQKSNETNVKVKLL------QERLKSIQAGFLQNGRSAMDVENEEKNLEKEVALAQKQAS 1712
            +++  + N ++  L      Q R  + Q   LQ  RS  +         ++   + + A 
Sbjct: 93   RKQIADLNARLNALEVQEREQRRRLAEQLAALQ--RSGRNPPPALLVSPEDAQRSVRLAI 150

Query: 1713 GLR----SRYQETDN-----KLLSK------AESSGLKRARGQMLLNKASQLSVNTTAKL 1757
                   +R +  D      K L+       AE + L     +    + ++L+     + 
Sbjct: 151  YYGALNPARAERIDALKATLKQLAAVRAEIAAEQAELTTLLSE-QRAQQAKLAQLLEERK 209

Query: 1758 KQLNETEAMFNSQESELTELSKNIAELQKRIQS 1790
            K L +  +  ++ + +L EL  N + L+  I S
Sbjct: 210  KTLAQLNSELSADQKKLEELRANESRLKNEIAS 242



 Score = 36.6 bits (85), Expect = 0.10
 Identities = 48/268 (17%), Positives = 89/268 (33%), Gaps = 60/268 (22%)

Query: 1549 AVNKNIQSNPEQIRERANEINNVIKSL-TDIDTI---LTETAGDLAKAND---------- 1594
            A+ K I+    + +++  ++   +KSL T+I ++   L ETA DL K             
Sbjct: 49   ALEKKIR----EQQDQRAKLEKQLKSLETEIASLEAQLIETADDLKKLRKQIADLNARLN 104

Query: 1595 ------LKRKANLTK-AGAD--STKNTVQKIV----DVLTEARTAQDMAEV--AIQTAKD 1639
                   +++  L +   A   S +N    ++    D     R A     +  A     D
Sbjct: 105  ALEVQEREQRRRLAEQLAALQRSGRNPPPALLVSPEDAQRSVRLAIYYGALNPARAERID 164

Query: 1640 DISAARKDLSQITNDLDDAQQKSNETNVKVKLLQERLKSIQAGFLQNGRSAMDVENEEKN 1699
             + A  K L+ +  ++  A +++             L S Q    Q  + A  +E  +K 
Sbjct: 165  ALKATLKQLAAVRAEI--AAEQAE---------LTTLLSEQRA--QQAKLAQLLEERKKT 211

Query: 1700 ---LEKEVALAQK-------QASGLRSRYQETDNKLLSKAESSGLKRARGQMLLNKASQL 1749
               L  E++  QK         S L++     +       E++    A         +  
Sbjct: 212  LAQLNSELSADQKKLEELRANESRLKNEIASAEAAAAKAREAAAAAEAAAARARAAEA-- 269

Query: 1750 SVNTTAKLKQLNETEAMFNSQESELTEL 1777
                T +  +    E M  S       L
Sbjct: 270  --KRTGETYKPTAPEKMLISSTGGFGAL 295


>gnl|CDD|224259 COG1340, COG1340, Uncharacterized archaeal coiled-coil protein
            [Function unknown].
          Length = 294

 Score = 44.3 bits (105), Expect = 3e-04
 Identities = 39/266 (14%), Positives = 89/266 (33%), Gaps = 62/266 (23%)

Query: 1552 KNIQSNPEQIRERANEINNVIKSLTDIDTILTETAGDLAKANDLKRKANLTKAGADSTKN 1611
            K ++   E+++E+ +E+      L               K ++L  K             
Sbjct: 16   KQLKEEIEELKEKRDELRKEASELA-------------EKRDELNAK------------- 49

Query: 1612 TVQKIVDVLTEARTAQDMAEVAIQTAKDDISAARKDLSQITNDLDDAQQKSNETNVKVKL 1671
             V+++ +   E R  +D     +Q  K+        L ++  +  + ++K NE N+  + 
Sbjct: 50   -VRELREKAQELREERDEINEEVQELKEKRDEINAKLQELRKEYRELKEKRNEFNLGGRS 108

Query: 1672 LQERLKSIQAGFLQNGRSAMDVENEEKNLEKEVALAQKQASGLRSRYQETDNKLLSKAES 1731
            ++   + I+       R     +      E+E  L QK    LR   ++    L    + 
Sbjct: 109  IKSLEREIE-------RLEKKQQTSVLTPEEERELVQK-IKELRKELEDAKKALEENEKL 160

Query: 1732 SGLK------RARGQMLLNKASQLSVNTTAKLKQLNET---------------------E 1764
              LK      + + + +  K  +L+       +++ +                       
Sbjct: 161  KELKAEIDELKKKAREIHEKIQELANEAQEYHEEMIKLFEEADELRKEADELHEEFVELS 220

Query: 1765 AMFNSQESELTELSKNIAELQKRIQS 1790
               +    E   L   + EL+K+I++
Sbjct: 221  KKIDELHEEFRNLQNELRELEKKIKA 246



 Score = 44.3 bits (105), Expect = 4e-04
 Identities = 53/279 (18%), Positives = 101/279 (36%), Gaps = 43/279 (15%)

Query: 1435 KKLINEKESKAEELRKHMIILSE-----SNSAKDYAETSKKLINEKEAKAEEVFRGITTA 1489
            K+ I E + K +ELRK    L+E     +   ++  E +++L  E++   EEV       
Sbjct: 19   KEEIEELKEKRDELRKEASELAEKRDELNAKVRELREKAQELREERDEINEEV----QEL 74

Query: 1490 KQESHAANILAKEAYDHAASVRNKTAAYVASTSNITKQLDEFLNAPGATLADIRNISGLA 1549
            K++    N   +E       ++              ++ +EF     +  +  R I  L 
Sbjct: 75   KEKRDEINAKLQELRKEYRELK--------------EKRNEFNLGGRSIKSLEREIERLE 120

Query: 1550 -VNKNIQSNPEQIRERANEINNVIKSLTDIDTILT---ETAGDLAKANDLKRKANLTKAG 1605
               +     PE+ RE   +I  + K L D    L    +     A+ ++LK+KA      
Sbjct: 121  KKQQTSVLTPEEERELVQKIKELRKELEDAKKALEENEKLKELKAEIDELKKKAREIHEK 180

Query: 1606 ADSTKNTVQKIVDVLTEA-------RTAQDMAEVAIQTAKDDISAARKDLSQITNDLDDA 1658
                 N  Q+  + + +        R   D            I    ++   + N+L + 
Sbjct: 181  IQELANEAQEYHEEMIKLFEEADELRKEADELHEEFVELSKKIDELHEEFRNLQNELREL 240

Query: 1659 QQK-----SNETNVKVK----LLQERLKSIQAGFLQNGR 1688
            ++K     + E   K +     L+ER + I   F +  +
Sbjct: 241  EKKIKALRAKEKAAKRREKREELKERAEEIYEKFKRGEK 279



 Score = 31.6 bits (72), Expect = 3.7
 Identities = 24/116 (20%), Positives = 46/116 (39%), Gaps = 12/116 (10%)

Query: 1684 LQNGRSAMDVENEEKNLEKEVALAQKQASGLRSRYQETDNKLLSKAESSGLKRARGQMLL 1743
            L+  R  +  E EE   +++    +K+AS L  +  E + K+    E +   R     + 
Sbjct: 11   LELKRKQLKEEIEELKEKRDEL--RKEASELAEKRDELNAKVRELREKAQELREERDEIN 68

Query: 1744 NKASQL-----SVNTTA-----KLKQLNETEAMFNSQESELTELSKNIAELQKRIQ 1789
             +  +L      +N        + ++L E    FN     +  L + I  L+K+ Q
Sbjct: 69   EEVQELKEKRDEINAKLQELRKEYRELKEKRNEFNLGGRSIKSLEREIERLEKKQQ 124


>gnl|CDD|214599 smart00283, MA, Methyl-accepting chemotaxis-like domains (chemotaxis
            sensory transducer).  Thought to undergo reversible
            methylation in response to attractants or repellants
            during bacterial chemotaxis.
          Length = 262

 Score = 44.2 bits (105), Expect = 3e-04
 Identities = 48/264 (18%), Positives = 95/264 (35%), Gaps = 51/264 (19%)

Query: 1436 KLINEKESKAEELRKHMIILSESNSAKDYAETSKKLINEKEAKAEEVFRGITTAKQESHA 1495
            + + E  + AEE         E     +  E     I E  A A+E+     +A + +  
Sbjct: 4    EAVEEIAAGAEEQA------EELEELAERMEELSASIEEVAANADEIAATAQSAAEAAEE 57

Query: 1496 ANILAKEAYDHAASVRNKTAAYVASTSNITKQLDEFLNAPGATLADIRNISG----LAVN 1551
                 ++A      +R      V    +  ++L+E  +  G  ++ I +I+     LA+N
Sbjct: 58   GREAVEDAITAMDQIREV----VEEAVSAVEELEESSDEIGEIVSVIDDIADQTNLLALN 113

Query: 1552 KNI-------------------QSNPEQIRERANEINNVIK--------SLTDIDTILTE 1584
              I                   +   E+  E A EI ++IK        ++  ++   +E
Sbjct: 114  AAIEAARAGEAGRGFAVVADEVRKLAERSAESAKEIESLIKEIQEETNEAVAAMEESSSE 173

Query: 1585 TAGDLAKANDLKRKANLTKAGADSTKNTVQKIVDVLTE--------ARTAQDMAEVAIQT 1636
                +    +            +   + VQ+I     E             ++A+V  +T
Sbjct: 174  VEEGVELVEETGDALEEIVDSVEEIADLVQEIAAATDEQAAGSEEVNAAIDEIAQVTQET 233

Query: 1637 AK--DDISAARKDLSQITNDLDDA 1658
            A   ++ISAA ++LS +  +LD+ 
Sbjct: 234  AAMSEEISAAAEELSGLAEELDEL 257



 Score = 36.9 bits (86), Expect = 0.071
 Identities = 41/217 (18%), Positives = 75/217 (34%), Gaps = 38/217 (17%)

Query: 1606 ADSTKNTVQKIVDVLTE-ARTAQDMAEVAIQTAKDDISAARKDLSQITNDLDDAQQKSNE 1664
             +     ++++   + E A  A ++A  A   A+      R+ +      +D  ++   E
Sbjct: 20   LEELAERMEELSASIEEVAANADEIAATAQSAAEA-AEEGREAVEDAITAMDQIREVVEE 78

Query: 1665 TNVKVKLLQERLKSIQ--------------------------AGFLQNGRSAMDVENEEK 1698
                V+ L+E    I                           AG  + GR    V +E +
Sbjct: 79   AVSAVEELEESSDEIGEIVSVIDDIADQTNLLALNAAIEAARAG--EAGRGFAVVADEVR 136

Query: 1699 NLEKEVALAQKQASGLRSRYQETDNKLLSKAESSGLKRARGQMLLNKASQL------SVN 1752
             L +  A + K+   L    QE  N+ ++  E S  +   G  L+ +          SV 
Sbjct: 137  KLAERSAESAKEIESLIKEIQEETNEAVAAMEESSSEVEEGVELVEETGDALEEIVDSVE 196

Query: 1753 TTAKLKQLNETEAMFNSQESELTELSKNIAELQKRIQ 1789
              A L Q     A  + Q +   E++  I E+ +  Q
Sbjct: 197  EIADLVQ-EIAAAT-DEQAAGSEEVNAAIDEIAQVTQ 231


>gnl|CDD|221776 pfam12795, MscS_porin, Mechanosensitive ion channel porin domain.
            The small mechanosensitive channel, MscS, is a part of
            the turgor-driven solute efflux system that protects
            bacteria from lysis in the event of osmotic shock. The
            MscS protein alone is sufficient to form a functional
            mechanosensitive channel gated directly by tension in the
            lipid bilayer. The MscS proteins are heptamers of three
            transmembrane subunits with seven converging M3 domains,
            and this MscS_porin is towards the N-terminal of the
            molecules. The high concentration of negative charges at
            the extracellular entrance of the pore helps select the
            cations for efflux.
          Length = 239

 Score = 43.8 bits (104), Expect = 4e-04
 Identities = 41/216 (18%), Positives = 81/216 (37%), Gaps = 42/216 (19%)

Query: 1551 NKNIQSNPEQIRERANEINNVIKSLTDIDTILTETAGDLAKANDLKRKANLTKAGADSTK 1610
             K I   P+++RE   E+  + K+   +   L          + L+++   T +     +
Sbjct: 44   QKQIDDAPKELRELRQELEALKKTDAPVFPELANL-----SLSQLEQRLAQTLSQLQELQ 98

Query: 1611 NTVQKIVDVLTEARTAQDMAEVAIQTAKDDISAARKDLSQITNDLDDAQQKSNETNVKVK 1670
              +Q+    L E +T         + A+  +S AR+ L +I N L               
Sbjct: 99   EQLQQENSQLIELQTR-------PERAQQQLSEARRRLQEIRNRLQALSPGGTP------ 145

Query: 1671 LLQERLKSIQAGFLQNGRSAMDVENEEKNLEKEVA-----LAQKQASGLRSRYQETD--- 1722
            L Q +   +QA       +A+  + EE  LE         L + Q   L+ R +  +   
Sbjct: 146  LAQAQRTLLQA-----ELAALKAQIEELELELLSNNNRQELLRLQRDLLKKRIERLEAEL 200

Query: 1723 --------NKLLSKAESSGLKRARGQMLLNKASQLS 1750
                     K L+++E + ++ A  + L  +++ L 
Sbjct: 201  QALQNAINRKRLAESEQA-VEEA--ERLAEQSAGLP 233



 Score = 41.5 bits (98), Expect = 0.002
 Identities = 43/215 (20%), Positives = 79/215 (36%), Gaps = 32/215 (14%)

Query: 1593 NDLKRKANLTKAGADSTKNTVQKIVDVLTEARTAQDMAEVAIQTAKDDISAARKDLSQIT 1652
            N L     L K    + +   + ++  L     A +  +  I  +K      +K +    
Sbjct: 1    NQLD---QLNKRKLPTPEQ--KALIQDL---EQALNFLD-EIDKSKQKADQYQKQIDDAP 51

Query: 1653 NDLDDAQQ-----KSNETNVKVKL-------LQERLKSIQAGFLQNGRSAMDVENEEKNL 1700
             +L + +Q     K  +  V  +L       L++RL    +   +         ++   L
Sbjct: 52   KELRELRQELEALKKTDAPVFPELANLSLSQLEQRLAQTLSQLQELQEQLQQENSQLIEL 111

Query: 1701 EKEVALAQKQASGLRSRYQETDNKLLS-KAESSGLKRARGQMLLNKASQLSVNTTAKLKQ 1759
            +     AQ+Q S  R R QE  N+L +     + L +A+ + LL           A++ +
Sbjct: 112  QTRPERAQQQLSEARRRLQEIRNRLQALSPGGTPLAQAQ-RTLLQAELAA---LKAQI-E 166

Query: 1760 LNETEAMFNSQESELT-----ELSKNIAELQKRIQ 1789
              E E + N+   EL       L K I  L+  +Q
Sbjct: 167  ELELELLSNNNRQELLRLQRDLLKKRIERLEAELQ 201



 Score = 31.1 bits (71), Expect = 4.7
 Identities = 20/109 (18%), Positives = 41/109 (37%), Gaps = 17/109 (15%)

Query: 1691 MDVENEEKNLEKEVALAQKQASGLRSRYQETDNKLLSKAESSGLKRARGQMLLNKASQLS 1750
            +D  N+ K    E     +         ++  N  L + + S  K  + Q  ++ A +  
Sbjct: 3    LDQLNKRKLPTPEQKALIQD-------LEQALN-FLDEIDKSKQKADQYQKQIDDAPKEL 54

Query: 1751 VNTTAKLKQLNETEAM----FNSQ-----ESELTELSKNIAELQKRIQS 1790
                 +L+ L +T+A       +      E  L +    + ELQ+++Q 
Sbjct: 55   RELRQELEALKKTDAPVFPELANLSLSQLEQRLAQTLSQLQELQEQLQQ 103


>gnl|CDD|240419 PTZ00440, PTZ00440, reticulocyte binding protein 2-like protein;
            Provisional.
          Length = 2722

 Score = 45.2 bits (107), Expect = 4e-04
 Identities = 116/660 (17%), Positives = 212/660 (32%), Gaps = 132/660 (20%)

Query: 1180 DNWDSILRGLADQTTKVITDASNIKQTGASGAYTKEFELLEKKIEDVKALVE-------N 1232
            +  +   + +     ++     NIK TG         +++E  +  V  +         N
Sbjct: 209  NKKEDYDKKIKKINNEIRKLLKNIKCTGNMCKTDTYVDMVELYLLRVNEVPSNNYDNYLN 268

Query: 1233 TTRSSHDLTVMLTTIDDMKKQIMNNEDPNGVG---KTLGNTTQRINLARLALNDLTEEIK 1289
              +   +       I+ +KK++ +N+    +    + +G+  +R N            +K
Sbjct: 269  RAKELLESGS--DLINKIKKELGDNKTIYSINFIQEEIGDIIKRYNF----------HLK 316

Query: 1290 KLNKTGEMLKENATLLQENNVEGALELTRQAHDTSLKLKEQSTETEKQINDAERQCKRTE 1349
            K+ K     KE    +Q NN+   ++           LK++  E+ K     +   +   
Sbjct: 317  KIEKG----KEYIKRIQNNNIPPQVKKDE--------LKKKYFESAKHYASFKFSLEMLS 364

Query: 1350 NLVTKTFPQFTQGQEENEKALQRLNEKI-----TSLEKNLPDLNELICDKR---GDPCDN 1401
                           + EK L  L  K+       +E  L D    I             
Sbjct: 365  --------MLDSLLIKKEKILNNLFNKLFGDLKEKIETLL-DSEYFISKYTNIISLSEHT 415

Query: 1402 LCGGAGCGKCGGMWCSNGTLSESNSAKDYAETSKKLINEKESKAEELRKHMIILSESNSA 1461
            L       K             S    DYA  S   I E + K +E         + N  
Sbjct: 416  LKAAEDVLK-----------ENSQKIADYALYSNLEIIEIKKKYDE---------KINEL 455

Query: 1462 KDYAETSKKLINEKEAKAEEVFRGITTAKQESHAANILAKEAYDHAASVRNKTAAYVAST 1521
            K      K LI+  ++                +   I  K++ D   S   K     +S 
Sbjct: 456  KKSINQLKTLISIMKSF---------------YDLIISEKDSMD---SKEKK----ESSD 493

Query: 1522 SNITKQLDEFLNAPGATLADIRNISGLAVN--KNIQSNPEQIRERANEINNVIKSLTDID 1579
            SN  +++DE L      +  I+  + +  N  KNI+     I    NEI  +I+ +    
Sbjct: 494  SNYQEKVDELLQ----IINSIKEKNNIVNNNFKNIEDYYITIEGLKNEIEGLIELIKY-- 547

Query: 1580 TILTETAGDLAKANDLKRKANLTKAGADSTKNTVQKI---VDVLTEARTAQDMAEVAIQT 1636
                     L     L +   L ++  +  KN ++ I   VD + +  +  D  +  IQ 
Sbjct: 548  --------YLQSIETLIKDEKLKRSMKNDIKNKIKYIEENVDHIKDIISLNDEIDNIIQQ 599

Query: 1637 AKDDISAARKDLSQITNDLDDAQQK--SNETNVKVKLLQERLKSI------QAGFLQNGR 1688
             ++ I+ A  +  +  N+ +D Q+K            LQE L  +              +
Sbjct: 600  IEELINEALFNKEKFINEKNDLQEKVKYILNKFYKGDLQELLDELSHFLDDHKYLYHEAK 659

Query: 1689 SAMDVENEEKNLEKEVALAQKQASGLRSRYQETDNKLLSKAESSGLKRARGQMLLNKASQ 1748
            S  D++      + E    +   S            L  K E   L   +  ++     Q
Sbjct: 660  SKEDLQTLLNTSKNEYEKLEFMKS---DNIDNIIKNL--KKELQNLLSLKENII---KKQ 711

Query: 1749 LSVNTTAKLKQLNETEAMFNSQESELTELSKNIAELQKRIQSCIN----FIVDKSNNYKN 1804
            L+         LN+    +N  +S + E  +   +L+      IN    FI+    N K+
Sbjct: 712  LNNIEQDISNSLNQYTIKYNDLKSSIEEYKEEEEKLEVYKHQIINRKNEFILHLYENDKD 771



 Score = 42.9 bits (101), Expect = 0.002
 Identities = 114/673 (16%), Positives = 237/673 (35%), Gaps = 126/673 (18%)

Query: 1194 TKVITDASNIKQTGASGAYTKEFELLEKKIEDVKALVENTTRSSHDLTVMLTTIDDMKKQ 1253
             ++ T+A  +K          E + ++     V + ++   + ++ +   L  I +M + 
Sbjct: 1234 EELTTEAKGLKGEANRSTNVDELKEIKL---QVFSYLQQVIKENNKMENALHEIKNMYEF 1290

Query: 1254 IMNNEDPNGVGKTLGNTTQRINLARLALNDLTEEIKKLNKTGEMLKE-NATLLQ----EN 1308
            +++  D   + K + N+T++      A     +  K+L KT  ++K+  A + Q    +N
Sbjct: 1291 LISI-DSEKILKEILNSTKK------AEEFSNDAKKELEKTDNLIKQVEAKIEQAKEHKN 1343

Query: 1309 NVEGALELTRQAHDTSL--KLKEQSTETEKQINDAERQCKRTENLVTKTFPQFTQGQEEN 1366
             + G+LE  +   +     ++KE+ +   K+IN            ++       +     
Sbjct: 1344 KIYGSLEDKQIDDEIKKIEQIKEEISNKRKEIN----------KYLSNIKSNKEKCDLHV 1393

Query: 1367 EKALQRLNEKITSLEKNLP----DLNELICDKRGDPCDNLCGGAGCGKCGGMWCSNGTLS 1422
              A  R  +KI  L K+      +  E+   K  D  +         KC          +
Sbjct: 1394 RNA-SRGKDKIDFLNKHEAIEPSNSKEVNIIKITDNIN---------KC------KQYSN 1437

Query: 1423 ESNSAKDYAETSKKLINEKESKAEELRKHMIIL----------SESNSAKDYAETSKKLI 1472
            E+   ++ A+ +   I + E +   +  +  IL           E+ +  D       +I
Sbjct: 1438 EAMETENKADENNDSIIKYEKEITNILNNSSILGKKTKLEKKKKEATNIMDDINGEHSII 1497

Query: 1473 NEKEAKAEEVFRGITTAKQESHAANILA----KEAYDHAASVRNKTAAYVASTSNITKQL 1528
              K  K+ E    +          ++L       AY+       +    + +  NI  ++
Sbjct: 1498 KTKLTKSSEKLNQLNEQPNIKREGDVLNNDKSTIAYETIQYNLGRVKHNLLNILNIKDEI 1557

Query: 1529 DEFLNAPGATLADIRNISGLAVNKN-----------------IQSNPEQIRERANEINNV 1571
            +  LN     + DI  IS +  NKN                 I    + +     ++N +
Sbjct: 1558 ETILNKAQDLMRDISKISKIVENKNLENLNDKEADYVKYLDNILKEKQLMEAEYKKLNEI 1617

Query: 1572 IKSLTDIDTILTETAG--DLAKANDLKRKANLTKAGADSTKNTVQKIVDVLTEARTAQDM 1629
               + +I+  L +     ++     +       K   DSTK ++  +V+  +       +
Sbjct: 1618 YSDVDNIEKELKKHKKNYEIGLLEKVIEINKNIKLYMDSTKESLNSLVNNFSSLFNNFYL 1677

Query: 1630 AEVAIQTAKDDISAARKDLSQITNDLDDAQQKSNETNVKVKLLQERLKSIQAGFLQNGRS 1689
             +  I    +++   +K L++I           NE      ++QE++K +         S
Sbjct: 1678 NKYNI---NENLEKYKKKLNEI----------YNEFMESYNIIQEKMKEV---------S 1715

Query: 1690 AMDVE-NEEKNLEKEVALAQKQASGLRSRYQETDNKL--LSKAESSGLKRARGQMLLN-- 1744
              DV+ NE K L +E   AQK+   L ++ +E    L  + K ES        + L    
Sbjct: 1716 NDDVDYNEAKTLREE---AQKEEVNLNNKEEEAKKYLNDIKKQESFRFILYMKEKLDELS 1772

Query: 1745 ---KASQLSVNTTAK-LKQLNETEAMFNSQESELTELSKNIAELQKRIQSCINF------ 1794
               K     V+     +K+  E     N + +    L     +  K + +  ++      
Sbjct: 1773 KMCKQQYNIVDEGYNYIKKKIEYIKTLNDENNLSDSL-NQAEDKNKEVANLTHYTNKNEA 1831

Query: 1795 -----IVDKSNNY 1802
                  V KS N+
Sbjct: 1832 KNLLGHVVKSANF 1844



 Score = 39.0 bits (91), Expect = 0.027
 Identities = 120/619 (19%), Positives = 220/619 (35%), Gaps = 120/619 (19%)

Query: 1195 KVITDASNIKQTG---ASGAYTKEFELLEKKIEDVKALVENTTRSSHDLTVMLTTIDDMK 1251
              + +  NIK+ G    +   T  +E ++  +  VK  + N      ++  +L    D+ 
Sbjct: 1509 NQLNEQPNIKREGDVLNNDKSTIAYETIQYNLGRVKHNLLNILNIKDEIETILNKAQDLM 1568

Query: 1252 KQIMNNEDPNGVGKTLGNTTQRINLARLA--LNDLTEEIKKLNKTGEMLKENATLLQE-- 1307
            + I            +    +  NL  L     D  + +  + K  ++++     L E  
Sbjct: 1569 RDI----------SKISKIVENKNLENLNDKEADYVKYLDNILKEKQLMEAEYKKLNEIY 1618

Query: 1308 ---NNVEGALELTRQAHDTSLKLKEQSTETEKQINDAERQCKRTENLVTKTFPQFTQGQE 1364
               +N+E  L+  ++ ++  L         EK I   +      +               
Sbjct: 1619 SDVDNIEKELKKHKKNYEIGL--------LEKVIEINKNIKLYMD--------------- 1655

Query: 1365 ENEKALQRLNEKITSLEKNLPDLNELICDKRGDPCDNLCGGAGCGKCGGMWCSNGTLSES 1424
              +++L  L    +SL  N   LN+       +  +NL                    + 
Sbjct: 1656 STKESLNSLVNNFSSLFNNF-YLNKY------NINENL---------------EKYKKKL 1693

Query: 1425 NSAKDYAETSKKLINEKESKAEELRKHMIILSESNSAKDYAETSKKLINEKEAKAEEVFR 1484
            N   +    S  +I EK  +        +  +E+ + ++ A+  +  +N KE +A++   
Sbjct: 1694 NEIYNEFMESYNIIQEKMKEVSN---DDVDYNEAKTLREEAQKEEVNLNNKEEEAKKYLN 1750

Query: 1485 GITTAKQESHAANILAKEAYDHAASVRNKTAAYVA--STSNITKQLDEFLNAPGATLADI 1542
             I   KQES    +  KE  D   S   K    +     + I K+++        TL D 
Sbjct: 1751 DIK--KQESFRFILYMKEKLDEL-SKMCKQQYNIVDEGYNYIKKKIEYI-----KTLNDE 1802

Query: 1543 RNISGL---AVNKNIQSNPEQIRERANEINN----VIKSLTDID-TILTETAG-DLAKAN 1593
             N+S     A +KN +          NE  N    V+KS   I   I+T     +L    
Sbjct: 1803 NNLSDSLNQAEDKNKEVANLTHYTNKNEAKNLLGHVVKSANFIGIKIMTGLQPTELTPDA 1862

Query: 1594 DLKRKANLT---KAGADSTKNTVQKIVDVLTEARTAQDMAEVAIQTAK--DDISAARKDL 1648
             L+    LT   +  +D   + +    + L   +  QD  + ++Q  K  DDI   ++D 
Sbjct: 1863 SLETAPELTFESENNSDLELDHLSSNKNELDVYKNIQDAYKSSLQILKYSDDIDKKQRDC 1922

Query: 1649 SQITNDLDDAQQKSNETNVKVKLLQERLKSIQAGFLQNGRSAMDVENEEKNLEKEVALAQ 1708
            +++  D        NE  +K   + E    I             V+N+E  +  ++    
Sbjct: 1923 NKLVED-------GNEIYLKSTAINELKNMIN-----------SVKNKESAISNKIDNVS 1964

Query: 1709 KQASGLRSRYQETDNK---LLSKAESSGLKRARGQMLLNKASQLSVNTTAKLKQLNETEA 1765
             + S L ++    D     +L K E   LK  R      KA       T    +LN+ + 
Sbjct: 1965 NKLSEL-NKITCNDESYDEILEKEEYEELKDLRNSFNQEKA------ETLNNLKLNKIKE 2017

Query: 1766 MFNSQESELTELSKNIAEL 1784
             FNS ++ L EL K++  L
Sbjct: 2018 DFNSYKNLLDELEKSVKTL 2036



 Score = 39.0 bits (91), Expect = 0.027
 Identities = 78/515 (15%), Positives = 176/515 (34%), Gaps = 92/515 (17%)

Query: 1230 VENTTRSSHDLTVMLTT----IDDMKKQIMNNEDPNGVGKTLGNTTQRINLARLALNDLT 1285
            ++   + S +L+  +T     I+++KK+I+   +         NT               
Sbjct: 2152 IDKANKLSSELSEAVTNSEEIIENIKKEIIEINE---------NT--------------- 2187

Query: 1286 EEIKKLNKTGEMLKENATLLQENNVEGALELTRQAHDTS--LKLKEQSTETEKQINDAER 1343
             E+  L  T + LKE        N++    +    +     +KL+E    +EK  + ++ 
Sbjct: 2188 -EMNTLENTADKLKELY-----ENLKKKKNIINNIYKKINFIKLQEIENSSEKYNDISK- 2240

Query: 1344 QCKRTENLVTKTFPQFTQGQE--ENEKALQRLNEKITSLEKNLPDLNELICDKRGDPCDN 1401
                  N V +T     Q ++  +N+  +  + +KI   EK L +++     +     + 
Sbjct: 2241 ----LFNNVVET-----QKKKLLDNKNKINNIKDKINDKEKELINVDSSFTLESIKTFNE 2291

Query: 1402 LCGGAGCGKCGGMWCSNGTLSESNSAKDYAETSKKLINEKESKAEELRKHMIILSES--N 1459
            +                   S         E +     +K     E   H++    +  N
Sbjct: 2292 IYDD--------------IKSNIGDLYKL-EDTNNDELKKVKLYIENITHLLNRINTLIN 2336

Query: 1460 SAKDYAETSKKLINEKEAKAEEVFRGITTAKQESHAANILAKEAYDHAASVRNKTAAYVA 1519
               +Y + +       E   E      +   +     N L +E      ++  K    + 
Sbjct: 2337 DLDNYQDENYGKDKNIELNNENN----SYIIKTKEKINNLKEEFSKLLKNI--KRNNTLC 2390

Query: 1520 STSNITKQLDEFLNAPGATLADIRNISGLAVNKNIQSNPEQIRERANEINNVIKSLTD-- 1577
            + +NI   +         ++  I+     + N   +    QI E  NEI N++       
Sbjct: 2391 NNNNIKDFISNIGK----SVETIK--QRFSSNLPEKEKLHQIEENLNEIKNIMNETKRIS 2444

Query: 1578 -----IDTILTETAGDLAKANDLKRKANLTKAGADSTKNTVQKIVDVLTEARTAQDMAEV 1632
                  + IL +   +  K N+    A       ++  +  +KI   L     A      
Sbjct: 2445 NVDAFTNKILQDIDNEKNKENNN-MNAEKIDDLIENVTSHNEKIKSELLIINDALR---- 2499

Query: 1633 AIQTAKDDISAARKDLSQITNDLDD-AQQKSNETNVKVKLLQERLKSIQAGFLQNGRSAM 1691
             ++  KD+++     L++  N+ ++ A+   + +   +  L+  +  +            
Sbjct: 2500 RVKEKKDEMNKLFNSLTENNNNNNNSAKNIVDNSTYIINELESHVSKLNELLSYIDNEIK 2559

Query: 1692 DVENEEKNLEK--EVALAQKQASGLRSRYQETDNK 1724
            ++ENE+  L +  ++  ++K+   + S  QE +  
Sbjct: 2560 ELENEKLKLLEKAKIEESRKERERIESETQEDNTD 2594



 Score = 34.8 bits (80), Expect = 0.53
 Identities = 71/417 (17%), Positives = 150/417 (35%), Gaps = 70/417 (16%)

Query: 1432 ETSKKLINEKESKAEELRKHMIILSESNSAKDYAETSKKLINEKEAKAEEVFRGITTAK- 1490
               K+  N  ++  +EL K +  L  S + K   E  K  I+      E++ + I +   
Sbjct: 2013 NKIKEDFNSYKNLLDELEKSVKTLKASENIKKIVENKKTSIDAINTNIEDIEKEIESINP 2072

Query: 1491 ------QESHAANI---------LAKEAYDHAASVR---NKTAAYVA-------STSNIT 1525
                  ++ H   I         +  +  + + ++     K   Y+A       S     
Sbjct: 2073 SLDELLKKGHKIEISRYTSIIDNVQTKISNDSKNINDIEKKAQIYLAYIKNNYNSIKKDI 2132

Query: 1526 KQLDEFLNAPGATLADIRNISGLAVNKNIQSNPEQIRERANEINNVIKSL------TDID 1579
              L+E+ +    +   + NI     NK      E +      I N+ K +      T+++
Sbjct: 2133 STLNEYFDEKQVSNYILTNID--KANKLSSELSEAVTNSEEIIENIKKEIIEINENTEMN 2190

Query: 1580 TILTETAGDLAKANDLKRKANLTKAGADSTKNTVQKIVDV-LTEARTAQ----DMAEV-- 1632
            T+            +LK+K N          N  +KI  + L E   +     D++++  
Sbjct: 2191 TLENTADKLKELYENLKKKKN-------IINNIYKKINFIKLQEIENSSEKYNDISKLFN 2243

Query: 1633 -AIQTAKDDISAARKDLSQITNDLDDAQQKSNETNVKVKLLQ-ERLKSIQAGFLQNGRSA 1690
              ++T K  +   +  ++ I + ++D +++    +    L   +    I      N    
Sbjct: 2244 NVVETQKKKLLDNKNKINNIKDKINDKEKELINVDSSFTLESIKTFNEIYDDIKSNIGDL 2303

Query: 1691 MDVENEEKNLEKEVALAQKQASGLRSRYQETDNKLLSKAESSGLKRARGQMLLNKASQLS 1750
              +E+   +  K+V L  +  + L +R     N L +  + +  K        +K  +L+
Sbjct: 2304 YKLEDTNNDELKKVKLYIENITHLLNRINTLINDLDNYQDENYGK--------DKNIELN 2355

Query: 1751 VNTTAKLKQLNETEAMFNSQESELTELSKNIAELQKRIQSCINFIVDKSNNYKNCVS 1807
                + + +  E     N+ + E ++L KNI           N  +  +NN K+ +S
Sbjct: 2356 NENNSYIIKTKEK---INNLKEEFSKLLKNIKR---------NNTLCNNNNIKDFIS 2400



 Score = 33.3 bits (76), Expect = 1.6
 Identities = 109/599 (18%), Positives = 202/599 (33%), Gaps = 119/599 (19%)

Query: 1247 IDDMKKQIMN-NEDPNGVGKTLGNTTQRINLARLALNDLTEE----IKKLN------KTG 1295
            +D++ K I N N++ N + KTL     R N  +  +  L         KL        T 
Sbjct: 865  VDNIIKDIENMNKNINII-KTLNIAINRSNSNKQLVEHLLNNKIDLKNKLEQHMKIINTD 923

Query: 1296 EMLKENATLLQENNV-EGALELTRQAHDTSLKLKEQSTETEKQINDAERQCKRTENLVTK 1354
             ++++N  L   NN+ +   ++ +Q  DT  K+     + EK +   ++  K   N    
Sbjct: 924  NIIQKNEKLNLLNNLNKEKEKIEKQLSDT--KINNLKMQIEKTLEYYDKS-KENINGNDG 980

Query: 1355 TFPQFTQGQEENEKALQRLNEKITSLEKNLPDLNELICDKRGDPCDNLCGGAGCGKCGGM 1414
            T  +     ++ +   +    +I  L  N   LN+ I D      D              
Sbjct: 981  THLE---KLDKEKDEWEHFKSEIDKLNVNYNILNKKIDDLIKKQHD-------------- 1023

Query: 1415 WCSNGTLSESNSAKDYAETSKKLINEKESKAEELRKHMIILSE------------SNSAK 1462
                          D  E   KLI EK  + EE     I L E             +  K
Sbjct: 1024 --------------DIIELIDKLIKEKGKEIEEKVDQYISLLEKMKTKLSSFHFNIDIKK 1069

Query: 1463 DYAETSKKLINEKEAKAEEVFRGITTAKQESHAANILAKEAYDHAASVRNKTAAYVASTS 1522
                  K+ I   E K E + + I               E  +    ++NK+  +V +  
Sbjct: 1070 YKNPKIKEEIKLLEEKVEALLKKI--------------DENKNKLIEIKNKSHEHVVNAD 1115

Query: 1523 NITKQLDEFLNAPGATLADI-RNISGLAVNKNIQS--NPEQIRERANEINNVIKSLTD-- 1577
                +  E  N    +L  I + +      K ++   N        NE+N  I+   +  
Sbjct: 1116 KEKNKQTEHYNKKKKSLEKIYKQME-----KTLKELENMNLEDITLNEVNE-IEIEYERI 1169

Query: 1578 -IDTILTETAGDLAKANDLKRKANLTKAGADSTKNTVQKIVDVLTEARTAQDMAEVAIQT 1636
             ID I+ +   +  K+  +  +    K   D  K  + K             +       
Sbjct: 1170 LIDHIVEQINNEAKKSKTIMEEIESYKKDIDQVKKNMSK--------ERNDHLTTFEYNA 1221

Query: 1637 AKDDISAARKDLSQITNDLDDAQQKSN-ETNVKVKLLQERLKSIQAGFLQNGRSAMDVEN 1695
              D  +A+ +++ ++T +    + ++N  TNV        LK I+       +  +   N
Sbjct: 1222 YYDKATASYENIEELTTEAKGLKGEANRSTNVD------ELKEIKLQVFSYLQQVIKENN 1275

Query: 1696 EEKNLEKEVALAQKQASGLRSRYQETDNKLLSKAESSGLKRAR-GQMLLNKASQLSVNTT 1754
            + +N   E+   +     L S   E   K +  +     + +   +  L K   L     
Sbjct: 1276 KMENALHEI---KNMYEFLISIDSEKILKEILNSTKKAEEFSNDAKKELEKTDNLIKQVE 1332

Query: 1755 AKLKQLNETE-AMFNSQESE--------LTELSKNIAELQKRIQSCINFIVDKSNNYKN 1804
            AK++Q  E +  ++ S E +        + ++ + I+  +K I   +      SN   N
Sbjct: 1333 AKIEQAKEHKNKIYGSLEDKQIDDEIKKIEQIKEEISNKRKEINKYL------SNIKSN 1385



 Score = 32.5 bits (74), Expect = 2.7
 Identities = 115/630 (18%), Positives = 212/630 (33%), Gaps = 106/630 (16%)

Query: 1212 YTKEFELLEKKIEDVKALVENTTRSSHDLTVMLT-TIDDMKKQIMNNEDPNGVGKTLGNT 1270
            Y ++   L+K I  +K L+ +  +S +DL +    ++D  +K+  ++ +       L   
Sbjct: 448  YDEKINELKKSINQLKTLI-SIMKSFYDLIISEKDSMDSKEKKESSDSNYQEKVDELLQI 506

Query: 1271 TQRINLARLALNDLTEEIKKLNKTGEMLKENATLLQENNVEGALELTRQAHDTSLKLKEQ 1330
               I      +N+  + I+    T E LK        N +EG +EL +    +   L + 
Sbjct: 507  INSIKEKNNIVNNNFKNIEDYYITIEGLK--------NEIEGLIELIKYYLQSIETLIKD 558

Query: 1331 STETEKQINDAERQCKRTENLVTKTFPQFTQGQEENEKALQRLNEKITSLEKNLPDLNEL 1390
                    ND + + K  E  V               K +  LN++I ++ + + +L   
Sbjct: 559  EKLKRSMKNDIKNKIKYIEENVDHI------------KDIISLNDEIDNIIQQIEELINE 606

Query: 1391 ICDKRGDPCDNLCGGAGCGKCGGMWCSNGTLSE-SNSAKDYAETSKKLINEKESKAEELR 1449
                +    +         K        G L E  +    + +  K L +E +SK +   
Sbjct: 607  ALFNKEKFINEKNDLQEKVKYILNKFYKGDLQELLDELSHFLDDHKYLYHEAKSKEDLQT 666

Query: 1450 KHMIILSE------------SNSAKDYAETSKKLINEKEAKAEEVFRGITTAKQESHAAN 1497
                  +E             N  K+  +  + L++ KE   ++    I      S   N
Sbjct: 667  LLNTSKNEYEKLEFMKSDNIDNIIKNLKKELQNLLSLKENIIKKQLNNIEQDISNS--LN 724

Query: 1498 ILAKEAYDHAASVRNKTAAYVASTS-------NITKQLDEF---LNAPGATLADIRNISG 1547
                +  D    +++    Y             I  + +EF   L      L D +N   
Sbjct: 725  QYTIKYND----LKSSIEEYKEEEEKLEVYKHQIINRKNEFILHLYENDKDLPDGKNTY- 779

Query: 1548 LAVNKNIQSNPEQIRERANEINNVIKSLTD-IDTILTETAGDLAKANDL--KRKANLTKA 1604
                +      + I  + N+I+N I  L +            L   N L  K +A+  K 
Sbjct: 780  ----EEFLQYKDTILNKENKISNDINILKENKKNNQDL----LNSYNILIQKLEAHTEKN 831

Query: 1605 GADSTKNTVQKIVDVLTEARTAQDMAEV--AIQTAKD---DISAARKDLSQI-------- 1651
                 K  +QK           +   E     Q   +   DI    K+++ I        
Sbjct: 832  DE-ELKQLLQKFPTEDENLNLKELEKEFNENNQIVDNIIKDIENMNKNINIIKTLNIAIN 890

Query: 1652 ---TNDLDDAQQKSNETNVKVKLLQERLKSIQA-GFLQNGRSAMDVENEEKNLEKEVALA 1707
               +N        +N+ ++K KL Q  +K I     +Q      +  N   NL KE    
Sbjct: 891  RSNSNKQLVEHLLNNKIDLKNKLEQ-HMKIINTDNIIQKN----EKLNLLNNLNKEKEKI 945

Query: 1708 QKQASGLRSRYQETDNKLLSKAESSGLKRARGQMLLNKASQLSVNTTAK----LKQLNET 1763
            +KQ               LS  + + LK  + +  L    +   N        L++L++ 
Sbjct: 946  EKQ---------------LSDTKINNLK-MQIEKTLEYYDKSKENINGNDGTHLEKLDKE 989

Query: 1764 EAMFNSQESELTELSKNIAELQKRIQSCIN 1793
            +  +   +SE+ +L+ N   L K+I   I 
Sbjct: 990  KDEWEHFKSEIDKLNVNYNILNKKIDDLIK 1019



 Score = 31.0 bits (70), Expect = 8.8
 Identities = 74/542 (13%), Positives = 177/542 (32%), Gaps = 101/542 (18%)

Query: 1266 TLGNTTQRINLARLALNDLTEEIKKLN----KTGEMLKENATLLQENNVEGALELTRQAH 1321
             + +    I   + +++   ++    +    K  E+L+   ++ ++NN+       +   
Sbjct: 468  IMKSFYDLIISEKDSMDSKEKKESSDSNYQEKVDELLQIINSIKEKNNI--VNNNFKNIE 525

Query: 1322 DTSLKLKEQSTETEKQINDAERQCKRTENLVTKTFPQFTQGQEENEKALQRLNEKITSLE 1381
            D  + ++    E E  I   +   +  E L+          ++        +  KI  +E
Sbjct: 526  DYYITIEGLKNEIEGLIELIKYYLQSIETLIKD--------EKLKRSMKNDIKNKIKYIE 577

Query: 1382 KNLPDLNELI-----CDKRGDPCDNLCGGAGCGKCGGMWCSNGTLSESNSAKDYAETSKK 1436
            +N+  + ++I      D      + L                  ++E+   K+     K 
Sbjct: 578  ENVDHIKDIISLNDEIDNIIQQIEEL------------------INEALFNKEKFINEKN 619

Query: 1437 LINEK-ESKAEELRKHMIILSESNSAKDYAETSKKLINEKEAKAEEVFRGITTAKQESHA 1495
             + EK +    +  K  +       +    +  K L +E ++K E++   + T+K E   
Sbjct: 620  DLQEKVKYILNKFYKGDLQELLDELSHFLDDH-KYLYHEAKSK-EDLQTLLNTSKNEY-- 675

Query: 1496 ANILAKEAYDHAASVRNKTAAYVASTSNITKQL-DEFLNAPGATLADIRNISGLAVNKNI 1554
               L     D+  ++       + +  ++ + +  + LN                + ++I
Sbjct: 676  -EKLEFMKSDNIDNIIKNLKKELQNLLSLKENIIKKQLNN---------------IEQDI 719

Query: 1555 QSNPEQIRERANEINNVIKSLTDIDTILTETAGDLAKANDLKRKANLTKAGADSTKNTVQ 1614
             ++  Q   + N++ + I                     + K +    +   +  K+ + 
Sbjct: 720  SNSLNQYTIKYNDLKSSI--------------------EEYKEE----EEKLEVYKHQII 755

Query: 1615 KIVDVLTEARTAQDMAEVAIQTAKDDISAARKDLSQITNDLDDAQQKSNETNVKVKLLQE 1674
               +         D           D+   +    +     D    K N+ +  + +L+E
Sbjct: 756  NRKNEFILHLYEND----------KDLPDGKNTYEEFLQYKDTILNKENKISNDINILKE 805

Query: 1675 RLKSIQAGFLQNGRSAMDVENE-EKNLEKEVALAQKQASGLRSRYQETDNKLLSKAESSG 1733
              K+ Q            +E   EKN E+   L QK  +       E +N  L + E   
Sbjct: 806  NKKNNQDLLNSYNILIQKLEAHTEKNDEELKQLLQKFPT-------EDENLNLKELEKEF 858

Query: 1734 LKRARGQMLLNKASQLSVNTTAKLKQLNETEAMFNSQESELTELSKNIAELQKRIQSCIN 1793
             +  +    + K  +        +K LN      NS +  +  L  N  +L+ +++  + 
Sbjct: 859  NENNQIVDNIIKDIENMNKNINIIKTLNIAINRSNSNKQLVEHLLNNKIDLKNKLEQHMK 918

Query: 1794 FI 1795
             I
Sbjct: 919  II 920


>gnl|CDD|130673 TIGR01612, 235kDa-fam, reticulocyte binding/rhoptry protein.  This
            model represents a group of paralogous families in
            plasmodium species alternately annotated as reticulocyte
            binding protein, 235-kDa family protein and rhoptry
            protein. Rhoptry protein is localized on the cell surface
            and is extremely large (although apparently lacking in
            repeat structure) and is important for the process of
            invasion of the RBCs by the parasite. These proteins are
            found in P. falciparum, P. vivax and P. yoelii.
          Length = 2757

 Score = 44.7 bits (105), Expect = 6e-04
 Identities = 114/638 (17%), Positives = 248/638 (38%), Gaps = 125/638 (19%)

Query: 1202 NIKQTGASGAYTKEFELLEKKIE-DVKALVENTTRSSHDLTVMLTTIDDMKK---QIMNN 1257
            NIK          + + L++KI+  +KAL E   +S + +  +   I+D++    + ++N
Sbjct: 1106 NIKYADEINKIKDDIKNLDQKIDHHIKALEEIKKKSENYIDEIKAQINDLEDVADKAISN 1165

Query: 1258 EDPNGVGKTLGNTTQRINLARLALNDLTEEIKKL-NKTGEMLKENATLLQENNVEGALEL 1316
            +DP  + K + N   +I+  +    ++ +EIKKL N+  E+ K+  +L +   +      
Sbjct: 1166 DDPEEIEKKIENIVTKIDKKK----NIYDEIKKLLNEIAEIEKDKTSLEEVKGIN----- 1216

Query: 1317 TRQAHDTSL-KLKEQSTETEKQINDAERQCKRTENLVTKTFPQFTQGQEENEKALQRLNE 1375
               ++  +L KL  +  + EK         K++E++                        
Sbjct: 1217 --LSYGKNLGKLFLEKIDEEK---------KKSEHM------------------------ 1241

Query: 1376 KITSLEKNLPDLNELICDKRGDPCDNLCGGAGCGKCGGMWCSNGTLSESNSA-KDYAETS 1434
             I ++E  + DL+E+   K   P      G        M     T + S+   KD+   S
Sbjct: 1242 -IKAMEAYIEDLDEI---KEKSPEIENEMGIEMDIKAEM----ETFNISHDDDKDHHIIS 1293

Query: 1435 KKLINEKESKAEELRKHMIILSESNSAKDYAETSKKLINEKEAKAEEVFRGITTAKQESH 1494
            KK     +    ++R+  + + E  S +      KK + +    A++    I     E  
Sbjct: 1294 KK----HDENISDIREKSLKIIEDFSEESDINDIKKELQKNLLDAQKHNSDINLYLNE-- 1347

Query: 1495 AANILAKEAYDHAASVRNKTAAYVASTSNITKQLDEFLNAPGATLADIRNISGLAVNKN- 1553
             ANI      +    + ++   Y        K + + L+     +  I++   L   K+ 
Sbjct: 1348 IANIYNILKLNKIKKIIDEVKEYTKEIEENNKNIKDELDKSEKLIKKIKDDINLEECKSK 1407

Query: 1554 IQSNPEQIRERANEINNVIKSLTDIDT-ILTETAG--------------------DLAKA 1592
            I+S  +       +I+  IK + ++   IL+E +                     ++  A
Sbjct: 1408 IESTLD-----DKDIDECIKKIKELKNHILSEESNIDTYFKNADENNENVLLLFKNIEMA 1462

Query: 1593 ND-----LKRKANLTKAGADSTKNTVQKIVDVLTEARTAQDMAEVAIQTAKDDISAARKD 1647
            ++     LK K +      D   N +++ +D     +   D    AI+  K+     +KD
Sbjct: 1463 DNKSQHILKIKKDNATNDHDFNINELKEHIDKSKGCKDEADKNAKAIEKNKELFEQYKKD 1522

Query: 1648 LSQITNDL------DDAQQKSNETNVKVKLLQERLKSIQAGFLQNGRSAM---DVENEEK 1698
            ++++ N        +   +   ++ + +K +++  K      L+  +S     +++ E+ 
Sbjct: 1523 VTELLNKYSALAIKNKFAKTKKDSEIIIKEIKDAHKKF---ILEAEKSEQKIKEIKKEKF 1579

Query: 1699 NLEKEVA---LAQKQASGLRSRYQETDNKLLSKAESSGLKRARGQMLLNKASQLSVNTTA 1755
             +E + A    + K A  ++   +  +NK L  ++           +  K +     T +
Sbjct: 1580 RIEDDAAKNDKSNKAAIDIQLSLENFENKFLKISD-----------IKKKINDCLKETES 1628

Query: 1756 KLKQLNETEAMFNSQESELTELSKNIAELQKRIQSCIN 1793
              K+++      +SQ++EL E   N+  LQ+ ++S  +
Sbjct: 1629 IEKKISSFS--IDSQDTELKENGDNLNSLQEFLESLKD 1664



 Score = 42.7 bits (100), Expect = 0.002
 Identities = 101/495 (20%), Positives = 192/495 (38%), Gaps = 89/495 (17%)

Query: 1220 EKKIEDVKALVENTTRSSHDLTVMLTTIDDMKKQIMNNEDPNGVGKTLGNTTQRINLARL 1279
            EKK  D K+L+    +S          I++  + I   +  +   K   NT + I     
Sbjct: 889  EKKFNDSKSLINEINKS----------IEEEYQNINTLKKVDEYIKICENTKESIEKFHN 938

Query: 1280 ALNDLTEEIKKLNKTGEMLKENATLLQ--ENNVEGAL-----ELTRQAHDTSLKLKE-QS 1331
              N L E    LNK  + +KE+  + +  ++  +  L     EL +   D SL   E ++
Sbjct: 939  KQNILKE---ILNKNIDTIKESNLIEKSYKDKFDNTLIDKINELDKAFKDASLNDYEAKN 995

Query: 1332 TETEKQINDAERQCKRTENLVTKTFPQFTQGQEENEKALQRLNEKITSLEKNLPDLNELI 1391
             E  K  ND +       NL            +E EKA   + +KI    KN+P++   I
Sbjct: 996  NELIKYFNDLK------ANLGKNKENMLYHQFDEKEKATNDIEQKIEDANKNIPNIEIAI 1049

Query: 1392 CD---KRGDPCDNLCG-GAGCGKCGGMWCSNGTLSESNSAKD-----------------Y 1430
                    D  +   G          +  +   ++  N  K+                 Y
Sbjct: 1050 HTSIYNIIDEIEKEIGKNIELLNKEILEEAEINITNFNEIKEKLKHYNFDDFGKEENIKY 1109

Query: 1431 AETSKKLINEKESKAEELRKHMIILSE-SNSAKDYAETSKKLINEKEAKAEEVFRGITTA 1489
            A+   K+ ++ ++  +++  H+  L E    +++Y +  K  IN+ E  A++        
Sbjct: 1110 ADEINKIKDDIKNLDQKIDHHIKALEEIKKKSENYIDEIKAQINDLEDVADKAISNDDPE 1169

Query: 1490 KQESHAANILAKEAYDHAASVRNKTAAYVASTSNITKQLDEF--LNAPGATLADIRNISG 1547
            + E    NI+ K        +  K   Y      I K L+E   +     +L +++ I+ 
Sbjct: 1170 EIEKKIENIVTK--------IDKKKNIY----DEIKKLLNEIAEIEKDKTSLEEVKGIN- 1216

Query: 1548 LAVNKNIQS-NPEQIRERANEINNVIKSL----TDIDTILT---ETAGDLAKANDLKRKA 1599
            L+  KN+     E+I E   +  ++IK++     D+D I     E   ++    D+K + 
Sbjct: 1217 LSYGKNLGKLFLEKIDEEKKKSEHMIKAMEAYIEDLDEIKEKSPEIENEMGIEMDIKAEM 1276

Query: 1600 NLTKAGAD-------STKNTVQKIVDVLTEA-RTAQDMAEVAIQTAKDDISAARKDLSQI 1651
                   D        +K   + I D+  ++ +  +D +E      + DI+  +K+L + 
Sbjct: 1277 ETFNISHDDDKDHHIISKKHDENISDIREKSLKIIEDFSE------ESDINDIKKELQK- 1329

Query: 1652 TNDLDDAQQKSNETN 1666
              +L DAQ+ +++ N
Sbjct: 1330 --NLLDAQKHNSDIN 1342



 Score = 41.2 bits (96), Expect = 0.007
 Identities = 97/472 (20%), Positives = 182/472 (38%), Gaps = 97/472 (20%)

Query: 1219 LEKKIEDVKALVENTTRSSHDLTVMLTTIDDMKKQIMNN-EDPNGVGKTLGNTTQRINLA 1277
            +E+KIED    + N   + H  T +   ID+++K+I  N E  N          +   + 
Sbjct: 1031 IEQKIEDANKNIPNIEIAIH--TSIYNIIDEIEKEIGKNIELLNK------EILEEAEIN 1082

Query: 1278 RLALNDLTEEIKKLNKTGEMLKENATLLQENNVEGALELTRQAHDTSLKLKEQSTETEKQ 1337
                N++ E++K  N             +E N++ A E+ +   D    +K    + +  
Sbjct: 1083 ITNFNEIKEKLKHYNFD--------DFGKEENIKYADEINKIKDD----IKNLDQKIDHH 1130

Query: 1338 INDAERQCKRTENLVTKTFPQFTQGQEENEKALQRLNEKITSLEKNLPDLNELICDKRGD 1397
            I   E   K++EN + +   Q    ++  +KA+   N+    +EK + ++   I DK+ +
Sbjct: 1131 IKALEEIKKKSENYIDEIKAQINDLEDVADKAIS--NDDPEEIEKKIENIVTKI-DKKKN 1187

Query: 1398 PCDNLCGGAGCGKCGGMWCSNGTLSE-SNSAKD-------------YAETSKKL----IN 1439
              D +                  L+E +   KD             Y +   KL    I+
Sbjct: 1188 IYDEI---------------KKLLNEIAEIEKDKTSLEEVKGINLSYGKNLGKLFLEKID 1232

Query: 1440 EKESKAEELRKHM--IILSESNSAKDYAETSKKLINEKEAKAEEVFRGITTAKQESHAAN 1497
            E++ K+E + K M   I       +   E   ++  E + KAE     I+    + H  +
Sbjct: 1233 EEKKKSEHMIKAMEAYIEDLDEIKEKSPEIENEMGIEMDIKAEMETFNISHDDDKDH--H 1290

Query: 1498 ILAKEAYDHAASVRNKTAAYVASTSNITKQLDEFLNAPGATLADIRNISGLAVNKNIQSN 1557
            I++K+  ++ + +R K+   +   S                 +DI +I      K +Q N
Sbjct: 1291 IISKKHDENISDIREKSLKIIEDFSE---------------ESDINDI-----KKELQKN 1330

Query: 1558 PEQIRERANEINNVIKSLTDIDTILTETAGDLAKANDLKRKANLTKAGADSTKNTVQKIV 1617
                ++  ++IN  +  + +I  IL        K N +K+  +  K      +   + I 
Sbjct: 1331 LLDAQKHNSDINLYLNEIANIYNIL--------KLNKIKKIIDEVKEYTKEIEENNKNIK 1382

Query: 1618 DVLTEARTAQDMAEVAIQTAKDDISAARKDLSQITNDLDDAQQKSNETNVKV 1669
            D L       D +E  I+  KDDI+   +  S+I + LDD         +K 
Sbjct: 1383 DEL-------DKSEKLIKKIKDDIN-LEECKSKIESTLDDKDIDECIKKIKE 1426


>gnl|CDD|202204 pfam02321, OEP, Outer membrane efflux protein.  The OEP family (Outer
            membrane efflux protein) form trimeric channels that
            allow export of a variety of substrates in Gram negative
            bacteria. Each member of this family is composed of two
            repeats. The trimeric channel is composed of a 12
            stranded all beta sheet barrel that spans the outer
            membrane, and a long all helical barrel that spans the
            periplasm.
          Length = 186

 Score = 42.5 bits (100), Expect = 6e-04
 Identities = 20/112 (17%), Positives = 47/112 (41%), Gaps = 7/112 (6%)

Query: 1615 KIVDVLTEARTAQDMAEVAIQTAKDDISAARKDLSQITNDLDDAQQKSNETNVKVKLLQE 1674
            K    +  A+   + AE  ++ A+  +   R +++Q   +L  A+++       ++L +E
Sbjct: 78   KRRARVKAAKAQLEAAEAELEQARRQL---RLEVAQAYFNLLAAKEQLELAKQALELAEE 134

Query: 1675 RLKSIQAGFLQNGRSAMDVENEEKNLEKEVALAQKQASGLRSRYQETDNKLL 1726
             L+  +A +     S +DV   E  L +    A+ +     +  +    +L 
Sbjct: 135  ALELAEARYEAGLISLLDVLQAEVELLE----ARLELLNAEADLELARAQLE 182


>gnl|CDD|223783 COG0711, AtpF, F0F1-type ATP synthase, subunit b [Energy production
            and conversion].
          Length = 161

 Score = 41.1 bits (97), Expect = 0.001
 Identities = 22/137 (16%), Positives = 57/137 (41%), Gaps = 16/137 (11%)

Query: 1614 QKIVDVLTEARTAQDMAEVAIQTAKDDISAARKDLSQITNDLDDAQQKSNETNVKVKLLQ 1673
             KI D L EA   ++ A+  +   + ++  AR+  S+I   ++ A++++       ++ +
Sbjct: 40   AKIADDLAEAERLKEEAQALLAEYEQELEEAREQASEI---IEQAKKEAE------QIAE 90

Query: 1674 ERLKSIQAGFLQN-GRSAMDVENEEKNLEKEVALAQKQASGLRSRYQETDNKLLSKAESS 1732
            E     +    +    +  ++E E++   +E+   + + + L     E   KLL K    
Sbjct: 91   EIKAEAEEELERIKEAAEAEIEAEKERALEEL---RAEVAELAVAIAE---KLLGKKVDE 144

Query: 1733 GLKRARGQMLLNKASQL 1749
              ++      + +  + 
Sbjct: 145  AAQKDLIDAFIAELGEN 161



 Score = 37.7 bits (88), Expect = 0.018
 Identities = 23/114 (20%), Positives = 46/114 (40%), Gaps = 11/114 (9%)

Query: 1670 KLLQERLKSIQAGFLQNGRSAMDVENEEKNLEKEVALAQKQASGLRSRYQETDNKLLSKA 1729
            K L ER   I     +  R   + +      E+E+  A++QAS +  + ++   ++  + 
Sbjct: 33   KALDERQAKIADDLAEAERLKEEAQALLAEYEQELEEAREQASEIIEQAKKEAEQIAEEI 92

Query: 1730 ESSGLKRARGQMLLNKASQLSVNTTAKLKQLNETEAMFNSQESELTELSKNIAE 1783
            +      A  ++   K +       A+ +   E E       +E+ EL+  IAE
Sbjct: 93   K----AEAEEELERIKEA-------AEAEIEAEKERALEELRAEVAELAVAIAE 135


>gnl|CDD|218661 pfam05622, HOOK, HOOK protein.  This family consists of several
            HOOK1, 2 and 3 proteins from different eukaryotic
            organisms. The different members of the human gene family
            are HOOK1, HOOK2 and HOOK3. Different domains have been
            identified in the three human HOOK proteins, and it was
            demonstrated that the highly conserved NH2-domain
            mediates attachment to microtubules, whereas the central
            coiled-coil motif mediates homodimerisation and the more
            divergent C-terminal domains are involved in binding to
            specific organelles (organelle-binding domains). It has
            been demonstrated that endogenous HOOK3 binds to Golgi
            membranes, whereas both HOOK1 and HOOK2 are localised to
            discrete but unidentified cellular structures. In mice
            the Hook1 gene is predominantly expressed in the testis.
            Hook1 function is necessary for the correct positioning
            of microtubular structures within the haploid germ cell.
            Disruption of Hook1 function in mice causes abnormal
            sperm head shape and fragile attachment of the flagellum
            to the sperm head.
          Length = 713

 Score = 43.6 bits (103), Expect = 0.001
 Identities = 56/239 (23%), Positives = 94/239 (39%), Gaps = 57/239 (23%)

Query: 1281 LNDLTEEIKKLNKTGEMLKENATLLQE-----NNVEGALEL-TRQAHDTSLKLKEQSTET 1334
            LNDL  ++K L +   M  +N   L+E     N   G LE   RQ  +   KL E+S + 
Sbjct: 327  LNDLRRQVKLLEERNAMYMQNTVQLEEELKKANAARGQLETYKRQVQELHAKLSEESKKA 386

Query: 1335 EKQINDAERQCKRTENLVTKTFPQFTQGQEENEKALQRLNEKITSLEKNLPDLN-ELICD 1393
            +K     E + KR               +E+ E ALQ+  E++ +   +L + N EL C 
Sbjct: 387  DK----LEFEYKRL--------------EEKLE-ALQKEKERLLAERDSLRETNEELRC- 426

Query: 1394 KRGDPCDNLCGGAGCGKCGGMWCSNGTLSESNSAKDY--AETSKKLIN-EKESKA----- 1445
                       G           +  + S  N A +   +E  +KLI  + E+K      
Sbjct: 427  -----------GQAQQDQLTQADAGTSPSGDNLAAELLPSEYREKLIRLQHENKMLRLGQ 475

Query: 1446 --------EELRKHMIILSESNSAKDYAETSKKLINEKEAKAEEVFRGITTAKQESHAA 1496
                     EL++    L ++N   +  ET  +L N++  + ++    +  A QE  + 
Sbjct: 476  EGSENERITELQQL---LEDANRRNNELETQLRLANQRILELQQQVEDLQKALQEQGSK 531



 Score = 36.3 bits (84), Expect = 0.17
 Identities = 33/143 (23%), Positives = 62/143 (43%), Gaps = 41/143 (28%)

Query: 1647 DLSQITNDLDDAQQKSNETNVKVKLLQERLKSIQAGFLQNGRSAMDVENEEKNLEKEVAL 1706
            +L Q+   L+DA +++NE   +++L  +R+  +Q            VE+ +K L      
Sbjct: 485  ELQQL---LEDANRRNNELETQLRLANQRILELQQ----------QVEDLQKAL------ 525

Query: 1707 AQKQASGLRSRYQETDNKLLSKAESSGLKRARGQMLLNKASQLSVNTTAKLKQLNETEAM 1766
             Q+Q                SK+E S L +++ +  L +  + +     K +Q+ E E  
Sbjct: 526  -QEQG---------------SKSEDSSLLKSKLEEHLEQLHEANEELQKKREQIEELEPD 569

Query: 1767 FNSQESELTELSKNIAELQKRIQ 1789
             +        LS+ IAEL+  +Q
Sbjct: 570  QDQN------LSRKIAELEAALQ 586


>gnl|CDD|214369 CHL00118, atpG, ATP synthase CF0 B' subunit; Validated.
          Length = 156

 Score = 41.1 bits (97), Expect = 0.001
 Identities = 23/102 (22%), Positives = 45/102 (44%), Gaps = 17/102 (16%)

Query: 1281 LNDLTEEIKKLNKTGEMLKENATLLQENNVEGALELT--RQAHDTSLKLKEQSTETEKQI 1338
              +LT+  + L K  E+ K+    L +   E  LE+T  ++     +         E ++
Sbjct: 59   RKNLTKASEILAKANELTKQYEQELSKARKEAQLEITQSQKEAKEIV---------ENEL 109

Query: 1339 NDAERQCKRTENLVTKTFPQFTQGQEENEKALQRLNEKITSL 1380
              A+   K  ++L+ +   Q      + EKAL+ L E++ +L
Sbjct: 110  KQAQ---KYIDSLLNEATKQLEA---QKEKALKSLEEQVDTL 145


>gnl|CDD|113902 pfam05149, Flagellar_rod, Paraflagellar rod protein.  This family
            consists of several eukaryotic paraflagellar rod
            component proteins. The eukaryotic flagellum represents
            one of the most complex macromolecular structures found
            in any organism and contains more than 250 proteins. In
            addition to its locomotive role, the flagellum is
            probably involved in nutrient uptake since receptors for
            host low-density lipoproteins are localised on the
            flagellar membrane as well as on the flagellar pocket
            membrane.
          Length = 289

 Score = 42.3 bits (100), Expect = 0.002
 Identities = 46/229 (20%), Positives = 82/229 (35%), Gaps = 58/229 (25%)

Query: 1290 KLNKTGEMLKENATLLQENNVEGALELTRQAHDTSLKLKEQSTETEKQINDAERQCKRTE 1349
            +  ++ + L++N    Q+       EL R+  D +    E+  E E++I   ER+ +R  
Sbjct: 46   QRRESEKFLQQN-VEQQQACWRAIQELERELRDLAE---ERREEVERRIEAVEREEQRRT 101

Query: 1350 NLVTKTFPQFTQGQEENEKALQRLNEKITSLEKNLPDLNELI---CDKRGDPCDNLCGGA 1406
                  +  F     ++++ L+R  E   +  +    L E +   CD      D L    
Sbjct: 102  -----DYESFLNFAAQHKQRLRRTLENCDAALELARSLEEYVQEGCDHIPAKQDKL---- 152

Query: 1407 GCGKCGGMWCSNGTLSESNSA---------KDYAETSKKLINEKESKAEELRKHMI---- 1453
                          LSE   A         +    T  +L  +KE + EEL + +     
Sbjct: 153  -----------QQALSELLLAVRKERLEAFRALYLTLGELTYKKERRLEELDRQIRYYEH 201

Query: 1454 ---ILSES--NSAKDYAETSKKL-------------INEKEAKAEEVFR 1484
                  E+   +AK+Y+E  K+L             +  K+  A E F 
Sbjct: 202  QRESAMETLDPNAKEYSEAKKELLEQRAEVEGEINALRAKQDAALEDFE 250


>gnl|CDD|206034 pfam13863, DUF4200, Domain of unknown function (DUF4200).  This
            family is found in eukaryotes. It is a coiled-coil domain
            of unknwon function.
          Length = 126

 Score = 39.5 bits (93), Expect = 0.002
 Identities = 23/115 (20%), Positives = 50/115 (43%), Gaps = 21/115 (18%)

Query: 1696 EEKNLEKEVALAQKQASGLRSRYQETDNKLLSKAESSGLKRARGQMLLNKASQLSVNTTA 1755
            EE   ++E  L +K+   L+    +  +K L + E+   KR R +    +  +L      
Sbjct: 27   EELLKQREEELEKKEEE-LQESLIKF-DKFLKENEA---KRRRAEKKAEEEKKLRKEKEE 81

Query: 1756 KLKQLNETEAMFNSQESELTELSKNIAELQKRIQSCINF------IVDKSNNYKN 1804
            ++K+L            EL EL   I +L+++++    +      +V++S  ++ 
Sbjct: 82   EIKELKA----------ELEELKAEIEKLEEKLEEYQPYEEFLESVVERSPEFEE 126



 Score = 35.3 bits (82), Expect = 0.072
 Identities = 21/108 (19%), Positives = 50/108 (46%), Gaps = 7/108 (6%)

Query: 1620 LTEARTAQDMAEVAIQTAKDDISAARKDLSQITNDLDDAQQKSNETNVKV-KLLQE-RLK 1677
            L E R   +  ++A+   +++     + L Q   +L+  +++  E+ +K  K L+E   K
Sbjct: 2    LLEKRREMEEVQLALDAKREEFERREELLKQREEELEKKEEELQESLIKFDKFLKENEAK 61

Query: 1678 SIQAGFLQNGRSAMDVENEEKNLEKEVALAQKQASGLRSRYQETDNKL 1725
              +A      + A + +   K  E+E+   + +   L++  ++ + KL
Sbjct: 62   RRRA-----EKKAEEEKKLRKEKEEEIKELKAELEELKAEIEKLEEKL 104


>gnl|CDD|236892 PRK11281, PRK11281, hypothetical protein; Provisional.
          Length = 1113

 Score = 42.6 bits (101), Expect = 0.002
 Identities = 50/289 (17%), Positives = 92/289 (31%), Gaps = 72/289 (24%)

Query: 1506 HAASVRNKTAAYVASTSNITKQLDEFLNAPGATLADIRNISGLAVNKNIQSNPEQIRERA 1565
             +A  R  +   + + +++  QLD         L   + +   A +K +Q + EQ     
Sbjct: 23   SSAFARAASNGDLPTEADVQAQLD--------ALNKQKLLE--AEDKLVQQDLEQTLALL 72

Query: 1566 NEINNVIKSLTDIDTILTETAGDLAKANDLKRKANLTKAGADSTKNTVQKIVDVLTEART 1625
            ++I+   +    +   L +    L +A        L     + T+ T+          R 
Sbjct: 73   DKIDRQKEETEQLKQQLAQAPAKLRQAQ--AELEALKDDNDEETRETLST-----LSLRQ 125

Query: 1626 AQDMAEVAIQTAKDDISAARKDLSQITNDLDDAQQKSNETNVKVKLLQERLKSIQAGFLQ 1685
             +                    L+Q  + L +AQ    E N ++  LQ + +  QA    
Sbjct: 126  LESR------------------LAQTLDQLQNAQNDLAEYNSQLVSLQTQPERAQA---- 163

Query: 1686 NGRSAMDVENEEKNLEKEVALAQKQASGLRSRYQETDNKLLSKAESSGLKRARGQMLLNK 1745
                               AL          R Q+  N L          R   ++LL  
Sbjct: 164  -------------------ALYANSQ-----RLQQIRNLLKGGKVGGKALRPSQRVLLQ- 198

Query: 1746 ASQLSVNTTAKLKQLNETEAMFNSQ-----ESELTELSKNIAELQKRIQ 1789
            A Q  +N  A+   L       N+Q     + +   L+  I  L+ ++Q
Sbjct: 199  AEQALLN--AQN-DLQRKSLEGNTQLQDLLQKQRDYLTARIQRLEHQLQ 244



 Score = 33.0 bits (76), Expect = 1.9
 Identities = 21/119 (17%), Positives = 48/119 (40%), Gaps = 15/119 (12%)

Query: 1282 NDLTEEIKKLNKTGEMLKENATLLQENNVEGALELTRQAHDTSLKLKEQSTETEKQINDA 1341
             D+  ++  LNK   +  E+  + Q  ++E  L L  +        KE++ + ++Q+  A
Sbjct: 39   ADVQAQLDALNKQKLLEAEDKLVQQ--DLEQTLALLDKIDRQ----KEETEQLKQQLAQA 92

Query: 1342 ERQCKR-TENLVTKTFPQFTQGQEENEKA----LQ-RLNEKITSLE---KNLPDLNELI 1391
              + ++    L         + +E         L+ RL + +  L+    +L + N  +
Sbjct: 93   PAKLRQAQAELEALKDDNDEETRETLSTLSLRQLESRLAQTLDQLQNAQNDLAEYNSQL 151


>gnl|CDD|219791 pfam08317, Spc7, Spc7 kinetochore protein.  This domain is found in
            cell division proteins which are required for
            kinetochore-spindle association.
          Length = 321

 Score = 41.6 bits (98), Expect = 0.003
 Identities = 27/173 (15%), Positives = 63/173 (36%), Gaps = 20/173 (11%)

Query: 1572 IKSLTDIDTILTETAGDLAKAND-LKRKANLTKAGADSTKNTVQKIVDVL-TEARTAQDM 1629
            ++ L  +   L E    + +  + L +  NL     +S K  ++K +  L  E  + + +
Sbjct: 142  MQLLEGLKEGLEENLEGMKRDEELLNKDLNL----INSIKPKLRKKLQALKEEIASLRQL 197

Query: 1630 AEVAIQTAKDDISAARKDLSQITNDLDDAQQKSNETNVKVKLLQERLKSIQAGFLQNGRS 1689
            A+        ++  AR++L  ++  + + +++  E       LQ+ L+ +          
Sbjct: 198  ADELNLCDPLELEKARQELRSLSVKISEKRKQLEE-------LQQELQELTIAIEALTNK 250

Query: 1690 AMDVENEEKNLEKEVALAQ----KQASGLRSRYQETDNKL---LSKAESSGLK 1735
              ++  E    EK     +    K+ S L+++                 S L 
Sbjct: 251  KSELLEEIAEAEKIREECRGWSAKEISKLKAKVSLLQKLTGWKFLSLSGSTLS 303



 Score = 31.5 bits (72), Expect = 3.9
 Identities = 35/180 (19%), Positives = 62/180 (34%), Gaps = 48/180 (26%)

Query: 1213 TKEFELLEK---KIEDVK-ALVENTTRSSHDLTVMLTTIDDMKKQIMNNEDPNGVGKTLG 1268
             ++ ELL K    I  +K  L +       ++  +    D++     N  DP        
Sbjct: 160  KRDEELLNKDLNLINSIKPKLRKKLQALKEEIASLRQLADEL-----NLCDPL------- 207

Query: 1269 NTTQRINLARLALNDLTEEIKKLNKTGEMLKENATLLQENNVEGALELTRQAHDTSL-KL 1327
                 +  AR  L  L+ +I +  K    L+E   L QE           Q    ++  L
Sbjct: 208  ----ELEKARQELRSLSVKISEKRK---QLEE---LQQE----------LQELTIAIEAL 247

Query: 1328 KEQSTETEKQINDAER---QCKRTENLVTKTFPQFTQGQEENEKALQRL-NEKITSLEKN 1383
              + +E  ++I +AE+   +C+                 +     LQ+L   K  SL  +
Sbjct: 248  TNKKSELLEEIAEAEKIREECRGWSAKEISKL-------KAKVSLLQKLTGWKFLSLSGS 300


>gnl|CDD|234252 TIGR03545, TIGR03545, TIGR03545 family protein.  This model
            represents a relatively rare but broadly distributed
            uncharacterized protein family, distributed in 1-2
            percent of bacterial genomes, all of which have outer
            membranes. In many of these genomes, it is part of a
            two-gene pair.
          Length = 555

 Score = 41.6 bits (98), Expect = 0.003
 Identities = 55/278 (19%), Positives = 101/278 (36%), Gaps = 53/278 (19%)

Query: 1519 ASTSNITKQLDEFLNAPGATLADIRNISGLAVNKNIQSNPEQIRERANEINNVIKSLTDI 1578
            ++ S+I +Q    L    + L D R +      +++++      E A EI    KSL  +
Sbjct: 128  SAPSSIKEQKSSELKKVDSQLPDPRAL---LKGEDLKT-----VETAEEI---EKSLKAM 176

Query: 1579 DTILTETAGDLAKANDL----KRKANLTKAGADSTKN---TVQKIVDVLTEARTAQDMAE 1631
                 +   DL    DL    KR   + K    +        ++   +  E +  +    
Sbjct: 177  QQKWKKRKKDLPNKQDLEEYKKRLEAIKKKDIKNPLELQKIKEEFDKLKKEGKADKQK-- 234

Query: 1632 VAIQTAKDDISAARKDL--------SQITNDLDDAQQKS--NETNVK---VKLLQERLKS 1678
              I++AK+D+   +K L            NDL   + K      ++K   V L    ++ 
Sbjct: 235  --IKSAKNDLQNDKKQLKADLAELKKAPQNDLKRLENKYAIKSGDLKNFAVDLFGPEIRK 292

Query: 1679 IQAGFLQNGRSAMDVENEEKNLEKEVALAQKQASGLRSRYQETDNK---LLSKAESSGLK 1735
                FL+    A  + N+ K   KE A  +  A+G    +++ +     L+ KA+ SG  
Sbjct: 293  YLQKFLKYYDQAEPLLNKSKKEPKEEA-VETLANGEDYDFKDDNPYPDFLIKKADVSGKM 351

Query: 1736 RARG--------------QMLLNKASQLSVNTTAKLKQ 1759
                                LL K + ++++  A  K 
Sbjct: 352  PDGDVGEIKGEVSNITNDHDLLGKPTTINLSAEASGKG 389


>gnl|CDD|224455 COG1538, TolC, Outer membrane protein [Cell envelope biogenesis,
            outer membrane / Intracellular trafficking and
            secretion].
          Length = 457

 Score = 41.7 bits (98), Expect = 0.004
 Identities = 35/239 (14%), Positives = 69/239 (28%), Gaps = 42/239 (17%)

Query: 1580 TILTETAG---DLAKANDLKRKANLTKAGADSTKNTVQK--------IVDVLTEARTAQD 1628
            ++  E A    DL  A +    A  T A A+      +K         +DVL +A     
Sbjct: 151  SLAAEVATAYFDLLAAQEQLALAEETLAAAEEQLELAEKRYDAGLATRLDVL-QAEAQLA 209

Query: 1629 MAEVAIQTAKDDISAARKDLSQITN-DLDDAQQKSNETNVKVKLLQERLKSIQAGFLQNG 1687
             A   +  A+  ++ AR  L+++   +  +         +                 +  
Sbjct: 210  SARAQLAAAQAQLAQARNALARLLGLEPGELLDAPEPEALPALPPVLPDGLPSEALARRP 269

Query: 1688 --RSAMDVENEEKNLEKEVALAQKQ------ASGLRSRYQETDNKLLSKAESSGL----- 1734
               +A   E +       +  A+         +    R     + L   +  S       
Sbjct: 270  DILAA---EAQLAAANANIGAARAAFLPTLSLTASYGRSSTNLSGLFGSSSRSWSVGPGL 326

Query: 1735 -------KRARGQMLLNKASQLSVNTTAKLKQLNETEA-MFNSQESELTELSKNIAELQ 1785
                    R R ++   + ++   +  A L Q  +T           L  L   + +LQ
Sbjct: 327  SLPIFDGGRLRARV---RQAEAQYD--AALAQYEQTVLTARQEVADALAALEAALEQLQ 380



 Score = 37.8 bits (88), Expect = 0.046
 Identities = 22/105 (20%), Positives = 42/105 (40%), Gaps = 12/105 (11%)

Query: 1620 LTEARTAQDMA----EVAIQTAKDDISAARKDLSQITNDLDDAQQKSNETNVKVKLLQER 1675
            + +A    D A    E  + TA+ +++ A   L      L   +Q        V+  QE 
Sbjct: 340  VRQAEAQYDAALAQYEQTVLTARQEVADALAALEAALEQLQALRQ-------AVEAAQEA 392

Query: 1676 LKSIQAGFLQNGRSAMDVENEEKNLEK-EVALAQKQASGLRSRYQ 1719
            L+  +  +    R+ +DV + ++ L +   AL Q +   L +   
Sbjct: 393  LELARERYQAGVRTLLDVLDAQRTLLQARQALLQARYDYLVALVN 437


>gnl|CDD|222469 pfam13949, ALIX_LYPXL_bnd, ALIX V-shaped domain binding to HIV.  The
            binding of the LYPxL motif of late HIV p6Gag and EIAV
            p9Gag to this domain is necessary for viral budding.This
            domain is generally central between an N-terminal Bro1
            domain, pfam03097 and a C-terminal proline-rich domain.
            The retroviruses thus used this domain to hijack the
            ESCRT system of the cell.
          Length = 292

 Score = 41.1 bits (97), Expect = 0.004
 Identities = 45/281 (16%), Positives = 90/281 (32%), Gaps = 65/281 (23%)

Query: 1558 PEQIRERANEI------NNVIKSLTDIDTILTETAGDLAKANDL------KRKANLTKAG 1605
            P  + E+A EI        + +SL D+  +       L +  +L      + +    K G
Sbjct: 8    PPSLLEKAEEIRSEGGIERLERSLDDLPELSQRNEEILDEIKELLDEEESEDEQLRAKYG 67

Query: 1606 ADST-----------KNTVQKIVDVLTEARTAQDMAEVAIQTAKDDISAARKDLSQITND 1654
               T           +  ++K    L +A  +       ++  ++++        ++   
Sbjct: 68   ERWTRPPSSELTAELREELRKYRGYLEQASESDAQLRSKLEEHEENLELLSGPEEELEAL 127

Query: 1655 LDDAQQ-KSNETNVKVKLLQERLKSIQAGFLQNGRSAMDVENEEKNLEKEVALAQKQASG 1713
            L  +   K+ E + ++  L+E L  +             ++ E + L +E          
Sbjct: 128  LPSSSPSKTPEVSEQISRLRELLNKLNE-----------LKAEREKLLEE---------- 166

Query: 1714 LRSRYQETDN-KLLSKAESSGLKRARGQMLLNKASQLSVNTTAKLKQL-NETEAMFNSQE 1771
            L+ + Q+ D  KLL        K            QL      K   L    E   + QE
Sbjct: 167  LKEKAQDDDISKLLIALN----KLGSSNE-----EQLFEEELKKFDPLQERVEQNLSKQE 217

Query: 1772 SELTEL---------SKNIAELQKRIQSCINFIVDKSNNYK 1803
              L E+          +     QK  +  +  + +  + YK
Sbjct: 218  ELLKEIQEANEEFLQERKSDSEQKEREKVLQKLENAYDKYK 258


>gnl|CDD|129705 TIGR00618, sbcc, exonuclease SbcC.  All proteins in this family for
            which functions are known are part of an exonuclease
            complex with sbcD homologs. This complex is involved in
            the initiation of recombination to regulate the levels of
            palindromic sequences in DNA. This family is based on the
            phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis,
            Stanford University) [DNA metabolism, DNA replication,
            recombination, and repair].
          Length = 1042

 Score = 41.5 bits (97), Expect = 0.005
 Identities = 82/560 (14%), Positives = 154/560 (27%), Gaps = 87/560 (15%)

Query: 1270 TTQRINLARLALNDLTEEIKKLNKTGEMLKENATLLQENNVEGALELTRQAHDTSLKLKE 1329
              ++  L +  L  L E +++  ++   L +     +E      L+  +       +++E
Sbjct: 218  HERKQVLEKE-LKHLREALQQTQQSHAYLTQKREAQEE-----QLKKQQLLKQLRARIEE 271

Query: 1330 ---QSTETEKQINDAERQCKRTENLVTKTFPQFTQGQEENEKALQRLNEKITSLEKNLPD 1386
               Q    E+      R  K    L        TQ +++ ++    L  K+ S  K L  
Sbjct: 272  LRAQEAVLEETQERINRARKA-APLAAHI-KAVTQIEQQAQRIHTELQSKMRSRAKLLMK 329

Query: 1387 LNELICDKRGDPCDNLCGGAGCGKCGGMWCSNGTLSESNSAKDYAETSKKLINEKESKAE 1446
                +  +                           S+    +D  E     I E   +  
Sbjct: 330  RAAHVKQQSSIEEQR-------------RLLQTLHSQEIHIRDAHE-VATSIREISCQQH 375

Query: 1447 ELRKHMIILSESNSA---KDYAETSKKLINEKE--------------------AKAEEVF 1483
             L +H+  L +  +    K  +   +  I ++E                    AK ++  
Sbjct: 376  TLTQHIHTLQQQKTTLTQKLQSLCKELDILQREQATIDTRTSAFRDLQGQLAHAKKQQEL 435

Query: 1484 RGITTAKQESHAANIL--AKEAYDHAASVRNKTAA---YVASTSNITKQLDEFLNAPGAT 1538
            +        +         K    H              + +   I  Q         A 
Sbjct: 436  QQRYAELCAAAITCTAQCEKLEKIHLQESAQSLKEREQQLQTKEQIHLQETRKKAVVLAR 495

Query: 1539 LADIRNISGLAVNKNIQSNPEQIRERANEINNVIKSLTDIDTILTETAGDLAKANDLKRK 1598
            L              +Q  P  +       N    +  DID     T             
Sbjct: 496  L------------LELQEEPCPLCGSCIHPN---PARQDIDNPGPLTRRMQRGEQ---TY 537

Query: 1599 ANLTKAGAD------STKNTVQKIVDVLTEARTAQDMAEVAIQTAKDDISAARKDLSQIT 1652
            A L  +  D      S +     + + + E +  Q  + +      +       +L  IT
Sbjct: 538  AQLETSEEDVYHQLTSERKQRASLKEQMQEIQ--QSFSILTQC--DNRSKEDIPNLQNIT 593

Query: 1653 NDLDDAQQKSNETNVKVKLLQERLKSIQAGFLQNGRSAMDVENEEKNLEKEVALAQ--KQ 1710
              L D  +K +E    +   Q  L       LQ  +   DV    +   +E+AL      
Sbjct: 594  VRLQDLTEKLSEAEDMLACEQHALLRK----LQPEQDLQDVRLHLQQCSQELALKLTALH 649

Query: 1711 ASGLRSRYQETDNKLLSKAESSGLKRARGQMLLNKASQLSVNTTAKLKQLNETEAMFNSQ 1770
            A  L    +      LS         A  Q+ L K        T   + L + + +    
Sbjct: 650  ALQLTLTQERVREHALSIRVLPKELLASRQLALQKMQSEKEQLTYWKEMLAQCQTLLREL 709

Query: 1771 ESELTELSKNIAELQKRIQS 1790
            E+ + E  +   E++    S
Sbjct: 710  ETHIEEYDREFNEIENASSS 729



 Score = 36.5 bits (84), Expect = 0.15
 Identities = 62/395 (15%), Positives = 118/395 (29%), Gaps = 66/395 (16%)

Query: 1218 LLEKKIEDVKALVENTTRSSHDLTVMLTTIDDMKKQIMNNEDPNGVGKTLGNTTQRINLA 1277
             +++  +    L  +     H LT        +K+Q+   +       +    TQ  N +
Sbjct: 529  RMQRGEQTYAQLETSEEDVYHQLTSERKQRASLKEQMQEIQQ------SFSILTQCDNRS 582

Query: 1278 RLALNDLTEEIKKLNKTGEMLKENATLLQENNVEGALELTRQAHDTSLKLKEQSTETEKQ 1337
            +  + +L     +L    E L E   +L         +L  +     ++L  Q    E  
Sbjct: 583  KEDIPNLQNITVRLQDLTEKLSEAEDMLACEQHALLRKLQPEQDLQDVRLHLQQCSQELA 642

Query: 1338 INDAERQCKRTENLVTKTFPQFTQGQEENEKALQRLNEKITSLEKNLPDLNELICDKRGD 1397
            +            L      Q T  QE   +        I  L K L    +L   K   
Sbjct: 643  L-----------KLTALHALQLTLTQERVREHALS----IRVLPKELLASRQLALQKMQS 687

Query: 1398 PCDNLCGGAGCGKCGGMWCSNGTLSESNSAKDYAETSKKLINEKESKAEELRKHMIILSE 1457
              + L                         K+     + L+ E E+  EE  +    +  
Sbjct: 688  EKEQLTY----------------------WKEMLAQCQTLLRELETHIEEYDREFNEIEN 725

Query: 1458 -SNSAKDYAETSKKLINEKEAKAEEVFRGITTAKQESHAANILA--------KEAYDHAA 1508
             S+S        +  +N+   +     R +  A+ E+H  N            E    AA
Sbjct: 726  ASSSLGSDLAAREDALNQSLKELMHQARTVLKARTEAHFNNNEEVTAALQTGAELSHLAA 785

Query: 1509 SVRNKTAAYVASTSNITKQLDEFLNAPGATLADIRNISGLAVNKNIQSNPEQIRERANEI 1568
             ++         T    K L+  +     +  DI N+    + +  +    ++ E++   
Sbjct: 786  EIQFFNRLREEDTHL-LKTLEAEIGQEIPSDEDILNLQCETLVQEEEQFLSRLEEKS--- 841

Query: 1569 NNVIKSLTDIDTILTETAGDLAKANDLKRKANLTK 1603
                 +L +I   L +           K+ A LT+
Sbjct: 842  ----ATLGEITHQLLKY------EECSKQLAQLTQ 866



 Score = 34.9 bits (80), Expect = 0.54
 Identities = 70/508 (13%), Positives = 145/508 (28%), Gaps = 101/508 (19%)

Query: 1286 EEIKKLNKTGEMLKENATLLQENNVEGALELTRQAHDTSLKLKEQSTETEKQINDAERQC 1345
             + K L+   E+L   + LL           T    DT  + K+   +  K + +A +Q 
Sbjct: 187  AKKKSLHGKAELLTLRSQLLTL--------CTPCMPDTYHERKQVLEKELKHLREALQQT 238

Query: 1346 KRTENLVTKTFPQFTQGQEENEKALQRLNEKITSLEKNLPDLNELICDKRGDPCDNLCGG 1405
            +++   +T    Q  + QEE  K  Q L +    +E+                       
Sbjct: 239  QQSHAYLT----QKREAQEEQLKKQQLLKQLRARIEELRAQEAV---------------- 278

Query: 1406 AGCGKCGGMWCSNGTLSESNSAKDYAETSKKLINEKESKAEELRKHMIILSESNSAKDYA 1465
                           L E+    + A  +  L    ++  +  ++   I +E        
Sbjct: 279  ---------------LEETQERINRARKAAPLAAHIKAVTQIEQQAQRIHTE-------- 315

Query: 1466 ETSKKLINEKEAKAEEVF-RGITTAKQESHAANILAKEAYDHAASVRNKTAAYVASTSNI 1524
                  +  K     ++  +     KQ+S            H+  +  + A  VA     
Sbjct: 316  ------LQSKMRSRAKLLMKRAAHVKQQSSIEEQRRLLQTLHSQEIHIRDAHEVA----- 364

Query: 1525 TKQLDEFLNAPGATLADIRNISGLAVNKNIQSNPEQIRERANEINNVIKSLTDIDTILTE 1584
            T   +             ++I  L           Q +    +    +    DI      
Sbjct: 365  TSIREISCQQ----HTLTQHIHTLQ----------QQKTTLTQKLQSLCKELDILQREQA 410

Query: 1585 TAG-DLAKANDLKRKANLTKAGADSTKNTVQKIVDVLTEARTAQDMAEVAIQTAKDDISA 1643
            T     +   DL+ +     A A   +   Q+  ++             AI         
Sbjct: 411  TIDTRTSAFRDLQGQ----LAHAKKQQELQQRYAELC----------AAAITCTAQCEKL 456

Query: 1644 ARKDLSQITNDLDDAQQKSNETNVKVKLLQE-RLKSIQAGFLQNGRSAMDVENEEKNLEK 1702
             +  L +    L + +Q+      +   LQE R K++    L               +  
Sbjct: 457  EKIHLQESAQSLKEREQQ--LQTKEQIHLQETRKKAVVLARLLE-LQEEPCPLCGSCIHP 513

Query: 1703 EVALAQKQASG-LRSRYQETDNKLLSKAESSGLKRARGQMLLNKASQLSVNTTAKLKQLN 1761
              A       G L  R Q  +        S      +      + + L      +++++ 
Sbjct: 514  NPARQDIDNPGPLTRRMQRGEQTYAQLETSEEDVYHQLTSERKQRASLK----EQMQEIQ 569

Query: 1762 ETEAMFNSQESELTELSKNIAELQKRIQ 1789
            ++ ++    ++   E   N+  +  R+Q
Sbjct: 570  QSFSILTQCDNRSKEDIPNLQNITVRLQ 597



 Score = 31.9 bits (72), Expect = 4.0
 Identities = 27/143 (18%), Positives = 49/143 (34%), Gaps = 23/143 (16%)

Query: 1276 LARLALNDLTEEIKKLNKTGEMLKENATLLQENNVEGALEL---TRQAHDTSLKLKEQST 1332
             AR  L   TE     N   E+     T  + +++   ++     R+     LK  E   
Sbjct: 751  QARTVLKARTEA--HFNNNEEVTAALQTGAELSHLAAEIQFFNRLREEDTHLLKTLEAEI 808

Query: 1333 ETEK----QINDA--ERQCKRTENLVTK------TFPQFTQGQEENEKALQRLNE----- 1375
              E      I +   E   +  E  +++      T  + T    + E+  ++L +     
Sbjct: 809  GQEIPSDEDILNLQCETLVQEEEQFLSRLEEKSATLGEITHQLLKYEECSKQLAQLTQEQ 868

Query: 1376 -KITSLEKNLPDLNELICDKRGD 1397
             KI  L   L  +N++     GD
Sbjct: 869  AKIIQLSDKLNGINQIKIQFDGD 891


>gnl|CDD|218852 pfam06008, Laminin_I, Laminin Domain I.  coiled-coil structure. It
            has been suggested that the domains I and II from laminin
            A, B1 and B2 may come together to form a triple helical
            coiled-coil structure.
          Length = 263

 Score = 40.4 bits (95), Expect = 0.005
 Identities = 41/174 (23%), Positives = 67/174 (38%), Gaps = 27/174 (15%)

Query: 1219 LEKKIEDVKALVENTTRSSHDLTVMLTTIDDMKKQIMNNEDPNGVGKTLGNTTQRINLAR 1278
            L K  +D + L +            L     +   I          +TLG+        +
Sbjct: 53   LSKLAQDAENLQK-------KANQALAKAQKVNAAIE---------RTLGHAKTLAEAIK 96

Query: 1279 LALNDLTEEIKKLNKTGEMLKENATLLQENNVEGALELTRQAHDTSLKLKE-QSTETEKQ 1337
               +++ E ++KL      L ENA  L  + +   L    Q       L E +  + +KQ
Sbjct: 97   NLRDNIKEILEKL----AGLGENAFALPGSALSRMLAEA-QRM-----LGEIRERDFQKQ 146

Query: 1338 INDAERQCKRTENLVTKTFPQFTQGQEENEKALQRLNEKITSLEKNLPDLNELI 1391
            + +AE + K  E L+ +    F   QEENE   + L + +   E  L DL EL+
Sbjct: 147  LQNAEAELKAAEELLNRVQTWFQTPQEENEGLFEALRDDLNEYEAKLSDLRELL 200



 Score = 35.1 bits (81), Expect = 0.27
 Identities = 50/247 (20%), Positives = 98/247 (39%), Gaps = 34/247 (13%)

Query: 1523 NITKQLDEFLNAPGATLADI----RNISGLAVN-KNIQSNPEQIRERANEINNVIKSLTD 1577
            N+T+QL   L+   AT   +    + +S LA + +N+Q    Q   +A ++N  I+    
Sbjct: 27   NLTQQLSGRLSPISATKRQLEDLEKALSKLAQDAENLQKKANQALAKAQKVNAAIERTLG 86

Query: 1578 IDTILTETAGDL-AKANDLKRKA-----NLTKAGADSTKNTVQKIVDVLTEARTAQDMAE 1631
                L E   +L     ++  K      N       +    + +   +L E R      +
Sbjct: 87   HAKTLAEAIKNLRDNIKEILEKLAGLGENAFALPGSALSRMLAEAQRMLGEIRERDFQKQ 146

Query: 1632 VAIQTAKDDISAARKDLSQI----------TNDL-DDAQQKSNETNVKVKLLQERLKSIQ 1680
              +Q A+ ++ AA + L+++             L +  +   NE   K+  L+E L   Q
Sbjct: 147  --LQNAEAELKAAEELLNRVQTWFQTPQEENEGLFEALRDDLNEYEAKLSDLRELLDEAQ 204

Query: 1681 AGFLQNGRSAMDVENEEKNLEKEVALAQKQASGLRSRYQETDNKLLSKAESSGLKRARGQ 1740
            A      R A  +    +   +E    +++ S L+++ +E     L+    S L +A   
Sbjct: 205  A----KTRDAERLNLANQKRLREFQEKKQEVSELKNQAEEH----LATGRDS-LDQANLL 255

Query: 1741 -MLLNKA 1746
              L+++A
Sbjct: 256  LQLMDQA 262



 Score = 34.7 bits (80), Expect = 0.32
 Identities = 49/241 (20%), Positives = 81/241 (33%), Gaps = 50/241 (20%)

Query: 1454 ILSESNSAKDYAETSKKLINEKEAKAEEVFRGITTAKQESHAANILAKEAYDHAASVRNK 1513
            IL    +           I+  + + E++ + ++   Q+   A  L K+A    A    K
Sbjct: 21   ILYNIENLTQQLSGRLSPISATKRQLEDLEKALSKLAQD---AENLQKKANQALAKA-QK 76

Query: 1514 TAAYVASTSNITKQLDEFLNAPGATLADIRN-ISGLAVN-------------KNIQSNPE 1559
              A +  T    K L E +      + +I   ++GL  N                Q    
Sbjct: 77   VNAAIERTLGHAKTLAEAIKNLRDNIKEILEKLAGLGENAFALPGSALSRMLAEAQRMLG 136

Query: 1560 QIRERANEINNVIKSLTDIDTILTETAGDLAKANDLKRKANLTKAGADSTKNTVQKIVDV 1619
            +IRER            D    L     +L  A +L               N VQ     
Sbjct: 137  EIRER------------DFQKQLQNAEAELKAAEELL--------------NRVQTWF-- 168

Query: 1620 LTEARTAQDMAEVAIQTAKDDISAARKDLSQITNDLDDAQQKSNETNVKVKLLQERLKSI 1679
                +T Q+  E   +  +DD++     LS +   LD+AQ K+ +        Q+RL+  
Sbjct: 169  ----QTPQEENEGLFEALRDDLNEYEAKLSDLRELLDEAQAKTRDAERLNLANQKRLREF 224

Query: 1680 Q 1680
            Q
Sbjct: 225  Q 225



 Score = 32.4 bits (74), Expect = 1.9
 Identities = 34/217 (15%), Positives = 67/217 (30%), Gaps = 41/217 (18%)

Query: 1189 LADQTTKVITDASNIKQTGASGAYTKEFELLEKKIEDVKALVENTTRSSH-----DLTVM 1243
            L  +  + +  A  +    A        + L + I++++  ++               + 
Sbjct: 63   LQKKANQALAKAQKVNA--AIERTLGHAKTLAEAIKNLRDNIKEILEKLAGLGENAFALP 120

Query: 1244 LTTIDDMKKQIMNNEDPNGVGKTLGNTTQRINLARLALNDLTEEIKKLNKTGEMLKENAT 1303
             + +  M  +           + LG   +R    +L   +   E+K   +    ++    
Sbjct: 121  GSALSRMLAEAQ---------RMLGEIRERDFQKQLQNAE--AELKAAEELLNRVQTWFQ 169

Query: 1304 LLQENNVEGALE-LTRQAHDTSLKLK---EQSTETEKQINDAERQCKRTENLVTKTFPQF 1359
              QE N EG  E L    ++   KL    E   E + +  DAER                
Sbjct: 170  TPQEEN-EGLFEALRDDLNEYEAKLSDLRELLDEAQAKTRDAERL--------------- 213

Query: 1360 TQGQEENEKALQRLNEKITSLEKNLPDLNELICDKRG 1396
                  N+K L+   EK   + +      E +   R 
Sbjct: 214  ---NLANQKRLREFQEKKQEVSELKNQAEEHLATGRD 247


>gnl|CDD|184061 PRK13454, PRK13454, F0F1 ATP synthase subunit B'; Provisional.
          Length = 181

 Score = 39.4 bits (92), Expect = 0.006
 Identities = 29/80 (36%), Positives = 39/80 (48%), Gaps = 14/80 (17%)

Query: 1574 SLTDIDTILTETAG----DLAKANDLKRKANL-----TKAGADSTKNTVQKIVDVLTEAR 1624
            +L  I  +L E  G    DLA A +LK+KA        KA AD+ +   Q+IV    E R
Sbjct: 52   ALPRIGAVLAERQGTITNDLAAAEELKQKAVEAEKAYNKALADA-RAEAQRIV---AETR 107

Query: 1625 TA-QDMAEVAIQTAKDDISA 1643
               Q   +VAI  A  +I+A
Sbjct: 108  AEIQAELDVAIAKADAEIAA 127


>gnl|CDD|147988 pfam06120, Phage_HK97_TLTM, Tail length tape measure protein.  This
            family consists of the tail length tape measure protein
            from bacteriophage HK97 and related sequences from
            Escherichia coli O157:H7.
          Length = 311

 Score = 40.6 bits (94), Expect = 0.007
 Identities = 49/224 (21%), Positives = 83/224 (37%), Gaps = 29/224 (12%)

Query: 1460 SAKDYAETSKKLINE-KEAKAEEVFRGITTAKQESHAANILAKEAYDHAASVRNKTAAYV 1518
            SA DYA+T +++ ++ KE    E       A +   A N   ++     A +++  + Y 
Sbjct: 52   SAIDYADTIEEIRDKLKEMSLPEASANEGKANEALDAQNRAIEDQEQKIAEIKDAISGYN 111

Query: 1519 ASTSNITKQLDEFL-----NAPGATLADIRNISGLAVNKNIQSNPEQIRERANEINNVIK 1573
            A  +        +L     +   AT         LAV    +   EQ+  +  EI  +I+
Sbjct: 112  AKLAANGASEGGWLVNHLTSMDDATEEAADFADALAVE---EERLEQMLAKRKEIQQLIE 168

Query: 1574 SLTD--IDTI-------------LTETAGDLAKANDLKRKANLTKAGADSTKNTVQKIVD 1618
             + D  ID I             L E  G+ A+ + L    N   A      N   K   
Sbjct: 169  DIEDQAIDLIRNKAAEMAGAVQSLLEMNGEHAEFDRLLGLGNKLLAARQGLANIPMKYAG 228

Query: 1619 VLTEARTAQDMAEVAIQTAKDDISAAR-----KDLSQITNDLDD 1657
             +  A    +  + AI+ A+ D+  +      K ++Q   D DD
Sbjct: 229  PVIPAADLDNKQKDAIEKAQRDLELSGLKGEAKAIAQAEFDADD 272


>gnl|CDD|185272 PRK15374, PRK15374, pathogenicity island 1 effector protein SipB;
            Provisional.
          Length = 593

 Score = 40.7 bits (95), Expect = 0.007
 Identities = 43/196 (21%), Positives = 73/196 (37%), Gaps = 36/196 (18%)

Query: 1634 IQTAKDDISAARKDLSQITNDLDDAQQKSNETNVKVKLLQERLKSI----------QAGF 1683
             QTA  +   A           D A+   +    K+   Q +L+S+          +A  
Sbjct: 129  FQTALGEAQEATDLYEASIKKTDTAKSVYDAAEKKLTQAQNKLQSLDPADPGYAQAEAAV 188

Query: 1684 LQNGRSAMDVEN--EEKNLEKEVALAQK-----QASGLRSRYQETDNKLLSKAESSGLKR 1736
             Q G+ A + +   ++       A         +A  + +++Q T N       S G + 
Sbjct: 189  EQAGKEATEAKEALDKATDATVKAGTDAKAKAEKADNILTKFQGTANAASQNQVSQGEQD 248

Query: 1737 -----ARGQMLLNKASQL-SVNTTAKLKQLNETEAMFNS-QESELTELSKNIAELQK--- 1786
                 AR  ML+    ++   NT   L+  N+  A+FN+ QE    E+ K  AE Q+   
Sbjct: 249  NLSNVARLTMLMAMFIEIVGKNTEESLQ--NDL-ALFNALQEGRQAEMEKKSAEFQEETR 305

Query: 1787 ------RIQSCINFIV 1796
                  RI  CI  ++
Sbjct: 306  KAEETNRIMGCIGKVL 321



 Score = 39.2 bits (91), Expect = 0.023
 Identities = 48/283 (16%), Positives = 91/283 (32%), Gaps = 41/283 (14%)

Query: 1476 EAKAEEVFRGITTAKQESHAAN-ILAKEAYDHAASVRNKTAAYVASTSNITKQLD---EF 1531
            EA  E V +     K    A   ++A +A D  A  ++  +A   +T  +    D   E 
Sbjct: 21   EAAFEGVRKNTDFLKAADKAFKDVVATKAGDLKAGTKSGESAI--NTVGLKPPTDAAREK 78

Query: 1532 LNAPGATLADIRNISGLAVNKN--------------IQSNPEQIRERANEINNVIKSLTD 1577
            L++ G     +  +  L  + +              I+S  E   + + E    +    +
Sbjct: 79   LSSEGQLTLLLGKLMTLLGDVSLSQLESRLAVWQAMIESQKEMGIQVSKEFQTALGEAQE 138

Query: 1578 IDTILTETAGDL----AKANDLKRKANLTKA---GADSTKNTVQKIVDVLTEARTAQDMA 1630
               +   +        +  +  ++K    +      D       +    + +A      A
Sbjct: 139  ATDLYEASIKKTDTAKSVYDAAEKKLTQAQNKLQSLDPADPGYAQAEAAVEQAGKEATEA 198

Query: 1631 EVAIQTAKDDISAARKDLSQITNDLDDAQQKSNETNVKVKLLQERLKSIQAGFLQNGRS- 1689
            + A+  A D    A  D        D+   K   T       Q ++   +   L N    
Sbjct: 199  KEALDKATDATVKAGTDAKAKAEKADNILTKFQGTANAAS--QNQVSQGEQDNLSNVARL 256

Query: 1690 ----AMDVE----NEEKNLEKEVALAQKQASGLRSRYQETDNK 1724
                AM +E    N E++L+ ++AL      G   R  E + K
Sbjct: 257  TMLMAMFIEIVGKNTEESLQNDLALFNALQEG---RQAEMEKK 296


>gnl|CDD|109486 pfam00430, ATP-synt_B, ATP synthase B/B' CF(0).  Part of the CF(0)
            (base unit) of the ATP synthase. The base unit is thought
            to translocate protons through membrane (inner membrane
            in mitochondria, thylakoid membrane in plants,
            cytoplasmic membrane in bacteria). The B subunits are
            thought to interact with the stalk of the CF(1) subunits.
            This domain should not be confused with the ab CF(1)
            proteins (in the head of the ATP synthase) which are
            found in pfam00006.
          Length = 132

 Score = 38.4 bits (90), Expect = 0.007
 Identities = 24/106 (22%), Positives = 45/106 (42%), Gaps = 13/106 (12%)

Query: 1614 QKIVDVLTEARTAQDMAEVAIQTAKDDISAARKDLSQITNDLDDAQQKSNETNVKVKLLQ 1673
            +KI + + EA      A   +  A+  ++ AR + S+I N+     QK  E    +   Q
Sbjct: 33   EKIANNIKEAEERLKQAAALLAEAEQQLAQARAEASEIINNAKKEAQKLKEE--ILAEAQ 90

Query: 1674 ERLKSIQAGFLQNGRSAMDVENEE----KNLEKEV-ALAQKQASGL 1714
            +  + +         +  ++E E+      L ++V ALA + A  L
Sbjct: 91   KDAERLLE------SARAEIEQEKEQALAELRQQVAALAVQIAEKL 130



 Score = 36.1 bits (84), Expect = 0.038
 Identities = 21/114 (18%), Positives = 44/114 (38%), Gaps = 11/114 (9%)

Query: 1670 KLLQERLKSIQAGFLQNGRSAMDVENEEKNLEKEVALAQKQASGLRSRYQETDNKLLSKA 1729
            K+L ER + I     +               E+++A A+ +AS + +  ++   KL  + 
Sbjct: 26   KILDERKEKIANNIKEAEERLKQAAALLAEAEQQLAQARAEASEIINNAKKEAQKLKEEI 85

Query: 1730 ESSGLKRARGQMLLNKASQLSVNTTAKLKQLNETEAMFNSQESELTELSKNIAE 1783
             +   K A  ++L +  +++            E E        ++  L+  IAE
Sbjct: 86   LAEAQKDAE-RLLESARAEIEQ----------EKEQALAELRQQVAALAVQIAE 128


>gnl|CDD|215974 pfam00529, HlyD, HlyD family secretion protein. 
          Length = 304

 Score = 40.2 bits (94), Expect = 0.007
 Identities = 29/193 (15%), Positives = 60/193 (31%), Gaps = 36/193 (18%)

Query: 1552 KNIQSNPEQ--IRERANEINNVIK---SLTDIDTILTETAGDLAKANDLKRKANLTKAGA 1606
              +Q       +     +  + +K    L  +D    + A D A+A     +A+     A
Sbjct: 3    AEVQP-QVSGIVIRILVKEGDRVKAGDVLVRLDPTPYQAALDRAEAELAAAQADAQARLA 61

Query: 1607 DSTKNTVQKIV------------DVLTEARTAQD----MAEVA-IQTAKDDISAARKDLS 1649
             + ++ +Q +V                  R A       A  A ++ A+D++ +A   L 
Sbjct: 62   RAERDRLQALVAQAAIQSEQEADAARAALREALAASRGAASKAQLEAARDNLRSALALLP 121

Query: 1650 QITNDLDDAQ-----QKSNETNVKVKLLQERLKSIQAGFLQNGRSAMDVENEEKNLEKEV 1704
                 +  +Q       +N    +    + +L        Q          E   L +  
Sbjct: 122  ISLGRIGASQATLGALLNNLQA-QGLAARAQLDEAYLQLAQ-------TAAELLRLIRAP 173

Query: 1705 ALAQKQASGLRSR 1717
            A  ++  S  + R
Sbjct: 174  AQGERVQSEAQLR 186


>gnl|CDD|147523 pfam05384, DegS, Sensor protein DegS.  This is small family of
            Bacillus DegS proteins. The DegS-DegU two-component
            regulatory system of Bacillus subtilis controls various
            processes that characterize the transition from the
            exponential to the stationary growth phase, including the
            induction of extracellular degradative enzymes,
            expression of late competence genes and down-regulation
            of the sigma D regulon. The family also contains one
            sequence from Thermoanaerobacter tengcongensis which is
            described as a sensory transduction histidine kinase.
          Length = 159

 Score = 38.7 bits (91), Expect = 0.007
 Identities = 26/167 (15%), Positives = 70/167 (41%), Gaps = 33/167 (19%)

Query: 1633 AIQTAKDDI----SAARKDLSQITNDLDDAQQKSNETNVKVKLLQE-------RLKSIQA 1681
             ++ +K++I      AR++  ++  +L++ +++ +E   +V  L++       RL  +  
Sbjct: 10   TVEESKEEIFEIAEQARQEYERLEQELEEVKEEVSEVIKEVDKLEKQERRARKRLAEVSR 69

Query: 1682 GFLQNGRSAM-DVENEEKNLEKEVALAQKQASGLRSRYQETDNKLLSKAESSGLKRARGQ 1740
             F +     + +   +  +L+ ++AL +++   LR R  E + +L        LK     
Sbjct: 70   NFDKYSEEDIREAYEKAHDLQVKLALLRQEEKQLRKRRDELERRLK------NLKE---- 119

Query: 1741 MLLNKASQLSVNTTAKLKQLNETEAMFNSQESELTELSKNIAELQKR 1787
              + +A  L       L  L            +L ++ + + + +++
Sbjct: 120  -TIERAENLVSQVGVVLNYLT----------GDLKQVGEQLEDAKQK 155


>gnl|CDD|226055 COG3524, KpsE, Capsule polysaccharide export protein [Cell envelope
            biogenesis, outer membrane].
          Length = 372

 Score = 40.2 bits (94), Expect = 0.009
 Identities = 28/143 (19%), Positives = 53/143 (37%), Gaps = 30/143 (20%)

Query: 1610 KNTVQKIVDVLTEARTA-------------QDMAEVAIQTAKDDISAARKDLSQITNDLD 1656
            +  VQK  + + +A                +  AEV +      +S    +L  I   LD
Sbjct: 185  EEEVQKAEERVKKASNDLTDYRIKNGVFDPKAQAEVQMSL----VSKLEDELIVIQAQLD 240

Query: 1657 DAQQKSNETNVKVKLLQERLKSIQAGFLQ---------NGRSAMDVENEEKNLEKEVALA 1707
              +   N  N ++  L+ R++S++   LQ         + +S  +   E + L  E   A
Sbjct: 241  TVKSVMNPENPQIPGLKARIESLRKQLLQEKQAISAGGSSQSLSNQAAEFQRLYLENTFA 300

Query: 1708 QKQASGLRSRYQ----ETDNKLL 1726
            +K  +   +  +    E D + L
Sbjct: 301  EKAYAAALTSLESARIEADRQQL 323


>gnl|CDD|233973 TIGR02680, TIGR02680, TIGR02680 family protein.  Members of this
            protein family belong to a conserved gene four-gene
            neighborhood found sporadically in a phylogenetically
            broad range of bacteria: Nocardia farcinica,
            Symbiobacterium thermophilum, and Streptomyces
            avermitilis (Actinobacteria), Geobacillus kaustophilus
            (Firmicutes), Azoarcus sp. EbN1 and Ralstonia
            solanacearum (Betaproteobacteria). Proteins in this
            family average over 1400 amino acids in length
            [Hypothetical proteins, Conserved].
          Length = 1353

 Score = 40.6 bits (95), Expect = 0.010
 Identities = 43/230 (18%), Positives = 83/230 (36%), Gaps = 27/230 (11%)

Query: 1562 RERANEINNVIKSLTDIDTILTETAGDLAKANDLKRKANLTKAGADSTKNTVQKIVDVLT 1621
            R R   I  +   L  +D  L E A +L      +R      AGA S ++          
Sbjct: 738  RARLRRIAELDARLAAVDDELAELARELRALGARQRALADELAGAPSDRSLRAAHRRA-A 796

Query: 1622 EARTAQDMAEVAIQTAKDDISAARKDLSQITNDLDDAQQKSNETNVKVKLLQERLKSIQA 1681
            EA    + AE  +  A    +AA     Q   +L+     + + ++      + L+++  
Sbjct: 797  EAERQAESAERELARAARKAAAAAAAWKQARRELER---DAADLDLPTDP--DALEAVGL 851

Query: 1682 GFLQNGRSAMDVENEEKNLEKEVALAQKQASGLRSRYQETDNKLLSKAESSGLKRARGQM 1741
               + G     +E       +E+  A  +A+  R+R    ++     AE +   RA    
Sbjct: 852  ALKRFGDHLHTLE----VAVRELRHAATRAAEQRARAARAESDAREAAEDAAEARAE--- 904

Query: 1742 LLNKASQLSVNTTAKLKQLNETEAMFNSQESELTELSKNIAELQKRIQSC 1791
               +A       + +L+ L E+        + + E+   +AE +  + S 
Sbjct: 905  -AEEA-------SLRLRTLEESVG------AMVDEIRARLAETRAALASG 940


>gnl|CDD|181619 PRK09039, PRK09039, hypothetical protein; Validated.
          Length = 343

 Score = 39.2 bits (92), Expect = 0.015
 Identities = 26/133 (19%), Positives = 59/133 (44%), Gaps = 17/133 (12%)

Query: 1590 AKANDLKRKANLTKAGADSTKNTVQKIVDVLTEARTAQDMAEVAIQTAKDDISAARKDLS 1649
            ++  +L    +L + G    +++V  +   L+ A   +   +  +       +AA     
Sbjct: 60   SQIAELADLLSLERQGNQDLQDSVANLRASLSAAEAERSRLQALLAELAGAGAAAEGRAG 119

Query: 1650 QITNDLDDAQQKSNETNVKVKLLQERLKSI--QAGFLQNGRSAMDVENEEKNLEKE---- 1703
            ++  +LD  +Q S     +V+LL +++ ++  Q   L+   +A+D   E+++ E +    
Sbjct: 120  ELAQELDSEKQVSARALAQVELLNQQIAALRRQLAALE---AALDAS-EKRDRESQAKIA 175

Query: 1704 -------VALAQK 1709
                   VALAQ+
Sbjct: 176  DLGRRLNVALAQR 188


>gnl|CDD|234750 PRK00409, PRK00409, recombination and DNA strand exchange inhibitor
            protein; Reviewed.
          Length = 782

 Score = 39.8 bits (94), Expect = 0.015
 Identities = 27/121 (22%), Positives = 54/121 (44%), Gaps = 13/121 (10%)

Query: 1559 EQIRERANEINNVIKSLTDIDTILTETAGDLAKANDLKRKANLTKAGADSTKNTVQKIVD 1618
            + I E   ++N +I SL +++  L +      +A  L ++       A+  K  +++  +
Sbjct: 509  KLIGEDKEKLNELIASLEELERELEQ---KAEEAEALLKE-------AEKLKEELEEKKE 558

Query: 1619 VLTEARTAQDMAEVAIQTAKDDISAARKDLSQITNDLDDAQQKSNETNVKVKLLQERLKS 1678
             L E    + + E A + A+  I  A+K+  +I  +L   Q+     +VK   L E  K 
Sbjct: 559  KLQEEE-DKLLEE-AEKEAQQAIKEAKKEADEIIKELRQLQKGGY-ASVKAHELIEARKR 615

Query: 1679 I 1679
            +
Sbjct: 616  L 616


>gnl|CDD|235136 PRK03598, PRK03598, putative efflux pump membrane fusion protein;
            Provisional.
          Length = 331

 Score = 39.2 bits (92), Expect = 0.016
 Identities = 35/160 (21%), Positives = 67/160 (41%), Gaps = 27/160 (16%)

Query: 1590 AKANDLKRKANLTK--AGADSTKNTVQKIVDV---LTEARTAQDMAEVAIQTAKDDISAA 1644
            AKAN    +A L    AG        ++I      + +A+ A D A+      +      
Sbjct: 86   AKANVSVAQAQLDLMLAGYRD-----EEIAQARAAVKQAQAAYDYAQNFYNRQQG--LWK 138

Query: 1645 RKDLSQITNDLDDAQQKSNETNVKVKLLQERLKSIQAGFLQNGRSAMDVENEEKNLEKEV 1704
             + +S   NDL++A+   ++    +K  Q++L   + G              +   + + 
Sbjct: 139  SRTISA--NDLENARSSRDQAQATLKSAQDKLSQYREGN-----------RPQDIAQAKA 185

Query: 1705 ALAQKQASGLRSRYQETDNKLLSKAESSGLKRAR--GQML 1742
            +LAQ QA+  ++     D +L++ ++ + L RA   G ML
Sbjct: 186  SLAQAQAALAQAELNLQDTELIAPSDGTILTRAVEPGTML 225


>gnl|CDD|193488 cd04083, CBM6_discoidin-like, Carbohydrate Binding Module 6 (CBM6)
           domains; many are found in association with discoidin
           domain(s).  This family includes carbohydrate binding
           module 6 (CBM6) domains that are appended to several
           carbohydrate binding enzymes; many are found in
           association with carbohydrate-binding coagulation factor
           5/8 C-terminal (discoidin) domain(s), for example the
           CBM domain of Agl, an alpha-1,3-glucanase from Bacillus
           circulans KA-304. Some CBM6 domains belonging to this
           family are appended to glycoside hydrolase (GH) family
           domains, including glycoside hydrolase family 31 (GH31),
           for example the CBM domain of CtsY, a
           3-alpha-isomaltosyltranasferase from Arthrobacter
           globiformis A19. These CBM6s are non-catalytic
           carbohydrate binding domains that facilitate the strong
           binding of the GH catalytic modules with their
           dedicated, insoluble substrates. CBM6 is an unusual CBM
           as it represents a chimera of two distinct binding sites
           with different modes of binding: binding site I within
           the loop regions and binding site II on the concave face
           of the beta-sandwich fold. GH31 has a wide range of
           hydrolytic activities such as alpha-glucosidase,
           alpha-xylosidase, 6-alpha-glucosyltransferase, or
           alpha-1,4-glucan lyase, cleaving a terminal carbohydrate
           moiety from a substrate that may be a starch or a
           glycoprotein. Most characterized GH31 enzymes are
           alpha-glucosidases.
          Length = 124

 Score = 37.0 bits (86), Expect = 0.016
 Identities = 21/68 (30%), Positives = 28/68 (41%), Gaps = 19/68 (27%)

Query: 561 YEAENANCKTDKCVVEIRQPLGGG----NENTW-TGPGFMKGFENTG-LVFEIDNIPTPM 614
           YEAE+A              L GG     E T  +G G++ GF   G  V    N+P   
Sbjct: 3   YEAEDAT-------------LSGGATVATEATGYSGTGYVVGFTAVGASVTFTVNVPEAG 49

Query: 615 DYDIVVRY 622
            Y + +RY
Sbjct: 50  TYSLTLRY 57


>gnl|CDD|130214 TIGR01144, ATP_synt_b, ATP synthase, F0 subunit b.  This model
            describes the F1/F0 ATP synthase b subunit in bacteria
            only. Scoring just below the trusted cutoff are the
            N-terminal domains of Mycobacterial b/delta fusion
            proteins and a subunit from an archaeon, Methanosarcina
            barkeri, in which the ATP synthase homolog differs in
            architecture and is not experimentally confirmed. This
            model helps resolve b from the related b' subunit. Within
            the family is an example from a sodium-translocating
            rather than proton-translocating ATP synthase [Energy
            metabolism, ATP-proton motive force interconversion].
          Length = 147

 Score = 37.4 bits (87), Expect = 0.017
 Identities = 26/104 (25%), Positives = 49/104 (47%), Gaps = 17/104 (16%)

Query: 1615 KIVDVLTEARTAQDMAEVAIQTAKDDISAARKDLSQITNDLDDAQQKSNE--TNVKVKLL 1672
            KI D L  A  A+  A +A + A+  +  A+ +  +I   +++A ++ +E     K +  
Sbjct: 30   KIADGLASAERAKKEAALAQKKAQVILKEAKDEAQEI---IENANKRGSEILEEAKAEAR 86

Query: 1673 QERLKSIQAGFLQNGRSAMDVENEE--KNLEKEVA-----LAQK 1709
            +ER K          R+ ++ E E+  + L K+VA      A+K
Sbjct: 87   EEREKIKAQ-----ARAEIEAEKEQAREELRKQVADLSVLGAEK 125



 Score = 31.6 bits (72), Expect = 1.5
 Identities = 26/110 (23%), Positives = 45/110 (40%), Gaps = 15/110 (13%)

Query: 1698 KNLEKEVALAQKQASGLRSRYQETDNKLLSKAESSGLKRARGQMLLNKASQLSVNTTAKL 1757
            +  +KE ALAQK+A  +    ++   +++  A        RG  +L +A   +     K+
Sbjct: 39   ERAKKEAALAQKKAQVILKEAKDEAQEIIENANK------RGSEILEEAKAEAREEREKI 92

Query: 1758 KQLNETE--AMFNSQESEL----TELSKNIAE--LQKRIQSCIN-FIVDK 1798
            K     E  A       EL     +LS   AE  +++ I       ++DK
Sbjct: 93   KAQARAEIEAEKEQAREELRKQVADLSVLGAEKIIERNIDKQAQKDLIDK 142


>gnl|CDD|114524 pfam05802, EspB, Enterobacterial EspB protein.  EspB is a
            type-III-secreted pore-forming protein of
            enteropathogenic Escherichia coli (EPEC) which is
            essential for EPEC pathogenesis. EspB is also found in
            Citrobacter rodentium.
          Length = 317

 Score = 38.9 bits (90), Expect = 0.022
 Identities = 23/100 (23%), Positives = 44/100 (44%), Gaps = 3/100 (3%)

Query: 1568 INNVIKSLTDIDTILTETAGDLA-KANDLKRKANLTKAGADSTKNTVQKIVDVLTEARTA 1626
            IN+  K  +DI    T  +      A+++  KA L KA  +S  +  ++    + +A   
Sbjct: 119  INSATKGASDIAQKATSASSKAVNAASEVATKA-LVKA-TESVADAAEEASSTMQQAMAT 176

Query: 1627 QDMAEVAIQTAKDDISAARKDLSQITNDLDDAQQKSNETN 1666
               A        DD++ + +  SQ+  +  DA QK++  +
Sbjct: 177  ATKAASRTSGVADDVATSAQKASQVAEEAADAAQKASRLS 216


>gnl|CDD|224495 COG1579, COG1579, Zn-ribbon protein, possibly nucleic acid-binding
            [General function prediction only].
          Length = 239

 Score = 38.1 bits (89), Expect = 0.022
 Identities = 31/171 (18%), Positives = 69/171 (40%), Gaps = 23/171 (13%)

Query: 1217 ELLEKKIEDVKALVENTTRSSHDLTVMLTTIDDMKKQIMNNEDPNGVGKTLGNTTQRINL 1276
              +++  + +K         +  L  +   ++D++ Q+   E        +    +RI  
Sbjct: 24   PRIKEIRKALKKAKAELEALNKALEALEIELEDLENQVSQLE------SEIQEIRERIKR 77

Query: 1277 ARLALNDLT--EEIKKLNKTGEMLKENATLLQENNVEGALELTRQAHDTSLKLKEQSTET 1334
            A   L+ +    E++ LN   ++ KE     + N++E  L           +L E+  + 
Sbjct: 78   AEEKLSAVKDERELRALNIEIQIAKE-----RINSLEDELA----------ELMEEIEKL 122

Query: 1335 EKQINDAERQCKRTENLVTKTFPQFTQGQEENEKALQRLNEKITSLEKNLP 1385
            EK+I D + + +R E  + +   +  +   E  +  Q L+ K   L++ L 
Sbjct: 123  EKEIEDLKERLERLEKNLAEAEARLEEEVAEIREEGQELSSKREELKEKLD 173



 Score = 38.1 bits (89), Expect = 0.025
 Identities = 29/176 (16%), Positives = 72/176 (40%), Gaps = 18/176 (10%)

Query: 1559 EQIRERANEINNVIKS----LTDIDTILTETAGDLAKANDLKRKANLTKAGADSTKNTVQ 1614
            +++  R  EI   +K     L  ++  L     +L    +   +               +
Sbjct: 20   DRLEPRIKEIRKALKKAKAELEALNKALEALEIELEDLENQVSQLESEIQEIRERIKRAE 79

Query: 1615 KIVDVLTEARTAQDMAEVAIQTAKDDISAARKDLSQITNDLDDAQQKSNETNVKVKLLQE 1674
            + +  + + R  + +  + IQ AK+ I++   +L+    +L +  +K  +   +++ L+E
Sbjct: 80   EKLSAVKDERELRAL-NIEIQIAKERINSLEDELA----ELMEEIEKLEK---EIEDLKE 131

Query: 1675 RLKSIQAGFLQNGRSAMDVENEEKNLEKEVALAQKQASGLRSRYQETDNKLLSKAE 1730
            RL+ ++    +   +   +E E   + +E      +   L+    + D +LLS+ E
Sbjct: 132  RLERLEKNLAEA-EAR--LEEEVAEIREEGQELSSKREELKE---KLDPELLSEYE 181



 Score = 36.2 bits (84), Expect = 0.12
 Identities = 20/141 (14%), Positives = 56/141 (39%), Gaps = 4/141 (2%)

Query: 1639 DDISAARKDLSQITNDLDDAQQKSNETNVKVKLLQERLKSIQAGFLQNGRSAMDV---EN 1695
              +  A+ +L  +   L+  + +  +   +V  L+  ++ I+   ++     +     E 
Sbjct: 31   KALKKAKAELEALNKALEALEIELEDLENQVSQLESEIQEIRER-IKRAEEKLSAVKDER 89

Query: 1696 EEKNLEKEVALAQKQASGLRSRYQETDNKLLSKAESSGLKRARGQMLLNKASQLSVNTTA 1755
            E + L  E+ +A+++ + L     E   ++    +     + R + L    ++       
Sbjct: 90   ELRALNIEIQIAKERINSLEDELAELMEEIEKLEKEIEDLKERLERLEKNLAEAEARLEE 149

Query: 1756 KLKQLNETEAMFNSQESELTE 1776
            ++ ++ E     +S+  EL E
Sbjct: 150  EVAEIREEGQELSSKREELKE 170



 Score = 34.6 bits (80), Expect = 0.37
 Identities = 27/118 (22%), Positives = 54/118 (45%), Gaps = 12/118 (10%)

Query: 1673 QERLKSIQAGFLQNGRSAMDVENEEKNLEKEVALAQKQASGLRSRYQETDNKLLSKAESS 1732
            ++ LK  +A      ++   +E E ++LE +V+  + +   +R R +  + KL      S
Sbjct: 30   RKALKKAKAELEALNKALEALEIELEDLENQVSQLESEIQEIRERIKRAEEKL------S 83

Query: 1733 GLKRARGQMLLNKASQLSVNTTAKLKQLNETEAMFNSQESELTELSKNIAELQKRIQS 1790
             +K  R    LN   Q      AK +  +  + +    E E+ +L K I +L++R++ 
Sbjct: 84   AVKDERELRALNIEIQ-----IAKERINSLEDELAELME-EIEKLEKEIEDLKERLER 135


>gnl|CDD|147975 pfam06103, DUF948, Bacterial protein of unknown function (DUF948).
            This family consists of bacterial sequences several of
            which are thought to be general stress proteins.
          Length = 90

 Score = 35.6 bits (83), Expect = 0.028
 Identities = 14/37 (37%), Positives = 20/37 (54%), Gaps = 1/37 (2%)

Query: 1560 QIRERANEINNVIKSLT-DIDTILTETAGDLAKANDL 1595
             + +  +E+N  +  LT  +D I TET   LAK N L
Sbjct: 23   SLSKTLDEVNKTLAGLTKQVDGITTETEDLLAKTNVL 59


>gnl|CDD|213754 TIGR02971, heterocyst_DevB, ABC exporter membrane fusion protein,
            DevB family.  Members of this protein family are found
            mostly in the Cyanobacteria, but also in the
            Planctomycetes. DevB from Anabaena sp. strain PCC 7120 is
            partially characterized as a membrane fusion protein of
            the DevBCA ABC exporter, probably a glycolipid exporter,
            required for heterocyst formation. Most Cyanobacteria
            have one member only, but Nostoc sp. PCC 7120 has seven
            members.
          Length = 327

 Score = 38.3 bits (89), Expect = 0.028
 Identities = 27/141 (19%), Positives = 49/141 (34%), Gaps = 20/141 (14%)

Query: 1559 EQIRERANEINNVIKSLTDIDTILTETAGDLAKAN-DLKRKANLTKAGA------DSTKN 1611
               +  A     + K +      L     +L  A  ++ R  +L + GA      DS   
Sbjct: 83   IAAQRAARAAAKLFKDVAAQQATLNRLEAELETAQREVDRYRSLFRDGAVSASDLDSKAL 142

Query: 1612 TVQKIVDVLTEAR----TAQDMAEVAIQTAKDDISAARKDLSQITNDLDDAQQKSNETNV 1667
             ++   + L EA        D A  A+ +  +++     DL+Q       A+ KS     
Sbjct: 143  KLRTAEEELEEALASRSEQIDGARAALASLAEEVRETDVDLAQ-------AEVKSALEA- 194

Query: 1668 KVKLLQERLKSIQAGFLQNGR 1688
             V+  +  L+        +GR
Sbjct: 195  -VQQAEALLELTYVKAPIDGR 214


>gnl|CDD|218489 pfam05192, MutS_III, MutS domain III.  This domain is found in
            proteins of the MutS family (DNA mismatch repair
            proteins) and is found associated with pfam00488,
            pfam05188, pfam01624 and pfam05190. The MutS family of
            proteins is named after the Salmonella typhimurium MutS
            protein involved in mismatch repair; other members of the
            family included the eukaryotic MSH 1,2,3, 4,5 and 6
            proteins. These have various roles in DNA repair and
            recombination. Human MSH has been implicated in
            non-polyposis colorectal carcinoma (HNPCC) and is a
            mismatch binding protein. The aligned region corresponds
            with domain III, which is central to the structure of
            Thermus aquaticus MutS as characterized in.
          Length = 290

 Score = 38.2 bits (89), Expect = 0.031
 Identities = 47/260 (18%), Positives = 95/260 (36%), Gaps = 56/260 (21%)

Query: 1557 NPEQIRER-----------ANEINNVIKSLTDIDTILTETAGDLAKANDLKR-KANLTKA 1604
            + ++I  R             E+   +K + D++ +L+  A   A   DL   +++L   
Sbjct: 47   DLDEINARLDAVEELLEDLREELREALKGIPDLERLLSRLALGRASPRDLLALRSSLEAL 106

Query: 1605 ----GADSTKNTVQKIVDVLTEARTAQDMAEVAIQTAKDDISAARKD----LSQITNDLD 1656
                   ++   +  + D L       ++ E AI    +D   + +D          +LD
Sbjct: 107  PALRKLLASAPLLADLADQL-PLPELLELLERAID---EDPPLSLRDGGVIKDGYDPELD 162

Query: 1657 DAQQKSNETNVKVKLLQER------LKSIQAGF-LQNGRSAMDVENEEKNLEKEVALAQK 1709
            + +   +E   K+  L ER      +KS++ G+    G   ++V+  + +      + ++
Sbjct: 163  ELRALLDELREKLAELLERERERTGIKSLKVGYNRVFGYYVIEVKASKADKVPGDYI-RR 221

Query: 1710 QASGLRSRYQ-----ETDNKLLSKAESSGLKRARGQMLLNKASQLSVNTTAKLKQLNETE 1764
              +    R+      E + KLL                  +A  L       L++L   E
Sbjct: 222  STTKNAVRFTTPELKELERKLLE--------------AEERALALE---KEILEEL--LE 262

Query: 1765 AMFNSQESELTELSKNIAEL 1784
             + +     L EL+  IAEL
Sbjct: 263  RVLSEYAELLQELADAIAEL 282


>gnl|CDD|221408 pfam12072, DUF3552, Domain of unknown function (DUF3552).  This
            presumed domain is functionally uncharacterized. This
            domain is found in bacteria, archaea and eukaryotes. This
            domain is about 200 amino acids in length. This domain is
            found associated with pfam00013, pfam01966. This domain
            has a single completely conserved residue A that may be
            functionally important.
          Length = 201

 Score = 37.5 bits (88), Expect = 0.032
 Identities = 37/147 (25%), Positives = 63/147 (42%), Gaps = 34/147 (23%)

Query: 1641 ISAARKDLSQITNDLDDAQQKSNETNVKVKLLQERLKSIQAGFLQNGRSAMDVENEEKNL 1700
            IS+A ++  QI   +++A+ K  E   K  LL+ + +  +       R+  + E E K  
Sbjct: 29   ISSAEEEAKQI---IEEAK-KEAEALKKEALLEAKEEIHKL------RA--EAERELKER 76

Query: 1701 EKEVALAQKQASGLRSRYQETDNKLLSKAESSGLKRARGQMLLNKASQLSVNTTAKLKQL 1760
              E+   Q+Q           + +LL K E+  L R + + L  K   L      K K+L
Sbjct: 77   RNEL---QRQ-----------EKRLLQKEET--LDR-KDESLEKKEESLE----EKEKEL 115

Query: 1761 NETEAMFNSQESELTEL-SKNIAELQK 1786
               +     +E EL EL  +   EL++
Sbjct: 116  AARQQQLEEKEEELEELIEEQQQELER 142


>gnl|CDD|227109 COG4768, COG4768, Uncharacterized protein containing a divergent
            version of the methyl-accepting chemotaxis-like domain
            [General function prediction only].
          Length = 139

 Score = 36.3 bits (84), Expect = 0.035
 Identities = 22/83 (26%), Positives = 32/83 (38%), Gaps = 5/83 (6%)

Query: 1567 EINNVIKSLT-DIDTILTETAGDLAKANDLKRKANLTKAGADSTKNTV----QKIVDVLT 1621
            E+   +K LT  +D I  ET   L K N L        A  D   + V    Q + D+  
Sbjct: 35   EVAKTLKGLTSQVDGITHETEELLHKTNTLAEDVQGKVATLDPVFDAVKDLGQSVSDLNQ 94

Query: 1622 EARTAQDMAEVAIQTAKDDISAA 1644
              R     A  A++  +  I+ A
Sbjct: 95   SVRHLATRATNAVEKNEKKIAQA 117


>gnl|CDD|234501 TIGR04211, SH3_and_anchor, SH3 domain protein.  Members of this
            protein family have a signal peptide, a strongly
            conserved SH3 domain, a variable region, and then a
            C-terminal hydrophobic transmembrane alpha helix region.
          Length = 198

 Score = 37.3 bits (87), Expect = 0.039
 Identities = 24/115 (20%), Positives = 50/115 (43%), Gaps = 8/115 (6%)

Query: 1679 IQAGFLQNGRSAMD----VENEEKNLEKEVALAQKQASGLRSRYQETDNKLLSKAESSGL 1734
            + + +L +  SA +    ++ E   L++E+A  Q+Q + L+   QE   +L S  E+   
Sbjct: 53   VLSRYLSDTPSARERLPELQQELAELQEELAELQEQLAELQQENQELKQEL-STLEAELE 111

Query: 1735 KRARGQMLLNKASQLSVNTTAKLKQLNETEAMFNSQESELTELSKNIAELQKRIQ 1789
            +  +    + + S  ++    + ++L E  A     + E   L      LQ+  Q
Sbjct: 112  RLQKELARIKQLSANAIELDEENRELREELAE---LKQENEALEAENERLQENEQ 163


>gnl|CDD|114011 pfam05262, Borrelia_P83, Borrelia P83/100 protein.  This family
            consists of several Borrelia P83/P100 antigen proteins.
          Length = 489

 Score = 38.4 bits (89), Expect = 0.040
 Identities = 38/236 (16%), Positives = 87/236 (36%), Gaps = 35/236 (14%)

Query: 1541 DIRNISGLAVNKNIQSNPEQIRERANEINNVIKSLTDIDTILTETAGDLA-----KANDL 1595
            D  +IS   V + ++ + E+      ++ ++ +  +  D    +   +        A+  
Sbjct: 173  DTDSISDKKVVEALREDNEKGVNFRRDMTDLKERESQEDAKRAQQLKEELDKKQIDADKA 232

Query: 1596 KRKANLTKAGADSTKNTVQKIVDVLTEARTAQDMAEVAIQTAKDDISAARKDLSQITNDL 1655
            ++KA+  +  AD  ++ V++      EA+     A+    + K+D   A     +I    
Sbjct: 233  QQKADFAQDNADKQRDEVRQ---KQQEAKNLPKPADT--SSPKEDKQVAENQKREIEKAQ 287

Query: 1656 DDAQQKSNETNVKVKLLQERLKSIQAGFLQNGRSAMDVENEEKNLEKEVALAQKQASGLR 1715
             + ++   E           LK              + +  EK  E +   AQK+   + 
Sbjct: 288  IEIKKNDEEALKAKDHKAFDLKQ-------------ESKASEKEAEDKELEAQKKREPVA 334

Query: 1716 SRYQETDNKLLSKA--------ESS----GLKRARGQMLLNKASQLSVNTTAKLKQ 1759
               Q+T  ++ ++         +SS    GLK       L++   + + T  +L++
Sbjct: 335  EDLQKTKPQVEAQPTSLNEDAIDSSNPVYGLKVVDPITNLSELVLIDLKTEVRLRE 390



 Score = 31.9 bits (72), Expect = 3.9
 Identities = 20/83 (24%), Positives = 34/83 (40%), Gaps = 4/83 (4%)

Query: 1427 AKDYAETSKKLINEKESKAEELRKHM-IILSESNSAKDYAETSKKLINEKEAKAEEVFRG 1485
              D  +  +K  +  +  A++ R  +     E+ +    A+TS     E +  AE   R 
Sbjct: 226  QIDADKAQQKA-DFAQDNADKQRDEVRQKQQEAKNLPKPADTSSP--KEDKQVAENQKRE 282

Query: 1486 ITTAKQESHAANILAKEAYDHAA 1508
            I  A+ E    +  A +A DH A
Sbjct: 283  IEKAQIEIKKNDEEALKAKDHKA 305


>gnl|CDD|200948 pfam00038, Filament, Intermediate filament protein. 
          Length = 312

 Score = 37.6 bits (88), Expect = 0.044
 Identities = 31/133 (23%), Positives = 64/133 (48%), Gaps = 23/133 (17%)

Query: 1670 KLLQERLKSIQAGFLQNGRSAMDVENEEKNLEKEV-ALAQKQA---SGLRSRYQETDNKL 1725
            + L +RL    A ++   R    +E + K LE ++  L QK++   S L S Y++   +L
Sbjct: 7    QNLNDRL----ASYIDKVRF---LEQQNKELEAKIEELRQKKSAEPSRLYSLYEQEIREL 59

Query: 1726 LSKAESSGLKRARGQMLLNKASQLSVNTTAKLKQLNETE-AMFNSQESELTELSKNI--- 1781
              + +    +RAR Q+ ++   +L+       ++  E E  +  S E+++  L K++   
Sbjct: 60   RKQLDELTNERARLQLEIDNL-RLAA---EDFREKYEDELNLRQSAENDIVGLRKDLDEA 115

Query: 1782 ----AELQKRIQS 1790
                 +L+ +I+S
Sbjct: 116  TLARVDLEMKIES 128


>gnl|CDD|216100 pfam00754, F5_F8_type_C, F5/8 type C domain.  This domain is also
           known as the discoidin (DS) domain family.
          Length = 128

 Score = 35.6 bits (82), Expect = 0.051
 Identities = 12/62 (19%), Positives = 23/62 (37%), Gaps = 5/62 (8%)

Query: 112 KTISWWQSENGVENVTIQLDMEAEFHFTHLIIYF----KTFRPAAMLIERSHDFGKTWQV 167
              + W S+ G     +Q+D+      T ++             +  IE S+D G+ W  
Sbjct: 23  NGSTAWSSKWGDAPQWLQVDLGKPKKITGVVTQGRQDGGNGYVKSYKIEYSND-GENWTE 81

Query: 168 YR 169
            +
Sbjct: 82  VK 83


>gnl|CDD|146106 pfam03302, VSP, Giardia variant-specific surface protein. 
          Length = 397

 Score = 37.6 bits (87), Expect = 0.056
 Identities = 76/360 (21%), Positives = 108/360 (30%), Gaps = 69/360 (19%)

Query: 851  CDQCSPGYWNYPTCQRCICNGHADLCDSKTGTCISCRNSTDGENCERCINGYYGDPRLGI 910
            CD+C PGY       +C  +        KT  C +C N    E CE C +  Y  P    
Sbjct: 1    CDECKPGYELSADKTKCTSSAPC-----KTENCKACSND-KREVCEECNSNNYLTPTSQC 54

Query: 911  DIPCRPCPCPGTVESGLSYADTCQLDPRS--QDVICECKEGYRGTRCDGCLDNYFGNPQE 968
               C           G  Y  T   + +   +  +  CK      +C  C D ++ +   
Sbjct: 55   IDDCAKI--------GNYYYTTNANNKKICKECTVANCKTCEDQGQCQACNDGFYKSGDA 106

Query: 969  RGGSCEPCDCSNNTDLALPGNCDTLTGKCLQCRYDTEGDHCQVCKAGFFGNALEQNCTEC 1028
                 E C   +    +    C  LTGK L  RY  +G     C  G         C  C
Sbjct: 107  CSPCHESCKTCSGGTASDCTEC--LTGKAL--RYGNDGTKGT-CGEGCTTGTGAGACKTC 161

Query: 1029 TCNILGIDHSK-----------GPCD----RTTGQCPCLPNVIGVSCDRCLKNHWKIASG 1073
               I G  +             G C     R T  C       G+ C  C   ++++  G
Sbjct: 162  GLTIDGTSYCSECATETEYPQNGVCTSTAARATATCKASSVANGM-CSSCANGYFRMNGG 220

Query: 1074 TGCEPCDCDPIGSVDVTECNEYTGQCQCKSGFGGRQCDQCQTNFWGNPN--VECFPCDCD 1131
                             E  ++ G+  C+    G  C +    +  N    V C P  C 
Sbjct: 221  ---------------CYETTKFPGKSVCEEANSGGTCQKEAPGYKLNNGDLVTCSP-GCK 264

Query: 1132 PSGSAT-------------SQCHQNNGTCVCHQGIGGVRCDTCARGYIGTAPDCSPCGEC 1178
               S T               C + + +C    G     C TCA GY  +   C  C   
Sbjct: 265  TCTSNTVCTTCMDGYVKTSDSCTKCDSSCETCTG-ATTTCKTCATGYYKSGTGCVSCTSS 323


>gnl|CDD|225177 COG2268, COG2268, Uncharacterized protein conserved in bacteria
            [Function unknown].
          Length = 548

 Score = 37.9 bits (88), Expect = 0.060
 Identities = 47/271 (17%), Positives = 86/271 (31%), Gaps = 44/271 (16%)

Query: 1559 EQIRERANEINN-------VIKSLTDIDTILTETAGDLAKANDLKRKANLTKAGADSTKN 1611
             Q+ + A    N       +  +  + D  L E   +   A     +    K     T+ 
Sbjct: 217  AQVLQDAEIAENEAEKETEIAIAEANRDAKLVELEVEQQPAGKTAEQTREVKIILAETEA 276

Query: 1612 TVQKIVDVL-TEARTAQDMAEVAIQTAK----------DDISAARKDLSQITNDLD---D 1657
             V         EA  A+ +AE AIQ  K            + A    +  I    +   +
Sbjct: 277  EVAAWKAETRREAEQAEILAEQAIQEEKAQAEQEVQHAKALEAREMRVGLIERQKETELE 336

Query: 1658 AQQKSNETNVKVKLLQERLKSI-----QAGFLQNGRSAMDVENEEKNLEKEVALAQKQAS 1712
             Q++S   N   +  QE  K+        G           E EE    ++ AL     +
Sbjct: 337  PQERSYFINAAQRQAQEEAKAAANIAEAIGAQAEAAVETARETEEAERAEQAALVAAAEA 396

Query: 1713 GLRSRYQETDNKLLSKAESSGLKRARG----QMLLNKASQLSVNTTAKLKQLNET-EAMF 1767
              + + +       +KAE+   + A      + +  K            + L E  + + 
Sbjct: 397  AEQEQVEIAVRAEAAKAEAE-AQAAEIKAEAEAIREKGKA----EAEAKRALAEAIQVLG 451

Query: 1768 NSQESE--------LTELSKNIAELQKRIQS 1790
            ++  +E        L E+++  A+  K I S
Sbjct: 452  DAAAAELFKALVQALPEVAEEAAQPMKNIDS 482


>gnl|CDD|131526 TIGR02473, flagell_FliJ, flagellar export protein FliJ.  Members of
            this family are the FliJ protein found, in nearly every
            case, in the midst of other flagellar biosynthesis genes
            in bacgterial genomes. Typically the fliJ gene is found
            adjacent to the gene for the flagellum-specific ATPase
            FliI. Sequence scoring in the gray zone between trusted
            and noise cutoffs include both probable FliJ proteins and
            components of bacterial type III secretion systems.
          Length = 141

 Score = 35.8 bits (83), Expect = 0.061
 Identities = 18/114 (15%), Positives = 44/114 (38%), Gaps = 17/114 (14%)

Query: 1610 KNTVQKIVDVLTEARTAQDMAEVAIQTAKDDISAARKDLSQITNDLDDAQQKSNET---N 1666
            +  +QK++D+       ++ A++ +  A+ +       L Q+    ++ +Q++ E     
Sbjct: 1    EFRLQKLLDL---REKEEEQAKLELAKAQAEFERLETQLQQLIKYREEYEQQALEKVGAG 57

Query: 1667 VKVKLLQERLKSIQAGFLQNGRSAMDVENEEKNLEKEVALAQKQASGLRSRYQE 1720
                 L          F++     +         ++E+AL Q++    R R  E
Sbjct: 58   TSALELSN-----YQRFIRQLDQRIQ------QQQQELALLQQEVEAKRERLLE 100


>gnl|CDD|224176 COG1256, FlgK, Flagellar hook-associated protein [Cell motility and
            secretion].
          Length = 552

 Score = 37.8 bits (88), Expect = 0.062
 Identities = 43/189 (22%), Positives = 70/189 (37%), Gaps = 35/189 (18%)

Query: 1623 ARTAQDMAEVAIQTAKDDISAA------RKDLSQITNDLDDAQQKSNETNVKVK------ 1670
            A +  + A+ A+     +IS A      R+ + Q TN        +  T V V       
Sbjct: 10   ALSGLNAAQAALDVTGHNISNANTPGYSRQRVIQTTNIPYLGGGLNVGTGVNVVSIQRLR 69

Query: 1671 ----LLQERLKSIQAGFLQNGRSAMDVENEEKNLEKEVALAQKQASGLRSRYQETDNK-- 1724
                  Q R  + Q+ +L    S +         + E  L++   S L +   +  N   
Sbjct: 70   DEFLTNQYRNANSQSSYLDTRASQLS--------QLESLLSEPSESSLSTLLNDFFNSLQ 121

Query: 1725 -LLSKAESSGLKRARGQMLLNKASQLSV---NTTAKLKQL-NETEAMFNSQESELTELSK 1779
             L S    +  +    Q +L+KA  L     NT  +L  L  +  A   +   E+  L K
Sbjct: 122  ELASNPSDTAAR----QAVLSKAQTLVNQINNTYEQLTDLRKDINAEIAATVDEVNSLLK 177

Query: 1780 NIAELQKRI 1788
             IA+L K+I
Sbjct: 178  QIADLNKQI 186


>gnl|CDD|114219 pfam05483, SCP-1, Synaptonemal complex protein 1 (SCP-1).
            Synaptonemal complex protein 1 (SCP-1) is the major
            component of the transverse filaments of the synaptonemal
            complex. Synaptonemal complexes are structures that are
            formed between homologous chromosomes during meiotic
            prophase.
          Length = 787

 Score = 37.8 bits (87), Expect = 0.072
 Identities = 137/648 (21%), Positives = 255/648 (39%), Gaps = 104/648 (16%)

Query: 1188 GLADQTTKVITDASNIKQTGASGAYTKEFELLEKKIEDVKALVENTTRSSHDLTVMLTTI 1247
            GL+   +K+  +A  IK+   S     E +  E K+++ + ++E   ++  +L      +
Sbjct: 75   GLSRLYSKLYKEAEKIKKWKVS--IEAELKQKENKLQENRKIIEAQRKAIQELQFENEKV 132

Query: 1248 D-DMKKQIMNNEDPNGVGKTLGNTTQRINLARLALNDLTEEIKKLNKTGEMLKENATLLQ 1306
               ++++I  N+D   + K    T    NL +       E+ KK     E  ++    L 
Sbjct: 133  SLKLEEEIQENKD---LIKENNATRHLCNLLKETCARSAEKTKKYEYEREETRQVYMDLN 189

Query: 1307 ENNVEGAL----ELTRQAHDTSL----KLKEQ-------STETEKQINDAERQCKRTENL 1351
             NN+E  +    EL  QA +  L    KLKE          E +K+IND E+Q       
Sbjct: 190  -NNIEKMILAFEELRVQAENARLEMHFKLKEDHEKIQHLEEEYKKEINDKEKQVSLLLIQ 248

Query: 1352 VTK---TFPQFTQGQEENEKALQRLNEKITSLEKNLPDLNELICDKRGDPCDNLCGGAGC 1408
            +T+        T   EE+     +L EK    ++NL +L E          D+L      
Sbjct: 249  ITEKENKMKDLTFLLEESRDKANQLEEKTKLQDENLKELIE--------KKDHLTKELED 300

Query: 1409 GKCGGMWCSNGTLSESNSAKDYAETSKKLINEKESKAEELRK----HMIILSESNSAKDY 1464
             K       +   +     +   +T  +L  EKE++ EEL K    H  +++E  +    
Sbjct: 301  IKMSLQRSMSTQKALEEDLQIATKTICQLTEEKEAQMEELNKAKAAHSFVVTEFEATTCS 360

Query: 1465 AETSKKLINEKEAKAEEVFRGITTAKQESHAANILAKEAYDHAASVRNKTAAYVASTSNI 1524
             E   +   ++  K E+  + IT   Q+                             S+ 
Sbjct: 361  LEELLRTEQQRLEKNEDQLKIITMELQKK----------------------------SSE 392

Query: 1525 TKQLDEFLNAPGATLADIRNISG-----LAVNKNIQSNPEQIRERANEINNVI----KSL 1575
             +++ +F N     L +++ I       L   K  +   E+++ +  E+  ++    K +
Sbjct: 393  LEEMTKFKNNKEVELEELKKILAEDEKLLDEKKQFEKIAEELKGKEQELIFLLQAREKEI 452

Query: 1576 TDIDTILTETAGD----LAKANDLKRKANLTKAGADSTKNTVQKIVDVLTEARTAQDMAE 1631
             D++  LT         L +  DL  K  L K    + + T      +L      Q+ ++
Sbjct: 453  HDLEIQLTAIKTSEEHYLKEVEDL--KTELEKEKLKNIELTAHCDKLLLENKELTQEASD 510

Query: 1632 VAIQTAK--DDISAARKDLSQITNDLDDAQQKSNETNVKVKLLQERLKSIQAGFLQNG-- 1687
            + ++  K  +DI   +K   ++   +++ ++K  E N     L++ L+S++  F+Q G  
Sbjct: 511  MTLELKKHQEDIINCKKQEERMLKQIENLEEK--EMN-----LRDELESVREEFIQKGDE 563

Query: 1688 -RSAMD-VENEEKNLEKEVALAQKQASGLRSRYQETDNKLLSKAESSGLKRARGQMLLNK 1745
             +  +D  E   +++E EV   +KQ   L ++     N L  + E+   K    + L  +
Sbjct: 564  VKCKLDKSEENARSIEYEVLKKEKQMKILENKC----NNLKKQIEN---KNKNIEELHQE 616

Query: 1746 ASQLSVNTTAKLKQLNETEAMFNSQESELTELSKNIAEL----QKRIQ 1789
               L    +A+ KQLN  E   N  E EL    +   E+    QK I+
Sbjct: 617  NKALKKKGSAENKQLNAYEIKVNKLELELASAKQKFEEIIDNYQKEIE 664



 Score = 36.6 bits (84), Expect = 0.14
 Identities = 113/586 (19%), Positives = 225/586 (38%), Gaps = 95/586 (16%)

Query: 1248 DDMKKQIMNNEDPNGVGKTLGNTTQRINLARLALNDLTEEIKKLNKTGEMLKENATLLQE 1307
            ++ KK+I + E    V   L   T++ N     + DLT  +++       L+E   L  E
Sbjct: 229  EEYKKEINDKE--KQVSLLLIQITEKEN----KMKDLTFLLEESRDKANQLEEKTKLQDE 282

Query: 1308 N---NVEGALELTRQAHDTSLKLKEQSTETEKQINDAERQCKRTENLVTKTFPQFTQGQE 1364
            N    +E    LT++  D  + L ++S  T+K + +  +       + TKT  Q T   E
Sbjct: 283  NLKELIEKKDHLTKELEDIKMSL-QRSMSTQKALEEDLQ-------IATKTICQLT---E 331

Query: 1365 ENEKALQRLNEK-------ITSLEKNLPDLNELI------CDKRGDPCDNLCGGAGCGKC 1411
            E E  ++ LN+        +T  E     L EL+       +K  D    +         
Sbjct: 332  EKEAQMEELNKAKAAHSFVVTEFEATTCSLEELLRTEQQRLEKNEDQLKII--------- 382

Query: 1412 GGMWCSNGTLSESNSAKDYAETSKKLINEKESKAEELRKHMIILSESNSAKDYAETSKKL 1471
                    T+     + +  E +K   N KE + EEL+    IL+E     D  +  +K+
Sbjct: 383  --------TMELQKKSSELEEMTK-FKNNKEVELEELK---KILAEDEKLLDEKKQFEKI 430

Query: 1472 INEKEAKAEEVFRGITTAKQESHAANI-----------LAKEAYDHAASVRNKTAAYVAS 1520
              E + K +E+   +   ++E H   I             KE  D    +  +    +  
Sbjct: 431  AEELKGKEQELIFLLQAREKEIHDLEIQLTAIKTSEEHYLKEVEDLKTELEKEKLKNIEL 490

Query: 1521 TSNITKQLDEFLNAPGATLADIRNISGLAVN-KNIQSNPEQIRERANEINNVIKSLTDID 1579
            T++  K L E            +  S + +  K  Q +    +++   +   I++L + +
Sbjct: 491  TAHCDKLLLE-------NKELTQEASDMTLELKKHQEDIINCKKQEERMLKQIENLEEKE 543

Query: 1580 TILTETAGDLAKANDLKRKANLTKAGADSTKNTVQKIVDVLTEARTAQDMAEVAIQTAKD 1639
              L +    + +  +  +K +  K   D ++   + I   + +      + E      K 
Sbjct: 544  MNLRDELESVRE--EFIQKGDEVKCKLDKSEENARSIEYEVLKKEKQMKILENKCNNLKK 601

Query: 1640 DISAARKDLSQITNDLDDAQQKSNETN-------VKVKLLQERLKSIQAGF---LQNGRS 1689
             I    K++ ++  +    ++K +  N       +KV  L+  L S +  F   + N + 
Sbjct: 602  QIENKNKNIEELHQENKALKKKGSAENKQLNAYEIKVNKLELELASAKQKFEEIIDNYQK 661

Query: 1690 AMDVEN-EEKNLEKEVALAQKQASGLRSRYQETDNKLLSK-AESSGL----KRARGQMLL 1743
             ++ +   E+ L +EV  A+  A       +E D +   K AE   L    K    +++ 
Sbjct: 662  EIEDKKISEEKLLEEVEKAKAIADEAVKLQKEIDKRCQHKIAEMVALMEKHKHQYDKIIE 721

Query: 1744 NKASQLSVNTTAKLKQLNETEAMFNSQESELTELSKNIAELQKRIQ 1789
             + S+L +    + +Q +   A+    E EL+ +   +  L+K+++
Sbjct: 722  ERDSELGLYKNKEQEQSSAKAAL----EIELSNIKAELLSLKKQLE 763



 Score = 33.2 bits (75), Expect = 1.5
 Identities = 59/290 (20%), Positives = 112/290 (38%), Gaps = 32/290 (11%)

Query: 1217 ELLEKKIEDVKALVENTTRSSHDLTVMLTTIDDMKKQIMNNEDPNGVGKTLGNTTQRINL 1276
            E L+ K +++  L++   +  HDL + LT I   ++  +                     
Sbjct: 432  EELKGKEQELIFLLQAREKEIHDLEIQLTAIKTSEEHYLKE------------------- 472

Query: 1277 ARLALNDLTEEIKKLN-KTGEMLKENATLLQENNVEGALELTRQAHDTSLKLKEQSTETE 1335
                + DL  E++K   K  E+      LL EN      ELT++A D +L+LK+   +  
Sbjct: 473  ----VEDLKTELEKEKLKNIELTAHCDKLLLENK-----ELTQEASDMTLELKKHQEDII 523

Query: 1336 KQINDAERQCKRTENLVTKTFPQFTQGQEENEKALQRLNEKITSLEKNLPDLNELICDKR 1395
                  ER  K+ ENL  K      + +   E+ +Q+ +E    L+K+  +   +  +  
Sbjct: 524  NCKKQEERMLKQIENLEEKEMNLRDELESVREEFIQKGDEVKCKLDKSEENARSIEYEVL 583

Query: 1396 GDPCDNLCGGAGCGKCGGMWCSNGTLSESNSAKDYAETSKKLINEKESKAEELRKHMIIL 1455
                        C        +     E    ++ A   K     K+  A E++ + + L
Sbjct: 584  KKEKQMKILENKCNNLKKQIENKNKNIEELHQENKALKKKGSAENKQLNAYEIKVNKLEL 643

Query: 1456 SESNSAKDYAE---TSKKLINEKEAKAEEVFRGITTAKQESHAANILAKE 1502
              +++ + + E     +K I +K+   E++   +  AK  +  A  L KE
Sbjct: 644  ELASAKQKFEEIIDNYQKEIEDKKISEEKLLEEVEKAKAIADEAVKLQKE 693


>gnl|CDD|224380 COG1463, Ttg2C, ABC-type transport system involved in resistance to
            organic solvents, periplasmic component [Secondary
            metabolites biosynthesis, transport, and catabolism].
          Length = 359

 Score = 37.1 bits (86), Expect = 0.074
 Identities = 39/217 (17%), Positives = 69/217 (31%), Gaps = 34/217 (15%)

Query: 1191 DQTTKVITDAS-NIKQTGASGA--------YTKEFELLEK---KIEDVK-ALVENTTRSS 1237
            D    +  D++ +I+ TG  G              +L       +E  +  +        
Sbjct: 88   DSDYPIPADSTASIRTTGLLGEKYIELTPGDAGGPKLRLGATIPLERTQVPIDLEDLLG- 146

Query: 1238 HDLTVMLTTIDDMK-KQIMN--NEDPNGVGKTLGNTTQRINLARL--ALNDLTEEIKKLN 1292
             DL ++L  +D  +   I+N       G G  L       NLA+   ALN    +I  L 
Sbjct: 147  -DLLLLLGGLDPDRLNAILNEAAAALAGTGPQLNALLD--NLAQFTDALNARDGDIGAL- 202

Query: 1293 KTGEMLKENATLLQENNVEGALELTRQAHDTSLKLKEQSTETEKQINDA----ERQCKRT 1348
                 +     LL  +++  A +   +  D    L          ++DA           
Sbjct: 203  -----IANLNQLL--DSLAAASDQLDRLLDNLATLTAALAARRDALDDALAALSALAATV 255

Query: 1349 ENLVTKTFPQFTQGQEENEKALQRLNEKITSLEKNLP 1385
             +L+ +  P   Q           L + +  LE+ L 
Sbjct: 256  NDLLAENRPNLNQALANLRPLATLLVDYLPGLEQLLH 292


>gnl|CDD|214826 smart00806, AIP3, Actin interacting protein 3.  Aip3p/Bud6p is a
            regulator of cell and cytoskeletal polarity in
            Saccharomyces cerevisiae that was previously identified
            as an actin-interacting protein. Actin-interacting
            protein 3 (Aip3p) localizes at the cell cortex where
            cytoskeleton assembly must be achieved to execute
            polarized cell growth, and deletion of AIP3 causes gross
            defects in cell and cytoskeletal polarity. Aip3p
            localization is mediated by the secretory pathway,
            mutations in early- or late-acting components of the
            secretory apparatus lead to Aip3p mislocalization.
          Length = 426

 Score = 37.0 bits (86), Expect = 0.091
 Identities = 31/198 (15%), Positives = 75/198 (37%), Gaps = 12/198 (6%)

Query: 1469 KKLINEKEAKAEEVFRGITTA--KQES---HAANILAKEAYDHAASVRNKTAAYVASTSN 1523
            KK I+++    +     +  A   Q              A + A    + +    +S+S 
Sbjct: 80   KKHIDDEIDTLQNELDEVKQALESQREAIQRLKERQQNSAANIARPAASPSPVLASSSSA 139

Query: 1524 ITKQLDEFLNAPGATLADIRNIS-GLAVNKNIQSNPEQIRERANEINNVIKSLTDIDTIL 1582
            I+   +          A+++++   LAV +   +      E    I ++++ +    +  
Sbjct: 140  ISLA-NNPDKLNKEQRAELKSLQRELAVLRQTHN--SFFTEIKESIKDILEKIDKFKSSS 196

Query: 1583 TETAGDLAKANDLKRKANLTKAGADSTKNTVQKIVDVLTEARTAQDMAEVAIQTAKDDIS 1642
               +G   +A     K  L+   +DS    V  + D++   R  +D+A+  ++ +K  + 
Sbjct: 197  LSASGSSNRAYVESSKKKLS-EDSDSLLTKVDDLQDIIEALR--KDVAQRGVRPSKKQLE 253

Query: 1643 AARKDLSQITNDLDDAQQ 1660
              +K+L     +L   ++
Sbjct: 254  TVQKELETARKELKKMEE 271



 Score = 30.8 bits (70), Expect = 7.2
 Identities = 29/185 (15%), Positives = 68/185 (36%), Gaps = 34/185 (18%)

Query: 1219 LEKKIEDVKALVENTTRSSHDLTVMLTTIDDMKKQIMNNEDPNGVGKTLGNTTQRINLAR 1278
            ++ +I+ ++  ++   ++       +  + + ++                      N+AR
Sbjct: 83   IDDEIDTLQNELDEVKQALESQREAIQRLKERQQNS------------------AANIAR 124

Query: 1279 LALNDL------TEEIKKLNKTGEMLKENATLLQENNVEGALELTRQAHDTSLKLKEQST 1332
             A +        +  I   N   ++ KE    L+  +++  L + RQ H+         T
Sbjct: 125  PAASPSPVLASSSSAISLANNPDKLNKEQRAELK--SLQRELAVLRQTHN------SFFT 176

Query: 1333 ETEKQINDAERQCKRTENLVTKTFPQFTQGQEENEKALQRLNEKITSLEKNLPDLNELIC 1392
            E ++ I D   +  + ++          +   E+ K   +L+E   SL   + DL ++I 
Sbjct: 177  EIKESIKDILEKIDKFKSSSLSASGSSNRAYVESSKK--KLSEDSDSLLTKVDDLQDIIE 234

Query: 1393 DKRGD 1397
              R D
Sbjct: 235  ALRKD 239


>gnl|CDD|217933 pfam04156, IncA, IncA protein.  Chlamydia trachomatis is an obligate
            intracellular bacterium that develops within a
            parasitophorous vacuole termed an inclusion. The
            inclusion is non-fusogenic with lysosomes but intercepts
            lipids from a host cell exocytic pathway. Initiation of
            chlamydial development is concurrent with modification of
            the inclusion membrane by a set of C. trachomatis-encoded
            proteins collectively designated Incs. One of these Incs,
            IncA, is functionally associated with the homotypic
            fusion of inclusions. This family probably includes
            members of the wider Inc family rather than just IncA.
          Length = 186

 Score = 35.9 bits (83), Expect = 0.098
 Identities = 20/113 (17%), Positives = 50/113 (44%), Gaps = 5/113 (4%)

Query: 1634 IQTAKDDISAARKDLSQITNDLDDAQQKSNETNVKVKLLQERLKSIQAGFLQNGRSAMDV 1693
            ++  + ++S  ++ LS++  +L+D +++  E   +++ L+E L+ ++            +
Sbjct: 74   LEELQGELSELKQQLSELQEELEDLEERIAELESELEDLKEDLQLLRELLKSLEERLESL 133

Query: 1694 ENEEKNLEKEVALAQK----QASGLRSRYQETDNKLLSKAES-SGLKRARGQM 1741
            E   K L KE+   ++    +   LR   +     L    E+   L+    Q+
Sbjct: 134  EESIKELAKELRELRQDLREEVEELREELERLQENLQRLQEAIQELQSLLEQL 186



 Score = 32.8 bits (75), Expect = 0.99
 Identities = 14/66 (21%), Positives = 34/66 (51%), Gaps = 4/66 (6%)

Query: 1326 KLKEQSTETEKQINDAERQCKRTENLVTKTFPQFTQGQEENEKALQRLNEKITSLEKNLP 1385
            +LK+Q +E ++++ D E +    E           +  +   + L+ L E++ SLE+++ 
Sbjct: 83   ELKQQLSELQEELEDLEERIAELE----SELEDLKEDLQLLRELLKSLEERLESLEESIK 138

Query: 1386 DLNELI 1391
            +L + +
Sbjct: 139  ELAKEL 144


>gnl|CDD|227512 COG5185, HEC1, Protein involved in chromosome segregation, interacts
            with SMC proteins [Cell division and chromosome
            partitioning].
          Length = 622

 Score = 36.9 bits (85), Expect = 0.12
 Identities = 60/343 (17%), Positives = 121/343 (35%), Gaps = 30/343 (8%)

Query: 1472 INEKEAKAEEVFRGITTAKQESHAANILAKEAYDHAASVRNKTAAYVASTSNITKQLDEF 1531
            I   + + + ++  I  A + S     L +E +    S  NK   Y    + + ++  E+
Sbjct: 273  IANLKTQNDNLYEKIQEAMKISQKIKTL-REKWRALKSDSNK---YENYVNAMKQKSQEW 328

Query: 1532 LNAPGATLADIRNISGLAVNKNIQSNPEQIRERANEINNVIKSLTDIDTILTETAGDLAK 1591
                         +  L     I+   E+I+   + I+ + K L        +      +
Sbjct: 329  PGK----------LEKLKSE--IELKEEEIKALQSNIDELHKQLRKQGISTEQFELMNQE 376

Query: 1592 ANDLKRKANLTKAGADS-------TKNTVQKIVDVLTEARTAQDMAEVAIQTAKDDISAA 1644
               L R+ +     +D         K   Q I   L +     D     I  ++  I   
Sbjct: 377  REKLTRELDKINIQSDKLTKSVKSRKLEAQGIFKSLEKTLRQYDSLIQNITRSRSQIGHN 436

Query: 1645 RKDLSQITNDLDDAQQKSNETNVKVKLLQERLKSIQAGFLQNGRSAMDVENEEKNLEKEV 1704
              D S   N      + S       K + E    IQ         ++ +E + KNL+ ++
Sbjct: 437  VNDSSLKINIEQLFPKGSGINESIKKSILELNDEIQERIKTEENKSITLEEDIKNLKHDI 496

Query: 1705 ALAQKQASGLRSRYQETDNK-LLSKAESSGLKRA-RGQM--LLNKASQLSVNTTAKLKQL 1760
                +    L     E ++K  LSK E+     A R ++  L  + + L++ +   +   
Sbjct: 497  NELTQILEKLELELSEANSKFELSKEENERELVAQRIEIEKLEKELNDLNLLSKTSILDA 556

Query: 1761 NETEAMFNSQESELTELSKNIAELQKRIQSCINFIVDKSNNYK 1803
               E +  S E +L EL  ++   + +I   +  ++D ++ +K
Sbjct: 557  ---EQLVQSTEIKLDELKVDLNRKRYKIHKQVIHVIDITSKFK 596



 Score = 36.5 bits (84), Expect = 0.16
 Identities = 41/173 (23%), Positives = 70/173 (40%), Gaps = 35/173 (20%)

Query: 1216 FELLEKKIEDVKALVENTTRSSHDLTVMLTTIDDMKKQIMNNEDPNGVGKTLGNTTQRIN 1275
            F+ LEK +    +L++N TRS   +      ++D   +I N E     G           
Sbjct: 409  FKSLEKTLRQYDSLIQNITRSRSQIG---HNVNDSSLKI-NIEQLFPKG----------- 453

Query: 1276 LARLALNDLTEEIKKLNKTGEMLKENATLLQENNVEGALELTRQAHDTSLKLKEQSTETE 1335
                  + + E IKK              + E N E    +  + +  S+ L+E     +
Sbjct: 454  ------SGINESIKKS-------------ILELNDEIQERIKTEENK-SITLEEDIKNLK 493

Query: 1336 KQINDAERQCKRTENLVTKTFPQFTQGQEENEKALQRLNEKITSLEKNLPDLN 1388
              IN+  +  ++ E  +++   +F   +EENE+ L     +I  LEK L DLN
Sbjct: 494  HDINELTQILEKLELELSEANSKFELSKEENERELVAQRIEIEKLEKELNDLN 546


>gnl|CDD|218762 pfam05816, TelA, Toxic anion resistance protein (TelA).  This family
            consists of several prokaryotic TelA like proteins. TelA
            and KlA are associated with tellurite resistance and
            plasmid fertility inhibition.
          Length = 333

 Score = 36.4 bits (85), Expect = 0.12
 Identities = 30/119 (25%), Positives = 50/119 (42%), Gaps = 36/119 (30%)

Query: 1266 TLGNTTQRINLARLALNDLTEEIKKLNKTGEMLKENATLLQENNVEGALELTRQAHDTSL 1325
             L       +  + A+ND T E         +L +NA LL++N++E A    ++A    +
Sbjct: 236  ALKRQKLAADAQK-AVNDTTNE---------LLLKNAELLKQNSIEIA----KEAERGVI 281

Query: 1326 KLKEQSTETEKQINDAERQCKRTENLVTKTFPQFTQGQEEN-------EKALQRLNEKI 1377
             +     ET K+            NL+  T  +  Q QEE        EK L++L E++
Sbjct: 282  DI-----ETLKKAFQ---------NLIE-TIDETLQIQEEGRQKRKQAEKELEQLEEEL 325


>gnl|CDD|219690 pfam07996, T4SS, Type IV secretion system proteins.  Members of this
            family are components of the type IV secretion system.
            They mediate intracellular transfer of macromolecules via
            a mechanism ancestrally related to that of bacterial
            conjugation machineries.
          Length = 195

 Score = 35.8 bits (83), Expect = 0.12
 Identities = 31/168 (18%), Positives = 61/168 (36%), Gaps = 22/168 (13%)

Query: 1623 ARTAQDMAEVAIQTAKDDISAARKDLSQITNDLDDAQQKSNETNVKVKLLQERLKSIQAG 1682
             +     A+  +   K  I  A++  + IT         +N     V L ++  +S+   
Sbjct: 18   LQQLAQWAQ-QLDQLKQQIQQAKQQYNSITGSRGLGALLNNPALRSV-LPKD-WQSVYDL 74

Query: 1683 FLQNGRSAMDVENEEKNLEKEVALAQKQASGLRSRYQETDNKLLSKAESSGLKRARGQML 1742
             L      +   +  K+L  +  L    +    SR           A+      A G+  
Sbjct: 75   VLDGY-GGLS--SSAKSLYSKEKLFDTCSYPSGSRASICQRSADKAAQDK----AVGEQA 127

Query: 1743 LNKASQLSVNTTAKLKQLNETEAMFNSQESELTELSKNIAELQKRIQS 1790
             ++A+    N  ++++QL          + +  +  K IA+LQ RIQ+
Sbjct: 128  YDQAT----NRLSQIEQL--------LSQIDSAKDPKEIADLQARIQA 163


>gnl|CDD|182106 PRK09841, PRK09841, cryptic autophosphorylating protein tyrosine
            kinase Etk; Provisional.
          Length = 726

 Score = 36.8 bits (85), Expect = 0.13
 Identities = 41/189 (21%), Positives = 82/189 (43%), Gaps = 46/189 (24%)

Query: 1600 NLTKAGADSTKNTVQKIVDVLTEARTAQDMAEVAIQTAKDDISAARKDLSQITNDLDDAQ 1659
             LT  G D     + +I++ +      Q++A  A Q ++  +   ++ L ++ ++LD A+
Sbjct: 231  ELTMTGDDP--QLITRILNSIANNYLQQNIARQAAQDSQS-LEFLQRQLPEVRSELDQAE 287

Query: 1660 QKSN---------ETNVKVKLLQERLKSI----------QAGFLQ-------NGRSAMD- 1692
            +K N         + N++ K + E++ ++          +A   Q         R+ ++ 
Sbjct: 288  EKLNVYRQQRDSVDLNLEAKAVLEQIVNVDNQLNELTFREAEISQLYKKDHPTYRALLEK 347

Query: 1693 ---VENEEKNLEKEV-ALAQKQASGLRSRYQETDNKLLSKAESSGLKRARGQMLLNKASQ 1748
               +E E K L K V A+   Q   LR          LS+   +G  RA    LLN+  +
Sbjct: 348  RQTLEQERKRLNKRVSAMPSTQQEVLR----------LSRDVEAG--RAVYLQLLNRQQE 395

Query: 1749 LSVNTTAKL 1757
            LS++ ++ +
Sbjct: 396  LSISKSSAI 404


>gnl|CDD|236545 PRK09510, tolA, cell envelope integrity inner membrane protein TolA;
            Provisional.
          Length = 387

 Score = 36.3 bits (84), Expect = 0.13
 Identities = 33/130 (25%), Positives = 41/130 (31%), Gaps = 22/130 (16%)

Query: 1427 AKDYAETSKKLINEKESKAEELRKHMIILSESNSAKDYAETSKKLINEKEAKAEEVFRGI 1486
             K  AE  KK   E   KA    K         +AK  AE  KK   E + KA       
Sbjct: 161  KKAAAEAKKKAEAEAAKKAAAEAKKK--AEAEAAAKAAAEAKKKAEAEAKKKAAAE---- 214

Query: 1487 TTAKQESHAANILAKEAYDHAASVRNKTAAYVASTSNITKQ---LDEFLNAPGATLADIR 1543
               K  + A    AK A +  A+     AA  A  +   K    +D+             
Sbjct: 215  AKKKAAAEAKAAAAKAAAEAKAAAEKAAAAKAAEKAAAAKAAAEVDDLFG---------- 264

Query: 1544 NISGLAVNKN 1553
               GL   KN
Sbjct: 265  ---GLDSGKN 271



 Score = 32.1 bits (73), Expect = 3.0
 Identities = 22/84 (26%), Positives = 37/84 (44%), Gaps = 5/84 (5%)

Query: 1657 DAQQKSNETNVKVKLLQERLKSIQAGFLQNGRSAMDVENEEKNLEKEVALAQKQASGLRS 1716
              QQ++ E   K    QERLK ++   L     A + + + +   K+ AL QKQA    +
Sbjct: 84   KEQQQAEELQQKQAAEQERLKQLEKERLA----AQEQKKQAEEAAKQAALKQKQAEEAAA 139

Query: 1717 RYQETDNKLLSKAESSGLKRARGQ 1740
            +      K  ++AE+     A  +
Sbjct: 140  K-AAAAAKAKAEAEAKRAAAAAKK 162


>gnl|CDD|218806 pfam05911, DUF869, Plant protein of unknown function (DUF869).  This
            family consists of a number of sequences found in
            Arabidopsis thaliana, Oryza sativa and Lycopersicon
            esculentum (Tomato). The function of this family is
            unknown.
          Length = 767

 Score = 36.9 bits (85), Expect = 0.14
 Identities = 54/385 (14%), Positives = 125/385 (32%), Gaps = 38/385 (9%)

Query: 1417 SNGTLSESNSAKDYAETSKKLINEKESKAEELRKHMIILSESNSAKDYAETSKKL--INE 1474
            S      S+     +  +     E E   E      +     +  K   E    +  I+E
Sbjct: 399  SADGSKVSSPKCSDSSVALTGPVEHEDNLESKLSDRLPEVLQSVLKLVME-KHIISEISE 457

Query: 1475 KEAKAEEVFRGITTAKQESHAANILAKEAYD----HAASVRNKT-AAYVASTSNITKQLD 1529
                 E     +  +         +A          A   R++     V+  S  T  L+
Sbjct: 458  ILEDIEAELDSLERSSNGDDENEEVAMVGSLVEESSAVIERSQELEGAVSRISEFTSVLE 517

Query: 1530 EFLNAPGATLADIRNISGLAVNKNIQSNPEQIRERANEINNVIKSLTDIDTILTETAGDL 1589
              +      L             +++   ++       + N  KSL D+    +E   + 
Sbjct: 518  HEVTVCQDLLDG---------KADLEKFIQEFSLTLEWVVNQEKSLQDVSVEASEIKKNF 568

Query: 1590 AKANDLKRKANLTKAGADSTKNTVQKIVDVL----TEARTAQDMAEVAIQTAKDDISAAR 1645
                  +++ N       +   +  + ++ L              +  ++ +K+ +    
Sbjct: 569  LGLKSSEKEINSPDEVKGAVCISTLEELETLKSEKENLDGELSKCKDDLEESKNKLQETE 628

Query: 1646 KDLSQITNDLDDAQQKSNETNVKVKLLQERLKSIQAGFLQNGRSAMDVENEEKNLEKEVA 1705
            K L ++ ++LD +Q+ ++    ++K + E  +S++            ++ +  +LE E+ 
Sbjct: 629  KKLEELKSELDASQESNSLAETQLKCMVESYESLELRAKDLEAEHKSLQEKISSLEDELE 688

Query: 1706 LAQKQASGLRSRYQETDNKLLSKAESSGLKRARGQMLLNKASQLSVNTTAKLKQLNETEA 1765
              ++    L ++ +E + K+    +              +  Q       K+KQ  E  A
Sbjct: 689  KERQNHQELIAKCRELEEKIERAEQ-------------EENMQKLDEEEQKIKQEKEIAA 735

Query: 1766 MFNSQESELTELSKNIAELQKRIQS 1790
                   +L E  + I  L K++QS
Sbjct: 736  ----AAEKLAECQETILSLGKQLQS 756


>gnl|CDD|172947 PRK14472, PRK14472, F0F1 ATP synthase subunit B; Provisional.
          Length = 175

 Score = 35.2 bits (81), Expect = 0.14
 Identities = 32/133 (24%), Positives = 57/133 (42%), Gaps = 17/133 (12%)

Query: 1561 IRERANEINNVIKSLTDIDTILTETAGDLAKANDLKRKANLTKAGADST------KNTVQ 1614
            + ER   I + I            +A D A+A   K +  L KA A++       K   +
Sbjct: 47   LEEREKGIQSSIDR--------AHSAKDEAEAILRKNRELLAKADAEADKIIREGKEYAE 98

Query: 1615 KIVDVLTEARTAQDMAEVAIQTAKDDI-SAARKDLSQITNDLDDAQQKSNETNVKVKLLQ 1673
            K+   +TE   A   A+  I +AK++I    R+ L  + N++ D   K  E  ++  L  
Sbjct: 99   KLRAEITE--KAHTEAKKMIASAKEEIEQEKRRALDVLRNEVADLAVKGAEKIIRTSLDA 156

Query: 1674 ERLKSIQAGFLQN 1686
            ++ K +    +Q+
Sbjct: 157  DKQKKVVDSMIQD 169


>gnl|CDD|117396 pfam08826, DMPK_coil, DMPK coiled coil domain like.  This domain is
            found in the myotonic dystrophy protein kinase (DMPK) and
            adopts a coiled coil structure. It plays a role in
            dimerisation.
          Length = 61

 Score = 32.5 bits (74), Expect = 0.15
 Identities = 17/59 (28%), Positives = 30/59 (50%), Gaps = 1/59 (1%)

Query: 1660 QKSNETNVKVK-LLQERLKSIQAGFLQNGRSAMDVENEEKNLEKEVALAQKQASGLRSR 1717
            Q + E  ++ K  LQE L+ ++A  +       + E + + LE EV   +K+   LR+R
Sbjct: 3    QSALEAEIRAKQSLQEELEKVKAANINFESKLQEAEAKNRELEAEVRQLKKRMEELRAR 61


>gnl|CDD|130902 TIGR01843, type_I_hlyD, type I secretion membrane fusion protein,
            HlyD family.  Type I secretion is an ABC transport
            process that exports proteins, without cleavage of any
            signal sequence, from the cytosol to extracellular medium
            across both inner and outer membranes. The secretion
            signal is found in the C-terminus of the transported
            protein. This model represents the adaptor protein
            between the ATP-binding cassette (ABC) protein of the
            inner membrane and the outer membrane protein, and is
            called the membrane fusion protein. This model selects a
            subfamily closely related to HlyD; it is defined narrowly
            and excludes, for example, colicin V secretion protein
            CvaA and multidrug efflux proteins [Protein fate, Protein
            and peptide secretion and trafficking].
          Length = 423

 Score = 36.1 bits (84), Expect = 0.15
 Identities = 33/175 (18%), Positives = 63/175 (36%), Gaps = 17/175 (9%)

Query: 1621 TEARTAQDMAEVAIQTAKDDISAARKDLSQITNDLDDAQQKSNETNVKVKLLQERLKSIQ 1680
             EA  A+  ++V    A+     AR  L    +     +   +  + +   + E +K  Q
Sbjct: 79   VEADAAELESQVLRLEAE----VAR--LRAEADSQAAIEFPDDLLSAEDPAVPELIKGQQ 132

Query: 1681 AGFLQNGRSAMDVENEEKNLEKEVALAQKQASGLRSRYQETDNKLLSKAESSGLKRARGQ 1740
            + F    +S +  + E    +    + Q +A     + Q    +   +  S  L+ AR +
Sbjct: 133  SLFESR-KSTLRAQLELILAQ----IKQLEAELAGLQAQLQALRQQLEVISEELE-ARRK 186

Query: 1741 MLLNKASQL--SVNTTAKLKQLNETEAMFNSQESELTELSKNIAELQKRIQSCIN 1793
            +   K   L   +      ++  E +      E+EL  L + I ELQ   Q    
Sbjct: 187  L---KEKGLVSRLELLELERERAEAQGELGRLEAELEVLKRQIDELQLERQQIEQ 238


>gnl|CDD|236942 PRK11637, PRK11637, AmiB activator; Provisional.
          Length = 428

 Score = 36.2 bits (84), Expect = 0.17
 Identities = 35/138 (25%), Positives = 57/138 (41%), Gaps = 30/138 (21%)

Query: 1610 KNTVQKIVDVLTEARTAQDMAEVAIQTAKDDISAAR-KDLSQITNDLDDAQQ-----KSN 1663
            K + QK   +L E +  Q   E A           R K L+ + + L   QQ     ++N
Sbjct: 192  KQSQQK--TLLYEQQAQQQKLEQA--------RNERKKTLTGLESSLQKDQQQLSELRAN 241

Query: 1664 ETNVKVKLLQ-ERLKSIQAGFLQNGRSAMDVENEEKNLEKEVALAQKQASGLRSRYQETD 1722
            E+ ++  + + ER    +A            E E +   + V   QKQA    S Y+ T+
Sbjct: 242  ESRLRDSIARAEREAKARA------------EREAREAAR-VRDKQKQAKRKGSTYKPTE 288

Query: 1723 NKLLSKAESSGLKRARGQ 1740
            ++    + + GL R RGQ
Sbjct: 289  SERSLMSRTGGLGRPRGQ 306



 Score = 31.6 bits (72), Expect = 4.9
 Identities = 23/115 (20%), Positives = 49/115 (42%), Gaps = 42/115 (36%)

Query: 1673 QERLKSIQAGFLQNGRSAMDVENEEKNLEKEVALAQKQASGLRSRYQETDNKLLSKAESS 1732
            +++LKSIQ           D+    K  EK V   Q+Q + L ++ ++ +  +   +++S
Sbjct: 46   RDQLKSIQQ----------DIA--AK--EKSVRQQQQQRASLLAQLKKQEEAI---SQAS 88

Query: 1733 GLKRARGQMLLNKASQLSVNTTAKLKQLNETEAMFNSQESELTELSKNIAELQKR 1787
                                     ++L ET+   N    ++ EL+ +IA+L+++
Sbjct: 89   -------------------------RKLRETQNTLNQLNKQIDELNASIAKLEQQ 118


>gnl|CDD|236798 PRK10929, PRK10929, putative mechanosensitive channel protein;
            Provisional.
          Length = 1109

 Score = 36.6 bits (85), Expect = 0.17
 Identities = 29/112 (25%), Positives = 47/112 (41%), Gaps = 19/112 (16%)

Query: 1278 RLALNDLTEEIKKLNKTGEMLKENATLLQENNVEGALELTRQAH----------DTSLKL 1327
            R  LN+  +E + +            +LQ ++    LE +RQA           D+  +L
Sbjct: 85   RQQLNNERDEPRSVPPNMSTDALEQEILQVSS--QLLEKSRQAQQEQDRAREISDSLSQL 142

Query: 1328 KEQSTETEKQINDAERQCKRTENLVTKTFP----QFTQGQEENEKALQRLNE 1375
             +Q TE  +Q+N+ ER   R + L T   P    Q T  Q E+      ++E
Sbjct: 143  PQQQTEARRQLNEIER---RLQTLGTPNTPLAQAQLTALQAESAALKALVDE 191


>gnl|CDD|217020 pfam02403, Seryl_tRNA_N, Seryl-tRNA synthetase N-terminal domain.
            This domain is found associated with the Pfam tRNA
            synthetase class II domain (pfam00587) and represents the
            N-terminal domain of seryl-tRNA synthetase.
          Length = 108

 Score = 33.7 bits (78), Expect = 0.18
 Identities = 22/110 (20%), Positives = 46/110 (41%), Gaps = 17/110 (15%)

Query: 1552 KNIQSNPEQIRERANEINNVIKSLTDIDTILTETAGDLAKANDLKRKANLTKA-GADSTK 1610
            K I+ NPE ++E+   +      + D+D +L           D +R+    +     + +
Sbjct: 5    KLIRENPEAVKEK---LRKRGVDVLDVDELL---------ELDEERRELQVELEELQAER 52

Query: 1611 NTVQKIVDVL--TEARTAQDMAEVAIQTAKDDISAARKDLSQITNDLDDA 1658
            N + K +      +      +AEV  +  KD++ A   +L ++  +LD  
Sbjct: 53   NELSKEIGKAKKKKEDAEALIAEV--KELKDELKALEAELRELEAELDKL 100


>gnl|CDD|223649 COG0576, GrpE, Molecular chaperone GrpE (heat shock protein)
            [Posttranslational modification, protein turnover,
            chaperones].
          Length = 193

 Score = 35.0 bits (81), Expect = 0.19
 Identities = 18/88 (20%), Positives = 39/88 (44%), Gaps = 7/88 (7%)

Query: 1629 MAEVAIQTA-KDDISAARKDLSQITNDLDDAQQKSNETNV---KVKLLQERLKSIQAGFL 1684
            M++   +T   D       + S+     ++  ++ NE      ++  L+ +L+ ++  +L
Sbjct: 1    MSDKEQKTEEPDAEETEEAEKSEEEEAEEEEPEEENELEEEQQEIAELEAQLEELKDKYL 60

Query: 1685 QNGRSAMDVENEEKNLEKEVALAQKQAS 1712
               R+  + EN  K  E+E   A+K A 
Sbjct: 61   ---RAQAEFENLRKRTEREREEAKKYAI 85


>gnl|CDD|163494 TIGR03782, Bac_Flav_CT_J, Bacteroides conjugative transposon TraJ
            protein.  Members of this protein family are designated
            TraM and are found in a proposed transfer region of a
            class of conjugative transposon found in the Bacteroides
            lineage. This family is related conjugation system
            proteins in the Proteobacteria, including TrbL of
            Agrobacterium Ti plasmids and VirB6 [Cellular processes,
            DNA transformation].
          Length = 322

 Score = 35.8 bits (83), Expect = 0.19
 Identities = 20/86 (23%), Positives = 45/86 (52%), Gaps = 8/86 (9%)

Query: 1501 KEAYDHAASVRNKTAAYVASTSNITKQLDEFLNAPGATLADIRNISGLAVNKNIQSNPEQ 1560
            K+  ++ A VR+   AY+ S     ++LDE     G + +D++ ++G+ + +   +  + 
Sbjct: 108  KDKLEYEAMVRDPETAYLVSDEEFDRKLDEL----GWSPSDMKTMAGMYMERAAYNMKKS 163

Query: 1561 IRERANEINNVI---KSLTDIDTILT 1583
            +R+   E+  ++    +L  IDT+ T
Sbjct: 164  VRDWFRELLELLFQAAALV-IDTLRT 188


>gnl|CDD|217135 pfam02601, Exonuc_VII_L, Exonuclease VII, large subunit.  This family
            consist of exonuclease VII, large subunit EC:3.1.11.6
            This enzyme catalyzes exonucleolytic cleavage in either
            5'->3' or 3'->5' direction to yield
            5'-phosphomononucleotides. This exonuclease VII enzyme is
            composed of one large subunit and 4 small ones.
          Length = 295

 Score = 35.7 bits (83), Expect = 0.20
 Identities = 32/133 (24%), Positives = 53/133 (39%), Gaps = 21/133 (15%)

Query: 1630 AEVAIQTAKDDISAARKDLSQITNDLDDA-QQKSNETNVKVKLLQERLKSIQAGFLQNGR 1688
            AE+ +    +      + L  +   L  A + +  +   ++ LL+ERLKS+    L+   
Sbjct: 138  AELLVPDRTE----LLQKLEGLEQRLSRALKNRLEKEQDRLNLLRERLKSLSRRLLEQ-- 191

Query: 1689 SAMDVENEEKNLEKEVALAQKQASGLRSRYQETDNKLLSKAESSGLKRARGQMLLNKASQ 1748
                  +EE+  E    L       L SR Q       S+ E   L R   +  L +   
Sbjct: 192  ------HEERLAELRDRLISS-IQNLLSRKQ-------SRLERLILNRELEKNSLLENKL 237

Query: 1749 LSVNTTAKLKQLN 1761
             + N TA+LK L+
Sbjct: 238  ATANLTAQLKALS 250


>gnl|CDD|217803 pfam03938, OmpH, Outer membrane protein (OmpH-like).  This family
            includes outer membrane proteins such as OmpH among
            others. Skp (OmpH) has been characterized as a molecular
            chaperone that interacts with unfolded proteins as they
            emerge in the periplasm from the Sec translocation
            machinery.
          Length = 157

 Score = 34.5 bits (80), Expect = 0.21
 Identities = 30/148 (20%), Positives = 59/148 (39%), Gaps = 32/148 (21%)

Query: 1654 DLDDAQQKSNETNVKVKLLQERLKSIQAGFLQNGRSAMDVENEEKNLEKEVALAQKQASG 1713
            D+     +S       K L++  K +QA          +++ +EK L+KE    QKQA+ 
Sbjct: 22   DVQKVLSESPAGKAAQKQLEKEFKKLQA----------ELQKKEKELQKEEQKLQKQAA- 70

Query: 1714 LRSRYQETDNKLLSKAESSGLKRARGQMLLNKASQLSVNTTAKLKQLNETEAMFNSQ-ES 1772
                        LS+      ++A+ Q L  K  +L     A  ++L + +         
Sbjct: 71   -----------TLSEEA----RKAKQQELQQKQQELQQKQQAAQQELQQKQQELLQPIYD 115

Query: 1773 ELTELSKNIAELQKRIQSCINFIVDKSN 1800
            ++ +  K +A+ +       + ++DKS 
Sbjct: 116  KIDKAIKEVAKEKG-----YDLVLDKSA 138


>gnl|CDD|220306 pfam09599, IpaC_SipC, Salmonella-Shigella invasin protein C
            (IpaC_SipC).  This entry represents a family of proteins
            associated with bacterial type III secretion systems,
            which are injection machines for virulence factors into
            host cell cytoplasm. Characterized members of this
            protein family are known to be secreted and are described
            as invasins, including IpaC from Shigella flexneri and
            SipC from Salmonella typhimurium. Members may be referred
            to as invasins, pathogenicity island effectors, and cell
            invasion proteins.
          Length = 337

 Score = 35.6 bits (82), Expect = 0.22
 Identities = 45/265 (16%), Positives = 78/265 (29%), Gaps = 45/265 (16%)

Query: 1498 ILAKEAYDHAAS--VRNKTAAYVASTSNITKQLDEFLNAPGATLADIRNISGLAVNKN-- 1553
            +L+ +A   AA+  VR   AA  +S      Q+             +   SG A+ KN  
Sbjct: 110  LLSFDATKSAAASMVREGKAALSSSIIGSATQVGITGVGAKLQAKGLSKQSG-ALKKNLK 168

Query: 1554 -IQSNPEQIRERANEINNVIKSLTDIDTILTETAGDLAKANDLKRKANLTKAGADSTKNT 1612
                   +  +   ++N                        D  +   + +   D+TK  
Sbjct: 169  PSNKLSAEAAQLKLQLNGQNAP---------------KLVADELKHVKIKRTTGDATKML 213

Query: 1613 VQKIVDVLTEARTAQDMAEVAIQTAKDDISA-ARKDLSQITND--LDDAQQKSNETNVKV 1669
                      A      AE   Q ++   S   + DL Q   +     AQ          
Sbjct: 214  ----QKNKITASNKALSAEHKAQLSQRVESLQDKIDLHQQAYEQNTLKAQAMQM----IG 265

Query: 1670 KLLQERLKSIQAGFLQNGRSAMDVENEEKNLEKEVALAQKQASGLRSRYQETDNKLLSKA 1729
             L+     S  AG +  G        EE   ++    + + A    +   ET        
Sbjct: 266  DLIMR--MSANAGNISGGSGQYAATQEEAE-QQISQASSRTA---STASNETR------- 312

Query: 1730 ESSGLKRARGQMLLNKASQLSVNTT 1754
            E+S   +   Q LL     ++ + +
Sbjct: 313  EASRKTKQLIQELLKILESINQSKS 337


>gnl|CDD|192987 pfam12325, TMF_TATA_bd, TATA element modulatory factor 1 TATA
            binding.  This is the C-terminal conserved coiled coil
            region of a family of TATA element modulatory factor 1
            proteins conserved in eukaryotes. The proteins bind to
            the TATA element of some RNA polymerase II promoters and
            repress their activity. by competing with the binding of
            TATA binding protein. TMF1_TATA_bd is the most conserved
            part of the TMFs. TMFs are evolutionarily conserved
            golgins that bind Rab6, a ubiquitous ras-like GTP-binding
            Golgi protein, and contribute to Golgi organisation in
            animal and plant cells. The Rab6-binding domain appears
            to be the same region as this C-terminal family.
          Length = 121

 Score = 33.7 bits (78), Expect = 0.23
 Identities = 23/68 (33%), Positives = 37/68 (54%), Gaps = 5/68 (7%)

Query: 1276 LARLAL--NDLTEEIKKLNKTGEMLKENATLLQENNVEGALELTRQAHDTSLK-LKEQST 1332
            LARL    ++  +EI KL +  E L+E    ++E  +E  LE   Q ++T+L+ L E+S 
Sbjct: 40   LARLEAERDEARQEIVKLTEENEELRELKKEIEE--LEKELEDLEQRYETTLELLGEKSE 97

Query: 1333 ETEKQIND 1340
              E+   D
Sbjct: 98   RVEELRAD 105


>gnl|CDD|234017 TIGR02794, tolA_full, TolA protein.  TolA couples the inner membrane
            complex of itself with TolQ and TolR to the outer
            membrane complex of TolB and OprL (also called Pal). Most
            of the length of the protein consists of low-complexity
            sequence that may differ in both length and composition
            from one species to another, complicating efforts to
            discriminate TolA (the most divergent gene in the tol-pal
            system) from paralogs such as TonB. Selection of members
            of the seed alignment and criteria for setting scoring
            cutoffs are based largely conserved operon struction.
            //The Tol-Pal complex is required for maintaining outer
            membrane integrity. Also involved in transport (uptake)
            of colicins and filamentous DNA, and implicated in
            pathogenesis. Transport is energized by the proton motive
            force. TolA is an inner membrane protein that interacts
            with periplasmic TolB and with outer membrane porins
            ompC, phoE and lamB [Transport and binding proteins,
            Other, Cellular processes, Pathogenesis].
          Length = 346

 Score = 35.6 bits (82), Expect = 0.23
 Identities = 41/135 (30%), Positives = 56/135 (41%), Gaps = 11/135 (8%)

Query: 1423 ESNSAKDYAETSKKLINEKESKA-EELRKHMIILSESNSAKDYAETSKKLINEKEAKAEE 1481
            E   AK  AE   K   E E KA EE +K     +E  +    A  +KK   E + KAE 
Sbjct: 122  EEAKAKQAAEAKAKAEAEAEKKAKEEAKKQ----AEEEAKAKAAAEAKKKAAEAKKKAEA 177

Query: 1482 VFRGITTAKQESHAANILAKEAYDHAASVRNKTAAYVASTSNITKQLDEFLNA-PGATLA 1540
                   AK E+  A   A+EA   A + + K AA  A+ +           A   A  A
Sbjct: 178  E----AKAKAEA-KAKAKAEEAKAKAEAAKAKAAAEAAAKAEAEAAAAAAAEAERKADEA 232

Query: 1541 DIRNISGLAVNKNIQ 1555
            ++ +I GLA   N +
Sbjct: 233  ELGDIFGLASGSNAE 247



 Score = 34.0 bits (78), Expect = 0.67
 Identities = 27/86 (31%), Positives = 35/86 (40%), Gaps = 15/86 (17%)

Query: 1431 AETSKKLINEKESKAEELRKHMIILSESNSAKDYAETSKKLINEKEAKAEEVFRGITTAK 1490
            AE + K   EK+ +AEE +           AK  AE   K   E E KA+E  +     +
Sbjct: 107  AEQAAKQAEEKQKQAEEAK-----------AKQAAEAKAKAEAEAEKKAKEEAK----KQ 151

Query: 1491 QESHAANILAKEAYDHAASVRNKTAA 1516
             E  A    A EA   AA  + K  A
Sbjct: 152  AEEEAKAKAAAEAKKKAAEAKKKAEA 177


>gnl|CDD|180240 PRK05759, PRK05759, F0F1 ATP synthase subunit B; Validated.
          Length = 156

 Score = 34.4 bits (80), Expect = 0.24
 Identities = 21/103 (20%), Positives = 43/103 (41%), Gaps = 13/103 (12%)

Query: 1614 QKIVDVLTEARTAQDMAEVAIQTAKDDISAARKDLSQITNDLDDAQQKSNET--NVKVKL 1671
            +KI D L  A  A+   E+A    +  ++ AR + ++I   ++ A++++ +     K + 
Sbjct: 38   KKIADGLAAAERAKKELELAQAKYEAQLAEARAEAAEI---IEQAKKRAAQIIEEAKAEA 94

Query: 1672 LQERLKSIQAGFLQNGRSAMDVENEEKNLEKEVA-----LAQK 1709
              E  +       +  +       E   L K+VA      A+K
Sbjct: 95   EAEAARIKAQAQAEIEQERKRAREE---LRKQVADLAVAGAEK 134


>gnl|CDD|188306 TIGR03319, RNase_Y, ribonuclease Y.  Members of this family are RNase
            Y, an endoribonuclease. The member from Bacillus
            subtilis, YmdA, has been shown to be involved in turnover
            of yitJ riboswitch [Transcription, Degradation of RNA].
          Length = 514

 Score = 35.7 bits (83), Expect = 0.26
 Identities = 31/170 (18%), Positives = 60/170 (35%), Gaps = 58/170 (34%)

Query: 1624 RTAQDMAEVAIQTAKDDISAARKDLSQITNDLDDAQQKSNETNVKVKLLQERLKSIQAGF 1683
            R A+     A + AK  I  A+K+         +  +K      K ++ + R        
Sbjct: 21   RIAEKKLGSAEELAKRIIEEAKKEA--------ETLKKEALLEAKEEVHKLR-------- 64

Query: 1684 LQNGRSAMDVENEEKNLEKEVALAQKQASGLRSRYQETDNKLLSKAESSGLKRARGQMLL 1743
                            LE+E+   ++     R+  Q  + +LL + E+  L R       
Sbjct: 65   --------------AELERELK--ER-----RNELQRLERRLLQREET--LDR------- 94

Query: 1744 NKASQLSVNTTAKLKQLNETEAMFNSQESELTELSKNIAELQKRIQSCIN 1793
                        K++ L++ E     +E EL+   KN+ E ++ ++  I 
Sbjct: 95   ------------KMESLDKKEENLEKKEKELSNKEKNLDEKEEELEELIA 132


>gnl|CDD|233228 TIGR00998, 8a0101, efflux pump membrane protein (multidrug resistance
            protein A).  [Transport and binding proteins, Other].
          Length = 334

 Score = 35.2 bits (81), Expect = 0.29
 Identities = 45/191 (23%), Positives = 72/191 (37%), Gaps = 42/191 (21%)

Query: 1577 DIDTILTETAGDLAKANDLKRKANLTKAGADSTKNTVQKIVDVLTEARTAQDMAEVAIQT 1636
            +   + ++ +G + + N      +  K G            DVL   R     AE+A+  
Sbjct: 41   NQLQVSSQVSGSVIEVN--VDDTDYVKQG------------DVL--VRLDPTNAELALAK 84

Query: 1637 AKDDISAARK--DLSQITNDLDDAQQKSNETNVKVKLLQERLKSIQAGFLQNGRSAMDVE 1694
            A+ +++A  +     +IT       Q   E+  K+KL Q R K +QA             
Sbjct: 85   AEANLAALVRQTKQLEITV---QQLQAKVESL-KIKLEQAREKLLQA------------- 127

Query: 1695 NEEKNLEKEVALAQKQASGLRSRYQETDNKLLSKAESSGLKRARGQMLLNKASQLSVNTT 1754
              E +L + V L +K   GL SR +    +    +  + L  A  +  LN    L   T 
Sbjct: 128  --ELDLRRRVPLFKK---GLISREELDHARKALLSAKAALNAAIQEQ-LNANQALVRGTP 181

Query: 1755 AKLKQLNETEA 1765
             K KQ    EA
Sbjct: 182  LK-KQPAVQEA 191



 Score = 34.4 bits (79), Expect = 0.52
 Identities = 29/137 (21%), Positives = 47/137 (34%), Gaps = 23/137 (16%)

Query: 1594 DLKRKANLTKAGADSTKNTVQKIVDVLTEARTAQDMAEVAIQTAKDDISAARKDLSQITN 1653
            DL+R+  L K G  S +         L  AR A   A+ A+  A  +   A + L + T 
Sbjct: 130  DLRRRVPLFKKGLISREE--------LDHARKALLSAKAALNAAIQEQLNANQALVRGTP 181

Query: 1654 DLDDAQQKSNETNVKVKLLQ-ERLK------------SIQAG-FLQNGRSAMDVENEEKN 1699
                   +  +  +K   L  +R               +Q G  +  G+  M V   E  
Sbjct: 182  LKKQPAVQEAKERLKTAWLALKRTVIRAPFDGYVARRFVQVGQVVSPGQPLMAVVPAE-Q 240

Query: 1700 LEKEVALAQKQASGLRS 1716
            +  E    + Q   +R 
Sbjct: 241  MYVEANFKETQLKNVRI 257


>gnl|CDD|218681 pfam05667, DUF812, Protein of unknown function (DUF812).  This family
            consists of several eukaryotic proteins of unknown
            function.
          Length = 536

 Score = 35.1 bits (81), Expect = 0.35
 Identities = 41/234 (17%), Positives = 87/234 (37%), Gaps = 21/234 (8%)

Query: 1579 DTILTETAGDLAKANDLKRKANLTKA-GADSTKNTVQ--KIVDVLTEARTAQDMAEVAIQ 1635
            + +          ++     A+L      DS     +  +   V   A  A ++    + 
Sbjct: 200  EYLRQAAQAGTTGSSSADDLADLLSKFRLDSWDKGSRFTRTEKVEQAATAAAEVPPAEMD 259

Query: 1636 TAKDDISAARKDLSQITNDLDDAQQKSNETNVKVKLLQERLKSIQAGFLQ-NGR-SAMDV 1693
            T +D       +L  +       Q++ +E    ++ +   +K++ +   Q N   + +  
Sbjct: 260  TEEDRTKEREAELEAL-------QEQIDELESSIEEVLSEIKALASKIKQVNEELTTVRQ 312

Query: 1694 ENEEKNLEKEVALAQKQASGLRSRYQETDNKLLSKAESSGLKRARGQMLLNKASQLSVNT 1753
            ENEE  LE+E  +  K+   L     + +N +   A+   L  A  + LL  A Q   + 
Sbjct: 313  ENEE--LEEEYKIK-KRTVELLP---DAENNV---AKLQALVVASSERLLELAQQWEAHR 363

Query: 1754 TAKLKQLNETEAMFNSQESELTELSKNIAELQKRIQSCINFIVDKSNNYKNCVS 1807
            T  + +    +    ++E E       I +L+ +I+   + +  K   YK  + 
Sbjct: 364  TPLIDEYRSLKEKNRNKEDETQRQLDEIKKLRNKIEELESELQTKEQLYKQLLD 417


>gnl|CDD|225606 COG3064, TolA, Membrane protein involved in colicin uptake [Cell
            envelope biogenesis, outer membrane].
          Length = 387

 Score = 34.9 bits (80), Expect = 0.36
 Identities = 30/110 (27%), Positives = 41/110 (37%), Gaps = 15/110 (13%)

Query: 1427 AKDYAETSKKLINEKESKAEELRKHMIILSESNSAKDYAETSKKLINEKEAKAEEVFRGI 1486
            AK  AE + K   E ++KAE             +AK  AE   K    ++AKAE   +  
Sbjct: 169  AKKKAEEAAKAAEEAKAKAEAA-----------AAKKKAEAEAKA-AAEKAKAEAEAKAK 216

Query: 1487 TTAKQESHAANILAKE---AYDHAASVRNKTAAYVASTSNITKQLDEFLN 1533
               K E+ A    A E   A   A + +   AA  A        LD+   
Sbjct: 217  AEKKAEAAAEEKAAAEKKKAAAKAKADKAAAAAKAAERKAAAAALDDIFG 266


>gnl|CDD|236081 PRK07735, PRK07735, NADH dehydrogenase subunit C; Validated.
          Length = 430

 Score = 35.0 bits (80), Expect = 0.37
 Identities = 26/136 (19%), Positives = 47/136 (34%), Gaps = 6/136 (4%)

Query: 1423 ESNSAKDYAETSKKLINEKESKAEELRKHMIILSESNSAKDYAETSKKLINEKEAKAEEV 1482
              N      E  ++     ++KA  L K     +E  + ++ A+   K     +AKA  +
Sbjct: 49   PKNDDMTIEEAKRRAAAAAKAKAAALAKQKREGTEEVTEEEKAKAKAKAAAAAKAKAAAL 108

Query: 1483 FR----GITTAKQESHAA--NILAKEAYDHAASVRNKTAAYVASTSNITKQLDEFLNAPG 1536
             +    G     +E  AA     A  A   AA++  +        +   ++ D+      
Sbjct: 109  AKQKREGTEEVTEEEKAAAKAKAAAAAKAKAAALAKQKREGTEEVTEEEEETDKEKAKAK 168

Query: 1537 ATLADIRNISGLAVNK 1552
            A  A     + LA  K
Sbjct: 169  AAAAAKAKAAALAKQK 184


>gnl|CDD|226447 COG3937, COG3937, Uncharacterized conserved protein [Function
            unknown].
          Length = 108

 Score = 32.8 bits (75), Expect = 0.37
 Identities = 25/138 (18%), Positives = 44/138 (31%), Gaps = 35/138 (25%)

Query: 1254 IMNNEDPNGVGKTLGNTTQRINLARLALNDLTEEIKKLNKTGEMLKENATLLQENNVEGA 1313
             M            G               + + + +L K GE+  E A    +      
Sbjct: 3    YMGEGLRKLALIGAGLAA-------ETAEKVQKLVDELVKKGELNAEEAKRFVD------ 49

Query: 1314 LELTRQAHDTSLKLKEQSTETEKQINDAERQCKRTENLVTKTFPQFTQGQEENEKALQRL 1373
             +L RQA       KE   E E++I     +      +  +               +  L
Sbjct: 50   -DLLRQA-------KEAQGELEEKIPRKIEEMLSDLEVARQ-------------SEMDEL 88

Query: 1374 NEKITSLEKNLPDL-NEL 1390
             E++ +LE+ + DL N+L
Sbjct: 89   TERVDALERQVADLENKL 106


>gnl|CDD|227608 COG5283, COG5283, Phage-related tail protein [Function unknown].
          Length = 1213

 Score = 35.3 bits (81), Expect = 0.44
 Identities = 27/145 (18%), Positives = 44/145 (30%), Gaps = 26/145 (17%)

Query: 1663 NETNVKVKLLQERLKSIQAGFLQNGRSAMDVENEEKNLEKEVALAQKQASGLRSRYQETD 1722
            N+T     +    LKS          S  D     K LEK+  L +   S  + +Y+   
Sbjct: 16   NDTIAVKNI--NVLKS----------SIKDSTQFWKMLEKQQKLTKDGLSASKGKYEGLS 63

Query: 1723 NKLLS--------KAESSGLKRA------RGQMLLNKASQLSVNTTAKLKQLNETEAMFN 1768
              +          K E   + RA        Q    + +Q      +   Q         
Sbjct: 64   EAMEKQKKAYEDLKQEVKEVNRATQASKKAYQEYNAQYTQAENKLRSLSGQFGVASEQLM 123

Query: 1769 SQESELTELSKNIAELQKRIQSCIN 1793
             Q+ E+  L   I+ L K + +   
Sbjct: 124  LQQKEIQRLQYAISTLNKSMAAQAR 148


>gnl|CDD|217869 pfam04065, Not3, Not1 N-terminal domain, CCR4-Not complex component. 
          Length = 233

 Score = 34.2 bits (79), Expect = 0.49
 Identities = 34/177 (19%), Positives = 59/177 (33%), Gaps = 53/177 (29%)

Query: 1244 LTTIDDMKKQIMNNEDPNGVGKTLGNTTQRINLARLALNDLTEEIKKLNKTGEMLK---- 1299
            L + D + +++ N            N +Q+  L     +DL  EIKKL +  + +K    
Sbjct: 21   LESFDSIYEKLENAT----------NASQKEKLE----SDLKREIKKLQRLRDQIKTWLS 66

Query: 1300 ----ENATLLQENN--VEGALE---------------------------LTRQAHDTSLK 1326
                ++   L EN   +E A+E                             ++  +    
Sbjct: 67   SSDIKDKDSLLENRKLIETAMERFKAVEKEMKTKAFSKEGLSAASKLDPKEKEKAEVCDF 126

Query: 1327 LKEQSTETEKQINDAERQCKRTENLVTKTFPQFTQGQ--EENEKALQRLNEKITSLE 1381
            L     E  KQI   E + +       K      + +   E E+ L+R    I+ LE
Sbjct: 127  LSNTIDELNKQIELLEAEVESLSVQRKKRKADAAKQERIAELEELLERHKFHISKLE 183


>gnl|CDD|226371 COG3853, TelA, Uncharacterized protein involved in tellurite
            resistance [Inorganic ion transport and metabolism].
          Length = 386

 Score = 34.7 bits (80), Expect = 0.50
 Identities = 36/143 (25%), Positives = 60/143 (41%), Gaps = 32/143 (22%)

Query: 1244 LTTIDDMKKQIMNNEDPNGVGKTLGNTTQRINLARL-ALNDLTEEIKKLNKTGEMLKENA 1302
            +TTI   K Q+        +  TL    Q+  L    A+N+          T E+L  NA
Sbjct: 268  ITTIPIWKNQVA-------IALTLKR--QKKVLKAQQAVNE---------TTNELLLSNA 309

Query: 1303 TLLQENNVEGALELTRQAHDT-SLKLKEQST-----ETEKQINDAERQCKRTENLVTKTF 1356
             +L+EN++E A +  R   D  +LK  +QS      ET +   +  ++   T        
Sbjct: 310  EMLKENSIEIAKQAERPTIDIETLKKTQQSVVAAIEETSQIQEEGRKKRADTIR------ 363

Query: 1357 PQFTQGQEENEKALQRLNEKITS 1379
             +  + +EE +K L  L + I+ 
Sbjct: 364  -ELDKLEEEIKKKLILLKQNISQ 385



 Score = 33.5 bits (77), Expect = 1.1
 Identities = 40/242 (16%), Positives = 70/242 (28%), Gaps = 66/242 (27%)

Query: 1190 ADQTTKVITDASNIKQTG---ASGAYTKEFELLEKKIEDVKALVEN-------------- 1232
            A    KV TDA   +             E +L  +  +    L +               
Sbjct: 11   ATINDKV-TDADLPEPENSEIVQKEKKAERDLQLENTKKADELAKQIDVSNQQSLLDFGA 69

Query: 1233 -----TTRSSHDL--TVMLTTIDDMKK------QIMNNEDPN---------------GVG 1264
                     S+ +   V +T + D+ K       I+   DP+                  
Sbjct: 70   EAQEKAAAFSNRMLNEVKITDVGDLGKMLEQLMFILKRFDPDELPSDKKGFLTKIFGRSK 129

Query: 1265 KTLGNTTQRINLARLALNDLTEEIKKLNKTGEMLKENATL--LQENNVEGALELTRQAHD 1322
             ++     +       ++ + E + K  K  E+ ++N  L  L E N E    L +    
Sbjct: 130  SSIQEIFSKYQTIGAQIDRIIESLSK-GKD-ELTRDNKMLELLYEKNREYFEHLEKYIAA 187

Query: 1323 TSLKLKEQSTET---EKQINDAERQCKRTENLVTKTFPQFTQGQEENEKALQRLNEKITS 1379
              LK +E  TE     K   ++  Q                Q   E    + RL +++  
Sbjct: 188  GELKDEELETEIIPELKTKAESGNQMD-------------VQQVNELTLFINRLEQRVYD 234

Query: 1380 LE 1381
            L 
Sbjct: 235  LL 236


>gnl|CDD|238453 cd00890, Prefoldin, Prefoldin is a hexameric molecular chaperone
            complex, found in both eukaryotes and archaea, that binds
            and stabilizes newly synthesized polypeptides allowing
            them to fold correctly.  The complex contains two alpha
            and four beta subunits, the two subunits being
            evolutionarily related. In archaea, there is usually only
            one gene for each subunit while in eukaryotes there two
            or more paralogous genes encoding each subunit adding
            heterogeneity to the structure of the hexamer. The
            structure of the complex consists of a double beta barrel
            assembly with six protruding coiled-coils.
          Length = 129

 Score = 32.7 bits (75), Expect = 0.51
 Identities = 20/133 (15%), Positives = 46/133 (34%), Gaps = 17/133 (12%)

Query: 1558 PEQIRERAN----EINNVIKSLTDIDTILTETAGDLAKANDLKRKANLTKAGADSTKNTV 1613
             +++  +      ++  + + L  ++  LTE          LK+     +          
Sbjct: 1    LQELAAQLQQLQQQLEALQQQLQKLEAQLTEYEKAKETLETLKKAEEEKELLVPLGAGLF 60

Query: 1614 QK---------IVDVLTEARTAQDMAEVAIQTAKDDISAARKDLSQITNDLDDAQQKSNE 1664
             K         +VD+ T     + + E AI+  K  +    K + ++   L+  Q +  E
Sbjct: 61   VKAEVKDDDKVLVDLGTGVYVEKSLEE-AIEFLKKRLETLEKQIEKLEKQLEKLQDQITE 119

Query: 1665 TNVKVKLLQERLK 1677
               +   LQ+  +
Sbjct: 120  LQEE---LQQLQQ 129


>gnl|CDD|222160 pfam13476, AAA_23, AAA domain. 
          Length = 204

 Score = 33.7 bits (77), Expect = 0.58
 Identities = 31/146 (21%), Positives = 48/146 (32%), Gaps = 16/146 (10%)

Query: 1260 PNGVGKT---------LGNTTQRINLARLALNDLTE-EIKKLNKTGEMLKENATLLQENN 1309
            PNG GKT         L   T R+  ++       + EI+K     +   E  T      
Sbjct: 27   PNGSGKTTILDAIRWALYGKTSRLKKSKGRGIVKGDIEIEKDEGKKKTYVE-ITFENNKG 85

Query: 1310 VEGALELTRQAHDTSLKLKEQSTETEKQINDAERQCKRTENLVTKTFPQFTQ----GQEE 1365
                  +      T  K K+      + +   E Q +  + L+             GQE 
Sbjct: 86   KLKLRLIEESRELTKKKGKKVKKSILEIVEIDELQ-EFIDELLKSDKEILPLLLYLGQER 144

Query: 1366 NEKALQRLNEKITSLEKNLPDLNELI 1391
             E+   +  EK   LE+   +L EL 
Sbjct: 145  LEELKFKRKEKKERLEELEKELEELE 170


>gnl|CDD|237582 PRK14011, PRK14011, prefoldin subunit alpha; Provisional.
          Length = 144

 Score = 32.8 bits (75), Expect = 0.61
 Identities = 33/170 (19%), Positives = 57/170 (33%), Gaps = 51/170 (30%)

Query: 1552 KNIQSNPEQIRERANEINNVIKSL----TDIDTILTETAGDLAKANDLKRKANLTKAGAD 1607
            + +Q     I     E+   I+S+    T  + ++    G   KA               
Sbjct: 20   QKLQEELSSIDMMKMELLKSIESMEGLKTSEEILIPLGPGAFLKA--------------- 64

Query: 1608 STKNTVQKIVDVLTEARTAQDMAEVAIQTAKDDISAARKDLSQITNDLDDAQQKSNETNV 1667
                   KIVD            + AI     DI    KD+S++  D     +KS E   
Sbjct: 65   -------KIVD-----------PDKAILGVGSDI-YLEKDVSEVIEDF----KKSVEELD 101

Query: 1668 KVKLLQERLKSIQAGFLQNGRSAMDVENEEKNLEKEVALAQKQASGLRSR 1717
            K K  +E  K I+           ++    K LEK     +++ + ++ +
Sbjct: 102  KTK--KEGNKKIE-------ELNKEITKLRKELEKRAQAIEQRQAQMKPK 142


>gnl|CDD|148617 pfam07111, HCR, Alpha helical coiled-coil rod protein (HCR).  This
            family consists of several mammalian alpha helical
            coiled-coil rod HCR proteins. The function of HCR is
            unknown but it has been implicated in psoriasis in humans
            and is thought to affect keratinocyte proliferation.
          Length = 739

 Score = 34.6 bits (79), Expect = 0.64
 Identities = 28/112 (25%), Positives = 55/112 (49%), Gaps = 11/112 (9%)

Query: 1265 KTLGNTTQRINLARLALNDLTEEIKKLNKTGEMLKENATLLQENNVEGALELTRQAHDTS 1324
            ++L +   ++  AR    + TEE   L +          L Q+  V G   L  +  +  
Sbjct: 535  ESLASAGAQLEAARAGQQESTEEAASLRQE---------LTQQQEVYGQA-LQEKVAEVE 584

Query: 1325 LKLKEQSTETEKQINDAERQCKRTENLVTKTFPQFTQGQEENEKALQRLNEK 1376
             +L+EQ ++TE+++N+A R+  +    + +   Q  Q +E N++ L+RL E+
Sbjct: 585  TRLREQLSDTERRLNEARREHAKAVVSLRQIQRQAAQEKERNQE-LRRLQEE 635


>gnl|CDD|214922 smart00935, OmpH, Outer membrane protein (OmpH-like).  This family
            includes outer membrane proteins such as OmpH among
            others. Skp (OmpH) has been characterized as a molecular
            chaperone that interacts with unfolded proteins as they
            emerge in the periplasm from the Sec translocation
            machinery.
          Length = 140

 Score = 32.6 bits (75), Expect = 0.73
 Identities = 17/108 (15%), Positives = 42/108 (38%), Gaps = 11/108 (10%)

Query: 1694 ENEEKNLEKEVALAQKQASGLRSRYQETDNKLLSKAESSGLKRARGQMLLNKASQLSVNT 1753
            E E K  + E+   +K+   L+ + Q  D   LS+A     +  + + L  K  +     
Sbjct: 24   EKEFKKRQAELEKLEKELQKLKEKLQ-KDAATLSEAA----REKKEKELQKKVQEFQRKQ 78

Query: 1754 TAKLKQLNETEA-MFNSQESELTELSKNIAELQKRIQSCINFIVDKSN 1800
                + L + +         ++ +  K +A+ +       + ++D++ 
Sbjct: 79   QKLQQDLQKRQQEELQKILDKINKAIKEVAKKKG-----YDLVLDRNA 121


>gnl|CDD|149682 pfam08702, Fib_alpha, Fibrinogen alpha/beta chain family.  Fibrinogen
            is a protein involved in platelet aggregation and is
            essential for the coagulation of blood. This domain forms
            part of the central coiled coiled region of the protein
            which is formed from two sets of three non-identical
            chains (alpha, beta and gamma).
          Length = 146

 Score = 32.7 bits (75), Expect = 0.79
 Identities = 9/42 (21%), Positives = 18/42 (42%)

Query: 1645 RKDLSQITNDLDDAQQKSNETNVKVKLLQERLKSIQAGFLQN 1686
             K +  + N LD     ++  +  VK +++ L+  Q     N
Sbjct: 35   DKRIEDLENLLDQLANSTSSAHQYVKHIKDSLRGDQKQAQPN 76


>gnl|CDD|220366 pfam09728, Taxilin, Myosin-like coiled-coil protein.  Taxilin
            contains an extraordinarily long coiled-coil domain in
            its C-terminal half and is ubiquitously expressed. It is
            a novel binding partner of several syntaxin family
            members and is possibly involved in Ca2+-dependent
            exocytosis in neuroendocrine cells. Gamma-taxilin,
            described as leucine zipper protein Factor Inhibiting
            ATF4-mediated Transcription (FIAT), localises to the
            nucleus in osteoblasts and dimerises with ATF4 to form
            inactive dimers, thus inhibiting ATF4-mediated
            transcription.
          Length = 309

 Score = 33.5 bits (77), Expect = 0.82
 Identities = 39/166 (23%), Positives = 82/166 (49%), Gaps = 24/166 (14%)

Query: 1644 ARKDLS---QIT-NDLDDAQQKSNETNVKVK----LLQERLKSIQAGFLQNGRSAMDVEN 1695
             RK+L+   Q T  D+ D  ++ +  N K+      L+E+LK +   +    +   +   
Sbjct: 104  KRKELTEKFQSTLKDIQDQMEEHSNPNHKLCEENEELREKLKELIEQYELREQH-FEKLL 162

Query: 1696 EEKNLEKEVALA-----QKQASGLRSRY-QETDNKLLSKAESSGLKRARGQMLLNKASQL 1749
            + K+LE+++  A     Q+QA   + ++ +E +  L   A+ S LK    ++     +QL
Sbjct: 163  KTKDLEQQLNDAKLEQAQEQAEQEQEKHKREKEILLKEAAQVSTLKETEKEL----RAQL 218

Query: 1750 SVNTTAKLKQ----LNETEAMFNSQESELTELSKNIAELQKRIQSC 1791
            ++ +  K K+    LN++  +F + + E+ ++SK I +L+K   + 
Sbjct: 219  NLYSD-KFKEFQDTLNKSNEVFLTFKQEMEKMSKKIKKLEKENLAW 263


>gnl|CDD|237178 PRK12705, PRK12705, hypothetical protein; Provisional.
          Length = 508

 Score = 33.9 bits (78), Expect = 0.82
 Identities = 27/96 (28%), Positives = 44/96 (45%), Gaps = 7/96 (7%)

Query: 1696 EEKNLEKEVALAQKQASGLRSRYQETDNKLLSKAESSGLKRARGQMLLNKASQLS--VNT 1753
            +E   + E AL + +   LR R Q+   +  ++ E   L+R   + L+ K  QL      
Sbjct: 44   KEAEEKLEAALLEAKELLLRERNQQ---RQEARREREELQREEER-LVQKEEQLDARAEK 99

Query: 1754 TAKLK-QLNETEAMFNSQESELTELSKNIAELQKRI 1788
               L+ QL E E   +++E EL EL K +     R+
Sbjct: 100  LDNLENQLEEREKALSARELELEELEKQLDNELYRV 135


>gnl|CDD|153289 cd07605, I-BAR_IMD, Inverse (I)-BAR, also known as the IRSp53/MIM
            homology Domain (IMD), a dimerization module that binds
            and bends membranes.  Inverse (I)-BAR (or IMD) is a
            member of the Bin/Amphiphysin/Rvs (BAR) domain family. It
            is a dimerization and lipid-binding module that bends
            membranes and induces membrane protrusions in the
            opposite direction compared to classical BAR and F-BAR
            domains, which produce membrane invaginations. IMD
            domains are found in Insulin Receptor tyrosine kinase
            Substrate p53 (IRSp53), Missing in Metastasis (MIM), and
            Brain-specific Angiogenesis Inhibitor 1-Associated
            Protein 2-like (BAIAP2L) proteins. These are multi-domain
            proteins that act as scaffolding proteins and transducers
            of a variety of signaling pathways that link membrane
            dynamics and the underlying actin cytoskeleton. Most
            members contain an N-terminal IMD, an SH3 domain, and a
            WASP homology 2 (WH2) actin-binding motif at the
            C-terminus, exccept for MIM which does not carry an SH3
            domain. Some members contain additional domains and
            motifs. The IMD domain binds and bundles actin filaments,
            binds membranes and produces membrane protrusions, and
            interacts with the small GTPase Rac.
          Length = 223

 Score = 33.1 bits (76), Expect = 0.83
 Identities = 42/215 (19%), Positives = 79/215 (36%), Gaps = 52/215 (24%)

Query: 1589 LAKANDLKRKAN-LTKAGADSTKNTVQKIVDVLTEARTAQDMAEVAIQTAKDDISAARKD 1647
            +      ++    L++A      + + KI ++ +++R +Q++ E   Q     I    K 
Sbjct: 26   IKAGKKYQKALQALSQAAKVFF-DALAKIGELASQSRGSQELGEALKQ-----IVDTHKS 79

Query: 1648 LSQITNDLDDAQQKSNETNVKVKLLQERLKSIQAGFLQNGRSAMDVENEEKNLEKEVALA 1707
            +           + S E   K     E +  +                 EK LE +    
Sbjct: 80   I-----------EASLEQVAK-AFHGELILPL-----------------EKKLELD---- 106

Query: 1708 QKQASGLRSRYQETDNKLLSKAESSGLKRARGQML-LNKASQLSVNTTAKLKQLNETEAM 1766
            QK  +     Y+  + K     +   L +AR ++  L K SQ    T    ++L++    
Sbjct: 107  QKVINKFEKDYK-KEYK----QKREDLDKARSELKKLQKKSQ-KSGTGKYQEKLDQALEE 160

Query: 1767 FNSQESELTELSKNI---AELQKRIQSCINFIVDK 1798
             N ++ EL          A L++R + C  F+VDK
Sbjct: 161  LNDKQKELEAFVSQGLRDALLEERRRYC--FLVDK 193


>gnl|CDD|112039 pfam03207, OspD, Borrelia outer surface protein D (OspD). 
          Length = 264

 Score = 33.3 bits (75), Expect = 0.87
 Identities = 47/205 (22%), Positives = 87/205 (42%), Gaps = 22/205 (10%)

Query: 1424 SNSAKDYAETSKKL--INEKESKAEELRKHMIILSESNSAKDYAETSKKLINEKEAKAEE 1481
            +NS K+   T+  L   NE     E +   + ++S   SA D  + ++K +++    AE 
Sbjct: 69   TNSLKEAKNTTDNLNASNEANKVVEAVINAVNLIS---SAADQVKGAQKNMHDLAQMAEI 125

Query: 1482 VFRGITTAKQESHAANILAKEAYDHAASV-RNKTAAY--------VASTSNITKQLDEFL 1532
                I  +  ++  A  LAKEAY+   +  +N    Y          S S+  +Q+ + +
Sbjct: 126  DLEKIKESSDKAIFAANLAKEAYNLTKAAEQNMQKLYKEQEEILESESDSDEIEQVSDEI 185

Query: 1533 NAPGATLADIRNISGLAVNKNIQSNPEQIRERANEINNVIKSLTDIDTILTETAGDLAKA 1592
                A +   +    +A    +++  E I     ++ N +K    +D I TETA +  K 
Sbjct: 186  KQDSAEIKQAKEAVEIAWKATVEAKDELI-----DVENAVKEA--LDKIKTETANN-TKL 237

Query: 1593 NDLKRKANLTKAGADSTKNTVQKIV 1617
             D++  A L    A +     Q++V
Sbjct: 238  ADIEEAAELVLQIAKNAAEIAQEVV 262


>gnl|CDD|221533 pfam12329, TMF_DNA_bd, TATA element modulatory factor 1 DNA binding. 
            This is the middle region of a family of TATA element
            modulatory factor 1 proteins conserved in eukaryotes that
            contains at its N-terminal section a number of leucine
            zippers that could potentially form coiled coil
            structures. The whole proteins bind to the TATA element
            of some RNA polymerase II promoters and repress their
            activity. by competing with the binding of TATA binding
            protein. TMFs are evolutionarily conserved golgins that
            bind Rab6, a ubiquitous ras-like GTP-binding Golgi
            protein, and contribute to Golgi organisation in animal
            and plant cells.
          Length = 74

 Score = 30.7 bits (70), Expect = 0.93
 Identities = 18/70 (25%), Positives = 29/70 (41%), Gaps = 2/70 (2%)

Query: 1719 QETDNKLLSKAESSGLKRARGQMLLNKASQLSVNTTAKLKQ-LNETEAMFNSQESELTEL 1777
               + KL  K E        G+ L  K  + + NT  KL+    E E      + +L +L
Sbjct: 1    SSLEKKLAEKDEQIAQLMEEGEKLSKKELKHN-NTIKKLRAKNKELEKEIAELKKKLEKL 59

Query: 1778 SKNIAELQKR 1787
             K +  L++R
Sbjct: 60   EKELENLEER 69


>gnl|CDD|163506 TIGR03794, NHLM_micro_HlyD, NHLM bacteriocin system secretion
            protein.  Members of this protein family are homologs of
            the HlyD membrane fusion protein of type I secretion
            systems. Their occurrence in prokaryotic genomes is
            associated with the occurrence of a novel class of
            microcin (small bacteriocins) with a leader peptide
            region related to nitrile hydratase. We designate the
            class of bacteriocin as Nitrile Hydratase Leader
            Microcin, or NHLM. This family, therefore, is designated
            as NHLM bacteriocin system secretion protein. Some but
            not all NHLM-class putative microcins belong to the TOMM
            (thiazole/oxazole modified microcin) class as assessed by
            the presence of the scaffolding protein and/or
            cyclodehydratase in the same gene clusters [Transport and
            binding proteins, Amino acids, peptides and amines,
            Cellular processes, Biosynthesis of natural products].
          Length = 421

 Score = 33.7 bits (77), Expect = 1.0
 Identities = 22/132 (16%), Positives = 42/132 (31%), Gaps = 17/132 (12%)

Query: 1267 LGNTTQRINLARLALNDLTEEIKKLNKTGEM-LKENATLLQENNVEGALELTRQAHDTSL 1325
            L  T  R+     AL+    + + L   G    K +  L Q+   E        A D + 
Sbjct: 137  LEETIGRLREELAALSREVGKQRGLLSRGLATFKRDRILQQQWREEQE---KYDAADKAR 193

Query: 1326 KLKEQSTETEKQINDAERQCKRTENLVTKTFPQFTQGQEENEKALQRLNEKITSLEKNLP 1385
             +    T+ +++           E ++        Q     EK L+ +  +I      + 
Sbjct: 194  AIYALQTKADERNL---------ETVLQSLSQADFQLAGVAEKELETVEARIKEARYEIE 244

Query: 1386 DLNELICDKRGD 1397
            +L     +K   
Sbjct: 245  ELE----NKLNL 252


>gnl|CDD|218602 pfam05478, Prominin, Prominin.  The prominins are an emerging family
            of proteins that among the multispan membrane proteins
            display a novel topology. Mouse prominin and human
            prominin (mouse)-like 1 (PROML1) are predicted to contain
            five membrane spanning domains, with an N-terminal domain
            exposed to the extracellular space followed by four,
            alternating small cytoplasmic and large extracellular,
            loops and a cytoplasmic C-terminal domain. The exact
            function of prominin is unknown although in humans
            defects in PROM1, the gene coding for prominin, cause
            retinal degeneration.
          Length = 807

 Score = 33.8 bits (78), Expect = 1.0
 Identities = 32/157 (20%), Positives = 59/157 (37%), Gaps = 31/157 (19%)

Query: 1607 DSTKNTVQKIVDVLT--EARTAQDMAEVAIQTAKDDISA-----------ARKDLSQITN 1653
            +S K  +  I  +LT    R  +D+A   +    DDI+               +L+    
Sbjct: 576  ESLKVNLSTI-TLLTPEARRNLEDLASSGL----DDINFSNFLKQIQKPLTEVNLNSFAE 630

Query: 1654 DLDDAQQKSNETNVKVKLLQE--RLKSIQAGFLQNGRSAMDVENEEKNLEKEVALAQKQA 1711
            +LD          +KV L  E   L+SIQ  F+    S ++       L   V    + +
Sbjct: 631  ELDALANNLPNGKLKVALKNEALDLRSIQQNFVPPMESLLE------KLNSNVKTLDESS 684

Query: 1712 SGLRSRYQETDNKLLSKAESS-GLKRARGQMLLNKAS 1747
            + L    Q     +L++ +++ G        ++N+ S
Sbjct: 685  NDL----QNAVKDVLAEVQAAQGFLNNNASEIVNEES 717


>gnl|CDD|202101 pfam02050, FliJ, Flagellar FliJ protein. 
          Length = 122

 Score = 31.8 bits (73), Expect = 1.1
 Identities = 18/94 (19%), Positives = 41/94 (43%), Gaps = 11/94 (11%)

Query: 1628 DMAEVAIQTAKDDISAARKDLSQITNDLDDAQQKSNETNVKVKLLQ-ERLKSIQAGFLQN 1686
            D A   +  A+ ++  A + L ++     + +Q+ +      + +    L++ QA F+  
Sbjct: 1    DKAARNLAEAQQELEQAEEKLEELQQYRLEYRQQLSG---SGQGISAAELRNYQA-FISA 56

Query: 1687 GRSAMDVENEEKNLEKEVALAQKQASGLRSRYQE 1720
               A+         ++E+  A+KQ    R ++QE
Sbjct: 57   LDEAIA------QQQQELEQAEKQVEQAREQWQE 84


>gnl|CDD|217789 pfam03915, AIP3, Actin interacting protein 3. 
          Length = 424

 Score = 33.5 bits (77), Expect = 1.1
 Identities = 37/195 (18%), Positives = 78/195 (40%), Gaps = 10/195 (5%)

Query: 1469 KKLINEKEAKAEEVFRGITTA--KQESHAANILAKEAYDHAASVRNKTAAYVASTS-NIT 1525
            KK I++   +  E  + +  A   Q+S     L++   + A  V    AA   S+  N +
Sbjct: 80   KKHIDDSLGELSEKIKSLKQALDDQQSTILQRLSELQQNTAKEVARPAAAPPPSSLPNSS 139

Query: 1526 KQLDEFLNAPGATLADIRNISGLAVNKNIQSNPEQIRERANEINNVIKSLTDIDTILTET 1585
               D    A    L  +R    LAV +   S+     E    I+++++ +    +     
Sbjct: 140  SSPDSINPAGLKELQSLRR--ELAVLRQTHSD--FKSEIQASISSILEKINAFKSASLNA 195

Query: 1586 AGDLAKANDLKRKANLTKAGADSTKNTVQKIVDVLTEARTAQDMAEVAIQTAKDDISAAR 1645
            +G   +A     K  L+   +DS    V  + D++   R  +D+A+  ++     +   +
Sbjct: 196  SGSSNRAYMESSKKKLS-EDSDSLLTKVDDLQDIIEALR--KDVAQRGVRPGPKQLETVQ 252

Query: 1646 KDLSQITNDLDDAQQ 1660
            K++ +   +L   ++
Sbjct: 253  KEIQKAEKELKKMEE 267


>gnl|CDD|178867 PRK00106, PRK00106, hypothetical protein; Provisional.
          Length = 535

 Score = 33.7 bits (77), Expect = 1.1
 Identities = 27/97 (27%), Positives = 49/97 (50%), Gaps = 9/97 (9%)

Query: 1692 DVENEEKNLEKEVALAQKQASGLRSRYQETDNKLLSKAESSGLKRARGQMLLNKASQLSV 1751
              + E K L+KE+ L  K+ +  R   +E + +   K+E   LK+   + L  +A+ L  
Sbjct: 61   TAKRESKALKKELLLEAKEEA--RKYREEIEQEF--KSERQELKQIESR-LTERATSLD- 114

Query: 1752 NTTAKLKQLNETEAMFNSQESELTELSKNIAELQKRI 1788
                K + L+  E    S+E  LT+ SK+I E ++++
Sbjct: 115  ---RKDENLSSKEKTLESKEQSLTDKSKHIDEREEQV 148


>gnl|CDD|179335 PRK01773, hscB, co-chaperone HscB; Provisional.
          Length = 173

 Score = 32.4 bits (74), Expect = 1.1
 Identities = 33/131 (25%), Positives = 58/131 (44%), Gaps = 30/131 (22%)

Query: 1621 TEARTA-QDMAEV--AIQTAKDDISAARKDLSQITNDLDDAQQKSNETNVKVKLLQ---- 1673
             E R A Q  AEV  A+Q  KD I  A   ++  T +  + ++KS +      L+Q    
Sbjct: 43   QEQRLAMQKSAEVNDALQILKDPILRAEAIIALNTGEQQNLEEKSTQD--MAFLMQQMEW 100

Query: 1674 -ERLKSIQ--------AGFLQNGRSAMDVENEEKNLEKEV--ALAQKQ---ASGLRSRYQ 1719
             E+L+ I+          F +      +++ E++ +  E+  AL  +Q   AS +  R +
Sbjct: 101  REQLEEIEQQQDEDALTAFSK------EIKQEQQAILTELSTALNSQQWQQASQINDRLR 154

Query: 1720 ETDNKLLSKAE 1730
                KL+ + E
Sbjct: 155  FI-KKLIIEIE 164


>gnl|CDD|145995 pfam03148, Tektin, Tektin family.  Tektins are cytoskeletal proteins.
            They have been demonstrated in such cellular sites as
            centrioles, basal bodies, and along ciliary and flagellar
            doublet microtubules. Tektins form unique protofilaments,
            organised as longitudinal polymers of tektin heterodimers
            with axial periodicity matching tubulin. Tektin
            polypeptides consist of several alpha-helical regions
            that are predicted to form coiled coils. Indeed, tektins
            share considerable structural similarities with
            intermediate filament proteins. Possible functional roles
            for tektins are: stabilisation of tubulin protofilaments;
            attachment of A and B-tubules in ciliary/flagellar
            microtubule doublets and C-tubules in centrioles; binding
            of axonemal components.
          Length = 384

 Score = 33.3 bits (77), Expect = 1.1
 Identities = 14/58 (24%), Positives = 26/58 (44%), Gaps = 2/58 (3%)

Query: 1328 KEQSTETEKQINDAERQCKRTENLVTKTFPQFTQGQEENEKAL-QRLNEKITSLEKNL 1384
             +Q  + E Q + AER    +  L+ +T       Q +  K L +R+++ I   +  L
Sbjct: 6    YQQYNQAEAQRDLAERLRHESRRLINETDATTKWTQADVTKKLGERISD-INFWKSEL 62


>gnl|CDD|223571 COG0497, RecN, ATPase involved in DNA repair [DNA replication,
            recombination, and repair].
          Length = 557

 Score = 33.3 bits (77), Expect = 1.2
 Identities = 50/237 (21%), Positives = 84/237 (35%), Gaps = 47/237 (19%)

Query: 1548 LAVNKNIQSNPEQIRERANEINNVIKSLTDIDTILTETAGDLAKAND-LKRKANLTKAGA 1606
            L   +   SN E++ E    I N ++ L+  D    +T   L+     L+   +L  +  
Sbjct: 210  LEEERKRLSNSEKLAE---AIQNALELLSGED----DTVSALSLLGRALEALEDL--SEY 260

Query: 1607 DST-KNTVQKIVDVLTEARTAQDMAEVAIQTAKDDISAARKDLSQITNDLDDAQQKSNET 1665
            D       + + + L E   A +     ++   D++      L ++   L   +  + + 
Sbjct: 261  DGKLSELAELLEEALYELEEASE----ELRAYLDELEFDPNRLEEVEERLFALKSLARKY 316

Query: 1666 NVKVKLLQERLKSIQAGF--LQNGRSAMDVENEEKNLEKEVALAQKQASGLRSRYQETDN 1723
             V ++ L E L  I+     L N       E   + LEKEV   + +             
Sbjct: 317  GVTIEDLLEYLDKIKEELAQLDNS------EESLEALEKEVKKLKAE------------- 357

Query: 1724 KLLSKAESSGLKRARGQMLLNKASQLSVNTTAKLKQLNETEAMF----NSQESELTE 1776
             LL  AE+  L   R       A +L    TA+LK L   +A F       E   T 
Sbjct: 358  -LLEAAEA--LSAIR----KKAAKELEKEVTAELKALAMEKARFTVELKPLEESPTA 407



 Score = 32.2 bits (74), Expect = 3.0
 Identities = 37/154 (24%), Positives = 60/154 (38%), Gaps = 16/154 (10%)

Query: 1637 AKDDISAARKDLSQITNDLDDAQQKSNETNVKVKLLQERLKSIQAGFLQNGRSAMDVENE 1696
            A++    A +   Q   +L+D Q+K  E   +  LLQ +L+ ++   LQ G    ++E E
Sbjct: 155  AQEAYQEAYQAWKQARRELEDLQEKERERAQRADLLQFQLEELEELNLQPGEDE-ELEEE 213

Query: 1697 EKNLEKEVALAQKQASGLRSRYQETDNK-LLSKAESSGLKRARGQMLLNKASQLSVNTTA 1755
             K L     LA+   + L     E D    LS      L RA     L   S+       
Sbjct: 214  RKRLSNSEKLAEAIQNALELLSGEDDTVSALSL-----LGRALEA--LEDLSEYD----- 261

Query: 1756 KLKQLNETEAMFNSQESELTELSKNIAELQKRIQ 1789
               +L+E   +      EL E S+ +      ++
Sbjct: 262  --GKLSELAELLEEALYELEEASEELRAYLDELE 293


>gnl|CDD|184064 PRK13461, PRK13461, F0F1 ATP synthase subunit B; Provisional.
          Length = 159

 Score = 32.3 bits (74), Expect = 1.2
 Identities = 22/68 (32%), Positives = 36/68 (52%), Gaps = 10/68 (14%)

Query: 1436 KLINEKESKAEELR-KHMIILSESNSAKDYAETSKKLINEKEAKAEEVFRGITT-AKQES 1493
            +  +E + KA EL+ K+   L    +AK   E  KK++ E ++KAE V+  I   A +E 
Sbjct: 46   EKADEDQKKARELKLKNERELK---NAK---EEGKKIVEEYKSKAENVYEEIVKEAHEE- 98

Query: 1494 HAANILAK 1501
             A  I+ +
Sbjct: 99   -ADLIIER 105


>gnl|CDD|226138 COG3611, DnaB, Replication initiation/membrane attachment protein
            [DNA replication, recombination, and repair].
          Length = 417

 Score = 33.1 bits (76), Expect = 1.3
 Identities = 66/288 (22%), Positives = 96/288 (33%), Gaps = 50/288 (17%)

Query: 1529 DEFLNAPGATLADIRNISGLAVNKNIQSNPEQIRERANEINNVIKSLTDIDTILTETAGD 1588
            +EF   P         I  +   +      EQ +E   + N+V K  ++ D  LT+    
Sbjct: 109  EEFFQDPMLNELLYSKIGKVRYQRLKLRFEEQGKEITRDFNDVFKLNSESD--LTQNPHF 166

Query: 1589 LAKANDL-KRKANLTKAGADSTKNTVQKIVDVLTE-----ARTAQDMAEVAIQTA--KDD 1640
              ++ D    K  L K G      T QK    +         T   M  +   +    D+
Sbjct: 167  TNESFDFELFKKLLVKQGLPRLAIT-QKDKLTIYSLAEKYKLTWDQMLNLIKNSLVNNDE 225

Query: 1641 IS--AARKDLSQI----TNDLDDAQQKSNETNVKVK---LLQERLKSIQAGFLQNGRSAM 1691
            I     RK LSQI     N       KSN  +       L  E  K I            
Sbjct: 226  IDTEELRKLLSQIYKNEHNTKPTLSTKSNPEDFTEANQNLNTESKKVIS----------- 274

Query: 1692 DVENEEKNLEK-------EVALAQKQA-SGLRSRYQETD---NKLLSKAESSGLKRARGQ 1740
            D+E     LEK       ++ L   +    L S     D   N LL   +   LK     
Sbjct: 275  DLEL-TFPLEKLKSIRDGKITLEDLKLVEELISDQNLLDGVINVLL---DYVLLKN---D 327

Query: 1741 MLLNKASQLSV-NTTAKLKQLNETEAMFNSQESELTELSKNIAELQKR 1787
              LNK   L + N  A+ K     EA+  +++ +   L+K  +   KR
Sbjct: 328  SKLNKKYILKIANDWARKKVKTAYEAIKENRKRKELALAKKQSSTTKR 375


>gnl|CDD|148614 pfam07106, TBPIP, Tat binding protein 1(TBP-1)-interacting protein
            (TBPIP).  This family consists of several eukaryotic
            TBP-1 interacting protein (TBPIP) sequences. TBP-1 has
            been demonstrated to interact with the human
            immunodeficiency virus type 1 (HIV-1) viral protein Tat,
            then modulate the essential replication process of HIV.
            In addition, TBP-1 has been shown to be a component of
            the 26S proteasome, a basic multiprotein complex that
            degrades ubiquitinated proteins in an ATP-dependent
            fashion. Human TBPIP interacts with human TBP-1 then
            modulates the inhibitory action of human TBP-1 on
            HIV-Tat-mediated transactivation.
          Length = 169

 Score = 32.3 bits (74), Expect = 1.3
 Identities = 29/130 (22%), Positives = 56/130 (43%), Gaps = 35/130 (26%)

Query: 1609 TKNTVQKIVDVL-----------------------TEARTAQDMAEVAI---------QT 1636
             K  VQK +D L                        E  + +++ ++ +         Q 
Sbjct: 31   GKTAVQKALDELVDEGKIICKEYGKQKIYLCNQDQFELPSDEELNKLDMEIEELREEVQL 90

Query: 1637 AKDDISAARKDLSQITNDL--DDAQQKSNETNVKVKLLQERLKSIQAGFLQNGRSAMD-V 1693
             K D S    +L  +T+DL  ++ Q++  E   +V+ ++E+L+S++ G+       M+ V
Sbjct: 91   LKQDCSTLEIELKSLTSDLTTEELQEEIQELKKEVREIEEKLESLEEGWKPVTPEEMEKV 150

Query: 1694 ENEEKNLEKE 1703
            + E K+L KE
Sbjct: 151  KKEYKDLHKE 160


>gnl|CDD|227635 COG5325, COG5325, t-SNARE complex subunit, syntaxin [Intracellular
            trafficking and secretion].
          Length = 283

 Score = 32.9 bits (75), Expect = 1.3
 Identities = 30/183 (16%), Positives = 72/183 (39%), Gaps = 17/183 (9%)

Query: 1214 KEFELLEKKIEDV-KALVENTTRSSHDLTVMLTTIDDMKKQI--MNNEDPNGVGKTLGNT 1270
            +E   + + I  + K   ++T  S  D +     ID++ K++          +     N 
Sbjct: 45   QELTAVRRSISRLGKVYAKHTEPSFSDKSEKEDEIDELSKKVNQDLQRCEKILKTKYKNL 104

Query: 1271 TQRINLARLALNDLTEEIKKLNKTGEMLKENATLLQENNVEGALELTRQAHDTSLKLKEQ 1330
                  ++L  +  TE ++     G+ +++ +   ++  V  A  L  + +D     +E+
Sbjct: 105  QSSFLQSKLLRDLNTECME-----GQRIQQKSAQFRKYQVLQAKFLRNKNNDQHPLEEEE 159

Query: 1331 STETEKQINDAERQCKRTENLVTKTFPQFTQGQEENEKALQRLNEKITSLEKNLPDLNEL 1390
              E+        +Q  + + L  +         E  +  +   +E+I +L + + +LNE+
Sbjct: 160  DEESLSS--LGSQQTLQQQGLSNEEL-------EYQQILITERDEEIKNLARGIYELNEI 210

Query: 1391 ICD 1393
              D
Sbjct: 211  FRD 213


>gnl|CDD|220368 pfam09730, BicD, Microtubule-associated protein Bicaudal-D.  BicD
            proteins consist of three coiled-coiled domains and are
            involved in dynein-mediated minus end-directed transport
            from the Golgi apparatus to the endoplasmic reticulum
            (ER). For full functioning they bind with GSK-3beta
            pfam05350 to maintain the anchoring of microtubules to
            the centromere. It appears that amino-acid residues
            437-617 of BicD and the kinase activity of GSK-3 are
            necessary for the formation of a complex between BicD and
            GSK-3beta in intact cells.
          Length = 711

 Score = 33.3 bits (76), Expect = 1.3
 Identities = 81/402 (20%), Positives = 148/402 (36%), Gaps = 74/402 (18%)

Query: 1429 DYAE------TSKKLI-NEKESKAE-ELRKHMI-ILSES----NSAKDYAETSKKLINEK 1475
            DY+E      + +K +   ++S+ E E  KH I  L E     NS  + A   K+ I EK
Sbjct: 98   DYSELEEENISLQKQVSVLRQSQVEFEGLKHEIRRLEEETELLNSQLEEAARLKE-IAEK 156

Query: 1476 EAKAEEVFRGITTAKQESHAANILAKEAYDHAASVRNKTAAYVASTSNITKQLDEFLNAP 1535
            +   EE    + T K E    N L KE   H             S SN++  LD      
Sbjct: 157  QL--EE---ALETLKTEREQKNALRKELSHH------LNLEDFYSLSNLSISLD------ 199

Query: 1536 GATLADIRNISGLAVNKNIQSNPEQIRERANEINNVIKSLTDIDTIL-TETAGDLAK--- 1591
                       GL  +++ +++ E   +   + N  +     +   L T   G+L     
Sbjct: 200  -----------GLKFSEDPEASTEPNND-GEDENGHLNGGPGLAKSLGTPRKGELFPPAP 247

Query: 1592 -ANDLKRKANLT-----KAGADSTKNTVQKIVDVLTEARTAQDMAEVAIQTAKDDI---- 1641
              +DL  + NL+     K   +  +     ++  L E +T  + +E A+    + +    
Sbjct: 248  GVSDLFSELNLSEIQKLKQQLEQVEREKASLLANLQELQTQLEQSEGALSEQHEKVNRLT 307

Query: 1642 ---SAAR---KDLSQITNDLDDAQQKSNETNVKVKLLQERLKSIQAGFLQNGRSAMDVEN 1695
               +A R       Q      +  + S+E     ++     + ++  +        +++ 
Sbjct: 308  EHLNALRNLQASKEQQDGLDSEKDRGSHEDGDYYEVDINGPEILECKYRVAVSEVGELKA 367

Query: 1696 EEKNLEKEVALAQKQAS----GLRSRYQETDNKLLSKAESSGLKRARGQMLLNKASQLSV 1751
            E K L+      Q++       LR+  Q    KLLS  +SS   + R   L     +  +
Sbjct: 368  ELKALKARYNELQERYEDEKDRLRNEIQNLAEKLLSLEKSSHEDQERSGDL-----EKEL 422

Query: 1752 NTTAKLKQLNETEAMFNSQESELTELSKNIAELQKRIQSCIN 1793
                 +    E++   N+ + EL   S+ +A+L   +  C N
Sbjct: 423  RKLTHVAG--ESQGSLNAAQDELVTFSEELAQLYHHVCMCNN 462


>gnl|CDD|235558 PRK05683, flgK, flagellar hook-associated protein FlgK; Validated.
          Length = 676

 Score = 33.5 bits (77), Expect = 1.3
 Identities = 19/53 (35%), Positives = 28/53 (52%), Gaps = 4/53 (7%)

Query: 1740 QMLLNKASQLSVNTTAKLKQLNETEAMFNSQESELT----ELSKNIAELQKRI 1788
            Q+LL +A  LS    +   QLN+  +  NSQ S +T     L+ +IA   K+I
Sbjct: 130  QLLLTQAQGLSKRFNSLSSQLNQQNSNINSQLSAMTDQVNNLTTSIASYNKQI 182


>gnl|CDD|219780 pfam08286, Spc24, Spc24 subunit of Ndc80.  Spc24 is a component of
            the evolutionarily conserved kinetochore-associated Ndc80
            complex and is involved in chromosome segregation.
          Length = 117

 Score = 31.5 bits (72), Expect = 1.3
 Identities = 17/45 (37%), Positives = 22/45 (48%), Gaps = 1/45 (2%)

Query: 1271 TQRINLARLALNDLTEEIKKLNKTGEMLKENATLLQENNVEGALE 1315
             ++  LA+  LNDL  E+  L    E LKE    L+E  VEG   
Sbjct: 6    NEKFRLAK-ELNDLESELNNLQSELEELKERLEELEEQEVEGDEV 49


>gnl|CDD|219934 pfam08614, ATG16, Autophagy protein 16 (ATG16).  Autophagy is a
            ubiquitous intracellular degradation system for
            eukaryotic cells. During autophagy, cytoplasmic
            components are enclosed in autophagosomes and delivered
            to lysosomes/vacuoles. ATG16 (also known as Apg16) has
            been shown to be bind to Apg5 and is required for the
            function of the Apg12p-Apg5p conjugate in the yeast
            autophagy pathway.
          Length = 194

 Score = 32.5 bits (74), Expect = 1.4
 Identities = 19/70 (27%), Positives = 29/70 (41%)

Query: 1719 QETDNKLLSKAESSGLKRARGQMLLNKASQLSVNTTAKLKQLNETEAMFNSQESELTELS 1778
            Q  D   L K E+  L+  + +   + +S  S +       + E E        ELTEL 
Sbjct: 28   QLADRTRLLKVENDALQAEKYEQQSSHSSSPSADGPGSDAAIAEMEQKLAKLREELTELH 87

Query: 1779 KNIAELQKRI 1788
            K   EL +R+
Sbjct: 88   KKRGELAQRL 97



 Score = 32.1 bits (73), Expect = 1.7
 Identities = 25/111 (22%), Positives = 43/111 (38%), Gaps = 4/111 (3%)

Query: 1284 LTEEIKKLNKTGEMLKENATLLQENNVEGALELTRQAHDTSLKLKEQSTETEKQINDAER 1343
            L  E   L       + + +     +  G+     +      KL+E+ TE  K     E 
Sbjct: 36   LKVENDALQAEKYEQQSSHSSSPSADGPGSDAAIAEMEQKLAKLREELTELHK--KRGEL 93

Query: 1344 QCKRTENLVTKTFPQFTQGQEENEKALQRLNEKITSLEKNLPDLNELICDK 1394
               +   L+     Q  +  ++ EK +  L  +ITSLE  + DL E + +K
Sbjct: 94   A--QRLLLLNDELEQLRREIQQLEKTIAELRSEITSLETEIRDLREELQEK 142



 Score = 31.7 bits (72), Expect = 2.6
 Identities = 19/101 (18%), Positives = 37/101 (36%), Gaps = 19/101 (18%)

Query: 1281 LNDLTEEIKKLNKTGEMLKENATLLQENNVEGALELTRQAHDTSLKLKEQSTETEKQIND 1340
            L  L EE+ +L+K    L +   LL +   +   E+ +        + E       +I  
Sbjct: 76   LAKLREELTELHKKRGELAQRLLLLNDELEQLRREIQQLEK----TIAEL----RSEITS 127

Query: 1341 AERQCKRTENLVTKTFPQFTQGQEENEKALQRLNEKITSLE 1381
             E + +     +           +E EK  + L +++ SL 
Sbjct: 128  LETEIRDLREEL-----------QEKEKDNETLQDELISLN 157


>gnl|CDD|218979 pfam06295, DUF1043, Protein of unknown function (DUF1043).  This
            family consists of several hypothetical bacterial
            proteins of unknown function.
          Length = 128

 Score = 31.7 bits (72), Expect = 1.4
 Identities = 15/56 (26%), Positives = 25/56 (44%), Gaps = 7/56 (12%)

Query: 1430 YAETSKKLINEKESKAEELRKHMIILSESNSA-------KDYAETSKKLINEKEAK 1478
             A++SK L+ E  ++     K +    ESN         +DY+E +  L   K+ K
Sbjct: 73   MAKSSKFLLPELPAQDNPFIKKLSKNEESNENEEVKEQPRDYSEGASGLFRNKKKK 128


>gnl|CDD|219186 pfam06810, Phage_GP20, Phage minor structural protein GP20.  This
            family consists of several phage minor structural protein
            GP20 sequences of around 180 residues in length. The
            function of this family is unknown.
          Length = 156

 Score = 31.9 bits (73), Expect = 1.4
 Identities = 27/130 (20%), Positives = 54/130 (41%), Gaps = 18/130 (13%)

Query: 1264 GKTLGNTTQRINLARLALNDLTEEIKKLNKTGEMLKENATLLQENNVEGALELTRQAHDT 1323
            GK + N   +++ A    + L +++K  +K  E LK+     +  + E   EL ++    
Sbjct: 12   GKDIQNPKSKLDKANEERDSLKKQLKNRDKQIEDLKK-----KVKDNE---ELQKKIEKL 63

Query: 1324 SLKLKEQSTETEKQINDAERQCKRTENLVTKTFPQFTQGQEENEKALQRL--NEKITSLE 1381
              + K    E E ++ D      +  N +     +    +  N KA++ L   +K+   +
Sbjct: 64   KQQNKTAKEEYEAKLAD-----TKLNNAIELALKKA---KARNAKAVKALLDKDKLKLDD 115

Query: 1382 KNLPDLNELI 1391
              L  L+E I
Sbjct: 116  DGLKGLDEQI 125


>gnl|CDD|222790 PHA00430, PHA00430, tail fiber protein.
          Length = 568

 Score = 33.3 bits (76), Expect = 1.4
 Identities = 29/130 (22%), Positives = 52/130 (40%), Gaps = 7/130 (5%)

Query: 1422 SESNSAKDYAETSKKLINEKESKAEELRKHMI-ILSESNSAKDYAETSKKLINEKEAKAE 1480
            +E++ A++ AE          +  ++ R       SE+N  K YA++    +   + +AE
Sbjct: 173  NEADRARNQAERFNNESGASATNTKQWRSEADGSNSEANRFKGYADSMTSSVEAAKGQAE 232

Query: 1481 EVFRGITTAKQESHAANILAKEAYDHAASVRNKTAAYVASTSNITKQLDEFLNAPGATLA 1540
               +   TA   +  A   A  A+    +  N   A   S +   +Q D        T A
Sbjct: 233  SSSKEANTAGDYATKAAASASAAHASEVNAANSATAAATSANRAKQQADR-----AKTEA 287

Query: 1541 D-IRNISGLA 1549
            D + N++G A
Sbjct: 288  DKLGNMNGFA 297


>gnl|CDD|214016 cd12923, iSH2_PI3K_IA_R, Inter-Src homology 2 (iSH2) helical domain
            of Class IA Phosphoinositide 3-kinase Regulatory
            subunits.  PI3Ks catalyze the transfer of the
            gamma-phosphoryl group from ATP to the 3-hydroxyl of the
            inositol ring of D-myo-phosphatidylinositol (PtdIns) or
            its derivatives. They play an important role in a variety
            of fundamental cellular processes, including cell
            motility, the Ras pathway, vesicle trafficking and
            secretion, immune cell activation, and apoptosis. They
            are classified according to their substrate specificity,
            regulation, and domain structure. Class IA PI3Ks are
            heterodimers of a p110 catalytic (C) subunit and a
            p85-related regulatory (R) subunit. The R subunit
            down-regulates PI3K basal activity, stabilizes the C
            subunit, and plays a role in the activation downstream of
            tyrosine kinases. All R subunits contain two SH2 domains
            that flank an intervening helical domain (iSH2), which
            binds to the N-terminal adaptor-binding domain (ABD) of
            the catalytic subunit. In vertebrates, there are three
            genes (PIK3R1, PIK3R2, and PIK3R3) that encode for
            different Class IA PI3K R subunits.
          Length = 152

 Score = 31.8 bits (73), Expect = 1.5
 Identities = 19/92 (20%), Positives = 31/92 (33%), Gaps = 22/92 (23%)

Query: 1714 LRSRYQETDNKLLSKAESSGLKRARGQMLLNKASQLSVNTTAK---LKQLNETEAMFNSQ 1770
            L  + +E + + L K+            L  K ++LS     K   L+   E   MF  Q
Sbjct: 6    LAKKLKEINKEYLDKSR-------EYDELYEKYNKLSQEIQLKRQALEAFEEAVKMFEEQ 58

Query: 1771 ------------ESELTELSKNIAELQKRIQS 1790
                          E   L +N   L+ R++ 
Sbjct: 59   LRTQEKFQKEAQPHEKQRLMENNELLKSRLKE 90


>gnl|CDD|129130 TIGR00019, prfA, peptide chain release factor 1.  This model
            describes peptide chain release factor 1 (PrfA, RF-1),
            and excludes the related peptide chain release factor 2
            (PrfB, RF-2). RF-1 helps recognize and terminate
            translation at UAA and UAG stop codons. The mitochondrial
            release factors are prfA-like, although not included
            above the trusted cutoff for this model. RF-1 does not
            have a translational frameshift [Protein synthesis,
            Translation factors].
          Length = 360

 Score = 33.1 bits (76), Expect = 1.5
 Identities = 28/119 (23%), Positives = 51/119 (42%), Gaps = 21/119 (17%)

Query: 1703 EVALAQKQASGLRSRYQETDNKLLSKAE-SSGLKRARGQMLLNKASQLS-VNTTAK---- 1756
            + +L +K  S L  RY+E    LLS  E  S   + R   L  + SQL  +    +    
Sbjct: 2    KPSLLEKLES-LLERYEEL-EALLSDPEVISDQDKLRK--LSKEYSQLEEIVDCYREYQQ 57

Query: 1757 -LKQLNETEAMFNSQ--------ESELTELSKNIAELQKRIQSCINFIVDKSNNYKNCV 1806
              + + E + +            + EL EL + I EL+++++  +  +    N+ KN +
Sbjct: 58   AQEDIKEAKEILEESDPEMREMAKEELEELEEKIEELEEQLK--VLLLPKDPNDEKNVI 114


>gnl|CDD|213765 TIGR03060, PS_II_psb29, photosystem II biogenesis protein Psp29.
            Psp29, originally designated sll1414 in Synechocystis
            6803, is found universally in Cyanobacteria and in
            Arabidopsis. It was isolated and partially sequenced from
            purified photosystem II (PS II) in Synechocystis. While
            its function is unknown, mutant studies show an
            impairment in photosystem II biogenesis and/or stability,
            rather than in PS II core function [Energy metabolism,
            Photosynthesis].
          Length = 214

 Score = 32.3 bits (74), Expect = 1.5
 Identities = 28/132 (21%), Positives = 54/132 (40%), Gaps = 19/132 (14%)

Query: 1557 NPEQIRERANEINNVIKSLT--DIDTILTETAGDLAKANDLKRKANLTKAGADSTKNTVQ 1614
            +PEQ+RE A ++    K     +I + LT+ A       D  +          S    + 
Sbjct: 87   DPEQLREDAKQLLEQAKGKGLDEILSWLTQ-ANLSNGGGDTLQGIAGRHKFKYSRLFAI- 144

Query: 1615 KIVDVLTEARTAQDMAEVAIQTAKDDISAA--------RKDLSQITNDLDDAQQKSNETN 1666
             +  +L EA   +D+ E  +     ++S A         KDL    ++L+  +Q      
Sbjct: 145  GLYSLLEEAAPDKDIDEEDLNEILKELSEALGLSYDRVEKDLDLYKSNLEKMKQ------ 198

Query: 1667 VKVKLLQERLKS 1678
              ++L++E L++
Sbjct: 199  -ALELMEETLEA 209


>gnl|CDD|235640 PRK05901, PRK05901, RNA polymerase sigma factor; Provisional.
          Length = 509

 Score = 33.0 bits (76), Expect = 1.5
 Identities = 47/298 (15%), Positives = 87/298 (29%), Gaps = 39/298 (13%)

Query: 1456 SESNSAKDYAETSKKLINEKEAKAEEVFRGITTAKQESHAANILAKEAYDHAASVRNKTA 1515
            +E  + ++  +  KKL  + ++K       I  A +          +     + +   T 
Sbjct: 8    AELAAEEEAKKKLKKLAAKSKSKGFITKEEIKEALESKKKTPEQIDQVLIFLSGMVKDTD 67

Query: 1516 AYVASTSNITKQLDEFLNAPGATLADIRNISGLAVNKNIQSNPEQIRERANEINNVIKSL 1575
                S     K       A     A  +    L  +K  +      ++        I  L
Sbjct: 68   DATESDIPKKKTKTAAKAAAAKAPAKKKLKDELDSSKKAEKKNALDKDDDLNYVKDIDVL 127

Query: 1576 TDIDTILTETAGDLAKANDLKRKANLTKAGADSTKNTVQKIVDVLTEARTAQDMAEVAIQ 1635
               D    +   D    +D+    +      D   + V    +   E           ++
Sbjct: 128  NQADDDDDDDDDDDLDDDDIDDDDDDEDDDEDDDDDDVDDEDEEKKE--------AKELE 179

Query: 1636 TAKDDISAARKDLSQITNDLDDAQQKSNET----NVKVKLLQERLKSIQAGFLQNGRSAM 1691
               DD      +       L  A++ +  T     VK       LK I       G+  +
Sbjct: 180  KLSDDDDFVWDEDDSE--ALRQARKDAKLTATADPVK-----AYLKQI-------GKVKL 225

Query: 1692 DVENEEKNLEKEVALAQKQASGLRSRYQETDNKLLSKAESSGL-------KRARGQML 1742
                   N E+EV LA++  +GL +     + + L       L       KRA+  +L
Sbjct: 226  ------LNAEEEVELAKRIEAGLYAEELLAEGEKLDPELRRDLQWIGRDGKRAKNHLL 277


>gnl|CDD|128778 smart00502, BBC, B-Box C-terminal domain.  Coiled coil region
            C-terminal to (some) B-Box domains.
          Length = 127

 Score = 31.5 bits (72), Expect = 1.7
 Identities = 21/113 (18%), Positives = 37/113 (32%), Gaps = 25/113 (22%)

Query: 1691 MDVENEEKNLEKEVALAQKQASGLRSRYQETDNKLLS-----KAESSGLKRA---RGQML 1742
              +E     L K+ A  +     L S  QE +          KA    L+ A   R + L
Sbjct: 3    EALEELLTKLRKKAAELEDALKQLISIIQEVEENAADVEAQIKAAFDELRNALNKRKKQL 62

Query: 1743 LNKASQLSVNTTAKLKQLNETEAMFNSQESELTELSKNIAELQKRIQSCINFI 1795
            L       +    + K               L +  +++ + Q+++   INF 
Sbjct: 63   LED-----LEEQKENKLKV------------LEQQLESLTQKQEKLSHAINFT 98


>gnl|CDD|223294 COG0216, PrfA, Protein chain release factor A [Translation, ribosomal
            structure and biogenesis].
          Length = 363

 Score = 33.0 bits (76), Expect = 1.7
 Identities = 24/93 (25%), Positives = 38/93 (40%), Gaps = 21/93 (22%)

Query: 1714 LRSRYQETDNKLLSKAE-SSGLKRARGQMLLNKA-SQLS--VNTTAKLKQ----LNETEA 1765
            L  RY+E    LLS  E  S     R    L+K  ++L   V    + K+    L + + 
Sbjct: 12   LLERYEEL-EALLSDPEVISDPDEYRK---LSKEYAELEPIVEKYREYKKAQEDLEDAKE 67

Query: 1766 MFNSQ---------ESELTELSKNIAELQKRIQ 1789
            M   +         E E+ EL   I EL++ ++
Sbjct: 68   MLAEEKDPEMREMAEEEIKELEAKIEELEEELK 100


>gnl|CDD|218625 pfam05531, NPV_P10, Nucleopolyhedrovirus P10 protein.  This family
            consists of several nucleopolyhedrovirus P10 proteins
            which are thought to be involved in the morphogenesis of
            the polyhedra.
          Length = 75

 Score = 30.1 bits (68), Expect = 1.8
 Identities = 17/63 (26%), Positives = 28/63 (44%), Gaps = 7/63 (11%)

Query: 1619 VLTEARTAQDMAEV-----AIQTAKDDISAARKDLSQITNDLDDAQQKSNETNVKVKLLQ 1673
            +L + R   D+  V     A+QT  DD+ A   D++++   LD    +      KV  +Q
Sbjct: 5    ILLQIR--ADIKAVDDKVDALQTQVDDLRANLPDVTELNEKLDAQSAQLTTLQTKVTNIQ 62

Query: 1674 ERL 1676
              L
Sbjct: 63   SIL 65


>gnl|CDD|217817 pfam03961, DUF342, Protein of unknown function (DUF342).  This family
            of bacterial proteins has no known function. The proteins
            are in the region of 500-600 amino acid residues in
            length.
          Length = 450

 Score = 32.6 bits (75), Expect = 1.8
 Identities = 15/73 (20%), Positives = 36/73 (49%), Gaps = 8/73 (10%)

Query: 1325 LKLKEQSTETEKQINDAERQCKRTENLVTKTFPQFTQGQEENEKA--------LQRLNEK 1376
             +LKE+  E E+++ + E + ++ + L+ K   +        ++          ++L+E+
Sbjct: 330  PELKEELKELEEELKELEEELEKIKKLLKKLPKKARGQLPPEKREQLEKLLETKEKLSEE 389

Query: 1377 ITSLEKNLPDLNE 1389
            +  LE+ L +L E
Sbjct: 390  LEELEEELKELKE 402



 Score = 32.6 bits (75), Expect = 2.4
 Identities = 21/118 (17%), Positives = 43/118 (36%), Gaps = 13/118 (11%)

Query: 1676 LKSIQAGFLQNGRSAMDVENEEKNLEKEVALAQKQASGLRSRYQETDNKLLSKAESSGLK 1735
             K ++A  L +              E EV +   +   L+   +E + +L    E   L+
Sbjct: 305  GKEVKAKELGSPA--------GTKTEIEVGVDFPE---LKEELKELEEELKELEEE--LE 351

Query: 1736 RARGQMLLNKASQLSVNTTAKLKQLNETEAMFNSQESELTELSKNIAELQKRIQSCIN 1793
            + +  +              K +QL +          EL EL + + EL++ ++S  +
Sbjct: 352  KIKKLLKKLPKKARGQLPPEKREQLEKLLETKEKLSEELEELEEELKELKEELESLYS 409


>gnl|CDD|191111 pfam04849, HAP1_N, HAP1 N-terminal conserved region.  This family
            represents an N-terminal conserved region found in
            several huntingtin-associated protein 1 (HAP1)
            homologues. HAP1 binds to huntingtin in a polyglutamine
            repeat-length-dependent manner. However, its possible
            role in the pathogenesis of Huntington's disease is
            unclear. This family also includes a similar N-terminal
            conserved region from hypothetical protein products of
            ALS2CR3 genes found in the human juvenile amyotrophic
            lateral sclerosis critical region 2q33-2q34.
          Length = 307

 Score = 32.5 bits (74), Expect = 2.0
 Identities = 27/117 (23%), Positives = 46/117 (39%), Gaps = 11/117 (9%)

Query: 1281 LNDLTEEIKKLNKTGEMLKENATLLQENNVEGALELTRQAHDTSLKLKEQSTETEKQIND 1340
            L  L EE + L      LK      +E   +   +  +Q  + + ++   S E  K+  D
Sbjct: 170  LKLLEEENEHLRSEASHLKTETVTYEEKEQQLVNDCVKQLREANDQIASLSEELAKKTED 229

Query: 1341 AERQCKRTENLVTKTFPQFTQGQE-------ENEKALQRLNEKITSLEKNLPDLNEL 1390
             ERQ +   +L++    Q    Q+       ENE+  Q L     +  +   +L EL
Sbjct: 230  LERQQEEITHLLS----QIVDLQKKCKSYALENEELQQHLAAAKDAQRQLQAELQEL 282



 Score = 30.2 bits (68), Expect = 9.5
 Identities = 57/290 (19%), Positives = 104/290 (35%), Gaps = 77/290 (26%)

Query: 1540 ADIRNISGLAVNKNIQSNPEQIRE-------RANEINNVIKSLTDIDT---ILTETAGDL 1589
             D       + +  +  + EQ  E        AN +  + K+  DID    +L E   DL
Sbjct: 20   DDDWKTPRASPDARLDLSTEQRAETLKYLALCANRVGQMTKTYNDIDAVTRLLEEKERDL 79

Query: 1590 AKAND-----LKRKANLTKAGADSTKNTVQKIVDVLTEARTAQDMAEVAIQTAKDDISAA 1644
              A       LK+   L        KN                +  E  +  A+D+I   
Sbjct: 80   ELAARIGQSLLKQNKVL------MEKN----------------EKLEEQLGKARDEILQL 117

Query: 1645 RKDLSQ-------ITNDLDDAQQKSNETNVKVKLLQERLKSIQAGFLQNGRSAMDVENEE 1697
            R +L+         ++  ++++ +S+E+          L+  ++    +G   ++   E+
Sbjct: 118  RHELNLKDELLQFYSDADEESEDESSES--------TPLRPQESSSSSHGCFQLEALQEK 169

Query: 1698 -KNLEKEVALAQKQASGLRSRYQETDNKLLSKAESSGLKRARGQMLLNKASQLSVNTTAK 1756
             K LE+E    + +AS L+    ET      + E         Q L+N            
Sbjct: 170  LKLLEEENEHLRSEASHLK---TETVT--YEEKE---------QQLVNDC---------- 205

Query: 1757 LKQLNETEAMFNSQESELTELSKNIAELQKRIQSCINFIVDKSNNYKNCV 1806
            +KQL E      S   EL + ++++   Q+ I   ++ IVD     K+  
Sbjct: 206  VKQLREANDQIASLSEELAKKTEDLERQQEEITHLLSQIVDLQKKCKSYA 255


>gnl|CDD|235401 PRK05306, infB, translation initiation factor IF-2; Validated.
          Length = 746

 Score = 32.9 bits (76), Expect = 2.0
 Identities = 49/225 (21%), Positives = 78/225 (34%), Gaps = 29/225 (12%)

Query: 1427 AKDYAETSKKLINEKESKAEELRKHMIILSES------------NSAKDYAETSKKLINE 1474
            AK+   +SK+L+ + +    E++ H   + E               AK  AE +     E
Sbjct: 10   AKELGVSSKELLEKLKELGIEVKSHSSTVEEEEARKEEAKREAEEEAKAEAEEAAAAEAE 69

Query: 1475 KEAKAEEVFRGITTAKQESHAANI-LAKEAYDHAASVRNKTAAYVASTSNITKQLDEFLN 1533
            +EAKAE           E+ AA    A+ A D AA      A    +     K+      
Sbjct: 70   EEAKAEAAAAAPAEEAAEAAAAAEAAARPAEDEAARPAEAAARRPKAKKAAKKKKGPKPK 129

Query: 1534 APGATLADIRNISGLAVNKNIQSNPEQIRERANEINNVIKSLTDIDTILTE--TAGDLA- 1590
                     R        K  +    + R R  +           + ++ E  T  +LA 
Sbjct: 130  KKKPKRKAARGGKRGKGGKGRRRRRGRRRRRKKKKKQKPTEKIPREVVIPETITVAELAE 189

Query: 1591 ----KANDLKRKANLTKAGADSTKNTVQKIVDVLTEARTAQDMAE 1631
                KA ++ +K  L K G  +T N        L +  TA+ +AE
Sbjct: 190  KMAVKAAEVIKK--LFKLGVMATIN------QSL-DQETAELLAE 225


>gnl|CDD|237687 PRK14357, glmU, bifunctional N-acetylglucosamine-1-phosphate
            uridyltransferase/glucosamine-1-phosphate
            acetyltransferase; Provisional.
          Length = 448

 Score = 32.8 bits (75), Expect = 2.0
 Identities = 25/103 (24%), Positives = 44/103 (42%), Gaps = 23/103 (22%)

Query: 1224 EDVKALVENTTRSSHDLTVMLTTIDDMKKQIMNNEDPNGVGKTLGNTTQ-RINLARLALN 1282
              +K L+E   R   D+T+++  +          EDP G G+ + +  + RI    +   
Sbjct: 105  NTLKRLIEEHNRKGADVTILVADL----------EDPTGYGRIIRDGGKYRI----VEDK 150

Query: 1283 DLTEEIKKLNK--------TGEMLKENATLLQENNVEGALELT 1317
            D  EE KK+ +        +G+ L E    ++  N +G   LT
Sbjct: 151  DAPEEEKKIKEINTGIYVFSGDFLLEVLPKIKNENAKGEYYLT 193


>gnl|CDD|112704 pfam03904, DUF334, Domain of unknown function (DUF334).
            Staphylococcus aureus plasmid proteins with no
            characterized function.
          Length = 229

 Score = 32.3 bits (73), Expect = 2.1
 Identities = 26/129 (20%), Positives = 53/129 (41%), Gaps = 31/129 (24%)

Query: 1594 DLKRKANLTKAGADSTKNTVQK------------IVDVLTEARTAQDMAEVAIQTAKDDI 1641
            +++  + LT   +   + +++             I+  + E +  QD+ E  ++  K ++
Sbjct: 20   NMQMNSKLTGTHSQKIQKSLENEELQELKKQNKLIIKYIAEIKENQDIREKELKAIKSEL 79

Query: 1642 SAARKDLS----QITNDLDDAQQKSNETNVKVKLLQERLKSIQAGFLQN--GRSAMDVEN 1695
              A +D      +I ND              V LLQE+LK + A  +++  G+    V  
Sbjct: 80   KEATEDFQDKGKKIHNDF-------------VDLLQEKLKHVDAEDIEDELGKDIYGVRE 126

Query: 1696 EEKNLEKEV 1704
            E   + +EV
Sbjct: 127  EIDRMLQEV 135


>gnl|CDD|236491 PRK09377, tsf, elongation factor Ts; Provisional.
          Length = 290

 Score = 32.4 bits (75), Expect = 2.3
 Identities = 18/62 (29%), Positives = 29/62 (46%), Gaps = 4/62 (6%)

Query: 1582 LTETAGDLAKANDLKRKANLTKAGADSTKNTVQKIVDVLTEARTAQDMAEVAIQT---AK 1638
            LTE  GD+ KA +  RK  L KA   + +   + +V    +      + EV  +T   AK
Sbjct: 27   LTEADGDIEKAIEWLRKKGLAKAAKKAGRVAAEGLVAAKVDGNKGV-LVEVNSETDFVAK 85

Query: 1639 DD 1640
            ++
Sbjct: 86   NE 87


>gnl|CDD|213361 cd12827, EcCorA_ZntB-like_u2, uncharacterized bacterial subfamily of
            the Escherichia coli CorA-Salmonella typhimurium ZntB
            family.  A uncharacterized subfamily of the Escherichia
            coli CorA-Salmonella typhimurium ZntB (EcCorA-ZntB_like)
            family of the MIT superfamily of essential membrane
            proteins involved in transporting divalent cations
            (uptake or efflux) across membranes.The EcCorA-ZntB-like
            family includes the Mg2+ transporters Escherichia coli
            and Salmonella typhimurium CorAs, which can also
            transport Co2+, and Ni2+. Structures of the intracellular
            domain of EcCorA-ZntB-like family members, Vibrio
            parahaemolyticus and Salmonella typhimurium ZntB, form
            funnel-shaped homopentamers, the tip of the funnel is
            formed from two C-terminal transmembrane (TM) helices
            from each monomer, and the large opening of the funnel
            from the N-terminal cytoplasmic domains. The GMN
            signature motif of the MIT superfamily occurs just after
            TM1, mutation within this motif is known to abolish Mg2+
            transport through Salmonella typhimurium CorA. Natural
            variants such as GVN and GIN, such as occur in some ZntB
            family proteins, may be associated with the transport of
            different divalent cations, such as zinc and cadmium. The
            functional diversity of MIT transporters may also be due
            to minor structural differences regulating gating,
            substrate selection, and transport.
          Length = 289

 Score = 32.1 bits (74), Expect = 2.3
 Identities = 26/116 (22%), Positives = 44/116 (37%), Gaps = 39/116 (33%)

Query: 1559 EQIRERANEINNVIKS---------LTDIDTILT--ETAGDLAKAND-----LKRKANLT 1602
             QI +  +E+   ++          L D++  L    T+    K+N+     LKR  +L 
Sbjct: 125  RQINKERDELEQELRKSTKNKELFQLLDLEKSLVYFSTS---LKSNEIVLEKLKRSRSLK 181

Query: 1603 KAGADSTKNTVQKIVDVLTEARTAQDMAEVAIQTAKDDISAARKDLSQITNDLDDA 1658
                D      + + DV+ E + A +MAE+                S I + L DA
Sbjct: 182  LDEEDK-----ELLEDVIIENKQAIEMAEIY---------------SDILSGLMDA 217


>gnl|CDD|224236 COG1317, FliH, Flagellar biosynthesis/type III secretory pathway
            protein [Cell motility and secretion / Intracellular
            trafficking and secretion].
          Length = 234

 Score = 32.0 bits (73), Expect = 2.4
 Identities = 20/103 (19%), Positives = 37/103 (35%), Gaps = 13/103 (12%)

Query: 1694 ENEEKNLEKEVALAQKQASGLRSRYQETDNKLLSKAESSGLKRARGQMLLNKASQLSVNT 1753
            E  E+ LE +    +  A  L+   +E   +   +    G +         +  Q     
Sbjct: 40   EELEQALEAKEEELESAAQELQEGIEEGAREGYEEGFQLGYEEGF------EEGQEEGRV 93

Query: 1754 TAKLKQL-----NETEAMFNSQESELTELSKNIAE--LQKRIQ 1789
              +L +L      E EA+    E +L +L   IA   + K ++
Sbjct: 94   LERLAKLIAEFQAELEALKEVVEKQLVQLVLEIARKVIGKELE 136


>gnl|CDD|206121 pfam13950, Epimerase_Csub, UDP-glucose 4-epimerase C-term subunit. 
           This domain is the very C-terminal subunit of
           UDP-glucose 4-epimerase.
          Length = 62

 Score = 29.3 bits (67), Expect = 2.4
 Identities = 9/17 (52%), Positives = 11/17 (64%), Gaps = 1/17 (5%)

Query: 442 DMCKDGFWNFNETNPNG 458
           DMC+D  W +   NPNG
Sbjct: 46  DMCRDS-WRWQSKNPNG 61


>gnl|CDD|224212 COG1293, COG1293, Predicted RNA-binding protein homologous to
            eukaryotic snRNP [Transcription].
          Length = 564

 Score = 32.4 bits (74), Expect = 2.4
 Identities = 22/114 (19%), Positives = 44/114 (38%), Gaps = 10/114 (8%)

Query: 1278 RLALNDLTEEIKKLNKTGEMLKENATLLQEN--NVEGALELTRQAHDTSLK---LKEQST 1332
               L    +E+++L K  E L++   LL  N   +E  L+  R A     +   ++   +
Sbjct: 299  ENKLEKQEDELEELEKAAEELRQKGELLYANLQLIEEGLKSVRLADFYGNEEIKIELDKS 358

Query: 1333 ETEKQINDAE-RQCKRTENLVTKTFPQFTQGQEENEKALQRLNEKITSLEKNLP 1385
            +T  +      ++ K+ +        Q      E ++A+       T+LEK   
Sbjct: 359  KTPSENAQRYFKKYKKLKGAKVNLDRQL----SELKEAIAYYESAKTALEKAEG 408


>gnl|CDD|223342 COG0264, Tsf, Translation elongation factor Ts [Translation,
            ribosomal structure and biogenesis].
          Length = 296

 Score = 32.3 bits (74), Expect = 2.5
 Identities = 20/107 (18%), Positives = 40/107 (37%), Gaps = 7/107 (6%)

Query: 1582 LTETAGDLAKANDLKRKANLTKAGADSTKNTVQKIVDV-LTEARTAQDMAEVAIQT---A 1637
            L E  GD+ KA +  R+  + KA   + +   + ++   +        + EV  +T   A
Sbjct: 27   LEEANGDIEKAIEWLREKGIAKAAKKAGRIAAEGLIAAKVDGDGKKAVLVEVNCETDFVA 86

Query: 1638 KDD--ISAARKDLSQI-TNDLDDAQQKSNETNVKVKLLQERLKSIQA 1681
            K+      A K           D ++         K ++E + ++ A
Sbjct: 87   KNAEFQELANKIAKAALEKKPADVEELKAAFEPGGKTVEEEIAALIA 133


>gnl|CDD|213755 TIGR02977, phageshock_pspA, phage shock protein A.  Members of this
            family are the phage shock protein PspA, from the phage
            shock operon. This is a narrower family than the set of
            PspA and its homologs, sometimes several in a genome, as
            described by pfam04012. PspA appears to maintain the
            protonmotive force under stress conditions that include
            overexpression of certain phage secretins, heat shock,
            ethanol, and protein export defects [Cellular processes,
            Adaptations to atypical conditions].
          Length = 219

 Score = 31.9 bits (73), Expect = 2.5
 Identities = 15/62 (24%), Positives = 31/62 (50%), Gaps = 4/62 (6%)

Query: 1620 LTEARTAQDMAEVAIQTAKDDISAARKDLSQITNDLDDAQQKSNETNVKVKLLQERLKSI 1679
            L E + AQ++AE      + +++A  + L+++  D+   Q K  E   + K L  R ++ 
Sbjct: 91   LIEKQKAQELAE----ALERELAAVEETLAKLQEDIAKLQAKLAEARARQKALAIRHQAA 146

Query: 1680 QA 1681
             +
Sbjct: 147  SS 148


>gnl|CDD|237177 PRK12704, PRK12704, phosphodiesterase; Provisional.
          Length = 520

 Score = 32.4 bits (75), Expect = 2.7
 Identities = 51/233 (21%), Positives = 87/233 (37%), Gaps = 64/233 (27%)

Query: 1285 TEEIKKLNKTGEMLKENATLLQENNVEGA-LELTRQAHDTSLKLKEQSTETEKQINDAER 1343
              +IK+  +  + + E A    E   + A LE   + H    + +++  E   ++   E+
Sbjct: 30   EAKIKEAEEEAKRILEEAKKEAEAIKKEALLEAKEEIHKLRNEFEKELRERRNELQKLEK 89

Query: 1344 QCKRTENLVTKTFPQFTQGQEENEKALQRLNEKITSLEKNLPDLNELICDKRGDPCDNLC 1403
            +  + E  + +      + +EE EK  + L +K   LEK   +L ELI            
Sbjct: 90   RLLQKEENLDRKLELLEKREEELEKKEKELEQKQQELEKKEEELEELI------------ 137

Query: 1404 GGAGCGKCGGMWCSNGTLSESNSAKDYAETSKKLINEKESK--AEELRKHMIILS--ESN 1459
                                        E  ++L  E+ S   AEE ++  I+L   E  
Sbjct: 138  ---------------------------EEQLQEL--ERISGLTAEEAKE--ILLEKVEEE 166

Query: 1460 SAKDYAETSKKLINEKEAKAEEVFRGITTAKQESHAANILA----KEAYDHAA 1508
            +  + A     LI E E +A+E       A ++  A  ILA    + A DH A
Sbjct: 167  ARHEAA----VLIKEIEEEAKE------EADKK--AKEILAQAIQRCAADHVA 207


>gnl|CDD|203372 pfam06009, Laminin_II, Laminin Domain II.  It has been suggested that
            the domains I and II from laminin A, B1 and B2 may come
            together to form a triple helical coiled-coil structure.
          Length = 138

 Score = 30.9 bits (70), Expect = 2.7
 Identities = 19/98 (19%), Positives = 37/98 (37%), Gaps = 18/98 (18%)

Query: 1698 KNLEKEVALAQKQASGLRSRYQETDNKL-----LSKAESSGLKRARGQM--LLNKASQLS 1750
             N+ K++       S  +++  + +  L     L  + +  +  A   +  L   A  L 
Sbjct: 13   ANVLKQLEPISTNLSNTKAKLSKLNGSLEETNELVNSANKAVDDAGRSVRKLEKLAPDL- 71

Query: 1751 VNTTAKLKQLNETEAMFNSQESELTELSKNIAELQKRI 1788
                 KLK L +        E     LS+NI+ +++ I
Sbjct: 72   ---LDKLKPLKQ-------LEVANQSLSRNISRIKELI 99



 Score = 30.1 bits (68), Expect = 4.3
 Identities = 16/114 (14%), Positives = 43/114 (37%), Gaps = 4/114 (3%)

Query: 1602 TKAGADSTKNTVQKIVDVLTEARTAQDMAEVAIQTAKDDISAARKDLSQITNDLDDAQQK 1661
            +K  A     T   ++  L    T     +  +      +    + ++     +DDA + 
Sbjct: 1    SKELAREANETAANVLKQLEPISTNLSNTKAKLSKLNGSLEETNELVNSANKAVDDAGRS 60

Query: 1662 SNETNVKVKLLQERLKSIQAGFLQNGRSAMDVENEEKNLEKEVALAQKQASGLR 1715
              +       L ++LK ++   L+    ++        +++ +A A+K A+ ++
Sbjct: 61   VRKLEKLAPDLLDKLKPLKQ--LEVANQSLSRNISR--IKELIAQARKAANSIK 110


>gnl|CDD|226396 COG3879, COG3879, Uncharacterized protein conserved in bacteria
            [Function unknown].
          Length = 247

 Score = 32.0 bits (73), Expect = 2.7
 Identities = 19/86 (22%), Positives = 35/86 (40%), Gaps = 4/86 (4%)

Query: 1629 MAEVAIQTAKDDISAARKDLSQITNDLDDAQQKSNETNVKVKLLQERLKSIQAGFLQNGR 1688
            M     QT+K + S  R     +  +L   Q+K N    +V+ L+ +L S++   +    
Sbjct: 34   MLAAVFQTSKGE-SVRRARDLDLVKELRSLQKKVNTLAAEVEDLENKLDSVRRS-VLTDD 91

Query: 1689 SAMDVENEEKNLEKEVALAQKQASGL 1714
            +A+  E+  + L            GL
Sbjct: 92   AAL--EDRLEKLRMLAGSVPVTGPGL 115


>gnl|CDD|217457 pfam03255, ACCA, Acetyl co-enzyme A carboxylase carboxyltransferase
            alpha subunit.  Acetyl co-enzyme A carboxylase
            carboxyltransferase is composed of an alpha and beta
            subunit.
          Length = 145

 Score = 31.1 bits (71), Expect = 2.8
 Identities = 20/69 (28%), Positives = 33/69 (47%), Gaps = 15/69 (21%)

Query: 1215 EFEL----LEKKIEDVKALVE-NTTRSSHDLTVMLTTIDDMKKQIMNNEDPNGVGKTLGN 1269
            EFE     LE KI++++ L   N    S ++  +   +D +K++I +N  P         
Sbjct: 4    EFEKPLAELEAKIDELRKLARKNDVDVSDEIHRLEKKLDKLKREIYSNLTP--------- 54

Query: 1270 TTQRINLAR 1278
              QR+ LAR
Sbjct: 55   -WQRVQLAR 62


>gnl|CDD|234342 TIGR03752, conj_TIGR03752, integrating conjugative element protein,
            PFL_4705 family.  Members of this protein family are
            found occasionally on plasmids such as the Pseudomonas
            putida toluene catabolic TOL plasmid pWWO_p085. Usually,
            however, they are found on the bacterial main chromosome
            in regions flanked by markers of conjugative transfer
            and/or transposition [Mobile and extrachromosomal element
            functions, Plasmid functions].
          Length = 472

 Score = 32.3 bits (74), Expect = 2.8
 Identities = 23/108 (21%), Positives = 42/108 (38%), Gaps = 27/108 (25%)

Query: 1281 LNDLTEEIKKLNK-TGEMLKENATLLQENNVEGALELTRQAHDTSLKLKEQSTETEKQIN 1339
            L  L  E+K+L K   +++ EN  L  EN                 +L+++    ++QI 
Sbjct: 61   LRTLVAEVKELRKRLAKLISENEALKAENE----------------RLQKREQSIDQQIQ 104

Query: 1340 DAERQCKRTENLVTKTFPQFTQGQEENEKALQRLNEKITSLEKNLPDL 1387
             A          V     + T+  E+ +   Q+L   I  L++ L  +
Sbjct: 105  QA----------VQSETQELTKEIEQLKSERQQLQGLIDQLQRRLAGV 142


>gnl|CDD|205039 pfam12718, Tropomyosin_1, Tropomyosin like.  This family is a set of
            eukaryotic tropomyosins. Within the yeast Tmp1 and Tmp2,
            biochemical and sequence analyses indicate that Tpm2p
            spans four actin monomers along a filament, whereas Tpmlp
            spans five. Despite its shorter length, Tpm2p can compete
            with Tpm1p for binding to F-actin. Over-expression of
            Tpm2p in vivo alters the axial budding of haploids to a
            bipolar pattern, and this can be partially suppressed by
            co-over-expression of Tpm1p. This suggests distinct
            functions for the two tropomyosins, and indicates that
            the ratio between them is important for correct
            morphogenesis. The family also contains higher eukaryote
            Tmp3 members.
          Length = 143

 Score = 30.6 bits (70), Expect = 3.0
 Identities = 42/178 (23%), Positives = 65/178 (36%), Gaps = 53/178 (29%)

Query: 1311 EGALELTRQAHDTSLKLKEQSTETEKQINDAERQCKRTENLVTKTFPQFTQGQ---EENE 1367
            E A E   +  +   +L++++ E E++I   +++ ++ E  V K   Q  + +   EE+E
Sbjct: 10   ENAQERAEELEEKLKELEQENLEKEQEITSLQKKNQQLEEEVEKLEEQLKEAKEKLEESE 69

Query: 1368 KALQR---LNEKITSLEKNLPDLNELICDKRGDPCDNLCGGAGCGKCGGMWCSNGTLSES 1424
            K       L  +I  LE       EL                                  
Sbjct: 70   KLATNAEALTRRIQLLE------EEL---------------------------------E 90

Query: 1425 NSAKDYAETSKKLINEKESKAEEL---RKHMIILSESNSAKDYAETSKKLINEKEAKA 1479
             S K   ET++KL  E + KAEE     K +    +    K Y E  KK    KEAK 
Sbjct: 91   ESEKRLKETTEKL-READKKAEESERKVKALENERDEWEEK-YEELEKKY---KEAKK 143



 Score = 29.9 bits (68), Expect = 6.7
 Identities = 25/137 (18%), Positives = 55/137 (40%), Gaps = 39/137 (28%)

Query: 1622 EARTAQDMAEVAIQTAKD----------DISAARKDLSQITNDLD-------DAQQKSNE 1664
            EA  AQ+ AE   +  K+          +I++ +K   Q+  +++       +A++K  E
Sbjct: 8    EAENAQERAEELEEKLKELEQENLEKEQEITSLQKKNQQLEEEVEKLEEQLKEAKEKLEE 67

Query: 1665 T----------NVKVKLLQERLKSIQAGFLQNGRSAM------DVENEEKNLEKEVALAQ 1708
            +            +++LL+E L+  +       +         D + EE   E++V   +
Sbjct: 68   SEKLATNAEALTRRIQLLEEELEESEK----RLKETTEKLREADKKAEE--SERKVKALE 121

Query: 1709 KQASGLRSRYQETDNKL 1725
             +      +Y+E + K 
Sbjct: 122  NERDEWEEKYEELEKKY 138


>gnl|CDD|225559 COG3014, COG3014, Uncharacterized protein conserved in bacteria
            [Function unknown].
          Length = 449

 Score = 32.2 bits (73), Expect = 3.0
 Identities = 19/83 (22%), Positives = 29/83 (34%), Gaps = 9/83 (10%)

Query: 1653 NDLDDAQQKSNETNVKVKLLQERLKSIQAGFLQNGRSAMDVENEEKNLEKEVA----LAQ 1708
            ND   A+ + N  N +    Q R K      +Q     +D      N+E+  A    +  
Sbjct: 139  NDSAKARVEFNRANER----QRRAKEFYYEEVQKAIKEIDSSKHNINMERSRAEVSEILN 194

Query: 1709 KQASGLRSRYQETDNKLLSKAES 1731
               S    +Y E    LL+   S
Sbjct: 195  NTYSNYLDKY-EAYQGLLNPYVS 216


>gnl|CDD|226889 COG4487, COG4487, Uncharacterized protein conserved in bacteria
            [Function unknown].
          Length = 438

 Score = 32.1 bits (73), Expect = 3.1
 Identities = 25/105 (23%), Positives = 44/105 (41%), Gaps = 13/105 (12%)

Query: 1284 LTEEIKKLNKTGEMLKENATLLQENNVEGALELTRQAHDTSLKLKEQSTETEKQINDAER 1343
              ++I++ +++  +L       +E N        ++A   S K KE S   E+ IN  + 
Sbjct: 28   RYKQIEQEDQS-RILNTLEEFEKEAN-------EKRAQYRSAKKKELSQLEEQLINQKKE 79

Query: 1344 QCKRTENLVTKTFPQFTQGQEENEKALQRLNE----KITSLEKNL 1384
            Q       + K F    Q +    +AL+ LN     ++  LEK L
Sbjct: 80   QKNLFNEQI-KQFELALQDEIAKLEALELLNLEKDKELELLEKEL 123



 Score = 30.6 bits (69), Expect = 8.2
 Identities = 35/158 (22%), Positives = 67/158 (42%), Gaps = 27/158 (17%)

Query: 1640 DISAARKDLSQITNDLDDAQQK-SNETNVKVK----LLQERLKSIQAGFLQNGRSAMDVE 1694
              SA +K+LSQ+   L + +++  N  N ++K     LQ+ +  ++A  L N    ++ +
Sbjct: 58   YRSAKKKELSQLEEQLINQKKEQKNLFNEQIKQFELALQDEIAKLEALELLN----LEKD 113

Query: 1695 NEEKNLEKEVALAQKQASGLRSRYQETDNKLLSKAESSGLKRARGQMLLNKASQLSVNTT 1754
             E + LEKE          L    +E   +L + AE    KR       N  ++  +   
Sbjct: 114  KELELLEKE----------LDELSKELQKQLQNTAEIIEKKRE------NNKNEERLKFE 157

Query: 1755 AKLKQLNETEAMFNSQESELTELSKNI--AELQKRIQS 1790
             + K     E      E +L E + ++   E +++ +S
Sbjct: 158  NEKKLEESLELEREKFEEQLHEANLDLEFKENEEQRES 195


>gnl|CDD|232995 TIGR00487, IF-2, translation initiation factor IF-2.  This model
            discriminates eubacterial (and mitochondrial) translation
            initiation factor 2 (IF-2), encoded by the infB gene in
            bacteria, from similar proteins in the Archaea and
            Eukaryotes. In the bacteria and in organelles, the
            initiator tRNA is charged with N-formyl-Met instead of
            Met. This translation factor acts in delivering the
            initator tRNA to the ribosome. It is one of a number of
            GTP-binding translation factors recognized by the pfam
            model GTP_EFTU [Protein synthesis, Translation factors].
          Length = 587

 Score = 32.0 bits (73), Expect = 3.4
 Identities = 20/78 (25%), Positives = 32/78 (41%), Gaps = 3/78 (3%)

Query: 1276 LARLALNDLTEEIKKLNKTGEMLKENATLLQENNVEGALELTRQAHDTSLKLKEQSTETE 1335
            LA      +++ IKKL   G M+  N  L +E     A E         +++  + TE E
Sbjct: 17   LANKMNIKVSDIIKKLMLLGVMVTINQVLDKETAELVAEEF---GVKVEVRVTLEETEAE 73

Query: 1336 KQINDAERQCKRTENLVT 1353
            +Q  D+         +VT
Sbjct: 74   EQDEDSGDLLVERPPVVT 91


>gnl|CDD|206779 cd11386, MCP_signal, Methyl-accepting chemotaxis protein (MCP),
            signaling domain.  Methyl-accepting chemotaxis proteins
            (MCPs or chemotaxis receptors) are an integral part of
            the transmembrane protein complex that controls bacterial
            chemotaxis, together with the histidine kinase CheA, the
            receptor-coupling protein CheW, receptor-modification
            enzymes, and localized phosphatases. MCPs contain a four
            helix trans membrane region, an N-terminal periplasmic
            ligand binding domain, and a C-terminal HAMP domain
            followed by a cytoplasmic signaling domain. This
            C-terminal signaling domain dimerizes into a four-helix
            bundle and interacts with CheA through the adaptor
            protein CheW.
          Length = 200

 Score = 31.1 bits (71), Expect = 3.6
 Identities = 13/57 (22%), Positives = 24/57 (42%), Gaps = 1/57 (1%)

Query: 1623 ARTAQDMAEVAIQTAKDDISAARKDLSQITNDLDDAQQKSNETNVKVKLLQERLKSI 1679
            A +A  +AE + Q A +     R+      N ++   +  +E    V+ L+E    I
Sbjct: 11   AASADQVAETS-QQAAELAEKGREAAEDAINQMNQIDESVDEAVSAVEELEESSAEI 66


>gnl|CDD|220964 pfam11068, DUF2869, Protein of unknown function (DUF2869).  This
            bacterial family of proteins has no known function.
          Length = 131

 Score = 30.3 bits (69), Expect = 3.6
 Identities = 15/65 (23%), Positives = 31/65 (47%), Gaps = 1/65 (1%)

Query: 1327 LKEQSTETEKQINDAERQCKRTENLVTKTFPQFTQG-QEENEKALQRLNEKITSLEKNLP 1385
            L+EQ T+ E+++   E Q ++    + K   Q  +  Q++ E+   +  E+   L   + 
Sbjct: 25   LQEQLTQLEQELQQLEFQGQKAIKEIRKQSAQQIEQIQQQVEQERAKRLEQKNQLLFQIE 84

Query: 1386 DLNEL 1390
             + EL
Sbjct: 85   QVQEL 89


>gnl|CDD|183114 PRK11388, PRK11388, DNA-binding transcriptional regulator DhaR;
            Provisional.
          Length = 638

 Score = 32.0 bits (73), Expect = 3.6
 Identities = 30/129 (23%), Positives = 55/129 (42%), Gaps = 16/129 (12%)

Query: 1216 FEL----LEKKIEDVKALVENTTRS-SHDLTVMLTTIDDMKKQIMNNEDPNGVGKTLGNT 1270
            FE+    L  + ED+ ALV N  RS     +  L   DD   ++++   P    + L + 
Sbjct: 488  FEITIPPLRMRREDIPALVNNKLRSLEKRFSTRLKIDDDALARLVSYRWPGNDFE-LRSV 546

Query: 1271 TQRINLA----RLALNDLTEEIKKLNKTGEMLKENATLLQENNVEGALE---LTRQAHDT 1323
             + + L+    R+ L+DL E +     T ++   +AT L  +     LE   +   A   
Sbjct: 547  IENLALSSDNGRIRLSDLPEHLFTEQATDDV---SATRLSTSLSLAELEKEAIINAAQVC 603

Query: 1324 SLKLKEQST 1332
              +++E + 
Sbjct: 604  GGRIQEMAA 612


>gnl|CDD|112355 pfam03532, OMS28_porin, OMS28 porin. 
          Length = 253

 Score = 31.3 bits (70), Expect = 3.7
 Identities = 38/210 (18%), Positives = 86/210 (40%), Gaps = 17/210 (8%)

Query: 1198 TDASNIKQTGASGAYTKEFELLEKKIEDVKALVENTTRSSHDLT-------------VML 1244
            +D  + K+        +  + + K  EDV + +E    SS +L               M 
Sbjct: 40   SDQKDNKKLDQKDQVNQALDTINKVTEDVSSKLEGVRESSLELVESNDAGVVKKFVGSMS 99

Query: 1245 TTIDDMKKQIMNNEDPNGVGKTLGNTTQRIN-LARLALNDLTEEIKKLNKTGE--MLKEN 1301
               D  K  ++ +++   V K  G   +  N +  ++   + E  K ++  GE   L E 
Sbjct: 100  LMSDVAKGTVVASQEATIVAKCSGMVAEGANKVVEMSKKAVQETQKAVSVAGEATFLIEK 159

Query: 1302 ATLLQENNVEGALELTRQAHDTSLKLKEQSTETEKQINDAERQCKRTENLVTKTFPQFTQ 1361
              +L ++     LELT++      ++KE    +E+ +++  ++ ++  N+V    P   +
Sbjct: 160  QIMLNKSPNNKELELTKEEFAKVDEVKETLMASERALDETVQEAQKVLNMVNGLNPS-NK 218

Query: 1362 GQEENEKALQRLNEKITSLEKNLPDLNELI 1391
             Q   +K + +    +  + +   DL +++
Sbjct: 219  DQVLAKKDVAKAISNVVKVAQGARDLTKVM 248


>gnl|CDD|197874 smart00787, Spc7, Spc7 kinetochore protein.  This domain is found in
            cell division proteins which are required for
            kinetochore-spindle association.
          Length = 312

 Score = 31.5 bits (72), Expect = 3.7
 Identities = 30/113 (26%), Positives = 46/113 (40%), Gaps = 10/113 (8%)

Query: 1284 LTEEIKK-LNKTGEMLKENATLL--QENNVEGALELTRQAHDTSLKLKEQSTETEKQIND 1340
            L E +K+ L++  E LKE+  LL  +   +       R   D    L+E+  +  KQ+ D
Sbjct: 141  LLEGLKEGLDENLEGLKEDYKLLMKELELLNSIKPKLRDRKDA---LEEELRQ-LKQLED 196

Query: 1341 AERQCKRTE--NLVTKTFPQFTQGQEENEKALQRLNEKITSLEKNLPDLNELI 1391
                C  TE      K   +  Q      K L+ L E++  LE  + DL    
Sbjct: 197  ELEDCDPTELDRAKEK-LKKLLQEIMIKVKKLEELEEELQELESKIEDLTNKK 248


>gnl|CDD|234173 TIGR03346, chaperone_ClpB, ATP-dependent chaperone ClpB.  Members of
            this protein family are the bacterial ATP-dependent
            chaperone ClpB. This protein belongs to the AAA family,
            ATPases associated with various cellular activities
            (pfam00004). This molecular chaperone does not act as a
            protease, but rather serves to disaggregate misfolded and
            aggregated proteins [Protein fate, Protein folding and
            stabilization].
          Length = 852

 Score = 31.9 bits (73), Expect = 4.0
 Identities = 18/83 (21%), Positives = 38/83 (45%), Gaps = 21/83 (25%)

Query: 1698 KNLEKEVALAQKQASGLRSRYQETDNKLLSKAESSGLKRARGQML--------------L 1743
            ++LEKE+A  +++ + L  +++        KA   G+++ + ++               L
Sbjct: 437  EDLEKELAELEEEYADLEEQWK------AEKAAIQGIQQIKEEIEQVRLELEQAEREGDL 490

Query: 1744 NKASQLSVNTTAKL-KQLNETEA 1765
             KA++L      +L K+L   EA
Sbjct: 491  AKAAELQYGKLPELEKRLQAAEA 513


>gnl|CDD|237624 PRK14143, PRK14143, heat shock protein GrpE; Provisional.
          Length = 238

 Score = 31.2 bits (71), Expect = 4.2
 Identities = 19/83 (22%), Positives = 38/83 (45%), Gaps = 1/83 (1%)

Query: 1307 ENNVEGALELTRQAHDTSLKLKEQSTETEKQINDAERQCKRTENLVTKTFPQFTQGQEEN 1366
               V+ A+    +A D S +  E+ TE E ++ + E      E+         ++   +N
Sbjct: 7    PLEVKLAVISESEAEDNSPESSEEVTEQEAELTNPEGDAAEAESSPDSGSAA-SETAADN 65

Query: 1367 EKALQRLNEKITSLEKNLPDLNE 1389
               L +L +++ SL++ L +LN 
Sbjct: 66   AARLAQLEQELESLKQELEELNS 88


>gnl|CDD|220404 pfam09789, DUF2353, Uncharacterized coiled-coil protein (DUF2353).
            Members of this family of uncharacterized proteins have
            no known function.
          Length = 319

 Score = 31.4 bits (71), Expect = 4.3
 Identities = 15/82 (18%), Positives = 36/82 (43%), Gaps = 2/82 (2%)

Query: 1635 QTAKDDISAARKDLSQITNDLDDAQQKSNETNVKVKLLQERLKSIQAGFLQNGRSAMDVE 1694
            Q     +   R++  Q++ D+ + +Q+  E    +KLL++ +   +  F   G    +  
Sbjct: 68   QDLSQLLEKYREENQQLSTDVQELRQRKAELQGDIKLLRQTIAQQRVEF--MGGGVRESF 125

Query: 1695 NEEKNLEKEVALAQKQASGLRS 1716
             E  +L  ++   +++   L S
Sbjct: 126  EERNDLVSQLEKLREKCKQLES 147



 Score = 31.4 bits (71), Expect = 4.6
 Identities = 42/261 (16%), Positives = 93/261 (35%), Gaps = 30/261 (11%)

Query: 1551 NKNIQSNPEQIRERANEINNVIKSLTDIDTILTETAGD--LAKANDLKRKANLTKAGADS 1608
             +++    E+ RE   +++  ++ L        E  GD  L +    +++      G   
Sbjct: 67   RQDLSQLLEKYREENQQLSTDVQELRQR---KAELQGDIKLLRQTIAQQRVEFMGGGVRE 123

Query: 1609 TKNTVQKIVDVLTEARTAQDMAEVAIQTA---KDDISAARKDLSQITNDLDDAQQKSNET 1665
            +      +V  L + R      E  +++    K+++   R +  +  + L      +NE 
Sbjct: 124  SFEERNDLVSQLEKLREKCKQLESDLRSLLDEKEEVLMERDEYQRKAHRL------NNEL 177

Query: 1666 NVKVKLLQERLKSIQAGFLQNG--RSAMDVENEEKNLEKEVALAQKQA-------SGLRS 1716
            +  +     R+  + A  ++N   +  ++   EE  L K      KQ        + +++
Sbjct: 178  SYLLGGDPRRVVDVDALVMENRYLKEKINTLEEESELIKMTLAKYKQMLEAKNKKTMVKA 237

Query: 1717 RYQETDN-------KLLSKAESSGLKRARGQMLLNKASQLSVNTTAKLKQLNETEAMFNS 1769
               +  N       K + +  +S        +     S L    TA L   N+ +     
Sbjct: 238  GEGDDKNSTGVISMKQVKELLASEAIELELPVKAATISDLKSLATALLDLCNDKQMALAH 297

Query: 1770 QESELTELSKNIAELQKRIQS 1790
            Q      L   +AEL+K++ +
Sbjct: 298  QRRTNKILGNRVAELEKKLAT 318


>gnl|CDD|236270 PRK08471, flgK, flagellar hook-associated protein FlgK; Validated.
          Length = 613

 Score = 31.6 bits (72), Expect = 4.5
 Identities = 32/167 (19%), Positives = 63/167 (37%), Gaps = 30/167 (17%)

Query: 1455 LSESNSAKDYAETSKKLINEKEAKAEEV-FRGITTAKQESHAA-NILAKEAYDHAASVRN 1512
              ++++  +Y +     + E      ++   GI    Q+   A N  A    D A     
Sbjct: 78   YKKASTELEYTDYEFSTLQEASQYFPDLDDTGILKDLQDYFNAWNDFASNPKDSAQKQ-- 135

Query: 1513 KTAAYVASTSNITKQLDEFLNAPGATLADIRNISGLAVNKNIQSNPEQIRERANEINNVI 1572
                   + +  T+ L   +      L  ++      VN       E+++   +EIN++ 
Sbjct: 136  -------ALAQKTETLTNNIKDTRERLDTLQK----KVN-------EELKVTVDEINSLG 177

Query: 1573 KSLTDIDTIL--TETAGDLAKANDLKRKA-----NLTK-AGADSTKN 1611
            K + +I+  +   E    L  AN+L+ K       L+K  GA+  K+
Sbjct: 178  KQIAEINKQIKEVEAGKTLKHANELRDKRDELELTLSKLVGANVFKS 224


>gnl|CDD|234801 PRK00591, prfA, peptide chain release factor 1; Validated.
          Length = 359

 Score = 31.2 bits (72), Expect = 4.9
 Identities = 27/94 (28%), Positives = 41/94 (43%), Gaps = 21/94 (22%)

Query: 1713 GLRSRYQETDNKLLSKAES-SGLKRARGQMLLNKA-SQLS--VNTTAKLKQ----LNETE 1764
             L  RY+E    LLS  E  S  KR R    L+K  ++L   V    + KQ    L E +
Sbjct: 10   ALEERYEEL-EALLSDPEVISDQKRFR---KLSKEYAELEPIVEAYREYKQAQEDLEEAK 65

Query: 1765 AMFNSQ---------ESELTELSKNIAELQKRIQ 1789
             M   +         + EL EL + + EL++ ++
Sbjct: 66   EMLEEESDPEMREMAKEELKELEERLEELEEELK 99


>gnl|CDD|218651 pfam05600, DUF773, Protein of unknown function (DUF773).  This family
            contains several eukaryotic sequences which are thought
            to be CDK5 activator-binding proteins, however, the
            function of this family is unknown.
          Length = 506

 Score = 31.6 bits (72), Expect = 5.1
 Identities = 28/142 (19%), Positives = 53/142 (37%), Gaps = 23/142 (16%)

Query: 1550 VNKNIQSNPEQIRERANEINNVIKSLTDIDTILTETAGDLAKANDLKRKANLTKAGADST 1609
             NK+ Q     IRE+   IN  I+ + + + I+   +G         +   + K     T
Sbjct: 26   CNKDWQKAVLAIREK---INAAIQDMPEHEEIVQLLSGAYINYFHCLQIVEILKGTEADT 82

Query: 1610 KNTV-----------QKIV-----DVLTEARTAQDMAEVAIQTAKDDISAARKDLSQITN 1653
            KN             Q+IV     D +  A  AQ +    ++    +I A RK +++   
Sbjct: 83   KNIFGRYSSQRMKDWQEIVRLYEKDNVYLAEAAQIL----VRNVNYEIPALRKQIAKCQQ 138

Query: 1654 DLDDAQQKSNETNVKVKLLQER 1675
               + ++K  +        +E+
Sbjct: 139  LQQELERKEEDCKRSAADSREQ 160


>gnl|CDD|222417 pfam13851, GAS, Growth-arrest specific micro-tubule binding.  This
            family is the highly conserved central region of a number
            of metazoan proteins referred to as growth-arrest
            proteins. In mouse, Gas8 is predominantly a testicular
            protein, whose expression is developmentally regulated
            during puberty and spermatogenesis. In humans, it is
            absent in infertile males who lack the ability to
            generate gametes. The localisation of Gas8 in the
            motility apparatus of post-meiotic gametocytes and mature
            spermatozoa, together with the detection of Gas8 also in
            cilia at the apical surfaces of epithelial cells lining
            the pulmonary bronchi and Fallopian tubes suggests that
            the Gas8 protein may have a role in the functioning of
            motile cellular appendages. Gas8 is a microtubule-binding
            protein localised to regions of dynein regulation in
            mammalian cells.
          Length = 201

 Score = 30.7 bits (70), Expect = 5.1
 Identities = 34/181 (18%), Positives = 80/181 (44%), Gaps = 38/181 (20%)

Query: 1634 IQTAKDDISAARKDLSQITNDLDDAQQKSNETNVKVKLLQER---LKSIQAGFLQNGRSA 1690
            I++ K++I+  +K+       + +  Q++      +K  +E    L+     + ++ +S 
Sbjct: 29   IKSLKEEIAEMKKNEEHNEKLMAEIAQENKRLVEPLKKAEEEVEELRKKLKDYEKDKQSL 88

Query: 1691 MDVENEEKNLEKEVALAQKQASGLRSRYQ--ETD-NKLLSKAESS--------GLKRARG 1739
             +++   K LEKE+   + ++  L  R++  E + ++L  K E++        GLK    
Sbjct: 89   KNLKARLKELEKELKNLKWESEVLEQRFEKVERERDELYDKFEAAIQDVQQKTGLKN--- 145

Query: 1740 QMLLNKASQLSVNTTAKLKQLNETEAMFNSQESELTEL--SKNI-----AELQKRIQSCI 1792
             +LL +          KL+ LNE       +E++L E+  + N+       + +++   +
Sbjct: 146  -LLLEQ----------KLEALNE---ELEKKEAQLNEVLAAANLDPAALQAVSEKLDDVL 191

Query: 1793 N 1793
             
Sbjct: 192  E 192


>gnl|CDD|233894 TIGR02492, flgK_ends, flagellar hook-associated protein FlgK.  The
            flagellar hook-associated protein FlgK of bacterial
            flagella has conserved N- and C-terminal domains. The
            central region is highly variable in length and sequence,
            and often contains substantial runs of low-complexity
            sequence. This model is built from an alignment of FlgK
            sequences with the central region excised. Note that
            several other proteins of the flagellar apparatus also
            are homologous in the N- and C-terminal regions to FlgK,
            but are excluded from this model [Cellular processes,
            Chemotaxis and motility].
          Length = 324

 Score = 31.2 bits (71), Expect = 5.1
 Identities = 38/191 (19%), Positives = 68/191 (35%), Gaps = 37/191 (19%)

Query: 1623 ARTAQDMAEVAIQTAKDDISAA------RKDLSQITNDLDDAQQKSNETNVKVK------ 1670
            A++    +++A+    ++I+ A      R+              K   T VKV       
Sbjct: 6    AKSGLSASQIALSVTSNNIANANTPGYSRQRAVLTAVPAGGQGGKQIGTGVKVTSIRRIQ 65

Query: 1671 ----LLQERLKSIQAGFLQNGRSAM----DVENEEKNLEKEVALAQKQASGLRSRYQETD 1722
                  Q R  +  AG+  +  S +     + N+        +      +   +  QE  
Sbjct: 66   DQFLDNQYRRANSSAGYADSRASGLSKIEQIFNDLDG-----SGLSTYLNNFFNALQE-- 118

Query: 1723 NKLLSKAESSGLKRARGQMLLNKASQLSV---NTTAKLKQL-NETEAMFNSQESELTELS 1778
              L    +S  L+    Q +L  A  L+     T+++L+ L     A   S  +E+  L 
Sbjct: 119  --LAKNPDSEALR----QAVLESAQALANQFNQTSSELQDLRKNINAEIKSAVTEINSLL 172

Query: 1779 KNIAELQKRIQ 1789
            K IA L K I+
Sbjct: 173  KQIASLNKEIK 183


>gnl|CDD|237212 PRK12808, PRK12808, flagellin; Provisional.
          Length = 476

 Score = 31.5 bits (71), Expect = 5.5
 Identities = 43/249 (17%), Positives = 82/249 (32%), Gaps = 28/249 (11%)

Query: 1550 VNKNIQSNPEQIRERANEINNVIKSL------TDIDTILTETAGDLAKANDLKRKAN--- 1600
            +  N Q N + +    + IN  I++L        I   L     +    NDL   AN   
Sbjct: 124  IADNTQFNDKNLLNSNSSIN--IQTLDSSDGDKQIGIDLKSATLEALGINDLSIGANTDI 181

Query: 1601 --LTKAGADSTKNTVQKIVDVLTEARTAQDMAEVAIQTAKDDISAARKDLSQITNDLDDA 1658
              ++   A   KNT+ K      EA+ A D A+   +    +I  A  D  +     DDA
Sbjct: 182  EAVSTKIAALDKNTIAK---ATVEAKAAFDKAKDDTKAEDSNILDAAADGFK-DGKADDA 237

Query: 1659 QQKSNETNVKVKLLQ-----ERLKSIQAGFLQNGRSAMDVENEEKNLEKEVALAQKQASG 1713
             +        +         E  ++ +  F    +  +D   ++  L K         + 
Sbjct: 238  AKDVEAIKTALSAFTGAATLEEAEAAKTAFEVAQKDLVDTYTKKAALTK------DAVAD 291

Query: 1714 LRSRYQETDNKLLSKAESSGLKRARGQMLLNKASQLSVNTTAKLKQLNETEAMFNSQESE 1773
            L +    +     +K   +   +A+     N  ++  +      ++ N      +  E+ 
Sbjct: 292  LDTAKTTSTRSKAAKDLVAAYDKAKSGAKPNDVAKAYLEAKMAYEKDNNAIDGKSKLEAA 351

Query: 1774 LTELSKNIA 1782
               L K+  
Sbjct: 352  DDALEKDAI 360


>gnl|CDD|185750 cd09237, V_ScBro1_like, Protein-interacting V-domain of Saccharomyces
            cerevisiae Bro1 and related domains.  This family
            contains the V-shaped (V) domain of Saccharomyces
            cerevisiae Bro1, and related domains. It belongs to the
            V_Alix_like superfamily which also includes the V-domain
            of Saccharomyces cerevisiae Rim20 (also known as PalA),
            mammalian Alix (apoptosis-linked gene-2 interacting
            protein X), His-Domain type N23 protein tyrosine
            phosphatase (HD-PTP, also known as PTPN23), and related
            domains. Bro1 interacts with the ESCRT (Endosomal Sorting
            Complexes Required for Transport) system, and
            participates in endosomal trafficking. The mammalian Alix
            V-domain (belonging to a different family) contains a
            binding site, partially conserved in the superfamily, for
            the retroviral late assembly (L) domain YPXnL motif. The
            Alix V-domain is also a dimerization domain. Bro1 also
            has an N-terminal Bro1-like domain, which binds Snf7, a
            component of the ESCRT-III complex, and a C-terminal
            proline-rich region (PRR). The C-terminal portion
            (V-domain and PRR) of S. cerevisiae Bro1 interacts with
            Doa4, a ubiquitin thiolesterase needed to remove
            ubiquitin from MVB cargoes. It interacts with a YPxL
            motif in the Doa4s catalytic domain to stimulate its
            deubiquitination activity.
          Length = 356

 Score = 31.1 bits (71), Expect = 5.8
 Identities = 37/195 (18%), Positives = 63/195 (32%), Gaps = 48/195 (24%)

Query: 1203 IKQTGASGAYTKEFELLEKKIEDVKALVENTTRSSHDLTVMLTTIDDMKKQIMNNEDPNG 1262
            I     S +   + ELL  +       ++    S +DL   +  I+ M+K+I+       
Sbjct: 62   IVSGLKSSSVDSQLELLRPQSASWVNEID---SSYNDLDEEMKEIEKMRKKILA------ 112

Query: 1263 VGKTLGNTTQRINLARLALNDLTEEIKKLNKTGEMLKEN----ATLLQENNVEGALELTR 1318
                    TQ    + L    L E++ KL K+      +     +L+     + AL L  
Sbjct: 113  ------KWTQSP-SSSLT-ASLREDLVKLKKSLVEASASDEKLFSLVDPVKEDIALLLNG 164

Query: 1319 QAHDTSLKLKEQSTETE----------------KQINDAERQCKRTENLVTKTFPQFTQG 1362
             +    L     S   E                KQI   E   +    +           
Sbjct: 165  GSLWEELFGFSSSGSPEPSLLDLDDSQNEQTVLKQIKQLEELLEDLNLIK---------- 214

Query: 1363 QEENEKALQRLNEKI 1377
             EE ++ L+ L +KI
Sbjct: 215  -EERQRVLKDLKQKI 228


>gnl|CDD|215452 PLN02843, PLN02843, isoleucyl-tRNA synthetase.
          Length = 974

 Score = 31.3 bits (71), Expect = 6.2
 Identities = 19/64 (29%), Positives = 27/64 (42%), Gaps = 9/64 (14%)

Query: 344 NNHSTKCHFDEAVY-NSTGRISGGVCDSCEHNTEGRNCERCKQYFYKDPNKEFSD-PEVC 401
           ++ +  C      Y    GR+  GV  +     +G  CERC  + Y      FSD P +C
Sbjct: 903 DDETAACGSYTGEYVEGAGRVWVGVSRA-----DGSKCERC--WNYSPAVGSFSDHPTLC 955

Query: 402 LRCD 405
            RC 
Sbjct: 956 ERCY 959


>gnl|CDD|235850 PRK06669, fliH, flagellar assembly protein H; Validated.
          Length = 281

 Score = 30.8 bits (70), Expect = 6.3
 Identities = 23/112 (20%), Positives = 43/112 (38%), Gaps = 20/112 (17%)

Query: 1300 ENATLLQENNVEGALELTRQAHDTSLKLKEQSTETEKQINDAERQCKRTENLVTKTFPQ- 1358
            E     +E   E  L+ T +A     KL+ Q    E++  + E + +R          + 
Sbjct: 70   EIVEAAEEEAKEELLKKTDEASSIIEKLQMQ---IEREQEEWEEELERLIEEAKAEGYEE 126

Query: 1359 -FTQGQEEN---------------EKALQRLNEKITSLEKNLPDLNELICDK 1394
             + +G+EE                EK +++  E + S E+ + +L   I  K
Sbjct: 127  GYEKGREEGLEEVRELIEQLNKIIEKLIKKREEILESSEEEIVELALDIAKK 178


>gnl|CDD|238105 cd00179, SynN, Syntaxin N-terminus domain; syntaxins are nervous
            system-specific proteins implicated in the docking of
            synaptic vesicles with the presynaptic plasma membrane;
            they are a family of receptors for intracellular
            transport vesicles; each target membrane may be
            identified by a specific member of the syntaxin family;
            syntaxins contain a moderately well conserved
            amino-terminal domain, called Habc, whose structure is an
            antiparallel three-helix bundle; a linker of about 30
            amino acids connects this to the carboxy-terminal region,
            designated H3 (t_SNARE), of the syntaxin cytoplasmic
            domain; the highly conserved H3 region forms a single,
            long alpha-helix when it is part of the core SNARE
            complex and anchors the protein on the cytoplasmic
            surface of cellular membranes; H3 is not included in
            defining this domain.
          Length = 151

 Score = 29.9 bits (68), Expect = 6.4
 Identities = 23/116 (19%), Positives = 43/116 (37%), Gaps = 14/116 (12%)

Query: 1524 ITKQLDEFLNAPGATLADIRNISGLAVNKNIQSNPEQIRERANEINNVIKSLTDIDTILT 1583
            I++ ++E        L        L   + ++S  ++I++ A EI   +K L + +    
Sbjct: 18   ISEDVEELQKLHSQLLTAPDADPELK--QELESLVQEIKKLAKEIKGKLKELEESNEQNE 75

Query: 1584 ETAGDLAKANDLKRKANLTKAGADSTKNTVQKIVDVLTEARTAQ-DMAEVAIQTAK 1638
               G    + D  RK               +K V+V+TE   AQ    E   +  +
Sbjct: 76   ALNGS---SVDRIRKT--------QHSGLSKKFVEVMTEFNKAQRKYRERYKERIQ 120


>gnl|CDD|221919 pfam13094, CENP-Q, CENP-A-nucleosome distal (CAD) centromere subunit.
             CENP-Q is one of the components that assembles onto the
            CENP-A-nucleosome distal (CAD) centromere. The
            centromere, which is the basic element of chromosome
            inheritance, is epigenetically determined in mammals.
            CENP-A, the centromere-specific histone H3 variant,
            assembles an array of nucleosomes and it is this that
            seems to be the prime candidate for specifying centromere
            identity. CENP-A nucleosomes directly recruit a proximal
            CENP-A nucleosome associated complex (NAC) comprised of
            CENP-M, CENP-N and CENP-T, CENP-U(50), CENP-C and CENP-H.
            Assembly of the CENP-A NAC at centromeres is dependent on
            CENP-M, CENP-N and CENP-T. Additionally, there are seven
            other subunits which make up the CENP-A-nucleosome distal
            (CAD) centromere, CENP-K, CENP-L, CENP-O, CENP-P, CENP-Q,
            CENP-R and CENP-S, also assembling on the CENP-A NAC.
            Fta7 is the equivalent component of the fission yeast
            Sim4 complex.
          Length = 159

 Score = 30.0 bits (68), Expect = 6.5
 Identities = 15/48 (31%), Positives = 24/48 (50%)

Query: 1742 LLNKASQLSVNTTAKLKQLNETEAMFNSQESELTELSKNIAELQKRIQ 1789
            LL +   L     A+LKQL   E     +E+EL +  + + EL+K  +
Sbjct: 25   LLRRKRALERQLAAELKQLELLEEEIRREEAELEKDLEYLQELEKNAK 72


>gnl|CDD|236274 PRK08485, PRK08485, DNA polymerase III subunit delta'; Validated.
          Length = 206

 Score = 30.4 bits (69), Expect = 6.7
 Identities = 16/60 (26%), Positives = 29/60 (48%), Gaps = 7/60 (11%)

Query: 1274 INLARLALNDLTEEIKKLNKTGEMLKENATLLQENNVEGALELTRQAHDTSLKLKEQSTE 1333
            ++L +L L D+ E +K+L K  ++ KE    L E+       L ++     + L E+  E
Sbjct: 121  LDLKKLDLKDIYEFLKELEKENKLSKEELKELIES-------LLKECVKYKIPLNEEELE 173


>gnl|CDD|214362 CHL00098, tsf, elongation factor Ts.
          Length = 200

 Score = 30.0 bits (68), Expect = 7.5
 Identities = 40/181 (22%), Positives = 74/181 (40%), Gaps = 36/181 (19%)

Query: 1582 LTETAGDLAKANDLKRKANLTKAGADSTKNTVQKIVD--VLTEARTAQDMAEVAIQTAKD 1639
            L E  GD  KA +  R+  L  A   ST+ T + +++  + T  +    + E+  +T   
Sbjct: 23   LQEANGDFEKALESLRQKGLASANKKSTRITTEGLIESYIHTGGKLGV-LVEINCET--- 78

Query: 1640 DISAARKDLSQITNDLDDAQQKSNETNVKVKLLQERLKSIQAGFLQNGRSAMDVENEEKN 1699
            D  A R++  ++  ++  A Q +   NVK                    S  D+  E  N
Sbjct: 79   DFVARREEFQKLAKNI--AMQIAACPNVKYV------------------SLEDIPEEIIN 118

Query: 1700 LEKEVALAQKQASGLRSRYQETDNKLLSKAESSGLKRARGQMLLNKA----SQLSVNTTA 1755
            LEK++   +     L+++ +E   K++   E    KR +   LL++       ++V    
Sbjct: 119  LEKKIESEKDD---LQNKPEEIKEKIV---EGRIKKRLKELSLLDQPFIRDQSITVEELI 172

Query: 1756 K 1756
            K
Sbjct: 173  K 173


>gnl|CDD|219563 pfam07767, Nop53, Nop53 (60S ribosomal biogenesis).  This nucleolar
            family of proteins are involved in 60S ribosomal
            biogenesis. They are specifically involved in the
            processing beyond the 27S stage of 25S rRNA maturation.
            This family contains sequences that bear similarity to
            the glioma tumour suppressor candidate region gene 2
            protein (p60). This protein has been found to interact
            with herpes simplex type 1 regulatory proteins.
          Length = 387

 Score = 30.4 bits (69), Expect = 8.6
 Identities = 26/129 (20%), Positives = 49/129 (37%), Gaps = 10/129 (7%)

Query: 1621 TEARTAQDMAEVAIQTAKDDISAARKDLSQITNDLDDAQQKSNETNVKVKLLQERLKSIQ 1680
             EA    +M+E  ++ + DD      D S       + +  +     K K   +R K   
Sbjct: 229  PEASRLDEMSEGLLEESDDDGEEESDDESAWEGFESEYEPINKPVRPKRKTKAQRNKE-- 286

Query: 1681 AGFLQNGRSAMDVENEEKNLEKEVALAQKQASGLRSRYQETDNKLLSKAESSGLKRARGQ 1740
                   +   ++E E K  EK++     Q + L+   +E   K  ++A     ++ RG+
Sbjct: 287  -------KRRKELEREAK-EEKQLKKKLAQLARLKEIAKEVAQKEKARARKKEQRKERGE 338

Query: 1741 MLLNKASQL 1749
                K  +L
Sbjct: 339  KKKLKRRKL 347


>gnl|CDD|235588 PRK05743, ileS, isoleucyl-tRNA synthetase; Reviewed.
          Length = 912

 Score = 30.8 bits (71), Expect = 9.4
 Identities = 12/30 (40%), Positives = 15/30 (50%), Gaps = 1/30 (3%)

Query: 375 TEGRNCERCKQYFYKDPNKEFSDPEVCLRC 404
            EG  CERC  Y  +D   +   P +C RC
Sbjct: 878 AEGEKCERCWHYR-EDVGSDAEHPTLCGRC 906


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.317    0.134    0.422 

Gapped
Lambda     K      H
   0.267   0.0788    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 86,377,504
Number of extensions: 8177261
Number of successful extensions: 9534
Number of sequences better than 10.0: 1
Number of HSP's gapped: 9088
Number of HSP's successfully gapped: 453
Length of query: 1807
Length of database: 10,937,602
Length adjustment: 111
Effective length of query: 1696
Effective length of database: 6,014,308
Effective search space: 10200266368
Effective search space used: 10200266368
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.6 bits)
S2: 66 (29.1 bits)