RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy4474
         (2174 letters)



>gnl|CDD|233757 TIGR02168, SMC_prok_B, chromosome segregation protein SMC, common
            bacterial type.  SMC (structural maintenance of
            chromosomes) proteins bind DNA and act in organizing and
            segregating chromosomes for partition. SMC proteins are
            found in bacteria, archaea, and eukaryotes. This family
            represents the SMC protein of most bacteria. The smc gene
            is often associated with scpB (TIGR00281) and scpA genes,
            where scp stands for segregation and condensation
            protein. SMC was shown (in Caulobacter crescentus) to be
            induced early in S phase but present and bound to DNA
            throughout the cell cycle [Cellular processes, Cell
            division, DNA metabolism, Chromosome-associated
            proteins].
          Length = 1179

 Score = 69.3 bits (170), Expect = 2e-11
 Identities = 117/800 (14%), Positives = 297/800 (37%), Gaps = 132/800 (16%)

Query: 732  LERSKHIVDDIEKKKDDLKPIQKEAQ----------QLSEWDSSIVPKVSDLETRWAAVK 781
            L+R + I++++E++   LK ++++A+          +L E + ++      L  R   ++
Sbjct: 188  LDRLEDILNELERQ---LKSLERQAEKAERYKELKAELRELELAL------LVLRLEELR 238

Query: 782  SAWDAKHSDLSSEIAQYSSYQNAVQDAEKWLLQISFQLMAHNSLYINNRQQTLEQIQHHE 841
               +    +L     +       +Q+ E+ L ++         L ++  ++ +E++Q   
Sbjct: 239  EELEELQEELKEAEEELEELTAELQELEEKLEELR--------LEVSELEEEIEELQK-- 288

Query: 842  KLLQDYSSYQNAVQDAEKWLLQISFQLMAHNSLYINNRQQTLEQIQHHEKLLQDVILSPD 901
                +  +  N +   E+    +  +L           ++   Q++  E           
Sbjct: 289  ----ELYALANEISRLEQQKQILRERLANLE----RQLEELEAQLEELES---------- 330

Query: 902  NKLAPCFSAEEGSPIKDKIDRLEKETKAQSDRIQDALKTLNQLIGHRKEFEVDVENVEAK 961
                             K+D L +E     +++++  + L  L    +E E ++E +E++
Sbjct: 331  -----------------KLDELAEELAELEEKLEELKEELESLEAELEELEAELEELESR 373

Query: 962  LKELEVAVEGDVKTNNVLMLQELLNKYAQLNEEAGELNMMISNVSIATEHMNLNDADRLT 1021
            L+ELE  +E        L LQ       ++      L               + +  +  
Sbjct: 374  LEELEEQLETLRSKVAQLELQIASLN-NEIERLEARLER--LEDRRERLQQEIEELLKKL 430

Query: 1022 VNDSVSSMQKRYAQLSRTIEDRISTVNNKLQSVKRLQHKVEEAKALLQKAESQLARPVST 1081
                +  +Q    +L   +E+    +    ++++ L+ ++EEA+  L  AE +LA+ +  
Sbjct: 431  EEAELKELQAELEELEEELEELQEELERLEEALEELREELEEAEQALDAAERELAQ-LQA 489

Query: 1082 QAEDISEALTVYQKLLDEINSWKNSQSDEDLVH--LADSIKP-------LDEVV-ERIEN 1131
            + + +       +   + + +   +QS    +   L++ I         ++  +  R++ 
Sbjct: 490  RLDSLERLQENLEGFSEGVKALLKNQSGLSGILGVLSELISVDEGYEAAIEAALGGRLQA 549

Query: 1132 HAVPKLKSLLLLREQFTTLIMQIVGFITETTARVGEVDGNSATVQEKIDKYDKIIADIQE 1191
              V  L +            +  V F+   + +  E+ GN   + + I+ +  +  D+ +
Sbjct: 550  VVVENLNAAKKAIAFLKQNELGRVTFLPLDSIKGTEIQGNDREILKNIEGFLGVAKDLVK 609

Query: 1192 YEA-----------------ILATA--------------SDKGDQLSSDGTISDRNEITE 1220
            ++                   L  A              +  GD +   G I+  +  T 
Sbjct: 610  FDPKLRKALSYLLGGVLVVDDLDNALELAKKLRPGYRIVTLDGDLVRPGGVITGGSAKTN 669

Query: 1221 Q-LQSRKQQLHNLRKTVEKLRQQNEKRAAESEKLGAELEEIIEALHARETAVPSSSR-LH 1278
              +  R++++  L + +E+L ++  +      +L  ELEE+ E L      +   SR + 
Sbjct: 670  SSILERRREIEELEEKIEELEEKIAELEKALAELRKELEELEEELEQLRKELEELSRQIS 729

Query: 1279 NLRKTVEKLRQQNEKRAAESEKLGAELEEIIEALHARETAVKTLPVLLLEVTSVDVELDK 1338
             LRK + +L  + E+      +L  EL E+   +   E  ++     L E  +   EL+ 
Sbjct: 730  ALRKDLARLEAEVEQLEERIAQLSKELTELEAEIEELEERLEEAEEELAEAEAEIEELEA 789

Query: 1339 QRLELATKEVQPYVQRYETLKKNIAEAIDKFETQANE-HEAYKQAYNEAYDWLRKAKLGA 1397
            Q            +++ +   K + EA+D+   +    +E             R+     
Sbjct: 790  Q------------IEQLKEELKALREALDELRAELTLLNEEAANLRERLESLERRIAATE 837

Query: 1398 QANADCHGEQQTTKDKADKIKQITKSLPEGQKLIDKTVALKNNVLESTGPIGKENINQEI 1457
            +   D    ++  ++ ++ I+ +   + E ++LI++  +    +L       + ++ + +
Sbjct: 838  RRLEDL---EEQIEELSEDIESLAAEIEELEELIEELESELEALLNE-----RASLEEAL 889

Query: 1458 NQITLDWTNLQNTLQDIDKH 1477
              +  +   L   L++++  
Sbjct: 890  ALLRSELEELSEELRELESK 909



 Score = 62.4 bits (152), Expect = 3e-09
 Identities = 128/818 (15%), Positives = 308/818 (37%), Gaps = 101/818 (12%)

Query: 635  MTVLRDSWDRLNRDLKSTSTQLKSYIARWNELDDLYNRFNMWLSGVENKLNEPSPVFTEI 694
            +  LR+  + L  +LK    +L+   A   EL++      + +S +E ++ E        
Sbjct: 234  LEELREELEELQEELKEAEEELEELTAELQELEEKLEELRLEVSELEEEIEE-------- 285

Query: 695  DLNTKWQQYVNDELEWENHSQNTLQWLDNIRSDMKTLLERSKHIVDDIEKKKDDLKPIQK 754
             L  +     N+    E   Q   + L N+   ++ L  + + +   +++  ++L  +++
Sbjct: 286  -LQKELYALANEISRLEQQKQILRERLANLERQLEELEAQLEELESKLDELAEELAELEE 344

Query: 755  EAQQLSEWDSSIVPKVSDLETRWAAVKSAWDAKHSDLSSEIAQYSSYQNAVQDAEKWLLQ 814
            + ++L E   S+  ++ +LE     ++S  +                +  ++     + Q
Sbjct: 345  KLEELKEELESLEAELEELEAELEELESRLEEL--------------EEQLETLRSKVAQ 390

Query: 815  ISFQLMAHNSLYINNRQQTLEQIQHH-EKLLQDYSSYQNAVQDAEKWLLQISF----QLM 869
            +  Q+ + N+       + LE+++   E+L Q+       +++AE   LQ       + +
Sbjct: 391  LELQIASLNNEIERLEAR-LERLEDRRERLQQEIEELLKKLEEAELKELQAELEELEEEL 449

Query: 870  AHNSLYINNRQQTLEQIQHHEKLLQDVILSPDNKL----APCFSAEEGSPIKDKIDRLEK 925
                  +   ++ LE+++   +  +  + + + +L    A   S E      +      K
Sbjct: 450  EELQEELERLEEALEELREELEEAEQALDAAERELAQLQARLDSLERLQENLEGFSEGVK 509

Query: 926  ETKAQSDRIQDALKTLNQLIGHRKEFEVDVENV-EAKLKELEV--------AVEGDVKTN 976
                    +   L  L++LI   + +E  +E     +L+ + V        A+    +  
Sbjct: 510  ALLKNQSGLSGILGVLSELISVDEGYEAAIEAALGGRLQAVVVENLNAAKKAIAFLKQNE 569

Query: 977  N--VLMLQELLNKYAQLNEEAGE-LNMMISNVSIATEHMNLNDADRLTVN---------- 1023
               V  L     K  ++     E L  +   + +A + +  +   R  ++          
Sbjct: 570  LGRVTFLPLDSIKGTEIQGNDREILKNIEGFLGVAKDLVKFDPKLRKALSYLLGGVLVVD 629

Query: 1024 --DSVSSMQKRYAQLSRTI-----------------EDRISTVNNKLQSVKRLQHKVEEA 1064
              D+   + K+     R +                     S++  + + ++ L+ K+EE 
Sbjct: 630  DLDNALELAKKLRPGYRIVTLDGDLVRPGGVITGGSAKTNSSILERRREIEELEEKIEEL 689

Query: 1065 KALLQKAESQLARPVSTQAEDISEALTVYQKLLDEINSWKNSQSDEDLVHLADSIKPLDE 1124
            +  + + E  LA  +  + E++ E L   +K L+E++          +  L   +  L+ 
Sbjct: 690  EEKIAELEKALAE-LRKELEELEEELEQLRKELEELS--------RQISALRKDLARLEA 740

Query: 1125 VVERIENHAVPKLKSLLLLREQFTTLIMQIVGFITETTARVGEVDGNSATVQEKIDKY-- 1182
             VE++E     ++  L     +    I ++   + E    + E +     ++ +I++   
Sbjct: 741  EVEQLEE----RIAQLSKELTELEAEIEELEERLEEAEEELAEAEAEIEELEAQIEQLKE 796

Query: 1183 --DKIIADIQEYEAILATASDKGDQLSSDGTISDRN--EITEQLQSRKQQLHNLRKTVEK 1238
                +   + E  A L   +++   L       +R       +L+  ++Q+  L + +E 
Sbjct: 797  ELKALREALDELRAELTLLNEEAANLRERLESLERRIAATERRLEDLEEQIEELSEDIES 856

Query: 1239 LRQQNEKRAAESEKLGAELEEIIEALHARETAVPS-SSRLHNLRKTVEKLRQQNEKRAAE 1297
            L  + E+     E+L +ELE ++    + E A+    S L  L + + +L  +  +   E
Sbjct: 857  LAAEIEELEELIEELESELEALLNERASLEEALALLRSELEELSEELRELESKRSELRRE 916

Query: 1298 SEKLGAELEEIIEALHARETAVKTLPVLLLEVTSVDVELDKQRLELATKEVQPYVQRYET 1357
             E+L  +L ++   L   E  +  L   L E  S+ +E  +        + +   +R + 
Sbjct: 917  LEELREKLAQLELRLEGLEVRIDNLQERLSEEYSLTLEEAEALENKIEDDEEEARRRLKR 976

Query: 1358 LKKNIAE-------AIDKFETQANEHEAYKQAYNEAYD 1388
            L+  I E       AI+++E     ++       +  +
Sbjct: 977  LENKIKELGPVNLAAIEEYEELKERYDFLTAQKEDLTE 1014



 Score = 47.0 bits (112), Expect = 1e-04
 Identities = 62/279 (22%), Positives = 117/279 (41%), Gaps = 34/279 (12%)

Query: 1771 ELKTTLDEKKAQLQVYRTIYQDATSHQQDLLQLKDKIESLHQPSEQSKQQLATITTRHGN 1830
            EL+  + E +  L   R   ++    +++L QL+ ++E L +     ++ LA +      
Sbjct: 688  ELEEKIAELEKALAELRKELEEL---EEELEQLRKELEELSRQISALRKDLARLEAEVEQ 744

Query: 1831 VLKR-AQKKIE-AEKDIGDCTNLDDLEKYKALLQEVISHNSDMETLNDKCEALMELAAHS 1888
            + +R AQ   E  E +       + LE+ +  L E  +    +E L  + E L E     
Sbjct: 745  LEERIAQLSKELTELEAEIEELEERLEEAEEELAEAEAE---IEELEAQIEQLKE--ELK 799

Query: 1889 PIREETLRLQALYAALVTSVQGLVSQVEKNLSDQTEFLAKKEEVEAWLHKAHATVQECDE 1948
             +RE    L+A    L                   E    +E +E+   +  AT +  ++
Sbjct: 800  ALREALDELRAELTLL-----------------NEEAANLRERLESLERRIAATERRLED 842

Query: 1949 LEAGIKEEQKNIETQYLQWNSFRETLAQMTAWLDSVEKNIKQEIATPWTTTQELRSKLLK 2008
            LE  I+E  ++IE+   +     E + ++ + L+++      E A+       LRS+L +
Sbjct: 843  LEEQIEELSEDIESLAAEIEELEELIEELESELEALLN----ERASLEEALALLRSELEE 898

Query: 2009 LKEILKKCEKGVGDHEAFVDKYEECSQKLAQVEAKYAKL 2047
            L E L++ E    +      + EE  +KLAQ+E +   L
Sbjct: 899  LSEELRELESKRSELR---RELEELREKLAQLELRLEGL 934



 Score = 47.0 bits (112), Expect = 1e-04
 Identities = 50/298 (16%), Positives = 118/298 (39%), Gaps = 20/298 (6%)

Query: 1109 DEDLVHLADSIKPLDEVVERIENHAVPKLKSLLLLREQFTTLIMQIVGFITETTARVGEV 1168
             E+L  L D +  L+  ++ +E  A  K +    L+ +   L  ++   +        E+
Sbjct: 185  RENLDRLEDILNELERQLKSLERQAE-KAERYKELKAELREL--ELALLVLRLEELREEL 241

Query: 1169 DGNSATVQEKIDKYDKIIADIQEYEAILATASDK------------GDQLSSDGTISDRN 1216
            +     ++E  ++ +++ A++QE E  L     +             +  +    IS   
Sbjct: 242  EELQEELKEAEEELEELTAELQELEEKLEELRLEVSELEEEIEELQKELYALANEISRLE 301

Query: 1217 EITEQLQSRKQQLHNLRKTVEKLRQQNEKRAAESEKLGAELEEIIEALHARETAVPSSSR 1276
            +  + L+ R   L    + +E   ++ E +  E  +  AELEE +E L     ++   + 
Sbjct: 302  QQKQILRERLANLERQLEELEAQLEELESKLDELAEELAELEEKLEELKEELESL--EAE 359

Query: 1277 LHNLRKTVEKLRQQNEKRAAESEKLGAELEEIIEALHARETAVKTLPVLLLEVTSVDVEL 1336
            L  L   +E+L  + E+   + E L +   ++ +      +    +  L   +  ++   
Sbjct: 360  LEELEAELEELESRLEELEEQLETLRS---KVAQLELQIASLNNEIERLEARLERLEDRR 416

Query: 1337 DKQRLELATKEVQPYVQRYETLKKNIAEAIDKFETQANEHEAYKQAYNEAYDWLRKAK 1394
            ++ + E+     +      + L+  + E  ++ E    E E  ++A  E  + L +A+
Sbjct: 417  ERLQQEIEELLKKLEEAELKELQAELEELEEELEELQEELERLEEALEELREELEEAE 474



 Score = 40.0 bits (94), Expect = 0.016
 Identities = 66/379 (17%), Positives = 151/379 (39%), Gaps = 31/379 (8%)

Query: 731  LLERSKHIVD---DIEKKKDDLKPIQKEAQQLSEWDSSIVPKVSDLETRWAAVKSAWDAK 787
            +LER + I +    IE+ ++ +  ++K   +L +    +  ++  L      +     A 
Sbjct: 672  ILERRREIEELEEKIEELEEKIAELEKALAELRKELEELEEELEQLRKELEELSRQISAL 731

Query: 788  HSDLSSEIAQYSSYQNAVQDAEKWLLQISFQLMAHNSLYINNRQQTLEQIQHHEKLLQDY 847
              DL+   A+    +  +    K L ++  ++           ++  E     E+L    
Sbjct: 732  RKDLARLEAEVEQLEERIAQLSKELTELEAEIEELEERLEEAEEELAEAEAEIEELEAQI 791

Query: 848  SSYQNAVQDAEKWLLQISFQLMAHNSLYINNRQQTLEQIQHHEKLLQDVILSPDNKLAPC 907
               +  ++   + L ++  +L   N     N ++ LE ++      +  +   + ++   
Sbjct: 792  EQLKEELKALREALDELRAELTLLNEEA-ANLRERLESLERRIAATERRLEDLEEQIEE- 849

Query: 908  FSAEEGSPIKDKIDRLEKETKAQSDRIQDALKTLNQLIGHRKEFEVDVENVEAKLKELEV 967
              +E+   +  +I+ LE+  +     ++  L     L         ++E +  +L+ELE 
Sbjct: 850  -LSEDIESLAAEIEELEELIEELESELEALLNERASLEEALALLRSELEELSEELRELES 908

Query: 968  AVEGDVKTNNVLMLQELLNKYAQLNEEAGELNMMISNV-SIATEH--MNLNDADRLTVN- 1023
                +++      L+EL  K AQL      L + I N+    +E   + L +A+ L    
Sbjct: 909  KR-SELRRE----LEELREKLAQLELRLEGLEVRIDNLQERLSEEYSLTLEEAEALENKI 963

Query: 1024 -DSVSSMQKRYAQLSRTIEDRISTVNNKLQSVKRLQHKVEEAKALLQKAESQLARPVSTQ 1082
             D     ++R  +L   I   +  VN  L +       +EE + L ++ +      ++ Q
Sbjct: 964  EDDEEEARRRLKRLENKI-KELGPVN--LAA-------IEEYEELKERYDF-----LTAQ 1008

Query: 1083 AEDISEALTVYQKLLDEIN 1101
             ED++EA    ++ ++EI+
Sbjct: 1009 KEDLTEAKETLEEAIEEID 1027



 Score = 37.7 bits (88), Expect = 0.087
 Identities = 62/325 (19%), Positives = 127/325 (39%), Gaps = 29/325 (8%)

Query: 1732 KKQQRELKGKLSSWCGYEESVATLLDWLKVTEKKLGDEMELKTTLDEKKAQLQVYRTIYQ 1791
            +++  EL+ +L      EE +  L   L+  E+KL    EL+  + E + +++  +    
Sbjct: 238  REELEELQEELKE---AEEELEELTAELQELEEKLE---ELRLEVSELEEEIEELQKELY 291

Query: 1792 DATSHQQDLLQLKDKIESLHQPSEQSKQQLATITTRHGNVLKRAQKKIEA-EKDIGDCTN 1850
               +    L Q K  +       E+  ++L        + L    +++   E+ + +   
Sbjct: 292  ALANEISRLEQQKQILRERLANLERQLEELEAQLEELESKLDELAEELAELEEKLEELKE 351

Query: 1851 LDDLEKYKALLQEVISHNSDMETLNDKCEALMELAAHSPIREETLRLQALYA---ALVTS 1907
              +LE  +A L+E+ +   ++E+  ++ E  +E    S + +  L++ +L      L   
Sbjct: 352  --ELESLEAELEELEAELEELESRLEELEEQLE-TLRSKVAQLELQIASLNNEIERLEAR 408

Query: 1908 VQGLVSQVEKNLSDQTEFLAKKEEVEAWLHKAHATVQECDELEAGIKEEQKNIETQYLQW 1967
            ++ L  + E+   +  E L K EE      +      E +ELE  ++E Q+ +E      
Sbjct: 409  LERLEDRRERLQQEIEELLKKLEE-----AELKELQAELEELEEELEELQEELERLEEAL 463

Query: 1968 NSFRETLAQMTAWLDSVEKNIKQ---EIATPWTTTQELRSKLLKLKEILKKCEKGVGDHE 2024
               RE L +    LD+ E+ + Q    + +     + L      +K +LK      G   
Sbjct: 464  EELREELEEAEQALDAAERELAQLQARLDSLERLQENLEGFSEGVKALLKNQSGLSGILG 523

Query: 2025 AFVDKYEECSQKLAQVEAKYAKLIE 2049
               +           V+  Y   IE
Sbjct: 524  VLSELIS--------VDEGYEAAIE 540



 Score = 35.0 bits (81), Expect = 0.52
 Identities = 42/281 (14%), Positives = 106/281 (37%), Gaps = 23/281 (8%)

Query: 1888 SPIREETLRLQALYAALVTSVQGLVSQVEKNLSDQTEFLAKKEEVEAWLHKAHATVQECD 1947
              ++EE    +     L   +Q L  ++E+   + +E   + EE++  L+     +   +
Sbjct: 242  EELQEELKEAEEELEELTAELQELEEKLEELRLEVSELEEEIEELQKELYALANEISRLE 301

Query: 1948 ELEAGIKEEQKNIETQYLQWNSFRETLAQMTAWLDSVEKNIKQEIATPWTTTQELRSKLL 2007
            + +  ++E   N+E Q  +  +  E L            +   E        ++L     
Sbjct: 302  QQKQILRERLANLERQLEELEAQLEELESK--------LDELAEELAEL--EEKLEELKE 351

Query: 2008 KLKEILKKCEKGVGDHEAFVDKYEECSQKLAQVEAKYAKLIEPETSYEELQKRQGQLTAL 2067
            +L+ +  + E+   + E    + EE  ++L  + +K A+L        ++     ++  L
Sbjct: 352  ELESLEAELEELEAELEELESRLEELEEQLETLRSKVAQL------ELQIASLNNEIERL 405

Query: 2068 LTEKVDMDVLLNACVDLCDKVYESTSEPGHEPLRVQMEKLQQAVEALYDKITVTESLILS 2127
                  ++          +++ +   E   +    ++++LQ  +E L +++   +  +  
Sbjct: 406  EARLERLEDRR-------ERLQQEIEELLKKLEEAELKELQAELEELEEELEELQEELER 458

Query: 2128 QVEAKYAKLIEPETSYEELQKRQGQLTALLAEKVDMDVLLN 2168
              EA      E E + + L   + +L  L A    ++ L  
Sbjct: 459  LEEALEELREELEEAEQALDAAERELAQLQARLDSLERLQE 499



 Score = 35.0 bits (81), Expect = 0.54
 Identities = 70/376 (18%), Positives = 153/376 (40%), Gaps = 55/376 (14%)

Query: 1517 NLDDLEKYKALLQEGLHKEADNKKVLIDSFNKGIIGMTELESEEVK-----DLIQVGVHG 1571
             +++LE+    L+E   K A+ +K L +   +    + ELE E  +     + +   +  
Sbjct: 678  EIEELEEKIEELEE---KIAELEKALAELRKE----LEELEEELEQLRKELEELSRQISA 730

Query: 1572 LQVELNALLQNIEAEINKVANAAQERKAMQDKISKLQSWLKQYIDYSGNKQALQSRLDKV 1631
            L+ +L  L   +E    ++A  ++E   ++ +I +L+  L++       ++ L     ++
Sbjct: 731  LRKDLARLEAEVEQLEERIAQLSKELTELEAEIEELEERLEE------AEEELAEAEAEI 784

Query: 1632 NEIQESFPEISTKLQTLSDHIENSSRKLPSRTNEAMYRDLANLRYDFEKCVSSLSDVKQG 1691
             E++    ++  +L+ L + ++    +L +  NE    +L       E+ +++     + 
Sbjct: 785  EELEAQIEQLKEELKALREALDELRAEL-TLLNEE-AANLRERLESLERRIAATERRLED 842

Query: 1692 LESRLVQWNEYETSLGKLITWLNDTENVLKNFTLQPTLNEKKQQRELKGKLSSWCGYEES 1751
            LE ++ + +E   SL   I  L +    L++  L+  LNE+    E    L S    EE 
Sbjct: 843  LEEQIEELSEDIESLAAEIEELEELIEELES-ELEALLNERASLEEALALLRS--ELEEL 899

Query: 1752 VATLLDWLKVTEKKLGDEMELKTTLDEKKAQLQVYRTIYQDATSHQQDLLQLKDKIESLH 1811
               L +     E K     EL+  L+E + +L       +     +  +  L++++   +
Sbjct: 900  SEELREL----ESKR---SELRRELEELREKLAQLELRLEGL---EVRIDNLQERLSEEY 949

Query: 1812 QPSEQSKQQLATITTRHGNVLKRAQKKIEAE-KDIGDCTNL------------------- 1851
              + +  + L           +R  K++E + K++G   NL                   
Sbjct: 950  SLTLEEAEALENKIEDDEEEARRRLKRLENKIKELGP-VNLAAIEEYEELKERYDFLTAQ 1008

Query: 1852 -DDLEKYKALLQEVIS 1866
             +DL + K  L+E I 
Sbjct: 1009 KEDLTEAKETLEEAIE 1024



 Score = 33.1 bits (76), Expect = 2.1
 Identities = 49/272 (18%), Positives = 102/272 (37%), Gaps = 13/272 (4%)

Query: 417 ENINQEINQITLDWTNLQNTLQVIDKHHAKCLSIWNDFLSSKNTLEKWIEGFQKKIE-AE 475
           E + ++I ++      L+  L  + K   +              L + I   +K +   E
Sbjct: 680 EELEEKIEELEEKIAELEKALAELRKELEELEEELEQLRKELEELSRQISALRKDLARLE 739

Query: 476 KDIGDCTN-LDDLEKYKALLQEVISHNSDMETLNDKCEALMELAAHSPIREETLRLQALY 534
            ++      +  L K    L+  I    +   L +  E L E  A   I E   +++ L 
Sbjct: 740 AEVEQLEERIAQLSKELTELEAEIEELEER--LEEAEEELAEAEAE--IEELEAQIEQLK 795

Query: 535 AALVTSVQGLVSQVEKNLSDQTEFLAKKEEVEAWLHKAHATVQECAGNGSKNVLKERLDT 594
             L    + L   +   L+   E  A   E    L +  A  +    +  +  ++E  + 
Sbjct: 796 EELKALREALDE-LRAELTLLNEEAANLRERLESLERRIAATERRLEDLEEQ-IEELSED 853

Query: 595 VNMVAERIPEGQHLMSVLQDTFTKALDTTPSDQQDSLREAMTVLRDSWDRLNRDLKSTST 654
           +  +A  I E + L+  L+      L+     ++ SL EA+ +LR   + L+ +L+   +
Sbjct: 854 IESLAAEIEELEELIEELESELEALLN-----ERASLEEALALLRSELEELSEELRELES 908

Query: 655 QLKSYIARWNELDDLYNRFNMWLSGVENKLNE 686
           +         EL +   +  + L G+E +++ 
Sbjct: 909 KRSELRRELEELREKLAQLELRLEGLEVRIDN 940


>gnl|CDD|224117 COG1196, Smc, Chromosome segregation ATPases [Cell division and
            chromosome partitioning].
          Length = 1163

 Score = 62.0 bits (151), Expect = 4e-09
 Identities = 143/822 (17%), Positives = 321/822 (39%), Gaps = 109/822 (13%)

Query: 1177 EKIDKYDKIIADIQEYEAILATASDKGDQLSSDGTISDRNEITEQLQSRKQQLHNLRKTV 1236
            EK ++Y ++ A+++E E  L               ++   E+ ++L+  +++L  L + +
Sbjct: 210  EKAERYQELKAELRELELALL--------------LAKLKELRKELEELEEELSRLEEEL 255

Query: 1237 EKLRQQNEKRAAESEKLGAELEEIIEALHARETAV-PSSSRLHNLRKTVEKLRQQNEKRA 1295
            E+L+++ E+   E E+L +ELEE+ E L   +  +      +  L   +  LR++ E+  
Sbjct: 256  EELQEELEEAEKEIEELKSELEELREELEELQEELLELKEEIEELEGEISLLRERLEELE 315

Query: 1296 AESEKLGAELEEIIEALHARETAVKTLPVLLLEVTSVDVELDKQRLELATKEVQPYVQRY 1355
             E E+L   LEE+ E + A +  ++    LL E+  +  EL++ + EL  K     ++  
Sbjct: 316  NELEELEERLEELKEKIEALKEELEERETLLEELEQLLAELEEAKEELEEKLSA-LLEEL 374

Query: 1356 ETLKKNIAEAIDKFETQANEHEAYKQAYNEAYDWLRKAKLGAQANADCHGEQQTTKDKAD 1415
            E L + + E + + E +  E     +      + L +         +   E+   K+   
Sbjct: 375  EELFEALREELAELEAELAEIRNELEELKREIESLEERLERLSERLEDLKEEL--KELEA 432

Query: 1416 KIKQITKSLPEGQKLIDKTVALKNNVLESTGPIGKENINQEINQITLDWTNLQNTLQDID 1475
            +++++   L E  + +++       + +       + + +E+ ++  +   L+  L  ++
Sbjct: 433  ELEELQTELEELNEELEELEEQLEELRDR-----LKELERELAELQEELQRLEKELSSLE 487

Query: 1476 KHHAKCLSLWNDFLSSKNTLEKWIEGFQKKIESEKDIGDCTNLDDLEKYKALLQEGLHKE 1535
                +  +        +  LE    G         ++         EKY+  L+  L   
Sbjct: 488  ARLDRLEAEQRASQGVRAVLEALESGLPGVYGPVAELIKVK-----EKYETALEAALGNR 542

Query: 1536 ADN--------KKVLIDSFNKGIIG-MTELESEEVKDLIQV------GVHGLQVEL---- 1576
                        K  I+   +   G  T L  + +K L  +      G  GL  +L    
Sbjct: 543  LQAVVVENEEVAKKAIEFLKENKAGRATFLPLDRIKPLRSLKSDAAPGFLGLASDLIDFD 602

Query: 1577 -------NALLQN-IEAEINKVANAAQERKAMQDKISKLQS--WLKQYIDYSGNKQALQS 1626
                     +L + +  +  + A     +  ++ +I  L             G++    S
Sbjct: 603  PKYEPAVRFVLGDTLVVDDLEQARRLARKLRIKYRIVTLDGDLVEPSGSITGGSRNKRSS 662

Query: 1627 --RLDKVNEIQESFPEISTKLQTLSDHIENSSRKLPSRTNEAMYRDLANLRYDFEKCVSS 1684
              +  ++ E++E   E+  +L+ L + +++   +L S  +     +L     + E+ +  
Sbjct: 663  LAQKRELKELEEELAELEAQLEKLEEELKSLKNELRSLEDL--LEELRRQLEELERQLEE 720

Query: 1685 LSDVKQGLESRLVQWNEYETSLGKLITWLNDTENVLKNFTLQPTLNEKKQQRELKGKLSS 1744
            L      LE  L Q       L + +  L +         LQ  L E +++ E       
Sbjct: 721  LKRELAALEEELEQLQSRLEELEEELEELEEELE-----ELQERLEELEEELE------- 768

Query: 1745 WCGYEESVATLLDWLKVTEKKLGDEMELKTTLDEKKAQLQVYRTIYQDATSHQQDLLQLK 1804
                            + E     + E++   ++++A  +    + ++    ++ L  L+
Sbjct: 769  ---------------SLEEALAKLKEEIEELEEKRQALQEELEELEEELEEAERRLDALE 813

Query: 1805 DKIESLHQPSEQSKQQLATITTRHGNVLKRAQKKIEAEKDIGDCTNLDDLEKYKALLQEV 1864
             ++ESL Q  E+ +Q++  +       ++  ++K++  ++      L++LEK    L+E 
Sbjct: 814  RELESLEQRRERLEQEIEELEEE----IEELEEKLDELEE-----ELEELEKELEELKEE 864

Query: 1865 IS-HNSDMETLNDKCEALMELAAHSPIREETLRLQALYAALVTSVQGLVSQVEKNLSDQT 1923
            +    ++ E L D+ + L          EE   L+     L + +  L  ++EK      
Sbjct: 865  LEELEAEKEELEDELKEL---------EEEKEELEEELRELESELAELKEEIEKLRERLE 915

Query: 1924 EFLAKKEEVEAWLHKAHATVQEC--DELEAGIKEEQKNIETQ 1963
            E  AK E +E  L +    ++E   D LE  ++ E + +E +
Sbjct: 916  ELEAKLERLEVELPELEEELEEEYEDTLETELEREIERLEEE 957



 Score = 57.4 bits (139), Expect = 9e-08
 Identities = 64/339 (18%), Positives = 149/339 (43%), Gaps = 14/339 (4%)

Query: 1055 KRLQHKVEEAKALLQKAESQLARPVSTQAEDISEALTVYQKLLDEINSWKNSQSDEDLVH 1114
            K L+ ++ E +A L+K E +L   +  +   + + L   ++ L+E+   +  +   +L  
Sbjct: 670  KELEEELAELEAQLEKLEEELKS-LKNELRSLEDLLEELRRQLEEL-ERQLEELKRELAA 727

Query: 1115 LADSIKPLDEVVERIENHAVPKLKSLLLLREQFTTLIMQIVGF---ITETTARVGEVDGN 1171
            L + ++ L   +E +E       + L  L+E+   L  ++      + +    + E++  
Sbjct: 728  LEEELEQLQSRLEELEEELEELEEELEELQERLEELEEELESLEEALAKLKEEIEELEEK 787

Query: 1172 SATVQEKIDKYDKIIADIQEYEAILATASDKGDQLSSDGTISDRNEITEQLQSRKQQLHN 1231
               +QE++++ ++ + + +           +  +   +    +  E+ E+++  +++L  
Sbjct: 788  RQALQEELEELEEELEEAER-RLDALERELESLEQRRERLEQEIEELEEEIEELEEKLDE 846

Query: 1232 LRKTVEKLRQQNEKRAAESEKLGAELEEIIEAL-HARETAVPSSSRLHNLRKTVEKLRQQ 1290
            L + +E+L ++ E+   E E+L AE EE+ + L    E        L  L   + +L+++
Sbjct: 847  LEEELEELEKELEELKEELEELEAEKEELEDELKELEEEKEELEEELRELESELAELKEE 906

Query: 1291 NEKRAAESEKLGAELEEIIEALHARETAVKTLPVLLLEVTSVDVELDKQRLELATKEVQP 1350
             EK     E+L A+LE +   L   E  ++      LE    ++E + +RLE   + + P
Sbjct: 907  IEKLRERLEELEAKLERLEVELPELEEELEEEYEDTLE---TELEREIERLEEEIEALGP 963

Query: 1351 Y----VQRYETLKKNIAEAIDKFETQANEHEAYKQAYNE 1385
                 ++ YE +++   E   + E      E   +   E
Sbjct: 964  VNLRAIEEYEEVEERYEELKSQREDLEEAKEKLLEVIEE 1002



 Score = 55.5 bits (134), Expect = 3e-07
 Identities = 134/774 (17%), Positives = 308/774 (39%), Gaps = 95/774 (12%)

Query: 627  QQDSLREAMTVLRDSWDRLNRDLKSTSTQLKSYIARWNELDDLYNRFNMWLSGVENKLNE 686
            + + L E ++ L +  + L  +L+    +++   +   EL +        L  ++ ++ E
Sbjct: 240  ELEELEEELSRLEEELEELQEELEEAEKEIEELKSELEELREELEELQEELLELKEEIEE 299

Query: 687  PSPVFTEIDLNTKWQQYVNDELEWENHSQNTLQWLDNIRSDMKTLLERSKHIVDDIEKKK 746
               +  EI L  +  + + +ELE     +  L+ L      +K  LE  + +++++E+  
Sbjct: 300  ---LEGEISLLRERLEELENELE---ELEERLEELKEKIEALKEELEERETLLEELEQLL 353

Query: 747  DDLKPIQKEAQQ-LSEWDSSIVPKVSDLETRWAAVKSA---WDAKHSDLSSEIAQYSSYQ 802
             +L+  ++E ++ LS     +      L    A +++       +  +L  EI       
Sbjct: 354  AELEEAKEELEEKLSALLEELEELFEALREELAELEAELAEIRNELEELKREIESLEERL 413

Query: 803  N----AVQDAEKWLLQISFQLMAHNSLYINNRQQTLEQIQHHEKLL-------QDYSSYQ 851
                  ++D ++ L ++  +L    +      ++  E  +  E+L        ++ +  Q
Sbjct: 414  ERLSERLEDLKEELKELEAELEELQTELEELNEELEELEEQLEELRDRLKELERELAELQ 473

Query: 852  NAVQDAEKWLLQISFQLMAHNSLYINNRQQTLEQIQHHEKLLQDVI------LSPDNKLA 905
              +Q  EK L  +  +L    +      Q     ++  E  L  V       +    K  
Sbjct: 474  EELQRLEKELSSLEARLDRLEAE-QRASQGVRAVLEALESGLPGVYGPVAELIKVKEKYE 532

Query: 906  PCFSAEEGSPIKDKI---------------------------DRLEKETKAQSDRIQDAL 938
                A  G+ ++  +                           DR++     +SD     L
Sbjct: 533  TALEAALGNRLQAVVVENEEVAKKAIEFLKENKAGRATFLPLDRIKPLRSLKSDAAPGFL 592

Query: 939  KTLNQLIGHRKEFEVDVENV------------------EAKLKELEVAVEGDV------K 974
               + LI    ++E  V  V                  + ++K   V ++GD+       
Sbjct: 593  GLASDLIDFDPKYEPAVRFVLGDTLVVDDLEQARRLARKLRIKYRIVTLDGDLVEPSGSI 652

Query: 975  TNNVLMLQELLNKYAQLNEEAGELNMMISNVSIATEHMNLNDADRLTVNDSVSSMQKRYA 1034
            T      +  L +  +L E   EL  + + +    E +     +  ++ D +  ++++  
Sbjct: 653  TGGSRNKRSSLAQKRELKELEEELAELEAQLEKLEEELKSLKNELRSLEDLLEELRRQLE 712

Query: 1035 QLSRTIEDRISTVNNKLQSVKRLQHKVEEAKALLQKAESQLARPVSTQAEDISEALTVYQ 1094
            +L R +E+    +    + +++LQ ++EE +  L++ E +L   +  + E++ E L   +
Sbjct: 713  ELERQLEELKRELAALEEELEQLQSRLEELEEELEELEEELEE-LQERLEELEEELESLE 771

Query: 1095 KLLDEINSWKNSQSDEDLVHLADSIKPLDEVVERIENHAVP---KLKSLLLLREQFTTLI 1151
            + L ++   +  + +E    L + ++ L+E +E  E        +L+SL   RE+    I
Sbjct: 772  EALAKLKE-EIEELEEKRQALQEELEELEEELEEAERRLDALERELESLEQRRERLEQEI 830

Query: 1152 MQIVGFITETTARVGEVDGNSATVQEKIDKYDKIIADIQEYEAILATASDKGDQLSSDGT 1211
             ++   I E   ++ E++     +++++   +++  +++E EA      D+  +L     
Sbjct: 831  EELEEEIEELEEKLDELEEELEELEKEL---EELKEELEELEAEKEELEDELKELE---- 883

Query: 1212 ISDRNEITEQLQSRKQQLHNLRKTVEKLRQQNEKRAAESEKLGAELEEIIEALHARETAV 1271
              ++ E+ E+L+  + +L  L++ +EKLR++ E+  A+ E+L  EL E+ E L       
Sbjct: 884  -EEKEELEEELRELESELAELKEEIEKLRERLEELEAKLERLEVELPELEEELEEEYEDT 942

Query: 1272 PSSS---RLHNLRKTVEKLRQQNEKRAAESEKLGAELEEIIEALHARETAVKTL 1322
              +     +  L + +E L   N +   E E++    EE+       E A + L
Sbjct: 943  LETELEREIERLEEEIEALGPVNLRAIEEYEEVEERYEELKSQREDLEEAKEKL 996



 Score = 54.3 bits (131), Expect = 8e-07
 Identities = 67/357 (18%), Positives = 144/357 (40%), Gaps = 34/357 (9%)

Query: 1051 LQSVKRLQHKVEEAKALLQKAESQLARPVSTQAEDISEALTVYQKLLDEINSWKNSQSDE 1110
             + ++R +  +E  + LL++ E QL   +  QAE        YQ+L  E+   + +    
Sbjct: 178  ERKLERTEENLERLEDLLEELEKQL-EKLERQAEKAER----YQELKAELRELELALLLA 232

Query: 1111 DLVHLADSIKPLDEVVERIENH----------AVPKLKSLLLLREQFTTLIMQIVGFITE 1160
             L  L   ++ L+E + R+E            A  +++ L    E+    + ++   + E
Sbjct: 233  KLKELRKELEELEEELSRLEEELEELQEELEEAEKEIEELKSELEELREELEELQEELLE 292

Query: 1161 TTARVGEVDGNSATVQEKIDKYDKIIADIQEYEAILATASDKGDQLSSDGTISDRNEITE 1220
                + E++G  + ++E++++ +    +++E E  L    +K + L  +       E   
Sbjct: 293  LKEEIEELEGEISLLRERLEELEN---ELEELEERLEELKEKIEALKEELE-----ERET 344

Query: 1221 QLQSRKQQLHNLRKTVEKLRQQNEKRAAESEKLGAELEEIIEALHARETAVPSSSRLHNL 1280
             L+  +Q L  L +  E+L ++      E E+L   L E +  L A    +   + L  L
Sbjct: 345  LLEELEQLLAELEEAKEELEEKLSALLEELEELFEALREELAELEAELAEI--RNELEEL 402

Query: 1281 RKTVEKLRQQNEKRAAESEKLGAELEEIIEALHARETAVKTLPVLLLEVTSV-------- 1332
            ++ +E L ++ E+ +   E L  EL+E+   L   +T ++ L   L E+           
Sbjct: 403  KREIESLEERLERLSERLEDLKEELKELEAELEELQTELEELNEELEELEEQLEELRDRL 462

Query: 1333 -DVELDKQRLELATKEVQPYVQRYETLKKNIAEAIDKFETQANEHEAYKQAYNEAYD 1388
             ++E +   L+   + ++  +   E     +       +      EA +      Y 
Sbjct: 463  KELERELAELQEELQRLEKELSSLEARLDRLEAEQRASQGVRAVLEALESGLPGVYG 519



 Score = 53.6 bits (129), Expect = 1e-06
 Identities = 82/421 (19%), Positives = 167/421 (39%), Gaps = 70/421 (16%)

Query: 1653 ENSSRKLPSRTNEAMYRDLANLRYDFEKCVSSLSDVKQGLESRLVQWNEYETSLGKLITW 1712
               S     R  + +  +LA L    EK    L  +K  L S      E    L +L   
Sbjct: 658  NKRSSLAQKRELKELEEELAELEAQLEKLEEELKSLKNELRSLEDLLEELRRQLEEL--- 714

Query: 1713 LNDTENVLKNFTLQPTLNEKKQQRELKGKLSSWCGYEESVATLLDWLKVTEKKLGDEMEL 1772
                               ++Q  ELK +L++    EE +  L   L+  E++L +  E 
Sbjct: 715  -------------------ERQLEELKRELAA---LEEELEQLQSRLEELEEELEELEEE 752

Query: 1773 KTTLDEKKAQLQVYRTIYQDATSHQQDLLQLKDKIESLHQPSEQSKQQLATITTRHGNVL 1832
               L E+  +L+      ++  S ++ L +LK++IE L +  +  +++L  +       L
Sbjct: 753  LEELQERLEELE------EELESLEEALAKLKEEIEELEEKRQALQEELEELEE----EL 802

Query: 1833 KRAQKKIEAEKDIGDCTNLDDLEKYKALLQEVISHNSDMETLNDKCEALMELAAHSPIRE 1892
            + A+++++                  AL +E+ S     E L  + E L          E
Sbjct: 803  EEAERRLD------------------ALERELESLEQRRERLEQEIEEL---------EE 835

Query: 1893 ETLRLQALYAALVTSVQGLVSQVEKNLSDQTEFLAKKEEVEAWLHKAHATVQECDELEAG 1952
            E   L+     L   ++ L  ++E+   +  E  A+KEE+E  L +     +E +E    
Sbjct: 836  EIEELEEKLDELEEELEELEKELEELKEELEELEAEKEELEDELKELEEEKEELEEELRE 895

Query: 1953 IKEEQKNIETQYLQWNSFRETLAQMTAWLDSVEKNIKQEIATPW--TTTQELRSKLLKLK 2010
            ++ E   ++ +  +     E L      L+     +++E+   +  T   EL  ++ +L+
Sbjct: 896  LESELAELKEEIEKLRERLEELEAKLERLEVELPELEEELEEEYEDTLETELEREIERLE 955

Query: 2011 EILKKCEKGVGDHEAFVDKYEECSQKLAQVEAKYAKLIEPETSYEELQKRQGQLTALLTE 2070
            E ++     V      +++YEE  ++  +++++   L E   + E+L +   +L     E
Sbjct: 956  EEIEALGP-VN--LRAIEEYEEVEERYEELKSQREDLEE---AKEKLLEVIEELDKEKRE 1009

Query: 2071 K 2071
            +
Sbjct: 1010 R 1010



 Score = 48.6 bits (116), Expect = 5e-05
 Identities = 46/237 (19%), Positives = 102/237 (43%), Gaps = 10/237 (4%)

Query: 1158 ITETTARVGEVDGNSATVQEKIDKYDKIIADIQEYEAILATASDKGDQLSSDGTISDRNE 1217
            + +    +  +     ++++ +++  + + +++     L       ++       S   E
Sbjct: 683  LEKLEEELKSLKNELRSLEDLLEELRRQLEELERQLEELKRELAALEEELEQ-LQSRLEE 741

Query: 1218 ITEQLQSRKQQLHNLRKTVEKLRQQNEKRAAESEKLGAELEEIIEALHARETAVPSSS-- 1275
            + E+L+  +++L  L++ +E+L ++ E       KL  E+EE+ E   A +  +      
Sbjct: 742  LEEELEELEEELEELQERLEELEEELESLEEALAKLKEEIEELEEKRQALQEELEELEEE 801

Query: 1276 ------RLHNLRKTVEKLRQQNEKRAAESEKLGAELEEIIEALHARETAVKTLPVLLLEV 1329
                  RL  L + +E L Q+ E+   E E+L  E+EE+ E L   E  ++ L   L E+
Sbjct: 802  LEEAERRLDALERELESLEQRRERLEQEIEELEEEIEELEEKLDELEEELEELEKELEEL 861

Query: 1330 TSVDVELDKQRLELATKEVQPYVQRYETLKKNIAEAIDKFETQANEHEAYKQAYNEA 1386
                 EL+ ++ EL  +  +   +  E L++ + E   +      E E  ++   E 
Sbjct: 862  KEELEELEAEKEELEDELKE-LEEEKEELEEELRELESELAELKEEIEKLRERLEEL 917



 Score = 44.3 bits (105), Expect = 8e-04
 Identities = 128/855 (14%), Positives = 315/855 (36%), Gaps = 123/855 (14%)

Query: 788  HSDLSSEIAQYSSYQNAVQDAEKWLLQISFQLMAHNSLYINNRQQTLEQIQHHEKLLQDY 847
             + L     +    +  +   E+ L ++  +L            + L+     E+L ++ 
Sbjct: 231  LAKLKELRKELEELEEELSRLEEELEELQEELEE-----AEKEIEELKSE--LEELREEL 283

Query: 848  SSYQNAVQDAEKWLLQISFQLMAHNSLYINNRQQTLEQIQHHEKLLQDVILSPDNKLAPC 907
               Q  + + ++ + ++  ++ +     +   +  LE+++   + L++ I +   +L   
Sbjct: 284  EELQEELLELKEEIEELEGEI-SLLRERLEELENELEELEERLEELKEKIEALKEEL--- 339

Query: 908  FSAEEGSPIKDKIDRLEKETKAQSDRIQDALKTLNQLIGHRKEFEVDVENVEAKLKELEV 967
               EE   + +++++L  E +   + +++ L  L + +      E   E +  +L ELE 
Sbjct: 340  ---EERETLLEELEQLLAELEEAKEELEEKLSALLEEL------EELFEALREELAELE- 389

Query: 968  AVEGDVKTNNVLMLQELLNKYAQLNEEAGELNMMISNVSIATEHMNLNDADRLTVNDSVS 1027
                           EL     +L E   E+  +   +   +E +     +   +   + 
Sbjct: 390  --------------AELAEIRNELEELKREIESLEERLERLSERLEDLKEELKELEAELE 435

Query: 1028 SMQKRYAQLSRTIEDRISTVNNKLQSVKRLQHKVEEAKALLQKAESQLARPVSTQAEDIS 1087
             +Q    +L+  +E+    +      +K L+ ++ E +  LQ+ E +L    S+    + 
Sbjct: 436  ELQTELEELNEELEELEEQLEELRDRLKELERELAELQEELQRLEKEL----SSLEARLD 491

Query: 1088 EALTVYQKLLDEINSWKNSQSDEDLVH--LADSIKPLDEVVERIENHAVPKLKSLLLLRE 1145
                  +         +  +S    V+  +A+ IK  ++    +E     +L+++++  E
Sbjct: 492  RLEAEQRASQGVRAVLEALESGLPGVYGPVAELIKVKEKYETALEAALGNRLQAVVVENE 551

Query: 1146 QFTTLIMQI--------VGFI---TETTARVGEVDGNSATVQEKID--KYDK-------- 1184
            +     ++           F+        R  + D     +    D   +D         
Sbjct: 552  EVAKKAIEFLKENKAGRATFLPLDRIKPLRSLKSDAAPGFLGLASDLIDFDPKYEPAVRF 611

Query: 1185 ------IIADIQEYEAILATASDK-------GDQLSSDGTISDRNEITEQLQSRKQQLHN 1231
                  ++ D+++   +      K       GD +   G+I+  +       ++K++L  
Sbjct: 612  VLGDTLVVDDLEQARRLARKLRIKYRIVTLDGDLVEPSGSITGGSRNKRSSLAQKRELKE 671

Query: 1232 LRKTVEKLRQQ---NEKRAAESEKLGAELEEIIEALHARETAVPSSSRLHNLRKTVEKLR 1288
            L + + +L  Q    E+     +     LE+++E L           +L  L + +E+L+
Sbjct: 672  LEEELAELEAQLEKLEEELKSLKNELRSLEDLLEEL---------RRQLEELERQLEELK 722

Query: 1289 QQNEKRAAESEKLGAELEEIIEALHARETAVKTLPVLLLEVTSVDVELDKQRLELATKEV 1348
            ++      E E+L + LEE+ E L   E  ++ L   L E+      L++   +L     
Sbjct: 723  RELAALEEELEQLQSRLEELEEELEELEEELEELQERLEELEEELESLEEALAKLKE--- 779

Query: 1349 QPYVQRYETLKKNIAEAIDKFETQANEHE-AYKQAYNEAYDWLRKAKLGAQANADCHGEQ 1407
               ++  E  ++ + E +++ E +  E E        E     ++ +   Q   +    +
Sbjct: 780  --EIEELEEKRQALQEELEELEEELEEAERRLDALERELESLEQRRERLEQEIEEL---E 834

Query: 1408 QTTKDKADKIKQITKSLPEGQKLIDKTVALKNNVLESTGPIGKENINQEINQITLDWTNL 1467
            +  ++  +K+ ++ + L E +K ++    LK  + E      KE +  E+ ++  +   L
Sbjct: 835  EEIEELEEKLDELEEELEELEKELE---ELKEELEELE--AEKEELEDELKELEEEKEEL 889

Query: 1468 QNTLQDIDKHHAKCLSLWNDFLSSKNTLEKWIEGFQKKIESEKDIGDCTNLDDLEKYKAL 1527
            +  L++++   A+              LE        K+E  +        +  E+Y+  
Sbjct: 890  EEELRELESELAELKEEIEKLRERLEELE-------AKLERLEVELPELEEELEEEYEDT 942

Query: 1528 LQEGLHKEADNKKVLIDSF---NKGIIGMTELESEEVKDLIQVGVHGLQVELNALLQNIE 1584
            L+  L +E +  +  I++    N   I   E   E  +            EL +  +++E
Sbjct: 943  LETELEREIERLEEEIEALGPVNLRAIEEYEEVEERYE------------ELKSQREDLE 990

Query: 1585 AEINKVANAAQERKA 1599
                K+    +E   
Sbjct: 991  EAKEKLLEVIEELDK 1005



 Score = 43.5 bits (103), Expect = 0.001
 Identities = 66/347 (19%), Positives = 146/347 (42%), Gaps = 20/347 (5%)

Query: 1832 LKRAQKKIEAEKDIGDC--TNLDDLEKYKALLQEVISHNSDMETLNDKCEALMELAAHSP 1889
            L+R ++ +E  +D+ +     L+ LE+     +      +++  L    E  + LA    
Sbjct: 181  LERTEENLERLEDLLEELEKQLEKLERQAEKAERYQELKAELREL----ELALLLAKLKE 236

Query: 1890 IREETLRLQALYAALVTSVQGLVSQVEKNLSDQTEFLAKKEEVEAWLHKAHATVQECDEL 1949
            +R+E   L+   + L   ++ L  ++E+   +  E  ++ EE+   L +    + E  E 
Sbjct: 237  LRKELEELEEELSRLEEELEELQEELEEAEKEIEELKSELEELREELEELQEELLELKEE 296

Query: 1950 EAGIKEEQKNIETQYLQWNSFRETLAQMTAWLDSVEKNIKQEIATPWTTTQELRSKLLKL 2009
               ++ E   +  +  +  +  E L +    L    + +K+E+    T  +EL   L +L
Sbjct: 297  IEELEGEISLLRERLEELENELEELEERLEELKEKIEALKEELEERETLLEELEQLLAEL 356

Query: 2010 KEILKKCEKGVGDHEAFVD-KYEECSQKLAQVEAKYAKLIEPETSYEELQKRQGQLTALL 2068
            +E  ++ E+ +      ++  +E   ++LA++EA+ A++       EEL++    L   L
Sbjct: 357  EEAKEELEEKLSALLEELEELFEALREELAELEAELAEIRN---ELEELKREIESLEERL 413

Query: 2069 TEKVDMDVLLNACV-DLCDKVYESTSEPGHEPLRVQMEKLQQAVEALYDKITVTESLILS 2127
                +    L   + +L  ++ E  +E   E L  ++E+L++ +E L D++   E  +  
Sbjct: 414  ERLSERLEDLKEELKELEAELEELQTEL--EELNEELEELEEQLEELRDRLKELERELAE 471

Query: 2128 QVEAKYAKLIEPETSYEELQKRQGQLTALLAEKVDMDVLLNASVKVN 2174
              E             +EL   + +L  L AE+     +      + 
Sbjct: 472  LQEELQRL-------EKELSSLEARLDRLEAEQRASQGVRAVLEALE 511



 Score = 42.4 bits (100), Expect = 0.004
 Identities = 67/352 (19%), Positives = 148/352 (42%), Gaps = 30/352 (8%)

Query: 1575 ELNALLQNIEAEINK---VANAAQERKAMQDKISKLQSWLKQYIDYSGNKQALQSRLDKV 1631
             L  LL+ +E ++ K    A  A+  + ++ ++ +L+  L         ++ L+   +++
Sbjct: 190  RLEDLLEELEKQLEKLERQAEKAERYQELKAELRELELALLLA-KLKELRKELEELEEEL 248

Query: 1632 NEIQESFPEISTKLQTLSDHIENSSRKLPSRTNEAM--YRDLANLRYDFEKCVSSLSDVK 1689
            + ++E   E+  +L+     IE    +L     E      +L  L+ + E+    +S ++
Sbjct: 249  SRLEEELEELQEELEEAEKEIEELKSELEELREELEELQEELLELKEEIEELEGEISLLR 308

Query: 1690 QGLESRLVQWNEYETSLGKLITWLNDTEN-VLKNFTLQPTLNE-----KKQQRELKGKLS 1743
            + LE    +  E E  L +L   +   +  + +  TL   L +     ++ + EL+ KLS
Sbjct: 309  ERLEELENELEELEERLEELKEKIEALKEELEERETLLEELEQLLAELEEAKEELEEKLS 368

Query: 1744 SWCGYEESVATLLDWLKVTEKKLGDEM-ELKTTLDEKKAQLQVYRTIYQDATSHQQDLLQ 1802
                  E +  L + L+    +L  E+ E++  L+E K +++      +      + L  
Sbjct: 369  ---ALLEELEELFEALREELAELEAELAEIRNELEELKREIESLE---ERLERLSERLED 422

Query: 1803 LKDKIESLHQPSEQSKQQLATITTRHGNVLKRAQKKIEAEKDIGDCTNLDDLEKYKALLQ 1862
            LK++++ L    E+ + +L  +       L+  ++++E  +D      L +LE+  A LQ
Sbjct: 423  LKEELKELEAELEELQTELEELNEE----LEELEEQLEELRD-----RLKELERELAELQ 473

Query: 1863 EVISHNSDMETLNDKCEALMELAAHSPIREETLRLQALYAALVTSVQGLVSQ 1914
            E +        L+     L  L A     +    +     + +  V G V++
Sbjct: 474  EELQRLEKE--LSSLEARLDRLEAEQRASQGVRAVLEALESGLPGVYGPVAE 523



 Score = 42.0 bits (99), Expect = 0.004
 Identities = 47/333 (14%), Positives = 134/333 (40%), Gaps = 24/333 (7%)

Query: 916  IKDKIDRLEKETKAQSDRIQDALKTLNQLIGHRKEFEVDVENVEAKLKELEVAVEGDVKT 975
            ++D ++ L ++ +    ++++  + L  L    ++ +  +E +E +L+ELE         
Sbjct: 700  LEDLLEELRRQLEELERQLEELKRELAALEEELEQLQSRLEELEEELEELE--------E 751

Query: 976  NNVLMLQELLNKYAQLNEEAGELNMMISNVSIATEHMNLNDADRLTVNDSVSSMQKRYAQ 1035
                + + L     +L      L  +   +    E       +   + + +   ++R   
Sbjct: 752  ELEELQERLEELEEELESLEEALAKLKEEIEELEEKRQALQEELEELEEELEEAERRLDA 811

Query: 1036 LSRTIEDRISTVNNKLQSVKRLQHKVEEAKALLQKAESQLARPVSTQAEDISEALTVYQK 1095
            L R +E          Q ++ L+ ++EE +  L + E +L   +  + E++ E L   + 
Sbjct: 812  LERELESLEQRRERLEQEIEELEEEIEELEEKLDELEEELEE-LEKELEELKEELEELEA 870

Query: 1096 LLDEINSWKNSQSDEDLVHLADSIKPLDEVVERIENHAVPKLKSLLLLREQFTTLIMQI- 1154
              +E+         ++L  L +  + L+E +  +E+      + +  LRE+   L  ++ 
Sbjct: 871  EKEELE--------DELKELEEEKEELEEELRELESELAELKEEIEKLRERLEELEAKLE 922

Query: 1155 VGFITETTARVGEVDGNSATVQEKIDKYDKII-ADIQEYEAILATASDKGDQLSSDGTIS 1213
               +          +    T++ ++++  + +  +I+    +   A ++ +++       
Sbjct: 923  RLEVELPELEEELEEEYEDTLETELEREIERLEEEIEALGPVNLRAIEEYEEVE-----E 977

Query: 1214 DRNEITEQLQSRKQQLHNLRKTVEKLRQQNEKR 1246
               E+  Q +  ++    L + +E+L ++  +R
Sbjct: 978  RYEELKSQREDLEEAKEKLLEVIEELDKEKRER 1010



 Score = 40.5 bits (95), Expect = 0.011
 Identities = 59/317 (18%), Positives = 123/317 (38%), Gaps = 32/317 (10%)

Query: 1229 LHNLRKTVEKLRQQNEKRAAESEKLGAELEEIIEALHARETAVPSSSRLHNLRKTVEKLR 1288
            L  L K +EKL +Q     AE  +   EL+  +  L               L +  E+L 
Sbjct: 195  LEELEKQLEKLERQ--AEKAERYQ---ELKAELRELELALLLAKLKELRKELEELEEELS 249

Query: 1289 QQNEKRAAESEKLGA---ELEEIIEALHARETAVKTLPVLLLEVTS--VDVELDKQRLEL 1343
            +  E+     E+L     E+EE+   L      ++ L   LLE+     ++E +   L  
Sbjct: 250  RLEEELEELQEELEEAEKEIEELKSELEELREELEELQEELLELKEEIEELEGEISLLRE 309

Query: 1344 ATKEVQPYVQRYETLKKNIAEAIDKFETQANEHEAYKQAYNEAYDWLRKAKLGAQANADC 1403
              +E++  ++  E   + + E I+  + +  E E   +   +       A+L        
Sbjct: 310  RLEELENELEELEERLEELKEKIEALKEELEERETLLEELEQ-----LLAELEEAKEE-- 362

Query: 1404 HGEQQTTKDKADKIKQITKSLPEGQKLIDKTVALKNNVLESTGPIGKENINQEINQITLD 1463
               ++      ++++++ ++L E    ++  +A   N L        E + +EI  +   
Sbjct: 363  --LEEKLSALLEELEELFEALREELAELEAELAEIRNEL--------EELKREIESLEER 412

Query: 1464 WTNLQNTLQDIDKHHAKCLSLWNDFLSSKNTLEKWIEGFQKKIESEKDIGDCTNLDDLEK 1523
               L   L+D+ +   +  +   +  +    L + +E  ++++E  +D      L +LE+
Sbjct: 413  LERLSERLEDLKEELKELEAELEELQTELEELNEELEELEEQLEELRD-----RLKELER 467

Query: 1524 YKALLQEGLHKEADNKK 1540
              A LQE L +      
Sbjct: 468  ELAELQEELQRLEKELS 484



 Score = 35.8 bits (83), Expect = 0.36
 Identities = 50/237 (21%), Positives = 103/237 (43%), Gaps = 42/237 (17%)

Query: 1924 EFLAKKEEVEAWLHKAHATVQECDELEAGIKEEQKNIETQYLQWNSFRETLAQMTAWLDS 1983
            ++  +KEE E  L +    ++  ++L   ++++ + +E Q  +   ++E  A++      
Sbjct: 169  KYKERKEEAERKLERTEENLERLEDLLEELEKQLEKLERQAEKAERYQELKAELR----- 223

Query: 1984 VEKNIKQEIATPWTTTQELRSKLLKLKEILKKCEKGVGDHEAFVDKYEECSQKLAQVEAK 2043
                 + E+A      +ELR +L +L+E L + E+ + + +  +++ E+      ++E  
Sbjct: 224  -----ELELALLLAKLKELRKELEELEEELSRLEEELEELQEELEEAEK------EIEEL 272

Query: 2044 YAKLIEPETSYEELQKRQGQLTALLTEKVDMDVLLNACVDLCDKVYESTSEPGHEPLRVQ 2103
             ++L E     EELQ+   +L   + E      LL           E   E     L  +
Sbjct: 273  KSELEELREELEELQEELLELKEEIEELEGEISLLR----------ERLEE-----LENE 317

Query: 2104 MEKLQQAVEALYDKITVTESLILSQVEAKYAKLIEPETSYEELQKRQGQLTALLAEK 2160
            +E+L++ +E L +KI   +            +L E ET  EEL++   +L     E 
Sbjct: 318  LEELEERLEELKEKIEALK-----------EELEERETLLEELEQLLAELEEAKEEL 363



 Score = 35.5 bits (82), Expect = 0.38
 Identities = 73/374 (19%), Positives = 156/374 (41%), Gaps = 47/374 (12%)

Query: 182 KRYEDLVDGILKTISQLEESLDIFQQFQQLQKAYQEDQKQL-WDKLSSLTER--ELKGKL 238
           +R EDL++ + K + +LE   +  +++Q+L+   +E +  L   KL  L +   EL+ +L
Sbjct: 189 ERLEDLLEELEKQLEKLERQAEKAERYQELKAELRELELALLLAKLKELRKELEELEEEL 248

Query: 239 SSWCGYEESVATLLDWLKVTEKKLGDEMELKTTLDEKKAQLQVYRTIYQDATSHQQDLLQ 298
           S     EE +  L + L+  EK++    ELK+ L+E + +L+            Q++LL+
Sbjct: 249 SR---LEEELEELQEELEEAEKEIE---ELKSELEELREELEEL----------QEELLE 292

Query: 299 LKDKIESLHQPSEQSKQQLATITTRHGNVLKRAQKFTEAYEGIVSIHQAYTKAVLDTQEW 358
           LK++IE L       + +++ +  R   +    ++  E  E +    +A  + + + +  
Sbjct: 293 LKEEIEEL-------EGEISLLRERLEELENELEELEERLEELKEKIEALKEELEERETL 345

Query: 359 IDATYNAVNMWGDLTLERVSLHSNL-----ERLKNLEKELGRIGHKTVALKNNVLESTGP 413
           ++     +    +   E     S L     E  + L +EL  +  +   ++N        
Sbjct: 346 LEELEQLLAELEEAKEELEEKLSALLEELEELFEALREELAELEAELAEIRNE------- 398

Query: 414 IGKENINQEINQITLDWTNLQNTLQVIDKHHAKCLSIWNDFLSSKNTLEKWIEGFQKKIE 473
              E + +EI  +      L   L+ + +   +  +   +  +    L + +E  ++++E
Sbjct: 399 --LEELKREIESLEERLERLSERLEDLKEELKELEAELEELQTELEELNEELEELEEQLE 456

Query: 474 AEKDIGDCTNLDDLEKYKALLQEVISHNSDMETLNDKCEALMELAAHSPIREETLRLQAL 533
             +D      L +LE+  A LQE +        L+     L  L A     +    +   
Sbjct: 457 ELRD-----RLKELERELAELQEELQRLEKE--LSSLEARLDRLEAEQRASQGVRAVLEA 509

Query: 534 YAALVTSVQGLVSQ 547
             + +  V G V++
Sbjct: 510 LESGLPGVYGPVAE 523



 Score = 35.5 bits (82), Expect = 0.41
 Identities = 39/274 (14%), Positives = 110/274 (40%), Gaps = 16/274 (5%)

Query: 143 TLQEVLSHKRIIESLLDKSKSLPQINKDQALEKAIVSVNKRYEDLVDGILKTISQLEESL 202
            L+  L    +   L    +   ++ + +     +    +  ++ ++   K I +L+  L
Sbjct: 217 ELKAELRELELALLLAKLKELRKELEELEEELSRLEEELEELQEELEEAEKEIEELKSEL 276

Query: 203 ----DIFQQFQQLQKAYQEDQKQLWDKLSSLTERELKGKLSSWCGYEESVATLLDWLKVT 258
               +  ++ Q+     +E+ ++L  ++S L ER L+   +     EE +  L + ++  
Sbjct: 277 EELREELEELQEELLELKEEIEELEGEISLLRER-LEELENELEELEERLEELKEKIEAL 335

Query: 259 EKKLGDEMELKTTLDEKKAQL-----QVYRTIYQDATSHQQDLLQLKDKIESLHQPSEQS 313
           +++L +   L   L++  A+L     ++   +       ++    L++++  L     + 
Sbjct: 336 KEELEERETLLEELEQLLAELEEAKEELEEKLSALLEELEELFEALREELAELEAELAEI 395

Query: 314 KQQLATITTRHGNVLKRAQKFTEAYEGIVSIHQAYTKAVLDTQEWIDATYNAVNMWGDLT 373
           + +L  +     ++ +R ++ +E  E +    +     + + Q  ++     +    +  
Sbjct: 396 RNELEELKREIESLEERLERLSERLEDLKEELKELEAELEELQTELEELNEELEELEE-- 453

Query: 374 LERVSLHSNLERLKNLEKELGRIGHKTVALKNNV 407
                L    +RLK LE+EL  +  +   L+  +
Sbjct: 454 ----QLEELRDRLKELERELAELQEELQRLEKEL 483



 Score = 35.1 bits (81), Expect = 0.56
 Identities = 69/358 (19%), Positives = 149/358 (41%), Gaps = 17/358 (4%)

Query: 1773 KTTLDEKKAQLQVYRTIYQDATSHQQDLLQLKDKIESLHQPSEQSKQQLATITTRHGNVL 1832
            K  L E + +L       +      +    LK+++ SL    E+ ++QL  +  +   + 
Sbjct: 666  KRELKELEEELAELEAQLEKLEEELK---SLKNELRSLEDLLEELRRQLEELERQLEELK 722

Query: 1833 KRAQKKIEAEKDI-GDCTNLDDLEKYKALLQEVISHNSDMETLNDKCEALMELAAHSPIR 1891
            +      E  + +      L+  E+ + L +E+      +E L ++ E+L E  A   ++
Sbjct: 723  RELAALEEELEQLQSRLEELE--EELEELEEELEELQERLEELEEELESLEEALAK--LK 778

Query: 1892 EETLRLQALYAALVTSVQGLVSQVEKNLSDQTEFLAKKEEVEAWLHKAHATVQECDELEA 1951
            EE   L+    AL   ++ L  ++E+          + E +E    +    ++E +E   
Sbjct: 779  EEIEELEEKRQALQEELEELEEELEEAERRLDALERELESLEQRRERLEQEIEELEEEIE 838

Query: 1952 GIKEEQKNIETQYLQWNSFRETLAQMTAWLDSVEKNIKQEIATPWTTTQELRSKLLKLKE 2011
             ++E+   +E +  +     E L +    L++ ++ ++ E+       +EL  +L +L+ 
Sbjct: 839  ELEEKLDELEEELEELEKELEELKEELEELEAEKEELEDELKELEEEKEELEEELRELES 898

Query: 2012 ILKKCEKGVGDHEAFVDKYEECSQKLAQVEAKYAKLIEPETSYEE--LQKRQGQLTALLT 2069
             L + ++   + E   ++ EE   KL ++E +  +L E      E  L+    +    L 
Sbjct: 899  ELAELKE---EIEKLRERLEELEAKLERLEVELPELEEELEEEYEDTLETELEREIERLE 955

Query: 2070 EKVDMDVLLNACVDLCDKVYESTSEPGHEPLRVQMEKLQQAVEALYDKITVTESLILS 2127
            E+++    +N       + YE   E  +E L+ Q E L++A E L + I   +     
Sbjct: 956  EEIEALGPVNLRAI---EEYEEVEER-YEELKSQREDLEEAKEKLLEVIEELDKEKRE 1009



 Score = 34.3 bits (79), Expect = 0.96
 Identities = 65/331 (19%), Positives = 139/331 (41%), Gaps = 42/331 (12%)

Query: 1855 EKYKALLQEVISHNSDMETLNDKCEALMEL------------AAHSPIREETLRLQALYA 1902
             + + L +E+ S  +++ +L D  E L                  + + EE  +LQ+   
Sbjct: 681  AQLEKLEEELKSLKNELRSLEDLLEELRRQLEELERQLEELKRELAALEEELEQLQSRLE 740

Query: 1903 ALVTSVQGLVSQVEKNLSDQTEFLAKKEEVEAWLHKAHATVQECDELEAGIKEEQKNIET 1962
             L   ++ L  ++E+      E   + E +E  L K    ++E +E    ++EE + +E 
Sbjct: 741  ELEEELEELEEELEELQERLEELEEELESLEEALAKLKEEIEELEEKRQALQEELEELEE 800

Query: 1963 QYLQWNSFRETLAQMTAWLDSVEKNIKQEIATPWTTTQELRSKLLKLKEILKKCEKGVGD 2022
            +  +     + L +    L+   + ++QEI       +EL  KL +L+E L++ EK    
Sbjct: 801  ELEEAERRLDALERELESLEQRRERLEQEIEELEEEIEELEEKLDELEEELEELEK---- 856

Query: 2023 HEAFVDKYEECSQKLAQVEAKYAKLIEPETSYEELQKRQGQLTALLTEKVDMDVLLNACV 2082
                  + EE  ++L ++EA+  +L   E   +EL++ + +L   L E       L + +
Sbjct: 857  ------ELEELKEELEELEAEKEEL---EDELKELEEEKEELEEELRE-------LESEL 900

Query: 2083 DLCDKVYESTSEPGHEPLRVQMEKLQQAVEALYDKITVTESLILSQVEAKYAKLIEPETS 2142
                +  E   E   E L  ++E+L+  +  L +++       L     +  + +E E  
Sbjct: 901  AELKEEIEKLRERLEE-LEAKLERLEVELPELEEELEEEYEDTLETELEREIERLEEEIE 959

Query: 2143 Y---------EELQKRQGQLTALLAEKVDMD 2164
                      EE ++ + +   L +++ D++
Sbjct: 960  ALGPVNLRAIEEYEEVEERYEELKSQREDLE 990



 Score = 32.0 bits (73), Expect = 4.7
 Identities = 63/352 (17%), Positives = 149/352 (42%), Gaps = 21/352 (5%)

Query: 175 KAIVSVNKRYEDLVDGILKTISQLEESLDIFQQFQQLQKAYQEDQKQLWDKLSSLTEREL 234
           ++ ++  +  ++L + + +  +QLE+  +  +  +   ++ ++  ++L  +L  L ER+L
Sbjct: 660 RSSLAQKRELKELEEELAELEAQLEKLEEELKSLKNELRSLEDLLEELRRQLEEL-ERQL 718

Query: 235 KGKLSSWCGYEESVATLLDWLKVTEKKLGDEMELKTTLDEKKAQLQVYRTIYQDATSHQQ 294
           +         EE +  L   L+  E++L +  E    L E+  +L+      ++  S ++
Sbjct: 719 EELKRELAALEEELEQLQSRLEELEEELEELEEELEELQERLEELE------EELESLEE 772

Query: 295 DLLQLKDKIESLHQPSEQSKQQLATITTRHGNVLKRAQKFTEAYEGIVSIHQAYTKAVLD 354
            L +LK++IE L +  +  +++L  +        +R        E +    +   + + +
Sbjct: 773 ALAKLKEEIEELEEKRQALQEELEELEEELEEAERRLDALERELESLEQRRERLEQEIEE 832

Query: 355 TQEWIDATYNAVNMWGDLTLERVSLHSNLERLKN----LEKELGRIGHKTVALKNNVLES 410
            +E I+     ++   +L  E   L   LE LK     LE E   +  +   L+    E 
Sbjct: 833 LEEEIEELEEKLD---ELEEELEELEKELEELKEELEELEAEKEELEDELKELEEEKEEL 889

Query: 411 TGPIGKENINQEINQITLDWTNLQNTLQVIDKHHAKCLSIWNDFLSS-----KNTLEKWI 465
              +    +  E+ ++  +   L+  L+ ++    +      +         ++TLE  +
Sbjct: 890 EEEL--RELESELAELKEEIEKLRERLEELEAKLERLEVELPELEEELEEEYEDTLETEL 947

Query: 466 EGFQKKIEAEKDIGDCTNLDDLEKYKALLQEVISHNSDMETLNDKCEALMEL 517
           E   +++E E +     NL  +E+Y+ + +      S  E L +  E L+E+
Sbjct: 948 EREIERLEEEIEALGPVNLRAIEEYEEVEERYEELKSQREDLEEAKEKLLEV 999



 Score = 31.6 bits (72), Expect = 5.8
 Identities = 46/271 (16%), Positives = 111/271 (40%), Gaps = 45/271 (16%)

Query: 1893 ETLRLQALYAALVTSVQGLVSQVEKNLSDQTEFLAKKEEVEAWLHKAHATVQECDELEAG 1952
            E   L+   A L   ++ L  +++   ++        EE+   L +    ++E     A 
Sbjct: 668  ELKELEEELAELEAQLEKLEEELKSLKNELRSLEDLLEELRRQLEELERQLEELKRELAA 727

Query: 1953 IKEEQKNIETQYLQWNSFRETLAQMTAWLDSVEKNIKQEIATPWTTTQELRSKLLKLKEI 2012
            ++EE + ++++                      + +++E+       +EL+ +L +L+E 
Sbjct: 728  LEEELEQLQSRL---------------------EELEEELEELEEELEELQERLEELEEE 766

Query: 2013 LKKCEKGVGDHEAFVDKYEECSQKLAQVEAKYAKLIEPETSYEELQKRQGQLTALLTEKV 2072
            L+  E+ +   +  +++ EE  Q L +   +       E   EE ++R   L      + 
Sbjct: 767  LESLEEALAKLKEEIEELEEKRQALQEELEEL------EEELEEAERRLDAL------ER 814

Query: 2073 DMDVLLNACVDLCDKVYESTSEPGHEPLRVQMEKLQQAVEALYDKITVTESLI---LSQV 2129
            +++ L      L  ++ E         L  ++E+L++ ++ L +++   E  +     ++
Sbjct: 815  ELESLEQRRERLEQEIEE---------LEEEIEELEEKLDELEEELEELEKELEELKEEL 865

Query: 2130 EAKYAKLIEPETSYEELQKRQGQLTALLAEK 2160
            E   A+  E E   +EL++ + +L   L E 
Sbjct: 866  EELEAEKEELEDELKELEEEKEELEEELREL 896


>gnl|CDD|223496 COG0419, SbcC, ATPase involved in DNA repair [DNA replication,
            recombination, and repair].
          Length = 908

 Score = 55.2 bits (133), Expect = 4e-07
 Identities = 81/512 (15%), Positives = 202/512 (39%), Gaps = 62/512 (12%)

Query: 1186 IADIQEYEAILATASDKGDQLSS-----DGTISDRNEITEQLQSRKQQLHNLRKTVEKLR 1240
            +  +++YE +     +   +  +     +G +S+  E  E L    ++     K +E+++
Sbjct: 163  LFGLEKYEKLSELLKEVIKEAKAKIEELEGQLSELLEDIEDLLEALEEELKELKKLEEIQ 222

Query: 1241 QQNEKRAAESEKLGAELEEIIEALHARETAVPSSSRLHNLRKTVEKLRQQNEKRAAESEK 1300
            ++ E+   E E    E E + E    +E      +RL  +     +  +  E+   E E+
Sbjct: 223  EEQEEEELEQEIEALE-ERLAELEEEKERLEELKARLLEIESLELEALKIREEELRELER 281

Query: 1301 LGAELEEIIEALHARETAVKTLPVLLLEVTSVDVELDK--QRLELATKEVQPYVQRYETL 1358
            L  ELEE IE L   E  ++ L   L  + ++  EL++  ++L+   + ++   ++ E L
Sbjct: 282  LLEELEEKIERLEELEREIEELEEELEGLRALLEELEELLEKLKSLEERLEKLEEKLEKL 341

Query: 1359 KKNIAEAIDKFETQANEHEAYKQAYNEAYDWLRKAKLGAQANADCHGEQQTTKDKADKIK 1418
            +  + E  ++    A   E   +   E  + L K    A          +  K   + I+
Sbjct: 342  ESELEELAEEKNELAKLLEERLKELEERLEELEKELEKAL---------ERLKQLEEAIQ 392

Query: 1419 QITKSLPEGQKLIDKTVALKNNVLESTGPIGKENINQEINQITLDWTNLQ---------- 1468
            ++ + L E    +++       + +       E + +E+ ++  +   L+          
Sbjct: 393  ELKEELAELSAALEEIQEELEELEKE-----LEELERELEELEEEIKKLEEQINQLESKE 447

Query: 1469 NTLQDIDKHHAKCLSLWNDFL-----SSKNTLEKWIEGFQKKIESEKDIGD-CTNLDDLE 1522
              + ++     KC     +             E  +E  ++++  EK+  +    +++LE
Sbjct: 448  LMIAELAGAGEKCPVCGQELPEEHEKELLELYELELEELEEELSREKEEAELREEIEELE 507

Query: 1523 KYKALLQEGLHKEADNKKVLIDSFNKGIIGMTELESEEVKDLIQVGVHGLQVELNALLQN 1582
            K    L+E L +  + ++ L +              E+++ L           L   L+ 
Sbjct: 508  KELRELEEELIELLELEEALKEEL-----------EEKLEKLEN---------LLEELEE 547

Query: 1583 IEAEINKVANAAQERKAMQDKISKLQSWLKQYIDYSGNKQALQSRLDKVNEIQESFPEIS 1642
            ++ ++ ++    +E + ++D++ +L+  L++       K+ L+   +++ E+++   E+ 
Sbjct: 548  LKEKL-QLQQLKEELRQLEDRLQELKELLEELRLLRTRKEELEELRERLKELKKKLKELE 606

Query: 1643 ---TKLQTLSDHIENSSRKLPSRTNEAMYRDL 1671
               ++L+ L   +E S  +      E      
Sbjct: 607  ERLSQLEELLQSLELSEAENELEEAEEELESE 638



 Score = 45.9 bits (109), Expect = 3e-04
 Identities = 111/592 (18%), Positives = 240/592 (40%), Gaps = 45/592 (7%)

Query: 1507 ESEKDIGDCTNLDDLEKYKALLQEGLHKEADNKKVL---IDSFNKGIIGMTELESEEVKD 1563
            E ++ + +   L+  EK   LL+E + +     + L   +    + I  + E   EE+K+
Sbjct: 155  ERKEILDELFGLEKYEKLSELLKEVIKEAKAKIEELEGQLSELLEDIEDLLEALEEELKE 214

Query: 1564 LIQVGVHGLQVELNALLQNIEAEINKVANAAQERKAMQDKISKLQSWLKQYIDYSGNKQA 1623
            L ++     + E   L Q IEA   ++A   +E++ +++  ++L       +      +A
Sbjct: 215  LKKLEEIQEEQEEEELEQEIEALEERLAELEEEKERLEELKARLLEIESLEL------EA 268

Query: 1624 LQSRLDKVNEIQESFPEISTKLQTLSDHIENSSRKLPSRTN--------EAMYRDLANLR 1675
            L+ R +++ E++    E+  K++ L +                      E +   L +L 
Sbjct: 269  LKIREEELRELERLLEELEEKIERLEELEREIEELEEELEGLRALLEELEELLEKLKSLE 328

Query: 1676 YDFEKCVSSLSDVKQGLESRLVQWNEYETSLGKLITWLNDTENVLKNFT---LQPTLNEK 1732
               EK    L  ++  LE    + NE    L + +  L +    L+      L+     +
Sbjct: 329  ERLEKLEEKLEKLESELEELAEEKNELAKLLEERLKELEERLEELEKELEKALERLKQLE 388

Query: 1733 KQQRELKGKLSSWCGYEESVATLLDWLKVTEKKLGDEMELKTTLDEKKAQLQVYRTIYQD 1792
            +  +ELK +L       E  A L +  +  E+   +  EL+  L+E + +++        
Sbjct: 389  EAIQELKEEL------AELSAALEEIQEELEELEKELEELERELEELEEEIKKLEEQINQ 442

Query: 1793 ATSHQQDLLQLKDKIE-----SLHQPSEQSKQQLATITTRHGNVLKRAQKKIEAEKDIGD 1847
              S +  + +L    E         P E  K+ L          L+  ++++  EK+  +
Sbjct: 443  LESKELMIAELAGAGEKCPVCGQELPEEHEKELLELYELE----LEELEEELSREKEEAE 498

Query: 1848 -CTNLDDLEK-YKALLQEVISHNSDMETLND----KCEALMELAAHSPIREETLRLQALY 1901
                +++LEK  + L +E+I      E L +    K E L  L       +E L+LQ L 
Sbjct: 499  LREEIEELEKELRELEEELIELLELEEALKEELEEKLEKLENLLEELEELKEKLQLQQLK 558

Query: 1902 AALVTSVQGLVSQVEKNLSDQTEFLAKKEEVEAWLHKAHATVQECDELEAGIKEEQKNIE 1961
               +  ++  + ++++ L +      +KEE+E    +     ++  ELE  + + ++ ++
Sbjct: 559  E-ELRQLEDRLQELKELLEELRLLRTRKEELEELRERLKELKKKLKELEERLSQLEELLQ 617

Query: 1962 TQYL--QWNSFRETLAQMTAWLDSVEKNIKQEIATPWTTTQELRSKLLKLKEILKKCEKG 2019
            +  L    N   E   ++ + L+ +    + E        +EL  K+ +L+  +++  + 
Sbjct: 618  SLELSEAENELEEAEEELESELEKLNLQAELEELL-QAALEELEEKVEELEAEIRRELQR 676

Query: 2020 VGDHEAFVDKYEECSQKLAQVEAKYAKLIEPETSYEELQKRQGQLTALLTEK 2071
            + + E   +K EE  Q   ++E    +L E      E+++   +L +   E 
Sbjct: 677  IENEEQLEEKLEELEQLEEELEQLREELEELLKKLGEIEQLIEELESRKAEL 728



 Score = 44.4 bits (105), Expect = 8e-04
 Identities = 65/511 (12%), Positives = 188/511 (36%), Gaps = 48/511 (9%)

Query: 835  EQIQHHEKLLQDYSSYQNAVQDAEKWLLQISFQLMAHNSLYINNRQQTLEQIQHHEKLLQ 894
            E++   E+  +     +  + + E   L+            +   ++ LE+++   + L+
Sbjct: 239  ERLAELEEEKERLEELKARLLEIESLELEALKIREEE----LRELERLLEELEEKIERLE 294

Query: 895  DVILSPDNKLAPCFSAEEGSPIKDKIDRLEKETKAQSDRIQDALKTLNQLIGHRKEFEVD 954
            ++    +         E    + ++++ L ++ K+  +R++   + L +L    +E   +
Sbjct: 295  ELEREIEELEE---ELEGLRALLEELEELLEKLKSLEERLEKLEEKLEKLESELEELAEE 351

Query: 955  VENVEAKLKELEVAVE-------------GDVKTNNVLMLQELLNKYAQLNEEAGELNMM 1001
               +   L+E    +E              +        +QEL  + A+L+    E+   
Sbjct: 352  KNELAKLLEERLKELEERLEELEKELEKALERLKQLEEAIQELKEELAELSAALEEIQEE 411

Query: 1002 ISNVSIATEHMNLNDADRLTVNDSVSSMQKRYAQLSRTIEDRISTVNNKLQ--------- 1052
            +  +            +   +   +  +++   +L   I    S      +         
Sbjct: 412  LEELE----------KELEELERELEELEEEIKKLEEQINQLESKELMIAELAGAGEKCP 461

Query: 1053 --SVKRLQHKVEEAKALLQKAESQLARPVSTQAEDISEALTVYQKLLDEINSWKNSQSDE 1110
                +  +   +E   L +    +L   +S + ++ +E     ++L  E+   +    + 
Sbjct: 462  VCGQELPEEHEKELLELYELELEELEEELS-REKEEAELREEIEELEKELRELEEELIEL 520

Query: 1111 DLVHLADSIKPLDEVVERIENHAVPKLKSLLLLREQFTTLIMQI-VGFITETTARVGEVD 1169
                     + L E +E         L+ L  L+E+     ++  +  + +    + E+ 
Sbjct: 521  LE-----LEEALKEELEEKLEKLENLLEELEELKEKLQLQQLKEELRQLEDRLQELKELL 575

Query: 1170 GNSATVQEKIDKYDKIIADIQEYEAILATASDKGDQLSSDGTISDRNEITEQLQSRKQQL 1229
                 ++ + ++ +++   ++E +  L    ++  QL       + +E   +L+  +++L
Sbjct: 576  EELRLLRTRKEELEELRERLKELKKKLKELEERLSQLEELLQSLELSEAENELEEAEEEL 635

Query: 1230 HNLRKTVEKLRQQNEKRAAESEKLGAELEEIIEALHARETAVPSSSRLHNLRKTVEKLRQ 1289
             +  + +    +  E   A  E+L  ++EE+   +      + +  +L    + +E+L +
Sbjct: 636  ESELEKLNLQAELEELLQAALEELEEKVEELEAEIRRELQRIENEEQLEEKLEELEQLEE 695

Query: 1290 QNEKRAAESEKLGAELEEIIEALHARETAVK 1320
            + E+   E E+L  +L EI + +   E+   
Sbjct: 696  ELEQLREELEELLKKLGEIEQLIEELESRKA 726


>gnl|CDD|233758 TIGR02169, SMC_prok_A, chromosome segregation protein SMC, primarily
            archaeal type.  SMC (structural maintenance of
            chromosomes) proteins bind DNA and act in organizing and
            segregating chromosomes for partition. SMC proteins are
            found in bacteria, archaea, and eukaryotes. It is found
            in a single copy and is homodimeric in prokaryotes, but
            six paralogs (excluded from this family) are found in
            eukarotes, where SMC proteins are heterodimeric. This
            family represents the SMC protein of archaea and a few
            bacteria (Aquifex, Synechocystis, etc); the SMC of other
            bacteria is described by TIGR02168. The N- and C-terminal
            domains of this protein are well conserved, but the
            central hinge region is skewed in composition and highly
            divergent [Cellular processes, Cell division, DNA
            metabolism, Chromosome-associated proteins].
          Length = 1164

 Score = 49.7 bits (119), Expect = 2e-05
 Identities = 70/366 (19%), Positives = 151/366 (41%), Gaps = 49/366 (13%)

Query: 1022 VNDSVSSMQKRYAQLSRTIEDRISTVNNKLQSVKRLQHKVEEAKALLQKAESQLARPVST 1081
            + + +  +++  + L   +    + ++   Q +     K+ E    ++K   QL +    
Sbjct: 679  LRERLEGLKRELSSLQSELRRIENRLDELSQELSDASRKIGE----IEKEIEQLEQEEEK 734

Query: 1082 QAEDISEALTVYQKLLDEINSWKNSQSDEDLVHLADSIKPLDEVVERIENHAVPKLKSLL 1141
              E + E       L  EI + K      +L  L   I+ L+E + ++E  A+  L++ L
Sbjct: 735  LKERLEELEEDLSSLEQEIENVK-----SELKELEARIEELEEDLHKLEE-ALNDLEARL 788

Query: 1142 LLREQFTTLIMQIVGFITETTARVGEVDGNSATVQEKIDKYDKIIADIQEYEAILATASD 1201
                                 +R+ E+    + ++E++ + +  + +I   E  L   + 
Sbjct: 789  S-------------------HSRIPEIQAELSKLEEEVSRIEARLREI---EQKLNRLTL 826

Query: 1202 KGDQLSSD--GTISDRNEITEQLQSRKQQLHNLRKTVEKLRQQNEKRAAESEKLGAELEE 1259
            + + L  +       R ++ EQ++S ++++ NL    E+L ++ E+  A    L + L +
Sbjct: 827  EKEYLEKEIQELQEQRIDLKEQIKSIEKEIENLNGKKEELEEELEELEAALRDLESRLGD 886

Query: 1260 I---IEALHARETAVPSSSRLHNLRKTVEKLRQQNEKRAAESEKLGAELEEIIEALHARE 1316
            +    + L A+   +    ++  L   +EK R++  +  A+ E L  EL EI +     E
Sbjct: 887  LKKERDELEAQLREL--ERKIEELEAQIEKKRKRLSELKAKLEALEEELSEIEDPKGEDE 944

Query: 1317 TAVKTLPVLLLEVTSVDVELDKQRLELATKEVQPY----VQRYETLKKNIAEAIDKFETQ 1372
               +    L       DV+ + QR+E   + ++P     +Q YE + K + E  +K    
Sbjct: 945  EIPEEELSL------EDVQAELQRVEEEIRALEPVNMLAIQEYEEVLKRLDELKEKRAKL 998

Query: 1373 ANEHEA 1378
              E +A
Sbjct: 999  EEERKA 1004



 Score = 47.4 bits (113), Expect = 1e-04
 Identities = 50/300 (16%), Positives = 113/300 (37%), Gaps = 18/300 (6%)

Query: 1032 RYAQLSRTIEDRISTVNNKLQSVKRLQHKVEEAKALLQKAESQLARPVSTQAEDISEALT 1091
            RY  L +   +        L+  + L+ + E  +  L   E +L + ++ +  ++ + L 
Sbjct: 212  RYQALLKEKREYEGYE--LLKEKEALERQKEAIERQLASLEEELEK-LTEEISELEKRLE 268

Query: 1092 VYQKLLDEINSWKNSQSDEDLVHLADSIKPLDEVVERIENHAVPKLKSLLLLREQFTTLI 1151
              ++LL+E+N       +E+ + + + I  L+  +  +E     K + L    E+   L 
Sbjct: 269  EIEQLLEELNKKIKDLGEEEQLRVKEKIGELEAEIASLERSIAEKERELEDAEERLAKLE 328

Query: 1152 MQI-------VGFITETTARVGEVDGNSATVQEKIDKYDKIIADIQEYEAILATASDKGD 1204
             +I            E        D  +    E  ++ + + A+++E +   A   D+  
Sbjct: 329  AEIDKLLAEIEELEREIEEERKRRDKLTEEYAELKEELEDLRAELEEVDKEFAETRDELK 388

Query: 1205 QLSS--DGTISDRNEITEQLQSRKQQLHNLRKTVEKLRQQNEKRAAESEKLGAELEEIIE 1262
                  +    + NE+  +L   +++L  L + +  L        A+  +L  E E+   
Sbjct: 389  DYREKLEKLKREINELKRELDRLQEELQRLSEELADLNAAIAGIEAKINELEEEKEDKAL 448

Query: 1263 ALHARETAVPSSSRLHNLRKTVEKLRQQNEKRAAESEKLGAELEEIIEALHARETAVKTL 1322
             +  +E       +L  L   + K  Q+      E +++  EL ++   L   E   +  
Sbjct: 449  EIKKQEW------KLEQLAADLSKYEQELYDLKEEYDRVEKELSKLQRELAEAEAQARAS 502



 Score = 45.4 bits (108), Expect = 4e-04
 Identities = 59/377 (15%), Positives = 150/377 (39%), Gaps = 62/377 (16%)

Query: 916  IKDKIDRLEKETKAQSDRIQDALKTLNQ-----LIGHRKEFEVDVENVEAKLKELEVAVE 970
             + +++RL +E + +++R Q  LK   +     L+  ++  E   E +E +L  LE  +E
Sbjct: 196  KRQQLERLRRE-REKAERYQALLKEKREYEGYELLKEKEALERQKEAIERQLASLEEELE 254

Query: 971  GDVKTNNVLMLQELLNKYAQLNEEAGELNMMISNVSIATEHMNLNDADRLTVNDSVSSMQ 1030
                      L E ++   +L +   E+  ++    +  +  +L + ++L V + +  ++
Sbjct: 255  K---------LTEEIS---ELEKRLEEIEQLLE--ELNKKIKDLGEEEQLRVKEKIGELE 300

Query: 1031 KRYAQLSRTIEDRISTVNNKLQSVKRLQHKVEEAKALLQKAESQLARPVSTQAEDISEAL 1090
               A L R+I +       K + ++  + ++ + +A + K  +++        E++   +
Sbjct: 301  AEIASLERSIAE-------KERELEDAEERLAKLEAEIDKLLAEI--------EELEREI 345

Query: 1091 TVYQKLLDEINSWKNSQSDEDLVHLADSIKPLDEVVERIENHAVPKLKSLLLLREQFTTL 1150
               +K  D+               L +    L E +E +        K     R++    
Sbjct: 346  EEERKRRDK---------------LTEEYAELKEELEDLRAELEEVDKEFAETRDEL--- 387

Query: 1151 IMQIVGFITETTARVGEVDGNSATVQEKIDKYDKIIADIQEYEAILATASDKGDQLSSDG 1210
                   + +    + E+      +QE++ +  + +AD+    A +A    K ++L    
Sbjct: 388  -KDYREKLEKLKREINELKRELDRLQEELQRLSEELADL---NAAIAGIEAKINEL---- 439

Query: 1211 TISDRNEITEQLQSRKQQLHNLRKTVEKLRQQNEKRAAESEKLGAELEEIIEALHARETA 1270
               ++ +   +++ ++ +L  L   + K  Q+      E +++  EL ++   L   E  
Sbjct: 440  -EEEKEDKALEIKKQEWKLEQLAADLSKYEQELYDLKEEYDRVEKELSKLQRELAEAEAQ 498

Query: 1271 VPSSSRLHNLRKTVEKL 1287
              +S       + VE++
Sbjct: 499  ARASEERVRGGRAVEEV 515



 Score = 43.5 bits (103), Expect = 0.001
 Identities = 64/329 (19%), Positives = 132/329 (40%), Gaps = 39/329 (11%)

Query: 1557 ESEEVKDLIQVGVHGLQVELNALLQNIEAEINKVANAAQERKAMQDKISKLQSWLKQYID 1616
            E + +++ ++    GL+ EL++L   +    N++   +QE      KI +++  +     
Sbjct: 675  ELQRLRERLE----GLKRELSSLQSELRRIENRLDELSQELSDASRKIGEIEKEI----- 725

Query: 1617 YSGNKQALQSRLDKVNEIQESFPEISTKLQTLSDHIENSSRKLPSRTNE--AMYRDLANL 1674
                 + L+   +K+ E  E   E+   L +L   IEN   +L         +  DL  L
Sbjct: 726  -----EQLEQEEEKLKERLE---ELEEDLSSLEQEIENVKSELKELEARIEELEEDLHKL 777

Query: 1675 RYDFEKCVSSLSDVKQGLESRLVQWNEYETSLGKLITWLNDTENVLKNFTLQPTLNEKK- 1733
                    + LS     +     + ++ E  + ++   L + E  L   TL+    EK+ 
Sbjct: 778  EEALNDLEARLSH--SRIPEIQAELSKLEEEVSRIEARLREIEQKLNRLTLEKEYLEKEI 835

Query: 1734 -----QQRELKGKLSSWCGYEESVATLLDWLKVTEKKLGDEMELKTTLDEKKAQLQVYRT 1788
                 Q+ +LK ++ S     E++    + L+   ++L   +     L+ +   L+    
Sbjct: 836  QELQEQRIDLKEQIKSIEKEIENLNGKKEELEEELEELEAALR---DLESRLGDLK---- 888

Query: 1789 IYQDATSHQQDLLQLKDKIESLHQPSEQSKQQLATITTRHGNVLKRAQKKIEAEK--DIG 1846
              ++    +  L +L+ KIE L    E+ +++L+ +  +   + +    +IE  K  D  
Sbjct: 889  --KERDELEAQLRELERKIEELEAQIEKKRKRLSELKAKLEALEEE-LSEIEDPKGEDEE 945

Query: 1847 DCTNLDDLEKYKALLQEVISHNSDMETLN 1875
                   LE  +A LQ V      +E +N
Sbjct: 946  IPEEELSLEDVQAELQRVEEEIRALEPVN 974



 Score = 42.4 bits (100), Expect = 0.004
 Identities = 52/256 (20%), Positives = 101/256 (39%), Gaps = 35/256 (13%)

Query: 1175 VQEKIDKYDKIIADIQEYEAILATASDKGDQLSSDGTIS------DRNEITEQLQSRKQQ 1228
            ++E+++   + ++ +Q     +    D+  Q  SD +        +  ++ ++ +  K++
Sbjct: 679  LRERLEGLKRELSSLQSELRRIENRLDELSQELSDASRKIGEIEKEIEQLEQEEEKLKER 738

Query: 1229 LHNLRKTVEKLRQQNEKRAAESEKLGAELEEIIEALHARETAV------PSSSRLHNLRK 1282
            L  L + +  L Q+ E   +E ++L A +EE+ E LH  E A+       S SR+  ++ 
Sbjct: 739  LEELEEDLSSLEQEIENVKSELKELEARIEELEEDLHKLEEALNDLEARLSHSRIPEIQA 798

Query: 1283 TVEKL---RQQNEKRAAE--------------SEKLGAELEEIIEALHARETAVKTLPVL 1325
             + KL     + E R  E               EK   EL+E    L  +  +++     
Sbjct: 799  ELSKLEEEVSRIEARLREIEQKLNRLTLEKEYLEKEIQELQEQRIDLKEQIKSIEKE--- 855

Query: 1326 LLEVTSVDVELDKQRLELATKEVQPYVQRYETLKKNIAEAIDKFETQANEHEAYKQAYNE 1385
             +E  +   E  ++ LE     ++    R   LKK   E   +      + E  +    +
Sbjct: 856  -IENLNGKKEELEEELEELEAALRDLESRLGDLKKERDELEAQLRELERKIEELEAQIEK 914

Query: 1386 AYDWL--RKAKLGAQA 1399
                L   KAKL A  
Sbjct: 915  KRKRLSELKAKLEALE 930



 Score = 41.2 bits (97), Expect = 0.007
 Identities = 50/282 (17%), Positives = 102/282 (36%), Gaps = 48/282 (17%)

Query: 1177 EKIDKYDKIIADIQEYEAILATASDKGDQLSSDGTISDRNEITEQLQSRKQQLHNLRKTV 1236
            EK ++Y  ++ + +EYE        +  +   +        + E+L+   +++  L K +
Sbjct: 208  EKAERYQALLKEKREYEGYELLKEKEALERQKEAIERQLASLEEELEKLTEEISELEKRL 267

Query: 1237 EKLRQQNEKRAAESEKLGAE------------------LEEIIEALH-----ARETAVPS 1273
            E++ Q  E+   + + LG E                  LE  I         A E     
Sbjct: 268  EEIEQLLEELNKKIKDLGEEEQLRVKEKIGELEAEIASLERSIAEKERELEDAEERLAKL 327

Query: 1274 SSRLHNLRKTVEKLRQQNEKRAAESEKLGAELEEIIEALHARETAVKTLPVLLLEVTSVD 1333
             + +  L   +E+L ++ E+     +KL  E  E            + L  L  E+  VD
Sbjct: 328  EAEIDKLLAEIEELEREIEEERKRRDKLTEEYAE----------LKEELEDLRAELEEVD 377

Query: 1334 VELDKQRLELAT--KEVQPYVQRYETLKKNIAEAIDKFETQANEHEAYKQAYNEAYDWLR 1391
             E  + R EL    ++++   +    LK+ +    ++ +  + E      A        +
Sbjct: 378  KEFAETRDELKDYREKLEKLKREINELKRELDRLQEELQRLSEELADLNAAIAGIEA--K 435

Query: 1392 KAKLGAQANADCHGEQQTTKDKADKIKQITKSLPEGQKLIDK 1433
              +L           ++  +DKA +IK+    L +    + K
Sbjct: 436  INEL-----------EEEKEDKALEIKKQEWKLEQLAADLSK 466



 Score = 39.3 bits (92), Expect = 0.033
 Identities = 62/336 (18%), Positives = 135/336 (40%), Gaps = 47/336 (13%)

Query: 1733 KQQRELKGKLSSWCGYEESVATLLDWLKVTEKKLGDEMELKTTLDEKKAQLQVYRTIYQD 1792
             ++R++  +++    ++      L+ L+  E+ +     L   +DEK+ QL+  R   + 
Sbjct: 153  VERRKIIDEIAGVAEFDRKKEKALEELEEVEENIE---RLDLIIDEKRQQLERLRREREK 209

Query: 1793 ATSHQQDLLQLKDKIESLHQPSEQSKQQLATITTRHGNVLKRAQKKIEAEKDIGDCTNLD 1852
            A  +Q  LL+ K + E                  +    L+R ++ IE +        L 
Sbjct: 210  AERYQ-ALLKEKREYE-------------GYELLKEKEALERQKEAIERQ--------LA 247

Query: 1853 DLEKYKALLQEVISHNSDMETLNDKCEALMELAAHSPIREETLRLQALYAALVTSVQGLV 1912
             LE+    L E IS     + L +  + L EL     I++     Q      +  ++  +
Sbjct: 248  SLEEELEKLTEEISELE--KRLEEIEQLLEELNKK--IKDLGEEEQLRVKEKIGELEAEI 303

Query: 1913 SQVEKNLSDQTEFLAKKEEVEAWLHKAHATVQECDELEAGIKEEQKNIETQYLQWNSFRE 1972
            + +E++++       K+ E+E     A   + + +     +  E + +E +  +    R+
Sbjct: 304  ASLERSIA------EKERELE----DAEERLAKLEAEIDKLLAEIEELEREIEEERKRRD 353

Query: 1973 TLAQMTAWLDSVEKNIKQEIATPWTTTQELRSKLLKLKEILKKCEKGVGDHEAFVDKYEE 2032
             L +  A L    ++++ E+        E R +L   +E L+K ++ + + +  +D+ +E
Sbjct: 354  KLTEEYAELKEELEDLRAELEEVDKEFAETRDELKDYREKLEKLKREINELKRELDRLQE 413

Query: 2033 CSQKL--------AQVEAKYAKLIEPETSYEELQKR 2060
              Q+L        A +    AK+ E E   E+    
Sbjct: 414  ELQRLSEELADLNAAIAGIEAKINELEEEKEDKALE 449



 Score = 35.8 bits (83), Expect = 0.31
 Identities = 49/307 (15%), Positives = 119/307 (38%), Gaps = 38/307 (12%)

Query: 718  LQWLDNIRSDMKTLLERSKHIVDDIEKKKDDLK----PIQKEAQQLSEWDSSIVPKVSDL 773
            L+ L    S +++ L R ++ +D++ ++  D       I+KE +QL + +  +  ++ +L
Sbjct: 683  LEGLKRELSSLQSELRRIENRLDELSQELSDASRKIGEIEKEIEQLEQEEEKLKERLEEL 742

Query: 774  ETRWAAVK---SAWDAKHSDLSSEIAQYSSYQNAVQDAEKWLLQISFQLMAHNSLYINNR 830
            E   ++++       ++  +L + I +     + +++A              N L     
Sbjct: 743  EEDLSSLEQEIENVKSELKELEARIEELEEDLHKLEEAL-------------NDLEARLS 789

Query: 831  QQTLEQIQHH-EKLLQDYSSYQNAVQDAEKWLLQISFQLMAHNSLYINNRQQTLEQIQHH 889
               + +IQ    KL ++ S  +  +++ E+ L +++ +        I   Q+    ++  
Sbjct: 790  HSRIPEIQAELSKLEEEVSRIEARLREIEQKLNRLTLEKEYLEKE-IQELQEQRIDLKEQ 848

Query: 890  EKLLQDVILSPDNKLAPCFS------------AEEGSPIKDKIDRLEKETKAQSDRIQDA 937
             K ++  I + + K                        +K + D LE + +    +I++ 
Sbjct: 849  IKSIEKEIENLNGKKEELEEELEELEAALRDLESRLGDLKKERDELEAQLRELERKIEEL 908

Query: 938  LKTLNQLIGHRKEFEVDVENVEAKLKELEVAVEGDVKTNNVLMLQELLNKYAQLNEEA-- 995
               + +      E +  +E +E +L E+E     D +     +  E +    Q  EE   
Sbjct: 909  EAQIEKKRKRLSELKAKLEALEEELSEIEDPKGEDEEIPEEELSLEDVQAELQRVEEEIR 968

Query: 996  --GELNM 1000
                +NM
Sbjct: 969  ALEPVNM 975



 Score = 35.0 bits (81), Expect = 0.60
 Identities = 63/294 (21%), Positives = 130/294 (44%), Gaps = 42/294 (14%)

Query: 1791 QDATSHQQDLLQLKDKIESLHQPSEQSKQQLATITTRHGNVLKRAQKKIEAEKDIGDCTN 1850
            Q+    ++ L +L++ + SL Q  E  K +L  +  R    ++  ++ +   ++      
Sbjct: 730  QEEEKLKERLEELEEDLSSLEQEIENVKSELKELEAR----IEELEEDLHKLEE-----A 780

Query: 1851 LDDLEKYKA--LLQEVISHNSDMETLNDKCEALMELAAHSPIREETLRLQALYAALVTSV 1908
            L+DLE   +   + E+ +  S +E    + EA +       I ++  RL      L   +
Sbjct: 781  LNDLEARLSHSRIPEIQAELSKLEEEVSRIEARLR-----EIEQKLNRLTLEKEYLEKEI 835

Query: 1909 QGLVSQV------EKNLSDQTEFL-AKKEEVEAWLHKAHATVQECDELEAGIKEE----- 1956
            Q L  Q        K++  + E L  KKEE+E  L +  A +++ +     +K+E     
Sbjct: 836  QELQEQRIDLKEQIKSIEKEIENLNGKKEELEEELEELEAALRDLESRLGDLKKERDELE 895

Query: 1957 ------QKNIETQYLQWNSFRETLAQMTAWLDSVEKNIKQEIATPWTTTQELRSKLLKLK 2010
                  ++ IE    Q    R+ L+++ A L+++E+ +  EI  P    +E+  + L L+
Sbjct: 896  AQLRELERKIEELEAQIEKKRKRLSELKAKLEALEEELS-EIEDPKGEDEEIPEEELSLE 954

Query: 2011 EILKKCEKGVGDHEAF-------VDKYEECSQKLAQVEAKYAKLIEPETSYEEL 2057
            ++  + ++   +  A        + +YEE  ++L +++ K AKL E   +  E 
Sbjct: 955  DVQAELQRVEEEIRALEPVNMLAIQEYEEVLKRLDELKEKRAKLEEERKAILER 1008



 Score = 32.7 bits (75), Expect = 2.7
 Identities = 58/337 (17%), Positives = 116/337 (34%), Gaps = 55/337 (16%)

Query: 1729 LNEKKQQRELKGKLSSWCGYEESVATLLDWLKVTEKKLGDEMELKTTLDEKKAQLQVYRT 1788
            L ++K++ E    L      E     +   L   E++L     L   + E + +L+    
Sbjct: 216  LLKEKREYEGYELLKEKEALERQKEAIERQLASLEEELEK---LTEEISELEKRLEEIEQ 272

Query: 1789 IYQDATSHQQDL-----LQLKDKIESLHQPSEQSKQQLATITTRHGNVLKRAQKKIEAEK 1843
            + ++     +DL     L++K+KI  L       ++ +A          +R  +  E   
Sbjct: 273  LLEELNKKIKDLGEEEQLRVKEKIGELEAEIASLERSIA--------EKERELEDAEERL 324

Query: 1844 DIGDCTNLDDLEKYKALLQEVISHNSDMETLNDKCEALMELAAHSPIREETL--RLQALY 1901
               +        +   LL E+     ++E    + + L E  A      E L   L+ + 
Sbjct: 325  AKLE-------AEIDKLLAEIEELEREIEEERKRRDKLTEEYAELKEELEDLRAELEEVD 377

Query: 1902 AALVTSVQGLVSQVEKNLSDQTEFLAKKEEVEAWLHKAHATVQECDELEAGIKEEQKNIE 1961
                 +   L     + L        +  E++  L +    +Q   E  A +      IE
Sbjct: 378  KEFAETRDEL-KDYREKLEKLKR---EINELKRELDRLQEELQRLSEELADLNAAIAGIE 433

Query: 1962 TQYLQWNSFRETLAQMTAWLDSVEKNIKQEIATPWTTTQELRSKLLKLKEILKKCEKGVG 2021
             +  +              L+  +++   EI        +   KL +L   L K E+ + 
Sbjct: 434  AKINE--------------LEEEKEDKALEIK-------KQEWKLEQLAADLSKYEQELY 472

Query: 2022 DHEAFVDKYE----ECSQKLAQVEAKYAKLIEPETSY 2054
            D +   D+ E    +  ++LA+ EA+ A+  E     
Sbjct: 473  DLKEEYDRVEKELSKLQRELAEAEAQ-ARASEERVRG 508



 Score = 32.7 bits (75), Expect = 2.9
 Identities = 56/299 (18%), Positives = 118/299 (39%), Gaps = 32/299 (10%)

Query: 912  EGSPIKDKIDRLEKETKAQSDRIQDALKTLNQLIGHRKEFEVDVENVEAKLKELEVAVEG 971
            EG  +  + + LE++ +A   ++    + L +L     E E  +E +E  L+EL   ++ 
Sbjct: 224  EGYELLKEKEALERQKEAIERQLASLEEELEKLTEEISELEKRLEEIEQLLEELNKKIK- 282

Query: 972  DVKTNNVLMLQELLNKYAQLNEEAGELNMMISNV--SIATEHMNLNDADRLTVNDSV--S 1027
                           +  ++ E+ GEL   I+++  SIA +   L DA+           
Sbjct: 283  ----------DLGEEEQLRVKEKIGELEAEIASLERSIAEKERELEDAEERLAKLEAEID 332

Query: 1028 SMQKRYAQLSRTIEDRISTVNNKLQSVKR--LQHKVEEAKALLQKAESQLARPVSTQAED 1085
             +     +L R IE+         +  +R  L  +  E K  L+   ++L   V  +  +
Sbjct: 333  KLLAEIEELEREIEE---------ERKRRDKLTEEYAELKEELEDLRAELEE-VDKEFAE 382

Query: 1086 ISEALTVYQKLLDEINSWKNSQSDEDLVHLADSIKPLDEVVERIENHAVPKLKSLLLLRE 1145
              + L  Y++ L+++    N    E L  L + ++ L E +  +          +  L E
Sbjct: 383  TRDELKDYREKLEKLKREINELKRE-LDRLQEELQRLSEELADLNAAIAGIEAKINELEE 441

Query: 1146 QFTTL---IMQIVGFITETTARVGEVDGNSATVQEKIDKYDKIIADIQ-EYEAILATAS 1200
            +       I +    + +  A + + +     ++E+ D+ +K ++ +Q E     A A 
Sbjct: 442  EKEDKALEIKKQEWKLEQLAADLSKYEQELYDLKEEYDRVEKELSKLQRELAEAEAQAR 500



 Score = 32.0 bits (73), Expect = 5.0
 Identities = 42/219 (19%), Positives = 88/219 (40%), Gaps = 19/219 (8%)

Query: 1495 LEKWIEGFQKKIESEKDIGDCTNLDDLEKYKALLQEGLHKEADNKKVLIDSFNKGIIGMT 1554
            LE+ +   +++IE+ K       L +LE     L+E LHK  +    L    +   I   
Sbjct: 742  LEEDLSSLEQEIENVKS-----ELKELEARIEELEEDLHKLEEALNDLEARLSHSRIPEI 796

Query: 1555 ELESEEVKDLIQVGVHGLQVELNALLQNIEAEINKV-ANAAQERKAMQDKISKLQSWLKQ 1613
            + E  ++++ +          + A L+ IE ++N++        K +Q+   +     +Q
Sbjct: 797  QAELSKLEEEVS--------RIEARLREIEQKLNRLTLEKEYLEKEIQELQEQRIDLKEQ 848

Query: 1614 YIDYSGNKQALQSRLDKVNEIQESFPEISTKLQTLSDHIENSSRKLPSRTNE--AMYRDL 1671
                    + L  + +++ E  E   E+   L+ L   + +  ++      +   + R +
Sbjct: 849  IKSIEKEIENLNGKKEELEEELE---ELEAALRDLESRLGDLKKERDELEAQLRELERKI 905

Query: 1672 ANLRYDFEKCVSSLSDVKQGLESRLVQWNEYETSLGKLI 1710
              L    EK    LS++K  LE+   + +E E   G+  
Sbjct: 906  EELEAQIEKKRKRLSELKAKLEALEEELSEIEDPKGEDE 944


>gnl|CDD|235175 PRK03918, PRK03918, chromosome segregation protein; Provisional.
          Length = 880

 Score = 46.2 bits (110), Expect = 2e-04
 Identities = 96/503 (19%), Positives = 204/503 (40%), Gaps = 74/503 (14%)

Query: 1217 EITEQLQSRKQQLHNLRKTVEKLRQQNEKRAAESEKLGAELEEIIEALHA---------- 1266
            E  E+L+   + L   ++ +E+  ++ E+R  E +K   ELEE ++ L            
Sbjct: 238  EEIEELEKELESLEGSKRKLEEKIRELEERIEELKKEIEELEEKVKELKELKEKAEEYIK 297

Query: 1267 -RETAVPSSSRLHNLRKTVEKLRQQN---EKRAAESEKLGAELEEI---IEALHARETAV 1319
              E        L  + K + +L ++    E+R  E E+    LEE+   ++ L  R   +
Sbjct: 298  LSEFYEEYLDELREIEKRLSRLEEEINGIEERIKELEEKEERLEELKKKLKELEKRLEEL 357

Query: 1320 KTLPVLLLEVTSVDVELDKQRLELATKEVQPYVQRYETL---KKNIAEAIDKFETQANEH 1376
            +    L  E  +   EL++ +  L     +   +  E L   K+ I E I K   +  E 
Sbjct: 358  EERHELYEEAKAKKEELERLKKRLTGLTPEKLEKELEELEKAKEEIEEEISKITARIGEL 417

Query: 1377 EAYKQAYNEAYDWLRKAKLGAQANADCHGEQQTTKDKADKIKQITKSLPEGQKLIDKTVA 1436
            +   +   +A + L+KAK          G + T + + + +++ T  L            
Sbjct: 418  KKEIKELKKAIEELKKAK----GKCPVCGRELTEEHRKELLEEYTAEL------------ 461

Query: 1437 LKNNVLESTGPIGKENINQEINQITLDWTNLQNTLQDIDKHHAKCLSLWNDFLSSKNTLE 1496
                          + I +E+ +I      L+  L++++    K L   ++ +  K   E
Sbjct: 462  --------------KRIEKELKEIEEKERKLRKELRELE----KVLKKESELIKLKELAE 503

Query: 1497 KWIEGFQKKIESEKDIGDCTNLDDLEKYKALLQEGLHKEADNKKVLIDSFNKGIIGMTEL 1556
            + ++  ++K++         NL++LEK KA   E L ++    K  I S  K +  + EL
Sbjct: 504  Q-LKELEEKLKK-------YNLEELEK-KAEEYEKLKEKLIKLKGEIKSLKKELEKLEEL 554

Query: 1557 ESEEVKDLIQVGVHGLQVELNALLQNIEAEINKVANAAQERKAMQDKISKLQSWLKQYID 1616
            + +     ++  +  L+ EL  LL+ +E    +     +ER      + +L+ +  +Y++
Sbjct: 555  KKK--LAELEKKLDELEEELAELLKELEELGFESVEELEER------LKELEPFYNEYLE 606

Query: 1617 YSGNKQALQSRLDKVNEIQESFPEISTKLQTLSDHIENSSRK---LPSRTNEAMYRDLAN 1673
                ++ L+    ++ +++E   +   +L      +E   ++   L  + +E  Y +L  
Sbjct: 607  LKDAEKELEREEKELKKLEEELDKAFEELAETEKRLEELRKELEELEKKYSEEEYEELRE 666

Query: 1674 LRYDFEKCVSSLSDVKQGLESRL 1696
               +  + ++ L    + LE R 
Sbjct: 667  EYLELSRELAGLRAELEELEKRR 689



 Score = 40.0 bits (94), Expect = 0.017
 Identities = 99/494 (20%), Positives = 212/494 (42%), Gaps = 39/494 (7%)

Query: 919  KIDRLEKETKAQSDRIQDALKTLNQLIGHRKEFEVDVENVEAKLKELEVAVEGDVKTNNV 978
            +I  L+    A    + + +K + + I   ++F    EN+E  +KE E  +E  ++    
Sbjct: 153  QILGLDDYENAY-KNLGEVIKEIKRRIERLEKFIKRTENIEELIKEKEKELEEVLRE--- 208

Query: 979  LMLQELLNKYAQLNEEAGELNMMISNVSIATEHMNLNDADRLTVNDSVSSMQKRYAQLSR 1038
              + E+ ++  +L EE  +L   +  +    E +   + +  ++  S   ++++  +L  
Sbjct: 209  --INEISSELPELREELEKLEKEVKELEELKEEIEELEKELESLEGSKRKLEEKIRELEE 266

Query: 1039 TIED---RISTVNNKLQSVKRLQHKVEE---AKALLQKAESQLARPVSTQAEDISEALTV 1092
             IE+    I  +  K++ +K L+ K EE        ++   +L   +  +   + E +  
Sbjct: 267  RIEELKKEIEELEEKVKELKELKEKAEEYIKLSEFYEEYLDELRE-IEKRLSRLEEEING 325

Query: 1093 YQKLLDEINSWKNSQSDEDLVHLADSIKPLDEVVERIE--NHAVPKLKSLLLLREQFTTL 1150
             ++ + E+   K  + +E    L +  K L+E+ ER E    A  K + L  L+++ T L
Sbjct: 326  IEERIKELEE-KEERLEELKKKLKELEKRLEELEERHELYEEAKAKKEELERLKKRLTGL 384

Query: 1151 -IMQIVGFITETTARVGEVDGNSATVQEKIDKYDKIIADIQEYEAILATASDK----GDQ 1205
               ++   + E      E++   + +  +I +  K I ++++    L  A  K    G +
Sbjct: 385  TPEKLEKELEELEKAKEEIEEEISKITARIGELKKEIKELKKAIEELKKAKGKCPVCGRE 444

Query: 1206 LSSDGTISDRNEITEQLQSRKQQLHNLRKTVEKLRQQ---------NEKRAAESEKLGAE 1256
            L+ +       E T +L+  +++L  + +   KLR++          E    + ++L  +
Sbjct: 445  LTEEHRKELLEEYTAELKRIEKELKEIEEKERKLRKELRELEKVLKKESELIKLKELAEQ 504

Query: 1257 LEEIIEAL--HARETAVPSSSRLHNLRKTVEKLRQQN---EKRAAESEKLGAELEEIIEA 1311
            L+E+ E L  +  E     +     L++ + KL+ +    +K   + E+L  +L E+ + 
Sbjct: 505  LKELEEKLKKYNLEELEKKAEEYEKLKEKLIKLKGEIKSLKKELEKLEELKKKLAELEKK 564

Query: 1312 LHARETAVKTLPVLLLEVTSVDVELDKQRLELATKEVQPYVQRYETLKKNIAEAIDKFET 1371
            L   E  +  L   L E+    VE  ++RL    KE++P+   Y  LK    E   + + 
Sbjct: 565  LDELEEELAELLKELEELGFESVEELEERL----KELEPFYNEYLELKDAEKELEREEKE 620

Query: 1372 QANEHEAYKQAYNE 1385
                 E   +A+ E
Sbjct: 621  LKKLEEELDKAFEE 634



 Score = 35.8 bits (83), Expect = 0.29
 Identities = 92/499 (18%), Positives = 205/499 (41%), Gaps = 73/499 (14%)

Query: 1582 NIEAEINKVANAAQERKAMQDKISKLQSWLKQYIDY----SGNKQALQSRLDKVNEIQES 1637
             ++   N   N  +  K ++ +I +L+ ++K+  +        ++ L+  L ++NEI   
Sbjct: 156  GLDDYENAYKNLGEVIKEIKRRIERLEKFIKRTENIEELIKEKEKELEEVLREINEISSE 215

Query: 1638 FPEISTKLQTLSDHIENSSRKLPSRTNEAMYRDLANLRYDFEKCVSSLSDVKQGLESRLV 1697
             PE+  +L+ L   ++    +L     E      +      E     L +  + LE R+ 
Sbjct: 216  LPELREELEKLEKEVK-ELEELKEEIEELEKELES-----LEGSKRKLEEKIRELEERIE 269

Query: 1698 QWNEYETSLGKLITWLNDTENVLKNF-TLQPTLNE-KKQQRELKGKLSSWCGYEESVATL 1755
            +  +    L + +  L + +   + +  L     E   + RE++ +LS        +   
Sbjct: 270  ELKKEIEELEEKVKELKELKEKAEEYIKLSEFYEEYLDELREIEKRLSRLEEEINGIEER 329

Query: 1756 LDWLKVTEKKLGDEMELKTTLDEKKAQLQVYRTIYQDATSHQQDLLQLKDKIESLHQPSE 1815
            +  L+  E++L +  +    L+++  +L+    +Y++A + +++L +LK ++  L    E
Sbjct: 330  IKELEEKEERLEELKKKLKELEKRLEELEERHELYEEAKAKKEELERLKKRLTGL--TPE 387

Query: 1816 QSKQQLATITTRHGNVLKRAQKKIEAEKDIGDCTNLDDLEKYKALLQEVISHNSDMETLN 1875
            + +++L          L++A+++IE              E+   +   +     +++ L 
Sbjct: 388  KLEKELEE--------LEKAKEEIE--------------EEISKITARIGELKKEIKELK 425

Query: 1876 DKCEALMELAAHSPI--REETLRLQALYAALVTSVQGLVSQVEKNLSDQTEFLAKKEEVE 1933
               E L +     P+  RE                  L  +  K L          EE  
Sbjct: 426  KAIEELKKAKGKCPVCGRE------------------LTEEHRKEL---------LEEYT 458

Query: 1934 AWLHKAHATVQECDELEAGIKEEQKNIETQYLQWNSF---RETLAQMTAWLDSVEKNIKQ 1990
            A L +    ++E +E E  +++E + +E    + +     +E   Q+    + ++K   +
Sbjct: 459  AELKRIEKELKEIEEKERKLRKELRELEKVLKKESELIKLKELAEQLKELEEKLKKYNLE 518

Query: 1991 EIATPWTTTQELRSKLLKLKEILKKCEKGVGDHEAFVDKYEECSQKLAQVEAKYAKLI-- 2048
            E+       ++L+ KL+KLK  +K  +K +   E    K  E  +KL ++E + A+L+  
Sbjct: 519  ELEKKAEEYEKLKEKLIKLKGEIKSLKKELEKLEELKKKLAELEKKLDELEEELAELLKE 578

Query: 2049 ---EPETSYEELQKRQGQL 2064
                   S EEL++R  +L
Sbjct: 579  LEELGFESVEELEERLKEL 597



 Score = 33.1 bits (76), Expect = 2.1
 Identities = 76/424 (17%), Positives = 172/424 (40%), Gaps = 41/424 (9%)

Query: 911  EEGSPIKDKIDRLEKETKAQSDRIQDALKTLNQLIGHRKEFE-VDVENVEAKLKELEVAV 969
            ++   ++ +++ LE+  +      ++A     +L   +K    +  E +E +L+ELE A 
Sbjct: 345  KKLKELEKRLEELEERHEL----YEEAKAKKEELERLKKRLTGLTPEKLEKELEELEKA- 399

Query: 970  EGDVKTNNVLMLQELLNKYAQLNEEAGELNMMISNVSIA-----------TEHMNLNDAD 1018
                K      + ++  +  +L +E  EL   I  +  A           TE       +
Sbjct: 400  ----KEEIEEEISKITARIGELKKEIKELKKAIEELKKAKGKCPVCGRELTEEHRKELLE 455

Query: 1019 RLTVN-DSVSSMQKRYAQLSRTIEDRISTVNNKLQSVKRLQHKVEEAKALLQKAESQLAR 1077
              T     +    K   +  R +   +  +   L+    L  K++E    L++ E +L  
Sbjct: 456  EYTAELKRIEKELKEIEEKERKLRKELRELEKVLKKESELI-KLKELAEQLKELEEKLK- 513

Query: 1078 PVSTQAEDISEALTVYQKLLDEINSWKNSQS--DEDLVHLADSIKPLDEVVERIENHAVP 1135
                  E++ +    Y+KL +++   K       ++L  L +  K L E+ ++++     
Sbjct: 514  --KYNLEELEKKAEEYEKLKEKLIKLKGEIKSLKKELEKLEELKKKLAELEKKLDE-LEE 570

Query: 1136 KLKSLL--LLREQFTTL--IMQIVGFITETTARVGEVDGNSATVQEKIDKYDKIIADIQE 1191
            +L  LL  L    F ++  + + +  +        E+      ++ +  +  K+  ++ +
Sbjct: 571  ELAELLKELEELGFESVEELEERLKELEPFYNEYLELKDAEKELEREEKELKKLEEELDK 630

Query: 1192 YEAILATASDKGDQLSSDGTISDRNEITEQLQSRKQQLHNLRKTVEKLR---QQNEKR-- 1246
                LA    + ++L  +    ++    E+ +  +++   L + +  LR   ++ EKR  
Sbjct: 631  AFEELAETEKRLEELRKELEELEKKYSEEEYEELREEYLELSRELAGLRAELEELEKRRE 690

Query: 1247 --AAESEKLGAELEEIIEALHARETAVPSSSRLHNLRKTVEKLRQQNEKRA-AESEKLGA 1303
                  EKL  ELEE  +A    E    +  R+  LR+ V+K +   ++RA ++  ++ +
Sbjct: 691  EIKKTLEKLKEELEEREKAKKELEKLEKALERVEELREKVKKYKALLKERALSKVGEIAS 750

Query: 1304 ELEE 1307
            E+ E
Sbjct: 751  EIFE 754


>gnl|CDD|173412 PTZ00121, PTZ00121, MAEBL; Provisional.
          Length = 2084

 Score = 45.1 bits (106), Expect = 5e-04
 Identities = 48/209 (22%), Positives = 89/209 (42%), Gaps = 9/209 (4%)

Query: 1216 NEITEQLQSRKQQLHNLRKTVEKLRQQNEKRAAESEKLGAELEEIIEALHARETAVPSSS 1275
            +E  ++ +  K++    +K  E  ++ +E + AE  K   E ++  EA  A E       
Sbjct: 1486 DEAKKKAEEAKKKADEAKKAAEAKKKADEAKKAEEAKKADEAKKAEEAKKADEAKKAEEK 1545

Query: 1276 RLHNLRKTVEKLRQQNEKRAAESEKLGAELEEIIEALHARETAVKTLPVLLLEVTSVDVE 1335
            +  +  K  E+L++  EK+ AE  K     E+   AL   E A K     + EV  +  E
Sbjct: 1546 KKADELKKAEELKKAEEKKKAEEAK--KAEEDKNMALRKAEEAKKAEEARIEEVMKLYEE 1603

Query: 1336 LDKQRLELATKEVQP-----YVQRYETLKKNIAEAIDKFETQANEHEAYKQAYNEAYDWL 1390
              K + E A K  +       +++ E  KK + +   K   +  + E  K+A  E  + +
Sbjct: 1604 EKKMKAEEAKKAEEAKIKAEELKKAEEEKKKVEQLKKKEAEEKKKAEELKKA--EEENKI 1661

Query: 1391 RKAKLGAQANADCHGEQQTTKDKADKIKQ 1419
            + A+   +A  D    ++  K + D+ K 
Sbjct: 1662 KAAEEAKKAEEDKKKAEEAKKAEEDEKKA 1690



 Score = 35.1 bits (80), Expect = 0.51
 Identities = 42/203 (20%), Positives = 76/203 (37%), Gaps = 28/203 (13%)

Query: 1227 QQLHNLRKTVEKLRQQNEKRAAESEKLGAELEEIIEALHARETAVPSSSRLHNLRKTVEK 1286
            ++   ++K  E  +   E + AE E+   E+ +  EA  A      ++ +    RK  ++
Sbjct: 1224 KKAEAVKKAEEAKKDAEEAKKAEEERNNEEIRKFEEARMAHFARRQAAIKAEEARKA-DE 1282

Query: 1287 LRQQNEKRAAESEKLGAELEEIIEALHARETAVKTLPVLLLEVTSVDVELDKQRLELATK 1346
            L++  EK+ A+  K   E ++  EA    E                    + ++ + A K
Sbjct: 1283 LKKAEEKKKADEAKKAEEKKKADEAKKKAE--------------------EAKKADEAKK 1322

Query: 1347 EVQPYVQRYETLKKNIAEAIDKFETQANEHEAYKQAYNEAYDWLRKAKLGAQANADCHGE 1406
            + +   ++ +  KK   EA    E    E EA         D    A+  A+A      E
Sbjct: 1323 KAEEAKKKADAAKKKAEEAKKAAEAAKAEAEAAA-------DEAEAAEEKAEAAEKKKEE 1375

Query: 1407 QQTTKDKADKIKQITKSLPEGQK 1429
             +   D A K  +  K   E +K
Sbjct: 1376 AKKKADAAKKKAEEKKKADEAKK 1398



 Score = 34.0 bits (77), Expect = 1.4
 Identities = 43/206 (20%), Positives = 75/206 (36%), Gaps = 7/206 (3%)

Query: 1214 DRNEITEQLQSRKQQLHNLRKTVEKLRQQNEKRAAESEKLGAELEEIIEALHARETAVPS 1273
            D  +  E  ++   +   + +  E  R+  E R AE  K      +  E   A E     
Sbjct: 1138 DARKAEEARKAEDAKRVEIARKAEDARKAEEARKAEDAKKAEAARKAEEVRKAEELRKAE 1197

Query: 1274 SSRLHNLRKTVEKLRQQNEKRAAESEKLGAELEEIIEALHARETAVKTLPVLLLEVTSVD 1333
             +R     +  E+ R+  E R AE  K    +++  EA    E A K       E  + +
Sbjct: 1198 DARKAEAARKAEEERKAEEARKAEDAKKAEAVKKAEEAKKDAEEAKKA-----EEERNNE 1252

Query: 1334 VELDKQRLELATKEVQPYVQRYETLKKNIAEAIDKFETQANEHEAYKQAYNEAYDWLRKA 1393
                 +   +A    +    + E  +K  A+ + K E +    EA K    +  D  +K 
Sbjct: 1253 EIRKFEEARMAHFARRQAAIKAEEARK--ADELKKAEEKKKADEAKKAEEKKKADEAKKK 1310

Query: 1394 KLGAQANADCHGEQQTTKDKADKIKQ 1419
               A+   +   + +  K KAD  K+
Sbjct: 1311 AEEAKKADEAKKKAEEAKKKADAAKK 1336



 Score = 31.6 bits (71), Expect = 6.8
 Identities = 25/105 (23%), Positives = 46/105 (43%), Gaps = 4/105 (3%)

Query: 1216 NEITEQLQSRKQQLHNLRKTVEKLRQQNEKRAAESEKLGAEL----EEIIEALHARETAV 1271
            +E  ++ +  K+     +K  E  + +  K+ AE  K   E     EE  +A  A++ A 
Sbjct: 1434 DEAKKKAEEAKKADEAKKKAEEAKKAEEAKKKAEEAKKADEAKKKAEEAKKADEAKKKAE 1493

Query: 1272 PSSSRLHNLRKTVEKLRQQNEKRAAESEKLGAELEEIIEALHARE 1316
             +  +    +K  E  ++ +E + AE  K   E ++  EA  A E
Sbjct: 1494 EAKKKADEAKKAAEAKKKADEAKKAEEAKKADEAKKAEEAKKADE 1538


>gnl|CDD|235600 PRK05771, PRK05771, V-type ATP synthase subunit I; Validated.
          Length = 646

 Score = 44.5 bits (106), Expect = 6e-04
 Identities = 49/286 (17%), Positives = 107/286 (37%), Gaps = 36/286 (12%)

Query: 1048 NNKLQSVKRLQHKVEEAKALLQKAESQL----ARPVSTQAEDISEALTVYQKLLDEINSW 1103
            N +L+ ++ L  K+ EA   L+    +L            + + E +   ++ L++I   
Sbjct: 42   NERLRKLRSLLTKLSEALDKLRSYLPKLNPLREEKKKVSVKSLEELIKDVEEELEKI--- 98

Query: 1104 KNSQSDEDLVHLADSIKPLDEVVERIENH--AVPKLKSL--LLLREQFTTLIMQIVGFIT 1159
                 ++++  L + I  L+  ++ +E     +    +    L        +   VG + 
Sbjct: 99   -----EKEIKELEEEISELENEIKELEQEIERLEPWGNFDLDLSLLLGFKYVSVFVGTVP 153

Query: 1160 ETTARVGEVDGNSATVQEKIDKYDKIIA-------DIQEYEAILATASDKGDQLSSDGTI 1212
            E      +++ +   V+        +            E E  L     +  +L  +GT 
Sbjct: 154  EDKLEELKLESDVENVEYISTDKGYVYVVVVVLKELSDEVEEELKKLGFERLELEEEGTP 213

Query: 1213 SDR-NEITEQLQSRKQQLHNLRKTVEKLRQQNEKR-AAESEKLGAELEEIIEALHARETA 1270
            S+   EI E+L+  +++  +L + +++L ++  +   A  E L  ELE         +T 
Sbjct: 214  SELIREIKEELEEIEKERESLLEELKELAKKYLEELLALYEYLEIELERAEALSKFLKTD 273

Query: 1271 --------VPSSSRLHNLRKTVEKLRQQNEKRAAESEKLGAELEEI 1308
                    VP   R+  L++ ++K          E  +   E EE+
Sbjct: 274  KTFAIEGWVP-EDRVKKLKELIDKA--TGGSAYVEFVEPDEEEEEV 316


>gnl|CDD|238103 cd00176, SPEC, Spectrin repeats, found in several proteins involved
           in cytoskeletal structure; family members include
           spectrin, alpha-actinin and dystrophin; the spectrin
           repeat forms a three helix bundle with the second helix
           interrupted by proline in some sequences; the repeats
           are independent folding units; tandem repeats are found
           in differing numbers and arrange in an antiparallel
           manner to form dimers; the repeats are defined by a
           characteristic tryptophan (W) residue in helix A and a
           leucine (L) at the carboxyl end of helix C and separated
           by a linker of 5 residues; two copies of the repeat are
           present here.
          Length = 213

 Score = 42.8 bits (101), Expect = 8e-04
 Identities = 40/200 (20%), Positives = 83/200 (41%), Gaps = 4/200 (2%)

Query: 451 WNDFLSSKNTLEKWIEGFQKKIEAEKDIGDCTNLDDL-EKYKALLQEVISHNSDMETLND 509
              FL   + LE W+   ++ + +     D  +++ L +K++AL  E+ +H   +E LN+
Sbjct: 2   LQQFLRDADELEAWLSEKEELLSSTDYGDDLESVEALLKKHEALEAELAAHEERVEALNE 61

Query: 510 KCEALMELAAH--SPIREETLRLQALYAALVTSVQGLVSQVEKNLSDQTEFLAKKEEVEA 567
             E L+E        I+E    L   +  L    +    ++E+ L  Q +F    +++E 
Sbjct: 62  LGEQLIEEGHPDAEEIQERLEELNQRWEELRELAEERRQRLEEALDLQ-QFFRDADDLEQ 120

Query: 568 WLHKAHATVQECAGNGSKNVLKERLDTVNMVAERIPEGQHLMSVLQDTFTKALDTTPSDQ 627
           WL +  A +           ++E L     + E +   +  +  L +   + L+    D 
Sbjct: 121 WLEEKEAALASEDLGKDLESVEELLKKHKELEEELEAHEPRLKSLNELAEELLEEGHPDA 180

Query: 628 QDSLREAMTVLRDSWDRLNR 647
            + + E +  L + W+ L  
Sbjct: 181 DEEIEEKLEELNERWEELLE 200



 Score = 42.4 bits (100), Expect = 0.001
 Identities = 24/193 (12%), Positives = 78/193 (40%), Gaps = 9/193 (4%)

Query: 699 KWQQYVNDELEWENHSQNTLQWLDNIRS--DMKTLLERSKHIVDDIEKKKDDLKPIQKEA 756
           ++ +  ++   W +  +  L   D       ++ LL++ + +  ++   ++ ++ + +  
Sbjct: 4   QFLRDADELEAWLSEKEELLSSTDYGDDLESVEALLKKHEALEAELAAHEERVEALNELG 63

Query: 757 QQLSEWDS----SIVPKVSDLETRWAAVKSAWDAKHSDLSSEIAQYSSYQNAVQDAEKWL 812
           +QL E        I  ++ +L  RW  ++   + +   L  E      +     D E+WL
Sbjct: 64  EQLIEEGHPDAEEIQERLEELNQRWEELRELAEERRQRL-EEALDLQQFFRDADDLEQWL 122

Query: 813 LQISFQLMAHNSLYINNRQQTLEQIQHHEKLLQDYSSYQNAVQDAEKWLLQISFQLMAHN 872
            +    L + +     + +   E ++ H++L ++  +++  ++   +   ++  +     
Sbjct: 123 EEKEAALASED--LGKDLESVEELLKKHKELEEELEAHEPRLKSLNELAEELLEEGHPDA 180

Query: 873 SLYINNRQQTLEQ 885
              I  + + L +
Sbjct: 181 DEEIEEKLEELNE 193



 Score = 36.7 bits (85), Expect = 0.086
 Identities = 39/212 (18%), Positives = 88/212 (41%), Gaps = 6/212 (2%)

Query: 346 QAYTKAVLDTQEWIDATYNAVNMWGDLTLERVSLHSNLERLKNLEKELGRIGHK--TVAL 403
           Q + +   + + W+      ++          S+ + L++ + LE EL     +   +  
Sbjct: 3   QQFLRDADELEAWLSEKEELLSSTDYGDDLE-SVEALLKKHEALEAELAAHEERVEALNE 61

Query: 404 KNNVLESTGPIGKENINQEINQITLDWTNLQNTLQVIDKHHAKCLSIWNDFLSSKNTLEK 463
               L   G    E I + + ++   W  L+   +   +   + L +   F    + LE+
Sbjct: 62  LGEQLIEEGHPDAEEIQERLEELNQRWEELRELAEERRQRLEEALDLQQ-FFRDADDLEQ 120

Query: 464 WIEGFQKKIEAEKDIGDCTNLDDL-EKYKALLQEVISHNSDMETLNDKCEALMELAAHSP 522
           W+E  +  + +E    D  ++++L +K+K L +E+ +H   +++LN+  E L+E      
Sbjct: 121 WLEEKEAALASEDLGKDLESVEELLKKHKELEEELEAHEPRLKSLNELAEELLEEGHPDA 180

Query: 523 IREETLRLQALYAALVTSVQGLVSQVEKNLSD 554
             E   +L+ L       +  L  + +K L +
Sbjct: 181 DEEIEEKLEELNERW-EELLELAEERQKKLEE 211



 Score = 33.6 bits (77), Expect = 0.85
 Identities = 32/161 (19%), Positives = 63/161 (39%), Gaps = 20/161 (12%)

Query: 1442 LESTGPIGKENINQEINQITLDWTNLQNTLQDIDKHHAKCLSLWNDFLSSKNTLEKWIEG 1501
            L   G    E I + + ++   W  L+   ++  +   + L L   F    + LE+W+E 
Sbjct: 66   LIEEGHPDAEEIQERLEELNQRWEELRELAEERRQRLEEALDLQQ-FFRDADDLEQWLE- 123

Query: 1502 FQKKIESEKDIGDCTNLDDLEKYKALLQ--EGLHKEADNKKVLIDSFNKGIIGMTELESE 1559
                 E E  +       DLE  + LL+  + L +E +  +  + S N           E
Sbjct: 124  -----EKEAALASEDLGKDLESVEELLKKHKELEEELEAHEPRLKSLN-----------E 167

Query: 1560 EVKDLIQVGVHGLQVELNALLQNIEAEINKVANAAQERKAM 1600
              ++L++ G      E+   L+ +     ++   A+ER+  
Sbjct: 168  LAEELLEEGHPDADEEIEEKLEELNERWEELLELAEERQKK 208



 Score = 32.8 bits (75), Expect = 1.5
 Identities = 17/99 (17%), Positives = 49/99 (49%), Gaps = 2/99 (2%)

Query: 1146 QFTTLIMQIVGFITETTARVG--EVDGNSATVQEKIDKYDKIIADIQEYEAILATASDKG 1203
            QF      +  ++ E  A +   ++  +  +V+E + K+ ++  +++ +E  L + ++  
Sbjct: 110  QFFRDADDLEQWLEEKEAALASEDLGKDLESVEELLKKHKELEEELEAHEPRLKSLNELA 169

Query: 1204 DQLSSDGTISDRNEITEQLQSRKQQLHNLRKTVEKLRQQ 1242
            ++L  +G      EI E+L+   ++   L +  E+ +++
Sbjct: 170  EELLEEGHPDADEEIEEKLEELNERWEELLELAEERQKK 208



 Score = 32.0 bits (73), Expect = 2.4
 Identities = 37/179 (20%), Positives = 78/179 (43%), Gaps = 14/179 (7%)

Query: 141 KTTLQEVLSHKRIIESLLDKSKSLPQINKDQA--LEKAIVSVNKRYEDLVDGILKTISQL 198
           +    E+ +H+  +E+L +  + L +     A  +++ +  +N+R+E+L +   +   +L
Sbjct: 43  EALEAELAAHEERVEALNELGEQLIEEGHPDAEEIQERLEELNQRWEELRELAEERRQRL 102

Query: 199 EESLDIFQQFQQLQKAYQEDQKQLWDKLSSLTERELKGKLSSWCGYEESVATLLDWLKVT 258
           EE+LD+ Q F+                   L E+E            ESV  LL   K  
Sbjct: 103 EEALDLQQFFRDADDL-----------EQWLEEKEAALASEDLGKDLESVEELLKKHKEL 151

Query: 259 EKKLGDEMELKTTLDEKKAQLQVYRTIYQDATSHQQDLLQLKDKIESLHQPSEQSKQQL 317
           E++L        +L+E   +L +           ++ L +L ++ E L + +E+ +++L
Sbjct: 152 EEELEAHEPRLKSLNELAEEL-LEEGHPDADEEIEEKLEELNERWEELLELAEERQKKL 209



 Score = 31.3 bits (71), Expect = 4.3
 Identities = 25/108 (23%), Positives = 50/108 (46%), Gaps = 4/108 (3%)

Query: 1832 LKRAQKKIEAEKDIGDCTNLDDL-EKYKALLQEVISHNSDMETLNDKCEALMELAAH--S 1888
            L   ++ + +     D  +++ L +K++AL  E+ +H   +E LN+  E L+E       
Sbjct: 16   LSEKEELLSSTDYGDDLESVEALLKKHEALEAELAAHEERVEALNELGEQLIEEGHPDAE 75

Query: 1889 PIREETLRLQALYAALVTSVQGLVSQVEKNLSDQTEFLAKKEEVEAWL 1936
             I+E    L   +  L    +    ++E+ L  Q +F    +++E WL
Sbjct: 76   EIQERLEELNQRWEELRELAEERRQRLEEALDLQ-QFFRDADDLEQWL 122



 Score = 30.5 bits (69), Expect = 7.1
 Identities = 25/158 (15%), Positives = 65/158 (41%), Gaps = 1/158 (0%)

Query: 950  EFEVDVENVEAKLKELEVAVEGDVKTNNVLMLQELLNKYAQLNEEAGELNMMISNV-SIA 1008
            +F  D + +EA L E E  +      +++  ++ LL K+  L  E       +  +  + 
Sbjct: 4    QFLRDADELEAWLSEKEELLSSTDYGDDLESVEALLKKHEALEAELAAHEERVEALNELG 63

Query: 1009 TEHMNLNDADRLTVNDSVSSMQKRYAQLSRTIEDRISTVNNKLQSVKRLQHKVEEAKALL 1068
             + +     D   + + +  + +R+ +L    E+R   +   L   +  +   +  + L 
Sbjct: 64   EQLIEEGHPDAEEIQERLEELNQRWEELRELAEERRQRLEEALDLQQFFRDADDLEQWLE 123

Query: 1069 QKAESQLARPVSTQAEDISEALTVYQKLLDEINSWKNS 1106
            +K  +  +  +    E + E L  +++L +E+ + +  
Sbjct: 124  EKEAALASEDLGKDLESVEELLKKHKELEEELEAHEPR 161


>gnl|CDD|179385 PRK02224, PRK02224, chromosome segregation protein; Provisional.
          Length = 880

 Score = 41.6 bits (98), Expect = 0.005
 Identities = 108/525 (20%), Positives = 208/525 (39%), Gaps = 68/525 (12%)

Query: 926  ETKAQSDRIQDA--LKTLNQLIGHRKEFEVDVENVEAKLKELEVAVEGDVKTNNVLMLQE 983
            + KAQ +  ++    + LN L     E + ++E  E + +E       +          E
Sbjct: 191  QLKAQIEEKEEKDLHERLNGLESELAELDEEIERYEEQ-REQARETRDE--------ADE 241

Query: 984  LLNKYAQLNEEAGELNMMISNVSIATEHMNLNDADRLTVNDSVSSMQKRYAQLSRTIEDR 1043
            +L ++ +  EE   L   I ++    E +   + +R  + + V  +++R  +L    +D 
Sbjct: 242  VLEEHEERREELETLEAEIEDL---RETIAETEREREELAEEVRDLRERLEELEEERDDL 298

Query: 1044 ISTVNNKLQSVKRLQHKVEEAKALLQKAESQLARPVSTQAEDISEALTVYQKL------L 1097
            ++         + ++ + EE    L+  + +L   +              + L      L
Sbjct: 299  LAEAGLDDADAEAVEARREE----LEDRDEELRDRLEECRVAAQAHNEEAESLREDADDL 354

Query: 1098 DEINSWKNSQSDEDLVHLADSIKPLDEVVERIENHAVPKLKSLLLLREQFTTLIMQIVGF 1157
            +E       ++ E    L ++ + +++  E IE            L E+   L  +  G 
Sbjct: 355  EERAEELREEAAELESELEEAREAVEDRREEIEE-----------LEEEIEEL-RERFGD 402

Query: 1158 ITETTARVGEVDGNSATVQEKIDKYDKIIADIQEYEAILATASD---KGDQLSSDGTISD 1214
                   +G  +     ++E+ D+   +     E EA L TA +   + + L   G    
Sbjct: 403  ---APVDLGNAEDFLEELREERDE---LREREAELEATLRTARERVEEAEALLEAG---- 452

Query: 1215 RNEITEQLQSRKQQLHNLRKTVEKLRQQNEKRAAESEKLGAELEEIIEALHARETAVPSS 1274
              +  E  Q  +   H   +T+E+ R++ E+  AE E L  E+EE+ E L   E  V + 
Sbjct: 453  --KCPECGQPVEGSPH--VETIEEDRERVEELEAELEDLEEEVEEVEERLERAEDLVEAE 508

Query: 1275 SRLHNL---RKTVEKLRQQNEKRAAESEKLGAELEEIIEALHA----RETAVKTLPVLLL 1327
             R+  L   R+ +E+L  +  +   E  +   EL E    L A    +  A         
Sbjct: 509  DRIERLEERREDLEELIAERRETIEEKRERAEELRERAAELEAEAEEKREAAAEAEEEAE 568

Query: 1328 EVTSVDVELDKQRLELATKEVQPYVQRYETLKKNIAEAIDKFETQANEHEAYKQAYNEAY 1387
            E      EL+ +  EL  KE    ++R  TL   IA+A D+ E    + EA  +  +E  
Sbjct: 569  EAREEVAELNSKLAEL--KERIESLERIRTLLAAIADAEDEIERLREKREALAELNDERR 626

Query: 1388 DWL-----RKAKLGAQANADCHGEQQTTKDKADK-IKQITKSLPE 1426
            + L     RK +L A+ +     E +  K++A++ ++Q+ + L E
Sbjct: 627  ERLAEKRERKRELEAEFDEARIEEAREDKERAEEYLEQVEEKLDE 671



 Score = 36.2 bits (84), Expect = 0.28
 Identities = 50/233 (21%), Positives = 103/233 (44%), Gaps = 23/233 (9%)

Query: 911  EEGSPIKDKIDRLE--KETKAQSDRIQDALKTLNQLIG-HRKEFEVDVENVEAKLK---E 964
            EE   ++++++R E   E + + +R+++  + L +LI   R+  E   E  E   +   E
Sbjct: 489  EEVEEVEERLERAEDLVEAEDRIERLEERREDLEELIAERRETIEEKRERAEELRERAAE 548

Query: 965  LEVAVEGDVKTNNVLML--QELLNKYAQLNEEAGELNMMISNVSIATEHMNLNDADRLTV 1022
            LE   E   +         +E   + A+LN +  EL   I ++    E +    A     
Sbjct: 549  LEAEAEEKREAAAEAEEEAEEAREEVAELNSKLAELKERIESL----ERIRTLLAAIADA 604

Query: 1023 NDSVSSMQKRYAQLSRTIEDRISTVNNKLQSVKRLQHK-----VEEAKALLQKAESQLAR 1077
             D +  ++++   L+   ++R   +  K +  + L+ +     +EEA+   ++AE  L +
Sbjct: 605  EDEIERLREKREALAELNDERRERLAEKRERKRELEAEFDEARIEEAREDKERAEEYLEQ 664

Query: 1078 PVSTQAEDISEALTVYQKLLDEINSWKNSQSDEDLVHLADSIKPLDEVVERIE 1130
             V  + +++ E       L  EI + +N    E+L  L +  + L+  VE +E
Sbjct: 665  -VEEKLDELREERD---DLQAEIGAVENEL--EELEELRERREALENRVEALE 711



 Score = 33.5 bits (77), Expect = 1.6
 Identities = 70/400 (17%), Positives = 153/400 (38%), Gaps = 72/400 (18%)

Query: 917  KDKIDRLEKETKAQSDRIQDALKTLNQLIGHRKEFEVDVENVEAKLKELEVAVEGDVKTN 976
            +++I+ LE+E +   +R  DA   L       +E   + + +  +  ELE  +       
Sbjct: 383  REEIEELEEEIEELRERFGDAPVDLGNAEDFLEELREERDELREREAELEATLRT----- 437

Query: 977  NVLMLQELLNKYAQLNEEAG--------ELNMMISNVSIATEHMNLNDADRLTVNDSVSS 1028
                 +E + +   L E           E +  +  +    E +   +A+   + + V  
Sbjct: 438  ----ARERVEEAEALLEAGKCPECGQPVEGSPHVETIEEDRERVEELEAELEDLEEEVEE 493

Query: 1029 MQKRYAQLS--RTIEDRISTVNNKLQSVKRLQHKVEEAKALLQKAESQLARPVSTQAEDI 1086
            +++R  +       EDRI  +  + + ++ L   + E +  +++   +    +  +A ++
Sbjct: 494  VEERLERAEDLVEAEDRIERLEERREDLEEL---IAERRETIEEKRERAEE-LRERAAEL 549

Query: 1087 SEALTVYQKLLDEINSWKNSQSDEDLVHLADSIKPLDEVVERIENHAVPKLKSLLLLREQ 1146
                   ++   E       +++E    +A+    L E+ ERIE  ++ ++++LL     
Sbjct: 550  EAEAEEKREAAAE----AEEEAEEAREEVAELNSKLAELKERIE--SLERIRTLLAAIA- 602

Query: 1147 FTTLIMQIVGFITETTARVGEVDGNSATVQEKIDKYDKIIADIQEYEAILATASDKGDQL 1206
                         +    +  +      + E  D          E    LA   ++  +L
Sbjct: 603  -------------DAEDEIERLREKREALAELND----------ERRERLAEKRERKREL 639

Query: 1207 SSDGTISDRNEITEQLQSRKQQLHN-LRKTVEKLRQQNEKRAAESEKLGA------ELEE 1259
             ++    D   I E+ +  K++    L +  EKL +  E+R     ++GA      ELEE
Sbjct: 640  EAE---FDEARI-EEAREDKERAEEYLEQVEEKLDELREERDDLQAEIGAVENELEELEE 695

Query: 1260 IIEALHARETAVPSSSRLHNLRKTVE--------KLRQQN 1291
            + E   A E  V +   L++  + +E        +LRQ+N
Sbjct: 696  LRERREALENRVEALEALYDEAEELESMYGDLRAELRQRN 735


>gnl|CDD|234252 TIGR03545, TIGR03545, TIGR03545 family protein.  This model
            represents a relatively rare but broadly distributed
            uncharacterized protein family, distributed in 1-2
            percent of bacterial genomes, all of which have outer
            membranes. In many of these genomes, it is part of a
            two-gene pair.
          Length = 555

 Score = 40.5 bits (95), Expect = 0.010
 Identities = 40/212 (18%), Positives = 80/212 (37%), Gaps = 29/212 (13%)

Query: 1212 ISDRNEITEQLQSR-KQQLHNLRKTVEKLRQQNEKRAAESEKLGAELEEIIEALHARETA 1270
             S  + I EQ  S  K+    L      L+ ++ K    +E++   L+ + +    R+  
Sbjct: 127  ASAPSSIKEQKSSELKKVDSQLPDPRALLKGEDLKTVETAEEIEKSLKAMQQKWKKRKKD 186

Query: 1271 VPSSSRLHNLRKTVEKLRQQNEKRAAESEKLGAELEE-----------IIEALHARETAV 1319
            +P+   L   +K +E +++++ K   E +K+  E ++           I  A +  +   
Sbjct: 187  LPNKQDLEEYKKRLEAIKKKDIKNPLELQKIKEEFDKLKKEGKADKQKIKSAKNDLQNDK 246

Query: 1320 KTLPVLLLEVTSVDVELDKQRLE----------------LATKEVQPYVQRYETLKKNIA 1363
            K L   L E+     + D +RLE                L   E++ Y+Q++        
Sbjct: 247  KQLKADLAELKKA-PQNDLKRLENKYAIKSGDLKNFAVDLFGPEIRKYLQKFLKYYDQAE 305

Query: 1364 EAIDKFETQANEHEAYKQAYNEAYDWLRKAKL 1395
              ++K + +  E      A  E YD+      
Sbjct: 306  PLLNKSKKEPKEEAVETLANGEDYDFKDDNPY 337



 Score = 31.2 bits (71), Expect = 6.8
 Identities = 18/99 (18%), Positives = 39/99 (39%), Gaps = 8/99 (8%)

Query: 141 KTTLQEVLSHKRIIESLLDKSKSLPQINKDQALEKAIVSVNKRYEDLVDGILKTISQLEE 200
              L+ V + + I +SL    +   +  KD   ++ +    KR E +    +K   +L++
Sbjct: 157 GEDLKTVETAEEIEKSLKAMQQKWKKRKKDLPNKQDLEEYKKRLEAIKKKDIKNPLELQK 216

Query: 201 SLDIFQQ--------FQQLQKAYQEDQKQLWDKLSSLTE 231
             + F +         Q+++ A  + Q       + L E
Sbjct: 217 IKEEFDKLKKEGKADKQKIKSAKNDLQNDKKQLKADLAE 255


>gnl|CDD|222878 PHA02562, 46, endonuclease subunit; Provisional.
          Length = 562

 Score = 40.0 bits (94), Expect = 0.013
 Identities = 46/245 (18%), Positives = 108/245 (44%), Gaps = 29/245 (11%)

Query: 1034 AQLSRTIEDRISTVNNKLQSVKRLQHKVEEAKALLQKAESQLARPVSTQAEDISEALTVY 1093
            +++ +  +D+I  +N ++Q++      +++      K   +         E+I+     Y
Sbjct: 166  SEMDKLNKDKIRELNQQIQTLDMKIDHIQQQIKTYNKNIEEQR---KKNGENIARKQNKY 222

Query: 1094 QKLLDEINSWKNSQSDEDLVHLADSIKPLDEVVERIE--NHAVPKLKSLLLLREQFTTLI 1151
             +L++E  + K ++ +E    L + +  +++    +   N A  K+KS +   EQF  +I
Sbjct: 223  DELVEEAKTIK-AEIEELTDELLNLVMDIEDPSAALNKLNTAAAKIKSKI---EQFQKVI 278

Query: 1152 ------------MQIVGFITETTARVGEVDGNSATVQEKIDKYDKIIADIQEYEAILATA 1199
                         Q    I+E   R+ ++      +Q  ++K D  I +++E        
Sbjct: 279  KMYEKGGVCPTCTQQ---ISEGPDRITKIKDKLKELQHSLEKLDTAIDELEEIMDEFNEQ 335

Query: 1200 SDKGDQLSSDGTISDRNEITEQLQSRKQQLHNLRKTVEKLRQQNEKRAAESEKLGAELEE 1259
            S K  +L +  + + ++ IT   +++K     ++  +E+L+ +    A E  KL  EL++
Sbjct: 336  SKKLLELKNKISTNKQSLITLVDKAKK-----VKAAIEELQAEFVDNAEELAKLQDELDK 390

Query: 1260 IIEAL 1264
            I++  
Sbjct: 391  IVKTK 395



 Score = 35.0 bits (81), Expect = 0.56
 Identities = 26/152 (17%), Positives = 52/152 (34%), Gaps = 12/152 (7%)

Query: 1570 HGLQVELNALLQNIEAEINKVANA----AQERKAMQDKISKLQSWLKQYIDYS---GNKQ 1622
                 EL   L N+  +I   + A          ++ KI + Q  +K Y          Q
Sbjct: 233  KAEIEELTDELLNLVMDIEDPSAALNKLNTAAAKIKSKIEQFQKVIKMYEKGGVCPTCTQ 292

Query: 1623 ALQSRLDKVNEIQESFPEISTKLQTLSDHIENSSRKL-----PSRTNEAMYRDLANLRYD 1677
             +    D++ +I++   E+   L+ L   I+     +      S+    +   ++  +  
Sbjct: 293  QISEGPDRITKIKDKLKELQHSLEKLDTAIDELEEIMDEFNEQSKKLLELKNKISTNKQS 352

Query: 1678 FEKCVSSLSDVKQGLESRLVQWNEYETSLGKL 1709
                V     VK  +E    ++ +    L KL
Sbjct: 353  LITLVDKAKKVKAAIEELQAEFVDNAEELAKL 384



 Score = 33.4 bits (77), Expect = 1.4
 Identities = 34/211 (16%), Positives = 76/211 (36%), Gaps = 39/211 (18%)

Query: 917  KDKIDRLEKETKAQSDRIQDALKTLNQLIGHRKEFEVDVENVEAKLKELEVAVEGDVKTN 976
               I+   K+      R Q+    L +     K  + ++E +  +L  L +  + +  + 
Sbjct: 201  NKNIEEQRKKNGENIARKQNKYDELVEEA---KTIKAEIEELTDELLNLVM--DIEDPSA 255

Query: 977  NVLMLQELLNKYAQLNEE----AGELNMMISNVSIATEHMNLNDADRL--TVNDSVSSMQ 1030
                L +L    A++  +       + M        T    +++       + D +  +Q
Sbjct: 256  A---LNKLNTAAAKIKSKIEQFQKVIKMYEKGGVCPTCTQQISEGPDRITKIKDKLKELQ 312

Query: 1031 KRYAQLSRTIED-------------RISTVNNKL----QSVKRLQHKVEEAKALLQKAES 1073
                +L   I++             ++  + NK+    QS+  L  K ++ KA +++ ++
Sbjct: 313  HSLEKLDTAIDELEEIMDEFNEQSKKLLELKNKISTNKQSLITLVDKAKKVKAAIEELQA 372

Query: 1074 QLARPVSTQAEDISEALTVYQKLLDEINSWK 1104
            +          D +E L   Q  LD+I   K
Sbjct: 373  EFV--------DNAEELAKLQDELDKIVKTK 395



 Score = 31.9 bits (73), Expect = 5.1
 Identities = 40/205 (19%), Positives = 85/205 (41%), Gaps = 20/205 (9%)

Query: 1762 TEKKLGDEMELKTTLDEKKAQLQ-VYRTIYQDATSHQQDLLQLKDKIESLHQPSEQ---S 1817
            T  K  +E   K    E  A+ Q  Y  + ++A + + ++ +L D++ +L    E    +
Sbjct: 199  TYNKNIEEQRKKN--GENIARKQNKYDELVEEAKTIKAEIEELTDELLNLVMDIEDPSAA 256

Query: 1818 KQQLATITTRHGNVLKRAQKKIEAEKDIGDC-TNLDDLEKYKALLQEVISHNSDMET--- 1873
              +L T   +  + +++ QK I+  +  G C T    + +    + ++     +++    
Sbjct: 257  LNKLNTAAAKIKSKIEQFQKVIKMYEKGGVCPTCTQQISEGPDRITKIKDKLKELQHSLE 316

Query: 1874 -LNDKCEALMELAAHSPIREETLRLQALYAALVTSVQGLVSQVEKNLSD-------QTEF 1925
             L+   + L E+       E++ +L  L   + T+ Q L++ V+K           Q EF
Sbjct: 317  KLDTAIDELEEIM--DEFNEQSKKLLELKNKISTNKQSLITLVDKAKKVKAAIEELQAEF 374

Query: 1926 LAKKEEVEAWLHKAHATVQECDELE 1950
            +   EE+     +    V+   EL 
Sbjct: 375  VDNAEELAKLQDELDKIVKTKSELV 399



 Score = 31.1 bits (71), Expect = 8.8
 Identities = 40/261 (15%), Positives = 104/261 (39%), Gaps = 51/261 (19%)

Query: 1065 KALLQKAESQLARPVSTQAEDISEALTVYQKLLDEINSWKNSQSDEDLVHLADSIKPLDE 1124
            K  +++   Q+   +  + + I + +  Y K ++E       ++ E++    +     DE
Sbjct: 173  KDKIRELNQQIQT-LDMKIDHIQQQIKTYNKNIEEQRK----KNGENIARKQNKY---DE 224

Query: 1125 VVERIENHAVPKLKSLLLLREQFTTLIMQIVGFITETTARVGEVDGNSATVQEKIDKYDK 1184
            +VE  +       K+ +   E+ T  ++ +V  I + +A + +++  +A ++ KI+++ K
Sbjct: 225  LVEEAKTI-----KAEI---EELTDELLNLVMDIEDPSAALNKLNTAAAKIKSKIEQFQK 276

Query: 1185 II-------------ADIQEYEAILATASDKGDQLSSDGTISDRNEITEQLQSRKQQLHN 1231
            +I               I E    +    DK  +L    ++   +   ++L+    + + 
Sbjct: 277  VIKMYEKGGVCPTCTQQISEGPDRITKIKDKLKELQ--HSLEKLDTAIDELEEIMDEFNE 334

Query: 1232 LRKTVEKLRQQNEKRAAESEKLGAELEEIIEALHARETAVPSSSRLHNLRKTVEKLRQQN 1291
              K + +L+ +                  I         +   ++   ++  +E+L+ + 
Sbjct: 335  QSKKLLELKNK------------------ISTNKQSLITLVDKAK--KVKAAIEELQAEF 374

Query: 1292 EKRAAESEKLGAELEEIIEAL 1312
               A E  KL  EL++I++  
Sbjct: 375  VDNAEELAKLQDELDKIVKTK 395


>gnl|CDD|227355 COG5022, COG5022, Myosin heavy chain [Cytoskeleton].
          Length = 1463

 Score = 40.1 bits (94), Expect = 0.015
 Identities = 26/170 (15%), Positives = 62/170 (36%), Gaps = 9/170 (5%)

Query: 72   EKDLDTLKNTT---DVEAKNSLLQKLLTEKDQAGHKLTYLTSSG---EKLYLDTAAKGRE 125
            +K    LK  T       +  L ++ L E       ++ L       E   ++       
Sbjct: 858  KKRFSLLKKETIYLQSAQRVELAERQLQELKIDVKSISSLKLVNLELESEIIELKKSLSS 917

Query: 126  VVRQQLRALRDSKCSKTTLQEVLSHKRIIESLLDKSKSLPQINKDQALEKAIVSVNKRYE 185
             + + L    +       L+++L++  + E    +   LP++NK   +E  +   ++ YE
Sbjct: 918  DLIENLEFKTELI---ARLKKLLNNIDLEEGPSIEYVKLPELNKLHEVESKLKETSEEYE 974

Query: 186  DLVDGILKTISQLEESLDIFQQFQQLQKAYQEDQKQLWDKLSSLTERELK 235
            DL+      + +  ++    + F++      +    L +    L E  ++
Sbjct: 975  DLLKKSTILVREGNKANSELKNFKKELAELSKQYGALQESTKQLKELPVE 1024



 Score = 31.2 bits (71), Expect = 7.8
 Identities = 82/468 (17%), Positives = 161/468 (34%), Gaps = 75/468 (16%)

Query: 729  KTLLERSKHIVDDIEK----KKDDLKPIQKEAQQLSEWDSSIVPKVSDLE---------- 774
            +T+  +S   V+  E+     K D+K I        E +S I+     L           
Sbjct: 867  ETIYLQSAQRVELAERQLQELKIDVKSISSLKLVNLELESEIIELKKSLSSDLIENLEFK 926

Query: 775  ----TRWAAVKSAWDAKHSDLSSEIAQYSSYQNAVQDAEKWLLQISFQLMA---HNSLYI 827
                 R   + +  D +    S E  +       + + E  L + S +       +++ +
Sbjct: 927  TELIARLKKLLNNIDLE-EGPSIEYVKLPELNK-LHEVESKLKETSEEYEDLLKKSTILV 984

Query: 828  NNRQQTLEQIQHHEKLLQDYSSYQNAVQDAEKWLLQISFQLMAHNSLYINNRQQTLEQIQ 887
                +   ++++ +K L + S    A+Q++ K L ++  ++    S        +     
Sbjct: 985  REGNKANSELKNFKKELAELSKQYGALQESTKQLKELPVEVAELQSAS--KIISSESTEL 1042

Query: 888  HHEKLLQDVILSPDNKLAPCFSAEEGSPIKDKIDRLEKETKAQSDRIQDALKTLNQLIGH 947
               K LQ +                   +  + ++L+   KA   R +++L    QL   
Sbjct: 1043 SILKPLQKLK----------------GLLLLENNQLQARYKALKLRRENSLLDDKQL--- 1083

Query: 948  RKEFEVDVENVEAKLKELEVAVEGDVKTNNVLMLQELLN-------KYAQLNEEAGELNM 1000
                   +E+ E  LK + V  + +V   N++    +L        K   L E +  L+ 
Sbjct: 1084 -----YQLESTENLLKTINVK-DLEVTNRNLVKPANVLQFIVAQMIKLNLLQEISKFLSQ 1137

Query: 1001 MISNVSIATEHMNLNDAD-----RLTVNDSVSSMQKRYAQL-SRTIEDRISTVNNKLQSV 1054
            +++ +    + +++   +          +++ S     A    R  +  +    +KL S 
Sbjct: 1138 LVNTLEPVFQKLSVLQLELDGLFWEANLEALPSPPPFAALSEKRLYQSALYDEKSKL-SS 1196

Query: 1055 KRLQHKVEEAKALLQKAESQLARPVSTQAEDISEA--LTVYQKLLDEINSWKN------S 1106
              +     E  AL  K  S   R    + + ISE    T Y   L   N+         S
Sbjct: 1197 SEVNDLKNELIALFSKIFSGWPRGDKLK-KLISEGWVPTEYSTSLKGFNNLNKKFDTPAS 1255

Query: 1107 QSDEDLVHLADSIKPLDEVVERIENHAVPKLKSLL--LLREQFTTLIM 1152
             S+E L+ L +SI  L    +  E      + SLL  +    F  L  
Sbjct: 1256 MSNEKLLSLLNSIDNLLSSYKLEEEVLPATINSLLQYINVGLFNALRT 1303


>gnl|CDD|233973 TIGR02680, TIGR02680, TIGR02680 family protein.  Members of this
            protein family belong to a conserved gene four-gene
            neighborhood found sporadically in a phylogenetically
            broad range of bacteria: Nocardia farcinica,
            Symbiobacterium thermophilum, and Streptomyces
            avermitilis (Actinobacteria), Geobacillus kaustophilus
            (Firmicutes), Azoarcus sp. EbN1 and Ralstonia
            solanacearum (Betaproteobacteria). Proteins in this
            family average over 1400 amino acids in length
            [Hypothetical proteins, Conserved].
          Length = 1353

 Score = 39.8 bits (93), Expect = 0.018
 Identities = 60/300 (20%), Positives = 110/300 (36%), Gaps = 49/300 (16%)

Query: 1108 SDEDLVHLADSIKPLDEVVERIENHAVPKLKSLLLLREQFTTLIMQIVGFITETTARVGE 1167
             D++L  +AD+++ LDE  + +E     +L++L      F     +   +   T  R   
Sbjct: 218  DDDELTDVADALEQLDEYRDELE-----RLEALERALRNFLQ---RYRRY-ARTMLRRRA 268

Query: 1168 VDGNSATVQEKIDKYDKIIADIQEYEAILATASDKGDQLSSDGTISDRNEITEQLQSRKQ 1227
                SA  Q     YD++  D+      L TA ++  +L +     +R    + L++R +
Sbjct: 269  TRLRSAQTQ-----YDQLSRDLGRARDELETAREEERELDARTEALEREA--DALRTRLE 321

Query: 1228 QLHNLR-----KTVEKLRQQNE---KRAAESEKLGAELEEIIEALHARETAVPSSSRLHN 1279
             L         + +E+ R   E     AA++ +   E E  +E    R      + R   
Sbjct: 322  ALQGSPAYQDAEELERARADAEALQAAAADARQAIREAESRLEE--ERRRLDEEAGR--- 376

Query: 1280 LRKTVEKLRQQNEKRAAESEKLG---AELEEIIEALHARETAVKTLPVLLLEVTSVDVEL 1336
            L     +LR   E+ A  +E+ G   A   E   AL A+E         L E+ ++D   
Sbjct: 377  LDDAERELRAAREQLARAAERAGLSPAHTAEPDAALAAQE---------LQELGALDARR 427

Query: 1337 DK-----QRLELATKEVQPYVQ---RYETLKKNIAEAIDKFETQANEHEAYKQAYNEAYD 1388
                    +       ++       R E          D+ + +A +  A  +  +EA  
Sbjct: 428  QDADRVIAQRSEQVALLRRRDDVADRAEATHAAARARRDELDEEAEQAAARAELADEAVH 487


>gnl|CDD|100796 PRK01156, PRK01156, chromosome segregation protein; Provisional.
          Length = 895

 Score = 39.5 bits (92), Expect = 0.025
 Identities = 81/447 (18%), Positives = 179/447 (40%), Gaps = 70/447 (15%)

Query: 714  SQNTLQWLDNIRSDMKTLLERSKHIVDD-IEKKKDDLKPIQKEAQQLSEWDSSIVPKVSD 772
            +++ L       +  K L ER   I++D + K ++ +    K    +      +    ++
Sbjct: 261  AESDLSMELEKNNYYKELEERHMKIINDPVYKNRNYINDYFKYKNDIENKKQILSNIDAE 320

Query: 773  LETRWAAVKSAWDAKHSDLSSEIAQYSSYQNAVQDAEKWLLQISFQLMAHNSLYINNRQQ 832
            +    A +K     K S L  +   Y   ++   D    +L++    M +NS Y+ + + 
Sbjct: 321  INKYHAIIK-----KLSVLQKDYNDYIKKKSRYDDLNNQILELEGYEMDYNS-YLKSIES 374

Query: 833  TLEQIQHHEKLLQDYSSYQNAVQDAE--------KWLLQISFQLMAHNSLYINNRQQTLE 884
              ++I+ + K ++  S++ + +   +        K L +I+ +L   +S   +  Q+   
Sbjct: 375  LKKKIEEYSKNIERMSAFISEILKIQEIDPDAIKKELNEINVKLQDISSKVSSLNQRIRA 434

Query: 885  QIQHHEKLLQDV-ILSPDNKLAPCFS--------------AEEGSPIKDKIDRLEKETKA 929
              ++ ++L +++ +L+  +    C +               E+ S +++KI  +E E K 
Sbjct: 435  LRENLDELSRNMEMLNGQSVCPVCGTTLGEEKSNHIINHYNEKKSRLEEKIREIEIEVKD 494

Query: 930  QSDRIQDALKTL---------------NQLIGHRKEFEVDVENVEAKLKELEVAVEGDVK 974
              ++I D  K                 N++   R + E D++    +LK+     E    
Sbjct: 495  IDEKIVDLKKRKEYLESEEINKSINEYNKIESARADLE-DIKIKINELKDKHDKYEEIKN 553

Query: 975  TNNVLMLQELLNKYAQLNEEAGELNMMISNVSIATEHMNLNDADRLTVNDSVSSMQ---- 1030
                L L++L +K             +IS + I T     N+  +  +ND  S +Q    
Sbjct: 554  RYKSLKLEDLDSKRTSWL----NALAVISLIDIETNRSRSNEIKK-QLNDLESRLQEIEI 608

Query: 1031 -----KRYAQLS-RTIEDRISTVNNKLQSVKRLQHKVEEAKALLQKAESQLA-------- 1076
                 K Y   S R IE+  + +NNK   ++  +  +E+ +  +   + Q+A        
Sbjct: 609  GFPDDKSYIDKSIREIENEANNLNNKYNEIQENKILIEKLRGKIDNYKKQIAEIDSIIPD 668

Query: 1077 -RPVSTQAEDISEALTVYQKLLDEINS 1102
             + ++++  DI + L   +K LD+  +
Sbjct: 669  LKEITSRINDIEDNLKKSRKALDDAKA 695



 Score = 38.0 bits (88), Expect = 0.062
 Identities = 102/585 (17%), Positives = 217/585 (37%), Gaps = 49/585 (8%)

Query: 1157 FITETTARVGEVDGNSATVQEKIDKYDKIIADIQEYEAILATASDKGDQLSS-DGTISDR 1215
            F+       GE+D   + +     +  KI+ +I E  ++        D +      IS+ 
Sbjct: 129  FLNSIFVGQGEMD---SLISGDPAQRKKILDEILEINSLERNYDKLKDVIDMLRAEISNI 185

Query: 1216 NEITEQLQSRKQQLHNLRKTVEKLRQQNEKRAAESEKLGAELEEIIEALHARETAVPSSS 1275
            + + E+L+S   +L N++K +      +EK  + + K    L        A +      S
Sbjct: 186  DYLEEKLKSSNLELENIKKQIA----DDEKSHSITLKEIERLSIEYNN--AMDDYNNLKS 239

Query: 1276 RLHNLRKTVEKLRQQNEKRAAESEKLGAELEEIIEALHARETAVKTLPVLLLEVTSVDVE 1335
             L+ L    +   +   +       L  ELE+            K L    +++ +  V 
Sbjct: 240  ALNELSSLEDMKNRYESEIKTAESDLSMELEKNNY--------YKELEERHMKIINDPVY 291

Query: 1336 LDKQRLELATKEVQPYVQRYETLKKNIAEAIDKFETQANEHEAYKQAYNEAYDWLRKAKL 1395
             ++  +    K       + + L  NI   I+K+     +    ++ YN+      +   
Sbjct: 292  KNRNYINDYFKYKNDIENKKQIL-SNIDAEINKYHAIIKKLSVLQKDYNDYIKKKSRYDD 350

Query: 1396 GAQANADCHGEQQTTKDKADKIKQITKSLPEGQKLIDKTVALKNNVLESTGPIGKENINQ 1455
                  +  G +         I+ + K + E  K I++  A  + +L     I  + I +
Sbjct: 351  LNNQILELEGYEMDYNSYLKSIESLKKKIEEYSKNIERMSAFISEIL-KIQEIDPDAIKK 409

Query: 1456 EINQITLDWTNLQNTLQDIDKHHAKCLSLWNDFLSSKNTLEKWIEGFQKKIESEKDIGDC 1515
            E+N+I +    LQ+    +   + +  +L  +       +E  + G          +G+ 
Sbjct: 410  ELNEINVK---LQDISSKVSSLNQRIRALRENLDELSRNMEM-LNGQSVCPVCGTTLGEE 465

Query: 1516 TNLDDLEKY---KALLQEGL-HKEADNKKVLIDSFNKGIIGMTE-LESEEVKDLIQVGVH 1570
             +   +  Y   K+ L+E +   E + K   ID     +    E LESEE+   I    +
Sbjct: 466  KSNHIINHYNEKKSRLEEKIREIEIEVKD--IDEKIVDLKKRKEYLESEEINKSIN--EY 521

Query: 1571 GLQVELNALLQNIEAEINKVANAAQERKAMQDKI---------SKLQSWLK-----QYID 1616
                   A L++I+ +IN++ +   + + ++++          SK  SWL        ID
Sbjct: 522  NKIESARADLEDIKIKINELKDKHDKYEEIKNRYKSLKLEDLDSKRTSWLNALAVISLID 581

Query: 1617 YSGNKQALQSRLDKVNEIQESFPEISTKLQTLSDHIENSSRKLPSRTNEAMYRDLANLRY 1676
               N+        ++N+++    EI         +I+ S R++ +  N     +  N   
Sbjct: 582  IETNRSRSNEIKKQLNDLESRLQEIEIGFPDDKSYIDKSIREIENEAN--NLNNKYNEIQ 639

Query: 1677 DFEKCVSSLSDVKQGLESRLVQWNEYETSLGKLITWLNDTENVLK 1721
            + +  +  L       + ++ + +     L ++ + +ND E+ LK
Sbjct: 640  ENKILIEKLRGKIDNYKKQIAEIDSIIPDLKEITSRINDIEDNLK 684



 Score = 33.3 bits (76), Expect = 1.7
 Identities = 87/540 (16%), Positives = 196/540 (36%), Gaps = 59/540 (10%)

Query: 1530 EGLHKEADNKKVLIDSFNKGIIGMTELESEEVKDLIQVGVHGLQVELNALLQNIEAEINK 1589
              L +  D  K +ID     I  +  LE +         +    +EL  + + I  +   
Sbjct: 162  NSLERNYDKLKDVIDMLRAEISNIDYLEEK---------LKSSNLELENIKKQIADDEKS 212

Query: 1590 VANAAQERKAMQDKISKLQSWLKQYIDYSGNKQALQSRLDKVNEIQESFPEISTKLQTL- 1648
             +   +E + +  +     + +  Y +       L S  D  N  +       + L    
Sbjct: 213  HSITLKEIERLSIEY---NNAMDDYNNLKSALNELSSLEDMKNRYESEIKTAESDLSMEL 269

Query: 1649 --SDHIENSSRKLPSRTNEAMYRDLANLRYDFEKCVSSLSDVKQGLESRLVQWNEYETSL 1706
              +++ +    +     N+ +Y++   +  D+ K  + + + KQ L +   + N+Y   +
Sbjct: 270  EKNNYYKELEERHMKIINDPVYKNRNYIN-DYFKYKNDIENKKQILSNIDAEINKYHAII 328

Query: 1707 GKLITWLNDTENVLKNFTLQPTLNEKKQQRELKGKLSSWCGYEESVATLLDWLKVTEKKL 1766
             KL     D  + +K  +    LN   Q  EL+G    +  Y +S+ +L   ++   K +
Sbjct: 329  KKLSVLQKDYNDYIKKKSRYDDLN--NQILELEGYEMDYNSYLKSIESLKKKIEEYSKNI 386

Query: 1767 GD-EMELKTTLDEKKAQLQVYRTIYQDATSHQQDLLQLKDKIESLHQPSEQSKQQLATIT 1825
                  +   L  ++      +    +       L  +  K+ SL+Q     ++ L  ++
Sbjct: 387  ERMSAFISEILKIQEIDPDAIKKELNEINV---KLQDISSKVSSLNQRIRALRENLDELS 443

Query: 1826 TRHGNVLKRAQKKIEAEKDIGDCTNLDDLEKY---KALLQEVISH-NSDMETLNDKCEAL 1881
             R+  +L            +G+  +   +  Y   K+ L+E I     +++ +++K   L
Sbjct: 444  -RNMEMLNGQSVCPVCGTTLGEEKSNHIINHYNEKKSRLEEKIREIEIEVKDIDEKIVDL 502

Query: 1882 MELAAHSPIREETLRLQALYAALVTSVQGLVSQVEKNLSDQTEFLAKKEEVEAWLHKAHA 1941
             +   +    EE  +    Y                 +      L   +     L   H 
Sbjct: 503  KKRKEYLE-SEEINKSINEY---------------NKIESARADLEDIKIKINELKDKHD 546

Query: 1942 TVQECDELEAGIKEEQKNIETQYLQWNSFRETLAQMTAWLDSVEKNIKQEIATPWTTTQE 2001
              +E       +K E  +++++   W +             +V   I  +I T  + + E
Sbjct: 547  KYEEIKNRYKSLKLE--DLDSKRTSWLN-----------ALAVISLI--DIETNRSRSNE 591

Query: 2002 LRSKLLKLKEILKKCEKGVGDHEAFVDKY-EECSQKLAQVEAKYAKLIEPETSYEELQKR 2060
            ++ +L  L+  L++ E G  D ++++DK   E   +   +  KY ++ E +   E+L+ +
Sbjct: 592  IKKQLNDLESRLQEIEIGFPDDKSYIDKSIREIENEANNLNNKYNEIQENKILIEKLRGK 651


>gnl|CDD|220623 pfam10186, Atg14, UV radiation resistance protein and
            autophagy-related subunit 14.  The Atg14 or Apg14
            proteins are hydrophilic proteins with a predicted
            molecular mass of 40.5 kDa, and have a coiled-coil motif
            at the N terminus region. Yeast cells with mutant Atg14
            are defective not only in autophagy but also in sorting
            of carboxypeptidase Y (CPY), a vacuolar-soluble
            hydrolase, to the vacuole. Subcellular fractionation
            indicate that Apg14p and Apg6p are peripherally
            associated with a membrane structure(s). Apg14p was
            co-immunoprecipitated with Apg6p, suggesting that they
            form a stable protein complex. These results imply that
            Apg6/Vps30p has two distinct functions: in the autophagic
            process and in the vacuolar protein sorting pathway.
            Apg14p may be a component specifically required for the
            function of Apg6/Vps30p through the autophagic pathway.
            There are 17 auto-phagosomal component proteins which are
            categorized into six functional units, one of which is
            the AS-PI3K complex (Vps30/Atg6 and Atg14). The AS-PI3K
            complex and the Atg2-Atg18 complex are essential for
            nucleation, and the specific function of the AS-PI3K
            apparently is to produce phosphatidylinositol 3-phosphate
            (PtdIns(3)P) at the pre-autophagosomal structure (PAS).
            The localisation of this complex at the PAS is controlled
            by Atg14. Autophagy mediates the cellular response to
            nutrient deprivation, protein aggregation, and pathogen
            invasion in humans, and malfunction of autophagy has been
            implicated in multiple human diseases including cancer.
            This effect seems to be mediated through direct
            interaction of the human Atg14 with Beclin 1 in the human
            phosphatidylinositol 3-kinase class III complex.
          Length = 307

 Score = 38.5 bits (90), Expect = 0.030
 Identities = 22/116 (18%), Positives = 50/116 (43%), Gaps = 1/116 (0%)

Query: 1190 QEYEAILATASDKGDQLSSDGTISDRNEITEQLQSRKQQLHNLRKTVEKLRQQNEKRAAE 1249
            Q+ E  L  A+++  +L++D    +     E+L   + ++  L++ +E+ R++ E+    
Sbjct: 41   QKVEEALEGATNEDGKLAADLLKLEVARKKERLNQIRARISQLKEEIEQKRERIEELKRA 100

Query: 1250 SEKLGAELEEIIEALHARETAVPSSSRLHNLRKTVEKLRQQNEKRAAESEKLGAEL 1305
              +  ++L      L  R  +         +++T  KL   +   A +   L  EL
Sbjct: 101  LAQRRSDLSSASYQLEKRRASQL-EKLQDEIKRTRSKLNALHSLLAEKRSFLCREL 155


>gnl|CDD|234501 TIGR04211, SH3_and_anchor, SH3 domain protein.  Members of this
            protein family have a signal peptide, a strongly
            conserved SH3 domain, a variable region, and then a
            C-terminal hydrophobic transmembrane alpha helix region.
          Length = 198

 Score = 37.7 bits (88), Expect = 0.032
 Identities = 26/95 (27%), Positives = 46/95 (48%), Gaps = 10/95 (10%)

Query: 1216 NEITEQLQSRKQQLHNLRKTVEKLRQQNEKRAAESEKLGAELEEIIEALHARETAVPSSS 1275
             E+ E+L   ++QL  L++  ++L+Q+     AE E+L  EL  I +           S+
Sbjct: 76   AELQEELAELQEQLAELQQENQELKQELSTLEAELERLQKELARIKQL----------SA 125

Query: 1276 RLHNLRKTVEKLRQQNEKRAAESEKLGAELEEIIE 1310
                L +   +LR++  +   E+E L AE E + E
Sbjct: 126  NAIELDEENRELREELAELKQENEALEAENERLQE 160



 Score = 37.3 bits (87), Expect = 0.044
 Identities = 17/80 (21%), Positives = 31/80 (38%), Gaps = 3/80 (3%)

Query: 1213 SDRNEITEQLQSRKQQLHNLRKTVEKLRQQNEKRAAESEKLGAELEEIIEALHARETAVP 1272
             +  E+ EQL   +Q+   L++ +  L  + E+   E  ++       IE          
Sbjct: 80   EELAELQEQLAELQQENQELKQELSTLEAELERLQKELARIKQLSANAIELDEENREL-- 137

Query: 1273 SSSRLHNLRKTVEKLRQQNE 1292
                L  L++  E L  +NE
Sbjct: 138  -REELAELKQENEALEAENE 156



 Score = 33.8 bits (78), Expect = 0.64
 Identities = 23/101 (22%), Positives = 48/101 (47%), Gaps = 7/101 (6%)

Query: 1220 EQLQSRKQQLHNLRKTVEKLRQQNEKRAAESEKLGAELEEIIEALHARETAVPSSSRLHN 1279
            E+L   +Q+L  L++ + +L++Q  +   E+++L  EL  +   L   +      +R+  
Sbjct: 66   ERLPELQQELAELQEELAELQEQLAELQQENQELKQELSTLEAELERLQK---ELARIKQ 122

Query: 1280 LRKTVEKLRQQNEKRAAESEKLGAELEEIIEALHARETAVK 1320
            L     +L ++N +   E      EL++  EAL A    ++
Sbjct: 123  LSANAIELDEENRELREELA----ELKQENEALEAENERLQ 159


>gnl|CDD|224188 COG1269, NtpI, Archaeal/vacuolar-type H+-ATPase subunit I [Energy
            production and conversion].
          Length = 660

 Score = 38.5 bits (90), Expect = 0.047
 Identities = 48/258 (18%), Positives = 79/258 (30%), Gaps = 42/258 (16%)

Query: 1109 DEDLVHLAD---SIKPLDEVVER-IENHAVPKLKSLLLLREQFTTLIMQIVGFITETTAR 1164
            D  LVHL D     K L E+ +  +   A   L SLL     +   +  + G +      
Sbjct: 27   DFGLVHLEDLEEGEKGLKELEKLKVAEVAQISLSSLLSEVLDYLRSVKGLEGRLFILPEE 86

Query: 1165 VGEVDGNSATVQEKIDKYDKIIADIQEYEAILATA--------SDKGDQLSSDGTISDRN 1216
            V +++    +++E I   +K  ++++E    L            D  D L     ++  +
Sbjct: 87   VEKLEAELKSLEEVIKPAEKFSSEVEELTRKLEERLSELDEELEDLEDLLEELEPLAYLD 146

Query: 1217 EITEQLQSRK-----QQLHNLRKTVEKLRQQNEKRAA-ESEKLGAELEEIIEALHARETA 1270
                 L+  K       L    K  E L    E   A   E + A +  ++         
Sbjct: 147  FDLSLLRGLKFLLVRLGLVRREKL-EALVGVIEDEVALYGENVEASVVIVVAHGAEDLDK 205

Query: 1271 V--------------------PSSSRLHNLRKTVEK---LRQQNEKRAAESEKLGAELEE 1307
            V                    PS           E    L     +  A +EK+  EL  
Sbjct: 206  VSKILNELGFELYEVPEFDGGPSELISELEEVIAEIQDELESLRSELEALAEKIAEELLA 265

Query: 1308 IIEALHARETAVKTLPVL 1325
            + E L   +     L  L
Sbjct: 266  VREILEIEKALGDVLSKL 283


>gnl|CDD|116500 pfam07888, CALCOCO1, Calcium binding and coiled-coil domain
            (CALCOCO1) like.  Proteins found in this family are
            similar to the coiled-coil transcriptional coactivator
            protein coexpressed by Mus musculus (CoCoA/CALCOCO1).
            This protein binds to a highly conserved N-terminal
            domain of p160 coactivators, such as GRIP1, and thus
            enhances transcriptional activation by a number of
            nuclear receptors. CALCOCO1 has a central coiled-coil
            region with three leucine zipper motifs, which is
            required for its interaction with GRIP1 and may regulate
            the autonomous transcriptional activation activity of the
            C-terminal region.
          Length = 546

 Score = 38.4 bits (89), Expect = 0.049
 Identities = 34/124 (27%), Positives = 55/124 (44%), Gaps = 11/124 (8%)

Query: 1160 ETTARVGEVDGNSATVQEKIDKYDKIIADIQEYEAILATASDKGDQLSSDGTISDRNEIT 1219
            E   R+ E++ +  T+ +K  + D+++   Q+ EA L     +  Q     TI  R+E T
Sbjct: 217  ENQQRILELEQDIQTLTQKKQENDRVLEGTQDIEAELERMKGELKQRLKKMTIQRRDEET 276

Query: 1220 E--QLQSRKQQLHNLRKTVEKLRQQNEKRAAESEKLGAELEEIIEALHARETAVPSSSRL 1277
            E   LQ   +QLH      E LR   E+  +  +K G   EE+      R+  +   + L
Sbjct: 277  ERIDLQLENEQLH------EDLRTLQERLESSQQKAGLLGEELASLGSLRDHTI---AEL 327

Query: 1278 HNLR 1281
            H  R
Sbjct: 328  HQSR 331


>gnl|CDD|235449 PRK05415, PRK05415, hypothetical protein; Provisional.
          Length = 341

 Score = 37.2 bits (87), Expect = 0.085
 Identities = 23/93 (24%), Positives = 38/93 (40%), Gaps = 15/93 (16%)

Query: 1050 KLQSVKRLQHKVEEAKALLQKAESQLARPVSTQ------AEDISEALTVYQKLLDEINSW 1103
            +L+ +++  H  +EA+ALL   +   AR    +         +  AL  +Q  L E    
Sbjct: 124  RLRRLRQRAHLRDEARALLHSHDVGEARAFCEKLAKQAGIPQLHPALQRWQASLHET--- 180

Query: 1104 KNSQSDEDLVHLADSI--KPLDEVVER-IENHA 1133
                +D +L+ L +     PLD    R I   A
Sbjct: 181  ---HNDAELLRLYEREVLPPLDAQARRLISRAA 210


>gnl|CDD|221489 pfam12252, SidE, Dot/Icm substrate protein.  This family of proteins
            is found in bacteria. Proteins in this family are
            typically between 397 and 1543 amino acids in length.
            This family is the SidE protein in the Dot/Icm pathway of
            Legionella pneumophila bacteria. There is little
            literature describing the family.
          Length = 1443

 Score = 37.6 bits (87), Expect = 0.093
 Identities = 44/142 (30%), Positives = 69/142 (48%), Gaps = 19/142 (13%)

Query: 1210 GTISDRNEITEQLQSRKQQLHNLRKTV--------EKLRQQNEKRAAESEKLGAELEEII 1261
            GTIS   E+ EQ+QS +Q+L +L++ V        EK+R++ +     + K   ELE+  
Sbjct: 1066 GTIS--RELREQIQSVRQELESLQRAVVTPVVTDEEKVRERYDALIENTSKRITELEKAK 1123

Query: 1262 EA-LHARETAVPSSSRLHNLRKTVEKLRQQNEKRAAESEKLGAELEEIIEALHAR--ETA 1318
             A L   + A+   S L+NL++ V  LR  NEK    +     +  + IE L  +     
Sbjct: 1124 LANLDVVKKAI---SNLNNLQQEVTLLR--NEKIRMHTGTDKVDFSD-IEKLEKQLQVID 1177

Query: 1319 VKTLPVLLLEVTSVDVELDKQR 1340
             K     LLEVT     L+K++
Sbjct: 1178 TKLADAYLLEVTKQISALEKEK 1199


>gnl|CDD|221432 pfam12128, DUF3584, Protein of unknown function (DUF3584).  This
            protein is found in bacteria and eukaryotes. Proteins in
            this family are typically between 943 to 1234 amino acids
            in length. This family contains a P-loop motif suggesting
            it is a nucleotide binding protein. It may be involved in
            replication.
          Length = 1198

 Score = 37.4 bits (87), Expect = 0.11
 Identities = 45/286 (15%), Positives = 103/286 (36%), Gaps = 28/286 (9%)

Query: 1025 SVSSMQKRYAQLSRTIEDRISTVNNKLQSVKRLQHKVEEAKALLQKAESQLARPVSTQAE 1084
             +        +  R   +  + +      ++RLQ++ +  K  L+ A ++  +   TQ  
Sbjct: 629  QLVQANAELEEQKRAEAEARTALKQARLDLQRLQNEQQSLKDKLELAIAERKQQAETQLR 688

Query: 1085 DISEALTVYQKLLDEINSWKNSQSDEDLVHLADSIKPLDEVVERIENHAVPKLKSLLLLR 1144
             +   L     LL++  ++  +  D+      + +     V   ++N             
Sbjct: 689  QLDAQLKQ---LLEQQQAFLEALKDDFRELRTERLAKWQVVEGELDN------------- 732

Query: 1145 EQFTTLIMQIVGFITETTARVGEVDGNSATVQEKID----KYDKIIADIQEYEAILATAS 1200
             Q   L   I    T+  AR+ E+          +D       ++   I+E E  +   +
Sbjct: 733  -QLAQLSAAIEAARTQAKARLKELKKQYDRELASLDVDPNTVKELKRQIEELETTIERIA 791

Query: 1201 DKGDQLSS-----DGTISDRNEITEQLQSRKQQLHNLRKTVEKLRQQNEKRAAESEKLGA 1255
             +  ++         T   R+ + E+  +   QL  L  + E+L+Q+  +   +++    
Sbjct: 792  VRRPEVREYRAFMQETWLHRDSLREERPNLAIQLRELESSAEELQQELTRLIKDTKLRRK 851

Query: 1256 ELEEIIEALHARETAVPSSSRLHNLRKTVEKLRQQNEKRAAESEKL 1301
            +LE+  +AL  +   +     L  LR  + +L +  E   A   + 
Sbjct: 852  KLEQERKALEKQLDQL--DELLRGLRDEMRQLAELKEPANANQAEG 895


>gnl|CDD|205692 pfam13514, AAA_27, AAA domain.  This domain is found in a number of
            double-strand DNA break proteins. This domain contains a
            P-loop motif.
          Length = 1118

 Score = 37.1 bits (86), Expect = 0.12
 Identities = 43/212 (20%), Positives = 74/212 (34%), Gaps = 45/212 (21%)

Query: 1209 DGTISDRNEITEQLQSRKQQLHNLRKTVEKLRQQNEKRA---------AESEKLGAELEE 1259
                    ++ + L   +++L NLRK + +L ++ ++           AE + L  +L  
Sbjct: 170  QLKTRTWKDLVKALDEAEEELANLRKELRQLEKEKQRLERLRRLLPLLAERKALEQQLAA 229

Query: 1260 IIEALHARETAVPSSSRLHNLRKTVEKLRQQNEKRAAESEKLGAELEEI----------- 1308
            + E +     AV    R    R  +   R+  E      E L AEL+EI           
Sbjct: 230  LGEVIDLPPDAV---ERYEEARAELRAARRNLELLTERLEALQAELDEISLDEELLAQAA 286

Query: 1309 -IEALHAR----ETAVKTLPVLLLEVTS-----------------VDVELDKQRLELATK 1346
             IEALH +      A + LP    E+ +                  +     +    A +
Sbjct: 287  AIEALHQQRGEYRNAEQDLPDREGEIANAREAAAALLAQIGPDADEEAVESLRPSLAAKE 346

Query: 1347 EVQPYVQRYETLKKNIAEAIDKFETQANEHEA 1378
             V    +R E L + +  A D  E +  E + 
Sbjct: 347  TVTELEKRKEALDQALKSARDALEERERELKQ 378



 Score = 35.9 bits (83), Expect = 0.29
 Identities = 90/596 (15%), Positives = 195/596 (32%), Gaps = 80/596 (13%)

Query: 732  LERSKHIVDDIEKKKDDLKPIQKEAQQLSEWDS-----SIVPKVSDLETRWAAVKSAWDA 786
            L   +  +  +EK+K  L+ +++    L+E  +     + + +V DL            A
Sbjct: 190  LANLRKELRQLEKEKQRLERLRRLLPLLAERKALEQQLAALGEVIDLPPDAVERYEEARA 249

Query: 787  KHSDLSSEIAQYSSYQNAVQDAEKWLLQISFQLMAHNSLYINNRQQTLEQIQHHEKLLQD 846
            +       +   +    A+Q     L +IS              ++ L Q    E L Q 
Sbjct: 250  ELRAARRNLELLTERLEALQAE---LDEISLD------------EELLAQAAAIEALHQQ 294

Query: 847  YSSYQNAVQDAEKWLLQISFQLMAHNSLYINNRQQTLEQIQHHEKLLQDVILSPDNKLAP 906
               Y+NA QD      +                    E       LL    + PD     
Sbjct: 295  RGEYRNAEQDLPD--REGE-------------IANAREAA---AALLAQ--IGPDADEEA 334

Query: 907  CFSAEEGSPIKDKIDRLEKETKAQSDRIQDALKTLNQLIGHRKEFEVDVENVEAKLKELE 966
              S       K+ +  LEK  +A    ++ A   L       +E E +++ V A+L  L 
Sbjct: 335  VESLRPSLAAKETVTELEKRKEALDQALKSARDAL-------EERERELKQVRAQLAALP 387

Query: 967  VAVEGDVKTNNVLMLQELLNKYAQLNEEAGELNMMISNVSIATEHMNLNDADRLTVNDSV 1026
                 +     +       +  A L     E+ +    ++ A   + L   D   +    
Sbjct: 388  TVTVPEPLRAALADALRAGDIDAALAALEQEVAVAKRELAQALSRLGLWRGDLEELVALP 447

Query: 1027 SSMQKRYAQLSRTIEDRISTVNNKLQSVKRLQHKVEEAKALLQKAESQLARPVSTQAEDI 1086
                ++     +  E+ +     K   +  L+  +E  +  L+  ++  A  V T+ E +
Sbjct: 448  VPSAEQVQAFLKEAEEIMQAKRQKRDRLLELEEDLETLELQLRHLDA--AGAVPTEEE-V 504

Query: 1087 SEALTVYQKLLDEINSWKNSQSDEDLVHLADSIKPLDEVVERIENHAVPKLKSLLLLREQ 1146
            + A      L  +I             +  ++++  D++ +R+   A             
Sbjct: 505  AAARARRDALWQDIRRGYALLEPTASAYE-EAVESADQLADRLLREA------------- 550

Query: 1147 FTTLIMQIVGFITETTARVGEVDGNSATVQEKIDKYDKIIADIQEYEAILATASDKGDQL 1206
                         +    +  +       + ++++ +K +  ++   A L   + +    
Sbjct: 551  -------------QLVGELQSLRQQEEAARRRLEQLEKELEVLELALAAL-REAWQAQWA 596

Query: 1207 SSDGTISDRNEITEQLQSRKQQLHNLRKTVEKLRQQNEKRAAESEKLGAELEEIIEALHA 1266
            ++   ++   E+ + L  R      +R    K R + +       +L A L   ++A+  
Sbjct: 597  AAGLPLTPA-EMEDWLAERATAREQVR-AYFKARAELDALLDRRARLRAALRAALKAVAI 654

Query: 1267 RETAVPSSSRLHNLRKTVEKLRQQNEKRAAESEKLGAELEEIIEALHARETAVKTL 1322
                   +  L   R+ +E+  +Q  ++A+  E+L      + EA    + A   L
Sbjct: 655  VLPGEELAELLELARQLLEEAEKQAARKASLDERLRDAERALEEAEERHDEAQSAL 710



 Score = 35.5 bits (82), Expect = 0.40
 Identities = 40/195 (20%), Positives = 81/195 (41%), Gaps = 18/195 (9%)

Query: 1981 LDSVEKNIKQEIATPWT---TTQELRSKLLKLKEI---LKKCEKGVGDHEAFVDKYEECS 2034
            L SV   +++E    W       E+   L +LKE+   +++ +      +  V   +E  
Sbjct: 128  LGSVRDELEKEADELWKPRGRKPEINVALKELKELEAEIREVQLKTRTWKDLVKALDEAE 187

Query: 2035 QKLAQVEAKYAKLIEPETSYEELQKRQGQLTALLTEKVDMDVL--LNACVDLCDKVYES- 2091
            ++LA +  +  +L   E   + L++ +  L  L   K     L  L   +DL     E  
Sbjct: 188  EELANLRKELRQL---EKEKQRLERLRRLLPLLAERKALEQQLAALGEVIDLPPDAVERY 244

Query: 2092 -TSEPGHEPLRVQMEKLQ---QAVEALYDKITVTESLI--LSQVEAKYAKLIEPETSYEE 2145
              +       R  +E L    +A++A  D+I++ E L+   + +EA + +  E   + ++
Sbjct: 245  EEARAELRAARRNLELLTERLEALQAELDEISLDEELLAQAAAIEALHQQRGEYRNAEQD 304

Query: 2146 LQKRQGQLTALLAEK 2160
            L  R+G++       
Sbjct: 305  LPDREGEIANAREAA 319


>gnl|CDD|233885 TIGR02479, FliA_WhiG, RNA polymerase sigma factor, FliA/WhiG family. 
            Most members of this family are the flagellar operon
            sigma factor FliA, controlling transcription of bacterial
            flagellar genes by RNA polymerase. An exception is the
            sigma factor WhiG in the genus Streptomyces, involved in
            the production of sporulating aerial mycelium.
          Length = 224

 Score = 36.1 bits (84), Expect = 0.13
 Identities = 27/110 (24%), Positives = 50/110 (45%), Gaps = 11/110 (10%)

Query: 1224 SRKQQLHNLRKTVEKLRQQNEKRAAESE---KLGAELEEIIEALHAR--------ETAVP 1272
            S +Q+   L + + +L  +  +   E E   +LG +L+E  +AL+          +  + 
Sbjct: 77   SLRQKARKLERAIRELEARLGREPTEEEIAEELGMDLKEYRQALNEINALSLVSLDELLE 136

Query: 1273 SSSRLHNLRKTVEKLRQQNEKRAAESEKLGAELEEIIEALHARETAVKTL 1322
            S     +L   +E  + ++ +   E E+L   L E IE+L  RE  V +L
Sbjct: 137  SGDDGGSLIDRIEDDKSEDPEEELEREELREALAEAIESLSEREQLVLSL 186


>gnl|CDD|235316 PRK04863, mukB, cell division protein MukB; Provisional.
          Length = 1486

 Score = 36.9 bits (86), Expect = 0.15
 Identities = 23/92 (25%), Positives = 35/92 (38%), Gaps = 14/92 (15%)

Query: 1220 EQLQSRKQQLHNLRK------TVEKLRQQNEKR-------AAESEKLGAELEEIIEALHA 1266
            EQLQ  + +L  L +        E+L  +  KR         E E+L  ELE  +E+L  
Sbjct: 513  EQLQQLRMRLSELEQRLRQQQRAERLLAEFCKRLGKNLDDEDELEQLQEELEARLESLSE 572

Query: 1267 R-ETAVPSSSRLHNLRKTVEKLRQQNEKRAAE 1297
                A      L    + ++   Q+   RA  
Sbjct: 573  SVSEARERRMALRQQLEQLQARIQRLAARAPA 604



 Score = 34.9 bits (81), Expect = 0.67
 Identities = 87/426 (20%), Positives = 161/426 (37%), Gaps = 73/426 (17%)

Query: 1244 EKRAAESEKLGA---ELEEIIEALHARETAVPSSSRLHNLRKTVEKLRQQNEKRAAESEK 1300
             + AAE  +L     EL E+ EA    E    ++S   +L      LRQQ      + E+
Sbjct: 300  RQLAAEQYRLVEMARELAELNEAESDLEQDYQAAS--DHLNLVQTALRQQE-----KIER 352

Query: 1301 LGAELEEIIEALHARETAVKTLPVLLLEVTSVDVELDKQRLELATKEVQPYVQRYETLKK 1360
              A+LEE+ E L  +   V+       E         + R E A +EV       + LK 
Sbjct: 353  YQADLEELEERLEEQNEVVEEADEQQEEN--------EARAEAAEEEV-------DELKS 397

Query: 1361 NIA---EAIDKFETQANEHEAYKQAYNEAYDWLRKAKLGAQANADCHGEQQTTKDKADKI 1417
             +A   +A+D  +T+A +++   QA   A        L A  NA+     +  + K  + 
Sbjct: 398  QLADYQQALDVQQTRAIQYQQAVQALERAKQLCGLPDLTA-DNAE--DWLEEFQAKEQEA 454

Query: 1418 KQITKSLPEGQKLIDKTVALK------NNVLESTGPIGKENINQEINQITLDWTNLQNTL 1471
             +   SL   QKL     A          V +  G + +        ++       ++  
Sbjct: 455  TEELLSL--EQKLSVAQAAHSQFEQAYQLVRKIAGEVSRSEAWDVARELLRRLREQRHLA 512

Query: 1472 QDIDKHHAKCLSLWNDFLSSKNTLEKWIEGFQKKIESEKDIGDCTNLDDLEKYKALLQEG 1531
            + + +   +   L    L  +   E+ +  F K++       +  + D+LE+ +  L+  
Sbjct: 513  EQLQQLRMRLSELEQR-LRQQQRAERLLAEFCKRLGK-----NLDDEDELEQLQEELEAR 566

Query: 1532 LHKEADNKKVLIDSFNKGIIGMTELESEEVKDLIQVGVHGLQVELNALLQNIEAEINKVA 1591
            L                      E  SE V +  +      ++ L   L+ ++A I ++A
Sbjct: 567  L----------------------ESLSESVSEARER-----RMALRQQLEQLQARIQRLA 599

Query: 1592 NAAQERKAMQDKISKLQSWLKQYIDYS-GNKQALQSRLDKVNEIQESFPEISTKLQTLSD 1650
              A    A QD +++L+    +  + S    + +Q  L++  E+     E++ + Q L +
Sbjct: 600  ARAPAWLAAQDALARLREQSGEEFEDSQDVTEYMQQLLERERELTVERDELAARKQALDE 659

Query: 1651 HIENSS 1656
             IE  S
Sbjct: 660  EIERLS 665



 Score = 31.1 bits (71), Expect = 9.5
 Identities = 28/133 (21%), Positives = 58/133 (43%), Gaps = 23/133 (17%)

Query: 193  KTISQLEESLDIFQ----QFQQLQKAYQE---DQKQLWDKLSSLTE-RELKGKLSSWCGY 244
              ++QLE  + + Q    QF+QL++ YQ+    Q+    +  +LTE  + +   S    Y
Sbjct: 918  NALAQLEPIVSVLQSDPEQFEQLKQDYQQAQQTQRDAKQQAFALTEVVQRRAHFS----Y 973

Query: 245  EESVATLLDWLKVTEKKLGDEMELKTTLDEKKAQLQVYRTIYQDATSHQQDLLQLKDKIE 304
            E++            + L    +L   L ++  Q +  RT  ++     Q  L   +++ 
Sbjct: 974  EDAA-----------EMLAKNSDLNEKLRQRLEQAEQERTRAREQLRQAQAQLAQYNQVL 1022

Query: 305  SLHQPSEQSKQQL 317
            +  + S  +K+Q+
Sbjct: 1023 ASLKSSYDAKRQM 1035


>gnl|CDD|219176 pfam06785, UPF0242, Uncharacterized protein family (UPF0242). 
          Length = 401

 Score = 36.4 bits (84), Expect = 0.15
 Identities = 30/154 (19%), Positives = 67/154 (43%), Gaps = 34/154 (22%)

Query: 1240 RQQNEKRAAESEKLGAELEEIIEALHARETAVPSSSRLHNLRKTVEKLRQQNEKRAAESE 1299
            R++ + + A  + L  ++ ++ E              L N+R+++E+ ++++++   +++
Sbjct: 60   RRERQLKTAAGQLLQTKIRKLTE----------QDEGLRNIRESIEERQKESDQLKLQNQ 109

Query: 1300 KLGAELEEIIEALHARETAVKT------LPVLLLEVTSVDVELDKQRLELATKEVQPYVQ 1353
            KL  +L       HAR   +KT      L  L   +   +  L  Q L+   +E    ++
Sbjct: 110  KLVNQL------GHARGVFIKTKGRYQKLEELSRHLKEENQCLQIQ-LDALVQECNEKIE 162

Query: 1354 RYETLKKNIAEAIDKFETQANEHEAYKQAYNEAY 1387
              + L + +AE +           AY+Q  N+ Y
Sbjct: 163  ENQELNRELAETL-----------AYQQELNDEY 185



 Score = 33.3 bits (76), Expect = 1.4
 Identities = 22/90 (24%), Positives = 47/90 (52%), Gaps = 1/90 (1%)

Query: 1222 LQSRKQQLHNLRKTVEKLRQQNEKRAAESEKLGAELEEIIEAL-HARETAVPSSSRLHNL 1280
            LQ++ ++L    + +  +R+  E+R  ES++L  + ++++  L HAR   + +  R   L
Sbjct: 73   LQTKIRKLTEQDEGLRNIRESIEERQKESDQLKLQNQKLVNQLGHARGVFIKTKGRYQKL 132

Query: 1281 RKTVEKLRQQNEKRAAESEKLGAELEEIIE 1310
             +    L+++N+    + + L  E  E IE
Sbjct: 133  EELSRHLKEENQCLQIQLDALVQECNEKIE 162


>gnl|CDD|223910 COG0840, Tar, Methyl-accepting chemotaxis protein [Cell motility and
            secretion / Signal transduction mechanisms].
          Length = 408

 Score = 36.5 bits (84), Expect = 0.17
 Identities = 36/281 (12%), Positives = 106/281 (37%), Gaps = 25/281 (8%)

Query: 980  MLQELLNKYAQLNEEAGELNMMISNVSIATEHMNLNDADRLTVNDSVSSMQKRYAQLSRT 1039
            ++  + +    L+  + E+    + +S   +    +  +   V  ++  + +   +++  
Sbjct: 132  IIDAVQDNAEALSGASEEIAASATELSARADQQAESLEE---VASAIEELSETVKEVAFN 188

Query: 1040 IEDRISTVNNKLQSVKRLQHKVEEAKALLQKAESQLARPV---STQAEDISEALTVYQKL 1096
             ++  +  +   Q  +    +V +A   +Q+   +LA  V   S  +++I E  +V   +
Sbjct: 189  AKEAAALASEASQVAEEGGEEVRQAVEQMQEIAEELAEVVKKLSESSQEIEEITSVINSI 248

Query: 1097 LDEIN---------SWKNSQSDEDLVHLADSIKPL--------DEVVERIENHAVPKLKS 1139
             ++ N         + +  ++      +AD ++ L         E+   IE     +   
Sbjct: 249  AEQTNLLALNAAIEAARAGEAGRGFAVVADEVRKLAERSADSAKEIGLLIEE-IQNEAAD 307

Query: 1140 LLLLREQFTTLIMQIVGFITETTARVGEVDGNSATVQEKIDKYDKIIADIQEYEAILATA 1199
             +   E+  + + + V  + ET + +GE+      V + I +      +       +  +
Sbjct: 308  AVEHMEESASEVSEGVKLVEETGSSLGEIAAAIEEVSQLISEIAAATEEQTAVLEEINAS 367

Query: 1200 SDKGDQLSSDGTISDRNEITEQLQSRKQQLHNLRKTVEKLR 1240
             ++ D ++ +   +   E+    +  K+    L + V K +
Sbjct: 368  IEELDDVTQE-NAAAVEELAAASEELKELAEKLLELVAKFK 407


>gnl|CDD|227278 COG4942, COG4942, Membrane-bound metallopeptidase [Cell division and
            chromosome partitioning].
          Length = 420

 Score = 35.9 bits (83), Expect = 0.24
 Identities = 36/218 (16%), Positives = 79/218 (36%), Gaps = 27/218 (12%)

Query: 916  IKDKIDRLEKETKAQSDRIQDALKTLNQLIGHRKEFEVDVENVEAKLKELEVAVEGDVKT 975
             +D+  +LEK+ K+           +  L     E   D++ +  ++ +L   +      
Sbjct: 57   QQDQRAKLEKQLKSLE-------TEIASLEAQLIETADDLKKLRKQIADLNARLNA---- 105

Query: 976  NNVLMLQELLNKYAQLNEEAGELNMMISNVSIATEHMNLNDADRLTVNDSVSSMQKRYAQ 1035
                +  +   +  +L E+   L     N   A   ++  DA R         +   Y  
Sbjct: 106  ----LEVQEREQRRRLAEQLAALQRSGRNPPPA-LLVSPEDAQR------SVRLAIYYGA 154

Query: 1036 LSRTIEDRISTVNNKLQSVKRLQHKVEEAKALLQKAESQLARPVSTQAEDISEALTVYQK 1095
            L+    +RI  +   L+ +  ++ ++   +A L    S+       Q   +++ L   +K
Sbjct: 155  LNPARAERIDALKATLKQLAAVRAEIAAEQAELTTLLSEQRA----QQAKLAQLLEERKK 210

Query: 1096 LLDEINSWKNSQSDEDLVHLADSIKPLDEVVERIENHA 1133
             L ++NS  ++   + L  L  +   L   +   E  A
Sbjct: 211  TLAQLNSELSADQKK-LEELRANESRLKNEIASAEAAA 247



 Score = 31.2 bits (71), Expect = 6.8
 Identities = 20/99 (20%), Positives = 39/99 (39%), Gaps = 2/99 (2%)

Query: 1222 LQSRKQQLHNLRKTVEKLRQQNEKRAAESEKLGAELEEIIEALHARETAVPSSSRLHN-L 1280
              +R +++  L+ T+++L     + AAE  +L   L E           +    +    L
Sbjct: 156  NPARAERIDALKATLKQLAAVRAEIAAEQAELTTLLSEQRAQQAKLAQLLEERKKTLAQL 215

Query: 1281 RKTVEKLRQQNEKRAAESEKLGAELEEII-EALHARETA 1318
               +   +++ E+  A   +L  E+      A  ARE A
Sbjct: 216  NSELSADQKKLEELRANESRLKNEIASAEAAAAKAREAA 254


>gnl|CDD|234078 TIGR02980, SigBFG, RNA polymerase sigma-70 factor, sigma-B/F/G
            subfamily.  This group of similar sigma-70 factors
            includes clades found in Bacilli (including the
            sporulation factors SigF:TIGR02885 and SigG:TIGR02850 as
            well as SigB:TIGR02941), and the high GC gram positive
            bacteria (Actinobacteria) where a variable number of them
            are found depending on the lineage.
          Length = 227

 Score = 35.3 bits (82), Expect = 0.26
 Identities = 28/111 (25%), Positives = 45/111 (40%), Gaps = 24/111 (21%)

Query: 1225 RKQQLHN-LRKTVEKLRQQNEKR--AAE-SEKLGAELEEIIEALHARETAVPSSSRLHNL 1280
            R ++L   + K  E+L Q+  +    AE +E+LG   EE++EAL A       S  L   
Sbjct: 84   RLKELGVKINKATEELTQRLGRSPTIAEIAERLGVSEEEVVEALEAGNAYQALS--LD-- 139

Query: 1281 RKTVEKLRQQNEKRAAESEKLGAE------------LEEIIEALHARETAV 1319
                  +   +    A  + LG E            L+ ++ AL  RE  +
Sbjct: 140  ----ASIEDDDGDPIALLDTLGDEDDALETVEDRLALKPLLAALPERERRI 186


>gnl|CDD|168831 PRK07122, PRK07122, RNA polymerase sigma factor SigF; Reviewed.
          Length = 264

 Score = 34.8 bits (80), Expect = 0.38
 Identities = 29/100 (29%), Positives = 51/100 (51%), Gaps = 10/100 (10%)

Query: 1073 SQLARPVSTQAEDISEAL---TVYQKLLDEINSWKNSQSDEDLVHLADSIKPLDEVVERI 1129
            S+LA  +    E++ E L   + Y      I+S   S  D+    +AD++  +D  +++I
Sbjct: 147  SELAAELGMDREEVVEGLVAGSAYN--TLSIDSGGGSGDDDAR-AIADTLGDVDAGLDQI 203

Query: 1130 ENH-AVPKLKSLLLLREQFTTLIMQIVGFITET--TARVG 1166
            EN  A+  L + L  RE+ T L+++    +T+T    RVG
Sbjct: 204  ENREALRPLLAALPERER-TVLVLRFFESMTQTQIAERVG 242


>gnl|CDD|223571 COG0497, RecN, ATPase involved in DNA repair [DNA replication,
            recombination, and repair].
          Length = 557

 Score = 35.3 bits (82), Expect = 0.44
 Identities = 32/142 (22%), Positives = 57/142 (40%), Gaps = 24/142 (16%)

Query: 1213 SDRNEITEQLQSRKQQLHNLRKTVEKLR------------QQNEKRAAESEKLGAELEEI 1260
             +  ++ E+ + R Q+   L+  +E+L             ++  KR + SEKL   ++  
Sbjct: 171  RELEDLQEKERERAQRADLLQFQLEELEELNLQPGEDEELEEERKRLSNSEKLAEAIQNA 230

Query: 1261 IEALHARETAVPSSSRLHNLRKTVEKLRQQNEKRAAESEKLGAELEEIIEALHARETAVK 1320
            +E L   +  V + S L    + +E L + + K +  +E L   L E+ EA       + 
Sbjct: 231  LELLSGEDDTVSALSLLGRALEALEDLSEYDGKLSELAELLEEALYELEEASEELRAYLD 290

Query: 1321 TLPVLLLEVTSVDVELDKQRLE 1342
             L            E D  RLE
Sbjct: 291  EL------------EFDPNRLE 300



 Score = 31.8 bits (73), Expect = 5.2
 Identities = 47/281 (16%), Positives = 101/281 (35%), Gaps = 86/281 (30%)

Query: 841  EKLLQDYSSYQNAVQDAEKWLLQ----------ISFQLMAHNSLYI--------NNRQQT 882
            E   + Y +++ A ++ E    +          + FQL     L +           ++ 
Sbjct: 157  EAYQEAYQAWKQARRELEDLQEKERERAQRADLLQFQLEELEELNLQPGEDEELEEERKR 216

Query: 883  LEQIQHHEKLLQDV-----ILSPDNKLAPCFSAEEGSPIKDKIDRLEKETKAQSDRIQDA 937
            L    + EKL + +     +LS ++                       +T +    +  A
Sbjct: 217  L---SNSEKLAEAIQNALELLSGED-----------------------DTVSALSLLGRA 250

Query: 938  LKTLNQLIGHRKEFEVDVENVEAKLKELEVAVEGDVKTNNVLMLQELLNKYAQLNEEAGE 997
            L+ L  L  +  +     E +E  L ELE A E            EL     +L  +   
Sbjct: 251  LEALEDLSEYDGKLSELAELLEEALYELEEASE------------ELRAYLDELEFDPNR 298

Query: 998  LNMMISNVSIATEHMNLNDADRLTVNDSVSSMQKRYA-------QLSRTIEDRISTVNNK 1050
            L  +                +RL    ++ S+ ++Y        +    I++ ++ ++N 
Sbjct: 299  LEEV---------------EERL---FALKSLARKYGVTIEDLLEYLDKIKEELAQLDNS 340

Query: 1051 LQSVKRLQHKVEEAKALLQKAESQLARPVSTQAEDISEALT 1091
             +S++ L+ +V++ KA L +A   L+      A+++ + +T
Sbjct: 341  EESLEALEKEVKKLKAELLEAAEALSAIRKKAAKELEKEVT 381


>gnl|CDD|216531 pfam01496, V_ATPase_I, V-type ATPase 116kDa subunit family.  This
            family consists of the 116kDa V-type ATPase (vacuolar
            (H+)-ATPases) subunits, as well as V-type ATP synthase
            subunit i. The V-type ATPases family are proton pumps
            that acidify intracellular compartments in eukaryotic
            cells for example yeast central vacuoles, clathrin-coated
            and synaptic vesicles. They have important roles in
            membrane trafficking processes. The 116kDa subunit
            (subunit a) in the V-type ATPase is part of the V0
            functional domain responsible for proton transport. The a
            subunit is a transmembrane glycoprotein with multiple
            putative transmembrane helices it has a hydrophilic amino
            terminal and a hydrophobic carboxy terminal. It has roles
            in proton transport and assembly of the V-type ATPase
            complex. This subunit is encoded by two homologous gene
            in yeast VPH1 and STV1.
          Length = 707

 Score = 35.0 bits (81), Expect = 0.47
 Identities = 44/256 (17%), Positives = 87/256 (33%), Gaps = 27/256 (10%)

Query: 913  GSPIKDKIDRLEKETKAQSDRIQDALKTLNQLIGHRKEFEVDVENVEAKLKELEVAVEGD 972
            G P+KD   + +     +   +++ +  L   I   KE E ++E++E ++ ELE     +
Sbjct: 51   GIPLKDTGGKPDVPPSKEFLDLEEEILDLEAEI---KEVEENLESLEKEINELE--EWLN 105

Query: 973  VKTNNVLMLQELLN---KYAQLNEEAGEL-NMMISNVSIATEHMNLNDADRLTVNDSVSS 1028
            V       L E L    + + L+ +   L       +      +N    +       +  
Sbjct: 106  VLDEEKSFLDENLEELSELSNLDIDFKYLRGAEGLKLGFVAGVINREKLE--AFERELWR 163

Query: 1029 MQKRYAQLSRTIEDRISTVNNKLQSV---KRLQHKVEEAKALLQKAESQLARPVSTQAED 1085
              + Y + +   E                K    KV++   +L     +L   V     +
Sbjct: 164  ACRGYIRQAEIEEPLEDPKKTVFIIFFVGKEDLDKVKK---ILDSFGFELY-DVPETEGE 219

Query: 1086 ISEALTVYQKLLDEINSWKNSQSDE---DLVHLADSIKPLDEVVERIENHAVPKLKSLLL 1142
             SE ++   K ++E+              LV +AD +   DE V +       K     L
Sbjct: 220  RSELISKVNKRIEELQRVLEQTESHLEKVLVKIADELLAWDEQVSKE------KAVYETL 273

Query: 1143 LREQFTTLIMQIVGFI 1158
                + T  +   G++
Sbjct: 274  NLFNYDTKTLIAEGWV 289


>gnl|CDD|177326 PHA01817, PHA01817, hypothetical protein.
          Length = 479

 Score = 35.1 bits (80), Expect = 0.48
 Identities = 22/74 (29%), Positives = 37/74 (50%), Gaps = 6/74 (8%)

Query: 670 YNRFNMWLSGVENKLNEPSPVFTEIDLN-TKWQQYVNDE-----LEWENHSQNTLQWLDN 723
           Y RFN+WL G E+K+ +      EI ++ T +  Y  +E     L W NH+     +L +
Sbjct: 33  YGRFNVWLGGNESKIRQVLKAVKEIGVSPTLFAVYEKNEGFSSGLGWLNHTSARGDYLTD 92

Query: 724 IRSDMKTLLERSKH 737
            +   + L+ +SK 
Sbjct: 93  AKFIARKLVSQSKQ 106


>gnl|CDD|223683 COG0610, COG0610, Type I site-specific restriction-modification
            system, R (restriction) subunit and related helicases
            [Defense mechanisms].
          Length = 962

 Score = 35.1 bits (81), Expect = 0.48
 Identities = 29/199 (14%), Positives = 70/199 (35%), Gaps = 22/199 (11%)

Query: 1175 VQEKIDKYDKIIADIQEYEAILATAS---------DKGDQLSSDGTISDRN-EITEQLQS 1224
             ++   KY  +I ++ +   I    S          + + +       D   E+ +    
Sbjct: 768  EKDYQSKYLDLIDNLNDAREIKDKESKAIDLDDIDFELELIGKQEINIDYILELLQTFND 827

Query: 1225 RKQQLHNLRKTVEKLRQQNEKRAAESEKLGAELEEIIEALHARETAVPSSSRLHNLRKTV 1284
            +     +L++ +E++ ++  +   + +KL   L E I    A++           L +  
Sbjct: 828  KNGAYESLKELIERIIKEWIEDLRQKKKLIERLIEAINQYRAKKLD-----TAEKLEELY 882

Query: 1285 EKLRQQNE-KRAAESEKLGAELEEIIEALHARETAVKTLPVLLLEVT------SVDVELD 1337
               +++ E K+ AE E L  E     + L      +      L+E            + D
Sbjct: 883  ILAKKEEEFKQFAEEEGLNEEELAFYDDLALNGGKLPENGTELVEKLAKEKSLREKNKDD 942

Query: 1338 KQRLELATKEVQPYVQRYE 1356
             +  E    +++ + +R+ 
Sbjct: 943  WKAKEEVEAKLKAFFRRFI 961


>gnl|CDD|234701 PRK00252, alaS, alanyl-tRNA synthetase; Reviewed.
          Length = 865

 Score = 35.1 bits (82), Expect = 0.49
 Identities = 18/83 (21%), Positives = 43/83 (51%), Gaps = 8/83 (9%)

Query: 1213 SDRNEITEQLQSRKQQLHNLRKTVEKLRQQNEKRAAESEKLGAELEEI--IEALHARETA 1270
            +  +E+ E++++  ++L  L K +E+L+ +    AA +  L A+ +E+  ++ L A+   
Sbjct: 712  AKPSELPERVEALLEELKELEKELEQLKAK--LAAAAAGDLLAQAKEVNGVKVLAAQVEG 769

Query: 1271 VPSSSRLHNLRKTVEKLRQQNEK 1293
            V + +    LR   + L+ +   
Sbjct: 770  VDAKA----LRTLADDLKDKLGS 788


>gnl|CDD|226809 COG4372, COG4372, Uncharacterized protein conserved in bacteria with
            the myosin-like domain [Function unknown].
          Length = 499

 Score = 35.0 bits (80), Expect = 0.50
 Identities = 40/258 (15%), Positives = 98/258 (37%), Gaps = 14/258 (5%)

Query: 1139 SLLLLREQFTTLIMQIVGFITETTARVGEVDGNSATVQEKIDKYDKIIADIQEYEAILAT 1198
             L  LR + T +++ ++  +  + A +  +   +  ++  + + D I   ++     L T
Sbjct: 33   RLFGLRPKQTAVLVTVLTGMLISAATLAILFLLNRNLRSGVFQLDDIRPQLRALRTELGT 92

Query: 1199 ASDKGDQLSSDGTISDRNEIT------EQLQSRKQQLH-NLRKTVEKLRQQNEKRAAESE 1251
            A  +     ++   + R+E+       E ++        NL K  ++L +  ++      
Sbjct: 93   AQGEKRAAETERE-AARSELQKARQEREAVRQELAAARQNLAKAQQELARLTKQAQDLQT 151

Query: 1252 KLGAELEEIIEALHARETAVPSSSRLHNLRKTVEKLRQQNEKRAAESEKLGAELEEIIEA 1311
            +L    E+  +     ++   S  +L      ++      + R+A+ E+    L     A
Sbjct: 152  RLKTLAEQRRQLEAQAQSLQASQKQLQASATQLKSQVLDLKLRSAQIEQEAQNLATRANA 211

Query: 1312 LHARETAVKTLPVLL----LEVTSVDVELDKQRLELATKEVQPYVQRYETLKKNIAEAID 1367
              AR   +             +   D ++ ++  ++A +  Q  ++  E   + +  A  
Sbjct: 212  AQARTEELARRAAAAQQTAQAIQQRDAQISQKAQQIAARAEQ--IRERERQLQRLETAQA 269

Query: 1368 KFETQANEHEAYKQAYNE 1385
            + E +  + EAY QAY  
Sbjct: 270  RLEQEVAQLEAYYQAYVR 287



 Score = 32.3 bits (73), Expect = 3.7
 Identities = 33/159 (20%), Positives = 63/159 (39%), Gaps = 10/159 (6%)

Query: 1892 EETLRLQALYAALVTSVQGLVSQVEKNLSDQTEFLAKKEEVEAWLHKAHATVQECDELEA 1951
            +E  RL      L T ++ L  Q  +  +      A +++++A   +  + V +     A
Sbjct: 137  QELARLTKQAQDLQTRLKTLAEQRRQLEAQAQSLQASQKQLQASATQLKSQVLDLKLRSA 196

Query: 1952 GIKEEQKNIETQYLQWNSFRETLAQMTA----------WLDSVEKNIKQEIATPWTTTQE 2001
             I++E +N+ T+     +  E LA+  A            D+      Q+IA      +E
Sbjct: 197  QIEQEAQNLATRANAAQARTEELARRAAAAQQTAQAIQQRDAQISQKAQQIAARAEQIRE 256

Query: 2002 LRSKLLKLKEILKKCEKGVGDHEAFVDKYEECSQKLAQV 2040
               +L +L+    + E+ V   EA+   Y    Q+ A  
Sbjct: 257  RERQLQRLETAQARLEQEVAQLEAYYQAYVRLRQQAAAT 295


>gnl|CDD|221952 pfam13166, AAA_13, AAA domain.  This family of domains contain a
            P-loop motif that is characteristic of the AAA
            superfamily. Many of the proteins in this family are
            conjugative transfer proteins. This family includes the
            PrrC protein that is thought to be the active component
            of the anticodon nuclease.
          Length = 713

 Score = 35.0 bits (81), Expect = 0.55
 Identities = 28/177 (15%), Positives = 69/177 (38%), Gaps = 11/177 (6%)

Query: 1023 NDSVSSMQKRYAQLSRTIEDRISTVNNKLQSVKRLQHKVEEAKALLQKAESQLARPVSTQ 1082
            ++    + ++  +L    E  I     +L+S+   + +  E K  +++ ++ L   +   
Sbjct: 277  DEEYQELIEQLEELIDKYESHIEKALEELESILDTEKENSEFKLDVEELKALL-EALEEI 335

Query: 1083 AEDISEALTVYQKLLDEINSWKNSQSDEDLVHLADSIKPLDEVV----ERIENHA--VPK 1136
             E   + L   +KL D   S +     + +  + D I  ++E++    E+I+N      K
Sbjct: 336  LEKNLQKLE--EKLKDPSTSIELESITDLIESINDIIDAINELIREHNEKIDNLKKEKNK 393

Query: 1137 LKSLL--LLREQFTTLIMQIVGFITETTARVGEVDGNSATVQEKIDKYDKIIADIQE 1191
             K  L   L  +    I             +  ++     ++ +I   +K I ++++
Sbjct: 394  AKKKLWLHLVAELKEDIDAYQKEKKGLEKAINSLEKEIKQLEAEIKALEKEIKELEK 450



 Score = 31.1 bits (71), Expect = 7.8
 Identities = 80/412 (19%), Positives = 161/412 (39%), Gaps = 70/412 (16%)

Query: 1211 TISDRN-EITEQLQSRKQQLHNLRKTVEKLRQQNEKRAAESEKLGAELEEIIEALHAR-- 1267
            T+ + N EI  Q++  K++L  L + +E+L  + EK+  E EK   +  +      A+  
Sbjct: 85   TLGEENIEIEAQIEELKKELKKLEEKIEQLEAEIEKKEEELEKAKNKFLDKAWKKLAKKY 144

Query: 1268 ETAVPSSSRLHNLRKT-VEKLRQQNEKRAAES------EKLGAELEEIIEALHARETAVK 1320
            ++ +  + +  N +K   EKL ++ +     +      E+L A+++ +  +    E A+ 
Sbjct: 145  DSNLSEALKGLNYKKNFKEKLLKELKSVILNASSLLSLEELKAKIKTLFSSNKP-ELALL 203

Query: 1321 TLPVLLLE------------VTSVDVELDK--QRLELAT--KEVQPY------------- 1351
            TL V+  +            + S DV + +    L  +   KE   Y             
Sbjct: 204  TLSVIDFDEIEQAEILEKSIIGSSDVPISELINNLGNSDWVKEGLEYHEEGDTCPFCQQT 263

Query: 1352 --VQRYETLKKNIAEAIDKFETQANEH-EAYKQAYNEAYDWLRKAKLGAQANADCHGEQQ 1408
               +R   L+ +  E   +   Q  E  + Y+    +A + L       + N++   + +
Sbjct: 264  ITEERKAELEAHFDEEYQELIEQLEELIDKYESHIEKALEELESILDTEKENSEFKLDVE 323

Query: 1409 TTKDKADKIKQITKSLPEGQKLIDKTVALKNNV-LESTGPIGKE------NINQEI---N 1458
              K   + +++I +     QKL +K      ++ LES   + +        IN+ I   N
Sbjct: 324  ELKALLEALEEILEKNL--QKLEEKLKDPSTSIELESITDLIESINDIIDAINELIREHN 381

Query: 1459 QITLDWTNLQNTLQDIDKHH--AKCLSLWNDFLSSKNTLEKWIEGFQKKIESEKDIGDCT 1516
            +   +    +N  +     H  A+     + +   K  LEK I   +K+I+  +      
Sbjct: 382  EKIDNLKKEKNKAKKKLWLHLVAELKEDIDAYQKEKKGLEKAINSLEKEIKQLE------ 435

Query: 1517 NLDDLEKYKALLQEGLHKEADNKKVLIDSFNKGI----IGMTELESEEVKDL 1564
               +++  +  ++E L K+  N +   D  NK +     G  ELE E     
Sbjct: 436  --AEIKALEKEIKE-LEKQLTNIEPTADEINKLLKAYGFGNFELEKEAEDKG 484


>gnl|CDD|225803 COG3264, COG3264, Small-conductance mechanosensitive channel [Cell
            envelope biogenesis, outer membrane].
          Length = 835

 Score = 35.0 bits (81), Expect = 0.58
 Identities = 26/117 (22%), Positives = 43/117 (36%), Gaps = 16/117 (13%)

Query: 1213 SDRNEITEQLQSRKQQLHNLRKTVEKLRQQNEKRAAESEKLGAEL----EEIIEALHARE 1268
            S R  +T +    +  L  L++ V   RQ+  + AAE  +L AEL      I + L    
Sbjct: 15   SRRELLTAESAQLEAALQLLQEAVNSKRQEEAEPAAEEAELQAELIQQELAINDQL---- 70

Query: 1269 TAVPSSSRLHNLRKTVEKLRQQNEKRAAESEKLGA---ELEEIIEALHARETAVKTL 1322
                 S  L+   + +  L   + + A    +L      + E I  L       + L
Sbjct: 71   -----SQALNQQTERLNALASDDRQLANLLLQLLQSSRTIREQIAVLRGSLLLSRIL 122


>gnl|CDD|234750 PRK00409, PRK00409, recombination and DNA strand exchange inhibitor
            protein; Reviewed.
          Length = 782

 Score = 34.8 bits (81), Expect = 0.59
 Identities = 30/151 (19%), Positives = 62/151 (41%), Gaps = 21/151 (13%)

Query: 1220 EQLQSRKQQLHNLRKTVEKLRQQNEKRAAESEKLGAELEEIIEALHARETAVPSSSRLHN 1279
            + +   K++L+ L  ++E+L ++ E++A E+E L  E E++ E L               
Sbjct: 509  KLIGEDKEKLNELIASLEELERELEQKAEEAEALLKEAEKLKEELE-------------- 554

Query: 1280 LRKTVEKLRQQNEKRAAESEKLGAELEEIIEALHARETAVKTLPVLLLEVTSVDVELDKQ 1339
                 EK  +  E+     E+   E ++ I+   A++ A + +  L          +   
Sbjct: 555  -----EKKEKLQEEEDKLLEEAEKEAQQAIKE--AKKEADEIIKELRQLQKGGYASVKAH 607

Query: 1340 RLELATKEVQPYVQRYETLKKNIAEAIDKFE 1370
             L  A K +    ++ E  KK   E  ++ +
Sbjct: 608  ELIEARKRLNKANEKKEKKKKKQKEKQEELK 638



 Score = 33.6 bits (78), Expect = 1.6
 Identities = 26/101 (25%), Positives = 45/101 (44%), Gaps = 22/101 (21%)

Query: 1216 NEITEQLQSRKQQLHNLRKTVEKLRQQNEKRAAESEKLGAELEEIIEALHARETAVPSSS 1275
            NE+   L+  +++L    +  E L +       E+EKL  ELEE  E L   E       
Sbjct: 519  NELIASLEELERELEQKAEEAEALLK-------EAEKLKEELEEKKEKLQEEE------- 564

Query: 1276 RLHNLRKTVEKLRQQNEKRAAESEKLGAELEEIIEALHARE 1316
                  K +E+  ++ ++   E++K   E +EII+ L   +
Sbjct: 565  -----DKLLEEAEKEAQQAIKEAKK---EADEIIKELRQLQ 597



 Score = 31.3 bits (72), Expect = 6.3
 Identities = 28/134 (20%), Positives = 48/134 (35%), Gaps = 30/134 (22%)

Query: 1256 ELEEIIEALHARETAVPSSSRLHNLRKTVEKLRQQNEKRAAESEKLGAELEEIIEALHAR 1315
            +L E+I               L  L + +E+  ++ E    E+EKL  ELEE  E L   
Sbjct: 517  KLNELIA-------------SLEELERELEQKAEEAEALLKEAEKLKEELEEKKEKLQEE 563

Query: 1316 ETAVKTLPVLLLEVTSVDVELDK---QRLELATKEVQPYVQRYETLKKNIAEAIDKFETQ 1372
            E  +               E +K   Q ++ A KE    ++    L+K    ++   E  
Sbjct: 564  EDKLLE-------------EAEKEAQQAIKEAKKEADEIIKELRQLQKGGYASVKAHELI 610

Query: 1373 ANEHEAYKQAYNEA 1386
                +   +A  + 
Sbjct: 611  -EARKRLNKANEKK 623


>gnl|CDD|235899 PRK06975, PRK06975, bifunctional uroporphyrinogen-III
            synthetase/uroporphyrin-III C-methyltransferase;
            Reviewed.
          Length = 656

 Score = 34.7 bits (80), Expect = 0.67
 Identities = 35/125 (28%), Positives = 59/125 (47%), Gaps = 24/125 (19%)

Query: 1028 SMQKRYAQLSRT--------IEDRISTVNNKLQSVKRLQHKVEEAKALLQKAESQLARPV 1079
            +++++Y  LSR         +E  +S+ + +LQ    L   V+ A   LQ A+++LA   
Sbjct: 403  ALEQQYQDLSRNRDDWMIAEVEQMLSSASQQLQ----LTGNVQLALIALQNADARLATSD 458

Query: 1080 STQAEDISEALTVYQKLLDEINSWKNSQSDEDLVHLADSIKPLDEVVERIENHAVPKLKS 1139
            S QA  + +A+        +I   K + S  DL  LA  IK LD+ + +I+  A+P    
Sbjct: 459  SPQAVAVRKAIA------QDIERLKAAPS-ADLTGLA--IK-LDDAIAKID--ALPLSGE 506

Query: 1140 LLLLR 1144
             L   
Sbjct: 507  ALPPH 511


>gnl|CDD|218636 pfam05557, MAD, Mitotic checkpoint protein.  This family consists of
            several eukaryotic mitotic checkpoint (Mitotic arrest
            deficient or MAD) proteins. The mitotic spindle
            checkpoint monitors proper attachment of the bipolar
            spindle to the kinetochores of aligned sister chromatids
            and causes a cell cycle arrest in prometaphase when
            failures occur. Multiple components of the mitotic
            spindle checkpoint have been identified in yeast and
            higher eukaryotes. In S.cerevisiae, the existence of a
            Mad1-dependent complex containing Mad2, Mad3, Bub3 and
            Cdc20 has been demonstrated.
          Length = 722

 Score = 34.5 bits (79), Expect = 0.82
 Identities = 82/397 (20%), Positives = 151/397 (38%), Gaps = 43/397 (10%)

Query: 938  LKTLNQLIGHRKEFEVDVENVEAKLKELEVAVEGDVKTNNVLMLQ--ELLNKYAQLNEEA 995
            L++L        +  V ++++E +LK  E   E  VK+    +LQ  EL  + A L EE 
Sbjct: 215  LQSLTSEQASSADNSVKIKHLEEELKRYEQDAE-VVKSMKEQLLQIPELERELAALREEN 273

Query: 996  GELNMMISNVSIATEHMNLNDADRLTVNDSVSSMQKRYAQLSRTIEDRISTVNNKLQSVK 1055
             +L  M  +  +  E   L D                  +L R  + R    + +L+  K
Sbjct: 274  RKLRSMKEDNELLKE--ELEDLQS---------------RLERFEKMREKLADLELEKEK 316

Query: 1056 RLQHKVEEAKALLQKAESQLARPVSTQAEDISEALTVYQKLLDEINSWKNSQSDEDLVHL 1115
             L+++++  K+LLQ     L  P     +D+S  + V Q   +     KN         L
Sbjct: 317  -LENELKSWKSLLQDIGLNLRTP-----DDLSRRIVVLQNE-ELQLKEKNGSISSSAKQL 369

Query: 1116 ADSIKPLDEVVERIENHAVPKLKSLLLLREQFTTLIMQIVGFITET---TARVGEVDGNS 1172
              +++ L    ++  +  +   K L  L+     L  ++     E     A +   D   
Sbjct: 370  ETTLQQLQLERQKAVSEILELKKKLEALKALVRRLQRRLTLVTKERDGLRAILNSYDKEL 429

Query: 1173 ATVQEKIDKYDKIIAD---IQEYEAILATASDKGDQLSSDGTI-SDRNE--------ITE 1220
                       ++      +Q+ ++ LA   ++  +L  D     DRN         + E
Sbjct: 430  TETSVSGQLMKRLEEAEDLVQKVQSHLAKMENQLSELEEDVGQQKDRNNTLETEIKLLKE 489

Query: 1221 QLQSRKQQLHNLRKTVEKLRQQNEKRAAESEKLGAELEEI-IEALHARETAVPSSSRLHN 1279
            QL S ++ L  +++    LR + E    E ++L  E   + ++  H       ++SR   
Sbjct: 490  QLVSNERLLSFVKEATNALRLKIETLERERDRLRQEKSLLEMKLEHLCLQGDYNASRTKV 549

Query: 1280 LRKTVEKLRQQNEKRAAESEKLGAELEEIIEALHARE 1316
            L  ++    +  +      E L AE E++ E L A E
Sbjct: 550  LHMSLNPASEAEQIAKQTIEALQAECEKLKERLQALE 586


>gnl|CDD|224428 COG1511, COG1511, Predicted membrane protein [Function unknown].
          Length = 780

 Score = 34.4 bits (79), Expect = 0.82
 Identities = 40/264 (15%), Positives = 72/264 (27%), Gaps = 29/264 (10%)

Query: 1410 TKDKADKIKQITKSLPEGQKLIDKTVALKNNVLESTGPIGKENINQEINQITLDWTNLQN 1469
             K  AD  +++     E      K   L N     T        +  +N +T   T L +
Sbjct: 180  VKGAADGAEKLKDGTDEASNGNKKLSDLLN-----TLNNSSATFSDGLNALTSGLTTLTD 234

Query: 1470 TLQDIDKHHAKCLSLWNDFLSSKNTLEKWIEGFQKKIESEKDIGDCTNLDDLEKYKALLQ 1529
             L  +D       +   +       L + I  F   +           + DL      L 
Sbjct: 235  GLNQLDSGLGTLAAGIGELKQGAEQLNEGIGEFSSGLSELNSG-----VQDLAAGVPQLN 289

Query: 1530 EGLHKEADNKKVL-------------IDSFNKGIIGMTELESEEVKDLIQVGVHGLQVEL 1576
            +G+   A    +              +    +G+   T    E  +  +      LQ  L
Sbjct: 290  QGISALAAGLSLPDSLGDQFSSLQEALTQIAQGLKQKTSSSLEAAQGSLS----SLQSML 345

Query: 1577 NALLQNIEAEINKVANAAQERKAMQDKISKLQSWLKQYIDYSGN-KQALQSRLDKVNEIQ 1635
                           +A      +Q  + + Q  L    +        +   LD + +  
Sbjct: 346  ALSKSLDLTAEGATVDALGAPDGVQW-LDESQKTLATLSELLSTGIDGVSEGLDALEQAS 404

Query: 1636 ESFPEISTKLQTLSDHIENSSRKL 1659
                +   KL+T    I  S  +L
Sbjct: 405  AQLAKSLAKLKTAVAQIAASIAQL 428


>gnl|CDD|225638 COG3096, MukB, Uncharacterized protein involved in chromosome
           partitioning [Cell division and chromosome
           partitioning].
          Length = 1480

 Score = 34.5 bits (79), Expect = 0.84
 Identities = 33/178 (18%), Positives = 76/178 (42%), Gaps = 20/178 (11%)

Query: 795 IAQYSSYQNAVQDAEKWLLQISFQLMAHNSLYINNRQQTLEQIQHHEKLLQDYSSYQNAV 854
           + +    ++  +  +   +++S              +Q L Q Q  E+LL D+   Q   
Sbjct: 502 LREGPDQRHLAEQVQPLRMRLS------------ELEQRLRQQQSAERLLADFCKRQGKN 549

Query: 855 QDAEKWLLQISFQ--LMAHNSLYINNRQQTLEQIQHHEKLLQDVILSPDNKLAPCFSAEE 912
            DAE+          L+   S  ++N ++    ++  ++ LQ  I S   + AP + A +
Sbjct: 550 LDAEELEALHQELEALIESLSDSVSNAREQRMALRQEQEQLQSRIQSLMQR-APVWLAAQ 608

Query: 913 GSPIKDKIDRLEKETKAQSDRIQDALKTLNQLIGHRKEFEVDVENVEAKLKELEVAVE 970
                + +++L +++  +    QD  + + QL+   +E  V+ + + A+   L+  +E
Sbjct: 609 -----NALEQLSEQSGEEFTDSQDVTEYMQQLLEREREATVERDELGARKNALDEEIE 661


>gnl|CDD|129694 TIGR00606, rad50, rad50.  All proteins in this family for which
            functions are known are involvedin recombination,
            recombinational repair, and/or non-homologous end
            joining.They are components of an exonuclease complex
            with MRE11 homologs. This family is distantly related to
            the SbcC family of bacterial proteins.This family is
            based on the phylogenomic analysis of JA Eisen (1999,
            Ph.D. Thesis, Stanford University).
          Length = 1311

 Score = 34.6 bits (79), Expect = 0.86
 Identities = 66/395 (16%), Positives = 156/395 (39%), Gaps = 38/395 (9%)

Query: 916  IKDKIDRLEKETKAQSDRIQDALKTLNQLIGHRKEFEVDVENVEAKLKELEVAVEGDVKT 975
            ++D +    KE     DR+    K L  L  ++     ++E+ E +L   E       K 
Sbjct: 575  LEDWLHSKSKEINQTRDRLAKLNKELASLEQNKNHINNELESKEEQLSSYED------KL 628

Query: 976  NNVLMLQELLNKYAQLNEEAGELNMMISNVSIATEHMNLNDADRLTVNDSVSSMQKRYAQ 1035
             +V   Q+  +   +L EE  + +   + ++ AT   +         N S   + +R  Q
Sbjct: 629  FDVCGSQDEESDLERLKEEIEKSSKQRAMLAGATAVYSQFITQLTDENQSCCPVCQRVFQ 688

Query: 1036 LSRTIEDRISTVNNKLQSVKRLQHKVEEAKALLQKAESQLARPVSTQAEDISEALTVYQK 1095
                +++ IS + +KL+         E      +K   ++      +   I     + +K
Sbjct: 689  TEAELQEFISDLQSKLRLAPDKLKSTESELKKKEKRRDEMLGLAPGRQSIID----LKEK 744

Query: 1096 LLDEINSWKNSQSDEDLVHLADSIKPLDEVVERI--ENHAVPKLKSLLLLREQFTTLIMQ 1153
             + E+ + K  + + D+  L + I+  + ++  I  E  +     + + + E+F   +  
Sbjct: 745  EIPELRN-KLQKVNRDIQRLKNDIEEQETLLGTIMPEEESAKVCLTDVTIMERFQMELKD 803

Query: 1154 IVGFITETTARVGEVDGN------SATVQEKIDKYDKIIADIQEYEAILATASDKGDQLS 1207
            +   I +  A++   D +      +   QEK  + D +++ I+    +            
Sbjct: 804  VERKIAQQAAKLQGSDLDRTVQQVNQEKQEKQHELDTVVSKIELNRKL------------ 851

Query: 1208 SDGTISDRNEITEQLQSRKQQLHNLRKTVEKLRQQNEKRAAESEKLGAELEEIIEAL-HA 1266
                I D+ E  + L+S+  +L + +  +    Q+ ++   +  +L  E++ +I  +  A
Sbjct: 852  ----IQDQQEQIQHLKSKTNELKSEKLQIGTNLQRRQQFEEQLVELSTEVQSLIREIKDA 907

Query: 1267 RETAVPSSSRLHNLRKTVEKL--RQQNEKRAAESE 1299
            +E   P  + L   ++  E+L   ++   + A+ +
Sbjct: 908  KEQDSPLETFLEKDQQEKEELISSKETSNKKAQDK 942



 Score = 32.7 bits (74), Expect = 3.3
 Identities = 81/522 (15%), Positives = 189/522 (36%), Gaps = 61/522 (11%)

Query: 468 FQKKIEAEKDIGDCTNLDDLEKYKALLQEVIS-HNSDMETLNDKC---EALMELAAHSPI 523
            +   E  K   + +  +     + L +EV S  N   +         + + +L  H+  
Sbjct: 474 LELDQELRKAERELSKAEKNSLTETLKKEVKSLQNEKADLDRKLRKLDQEMEQLNHHTTT 533

Query: 524 REETLRLQALYAALVTSVQGLVSQVEKNLSDQTEFLAKKEEVEAWLHKAHATVQECAGNG 583
           R +   L          ++ + S+    L+    +   K+++E WLH           + 
Sbjct: 534 RTQMEMLTKDKMDKDEQIRKIKSRHSDELTSLLGYFPNKKQLEDWLHSK---------SK 584

Query: 584 SKNVLKERLDTVNMVAERIPEGQHLMSVLQDTFTKALDTTPSDQQDSLREAMTV--LRDS 641
             N  ++RL  +N     + + ++ ++   ++  + L    S  +D L +          
Sbjct: 585 EINQTRDRLAKLNKELASLEQNKNHINNELESKEEQL----SSYEDKLFDVCGSQDEESD 640

Query: 642 WDRLNRDLKSTSTQLKSYIARWNELDDLYNRFNMWLSGVENKLNEPSPVFTEIDLNTKWQ 701
            +RL  +++ +S Q             +Y++F   ++ + ++     PV   +       
Sbjct: 641 LERLKEEIEKSSKQRAMLAGATA----VYSQF---ITQLTDENQSCCPVCQRVFQTEAEL 693

Query: 702 QYVNDELEWENHS--------QNTLQWLDNIRSDMKTLLERSKHIVDDIEKKKDDLK-PI 752
           Q    +L+ +           ++ L+  +  R +M  L    + I+D  EK+  +L+  +
Sbjct: 694 QEFISDLQSKLRLAPDKLKSTESELKKKEKRRDEMLGLAPGRQSIIDLKEKEIPELRNKL 753

Query: 753 QKEAQQLSEWDSSIVPKVSDLETRWAAVKSAWDAKHSDLSSEIAQYSSYQNAVQDAEKWL 812
           QK  + +    + I  + + L T     +SA         +++     +Q  ++D E+ +
Sbjct: 754 QKVNRDIQRLKNDIEEQETLLGTIMPEEESA-----KVCLTDVTIMERFQMELKDVERKI 808

Query: 813 LQISFQLMAHNSL--YINNRQQTLEQIQHHEKLLQDYSSYQNAVQDAEKWLLQISFQLMA 870
            Q + +L   +         Q+  E+    + ++      +  +QD ++ +  +  +   
Sbjct: 809 AQQAAKLQGSDLDRTVQQVNQEKQEKQHELDTVVSKIELNRKLIQDQQEQIQHLKSKTNE 868

Query: 871 HNSLYIN------NRQQTLEQIQHHEKLLQDVI---------LSPDNKLAPCFSAEEGSP 915
             S  +        RQQ  EQ+      +Q +I          SP          E+   
Sbjct: 869 LKSEKLQIGTNLQRRQQFEEQLVELSTEVQSLIREIKDAKEQDSPLETFLEKDQQEKEEL 928

Query: 916 IKDKIDRLEKETKAQSDRIQDALKTLNQLIGHRKEFEVDVEN 957
           I  K    E   K   D++ D  + +  + G+ K+ E  +++
Sbjct: 929 ISSK----ETSNKKAQDKVNDIKEKVKNIHGYMKDIENKIQD 966


>gnl|CDD|225288 COG2433, COG2433, Uncharacterized conserved protein [Function
            unknown].
          Length = 652

 Score = 34.3 bits (79), Expect = 0.90
 Identities = 26/108 (24%), Positives = 48/108 (44%), Gaps = 5/108 (4%)

Query: 1258 EEIIEALHARETAVPSSSRLHNLRKTVEKLRQQNEKRAAESEKLGAELEEIIEALHARET 1317
            E+       R        R+  L +TVE+L ++N +   E E+L  E+E++   L     
Sbjct: 405  EKEGTEEEERREITVYEKRIKKLEETVERLEEENSELKRELEELKREIEKLESELERFRR 464

Query: 1318 AVKTLPVLLLEVTSVDVELDKQRLELATKEVQPYVQRYETLKKNIAEA 1365
             V+       E+ + D  +++   EL  K+     +R E L++ +AE 
Sbjct: 465  EVRDKVRKDREIRARDRRIERLEKELEEKK-----KRVEELERKLAEL 507


>gnl|CDD|216108 pfam00769, ERM, Ezrin/radixin/moesin family.  This family of proteins
            contain a band 4.1 domain (pfam00373), at their amino
            terminus. This family represents the rest of these
            proteins.
          Length = 244

 Score = 33.6 bits (77), Expect = 0.98
 Identities = 27/107 (25%), Positives = 44/107 (41%), Gaps = 16/107 (14%)

Query: 1208 SDGTISDRNEITEQLQSRKQQLHNLRKTVEKLRQQNEKRAAESEKLGAELEEIIEALHAR 1267
             + T  +  E  +Q +   Q L      +E+  ++ E+ AA SE+    LE  ++   A 
Sbjct: 31   YEETALELEEKLKQEEEEAQLLEKKADELEEENRRLEEEAAASEEERERLEAEVDEATAE 90

Query: 1268 ETAVPSSSRLHNLRKTVEKLRQQNEKRAAESEKLGAELEEIIEALHA 1314
                            V KL ++ EK+ AE+ +L  EL E  EA   
Sbjct: 91   ----------------VAKLEEEREKKEAETRQLQQELREAQEAHER 121


>gnl|CDD|224241 COG1322, COG1322, Predicted nuclease of restriction endonuclease-like
            fold, RmuC family [General function prediction only].
          Length = 448

 Score = 33.9 bits (78), Expect = 1.1
 Identities = 26/101 (25%), Positives = 48/101 (47%), Gaps = 12/101 (11%)

Query: 1215 RNEITEQLQSRKQQLHNLRKTVEKLRQQNEKRAAES-EKLGAELEEIIEALHARETAVPS 1273
            +  + +QL   ++QL  L +++ +L  + ++ A E  E+L   L E+ +          +
Sbjct: 79   KARLQQQLLQSREQLQLLIESLAQLSSEFQELANEIFEELNRRLAELNQQ---------N 129

Query: 1274 SSRLHNLRKTV-EKLRQQNEKR-AAESEKLGAELEEIIEAL 1312
              +L    + V EK R+Q E+R    +E+    LEEI   L
Sbjct: 130  LKQLLKPLREVLEKFREQLEQRIHESAEERSTLLEEIDRLL 170


>gnl|CDD|226081 COG3551, COG3551, Uncharacterized protein conserved in bacteria
           [Function unknown].
          Length = 402

 Score = 33.8 bits (77), Expect = 1.1
 Identities = 22/147 (14%), Positives = 44/147 (29%), Gaps = 11/147 (7%)

Query: 90  LLQKLLTEKDQAGHKLTYLTSSGEKLYLDTAAKGREVVRQQLRALRDSKCSKTTLQEVLS 149
            L  L    D+    +           +  A K R +   +L      K   + L     
Sbjct: 190 HLSDLRQHPDRLVAAVN--------QRVGRAPKKRFLDDVRLFLDHSLKHFASALAAAAG 241

Query: 150 HKRIIESLLDKSKSLPQI---NKDQALEKAIVSVNKRYEDLVDGILKTISQLEESLDIFQ 206
             + +  + +  + L       + + LE+         E + + + ++ S   ES+  F 
Sbjct: 242 SFQALLEVHEALQQLAVDRFDIEWKGLEREFSRAMPLLEYVGELLWQSASSHNESMTRFS 301

Query: 207 QFQQLQKAYQEDQKQLWDKLSSLTERE 233
           +           Q+QL D L       
Sbjct: 302 EQVADLAMSYTAQRQLNDGLRDRLLEA 328


>gnl|CDD|129992 TIGR00914, 2A0601, heavy metal efflux pump, CzcA family.  This model
            represents a family of H+/heavy metal cation antiporters.
            This family is one of several subfamilies within the
            scope of Pfam model pfam00873 [Cellular processes,
            Detoxification, Transport and binding proteins, Cations
            and iron carrying compounds].
          Length = 1051

 Score = 34.0 bits (78), Expect = 1.2
 Identities = 15/40 (37%), Positives = 23/40 (57%), Gaps = 10/40 (25%)

Query: 1414 ADKIKQITKSLPEGQK---------LIDKTVA-LKNNVLE 1443
             DK++ I K+LPEG +         L+D  +A +K N+LE
Sbjct: 310  GDKLETINKTLPEGVEIVTTYDRSQLVDAAIATVKKNLLE 349


>gnl|CDD|197544 smart00150, SPEC, Spectrin repeats. 
          Length = 101

 Score = 31.5 bits (72), Expect = 1.2
 Identities = 15/99 (15%), Positives = 42/99 (42%), Gaps = 6/99 (6%)

Query: 699 KWQQYVNDELEWENHSQNTLQWLDNIRS--DMKTLLERSKHIVDDIEKKKDDLKPIQKEA 756
           ++ +  ++   W    +  L   D  +    ++ LL++ +    ++E  ++ ++ + +  
Sbjct: 2   QFLRDADELEAWLEEKEQLLASEDLGKDLESVEALLKKHEAFEAELEAHEERVEALNELG 61

Query: 757 QQLSEWD----SSIVPKVSDLETRWAAVKSAWDAKHSDL 791
           +QL E        I  ++ +L  RW  +K   + +   L
Sbjct: 62  EQLIEEGHPDAEEIEERLEELNERWEELKELAEERRQKL 100



 Score = 29.6 bits (67), Expect = 5.4
 Identities = 19/95 (20%), Positives = 44/95 (46%), Gaps = 5/95 (5%)

Query: 1748 YEESVATLLDWLKVTEKKLGDEMELKTTLDEKKAQLQVYRTIYQDATSHQQDLLQLKDKI 1807
            +      L  WL+  E+ L  E +L   L+  +A L+ +     +  +H++ +  L +  
Sbjct: 3    FLRDADELEAWLEEKEQLLASE-DLGKDLESVEALLKKHEAFEAELEAHEERVEALNELG 61

Query: 1808 ESL----HQPSEQSKQQLATITTRHGNVLKRAQKK 1838
            E L    H  +E+ +++L  +  R   + + A+++
Sbjct: 62   EQLIEEGHPDAEEIEERLEELNERWEELKELAEER 96


>gnl|CDD|217653 pfam03639, Glyco_hydro_81, Glycosyl hydrolase family 81.  Family of
           eukaryotic beta-1,3-glucanases. Within the Aspergillus
           fumigatus protein two perfectly conserved Glu residues
           (E550 or E554) have been proposed as putative
           nucleophiles of the active site of the Engl1
           endoglucanase, while the proton donor would be D475. The
           endo-beta-1,3-glucanase activity is essential for
           efficient spore release.
          Length = 694

 Score = 33.7 bits (77), Expect = 1.2
 Identities = 21/102 (20%), Positives = 32/102 (31%), Gaps = 6/102 (5%)

Query: 530 LQALYAALVTSVQGLVSQVEKNLSDQTEFLAKKEEVEAWLHKAHATVQECAGNGSKNVLK 589
           LQ +  A    V    S  E NL D   F  K     A +       +       + + K
Sbjct: 348 LQEIREAANKDVNVSDSSAESNL-DSMYFSGKIIAKYARI--LLVASEIL---YDEVLTK 401

Query: 590 ERLDTVNMVAERIPEGQHLMSVLQDTFTKALDTTPSDQQDSL 631
             L  + M   R+        +L DT  K + ++      S 
Sbjct: 402 ITLYKLKMAMPRLLGNFQPYPLLYDTKWKGIISSGWGSTSSQ 443


>gnl|CDD|225711 COG3170, FimV, Tfp pilus assembly protein FimV [Cell motility and
            secretion / Intracellular trafficking and secretion].
          Length = 755

 Score = 33.7 bits (77), Expect = 1.4
 Identities = 20/125 (16%), Positives = 34/125 (27%), Gaps = 23/125 (18%)

Query: 1220 EQLQSRKQQLHNLRKTVEKLRQQNEKRAAESEKLGAELEEIIEALHARETAVPSSSRLHN 1279
             +L +R  +L   +  + K + +  ++AA      A L+   E   A     P  S    
Sbjct: 298  AELPARVAEL---QAQLNKAQHELAQKAAPLAAAQAALDAPAETATAPSAPAPQVS---- 350

Query: 1280 LRKTVEKLRQQNEKRAAESEKLGAELEEIIEALHARETAVKTLPVLLLEVTSVDVELDKQ 1339
                            AES         +     A    +         +    V   + 
Sbjct: 351  ----------------AESSPAQPGSYLLAAPGDAPLGELAQAQSARERLAEESVPAAEP 394

Query: 1340 RLELA 1344
            R  LA
Sbjct: 395  RSRLA 399


>gnl|CDD|223092 COG0013, AlaS, Alanyl-tRNA synthetase [Translation, ribosomal
            structure and biogenesis].
          Length = 879

 Score = 33.8 bits (78), Expect = 1.5
 Identities = 19/97 (19%), Positives = 40/97 (41%), Gaps = 13/97 (13%)

Query: 1029 MQKRYAQLSRTIEDRISTVNNKLQSVKRLQHKVEEAKALLQKAESQLARPVSTQAEDISE 1088
            +Q++   L   I   +      L  V+RL  +++E +  L++ + +LA          ++
Sbjct: 710  LQEQEDLLKE-IASLLKVPPELLPKVERLLEELKELEKELERLKKKLAAAELADLLSNAK 768

Query: 1089 AL-----TVYQKLLDEINSWKNSQSDEDLVHLADSIK 1120
            A       V  K +D  +        ++L  +AD +K
Sbjct: 769  AEEIGGVKVLAKEVDGADM-------KELREIADDLK 798


>gnl|CDD|185747 cd09234, V_HD-PTP_like, Protein-interacting V-domain of mammalian
            His-Domain type N23 protein tyrosine phosphatase and
            related domains.  This family contains the V-shaped (V)
            domain of mammalian His-Domain type N23 protein tyrosine
            phosphatase (HD-PTP, also known as PTPN23) and related
            domains. It belongs to the V_Alix_like superfamily which
            includes the V domains of  Bro1 and Rim20 (also known as
            PalA) from Saccharomyces cerevisiae, mammalian Alix
            (apoptosis-linked gene-2 interacting protein X/ also
            known as apoptosis-linked gene-2 interacting protein 1,
            AIP1), and related domains. HD_PTP interacts with the
            ESCRT (Endosomal Sorting Complexes Required for
            Transport) system, and participates in cell migration and
            endosomal trafficking. The related Alix V-domain
            (belonging to a different family in this superfamily)
            contains a binding site, partially conserved in the
            superfamily, for the retroviral late assembly (L) domain
            YPXnL motif. The Alix V-domain is also a dimerization
            domain. In addition to the V-domain, HD_PTP also has an
            N-terminal Bro1-like domain, a proline-rich region (PRR),
            a catalytically inactive tyrosine phosphatase domain, and
            a region containing a PEST motif. Bro1-like domains bind
            components of the ESCRT-III complex, specifically to
            CHMP4 in the case of HD-PTP. The Bro1-like domain of
            HD-PTP can also bind human immunodeficiency virus type 1
            (HIV-1) nucleocapsid. HD-PTP is encoded by the PTPN23
            gene, a tumor suppressor gene candidate frequently absent
            in human kidney, breast, lung, and cervical tumors. This
            family also contains Drosophila Myopic, which promotes
            epidermal growth factor receptor (EGFR) signaling, and
            Caenorhabditis elegans (enhancer of glp-1) EGO-2 which
            promotes Notch signaling.
          Length = 337

 Score = 33.0 bits (76), Expect = 1.5
 Identities = 39/171 (22%), Positives = 68/171 (39%), Gaps = 29/171 (16%)

Query: 935  QDALKTLNQLIGHRKEFEVDVENVEAKLKELEVAVEGDVK---------------TNNVL 979
             D +K L + +G   +   DVE   A L E+E  +E +                   +V 
Sbjct: 71   PDTIKNLVEAMGELSDVYQDVE---AMLNEIESLLEEEELQEKEFQEAVGKRGSSIAHVT 127

Query: 980  MLQELLNKYAQLNEEAGELNMMISNVSIATEHMNLNDADRLTVNDSVSSMQKRYAQLSRT 1039
             L+  L KY + +E+A +     SN  +    MNL+ A+   +   +  +QK   +L   
Sbjct: 128  ELKRELKKYKEAHEKASQ-----SNTELHKA-MNLHIANLKLLAGPLDELQK---KLPSP 178

Query: 1040 IEDRISTVNNKLQSVKRLQHKVEEAKALLQKAESQLARPVSTQAEDISEAL 1090
                        + +KR+ +KV E +   +  E QL   +    +DI+  L
Sbjct: 179  SLLDRPEDEAIEKELKRILNKVNEMRKQRRSLEQQLRDAI--HEDDITSKL 227


>gnl|CDD|153280 cd07596, BAR_SNX, The Bin/Amphiphysin/Rvs (BAR) domain of Sorting
            Nexins.  BAR domains are dimerization, lipid binding and
            curvature sensing modules found in many different
            proteins with diverse functions. Sorting nexins (SNXs)
            are Phox homology (PX) domain containing proteins that
            are involved in regulating membrane traffic and protein
            sorting in the endosomal system. SNXs differ from each
            other in their lipid-binding specificity, subcellular
            localization and specific function in the endocytic
            pathway. A subset of SNXs also contain BAR domains. The
            PX-BAR structural unit determines the specific membrane
            targeting of SNXs. BAR domains form dimers that bind to
            membranes, induce membrane bending and curvature, and may
            also be involved in protein-protein interactions.
          Length = 218

 Score = 32.7 bits (75), Expect = 1.6
 Identities = 37/188 (19%), Positives = 72/188 (38%), Gaps = 29/188 (15%)

Query: 1217 EITEQLQSRKQQLHNLRKTVEKLRQQNEKRAAESEKLGAELEEIIEALHA-----RETAV 1271
            E  ++ +  K  +  L + ++KL +Q ++      +LG+ L E  +AL        E   
Sbjct: 1    EEDQEFEEAKDYILKLEEQLKKLSKQAQRLVKRRRELGSALGEFGKALIKLAKCEEEVGG 60

Query: 1272 PSSSRLHNLRKTVEKLRQQNEKRA-AESEKLGAELEEIIEALHARETAVKTLPVLLLEVT 1330
                 L  L K  E+L   +E +A  E  KL   L+E +    A +  +      LL + 
Sbjct: 61   ELGEALSKLGKAAEELSSLSEAQANQELVKLLEPLKEYLRYCQAVKETLDDRADALLTLQ 120

Query: 1331 SVDVELDKQRLELATKEVQPYV-----------------------QRYETLKKNIAEAID 1367
            S+  +L  ++ +L   +  P +                       +RYE + + + E + 
Sbjct: 121  SLKKDLASKKAQLEKLKAAPGIKPAKVEELEEELEEAESALEEARKRYEEISERLKEELK 180

Query: 1368 KFETQANE 1375
            +F  +   
Sbjct: 181  RFHEERAR 188


>gnl|CDD|237178 PRK12705, PRK12705, hypothetical protein; Provisional.
          Length = 508

 Score = 33.1 bits (76), Expect = 1.6
 Identities = 28/127 (22%), Positives = 56/127 (44%), Gaps = 2/127 (1%)

Query: 1190 QEYEAILATASDKGDQLSSDGTISDRNEITEQLQSRKQQLHNLRKTVEKLRQQNEKRAAE 1249
            QE +       +     + +  + +RN+  ++ +  +++L    + + +  +Q + RA +
Sbjct: 40   QEAQKEAEEKLEAALLEAKELLLRERNQQRQEARREREELQREEERLVQKEEQLDARAEK 99

Query: 1250 SEKLGAELEEIIEALHARETAVPS-SSRLHNLRKTVEKLRQQNEKRAAESEKLGAELEEI 1308
             + L  +LEE  +AL ARE  +     +L N    V  L  + + R    + L AELEE 
Sbjct: 100  LDNLENQLEEREKALSARELELEELEKQLDNELYRVAGLTPE-QARKLLLKLLDAELEEE 158

Query: 1309 IEALHAR 1315
                  +
Sbjct: 159  KAQRVKK 165


>gnl|CDD|189014 cd09607, M3B_PepF_2, Peptidase family M3B Oligopeptidase F (PepF).
            Peptidase family M3B Oligopeptidase F (PepF; Pz-peptidase
            B; EC 3.4.24.-) is mostly bacterial and includes
            oligoendopeptidase F from Lactococcus lactis. This enzyme
            hydrolyzes peptides containing between 7 and 17 amino
            acids with fairly broad specificity. The PepF gene is
            duplicated in L. lactis on the plasmid that bears it,
            while a shortened second copy is found in Bacillus
            subtilis. Most bacterial PepFs are cytoplasmic
            endopeptidases; however, the PepF Bacillus
            amyloliquefaciens oligopeptidase is a secreted protein
            and may facilitate the process of sporulation.
            Specifically, the yjbG gene encoding the homolog of the
            PepF1 and PepF2 oligoendopeptidases of Lactococcus lactis
            has been identified in Bacillus subtilis as an inhibitor
            of sporulation initiation when over expressed from a
            multicopy plasmid.
          Length = 581

 Score = 33.3 bits (77), Expect = 1.6
 Identities = 25/141 (17%), Positives = 59/141 (41%), Gaps = 17/141 (12%)

Query: 949  KEFEVDVENVEAKLKELEVAVEGDVKTNN--VLMLQELLNKYAQLNEEAGELNMMISNVS 1006
             EF+ D++ ++  + EL+  ++  + ++   V  L+ L+    +L E    L   +S + 
Sbjct: 13   PEFQEDLKKLKQLIDELKSLLDALLPSDEDAVETLEALITLLQELLELLRTLGAFVSCLL 72

Query: 1007 IATEHMNLNDADRLTVNDSVSSMQKRYAQLSRTIED---RISTVN-NKLQSVKRLQ---- 1058
             A    +  D +   +   ++ +     QL   ++     +S  + + L +   L+    
Sbjct: 73   SA----DTTDEEAQALLSQLNQLSASLEQLLTLLDKKLAALSEEDLDALLADPELKEHAF 128

Query: 1059 ---HKVEEAKALLQKAESQLA 1076
                +  +AK LL + E +L 
Sbjct: 129  FLEERRRQAKHLLSEEEEELI 149



 Score = 31.4 bits (72), Expect = 7.4
 Identities = 47/253 (18%), Positives = 88/253 (34%), Gaps = 61/253 (24%)

Query: 1145 EQFTTLIMQIVGFITETTARVGEVDGNSATVQEKIDKYDKIIADIQEYEAILATASDKGD 1204
             +F   + ++   I E  + +  +  +     E ++    +I  +QE   +L T      
Sbjct: 13   PEFQEDLKKLKQLIDELKSLLDALLPSDEDAVETLEA---LITLLQELLELLRTLGAFVS 69

Query: 1205 QLSSDGTISDRNEITEQLQSRKQQLHNLRKTVEKLRQQNEKRAAESEKLGAELEEIIEAL 1264
             L S  T    +E  + L S+  QL    + +  L  +     +E      +L+ ++   
Sbjct: 70   CLLSADT---TDEEAQALLSQLNQLSASLEQLLTLLDKKLAALSE-----EDLDALLA-- 119

Query: 1265 HARETAVPSSSRLHNLRKTVEKLRQQNEKRAAESEKLGAELEEIIEAL-----HA----R 1315
                        L      +E+ R+Q          L  E EE+I  L     HA     
Sbjct: 120  ---------DPELKEHAFFLEERRRQA------KHLLSEEEEELIAKLSVDGLHAWGKLY 164

Query: 1316 ETAVKTLPVLLLEVTSVDVELDKQRLELATKEVQPYVQRYETLKKNIAEAIDKFETQANE 1375
            +T   TL         + +E D +  EL+  + +           N+A   D    +A  
Sbjct: 165  DTLTSTL--------KIPIEEDGEEKELSAGQAR-----------NLAYDPD----RAVR 201

Query: 1376 HEAYKQAYNEAYD 1388
              AY+ A  +A++
Sbjct: 202  KAAYE-AELKAWE 213


>gnl|CDD|220180 pfam09325, Vps5, Vps5 C terminal like.  Vps5 is a sorting nexin that
            functions in membrane trafficking. This is the C terminal
            dimerisation domain.
          Length = 236

 Score = 32.6 bits (75), Expect = 1.7
 Identities = 39/182 (21%), Positives = 77/182 (42%), Gaps = 30/182 (16%)

Query: 1216 NEITEQLQSRKQQLHNLRKTVEKL-RQQNEKRAAESEKLGAELEEIIEALHARETAVPSS 1274
             +  + + S + QL  L K +E L  Q+ E  +A       E  + +  L + E +   S
Sbjct: 27   EDKRQYIDSLESQLKKLYKALELLVNQRKELASAT-----GEFAKSLAMLASLELSTGLS 81

Query: 1275 SRLHNLRKTVEKLRQQNEKRA-AESEKLGAELEEII-------EALHARETAVKTLPVLL 1326
              L  L +  E++++  E++A  +   LG  ++E +          + R  A +      
Sbjct: 82   RALSQLAEVEERIKELLERQALQDVLTLGETIDEYLRLIGSVKAVFNQRVKAWQQWQNAE 141

Query: 1327 LEVTSVDVELDK---------QRLELATKEVQPYVQRYETLKK---NIAEAI----DKFE 1370
             +++    +L K          +L+ A KEV    ++ +  KK   +I+E I    ++FE
Sbjct: 142  QDLSKKRAQLAKLKGAGKSRQDKLQQAKKEVDEAERKVQQAKKEFEDISETIKKELERFE 201

Query: 1371 TQ 1372
            T+
Sbjct: 202  TE 203


>gnl|CDD|218915 pfam06156, DUF972, Protein of unknown function (DUF972).  This family
            consists of several hypothetical bacterial sequences. The
            function of this family is unknown.
          Length = 106

 Score = 31.1 bits (71), Expect = 1.8
 Identities = 20/77 (25%), Positives = 35/77 (45%), Gaps = 8/77 (10%)

Query: 1214 DRNEITEQLQSRKQQLHNLRKTVEKLRQQ-------NEKRAAESEKLGAELEEIIEALHA 1266
            D+ E+ + L   +QQL+ L   + +L+QQ       N +   E+E L   LEE+ +    
Sbjct: 2    DKKELFDALSELEQQLNQLLAELGELKQQLAELLEENAELRIENEHLRERLEELEQEQKE 61

Query: 1267 RETAVPSSSRLH-NLRK 1282
            +E         + NL +
Sbjct: 62   KEKGKKKLGEGYDNLAR 78


>gnl|CDD|234801 PRK00591, prfA, peptide chain release factor 1; Validated.
          Length = 359

 Score = 32.7 bits (76), Expect = 2.2
 Identities = 26/104 (25%), Positives = 49/104 (47%), Gaps = 22/104 (21%)

Query: 1057 LQHKVEEAKALLQKAESQLARP--VSTQA------------EDISEALTVYQKLLDEINS 1102
            +  K+E  +   ++ E+ L+ P  +S Q             E I EA   Y++  +++  
Sbjct: 4    MLDKLEALEERYEELEALLSDPEVISDQKRFRKLSKEYAELEPIVEAYREYKQAQEDLEE 63

Query: 1103 WK---NSQSDEDLVHLA-DSIKPLDEVVERIENHAVPKLKSLLL 1142
             K     +SD ++  +A + +K L+E +E +E     +LK LLL
Sbjct: 64   AKEMLEEESDPEMREMAKEELKELEERLEELEE----ELKILLL 103


>gnl|CDD|172376 PRK13855, PRK13855, type IV secretion system protein VirB10;
           Provisional.
          Length = 376

 Score = 32.6 bits (74), Expect = 2.6
 Identities = 13/25 (52%), Positives = 15/25 (60%)

Query: 675 MWLSGVENKLNEPSPVFTEIDLNTK 699
           +WL G   K NEP+P  T I  NTK
Sbjct: 47  IWLGGRSKKENEPAPPSTMIATNTK 71


>gnl|CDD|236729 PRK10636, PRK10636, putative ABC transporter ATP-binding protein;
            Provisional.
          Length = 638

 Score = 32.8 bits (75), Expect = 2.7
 Identities = 32/148 (21%), Positives = 58/148 (39%), Gaps = 21/148 (14%)

Query: 1178 KIDKYDKIIADIQEYEAILATASDKGDQLSSDGTISDRNEITEQLQSRKQQLH---NLRK 1234
            K++ +D    D+++Y+  L+    + +Q         +       Q+RK Q      LR 
Sbjct: 504  KVEPFD---GDLEDYQQWLSDVQKQENQTDEA----PKENNANSAQARKDQKRREAELRT 556

Query: 1235 TVEKLRQQNEKRAAESEKLGAELEEIIEALHARETAVPSSSRLHNLRKT--VEKLRQQNE 1292
              + LR++  +   E EKL A+L +        E  +  S      RK      L+QQ  
Sbjct: 557  QTQPLRKEIARLEKEMEKLNAQLAQA-------EEKLGDSELYDQSRKAELTACLQQQAS 609

Query: 1293 KRAA--ESEKLGAELEEIIEALHARETA 1318
             ++   E E    E +E +E +     +
Sbjct: 610  AKSGLEECEMAWLEAQEQLEQMLLEGQS 637


>gnl|CDD|213754 TIGR02971, heterocyst_DevB, ABC exporter membrane fusion protein,
            DevB family.  Members of this protein family are found
            mostly in the Cyanobacteria, but also in the
            Planctomycetes. DevB from Anabaena sp. strain PCC 7120 is
            partially characterized as a membrane fusion protein of
            the DevBCA ABC exporter, probably a glycolipid exporter,
            required for heterocyst formation. Most Cyanobacteria
            have one member only, but Nostoc sp. PCC 7120 has seven
            members.
          Length = 327

 Score = 32.5 bits (74), Expect = 2.7
 Identities = 46/242 (19%), Positives = 94/242 (38%), Gaps = 26/242 (10%)

Query: 961  KLKELEVAVEGDVKTNNVLMLQELLNKYAQLNEEAGELNMMISNVSIATEHMNLNDADRL 1020
            ++K+L VA    V+   VL   +   +         EL++  + +  A          RL
Sbjct: 26   RIKKLLVAEGDRVQAGQVLAELDSRPERTA------ELDVARTQLDEAKA--------RL 71

Query: 1021 T-VNDSVSSMQKRYAQLSRTIEDRISTVNNKLQSVKRLQHKVEEAKALLQKAESQLARP- 1078
              V       +    + +R        V  +  ++ RL+ ++E A+  + +  S      
Sbjct: 72   AQVRAGAKKGEIAAQRAARAAAKLFKDVAAQQATLNRLEAELETAQREVDRYRSLFRDGA 131

Query: 1079 VSTQAEDISEALTVYQKLLDEINSWKNSQSDE------DLVHLADSIKPLDEVVERIENH 1132
            VS    D S+AL + +   +E+     S+S++       L  LA+ ++  D  + + E  
Sbjct: 132  VSASDLD-SKALKL-RTAEEELEEALASRSEQIDGARAALASLAEEVRETDVDLAQAEVK 189

Query: 1133 AVPKLKSLLLLREQFTTLIMQIVGFITETTARVGEVDGNSATVQEKIDKYDKIIADIQEY 1192
            +  +         + T +   I G + +  AR GEV G+   ++  +    ++ A  + Y
Sbjct: 190  SALEAVQQAEALLELTYVKAPIDGRVLKIHAREGEVIGSEGILE--MGDTSQMYAVAEVY 247

Query: 1193 EA 1194
            E 
Sbjct: 248  ET 249


>gnl|CDD|181087 PRK07701, flgL, flagellar hook-associated protein FlgL; Validated.
          Length = 298

 Score = 32.3 bits (74), Expect = 2.8
 Identities = 20/100 (20%), Positives = 34/100 (34%), Gaps = 23/100 (23%)

Query: 337 AYEGIVSIHQAYTKAVLDTQEWIDATYNAVNMWGDLTLERVSLHSNLERLKNLEKELGRI 396
            Y   ++  + Y K   D + W++ T +A++   D+                    L R 
Sbjct: 51  RYRTDLAEVEQYQKNASDAKSWLENTESALDQATDI--------------------LQRA 90

Query: 397 GHKTVALKNNVLESTGPIGKENINQEINQITLDWTNLQNT 436
               V   N     T    ++ I QEI Q+      + NT
Sbjct: 91  RELAVQAANGTNTQT---DRQAIAQEIEQLKEQLIQIANT 127


>gnl|CDD|233003 TIGR00512, salvage_mtnA, S-methyl-5-thioribose-1-phosphate isomerase.
             The delineation of this family was based in part on a
            discussion and neighbor-joining phylogenetic study, by
            Kyrpides and Woese, of archaeal and other proteins
            homologous to the alpha, beta, and delta subunits of
            eukaryotic initiation factor 2B (eIF-2B), a five-subunit
            molecule that catalyzes GTP recycling for eIF-2. This
            clade is now recognized to include the methionine salvage
            pathway enzyme MtnA [Amino acid biosynthesis, Aspartate
            family].
          Length = 331

 Score = 32.2 bits (74), Expect = 2.8
 Identities = 19/67 (28%), Positives = 30/67 (44%), Gaps = 8/67 (11%)

Query: 1245 KRAAESEKLGAELEEIIEALH-ARETAVPSSSRLHNLRKTVEKLRQQNEKRAAESEKLGA 1303
            + A E E+  A LEE ++ L  +R TAV       NL   ++++R   E     ++   A
Sbjct: 61   READEREEFKALLEEKLQYLVSSRPTAV-------NLSWALDRMRAALEAAKTVADIKEA 113

Query: 1304 ELEEIIE 1310
             L E   
Sbjct: 114  LLAEAER 120


>gnl|CDD|226513 COG4026, COG4026, Uncharacterized protein containing TOPRIM domain,
            potential nuclease [General function prediction only].
          Length = 290

 Score = 31.8 bits (72), Expect = 3.3
 Identities = 35/171 (20%), Positives = 64/171 (37%), Gaps = 33/171 (19%)

Query: 1226 KQQLHNLRKTVEKLRQQNEKRAAESEKLGAELEEIIEALHARETAVPSSSRLHNLRKTVE 1285
            K+    L++ +E+L+++ E+   E E+L AE EE+ E             RL  L     
Sbjct: 134  KEDYEELKEKLEELQKEKEELLKELEELEAEYEEVQE-------------RLKRLEVENS 180

Query: 1286 KLRQQNEKRAAESEKLGAELEEIIEALHARETAVKTLPVLLLEVTSVDVELDKQRLELAT 1345
            +L +  +K   E   L    +E+   +   E  + +             +L K+ L LA 
Sbjct: 181  RLEEMLKKLPGEVYDLKKRWDELEPGVELPEEELIS-------------DLVKETLNLAP 227

Query: 1346 KEVQPYVQRYETLKKNIAEAIDKFETQANEHEAYKQAYNEAYDWLRKAKLG 1396
            K+++     Y   +K +   +      A   E       +A + L   K  
Sbjct: 228  KDIEGQGYIYAEDEKEVEILLGTVYIAAPSRE-------DAVEELEIIKEA 271



 Score = 30.6 bits (69), Expect = 7.6
 Identities = 13/60 (21%), Positives = 30/60 (50%)

Query: 911 EEGSPIKDKIDRLEKETKAQSDRIQDALKTLNQLIGHRKEFEVDVENVEAKLKELEVAVE 970
           +E   +  +++ LE E +   +R++      ++L    K+   +V +++ +  ELE  VE
Sbjct: 149 KEKEELLKELEELEAEYEEVQERLKRLEVENSRLEEMLKKLPGEVYDLKKRWDELEPGVE 208


>gnl|CDD|181620 PRK09040, PRK09040, hypothetical protein; Provisional.
          Length = 214

 Score = 31.5 bits (72), Expect = 3.3
 Identities = 17/38 (44%), Positives = 25/38 (65%), Gaps = 3/38 (7%)

Query: 1564 LIQVGVHGLQVELNALLQNIEAEINKVANAAQERKAMQ 1601
            LI VGV G+Q+EL++ L   EAE+ +    AQ RK ++
Sbjct: 31   LILVGVIGVQLELSSKL---EAEVKQRQAEAQRRKTLE 65


>gnl|CDD|129705 TIGR00618, sbcc, exonuclease SbcC.  All proteins in this family for
            which functions are known are part of an exonuclease
            complex with sbcD homologs. This complex is involved in
            the initiation of recombination to regulate the levels of
            palindromic sequences in DNA. This family is based on the
            phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis,
            Stanford University) [DNA metabolism, DNA replication,
            recombination, and repair].
          Length = 1042

 Score = 32.2 bits (73), Expect = 3.6
 Identities = 56/310 (18%), Positives = 97/310 (31%), Gaps = 32/310 (10%)

Query: 1776 LDEKKAQLQVYRTIYQDATSHQQDLLQLKDKIESLHQPSEQSKQQLATITTRHGNVLKRA 1835
            L E +  L   +         +QDL  +      LH      +  L  +T  H   L   
Sbjct: 603  LSEAEDMLACEQHALLRKLQPEQDLQDV-----RLHLQQCSQELALK-LTALHALQLTLT 656

Query: 1836 QKKIEAEKDIGDCTNLDDLEKYKALLQEVISH----NSDMETLNDKCEALMELAAHSPIR 1891
            Q+++            + L   +  LQ++ S         E L      L EL  H    
Sbjct: 657  QERVREHALSIRVLPKELLASRQLALQKMQSEKEQLTYWKEMLAQCQTLLRELETH---- 712

Query: 1892 EETLRLQALYAALVTSVQGLVSQVEKNLSDQTEFLAKKEEVEAWLHKAHATVQECDELEA 1951
             E    +       +S  G  S +        + L +       + KA       +  E 
Sbjct: 713  IEEYDREFNEIENASSSLG--SDLAAREDALNQSLKELMHQARTVLKARTEAHFNNNEEV 770

Query: 1952 GIKEEQKNIETQYLQWN--SFRETLAQMTAWLDSVEKNIKQEIATPWTTTQELRSKLLKL 2009
                 Q   E  +L      F     + T  L ++E  I QEI +               
Sbjct: 771  TA-ALQTGAELSHLAAEIQFFNRLREEDTHLLKTLEAEIGQEIPS-------------DE 816

Query: 2010 KEILKKCEKGVGDHEAFVDKYEECSQKLAQVEAKYAKLIEPETSYEELQKRQGQLTALLT 2069
              +  +CE  V + E F+ + EE S  L ++  +  K  E      +L + Q ++  L  
Sbjct: 817  DILNLQCETLVQEEEQFLSRLEEKSATLGEITHQLLKYEECSKQLAQLTQEQAKIIQLSD 876

Query: 2070 EKVDMDVLLN 2079
            +   ++ +  
Sbjct: 877  KLNGINQIKI 886


>gnl|CDD|222290 pfam13654, AAA_32, AAA domain.  This family includes a wide variety
           of AAA domains including some that have lost essential
           nucleotide binding residues in the P-loop.
          Length = 509

 Score = 32.1 bits (74), Expect = 3.6
 Identities = 10/44 (22%), Positives = 27/44 (61%)

Query: 185 EDLVDGILKTISQLEESLDIFQQFQQLQKAYQEDQKQLWDKLSS 228
           E+LV+ + K I    ES +   + +++++ +QE +++ +++L  
Sbjct: 94  EELVEELKKEIPAAFESEEYEARKEEIEEEFQEKREEAFEELEE 137



 Score = 31.7 bits (73), Expect = 4.7
 Identities = 11/39 (28%), Positives = 25/39 (64%), Gaps = 1/39 (2%)

Query: 185 EDLVDGILKTISQLEESL-DIFQQFQQLQKAYQEDQKQL 222
           E+  + + + I +LEE L +I +Q ++L++  +E  ++L
Sbjct: 174 EEEREELEEKIDELEEELQEILRQLRELEREAREKLREL 212


>gnl|CDD|112953 pfam04163, Tht1, Tht1-like nuclear fusion protein. 
          Length = 554

 Score = 32.0 bits (72), Expect = 3.8
 Identities = 41/226 (18%), Positives = 81/226 (35%), Gaps = 34/226 (15%)

Query: 918  DKIDRL------EKETKAQSDRIQDALKTLNQLIGHRKEFEVDVENVEAKLKELEVAVEG 971
            D           E     ++ RI   +  L++   H++E +V+ + +   +++LE  +  
Sbjct: 202  DDFAGKFRNSDIELRFADENFRIILEIMDLSEFQNHQQEGKVNQDEILETIEQLESEI-- 259

Query: 972  DVKTNNVLMLQELLNKYAQLNEEAGELNMMISNVSIATEHMNLNDADRLTVN----DSVS 1027
                N+             + E AG  N +I +  +A E  + N      +     D ++
Sbjct: 260  -FDINSFFA--------NFIEESAGVHNHIIED--LADEFASENAIALSAIGDLKDDFLA 308

Query: 1028 SMQKRYAQLSRTIEDRISTVNNKLQSVKRLQHKVEEAKALLQKAESQLARPVSTQAEDIS 1087
             ++ R   L    ED +       +SV+R    V      L+     L   +    +D+S
Sbjct: 309  FLEDRIKNLIEMSEDSLK------ESVQRNIDFVNSGFQELEDFSIGLKEELGGLKKDLS 362

Query: 1088 EALTVYQKLLDEINSWKNSQSDEDLVHLA--DSIKPLDEVVERIEN 1131
            E   +     +EI    N    +   HL   + +  L +  E+  N
Sbjct: 363  EQQNLEA---EEILQGFNDFLHDSKDHLKTDEILPELIDFKEKFLN 405


>gnl|CDD|163497 TIGR03785, marine_sort_HK, proteobacterial dedicated sortase system
            histidine kinase.  This histidine kinase protein is
            paired with an adjacent response regulator (TIGR03787)
            gene. It co-occurs with a variant sortase enzyme
            (TIGR03784), usually in the same gene neighborhood, in
            proteobacterial species most of which are marine, and
            with an LPXTG motif-containing sortase target conserved
            protein (TIGR03788). Sortases and LPXTG proteins are far
            more common in Gram-positive bacteria, where sortase
            systems mediate attachment to the cell wall or
            cross-linking of pilin structures. We give this predicted
            sensor histidine kinase the gene symbol psdS, for
            Proteobacterial Dedicated Sortase system Sensor histidine
            kinase.
          Length = 703

 Score = 32.0 bits (73), Expect = 3.9
 Identities = 17/61 (27%), Positives = 33/61 (54%), Gaps = 5/61 (8%)

Query: 1603 KISKLQSWLKQYID----YSGNKQALQSRLDKVNEIQESFPEISTKLQTLSDHIENSSRK 1658
            +I +L    +  ID     SG   A +SR D++ ++  SF ++  +L+  + ++EN S +
Sbjct: 433  RIRRLSDDAEAAIDSQGRISGAIPASRSR-DEIGDLSRSFAQMVARLRQYTHYLENMSSR 491

Query: 1659 L 1659
            L
Sbjct: 492  L 492


>gnl|CDD|235334 PRK05035, PRK05035, electron transport complex protein RnfC;
            Provisional.
          Length = 695

 Score = 32.2 bits (74), Expect = 4.0
 Identities = 36/215 (16%), Positives = 63/215 (29%), Gaps = 27/215 (12%)

Query: 1237 EKLRQQNEKRAAESEKLGAELEEIIEALHARET--------AVPSSSRLHNLRKTVEKLR 1288
             + ++  E RAA+ +   A     ++A  A  T        A P +S +   R+  +   
Sbjct: 471  ARHKKAAEARAAKDKDAVAAALARVKAKKAAATQPIVIKAGARPDNSAVIAAREARKAQA 530

Query: 1289 QQNEKRAAESEKLGAELEEIIEALHARETAVKTLPVLLLEVTSVDVELDKQRLELATKEV 1348
            +  +     +     +   +  A+ AR  A K            +V+  K  +  A    
Sbjct: 531  RARQAEKQAAAAADPKKAAVAAAI-ARAKAKKAAQQAANAEAEEEVDPKKAAVAAAIARA 589

Query: 1349 QPYVQRYETLKKNIAEAIDKFETQANEHEAYKQAYNEAYDWLRKAKLGAQANADCHGEQQ 1408
            +         KK   +A      +       K+A   A     KAK   Q       E  
Sbjct: 590  KA--------KKAAQQAASAEPEEQVAEVDPKKAAVAAAIARAKAKKAEQQANAEPEEPV 641

Query: 1409 TTKD----------KADKIKQITKSLPEGQKLIDK 1433
              +           KA K  Q   +    +    K
Sbjct: 642  DPRKAAVAAAIARAKARKAAQQQANAEPEEAEDPK 676


>gnl|CDD|217817 pfam03961, DUF342, Protein of unknown function (DUF342).  This family
            of bacterial proteins has no known function. The proteins
            are in the region of 500-600 amino acid residues in
            length.
          Length = 450

 Score = 31.8 bits (73), Expect = 4.2
 Identities = 24/99 (24%), Positives = 45/99 (45%), Gaps = 17/99 (17%)

Query: 1214 DRNEITEQLQSRKQQLHNLRKTVEKLRQQNEK-RAAESEKLGAELEEIIEALHARETAVP 1272
            D  E+ E+L+  +++L  L + +EK+++  +K       +L  E  E +E          
Sbjct: 328  DFPELKEELKELEEELKELEEELEKIKKLLKKLPKKARGQLPPEKREQLE---------- 377

Query: 1273 SSSRLHNLRKTVEKLRQQNEKRAAESEKLGAELEEIIEA 1311
                   L +T EKL ++ E+   E ++L  ELE +   
Sbjct: 378  ------KLLETKEKLSEELEELEEELKELKEELESLYSE 410


>gnl|CDD|227512 COG5185, HEC1, Protein involved in chromosome segregation, interacts
            with SMC proteins [Cell division and chromosome
            partitioning].
          Length = 622

 Score = 31.9 bits (72), Expect = 4.3
 Identities = 26/124 (20%), Positives = 56/124 (45%), Gaps = 10/124 (8%)

Query: 1540 KVLIDSFNKGIIGMTELES------EEVKDLIQVGVHGLQVELNAL---LQNIEAEINKV 1590
            K+L D F +      +LE       +E+K   +  VH +  ++  L     N+  +I + 
Sbjct: 231  KLLFDYFTESYKSFLKLEDNYEPSEQELKLGFEKFVHIINTDIANLKTQNDNLYEKIQEA 290

Query: 1591 ANAAQERKAMQDKISKLQSWLKQYIDY-SGNKQALQSRLDKVNEIQESFPEISTKLQTLS 1649
               +Q+ K +++K   L+S   +Y +Y +  KQ  Q    K+ +++        +++ L 
Sbjct: 291  MKISQKIKTLREKWRALKSDSNKYENYVNAMKQKSQEWPGKLEKLKSEIELKEEEIKALQ 350

Query: 1650 DHIE 1653
             +I+
Sbjct: 351  SNID 354


>gnl|CDD|215918 pfam00435, Spectrin, Spectrin repeat.  Spectrin repeat-domains are
           found in several proteins involved in cytoskeletal
           structure. These include spectrin, alpha-actinin and
           dystrophin. The sequence repeat used in this family is
           taken from the structural repeat in reference. The
           spectrin domain- repeat forms a three helix bundle. The
           second helix is interrupted by proline in some
           sequences. The repeats are defined by a characteristic
           tryptophan (W) residue at position 17 in helix A and a
           leucine (L) at 2 residues from the carboxyl end of helix
           C. Although the domain occurs in ultiple repeats along
           sequences, the domains are actually stable on their own
           - ie they act, biophysically, like domains rather than
           repeats that along function when aggregated.
          Length = 105

 Score = 30.0 bits (68), Expect = 4.4
 Identities = 15/99 (15%), Positives = 40/99 (40%), Gaps = 6/99 (6%)

Query: 699 KWQQYVNDELEWENHSQNTLQWLDNIRS--DMKTLLERSKHIVDDIEKKKDDLKPIQKEA 756
           ++ +  +D   W    +  L   D  +    ++ LL++ K +  ++   +D ++ + + A
Sbjct: 5   QFFRDADDLESWIEEKEALLSSEDYGKDLESVQALLKKHKALEAELAAHQDRVEALNELA 64

Query: 757 QQLSEWD----SSIVPKVSDLETRWAAVKSAWDAKHSDL 791
           ++L          I  ++ +L  RW  +      +   L
Sbjct: 65  EKLIAEGHYASEEIQERLEELNERWEQLLELAAERKQKL 103


>gnl|CDD|133400 cd04773, HTH_TioE_rpt2, Second Helix-Turn-Helix DNA binding domain of
            the regulatory protein TioE.  Putative helix-turn-helix
            (HTH) regulatory protein, TioE, and related proteins.
            TioE is part of the thiocoraline gene cluster, which is
            involved in the biosynthesis of the antitumor
            thiocoraline from the marine actinomycete,
            Micromonospora. These proteins share the N-terminal DNA
            binding domain with other transcription regulators of the
            MerR superfamily that promote transcription by
            reconfiguring the spacer between the -35 and -10 promoter
            elements. Proteins in this family are unique within the
            MerR superfamily in that they are composed of just two
            adjacent MerR-like N-terminal domains; this CD mainly
            contains the C-terminal or second repeat (rpt2) of these
            tandem MerR-like domain proteins.
          Length = 108

 Score = 30.0 bits (68), Expect = 4.6
 Identities = 14/43 (32%), Positives = 18/43 (41%), Gaps = 5/43 (11%)

Query: 1276 RLHNLRKTVEKLRQQNEKRAAESEKLGAELEEIIEALHARETA 1318
             L  +   VE+LR         +E L A LE+   AL  R  A
Sbjct: 59   LLEQIATVVEQLRHAGG-----TEALAAALEQRRVALTQRGRA 96



 Score = 29.6 bits (67), Expect = 5.0
 Identities = 15/54 (27%), Positives = 21/54 (38%), Gaps = 5/54 (9%)

Query: 1217 EITEQLQSRKQQLHNLRKTVEKLRQQNEKRAAESEKLGAELEEIIEALHARETA 1270
             +   L+     L  +   VE+LR         +E L A LE+   AL  R  A
Sbjct: 48   RLIHLLRRGGYLLEQIATVVEQLRHAGG-----TEALAAALEQRRVALTQRGRA 96


>gnl|CDD|222097 pfam13393, tRNA-synt_His, Histidyl-tRNA synthetase.  This is a family
            of class II aminoacyl-tRNA synthetase-like and ATP
            phosphoribosyltransferase regulatory subunits.
          Length = 308

 Score = 31.4 bits (72), Expect = 5.4
 Identities = 21/82 (25%), Positives = 29/82 (35%), Gaps = 3/82 (3%)

Query: 1241 QQNEKRAAESEKLGAELEEIIEALHARETAVPSSSRLHNLRKTVEKLRQQNEKRAAESEK 1300
            Q+   R A   K   ELEE++  L        +   L  L    E L  +  +    S  
Sbjct: 169  QEEALRDALQRKDLPELEELLAELGLDPALADALLALPELYGDPEVL-DEALELLPGSPA 227

Query: 1301 LGA--ELEEIIEALHARETAVK 1320
              A  ELE +   L A    V+
Sbjct: 228  AAALDELEALAALLEALGPGVR 249


>gnl|CDD|215618 PLN03184, PLN03184, chloroplast Hsp70; Provisional.
          Length = 673

 Score = 31.7 bits (72), Expect = 5.6
 Identities = 28/83 (33%), Positives = 38/83 (45%), Gaps = 18/83 (21%)

Query: 918 DKIDRLEKETKAQSDRIQDALKTLNQLIGHRKEFEVDV-----ENVEAKLKELEVAVEGD 972
           DK  R   +TK Q+D +    +T  QL    KE    V     E VEAKLKEL+ A+   
Sbjct: 558 DKEKRDAVDTKNQADSV--VYQTEKQL----KELGDKVPADVKEKVEAKLKELKDAIASG 611

Query: 973 VKTNNVLMLQELLNKYAQLNEEA 995
                    Q++ +  A LN+E 
Sbjct: 612 -------STQKMKDAMAALNQEV 627


>gnl|CDD|220785 pfam10498, IFT57, Intra-flagellar transport protein 57.  Eukaryotic
            cilia and flagella are specialised organelles found at
            the periphery of cells of diverse organisms.
            Intra-flagellar transport (IFT) is required for the
            assembly and maintenance of eukaryotic cilia and
            flagella, and consists of the bidirectional movement of
            large protein particles between the base and the distal
            tip of the organelle. IFT particles contain multiple
            copies of two distinct protein complexes, A and B, which
            contain at least 6 and 11 protein subunits. IFT57 is part
            of complex B but is not, however, required for the core
            subunits to stay associated. This protein is known as
            Huntington-interacting protein-1 in humans.
          Length = 355

 Score = 31.2 bits (71), Expect = 5.6
 Identities = 17/79 (21%), Positives = 38/79 (48%), Gaps = 7/79 (8%)

Query: 1218 ITEQLQSRKQQLHNLRKTVEKLRQQNEKRAAESEKLGAELEEIIEALHA-------RETA 1270
            +  QL+   Q+    R+T+ +++++  + +    +L  EL EI E L         R  +
Sbjct: 260  LNNQLEQLVQEYREARRTLSQVQEKYNQASQGVSELTRELNEISEELEQVKQEMEERGAS 319

Query: 1271 VPSSSRLHNLRKTVEKLRQ 1289
            +   S L  +++ + KL++
Sbjct: 320  MSDGSPLVKIKQAITKLKE 338


>gnl|CDD|217331 pfam03036, Perilipin, Perilipin family.  The perilipin family
           includes lipid droplet-associated protein (perilipin)
           and adipose differentiation-related protein
           (adipophilin).
          Length = 390

 Score = 31.3 bits (71), Expect = 5.6
 Identities = 15/45 (33%), Positives = 22/45 (48%)

Query: 505 ETLNDKCEALMELAAHSPIREETLRLQALYAALVTSVQGLVSQVE 549
              +D     +E    S  R  T +LQ    +LV+SVQGL + V+
Sbjct: 290 NQEDDARPEQLESRTLSLTRSLTQQLQTTCLSLVSSVQGLPANVQ 334



 Score = 31.3 bits (71), Expect = 5.6
 Identities = 15/45 (33%), Positives = 22/45 (48%)

Query: 1872 ETLNDKCEALMELAAHSPIREETLRLQALYAALVTSVQGLVSQVE 1916
               +D     +E    S  R  T +LQ    +LV+SVQGL + V+
Sbjct: 290  NQEDDARPEQLESRTLSLTRSLTQQLQTTCLSLVSSVQGLPANVQ 334


>gnl|CDD|202427 pfam02841, GBP_C, Guanylate-binding protein, C-terminal domain.
            Transcription of the anti-viral guanylate-binding protein
            (GBP) is induced by interferon-gamma during macrophage
            induction. This family contains GBP1 and GPB2, both
            GTPases capable of binding GTP, GDP and GMP.
          Length = 297

 Score = 31.1 bits (71), Expect = 5.7
 Identities = 23/89 (25%), Positives = 42/89 (47%), Gaps = 5/89 (5%)

Query: 1927 AKKEEVEAWLHKAHATVQECDELEAGIKEEQKNIETQYLQWNSFRETLAQMTAWLDSVEK 1986
            AK++ +EA   KA A   E + L    KEE++ +E Q     S++E + Q+   +++  +
Sbjct: 201  AKEKAIEAERAKAEAAEAEQELLREKQKEEEQMMEAQE---RSYQEHVKQLIEKMEAERE 257

Query: 1987 NIKQEIATPWTTTQELRSKLLKLKEILKK 2015
             +  E         +L+ +   LKE  K 
Sbjct: 258  KLLAEQE--RMLEHKLQEQEELLKEGFKT 284


>gnl|CDD|218632 pfam05549, Allexi_40kDa, Allexivirus 40kDa protein. 
          Length = 271

 Score = 31.3 bits (71), Expect = 5.8
 Identities = 27/119 (22%), Positives = 48/119 (40%), Gaps = 23/119 (19%)

Query: 706 DELEWENHS--QNTLQWLDNIRSD---------------MKTLLER----SKHIVDDIEK 744
           DEL  + H+  QN+L+WL +I  +                 T L R     + +   ++ 
Sbjct: 37  DELYGQLHALHQNSLEWLTHISHNVDSIINKLNPVNLTSQGTPLSRLRDALRTLTRTVDS 96

Query: 745 KKDDLKPIQKEAQQLSEWDSSIVPKVSDLETRWAAVKSAWDAKHSDLSSEIAQYSSYQN 803
                + I+   Q  S   S ++ K++ +ET   A+ +  D   S L+S +A  S    
Sbjct: 97  IYSVEQKIELSTQTPS--SSKLLKKLTSIETSLEALHAKLDELTSSLTSTLAGPSVSPT 153


>gnl|CDD|234541 TIGR04320, Surf_Exclu_PgrA, SEC10/PgrA surface exclusion domain.
            This model describes a conserved domain found in surface
            proteins of a number of Firmutes. Many members have LPXTG
            C-terminal anchoring motifs and a substantial number have
            the KxYKxGKxW putative sorting signal at the N-terminus.
            The tetracycline resistance plasmid pCF10 in Enterococcus
            faecalis promotes conjugal plasmid transfer in response
            to sex pheromones, but PgrA/Sec10 encoded by that
            plasmid, a member of this family, specifically inhibits
            the ability of cells to receive homologous plasmids. The
            phenomenon is called surface exclusion.
          Length = 356

 Score = 31.2 bits (71), Expect = 5.9
 Identities = 17/116 (14%), Positives = 38/116 (32%)

Query: 992  NEEAGELNMMISNVSIATEHMNLNDADRLTVNDSVSSMQKRYAQLSRTIEDRISTVNNKL 1051
            N+    ++ +  N S   +    +        +S+ ++Q   A     +    + +N   
Sbjct: 222  NDGGVNIHFVNFNDSYIADGNKFDKTPIPNTPNSLKALQAALATAQADLAAAQTALNTAQ 281

Query: 1052 QSVKRLQHKVEEAKALLQKAESQLARPVSTQAEDISEALTVYQKLLDEINSWKNSQ 1107
             ++   Q     A+A L  A+ +LA   +   +     L   Q  L    +     
Sbjct: 282  AALTTAQTAYAAAQAALATAQKELANAQAQALQTAQNNLATAQAALANAEARLAKA 337


>gnl|CDD|214880 smart00880, CHAD, The CHAD domain is an alpha-helical domain
            functionally associated with some members of the
            adenylate cyclase family.  It has conserved histidines
            that may chelate metals.
          Length = 262

 Score = 31.2 bits (71), Expect = 6.1
 Identities = 20/94 (21%), Positives = 30/94 (31%), Gaps = 14/94 (14%)

Query: 1228 QLHNLRKTVEKLRQ-----QNEKRAAESEKLGAELEEIIEALH-ARETAVPSSSRLHNLR 1281
             LH LR  + +LR      +       +  L AEL  +   L   R+  V        L 
Sbjct: 24   ALHQLRVALRRLRSALRLFRPVLPREAAAALRAELRWLARELGPLRDLDV--------LL 75

Query: 1282 KTVEKLRQQNEKRAAESEKLGAELEEIIEALHAR 1315
            + + +L           + L A LE    A    
Sbjct: 76   ERLLELLAALLPELPALDALVAALEARRAAARRA 109


>gnl|CDD|220381 pfam09753, Use1, Membrane fusion protein Use1.  This entry is of a
           family of proteins all approximately 300 residues in
           length. The proteins have a single C-terminal
           trans-membrane domain and a SNARE [soluble NSF
           (N-ethylmaleimide-sensitive fusion protein) attachment
           protein receptor] domain of approximately 60 residues.
           The SNARE domains are essential for membrane fusion and
           are conserved from yeasts to humans. Use1 is one of the
           three protein subunits that make up the SNARE complex
           and it is specifically required for Golgi-endoplasmic
           reticulum retrograde transport.
          Length = 251

 Score = 30.9 bits (70), Expect = 6.6
 Identities = 41/177 (23%), Positives = 69/177 (38%), Gaps = 25/177 (14%)

Query: 64  TDDWLTGLEKDLDTLKNTTDVEAKNSLLQKLLTEKDQAGHKLTYLTSSGEKLYLDTAAKG 123
           + D LT   + +D LK   + E  +S      +EK  A  KL    S G      T  + 
Sbjct: 51  SVDVLTEYRRRVDFLKGLLEAEKPSSP-----SEKALANQKL----SRGRA---PTVDEP 98

Query: 124 REVVRQQLRALRDSKCSKTTLQEVLSHKRIIESLLD--KSKSLPQINKDQALEKAIVSVN 181
           R    +++     S+ +    +E+LS    +    D     S   IN D+A   +I  V 
Sbjct: 99  RSPASKEIHQQTKSRYTSELRKELLSGSTSLRRSTDAKDRDSSGPINDDEASAASIDQVL 158

Query: 182 KRY----EDLVDGILKTISQLEESLDIFQQFQQLQKAYQEDQKQLWDKLSSLTEREL 234
           + +    E L + +L     L+E      Q    Q+  +ED + L    + L ++ L
Sbjct: 159 QYHQNLQEKLAEEMLLLARNLKE------QSLAAQQIIKEDTEIL-SSSAKLADQNL 208


>gnl|CDD|225606 COG3064, TolA, Membrane protein involved in colicin uptake [Cell
            envelope biogenesis, outer membrane].
          Length = 387

 Score = 31.1 bits (70), Expect = 6.6
 Identities = 41/209 (19%), Positives = 79/209 (37%), Gaps = 24/209 (11%)

Query: 1209 DGTISDRNEITEQLQSRKQQLHNLRKTVEKLRQQNEKRAAESEKLGAELEEIIEALHARE 1268
            D  + D   + +Q    + Q  + +K  ++ +++ E+ A E +   A  +E ++ L    
Sbjct: 51   DAVMVDPGAVVQQYGRIQSQQSSAKKGEQQRKKKEEQVAEELKPKQAAEQERLKQLEKER 110

Query: 1269 TAVPSSSRLHNLRKTVEKLRQQNEKRAAESE-KLGAELEEIIEALHARETAVKTLPVLLL 1327
                   +     +  EK  Q  +K+  E   K  AE ++  EA  A+  A         
Sbjct: 111  LKAQEQQK---QAEEAEKQAQLEQKQQEEQARKAAAEQKKKAEAAKAKAAA--------- 158

Query: 1328 EVTSVDVELDKQRLELATKEVQPYVQRYETLKKNIAEAIDKFETQANEHEAYKQAYNEAY 1387
            E   +    +      A K+ +   +  E  K     A  K + +A    A ++A  EA 
Sbjct: 159  EAAKLKAAAE------AKKKAEEAAKAAEEAKAKAEAAAAKKKAEAEAKAAAEKAKAEA- 211

Query: 1388 DWLRKAKLGAQANADCHGEQQTTKDKADK 1416
                +AK  A+  A+   E++   +K   
Sbjct: 212  ----EAKAKAEKKAEAAAEEKAAAEKKKA 236


>gnl|CDD|224702 COG1788, AtoD, Acyl CoA:acetate/3-ketoacid CoA transferase, alpha
           subunit [Lipid metabolism].
          Length = 220

 Score = 30.7 bits (70), Expect = 6.7
 Identities = 10/32 (31%), Positives = 15/32 (46%), Gaps = 1/32 (3%)

Query: 690 VFTEIDLNTK-WQQYVNDELEWENHSQNTLQW 720
           + + I  N +  +Q +  ELE E   Q TL  
Sbjct: 74  IASYIGSNPEFGRQMLAGELEVELVPQGTLAE 105


>gnl|CDD|224486 COG1570, XseA, Exonuclease VII, large subunit [DNA replication,
            recombination, and repair].
          Length = 440

 Score = 31.1 bits (71), Expect = 6.8
 Identities = 27/119 (22%), Positives = 52/119 (43%), Gaps = 9/119 (7%)

Query: 1751 SVATLLDWLKVTEKKLGDEMELKTTLDEKKAQLQVYRTIYQDA------TSHQQDLLQLK 1804
              A LL  L   +++L     L+  LD+KK +L+      Q        +  QQ L +L 
Sbjct: 262  DSAELLQQLDQLQRRL--HRALRRLLDQKKQRLEHLARRLQFRSPERLLSEQQQRLDELA 319

Query: 1805 DKIE-SLHQPSEQSKQQLATITTRHGNVLKRAQKKIEAEKDIGDCTNLDDLEKYKALLQ 1862
             ++  +L       KQ+L  +T R    ++R Q++++  +   D      L++ +  L+
Sbjct: 320  IRLRRALENQLALKKQRLERLTQRLNPQIQRQQQRLQQLERRLDKALRRQLKRKRERLE 378


>gnl|CDD|115307 pfam06637, PV-1, PV-1 protein (PLVAP).  This family consists of
            several PV-1 (PLVAP) proteins which seem to be specific
            to mammals. PV-1 is a novel protein component of the
            endothelial fenestral and stomatal diaphragms. The
            function of this family is unknown.
          Length = 442

 Score = 31.2 bits (70), Expect = 7.1
 Identities = 52/238 (21%), Positives = 98/238 (41%), Gaps = 31/238 (13%)

Query: 1888 SPIREETLRLQALYAALVTSVQGLVSQVEKNLSDQTEF--LAKKEEVEAWLHKAHATVQE 1945
            S ++   +R   LY+ L+      +S  + NLS +  F  LAK   ++  L+      ++
Sbjct: 58   SNLQATEIRADGLYSQLLG-----LSASQANLSKELNFSLLAKDAIMQMLLN----ARRD 108

Query: 1946 CDELEAGIKEEQKNIETQYLQWNSFRETLAQMTAWLDSVEKNIKQEIATPWTTTQELRSK 2005
             D + A  ++ Q ++   Y+  N F      + A + S EK  + +      +   L   
Sbjct: 109  LDRINASFRQCQGDLII-YINNNRF------IAAIILS-EKQCQDQFKDMNKSCDAL--- 157

Query: 2006 LLKLKEILKKCEKGVGDHEAFVDKYEEC---------SQKLAQVEAKYAKLIEPETSYEE 2056
            L KL E +K  E  +   +A   K +E           Q  A  +A+  +  E + + E 
Sbjct: 158  LFKLGEKVKTLEMEIAKEKAICSKDKESLLAGKRQAEEQLEACGKARELQHQEQQLAEEN 217

Query: 2057 LQKRQGQLTALLTEKVDMDVLLNACVDLCDKVYESTSEPGHEPLRVQMEKLQQAVEAL 2114
            LQK Q     L  +K + D+L      +  +  ++     H PL  ++  +++A + L
Sbjct: 218  LQKVQALCIPLDKDKFEADLLNAWRDSIIPRSLDNLGYNLHHPLGPELASIRRACDHL 275


>gnl|CDD|227061 COG4717, COG4717, Uncharacterized conserved protein [Function
            unknown].
          Length = 984

 Score = 31.4 bits (71), Expect = 7.2
 Identities = 37/186 (19%), Positives = 63/186 (33%), Gaps = 29/186 (15%)

Query: 1223 QSRKQQLHNLRKTVEKLRQQNEKRAAESEKLGAELEEIIEALHARETAVPS-----SSRL 1277
              R  Q++ L + +++ R + ++   E       LE    A HAR   + S        +
Sbjct: 177  SGRNPQINQLLEKLKQERNEIDEAEKEYATYHKLLESR-RAEHARLAELRSELRADRDHI 235

Query: 1278 HNLRKTVEKLRQQNEKRAAESE------------KLGA-ELEEIIEALHARETAVKTLPV 1324
              LR  VE   +  E +  E E            + G   LE+    L   E  +  L V
Sbjct: 236  RALRDAVELWPRLQEWKQLEQELTRRREELATFPRDGVLRLEKREAHLQKTEAEIDALLV 295

Query: 1325 ----------LLLEVTSVDVELDKQRLELATKEVQPYVQRYETLKKNIAEAIDKFETQAN 1374
                       L+      ++   +  +  ++      +  ETL    A+  DK E   N
Sbjct: 296  RLAELKDLASQLIPAKEAVLQALVRLHQQLSEIKASAFELTETLAGIEADLRDKEEAAGN 355

Query: 1375 EHEAYK 1380
              EA +
Sbjct: 356  GFEAER 361


>gnl|CDD|185746 cd08915, V_Alix_like, Protein-interacting V-domain of mammalian Alix
            and related domains.  This superfamily contains the
            V-shaped (V) domain of mammalian Alix (apoptosis-linked
            gene-2 interacting protein X), His-Domain type N23
            protein tyrosine phosphatase (HD-PTP, also known as
            PTPN23), Bro1 and Rim20 (also known as PalA) from
            Saccharomyces cerevisiae, and related domains. Alix,
            HD-PTP, Bro1, and Rim20 all interact with the ESCRT
            (Endosomal Sorting Complexes Required for Transport)
            system. Alix, also known as apoptosis-linked gene-2
            interacting protein 1 (AIP1), participates in membrane
            remodeling processes during the budding of enveloped
            viruses, vesicle budding inside late endosomal
            multivesicular bodies (MVBs), and the abscission
            reactions of mammalian cell division. It also functions
            in apoptosis. HD-PTP functions in cell migration and
            endosomal trafficking, Bro1 in endosomal trafficking, and
            Rim20 in the response to the external pH via the Rim101
            pathway. The Alix V-domain contains a binding site,
            partially conserved in this superfamily, for the
            retroviral late assembly (L) domain YPXnL motif. The Alix
            V-domain is also a dimerization domain. Members of this
            superfamily have an N-terminal Bro1-like domain, which
            binds components of the ESCRT-III complex. The Bro1-like
            domains of Alix and HD-PTP can also bind human
            immunodeficiency virus type 1 (HIV-1) nucleocapsid. Many
            members, including Alix, HD-PTP, and Bro1, also have a
            proline-rich region (PRR), which binds multiple partners
            in Alix, including Tsg101 (tumor susceptibility gene 101,
            a component of ESCRT-1) and the apoptotic protein ALG-2.
            The C-terminal portion (V-domain and PRR) of Bro1
            interacts with Doa4, a ubiquitin thiolesterase needed to
            remove ubiquitin from MVB cargoes; it interacts with a
            YPxL motif in Doa4s catalytic domain to stimulate its
            deubiquitination activity. Rim20 may bind the ESCRT-III
            subunit Snf7, bringing the protease Rim13 (a
            YPxL-containing transcription factor) into proximity with
            Rim101, and promoting the proteolytic activation of
            Rim101. HD-PTP is encoded by the PTPN23 gene, a tumor
            suppressor gene candidate often absent in human kidney,
            breast, lung, and cervical tumors. HD-PTP has a
            C-terminal catalytically inactive tyrosine phosphatase
            domain.
          Length = 342

 Score = 31.2 bits (71), Expect = 7.3
 Identities = 34/151 (22%), Positives = 57/151 (37%), Gaps = 8/151 (5%)

Query: 1227 QQLHNLRKTVEKLRQQNEKRAAESEKLGAELEEIIEALHARETAVPSSSRLHN-LRKTVE 1285
            ++L  LR+ VE+L Q+ E+   E      +L      L  R    PSS      L + V 
Sbjct: 80   KELSKLRQNVEELLQECEELLEEEAAEDDQLRAKFGTLRWR---RPSSDEAAKELYEKVT 136

Query: 1286 KLRQQNEKRAAESEKLGAELEEIIEALHARETAVKTLPVLLLEVTSVDVELDKQRLELAT 1345
            KLR   E+ +    ++    E I   L            L   + S    LD +  E+  
Sbjct: 137  KLRGYLEQASNSDNEVLQCYESIDPNL---VLLCGGYKELKAFIPSPYPALDPEVSEVV- 192

Query: 1346 KEVQPYVQRYETLKKNIAEAIDKFETQANEH 1376
              ++P +     L+K     I + E ++  +
Sbjct: 193  SSLRPLLNEVSELEKERERFISELEIKSRNN 223


>gnl|CDD|234342 TIGR03752, conj_TIGR03752, integrating conjugative element protein,
            PFL_4705 family.  Members of this protein family are
            found occasionally on plasmids such as the Pseudomonas
            putida toluene catabolic TOL plasmid pWWO_p085. Usually,
            however, they are found on the bacterial main chromosome
            in regions flanked by markers of conjugative transfer
            and/or transposition [Mobile and extrachromosomal element
            functions, Plasmid functions].
          Length = 472

 Score = 31.2 bits (71), Expect = 7.3
 Identities = 20/100 (20%), Positives = 43/100 (43%), Gaps = 10/100 (10%)

Query: 1228 QLHNLRKTVEKLRQQNEKRAAESEKLGAELEEIIEALHARETAVPSSSRLHNLRKTVEKL 1287
            ++  LRK + KL  +NE   AE+E+L    + I + +            + +  + + K 
Sbjct: 67   EVKELRKRLAKLISENEALKAENERLQKREQSIDQQI---------QQAVQSETQELTKE 117

Query: 1288 RQQNEKRAAESEKLGAELEEIIEALHARETAVKT-LPVLL 1326
             +Q +    + + L  +L+  +  +    +   + LPV L
Sbjct: 118  IEQLKSERQQLQGLIDQLQRRLAGVLTGPSGGGSDLPVGL 157


>gnl|CDD|135648 PRK05898, dnaE, DNA polymerase III DnaE; Validated.
          Length = 971

 Score = 31.4 bits (71), Expect = 7.8
 Identities = 22/97 (22%), Positives = 45/97 (46%), Gaps = 14/97 (14%)

Query: 413 PIGKENINQEINQITLDWTNLQNTLQVIDKH---------HAKCLSIWNDFLSSKNTLEK 463
           PI  +N+N+ +N++ ++  +L   +   DK             C+S  N  L++ +   K
Sbjct: 192 PIQLDNLNKVLNELKVEIHDLPINIIKYDKQNSIISSEILKQLCISGLNKRLNANDGQVK 251

Query: 464 WIEGFQKKIEAEKDIGDCTNLDDLEKYKALLQEVISH 500
            I  + K+++ E DI +    DD   Y  ++ + I+ 
Sbjct: 252 KI--YVKRLKYELDIINEKQFDD---YFLIVYDFINF 283


>gnl|CDD|225747 COG3206, GumC, Uncharacterized protein involved in exopolysaccharide
            biosynthesis [Cell envelope biogenesis, outer membrane].
          Length = 458

 Score = 30.9 bits (70), Expect = 8.2
 Identities = 42/227 (18%), Positives = 75/227 (33%), Gaps = 29/227 (12%)

Query: 1143 LREQFTTLIMQIVGFITETTARVGEVDGNSATVQEKIDK---YDKIIADIQEYEAILATA 1199
            LR +      Q+  F  +         G   + Q+            A + + EA LA+ 
Sbjct: 207  LRARLQEAEAQVEDFRAQHGL-TDAARGQLLSEQQLSALNTQLQSARARLAQAEARLASL 265

Query: 1200 SDKGDQLSSDGTISDRNEITEQLQSRKQQLHNLRKTVEKLRQQNEKRAAESEKLGAELEE 1259
                        + +  E +  +Q  +QQ   +R+ +  L  +   +  +   L A+L E
Sbjct: 266  LQLLPLGREAAALREVLE-SPTIQDLRQQYAQVRQQIADLSTELGAKHPQLVALEAQLAE 324

Query: 1260 IIEALHARETAVPSSSRLHNLRKTVEKLRQQNEKRAAESEKLGAELEEIIEALHARETAV 1319
             +    A E                 ++        A  E+  A LE+ +  L  R + +
Sbjct: 325  -LRQQIAAELR---------------QILASLPNELALLEQQEAALEKELAQLKGRLSKL 368

Query: 1320 KTLPVLLLEVTSVDVELDKQRLELATKEVQPYVQRYETLKKNIAEAI 1366
              L V L E+   + E  +   E         +QRY+ L    A  I
Sbjct: 369  PKLQVQLRELER-EAEAARSLYET-------LLQRYQELSIQEASPI 407


>gnl|CDD|203573 pfam07071, DUF1341, Protein of unknown function (DUF1341).  This
            family consists of several hypothetical bacterial
            proteins of around 220 residues in length. The function
            of this family is unknown.
          Length = 218

 Score = 30.4 bits (69), Expect = 8.2
 Identities = 17/43 (39%), Positives = 20/43 (46%), Gaps = 2/43 (4%)

Query: 1421 TKSLPEGQKLIDKTVALKNNVLESTGPIGKENINQEINQITLD 1463
             K L E   +  K  A     LE TG I  +N  +EI QI LD
Sbjct: 160  LKHLDEYAAVA-KACAEHGFYLEPTGGIDLDNF-EEIVQIALD 200


>gnl|CDD|183293 PRK11730, fadB, multifunctional fatty acid oxidation complex
           subunit alpha; Reviewed.
          Length = 715

 Score = 31.0 bits (71), Expect = 8.2
 Identities = 25/80 (31%), Positives = 34/80 (42%), Gaps = 26/80 (32%)

Query: 503 DMETLNDKCEALMELAAHSPIREETLRLQALYAALVTSVQGLVSQVEKNL----SDQTEF 558
           D  TL    EAL  L A S ++            L+TS         K+     +D TEF
Sbjct: 32  DRATLASLGEALDALEAQSDLK----------GLLLTS--------AKDAFIVGADITEF 73

Query: 559 L----AKKEEVEAWLHKAHA 574
           L    A +EE+  WLH A++
Sbjct: 74  LSLFAAPEEELSQWLHFANS 93



 Score = 31.0 bits (71), Expect = 8.2
 Identities = 25/80 (31%), Positives = 34/80 (42%), Gaps = 26/80 (32%)

Query: 1870 DMETLNDKCEALMELAAHSPIREETLRLQALYAALVTSVQGLVSQVEKNL----SDQTEF 1925
            D  TL    EAL  L A S ++            L+TS         K+     +D TEF
Sbjct: 32   DRATLASLGEALDALEAQSDLK----------GLLLTS--------AKDAFIVGADITEF 73

Query: 1926 L----AKKEEVEAWLHKAHA 1941
            L    A +EE+  WLH A++
Sbjct: 74   LSLFAAPEEELSQWLHFANS 93


>gnl|CDD|144972 pfam01576, Myosin_tail_1, Myosin tail.  The myosin molecule is a
            multi-subunit complex made up of two heavy chains and
            four light chains it is a fundamental contractile protein
            found in all eukaryote cell types. This family consists
            of the coiled-coil myosin heavy chain tail region. The
            coiled-coil is composed of the tail from two molecules of
            myosin. These can then assemble into the macromolecular
            thick filament. The coiled-coil region provides the
            structural backbone the thick filament.
          Length = 859

 Score = 31.2 bits (71), Expect = 8.3
 Identities = 84/494 (17%), Positives = 188/494 (38%), Gaps = 113/494 (22%)

Query: 916  IKDKIDRLEK-ETKAQSDRIQ------DALKTLNQLIGHRKEFEVDVENVEAKLKELEVA 968
            + ++I++L+K + KA+ ++ Q      D L  L+Q+   +   E   + +E++L EL+V 
Sbjct: 136  LSEQIEQLQKQKAKAEKEKSQLQAEVDDLLAQLDQIAKAKLNAEKKAKQLESQLSELQVK 195

Query: 969  VEGDVKTNNVLMLQELLNKYAQLNEEAGEL------------------NMMISNVSIATE 1010
            ++   +      L +L ++ ++L  E  +L                  + + S +  A  
Sbjct: 196  LDELQRQ-----LNDLTSQKSRLQSENSDLTRQLEEAEAQVSNLSKLKSQLESQLEEAKR 250

Query: 1011 HMNLNDADRLTVNDSVSSMQKRYAQLSRTIEDRISTVNNKLQSVKRLQHKVEEAKALLQK 1070
             +     +R  +   +  ++     L   +E+              L+ ++ +A A +Q+
Sbjct: 251  SLEEESRERANLQAQLRQLEHDLDSLREQLEEESEAKAE-------LERQLSKANAEIQQ 303

Query: 1071 AESQLARPVSTQAEDISEALTVYQKLLDEINSWKNSQSDEDLVHLADSIKPLDEVVERIE 1130
              S+     + +AE++ E      + + E                      L+E  E   
Sbjct: 304  WRSKFESEGALRAEELEELKKKLNQKISE----------------------LEEAAEAAN 341

Query: 1131 --NHAVPKLKSLLLLREQFTTLIMQIVGFITETTARVGEVDGNSATVQEKIDKYDKIIAD 1188
                ++ K KS             ++   + +    +   +  ++ +++K   +DKI+A+
Sbjct: 342  AKCDSLEKTKS-------------RLQSELEDLQIELERANAAASELEKKQKNFDKILAE 388

Query: 1189 IQEYEAILATASDKGDQLSSDGTISDRNEITEQLQSRKQQLHNLRKTVEKLRQQNEKRAA 1248
             +                  D   ++ +    + ++   +L  L+  +E+L+ Q E    
Sbjct: 389  WK---------------RKVDELQAELDTAQREARNLSTELFRLKNELEELKDQVEALRR 433

Query: 1249 ESEKLGAELEEIIEALHARETAVPSSSRLHNLRKTVEKLRQQNEKRAAESEKLGAELEEI 1308
            E++ L  E+ ++ + L            +H L K   +L        AE ++L A LEE 
Sbjct: 434  ENKNLQDEIHDLTDQLGE------GGRNVHELEKARRRL-------EAEKDELQAALEEA 480

Query: 1309 IEALHARETAVKTLPVLLLEVTSVDVELDKQRLELATKEVQPYVQRYETLKKNIAEAIDK 1368
              AL   E+ V       +E++ +  E++++   LA KE     + +E  +KN   AI+ 
Sbjct: 481  EAALELEESKVLR---AQVELSQIRSEIERR---LAEKE-----EEFENTRKNHQRAIES 529

Query: 1369 FETQANEHEAYKQA 1382
             +         K  
Sbjct: 530  LQATLEAEAKGKAE 543


>gnl|CDD|189016 cd09609, M3B_PepF_4, Peptidase family M3B Oligopeptidase F (PepF).
            Peptidase family M3B Oligopeptidase F (PepF; Pz-peptidase
            B; EC 3.4.24.-) is mostly bacterial and includes
            oligoendopeptidase F from Lactococcus lactis. This enzyme
            hydrolyzes peptides containing between 7 and 17 amino
            acids with fairly broad specificity. The PepF gene is
            duplicated in L. lactis on the plasmid that bears it,
            while a shortened second copy is found in Bacillus
            subtilis. Most bacterial PepFs are cytoplasmic
            endopeptidases; however, the PepF Bacillus
            amyloliquefaciens oligopeptidase is a secreted protein
            and may facilitate the process of sporulation.
            Specifically, the yjbG gene encoding the homolog of the
            PepF1 and PepF2 oligoendopeptidases of Lactococcus lactis
            has been identified in Bacillus subtilis as an inhibitor
            of sporulation initiation when over expressed from a
            multicopy plasmid.
          Length = 586

 Score = 31.0 bits (71), Expect = 8.4
 Identities = 29/124 (23%), Positives = 54/124 (43%), Gaps = 16/124 (12%)

Query: 1215 RNEITEQLQSRKQQLHNLRKT-------VEKLRQQNEKRAAESEKLGAELEEIIEALHAR 1267
            R+E+  +L      L +L KT       ++KL++  +    + +    + E I+ AL   
Sbjct: 1    RSEVPVEL---TWDLTDLFKTEEAFEAALKKLKELAKDFKKKYKGKLTDAETILSALDDY 57

Query: 1268 ETAVPSSSRLHN---LRKTVEKLRQQNEKRAAESEKLGAELEEIIEALHARETAVKTLPV 1324
            E  +   SRL +   L ++ +    +N+ RAA   K  +   +I   L   E+ +  L  
Sbjct: 58   EEILELISRLSHYASLPQSTDGTDPENQARAA---KFDSLAAKINAKLSFFESELLALDE 114

Query: 1325 LLLE 1328
             +LE
Sbjct: 115  KILE 118


>gnl|CDD|217902 pfam04111, APG6, Autophagy protein Apg6.  In yeast, 15 Apg proteins
           coordinate the formation of autophagosomes. Autophagy is
           a bulk degradation process induced by starvation in
           eukaryotic cells. Apg6/Vps30p has two distinct functions
           in the autophagic process, either associated with the
           membrane or in a retrieval step of the carboxypeptidase
           Y sorting pathway.
          Length = 356

 Score = 30.6 bits (69), Expect = 8.6
 Identities = 29/134 (21%), Positives = 56/134 (41%), Gaps = 13/134 (9%)

Query: 185 EDLVDGILKTISQLEESLDIFQQFQQLQKAYQEDQKQLWDKLSSLTERELKGKLSSWCGY 244
           + L++ +   +   E+  D ++Q+    ++   +        S L E + +         
Sbjct: 28  DILLERLDSELRDAEKERDTYKQYLSKLESQNVEISNYEALDSELDELKKE--------- 78

Query: 245 EESVATLLDWLKVTEKKLGDEMELKTTLDEKKAQLQVYRTIYQ-DATSHQQDLLQLKDKI 303
           EE +   L+ L+  +  L  E+     L E+K QL+     Y  +     ++ LQL+D +
Sbjct: 79  EERLLDELEELEKEDDDLDGELV---ELQEEKEQLENEELQYLREYNLFDRNNLQLEDNL 135

Query: 304 ESLHQPSEQSKQQL 317
           +SL    E S  QL
Sbjct: 136 QSLELQYEYSLNQL 149


>gnl|CDD|222679 pfam14322, SusD-like_3, Starch-binding associating with outer
            membrane.  SusD is a secreted starch-binding protein with
            an N-terminal lipid tail that allows it to associate with
            the outer membrane.
          Length = 190

 Score = 30.0 bits (68), Expect = 8.6
 Identities = 16/44 (36%), Positives = 21/44 (47%), Gaps = 3/44 (6%)

Query: 1161 TTARVGEVDGNSATVQEKIDKYDKIIADIQEYEAILATASDKGD 1204
            TTA   E +   ATVQE    YD+II D++E   +L        
Sbjct: 131  TTATEPEQNLPRATVQEV---YDQIIKDLEEAIPLLPDEIPSVP 171


>gnl|CDD|234141 TIGR03185, DNA_S_dndD, DNA sulfur modification protein DndD.  This
            model describes the DndB protein encoded by an operon
            associated with a sulfur-containing modification to DNA.
            The operon is sporadically distributed in bacteria, much
            like some restriction enzyme operons. DndD is described
            as a putative ATPase. The small number of examples known
            so far include species from among the Firmicutes,
            Actinomycetes, Proteobacteria, and Cyanobacteria [DNA
            metabolism, Restriction/modification].
          Length = 650

 Score = 30.8 bits (70), Expect = 9.2
 Identities = 60/291 (20%), Positives = 117/291 (40%), Gaps = 21/291 (7%)

Query: 1042 DRISTVNNKLQSVKRLQHKVEEAKALLQKAESQLAR--PVSTQAEDISEALTVYQKLLDE 1099
            D I  +   L +V R + K E   ++L + E+  A     S + ED+++ +   +  L+E
Sbjct: 182  DLIDRLAGDLTNVLRRRKKSELPSSILSEIEALEAELKEQSEKYEDLAQEIAHLRNELEE 241

Query: 1100 INSWKNSQSDEDLVHLADSIKPLDEVVERIENHAVPKLKSLLLLREQFTTLIMQIVGFI- 1158
                  S   +           L E  E++E        +    R Q   L    +  + 
Sbjct: 242  AQRSLESLEKK----FRSEGGDLFEEREQLERQLKEIEAARKANRAQLRELAADPLPLLL 297

Query: 1159 -------TETTARVGEVDGNSATVQEKIDKYDKIIADIQEYEAILATASDK--GDQLSSD 1209
                   T+   +  E    +   QE++++ DK + +     A+ A    +   +    D
Sbjct: 298  IPNLLDSTKAQLQKEEQSQQNQLTQEELEERDKELLESLPKLALPAEHVKEIAAELAEID 357

Query: 1210 GTISDRNEITEQL-QSRKQQLHNLRKTVEKLRQQNEKRAAESEKLGAELEEIIEALHARE 1268
               +  +EI  +L  S   QL  L + V++   Q+ K     E    ELEE +  +  + 
Sbjct: 358  KPATTDSEIPHRLSGSELTQLEVLIQQVKR-ELQDAKSQLLKEL--RELEEELAEVDKKI 414

Query: 1269 TAVPSSSRLHNLRKTVEKLRQQNEKRAAESEKLGAELEEIIEAL-HARETA 1318
            + +PS  ++  L + + + + +  +  AE E+L  +LE + EA+   R+T 
Sbjct: 415  STIPSEEQIAQLLEELGEAQNELFRSEAEIEELLRQLETLKEAIEALRKTL 465



 Score = 30.8 bits (70), Expect = 9.5
 Identities = 53/294 (18%), Positives = 96/294 (32%), Gaps = 48/294 (16%)

Query: 1759 LKVTEKKLGDEMELKTTLDE-------KKAQLQVYRTIYQDATSHQQDLLQLKDKIESLH 1811
            L    ++        + L E        K Q + Y  + Q+    + +L + +  +ESL 
Sbjct: 191  LTNVLRRRKKSELPSSILSEIEALEAELKEQSEKYEDLAQEIAHLRNELEEAQRSLESLE 250

Query: 1812 QP-----------SEQSKQQLATITTRHGNVLKRAQKKIEAEKDIGDCTNL--------- 1851
            +             EQ ++QL  I         + ++       +    NL         
Sbjct: 251  KKFRSEGGDLFEEREQLERQLKEIEAARKANRAQLRELAADPLPLLLIPNLLDSTKAQLQ 310

Query: 1852 -DDLEKYKALLQEVISHNSDMETLNDKCEALMELAAHSPIREETLRLQALYAALVTSVQG 1910
             ++  +   L QE +      E   +  E+L +LA  +        ++ + A L    + 
Sbjct: 311  KEEQSQQNQLTQEELE-----ERDKELLESLPKLALPAEH------VKEIAAELAEIDKP 359

Query: 1911 LVSQVEKNL----SDQTEFLAKKEEVEAWLHKAHATV-QECDELEAGIKEEQKNIET--Q 1963
              +  E       S+ T+     ++V+  L  A + + +E  ELE  + E  K I T   
Sbjct: 360  ATTDSEIPHRLSGSELTQLEVLIQQVKRELQDAKSQLLKELRELEEELAEVDKKISTIPS 419

Query: 1964 YLQWNSFRETLAQMTAWLDSVEKNIKQ--EIATPWTTTQELRSKLLKLKEILKK 2015
              Q     E L +    L   E  I++            E   K L  K   K 
Sbjct: 420  EEQIAQLLEELGEAQNELFRSEAEIEELLRQLETLKEAIEALRKTLDEKTKQKI 473


>gnl|CDD|234084 TIGR03007, pepcterm_ChnLen, polysaccharide chain length determinant
            protein, PEP-CTERM locus subfamily.  Members of this
            protein family belong to the family of polysaccharide
            chain length determinant proteins (pfam02706). All are
            found in species that encode the PEP-CTERM/exosortase
            system predicted to act in protein sorting in a number of
            Gram-negative bacteria, and are found near the epsH
            homolog that is the putative exosortase gene [Cell
            envelope, Biosynthesis and degradation of surface
            polysaccharides and lipopolysaccharides].
          Length = 498

 Score = 30.8 bits (70), Expect = 9.3
 Identities = 18/79 (22%), Positives = 42/79 (53%), Gaps = 6/79 (7%)

Query: 2000 QELRSKLLKLKEILKKCEKGVGDHEAFVDKYEECSQKLAQVEAKYAKLI---EPETS-YE 2055
            Q+L+ +L + +  +   E  V +  A +++ E   + + +VEA+  +L    E   S YE
Sbjct: 313  QQLQIELAEAEAEIASLEARVAELTARIERLESLLRTIPEVEAELTQLNRDYEVNKSNYE 372

Query: 2056 ELQKRQGQLTALLTEKVDM 2074
            +L  R+    A +++++++
Sbjct: 373  QLLTRRES--AEVSKQMEV 389


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.311    0.127    0.349 

Gapped
Lambda     K      H
   0.267   0.0677    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 106,788,951
Number of extensions: 10696749
Number of successful extensions: 15350
Number of sequences better than 10.0: 1
Number of HSP's gapped: 13895
Number of HSP's successfully gapped: 1006
Length of query: 2174
Length of database: 10,937,602
Length adjustment: 112
Effective length of query: 2062
Effective length of database: 5,969,954
Effective search space: 12310045148
Effective search space used: 12310045148
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.8 bits)
S2: 67 (29.7 bits)