RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy15959
         (393 letters)



>gnl|CDD|150430 pfam09755, DUF2046, Uncharacterized conserved protein H4 (DUF2046).
            This is the conserved N-terminal 350 residues of a
           family of proteins of unknown function possibly
           containing a coiled-coil domain.
          Length = 308

 Score =  348 bits (895), Expect = e-119
 Identities = 217/310 (70%), Positives = 261/310 (84%), Gaps = 6/310 (1%)

Query: 2   ADSASESDSNSMDGVALMMPPSPVTREQLQKRIESLQQQNRVLKVELETYKLQLRALQEE 61
           A+  SESDS+     +     S  +RE+L+ RI SL+Q+N+VLK+EL+T+KL+ ++LQEE
Sbjct: 1   AECPSESDSDG----STDEGSSTASRERLRGRIHSLKQENKVLKMELDTFKLKCKSLQEE 56

Query: 62  NKNLRQTSVNIQAKAEQEEEFISNTLMKKIQALKKEKETLAHHYEQEEEFLTNDLSRKLN 121
           N+ LRQ SV+IQAKAEQEEEFISNTL+KKIQALKKEKETLA +YEQEEEFLTNDLSRKL 
Sbjct: 57  NRALRQASVSIQAKAEQEEEFISNTLLKKIQALKKEKETLAMNYEQEEEFLTNDLSRKLT 116

Query: 122 QLTEEKYRLEQTIEQEQERVVNKLMRKIEKLEAETDAKQNNLEQLRREKVELENTLEQEQ 181
           QL +EK +LEQT+EQEQE  VNKLMRKI+KLEA+T AKQ +LEQLRREKV+LENTLEQEQ
Sbjct: 117 QLRQEKIQLEQTLEQEQEYQVNKLMRKIDKLEADTLAKQTSLEQLRREKVDLENTLEQEQ 176

Query: 182 EALVNKLWKRMDKLEAEKRILQIKLDQPVSDPSSPHDM--NNGDTASNLSTHIHSLRSEV 239
           EALVN+LWKRMDKLEAEKR+LQ KLDQPVS+P SP D+     DT   +++HI SLRSEV
Sbjct: 177 EALVNRLWKRMDKLEAEKRMLQEKLDQPVSEPPSPRDIFTEAEDTVGGIASHITSLRSEV 236

Query: 240 MRLRNQLANTQQEHTEKMERYVKDEKNVREENLRLQRKLQREVERREALCRHLSESESSL 299
            RLR  LA ++ ++TEKME+Y K+E+  REEN+RLQRKL REVERREALCR LSESESSL
Sbjct: 237 RRLRQNLAKSEADYTEKMEQYAKEERQTREENIRLQRKLLREVERREALCRQLSESESSL 296

Query: 300 EMEEERHYNE 309
           EM++ER+YNE
Sbjct: 297 EMDDERYYNE 306


>gnl|CDD|233757 TIGR02168, SMC_prok_B, chromosome segregation protein SMC, common
           bacterial type.  SMC (structural maintenance of
           chromosomes) proteins bind DNA and act in organizing and
           segregating chromosomes for partition. SMC proteins are
           found in bacteria, archaea, and eukaryotes. This family
           represents the SMC protein of most bacteria. The smc
           gene is often associated with scpB (TIGR00281) and scpA
           genes, where scp stands for segregation and condensation
           protein. SMC was shown (in Caulobacter crescentus) to be
           induced early in S phase but present and bound to DNA
           throughout the cell cycle [Cellular processes, Cell
           division, DNA metabolism, Chromosome-associated
           proteins].
          Length = 1179

 Score = 64.7 bits (158), Expect = 3e-11
 Identities = 64/291 (21%), Positives = 115/291 (39%), Gaps = 36/291 (12%)

Query: 28  EQLQKRIESLQQQNRVLKVELETYKLQLRALQEENKNLRQTSVNIQAKAEQEEEFISNTL 87
           E+L+++IE L+++   L+  L   + +L  L+EE + LR        K  +E     + L
Sbjct: 680 EELEEKIEELEEKIAELEKALAELRKELEELEEELEQLR--------KELEELSRQISAL 731

Query: 88  MKKIQALKKEKETLAHHYEQEEEFLTNDLSRKLNQLTEEKYRLEQTIEQEQERVVNKLMR 147
            K +  L+ E E L     Q  +    +L+    ++ E + RLE+  E+  E        
Sbjct: 732 RKDLARLEAEVEQLEERIAQLSK----ELTELEAEIEELEERLEEAEEELAE-----AEA 782

Query: 148 KIEKLEAETDAKQNNLEQLRREKVELENTLEQEQEALVNKLWKRMDKLEAEKRILQIKLD 207
           +IE+LEA+ +  +  L+ LR    EL   L    E   N   +R++ LE      + +L+
Sbjct: 783 EIEELEAQIEQLKEELKALREALDELRAELTLLNEEAANLR-ERLESLERRIAATERRLE 841

Query: 208 QPVSDPSSPHDMNNGDTASNLSTHIHSLRSEVMRLRNQLANTQQEHTEKMERYVKDEKNV 267
                          +    LS  I SL +E+  L   +   + E    +      E+ +
Sbjct: 842 DLE------------EQIEELSEDIESLAAEIEELEELIEELESELEALLNERASLEEAL 889

Query: 268 REENLRLQ------RKLQREVERREALCRHLSESESSLEMEEERHYNEIAH 312
                 L+      R+L+ +          L E  + LE+  E     I +
Sbjct: 890 ALLRSELEELSEELRELESKRSELRRELEELREKLAQLELRLEGLEVRIDN 940



 Score = 52.8 bits (127), Expect = 2e-07
 Identities = 45/171 (26%), Positives = 78/171 (45%), Gaps = 13/171 (7%)

Query: 26  TREQLQKRIESLQQQNRVLKVELETYKLQLRALQEENKNLRQTSVNIQAKAEQEEEFISN 85
              + +  IE L+ Q   LK EL+  +  L  L+ E   L + + N++ + E  E  I+ 
Sbjct: 776 ELAEAEAEIEELEAQIEQLKEELKALREALDELRAELTLLNEEAANLRERLESLERRIAA 835

Query: 86  T------LMKKIQALKKEKETLAHHYEQEEEFLTNDLSRKLNQLTEEKYRLEQTIE---- 135
           T      L ++I+ L ++ E+LA   E+ EE +  +L  +L  L  E+  LE+ +     
Sbjct: 836 TERRLEDLEEQIEELSEDIESLAAEIEELEELI-EELESELEALLNERASLEEALALLRS 894

Query: 136 --QEQERVVNKLMRKIEKLEAETDAKQNNLEQLRREKVELENTLEQEQEAL 184
             +E    + +L  K  +L  E +  +  L QL      LE  ++  QE L
Sbjct: 895 ELEELSEELRELESKRSELRRELEELREKLAQLELRLEGLEVRIDNLQERL 945



 Score = 50.4 bits (121), Expect = 1e-06
 Identities = 57/258 (22%), Positives = 98/258 (37%), Gaps = 27/258 (10%)

Query: 27  REQLQKRIESLQQQNRVLKVELETYKLQLRALQEENKNLRQTSVNIQAKAEQEEEFISNT 86
             QL K +  L+ +   L+  LE  + +L   + E + L      ++ + +   E +   
Sbjct: 749 IAQLSKELTELEAEIEELEERLEEAEEELAEAEAEIEELEAQIEQLKEELKALREALDE- 807

Query: 87  LMKKIQALKKEKETLAHHYEQEEEFLTNDLSRKLNQLTEEKYRLEQTIEQEQERVVNKLM 146
           L  ++  L +E   L    E   E       R+L  L E+   L + IE      + +L 
Sbjct: 808 LRAELTLLNEEAANLRERLES-LERRIAATERRLEDLEEQIEELSEDIE-SLAAEIEELE 865

Query: 147 RKIEKLEAETDAKQNNLEQLRREKVELENTLEQEQEALVNKLWKRMDKLEAEKRILQIKL 206
             IE+LE+E +A  N    L      L + LE+  E L  +L  +  +L  E   L+ KL
Sbjct: 866 ELIEELESELEALLNERASLEEALALLRSELEELSEEL-RELESKRSELRRELEELREKL 924

Query: 207 DQPVSDPSSPHDMNNGDTASNLSTHIHSLRSEVMRLRNQLAN----TQQEHTEKMERYVK 262
            Q                   L   +  L   +  L+ +L+     T +E      +   
Sbjct: 925 AQ-------------------LELRLEGLEVRIDNLQERLSEEYSLTLEEAEALENKIED 965

Query: 263 DEKNVREENLRLQRKLQR 280
           DE+  R    RL+ K++ 
Sbjct: 966 DEEEARRRLKRLENKIKE 983



 Score = 47.0 bits (112), Expect = 1e-05
 Identities = 54/260 (20%), Positives = 105/260 (40%), Gaps = 37/260 (14%)

Query: 57  ALQEENKNLRQTSVNIQAKAEQEEEFISNTLMKKIQALKKEKETLAHHYEQEEEFLTNDL 116
            L E  + L+    +++ +AE+ E +         + LK E   L              L
Sbjct: 194 ILNELERQLK----SLERQAEKAERY---------KELKAELRELELAL----------L 230

Query: 117 SRKLNQLTEEKYRLEQTIEQEQERVVNKLMRKIEKLEAETDAKQNNLEQLRREKVELENT 176
             +L +L EE   L++ +++ +         ++E+L AE    +  LE+LR E  ELE  
Sbjct: 231 VLRLEELREELEELQEELKEAE--------EELEELTAELQELEEKLEELRLEVSELEEE 282

Query: 177 LEQEQEALVNKLWKRMDKLEAEKRILQIKLDQPVSDPS--SPHDMNNGDTASNLSTHIHS 234
           +E+ Q+ L   L   + +LE +K+IL+ +L                       L+  +  
Sbjct: 283 IEELQKEL-YALANEISRLEQQKQILRERLANLERQLEELEAQLEELESKLDELAEELAE 341

Query: 235 LRSEVMRLRNQLANTQQEHTEKMERYVKDE---KNVREENLRLQRKLQREVERREALCRH 291
           L  ++  L+ +L + + E  E      + E   + + E+   L+ K+ +   +  +L   
Sbjct: 342 LEEKLEELKEELESLEAELEELEAELEELESRLEELEEQLETLRSKVAQLELQIASLNNE 401

Query: 292 LSESESSLEMEEERHYNEIA 311
           +   E+ LE  E+R      
Sbjct: 402 IERLEARLERLEDRRERLQQ 421



 Score = 46.2 bits (110), Expect = 2e-05
 Identities = 50/192 (26%), Positives = 100/192 (52%), Gaps = 16/192 (8%)

Query: 28  EQLQKRIESLQQQNRVLKVELETYKLQLRALQEE----NKNLRQTSVNIQAKAEQEEEF- 82
           + L++R+ +L++Q   L+ +LE  + +L  L EE     + L +    +++   + EE  
Sbjct: 305 QILRERLANLERQLEELEAQLEELESKLDELAEELAELEEKLEELKEELESLEAELEELE 364

Query: 83  -ISNTLMKKIQALKKEKETLAHHY---EQEEEFLTNDLSR---KLNQLTEEKYRLEQTIE 135
                L  +++ L+++ ETL       E +   L N++ R   +L +L + + RL+Q IE
Sbjct: 365 AELEELESRLEELEEQLETLRSKVAQLELQIASLNNEIERLEARLERLEDRRERLQQEIE 424

Query: 136 QEQERVVNKLMRKIEKLEAETDAKQNNLEQLRREKVELENTLEQEQEALVNKLWKRMDKL 195
           +  +++      ++++L+AE +  +  LE+L+ E   LE  LE+ +E L  +  + +D  
Sbjct: 425 ELLKKLEEA---ELKELQAELEELEEELEELQEELERLEEALEELREEL-EEAEQALDAA 480

Query: 196 EAEKRILQIKLD 207
           E E   LQ +LD
Sbjct: 481 ERELAQLQARLD 492



 Score = 43.9 bits (104), Expect = 1e-04
 Identities = 32/174 (18%), Positives = 77/174 (44%), Gaps = 8/174 (4%)

Query: 26  TREQLQKRIESLQQQNRVLKVELETYKLQLRALQEENKNLRQTSVNIQAKAEQEEEFISN 85
             E+L+ ++E L+ +   L  EL   + +L  L+EE ++L       + +  + E     
Sbjct: 317 QLEELEAQLEELESKLDELAEELAELEEKLEELKEELESLEA-----ELEELEAELEELE 371

Query: 86  TLMKKIQALKKEKETLAHHYEQEEEFLTNDLSR---KLNQLTEEKYRLEQTIEQEQERVV 142
           + +++++   +   +     E +   L N++ R   +L +L + + RL+Q IE+  +++ 
Sbjct: 372 SRLEELEEQLETLRSKVAQLELQIASLNNEIERLEARLERLEDRRERLQQEIEELLKKLE 431

Query: 143 NKLMRKIEKLEAETDAKQNNLEQLRREKVELENTLEQEQEALVNKLWKRMDKLE 196
              +++++    E + +   L++      E    L +E E     L     +L 
Sbjct: 432 EAELKELQAELEELEEELEELQEELERLEEALEELREELEEAEQALDAAERELA 485



 Score = 37.0 bits (86), Expect = 0.021
 Identities = 42/181 (23%), Positives = 72/181 (39%), Gaps = 11/181 (6%)

Query: 136 QEQERVVNKLMRKIEKLEAETDAKQNNLEQLRREKVELENTLEQ------EQEALVNKLW 189
            E+ R + +L  KIE+LE +    +  L +LR+E  ELE  LEQ      E    ++ L 
Sbjct: 673 LERRREIEELEEKIEELEEKIAELEKALAELRKELEELEEELEQLRKELEELSRQISALR 732

Query: 190 KRMDKLEAEKRILQIKLDQPVSDPSSPHDMNNGDTA--SNLSTHIHSLRSEVMRLRNQLA 247
           K + +LEAE   L+ ++ Q   + +                   +    +E+  L  Q+ 
Sbjct: 733 KDLARLEAEVEQLEERIAQLSKELTELEAEIEELEERLEEAEEELAEAEAEIEELEAQIE 792

Query: 248 NTQQEHTEKMERYVKDEKNVREEN---LRLQRKLQREVERREALCRHLSESESSLEMEEE 304
             ++E     E   +    +   N     L+ +L+    R  A  R L + E  +E   E
Sbjct: 793 QLKEELKALREALDELRAELTLLNEEAANLRERLESLERRIAATERRLEDLEEQIEELSE 852

Query: 305 R 305
            
Sbjct: 853 D 853


>gnl|CDD|224117 COG1196, Smc, Chromosome segregation ATPases [Cell division and
           chromosome partitioning].
          Length = 1163

 Score = 57.4 bits (139), Expect = 7e-09
 Identities = 61/277 (22%), Positives = 121/277 (43%), Gaps = 31/277 (11%)

Query: 28  EQLQKRIESLQQQNRVLKVELETYKLQLRALQEENKNLRQTSVNIQAKAEQEEEFISNTL 87
            +L++ +E LQ++    + E+E  K +L  L+EE + L++  + ++ + E+ E  IS   
Sbjct: 249 SRLEEELEELQEELEEAEKEIEELKSELEELREELEELQEELLELKEEIEELEGEISLLR 308

Query: 88  MKKIQALKKEKETLAHHYEQEEEFLTNDLSRKLNQLT------EEKYRLEQTIEQEQERV 141
            +  +   + +E      E+  E L   +     +L       EE  +L   +E+ +E +
Sbjct: 309 ERLEELENELEE-----LEERLEELKEKIEALKEELEERETLLEELEQLLAELEEAKEEL 363

Query: 142 VNKLMRKIEKLEAETDAKQNNLEQLRREKVELENTLEQEQEALVNKLWKRMDKLEAEKRI 201
             KL   +E+LE   +A +  L +L  E  E+ N LE E +  +  L +R+++L      
Sbjct: 364 EEKLSALLEELEELFEALREELAELEAELAEIRNELE-ELKREIESLEERLERLSERLED 422

Query: 202 LQIKLDQPVSDPSSPHDMNNGDTASNLSTHIHSLRSEVMRLRNQLANTQQEHTEKMERYV 261
           L+ +L +                   L   +  L++E+  L  +L   +++  E  +R  
Sbjct: 423 LKEELKE-------------------LEAELEELQTELEELNEELEELEEQLEELRDRLK 463

Query: 262 KDEKNVREENLRLQRKLQREVERREALCRHLSESESS 298
           + E+ + E    LQR  +        L R  +E  +S
Sbjct: 464 ELERELAELQEELQRLEKELSSLEARLDRLEAEQRAS 500



 Score = 55.9 bits (135), Expect = 2e-08
 Identities = 57/273 (20%), Positives = 128/273 (46%), Gaps = 19/273 (6%)

Query: 27   REQLQKRIESLQQQNRVLKVELETYKLQLRALQEENKNLRQTSVNIQAKAEQEEEFISNT 86
             E+L++ +E L+++   L+  LE  + +L +L+E    L++    ++ K +  +E +   
Sbjct: 739  LEELEEELEELEEELEELQERLEELEEELESLEEALAKLKEEIEELEEKRQALQEELEE- 797

Query: 87   LMKKIQALKKEKETLAHHYEQEEEFLTNDLSRKLNQLTEEKYRLEQTIEQEQERVVN--- 143
            L ++++  ++  + L    E  E+     L +++ +L EE   LE+ +++ +E +     
Sbjct: 798  LEEELEEAERRLDALERELESLEQRR-ERLEQEIEELEEEIEELEEKLDELEEELEELEK 856

Query: 144  ---KLMRKIEKLEAETDAKQNNLEQLRREKVELENTLEQEQEALVNKLWK------RMDK 194
               +L  ++E+LEAE +  ++ L++L  EK ELE  L + +  L     +      R+++
Sbjct: 857  ELEELKEELEELEAEKEELEDELKELEEEKEELEEELRELESELAELKEEIEKLRERLEE 916

Query: 195  LEAEKRILQIKLDQPVSDPSSPHDMNNGDTASNLSTHIHSLRSEVMRLR--NQLANTQQE 252
            LEA+   L+++L +   +    ++             I  L  E+  L   N  A  + E
Sbjct: 917  LEAKLERLEVELPELEEELEEEYEDTLETELER---EIERLEEEIEALGPVNLRAIEEYE 973

Query: 253  HTEKMERYVKDEKNVREENLRLQRKLQREVERR 285
              E+    +K ++   EE      ++  E+++ 
Sbjct: 974  EVEERYEELKSQREDLEEAKEKLLEVIEELDKE 1006



 Score = 53.9 bits (130), Expect = 9e-08
 Identities = 67/288 (23%), Positives = 124/288 (43%), Gaps = 25/288 (8%)

Query: 28  EQLQKRIESLQQQNRVLKVELETYKLQLRALQEENKNLRQTSVNIQAKAEQEEEFISNTL 87
           ++L++ +  L+ Q   L+ EL+       +L+ E ++L      ++ + E+ E  +    
Sbjct: 670 KELEEELAELEAQLEKLEEELK-------SLKNELRSLEDLLEELRRQLEELERQLEELK 722

Query: 88  MKKIQALKKEKETLAHHYEQEEEFLTNDLSRKLNQLTEEKYRLEQTIEQEQERVVNKLMR 147
            +     ++ ++  +   E EEE    +L  +L +L E    LE+ +E  +E  + KL  
Sbjct: 723 RELAALEEELEQLQSRLEELEEEL--EELEEELEELQERLEELEEELESLEEA-LAKLKE 779

Query: 148 KIEKLEAETDAKQNNLEQLRREKVELENTLEQEQEAL------VNKLWKRMDKLEAEKRI 201
           +IE+LE +  A Q  LE+L  E  E E  L+  +  L        +L + +++LE E   
Sbjct: 780 EIEELEEKRQALQEELEELEEELEEAERRLDALERELESLEQRRERLEQEIEELEEEIEE 839

Query: 202 LQIKLDQPVSDPSSPHDMNNGDTASNLSTHIHSLRSEVMRLRNQLANTQQEHTEKMERYV 261
           L+ KLD+   +               L   +  L +E   L ++L   ++E  E  E   
Sbjct: 840 LEEKLDELEEELEELEKELE-----ELKEELEELEAEKEELEDELKELEEEKEELEEEL- 893

Query: 262 KDEKNVREENLRLQRKLQREVERREALCRHLSESESSLEMEEERHYNE 309
              + +  E   L+ ++++  ER E L   L   E  L   EE    E
Sbjct: 894 ---RELESELAELKEEIEKLRERLEELEAKLERLEVELPELEEELEEE 938



 Score = 45.5 bits (108), Expect = 4e-05
 Identities = 57/282 (20%), Positives = 119/282 (42%), Gaps = 25/282 (8%)

Query: 27  REQLQKRIESLQQQNRVLKVELETYKLQLRALQEE---NKNLRQTSVNIQAKAEQEEEFI 83
           +E+ ++++E  ++    L+  LE  + QL  L+ +    +  ++    ++          
Sbjct: 174 KEEAERKLERTEENLERLEDLLEELEKQLEKLERQAEKAERYQELKAELRELELALLLAK 233

Query: 84  SNTLMKKIQALKKEKETLAHHYEQEEEFLTNDLSRKLNQLTEEKYRLEQTIEQEQERVVN 143
              L K+++ L++E   L    E+ +E L  +  +++ +L  E   L + +E+ QE +  
Sbjct: 234 LKELRKELEELEEELSRLEEELEELQEEL-EEAEKEIEELKSELEELREELEELQEEL-L 291

Query: 144 KLMRKIEKLEAETDAKQNNLEQLRREKVELENTLEQEQEALVNKLWKRMDKLEAEKRILQ 203
           +L  +IE+LE E    +  LE+L  E  ELE  LE+ +E +     + +++ E     L+
Sbjct: 292 ELKEEIEELEGEISLLRERLEELENELEELEERLEELKEKIEALK-EELEERETLLEELE 350

Query: 204 IKLDQPVSDPSSPHDMNNGDTASNLSTHIHSLRSEVMRLRNQLANTQQEHTEKMERYVKD 263
             L +                   L      L  ++  L  +L    +   E++     +
Sbjct: 351 QLLAE-------------------LEEAKEELEEKLSALLEELEELFEALREELAELEAE 391

Query: 264 EKNVREENLRLQRKLQREVERREALCRHLSESESSLEMEEER 305
              +R E   L+R+++   ER E L   L + +  L+  E  
Sbjct: 392 LAEIRNELEELKREIESLEERLERLSERLEDLKEELKELEAE 433



 Score = 45.1 bits (107), Expect = 5e-05
 Identities = 55/241 (22%), Positives = 107/241 (44%), Gaps = 14/241 (5%)

Query: 73  QAKAEQEEEFISNTLMKKIQALKKEKETLAH---HYEQEEEFLTNDLSRKLNQLTEEKYR 129
             ++   ++     L +++  L+ + E L       + E   L + L     QL E + +
Sbjct: 658 NKRSSLAQKRELKELEEELAELEAQLEKLEEELKSLKNELRSLEDLLEELRRQLEELERQ 717

Query: 130 LEQTIEQEQ--ERVVNKLMRKIEKLEAETDAKQNNLEQLRREKVELENTLEQEQEALVNK 187
           LE+   +    E  + +L  ++E+LE E +  +  LE+L+    ELE  LE  +EAL  K
Sbjct: 718 LEELKRELAALEEELEQLQSRLEELEEELEELEEELEELQERLEELEEELESLEEAL-AK 776

Query: 188 LWKRMDKLEAEKRILQIKLDQPVSDPSSPHDMNNGDTASNLSTHIHSLRSEVMRLRNQLA 247
           L + +++LE +++ LQ +L++   +               L   + SL     RL  ++ 
Sbjct: 777 LKEEIEELEEKRQALQEELEELEEELEE-----AERRLDALERELESLEQRRERLEQEIE 831

Query: 248 NTQQ---EHTEKMERYVKDEKNVREENLRLQRKLQREVERREALCRHLSESESSLEMEEE 304
             ++   E  EK++   ++ + + +E   L+ +L+     +E L   L E E   E  EE
Sbjct: 832 ELEEEIEELEEKLDELEEELEELEKELEELKEELEELEAEKEELEDELKELEEEKEELEE 891

Query: 305 R 305
            
Sbjct: 892 E 892



 Score = 40.9 bits (96), Expect = 0.001
 Identities = 30/137 (21%), Positives = 64/137 (46%), Gaps = 9/137 (6%)

Query: 27   REQLQKRIESLQQQNRVLKVELETYKLQLRALQEENKNLRQTSVNIQAKA-EQEEEFISN 85
            +E+L++ +  L+ +   LK E+E  + +L  L+ + + L      ++ +  E+ E+ +  
Sbjct: 886  KEELEEELRELESELAELKEEIEKLRERLEELEAKLERLEVELPELEEELEEEYEDTLET 945

Query: 86   TLMKKIQALKKEKETLA-------HHYEQEEEFLTNDLSRKLNQLTEEKYRLEQTIEQEQ 138
             L ++I+ L++E E L          YE+ EE    +L  +   L E K +L + IE+  
Sbjct: 946  ELEREIERLEEEIEALGPVNLRAIEEYEEVEERY-EELKSQREDLEEAKEKLLEVIEELD 1004

Query: 139  ERVVNKLMRKIEKLEAE 155
            +    +     +K+   
Sbjct: 1005 KEKRERFKETFDKINEN 1021



 Score = 35.8 bits (83), Expect = 0.037
 Identities = 41/165 (24%), Positives = 72/165 (43%), Gaps = 8/165 (4%)

Query: 142 VNKLMRKIEKLEAETDAKQNNLEQLRREKVELENTLEQ-EQEALVNKLWKRMDKLEAEKR 200
           V+K   + E+ E + +  + NLE+L     ELE  LE+ E++A      +R  +L+AE R
Sbjct: 167 VSKYKERKEEAERKLERTEENLERLEDLLEELEKQLEKLERQAEKA---ERYQELKAELR 223

Query: 201 ILQIKLDQPVSDPSSPHDMNNGDTASNLSTHIHSLRSEVMRLRNQLANTQQEHTEKMERY 260
            L++ L   +            +    LS     L      L  +     +E   ++E  
Sbjct: 224 ELELAL---LLAKLKELRKELEELEEELSRLEEELEELQEELE-EAEKEIEELKSELEEL 279

Query: 261 VKDEKNVREENLRLQRKLQREVERREALCRHLSESESSLEMEEER 305
            ++ + ++EE L L+ +++        L   L E E+ LE  EER
Sbjct: 280 REELEELQEELLELKEEIEELEGEISLLRERLEELENELEELEER 324



 Score = 31.6 bits (72), Expect = 0.89
 Identities = 27/130 (20%), Positives = 54/130 (41%), Gaps = 6/130 (4%)

Query: 27  REQLQKRIESLQQQNRVLKVELETYKLQLRALQEENKNLRQTSVNIQAKAEQEEEFISNT 86
             +++  +E L+++   L+  LE    +L  L+EE K L      +Q + E+  E +   
Sbjct: 392 LAEIRNELEELKREIESLEERLERLSERLEDLKEELKELEAELEELQTELEELNEELEE- 450

Query: 87  LMKKIQALKKEKETLAHHYEQEEEFLTNDLSRKLNQLTEEKYRLEQTIEQEQERVVNKLM 146
           L ++++ L+   +      E+E   L  +L R   +L+  + RL+  +E EQ        
Sbjct: 451 LEEQLEELRDRLK----ELERELAELQEELQRLEKELSSLEARLD-RLEAEQRASQGVRA 505

Query: 147 RKIEKLEAET 156
                     
Sbjct: 506 VLEALESGLP 515


>gnl|CDD|233758 TIGR02169, SMC_prok_A, chromosome segregation protein SMC,
           primarily archaeal type.  SMC (structural maintenance of
           chromosomes) proteins bind DNA and act in organizing and
           segregating chromosomes for partition. SMC proteins are
           found in bacteria, archaea, and eukaryotes. It is found
           in a single copy and is homodimeric in prokaryotes, but
           six paralogs (excluded from this family) are found in
           eukarotes, where SMC proteins are heterodimeric. This
           family represents the SMC protein of archaea and a few
           bacteria (Aquifex, Synechocystis, etc); the SMC of other
           bacteria is described by TIGR02168. The N- and
           C-terminal domains of this protein are well conserved,
           but the central hinge region is skewed in composition
           and highly divergent [Cellular processes, Cell division,
           DNA metabolism, Chromosome-associated proteins].
          Length = 1164

 Score = 50.5 bits (121), Expect = 1e-06
 Identities = 59/280 (21%), Positives = 123/280 (43%), Gaps = 26/280 (9%)

Query: 28  EQLQKRIESLQQQNRVLKVELETYKLQLRALQEENKNLRQTSVNIQAKAEQEEEFISNTL 87
           ++L++R+E L++              +L +LQ E + +      +  +       I   +
Sbjct: 677 QRLRERLEGLKR--------------ELSSLQSELRRIENRLDELSQELSDASRKIG-EI 721

Query: 88  MKKIQALKKEKETLAHHYEQEEEFLTNDLSRKLNQLTEEKYRLEQTIEQEQERVVNKLMR 147
            K+I+ L++E+E L    E+ EE L + L +++  +  E   LE  IE E E  ++KL  
Sbjct: 722 EKEIEQLEQEEEKLKERLEELEEDL-SSLEQEIENVKSELKELEARIE-ELEEDLHKLEE 779

Query: 148 KIEKLEAETDAKQNNLEQLRREKVELENTLEQEQEALVNKLWKRMDKLEAEKRILQIKLD 207
            +  LEA      + + +++ E  +LE  +    EA + ++ +++++L  EK  L+ ++ 
Sbjct: 780 ALNDLEARLS--HSRIPEIQAELSKLEEEV-SRIEARLREIEQKLNRLTLEKEYLEKEIQ 836

Query: 208 QPVSDPSSPHDM--NNGDTASNLSTHIHSLRSEVMRLRNQLANTQQEHTEKMERYVKDEK 265
           +         +   +      NL+     L  E+  L   L     +   ++    K+  
Sbjct: 837 ELQEQRIDLKEQIKSIEKEIENLNGKKEELEEELEELEAALR----DLESRLGDLKKERD 892

Query: 266 NVREENLRLQRKLQREVERREALCRHLSESESSLEMEEER 305
            +  +   L+RK++    + E   + LSE ++ LE  EE 
Sbjct: 893 ELEAQLRELERKIEELEAQIEKKRKRLSELKAKLEALEEE 932



 Score = 49.7 bits (119), Expect = 2e-06
 Identities = 39/186 (20%), Positives = 84/186 (45%), Gaps = 17/186 (9%)

Query: 26  TREQLQKRIESLQQQNRVLKVELETYKLQLRALQEENKNLRQTSVNIQAKAEQEEEFISN 85
              +L+  I+ L  +   L+ E+E  + +   L EE   L++   +++A+ E+ ++    
Sbjct: 323 RLAKLEAEIDKLLAEIEELEREIEEERKRRDKLTEEYAELKEELEDLRAELEEVDK-EFA 381

Query: 86  TLMKKIQALKKEKETLAHHYEQEEEFLTNDLSRKLNQLTEEKYRLEQTIEQEQERVVNKL 145
               +++  +++ E L            N+L R+L++L EE  RL     +E   +   +
Sbjct: 382 ETRDELKDYREKLEKLKREI--------NELKRELDRLQEELQRL----SEELADLNAAI 429

Query: 146 MRKIEK---LEAETDAKQNNLEQLRREKVELENTLEQEQEALVNKLWKRMDKLEAEKRIL 202
                K   LE E + K   +++   +  +L   L + ++ L + L +  D++E E   L
Sbjct: 430 AGIEAKINELEEEKEDKALEIKKQEWKLEQLAADLSKYEQELYD-LKEEYDRVEKELSKL 488

Query: 203 QIKLDQ 208
           Q +L +
Sbjct: 489 QRELAE 494



 Score = 48.5 bits (116), Expect = 5e-06
 Identities = 46/226 (20%), Positives = 99/226 (43%), Gaps = 13/226 (5%)

Query: 29  QLQKRIESLQQQNRVLKVELETYKLQLRALQEENKNLRQTSVNIQAKAEQEEEFISNTLM 88
           +++++I  L+ +   L+  +   + +L   +E    L +    I     + EE     L 
Sbjct: 291 RVKEKIGELEAEIASLERSIAEKERELEDAEER---LAKLEAEIDKLLAEIEE-----LE 342

Query: 89  KKIQALKKEKETLAHHYEQEEEFLTNDLSRKLNQLTEEKYRLEQTIEQEQERVVNKLMRK 148
           ++I+  +K ++ L   Y + +E L  DL  +L ++ +E       ++  +E+ + KL R+
Sbjct: 343 REIEEERKRRDKLTEEYAELKEEL-EDLRAELEEVDKEFAETRDELKDYREK-LEKLKRE 400

Query: 149 IEKLEAETDAKQNNLEQLRREKVELENTLEQEQEALVNKLWKRMDKLEAEKRILQIKLDQ 208
           I +L+ E D  Q  L++L  E  +L   +    EA +N+L +  +    E +  + KL+Q
Sbjct: 401 INELKRELDRLQEELQRLSEELADLNAAIA-GIEAKINELEEEKEDKALEIKKQEWKLEQ 459

Query: 209 PVSDPSSPHDMNNGDTA--SNLSTHIHSLRSEVMRLRNQLANTQQE 252
             +D S               +   +  L+ E+     Q   +++ 
Sbjct: 460 LAADLSKYEQELYDLKEEYDRVEKELSKLQRELAEAEAQARASEER 505



 Score = 47.8 bits (114), Expect = 9e-06
 Identities = 54/286 (18%), Positives = 112/286 (39%), Gaps = 31/286 (10%)

Query: 27  REQLQKRIESLQQQNRVLKVELETYKLQLRALQEENKNLRQTSVNIQAKAEQEEEFISNT 86
              LQ  +  ++ +   L  EL     ++  +++E + L Q    ++ + E+ EE +S +
Sbjct: 690 LSSLQSELRRIENRLDELSQELSDASRKIGEIEKEIEQLEQEEEKLKERLEELEEDLS-S 748

Query: 87  LMKKIQALKKEKETLAHHYEQEEEFLTNDLSRKLNQLTEEKYRLEQTIEQEQERVVNKLM 146
           L ++I+ +K E + L    E+ EE         L++L E    LE  +   +   +   +
Sbjct: 749 LEQEIENVKSELKELEARIEELEE--------DLHKLEEALNDLEARLSHSRIPEIQAEL 800

Query: 147 RKIEKLEAETDAKQNNLEQLRREKVELENTLEQEQEALVNKL----------WKRMDKLE 196
            K+E+  +  +A+   +EQ        +  LE+E + L  +            K ++ L 
Sbjct: 801 SKLEEEVSRIEARLREIEQKLNRLTLEKEYLEKEIQELQEQRIDLKEQIKSIEKEIENLN 860

Query: 197 AEKRILQIKLDQPVSDPSSPHDMNNGDTASNLSTHIHSLRSEVMRLRNQLANTQQEHTEK 256
            +K  L+ +L+                   +L + +  L+ E   L  QL   +++  E 
Sbjct: 861 GKKEELEEELE------------ELEAALRDLESRLGDLKKERDELEAQLRELERKIEEL 908

Query: 257 MERYVKDEKNVREENLRLQRKLQREVERREALCRHLSESESSLEME 302
             +  K  K + E   +L+   +   E  +         E  L +E
Sbjct: 909 EAQIEKKRKRLSELKAKLEALEEELSEIEDPKGEDEEIPEEELSLE 954



 Score = 46.2 bits (110), Expect = 3e-05
 Identities = 57/276 (20%), Positives = 111/276 (40%), Gaps = 49/276 (17%)

Query: 36  SLQQQNRVLKVELETYKLQLRALQEENKNLRQTSVNIQAKAEQEEEFISNTLMKKIQALK 95
           S   + + L+  LE  K +L +LQ E + +      +  +       I   + K+I+ L+
Sbjct: 671 SEPAELQRLRERLEGLKRELSSLQSELRRIENRLDELSQELSDASRKIG-EIEKEIEQLE 729

Query: 96  KEKETLAHHYEQEEEFLTNDLSRKLNQLTEEKYRLEQTIEQEQERVVNKLMRKIEKLEAE 155
           +E+E L    E+ EE L + L +++  +  E   LE  IE E E  ++KL   +  LEA 
Sbjct: 730 QEEEKLKERLEELEEDL-SSLEQEIENVKSELKELEARIE-ELEEDLHKLEEALNDLEAR 787

Query: 156 TDAKQNNLEQLRREKVELENTLEQEQEALVNKLWKRMDKLEAEKRILQIKLDQPVSDPSS 215
                + + +++ E  +LE  +    EA + ++ +++++L  EK  L+            
Sbjct: 788 LS--HSRIPEIQAELSKLEEEV-SRIEARLREIEQKLNRLTLEKEYLE------------ 832

Query: 216 PHDMNNGDTASNLSTHIHSLRSEVMRLRNQLANTQQEHTEKMERYVKDEKNVREENLRLQ 275
                                 E+  L+ Q  + +++                 ENL  +
Sbjct: 833 ---------------------KEIQELQEQRIDLKEQ---------IKSIEKEIENLNGK 862

Query: 276 -RKLQREVERREALCRHLSESESSLEMEEERHYNEI 310
             +L+ E+E  EA  R L      L+ E +    ++
Sbjct: 863 KEELEEELEELEAALRDLESRLGDLKKERDELEAQL 898



 Score = 45.1 bits (107), Expect = 5e-05
 Identities = 62/264 (23%), Positives = 121/264 (45%), Gaps = 37/264 (14%)

Query: 54  QLRALQEENKNLRQTSVNIQAKAEQEEEFISNTLMKKIQALKKEKETLAHHYEQEEEFLT 113
           QL  L+ E    R+ +   QA  +++ E+    L+K+ +AL+++KE +       EE L 
Sbjct: 199 QLERLRRE----REKAERYQALLKEKREYEGYELLKEKEALERQKEAIERQLASLEEELE 254

Query: 114 NDLSRKLNQLTEEKYRLEQTIEQEQERV-------VNKLMRKIEKLEAETDAKQNNLEQL 166
             L+ ++++L +    +EQ +E+  +++         ++  KI +LEAE  + + ++ + 
Sbjct: 255 -KLTEEISELEKRLEEIEQLLEELNKKIKDLGEEEQLRVKEKIGELEAEIASLERSIAEK 313

Query: 167 RREKVELENTLEQEQEALVNKLWKRMDKLEAEKRILQIKLDQPVSDPSSPHDMNNGDTAS 226
            RE  + E  L  + EA ++KL   +++LE E    + + D+                  
Sbjct: 314 ERELEDAEERL-AKLEAEIDKLLAEIEELEREIEEERKRRDK------------------ 354

Query: 227 NLSTHIHSLRSEVMRLRNQLANTQQEHT----EKMERYVKDEKNVREENLRLQRKLQREV 282
            L+     L+ E+  LR +L    +E      E  +   K EK  RE N  L+R+L R  
Sbjct: 355 -LTEEYAELKEELEDLRAELEEVDKEFAETRDELKDYREKLEKLKREIN-ELKRELDRLQ 412

Query: 283 ERREALCRHLSESESSLEMEEERH 306
           E  + L   L++  +++   E + 
Sbjct: 413 EELQRLSEELADLNAAIAGIEAKI 436



 Score = 40.8 bits (96), Expect = 0.001
 Identities = 32/141 (22%), Positives = 73/141 (51%), Gaps = 8/141 (5%)

Query: 26  TREQLQKRIESLQQQNRVLKVELETYKLQLRALQEENKNLRQTSVNIQAKAEQEEEFISN 85
             E L+  +E + ++    + EL+ Y+ +L  L+ E   L++    +Q + ++  E +++
Sbjct: 365 ELEDLRAELEEVDKEFAETRDELKDYREKLEKLKREINELKRELDRLQEELQRLSEELAD 424

Query: 86  TLMK------KIQALKKEKETLAHHYEQEEEFLTNDLSRKLNQLTEEKYRLEQTIEQEQE 139
                     KI  L++EKE  A   +++E  L   L+  L++  +E Y L++  ++ ++
Sbjct: 425 LNAAIAGIEAKINELEEEKEDKALEIKKQEWKLEQ-LAADLSKYEQELYDLKEEYDRVEK 483

Query: 140 RVVNKLMRKIEKLEAETDAKQ 160
            + +KL R++ + EA+  A +
Sbjct: 484 EL-SKLQRELAEAEAQARASE 503



 Score = 39.7 bits (93), Expect = 0.003
 Identities = 40/170 (23%), Positives = 76/170 (44%), Gaps = 28/170 (16%)

Query: 27   REQLQKRIESLQQQNRVLKVELETYKLQLRALQEENKNLRQTSVNIQAKAE--------- 77
             + ++K IE+L  +   L+ ELE  +  LR L+    +L++    ++A+           
Sbjct: 849  IKSIEKEIENLNGKKEELEEELEELEAALRDLESRLGDLKKERDELEAQLRELERKIEEL 908

Query: 78   ----QEEEFISNTLMKKIQALKKEKETLAHHYEQEEEFLTNDLSRKLNQLTEEKYRLEQT 133
                +++    + L  K++AL++E   +     ++EE    +LS  L  +  E  R+E+ 
Sbjct: 909  EAQIEKKRKRLSELKAKLEALEEELSEIEDPKGEDEEIPEEELS--LEDVQAELQRVEEE 966

Query: 134  IE----------QEQERVV---NKLMRKIEKLEAETDAKQNNLEQLRREK 170
            I           QE E V+   ++L  K  KLE E  A    +E+  ++K
Sbjct: 967  IRALEPVNMLAIQEYEEVLKRLDELKEKRAKLEEERKAILERIEEYEKKK 1016



 Score = 37.0 bits (86), Expect = 0.019
 Identities = 45/191 (23%), Positives = 79/191 (41%), Gaps = 32/191 (16%)

Query: 126 EKYRLEQTIEQEQE-----RVVNKLMRKIEKLEAETDAKQNNLEQLRREKVELENTLEQE 180
           E+Y+     ++E E     +    L R+ E +E +  + +  LE+L  E  ELE  LE E
Sbjct: 211 ERYQALLKEKREYEGYELLKEKEALERQKEAIERQLASLEEELEKLTEEISELEKRLE-E 269

Query: 181 QEALVNKLWKRMDKLEAEKRILQIKLDQPVSDPSSPHDMNNGDTASNLSTHIHSLRSEVM 240
            E L+ +L K++  L  E+  L++K                          I  L +E+ 
Sbjct: 270 IEQLLEELNKKIKDL-GEEEQLRVK------------------------EKIGELEAEIA 304

Query: 241 RLRNQLANTQQEHTEKMERYVKDEKNVREENLRLQRKLQREVERREALCRHLSESESSLE 300
            L   +A  ++E  E  E  +   +   ++ L    +L+RE+E        L+E  + L+
Sbjct: 305 SLERSIAEKERE-LEDAEERLAKLEAEIDKLLAEIEELEREIEEERKRRDKLTEEYAELK 363

Query: 301 MEEERHYNEIA 311
            E E    E+ 
Sbjct: 364 EELEDLRAELE 374



 Score = 34.7 bits (80), Expect = 0.10
 Identities = 45/227 (19%), Positives = 89/227 (39%), Gaps = 27/227 (11%)

Query: 90  KIQALKKEKETLAHHYEQEEEFLTNDLSRKLNQLTEEKYRLEQTIEQEQERVVNKLMRKI 149
           + QAL KEK      YE  E      L ++   L  +K  +E+ +    E  + KL  +I
Sbjct: 212 RYQALLKEKR----EYEGYE------LLKEKEALERQKEAIERQL-ASLEEELEKLTEEI 260

Query: 150 EKLEAETDAKQNNLEQLRREKVELENTLEQEQEALVNKLWKRMDKLEAEKRILQIKLDQP 209
            +LE   +  +  LE+L ++   +++  E+EQ  +  K+ +   ++ + +R +  K  + 
Sbjct: 261 SELEKRLEEIEQLLEELNKK---IKDLGEEEQLRVKEKIGELEAEIASLERSIAEKEREL 317

Query: 210 VSDPSSPHDMNNGDTASNLSTHIHSLRSEVMRLRNQLANTQQEHTEKMERYVKDEKNVRE 269
                   D       + L   I  L +E+  L  ++   ++   +  E Y      ++E
Sbjct: 318 -------EDAEE--RLAKLEAEIDKLLAEIEELEREIEEERKRRDKLTEEY----AELKE 364

Query: 270 ENLRLQRKLQREVERREALCRHLSESESSLEMEEERHYNEIAHSMRL 316
           E   L+ +L+   +        L +    LE  +           RL
Sbjct: 365 ELEDLRAELEEVDKEFAETRDELKDYREKLEKLKREINELKRELDRL 411



 Score = 31.6 bits (72), Expect = 0.88
 Identities = 38/184 (20%), Positives = 71/184 (38%), Gaps = 30/184 (16%)

Query: 147 RKIEKLEAETDAKQNNLEQLRREKVELENTLE-----------------QEQEALVNKLW 189
             IE+L+   D K+  LE+LRRE+ + E                     +  E     + 
Sbjct: 184 ENIERLDLIIDEKRQQLERLRREREKAERYQALLKEKREYEGYELLKEKEALERQKEAIE 243

Query: 190 KRMDKLEAEKRILQIKLD----------QPVSDPSSPHDMNNGDTASNLSTHIHSLRSEV 239
           +++  LE E   L  ++           Q + + +        +    +   I  L +E+
Sbjct: 244 RQLASLEEELEKLTEEISELEKRLEEIEQLLEELNKKIKDLGEEEQLRVKEKIGELEAEI 303

Query: 240 MRLRNQLANTQQEHTEKMERYVKDE---KNVREENLRLQRKLQREVERREALCRHLSESE 296
             L   +A  ++E  +  ER  K E     +  E   L+R+++ E +RR+ L    +E +
Sbjct: 304 ASLERSIAEKERELEDAEERLAKLEAEIDKLLAEIEELEREIEEERKRRDKLTEEYAELK 363

Query: 297 SSLE 300
             LE
Sbjct: 364 EELE 367


>gnl|CDD|223496 COG0419, SbcC, ATPase involved in DNA repair [DNA replication,
           recombination, and repair].
          Length = 908

 Score = 47.4 bits (113), Expect = 9e-06
 Identities = 58/286 (20%), Positives = 111/286 (38%), Gaps = 14/286 (4%)

Query: 27  REQLQKRIESLQQQNRVLKVELETYKLQLRALQEENKNLRQTSVNIQAKAEQEEEFISNT 86
           + +L +      +  ++ + EL   +  L  L+E+ + L +    I+   E+ E      
Sbjct: 255 KARLLEIESLELEALKIREEELRELERLLEELEEKIERLEELEREIEELEEELEGL---- 310

Query: 87  LMKKIQALKKEKETLAHHYEQEEEFLT--NDLSRKLNQLTEEKYRLEQTIEQEQERVVNK 144
                +AL +E E L    +  EE L    +   KL    EE    +  + +  E  + +
Sbjct: 311 -----RALLEELEELLEKLKSLEERLEKLEEKLEKLESELEELAEEKNELAKLLEERLKE 365

Query: 145 LMRKIEKLEAETDAKQNNLEQLRREKVELENTLEQEQEALVNKLWKRMDKLEAEKRILQI 204
           L  ++E+LE E +     L+QL     EL+  L  E  A + ++ + +++LE E   L+ 
Sbjct: 366 LEERLEELEKELEKALERLKQLEEAIQELKEELA-ELSAALEEIQEELEELEKELEELER 424

Query: 205 KLDQPVSDPSSPHDMNNGDTASNLSTHIHSLRSEVMRLRNQLANTQQEHTEKMERYVKDE 264
           +L++   +     +  N     +    I  L     +         +EH +++    + E
Sbjct: 425 ELEELEEEIKKLEEQINQ--LESKELMIAELAGAGEKCPVCGQELPEEHEKELLELYELE 482

Query: 265 KNVREENLRLQRKLQREVERREALCRHLSESESSLEMEEERHYNEI 310
               EE L  +++     E  E L + L E E  L    E      
Sbjct: 483 LEELEEELSREKEEAELREEIEELEKELRELEEELIELLELEEALK 528



 Score = 47.4 bits (113), Expect = 9e-06
 Identities = 62/283 (21%), Positives = 117/283 (41%), Gaps = 34/283 (12%)

Query: 28  EQLQKRIESLQQQNRVLKVELETYKLQLRALQE----ENKNLRQTSVNIQAKAEQEEEFI 83
           E+ +K  E L++  +  K ++E  + QL  L E      + L +    ++   E +EE  
Sbjct: 167 EKYEKLSELLKEVIKEAKAKIEELEGQLSELLEDIEDLLEALEEELKELKKLEEIQEEQE 226

Query: 84  SNTLMKKIQALKKEKETLAHHYEQEEEFLTNDLSRKLNQLTEEKYRLEQTIEQEQERVVN 143
              L ++I+AL++    L    E+ EE              + +    +++E E  ++  
Sbjct: 227 EEELEQEIEALEERLAELEEEKERLEE-------------LKARLLEIESLELEALKIRE 273

Query: 144 KLMRKIEKLEAETDAKQNNLEQLRREKVELENTLEQEQEALVNKLWKRMDKLEAEKRILQ 203
           + +R++E+L  E + K   LE+L RE  ELE  LE  +  L     + ++KL++ +  L+
Sbjct: 274 EELRELERLLEELEEKIERLEELEREIEELEEELEGLRALLEELE-ELLEKLKSLEERLE 332

Query: 204 IKLDQPVSDPSSPHDMNNGDTASNLSTHIHSLRSEVMRLRNQLANTQQEHTEKMERYVKD 263
              ++                   L + +  L  E   L   L    +E  E++E   K+
Sbjct: 333 KLEEK----------------LEKLESELEELAEEKNELAKLLEERLKELEERLEELEKE 376

Query: 264 EKNVREENLRLQRKLQREVERREALCRHLSESESSLEMEEERH 306
            +   E   +L+  +Q   E    L   L E +  LE  E+  
Sbjct: 377 LEKALERLKQLEEAIQELKEELAELSAALEEIQEELEELEKEL 419



 Score = 43.2 bits (102), Expect = 2e-04
 Identities = 46/255 (18%), Positives = 104/255 (40%), Gaps = 23/255 (9%)

Query: 27  REQLQKRIESLQQQNRVLKVELETYKLQLRALQEENKNLRQTSVNIQAKAEQEEEFISNT 86
              L + +E L ++ + L+  LE  + +L  L+ E + L +                   
Sbjct: 310 LRALLEELEELLEKLKSLEERLEKLEEKLEKLESELEELAEEKN---------------- 353

Query: 87  LMKKIQALKKEKETLAHHYEQEEEFLTNDLSRKLNQLTEEKYRLEQTIEQEQERVVNKLM 146
             +  + L++  + L    E+ E+ L   L R      EE  +  +    E    + ++ 
Sbjct: 354 --ELAKLLEERLKELEERLEELEKELEKALERLKQ--LEEAIQELKEELAELSAALEEIQ 409

Query: 147 RKIEKLEAETDAKQNNLEQLRREKVELENTLEQEQ--EALVNKLWKRMDKLEAEKRILQI 204
            ++E+LE E +  +  LE+L  E  +LE  + Q +  E ++ +L    +K     + L  
Sbjct: 410 EELEELEKELEELERELEELEEEIKKLEEQINQLESKELMIAELAGAGEKCPVCGQELPE 469

Query: 205 KLDQPVSDPSSPHDMNNGDTASNLSTHIHSLRSEVMRLRNQLANTQQEHTEKMERYVKDE 264
           + ++ + +     ++   +   +       LR E+  L  +L   ++E  E +E     +
Sbjct: 470 EHEKELLELYE-LELEELEEELSREKEEAELREEIEELEKELRELEEELIELLELEEALK 528

Query: 265 KNVREENLRLQRKLQ 279
           + + E+  +L+  L+
Sbjct: 529 EELEEKLEKLENLLE 543



 Score = 42.4 bits (100), Expect = 4e-04
 Identities = 56/292 (19%), Positives = 113/292 (38%), Gaps = 16/292 (5%)

Query: 26  TREQLQKRIESLQQQNRVLKVELETYKLQLRALQEENKNLRQTSVNIQA------KAEQE 79
             E+  +R+E L+++   L+ ELE  +  L  L+E  + L+     ++       K E E
Sbjct: 285 ELEEKIERLEELEREIEELEEELEGLRALLEELEELLEKLKSLEERLEKLEEKLEKLESE 344

Query: 80  EEFISNTLMKKIQALKKEKETLAHHYEQEEEFLTNDLSRKLNQLTEEKYRLEQTIEQEQE 139
            E ++    +  + L++  + L    E+ E+ L   L R            E   E ++E
Sbjct: 345 LEELAEEKNELAKLLEERLKELEERLEELEKELEKALERLKQ-------LEEAIQELKEE 397

Query: 140 RVVNKLMRKIEKLEAETDAKQNNLEQLRREKVELENTLEQEQEALVNKLWKRMDKLEAEK 199
               +L   +E+++ E +  +  LE+L RE  ELE  +++ +E +     K +   E   
Sbjct: 398 LA--ELSAALEEIQEELEELEKELEELERELEELEEEIKKLEEQINQLESKELMIAELAG 455

Query: 200 RILQIKL-DQPVSDPSSPHDMNNGDTASNLSTHIHSLRSEVMRLRNQLANTQQEHTEKME 258
              +  +  Q + +      +   +          S   E   LR ++   ++E  E  E
Sbjct: 456 AGEKCPVCGQELPEEHEKELLELYELELEELEEELSREKEEAELREEIEELEKELRELEE 515

Query: 259 RYVKDEKNVREENLRLQRKLQREVERREALCRHLSESESSLEMEEERHYNEI 310
             ++  +        L+ KL++     E L     + +     EE R   + 
Sbjct: 516 ELIELLELEEALKEELEEKLEKLENLLEELEELKEKLQLQQLKEELRQLEDR 567



 Score = 40.9 bits (96), Expect = 0.001
 Identities = 65/290 (22%), Positives = 122/290 (42%), Gaps = 13/290 (4%)

Query: 26  TREQLQKRIESLQQQNRVLKVELETYKLQLRALQEENKNLRQTSVNIQAKAEQEEEFISN 85
             +QL++ I+ L+++   L   LE  + +L  L++E + L +    ++ + ++ EE I+ 
Sbjct: 383 RLKQLEEAIQELKEELAELSAALEEIQEELEELEKELEELERELEELEEEIKKLEEQINQ 442

Query: 86  TLMKK--IQALKKEKET----LAHHYEQEEEFLTNDLSRKLNQLTEE--KYRLEQTIEQE 137
              K+  I  L    E          E+ E+ L      +L +L EE  + + E  + +E
Sbjct: 443 LESKELMIAELAGAGEKCPVCGQELPEEHEKELLELYELELEELEEELSREKEEAELREE 502

Query: 138 QERVVNKLMRKIEKLEAETDAKQNNLEQLRREKVELENTLEQEQEALVNKLWKRMDKLEA 197
            E +  +L R++E+   E    +  L++   EK+E    L +E E L  KL     +L+ 
Sbjct: 503 IEELEKEL-RELEEELIELLELEEALKEELEEKLEKLENLLEELEELKEKLQL--QQLKE 559

Query: 198 EKRILQIKLDQPVSDPSSPHDMNN-GDTASNLSTHIHSLRSEVMRLRNQLANTQQEHTEK 256
           E R L+ +L +          +    +    L   +  L+ ++  L  +L+  ++     
Sbjct: 560 ELRQLEDRLQELKELLEELRLLRTRKEELEELRERLKELKKKLKELEERLSQLEELLQSL 619

Query: 257 MERYVKDEKNVREENLR-LQRKLQREVERREALCRHLSESESSLEMEEER 305
                ++E    EE L     KL  + E  E L   L E E  +E  E  
Sbjct: 620 ELSEAENELEEAEEELESELEKLNLQAELEELLQAALEELEEKVEELEAE 669



 Score = 39.7 bits (93), Expect = 0.002
 Identities = 63/280 (22%), Positives = 122/280 (43%), Gaps = 14/280 (5%)

Query: 27  REQLQKRIESLQQQNRVLKVELETYKLQLRALQE--ENKNLRQTSVNIQAKAEQEEEFIS 84
            E+L+ ++  L +    L   LE    +L+ L+E  E +   +    I+A  E+  E   
Sbjct: 187 IEELEGQLSELLEDIEDLLEALEEELKELKKLEEIQEEQEEEELEQEIEALEERLAELEE 246

Query: 85  --NTLMKKIQALKKEKETLAHHYEQEEEFLTNDLSRKLNQLTEEKYRLEQTIE--QEQER 140
               L +    L + +       +  EE L  +L R L +L E+  RLE+     +E E 
Sbjct: 247 EKERLEELKARLLEIESLELEALKIREEEL-RELERLLEELEEKIERLEELEREIEELEE 305

Query: 141 VVNKLMRKIEKLEAETDAKQNNLEQLRREKVELENTLEQEQEALVNKLWKRMDKLEAEKR 200
            +  L   +E+LE   +  ++  E+L + + +LE  LE E E L  +  +    LE   +
Sbjct: 306 ELEGLRALLEELEELLEKLKSLEERLEKLEEKLE-KLESELEELAEEKNELAKLLEERLK 364

Query: 201 ILQIKLDQPVSDPSSPHDMNNGDTASNLSTHIHSLRSEVMRLRNQLANTQQEHTEKMERY 260
            L+ +L++   +          +    L   I  L+ E+  L   L   Q+E  E++E+ 
Sbjct: 365 ELEERLEELEKELE-----KALERLKQLEEAIQELKEELAELSAALEEIQEE-LEELEKE 418

Query: 261 VKDEKNVREENLRLQRKLQREVERREALCRHLSESESSLE 300
           +++ +   EE     +KL+ ++ + E+    ++E   + E
Sbjct: 419 LEELERELEELEEEIKKLEEQINQLESKELMIAELAGAGE 458



 Score = 36.3 bits (84), Expect = 0.026
 Identities = 29/157 (18%), Positives = 69/157 (43%), Gaps = 1/157 (0%)

Query: 28  EQLQKRIESLQQQNRVLKVELETYKLQLRALQEENKNLRQTSVNIQAKAEQEE-EFISNT 86
              ++ +E L+++ + LK +L+  + +L  L+E  ++L  +    + +  +EE E     
Sbjct: 582 RTRKEELEELRERLKELKKKLKELEERLSQLEELLQSLELSEAENELEEAEEELESELEK 641

Query: 87  LMKKIQALKKEKETLAHHYEQEEEFLTNDLSRKLNQLTEEKYRLEQTIEQEQERVVNKLM 146
           L  + +  +  +  L    E+ EE              EE+   +    ++ E  + +L 
Sbjct: 642 LNLQAELEELLQAALEELEEKVEELEAEIRRELQRIENEEQLEEKLEELEQLEEELEQLR 701

Query: 147 RKIEKLEAETDAKQNNLEQLRREKVELENTLEQEQEA 183
            ++E+L  +    +  +E+L   K ELE   ++ ++ 
Sbjct: 702 EELEELLKKLGEIEQLIEELESRKAELEELKKELEKL 738


>gnl|CDD|237177 PRK12704, PRK12704, phosphodiesterase; Provisional.
          Length = 520

 Score = 45.5 bits (109), Expect = 3e-05
 Identities = 35/125 (28%), Positives = 66/125 (52%), Gaps = 15/125 (12%)

Query: 74  AKAEQEEEFISNTLMKKIQALKKEKETLAHHYEQEEEF--LTNDLSRKLNQLTEEKYRLE 131
            +AE+E + I     K+ +A+KKE        E +EE   L N+  ++L +   E  +LE
Sbjct: 34  KEAEEEAKRILEEAKKEAEAIKKEALL-----EAKEEIHKLRNEFEKELRERRNELQKLE 88

Query: 132 QTIEQEQERVVNKLMRKIEKLEAETDAKQNNLEQLRREKVELENTLEQEQEALVNKLWKR 191
           + + Q++E     L RK+E LE     ++  LE+  +E  + +  LE+++E L   + ++
Sbjct: 89  KRLLQKEEN----LDRKLELLEK----REEELEKKEKELEQKQQELEKKEEELEELIEEQ 140

Query: 192 MDKLE 196
           + +LE
Sbjct: 141 LQELE 145



 Score = 32.8 bits (76), Expect = 0.30
 Identities = 21/94 (22%), Positives = 46/94 (48%), Gaps = 7/94 (7%)

Query: 122 QLTEEKYRLEQTIEQEQERVVNKLMRKIEKLEAETDAKQNNLEQLRREK----VELENTL 177
           ++ E +   ++ +E+ ++         + + + E    +N  E+  RE+     +LE  L
Sbjct: 32  KIKEAEEEAKRILEEAKKEAEAIKKEALLEAKEEIHKLRNEFEKELRERRNELQKLEKRL 91

Query: 178 EQEQEALVNK---LWKRMDKLEAEKRILQIKLDQ 208
            Q++E L  K   L KR ++LE +++ L+ K  +
Sbjct: 92  LQKEENLDRKLELLEKREEELEKKEKELEQKQQE 125



 Score = 32.8 bits (76), Expect = 0.32
 Identities = 43/139 (30%), Positives = 70/139 (50%), Gaps = 16/139 (11%)

Query: 27  REQLQKRIESLQQQNRVLKVELETYKLQLRALQEENKNLRQTSVNIQAKAEQEEEFISNT 86
           +E++ K     +++ R  + EL+  KL+ R LQ+E  NL +    ++ K E+E E     
Sbjct: 63  KEEIHKLRNEFEKELRERRNELQ--KLEKRLLQKEE-NLDRKLELLE-KREEELEKKEKE 118

Query: 87  LMKKIQAL-KKEKETLAHHYEQEEEFLTNDLSRKLNQLTEE--KYRLEQTIEQEQERVVN 143
           L +K Q L KKE+E      EQ +E        +++ LT E  K  L + +E+E      
Sbjct: 119 LEQKQQELEKKEEELEELIEEQLQELE------RISGLTAEEAKEILLEKVEEEARHEAA 172

Query: 144 KLMRKIE---KLEAETDAK 159
            L+++IE   K EA+  AK
Sbjct: 173 VLIKEIEEEAKEEADKKAK 191



 Score = 30.5 bits (70), Expect = 1.8
 Identities = 13/56 (23%), Positives = 26/56 (46%), Gaps = 4/56 (7%)

Query: 251 QEHTEKMERYVKD-EKNVREENLRLQRKLQREVERREALCRHLSESESSLEMEEER 305
           +E  ++ E   K+     +EE  +L+ + ++E+  R      L + E  L  +EE 
Sbjct: 45  EEAKKEAEAIKKEALLEAKEEIHKLRNEFEKELRERR---NELQKLEKRLLQKEEN 97


>gnl|CDD|235175 PRK03918, PRK03918, chromosome segregation protein; Provisional.
          Length = 880

 Score = 45.1 bits (107), Expect = 6e-05
 Identities = 57/275 (20%), Positives = 121/275 (44%), Gaps = 29/275 (10%)

Query: 47  ELETYKLQLRALQEENKNLRQTSVNIQAKAEQEEEFISNTLMKKIQALKKEKETLAHHYE 106
           ELE  K ++  L++E ++L  +   ++ K  + EE I   L K+I+ L+++ + L    E
Sbjct: 232 ELEELKEEIEELEKELESLEGSKRKLEEKIRELEERI-EELKKEIEELEEKVKELKELKE 290

Query: 107 QEEEFLTNDLSRKLNQLTEEKYRLEQTIE--QEQERVVNKLMRKIEKLEAETDAKQNNLE 164
           + EE++   LS    +  +E   +E+ +   +E+   + + ++++E+ E   +  +  L+
Sbjct: 291 KAEEYIK--LSEFYEEYLDELREIEKRLSRLEEEINGIEERIKELEEKEERLEELKKKLK 348

Query: 165 QLRREKVELENTLEQEQEALVNKLWKRMDKLEAEKRILQIKLDQPVSDPSSPHDMNNGDT 224
           +L +   ELE   E  +EA   K  + +++L+     L  +  +   +          + 
Sbjct: 349 ELEKRLEELEERHELYEEAKAKK--EELERLKKRLTGLTPEKLEKELEELEKAKEEIEEE 406

Query: 225 ASNLSTHIHSLRSEVMRLRNQLAN---------------TQQEHTEKMERYVKDEKNVRE 269
            S ++  I  L+ E+  L+  +                 T++   E +E Y  + K + +
Sbjct: 407 ISKITARIGELKKEIKELKKAIEELKKAKGKCPVCGRELTEEHRKELLEEYTAELKRIEK 466

Query: 270 ENLRLQRKLQREVERREALCRHLSESESSLEMEEE 304
           E       L+   E+   L + L E E  L+ E E
Sbjct: 467 E-------LKEIEEKERKLRKELRELEKVLKKESE 494



 Score = 38.9 bits (91), Expect = 0.005
 Identities = 53/262 (20%), Positives = 116/262 (44%), Gaps = 22/262 (8%)

Query: 47  ELETYKLQLRALQEENKNLRQTSVNIQAKAEQEEEFISN-TLMKKIQALKKEKETLAHHY 105
           EL+  + +L+ ++E+ + LR+    ++   ++E E I    L ++++ L+++ +      
Sbjct: 460 ELKRIEKELKEIEEKERKLRKELRELEKVLKKESELIKLKELAEQLKELEEKLKKYNLEE 519

Query: 106 EQEEEFLTNDLSRKLNQLTEEKYRLEQTIEQEQE--RVVNKLMRKIEKLEAETDAKQNNL 163
            +++      L  KL +L  E   L++ +E+ +E  + + +L +K+++LE E       L
Sbjct: 520 LEKKAEEYEKLKEKLIKLKGEIKSLKKELEKLEELKKKLAELEKKLDELEEELAELLKEL 579

Query: 164 EQLRREKVELENTLEQEQEALVNKLWKRMDKLEAEKRILQIKLDQPVSDPSSPHDMNNGD 223
           E+L  E VE      +E E   N+ +  +   E E    + +L +              +
Sbjct: 580 EELGFESVEELEERLKELEPFYNE-YLELKDAEKELEREEKELKK------------LEE 626

Query: 224 TASNLSTHIHSLRSEVMRLRNQLANTQQEHTEKMERYVKDEKNVREENLRLQRKLQREVE 283
                   +      +  LR +L   +++++E+      + + +REE L L R+L     
Sbjct: 627 ELDKAFEELAETEKRLEELRKELEELEKKYSEE------EYEELREEYLELSRELAGLRA 680

Query: 284 RREALCRHLSESESSLEMEEER 305
             E L +   E + +LE  +E 
Sbjct: 681 ELEELEKRREEIKKTLEKLKEE 702



 Score = 36.2 bits (84), Expect = 0.028
 Identities = 34/179 (18%), Positives = 82/179 (45%), Gaps = 30/179 (16%)

Query: 27  REQLQKRIESLQQQNRVLKVELETYKLQLRALQEENKNLRQTSVNIQAKAEQEEEFISNT 86
            E+ ++R+E L+++ + L+  LE  + +    +E     +     ++   ++        
Sbjct: 333 LEEKEERLEELKKKLKELEKRLEELEERHELYEE----AKAKKEELERLKKRLTGLTPEK 388

Query: 87  LMKKIQALKKEKETLAHHYEQEEEFLTNDLSRKLNQLTEEKYRLEQTI------------ 134
           L K+++ L+K KE +      EEE   + ++ ++ +L +E   L++ I            
Sbjct: 389 LEKELEELEKAKEEI------EEEI--SKITARIGELKKEIKELKKAIEELKKAKGKCPV 440

Query: 135 ------EQEQERVVNKLMRKIEKLEAETDAKQNNLEQLRREKVELENTLEQEQEALVNK 187
                 E+ ++ ++ +   +++++E E    +    +LR+E  ELE  L++E E +  K
Sbjct: 441 CGRELTEEHRKELLEEYTAELKRIEKELKEIEEKERKLRKELRELEKVLKKESELIKLK 499



 Score = 33.5 bits (77), Expect = 0.20
 Identities = 63/275 (22%), Positives = 119/275 (43%), Gaps = 34/275 (12%)

Query: 32  KRIESLQQQNRVLKVELETYKLQLRALQEENKNLRQTSVNIQAKAEQEEEFISNTLMKKI 91
           + +E + ++   L    E  + QL+ L+E+ K       N++   ++ EE+    L +K+
Sbjct: 483 RELEKVLKKESELIKLKELAE-QLKELEEKLK-----KYNLEELEKKAEEY--EKLKEKL 534

Query: 92  QALKKEKETLAHHYEQEEEFLTN--DLSRKLNQLTEEKYRLEQTIEQEQERVVNKLMRKI 149
             LK E ++L    E+ EE      +L +KL++L EE   L + +E+     V +L  ++
Sbjct: 535 IKLKGEIKSLKKELEKLEELKKKLAELEKKLDELEEELAELLKELEELGFESVEELEERL 594

Query: 150 EKLEAETDAKQNNLEQLRREKVELENTLEQEQEALVNKLWKRMDKLEAEKRILQIKLDQP 209
           ++LE     +   L+   +E    E  L++ +E L     + + + E     L+ +L++ 
Sbjct: 595 KELE-PFYNEYLELKDAEKELEREEKELKKLEEELDKAF-EELAETEKRLEELRKELEEL 652

Query: 210 VSDPSSPHDMNNGDTASNLSTHIHSLRSEVMRLRNQLANTQQEHTEKMERYVKDEKNVRE 269
               S        +    L      L  E+  LR +L               ++ +  RE
Sbjct: 653 EKKYSE-------EEYEELREEYLELSRELAGLRAEL---------------EELEKRRE 690

Query: 270 ENLRLQRKLQREVERREALCRHLSESESSLEMEEE 304
           E  +   KL+ E+E RE   + L + E +LE  EE
Sbjct: 691 EIKKTLEKLKEELEEREKAKKELEKLEKALERVEE 725



 Score = 31.2 bits (71), Expect = 1.1
 Identities = 45/222 (20%), Positives = 87/222 (39%), Gaps = 46/222 (20%)

Query: 85  NTLMKKIQALKKEKETLAHHYEQEEEFLTNDLSRKLNQLTEEKYRLEQTIEQEQERVVNK 144
             L + I+ +K+  E L    ++ E             + E++  LE+ +     R +N+
Sbjct: 165 KNLGEVIKEIKRRIERLEKFIKRTENIE--------ELIKEKEKELEEVL-----REINE 211

Query: 145 LMRKIEKLEAETDAKQNNLEQLRREKVELENTLEQEQEALVNKLWKRMDKLEAEKRILQI 204
           +  ++ +L  E       LE+L +E  ELE   E      + +L K ++ LE  KR L+ 
Sbjct: 212 ISSELPELREE-------LEKLEKEVKELEELKE-----EIEELEKELESLEGSKRKLEE 259

Query: 205 KLDQPVSDPSSPHDMNNGDTASNLSTHIHSLRSEVMRLRNQLANTQ--QEHTEKMERYVK 262
           K+                     L   I  L+ E+  L  ++   +  +E  E+  +  +
Sbjct: 260 KI-------------------RELEERIEELKKEIEELEEKVKELKELKEKAEEYIKLSE 300

Query: 263 DEKNVREENLRLQRKLQREVERREALCRHLSESESSLEMEEE 304
             +   +E   ++++L R  E    +   + E E   E  EE
Sbjct: 301 FYEEYLDELREIEKRLSRLEEEINGIEERIKELEEKEERLEE 342



 Score = 31.2 bits (71), Expect = 1.3
 Identities = 41/163 (25%), Positives = 77/163 (47%), Gaps = 10/163 (6%)

Query: 53  LQLRALQEENKNLRQTSVNIQAKAEQEEEFISNT--LMKKIQALKKEKETLAHHYEQEEE 110
           L L   +   KNL +    I+ + E+ E+FI  T  + + I+  +KE E +     +   
Sbjct: 155 LGLDDYENAYKNLGEVIKEIKRRIERLEKFIKRTENIEELIKEKEKELEEVLREINEISS 214

Query: 111 FLTNDLSRKLNQLTEEKYRLEQTIEQ--EQERVVNKLMRKIEKLEAETDAKQNNLEQLRR 168
            L  +L  +L +L +E   LE+  E+  E E+ +  L     KLE +    +  +E+L++
Sbjct: 215 ELP-ELREELEKLEKEVKELEELKEEIEELEKELESLEGSKRKLEEKIRELEERIEELKK 273

Query: 169 EKVELENTLE-----QEQEALVNKLWKRMDKLEAEKRILQIKL 206
           E  ELE  ++     +E+     KL +  ++   E R ++ +L
Sbjct: 274 EIEELEEKVKELKELKEKAEEYIKLSEFYEEYLDELREIEKRL 316


>gnl|CDD|234501 TIGR04211, SH3_and_anchor, SH3 domain protein.  Members of this
           protein family have a signal peptide, a strongly
           conserved SH3 domain, a variable region, and then a
           C-terminal hydrophobic transmembrane alpha helix region.
          Length = 198

 Score = 40.8 bits (96), Expect = 4e-04
 Identities = 31/93 (33%), Positives = 49/93 (52%), Gaps = 9/93 (9%)

Query: 27  REQLQKRIESLQQQNRVLKVELETYKLQLRALQEENKNLRQTSVNIQAKAEQEEEFISNT 86
             +LQ+++  LQQ+N+ LK EL T + +L  LQ+E   ++Q S N     E+  E     
Sbjct: 82  LAELQEQLAELQQENQELKQELSTLEAELERLQKELARIKQLSANAIELDEENRE----- 136

Query: 87  LMKKIQALKKEKETLAHHYEQEEEFLTNDLSRK 119
           L +++  LK+E E L    E E E L  +  R+
Sbjct: 137 LREELAELKQENEAL----EAENERLQENEQRR 165



 Score = 38.1 bits (89), Expect = 0.004
 Identities = 27/127 (21%), Positives = 50/127 (39%), Gaps = 21/127 (16%)

Query: 39  QQNRVLKVELETYKLQLRALQEENKNLRQTSVNIQAKAEQEEEFISNTLMKKIQALKKEK 98
                 +  L   + +L  LQEE   L++    +Q +           L +++  L+ E 
Sbjct: 59  SDTPSARERLPELQQELAELQEELAELQEQLAELQQEN--------QELKQELSTLEAEL 110

Query: 99  ETLAHHYEQEEEFLTNDLSRKLNQLTEEKYRLEQTIEQEQERVVNKLMRKIEKLEAETDA 158
           E L     + ++     LS    +L EE   L + + +        L ++ E LEAE + 
Sbjct: 111 ERLQKELARIKQ-----LSANAIELDEENRELREELAE--------LKQENEALEAENER 157

Query: 159 KQNNLEQ 165
            Q N ++
Sbjct: 158 LQENEQR 164



 Score = 31.9 bits (73), Expect = 0.38
 Identities = 26/130 (20%), Positives = 48/130 (36%), Gaps = 28/130 (21%)

Query: 52  KLQLRALQEENKNLRQTSVNIQAKAEQEEEFISNTLMKKIQALKKEKETLAHHYEQEEEF 111
           + +L  LQ+E   L++    +Q +  +        L ++ Q LK+E  TL          
Sbjct: 65  RERLPELQQELAELQEELAELQEQLAE--------LQQENQELKQELSTL---------- 106

Query: 112 LTNDLSRKLNQLTEEKYRLEQTIEQEQERVVNKLMRKIEKLEAETDAKQNNLEQLRREKV 171
                  +L +L +E  R++Q      E     L  +  +L  E    +   E L  E  
Sbjct: 107 -----EAELERLQKELARIKQLSANAIE-----LDEENRELREELAELKQENEALEAENE 156

Query: 172 ELENTLEQEQ 181
            L+   ++  
Sbjct: 157 RLQENEQRRW 166


>gnl|CDD|130141 TIGR01069, mutS2, MutS2 family protein.  Function of MutS2 is
           unknown. It should not be considered a DNA mismatch
           repair protein. It is likely a DNA mismatch binding
           protein of unknown cellular function [DNA metabolism,
           Other].
          Length = 771

 Score = 42.1 bits (99), Expect = 5e-04
 Identities = 35/140 (25%), Positives = 68/140 (48%), Gaps = 8/140 (5%)

Query: 69  SVNIQAK---AEQEEEFISNTLMKKIQALKKEKETLAHHYE---QEEEFLTNDLSRKLNQ 122
            +  QAK    E +EE   N L++K+ AL+KE E    H E   +E+E L  +L +++ +
Sbjct: 497 FIIEQAKTFYGEFKEEI--NVLIEKLSALEKELEQKNEHLEKLLKEQEKLKKELEQEMEE 554

Query: 123 LTEEKYRLEQTIEQEQERVVNKLMRKIEKLEAETDAKQNNLEQLRREKVELENTLEQEQE 182
           L E +   +  +E+E +  +  L +++E +  E   K+ +  +  +   +L    E +Q+
Sbjct: 555 LKERERNKKLELEKEAQEALKALKKEVESIIRELKEKKIHKAKEIKSIEDLVKLKETKQK 614

Query: 183 ALVNKLWKRMDKLEAEKRIL 202
                   + DK+  + RI 
Sbjct: 615 IPQKPTNFQADKIGDKVRIR 634



 Score = 29.8 bits (67), Expect = 3.4
 Identities = 16/76 (21%), Positives = 36/76 (47%)

Query: 28  EQLQKRIESLQQQNRVLKVELETYKLQLRALQEENKNLRQTSVNIQAKAEQEEEFISNTL 87
           E++   IE L    + L+ + E  +  L+  ++  K L Q    ++ +   ++  +    
Sbjct: 511 EEINVLIEKLSALEKELEQKNEHLEKLLKEQEKLKKELEQEMEELKERERNKKLELEKEA 570

Query: 88  MKKIQALKKEKETLAH 103
            + ++ALKKE E++  
Sbjct: 571 QEALKALKKEVESIIR 586



 Score = 29.4 bits (66), Expect = 4.3
 Identities = 21/72 (29%), Positives = 41/72 (56%), Gaps = 11/72 (15%)

Query: 137 EQERVVNKLMRKIEKLEAETDAKQNNLEQLRREKVELENTLEQEQEALVNKLWKRMDKLE 196
           E +  +N L+ K+  LE E + K  +LE+L +E+ +L+  LEQE           M++L+
Sbjct: 508 EFKEEINVLIEKLSALEKELEQKNEHLEKLLKEQEKLKKELEQE-----------MEELK 556

Query: 197 AEKRILQIKLDQ 208
             +R  +++L++
Sbjct: 557 ERERNKKLELEK 568


>gnl|CDD|188306 TIGR03319, RNase_Y, ribonuclease Y.  Members of this family are
           RNase Y, an endoribonuclease. The member from Bacillus
           subtilis, YmdA, has been shown to be involved in
           turnover of yitJ riboswitch [Transcription, Degradation
           of RNA].
          Length = 514

 Score = 41.4 bits (98), Expect = 6e-04
 Identities = 33/123 (26%), Positives = 56/123 (45%), Gaps = 9/123 (7%)

Query: 66  RQTSVNIQAKAEQEEEFISNTLMKKIQALKKEKETLAHHYEQEEEFLTNDLSRKLNQLTE 125
           ++ +      AE+  + I     K+ + LKKE    A   ++E   L  +L R+L +   
Sbjct: 20  KRIAEKKLGSAEELAKRIIEEAKKEAETLKKEALLEA---KEEVHKLRAELERELKERRN 76

Query: 126 EKYRLEQTIEQEQERV------VNKLMRKIEKLEAETDAKQNNLEQLRREKVELENTLEQ 179
           E  RLE+ + Q +E +      ++K    +EK E E   K+ NL++   E  EL     +
Sbjct: 77  ELQRLERRLLQREETLDRKMESLDKKEENLEKKEKELSNKEKNLDEKEEELEELIAEQRE 136

Query: 180 EQE 182
           E E
Sbjct: 137 ELE 139



 Score = 30.7 bits (70), Expect = 1.4
 Identities = 35/138 (25%), Positives = 61/138 (44%), Gaps = 32/138 (23%)

Query: 27  REQLQKRIESLQQQNRVLKVELETYKLQLRALQEENKNLRQTSVNIQAKAEQEEEFISNT 86
           R +LQ+    L Q+   L  ++E+   +   L+++ K L     N+  K E+ EE I+  
Sbjct: 75  RNELQRLERRLLQREETLDRKMESLDKKEENLEKKEKELSNKEKNLDEKEEELEELIA-- 132

Query: 87  LMKKIQALKKEKETLAHHYEQEEEFLTNDLSRKLNQLTEEKYR--LEQTIEQEQERVVNK 144
                              EQ EE        +++ LT+E+ +  L + +E+E      K
Sbjct: 133 -------------------EQREEL------ERISGLTQEEAKEILLEEVEEEARHEAAK 167

Query: 145 LMRKIE---KLEAETDAK 159
           L+++IE   K EA+  AK
Sbjct: 168 LIKEIEEEAKEEADKKAK 185



 Score = 30.7 bits (70), Expect = 1.5
 Identities = 19/73 (26%), Positives = 37/73 (50%), Gaps = 6/73 (8%)

Query: 236 RSEVMRLRNQLANTQQEHTEKMERYVKDEKNVREENLRLQRKLQREVERREALCR---HL 292
           + EV +LR +L   ++E  E+     + E+ + +    L RK++   ++ E L +    L
Sbjct: 57  KEEVHKLRAEL---ERELKERRNELQRLERRLLQREETLDRKMESLDKKEENLEKKEKEL 113

Query: 293 SESESSLEMEEER 305
           S  E +L+ +EE 
Sbjct: 114 SNKEKNLDEKEEE 126



 Score = 29.9 bits (68), Expect = 2.7
 Identities = 22/67 (32%), Positives = 34/67 (50%), Gaps = 4/67 (5%)

Query: 236 RSEVMRLRNQLANTQQEHT--EKMERYVKDEKNVREENLRLQRKLQREVERREALCRHLS 293
           R+E+ RL  +L   Q+E T   KME   K E+N+ ++   L  K +   E+ E L   ++
Sbjct: 75  RNELQRLERRLL--QREETLDRKMESLDKKEENLEKKEKELSNKEKNLDEKEEELEELIA 132

Query: 294 ESESSLE 300
           E    LE
Sbjct: 133 EQREELE 139


>gnl|CDD|234750 PRK00409, PRK00409, recombination and DNA strand exchange inhibitor
           protein; Reviewed.
          Length = 782

 Score = 41.0 bits (97), Expect = 0.001
 Identities = 31/139 (22%), Positives = 57/139 (41%), Gaps = 11/139 (7%)

Query: 71  NIQAKAEQEEEFISNTLMKKIQALKKEKETLAHHYEQEEEFLTNDLSRKLNQLTEEKYRL 130
            I    E+  E I++    + +  +K +E  A    +E E L  +L  K  +L EE    
Sbjct: 510 LIGEDKEKLNELIASLEELERELEQKAEE--AEALLKEAEKLKEELEEKKEKLQEE---- 563

Query: 131 EQTIEQEQERVVNKLMRKIEKLEAETDAKQNNLEQLRREKVELENTLEQEQEALVNKLWK 190
           E  + +E E+   +    I++ + E D     L QL++           E      +L K
Sbjct: 564 EDKLLEEAEKEAQQ---AIKEAKKEADEIIKELRQLQKGGYASVK--AHELIEARKRLNK 618

Query: 191 RMDKLEAEKRILQIKLDQP 209
             +K E +K+  + K ++ 
Sbjct: 619 ANEKKEKKKKKQKEKQEEL 637



 Score = 32.9 bits (76), Expect = 0.36
 Identities = 27/127 (21%), Positives = 53/127 (41%), Gaps = 22/127 (17%)

Query: 28  EQLQKRIESLQQQNR----------VLKVELETYKLQLRALQEENKNLRQTSVNIQAKAE 77
           E+L + I SL++  R           L  E E  K +L   +E+ +         +   E
Sbjct: 516 EKLNELIASLEELERELEQKAEEAEALLKEAEKLKEELEEKKEKLQEEED-----KLLEE 570

Query: 78  QEEEFISNTLMKKIQALKKEKETLAHHYEQEEEFLTNDLSRKLNQLTEEKYRLEQTIEQE 137
            E+E       + I+  KKE + +     Q ++      S K ++L E + RL +  E++
Sbjct: 571 AEKEA-----QQAIKEAKKEADEIIKELRQLQK--GGYASVKAHELIEARKRLNKANEKK 623

Query: 138 QERVVNK 144
           +++   +
Sbjct: 624 EKKKKKQ 630


>gnl|CDD|235600 PRK05771, PRK05771, V-type ATP synthase subunit I; Validated.
          Length = 646

 Score = 40.3 bits (95), Expect = 0.002
 Identities = 37/184 (20%), Positives = 68/184 (36%), Gaps = 30/184 (16%)

Query: 27  REQLQKRIESLQQQNRVLKVELETYKLQLRALQEENKNLRQTSVNIQAKAEQEEEFISNT 86
             +L+  I+ L+Q+   L+     + L L  L    K +      +    ++ EE    +
Sbjct: 109 ISELENEIKELEQEIERLEP-WGNFDLDLSLLLG-FKYVSVFVGTV--PEDKLEELKLES 164

Query: 87  LMKKIQALKKEKETLA--------HHYEQEEEFLTNDLSRKLNQLTEEKYRLEQTIEQEQ 138
            ++ ++ +  +K  +            E EEE       R   +L EE     + I + +
Sbjct: 165 DVENVEYISTDKGYVYVVVVVLKELSDEVEEELKKLGFER--LELEEEG-TPSELIREIK 221

Query: 139 ERVVNKLMRKIEKLEAETDAKQNNLEQLRREKVELENTLEQEQEALVNKLWKRMDKLEAE 198
           E         +E++E E        E L  E  EL     +E  AL   L   +++ EA 
Sbjct: 222 EE--------LEEIEKE-------RESLLEELKELAKKYLEELLALYEYLEIELERAEAL 266

Query: 199 KRIL 202
            + L
Sbjct: 267 SKFL 270



 Score = 31.4 bits (72), Expect = 0.79
 Identities = 29/128 (22%), Positives = 56/128 (43%), Gaps = 10/128 (7%)

Query: 41  NRVLKVELETYKLQ-LRALQEENKNLRQTSVNIQAKAEQEE-EFISNTLMKKIQALKKEK 98
            +VL V L++YK + L AL E         V+I+   E+   E +   L   +  L +  
Sbjct: 7   KKVLIVTLKSYKDEVLEALHELG------VVHIEDLKEELSNERLRK-LRSLLTKLSEAL 59

Query: 99  ETLAHHYEQEEEFLTNDLSRKLNQLTEEKYRLEQTIEQEQERVVNKLMRKIEKLEAETDA 158
           + L   Y  +   L  +  +   +  EE  +  +   ++ E+ + +L  +I +LE E   
Sbjct: 60  DKL-RSYLPKLNPLREEKKKVSVKSLEELIKDVEEELEKIEKEIKELEEEISELENEIKE 118

Query: 159 KQNNLEQL 166
            +  +E+L
Sbjct: 119 LEQEIERL 126



 Score = 31.4 bits (72), Expect = 0.84
 Identities = 30/129 (23%), Positives = 57/129 (44%), Gaps = 17/129 (13%)

Query: 88  MKKIQ--ALKKEKETL--AHHYEQEEEFLTNDLSRKLNQLTEEKYRLEQTIEQEQERVVN 143
           MKK+    LK  K+ +  A H          DL  +L+     K R   T   +    ++
Sbjct: 6   MKKVLIVTLKSYKDEVLEALHELGVVHI--EDLKEELSNERLRKLRSLLT---KLSEALD 60

Query: 144 KLMRKIEKLEAETDAKQ----NNLEQLRREKVELENTLEQEQEALVNKLWKRMDKLEAEK 199
           KL   + KL    + K+     +LE+L ++  E    +E+E + L  ++     +LE E 
Sbjct: 61  KLRSYLPKLNPLREEKKKVSVKSLEELIKDVEEELEKIEKEIKELEEEI----SELENEI 116

Query: 200 RILQIKLDQ 208
           + L+ ++++
Sbjct: 117 KELEQEIER 125


>gnl|CDD|221432 pfam12128, DUF3584, Protein of unknown function (DUF3584).  This
           protein is found in bacteria and eukaryotes. Proteins in
           this family are typically between 943 to 1234 amino
           acids in length. This family contains a P-loop motif
           suggesting it is a nucleotide binding protein. It may be
           involved in replication.
          Length = 1198

 Score = 40.4 bits (95), Expect = 0.002
 Identities = 50/268 (18%), Positives = 102/268 (38%), Gaps = 20/268 (7%)

Query: 28  EQLQKRIESLQQQNRVLKVELETYKLQLR-ALQEENKNLRQTSVNIQAKAEQEEEFISNT 86
              ++R+   Q++ +  K  L      L   L+E    L Q      AK   +   +   
Sbjct: 273 VADEERLAEEQEERQEAKNRLRQQLRTLEDQLKEARDELNQELSAANAKLAADRSELELL 332

Query: 87  LMKKIQALKKEKETLAHHYEQEEEFLT--NDLSRKLNQLTEEKYRLEQTIEQEQERVVNK 144
             +K      + E L    +Q     +   ++  +L+ LT +   +++  E+ ++++  +
Sbjct: 333 EDQKGAFEDADIEQLQADLDQLPSIRSELEEVEARLDALTGKHQDVQRKYERLKQKIKEQ 392

Query: 145 LMRKIEKLEAETDAKQNNLEQLRREKVE----LENTLEQEQEALVNKLWKRMDKLEAEKR 200
           L R +EK      A +   ++ +    E    LE+ L Q+ EA   +  +   +LE    
Sbjct: 393 LERDLEKNNERLAAIREEKDRQKAAIEEDLQALESQLRQQLEAGKLEFNEEEYELELRLG 452

Query: 201 ILQIKLDQPVSDPSSPHDMNNGDTASNLSTHIHSLRSEVMRLRNQLANTQQEHTEKMERY 260
            L+ +LD   + P     +   D A               +   + A    E  +   R 
Sbjct: 453 RLKQRLDSATATPEELEQLEINDEALEK-----------AQEEQEQAEANVEQLQSELRQ 501

Query: 261 VKDEKNVREENLRLQRKLQREVERREAL 288
           ++  ++  E    LQR  +R ++ R+AL
Sbjct: 502 LRKRRD--EALEALQRAERRLLQLRQAL 527


>gnl|CDD|225288 COG2433, COG2433, Uncharacterized conserved protein [Function
           unknown].
          Length = 652

 Score = 39.7 bits (93), Expect = 0.002
 Identities = 24/84 (28%), Positives = 44/84 (52%), Gaps = 2/84 (2%)

Query: 125 EEKYRLEQTIEQEQERVVNKLMRKIEKLEAETDAKQNNLEQLRREKVELENTLEQEQEAL 184
           E     E+      E+ + KL   +E+LE E    +  LE+L+RE  +LE+ LE+ +  +
Sbjct: 407 EGTEEEERREITVYEKRIKKLEETVERLEEENSELKRELEELKREIEKLESELERFRREV 466

Query: 185 VNKLWKRMDKLEAEKRI--LQIKL 206
            +K+ K  +    ++RI  L+ +L
Sbjct: 467 RDKVRKDREIRARDRRIERLEKEL 490



 Score = 36.6 bits (85), Expect = 0.024
 Identities = 30/150 (20%), Positives = 63/150 (42%), Gaps = 27/150 (18%)

Query: 153 EAETDAKQNNLEQLRREKVELENTLEQEQEALVNKLWKRMDKLEAEKRILQIKLDQPVSD 212
           EA +  K+    + +    E E       E  + KL + +++LE E   L+ +L++    
Sbjct: 392 EALSKVKEEERPREKEGTEEEERREITVYEKRIKKLEETVERLEEENSELKRELEE---- 447

Query: 213 PSSPHDMNNGDTASNLSTHIHSLRSEVMRLRNQLANTQQEHTEKMERYVKDEKNVREENL 272
                                 L+ E+ +L ++L   ++E  +K  R  ++ +       
Sbjct: 448 ----------------------LKREIEKLESELERFRREVRDK-VRKDREIRARDRRIE 484

Query: 273 RLQRKLQREVERREALCRHLSESESSLEME 302
           RL+++L+ + +R E L R L+E     ++E
Sbjct: 485 RLEKELEEKKKRVEELERKLAELRKMRKLE 514



 Score = 35.1 bits (81), Expect = 0.075
 Identities = 35/142 (24%), Positives = 68/142 (47%), Gaps = 15/142 (10%)

Query: 28  EQLQKRIESLQQQNRVLKVELETYKLQLRALQEENKNLRQTSVNIQAKAEQEEEFISNTL 87
           ++L++ +E L+++N  LK ELE  K ++  L+ E +  R             E       
Sbjct: 425 KKLEETVERLEEENSELKRELEELKREIEKLESELERFR------------REVRDKVRK 472

Query: 88  MKKIQALKKEKETLAHHYEQEEEFLTNDLSRKLNQLTEEKYRLEQTIEQEQERVVNKL-M 146
            ++I+A  +  E L    E++++    +L RKL +L  +  +LE + +    +VV KL +
Sbjct: 473 DREIRARDRRIERLEKELEEKKK-RVEELERKLAEL-RKMRKLELSGKGTPVKVVEKLTL 530

Query: 147 RKIEKLEAETDAKQNNLEQLRR 168
             IE+ E E   K+ ++  +  
Sbjct: 531 EAIEEAEEEYGIKEGDVILVED 552



 Score = 34.7 bits (80), Expect = 0.086
 Identities = 26/112 (23%), Positives = 47/112 (41%), Gaps = 21/112 (18%)

Query: 42  RVLKVELETYKLQLRALQEENKNLRQTSVNIQAKAEQEEEFISNTLMKKIQALKKEKETL 101
            V +  ++  +  +  L+EEN  L         K E EE      L ++I+ L+ E E  
Sbjct: 418 TVYEKRIKKLEETVERLEEENSEL---------KRELEE------LKREIEKLESELERF 462

Query: 102 AHHYEQEEEFLTNDLSRKLNQLTEEKYRLEQTIEQEQERVVNKLMRKIEKLE 153
                 +         R++        RLE+ +E E+++ V +L RK+ +L 
Sbjct: 463 RREVRDKVR-----KDREIRARDRRIERLEKELE-EKKKRVEELERKLAELR 508



 Score = 32.4 bits (74), Expect = 0.45
 Identities = 18/67 (26%), Positives = 35/67 (52%), Gaps = 2/67 (2%)

Query: 115 DLSRKLNQLTEEKYRLEQTIEQEQERVVNKLM--RKIEKLEAETDAKQNNLEQLRREKVE 172
           +L R+L +L  E  +LE  +E+ +  V +K+   R+I   +   +  +  LE+ ++   E
Sbjct: 440 ELKRELEELKREIEKLESELERFRREVRDKVRKDREIRARDRRIERLEKELEEKKKRVEE 499

Query: 173 LENTLEQ 179
           LE  L +
Sbjct: 500 LERKLAE 506


>gnl|CDD|220402 pfam09787, Golgin_A5, Golgin subfamily A member 5.  Members of this
           family of proteins are involved in maintaining Golgi
           structure. They stimulate the formation of Golgi stacks
           and ribbons, and are involved in intra-Golgi retrograde
           transport. Two main interactions have been
           characterized: one with RAB1A that has been activated by
           GTP-binding and another with isoform CASP of CUTL1.
          Length = 509

 Score = 39.0 bits (91), Expect = 0.003
 Identities = 44/270 (16%), Positives = 95/270 (35%), Gaps = 19/270 (7%)

Query: 32  KRIESLQQQNRVLKVELETYKLQLRALQEENKNLRQTSVNIQAKAEQEEEFISNTLMKKI 91
           +    L+ +   LK+ L+    +LR L+ E ++LR       +  E  +      +    
Sbjct: 100 QSSSDLEDELAALKIRLQEAAQELRELKSELEDLRSERSRDLSDEESIKRLQRGAVRSLQ 159

Query: 92  QALKKEKETLAHHYEQEEEFLTNDLSRKLNQLTEEKYRLEQTIEQEQERVVNKLMRKIEK 151
             L      +  H       +   L R L     E    E    Q   R ++K +++  +
Sbjct: 160 DKLLLLDAAVKRHDGNVITAVLIFLKRTLK---LEILFQELAERQTALRHLSKHLQEEGE 216

Query: 152 LEAETDAKQNNLEQLRREKVELENTLE---QEQEALVNKLWKRMDKLEAEKRILQIKLDQ 208
           L+      +   E  ++E  +         Q +E  +N L +       +    QI+L  
Sbjct: 217 LQQLLKLLRAEGESEKQELQQYRQKAHRILQSKEKRINFLKEGCLFEGLDSSTAQIEL-- 274

Query: 209 PVSDPSSPHDMNNGDTASNLSTHIHSLRSEVMRLRNQLANTQQEHTEKMERYVKDEKNVR 268
                   H+      + ++   I  L  ++++LR++  + + E + + E + K  + + 
Sbjct: 275 ----EELKHE------SEHVQEEITKLEGQIIQLRSEAQDREAEASGEAESFRKQPRELS 324

Query: 269 EENLRLQRKLQREVERREALCRHLSESESS 298
           ++        Q E+E       H  E   +
Sbjct: 325 QQIAPQITTKQ-ELELYYQELYHYREDLGN 353


>gnl|CDD|218661 pfam05622, HOOK, HOOK protein.  This family consists of several
           HOOK1, 2 and 3 proteins from different eukaryotic
           organisms. The different members of the human gene
           family are HOOK1, HOOK2 and HOOK3. Different domains
           have been identified in the three human HOOK proteins,
           and it was demonstrated that the highly conserved
           NH2-domain mediates attachment to microtubules, whereas
           the central coiled-coil motif mediates homodimerisation
           and the more divergent C-terminal domains are involved
           in binding to specific organelles (organelle-binding
           domains). It has been demonstrated that endogenous HOOK3
           binds to Golgi membranes, whereas both HOOK1 and HOOK2
           are localised to discrete but unidentified cellular
           structures. In mice the Hook1 gene is predominantly
           expressed in the testis. Hook1 function is necessary for
           the correct positioning of microtubular structures
           within the haploid germ cell. Disruption of Hook1
           function in mice causes abnormal sperm head shape and
           fragile attachment of the flagellum to the sperm head.
          Length = 713

 Score = 39.0 bits (91), Expect = 0.005
 Identities = 67/288 (23%), Positives = 131/288 (45%), Gaps = 38/288 (13%)

Query: 21  PPSPVTRE--QLQKRIESLQQQNRVLKVELETYKLQLRALQEENKNLRQTSVNIQAKAEQ 78
           P    +++   LQ ++E LQ++N  L+   + Y+++   L++E   L+  +  + + A +
Sbjct: 233 PNQLGSKKYNLLQSQLEQLQEENFRLEAARDDYRIKCEELEKELAELQHRNDELTSLAAE 292

Query: 79  EEEFISNTLMKKIQALK------KEKETLAHHYEQEEEFLTNDLSRKLNQLTEEKYR-LE 131
                S  L  +I  L+      K+ E     Y+++ E L NDL R++  L E     ++
Sbjct: 293 -----SQALKDEIDVLRESSDKAKKLEAQVETYKKKLEDL-NDLRRQVKLLEERNAMYMQ 346

Query: 132 QTIEQEQE-RVVNKLMRKIEKLEAETDAKQNNL--EQLRREKVELE-NTLEQEQEALVNK 187
            T++ E+E +  N    ++E  + +       L  E  + +K+E E   LE++ EAL   
Sbjct: 347 NTVQLEEELKKANAARGQLETYKRQVQELHAKLSEESKKADKLEFEYKRLEEKLEALQ-- 404

Query: 188 LWKRMDKLEAEKRIL-----QIKLDQPVSDPSSPHDMNNGDTASNLSTHIHS--LRSEVM 240
             K  ++L AE+  L     +++  Q   D  +  D     +  NL+  +     R +++
Sbjct: 405 --KEKERLLAERDSLRETNEELRCGQAQQDQLTQADAGTSPSGDNLAAELLPSEYREKLI 462

Query: 241 RLR--NQLANTQQEHTEKMERYVK-----DEKNVREENLRLQRKLQRE 281
           RL+  N++    QE +E  ER  +     ++ N R   L  Q +L  +
Sbjct: 463 RLQHENKMLRLGQEGSEN-ERITELQQLLEDANRRNNELETQLRLANQ 509



 Score = 33.6 bits (77), Expect = 0.19
 Identities = 53/250 (21%), Positives = 107/250 (42%), Gaps = 46/250 (18%)

Query: 53  LQLRALQEENKNLRQTSVNIQAKAEQEEEFISNTLMKKIQALKKEKETLAHHYEQEEEFL 112
            Q+  LQEE  +L+Q +  +Q +  Q E     + +   Q   K+               
Sbjct: 200 KQVLLLQEEKNSLQQENEKLQERLAQLE----GSSLGPNQLGSKK--------------- 240

Query: 113 TNDLSRKLNQLTEEKYRLEQTIE------QEQERVVNKLMRKIEKLEAETDAKQNNLEQL 166
            N L  +L QL EE +RLE   +      +E E+ + +L  + ++L +     Q     L
Sbjct: 241 YNLLQSQLEQLQEENFRLEAARDDYRIKCEELEKELAELQHRNDELTSLAAESQ----AL 296

Query: 167 RREKVELENTLEQEQ--EALVNKLWKRMDKLEAEKRILQIKLDQPVSDPSSPHDMNNGD- 223
           + E   L  + ++ +  EA V    K+++ L   +R  Q+KL +   + ++ +  N    
Sbjct: 297 KDEIDVLRESSDKAKKLEAQVETYKKKLEDLNDLRR--QVKLLE---ERNAMYMQNTVQL 351

Query: 224 -----TASNLSTHIHSLRSEVMRLRNQLANTQQEHTEKMERYVKDEKNVREENLRLQRKL 278
                 A+     + + + +V  L  +L     E ++K ++   + K + E+   LQ++ 
Sbjct: 352 EEELKKANAARGQLETYKRQVQELHAKL----SEESKKADKLEFEYKRLEEKLEALQKEK 407

Query: 279 QREVERREAL 288
           +R +  R++L
Sbjct: 408 ERLLAERDSL 417



 Score = 32.1 bits (73), Expect = 0.56
 Identities = 38/168 (22%), Positives = 75/168 (44%), Gaps = 15/168 (8%)

Query: 5   ASESDSNSMDGVALMMPPSPVTREQLQKRIESLQQQNRVLKVELE-TYKLQLRALQE--E 61
           A    S S D +A  + PS     + ++++  LQ +N++L++  E +   ++  LQ+  E
Sbjct: 437 ADAGTSPSGDNLAAELLPS-----EYREKLIRLQHENKMLRLGQEGSENERITELQQLLE 491

Query: 62  NKNLRQTSVNIQAKAEQEEEFISNTLMKKIQALKKEKETLAHHYEQEEEFLTNDLSRKLN 121
           + N R   +  Q +   +       L ++++ L+K  +       ++   L + L   L 
Sbjct: 492 DANRRNNELETQLRLANQR---ILELQQQVEDLQKALQE-QGSKSEDSSLLKSKLEEHLE 547

Query: 122 QLTE---EKYRLEQTIEQEQERVVNKLMRKIEKLEAETDAKQNNLEQL 166
           QL E   E  +  + IE+ +      L RKI +LEA    K  ++  +
Sbjct: 548 QLHEANEELQKKREQIEELEPDQDQNLSRKIAELEAALQKKDEDMRAM 595


>gnl|CDD|197874 smart00787, Spc7, Spc7 kinetochore protein.  This domain is found
           in cell division proteins which are required for
           kinetochore-spindle association.
          Length = 312

 Score = 38.1 bits (89), Expect = 0.005
 Identities = 33/122 (27%), Positives = 55/122 (45%), Gaps = 9/122 (7%)

Query: 86  TLMKKIQALKKEKETLAHHYEQEEEFLTNDLSRKLNQLTEEKYRLEQTIEQEQERVVNKL 145
             +K+   L  ++  L +  + +     + L  +L QL + +  LE     E +R   KL
Sbjct: 154 EGLKEDYKLLMKELELLNSIKPKLRDRKDALEEELRQLKQLEDELEDCDPTELDRAKEKL 213

Query: 146 MRKIEKLEAETDAKQNNLEQLRREKVELENTLEQEQEALVNKLWKRMDKL-EAEKRILQI 204
               +KL  E   K   LE+L  E  ELE+ +E     L NK  +   ++ EAEK++ Q 
Sbjct: 214 ----KKLLQEIMIKVKKLEELEEELQELESKIED----LTNKKSELNTEIAEAEKKLEQC 265

Query: 205 KL 206
           + 
Sbjct: 266 RG 267



 Score = 28.8 bits (65), Expect = 4.4
 Identities = 33/150 (22%), Positives = 59/150 (39%), Gaps = 21/150 (14%)

Query: 115 DLSRKLNQLTEEKYRLEQTIEQEQERVVNKLMRKIEKLEAETDAKQNNLEQLRREKVELE 174
            L   L  L E+   L + +E      +N +  K+   +   DA +  L QL++ + ELE
Sbjct: 148 GLDENLEGLKEDYKLLMKELEL-----LNSIKPKLRDRK---DALEEELRQLKQLEDELE 199

Query: 175 NTLEQEQEALVNKLWKRMDKLEAEKRILQIKLDQPVSDPSSPHDMNNGDTASNLSTHIHS 234
           +    E +    KL K + ++  + + L+   ++     S   D+ N    S L+T I  
Sbjct: 200 DCDPTELDRAKEKLKKLLQEIMIKVKKLEELEEELQELESKIEDLTN--KKSELNTEIAE 257

Query: 235 L-----------RSEVMRLRNQLANTQQEH 253
                         E+ +L+ QL   Q   
Sbjct: 258 AEKKLEQCRGFTFKEIEKLKEQLKLLQSLT 287


>gnl|CDD|221952 pfam13166, AAA_13, AAA domain.  This family of domains contain a
           P-loop motif that is characteristic of the AAA
           superfamily. Many of the proteins in this family are
           conjugative transfer proteins. This family includes the
           PrrC protein that is thought to be the active component
           of the anticodon nuclease.
          Length = 713

 Score = 38.5 bits (90), Expect = 0.006
 Identities = 15/53 (28%), Positives = 27/53 (50%)

Query: 147 RKIEKLEAETDAKQNNLEQLRREKVELENTLEQEQEALVNKLWKRMDKLEAEK 199
            +IE+L+ E    +  +EQL  E  + E  LE+ +   ++K WK++ K     
Sbjct: 95  AQIEELKKELKKLEEKIEQLEAEIEKKEEELEKAKNKFLDKAWKKLAKKYDSN 147



 Score = 38.1 bits (89), Expect = 0.008
 Identities = 40/211 (18%), Positives = 75/211 (35%), Gaps = 39/211 (18%)

Query: 27  REQLQKRI-ESLQQQNRVLKVELETYKLQLRALQEENKNLRQTSVNIQAKAEQEEEF--- 82
           + +L+    E  Q+    L+  ++ Y+  +    EE +++  T           EE    
Sbjct: 269 KAELEAHFDEEYQELIEQLEELIDKYESHIEKALEELESILDTEKENSEFKLDVEELKAL 328

Query: 83  ---ISNTLMKKIQALKKEKETLAHHYE----QEEEFLTNDLSRKLNQLTEEKYRLEQTIE 135
              +   L K +Q L+++ +  +   E     +     ND+   +N+L  E       ++
Sbjct: 329 LEALEEILEKNLQKLEEKLKDPSTSIELESITDLIESINDIIDAINELIREHNEKIDNLK 388

Query: 136 QEQERVVNKLMR----------------------KIEKLEAETDAKQNNLEQLRREKVEL 173
           +E+ +   KL                         I  LE E    +  ++ L +E  EL
Sbjct: 389 KEKNKAKKKLWLHLVAELKEDIDAYQKEKKGLEKAINSLEKEIKQLEAEIKALEKEIKEL 448

Query: 174 ENTLEQEQEA--LVNKLWKR----MDKLEAE 198
           E  L   +     +NKL K       +LE E
Sbjct: 449 EKQLTNIEPTADEINKLLKAYGFGNFELEKE 479



 Score = 33.4 bits (77), Expect = 0.22
 Identities = 38/256 (14%), Positives = 88/256 (34%), Gaps = 42/256 (16%)

Query: 31  QKRIESLQQQNRVLKVELETYKLQLRALQEENKNLRQTSVN-----IQAKAEQEEEFISN 85
           + +IE L+++ + L+ ++E  + ++   +EE +  +   ++     +  K +        
Sbjct: 94  EAQIEELKKELKKLEEKIEQLEAEIEKKEEELEKAKNKFLDKAWKKLAKKYDSNLSEALK 153

Query: 86  TLMKKIQALKKEKETLAHHYEQEEEFLTN-DLSRKLNQL-TEEKYRLE---------QTI 134
            L  K    +K  + L          L+  +L  K+  L +  K  L            I
Sbjct: 154 GLNYKKNFKEKLLKELKSVILNASSLLSLEELKAKIKTLFSSNKPELALLTLSVIDFDEI 213

Query: 135 EQE---QERVVNKLMRKIEKLEAETDAK-------------------QNNLEQLRREKVE 172
           EQ    ++ ++      I +L                          Q  + + R+   E
Sbjct: 214 EQAEILEKSIIGSSDVPISELINNLGNSDWVKEGLEYHEEGDTCPFCQQTITEERKA--E 271

Query: 173 LENTLEQEQEALVNKLWKRMDKLEAEKRILQIKLDQPVSDPSSPHDMNNGDTASNLSTHI 232
           LE   ++E + L+ +L + +DK E+       +L+  +       +         L   +
Sbjct: 272 LEAHFDEEYQELIEQLEELIDKYESHIEKALEELESILDTEKENSEFK--LDVEELKALL 329

Query: 233 HSLRSEVMRLRNQLAN 248
            +L   + +   +L  
Sbjct: 330 EALEEILEKNLQKLEE 345


>gnl|CDD|220623 pfam10186, Atg14, UV radiation resistance protein and
           autophagy-related subunit 14.  The Atg14 or Apg14
           proteins are hydrophilic proteins with a predicted
           molecular mass of 40.5 kDa, and have a coiled-coil motif
           at the N terminus region. Yeast cells with mutant Atg14
           are defective not only in autophagy but also in sorting
           of carboxypeptidase Y (CPY), a vacuolar-soluble
           hydrolase, to the vacuole. Subcellular fractionation
           indicate that Apg14p and Apg6p are peripherally
           associated with a membrane structure(s). Apg14p was
           co-immunoprecipitated with Apg6p, suggesting that they
           form a stable protein complex. These results imply that
           Apg6/Vps30p has two distinct functions: in the
           autophagic process and in the vacuolar protein sorting
           pathway. Apg14p may be a component specifically required
           for the function of Apg6/Vps30p through the autophagic
           pathway. There are 17 auto-phagosomal component proteins
           which are categorized into six functional units, one of
           which is the AS-PI3K complex (Vps30/Atg6 and Atg14). The
           AS-PI3K complex and the Atg2-Atg18 complex are essential
           for nucleation, and the specific function of the AS-PI3K
           apparently is to produce phosphatidylinositol
           3-phosphate (PtdIns(3)P) at the pre-autophagosomal
           structure (PAS). The localisation of this complex at the
           PAS is controlled by Atg14. Autophagy mediates the
           cellular response to nutrient deprivation, protein
           aggregation, and pathogen invasion in humans, and
           malfunction of autophagy has been implicated in multiple
           human diseases including cancer. This effect seems to be
           mediated through direct interaction of the human Atg14
           with Beclin 1 in the human phosphatidylinositol 3-kinase
           class III complex.
          Length = 307

 Score = 37.7 bits (88), Expect = 0.006
 Identities = 23/120 (19%), Positives = 51/120 (42%), Gaps = 7/120 (5%)

Query: 89  KKIQALKKEKETLAHHYEQEEEFLTNDLSRKLNQLTEEKYRLEQTIEQEQERVVNKLMRK 148
             +  L  E E L    E+        L    N+  +    L +     ++  +N++  +
Sbjct: 27  LDLARLLLENEELKQKVEEA-------LEGATNEDGKLAADLLKLEVARKKERLNQIRAR 79

Query: 149 IEKLEAETDAKQNNLEQLRREKVELENTLEQEQEALVNKLWKRMDKLEAEKRILQIKLDQ 208
           I +L+ E + K+  +E+L+R   +  + L      L  +   +++KL+ E +  + KL+ 
Sbjct: 80  ISQLKEEIEQKRERIEELKRALAQRRSDLSSASYQLEKRRASQLEKLQDEIKRTRSKLNA 139



 Score = 30.8 bits (70), Expect = 1.2
 Identities = 13/86 (15%), Positives = 34/86 (39%)

Query: 27  REQLQKRIESLQQQNRVLKVELETYKLQLRALQEENKNLRQTSVNIQAKAEQEEEFISNT 86
             + ++R+  ++ +   LK E+E  + ++  L+      R    +   + E+        
Sbjct: 66  VARKKERLNQIRARISQLKEEIEQKRERIEELKRALAQRRSDLSSASYQLEKRRASQLEK 125

Query: 87  LMKKIQALKKEKETLAHHYEQEEEFL 112
           L  +I+  + +   L     ++  FL
Sbjct: 126 LQDEIKRTRSKLNALHSLLAEKRSFL 151



 Score = 30.0 bits (68), Expect = 1.9
 Identities = 33/160 (20%), Positives = 62/160 (38%), Gaps = 28/160 (17%)

Query: 48  LETYKLQLRALQEENKNLRQTSVNIQAKAEQEEEFISNTLMKKIQALKKEKETLAHHYEQ 107
           L   +L L  L  EN+ L+Q         E  E   +         LK E          
Sbjct: 22  LYELRLDLARLLLENEELKQKV------EEALEGATNEDGKLAADLLKLEVA-------- 67

Query: 108 EEEFLTNDLSRKLNQLTEEKYRLEQTIEQEQERVVNKLMRKIEKLEAETDAKQNNLEQLR 167
                      +LNQ+     +L++ IEQ++ER + +L R + +  ++  +    LE+ R
Sbjct: 68  -------RKKERLNQIRARISQLKEEIEQKRER-IEELKRALAQRRSDLSSASYQLEKRR 119

Query: 168 -REKVELENTLEQEQEALVNKLWKRMDKLEAEKRILQIKL 206
             +  +L++ +++ +          +  L AEKR    + 
Sbjct: 120 ASQLEKLQDEIKRTRS-----KLNALHSLLAEKRSFLCRE 154


>gnl|CDD|227278 COG4942, COG4942, Membrane-bound metallopeptidase [Cell division
           and chromosome partitioning].
          Length = 420

 Score = 37.8 bits (88), Expect = 0.008
 Identities = 40/211 (18%), Positives = 79/211 (37%), Gaps = 30/211 (14%)

Query: 28  EQLQKRIESLQQQNR----VLKVELETYK-------LQLRALQEENKNLRQTSVNIQAKA 76
             L  R+ +L+ Q R     L  +L   +         L    E+ +   + ++   A  
Sbjct: 97  ADLNARLNALEVQEREQRRRLAEQLAALQRSGRNPPPALLVSPEDAQRSVRLAIYYGA-- 154

Query: 77  EQEEEFISNTLMKKIQALKKEKETLAHHYEQEEEFLTNDLSRKLNQLTEEKYRLEQTIEQ 136
                 ++    ++I ALK   + LA     E      +L+  L++   ++ +L Q +E+
Sbjct: 155 ------LNPARAERIDALKATLKQLAAV-RAEIAAEQAELTTLLSEQRAQQAKLAQLLEE 207

Query: 137 EQERVVNKLMRKIEKLEAETDAKQNNLEQLRREKVELENTLE--QEQEALVNKLWKRMDK 194
            ++ +         +L +E  A Q  LE+LR  +  L+N +   +   A   +     + 
Sbjct: 208 RKKTL--------AQLNSELSADQKKLEELRANESRLKNEIASAEAAAAKAREAAAAAEA 259

Query: 195 LEAEKRILQIKLDQPVSDPSSPHDMNNGDTA 225
             A  R  + K       P++P  M    T 
Sbjct: 260 AAARARAAEAKRTGETYKPTAPEKMLISSTG 290



 Score = 36.6 bits (85), Expect = 0.021
 Identities = 31/210 (14%), Positives = 72/210 (34%), Gaps = 25/210 (11%)

Query: 116 LSRKLNQLTEEKYRLEQTIEQEQERVVNKLMRKIEKLEAETDAKQNNLEQLRREKVELEN 175
           L +K+ +  +++ +LE+ + +  E  +  L  ++ +   +    +  +  L      LE 
Sbjct: 50  LEKKIREQQDQRAKLEKQL-KSLETEIASLEAQLIETADDLKKLRKQIADLNARLNALEV 108

Query: 176 TLEQEQEALVNKLWKRMDKLEAEKRILQIKLDQPVSDPSSPHDMNNGDTASNLSTHIHSL 235
              +++  L  +L     +         + +        SP D       +     ++  
Sbjct: 109 QEREQRRRLAEQL--AALQRSGRNPPPALLV--------SPEDAQRSVRLAIYYGALNPA 158

Query: 236 RSEVMR-----------LRNQLANTQQEHTEKMERYVKDEKN---VREENLRLQRKLQRE 281
           R+E +            +R ++A  Q E T  +      +     + EE  +   +L  E
Sbjct: 159 RAERIDALKATLKQLAAVRAEIAAEQAELTTLLSEQRAQQAKLAQLLEERKKTLAQLNSE 218

Query: 282 VERREALCRHLSESESSLEMEEERHYNEIA 311
           +   +     L  +ES L+ E        A
Sbjct: 219 LSADQKKLEELRANESRLKNEIASAEAAAA 248



 Score = 35.9 bits (83), Expect = 0.036
 Identities = 34/170 (20%), Positives = 54/170 (31%), Gaps = 32/170 (18%)

Query: 23  SPVTREQLQKRIESLQQQNRVLKVEL----ETYKLQLRALQEENKNLRQTSVNIQAKAEQ 78
           +    ++L+K+I  L  +   L+V+          QL ALQ   +N              
Sbjct: 85  TADDLKKLRKQIADLNARLNALEVQEREQRRRLAEQLAALQRSGRN-----PPPALLVSP 139

Query: 79  EEEFISNTLMKKIQALKKEKETLAHHYEQEEEFLTNDLSRKLNQLTEEKYRLEQTIEQEQ 138
           E+   S  L     AL   +       +   + L                 +   I  EQ
Sbjct: 140 EDAQRSVRLAIYYGALNPARAERIDALKATLKQLAA---------------VRAEIAAEQ 184

Query: 139 ERVVNKLMRKIEKLEAETDAKQNNLEQLRREKVELENTLEQEQEALVNKL 188
                    ++  L +E  A+Q  L QL  E+ +    L  E  A   KL
Sbjct: 185 A--------ELTTLLSEQRAQQAKLAQLLEERKKTLAQLNSELSADQKKL 226



 Score = 28.5 bits (64), Expect = 6.5
 Identities = 40/235 (17%), Positives = 88/235 (37%), Gaps = 14/235 (5%)

Query: 57  ALQEENKNLRQTSVNIQAK----AEQEEEFISNTLMKKIQALKKEKETLAHHYEQEEEFL 112
           +   ++K L+Q    I A      EQ+++     L K++++L+ E  +L     +  + L
Sbjct: 32  SAAADDKQLKQIQKEIAALEKKIREQQDQR--AKLEKQLKSLETEIASLEAQLIETADDL 89

Query: 113 TNDLSRKLNQLTEEKYRLEQTIEQEQERVVNKLMRKIEKLEAETDAKQNNLEQLRREKVE 172
              L +++  L      LE   E+EQ R + + +  +++              +  E  +
Sbjct: 90  KK-LRKQIADLNARLNALEVQ-EREQRRRLAEQLAALQRSGRNPPPA----LLVSPEDAQ 143

Query: 173 LENTLEQEQEALVNKLWKRMDKLEAEKRILQIKLDQPVSDPSSPHDMNNGDTASNLSTHI 232
               L     AL     +R+D L+A  + L   +   ++   +       +  +  +  +
Sbjct: 144 RSVRLAIYYGALNPARAERIDALKATLKQLA-AVRAEIAAEQAELTTLLSEQRAQQAK-L 201

Query: 233 HSLRSEVMRLRNQLANTQQEHTEKMERYVKDEKNVREENLRLQRKLQREVERREA 287
             L  E  +   QL +      +K+E    +E  ++ E    +    +  E   A
Sbjct: 202 AQLLEERKKTLAQLNSELSADQKKLEELRANESRLKNEIASAEAAAAKAREAAAA 256


>gnl|CDD|217902 pfam04111, APG6, Autophagy protein Apg6.  In yeast, 15 Apg proteins
           coordinate the formation of autophagosomes. Autophagy is
           a bulk degradation process induced by starvation in
           eukaryotic cells. Apg6/Vps30p has two distinct functions
           in the autophagic process, either associated with the
           membrane or in a retrieval step of the carboxypeptidase
           Y sorting pathway.
          Length = 356

 Score = 37.2 bits (86), Expect = 0.011
 Identities = 33/141 (23%), Positives = 55/141 (39%), Gaps = 20/141 (14%)

Query: 24  PVTREQLQKRIESLQQQNRVLKVELETYKLQLRALQEENKNLRQTSVNIQAKAEQEEEFI 83
           P+  E     +E L  + R  + E +TYK  L  L+ +N  +           E      
Sbjct: 21  PLCEECADILLERLDSELRDAEKERDTYKQYLSKLESQNVEISNYEALDSELDE------ 74

Query: 84  SNTLMKKIQALKKEKETLAHHYEQEEEFLTNDLSRKLNQLTEEKYRLEQTIEQEQERVVN 143
              L K+ + L  E E L        E   +DL  +L +L EEK +LE    +E + +  
Sbjct: 75  ---LKKEEERLLDELEEL--------EKEDDDLDGELVELQEEKEQLEN---EELQYLRE 120

Query: 144 KLMRKIEKLEAETDAKQNNLE 164
             +     L+ E + +   L+
Sbjct: 121 YNLFDRNNLQLEDNLQSLELQ 141



 Score = 36.0 bits (83), Expect = 0.030
 Identities = 27/128 (21%), Positives = 53/128 (41%), Gaps = 17/128 (13%)

Query: 81  EFISNTLMKKIQALKKEKETLAHHYEQEEEFLTNDLSRKLNQLTEEKYRLEQTIEQEQE- 139
           + +   L  +++  +KE++T   +              KL     E    E    +  E 
Sbjct: 28  DILLERLDSELRDAEKERDTYKQYLS------------KLESQNVEISNYEALDSELDEL 75

Query: 140 -RVVNKLMRKIEKLEAETDAKQNNLEQLRREKVELENTLEQEQEALVNKLWKRMDKLEAE 198
            +   +L+ ++E+LE E D     L +L+ EK +LEN   +E + L        + L+ E
Sbjct: 76  KKEEERLLDELEELEKEDDDLDGELVELQEEKEQLEN---EELQYLREYNLFDRNNLQLE 132

Query: 199 KRILQIKL 206
             +  ++L
Sbjct: 133 DNLQSLEL 140


>gnl|CDD|237560 PRK13922, PRK13922, rod shape-determining protein MreC;
           Provisional.
          Length = 276

 Score = 36.9 bits (86), Expect = 0.011
 Identities = 15/60 (25%), Positives = 29/60 (48%), Gaps = 7/60 (11%)

Query: 25  VTREQLQKRIESLQQQNRVLKVELETYKLQLRALQEENKNLRQTSVNIQAKAEQEEEFIS 84
            +   L++  E L+++      ELE+   +L  L+ EN  LR+    +  K   + +FI+
Sbjct: 69  ASLFDLREENEELKKELL----ELESRLQELEQLEAENARLREL---LNLKESLDYQFIT 121



 Score = 31.1 bits (71), Expect = 0.76
 Identities = 14/72 (19%), Positives = 28/72 (38%), Gaps = 7/72 (9%)

Query: 220 NNGDTASNLSTHIHSLRSEVMRLRNQLANTQQEHTEKMERYVKDEKNVREENLRLQRK-- 277
               + S +   +  + S V R+ N          E     +    ++REEN  L+++  
Sbjct: 31  RRLGSLSPVRQVVGDVVSPVQRVVNAPREFVSGVFE----SLASLFDLREENEELKKELL 86

Query: 278 -LQREVERREAL 288
            L+  ++  E L
Sbjct: 87  ELESRLQELEQL 98


>gnl|CDD|217051 pfam02463, SMC_N, RecF/RecN/SMC N terminal domain.  This domain is
           found at the N terminus of SMC proteins. The SMC
           (structural maintenance of chromosomes) superfamily
           proteins have ATP-binding domains at the N- and
           C-termini, and two extended coiled-coil domains
           separated by a hinge in the middle. The eukaryotic SMC
           proteins form two kind of heterodimers: the SMC1/SMC3
           and the SMC2/SMC4 types. These heterodimers constitute
           an essential part of higher order complexes, which are
           involved in chromatin and DNA dynamics. This family also
           includes the RecF and RecN proteins that are involved in
           DNA metabolism and recombination.
          Length = 1162

 Score = 37.6 bits (87), Expect = 0.013
 Identities = 45/278 (16%), Positives = 95/278 (34%), Gaps = 25/278 (8%)

Query: 45  KVELETYKLQLRALQEENKNLRQTSVNIQAKAEQEEEFISNTLMKKIQALKKEKETLAHH 104
            +E      +L    EE K         QAK   E          K +   +E+  L   
Sbjct: 179 LIEETENLAELIIDLEELKLQELKLK-EQAKKALEYY------QLKEKLELEEENLLYLD 231

Query: 105 YEQEEEFLTNDLSRKLNQLTEEKYRLEQTIEQEQE---------RVVNKLMRKIEKLEAE 155
           Y +  E   + L   L    EE    +Q +E+E+E         +   K  +  E+    
Sbjct: 232 YLKLNEERIDLLQELLRDEQEEIESSKQELEKEEEILAQVLKENKEEEKEKKLQEEELKL 291

Query: 156 TDAKQNNLEQLRREKVELENTLEQEQEALVNKLWKRMDKLEAEKRILQIKLDQPVSDPSS 215
              ++  L+    +    +   E++ +    +L K   +L+ EK   +I+  +       
Sbjct: 292 LAKEEEELKSELLKLERRKVDDEEKLKESEKELKKLEKELKKEKE--EIEELEKELKELE 349

Query: 216 PHDMNNGDTASNLS-------THIHSLRSEVMRLRNQLANTQQEHTEKMERYVKDEKNVR 268
                  +    L             L ++      +L++  +   E++E   ++EK  +
Sbjct: 350 IKREAEEEEEEQLEKLQEKLEQLEEELLAKKKLESERLSSAAKLKEEELELKNEEEKEAK 409

Query: 269 EENLRLQRKLQREVERREALCRHLSESESSLEMEEERH 306
                 +++     E ++   + + E E SLE ++ + 
Sbjct: 410 LLLELSEQEEDLLKEEKKEELKIVEELEESLETKQGKL 447



 Score = 36.1 bits (83), Expect = 0.033
 Identities = 62/321 (19%), Positives = 124/321 (38%), Gaps = 24/321 (7%)

Query: 7   ESDSNSMDGVALMMPPSPVTREQLQK----RIESLQQQNRVLKVELETYKLQL------- 55
                     + +     + R++  K    RI+   ++ ++ K EL   K+Q        
Sbjct: 678 AEQELQEKAESELAKNEILRRQEEIKKKEQRIKEELKKLKLEKEELLADKVQEAQDKINE 737

Query: 56  --RALQEENKNLRQTSVNIQAKAEQEEEFISNTLMKKIQALKKEKETLAHHYEQEEEFLT 113
             + L+++ K   +     + K E+EEE  S   +K+ +  ++E++T     E+E+E   
Sbjct: 738 ELKLLEQKIKEKEEEEEKSRLKKEEEEEEKSELSLKEKELAEEEEKTEKLKVEEEKEEKL 797

Query: 114 NDLSRKLNQLTEEKYRLEQTIEQEQERVVNKLMRKIEKL--------EAETDAKQNNLEQ 165
                +L  L EE     + +E+EQ  +  +   K E+L        E +   K    E 
Sbjct: 798 KAQEEELRALEEELKEEAELLEEEQLLIEQEEKIKEEELEELALELKEEQKLEKLAEEEL 857

Query: 166 LRREKVELENTLEQEQEALVNKLWKRMDKLEAEKRILQIKLDQPVSDPSSPHDMNNGDTA 225
            R E+   +  L QE      +L ++  K E E +  + K ++   +  S  D    +  
Sbjct: 858 ERLEEEITKEELLQELLLKEEELEEQKLKDELESKEEKEKEEKKELEEESQKDNLLEEKE 917

Query: 226 SNLSTHIHSLRSEVMRLRNQLANTQQEHTEKMERYVKDEKNVREENLRLQRKLQREVERR 285
           + +   I      +++  ++     +   E+ +   K+E N  EE  R +R L  + E  
Sbjct: 918 NEIEERIAEEAIILLKYESEPE---ELLLEEADEKEKEEDNKEEEEERNKRLLLAKEELG 974

Query: 286 EALCRHLSESESSLEMEEERH 306
                 ++E E   E   +  
Sbjct: 975 NVNLMAIAEFEEKEERYNKDE 995



 Score = 31.9 bits (72), Expect = 0.76
 Identities = 31/182 (17%), Positives = 76/182 (41%), Gaps = 11/182 (6%)

Query: 28  EQLQKRIESLQQQNRVLKVELETYKLQLRALQEENKNLRQTSVNIQAKAEQEEEFISNTL 87
           +  ++ ++ L ++   LK EL   + +    +E+ K   +    ++ + ++E+E I    
Sbjct: 283 KLQEEELKLLAKEEEELKSELLKLERRKVDDEEKLKESEKELKKLEKELKKEKEEIEE-- 340

Query: 88  MKKIQALKKEKETLAHHYEQEEEFLTNDLSRKLNQLTE--EKYRLEQTIEQEQERVVNKL 145
              ++   KE E      E+EEE L     +      E   K +LE        ++  + 
Sbjct: 341 ---LEKELKELEIKREAEEEEEEQLEKLQEKLEQLEEELLAKKKLESERLSSAAKLKEEE 397

Query: 146 MRKIEKLEAETDAKQNNLEQLRREKVELENTLEQEQEALVNKLWKRMDKLEAEKRILQIK 205
           +    + E E       L +L  ++ +L    ++E+  +V +L + ++  + +    + +
Sbjct: 398 LELKNEEEKE----AKLLLELSEQEEDLLKEEKKEELKIVEELEESLETKQGKLTEEKEE 453

Query: 206 LD 207
           L+
Sbjct: 454 LE 455


>gnl|CDD|178867 PRK00106, PRK00106, hypothetical protein; Provisional.
          Length = 535

 Score = 37.2 bits (86), Expect = 0.014
 Identities = 35/130 (26%), Positives = 63/130 (48%), Gaps = 11/130 (8%)

Query: 64  NLRQTSVNIQAKAEQEEEFISNTLMKKIQALKKE------KETLAHHYEQEEEFLT--ND 115
           N  Q +VN++ KAE++ E I  T  ++ +ALKKE      +E   +  E E+EF +   +
Sbjct: 39  NAEQEAVNLRGKAERDAEHIKKTAKRESKALKKELLLEAKEEARKYREEIEQEFKSERQE 98

Query: 116 LSRKLNQLTEEKYRLEQTIEQ--EQERVVNKLMRKIEKLEAETDAKQNNLEQLRREKV-E 172
           L +  ++LTE    L++  E    +E+ +    + +       D ++  +E+L  +K  E
Sbjct: 99  LKQIESRLTERATSLDRKDENLSSKEKTLESKEQSLTDKSKHIDEREEQVEKLEEQKKAE 158

Query: 173 LENTLEQEQE 182
           LE      Q 
Sbjct: 159 LERVAALSQA 168


>gnl|CDD|129705 TIGR00618, sbcc, exonuclease SbcC.  All proteins in this family for
           which functions are known are part of an exonuclease
           complex with sbcD homologs. This complex is involved in
           the initiation of recombination to regulate the levels
           of palindromic sequences in DNA. This family is based on
           the phylogenomic analysis of JA Eisen (1999, Ph.D.
           Thesis, Stanford University) [DNA metabolism, DNA
           replication, recombination, and repair].
          Length = 1042

 Score = 37.3 bits (86), Expect = 0.014
 Identities = 48/277 (17%), Positives = 90/277 (32%), Gaps = 23/277 (8%)

Query: 30  LQKRIESLQQQNRVLKVELETYKLQLRALQEENKNLRQTSVNIQAKAEQE-EEFISNTLM 88
            +K+  SL++Q + ++             +E+  NL+  +V +Q   E+  E        
Sbjct: 554 ERKQRASLKEQMQEIQQSFSILTQCDNRSKEDIPNLQNITVRLQDLTEKLSEAEDMLACE 613

Query: 89  KKIQALKKEKETLAHHYEQEEEFLTNDLSRKLNQLTEEKYRLEQTIEQEQERVVNKL-MR 147
           +     K + E          +  + +L+ KL  L   +  L Q   +E    +  L   
Sbjct: 614 QHALLRKLQPEQDLQDVRLHLQQCSQELALKLTALHALQLTLTQERVREHALSIRVLPKE 673

Query: 148 KIEKLEAETDAKQNNLEQLRREKVELE--NTLEQEQEALVNKLWKRMDKLEAEKRILQIK 205
            +   +      Q+  EQL   K  L    TL +E E  + +  +  +++E     L   
Sbjct: 674 LLASRQLALQKMQSEKEQLTYWKEMLAQCQTLLRELETHIEEYDREFNEIENASSSLGSD 733

Query: 206 LDQPVSDPSSPHDMNNGDTASNLSTHIHSLRSEVMRLRNQLANTQQEHTEKMERYVKDEK 265
           L                           SL+  + + R  L    + H    E      +
Sbjct: 734 LAA------------------REDALNQSLKELMHQARTVLKARTEAHFNNNEEVTAALQ 775

Query: 266 NVREENLRLQRKLQREVERREALCRHLSESESSLEME 302
              E +  L  ++Q     RE     L   E+ +  E
Sbjct: 776 TGAELS-HLAAEIQFFNRLREEDTHLLKTLEAEIGQE 811



 Score = 29.2 bits (65), Expect = 5.4
 Identities = 33/186 (17%), Positives = 69/186 (37%), Gaps = 11/186 (5%)

Query: 26  TREQLQKRIESLQQQNRVLKVELETYKLQLRALQEENKNLRQTSVNIQAKAE--QEEEFI 83
             E+  K +    QQ +    +   Y  Q R  QEE    +Q    ++A+ E  + +E +
Sbjct: 223 VLEKELKHLREALQQTQ----QSHAYLTQKREAQEEQLKKQQLLKQLRARIEELRAQEAV 278

Query: 84  SNTLMKKIQALKKEKETLAH-----HYEQEEEFLTNDLSRKLNQLTEEKYRLEQTIEQEQ 138
                ++I   +K     AH       EQ+ + +  +L  K+    +   +    ++Q+ 
Sbjct: 279 LEETQERINRARKAAPLAAHIKAVTQIEQQAQRIHTELQSKMRSRAKLLMKRAAHVKQQS 338

Query: 139 ERVVNKLMRKIEKLEAETDAKQNNLEQLRREKVELENTLEQEQEALVNKLWKRMDKLEAE 198
                + + +    +       + +    RE    ++TL Q    L  +      KL++ 
Sbjct: 339 SIEEQRRLLQTLHSQEIHIRDAHEVATSIREISCQQHTLTQHIHTLQQQKTTLTQKLQSL 398

Query: 199 KRILQI 204
            + L I
Sbjct: 399 CKELDI 404


>gnl|CDD|215593 PLN03128, PLN03128, DNA topoisomerase 2; Provisional.
          Length = 1135

 Score = 37.4 bits (87), Expect = 0.015
 Identities = 39/213 (18%), Positives = 75/213 (35%), Gaps = 31/213 (14%)

Query: 61   ENKNLRQTSVNIQAKAEQEEEFISNTLMKKIQALKK---------EKETLAHHYEQEEEF 111
            EN         +    E+    ++  L KK +             +K+     Y+  E+ 
Sbjct: 921  ENHTDTTVHFTVTLSEEKMARALAEGLEKKFKLSTTISTSNMHLFDKDGKIKKYDSPEDI 980

Query: 112  LTNDLSRKLNQLTEEKYRLEQTIEQEQERVVNKLMRKIEKLEAETDAKQNNLEQLRREKV 171
            L      +L+   + K  L + +E E +++ NK  R I  +          +    R++ 
Sbjct: 981  LEEFFHLRLDLYVKRKEVLLENLEYELKKLDNK-ARFILAVVK------GEIVVNNRKRA 1033

Query: 172  ELENTLEQEQEALVNKLWKRMDKLEAEKRILQIKLDQPVSDPSSPHDMNNGDTASN---- 227
            EL   LE        K + +  K         +       + S   + ++ + A +    
Sbjct: 1034 ELLAELE-------EKGFDKFPKTAKISETNVVGDRDG--EASEEEEASDNELAKSYDYL 1084

Query: 228  LSTHIHSLRSE-VMRLRNQLANTQQEHTEKMER 259
            L   I SL  E V  LR + A  + E  E++++
Sbjct: 1085 LGMPISSLTLEKVDELRAERAKKETE-VEELKK 1116


>gnl|CDD|226809 COG4372, COG4372, Uncharacterized protein conserved in bacteria
           with the myosin-like domain [Function unknown].
          Length = 499

 Score = 36.5 bits (84), Expect = 0.019
 Identities = 49/244 (20%), Positives = 99/244 (40%), Gaps = 21/244 (8%)

Query: 33  RIESLQQQNRVLKVELETYKLQLRALQEENKNLRQTSVNIQAKAEQEEEFISNTLMKKIQ 92
           +++ ++ Q R L+ EL T + + RA + E +  R        KA QE E +   L    Q
Sbjct: 75  QLDDIRPQLRALRTELGTAQGEKRAAETEREAARSEL----QKARQEREAVRQELAAARQ 130

Query: 93  ALKKEKETLAHHYEQEEEFLTNDLSRKLNQLTEEKYRLEQTIEQEQERVVNKLMRKIEKL 152
            L K ++ LA   +Q +     DL  +L  L E++ +LE      Q + +    ++++  
Sbjct: 131 NLAKAQQELARLTKQAQ-----DLQTRLKTLAEQRRQLE-----AQAQSLQASQKQLQAS 180

Query: 153 EAETDAKQNNLEQLRREKVELENTLEQEQEALVNKLWKRMDKLEAEKRILQIKLDQPVSD 212
             +  ++  +L +LR  ++E E      +        + + +  A  +     + Q  + 
Sbjct: 181 ATQLKSQVLDL-KLRSAQIEQEAQNLATRANAAQARTEELARRAAAAQQTAQAIQQRDAQ 239

Query: 213 PSSPHDMNNGDTASNLSTHIHSLRSEVMRLRNQLANTQQEHTEKMERYVKDEKNVREENL 272
            S           +  +  I     ++ RL    A  +QE   ++E Y +    +R++  
Sbjct: 240 IS-----QKAQQIAARAEQIRERERQLQRLETAQARLEQE-VAQLEAYYQAYVRLRQQAA 293

Query: 273 RLQR 276
             QR
Sbjct: 294 ATQR 297



 Score = 35.0 bits (80), Expect = 0.068
 Identities = 36/157 (22%), Positives = 65/157 (41%), Gaps = 17/157 (10%)

Query: 31  QKRIESLQQQNRVLKVELETYKLQLRALQEENKNLRQTSVNIQAKAEQEEEFISNTLMKK 90
           Q+ +  L +Q + L+  L+T   Q R L+ + ++L+ +   +QA A Q        L  +
Sbjct: 136 QQELARLTKQAQDLQTRLKTLAEQRRQLEAQAQSLQASQKQLQASATQ--------LKSQ 187

Query: 91  IQALKKEKETLAHHYEQEEEFLTNDLSRKLNQLTEEKYRLEQTIEQEQERVVNKLMRKIE 150
           +  LK     +    EQE + L    +      TEE  R     +Q  +  + +   +I 
Sbjct: 188 VLDLKLRSAQI----EQEAQNLATR-ANAAQARTEELARRAAAAQQTAQA-IQQRDAQIS 241

Query: 151 KLEAETDAKQNNLEQLRREKVELENT---LEQEQEAL 184
           +   +  A+   + +  R+   LE     LEQE   L
Sbjct: 242 QKAQQIAARAEQIRERERQLQRLETAQARLEQEVAQL 278



 Score = 32.3 bits (73), Expect = 0.42
 Identities = 27/142 (19%), Positives = 54/142 (38%), Gaps = 21/142 (14%)

Query: 27  REQLQKRIESLQQQNRVLKVELETYKLQLRALQEENKNLRQTSVNIQAKAEQEEEFISNT 86
           R QL+ + +SLQ   + L+      K Q+  L+  +  + Q + N+  +A          
Sbjct: 160 RRQLEAQAQSLQASQKQLQASATQLKSQVLDLKLRSAQIEQEAQNLATRANA-------- 211

Query: 87  LMKKIQALKKEKETLAHHYEQEEEFLTNDLSRKLNQLTEEKYRLEQTIEQEQERVVNKLM 146
                QA  +E    A   +Q         ++ + Q   +  +  Q I    E  + +  
Sbjct: 212 ----AQARTEELARRAAAAQQ--------TAQAIQQRDAQISQKAQQIAARAE-QIRERE 258

Query: 147 RKIEKLEAETDAKQNNLEQLRR 168
           R++++LE      +  + QL  
Sbjct: 259 RQLQRLETAQARLEQEVAQLEA 280


>gnl|CDD|220365 pfam09726, Macoilin, Transmembrane protein.  This entry is a highly
           conserved protein present in eukaryotes.
          Length = 680

 Score = 36.4 bits (84), Expect = 0.022
 Identities = 47/209 (22%), Positives = 89/209 (42%), Gaps = 15/209 (7%)

Query: 28  EQLQKRIESLQQQNRVLKVELETYKLQLRALQEENKNLRQTSVNIQAKAEQEEEFISNTL 87
              QK  +S+Q   + LK E ++     + L EE K  ++     +  A +     + + 
Sbjct: 469 SAKQKDKQSMQSMEKRLKSEADSRVNAEKQLAEEKKRKKEE----EETAARAAAQAAASR 524

Query: 88  MKKIQALKKEKETLAHHYEQEEEFLTNDLSRKLNQLTE-EKYRLEQTIEQEQERVVNKLM 146
            +  ++LK+ K+ L    E E + L +DL  K  +    EK   E    QE E+    LM
Sbjct: 525 EECAESLKQAKQDL----EMEIKKLEHDLKLKEEECRMLEKEAQELRKYQESEKETEVLM 580

Query: 147 RKIEKLEAETDAKQNNLEQLRREKVELENTL---EQEQEALVNKLWKR-MDKLEAEKRIL 202
             ++ ++ +    +N+L    R K++L + L   +++ E     ++KR  + ++ + +I 
Sbjct: 581 SALQAMQDKNLMLENSLSAETRLKLDLFSALGDVKRQLEIAHGAIYKREAEIVDLKSKIA 640

Query: 203 QIKLDQPVSD--PSSPHDMNNGDTASNLS 229
            I    P S     +PH   N      L 
Sbjct: 641 DILAVMPESRIRSVTPHYSANFLEKPPLV 669



 Score = 31.0 bits (70), Expect = 1.2
 Identities = 42/212 (19%), Positives = 91/212 (42%), Gaps = 40/212 (18%)

Query: 129 RLEQTIE------QEQERVVNKLMRKIEKLEAETDAKQNNLEQLRREKVELENTL----- 177
           RLEQ I+      Q+  +  ++L  +I  L +   + +++L QL++E   L+  L     
Sbjct: 410 RLEQDIKKLQAELQQARQNESELRNQISLLTSLERSLKSDLGQLKKENDMLQTKLNSMVS 469

Query: 178 -EQEQEALVNKLWKRMDKLEAEKRILQIKL-----------DQPVSDPSSPHDMNNGDTA 225
            +Q+ +  +  + KR+ K EA+ R+   K            ++  +  ++    +  + A
Sbjct: 470 AKQKDKQSMQSMEKRL-KSEADSRVNAEKQLAEEKKRKKEEEETAARAAAQAAASREECA 528

Query: 226 SNLSTHIHSLRSEVMRLRNQLANTQQE------HTEKMERYVKDEKN----------VRE 269
            +L      L  E+ +L + L   ++E        +++ +Y + EK           +++
Sbjct: 529 ESLKQAKQDLEMEIKKLEHDLKLKEEECRMLEKEAQELRKYQESEKETEVLMSALQAMQD 588

Query: 270 ENLRLQRKLQREVERREALCRHLSESESSLEM 301
           +NL L+  L  E   +  L   L + +  LE+
Sbjct: 589 KNLMLENSLSAETRLKLDLFSALGDVKRQLEI 620



 Score = 28.7 bits (64), Expect = 6.4
 Identities = 18/84 (21%), Positives = 38/84 (45%), Gaps = 3/84 (3%)

Query: 224 TASNLSTHIHSLRSEVMRLRNQLANTQQEHTE---KMERYVKDEKNVREENLRLQRKLQR 280
             S L   I  L S    L++ L   ++E+     K+   V  ++  ++    ++++L+ 
Sbjct: 428 NESELRNQISLLTSLERSLKSDLGQLKKENDMLQTKLNSMVSAKQKDKQSMQSMEKRLKS 487

Query: 281 EVERREALCRHLSESESSLEMEEE 304
           E + R    + L+E +   + EEE
Sbjct: 488 EADSRVNAEKQLAEEKKRKKEEEE 511


>gnl|CDD|235316 PRK04863, mukB, cell division protein MukB; Provisional.
          Length = 1486

 Score = 36.9 bits (86), Expect = 0.022
 Identities = 43/196 (21%), Positives = 75/196 (38%), Gaps = 31/196 (15%)

Query: 24  PVTREQLQKRIESLQQQNRVLKVELETYKLQLRALQEENKNLRQTSVNIQAKAEQEEEFI 83
            V+R +       L ++ R  +   E    QL+ L+     L Q     Q       EF 
Sbjct: 488 EVSRSEAWDVARELLRRLREQRHLAE----QLQQLRMRLSELEQRLRQQQRAERLLAEF- 542

Query: 84  SNTLMKKIQALKKEKETLAHHYEQEEEFLTNDLSRKLNQLTEEKYRLEQTIEQEQERVVN 143
                +  + L  E E      E E       LS  +++  E +  L Q +EQ Q R+  
Sbjct: 543 ---CKRLGKNLDDEDELEQLQEELEARL--ESLSESVSEARERRMALRQQLEQLQARI-- 595

Query: 144 KLMRKIEKLEAETDAKQNNLEQLR---------REKVE--LENTLEQEQEALVNKLWKRM 192
              +++        A Q+ L +LR          + V   ++  LE+E+E  V +     
Sbjct: 596 ---QRLAARAPAWLAAQDALARLREQSGEEFEDSQDVTEYMQQLLERERELTVER----- 647

Query: 193 DKLEAEKRILQIKLDQ 208
           D+L A K+ L  ++++
Sbjct: 648 DELAARKQALDEEIER 663



 Score = 31.5 bits (72), Expect = 1.1
 Identities = 30/152 (19%), Positives = 59/152 (38%), Gaps = 26/152 (17%)

Query: 28   EQLQKRIESLQQQNRVLKVELETYKLQLRALQEENKNLRQTSVNIQAKAEQEEEFISNTL 87
            E+L++R+E  +Q+    + +L   + QL    + N+ L     +  AK +          
Sbjct: 988  EKLRQRLEQAEQERTRAREQLRQAQAQLA---QYNQVLASLKSSYDAKRQM--------- 1035

Query: 88   MKKIQALKKEKETLAHHYEQEEEFLTNDLSRKLNQLTEEKYRLEQTIEQEQERVVNKLMR 147
               +Q LK+E + L    +   E           +    +  L   +   + R  N+L +
Sbjct: 1036 ---LQELKQELQDLGVPADSGAE----------ERARARRDELHARLSANRSR-RNQLEK 1081

Query: 148  KIEKLEAETDAKQNNLEQLRREKVELENTLEQ 179
            ++   EAE D     L +L R+  E+   +  
Sbjct: 1082 QLTFCEAEMDNLTKKLRKLERDYHEMREQVVN 1113


>gnl|CDD|214660 smart00434, TOP4c, DNA Topoisomerase IV.  Bacterial DNA
           topoisomerase IV, GyrA, ParC.
          Length = 444

 Score = 36.0 bits (84), Expect = 0.032
 Identities = 18/116 (15%), Positives = 49/116 (42%), Gaps = 13/116 (11%)

Query: 98  KETLAHHYEQEEEFLTNDLSRKLNQLTEEKYRLEQTIEQEQERVVNKLMRKIEKLEAETD 157
           KE L    +   E  T      L +L  E+          +   +   +  I+++     
Sbjct: 325 KEILKEFLDHRLEVYTRRKEYLLGKLEAERL------HILEGLFI--ALSIIDEIIVLIR 376

Query: 158 AKQNNLEQLRREKVELENTLEQEQEALVN-KLWK----RMDKLEAEKRILQIKLDQ 208
           + ++  ++ + + +E     E + +A+++ +L +     ++KLE E + L+ +++ 
Sbjct: 377 SSKDLAKEAKEKLMERFELSEIQADAILDMRLRRLTKLEVEKLEKELKELEKEIED 432


>gnl|CDD|215969 pfam00521, DNA_topoisoIV, DNA gyrase/topoisomerase IV, subunit A. 
          Length = 427

 Score = 35.6 bits (83), Expect = 0.037
 Identities = 40/219 (18%), Positives = 79/219 (36%), Gaps = 50/219 (22%)

Query: 25  VTREQLQKRIESLQQQNRVLKVELETYKLQLRALQEENKNLRQTSVNIQAKAEQEEEFIS 84
           V  E+L + IE L          ++  K++   +++E+   R   + I+ K     E + 
Sbjct: 229 VWTEKLIESIEEL----------VKNKKIKGSDIRDESD--RGVRIVIELKRGANPEVVL 276

Query: 85  NTLMKKIQ----------ALKKE-------KETLAHHYEQEEEFLTNDLSRKLNQLTEEK 127
           N L KK +           L          KE L    E   E         L +L E  
Sbjct: 277 NKLYKKTKLQSSFSTNNLVLFDGGPKYLNLKEILKEFLEHRLEVYKRRKEYLLEKLEERL 336

Query: 128 YRLEQTIEQ-EQERVVNKLMRKIEKLEAETDAKQNNLEQLRREKVELENTLEQEQ-EALV 185
           + LE  ++   +   V +++R       +    +  L +           L + Q + L+
Sbjct: 337 HILEGLLKALNKIDFVIEVIRG----SIDLKKAKKELIE----------ELSEIQADYLL 382

Query: 186 N-KLWK----RMDKLEAEKRILQIKLDQPVSDPSSPHDM 219
           + +L +     ++KLE E   L+ ++ +     +S   +
Sbjct: 383 DMRLRRLTKEEIEKLEKEIEELEKEIAELEKILASEKKL 421


>gnl|CDD|218602 pfam05478, Prominin, Prominin.  The prominins are an emerging
           family of proteins that among the multispan membrane
           proteins display a novel topology. Mouse prominin and
           human prominin (mouse)-like 1 (PROML1) are predicted to
           contain five membrane spanning domains, with an
           N-terminal domain exposed to the extracellular space
           followed by four, alternating small cytoplasmic and
           large extracellular, loops and a cytoplasmic C-terminal
           domain. The exact function of prominin is unknown
           although in humans defects in PROM1, the gene coding for
           prominin, cause retinal degeneration.
          Length = 807

 Score = 35.8 bits (83), Expect = 0.039
 Identities = 19/92 (20%), Positives = 43/92 (46%), Gaps = 5/92 (5%)

Query: 10  SNSMDGVALMMPPSPVTREQLQKRIESLQ--QQNRVLKVE--LETYKLQLRALQEENKNL 65
           +  +D +A  +P   + +  L+     L+  QQN V  +E  LE     ++ L E + +L
Sbjct: 629 AEELDALANNLPNGKL-KVALKNEALDLRSIQQNFVPPMESLLEKLNSNVKTLDESSNDL 687

Query: 66  RQTSVNIQAKAEQEEEFISNTLMKKIQALKKE 97
           +    ++ A+ +  + F++N   + +    K+
Sbjct: 688 QNAVKDVLAEVQAAQGFLNNNASEIVNEESKK 719


>gnl|CDD|226889 COG4487, COG4487, Uncharacterized protein conserved in bacteria
           [Function unknown].
          Length = 438

 Score = 35.6 bits (82), Expect = 0.043
 Identities = 31/161 (19%), Positives = 60/161 (37%), Gaps = 2/161 (1%)

Query: 26  TREQLQKRIESLQQQNRV-LKVELETYKLQLRALQEENKNLRQTSVNIQAKAEQEEEFIS 84
           T E+ +K     + Q R   K EL   + QL   ++E KNL    +     A Q+E    
Sbjct: 43  TLEEFEKEANEKRAQYRSAKKKELSQLEEQLINQKKEQKNLFNEQIKQFELALQDEIAKL 102

Query: 85  NTLMKKIQALKKEKETLAHHYEQEEEFLTNDLSRKLNQLTEEKYRLE-QTIEQEQERVVN 143
             L        KE E L    ++  + L   L      + +++   + +   + +     
Sbjct: 103 EALELLNLEKDKELELLEKELDELSKELQKQLQNTAEIIEKKRENNKNEERLKFENEKKL 162

Query: 144 KLMRKIEKLEAETDAKQNNLEQLRREKVELENTLEQEQEAL 184
           +   ++E+ + E    + NL+   +E  E   +     + L
Sbjct: 163 EESLELEREKFEEQLHEANLDLEFKENEEQRESKWAILKKL 203


>gnl|CDD|222878 PHA02562, 46, endonuclease subunit; Provisional.
          Length = 562

 Score = 35.4 bits (82), Expect = 0.045
 Identities = 36/210 (17%), Positives = 84/210 (40%), Gaps = 49/210 (23%)

Query: 30  LQKRIESLQQQNRVLKVELETYKLQLRALQEENKNLRQTSVNIQAKAEQEEEFISNTLMK 89
            + +I  L QQ + L ++++  + Q+     +N    Q   N +  A ++ ++  + L++
Sbjct: 172 NKDKIRELNQQIQTLDMKIDHIQQQI-KTYNKNIE-EQRKKNGENIARKQNKY--DELVE 227

Query: 90  KIQALKKEKETLAHH---YEQEEEFLTNDLSR----------KLNQLTEEKYRLE----- 131
           + + +K E E L         + E  +  L++          K+ Q  +     E     
Sbjct: 228 EAKTIKAEIEELTDELLNLVMDIEDPSAALNKLNTAAAKIKSKIEQFQKVIKMYEKGGVC 287

Query: 132 ----QTIEQEQERVV-------------NKLMRKIEKLEAETDAKQNNLEQLRREKVELE 174
               Q I +  +R+               KL   I++LE       +   +  ++ +EL+
Sbjct: 288 PTCTQQISEGPDRITKIKDKLKELQHSLEKLDTAIDELE----EIMDEFNEQSKKLLELK 343

Query: 175 NTLEQEQEALVN------KLWKRMDKLEAE 198
           N +   +++L+       K+   +++L+AE
Sbjct: 344 NKISTNKQSLITLVDKAKKVKAAIEELQAE 373



 Score = 33.8 bits (78), Expect = 0.16
 Identities = 33/175 (18%), Positives = 70/175 (40%), Gaps = 27/175 (15%)

Query: 28  EQLQKRIESLQQQNRVLKVELETYKLQLRALQEENKNLRQTSVNIQAKAEQ---EEEF-- 82
           ++L +  ++++ +   L  EL    + +         L   +  I++K EQ     +   
Sbjct: 223 DELVEEAKTIKAEIEELTDELLNLVMDIEDPSAALNKLNTAAAKIKSKIEQFQKVIKMYE 282

Query: 83  -----------ISNTLMKKIQALKKEKETLAH-------HYEQEEEFLT--NDLSRKLNQ 122
                      IS     +I  +K + + L H         ++ EE +   N+ S+KL +
Sbjct: 283 KGGVCPTCTQQISEG-PDRITKIKDKLKELQHSLEKLDTAIDELEEIMDEFNEQSKKLLE 341

Query: 123 LTEEKYRLEQTIEQEQERVVNKLMRKIEKLEAETDAKQNNLEQLRREKVELENTL 177
           L  +    +Q++    ++   K+   IE+L+AE       L +L+ E  ++  T 
Sbjct: 342 LKNKISTNKQSLITLVDKA-KKVKAAIEELQAEFVDNAEELAKLQDELDKIVKTK 395


>gnl|CDD|217817 pfam03961, DUF342, Protein of unknown function (DUF342).  This
           family of bacterial proteins has no known function. The
           proteins are in the region of 500-600 amino acid
           residues in length.
          Length = 450

 Score = 35.3 bits (82), Expect = 0.057
 Identities = 21/96 (21%), Positives = 46/96 (47%), Gaps = 13/96 (13%)

Query: 87  LMKKIQALKKEKETLAHHYEQEEEFLTNDLSRKLNQLTEEK-YRLEQTIEQEQERVV--- 142
           L ++++ L++E + L        E     + + L +L ++   +L     ++ E+++   
Sbjct: 332 LKEELKELEEELKEL--------EEELEKIKKLLKKLPKKARGQLPPEKREQLEKLLETK 383

Query: 143 NKLMRKIEKLEAETDAKQNNLEQLRRE-KVELENTL 177
            KL  ++E+LE E    +  LE L  E K+ +  T+
Sbjct: 384 EKLSEELEELEEELKELKEELESLYSEGKISVNKTI 419



 Score = 32.6 bits (75), Expect = 0.38
 Identities = 19/74 (25%), Positives = 37/74 (50%), Gaps = 4/74 (5%)

Query: 114 NDLSRKLNQLTEEKYRLEQTIEQEQERVVNKL-MRKIEKLEAETDAKQN---NLEQLRRE 169
            +L  +L +L EE  ++++ +++  ++   +L   K E+LE   + K+     LE+L  E
Sbjct: 337 KELEEELKELEEELEKIKKLLKKLPKKARGQLPPEKREQLEKLLETKEKLSEELEELEEE 396

Query: 170 KVELENTLEQEQEA 183
             EL+  LE     
Sbjct: 397 LKELKEELESLYSE 410


>gnl|CDD|204414 pfam10211, Ax_dynein_light, Axonemal dynein light chain.  Axonemal
           dynein light chain proteins play a dynamic role in
           flagellar and cilia motility. Eukaryotic cilia and
           flagella are complex organelles consisting of a core
           structure, the axoneme, which is composed of nine
           microtubule doublets forming a cylinder that surrounds a
           pair of central singlet microtubules. This
           ultra-structural arrangement seems to be one of the most
           stable micro-tubular assemblies known and is responsible
           for the flagellar and ciliary movement of a large number
           of organisms ranging from protozoan to mammals. This
           light chain interacts directly with the N-terminal half
           of the heavy chains.
          Length = 189

 Score = 34.1 bits (79), Expect = 0.058
 Identities = 22/112 (19%), Positives = 51/112 (45%), Gaps = 15/112 (13%)

Query: 55  LRALQEENKNLRQTSVNIQAKAEQEEEFISNTLMKKIQALKKEKETLAHHYEQEEEFLTN 114
           LR      + L ++SV    +   + E   + L ++I+ L++EKE               
Sbjct: 93  LRMTIAAYQTLYESSVAFGMRKALQAEQGKSELEQEIKKLEEEKE--------------- 137

Query: 115 DLSRKLNQLTEEKYRLEQTIEQEQERVVNKLMRKIEKLEAETDAKQNNLEQL 166
           +L +++ +L  +   +E+  E+E++    +   +I  L+ +    ++ LEQ+
Sbjct: 138 ELEKRVAELEAKLEAIEKREEEERQIEEKRHADEIAFLKKQNQQLKSQLEQI 189



 Score = 31.0 bits (71), Expect = 0.66
 Identities = 17/82 (20%), Positives = 44/82 (53%), Gaps = 4/82 (4%)

Query: 128 YRLEQTIEQEQERVVNKLMRKIEKLEAETDAKQNNLEQLRREKVELENTLEQEQEALVNK 187
           + + + ++ EQ +  ++L ++I+KLE E +  +  + +L  +   +E   E+E++    K
Sbjct: 110 FGMRKALQAEQGK--SELEQEIKKLEEEKEELEKRVAELEAKLEAIEKREEEERQI-EEK 166

Query: 188 LWK-RMDKLEAEKRILQIKLDQ 208
                +  L+ + + L+ +L+Q
Sbjct: 167 RHADEIAFLKKQNQQLKSQLEQ 188



 Score = 29.9 bits (68), Expect = 1.6
 Identities = 15/79 (18%), Positives = 37/79 (46%), Gaps = 1/79 (1%)

Query: 30  LQKRIESLQQQNRVLKVELETYKLQLRALQEENKNLRQTSVNIQAKAEQEEEFISNTLMK 89
           ++K +++ Q ++ + + E++  + +   L++    L      I+ + E+E +        
Sbjct: 112 MRKALQAEQGKSELEQ-EIKKLEEEKEELEKRVAELEAKLEAIEKREEEERQIEEKRHAD 170

Query: 90  KIQALKKEKETLAHHYEQE 108
           +I  LKK+ + L    EQ 
Sbjct: 171 EIAFLKKQNQQLKSQLEQI 189


>gnl|CDD|238111 cd00187, TOP4c, DNA Topoisomerase, subtype IIA; domain A';
           bacterial DNA topoisomerase IV (C subunit, ParC),
           bacterial DNA gyrases (A subunit, GyrA),mammalian DNA
           toposiomerases II. DNA topoisomerases are essential
           enzymes that regulate the conformational changes in DNA
           topology by catalysing the concerted breakage and
           rejoining of DNA strands during normal cellular growth.
          Length = 445

 Score = 35.2 bits (82), Expect = 0.060
 Identities = 21/115 (18%), Positives = 43/115 (37%), Gaps = 13/115 (11%)

Query: 98  KETLAHHYEQEEEFLTNDLSRKLNQLTEEKYRLEQTIEQEQERVVNKLMRKIEKLEAETD 157
           KE L    +   E  T     +L +     + LE  +     + +  +   I  + +  +
Sbjct: 318 KEILQEFLDHRLEVYTRRKEYELGKAEARLHILEGLL-----KAILNIDEVINLIRSSDE 372

Query: 158 AKQNNLEQLRREKVELENTLEQEQEALVNKLWK----RMDKLEAEKRILQIKLDQ 208
           AK    + L  E  +L  +  Q    L  +L +      +KL  E + L+ +++ 
Sbjct: 373 AK----KALIEELEKLGFSEIQADAILDMRLRRLTKLEREKLLKELKELEAEIED 423


>gnl|CDD|218636 pfam05557, MAD, Mitotic checkpoint protein.  This family consists
           of several eukaryotic mitotic checkpoint (Mitotic arrest
           deficient or MAD) proteins. The mitotic spindle
           checkpoint monitors proper attachment of the bipolar
           spindle to the kinetochores of aligned sister chromatids
           and causes a cell cycle arrest in prometaphase when
           failures occur. Multiple components of the mitotic
           spindle checkpoint have been identified in yeast and
           higher eukaryotes. In S.cerevisiae, the existence of a
           Mad1-dependent complex containing Mad2, Mad3, Bub3 and
           Cdc20 has been demonstrated.
          Length = 722

 Score = 34.9 bits (80), Expect = 0.075
 Identities = 46/177 (25%), Positives = 70/177 (39%), Gaps = 9/177 (5%)

Query: 119 KLNQLTEEKYRLEQTIEQEQERVVNKLMRKIEKLEAETDAKQNNLEQLRREKVELENTLE 178
           KL QL  E  + E   ++ Q  +  K     E  E E D       +L+  + ELE   E
Sbjct: 69  KLIQLENELMQKELEHKRAQIELERKASTLAENYERELDRNLELEVRLKALE-ELEKKAE 127

Query: 179 QEQEALVNKLWKRMDKLEAEKRILQIKLDQPVSDPSSPHDMNNGD------TASNLSTHI 232
            E      +     DKL+AE   LQ + +  + +          D       A N  T +
Sbjct: 128 NEAAEAEEEAKLLKDKLDAESLKLQNEKEDQLKEAKESISRIKNDLSEMQCRAQNADTEL 187

Query: 233 HSLRSEVMRLRNQLANTQQEHTE--KMERYVKDEKNVREENLRLQRKLQREVERREA 287
             L SE+  LR QL   Q+E  E  K  + +  E+    +N    + L+ E++R E 
Sbjct: 188 KLLESELEELREQLEECQKELAEAEKKLQSLTSEQASSADNSVKIKHLEEELKRYEQ 244


>gnl|CDD|225747 COG3206, GumC, Uncharacterized protein involved in
           exopolysaccharide biosynthesis [Cell envelope
           biogenesis, outer membrane].
          Length = 458

 Score = 34.7 bits (80), Expect = 0.078
 Identities = 35/185 (18%), Positives = 71/185 (38%), Gaps = 16/185 (8%)

Query: 30  LQKRIESLQQQNRVLKVELETYKLQLRALQEENKNLRQTSVNIQ-------AKAEQEEEF 82
           L +R+E L+ + +  + ++E ++ Q          L               A+A   +  
Sbjct: 200 LDERLEELRARLQEAEAQVEDFRAQHGLTDAARGQLLSEQQLSALNTQLQSARARLAQAE 259

Query: 83  ISNTLMKKIQALKKEKETLAHHYEQEEEFLTNDLSRKLNQLTEEKYRLEQTIEQEQERVV 142
                + ++  L +E   L    E  E     DL ++  Q+ ++   L   +  +  ++V
Sbjct: 260 ARLASLLQLLPLGREAAALR---EVLESPTIQDLRQQYAQVRQQIADLSTELGAKHPQLV 316

Query: 143 NKLMRKIEKLEA-ETDAKQNNLEQLRREKVEL---ENTLEQEQEALVNKLWKRMDKLEAE 198
             L  ++ +L        +  L  L  E   L   E  LE+E   L  +L  ++ KL+ +
Sbjct: 317 A-LEAQLAELRQQIAAELRQILASLPNELALLEQQEAALEKELAQLKGRL-SKLPKLQVQ 374

Query: 199 KRILQ 203
            R L+
Sbjct: 375 LRELE 379


>gnl|CDD|223783 COG0711, AtpF, F0F1-type ATP synthase, subunit b [Energy production
           and conversion].
          Length = 161

 Score = 33.4 bits (77), Expect = 0.093
 Identities = 34/124 (27%), Positives = 56/124 (45%), Gaps = 15/124 (12%)

Query: 83  ISNTLMKKIQALKKEKETLAHHYEQEEEFLTNDLSRKLNQLTEEKYRLEQTIEQEQERVV 142
           I++ L  + + LK+E + L   YEQE E      S  + Q  +E  ++ + I+ E E  +
Sbjct: 42  IADDL-AEAERLKEEAQALLAEYEQELEEAREQASEIIEQAKKEAEQIAEEIKAEAEEEL 100

Query: 143 NKLMRKIEKLEAETDA-KQNNLEQLRREKVEL----------ENTLEQEQEALVNKLWKR 191
               R  E  EAE +A K+  LE+LR E  EL          +   E  Q+ L++     
Sbjct: 101 E---RIKEAAEAEIEAEKERALEELRAEVAELAVAIAEKLLGKKVDEAAQKDLIDAFIAE 157

Query: 192 MDKL 195
           + + 
Sbjct: 158 LGEN 161


>gnl|CDD|234342 TIGR03752, conj_TIGR03752, integrating conjugative element protein,
           PFL_4705 family.  Members of this protein family are
           found occasionally on plasmids such as the Pseudomonas
           putida toluene catabolic TOL plasmid pWWO_p085. Usually,
           however, they are found on the bacterial main chromosome
           in regions flanked by markers of conjugative transfer
           and/or transposition [Mobile and extrachromosomal
           element functions, Plasmid functions].
          Length = 472

 Score = 34.6 bits (80), Expect = 0.095
 Identities = 20/85 (23%), Positives = 38/85 (44%), Gaps = 8/85 (9%)

Query: 40  QNRVLKVELETYKLQLRALQEENKNLRQTSVNIQAKAEQEEEFISNTLMKKIQALKKEKE 99
           + + L+  L     +  AL+ EN+ L++   +I       ++ I   +  + Q L KE E
Sbjct: 67  EVKELRKRLAKLISENEALKAENERLQKREQSI-------DQQIQQAVQSETQELTKEIE 119

Query: 100 TLAHHYEQEEEFLTNDLSRKLNQLT 124
            L    +Q +  L + L R+L  + 
Sbjct: 120 QLKSERQQLQG-LIDQLQRRLAGVL 143



 Score = 30.8 bits (70), Expect = 1.4
 Identities = 18/74 (24%), Positives = 38/74 (51%)

Query: 113 TNDLSRKLNQLTEEKYRLEQTIEQEQERVVNKLMRKIEKLEAETDAKQNNLEQLRREKVE 172
             +L ++L +L  E   L+   E+ Q+R  +   +  + +++ET      +EQL+ E+ +
Sbjct: 68  VKELRKRLAKLISENEALKAENERLQKREQSIDQQIQQAVQSETQELTKEIEQLKSERQQ 127

Query: 173 LENTLEQEQEALVN 186
           L+  ++Q Q  L  
Sbjct: 128 LQGLIDQLQRRLAG 141


>gnl|CDD|233791 TIGR02231, TIGR02231, conserved hypothetical protein.  This family
           consists of proteins over 500 amino acids long in
           Caenorhabditis elegans and several bacteria (Pseudomonas
           aeruginosa, Nostoc sp. PCC 7120, Leptospira interrogans,
           etc.). The function is unknown.
          Length = 525

 Score = 34.3 bits (79), Expect = 0.10
 Identities = 26/134 (19%), Positives = 47/134 (35%), Gaps = 12/134 (8%)

Query: 64  NLRQTSVNIQA--KAEQEEEFISNTLMKKIQALKKEKETLAHHYEQEEEFLTNDLSRKLN 121
             R     +    K  +E E     L  +  ALK   + L    E   E + +   R   
Sbjct: 65  TSRPDPERLAELRKQIRELEAELRDLEDRGDALKALAKFLEDIREGLTEPIKDSAKRNEP 124

Query: 122 ---------QLTEEKYRLEQTIEQEQERVVNKLMRKIEKLEAETDAKQNNLEQLRREKVE 172
                         +     T ++E ER + +L +++ +L+ E +A      Q R   V 
Sbjct: 125 DLKEWFQAFDFNGSEIERLLTEDREAERRIRELEKQLSELQNELNALLTGKSQ-RSHTVL 183

Query: 173 LENTLEQEQEALVN 186
           +     ++ EA +N
Sbjct: 184 VRLEAPEDAEAELN 197


>gnl|CDD|237178 PRK12705, PRK12705, hypothetical protein; Provisional.
          Length = 508

 Score = 34.3 bits (79), Expect = 0.12
 Identities = 22/110 (20%), Positives = 46/110 (41%), Gaps = 7/110 (6%)

Query: 87  LMKKIQALKKEKETLAHHYEQEEEFLTNDLSRKLNQLTEEKYRLEQTIEQEQERVVNKLM 146
           L K+ + + +E +  A    +       +L  +      ++ R E+   +E +R   +L+
Sbjct: 31  LAKEAERILQEAQKEAEEKLEAALLEAKELLLRERNQQRQEARRER---EELQREEERLV 87

Query: 147 RKIEKLEAETDAKQNNLEQLRREKVELENTLEQEQEALVNKLWKRMDKLE 196
           +K E+L    DA+   L+ L  +  E E  L   +  L     +  ++L 
Sbjct: 88  QKEEQL----DARAEKLDNLENQLEEREKALSARELELEELEKQLDNELY 133



 Score = 33.5 bits (77), Expect = 0.19
 Identities = 27/153 (17%), Positives = 56/153 (36%), Gaps = 8/153 (5%)

Query: 53  LQLRALQEENKNLRQTSVNIQAKAEQEEEFISNTLMKKIQALKKEKETLAHHYEQEEEFL 112
           + L+  Q   K   +     Q +AE++ E       + +   + ++   A    +E +  
Sbjct: 23  VLLKKRQRLAKEAERILQEAQKEAEEKLEAALLEAKELLLRERNQQRQEARREREELQRE 82

Query: 113 TNDLSRKLNQLTEEKYRLEQTIEQEQERVVNKLMRKIEKLEAETDAKQNNLEQLRREKVE 172
              L +K  QL     +L+    Q        L  + + L A     +   +QL  E   
Sbjct: 83  EERLVQKEEQLDARAEKLDNLENQ--------LEEREKALSARELELEELEKQLDNELYR 134

Query: 173 LENTLEQEQEALVNKLWKRMDKLEAEKRILQIK 205
           +     ++   L+ KL     + E  +R+ +I+
Sbjct: 135 VAGLTPEQARKLLLKLLDAELEEEKAQRVKKIE 167



 Score = 30.1 bits (68), Expect = 2.4
 Identities = 21/94 (22%), Positives = 35/94 (37%), Gaps = 4/94 (4%)

Query: 28  EQLQKRIESLQQQNRVLKVELETYKLQLRALQEENKNLRQTSVNIQAKAEQEEEFISNTL 87
           EQL  R E L      L+   +    +   L+E  K L      +   A    E     L
Sbjct: 91  EQLDARAEKLDNLENQLEEREKALSARELELEELEKQLDNELYRV---AGLTPEQARKLL 147

Query: 88  MKKIQA-LKKEKETLAHHYEQEEEFLTNDLSRKL 120
           +K + A L++EK       E+E +      ++ +
Sbjct: 148 LKLLDAELEEEKAQRVKKIEEEADLEAERKAQNI 181


>gnl|CDD|217803 pfam03938, OmpH, Outer membrane protein (OmpH-like).  This family
           includes outer membrane proteins such as OmpH among
           others. Skp (OmpH) has been characterized as a molecular
           chaperone that interacts with unfolded proteins as they
           emerge in the periplasm from the Sec translocation
           machinery.
          Length = 157

 Score = 33.0 bits (76), Expect = 0.12
 Identities = 25/107 (23%), Positives = 56/107 (52%), Gaps = 8/107 (7%)

Query: 57  ALQEENKNLRQTSVNIQAKAEQEEEFISNTLMKKIQALKKEKETLAHHYEQEEEFLTNDL 116
           A +   K L +    +QA+ +++E+     L K+ Q L+K+  TL+   E+  +    +L
Sbjct: 32  AGKAAQKQLEKEFKKLQAELQKKEK----ELQKEEQKLQKQAATLS---EEARKAKQQEL 84

Query: 117 SRKLNQLTEEKYRLEQTIEQEQERVVNKLMRKIEKLEAETDAKQNNL 163
            +K  +L +++   +Q ++Q+Q+ ++  +  KI+K   +  AK+   
Sbjct: 85  QQKQQELQQKQQAAQQELQQKQQELLQPIYDKIDKA-IKEVAKEKGY 130



 Score = 30.3 bits (69), Expect = 1.0
 Identities = 17/79 (21%), Positives = 35/79 (44%), Gaps = 3/79 (3%)

Query: 87  LMKKIQALKKEKETLAHHYEQEEEFLTNDLSRKLNQLTEEKYRLEQTIEQEQERVVNKLM 146
             K+++   K+ +      E+E +     L ++   L+EE     +  +QE ++   +L 
Sbjct: 36  AQKQLEKEFKKLQAELQKKEKELQKEEQKLQKQAATLSEEAR---KAKQQELQQKQQELQ 92

Query: 147 RKIEKLEAETDAKQNNLEQ 165
           +K +  + E   KQ  L Q
Sbjct: 93  QKQQAAQQELQQKQQELLQ 111



 Score = 29.1 bits (66), Expect = 2.0
 Identities = 17/67 (25%), Positives = 34/67 (50%), Gaps = 2/67 (2%)

Query: 27  REQLQKRIESLQQQNRVLKVELETYKLQLRALQEENKNLRQTSVNIQAKAEQEEEFISNT 86
            ++LQK  + LQ+Q   L  E    K + + LQ++ + L+Q     Q + +Q+++ +   
Sbjct: 55  EKELQKEEQKLQKQAATLSEE--ARKAKQQELQQKQQELQQKQQAAQQELQQKQQELLQP 112

Query: 87  LMKKIQA 93
           +  KI  
Sbjct: 113 IYDKIDK 119



 Score = 28.8 bits (65), Expect = 3.1
 Identities = 22/100 (22%), Positives = 52/100 (52%), Gaps = 13/100 (13%)

Query: 27  REQLQKRIESLQQQNRVLKVELETYKLQLRALQEENKNLRQTSVNIQAKAEQEEEFISNT 86
           ++QL+K  + LQ + +  + EL+    + + LQ++   L +       KA+Q+E      
Sbjct: 37  QKQLEKEFKKLQAELQKKEKELQK---EEQKLQKQAATLSEE----ARKAKQQE------ 83

Query: 87  LMKKIQALKKEKETLAHHYEQEEEFLTNDLSRKLNQLTEE 126
           L +K Q L+++++      +Q+++ L   +  K+++  +E
Sbjct: 84  LQQKQQELQQKQQAAQQELQQKQQELLQPIYDKIDKAIKE 123



 Score = 27.6 bits (62), Expect = 6.9
 Identities = 17/82 (20%), Positives = 38/82 (46%), Gaps = 1/82 (1%)

Query: 132 QTIEQEQERVVNKLMRKIEKLEAETDAKQNNLEQLRREKVELE-NTLEQEQEALVNKLWK 190
           +  +++ E+   KL  +++K E E   ++  L++      E      +QE +    +L +
Sbjct: 34  KAAQKQLEKEFKKLQAELQKKEKELQKEEQKLQKQAATLSEEARKAKQQELQQKQQELQQ 93

Query: 191 RMDKLEAEKRILQIKLDQPVSD 212
           +    + E +  Q +L QP+ D
Sbjct: 94  KQQAAQQELQQKQQELLQPIYD 115


>gnl|CDD|223497 COG0420, SbcD, DNA repair exonuclease [DNA replication,
           recombination, and repair].
          Length = 390

 Score = 34.0 bits (78), Expect = 0.12
 Identities = 23/152 (15%), Positives = 48/152 (31%), Gaps = 16/152 (10%)

Query: 60  EENKNLRQTSVNIQAKAEQEEEFISNTLMKKIQALKKEKETLAHHYEQEEEFLTNDLSRK 119
           EE +      V          E +   L ++++      E      + E+E L  D    
Sbjct: 236 EEGERKGVVLVEFSGGKLWRFEELFVPLFERLEVDVLNGE------DLEDELLKLDREED 289

Query: 120 LNQLTEEKYRLEQTIEQEQERVVNKLMRKIEKLEAETDAKQNNLEQLRREKVELENTLEQ 179
                  +  +E   +   ER+  +++   E++E      +     L  E    E     
Sbjct: 290 AKDEPVVRLTIEGETDILDERLKERILALYERIERLELLLEERATDLAEEPESSE----- 344

Query: 180 EQEALVNKLWKRMDKLEAEKRILQIKLDQPVS 211
                + +L   +D  EA + +L  ++     
Sbjct: 345 -----LEELLSELDVEEAFEELLADEVLLDRE 371


>gnl|CDD|236912 PRK11448, hsdR, type I restriction enzyme EcoKI subunit R;
           Provisional.
          Length = 1123

 Score = 34.2 bits (79), Expect = 0.13
 Identities = 22/104 (21%), Positives = 42/104 (40%), Gaps = 12/104 (11%)

Query: 28  EQLQKRIESLQQQNRVLKVELETYKLQLRALQEENKNLRQTSVNIQAK-AEQEEEFISNT 86
             LQ+ + +L+QQ  +   E    +    A Q+E   L   +  ++ K  E E +     
Sbjct: 145 HALQQEVLTLKQQLELQAREKAQSQALAEAQQQELVALEGLAAELEEKQQELEAQL--EQ 202

Query: 87  LMKKIQALKKEKETLAHHYEQEEEF-----LTNDLSRKL--NQL 123
           L +  +A +  +E      E  ++      L+ + +R L   QL
Sbjct: 203 LQE--KAAETSQERKQKRKEITDQAAKRLELSEEETRILIDQQL 244



 Score = 33.0 bits (76), Expect = 0.28
 Identities = 26/120 (21%), Positives = 46/120 (38%), Gaps = 23/120 (19%)

Query: 83  ISNTLMKKIQALKKEKETLAHHYEQEEEFLTNDLSRKLNQLTEEKYRLEQTIEQEQERVV 142
           + + L +++  LK++ E  A    Q +               +E   LE     E E   
Sbjct: 143 LLHALQQEVLTLKQQLELQAREKAQSQA--------LAEAQQQELVALEGLA-AELEEKQ 193

Query: 143 NKLMRKIEKLEAETDAKQNNLEQLRREKVELENTLEQEQEALVNKLWKRMDKLEAEKRIL 202
            +L  ++E+L+ +        +Q R+E          +Q A      KR++  E E RIL
Sbjct: 194 QELEAQLEQLQEKAAETSQERKQKRKEI--------TDQAA------KRLELSEEETRIL 239



 Score = 31.8 bits (73), Expect = 0.64
 Identities = 18/61 (29%), Positives = 24/61 (39%), Gaps = 2/61 (3%)

Query: 120 LNQLTEEKYRLEQTIEQEQ--ERVVNKLMRKIEKLEAETDAKQNNLEQLRREKVELENTL 177
           L QL  E    +     E   E+++ KL RK+ K     +     L QLRR  V  E   
Sbjct: 838 LEQLVNELTDADGESHAEHVREQLIAKLQRKLRKATDNAERSFEILAQLRRAGVTPEEFA 897

Query: 178 E 178
            
Sbjct: 898 S 898



 Score = 30.3 bits (69), Expect = 2.0
 Identities = 26/114 (22%), Positives = 57/114 (50%), Gaps = 15/114 (13%)

Query: 46  VELETYKLQLRALQEENKNLRQTSVNIQAKAEQEEEFISNTLMKKIQALKKEKETLAHHY 105
           V  E  +  L ALQ+E   L+Q  + +QA+ + + + ++    +++ AL+     LA   
Sbjct: 135 VPPEDPENLLHALQQEVLTLKQQ-LELQAREKAQSQALAEAQQQELVALEG----LAAEL 189

Query: 106 EQEEEFLTNDLSRKLNQLTEEKYRLEQTIEQEQERVVNKLMRKIEKL---EAET 156
           E++++    +L  +L QL E+     +T ++ +++      +  ++L   E ET
Sbjct: 190 EEKQQ----ELEAQLEQLQEKA---AETSQERKQKRKEITDQAAKRLELSEEET 236



 Score = 28.4 bits (64), Expect = 9.1
 Identities = 16/78 (20%), Positives = 35/78 (44%), Gaps = 10/78 (12%)

Query: 27  REQLQKRIESLQQQNRVLKVELETYKLQLRALQEENKNLRQTSVNIQAKAEQEEEFISNT 86
           + Q Q   E+ QQ+   L+      + + + L+ + + L++     +   E++++    T
Sbjct: 165 KAQSQALAEAQQQELVALEGLAAELEEKQQELEAQLEQLQE--KAAETSQERKQKRKEIT 222

Query: 87  LMKKIQALKK----EKET 100
                QA K+    E+ET
Sbjct: 223 D----QAAKRLELSEEET 236


>gnl|CDD|217135 pfam02601, Exonuc_VII_L, Exonuclease VII, large subunit.  This
           family consist of exonuclease VII, large subunit
           EC:3.1.11.6 This enzyme catalyzes exonucleolytic
           cleavage in either 5'->3' or 3'->5' direction to yield
           5'-phosphomononucleotides. This exonuclease VII enzyme
           is composed of one large subunit and 4 small ones.
          Length = 295

 Score = 33.8 bits (78), Expect = 0.13
 Identities = 22/100 (22%), Positives = 43/100 (43%), Gaps = 2/100 (2%)

Query: 104 HYEQEEEFLTNDLSRKL-NQLTEEKYRLEQTIEQEQERVVNKLMRKIEKLEAETDAKQNN 162
              Q+ E L   LSR L N+L +E+ RL    E+ +      L +  E+L    D   ++
Sbjct: 147 ELLQKLEGLEQRLSRALKNRLEKEQDRLNLLRERLKSLSRRLLEQHEERLAELRDRLISS 206

Query: 163 LEQ-LRREKVELENTLEQEQEALVNKLWKRMDKLEAEKRI 201
           ++  L R++  LE  +   +    + L  ++       ++
Sbjct: 207 IQNLLSRKQSRLERLILNRELEKNSLLENKLATANLTAQL 246


>gnl|CDD|232965 TIGR00414, serS, seryl-tRNA synthetase.  This model represents the
           seryl-tRNA synthetase found in most organisms. This
           protein is a class II tRNA synthetase, and is recognized
           by the pfam model tRNA-synt_2b. The seryl-tRNA
           synthetases of two archaeal species, Methanococcus
           jannaschii and Methanobacterium thermoautotrophicum,
           differ considerably and are included in a different
           model [Protein synthesis, tRNA aminoacylation].
          Length = 418

 Score = 33.9 bits (78), Expect = 0.13
 Identities = 34/141 (24%), Positives = 57/141 (40%), Gaps = 20/141 (14%)

Query: 27  REQLQKRIESLQQQNRVLKVELETYKLQLRALQEENKNLRQTSVNIQAKAEQEEEFISNT 86
           R       ESL+ +   + ++LE    +L AL +E K L      +QAK         N 
Sbjct: 8   RNNPDLVKESLKARGLSVDIDLE----KLIALDDERKKLLSEIEELQAK--------RNE 55

Query: 87  LMKKIQALKKEKETLAHHYEQEEEFLTNDLSRKLNQLTEEKYRLEQTIEQEQERVVNKLM 146
           L K+I   K +K+      ++E      +L  +L +L+     LE  ++ +   + N   
Sbjct: 56  LSKQIGKAKGQKKDKIEEIKKE----LKELKEELTELSAALKALEAELQDKLLSIPNIPH 111

Query: 147 RKIEKLEAETDAKQNNLEQLR 167
             +   + E D    NLE  R
Sbjct: 112 ESVPVGKDEED----NLEVKR 128



 Score = 28.9 bits (65), Expect = 5.0
 Identities = 23/94 (24%), Positives = 46/94 (48%), Gaps = 7/94 (7%)

Query: 126 EKYRLEQTIEQEQERVVNKLMRKIEKLEAETDAKQNNLEQLRREKVELENTLEQEQEALV 185
               LE+ I  + ER   KL+ +IE+L+A    K+N L +   +    +    +E +  +
Sbjct: 25  VDIDLEKLIALDDERK--KLLSEIEELQA----KRNELSKQIGKAKGQKKDKIEEIKKEL 78

Query: 186 NKLWKRMDKLEAEKRILQIKLDQPVSD-PSSPHD 218
            +L + + +L A  + L+ +L   +   P+ PH+
Sbjct: 79  KELKEELTELSAALKALEAELQDKLLSIPNIPHE 112


>gnl|CDD|227355 COG5022, COG5022, Myosin heavy chain [Cytoskeleton].
          Length = 1463

 Score = 34.3 bits (79), Expect = 0.14
 Identities = 33/170 (19%), Positives = 60/170 (35%), Gaps = 20/170 (11%)

Query: 30   LQKRIESLQQQNRVLKVELETYKLQLRALQEENKNLRQTSVNIQAKAEQEEEFISNTLMK 89
            L+K + ++  +      E      +L  L E    L++TS   +   ++    +     K
Sbjct: 933  LKKLLNNIDLEEGPSI-EYVKLP-ELNKLHEVESKLKETSEEYEDLLKKSTILVR-EGNK 989

Query: 90   KIQALKKEKETLAHHYEQEEEFLTNDLSRKLNQLTEEKYRL-----------EQTIEQEQ 138
                LK  K+ LA   +Q        L     QL E    +            ++ E   
Sbjct: 990  ANSELKNFKKELAELSKQYG-----ALQESTKQLKELPVEVAELQSASKIISSESTELSI 1044

Query: 139  ERVVNKLMRKIEKLEAETDAKQNNLEQLRREKVELENTLEQEQEALVNKL 188
             + + KL   +     +  A+   L +LRRE   L++    + E+  N L
Sbjct: 1045 LKPLQKLKGLLLLENNQLQARYKAL-KLRRENSLLDDKQLYQLESTENLL 1093



 Score = 32.4 bits (74), Expect = 0.45
 Identities = 40/285 (14%), Positives = 95/285 (33%), Gaps = 17/285 (5%)

Query: 29   QLQKRIESLQQQNRVLKVELETYKLQLRALQEENKNLRQTSVNIQAKAEQEEEFISNTLM 88
            +  K+I+ +Q   R+ ++     K +L    +   +L  +    ++      + +  T+ 
Sbjct: 769  KRIKKIQVIQHGFRLRRLVDYELKWRLFIKLQPLLSLLGSRKEYRSYLACIIK-LQKTIK 827

Query: 89   KKIQALKKEKETLAHHYEQEEEFLTNDLSRKLNQLTEEKYRLEQ---TIEQEQERVVNKL 145
            ++ +  + E+   +       E L     R L    ++++ L +      Q  +RV    
Sbjct: 828  REKKLRETEEVEFSLK----AEVLIQKFGRSLK--AKKRFSLLKKETIYLQSAQRVELAE 881

Query: 146  MRKIEKLEAETDAKQNNLEQLRREKVELENTLEQEQEALVNKLWKRMDKLEAEKRILQ-I 204
             +  E            L  L  E   +E   +     L+  L  + + +   K++L  I
Sbjct: 882  RQLQELKIDVKSISSLKLVNLELESEIIELK-KSLSSDLIENLEFKTELIARLKKLLNNI 940

Query: 205  KLDQPVSDPSSPHDMNNGDTASNLSTHIHSLRSEVMRLRNQLANTQQE---HTEKMERYV 261
             L++  S         N        + +     E   L  +     +E      +++ + 
Sbjct: 941  DLEEGPSIEYVKLPELNKLHEVE--SKLKETSEEYEDLLKKSTILVREGNKANSELKNFK 998

Query: 262  KDEKNVREENLRLQRKLQREVERREALCRHLSESESSLEMEEERH 306
            K+   + ++   LQ   ++  E    +    S S+       E  
Sbjct: 999  KELAELSKQYGALQESTKQLKELPVEVAELQSASKIISSESTELS 1043



 Score = 31.2 bits (71), Expect = 1.1
 Identities = 25/169 (14%), Positives = 61/169 (36%), Gaps = 16/169 (9%)

Query: 26   TREQLQKRIESLQQQNRVLKVELETYKLQLRALQEENKNLRQTSVNIQAKAEQEEEFISN 85
               +L+  I  L++   +    +E  + +   +    K L     NI  +     E++  
Sbjct: 900  VNLELESEIIELKKS--LSSDLIENLEFKTELIARLKKLLN----NIDLEEGPSIEYVKL 953

Query: 86   TLMKKIQALKKEKETLAHHYEQEEEFLTNDLSRKLNQLTEEK--------YRLEQTIEQE 137
              + K+  ++ + +  +  YE   +  T  +       +E K           +    QE
Sbjct: 954  PELNKLHEVESKLKETSEEYEDLLKKSTILVREGNKANSELKNFKKELAELSKQYGALQE 1013

Query: 138  QERVVNKLMRKIEKLEAETDAKQNNLEQLRREK--VELENTLEQEQEAL 184
              + + +L  ++ +L++ +    +   +L   K   +L+  L  E   L
Sbjct: 1014 STKQLKELPVEVAELQSASKIISSESTELSILKPLQKLKGLLLLENNQL 1062


>gnl|CDD|216531 pfam01496, V_ATPase_I, V-type ATPase 116kDa subunit family.  This
           family consists of the 116kDa V-type ATPase (vacuolar
           (H+)-ATPases) subunits, as well as V-type ATP synthase
           subunit i. The V-type ATPases family are proton pumps
           that acidify intracellular compartments in eukaryotic
           cells for example yeast central vacuoles,
           clathrin-coated and synaptic vesicles. They have
           important roles in membrane trafficking processes. The
           116kDa subunit (subunit a) in the V-type ATPase is part
           of the V0 functional domain responsible for proton
           transport. The a subunit is a transmembrane glycoprotein
           with multiple putative transmembrane helices it has a
           hydrophilic amino terminal and a hydrophobic carboxy
           terminal. It has roles in proton transport and assembly
           of the V-type ATPase complex. This subunit is encoded by
           two homologous gene in yeast VPH1 and STV1.
          Length = 707

 Score = 33.9 bits (78), Expect = 0.14
 Identities = 18/50 (36%), Positives = 25/50 (50%), Gaps = 3/50 (6%)

Query: 138 QERVVNKLMRKIEKLEAETDAKQNNLEQLRREKVELE---NTLEQEQEAL 184
             +    L  +I  LEAE    + NLE L +E  ELE   N L++E+  L
Sbjct: 65  PSKEFLDLEEEILDLEAEIKEVEENLESLEKEINELEEWLNVLDEEKSFL 114



 Score = 31.2 bits (71), Expect = 1.0
 Identities = 38/221 (17%), Positives = 80/221 (36%), Gaps = 56/221 (25%)

Query: 22  PSPVTREQLQKRIESLQQQNRVLKVELETYKLQLRALQEENKNLRQTSVNIQAKAEQEEE 81
           P       L++ I  L+ + + ++  LE+ + ++  L+E    L +    +    E+  E
Sbjct: 64  PPSKEFLDLEEEILDLEAEIKEVEENLESLEKEINELEEWLNVLDEEKSFLDENLEELSE 123

Query: 82  FISNTLM----------------------KKIQALKKEKETLAHHY------EQEEEF-- 111
             SN  +                      +K++A ++E       Y      E+  E   
Sbjct: 124 L-SNLDIDFKYLRGAEGLKLGFVAGVINREKLEAFERELWRACRGYIRQAEIEEPLEDPK 182

Query: 112 ---------LTNDLSRKLNQLTEEKYRLEQTIEQEQER--VVNKLMRKIEKLEAETDAKQ 160
                       DL +    L    + L    E E ER  +++K+ ++IE+L+       
Sbjct: 183 KTVFIIFFVGKEDLDKVKKILDSFGFELYDVPETEGERSELISKVNKRIEELQ------- 235

Query: 161 NNLEQLRREKVELENTLEQEQEALVNKLWKRMDKLEAEKRI 201
                  R   + E+ LE+    + ++L    +++  EK +
Sbjct: 236 -------RVLEQTESHLEKVLVKIADELLAWDEQVSKEKAV 269


>gnl|CDD|225803 COG3264, COG3264, Small-conductance mechanosensitive channel [Cell
           envelope biogenesis, outer membrane].
          Length = 835

 Score = 33.9 bits (78), Expect = 0.15
 Identities = 31/162 (19%), Positives = 63/162 (38%), Gaps = 37/162 (22%)

Query: 28  EQLQKRIESLQQQNRVLKVELETYKLQLRALQEENKNLRQTSVNIQAKAEQEEEFISNTL 87
            Q  +R+ +L   +R L        LQL       + +R+    ++          S  L
Sbjct: 75  NQQTERLNALASDDRQLA----NLLLQLLQSS---RTIREQIAVLRGSLLL-----SRIL 122

Query: 88  MKKIQALKKEKETLAHHYEQEEEFLTNDLSRKLNQLTEEKYRLEQTIEQEQERVVNKLMR 147
           ++++  L +  +        +E+F       + + L  EK              +N L  
Sbjct: 123 LQQLGPLPEAGQ-------PQEQFEVTQ---ERDALQAEK------------AYINALEG 160

Query: 148 KIEKLEAETDAKQNNLEQLRREKVELENTLEQEQEALVNKLW 189
           + E+L   T   ++ L+Q+   + EL N+L  ++EA+  +L 
Sbjct: 161 QAEQL---TAEVRDILDQILDTRRELLNSLLSQREAISLQLN 199


>gnl|CDD|173412 PTZ00121, PTZ00121, MAEBL; Provisional.
          Length = 2084

 Score = 34.0 bits (77), Expect = 0.17
 Identities = 37/178 (20%), Positives = 73/178 (41%), Gaps = 18/178 (10%)

Query: 28   EQLQKRIESLQQQNRVLKVELETYKLQLRALQEENKNLRQTSVNIQAKAEQEEEFISNTL 87
            E+ +K  E+  +   + K E E  K++    +E  +  +      + K  +EE  I    
Sbjct: 1610 EEAKKAEEAKIKAEELKKAEEEKKKVEQLKKKEAEEKKKAE----ELKKAEEENKIKAAE 1665

Query: 88   MKKIQALKKEKETLAHHYEQEEEFLTNDLSRKLNQLTEEKYRLEQTIEQEQERVVNKLMR 147
              K     K+K   A   E++E+     L ++     EE  + E+  ++E E       +
Sbjct: 1666 EAKKAEEDKKKAEEAKKAEEDEKKAAEALKKE----AEEAKKAEELKKKEAEEK-----K 1716

Query: 148  KIEKLEAETDAKQNNLEQLRREKVE-----LENTLEQEQEALVNKLWKRMDKLEAEKR 200
            K E+L+   +  +   E+ ++E  E      E   ++E++  +  L K  +K   E R
Sbjct: 1717 KAEELKKAEEENKIKAEEAKKEAEEDKKKAEEAKKDEEEKKKIAHLKKEEEKKAEEIR 1774



 Score = 32.4 bits (73), Expect = 0.45
 Identities = 37/192 (19%), Positives = 81/192 (42%), Gaps = 18/192 (9%)

Query: 32   KRIESLQQQNRVLKVELETYKLQLRALQEENKNLRQTSVNIQAKAEQE------EEFISN 85
            K+ + L++   + K E E  K +     EE+KN+         KAE+       + +   
Sbjct: 1546 KKADELKKAEELKKAE-EKKKAEEAKKAEEDKNMALRKAEEAKKAEEARIEEVMKLYEEE 1604

Query: 86   TLMKKIQALKKEKETLAHHYEQEEEFLTNDLSRKLNQLTEEKYRLEQTIEQEQERVVNKL 145
              MK  +A K E+  +     ++ E     + +   +  EEK + E+  + E+E  +   
Sbjct: 1605 KKMKAEEAKKAEEAKIKAEELKKAEEEKKKVEQLKKKEAEEKKKAEELKKAEEENKIKAA 1664

Query: 146  M---------RKIEKLEAETDAKQNNLEQLRREKVELENTLEQEQEALVNKLWKRMDKLE 196
                      +K E+ +   + ++   E L++E  E +   E +++    K  K+ ++L+
Sbjct: 1665 EEAKKAEEDKKKAEEAKKAEEDEKKAAEALKKEAEEAKKAEELKKKEAEEK--KKAEELK 1722

Query: 197  AEKRILQIKLDQ 208
              +   +IK ++
Sbjct: 1723 KAEEENKIKAEE 1734



 Score = 29.0 bits (64), Expect = 6.7
 Identities = 37/181 (20%), Positives = 82/181 (45%), Gaps = 15/181 (8%)

Query: 28   EQLQKRIESLQQQNRVLKVELETYKLQLRALQEENKNLRQTSVNIQAKAEQEEEFISNTL 87
            EQL+K+    +++   LK   E  K+  +A +E  K         +AK  +E+E  +   
Sbjct: 1636 EQLKKKEAEEKKKAEELKKAEEENKI--KAAEEAKKAEEDKKKAEEAKKAEEDEKKAAEA 1693

Query: 88   MKKIQALKKEKETLAHHYEQEEEFLTNDLSRKLNQLTEEKYRLEQTIEQEQERVVNKLMR 147
            +KK     K+ E L     +E+        +K  +L + +   +   E+ + +   +  +
Sbjct: 1694 LKKEAEEAKKAEELKKKEAEEK--------KKAEELKKAEEENKIKAEEAK-KEAEEDKK 1744

Query: 148  KIEKLEAETDAKQNNLEQLRREKVELENTLEQEQEALVNKLWKRMD---KLEAEKRILQI 204
            K E+ + + + K   +  L++E+ +    + +E+EA++ +     D   ++E +K+I  I
Sbjct: 1745 KAEEAKKDEEEK-KKIAHLKKEEEKKAEEIRKEKEAVIEEELDEEDEKRRMEVDKKIKDI 1803

Query: 205  K 205
             
Sbjct: 1804 F 1804


>gnl|CDD|235505 PRK05563, PRK05563, DNA polymerase III subunits gamma and tau;
           Validated.
          Length = 559

 Score = 33.7 bits (78), Expect = 0.19
 Identities = 29/180 (16%), Positives = 74/180 (41%), Gaps = 24/180 (13%)

Query: 28  EQLQKRIESLQQQNRVLKVELETYKLQLRALQEENKNLRQTSVNIQAKAEQE--EEFISN 85
           E L +R+E L+Q+ + LK +      + +  ++E K  ++    +      +  +E    
Sbjct: 372 EVLLQRVEQLEQELKQLKAQPVGVAPEQKEKKKEKKKNKKKKYKVPRGKIYKVLKEATRQ 431

Query: 86  TL--MKK-----IQALKKEKETLAHHYEQEEEFLTNDLSRKLNQLT---EEKYRLEQTIE 135
            L  +K      +++LK ++++L       E    ++     + +    E +   E+ + 
Sbjct: 432 DLELLKNVWGEILESLKAQRKSLRALLVNSEPVAASE-----DTVVLAFEYEIHFEKAML 486

Query: 136 QEQER-VVNKLMRKIEK------LEAETDAKQNNLEQLRREKVELENTLEQEQEALVNKL 188
            ++    +  ++ K+           E + ++   E L++ K E E   E+E+  L+ + 
Sbjct: 487 DKELNDTIENILSKLLGKSVEIVAVPEDEWQKIREEFLQKHKNEEEEEEEEEELPLIPEA 546


>gnl|CDD|225177 COG2268, COG2268, Uncharacterized protein conserved in bacteria
           [Function unknown].
          Length = 548

 Score = 33.7 bits (77), Expect = 0.20
 Identities = 25/175 (14%), Positives = 58/175 (33%), Gaps = 10/175 (5%)

Query: 40  QNRVLKVELETYKLQLRALQEENKNLRQTSVNIQAKAEQEEEFISNTLMKKIQALKKEKE 99
           Q    K   +T ++++   + E +     +   + + E E+  I      + +  + E+E
Sbjct: 254 QQPAGKTAEQTREVKIILAETEAEVAAWKA---ETRREAEQAEILAEQAIQEEKAQAEQE 310

Query: 100 TLAHHYEQEEEFLTNDLSRKLNQLTEEKYR--LEQTIEQEQERVVNKLMRKIEKLEAETD 157
                  +  E     + R+     E + R       +++ +          E + A+ +
Sbjct: 311 VQHAKALEAREMRVGLIERQKETELEPQERSYFINAAQRQAQEEAKAAANIAEAIGAQAE 370

Query: 158 AKQNNLEQLRREKVELENTLEQEQEALVNKL-----WKRMDKLEAEKRILQIKLD 207
           A      +    +   +  L    EA   +           K EAE +  +IK +
Sbjct: 371 AAVETARETEEAERAEQAALVAAAEAAEQEQVEIAVRAEAAKAEAEAQAAEIKAE 425



 Score = 31.3 bits (71), Expect = 1.1
 Identities = 26/157 (16%), Positives = 52/157 (33%), Gaps = 11/157 (7%)

Query: 32  KRIESLQQQNRVLKVELETYKLQLRALQEENKNLRQTSVNIQAKAEQEEEFISNTLMKKI 91
           +RI  + Q   + + E E    +   +     N     V ++ + +   +    T   KI
Sbjct: 214 RRIAQVLQDAEIAENEAE----KETEIAIAEANRDAKLVELEVEQQPAGKTAEQTREVKI 269

Query: 92  QALKKEKETLAHHYEQEEEFLTNDLSRKLNQLTEEKYRLEQTIEQEQERVVNKLMRKIEK 151
              + E E  A   E   E      + +   L E+  + E+   +++ +    L  +  +
Sbjct: 270 ILAETEAEVAAWKAETRRE------AEQAEILAEQAIQEEKAQAEQEVQHAKALEAREMR 323

Query: 152 LEAETDAKQNNLEQLRREKVELENTLEQEQEALVNKL 188
           +      K+  LE   R    +     Q QE      
Sbjct: 324 VGLIERQKETELEPQERS-YFINAAQRQAQEEAKAAA 359


>gnl|CDD|220614 pfam10174, Cast, RIM-binding protein of the cytomatrix active zone.
            This is a family of proteins that form part of the CAZ
           (cytomatrix at the active zone) complex which is
           involved in determining the site of synaptic vesicle
           fusion. The C-terminus is a PDZ-binding motif that binds
           directly to RIM (a small G protein Rab-3A effector). The
           family also contains four coiled-coil domains.
          Length = 774

 Score = 33.4 bits (76), Expect = 0.20
 Identities = 44/210 (20%), Positives = 81/210 (38%), Gaps = 8/210 (3%)

Query: 4   SASESDSNSMDGVALMMPPSPVTREQLQKRIESLQQQNRVLKVELETYKLQLRALQEENK 63
           SA  +     D  A  +         LQ  +E+L  Q+  ++  L+  K  L   ++E  
Sbjct: 280 SALAAMKAKCDRAAQELSRKKTELLGLQTELETLANQDSDMRQHLDKLKEDLTRAEQEKA 339

Query: 64  NLRQTSVNIQAKAEQEEEFISNTLMKKIQALKKEKETLAHHYEQEEEFL------TNDLS 117
            L QT V+      + +          +QA ++EK T A   E   +           L 
Sbjct: 340 IL-QTEVDALRYELERKHNTLTKKTASLQAAQEEKATYAGEIEDMRDRYEKTERKLRVLQ 398

Query: 118 RKLNQLTEEKYRLEQTIEQEQERVVNKLMRKIEKLEAET-DAKQNNLEQLRREKVELENT 176
           +K+  L E   R E+ +++E+ER+ +           E  +      E++     E  + 
Sbjct: 399 KKIENLQETFRRKERRLKEEKERLRSLQTDTNTDTALEKLEKALAEKERIIERLKEQRDR 458

Query: 177 LEQEQEALVNKLWKRMDKLEAEKRILQIKL 206
            E+ ++       K  + L+ E + LQ+KL
Sbjct: 459 DERYEQEEFETYKKEFEDLKEEVQNLQLKL 488


>gnl|CDD|236842 PRK11091, PRK11091, aerobic respiration control sensor protein
           ArcB; Provisional.
          Length = 779

 Score = 33.4 bits (77), Expect = 0.22
 Identities = 18/74 (24%), Positives = 35/74 (47%), Gaps = 4/74 (5%)

Query: 227 NLSTHIHSLRSEVMRLRNQLANTQ-QEHTEKMERYVKDEKNVREENLRLQRKLQREVERR 285
            LS  +  L  E MR R+   N Q +++  ++ + + + +   E       +L+ E++ R
Sbjct: 86  RLSRLVAKL--EEMRERDLELNVQLKDNIAQLNQEIAEREKAEEARQEAFEQLKNEIKER 143

Query: 286 EALCRHLSESESSL 299
           E     L + +SSL
Sbjct: 144 EETQIELEQ-QSSL 156


>gnl|CDD|224212 COG1293, COG1293, Predicted RNA-binding protein homologous to
           eukaryotic snRNP [Transcription].
          Length = 564

 Score = 33.5 bits (77), Expect = 0.22
 Identities = 34/183 (18%), Positives = 70/183 (38%), Gaps = 21/183 (11%)

Query: 115 DLSRKLNQLTEEKYRLEQTIEQEQERVVNKLMRKIEKLEAETDAKQNNLEQLRREKVELE 174
           DL +  N+  +EK+  +  I+Q    +  KL ++++KLE       N LE+   E  ELE
Sbjct: 262 DLEKLFNEALDEKFERD-KIKQLASELEKKLEKELKKLE-------NKLEKQEDELEELE 313

Query: 175 NTLEQEQ---EALVNKLWKRMDKLEAEKRILQIKLDQPVSDPSSPHDMNNGDTASNLSTH 231
              E+ +   E L   L   +++     R+     ++ +            + A      
Sbjct: 314 KAAEELRQKGELLYANLQL-IEEGLKSVRLADFYGNEEIKIELDK-SKTPSENAQRYFKK 371

Query: 232 IHSLRSEVMRLRNQLANTQQE--------HTEKMERYVKDEKNVREENLRLQRKLQREVE 283
              L+   + L  QL+  ++            +     K  + +REE +       ++ +
Sbjct: 372 YKKLKGAKVNLDRQLSELKEAIAYYESAKTALEKAEGKKAIEEIREELIEEGLLKSKKKK 431

Query: 284 RRE 286
           R++
Sbjct: 432 RKK 434



 Score = 32.7 bits (75), Expect = 0.30
 Identities = 32/168 (19%), Positives = 59/168 (35%), Gaps = 27/168 (16%)

Query: 71  NIQAKAEQEEEFISNT---LMKKIQALKKEKETL---AHHYEQEEEFLTNDLS------- 117
            I+  A + E+ +      L  K++  + E E L   A    Q+ E L  +L        
Sbjct: 279 KIKQLASELEKKLEKELKKLENKLEKQEDELEELEKAAEELRQKGELLYANLQLIEEGLK 338

Query: 118 --RKLN--QLTEEKYRLEQ-----TIEQEQERVVNKLMRKIEKLEAETDAKQNNLEQLRR 168
             R  +     E K  L++        Q   +   KL      L+ +    +  +     
Sbjct: 339 SVRLADFYGNEEIKIELDKSKTPSENAQRYFKKYKKLKGAKVNLDRQLSELKEAIAYYES 398

Query: 169 -----EKVELENTLEQEQEALVNKLWKRMDKLEAEKRILQIKLDQPVS 211
                EK E +  +E+ +E L+ +   +  K + +K+    K    VS
Sbjct: 399 AKTALEKAEGKKAIEEIREELIEEGLLKSKKKKRKKKEWFEKFRWFVS 446


>gnl|CDD|236270 PRK08471, flgK, flagellar hook-associated protein FlgK; Validated.
          Length = 613

 Score = 33.5 bits (77), Expect = 0.22
 Identities = 21/81 (25%), Positives = 41/81 (50%), Gaps = 16/81 (19%)

Query: 107 QEEEFLTNDLSRKLNQLTEEKYRLEQTIEQEQERVVN----------KLMRKIEKLEAET 156
           Q+ E LTN++     +L      L++ + +E +  V+          ++ ++I+++EA  
Sbjct: 139 QKTETLTNNIKDTRERL----DTLQKKVNEELKVTVDEINSLGKQIAEINKQIKEVEAGK 194

Query: 157 DAKQNNLEQLRREKVELENTL 177
             K  N  +LR ++ ELE TL
Sbjct: 195 TLKHAN--ELRDKRDELELTL 213


>gnl|CDD|234141 TIGR03185, DNA_S_dndD, DNA sulfur modification protein DndD.  This
           model describes the DndB protein encoded by an operon
           associated with a sulfur-containing modification to DNA.
           The operon is sporadically distributed in bacteria, much
           like some restriction enzyme operons. DndD is described
           as a putative ATPase. The small number of examples known
           so far include species from among the Firmicutes,
           Actinomycetes, Proteobacteria, and Cyanobacteria [DNA
           metabolism, Restriction/modification].
          Length = 650

 Score = 33.1 bits (76), Expect = 0.24
 Identities = 26/99 (26%), Positives = 41/99 (41%), Gaps = 9/99 (9%)

Query: 28  EQLQKRIESLQQQNRVLKVELETYKLQLRALQEENKNLRQTSVNIQAKAEQEEEFISNTL 87
           EQ+ + +E L +    L       +  LR L+   + +      +  K +Q+    +  L
Sbjct: 421 EQIAQLLEELGEAQNELFRSEAEIEELLRQLETLKEAIEALRKTLDEKTKQKIN--AFEL 478

Query: 88  MKKIQALKKEKETLAHHYEQEEEFLTNDLSRKLNQLTEE 126
            + I    K K+TL       +EF    L RKL QL EE
Sbjct: 479 ERAITIADKAKKTL-------KEFREKLLERKLQQLEEE 510


>gnl|CDD|227605 COG5280, COG5280, Phage-related minor tail protein [Function
           unknown].
          Length = 634

 Score = 33.3 bits (76), Expect = 0.25
 Identities = 13/108 (12%), Positives = 38/108 (35%), Gaps = 6/108 (5%)

Query: 85  NTLMKKIQALKKEKETL---AHHYEQEEEFLTNDLSRKLNQLTEEKYRLEQTIEQEQERV 141
             L ++IQ L  +            +++     D      ++ EE  RL     +  E  
Sbjct: 54  EALAQRIQLLTGQVTLTVKNLDDLAKQQA--QVDAQFGSGEIGEEAERLFTREIRVTEGG 111

Query: 142 VNKLMRKIEKLEAETDAKQNNLEQLRREKVEL-ENTLEQEQEALVNKL 188
           +   + ++ +           L+      V   ++++ +  E++ +++
Sbjct: 112 LTGSLSELNEPTVSVREHGKGLKDEELRFVTGGDSSIAEVFESIKDRV 159


>gnl|CDD|237619 PRK14135, recX, recombination regulator RecX; Provisional.
          Length = 263

 Score = 32.9 bits (76), Expect = 0.25
 Identities = 24/95 (25%), Positives = 44/95 (46%), Gaps = 17/95 (17%)

Query: 126 EKYRLEQTIEQEQERVVNKLMRKIEKLEAETDAKQNNLEQLRR------------EKVEL 173
            +Y  E  IE  Q ++  KL++K +KL  +   KQ  ++ L              E+++L
Sbjct: 147 SEYTEEDQIEVAQ-KLAEKLLKKYQKLPFKA-LKQKIIQSLLTKGFSYEVIKAALEELDL 204

Query: 174 ENTLEQEQEALV---NKLWKRMDKLEAEKRILQIK 205
           E   E+EQE L     K +++  K +  +   ++K
Sbjct: 205 EQDEEEEQELLQKELEKAYRKYSKYDGYELKQKLK 239


>gnl|CDD|217933 pfam04156, IncA, IncA protein.  Chlamydia trachomatis is an
           obligate intracellular bacterium that develops within a
           parasitophorous vacuole termed an inclusion. The
           inclusion is non-fusogenic with lysosomes but intercepts
           lipids from a host cell exocytic pathway. Initiation of
           chlamydial development is concurrent with modification
           of the inclusion membrane by a set of C.
           trachomatis-encoded proteins collectively designated
           Incs. One of these Incs, IncA, is functionally
           associated with the homotypic fusion of inclusions. This
           family probably includes members of the wider Inc family
           rather than just IncA.
          Length = 186

 Score = 32.4 bits (74), Expect = 0.27
 Identities = 28/121 (23%), Positives = 53/121 (43%), Gaps = 9/121 (7%)

Query: 54  QLRALQEENKNLRQTSVNIQAKAEQEEEFISNTLMKKIQALKKEKETLAHHYEQEEEFLT 113
           +L  LQ E   L+Q    +Q + E  EE I+  L  +++ LK++        +   E L 
Sbjct: 73  KLEELQGELSELKQQLSELQEELEDLEERIA-ELESELEDLKED-------LQLLRELLK 124

Query: 114 NDLSRKLNQLTEEKYRLEQTIEQEQERVVNKLMRKIEKLEAETDAKQNNLEQLRREKVEL 173
             L  +L  L E    L + + + ++ +  ++    E+LE   +  Q   E ++  +  L
Sbjct: 125 -SLEERLESLEESIKELAKELRELRQDLREEVEELREELERLQENLQRLQEAIQELQSLL 183

Query: 174 E 174
           E
Sbjct: 184 E 184



 Score = 30.1 bits (68), Expect = 1.5
 Identities = 27/142 (19%), Positives = 57/142 (40%), Gaps = 19/142 (13%)

Query: 159 KQNNLEQLRREKVELENTLEQEQEALVNKLWKRMDKLEAEKRILQIKLDQPVSDPSSPHD 218
            +  ++ +R +K+E       E +  +++L + ++ LE     L+ +L+    D     +
Sbjct: 62  LKAPVQSVRPQKLEELQGELSELKQQLSELQEELEDLEERIAELESELEDLKEDLQLLRE 121

Query: 219 MNNGDTASNLSTHIHSLRSEVMRLRNQLANTQQEHTEKMERYVKDEKNVREENLRLQRKL 278
           +              SL   +  L   +    +E  E         +++REE   L+ +L
Sbjct: 122 LLK------------SLEERLESLEESIKELAKELRE-------LRQDLREEVEELREEL 162

Query: 279 QREVERREALCRHLSESESSLE 300
           +R  E  + L   + E +S LE
Sbjct: 163 ERLQENLQRLQEAIQELQSLLE 184



 Score = 29.3 bits (66), Expect = 2.3
 Identities = 31/118 (26%), Positives = 58/118 (49%), Gaps = 6/118 (5%)

Query: 28  EQLQKRIESLQQQNRVLKVELETYKLQLRALQEENKNLRQTSVNIQAKAEQEEEFISNTL 87
           E+LQ  +  L+QQ   L+ ELE  + ++  L+ E ++L++    ++   +  EE +  +L
Sbjct: 75  EELQGELSELKQQLSELQEELEDLEERIAELESELEDLKEDLQLLRELLKSLEERLE-SL 133

Query: 88  MKKIQALKKEKETLAHHYEQEEEFLTNDLSRKLNQLTEEKYRLEQTIEQEQERVVNKL 145
            + I+ L KE   L     +E E L  +L R      +E  +  Q   QE + ++ +L
Sbjct: 134 EESIKELAKELRELRQDLREEVEELREELER-----LQENLQRLQEAIQELQSLLEQL 186


>gnl|CDD|218434 pfam05103, DivIVA, DivIVA protein.  The Bacillus subtilis divIVA1
           mutation causes misplacement of the septum during cell
           division, resulting in the formation of small, circular,
           anucleate mini-cells. Inactivation of divIVA produces a
           mini-cell phenotype, whereas overproduction of DivIVA
           results in a filamentation phenotype. These proteins
           appear to contain coiled-coils.
          Length = 131

 Score = 31.5 bits (72), Expect = 0.27
 Identities = 22/95 (23%), Positives = 44/95 (46%), Gaps = 4/95 (4%)

Query: 28  EQLQKRIESLQQQNRVLKVELETYKLQLRALQEENKNLRQTSVNIQAKAEQEEEFISNTL 87
           E L K  E L+++   L+ +LE YK     LQ+     ++T+  ++A A++E E I    
Sbjct: 35  EALYKENEELKEEIERLEEKLEEYKELEETLQKTLVVAQETAEEVKANAQKEAELIIKEA 94

Query: 88  MKKIQALKKEKETLAHHYEQEEEFLTNDLSRKLNQ 122
             K + +  +    A+   ++     ++L R+   
Sbjct: 95  EAKAERIVND----ANEEAKKLATEYDELKRQSRL 125



 Score = 28.8 bits (65), Expect = 2.4
 Identities = 28/114 (24%), Positives = 52/114 (45%), Gaps = 11/114 (9%)

Query: 28  EQLQKRIESLQQQNRVLKVELETYKLQLRALQEENKNLRQTSVNIQAKAEQEEEFISNTL 87
           +Q+ K  E+L ++N  LK E+E  + +L   +E  + L++T V  Q  AE+ +       
Sbjct: 28  DQVIKDYEALYKENEELKEEIERLEEKLEEYKELEETLQKTLVVAQETAEEVK------- 80

Query: 88  MKKIQALKKEKETLAHHYEQEEEFLTNDLSRKLNQLTEEKYRLEQTIEQEQERV 141
                  +KE E +    E + E + ND + +  +L  E   L++     + R 
Sbjct: 81  ----ANAQKEAELIIKEAEAKAERIVNDANEEAKKLATEYDELKRQSRLFRTRF 130


>gnl|CDD|191309 pfam05597, Phasin, Poly(hydroxyalcanoate) granule associated
           protein (phasin).  Polyhydroxyalkanoates (PHAs) are
           storage polyesters synthesised by various bacteria as
           intracellular carbon and energy reserve material. PHAs
           are accumulated as water-insoluble inclusions within the
           cells. This family consists of the phasins PhaF and PhaI
           which act as a transcriptional regulator of PHA
           biosynthesis genes. PhaF has been proposed to repress
           expression of the phaC1 gene and the phaIF operon.
          Length = 132

 Score = 31.5 bits (72), Expect = 0.27
 Identities = 24/102 (23%), Positives = 49/102 (48%), Gaps = 5/102 (4%)

Query: 74  AKAEQEEEFISNTLMKKIQALKKE-KETLAHHYEQEEEFLTNDLSRKLNQLTEEKYRLEQ 132
           AKA++E   +   L+K+ + L+K  ++      E   E + + +S   ++   +  +LEQ
Sbjct: 31  AKAQEEGGKVFEALVKEGEELEKRTRKLAEEQVEAVRESVKSRVSEVKDKAEGQWDKLEQ 90

Query: 133 TIEQEQERVVNKL----MRKIEKLEAETDAKQNNLEQLRREK 170
             ++   + +N+L     +++E L A  D     LE+L   K
Sbjct: 91  AFDERVAKALNRLGVPSRKEVEALSARIDELTAQLEELAEAK 132


>gnl|CDD|236798 PRK10929, PRK10929, putative mechanosensitive channel protein;
           Provisional.
          Length = 1109

 Score = 33.1 bits (76), Expect = 0.29
 Identities = 43/224 (19%), Positives = 91/224 (40%), Gaps = 35/224 (15%)

Query: 73  QAKAEQEEEFISNTLMKKIQALKKEKETLAHHYEQEEEFLT-NDLSRKL----NQLTEEK 127
             +A+Q ++ I N   K    L+++              ++ + L +++    +QL E+ 
Sbjct: 64  LERAKQYQQVIDN-FPKLSAELRQQLNNERDEPRSVPPNMSTDALEQEILQVSSQLLEKS 122

Query: 128 YRLEQTIEQEQERVVNKLMRKIEKLEAETDAKQNNLEQLRREKVELENTLEQEQEALVNK 187
            + +Q  EQ++ R ++  + ++ + + E   + N +E+  +        L Q Q      
Sbjct: 123 RQAQQ--EQDRAREISDSLSQLPQQQTEARRQLNEIERRLQTLGTPNTPLAQAQLTA--- 177

Query: 188 LWKRMDKLEAEKRILQIKLDQ-PVSDPSSPHDMNNGDTASNLSTHIHSLRSE-----VMR 241
                  L+AE   L+  +D+  ++  S+    NN    + L + +   RS+     +  
Sbjct: 178 -------LQAESAALKALVDELELAQLSA----NNRQELARLRSELAKKRSQQLDAYLQA 226

Query: 242 LRNQLANTQQ-------EHTEKMERYVKDEKNVREENLRLQRKL 278
           LRNQL + +Q       E TE +     D         ++ R+L
Sbjct: 227 LRNQLNSQRQREAERALESTELLAEQSGDLPKSIVAQFKINREL 270


>gnl|CDD|234172 TIGR03345, VI_ClpV1, type VI secretion ATPase, ClpV1 family.
           Members of this protein family are homologs of ClpB, an
           ATPase associated with chaperone-related functions.
           These ClpB homologs, designated ClpV1, are a key
           component of the bacterial pathogenicity-associated type
           VI secretion system [Protein fate, Protein and peptide
           secretion and trafficking, Cellular processes,
           Pathogenesis].
          Length = 852

 Score = 33.0 bits (76), Expect = 0.30
 Identities = 34/105 (32%), Positives = 51/105 (48%), Gaps = 7/105 (6%)

Query: 87  LMKKIQALKKEKETL---AHHYEQEEEFLTNDLSRKLNQLTEEKYRLEQTIEQEQERVVN 143
           L ++I AL+ E + L   A      +E L  +L  +L  L  E   LE   +QE+E +V 
Sbjct: 424 LRRRIAALELELDALEREAALGADHDERLA-ELRAELAALEAELAALEARWQQEKE-LVE 481

Query: 144 KLMRKIEKLEAETDAKQNNLEQLRREKVELENTLEQEQ--EALVN 186
            ++    +LEA+ DA  ++   LR +  ELE  L   Q  E LV 
Sbjct: 482 AILALRAELEADADAPADDDAALRAQLAELEAALASAQGEEPLVF 526



 Score = 29.5 bits (67), Expect = 3.6
 Identities = 33/126 (26%), Positives = 55/126 (43%), Gaps = 19/126 (15%)

Query: 16  VALMMPPSPVTREQLQKRIESLQQQNRVLKVELET---YKLQLRALQEENKNLRQTSVNI 72
           VAL    +P   E L++RI +L+ +   L+ E      +  +L  L+ E   L      +
Sbjct: 410 VALSQNATPAALEDLRRRIAALELELDALEREAALGADHDERLAELRAELAALEAELAAL 469

Query: 73  QAKAEQEEEFISNTLMKKIQALKKEKETLAHHYEQEEEFLTNDLSRKLNQLTEEKYRLEQ 132
           +A+ +QE+E     L++ I AL+ E E  A   +   +    D +    QL E    LE 
Sbjct: 470 EARWQQEKE-----LVEAILALRAELEADA---DAPAD----DDAALRAQLAE----LEA 513

Query: 133 TIEQEQ 138
            +   Q
Sbjct: 514 ALASAQ 519



 Score = 28.8 bits (65), Expect = 6.6
 Identities = 23/119 (19%), Positives = 40/119 (33%), Gaps = 27/119 (22%)

Query: 147 RKIEKLEAETDAKQNNLEQLRREK----------VELENTLEQEQEAL--VNKLWKRMDK 194
             +E L     A +  L+ L RE            EL   L   +  L  +   W++  +
Sbjct: 419 AALEDLRRRIAALELELDALEREAALGADHDERLAELRAELAALEAELAALEARWQQEKE 478

Query: 195 LEAEKRILQIKLDQPVSDPSSPHDMNNGDTASNLSTHIHSLRSEVMRLRNQLANTQQEH 253
           L      L+ +L+     P+                   +LR+++  L   LA+ Q E 
Sbjct: 479 LVEAILALRAELEADADAPADD---------------DAALRAQLAELEAALASAQGEE 522


>gnl|CDD|144972 pfam01576, Myosin_tail_1, Myosin tail.  The myosin molecule is a
           multi-subunit complex made up of two heavy chains and
           four light chains it is a fundamental contractile
           protein found in all eukaryote cell types. This family
           consists of the coiled-coil myosin heavy chain tail
           region. The coiled-coil is composed of the tail from two
           molecules of myosin. These can then assemble into the
           macromolecular thick filament. The coiled-coil region
           provides the structural backbone the thick filament.
          Length = 859

 Score = 33.1 bits (76), Expect = 0.30
 Identities = 49/166 (29%), Positives = 74/166 (44%), Gaps = 11/166 (6%)

Query: 27  REQLQKRIESLQQQNRVLKVELETYKLQLRALQEENKNLRQTSVNIQAKAEQEEEFISNT 86
            +  Q+   +L  +   LK ELE  K Q+ AL+ ENKNL+     I    +Q  E   N 
Sbjct: 400 LDTAQREARNLSTELFRLKNELEELKDQVEALRRENKNLQD---EIHDLTDQLGEGGRNV 456

Query: 87  --LMKKIQALKKEKETLAHHYEQEEEFLTNDLSRKL------NQLTEEKYRLEQTIEQEQ 138
             L K  + L+ EK+ L    E+ E  L  + S+ L      +Q+  E  R     E+E 
Sbjct: 457 HELEKARRRLEAEKDELQAALEEAEAALELEESKVLRAQVELSQIRSEIERRLAEKEEEF 516

Query: 139 ERVVNKLMRKIEKLEAETDAKQNNLEQLRREKVELENTLEQEQEAL 184
           E       R IE L+A  +A+     +  R K +LE  + + + AL
Sbjct: 517 ENTRKNHQRAIESLQATLEAEAKGKAEASRLKKKLEGDINELEIAL 562



 Score = 28.9 bits (65), Expect = 6.3
 Identities = 44/202 (21%), Positives = 85/202 (42%), Gaps = 29/202 (14%)

Query: 26  TREQLQKRIESLQQQNRVLKVELETYKLQLRALQEENKNLRQTSVNIQAKAEQEEEFISN 85
              + QK ++  QQQ + L+ ++E  +      +E+     +       +AE EE  + +
Sbjct: 568 ANAEAQKNVKKYQQQVKELQTQVEEEQRAREDAREQLAVAERR--ATALEAELEE--LRS 623

Query: 86  TLMKKIQALKKEKETLAHHYEQEEEFLTNDLSRKLNQLTEEKYRLEQTIEQEQERV---- 141
            L +  +A K+ +  LA   E+      N+L+ + + L  +K +LE  +   Q  +    
Sbjct: 624 ALEQAERARKQAETELAEASER-----VNELTAQNSSLIAQKRKLEGELAALQSDLDEAV 678

Query: 142 ---------VNKLMRKIEKLEAETDAKQNNLEQLRREKVELENT-------LEQEQEALV 185
                      K      +L  E   +Q + + L R + +LE+        L++ + A +
Sbjct: 679 NELKAAEERAKKAQADAARLAEELRQEQEHSQHLERLRKQLESQVKELQVRLDEAEAAAL 738

Query: 186 NKLWKRMDKLEAEKRILQIKLD 207
               K + KLEA  R L+ +LD
Sbjct: 739 KGGKKMIQKLEARVRELEAELD 760


>gnl|CDD|149160 pfam07926, TPR_MLP1_2, TPR/MLP1/MLP2-like protein.  The sequences
           featured in this family are similar to a region of human
           TPR protein and to yeast myosin-like proteins 1 (MLP1)
           and 2 (MLP2). These proteins share a number of features;
           for example, they all have coiled-coil regions and all
           three are associated with nuclear pores. TPR is thought
           to be a component of nuclear pore complex- attached
           intra-nuclear filaments, and is implicated in nuclear
           protein import. Moreover, its N-terminal region is
           involved in the activation of oncogenic kinases,
           possibly by mediating the dimerisation of kinase domains
           or by targeting these kinases to the nuclear pore
           complex. MLP1 and MLP2 are involved in the process of
           telomere length regulation, where they are thought to
           interact with proteins such as Tel1p and modulate their
           activity.
          Length = 132

 Score = 31.5 bits (72), Expect = 0.31
 Identities = 26/115 (22%), Positives = 56/115 (48%), Gaps = 4/115 (3%)

Query: 30  LQKRIESLQQQNRVLKVELETYKLQLRALQEENKNLRQTSVNIQAKAEQEEEFISNTLMK 89
           L++   SLQ + + L  + E  + +++ LQE+ +   + +   Q K E+E    +   ++
Sbjct: 1   LEEEKSSLQAELKRLLEKKEDAEEKIQKLQEDLEEQAEIANEAQQKYERELVKHA-EDIE 59

Query: 90  KIQALKKE---KETLAHHYEQEEEFLTNDLSRKLNQLTEEKYRLEQTIEQEQERV 141
           ++QAL+K+    +      + E E    +LS       E+K  LE  + + ++R+
Sbjct: 60  ELQALRKQLNELKKEIAQLKAEAESAQAELSEAEESWEEQKKMLEDELSELEKRI 114


>gnl|CDD|234084 TIGR03007, pepcterm_ChnLen, polysaccharide chain length determinant
           protein, PEP-CTERM locus subfamily.  Members of this
           protein family belong to the family of polysaccharide
           chain length determinant proteins (pfam02706). All are
           found in species that encode the PEP-CTERM/exosortase
           system predicted to act in protein sorting in a number
           of Gram-negative bacteria, and are found near the epsH
           homolog that is the putative exosortase gene [Cell
           envelope, Biosynthesis and degradation of surface
           polysaccharides and lipopolysaccharides].
          Length = 498

 Score = 32.7 bits (75), Expect = 0.34
 Identities = 62/362 (17%), Positives = 118/362 (32%), Gaps = 83/362 (22%)

Query: 36  SLQQQNRVLKVELETYKLQLRALQEENKNLRQTSVNIQAKAEQEEEFISNTLMKKIQALK 95
           ++ Q+ R++   L +     + +       R   +++ AK+  + E +   L K I    
Sbjct: 65  NVDQKIRIMSRTLLSRPNLEKVI-------RMLDLDLGAKSPAQLEALITKLRKNISISL 117

Query: 96  KEKETL------------AHHYEQE--EEFLTNDLSRKLNQLTEEKYRLEQTIEQEQERV 141
             ++ L            A    Q     F+   L  K       +  +++ I+  ++++
Sbjct: 118 AGRDNLFTISYEDKDPELAKDVVQTLLTIFVEETLGSKRQDSDSAQRFIDEQIKTYEKKL 177

Query: 142 V---NKLMR-KIEK----------LEAETDAKQNNLEQLRREKVELENTLEQEQEAL--- 184
               N+L   K E             +E    Q  LE  R E  E     +  +  L   
Sbjct: 178 EAAENRLKAFKQENGGILPDQEGDYYSEISEAQEELEAARLELNEAIAQRDALKRQLGGE 237

Query: 185 ------------------VNKLWKRMDKL-----------EAEKRILQIKLDQPVSDPSS 215
                             +  L K++D L            A KR +    +Q   + S+
Sbjct: 238 EPVLLAGSSVANSELDGRIEALEKQLDALRLRYTDKHPDVIATKREIAQLEEQKEEEGSA 297

Query: 216 PHDMNNGDTASNLSTHIHSLRSEVMRLRNQLANTQQEHTEKMERYVKDEKNVRE------ 269
            +     +     +     L+ E+     ++A+ +    E   R  + E  +R       
Sbjct: 298 KN--GGPERGEIANPVYQQLQIELAEAEAEIASLEARVAELTARIERLESLLRTIPEVEA 355

Query: 270 ENLRLQRKLQREVERR--EALCRHLSESESSLEMEEERHYNEIAHSMRLVGSSPSSSRPL 327
           E  +L R    EV +   E L      +E S +ME +      A S R++      S+P 
Sbjct: 356 ELTQLNR--DYEVNKSNYEQLLTRRESAEVSKQMEVQDK----AVSFRIIDPPIVPSKPS 409

Query: 328 SP 329
            P
Sbjct: 410 GP 411


>gnl|CDD|234701 PRK00252, alaS, alanyl-tRNA synthetase; Reviewed.
          Length = 865

 Score = 32.8 bits (76), Expect = 0.34
 Identities = 11/38 (28%), Positives = 21/38 (55%)

Query: 25  VTREQLQKRIESLQQQNRVLKVELETYKLQLRALQEEN 62
               +L +R+E+L ++ + L+ ELE  K +L A    +
Sbjct: 712 AKPSELPERVEALLEELKELEKELEQLKAKLAAAAAGD 749


>gnl|CDD|220369 pfam09731, Mitofilin, Mitochondrial inner membrane protein.
           Mitofilin controls mitochondrial cristae morphology.
           Mitofilin is enriched in the narrow space between the
           inner boundary and the outer membranes, where it forms a
           homotypic interaction and assembles into a large
           multimeric protein complex. The first 78 amino acids
           contain a typical amino-terminal-cleavable mitochondrial
           presequence rich in positive-charged and hydroxylated
           residues and a membrane anchor domain. In addition, it
           has three centrally located coiled coil domains.
          Length = 493

 Score = 32.7 bits (75), Expect = 0.36
 Identities = 38/175 (21%), Positives = 72/175 (41%), Gaps = 21/175 (12%)

Query: 28  EQLQKRIESLQQQNRVLKVELETYKLQLRALQEENKNLRQTSVNIQAKAEQEEEFISNTL 87
           +QL K++  L+ +      E E  +  L+  +EE      + +  +  A  E +  +   
Sbjct: 173 DQLSKKLAELKAE------EEEELERALKEKREELL----SKLEEELLARLESK-EAALE 221

Query: 88  MKKIQALKKEKETLAHHYEQEEEFLTNDLSRKLNQLTEEKYRLEQTIEQEQERVVNKLMR 147
            +     ++EKE L   YE++   L  +L  +  +  E+K +  +   Q  E        
Sbjct: 222 KQLRLEFEREKEELRKKYEEK---LRQEL-ERQAEAHEQKLK-NELALQAIELQREFNKE 276

Query: 148 KIEKLEAETDAKQNNLEQLRREKVELENTLEQEQEAL-----VNKLWKRMDKLEA 197
             EK+E E + +   L +L      LE  L+   EA      V +LW  ++ L++
Sbjct: 277 IKEKVEEERNGRLAKLAELNSRLKGLEKALDSRSEAEDENHKVQQLWLAVEALKS 331



 Score = 28.5 bits (64), Expect = 7.0
 Identities = 45/239 (18%), Positives = 90/239 (37%), Gaps = 52/239 (21%)

Query: 57  ALQEENKNLRQTSVNIQAKAEQEEEFISNTLMKKIQALKKEKE----TLAHHYEQEEEF- 111
              +  +    +    +A+A+        T  K IQ L +  E     L      +    
Sbjct: 80  KQVKIPRQSGLSEEVAEAEAKATSVAAEATTPKSIQELVEALEELLEELLKETASDPVVQ 139

Query: 112 ----LTNDLSRKLNQLTEEKYRLEQTIEQEQERVVNKLMRKIEKLEAETDAK-QNNLEQL 166
               + NDL   + +    K  LE  I   +E + ++L +K+ +L+AE + + +  L++ 
Sbjct: 140 ELVSIFNDLIDSIKE-DNLKDDLESLIASAKEEL-DQLSKKLAELKAEEEEELERALKEK 197

Query: 167 RREKVE-LENTLEQEQEALVNKLWKRMD-KLEAEKRILQIKLDQPVSDPSSPHDMNNGDT 224
           R E +  LE  L    E+    L K++  + E EK  L+ K ++                
Sbjct: 198 REELLSKLEEELLARLESKEAALEKQLRLEFEREKEELRKKYEE---------------- 241

Query: 225 ASNLSTHIHSLRSEVMRLRNQLANTQQEHTEKMERYVKDEKNVREENLRLQRKLQREVE 283
                           +LR +L    + H +K+      +  +  + + LQR+  +E++
Sbjct: 242 ----------------KLRQELERQAEAHEQKL------KNELALQAIELQREFNKEIK 278


>gnl|CDD|226400 COG3883, COG3883, Uncharacterized protein conserved in bacteria
           [Function unknown].
          Length = 265

 Score = 32.0 bits (73), Expect = 0.39
 Identities = 19/66 (28%), Positives = 40/66 (60%), Gaps = 1/66 (1%)

Query: 119 KLNQLTEEKYRLEQTIEQEQERVVNKLMRKIEKLEAETDAKQNNLEQLRREKVELENTLE 178
           KL++L +EK  ++  IE    ++  ++  KI++L+ E D  +  +++L++E  EL+  + 
Sbjct: 39  KLSELQKEKKNIQNEIESLDNQI-EEIQSKIDELQKEIDQSKAEIKKLQKEIAELKENIV 97

Query: 179 QEQEAL 184
           + QE L
Sbjct: 98  ERQELL 103


>gnl|CDD|179385 PRK02224, PRK02224, chromosome segregation protein; Provisional.
          Length = 880

 Score = 32.7 bits (75), Expect = 0.42
 Identities = 66/299 (22%), Positives = 118/299 (39%), Gaps = 34/299 (11%)

Query: 27  REQLQKRIESLQQQNRVLKVELETYKLQLRALQEENKNLRQTSVNIQAKAEQEEEFISNT 86
            E+L+ R+E  +   +    E E+ +     L+E  + LR+ +  ++++ E+  E +   
Sbjct: 323 DEELRDRLEECRVAAQAHNEEAESLREDADDLEERAEELREEAAELESELEEAREAVE-D 381

Query: 87  LMKKIQALKKEKETLAHHYEQEEEFLTN------DLSRKLNQLTEEKYRLEQTIEQEQER 140
             ++I+ L++E E L   +      L N      +L  + ++L E +  LE T+   +ER
Sbjct: 382 RREEIEELEEEIEELRERFGDAPVDLGNAEDFLEELREERDELREREAELEATLRTARER 441

Query: 141 VVNKLMRKIEKLEAETDAKQ-----------NNLEQLRREKVELENTLEQ---EQEALVN 186
           V      + E L       +             +E+ R    ELE  LE    E E +  
Sbjct: 442 V-----EEAEALLEAGKCPECGQPVEGSPHVETIEEDRERVEELEAELEDLEEEVEEVEE 496

Query: 187 KLWKRMDKLEAEKRILQIKLDQPVSDPSSPHDMNNGDTASNLSTHIHSLRSEVMRLRNQL 246
           +L +  D +EAE RI   +L++   D          +T          LR     L  + 
Sbjct: 497 RLERAEDLVEAEDRIE--RLEERREDLEELIA-ERRETIEEKRERAEELRERAAELEAEA 553

Query: 247 ANTQQEHTEKMERYVKDEKNVREENLRLQRKLQREVERREALCRHLSESESSLEMEEER 305
              ++   E  E      +  REE   L  KL    ER E+L R +    +++   E+ 
Sbjct: 554 EEKREAAAEAEEE----AEEAREEVAELNSKLAELKERIESLER-IRTLLAAIADAEDE 607



 Score = 30.4 bits (69), Expect = 2.3
 Identities = 43/185 (23%), Positives = 77/185 (41%), Gaps = 37/185 (20%)

Query: 26  TREQLQKRIESLQQQNRVLKVELETYKLQLRALQEENKNLRQTSVNIQAKAEQEEEFISN 85
             E+L++ IE L+++     V+L   +  L  L+EE   LR+    ++A      E +  
Sbjct: 385 EIEELEEEIEELRERFGDAPVDLGNAEDFLEELREERDELREREAELEATLRTARERV-- 442

Query: 86  TLMKKIQALKKEK---------------ETLAHHYEQEEEFLTNDLSRKLNQLTEEKYRL 130
              ++ +AL +                 ET+    E+ EE     L  +L  L EE   +
Sbjct: 443 ---EEAEALLEAGKCPECGQPVEGSPHVETIEEDRERVEE-----LEAELEDLEEEVEEV 494

Query: 131 EQTIEQ-----EQERVVNKLMRK-------IEKLEAETDAKQNNLEQLRREKVELENTLE 178
           E+ +E+     E E  + +L  +       I +     + K+   E+LR    ELE   E
Sbjct: 495 EERLERAEDLVEAEDRIERLEERREDLEELIAERRETIEEKRERAEELRERAAELEAEAE 554

Query: 179 QEQEA 183
           +++EA
Sbjct: 555 EKREA 559



 Score = 30.0 bits (68), Expect = 2.3
 Identities = 60/279 (21%), Positives = 117/279 (41%), Gaps = 49/279 (17%)

Query: 27  REQLQKRIESLQQQNRVLKVELETYKLQLRALQEENKNLRQTSVNIQAKAEQEEEFISNT 86
            E ++ R E L+ ++  L+  LE  ++  +A  EE ++LR+ + +++ +AE+  E     
Sbjct: 309 AEAVEARREELEDRDEELRDRLEECRVAAQAHNEEAESLREDADDLEERAEELRE----- 363

Query: 87  LMKKIQALKKEKETLAHHYEQEEEFLTNDLSRKLNQLTEEKYRLEQTIEQEQERVVNKLM 146
                     E   L    E+  E     +  +  ++ E    LE+ IE+ +ER  +  +
Sbjct: 364 ----------EAAELESELEEARE----AVEDRREEIEE----LEEEIEELRERFGDAPV 405

Query: 147 RKIEKLEAETDAKQNNLEQLRREKVELENTLEQEQEALVNKLWKRMDKLEAEKRILQIKL 206
             +   E   +  +   ++LR  + ELE TL   +E +     +    LEA K     + 
Sbjct: 406 D-LGNAEDFLEELREERDELREREAELEATLRTARERVE----EAEALLEAGK---CPEC 457

Query: 207 DQPVSDPSSPHDMNNGDTASNLSTHIHSLRSEVMRLRNQLANTQQEHTEKMERYVKDEKN 266
            QPV    SPH     +T       +  L +E+  L  ++   ++               
Sbjct: 458 GQPVEG--SPH----VETIEEDRERVEELEAELEDLEEEVEEVEERLERA---------- 501

Query: 267 VREENLRLQRKLQREVERREALCRHLSESESSLEMEEER 305
             E+ +  + +++R  ERRE L   ++E   ++E + ER
Sbjct: 502 --EDLVEAEDRIERLEERREDLEELIAERRETIEEKRER 538


>gnl|CDD|227352 COG5019, CDC3, Septin family protein [Cell division and chromosome
           partitioning / Cytoskeleton].
          Length = 373

 Score = 32.3 bits (74), Expect = 0.45
 Identities = 16/79 (20%), Positives = 35/79 (44%), Gaps = 1/79 (1%)

Query: 94  LKKEKETLAHHYEQEEEFLTNDLSRKLNQLTEEKYR-LEQTIEQEQERVVNKLMRKIEKL 152
           L   K +     ++  E   N+  R+L +   EK R  E+ +E+ ++ ++ +      KL
Sbjct: 292 LSGLKNSGEPSLKEIHEARLNEEERELKKKFTEKIREKEKRLEELEQNLIEERKELNSKL 351

Query: 153 EAETDAKQNNLEQLRREKV 171
           E      ++  ++L + K 
Sbjct: 352 EEIQKKLEDLEKRLEKLKS 370



 Score = 29.6 bits (67), Expect = 3.2
 Identities = 20/94 (21%), Positives = 39/94 (41%), Gaps = 9/94 (9%)

Query: 115 DLSRKLNQLTEEKYRLEQ-TIEQEQERVVNKLMRKIEKLEAETDAKQNNLEQLRREKV-E 172
           +L      L  E YR E+ +  +       K +   E    E    +  L++   EK+ E
Sbjct: 274 ELKETTENLLYENYRTEKLSGLKNSGEPSLKEIH--EARLNEE---ERELKKKFTEKIRE 328

Query: 173 LENTLEQEQEALVNKLWKRMDKLEAEKRILQIKL 206
            E  LE+ ++ L+ +  +   KL  E+   +++ 
Sbjct: 329 KEKRLEELEQNLIEERKELNSKL--EEIQKKLED 360


>gnl|CDD|130902 TIGR01843, type_I_hlyD, type I secretion membrane fusion protein,
           HlyD family.  Type I secretion is an ABC transport
           process that exports proteins, without cleavage of any
           signal sequence, from the cytosol to extracellular
           medium across both inner and outer membranes. The
           secretion signal is found in the C-terminus of the
           transported protein. This model represents the adaptor
           protein between the ATP-binding cassette (ABC) protein
           of the inner membrane and the outer membrane protein,
           and is called the membrane fusion protein. This model
           selects a subfamily closely related to HlyD; it is
           defined narrowly and excludes, for example, colicin V
           secretion protein CvaA and multidrug efflux proteins
           [Protein fate, Protein and peptide secretion and
           trafficking].
          Length = 423

 Score = 32.3 bits (74), Expect = 0.49
 Identities = 31/155 (20%), Positives = 60/155 (38%), Gaps = 20/155 (12%)

Query: 26  TREQLQKRIESLQQQNRVLKVELETYKLQLRALQEENKNLRQTSVNIQAKAEQEEEFISN 85
            +  L+ ++E +  Q + L+ EL   + QL+AL++            Q +   EE     
Sbjct: 138 RKSTLRAQLELILAQIKQLEAELAGLQAQLQALRQ------------QLEVISEELEARR 185

Query: 86  TLMKKIQALKKEKETLAHHYEQEEEFLTNDLSRKLNQLTEEKYRLEQTIEQEQERVVNKL 145
            L +K   + +         E E E    +   +L +L  E   L++ I++ Q       
Sbjct: 186 KLKEK-GLVSR-----LELLELERER--AEAQGELGRLEAELEVLKRQIDELQLERQQIE 237

Query: 146 MRKIEKLEAETDAKQNNLEQLRREKVELENTLEQE 180
               E++  E    Q  L +LR    +  + L++ 
Sbjct: 238 QTFREEVLEELTEAQARLAELRERLNKARDRLQRL 272


>gnl|CDD|214922 smart00935, OmpH, Outer membrane protein (OmpH-like).  This family
           includes outer membrane proteins such as OmpH among
           others. Skp (OmpH) has been characterized as a molecular
           chaperone that interacts with unfolded proteins as they
           emerge in the periplasm from the Sec translocation
           machinery.
          Length = 140

 Score = 31.0 bits (71), Expect = 0.51
 Identities = 24/99 (24%), Positives = 49/99 (49%), Gaps = 7/99 (7%)

Query: 53  LQLRALQEENKNLRQTSVNIQAKAEQEEEFISNTLMKKIQALKKEKETLAHHYEQEEEFL 112
            +  A +   K L +     QA+ E+ E+     L K  + L+K+  TL+   E   E  
Sbjct: 11  QESPAGKAAQKQLEKEFKKRQAELEKLEK----ELQKLKEKLQKDAATLS---EAAREKK 63

Query: 113 TNDLSRKLNQLTEEKYRLEQTIEQEQERVVNKLMRKIEK 151
             +L +K+ +   ++ +L+Q +++ Q+  + K++ KI K
Sbjct: 64  EKELQKKVQEFQRKQQKLQQDLQKRQQEELQKILDKINK 102



 Score = 29.1 bits (66), Expect = 2.0
 Identities = 21/110 (19%), Positives = 49/110 (44%), Gaps = 18/110 (16%)

Query: 87  LMKKIQALKKEKETLAHHYEQEEEFLTNDLSRKLNQLT--EEKYRLEQTIEQEQERVVNK 144
           ++++  A K  ++ L    E+E +    +L +   +L   +EK + +     E  R    
Sbjct: 9   ILQESPAGKAAQKQL----EKEFKKRQAELEKLEKELQKLKEKLQKDAATLSEAAR---- 60

Query: 145 LMRKIEKLEAETDAKQNNLEQLRREKVELENTLEQEQEALVNKLWKRMDK 194
                EK E E   K    ++ ++   +L+  L++ Q+  + K+  +++K
Sbjct: 61  -----EKKEKELQKKVQEFQRKQQ---KLQQDLQKRQQEELQKILDKINK 102


>gnl|CDD|224241 COG1322, COG1322, Predicted nuclease of restriction
           endonuclease-like fold, RmuC family [General function
           prediction only].
          Length = 448

 Score = 32.0 bits (73), Expect = 0.55
 Identities = 29/150 (19%), Positives = 62/150 (41%), Gaps = 6/150 (4%)

Query: 37  LQQQNRVLKVELETYKLQLRALQEEN----KNLRQTSVNIQAKAEQEEEFISNTLMKKIQ 92
                   + E E  +   R+LQ  N    + L +    +Q +  Q  E +   L++ + 
Sbjct: 43  QLLLLLAFRAEAEQLRTFARSLQALNLELIQELNELKARLQQQLLQSREQLQ-LLIESLA 101

Query: 93  ALKKEKETLAHHYEQEEEFLTNDLSRK-LNQLTEEKYRLEQTIEQEQERVVNKLMRKIEK 151
            L  E + LA+   +E      +L+++ L QL +    + +   ++ E+ +++   +   
Sbjct: 102 QLSSEFQELANEIFEELNRRLAELNQQNLKQLLKPLREVLEKFREQLEQRIHESAEERST 161

Query: 152 LEAETDAKQNNLEQLRREKVELENTLEQEQ 181
           L  E D     ++QL +E   L   L+  +
Sbjct: 162 LLEEIDRLLGEIQQLAQEAGNLTAALKGNK 191



 Score = 31.2 bits (71), Expect = 1.1
 Identities = 25/163 (15%), Positives = 54/163 (33%), Gaps = 11/163 (6%)

Query: 44  LKVELETYKLQLRALQEENKNLRQTSVNIQAKAEQEEE-FISNTLMKKIQALKKEKETLA 102
           L V L   +  L  L    + L + +  ++           +  L    ++L+     L 
Sbjct: 13  LLVNLAFIRQLLLRLGRLEQMLGELAAVLEQLLLLLAFRAEAEQLRTFARSLQALNLELI 72

Query: 103 HHYEQEEEFLTNDLSRKLNQLTEEKYRLEQTIEQEQERVVNKLMRKIEKLEAETDAKQNN 162
               + +  L   L +   QL      L Q   + QE           ++  E + +   
Sbjct: 73  QELNELKARLQQQLLQSREQLQLLIESLAQLSSEFQEL--------ANEIFEELNRRLAE 124

Query: 163 LEQLRREKVE--LENTLEQEQEALVNKLWKRMDKLEAEKRILQ 203
           L Q   +++   L   LE+ +E L  ++ +  ++       + 
Sbjct: 125 LNQQNLKQLLKPLREVLEKFREQLEQRIHESAEERSTLLEEID 167


>gnl|CDD|147082 pfam04740, LXG, LXG domain of WXG superfamily.  This domain is
           present is the N-terminal region of a group of
           polymorphic toxin proteins in bacteria. It is predicted
           to use Type VII secretion pathway to mediate export of
           bacterial toxins.
          Length = 204

 Score = 31.5 bits (72), Expect = 0.55
 Identities = 21/103 (20%), Positives = 48/103 (46%), Gaps = 15/103 (14%)

Query: 108 EEEFLTNDLSRKLNQLTEEKYRLEQTIEQEQERVVNKLMRKIEKL----EAETDAKQNNL 163
           +EEFL ++L   L        + E+  E+ ++  +N ++  I  +    +  T A Q  L
Sbjct: 96  DEEFLEHELKNGLK-------KAEEKTEELKDA-INSILASISDIVHLPKFSTSAFQEQL 147

Query: 164 EQLRREK---VELENTLEQEQEALVNKLWKRMDKLEAEKRILQ 203
           ++ ++++   +E     +QE    + +L   + +++   R LQ
Sbjct: 148 DKAKKKRNDTLEKLYEFDQELTNELAELEALIQEIQQYVRQLQ 190


>gnl|CDD|233973 TIGR02680, TIGR02680, TIGR02680 family protein.  Members of this
           protein family belong to a conserved gene four-gene
           neighborhood found sporadically in a phylogenetically
           broad range of bacteria: Nocardia farcinica,
           Symbiobacterium thermophilum, and Streptomyces
           avermitilis (Actinobacteria), Geobacillus kaustophilus
           (Firmicutes), Azoarcus sp. EbN1 and Ralstonia
           solanacearum (Betaproteobacteria). Proteins in this
           family average over 1400 amino acids in length
           [Hypothetical proteins, Conserved].
          Length = 1353

 Score = 32.1 bits (73), Expect = 0.56
 Identities = 31/157 (19%), Positives = 51/157 (32%), Gaps = 9/157 (5%)

Query: 28  EQLQKRIESLQQQNRVLKVELETYKLQLRALQEENKNLRQTSVNIQAKAEQEEEFISNTL 87
             +   +E L +       ELE  +   RAL   N   R           +     S   
Sbjct: 223 TDVADALEQLDEYRD----ELERLEALERAL--RNFLQRYRRYARTMLRRRATRLRSAQ- 275

Query: 88  MKKIQALKKEKETLAHHYEQEEEFLTNDLSRKLNQLTEEKYRLEQTIEQEQERVVNKLMR 147
             +   L ++        E   E      +R    L  E   L   +E  Q     +   
Sbjct: 276 -TQYDQLSRDLGRARDELETAREEERELDAR-TEALEREADALRTRLEALQGSPAYQDAE 333

Query: 148 KIEKLEAETDAKQNNLEQLRREKVELENTLEQEQEAL 184
           ++E+  A+ +A Q      R+   E E+ LE+E+  L
Sbjct: 334 ELERARADAEALQAAAADARQAIREAESRLEEERRRL 370



 Score = 30.2 bits (68), Expect = 2.7
 Identities = 43/206 (20%), Positives = 76/206 (36%), Gaps = 31/206 (15%)

Query: 115 DLSRKLNQLTEEKYRLEQTIEQEQERVVNKLMRKIEKLE-------AETDAKQNNLEQLR 167
           D++  L QL E +  LE+ +E  +  + N L R                 + Q   +QL 
Sbjct: 224 DVADALEQLDEYRDELER-LEALERALRNFLQRYRRYARTMLRRRATRLRSAQTQYDQLS 282

Query: 168 REKVELENTLEQEQEALVNKLWKRMDKLEAEKRILQIKLDQPVSDPSSPHDMNNGDTASN 227
           R+     + LE  +E    +L  R + LE E   L+ +L+     P+      + +    
Sbjct: 283 RDLGRARDELETAREEE-RELDARTEALEREADALRTRLEALQGSPA----YQDAEE--- 334

Query: 228 LSTHIHSLRSEVMRLRNQLANTQQEHTEKMERYVKDEKNVREENLRLQRKLQREVERREA 287
               +   R++   L+   A+ +Q   E   R  ++           +R+L  E  R + 
Sbjct: 335 ----LERARADAEALQAAAADARQAIREAESRLEEE-----------RRRLDEEAGRLDD 379

Query: 288 LCRHLSESESSLEMEEERHYNEIAHS 313
             R L  +   L    ER     AH+
Sbjct: 380 AERELRAAREQLARAAERAGLSPAHT 405


>gnl|CDD|238103 cd00176, SPEC, Spectrin repeats, found in several proteins involved
           in cytoskeletal structure; family members include
           spectrin, alpha-actinin and dystrophin; the spectrin
           repeat forms a three helix bundle with the second helix
           interrupted by proline in some sequences; the repeats
           are independent folding units; tandem repeats are found
           in differing numbers and arrange in an antiparallel
           manner to form dimers; the repeats are defined by a
           characteristic tryptophan (W) residue in helix A and a
           leucine (L) at the carboxyl end of helix C and separated
           by a linker of 5 residues; two copies of the repeat are
           present here.
          Length = 213

 Score = 31.3 bits (71), Expect = 0.57
 Identities = 36/180 (20%), Positives = 81/180 (45%), Gaps = 15/180 (8%)

Query: 34  IESLQQQNRVLKVELETYKLQLRALQEENKNLRQT----SVNIQAKAEQEEEFISNTLMK 89
           +E+L +++  L+ EL  ++ ++ AL E  + L +     +  IQ + E+      N   +
Sbjct: 35  VEALLKKHEALEAELAAHEERVEALNELGEQLIEEGHPDAEEIQERLEE-----LNQRWE 89

Query: 90  KIQALKKEKET-LAHHYEQEEEFLTNDLSRKLNQLTEEKYRLEQTIEQEQERVVNKLMRK 148
           +++ L +E+   L    + ++ F   D       L E++  L      +    V +L++K
Sbjct: 90  ELRELAEERRQRLEEALDLQQFF--RDADDLEQWLEEKEAALASEDLGKDLESVEELLKK 147

Query: 149 IEKLEAETDAKQNNLEQLRREKVELENTLEQEQEALVNKLWKRMDKLEAEKRILQIKLDQ 208
            ++LE E +A +  L+ L     EL   LE+       ++ +++++L      L    ++
Sbjct: 148 HKELEEELEAHEPRLKSLNELAEEL---LEEGHPDADEEIEEKLEELNERWEELLELAEE 204


>gnl|CDD|179699 PRK03992, PRK03992, proteasome-activating nucleotidase;
          Provisional.
          Length = 389

 Score = 31.7 bits (73), Expect = 0.63
 Identities = 12/36 (33%), Positives = 26/36 (72%)

Query: 32 KRIESLQQQNRVLKVELETYKLQLRALQEENKNLRQ 67
          +R+E+L+++N  L+ ++   +L+LR L+ EN+ L +
Sbjct: 1  ERLEALEERNSELEEQIRQLELKLRDLEAENEKLER 36



 Score = 30.2 bits (69), Expect = 1.9
 Identities = 13/49 (26%), Positives = 25/49 (51%), Gaps = 7/49 (14%)

Query: 28 EQLQKRIESLQQQNRVLKVEL-------ETYKLQLRALQEENKNLRQTS 69
          E L++R   L++Q R L+++L       E  + +L  L+ E + L+   
Sbjct: 4  EALEERNSELEEQIRQLELKLRDLEAENEKLERELERLKSELEKLKSPP 52



 Score = 28.6 bits (65), Expect = 6.1
 Identities = 17/56 (30%), Positives = 27/56 (48%), Gaps = 13/56 (23%)

Query: 119 KLNQLTEEKYRLEQTIEQ------EQERVVNKLMRKIEKLEAETDAKQNNLEQLRR 168
           +L  L E    LE+ I Q      + E    KL R++E+L++E       LE+L+ 
Sbjct: 2   RLEALEERNSELEEQIRQLELKLRDLEAENEKLERELERLKSE-------LEKLKS 50


>gnl|CDD|202833 pfam03962, Mnd1, Mnd1 family.  This family of proteins includes
           MND1 from S. cerevisiae. The mnd1 protein forms a
           complex with hop2 to promote homologous chromosome
           pairing and meiotic double-strand break repair.
          Length = 188

 Score = 31.1 bits (71), Expect = 0.66
 Identities = 18/71 (25%), Positives = 33/71 (46%), Gaps = 2/71 (2%)

Query: 113 TNDLSRKLNQLTEEKYRLEQTIEQEQERVVNKLMRKIEKLEAETDAKQNNLEQLRREKVE 172
            N L  +L +L +E   L+Q I + Q ++      + E    E       L+QL +E  +
Sbjct: 64  LNKLKTRLEKLKKELEELKQRIAELQAQIEKLKKGREE--TEERTELLEELKQLEKELKK 121

Query: 173 LENTLEQEQEA 183
           L+  LE+ ++ 
Sbjct: 122 LKAELEKYEKN 132



 Score = 29.1 bits (66), Expect = 3.0
 Identities = 17/63 (26%), Positives = 35/63 (55%), Gaps = 2/63 (3%)

Query: 136 QEQERVVNKLMRKIEKLEAETDAKQNNLEQLRREKVELENTLEQEQEALVNKLWKRMDKL 195
            + +  + KL +++E+L+      Q  +E+L++ + E E   E  +E  + +L K + KL
Sbjct: 65  NKLKTRLEKLKKELEELKQRIAELQAQIEKLKKGREETEERTELLEE--LKQLEKELKKL 122

Query: 196 EAE 198
           +AE
Sbjct: 123 KAE 125



 Score = 28.7 bits (65), Expect = 3.9
 Identities = 17/76 (22%), Positives = 36/76 (47%), Gaps = 4/76 (5%)

Query: 26  TREQLQKRIESLQQQNRVLKVELETYKL----QLRALQEENKNLRQTSVNIQAKAEQEEE 81
            R +L + ++ L+++ + LK ELE Y+     ++  L+EE K  ++ +          + 
Sbjct: 104 ERTELLEELKQLEKELKKLKAELEKYEKNDPERIEKLKEETKVAKEAANRWTDNIFSLKS 163

Query: 82  FISNTLMKKIQALKKE 97
           ++S         L+KE
Sbjct: 164 YLSKKFGLDEAQLRKE 179



 Score = 27.6 bits (62), Expect = 8.6
 Identities = 23/83 (27%), Positives = 42/83 (50%), Gaps = 10/83 (12%)

Query: 52  KLQLRALQEENKNLRQTSVNIQA-----KAEQEEEFISNTLMKKIQALKKEKETLAHHYE 106
           K +L  L++E + L+Q    +QA     K  +EE      L+++++ L+KE + L     
Sbjct: 68  KTRLEKLKKELEELKQRIAELQAQIEKLKKGREETEERTELLEELKQLEKELKKLK---A 124

Query: 107 QEEEFLTNDLSR--KLNQLTEEK 127
           + E++  ND  R  KL + T+  
Sbjct: 125 ELEKYEKNDPERIEKLKEETKVA 147


>gnl|CDD|200948 pfam00038, Filament, Intermediate filament protein. 
          Length = 312

 Score = 31.4 bits (72), Expect = 0.72
 Identities = 40/154 (25%), Positives = 66/154 (42%), Gaps = 16/154 (10%)

Query: 27  REQLQKRIESLQQQNRVLKVELETYKLQLRALQEENKNLRQTSVNIQAKAEQEEEFISNT 86
           R  L+ +IESLQ++   LK   E    ++R LQ + +      ++   K +         
Sbjct: 119 RVDLEMKIESLQEELAFLKKNHEE---EVRELQSQIQGQVNVEMDAARKLD--------- 166

Query: 87  LMKKIQALKKEKETLAHHYEQEEEFLTNDLSRKLNQLTEEKYRLEQTIEQEQERVVNKLM 146
           L K +  ++ + E LA    QE E        KL +L +   R    +   +E +  +L 
Sbjct: 167 LTKALAEIRAQYEELAEKNRQEAE---EWYKSKLEELQQAAARNGDALRSAKEEI-TELR 222

Query: 147 RKIEKLEAETDAKQNNLEQLRREKVELENTLEQE 180
           R+I+ LE E  + +     L R+  ELE   E E
Sbjct: 223 RQIQSLEIELQSLKKQKASLERQLAELEERYELE 256



 Score = 30.2 bits (69), Expect = 1.9
 Identities = 23/101 (22%), Positives = 47/101 (46%), Gaps = 8/101 (7%)

Query: 114 NDLSRKLNQLTEEKYRLEQTIEQEQERVVNKLMRKIE-------KLEAETDAKQNNLEQL 166
            +L  K+ +L ++K      +    E+ + +L ++++       +L+ E D  +   E  
Sbjct: 28  KELEAKIEELRQKKSAEPSRLYSLYEQEIRELRKQLDELTNERARLQLEIDNLRLAAEDF 87

Query: 167 RREKVELENTLEQEQEALVNKLWKRMDKLEAEKRILQIKLD 207
            REK E E  L Q  E  +  L K +D+    +  L++K++
Sbjct: 88  -REKYEDELNLRQSAENDIVGLRKDLDEATLARVDLEMKIE 127


>gnl|CDD|219791 pfam08317, Spc7, Spc7 kinetochore protein.  This domain is found in
           cell division proteins which are required for
           kinetochore-spindle association.
          Length = 321

 Score = 31.5 bits (72), Expect = 0.73
 Identities = 17/97 (17%), Positives = 34/97 (35%), Gaps = 8/97 (8%)

Query: 112 LTNDLSRKLNQLTEEKYRLEQTIEQEQERVVNKLMRKIEKLEAETDAKQNNLEQLRREKV 171
           L   L   L  +  ++  L      +   ++N +  K+ K       +  +L QL  E  
Sbjct: 148 LKEGLEENLEGMKRDEELLN-----KDLNLINSIKPKLRKKLQALKEEIASLRQLADE-- 200

Query: 172 ELENTLEQEQEALVNKLWKRMDKLEAEKRILQIKLDQ 208
            L      E E    +L     K+  +++ L+    +
Sbjct: 201 -LNLCDPLELEKARQELRSLSVKISEKRKQLEELQQE 236


>gnl|CDD|235570 PRK05703, flhF, flagellar biosynthesis regulator FlhF; Validated.
          Length = 424

 Score = 31.4 bits (72), Expect = 0.76
 Identities = 28/124 (22%), Positives = 50/124 (40%)

Query: 89  KKIQALKKEKETLAHHYEQEEEFLTNDLSRKLNQLTEEKYRLEQTIEQEQERVVNKLMRK 148
           KK   L++EK   A      +  L    SR  +Q    +        +++    ++   +
Sbjct: 58  KKNPVLREEKRKPAKSILSLQALLEKRPSRTNSQDALLQAENALPEWKKELEKPSEPKEE 117

Query: 149 IEKLEAETDAKQNNLEQLRREKVELENTLEQEQEALVNKLWKRMDKLEAEKRILQIKLDQ 208
             K  AE+   Q  L++LR E  EL+N LE +   L        +  E  KR+ +  L  
Sbjct: 118 EPKAAAESKVVQKELDELRDELKELKNLLEDQLSGLRQVERIPPEFAELYKRLKRSGLSP 177

Query: 209 PVSD 212
            +++
Sbjct: 178 EIAE 181


>gnl|CDD|240271 PTZ00108, PTZ00108, DNA topoisomerase 2-like protein; Provisional.
          Length = 1388

 Score = 31.9 bits (73), Expect = 0.77
 Identities = 54/374 (14%), Positives = 113/374 (30%), Gaps = 53/374 (14%)

Query: 28   EQLQKRIESLQQQNRVLKV----ELETYKLQLRALQEENKNLRQTSVNIQAKAEQEEEFI 83
             +L++ +  L  + R +K     EL     + + L +E K L           ++ E+  
Sbjct: 1002 GKLERELARLSNKVRFIKHVINGELVITNAKKKDLVKELKKLGYVRF-KDIIKKKSEKIT 1060

Query: 84   SNTLMKKIQALKKEKETLAHHYEQEEEFLTND--LSRKLNQLTEEKYRLEQTIEQEQERV 141
            +       +  + + E           +   D  LS  +  LT+EK              
Sbjct: 1061 AEEEEGAEEDDEADDEDDEEELGAAVSY---DYLLSMPIWSLTKEK-------------- 1103

Query: 142  VNKLMRKIEKLEAETDAKQNNLEQLRREKVE---------LENTLEQEQEALVNKLWKRM 192
                   +EKL AE + K+  LE+L+    +          E  LE+++E    ++ K  
Sbjct: 1104 -------VEKLNAELEKKEKELEKLKNTTPKDMWLEDLDKFEEALEEQEEVEEKEIAKEQ 1156

Query: 193  DKLEAEKRILQIKLDQPVSDPSSPHDMNNGDTASNLSTHIHSLRSEVMRLRNQLANTQQE 252
                  K          +         ++ D +   S   +S R +    R        +
Sbjct: 1157 RLKSKTKGKASKLRKPKLKKKEKKKKKSSADKSKKASVVGNSKRVDSDEKRKLDDKPDNK 1216

Query: 253  HTEKMERYVKDEKNVREENLRLQRKLQREVERREALCRHLSESESSLEMEEERHYNEIAH 312
             +       +D++         Q+   ++   +    +  + S+SS + +E    +    
Sbjct: 1217 KSNSSGSDQEDDEE--------QKTKPKKSSVKRLKSKKNNSSKSSEDNDEFSSDDLSKE 1268

Query: 313  SMRLVGSSPSSSRPLSPGASMLTTKEPRCFACGQTTVMPIPPPVHSRPQRIKTPLLSVEQ 372
                      S+   SP                     P  P      +R++  L ++++
Sbjct: 1269 GKPKNAPKRVSAVQYSPPPPSKRPDGES-----NGGSKPSSPTKKKVKKRLEGSLAALKK 1323

Query: 373  PTSPEGAPTPTLGS 386
                E        S
Sbjct: 1324 KKKSEKKTARKKKS 1337



 Score = 31.6 bits (72), Expect = 0.80
 Identities = 32/163 (19%), Positives = 61/163 (37%), Gaps = 42/163 (25%)

Query: 159  KQNNLEQLRREKVELENTLE------QEQ--------EALVNKLWKR-------MDKLEA 197
            K+  L +L RE   L N +         +        + LV +L K        + K ++
Sbjct: 997  KEYLLGKLERELARLSNKVRFIKHVINGELVITNAKKKDLVKELKKLGYVRFKDIIKKKS 1056

Query: 198  EKRILQIKLDQPVSDPSSPHDMNNGDTASN-----LSTHIHSLRSE-VMRLRNQLANTQQ 251
            EK   + +      D +   D      A+      LS  I SL  E V +L  +L   ++
Sbjct: 1057 EKITAEEEEGAEEDDEADDEDDEEELGAAVSYDYLLSMPIWSLTKEKVEKLNAELEKKEK 1116

Query: 252  EHTE---------------KMERYVKDEKNVREENLRLQRKLQ 279
            E  +               K E  +++++ V E+ +  +++L+
Sbjct: 1117 ELEKLKNTTPKDMWLEDLDKFEEALEEQEEVEEKEIAKEQRLK 1159


>gnl|CDD|236892 PRK11281, PRK11281, hypothetical protein; Provisional.
          Length = 1113

 Score = 31.8 bits (73), Expect = 0.85
 Identities = 43/254 (16%), Positives = 97/254 (38%), Gaps = 32/254 (12%)

Query: 24  PVTREQLQKRIESLQQQ------NRVLKVELETYKLQLRALQEENKNLRQTSVNIQAKAE 77
             T   +Q ++++L +Q      +++++ +LE     L  +  + +   Q    ++ +  
Sbjct: 35  LPTEADVQAQLDALNKQKLLEAEDKLVQQDLEQTLALLDKIDRQKEETEQ----LKQQLA 90

Query: 78  QEEEFISNTLMKKIQALKKEKETLAHHYEQEEEFLTN-DLSRKLNQLTEEKYRLEQTIEQ 136
           Q    +      +++ALK + +       +    L+   L  +L Q  ++    +  + +
Sbjct: 91  QAPAKLRQA-QAELEALKDDNDEET---RETLSTLSLRQLESRLAQTLDQLQNAQNDLAE 146

Query: 137 EQERVVNKLMRKIEKLEAETDAKQNNLEQLRREKVELENTLEQEQEALVNKLWKRMDKLE 196
              ++V+ L  + E+ +A   A    L+Q+R          +   +AL      +   L+
Sbjct: 147 YNSQLVS-LQTQPERAQAALYANSQRLQQIRNLL----KGGKVGGKALRP---SQRVLLQ 198

Query: 197 AEKRILQIKLDQPVSDPSSPHDMNNGDTASNLSTHIHSLRSE-VMRLRNQLANTQQEHTE 255
           AE+ +L  + D           +       +L        +  + RL +QL   Q+    
Sbjct: 199 AEQALLNAQNDL------QRKSLEGNTQLQDLLQKQRDYLTARIQRLEHQLQLLQEAINS 252

Query: 256 KMERYVKDEKNVRE 269
           K  R    EK V+E
Sbjct: 253 K--RLTLSEKTVQE 264


>gnl|CDD|234714 PRK00286, xseA, exodeoxyribonuclease VII large subunit; Reviewed.
          Length = 438

 Score = 31.3 bits (72), Expect = 0.87
 Identities = 27/135 (20%), Positives = 59/135 (43%), Gaps = 6/135 (4%)

Query: 23  SPVTREQLQKRIESLQQQ-NRVLKVELETYKLQLRALQEENKNLRQTSVNIQAKAEQEEE 81
            P   E LQ R++ LQQ+  R ++  LE  + +L  L    +   Q+   + A+ +Q  +
Sbjct: 259 VPDRAELLQ-RLQQLQQRLARAMRRRLEQKRQRLDQLA--RRLKFQSPERLLAQQQQRLD 315

Query: 82  FISNTLMKKIQALKKEKETLAHHYEQEEEFLT--NDLSRKLNQLTEEKYRLEQTIEQEQE 139
            +   L + ++   +  +       Q  +       + R   +L + + RL + + ++ +
Sbjct: 316 RLQQRLQRALERRLRLAKQRLERLSQRLQQQNPQRRIERAQQRLEQLEQRLRRAMRRQLK 375

Query: 140 RVVNKLMRKIEKLEA 154
           R   +L    ++LEA
Sbjct: 376 RKRQRLEALAQQLEA 390



 Score = 31.3 bits (72), Expect = 0.91
 Identities = 34/187 (18%), Positives = 69/187 (36%), Gaps = 58/187 (31%)

Query: 110 EFLTNDLSRKLNQLTEEKYRLEQ----TIEQEQERVVNKLMRKIEKLEAETDAKQNNLEQ 165
           E    D +  L +L + + RL +     +EQ+++R+ ++L R++          Q+    
Sbjct: 256 ELAVPDRAELLQRLQQLQQRLARAMRRRLEQKRQRL-DQLARRL--------KFQSPERL 306

Query: 166 LRREKVELENTLEQEQEALVNKLWKRMDKLEAEKRILQIKLDQPVSDPSSPHDMNNGDTA 225
           L +++  L+   ++ Q AL  +L     +LE   + LQ                      
Sbjct: 307 LAQQQQRLDRLQQRLQRALERRLRLAKQRLERLSQRLQ---------------------Q 345

Query: 226 SNLSTHIHSLRSEVMRLRNQLANTQQEHTEKMERYVKDEKNVREENLRLQRKLQREVERR 285
            N    I   +  + +L  +L                           ++R+L+R+ +R 
Sbjct: 346 QNPQRRIERAQQRLEQLEQRLRRA------------------------MRRQLKRKRQRL 381

Query: 286 EALCRHL 292
           EAL + L
Sbjct: 382 EALAQQL 388


>gnl|CDD|218115 pfam04502, DUF572, Family of unknown function (DUF572).  Family of
           eukaryotic proteins with undetermined function.
          Length = 321

 Score = 31.3 bits (71), Expect = 0.88
 Identities = 28/110 (25%), Positives = 51/110 (46%), Gaps = 7/110 (6%)

Query: 92  QALKKEKETLAHHYEQEEEFLTNDLSRKLNQLTEEKYRLEQTIEQEQERVVNKLMRKIEK 151
           +A K ++E      ++ EE L  D  +KL   T +  R  + +E+ +E      +++++ 
Sbjct: 109 EADKLDEEQEERVEKEREEELAGDAMKKLENRTADSKREMEVLERLEE------LKELQS 162

Query: 152 LEAETDAKQNNLEQLRREKVELENTLEQEQEALVNKLWKRMDKLEAEKRI 201
             A+ D     LE L R + + E   E+E EAL+  L    +  E  +R 
Sbjct: 163 RRADVDVNSM-LEALFRREKKEEEEEEEEDEALIKSLSFGPETEEDRRRA 211


>gnl|CDD|240419 PTZ00440, PTZ00440, reticulocyte binding protein 2-like protein;
            Provisional.
          Length = 2722

 Score = 31.7 bits (72), Expect = 0.90
 Identities = 29/160 (18%), Positives = 65/160 (40%), Gaps = 9/160 (5%)

Query: 28   EQLQKRIESLQQQNRVLKVELETYKLQLRALQEENKNLRQTSVNIQAKAEQEEEFISNTL 87
            E++   IE++   N  +K EL      LR ++E+   + +   ++            N +
Sbjct: 2471 EKIDDLIENVTSHNEKIKSELLIINDALRRVKEKKDEMNKLFNSLTENNNNNNNSAKNIV 2530

Query: 88   MKKIQALKKEKETLAHHYEQEEEFLTNDLSRKLNQLTEEKYRLEQTIEQEQERVVNKLMR 147
                  + +    L  H  +  E L+  +  ++ +L  EK +L +  + E+ R   +   
Sbjct: 2531 DNSTYIINE----LESHVSKLNELLSY-IDNEIKELENEKLKLLEKAKIEESRKERE--- 2582

Query: 148  KIEKLEAETDAKQNNLEQLRREKV-ELENTLEQEQEALVN 186
            +IE    E +  +  + + ++E++ + E      QE L  
Sbjct: 2583 RIESETQEDNTDEEQINRQQQERLQKEEEQKAYSQERLNR 2622


>gnl|CDD|226513 COG4026, COG4026, Uncharacterized protein containing TOPRIM domain,
           potential nuclease [General function prediction only].
          Length = 290

 Score = 31.0 bits (70), Expect = 1.0
 Identities = 32/120 (26%), Positives = 53/120 (44%), Gaps = 8/120 (6%)

Query: 27  REQLQKRIESLQQQNRVLKVELETYKLQLRALQEENKNLRQTSVNIQAKAEQEEEFISNT 86
            E+L++++E LQ++   L  ELE  + +   +QE  K L          +  EE      
Sbjct: 137 YEELKEKLEELQKEKEELLKELEELEAEYEEVQERLKRLE------VENSRLEEML--KK 188

Query: 87  LMKKIQALKKEKETLAHHYEQEEEFLTNDLSRKLNQLTEEKYRLEQTIEQEQERVVNKLM 146
           L  ++  LKK  + L    E  EE L +DL ++   L  +    +  I  E E+ V  L+
Sbjct: 189 LPGEVYDLKKRWDELEPGVELPEEELISDLVKETLNLAPKDIEGQGYIYAEDEKEVEILL 248



 Score = 29.5 bits (66), Expect = 3.2
 Identities = 21/77 (27%), Positives = 36/77 (46%), Gaps = 1/77 (1%)

Query: 122 QLTEEKYRLEQTIEQEQERVVNKLMRKIEKLEAETDAKQNNLEQLRREKVELENTLEQEQ 181
           +L +E   LE   E+ QER+  +L  +  +LE         +  L++   ELE  +E  +
Sbjct: 153 ELLKELEELEAEYEEVQERL-KRLEVENSRLEEMLKKLPGEVYDLKKRWDELEPGVELPE 211

Query: 182 EALVNKLWKRMDKLEAE 198
           E L++ L K    L  +
Sbjct: 212 EELISDLVKETLNLAPK 228


>gnl|CDD|222417 pfam13851, GAS, Growth-arrest specific micro-tubule binding.  This
           family is the highly conserved central region of a
           number of metazoan proteins referred to as growth-arrest
           proteins. In mouse, Gas8 is predominantly a testicular
           protein, whose expression is developmentally regulated
           during puberty and spermatogenesis. In humans, it is
           absent in infertile males who lack the ability to
           generate gametes. The localisation of Gas8 in the
           motility apparatus of post-meiotic gametocytes and
           mature spermatozoa, together with the detection of Gas8
           also in cilia at the apical surfaces of epithelial cells
           lining the pulmonary bronchi and Fallopian tubes
           suggests that the Gas8 protein may have a role in the
           functioning of motile cellular appendages. Gas8 is a
           microtubule-binding protein localised to regions of
           dynein regulation in mammalian cells.
          Length = 201

 Score = 30.3 bits (69), Expect = 1.1
 Identities = 32/145 (22%), Positives = 72/145 (49%), Gaps = 25/145 (17%)

Query: 28  EQLQKRIESLQQQNRVLKVELETYKLQLRALQEENKN---LRQTSVNIQAKAEQEEEFIS 84
           E  +K +  + Q+N+ L   L+  + ++  L+++ K+    +Q+  N++A+ ++ E    
Sbjct: 44  EHNEKLMAEIAQENKRLVEPLKKAEEEVEELRKKLKDYEKDKQSLKNLKARLKELE---- 99

Query: 85  NTLMKKIQALKKEKETLAHHYEQEEEFLTNDLSRKLNQLTEEKYRLEQTIEQEQERVVNK 144
               K+++ LK E E L   +E+        + R+ ++L +   + E  I+  Q++   K
Sbjct: 100 ----KELKNLKWESEVLEQRFEK--------VERERDELYD---KFEAAIQDVQQKTGLK 144

Query: 145 ---LMRKIEKLEAETDAKQNNLEQL 166
              L +K+E L  E + K+  L ++
Sbjct: 145 NLLLEQKLEALNEELEKKEAQLNEV 169


>gnl|CDD|224495 COG1579, COG1579, Zn-ribbon protein, possibly nucleic acid-binding
           [General function prediction only].
          Length = 239

 Score = 30.4 bits (69), Expect = 1.1
 Identities = 37/166 (22%), Positives = 72/166 (43%), Gaps = 11/166 (6%)

Query: 132 QTIEQEQERVVNKLMRKIEKLEAETDAKQNNLEQLRREKVELENTLEQEQEALVNKLWKR 191
           Q ++ E++R    L  +I+++       +  LE L +    LE  LE + E  V++L   
Sbjct: 13  QKLDLEKDR----LEPRIKEIRKALKKAKAELEALNKALEALEIELE-DLENQVSQLESE 67

Query: 192 MDKLEAEKRILQIKLDQPVSDPSSPHDMNNGDTASNLSTHIHSLRSEVMRLRNQLANTQQ 251
           + ++    +  + KL   V D      +N     +     I+SL  E+  L  ++   ++
Sbjct: 68  IQEIRERIKRAEEKLSA-VKDERELRALNIEIQIAK--ERINSLEDELAELMEEIEKLEK 124

Query: 252 EHTEKMERYVKDEKNVREENLRLQ---RKLQREVERREALCRHLSE 294
           E  +  ER  + EKN+ E   RL+    +++ E +   +    L E
Sbjct: 125 EIEDLKERLERLEKNLAEAEARLEEEVAEIREEGQELSSKREELKE 170


>gnl|CDD|226834 COG4399, COG4399, Uncharacterized protein conserved in bacteria
           [Function unknown].
          Length = 376

 Score = 30.8 bits (70), Expect = 1.1
 Identities = 36/209 (17%), Positives = 69/209 (33%), Gaps = 37/209 (17%)

Query: 15  GVALMMPPSPVTREQLQKRIESLQQQNRVLKVELETYKLQLRALQEENKNLRQ------- 67
            +  ++    +T E +QK++     Q RV +   +  +  L++   + + L Q       
Sbjct: 60  QIGKVVVNHLLTPEGIQKKLRRELFQERVTEAIDQLLQKLLKSEVTDKEQLHQQIFADIE 119

Query: 68  -------------------TSVNIQAKAEQEEEFISNTLMKKIQALKKEKETLAHHYEQE 108
                              T        E     +  +L + + +     E+ A  Y   
Sbjct: 120 KDLIGNSERWLEKELAEKFTEAQSNTIFELIPLELVESLEQSLPSADWLLESRARPYVAS 179

Query: 109 EEFLTNDLSRKLNQLTEEKYRLEQ--TIEQEQERVVNKLMRKIEKLEAETDAKQ------ 160
                  L   L+Q  EE  +L     +    E + ++L +++ KL    + K+      
Sbjct: 180 LA-GKIQLGDMLDQFIEEHGKLGTMVQMFVVNESLADRLQQELLKLLNREETKKILQQLL 238

Query: 161 -NNLEQLRREKVELENTLEQEQEALVNKL 188
            N      +EK   E T E  + AL   L
Sbjct: 239 TNEWPTF-KEKPLNELTDEVVRNALAESL 266


>gnl|CDD|225638 COG3096, MukB, Uncharacterized protein involved in chromosome
           partitioning [Cell division and chromosome
           partitioning].
          Length = 1480

 Score = 31.0 bits (70), Expect = 1.3
 Identities = 46/174 (26%), Positives = 73/174 (41%), Gaps = 27/174 (15%)

Query: 54  QLRALQEENKNLRQTSVNIQAKAEQEEEFISNTLMKKIQALKKEKETLAHHYEQEEEFLT 113
           +  A Q +   +R + +  + + +Q  E +     K+ Q    + E L     QE E L 
Sbjct: 509 RHLAEQVQPLRMRLSELEQRLRQQQSAERLLADFCKR-QGKNLDAEEL-EALHQELEALI 566

Query: 114 NDLSRKLNQLTEEKYRLEQTIEQEQERVVNKLMRKIEKLEAETDAKQNNLEQLRREKVE- 172
             LS  ++   E++  L Q  EQ Q R+ + + R    L A     QN LEQL  +  E 
Sbjct: 567 ESLSDSVSNAREQRMALRQEQEQLQSRIQSLMQRAPVWLAA-----QNALEQLSEQSGEE 621

Query: 173 ----------LENTLEQEQEALVNKLWKRMDKLEAEKRILQIKLDQPVSDPSSP 216
                     ++  LE+E+EA V +     D+L A K      LD+ +   S P
Sbjct: 622 FTDSQDVTEYMQQLLEREREATVER-----DELGARKN----ALDEEIERLSQP 666



 Score = 29.1 bits (65), Expect = 5.0
 Identities = 50/246 (20%), Positives = 96/246 (39%), Gaps = 26/246 (10%)

Query: 27   REQLQKRIESLQQQNRVLKVELETYKLQLRALQEENKNLRQ-----TSVNIQAKAEQEEE 81
            R   ++R+ESL  +  VL     T    ++  Q  ++   +      +V  +A  E E  
Sbjct: 780  RAAREQRLESLHAERDVLSERHATLSFDVQKTQRLHQAFSRFIGSHLAVAFEADPEAEIR 839

Query: 82   FIS---NTLMKKIQALKKEKETLAHHYEQEEEFLT--NDLSRKLNQLTEE-------KYR 129
             ++   N L + +   + + +     ++Q +E +T  N L  +LN L +E       + R
Sbjct: 840  QLNSRRNELERALSNHENDNQQQRIQFDQAKEGVTALNRLIPQLNLLADESLADRVEEIR 899

Query: 130  LEQTIEQEQERVVNKLMRKIEKLEAETDAKQNNLEQLRREKVELENTLEQEQEALVNKLW 189
                  QE  R + +    + KLE      Q++ EQ  + K +     + +++A      
Sbjct: 900  ERLDEAQEAARFIQQHGNTLSKLEPIASVLQSDPEQFEQLKEDYAQAQQMQRQA------ 953

Query: 190  KRMDKLEAEKRILQIKLDQPVSDPSSPHDMNNGDTASNLSTHIHSLRSEVMRLRNQLANT 249
                +  A   ++Q +     SD S+     N D    L   +    +E  R R QL   
Sbjct: 954  --RQQAFALTEVVQRRAHFSYSD-SAEMLSENSDLNEKLRQRLEQAEAERTRAREQLRQH 1010

Query: 250  QQEHTE 255
            Q + ++
Sbjct: 1011 QAQLSQ 1016


>gnl|CDD|220555 pfam10079, DUF2317, Uncharacterized protein conserved in bacteria
           (DUF2317).  Members of this family of hypothetical
           bacterial proteins have no known function.
          Length = 541

 Score = 31.0 bits (71), Expect = 1.3
 Identities = 21/128 (16%), Positives = 50/128 (39%), Gaps = 18/128 (14%)

Query: 30  LQKRIESLQQQNRVLKVELETYKLQLRALQEENKNL--RQTSVNIQAKAEQEEEFISNTL 87
           ++ ++E         K+ L    + L  L+   + +   + +  I    E+ +  +    
Sbjct: 384 IEPKVEKKLD-----KLGLSIEDVFLHGLELLKEKVLEEKANSEIDIDFEEAKAELEAQF 438

Query: 88  MKKIQALKKEKETLAHHYEQEEEFLTNDLSRKLNQLTEEKYRLEQTIEQEQERVVNKLMR 147
            +  +   K   +L       E  L  + ++ L QL      LE+ + + Q R   +L+R
Sbjct: 439 KRLREEAAKIDPSL-------EGALEANEAKILKQLE----FLEKRLLKAQRRKHEELLR 487

Query: 148 KIEKLEAE 155
           + ++L+  
Sbjct: 488 QFDRLQTA 495


>gnl|CDD|233547 TIGR01730, RND_mfp, RND family efflux transporter, MFP subunit.
           This model represents the MFP (membrane fusion protein)
           component of the RND family of transporters. RND refers
           to Resistance, Nodulation, and cell Division. It is, in
           part, a subfamily of pfam00529 (Pfam release 7.5) but
           hits substantial numbers of proteins missed by that
           model. The related HlyD secretion protein, for which
           pfam00529 is named, is outside the scope of this model.
           Attributed functions imply outward transport. These
           functions include nodulation, acriflavin resistance,
           heavy metal efflux, and multidrug resistance proteins.
           Most members of this family are found in Gram-negative
           bacteria. The proposed function of MFP proteins is to
           bring the inner and outer membranes together and enable
           transport to the outside of the outer membrane. Note,
           however, that a few members of this family are found in
           Gram-positive bacteria, where there is no outer membrane
           [Transport and binding proteins, Unknown substrate].
          Length = 322

 Score = 30.7 bits (70), Expect = 1.3
 Identities = 11/81 (13%), Positives = 24/81 (29%), Gaps = 1/81 (1%)

Query: 128 YRLEQTIEQEQERVVNKLMRKIEKLEAETDAKQNNLEQLRREKVELENTLEQEQEALVNK 187
            RL+    Q   +     +   E             E+L +     +  L+  + A+   
Sbjct: 55  ARLDDDDYQLALQAALAQLAAAEAQLELAQRSFERAERLVKRNAVSQADLDDAKAAVEAA 114

Query: 188 LWKRMDKLEAEKRILQIKLDQ 208
               ++  +A     Q+ L  
Sbjct: 115 Q-ADLEAAKASLASAQLNLRY 134


>gnl|CDD|223571 COG0497, RecN, ATPase involved in DNA repair [DNA replication,
           recombination, and repair].
          Length = 557

 Score = 30.6 bits (70), Expect = 1.3
 Identities = 43/215 (20%), Positives = 85/215 (39%), Gaps = 46/215 (21%)

Query: 28  EQLQKRIESLQQQNRVLKVELETYKLQLRALQEEN-------------------KNLRQT 68
           +Q ++ +E LQ++ R      +  + QL  L+E N                   + L + 
Sbjct: 167 KQARRELEDLQEKERERAQRADLLQFQLEELEELNLQPGEDEELEEERKRLSNSEKLAEA 226

Query: 69  SVNIQAKAEQEEEFIS-----NTLMKKIQA-------LKKEKETLAHHYEQEEEFLTNDL 116
             N       E++ +S        ++ ++        L +  E L     + EE  + +L
Sbjct: 227 IQNALELLSGEDDTVSALSLLGRALEALEDLSEYDGKLSELAELLEEALYELEE-ASEEL 285

Query: 117 SRKLNQLTEEKYRLEQTIEQEQERV----------VNKLMRKIEKLEAET---DAKQNNL 163
              L++L  +  RLE+ +E+    +          +  L+  ++K++ E    D  + +L
Sbjct: 286 RAYLDELEFDPNRLEE-VEERLFALKSLARKYGVTIEDLLEYLDKIKEELAQLDNSEESL 344

Query: 164 EQLRREKVELENTLEQEQEALVNKLWKRMDKLEAE 198
           E L +E  +L+  L +  EAL     K   +LE E
Sbjct: 345 EALEKEVKKLKAELLEAAEALSAIRKKAAKELEKE 379


>gnl|CDD|236489 PRK09375, PRK09375, quinolinate synthetase; Provisional.
          Length = 319

 Score = 30.4 bits (70), Expect = 1.4
 Identities = 11/25 (44%), Positives = 18/25 (72%), Gaps = 2/25 (8%)

Query: 87  LMKKIQALKKEKE--TLAHHYEQEE 109
           L ++I+ LKKE+    LAH+Y++ E
Sbjct: 21  LKERIKRLKKERNAVILAHYYQRPE 45


>gnl|CDD|217038 pfam02445, NadA, Quinolinate synthetase A protein.  Quinolinate
           synthetase catalyzes the second step of the de novo
           biosynthetic pathway of pyridine nucleotide formation.
           In particular, quinolinate synthetase is involved in the
           condensation of dihydroxyacetone phosphate and
           iminoaspartate to form quinolinic acid. This synthesis
           requires two enzymes, a FAD-containing "B protein" and
           an "A protein".
          Length = 296

 Score = 30.5 bits (70), Expect = 1.5
 Identities = 11/24 (45%), Positives = 18/24 (75%), Gaps = 2/24 (8%)

Query: 88  MKKIQALKKEKET--LAHHYEQEE 109
           +++I+ LKKEK    LAH+Y++ E
Sbjct: 1   IEEIKRLKKEKNAVILAHNYQRPE 24


>gnl|CDD|215180 PLN02316, PLN02316, synthase/transferase.
          Length = 1036

 Score = 30.6 bits (69), Expect = 1.7
 Identities = 24/124 (19%), Positives = 50/124 (40%), Gaps = 7/124 (5%)

Query: 72  IQAKAEQEEEFISNTLMKKIQALKKEKETLAHHYEQEEEFLTNDLSRKLNQLTEEKYRLE 131
           IQ     +E+  ++T    + A++K            +    ++    + ++  E    E
Sbjct: 27  IQQNNGDKEDSSTSTSSLSVSAVEKTSNAKEE----IQVDFQHNSESAVEEVEAED---E 79

Query: 132 QTIEQEQERVVNKLMRKIEKLEAETDAKQNNLEQLRREKVELENTLEQEQEALVNKLWKR 191
             +EQ Q  V+       E+  +      ++    R+ K+E EN  ++E E L  + + R
Sbjct: 80  IEVEQNQSDVLKSSSIVKEESISTDMDGIDDDSLDRKLKLERENLRKREIEELAEENFSR 139

Query: 192 MDKL 195
            +KL
Sbjct: 140 GNKL 143


>gnl|CDD|236993 PRK11820, PRK11820, hypothetical protein; Provisional.
          Length = 288

 Score = 30.1 bits (69), Expect = 1.8
 Identities = 31/144 (21%), Positives = 59/144 (40%), Gaps = 17/144 (11%)

Query: 33  RIESLQQQNRVLKVE---LETYKLQLRALQEENKNLRQTSVN-------IQAKAEQEEEF 82
           R+E  +     L +     + Y   L  L+ E     + S++       +    E++ E 
Sbjct: 67  RLERDESAQGELSLNEDLAKQYLEALEELKAELPEAGEISLDDLLRWPGVLEAEEEDLEA 126

Query: 83  ISNTLMKKI-QALKKEKETLAHHYEQEEEFLTNDLSRKLNQLTEEKYRLEQTIEQEQERV 141
           +   L+  + +AL      L    E+E   L  DL ++L+ +     ++E    +  E  
Sbjct: 127 LWAALLAALDEALDD----LIEMREREGAALKADLLQRLDAIEALVAKIEALAPEILEEY 182

Query: 142 VNKLMRKIEKLEAETDAKQNNLEQ 165
             +L  ++E+L  E D  +N LEQ
Sbjct: 183 RERLRERLEELLGELD--ENRLEQ 204


>gnl|CDD|224337 COG1419, FlhF, Flagellar GTP-binding protein [Cell motility and
           secretion].
          Length = 407

 Score = 30.0 bits (68), Expect = 2.1
 Identities = 20/114 (17%), Positives = 38/114 (33%), Gaps = 9/114 (7%)

Query: 103 HHYEQEEEFLTNDLSRKLNQLTEEKYRLEQTIEQEQ--------ERVVNKLMRKIEKL-E 153
             ++  E     D   +L +   EK R E+   ++         ER   ++    E++ E
Sbjct: 62  RKFQTREAAKVKDAEAQLLKDPAEKKREERKAAKKIERSTPSLIERKTQEVKDSGEEIAE 121

Query: 154 AETDAKQNNLEQLRREKVELENTLEQEQEALVNKLWKRMDKLEAEKRILQIKLD 207
              D K    E        +    ++ Q      L   + +   E  IL +K +
Sbjct: 122 MMRDEKVPIRELEEIPPEFVALYKQEIQSPTRLNLINELLRAGLELEILDMKDE 175


>gnl|CDD|226614 COG4129, COG4129, Predicted membrane protein [Function unknown].
          Length = 332

 Score = 30.0 bits (68), Expect = 2.2
 Identities = 21/129 (16%), Positives = 44/129 (34%), Gaps = 19/129 (14%)

Query: 27  REQLQKRIESLQQQNRVLKVELETYKLQLRAL-QEENKNLRQTSVNIQAKAEQEEEFISN 85
               ++++  L++  RVL   L         L     + L Q   ++ A     +     
Sbjct: 222 FSMRRRQVRILRRMLRVLNS-LHRTSNLSNLLADLLQELLEQLLESLTAAHLLID----- 275

Query: 86  TLMKKIQALKKEKETLAHHYEQEEEFLTNDLSRKLNQLTEEKYRLEQTIEQEQERVVNKL 145
                ++  +K  E L    +  EEF           L++    LE+ +E + +     L
Sbjct: 276 -----LEETRKVFEALPLP-KTREEF------ESRAVLSQLLQELEEFLEVKTDFSRVIL 323

Query: 146 MRKIEKLEA 154
           ++    + A
Sbjct: 324 LKAKSSVGA 332


>gnl|CDD|217414 pfam03189, Otopetrin, Protein of unknown function, DUF270. 
          Length = 413

 Score = 30.0 bits (68), Expect = 2.2
 Identities = 11/48 (22%), Positives = 19/48 (39%), Gaps = 5/48 (10%)

Query: 189 WKRMDKLEAEKRILQIKLDQPVSDPSSPHDMNNGD-----TASNLSTH 231
           W  +   E++  IL+      V D  S    ++ D     T++  ST 
Sbjct: 82  WFSLLLAESKHEILKKAQKHIVFDHHSALSHDSSDSSKEQTSTEESTE 129


>gnl|CDD|182730 PRK10787, PRK10787, DNA-binding ATP-dependent protease La;
           Provisional.
          Length = 784

 Score = 30.3 bits (68), Expect = 2.2
 Identities = 29/110 (26%), Positives = 57/110 (51%), Gaps = 7/110 (6%)

Query: 110 EFLTNDLSRKLNQLTEEKY---RLEQTIEQEQ-ERVVNKLMRKIEKLEAETDAKQNNLEQ 165
           E+L   +  +++ L  EK    R+++ +E+ Q E  +N+ M+ I+K   E D   +  E 
Sbjct: 194 EYLMAMMESEIDLLQVEKRIRNRVKKQMEKSQREYYLNEQMKAIQKELGEMDDAPDENEA 253

Query: 166 LRR--EKVELENTLEQEQEALVNKLWKRMDKLEAEKRILQIKLDQPVSDP 213
           L+R  +  ++    +++ EA + KL K M  + AE  +++  +D  V  P
Sbjct: 254 LKRKIDAAKMPKEAKEKAEAELQKL-KMMSPMSAEATVVRGYIDWMVQVP 302


>gnl|CDD|234767 PRK00448, polC, DNA polymerase III PolC; Validated.
          Length = 1437

 Score = 30.2 bits (69), Expect = 2.3
 Identities = 25/135 (18%), Positives = 52/135 (38%), Gaps = 29/135 (21%)

Query: 34  IESLQQQNRVLKVELETYKLQLRALQEENKNLRQTSVNIQAKAEQEEEFISN----TLMK 89
           IE  ++ + + K  L+  K+++    E NK      + I+   E E + +       L+K
Sbjct: 100 IEKAKKNSPLFKSLLKKQKVEV----EGNK------LIIKVNNEIERDHLKKKHLPKLIK 149

Query: 90  KIQALKKEKETLAHHYEQEEEFLTNDLSRKLNQLTEEKYRLEQTIEQEQERVVNKLMRKI 149
           + +        +    +  +E L                  E   E+E E++  + +  +
Sbjct: 150 QYEKFGFGILKIDFEIDDSKEELEK---------------FEAQKEEEDEKLAKEALEAM 194

Query: 150 EKLEAETDAKQNNLE 164
           +KLEAE   +  N +
Sbjct: 195 KKLEAEKKKQSKNFD 209


>gnl|CDD|217092 pfam02538, Hydantoinase_B, Hydantoinase B/oxoprolinase.  This
           family includes N-methylhydaintoinase B which converts
           hydantoin to N-carbamyl-amino acids, and 5-oxoprolinase
           EC:3.5.2.9 which catalyzes the formation of L-glutamate
           from 5-oxo-L-proline. These enzymes are part of the
           oxoprolinase family and are related to pfam01968.
          Length = 527

 Score = 30.1 bits (68), Expect = 2.3
 Identities = 15/98 (15%), Positives = 31/98 (31%), Gaps = 18/98 (18%)

Query: 129 RLEQTIEQEQERVVNKLMRKIEKLEAETDAKQNNLEQLRREKVE-LENTLEQEQEALVNK 187
           R+ + I++     V   M  I          Q+N E+  + K+  L+         + ++
Sbjct: 197 RIGKLIDEYGLETVLGYMAHI----------QDNAEEAIKRKIAKLDEPYGAYDYGVCDR 246

Query: 188 LWKRMD-----KLEAEKRILQIKLDQPVSDPSSPHDMN 220
           L           +  +K       D   +     H+ N
Sbjct: 247 LDTGKLIKVEITINIKKG--TAIFDFDGTSRQGDHNFN 282


>gnl|CDD|239428 cd03312, CIMS_N_terminal_like, CIMS - Cobalamine-independent
          methonine synthase, or MetE, N-terminal domain_like.
          Many members have been characterized as
          5-methyltetrahydropteroyltriglutamate-homocysteine
          methyltransferases, EC:2.1.1.14, mostly from bacteria
          and plants. This enzyme catalyses the last step in the
          production of methionine by transferring a methyl group
          from 5-methyltetrahydrofolate to L-homocysteine without
          using an intermediate methyl carrier. The active enzyme
          has a dual (beta-alpha)8-barrel structure, and this
          model covers the N-terminal barrel, and a few
          single-barrel sequences most similar to the N-terminal
          barrel. It is assumed that the homologous N-terminal
          barrel has evolved from the C-terminus via gene
          duplication and has subsequently lost binding sites,
          and it seems as if the two barrels forming the active
          enzyme may sometimes reside on different polypeptides.
          The C-terminal domain incorporates the Zinc ion, which
          binds and activates homocysteine. Side chains from both
          barrels contribute to the binding of the folate
          substrate.
          Length = 360

 Score = 29.8 bits (68), Expect = 2.3
 Identities = 14/39 (35%), Positives = 18/39 (46%), Gaps = 3/39 (7%)

Query: 41 NRVLKVELETY---KLQLRALQEENKNLRQTSVNIQAKA 76
          NR LK  LE+Y   K+    L    K LR     +Q +A
Sbjct: 13 NRELKKALESYWKGKISEEELLATAKELRLRHWKLQKEA 51


>gnl|CDD|163064 TIGR02894, DNA_bind_RsfA, transcription factor, RsfA family.  In a
           subset of endospore-forming members of the Firmcutes,
           members of this protein family are found, several to a
           genome. Two very strongly conserved sequences regions
           are separated by a highly variable linker region. Much
           of the linker region was excised from the seed alignment
           for this model. A characterized member is the
           prespore-specific transcription RsfA from Bacillus
           subtilis, previously called YwfN, which is controlled by
           sigma factor F and seems to fine-tune expression of some
           genes in the sigma-F regulon. A paralog in Bacillus
           subtilis is designated YlbO [Regulatory functions, DNA
           interactions, Cellular processes, Sporulation and
           germination].
          Length = 161

 Score = 29.0 bits (65), Expect = 2.4
 Identities = 16/52 (30%), Positives = 32/52 (61%), Gaps = 4/52 (7%)

Query: 27  REQLQKRIESLQQQNRVLKVELETYKLQLRALQEENKNLRQTSVNIQAKAEQ 78
            E+L+ + ESLQ++N  L+ ELE  + +L  ++E+ + L    ++I  +A +
Sbjct: 106 NERLKNQNESLQKRNEELEKELEKLRQRLSTIEEDYQTL----IDIMDRARK 153


>gnl|CDD|163506 TIGR03794, NHLM_micro_HlyD, NHLM bacteriocin system secretion
           protein.  Members of this protein family are homologs of
           the HlyD membrane fusion protein of type I secretion
           systems. Their occurrence in prokaryotic genomes is
           associated with the occurrence of a novel class of
           microcin (small bacteriocins) with a leader peptide
           region related to nitrile hydratase. We designate the
           class of bacteriocin as Nitrile Hydratase Leader
           Microcin, or NHLM. This family, therefore, is designated
           as NHLM bacteriocin system secretion protein. Some but
           not all NHLM-class putative microcins belong to the TOMM
           (thiazole/oxazole modified microcin) class as assessed
           by the presence of the scaffolding protein and/or
           cyclodehydratase in the same gene clusters [Transport
           and binding proteins, Amino acids, peptides and amines,
           Cellular processes, Biosynthesis of natural products].
          Length = 421

 Score = 29.8 bits (67), Expect = 2.4
 Identities = 26/167 (15%), Positives = 56/167 (33%), Gaps = 15/167 (8%)

Query: 87  LMKKIQALKKEKETLAHHYEQEEEFLTNDLSRKLNQLTEEKYRLEQTIEQEQE------R 140
           L +++Q   ++   L    E+   +       +     + K  LE+TI + +E      R
Sbjct: 94  LRERLQESYQKLTQLQEQLEEVRNYTGRLKEGRERHFQKSKEALEETIGRLREELAALSR 153

Query: 141 VVNKLMRKIEKLEAETDAKQNNLEQLRREKVELENTLEQEQEALVNKLWKRMDKLEAEKR 200
            V K    + +  A     +   +Q R E+ + +   +         L  + D+   E  
Sbjct: 154 EVGKQRGLLSRGLATFKRDRILQQQWREEQEKYDAADKARAI---YALQTKADERNLETV 210

Query: 201 ILQIKLDQPVSDPSSPHDMNNGDTASNLSTHIHSLRSEVMRLRNQLA 247
           +  +          +  ++        +   I   R E+  L N+L 
Sbjct: 211 LQSLSQADFQLAGVAEKEL------ETVEARIKEARYEIEELENKLN 251



 Score = 29.8 bits (67), Expect = 2.7
 Identities = 35/162 (21%), Positives = 70/162 (43%), Gaps = 11/162 (6%)

Query: 43  VLKVELETYKLQLRALQEENKNLRQTSVNIQAKAEQEEEFISNTLMKKIQALKKEKETLA 102
            L+  L+    +L  LQE+ + +R  +  ++   E+  +     L + I  L++E   L+
Sbjct: 93  ELRERLQESYQKLTQLQEQLEEVRNYTGRLKEGRERHFQKSKEALEETIGRLREELAALS 152

Query: 103 HHYEQEEEFLTNDLSRKLNQLTEEKYRLEQT-IEQEQERVVNKLMRKIEKLEAETDAKQN 161
               +E       LSR L     ++   +Q   EQE+    +K  R I  L+ + D +  
Sbjct: 153 ----REVGKQRGLLSRGLATFKRDRILQQQWREEQEKYDAADKA-RAIYALQTKADERNL 207

Query: 162 N--LEQLRREKVELENTLEQEQEAL---VNKLWKRMDKLEAE 198
              L+ L +   +L    E+E E +   + +    +++LE +
Sbjct: 208 ETVLQSLSQADFQLAGVAEKELETVEARIKEARYEIEELENK 249


>gnl|CDD|190336 pfam02520, DUF148, Domain of unknown function DUF148.  This domain
           has no known function nor do any of the proteins that
           possess it. In one member of this family the aligned
           region is repeated twice.
          Length = 114

 Score = 28.4 bits (64), Expect = 2.5
 Identities = 23/93 (24%), Positives = 38/93 (40%), Gaps = 3/93 (3%)

Query: 94  LKKEKETLAHHYEQEEEFL-TNDLSRKLNQLTEEKYRLEQTIEQEQERVVNKLMRKIEKL 152
           LK E  T+A    Q E +     ++ +  +  +    L+   E+    V++ L     KL
Sbjct: 11  LKNETLTIAEIDAQLEAWAEKYGVADQYEEFKKNVKALKAEAEKNVTAVISNLSSVDAKL 70

Query: 153 EAETDAKQNNLEQLRREKV-ELENTLEQEQEAL 184
            A  D K     Q ++E +  L+  L QE   L
Sbjct: 71  SAILDNKDLTRRQ-KKEAIDALKKQLPQEVSTL 102


>gnl|CDD|153328 cd07644, I-BAR_IMD_BAIAP2L2, Inverse (I)-BAR, also known as the
           IRSp53/MIM homology Domain (IMD), of Brain-specific
           Angiogenesis Inhibitor 1-Associated Protein 2-Like 2.
           The IMD domain, also called Inverse-Bin/Amphiphysin/Rvs
           (I-BAR) domain, is a dimerization and lipid-binding
           module that bends membranes and induces membrane
           protrusions. This group is composed of uncharacterized
           proteins known as BAIAP2L2 (Brain-specific Angiogenesis
           Inhibitor 1-Associated Protein 2-Like 2). They contain
           an N-terminal IMD, an SH3 domain, and a WASP homology 2
           (WH2) actin-binding motif at the C-terminus. The related
           proteins, BAIAP2L1 and IRSp53, function as regulators of
           membrane dynamics and the actin cytoskeleton. The IMD
           domain binds and bundles actin filaments, binds
           membranes and produces membrane protrusions, and
           interacts with the small GTPase Rac.
          Length = 215

 Score = 29.5 bits (66), Expect = 2.6
 Identities = 21/80 (26%), Positives = 40/80 (50%), Gaps = 11/80 (13%)

Query: 243 RNQLANTQQEHTEKMERYVKDEKNVRE--ENLRLQRKLQREVERREALCRHLSESESSLE 300
           R++ AN ++  +E      + ++NVRE  EN+   R+  +           L ES+ + E
Sbjct: 122 RHRAANLEKCMSELWRMERQRDRNVREMKENVNRLRQSMQAF---------LKESQRAAE 172

Query: 301 MEEERHYNEIAHSMRLVGSS 320
           +EE+R Y  +A    L+ ++
Sbjct: 173 LEEKRRYRFLAEKHYLLNNT 192


>gnl|CDD|100796 PRK01156, PRK01156, chromosome segregation protein; Provisional.
          Length = 895

 Score = 29.9 bits (67), Expect = 2.6
 Identities = 29/205 (14%), Positives = 74/205 (36%), Gaps = 39/205 (19%)

Query: 110 EFLTNDLSRKLNQLTEEKYRLEQTIEQEQERVVNKLMRKIEKLEAETDAKQNNLEQLRRE 169
           ++L   L     +L   K ++        E+  +  +++IE+L  E +   ++   L+  
Sbjct: 186 DYLEEKLKSSNLELENIKKQIAD-----DEKSHSITLKEIERLSIEYNNAMDDYNNLKSA 240

Query: 170 KVELENTLEQEQEALVNKLWKRMDKLEAEKRILQIKLDQPVSDPSSPHDMNNGDTASNLS 229
                                 +  LE  K   + ++    SD S   + NN        
Sbjct: 241 -------------------LNELSSLEDMKNRYESEIKTAESDLSMELEKNN-------- 273

Query: 230 THIHSLRSEVMRLRNQLANTQQEHTEKMERYVKDEKNVREENLRLQRKLQREVERREALC 289
            +   L    M++ N      + +     +Y  D +N ++    +   +  E+ +  A+ 
Sbjct: 274 -YYKELEERHMKIINDPVYKNRNYINDYFKYKNDIENKKQ----ILSNIDAEINKYHAII 328

Query: 290 RHLSESES--SLEMEEERHYNEIAH 312
           + LS  +   +  ++++  Y+++ +
Sbjct: 329 KKLSVLQKDYNDYIKKKSRYDDLNN 353



 Score = 29.9 bits (67), Expect = 3.1
 Identities = 51/324 (15%), Positives = 125/324 (38%), Gaps = 50/324 (15%)

Query: 28  EQLQKRIESLQQQNRVLKVELETYKLQLRALQEENKNLRQTSVNIQAKAEQEEEFISNTL 87
           E ++K+I   ++ + +   E+E   ++     ++  NL+     + +  + +  +   + 
Sbjct: 200 ENIKKQIADDEKSHSITLKEIERLSIEYNNAMDDYNNLKSALNELSSLEDMKNRY--ESE 257

Query: 88  MKKIQALKKEKETLAHHYEQEEEF---LTNDLSRKLNQLTEEKYRLEQTIEQEQERVVN- 143
           +K  ++    +    ++Y++ EE    + ND   K      + ++ +  IE +++ + N 
Sbjct: 258 IKTAESDLSMELEKNNYYKELEERHMKIINDPVYKNRNYINDYFKYKNDIENKKQILSNI 317

Query: 144 --------KLMRKIEKLEAETD---AKQNNLEQLRREKVELENTLEQEQEALVN----KL 188
                    +++K+  L+ + +    K++  + L  + +ELE         L +    K 
Sbjct: 318 DAEINKYHAIIKKLSVLQKDYNDYIKKKSRYDDLNNQILELEGYEMDYNSYLKSIESLKK 377

Query: 189 WKRMDKLEAEK------RILQIKLDQPVSDPSSPHDMN------NGDTASNLSTHIHSLR 236
                    E+       IL+I+   P +     +++N      +   +S L+  I +LR
Sbjct: 378 KIEEYSKNIERMSAFISEILKIQEIDPDAIKKELNEINVKLQDISSKVSS-LNQRIRALR 436

Query: 237 SEVMRLRNQLANTQQEH----------TEKMERYVKD---EKNVREENLRLQRKLQREVE 283
             +  L   +     +            EK    +     +K+  EE +   R+++ EV+
Sbjct: 437 ENLDELSRNMEMLNGQSVCPVCGTTLGEEKSNHIINHYNEKKSRLEEKI---REIEIEVK 493

Query: 284 RREALCRHLSESESSLEMEEERHY 307
             +     L + +  LE EE    
Sbjct: 494 DIDEKIVDLKKRKEYLESEEINKS 517



 Score = 28.7 bits (64), Expect = 5.8
 Identities = 31/158 (19%), Positives = 69/158 (43%), Gaps = 6/158 (3%)

Query: 114 NDLSRKLNQLTEEKYRLEQTIEQEQERVVNKLMRKIEKLEAETDAKQNNLEQLRREKVEL 173
           N++ ++LN L      +E     + +  ++K +R+IE      + K N +++ +    +L
Sbjct: 590 NEIKKQLNDLESRLQEIEIGFP-DDKSYIDKSIREIENEANNLNNKYNEIQENKILIEKL 648

Query: 174 ENTLEQEQEALVNKLWKRMDKLEAEKRILQIKLDQPVSDPSSPHDMNNGDTASNLSTHIH 233
              ++  ++ +        D  E   RI  I+ +   S      D    + A   ST   
Sbjct: 649 RGKIDNYKKQIAEIDSIIPDLKEITSRINDIEDNLKKS--RKALDDAKANRARLESTI-E 705

Query: 234 SLRSEVMRLRNQLANTQQ--EHTEKMERYVKDEKNVRE 269
            LR+ +  L +++ +  +  E  +K+++ + D K +RE
Sbjct: 706 ILRTRINELSDRINDINETLESMKKIKKAIGDLKRLRE 743


>gnl|CDD|215212 PLN02372, PLN02372, violaxanthin de-epoxidase.
          Length = 455

 Score = 29.8 bits (67), Expect = 2.7
 Identities = 22/84 (26%), Positives = 41/84 (48%), Gaps = 15/84 (17%)

Query: 129 RLEQTIEQ----------EQERVVNKLMRKIEKLEAETDAKQNNLEQ-LRREKVELEN-- 175
           RLE+ +E+          + E  + K + K+ K E E+  K+  LE+ L+  + + EN  
Sbjct: 365 RLEKDVEEGEKTIVKEARQIEEELEKEVEKLGK-EEESLFKRVALEEGLKELEQDEENFL 423

Query: 176 -TLEQEQEALVNKLWKRMDKLEAE 198
             L +E++ L+ KL     ++E  
Sbjct: 424 KELSKEEKELLEKLKMEASEVEKL 447


>gnl|CDD|148614 pfam07106, TBPIP, Tat binding protein 1(TBP-1)-interacting protein
           (TBPIP).  This family consists of several eukaryotic
           TBP-1 interacting protein (TBPIP) sequences. TBP-1 has
           been demonstrated to interact with the human
           immunodeficiency virus type 1 (HIV-1) viral protein Tat,
           then modulate the essential replication process of HIV.
           In addition, TBP-1 has been shown to be a component of
           the 26S proteasome, a basic multiprotein complex that
           degrades ubiquitinated proteins in an ATP-dependent
           fashion. Human TBPIP interacts with human TBP-1 then
           modulates the inhibitory action of human TBP-1 on
           HIV-Tat-mediated transactivation.
          Length = 169

 Score = 29.2 bits (66), Expect = 2.7
 Identities = 24/89 (26%), Positives = 46/89 (51%), Gaps = 9/89 (10%)

Query: 22  PSPVTREQLQKRIESLQQQNRVLKVELETYKLQLRAL---------QEENKNLRQTSVNI 72
           PS     +L   IE L+++ ++LK +  T +++L++L         QEE + L++    I
Sbjct: 69  PSDEELNKLDMEIEELREEVQLLKQDCSTLEIELKSLTSDLTTEELQEEIQELKKEVREI 128

Query: 73  QAKAEQEEEFISNTLMKKIQALKKEKETL 101
           + K E  EE       ++++ +KKE + L
Sbjct: 129 EEKLESLEEGWKPVTPEEMEKVKKEYKDL 157


>gnl|CDD|201540 pfam00992, Troponin, Troponin.  Troponin (Tn) contains three
           subunits, Ca2+ binding (TnC), inhibitory (TnI), and
           tropomyosin binding (TnT). this Pfam contains members of
           the TnT subunit. Troponin is a complex of three
           proteins, Ca2+ binding (TnC), inhibitory (TnI), and
           tropomyosin binding (TnT). The troponin complex
           regulates Ca++ induced muscle contraction. This family
           includes troponin T and troponin I. Troponin I binds to
           actin and troponin T binds to tropomyosin.
          Length = 131

 Score = 28.7 bits (65), Expect = 2.9
 Identities = 15/60 (25%), Positives = 28/60 (46%), Gaps = 4/60 (6%)

Query: 144 KLMRKIEKLEAETDAKQNNLEQLR--REKVE-LENTLE-QEQEALVNKLWKRMDKLEAEK 199
            L+      E E + ++   E+ +   E+   L  +L   E + L  KL  R+D+L+ E+
Sbjct: 6   SLLLLKAAEELEFEQRKKEEEKEKYLAERCPPLRLSLSRAELQELCKKLHARIDRLDEER 65


>gnl|CDD|236861 PRK11147, PRK11147, ABC transporter ATPase component; Reviewed.
          Length = 635

 Score = 29.9 bits (68), Expect = 3.0
 Identities = 29/101 (28%), Positives = 41/101 (40%), Gaps = 11/101 (10%)

Query: 92  QALKKEKETLAHHYEQEEEFLTNDLS----RKLNQLTEEKYRLEQTIEQEQERVVNKLMR 147
           Q   K+KE  A    +  +  +  LS    R+L QL +    LE  IE  Q +V +    
Sbjct: 538 QPAVKKKEEAAAPKAETVKRSSKKLSYKLQRELEQLPQLLEDLEAEIEALQAQVADA--- 594

Query: 148 KIEKLEAETDAKQNNLEQLRREKVELENTLE--QEQEALVN 186
             +      +  Q  L  L   + ELE   E  +E EAL N
Sbjct: 595 --DFFSQPHEQTQKVLADLADAEQELEVAFERWEELEALKN 633


>gnl|CDD|131526 TIGR02473, flagell_FliJ, flagellar export protein FliJ.  Members of
           this family are the FliJ protein found, in nearly every
           case, in the midst of other flagellar biosynthesis genes
           in bacgterial genomes. Typically the fliJ gene is found
           adjacent to the gene for the flagellum-specific ATPase
           FliI. Sequence scoring in the gray zone between trusted
           and noise cutoffs include both probable FliJ proteins
           and components of bacterial type III secretion systems.
          Length = 141

 Score = 28.4 bits (64), Expect = 3.0
 Identities = 23/107 (21%), Positives = 48/107 (44%)

Query: 76  AEQEEEFISNTLMKKIQALKKEKETLAHHYEQEEEFLTNDLSRKLNQLTEEKYRLEQTIE 135
            E+EEE     L K     ++ +  L    +  EE+    L +     +  +    Q   
Sbjct: 11  REKEEEQAKLELAKAQAEFERLETQLQQLIKYREEYEQQALEKVGAGTSALELSNYQRFI 70

Query: 136 QEQERVVNKLMRKIEKLEAETDAKQNNLEQLRREKVELENTLEQEQE 182
           ++ ++ + +  +++  L+ E +AK+  L + RRE   LE   E++Q+
Sbjct: 71  RQLDQRIQQQQQELALLQQEVEAKRERLLEARRELKALEKLKEKKQK 117


>gnl|CDD|173502 PTZ00266, PTZ00266, NIMA-related protein kinase; Provisional.
          Length = 1021

 Score = 29.7 bits (66), Expect = 3.1
 Identities = 29/107 (27%), Positives = 53/107 (49%), Gaps = 20/107 (18%)

Query: 126 EKYRLEQTIEQEQERVVNKLMRKIEKLEAETDAKQNNLEQLRREKVELENTLEQEQEALV 185
           EK R+E+   +E+ER+  + M +IE+   E        E+L RE++E +  LE++     
Sbjct: 456 EKKRIERLEREERERLERERMERIERERLER-------ERLERERLERDR-LERD----- 502

Query: 186 NKLWKRMDKLEAEK--RILQIKLDQPVSDPSSPHDMNNGDTASNLST 230
                R+D+LE E+  R+ + +L++   +      M NG +A     
Sbjct: 503 -----RLDRLERERVDRLERDRLEKARRNSYFLKGMENGLSAGGGPG 544


>gnl|CDD|223683 COG0610, COG0610, Type I site-specific restriction-modification
           system, R (restriction) subunit and related helicases
           [Defense mechanisms].
          Length = 962

 Score = 29.7 bits (67), Expect = 3.2
 Identities = 39/207 (18%), Positives = 69/207 (33%), Gaps = 27/207 (13%)

Query: 98  KETLAHHYEQEEEFLTNDLSRKLNQLTEEKYRLEQTIEQEQ-------ERVVNKLMRKIE 150
           K+ L  +  + EE L   L   + +L  EK        +EQ       E     L     
Sbjct: 657 KKALKLYSNEGEEDLKEALEEAIEEL-VEKLEDVLNEFKEQSEDIGYLEDAGELLRDFAP 715

Query: 151 KLEAETDAKQNNLEQLRREKVELEN----------TLEQEQEALVNKLWKRMDKLEAEKR 200
           +LE + +     L +L    ++  N          TLE+E + L + L  R++K    K 
Sbjct: 716 ELEIKVERGNKILAKLFGRFLKAFNALKKLYEFAYTLEEEIQYLEDLLKARLEKDYQSKY 775

Query: 201 ILQIKLDQPVSDPSSPHDMNNGDTASNLSTHIHSLRSEVMRLRN------QLANTQQEHT 254
           +  I       D     D  +     +       L  +     +      Q  N +    
Sbjct: 776 LDLIDNLN---DAREIKDKESKAIDLDDIDFELELIGKQEINIDYILELLQTFNDKNGAY 832

Query: 255 EKMERYVKDEKNVREENLRLQRKLQRE 281
           E ++  ++       E+LR ++KL   
Sbjct: 833 ESLKELIERIIKEWIEDLRQKKKLIER 859


>gnl|CDD|227731 COG5444, COG5444, Uncharacterized conserved protein [Function
           unknown].
          Length = 565

 Score = 29.7 bits (67), Expect = 3.3
 Identities = 24/133 (18%), Positives = 45/133 (33%), Gaps = 19/133 (14%)

Query: 26  TREQLQKRIESLQQQNRVLKVELETYKLQLRALQEENKN----LRQTSVNIQAKAEQEEE 81
           T E+L K  +        ++ +++  K  +R L+E         +   ++  AKA     
Sbjct: 158 TLEKLYKLDQEGMTLMAAVESKMQELKAIIRQLEEWTIKGGATKKGVPIHYVAKA----- 212

Query: 82  FISNTLMKKIQALKKEKET--LAHHYEQEEEFLTNDLSRKL--------NQLTEEKYRLE 131
           F   T+ KK   +  + ET         +E     DL + L          L  E     
Sbjct: 213 FAEVTIHKKAAEVALQSETYLDIKTELAKERPDMRDLDKPLESANKTGYEGLAGEDIVPF 272

Query: 132 QTIEQEQERVVNK 144
              E+  +   ++
Sbjct: 273 FLAEETGQLAYSR 285


>gnl|CDD|219342 pfam07227, DUF1423, Protein of unknown function (DUF1423).  This
           family represents a conserved region approximately 500
           residues long within a number of Arabidopsis thaliana
           proteins of unknown function.
          Length = 446

 Score = 29.7 bits (67), Expect = 3.3
 Identities = 25/98 (25%), Positives = 46/98 (46%), Gaps = 15/98 (15%)

Query: 56  RALQEENKNLRQTSVNIQAKAEQEEEFISNTLMKKIQA---------LKKEKETL----- 101
           R L+E+ + + +  +  Q K  Q +E  S   +K+ +A          ++E E L     
Sbjct: 329 RELEEKAREVSELKMERQRKKPQIDELESIVRLKQAEADMFQLKADEARREAERLQRIAL 388

Query: 102 AHHYEQEEEFLTNDLSRKLNQLTEEK-YRLEQTIEQEQ 138
           A   + EEE+ +  L  +L++  EE+ Y  E+   QE+
Sbjct: 389 AKTEKSEEEYASKYLKLRLSEAEEERQYLFEELKLQEE 426


>gnl|CDD|129661 TIGR00570, cdk7, CDK-activating kinase assembly factor MAT1.  All
           proteins in this family for which functions are known
           are cyclin dependent protein kinases that are components
           of TFIIH, a complex that is involved in nucleotide
           excision repair and transcription initiation. Also known
           as MAT1 (menage a trois 1). This family is based on the
           phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis,
           Stanford University) [DNA metabolism, DNA replication,
           recombination, and repair].
          Length = 309

 Score = 29.4 bits (66), Expect = 3.4
 Identities = 41/159 (25%), Positives = 69/159 (43%), Gaps = 19/159 (11%)

Query: 98  KETLAHHYEQEEEFLTNDLSRKLNQLTEE----KYRLEQTIEQEQERVVNKLMRKIEKLE 153
           K  L  + ++EE+F    L R+ N   EE     Y L   I+      +    +KIE  +
Sbjct: 76  KRVLKIYNKREEDF--PSL-REYNDYLEEVEDIVYNLTNNID------LENTKKKIETYQ 126

Query: 154 AET-DAKQNNLEQLRREKVELENTLEQEQEALVNKLWKRM--DKLEAEKRILQIKLDQPV 210
            E  D  Q N E+  RE+ ELE  LE E+E    +  +R+   K E E+++ + K  Q +
Sbjct: 127 KENKDVIQKNKEKSTREQEELEEALEFEKE---EEEQRRLLLQKEEEEQQMNKRKNKQAL 183

Query: 211 SDPSSPHDMNNGDTASNLSTHIHSLRSEVMRLRNQLANT 249
            D      +   +  +    +   L  +V + + +  NT
Sbjct: 184 LDELETSTLPAAELIAQHKKNSVKLEMQVEKPKPEKPNT 222


>gnl|CDD|218038 pfam04350, PilO, Pilus assembly protein, PilO.  PilO proteins are
           involved in the assembly of pilin. However, the precise
           function of this family of proteins is not known.
          Length = 144

 Score = 28.6 bits (65), Expect = 3.5
 Identities = 13/52 (25%), Positives = 21/52 (40%), Gaps = 5/52 (9%)

Query: 147 RKIEKLEAETDAKQN---NLEQLRREKVELENTLEQEQEALVNKLWKRMDKL 195
            K   L+ E + K     NL   +++  ELE       + L +K  K +  L
Sbjct: 6   AKEATLKQEFEEKARQAANLPAYKKQLEELEERFGALLKQLPSK--KEIPSL 55


>gnl|CDD|237258 PRK12903, secA, preprotein translocase subunit SecA; Reviewed.
          Length = 925

 Score = 29.6 bits (67), Expect = 3.5
 Identities = 16/131 (12%), Positives = 42/131 (32%), Gaps = 7/131 (5%)

Query: 59  QEENKNLRQTSVNIQAKAEQEEEFISNTLMKKIQALKKEKETLAHHYEQEEEF-----LT 113
                 +    +N     ++ +E I   +  + + LK  +        + EE        
Sbjct: 784 ILIITEILSDGINNSDINDRPQELIDQIIESEEERLKALRIQREEMLMRPEELELINEEQ 843

Query: 114 NDLSRKLNQLTEEKYRLEQTIEQEQE--RVVNKLMRKIEKLEAETDAKQNNLEQLRREKV 171
            +L +++     E    E+ I+   E      +     +KL     AK   ++ +     
Sbjct: 844 KNLKQEIKLELSEIQEAEEEIQNINENKNEFVEFKNDPKKLNKLIIAKDVLIKLVISSDE 903

Query: 172 ELENTLEQEQE 182
             ++    +++
Sbjct: 904 IKQDEKTTKKK 914


>gnl|CDD|219780 pfam08286, Spc24, Spc24 subunit of Ndc80.  Spc24 is a component of
           the evolutionarily conserved kinetochore-associated
           Ndc80 complex and is involved in chromosome segregation.
          Length = 117

 Score = 28.1 bits (63), Expect = 3.6
 Identities = 18/48 (37%), Positives = 27/48 (56%), Gaps = 3/48 (6%)

Query: 136 QEQERVVNKLMRKIEKLEAETDAKQNNLEQLRREKVELENTLEQEQEA 183
           QE E    +L +++  LE+E +  Q+ LE+L+    ELE   EQE E 
Sbjct: 2   QELENEKFRLAKELNDLESELNNLQSELEELKERLEELE---EQEVEG 46


>gnl|CDD|221408 pfam12072, DUF3552, Domain of unknown function (DUF3552).  This
           presumed domain is functionally uncharacterized. This
           domain is found in bacteria, archaea and eukaryotes.
           This domain is about 200 amino acids in length. This
           domain is found associated with pfam00013, pfam01966.
           This domain has a single completely conserved residue A
           that may be functionally important.
          Length = 201

 Score = 28.7 bits (65), Expect = 3.7
 Identities = 36/124 (29%), Positives = 62/124 (50%), Gaps = 13/124 (10%)

Query: 74  AKAEQEEEFISNTLMKKIQALKKEKETLAHHYEQEEEF-LTNDLSRKLNQLTEEKYRLEQ 132
           + AE+E + I     K+ +ALKKE    A    +EE   L  +  R+L +   E  R E+
Sbjct: 30  SSAEEEAKQIIEEAKKEAEALKKEALLEA----KEEIHKLRAEAERELKERRNELQRQEK 85

Query: 133 TIEQEQERVVNKLMRKIEKLEAETDAKQNNLEQLRREKVELENTLEQEQEALVNKLWKRM 192
            + Q++E     L RK E L    + K+ +LE+  +E    +  LE+++E L   + ++ 
Sbjct: 86  RLLQKEET----LDRKDESL----EKKEESLEEKEKELAARQQQLEEKEEELEELIEEQQ 137

Query: 193 DKLE 196
            +LE
Sbjct: 138 QELE 141


>gnl|CDD|237449 PRK13616, PRK13616, lipoprotein LpqB; Provisional.
          Length = 591

 Score = 29.6 bits (67), Expect = 3.8
 Identities = 14/36 (38%), Positives = 16/36 (44%), Gaps = 4/36 (11%)

Query: 355 PVHSRPQRIKTPLLSVEQPTSPEGAPTPTLGSPMDI 390
           P  S PQ I T    VE+P      P PT G   D+
Sbjct: 27  PSSSAPQAIGT----VERPAPSNEVPKPTPGMDPDL 58


>gnl|CDD|216603 pfam01610, DDE_Tnp_ISL3, Transposase.  Transposase proteins are
           necessary for efficient DNA transposition. Contains
           transposases for IS204, IS1001, IS1096 and IS1165.
          Length = 244

 Score = 29.1 bits (66), Expect = 3.8
 Identities = 23/93 (24%), Positives = 36/93 (38%), Gaps = 15/93 (16%)

Query: 17  ALMMPPSPVTREQLQKRIESLQQQNRVLKVELETY--KLQLRALQEENKNLRQTSVNIQA 74
            L+  P  +T +Q Q+  + L   N  LK     Y  K +LR +       R        
Sbjct: 120 LLLKNPENLTEKQKQRLEDLLLDLNPELK---RAYELKEELRDI------YRYKDK---E 167

Query: 75  KAEQE-EEFISNTLMKKIQALKKEKETLAHHYE 106
           +A +  +E+I       I   KK  +TL  H +
Sbjct: 168 EARRLLKEWIEKARRSGIPPFKKLAKTLKRHLD 200


>gnl|CDD|225519 COG2972, COG2972, Predicted signal transduction protein with a
           C-terminal ATPase domain [Signal transduction
           mechanisms].
          Length = 456

 Score = 29.3 bits (66), Expect = 3.8
 Identities = 30/116 (25%), Positives = 51/116 (43%), Gaps = 16/116 (13%)

Query: 26  TREQLQKRIESLQQQNRVLKVELETYKLQLRALQEEN-----KNLRQTSVNIQAKAE-QE 79
           T  ++++R+E L ++N  L  E E  + +LRALQ +       N  +T   I+  AE  +
Sbjct: 231 TINEMEERLEYLIEENYSL--EQEQLEAELRALQSQINPHFLYNTLET---IRMLAEEDD 285

Query: 80  EEFISNTLMKKIQALKKEKETLAHHYEQEEEFLTNDLSRKLNQLTEEKYRLEQTIE 135
            E  +  +    + L+     L +    E E L   L  K   L  +K R+   +E
Sbjct: 286 PEEAAKVVKALSKLLRYSLSNLDNIVTLEIELL---LIEKY--LEIQKLRIGDRLE 336


>gnl|CDD|222709 pfam14362, DUF4407, Domain of unknown function (DUF4407).  This
           family of proteins is found in bacteria. Proteins in
           this family are typically between 366 and 597 amino
           acids in length. There is a single completely conserved
           residue R that may be functionally important.
          Length = 297

 Score = 29.1 bits (66), Expect = 3.9
 Identities = 30/136 (22%), Positives = 52/136 (38%), Gaps = 9/136 (6%)

Query: 83  ISNTLMKKIQALKKEKETLAHHYEQEEEF---LTNDLSRKLNQLTEEKYRLEQTIEQEQE 139
           IS  L  KI   + +   L    E++      +      K+  LT E   L+  I++ Q 
Sbjct: 97  ISEPLELKIFEKEIDTVLLEIQQEEQAAAQAQVAAGFRPKIAALTAEIAALQAEIDEAQA 156

Query: 140 RVVNKLMRKIEKLEAETDA---KQNNLEQLRREKVELENTLEQEQEALVNKLWKRMDKLE 196
            V      +  + EAE             + +EK E  +  +   E L  +L   + +LE
Sbjct: 157 EV--NAAYQEAQCEAEGTGGTGVAGK-GPVYKEKREKLDAAQARLETLKARLDAAIAQLE 213

Query: 197 AEKRILQIKLDQPVSD 212
           A+K  L+      V++
Sbjct: 214 AQKAALERNRQAAVAE 229


>gnl|CDD|235850 PRK06669, fliH, flagellar assembly protein H; Validated.
          Length = 281

 Score = 29.2 bits (66), Expect = 3.9
 Identities = 24/112 (21%), Positives = 44/112 (39%), Gaps = 4/112 (3%)

Query: 78  QEEEFISNTLMKKIQALKKEKETL---AHHYEQEEEFLTNDLSRKLNQLTEEKYRLEQTI 134
           +    I+   +K  +  K   + L        +EEE    +  R+     E K  +E+  
Sbjct: 7   KRSNVINKEKLKTHEIQKYRFKVLSIKEKERLREEEEEQVEQLREEAN-DEAKEIIEEAE 65

Query: 135 EQEQERVVNKLMRKIEKLEAETDAKQNNLEQLRREKVELENTLEQEQEALVN 186
           E   E V        E+L  +TD   + +E+L+ +    +   E+E E L+ 
Sbjct: 66  EDAFEIVEAAEEEAKEELLKKTDEASSIIEKLQMQIEREQEEWEEELERLIE 117


>gnl|CDD|224259 COG1340, COG1340, Uncharacterized archaeal coiled-coil protein
           [Function unknown].
          Length = 294

 Score = 29.3 bits (66), Expect = 3.9
 Identities = 31/172 (18%), Positives = 79/172 (45%), Gaps = 6/172 (3%)

Query: 27  REQLQKRIESLQQQNRVLKVELETYKLQLRALQEENKNLRQTSVNIQAKAEQEEEFISNT 86
           R++L  ++  L+++ + L+ E +    +++ L+E+   +      ++ +  + +E  +  
Sbjct: 43  RDELNAKVRELREKAQELREERDEINEEVQELKEKRDEINAKLQELRKEYRELKEKRNEF 102

Query: 87  LMKKIQALKKEKETLAHHYEQEEEFLTNDLSRKLNQLTEEKYRLEQTIEQEQERVVNKLM 146
            +        E+E      +Q+   LT +  R+L Q  +E  +  +  ++       +  
Sbjct: 103 NLGGRSIKSLEREIERLEKKQQTSVLTPEEERELVQKIKELRKELEDAKK-----ALEEN 157

Query: 147 RKIEKLEAETDAKQNNLEQLRREKVELENTLEQEQEALVNKLWKRMDKLEAE 198
            K+++L+AE D  +    ++  +  EL N  ++  E +  KL++  D+L  E
Sbjct: 158 EKLKELKAEIDELKKKAREIHEKIQELANEAQEYHEEM-IKLFEEADELRKE 208


>gnl|CDD|220646 pfam10240, DUF2464, Protein of unknown function (DUF2464).  This is
           a family of proteins conserved from worms to humans.
           Members have been annotated as FAM125A proteins, but
           their function is unknown.
          Length = 251

 Score = 28.9 bits (65), Expect = 4.1
 Identities = 16/52 (30%), Positives = 24/52 (46%), Gaps = 5/52 (9%)

Query: 344 CGQTTVMPIPPPVHSRPQRIKTPL--LSVEQPTSPEGAPTPTLGSPMDISKS 393
             +   +P PPPV  +P+ I      LS++  + P  AP   L S +   KS
Sbjct: 139 WCKKGSVPAPPPV-PKPRSISQGSQSLSLDSASRP--APGLPLLSSLSRGKS 187


>gnl|CDD|220368 pfam09730, BicD, Microtubule-associated protein Bicaudal-D.  BicD
           proteins consist of three coiled-coiled domains and are
           involved in dynein-mediated minus end-directed transport
           from the Golgi apparatus to the endoplasmic reticulum
           (ER). For full functioning they bind with GSK-3beta
           pfam05350 to maintain the anchoring of microtubules to
           the centromere. It appears that amino-acid residues
           437-617 of BicD and the kinase activity of GSK-3 are
           necessary for the formation of a complex between BicD
           and GSK-3beta in intact cells.
          Length = 711

 Score = 29.4 bits (66), Expect = 4.2
 Identities = 61/296 (20%), Positives = 105/296 (35%), Gaps = 51/296 (17%)

Query: 36  SLQQQNRVLK---VELETYKLQLRALQEENKNLRQT---SVNIQAKAEQEEEFISNTLMK 89
           SLQ+Q  VL+   VE E  K ++R L+EE + L      +  ++  AE++ E    TL  
Sbjct: 108 SLQKQVSVLRQSQVEFEGLKHEIRRLEEETELLNSQLEEAARLKEIAEKQLEEALETLKT 167

Query: 90  KIQALKKEKETLAHHYEQEEEFLTNDLSRKLNQLTEEKYRLEQTIEQEQERVVN------ 143
           + +     ++ L+HH   E+ +  ++LS  L+ L   +     T         N      
Sbjct: 168 EREQKNALRKELSHHLNLEDFYSLSNLSISLDGLKFSEDPEASTEPNNDGEDENGHLNGG 227

Query: 144 KLMRKIEKLEAETDAK---------------------QNNLEQLRREKVELENTLE---- 178
             + K      + +                       +  LEQ+ REK  L   L+    
Sbjct: 228 PGLAKSLGTPRKGELFPPAPGVSDLFSELNLSEIQKLKQQLEQVEREKASLLANLQELQT 287

Query: 179 ---------QEQEALVNKLWKRMDKLEAEKRILQIKLDQPVSDPSSPHDMNNGDTASNLS 229
                     EQ   VN+L + ++ L   +   + +           H+  +        
Sbjct: 288 QLEQSEGALSEQHEKVNRLTEHLNALRNLQASKEQQDGLDSEKDRGSHEDGDYYEVDING 347

Query: 230 THI-----HSLRSEVMRLRNQLANTQQEHTEKMERYVKDEKNVREENLRLQRKLQR 280
             I         SEV  L+ +L   +  + E  ERY  ++  +R E   L  KL  
Sbjct: 348 PEILECKYRVAVSEVGELKAELKALKARYNELQERYEDEKDRLRNEIQNLAEKLLS 403


>gnl|CDD|220366 pfam09728, Taxilin, Myosin-like coiled-coil protein.  Taxilin
           contains an extraordinarily long coiled-coil domain in
           its C-terminal half and is ubiquitously expressed. It is
           a novel binding partner of several syntaxin family
           members and is possibly involved in Ca2+-dependent
           exocytosis in neuroendocrine cells. Gamma-taxilin,
           described as leucine zipper protein Factor Inhibiting
           ATF4-mediated Transcription (FIAT), localises to the
           nucleus in osteoblasts and dimerises with ATF4 to form
           inactive dimers, thus inhibiting ATF4-mediated
           transcription.
          Length = 309

 Score = 28.9 bits (65), Expect = 4.3
 Identities = 29/132 (21%), Positives = 61/132 (46%), Gaps = 7/132 (5%)

Query: 26  TREQLQKRIESLQQQNRVLKVELETY-KLQLRALQEENKNLRQTSVNIQAKAEQEEEFIS 84
            R +L+     LQ+ N+ LK E +     +    +E  +  + T  +IQ + E+     +
Sbjct: 72  ARSKLESLCRELQKHNKTLKEENKARSDEEEEKRKELTEKFQSTLKDIQDQMEEHSN-PN 130

Query: 85  NTLMKKIQALKKEKETLAHHYEQ-----EEEFLTNDLSRKLNQLTEEKYRLEQTIEQEQE 139
           + L ++ + L+++ + L   YE      E+   T DL ++LN    E+ + +   EQE+ 
Sbjct: 131 HKLCEENEELREKLKELIEQYELREQHFEKLLKTKDLEQQLNDAKLEQAQEQAEQEQEKH 190

Query: 140 RVVNKLMRKIEK 151
           +   +++ K   
Sbjct: 191 KREKEILLKEAA 202


>gnl|CDD|216249 pfam01025, GrpE, GrpE. 
          Length = 165

 Score = 28.3 bits (64), Expect = 4.5
 Identities = 13/40 (32%), Positives = 24/40 (60%)

Query: 28 EQLQKRIESLQQQNRVLKVELETYKLQLRALQEENKNLRQ 67
          E+L+   E+L+++   L+ E+E  K +L  L  E +N R+
Sbjct: 7  EELEDEEEALEEELEELEEEIEELKDRLLRLLAEFENYRK 46


>gnl|CDD|235551 PRK05667, dnaG, DNA primase; Validated.
          Length = 580

 Score = 29.0 bits (66), Expect = 4.6
 Identities = 25/156 (16%), Positives = 53/156 (33%), Gaps = 8/156 (5%)

Query: 66  RQTSVNIQAKAEQEEEFISNTLMKKIQALKKEKETLAH--HYEQEEEFLTNDLSRKLNQL 123
            +  +   A+ EQ         +K  +    E+E LA    + +  E + + L  +  + 
Sbjct: 412 ERLGILDDAQLEQLVPKAQEPQLKAERPRTAERELLALLLQHPELAEEVRDALDEEDFEG 471

Query: 124 TEEKYRLEQTIEQEQERVVNKLMRKIEKLEAETDAKQ----NNLEQLRREKVELENTLEQ 179
                 L + I  +        +  +E L      +      +L        E    LE+
Sbjct: 472 LPLFRALLEAILAQPGLTTGSQL--LEHLRDAGLEELAALLESLAVWEEISEEDIAALEK 529

Query: 180 EQEALVNKLWKRMDKLEAEKRILQIKLDQPVSDPSS 215
           E +  + KL  ++ +   E+ I + +L +     S 
Sbjct: 530 ELKDALEKLRDQLLEERLEELIAKERLLEGHGLSSE 565


>gnl|CDD|183173 PRK11519, PRK11519, tyrosine kinase; Provisional.
          Length = 719

 Score = 29.4 bits (66), Expect = 4.7
 Identities = 25/99 (25%), Positives = 46/99 (46%), Gaps = 10/99 (10%)

Query: 85  NTLMKKIQALKKEKET--LAHHYEQEEEFLTNDLSRKLNQLTEEKYRLEQTIE---QEQE 139
           N L   +   +  K+T  L+  Y  E+     D+   LN +T     LEQ IE   +E  
Sbjct: 212 NNLQNNLTVTENGKDTGVLSLTYTGEDREQIRDI---LNSITRN--YLEQNIERKSEEAS 266

Query: 140 RVVNKLMRKIEKLEAETDAKQNNLEQLRREKVELENTLE 178
           + +  L +++ ++ +  D  +N L   R++K  ++  LE
Sbjct: 267 KSLAFLAQQLPEVRSRLDVAENKLNAFRQDKDSVDLPLE 305


>gnl|CDD|240334 PTZ00258, PTZ00258, GTP-binding protein; Provisional.
          Length = 390

 Score = 28.8 bits (65), Expect = 4.9
 Identities = 19/73 (26%), Positives = 32/73 (43%), Gaps = 5/73 (6%)

Query: 132 QTIEQEQERVVNKLMRKIEKLEAETDAKQNNLEQLRREKVELENTLEQEQEALVNKLWKR 191
           + I  E   ++ K +  +EK   E   K+   ++ + EKVEL+  L++  E L      R
Sbjct: 145 EIISSE---LILKDLEFVEKRLDELTKKRKKKKKKKEEKVELD-VLKKVLEWLEEGKPVR 200

Query: 192 MDKL-EAEKRILQ 203
                + E  IL 
Sbjct: 201 DGDWTDKEIEILN 213


>gnl|CDD|185628 PTZ00449, PTZ00449, 104 kDa microneme/rhoptry antigen; Provisional.
          Length = 943

 Score = 29.3 bits (65), Expect = 5.0
 Identities = 12/49 (24%), Positives = 19/49 (38%), Gaps = 2/49 (4%)

Query: 337 KEPRCFACGQTTVMPIPPPVHSRPQRIKTP--LLSVEQPTSPEGAPTPT 383
           +EP+     ++   P  P     P+ +  P      E P SP+  P P 
Sbjct: 592 EEPKKPKRPRSAQRPTRPKSPKLPELLDIPKSPKRPESPKSPKRPPPPQ 640


>gnl|CDD|233067 TIGR00641, acid_CoA_mut_N, methylmalonyl-CoA mutase N-terminal
           domain.  Methylmalonyl-CoA mutase (EC 5.4.99.2)
           catalyzes a reversible isomerization between
           L-methylmalonyl-CoA and succinyl-CoA. The enzyme uses an
           adenosylcobalamin cofactor. It may be a homodimer, as in
           mitochondrion, or a heterodimer with partially
           homologous beta chain that does not bind the
           adenosylcobalamin cofactor, as in Propionibacterium
           freudenreichii. The most similar archaeal sequences are
           separate chains, such as AF2215 abd AF2219 of
           Archaeoglobus fulgidus, that correspond roughly to the
           first 500 and last 130 residues, respectively of known
           methylmalonyl-CoA mutases. This model describes the
           N-terminal domain subfamily. In a neighbor-joining tree,
           AF2215 branches with a bacterial isobutyryl-CoA mutase,
           which is also the same length. Scoring between the noise
           and trusted cutoffs are the non-catalytic, partially
           homologous beta chains from certain heterodimeric
           examples of 5.4.99.2.
          Length = 524

 Score = 29.0 bits (65), Expect = 5.1
 Identities = 15/49 (30%), Positives = 25/49 (51%), Gaps = 7/49 (14%)

Query: 127 KYRLEQTIEQE-----QERVVNKLMRKIEKLEAETDAK--QNNLEQLRR 168
           KY+LE+  E E        V  + + K++KL AE D +  +  L+ L +
Sbjct: 433 KYQLEEEDEVEVLKVDNSSVREEQIAKLKKLRAERDQEKVEAALDALTK 481


>gnl|CDD|236729 PRK10636, PRK10636, putative ABC transporter ATP-binding protein;
           Provisional.
          Length = 638

 Score = 29.0 bits (65), Expect = 5.3
 Identities = 33/107 (30%), Positives = 49/107 (45%), Gaps = 20/107 (18%)

Query: 47  ELETYKLQLRALQ----EENKNLRQTSVN-IQAKAEQ---EEEFISNT--LMKKIQALKK 96
           +LE Y+  L  +Q    + ++  ++ + N  QA+ +Q   E E  + T  L K+I  L+K
Sbjct: 511 DLEDYQQWLSDVQKQENQTDEAPKENNANSAQARKDQKRREAELRTQTQPLRKEIARLEK 570

Query: 97  EKETLAHHYEQEEEFLTN----DLSRK------LNQLTEEKYRLEQT 133
           E E L     Q EE L +    D SRK      L Q    K  LE+ 
Sbjct: 571 EMEKLNAQLAQAEEKLGDSELYDQSRKAELTACLQQQASAKSGLEEC 617


>gnl|CDD|184896 PRK14903, PRK14903, 16S rRNA methyltransferase B; Provisional.
          Length = 431

 Score = 28.7 bits (64), Expect = 5.4
 Identities = 22/95 (23%), Positives = 42/95 (44%), Gaps = 10/95 (10%)

Query: 110 EFLTNDLSRKLNQLTEE-----KYRLEQTIEQEQERVVNKLMRKIEKLEAETDAK----- 159
           + L  D+SR+  QL E+     K    +    + ER+   +    +++  +         
Sbjct: 264 KILAVDISREKIQLVEKHAKRLKLSSIEIKIADAERLTEYVQDTFDRILVDAPCTSLGTA 323

Query: 160 QNNLEQLRREKVELENTLEQEQEALVNKLWKRMDK 194
           +N+ E LRR   E    L + Q  +V++ WK ++K
Sbjct: 324 RNHPEVLRRVNKEDFKKLSEIQLRIVSQAWKLLEK 358


>gnl|CDD|226883 COG4477, EzrA, Negative regulator of septation ring formation [Cell
           division and chromosome partitioning].
          Length = 570

 Score = 28.9 bits (65), Expect = 5.5
 Identities = 33/176 (18%), Positives = 72/176 (40%), Gaps = 21/176 (11%)

Query: 29  QLQKRIESLQQQNRVLKVELETYKL-----QLRALQEENKNLRQTSVNIQAKAEQEEEFI 83
            +  R+E L++Q       L   +L     +L  +QE+ ++L    +  + +A+   E  
Sbjct: 252 NIDSRLERLKEQLVENSELLTQLELDEAEEELGLIQEKIESLYDL-LEREVEAKNVVEEN 310

Query: 84  SNTLMKKIQALKKEKETLAHHYE--QEEEFLTNDLSRKLNQLTEEKYRLEQTIEQ----- 136
              L   ++  K+  E L    E  +E   L       + +  +E   LE  +++     
Sbjct: 311 LPILPDYLEKAKENNEHLKEEIERVKESYRLAETELGSVRKFEKELKELESVLDEILENI 370

Query: 137 --------EQERVVNKLMRKIEKLEAETDAKQNNLEQLRREKVELENTLEQEQEAL 184
                   E +  + ++ + +  +E E +  Q +L  LR++++E    LE+ +  L
Sbjct: 371 EAQEVAYSELQDNLEEIEKALTDIEDEQEKVQEHLTSLRKDELEARENLERLKSKL 426


>gnl|CDD|227250 COG4913, COG4913, Uncharacterized protein conserved in bacteria
           [Function unknown].
          Length = 1104

 Score = 28.8 bits (64), Expect = 5.5
 Identities = 33/195 (16%), Positives = 65/195 (33%), Gaps = 22/195 (11%)

Query: 25  VTREQLQKRIESLQQQNRVLKVELETYKLQLRALQEENKNLRQTSVNIQAKAEQEEEFIS 84
           + +E+ +  +     +    K ++E+ K++    +E           I            
Sbjct: 295 IKKEEQETLVRQFTVEQTQAKSKVESAKIETDRAREMETLAHDNVKQIVGAQHGI----- 349

Query: 85  NTLMKKIQALKKEKETLAHHYEQEEEFLTNDLSRKLNQLTEEKYRLEQTIEQEQERVVNK 144
             L  K +    ++ T++      +  +       L     E       +       V++
Sbjct: 350 --LSAKREGAVDKRRTISTARAGLDALVK-----GLGGAAPESAEELLELNNAARLTVDE 402

Query: 145 LMRKIEKLEAETDAKQNNLEQLRREKVELENTLEQEQEALVNKLWKRMDKLEAEKRILQI 204
                E LE    A Q N+E   R  V+     +QE    ++ L K    +E   R+LQ+
Sbjct: 403 YPAAREALE---SAGQRNVEDRTR-AVDEFKAADQE----LSSLSKGSSNIEY--RLLQV 452

Query: 205 KLDQPVSDPSSPHDM 219
           + +       SP DM
Sbjct: 453 RENLCQDLGVSPRDM 467


>gnl|CDD|114011 pfam05262, Borrelia_P83, Borrelia P83/100 protein.  This family
           consists of several Borrelia P83/P100 antigen proteins.
          Length = 489

 Score = 28.8 bits (64), Expect = 5.6
 Identities = 22/128 (17%), Positives = 57/128 (44%), Gaps = 3/128 (2%)

Query: 28  EQLQKRIESLQQQNRVLKVELETYKLQLRALQEENKNLRQTSVNIQAKAEQEEEFISNTL 87
           ++ Q   +  QQ+    +   +  + ++R  Q+E KNL + +     K   E++ ++   
Sbjct: 223 DKKQIDADKAQQKADFAQDNADKQRDEVRQKQQEAKNLPKPADTSSPK---EDKQVAENQ 279

Query: 88  MKKIQALKKEKETLAHHYEQEEEFLTNDLSRKLNQLTEEKYRLEQTIEQEQERVVNKLMR 147
            ++I+  + E +       + ++    DL ++     +E    E   ++++E V   L +
Sbjct: 280 KREIEKAQIEIKKNDEEALKAKDHKAFDLKQESKASEKEAEDKELEAQKKREPVAEDLQK 339

Query: 148 KIEKLEAE 155
              ++EA+
Sbjct: 340 TKPQVEAQ 347


>gnl|CDD|150420 pfam09744, Jnk-SapK_ap_N, JNK_SAPK-associated protein-1.  This is
           the N-terminal 200 residues of a set of proteins
           conserved from yeasts to humans. Most of the proteins in
           this entry have an RhoGEF pfam00621 domain at their
           C-terminal end.
          Length = 158

 Score = 28.2 bits (63), Expect = 5.7
 Identities = 39/152 (25%), Positives = 62/152 (40%), Gaps = 20/152 (13%)

Query: 128 YRLEQTIEQEQERVVN------------KLMRKIEKLEAETDAKQN---NLEQLRREKVE 172
           Y L  +I QE ER+++             ++R +E L++     Q     LE LR +   
Sbjct: 2   YDLANSIYQEFERLIDNYGEEVVKNLMPLVIRVLENLDSLASENQELEVELELLREDNER 61

Query: 173 LENTLEQEQE---ALVNKLWKRMDKLEAEKRILQIKLDQPVSDPSS--PHDMNNGDTASN 227
           L    E+E+E       KL +  D+LE EK+ LQ K++    +         N  D  S 
Sbjct: 62  LSTQYEREKELRKQAEQKLLEFEDELEQEKKELQKKIEDLEENVRQLELKAKNLSDQVSR 121

Query: 228 LSTHIHSLRSEVMRLRNQLANTQQEHTEKMER 259
           L      L+ E   L  +     + + E +ER
Sbjct: 122 LEERETELKKEYNSLHERYTKLLKNYVEYVER 153


>gnl|CDD|116500 pfam07888, CALCOCO1, Calcium binding and coiled-coil domain
           (CALCOCO1) like.  Proteins found in this family are
           similar to the coiled-coil transcriptional coactivator
           protein coexpressed by Mus musculus (CoCoA/CALCOCO1).
           This protein binds to a highly conserved N-terminal
           domain of p160 coactivators, such as GRIP1, and thus
           enhances transcriptional activation by a number of
           nuclear receptors. CALCOCO1 has a central coiled-coil
           region with three leucine zipper motifs, which is
           required for its interaction with GRIP1 and may regulate
           the autonomous transcriptional activation activity of
           the C-terminal region.
          Length = 546

 Score = 28.8 bits (64), Expect = 5.7
 Identities = 50/197 (25%), Positives = 87/197 (44%), Gaps = 37/197 (18%)

Query: 27  REQLQKRIESLQQQNRVLKVELETYKLQLRALQEE---NKNLRQTSVNIQAKAEQEEEFI 83
           R  LQ   E L +  R L+  LE+ + +   L EE     +LR  ++    ++  E   +
Sbjct: 278 RIDLQLENEQLHEDLRTLQERLESSQQKAGLLGEELASLGSLRDHTIAELHQSRLESAQM 337

Query: 84  SNTLMKKIQALK-------KEKETLAHHYEQEEEFLTNDLSRKLNQLTEEKYRLEQTIEQ 136
           S  L +   ALK       +E+ETL    E +++        ++ +L+ E  +LE+ +++
Sbjct: 338 SLQLSQLNLALKEGQSQWAQERETLRQSAEADKD--------RIQKLSAELLKLEEWLQE 389

Query: 137 EQ-ERVVNKLMRKIEKLEAETDAKQNNLEQLRREKVELENTL-----EQEQEAL------ 184
           E+ +R   ++     +L  E D  +  L + RRE  EL + L     E+EQ         
Sbjct: 390 ERSQREKLEV-----ELGIEKDCNRVQLSENRRELSELRSALRVLQKEKEQLQEEKQELL 444

Query: 185 --VNKLWKRMDKLEAEK 199
             +  L  R+DK   EK
Sbjct: 445 DYIRVLELRLDKEADEK 461


>gnl|CDD|218891 pfam06090, Ins_P5_2-kin, Inositol-pentakisphosphate 2-kinase.  This
           is a family of inositol-pentakisphosphate 2-kinases (EC
           2.7.1.158) (also known as inositol
           1,3,4,5,6-pentakisphosphate 2-kinase, Ins(1,3,4,5,6)P5
           2-kinase) and InsP5 2-kinase). This enzyme
           phosphorylates Ins(1,3,4,5,6)P5 to form
           Ins(1,2,3,4,5,6)P6 (also known as InsP6 or phytate).
           InsP6 is involved in many processes such as mRNA export,
           nonhomologous end-joining, endocytosis and ion channel
           regulation.
          Length = 301

 Score = 28.5 bits (64), Expect = 5.9
 Identities = 10/30 (33%), Positives = 19/30 (63%)

Query: 234 SLRSEVMRLRNQLANTQQEHTEKMERYVKD 263
            LR +++RLR    N+Q + T+++  Y+K 
Sbjct: 19  YLRGKLLRLRKLRPNSQNDSTQEIYDYIKR 48


>gnl|CDD|151277 pfam10828, DUF2570, Protein of unknown function (DUF2570).  This is
           a family of proteins with unknown function.
          Length = 110

 Score = 27.5 bits (61), Expect = 5.9
 Identities = 17/50 (34%), Positives = 29/50 (58%), Gaps = 1/50 (2%)

Query: 148 KIEKLEAETDAKQNNLEQLRREKVELENTLEQEQEALVNKLWKRMDKLEA 197
           KI+ L AE  A+   +EQ ++    L + LEQE++A V +  K  +++E 
Sbjct: 26  KIDSLRAENKAQAQTIEQQQKANQRLTDALEQERQA-VEEQQKIANEIER 74


>gnl|CDD|240227 PTZ00009, PTZ00009, heat shock 70 kDa protein; Provisional.
          Length = 653

 Score = 29.0 bits (65), Expect = 6.0
 Identities = 24/80 (30%), Positives = 38/80 (47%), Gaps = 17/80 (21%)

Query: 123 LTEEKYRLEQTIEQEQERVVNKLMRKIEKLEAETDAKQNNLEQLRREKVELENTLEQEQE 182
           +T +K RL +    + +R+VN+     EK +AE +A         RE+VE +N LE    
Sbjct: 505 ITNDKGRLSK---ADIDRMVNE----AEKYKAEDEA--------NRERVEAKNGLENYCY 549

Query: 183 ALVNKLW--KRMDKLEAEKR 200
           ++ N L   K   KL    +
Sbjct: 550 SMKNTLQDEKVKGKLSDSDK 569


>gnl|CDD|214395 CHL00204, ycf1, Ycf1; Provisional.
          Length = 1832

 Score = 28.9 bits (65), Expect = 6.2
 Identities = 53/238 (22%), Positives = 93/238 (39%), Gaps = 45/238 (18%)

Query: 77  EQEEEFISNTLMKKIQALKK--------EKETLAHHYEQEEEFLT-------NDLSR-KL 120
           E+++  +SN  + +I+AL K        EK T   + E ++E+L        N   R ++
Sbjct: 410 EEKKNNLSNEFLNRIEALDKESSSLNILEKRTRLCNDETKKEYLPKIYDPFLNGPYRGRI 469

Query: 121 NQLTEEKYRLEQTIEQEQERV-VNK----LMRKIEKLEAETDAKQNNLEQLRREKVELEN 175
            +L       E +I+   E V +NK    L+   +  E E      N + L  E  E   
Sbjct: 470 KKLFSPSIINETSIKNNIEGVWINKIHGILLNNTDYQEFEQKIDTFNKKSLSTEINESLT 529

Query: 176 TLEQEQEALVNKLWKRMDKL-----------EAEKRILQIKLDQPVSDPSSPHDMNNGDT 224
            + +      + L  +   L           E E +I +   D  ++DP       N  T
Sbjct: 530 LINKFGAEPKSSLNLKGLSLFSEPEQEKINSEEEIKIFKFLFDAVITDP-------NDQT 582

Query: 225 ASNLSTHIHSLRSEVMRLRNQLANTQQEHTEKMERYVKDEKNVREENLRLQRKLQREV 282
               S  I  +  +V R   +L +      E++E   ++E+NV  ++    RK +R V
Sbjct: 583 IIKKSIGIKEISKKVPRWSYKLID----ELEQLEG--ENEENVPSDHQIRSRKAKRVV 634


>gnl|CDD|233119 TIGR00763, lon, ATP-dependent protease La.  This protein is induced
           by heat shock and other stresses in E. coli, B.
           subtilis, and other species. The yeast member,
           designated PIM1, is located in the mitochondrial matrix,
           required for mitochondrial function, and also induced by
           heat shock [Protein fate, Degradation of proteins,
           peptides, and glycopeptides].
          Length = 775

 Score = 28.8 bits (65), Expect = 6.2
 Identities = 31/155 (20%), Positives = 67/155 (43%), Gaps = 20/155 (12%)

Query: 28  EQLQKRIESLQQQNRVLKVELETYKLQLRALQEENK--NLRQTSVNIQAKAEQEEEF--- 82
            ++++    L   +++ + +       L  + E  +  +    S+ ++ K E +E     
Sbjct: 127 REIKETFRELISLSKLFREQPALLS-ALEDIDEPGRLADFVAASLQLKEKDELQEVLETV 185

Query: 83  -ISNTLMKKIQALKKEKETLAHHYEQEEEFLTNDLSRKLNQLTEEKYRLEQT-------- 133
            I   L K ++ LKKE E L     + +  +T  +  K+ +   E Y  EQ         
Sbjct: 186 NIEKRLKKALELLKKELELL-----KLQNKITKKVEEKMEKTQREYYLREQLKAIKKELG 240

Query: 134 IEQEQERVVNKLMRKIEKLEAETDAKQNNLEQLRR 168
           IE++ +  + KL  K+E+L+   + K+   ++L +
Sbjct: 241 IEKDDKDELEKLKEKLEELKLPEEVKKVIEKELTK 275


>gnl|CDD|215656 pfam00012, HSP70, Hsp70 protein.  Hsp70 chaperones help to fold
           many proteins. Hsp70 assisted folding involves repeated
           cycles of substrate binding and release. Hsp70 activity
           is ATP dependent. Hsp70 proteins are made up of two
           regions: the amino terminus is the ATPase domain and the
           carboxyl terminus is the substrate binding region.
          Length = 598

 Score = 28.7 bits (65), Expect = 6.3
 Identities = 15/53 (28%), Positives = 28/53 (52%), Gaps = 5/53 (9%)

Query: 149 IEKL--EAETDAKQNNLEQLRREKVELENTLEQEQEALVNKLWKRMDKLEAEK 199
           IE++  +AE  A +   ++ R+E++E +N  E+   +L   L +  DKL    
Sbjct: 506 IERMVKDAEEYAAE---DKKRKERIEAKNEAEEYVYSLEKSLKEEGDKLPEAD 555


>gnl|CDD|220716 pfam10359, Fmp27_WPPW, RNA pol II promoter Fmp27 protein domain.
           Fmp27_WPPW is a conserved domain of a family of proteins
           involved in RNA polymerase II transcription initiation.
           It contains characteristic HQR and WPPW sequence motifs.
           and is towards the C-terminal in members which contain
           Fmp27_SW pfam10305.
          Length = 470

 Score = 28.8 bits (65), Expect = 6.3
 Identities = 20/93 (21%), Positives = 43/93 (46%), Gaps = 1/93 (1%)

Query: 138 QERVVNKLMRK-IEKLEAETDAKQNNLEQLRREKVELENTLEQEQEALVNKLWKRMDKLE 196
             RV  +L+ + +++LE +    +  L+ L     E   +  +E E  ++ L +R++ LE
Sbjct: 158 PRRVQRELLEERLKQLEEQIKKLEEKLDDLELNDTEELQSDLEELEEELSVLKERLEFLE 217

Query: 197 AEKRILQIKLDQPVSDPSSPHDMNNGDTASNLS 229
                L+   +      S+  + +  D+ S+LS
Sbjct: 218 KLLEDLERSEESSDRSSSTDTESSIADSLSSLS 250


>gnl|CDD|185657 PTZ00479, PTZ00479, RAP Superfamily; Provisional.
          Length = 435

 Score = 28.6 bits (64), Expect = 6.4
 Identities = 24/96 (25%), Positives = 38/96 (39%), Gaps = 13/96 (13%)

Query: 48  LETYKLQLRALQEENKNLRQTSVN--IQAKAEQEEEFISNTLMKKIQALKK--EKETLAH 103
            E+ KLQ R L     N+R+      +Q   + E          K+Q ++K  E E L  
Sbjct: 349 TESVKLQHRILSNLGWNIRRVVWIDWVQLGDDTEA---------KVQYVRKLRESEPLGD 399

Query: 104 HYEQEEEFLTNDLSRKLNQLTEEKYRLEQTIEQEQE 139
                      ++  KL ++ + K + EQ   Q QE
Sbjct: 400 TLTDGPSLSPQEIKEKLKKIKQYKIKKEQQKVQSQE 435


>gnl|CDD|130728 TIGR01667, YCCS_YHJK, integral membrane protein, YccS/YhfK family. 
           This model represents two clades of putative
           transmembrane proteins including the E. coli YccS and
           YhfK proteins. The YccS hypothetical equivalog
           (TIGR01666) is found in beta and gamma proteobacteria,
           while the smaller YhfK group is only found in E. coli,
           Salmonella and Yersinia. TMHMM on the 19 hits to this
           model shows a consensus of 11 transmembrane helices
           separated into two clusters, an N-terminal cluster of 6
           and a central cluster of 5. This would indicate two
           non-membrane domains one on each side of the membrane.
          Length = 701

 Score = 28.7 bits (64), Expect = 6.4
 Identities = 35/173 (20%), Positives = 64/173 (36%), Gaps = 18/173 (10%)

Query: 39  QQNRVLKVELETYKLQLRALQEENKNLRQTSVNIQA---KAEQEEEFISNTLMKKIQALK 95
              ++L   LE  +  LR +  +    +   +  +     A   +  +S TL   +Q   
Sbjct: 531 LLRKMLHDALEANQRYLRLILSQYPQGKPDDLAYRIARRNAHNTDAALSTTLSNMMQ--- 587

Query: 96  KEKETLAHHYEQEEEFLTND--LSRKLNQLTEEKYRL---EQTIEQEQ--ERVVNKLMRK 148
            E    +H+ E     LT    L   ++ L   + RL   E   E  Q  E V   + R 
Sbjct: 588 -EPAFNSHYLEDGFRLLTLSHTLLSYISALGAHRERLLNPELAAELLQACEIVAKAIQRC 646

Query: 149 IEKLEAETDAKQNNLEQLRREKVELE----NTLEQEQEALVNKLWKRMDKLEA 197
             +LE + +A   +   +  ++++       TL  +Q  L+  L   M  L +
Sbjct: 647 QARLEYDEEALAGDANIMEAQEMQPHEDAAGTLVYQQLQLIIGLLPTMHTLSS 699


>gnl|CDD|220410 pfam09798, LCD1, DNA damage checkpoint protein.  This is a family
           of proteins which regulate checkpoint kinases. In
           Schizosaccharomyces pombe this protein is called Rad26
           and in Saccharomyces cerevisiae it is called LCD1.
          Length = 648

 Score = 28.6 bits (64), Expect = 6.9
 Identities = 21/119 (17%), Positives = 55/119 (46%), Gaps = 7/119 (5%)

Query: 152 LEAETDAKQNNLEQLRREKVELENTLEQEQEALVNKLWKRMDKLEAEKRILQIKLDQPVS 211
           L  + D  Q   ++ R ++    N L+++ +  + KL + +  LE E++ L ++     +
Sbjct: 2   LRDKLDMLQQQKQEERNKQKSRVNELKEKHDQELQKLKQELQSLEDERKFLVLEQRGLSA 61

Query: 212 -----DPSSPHDMNNGDTASNLSTHIHSLRSEVMRLRNQLANTQQEHTEKMERYVKDEK 265
                + S P ++     AS+++    S  + + + + +++  ++       R +KD+K
Sbjct: 62  NDLRTELSPPSNLLKTSDASHIA--DESQPNSIKQKKREISPVKELVPLNPNRIIKDDK 118


>gnl|CDD|206687 cd01900, YchF, YchF GTPase.  YchF is a member of the Obg family,
           which includes four other subfamilies of GTPases: Obg,
           DRG, Ygr210, and NOG1. Obg is an essential gene that is
           involved in DNA replication in C. crescentus and
           Streptomyces griseus and is associated with the
           ribosome. Several members of the family, including YchF,
           possess the TGS domain related to the RNA-binding
           proteins. Experimental data and genomic analysis suggest
           that YchF may be part of a nucleoprotein complex and may
           function as a GTP-dependent translational factor.
          Length = 274

 Score = 28.2 bits (64), Expect = 7.1
 Identities = 14/62 (22%), Positives = 31/62 (50%), Gaps = 10/62 (16%)

Query: 28  EQLQKRIESLQQQNR----VLKVELETYKLQLRALQEENKNLRQTSVNIQAKAEQEEEFI 83
           E ++KR+E L+++ +      K ELE  + +++   EE K  R   +      ++E + +
Sbjct: 134 ETIEKRLERLEKKAKSGDKEAKEELELLE-KIKEHLEEGKPARTLELT-----DEEIKIL 187

Query: 84  SN 85
            +
Sbjct: 188 KS 189


>gnl|CDD|206034 pfam13863, DUF4200, Domain of unknown function (DUF4200).  This
           family is found in eukaryotes. It is a coiled-coil
           domain of unknwon function.
          Length = 126

 Score = 27.2 bits (61), Expect = 7.1
 Identities = 27/107 (25%), Positives = 53/107 (49%), Gaps = 3/107 (2%)

Query: 30  LQKRIESLQQQNRVLKVELETYKLQLRALQEENKNLRQTSVNIQAKAEQEEEFISNTLMK 89
           L+KR E  + Q   L  + E ++ +   L++  + L +    +Q    + ++F+     K
Sbjct: 3   LEKRREMEEVQ-LALDAKREEFERREELLKQREEELEKKEEELQESLIKFDKFLKENEAK 61

Query: 90  KIQALKKEKETLAHHYEQEEEFLTNDLSRKLNQLTEEKYRLEQTIEQ 136
           + +A KK +E      E+EEE    +L  +L +L  E  +LE+ +E+
Sbjct: 62  RRRAEKKAEEEKKLRKEKEEEI--KELKAELEELKAEIEKLEEKLEE 106


>gnl|CDD|132552 TIGR03513, GldL_gliding, gliding motility-associated protein GldL. 
           This protein family, GldL, is named for the member from
           Flavobacterium johnsoniae, which is required for a type
           of rapid gliding motility found in certain members of
           the Bacteriodetes. However, members are found also in
           several members of the Bacteriodetes that appear not to
           be motile [Cellular processes, Chemotaxis and motility].
          Length = 202

 Score = 27.9 bits (62), Expect = 7.1
 Identities = 14/84 (16%), Positives = 29/84 (34%), Gaps = 10/84 (11%)

Query: 83  ISNTLMKKIQALKKEKETLAHHYEQEEEFLTNDLSRKLNQLTEEKYRLEQTIEQEQERVV 142
            + TL     +  ++K+     Y ++   L  ++   LN +    Y  +           
Sbjct: 118 AAKTLAPMTDSYAQQKK-----YIEQMSSLAANME-GLNTI----YEAQLKGASSHADAN 167

Query: 143 NKLMRKIEKLEAETDAKQNNLEQL 166
           N++      L+ E +    NL  L
Sbjct: 168 NEIAINSSSLKEEMEKMAANLTSL 191


>gnl|CDD|205039 pfam12718, Tropomyosin_1, Tropomyosin like.  This family is a set
           of eukaryotic tropomyosins. Within the yeast Tmp1 and
           Tmp2, biochemical and sequence analyses indicate that
           Tpm2p spans four actin monomers along a filament,
           whereas Tpmlp spans five. Despite its shorter length,
           Tpm2p can compete with Tpm1p for binding to F-actin.
           Over-expression of Tpm2p in vivo alters the axial
           budding of haploids to a bipolar pattern, and this can
           be partially suppressed by co-over-expression of Tpm1p.
           This suggests distinct functions for the two
           tropomyosins, and indicates that the ratio between them
           is important for correct morphogenesis. The family also
           contains higher eukaryote Tmp3 members.
          Length = 143

 Score = 27.6 bits (62), Expect = 7.1
 Identities = 30/98 (30%), Positives = 46/98 (46%), Gaps = 5/98 (5%)

Query: 90  KIQALKKEKETLAHHYEQEEE---FLTNDLSRKLNQLTEEKYRLEQTIEQEQERVVNKLM 146
           K+ ALK E E      E+ EE    L  +   K  ++T  + + +Q +E+E E++  +L 
Sbjct: 1   KMNALKLEAENAQERAEELEEKLKELEQENLEKEQEITSLQKKNQQ-LEEEVEKLEEQLK 59

Query: 147 RKIEKLEAETDAKQNNLEQLRREKVELENTLEQEQEAL 184
              EKLE       N  E L R    LE  LE+ ++ L
Sbjct: 60  EAKEKLEESEKLATNA-EALTRRIQLLEEELEESEKRL 96


>gnl|CDD|192773 pfam11559, ADIP, Afadin- and alpha -actinin-Binding.  This family
           is found in mammals where it is localised at cell-cell
           adherens junctions, and in Sch. pombe and other fungi
           where it anchors spindle-pole bodies to spindle
           microtubules. It is a coiled-coil structure, and in
           pombe, it is required for anchoring the minus end of
           spindle microtubules to the centrosome equivalent, the
           spindle-pole body. The name ADIP derives from the family
           being composed of Afadin- and alpha -Actinin-Binding
           Proteins Localised at Cell-Cell Adherens Junctions.
          Length = 149

 Score = 27.6 bits (62), Expect = 7.2
 Identities = 15/41 (36%), Positives = 23/41 (56%)

Query: 28  EQLQKRIESLQQQNRVLKVELETYKLQLRALQEENKNLRQT 68
           E+LQ  IE L+ Q   L+ EL   + + R L+++ K L Q 
Sbjct: 67  ERLQNTIERLKTQLEDLERELALLQAKERQLEKKLKTLEQK 107



 Score = 27.3 bits (61), Expect = 8.7
 Identities = 23/80 (28%), Positives = 38/80 (47%), Gaps = 9/80 (11%)

Query: 105 YEQEEEFLTNDLSRKLNQLTEEKYRLEQTIEQEQERVVNKLMRKIEKLEAETDAKQNNLE 164
            +++ EF  + L   L +L  E  RL+ TIE+ + ++        E LE E    Q    
Sbjct: 45  RDRDLEFRES-LEETLRKLEAEIERLQNTIERLKTQL--------EDLERELALLQAKER 95

Query: 165 QLRREKVELENTLEQEQEAL 184
           QL ++   LE  L+ E+E +
Sbjct: 96  QLEKKLKTLEQKLKNEKEEV 115


>gnl|CDD|204784 pfam11932, DUF3450, Protein of unknown function (DUF3450).  This
           family of proteins are functionally uncharacterized.
           This protein is found in bacteria and eukaryotes.
           Proteins in this family are about 260 amino acids in
           length.
          Length = 250

 Score = 28.0 bits (63), Expect = 7.4
 Identities = 19/80 (23%), Positives = 34/80 (42%), Gaps = 10/80 (12%)

Query: 26  TREQLQKRIE-------SLQQQNRVLKVELETYKLQLRALQEENKNLRQTSVNIQAKAEQ 78
              Q QK+I+        L  + R L  E+E  ++    LQ    N +Q   ++Q + EQ
Sbjct: 37  AAAQSQKKIDKWADQTQELLAEIRQLLKEIENLRVYNDQLQRLVANQQQEIASLQQQIEQ 96

Query: 79  EEEF---ISNTLMKKIQALK 95
            E+    +   + + I  L+
Sbjct: 97  IEKTRQGLVPLMYRMIDGLE 116


>gnl|CDD|235339 PRK05084, xerS, site-specific tyrosine recombinase XerS; Reviewed.
          Length = 357

 Score = 28.3 bits (64), Expect = 7.4
 Identities = 29/119 (24%), Positives = 49/119 (41%), Gaps = 29/119 (24%)

Query: 45  KVELETYK--LQLRALQEEN---KNLRQTSVN--------------IQAKAEQEEEFISN 85
           K ++E +   L+ R L   +   K   QT++N               +A+ E  E +   
Sbjct: 76  KKDVEAFILYLRERPLLNGHSTKKGNSQTTINRTLSALKSLFKYLTEEAENEDGEPYFYR 135

Query: 86  TLMKKIQALKKEKETLAHHYEQEEEFLTNDLSRKLNQLTEEKYRLEQTIEQEQERVVNK 144
            +MKKI+  KK+KETLA      ++ L            +E Y     I+ E E+ ++ 
Sbjct: 136 NVMKKIEL-KKKKETLAARAHNLKQKLFLG---------DEDYEFLDFIDNEYEQKLSN 184


>gnl|CDD|239815 cd04320, AspRS_cyto_N, AspRS_cyto_N: N-terminal, anticodon
          recognition domain of the type found in Saccharomyces
          cerevisiae and human cytoplasmic aspartyl-tRNA
          synthetase (AspRS). This domain is a beta-barrel domain
          (OB fold) involved in binding the tRNA anticodon
          stem-loop. The enzymes in this group are homodimeric
          class2b aminoacyl-tRNA synthetases (aaRSs). aaRSs
          catalyze the specific attachment of amino acids (AAs)
          to their cognate tRNAs during protein biosynthesis.
          This 2-step reaction involves i) the activation of the
          AA by ATP in the presence of magnesium ions, followed
          by ii) the transfer of  the activated AA to the
          terminal ribose of tRNA.  In the case of the class2b
          aaRSs, the activated AA is attached to the 3'OH of the
          terminal ribose. Eukaryotes contain 2 sets of aaRSs,
          both of which are encoded by the nuclear genome. One
          set concerns with cytoplasmic protein synthesis,
          whereas the other exclusively with mitochondrial
          protein synthesis.
          Length = 102

 Score = 26.8 bits (60), Expect = 7.6
 Identities = 11/33 (33%), Positives = 15/33 (45%)

Query: 65 LRQTSVNIQAKAEQEEEFISNTLMKKIQALKKE 97
          LRQ    IQ       E +S  ++K   +L KE
Sbjct: 23 LRQQGYTIQGVLAASAEGVSKQMVKWAGSLSKE 55


>gnl|CDD|149105 pfam07851, TMPIT, TMPIT-like protein.  A number of members of
          this family are annotated as being transmembrane
          proteins induced by tumour necrosis factor alpha, but
          no literature was found to support this.
          Length = 330

 Score = 28.2 bits (63), Expect = 7.8
 Identities = 16/50 (32%), Positives = 31/50 (62%), Gaps = 4/50 (8%)

Query: 28 EQLQKRIESLQQQNRVLKVELE-TYKLQLR---ALQEENKNLRQTSVNIQ 73
          ++L+K  + LQ+ +R+ K +LE   KLQ +   A+  + K L+Q  V+++
Sbjct: 7  KELEKEFQQLQETHRLYKQKLEEVEKLQEQCTSAIARQRKRLKQLIVSLK 56


>gnl|CDD|221489 pfam12252, SidE, Dot/Icm substrate protein.  This family of proteins
            is found in bacteria. Proteins in this family are
            typically between 397 and 1543 amino acids in length.
            This family is the SidE protein in the Dot/Icm pathway of
            Legionella pneumophila bacteria. There is little
            literature describing the family.
          Length = 1443

 Score = 28.7 bits (64), Expect = 8.0
 Identities = 51/253 (20%), Positives = 105/253 (41%), Gaps = 51/253 (20%)

Query: 76   AEQEEEFISNTLMKKIQALKKEKETL-----------AHHYEQEEEFLTNDLSRKLNQLT 124
            +E+E   IS  L ++IQ++++E E+L                +  + L  + S+++ +L 
Sbjct: 1061 SEKEMGTISRELREQIQSVRQELESLQRAVVTPVVTDEEKVRERYDALIENTSKRITEL- 1119

Query: 125  EEKYRLEQ------------TIEQEQERVVNKLMR-----------KIEKLEAETDAKQN 161
             EK +L               ++QE   + N+ +R            IEKLE +      
Sbjct: 1120 -EKAKLANLDVVKKAISNLNNLQQEVTLLRNEKIRMHTGTDKVDFSDIEKLEKQLQVIDT 1178

Query: 162  NL-----EQLRREKVELENTLEQEQEALVNKLWKRMDKLEAEKRILQIKLDQPVSDPSSP 216
             L      ++ ++   LE    + Q  L +K+ K  D     + +   ++ +  S    P
Sbjct: 1179 KLADAYLLEVTKQISALEKEKPKNQSELKSKIAKFFDTTADIEVLRNERIKKHGSS-KDP 1237

Query: 217  HDMNNGDTAS-NLSTHIHSLRSEVMR-LRNQLANTQQEHTEKMERYVKDEKNVREENLRL 274
             D+++ D  S NL     SL S ++  +R+ L   + +  E  E+ +       ++N  L
Sbjct: 1238 LDLSDLDKLSGNLQGVNQSLVSILITTIRSSLNQMKPKTFEMQEKEI-------QQNFEL 1290

Query: 275  QRKLQREVERREA 287
              KL++ +++ + 
Sbjct: 1291 LAKLEKTLDKSDT 1303


>gnl|CDD|224381 COG1464, NlpA, ABC-type metal ion transport system, periplasmic
           component/surface antigen [Inorganic ion transport and
           metabolism].
          Length = 268

 Score = 28.0 bits (63), Expect = 8.1
 Identities = 19/84 (22%), Positives = 35/84 (41%), Gaps = 2/84 (2%)

Query: 187 KLWKRMDKLEAEKRILQIKLDQPVSDPSSPHDM-NNGDTASNLSTHIHSLRSEVMRLRNQ 245
           ++ + + K   +K+ L +K+    +D   P++   +GD  +N   H   L         +
Sbjct: 42  EILEVVVKPALKKKGLDLKI-VEFTDYVQPNEALADGDIDANAFQHKPYLDQFNKEHGGK 100

Query: 246 LANTQQEHTEKMERYVKDEKNVRE 269
           L      H E M  Y K  K++ E
Sbjct: 101 LVAVGNTHIEPMGLYSKKYKSLAE 124


>gnl|CDD|234252 TIGR03545, TIGR03545, TIGR03545 family protein.  This model
           represents a relatively rare but broadly distributed
           uncharacterized protein family, distributed in 1-2
           percent of bacterial genomes, all of which have outer
           membranes. In many of these genomes, it is part of a
           two-gene pair.
          Length = 555

 Score = 28.5 bits (64), Expect = 8.1
 Identities = 17/112 (15%), Positives = 44/112 (39%), Gaps = 23/112 (20%)

Query: 22  PSPVTREQLQKRIESLQQQNRVLKVELETYKLQLRALQEENKNLRQTSVNIQAKAEQEEE 81
           P+    E+ +KR+E++++++      ++      +  +E      +     +A  ++ + 
Sbjct: 188 PNKQDLEEYKKRLEAIKKKD------IKNPLELQKIKEE----FDKLKKEGKADKQKIKS 237

Query: 82  FISNTLMKKIQALKKEKETLAHHYEQEEEFLTNDLSRKLNQLTEEKYRLEQT 133
                       L+ +K+ L     + ++   NDL R      E KY ++  
Sbjct: 238 A--------KNDLQNDKKQLKADLAELKKAPQNDLKR-----LENKYAIKSG 276


>gnl|CDD|221049 pfam11262, Tho2, Transcription factor/nuclear export subunit
           protein 2.  THO and TREX form a eukaryotic complex which
           functions in messenger ribonucleoprotein metabolism and
           plays a role in preventing the transcription-associated
           genetic instability. Tho2, along with four other
           subunits forms THO.
          Length = 296

 Score = 28.1 bits (63), Expect = 8.1
 Identities = 17/73 (23%), Positives = 36/73 (49%), Gaps = 5/73 (6%)

Query: 46  VELETYKLQLRALQEENKNLRQTSVNIQAKAEQEEEFISNTLMKKIQALKKEKETLAHHY 105
           V  E Y  ++  L+++ K L  +S  I  K ++++      L   I+ L++E +    H 
Sbjct: 24  VPKELYDKEIERLEKQIKELDSSSSGIDKKKKEKKR-----LKSLIKKLEEELKKHIEHN 78

Query: 106 EQEEEFLTNDLSR 118
           E+ ++ L+ + S 
Sbjct: 79  EKTKKRLSEEKSS 91


>gnl|CDD|193553 cd05678, M20_dipept_like_1, Uncharacterized M20 Dipeptidases.
           Peptidase M20 family, unknown dipeptidase-like subfamily
           1 (inferred by homology to be dipeptidases). M20
           dipeptidases include a large variety of bacterial
           enzymes including cytosolic nonspecific dipeptidase
           (CNDP), Xaa-methyl-His dipeptidase (anserinase),and
           canosinase. These dipeptidases have been shown to act on
           a wide range of dipeptides, but not larger peptides. For
           example, anserinase mainly catalyzes the hydrolysis of
           N-alpha-acetylhistidine while carnosinase degrades
           beta-alanyl-L-histidine.
          Length = 466

 Score = 28.4 bits (64), Expect = 8.3
 Identities = 16/48 (33%), Positives = 22/48 (45%), Gaps = 6/48 (12%)

Query: 55  LRALQEENKNLRQTSVNIQAKAEQEEEFISNTLMKKIQALKKEKETLA 102
           L AL+         +VNI+   + EEE  S  L    Q +K+ KE L 
Sbjct: 135 LDALKAAGIPP---AVNIKVILDSEEEKGSPHLP---QVVKEYKELLK 176


>gnl|CDD|205692 pfam13514, AAA_27, AAA domain.  This domain is found in a number of
           double-strand DNA break proteins. This domain contains a
           P-loop motif.
          Length = 1118

 Score = 28.2 bits (63), Expect = 8.6
 Identities = 28/123 (22%), Positives = 48/123 (39%), Gaps = 20/123 (16%)

Query: 106 EQEEEFLTNDLSRKLNQLTEEKYRLEQTIEQEQERVVN--KLMRKIEKLEAETDAKQNNL 163
           E +E +       ++N   +E   LE  I + Q +      L++ +++ E E    +  L
Sbjct: 138 EADELWKPRGRKPEINVALKELKELEAEIREVQLKTRTWKDLVKALDEAEEELANLRKEL 197

Query: 164 EQLRREKVELE------------NTLEQEQEALVN------KLWKRMDKLEAEKRILQIK 205
            QL +EK  LE              LEQ+  AL           +R ++  AE R  +  
Sbjct: 198 RQLEKEKQRLERLRRLLPLLAERKALEQQLAALGEVIDLPPDAVERYEEARAELRAARRN 257

Query: 206 LDQ 208
           L+ 
Sbjct: 258 LEL 260


>gnl|CDD|237551 PRK13902, alaS, alanyl-tRNA synthetase; Provisional.
          Length = 900

 Score = 28.3 bits (64), Expect = 8.8
 Identities = 18/78 (23%), Positives = 30/78 (38%)

Query: 25  VTREQLQKRIESLQQQNRVLKVELETYKLQLRALQEENKNLRQTSVNIQAKAEQEEEFIS 84
           V  EQL K +E   ++ +  K E+E  + +L  L       +   V       +E +   
Sbjct: 754 VPPEQLPKTVERFFEEWKEQKKEIEKLRKELAELLASELLSKAEEVGGVKVVVEEVDADE 813

Query: 85  NTLMKKIQALKKEKETLA 102
           + L K    L KE +   
Sbjct: 814 DELRKIANELAKEGKVAL 831


>gnl|CDD|235461 PRK05431, PRK05431, seryl-tRNA synthetase; Provisional.
          Length = 425

 Score = 28.1 bits (64), Expect = 8.9
 Identities = 20/81 (24%), Positives = 35/81 (43%), Gaps = 20/81 (24%)

Query: 86  TLMKKIQALKKEKETLAHHYEQEEEFLTNDLSRKLNQLTEEKYRLEQTIEQEQERVVNKL 145
            L  +++ L+ E+               N LS+++ Q   +    E  I +     V +L
Sbjct: 39  ELQTELEELQAER---------------NALSKEIGQAKRKGEDAEALIAE-----VKEL 78

Query: 146 MRKIEKLEAETDAKQNNLEQL 166
             +I+ LEAE D  +  LE+L
Sbjct: 79  KEEIKALEAELDELEAELEEL 99


>gnl|CDD|236583 PRK09601, PRK09601, GTP-binding protein YchF; Reviewed.
          Length = 364

 Score = 28.1 bits (64), Expect = 9.0
 Identities = 16/62 (25%), Positives = 32/62 (51%), Gaps = 10/62 (16%)

Query: 28  EQLQKRIESLQQQ----NRVLKVELETYKLQLRALQEENKNLRQTSVNIQAKAEQEEEFI 83
           E ++KR+E L+++    ++  K ELE  + +L    EE K  R   +      ++EE+ +
Sbjct: 138 ETVEKRLERLEKKAKGGDKEAKAELELLE-KLLEHLEEGKPARTLELT-----DEEEKLL 191

Query: 84  SN 85
            +
Sbjct: 192 KS 193


>gnl|CDD|224705 COG1792, MreC, Cell shape-determining protein [Cell envelope
           biogenesis, outer membrane].
          Length = 284

 Score = 28.1 bits (63), Expect = 9.1
 Identities = 12/37 (32%), Positives = 21/37 (56%), Gaps = 3/37 (8%)

Query: 35  ESLQQQNRVLKVELETYK---LQLRALQEENKNLRQT 68
           + L  +N  LK EL   +    ++ +L+EENK L++ 
Sbjct: 69  KDLALENEELKKELAELEQLLEEVESLEEENKRLKEL 105


>gnl|CDD|222160 pfam13476, AAA_23, AAA domain. 
          Length = 204

 Score = 27.5 bits (61), Expect = 9.2
 Identities = 24/114 (21%), Positives = 50/114 (43%), Gaps = 7/114 (6%)

Query: 56  RALQEENKNLRQTSVNIQAKAEQEEEFISNTLMKKIQALKKEKETLAHHYEQEEEFLTND 115
           R L ++     + S+    + ++ +EFI   L    + L      L +  ++  E L   
Sbjct: 96  RELTKKKGKKVKKSILEIVEIDELQEFIDELLKSDKEIL----PLLLYLGQERLEELKFK 151

Query: 116 LSRKLNQLTEEKYRLEQTIEQEQERVVNKLMRKIEKLEAETDAKQNNLEQLRRE 169
              K  +L  E+   E    ++++ ++ KL+ + EK + E +  +  LE+L  E
Sbjct: 152 RKEKKERL--EELEKELEELEDEKDLLEKLLEEKEKKK-ELEELKEELEELLEE 202


>gnl|CDD|220404 pfam09789, DUF2353, Uncharacterized coiled-coil protein (DUF2353). 
           Members of this family of uncharacterized proteins have
           no known function.
          Length = 319

 Score = 27.9 bits (62), Expect = 9.2
 Identities = 40/179 (22%), Positives = 80/179 (44%), Gaps = 18/179 (10%)

Query: 27  REQLQKRIESLQQQNRVLKVELETYKLQLRALQEENKNLRQTSVNIQAKAE----QEEEF 82
           R+ L + +E  +++N+ L  +++  + +   LQ + K LRQT    + +      +E   
Sbjct: 67  RQDLSQLLEKYREENQQLSTDVQELRQRKAELQGDIKLLRQTIAQQRVEFMGGGVRESFE 126

Query: 83  ISNTLMKKIQALKKEKETLAH-----HYEQEEEFLTND-LSRKLNQLTEEKYRLEQTIEQ 136
             N L+ +++ L+++ + L         E+EE  +  D   RK ++L  E   L   +  
Sbjct: 127 ERNDLVSQLEKLREKCKQLESDLRSLLDEKEEVLMERDEYQRKAHRLNNE---LSYLLGG 183

Query: 137 EQERVVNKLMRKIEKLEAETDAKQNNLEQLRREKVELENTLEQEQEALVNKLWKRMDKL 195
           +  RVV+     ++ L  E    +  +  L  E   ++ TL + ++ L  K  K M K 
Sbjct: 184 DPRRVVD-----VDALVMENRYLKEKINTLEEESELIKMTLAKYKQMLEAKNKKTMVKA 237


>gnl|CDD|114855 pfam06160, EzrA, Septation ring formation regulator, EzrA.  During
           the bacterial cell cycle, the tubulin-like cell-division
           protein FtsZ polymerises into a ring structure that
           establishes the location of the nascent division site.
           EzrA modulates the frequency and position of FtsZ ring
           formation.
          Length = 559

 Score = 28.3 bits (64), Expect = 9.4
 Identities = 42/220 (19%), Positives = 82/220 (37%), Gaps = 63/220 (28%)

Query: 114 NDLSRKLNQLTEEKYRLEQT-IEQEQERVVNKLMRKIEKLEAETDAKQNNLEQLRREKVE 172
            +L     ++TEE Y  +   IE+E           ++ L+ + D     LE+L  ++ E
Sbjct: 229 EELKAGYREMTEEGYHFDHLDIEKE-----------LQDLKEQIDQNLALLEELDLDEAE 277

Query: 173 LEN------------TLEQEQEA------LVNKLWKRMDKLEAEKRILQIKLDQPVSDPS 214
            EN             LE+E +A       ++KL   ++    + + L ++LD+      
Sbjct: 278 EENEEIEERIDTLYDILEKEVKAKKFVEKNIDKLTDFLEHAREQNKQLLLELDR----LQ 333

Query: 215 SPHDMNNGD--TASNLSTHIHSLRSEVMRLRNQLANTQQEHTEKMERY------------ 260
             + +N  +  T   L   +  L ++  +L  ++A  +  ++E  E              
Sbjct: 334 QSYTLNEDELETVRELEKQLEELEAQYDQLVERIAEKKVAYSELQEELEEIEKQLEEIEK 393

Query: 261 ------------VKDEKNVREENLRLQRKLQ---REVERR 285
                        KDE   RE+    ++KL    R +E+ 
Sbjct: 394 EQEELSESLQGLRKDELEAREKLQEYRQKLHEIKRYLEKS 433


>gnl|CDD|233255 TIGR01061, parC_Gpos, DNA topoisomerase IV, A subunit,
           Gram-positive.  Operationally, topoisomerase IV is a
           type II topoisomerase required for the decatenation of
           chromosome segregation. Not every bacterium has both a
           topo II and a topo IV. The topo IV families of the
           Gram-positive bacteria and the Gram-negative bacteria
           appear not to represent a single clade among the type II
           topoisomerases, and are represented by separate models
           for this reason [DNA metabolism, DNA replication,
           recombination, and repair].
          Length = 738

 Score = 28.2 bits (63), Expect = 9.8
 Identities = 26/143 (18%), Positives = 62/143 (43%), Gaps = 6/143 (4%)

Query: 23  SPVTREQLQKRIESLQQQNRVLKVELETYKLQLRALQEENKNLRQTSVNIQAKAEQEEEF 82
           S    E+  KR+E ++   + + +  E  KL +R+ ++++        N +    Q E  
Sbjct: 358 SKYELEKASKRLEIVEGLIKAISIIDEIIKL-IRSSEDKSDAKENLIDNFKFTENQAEAI 416

Query: 83  ISNTLMK----KIQALKKEKETLAHHYEQEEEFLTNDLSRKLNQLTEEKYRLEQTIEQEQ 138
           +S  L +     I  LK+E+  L       E+ + ++ +R    L ++    ++   Q++
Sbjct: 417 VSLRLYRLTNTDIFELKEEQNELEKKIISLEQIIASEKARN-KLLKKQLEEYKKQFAQQR 475

Query: 139 ERVVNKLMRKIEKLEAETDAKQN 161
              +   + +I+  E+E    ++
Sbjct: 476 RSQIEDFINQIKINESELIENED 498


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.308    0.123    0.326 

Gapped
Lambda     K      H
   0.267   0.0677    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 19,350,691
Number of extensions: 1893153
Number of successful extensions: 6266
Number of sequences better than 10.0: 1
Number of HSP's gapped: 4692
Number of HSP's successfully gapped: 1197
Length of query: 393
Length of database: 10,937,602
Length adjustment: 99
Effective length of query: 294
Effective length of database: 6,546,556
Effective search space: 1924687464
Effective search space used: 1924687464
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.1 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.7 bits)
S2: 60 (27.0 bits)