RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy7684
         (332 letters)



>gnl|CDD|233757 TIGR02168, SMC_prok_B, chromosome segregation protein SMC, common
           bacterial type.  SMC (structural maintenance of
           chromosomes) proteins bind DNA and act in organizing and
           segregating chromosomes for partition. SMC proteins are
           found in bacteria, archaea, and eukaryotes. This family
           represents the SMC protein of most bacteria. The smc
           gene is often associated with scpB (TIGR00281) and scpA
           genes, where scp stands for segregation and condensation
           protein. SMC was shown (in Caulobacter crescentus) to be
           induced early in S phase but present and bound to DNA
           throughout the cell cycle [Cellular processes, Cell
           division, DNA metabolism, Chromosome-associated
           proteins].
          Length = 1179

 Score = 57.8 bits (140), Expect = 4e-09
 Identities = 44/225 (19%), Positives = 82/225 (36%), Gaps = 10/225 (4%)

Query: 8   LRNRPYPSSSTLKAADLLTEGVELTLGELEDMAVRQQAQIDSQRQLIAAKEQRLRFLKQH 67
           LR      S  + A       +E  + +LE+   +   ++      I   E+RL   ++ 
Sbjct: 717 LRKELEELSRQISALRKDLARLEAEVEQLEERIAQLSKELTELEAEIEELEERLEEAEEE 776

Query: 68  EARH-QQVAAEHERLRRLRDRVEQQELKLRKLRALRGQVDQNKMNNISLSSDLESIRALF 126
            A    ++     ++ +L++ ++        L  LR ++        +L   LES+    
Sbjct: 777 LAEAEAEIEELEAQIEQLKEELKALR---EALDELRAELTLLNEEAANLRERLESLERRI 833

Query: 127 NEKEKELCMAVAKVEELTHQLEDLKRGRTHQPSHPHLSAELENLRRELMYHNKLSEQQNV 186
              E+ L     ++EEL+  +E L            L   +E L  EL            
Sbjct: 834 AATERRLEDLEEQIEELSEDIESL------AAEIEELEELIEELESELEALLNERASLEE 887

Query: 187 RLHQQRETLTKKQAEVNSVDLRISELQERLQRKRLLNQQLASQLN 231
            L   R  L +   E+  ++ + SEL+  L+  R    QL  +L 
Sbjct: 888 ALALLRSELEELSEELRELESKRSELRRELEELREKLAQLELRLE 932



 Score = 54.3 bits (131), Expect = 5e-08
 Identities = 58/249 (23%), Positives = 108/249 (43%), Gaps = 21/249 (8%)

Query: 16  SSTLKAADLLTEGVELTLGELEDMAVRQQAQIDSQRQLIAAKEQRLRFLKQHEARHQQVA 75
           ++ L+  +   E + L + ELE+     Q ++ +    I+  EQ+ + L++   R   + 
Sbjct: 259 TAELQELEEKLEELRLEVSELEEEIEELQKELYALANEISRLEQQKQILRE---RLANLE 315

Query: 76  AEHERLRRLRDRVEQQELKLRKLRALRGQVDQNKMNNISLSSDLESIRALFNEKEKELCM 135
            + E L    + +E +  +L +  A   +  +       L  +LES+ A   E E EL  
Sbjct: 316 RQLEELEAQLEELESKLDELAEELAELEEKLE------ELKEELESLEAELEELEAELEE 369

Query: 136 AVAKVEELTHQLEDLKRGRT-HQPSHPHLSAELENLRRELMYHNKLSEQQNVRLHQQRET 194
             +++EEL  QLE L+      +     L+ E+E L   L    +  E +  RL Q+ E 
Sbjct: 370 LESRLEELEEQLETLRSKVAQLELQIASLNNEIERLEARL----ERLEDRRERLQQEIEE 425

Query: 195 LTKKQ--AEVNSVDLRISELQERL----QRKRLLNQQLASQLNSNINSSQHQLPPGKQQL 248
           L KK   AE+  +   + EL+E L    +    L + L       +  ++  L   +++L
Sbjct: 426 LLKKLEEAELKELQAELEELEEELEELQEELERLEEALEELREE-LEEAEQALDAAEREL 484

Query: 249 VAQNSQLTS 257
               ++L S
Sbjct: 485 AQLQARLDS 493



 Score = 45.0 bits (107), Expect = 4e-05
 Identities = 47/204 (23%), Positives = 83/204 (40%), Gaps = 11/204 (5%)

Query: 16  SSTLKAADLLTEGVELTLGELEDMAVRQQAQIDSQRQLIA-AKEQRLRFLKQHEARHQQV 74
           S  L   +   E +E  L E E+     +A+I+     I   KE+     +  +    ++
Sbjct: 753 SKELTELEAEIEELEERLEEAEEELAEAEAEIEELEAQIEQLKEELKALREALDELRAEL 812

Query: 75  AAEHERLRRLRDRVEQQE----LKLRKLRALRGQVDQNKMNNISLSSDLESIRALFNEKE 130
              +E    LR+R+E  E       R+L  L  Q+++   +  SL++++E +  L  E E
Sbjct: 813 TLLNEEAANLRERLESLERRIAATERRLEDLEEQIEELSEDIESLAAEIEELEELIEELE 872

Query: 131 KELCMAVAKVEELTHQLEDLKRGRTHQPSHPHLSAELENLRRELMYHNKLSEQQNVRLHQ 190
            EL    A + E     E L   R+       LS EL  L  +     +  E+   +L Q
Sbjct: 873 SEL---EALLNERASLEEALALLRSELEE---LSEELRELESKRSELRRELEELREKLAQ 926

Query: 191 QRETLTKKQAEVNSVDLRISELQE 214
               L   +  ++++  R+SE   
Sbjct: 927 LELRLEGLEVRIDNLQERLSEEYS 950



 Score = 43.9 bits (104), Expect = 1e-04
 Identities = 43/205 (20%), Positives = 85/205 (41%), Gaps = 24/205 (11%)

Query: 29  VELTLGELEDMAVRQQAQIDSQRQLIAAKEQRLRFLKQHEAR-HQQVAAEHERLRRLRDR 87
           +E  + ELE+     +  +   R+ +   E+ L  L++      +Q++A  + L RL   
Sbjct: 682 LEEKIEELEEKIAELEKALAELRKELEELEEELEQLRKELEELSRQISALRKDLARLEAE 741

Query: 88  VEQQELKLRKLRALRGQVDQNKMNNISLSSDLESIRALFNEKEKELCMAVAKVEELTHQL 147
           VEQ E ++ +L     +++        L   LE       E E E+    A++E+L  +L
Sbjct: 742 VEQLEERIAQLSKELTELEAEIE---ELEERLEEAEEELAEAEAEIEELEAQIEQLKEEL 798

Query: 148 EDLKRGRTHQPSHPHLSAELENLRRELMYHNKLSEQQNVRLHQQRETLTKKQAEVNSVDL 207
           + L+               L+ LR EL          N      RE L   +  + + + 
Sbjct: 799 KALRE-------------ALDELRAEL-------TLLNEEAANLRERLESLERRIAATER 838

Query: 208 RISELQERLQRKRLLNQQLASQLNS 232
           R+ +L+E+++      + LA+++  
Sbjct: 839 RLEDLEEQIEELSEDIESLAAEIEE 863



 Score = 41.6 bits (98), Expect = 5e-04
 Identities = 31/160 (19%), Positives = 66/160 (41%), Gaps = 10/160 (6%)

Query: 80  RLRRLRDRVEQQELKLRKLR----ALRGQVDQNKMNNISLSSDLESIRALFNEKEKELCM 135
            +  L +++E+ E K+ +L      LR ++++ +     L  +LE +    +   K+L  
Sbjct: 678 EIEELEEKIEELEEKIAELEKALAELRKELEELEEELEQLRKELEELSRQISALRKDLAR 737

Query: 136 AVAKVEELTHQLEDLKRGRTHQPSHPHLSAELENLRRELMYHNKLSEQQNVRLHQQRETL 195
             A+VE+L  ++  L +  T       L AE+E L   L    +   +    + +    +
Sbjct: 738 LEAEVEQLEERIAQLSKELT------ELEAEIEELEERLEEAEEELAEAEAEIEELEAQI 791

Query: 196 TKKQAEVNSVDLRISELQERLQRKRLLNQQLASQLNSNIN 235
            + + E+ ++   + EL+  L         L  +L S   
Sbjct: 792 EQLKEELKALREALDELRAELTLLNEEAANLRERLESLER 831



 Score = 30.0 bits (68), Expect = 2.0
 Identities = 33/150 (22%), Positives = 65/150 (43%), Gaps = 3/150 (2%)

Query: 40  AVRQQAQIDSQRQLIAAKEQRLRFLKQH-EARHQQVAAEHERLRRLRDRVEQQELKLRKL 98
              + AQ++ Q   +  + +RL    +  E R +++  E E L +  +  E +EL+  +L
Sbjct: 384 LRSKVAQLELQIASLNNEIERLEARLERLEDRRERLQQEIEELLKKLEEAELKELQ-AEL 442

Query: 99  RALRGQVDQNKMNNISLSSDLESIRALFNEKEKELCMAVAKVEELTHQLEDLKRGRTHQP 158
             L  ++++ +     L   LE +R    E E+ L  A  ++ +L  +L+ L+R + +  
Sbjct: 443 EELEEELEELQEELERLEEALEELREELEEAEQALDAAERELAQLQARLDSLERLQENLE 502

Query: 159 SHPHLSAELENLRREL-MYHNKLSEQQNVR 187
                   L   +  L      LSE  +V 
Sbjct: 503 GFSEGVKALLKNQSGLSGILGVLSELISVD 532


>gnl|CDD|235175 PRK03918, PRK03918, chromosome segregation protein; Provisional.
          Length = 880

 Score = 45.1 bits (107), Expect = 3e-05
 Identities = 36/176 (20%), Positives = 82/176 (46%), Gaps = 8/176 (4%)

Query: 45  AQIDSQRQLIAAKEQRLRFLKQHEARHQQVAAE----HERLRRLRDRVEQQELKLRKLRA 100
            +I+   + I   E     +K+ E   ++V  E       L  LR+ +E+ E ++++L  
Sbjct: 176 RRIERLEKFIKRTENIEELIKEKEKELEEVLREINEISSELPELREELEKLEKEVKELEE 235

Query: 101 LRGQVDQNKMNNISLSSDLESIRALFNEKEKELCMAVAKVEELTHQLEDLKRGRTHQPSH 160
           L+ ++++ +    SL      +     E E+ +     ++EEL  ++++LK  +     +
Sbjct: 236 LKEEIEELEKELESLEGSKRKLEEKIRELEERIEELKKEIEELEEKVKELKELKEKAEEY 295

Query: 161 PHLSAELENLRRELMYHNKLSEQQNVRLHQQRETLTKKQAEVNSVDLRISELQERL 216
             LS   E    EL    ++ E++  RL ++   + ++  E+   + R+ EL+++L
Sbjct: 296 IKLSEFYEEYLDEL---REI-EKRLSRLEEEINGIEERIKELEEKEERLEELKKKL 347



 Score = 36.6 bits (85), Expect = 0.019
 Identities = 45/220 (20%), Positives = 88/220 (40%), Gaps = 35/220 (15%)

Query: 28  GVELTLGELEDMAVRQQAQIDSQRQLIAAKEQRLRFLKQHEARHQQVAAEHERLRRLRDR 87
           G ELT    +++     A++    + +   E++ R L++     ++V  +   L +L++ 
Sbjct: 442 GRELTEEHRKELLEEYTAELKRIEKELKEIEEKERKLRKELRELEKVLKKESELIKLKEL 501

Query: 88  VEQ---------------QELKLRKLRALRGQVDQNKMNNISLSSDLESIRALFNEK--- 129
            EQ                E K  +   L+ ++ + K    SL  +LE +  L  +    
Sbjct: 502 AEQLKELEEKLKKYNLEELEKKAEEYEKLKEKLIKLKGEIKSLKKELEKLEELKKKLAEL 561

Query: 130 EKELCMAVAKVEELTHQLEDLKRGRTHQPSHPHLSAELENLRRELMYHNKLSEQQNV--- 186
           EK+L     ++ EL  +LE+L        S   L   L+ L     ++N+  E ++    
Sbjct: 562 EKKLDELEEELAELLKELEELGFE-----SVEELEERLKELEP---FYNEYLELKDAEKE 613

Query: 187 ------RLHQQRETLTKKQAEVNSVDLRISELQERLQRKR 220
                  L +  E L K   E+   + R+ EL++ L+   
Sbjct: 614 LEREEKELKKLEEELDKAFEELAETEKRLEELRKELEELE 653



 Score = 33.1 bits (76), Expect = 0.23
 Identities = 37/187 (19%), Positives = 81/187 (43%), Gaps = 21/187 (11%)

Query: 51  RQLIAAKEQRLR-FLKQHEARHQQVAAEHERLRRLRDRVEQQELKLRKLRALRGQVDQNK 109
            + I  + +RL  F+K+ E   + +  + + L  +   + +   +L +LR    ++++  
Sbjct: 171 IKEIKRRIERLEKFIKRTENIEELIKEKEKELEEVLREINEISSELPELREELEKLEKEV 230

Query: 110 MNNISLSSDLESIRALFNEKEKELCMAVAKVEELTHQLEDLKRGRTHQPSHPHLSAELEN 169
                   +LE ++    E EKEL        +L  ++ +L+              ELE 
Sbjct: 231 -------KELEELKEEIEELEKELESLEGSKRKLEEKIRELEERIEELKKE---IEELEE 280

Query: 170 LRREL----------MYHNKLSEQQNVRLHQQRETLTKKQAEVNSVDLRISELQERLQRK 219
             +EL          +  ++  E+    L +  + L++ + E+N ++ RI EL+E+ +R 
Sbjct: 281 KVKELKELKEKAEEYIKLSEFYEEYLDELREIEKRLSRLEEEINGIEERIKELEEKEERL 340

Query: 220 RLLNQQL 226
             L ++L
Sbjct: 341 EELKKKL 347



 Score = 32.7 bits (75), Expect = 0.29
 Identities = 36/197 (18%), Positives = 80/197 (40%), Gaps = 15/197 (7%)

Query: 27  EGVELTLGELEDMAVRQQAQIDSQRQLIAAKEQRLRFLKQHEARHQQVAAEHERLRRLRD 86
           E +E  L  LE    + + +I    + I   ++ +  L++     +++  + E   +L +
Sbjct: 241 EELEKELESLEGSKRKLEEKIRELEERIEELKKEIEELEEKVKELKELKEKAEEYIKLSE 300

Query: 87  RVEQQELKLRKLRALRGQVDQNKMNNISLSSDLESIRALFNE---KEKELCMAVAKVEEL 143
             E+   +LR++     ++++          +LE       E   K KEL   + ++EE 
Sbjct: 301 FYEEYLDELREIEKRLSRLEEEINGIEERIKELEEKEERLEELKKKLKELEKRLEELEER 360

Query: 144 THQLEDLKRGRTHQPSHPHLSAELENLRRELMYHNKLSEQQNVRLHQQRETLTKKQAEVN 203
               E+ K              ELE L++ L       E+    L +  +   + + E++
Sbjct: 361 HELYEEAKA----------KKEELERLKKRLTGLTP--EKLEKELEELEKAKEEIEEEIS 408

Query: 204 SVDLRISELQERLQRKR 220
            +  RI EL++ ++  +
Sbjct: 409 KITARIGELKKEIKELK 425



 Score = 28.5 bits (64), Expect = 6.4
 Identities = 23/95 (24%), Positives = 45/95 (47%), Gaps = 1/95 (1%)

Query: 119 LESIRALFNEKEKELCMAVAKVEELTHQLEDLKRGRTHQPSHPHLSAELENLRRELMYHN 178
           L+ +    ++  +EL     ++EEL  +LE+L++ +  +  +  L  E   L REL    
Sbjct: 621 LKKLEEELDKAFEELAETEKRLEELRKELEELEK-KYSEEEYEELREEYLELSRELAGLR 679

Query: 179 KLSEQQNVRLHQQRETLTKKQAEVNSVDLRISELQ 213
              E+   R  + ++TL K + E+   +    EL+
Sbjct: 680 AELEELEKRREEIKKTLEKLKEELEEREKAKKELE 714


>gnl|CDD|224117 COG1196, Smc, Chromosome segregation ATPases [Cell division and
           chromosome partitioning].
          Length = 1163

 Score = 44.7 bits (106), Expect = 5e-05
 Identities = 46/225 (20%), Positives = 94/225 (41%), Gaps = 26/225 (11%)

Query: 33  LGELEDMAVRQQAQIDSQRQLIAAKEQRLRFLK-QHEARHQQVAAEHERLRRLRDRVEQQ 91
           L EL +     Q ++   ++ I   E  +  L+ + E    ++    ERL  L++++E  
Sbjct: 276 LEELREELEELQEELLELKEEIEELEGEISLLRERLEELENELEELEERLEELKEKIEAL 335

Query: 92  ELKLRKLRALRGQVDQNKMNNISLSSDLESIR-ALFNEKEKELCMAVAKVEELTHQLEDL 150
           + +L +   L  +++Q          +LE    AL  E E+       ++ EL  +L ++
Sbjct: 336 KEELEERETLLEELEQLLAELEEAKEELEEKLSALLEELEELFEALREELAELEAELAEI 395

Query: 151 KRGRTHQPSHPHLSAELENLRRELMYHNKLSEQQNVRLHQQRETLTKKQAEVNSVDLRIS 210
           +              ELE L+RE+     L E+   RL ++ E L ++  E+ +    + 
Sbjct: 396 RN-------------ELEELKREI---ESLEERLE-RLSERLEDLKEELKELEA---ELE 435

Query: 211 ELQERLQRKRLLNQQLASQLNSNINSSQHQLPPGKQQLVAQNSQL 255
           ELQ  L+       +   +L   +   + +L   +++L     +L
Sbjct: 436 ELQTELEELN----EELEELEEQLEELRDRLKELERELAELQEEL 476



 Score = 43.2 bits (102), Expect = 2e-04
 Identities = 40/212 (18%), Positives = 90/212 (42%), Gaps = 12/212 (5%)

Query: 50  QRQLIAAKEQRLRFLKQHEARHQQVAAEHERLRRLRDRVEQQELKLRKLRALRGQVDQNK 109
           + +L   + Q  +  ++ ++   ++ +  + L  LR ++E+ E   R+L  L+ ++   +
Sbjct: 673 EEELAELEAQLEKLEEELKSLKNELRSLEDLLEELRRQLEELE---RQLEELKRELAALE 729

Query: 110 MNNISLSSDLESIRALFNEKEKELCMAVAKVEELTHQLEDLKRGRTHQPSHPHLSAELEN 169
                L S LE +     E E+EL     ++EEL  +LE L+           L  E+E 
Sbjct: 730 EELEQLQSRLEELEEELEELEEELEELQERLEELEEELESLEEALA------KLKEEIEE 783

Query: 170 LRRELMYHNKLSEQQNVRLHQQRETLTKKQAEVNSVDLRISELQERLQRKRLLNQQLASQ 229
           L  +     +  E+    L +    L   + E+ S++ R   L++ ++      ++L  +
Sbjct: 784 LEEKRQALQEELEELEEELEEAERRLDALERELESLEQRRERLEQEIEELEEEIEELEEK 843

Query: 230 LNS---NINSSQHQLPPGKQQLVAQNSQLTSA 258
           L+     +   + +L   K++L    ++    
Sbjct: 844 LDELEEELEELEKELEELKEELEELEAEKEEL 875



 Score = 42.8 bits (101), Expect = 2e-04
 Identities = 48/227 (21%), Positives = 101/227 (44%), Gaps = 5/227 (2%)

Query: 16  SSTLKAADLLTEGVELTLGELEDMAVRQQAQIDSQRQLIAAKEQRLRFLKQHEARHQQVA 75
              LK+       +E  L EL       + Q++  ++ +AA E+ L   +Q ++R +++ 
Sbjct: 687 EEELKSLKNELRSLEDLLEELRRQLEELERQLEELKRELAALEEEL---EQLQSRLEELE 743

Query: 76  AEHERLRRLRDRVEQQELKLR-KLRALRGQVDQNKMNNISLSSDLESIRALFNEKEKELC 134
            E E L    + ++++  +L  +L +L   + + K     L    ++++    E E+EL 
Sbjct: 744 EELEELEEELEELQERLEELEEELESLEEALAKLKEEIEELEEKRQALQEELEELEEELE 803

Query: 135 MAVAKVEELTHQLEDLK-RGRTHQPSHPHLSAELENLRRELMYHNKLSEQQNVRLHQQRE 193
            A  +++ L  +LE L+ R    +     L  E+E L  +L    +  E+    L + +E
Sbjct: 804 EAERRLDALERELESLEQRRERLEQEIEELEEEIEELEEKLDELEEELEELEKELEELKE 863

Query: 194 TLTKKQAEVNSVDLRISELQERLQRKRLLNQQLASQLNSNINSSQHQ 240
            L + +AE   ++  + EL+E  +      ++L S+L       +  
Sbjct: 864 ELEELEAEKEELEDELKELEEEKEELEEELRELESELAELKEEIEKL 910



 Score = 42.4 bits (100), Expect = 3e-04
 Identities = 50/237 (21%), Positives = 104/237 (43%), Gaps = 15/237 (6%)

Query: 27  EGVELTLGELEDMAVRQQAQIDSQRQLIAAKEQRLRFLK-QHEARHQQVAAEHERLRRLR 85
             +E  L +LE+     + ++ S   L+    ++L  L+ Q E   +++AA  E L +L+
Sbjct: 677 AELEAQLEKLEEELKSLKNELRSLEDLLEELRRQLEELERQLEELKRELAALEEELEQLQ 736

Query: 86  DRVEQQELKLRKLRA----LRGQVDQNKMNNISLSSDLESIRALFNEKEKELCMAVAKVE 141
            R+E+ E +L +L      L+ ++++ +    SL   L  ++    E E++      ++E
Sbjct: 737 SRLEELEEELEELEEELEELQERLEELEEELESLEEALAKLKEEIEELEEKRQALQEELE 796

Query: 142 ELTHQLEDLKRGRTHQPSHPHLSAELENLRRELMYHNKLSEQQNVRLHQQRETLTKKQAE 201
           EL  +LE+ +R          L  ELE+L +         EQ+   L ++ E L +K  E
Sbjct: 797 ELEEELEEAERRLDA------LERELESLEQRRERL----EQEIEELEEEIEELEEKLDE 846

Query: 202 VNSVDLRISELQERLQRKRLLNQQLASQLNSNINSSQHQLPPGKQQLVAQNSQLTSA 258
           +      + +  E L+ +    +    +L   +   + +    +++L    S+L   
Sbjct: 847 LEEELEELEKELEELKEELEELEAEKEELEDELKELEEEKEELEEELRELESELAEL 903



 Score = 41.2 bits (97), Expect = 6e-04
 Identities = 43/196 (21%), Positives = 81/196 (41%), Gaps = 10/196 (5%)

Query: 30  ELTLGELEDMAVRQQAQIDSQRQLIAAKEQRLRFLKQHEARHQQVAAEHERLRRLRDRVE 89
           +  + ELE+     Q +++   + +   E+RL  L++     +Q     E+   + +  E
Sbjct: 778 KEEIEELEEKRQALQEELEELEEELEEAERRLDALERELESLEQRRERLEQ--EIEELEE 835

Query: 90  QQELKLRKLRALRGQVDQNKMNNISLSSDLESIRALFNEKEKELCMAVAKVEELTHQLED 149
           + E    KL  L  ++++ +     L  +LE + A   E E EL     + EEL  +L +
Sbjct: 836 EIEELEEKLDELEEELEELEKELEELKEELEELEAEKEELEDELKELEEEKEELEEELRE 895

Query: 150 LKRGRTHQPSHPHLSAELENLRRELMYHNKLSEQQNVRLHQQRETLTK--KQAEVNSVDL 207
           L+           L  E+E LR  L       E+  V L +  E L +  +      ++ 
Sbjct: 896 LESELA------ELKEEIEKLRERLEELEAKLERLEVELPELEEELEEEYEDTLETELER 949

Query: 208 RISELQERLQRKRLLN 223
            I  L+E ++    +N
Sbjct: 950 EIERLEEEIEALGPVN 965



 Score = 38.9 bits (91), Expect = 0.003
 Identities = 40/212 (18%), Positives = 82/212 (38%), Gaps = 32/212 (15%)

Query: 27  EGVELTLGELEDMAVRQQAQIDSQRQLIAAKEQRLRFLKQHEARHQQ--------VAAEH 78
           E +E  L EL++     + +++ +  L+   EQ L  L++ +   ++        +    
Sbjct: 319 EELEERLEELKEKIEALKEELEERETLLEELEQLLAELEEAKEELEEKLSALLEELEELF 378

Query: 79  ERLR-RLRDRVEQQELKLRKLRALRGQVDQNKM--------------NNISLSSDLESIR 123
           E LR  L +   +      +L  L+ +++  +                   L ++LE ++
Sbjct: 379 EALREELAELEAELAEIRNELEELKREIESLEERLERLSERLEDLKEELKELEAELEELQ 438

Query: 124 ALFNEKEKELCMAVAKVEELTHQLEDLKRGRTHQPSHPHLSAELENLRRELMYHNKL--- 180
               E  +EL     ++EEL  +L++L+R          L  EL+ L +EL         
Sbjct: 439 TELEELNEELEELEEQLEELRDRLKELER------ELAELQEELQRLEKELSSLEARLDR 492

Query: 181 SEQQNVRLHQQRETLTKKQAEVNSVDLRISEL 212
            E +       R  L   ++ +  V   ++EL
Sbjct: 493 LEAEQRASQGVRAVLEALESGLPGVYGPVAEL 524



 Score = 37.4 bits (87), Expect = 0.009
 Identities = 41/229 (17%), Positives = 85/229 (37%), Gaps = 19/229 (8%)

Query: 50  QRQLIAAKEQRLRFLKQHEARHQQVAAEHERLRRLRDRVEQQELKLRKLRALRGQVDQNK 109
             +L   +++     ++     +++    E L      +E+ + +L +LR    ++ +  
Sbjct: 231 LAKLKELRKELEELEEELSRLEEELEELQEELEEAEKEIEELKSELEELREELEELQEEL 290

Query: 110 MNNISLSSDLESIRALFNEKEKELC----MAVAKVEELTHQLEDLKRGRTHQPS------ 159
           +       +LE   +L  E+ +EL         ++EEL  ++E LK     + +      
Sbjct: 291 LELKEEIEELEGEISLLRERLEELENELEELEERLEELKEKIEALKEELEERETLLEELE 350

Query: 160 --HPHLSAELENLRRELMYHNK----LSEQQNVRLHQQRETLTKKQAEVNSVDLRISELQ 213
                L    E L  +L    +    L E     L +    L + + E+  +   I  L+
Sbjct: 351 QLLAELEEAKEELEEKLSALLEELEELFEALREELAELEAELAEIRNELEELKREIESLE 410

Query: 214 ERLQRKRLLNQQLASQ---LNSNINSSQHQLPPGKQQLVAQNSQLTSAH 259
           ERL+R     + L  +   L + +   Q +L    ++L     QL    
Sbjct: 411 ERLERLSERLEDLKEELKELEAELEELQTELEELNEELEELEEQLEELR 459



 Score = 35.5 bits (82), Expect = 0.037
 Identities = 35/144 (24%), Positives = 64/144 (44%), Gaps = 10/144 (6%)

Query: 115 LSSDLESIRALFNEKEKELCMAVAKVEELTHQLEDLKRGRTHQPSHPHLSAELENLRREL 174
           L  +L  + A   + E+EL     ++  L   LE+L+R          L  +LE L+REL
Sbjct: 672 LEEELAELEAQLEKLEEELKSLKNELRSLEDLLEELRR------QLEELERQLEELKREL 725

Query: 175 MYHNKLSEQQNVRLHQQRETLTKKQAEVNSVDLRISELQERLQRKRLLNQQLASQLNSNI 234
               +  EQ   RL +  E L + + E+  +  R+ EL+E L+      ++  ++L   I
Sbjct: 726 AALEEELEQLQSRLEELEEELEELEEELEELQERLEELEEELESL----EEALAKLKEEI 781

Query: 235 NSSQHQLPPGKQQLVAQNSQLTSA 258
              + +    +++L     +L  A
Sbjct: 782 EELEEKRQALQEELEELEEELEEA 805


>gnl|CDD|234084 TIGR03007, pepcterm_ChnLen, polysaccharide chain length determinant
           protein, PEP-CTERM locus subfamily.  Members of this
           protein family belong to the family of polysaccharide
           chain length determinant proteins (pfam02706). All are
           found in species that encode the PEP-CTERM/exosortase
           system predicted to act in protein sorting in a number
           of Gram-negative bacteria, and are found near the epsH
           homolog that is the putative exosortase gene [Cell
           envelope, Biosynthesis and degradation of surface
           polysaccharides and lipopolysaccharides].
          Length = 498

 Score = 44.3 bits (105), Expect = 5e-05
 Identities = 52/219 (23%), Positives = 90/219 (41%), Gaps = 27/219 (12%)

Query: 24  LLTEGVELTLGEL-EDMAVRQQ---AQIDSQRQLIAAKEQRLRFLKQH--EARHQQVAAE 77
           LLT  VE TLG   +D    Q+    QI +  + + A E RL+  KQ        Q    
Sbjct: 143 LLTIFVEETLGSKRQDSDSAQRFIDEQIKTYEKKLEAAENRLKAFKQENGGILPDQEGDY 202

Query: 78  HERLRRLRDRVEQQELKLRKLRALRGQVDQNKMNNISLSSDLESIRALFNEKEKELCMAV 137
           +  +   ++ +E   L+L +  A R  + +       L  +   + A  +    EL    
Sbjct: 203 YSEISEAQEELEAARLELNEAIAQRDALKRQ------LGGEEPVLLAGSSVANSELDG-- 254

Query: 138 AKVEELTHQLEDLKRGRTHQPSHPHLSA---ELENLRRELMYHNKL-------SEQQNVR 187
            ++E L  QL+ L+   T +  HP + A   E+  L  +               E  N  
Sbjct: 255 -RIEALEKQLDALRLRYTDK--HPDVIATKREIAQLEEQKEEEGSAKNGGPERGEIANPV 311

Query: 188 LHQQRETLTKKQAEVNSVDLRISELQERLQRKRLLNQQL 226
             Q +  L + +AE+ S++ R++EL  R++R   L + +
Sbjct: 312 YQQLQIELAEAEAEIASLEARVAELTARIERLESLLRTI 350


>gnl|CDD|233758 TIGR02169, SMC_prok_A, chromosome segregation protein SMC,
           primarily archaeal type.  SMC (structural maintenance of
           chromosomes) proteins bind DNA and act in organizing and
           segregating chromosomes for partition. SMC proteins are
           found in bacteria, archaea, and eukaryotes. It is found
           in a single copy and is homodimeric in prokaryotes, but
           six paralogs (excluded from this family) are found in
           eukarotes, where SMC proteins are heterodimeric. This
           family represents the SMC protein of archaea and a few
           bacteria (Aquifex, Synechocystis, etc); the SMC of other
           bacteria is described by TIGR02168. The N- and
           C-terminal domains of this protein are well conserved,
           but the central hinge region is skewed in composition
           and highly divergent [Cellular processes, Cell division,
           DNA metabolism, Chromosome-associated proteins].
          Length = 1164

 Score = 43.1 bits (102), Expect = 2e-04
 Identities = 49/207 (23%), Positives = 85/207 (41%), Gaps = 22/207 (10%)

Query: 30  ELTLGELEDMAVRQQAQIDSQRQLIAAKEQRLRFLKQHEARHQQVAAEHERLRRLRDRVE 89
           E  L ELE   V +   I+    +I  K Q+L  L++          + ER + L     
Sbjct: 173 EKALEELE--EVEEN--IERLDLIIDEKRQQLERLRRERE-------KAERYQALLKEKR 221

Query: 90  QQE--LKLRKLRALRGQVDQNKMNNISLSSDLESIRALFNEKEKELCMAVAKVEELTHQL 147
           + E    L++  AL  Q +  +    SL  +LE +    +E EK L      +EEL  ++
Sbjct: 222 EYEGYELLKEKEALERQKEAIERQLASLEEELEKLTEEISELEKRLEEIEQLLEELNKKI 281

Query: 148 EDLKRGR--THQPSHPHLSAELENLRRELMYHNKLSEQQNVRLHQQRETLTKKQAEVNSV 205
           +DL        +     L AE+ +L R +        ++   L    E L K +AE++ +
Sbjct: 282 KDLGEEEQLRVKEKIGELEAEIASLERSI-------AEKERELEDAEERLAKLEAEIDKL 334

Query: 206 DLRISELQERLQRKRLLNQQLASQLNS 232
              I EL+  ++ +R    +L  +   
Sbjct: 335 LAEIEELEREIEEERKRRDKLTEEYAE 361



 Score = 39.7 bits (93), Expect = 0.002
 Identities = 30/195 (15%), Positives = 76/195 (38%), Gaps = 25/195 (12%)

Query: 40  AVRQQAQIDSQRQLIAAKEQRLRFLKQHEARHQQVAAEHERLRRLRDRVEQQELKLRKLR 99
           A +++   D++ +L   + +  + L + E   +++  E +R  +L +   + + +L  LR
Sbjct: 311 AEKERELEDAEERLAKLEAEIDKLLAEIEELEREIEEERKRRDKLTEEYAELKEELEDLR 370

Query: 100 A-----------LRGQVDQNKMNNISLSSDLESIRALFNEKEKELCMAVAKVEELTHQLE 148
           A            R ++   +     L  ++  ++   +  ++EL     ++ +L   + 
Sbjct: 371 AELEEVDKEFAETRDELKDYREKLEKLKREINELKRELDRLQEELQRLSEELADLNAAIA 430

Query: 149 DLKRGRTHQPSHPHLSAELENLRRELMYHNKLSEQQNVRLHQQRETLTKK----QAEVNS 204
            ++           L  E E+   E+    K  E +  +L        ++    + E + 
Sbjct: 431 GIEAK------INELEEEKEDKALEI----KKQEWKLEQLAADLSKYEQELYDLKEEYDR 480

Query: 205 VDLRISELQERLQRK 219
           V+  +S+LQ  L   
Sbjct: 481 VEKELSKLQRELAEA 495



 Score = 39.7 bits (93), Expect = 0.002
 Identities = 45/235 (19%), Positives = 102/235 (43%), Gaps = 27/235 (11%)

Query: 33  LGELEDMAVRQQAQIDSQRQLIAAKEQRLRFLKQHEARHQQVAAEHERLRRLRDRVEQQE 92
           +GE+E    + + + +  ++ +   E+ L  L+      Q++      L+ L  R+E+ E
Sbjct: 718 IGEIEKEIEQLEQEEEKLKERLEELEEDLSSLE------QEIENVKSELKELEARIEELE 771

Query: 93  LKLRKLRALRGQVDQNKMNNISLS---SDLESIRALFNEKEKELCMAVAKVEELTHQLED 149
             L KL           +N++      S +  I+A  ++ E+E+    A++ E+  +L  
Sbjct: 772 EDLHKLEE--------ALNDLEARLSHSRIPEIQAELSKLEEEVSRIEARLREIEQKLNR 823

Query: 150 LKRGRTHQPSHPHLSAELENLRRELMYHNKLSEQQ----NVRLHQQRETLTKKQAEVNSV 205
           L   + +         EL+  R +L    K  E++    N +  +  E L + +A +  +
Sbjct: 824 LTLEKEYLEK---EIQELQEQRIDLKEQIKSIEKEIENLNGKKEELEEELEELEAALRDL 880

Query: 206 DLRISELQ---ERLQRKRLLNQQLASQLNSNINSSQHQLPPGKQQLVAQNSQLTS 257
           + R+ +L+   + L+ +    ++   +L + I   + +L   K +L A   +L+ 
Sbjct: 881 ESRLGDLKKERDELEAQLRELERKIEELEAQIEKKRKRLSELKAKLEALEEELSE 935



 Score = 38.9 bits (91), Expect = 0.004
 Identities = 41/198 (20%), Positives = 80/198 (40%), Gaps = 16/198 (8%)

Query: 27  EGVELTLGELEDMAVRQQAQIDSQRQLIAAKEQRLR--FLKQHEARHQQVAAEHERLRRL 84
           E V+  L ELE      +  +    + +   E RL    + + +A   ++  E  R+   
Sbjct: 754 ENVKSELKELEARIEELEEDLHKLEEALNDLEARLSHSRIPEIQAELSKLEEEVSRIEAR 813

Query: 85  RDRVEQQELKLRKLRALRGQVDQNKMNNI--------SLSSDLESIRALFNEKEKELCMA 136
              +EQ+  +L   +    +  Q              S+  ++E++     E E+EL   
Sbjct: 814 LREIEQKLNRLTLEKEYLEKEIQELQEQRIDLKEQIKSIEKEIENLNGKKEELEEELEEL 873

Query: 137 VAKVEELTHQLEDLKRGRTHQPSHPHLSAELENLRRELMYHNKLSEQQNVRLHQQRETLT 196
            A + +L  +L DLK+ R        L A+L  L R++       E++  RL + +  L 
Sbjct: 874 EAALRDLESRLGDLKKERD------ELEAQLRELERKIEELEAQIEKKRKRLSELKAKLE 927

Query: 197 KKQAEVNSVDLRISELQE 214
             + E++ ++    E +E
Sbjct: 928 ALEEELSEIEDPKGEDEE 945



 Score = 37.0 bits (86), Expect = 0.014
 Identities = 41/170 (24%), Positives = 79/170 (46%), Gaps = 16/170 (9%)

Query: 75  AAEHERLRRLRDRVEQQELKL----RKLRALRGQVDQNKMNNISLSSDLESIRALFNEKE 130
            +E   L+RLR+R+E  + +L     +LR +  ++D+        S  +  I     + E
Sbjct: 670 RSEPAELQRLRERLEGLKRELSSLQSELRRIENRLDELSQELSDASRKIGEIEKEIEQLE 729

Query: 131 KELCMAVAKVEELTHQLEDLKRGRTHQPSH-PHLSAELENLRRELMYHNKLSEQQN---- 185
           +E      ++EEL   L  L++   +  S    L A +E L  +L   +KL E  N    
Sbjct: 730 QEEEKLKERLEELEEDLSSLEQEIENVKSELKELEARIEELEEDL---HKLEEALNDLEA 786

Query: 186 ----VRLHQQRETLTKKQAEVNSVDLRISELQERLQRKRLLNQQLASQLN 231
                R+ + +  L+K + EV+ ++ R+ E++++L R  L  + L  ++ 
Sbjct: 787 RLSHSRIPEIQAELSKLEEEVSRIEARLREIEQKLNRLTLEKEYLEKEIQ 836



 Score = 37.0 bits (86), Expect = 0.014
 Identities = 42/246 (17%), Positives = 91/246 (36%), Gaps = 47/246 (19%)

Query: 19  LKAADLLTEGVELTLGELEDMAVRQQAQIDSQRQLIAAKEQRLRFLK------------Q 66
            +A +   E +E  L  LE+   +   +I    + +   EQ L  L             +
Sbjct: 232 KEALERQKEAIERQLASLEEELEKLTEEISELEKRLEEIEQLLEELNKKIKDLGEEEQLR 291

Query: 67  HEARHQQVAAEHERLRR----LRDRVEQQELKLRKL-----------RALRGQVDQNKMN 111
            + +  ++ AE   L R        +E  E +L KL             L  ++++ +  
Sbjct: 292 VKEKIGELEAEIASLERSIAEKERELEDAEERLAKLEAEIDKLLAEIEELEREIEEERKR 351

Query: 112 NISLSSDLESIRALFN-------EKEKELCMAVAKVEELTHQLEDLKRGRTHQPSHPHLS 164
              L+ +   ++           E +KE      ++++   +LE LKR            
Sbjct: 352 RDKLTEEYAELKEELEDLRAELEEVDKEFAETRDELKDYREKLEKLKR------------ 399

Query: 165 AELENLRRELMYHNKLSEQQNVRLHQQRETLTKKQAEVNSVDLRISELQERLQRKRLLNQ 224
            E+  L+REL    +  ++ +  L      +   +A++N ++    +    ++++    +
Sbjct: 400 -EINELKRELDRLQEELQRLSEELADLNAAIAGIEAKINELEEEKEDKALEIKKQEWKLE 458

Query: 225 QLASQL 230
           QLA+ L
Sbjct: 459 QLAADL 464



 Score = 35.4 bits (82), Expect = 0.039
 Identities = 34/207 (16%), Positives = 92/207 (44%), Gaps = 20/207 (9%)

Query: 35  ELEDMAVRQQAQIDSQRQLIAAKEQRLRFLKQHEARHQQVAAEHERLRRLRDRVEQQELK 94
                  R + +++  ++ +++ +  LR ++      +      +  R++ +  ++ E  
Sbjct: 671 SEPAELQRLRERLEGLKRELSSLQSELRRIENR--LDELSQELSDASRKIGEIEKEIEQL 728

Query: 95  LRKLRALRGQVDQNKMNNISLSSDLESIRALFNEKEKELCMAVAKVEELTHQLEDLKRGR 154
            ++   L+ ++++ + +  SL  ++E++++   E E  +      + +L   L DL+   
Sbjct: 729 EQEEEKLKERLEELEEDLSSLEQEIENVKSELKELEARIEELEEDLHKLEEALNDLEARL 788

Query: 155 THQPSHPHLSAELENLRRELMYHNKLS------EQQNVRLHQQRETLTKK----QAEVNS 204
           +H    P + AEL  L  E    +++       EQ+  RL  ++E L K+    Q +   
Sbjct: 789 SHSRI-PEIQAELSKLEEE---VSRIEARLREIEQKLNRLTLEKEYLEKEIQELQEQRID 844

Query: 205 VDLRISELQERLQ----RKRLLNQQLA 227
           +  +I  +++ ++    +K  L ++L 
Sbjct: 845 LKEQIKSIEKEIENLNGKKEELEEELE 871



 Score = 32.0 bits (73), Expect = 0.53
 Identities = 20/113 (17%), Positives = 44/113 (38%), Gaps = 9/113 (7%)

Query: 128 EKEKELCMAVAKVEELTHQLEDLKRGRTHQPSHPH--------LSAELENLRRELMYHNK 179
           E  +   +   ++E L  +L  L+       +            S ++  + +E+    +
Sbjct: 672 EPAELQRLR-ERLEGLKRELSSLQSELRRIENRLDELSQELSDASRKIGEIEKEIEQLEQ 730

Query: 180 LSEQQNVRLHQQRETLTKKQAEVNSVDLRISELQERLQRKRLLNQQLASQLNS 232
             E+   RL +  E L+  + E+ +V   + EL+ R++       +L   LN 
Sbjct: 731 EEEKLKERLEELEEDLSSLEQEIENVKSELKELEARIEELEEDLHKLEEALND 783



 Score = 28.9 bits (65), Expect = 4.3
 Identities = 56/246 (22%), Positives = 107/246 (43%), Gaps = 29/246 (11%)

Query: 19   LKAADLLTEGVELTLGELEDMAVRQQAQIDSQRQ----LIAAKEQRLRFLKQHEARHQQV 74
            L    L  E +E  + EL++  +  + QI S  +    L   KE+    L++ EA  + +
Sbjct: 821  LNRLTLEKEYLEKEIQELQEQRIDLKEQIKSIEKEIENLNGKKEELEEELEELEAALRDL 880

Query: 75   AAEHERLRRLRDRVEQQELKL-RKLRALRGQVDQNKMNNISLSSDLESIRALFNEKEKEL 133
             +    L++ RD +E Q  +L RK+  L  Q+++ +       S+L++      E+  E+
Sbjct: 881  ESRLGDLKKERDELEAQLRELERKIEELEAQIEKKR----KRLSELKAKLEALEEELSEI 936

Query: 134  CMAVAKVEELTHQLEDLKRGRTHQPSHPHLSAELENLRRELMYH---NKLSEQQ----NV 186
                 + EE+  +            S   + AEL+ +  E+      N L+ Q+      
Sbjct: 937  EDPKGEDEEIPEEE----------LSLEDVQAELQRVEEEIRALEPVNMLAIQEYEEVLK 986

Query: 187  RLHQQRETLTKKQAEVNSVDLRISELQERLQRKRLLNQQLASQLNSNINSSQHQLPPGKQ 246
            RL + +E   K + E  ++  RI E ++   +KR +  +    +N N N    +L  G  
Sbjct: 987  RLDELKEKRAKLEEERKAILERIEEYEK---KKREVFMEAFEAINENFNEIFAELSGGTG 1043

Query: 247  QLVAQN 252
            +L+ +N
Sbjct: 1044 ELILEN 1049


>gnl|CDD|130902 TIGR01843, type_I_hlyD, type I secretion membrane fusion protein,
           HlyD family.  Type I secretion is an ABC transport
           process that exports proteins, without cleavage of any
           signal sequence, from the cytosol to extracellular
           medium across both inner and outer membranes. The
           secretion signal is found in the C-terminus of the
           transported protein. This model represents the adaptor
           protein between the ATP-binding cassette (ABC) protein
           of the inner membrane and the outer membrane protein,
           and is called the membrane fusion protein. This model
           selects a subfamily closely related to HlyD; it is
           defined narrowly and excludes, for example, colicin V
           secretion protein CvaA and multidrug efflux proteins
           [Protein fate, Protein and peptide secretion and
           trafficking].
          Length = 423

 Score = 39.6 bits (93), Expect = 0.002
 Identities = 40/191 (20%), Positives = 77/191 (40%), Gaps = 20/191 (10%)

Query: 36  LEDMAVRQQAQIDSQRQLIAAKEQRLRFLKQHEAR-HQQVAAEHERLRRLRDRVEQQELK 94
           LE    R +A+ DSQ  +    +               Q +    R   LR ++E    +
Sbjct: 93  LEAEVARLRAEADSQAAIEFPDDLLSAEDPAVPELIKGQQSLFESRKSTLRAQLELILAQ 152

Query: 95  LRKLRALRG--QVDQNKMNNIS--LSSDLESIRALFNEKEKELCMAVAKVE--ELTHQLE 148
           +++L A     Q     +      +S +LE+ R L  +        V+++E  EL  +  
Sbjct: 153 IKQLEAELAGLQAQLQALRQQLEVISEELEARRKLKEKG------LVSRLELLELERERA 206

Query: 149 DLKRGRTHQPSHPHLSAELENLRRELM-YHNKLSEQQNVRLHQQRETLTKKQAEVNSVDL 207
           + +             AELE L+R++     +  + +     +  E LT+ QA +  +  
Sbjct: 207 EAQGELGRL------EAELEVLKRQIDELQLERQQIEQTFREEVLEELTEAQARLAELRE 260

Query: 208 RISELQERLQR 218
           R+++ ++RLQR
Sbjct: 261 RLNKARDRLQR 271


>gnl|CDD|100796 PRK01156, PRK01156, chromosome segregation protein; Provisional.
          Length = 895

 Score = 37.2 bits (86), Expect = 0.012
 Identities = 35/195 (17%), Positives = 82/195 (42%), Gaps = 22/195 (11%)

Query: 47  IDSQRQLIAAKEQRLRFLKQHEARHQQVAAEHERLRRLRDRVEQQELKLRKLRALRGQVD 106
           I+   ++ +A+      L+  + +  ++  +H++   +++R   + LKL  L + R    
Sbjct: 518 INEYNKIESARAD----LEDIKIKINELKDKHDKYEEIKNRY--KSLKLEDLDSKR--TS 569

Query: 107 QNKMNNISLSSDLESIRALFNEKEKELCMAVAKVEELTHQLEDLKRGRTHQPSHPHLSAE 166
                 +    D+E+ R+  NE +K+L    ++++E+     D K              E
Sbjct: 570 WLNALAVISLIDIETNRSRSNEIKKQLNDLESRLQEIEIGFPDDKS------YIDKSIRE 623

Query: 167 LENLRRELMYHNKLSEQQNVRLHQQRETLTKKQAEVNSVDLRISELQERLQRKRLLNQQL 226
           +EN       +NK +E Q        E L  K   +++   +I+E+   +   + +  ++
Sbjct: 624 IENEAN--NLNNKYNEIQE--NKILIEKLRGK---IDNYKKQIAEIDSIIPDLKEITSRI 676

Query: 227 ASQLNSNINSSQHQL 241
            + +  N+  S+  L
Sbjct: 677 -NDIEDNLKKSRKAL 690



 Score = 30.3 bits (68), Expect = 2.0
 Identities = 15/100 (15%), Positives = 41/100 (41%), Gaps = 4/100 (4%)

Query: 125 LFNEKEKELCMAVAKVEELTHQLEDLKRG----RTHQPSHPHLSAELENLRRELMYHNKL 180
             N K  E+      +E+L  ++++ K+      +  P    +++ + ++   L    K 
Sbjct: 630 NLNNKYNEIQENKILIEKLRGKIDNYKKQIAEIDSIIPDLKEITSRINDIEDNLKKSRKA 689

Query: 181 SEQQNVRLHQQRETLTKKQAEVNSVDLRISELQERLQRKR 220
            +       +   T+   +  +N +  RI+++ E L+  +
Sbjct: 690 LDDAKANRARLESTIEILRTRINELSDRINDINETLESMK 729



 Score = 29.1 bits (65), Expect = 3.9
 Identities = 30/190 (15%), Positives = 74/190 (38%), Gaps = 19/190 (10%)

Query: 33  LGELEDMAVRQQAQIDSQRQLIAAKEQRLRFLKQHEARHQQ-----VAAEHERLRRLRDR 87
           L  LEDM  R +++I +    ++ + ++  + K+ E RH +     V      +      
Sbjct: 244 LSSLEDMKNRYESEIKTAESDLSMELEKNNYYKELEERHMKIINDPVYKNRNYINDYFKY 303

Query: 88  VEQQELKLRKLRALRGQVDQNKMNNISLSSDLESIRALFNEKEKELCMAVAKVEELTHQL 147
               E K + L  +  ++       I   S L+     + +K+          ++L +Q+
Sbjct: 304 KNDIENKKQILSNIDAEI-NKYHAIIKKLSVLQKDYNDYIKKKSRY-------DDLNNQI 355

Query: 148 EDLKRGRTHQPSHPHLSAELENLRRELMYHNKLSEQQNVRLHQQRETLTKKQAEVNSVDL 207
            +L     ++  +      +E+L+       + S+          E L  ++ + +++  
Sbjct: 356 LEL---EGYEMDYNSYLKSIESLK---KKIEEYSKNIERMSAFISEILKIQEIDPDAIKK 409

Query: 208 RISELQERLQ 217
            ++E+  +LQ
Sbjct: 410 ELNEINVKLQ 419


>gnl|CDD|237178 PRK12705, PRK12705, hypothetical protein; Provisional.
          Length = 508

 Score = 36.6 bits (85), Expect = 0.017
 Identities = 45/204 (22%), Positives = 82/204 (40%), Gaps = 36/204 (17%)

Query: 50  QRQLIAAKEQRLRFLKQHEARHQQVAAEHERLRRLRDRVEQQELKLRKLRALRGQVDQNK 109
           +RQ +A + +R+    Q EA  +  AA  E    L     QQ  + R+ R    +  Q +
Sbjct: 27  KRQRLAKEAERILQEAQKEAEEKLEAALLEAKELLLRERNQQRQEARRER----EELQRE 82

Query: 110 MNNISLSSDLESIRALFNEKEKELCMAVAKVEELTHQLEDLKRGRTHQPSHPHLSAELEN 169
              +              +KE++L     K++ L +QLE+ ++  + +        ELE 
Sbjct: 83  EERL-------------VQKEEQLDARAEKLDNLENQLEEREKALSAR------ELELEE 123

Query: 170 LRRELMYHNKLSEQQNVRLHQQRETLTK---------KQAEVNSVDLRISELQERLQRKR 220
           L ++    N+L     +   Q R+ L K         K   V  ++       ER  +  
Sbjct: 124 LEKQ--LDNELYRVAGLTPEQARKLLLKLLDAELEEEKAQRVKKIEEEADLEAERKAQNI 181

Query: 221 LLN--QQLASQLNSNINSSQHQLP 242
           L    Q++AS+  S+++ S   +P
Sbjct: 182 LAQAMQRIASETASDLSVSVVPIP 205


>gnl|CDD|219791 pfam08317, Spc7, Spc7 kinetochore protein.  This domain is found in
           cell division proteins which are required for
           kinetochore-spindle association.
          Length = 321

 Score = 36.2 bits (84), Expect = 0.019
 Identities = 44/207 (21%), Positives = 73/207 (35%), Gaps = 39/207 (18%)

Query: 25  LTEGVELTLGEL---EDMAVRQQAQIDSQRQLIAAKEQRLRFLKQHEARHQQVAAEHERL 81
           L EG+E  L  +   E++  +    I+S +  +  K Q L+              E   L
Sbjct: 148 LKEGLEENLEGMKRDEELLNKDLNLINSIKPKLRKKLQALK-------------EEIASL 194

Query: 82  RRLRDRVEQQELKLRKLRALRGQVDQNKMNNISLSSDLESIRALFNEKEKELCMAVAKVE 141
           R+L D  E       +L   R ++        SLS  +   R    E ++EL      +E
Sbjct: 195 RQLAD--ELNLCDPLELEKARQEL-------RSLSVKISEKRKQLEELQQELQELTIAIE 245

Query: 142 ELTHQLEDLKRGRTHQPSHPHLSAELENLRRELMYHNKLSEQQNVRLHQQRETLTKKQAE 201
            LT++  +L              AE E +R E     +    + +   + + +L +K   
Sbjct: 246 ALTNKKSEL-LEEI---------AEAEKIREE----CRGWSAKEISKLKAKVSLLQKLTG 291

Query: 202 VNSVDLRISELQERLQRKRLLNQQLAS 228
              + L  S L     R   L    AS
Sbjct: 292 WKFLSLSGSTLSMSYNRTIELVFDPAS 318


>gnl|CDD|227278 COG4942, COG4942, Membrane-bound metallopeptidase [Cell division
           and chromosome partitioning].
          Length = 420

 Score = 35.1 bits (81), Expect = 0.040
 Identities = 37/197 (18%), Positives = 75/197 (38%), Gaps = 22/197 (11%)

Query: 64  LKQHEARHQQVAAEHERLRRLRDRVEQQELKLRK--------LRALRGQVDQNKMNNISL 115
               + + +Q+  E   L +     + Q  KL K        + +L  Q+ +   +   L
Sbjct: 33  AAADDKQLKQIQKEIAALEKKIREQQDQRAKLEKQLKSLETEIASLEAQLIETADDLKKL 92

Query: 116 SSDLESIRALFNEKEKELCMAVAKVEELTHQLEDLKRGRTHQPSHPHLSAE-LENLRREL 174
              +  + A  N  E +      +   L  QL  L+R   + P    +S E  +   R  
Sbjct: 93  RKQIADLNARLNALEVQE---REQRRRLAEQLAALQRSGRNPPPALLVSPEDAQRSVRLA 149

Query: 175 MYHNKLS----------EQQNVRLHQQRETLTKKQAEVNSVDLRISELQERLQRKRLLNQ 224
           +Y+  L+          +    +L   R  +  +QAE+ ++       Q +L +     +
Sbjct: 150 IYYGALNPARAERIDALKATLKQLAAVRAEIAAEQAELTTLLSEQRAQQAKLAQLLEERK 209

Query: 225 QLASQLNSNINSSQHQL 241
           +  +QLNS +++ Q +L
Sbjct: 210 KTLAQLNSELSADQKKL 226



 Score = 28.1 bits (63), Expect = 7.2
 Identities = 17/72 (23%), Positives = 27/72 (37%), Gaps = 4/72 (5%)

Query: 35  ELEDMAVRQQAQIDSQRQLIAAKEQRLRFLKQHEARHQQVAAEH----ERLRRLRDRVEQ 90
           EL  +   Q+AQ     QL+  +++ L  L    +  Q+   E      RL+      E 
Sbjct: 186 ELTTLLSEQRAQQAKLAQLLEERKKTLAQLNSELSADQKKLEELRANESRLKNEIASAEA 245

Query: 91  QELKLRKLRALR 102
              K R+  A  
Sbjct: 246 AAAKAREAAAAA 257


>gnl|CDD|234714 PRK00286, xseA, exodeoxyribonuclease VII large subunit; Reviewed.
          Length = 438

 Score = 35.2 bits (82), Expect = 0.047
 Identities = 23/122 (18%), Positives = 46/122 (37%), Gaps = 17/122 (13%)

Query: 145 HQLEDLKRGRTHQPSHPHLSAELENLRR---ELMYHNKLSEQQNVRLHQQRETLTKKQAE 201
            +L+ L      +         L   ++    L    + + ++ +RL +QR     ++ +
Sbjct: 289 QRLDQLA----RRLKFQSPERLLAQQQQRLDRLQQRLQRALERRLRLAKQRLERLSQRLQ 344

Query: 202 VNSVDLRISELQERLQRKRLLNQQLASQLNSNINSSQHQLPPGKQQLVAQNSQLTSAHIN 261
             +   RI   Q+RL       +QL  +L   +   + QL   +Q+L A   QL +    
Sbjct: 345 QQNPQRRIERAQQRL-------EQLEQRLRRAM---RRQLKRKRQRLEALAQQLEALSPL 394

Query: 262 NV 263
             
Sbjct: 395 AT 396



 Score = 33.2 bits (77), Expect = 0.15
 Identities = 28/129 (21%), Positives = 55/129 (42%), Gaps = 17/129 (13%)

Query: 42  RQQAQIDSQRQLIAAKEQRLRFLKQH-EARH--QQVAAEHERLRRLRDRVEQQELKLRKL 98
            QQ    + R+ +  K QRL  L +  + +   + +A + +RL RL+ R+++   +  +L
Sbjct: 272 LQQRLARAMRRRLEQKRQRLDQLARRLKFQSPERLLAQQQQRLDRLQQRLQRALERRLRL 331

Query: 99  RALRGQVDQNKMNNISLSSDLESIRALFNEKEKELCMAV--------AKVEELTHQLED- 149
              R +    ++   +    +E  +    + E+ L  A+         ++E L  QLE  
Sbjct: 332 AKQRLERLSQRLQQQNPQRRIERAQQRLEQLEQRLRRAMRRQLKRKRQRLEALAQQLEAL 391

Query: 150 -----LKRG 153
                L RG
Sbjct: 392 SPLATLARG 400


>gnl|CDD|236508 PRK09426, PRK09426, methylmalonyl-CoA mutase; Reviewed.
          Length = 714

 Score = 34.8 bits (81), Expect = 0.065
 Identities = 24/75 (32%), Positives = 37/75 (49%), Gaps = 14/75 (18%)

Query: 36  LEDMAVRQQAQIDSQRQLI-------AAKEQRLRFLK--QHEARHQQVAAEHERLRRLRD 86
           +E+ A R QA+IDS +Q+I         KE  +  L+      R +Q+A    RL RLR 
Sbjct: 436 IEEAAARTQARIDSGKQVIVGVNKYRLDKEDPIDVLEVDNTAVRAEQIA----RLERLRA 491

Query: 87  RVEQQELKLRKLRAL 101
             ++  ++   L AL
Sbjct: 492 ERDEAAVE-AALAAL 505


>gnl|CDD|223496 COG0419, SbcC, ATPase involved in DNA repair [DNA replication,
           recombination, and repair].
          Length = 908

 Score = 34.7 bits (80), Expect = 0.079
 Identities = 42/204 (20%), Positives = 86/204 (42%), Gaps = 23/204 (11%)

Query: 35  ELEDMAVRQQAQIDSQRQLIAAKEQRLRFLKQHEARHQQVAAEHERLRRLRDRVEQQELK 94
           ++E++  +    ++    L+ A E+ L+ LK+ E    Q   E E L +  + +E++  +
Sbjct: 186 KIEELEGQLSELLEDIEDLLEALEEELKELKKLE--EIQEEQEEEELEQEIEALEERLAE 243

Query: 95  LRKLRALRGQVDQNKMNNISLSSD-LESIRALFNEKEKELCMAVAKVEELTHQLEDLKRG 153
           L + +    ++    +   SL  + L+       E E+ L     ++EE   +LE+L+R 
Sbjct: 244 LEEEKERLEELKARLLEIESLELEALKIREEELRELERLLE----ELEEKIERLEELER- 298

Query: 154 RTHQPSHPHLSAELENLRRELMYHNKLSEQQNVRLHQQR---ETLTKKQAEVNSVDLRIS 210
                       E+E L  EL     L E+    L + +   E L K + ++  ++  + 
Sbjct: 299 ------------EIEELEEELEGLRALLEELEELLEKLKSLEERLEKLEEKLEKLESELE 346

Query: 211 ELQERLQRKRLLNQQLASQLNSNI 234
           EL E       L ++   +L   +
Sbjct: 347 ELAEEKNELAKLLEERLKELEERL 370



 Score = 32.0 bits (73), Expect = 0.53
 Identities = 45/211 (21%), Positives = 86/211 (40%), Gaps = 21/211 (9%)

Query: 27  EGVELTLGELEDMAVRQQAQIDSQRQLIAAKEQRLRFLKQHEARHQQVAAEHER------ 80
           E ++  L ELE      + +++   + I   E+++  L+  E    ++A   E+      
Sbjct: 406 EEIQEELEELEKELEELERELEELEEEIKKLEEQINQLESKELMIAELAGAGEKCPVCGQ 465

Query: 81  -LRRLRDR--VEQQELKLRKLRALRGQVDQNKMNNISLSSDLESIRALFNEKEKELCMAV 137
            L    ++  +E  EL+L +L     +  +       +    + +R L  E  + L +  
Sbjct: 466 ELPEEHEKELLELYELELEELEEELSREKEEAELREEIEELEKELRELEEELIELLELEE 525

Query: 138 AKVEELTHQLEDLKRGRTHQPSHPHLSAELENLRRELMYHNKLSEQQNVRLHQQRETLTK 197
           A  EEL  +LE L+           L  ELE L+ +L       E +   L  + + L +
Sbjct: 526 ALKEELEEKLEKLEN----------LLEELEELKEKLQLQQLKEELRQ--LEDRLQELKE 573

Query: 198 KQAEVNSVDLRISELQERLQRKRLLNQQLAS 228
              E+  +  R  EL+E  +R + L ++L  
Sbjct: 574 LLEELRLLRTRKEELEELRERLKELKKKLKE 604



 Score = 32.0 bits (73), Expect = 0.54
 Identities = 43/197 (21%), Positives = 83/197 (42%), Gaps = 16/197 (8%)

Query: 35  ELEDMAVRQQAQIDSQRQLIAAKEQRLRFLKQHEARHQQVAAEHERLRRLRDRVEQQELK 94
           EL +     + +++   + +   E  L  L + +  ++      ERL+ L +R+E+ E +
Sbjct: 319 ELLEKLKSLEERLEKLEEKLEKLESELEELAEEK--NELAKLLEERLKELEERLEELEKE 376

Query: 95  LRKLRALRGQVDQNKMNNIS----LSSDLESIRALFNEKEKELCMAVAKVEELTHQLEDL 150
           L K      Q+++           LS+ LE I+    E EKEL     ++EEL  +++ L
Sbjct: 377 LEKALERLKQLEEAIQELKEELAELSAALEEIQEELEELEKELEELERELEELEEEIKKL 436

Query: 151 KRGRTHQPSHPHLSAELE---------NLRRELMYHNKLSEQQNVRLHQQRETLTKKQAE 201
           +       S   + AEL                 +  +L E   + L +  E L++++ E
Sbjct: 437 EEQINQLESKELMIAELAGAGEKCPVCGQELPEEHEKELLELYELELEELEEELSREK-E 495

Query: 202 VNSVDLRISELQERLQR 218
              +   I EL++ L+ 
Sbjct: 496 EAELREEIEELEKELRE 512



 Score = 30.5 bits (69), Expect = 1.4
 Identities = 35/173 (20%), Positives = 69/173 (39%), Gaps = 21/173 (12%)

Query: 64  LKQHEARHQQVAAEHERLRRLRDRVEQQ-----ELKLRKLRALRGQVDQNKMNNISLSSD 118
           L+++E   + +    +  +   + +E Q     E     L AL  ++ + K         
Sbjct: 166 LEKYEKLSELLKEVIKEAKAKIEELEGQLSELLEDIEDLLEALEEELKELKKLE---EIQ 222

Query: 119 LESIRALFNEKEKELCMAVAKVEELTHQLEDLKRGRTHQPSHPHLSAELENLRRELMYHN 178
            E       ++ + L   +A++EE   +LE+LK             A L  +    +   
Sbjct: 223 EEQEEEELEQEIEALEERLAELEEEKERLEELK-------------ARLLEIESLELEAL 269

Query: 179 KLSEQQNVRLHQQRETLTKKQAEVNSVDLRISELQERLQRKRLLNQQLASQLN 231
           K+ E++   L +  E L +K   +  ++  I EL+E L+  R L ++L   L 
Sbjct: 270 KIREEELRELERLLEELEEKIERLEELEREIEELEEELEGLRALLEELEELLE 322


>gnl|CDD|221952 pfam13166, AAA_13, AAA domain.  This family of domains contain a
           P-loop motif that is characteristic of the AAA
           superfamily. Many of the proteins in this family are
           conjugative transfer proteins. This family includes the
           PrrC protein that is thought to be the active component
           of the anticodon nuclease.
          Length = 713

 Score = 34.2 bits (79), Expect = 0.085
 Identities = 31/186 (16%), Positives = 68/186 (36%), Gaps = 26/186 (13%)

Query: 47  IDSQRQLIAAKEQRLRFLKQHEARHQQVAAEHERLRRLRDRVEQQELKLRKLRALRGQVD 106
                + +   E  L   K++      V      L  L + +E+   KL           
Sbjct: 295 ESHIEKALEELESILDTEKENSEFKLDVEELKALLEALEEILEKNLQKL----------- 343

Query: 107 QNKMNNISLSSDLESIRALFNEKEKELCMAVAKVEELTHQLEDLKRGRTHQPSHPHLSAE 166
           + K+ + S S +LESI  L       +      + E   ++++LK              E
Sbjct: 344 EEKLKDPSTSIELESITDLIESINDIIDAINELIREHNEKIDNLK-------------KE 390

Query: 167 LENLRRELMYHNKLSEQQNVRLHQQRETLTKKQAEVNSVDLRISELQERLQRKRLLNQQL 226
               +++L  H     ++++  +Q  +     +  +NS++  I +L+  ++      ++L
Sbjct: 391 KNKAKKKLWLHLVAELKEDIDAYQ--KEKKGLEKAINSLEKEIKQLEAEIKALEKEIKEL 448

Query: 227 ASQLNS 232
             QL +
Sbjct: 449 EKQLTN 454


>gnl|CDD|234501 TIGR04211, SH3_and_anchor, SH3 domain protein.  Members of this
           protein family have a signal peptide, a strongly
           conserved SH3 domain, a variable region, and then a
           C-terminal hydrophobic transmembrane alpha helix region.
          Length = 198

 Score = 33.4 bits (77), Expect = 0.100
 Identities = 22/103 (21%), Positives = 44/103 (42%), Gaps = 7/103 (6%)

Query: 97  KLRALRGQVDQNKMNNISLSSDLESIRALFNEKEKELCMAVAKVEELTHQLEDLKRGRTH 156
           +L  L+ ++ + +     L   L  ++    E ++EL    A++E L  +L  +K+    
Sbjct: 67  RLPELQQELAELQEELAELQEQLAELQQENQELKQELSTLEAELERLQKELARIKQLSA- 125

Query: 157 QPSHPHLSAELENLRRELMYHNKLSEQQNVRLHQQRETLTKKQ 199
                  + EL+   REL       +Q+N  L  + E L + +
Sbjct: 126 ------NAIELDEENRELREELAELKQENEALEAENERLQENE 162



 Score = 32.3 bits (74), Expect = 0.20
 Identities = 18/105 (17%), Positives = 41/105 (39%), Gaps = 14/105 (13%)

Query: 62  RFLKQHEARHQQVAA---EHERLRRLRDRVEQQELKLRK-LRALRGQVDQNKMNNISLSS 117
           R+L    +  +++     E   L+     +++Q  +L++  + L+ ++   +     L  
Sbjct: 56  RYLSDTPSARERLPELQQELAELQEELAELQEQLAELQQENQELKQELSTLEAELERLQK 115

Query: 118 DLESIR-------ALFNEKE---KELCMAVAKVEELTHQLEDLKR 152
           +L  I+        L  E     +EL     + E L  + E L+ 
Sbjct: 116 ELARIKQLSANAIELDEENRELREELAELKQENEALEAENERLQE 160


>gnl|CDD|225519 COG2972, COG2972, Predicted signal transduction protein with a
           C-terminal ATPase domain [Signal transduction
           mechanisms].
          Length = 456

 Score = 34.0 bits (78), Expect = 0.11
 Identities = 29/112 (25%), Positives = 44/112 (39%), Gaps = 19/112 (16%)

Query: 79  ERLRRLRDRVEQQELKLRK--LRALRGQVDQNKMNNISLSSDLESIRALFNEKEKELCMA 136
           ERL  L +     E +  +  LRAL+ Q     +N   L + LE+IR L  E + E    
Sbjct: 237 ERLEYLIEENYSLEQEQLEAELRALQSQ-----INPHFLYNTLETIRMLAEEDDPE---- 287

Query: 137 VAKVEELTHQLEDLKRGRTHQPSHPHLSAELENLRRELMYHNKLSEQQNVRL 188
             +  ++   L  L R              +  L  EL+   K  E Q +R+
Sbjct: 288 --EAAKVVKALSKLLRYSL------SNLDNIVTLEIELLLIEKYLEIQKLRI 331


>gnl|CDD|225288 COG2433, COG2433, Uncharacterized conserved protein [Function
           unknown].
          Length = 652

 Score = 33.9 bits (78), Expect = 0.11
 Identities = 37/167 (22%), Positives = 74/167 (44%), Gaps = 15/167 (8%)

Query: 75  AAEHERLRRLRDRVEQQELKLRKLRALRGQVDQNKMNNI---SLSSDLESIRALFNEKEK 131
           AA ++     + ++E+ E KL +L  +   V++ K   I    L+  L  ++     +EK
Sbjct: 348 AAAYKAYLAYKPKLEKVERKLPEL-GIWKDVERIKALVIRGYPLAEALSKVKEEERPREK 406

Query: 132 ELCMAVAKVEELTHQLEDLKRGRT---HQPSHPHLSAELENLRREL-MYHNKLSE-QQNV 186
           E      + E   ++    K   T    +  +  L  ELE L+RE+    ++L   ++ V
Sbjct: 407 EGTEEEERREITVYEKRIKKLEETVERLEEENSELKRELEELKREIEKLESELERFRREV 466

Query: 187 RLHQQRETLTKKQAEVNSVDLRISELQERLQRKRLLNQQLASQLNSN 233
           R   +++       E+ + D RI  L++ L+ K+   ++L  +L   
Sbjct: 467 RDKVRKDR------EIRARDRRIERLEKELEEKKKRVEELERKLAEL 507



 Score = 28.1 bits (63), Expect = 7.7
 Identities = 17/74 (22%), Positives = 29/74 (39%), Gaps = 7/74 (9%)

Query: 79  ERLRRLRDRVEQQELKLRKLRALRGQVD------QNKMNNIS-LSSDLESIRALFNEKEK 131
             L R R  V  +  K R++RA   +++      + K   +  L   L  +R +   +  
Sbjct: 457 SELERFRREVRDKVRKDREIRARDRRIERLEKELEEKKKRVEELERKLAELRKMRKLELS 516

Query: 132 ELCMAVAKVEELTH 145
                V  VE+LT 
Sbjct: 517 GKGTPVKVVEKLTL 530


>gnl|CDD|235316 PRK04863, mukB, cell division protein MukB; Provisional.
          Length = 1486

 Score = 33.8 bits (78), Expect = 0.16
 Identities = 40/194 (20%), Positives = 72/194 (37%), Gaps = 19/194 (9%)

Query: 30  ELTLGELEDMAVRQQAQIDSQRQLIAAKEQRLRFLKQHEARHQQVAAEHERLRRLRDRVE 89
           +LT    ED     QA+     + + + EQ+L      +A H Q    ++ +R++   V 
Sbjct: 434 DLTADNAEDWLEEFQAKEQEATEELLSLEQKLSV---AQAAHSQFEQAYQLVRKIAGEVS 490

Query: 90  QQE---------LKLRKLRALRGQVDQNKMNNISLSSDLESIRALFNEKEKELCMAVAKV 140
           + E          +LR+ R L  Q+ Q +M    L   L   +        E C  + K 
Sbjct: 491 RSEAWDVARELLRRLREQRHLAEQLQQLRMRLSELEQRLRQQQRA-ERLLAEFCKRLGKN 549

Query: 141 EELTHQLEDLKRGRTHQPSHPHLSAELENLRRELMYHNKLSEQQNVRLHQQRETLTKKQA 200
            +   +LE L+     +     LS  +   R   M       QQ  +L  + + L  +  
Sbjct: 550 LDDEDELEQLQEELEARLE--SLSESVSEARERRMAL----RQQLEQLQARIQRLAARAP 603

Query: 201 EVNSVDLRISELQE 214
              +    ++ L+E
Sbjct: 604 AWLAAQDALARLRE 617



 Score = 31.5 bits (72), Expect = 0.70
 Identities = 45/242 (18%), Positives = 94/242 (38%), Gaps = 34/242 (14%)

Query: 6    MTLRNRPYPSSSTLKAADLLTEGVELTLGELEDMAVRQQAQ--IDSQRQLIAAKEQRLRF 63
            ++  NR  P  + L    L     E+     E +   ++A+  +      +A  E  +  
Sbjct: 874  LSALNRLLPRLNLLADETLADRVEEIR----EQLDEAEEAKRFVQQHGNALAQLEPIVSV 929

Query: 64   LKQHEARHQQVAAEHERLRRLRDRVEQQELKLRKLRALR---------GQVDQNKMNNIS 114
            L+    + +Q+  ++++ ++ +   +QQ   L ++   R           + +N   N  
Sbjct: 930  LQSDPEQFEQLKQDYQQAQQTQRDAKQQAFALTEVVQRRAHFSYEDAAEMLAKNSDLNEK 989

Query: 115  LSSDLESIRALFNEKEKELCMAVAKVEELTHQLEDLK-RGRTHQPSHPHLSAELENL--- 170
            L   LE          ++L  A A++ +    L  LK      +     L  EL++L   
Sbjct: 990  LRQRLEQAEQERTRAREQLRQAQAQLAQYNQVLASLKSSYDAKRQMLQELKQELQDLGVP 1049

Query: 171  ------------RRELMYHNKLSEQQNVRLHQQRETLTKKQAEVNSVDLRISELQERLQR 218
                        R EL  H +LS  ++ R +Q  + LT  +AE++++  ++ +L+     
Sbjct: 1050 ADSGAEERARARRDEL--HARLSANRS-RRNQLEKQLTFCEAEMDNLTKKLRKLERDYHE 1106

Query: 219  KR 220
             R
Sbjct: 1107 MR 1108


>gnl|CDD|161783 TIGR00237, xseA, exodeoxyribonuclease VII, large subunit.  This
           family consist of exodeoxyribonuclease VII, large
           subunit XseA which catalyses exonucleolytic cleavage in
           either the 5'->3' or 3'->5' direction to yield
           5'-phosphomononucleotides. Exonuclease VII consists of
           one large subunit and four small subunits [DNA
           metabolism, Degradation of DNA].
          Length = 432

 Score = 32.9 bits (75), Expect = 0.20
 Identities = 18/102 (17%), Positives = 42/102 (41%), Gaps = 9/102 (8%)

Query: 162 HLSAELENLRRELMYHNKLSEQQNVRLHQQRETL-------TKKQAEVNSVDLRISELQE 214
            L A L+    +      L + Q  +L ++++         T+++    +  L  +    
Sbjct: 288 QLVASLQRQHPQNKLA--LQQLQFEKLEKRKQAALNKQLERTRQKKTRLTKRLTQTNPSP 345

Query: 215 RLQRKRLLNQQLASQLNSNINSSQHQLPPGKQQLVAQNSQLT 256
           ++ R +   +QL  +LN+  N+  +   P    LV + + L+
Sbjct: 346 QILRAQTRTEQLNRRLNALKNAQANLKLPQFGTLVGKLNALS 387



 Score = 30.2 bits (68), Expect = 1.7
 Identities = 25/124 (20%), Positives = 52/124 (41%), Gaps = 18/124 (14%)

Query: 42  RQQAQIDS-QRQLIAAKEQRLRFLKQHEARHQQVAAE---HERLRRLRDRVEQQELKLRK 97
            QQ Q +  +++  AA  ++L   +Q + R  +   +     ++ R + R EQ   +L  
Sbjct: 304 LQQLQFEKLEKRKQAALNKQLERTRQKKTRLTKRLTQTNPSPQILRAQTRTEQLNRRLNA 363

Query: 98  LRALRGQVDQ---------NKMNNISLSSDLESIRALFNEKEKELCMAVAKVEE---LTH 145
           L     Q +           K+N +S    L    ++   ++ +   +V +V+    LT 
Sbjct: 364 L--KNAQANLKLPQFGTLVGKLNALSPLEVLARGYSIALNEKGKAIKSVKQVDRGDRLTT 421

Query: 146 QLED 149
           +L+D
Sbjct: 422 KLKD 425


>gnl|CDD|239651 cd03679, MM_CoA_mutase_alpha_like, Coenzyme
           B12-dependent-methylmalonyl coenzyme A (CoA) mutase
           (MCM) family, Alpha subunit-like subfamily; contains
           proteins similar to the alpha subunit of
           Propionbacterium shermanni MCM, as well as human and E.
           coli MCM. Members of this subfamily contain an
           N-terminal MCM domain and a C-terminal coenzyme B12
           binding domain. MCM catalyzes the isomerization of
           methylmalonyl-CoA to succinyl-CoA. The reaction proceeds
           via radical intermediates beginning with a
           substrate-induced homolytic cleavage of the Co-C bond of
           coenzyme B12 to produce cob(II)alamin and the
           deoxyadenosyl radical. MCM plays an important role in
           the conversion of propionyl-CoA to succinyl-CoA during
           the degradation of propionate for the Krebs cycle. In
           higher animals, MCM is involved in the breakdown of
           odd-chain fatty acids, several amino acids, and
           cholesterol. Methylobacterium extorquens MCM
           participates in the glyoxylate regeneration pathway. In
           M. extorquens, MCM forms a complex with MeaB; MeaB may
           protect MCM from irreversible inactivation. In some
           bacteria, MCM is involved in the reverse metabolic
           reaction, the rearrangement of succinyl-CoA to
           methylmalonyl-CoA. Examples include P. shermanni MCM
           during propionic acid fermentation, E.coli MCM in a
           pathway for the conversion of succinate to propionate
           and Streptomyces MCM in polyketide biosynthesis.
           Sinorhizobium meliloti strain SU47 MCM plays a role in
           the polyhydroxyalkanoate degradation pathway. P.
           shermanni and Streptomyces cinnamonensis MCMs are
           alpha/beta heterodimers. It has been shown for P.
           shermanni MCM that only the alpha subunit binds coenzyme
           B12 and substrates. Human MCM is a homodimer with two
           active sites. Mouse and E.coli MCMs are also homodimers.
           In humans, impaired activity of MCM results in
           methylmalonic aciduria, a disorder of propionic acid
           metabolism.
          Length = 536

 Score = 33.1 bits (76), Expect = 0.22
 Identities = 22/66 (33%), Positives = 33/66 (50%), Gaps = 10/66 (15%)

Query: 36  LEDMAVRQQAQIDSQRQLI-------AAKEQRLRFLK--QHEARHQQVAAEHERLRRLRD 86
           +E+ A R+QA+IDS R++I          E+ L  LK      R +Q+A   ++LR  RD
Sbjct: 430 IEEAAARRQARIDSGREVIVGVNKYRLDHEEPLDVLKIDNTAVRAEQIAR-LKKLRAERD 488

Query: 87  RVEQQE 92
               Q 
Sbjct: 489 PEAVQA 494


>gnl|CDD|234342 TIGR03752, conj_TIGR03752, integrating conjugative element protein,
           PFL_4705 family.  Members of this protein family are
           found occasionally on plasmids such as the Pseudomonas
           putida toluene catabolic TOL plasmid pWWO_p085. Usually,
           however, they are found on the bacterial main chromosome
           in regions flanked by markers of conjugative transfer
           and/or transposition [Mobile and extrachromosomal
           element functions, Plasmid functions].
          Length = 472

 Score = 32.7 bits (75), Expect = 0.24
 Identities = 27/103 (26%), Positives = 43/103 (41%), Gaps = 21/103 (20%)

Query: 163 LSAELENLRRELMYHNKLSEQQNVRLHQQRETLTKKQAEVNSVDLRISELQERLQRKRLL 222
           L AE++ LR+ L    KL   +N  L  + E L K++    S+D +I   Q+ +Q +   
Sbjct: 64  LVAEVKELRKRL---AKL-ISENEALKAENERLQKRE---QSIDQQI---QQAVQSET-- 111

Query: 223 NQQLASQLNSNINSSQHQLPPGKQQLVAQNSQLTSAHINNVNS 265
            Q+L  +          QL   +QQL     QL       +  
Sbjct: 112 -QELTKE--------IEQLKSERQQLQGLIDQLQRRLAGVLTG 145


>gnl|CDD|220785 pfam10498, IFT57, Intra-flagellar transport protein 57.  Eukaryotic
           cilia and flagella are specialised organelles found at
           the periphery of cells of diverse organisms.
           Intra-flagellar transport (IFT) is required for the
           assembly and maintenance of eukaryotic cilia and
           flagella, and consists of the bidirectional movement of
           large protein particles between the base and the distal
           tip of the organelle. IFT particles contain multiple
           copies of two distinct protein complexes, A and B, which
           contain at least 6 and 11 protein subunits. IFT57 is
           part of complex B but is not, however, required for the
           core subunits to stay associated. This protein is known
           as Huntington-interacting protein-1 in humans.
          Length = 355

 Score = 32.4 bits (74), Expect = 0.29
 Identities = 33/172 (19%), Positives = 66/172 (38%), Gaps = 35/172 (20%)

Query: 46  QIDSQRQLIAAKEQRLRFLKQHEARHQQVAAEHERLRRLRDRVEQQELKLRKLRALRGQV 105
            ++   QL  A  + L   KQ+             L++L   + +   K+          
Sbjct: 217 HLEQMHQLKKAISKSLPETKQY-------------LKKLSQDLSKALEKISSREKHLNNQ 263

Query: 106 DQNKMNNISLSSDLESIRALFNEKEKELCMAVAKVEELTHQLEDLKRGRTHQPSHPHLSA 165
            +       L  +    R   ++ +++   A   V ELT +L ++             S 
Sbjct: 264 LE------QLVQEYREARRTLSQVQEKYNQASQGVSELTRELNEI-------------SE 304

Query: 166 ELENLRRELMYH-NKLSEQQNVRLHQQRETLTKKQAEVNSVDLRISELQERL 216
           ELE +++E+      +S+   +   + ++ +TK + E+  +DLRI  LQ  L
Sbjct: 305 ELEQVKQEMEERGASMSDGSPL--VKIKQAITKLKEEIKQMDLRIGVLQHTL 354


>gnl|CDD|237177 PRK12704, PRK12704, phosphodiesterase; Provisional.
          Length = 520

 Score = 32.4 bits (75), Expect = 0.30
 Identities = 18/92 (19%), Positives = 41/92 (44%), Gaps = 1/92 (1%)

Query: 42  RQQAQIDSQRQLIAAKEQRLRFLKQHEARHQQVAAEHERL-RRLRDRVEQQELKLRKLRA 100
           +++A+   +  L+ AKE+  +   + E   ++   E ++L +RL  + E  + KL  L  
Sbjct: 48  KKEAEAIKKEALLEAKEEIHKLRNEFEKELRERRNELQKLEKRLLQKEENLDRKLELLEK 107

Query: 101 LRGQVDQNKMNNISLSSDLESIRALFNEKEKE 132
              ++++ +        +LE       E  +E
Sbjct: 108 REEELEKKEKELEQKQQELEKKEEELEELIEE 139



 Score = 32.1 bits (74), Expect = 0.44
 Identities = 17/81 (20%), Positives = 36/81 (44%), Gaps = 6/81 (7%)

Query: 166 ELENLRREL-----MYHNKLSEQQNVRLHQQRETLTKKQAEVNSVDLRISELQERLQRKR 220
           E+  LR E         N+L + +  RL Q+ E L +K   +   +  + + ++ L++K+
Sbjct: 65  EIHKLRNEFEKELRERRNELQKLEK-RLLQKEENLDRKLELLEKREEELEKKEKELEQKQ 123

Query: 221 LLNQQLASQLNSNINSSQHQL 241
              ++   +L   I     +L
Sbjct: 124 QELEKKEEELEELIEEQLQEL 144



 Score = 29.7 bits (68), Expect = 2.1
 Identities = 39/176 (22%), Positives = 70/176 (39%), Gaps = 42/176 (23%)

Query: 49  SQRQLIAAKEQRLRFLKQHEARHQQVAAEHERLRRLRDRVEQQELKLRKLRALRGQVDQN 108
           ++ ++  A+E+  R L+  EA+ +  A + E L   ++ + +          LR + ++ 
Sbjct: 29  AEAKIKEAEEEAKRILE--EAKKEAEAIKKEALLEAKEEIHK----------LRNEFEK- 75

Query: 109 KMNNISLSSDLESIRALFNEKEKELCMAVAKVEELTHQLEDLKRGRTHQPSHPHLSAELE 168
                    +L   R    + EK L   + K E L  +LE L++             ELE
Sbjct: 76  ---------ELRERRNELQKLEKRL---LQKEENLDRKLELLEK----------REEELE 113

Query: 169 NLRRELMYHNKLSEQQNVRLHQQRETLTKKQAEVNSVDLRISELQERLQRKRLLNQ 224
              +EL       EQ+   L ++ E L +   E      RIS L     ++ LL +
Sbjct: 114 KKEKEL-------EQKQQELEKKEEELEELIEEQLQELERISGLTAEEAKEILLEK 162


>gnl|CDD|163506 TIGR03794, NHLM_micro_HlyD, NHLM bacteriocin system secretion
           protein.  Members of this protein family are homologs of
           the HlyD membrane fusion protein of type I secretion
           systems. Their occurrence in prokaryotic genomes is
           associated with the occurrence of a novel class of
           microcin (small bacteriocins) with a leader peptide
           region related to nitrile hydratase. We designate the
           class of bacteriocin as Nitrile Hydratase Leader
           Microcin, or NHLM. This family, therefore, is designated
           as NHLM bacteriocin system secretion protein. Some but
           not all NHLM-class putative microcins belong to the TOMM
           (thiazole/oxazole modified microcin) class as assessed
           by the presence of the scaffolding protein and/or
           cyclodehydratase in the same gene clusters [Transport
           and binding proteins, Amino acids, peptides and amines,
           Cellular processes, Biosynthesis of natural products].
          Length = 421

 Score = 32.5 bits (74), Expect = 0.32
 Identities = 45/194 (23%), Positives = 74/194 (38%), Gaps = 21/194 (10%)

Query: 79  ERLRRLRDRVEQQELKLRKLRALRGQVDQNKMNNISLSSD-LESIRALFNEKEKELCMAV 137
           ERL+    ++ Q + +L ++R   G++ + +  +   S + LE       E+   L   V
Sbjct: 96  ERLQESYQKLTQLQEQLEEVRNYTGRLKEGRERHFQKSKEALEETIGRLREELAALSREV 155

Query: 138 AKVEE-LTHQLEDLKRGRTHQ--------PSHPHLSAELEN-LRRELMYHNKLSEQQNVR 187
            K    L+  L   KR R  Q               A     L+ +    N  +  Q   
Sbjct: 156 GKQRGLLSRGLATFKRDRILQQQWREEQEKYDAADKARAIYALQTKADERNLETVLQ--S 213

Query: 188 LHQQRETL-TKKQAEVNSVDLRISELQER---LQRKRLLNQQLASQLNSNINSSQHQLPP 243
           L Q    L    + E+ +V+ RI E +     L+ K  LN ++ SQ +  +      L  
Sbjct: 214 LSQADFQLAGVAEKELETVEARIKEARYEIEELENKLNLNTRIVSQHSGRVIE----LNY 269

Query: 244 GKQQLVAQNSQLTS 257
              QLVA  + L S
Sbjct: 270 TPGQLVAAGAPLAS 283


>gnl|CDD|224486 COG1570, XseA, Exonuclease VII, large subunit [DNA replication,
           recombination, and repair].
          Length = 440

 Score = 32.2 bits (74), Expect = 0.41
 Identities = 37/140 (26%), Positives = 57/140 (40%), Gaps = 16/140 (11%)

Query: 135 MAVAKVEELTHQLEDLKRGRTHQPSHPHLSAE---LENLRRELM---YHNKLSEQQNVRL 188
           + V    EL  QL+ L+R R H+     L  +   LE+L R L        LSEQQ  RL
Sbjct: 258 LVVPDSAELLQQLDQLQR-RLHRALRRLLDQKKQRLEHLARRLQFRSPERLLSEQQQ-RL 315

Query: 189 HQQRETLTKKQAEV-----NSVDLRISELQERLQRKRLLNQQLASQLNSNINSSQHQLPP 243
            +    L +            ++     L  ++QR++   QQL  +L+  +     QL  
Sbjct: 316 DELAIRLRRALENQLALKKQRLERLTQRLNPQIQRQQQRLQQLERRLDKALR---RQLKR 372

Query: 244 GKQQLVAQNSQLTSAHINNV 263
            +++L A   QL S      
Sbjct: 373 KRERLEALVEQLESLSPLAT 392



 Score = 28.8 bits (65), Expect = 4.1
 Identities = 32/144 (22%), Positives = 65/144 (45%), Gaps = 11/144 (7%)

Query: 20  KAADLLTEGVELTLGELEDMAVR----QQAQIDSQRQLIAAKEQRLRFL---KQHEARHQ 72
            AA+L+       L +L+ +  R     +  +D ++Q +    +RL+F    +    + Q
Sbjct: 254 AAAELVVPDSAELLQQLDQLQRRLHRALRRLLDQKKQRLEHLARRLQFRSPERLLSEQQQ 313

Query: 73  QVAAEHERLRR-LRDRVEQQELKLRKLR-ALRGQVDQNKMNNISLSSDL-ESIRALFNEK 129
           ++     RLRR L +++  ++ +L +L   L  Q+ + +     L   L +++R     K
Sbjct: 314 RLDELAIRLRRALENQLALKKQRLERLTQRLNPQIQRQQQRLQQLERRLDKALRRQLKRK 373

Query: 130 EKELCMAVAKVEELTHQLEDLKRG 153
            + L   V ++E L+     L RG
Sbjct: 374 RERLEALVEQLESLSPLAT-LARG 396


>gnl|CDD|234172 TIGR03345, VI_ClpV1, type VI secretion ATPase, ClpV1 family.
           Members of this protein family are homologs of ClpB, an
           ATPase associated with chaperone-related functions.
           These ClpB homologs, designated ClpV1, are a key
           component of the bacterial pathogenicity-associated type
           VI secretion system [Protein fate, Protein and peptide
           secretion and trafficking, Cellular processes,
           Pathogenesis].
          Length = 852

 Score = 32.2 bits (74), Expect = 0.47
 Identities = 23/104 (22%), Positives = 45/104 (43%), Gaps = 11/104 (10%)

Query: 34  GELEDMAVRQQAQIDSQRQLIAAKEQRLRFLKQHEARHQ----QVAAEHERLRRLRDRVE 89
             LED+  R+ A ++ +   +   E+       H+ R      ++AA    L  L  R +
Sbjct: 419 AALEDLR-RRIAALELELDAL---EREAALGADHDERLAELRAELAALEAELAALEARWQ 474

Query: 90  QQELKLRKLRALRGQVDQNKMNNISLSSDLESIRALFNEKEKEL 133
           Q++  +  + ALR +++ +       + D  ++RA   E E  L
Sbjct: 475 QEKELVEAILALRAELEADADAP---ADDDAALRAQLAELEAAL 515


>gnl|CDD|179385 PRK02224, PRK02224, chromosome segregation protein; Provisional.
          Length = 880

 Score = 31.9 bits (73), Expect = 0.49
 Identities = 43/197 (21%), Positives = 80/197 (40%), Gaps = 22/197 (11%)

Query: 27  EGVELTLGELEDMAVRQQAQIDSQRQLIAAKEQRLRFLKQHEARHQQVAAEHERLRRLRD 86
           E +E  L +LE+     + +++    L+ A+++  R  ++ E   + +A   E +   R+
Sbjct: 478 EELEAELEDLEEEVEEVEERLERAEDLVEAEDRIERLEERREDLEELIAERRETIEEKRE 537

Query: 87  RVEQQELKLRKLRALRGQVDQNKMNNISLSSDLESIRALFNEKEKELCMAVAKVEELTHQ 146
           R E+          LR +  +       L ++ E  R    E E+E   A  +V EL  +
Sbjct: 538 RAEE----------LRERAAE-------LEAEAEEKREAAAEAEEEAEEAREEVAELNSK 580

Query: 147 LEDLKRGRTHQPSHPHLSAELENLRRELMYHNKLSEQQNVRLHQQRETLT-----KKQAE 201
           L +LK           L A + +   E+    +  E       ++RE L      K++ E
Sbjct: 581 LAELKERIESLERIRTLLAAIADAEDEIERLREKREALAELNDERRERLAEKRERKRELE 640

Query: 202 VNSVDLRISELQERLQR 218
               + RI E +E  +R
Sbjct: 641 AEFDEARIEEAREDKER 657


>gnl|CDD|240016 cd04658, Piwi_piwi-like_Euk, Piwi_piwi-like_Euk: PIWI domain,
           Piwi-like subfamily found in eukaryotes. This domain is
           found in Piwi and closely related proteins, where it is
           believed to perform a crucial role in germline cells,
           via RNA silencing. RNA silencing refers to a group of
           related gene-silencing mechanisms mediated by short RNA
           molecules, including siRNAs, miRNAs, and
           heterochromatin-related guide RNAs. The mechanism in
           Piwi is believed to be similar to that in Argonaute, the
           central component of the RNA-induced silencing complex
           (RISC). The PIWI domain is the C-terminal portion of
           Argonaute and consists of two subdomains, one of which
           provides the 5' anchoring of the guide RNA and the
           other, the catalytic site for slicing.
          Length = 448

 Score = 31.9 bits (73), Expect = 0.53
 Identities = 19/103 (18%), Positives = 36/103 (34%), Gaps = 2/103 (1%)

Query: 213 QERLQRKRLLNQQLASQLNSNINSSQHQLPPGKQQLVAQNSQLTSAHINNVNSLFRSMTS 272
           +ER    R   Q++    +      +  +      L  Q   L    I   N    + ++
Sbjct: 15  KERYDTIRQFIQRIQKNPSVQELLKKWGIELDSNPLKIQGRVLPPEQIIMGNVFVYANSN 74

Query: 273 SNIAAVAPYLHVPSKPNLDNTHAIF--QNQRRIESGHNQLNKV 313
           ++         +    NL+N   I+  ++QR  ES    L +V
Sbjct: 75  ADWKREIRNQPLYDAVNLNNWVLIYPSRDQREAESFLQTLKQV 117


>gnl|CDD|213368 cd12834, ZntB_u1, Uncharacterized bacterial subgroup of the
           Salmonella typhimurium Zn2+ transporter ZntB-like
           subfamily.  The MIT superfamily of essential membrane
           proteins is involved in transporting divalent cations
           (uptake or efflux) across membranes. The ZntB-like
           subfamily includes the Zn2+ transporter Salmonella
           typhimurium ZntB which mediates the efflux of Zn2+ (and
           Cd2+). Structures of the intracellular domain of Vibrio
           parahaemolyticus and Salmonella typhimurium ZntB form
           funnel-shaped homopentamers, the tip of the funnel is
           formed from two C-terminal transmembrane (TM) helices
           from each monomer, and the large opening of the funnel
           from the N-terminal cytoplasmic domains. The GMN
           signature motif of the MIT superfamily occurs just after
           TM1, mutation within this motif is known to abolish Mg2+
           transport through Salmonella typhimurium CorA, and
           Mrs2p. Natural variants such as GVN and GIN which occur
           in proteins belonging to this subfamily, may be
           associated with the transport of different divalent
           cations, such as zinc and cadmium. The functional
           diversity of MIT transporters may also be due to minor
           structural differences regulating gating, substrate
           selection, and transport.
          Length = 290

 Score = 31.0 bits (71), Expect = 0.66
 Identities = 20/78 (25%), Positives = 35/78 (44%), Gaps = 9/78 (11%)

Query: 160 HPHLSAELENLRRELMYHNKLSEQQNVRLHQQRETLTKKQAEVNSVDLRISELQERLQRK 219
           H  L+ E   L R L    +  +       +    LT+    ++++D  +  LQER    
Sbjct: 161 HRLLAPEPAALFRLL---RRPPDWLPDADLETLRRLTE---RLDALDRDLVALQERA--- 211

Query: 220 RLLNQQLASQLNSNINSS 237
           RLL +++A++L    N S
Sbjct: 212 RLLQEEIAARLAEQTNRS 229


>gnl|CDD|221016 pfam11180, DUF2968, Protein of unknown function (DUF2968).  This
           family of proteins has no known function.
          Length = 192

 Score = 30.5 bits (69), Expect = 0.76
 Identities = 24/81 (29%), Positives = 40/81 (49%), Gaps = 5/81 (6%)

Query: 32  TLGELEDMAVRQQAQIDSQRQLIAAKEQRLRFLKQ----HEARHQQVAAEHERLRRLRDR 87
            L ++E    + +AQ     + IAA E R   L+      +A+ QQVAA  ++ R+    
Sbjct: 99  QLADVEIRRAQLEAQKAQLERQIAASEARAERLQADLQLAQAQEQQVAARQQQARQEAVA 158

Query: 88  VEQQELKLR-KLRALRGQVDQ 107
           +E Q    + +LR L+ Q+ Q
Sbjct: 159 LEAQRQAAQAQLRKLQRQIRQ 179


>gnl|CDD|236942 PRK11637, PRK11637, AmiB activator; Provisional.
          Length = 428

 Score = 31.2 bits (71), Expect = 0.82
 Identities = 50/214 (23%), Positives = 86/214 (40%), Gaps = 32/214 (14%)

Query: 40  AVRQQAQIDSQRQLIAAKEQRLRFLKQHEARHQQVAAEHERLRRLRDRVEQQELKLRKLR 99
           A   + Q+ S +Q IAAKE+ +R         QQ       L +L+ + E      RKLR
Sbjct: 42  ASDNRDQLKSIQQDIAAKEKSVR---------QQQQQRASLLAQLKKQEEAISQASRKLR 92

Query: 100 ALRGQVDQNKMNNISLSSDLESIRALFNEKEKELCMAVAKVEELTHQLEDLKRGRTHQPS 159
             +  ++Q       L++ +  +      +E+           L  QL+   R   H   
Sbjct: 93  ETQNTLNQLNKQIDELNASIAKLEQQQAAQER----------LLAAQLDAAFRQGEHTGL 142

Query: 160 HPHLSAELENLRRE--LMYHNKLSE----------QQNVRLHQQRETLTKKQAEVNSVDL 207
              LS E E+ R E  L Y   L++          Q    L  Q+  L +KQ++  ++  
Sbjct: 143 QLILSGE-ESQRGERILAYFGYLNQARQETIAELKQTREELAAQKAELEEKQSQQKTLLY 201

Query: 208 RISELQERLQRKRLLNQQLASQLNSNINSSQHQL 241
                Q++L++ R   ++  + L S++   Q QL
Sbjct: 202 EQQAQQQKLEQARNERKKTLTGLESSLQKDQQQL 235


>gnl|CDD|225803 COG3264, COG3264, Small-conductance mechanosensitive channel [Cell
           envelope biogenesis, outer membrane].
          Length = 835

 Score = 31.2 bits (71), Expect = 0.93
 Identities = 51/261 (19%), Positives = 86/261 (32%), Gaps = 64/261 (24%)

Query: 7   TLRNRPYPSSSTLKAADLLTEGVELTLGELEDMAVRQQAQIDSQRQ---------LIAAK 57
            +      S   L  A+     +E  L  L++ AV  + Q +++           LI  +
Sbjct: 7   NVLQELLQSRRELLTAESAQ--LEAALQLLQE-AVNSKRQEEAEPAAEEAELQAELIQQE 63

Query: 58  EQRLR----FLKQHEARHQQVAAEHERLRRLRDRVEQQELKLRKLRALRGQVDQNKMNNI 113
                     L Q   R   +A++  +L  L  ++ Q        R +R Q         
Sbjct: 64  LAINDQLSQALNQQTERLNALASDDRQLANLLLQLLQSS------RTIREQ-------IA 110

Query: 114 SLSSDLESIRALFNEKEKELCMAVAKVEELTHQLEDLKRGRTHQPSHPHLSAELENLRRE 173
            L   L   R L  +        + +  +   Q E              ++ E + L+ E
Sbjct: 111 VLRGSLLLSRILLQQLG-----PLPEAGQPQEQFE--------------VTQERDALQAE 151

Query: 174 LMYHNKLSEQQNVRLHQQRETLTKKQAEVNSVDLRISELQERLQRKRLLNQQLASQLNSN 233
             Y N L  Q         E LT   AEV      I +     +R+ LLN  L+ +   +
Sbjct: 152 KAYINALEGQA--------EQLT---AEVR----DILDQILDTRRE-LLNSLLSQREAIS 195

Query: 234 INSSQHQLPPGKQQLVAQNSQ 254
           +  +Q QL     +L +   Q
Sbjct: 196 LQLNQQQLSAASDELRSLLHQ 216


>gnl|CDD|224495 COG1579, COG1579, Zn-ribbon protein, possibly nucleic acid-binding
           [General function prediction only].
          Length = 239

 Score = 30.4 bits (69), Expect = 0.94
 Identities = 29/135 (21%), Positives = 63/135 (46%), Gaps = 12/135 (8%)

Query: 23  DLLTEGVELTLGELEDMAVRQQAQIDSQRQLIAAKEQRLRFLKQHEARHQQVAAEHERLR 82
           DL  + +E  + E+     + +A++++  + + A E  L  L+   ++ +        ++
Sbjct: 16  DLEKDRLEPRIKEIRKALKKAKAELEALNKALEALEIELEDLENQVSQLES------EIQ 69

Query: 83  RLRDRVEQQELKL------RKLRALRGQVDQNKMNNISLSSDLESIRALFNEKEKELCMA 136
            +R+R+++ E KL      R+LRAL  ++   K    SL  +L  +     + EKE+   
Sbjct: 70  EIRERIKRAEEKLSAVKDERELRALNIEIQIAKERINSLEDELAELMEEIEKLEKEIEDL 129

Query: 137 VAKVEELTHQLEDLK 151
             ++E L   L + +
Sbjct: 130 KERLERLEKNLAEAE 144


>gnl|CDD|233067 TIGR00641, acid_CoA_mut_N, methylmalonyl-CoA mutase N-terminal
           domain.  Methylmalonyl-CoA mutase (EC 5.4.99.2)
           catalyzes a reversible isomerization between
           L-methylmalonyl-CoA and succinyl-CoA. The enzyme uses an
           adenosylcobalamin cofactor. It may be a homodimer, as in
           mitochondrion, or a heterodimer with partially
           homologous beta chain that does not bind the
           adenosylcobalamin cofactor, as in Propionibacterium
           freudenreichii. The most similar archaeal sequences are
           separate chains, such as AF2215 abd AF2219 of
           Archaeoglobus fulgidus, that correspond roughly to the
           first 500 and last 130 residues, respectively of known
           methylmalonyl-CoA mutases. This model describes the
           N-terminal domain subfamily. In a neighbor-joining tree,
           AF2215 branches with a bacterial isobutyryl-CoA mutase,
           which is also the same length. Scoring between the noise
           and trusted cutoffs are the non-catalytic, partially
           homologous beta chains from certain heterodimeric
           examples of 5.4.99.2.
          Length = 524

 Score = 30.9 bits (70), Expect = 1.1
 Identities = 27/85 (31%), Positives = 43/85 (50%), Gaps = 11/85 (12%)

Query: 35  ELEDMAVRQQAQIDSQRQLIAA-------KEQRLRFLK--QHEARHQQVAAEHERLRRLR 85
            +E+ A R QA+IDS RQ+I         +E  +  LK      R +Q+ A+ ++LR  R
Sbjct: 409 RIEEAAARTQARIDSGRQVIVGVNKYQLEEEDEVEVLKVDNSSVREEQI-AKLKKLRAER 467

Query: 86  DRVEQQELKLRKLRALRGQVDQNKM 110
           D+ E+ E  L  L     + D+N +
Sbjct: 468 DQ-EKVEAALDALTKAAEKEDENLL 491


>gnl|CDD|217551 pfam03428, RP-C, Replication protein C N-terminal domain.
           Replication protein C is involved in the early stages of
           viral DNA replication.
          Length = 177

 Score = 29.8 bits (68), Expect = 1.3
 Identities = 15/34 (44%), Positives = 19/34 (55%), Gaps = 3/34 (8%)

Query: 68  EARHQQVAAEHERLRRLRDRVEQQELKLRKLRAL 101
            A   QV AE + LRRLR+R+    L  R +R L
Sbjct: 140 AALAAQVRAERKALRRLRERL---TLLRRDIRKL 170


>gnl|CDD|217051 pfam02463, SMC_N, RecF/RecN/SMC N terminal domain.  This domain is
           found at the N terminus of SMC proteins. The SMC
           (structural maintenance of chromosomes) superfamily
           proteins have ATP-binding domains at the N- and
           C-termini, and two extended coiled-coil domains
           separated by a hinge in the middle. The eukaryotic SMC
           proteins form two kind of heterodimers: the SMC1/SMC3
           and the SMC2/SMC4 types. These heterodimers constitute
           an essential part of higher order complexes, which are
           involved in chromatin and DNA dynamics. This family also
           includes the RecF and RecN proteins that are involved in
           DNA metabolism and recombination.
          Length = 1162

 Score = 30.7 bits (69), Expect = 1.3
 Identities = 33/198 (16%), Positives = 73/198 (36%), Gaps = 13/198 (6%)

Query: 37  EDMAVRQQAQIDSQRQLIAAKEQRLRFLKQHEARHQQVAAEHERLRRLRDRVEQQELKLR 96
           E +   Q+    S+++L   +E   + LK+++   ++   + E L+ L    E+ + +L 
Sbjct: 245 ELLRDEQEEIESSKQELEKEEEILAQVLKENKEEEKEKKLQEEELKLLAKEEEELKSELL 304

Query: 97  KLR----ALRGQVDQNKMNNISLSSDLESIRALFNEKEKELCMAVAKVEELTHQLEDLKR 152
           KL         ++ +++     L  +L+  +    E EKEL     K E    + E L++
Sbjct: 305 KLERRKVDDEEKLKESEKELKKLEKELKKEKEEIEELEKELKELEIKREAEEEEEEQLEK 364

Query: 153 GRTHQPSHPHLSAELENLRRELMYHNKLSEQQNVRLHQQRETLTKKQAEVNSVDLRISEL 212
            +             +     L      SE+ +     + E L  K  E     L +   
Sbjct: 365 LQEK---------LEQLEEELLAKKKLESERLSSAAKLKEEELELKNEEEKEAKLLLELS 415

Query: 213 QERLQRKRLLNQQLASQL 230
           ++     +   ++    +
Sbjct: 416 EQEEDLLKEEKKEELKIV 433


>gnl|CDD|115307 pfam06637, PV-1, PV-1 protein (PLVAP).  This family consists of
           several PV-1 (PLVAP) proteins which seem to be specific
           to mammals. PV-1 is a novel protein component of the
           endothelial fenestral and stomatal diaphragms. The
           function of this family is unknown.
          Length = 442

 Score = 30.4 bits (68), Expect = 1.4
 Identities = 19/76 (25%), Positives = 39/76 (51%), Gaps = 12/76 (15%)

Query: 43  QQAQIDSQRQLIAAKEQRLRFLKQHEARHQQVAAEHER-----------LRRLRDRVEQQ 91
           Q+ ++++++   AA+E + +  K+ +AR  ++ AE  R           LR  RD + ++
Sbjct: 305 QRQKLEAEQAAQAAQEAKAKAEKEAQAREAKLQAECARQTQLALEEKAALRAERDNLAKE 364

Query: 92  -ELKLRKLRALRGQVD 106
            E K R+   LR ++ 
Sbjct: 365 LEAKKREAEQLRMELA 380


>gnl|CDD|217372 pfam03105, SPX, SPX domain.  We have named this region the SPX
           domain after (SYG1, Pho81 and XPR1). This 180 residue
           length domain is found at the amino terminus of a
           variety of proteins. In the yeast protein SYG1, the
           N-terminus directly binds to the G- protein beta subunit
           and inhibits transduction of the mating pheromone
           signal. This finding suggests that all the members of
           this family are involved in G-protein associated signal
           transduction. The N-termini of several proteins involved
           in the regulation of phosphate transport, including the
           putative phosphate level sensors PHO81 from
           Saccharomyces cerevisiae and NUC-2 from Neurospora
           crassa, are also members of this family. The SPX domain
           of S. cerevisiae low-affinity phosphate transporters
           Pho87 and Pho90 auto-regulates uptake and prevents
           efflux. This SPX dependent inhibition is mediated by the
           physical interaction with Spl2 NUC-2 contains several
           ankyrin repeats pfam00023. Several members of this
           family are annotated as XPR1 proteins: the xenotropic
           and polytropic retrovirus receptor confers
           susceptibility to infection with murine leukaemia
           viruses (MLV). The similarity between SYG1, phosphate
           regulators and XPR1 sequences has been previously noted,
           as has the additional similarity to several predicted
           proteins, of unknown function, from Drosophila
           melanogaster, Arabidopsis thaliana, Caenorhabditis
           elegans, Schizosaccharomyces pombe, and Saccharomyces
           cerevisiae. In addition, given the similarities between
           XPR1 and SYG1 and phosphate regulatory proteins, it has
           been proposed that XPR1 might be involved in G-protein
           associated signal transduction and may itself function
           as a phosphate sensor.
          Length = 176

 Score = 29.7 bits (67), Expect = 1.4
 Identities = 20/68 (29%), Positives = 32/68 (47%), Gaps = 3/68 (4%)

Query: 115 LSSDLESIRALFNEKEKELCMAVAKVEELTHQLEDLKRGRTHQPSHPHLSAELENLRREL 174
           L  +LE + + + EKE EL   + ++EEL  QLE+L+  R    S     ++   L+   
Sbjct: 69  LDKELEKVNSFYKEKEAEL---LERLEELEKQLEELEERRDETSSAKSSPSDKNLLKAFE 125

Query: 175 MYHNKLSE 182
                L E
Sbjct: 126 ELKKALLE 133


>gnl|CDD|217902 pfam04111, APG6, Autophagy protein Apg6.  In yeast, 15 Apg proteins
           coordinate the formation of autophagosomes. Autophagy is
           a bulk degradation process induced by starvation in
           eukaryotic cells. Apg6/Vps30p has two distinct functions
           in the autophagic process, either associated with the
           membrane or in a retrieval step of the carboxypeptidase
           Y sorting pathway.
          Length = 356

 Score = 30.2 bits (68), Expect = 1.4
 Identities = 24/124 (19%), Positives = 49/124 (39%), Gaps = 4/124 (3%)

Query: 125 LFNEKEKELCMAVAKVEELTHQLEDLKRGRTHQPSHPHLSAELENLRRELMYHNKLSEQQ 184
           L    + EL  A  + +     L  L+       ++  L +EL+ L++E     +L   +
Sbjct: 30  LLERLDSELRDAEKERDTYKQYLSKLESQNVEISNYEALDSELDELKKE---EERL-LDE 85

Query: 185 NVRLHQQRETLTKKQAEVNSVDLRISELQERLQRKRLLNQQLASQLNSNINSSQHQLPPG 244
              L ++ + L  +  E+     ++   + +  R+  L  +   QL  N+ S + Q    
Sbjct: 86  LEELEKEDDDLDGELVELQEEKEQLENEELQYLREYNLFDRNNLQLEDNLQSLELQYEYS 145

Query: 245 KQQL 248
             QL
Sbjct: 146 LNQL 149



 Score = 27.9 bits (62), Expect = 7.6
 Identities = 23/106 (21%), Positives = 46/106 (43%), Gaps = 13/106 (12%)

Query: 33  LGELEDMAV--RQQAQIDSQRQLIAAKEQRL-RFLKQHEARHQQVAAEHERLRRLRDRVE 89
           L +LE   V       +DS+   +  +E+RL   L++ E     +  E   L+  ++++E
Sbjct: 52  LSKLESQNVEISNYEALDSELDELKKEEERLLDELEELEKEDDDLDGELVELQEEKEQLE 111

Query: 90  QQELKLRKLRALRGQVDQNKM--NNISLSSDLESIRALFNEKEKEL 133
            +EL+           + N    NN+ L  +L+S+   +     +L
Sbjct: 112 NEELQY--------LREYNLFDRNNLQLEDNLQSLELQYEYSLNQL 149


>gnl|CDD|227061 COG4717, COG4717, Uncharacterized conserved protein [Function
           unknown].
          Length = 984

 Score = 30.6 bits (69), Expect = 1.5
 Identities = 36/172 (20%), Positives = 65/172 (37%), Gaps = 21/172 (12%)

Query: 76  AEHERLRRLRDRVEQQELKLRKLRALRGQVDQNKMNNISLSSDLESIRALFNEKEKELCM 135
            E + L +   R  Q    L KL+  R ++D+ +        +  +   L   +  E   
Sbjct: 168 KEADSLYKPSGRNPQINQLLEKLKQERNEIDEAE-------KEYATYHKLLESRRAEHAR 220

Query: 136 AVAKVEELTHQLEDLKRGRTHQPSHPHL--SAELEN-LRRELMYHNKLSEQQNVRLHQQR 192
                 EL    + ++  R      P L    +LE  L R             +RL ++ 
Sbjct: 221 LAELRSELRADRDHIRALRDAVELWPRLQEWKQLEQELTRRREELATFPRDGVLRLEKRE 280

Query: 193 ETLTKKQAEVNSVDLRISELQER-----------LQRKRLLNQQLASQLNSN 233
             L K +AE++++ +R++EL++            LQ    L+QQL+    S 
Sbjct: 281 AHLQKTEAEIDALLVRLAELKDLASQLIPAKEAVLQALVRLHQQLSEIKASA 332


>gnl|CDD|227352 COG5019, CDC3, Septin family protein [Cell division and chromosome
           partitioning / Cytoskeleton].
          Length = 373

 Score = 30.0 bits (68), Expect = 1.6
 Identities = 15/75 (20%), Positives = 31/75 (41%), Gaps = 3/75 (4%)

Query: 80  RLRRLRDRVEQQELKLRKLRALRGQVDQNKMNNISLSSDLESIRALFNEKEK---ELCMA 136
           R  +L       E  L+++   R   ++ ++         E  + L   ++    E    
Sbjct: 288 RTEKLSGLKNSGEPSLKEIHEARLNEEERELKKKFTEKIREKEKRLEELEQNLIEERKEL 347

Query: 137 VAKVEELTHQLEDLK 151
            +K+EE+  +LEDL+
Sbjct: 348 NSKLEEIQKKLEDLE 362


>gnl|CDD|223562 COG0488, Uup, ATPase components of ABC transporters with duplicated
           ATPase domains [General function prediction only].
          Length = 530

 Score = 30.3 bits (69), Expect = 1.6
 Identities = 10/101 (9%), Positives = 35/101 (34%), Gaps = 5/101 (4%)

Query: 51  RQLIAAKEQRLRFLKQHEARHQQVAAEHERLRRLRDRVEQQELKLRKLRALRGQVDQNKM 110
            Q      Q     ++ +    +      R +    + ++ + ++++L  L  ++ + + 
Sbjct: 240 EQKAERLRQEAAAYEKQQKELAKEQEWIRRGKAAASKAKKAKSRIKRLEKLEARLAEERP 299

Query: 111 NNISLSSDLESIRALFNEKEKELCMAVAKVEELTHQLEDLK 151
                  + + +   F    K L   V + E ++   +  +
Sbjct: 300 V-----EEGKPLAFRFPPPGKRLGKLVLEFENVSKGYDGGR 335


>gnl|CDD|236892 PRK11281, PRK11281, hypothetical protein; Provisional.
          Length = 1113

 Score = 30.3 bits (69), Expect = 1.7
 Identities = 25/121 (20%), Positives = 48/121 (39%), Gaps = 6/121 (4%)

Query: 141 EELTHQLEDLKRGRTHQPSHPHLSAELENLRRELMYHNKL--SEQQNVRLHQQRETLTKK 198
            ++  QL+ L + +  +     +  +LE     L   +K+   +++  +L QQ      K
Sbjct: 39  ADVQAQLDALNKQKLLEAEDKLVQQDLEQ---TLALLDKIDRQKEETEQLKQQLAQAPAK 95

Query: 199 QAEVNSVDLRISELQERLQRKRLLNQQLASQLNSNINSSQHQLPPGKQQLVAQNSQLTSA 258
             +  +    + +  +   R+ L    L  QL S +  +  QL   +  L   NSQL S 
Sbjct: 96  LRQAQAELEALKDDNDEETRETLSTLSLR-QLESRLAQTLDQLQNAQNDLAEYNSQLVSL 154

Query: 259 H 259
            
Sbjct: 155 Q 155



 Score = 29.5 bits (67), Expect = 3.3
 Identities = 27/135 (20%), Positives = 50/135 (37%), Gaps = 25/135 (18%)

Query: 130 EKELCMAVAKVEELTHQLEDLKRGRTHQPSHPHLSAELENLRRELMYHNKLSEQQNVRLH 189
           +++L  A AK+ +   +LE LK     +      +  L  L  E      L + QN    
Sbjct: 86  KQQLAQAPAKLRQAQAELEALKDDNDEETRETLSTLSLRQL--ESRLAQTLDQLQNA--- 140

Query: 190 QQRETLTKKQAEVNSVDLRISELQERLQR---------KRLLNQQLASQLNSNINSSQHQ 240
                    Q ++   + ++  LQ + +R         +RL  QQ+ + L       +  
Sbjct: 141 ---------QNDLAEYNSQLVSLQTQPERAQAALYANSQRL--QQIRNLLKGGKVGGKAL 189

Query: 241 LPPGKQQLVAQNSQL 255
            P  +  L A+ + L
Sbjct: 190 RPSQRVLLQAEQALL 204


>gnl|CDD|184349 PRK13824, PRK13824, replication initiation protein RepC;
           Provisional.
          Length = 404

 Score = 29.8 bits (68), Expect = 1.8
 Identities = 11/17 (64%), Positives = 15/17 (88%)

Query: 72  QQVAAEHERLRRLRDRV 88
           +QVAAE + LRRLR+R+
Sbjct: 156 EQVAAERKALRRLRERL 172


>gnl|CDD|227355 COG5022, COG5022, Myosin heavy chain [Cytoskeleton].
          Length = 1463

 Score = 30.0 bits (68), Expect = 2.0
 Identities = 52/244 (21%), Positives = 94/244 (38%), Gaps = 29/244 (11%)

Query: 32  TLGELEDMAVR--QQAQIDSQRQLIAAKEQRLRFL-KQHEARHQQVAAEHERLRRLRDR- 87
            L  LEDM            QR  I  +  R R+L      +  QV     RLRRL D  
Sbjct: 732 VLAALEDMRDAKLDNIATRIQR-AIRGRYLRRRYLQALKRIKKIQVIQHGFRLRRLVDYE 790

Query: 88  -VEQQELKLRKLRALRGQVDQNKMNNISLSSDLESIRALFNEKEKELCMAVAKVEELTHQ 146
              +  +KL+ L +L G   + +       S L  I  L    ++E  +   +  E + +
Sbjct: 791 LKWRLFIKLQPLLSLLGSRKEYR-------SYLACIIKLQKTIKREKKLRETEEVEFSLK 843

Query: 147 LEDLKR--GRTHQPSHPHLSAEL-ENLRRELMYHNKLSEQQNVRLHQQRETLTKKQAEVN 203
            E L +  GR+ +      S    E +  +     +L+E+Q   L    ++++  +    
Sbjct: 844 AEVLIQKFGRSLKA-KKRFSLLKKETIYLQSAQRVELAERQLQELKIDVKSISSLKLVNL 902

Query: 204 SVDLRISELQERLQRKRLLNQQLASQ-------LNSNINSS-----QHQLPPGKQQLVAQ 251
            ++  I EL++ L    + N +  ++       L +NI+       ++   P   +L   
Sbjct: 903 ELESEIIELKKSLSSDLIENLEFKTELIARLKKLLNNIDLEEGPSIEYVKLPELNKLHEV 962

Query: 252 NSQL 255
            S+L
Sbjct: 963 ESKL 966


>gnl|CDD|221313 pfam11917, DUF3435, Protein of unknown function (DUF3435).  This
           family of proteins are functionally uncharacterized.
           This protein is found in eukaryotes. Proteins in this
           family are typically between 435 to 791 amino acids in
           length. This family is related to pfam00589 suggesting
           it may be an integrase enzyme.
          Length = 418

 Score = 29.6 bits (67), Expect = 2.2
 Identities = 19/73 (26%), Positives = 34/73 (46%), Gaps = 7/73 (9%)

Query: 43  QQAQIDSQRQLIAAKEQRLRFLKQHEARHQQVA-----AEHERLRRLRDRV--EQQELKL 95
           Q+A ++   +L     +R    K+  A + QVA       +ERL + R  V  E+Q L+ 
Sbjct: 280 QKASVEEDPELQELIRKRDHLKKEIIALYGQVAKAKGTPLYERLEKRRREVRNERQRLRR 339

Query: 96  RKLRALRGQVDQN 108
              + +R + D+ 
Sbjct: 340 ELKKKIREEFDEE 352


>gnl|CDD|218636 pfam05557, MAD, Mitotic checkpoint protein.  This family consists
           of several eukaryotic mitotic checkpoint (Mitotic arrest
           deficient or MAD) proteins. The mitotic spindle
           checkpoint monitors proper attachment of the bipolar
           spindle to the kinetochores of aligned sister chromatids
           and causes a cell cycle arrest in prometaphase when
           failures occur. Multiple components of the mitotic
           spindle checkpoint have been identified in yeast and
           higher eukaryotes. In S.cerevisiae, the existence of a
           Mad1-dependent complex containing Mad2, Mad3, Bub3 and
           Cdc20 has been demonstrated.
          Length = 722

 Score = 29.9 bits (67), Expect = 2.4
 Identities = 28/181 (15%), Positives = 63/181 (34%), Gaps = 10/181 (5%)

Query: 65  KQHEARHQQVAAEHERLRRLRDRVEQQELKLRKLRALRGQVDQNKMNNISLSSDLESIRA 124
           +   ++ + +  E+E +++  +    Q    RK   L    ++    N+ L   L+++  
Sbjct: 62  ELIRSKSKLIQLENELMQKELEHKRAQIELERKASTLAENYERELDRNLELEVRLKALEE 121

Query: 125 LFNEKEKELCMAVAKVE---------ELTHQLEDLKRGRTHQPSHPHLSAELENLRRELM 175
           L  + E E   A  + +          L  Q E   + +  + S   +  +L  ++    
Sbjct: 122 LEKKAENEAAEAEEEAKLLKDKLDAESLKLQNEKEDQLKEAKESISRIKNDLSEMQCRAQ 181

Query: 176 YHNKLSEQQNVRLHQQRETLTKKQAEVNSVDLRISELQERLQRKRLLNQQLASQLNSNIN 235
             +   +     L + RE L + Q E+   + ++  L    Q     N      L   + 
Sbjct: 182 NADTELKLLESELEELREQLEECQKELAEAEKKLQSLTS-EQASSADNSVKIKHLEEELK 240

Query: 236 S 236
            
Sbjct: 241 R 241


>gnl|CDD|235600 PRK05771, PRK05771, V-type ATP synthase subunit I; Validated.
          Length = 646

 Score = 29.5 bits (67), Expect = 2.8
 Identities = 42/203 (20%), Positives = 75/203 (36%), Gaps = 45/203 (22%)

Query: 20  KAADLLTEGVELTLGELEDMAVRQQAQIDSQRQLIAAKEQRLRFLKQHEARHQQVAAEHE 79
           K+ + L + VE  L ++E                I   E+ +  L+      +Q   E E
Sbjct: 82  KSLEELIKDVEEELEKIEKE--------------IKELEEEISELENEIKELEQ---EIE 124

Query: 80  RLRRLRDRVEQQELKLRKLRALR------GQVDQNKMNNISLSSDLESIRALFNEKEKEL 133
           RL    +     +L L  L   +      G V ++K+  + L SD+E++  +  +K    
Sbjct: 125 RLEPWGNF----DLDLSLLLGFKYVSVFVGTVPEDKLEELKLESDVENVEYISTDKGYVY 180

Query: 134 CMAVAKVEELTHQLEDLKRGRTHQPSHPHLSAELENLRRELMYHNKLSEQQNVRLHQQRE 193
            + V   E      E+LK+               E L  EL      SE     + + +E
Sbjct: 181 VVVVVLKELSDEVEEELKK------------LGFERL--ELEEEGTPSE----LIREIKE 222

Query: 194 TLTKKQAEVNSVDLRISELQERL 216
            L + + E  S+   + EL ++ 
Sbjct: 223 ELEEIEKERESLLEELKELAKKY 245


>gnl|CDD|151391 pfam10944, DUF2630, Protein of unknown function (DUF2630).  This
           bacterial family of proteins have no known function.
          Length = 81

 Score = 27.3 bits (61), Expect = 3.1
 Identities = 17/47 (36%), Positives = 22/47 (46%), Gaps = 2/47 (4%)

Query: 56  AKEQRLRFLKQHEARHQQVAAEHERLRRLRDRVEQQELKLRKLRALR 102
           A+E  LR   Q        + EH RL RL   ++Q    LR+ RA R
Sbjct: 15  AEEHELRAKLQQ--GEIDESEEHARLARLEVALDQCWDLLRQRRARR 59


>gnl|CDD|220369 pfam09731, Mitofilin, Mitochondrial inner membrane protein.
           Mitofilin controls mitochondrial cristae morphology.
           Mitofilin is enriched in the narrow space between the
           inner boundary and the outer membranes, where it forms a
           homotypic interaction and assembles into a large
           multimeric protein complex. The first 78 amino acids
           contain a typical amino-terminal-cleavable mitochondrial
           presequence rich in positive-charged and hydroxylated
           residues and a membrane anchor domain. In addition, it
           has three centrally located coiled coil domains.
          Length = 493

 Score = 29.2 bits (66), Expect = 3.3
 Identities = 25/151 (16%), Positives = 64/151 (42%), Gaps = 9/151 (5%)

Query: 30  ELTLGELEDMAVRQQAQIDSQRQLIAAKEQRLRFLKQHEARHQQVAAEHERLRRLRDRVE 89
           +L    L  +  ++ A     R     +++ LR   + + R +         ++L++ + 
Sbjct: 204 KLEEELLARLESKEAALEKQLRLEFEREKEELRKKYEEKLRQELERQAEAHEQKLKNELA 263

Query: 90  QQELKLRK--LRALRGQVDQNKMNNIS----LSSDLESIRALFNEKEKELCMAVAKVEEL 143
            Q ++L++   + ++ +V++ +   ++    L+S L+ +    + + +       KV++L
Sbjct: 264 LQAIELQREFNKEIKEKVEEERNGRLAKLAELNSRLKGLEKALDSRSEAE-DENHKVQQL 322

Query: 144 THQLEDLKRG--RTHQPSHPHLSAELENLRR 172
              +E LK         S   L  EL+ L+ 
Sbjct: 323 WLAVEALKSALKSGSAGSPRPLVKELDALKE 353


>gnl|CDD|225606 COG3064, TolA, Membrane protein involved in colicin uptake [Cell
           envelope biogenesis, outer membrane].
          Length = 387

 Score = 29.1 bits (65), Expect = 3.3
 Identities = 25/106 (23%), Positives = 42/106 (39%), Gaps = 9/106 (8%)

Query: 40  AVRQQAQIDSQRQLIAAKEQRLRFLKQH---EARHQQVAAEHERLRRLRDRVEQQELKLR 96
           AV QQ      +Q  A K ++ R  K+    E    + AAE ERL+      + ++ +L+
Sbjct: 59  AVVQQYGRIQSQQSSAKKGEQQRKKKEEQVAEELKPKQAAEQERLK------QLEKERLK 112

Query: 97  KLRALRGQVDQNKMNNISLSSDLESIRALFNEKEKELCMAVAKVEE 142
                +   +  K   +      E  R    E++K+   A AK   
Sbjct: 113 AQEQQKQAEEAEKQAQLEQKQQEEQARKAAAEQKKKAEAAKAKAAA 158


>gnl|CDD|204379 pfam10023, DUF2265, Predicted aminopeptidase (DUF2265).  Members of
           this family of bacterial proteins comprise various
           hypothetical proteins and putative aminopeptidases.
           Their exact function, has not, as yet, been defined.
          Length = 338

 Score = 28.8 bits (65), Expect = 3.9
 Identities = 21/96 (21%), Positives = 32/96 (33%), Gaps = 7/96 (7%)

Query: 26  TEGVELTLGELEDMAVRQQAQIDSQR-----QLIAAKEQRLRFL-KQHEARHQQVAAEHE 79
            EGV   L    D A R +     +R      L+ A   RL  L     A  ++ AA+  
Sbjct: 197 REGVRRWLAARGDPAARARYAAFDERRQQFRALLLATRDRLEALYASPLADERKRAAKAA 256

Query: 80  RLRRLRDRVEQQELKLRKLRALRGQVDQNKMNNISL 115
             ++LR      + +               +NN  L
Sbjct: 257 IFQQLRADYAALKARWGGYAGYDRWFAAP-LNNAKL 291


>gnl|CDD|129705 TIGR00618, sbcc, exonuclease SbcC.  All proteins in this family for
           which functions are known are part of an exonuclease
           complex with sbcD homologs. This complex is involved in
           the initiation of recombination to regulate the levels
           of palindromic sequences in DNA. This family is based on
           the phylogenomic analysis of JA Eisen (1999, Ph.D.
           Thesis, Stanford University) [DNA metabolism, DNA
           replication, recombination, and repair].
          Length = 1042

 Score = 29.2 bits (65), Expect = 4.0
 Identities = 40/188 (21%), Positives = 70/188 (37%), Gaps = 6/188 (3%)

Query: 44  QAQIDSQRQLIAAKEQRLRFLKQHEARHQQVAAEHERLRRLRDRVEQQELKLRKLRALRG 103
            A +  +R+    + ++ + LKQ  AR +++ A+   L   ++R+ +       L A   
Sbjct: 242 HAYLTQKREAQEEQLKKQQLLKQLRARIEELRAQEAVLEETQERINRARKAAP-LAAHIK 300

Query: 104 QVDQNKMNNISLSSDLESIRALFNEKEKELCMAVAKVEELTHQLEDLKRGRTHQPSHPHL 163
            V Q +     + ++L+S       K      A  K +    +   L +    Q  H   
Sbjct: 301 AVTQIEQQAQRIHTELQSKMRSR-AKLLMKRAAHVKQQSSIEEQRRLLQTLHSQEIHIRD 359

Query: 164 SAELENLRRELMYHNKLSEQQNVRLHQQRETLTKKQAEVNSVDLRISELQE----RLQRK 219
           + E+    RE+        Q    L QQ+ TLT+K   +      +   Q     R    
Sbjct: 360 AHEVATSIREISCQQHTLTQHIHTLQQQKTTLTQKLQSLCKELDILQREQATIDTRTSAF 419

Query: 220 RLLNQQLA 227
           R L  QLA
Sbjct: 420 RDLQGQLA 427


>gnl|CDD|217817 pfam03961, DUF342, Protein of unknown function (DUF342).  This
           family of bacterial proteins has no known function. The
           proteins are in the region of 500-600 amino acid
           residues in length.
          Length = 450

 Score = 28.8 bits (65), Expect = 4.6
 Identities = 18/76 (23%), Positives = 36/76 (47%), Gaps = 5/76 (6%)

Query: 76  AEHERLRRLRDRVEQQELKLRKLRALRGQVDQNKMNNISLSSDLESIRALFNEKEKELCM 135
              E L+ L + +++ E +L K++ L  ++ +     +      E +  L   KEK    
Sbjct: 331 ELKEELKELEEELKELEEELEKIKKLLKKLPKKARGQLP-PEKREQLEKLLETKEK---- 385

Query: 136 AVAKVEELTHQLEDLK 151
              ++EEL  +L++LK
Sbjct: 386 LSEELEELEEELKELK 401


>gnl|CDD|236912 PRK11448, hsdR, type I restriction enzyme EcoKI subunit R;
           Provisional.
          Length = 1123

 Score = 28.8 bits (65), Expect = 4.7
 Identities = 20/90 (22%), Positives = 36/90 (40%), Gaps = 3/90 (3%)

Query: 14  PSSSTLKAADLLTEGVELTLGELEDMAVRQQAQIDSQRQLIAAKEQRLRFL-KQHEARHQ 72
           P         L  E V     +LE  A R++AQ  +  +    +   L  L  + E + Q
Sbjct: 137 PEDPENLLHALQQE-VLTLKQQLELQA-REKAQSQALAEAQQQELVALEGLAAELEEKQQ 194

Query: 73  QVAAEHERLRRLRDRVEQQELKLRKLRALR 102
           ++ A+ E+L+       Q+  + RK    +
Sbjct: 195 ELEAQLEQLQEKAAETSQERKQKRKEITDQ 224


>gnl|CDD|223250 COG0172, SerS, Seryl-tRNA synthetase [Translation, ribosomal
           structure and biogenesis].
          Length = 429

 Score = 28.7 bits (65), Expect = 5.0
 Identities = 13/82 (15%), Positives = 32/82 (39%), Gaps = 14/82 (17%)

Query: 77  EHERLRRLRDRVEQQELKLRKLRALRGQVDQNKMNNISLSSDLESIRALFNEKEKELCMA 136
           E E L+  R+ +  +E+             + + +   L ++++ ++    E E  L   
Sbjct: 44  ELEELQAERNEL-SKEIGRAL--------KRGEDDAEELIAEVKELKEKLKELEAALDEL 94

Query: 137 VAKVEELTHQL-----EDLKRG 153
            A+++ L   +     E +  G
Sbjct: 95  EAELDTLLLTIPNIPHESVPVG 116


>gnl|CDD|181619 PRK09039, PRK09039, hypothetical protein; Validated.
          Length = 343

 Score = 28.4 bits (64), Expect = 5.0
 Identities = 28/140 (20%), Positives = 52/140 (37%), Gaps = 25/140 (17%)

Query: 129 KEKELCMAVAKVEELTHQLEDLKRGRTHQPSHPHLSAELENLRRELMYHNKLSEQQNVRL 188
           K+  L    +++ EL   L  L+R          L   + NLR  L      +E +  RL
Sbjct: 51  KDSALDRLNSQIAELADLLS-LERQGNQD-----LQDSVANLRASL----SAAEAERSRL 100

Query: 189 HQQRETLTKKQAEVNSVDLRISELQERLQRKR-----------LLNQQLASQLNSNINSS 237
                 L +      + + R  EL + L  ++           LLNQQ+A  L   + + 
Sbjct: 101 QAL---LAELAGAGAAAEGRAGELAQELDSEKQVSARALAQVELLNQQIA-ALRRQLAAL 156

Query: 238 QHQLPPGKQQLVAQNSQLTS 257
           +  L   +++     +++  
Sbjct: 157 EAALDASEKRDRESQAKIAD 176


>gnl|CDD|237560 PRK13922, PRK13922, rod shape-determining protein MreC;
           Provisional.
          Length = 276

 Score = 28.4 bits (64), Expect = 5.1
 Identities = 12/55 (21%), Positives = 23/55 (41%)

Query: 40  AVRQQAQIDSQRQLIAAKEQRLRFLKQHEARHQQVAAEHERLRRLRDRVEQQELK 94
                A +   R+     ++ L  L+      +Q+ AE+ RLR L +  E  + +
Sbjct: 64  VFESLASLFDLREENEELKKELLELESRLQELEQLEAENARLRELLNLKESLDYQ 118


>gnl|CDD|215312 PLN02574, PLN02574, 4-coumarate--CoA ligase-like.
          Length = 560

 Score = 28.7 bits (64), Expect = 5.3
 Identities = 12/47 (25%), Positives = 18/47 (38%), Gaps = 1/47 (2%)

Query: 256 TSAHINNVNSLFRSMTSSNIAAVAPYLHVPSKPNLDNTHAIFQNQRR 302
                NN    +   T    ++  P + +PS PNLD    IF +   
Sbjct: 7   AQTSHNNPPFWYSPETGI-YSSKHPPVPLPSDPNLDAVSFIFSHHNH 52


>gnl|CDD|213754 TIGR02971, heterocyst_DevB, ABC exporter membrane fusion protein,
           DevB family.  Members of this protein family are found
           mostly in the Cyanobacteria, but also in the
           Planctomycetes. DevB from Anabaena sp. strain PCC 7120
           is partially characterized as a membrane fusion protein
           of the DevBCA ABC exporter, probably a glycolipid
           exporter, required for heterocyst formation. Most
           Cyanobacteria have one member only, but Nostoc sp. PCC
           7120 has seven members.
          Length = 327

 Score = 28.3 bits (63), Expect = 5.9
 Identities = 25/118 (21%), Positives = 48/118 (40%), Gaps = 9/118 (7%)

Query: 38  DMAVRQQAQIDSQRQLIAAKEQRLRFLKQHEARHQQVAAEHERLRRLRDRVEQQELKLRK 97
           D A  + AQ+ +  +      QR           + VAA+   L RL   +E  + ++ +
Sbjct: 65  DEAKARLAQVRAGAKKGEIAAQRAARAAAK--LFKDVAAQQATLNRLEAELETAQREVDR 122

Query: 98  LRALR--GQVDQNKMNNISLSSD-----LESIRALFNEKEKELCMAVAKVEELTHQLE 148
            R+L   G V  + +++ +L        LE   A  +E+      A+A + E   + +
Sbjct: 123 YRSLFRDGAVSASDLDSKALKLRTAEEELEEALASRSEQIDGARAALASLAEEVRETD 180


>gnl|CDD|233973 TIGR02680, TIGR02680, TIGR02680 family protein.  Members of this
           protein family belong to a conserved gene four-gene
           neighborhood found sporadically in a phylogenetically
           broad range of bacteria: Nocardia farcinica,
           Symbiobacterium thermophilum, and Streptomyces
           avermitilis (Actinobacteria), Geobacillus kaustophilus
           (Firmicutes), Azoarcus sp. EbN1 and Ralstonia
           solanacearum (Betaproteobacteria). Proteins in this
           family average over 1400 amino acids in length
           [Hypothetical proteins, Conserved].
          Length = 1353

 Score = 28.6 bits (64), Expect = 6.1
 Identities = 43/201 (21%), Positives = 77/201 (38%), Gaps = 20/201 (9%)

Query: 23  DLLTEGV-ELTLGELEDMAVRQQAQIDSQRQLIAAKEQRLRFLKQHEARHQQVAAE--HE 79
           D LTE +  L   EL D+A     Q+D  R  +   E   R L+    R+++ A      
Sbjct: 208 DALTEALPPLDDDELTDVA-DALEQLDEYRDELERLEALERALRNFLQRYRRYARTMLRR 266

Query: 80  RLRRLRDRVEQQELKLRKLRALRGQVDQNKMNNISLSSDLESIRALFNEKEKELCMAVAK 139
           R  RLR    Q +   R L   R +++           +   + A     E+E       
Sbjct: 267 RATRLRSAQTQYDQLSRDLGRARDELET-------AREEERELDARTEALEREA------ 313

Query: 140 VEELTHQLEDLKRGRTHQPSHP--HLSAELENLRRELMYHNKLSEQQNVRLHQQRETLTK 197
            + L  +LE L+    +Q +       A+ E L+       +   +   RL ++R  L +
Sbjct: 314 -DALRTRLEALQGSPAYQDAEELERARADAEALQAAAADARQAIREAESRLEEERRRLDE 372

Query: 198 KQAEVNSVDLRISELQERLQR 218
           +   ++  +  +   +E+L R
Sbjct: 373 EAGRLDDAERELRAAREQLAR 393


>gnl|CDD|220964 pfam11068, DUF2869, Protein of unknown function (DUF2869).  This
          bacterial family of proteins has no known function.
          Length = 131

 Score = 27.2 bits (61), Expect = 6.1
 Identities = 21/59 (35%), Positives = 31/59 (52%), Gaps = 6/59 (10%)

Query: 42 RQQAQIDSQRQLIAAKEQRLRFLKQHEARHQQVAAE----HERLRRLRDRVEQ-QELKL 95
          ++  Q++ Q Q  A KE R +  +Q E   QQV  E     E+  +L  ++EQ QEL L
Sbjct: 34 QELQQLEFQGQK-AIKEIRKQSAQQIEQIQQQVEQERAKRLEQKNQLLFQIEQVQELPL 91


>gnl|CDD|237481 PRK13723, PRK13723, conjugal transfer pilus assembly protein TraH;
           Provisional.
          Length = 451

 Score = 28.2 bits (63), Expect = 6.1
 Identities = 13/61 (21%), Positives = 26/61 (42%), Gaps = 6/61 (9%)

Query: 204 SVDLRISELQERLQRKRLL------NQQLASQLNSNINSSQHQLPPGKQQLVAQNSQLTS 257
             D+ +  +QE +Q+ R +       + +   L  N+N +Q Q+   + Q+  Q   L  
Sbjct: 364 GYDILLQYIQELIQQARAMLATGNYPEAVMDHLRENLNQAQRQIAAFQSQVQVQQDALLV 423

Query: 258 A 258
            
Sbjct: 424 V 424


>gnl|CDD|225115 COG2205, KdpD, Osmosensitive K+ channel histidine kinase [Signal
           transduction mechanisms].
          Length = 890

 Score = 28.4 bits (64), Expect = 6.2
 Identities = 15/45 (33%), Positives = 24/45 (53%), Gaps = 2/45 (4%)

Query: 39  MAVRQQAQIDSQRQLIAAKEQRLRFLKQHEARHQQVAAEHERLRR 83
           +A  Q+  +D+    IA   +R+   +  EA   ++AAE ERLR 
Sbjct: 619 LAPEQRRLLDAVLTQIALALERVTLAE--EAEQARLAAERERLRS 661


>gnl|CDD|227515 COG5188, PRP9, Splicing factor 3a, subunit 3 [RNA processing and
          modification].
          Length = 470

 Score = 28.4 bits (63), Expect = 6.3
 Identities = 14/59 (23%), Positives = 23/59 (38%), Gaps = 7/59 (11%)

Query: 36 LEDMAVRQQAQIDSQRQLIAAKEQRLR-------FLKQHEARHQQVAAEHERLRRLRDR 87
          LE+M + + A  +  ++        LR       F       +Q    EHER   L+D+
Sbjct: 11 LEEMEIIENAIAERIQRNPKLPRDELRLERQIRIFENMERISNQIWLVEHERPTGLKDQ 69


>gnl|CDD|116500 pfam07888, CALCOCO1, Calcium binding and coiled-coil domain
           (CALCOCO1) like.  Proteins found in this family are
           similar to the coiled-coil transcriptional coactivator
           protein coexpressed by Mus musculus (CoCoA/CALCOCO1).
           This protein binds to a highly conserved N-terminal
           domain of p160 coactivators, such as GRIP1, and thus
           enhances transcriptional activation by a number of
           nuclear receptors. CALCOCO1 has a central coiled-coil
           region with three leucine zipper motifs, which is
           required for its interaction with GRIP1 and may regulate
           the autonomous transcriptional activation activity of
           the C-terminal region.
          Length = 546

 Score = 28.4 bits (63), Expect = 6.4
 Identities = 31/143 (21%), Positives = 63/143 (44%), Gaps = 14/143 (9%)

Query: 113 ISLSSDLESIRALFNEKEKELCMAVAKVEELTHQLEDLKRGRTHQPSHPHLSAELENLRR 172
           ++L  D+  +R+   + E  L  +  K EEL  Q ++ +       S   +SAE   L  
Sbjct: 160 LALEGDVSDMRSRIEQLETALRHSTEKTEELEEQHKEAQS------SSESMSAERNALLA 213

Query: 173 ELMYHNKLSEQQNVRLHQQRETLTKKQAEVNSVDLRISELQERLQRKRLLNQQLASQLNS 232
           +   +    +Q+ + L Q  +TLT+K+ E + V     +++  L+R +   +Q   ++  
Sbjct: 214 QRAEN----QQRILELEQDIQTLTQKKQENDRVLEGTQDIEAELERMKGELKQRLKKMTI 269

Query: 233 NINSSQHQLPPGKQQLVAQNSQL 255
                + +    +  L  +N QL
Sbjct: 270 QRRDEETE----RIDLQLENEQL 288


>gnl|CDD|218661 pfam05622, HOOK, HOOK protein.  This family consists of several
           HOOK1, 2 and 3 proteins from different eukaryotic
           organisms. The different members of the human gene
           family are HOOK1, HOOK2 and HOOK3. Different domains
           have been identified in the three human HOOK proteins,
           and it was demonstrated that the highly conserved
           NH2-domain mediates attachment to microtubules, whereas
           the central coiled-coil motif mediates homodimerisation
           and the more divergent C-terminal domains are involved
           in binding to specific organelles (organelle-binding
           domains). It has been demonstrated that endogenous HOOK3
           binds to Golgi membranes, whereas both HOOK1 and HOOK2
           are localised to discrete but unidentified cellular
           structures. In mice the Hook1 gene is predominantly
           expressed in the testis. Hook1 function is necessary for
           the correct positioning of microtubular structures
           within the haploid germ cell. Disruption of Hook1
           function in mice causes abnormal sperm head shape and
           fragile attachment of the flagellum to the sperm head.
          Length = 713

 Score = 28.2 bits (63), Expect = 6.5
 Identities = 52/233 (22%), Positives = 97/233 (41%), Gaps = 15/233 (6%)

Query: 33  LGELEDMAVRQQAQIDSQRQLIAAKEQRLRFLKQHEARH----QQVAAEHERLRRLRDRV 88
           L E  D A + +AQ+++ ++ +       R +K  E R+    Q      E L++     
Sbjct: 303 LRESSDKAKKLEAQVETYKKKLEDLNDLRRQVKLLEERNAMYMQNTVQLEEELKKANAAR 362

Query: 89  EQQELKLRKLRALRGQVDQNKMNNISLSSDLESIRALFNEKEKELCMAVAKVEELTHQLE 148
            Q E   R+++ L  ++ +       L  + + +       +KE    +A+ + L    E
Sbjct: 363 GQLETYKRQVQELHAKLSEESKKADKLEFEYKRLEEKLEALQKEKERLLAERDSLRETNE 422

Query: 149 DLKRGRTHQ----PSHPHLSAELENLRRELMYHNKLSEQQNVRLHQQRETLTKKQAEVNS 204
           +L+ G+  Q     +    S   +NL  EL+     SE +   +  Q E    +  +  S
Sbjct: 423 ELRCGQAQQDQLTQADAGTSPSGDNLAAELLP----SEYREKLIRLQHENKMLRLGQEGS 478

Query: 205 VDLRISELQERLQRKRLLNQQLASQLNSN---INSSQHQLPPGKQQLVAQNSQ 254
            + RI+ELQ+ L+     N +L +QL      I   Q Q+   ++ L  Q S+
Sbjct: 479 ENERITELQQLLEDANRRNNELETQLRLANQRILELQQQVEDLQKALQEQGSK 531


>gnl|CDD|153183 cd04911, ACT_AKiii-YclM-BS_1, ACT domains located C-terminal to the
           catalytic domain of the lysine plus threonine-sensitive
           aspartokinase isoenzyme AKIII.  This CD includes the
           first of two ACT domains located C-terminal to the
           catalytic domain of the lysine plus threonine-sensitive
           aspartokinase isoenzyme AKIII, a monofunctional class
           enzyme found in Bacilli (Bacillus subtilis (BS) YclM)
           and Clostridia species. Aspartokinase is the first
           enzyme in the aspartate metabolic pathway and catalyzes
           the conversion of aspartate and ATP to aspartylphosphate
           and ADP. Bacillus subtilis YclM is reported to be a
           single polypeptide of 50 kD. AKIII from Bacillus
           subtilis strain 168 is induced by lysine and repressed
           by threonine and it is synergistically inhibited by
           lysine and threonine. Members of this CD belong to the
           superfamily of ACT regulatory domains.
          Length = 76

 Score = 26.4 bits (59), Expect = 6.6
 Identities = 15/51 (29%), Positives = 22/51 (43%), Gaps = 10/51 (19%)

Query: 281 YLHVPSKPNLDNTHAIFQNQRRIESGHNQLNKVVCFDFVPEISEKCAVDEI 331
           Y H+PS   +D+   I ++        NQL        + EI E+   DEI
Sbjct: 32  YEHMPS--GIDDISIIIRD--------NQLTDEKEQKILAEIKEELHPDEI 72


>gnl|CDD|224120 COG1199, DinG, Rad3-related DNA helicases [Transcription / DNA
           replication, recombination, and repair].
          Length = 654

 Score = 28.2 bits (63), Expect = 6.7
 Identities = 23/137 (16%), Positives = 47/137 (34%), Gaps = 13/137 (9%)

Query: 35  ELEDMAVRQQAQID-SQRQLIAAKEQRLRFLKQHEARHQQVAAE----HERLRRLRDRVE 89
            L D   R    I  S+R L    ++     +  E   +++        + L  LR+ + 
Sbjct: 231 NLPD-IARSALSIRLSERTLERLLKEIQALGETLEKDLKRLEDLADRLEKALEDLRELLI 289

Query: 90  QQ-----ELKLRKLRALRGQVDQNKMNNI--SLSSDLESIRALFNEKEKELCMAVAKVEE 142
                   L+ R    L  +  +  +  +  +L   L+++  L    + EL    + +E 
Sbjct: 290 FDVDELGNLRERLREQLSSEEAKEALGKLEEALLEKLKNLSELLGLSQNELDRPTSILER 349

Query: 143 LTHQLEDLKRGRTHQPS 159
           L  +L+ L         
Sbjct: 350 LKEELDRLLSRELLLSD 366


>gnl|CDD|203795 pfam07889, DUF1664, Protein of unknown function (DUF1664).  The
           members of this family are hypothetical plant proteins
           of unknown function. The region featured in this family
           is approximately 100 amino acids long.
          Length = 125

 Score = 27.0 bits (60), Expect = 6.8
 Identities = 16/57 (28%), Positives = 28/57 (49%)

Query: 162 HLSAELENLRRELMYHNKLSEQQNVRLHQQRETLTKKQAEVNSVDLRISELQERLQR 218
           HLS  ++NL  +L    ++SE     + + RE L+    +V SV   +  L+ +L  
Sbjct: 64  HLSQRIDNLDDKLDEQKEISESTRDEVTEIREDLSNIGEDVKSVHQAVEGLEGKLDS 120


>gnl|CDD|213363 cd12829, Alr1p-like, Saccharomyces cerevisiae Alr1p-like subfamily.
            This eukaryotic subfamily belongs to the Thermotoga
           maritima CorA (TmCorA)-family of the MIT superfamily of
           essential membrane proteins involved in transporting
           divalent cations (uptake or efflux) across membranes.
           This subfamily includes three Saccharomyces cerevisiae
           members: two plasma membrane proteins, the Mg2+
           transporter Alr1p/Swc3p and the putative Mg2+
           transporter, Alr2p, and the vacuole membrane protein
           Mnr2p, a putative Mg2+ transporter. Thermotoga maritima
           CorA forms funnel-shaped homopentamers, the tip of the
           funnel is formed from two C-terminal transmembrane (TM)
           helices from each monomer, and the large opening of the
           funnel from the N-terminal cytoplasmic domains. The GMN
           signature motif of the MIT superfamily occurs just after
           TM1, mutation within this motif is known to abolish Mg2+
           transport by Alr1p. Natural variants in this signature
           sequence may be associated with the transport of
           different divalent cations. The functional diversity of
           the MIT superfamily may also be due to minor structural
           differences regulating gating, substrate selection, and
           transport.
          Length = 305

 Score = 28.0 bits (63), Expect = 8.0
 Identities = 10/41 (24%), Positives = 19/41 (46%)

Query: 46  QIDSQRQLIAAKEQRLRFLKQHEARHQQVAAEHERLRRLRD 86
           ++ S  +L+ AK   L+ L +   +   V+ + E    L D
Sbjct: 160 RVTSLLRLLFAKSDVLKQLLKRLMKKTFVSPDSEVAIYLGD 200


>gnl|CDD|202101 pfam02050, FliJ, Flagellar FliJ protein. 
          Length = 122

 Score = 26.8 bits (60), Expect = 8.2
 Identities = 20/111 (18%), Positives = 38/111 (34%), Gaps = 5/111 (4%)

Query: 119 LESIRALFNEKEKELCMAVAKVEELTHQLEDLKRGRTHQPSHPHLSAELENLRRELMYHN 178
           L   +    + E++L        E   QL    +G +      +  A +  L   +    
Sbjct: 7   LAEAQQELEQAEEKLEELQQYRLEYRQQLSGSGQGISAAELR-NYQAFISALDEAIAQQQ 65

Query: 179 KLSEQQNVRLHQQRETLTKKQAEVNSVDLRISELQERLQRKRLLNQQLASQ 229
           +  EQ   ++ Q RE   +   E   ++     L ER ++K    +    Q
Sbjct: 66  QELEQAEKQVEQAREQWQEANQERKKLEK----LLEREKKKEQKEENRREQ 112


>gnl|CDD|219176 pfam06785, UPF0242, Uncharacterized protein family (UPF0242). 
          Length = 401

 Score = 27.9 bits (62), Expect = 8.3
 Identities = 51/221 (23%), Positives = 95/221 (42%), Gaps = 47/221 (21%)

Query: 50  QRQLIAAKEQRLRFLKQHEARHQQVAAEHERLRRLRDRVEQQELKLRKLRALRGQVDQNK 109
           +RQL  A  Q L      + + +++  + E LR +R+ +E+++           + DQ K
Sbjct: 62  ERQLKTAAGQLL------QTKIRKLTEQDEGLRNIRESIEERQK----------ESDQLK 105

Query: 110 MNNISLSSDLESIRALF----NEKEKELCMAVAKVEE---LTHQLEDLKRGRTHQPSHPH 162
           + N  L + L   R +F       +K   ++    EE   L  QL+ L +    +     
Sbjct: 106 LQNQKLVNQLGHARGVFIKTKGRYQKLEELSRHLKEENQCLQIQLDALVQECNEK----- 160

Query: 163 LSAELENLRREL----MYHNKLSEQQNVRLHQQRETLTKKQAEVNSVDLRISELQERLQR 218
              E + L REL     Y  +L+++      +Q   L K+QA +  ++ ++ +L   +  
Sbjct: 161 -IEENQELNRELAETLAYQQELNDEYQATFVEQHNMLDKRQAYIGKLEAKVQDLMCEI-- 217

Query: 219 KRLLNQQLASQLNSNINSSQHQLPPGKQQLVAQN--SQLTS 257
           + LL  QL S +  N+        PGK    +++  +QL S
Sbjct: 218 RNLL--QLESGIKENL--------PGKPVAASRDVVAQLVS 248


>gnl|CDD|216531 pfam01496, V_ATPase_I, V-type ATPase 116kDa subunit family.  This
           family consists of the 116kDa V-type ATPase (vacuolar
           (H+)-ATPases) subunits, as well as V-type ATP synthase
           subunit i. The V-type ATPases family are proton pumps
           that acidify intracellular compartments in eukaryotic
           cells for example yeast central vacuoles,
           clathrin-coated and synaptic vesicles. They have
           important roles in membrane trafficking processes. The
           116kDa subunit (subunit a) in the V-type ATPase is part
           of the V0 functional domain responsible for proton
           transport. The a subunit is a transmembrane glycoprotein
           with multiple putative transmembrane helices it has a
           hydrophilic amino terminal and a hydrophobic carboxy
           terminal. It has roles in proton transport and assembly
           of the V-type ATPase complex. This subunit is encoded by
           two homologous gene in yeast VPH1 and STV1.
          Length = 707

 Score = 28.1 bits (63), Expect = 9.0
 Identities = 11/73 (15%), Positives = 28/73 (38%), Gaps = 5/73 (6%)

Query: 163 LSAELENLRRELMYHNKLSEQQNVRLHQQR-ETLTKKQAEVNSVDLRISELQERLQRKRL 221
           L ++++ L   L       +    +      E +   +AE+  V+    E  E L+++  
Sbjct: 43  LESKIKKLGIPLKDTGGKPDVPPSKEFLDLEEEILDLEAEIKEVE----ENLESLEKEIN 98

Query: 222 LNQQLASQLNSNI 234
             ++  + L+   
Sbjct: 99  ELEEWLNVLDEEK 111


>gnl|CDD|218184 pfam04632, FUSC, Fusaric acid resistance protein family.  This
           family includes a conserved region found in two proteins
           associated with fusaric acid resistance,from
           Burkholderia cepacia and Klebsiella oxytoca. These
           proteins are likely to be membrane transporter proteins.
          Length = 649

 Score = 28.0 bits (63), Expect = 9.2
 Identities = 16/89 (17%), Positives = 23/89 (25%), Gaps = 8/89 (8%)

Query: 20  KAADLLTEGV-ELTLGELEDMAVRQQAQIDSQRQLIAAKEQRLRFL------KQHEARHQ 72
                L +G+  L +G    + +R+           AA    L  L        H     
Sbjct: 545 ARRRALRDGLAALRVG-RAVLRLRRALAAALPAPARAALRAVLAALARLFRAAAHGRALA 603

Query: 73  QVAAEHERLRRLRDRVEQQELKLRKLRAL 101
             AA    L      +          RAL
Sbjct: 604 APAALLRALDAALAALAADPAPAALRRAL 632


>gnl|CDD|218827 pfam05960, DUF885, Bacterial protein of unknown function (DUF885). 
           This family consists of several hypothetical bacterial
           proteins several of which are putative membrane
           proteins.
          Length = 539

 Score = 27.7 bits (62), Expect = 9.2
 Identities = 15/80 (18%), Positives = 34/80 (42%), Gaps = 7/80 (8%)

Query: 26  TEGVELTLGELEDMAVRQQAQIDSQRQLIAAK-------EQRLRFLKQHEARHQQVAAEH 78
               +LT  E+  + + + A+I ++ + IA +       ++ L  L+  +  +       
Sbjct: 232 YTTTDLTPEEIHQLGLAEVARIRAEMEAIARQVGFTGDLQEFLAALRTDQFYYPTDPGRD 291

Query: 79  ERLRRLRDRVEQQELKLRKL 98
           E L   R+ ++  + KL  L
Sbjct: 292 ELLDEARELLDAIDAKLPDL 311


>gnl|CDD|221432 pfam12128, DUF3584, Protein of unknown function (DUF3584).  This
           protein is found in bacteria and eukaryotes. Proteins in
           this family are typically between 943 to 1234 amino
           acids in length. This family contains a P-loop motif
           suggesting it is a nucleotide binding protein. It may be
           involved in replication.
          Length = 1198

 Score = 27.7 bits (62), Expect = 9.6
 Identities = 52/239 (21%), Positives = 79/239 (33%), Gaps = 32/239 (13%)

Query: 40  AVRQQAQIDSQRQLIAAKEQRLRFLKQHEARHQQVAAEHERLRRLRDRVEQQELKLRKLR 99
           AV +Q Q + Q     A+   L   K+ EA  +  A +  RL   R + EQQ LK +   
Sbjct: 619 AVAKQKQAEEQLVQANAE---LEEQKRAEAEART-ALKQARLDLQRLQNEQQSLKDKLEL 674

Query: 100 AL-------RGQVD----QNKMNNISLSSDLESIRALFNE---KEKELCMAVAKVEELTH 145
           A+         Q+     Q K       + LE+++  F E   +       V    EL +
Sbjct: 675 AIAERKQQAETQLRQLDAQLKQLLEQQQAFLEALKDDFRELRTERLAKWQVV--EGELDN 732

Query: 146 QLEDLKRG-RTHQPSHPHLSAELENLRRELMYHNKLSEQQNVRLHQQRETLTKK------ 198
           QL  L       +        EL+      +    +       L +Q E L         
Sbjct: 733 QLAQLSAAIEAARTQAKARLKELKKQYDRELASLDVDPNTVKELKRQIEELETTIERIAV 792

Query: 199 -QAEVNSVDLRISELQERLQRKRLLNQQLASQLNSNINSSQHQLPPGKQQLVAQNSQLT 256
            + EV      + E        R     LA QL   + SS  +L   +Q+L        
Sbjct: 793 RRPEVREYRAFMQETWLHRDSLREERPNLAIQL-RELESSAEEL---QQELTRLIKDTK 847


>gnl|CDD|119423 cd05163, TRRAP, TRansformation/tRanscription domain-Associated
           Protein (TRRAP), pseudokinase domain; The TRRAP
           catalytic domain is part of a larger superfamily that
           includes the catalytic domains of other kinases such as
           the typical serine/threonine/tyrosine protein kinases
           (PKs), aminoglycoside phosphotransferase, choline
           kinase, and RIO kinases. TRRAP shows some similarity to
           members of the phosphoinositide 3-kinase-related protein
           kinase (PIKK) subfamily in that it contains a FATC
           (FRAP, ATM and TRRAP, C-terminal) domain and has a large
           molecular weight. Unlike PIKK proteins, however, it
           contains an inactive PI3K-like pseudokinase domain,
           which lacks the conserved residues necessary for ATP
           binding and catalytic activity. TRRAP also contains many
           motifs that may be critical for protein-protein
           interactions. TRRAP is a common component of many
           histone acetyltransferase (HAT) complexes, and is
           responsible for the recruitment of these complexes to
           chromatin during transcription, replication, and DNA
           repair. TRRAP also exists in non-HAT complexes such as
           the p400 and MRN complexes, which are implicated in
           ATP-dependent remodeling and DNA repair, respectively.
          Length = 253

 Score = 27.6 bits (62), Expect = 9.7
 Identities = 12/37 (32%), Positives = 17/37 (45%), Gaps = 1/37 (2%)

Query: 208 RISELQER-LQRKRLLNQQLASQLNSNINSSQHQLPP 243
           R +  +ER LQ  R LN  L+    +   + Q  LP 
Sbjct: 41  RQARREERVLQLFRTLNSVLSKNKETRRRNLQFTLPL 77


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.314    0.127    0.341 

Gapped
Lambda     K      H
   0.267   0.0689    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 16,422,126
Number of extensions: 1589139
Number of successful extensions: 3992
Number of sequences better than 10.0: 1
Number of HSP's gapped: 3420
Number of HSP's successfully gapped: 606
Length of query: 332
Length of database: 10,937,602
Length adjustment: 97
Effective length of query: 235
Effective length of database: 6,635,264
Effective search space: 1559287040
Effective search space used: 1559287040
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.6 bits)
S2: 59 (26.6 bits)