RPS-BLAST 2.2.26 [Sep-21-2011]
Database: CDD.v3.10
44,354 sequences; 10,937,602 total letters
Searching..................................................done
Query= psy7684
(332 letters)
>gnl|CDD|233757 TIGR02168, SMC_prok_B, chromosome segregation protein SMC, common
bacterial type. SMC (structural maintenance of
chromosomes) proteins bind DNA and act in organizing and
segregating chromosomes for partition. SMC proteins are
found in bacteria, archaea, and eukaryotes. This family
represents the SMC protein of most bacteria. The smc
gene is often associated with scpB (TIGR00281) and scpA
genes, where scp stands for segregation and condensation
protein. SMC was shown (in Caulobacter crescentus) to be
induced early in S phase but present and bound to DNA
throughout the cell cycle [Cellular processes, Cell
division, DNA metabolism, Chromosome-associated
proteins].
Length = 1179
Score = 57.8 bits (140), Expect = 4e-09
Identities = 44/225 (19%), Positives = 82/225 (36%), Gaps = 10/225 (4%)
Query: 8 LRNRPYPSSSTLKAADLLTEGVELTLGELEDMAVRQQAQIDSQRQLIAAKEQRLRFLKQH 67
LR S + A +E + +LE+ + ++ I E+RL ++
Sbjct: 717 LRKELEELSRQISALRKDLARLEAEVEQLEERIAQLSKELTELEAEIEELEERLEEAEEE 776
Query: 68 EARH-QQVAAEHERLRRLRDRVEQQELKLRKLRALRGQVDQNKMNNISLSSDLESIRALF 126
A ++ ++ +L++ ++ L LR ++ +L LES+
Sbjct: 777 LAEAEAEIEELEAQIEQLKEELKALR---EALDELRAELTLLNEEAANLRERLESLERRI 833
Query: 127 NEKEKELCMAVAKVEELTHQLEDLKRGRTHQPSHPHLSAELENLRRELMYHNKLSEQQNV 186
E+ L ++EEL+ +E L L +E L EL
Sbjct: 834 AATERRLEDLEEQIEELSEDIESL------AAEIEELEELIEELESELEALLNERASLEE 887
Query: 187 RLHQQRETLTKKQAEVNSVDLRISELQERLQRKRLLNQQLASQLN 231
L R L + E+ ++ + SEL+ L+ R QL +L
Sbjct: 888 ALALLRSELEELSEELRELESKRSELRRELEELREKLAQLELRLE 932
Score = 54.3 bits (131), Expect = 5e-08
Identities = 58/249 (23%), Positives = 108/249 (43%), Gaps = 21/249 (8%)
Query: 16 SSTLKAADLLTEGVELTLGELEDMAVRQQAQIDSQRQLIAAKEQRLRFLKQHEARHQQVA 75
++ L+ + E + L + ELE+ Q ++ + I+ EQ+ + L++ R +
Sbjct: 259 TAELQELEEKLEELRLEVSELEEEIEELQKELYALANEISRLEQQKQILRE---RLANLE 315
Query: 76 AEHERLRRLRDRVEQQELKLRKLRALRGQVDQNKMNNISLSSDLESIRALFNEKEKELCM 135
+ E L + +E + +L + A + + L +LES+ A E E EL
Sbjct: 316 RQLEELEAQLEELESKLDELAEELAELEEKLE------ELKEELESLEAELEELEAELEE 369
Query: 136 AVAKVEELTHQLEDLKRGRT-HQPSHPHLSAELENLRRELMYHNKLSEQQNVRLHQQRET 194
+++EEL QLE L+ + L+ E+E L L + E + RL Q+ E
Sbjct: 370 LESRLEELEEQLETLRSKVAQLELQIASLNNEIERLEARL----ERLEDRRERLQQEIEE 425
Query: 195 LTKKQ--AEVNSVDLRISELQERL----QRKRLLNQQLASQLNSNINSSQHQLPPGKQQL 248
L KK AE+ + + EL+E L + L + L + ++ L +++L
Sbjct: 426 LLKKLEEAELKELQAELEELEEELEELQEELERLEEALEELREE-LEEAEQALDAAEREL 484
Query: 249 VAQNSQLTS 257
++L S
Sbjct: 485 AQLQARLDS 493
Score = 45.0 bits (107), Expect = 4e-05
Identities = 47/204 (23%), Positives = 83/204 (40%), Gaps = 11/204 (5%)
Query: 16 SSTLKAADLLTEGVELTLGELEDMAVRQQAQIDSQRQLIA-AKEQRLRFLKQHEARHQQV 74
S L + E +E L E E+ +A+I+ I KE+ + + ++
Sbjct: 753 SKELTELEAEIEELEERLEEAEEELAEAEAEIEELEAQIEQLKEELKALREALDELRAEL 812
Query: 75 AAEHERLRRLRDRVEQQE----LKLRKLRALRGQVDQNKMNNISLSSDLESIRALFNEKE 130
+E LR+R+E E R+L L Q+++ + SL++++E + L E E
Sbjct: 813 TLLNEEAANLRERLESLERRIAATERRLEDLEEQIEELSEDIESLAAEIEELEELIEELE 872
Query: 131 KELCMAVAKVEELTHQLEDLKRGRTHQPSHPHLSAELENLRRELMYHNKLSEQQNVRLHQ 190
EL A + E E L R+ LS EL L + + E+ +L Q
Sbjct: 873 SEL---EALLNERASLEEALALLRSELEE---LSEELRELESKRSELRRELEELREKLAQ 926
Query: 191 QRETLTKKQAEVNSVDLRISELQE 214
L + ++++ R+SE
Sbjct: 927 LELRLEGLEVRIDNLQERLSEEYS 950
Score = 43.9 bits (104), Expect = 1e-04
Identities = 43/205 (20%), Positives = 85/205 (41%), Gaps = 24/205 (11%)
Query: 29 VELTLGELEDMAVRQQAQIDSQRQLIAAKEQRLRFLKQHEAR-HQQVAAEHERLRRLRDR 87
+E + ELE+ + + R+ + E+ L L++ +Q++A + L RL
Sbjct: 682 LEEKIEELEEKIAELEKALAELRKELEELEEELEQLRKELEELSRQISALRKDLARLEAE 741
Query: 88 VEQQELKLRKLRALRGQVDQNKMNNISLSSDLESIRALFNEKEKELCMAVAKVEELTHQL 147
VEQ E ++ +L +++ L LE E E E+ A++E+L +L
Sbjct: 742 VEQLEERIAQLSKELTELEAEIE---ELEERLEEAEEELAEAEAEIEELEAQIEQLKEEL 798
Query: 148 EDLKRGRTHQPSHPHLSAELENLRRELMYHNKLSEQQNVRLHQQRETLTKKQAEVNSVDL 207
+ L+ L+ LR EL N RE L + + + +
Sbjct: 799 KALRE-------------ALDELRAEL-------TLLNEEAANLRERLESLERRIAATER 838
Query: 208 RISELQERLQRKRLLNQQLASQLNS 232
R+ +L+E+++ + LA+++
Sbjct: 839 RLEDLEEQIEELSEDIESLAAEIEE 863
Score = 41.6 bits (98), Expect = 5e-04
Identities = 31/160 (19%), Positives = 66/160 (41%), Gaps = 10/160 (6%)
Query: 80 RLRRLRDRVEQQELKLRKLR----ALRGQVDQNKMNNISLSSDLESIRALFNEKEKELCM 135
+ L +++E+ E K+ +L LR ++++ + L +LE + + K+L
Sbjct: 678 EIEELEEKIEELEEKIAELEKALAELRKELEELEEELEQLRKELEELSRQISALRKDLAR 737
Query: 136 AVAKVEELTHQLEDLKRGRTHQPSHPHLSAELENLRRELMYHNKLSEQQNVRLHQQRETL 195
A+VE+L ++ L + T L AE+E L L + + + + +
Sbjct: 738 LEAEVEQLEERIAQLSKELT------ELEAEIEELEERLEEAEEELAEAEAEIEELEAQI 791
Query: 196 TKKQAEVNSVDLRISELQERLQRKRLLNQQLASQLNSNIN 235
+ + E+ ++ + EL+ L L +L S
Sbjct: 792 EQLKEELKALREALDELRAELTLLNEEAANLRERLESLER 831
Score = 30.0 bits (68), Expect = 2.0
Identities = 33/150 (22%), Positives = 65/150 (43%), Gaps = 3/150 (2%)
Query: 40 AVRQQAQIDSQRQLIAAKEQRLRFLKQH-EARHQQVAAEHERLRRLRDRVEQQELKLRKL 98
+ AQ++ Q + + +RL + E R +++ E E L + + E +EL+ +L
Sbjct: 384 LRSKVAQLELQIASLNNEIERLEARLERLEDRRERLQQEIEELLKKLEEAELKELQ-AEL 442
Query: 99 RALRGQVDQNKMNNISLSSDLESIRALFNEKEKELCMAVAKVEELTHQLEDLKRGRTHQP 158
L ++++ + L LE +R E E+ L A ++ +L +L+ L+R + +
Sbjct: 443 EELEEELEELQEELERLEEALEELREELEEAEQALDAAERELAQLQARLDSLERLQENLE 502
Query: 159 SHPHLSAELENLRREL-MYHNKLSEQQNVR 187
L + L LSE +V
Sbjct: 503 GFSEGVKALLKNQSGLSGILGVLSELISVD 532
>gnl|CDD|235175 PRK03918, PRK03918, chromosome segregation protein; Provisional.
Length = 880
Score = 45.1 bits (107), Expect = 3e-05
Identities = 36/176 (20%), Positives = 82/176 (46%), Gaps = 8/176 (4%)
Query: 45 AQIDSQRQLIAAKEQRLRFLKQHEARHQQVAAE----HERLRRLRDRVEQQELKLRKLRA 100
+I+ + I E +K+ E ++V E L LR+ +E+ E ++++L
Sbjct: 176 RRIERLEKFIKRTENIEELIKEKEKELEEVLREINEISSELPELREELEKLEKEVKELEE 235
Query: 101 LRGQVDQNKMNNISLSSDLESIRALFNEKEKELCMAVAKVEELTHQLEDLKRGRTHQPSH 160
L+ ++++ + SL + E E+ + ++EEL ++++LK + +
Sbjct: 236 LKEEIEELEKELESLEGSKRKLEEKIRELEERIEELKKEIEELEEKVKELKELKEKAEEY 295
Query: 161 PHLSAELENLRRELMYHNKLSEQQNVRLHQQRETLTKKQAEVNSVDLRISELQERL 216
LS E EL ++ E++ RL ++ + ++ E+ + R+ EL+++L
Sbjct: 296 IKLSEFYEEYLDEL---REI-EKRLSRLEEEINGIEERIKELEEKEERLEELKKKL 347
Score = 36.6 bits (85), Expect = 0.019
Identities = 45/220 (20%), Positives = 88/220 (40%), Gaps = 35/220 (15%)
Query: 28 GVELTLGELEDMAVRQQAQIDSQRQLIAAKEQRLRFLKQHEARHQQVAAEHERLRRLRDR 87
G ELT +++ A++ + + E++ R L++ ++V + L +L++
Sbjct: 442 GRELTEEHRKELLEEYTAELKRIEKELKEIEEKERKLRKELRELEKVLKKESELIKLKEL 501
Query: 88 VEQ---------------QELKLRKLRALRGQVDQNKMNNISLSSDLESIRALFNEK--- 129
EQ E K + L+ ++ + K SL +LE + L +
Sbjct: 502 AEQLKELEEKLKKYNLEELEKKAEEYEKLKEKLIKLKGEIKSLKKELEKLEELKKKLAEL 561
Query: 130 EKELCMAVAKVEELTHQLEDLKRGRTHQPSHPHLSAELENLRRELMYHNKLSEQQNV--- 186
EK+L ++ EL +LE+L S L L+ L ++N+ E ++
Sbjct: 562 EKKLDELEEELAELLKELEELGFE-----SVEELEERLKELEP---FYNEYLELKDAEKE 613
Query: 187 ------RLHQQRETLTKKQAEVNSVDLRISELQERLQRKR 220
L + E L K E+ + R+ EL++ L+
Sbjct: 614 LEREEKELKKLEEELDKAFEELAETEKRLEELRKELEELE 653
Score = 33.1 bits (76), Expect = 0.23
Identities = 37/187 (19%), Positives = 81/187 (43%), Gaps = 21/187 (11%)
Query: 51 RQLIAAKEQRLR-FLKQHEARHQQVAAEHERLRRLRDRVEQQELKLRKLRALRGQVDQNK 109
+ I + +RL F+K+ E + + + + L + + + +L +LR ++++
Sbjct: 171 IKEIKRRIERLEKFIKRTENIEELIKEKEKELEEVLREINEISSELPELREELEKLEKEV 230
Query: 110 MNNISLSSDLESIRALFNEKEKELCMAVAKVEELTHQLEDLKRGRTHQPSHPHLSAELEN 169
+LE ++ E EKEL +L ++ +L+ ELE
Sbjct: 231 -------KELEELKEEIEELEKELESLEGSKRKLEEKIRELEERIEELKKE---IEELEE 280
Query: 170 LRREL----------MYHNKLSEQQNVRLHQQRETLTKKQAEVNSVDLRISELQERLQRK 219
+EL + ++ E+ L + + L++ + E+N ++ RI EL+E+ +R
Sbjct: 281 KVKELKELKEKAEEYIKLSEFYEEYLDELREIEKRLSRLEEEINGIEERIKELEEKEERL 340
Query: 220 RLLNQQL 226
L ++L
Sbjct: 341 EELKKKL 347
Score = 32.7 bits (75), Expect = 0.29
Identities = 36/197 (18%), Positives = 80/197 (40%), Gaps = 15/197 (7%)
Query: 27 EGVELTLGELEDMAVRQQAQIDSQRQLIAAKEQRLRFLKQHEARHQQVAAEHERLRRLRD 86
E +E L LE + + +I + I ++ + L++ +++ + E +L +
Sbjct: 241 EELEKELESLEGSKRKLEEKIRELEERIEELKKEIEELEEKVKELKELKEKAEEYIKLSE 300
Query: 87 RVEQQELKLRKLRALRGQVDQNKMNNISLSSDLESIRALFNE---KEKELCMAVAKVEEL 143
E+ +LR++ ++++ +LE E K KEL + ++EE
Sbjct: 301 FYEEYLDELREIEKRLSRLEEEINGIEERIKELEEKEERLEELKKKLKELEKRLEELEER 360
Query: 144 THQLEDLKRGRTHQPSHPHLSAELENLRRELMYHNKLSEQQNVRLHQQRETLTKKQAEVN 203
E+ K ELE L++ L E+ L + + + + E++
Sbjct: 361 HELYEEAKA----------KKEELERLKKRLTGLTP--EKLEKELEELEKAKEEIEEEIS 408
Query: 204 SVDLRISELQERLQRKR 220
+ RI EL++ ++ +
Sbjct: 409 KITARIGELKKEIKELK 425
Score = 28.5 bits (64), Expect = 6.4
Identities = 23/95 (24%), Positives = 45/95 (47%), Gaps = 1/95 (1%)
Query: 119 LESIRALFNEKEKELCMAVAKVEELTHQLEDLKRGRTHQPSHPHLSAELENLRRELMYHN 178
L+ + ++ +EL ++EEL +LE+L++ + + + L E L REL
Sbjct: 621 LKKLEEELDKAFEELAETEKRLEELRKELEELEK-KYSEEEYEELREEYLELSRELAGLR 679
Query: 179 KLSEQQNVRLHQQRETLTKKQAEVNSVDLRISELQ 213
E+ R + ++TL K + E+ + EL+
Sbjct: 680 AELEELEKRREEIKKTLEKLKEELEEREKAKKELE 714
>gnl|CDD|224117 COG1196, Smc, Chromosome segregation ATPases [Cell division and
chromosome partitioning].
Length = 1163
Score = 44.7 bits (106), Expect = 5e-05
Identities = 46/225 (20%), Positives = 94/225 (41%), Gaps = 26/225 (11%)
Query: 33 LGELEDMAVRQQAQIDSQRQLIAAKEQRLRFLK-QHEARHQQVAAEHERLRRLRDRVEQQ 91
L EL + Q ++ ++ I E + L+ + E ++ ERL L++++E
Sbjct: 276 LEELREELEELQEELLELKEEIEELEGEISLLRERLEELENELEELEERLEELKEKIEAL 335
Query: 92 ELKLRKLRALRGQVDQNKMNNISLSSDLESIR-ALFNEKEKELCMAVAKVEELTHQLEDL 150
+ +L + L +++Q +LE AL E E+ ++ EL +L ++
Sbjct: 336 KEELEERETLLEELEQLLAELEEAKEELEEKLSALLEELEELFEALREELAELEAELAEI 395
Query: 151 KRGRTHQPSHPHLSAELENLRRELMYHNKLSEQQNVRLHQQRETLTKKQAEVNSVDLRIS 210
+ ELE L+RE+ L E+ RL ++ E L ++ E+ + +
Sbjct: 396 RN-------------ELEELKREI---ESLEERLE-RLSERLEDLKEELKELEA---ELE 435
Query: 211 ELQERLQRKRLLNQQLASQLNSNINSSQHQLPPGKQQLVAQNSQL 255
ELQ L+ + +L + + +L +++L +L
Sbjct: 436 ELQTELEELN----EELEELEEQLEELRDRLKELERELAELQEEL 476
Score = 43.2 bits (102), Expect = 2e-04
Identities = 40/212 (18%), Positives = 90/212 (42%), Gaps = 12/212 (5%)
Query: 50 QRQLIAAKEQRLRFLKQHEARHQQVAAEHERLRRLRDRVEQQELKLRKLRALRGQVDQNK 109
+ +L + Q + ++ ++ ++ + + L LR ++E+ E R+L L+ ++ +
Sbjct: 673 EEELAELEAQLEKLEEELKSLKNELRSLEDLLEELRRQLEELE---RQLEELKRELAALE 729
Query: 110 MNNISLSSDLESIRALFNEKEKELCMAVAKVEELTHQLEDLKRGRTHQPSHPHLSAELEN 169
L S LE + E E+EL ++EEL +LE L+ L E+E
Sbjct: 730 EELEQLQSRLEELEEELEELEEELEELQERLEELEEELESLEEALA------KLKEEIEE 783
Query: 170 LRRELMYHNKLSEQQNVRLHQQRETLTKKQAEVNSVDLRISELQERLQRKRLLNQQLASQ 229
L + + E+ L + L + E+ S++ R L++ ++ ++L +
Sbjct: 784 LEEKRQALQEELEELEEELEEAERRLDALERELESLEQRRERLEQEIEELEEEIEELEEK 843
Query: 230 LNS---NINSSQHQLPPGKQQLVAQNSQLTSA 258
L+ + + +L K++L ++
Sbjct: 844 LDELEEELEELEKELEELKEELEELEAEKEEL 875
Score = 42.8 bits (101), Expect = 2e-04
Identities = 48/227 (21%), Positives = 101/227 (44%), Gaps = 5/227 (2%)
Query: 16 SSTLKAADLLTEGVELTLGELEDMAVRQQAQIDSQRQLIAAKEQRLRFLKQHEARHQQVA 75
LK+ +E L EL + Q++ ++ +AA E+ L +Q ++R +++
Sbjct: 687 EEELKSLKNELRSLEDLLEELRRQLEELERQLEELKRELAALEEEL---EQLQSRLEELE 743
Query: 76 AEHERLRRLRDRVEQQELKLR-KLRALRGQVDQNKMNNISLSSDLESIRALFNEKEKELC 134
E E L + ++++ +L +L +L + + K L ++++ E E+EL
Sbjct: 744 EELEELEEELEELQERLEELEEELESLEEALAKLKEEIEELEEKRQALQEELEELEEELE 803
Query: 135 MAVAKVEELTHQLEDLK-RGRTHQPSHPHLSAELENLRRELMYHNKLSEQQNVRLHQQRE 193
A +++ L +LE L+ R + L E+E L +L + E+ L + +E
Sbjct: 804 EAERRLDALERELESLEQRRERLEQEIEELEEEIEELEEKLDELEEELEELEKELEELKE 863
Query: 194 TLTKKQAEVNSVDLRISELQERLQRKRLLNQQLASQLNSNINSSQHQ 240
L + +AE ++ + EL+E + ++L S+L +
Sbjct: 864 ELEELEAEKEELEDELKELEEEKEELEEELRELESELAELKEEIEKL 910
Score = 42.4 bits (100), Expect = 3e-04
Identities = 50/237 (21%), Positives = 104/237 (43%), Gaps = 15/237 (6%)
Query: 27 EGVELTLGELEDMAVRQQAQIDSQRQLIAAKEQRLRFLK-QHEARHQQVAAEHERLRRLR 85
+E L +LE+ + ++ S L+ ++L L+ Q E +++AA E L +L+
Sbjct: 677 AELEAQLEKLEEELKSLKNELRSLEDLLEELRRQLEELERQLEELKRELAALEEELEQLQ 736
Query: 86 DRVEQQELKLRKLRA----LRGQVDQNKMNNISLSSDLESIRALFNEKEKELCMAVAKVE 141
R+E+ E +L +L L+ ++++ + SL L ++ E E++ ++E
Sbjct: 737 SRLEELEEELEELEEELEELQERLEELEEELESLEEALAKLKEEIEELEEKRQALQEELE 796
Query: 142 ELTHQLEDLKRGRTHQPSHPHLSAELENLRRELMYHNKLSEQQNVRLHQQRETLTKKQAE 201
EL +LE+ +R L ELE+L + EQ+ L ++ E L +K E
Sbjct: 797 ELEEELEEAERRLDA------LERELESLEQRRERL----EQEIEELEEEIEELEEKLDE 846
Query: 202 VNSVDLRISELQERLQRKRLLNQQLASQLNSNINSSQHQLPPGKQQLVAQNSQLTSA 258
+ + + E L+ + + +L + + + +++L S+L
Sbjct: 847 LEEELEELEKELEELKEELEELEAEKEELEDELKELEEEKEELEEELRELESELAEL 903
Score = 41.2 bits (97), Expect = 6e-04
Identities = 43/196 (21%), Positives = 81/196 (41%), Gaps = 10/196 (5%)
Query: 30 ELTLGELEDMAVRQQAQIDSQRQLIAAKEQRLRFLKQHEARHQQVAAEHERLRRLRDRVE 89
+ + ELE+ Q +++ + + E+RL L++ +Q E+ + + E
Sbjct: 778 KEEIEELEEKRQALQEELEELEEELEEAERRLDALERELESLEQRRERLEQ--EIEELEE 835
Query: 90 QQELKLRKLRALRGQVDQNKMNNISLSSDLESIRALFNEKEKELCMAVAKVEELTHQLED 149
+ E KL L ++++ + L +LE + A E E EL + EEL +L +
Sbjct: 836 EIEELEEKLDELEEELEELEKELEELKEELEELEAEKEELEDELKELEEEKEELEEELRE 895
Query: 150 LKRGRTHQPSHPHLSAELENLRRELMYHNKLSEQQNVRLHQQRETLTK--KQAEVNSVDL 207
L+ L E+E LR L E+ V L + E L + + ++
Sbjct: 896 LESELA------ELKEEIEKLRERLEELEAKLERLEVELPELEEELEEEYEDTLETELER 949
Query: 208 RISELQERLQRKRLLN 223
I L+E ++ +N
Sbjct: 950 EIERLEEEIEALGPVN 965
Score = 38.9 bits (91), Expect = 0.003
Identities = 40/212 (18%), Positives = 82/212 (38%), Gaps = 32/212 (15%)
Query: 27 EGVELTLGELEDMAVRQQAQIDSQRQLIAAKEQRLRFLKQHEARHQQ--------VAAEH 78
E +E L EL++ + +++ + L+ EQ L L++ + ++ +
Sbjct: 319 EELEERLEELKEKIEALKEELEERETLLEELEQLLAELEEAKEELEEKLSALLEELEELF 378
Query: 79 ERLR-RLRDRVEQQELKLRKLRALRGQVDQNKM--------------NNISLSSDLESIR 123
E LR L + + +L L+ +++ + L ++LE ++
Sbjct: 379 EALREELAELEAELAEIRNELEELKREIESLEERLERLSERLEDLKEELKELEAELEELQ 438
Query: 124 ALFNEKEKELCMAVAKVEELTHQLEDLKRGRTHQPSHPHLSAELENLRRELMYHNKL--- 180
E +EL ++EEL +L++L+R L EL+ L +EL
Sbjct: 439 TELEELNEELEELEEQLEELRDRLKELER------ELAELQEELQRLEKELSSLEARLDR 492
Query: 181 SEQQNVRLHQQRETLTKKQAEVNSVDLRISEL 212
E + R L ++ + V ++EL
Sbjct: 493 LEAEQRASQGVRAVLEALESGLPGVYGPVAEL 524
Score = 37.4 bits (87), Expect = 0.009
Identities = 41/229 (17%), Positives = 85/229 (37%), Gaps = 19/229 (8%)
Query: 50 QRQLIAAKEQRLRFLKQHEARHQQVAAEHERLRRLRDRVEQQELKLRKLRALRGQVDQNK 109
+L +++ ++ +++ E L +E+ + +L +LR ++ +
Sbjct: 231 LAKLKELRKELEELEEELSRLEEELEELQEELEEAEKEIEELKSELEELREELEELQEEL 290
Query: 110 MNNISLSSDLESIRALFNEKEKELC----MAVAKVEELTHQLEDLKRGRTHQPS------ 159
+ +LE +L E+ +EL ++EEL ++E LK + +
Sbjct: 291 LELKEEIEELEGEISLLRERLEELENELEELEERLEELKEKIEALKEELEERETLLEELE 350
Query: 160 --HPHLSAELENLRRELMYHNK----LSEQQNVRLHQQRETLTKKQAEVNSVDLRISELQ 213
L E L +L + L E L + L + + E+ + I L+
Sbjct: 351 QLLAELEEAKEELEEKLSALLEELEELFEALREELAELEAELAEIRNELEELKREIESLE 410
Query: 214 ERLQRKRLLNQQLASQ---LNSNINSSQHQLPPGKQQLVAQNSQLTSAH 259
ERL+R + L + L + + Q +L ++L QL
Sbjct: 411 ERLERLSERLEDLKEELKELEAELEELQTELEELNEELEELEEQLEELR 459
Score = 35.5 bits (82), Expect = 0.037
Identities = 35/144 (24%), Positives = 64/144 (44%), Gaps = 10/144 (6%)
Query: 115 LSSDLESIRALFNEKEKELCMAVAKVEELTHQLEDLKRGRTHQPSHPHLSAELENLRREL 174
L +L + A + E+EL ++ L LE+L+R L +LE L+REL
Sbjct: 672 LEEELAELEAQLEKLEEELKSLKNELRSLEDLLEELRR------QLEELERQLEELKREL 725
Query: 175 MYHNKLSEQQNVRLHQQRETLTKKQAEVNSVDLRISELQERLQRKRLLNQQLASQLNSNI 234
+ EQ RL + E L + + E+ + R+ EL+E L+ ++ ++L I
Sbjct: 726 AALEEELEQLQSRLEELEEELEELEEELEELQERLEELEEELESL----EEALAKLKEEI 781
Query: 235 NSSQHQLPPGKQQLVAQNSQLTSA 258
+ + +++L +L A
Sbjct: 782 EELEEKRQALQEELEELEEELEEA 805
>gnl|CDD|234084 TIGR03007, pepcterm_ChnLen, polysaccharide chain length determinant
protein, PEP-CTERM locus subfamily. Members of this
protein family belong to the family of polysaccharide
chain length determinant proteins (pfam02706). All are
found in species that encode the PEP-CTERM/exosortase
system predicted to act in protein sorting in a number
of Gram-negative bacteria, and are found near the epsH
homolog that is the putative exosortase gene [Cell
envelope, Biosynthesis and degradation of surface
polysaccharides and lipopolysaccharides].
Length = 498
Score = 44.3 bits (105), Expect = 5e-05
Identities = 52/219 (23%), Positives = 90/219 (41%), Gaps = 27/219 (12%)
Query: 24 LLTEGVELTLGEL-EDMAVRQQ---AQIDSQRQLIAAKEQRLRFLKQH--EARHQQVAAE 77
LLT VE TLG +D Q+ QI + + + A E RL+ KQ Q
Sbjct: 143 LLTIFVEETLGSKRQDSDSAQRFIDEQIKTYEKKLEAAENRLKAFKQENGGILPDQEGDY 202
Query: 78 HERLRRLRDRVEQQELKLRKLRALRGQVDQNKMNNISLSSDLESIRALFNEKEKELCMAV 137
+ + ++ +E L+L + A R + + L + + A + EL
Sbjct: 203 YSEISEAQEELEAARLELNEAIAQRDALKRQ------LGGEEPVLLAGSSVANSELDG-- 254
Query: 138 AKVEELTHQLEDLKRGRTHQPSHPHLSA---ELENLRRELMYHNKL-------SEQQNVR 187
++E L QL+ L+ T + HP + A E+ L + E N
Sbjct: 255 -RIEALEKQLDALRLRYTDK--HPDVIATKREIAQLEEQKEEEGSAKNGGPERGEIANPV 311
Query: 188 LHQQRETLTKKQAEVNSVDLRISELQERLQRKRLLNQQL 226
Q + L + +AE+ S++ R++EL R++R L + +
Sbjct: 312 YQQLQIELAEAEAEIASLEARVAELTARIERLESLLRTI 350
>gnl|CDD|233758 TIGR02169, SMC_prok_A, chromosome segregation protein SMC,
primarily archaeal type. SMC (structural maintenance of
chromosomes) proteins bind DNA and act in organizing and
segregating chromosomes for partition. SMC proteins are
found in bacteria, archaea, and eukaryotes. It is found
in a single copy and is homodimeric in prokaryotes, but
six paralogs (excluded from this family) are found in
eukarotes, where SMC proteins are heterodimeric. This
family represents the SMC protein of archaea and a few
bacteria (Aquifex, Synechocystis, etc); the SMC of other
bacteria is described by TIGR02168. The N- and
C-terminal domains of this protein are well conserved,
but the central hinge region is skewed in composition
and highly divergent [Cellular processes, Cell division,
DNA metabolism, Chromosome-associated proteins].
Length = 1164
Score = 43.1 bits (102), Expect = 2e-04
Identities = 49/207 (23%), Positives = 85/207 (41%), Gaps = 22/207 (10%)
Query: 30 ELTLGELEDMAVRQQAQIDSQRQLIAAKEQRLRFLKQHEARHQQVAAEHERLRRLRDRVE 89
E L ELE V + I+ +I K Q+L L++ + ER + L
Sbjct: 173 EKALEELE--EVEEN--IERLDLIIDEKRQQLERLRRERE-------KAERYQALLKEKR 221
Query: 90 QQE--LKLRKLRALRGQVDQNKMNNISLSSDLESIRALFNEKEKELCMAVAKVEELTHQL 147
+ E L++ AL Q + + SL +LE + +E EK L +EEL ++
Sbjct: 222 EYEGYELLKEKEALERQKEAIERQLASLEEELEKLTEEISELEKRLEEIEQLLEELNKKI 281
Query: 148 EDLKRGR--THQPSHPHLSAELENLRRELMYHNKLSEQQNVRLHQQRETLTKKQAEVNSV 205
+DL + L AE+ +L R + ++ L E L K +AE++ +
Sbjct: 282 KDLGEEEQLRVKEKIGELEAEIASLERSI-------AEKERELEDAEERLAKLEAEIDKL 334
Query: 206 DLRISELQERLQRKRLLNQQLASQLNS 232
I EL+ ++ +R +L +
Sbjct: 335 LAEIEELEREIEEERKRRDKLTEEYAE 361
Score = 39.7 bits (93), Expect = 0.002
Identities = 30/195 (15%), Positives = 76/195 (38%), Gaps = 25/195 (12%)
Query: 40 AVRQQAQIDSQRQLIAAKEQRLRFLKQHEARHQQVAAEHERLRRLRDRVEQQELKLRKLR 99
A +++ D++ +L + + + L + E +++ E +R +L + + + +L LR
Sbjct: 311 AEKERELEDAEERLAKLEAEIDKLLAEIEELEREIEEERKRRDKLTEEYAELKEELEDLR 370
Query: 100 A-----------LRGQVDQNKMNNISLSSDLESIRALFNEKEKELCMAVAKVEELTHQLE 148
A R ++ + L ++ ++ + ++EL ++ +L +
Sbjct: 371 AELEEVDKEFAETRDELKDYREKLEKLKREINELKRELDRLQEELQRLSEELADLNAAIA 430
Query: 149 DLKRGRTHQPSHPHLSAELENLRRELMYHNKLSEQQNVRLHQQRETLTKK----QAEVNS 204
++ L E E+ E+ K E + +L ++ + E +
Sbjct: 431 GIEAK------INELEEEKEDKALEI----KKQEWKLEQLAADLSKYEQELYDLKEEYDR 480
Query: 205 VDLRISELQERLQRK 219
V+ +S+LQ L
Sbjct: 481 VEKELSKLQRELAEA 495
Score = 39.7 bits (93), Expect = 0.002
Identities = 45/235 (19%), Positives = 102/235 (43%), Gaps = 27/235 (11%)
Query: 33 LGELEDMAVRQQAQIDSQRQLIAAKEQRLRFLKQHEARHQQVAAEHERLRRLRDRVEQQE 92
+GE+E + + + + ++ + E+ L L+ Q++ L+ L R+E+ E
Sbjct: 718 IGEIEKEIEQLEQEEEKLKERLEELEEDLSSLE------QEIENVKSELKELEARIEELE 771
Query: 93 LKLRKLRALRGQVDQNKMNNISLS---SDLESIRALFNEKEKELCMAVAKVEELTHQLED 149
L KL +N++ S + I+A ++ E+E+ A++ E+ +L
Sbjct: 772 EDLHKLEE--------ALNDLEARLSHSRIPEIQAELSKLEEEVSRIEARLREIEQKLNR 823
Query: 150 LKRGRTHQPSHPHLSAELENLRRELMYHNKLSEQQ----NVRLHQQRETLTKKQAEVNSV 205
L + + EL+ R +L K E++ N + + E L + +A + +
Sbjct: 824 LTLEKEYLEK---EIQELQEQRIDLKEQIKSIEKEIENLNGKKEELEEELEELEAALRDL 880
Query: 206 DLRISELQ---ERLQRKRLLNQQLASQLNSNINSSQHQLPPGKQQLVAQNSQLTS 257
+ R+ +L+ + L+ + ++ +L + I + +L K +L A +L+
Sbjct: 881 ESRLGDLKKERDELEAQLRELERKIEELEAQIEKKRKRLSELKAKLEALEEELSE 935
Score = 38.9 bits (91), Expect = 0.004
Identities = 41/198 (20%), Positives = 80/198 (40%), Gaps = 16/198 (8%)
Query: 27 EGVELTLGELEDMAVRQQAQIDSQRQLIAAKEQRLR--FLKQHEARHQQVAAEHERLRRL 84
E V+ L ELE + + + + E RL + + +A ++ E R+
Sbjct: 754 ENVKSELKELEARIEELEEDLHKLEEALNDLEARLSHSRIPEIQAELSKLEEEVSRIEAR 813
Query: 85 RDRVEQQELKLRKLRALRGQVDQNKMNNI--------SLSSDLESIRALFNEKEKELCMA 136
+EQ+ +L + + Q S+ ++E++ E E+EL
Sbjct: 814 LREIEQKLNRLTLEKEYLEKEIQELQEQRIDLKEQIKSIEKEIENLNGKKEELEEELEEL 873
Query: 137 VAKVEELTHQLEDLKRGRTHQPSHPHLSAELENLRRELMYHNKLSEQQNVRLHQQRETLT 196
A + +L +L DLK+ R L A+L L R++ E++ RL + + L
Sbjct: 874 EAALRDLESRLGDLKKERD------ELEAQLRELERKIEELEAQIEKKRKRLSELKAKLE 927
Query: 197 KKQAEVNSVDLRISELQE 214
+ E++ ++ E +E
Sbjct: 928 ALEEELSEIEDPKGEDEE 945
Score = 37.0 bits (86), Expect = 0.014
Identities = 41/170 (24%), Positives = 79/170 (46%), Gaps = 16/170 (9%)
Query: 75 AAEHERLRRLRDRVEQQELKL----RKLRALRGQVDQNKMNNISLSSDLESIRALFNEKE 130
+E L+RLR+R+E + +L +LR + ++D+ S + I + E
Sbjct: 670 RSEPAELQRLRERLEGLKRELSSLQSELRRIENRLDELSQELSDASRKIGEIEKEIEQLE 729
Query: 131 KELCMAVAKVEELTHQLEDLKRGRTHQPSH-PHLSAELENLRRELMYHNKLSEQQN---- 185
+E ++EEL L L++ + S L A +E L +L +KL E N
Sbjct: 730 QEEEKLKERLEELEEDLSSLEQEIENVKSELKELEARIEELEEDL---HKLEEALNDLEA 786
Query: 186 ----VRLHQQRETLTKKQAEVNSVDLRISELQERLQRKRLLNQQLASQLN 231
R+ + + L+K + EV+ ++ R+ E++++L R L + L ++
Sbjct: 787 RLSHSRIPEIQAELSKLEEEVSRIEARLREIEQKLNRLTLEKEYLEKEIQ 836
Score = 37.0 bits (86), Expect = 0.014
Identities = 42/246 (17%), Positives = 91/246 (36%), Gaps = 47/246 (19%)
Query: 19 LKAADLLTEGVELTLGELEDMAVRQQAQIDSQRQLIAAKEQRLRFLK------------Q 66
+A + E +E L LE+ + +I + + EQ L L +
Sbjct: 232 KEALERQKEAIERQLASLEEELEKLTEEISELEKRLEEIEQLLEELNKKIKDLGEEEQLR 291
Query: 67 HEARHQQVAAEHERLRR----LRDRVEQQELKLRKL-----------RALRGQVDQNKMN 111
+ + ++ AE L R +E E +L KL L ++++ +
Sbjct: 292 VKEKIGELEAEIASLERSIAEKERELEDAEERLAKLEAEIDKLLAEIEELEREIEEERKR 351
Query: 112 NISLSSDLESIRALFN-------EKEKELCMAVAKVEELTHQLEDLKRGRTHQPSHPHLS 164
L+ + ++ E +KE ++++ +LE LKR
Sbjct: 352 RDKLTEEYAELKEELEDLRAELEEVDKEFAETRDELKDYREKLEKLKR------------ 399
Query: 165 AELENLRRELMYHNKLSEQQNVRLHQQRETLTKKQAEVNSVDLRISELQERLQRKRLLNQ 224
E+ L+REL + ++ + L + +A++N ++ + ++++ +
Sbjct: 400 -EINELKRELDRLQEELQRLSEELADLNAAIAGIEAKINELEEEKEDKALEIKKQEWKLE 458
Query: 225 QLASQL 230
QLA+ L
Sbjct: 459 QLAADL 464
Score = 35.4 bits (82), Expect = 0.039
Identities = 34/207 (16%), Positives = 92/207 (44%), Gaps = 20/207 (9%)
Query: 35 ELEDMAVRQQAQIDSQRQLIAAKEQRLRFLKQHEARHQQVAAEHERLRRLRDRVEQQELK 94
R + +++ ++ +++ + LR ++ + + R++ + ++ E
Sbjct: 671 SEPAELQRLRERLEGLKRELSSLQSELRRIENR--LDELSQELSDASRKIGEIEKEIEQL 728
Query: 95 LRKLRALRGQVDQNKMNNISLSSDLESIRALFNEKEKELCMAVAKVEELTHQLEDLKRGR 154
++ L+ ++++ + + SL ++E++++ E E + + +L L DL+
Sbjct: 729 EQEEEKLKERLEELEEDLSSLEQEIENVKSELKELEARIEELEEDLHKLEEALNDLEARL 788
Query: 155 THQPSHPHLSAELENLRRELMYHNKLS------EQQNVRLHQQRETLTKK----QAEVNS 204
+H P + AEL L E +++ EQ+ RL ++E L K+ Q +
Sbjct: 789 SHSRI-PEIQAELSKLEEE---VSRIEARLREIEQKLNRLTLEKEYLEKEIQELQEQRID 844
Query: 205 VDLRISELQERLQ----RKRLLNQQLA 227
+ +I +++ ++ +K L ++L
Sbjct: 845 LKEQIKSIEKEIENLNGKKEELEEELE 871
Score = 32.0 bits (73), Expect = 0.53
Identities = 20/113 (17%), Positives = 44/113 (38%), Gaps = 9/113 (7%)
Query: 128 EKEKELCMAVAKVEELTHQLEDLKRGRTHQPSHPH--------LSAELENLRRELMYHNK 179
E + + ++E L +L L+ + S ++ + +E+ +
Sbjct: 672 EPAELQRLR-ERLEGLKRELSSLQSELRRIENRLDELSQELSDASRKIGEIEKEIEQLEQ 730
Query: 180 LSEQQNVRLHQQRETLTKKQAEVNSVDLRISELQERLQRKRLLNQQLASQLNS 232
E+ RL + E L+ + E+ +V + EL+ R++ +L LN
Sbjct: 731 EEEKLKERLEELEEDLSSLEQEIENVKSELKELEARIEELEEDLHKLEEALND 783
Score = 28.9 bits (65), Expect = 4.3
Identities = 56/246 (22%), Positives = 107/246 (43%), Gaps = 29/246 (11%)
Query: 19 LKAADLLTEGVELTLGELEDMAVRQQAQIDSQRQ----LIAAKEQRLRFLKQHEARHQQV 74
L L E +E + EL++ + + QI S + L KE+ L++ EA + +
Sbjct: 821 LNRLTLEKEYLEKEIQELQEQRIDLKEQIKSIEKEIENLNGKKEELEEELEELEAALRDL 880
Query: 75 AAEHERLRRLRDRVEQQELKL-RKLRALRGQVDQNKMNNISLSSDLESIRALFNEKEKEL 133
+ L++ RD +E Q +L RK+ L Q+++ + S+L++ E+ E+
Sbjct: 881 ESRLGDLKKERDELEAQLRELERKIEELEAQIEKKR----KRLSELKAKLEALEEELSEI 936
Query: 134 CMAVAKVEELTHQLEDLKRGRTHQPSHPHLSAELENLRRELMYH---NKLSEQQ----NV 186
+ EE+ + S + AEL+ + E+ N L+ Q+
Sbjct: 937 EDPKGEDEEIPEEE----------LSLEDVQAELQRVEEEIRALEPVNMLAIQEYEEVLK 986
Query: 187 RLHQQRETLTKKQAEVNSVDLRISELQERLQRKRLLNQQLASQLNSNINSSQHQLPPGKQ 246
RL + +E K + E ++ RI E ++ +KR + + +N N N +L G
Sbjct: 987 RLDELKEKRAKLEEERKAILERIEEYEK---KKREVFMEAFEAINENFNEIFAELSGGTG 1043
Query: 247 QLVAQN 252
+L+ +N
Sbjct: 1044 ELILEN 1049
>gnl|CDD|130902 TIGR01843, type_I_hlyD, type I secretion membrane fusion protein,
HlyD family. Type I secretion is an ABC transport
process that exports proteins, without cleavage of any
signal sequence, from the cytosol to extracellular
medium across both inner and outer membranes. The
secretion signal is found in the C-terminus of the
transported protein. This model represents the adaptor
protein between the ATP-binding cassette (ABC) protein
of the inner membrane and the outer membrane protein,
and is called the membrane fusion protein. This model
selects a subfamily closely related to HlyD; it is
defined narrowly and excludes, for example, colicin V
secretion protein CvaA and multidrug efflux proteins
[Protein fate, Protein and peptide secretion and
trafficking].
Length = 423
Score = 39.6 bits (93), Expect = 0.002
Identities = 40/191 (20%), Positives = 77/191 (40%), Gaps = 20/191 (10%)
Query: 36 LEDMAVRQQAQIDSQRQLIAAKEQRLRFLKQHEAR-HQQVAAEHERLRRLRDRVEQQELK 94
LE R +A+ DSQ + + Q + R LR ++E +
Sbjct: 93 LEAEVARLRAEADSQAAIEFPDDLLSAEDPAVPELIKGQQSLFESRKSTLRAQLELILAQ 152
Query: 95 LRKLRALRG--QVDQNKMNNIS--LSSDLESIRALFNEKEKELCMAVAKVE--ELTHQLE 148
+++L A Q + +S +LE+ R L + V+++E EL +
Sbjct: 153 IKQLEAELAGLQAQLQALRQQLEVISEELEARRKLKEKG------LVSRLELLELERERA 206
Query: 149 DLKRGRTHQPSHPHLSAELENLRRELM-YHNKLSEQQNVRLHQQRETLTKKQAEVNSVDL 207
+ + AELE L+R++ + + + + E LT+ QA + +
Sbjct: 207 EAQGELGRL------EAELEVLKRQIDELQLERQQIEQTFREEVLEELTEAQARLAELRE 260
Query: 208 RISELQERLQR 218
R+++ ++RLQR
Sbjct: 261 RLNKARDRLQR 271
>gnl|CDD|100796 PRK01156, PRK01156, chromosome segregation protein; Provisional.
Length = 895
Score = 37.2 bits (86), Expect = 0.012
Identities = 35/195 (17%), Positives = 82/195 (42%), Gaps = 22/195 (11%)
Query: 47 IDSQRQLIAAKEQRLRFLKQHEARHQQVAAEHERLRRLRDRVEQQELKLRKLRALRGQVD 106
I+ ++ +A+ L+ + + ++ +H++ +++R + LKL L + R
Sbjct: 518 INEYNKIESARAD----LEDIKIKINELKDKHDKYEEIKNRY--KSLKLEDLDSKR--TS 569
Query: 107 QNKMNNISLSSDLESIRALFNEKEKELCMAVAKVEELTHQLEDLKRGRTHQPSHPHLSAE 166
+ D+E+ R+ NE +K+L ++++E+ D K E
Sbjct: 570 WLNALAVISLIDIETNRSRSNEIKKQLNDLESRLQEIEIGFPDDKS------YIDKSIRE 623
Query: 167 LENLRRELMYHNKLSEQQNVRLHQQRETLTKKQAEVNSVDLRISELQERLQRKRLLNQQL 226
+EN +NK +E Q E L K +++ +I+E+ + + + ++
Sbjct: 624 IENEAN--NLNNKYNEIQE--NKILIEKLRGK---IDNYKKQIAEIDSIIPDLKEITSRI 676
Query: 227 ASQLNSNINSSQHQL 241
+ + N+ S+ L
Sbjct: 677 -NDIEDNLKKSRKAL 690
Score = 30.3 bits (68), Expect = 2.0
Identities = 15/100 (15%), Positives = 41/100 (41%), Gaps = 4/100 (4%)
Query: 125 LFNEKEKELCMAVAKVEELTHQLEDLKRG----RTHQPSHPHLSAELENLRRELMYHNKL 180
N K E+ +E+L ++++ K+ + P +++ + ++ L K
Sbjct: 630 NLNNKYNEIQENKILIEKLRGKIDNYKKQIAEIDSIIPDLKEITSRINDIEDNLKKSRKA 689
Query: 181 SEQQNVRLHQQRETLTKKQAEVNSVDLRISELQERLQRKR 220
+ + T+ + +N + RI+++ E L+ +
Sbjct: 690 LDDAKANRARLESTIEILRTRINELSDRINDINETLESMK 729
Score = 29.1 bits (65), Expect = 3.9
Identities = 30/190 (15%), Positives = 74/190 (38%), Gaps = 19/190 (10%)
Query: 33 LGELEDMAVRQQAQIDSQRQLIAAKEQRLRFLKQHEARHQQ-----VAAEHERLRRLRDR 87
L LEDM R +++I + ++ + ++ + K+ E RH + V +
Sbjct: 244 LSSLEDMKNRYESEIKTAESDLSMELEKNNYYKELEERHMKIINDPVYKNRNYINDYFKY 303
Query: 88 VEQQELKLRKLRALRGQVDQNKMNNISLSSDLESIRALFNEKEKELCMAVAKVEELTHQL 147
E K + L + ++ I S L+ + +K+ ++L +Q+
Sbjct: 304 KNDIENKKQILSNIDAEI-NKYHAIIKKLSVLQKDYNDYIKKKSRY-------DDLNNQI 355
Query: 148 EDLKRGRTHQPSHPHLSAELENLRRELMYHNKLSEQQNVRLHQQRETLTKKQAEVNSVDL 207
+L ++ + +E+L+ + S+ E L ++ + +++
Sbjct: 356 LEL---EGYEMDYNSYLKSIESLK---KKIEEYSKNIERMSAFISEILKIQEIDPDAIKK 409
Query: 208 RISELQERLQ 217
++E+ +LQ
Sbjct: 410 ELNEINVKLQ 419
>gnl|CDD|237178 PRK12705, PRK12705, hypothetical protein; Provisional.
Length = 508
Score = 36.6 bits (85), Expect = 0.017
Identities = 45/204 (22%), Positives = 82/204 (40%), Gaps = 36/204 (17%)
Query: 50 QRQLIAAKEQRLRFLKQHEARHQQVAAEHERLRRLRDRVEQQELKLRKLRALRGQVDQNK 109
+RQ +A + +R+ Q EA + AA E L QQ + R+ R + Q +
Sbjct: 27 KRQRLAKEAERILQEAQKEAEEKLEAALLEAKELLLRERNQQRQEARRER----EELQRE 82
Query: 110 MNNISLSSDLESIRALFNEKEKELCMAVAKVEELTHQLEDLKRGRTHQPSHPHLSAELEN 169
+ +KE++L K++ L +QLE+ ++ + + ELE
Sbjct: 83 EERL-------------VQKEEQLDARAEKLDNLENQLEEREKALSAR------ELELEE 123
Query: 170 LRRELMYHNKLSEQQNVRLHQQRETLTK---------KQAEVNSVDLRISELQERLQRKR 220
L ++ N+L + Q R+ L K K V ++ ER +
Sbjct: 124 LEKQ--LDNELYRVAGLTPEQARKLLLKLLDAELEEEKAQRVKKIEEEADLEAERKAQNI 181
Query: 221 LLN--QQLASQLNSNINSSQHQLP 242
L Q++AS+ S+++ S +P
Sbjct: 182 LAQAMQRIASETASDLSVSVVPIP 205
>gnl|CDD|219791 pfam08317, Spc7, Spc7 kinetochore protein. This domain is found in
cell division proteins which are required for
kinetochore-spindle association.
Length = 321
Score = 36.2 bits (84), Expect = 0.019
Identities = 44/207 (21%), Positives = 73/207 (35%), Gaps = 39/207 (18%)
Query: 25 LTEGVELTLGEL---EDMAVRQQAQIDSQRQLIAAKEQRLRFLKQHEARHQQVAAEHERL 81
L EG+E L + E++ + I+S + + K Q L+ E L
Sbjct: 148 LKEGLEENLEGMKRDEELLNKDLNLINSIKPKLRKKLQALK-------------EEIASL 194
Query: 82 RRLRDRVEQQELKLRKLRALRGQVDQNKMNNISLSSDLESIRALFNEKEKELCMAVAKVE 141
R+L D E +L R ++ SLS + R E ++EL +E
Sbjct: 195 RQLAD--ELNLCDPLELEKARQEL-------RSLSVKISEKRKQLEELQQELQELTIAIE 245
Query: 142 ELTHQLEDLKRGRTHQPSHPHLSAELENLRRELMYHNKLSEQQNVRLHQQRETLTKKQAE 201
LT++ +L AE E +R E + + + + + +L +K
Sbjct: 246 ALTNKKSEL-LEEI---------AEAEKIREE----CRGWSAKEISKLKAKVSLLQKLTG 291
Query: 202 VNSVDLRISELQERLQRKRLLNQQLAS 228
+ L S L R L AS
Sbjct: 292 WKFLSLSGSTLSMSYNRTIELVFDPAS 318
>gnl|CDD|227278 COG4942, COG4942, Membrane-bound metallopeptidase [Cell division
and chromosome partitioning].
Length = 420
Score = 35.1 bits (81), Expect = 0.040
Identities = 37/197 (18%), Positives = 75/197 (38%), Gaps = 22/197 (11%)
Query: 64 LKQHEARHQQVAAEHERLRRLRDRVEQQELKLRK--------LRALRGQVDQNKMNNISL 115
+ + +Q+ E L + + Q KL K + +L Q+ + + L
Sbjct: 33 AAADDKQLKQIQKEIAALEKKIREQQDQRAKLEKQLKSLETEIASLEAQLIETADDLKKL 92
Query: 116 SSDLESIRALFNEKEKELCMAVAKVEELTHQLEDLKRGRTHQPSHPHLSAE-LENLRREL 174
+ + A N E + + L QL L+R + P +S E + R
Sbjct: 93 RKQIADLNARLNALEVQE---REQRRRLAEQLAALQRSGRNPPPALLVSPEDAQRSVRLA 149
Query: 175 MYHNKLS----------EQQNVRLHQQRETLTKKQAEVNSVDLRISELQERLQRKRLLNQ 224
+Y+ L+ + +L R + +QAE+ ++ Q +L + +
Sbjct: 150 IYYGALNPARAERIDALKATLKQLAAVRAEIAAEQAELTTLLSEQRAQQAKLAQLLEERK 209
Query: 225 QLASQLNSNINSSQHQL 241
+ +QLNS +++ Q +L
Sbjct: 210 KTLAQLNSELSADQKKL 226
Score = 28.1 bits (63), Expect = 7.2
Identities = 17/72 (23%), Positives = 27/72 (37%), Gaps = 4/72 (5%)
Query: 35 ELEDMAVRQQAQIDSQRQLIAAKEQRLRFLKQHEARHQQVAAEH----ERLRRLRDRVEQ 90
EL + Q+AQ QL+ +++ L L + Q+ E RL+ E
Sbjct: 186 ELTTLLSEQRAQQAKLAQLLEERKKTLAQLNSELSADQKKLEELRANESRLKNEIASAEA 245
Query: 91 QELKLRKLRALR 102
K R+ A
Sbjct: 246 AAAKAREAAAAA 257
>gnl|CDD|234714 PRK00286, xseA, exodeoxyribonuclease VII large subunit; Reviewed.
Length = 438
Score = 35.2 bits (82), Expect = 0.047
Identities = 23/122 (18%), Positives = 46/122 (37%), Gaps = 17/122 (13%)
Query: 145 HQLEDLKRGRTHQPSHPHLSAELENLRR---ELMYHNKLSEQQNVRLHQQRETLTKKQAE 201
+L+ L + L ++ L + + ++ +RL +QR ++ +
Sbjct: 289 QRLDQLA----RRLKFQSPERLLAQQQQRLDRLQQRLQRALERRLRLAKQRLERLSQRLQ 344
Query: 202 VNSVDLRISELQERLQRKRLLNQQLASQLNSNINSSQHQLPPGKQQLVAQNSQLTSAHIN 261
+ RI Q+RL +QL +L + + QL +Q+L A QL +
Sbjct: 345 QQNPQRRIERAQQRL-------EQLEQRLRRAM---RRQLKRKRQRLEALAQQLEALSPL 394
Query: 262 NV 263
Sbjct: 395 AT 396
Score = 33.2 bits (77), Expect = 0.15
Identities = 28/129 (21%), Positives = 55/129 (42%), Gaps = 17/129 (13%)
Query: 42 RQQAQIDSQRQLIAAKEQRLRFLKQH-EARH--QQVAAEHERLRRLRDRVEQQELKLRKL 98
QQ + R+ + K QRL L + + + + +A + +RL RL+ R+++ + +L
Sbjct: 272 LQQRLARAMRRRLEQKRQRLDQLARRLKFQSPERLLAQQQQRLDRLQQRLQRALERRLRL 331
Query: 99 RALRGQVDQNKMNNISLSSDLESIRALFNEKEKELCMAV--------AKVEELTHQLED- 149
R + ++ + +E + + E+ L A+ ++E L QLE
Sbjct: 332 AKQRLERLSQRLQQQNPQRRIERAQQRLEQLEQRLRRAMRRQLKRKRQRLEALAQQLEAL 391
Query: 150 -----LKRG 153
L RG
Sbjct: 392 SPLATLARG 400
>gnl|CDD|236508 PRK09426, PRK09426, methylmalonyl-CoA mutase; Reviewed.
Length = 714
Score = 34.8 bits (81), Expect = 0.065
Identities = 24/75 (32%), Positives = 37/75 (49%), Gaps = 14/75 (18%)
Query: 36 LEDMAVRQQAQIDSQRQLI-------AAKEQRLRFLK--QHEARHQQVAAEHERLRRLRD 86
+E+ A R QA+IDS +Q+I KE + L+ R +Q+A RL RLR
Sbjct: 436 IEEAAARTQARIDSGKQVIVGVNKYRLDKEDPIDVLEVDNTAVRAEQIA----RLERLRA 491
Query: 87 RVEQQELKLRKLRAL 101
++ ++ L AL
Sbjct: 492 ERDEAAVE-AALAAL 505
>gnl|CDD|223496 COG0419, SbcC, ATPase involved in DNA repair [DNA replication,
recombination, and repair].
Length = 908
Score = 34.7 bits (80), Expect = 0.079
Identities = 42/204 (20%), Positives = 86/204 (42%), Gaps = 23/204 (11%)
Query: 35 ELEDMAVRQQAQIDSQRQLIAAKEQRLRFLKQHEARHQQVAAEHERLRRLRDRVEQQELK 94
++E++ + ++ L+ A E+ L+ LK+ E Q E E L + + +E++ +
Sbjct: 186 KIEELEGQLSELLEDIEDLLEALEEELKELKKLE--EIQEEQEEEELEQEIEALEERLAE 243
Query: 95 LRKLRALRGQVDQNKMNNISLSSD-LESIRALFNEKEKELCMAVAKVEELTHQLEDLKRG 153
L + + ++ + SL + L+ E E+ L ++EE +LE+L+R
Sbjct: 244 LEEEKERLEELKARLLEIESLELEALKIREEELRELERLLE----ELEEKIERLEELER- 298
Query: 154 RTHQPSHPHLSAELENLRRELMYHNKLSEQQNVRLHQQR---ETLTKKQAEVNSVDLRIS 210
E+E L EL L E+ L + + E L K + ++ ++ +
Sbjct: 299 ------------EIEELEEELEGLRALLEELEELLEKLKSLEERLEKLEEKLEKLESELE 346
Query: 211 ELQERLQRKRLLNQQLASQLNSNI 234
EL E L ++ +L +
Sbjct: 347 ELAEEKNELAKLLEERLKELEERL 370
Score = 32.0 bits (73), Expect = 0.53
Identities = 45/211 (21%), Positives = 86/211 (40%), Gaps = 21/211 (9%)
Query: 27 EGVELTLGELEDMAVRQQAQIDSQRQLIAAKEQRLRFLKQHEARHQQVAAEHER------ 80
E ++ L ELE + +++ + I E+++ L+ E ++A E+
Sbjct: 406 EEIQEELEELEKELEELERELEELEEEIKKLEEQINQLESKELMIAELAGAGEKCPVCGQ 465
Query: 81 -LRRLRDR--VEQQELKLRKLRALRGQVDQNKMNNISLSSDLESIRALFNEKEKELCMAV 137
L ++ +E EL+L +L + + + + +R L E + L +
Sbjct: 466 ELPEEHEKELLELYELELEELEEELSREKEEAELREEIEELEKELRELEEELIELLELEE 525
Query: 138 AKVEELTHQLEDLKRGRTHQPSHPHLSAELENLRRELMYHNKLSEQQNVRLHQQRETLTK 197
A EEL +LE L+ L ELE L+ +L E + L + + L +
Sbjct: 526 ALKEELEEKLEKLEN----------LLEELEELKEKLQLQQLKEELRQ--LEDRLQELKE 573
Query: 198 KQAEVNSVDLRISELQERLQRKRLLNQQLAS 228
E+ + R EL+E +R + L ++L
Sbjct: 574 LLEELRLLRTRKEELEELRERLKELKKKLKE 604
Score = 32.0 bits (73), Expect = 0.54
Identities = 43/197 (21%), Positives = 83/197 (42%), Gaps = 16/197 (8%)
Query: 35 ELEDMAVRQQAQIDSQRQLIAAKEQRLRFLKQHEARHQQVAAEHERLRRLRDRVEQQELK 94
EL + + +++ + + E L L + + ++ ERL+ L +R+E+ E +
Sbjct: 319 ELLEKLKSLEERLEKLEEKLEKLESELEELAEEK--NELAKLLEERLKELEERLEELEKE 376
Query: 95 LRKLRALRGQVDQNKMNNIS----LSSDLESIRALFNEKEKELCMAVAKVEELTHQLEDL 150
L K Q+++ LS+ LE I+ E EKEL ++EEL +++ L
Sbjct: 377 LEKALERLKQLEEAIQELKEELAELSAALEEIQEELEELEKELEELERELEELEEEIKKL 436
Query: 151 KRGRTHQPSHPHLSAELE---------NLRRELMYHNKLSEQQNVRLHQQRETLTKKQAE 201
+ S + AEL + +L E + L + E L++++ E
Sbjct: 437 EEQINQLESKELMIAELAGAGEKCPVCGQELPEEHEKELLELYELELEELEEELSREK-E 495
Query: 202 VNSVDLRISELQERLQR 218
+ I EL++ L+
Sbjct: 496 EAELREEIEELEKELRE 512
Score = 30.5 bits (69), Expect = 1.4
Identities = 35/173 (20%), Positives = 69/173 (39%), Gaps = 21/173 (12%)
Query: 64 LKQHEARHQQVAAEHERLRRLRDRVEQQ-----ELKLRKLRALRGQVDQNKMNNISLSSD 118
L+++E + + + + + +E Q E L AL ++ + K
Sbjct: 166 LEKYEKLSELLKEVIKEAKAKIEELEGQLSELLEDIEDLLEALEEELKELKKLE---EIQ 222
Query: 119 LESIRALFNEKEKELCMAVAKVEELTHQLEDLKRGRTHQPSHPHLSAELENLRRELMYHN 178
E ++ + L +A++EE +LE+LK A L + +
Sbjct: 223 EEQEEEELEQEIEALEERLAELEEEKERLEELK-------------ARLLEIESLELEAL 269
Query: 179 KLSEQQNVRLHQQRETLTKKQAEVNSVDLRISELQERLQRKRLLNQQLASQLN 231
K+ E++ L + E L +K + ++ I EL+E L+ R L ++L L
Sbjct: 270 KIREEELRELERLLEELEEKIERLEELEREIEELEEELEGLRALLEELEELLE 322
>gnl|CDD|221952 pfam13166, AAA_13, AAA domain. This family of domains contain a
P-loop motif that is characteristic of the AAA
superfamily. Many of the proteins in this family are
conjugative transfer proteins. This family includes the
PrrC protein that is thought to be the active component
of the anticodon nuclease.
Length = 713
Score = 34.2 bits (79), Expect = 0.085
Identities = 31/186 (16%), Positives = 68/186 (36%), Gaps = 26/186 (13%)
Query: 47 IDSQRQLIAAKEQRLRFLKQHEARHQQVAAEHERLRRLRDRVEQQELKLRKLRALRGQVD 106
+ + E L K++ V L L + +E+ KL
Sbjct: 295 ESHIEKALEELESILDTEKENSEFKLDVEELKALLEALEEILEKNLQKL----------- 343
Query: 107 QNKMNNISLSSDLESIRALFNEKEKELCMAVAKVEELTHQLEDLKRGRTHQPSHPHLSAE 166
+ K+ + S S +LESI L + + E ++++LK E
Sbjct: 344 EEKLKDPSTSIELESITDLIESINDIIDAINELIREHNEKIDNLK-------------KE 390
Query: 167 LENLRRELMYHNKLSEQQNVRLHQQRETLTKKQAEVNSVDLRISELQERLQRKRLLNQQL 226
+++L H ++++ +Q + + +NS++ I +L+ ++ ++L
Sbjct: 391 KNKAKKKLWLHLVAELKEDIDAYQ--KEKKGLEKAINSLEKEIKQLEAEIKALEKEIKEL 448
Query: 227 ASQLNS 232
QL +
Sbjct: 449 EKQLTN 454
>gnl|CDD|234501 TIGR04211, SH3_and_anchor, SH3 domain protein. Members of this
protein family have a signal peptide, a strongly
conserved SH3 domain, a variable region, and then a
C-terminal hydrophobic transmembrane alpha helix region.
Length = 198
Score = 33.4 bits (77), Expect = 0.100
Identities = 22/103 (21%), Positives = 44/103 (42%), Gaps = 7/103 (6%)
Query: 97 KLRALRGQVDQNKMNNISLSSDLESIRALFNEKEKELCMAVAKVEELTHQLEDLKRGRTH 156
+L L+ ++ + + L L ++ E ++EL A++E L +L +K+
Sbjct: 67 RLPELQQELAELQEELAELQEQLAELQQENQELKQELSTLEAELERLQKELARIKQLSA- 125
Query: 157 QPSHPHLSAELENLRRELMYHNKLSEQQNVRLHQQRETLTKKQ 199
+ EL+ REL +Q+N L + E L + +
Sbjct: 126 ------NAIELDEENRELREELAELKQENEALEAENERLQENE 162
Score = 32.3 bits (74), Expect = 0.20
Identities = 18/105 (17%), Positives = 41/105 (39%), Gaps = 14/105 (13%)
Query: 62 RFLKQHEARHQQVAA---EHERLRRLRDRVEQQELKLRK-LRALRGQVDQNKMNNISLSS 117
R+L + +++ E L+ +++Q +L++ + L+ ++ + L
Sbjct: 56 RYLSDTPSARERLPELQQELAELQEELAELQEQLAELQQENQELKQELSTLEAELERLQK 115
Query: 118 DLESIR-------ALFNEKE---KELCMAVAKVEELTHQLEDLKR 152
+L I+ L E +EL + E L + E L+
Sbjct: 116 ELARIKQLSANAIELDEENRELREELAELKQENEALEAENERLQE 160
>gnl|CDD|225519 COG2972, COG2972, Predicted signal transduction protein with a
C-terminal ATPase domain [Signal transduction
mechanisms].
Length = 456
Score = 34.0 bits (78), Expect = 0.11
Identities = 29/112 (25%), Positives = 44/112 (39%), Gaps = 19/112 (16%)
Query: 79 ERLRRLRDRVEQQELKLRK--LRALRGQVDQNKMNNISLSSDLESIRALFNEKEKELCMA 136
ERL L + E + + LRAL+ Q +N L + LE+IR L E + E
Sbjct: 237 ERLEYLIEENYSLEQEQLEAELRALQSQ-----INPHFLYNTLETIRMLAEEDDPE---- 287
Query: 137 VAKVEELTHQLEDLKRGRTHQPSHPHLSAELENLRRELMYHNKLSEQQNVRL 188
+ ++ L L R + L EL+ K E Q +R+
Sbjct: 288 --EAAKVVKALSKLLRYSL------SNLDNIVTLEIELLLIEKYLEIQKLRI 331
>gnl|CDD|225288 COG2433, COG2433, Uncharacterized conserved protein [Function
unknown].
Length = 652
Score = 33.9 bits (78), Expect = 0.11
Identities = 37/167 (22%), Positives = 74/167 (44%), Gaps = 15/167 (8%)
Query: 75 AAEHERLRRLRDRVEQQELKLRKLRALRGQVDQNKMNNI---SLSSDLESIRALFNEKEK 131
AA ++ + ++E+ E KL +L + V++ K I L+ L ++ +EK
Sbjct: 348 AAAYKAYLAYKPKLEKVERKLPEL-GIWKDVERIKALVIRGYPLAEALSKVKEEERPREK 406
Query: 132 ELCMAVAKVEELTHQLEDLKRGRT---HQPSHPHLSAELENLRREL-MYHNKLSE-QQNV 186
E + E ++ K T + + L ELE L+RE+ ++L ++ V
Sbjct: 407 EGTEEEERREITVYEKRIKKLEETVERLEEENSELKRELEELKREIEKLESELERFRREV 466
Query: 187 RLHQQRETLTKKQAEVNSVDLRISELQERLQRKRLLNQQLASQLNSN 233
R +++ E+ + D RI L++ L+ K+ ++L +L
Sbjct: 467 RDKVRKDR------EIRARDRRIERLEKELEEKKKRVEELERKLAEL 507
Score = 28.1 bits (63), Expect = 7.7
Identities = 17/74 (22%), Positives = 29/74 (39%), Gaps = 7/74 (9%)
Query: 79 ERLRRLRDRVEQQELKLRKLRALRGQVD------QNKMNNIS-LSSDLESIRALFNEKEK 131
L R R V + K R++RA +++ + K + L L +R + +
Sbjct: 457 SELERFRREVRDKVRKDREIRARDRRIERLEKELEEKKKRVEELERKLAELRKMRKLELS 516
Query: 132 ELCMAVAKVEELTH 145
V VE+LT
Sbjct: 517 GKGTPVKVVEKLTL 530
>gnl|CDD|235316 PRK04863, mukB, cell division protein MukB; Provisional.
Length = 1486
Score = 33.8 bits (78), Expect = 0.16
Identities = 40/194 (20%), Positives = 72/194 (37%), Gaps = 19/194 (9%)
Query: 30 ELTLGELEDMAVRQQAQIDSQRQLIAAKEQRLRFLKQHEARHQQVAAEHERLRRLRDRVE 89
+LT ED QA+ + + + EQ+L +A H Q ++ +R++ V
Sbjct: 434 DLTADNAEDWLEEFQAKEQEATEELLSLEQKLSV---AQAAHSQFEQAYQLVRKIAGEVS 490
Query: 90 QQE---------LKLRKLRALRGQVDQNKMNNISLSSDLESIRALFNEKEKELCMAVAKV 140
+ E +LR+ R L Q+ Q +M L L + E C + K
Sbjct: 491 RSEAWDVARELLRRLREQRHLAEQLQQLRMRLSELEQRLRQQQRA-ERLLAEFCKRLGKN 549
Query: 141 EELTHQLEDLKRGRTHQPSHPHLSAELENLRRELMYHNKLSEQQNVRLHQQRETLTKKQA 200
+ +LE L+ + LS + R M QQ +L + + L +
Sbjct: 550 LDDEDELEQLQEELEARLE--SLSESVSEARERRMAL----RQQLEQLQARIQRLAARAP 603
Query: 201 EVNSVDLRISELQE 214
+ ++ L+E
Sbjct: 604 AWLAAQDALARLRE 617
Score = 31.5 bits (72), Expect = 0.70
Identities = 45/242 (18%), Positives = 94/242 (38%), Gaps = 34/242 (14%)
Query: 6 MTLRNRPYPSSSTLKAADLLTEGVELTLGELEDMAVRQQAQ--IDSQRQLIAAKEQRLRF 63
++ NR P + L L E+ E + ++A+ + +A E +
Sbjct: 874 LSALNRLLPRLNLLADETLADRVEEIR----EQLDEAEEAKRFVQQHGNALAQLEPIVSV 929
Query: 64 LKQHEARHQQVAAEHERLRRLRDRVEQQELKLRKLRALR---------GQVDQNKMNNIS 114
L+ + +Q+ ++++ ++ + +QQ L ++ R + +N N
Sbjct: 930 LQSDPEQFEQLKQDYQQAQQTQRDAKQQAFALTEVVQRRAHFSYEDAAEMLAKNSDLNEK 989
Query: 115 LSSDLESIRALFNEKEKELCMAVAKVEELTHQLEDLK-RGRTHQPSHPHLSAELENL--- 170
L LE ++L A A++ + L LK + L EL++L
Sbjct: 990 LRQRLEQAEQERTRAREQLRQAQAQLAQYNQVLASLKSSYDAKRQMLQELKQELQDLGVP 1049
Query: 171 ------------RRELMYHNKLSEQQNVRLHQQRETLTKKQAEVNSVDLRISELQERLQR 218
R EL H +LS ++ R +Q + LT +AE++++ ++ +L+
Sbjct: 1050 ADSGAEERARARRDEL--HARLSANRS-RRNQLEKQLTFCEAEMDNLTKKLRKLERDYHE 1106
Query: 219 KR 220
R
Sbjct: 1107 MR 1108
>gnl|CDD|161783 TIGR00237, xseA, exodeoxyribonuclease VII, large subunit. This
family consist of exodeoxyribonuclease VII, large
subunit XseA which catalyses exonucleolytic cleavage in
either the 5'->3' or 3'->5' direction to yield
5'-phosphomononucleotides. Exonuclease VII consists of
one large subunit and four small subunits [DNA
metabolism, Degradation of DNA].
Length = 432
Score = 32.9 bits (75), Expect = 0.20
Identities = 18/102 (17%), Positives = 42/102 (41%), Gaps = 9/102 (8%)
Query: 162 HLSAELENLRRELMYHNKLSEQQNVRLHQQRETL-------TKKQAEVNSVDLRISELQE 214
L A L+ + L + Q +L ++++ T+++ + L +
Sbjct: 288 QLVASLQRQHPQNKLA--LQQLQFEKLEKRKQAALNKQLERTRQKKTRLTKRLTQTNPSP 345
Query: 215 RLQRKRLLNQQLASQLNSNINSSQHQLPPGKQQLVAQNSQLT 256
++ R + +QL +LN+ N+ + P LV + + L+
Sbjct: 346 QILRAQTRTEQLNRRLNALKNAQANLKLPQFGTLVGKLNALS 387
Score = 30.2 bits (68), Expect = 1.7
Identities = 25/124 (20%), Positives = 52/124 (41%), Gaps = 18/124 (14%)
Query: 42 RQQAQIDS-QRQLIAAKEQRLRFLKQHEARHQQVAAE---HERLRRLRDRVEQQELKLRK 97
QQ Q + +++ AA ++L +Q + R + + ++ R + R EQ +L
Sbjct: 304 LQQLQFEKLEKRKQAALNKQLERTRQKKTRLTKRLTQTNPSPQILRAQTRTEQLNRRLNA 363
Query: 98 LRALRGQVDQ---------NKMNNISLSSDLESIRALFNEKEKELCMAVAKVEE---LTH 145
L Q + K+N +S L ++ ++ + +V +V+ LT
Sbjct: 364 L--KNAQANLKLPQFGTLVGKLNALSPLEVLARGYSIALNEKGKAIKSVKQVDRGDRLTT 421
Query: 146 QLED 149
+L+D
Sbjct: 422 KLKD 425
>gnl|CDD|239651 cd03679, MM_CoA_mutase_alpha_like, Coenzyme
B12-dependent-methylmalonyl coenzyme A (CoA) mutase
(MCM) family, Alpha subunit-like subfamily; contains
proteins similar to the alpha subunit of
Propionbacterium shermanni MCM, as well as human and E.
coli MCM. Members of this subfamily contain an
N-terminal MCM domain and a C-terminal coenzyme B12
binding domain. MCM catalyzes the isomerization of
methylmalonyl-CoA to succinyl-CoA. The reaction proceeds
via radical intermediates beginning with a
substrate-induced homolytic cleavage of the Co-C bond of
coenzyme B12 to produce cob(II)alamin and the
deoxyadenosyl radical. MCM plays an important role in
the conversion of propionyl-CoA to succinyl-CoA during
the degradation of propionate for the Krebs cycle. In
higher animals, MCM is involved in the breakdown of
odd-chain fatty acids, several amino acids, and
cholesterol. Methylobacterium extorquens MCM
participates in the glyoxylate regeneration pathway. In
M. extorquens, MCM forms a complex with MeaB; MeaB may
protect MCM from irreversible inactivation. In some
bacteria, MCM is involved in the reverse metabolic
reaction, the rearrangement of succinyl-CoA to
methylmalonyl-CoA. Examples include P. shermanni MCM
during propionic acid fermentation, E.coli MCM in a
pathway for the conversion of succinate to propionate
and Streptomyces MCM in polyketide biosynthesis.
Sinorhizobium meliloti strain SU47 MCM plays a role in
the polyhydroxyalkanoate degradation pathway. P.
shermanni and Streptomyces cinnamonensis MCMs are
alpha/beta heterodimers. It has been shown for P.
shermanni MCM that only the alpha subunit binds coenzyme
B12 and substrates. Human MCM is a homodimer with two
active sites. Mouse and E.coli MCMs are also homodimers.
In humans, impaired activity of MCM results in
methylmalonic aciduria, a disorder of propionic acid
metabolism.
Length = 536
Score = 33.1 bits (76), Expect = 0.22
Identities = 22/66 (33%), Positives = 33/66 (50%), Gaps = 10/66 (15%)
Query: 36 LEDMAVRQQAQIDSQRQLI-------AAKEQRLRFLK--QHEARHQQVAAEHERLRRLRD 86
+E+ A R+QA+IDS R++I E+ L LK R +Q+A ++LR RD
Sbjct: 430 IEEAAARRQARIDSGREVIVGVNKYRLDHEEPLDVLKIDNTAVRAEQIAR-LKKLRAERD 488
Query: 87 RVEQQE 92
Q
Sbjct: 489 PEAVQA 494
>gnl|CDD|234342 TIGR03752, conj_TIGR03752, integrating conjugative element protein,
PFL_4705 family. Members of this protein family are
found occasionally on plasmids such as the Pseudomonas
putida toluene catabolic TOL plasmid pWWO_p085. Usually,
however, they are found on the bacterial main chromosome
in regions flanked by markers of conjugative transfer
and/or transposition [Mobile and extrachromosomal
element functions, Plasmid functions].
Length = 472
Score = 32.7 bits (75), Expect = 0.24
Identities = 27/103 (26%), Positives = 43/103 (41%), Gaps = 21/103 (20%)
Query: 163 LSAELENLRRELMYHNKLSEQQNVRLHQQRETLTKKQAEVNSVDLRISELQERLQRKRLL 222
L AE++ LR+ L KL +N L + E L K++ S+D +I Q+ +Q +
Sbjct: 64 LVAEVKELRKRL---AKL-ISENEALKAENERLQKRE---QSIDQQI---QQAVQSET-- 111
Query: 223 NQQLASQLNSNINSSQHQLPPGKQQLVAQNSQLTSAHINNVNS 265
Q+L + QL +QQL QL +
Sbjct: 112 -QELTKE--------IEQLKSERQQLQGLIDQLQRRLAGVLTG 145
>gnl|CDD|220785 pfam10498, IFT57, Intra-flagellar transport protein 57. Eukaryotic
cilia and flagella are specialised organelles found at
the periphery of cells of diverse organisms.
Intra-flagellar transport (IFT) is required for the
assembly and maintenance of eukaryotic cilia and
flagella, and consists of the bidirectional movement of
large protein particles between the base and the distal
tip of the organelle. IFT particles contain multiple
copies of two distinct protein complexes, A and B, which
contain at least 6 and 11 protein subunits. IFT57 is
part of complex B but is not, however, required for the
core subunits to stay associated. This protein is known
as Huntington-interacting protein-1 in humans.
Length = 355
Score = 32.4 bits (74), Expect = 0.29
Identities = 33/172 (19%), Positives = 66/172 (38%), Gaps = 35/172 (20%)
Query: 46 QIDSQRQLIAAKEQRLRFLKQHEARHQQVAAEHERLRRLRDRVEQQELKLRKLRALRGQV 105
++ QL A + L KQ+ L++L + + K+
Sbjct: 217 HLEQMHQLKKAISKSLPETKQY-------------LKKLSQDLSKALEKISSREKHLNNQ 263
Query: 106 DQNKMNNISLSSDLESIRALFNEKEKELCMAVAKVEELTHQLEDLKRGRTHQPSHPHLSA 165
+ L + R ++ +++ A V ELT +L ++ S
Sbjct: 264 LE------QLVQEYREARRTLSQVQEKYNQASQGVSELTRELNEI-------------SE 304
Query: 166 ELENLRRELMYH-NKLSEQQNVRLHQQRETLTKKQAEVNSVDLRISELQERL 216
ELE +++E+ +S+ + + ++ +TK + E+ +DLRI LQ L
Sbjct: 305 ELEQVKQEMEERGASMSDGSPL--VKIKQAITKLKEEIKQMDLRIGVLQHTL 354
>gnl|CDD|237177 PRK12704, PRK12704, phosphodiesterase; Provisional.
Length = 520
Score = 32.4 bits (75), Expect = 0.30
Identities = 18/92 (19%), Positives = 41/92 (44%), Gaps = 1/92 (1%)
Query: 42 RQQAQIDSQRQLIAAKEQRLRFLKQHEARHQQVAAEHERL-RRLRDRVEQQELKLRKLRA 100
+++A+ + L+ AKE+ + + E ++ E ++L +RL + E + KL L
Sbjct: 48 KKEAEAIKKEALLEAKEEIHKLRNEFEKELRERRNELQKLEKRLLQKEENLDRKLELLEK 107
Query: 101 LRGQVDQNKMNNISLSSDLESIRALFNEKEKE 132
++++ + +LE E +E
Sbjct: 108 REEELEKKEKELEQKQQELEKKEEELEELIEE 139
Score = 32.1 bits (74), Expect = 0.44
Identities = 17/81 (20%), Positives = 36/81 (44%), Gaps = 6/81 (7%)
Query: 166 ELENLRREL-----MYHNKLSEQQNVRLHQQRETLTKKQAEVNSVDLRISELQERLQRKR 220
E+ LR E N+L + + RL Q+ E L +K + + + + ++ L++K+
Sbjct: 65 EIHKLRNEFEKELRERRNELQKLEK-RLLQKEENLDRKLELLEKREEELEKKEKELEQKQ 123
Query: 221 LLNQQLASQLNSNINSSQHQL 241
++ +L I +L
Sbjct: 124 QELEKKEEELEELIEEQLQEL 144
Score = 29.7 bits (68), Expect = 2.1
Identities = 39/176 (22%), Positives = 70/176 (39%), Gaps = 42/176 (23%)
Query: 49 SQRQLIAAKEQRLRFLKQHEARHQQVAAEHERLRRLRDRVEQQELKLRKLRALRGQVDQN 108
++ ++ A+E+ R L+ EA+ + A + E L ++ + + LR + ++
Sbjct: 29 AEAKIKEAEEEAKRILE--EAKKEAEAIKKEALLEAKEEIHK----------LRNEFEK- 75
Query: 109 KMNNISLSSDLESIRALFNEKEKELCMAVAKVEELTHQLEDLKRGRTHQPSHPHLSAELE 168
+L R + EK L + K E L +LE L++ ELE
Sbjct: 76 ---------ELRERRNELQKLEKRL---LQKEENLDRKLELLEK----------REEELE 113
Query: 169 NLRRELMYHNKLSEQQNVRLHQQRETLTKKQAEVNSVDLRISELQERLQRKRLLNQ 224
+EL EQ+ L ++ E L + E RIS L ++ LL +
Sbjct: 114 KKEKEL-------EQKQQELEKKEEELEELIEEQLQELERISGLTAEEAKEILLEK 162
>gnl|CDD|163506 TIGR03794, NHLM_micro_HlyD, NHLM bacteriocin system secretion
protein. Members of this protein family are homologs of
the HlyD membrane fusion protein of type I secretion
systems. Their occurrence in prokaryotic genomes is
associated with the occurrence of a novel class of
microcin (small bacteriocins) with a leader peptide
region related to nitrile hydratase. We designate the
class of bacteriocin as Nitrile Hydratase Leader
Microcin, or NHLM. This family, therefore, is designated
as NHLM bacteriocin system secretion protein. Some but
not all NHLM-class putative microcins belong to the TOMM
(thiazole/oxazole modified microcin) class as assessed
by the presence of the scaffolding protein and/or
cyclodehydratase in the same gene clusters [Transport
and binding proteins, Amino acids, peptides and amines,
Cellular processes, Biosynthesis of natural products].
Length = 421
Score = 32.5 bits (74), Expect = 0.32
Identities = 45/194 (23%), Positives = 74/194 (38%), Gaps = 21/194 (10%)
Query: 79 ERLRRLRDRVEQQELKLRKLRALRGQVDQNKMNNISLSSD-LESIRALFNEKEKELCMAV 137
ERL+ ++ Q + +L ++R G++ + + + S + LE E+ L V
Sbjct: 96 ERLQESYQKLTQLQEQLEEVRNYTGRLKEGRERHFQKSKEALEETIGRLREELAALSREV 155
Query: 138 AKVEE-LTHQLEDLKRGRTHQ--------PSHPHLSAELEN-LRRELMYHNKLSEQQNVR 187
K L+ L KR R Q A L+ + N + Q
Sbjct: 156 GKQRGLLSRGLATFKRDRILQQQWREEQEKYDAADKARAIYALQTKADERNLETVLQ--S 213
Query: 188 LHQQRETL-TKKQAEVNSVDLRISELQER---LQRKRLLNQQLASQLNSNINSSQHQLPP 243
L Q L + E+ +V+ RI E + L+ K LN ++ SQ + + L
Sbjct: 214 LSQADFQLAGVAEKELETVEARIKEARYEIEELENKLNLNTRIVSQHSGRVIE----LNY 269
Query: 244 GKQQLVAQNSQLTS 257
QLVA + L S
Sbjct: 270 TPGQLVAAGAPLAS 283
>gnl|CDD|224486 COG1570, XseA, Exonuclease VII, large subunit [DNA replication,
recombination, and repair].
Length = 440
Score = 32.2 bits (74), Expect = 0.41
Identities = 37/140 (26%), Positives = 57/140 (40%), Gaps = 16/140 (11%)
Query: 135 MAVAKVEELTHQLEDLKRGRTHQPSHPHLSAE---LENLRRELM---YHNKLSEQQNVRL 188
+ V EL QL+ L+R R H+ L + LE+L R L LSEQQ RL
Sbjct: 258 LVVPDSAELLQQLDQLQR-RLHRALRRLLDQKKQRLEHLARRLQFRSPERLLSEQQQ-RL 315
Query: 189 HQQRETLTKKQAEV-----NSVDLRISELQERLQRKRLLNQQLASQLNSNINSSQHQLPP 243
+ L + ++ L ++QR++ QQL +L+ + QL
Sbjct: 316 DELAIRLRRALENQLALKKQRLERLTQRLNPQIQRQQQRLQQLERRLDKALR---RQLKR 372
Query: 244 GKQQLVAQNSQLTSAHINNV 263
+++L A QL S
Sbjct: 373 KRERLEALVEQLESLSPLAT 392
Score = 28.8 bits (65), Expect = 4.1
Identities = 32/144 (22%), Positives = 65/144 (45%), Gaps = 11/144 (7%)
Query: 20 KAADLLTEGVELTLGELEDMAVR----QQAQIDSQRQLIAAKEQRLRFL---KQHEARHQ 72
AA+L+ L +L+ + R + +D ++Q + +RL+F + + Q
Sbjct: 254 AAAELVVPDSAELLQQLDQLQRRLHRALRRLLDQKKQRLEHLARRLQFRSPERLLSEQQQ 313
Query: 73 QVAAEHERLRR-LRDRVEQQELKLRKLR-ALRGQVDQNKMNNISLSSDL-ESIRALFNEK 129
++ RLRR L +++ ++ +L +L L Q+ + + L L +++R K
Sbjct: 314 RLDELAIRLRRALENQLALKKQRLERLTQRLNPQIQRQQQRLQQLERRLDKALRRQLKRK 373
Query: 130 EKELCMAVAKVEELTHQLEDLKRG 153
+ L V ++E L+ L RG
Sbjct: 374 RERLEALVEQLESLSPLAT-LARG 396
>gnl|CDD|234172 TIGR03345, VI_ClpV1, type VI secretion ATPase, ClpV1 family.
Members of this protein family are homologs of ClpB, an
ATPase associated with chaperone-related functions.
These ClpB homologs, designated ClpV1, are a key
component of the bacterial pathogenicity-associated type
VI secretion system [Protein fate, Protein and peptide
secretion and trafficking, Cellular processes,
Pathogenesis].
Length = 852
Score = 32.2 bits (74), Expect = 0.47
Identities = 23/104 (22%), Positives = 45/104 (43%), Gaps = 11/104 (10%)
Query: 34 GELEDMAVRQQAQIDSQRQLIAAKEQRLRFLKQHEARHQ----QVAAEHERLRRLRDRVE 89
LED+ R+ A ++ + + E+ H+ R ++AA L L R +
Sbjct: 419 AALEDLR-RRIAALELELDAL---EREAALGADHDERLAELRAELAALEAELAALEARWQ 474
Query: 90 QQELKLRKLRALRGQVDQNKMNNISLSSDLESIRALFNEKEKEL 133
Q++ + + ALR +++ + + D ++RA E E L
Sbjct: 475 QEKELVEAILALRAELEADADAP---ADDDAALRAQLAELEAAL 515
>gnl|CDD|179385 PRK02224, PRK02224, chromosome segregation protein; Provisional.
Length = 880
Score = 31.9 bits (73), Expect = 0.49
Identities = 43/197 (21%), Positives = 80/197 (40%), Gaps = 22/197 (11%)
Query: 27 EGVELTLGELEDMAVRQQAQIDSQRQLIAAKEQRLRFLKQHEARHQQVAAEHERLRRLRD 86
E +E L +LE+ + +++ L+ A+++ R ++ E + +A E + R+
Sbjct: 478 EELEAELEDLEEEVEEVEERLERAEDLVEAEDRIERLEERREDLEELIAERRETIEEKRE 537
Query: 87 RVEQQELKLRKLRALRGQVDQNKMNNISLSSDLESIRALFNEKEKELCMAVAKVEELTHQ 146
R E+ LR + + L ++ E R E E+E A +V EL +
Sbjct: 538 RAEE----------LRERAAE-------LEAEAEEKREAAAEAEEEAEEAREEVAELNSK 580
Query: 147 LEDLKRGRTHQPSHPHLSAELENLRRELMYHNKLSEQQNVRLHQQRETLT-----KKQAE 201
L +LK L A + + E+ + E ++RE L K++ E
Sbjct: 581 LAELKERIESLERIRTLLAAIADAEDEIERLREKREALAELNDERRERLAEKRERKRELE 640
Query: 202 VNSVDLRISELQERLQR 218
+ RI E +E +R
Sbjct: 641 AEFDEARIEEAREDKER 657
>gnl|CDD|240016 cd04658, Piwi_piwi-like_Euk, Piwi_piwi-like_Euk: PIWI domain,
Piwi-like subfamily found in eukaryotes. This domain is
found in Piwi and closely related proteins, where it is
believed to perform a crucial role in germline cells,
via RNA silencing. RNA silencing refers to a group of
related gene-silencing mechanisms mediated by short RNA
molecules, including siRNAs, miRNAs, and
heterochromatin-related guide RNAs. The mechanism in
Piwi is believed to be similar to that in Argonaute, the
central component of the RNA-induced silencing complex
(RISC). The PIWI domain is the C-terminal portion of
Argonaute and consists of two subdomains, one of which
provides the 5' anchoring of the guide RNA and the
other, the catalytic site for slicing.
Length = 448
Score = 31.9 bits (73), Expect = 0.53
Identities = 19/103 (18%), Positives = 36/103 (34%), Gaps = 2/103 (1%)
Query: 213 QERLQRKRLLNQQLASQLNSNINSSQHQLPPGKQQLVAQNSQLTSAHINNVNSLFRSMTS 272
+ER R Q++ + + + L Q L I N + ++
Sbjct: 15 KERYDTIRQFIQRIQKNPSVQELLKKWGIELDSNPLKIQGRVLPPEQIIMGNVFVYANSN 74
Query: 273 SNIAAVAPYLHVPSKPNLDNTHAIF--QNQRRIESGHNQLNKV 313
++ + NL+N I+ ++QR ES L +V
Sbjct: 75 ADWKREIRNQPLYDAVNLNNWVLIYPSRDQREAESFLQTLKQV 117
>gnl|CDD|213368 cd12834, ZntB_u1, Uncharacterized bacterial subgroup of the
Salmonella typhimurium Zn2+ transporter ZntB-like
subfamily. The MIT superfamily of essential membrane
proteins is involved in transporting divalent cations
(uptake or efflux) across membranes. The ZntB-like
subfamily includes the Zn2+ transporter Salmonella
typhimurium ZntB which mediates the efflux of Zn2+ (and
Cd2+). Structures of the intracellular domain of Vibrio
parahaemolyticus and Salmonella typhimurium ZntB form
funnel-shaped homopentamers, the tip of the funnel is
formed from two C-terminal transmembrane (TM) helices
from each monomer, and the large opening of the funnel
from the N-terminal cytoplasmic domains. The GMN
signature motif of the MIT superfamily occurs just after
TM1, mutation within this motif is known to abolish Mg2+
transport through Salmonella typhimurium CorA, and
Mrs2p. Natural variants such as GVN and GIN which occur
in proteins belonging to this subfamily, may be
associated with the transport of different divalent
cations, such as zinc and cadmium. The functional
diversity of MIT transporters may also be due to minor
structural differences regulating gating, substrate
selection, and transport.
Length = 290
Score = 31.0 bits (71), Expect = 0.66
Identities = 20/78 (25%), Positives = 35/78 (44%), Gaps = 9/78 (11%)
Query: 160 HPHLSAELENLRRELMYHNKLSEQQNVRLHQQRETLTKKQAEVNSVDLRISELQERLQRK 219
H L+ E L R L + + + LT+ ++++D + LQER
Sbjct: 161 HRLLAPEPAALFRLL---RRPPDWLPDADLETLRRLTE---RLDALDRDLVALQERA--- 211
Query: 220 RLLNQQLASQLNSNINSS 237
RLL +++A++L N S
Sbjct: 212 RLLQEEIAARLAEQTNRS 229
>gnl|CDD|221016 pfam11180, DUF2968, Protein of unknown function (DUF2968). This
family of proteins has no known function.
Length = 192
Score = 30.5 bits (69), Expect = 0.76
Identities = 24/81 (29%), Positives = 40/81 (49%), Gaps = 5/81 (6%)
Query: 32 TLGELEDMAVRQQAQIDSQRQLIAAKEQRLRFLKQ----HEARHQQVAAEHERLRRLRDR 87
L ++E + +AQ + IAA E R L+ +A+ QQVAA ++ R+
Sbjct: 99 QLADVEIRRAQLEAQKAQLERQIAASEARAERLQADLQLAQAQEQQVAARQQQARQEAVA 158
Query: 88 VEQQELKLR-KLRALRGQVDQ 107
+E Q + +LR L+ Q+ Q
Sbjct: 159 LEAQRQAAQAQLRKLQRQIRQ 179
>gnl|CDD|236942 PRK11637, PRK11637, AmiB activator; Provisional.
Length = 428
Score = 31.2 bits (71), Expect = 0.82
Identities = 50/214 (23%), Positives = 86/214 (40%), Gaps = 32/214 (14%)
Query: 40 AVRQQAQIDSQRQLIAAKEQRLRFLKQHEARHQQVAAEHERLRRLRDRVEQQELKLRKLR 99
A + Q+ S +Q IAAKE+ +R QQ L +L+ + E RKLR
Sbjct: 42 ASDNRDQLKSIQQDIAAKEKSVR---------QQQQQRASLLAQLKKQEEAISQASRKLR 92
Query: 100 ALRGQVDQNKMNNISLSSDLESIRALFNEKEKELCMAVAKVEELTHQLEDLKRGRTHQPS 159
+ ++Q L++ + + +E+ L QL+ R H
Sbjct: 93 ETQNTLNQLNKQIDELNASIAKLEQQQAAQER----------LLAAQLDAAFRQGEHTGL 142
Query: 160 HPHLSAELENLRRE--LMYHNKLSE----------QQNVRLHQQRETLTKKQAEVNSVDL 207
LS E E+ R E L Y L++ Q L Q+ L +KQ++ ++
Sbjct: 143 QLILSGE-ESQRGERILAYFGYLNQARQETIAELKQTREELAAQKAELEEKQSQQKTLLY 201
Query: 208 RISELQERLQRKRLLNQQLASQLNSNINSSQHQL 241
Q++L++ R ++ + L S++ Q QL
Sbjct: 202 EQQAQQQKLEQARNERKKTLTGLESSLQKDQQQL 235
>gnl|CDD|225803 COG3264, COG3264, Small-conductance mechanosensitive channel [Cell
envelope biogenesis, outer membrane].
Length = 835
Score = 31.2 bits (71), Expect = 0.93
Identities = 51/261 (19%), Positives = 86/261 (32%), Gaps = 64/261 (24%)
Query: 7 TLRNRPYPSSSTLKAADLLTEGVELTLGELEDMAVRQQAQIDSQRQ---------LIAAK 57
+ S L A+ +E L L++ AV + Q +++ LI +
Sbjct: 7 NVLQELLQSRRELLTAESAQ--LEAALQLLQE-AVNSKRQEEAEPAAEEAELQAELIQQE 63
Query: 58 EQRLR----FLKQHEARHQQVAAEHERLRRLRDRVEQQELKLRKLRALRGQVDQNKMNNI 113
L Q R +A++ +L L ++ Q R +R Q
Sbjct: 64 LAINDQLSQALNQQTERLNALASDDRQLANLLLQLLQSS------RTIREQ-------IA 110
Query: 114 SLSSDLESIRALFNEKEKELCMAVAKVEELTHQLEDLKRGRTHQPSHPHLSAELENLRRE 173
L L R L + + + + Q E ++ E + L+ E
Sbjct: 111 VLRGSLLLSRILLQQLG-----PLPEAGQPQEQFE--------------VTQERDALQAE 151
Query: 174 LMYHNKLSEQQNVRLHQQRETLTKKQAEVNSVDLRISELQERLQRKRLLNQQLASQLNSN 233
Y N L Q E LT AEV I + +R+ LLN L+ + +
Sbjct: 152 KAYINALEGQA--------EQLT---AEVR----DILDQILDTRRE-LLNSLLSQREAIS 195
Query: 234 INSSQHQLPPGKQQLVAQNSQ 254
+ +Q QL +L + Q
Sbjct: 196 LQLNQQQLSAASDELRSLLHQ 216
>gnl|CDD|224495 COG1579, COG1579, Zn-ribbon protein, possibly nucleic acid-binding
[General function prediction only].
Length = 239
Score = 30.4 bits (69), Expect = 0.94
Identities = 29/135 (21%), Positives = 63/135 (46%), Gaps = 12/135 (8%)
Query: 23 DLLTEGVELTLGELEDMAVRQQAQIDSQRQLIAAKEQRLRFLKQHEARHQQVAAEHERLR 82
DL + +E + E+ + +A++++ + + A E L L+ ++ + ++
Sbjct: 16 DLEKDRLEPRIKEIRKALKKAKAELEALNKALEALEIELEDLENQVSQLES------EIQ 69
Query: 83 RLRDRVEQQELKL------RKLRALRGQVDQNKMNNISLSSDLESIRALFNEKEKELCMA 136
+R+R+++ E KL R+LRAL ++ K SL +L + + EKE+
Sbjct: 70 EIRERIKRAEEKLSAVKDERELRALNIEIQIAKERINSLEDELAELMEEIEKLEKEIEDL 129
Query: 137 VAKVEELTHQLEDLK 151
++E L L + +
Sbjct: 130 KERLERLEKNLAEAE 144
>gnl|CDD|233067 TIGR00641, acid_CoA_mut_N, methylmalonyl-CoA mutase N-terminal
domain. Methylmalonyl-CoA mutase (EC 5.4.99.2)
catalyzes a reversible isomerization between
L-methylmalonyl-CoA and succinyl-CoA. The enzyme uses an
adenosylcobalamin cofactor. It may be a homodimer, as in
mitochondrion, or a heterodimer with partially
homologous beta chain that does not bind the
adenosylcobalamin cofactor, as in Propionibacterium
freudenreichii. The most similar archaeal sequences are
separate chains, such as AF2215 abd AF2219 of
Archaeoglobus fulgidus, that correspond roughly to the
first 500 and last 130 residues, respectively of known
methylmalonyl-CoA mutases. This model describes the
N-terminal domain subfamily. In a neighbor-joining tree,
AF2215 branches with a bacterial isobutyryl-CoA mutase,
which is also the same length. Scoring between the noise
and trusted cutoffs are the non-catalytic, partially
homologous beta chains from certain heterodimeric
examples of 5.4.99.2.
Length = 524
Score = 30.9 bits (70), Expect = 1.1
Identities = 27/85 (31%), Positives = 43/85 (50%), Gaps = 11/85 (12%)
Query: 35 ELEDMAVRQQAQIDSQRQLIAA-------KEQRLRFLK--QHEARHQQVAAEHERLRRLR 85
+E+ A R QA+IDS RQ+I +E + LK R +Q+ A+ ++LR R
Sbjct: 409 RIEEAAARTQARIDSGRQVIVGVNKYQLEEEDEVEVLKVDNSSVREEQI-AKLKKLRAER 467
Query: 86 DRVEQQELKLRKLRALRGQVDQNKM 110
D+ E+ E L L + D+N +
Sbjct: 468 DQ-EKVEAALDALTKAAEKEDENLL 491
>gnl|CDD|217551 pfam03428, RP-C, Replication protein C N-terminal domain.
Replication protein C is involved in the early stages of
viral DNA replication.
Length = 177
Score = 29.8 bits (68), Expect = 1.3
Identities = 15/34 (44%), Positives = 19/34 (55%), Gaps = 3/34 (8%)
Query: 68 EARHQQVAAEHERLRRLRDRVEQQELKLRKLRAL 101
A QV AE + LRRLR+R+ L R +R L
Sbjct: 140 AALAAQVRAERKALRRLRERL---TLLRRDIRKL 170
>gnl|CDD|217051 pfam02463, SMC_N, RecF/RecN/SMC N terminal domain. This domain is
found at the N terminus of SMC proteins. The SMC
(structural maintenance of chromosomes) superfamily
proteins have ATP-binding domains at the N- and
C-termini, and two extended coiled-coil domains
separated by a hinge in the middle. The eukaryotic SMC
proteins form two kind of heterodimers: the SMC1/SMC3
and the SMC2/SMC4 types. These heterodimers constitute
an essential part of higher order complexes, which are
involved in chromatin and DNA dynamics. This family also
includes the RecF and RecN proteins that are involved in
DNA metabolism and recombination.
Length = 1162
Score = 30.7 bits (69), Expect = 1.3
Identities = 33/198 (16%), Positives = 73/198 (36%), Gaps = 13/198 (6%)
Query: 37 EDMAVRQQAQIDSQRQLIAAKEQRLRFLKQHEARHQQVAAEHERLRRLRDRVEQQELKLR 96
E + Q+ S+++L +E + LK+++ ++ + E L+ L E+ + +L
Sbjct: 245 ELLRDEQEEIESSKQELEKEEEILAQVLKENKEEEKEKKLQEEELKLLAKEEEELKSELL 304
Query: 97 KLR----ALRGQVDQNKMNNISLSSDLESIRALFNEKEKELCMAVAKVEELTHQLEDLKR 152
KL ++ +++ L +L+ + E EKEL K E + E L++
Sbjct: 305 KLERRKVDDEEKLKESEKELKKLEKELKKEKEEIEELEKELKELEIKREAEEEEEEQLEK 364
Query: 153 GRTHQPSHPHLSAELENLRRELMYHNKLSEQQNVRLHQQRETLTKKQAEVNSVDLRISEL 212
+ + L SE+ + + E L K E L +
Sbjct: 365 LQEK---------LEQLEEELLAKKKLESERLSSAAKLKEEELELKNEEEKEAKLLLELS 415
Query: 213 QERLQRKRLLNQQLASQL 230
++ + ++ +
Sbjct: 416 EQEEDLLKEEKKEELKIV 433
>gnl|CDD|115307 pfam06637, PV-1, PV-1 protein (PLVAP). This family consists of
several PV-1 (PLVAP) proteins which seem to be specific
to mammals. PV-1 is a novel protein component of the
endothelial fenestral and stomatal diaphragms. The
function of this family is unknown.
Length = 442
Score = 30.4 bits (68), Expect = 1.4
Identities = 19/76 (25%), Positives = 39/76 (51%), Gaps = 12/76 (15%)
Query: 43 QQAQIDSQRQLIAAKEQRLRFLKQHEARHQQVAAEHER-----------LRRLRDRVEQQ 91
Q+ ++++++ AA+E + + K+ +AR ++ AE R LR RD + ++
Sbjct: 305 QRQKLEAEQAAQAAQEAKAKAEKEAQAREAKLQAECARQTQLALEEKAALRAERDNLAKE 364
Query: 92 -ELKLRKLRALRGQVD 106
E K R+ LR ++
Sbjct: 365 LEAKKREAEQLRMELA 380
>gnl|CDD|217372 pfam03105, SPX, SPX domain. We have named this region the SPX
domain after (SYG1, Pho81 and XPR1). This 180 residue
length domain is found at the amino terminus of a
variety of proteins. In the yeast protein SYG1, the
N-terminus directly binds to the G- protein beta subunit
and inhibits transduction of the mating pheromone
signal. This finding suggests that all the members of
this family are involved in G-protein associated signal
transduction. The N-termini of several proteins involved
in the regulation of phosphate transport, including the
putative phosphate level sensors PHO81 from
Saccharomyces cerevisiae and NUC-2 from Neurospora
crassa, are also members of this family. The SPX domain
of S. cerevisiae low-affinity phosphate transporters
Pho87 and Pho90 auto-regulates uptake and prevents
efflux. This SPX dependent inhibition is mediated by the
physical interaction with Spl2 NUC-2 contains several
ankyrin repeats pfam00023. Several members of this
family are annotated as XPR1 proteins: the xenotropic
and polytropic retrovirus receptor confers
susceptibility to infection with murine leukaemia
viruses (MLV). The similarity between SYG1, phosphate
regulators and XPR1 sequences has been previously noted,
as has the additional similarity to several predicted
proteins, of unknown function, from Drosophila
melanogaster, Arabidopsis thaliana, Caenorhabditis
elegans, Schizosaccharomyces pombe, and Saccharomyces
cerevisiae. In addition, given the similarities between
XPR1 and SYG1 and phosphate regulatory proteins, it has
been proposed that XPR1 might be involved in G-protein
associated signal transduction and may itself function
as a phosphate sensor.
Length = 176
Score = 29.7 bits (67), Expect = 1.4
Identities = 20/68 (29%), Positives = 32/68 (47%), Gaps = 3/68 (4%)
Query: 115 LSSDLESIRALFNEKEKELCMAVAKVEELTHQLEDLKRGRTHQPSHPHLSAELENLRREL 174
L +LE + + + EKE EL + ++EEL QLE+L+ R S ++ L+
Sbjct: 69 LDKELEKVNSFYKEKEAEL---LERLEELEKQLEELEERRDETSSAKSSPSDKNLLKAFE 125
Query: 175 MYHNKLSE 182
L E
Sbjct: 126 ELKKALLE 133
>gnl|CDD|217902 pfam04111, APG6, Autophagy protein Apg6. In yeast, 15 Apg proteins
coordinate the formation of autophagosomes. Autophagy is
a bulk degradation process induced by starvation in
eukaryotic cells. Apg6/Vps30p has two distinct functions
in the autophagic process, either associated with the
membrane or in a retrieval step of the carboxypeptidase
Y sorting pathway.
Length = 356
Score = 30.2 bits (68), Expect = 1.4
Identities = 24/124 (19%), Positives = 49/124 (39%), Gaps = 4/124 (3%)
Query: 125 LFNEKEKELCMAVAKVEELTHQLEDLKRGRTHQPSHPHLSAELENLRRELMYHNKLSEQQ 184
L + EL A + + L L+ ++ L +EL+ L++E +L +
Sbjct: 30 LLERLDSELRDAEKERDTYKQYLSKLESQNVEISNYEALDSELDELKKE---EERL-LDE 85
Query: 185 NVRLHQQRETLTKKQAEVNSVDLRISELQERLQRKRLLNQQLASQLNSNINSSQHQLPPG 244
L ++ + L + E+ ++ + + R+ L + QL N+ S + Q
Sbjct: 86 LEELEKEDDDLDGELVELQEEKEQLENEELQYLREYNLFDRNNLQLEDNLQSLELQYEYS 145
Query: 245 KQQL 248
QL
Sbjct: 146 LNQL 149
Score = 27.9 bits (62), Expect = 7.6
Identities = 23/106 (21%), Positives = 46/106 (43%), Gaps = 13/106 (12%)
Query: 33 LGELEDMAV--RQQAQIDSQRQLIAAKEQRL-RFLKQHEARHQQVAAEHERLRRLRDRVE 89
L +LE V +DS+ + +E+RL L++ E + E L+ ++++E
Sbjct: 52 LSKLESQNVEISNYEALDSELDELKKEEERLLDELEELEKEDDDLDGELVELQEEKEQLE 111
Query: 90 QQELKLRKLRALRGQVDQNKM--NNISLSSDLESIRALFNEKEKEL 133
+EL+ + N NN+ L +L+S+ + +L
Sbjct: 112 NEELQY--------LREYNLFDRNNLQLEDNLQSLELQYEYSLNQL 149
>gnl|CDD|227061 COG4717, COG4717, Uncharacterized conserved protein [Function
unknown].
Length = 984
Score = 30.6 bits (69), Expect = 1.5
Identities = 36/172 (20%), Positives = 65/172 (37%), Gaps = 21/172 (12%)
Query: 76 AEHERLRRLRDRVEQQELKLRKLRALRGQVDQNKMNNISLSSDLESIRALFNEKEKELCM 135
E + L + R Q L KL+ R ++D+ + + + L + E
Sbjct: 168 KEADSLYKPSGRNPQINQLLEKLKQERNEIDEAE-------KEYATYHKLLESRRAEHAR 220
Query: 136 AVAKVEELTHQLEDLKRGRTHQPSHPHL--SAELEN-LRRELMYHNKLSEQQNVRLHQQR 192
EL + ++ R P L +LE L R +RL ++
Sbjct: 221 LAELRSELRADRDHIRALRDAVELWPRLQEWKQLEQELTRRREELATFPRDGVLRLEKRE 280
Query: 193 ETLTKKQAEVNSVDLRISELQER-----------LQRKRLLNQQLASQLNSN 233
L K +AE++++ +R++EL++ LQ L+QQL+ S
Sbjct: 281 AHLQKTEAEIDALLVRLAELKDLASQLIPAKEAVLQALVRLHQQLSEIKASA 332
>gnl|CDD|227352 COG5019, CDC3, Septin family protein [Cell division and chromosome
partitioning / Cytoskeleton].
Length = 373
Score = 30.0 bits (68), Expect = 1.6
Identities = 15/75 (20%), Positives = 31/75 (41%), Gaps = 3/75 (4%)
Query: 80 RLRRLRDRVEQQELKLRKLRALRGQVDQNKMNNISLSSDLESIRALFNEKEK---ELCMA 136
R +L E L+++ R ++ ++ E + L ++ E
Sbjct: 288 RTEKLSGLKNSGEPSLKEIHEARLNEEERELKKKFTEKIREKEKRLEELEQNLIEERKEL 347
Query: 137 VAKVEELTHQLEDLK 151
+K+EE+ +LEDL+
Sbjct: 348 NSKLEEIQKKLEDLE 362
>gnl|CDD|223562 COG0488, Uup, ATPase components of ABC transporters with duplicated
ATPase domains [General function prediction only].
Length = 530
Score = 30.3 bits (69), Expect = 1.6
Identities = 10/101 (9%), Positives = 35/101 (34%), Gaps = 5/101 (4%)
Query: 51 RQLIAAKEQRLRFLKQHEARHQQVAAEHERLRRLRDRVEQQELKLRKLRALRGQVDQNKM 110
Q Q ++ + + R + + ++ + ++++L L ++ + +
Sbjct: 240 EQKAERLRQEAAAYEKQQKELAKEQEWIRRGKAAASKAKKAKSRIKRLEKLEARLAEERP 299
Query: 111 NNISLSSDLESIRALFNEKEKELCMAVAKVEELTHQLEDLK 151
+ + + F K L V + E ++ + +
Sbjct: 300 V-----EEGKPLAFRFPPPGKRLGKLVLEFENVSKGYDGGR 335
>gnl|CDD|236892 PRK11281, PRK11281, hypothetical protein; Provisional.
Length = 1113
Score = 30.3 bits (69), Expect = 1.7
Identities = 25/121 (20%), Positives = 48/121 (39%), Gaps = 6/121 (4%)
Query: 141 EELTHQLEDLKRGRTHQPSHPHLSAELENLRRELMYHNKL--SEQQNVRLHQQRETLTKK 198
++ QL+ L + + + + +LE L +K+ +++ +L QQ K
Sbjct: 39 ADVQAQLDALNKQKLLEAEDKLVQQDLEQ---TLALLDKIDRQKEETEQLKQQLAQAPAK 95
Query: 199 QAEVNSVDLRISELQERLQRKRLLNQQLASQLNSNINSSQHQLPPGKQQLVAQNSQLTSA 258
+ + + + + R+ L L QL S + + QL + L NSQL S
Sbjct: 96 LRQAQAELEALKDDNDEETRETLSTLSLR-QLESRLAQTLDQLQNAQNDLAEYNSQLVSL 154
Query: 259 H 259
Sbjct: 155 Q 155
Score = 29.5 bits (67), Expect = 3.3
Identities = 27/135 (20%), Positives = 50/135 (37%), Gaps = 25/135 (18%)
Query: 130 EKELCMAVAKVEELTHQLEDLKRGRTHQPSHPHLSAELENLRRELMYHNKLSEQQNVRLH 189
+++L A AK+ + +LE LK + + L L E L + QN
Sbjct: 86 KQQLAQAPAKLRQAQAELEALKDDNDEETRETLSTLSLRQL--ESRLAQTLDQLQNA--- 140
Query: 190 QQRETLTKKQAEVNSVDLRISELQERLQR---------KRLLNQQLASQLNSNINSSQHQ 240
Q ++ + ++ LQ + +R +RL QQ+ + L +
Sbjct: 141 ---------QNDLAEYNSQLVSLQTQPERAQAALYANSQRL--QQIRNLLKGGKVGGKAL 189
Query: 241 LPPGKQQLVAQNSQL 255
P + L A+ + L
Sbjct: 190 RPSQRVLLQAEQALL 204
>gnl|CDD|184349 PRK13824, PRK13824, replication initiation protein RepC;
Provisional.
Length = 404
Score = 29.8 bits (68), Expect = 1.8
Identities = 11/17 (64%), Positives = 15/17 (88%)
Query: 72 QQVAAEHERLRRLRDRV 88
+QVAAE + LRRLR+R+
Sbjct: 156 EQVAAERKALRRLRERL 172
>gnl|CDD|227355 COG5022, COG5022, Myosin heavy chain [Cytoskeleton].
Length = 1463
Score = 30.0 bits (68), Expect = 2.0
Identities = 52/244 (21%), Positives = 94/244 (38%), Gaps = 29/244 (11%)
Query: 32 TLGELEDMAVR--QQAQIDSQRQLIAAKEQRLRFL-KQHEARHQQVAAEHERLRRLRDR- 87
L LEDM QR I + R R+L + QV RLRRL D
Sbjct: 732 VLAALEDMRDAKLDNIATRIQR-AIRGRYLRRRYLQALKRIKKIQVIQHGFRLRRLVDYE 790
Query: 88 -VEQQELKLRKLRALRGQVDQNKMNNISLSSDLESIRALFNEKEKELCMAVAKVEELTHQ 146
+ +KL+ L +L G + + S L I L ++E + + E + +
Sbjct: 791 LKWRLFIKLQPLLSLLGSRKEYR-------SYLACIIKLQKTIKREKKLRETEEVEFSLK 843
Query: 147 LEDLKR--GRTHQPSHPHLSAEL-ENLRRELMYHNKLSEQQNVRLHQQRETLTKKQAEVN 203
E L + GR+ + S E + + +L+E+Q L ++++ +
Sbjct: 844 AEVLIQKFGRSLKA-KKRFSLLKKETIYLQSAQRVELAERQLQELKIDVKSISSLKLVNL 902
Query: 204 SVDLRISELQERLQRKRLLNQQLASQ-------LNSNINSS-----QHQLPPGKQQLVAQ 251
++ I EL++ L + N + ++ L +NI+ ++ P +L
Sbjct: 903 ELESEIIELKKSLSSDLIENLEFKTELIARLKKLLNNIDLEEGPSIEYVKLPELNKLHEV 962
Query: 252 NSQL 255
S+L
Sbjct: 963 ESKL 966
>gnl|CDD|221313 pfam11917, DUF3435, Protein of unknown function (DUF3435). This
family of proteins are functionally uncharacterized.
This protein is found in eukaryotes. Proteins in this
family are typically between 435 to 791 amino acids in
length. This family is related to pfam00589 suggesting
it may be an integrase enzyme.
Length = 418
Score = 29.6 bits (67), Expect = 2.2
Identities = 19/73 (26%), Positives = 34/73 (46%), Gaps = 7/73 (9%)
Query: 43 QQAQIDSQRQLIAAKEQRLRFLKQHEARHQQVA-----AEHERLRRLRDRV--EQQELKL 95
Q+A ++ +L +R K+ A + QVA +ERL + R V E+Q L+
Sbjct: 280 QKASVEEDPELQELIRKRDHLKKEIIALYGQVAKAKGTPLYERLEKRRREVRNERQRLRR 339
Query: 96 RKLRALRGQVDQN 108
+ +R + D+
Sbjct: 340 ELKKKIREEFDEE 352
>gnl|CDD|218636 pfam05557, MAD, Mitotic checkpoint protein. This family consists
of several eukaryotic mitotic checkpoint (Mitotic arrest
deficient or MAD) proteins. The mitotic spindle
checkpoint monitors proper attachment of the bipolar
spindle to the kinetochores of aligned sister chromatids
and causes a cell cycle arrest in prometaphase when
failures occur. Multiple components of the mitotic
spindle checkpoint have been identified in yeast and
higher eukaryotes. In S.cerevisiae, the existence of a
Mad1-dependent complex containing Mad2, Mad3, Bub3 and
Cdc20 has been demonstrated.
Length = 722
Score = 29.9 bits (67), Expect = 2.4
Identities = 28/181 (15%), Positives = 63/181 (34%), Gaps = 10/181 (5%)
Query: 65 KQHEARHQQVAAEHERLRRLRDRVEQQELKLRKLRALRGQVDQNKMNNISLSSDLESIRA 124
+ ++ + + E+E +++ + Q RK L ++ N+ L L+++
Sbjct: 62 ELIRSKSKLIQLENELMQKELEHKRAQIELERKASTLAENYERELDRNLELEVRLKALEE 121
Query: 125 LFNEKEKELCMAVAKVE---------ELTHQLEDLKRGRTHQPSHPHLSAELENLRRELM 175
L + E E A + + L Q E + + + S + +L ++
Sbjct: 122 LEKKAENEAAEAEEEAKLLKDKLDAESLKLQNEKEDQLKEAKESISRIKNDLSEMQCRAQ 181
Query: 176 YHNKLSEQQNVRLHQQRETLTKKQAEVNSVDLRISELQERLQRKRLLNQQLASQLNSNIN 235
+ + L + RE L + Q E+ + ++ L Q N L +
Sbjct: 182 NADTELKLLESELEELREQLEECQKELAEAEKKLQSLTS-EQASSADNSVKIKHLEEELK 240
Query: 236 S 236
Sbjct: 241 R 241
>gnl|CDD|235600 PRK05771, PRK05771, V-type ATP synthase subunit I; Validated.
Length = 646
Score = 29.5 bits (67), Expect = 2.8
Identities = 42/203 (20%), Positives = 75/203 (36%), Gaps = 45/203 (22%)
Query: 20 KAADLLTEGVELTLGELEDMAVRQQAQIDSQRQLIAAKEQRLRFLKQHEARHQQVAAEHE 79
K+ + L + VE L ++E I E+ + L+ +Q E E
Sbjct: 82 KSLEELIKDVEEELEKIEKE--------------IKELEEEISELENEIKELEQ---EIE 124
Query: 80 RLRRLRDRVEQQELKLRKLRALR------GQVDQNKMNNISLSSDLESIRALFNEKEKEL 133
RL + +L L L + G V ++K+ + L SD+E++ + +K
Sbjct: 125 RLEPWGNF----DLDLSLLLGFKYVSVFVGTVPEDKLEELKLESDVENVEYISTDKGYVY 180
Query: 134 CMAVAKVEELTHQLEDLKRGRTHQPSHPHLSAELENLRRELMYHNKLSEQQNVRLHQQRE 193
+ V E E+LK+ E L EL SE + + +E
Sbjct: 181 VVVVVLKELSDEVEEELKK------------LGFERL--ELEEEGTPSE----LIREIKE 222
Query: 194 TLTKKQAEVNSVDLRISELQERL 216
L + + E S+ + EL ++
Sbjct: 223 ELEEIEKERESLLEELKELAKKY 245
>gnl|CDD|151391 pfam10944, DUF2630, Protein of unknown function (DUF2630). This
bacterial family of proteins have no known function.
Length = 81
Score = 27.3 bits (61), Expect = 3.1
Identities = 17/47 (36%), Positives = 22/47 (46%), Gaps = 2/47 (4%)
Query: 56 AKEQRLRFLKQHEARHQQVAAEHERLRRLRDRVEQQELKLRKLRALR 102
A+E LR Q + EH RL RL ++Q LR+ RA R
Sbjct: 15 AEEHELRAKLQQ--GEIDESEEHARLARLEVALDQCWDLLRQRRARR 59
>gnl|CDD|220369 pfam09731, Mitofilin, Mitochondrial inner membrane protein.
Mitofilin controls mitochondrial cristae morphology.
Mitofilin is enriched in the narrow space between the
inner boundary and the outer membranes, where it forms a
homotypic interaction and assembles into a large
multimeric protein complex. The first 78 amino acids
contain a typical amino-terminal-cleavable mitochondrial
presequence rich in positive-charged and hydroxylated
residues and a membrane anchor domain. In addition, it
has three centrally located coiled coil domains.
Length = 493
Score = 29.2 bits (66), Expect = 3.3
Identities = 25/151 (16%), Positives = 64/151 (42%), Gaps = 9/151 (5%)
Query: 30 ELTLGELEDMAVRQQAQIDSQRQLIAAKEQRLRFLKQHEARHQQVAAEHERLRRLRDRVE 89
+L L + ++ A R +++ LR + + R + ++L++ +
Sbjct: 204 KLEEELLARLESKEAALEKQLRLEFEREKEELRKKYEEKLRQELERQAEAHEQKLKNELA 263
Query: 90 QQELKLRK--LRALRGQVDQNKMNNIS----LSSDLESIRALFNEKEKELCMAVAKVEEL 143
Q ++L++ + ++ +V++ + ++ L+S L+ + + + + KV++L
Sbjct: 264 LQAIELQREFNKEIKEKVEEERNGRLAKLAELNSRLKGLEKALDSRSEAE-DENHKVQQL 322
Query: 144 THQLEDLKRG--RTHQPSHPHLSAELENLRR 172
+E LK S L EL+ L+
Sbjct: 323 WLAVEALKSALKSGSAGSPRPLVKELDALKE 353
>gnl|CDD|225606 COG3064, TolA, Membrane protein involved in colicin uptake [Cell
envelope biogenesis, outer membrane].
Length = 387
Score = 29.1 bits (65), Expect = 3.3
Identities = 25/106 (23%), Positives = 42/106 (39%), Gaps = 9/106 (8%)
Query: 40 AVRQQAQIDSQRQLIAAKEQRLRFLKQH---EARHQQVAAEHERLRRLRDRVEQQELKLR 96
AV QQ +Q A K ++ R K+ E + AAE ERL+ + ++ +L+
Sbjct: 59 AVVQQYGRIQSQQSSAKKGEQQRKKKEEQVAEELKPKQAAEQERLK------QLEKERLK 112
Query: 97 KLRALRGQVDQNKMNNISLSSDLESIRALFNEKEKELCMAVAKVEE 142
+ + K + E R E++K+ A AK
Sbjct: 113 AQEQQKQAEEAEKQAQLEQKQQEEQARKAAAEQKKKAEAAKAKAAA 158
>gnl|CDD|204379 pfam10023, DUF2265, Predicted aminopeptidase (DUF2265). Members of
this family of bacterial proteins comprise various
hypothetical proteins and putative aminopeptidases.
Their exact function, has not, as yet, been defined.
Length = 338
Score = 28.8 bits (65), Expect = 3.9
Identities = 21/96 (21%), Positives = 32/96 (33%), Gaps = 7/96 (7%)
Query: 26 TEGVELTLGELEDMAVRQQAQIDSQR-----QLIAAKEQRLRFL-KQHEARHQQVAAEHE 79
EGV L D A R + +R L+ A RL L A ++ AA+
Sbjct: 197 REGVRRWLAARGDPAARARYAAFDERRQQFRALLLATRDRLEALYASPLADERKRAAKAA 256
Query: 80 RLRRLRDRVEQQELKLRKLRALRGQVDQNKMNNISL 115
++LR + + +NN L
Sbjct: 257 IFQQLRADYAALKARWGGYAGYDRWFAAP-LNNAKL 291
>gnl|CDD|129705 TIGR00618, sbcc, exonuclease SbcC. All proteins in this family for
which functions are known are part of an exonuclease
complex with sbcD homologs. This complex is involved in
the initiation of recombination to regulate the levels
of palindromic sequences in DNA. This family is based on
the phylogenomic analysis of JA Eisen (1999, Ph.D.
Thesis, Stanford University) [DNA metabolism, DNA
replication, recombination, and repair].
Length = 1042
Score = 29.2 bits (65), Expect = 4.0
Identities = 40/188 (21%), Positives = 70/188 (37%), Gaps = 6/188 (3%)
Query: 44 QAQIDSQRQLIAAKEQRLRFLKQHEARHQQVAAEHERLRRLRDRVEQQELKLRKLRALRG 103
A + +R+ + ++ + LKQ AR +++ A+ L ++R+ + L A
Sbjct: 242 HAYLTQKREAQEEQLKKQQLLKQLRARIEELRAQEAVLEETQERINRARKAAP-LAAHIK 300
Query: 104 QVDQNKMNNISLSSDLESIRALFNEKEKELCMAVAKVEELTHQLEDLKRGRTHQPSHPHL 163
V Q + + ++L+S K A K + + L + Q H
Sbjct: 301 AVTQIEQQAQRIHTELQSKMRSR-AKLLMKRAAHVKQQSSIEEQRRLLQTLHSQEIHIRD 359
Query: 164 SAELENLRRELMYHNKLSEQQNVRLHQQRETLTKKQAEVNSVDLRISELQE----RLQRK 219
+ E+ RE+ Q L QQ+ TLT+K + + Q R
Sbjct: 360 AHEVATSIREISCQQHTLTQHIHTLQQQKTTLTQKLQSLCKELDILQREQATIDTRTSAF 419
Query: 220 RLLNQQLA 227
R L QLA
Sbjct: 420 RDLQGQLA 427
>gnl|CDD|217817 pfam03961, DUF342, Protein of unknown function (DUF342). This
family of bacterial proteins has no known function. The
proteins are in the region of 500-600 amino acid
residues in length.
Length = 450
Score = 28.8 bits (65), Expect = 4.6
Identities = 18/76 (23%), Positives = 36/76 (47%), Gaps = 5/76 (6%)
Query: 76 AEHERLRRLRDRVEQQELKLRKLRALRGQVDQNKMNNISLSSDLESIRALFNEKEKELCM 135
E L+ L + +++ E +L K++ L ++ + + E + L KEK
Sbjct: 331 ELKEELKELEEELKELEEELEKIKKLLKKLPKKARGQLP-PEKREQLEKLLETKEK---- 385
Query: 136 AVAKVEELTHQLEDLK 151
++EEL +L++LK
Sbjct: 386 LSEELEELEEELKELK 401
>gnl|CDD|236912 PRK11448, hsdR, type I restriction enzyme EcoKI subunit R;
Provisional.
Length = 1123
Score = 28.8 bits (65), Expect = 4.7
Identities = 20/90 (22%), Positives = 36/90 (40%), Gaps = 3/90 (3%)
Query: 14 PSSSTLKAADLLTEGVELTLGELEDMAVRQQAQIDSQRQLIAAKEQRLRFL-KQHEARHQ 72
P L E V +LE A R++AQ + + + L L + E + Q
Sbjct: 137 PEDPENLLHALQQE-VLTLKQQLELQA-REKAQSQALAEAQQQELVALEGLAAELEEKQQ 194
Query: 73 QVAAEHERLRRLRDRVEQQELKLRKLRALR 102
++ A+ E+L+ Q+ + RK +
Sbjct: 195 ELEAQLEQLQEKAAETSQERKQKRKEITDQ 224
>gnl|CDD|223250 COG0172, SerS, Seryl-tRNA synthetase [Translation, ribosomal
structure and biogenesis].
Length = 429
Score = 28.7 bits (65), Expect = 5.0
Identities = 13/82 (15%), Positives = 32/82 (39%), Gaps = 14/82 (17%)
Query: 77 EHERLRRLRDRVEQQELKLRKLRALRGQVDQNKMNNISLSSDLESIRALFNEKEKELCMA 136
E E L+ R+ + +E+ + + + L ++++ ++ E E L
Sbjct: 44 ELEELQAERNEL-SKEIGRAL--------KRGEDDAEELIAEVKELKEKLKELEAALDEL 94
Query: 137 VAKVEELTHQL-----EDLKRG 153
A+++ L + E + G
Sbjct: 95 EAELDTLLLTIPNIPHESVPVG 116
>gnl|CDD|181619 PRK09039, PRK09039, hypothetical protein; Validated.
Length = 343
Score = 28.4 bits (64), Expect = 5.0
Identities = 28/140 (20%), Positives = 52/140 (37%), Gaps = 25/140 (17%)
Query: 129 KEKELCMAVAKVEELTHQLEDLKRGRTHQPSHPHLSAELENLRRELMYHNKLSEQQNVRL 188
K+ L +++ EL L L+R L + NLR L +E + RL
Sbjct: 51 KDSALDRLNSQIAELADLLS-LERQGNQD-----LQDSVANLRASL----SAAEAERSRL 100
Query: 189 HQQRETLTKKQAEVNSVDLRISELQERLQRKR-----------LLNQQLASQLNSNINSS 237
L + + + R EL + L ++ LLNQQ+A L + +
Sbjct: 101 QAL---LAELAGAGAAAEGRAGELAQELDSEKQVSARALAQVELLNQQIA-ALRRQLAAL 156
Query: 238 QHQLPPGKQQLVAQNSQLTS 257
+ L +++ +++
Sbjct: 157 EAALDASEKRDRESQAKIAD 176
>gnl|CDD|237560 PRK13922, PRK13922, rod shape-determining protein MreC;
Provisional.
Length = 276
Score = 28.4 bits (64), Expect = 5.1
Identities = 12/55 (21%), Positives = 23/55 (41%)
Query: 40 AVRQQAQIDSQRQLIAAKEQRLRFLKQHEARHQQVAAEHERLRRLRDRVEQQELK 94
A + R+ ++ L L+ +Q+ AE+ RLR L + E + +
Sbjct: 64 VFESLASLFDLREENEELKKELLELESRLQELEQLEAENARLRELLNLKESLDYQ 118
>gnl|CDD|215312 PLN02574, PLN02574, 4-coumarate--CoA ligase-like.
Length = 560
Score = 28.7 bits (64), Expect = 5.3
Identities = 12/47 (25%), Positives = 18/47 (38%), Gaps = 1/47 (2%)
Query: 256 TSAHINNVNSLFRSMTSSNIAAVAPYLHVPSKPNLDNTHAIFQNQRR 302
NN + T ++ P + +PS PNLD IF +
Sbjct: 7 AQTSHNNPPFWYSPETGI-YSSKHPPVPLPSDPNLDAVSFIFSHHNH 52
>gnl|CDD|213754 TIGR02971, heterocyst_DevB, ABC exporter membrane fusion protein,
DevB family. Members of this protein family are found
mostly in the Cyanobacteria, but also in the
Planctomycetes. DevB from Anabaena sp. strain PCC 7120
is partially characterized as a membrane fusion protein
of the DevBCA ABC exporter, probably a glycolipid
exporter, required for heterocyst formation. Most
Cyanobacteria have one member only, but Nostoc sp. PCC
7120 has seven members.
Length = 327
Score = 28.3 bits (63), Expect = 5.9
Identities = 25/118 (21%), Positives = 48/118 (40%), Gaps = 9/118 (7%)
Query: 38 DMAVRQQAQIDSQRQLIAAKEQRLRFLKQHEARHQQVAAEHERLRRLRDRVEQQELKLRK 97
D A + AQ+ + + QR + VAA+ L RL +E + ++ +
Sbjct: 65 DEAKARLAQVRAGAKKGEIAAQRAARAAAK--LFKDVAAQQATLNRLEAELETAQREVDR 122
Query: 98 LRALR--GQVDQNKMNNISLSSD-----LESIRALFNEKEKELCMAVAKVEELTHQLE 148
R+L G V + +++ +L LE A +E+ A+A + E + +
Sbjct: 123 YRSLFRDGAVSASDLDSKALKLRTAEEELEEALASRSEQIDGARAALASLAEEVRETD 180
>gnl|CDD|233973 TIGR02680, TIGR02680, TIGR02680 family protein. Members of this
protein family belong to a conserved gene four-gene
neighborhood found sporadically in a phylogenetically
broad range of bacteria: Nocardia farcinica,
Symbiobacterium thermophilum, and Streptomyces
avermitilis (Actinobacteria), Geobacillus kaustophilus
(Firmicutes), Azoarcus sp. EbN1 and Ralstonia
solanacearum (Betaproteobacteria). Proteins in this
family average over 1400 amino acids in length
[Hypothetical proteins, Conserved].
Length = 1353
Score = 28.6 bits (64), Expect = 6.1
Identities = 43/201 (21%), Positives = 77/201 (38%), Gaps = 20/201 (9%)
Query: 23 DLLTEGV-ELTLGELEDMAVRQQAQIDSQRQLIAAKEQRLRFLKQHEARHQQVAAE--HE 79
D LTE + L EL D+A Q+D R + E R L+ R+++ A
Sbjct: 208 DALTEALPPLDDDELTDVA-DALEQLDEYRDELERLEALERALRNFLQRYRRYARTMLRR 266
Query: 80 RLRRLRDRVEQQELKLRKLRALRGQVDQNKMNNISLSSDLESIRALFNEKEKELCMAVAK 139
R RLR Q + R L R +++ + + A E+E
Sbjct: 267 RATRLRSAQTQYDQLSRDLGRARDELET-------AREEERELDARTEALEREA------ 313
Query: 140 VEELTHQLEDLKRGRTHQPSHP--HLSAELENLRRELMYHNKLSEQQNVRLHQQRETLTK 197
+ L +LE L+ +Q + A+ E L+ + + RL ++R L +
Sbjct: 314 -DALRTRLEALQGSPAYQDAEELERARADAEALQAAAADARQAIREAESRLEEERRRLDE 372
Query: 198 KQAEVNSVDLRISELQERLQR 218
+ ++ + + +E+L R
Sbjct: 373 EAGRLDDAERELRAAREQLAR 393
>gnl|CDD|220964 pfam11068, DUF2869, Protein of unknown function (DUF2869). This
bacterial family of proteins has no known function.
Length = 131
Score = 27.2 bits (61), Expect = 6.1
Identities = 21/59 (35%), Positives = 31/59 (52%), Gaps = 6/59 (10%)
Query: 42 RQQAQIDSQRQLIAAKEQRLRFLKQHEARHQQVAAE----HERLRRLRDRVEQ-QELKL 95
++ Q++ Q Q A KE R + +Q E QQV E E+ +L ++EQ QEL L
Sbjct: 34 QELQQLEFQGQK-AIKEIRKQSAQQIEQIQQQVEQERAKRLEQKNQLLFQIEQVQELPL 91
>gnl|CDD|237481 PRK13723, PRK13723, conjugal transfer pilus assembly protein TraH;
Provisional.
Length = 451
Score = 28.2 bits (63), Expect = 6.1
Identities = 13/61 (21%), Positives = 26/61 (42%), Gaps = 6/61 (9%)
Query: 204 SVDLRISELQERLQRKRLL------NQQLASQLNSNINSSQHQLPPGKQQLVAQNSQLTS 257
D+ + +QE +Q+ R + + + L N+N +Q Q+ + Q+ Q L
Sbjct: 364 GYDILLQYIQELIQQARAMLATGNYPEAVMDHLRENLNQAQRQIAAFQSQVQVQQDALLV 423
Query: 258 A 258
Sbjct: 424 V 424
>gnl|CDD|225115 COG2205, KdpD, Osmosensitive K+ channel histidine kinase [Signal
transduction mechanisms].
Length = 890
Score = 28.4 bits (64), Expect = 6.2
Identities = 15/45 (33%), Positives = 24/45 (53%), Gaps = 2/45 (4%)
Query: 39 MAVRQQAQIDSQRQLIAAKEQRLRFLKQHEARHQQVAAEHERLRR 83
+A Q+ +D+ IA +R+ + EA ++AAE ERLR
Sbjct: 619 LAPEQRRLLDAVLTQIALALERVTLAE--EAEQARLAAERERLRS 661
>gnl|CDD|227515 COG5188, PRP9, Splicing factor 3a, subunit 3 [RNA processing and
modification].
Length = 470
Score = 28.4 bits (63), Expect = 6.3
Identities = 14/59 (23%), Positives = 23/59 (38%), Gaps = 7/59 (11%)
Query: 36 LEDMAVRQQAQIDSQRQLIAAKEQRLR-------FLKQHEARHQQVAAEHERLRRLRDR 87
LE+M + + A + ++ LR F +Q EHER L+D+
Sbjct: 11 LEEMEIIENAIAERIQRNPKLPRDELRLERQIRIFENMERISNQIWLVEHERPTGLKDQ 69
>gnl|CDD|116500 pfam07888, CALCOCO1, Calcium binding and coiled-coil domain
(CALCOCO1) like. Proteins found in this family are
similar to the coiled-coil transcriptional coactivator
protein coexpressed by Mus musculus (CoCoA/CALCOCO1).
This protein binds to a highly conserved N-terminal
domain of p160 coactivators, such as GRIP1, and thus
enhances transcriptional activation by a number of
nuclear receptors. CALCOCO1 has a central coiled-coil
region with three leucine zipper motifs, which is
required for its interaction with GRIP1 and may regulate
the autonomous transcriptional activation activity of
the C-terminal region.
Length = 546
Score = 28.4 bits (63), Expect = 6.4
Identities = 31/143 (21%), Positives = 63/143 (44%), Gaps = 14/143 (9%)
Query: 113 ISLSSDLESIRALFNEKEKELCMAVAKVEELTHQLEDLKRGRTHQPSHPHLSAELENLRR 172
++L D+ +R+ + E L + K EEL Q ++ + S +SAE L
Sbjct: 160 LALEGDVSDMRSRIEQLETALRHSTEKTEELEEQHKEAQS------SSESMSAERNALLA 213
Query: 173 ELMYHNKLSEQQNVRLHQQRETLTKKQAEVNSVDLRISELQERLQRKRLLNQQLASQLNS 232
+ + +Q+ + L Q +TLT+K+ E + V +++ L+R + +Q ++
Sbjct: 214 QRAEN----QQRILELEQDIQTLTQKKQENDRVLEGTQDIEAELERMKGELKQRLKKMTI 269
Query: 233 NINSSQHQLPPGKQQLVAQNSQL 255
+ + + L +N QL
Sbjct: 270 QRRDEETE----RIDLQLENEQL 288
>gnl|CDD|218661 pfam05622, HOOK, HOOK protein. This family consists of several
HOOK1, 2 and 3 proteins from different eukaryotic
organisms. The different members of the human gene
family are HOOK1, HOOK2 and HOOK3. Different domains
have been identified in the three human HOOK proteins,
and it was demonstrated that the highly conserved
NH2-domain mediates attachment to microtubules, whereas
the central coiled-coil motif mediates homodimerisation
and the more divergent C-terminal domains are involved
in binding to specific organelles (organelle-binding
domains). It has been demonstrated that endogenous HOOK3
binds to Golgi membranes, whereas both HOOK1 and HOOK2
are localised to discrete but unidentified cellular
structures. In mice the Hook1 gene is predominantly
expressed in the testis. Hook1 function is necessary for
the correct positioning of microtubular structures
within the haploid germ cell. Disruption of Hook1
function in mice causes abnormal sperm head shape and
fragile attachment of the flagellum to the sperm head.
Length = 713
Score = 28.2 bits (63), Expect = 6.5
Identities = 52/233 (22%), Positives = 97/233 (41%), Gaps = 15/233 (6%)
Query: 33 LGELEDMAVRQQAQIDSQRQLIAAKEQRLRFLKQHEARH----QQVAAEHERLRRLRDRV 88
L E D A + +AQ+++ ++ + R +K E R+ Q E L++
Sbjct: 303 LRESSDKAKKLEAQVETYKKKLEDLNDLRRQVKLLEERNAMYMQNTVQLEEELKKANAAR 362
Query: 89 EQQELKLRKLRALRGQVDQNKMNNISLSSDLESIRALFNEKEKELCMAVAKVEELTHQLE 148
Q E R+++ L ++ + L + + + +KE +A+ + L E
Sbjct: 363 GQLETYKRQVQELHAKLSEESKKADKLEFEYKRLEEKLEALQKEKERLLAERDSLRETNE 422
Query: 149 DLKRGRTHQ----PSHPHLSAELENLRRELMYHNKLSEQQNVRLHQQRETLTKKQAEVNS 204
+L+ G+ Q + S +NL EL+ SE + + Q E + + S
Sbjct: 423 ELRCGQAQQDQLTQADAGTSPSGDNLAAELLP----SEYREKLIRLQHENKMLRLGQEGS 478
Query: 205 VDLRISELQERLQRKRLLNQQLASQLNSN---INSSQHQLPPGKQQLVAQNSQ 254
+ RI+ELQ+ L+ N +L +QL I Q Q+ ++ L Q S+
Sbjct: 479 ENERITELQQLLEDANRRNNELETQLRLANQRILELQQQVEDLQKALQEQGSK 531
>gnl|CDD|153183 cd04911, ACT_AKiii-YclM-BS_1, ACT domains located C-terminal to the
catalytic domain of the lysine plus threonine-sensitive
aspartokinase isoenzyme AKIII. This CD includes the
first of two ACT domains located C-terminal to the
catalytic domain of the lysine plus threonine-sensitive
aspartokinase isoenzyme AKIII, a monofunctional class
enzyme found in Bacilli (Bacillus subtilis (BS) YclM)
and Clostridia species. Aspartokinase is the first
enzyme in the aspartate metabolic pathway and catalyzes
the conversion of aspartate and ATP to aspartylphosphate
and ADP. Bacillus subtilis YclM is reported to be a
single polypeptide of 50 kD. AKIII from Bacillus
subtilis strain 168 is induced by lysine and repressed
by threonine and it is synergistically inhibited by
lysine and threonine. Members of this CD belong to the
superfamily of ACT regulatory domains.
Length = 76
Score = 26.4 bits (59), Expect = 6.6
Identities = 15/51 (29%), Positives = 22/51 (43%), Gaps = 10/51 (19%)
Query: 281 YLHVPSKPNLDNTHAIFQNQRRIESGHNQLNKVVCFDFVPEISEKCAVDEI 331
Y H+PS +D+ I ++ NQL + EI E+ DEI
Sbjct: 32 YEHMPS--GIDDISIIIRD--------NQLTDEKEQKILAEIKEELHPDEI 72
>gnl|CDD|224120 COG1199, DinG, Rad3-related DNA helicases [Transcription / DNA
replication, recombination, and repair].
Length = 654
Score = 28.2 bits (63), Expect = 6.7
Identities = 23/137 (16%), Positives = 47/137 (34%), Gaps = 13/137 (9%)
Query: 35 ELEDMAVRQQAQID-SQRQLIAAKEQRLRFLKQHEARHQQVAAE----HERLRRLRDRVE 89
L D R I S+R L ++ + E +++ + L LR+ +
Sbjct: 231 NLPD-IARSALSIRLSERTLERLLKEIQALGETLEKDLKRLEDLADRLEKALEDLRELLI 289
Query: 90 QQ-----ELKLRKLRALRGQVDQNKMNNI--SLSSDLESIRALFNEKEKELCMAVAKVEE 142
L+ R L + + + + +L L+++ L + EL + +E
Sbjct: 290 FDVDELGNLRERLREQLSSEEAKEALGKLEEALLEKLKNLSELLGLSQNELDRPTSILER 349
Query: 143 LTHQLEDLKRGRTHQPS 159
L +L+ L
Sbjct: 350 LKEELDRLLSRELLLSD 366
>gnl|CDD|203795 pfam07889, DUF1664, Protein of unknown function (DUF1664). The
members of this family are hypothetical plant proteins
of unknown function. The region featured in this family
is approximately 100 amino acids long.
Length = 125
Score = 27.0 bits (60), Expect = 6.8
Identities = 16/57 (28%), Positives = 28/57 (49%)
Query: 162 HLSAELENLRRELMYHNKLSEQQNVRLHQQRETLTKKQAEVNSVDLRISELQERLQR 218
HLS ++NL +L ++SE + + RE L+ +V SV + L+ +L
Sbjct: 64 HLSQRIDNLDDKLDEQKEISESTRDEVTEIREDLSNIGEDVKSVHQAVEGLEGKLDS 120
>gnl|CDD|213363 cd12829, Alr1p-like, Saccharomyces cerevisiae Alr1p-like subfamily.
This eukaryotic subfamily belongs to the Thermotoga
maritima CorA (TmCorA)-family of the MIT superfamily of
essential membrane proteins involved in transporting
divalent cations (uptake or efflux) across membranes.
This subfamily includes three Saccharomyces cerevisiae
members: two plasma membrane proteins, the Mg2+
transporter Alr1p/Swc3p and the putative Mg2+
transporter, Alr2p, and the vacuole membrane protein
Mnr2p, a putative Mg2+ transporter. Thermotoga maritima
CorA forms funnel-shaped homopentamers, the tip of the
funnel is formed from two C-terminal transmembrane (TM)
helices from each monomer, and the large opening of the
funnel from the N-terminal cytoplasmic domains. The GMN
signature motif of the MIT superfamily occurs just after
TM1, mutation within this motif is known to abolish Mg2+
transport by Alr1p. Natural variants in this signature
sequence may be associated with the transport of
different divalent cations. The functional diversity of
the MIT superfamily may also be due to minor structural
differences regulating gating, substrate selection, and
transport.
Length = 305
Score = 28.0 bits (63), Expect = 8.0
Identities = 10/41 (24%), Positives = 19/41 (46%)
Query: 46 QIDSQRQLIAAKEQRLRFLKQHEARHQQVAAEHERLRRLRD 86
++ S +L+ AK L+ L + + V+ + E L D
Sbjct: 160 RVTSLLRLLFAKSDVLKQLLKRLMKKTFVSPDSEVAIYLGD 200
>gnl|CDD|202101 pfam02050, FliJ, Flagellar FliJ protein.
Length = 122
Score = 26.8 bits (60), Expect = 8.2
Identities = 20/111 (18%), Positives = 38/111 (34%), Gaps = 5/111 (4%)
Query: 119 LESIRALFNEKEKELCMAVAKVEELTHQLEDLKRGRTHQPSHPHLSAELENLRRELMYHN 178
L + + E++L E QL +G + + A + L +
Sbjct: 7 LAEAQQELEQAEEKLEELQQYRLEYRQQLSGSGQGISAAELR-NYQAFISALDEAIAQQQ 65
Query: 179 KLSEQQNVRLHQQRETLTKKQAEVNSVDLRISELQERLQRKRLLNQQLASQ 229
+ EQ ++ Q RE + E ++ L ER ++K + Q
Sbjct: 66 QELEQAEKQVEQAREQWQEANQERKKLEK----LLEREKKKEQKEENRREQ 112
>gnl|CDD|219176 pfam06785, UPF0242, Uncharacterized protein family (UPF0242).
Length = 401
Score = 27.9 bits (62), Expect = 8.3
Identities = 51/221 (23%), Positives = 95/221 (42%), Gaps = 47/221 (21%)
Query: 50 QRQLIAAKEQRLRFLKQHEARHQQVAAEHERLRRLRDRVEQQELKLRKLRALRGQVDQNK 109
+RQL A Q L + + +++ + E LR +R+ +E+++ + DQ K
Sbjct: 62 ERQLKTAAGQLL------QTKIRKLTEQDEGLRNIRESIEERQK----------ESDQLK 105
Query: 110 MNNISLSSDLESIRALF----NEKEKELCMAVAKVEE---LTHQLEDLKRGRTHQPSHPH 162
+ N L + L R +F +K ++ EE L QL+ L + +
Sbjct: 106 LQNQKLVNQLGHARGVFIKTKGRYQKLEELSRHLKEENQCLQIQLDALVQECNEK----- 160
Query: 163 LSAELENLRREL----MYHNKLSEQQNVRLHQQRETLTKKQAEVNSVDLRISELQERLQR 218
E + L REL Y +L+++ +Q L K+QA + ++ ++ +L +
Sbjct: 161 -IEENQELNRELAETLAYQQELNDEYQATFVEQHNMLDKRQAYIGKLEAKVQDLMCEI-- 217
Query: 219 KRLLNQQLASQLNSNINSSQHQLPPGKQQLVAQN--SQLTS 257
+ LL QL S + N+ PGK +++ +QL S
Sbjct: 218 RNLL--QLESGIKENL--------PGKPVAASRDVVAQLVS 248
>gnl|CDD|216531 pfam01496, V_ATPase_I, V-type ATPase 116kDa subunit family. This
family consists of the 116kDa V-type ATPase (vacuolar
(H+)-ATPases) subunits, as well as V-type ATP synthase
subunit i. The V-type ATPases family are proton pumps
that acidify intracellular compartments in eukaryotic
cells for example yeast central vacuoles,
clathrin-coated and synaptic vesicles. They have
important roles in membrane trafficking processes. The
116kDa subunit (subunit a) in the V-type ATPase is part
of the V0 functional domain responsible for proton
transport. The a subunit is a transmembrane glycoprotein
with multiple putative transmembrane helices it has a
hydrophilic amino terminal and a hydrophobic carboxy
terminal. It has roles in proton transport and assembly
of the V-type ATPase complex. This subunit is encoded by
two homologous gene in yeast VPH1 and STV1.
Length = 707
Score = 28.1 bits (63), Expect = 9.0
Identities = 11/73 (15%), Positives = 28/73 (38%), Gaps = 5/73 (6%)
Query: 163 LSAELENLRRELMYHNKLSEQQNVRLHQQR-ETLTKKQAEVNSVDLRISELQERLQRKRL 221
L ++++ L L + + E + +AE+ V+ E E L+++
Sbjct: 43 LESKIKKLGIPLKDTGGKPDVPPSKEFLDLEEEILDLEAEIKEVE----ENLESLEKEIN 98
Query: 222 LNQQLASQLNSNI 234
++ + L+
Sbjct: 99 ELEEWLNVLDEEK 111
>gnl|CDD|218184 pfam04632, FUSC, Fusaric acid resistance protein family. This
family includes a conserved region found in two proteins
associated with fusaric acid resistance,from
Burkholderia cepacia and Klebsiella oxytoca. These
proteins are likely to be membrane transporter proteins.
Length = 649
Score = 28.0 bits (63), Expect = 9.2
Identities = 16/89 (17%), Positives = 23/89 (25%), Gaps = 8/89 (8%)
Query: 20 KAADLLTEGV-ELTLGELEDMAVRQQAQIDSQRQLIAAKEQRLRFL------KQHEARHQ 72
L +G+ L +G + +R+ AA L L H
Sbjct: 545 ARRRALRDGLAALRVG-RAVLRLRRALAAALPAPARAALRAVLAALARLFRAAAHGRALA 603
Query: 73 QVAAEHERLRRLRDRVEQQELKLRKLRAL 101
AA L + RAL
Sbjct: 604 APAALLRALDAALAALAADPAPAALRRAL 632
>gnl|CDD|218827 pfam05960, DUF885, Bacterial protein of unknown function (DUF885).
This family consists of several hypothetical bacterial
proteins several of which are putative membrane
proteins.
Length = 539
Score = 27.7 bits (62), Expect = 9.2
Identities = 15/80 (18%), Positives = 34/80 (42%), Gaps = 7/80 (8%)
Query: 26 TEGVELTLGELEDMAVRQQAQIDSQRQLIAAK-------EQRLRFLKQHEARHQQVAAEH 78
+LT E+ + + + A+I ++ + IA + ++ L L+ + +
Sbjct: 232 YTTTDLTPEEIHQLGLAEVARIRAEMEAIARQVGFTGDLQEFLAALRTDQFYYPTDPGRD 291
Query: 79 ERLRRLRDRVEQQELKLRKL 98
E L R+ ++ + KL L
Sbjct: 292 ELLDEARELLDAIDAKLPDL 311
>gnl|CDD|221432 pfam12128, DUF3584, Protein of unknown function (DUF3584). This
protein is found in bacteria and eukaryotes. Proteins in
this family are typically between 943 to 1234 amino
acids in length. This family contains a P-loop motif
suggesting it is a nucleotide binding protein. It may be
involved in replication.
Length = 1198
Score = 27.7 bits (62), Expect = 9.6
Identities = 52/239 (21%), Positives = 79/239 (33%), Gaps = 32/239 (13%)
Query: 40 AVRQQAQIDSQRQLIAAKEQRLRFLKQHEARHQQVAAEHERLRRLRDRVEQQELKLRKLR 99
AV +Q Q + Q A+ L K+ EA + A + RL R + EQQ LK +
Sbjct: 619 AVAKQKQAEEQLVQANAE---LEEQKRAEAEART-ALKQARLDLQRLQNEQQSLKDKLEL 674
Query: 100 AL-------RGQVD----QNKMNNISLSSDLESIRALFNE---KEKELCMAVAKVEELTH 145
A+ Q+ Q K + LE+++ F E + V EL +
Sbjct: 675 AIAERKQQAETQLRQLDAQLKQLLEQQQAFLEALKDDFRELRTERLAKWQVV--EGELDN 732
Query: 146 QLEDLKRG-RTHQPSHPHLSAELENLRRELMYHNKLSEQQNVRLHQQRETLTKK------ 198
QL L + EL+ + + L +Q E L
Sbjct: 733 QLAQLSAAIEAARTQAKARLKELKKQYDRELASLDVDPNTVKELKRQIEELETTIERIAV 792
Query: 199 -QAEVNSVDLRISELQERLQRKRLLNQQLASQLNSNINSSQHQLPPGKQQLVAQNSQLT 256
+ EV + E R LA QL + SS +L +Q+L
Sbjct: 793 RRPEVREYRAFMQETWLHRDSLREERPNLAIQL-RELESSAEEL---QQELTRLIKDTK 847
>gnl|CDD|119423 cd05163, TRRAP, TRansformation/tRanscription domain-Associated
Protein (TRRAP), pseudokinase domain; The TRRAP
catalytic domain is part of a larger superfamily that
includes the catalytic domains of other kinases such as
the typical serine/threonine/tyrosine protein kinases
(PKs), aminoglycoside phosphotransferase, choline
kinase, and RIO kinases. TRRAP shows some similarity to
members of the phosphoinositide 3-kinase-related protein
kinase (PIKK) subfamily in that it contains a FATC
(FRAP, ATM and TRRAP, C-terminal) domain and has a large
molecular weight. Unlike PIKK proteins, however, it
contains an inactive PI3K-like pseudokinase domain,
which lacks the conserved residues necessary for ATP
binding and catalytic activity. TRRAP also contains many
motifs that may be critical for protein-protein
interactions. TRRAP is a common component of many
histone acetyltransferase (HAT) complexes, and is
responsible for the recruitment of these complexes to
chromatin during transcription, replication, and DNA
repair. TRRAP also exists in non-HAT complexes such as
the p400 and MRN complexes, which are implicated in
ATP-dependent remodeling and DNA repair, respectively.
Length = 253
Score = 27.6 bits (62), Expect = 9.7
Identities = 12/37 (32%), Positives = 17/37 (45%), Gaps = 1/37 (2%)
Query: 208 RISELQER-LQRKRLLNQQLASQLNSNINSSQHQLPP 243
R + +ER LQ R LN L+ + + Q LP
Sbjct: 41 RQARREERVLQLFRTLNSVLSKNKETRRRNLQFTLPL 77
Database: CDD.v3.10
Posted date: Mar 20, 2013 7:55 AM
Number of letters in database: 10,937,602
Number of sequences in database: 44,354
Lambda K H
0.314 0.127 0.341
Gapped
Lambda K H
0.267 0.0689 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 16,422,126
Number of extensions: 1589139
Number of successful extensions: 3992
Number of sequences better than 10.0: 1
Number of HSP's gapped: 3420
Number of HSP's successfully gapped: 606
Length of query: 332
Length of database: 10,937,602
Length adjustment: 97
Effective length of query: 235
Effective length of database: 6,635,264
Effective search space: 1559287040
Effective search space used: 1559287040
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.6 bits)
S2: 59 (26.6 bits)