RPS-BLAST 2.2.26 [Sep-21-2011]
Database: CDD.v3.10
44,354 sequences; 10,937,602 total letters
Searching..................................................done
Query= psy15959
(393 letters)
>gnl|CDD|150430 pfam09755, DUF2046, Uncharacterized conserved protein H4 (DUF2046).
This is the conserved N-terminal 350 residues of a
family of proteins of unknown function possibly
containing a coiled-coil domain.
Length = 308
Score = 348 bits (895), Expect = e-119
Identities = 217/310 (70%), Positives = 261/310 (84%), Gaps = 6/310 (1%)
Query: 2 ADSASESDSNSMDGVALMMPPSPVTREQLQKRIESLQQQNRVLKVELETYKLQLRALQEE 61
A+ SESDS+ + S +RE+L+ RI SL+Q+N+VLK+EL+T+KL+ ++LQEE
Sbjct: 1 AECPSESDSDG----STDEGSSTASRERLRGRIHSLKQENKVLKMELDTFKLKCKSLQEE 56
Query: 62 NKNLRQTSVNIQAKAEQEEEFISNTLMKKIQALKKEKETLAHHYEQEEEFLTNDLSRKLN 121
N+ LRQ SV+IQAKAEQEEEFISNTL+KKIQALKKEKETLA +YEQEEEFLTNDLSRKL
Sbjct: 57 NRALRQASVSIQAKAEQEEEFISNTLLKKIQALKKEKETLAMNYEQEEEFLTNDLSRKLT 116
Query: 122 QLTEEKYRLEQTIEQEQERVVNKLMRKIEKLEAETDAKQNNLEQLRREKVELENTLEQEQ 181
QL +EK +LEQT+EQEQE VNKLMRKI+KLEA+T AKQ +LEQLRREKV+LENTLEQEQ
Sbjct: 117 QLRQEKIQLEQTLEQEQEYQVNKLMRKIDKLEADTLAKQTSLEQLRREKVDLENTLEQEQ 176
Query: 182 EALVNKLWKRMDKLEAEKRILQIKLDQPVSDPSSPHDM--NNGDTASNLSTHIHSLRSEV 239
EALVN+LWKRMDKLEAEKR+LQ KLDQPVS+P SP D+ DT +++HI SLRSEV
Sbjct: 177 EALVNRLWKRMDKLEAEKRMLQEKLDQPVSEPPSPRDIFTEAEDTVGGIASHITSLRSEV 236
Query: 240 MRLRNQLANTQQEHTEKMERYVKDEKNVREENLRLQRKLQREVERREALCRHLSESESSL 299
RLR LA ++ ++TEKME+Y K+E+ REEN+RLQRKL REVERREALCR LSESESSL
Sbjct: 237 RRLRQNLAKSEADYTEKMEQYAKEERQTREENIRLQRKLLREVERREALCRQLSESESSL 296
Query: 300 EMEEERHYNE 309
EM++ER+YNE
Sbjct: 297 EMDDERYYNE 306
>gnl|CDD|233757 TIGR02168, SMC_prok_B, chromosome segregation protein SMC, common
bacterial type. SMC (structural maintenance of
chromosomes) proteins bind DNA and act in organizing and
segregating chromosomes for partition. SMC proteins are
found in bacteria, archaea, and eukaryotes. This family
represents the SMC protein of most bacteria. The smc
gene is often associated with scpB (TIGR00281) and scpA
genes, where scp stands for segregation and condensation
protein. SMC was shown (in Caulobacter crescentus) to be
induced early in S phase but present and bound to DNA
throughout the cell cycle [Cellular processes, Cell
division, DNA metabolism, Chromosome-associated
proteins].
Length = 1179
Score = 64.7 bits (158), Expect = 3e-11
Identities = 64/291 (21%), Positives = 115/291 (39%), Gaps = 36/291 (12%)
Query: 28 EQLQKRIESLQQQNRVLKVELETYKLQLRALQEENKNLRQTSVNIQAKAEQEEEFISNTL 87
E+L+++IE L+++ L+ L + +L L+EE + LR K +E + L
Sbjct: 680 EELEEKIEELEEKIAELEKALAELRKELEELEEELEQLR--------KELEELSRQISAL 731
Query: 88 MKKIQALKKEKETLAHHYEQEEEFLTNDLSRKLNQLTEEKYRLEQTIEQEQERVVNKLMR 147
K + L+ E E L Q + +L+ ++ E + RLE+ E+ E
Sbjct: 732 RKDLARLEAEVEQLEERIAQLSK----ELTELEAEIEELEERLEEAEEELAE-----AEA 782
Query: 148 KIEKLEAETDAKQNNLEQLRREKVELENTLEQEQEALVNKLWKRMDKLEAEKRILQIKLD 207
+IE+LEA+ + + L+ LR EL L E N +R++ LE + +L+
Sbjct: 783 EIEELEAQIEQLKEELKALREALDELRAELTLLNEEAANLR-ERLESLERRIAATERRLE 841
Query: 208 QPVSDPSSPHDMNNGDTASNLSTHIHSLRSEVMRLRNQLANTQQEHTEKMERYVKDEKNV 267
+ LS I SL +E+ L + + E + E+ +
Sbjct: 842 DLE------------EQIEELSEDIESLAAEIEELEELIEELESELEALLNERASLEEAL 889
Query: 268 REENLRLQ------RKLQREVERREALCRHLSESESSLEMEEERHYNEIAH 312
L+ R+L+ + L E + LE+ E I +
Sbjct: 890 ALLRSELEELSEELRELESKRSELRRELEELREKLAQLELRLEGLEVRIDN 940
Score = 52.8 bits (127), Expect = 2e-07
Identities = 45/171 (26%), Positives = 78/171 (45%), Gaps = 13/171 (7%)
Query: 26 TREQLQKRIESLQQQNRVLKVELETYKLQLRALQEENKNLRQTSVNIQAKAEQEEEFISN 85
+ + IE L+ Q LK EL+ + L L+ E L + + N++ + E E I+
Sbjct: 776 ELAEAEAEIEELEAQIEQLKEELKALREALDELRAELTLLNEEAANLRERLESLERRIAA 835
Query: 86 T------LMKKIQALKKEKETLAHHYEQEEEFLTNDLSRKLNQLTEEKYRLEQTIE---- 135
T L ++I+ L ++ E+LA E+ EE + +L +L L E+ LE+ +
Sbjct: 836 TERRLEDLEEQIEELSEDIESLAAEIEELEELI-EELESELEALLNERASLEEALALLRS 894
Query: 136 --QEQERVVNKLMRKIEKLEAETDAKQNNLEQLRREKVELENTLEQEQEAL 184
+E + +L K +L E + + L QL LE ++ QE L
Sbjct: 895 ELEELSEELRELESKRSELRRELEELREKLAQLELRLEGLEVRIDNLQERL 945
Score = 50.4 bits (121), Expect = 1e-06
Identities = 57/258 (22%), Positives = 98/258 (37%), Gaps = 27/258 (10%)
Query: 27 REQLQKRIESLQQQNRVLKVELETYKLQLRALQEENKNLRQTSVNIQAKAEQEEEFISNT 86
QL K + L+ + L+ LE + +L + E + L ++ + + E +
Sbjct: 749 IAQLSKELTELEAEIEELEERLEEAEEELAEAEAEIEELEAQIEQLKEELKALREALDE- 807
Query: 87 LMKKIQALKKEKETLAHHYEQEEEFLTNDLSRKLNQLTEEKYRLEQTIEQEQERVVNKLM 146
L ++ L +E L E E R+L L E+ L + IE + +L
Sbjct: 808 LRAELTLLNEEAANLRERLES-LERRIAATERRLEDLEEQIEELSEDIE-SLAAEIEELE 865
Query: 147 RKIEKLEAETDAKQNNLEQLRREKVELENTLEQEQEALVNKLWKRMDKLEAEKRILQIKL 206
IE+LE+E +A N L L + LE+ E L +L + +L E L+ KL
Sbjct: 866 ELIEELESELEALLNERASLEEALALLRSELEELSEEL-RELESKRSELRRELEELREKL 924
Query: 207 DQPVSDPSSPHDMNNGDTASNLSTHIHSLRSEVMRLRNQLAN----TQQEHTEKMERYVK 262
Q L + L + L+ +L+ T +E +
Sbjct: 925 AQ-------------------LELRLEGLEVRIDNLQERLSEEYSLTLEEAEALENKIED 965
Query: 263 DEKNVREENLRLQRKLQR 280
DE+ R RL+ K++
Sbjct: 966 DEEEARRRLKRLENKIKE 983
Score = 47.0 bits (112), Expect = 1e-05
Identities = 54/260 (20%), Positives = 105/260 (40%), Gaps = 37/260 (14%)
Query: 57 ALQEENKNLRQTSVNIQAKAEQEEEFISNTLMKKIQALKKEKETLAHHYEQEEEFLTNDL 116
L E + L+ +++ +AE+ E + + LK E L L
Sbjct: 194 ILNELERQLK----SLERQAEKAERY---------KELKAELRELELAL----------L 230
Query: 117 SRKLNQLTEEKYRLEQTIEQEQERVVNKLMRKIEKLEAETDAKQNNLEQLRREKVELENT 176
+L +L EE L++ +++ + ++E+L AE + LE+LR E ELE
Sbjct: 231 VLRLEELREELEELQEELKEAE--------EELEELTAELQELEEKLEELRLEVSELEEE 282
Query: 177 LEQEQEALVNKLWKRMDKLEAEKRILQIKLDQPVSDPS--SPHDMNNGDTASNLSTHIHS 234
+E+ Q+ L L + +LE +K+IL+ +L L+ +
Sbjct: 283 IEELQKEL-YALANEISRLEQQKQILRERLANLERQLEELEAQLEELESKLDELAEELAE 341
Query: 235 LRSEVMRLRNQLANTQQEHTEKMERYVKDE---KNVREENLRLQRKLQREVERREALCRH 291
L ++ L+ +L + + E E + E + + E+ L+ K+ + + +L
Sbjct: 342 LEEKLEELKEELESLEAELEELEAELEELESRLEELEEQLETLRSKVAQLELQIASLNNE 401
Query: 292 LSESESSLEMEEERHYNEIA 311
+ E+ LE E+R
Sbjct: 402 IERLEARLERLEDRRERLQQ 421
Score = 46.2 bits (110), Expect = 2e-05
Identities = 50/192 (26%), Positives = 100/192 (52%), Gaps = 16/192 (8%)
Query: 28 EQLQKRIESLQQQNRVLKVELETYKLQLRALQEE----NKNLRQTSVNIQAKAEQEEEF- 82
+ L++R+ +L++Q L+ +LE + +L L EE + L + +++ + EE
Sbjct: 305 QILRERLANLERQLEELEAQLEELESKLDELAEELAELEEKLEELKEELESLEAELEELE 364
Query: 83 -ISNTLMKKIQALKKEKETLAHHY---EQEEEFLTNDLSR---KLNQLTEEKYRLEQTIE 135
L +++ L+++ ETL E + L N++ R +L +L + + RL+Q IE
Sbjct: 365 AELEELESRLEELEEQLETLRSKVAQLELQIASLNNEIERLEARLERLEDRRERLQQEIE 424
Query: 136 QEQERVVNKLMRKIEKLEAETDAKQNNLEQLRREKVELENTLEQEQEALVNKLWKRMDKL 195
+ +++ ++++L+AE + + LE+L+ E LE LE+ +E L + + +D
Sbjct: 425 ELLKKLEEA---ELKELQAELEELEEELEELQEELERLEEALEELREEL-EEAEQALDAA 480
Query: 196 EAEKRILQIKLD 207
E E LQ +LD
Sbjct: 481 ERELAQLQARLD 492
Score = 43.9 bits (104), Expect = 1e-04
Identities = 32/174 (18%), Positives = 77/174 (44%), Gaps = 8/174 (4%)
Query: 26 TREQLQKRIESLQQQNRVLKVELETYKLQLRALQEENKNLRQTSVNIQAKAEQEEEFISN 85
E+L+ ++E L+ + L EL + +L L+EE ++L + + + E
Sbjct: 317 QLEELEAQLEELESKLDELAEELAELEEKLEELKEELESLEA-----ELEELEAELEELE 371
Query: 86 TLMKKIQALKKEKETLAHHYEQEEEFLTNDLSR---KLNQLTEEKYRLEQTIEQEQERVV 142
+ +++++ + + E + L N++ R +L +L + + RL+Q IE+ +++
Sbjct: 372 SRLEELEEQLETLRSKVAQLELQIASLNNEIERLEARLERLEDRRERLQQEIEELLKKLE 431
Query: 143 NKLMRKIEKLEAETDAKQNNLEQLRREKVELENTLEQEQEALVNKLWKRMDKLE 196
+++++ E + + L++ E L +E E L +L
Sbjct: 432 EAELKELQAELEELEEELEELQEELERLEEALEELREELEEAEQALDAAERELA 485
Score = 37.0 bits (86), Expect = 0.021
Identities = 42/181 (23%), Positives = 72/181 (39%), Gaps = 11/181 (6%)
Query: 136 QEQERVVNKLMRKIEKLEAETDAKQNNLEQLRREKVELENTLEQ------EQEALVNKLW 189
E+ R + +L KIE+LE + + L +LR+E ELE LEQ E ++ L
Sbjct: 673 LERRREIEELEEKIEELEEKIAELEKALAELRKELEELEEELEQLRKELEELSRQISALR 732
Query: 190 KRMDKLEAEKRILQIKLDQPVSDPSSPHDMNNGDTA--SNLSTHIHSLRSEVMRLRNQLA 247
K + +LEAE L+ ++ Q + + + +E+ L Q+
Sbjct: 733 KDLARLEAEVEQLEERIAQLSKELTELEAEIEELEERLEEAEEELAEAEAEIEELEAQIE 792
Query: 248 NTQQEHTEKMERYVKDEKNVREEN---LRLQRKLQREVERREALCRHLSESESSLEMEEE 304
++E E + + N L+ +L+ R A R L + E +E E
Sbjct: 793 QLKEELKALREALDELRAELTLLNEEAANLRERLESLERRIAATERRLEDLEEQIEELSE 852
Query: 305 R 305
Sbjct: 853 D 853
>gnl|CDD|224117 COG1196, Smc, Chromosome segregation ATPases [Cell division and
chromosome partitioning].
Length = 1163
Score = 57.4 bits (139), Expect = 7e-09
Identities = 61/277 (22%), Positives = 121/277 (43%), Gaps = 31/277 (11%)
Query: 28 EQLQKRIESLQQQNRVLKVELETYKLQLRALQEENKNLRQTSVNIQAKAEQEEEFISNTL 87
+L++ +E LQ++ + E+E K +L L+EE + L++ + ++ + E+ E IS
Sbjct: 249 SRLEEELEELQEELEEAEKEIEELKSELEELREELEELQEELLELKEEIEELEGEISLLR 308
Query: 88 MKKIQALKKEKETLAHHYEQEEEFLTNDLSRKLNQLT------EEKYRLEQTIEQEQERV 141
+ + + +E E+ E L + +L EE +L +E+ +E +
Sbjct: 309 ERLEELENELEE-----LEERLEELKEKIEALKEELEERETLLEELEQLLAELEEAKEEL 363
Query: 142 VNKLMRKIEKLEAETDAKQNNLEQLRREKVELENTLEQEQEALVNKLWKRMDKLEAEKRI 201
KL +E+LE +A + L +L E E+ N LE E + + L +R+++L
Sbjct: 364 EEKLSALLEELEELFEALREELAELEAELAEIRNELE-ELKREIESLEERLERLSERLED 422
Query: 202 LQIKLDQPVSDPSSPHDMNNGDTASNLSTHIHSLRSEVMRLRNQLANTQQEHTEKMERYV 261
L+ +L + L + L++E+ L +L +++ E +R
Sbjct: 423 LKEELKE-------------------LEAELEELQTELEELNEELEELEEQLEELRDRLK 463
Query: 262 KDEKNVREENLRLQRKLQREVERREALCRHLSESESS 298
+ E+ + E LQR + L R +E +S
Sbjct: 464 ELERELAELQEELQRLEKELSSLEARLDRLEAEQRAS 500
Score = 55.9 bits (135), Expect = 2e-08
Identities = 57/273 (20%), Positives = 128/273 (46%), Gaps = 19/273 (6%)
Query: 27 REQLQKRIESLQQQNRVLKVELETYKLQLRALQEENKNLRQTSVNIQAKAEQEEEFISNT 86
E+L++ +E L+++ L+ LE + +L +L+E L++ ++ K + +E +
Sbjct: 739 LEELEEELEELEEELEELQERLEELEEELESLEEALAKLKEEIEELEEKRQALQEELEE- 797
Query: 87 LMKKIQALKKEKETLAHHYEQEEEFLTNDLSRKLNQLTEEKYRLEQTIEQEQERVVN--- 143
L ++++ ++ + L E E+ L +++ +L EE LE+ +++ +E +
Sbjct: 798 LEEELEEAERRLDALERELESLEQRR-ERLEQEIEELEEEIEELEEKLDELEEELEELEK 856
Query: 144 ---KLMRKIEKLEAETDAKQNNLEQLRREKVELENTLEQEQEALVNKLWK------RMDK 194
+L ++E+LEAE + ++ L++L EK ELE L + + L + R+++
Sbjct: 857 ELEELKEELEELEAEKEELEDELKELEEEKEELEEELRELESELAELKEEIEKLRERLEE 916
Query: 195 LEAEKRILQIKLDQPVSDPSSPHDMNNGDTASNLSTHIHSLRSEVMRLR--NQLANTQQE 252
LEA+ L+++L + + ++ I L E+ L N A + E
Sbjct: 917 LEAKLERLEVELPELEEELEEEYEDTLETELER---EIERLEEEIEALGPVNLRAIEEYE 973
Query: 253 HTEKMERYVKDEKNVREENLRLQRKLQREVERR 285
E+ +K ++ EE ++ E+++
Sbjct: 974 EVEERYEELKSQREDLEEAKEKLLEVIEELDKE 1006
Score = 53.9 bits (130), Expect = 9e-08
Identities = 67/288 (23%), Positives = 124/288 (43%), Gaps = 25/288 (8%)
Query: 28 EQLQKRIESLQQQNRVLKVELETYKLQLRALQEENKNLRQTSVNIQAKAEQEEEFISNTL 87
++L++ + L+ Q L+ EL+ +L+ E ++L ++ + E+ E +
Sbjct: 670 KELEEELAELEAQLEKLEEELK-------SLKNELRSLEDLLEELRRQLEELERQLEELK 722
Query: 88 MKKIQALKKEKETLAHHYEQEEEFLTNDLSRKLNQLTEEKYRLEQTIEQEQERVVNKLMR 147
+ ++ ++ + E EEE +L +L +L E LE+ +E +E + KL
Sbjct: 723 RELAALEEELEQLQSRLEELEEEL--EELEEELEELQERLEELEEELESLEEA-LAKLKE 779
Query: 148 KIEKLEAETDAKQNNLEQLRREKVELENTLEQEQEAL------VNKLWKRMDKLEAEKRI 201
+IE+LE + A Q LE+L E E E L+ + L +L + +++LE E
Sbjct: 780 EIEELEEKRQALQEELEELEEELEEAERRLDALERELESLEQRRERLEQEIEELEEEIEE 839
Query: 202 LQIKLDQPVSDPSSPHDMNNGDTASNLSTHIHSLRSEVMRLRNQLANTQQEHTEKMERYV 261
L+ KLD+ + L + L +E L ++L ++E E E
Sbjct: 840 LEEKLDELEEELEELEKELE-----ELKEELEELEAEKEELEDELKELEEEKEELEEEL- 893
Query: 262 KDEKNVREENLRLQRKLQREVERREALCRHLSESESSLEMEEERHYNE 309
+ + E L+ ++++ ER E L L E L EE E
Sbjct: 894 ---RELESELAELKEEIEKLRERLEELEAKLERLEVELPELEEELEEE 938
Score = 45.5 bits (108), Expect = 4e-05
Identities = 57/282 (20%), Positives = 119/282 (42%), Gaps = 25/282 (8%)
Query: 27 REQLQKRIESLQQQNRVLKVELETYKLQLRALQEE---NKNLRQTSVNIQAKAEQEEEFI 83
+E+ ++++E ++ L+ LE + QL L+ + + ++ ++
Sbjct: 174 KEEAERKLERTEENLERLEDLLEELEKQLEKLERQAEKAERYQELKAELRELELALLLAK 233
Query: 84 SNTLMKKIQALKKEKETLAHHYEQEEEFLTNDLSRKLNQLTEEKYRLEQTIEQEQERVVN 143
L K+++ L++E L E+ +E L + +++ +L E L + +E+ QE +
Sbjct: 234 LKELRKELEELEEELSRLEEELEELQEEL-EEAEKEIEELKSELEELREELEELQEEL-L 291
Query: 144 KLMRKIEKLEAETDAKQNNLEQLRREKVELENTLEQEQEALVNKLWKRMDKLEAEKRILQ 203
+L +IE+LE E + LE+L E ELE LE+ +E + + +++ E L+
Sbjct: 292 ELKEEIEELEGEISLLRERLEELENELEELEERLEELKEKIEALK-EELEERETLLEELE 350
Query: 204 IKLDQPVSDPSSPHDMNNGDTASNLSTHIHSLRSEVMRLRNQLANTQQEHTEKMERYVKD 263
L + L L ++ L +L + E++ +
Sbjct: 351 QLLAE-------------------LEEAKEELEEKLSALLEELEELFEALREELAELEAE 391
Query: 264 EKNVREENLRLQRKLQREVERREALCRHLSESESSLEMEEER 305
+R E L+R+++ ER E L L + + L+ E
Sbjct: 392 LAEIRNELEELKREIESLEERLERLSERLEDLKEELKELEAE 433
Score = 45.1 bits (107), Expect = 5e-05
Identities = 55/241 (22%), Positives = 107/241 (44%), Gaps = 14/241 (5%)
Query: 73 QAKAEQEEEFISNTLMKKIQALKKEKETLAH---HYEQEEEFLTNDLSRKLNQLTEEKYR 129
++ ++ L +++ L+ + E L + E L + L QL E + +
Sbjct: 658 NKRSSLAQKRELKELEEELAELEAQLEKLEEELKSLKNELRSLEDLLEELRRQLEELERQ 717
Query: 130 LEQTIEQEQ--ERVVNKLMRKIEKLEAETDAKQNNLEQLRREKVELENTLEQEQEALVNK 187
LE+ + E + +L ++E+LE E + + LE+L+ ELE LE +EAL K
Sbjct: 718 LEELKRELAALEEELEQLQSRLEELEEELEELEEELEELQERLEELEEELESLEEAL-AK 776
Query: 188 LWKRMDKLEAEKRILQIKLDQPVSDPSSPHDMNNGDTASNLSTHIHSLRSEVMRLRNQLA 247
L + +++LE +++ LQ +L++ + L + SL RL ++
Sbjct: 777 LKEEIEELEEKRQALQEELEELEEELEE-----AERRLDALERELESLEQRRERLEQEIE 831
Query: 248 NTQQ---EHTEKMERYVKDEKNVREENLRLQRKLQREVERREALCRHLSESESSLEMEEE 304
++ E EK++ ++ + + +E L+ +L+ +E L L E E E EE
Sbjct: 832 ELEEEIEELEEKLDELEEELEELEKELEELKEELEELEAEKEELEDELKELEEEKEELEE 891
Query: 305 R 305
Sbjct: 892 E 892
Score = 40.9 bits (96), Expect = 0.001
Identities = 30/137 (21%), Positives = 64/137 (46%), Gaps = 9/137 (6%)
Query: 27 REQLQKRIESLQQQNRVLKVELETYKLQLRALQEENKNLRQTSVNIQAKA-EQEEEFISN 85
+E+L++ + L+ + LK E+E + +L L+ + + L ++ + E+ E+ +
Sbjct: 886 KEELEEELRELESELAELKEEIEKLRERLEELEAKLERLEVELPELEEELEEEYEDTLET 945
Query: 86 TLMKKIQALKKEKETLA-------HHYEQEEEFLTNDLSRKLNQLTEEKYRLEQTIEQEQ 138
L ++I+ L++E E L YE+ EE +L + L E K +L + IE+
Sbjct: 946 ELEREIERLEEEIEALGPVNLRAIEEYEEVEERY-EELKSQREDLEEAKEKLLEVIEELD 1004
Query: 139 ERVVNKLMRKIEKLEAE 155
+ + +K+
Sbjct: 1005 KEKRERFKETFDKINEN 1021
Score = 35.8 bits (83), Expect = 0.037
Identities = 41/165 (24%), Positives = 72/165 (43%), Gaps = 8/165 (4%)
Query: 142 VNKLMRKIEKLEAETDAKQNNLEQLRREKVELENTLEQ-EQEALVNKLWKRMDKLEAEKR 200
V+K + E+ E + + + NLE+L ELE LE+ E++A +R +L+AE R
Sbjct: 167 VSKYKERKEEAERKLERTEENLERLEDLLEELEKQLEKLERQAEKA---ERYQELKAELR 223
Query: 201 ILQIKLDQPVSDPSSPHDMNNGDTASNLSTHIHSLRSEVMRLRNQLANTQQEHTEKMERY 260
L++ L + + LS L L + +E ++E
Sbjct: 224 ELELAL---LLAKLKELRKELEELEEELSRLEEELEELQEELE-EAEKEIEELKSELEEL 279
Query: 261 VKDEKNVREENLRLQRKLQREVERREALCRHLSESESSLEMEEER 305
++ + ++EE L L+ +++ L L E E+ LE EER
Sbjct: 280 REELEELQEELLELKEEIEELEGEISLLRERLEELENELEELEER 324
Score = 31.6 bits (72), Expect = 0.89
Identities = 27/130 (20%), Positives = 54/130 (41%), Gaps = 6/130 (4%)
Query: 27 REQLQKRIESLQQQNRVLKVELETYKLQLRALQEENKNLRQTSVNIQAKAEQEEEFISNT 86
+++ +E L+++ L+ LE +L L+EE K L +Q + E+ E +
Sbjct: 392 LAEIRNELEELKREIESLEERLERLSERLEDLKEELKELEAELEELQTELEELNEELEE- 450
Query: 87 LMKKIQALKKEKETLAHHYEQEEEFLTNDLSRKLNQLTEEKYRLEQTIEQEQERVVNKLM 146
L ++++ L+ + E+E L +L R +L+ + RL+ +E EQ
Sbjct: 451 LEEQLEELRDRLK----ELERELAELQEELQRLEKELSSLEARLD-RLEAEQRASQGVRA 505
Query: 147 RKIEKLEAET 156
Sbjct: 506 VLEALESGLP 515
>gnl|CDD|233758 TIGR02169, SMC_prok_A, chromosome segregation protein SMC,
primarily archaeal type. SMC (structural maintenance of
chromosomes) proteins bind DNA and act in organizing and
segregating chromosomes for partition. SMC proteins are
found in bacteria, archaea, and eukaryotes. It is found
in a single copy and is homodimeric in prokaryotes, but
six paralogs (excluded from this family) are found in
eukarotes, where SMC proteins are heterodimeric. This
family represents the SMC protein of archaea and a few
bacteria (Aquifex, Synechocystis, etc); the SMC of other
bacteria is described by TIGR02168. The N- and
C-terminal domains of this protein are well conserved,
but the central hinge region is skewed in composition
and highly divergent [Cellular processes, Cell division,
DNA metabolism, Chromosome-associated proteins].
Length = 1164
Score = 50.5 bits (121), Expect = 1e-06
Identities = 59/280 (21%), Positives = 123/280 (43%), Gaps = 26/280 (9%)
Query: 28 EQLQKRIESLQQQNRVLKVELETYKLQLRALQEENKNLRQTSVNIQAKAEQEEEFISNTL 87
++L++R+E L++ +L +LQ E + + + + I +
Sbjct: 677 QRLRERLEGLKR--------------ELSSLQSELRRIENRLDELSQELSDASRKIG-EI 721
Query: 88 MKKIQALKKEKETLAHHYEQEEEFLTNDLSRKLNQLTEEKYRLEQTIEQEQERVVNKLMR 147
K+I+ L++E+E L E+ EE L + L +++ + E LE IE E E ++KL
Sbjct: 722 EKEIEQLEQEEEKLKERLEELEEDL-SSLEQEIENVKSELKELEARIE-ELEEDLHKLEE 779
Query: 148 KIEKLEAETDAKQNNLEQLRREKVELENTLEQEQEALVNKLWKRMDKLEAEKRILQIKLD 207
+ LEA + + +++ E +LE + EA + ++ +++++L EK L+ ++
Sbjct: 780 ALNDLEARLS--HSRIPEIQAELSKLEEEV-SRIEARLREIEQKLNRLTLEKEYLEKEIQ 836
Query: 208 QPVSDPSSPHDM--NNGDTASNLSTHIHSLRSEVMRLRNQLANTQQEHTEKMERYVKDEK 265
+ + + NL+ L E+ L L + ++ K+
Sbjct: 837 ELQEQRIDLKEQIKSIEKEIENLNGKKEELEEELEELEAALR----DLESRLGDLKKERD 892
Query: 266 NVREENLRLQRKLQREVERREALCRHLSESESSLEMEEER 305
+ + L+RK++ + E + LSE ++ LE EE
Sbjct: 893 ELEAQLRELERKIEELEAQIEKKRKRLSELKAKLEALEEE 932
Score = 49.7 bits (119), Expect = 2e-06
Identities = 39/186 (20%), Positives = 84/186 (45%), Gaps = 17/186 (9%)
Query: 26 TREQLQKRIESLQQQNRVLKVELETYKLQLRALQEENKNLRQTSVNIQAKAEQEEEFISN 85
+L+ I+ L + L+ E+E + + L EE L++ +++A+ E+ ++
Sbjct: 323 RLAKLEAEIDKLLAEIEELEREIEEERKRRDKLTEEYAELKEELEDLRAELEEVDK-EFA 381
Query: 86 TLMKKIQALKKEKETLAHHYEQEEEFLTNDLSRKLNQLTEEKYRLEQTIEQEQERVVNKL 145
+++ +++ E L N+L R+L++L EE RL +E + +
Sbjct: 382 ETRDELKDYREKLEKLKREI--------NELKRELDRLQEELQRL----SEELADLNAAI 429
Query: 146 MRKIEK---LEAETDAKQNNLEQLRREKVELENTLEQEQEALVNKLWKRMDKLEAEKRIL 202
K LE E + K +++ + +L L + ++ L + L + D++E E L
Sbjct: 430 AGIEAKINELEEEKEDKALEIKKQEWKLEQLAADLSKYEQELYD-LKEEYDRVEKELSKL 488
Query: 203 QIKLDQ 208
Q +L +
Sbjct: 489 QRELAE 494
Score = 48.5 bits (116), Expect = 5e-06
Identities = 46/226 (20%), Positives = 99/226 (43%), Gaps = 13/226 (5%)
Query: 29 QLQKRIESLQQQNRVLKVELETYKLQLRALQEENKNLRQTSVNIQAKAEQEEEFISNTLM 88
+++++I L+ + L+ + + +L +E L + I + EE L
Sbjct: 291 RVKEKIGELEAEIASLERSIAEKERELEDAEER---LAKLEAEIDKLLAEIEE-----LE 342
Query: 89 KKIQALKKEKETLAHHYEQEEEFLTNDLSRKLNQLTEEKYRLEQTIEQEQERVVNKLMRK 148
++I+ +K ++ L Y + +E L DL +L ++ +E ++ +E+ + KL R+
Sbjct: 343 REIEEERKRRDKLTEEYAELKEEL-EDLRAELEEVDKEFAETRDELKDYREK-LEKLKRE 400
Query: 149 IEKLEAETDAKQNNLEQLRREKVELENTLEQEQEALVNKLWKRMDKLEAEKRILQIKLDQ 208
I +L+ E D Q L++L E +L + EA +N+L + + E + + KL+Q
Sbjct: 401 INELKRELDRLQEELQRLSEELADLNAAIA-GIEAKINELEEEKEDKALEIKKQEWKLEQ 459
Query: 209 PVSDPSSPHDMNNGDTA--SNLSTHIHSLRSEVMRLRNQLANTQQE 252
+D S + + L+ E+ Q +++
Sbjct: 460 LAADLSKYEQELYDLKEEYDRVEKELSKLQRELAEAEAQARASEER 505
Score = 47.8 bits (114), Expect = 9e-06
Identities = 54/286 (18%), Positives = 112/286 (39%), Gaps = 31/286 (10%)
Query: 27 REQLQKRIESLQQQNRVLKVELETYKLQLRALQEENKNLRQTSVNIQAKAEQEEEFISNT 86
LQ + ++ + L EL ++ +++E + L Q ++ + E+ EE +S +
Sbjct: 690 LSSLQSELRRIENRLDELSQELSDASRKIGEIEKEIEQLEQEEEKLKERLEELEEDLS-S 748
Query: 87 LMKKIQALKKEKETLAHHYEQEEEFLTNDLSRKLNQLTEEKYRLEQTIEQEQERVVNKLM 146
L ++I+ +K E + L E+ EE L++L E LE + + + +
Sbjct: 749 LEQEIENVKSELKELEARIEELEE--------DLHKLEEALNDLEARLSHSRIPEIQAEL 800
Query: 147 RKIEKLEAETDAKQNNLEQLRREKVELENTLEQEQEALVNKL----------WKRMDKLE 196
K+E+ + +A+ +EQ + LE+E + L + K ++ L
Sbjct: 801 SKLEEEVSRIEARLREIEQKLNRLTLEKEYLEKEIQELQEQRIDLKEQIKSIEKEIENLN 860
Query: 197 AEKRILQIKLDQPVSDPSSPHDMNNGDTASNLSTHIHSLRSEVMRLRNQLANTQQEHTEK 256
+K L+ +L+ +L + + L+ E L QL +++ E
Sbjct: 861 GKKEELEEELE------------ELEAALRDLESRLGDLKKERDELEAQLRELERKIEEL 908
Query: 257 MERYVKDEKNVREENLRLQRKLQREVERREALCRHLSESESSLEME 302
+ K K + E +L+ + E + E L +E
Sbjct: 909 EAQIEKKRKRLSELKAKLEALEEELSEIEDPKGEDEEIPEEELSLE 954
Score = 46.2 bits (110), Expect = 3e-05
Identities = 57/276 (20%), Positives = 111/276 (40%), Gaps = 49/276 (17%)
Query: 36 SLQQQNRVLKVELETYKLQLRALQEENKNLRQTSVNIQAKAEQEEEFISNTLMKKIQALK 95
S + + L+ LE K +L +LQ E + + + + I + K+I+ L+
Sbjct: 671 SEPAELQRLRERLEGLKRELSSLQSELRRIENRLDELSQELSDASRKIG-EIEKEIEQLE 729
Query: 96 KEKETLAHHYEQEEEFLTNDLSRKLNQLTEEKYRLEQTIEQEQERVVNKLMRKIEKLEAE 155
+E+E L E+ EE L + L +++ + E LE IE E E ++KL + LEA
Sbjct: 730 QEEEKLKERLEELEEDL-SSLEQEIENVKSELKELEARIE-ELEEDLHKLEEALNDLEAR 787
Query: 156 TDAKQNNLEQLRREKVELENTLEQEQEALVNKLWKRMDKLEAEKRILQIKLDQPVSDPSS 215
+ + +++ E +LE + EA + ++ +++++L EK L+
Sbjct: 788 LS--HSRIPEIQAELSKLEEEV-SRIEARLREIEQKLNRLTLEKEYLE------------ 832
Query: 216 PHDMNNGDTASNLSTHIHSLRSEVMRLRNQLANTQQEHTEKMERYVKDEKNVREENLRLQ 275
E+ L+ Q + +++ ENL +
Sbjct: 833 ---------------------KEIQELQEQRIDLKEQ---------IKSIEKEIENLNGK 862
Query: 276 -RKLQREVERREALCRHLSESESSLEMEEERHYNEI 310
+L+ E+E EA R L L+ E + ++
Sbjct: 863 KEELEEELEELEAALRDLESRLGDLKKERDELEAQL 898
Score = 45.1 bits (107), Expect = 5e-05
Identities = 62/264 (23%), Positives = 121/264 (45%), Gaps = 37/264 (14%)
Query: 54 QLRALQEENKNLRQTSVNIQAKAEQEEEFISNTLMKKIQALKKEKETLAHHYEQEEEFLT 113
QL L+ E R+ + QA +++ E+ L+K+ +AL+++KE + EE L
Sbjct: 199 QLERLRRE----REKAERYQALLKEKREYEGYELLKEKEALERQKEAIERQLASLEEELE 254
Query: 114 NDLSRKLNQLTEEKYRLEQTIEQEQERV-------VNKLMRKIEKLEAETDAKQNNLEQL 166
L+ ++++L + +EQ +E+ +++ ++ KI +LEAE + + ++ +
Sbjct: 255 -KLTEEISELEKRLEEIEQLLEELNKKIKDLGEEEQLRVKEKIGELEAEIASLERSIAEK 313
Query: 167 RREKVELENTLEQEQEALVNKLWKRMDKLEAEKRILQIKLDQPVSDPSSPHDMNNGDTAS 226
RE + E L + EA ++KL +++LE E + + D+
Sbjct: 314 ERELEDAEERL-AKLEAEIDKLLAEIEELEREIEEERKRRDK------------------ 354
Query: 227 NLSTHIHSLRSEVMRLRNQLANTQQEHT----EKMERYVKDEKNVREENLRLQRKLQREV 282
L+ L+ E+ LR +L +E E + K EK RE N L+R+L R
Sbjct: 355 -LTEEYAELKEELEDLRAELEEVDKEFAETRDELKDYREKLEKLKREIN-ELKRELDRLQ 412
Query: 283 ERREALCRHLSESESSLEMEEERH 306
E + L L++ +++ E +
Sbjct: 413 EELQRLSEELADLNAAIAGIEAKI 436
Score = 40.8 bits (96), Expect = 0.001
Identities = 32/141 (22%), Positives = 73/141 (51%), Gaps = 8/141 (5%)
Query: 26 TREQLQKRIESLQQQNRVLKVELETYKLQLRALQEENKNLRQTSVNIQAKAEQEEEFISN 85
E L+ +E + ++ + EL+ Y+ +L L+ E L++ +Q + ++ E +++
Sbjct: 365 ELEDLRAELEEVDKEFAETRDELKDYREKLEKLKREINELKRELDRLQEELQRLSEELAD 424
Query: 86 TLMK------KIQALKKEKETLAHHYEQEEEFLTNDLSRKLNQLTEEKYRLEQTIEQEQE 139
KI L++EKE A +++E L L+ L++ +E Y L++ ++ ++
Sbjct: 425 LNAAIAGIEAKINELEEEKEDKALEIKKQEWKLEQ-LAADLSKYEQELYDLKEEYDRVEK 483
Query: 140 RVVNKLMRKIEKLEAETDAKQ 160
+ +KL R++ + EA+ A +
Sbjct: 484 EL-SKLQRELAEAEAQARASE 503
Score = 39.7 bits (93), Expect = 0.003
Identities = 40/170 (23%), Positives = 76/170 (44%), Gaps = 28/170 (16%)
Query: 27 REQLQKRIESLQQQNRVLKVELETYKLQLRALQEENKNLRQTSVNIQAKAE--------- 77
+ ++K IE+L + L+ ELE + LR L+ +L++ ++A+
Sbjct: 849 IKSIEKEIENLNGKKEELEEELEELEAALRDLESRLGDLKKERDELEAQLRELERKIEEL 908
Query: 78 ----QEEEFISNTLMKKIQALKKEKETLAHHYEQEEEFLTNDLSRKLNQLTEEKYRLEQT 133
+++ + L K++AL++E + ++EE +LS L + E R+E+
Sbjct: 909 EAQIEKKRKRLSELKAKLEALEEELSEIEDPKGEDEEIPEEELS--LEDVQAELQRVEEE 966
Query: 134 IE----------QEQERVV---NKLMRKIEKLEAETDAKQNNLEQLRREK 170
I QE E V+ ++L K KLE E A +E+ ++K
Sbjct: 967 IRALEPVNMLAIQEYEEVLKRLDELKEKRAKLEEERKAILERIEEYEKKK 1016
Score = 37.0 bits (86), Expect = 0.019
Identities = 45/191 (23%), Positives = 79/191 (41%), Gaps = 32/191 (16%)
Query: 126 EKYRLEQTIEQEQE-----RVVNKLMRKIEKLEAETDAKQNNLEQLRREKVELENTLEQE 180
E+Y+ ++E E + L R+ E +E + + + LE+L E ELE LE E
Sbjct: 211 ERYQALLKEKREYEGYELLKEKEALERQKEAIERQLASLEEELEKLTEEISELEKRLE-E 269
Query: 181 QEALVNKLWKRMDKLEAEKRILQIKLDQPVSDPSSPHDMNNGDTASNLSTHIHSLRSEVM 240
E L+ +L K++ L E+ L++K I L +E+
Sbjct: 270 IEQLLEELNKKIKDL-GEEEQLRVK------------------------EKIGELEAEIA 304
Query: 241 RLRNQLANTQQEHTEKMERYVKDEKNVREENLRLQRKLQREVERREALCRHLSESESSLE 300
L +A ++E E E + + ++ L +L+RE+E L+E + L+
Sbjct: 305 SLERSIAEKERE-LEDAEERLAKLEAEIDKLLAEIEELEREIEEERKRRDKLTEEYAELK 363
Query: 301 MEEERHYNEIA 311
E E E+
Sbjct: 364 EELEDLRAELE 374
Score = 34.7 bits (80), Expect = 0.10
Identities = 45/227 (19%), Positives = 89/227 (39%), Gaps = 27/227 (11%)
Query: 90 KIQALKKEKETLAHHYEQEEEFLTNDLSRKLNQLTEEKYRLEQTIEQEQERVVNKLMRKI 149
+ QAL KEK YE E L ++ L +K +E+ + E + KL +I
Sbjct: 212 RYQALLKEKR----EYEGYE------LLKEKEALERQKEAIERQL-ASLEEELEKLTEEI 260
Query: 150 EKLEAETDAKQNNLEQLRREKVELENTLEQEQEALVNKLWKRMDKLEAEKRILQIKLDQP 209
+LE + + LE+L ++ +++ E+EQ + K+ + ++ + +R + K +
Sbjct: 261 SELEKRLEEIEQLLEELNKK---IKDLGEEEQLRVKEKIGELEAEIASLERSIAEKEREL 317
Query: 210 VSDPSSPHDMNNGDTASNLSTHIHSLRSEVMRLRNQLANTQQEHTEKMERYVKDEKNVRE 269
D + L I L +E+ L ++ ++ + E Y ++E
Sbjct: 318 -------EDAEE--RLAKLEAEIDKLLAEIEELEREIEEERKRRDKLTEEY----AELKE 364
Query: 270 ENLRLQRKLQREVERREALCRHLSESESSLEMEEERHYNEIAHSMRL 316
E L+ +L+ + L + LE + RL
Sbjct: 365 ELEDLRAELEEVDKEFAETRDELKDYREKLEKLKREINELKRELDRL 411
Score = 31.6 bits (72), Expect = 0.88
Identities = 38/184 (20%), Positives = 71/184 (38%), Gaps = 30/184 (16%)
Query: 147 RKIEKLEAETDAKQNNLEQLRREKVELENTLE-----------------QEQEALVNKLW 189
IE+L+ D K+ LE+LRRE+ + E + E +
Sbjct: 184 ENIERLDLIIDEKRQQLERLRREREKAERYQALLKEKREYEGYELLKEKEALERQKEAIE 243
Query: 190 KRMDKLEAEKRILQIKLD----------QPVSDPSSPHDMNNGDTASNLSTHIHSLRSEV 239
+++ LE E L ++ Q + + + + + I L +E+
Sbjct: 244 RQLASLEEELEKLTEEISELEKRLEEIEQLLEELNKKIKDLGEEEQLRVKEKIGELEAEI 303
Query: 240 MRLRNQLANTQQEHTEKMERYVKDE---KNVREENLRLQRKLQREVERREALCRHLSESE 296
L +A ++E + ER K E + E L+R+++ E +RR+ L +E +
Sbjct: 304 ASLERSIAEKERELEDAEERLAKLEAEIDKLLAEIEELEREIEEERKRRDKLTEEYAELK 363
Query: 297 SSLE 300
LE
Sbjct: 364 EELE 367
>gnl|CDD|223496 COG0419, SbcC, ATPase involved in DNA repair [DNA replication,
recombination, and repair].
Length = 908
Score = 47.4 bits (113), Expect = 9e-06
Identities = 58/286 (20%), Positives = 111/286 (38%), Gaps = 14/286 (4%)
Query: 27 REQLQKRIESLQQQNRVLKVELETYKLQLRALQEENKNLRQTSVNIQAKAEQEEEFISNT 86
+ +L + + ++ + EL + L L+E+ + L + I+ E+ E
Sbjct: 255 KARLLEIESLELEALKIREEELRELERLLEELEEKIERLEELEREIEELEEELEGL---- 310
Query: 87 LMKKIQALKKEKETLAHHYEQEEEFLT--NDLSRKLNQLTEEKYRLEQTIEQEQERVVNK 144
+AL +E E L + EE L + KL EE + + + E + +
Sbjct: 311 -----RALLEELEELLEKLKSLEERLEKLEEKLEKLESELEELAEEKNELAKLLEERLKE 365
Query: 145 LMRKIEKLEAETDAKQNNLEQLRREKVELENTLEQEQEALVNKLWKRMDKLEAEKRILQI 204
L ++E+LE E + L+QL EL+ L E A + ++ + +++LE E L+
Sbjct: 366 LEERLEELEKELEKALERLKQLEEAIQELKEELA-ELSAALEEIQEELEELEKELEELER 424
Query: 205 KLDQPVSDPSSPHDMNNGDTASNLSTHIHSLRSEVMRLRNQLANTQQEHTEKMERYVKDE 264
+L++ + + N + I L + +EH +++ + E
Sbjct: 425 ELEELEEEIKKLEEQINQ--LESKELMIAELAGAGEKCPVCGQELPEEHEKELLELYELE 482
Query: 265 KNVREENLRLQRKLQREVERREALCRHLSESESSLEMEEERHYNEI 310
EE L +++ E E L + L E E L E
Sbjct: 483 LEELEEELSREKEEAELREEIEELEKELRELEEELIELLELEEALK 528
Score = 47.4 bits (113), Expect = 9e-06
Identities = 62/283 (21%), Positives = 117/283 (41%), Gaps = 34/283 (12%)
Query: 28 EQLQKRIESLQQQNRVLKVELETYKLQLRALQE----ENKNLRQTSVNIQAKAEQEEEFI 83
E+ +K E L++ + K ++E + QL L E + L + ++ E +EE
Sbjct: 167 EKYEKLSELLKEVIKEAKAKIEELEGQLSELLEDIEDLLEALEEELKELKKLEEIQEEQE 226
Query: 84 SNTLMKKIQALKKEKETLAHHYEQEEEFLTNDLSRKLNQLTEEKYRLEQTIEQEQERVVN 143
L ++I+AL++ L E+ EE + + +++E E ++
Sbjct: 227 EEELEQEIEALEERLAELEEEKERLEE-------------LKARLLEIESLELEALKIRE 273
Query: 144 KLMRKIEKLEAETDAKQNNLEQLRREKVELENTLEQEQEALVNKLWKRMDKLEAEKRILQ 203
+ +R++E+L E + K LE+L RE ELE LE + L + ++KL++ + L+
Sbjct: 274 EELRELERLLEELEEKIERLEELEREIEELEEELEGLRALLEELE-ELLEKLKSLEERLE 332
Query: 204 IKLDQPVSDPSSPHDMNNGDTASNLSTHIHSLRSEVMRLRNQLANTQQEHTEKMERYVKD 263
++ L + + L E L L +E E++E K+
Sbjct: 333 KLEEK----------------LEKLESELEELAEEKNELAKLLEERLKELEERLEELEKE 376
Query: 264 EKNVREENLRLQRKLQREVERREALCRHLSESESSLEMEEERH 306
+ E +L+ +Q E L L E + LE E+
Sbjct: 377 LEKALERLKQLEEAIQELKEELAELSAALEEIQEELEELEKEL 419
Score = 43.2 bits (102), Expect = 2e-04
Identities = 46/255 (18%), Positives = 104/255 (40%), Gaps = 23/255 (9%)
Query: 27 REQLQKRIESLQQQNRVLKVELETYKLQLRALQEENKNLRQTSVNIQAKAEQEEEFISNT 86
L + +E L ++ + L+ LE + +L L+ E + L +
Sbjct: 310 LRALLEELEELLEKLKSLEERLEKLEEKLEKLESELEELAEEKN---------------- 353
Query: 87 LMKKIQALKKEKETLAHHYEQEEEFLTNDLSRKLNQLTEEKYRLEQTIEQEQERVVNKLM 146
+ + L++ + L E+ E+ L L R EE + + E + ++
Sbjct: 354 --ELAKLLEERLKELEERLEELEKELEKALERLKQ--LEEAIQELKEELAELSAALEEIQ 409
Query: 147 RKIEKLEAETDAKQNNLEQLRREKVELENTLEQEQ--EALVNKLWKRMDKLEAEKRILQI 204
++E+LE E + + LE+L E +LE + Q + E ++ +L +K + L
Sbjct: 410 EELEELEKELEELERELEELEEEIKKLEEQINQLESKELMIAELAGAGEKCPVCGQELPE 469
Query: 205 KLDQPVSDPSSPHDMNNGDTASNLSTHIHSLRSEVMRLRNQLANTQQEHTEKMERYVKDE 264
+ ++ + + ++ + + LR E+ L +L ++E E +E +
Sbjct: 470 EHEKELLELYE-LELEELEEELSREKEEAELREEIEELEKELRELEEELIELLELEEALK 528
Query: 265 KNVREENLRLQRKLQ 279
+ + E+ +L+ L+
Sbjct: 529 EELEEKLEKLENLLE 543
Score = 42.4 bits (100), Expect = 4e-04
Identities = 56/292 (19%), Positives = 113/292 (38%), Gaps = 16/292 (5%)
Query: 26 TREQLQKRIESLQQQNRVLKVELETYKLQLRALQEENKNLRQTSVNIQA------KAEQE 79
E+ +R+E L+++ L+ ELE + L L+E + L+ ++ K E E
Sbjct: 285 ELEEKIERLEELEREIEELEEELEGLRALLEELEELLEKLKSLEERLEKLEEKLEKLESE 344
Query: 80 EEFISNTLMKKIQALKKEKETLAHHYEQEEEFLTNDLSRKLNQLTEEKYRLEQTIEQEQE 139
E ++ + + L++ + L E+ E+ L L R E E ++E
Sbjct: 345 LEELAEEKNELAKLLEERLKELEERLEELEKELEKALERLKQ-------LEEAIQELKEE 397
Query: 140 RVVNKLMRKIEKLEAETDAKQNNLEQLRREKVELENTLEQEQEALVNKLWKRMDKLEAEK 199
+L +E+++ E + + LE+L RE ELE +++ +E + K + E
Sbjct: 398 LA--ELSAALEEIQEELEELEKELEELERELEELEEEIKKLEEQINQLESKELMIAELAG 455
Query: 200 RILQIKL-DQPVSDPSSPHDMNNGDTASNLSTHIHSLRSEVMRLRNQLANTQQEHTEKME 258
+ + Q + + + + S E LR ++ ++E E E
Sbjct: 456 AGEKCPVCGQELPEEHEKELLELYELELEELEEELSREKEEAELREEIEELEKELRELEE 515
Query: 259 RYVKDEKNVREENLRLQRKLQREVERREALCRHLSESESSLEMEEERHYNEI 310
++ + L+ KL++ E L + + EE R +
Sbjct: 516 ELIELLELEEALKEELEEKLEKLENLLEELEELKEKLQLQQLKEELRQLEDR 567
Score = 40.9 bits (96), Expect = 0.001
Identities = 65/290 (22%), Positives = 122/290 (42%), Gaps = 13/290 (4%)
Query: 26 TREQLQKRIESLQQQNRVLKVELETYKLQLRALQEENKNLRQTSVNIQAKAEQEEEFISN 85
+QL++ I+ L+++ L LE + +L L++E + L + ++ + ++ EE I+
Sbjct: 383 RLKQLEEAIQELKEELAELSAALEEIQEELEELEKELEELERELEELEEEIKKLEEQINQ 442
Query: 86 TLMKK--IQALKKEKET----LAHHYEQEEEFLTNDLSRKLNQLTEE--KYRLEQTIEQE 137
K+ I L E E+ E+ L +L +L EE + + E + +E
Sbjct: 443 LESKELMIAELAGAGEKCPVCGQELPEEHEKELLELYELELEELEEELSREKEEAELREE 502
Query: 138 QERVVNKLMRKIEKLEAETDAKQNNLEQLRREKVELENTLEQEQEALVNKLWKRMDKLEA 197
E + +L R++E+ E + L++ EK+E L +E E L KL +L+
Sbjct: 503 IEELEKEL-RELEEELIELLELEEALKEELEEKLEKLENLLEELEELKEKLQL--QQLKE 559
Query: 198 EKRILQIKLDQPVSDPSSPHDMNN-GDTASNLSTHIHSLRSEVMRLRNQLANTQQEHTEK 256
E R L+ +L + + + L + L+ ++ L +L+ ++
Sbjct: 560 ELRQLEDRLQELKELLEELRLLRTRKEELEELRERLKELKKKLKELEERLSQLEELLQSL 619
Query: 257 MERYVKDEKNVREENLR-LQRKLQREVERREALCRHLSESESSLEMEEER 305
++E EE L KL + E E L L E E +E E
Sbjct: 620 ELSEAENELEEAEEELESELEKLNLQAELEELLQAALEELEEKVEELEAE 669
Score = 39.7 bits (93), Expect = 0.002
Identities = 63/280 (22%), Positives = 122/280 (43%), Gaps = 14/280 (5%)
Query: 27 REQLQKRIESLQQQNRVLKVELETYKLQLRALQE--ENKNLRQTSVNIQAKAEQEEEFIS 84
E+L+ ++ L + L LE +L+ L+E E + + I+A E+ E
Sbjct: 187 IEELEGQLSELLEDIEDLLEALEEELKELKKLEEIQEEQEEEELEQEIEALEERLAELEE 246
Query: 85 --NTLMKKIQALKKEKETLAHHYEQEEEFLTNDLSRKLNQLTEEKYRLEQTIE--QEQER 140
L + L + + + EE L +L R L +L E+ RLE+ +E E
Sbjct: 247 EKERLEELKARLLEIESLELEALKIREEEL-RELERLLEELEEKIERLEELEREIEELEE 305
Query: 141 VVNKLMRKIEKLEAETDAKQNNLEQLRREKVELENTLEQEQEALVNKLWKRMDKLEAEKR 200
+ L +E+LE + ++ E+L + + +LE LE E E L + + LE +
Sbjct: 306 ELEGLRALLEELEELLEKLKSLEERLEKLEEKLE-KLESELEELAEEKNELAKLLEERLK 364
Query: 201 ILQIKLDQPVSDPSSPHDMNNGDTASNLSTHIHSLRSEVMRLRNQLANTQQEHTEKMERY 260
L+ +L++ + + L I L+ E+ L L Q+E E++E+
Sbjct: 365 ELEERLEELEKELE-----KALERLKQLEEAIQELKEELAELSAALEEIQEE-LEELEKE 418
Query: 261 VKDEKNVREENLRLQRKLQREVERREALCRHLSESESSLE 300
+++ + EE +KL+ ++ + E+ ++E + E
Sbjct: 419 LEELERELEELEEEIKKLEEQINQLESKELMIAELAGAGE 458
Score = 36.3 bits (84), Expect = 0.026
Identities = 29/157 (18%), Positives = 69/157 (43%), Gaps = 1/157 (0%)
Query: 28 EQLQKRIESLQQQNRVLKVELETYKLQLRALQEENKNLRQTSVNIQAKAEQEE-EFISNT 86
++ +E L+++ + LK +L+ + +L L+E ++L + + + +EE E
Sbjct: 582 RTRKEELEELRERLKELKKKLKELEERLSQLEELLQSLELSEAENELEEAEEELESELEK 641
Query: 87 LMKKIQALKKEKETLAHHYEQEEEFLTNDLSRKLNQLTEEKYRLEQTIEQEQERVVNKLM 146
L + + + + L E+ EE EE+ + ++ E + +L
Sbjct: 642 LNLQAELEELLQAALEELEEKVEELEAEIRRELQRIENEEQLEEKLEELEQLEEELEQLR 701
Query: 147 RKIEKLEAETDAKQNNLEQLRREKVELENTLEQEQEA 183
++E+L + + +E+L K ELE ++ ++
Sbjct: 702 EELEELLKKLGEIEQLIEELESRKAELEELKKELEKL 738
>gnl|CDD|237177 PRK12704, PRK12704, phosphodiesterase; Provisional.
Length = 520
Score = 45.5 bits (109), Expect = 3e-05
Identities = 35/125 (28%), Positives = 66/125 (52%), Gaps = 15/125 (12%)
Query: 74 AKAEQEEEFISNTLMKKIQALKKEKETLAHHYEQEEEF--LTNDLSRKLNQLTEEKYRLE 131
+AE+E + I K+ +A+KKE E +EE L N+ ++L + E +LE
Sbjct: 34 KEAEEEAKRILEEAKKEAEAIKKEALL-----EAKEEIHKLRNEFEKELRERRNELQKLE 88
Query: 132 QTIEQEQERVVNKLMRKIEKLEAETDAKQNNLEQLRREKVELENTLEQEQEALVNKLWKR 191
+ + Q++E L RK+E LE ++ LE+ +E + + LE+++E L + ++
Sbjct: 89 KRLLQKEEN----LDRKLELLEK----REEELEKKEKELEQKQQELEKKEEELEELIEEQ 140
Query: 192 MDKLE 196
+ +LE
Sbjct: 141 LQELE 145
Score = 32.8 bits (76), Expect = 0.30
Identities = 21/94 (22%), Positives = 46/94 (48%), Gaps = 7/94 (7%)
Query: 122 QLTEEKYRLEQTIEQEQERVVNKLMRKIEKLEAETDAKQNNLEQLRREK----VELENTL 177
++ E + ++ +E+ ++ + + + E +N E+ RE+ +LE L
Sbjct: 32 KIKEAEEEAKRILEEAKKEAEAIKKEALLEAKEEIHKLRNEFEKELRERRNELQKLEKRL 91
Query: 178 EQEQEALVNK---LWKRMDKLEAEKRILQIKLDQ 208
Q++E L K L KR ++LE +++ L+ K +
Sbjct: 92 LQKEENLDRKLELLEKREEELEKKEKELEQKQQE 125
Score = 32.8 bits (76), Expect = 0.32
Identities = 43/139 (30%), Positives = 70/139 (50%), Gaps = 16/139 (11%)
Query: 27 REQLQKRIESLQQQNRVLKVELETYKLQLRALQEENKNLRQTSVNIQAKAEQEEEFISNT 86
+E++ K +++ R + EL+ KL+ R LQ+E NL + ++ K E+E E
Sbjct: 63 KEEIHKLRNEFEKELRERRNELQ--KLEKRLLQKEE-NLDRKLELLE-KREEELEKKEKE 118
Query: 87 LMKKIQAL-KKEKETLAHHYEQEEEFLTNDLSRKLNQLTEE--KYRLEQTIEQEQERVVN 143
L +K Q L KKE+E EQ +E +++ LT E K L + +E+E
Sbjct: 119 LEQKQQELEKKEEELEELIEEQLQELE------RISGLTAEEAKEILLEKVEEEARHEAA 172
Query: 144 KLMRKIE---KLEAETDAK 159
L+++IE K EA+ AK
Sbjct: 173 VLIKEIEEEAKEEADKKAK 191
Score = 30.5 bits (70), Expect = 1.8
Identities = 13/56 (23%), Positives = 26/56 (46%), Gaps = 4/56 (7%)
Query: 251 QEHTEKMERYVKD-EKNVREENLRLQRKLQREVERREALCRHLSESESSLEMEEER 305
+E ++ E K+ +EE +L+ + ++E+ R L + E L +EE
Sbjct: 45 EEAKKEAEAIKKEALLEAKEEIHKLRNEFEKELRERR---NELQKLEKRLLQKEEN 97
>gnl|CDD|235175 PRK03918, PRK03918, chromosome segregation protein; Provisional.
Length = 880
Score = 45.1 bits (107), Expect = 6e-05
Identities = 57/275 (20%), Positives = 121/275 (44%), Gaps = 29/275 (10%)
Query: 47 ELETYKLQLRALQEENKNLRQTSVNIQAKAEQEEEFISNTLMKKIQALKKEKETLAHHYE 106
ELE K ++ L++E ++L + ++ K + EE I L K+I+ L+++ + L E
Sbjct: 232 ELEELKEEIEELEKELESLEGSKRKLEEKIRELEERI-EELKKEIEELEEKVKELKELKE 290
Query: 107 QEEEFLTNDLSRKLNQLTEEKYRLEQTIE--QEQERVVNKLMRKIEKLEAETDAKQNNLE 164
+ EE++ LS + +E +E+ + +E+ + + ++++E+ E + + L+
Sbjct: 291 KAEEYIK--LSEFYEEYLDELREIEKRLSRLEEEINGIEERIKELEEKEERLEELKKKLK 348
Query: 165 QLRREKVELENTLEQEQEALVNKLWKRMDKLEAEKRILQIKLDQPVSDPSSPHDMNNGDT 224
+L + ELE E +EA K + +++L+ L + + + +
Sbjct: 349 ELEKRLEELEERHELYEEAKAKK--EELERLKKRLTGLTPEKLEKELEELEKAKEEIEEE 406
Query: 225 ASNLSTHIHSLRSEVMRLRNQLAN---------------TQQEHTEKMERYVKDEKNVRE 269
S ++ I L+ E+ L+ + T++ E +E Y + K + +
Sbjct: 407 ISKITARIGELKKEIKELKKAIEELKKAKGKCPVCGRELTEEHRKELLEEYTAELKRIEK 466
Query: 270 ENLRLQRKLQREVERREALCRHLSESESSLEMEEE 304
E L+ E+ L + L E E L+ E E
Sbjct: 467 E-------LKEIEEKERKLRKELRELEKVLKKESE 494
Score = 38.9 bits (91), Expect = 0.005
Identities = 53/262 (20%), Positives = 116/262 (44%), Gaps = 22/262 (8%)
Query: 47 ELETYKLQLRALQEENKNLRQTSVNIQAKAEQEEEFISN-TLMKKIQALKKEKETLAHHY 105
EL+ + +L+ ++E+ + LR+ ++ ++E E I L ++++ L+++ +
Sbjct: 460 ELKRIEKELKEIEEKERKLRKELRELEKVLKKESELIKLKELAEQLKELEEKLKKYNLEE 519
Query: 106 EQEEEFLTNDLSRKLNQLTEEKYRLEQTIEQEQE--RVVNKLMRKIEKLEAETDAKQNNL 163
+++ L KL +L E L++ +E+ +E + + +L +K+++LE E L
Sbjct: 520 LEKKAEEYEKLKEKLIKLKGEIKSLKKELEKLEELKKKLAELEKKLDELEEELAELLKEL 579
Query: 164 EQLRREKVELENTLEQEQEALVNKLWKRMDKLEAEKRILQIKLDQPVSDPSSPHDMNNGD 223
E+L E VE +E E N+ + + E E + +L + +
Sbjct: 580 EELGFESVEELEERLKELEPFYNE-YLELKDAEKELEREEKELKK------------LEE 626
Query: 224 TASNLSTHIHSLRSEVMRLRNQLANTQQEHTEKMERYVKDEKNVREENLRLQRKLQREVE 283
+ + LR +L +++++E+ + + +REE L L R+L
Sbjct: 627 ELDKAFEELAETEKRLEELRKELEELEKKYSEE------EYEELREEYLELSRELAGLRA 680
Query: 284 RREALCRHLSESESSLEMEEER 305
E L + E + +LE +E
Sbjct: 681 ELEELEKRREEIKKTLEKLKEE 702
Score = 36.2 bits (84), Expect = 0.028
Identities = 34/179 (18%), Positives = 82/179 (45%), Gaps = 30/179 (16%)
Query: 27 REQLQKRIESLQQQNRVLKVELETYKLQLRALQEENKNLRQTSVNIQAKAEQEEEFISNT 86
E+ ++R+E L+++ + L+ LE + + +E + ++ ++
Sbjct: 333 LEEKEERLEELKKKLKELEKRLEELEERHELYEE----AKAKKEELERLKKRLTGLTPEK 388
Query: 87 LMKKIQALKKEKETLAHHYEQEEEFLTNDLSRKLNQLTEEKYRLEQTI------------ 134
L K+++ L+K KE + EEE + ++ ++ +L +E L++ I
Sbjct: 389 LEKELEELEKAKEEI------EEEI--SKITARIGELKKEIKELKKAIEELKKAKGKCPV 440
Query: 135 ------EQEQERVVNKLMRKIEKLEAETDAKQNNLEQLRREKVELENTLEQEQEALVNK 187
E+ ++ ++ + +++++E E + +LR+E ELE L++E E + K
Sbjct: 441 CGRELTEEHRKELLEEYTAELKRIEKELKEIEEKERKLRKELRELEKVLKKESELIKLK 499
Score = 33.5 bits (77), Expect = 0.20
Identities = 63/275 (22%), Positives = 119/275 (43%), Gaps = 34/275 (12%)
Query: 32 KRIESLQQQNRVLKVELETYKLQLRALQEENKNLRQTSVNIQAKAEQEEEFISNTLMKKI 91
+ +E + ++ L E + QL+ L+E+ K N++ ++ EE+ L +K+
Sbjct: 483 RELEKVLKKESELIKLKELAE-QLKELEEKLK-----KYNLEELEKKAEEY--EKLKEKL 534
Query: 92 QALKKEKETLAHHYEQEEEFLTN--DLSRKLNQLTEEKYRLEQTIEQEQERVVNKLMRKI 149
LK E ++L E+ EE +L +KL++L EE L + +E+ V +L ++
Sbjct: 535 IKLKGEIKSLKKELEKLEELKKKLAELEKKLDELEEELAELLKELEELGFESVEELEERL 594
Query: 150 EKLEAETDAKQNNLEQLRREKVELENTLEQEQEALVNKLWKRMDKLEAEKRILQIKLDQP 209
++LE + L+ +E E L++ +E L + + + E L+ +L++
Sbjct: 595 KELE-PFYNEYLELKDAEKELEREEKELKKLEEELDKAF-EELAETEKRLEELRKELEEL 652
Query: 210 VSDPSSPHDMNNGDTASNLSTHIHSLRSEVMRLRNQLANTQQEHTEKMERYVKDEKNVRE 269
S + L L E+ LR +L ++ + RE
Sbjct: 653 EKKYSE-------EEYEELREEYLELSRELAGLRAEL---------------EELEKRRE 690
Query: 270 ENLRLQRKLQREVERREALCRHLSESESSLEMEEE 304
E + KL+ E+E RE + L + E +LE EE
Sbjct: 691 EIKKTLEKLKEELEEREKAKKELEKLEKALERVEE 725
Score = 31.2 bits (71), Expect = 1.1
Identities = 45/222 (20%), Positives = 87/222 (39%), Gaps = 46/222 (20%)
Query: 85 NTLMKKIQALKKEKETLAHHYEQEEEFLTNDLSRKLNQLTEEKYRLEQTIEQEQERVVNK 144
L + I+ +K+ E L ++ E + E++ LE+ + R +N+
Sbjct: 165 KNLGEVIKEIKRRIERLEKFIKRTENIE--------ELIKEKEKELEEVL-----REINE 211
Query: 145 LMRKIEKLEAETDAKQNNLEQLRREKVELENTLEQEQEALVNKLWKRMDKLEAEKRILQI 204
+ ++ +L E LE+L +E ELE E + +L K ++ LE KR L+
Sbjct: 212 ISSELPELREE-------LEKLEKEVKELEELKE-----EIEELEKELESLEGSKRKLEE 259
Query: 205 KLDQPVSDPSSPHDMNNGDTASNLSTHIHSLRSEVMRLRNQLANTQ--QEHTEKMERYVK 262
K+ L I L+ E+ L ++ + +E E+ + +
Sbjct: 260 KI-------------------RELEERIEELKKEIEELEEKVKELKELKEKAEEYIKLSE 300
Query: 263 DEKNVREENLRLQRKLQREVERREALCRHLSESESSLEMEEE 304
+ +E ++++L R E + + E E E EE
Sbjct: 301 FYEEYLDELREIEKRLSRLEEEINGIEERIKELEEKEERLEE 342
Score = 31.2 bits (71), Expect = 1.3
Identities = 41/163 (25%), Positives = 77/163 (47%), Gaps = 10/163 (6%)
Query: 53 LQLRALQEENKNLRQTSVNIQAKAEQEEEFISNT--LMKKIQALKKEKETLAHHYEQEEE 110
L L + KNL + I+ + E+ E+FI T + + I+ +KE E + +
Sbjct: 155 LGLDDYENAYKNLGEVIKEIKRRIERLEKFIKRTENIEELIKEKEKELEEVLREINEISS 214
Query: 111 FLTNDLSRKLNQLTEEKYRLEQTIEQ--EQERVVNKLMRKIEKLEAETDAKQNNLEQLRR 168
L +L +L +L +E LE+ E+ E E+ + L KLE + + +E+L++
Sbjct: 215 ELP-ELREELEKLEKEVKELEELKEEIEELEKELESLEGSKRKLEEKIRELEERIEELKK 273
Query: 169 EKVELENTLE-----QEQEALVNKLWKRMDKLEAEKRILQIKL 206
E ELE ++ +E+ KL + ++ E R ++ +L
Sbjct: 274 EIEELEEKVKELKELKEKAEEYIKLSEFYEEYLDELREIEKRL 316
>gnl|CDD|234501 TIGR04211, SH3_and_anchor, SH3 domain protein. Members of this
protein family have a signal peptide, a strongly
conserved SH3 domain, a variable region, and then a
C-terminal hydrophobic transmembrane alpha helix region.
Length = 198
Score = 40.8 bits (96), Expect = 4e-04
Identities = 31/93 (33%), Positives = 49/93 (52%), Gaps = 9/93 (9%)
Query: 27 REQLQKRIESLQQQNRVLKVELETYKLQLRALQEENKNLRQTSVNIQAKAEQEEEFISNT 86
+LQ+++ LQQ+N+ LK EL T + +L LQ+E ++Q S N E+ E
Sbjct: 82 LAELQEQLAELQQENQELKQELSTLEAELERLQKELARIKQLSANAIELDEENRE----- 136
Query: 87 LMKKIQALKKEKETLAHHYEQEEEFLTNDLSRK 119
L +++ LK+E E L E E E L + R+
Sbjct: 137 LREELAELKQENEAL----EAENERLQENEQRR 165
Score = 38.1 bits (89), Expect = 0.004
Identities = 27/127 (21%), Positives = 50/127 (39%), Gaps = 21/127 (16%)
Query: 39 QQNRVLKVELETYKLQLRALQEENKNLRQTSVNIQAKAEQEEEFISNTLMKKIQALKKEK 98
+ L + +L LQEE L++ +Q + L +++ L+ E
Sbjct: 59 SDTPSARERLPELQQELAELQEELAELQEQLAELQQEN--------QELKQELSTLEAEL 110
Query: 99 ETLAHHYEQEEEFLTNDLSRKLNQLTEEKYRLEQTIEQEQERVVNKLMRKIEKLEAETDA 158
E L + ++ LS +L EE L + + + L ++ E LEAE +
Sbjct: 111 ERLQKELARIKQ-----LSANAIELDEENRELREELAE--------LKQENEALEAENER 157
Query: 159 KQNNLEQ 165
Q N ++
Sbjct: 158 LQENEQR 164
Score = 31.9 bits (73), Expect = 0.38
Identities = 26/130 (20%), Positives = 48/130 (36%), Gaps = 28/130 (21%)
Query: 52 KLQLRALQEENKNLRQTSVNIQAKAEQEEEFISNTLMKKIQALKKEKETLAHHYEQEEEF 111
+ +L LQ+E L++ +Q + + L ++ Q LK+E TL
Sbjct: 65 RERLPELQQELAELQEELAELQEQLAE--------LQQENQELKQELSTL---------- 106
Query: 112 LTNDLSRKLNQLTEEKYRLEQTIEQEQERVVNKLMRKIEKLEAETDAKQNNLEQLRREKV 171
+L +L +E R++Q E L + +L E + E L E
Sbjct: 107 -----EAELERLQKELARIKQLSANAIE-----LDEENRELREELAELKQENEALEAENE 156
Query: 172 ELENTLEQEQ 181
L+ ++
Sbjct: 157 RLQENEQRRW 166
>gnl|CDD|130141 TIGR01069, mutS2, MutS2 family protein. Function of MutS2 is
unknown. It should not be considered a DNA mismatch
repair protein. It is likely a DNA mismatch binding
protein of unknown cellular function [DNA metabolism,
Other].
Length = 771
Score = 42.1 bits (99), Expect = 5e-04
Identities = 35/140 (25%), Positives = 68/140 (48%), Gaps = 8/140 (5%)
Query: 69 SVNIQAK---AEQEEEFISNTLMKKIQALKKEKETLAHHYE---QEEEFLTNDLSRKLNQ 122
+ QAK E +EE N L++K+ AL+KE E H E +E+E L +L +++ +
Sbjct: 497 FIIEQAKTFYGEFKEEI--NVLIEKLSALEKELEQKNEHLEKLLKEQEKLKKELEQEMEE 554
Query: 123 LTEEKYRLEQTIEQEQERVVNKLMRKIEKLEAETDAKQNNLEQLRREKVELENTLEQEQE 182
L E + + +E+E + + L +++E + E K+ + + + +L E +Q+
Sbjct: 555 LKERERNKKLELEKEAQEALKALKKEVESIIRELKEKKIHKAKEIKSIEDLVKLKETKQK 614
Query: 183 ALVNKLWKRMDKLEAEKRIL 202
+ DK+ + RI
Sbjct: 615 IPQKPTNFQADKIGDKVRIR 634
Score = 29.8 bits (67), Expect = 3.4
Identities = 16/76 (21%), Positives = 36/76 (47%)
Query: 28 EQLQKRIESLQQQNRVLKVELETYKLQLRALQEENKNLRQTSVNIQAKAEQEEEFISNTL 87
E++ IE L + L+ + E + L+ ++ K L Q ++ + ++ +
Sbjct: 511 EEINVLIEKLSALEKELEQKNEHLEKLLKEQEKLKKELEQEMEELKERERNKKLELEKEA 570
Query: 88 MKKIQALKKEKETLAH 103
+ ++ALKKE E++
Sbjct: 571 QEALKALKKEVESIIR 586
Score = 29.4 bits (66), Expect = 4.3
Identities = 21/72 (29%), Positives = 41/72 (56%), Gaps = 11/72 (15%)
Query: 137 EQERVVNKLMRKIEKLEAETDAKQNNLEQLRREKVELENTLEQEQEALVNKLWKRMDKLE 196
E + +N L+ K+ LE E + K +LE+L +E+ +L+ LEQE M++L+
Sbjct: 508 EFKEEINVLIEKLSALEKELEQKNEHLEKLLKEQEKLKKELEQE-----------MEELK 556
Query: 197 AEKRILQIKLDQ 208
+R +++L++
Sbjct: 557 ERERNKKLELEK 568
>gnl|CDD|188306 TIGR03319, RNase_Y, ribonuclease Y. Members of this family are
RNase Y, an endoribonuclease. The member from Bacillus
subtilis, YmdA, has been shown to be involved in
turnover of yitJ riboswitch [Transcription, Degradation
of RNA].
Length = 514
Score = 41.4 bits (98), Expect = 6e-04
Identities = 33/123 (26%), Positives = 56/123 (45%), Gaps = 9/123 (7%)
Query: 66 RQTSVNIQAKAEQEEEFISNTLMKKIQALKKEKETLAHHYEQEEEFLTNDLSRKLNQLTE 125
++ + AE+ + I K+ + LKKE A ++E L +L R+L +
Sbjct: 20 KRIAEKKLGSAEELAKRIIEEAKKEAETLKKEALLEA---KEEVHKLRAELERELKERRN 76
Query: 126 EKYRLEQTIEQEQERV------VNKLMRKIEKLEAETDAKQNNLEQLRREKVELENTLEQ 179
E RLE+ + Q +E + ++K +EK E E K+ NL++ E EL +
Sbjct: 77 ELQRLERRLLQREETLDRKMESLDKKEENLEKKEKELSNKEKNLDEKEEELEELIAEQRE 136
Query: 180 EQE 182
E E
Sbjct: 137 ELE 139
Score = 30.7 bits (70), Expect = 1.4
Identities = 35/138 (25%), Positives = 61/138 (44%), Gaps = 32/138 (23%)
Query: 27 REQLQKRIESLQQQNRVLKVELETYKLQLRALQEENKNLRQTSVNIQAKAEQEEEFISNT 86
R +LQ+ L Q+ L ++E+ + L+++ K L N+ K E+ EE I+
Sbjct: 75 RNELQRLERRLLQREETLDRKMESLDKKEENLEKKEKELSNKEKNLDEKEEELEELIA-- 132
Query: 87 LMKKIQALKKEKETLAHHYEQEEEFLTNDLSRKLNQLTEEKYR--LEQTIEQEQERVVNK 144
EQ EE +++ LT+E+ + L + +E+E K
Sbjct: 133 -------------------EQREEL------ERISGLTQEEAKEILLEEVEEEARHEAAK 167
Query: 145 LMRKIE---KLEAETDAK 159
L+++IE K EA+ AK
Sbjct: 168 LIKEIEEEAKEEADKKAK 185
Score = 30.7 bits (70), Expect = 1.5
Identities = 19/73 (26%), Positives = 37/73 (50%), Gaps = 6/73 (8%)
Query: 236 RSEVMRLRNQLANTQQEHTEKMERYVKDEKNVREENLRLQRKLQREVERREALCR---HL 292
+ EV +LR +L ++E E+ + E+ + + L RK++ ++ E L + L
Sbjct: 57 KEEVHKLRAEL---ERELKERRNELQRLERRLLQREETLDRKMESLDKKEENLEKKEKEL 113
Query: 293 SESESSLEMEEER 305
S E +L+ +EE
Sbjct: 114 SNKEKNLDEKEEE 126
Score = 29.9 bits (68), Expect = 2.7
Identities = 22/67 (32%), Positives = 34/67 (50%), Gaps = 4/67 (5%)
Query: 236 RSEVMRLRNQLANTQQEHT--EKMERYVKDEKNVREENLRLQRKLQREVERREALCRHLS 293
R+E+ RL +L Q+E T KME K E+N+ ++ L K + E+ E L ++
Sbjct: 75 RNELQRLERRLL--QREETLDRKMESLDKKEENLEKKEKELSNKEKNLDEKEEELEELIA 132
Query: 294 ESESSLE 300
E LE
Sbjct: 133 EQREELE 139
>gnl|CDD|234750 PRK00409, PRK00409, recombination and DNA strand exchange inhibitor
protein; Reviewed.
Length = 782
Score = 41.0 bits (97), Expect = 0.001
Identities = 31/139 (22%), Positives = 57/139 (41%), Gaps = 11/139 (7%)
Query: 71 NIQAKAEQEEEFISNTLMKKIQALKKEKETLAHHYEQEEEFLTNDLSRKLNQLTEEKYRL 130
I E+ E I++ + + +K +E A +E E L +L K +L EE
Sbjct: 510 LIGEDKEKLNELIASLEELERELEQKAEE--AEALLKEAEKLKEELEEKKEKLQEE---- 563
Query: 131 EQTIEQEQERVVNKLMRKIEKLEAETDAKQNNLEQLRREKVELENTLEQEQEALVNKLWK 190
E + +E E+ + I++ + E D L QL++ E +L K
Sbjct: 564 EDKLLEEAEKEAQQ---AIKEAKKEADEIIKELRQLQKGGYASVK--AHELIEARKRLNK 618
Query: 191 RMDKLEAEKRILQIKLDQP 209
+K E +K+ + K ++
Sbjct: 619 ANEKKEKKKKKQKEKQEEL 637
Score = 32.9 bits (76), Expect = 0.36
Identities = 27/127 (21%), Positives = 53/127 (41%), Gaps = 22/127 (17%)
Query: 28 EQLQKRIESLQQQNR----------VLKVELETYKLQLRALQEENKNLRQTSVNIQAKAE 77
E+L + I SL++ R L E E K +L +E+ + + E
Sbjct: 516 EKLNELIASLEELERELEQKAEEAEALLKEAEKLKEELEEKKEKLQEEED-----KLLEE 570
Query: 78 QEEEFISNTLMKKIQALKKEKETLAHHYEQEEEFLTNDLSRKLNQLTEEKYRLEQTIEQE 137
E+E + I+ KKE + + Q ++ S K ++L E + RL + E++
Sbjct: 571 AEKEA-----QQAIKEAKKEADEIIKELRQLQK--GGYASVKAHELIEARKRLNKANEKK 623
Query: 138 QERVVNK 144
+++ +
Sbjct: 624 EKKKKKQ 630
>gnl|CDD|235600 PRK05771, PRK05771, V-type ATP synthase subunit I; Validated.
Length = 646
Score = 40.3 bits (95), Expect = 0.002
Identities = 37/184 (20%), Positives = 68/184 (36%), Gaps = 30/184 (16%)
Query: 27 REQLQKRIESLQQQNRVLKVELETYKLQLRALQEENKNLRQTSVNIQAKAEQEEEFISNT 86
+L+ I+ L+Q+ L+ + L L L K + + ++ EE +
Sbjct: 109 ISELENEIKELEQEIERLEP-WGNFDLDLSLLLG-FKYVSVFVGTV--PEDKLEELKLES 164
Query: 87 LMKKIQALKKEKETLA--------HHYEQEEEFLTNDLSRKLNQLTEEKYRLEQTIEQEQ 138
++ ++ + +K + E EEE R +L EE + I + +
Sbjct: 165 DVENVEYISTDKGYVYVVVVVLKELSDEVEEELKKLGFER--LELEEEG-TPSELIREIK 221
Query: 139 ERVVNKLMRKIEKLEAETDAKQNNLEQLRREKVELENTLEQEQEALVNKLWKRMDKLEAE 198
E +E++E E E L E EL +E AL L +++ EA
Sbjct: 222 EE--------LEEIEKE-------RESLLEELKELAKKYLEELLALYEYLEIELERAEAL 266
Query: 199 KRIL 202
+ L
Sbjct: 267 SKFL 270
Score = 31.4 bits (72), Expect = 0.79
Identities = 29/128 (22%), Positives = 56/128 (43%), Gaps = 10/128 (7%)
Query: 41 NRVLKVELETYKLQ-LRALQEENKNLRQTSVNIQAKAEQEE-EFISNTLMKKIQALKKEK 98
+VL V L++YK + L AL E V+I+ E+ E + L + L +
Sbjct: 7 KKVLIVTLKSYKDEVLEALHELG------VVHIEDLKEELSNERLRK-LRSLLTKLSEAL 59
Query: 99 ETLAHHYEQEEEFLTNDLSRKLNQLTEEKYRLEQTIEQEQERVVNKLMRKIEKLEAETDA 158
+ L Y + L + + + EE + + ++ E+ + +L +I +LE E
Sbjct: 60 DKL-RSYLPKLNPLREEKKKVSVKSLEELIKDVEEELEKIEKEIKELEEEISELENEIKE 118
Query: 159 KQNNLEQL 166
+ +E+L
Sbjct: 119 LEQEIERL 126
Score = 31.4 bits (72), Expect = 0.84
Identities = 30/129 (23%), Positives = 57/129 (44%), Gaps = 17/129 (13%)
Query: 88 MKKIQ--ALKKEKETL--AHHYEQEEEFLTNDLSRKLNQLTEEKYRLEQTIEQEQERVVN 143
MKK+ LK K+ + A H DL +L+ K R T + ++
Sbjct: 6 MKKVLIVTLKSYKDEVLEALHELGVVHI--EDLKEELSNERLRKLRSLLT---KLSEALD 60
Query: 144 KLMRKIEKLEAETDAKQ----NNLEQLRREKVELENTLEQEQEALVNKLWKRMDKLEAEK 199
KL + KL + K+ +LE+L ++ E +E+E + L ++ +LE E
Sbjct: 61 KLRSYLPKLNPLREEKKKVSVKSLEELIKDVEEELEKIEKEIKELEEEI----SELENEI 116
Query: 200 RILQIKLDQ 208
+ L+ ++++
Sbjct: 117 KELEQEIER 125
>gnl|CDD|221432 pfam12128, DUF3584, Protein of unknown function (DUF3584). This
protein is found in bacteria and eukaryotes. Proteins in
this family are typically between 943 to 1234 amino
acids in length. This family contains a P-loop motif
suggesting it is a nucleotide binding protein. It may be
involved in replication.
Length = 1198
Score = 40.4 bits (95), Expect = 0.002
Identities = 50/268 (18%), Positives = 102/268 (38%), Gaps = 20/268 (7%)
Query: 28 EQLQKRIESLQQQNRVLKVELETYKLQLR-ALQEENKNLRQTSVNIQAKAEQEEEFISNT 86
++R+ Q++ + K L L L+E L Q AK + +
Sbjct: 273 VADEERLAEEQEERQEAKNRLRQQLRTLEDQLKEARDELNQELSAANAKLAADRSELELL 332
Query: 87 LMKKIQALKKEKETLAHHYEQEEEFLT--NDLSRKLNQLTEEKYRLEQTIEQEQERVVNK 144
+K + E L +Q + ++ +L+ LT + +++ E+ ++++ +
Sbjct: 333 EDQKGAFEDADIEQLQADLDQLPSIRSELEEVEARLDALTGKHQDVQRKYERLKQKIKEQ 392
Query: 145 LMRKIEKLEAETDAKQNNLEQLRREKVE----LENTLEQEQEALVNKLWKRMDKLEAEKR 200
L R +EK A + ++ + E LE+ L Q+ EA + + +LE
Sbjct: 393 LERDLEKNNERLAAIREEKDRQKAAIEEDLQALESQLRQQLEAGKLEFNEEEYELELRLG 452
Query: 201 ILQIKLDQPVSDPSSPHDMNNGDTASNLSTHIHSLRSEVMRLRNQLANTQQEHTEKMERY 260
L+ +LD + P + D A + + A E + R
Sbjct: 453 RLKQRLDSATATPEELEQLEINDEALEK-----------AQEEQEQAEANVEQLQSELRQ 501
Query: 261 VKDEKNVREENLRLQRKLQREVERREAL 288
++ ++ E LQR +R ++ R+AL
Sbjct: 502 LRKRRD--EALEALQRAERRLLQLRQAL 527
>gnl|CDD|225288 COG2433, COG2433, Uncharacterized conserved protein [Function
unknown].
Length = 652
Score = 39.7 bits (93), Expect = 0.002
Identities = 24/84 (28%), Positives = 44/84 (52%), Gaps = 2/84 (2%)
Query: 125 EEKYRLEQTIEQEQERVVNKLMRKIEKLEAETDAKQNNLEQLRREKVELENTLEQEQEAL 184
E E+ E+ + KL +E+LE E + LE+L+RE +LE+ LE+ + +
Sbjct: 407 EGTEEEERREITVYEKRIKKLEETVERLEEENSELKRELEELKREIEKLESELERFRREV 466
Query: 185 VNKLWKRMDKLEAEKRI--LQIKL 206
+K+ K + ++RI L+ +L
Sbjct: 467 RDKVRKDREIRARDRRIERLEKEL 490
Score = 36.6 bits (85), Expect = 0.024
Identities = 30/150 (20%), Positives = 63/150 (42%), Gaps = 27/150 (18%)
Query: 153 EAETDAKQNNLEQLRREKVELENTLEQEQEALVNKLWKRMDKLEAEKRILQIKLDQPVSD 212
EA + K+ + + E E E + KL + +++LE E L+ +L++
Sbjct: 392 EALSKVKEEERPREKEGTEEEERREITVYEKRIKKLEETVERLEEENSELKRELEE---- 447
Query: 213 PSSPHDMNNGDTASNLSTHIHSLRSEVMRLRNQLANTQQEHTEKMERYVKDEKNVREENL 272
L+ E+ +L ++L ++E +K R ++ +
Sbjct: 448 ----------------------LKREIEKLESELERFRREVRDK-VRKDREIRARDRRIE 484
Query: 273 RLQRKLQREVERREALCRHLSESESSLEME 302
RL+++L+ + +R E L R L+E ++E
Sbjct: 485 RLEKELEEKKKRVEELERKLAELRKMRKLE 514
Score = 35.1 bits (81), Expect = 0.075
Identities = 35/142 (24%), Positives = 68/142 (47%), Gaps = 15/142 (10%)
Query: 28 EQLQKRIESLQQQNRVLKVELETYKLQLRALQEENKNLRQTSVNIQAKAEQEEEFISNTL 87
++L++ +E L+++N LK ELE K ++ L+ E + R E
Sbjct: 425 KKLEETVERLEEENSELKRELEELKREIEKLESELERFR------------REVRDKVRK 472
Query: 88 MKKIQALKKEKETLAHHYEQEEEFLTNDLSRKLNQLTEEKYRLEQTIEQEQERVVNKL-M 146
++I+A + E L E++++ +L RKL +L + +LE + + +VV KL +
Sbjct: 473 DREIRARDRRIERLEKELEEKKK-RVEELERKLAEL-RKMRKLELSGKGTPVKVVEKLTL 530
Query: 147 RKIEKLEAETDAKQNNLEQLRR 168
IE+ E E K+ ++ +
Sbjct: 531 EAIEEAEEEYGIKEGDVILVED 552
Score = 34.7 bits (80), Expect = 0.086
Identities = 26/112 (23%), Positives = 47/112 (41%), Gaps = 21/112 (18%)
Query: 42 RVLKVELETYKLQLRALQEENKNLRQTSVNIQAKAEQEEEFISNTLMKKIQALKKEKETL 101
V + ++ + + L+EEN L K E EE L ++I+ L+ E E
Sbjct: 418 TVYEKRIKKLEETVERLEEENSEL---------KRELEE------LKREIEKLESELERF 462
Query: 102 AHHYEQEEEFLTNDLSRKLNQLTEEKYRLEQTIEQEQERVVNKLMRKIEKLE 153
+ R++ RLE+ +E E+++ V +L RK+ +L
Sbjct: 463 RREVRDKVR-----KDREIRARDRRIERLEKELE-EKKKRVEELERKLAELR 508
Score = 32.4 bits (74), Expect = 0.45
Identities = 18/67 (26%), Positives = 35/67 (52%), Gaps = 2/67 (2%)
Query: 115 DLSRKLNQLTEEKYRLEQTIEQEQERVVNKLM--RKIEKLEAETDAKQNNLEQLRREKVE 172
+L R+L +L E +LE +E+ + V +K+ R+I + + + LE+ ++ E
Sbjct: 440 ELKRELEELKREIEKLESELERFRREVRDKVRKDREIRARDRRIERLEKELEEKKKRVEE 499
Query: 173 LENTLEQ 179
LE L +
Sbjct: 500 LERKLAE 506
>gnl|CDD|220402 pfam09787, Golgin_A5, Golgin subfamily A member 5. Members of this
family of proteins are involved in maintaining Golgi
structure. They stimulate the formation of Golgi stacks
and ribbons, and are involved in intra-Golgi retrograde
transport. Two main interactions have been
characterized: one with RAB1A that has been activated by
GTP-binding and another with isoform CASP of CUTL1.
Length = 509
Score = 39.0 bits (91), Expect = 0.003
Identities = 44/270 (16%), Positives = 95/270 (35%), Gaps = 19/270 (7%)
Query: 32 KRIESLQQQNRVLKVELETYKLQLRALQEENKNLRQTSVNIQAKAEQEEEFISNTLMKKI 91
+ L+ + LK+ L+ +LR L+ E ++LR + E + +
Sbjct: 100 QSSSDLEDELAALKIRLQEAAQELRELKSELEDLRSERSRDLSDEESIKRLQRGAVRSLQ 159
Query: 92 QALKKEKETLAHHYEQEEEFLTNDLSRKLNQLTEEKYRLEQTIEQEQERVVNKLMRKIEK 151
L + H + L R L E E Q R ++K +++ +
Sbjct: 160 DKLLLLDAAVKRHDGNVITAVLIFLKRTLK---LEILFQELAERQTALRHLSKHLQEEGE 216
Query: 152 LEAETDAKQNNLEQLRREKVELENTLE---QEQEALVNKLWKRMDKLEAEKRILQIKLDQ 208
L+ + E ++E + Q +E +N L + + QI+L
Sbjct: 217 LQQLLKLLRAEGESEKQELQQYRQKAHRILQSKEKRINFLKEGCLFEGLDSSTAQIEL-- 274
Query: 209 PVSDPSSPHDMNNGDTASNLSTHIHSLRSEVMRLRNQLANTQQEHTEKMERYVKDEKNVR 268
H+ + ++ I L ++++LR++ + + E + + E + K + +
Sbjct: 275 ----EELKHE------SEHVQEEITKLEGQIIQLRSEAQDREAEASGEAESFRKQPRELS 324
Query: 269 EENLRLQRKLQREVERREALCRHLSESESS 298
++ Q E+E H E +
Sbjct: 325 QQIAPQITTKQ-ELELYYQELYHYREDLGN 353
>gnl|CDD|218661 pfam05622, HOOK, HOOK protein. This family consists of several
HOOK1, 2 and 3 proteins from different eukaryotic
organisms. The different members of the human gene
family are HOOK1, HOOK2 and HOOK3. Different domains
have been identified in the three human HOOK proteins,
and it was demonstrated that the highly conserved
NH2-domain mediates attachment to microtubules, whereas
the central coiled-coil motif mediates homodimerisation
and the more divergent C-terminal domains are involved
in binding to specific organelles (organelle-binding
domains). It has been demonstrated that endogenous HOOK3
binds to Golgi membranes, whereas both HOOK1 and HOOK2
are localised to discrete but unidentified cellular
structures. In mice the Hook1 gene is predominantly
expressed in the testis. Hook1 function is necessary for
the correct positioning of microtubular structures
within the haploid germ cell. Disruption of Hook1
function in mice causes abnormal sperm head shape and
fragile attachment of the flagellum to the sperm head.
Length = 713
Score = 39.0 bits (91), Expect = 0.005
Identities = 67/288 (23%), Positives = 131/288 (45%), Gaps = 38/288 (13%)
Query: 21 PPSPVTRE--QLQKRIESLQQQNRVLKVELETYKLQLRALQEENKNLRQTSVNIQAKAEQ 78
P +++ LQ ++E LQ++N L+ + Y+++ L++E L+ + + + A +
Sbjct: 233 PNQLGSKKYNLLQSQLEQLQEENFRLEAARDDYRIKCEELEKELAELQHRNDELTSLAAE 292
Query: 79 EEEFISNTLMKKIQALK------KEKETLAHHYEQEEEFLTNDLSRKLNQLTEEKYR-LE 131
S L +I L+ K+ E Y+++ E L NDL R++ L E ++
Sbjct: 293 -----SQALKDEIDVLRESSDKAKKLEAQVETYKKKLEDL-NDLRRQVKLLEERNAMYMQ 346
Query: 132 QTIEQEQE-RVVNKLMRKIEKLEAETDAKQNNL--EQLRREKVELE-NTLEQEQEALVNK 187
T++ E+E + N ++E + + L E + +K+E E LE++ EAL
Sbjct: 347 NTVQLEEELKKANAARGQLETYKRQVQELHAKLSEESKKADKLEFEYKRLEEKLEALQ-- 404
Query: 188 LWKRMDKLEAEKRIL-----QIKLDQPVSDPSSPHDMNNGDTASNLSTHIHS--LRSEVM 240
K ++L AE+ L +++ Q D + D + NL+ + R +++
Sbjct: 405 --KEKERLLAERDSLRETNEELRCGQAQQDQLTQADAGTSPSGDNLAAELLPSEYREKLI 462
Query: 241 RLR--NQLANTQQEHTEKMERYVK-----DEKNVREENLRLQRKLQRE 281
RL+ N++ QE +E ER + ++ N R L Q +L +
Sbjct: 463 RLQHENKMLRLGQEGSEN-ERITELQQLLEDANRRNNELETQLRLANQ 509
Score = 33.6 bits (77), Expect = 0.19
Identities = 53/250 (21%), Positives = 107/250 (42%), Gaps = 46/250 (18%)
Query: 53 LQLRALQEENKNLRQTSVNIQAKAEQEEEFISNTLMKKIQALKKEKETLAHHYEQEEEFL 112
Q+ LQEE +L+Q + +Q + Q E + + Q K+
Sbjct: 200 KQVLLLQEEKNSLQQENEKLQERLAQLE----GSSLGPNQLGSKK--------------- 240
Query: 113 TNDLSRKLNQLTEEKYRLEQTIE------QEQERVVNKLMRKIEKLEAETDAKQNNLEQL 166
N L +L QL EE +RLE + +E E+ + +L + ++L + Q L
Sbjct: 241 YNLLQSQLEQLQEENFRLEAARDDYRIKCEELEKELAELQHRNDELTSLAAESQ----AL 296
Query: 167 RREKVELENTLEQEQ--EALVNKLWKRMDKLEAEKRILQIKLDQPVSDPSSPHDMNNGD- 223
+ E L + ++ + EA V K+++ L +R Q+KL + + ++ + N
Sbjct: 297 KDEIDVLRESSDKAKKLEAQVETYKKKLEDLNDLRR--QVKLLE---ERNAMYMQNTVQL 351
Query: 224 -----TASNLSTHIHSLRSEVMRLRNQLANTQQEHTEKMERYVKDEKNVREENLRLQRKL 278
A+ + + + +V L +L E ++K ++ + K + E+ LQ++
Sbjct: 352 EEELKKANAARGQLETYKRQVQELHAKL----SEESKKADKLEFEYKRLEEKLEALQKEK 407
Query: 279 QREVERREAL 288
+R + R++L
Sbjct: 408 ERLLAERDSL 417
Score = 32.1 bits (73), Expect = 0.56
Identities = 38/168 (22%), Positives = 75/168 (44%), Gaps = 15/168 (8%)
Query: 5 ASESDSNSMDGVALMMPPSPVTREQLQKRIESLQQQNRVLKVELE-TYKLQLRALQE--E 61
A S S D +A + PS + ++++ LQ +N++L++ E + ++ LQ+ E
Sbjct: 437 ADAGTSPSGDNLAAELLPS-----EYREKLIRLQHENKMLRLGQEGSENERITELQQLLE 491
Query: 62 NKNLRQTSVNIQAKAEQEEEFISNTLMKKIQALKKEKETLAHHYEQEEEFLTNDLSRKLN 121
+ N R + Q + + L ++++ L+K + ++ L + L L
Sbjct: 492 DANRRNNELETQLRLANQR---ILELQQQVEDLQKALQE-QGSKSEDSSLLKSKLEEHLE 547
Query: 122 QLTE---EKYRLEQTIEQEQERVVNKLMRKIEKLEAETDAKQNNLEQL 166
QL E E + + IE+ + L RKI +LEA K ++ +
Sbjct: 548 QLHEANEELQKKREQIEELEPDQDQNLSRKIAELEAALQKKDEDMRAM 595
>gnl|CDD|197874 smart00787, Spc7, Spc7 kinetochore protein. This domain is found
in cell division proteins which are required for
kinetochore-spindle association.
Length = 312
Score = 38.1 bits (89), Expect = 0.005
Identities = 33/122 (27%), Positives = 55/122 (45%), Gaps = 9/122 (7%)
Query: 86 TLMKKIQALKKEKETLAHHYEQEEEFLTNDLSRKLNQLTEEKYRLEQTIEQEQERVVNKL 145
+K+ L ++ L + + + + L +L QL + + LE E +R KL
Sbjct: 154 EGLKEDYKLLMKELELLNSIKPKLRDRKDALEEELRQLKQLEDELEDCDPTELDRAKEKL 213
Query: 146 MRKIEKLEAETDAKQNNLEQLRREKVELENTLEQEQEALVNKLWKRMDKL-EAEKRILQI 204
+KL E K LE+L E ELE+ +E L NK + ++ EAEK++ Q
Sbjct: 214 ----KKLLQEIMIKVKKLEELEEELQELESKIED----LTNKKSELNTEIAEAEKKLEQC 265
Query: 205 KL 206
+
Sbjct: 266 RG 267
Score = 28.8 bits (65), Expect = 4.4
Identities = 33/150 (22%), Positives = 59/150 (39%), Gaps = 21/150 (14%)
Query: 115 DLSRKLNQLTEEKYRLEQTIEQEQERVVNKLMRKIEKLEAETDAKQNNLEQLRREKVELE 174
L L L E+ L + +E +N + K+ + DA + L QL++ + ELE
Sbjct: 148 GLDENLEGLKEDYKLLMKELEL-----LNSIKPKLRDRK---DALEEELRQLKQLEDELE 199
Query: 175 NTLEQEQEALVNKLWKRMDKLEAEKRILQIKLDQPVSDPSSPHDMNNGDTASNLSTHIHS 234
+ E + KL K + ++ + + L+ ++ S D+ N S L+T I
Sbjct: 200 DCDPTELDRAKEKLKKLLQEIMIKVKKLEELEEELQELESKIEDLTN--KKSELNTEIAE 257
Query: 235 L-----------RSEVMRLRNQLANTQQEH 253
E+ +L+ QL Q
Sbjct: 258 AEKKLEQCRGFTFKEIEKLKEQLKLLQSLT 287
>gnl|CDD|221952 pfam13166, AAA_13, AAA domain. This family of domains contain a
P-loop motif that is characteristic of the AAA
superfamily. Many of the proteins in this family are
conjugative transfer proteins. This family includes the
PrrC protein that is thought to be the active component
of the anticodon nuclease.
Length = 713
Score = 38.5 bits (90), Expect = 0.006
Identities = 15/53 (28%), Positives = 27/53 (50%)
Query: 147 RKIEKLEAETDAKQNNLEQLRREKVELENTLEQEQEALVNKLWKRMDKLEAEK 199
+IE+L+ E + +EQL E + E LE+ + ++K WK++ K
Sbjct: 95 AQIEELKKELKKLEEKIEQLEAEIEKKEEELEKAKNKFLDKAWKKLAKKYDSN 147
Score = 38.1 bits (89), Expect = 0.008
Identities = 40/211 (18%), Positives = 75/211 (35%), Gaps = 39/211 (18%)
Query: 27 REQLQKRI-ESLQQQNRVLKVELETYKLQLRALQEENKNLRQTSVNIQAKAEQEEEF--- 82
+ +L+ E Q+ L+ ++ Y+ + EE +++ T EE
Sbjct: 269 KAELEAHFDEEYQELIEQLEELIDKYESHIEKALEELESILDTEKENSEFKLDVEELKAL 328
Query: 83 ---ISNTLMKKIQALKKEKETLAHHYE----QEEEFLTNDLSRKLNQLTEEKYRLEQTIE 135
+ L K +Q L+++ + + E + ND+ +N+L E ++
Sbjct: 329 LEALEEILEKNLQKLEEKLKDPSTSIELESITDLIESINDIIDAINELIREHNEKIDNLK 388
Query: 136 QEQERVVNKLMR----------------------KIEKLEAETDAKQNNLEQLRREKVEL 173
+E+ + KL I LE E + ++ L +E EL
Sbjct: 389 KEKNKAKKKLWLHLVAELKEDIDAYQKEKKGLEKAINSLEKEIKQLEAEIKALEKEIKEL 448
Query: 174 ENTLEQEQEA--LVNKLWKR----MDKLEAE 198
E L + +NKL K +LE E
Sbjct: 449 EKQLTNIEPTADEINKLLKAYGFGNFELEKE 479
Score = 33.4 bits (77), Expect = 0.22
Identities = 38/256 (14%), Positives = 88/256 (34%), Gaps = 42/256 (16%)
Query: 31 QKRIESLQQQNRVLKVELETYKLQLRALQEENKNLRQTSVN-----IQAKAEQEEEFISN 85
+ +IE L+++ + L+ ++E + ++ +EE + + ++ + K +
Sbjct: 94 EAQIEELKKELKKLEEKIEQLEAEIEKKEEELEKAKNKFLDKAWKKLAKKYDSNLSEALK 153
Query: 86 TLMKKIQALKKEKETLAHHYEQEEEFLTN-DLSRKLNQL-TEEKYRLE---------QTI 134
L K +K + L L+ +L K+ L + K L I
Sbjct: 154 GLNYKKNFKEKLLKELKSVILNASSLLSLEELKAKIKTLFSSNKPELALLTLSVIDFDEI 213
Query: 135 EQE---QERVVNKLMRKIEKLEAETDAK-------------------QNNLEQLRREKVE 172
EQ ++ ++ I +L Q + + R+ E
Sbjct: 214 EQAEILEKSIIGSSDVPISELINNLGNSDWVKEGLEYHEEGDTCPFCQQTITEERKA--E 271
Query: 173 LENTLEQEQEALVNKLWKRMDKLEAEKRILQIKLDQPVSDPSSPHDMNNGDTASNLSTHI 232
LE ++E + L+ +L + +DK E+ +L+ + + L +
Sbjct: 272 LEAHFDEEYQELIEQLEELIDKYESHIEKALEELESILDTEKENSEFK--LDVEELKALL 329
Query: 233 HSLRSEVMRLRNQLAN 248
+L + + +L
Sbjct: 330 EALEEILEKNLQKLEE 345
>gnl|CDD|220623 pfam10186, Atg14, UV radiation resistance protein and
autophagy-related subunit 14. The Atg14 or Apg14
proteins are hydrophilic proteins with a predicted
molecular mass of 40.5 kDa, and have a coiled-coil motif
at the N terminus region. Yeast cells with mutant Atg14
are defective not only in autophagy but also in sorting
of carboxypeptidase Y (CPY), a vacuolar-soluble
hydrolase, to the vacuole. Subcellular fractionation
indicate that Apg14p and Apg6p are peripherally
associated with a membrane structure(s). Apg14p was
co-immunoprecipitated with Apg6p, suggesting that they
form a stable protein complex. These results imply that
Apg6/Vps30p has two distinct functions: in the
autophagic process and in the vacuolar protein sorting
pathway. Apg14p may be a component specifically required
for the function of Apg6/Vps30p through the autophagic
pathway. There are 17 auto-phagosomal component proteins
which are categorized into six functional units, one of
which is the AS-PI3K complex (Vps30/Atg6 and Atg14). The
AS-PI3K complex and the Atg2-Atg18 complex are essential
for nucleation, and the specific function of the AS-PI3K
apparently is to produce phosphatidylinositol
3-phosphate (PtdIns(3)P) at the pre-autophagosomal
structure (PAS). The localisation of this complex at the
PAS is controlled by Atg14. Autophagy mediates the
cellular response to nutrient deprivation, protein
aggregation, and pathogen invasion in humans, and
malfunction of autophagy has been implicated in multiple
human diseases including cancer. This effect seems to be
mediated through direct interaction of the human Atg14
with Beclin 1 in the human phosphatidylinositol 3-kinase
class III complex.
Length = 307
Score = 37.7 bits (88), Expect = 0.006
Identities = 23/120 (19%), Positives = 51/120 (42%), Gaps = 7/120 (5%)
Query: 89 KKIQALKKEKETLAHHYEQEEEFLTNDLSRKLNQLTEEKYRLEQTIEQEQERVVNKLMRK 148
+ L E E L E+ L N+ + L + ++ +N++ +
Sbjct: 27 LDLARLLLENEELKQKVEEA-------LEGATNEDGKLAADLLKLEVARKKERLNQIRAR 79
Query: 149 IEKLEAETDAKQNNLEQLRREKVELENTLEQEQEALVNKLWKRMDKLEAEKRILQIKLDQ 208
I +L+ E + K+ +E+L+R + + L L + +++KL+ E + + KL+
Sbjct: 80 ISQLKEEIEQKRERIEELKRALAQRRSDLSSASYQLEKRRASQLEKLQDEIKRTRSKLNA 139
Score = 30.8 bits (70), Expect = 1.2
Identities = 13/86 (15%), Positives = 34/86 (39%)
Query: 27 REQLQKRIESLQQQNRVLKVELETYKLQLRALQEENKNLRQTSVNIQAKAEQEEEFISNT 86
+ ++R+ ++ + LK E+E + ++ L+ R + + E+
Sbjct: 66 VARKKERLNQIRARISQLKEEIEQKRERIEELKRALAQRRSDLSSASYQLEKRRASQLEK 125
Query: 87 LMKKIQALKKEKETLAHHYEQEEEFL 112
L +I+ + + L ++ FL
Sbjct: 126 LQDEIKRTRSKLNALHSLLAEKRSFL 151
Score = 30.0 bits (68), Expect = 1.9
Identities = 33/160 (20%), Positives = 62/160 (38%), Gaps = 28/160 (17%)
Query: 48 LETYKLQLRALQEENKNLRQTSVNIQAKAEQEEEFISNTLMKKIQALKKEKETLAHHYEQ 107
L +L L L EN+ L+Q E E + LK E
Sbjct: 22 LYELRLDLARLLLENEELKQKV------EEALEGATNEDGKLAADLLKLEVA-------- 67
Query: 108 EEEFLTNDLSRKLNQLTEEKYRLEQTIEQEQERVVNKLMRKIEKLEAETDAKQNNLEQLR 167
+LNQ+ +L++ IEQ++ER + +L R + + ++ + LE+ R
Sbjct: 68 -------RKKERLNQIRARISQLKEEIEQKRER-IEELKRALAQRRSDLSSASYQLEKRR 119
Query: 168 -REKVELENTLEQEQEALVNKLWKRMDKLEAEKRILQIKL 206
+ +L++ +++ + + L AEKR +
Sbjct: 120 ASQLEKLQDEIKRTRS-----KLNALHSLLAEKRSFLCRE 154
>gnl|CDD|227278 COG4942, COG4942, Membrane-bound metallopeptidase [Cell division
and chromosome partitioning].
Length = 420
Score = 37.8 bits (88), Expect = 0.008
Identities = 40/211 (18%), Positives = 79/211 (37%), Gaps = 30/211 (14%)
Query: 28 EQLQKRIESLQQQNR----VLKVELETYK-------LQLRALQEENKNLRQTSVNIQAKA 76
L R+ +L+ Q R L +L + L E+ + + ++ A
Sbjct: 97 ADLNARLNALEVQEREQRRRLAEQLAALQRSGRNPPPALLVSPEDAQRSVRLAIYYGA-- 154
Query: 77 EQEEEFISNTLMKKIQALKKEKETLAHHYEQEEEFLTNDLSRKLNQLTEEKYRLEQTIEQ 136
++ ++I ALK + LA E +L+ L++ ++ +L Q +E+
Sbjct: 155 ------LNPARAERIDALKATLKQLAAV-RAEIAAEQAELTTLLSEQRAQQAKLAQLLEE 207
Query: 137 EQERVVNKLMRKIEKLEAETDAKQNNLEQLRREKVELENTLE--QEQEALVNKLWKRMDK 194
++ + +L +E A Q LE+LR + L+N + + A + +
Sbjct: 208 RKKTL--------AQLNSELSADQKKLEELRANESRLKNEIASAEAAAAKAREAAAAAEA 259
Query: 195 LEAEKRILQIKLDQPVSDPSSPHDMNNGDTA 225
A R + K P++P M T
Sbjct: 260 AAARARAAEAKRTGETYKPTAPEKMLISSTG 290
Score = 36.6 bits (85), Expect = 0.021
Identities = 31/210 (14%), Positives = 72/210 (34%), Gaps = 25/210 (11%)
Query: 116 LSRKLNQLTEEKYRLEQTIEQEQERVVNKLMRKIEKLEAETDAKQNNLEQLRREKVELEN 175
L +K+ + +++ +LE+ + + E + L ++ + + + + L LE
Sbjct: 50 LEKKIREQQDQRAKLEKQL-KSLETEIASLEAQLIETADDLKKLRKQIADLNARLNALEV 108
Query: 176 TLEQEQEALVNKLWKRMDKLEAEKRILQIKLDQPVSDPSSPHDMNNGDTASNLSTHIHSL 235
+++ L +L + + + SP D + ++
Sbjct: 109 QEREQRRRLAEQL--AALQRSGRNPPPALLV--------SPEDAQRSVRLAIYYGALNPA 158
Query: 236 RSEVMR-----------LRNQLANTQQEHTEKMERYVKDEKN---VREENLRLQRKLQRE 281
R+E + +R ++A Q E T + + + EE + +L E
Sbjct: 159 RAERIDALKATLKQLAAVRAEIAAEQAELTTLLSEQRAQQAKLAQLLEERKKTLAQLNSE 218
Query: 282 VERREALCRHLSESESSLEMEEERHYNEIA 311
+ + L +ES L+ E A
Sbjct: 219 LSADQKKLEELRANESRLKNEIASAEAAAA 248
Score = 35.9 bits (83), Expect = 0.036
Identities = 34/170 (20%), Positives = 54/170 (31%), Gaps = 32/170 (18%)
Query: 23 SPVTREQLQKRIESLQQQNRVLKVEL----ETYKLQLRALQEENKNLRQTSVNIQAKAEQ 78
+ ++L+K+I L + L+V+ QL ALQ +N
Sbjct: 85 TADDLKKLRKQIADLNARLNALEVQEREQRRRLAEQLAALQRSGRN-----PPPALLVSP 139
Query: 79 EEEFISNTLMKKIQALKKEKETLAHHYEQEEEFLTNDLSRKLNQLTEEKYRLEQTIEQEQ 138
E+ S L AL + + + L + I EQ
Sbjct: 140 EDAQRSVRLAIYYGALNPARAERIDALKATLKQLAA---------------VRAEIAAEQ 184
Query: 139 ERVVNKLMRKIEKLEAETDAKQNNLEQLRREKVELENTLEQEQEALVNKL 188
++ L +E A+Q L QL E+ + L E A KL
Sbjct: 185 A--------ELTTLLSEQRAQQAKLAQLLEERKKTLAQLNSELSADQKKL 226
Score = 28.5 bits (64), Expect = 6.5
Identities = 40/235 (17%), Positives = 88/235 (37%), Gaps = 14/235 (5%)
Query: 57 ALQEENKNLRQTSVNIQAK----AEQEEEFISNTLMKKIQALKKEKETLAHHYEQEEEFL 112
+ ++K L+Q I A EQ+++ L K++++L+ E +L + + L
Sbjct: 32 SAAADDKQLKQIQKEIAALEKKIREQQDQR--AKLEKQLKSLETEIASLEAQLIETADDL 89
Query: 113 TNDLSRKLNQLTEEKYRLEQTIEQEQERVVNKLMRKIEKLEAETDAKQNNLEQLRREKVE 172
L +++ L LE E+EQ R + + + +++ + E +
Sbjct: 90 KK-LRKQIADLNARLNALEVQ-EREQRRRLAEQLAALQRSGRNPPPA----LLVSPEDAQ 143
Query: 173 LENTLEQEQEALVNKLWKRMDKLEAEKRILQIKLDQPVSDPSSPHDMNNGDTASNLSTHI 232
L AL +R+D L+A + L + ++ + + + + +
Sbjct: 144 RSVRLAIYYGALNPARAERIDALKATLKQLA-AVRAEIAAEQAELTTLLSEQRAQQAK-L 201
Query: 233 HSLRSEVMRLRNQLANTQQEHTEKMERYVKDEKNVREENLRLQRKLQREVERREA 287
L E + QL + +K+E +E ++ E + + E A
Sbjct: 202 AQLLEERKKTLAQLNSELSADQKKLEELRANESRLKNEIASAEAAAAKAREAAAA 256
>gnl|CDD|217902 pfam04111, APG6, Autophagy protein Apg6. In yeast, 15 Apg proteins
coordinate the formation of autophagosomes. Autophagy is
a bulk degradation process induced by starvation in
eukaryotic cells. Apg6/Vps30p has two distinct functions
in the autophagic process, either associated with the
membrane or in a retrieval step of the carboxypeptidase
Y sorting pathway.
Length = 356
Score = 37.2 bits (86), Expect = 0.011
Identities = 33/141 (23%), Positives = 55/141 (39%), Gaps = 20/141 (14%)
Query: 24 PVTREQLQKRIESLQQQNRVLKVELETYKLQLRALQEENKNLRQTSVNIQAKAEQEEEFI 83
P+ E +E L + R + E +TYK L L+ +N + E
Sbjct: 21 PLCEECADILLERLDSELRDAEKERDTYKQYLSKLESQNVEISNYEALDSELDE------ 74
Query: 84 SNTLMKKIQALKKEKETLAHHYEQEEEFLTNDLSRKLNQLTEEKYRLEQTIEQEQERVVN 143
L K+ + L E E L E +DL +L +L EEK +LE +E + +
Sbjct: 75 ---LKKEEERLLDELEEL--------EKEDDDLDGELVELQEEKEQLEN---EELQYLRE 120
Query: 144 KLMRKIEKLEAETDAKQNNLE 164
+ L+ E + + L+
Sbjct: 121 YNLFDRNNLQLEDNLQSLELQ 141
Score = 36.0 bits (83), Expect = 0.030
Identities = 27/128 (21%), Positives = 53/128 (41%), Gaps = 17/128 (13%)
Query: 81 EFISNTLMKKIQALKKEKETLAHHYEQEEEFLTNDLSRKLNQLTEEKYRLEQTIEQEQE- 139
+ + L +++ +KE++T + KL E E + E
Sbjct: 28 DILLERLDSELRDAEKERDTYKQYLS------------KLESQNVEISNYEALDSELDEL 75
Query: 140 -RVVNKLMRKIEKLEAETDAKQNNLEQLRREKVELENTLEQEQEALVNKLWKRMDKLEAE 198
+ +L+ ++E+LE E D L +L+ EK +LEN +E + L + L+ E
Sbjct: 76 KKEEERLLDELEELEKEDDDLDGELVELQEEKEQLEN---EELQYLREYNLFDRNNLQLE 132
Query: 199 KRILQIKL 206
+ ++L
Sbjct: 133 DNLQSLEL 140
>gnl|CDD|237560 PRK13922, PRK13922, rod shape-determining protein MreC;
Provisional.
Length = 276
Score = 36.9 bits (86), Expect = 0.011
Identities = 15/60 (25%), Positives = 29/60 (48%), Gaps = 7/60 (11%)
Query: 25 VTREQLQKRIESLQQQNRVLKVELETYKLQLRALQEENKNLRQTSVNIQAKAEQEEEFIS 84
+ L++ E L+++ ELE+ +L L+ EN LR+ + K + +FI+
Sbjct: 69 ASLFDLREENEELKKELL----ELESRLQELEQLEAENARLREL---LNLKESLDYQFIT 121
Score = 31.1 bits (71), Expect = 0.76
Identities = 14/72 (19%), Positives = 28/72 (38%), Gaps = 7/72 (9%)
Query: 220 NNGDTASNLSTHIHSLRSEVMRLRNQLANTQQEHTEKMERYVKDEKNVREENLRLQRK-- 277
+ S + + + S V R+ N E + ++REEN L+++
Sbjct: 31 RRLGSLSPVRQVVGDVVSPVQRVVNAPREFVSGVFE----SLASLFDLREENEELKKELL 86
Query: 278 -LQREVERREAL 288
L+ ++ E L
Sbjct: 87 ELESRLQELEQL 98
>gnl|CDD|217051 pfam02463, SMC_N, RecF/RecN/SMC N terminal domain. This domain is
found at the N terminus of SMC proteins. The SMC
(structural maintenance of chromosomes) superfamily
proteins have ATP-binding domains at the N- and
C-termini, and two extended coiled-coil domains
separated by a hinge in the middle. The eukaryotic SMC
proteins form two kind of heterodimers: the SMC1/SMC3
and the SMC2/SMC4 types. These heterodimers constitute
an essential part of higher order complexes, which are
involved in chromatin and DNA dynamics. This family also
includes the RecF and RecN proteins that are involved in
DNA metabolism and recombination.
Length = 1162
Score = 37.6 bits (87), Expect = 0.013
Identities = 45/278 (16%), Positives = 95/278 (34%), Gaps = 25/278 (8%)
Query: 45 KVELETYKLQLRALQEENKNLRQTSVNIQAKAEQEEEFISNTLMKKIQALKKEKETLAHH 104
+E +L EE K QAK E K + +E+ L
Sbjct: 179 LIEETENLAELIIDLEELKLQELKLK-EQAKKALEYY------QLKEKLELEEENLLYLD 231
Query: 105 YEQEEEFLTNDLSRKLNQLTEEKYRLEQTIEQEQE---------RVVNKLMRKIEKLEAE 155
Y + E + L L EE +Q +E+E+E + K + E+
Sbjct: 232 YLKLNEERIDLLQELLRDEQEEIESSKQELEKEEEILAQVLKENKEEEKEKKLQEEELKL 291
Query: 156 TDAKQNNLEQLRREKVELENTLEQEQEALVNKLWKRMDKLEAEKRILQIKLDQPVSDPSS 215
++ L+ + + E++ + +L K +L+ EK +I+ +
Sbjct: 292 LAKEEEELKSELLKLERRKVDDEEKLKESEKELKKLEKELKKEKE--EIEELEKELKELE 349
Query: 216 PHDMNNGDTASNLS-------THIHSLRSEVMRLRNQLANTQQEHTEKMERYVKDEKNVR 268
+ L L ++ +L++ + E++E ++EK +
Sbjct: 350 IKREAEEEEEEQLEKLQEKLEQLEEELLAKKKLESERLSSAAKLKEEELELKNEEEKEAK 409
Query: 269 EENLRLQRKLQREVERREALCRHLSESESSLEMEEERH 306
+++ E ++ + + E E SLE ++ +
Sbjct: 410 LLLELSEQEEDLLKEEKKEELKIVEELEESLETKQGKL 447
Score = 36.1 bits (83), Expect = 0.033
Identities = 62/321 (19%), Positives = 124/321 (38%), Gaps = 24/321 (7%)
Query: 7 ESDSNSMDGVALMMPPSPVTREQLQK----RIESLQQQNRVLKVELETYKLQL------- 55
+ + + R++ K RI+ ++ ++ K EL K+Q
Sbjct: 678 AEQELQEKAESELAKNEILRRQEEIKKKEQRIKEELKKLKLEKEELLADKVQEAQDKINE 737
Query: 56 --RALQEENKNLRQTSVNIQAKAEQEEEFISNTLMKKIQALKKEKETLAHHYEQEEEFLT 113
+ L+++ K + + K E+EEE S +K+ + ++E++T E+E+E
Sbjct: 738 ELKLLEQKIKEKEEEEEKSRLKKEEEEEEKSELSLKEKELAEEEEKTEKLKVEEEKEEKL 797
Query: 114 NDLSRKLNQLTEEKYRLEQTIEQEQERVVNKLMRKIEKL--------EAETDAKQNNLEQ 165
+L L EE + +E+EQ + + K E+L E + K E
Sbjct: 798 KAQEEELRALEEELKEEAELLEEEQLLIEQEEKIKEEELEELALELKEEQKLEKLAEEEL 857
Query: 166 LRREKVELENTLEQEQEALVNKLWKRMDKLEAEKRILQIKLDQPVSDPSSPHDMNNGDTA 225
R E+ + L QE +L ++ K E E + + K ++ + S D +
Sbjct: 858 ERLEEEITKEELLQELLLKEEELEEQKLKDELESKEEKEKEEKKELEEESQKDNLLEEKE 917
Query: 226 SNLSTHIHSLRSEVMRLRNQLANTQQEHTEKMERYVKDEKNVREENLRLQRKLQREVERR 285
+ + I +++ ++ + E+ + K+E N EE R +R L + E
Sbjct: 918 NEIEERIAEEAIILLKYESEPE---ELLLEEADEKEKEEDNKEEEEERNKRLLLAKEELG 974
Query: 286 EALCRHLSESESSLEMEEERH 306
++E E E +
Sbjct: 975 NVNLMAIAEFEEKEERYNKDE 995
Score = 31.9 bits (72), Expect = 0.76
Identities = 31/182 (17%), Positives = 76/182 (41%), Gaps = 11/182 (6%)
Query: 28 EQLQKRIESLQQQNRVLKVELETYKLQLRALQEENKNLRQTSVNIQAKAEQEEEFISNTL 87
+ ++ ++ L ++ LK EL + + +E+ K + ++ + ++E+E I
Sbjct: 283 KLQEEELKLLAKEEEELKSELLKLERRKVDDEEKLKESEKELKKLEKELKKEKEEIEE-- 340
Query: 88 MKKIQALKKEKETLAHHYEQEEEFLTNDLSRKLNQLTE--EKYRLEQTIEQEQERVVNKL 145
++ KE E E+EEE L + E K +LE ++ +
Sbjct: 341 ---LEKELKELEIKREAEEEEEEQLEKLQEKLEQLEEELLAKKKLESERLSSAAKLKEEE 397
Query: 146 MRKIEKLEAETDAKQNNLEQLRREKVELENTLEQEQEALVNKLWKRMDKLEAEKRILQIK 205
+ + E E L +L ++ +L ++E+ +V +L + ++ + + + +
Sbjct: 398 LELKNEEEKE----AKLLLELSEQEEDLLKEEKKEELKIVEELEESLETKQGKLTEEKEE 453
Query: 206 LD 207
L+
Sbjct: 454 LE 455
>gnl|CDD|178867 PRK00106, PRK00106, hypothetical protein; Provisional.
Length = 535
Score = 37.2 bits (86), Expect = 0.014
Identities = 35/130 (26%), Positives = 63/130 (48%), Gaps = 11/130 (8%)
Query: 64 NLRQTSVNIQAKAEQEEEFISNTLMKKIQALKKE------KETLAHHYEQEEEFLT--ND 115
N Q +VN++ KAE++ E I T ++ +ALKKE +E + E E+EF + +
Sbjct: 39 NAEQEAVNLRGKAERDAEHIKKTAKRESKALKKELLLEAKEEARKYREEIEQEFKSERQE 98
Query: 116 LSRKLNQLTEEKYRLEQTIEQ--EQERVVNKLMRKIEKLEAETDAKQNNLEQLRREKV-E 172
L + ++LTE L++ E +E+ + + + D ++ +E+L +K E
Sbjct: 99 LKQIESRLTERATSLDRKDENLSSKEKTLESKEQSLTDKSKHIDEREEQVEKLEEQKKAE 158
Query: 173 LENTLEQEQE 182
LE Q
Sbjct: 159 LERVAALSQA 168
>gnl|CDD|129705 TIGR00618, sbcc, exonuclease SbcC. All proteins in this family for
which functions are known are part of an exonuclease
complex with sbcD homologs. This complex is involved in
the initiation of recombination to regulate the levels
of palindromic sequences in DNA. This family is based on
the phylogenomic analysis of JA Eisen (1999, Ph.D.
Thesis, Stanford University) [DNA metabolism, DNA
replication, recombination, and repair].
Length = 1042
Score = 37.3 bits (86), Expect = 0.014
Identities = 48/277 (17%), Positives = 90/277 (32%), Gaps = 23/277 (8%)
Query: 30 LQKRIESLQQQNRVLKVELETYKLQLRALQEENKNLRQTSVNIQAKAEQE-EEFISNTLM 88
+K+ SL++Q + ++ +E+ NL+ +V +Q E+ E
Sbjct: 554 ERKQRASLKEQMQEIQQSFSILTQCDNRSKEDIPNLQNITVRLQDLTEKLSEAEDMLACE 613
Query: 89 KKIQALKKEKETLAHHYEQEEEFLTNDLSRKLNQLTEEKYRLEQTIEQEQERVVNKL-MR 147
+ K + E + + +L+ KL L + L Q +E + L
Sbjct: 614 QHALLRKLQPEQDLQDVRLHLQQCSQELALKLTALHALQLTLTQERVREHALSIRVLPKE 673
Query: 148 KIEKLEAETDAKQNNLEQLRREKVELE--NTLEQEQEALVNKLWKRMDKLEAEKRILQIK 205
+ + Q+ EQL K L TL +E E + + + +++E L
Sbjct: 674 LLASRQLALQKMQSEKEQLTYWKEMLAQCQTLLRELETHIEEYDREFNEIENASSSLGSD 733
Query: 206 LDQPVSDPSSPHDMNNGDTASNLSTHIHSLRSEVMRLRNQLANTQQEHTEKMERYVKDEK 265
L SL+ + + R L + H E +
Sbjct: 734 LAA------------------REDALNQSLKELMHQARTVLKARTEAHFNNNEEVTAALQ 775
Query: 266 NVREENLRLQRKLQREVERREALCRHLSESESSLEME 302
E + L ++Q RE L E+ + E
Sbjct: 776 TGAELS-HLAAEIQFFNRLREEDTHLLKTLEAEIGQE 811
Score = 29.2 bits (65), Expect = 5.4
Identities = 33/186 (17%), Positives = 69/186 (37%), Gaps = 11/186 (5%)
Query: 26 TREQLQKRIESLQQQNRVLKVELETYKLQLRALQEENKNLRQTSVNIQAKAE--QEEEFI 83
E+ K + QQ + + Y Q R QEE +Q ++A+ E + +E +
Sbjct: 223 VLEKELKHLREALQQTQ----QSHAYLTQKREAQEEQLKKQQLLKQLRARIEELRAQEAV 278
Query: 84 SNTLMKKIQALKKEKETLAH-----HYEQEEEFLTNDLSRKLNQLTEEKYRLEQTIEQEQ 138
++I +K AH EQ+ + + +L K+ + + ++Q+
Sbjct: 279 LEETQERINRARKAAPLAAHIKAVTQIEQQAQRIHTELQSKMRSRAKLLMKRAAHVKQQS 338
Query: 139 ERVVNKLMRKIEKLEAETDAKQNNLEQLRREKVELENTLEQEQEALVNKLWKRMDKLEAE 198
+ + + + + + RE ++TL Q L + KL++
Sbjct: 339 SIEEQRRLLQTLHSQEIHIRDAHEVATSIREISCQQHTLTQHIHTLQQQKTTLTQKLQSL 398
Query: 199 KRILQI 204
+ L I
Sbjct: 399 CKELDI 404
>gnl|CDD|215593 PLN03128, PLN03128, DNA topoisomerase 2; Provisional.
Length = 1135
Score = 37.4 bits (87), Expect = 0.015
Identities = 39/213 (18%), Positives = 75/213 (35%), Gaps = 31/213 (14%)
Query: 61 ENKNLRQTSVNIQAKAEQEEEFISNTLMKKIQALKK---------EKETLAHHYEQEEEF 111
EN + E+ ++ L KK + +K+ Y+ E+
Sbjct: 921 ENHTDTTVHFTVTLSEEKMARALAEGLEKKFKLSTTISTSNMHLFDKDGKIKKYDSPEDI 980
Query: 112 LTNDLSRKLNQLTEEKYRLEQTIEQEQERVVNKLMRKIEKLEAETDAKQNNLEQLRREKV 171
L +L+ + K L + +E E +++ NK R I + + R++
Sbjct: 981 LEEFFHLRLDLYVKRKEVLLENLEYELKKLDNK-ARFILAVVK------GEIVVNNRKRA 1033
Query: 172 ELENTLEQEQEALVNKLWKRMDKLEAEKRILQIKLDQPVSDPSSPHDMNNGDTASN---- 227
EL LE K + + K + + S + ++ + A +
Sbjct: 1034 ELLAELE-------EKGFDKFPKTAKISETNVVGDRDG--EASEEEEASDNELAKSYDYL 1084
Query: 228 LSTHIHSLRSE-VMRLRNQLANTQQEHTEKMER 259
L I SL E V LR + A + E E++++
Sbjct: 1085 LGMPISSLTLEKVDELRAERAKKETE-VEELKK 1116
>gnl|CDD|226809 COG4372, COG4372, Uncharacterized protein conserved in bacteria
with the myosin-like domain [Function unknown].
Length = 499
Score = 36.5 bits (84), Expect = 0.019
Identities = 49/244 (20%), Positives = 99/244 (40%), Gaps = 21/244 (8%)
Query: 33 RIESLQQQNRVLKVELETYKLQLRALQEENKNLRQTSVNIQAKAEQEEEFISNTLMKKIQ 92
+++ ++ Q R L+ EL T + + RA + E + R KA QE E + L Q
Sbjct: 75 QLDDIRPQLRALRTELGTAQGEKRAAETEREAARSEL----QKARQEREAVRQELAAARQ 130
Query: 93 ALKKEKETLAHHYEQEEEFLTNDLSRKLNQLTEEKYRLEQTIEQEQERVVNKLMRKIEKL 152
L K ++ LA +Q + DL +L L E++ +LE Q + + ++++
Sbjct: 131 NLAKAQQELARLTKQAQ-----DLQTRLKTLAEQRRQLE-----AQAQSLQASQKQLQAS 180
Query: 153 EAETDAKQNNLEQLRREKVELENTLEQEQEALVNKLWKRMDKLEAEKRILQIKLDQPVSD 212
+ ++ +L +LR ++E E + + + + A + + Q +
Sbjct: 181 ATQLKSQVLDL-KLRSAQIEQEAQNLATRANAAQARTEELARRAAAAQQTAQAIQQRDAQ 239
Query: 213 PSSPHDMNNGDTASNLSTHIHSLRSEVMRLRNQLANTQQEHTEKMERYVKDEKNVREENL 272
S + + I ++ RL A +QE ++E Y + +R++
Sbjct: 240 IS-----QKAQQIAARAEQIRERERQLQRLETAQARLEQE-VAQLEAYYQAYVRLRQQAA 293
Query: 273 RLQR 276
QR
Sbjct: 294 ATQR 297
Score = 35.0 bits (80), Expect = 0.068
Identities = 36/157 (22%), Positives = 65/157 (41%), Gaps = 17/157 (10%)
Query: 31 QKRIESLQQQNRVLKVELETYKLQLRALQEENKNLRQTSVNIQAKAEQEEEFISNTLMKK 90
Q+ + L +Q + L+ L+T Q R L+ + ++L+ + +QA A Q L +
Sbjct: 136 QQELARLTKQAQDLQTRLKTLAEQRRQLEAQAQSLQASQKQLQASATQ--------LKSQ 187
Query: 91 IQALKKEKETLAHHYEQEEEFLTNDLSRKLNQLTEEKYRLEQTIEQEQERVVNKLMRKIE 150
+ LK + EQE + L + TEE R +Q + + + +I
Sbjct: 188 VLDLKLRSAQI----EQEAQNLATR-ANAAQARTEELARRAAAAQQTAQA-IQQRDAQIS 241
Query: 151 KLEAETDAKQNNLEQLRREKVELENT---LEQEQEAL 184
+ + A+ + + R+ LE LEQE L
Sbjct: 242 QKAQQIAARAEQIRERERQLQRLETAQARLEQEVAQL 278
Score = 32.3 bits (73), Expect = 0.42
Identities = 27/142 (19%), Positives = 54/142 (38%), Gaps = 21/142 (14%)
Query: 27 REQLQKRIESLQQQNRVLKVELETYKLQLRALQEENKNLRQTSVNIQAKAEQEEEFISNT 86
R QL+ + +SLQ + L+ K Q+ L+ + + Q + N+ +A
Sbjct: 160 RRQLEAQAQSLQASQKQLQASATQLKSQVLDLKLRSAQIEQEAQNLATRANA-------- 211
Query: 87 LMKKIQALKKEKETLAHHYEQEEEFLTNDLSRKLNQLTEEKYRLEQTIEQEQERVVNKLM 146
QA +E A +Q ++ + Q + + Q I E + +
Sbjct: 212 ----AQARTEELARRAAAAQQ--------TAQAIQQRDAQISQKAQQIAARAE-QIRERE 258
Query: 147 RKIEKLEAETDAKQNNLEQLRR 168
R++++LE + + QL
Sbjct: 259 RQLQRLETAQARLEQEVAQLEA 280
>gnl|CDD|220365 pfam09726, Macoilin, Transmembrane protein. This entry is a highly
conserved protein present in eukaryotes.
Length = 680
Score = 36.4 bits (84), Expect = 0.022
Identities = 47/209 (22%), Positives = 89/209 (42%), Gaps = 15/209 (7%)
Query: 28 EQLQKRIESLQQQNRVLKVELETYKLQLRALQEENKNLRQTSVNIQAKAEQEEEFISNTL 87
QK +S+Q + LK E ++ + L EE K ++ + A + + +
Sbjct: 469 SAKQKDKQSMQSMEKRLKSEADSRVNAEKQLAEEKKRKKEE----EETAARAAAQAAASR 524
Query: 88 MKKIQALKKEKETLAHHYEQEEEFLTNDLSRKLNQLTE-EKYRLEQTIEQEQERVVNKLM 146
+ ++LK+ K+ L E E + L +DL K + EK E QE E+ LM
Sbjct: 525 EECAESLKQAKQDL----EMEIKKLEHDLKLKEEECRMLEKEAQELRKYQESEKETEVLM 580
Query: 147 RKIEKLEAETDAKQNNLEQLRREKVELENTL---EQEQEALVNKLWKR-MDKLEAEKRIL 202
++ ++ + +N+L R K++L + L +++ E ++KR + ++ + +I
Sbjct: 581 SALQAMQDKNLMLENSLSAETRLKLDLFSALGDVKRQLEIAHGAIYKREAEIVDLKSKIA 640
Query: 203 QIKLDQPVSD--PSSPHDMNNGDTASNLS 229
I P S +PH N L
Sbjct: 641 DILAVMPESRIRSVTPHYSANFLEKPPLV 669
Score = 31.0 bits (70), Expect = 1.2
Identities = 42/212 (19%), Positives = 91/212 (42%), Gaps = 40/212 (18%)
Query: 129 RLEQTIE------QEQERVVNKLMRKIEKLEAETDAKQNNLEQLRREKVELENTL----- 177
RLEQ I+ Q+ + ++L +I L + + +++L QL++E L+ L
Sbjct: 410 RLEQDIKKLQAELQQARQNESELRNQISLLTSLERSLKSDLGQLKKENDMLQTKLNSMVS 469
Query: 178 -EQEQEALVNKLWKRMDKLEAEKRILQIKL-----------DQPVSDPSSPHDMNNGDTA 225
+Q+ + + + KR+ K EA+ R+ K ++ + ++ + + A
Sbjct: 470 AKQKDKQSMQSMEKRL-KSEADSRVNAEKQLAEEKKRKKEEEETAARAAAQAAASREECA 528
Query: 226 SNLSTHIHSLRSEVMRLRNQLANTQQE------HTEKMERYVKDEKN----------VRE 269
+L L E+ +L + L ++E +++ +Y + EK +++
Sbjct: 529 ESLKQAKQDLEMEIKKLEHDLKLKEEECRMLEKEAQELRKYQESEKETEVLMSALQAMQD 588
Query: 270 ENLRLQRKLQREVERREALCRHLSESESSLEM 301
+NL L+ L E + L L + + LE+
Sbjct: 589 KNLMLENSLSAETRLKLDLFSALGDVKRQLEI 620
Score = 28.7 bits (64), Expect = 6.4
Identities = 18/84 (21%), Positives = 38/84 (45%), Gaps = 3/84 (3%)
Query: 224 TASNLSTHIHSLRSEVMRLRNQLANTQQEHTE---KMERYVKDEKNVREENLRLQRKLQR 280
S L I L S L++ L ++E+ K+ V ++ ++ ++++L+
Sbjct: 428 NESELRNQISLLTSLERSLKSDLGQLKKENDMLQTKLNSMVSAKQKDKQSMQSMEKRLKS 487
Query: 281 EVERREALCRHLSESESSLEMEEE 304
E + R + L+E + + EEE
Sbjct: 488 EADSRVNAEKQLAEEKKRKKEEEE 511
>gnl|CDD|235316 PRK04863, mukB, cell division protein MukB; Provisional.
Length = 1486
Score = 36.9 bits (86), Expect = 0.022
Identities = 43/196 (21%), Positives = 75/196 (38%), Gaps = 31/196 (15%)
Query: 24 PVTREQLQKRIESLQQQNRVLKVELETYKLQLRALQEENKNLRQTSVNIQAKAEQEEEFI 83
V+R + L ++ R + E QL+ L+ L Q Q EF
Sbjct: 488 EVSRSEAWDVARELLRRLREQRHLAE----QLQQLRMRLSELEQRLRQQQRAERLLAEF- 542
Query: 84 SNTLMKKIQALKKEKETLAHHYEQEEEFLTNDLSRKLNQLTEEKYRLEQTIEQEQERVVN 143
+ + L E E E E LS +++ E + L Q +EQ Q R+
Sbjct: 543 ---CKRLGKNLDDEDELEQLQEELEARL--ESLSESVSEARERRMALRQQLEQLQARI-- 595
Query: 144 KLMRKIEKLEAETDAKQNNLEQLR---------REKVE--LENTLEQEQEALVNKLWKRM 192
+++ A Q+ L +LR + V ++ LE+E+E V +
Sbjct: 596 ---QRLAARAPAWLAAQDALARLREQSGEEFEDSQDVTEYMQQLLERERELTVER----- 647
Query: 193 DKLEAEKRILQIKLDQ 208
D+L A K+ L ++++
Sbjct: 648 DELAARKQALDEEIER 663
Score = 31.5 bits (72), Expect = 1.1
Identities = 30/152 (19%), Positives = 59/152 (38%), Gaps = 26/152 (17%)
Query: 28 EQLQKRIESLQQQNRVLKVELETYKLQLRALQEENKNLRQTSVNIQAKAEQEEEFISNTL 87
E+L++R+E +Q+ + +L + QL + N+ L + AK +
Sbjct: 988 EKLRQRLEQAEQERTRAREQLRQAQAQLA---QYNQVLASLKSSYDAKRQM--------- 1035
Query: 88 MKKIQALKKEKETLAHHYEQEEEFLTNDLSRKLNQLTEEKYRLEQTIEQEQERVVNKLMR 147
+Q LK+E + L + E + + L + + R N+L +
Sbjct: 1036 ---LQELKQELQDLGVPADSGAE----------ERARARRDELHARLSANRSR-RNQLEK 1081
Query: 148 KIEKLEAETDAKQNNLEQLRREKVELENTLEQ 179
++ EAE D L +L R+ E+ +
Sbjct: 1082 QLTFCEAEMDNLTKKLRKLERDYHEMREQVVN 1113
>gnl|CDD|214660 smart00434, TOP4c, DNA Topoisomerase IV. Bacterial DNA
topoisomerase IV, GyrA, ParC.
Length = 444
Score = 36.0 bits (84), Expect = 0.032
Identities = 18/116 (15%), Positives = 49/116 (42%), Gaps = 13/116 (11%)
Query: 98 KETLAHHYEQEEEFLTNDLSRKLNQLTEEKYRLEQTIEQEQERVVNKLMRKIEKLEAETD 157
KE L + E T L +L E+ + + + I+++
Sbjct: 325 KEILKEFLDHRLEVYTRRKEYLLGKLEAERL------HILEGLFI--ALSIIDEIIVLIR 376
Query: 158 AKQNNLEQLRREKVELENTLEQEQEALVN-KLWK----RMDKLEAEKRILQIKLDQ 208
+ ++ ++ + + +E E + +A+++ +L + ++KLE E + L+ +++
Sbjct: 377 SSKDLAKEAKEKLMERFELSEIQADAILDMRLRRLTKLEVEKLEKELKELEKEIED 432
>gnl|CDD|215969 pfam00521, DNA_topoisoIV, DNA gyrase/topoisomerase IV, subunit A.
Length = 427
Score = 35.6 bits (83), Expect = 0.037
Identities = 40/219 (18%), Positives = 79/219 (36%), Gaps = 50/219 (22%)
Query: 25 VTREQLQKRIESLQQQNRVLKVELETYKLQLRALQEENKNLRQTSVNIQAKAEQEEEFIS 84
V E+L + IE L ++ K++ +++E+ R + I+ K E +
Sbjct: 229 VWTEKLIESIEEL----------VKNKKIKGSDIRDESD--RGVRIVIELKRGANPEVVL 276
Query: 85 NTLMKKIQ----------ALKKE-------KETLAHHYEQEEEFLTNDLSRKLNQLTEEK 127
N L KK + L KE L E E L +L E
Sbjct: 277 NKLYKKTKLQSSFSTNNLVLFDGGPKYLNLKEILKEFLEHRLEVYKRRKEYLLEKLEERL 336
Query: 128 YRLEQTIEQ-EQERVVNKLMRKIEKLEAETDAKQNNLEQLRREKVELENTLEQEQ-EALV 185
+ LE ++ + V +++R + + L + L + Q + L+
Sbjct: 337 HILEGLLKALNKIDFVIEVIRG----SIDLKKAKKELIE----------ELSEIQADYLL 382
Query: 186 N-KLWK----RMDKLEAEKRILQIKLDQPVSDPSSPHDM 219
+ +L + ++KLE E L+ ++ + +S +
Sbjct: 383 DMRLRRLTKEEIEKLEKEIEELEKEIAELEKILASEKKL 421
>gnl|CDD|218602 pfam05478, Prominin, Prominin. The prominins are an emerging
family of proteins that among the multispan membrane
proteins display a novel topology. Mouse prominin and
human prominin (mouse)-like 1 (PROML1) are predicted to
contain five membrane spanning domains, with an
N-terminal domain exposed to the extracellular space
followed by four, alternating small cytoplasmic and
large extracellular, loops and a cytoplasmic C-terminal
domain. The exact function of prominin is unknown
although in humans defects in PROM1, the gene coding for
prominin, cause retinal degeneration.
Length = 807
Score = 35.8 bits (83), Expect = 0.039
Identities = 19/92 (20%), Positives = 43/92 (46%), Gaps = 5/92 (5%)
Query: 10 SNSMDGVALMMPPSPVTREQLQKRIESLQ--QQNRVLKVE--LETYKLQLRALQEENKNL 65
+ +D +A +P + + L+ L+ QQN V +E LE ++ L E + +L
Sbjct: 629 AEELDALANNLPNGKL-KVALKNEALDLRSIQQNFVPPMESLLEKLNSNVKTLDESSNDL 687
Query: 66 RQTSVNIQAKAEQEEEFISNTLMKKIQALKKE 97
+ ++ A+ + + F++N + + K+
Sbjct: 688 QNAVKDVLAEVQAAQGFLNNNASEIVNEESKK 719
>gnl|CDD|226889 COG4487, COG4487, Uncharacterized protein conserved in bacteria
[Function unknown].
Length = 438
Score = 35.6 bits (82), Expect = 0.043
Identities = 31/161 (19%), Positives = 60/161 (37%), Gaps = 2/161 (1%)
Query: 26 TREQLQKRIESLQQQNRV-LKVELETYKLQLRALQEENKNLRQTSVNIQAKAEQEEEFIS 84
T E+ +K + Q R K EL + QL ++E KNL + A Q+E
Sbjct: 43 TLEEFEKEANEKRAQYRSAKKKELSQLEEQLINQKKEQKNLFNEQIKQFELALQDEIAKL 102
Query: 85 NTLMKKIQALKKEKETLAHHYEQEEEFLTNDLSRKLNQLTEEKYRLE-QTIEQEQERVVN 143
L KE E L ++ + L L + +++ + + + +
Sbjct: 103 EALELLNLEKDKELELLEKELDELSKELQKQLQNTAEIIEKKRENNKNEERLKFENEKKL 162
Query: 144 KLMRKIEKLEAETDAKQNNLEQLRREKVELENTLEQEQEAL 184
+ ++E+ + E + NL+ +E E + + L
Sbjct: 163 EESLELEREKFEEQLHEANLDLEFKENEEQRESKWAILKKL 203
>gnl|CDD|222878 PHA02562, 46, endonuclease subunit; Provisional.
Length = 562
Score = 35.4 bits (82), Expect = 0.045
Identities = 36/210 (17%), Positives = 84/210 (40%), Gaps = 49/210 (23%)
Query: 30 LQKRIESLQQQNRVLKVELETYKLQLRALQEENKNLRQTSVNIQAKAEQEEEFISNTLMK 89
+ +I L QQ + L ++++ + Q+ +N Q N + A ++ ++ + L++
Sbjct: 172 NKDKIRELNQQIQTLDMKIDHIQQQI-KTYNKNIE-EQRKKNGENIARKQNKY--DELVE 227
Query: 90 KIQALKKEKETLAHH---YEQEEEFLTNDLSR----------KLNQLTEEKYRLE----- 131
+ + +K E E L + E + L++ K+ Q + E
Sbjct: 228 EAKTIKAEIEELTDELLNLVMDIEDPSAALNKLNTAAAKIKSKIEQFQKVIKMYEKGGVC 287
Query: 132 ----QTIEQEQERVV-------------NKLMRKIEKLEAETDAKQNNLEQLRREKVELE 174
Q I + +R+ KL I++LE + + ++ +EL+
Sbjct: 288 PTCTQQISEGPDRITKIKDKLKELQHSLEKLDTAIDELE----EIMDEFNEQSKKLLELK 343
Query: 175 NTLEQEQEALVN------KLWKRMDKLEAE 198
N + +++L+ K+ +++L+AE
Sbjct: 344 NKISTNKQSLITLVDKAKKVKAAIEELQAE 373
Score = 33.8 bits (78), Expect = 0.16
Identities = 33/175 (18%), Positives = 70/175 (40%), Gaps = 27/175 (15%)
Query: 28 EQLQKRIESLQQQNRVLKVELETYKLQLRALQEENKNLRQTSVNIQAKAEQ---EEEF-- 82
++L + ++++ + L EL + + L + I++K EQ +
Sbjct: 223 DELVEEAKTIKAEIEELTDELLNLVMDIEDPSAALNKLNTAAAKIKSKIEQFQKVIKMYE 282
Query: 83 -----------ISNTLMKKIQALKKEKETLAH-------HYEQEEEFLT--NDLSRKLNQ 122
IS +I +K + + L H ++ EE + N+ S+KL +
Sbjct: 283 KGGVCPTCTQQISEG-PDRITKIKDKLKELQHSLEKLDTAIDELEEIMDEFNEQSKKLLE 341
Query: 123 LTEEKYRLEQTIEQEQERVVNKLMRKIEKLEAETDAKQNNLEQLRREKVELENTL 177
L + +Q++ ++ K+ IE+L+AE L +L+ E ++ T
Sbjct: 342 LKNKISTNKQSLITLVDKA-KKVKAAIEELQAEFVDNAEELAKLQDELDKIVKTK 395
>gnl|CDD|217817 pfam03961, DUF342, Protein of unknown function (DUF342). This
family of bacterial proteins has no known function. The
proteins are in the region of 500-600 amino acid
residues in length.
Length = 450
Score = 35.3 bits (82), Expect = 0.057
Identities = 21/96 (21%), Positives = 46/96 (47%), Gaps = 13/96 (13%)
Query: 87 LMKKIQALKKEKETLAHHYEQEEEFLTNDLSRKLNQLTEEK-YRLEQTIEQEQERVV--- 142
L ++++ L++E + L E + + L +L ++ +L ++ E+++
Sbjct: 332 LKEELKELEEELKEL--------EEELEKIKKLLKKLPKKARGQLPPEKREQLEKLLETK 383
Query: 143 NKLMRKIEKLEAETDAKQNNLEQLRRE-KVELENTL 177
KL ++E+LE E + LE L E K+ + T+
Sbjct: 384 EKLSEELEELEEELKELKEELESLYSEGKISVNKTI 419
Score = 32.6 bits (75), Expect = 0.38
Identities = 19/74 (25%), Positives = 37/74 (50%), Gaps = 4/74 (5%)
Query: 114 NDLSRKLNQLTEEKYRLEQTIEQEQERVVNKL-MRKIEKLEAETDAKQN---NLEQLRRE 169
+L +L +L EE ++++ +++ ++ +L K E+LE + K+ LE+L E
Sbjct: 337 KELEEELKELEEELEKIKKLLKKLPKKARGQLPPEKREQLEKLLETKEKLSEELEELEEE 396
Query: 170 KVELENTLEQEQEA 183
EL+ LE
Sbjct: 397 LKELKEELESLYSE 410
>gnl|CDD|204414 pfam10211, Ax_dynein_light, Axonemal dynein light chain. Axonemal
dynein light chain proteins play a dynamic role in
flagellar and cilia motility. Eukaryotic cilia and
flagella are complex organelles consisting of a core
structure, the axoneme, which is composed of nine
microtubule doublets forming a cylinder that surrounds a
pair of central singlet microtubules. This
ultra-structural arrangement seems to be one of the most
stable micro-tubular assemblies known and is responsible
for the flagellar and ciliary movement of a large number
of organisms ranging from protozoan to mammals. This
light chain interacts directly with the N-terminal half
of the heavy chains.
Length = 189
Score = 34.1 bits (79), Expect = 0.058
Identities = 22/112 (19%), Positives = 51/112 (45%), Gaps = 15/112 (13%)
Query: 55 LRALQEENKNLRQTSVNIQAKAEQEEEFISNTLMKKIQALKKEKETLAHHYEQEEEFLTN 114
LR + L ++SV + + E + L ++I+ L++EKE
Sbjct: 93 LRMTIAAYQTLYESSVAFGMRKALQAEQGKSELEQEIKKLEEEKE--------------- 137
Query: 115 DLSRKLNQLTEEKYRLEQTIEQEQERVVNKLMRKIEKLEAETDAKQNNLEQL 166
+L +++ +L + +E+ E+E++ + +I L+ + ++ LEQ+
Sbjct: 138 ELEKRVAELEAKLEAIEKREEEERQIEEKRHADEIAFLKKQNQQLKSQLEQI 189
Score = 31.0 bits (71), Expect = 0.66
Identities = 17/82 (20%), Positives = 44/82 (53%), Gaps = 4/82 (4%)
Query: 128 YRLEQTIEQEQERVVNKLMRKIEKLEAETDAKQNNLEQLRREKVELENTLEQEQEALVNK 187
+ + + ++ EQ + ++L ++I+KLE E + + + +L + +E E+E++ K
Sbjct: 110 FGMRKALQAEQGK--SELEQEIKKLEEEKEELEKRVAELEAKLEAIEKREEEERQI-EEK 166
Query: 188 LWK-RMDKLEAEKRILQIKLDQ 208
+ L+ + + L+ +L+Q
Sbjct: 167 RHADEIAFLKKQNQQLKSQLEQ 188
Score = 29.9 bits (68), Expect = 1.6
Identities = 15/79 (18%), Positives = 37/79 (46%), Gaps = 1/79 (1%)
Query: 30 LQKRIESLQQQNRVLKVELETYKLQLRALQEENKNLRQTSVNIQAKAEQEEEFISNTLMK 89
++K +++ Q ++ + + E++ + + L++ L I+ + E+E +
Sbjct: 112 MRKALQAEQGKSELEQ-EIKKLEEEKEELEKRVAELEAKLEAIEKREEEERQIEEKRHAD 170
Query: 90 KIQALKKEKETLAHHYEQE 108
+I LKK+ + L EQ
Sbjct: 171 EIAFLKKQNQQLKSQLEQI 189
>gnl|CDD|238111 cd00187, TOP4c, DNA Topoisomerase, subtype IIA; domain A';
bacterial DNA topoisomerase IV (C subunit, ParC),
bacterial DNA gyrases (A subunit, GyrA),mammalian DNA
toposiomerases II. DNA topoisomerases are essential
enzymes that regulate the conformational changes in DNA
topology by catalysing the concerted breakage and
rejoining of DNA strands during normal cellular growth.
Length = 445
Score = 35.2 bits (82), Expect = 0.060
Identities = 21/115 (18%), Positives = 43/115 (37%), Gaps = 13/115 (11%)
Query: 98 KETLAHHYEQEEEFLTNDLSRKLNQLTEEKYRLEQTIEQEQERVVNKLMRKIEKLEAETD 157
KE L + E T +L + + LE + + + + I + + +
Sbjct: 318 KEILQEFLDHRLEVYTRRKEYELGKAEARLHILEGLL-----KAILNIDEVINLIRSSDE 372
Query: 158 AKQNNLEQLRREKVELENTLEQEQEALVNKLWK----RMDKLEAEKRILQIKLDQ 208
AK + L E +L + Q L +L + +KL E + L+ +++
Sbjct: 373 AK----KALIEELEKLGFSEIQADAILDMRLRRLTKLEREKLLKELKELEAEIED 423
>gnl|CDD|218636 pfam05557, MAD, Mitotic checkpoint protein. This family consists
of several eukaryotic mitotic checkpoint (Mitotic arrest
deficient or MAD) proteins. The mitotic spindle
checkpoint monitors proper attachment of the bipolar
spindle to the kinetochores of aligned sister chromatids
and causes a cell cycle arrest in prometaphase when
failures occur. Multiple components of the mitotic
spindle checkpoint have been identified in yeast and
higher eukaryotes. In S.cerevisiae, the existence of a
Mad1-dependent complex containing Mad2, Mad3, Bub3 and
Cdc20 has been demonstrated.
Length = 722
Score = 34.9 bits (80), Expect = 0.075
Identities = 46/177 (25%), Positives = 70/177 (39%), Gaps = 9/177 (5%)
Query: 119 KLNQLTEEKYRLEQTIEQEQERVVNKLMRKIEKLEAETDAKQNNLEQLRREKVELENTLE 178
KL QL E + E ++ Q + K E E E D +L+ + ELE E
Sbjct: 69 KLIQLENELMQKELEHKRAQIELERKASTLAENYERELDRNLELEVRLKALE-ELEKKAE 127
Query: 179 QEQEALVNKLWKRMDKLEAEKRILQIKLDQPVSDPSSPHDMNNGD------TASNLSTHI 232
E + DKL+AE LQ + + + + D A N T +
Sbjct: 128 NEAAEAEEEAKLLKDKLDAESLKLQNEKEDQLKEAKESISRIKNDLSEMQCRAQNADTEL 187
Query: 233 HSLRSEVMRLRNQLANTQQEHTE--KMERYVKDEKNVREENLRLQRKLQREVERREA 287
L SE+ LR QL Q+E E K + + E+ +N + L+ E++R E
Sbjct: 188 KLLESELEELREQLEECQKELAEAEKKLQSLTSEQASSADNSVKIKHLEEELKRYEQ 244
>gnl|CDD|225747 COG3206, GumC, Uncharacterized protein involved in
exopolysaccharide biosynthesis [Cell envelope
biogenesis, outer membrane].
Length = 458
Score = 34.7 bits (80), Expect = 0.078
Identities = 35/185 (18%), Positives = 71/185 (38%), Gaps = 16/185 (8%)
Query: 30 LQKRIESLQQQNRVLKVELETYKLQLRALQEENKNLRQTSVNIQ-------AKAEQEEEF 82
L +R+E L+ + + + ++E ++ Q L A+A +
Sbjct: 200 LDERLEELRARLQEAEAQVEDFRAQHGLTDAARGQLLSEQQLSALNTQLQSARARLAQAE 259
Query: 83 ISNTLMKKIQALKKEKETLAHHYEQEEEFLTNDLSRKLNQLTEEKYRLEQTIEQEQERVV 142
+ ++ L +E L E E DL ++ Q+ ++ L + + ++V
Sbjct: 260 ARLASLLQLLPLGREAAALR---EVLESPTIQDLRQQYAQVRQQIADLSTELGAKHPQLV 316
Query: 143 NKLMRKIEKLEA-ETDAKQNNLEQLRREKVEL---ENTLEQEQEALVNKLWKRMDKLEAE 198
L ++ +L + L L E L E LE+E L +L ++ KL+ +
Sbjct: 317 A-LEAQLAELRQQIAAELRQILASLPNELALLEQQEAALEKELAQLKGRL-SKLPKLQVQ 374
Query: 199 KRILQ 203
R L+
Sbjct: 375 LRELE 379
>gnl|CDD|223783 COG0711, AtpF, F0F1-type ATP synthase, subunit b [Energy production
and conversion].
Length = 161
Score = 33.4 bits (77), Expect = 0.093
Identities = 34/124 (27%), Positives = 56/124 (45%), Gaps = 15/124 (12%)
Query: 83 ISNTLMKKIQALKKEKETLAHHYEQEEEFLTNDLSRKLNQLTEEKYRLEQTIEQEQERVV 142
I++ L + + LK+E + L YEQE E S + Q +E ++ + I+ E E +
Sbjct: 42 IADDL-AEAERLKEEAQALLAEYEQELEEAREQASEIIEQAKKEAEQIAEEIKAEAEEEL 100
Query: 143 NKLMRKIEKLEAETDA-KQNNLEQLRREKVEL----------ENTLEQEQEALVNKLWKR 191
R E EAE +A K+ LE+LR E EL + E Q+ L++
Sbjct: 101 E---RIKEAAEAEIEAEKERALEELRAEVAELAVAIAEKLLGKKVDEAAQKDLIDAFIAE 157
Query: 192 MDKL 195
+ +
Sbjct: 158 LGEN 161
>gnl|CDD|234342 TIGR03752, conj_TIGR03752, integrating conjugative element protein,
PFL_4705 family. Members of this protein family are
found occasionally on plasmids such as the Pseudomonas
putida toluene catabolic TOL plasmid pWWO_p085. Usually,
however, they are found on the bacterial main chromosome
in regions flanked by markers of conjugative transfer
and/or transposition [Mobile and extrachromosomal
element functions, Plasmid functions].
Length = 472
Score = 34.6 bits (80), Expect = 0.095
Identities = 20/85 (23%), Positives = 38/85 (44%), Gaps = 8/85 (9%)
Query: 40 QNRVLKVELETYKLQLRALQEENKNLRQTSVNIQAKAEQEEEFISNTLMKKIQALKKEKE 99
+ + L+ L + AL+ EN+ L++ +I ++ I + + Q L KE E
Sbjct: 67 EVKELRKRLAKLISENEALKAENERLQKREQSI-------DQQIQQAVQSETQELTKEIE 119
Query: 100 TLAHHYEQEEEFLTNDLSRKLNQLT 124
L +Q + L + L R+L +
Sbjct: 120 QLKSERQQLQG-LIDQLQRRLAGVL 143
Score = 30.8 bits (70), Expect = 1.4
Identities = 18/74 (24%), Positives = 38/74 (51%)
Query: 113 TNDLSRKLNQLTEEKYRLEQTIEQEQERVVNKLMRKIEKLEAETDAKQNNLEQLRREKVE 172
+L ++L +L E L+ E+ Q+R + + + +++ET +EQL+ E+ +
Sbjct: 68 VKELRKRLAKLISENEALKAENERLQKREQSIDQQIQQAVQSETQELTKEIEQLKSERQQ 127
Query: 173 LENTLEQEQEALVN 186
L+ ++Q Q L
Sbjct: 128 LQGLIDQLQRRLAG 141
>gnl|CDD|233791 TIGR02231, TIGR02231, conserved hypothetical protein. This family
consists of proteins over 500 amino acids long in
Caenorhabditis elegans and several bacteria (Pseudomonas
aeruginosa, Nostoc sp. PCC 7120, Leptospira interrogans,
etc.). The function is unknown.
Length = 525
Score = 34.3 bits (79), Expect = 0.10
Identities = 26/134 (19%), Positives = 47/134 (35%), Gaps = 12/134 (8%)
Query: 64 NLRQTSVNIQA--KAEQEEEFISNTLMKKIQALKKEKETLAHHYEQEEEFLTNDLSRKLN 121
R + K +E E L + ALK + L E E + + R
Sbjct: 65 TSRPDPERLAELRKQIRELEAELRDLEDRGDALKALAKFLEDIREGLTEPIKDSAKRNEP 124
Query: 122 ---------QLTEEKYRLEQTIEQEQERVVNKLMRKIEKLEAETDAKQNNLEQLRREKVE 172
+ T ++E ER + +L +++ +L+ E +A Q R V
Sbjct: 125 DLKEWFQAFDFNGSEIERLLTEDREAERRIRELEKQLSELQNELNALLTGKSQ-RSHTVL 183
Query: 173 LENTLEQEQEALVN 186
+ ++ EA +N
Sbjct: 184 VRLEAPEDAEAELN 197
>gnl|CDD|237178 PRK12705, PRK12705, hypothetical protein; Provisional.
Length = 508
Score = 34.3 bits (79), Expect = 0.12
Identities = 22/110 (20%), Positives = 46/110 (41%), Gaps = 7/110 (6%)
Query: 87 LMKKIQALKKEKETLAHHYEQEEEFLTNDLSRKLNQLTEEKYRLEQTIEQEQERVVNKLM 146
L K+ + + +E + A + +L + ++ R E+ +E +R +L+
Sbjct: 31 LAKEAERILQEAQKEAEEKLEAALLEAKELLLRERNQQRQEARRER---EELQREEERLV 87
Query: 147 RKIEKLEAETDAKQNNLEQLRREKVELENTLEQEQEALVNKLWKRMDKLE 196
+K E+L DA+ L+ L + E E L + L + ++L
Sbjct: 88 QKEEQL----DARAEKLDNLENQLEEREKALSARELELEELEKQLDNELY 133
Score = 33.5 bits (77), Expect = 0.19
Identities = 27/153 (17%), Positives = 56/153 (36%), Gaps = 8/153 (5%)
Query: 53 LQLRALQEENKNLRQTSVNIQAKAEQEEEFISNTLMKKIQALKKEKETLAHHYEQEEEFL 112
+ L+ Q K + Q +AE++ E + + + ++ A +E +
Sbjct: 23 VLLKKRQRLAKEAERILQEAQKEAEEKLEAALLEAKELLLRERNQQRQEARREREELQRE 82
Query: 113 TNDLSRKLNQLTEEKYRLEQTIEQEQERVVNKLMRKIEKLEAETDAKQNNLEQLRREKVE 172
L +K QL +L+ Q L + + L A + +QL E
Sbjct: 83 EERLVQKEEQLDARAEKLDNLENQ--------LEEREKALSARELELEELEKQLDNELYR 134
Query: 173 LENTLEQEQEALVNKLWKRMDKLEAEKRILQIK 205
+ ++ L+ KL + E +R+ +I+
Sbjct: 135 VAGLTPEQARKLLLKLLDAELEEEKAQRVKKIE 167
Score = 30.1 bits (68), Expect = 2.4
Identities = 21/94 (22%), Positives = 35/94 (37%), Gaps = 4/94 (4%)
Query: 28 EQLQKRIESLQQQNRVLKVELETYKLQLRALQEENKNLRQTSVNIQAKAEQEEEFISNTL 87
EQL R E L L+ + + L+E K L + A E L
Sbjct: 91 EQLDARAEKLDNLENQLEEREKALSARELELEELEKQLDNELYRV---AGLTPEQARKLL 147
Query: 88 MKKIQA-LKKEKETLAHHYEQEEEFLTNDLSRKL 120
+K + A L++EK E+E + ++ +
Sbjct: 148 LKLLDAELEEEKAQRVKKIEEEADLEAERKAQNI 181
>gnl|CDD|217803 pfam03938, OmpH, Outer membrane protein (OmpH-like). This family
includes outer membrane proteins such as OmpH among
others. Skp (OmpH) has been characterized as a molecular
chaperone that interacts with unfolded proteins as they
emerge in the periplasm from the Sec translocation
machinery.
Length = 157
Score = 33.0 bits (76), Expect = 0.12
Identities = 25/107 (23%), Positives = 56/107 (52%), Gaps = 8/107 (7%)
Query: 57 ALQEENKNLRQTSVNIQAKAEQEEEFISNTLMKKIQALKKEKETLAHHYEQEEEFLTNDL 116
A + K L + +QA+ +++E+ L K+ Q L+K+ TL+ E+ + +L
Sbjct: 32 AGKAAQKQLEKEFKKLQAELQKKEK----ELQKEEQKLQKQAATLS---EEARKAKQQEL 84
Query: 117 SRKLNQLTEEKYRLEQTIEQEQERVVNKLMRKIEKLEAETDAKQNNL 163
+K +L +++ +Q ++Q+Q+ ++ + KI+K + AK+
Sbjct: 85 QQKQQELQQKQQAAQQELQQKQQELLQPIYDKIDKA-IKEVAKEKGY 130
Score = 30.3 bits (69), Expect = 1.0
Identities = 17/79 (21%), Positives = 35/79 (44%), Gaps = 3/79 (3%)
Query: 87 LMKKIQALKKEKETLAHHYEQEEEFLTNDLSRKLNQLTEEKYRLEQTIEQEQERVVNKLM 146
K+++ K+ + E+E + L ++ L+EE + +QE ++ +L
Sbjct: 36 AQKQLEKEFKKLQAELQKKEKELQKEEQKLQKQAATLSEEAR---KAKQQELQQKQQELQ 92
Query: 147 RKIEKLEAETDAKQNNLEQ 165
+K + + E KQ L Q
Sbjct: 93 QKQQAAQQELQQKQQELLQ 111
Score = 29.1 bits (66), Expect = 2.0
Identities = 17/67 (25%), Positives = 34/67 (50%), Gaps = 2/67 (2%)
Query: 27 REQLQKRIESLQQQNRVLKVELETYKLQLRALQEENKNLRQTSVNIQAKAEQEEEFISNT 86
++LQK + LQ+Q L E K + + LQ++ + L+Q Q + +Q+++ +
Sbjct: 55 EKELQKEEQKLQKQAATLSEE--ARKAKQQELQQKQQELQQKQQAAQQELQQKQQELLQP 112
Query: 87 LMKKIQA 93
+ KI
Sbjct: 113 IYDKIDK 119
Score = 28.8 bits (65), Expect = 3.1
Identities = 22/100 (22%), Positives = 52/100 (52%), Gaps = 13/100 (13%)
Query: 27 REQLQKRIESLQQQNRVLKVELETYKLQLRALQEENKNLRQTSVNIQAKAEQEEEFISNT 86
++QL+K + LQ + + + EL+ + + LQ++ L + KA+Q+E
Sbjct: 37 QKQLEKEFKKLQAELQKKEKELQK---EEQKLQKQAATLSEE----ARKAKQQE------ 83
Query: 87 LMKKIQALKKEKETLAHHYEQEEEFLTNDLSRKLNQLTEE 126
L +K Q L+++++ +Q+++ L + K+++ +E
Sbjct: 84 LQQKQQELQQKQQAAQQELQQKQQELLQPIYDKIDKAIKE 123
Score = 27.6 bits (62), Expect = 6.9
Identities = 17/82 (20%), Positives = 38/82 (46%), Gaps = 1/82 (1%)
Query: 132 QTIEQEQERVVNKLMRKIEKLEAETDAKQNNLEQLRREKVELE-NTLEQEQEALVNKLWK 190
+ +++ E+ KL +++K E E ++ L++ E +QE + +L +
Sbjct: 34 KAAQKQLEKEFKKLQAELQKKEKELQKEEQKLQKQAATLSEEARKAKQQELQQKQQELQQ 93
Query: 191 RMDKLEAEKRILQIKLDQPVSD 212
+ + E + Q +L QP+ D
Sbjct: 94 KQQAAQQELQQKQQELLQPIYD 115
>gnl|CDD|223497 COG0420, SbcD, DNA repair exonuclease [DNA replication,
recombination, and repair].
Length = 390
Score = 34.0 bits (78), Expect = 0.12
Identities = 23/152 (15%), Positives = 48/152 (31%), Gaps = 16/152 (10%)
Query: 60 EENKNLRQTSVNIQAKAEQEEEFISNTLMKKIQALKKEKETLAHHYEQEEEFLTNDLSRK 119
EE + V E + L ++++ E + E+E L D
Sbjct: 236 EEGERKGVVLVEFSGGKLWRFEELFVPLFERLEVDVLNGE------DLEDELLKLDREED 289
Query: 120 LNQLTEEKYRLEQTIEQEQERVVNKLMRKIEKLEAETDAKQNNLEQLRREKVELENTLEQ 179
+ +E + ER+ +++ E++E + L E E
Sbjct: 290 AKDEPVVRLTIEGETDILDERLKERILALYERIERLELLLEERATDLAEEPESSE----- 344
Query: 180 EQEALVNKLWKRMDKLEAEKRILQIKLDQPVS 211
+ +L +D EA + +L ++
Sbjct: 345 -----LEELLSELDVEEAFEELLADEVLLDRE 371
>gnl|CDD|236912 PRK11448, hsdR, type I restriction enzyme EcoKI subunit R;
Provisional.
Length = 1123
Score = 34.2 bits (79), Expect = 0.13
Identities = 22/104 (21%), Positives = 42/104 (40%), Gaps = 12/104 (11%)
Query: 28 EQLQKRIESLQQQNRVLKVELETYKLQLRALQEENKNLRQTSVNIQAK-AEQEEEFISNT 86
LQ+ + +L+QQ + E + A Q+E L + ++ K E E +
Sbjct: 145 HALQQEVLTLKQQLELQAREKAQSQALAEAQQQELVALEGLAAELEEKQQELEAQL--EQ 202
Query: 87 LMKKIQALKKEKETLAHHYEQEEEF-----LTNDLSRKL--NQL 123
L + +A + +E E ++ L+ + +R L QL
Sbjct: 203 LQE--KAAETSQERKQKRKEITDQAAKRLELSEEETRILIDQQL 244
Score = 33.0 bits (76), Expect = 0.28
Identities = 26/120 (21%), Positives = 46/120 (38%), Gaps = 23/120 (19%)
Query: 83 ISNTLMKKIQALKKEKETLAHHYEQEEEFLTNDLSRKLNQLTEEKYRLEQTIEQEQERVV 142
+ + L +++ LK++ E A Q + +E LE E E
Sbjct: 143 LLHALQQEVLTLKQQLELQAREKAQSQA--------LAEAQQQELVALEGLA-AELEEKQ 193
Query: 143 NKLMRKIEKLEAETDAKQNNLEQLRREKVELENTLEQEQEALVNKLWKRMDKLEAEKRIL 202
+L ++E+L+ + +Q R+E +Q A KR++ E E RIL
Sbjct: 194 QELEAQLEQLQEKAAETSQERKQKRKEI--------TDQAA------KRLELSEEETRIL 239
Score = 31.8 bits (73), Expect = 0.64
Identities = 18/61 (29%), Positives = 24/61 (39%), Gaps = 2/61 (3%)
Query: 120 LNQLTEEKYRLEQTIEQEQ--ERVVNKLMRKIEKLEAETDAKQNNLEQLRREKVELENTL 177
L QL E + E E+++ KL RK+ K + L QLRR V E
Sbjct: 838 LEQLVNELTDADGESHAEHVREQLIAKLQRKLRKATDNAERSFEILAQLRRAGVTPEEFA 897
Query: 178 E 178
Sbjct: 898 S 898
Score = 30.3 bits (69), Expect = 2.0
Identities = 26/114 (22%), Positives = 57/114 (50%), Gaps = 15/114 (13%)
Query: 46 VELETYKLQLRALQEENKNLRQTSVNIQAKAEQEEEFISNTLMKKIQALKKEKETLAHHY 105
V E + L ALQ+E L+Q + +QA+ + + + ++ +++ AL+ LA
Sbjct: 135 VPPEDPENLLHALQQEVLTLKQQ-LELQAREKAQSQALAEAQQQELVALEG----LAAEL 189
Query: 106 EQEEEFLTNDLSRKLNQLTEEKYRLEQTIEQEQERVVNKLMRKIEKL---EAET 156
E++++ +L +L QL E+ +T ++ +++ + ++L E ET
Sbjct: 190 EEKQQ----ELEAQLEQLQEKA---AETSQERKQKRKEITDQAAKRLELSEEET 236
Score = 28.4 bits (64), Expect = 9.1
Identities = 16/78 (20%), Positives = 35/78 (44%), Gaps = 10/78 (12%)
Query: 27 REQLQKRIESLQQQNRVLKVELETYKLQLRALQEENKNLRQTSVNIQAKAEQEEEFISNT 86
+ Q Q E+ QQ+ L+ + + + L+ + + L++ + E++++ T
Sbjct: 165 KAQSQALAEAQQQELVALEGLAAELEEKQQELEAQLEQLQE--KAAETSQERKQKRKEIT 222
Query: 87 LMKKIQALKK----EKET 100
QA K+ E+ET
Sbjct: 223 D----QAAKRLELSEEET 236
>gnl|CDD|217135 pfam02601, Exonuc_VII_L, Exonuclease VII, large subunit. This
family consist of exonuclease VII, large subunit
EC:3.1.11.6 This enzyme catalyzes exonucleolytic
cleavage in either 5'->3' or 3'->5' direction to yield
5'-phosphomononucleotides. This exonuclease VII enzyme
is composed of one large subunit and 4 small ones.
Length = 295
Score = 33.8 bits (78), Expect = 0.13
Identities = 22/100 (22%), Positives = 43/100 (43%), Gaps = 2/100 (2%)
Query: 104 HYEQEEEFLTNDLSRKL-NQLTEEKYRLEQTIEQEQERVVNKLMRKIEKLEAETDAKQNN 162
Q+ E L LSR L N+L +E+ RL E+ + L + E+L D ++
Sbjct: 147 ELLQKLEGLEQRLSRALKNRLEKEQDRLNLLRERLKSLSRRLLEQHEERLAELRDRLISS 206
Query: 163 LEQ-LRREKVELENTLEQEQEALVNKLWKRMDKLEAEKRI 201
++ L R++ LE + + + L ++ ++
Sbjct: 207 IQNLLSRKQSRLERLILNRELEKNSLLENKLATANLTAQL 246
>gnl|CDD|232965 TIGR00414, serS, seryl-tRNA synthetase. This model represents the
seryl-tRNA synthetase found in most organisms. This
protein is a class II tRNA synthetase, and is recognized
by the pfam model tRNA-synt_2b. The seryl-tRNA
synthetases of two archaeal species, Methanococcus
jannaschii and Methanobacterium thermoautotrophicum,
differ considerably and are included in a different
model [Protein synthesis, tRNA aminoacylation].
Length = 418
Score = 33.9 bits (78), Expect = 0.13
Identities = 34/141 (24%), Positives = 57/141 (40%), Gaps = 20/141 (14%)
Query: 27 REQLQKRIESLQQQNRVLKVELETYKLQLRALQEENKNLRQTSVNIQAKAEQEEEFISNT 86
R ESL+ + + ++LE +L AL +E K L +QAK N
Sbjct: 8 RNNPDLVKESLKARGLSVDIDLE----KLIALDDERKKLLSEIEELQAK--------RNE 55
Query: 87 LMKKIQALKKEKETLAHHYEQEEEFLTNDLSRKLNQLTEEKYRLEQTIEQEQERVVNKLM 146
L K+I K +K+ ++E +L +L +L+ LE ++ + + N
Sbjct: 56 LSKQIGKAKGQKKDKIEEIKKE----LKELKEELTELSAALKALEAELQDKLLSIPNIPH 111
Query: 147 RKIEKLEAETDAKQNNLEQLR 167
+ + E D NLE R
Sbjct: 112 ESVPVGKDEED----NLEVKR 128
Score = 28.9 bits (65), Expect = 5.0
Identities = 23/94 (24%), Positives = 46/94 (48%), Gaps = 7/94 (7%)
Query: 126 EKYRLEQTIEQEQERVVNKLMRKIEKLEAETDAKQNNLEQLRREKVELENTLEQEQEALV 185
LE+ I + ER KL+ +IE+L+A K+N L + + + +E + +
Sbjct: 25 VDIDLEKLIALDDERK--KLLSEIEELQA----KRNELSKQIGKAKGQKKDKIEEIKKEL 78
Query: 186 NKLWKRMDKLEAEKRILQIKLDQPVSD-PSSPHD 218
+L + + +L A + L+ +L + P+ PH+
Sbjct: 79 KELKEELTELSAALKALEAELQDKLLSIPNIPHE 112
>gnl|CDD|227355 COG5022, COG5022, Myosin heavy chain [Cytoskeleton].
Length = 1463
Score = 34.3 bits (79), Expect = 0.14
Identities = 33/170 (19%), Positives = 60/170 (35%), Gaps = 20/170 (11%)
Query: 30 LQKRIESLQQQNRVLKVELETYKLQLRALQEENKNLRQTSVNIQAKAEQEEEFISNTLMK 89
L+K + ++ + E +L L E L++TS + ++ + K
Sbjct: 933 LKKLLNNIDLEEGPSI-EYVKLP-ELNKLHEVESKLKETSEEYEDLLKKSTILVR-EGNK 989
Query: 90 KIQALKKEKETLAHHYEQEEEFLTNDLSRKLNQLTEEKYRL-----------EQTIEQEQ 138
LK K+ LA +Q L QL E + ++ E
Sbjct: 990 ANSELKNFKKELAELSKQYG-----ALQESTKQLKELPVEVAELQSASKIISSESTELSI 1044
Query: 139 ERVVNKLMRKIEKLEAETDAKQNNLEQLRREKVELENTLEQEQEALVNKL 188
+ + KL + + A+ L +LRRE L++ + E+ N L
Sbjct: 1045 LKPLQKLKGLLLLENNQLQARYKAL-KLRRENSLLDDKQLYQLESTENLL 1093
Score = 32.4 bits (74), Expect = 0.45
Identities = 40/285 (14%), Positives = 95/285 (33%), Gaps = 17/285 (5%)
Query: 29 QLQKRIESLQQQNRVLKVELETYKLQLRALQEENKNLRQTSVNIQAKAEQEEEFISNTLM 88
+ K+I+ +Q R+ ++ K +L + +L + ++ + + T+
Sbjct: 769 KRIKKIQVIQHGFRLRRLVDYELKWRLFIKLQPLLSLLGSRKEYRSYLACIIK-LQKTIK 827
Query: 89 KKIQALKKEKETLAHHYEQEEEFLTNDLSRKLNQLTEEKYRLEQ---TIEQEQERVVNKL 145
++ + + E+ + E L R L ++++ L + Q +RV
Sbjct: 828 REKKLRETEEVEFSLK----AEVLIQKFGRSLK--AKKRFSLLKKETIYLQSAQRVELAE 881
Query: 146 MRKIEKLEAETDAKQNNLEQLRREKVELENTLEQEQEALVNKLWKRMDKLEAEKRILQ-I 204
+ E L L E +E + L+ L + + + K++L I
Sbjct: 882 RQLQELKIDVKSISSLKLVNLELESEIIELK-KSLSSDLIENLEFKTELIARLKKLLNNI 940
Query: 205 KLDQPVSDPSSPHDMNNGDTASNLSTHIHSLRSEVMRLRNQLANTQQE---HTEKMERYV 261
L++ S N + + E L + +E +++ +
Sbjct: 941 DLEEGPSIEYVKLPELNKLHEVE--SKLKETSEEYEDLLKKSTILVREGNKANSELKNFK 998
Query: 262 KDEKNVREENLRLQRKLQREVERREALCRHLSESESSLEMEEERH 306
K+ + ++ LQ ++ E + S S+ E
Sbjct: 999 KELAELSKQYGALQESTKQLKELPVEVAELQSASKIISSESTELS 1043
Score = 31.2 bits (71), Expect = 1.1
Identities = 25/169 (14%), Positives = 61/169 (36%), Gaps = 16/169 (9%)
Query: 26 TREQLQKRIESLQQQNRVLKVELETYKLQLRALQEENKNLRQTSVNIQAKAEQEEEFISN 85
+L+ I L++ + +E + + + K L NI + E++
Sbjct: 900 VNLELESEIIELKKS--LSSDLIENLEFKTELIARLKKLLN----NIDLEEGPSIEYVKL 953
Query: 86 TLMKKIQALKKEKETLAHHYEQEEEFLTNDLSRKLNQLTEEK--------YRLEQTIEQE 137
+ K+ ++ + + + YE + T + +E K + QE
Sbjct: 954 PELNKLHEVESKLKETSEEYEDLLKKSTILVREGNKANSELKNFKKELAELSKQYGALQE 1013
Query: 138 QERVVNKLMRKIEKLEAETDAKQNNLEQLRREK--VELENTLEQEQEAL 184
+ + +L ++ +L++ + + +L K +L+ L E L
Sbjct: 1014 STKQLKELPVEVAELQSASKIISSESTELSILKPLQKLKGLLLLENNQL 1062
>gnl|CDD|216531 pfam01496, V_ATPase_I, V-type ATPase 116kDa subunit family. This
family consists of the 116kDa V-type ATPase (vacuolar
(H+)-ATPases) subunits, as well as V-type ATP synthase
subunit i. The V-type ATPases family are proton pumps
that acidify intracellular compartments in eukaryotic
cells for example yeast central vacuoles,
clathrin-coated and synaptic vesicles. They have
important roles in membrane trafficking processes. The
116kDa subunit (subunit a) in the V-type ATPase is part
of the V0 functional domain responsible for proton
transport. The a subunit is a transmembrane glycoprotein
with multiple putative transmembrane helices it has a
hydrophilic amino terminal and a hydrophobic carboxy
terminal. It has roles in proton transport and assembly
of the V-type ATPase complex. This subunit is encoded by
two homologous gene in yeast VPH1 and STV1.
Length = 707
Score = 33.9 bits (78), Expect = 0.14
Identities = 18/50 (36%), Positives = 25/50 (50%), Gaps = 3/50 (6%)
Query: 138 QERVVNKLMRKIEKLEAETDAKQNNLEQLRREKVELE---NTLEQEQEAL 184
+ L +I LEAE + NLE L +E ELE N L++E+ L
Sbjct: 65 PSKEFLDLEEEILDLEAEIKEVEENLESLEKEINELEEWLNVLDEEKSFL 114
Score = 31.2 bits (71), Expect = 1.0
Identities = 38/221 (17%), Positives = 80/221 (36%), Gaps = 56/221 (25%)
Query: 22 PSPVTREQLQKRIESLQQQNRVLKVELETYKLQLRALQEENKNLRQTSVNIQAKAEQEEE 81
P L++ I L+ + + ++ LE+ + ++ L+E L + + E+ E
Sbjct: 64 PPSKEFLDLEEEILDLEAEIKEVEENLESLEKEINELEEWLNVLDEEKSFLDENLEELSE 123
Query: 82 FISNTLM----------------------KKIQALKKEKETLAHHY------EQEEEF-- 111
SN + +K++A ++E Y E+ E
Sbjct: 124 L-SNLDIDFKYLRGAEGLKLGFVAGVINREKLEAFERELWRACRGYIRQAEIEEPLEDPK 182
Query: 112 ---------LTNDLSRKLNQLTEEKYRLEQTIEQEQER--VVNKLMRKIEKLEAETDAKQ 160
DL + L + L E E ER +++K+ ++IE+L+
Sbjct: 183 KTVFIIFFVGKEDLDKVKKILDSFGFELYDVPETEGERSELISKVNKRIEELQ------- 235
Query: 161 NNLEQLRREKVELENTLEQEQEALVNKLWKRMDKLEAEKRI 201
R + E+ LE+ + ++L +++ EK +
Sbjct: 236 -------RVLEQTESHLEKVLVKIADELLAWDEQVSKEKAV 269
>gnl|CDD|225803 COG3264, COG3264, Small-conductance mechanosensitive channel [Cell
envelope biogenesis, outer membrane].
Length = 835
Score = 33.9 bits (78), Expect = 0.15
Identities = 31/162 (19%), Positives = 63/162 (38%), Gaps = 37/162 (22%)
Query: 28 EQLQKRIESLQQQNRVLKVELETYKLQLRALQEENKNLRQTSVNIQAKAEQEEEFISNTL 87
Q +R+ +L +R L LQL + +R+ ++ S L
Sbjct: 75 NQQTERLNALASDDRQLA----NLLLQLLQSS---RTIREQIAVLRGSLLL-----SRIL 122
Query: 88 MKKIQALKKEKETLAHHYEQEEEFLTNDLSRKLNQLTEEKYRLEQTIEQEQERVVNKLMR 147
++++ L + + +E+F + + L EK +N L
Sbjct: 123 LQQLGPLPEAGQ-------PQEQFEVTQ---ERDALQAEK------------AYINALEG 160
Query: 148 KIEKLEAETDAKQNNLEQLRREKVELENTLEQEQEALVNKLW 189
+ E+L T ++ L+Q+ + EL N+L ++EA+ +L
Sbjct: 161 QAEQL---TAEVRDILDQILDTRRELLNSLLSQREAISLQLN 199
>gnl|CDD|173412 PTZ00121, PTZ00121, MAEBL; Provisional.
Length = 2084
Score = 34.0 bits (77), Expect = 0.17
Identities = 37/178 (20%), Positives = 73/178 (41%), Gaps = 18/178 (10%)
Query: 28 EQLQKRIESLQQQNRVLKVELETYKLQLRALQEENKNLRQTSVNIQAKAEQEEEFISNTL 87
E+ +K E+ + + K E E K++ +E + + + K +EE I
Sbjct: 1610 EEAKKAEEAKIKAEELKKAEEEKKKVEQLKKKEAEEKKKAE----ELKKAEEENKIKAAE 1665
Query: 88 MKKIQALKKEKETLAHHYEQEEEFLTNDLSRKLNQLTEEKYRLEQTIEQEQERVVNKLMR 147
K K+K A E++E+ L ++ EE + E+ ++E E +
Sbjct: 1666 EAKKAEEDKKKAEEAKKAEEDEKKAAEALKKE----AEEAKKAEELKKKEAEEK-----K 1716
Query: 148 KIEKLEAETDAKQNNLEQLRREKVE-----LENTLEQEQEALVNKLWKRMDKLEAEKR 200
K E+L+ + + E+ ++E E E ++E++ + L K +K E R
Sbjct: 1717 KAEELKKAEEENKIKAEEAKKEAEEDKKKAEEAKKDEEEKKKIAHLKKEEEKKAEEIR 1774
Score = 32.4 bits (73), Expect = 0.45
Identities = 37/192 (19%), Positives = 81/192 (42%), Gaps = 18/192 (9%)
Query: 32 KRIESLQQQNRVLKVELETYKLQLRALQEENKNLRQTSVNIQAKAEQE------EEFISN 85
K+ + L++ + K E E K + EE+KN+ KAE+ + +
Sbjct: 1546 KKADELKKAEELKKAE-EKKKAEEAKKAEEDKNMALRKAEEAKKAEEARIEEVMKLYEEE 1604
Query: 86 TLMKKIQALKKEKETLAHHYEQEEEFLTNDLSRKLNQLTEEKYRLEQTIEQEQERVVNKL 145
MK +A K E+ + ++ E + + + EEK + E+ + E+E +
Sbjct: 1605 KKMKAEEAKKAEEAKIKAEELKKAEEEKKKVEQLKKKEAEEKKKAEELKKAEEENKIKAA 1664
Query: 146 M---------RKIEKLEAETDAKQNNLEQLRREKVELENTLEQEQEALVNKLWKRMDKLE 196
+K E+ + + ++ E L++E E + E +++ K K+ ++L+
Sbjct: 1665 EEAKKAEEDKKKAEEAKKAEEDEKKAAEALKKEAEEAKKAEELKKKEAEEK--KKAEELK 1722
Query: 197 AEKRILQIKLDQ 208
+ +IK ++
Sbjct: 1723 KAEEENKIKAEE 1734
Score = 29.0 bits (64), Expect = 6.7
Identities = 37/181 (20%), Positives = 82/181 (45%), Gaps = 15/181 (8%)
Query: 28 EQLQKRIESLQQQNRVLKVELETYKLQLRALQEENKNLRQTSVNIQAKAEQEEEFISNTL 87
EQL+K+ +++ LK E K+ +A +E K +AK +E+E +
Sbjct: 1636 EQLKKKEAEEKKKAEELKKAEEENKI--KAAEEAKKAEEDKKKAEEAKKAEEDEKKAAEA 1693
Query: 88 MKKIQALKKEKETLAHHYEQEEEFLTNDLSRKLNQLTEEKYRLEQTIEQEQERVVNKLMR 147
+KK K+ E L +E+ +K +L + + + E+ + + + +
Sbjct: 1694 LKKEAEEAKKAEELKKKEAEEK--------KKAEELKKAEEENKIKAEEAK-KEAEEDKK 1744
Query: 148 KIEKLEAETDAKQNNLEQLRREKVELENTLEQEQEALVNKLWKRMD---KLEAEKRILQI 204
K E+ + + + K + L++E+ + + +E+EA++ + D ++E +K+I I
Sbjct: 1745 KAEEAKKDEEEK-KKIAHLKKEEEKKAEEIRKEKEAVIEEELDEEDEKRRMEVDKKIKDI 1803
Query: 205 K 205
Sbjct: 1804 F 1804
>gnl|CDD|235505 PRK05563, PRK05563, DNA polymerase III subunits gamma and tau;
Validated.
Length = 559
Score = 33.7 bits (78), Expect = 0.19
Identities = 29/180 (16%), Positives = 74/180 (41%), Gaps = 24/180 (13%)
Query: 28 EQLQKRIESLQQQNRVLKVELETYKLQLRALQEENKNLRQTSVNIQAKAEQE--EEFISN 85
E L +R+E L+Q+ + LK + + + ++E K ++ + + +E
Sbjct: 372 EVLLQRVEQLEQELKQLKAQPVGVAPEQKEKKKEKKKNKKKKYKVPRGKIYKVLKEATRQ 431
Query: 86 TL--MKK-----IQALKKEKETLAHHYEQEEEFLTNDLSRKLNQLT---EEKYRLEQTIE 135
L +K +++LK ++++L E ++ + + E + E+ +
Sbjct: 432 DLELLKNVWGEILESLKAQRKSLRALLVNSEPVAASE-----DTVVLAFEYEIHFEKAML 486
Query: 136 QEQER-VVNKLMRKIEK------LEAETDAKQNNLEQLRREKVELENTLEQEQEALVNKL 188
++ + ++ K+ E + ++ E L++ K E E E+E+ L+ +
Sbjct: 487 DKELNDTIENILSKLLGKSVEIVAVPEDEWQKIREEFLQKHKNEEEEEEEEEELPLIPEA 546
>gnl|CDD|225177 COG2268, COG2268, Uncharacterized protein conserved in bacteria
[Function unknown].
Length = 548
Score = 33.7 bits (77), Expect = 0.20
Identities = 25/175 (14%), Positives = 58/175 (33%), Gaps = 10/175 (5%)
Query: 40 QNRVLKVELETYKLQLRALQEENKNLRQTSVNIQAKAEQEEEFISNTLMKKIQALKKEKE 99
Q K +T ++++ + E + + + + E E+ I + + + E+E
Sbjct: 254 QQPAGKTAEQTREVKIILAETEAEVAAWKA---ETRREAEQAEILAEQAIQEEKAQAEQE 310
Query: 100 TLAHHYEQEEEFLTNDLSRKLNQLTEEKYR--LEQTIEQEQERVVNKLMRKIEKLEAETD 157
+ E + R+ E + R +++ + E + A+ +
Sbjct: 311 VQHAKALEAREMRVGLIERQKETELEPQERSYFINAAQRQAQEEAKAAANIAEAIGAQAE 370
Query: 158 AKQNNLEQLRREKVELENTLEQEQEALVNKL-----WKRMDKLEAEKRILQIKLD 207
A + + + L EA + K EAE + +IK +
Sbjct: 371 AAVETARETEEAERAEQAALVAAAEAAEQEQVEIAVRAEAAKAEAEAQAAEIKAE 425
Score = 31.3 bits (71), Expect = 1.1
Identities = 26/157 (16%), Positives = 52/157 (33%), Gaps = 11/157 (7%)
Query: 32 KRIESLQQQNRVLKVELETYKLQLRALQEENKNLRQTSVNIQAKAEQEEEFISNTLMKKI 91
+RI + Q + + E E + + N V ++ + + + T KI
Sbjct: 214 RRIAQVLQDAEIAENEAE----KETEIAIAEANRDAKLVELEVEQQPAGKTAEQTREVKI 269
Query: 92 QALKKEKETLAHHYEQEEEFLTNDLSRKLNQLTEEKYRLEQTIEQEQERVVNKLMRKIEK 151
+ E E A E E + + L E+ + E+ +++ + L + +
Sbjct: 270 ILAETEAEVAAWKAETRRE------AEQAEILAEQAIQEEKAQAEQEVQHAKALEAREMR 323
Query: 152 LEAETDAKQNNLEQLRREKVELENTLEQEQEALVNKL 188
+ K+ LE R + Q QE
Sbjct: 324 VGLIERQKETELEPQERS-YFINAAQRQAQEEAKAAA 359
>gnl|CDD|220614 pfam10174, Cast, RIM-binding protein of the cytomatrix active zone.
This is a family of proteins that form part of the CAZ
(cytomatrix at the active zone) complex which is
involved in determining the site of synaptic vesicle
fusion. The C-terminus is a PDZ-binding motif that binds
directly to RIM (a small G protein Rab-3A effector). The
family also contains four coiled-coil domains.
Length = 774
Score = 33.4 bits (76), Expect = 0.20
Identities = 44/210 (20%), Positives = 81/210 (38%), Gaps = 8/210 (3%)
Query: 4 SASESDSNSMDGVALMMPPSPVTREQLQKRIESLQQQNRVLKVELETYKLQLRALQEENK 63
SA + D A + LQ +E+L Q+ ++ L+ K L ++E
Sbjct: 280 SALAAMKAKCDRAAQELSRKKTELLGLQTELETLANQDSDMRQHLDKLKEDLTRAEQEKA 339
Query: 64 NLRQTSVNIQAKAEQEEEFISNTLMKKIQALKKEKETLAHHYEQEEEFL------TNDLS 117
L QT V+ + + +QA ++EK T A E + L
Sbjct: 340 IL-QTEVDALRYELERKHNTLTKKTASLQAAQEEKATYAGEIEDMRDRYEKTERKLRVLQ 398
Query: 118 RKLNQLTEEKYRLEQTIEQEQERVVNKLMRKIEKLEAET-DAKQNNLEQLRREKVELENT 176
+K+ L E R E+ +++E+ER+ + E + E++ E +
Sbjct: 399 KKIENLQETFRRKERRLKEEKERLRSLQTDTNTDTALEKLEKALAEKERIIERLKEQRDR 458
Query: 177 LEQEQEALVNKLWKRMDKLEAEKRILQIKL 206
E+ ++ K + L+ E + LQ+KL
Sbjct: 459 DERYEQEEFETYKKEFEDLKEEVQNLQLKL 488
>gnl|CDD|236842 PRK11091, PRK11091, aerobic respiration control sensor protein
ArcB; Provisional.
Length = 779
Score = 33.4 bits (77), Expect = 0.22
Identities = 18/74 (24%), Positives = 35/74 (47%), Gaps = 4/74 (5%)
Query: 227 NLSTHIHSLRSEVMRLRNQLANTQ-QEHTEKMERYVKDEKNVREENLRLQRKLQREVERR 285
LS + L E MR R+ N Q +++ ++ + + + + E +L+ E++ R
Sbjct: 86 RLSRLVAKL--EEMRERDLELNVQLKDNIAQLNQEIAEREKAEEARQEAFEQLKNEIKER 143
Query: 286 EALCRHLSESESSL 299
E L + +SSL
Sbjct: 144 EETQIELEQ-QSSL 156
>gnl|CDD|224212 COG1293, COG1293, Predicted RNA-binding protein homologous to
eukaryotic snRNP [Transcription].
Length = 564
Score = 33.5 bits (77), Expect = 0.22
Identities = 34/183 (18%), Positives = 70/183 (38%), Gaps = 21/183 (11%)
Query: 115 DLSRKLNQLTEEKYRLEQTIEQEQERVVNKLMRKIEKLEAETDAKQNNLEQLRREKVELE 174
DL + N+ +EK+ + I+Q + KL ++++KLE N LE+ E ELE
Sbjct: 262 DLEKLFNEALDEKFERD-KIKQLASELEKKLEKELKKLE-------NKLEKQEDELEELE 313
Query: 175 NTLEQEQ---EALVNKLWKRMDKLEAEKRILQIKLDQPVSDPSSPHDMNNGDTASNLSTH 231
E+ + E L L +++ R+ ++ + + A
Sbjct: 314 KAAEELRQKGELLYANLQL-IEEGLKSVRLADFYGNEEIKIELDK-SKTPSENAQRYFKK 371
Query: 232 IHSLRSEVMRLRNQLANTQQE--------HTEKMERYVKDEKNVREENLRLQRKLQREVE 283
L+ + L QL+ ++ + K + +REE + ++ +
Sbjct: 372 YKKLKGAKVNLDRQLSELKEAIAYYESAKTALEKAEGKKAIEEIREELIEEGLLKSKKKK 431
Query: 284 RRE 286
R++
Sbjct: 432 RKK 434
Score = 32.7 bits (75), Expect = 0.30
Identities = 32/168 (19%), Positives = 59/168 (35%), Gaps = 27/168 (16%)
Query: 71 NIQAKAEQEEEFISNT---LMKKIQALKKEKETL---AHHYEQEEEFLTNDLS------- 117
I+ A + E+ + L K++ + E E L A Q+ E L +L
Sbjct: 279 KIKQLASELEKKLEKELKKLENKLEKQEDELEELEKAAEELRQKGELLYANLQLIEEGLK 338
Query: 118 --RKLN--QLTEEKYRLEQ-----TIEQEQERVVNKLMRKIEKLEAETDAKQNNLEQLRR 168
R + E K L++ Q + KL L+ + + +
Sbjct: 339 SVRLADFYGNEEIKIELDKSKTPSENAQRYFKKYKKLKGAKVNLDRQLSELKEAIAYYES 398
Query: 169 -----EKVELENTLEQEQEALVNKLWKRMDKLEAEKRILQIKLDQPVS 211
EK E + +E+ +E L+ + + K + +K+ K VS
Sbjct: 399 AKTALEKAEGKKAIEEIREELIEEGLLKSKKKKRKKKEWFEKFRWFVS 446
>gnl|CDD|236270 PRK08471, flgK, flagellar hook-associated protein FlgK; Validated.
Length = 613
Score = 33.5 bits (77), Expect = 0.22
Identities = 21/81 (25%), Positives = 41/81 (50%), Gaps = 16/81 (19%)
Query: 107 QEEEFLTNDLSRKLNQLTEEKYRLEQTIEQEQERVVN----------KLMRKIEKLEAET 156
Q+ E LTN++ +L L++ + +E + V+ ++ ++I+++EA
Sbjct: 139 QKTETLTNNIKDTRERL----DTLQKKVNEELKVTVDEINSLGKQIAEINKQIKEVEAGK 194
Query: 157 DAKQNNLEQLRREKVELENTL 177
K N +LR ++ ELE TL
Sbjct: 195 TLKHAN--ELRDKRDELELTL 213
>gnl|CDD|234141 TIGR03185, DNA_S_dndD, DNA sulfur modification protein DndD. This
model describes the DndB protein encoded by an operon
associated with a sulfur-containing modification to DNA.
The operon is sporadically distributed in bacteria, much
like some restriction enzyme operons. DndD is described
as a putative ATPase. The small number of examples known
so far include species from among the Firmicutes,
Actinomycetes, Proteobacteria, and Cyanobacteria [DNA
metabolism, Restriction/modification].
Length = 650
Score = 33.1 bits (76), Expect = 0.24
Identities = 26/99 (26%), Positives = 41/99 (41%), Gaps = 9/99 (9%)
Query: 28 EQLQKRIESLQQQNRVLKVELETYKLQLRALQEENKNLRQTSVNIQAKAEQEEEFISNTL 87
EQ+ + +E L + L + LR L+ + + + K +Q+ + L
Sbjct: 421 EQIAQLLEELGEAQNELFRSEAEIEELLRQLETLKEAIEALRKTLDEKTKQKIN--AFEL 478
Query: 88 MKKIQALKKEKETLAHHYEQEEEFLTNDLSRKLNQLTEE 126
+ I K K+TL +EF L RKL QL EE
Sbjct: 479 ERAITIADKAKKTL-------KEFREKLLERKLQQLEEE 510
>gnl|CDD|227605 COG5280, COG5280, Phage-related minor tail protein [Function
unknown].
Length = 634
Score = 33.3 bits (76), Expect = 0.25
Identities = 13/108 (12%), Positives = 38/108 (35%), Gaps = 6/108 (5%)
Query: 85 NTLMKKIQALKKEKETL---AHHYEQEEEFLTNDLSRKLNQLTEEKYRLEQTIEQEQERV 141
L ++IQ L + +++ D ++ EE RL + E
Sbjct: 54 EALAQRIQLLTGQVTLTVKNLDDLAKQQA--QVDAQFGSGEIGEEAERLFTREIRVTEGG 111
Query: 142 VNKLMRKIEKLEAETDAKQNNLEQLRREKVEL-ENTLEQEQEALVNKL 188
+ + ++ + L+ V ++++ + E++ +++
Sbjct: 112 LTGSLSELNEPTVSVREHGKGLKDEELRFVTGGDSSIAEVFESIKDRV 159
>gnl|CDD|237619 PRK14135, recX, recombination regulator RecX; Provisional.
Length = 263
Score = 32.9 bits (76), Expect = 0.25
Identities = 24/95 (25%), Positives = 44/95 (46%), Gaps = 17/95 (17%)
Query: 126 EKYRLEQTIEQEQERVVNKLMRKIEKLEAETDAKQNNLEQLRR------------EKVEL 173
+Y E IE Q ++ KL++K +KL + KQ ++ L E+++L
Sbjct: 147 SEYTEEDQIEVAQ-KLAEKLLKKYQKLPFKA-LKQKIIQSLLTKGFSYEVIKAALEELDL 204
Query: 174 ENTLEQEQEALV---NKLWKRMDKLEAEKRILQIK 205
E E+EQE L K +++ K + + ++K
Sbjct: 205 EQDEEEEQELLQKELEKAYRKYSKYDGYELKQKLK 239
>gnl|CDD|217933 pfam04156, IncA, IncA protein. Chlamydia trachomatis is an
obligate intracellular bacterium that develops within a
parasitophorous vacuole termed an inclusion. The
inclusion is non-fusogenic with lysosomes but intercepts
lipids from a host cell exocytic pathway. Initiation of
chlamydial development is concurrent with modification
of the inclusion membrane by a set of C.
trachomatis-encoded proteins collectively designated
Incs. One of these Incs, IncA, is functionally
associated with the homotypic fusion of inclusions. This
family probably includes members of the wider Inc family
rather than just IncA.
Length = 186
Score = 32.4 bits (74), Expect = 0.27
Identities = 28/121 (23%), Positives = 53/121 (43%), Gaps = 9/121 (7%)
Query: 54 QLRALQEENKNLRQTSVNIQAKAEQEEEFISNTLMKKIQALKKEKETLAHHYEQEEEFLT 113
+L LQ E L+Q +Q + E EE I+ L +++ LK++ + E L
Sbjct: 73 KLEELQGELSELKQQLSELQEELEDLEERIA-ELESELEDLKED-------LQLLRELLK 124
Query: 114 NDLSRKLNQLTEEKYRLEQTIEQEQERVVNKLMRKIEKLEAETDAKQNNLEQLRREKVEL 173
L +L L E L + + + ++ + ++ E+LE + Q E ++ + L
Sbjct: 125 -SLEERLESLEESIKELAKELRELRQDLREEVEELREELERLQENLQRLQEAIQELQSLL 183
Query: 174 E 174
E
Sbjct: 184 E 184
Score = 30.1 bits (68), Expect = 1.5
Identities = 27/142 (19%), Positives = 57/142 (40%), Gaps = 19/142 (13%)
Query: 159 KQNNLEQLRREKVELENTLEQEQEALVNKLWKRMDKLEAEKRILQIKLDQPVSDPSSPHD 218
+ ++ +R +K+E E + +++L + ++ LE L+ +L+ D +
Sbjct: 62 LKAPVQSVRPQKLEELQGELSELKQQLSELQEELEDLEERIAELESELEDLKEDLQLLRE 121
Query: 219 MNNGDTASNLSTHIHSLRSEVMRLRNQLANTQQEHTEKMERYVKDEKNVREENLRLQRKL 278
+ SL + L + +E E +++REE L+ +L
Sbjct: 122 LLK------------SLEERLESLEESIKELAKELRE-------LRQDLREEVEELREEL 162
Query: 279 QREVERREALCRHLSESESSLE 300
+R E + L + E +S LE
Sbjct: 163 ERLQENLQRLQEAIQELQSLLE 184
Score = 29.3 bits (66), Expect = 2.3
Identities = 31/118 (26%), Positives = 58/118 (49%), Gaps = 6/118 (5%)
Query: 28 EQLQKRIESLQQQNRVLKVELETYKLQLRALQEENKNLRQTSVNIQAKAEQEEEFISNTL 87
E+LQ + L+QQ L+ ELE + ++ L+ E ++L++ ++ + EE + +L
Sbjct: 75 EELQGELSELKQQLSELQEELEDLEERIAELESELEDLKEDLQLLRELLKSLEERLE-SL 133
Query: 88 MKKIQALKKEKETLAHHYEQEEEFLTNDLSRKLNQLTEEKYRLEQTIEQEQERVVNKL 145
+ I+ L KE L +E E L +L R +E + Q QE + ++ +L
Sbjct: 134 EESIKELAKELRELRQDLREEVEELREELER-----LQENLQRLQEAIQELQSLLEQL 186
>gnl|CDD|218434 pfam05103, DivIVA, DivIVA protein. The Bacillus subtilis divIVA1
mutation causes misplacement of the septum during cell
division, resulting in the formation of small, circular,
anucleate mini-cells. Inactivation of divIVA produces a
mini-cell phenotype, whereas overproduction of DivIVA
results in a filamentation phenotype. These proteins
appear to contain coiled-coils.
Length = 131
Score = 31.5 bits (72), Expect = 0.27
Identities = 22/95 (23%), Positives = 44/95 (46%), Gaps = 4/95 (4%)
Query: 28 EQLQKRIESLQQQNRVLKVELETYKLQLRALQEENKNLRQTSVNIQAKAEQEEEFISNTL 87
E L K E L+++ L+ +LE YK LQ+ ++T+ ++A A++E E I
Sbjct: 35 EALYKENEELKEEIERLEEKLEEYKELEETLQKTLVVAQETAEEVKANAQKEAELIIKEA 94
Query: 88 MKKIQALKKEKETLAHHYEQEEEFLTNDLSRKLNQ 122
K + + + A+ ++ ++L R+
Sbjct: 95 EAKAERIVND----ANEEAKKLATEYDELKRQSRL 125
Score = 28.8 bits (65), Expect = 2.4
Identities = 28/114 (24%), Positives = 52/114 (45%), Gaps = 11/114 (9%)
Query: 28 EQLQKRIESLQQQNRVLKVELETYKLQLRALQEENKNLRQTSVNIQAKAEQEEEFISNTL 87
+Q+ K E+L ++N LK E+E + +L +E + L++T V Q AE+ +
Sbjct: 28 DQVIKDYEALYKENEELKEEIERLEEKLEEYKELEETLQKTLVVAQETAEEVK------- 80
Query: 88 MKKIQALKKEKETLAHHYEQEEEFLTNDLSRKLNQLTEEKYRLEQTIEQEQERV 141
+KE E + E + E + ND + + +L E L++ + R
Sbjct: 81 ----ANAQKEAELIIKEAEAKAERIVNDANEEAKKLATEYDELKRQSRLFRTRF 130
>gnl|CDD|191309 pfam05597, Phasin, Poly(hydroxyalcanoate) granule associated
protein (phasin). Polyhydroxyalkanoates (PHAs) are
storage polyesters synthesised by various bacteria as
intracellular carbon and energy reserve material. PHAs
are accumulated as water-insoluble inclusions within the
cells. This family consists of the phasins PhaF and PhaI
which act as a transcriptional regulator of PHA
biosynthesis genes. PhaF has been proposed to repress
expression of the phaC1 gene and the phaIF operon.
Length = 132
Score = 31.5 bits (72), Expect = 0.27
Identities = 24/102 (23%), Positives = 49/102 (48%), Gaps = 5/102 (4%)
Query: 74 AKAEQEEEFISNTLMKKIQALKKE-KETLAHHYEQEEEFLTNDLSRKLNQLTEEKYRLEQ 132
AKA++E + L+K+ + L+K ++ E E + + +S ++ + +LEQ
Sbjct: 31 AKAQEEGGKVFEALVKEGEELEKRTRKLAEEQVEAVRESVKSRVSEVKDKAEGQWDKLEQ 90
Query: 133 TIEQEQERVVNKL----MRKIEKLEAETDAKQNNLEQLRREK 170
++ + +N+L +++E L A D LE+L K
Sbjct: 91 AFDERVAKALNRLGVPSRKEVEALSARIDELTAQLEELAEAK 132
>gnl|CDD|236798 PRK10929, PRK10929, putative mechanosensitive channel protein;
Provisional.
Length = 1109
Score = 33.1 bits (76), Expect = 0.29
Identities = 43/224 (19%), Positives = 91/224 (40%), Gaps = 35/224 (15%)
Query: 73 QAKAEQEEEFISNTLMKKIQALKKEKETLAHHYEQEEEFLT-NDLSRKL----NQLTEEK 127
+A+Q ++ I N K L+++ ++ + L +++ +QL E+
Sbjct: 64 LERAKQYQQVIDN-FPKLSAELRQQLNNERDEPRSVPPNMSTDALEQEILQVSSQLLEKS 122
Query: 128 YRLEQTIEQEQERVVNKLMRKIEKLEAETDAKQNNLEQLRREKVELENTLEQEQEALVNK 187
+ +Q EQ++ R ++ + ++ + + E + N +E+ + L Q Q
Sbjct: 123 RQAQQ--EQDRAREISDSLSQLPQQQTEARRQLNEIERRLQTLGTPNTPLAQAQLTA--- 177
Query: 188 LWKRMDKLEAEKRILQIKLDQ-PVSDPSSPHDMNNGDTASNLSTHIHSLRSE-----VMR 241
L+AE L+ +D+ ++ S+ NN + L + + RS+ +
Sbjct: 178 -------LQAESAALKALVDELELAQLSA----NNRQELARLRSELAKKRSQQLDAYLQA 226
Query: 242 LRNQLANTQQ-------EHTEKMERYVKDEKNVREENLRLQRKL 278
LRNQL + +Q E TE + D ++ R+L
Sbjct: 227 LRNQLNSQRQREAERALESTELLAEQSGDLPKSIVAQFKINREL 270
>gnl|CDD|234172 TIGR03345, VI_ClpV1, type VI secretion ATPase, ClpV1 family.
Members of this protein family are homologs of ClpB, an
ATPase associated with chaperone-related functions.
These ClpB homologs, designated ClpV1, are a key
component of the bacterial pathogenicity-associated type
VI secretion system [Protein fate, Protein and peptide
secretion and trafficking, Cellular processes,
Pathogenesis].
Length = 852
Score = 33.0 bits (76), Expect = 0.30
Identities = 34/105 (32%), Positives = 51/105 (48%), Gaps = 7/105 (6%)
Query: 87 LMKKIQALKKEKETL---AHHYEQEEEFLTNDLSRKLNQLTEEKYRLEQTIEQEQERVVN 143
L ++I AL+ E + L A +E L +L +L L E LE +QE+E +V
Sbjct: 424 LRRRIAALELELDALEREAALGADHDERLA-ELRAELAALEAELAALEARWQQEKE-LVE 481
Query: 144 KLMRKIEKLEAETDAKQNNLEQLRREKVELENTLEQEQ--EALVN 186
++ +LEA+ DA ++ LR + ELE L Q E LV
Sbjct: 482 AILALRAELEADADAPADDDAALRAQLAELEAALASAQGEEPLVF 526
Score = 29.5 bits (67), Expect = 3.6
Identities = 33/126 (26%), Positives = 55/126 (43%), Gaps = 19/126 (15%)
Query: 16 VALMMPPSPVTREQLQKRIESLQQQNRVLKVELET---YKLQLRALQEENKNLRQTSVNI 72
VAL +P E L++RI +L+ + L+ E + +L L+ E L +
Sbjct: 410 VALSQNATPAALEDLRRRIAALELELDALEREAALGADHDERLAELRAELAALEAELAAL 469
Query: 73 QAKAEQEEEFISNTLMKKIQALKKEKETLAHHYEQEEEFLTNDLSRKLNQLTEEKYRLEQ 132
+A+ +QE+E L++ I AL+ E E A + + D + QL E LE
Sbjct: 470 EARWQQEKE-----LVEAILALRAELEADA---DAPAD----DDAALRAQLAE----LEA 513
Query: 133 TIEQEQ 138
+ Q
Sbjct: 514 ALASAQ 519
Score = 28.8 bits (65), Expect = 6.6
Identities = 23/119 (19%), Positives = 40/119 (33%), Gaps = 27/119 (22%)
Query: 147 RKIEKLEAETDAKQNNLEQLRREK----------VELENTLEQEQEAL--VNKLWKRMDK 194
+E L A + L+ L RE EL L + L + W++ +
Sbjct: 419 AALEDLRRRIAALELELDALEREAALGADHDERLAELRAELAALEAELAALEARWQQEKE 478
Query: 195 LEAEKRILQIKLDQPVSDPSSPHDMNNGDTASNLSTHIHSLRSEVMRLRNQLANTQQEH 253
L L+ +L+ P+ +LR+++ L LA+ Q E
Sbjct: 479 LVEAILALRAELEADADAPADD---------------DAALRAQLAELEAALASAQGEE 522
>gnl|CDD|144972 pfam01576, Myosin_tail_1, Myosin tail. The myosin molecule is a
multi-subunit complex made up of two heavy chains and
four light chains it is a fundamental contractile
protein found in all eukaryote cell types. This family
consists of the coiled-coil myosin heavy chain tail
region. The coiled-coil is composed of the tail from two
molecules of myosin. These can then assemble into the
macromolecular thick filament. The coiled-coil region
provides the structural backbone the thick filament.
Length = 859
Score = 33.1 bits (76), Expect = 0.30
Identities = 49/166 (29%), Positives = 74/166 (44%), Gaps = 11/166 (6%)
Query: 27 REQLQKRIESLQQQNRVLKVELETYKLQLRALQEENKNLRQTSVNIQAKAEQEEEFISNT 86
+ Q+ +L + LK ELE K Q+ AL+ ENKNL+ I +Q E N
Sbjct: 400 LDTAQREARNLSTELFRLKNELEELKDQVEALRRENKNLQD---EIHDLTDQLGEGGRNV 456
Query: 87 --LMKKIQALKKEKETLAHHYEQEEEFLTNDLSRKL------NQLTEEKYRLEQTIEQEQ 138
L K + L+ EK+ L E+ E L + S+ L +Q+ E R E+E
Sbjct: 457 HELEKARRRLEAEKDELQAALEEAEAALELEESKVLRAQVELSQIRSEIERRLAEKEEEF 516
Query: 139 ERVVNKLMRKIEKLEAETDAKQNNLEQLRREKVELENTLEQEQEAL 184
E R IE L+A +A+ + R K +LE + + + AL
Sbjct: 517 ENTRKNHQRAIESLQATLEAEAKGKAEASRLKKKLEGDINELEIAL 562
Score = 28.9 bits (65), Expect = 6.3
Identities = 44/202 (21%), Positives = 85/202 (42%), Gaps = 29/202 (14%)
Query: 26 TREQLQKRIESLQQQNRVLKVELETYKLQLRALQEENKNLRQTSVNIQAKAEQEEEFISN 85
+ QK ++ QQQ + L+ ++E + +E+ + +AE EE + +
Sbjct: 568 ANAEAQKNVKKYQQQVKELQTQVEEEQRAREDAREQLAVAERR--ATALEAELEE--LRS 623
Query: 86 TLMKKIQALKKEKETLAHHYEQEEEFLTNDLSRKLNQLTEEKYRLEQTIEQEQERV---- 141
L + +A K+ + LA E+ N+L+ + + L +K +LE + Q +
Sbjct: 624 ALEQAERARKQAETELAEASER-----VNELTAQNSSLIAQKRKLEGELAALQSDLDEAV 678
Query: 142 ---------VNKLMRKIEKLEAETDAKQNNLEQLRREKVELENT-------LEQEQEALV 185
K +L E +Q + + L R + +LE+ L++ + A +
Sbjct: 679 NELKAAEERAKKAQADAARLAEELRQEQEHSQHLERLRKQLESQVKELQVRLDEAEAAAL 738
Query: 186 NKLWKRMDKLEAEKRILQIKLD 207
K + KLEA R L+ +LD
Sbjct: 739 KGGKKMIQKLEARVRELEAELD 760
>gnl|CDD|149160 pfam07926, TPR_MLP1_2, TPR/MLP1/MLP2-like protein. The sequences
featured in this family are similar to a region of human
TPR protein and to yeast myosin-like proteins 1 (MLP1)
and 2 (MLP2). These proteins share a number of features;
for example, they all have coiled-coil regions and all
three are associated with nuclear pores. TPR is thought
to be a component of nuclear pore complex- attached
intra-nuclear filaments, and is implicated in nuclear
protein import. Moreover, its N-terminal region is
involved in the activation of oncogenic kinases,
possibly by mediating the dimerisation of kinase domains
or by targeting these kinases to the nuclear pore
complex. MLP1 and MLP2 are involved in the process of
telomere length regulation, where they are thought to
interact with proteins such as Tel1p and modulate their
activity.
Length = 132
Score = 31.5 bits (72), Expect = 0.31
Identities = 26/115 (22%), Positives = 56/115 (48%), Gaps = 4/115 (3%)
Query: 30 LQKRIESLQQQNRVLKVELETYKLQLRALQEENKNLRQTSVNIQAKAEQEEEFISNTLMK 89
L++ SLQ + + L + E + +++ LQE+ + + + Q K E+E + ++
Sbjct: 1 LEEEKSSLQAELKRLLEKKEDAEEKIQKLQEDLEEQAEIANEAQQKYERELVKHA-EDIE 59
Query: 90 KIQALKKE---KETLAHHYEQEEEFLTNDLSRKLNQLTEEKYRLEQTIEQEQERV 141
++QAL+K+ + + E E +LS E+K LE + + ++R+
Sbjct: 60 ELQALRKQLNELKKEIAQLKAEAESAQAELSEAEESWEEQKKMLEDELSELEKRI 114
>gnl|CDD|234084 TIGR03007, pepcterm_ChnLen, polysaccharide chain length determinant
protein, PEP-CTERM locus subfamily. Members of this
protein family belong to the family of polysaccharide
chain length determinant proteins (pfam02706). All are
found in species that encode the PEP-CTERM/exosortase
system predicted to act in protein sorting in a number
of Gram-negative bacteria, and are found near the epsH
homolog that is the putative exosortase gene [Cell
envelope, Biosynthesis and degradation of surface
polysaccharides and lipopolysaccharides].
Length = 498
Score = 32.7 bits (75), Expect = 0.34
Identities = 62/362 (17%), Positives = 118/362 (32%), Gaps = 83/362 (22%)
Query: 36 SLQQQNRVLKVELETYKLQLRALQEENKNLRQTSVNIQAKAEQEEEFISNTLMKKIQALK 95
++ Q+ R++ L + + + R +++ AK+ + E + L K I
Sbjct: 65 NVDQKIRIMSRTLLSRPNLEKVI-------RMLDLDLGAKSPAQLEALITKLRKNISISL 117
Query: 96 KEKETL------------AHHYEQE--EEFLTNDLSRKLNQLTEEKYRLEQTIEQEQERV 141
++ L A Q F+ L K + +++ I+ ++++
Sbjct: 118 AGRDNLFTISYEDKDPELAKDVVQTLLTIFVEETLGSKRQDSDSAQRFIDEQIKTYEKKL 177
Query: 142 V---NKLMR-KIEK----------LEAETDAKQNNLEQLRREKVELENTLEQEQEAL--- 184
N+L K E +E Q LE R E E + + L
Sbjct: 178 EAAENRLKAFKQENGGILPDQEGDYYSEISEAQEELEAARLELNEAIAQRDALKRQLGGE 237
Query: 185 ------------------VNKLWKRMDKL-----------EAEKRILQIKLDQPVSDPSS 215
+ L K++D L A KR + +Q + S+
Sbjct: 238 EPVLLAGSSVANSELDGRIEALEKQLDALRLRYTDKHPDVIATKREIAQLEEQKEEEGSA 297
Query: 216 PHDMNNGDTASNLSTHIHSLRSEVMRLRNQLANTQQEHTEKMERYVKDEKNVRE------ 269
+ + + L+ E+ ++A+ + E R + E +R
Sbjct: 298 KN--GGPERGEIANPVYQQLQIELAEAEAEIASLEARVAELTARIERLESLLRTIPEVEA 355
Query: 270 ENLRLQRKLQREVERR--EALCRHLSESESSLEMEEERHYNEIAHSMRLVGSSPSSSRPL 327
E +L R EV + E L +E S +ME + A S R++ S+P
Sbjct: 356 ELTQLNR--DYEVNKSNYEQLLTRRESAEVSKQMEVQDK----AVSFRIIDPPIVPSKPS 409
Query: 328 SP 329
P
Sbjct: 410 GP 411
>gnl|CDD|234701 PRK00252, alaS, alanyl-tRNA synthetase; Reviewed.
Length = 865
Score = 32.8 bits (76), Expect = 0.34
Identities = 11/38 (28%), Positives = 21/38 (55%)
Query: 25 VTREQLQKRIESLQQQNRVLKVELETYKLQLRALQEEN 62
+L +R+E+L ++ + L+ ELE K +L A +
Sbjct: 712 AKPSELPERVEALLEELKELEKELEQLKAKLAAAAAGD 749
>gnl|CDD|220369 pfam09731, Mitofilin, Mitochondrial inner membrane protein.
Mitofilin controls mitochondrial cristae morphology.
Mitofilin is enriched in the narrow space between the
inner boundary and the outer membranes, where it forms a
homotypic interaction and assembles into a large
multimeric protein complex. The first 78 amino acids
contain a typical amino-terminal-cleavable mitochondrial
presequence rich in positive-charged and hydroxylated
residues and a membrane anchor domain. In addition, it
has three centrally located coiled coil domains.
Length = 493
Score = 32.7 bits (75), Expect = 0.36
Identities = 38/175 (21%), Positives = 72/175 (41%), Gaps = 21/175 (12%)
Query: 28 EQLQKRIESLQQQNRVLKVELETYKLQLRALQEENKNLRQTSVNIQAKAEQEEEFISNTL 87
+QL K++ L+ + E E + L+ +EE + + + A E + +
Sbjct: 173 DQLSKKLAELKAE------EEEELERALKEKREELL----SKLEEELLARLESK-EAALE 221
Query: 88 MKKIQALKKEKETLAHHYEQEEEFLTNDLSRKLNQLTEEKYRLEQTIEQEQERVVNKLMR 147
+ ++EKE L YE++ L +L + + E+K + + Q E
Sbjct: 222 KQLRLEFEREKEELRKKYEEK---LRQEL-ERQAEAHEQKLK-NELALQAIELQREFNKE 276
Query: 148 KIEKLEAETDAKQNNLEQLRREKVELENTLEQEQEAL-----VNKLWKRMDKLEA 197
EK+E E + + L +L LE L+ EA V +LW ++ L++
Sbjct: 277 IKEKVEEERNGRLAKLAELNSRLKGLEKALDSRSEAEDENHKVQQLWLAVEALKS 331
Score = 28.5 bits (64), Expect = 7.0
Identities = 45/239 (18%), Positives = 90/239 (37%), Gaps = 52/239 (21%)
Query: 57 ALQEENKNLRQTSVNIQAKAEQEEEFISNTLMKKIQALKKEKE----TLAHHYEQEEEF- 111
+ + + +A+A+ T K IQ L + E L +
Sbjct: 80 KQVKIPRQSGLSEEVAEAEAKATSVAAEATTPKSIQELVEALEELLEELLKETASDPVVQ 139
Query: 112 ----LTNDLSRKLNQLTEEKYRLEQTIEQEQERVVNKLMRKIEKLEAETDAK-QNNLEQL 166
+ NDL + + K LE I +E + ++L +K+ +L+AE + + + L++
Sbjct: 140 ELVSIFNDLIDSIKE-DNLKDDLESLIASAKEEL-DQLSKKLAELKAEEEEELERALKEK 197
Query: 167 RREKVE-LENTLEQEQEALVNKLWKRMD-KLEAEKRILQIKLDQPVSDPSSPHDMNNGDT 224
R E + LE L E+ L K++ + E EK L+ K ++
Sbjct: 198 REELLSKLEEELLARLESKEAALEKQLRLEFEREKEELRKKYEE---------------- 241
Query: 225 ASNLSTHIHSLRSEVMRLRNQLANTQQEHTEKMERYVKDEKNVREENLRLQRKLQREVE 283
+LR +L + H +K+ + + + + LQR+ +E++
Sbjct: 242 ----------------KLRQELERQAEAHEQKL------KNELALQAIELQREFNKEIK 278
>gnl|CDD|226400 COG3883, COG3883, Uncharacterized protein conserved in bacteria
[Function unknown].
Length = 265
Score = 32.0 bits (73), Expect = 0.39
Identities = 19/66 (28%), Positives = 40/66 (60%), Gaps = 1/66 (1%)
Query: 119 KLNQLTEEKYRLEQTIEQEQERVVNKLMRKIEKLEAETDAKQNNLEQLRREKVELENTLE 178
KL++L +EK ++ IE ++ ++ KI++L+ E D + +++L++E EL+ +
Sbjct: 39 KLSELQKEKKNIQNEIESLDNQI-EEIQSKIDELQKEIDQSKAEIKKLQKEIAELKENIV 97
Query: 179 QEQEAL 184
+ QE L
Sbjct: 98 ERQELL 103
>gnl|CDD|179385 PRK02224, PRK02224, chromosome segregation protein; Provisional.
Length = 880
Score = 32.7 bits (75), Expect = 0.42
Identities = 66/299 (22%), Positives = 118/299 (39%), Gaps = 34/299 (11%)
Query: 27 REQLQKRIESLQQQNRVLKVELETYKLQLRALQEENKNLRQTSVNIQAKAEQEEEFISNT 86
E+L+ R+E + + E E+ + L+E + LR+ + ++++ E+ E +
Sbjct: 323 DEELRDRLEECRVAAQAHNEEAESLREDADDLEERAEELREEAAELESELEEAREAVE-D 381
Query: 87 LMKKIQALKKEKETLAHHYEQEEEFLTN------DLSRKLNQLTEEKYRLEQTIEQEQER 140
++I+ L++E E L + L N +L + ++L E + LE T+ +ER
Sbjct: 382 RREEIEELEEEIEELRERFGDAPVDLGNAEDFLEELREERDELREREAELEATLRTARER 441
Query: 141 VVNKLMRKIEKLEAETDAKQ-----------NNLEQLRREKVELENTLEQ---EQEALVN 186
V + E L + +E+ R ELE LE E E +
Sbjct: 442 V-----EEAEALLEAGKCPECGQPVEGSPHVETIEEDRERVEELEAELEDLEEEVEEVEE 496
Query: 187 KLWKRMDKLEAEKRILQIKLDQPVSDPSSPHDMNNGDTASNLSTHIHSLRSEVMRLRNQL 246
+L + D +EAE RI +L++ D +T LR L +
Sbjct: 497 RLERAEDLVEAEDRIE--RLEERREDLEELIA-ERRETIEEKRERAEELRERAAELEAEA 553
Query: 247 ANTQQEHTEKMERYVKDEKNVREENLRLQRKLQREVERREALCRHLSESESSLEMEEER 305
++ E E + REE L KL ER E+L R + +++ E+
Sbjct: 554 EEKREAAAEAEEE----AEEAREEVAELNSKLAELKERIESLER-IRTLLAAIADAEDE 607
Score = 30.4 bits (69), Expect = 2.3
Identities = 43/185 (23%), Positives = 77/185 (41%), Gaps = 37/185 (20%)
Query: 26 TREQLQKRIESLQQQNRVLKVELETYKLQLRALQEENKNLRQTSVNIQAKAEQEEEFISN 85
E+L++ IE L+++ V+L + L L+EE LR+ ++A E +
Sbjct: 385 EIEELEEEIEELRERFGDAPVDLGNAEDFLEELREERDELREREAELEATLRTARERV-- 442
Query: 86 TLMKKIQALKKEK---------------ETLAHHYEQEEEFLTNDLSRKLNQLTEEKYRL 130
++ +AL + ET+ E+ EE L +L L EE +
Sbjct: 443 ---EEAEALLEAGKCPECGQPVEGSPHVETIEEDRERVEE-----LEAELEDLEEEVEEV 494
Query: 131 EQTIEQ-----EQERVVNKLMRK-------IEKLEAETDAKQNNLEQLRREKVELENTLE 178
E+ +E+ E E + +L + I + + K+ E+LR ELE E
Sbjct: 495 EERLERAEDLVEAEDRIERLEERREDLEELIAERRETIEEKRERAEELRERAAELEAEAE 554
Query: 179 QEQEA 183
+++EA
Sbjct: 555 EKREA 559
Score = 30.0 bits (68), Expect = 2.3
Identities = 60/279 (21%), Positives = 117/279 (41%), Gaps = 49/279 (17%)
Query: 27 REQLQKRIESLQQQNRVLKVELETYKLQLRALQEENKNLRQTSVNIQAKAEQEEEFISNT 86
E ++ R E L+ ++ L+ LE ++ +A EE ++LR+ + +++ +AE+ E
Sbjct: 309 AEAVEARREELEDRDEELRDRLEECRVAAQAHNEEAESLREDADDLEERAEELRE----- 363
Query: 87 LMKKIQALKKEKETLAHHYEQEEEFLTNDLSRKLNQLTEEKYRLEQTIEQEQERVVNKLM 146
E L E+ E + + ++ E LE+ IE+ +ER + +
Sbjct: 364 ----------EAAELESELEEARE----AVEDRREEIEE----LEEEIEELRERFGDAPV 405
Query: 147 RKIEKLEAETDAKQNNLEQLRREKVELENTLEQEQEALVNKLWKRMDKLEAEKRILQIKL 206
+ E + + ++LR + ELE TL +E + + LEA K +
Sbjct: 406 D-LGNAEDFLEELREERDELREREAELEATLRTARERVE----EAEALLEAGK---CPEC 457
Query: 207 DQPVSDPSSPHDMNNGDTASNLSTHIHSLRSEVMRLRNQLANTQQEHTEKMERYVKDEKN 266
QPV SPH +T + L +E+ L ++ ++
Sbjct: 458 GQPVEG--SPH----VETIEEDRERVEELEAELEDLEEEVEEVEERLERA---------- 501
Query: 267 VREENLRLQRKLQREVERREALCRHLSESESSLEMEEER 305
E+ + + +++R ERRE L ++E ++E + ER
Sbjct: 502 --EDLVEAEDRIERLEERREDLEELIAERRETIEEKRER 538
>gnl|CDD|227352 COG5019, CDC3, Septin family protein [Cell division and chromosome
partitioning / Cytoskeleton].
Length = 373
Score = 32.3 bits (74), Expect = 0.45
Identities = 16/79 (20%), Positives = 35/79 (44%), Gaps = 1/79 (1%)
Query: 94 LKKEKETLAHHYEQEEEFLTNDLSRKLNQLTEEKYR-LEQTIEQEQERVVNKLMRKIEKL 152
L K + ++ E N+ R+L + EK R E+ +E+ ++ ++ + KL
Sbjct: 292 LSGLKNSGEPSLKEIHEARLNEEERELKKKFTEKIREKEKRLEELEQNLIEERKELNSKL 351
Query: 153 EAETDAKQNNLEQLRREKV 171
E ++ ++L + K
Sbjct: 352 EEIQKKLEDLEKRLEKLKS 370
Score = 29.6 bits (67), Expect = 3.2
Identities = 20/94 (21%), Positives = 39/94 (41%), Gaps = 9/94 (9%)
Query: 115 DLSRKLNQLTEEKYRLEQ-TIEQEQERVVNKLMRKIEKLEAETDAKQNNLEQLRREKV-E 172
+L L E YR E+ + + K + E E + L++ EK+ E
Sbjct: 274 ELKETTENLLYENYRTEKLSGLKNSGEPSLKEIH--EARLNEE---ERELKKKFTEKIRE 328
Query: 173 LENTLEQEQEALVNKLWKRMDKLEAEKRILQIKL 206
E LE+ ++ L+ + + KL E+ +++
Sbjct: 329 KEKRLEELEQNLIEERKELNSKL--EEIQKKLED 360
>gnl|CDD|130902 TIGR01843, type_I_hlyD, type I secretion membrane fusion protein,
HlyD family. Type I secretion is an ABC transport
process that exports proteins, without cleavage of any
signal sequence, from the cytosol to extracellular
medium across both inner and outer membranes. The
secretion signal is found in the C-terminus of the
transported protein. This model represents the adaptor
protein between the ATP-binding cassette (ABC) protein
of the inner membrane and the outer membrane protein,
and is called the membrane fusion protein. This model
selects a subfamily closely related to HlyD; it is
defined narrowly and excludes, for example, colicin V
secretion protein CvaA and multidrug efflux proteins
[Protein fate, Protein and peptide secretion and
trafficking].
Length = 423
Score = 32.3 bits (74), Expect = 0.49
Identities = 31/155 (20%), Positives = 60/155 (38%), Gaps = 20/155 (12%)
Query: 26 TREQLQKRIESLQQQNRVLKVELETYKLQLRALQEENKNLRQTSVNIQAKAEQEEEFISN 85
+ L+ ++E + Q + L+ EL + QL+AL++ Q + EE
Sbjct: 138 RKSTLRAQLELILAQIKQLEAELAGLQAQLQALRQ------------QLEVISEELEARR 185
Query: 86 TLMKKIQALKKEKETLAHHYEQEEEFLTNDLSRKLNQLTEEKYRLEQTIEQEQERVVNKL 145
L +K + + E E E + +L +L E L++ I++ Q
Sbjct: 186 KLKEK-GLVSR-----LELLELERER--AEAQGELGRLEAELEVLKRQIDELQLERQQIE 237
Query: 146 MRKIEKLEAETDAKQNNLEQLRREKVELENTLEQE 180
E++ E Q L +LR + + L++
Sbjct: 238 QTFREEVLEELTEAQARLAELRERLNKARDRLQRL 272
>gnl|CDD|214922 smart00935, OmpH, Outer membrane protein (OmpH-like). This family
includes outer membrane proteins such as OmpH among
others. Skp (OmpH) has been characterized as a molecular
chaperone that interacts with unfolded proteins as they
emerge in the periplasm from the Sec translocation
machinery.
Length = 140
Score = 31.0 bits (71), Expect = 0.51
Identities = 24/99 (24%), Positives = 49/99 (49%), Gaps = 7/99 (7%)
Query: 53 LQLRALQEENKNLRQTSVNIQAKAEQEEEFISNTLMKKIQALKKEKETLAHHYEQEEEFL 112
+ A + K L + QA+ E+ E+ L K + L+K+ TL+ E E
Sbjct: 11 QESPAGKAAQKQLEKEFKKRQAELEKLEK----ELQKLKEKLQKDAATLS---EAAREKK 63
Query: 113 TNDLSRKLNQLTEEKYRLEQTIEQEQERVVNKLMRKIEK 151
+L +K+ + ++ +L+Q +++ Q+ + K++ KI K
Sbjct: 64 EKELQKKVQEFQRKQQKLQQDLQKRQQEELQKILDKINK 102
Score = 29.1 bits (66), Expect = 2.0
Identities = 21/110 (19%), Positives = 49/110 (44%), Gaps = 18/110 (16%)
Query: 87 LMKKIQALKKEKETLAHHYEQEEEFLTNDLSRKLNQLT--EEKYRLEQTIEQEQERVVNK 144
++++ A K ++ L E+E + +L + +L +EK + + E R
Sbjct: 9 ILQESPAGKAAQKQL----EKEFKKRQAELEKLEKELQKLKEKLQKDAATLSEAAR---- 60
Query: 145 LMRKIEKLEAETDAKQNNLEQLRREKVELENTLEQEQEALVNKLWKRMDK 194
EK E E K ++ ++ +L+ L++ Q+ + K+ +++K
Sbjct: 61 -----EKKEKELQKKVQEFQRKQQ---KLQQDLQKRQQEELQKILDKINK 102
>gnl|CDD|224241 COG1322, COG1322, Predicted nuclease of restriction
endonuclease-like fold, RmuC family [General function
prediction only].
Length = 448
Score = 32.0 bits (73), Expect = 0.55
Identities = 29/150 (19%), Positives = 62/150 (41%), Gaps = 6/150 (4%)
Query: 37 LQQQNRVLKVELETYKLQLRALQEEN----KNLRQTSVNIQAKAEQEEEFISNTLMKKIQ 92
+ E E + R+LQ N + L + +Q + Q E + L++ +
Sbjct: 43 QLLLLLAFRAEAEQLRTFARSLQALNLELIQELNELKARLQQQLLQSREQLQ-LLIESLA 101
Query: 93 ALKKEKETLAHHYEQEEEFLTNDLSRK-LNQLTEEKYRLEQTIEQEQERVVNKLMRKIEK 151
L E + LA+ +E +L+++ L QL + + + ++ E+ +++ +
Sbjct: 102 QLSSEFQELANEIFEELNRRLAELNQQNLKQLLKPLREVLEKFREQLEQRIHESAEERST 161
Query: 152 LEAETDAKQNNLEQLRREKVELENTLEQEQ 181
L E D ++QL +E L L+ +
Sbjct: 162 LLEEIDRLLGEIQQLAQEAGNLTAALKGNK 191
Score = 31.2 bits (71), Expect = 1.1
Identities = 25/163 (15%), Positives = 54/163 (33%), Gaps = 11/163 (6%)
Query: 44 LKVELETYKLQLRALQEENKNLRQTSVNIQAKAEQEEE-FISNTLMKKIQALKKEKETLA 102
L V L + L L + L + + ++ + L ++L+ L
Sbjct: 13 LLVNLAFIRQLLLRLGRLEQMLGELAAVLEQLLLLLAFRAEAEQLRTFARSLQALNLELI 72
Query: 103 HHYEQEEEFLTNDLSRKLNQLTEEKYRLEQTIEQEQERVVNKLMRKIEKLEAETDAKQNN 162
+ + L L + QL L Q + QE ++ E + +
Sbjct: 73 QELNELKARLQQQLLQSREQLQLLIESLAQLSSEFQEL--------ANEIFEELNRRLAE 124
Query: 163 LEQLRREKVE--LENTLEQEQEALVNKLWKRMDKLEAEKRILQ 203
L Q +++ L LE+ +E L ++ + ++ +
Sbjct: 125 LNQQNLKQLLKPLREVLEKFREQLEQRIHESAEERSTLLEEID 167
>gnl|CDD|147082 pfam04740, LXG, LXG domain of WXG superfamily. This domain is
present is the N-terminal region of a group of
polymorphic toxin proteins in bacteria. It is predicted
to use Type VII secretion pathway to mediate export of
bacterial toxins.
Length = 204
Score = 31.5 bits (72), Expect = 0.55
Identities = 21/103 (20%), Positives = 48/103 (46%), Gaps = 15/103 (14%)
Query: 108 EEEFLTNDLSRKLNQLTEEKYRLEQTIEQEQERVVNKLMRKIEKL----EAETDAKQNNL 163
+EEFL ++L L + E+ E+ ++ +N ++ I + + T A Q L
Sbjct: 96 DEEFLEHELKNGLK-------KAEEKTEELKDA-INSILASISDIVHLPKFSTSAFQEQL 147
Query: 164 EQLRREK---VELENTLEQEQEALVNKLWKRMDKLEAEKRILQ 203
++ ++++ +E +QE + +L + +++ R LQ
Sbjct: 148 DKAKKKRNDTLEKLYEFDQELTNELAELEALIQEIQQYVRQLQ 190
>gnl|CDD|233973 TIGR02680, TIGR02680, TIGR02680 family protein. Members of this
protein family belong to a conserved gene four-gene
neighborhood found sporadically in a phylogenetically
broad range of bacteria: Nocardia farcinica,
Symbiobacterium thermophilum, and Streptomyces
avermitilis (Actinobacteria), Geobacillus kaustophilus
(Firmicutes), Azoarcus sp. EbN1 and Ralstonia
solanacearum (Betaproteobacteria). Proteins in this
family average over 1400 amino acids in length
[Hypothetical proteins, Conserved].
Length = 1353
Score = 32.1 bits (73), Expect = 0.56
Identities = 31/157 (19%), Positives = 51/157 (32%), Gaps = 9/157 (5%)
Query: 28 EQLQKRIESLQQQNRVLKVELETYKLQLRALQEENKNLRQTSVNIQAKAEQEEEFISNTL 87
+ +E L + ELE + RAL N R + S
Sbjct: 223 TDVADALEQLDEYRD----ELERLEALERAL--RNFLQRYRRYARTMLRRRATRLRSAQ- 275
Query: 88 MKKIQALKKEKETLAHHYEQEEEFLTNDLSRKLNQLTEEKYRLEQTIEQEQERVVNKLMR 147
+ L ++ E E +R L E L +E Q +
Sbjct: 276 -TQYDQLSRDLGRARDELETAREEERELDAR-TEALEREADALRTRLEALQGSPAYQDAE 333
Query: 148 KIEKLEAETDAKQNNLEQLRREKVELENTLEQEQEAL 184
++E+ A+ +A Q R+ E E+ LE+E+ L
Sbjct: 334 ELERARADAEALQAAAADARQAIREAESRLEEERRRL 370
Score = 30.2 bits (68), Expect = 2.7
Identities = 43/206 (20%), Positives = 76/206 (36%), Gaps = 31/206 (15%)
Query: 115 DLSRKLNQLTEEKYRLEQTIEQEQERVVNKLMRKIEKLE-------AETDAKQNNLEQLR 167
D++ L QL E + LE+ +E + + N L R + Q +QL
Sbjct: 224 DVADALEQLDEYRDELER-LEALERALRNFLQRYRRYARTMLRRRATRLRSAQTQYDQLS 282
Query: 168 REKVELENTLEQEQEALVNKLWKRMDKLEAEKRILQIKLDQPVSDPSSPHDMNNGDTASN 227
R+ + LE +E +L R + LE E L+ +L+ P+ + +
Sbjct: 283 RDLGRARDELETAREEE-RELDARTEALEREADALRTRLEALQGSPA----YQDAEE--- 334
Query: 228 LSTHIHSLRSEVMRLRNQLANTQQEHTEKMERYVKDEKNVREENLRLQRKLQREVERREA 287
+ R++ L+ A+ +Q E R ++ +R+L E R +
Sbjct: 335 ----LERARADAEALQAAAADARQAIREAESRLEEE-----------RRRLDEEAGRLDD 379
Query: 288 LCRHLSESESSLEMEEERHYNEIAHS 313
R L + L ER AH+
Sbjct: 380 AERELRAAREQLARAAERAGLSPAHT 405
>gnl|CDD|238103 cd00176, SPEC, Spectrin repeats, found in several proteins involved
in cytoskeletal structure; family members include
spectrin, alpha-actinin and dystrophin; the spectrin
repeat forms a three helix bundle with the second helix
interrupted by proline in some sequences; the repeats
are independent folding units; tandem repeats are found
in differing numbers and arrange in an antiparallel
manner to form dimers; the repeats are defined by a
characteristic tryptophan (W) residue in helix A and a
leucine (L) at the carboxyl end of helix C and separated
by a linker of 5 residues; two copies of the repeat are
present here.
Length = 213
Score = 31.3 bits (71), Expect = 0.57
Identities = 36/180 (20%), Positives = 81/180 (45%), Gaps = 15/180 (8%)
Query: 34 IESLQQQNRVLKVELETYKLQLRALQEENKNLRQT----SVNIQAKAEQEEEFISNTLMK 89
+E+L +++ L+ EL ++ ++ AL E + L + + IQ + E+ N +
Sbjct: 35 VEALLKKHEALEAELAAHEERVEALNELGEQLIEEGHPDAEEIQERLEE-----LNQRWE 89
Query: 90 KIQALKKEKET-LAHHYEQEEEFLTNDLSRKLNQLTEEKYRLEQTIEQEQERVVNKLMRK 148
+++ L +E+ L + ++ F D L E++ L + V +L++K
Sbjct: 90 ELRELAEERRQRLEEALDLQQFF--RDADDLEQWLEEKEAALASEDLGKDLESVEELLKK 147
Query: 149 IEKLEAETDAKQNNLEQLRREKVELENTLEQEQEALVNKLWKRMDKLEAEKRILQIKLDQ 208
++LE E +A + L+ L EL LE+ ++ +++++L L ++
Sbjct: 148 HKELEEELEAHEPRLKSLNELAEEL---LEEGHPDADEEIEEKLEELNERWEELLELAEE 204
>gnl|CDD|179699 PRK03992, PRK03992, proteasome-activating nucleotidase;
Provisional.
Length = 389
Score = 31.7 bits (73), Expect = 0.63
Identities = 12/36 (33%), Positives = 26/36 (72%)
Query: 32 KRIESLQQQNRVLKVELETYKLQLRALQEENKNLRQ 67
+R+E+L+++N L+ ++ +L+LR L+ EN+ L +
Sbjct: 1 ERLEALEERNSELEEQIRQLELKLRDLEAENEKLER 36
Score = 30.2 bits (69), Expect = 1.9
Identities = 13/49 (26%), Positives = 25/49 (51%), Gaps = 7/49 (14%)
Query: 28 EQLQKRIESLQQQNRVLKVEL-------ETYKLQLRALQEENKNLRQTS 69
E L++R L++Q R L+++L E + +L L+ E + L+
Sbjct: 4 EALEERNSELEEQIRQLELKLRDLEAENEKLERELERLKSELEKLKSPP 52
Score = 28.6 bits (65), Expect = 6.1
Identities = 17/56 (30%), Positives = 27/56 (48%), Gaps = 13/56 (23%)
Query: 119 KLNQLTEEKYRLEQTIEQ------EQERVVNKLMRKIEKLEAETDAKQNNLEQLRR 168
+L L E LE+ I Q + E KL R++E+L++E LE+L+
Sbjct: 2 RLEALEERNSELEEQIRQLELKLRDLEAENEKLERELERLKSE-------LEKLKS 50
>gnl|CDD|202833 pfam03962, Mnd1, Mnd1 family. This family of proteins includes
MND1 from S. cerevisiae. The mnd1 protein forms a
complex with hop2 to promote homologous chromosome
pairing and meiotic double-strand break repair.
Length = 188
Score = 31.1 bits (71), Expect = 0.66
Identities = 18/71 (25%), Positives = 33/71 (46%), Gaps = 2/71 (2%)
Query: 113 TNDLSRKLNQLTEEKYRLEQTIEQEQERVVNKLMRKIEKLEAETDAKQNNLEQLRREKVE 172
N L +L +L +E L+Q I + Q ++ + E E L+QL +E +
Sbjct: 64 LNKLKTRLEKLKKELEELKQRIAELQAQIEKLKKGREE--TEERTELLEELKQLEKELKK 121
Query: 173 LENTLEQEQEA 183
L+ LE+ ++
Sbjct: 122 LKAELEKYEKN 132
Score = 29.1 bits (66), Expect = 3.0
Identities = 17/63 (26%), Positives = 35/63 (55%), Gaps = 2/63 (3%)
Query: 136 QEQERVVNKLMRKIEKLEAETDAKQNNLEQLRREKVELENTLEQEQEALVNKLWKRMDKL 195
+ + + KL +++E+L+ Q +E+L++ + E E E +E + +L K + KL
Sbjct: 65 NKLKTRLEKLKKELEELKQRIAELQAQIEKLKKGREETEERTELLEE--LKQLEKELKKL 122
Query: 196 EAE 198
+AE
Sbjct: 123 KAE 125
Score = 28.7 bits (65), Expect = 3.9
Identities = 17/76 (22%), Positives = 36/76 (47%), Gaps = 4/76 (5%)
Query: 26 TREQLQKRIESLQQQNRVLKVELETYKL----QLRALQEENKNLRQTSVNIQAKAEQEEE 81
R +L + ++ L+++ + LK ELE Y+ ++ L+EE K ++ + +
Sbjct: 104 ERTELLEELKQLEKELKKLKAELEKYEKNDPERIEKLKEETKVAKEAANRWTDNIFSLKS 163
Query: 82 FISNTLMKKIQALKKE 97
++S L+KE
Sbjct: 164 YLSKKFGLDEAQLRKE 179
Score = 27.6 bits (62), Expect = 8.6
Identities = 23/83 (27%), Positives = 42/83 (50%), Gaps = 10/83 (12%)
Query: 52 KLQLRALQEENKNLRQTSVNIQA-----KAEQEEEFISNTLMKKIQALKKEKETLAHHYE 106
K +L L++E + L+Q +QA K +EE L+++++ L+KE + L
Sbjct: 68 KTRLEKLKKELEELKQRIAELQAQIEKLKKGREETEERTELLEELKQLEKELKKLK---A 124
Query: 107 QEEEFLTNDLSR--KLNQLTEEK 127
+ E++ ND R KL + T+
Sbjct: 125 ELEKYEKNDPERIEKLKEETKVA 147
>gnl|CDD|200948 pfam00038, Filament, Intermediate filament protein.
Length = 312
Score = 31.4 bits (72), Expect = 0.72
Identities = 40/154 (25%), Positives = 66/154 (42%), Gaps = 16/154 (10%)
Query: 27 REQLQKRIESLQQQNRVLKVELETYKLQLRALQEENKNLRQTSVNIQAKAEQEEEFISNT 86
R L+ +IESLQ++ LK E ++R LQ + + ++ K +
Sbjct: 119 RVDLEMKIESLQEELAFLKKNHEE---EVRELQSQIQGQVNVEMDAARKLD--------- 166
Query: 87 LMKKIQALKKEKETLAHHYEQEEEFLTNDLSRKLNQLTEEKYRLEQTIEQEQERVVNKLM 146
L K + ++ + E LA QE E KL +L + R + +E + +L
Sbjct: 167 LTKALAEIRAQYEELAEKNRQEAE---EWYKSKLEELQQAAARNGDALRSAKEEI-TELR 222
Query: 147 RKIEKLEAETDAKQNNLEQLRREKVELENTLEQE 180
R+I+ LE E + + L R+ ELE E E
Sbjct: 223 RQIQSLEIELQSLKKQKASLERQLAELEERYELE 256
Score = 30.2 bits (69), Expect = 1.9
Identities = 23/101 (22%), Positives = 47/101 (46%), Gaps = 8/101 (7%)
Query: 114 NDLSRKLNQLTEEKYRLEQTIEQEQERVVNKLMRKIE-------KLEAETDAKQNNLEQL 166
+L K+ +L ++K + E+ + +L ++++ +L+ E D + E
Sbjct: 28 KELEAKIEELRQKKSAEPSRLYSLYEQEIRELRKQLDELTNERARLQLEIDNLRLAAEDF 87
Query: 167 RREKVELENTLEQEQEALVNKLWKRMDKLEAEKRILQIKLD 207
REK E E L Q E + L K +D+ + L++K++
Sbjct: 88 -REKYEDELNLRQSAENDIVGLRKDLDEATLARVDLEMKIE 127
>gnl|CDD|219791 pfam08317, Spc7, Spc7 kinetochore protein. This domain is found in
cell division proteins which are required for
kinetochore-spindle association.
Length = 321
Score = 31.5 bits (72), Expect = 0.73
Identities = 17/97 (17%), Positives = 34/97 (35%), Gaps = 8/97 (8%)
Query: 112 LTNDLSRKLNQLTEEKYRLEQTIEQEQERVVNKLMRKIEKLEAETDAKQNNLEQLRREKV 171
L L L + ++ L + ++N + K+ K + +L QL E
Sbjct: 148 LKEGLEENLEGMKRDEELLN-----KDLNLINSIKPKLRKKLQALKEEIASLRQLADE-- 200
Query: 172 ELENTLEQEQEALVNKLWKRMDKLEAEKRILQIKLDQ 208
L E E +L K+ +++ L+ +
Sbjct: 201 -LNLCDPLELEKARQELRSLSVKISEKRKQLEELQQE 236
>gnl|CDD|235570 PRK05703, flhF, flagellar biosynthesis regulator FlhF; Validated.
Length = 424
Score = 31.4 bits (72), Expect = 0.76
Identities = 28/124 (22%), Positives = 50/124 (40%)
Query: 89 KKIQALKKEKETLAHHYEQEEEFLTNDLSRKLNQLTEEKYRLEQTIEQEQERVVNKLMRK 148
KK L++EK A + L SR +Q + +++ ++ +
Sbjct: 58 KKNPVLREEKRKPAKSILSLQALLEKRPSRTNSQDALLQAENALPEWKKELEKPSEPKEE 117
Query: 149 IEKLEAETDAKQNNLEQLRREKVELENTLEQEQEALVNKLWKRMDKLEAEKRILQIKLDQ 208
K AE+ Q L++LR E EL+N LE + L + E KR+ + L
Sbjct: 118 EPKAAAESKVVQKELDELRDELKELKNLLEDQLSGLRQVERIPPEFAELYKRLKRSGLSP 177
Query: 209 PVSD 212
+++
Sbjct: 178 EIAE 181
>gnl|CDD|240271 PTZ00108, PTZ00108, DNA topoisomerase 2-like protein; Provisional.
Length = 1388
Score = 31.9 bits (73), Expect = 0.77
Identities = 54/374 (14%), Positives = 113/374 (30%), Gaps = 53/374 (14%)
Query: 28 EQLQKRIESLQQQNRVLKV----ELETYKLQLRALQEENKNLRQTSVNIQAKAEQEEEFI 83
+L++ + L + R +K EL + + L +E K L ++ E+
Sbjct: 1002 GKLERELARLSNKVRFIKHVINGELVITNAKKKDLVKELKKLGYVRF-KDIIKKKSEKIT 1060
Query: 84 SNTLMKKIQALKKEKETLAHHYEQEEEFLTND--LSRKLNQLTEEKYRLEQTIEQEQERV 141
+ + + + E + D LS + LT+EK
Sbjct: 1061 AEEEEGAEEDDEADDEDDEEELGAAVSY---DYLLSMPIWSLTKEK-------------- 1103
Query: 142 VNKLMRKIEKLEAETDAKQNNLEQLRREKVE---------LENTLEQEQEALVNKLWKRM 192
+EKL AE + K+ LE+L+ + E LE+++E ++ K
Sbjct: 1104 -------VEKLNAELEKKEKELEKLKNTTPKDMWLEDLDKFEEALEEQEEVEEKEIAKEQ 1156
Query: 193 DKLEAEKRILQIKLDQPVSDPSSPHDMNNGDTASNLSTHIHSLRSEVMRLRNQLANTQQE 252
K + ++ D + S +S R + R +
Sbjct: 1157 RLKSKTKGKASKLRKPKLKKKEKKKKKSSADKSKKASVVGNSKRVDSDEKRKLDDKPDNK 1216
Query: 253 HTEKMERYVKDEKNVREENLRLQRKLQREVERREALCRHLSESESSLEMEEERHYNEIAH 312
+ +D++ Q+ ++ + + + S+SS + +E +
Sbjct: 1217 KSNSSGSDQEDDEE--------QKTKPKKSSVKRLKSKKNNSSKSSEDNDEFSSDDLSKE 1268
Query: 313 SMRLVGSSPSSSRPLSPGASMLTTKEPRCFACGQTTVMPIPPPVHSRPQRIKTPLLSVEQ 372
S+ SP P P +R++ L ++++
Sbjct: 1269 GKPKNAPKRVSAVQYSPPPPSKRPDGES-----NGGSKPSSPTKKKVKKRLEGSLAALKK 1323
Query: 373 PTSPEGAPTPTLGS 386
E S
Sbjct: 1324 KKKSEKKTARKKKS 1337
Score = 31.6 bits (72), Expect = 0.80
Identities = 32/163 (19%), Positives = 61/163 (37%), Gaps = 42/163 (25%)
Query: 159 KQNNLEQLRREKVELENTLE------QEQ--------EALVNKLWKR-------MDKLEA 197
K+ L +L RE L N + + + LV +L K + K ++
Sbjct: 997 KEYLLGKLERELARLSNKVRFIKHVINGELVITNAKKKDLVKELKKLGYVRFKDIIKKKS 1056
Query: 198 EKRILQIKLDQPVSDPSSPHDMNNGDTASN-----LSTHIHSLRSE-VMRLRNQLANTQQ 251
EK + + D + D A+ LS I SL E V +L +L ++
Sbjct: 1057 EKITAEEEEGAEEDDEADDEDDEEELGAAVSYDYLLSMPIWSLTKEKVEKLNAELEKKEK 1116
Query: 252 EHTE---------------KMERYVKDEKNVREENLRLQRKLQ 279
E + K E +++++ V E+ + +++L+
Sbjct: 1117 ELEKLKNTTPKDMWLEDLDKFEEALEEQEEVEEKEIAKEQRLK 1159
>gnl|CDD|236892 PRK11281, PRK11281, hypothetical protein; Provisional.
Length = 1113
Score = 31.8 bits (73), Expect = 0.85
Identities = 43/254 (16%), Positives = 97/254 (38%), Gaps = 32/254 (12%)
Query: 24 PVTREQLQKRIESLQQQ------NRVLKVELETYKLQLRALQEENKNLRQTSVNIQAKAE 77
T +Q ++++L +Q +++++ +LE L + + + Q ++ +
Sbjct: 35 LPTEADVQAQLDALNKQKLLEAEDKLVQQDLEQTLALLDKIDRQKEETEQ----LKQQLA 90
Query: 78 QEEEFISNTLMKKIQALKKEKETLAHHYEQEEEFLTN-DLSRKLNQLTEEKYRLEQTIEQ 136
Q + +++ALK + + + L+ L +L Q ++ + + +
Sbjct: 91 QAPAKLRQA-QAELEALKDDNDEET---RETLSTLSLRQLESRLAQTLDQLQNAQNDLAE 146
Query: 137 EQERVVNKLMRKIEKLEAETDAKQNNLEQLRREKVELENTLEQEQEALVNKLWKRMDKLE 196
++V+ L + E+ +A A L+Q+R + +AL + L+
Sbjct: 147 YNSQLVS-LQTQPERAQAALYANSQRLQQIRNLL----KGGKVGGKALRP---SQRVLLQ 198
Query: 197 AEKRILQIKLDQPVSDPSSPHDMNNGDTASNLSTHIHSLRSE-VMRLRNQLANTQQEHTE 255
AE+ +L + D + +L + + RL +QL Q+
Sbjct: 199 AEQALLNAQNDL------QRKSLEGNTQLQDLLQKQRDYLTARIQRLEHQLQLLQEAINS 252
Query: 256 KMERYVKDEKNVRE 269
K R EK V+E
Sbjct: 253 K--RLTLSEKTVQE 264
>gnl|CDD|234714 PRK00286, xseA, exodeoxyribonuclease VII large subunit; Reviewed.
Length = 438
Score = 31.3 bits (72), Expect = 0.87
Identities = 27/135 (20%), Positives = 59/135 (43%), Gaps = 6/135 (4%)
Query: 23 SPVTREQLQKRIESLQQQ-NRVLKVELETYKLQLRALQEENKNLRQTSVNIQAKAEQEEE 81
P E LQ R++ LQQ+ R ++ LE + +L L + Q+ + A+ +Q +
Sbjct: 259 VPDRAELLQ-RLQQLQQRLARAMRRRLEQKRQRLDQLA--RRLKFQSPERLLAQQQQRLD 315
Query: 82 FISNTLMKKIQALKKEKETLAHHYEQEEEFLT--NDLSRKLNQLTEEKYRLEQTIEQEQE 139
+ L + ++ + + Q + + R +L + + RL + + ++ +
Sbjct: 316 RLQQRLQRALERRLRLAKQRLERLSQRLQQQNPQRRIERAQQRLEQLEQRLRRAMRRQLK 375
Query: 140 RVVNKLMRKIEKLEA 154
R +L ++LEA
Sbjct: 376 RKRQRLEALAQQLEA 390
Score = 31.3 bits (72), Expect = 0.91
Identities = 34/187 (18%), Positives = 69/187 (36%), Gaps = 58/187 (31%)
Query: 110 EFLTNDLSRKLNQLTEEKYRLEQ----TIEQEQERVVNKLMRKIEKLEAETDAKQNNLEQ 165
E D + L +L + + RL + +EQ+++R+ ++L R++ Q+
Sbjct: 256 ELAVPDRAELLQRLQQLQQRLARAMRRRLEQKRQRL-DQLARRL--------KFQSPERL 306
Query: 166 LRREKVELENTLEQEQEALVNKLWKRMDKLEAEKRILQIKLDQPVSDPSSPHDMNNGDTA 225
L +++ L+ ++ Q AL +L +LE + LQ
Sbjct: 307 LAQQQQRLDRLQQRLQRALERRLRLAKQRLERLSQRLQ---------------------Q 345
Query: 226 SNLSTHIHSLRSEVMRLRNQLANTQQEHTEKMERYVKDEKNVREENLRLQRKLQREVERR 285
N I + + +L +L ++R+L+R+ +R
Sbjct: 346 QNPQRRIERAQQRLEQLEQRLRRA------------------------MRRQLKRKRQRL 381
Query: 286 EALCRHL 292
EAL + L
Sbjct: 382 EALAQQL 388
>gnl|CDD|218115 pfam04502, DUF572, Family of unknown function (DUF572). Family of
eukaryotic proteins with undetermined function.
Length = 321
Score = 31.3 bits (71), Expect = 0.88
Identities = 28/110 (25%), Positives = 51/110 (46%), Gaps = 7/110 (6%)
Query: 92 QALKKEKETLAHHYEQEEEFLTNDLSRKLNQLTEEKYRLEQTIEQEQERVVNKLMRKIEK 151
+A K ++E ++ EE L D +KL T + R + +E+ +E +++++
Sbjct: 109 EADKLDEEQEERVEKEREEELAGDAMKKLENRTADSKREMEVLERLEE------LKELQS 162
Query: 152 LEAETDAKQNNLEQLRREKVELENTLEQEQEALVNKLWKRMDKLEAEKRI 201
A+ D LE L R + + E E+E EAL+ L + E +R
Sbjct: 163 RRADVDVNSM-LEALFRREKKEEEEEEEEDEALIKSLSFGPETEEDRRRA 211
>gnl|CDD|240419 PTZ00440, PTZ00440, reticulocyte binding protein 2-like protein;
Provisional.
Length = 2722
Score = 31.7 bits (72), Expect = 0.90
Identities = 29/160 (18%), Positives = 65/160 (40%), Gaps = 9/160 (5%)
Query: 28 EQLQKRIESLQQQNRVLKVELETYKLQLRALQEENKNLRQTSVNIQAKAEQEEEFISNTL 87
E++ IE++ N +K EL LR ++E+ + + ++ N +
Sbjct: 2471 EKIDDLIENVTSHNEKIKSELLIINDALRRVKEKKDEMNKLFNSLTENNNNNNNSAKNIV 2530
Query: 88 MKKIQALKKEKETLAHHYEQEEEFLTNDLSRKLNQLTEEKYRLEQTIEQEQERVVNKLMR 147
+ + L H + E L+ + ++ +L EK +L + + E+ R +
Sbjct: 2531 DNSTYIINE----LESHVSKLNELLSY-IDNEIKELENEKLKLLEKAKIEESRKERE--- 2582
Query: 148 KIEKLEAETDAKQNNLEQLRREKV-ELENTLEQEQEALVN 186
+IE E + + + + ++E++ + E QE L
Sbjct: 2583 RIESETQEDNTDEEQINRQQQERLQKEEEQKAYSQERLNR 2622
>gnl|CDD|226513 COG4026, COG4026, Uncharacterized protein containing TOPRIM domain,
potential nuclease [General function prediction only].
Length = 290
Score = 31.0 bits (70), Expect = 1.0
Identities = 32/120 (26%), Positives = 53/120 (44%), Gaps = 8/120 (6%)
Query: 27 REQLQKRIESLQQQNRVLKVELETYKLQLRALQEENKNLRQTSVNIQAKAEQEEEFISNT 86
E+L++++E LQ++ L ELE + + +QE K L + EE
Sbjct: 137 YEELKEKLEELQKEKEELLKELEELEAEYEEVQERLKRLE------VENSRLEEML--KK 188
Query: 87 LMKKIQALKKEKETLAHHYEQEEEFLTNDLSRKLNQLTEEKYRLEQTIEQEQERVVNKLM 146
L ++ LKK + L E EE L +DL ++ L + + I E E+ V L+
Sbjct: 189 LPGEVYDLKKRWDELEPGVELPEEELISDLVKETLNLAPKDIEGQGYIYAEDEKEVEILL 248
Score = 29.5 bits (66), Expect = 3.2
Identities = 21/77 (27%), Positives = 36/77 (46%), Gaps = 1/77 (1%)
Query: 122 QLTEEKYRLEQTIEQEQERVVNKLMRKIEKLEAETDAKQNNLEQLRREKVELENTLEQEQ 181
+L +E LE E+ QER+ +L + +LE + L++ ELE +E +
Sbjct: 153 ELLKELEELEAEYEEVQERL-KRLEVENSRLEEMLKKLPGEVYDLKKRWDELEPGVELPE 211
Query: 182 EALVNKLWKRMDKLEAE 198
E L++ L K L +
Sbjct: 212 EELISDLVKETLNLAPK 228
>gnl|CDD|222417 pfam13851, GAS, Growth-arrest specific micro-tubule binding. This
family is the highly conserved central region of a
number of metazoan proteins referred to as growth-arrest
proteins. In mouse, Gas8 is predominantly a testicular
protein, whose expression is developmentally regulated
during puberty and spermatogenesis. In humans, it is
absent in infertile males who lack the ability to
generate gametes. The localisation of Gas8 in the
motility apparatus of post-meiotic gametocytes and
mature spermatozoa, together with the detection of Gas8
also in cilia at the apical surfaces of epithelial cells
lining the pulmonary bronchi and Fallopian tubes
suggests that the Gas8 protein may have a role in the
functioning of motile cellular appendages. Gas8 is a
microtubule-binding protein localised to regions of
dynein regulation in mammalian cells.
Length = 201
Score = 30.3 bits (69), Expect = 1.1
Identities = 32/145 (22%), Positives = 72/145 (49%), Gaps = 25/145 (17%)
Query: 28 EQLQKRIESLQQQNRVLKVELETYKLQLRALQEENKN---LRQTSVNIQAKAEQEEEFIS 84
E +K + + Q+N+ L L+ + ++ L+++ K+ +Q+ N++A+ ++ E
Sbjct: 44 EHNEKLMAEIAQENKRLVEPLKKAEEEVEELRKKLKDYEKDKQSLKNLKARLKELE---- 99
Query: 85 NTLMKKIQALKKEKETLAHHYEQEEEFLTNDLSRKLNQLTEEKYRLEQTIEQEQERVVNK 144
K+++ LK E E L +E+ + R+ ++L + + E I+ Q++ K
Sbjct: 100 ----KELKNLKWESEVLEQRFEK--------VERERDELYD---KFEAAIQDVQQKTGLK 144
Query: 145 ---LMRKIEKLEAETDAKQNNLEQL 166
L +K+E L E + K+ L ++
Sbjct: 145 NLLLEQKLEALNEELEKKEAQLNEV 169
>gnl|CDD|224495 COG1579, COG1579, Zn-ribbon protein, possibly nucleic acid-binding
[General function prediction only].
Length = 239
Score = 30.4 bits (69), Expect = 1.1
Identities = 37/166 (22%), Positives = 72/166 (43%), Gaps = 11/166 (6%)
Query: 132 QTIEQEQERVVNKLMRKIEKLEAETDAKQNNLEQLRREKVELENTLEQEQEALVNKLWKR 191
Q ++ E++R L +I+++ + LE L + LE LE + E V++L
Sbjct: 13 QKLDLEKDR----LEPRIKEIRKALKKAKAELEALNKALEALEIELE-DLENQVSQLESE 67
Query: 192 MDKLEAEKRILQIKLDQPVSDPSSPHDMNNGDTASNLSTHIHSLRSEVMRLRNQLANTQQ 251
+ ++ + + KL V D +N + I+SL E+ L ++ ++
Sbjct: 68 IQEIRERIKRAEEKLSA-VKDERELRALNIEIQIAK--ERINSLEDELAELMEEIEKLEK 124
Query: 252 EHTEKMERYVKDEKNVREENLRLQ---RKLQREVERREALCRHLSE 294
E + ER + EKN+ E RL+ +++ E + + L E
Sbjct: 125 EIEDLKERLERLEKNLAEAEARLEEEVAEIREEGQELSSKREELKE 170
>gnl|CDD|226834 COG4399, COG4399, Uncharacterized protein conserved in bacteria
[Function unknown].
Length = 376
Score = 30.8 bits (70), Expect = 1.1
Identities = 36/209 (17%), Positives = 69/209 (33%), Gaps = 37/209 (17%)
Query: 15 GVALMMPPSPVTREQLQKRIESLQQQNRVLKVELETYKLQLRALQEENKNLRQ------- 67
+ ++ +T E +QK++ Q RV + + + L++ + + L Q
Sbjct: 60 QIGKVVVNHLLTPEGIQKKLRRELFQERVTEAIDQLLQKLLKSEVTDKEQLHQQIFADIE 119
Query: 68 -------------------TSVNIQAKAEQEEEFISNTLMKKIQALKKEKETLAHHYEQE 108
T E + +L + + + E+ A Y
Sbjct: 120 KDLIGNSERWLEKELAEKFTEAQSNTIFELIPLELVESLEQSLPSADWLLESRARPYVAS 179
Query: 109 EEFLTNDLSRKLNQLTEEKYRLEQ--TIEQEQERVVNKLMRKIEKLEAETDAKQ------ 160
L L+Q EE +L + E + ++L +++ KL + K+
Sbjct: 180 LA-GKIQLGDMLDQFIEEHGKLGTMVQMFVVNESLADRLQQELLKLLNREETKKILQQLL 238
Query: 161 -NNLEQLRREKVELENTLEQEQEALVNKL 188
N +EK E T E + AL L
Sbjct: 239 TNEWPTF-KEKPLNELTDEVVRNALAESL 266
>gnl|CDD|225638 COG3096, MukB, Uncharacterized protein involved in chromosome
partitioning [Cell division and chromosome
partitioning].
Length = 1480
Score = 31.0 bits (70), Expect = 1.3
Identities = 46/174 (26%), Positives = 73/174 (41%), Gaps = 27/174 (15%)
Query: 54 QLRALQEENKNLRQTSVNIQAKAEQEEEFISNTLMKKIQALKKEKETLAHHYEQEEEFLT 113
+ A Q + +R + + + + +Q E + K+ Q + E L QE E L
Sbjct: 509 RHLAEQVQPLRMRLSELEQRLRQQQSAERLLADFCKR-QGKNLDAEEL-EALHQELEALI 566
Query: 114 NDLSRKLNQLTEEKYRLEQTIEQEQERVVNKLMRKIEKLEAETDAKQNNLEQLRREKVE- 172
LS ++ E++ L Q EQ Q R+ + + R L A QN LEQL + E
Sbjct: 567 ESLSDSVSNAREQRMALRQEQEQLQSRIQSLMQRAPVWLAA-----QNALEQLSEQSGEE 621
Query: 173 ----------LENTLEQEQEALVNKLWKRMDKLEAEKRILQIKLDQPVSDPSSP 216
++ LE+E+EA V + D+L A K LD+ + S P
Sbjct: 622 FTDSQDVTEYMQQLLEREREATVER-----DELGARKN----ALDEEIERLSQP 666
Score = 29.1 bits (65), Expect = 5.0
Identities = 50/246 (20%), Positives = 96/246 (39%), Gaps = 26/246 (10%)
Query: 27 REQLQKRIESLQQQNRVLKVELETYKLQLRALQEENKNLRQ-----TSVNIQAKAEQEEE 81
R ++R+ESL + VL T ++ Q ++ + +V +A E E
Sbjct: 780 RAAREQRLESLHAERDVLSERHATLSFDVQKTQRLHQAFSRFIGSHLAVAFEADPEAEIR 839
Query: 82 FIS---NTLMKKIQALKKEKETLAHHYEQEEEFLT--NDLSRKLNQLTEE-------KYR 129
++ N L + + + + + ++Q +E +T N L +LN L +E + R
Sbjct: 840 QLNSRRNELERALSNHENDNQQQRIQFDQAKEGVTALNRLIPQLNLLADESLADRVEEIR 899
Query: 130 LEQTIEQEQERVVNKLMRKIEKLEAETDAKQNNLEQLRREKVELENTLEQEQEALVNKLW 189
QE R + + + KLE Q++ EQ + K + + +++A
Sbjct: 900 ERLDEAQEAARFIQQHGNTLSKLEPIASVLQSDPEQFEQLKEDYAQAQQMQRQA------ 953
Query: 190 KRMDKLEAEKRILQIKLDQPVSDPSSPHDMNNGDTASNLSTHIHSLRSEVMRLRNQLANT 249
+ A ++Q + SD S+ N D L + +E R R QL
Sbjct: 954 --RQQAFALTEVVQRRAHFSYSD-SAEMLSENSDLNEKLRQRLEQAEAERTRAREQLRQH 1010
Query: 250 QQEHTE 255
Q + ++
Sbjct: 1011 QAQLSQ 1016
>gnl|CDD|220555 pfam10079, DUF2317, Uncharacterized protein conserved in bacteria
(DUF2317). Members of this family of hypothetical
bacterial proteins have no known function.
Length = 541
Score = 31.0 bits (71), Expect = 1.3
Identities = 21/128 (16%), Positives = 50/128 (39%), Gaps = 18/128 (14%)
Query: 30 LQKRIESLQQQNRVLKVELETYKLQLRALQEENKNL--RQTSVNIQAKAEQEEEFISNTL 87
++ ++E K+ L + L L+ + + + + I E+ + +
Sbjct: 384 IEPKVEKKLD-----KLGLSIEDVFLHGLELLKEKVLEEKANSEIDIDFEEAKAELEAQF 438
Query: 88 MKKIQALKKEKETLAHHYEQEEEFLTNDLSRKLNQLTEEKYRLEQTIEQEQERVVNKLMR 147
+ + K +L E L + ++ L QL LE+ + + Q R +L+R
Sbjct: 439 KRLREEAAKIDPSL-------EGALEANEAKILKQLE----FLEKRLLKAQRRKHEELLR 487
Query: 148 KIEKLEAE 155
+ ++L+
Sbjct: 488 QFDRLQTA 495
>gnl|CDD|233547 TIGR01730, RND_mfp, RND family efflux transporter, MFP subunit.
This model represents the MFP (membrane fusion protein)
component of the RND family of transporters. RND refers
to Resistance, Nodulation, and cell Division. It is, in
part, a subfamily of pfam00529 (Pfam release 7.5) but
hits substantial numbers of proteins missed by that
model. The related HlyD secretion protein, for which
pfam00529 is named, is outside the scope of this model.
Attributed functions imply outward transport. These
functions include nodulation, acriflavin resistance,
heavy metal efflux, and multidrug resistance proteins.
Most members of this family are found in Gram-negative
bacteria. The proposed function of MFP proteins is to
bring the inner and outer membranes together and enable
transport to the outside of the outer membrane. Note,
however, that a few members of this family are found in
Gram-positive bacteria, where there is no outer membrane
[Transport and binding proteins, Unknown substrate].
Length = 322
Score = 30.7 bits (70), Expect = 1.3
Identities = 11/81 (13%), Positives = 24/81 (29%), Gaps = 1/81 (1%)
Query: 128 YRLEQTIEQEQERVVNKLMRKIEKLEAETDAKQNNLEQLRREKVELENTLEQEQEALVNK 187
RL+ Q + + E E+L + + L+ + A+
Sbjct: 55 ARLDDDDYQLALQAALAQLAAAEAQLELAQRSFERAERLVKRNAVSQADLDDAKAAVEAA 114
Query: 188 LWKRMDKLEAEKRILQIKLDQ 208
++ +A Q+ L
Sbjct: 115 Q-ADLEAAKASLASAQLNLRY 134
>gnl|CDD|223571 COG0497, RecN, ATPase involved in DNA repair [DNA replication,
recombination, and repair].
Length = 557
Score = 30.6 bits (70), Expect = 1.3
Identities = 43/215 (20%), Positives = 85/215 (39%), Gaps = 46/215 (21%)
Query: 28 EQLQKRIESLQQQNRVLKVELETYKLQLRALQEEN-------------------KNLRQT 68
+Q ++ +E LQ++ R + + QL L+E N + L +
Sbjct: 167 KQARRELEDLQEKERERAQRADLLQFQLEELEELNLQPGEDEELEEERKRLSNSEKLAEA 226
Query: 69 SVNIQAKAEQEEEFIS-----NTLMKKIQA-------LKKEKETLAHHYEQEEEFLTNDL 116
N E++ +S ++ ++ L + E L + EE + +L
Sbjct: 227 IQNALELLSGEDDTVSALSLLGRALEALEDLSEYDGKLSELAELLEEALYELEE-ASEEL 285
Query: 117 SRKLNQLTEEKYRLEQTIEQEQERV----------VNKLMRKIEKLEAET---DAKQNNL 163
L++L + RLE+ +E+ + + L+ ++K++ E D + +L
Sbjct: 286 RAYLDELEFDPNRLEE-VEERLFALKSLARKYGVTIEDLLEYLDKIKEELAQLDNSEESL 344
Query: 164 EQLRREKVELENTLEQEQEALVNKLWKRMDKLEAE 198
E L +E +L+ L + EAL K +LE E
Sbjct: 345 EALEKEVKKLKAELLEAAEALSAIRKKAAKELEKE 379
>gnl|CDD|236489 PRK09375, PRK09375, quinolinate synthetase; Provisional.
Length = 319
Score = 30.4 bits (70), Expect = 1.4
Identities = 11/25 (44%), Positives = 18/25 (72%), Gaps = 2/25 (8%)
Query: 87 LMKKIQALKKEKE--TLAHHYEQEE 109
L ++I+ LKKE+ LAH+Y++ E
Sbjct: 21 LKERIKRLKKERNAVILAHYYQRPE 45
>gnl|CDD|217038 pfam02445, NadA, Quinolinate synthetase A protein. Quinolinate
synthetase catalyzes the second step of the de novo
biosynthetic pathway of pyridine nucleotide formation.
In particular, quinolinate synthetase is involved in the
condensation of dihydroxyacetone phosphate and
iminoaspartate to form quinolinic acid. This synthesis
requires two enzymes, a FAD-containing "B protein" and
an "A protein".
Length = 296
Score = 30.5 bits (70), Expect = 1.5
Identities = 11/24 (45%), Positives = 18/24 (75%), Gaps = 2/24 (8%)
Query: 88 MKKIQALKKEKET--LAHHYEQEE 109
+++I+ LKKEK LAH+Y++ E
Sbjct: 1 IEEIKRLKKEKNAVILAHNYQRPE 24
>gnl|CDD|215180 PLN02316, PLN02316, synthase/transferase.
Length = 1036
Score = 30.6 bits (69), Expect = 1.7
Identities = 24/124 (19%), Positives = 50/124 (40%), Gaps = 7/124 (5%)
Query: 72 IQAKAEQEEEFISNTLMKKIQALKKEKETLAHHYEQEEEFLTNDLSRKLNQLTEEKYRLE 131
IQ +E+ ++T + A++K + ++ + ++ E E
Sbjct: 27 IQQNNGDKEDSSTSTSSLSVSAVEKTSNAKEE----IQVDFQHNSESAVEEVEAED---E 79
Query: 132 QTIEQEQERVVNKLMRKIEKLEAETDAKQNNLEQLRREKVELENTLEQEQEALVNKLWKR 191
+EQ Q V+ E+ + ++ R+ K+E EN ++E E L + + R
Sbjct: 80 IEVEQNQSDVLKSSSIVKEESISTDMDGIDDDSLDRKLKLERENLRKREIEELAEENFSR 139
Query: 192 MDKL 195
+KL
Sbjct: 140 GNKL 143
>gnl|CDD|236993 PRK11820, PRK11820, hypothetical protein; Provisional.
Length = 288
Score = 30.1 bits (69), Expect = 1.8
Identities = 31/144 (21%), Positives = 59/144 (40%), Gaps = 17/144 (11%)
Query: 33 RIESLQQQNRVLKVE---LETYKLQLRALQEENKNLRQTSVN-------IQAKAEQEEEF 82
R+E + L + + Y L L+ E + S++ + E++ E
Sbjct: 67 RLERDESAQGELSLNEDLAKQYLEALEELKAELPEAGEISLDDLLRWPGVLEAEEEDLEA 126
Query: 83 ISNTLMKKI-QALKKEKETLAHHYEQEEEFLTNDLSRKLNQLTEEKYRLEQTIEQEQERV 141
+ L+ + +AL L E+E L DL ++L+ + ++E + E
Sbjct: 127 LWAALLAALDEALDD----LIEMREREGAALKADLLQRLDAIEALVAKIEALAPEILEEY 182
Query: 142 VNKLMRKIEKLEAETDAKQNNLEQ 165
+L ++E+L E D +N LEQ
Sbjct: 183 RERLRERLEELLGELD--ENRLEQ 204
>gnl|CDD|224337 COG1419, FlhF, Flagellar GTP-binding protein [Cell motility and
secretion].
Length = 407
Score = 30.0 bits (68), Expect = 2.1
Identities = 20/114 (17%), Positives = 38/114 (33%), Gaps = 9/114 (7%)
Query: 103 HHYEQEEEFLTNDLSRKLNQLTEEKYRLEQTIEQEQ--------ERVVNKLMRKIEKL-E 153
++ E D +L + EK R E+ ++ ER ++ E++ E
Sbjct: 62 RKFQTREAAKVKDAEAQLLKDPAEKKREERKAAKKIERSTPSLIERKTQEVKDSGEEIAE 121
Query: 154 AETDAKQNNLEQLRREKVELENTLEQEQEALVNKLWKRMDKLEAEKRILQIKLD 207
D K E + ++ Q L + + E IL +K +
Sbjct: 122 MMRDEKVPIRELEEIPPEFVALYKQEIQSPTRLNLINELLRAGLELEILDMKDE 175
>gnl|CDD|226614 COG4129, COG4129, Predicted membrane protein [Function unknown].
Length = 332
Score = 30.0 bits (68), Expect = 2.2
Identities = 21/129 (16%), Positives = 44/129 (34%), Gaps = 19/129 (14%)
Query: 27 REQLQKRIESLQQQNRVLKVELETYKLQLRAL-QEENKNLRQTSVNIQAKAEQEEEFISN 85
++++ L++ RVL L L + L Q ++ A +
Sbjct: 222 FSMRRRQVRILRRMLRVLNS-LHRTSNLSNLLADLLQELLEQLLESLTAAHLLID----- 275
Query: 86 TLMKKIQALKKEKETLAHHYEQEEEFLTNDLSRKLNQLTEEKYRLEQTIEQEQERVVNKL 145
++ +K E L + EEF L++ LE+ +E + + L
Sbjct: 276 -----LEETRKVFEALPLP-KTREEF------ESRAVLSQLLQELEEFLEVKTDFSRVIL 323
Query: 146 MRKIEKLEA 154
++ + A
Sbjct: 324 LKAKSSVGA 332
>gnl|CDD|217414 pfam03189, Otopetrin, Protein of unknown function, DUF270.
Length = 413
Score = 30.0 bits (68), Expect = 2.2
Identities = 11/48 (22%), Positives = 19/48 (39%), Gaps = 5/48 (10%)
Query: 189 WKRMDKLEAEKRILQIKLDQPVSDPSSPHDMNNGD-----TASNLSTH 231
W + E++ IL+ V D S ++ D T++ ST
Sbjct: 82 WFSLLLAESKHEILKKAQKHIVFDHHSALSHDSSDSSKEQTSTEESTE 129
>gnl|CDD|182730 PRK10787, PRK10787, DNA-binding ATP-dependent protease La;
Provisional.
Length = 784
Score = 30.3 bits (68), Expect = 2.2
Identities = 29/110 (26%), Positives = 57/110 (51%), Gaps = 7/110 (6%)
Query: 110 EFLTNDLSRKLNQLTEEKY---RLEQTIEQEQ-ERVVNKLMRKIEKLEAETDAKQNNLEQ 165
E+L + +++ L EK R+++ +E+ Q E +N+ M+ I+K E D + E
Sbjct: 194 EYLMAMMESEIDLLQVEKRIRNRVKKQMEKSQREYYLNEQMKAIQKELGEMDDAPDENEA 253
Query: 166 LRR--EKVELENTLEQEQEALVNKLWKRMDKLEAEKRILQIKLDQPVSDP 213
L+R + ++ +++ EA + KL K M + AE +++ +D V P
Sbjct: 254 LKRKIDAAKMPKEAKEKAEAELQKL-KMMSPMSAEATVVRGYIDWMVQVP 302
>gnl|CDD|234767 PRK00448, polC, DNA polymerase III PolC; Validated.
Length = 1437
Score = 30.2 bits (69), Expect = 2.3
Identities = 25/135 (18%), Positives = 52/135 (38%), Gaps = 29/135 (21%)
Query: 34 IESLQQQNRVLKVELETYKLQLRALQEENKNLRQTSVNIQAKAEQEEEFISN----TLMK 89
IE ++ + + K L+ K+++ E NK + I+ E E + + L+K
Sbjct: 100 IEKAKKNSPLFKSLLKKQKVEV----EGNK------LIIKVNNEIERDHLKKKHLPKLIK 149
Query: 90 KIQALKKEKETLAHHYEQEEEFLTNDLSRKLNQLTEEKYRLEQTIEQEQERVVNKLMRKI 149
+ + + + +E L E E+E E++ + + +
Sbjct: 150 QYEKFGFGILKIDFEIDDSKEELEK---------------FEAQKEEEDEKLAKEALEAM 194
Query: 150 EKLEAETDAKQNNLE 164
+KLEAE + N +
Sbjct: 195 KKLEAEKKKQSKNFD 209
>gnl|CDD|217092 pfam02538, Hydantoinase_B, Hydantoinase B/oxoprolinase. This
family includes N-methylhydaintoinase B which converts
hydantoin to N-carbamyl-amino acids, and 5-oxoprolinase
EC:3.5.2.9 which catalyzes the formation of L-glutamate
from 5-oxo-L-proline. These enzymes are part of the
oxoprolinase family and are related to pfam01968.
Length = 527
Score = 30.1 bits (68), Expect = 2.3
Identities = 15/98 (15%), Positives = 31/98 (31%), Gaps = 18/98 (18%)
Query: 129 RLEQTIEQEQERVVNKLMRKIEKLEAETDAKQNNLEQLRREKVE-LENTLEQEQEALVNK 187
R+ + I++ V M I Q+N E+ + K+ L+ + ++
Sbjct: 197 RIGKLIDEYGLETVLGYMAHI----------QDNAEEAIKRKIAKLDEPYGAYDYGVCDR 246
Query: 188 LWKRMD-----KLEAEKRILQIKLDQPVSDPSSPHDMN 220
L + +K D + H+ N
Sbjct: 247 LDTGKLIKVEITINIKKG--TAIFDFDGTSRQGDHNFN 282
>gnl|CDD|239428 cd03312, CIMS_N_terminal_like, CIMS - Cobalamine-independent
methonine synthase, or MetE, N-terminal domain_like.
Many members have been characterized as
5-methyltetrahydropteroyltriglutamate-homocysteine
methyltransferases, EC:2.1.1.14, mostly from bacteria
and plants. This enzyme catalyses the last step in the
production of methionine by transferring a methyl group
from 5-methyltetrahydrofolate to L-homocysteine without
using an intermediate methyl carrier. The active enzyme
has a dual (beta-alpha)8-barrel structure, and this
model covers the N-terminal barrel, and a few
single-barrel sequences most similar to the N-terminal
barrel. It is assumed that the homologous N-terminal
barrel has evolved from the C-terminus via gene
duplication and has subsequently lost binding sites,
and it seems as if the two barrels forming the active
enzyme may sometimes reside on different polypeptides.
The C-terminal domain incorporates the Zinc ion, which
binds and activates homocysteine. Side chains from both
barrels contribute to the binding of the folate
substrate.
Length = 360
Score = 29.8 bits (68), Expect = 2.3
Identities = 14/39 (35%), Positives = 18/39 (46%), Gaps = 3/39 (7%)
Query: 41 NRVLKVELETY---KLQLRALQEENKNLRQTSVNIQAKA 76
NR LK LE+Y K+ L K LR +Q +A
Sbjct: 13 NRELKKALESYWKGKISEEELLATAKELRLRHWKLQKEA 51
>gnl|CDD|163064 TIGR02894, DNA_bind_RsfA, transcription factor, RsfA family. In a
subset of endospore-forming members of the Firmcutes,
members of this protein family are found, several to a
genome. Two very strongly conserved sequences regions
are separated by a highly variable linker region. Much
of the linker region was excised from the seed alignment
for this model. A characterized member is the
prespore-specific transcription RsfA from Bacillus
subtilis, previously called YwfN, which is controlled by
sigma factor F and seems to fine-tune expression of some
genes in the sigma-F regulon. A paralog in Bacillus
subtilis is designated YlbO [Regulatory functions, DNA
interactions, Cellular processes, Sporulation and
germination].
Length = 161
Score = 29.0 bits (65), Expect = 2.4
Identities = 16/52 (30%), Positives = 32/52 (61%), Gaps = 4/52 (7%)
Query: 27 REQLQKRIESLQQQNRVLKVELETYKLQLRALQEENKNLRQTSVNIQAKAEQ 78
E+L+ + ESLQ++N L+ ELE + +L ++E+ + L ++I +A +
Sbjct: 106 NERLKNQNESLQKRNEELEKELEKLRQRLSTIEEDYQTL----IDIMDRARK 153
>gnl|CDD|163506 TIGR03794, NHLM_micro_HlyD, NHLM bacteriocin system secretion
protein. Members of this protein family are homologs of
the HlyD membrane fusion protein of type I secretion
systems. Their occurrence in prokaryotic genomes is
associated with the occurrence of a novel class of
microcin (small bacteriocins) with a leader peptide
region related to nitrile hydratase. We designate the
class of bacteriocin as Nitrile Hydratase Leader
Microcin, or NHLM. This family, therefore, is designated
as NHLM bacteriocin system secretion protein. Some but
not all NHLM-class putative microcins belong to the TOMM
(thiazole/oxazole modified microcin) class as assessed
by the presence of the scaffolding protein and/or
cyclodehydratase in the same gene clusters [Transport
and binding proteins, Amino acids, peptides and amines,
Cellular processes, Biosynthesis of natural products].
Length = 421
Score = 29.8 bits (67), Expect = 2.4
Identities = 26/167 (15%), Positives = 56/167 (33%), Gaps = 15/167 (8%)
Query: 87 LMKKIQALKKEKETLAHHYEQEEEFLTNDLSRKLNQLTEEKYRLEQTIEQEQE------R 140
L +++Q ++ L E+ + + + K LE+TI + +E R
Sbjct: 94 LRERLQESYQKLTQLQEQLEEVRNYTGRLKEGRERHFQKSKEALEETIGRLREELAALSR 153
Query: 141 VVNKLMRKIEKLEAETDAKQNNLEQLRREKVELENTLEQEQEALVNKLWKRMDKLEAEKR 200
V K + + A + +Q R E+ + + + L + D+ E
Sbjct: 154 EVGKQRGLLSRGLATFKRDRILQQQWREEQEKYDAADKARAI---YALQTKADERNLETV 210
Query: 201 ILQIKLDQPVSDPSSPHDMNNGDTASNLSTHIHSLRSEVMRLRNQLA 247
+ + + ++ + I R E+ L N+L
Sbjct: 211 LQSLSQADFQLAGVAEKEL------ETVEARIKEARYEIEELENKLN 251
Score = 29.8 bits (67), Expect = 2.7
Identities = 35/162 (21%), Positives = 70/162 (43%), Gaps = 11/162 (6%)
Query: 43 VLKVELETYKLQLRALQEENKNLRQTSVNIQAKAEQEEEFISNTLMKKIQALKKEKETLA 102
L+ L+ +L LQE+ + +R + ++ E+ + L + I L++E L+
Sbjct: 93 ELRERLQESYQKLTQLQEQLEEVRNYTGRLKEGRERHFQKSKEALEETIGRLREELAALS 152
Query: 103 HHYEQEEEFLTNDLSRKLNQLTEEKYRLEQT-IEQEQERVVNKLMRKIEKLEAETDAKQN 161
+E LSR L ++ +Q EQE+ +K R I L+ + D +
Sbjct: 153 ----REVGKQRGLLSRGLATFKRDRILQQQWREEQEKYDAADKA-RAIYALQTKADERNL 207
Query: 162 N--LEQLRREKVELENTLEQEQEAL---VNKLWKRMDKLEAE 198
L+ L + +L E+E E + + + +++LE +
Sbjct: 208 ETVLQSLSQADFQLAGVAEKELETVEARIKEARYEIEELENK 249
>gnl|CDD|190336 pfam02520, DUF148, Domain of unknown function DUF148. This domain
has no known function nor do any of the proteins that
possess it. In one member of this family the aligned
region is repeated twice.
Length = 114
Score = 28.4 bits (64), Expect = 2.5
Identities = 23/93 (24%), Positives = 38/93 (40%), Gaps = 3/93 (3%)
Query: 94 LKKEKETLAHHYEQEEEFL-TNDLSRKLNQLTEEKYRLEQTIEQEQERVVNKLMRKIEKL 152
LK E T+A Q E + ++ + + + L+ E+ V++ L KL
Sbjct: 11 LKNETLTIAEIDAQLEAWAEKYGVADQYEEFKKNVKALKAEAEKNVTAVISNLSSVDAKL 70
Query: 153 EAETDAKQNNLEQLRREKV-ELENTLEQEQEAL 184
A D K Q ++E + L+ L QE L
Sbjct: 71 SAILDNKDLTRRQ-KKEAIDALKKQLPQEVSTL 102
>gnl|CDD|153328 cd07644, I-BAR_IMD_BAIAP2L2, Inverse (I)-BAR, also known as the
IRSp53/MIM homology Domain (IMD), of Brain-specific
Angiogenesis Inhibitor 1-Associated Protein 2-Like 2.
The IMD domain, also called Inverse-Bin/Amphiphysin/Rvs
(I-BAR) domain, is a dimerization and lipid-binding
module that bends membranes and induces membrane
protrusions. This group is composed of uncharacterized
proteins known as BAIAP2L2 (Brain-specific Angiogenesis
Inhibitor 1-Associated Protein 2-Like 2). They contain
an N-terminal IMD, an SH3 domain, and a WASP homology 2
(WH2) actin-binding motif at the C-terminus. The related
proteins, BAIAP2L1 and IRSp53, function as regulators of
membrane dynamics and the actin cytoskeleton. The IMD
domain binds and bundles actin filaments, binds
membranes and produces membrane protrusions, and
interacts with the small GTPase Rac.
Length = 215
Score = 29.5 bits (66), Expect = 2.6
Identities = 21/80 (26%), Positives = 40/80 (50%), Gaps = 11/80 (13%)
Query: 243 RNQLANTQQEHTEKMERYVKDEKNVRE--ENLRLQRKLQREVERREALCRHLSESESSLE 300
R++ AN ++ +E + ++NVRE EN+ R+ + L ES+ + E
Sbjct: 122 RHRAANLEKCMSELWRMERQRDRNVREMKENVNRLRQSMQAF---------LKESQRAAE 172
Query: 301 MEEERHYNEIAHSMRLVGSS 320
+EE+R Y +A L+ ++
Sbjct: 173 LEEKRRYRFLAEKHYLLNNT 192
>gnl|CDD|100796 PRK01156, PRK01156, chromosome segregation protein; Provisional.
Length = 895
Score = 29.9 bits (67), Expect = 2.6
Identities = 29/205 (14%), Positives = 74/205 (36%), Gaps = 39/205 (19%)
Query: 110 EFLTNDLSRKLNQLTEEKYRLEQTIEQEQERVVNKLMRKIEKLEAETDAKQNNLEQLRRE 169
++L L +L K ++ E+ + +++IE+L E + ++ L+
Sbjct: 186 DYLEEKLKSSNLELENIKKQIAD-----DEKSHSITLKEIERLSIEYNNAMDDYNNLKSA 240
Query: 170 KVELENTLEQEQEALVNKLWKRMDKLEAEKRILQIKLDQPVSDPSSPHDMNNGDTASNLS 229
+ LE K + ++ SD S + NN
Sbjct: 241 -------------------LNELSSLEDMKNRYESEIKTAESDLSMELEKNN-------- 273
Query: 230 THIHSLRSEVMRLRNQLANTQQEHTEKMERYVKDEKNVREENLRLQRKLQREVERREALC 289
+ L M++ N + + +Y D +N ++ + + E+ + A+
Sbjct: 274 -YYKELEERHMKIINDPVYKNRNYINDYFKYKNDIENKKQ----ILSNIDAEINKYHAII 328
Query: 290 RHLSESES--SLEMEEERHYNEIAH 312
+ LS + + ++++ Y+++ +
Sbjct: 329 KKLSVLQKDYNDYIKKKSRYDDLNN 353
Score = 29.9 bits (67), Expect = 3.1
Identities = 51/324 (15%), Positives = 125/324 (38%), Gaps = 50/324 (15%)
Query: 28 EQLQKRIESLQQQNRVLKVELETYKLQLRALQEENKNLRQTSVNIQAKAEQEEEFISNTL 87
E ++K+I ++ + + E+E ++ ++ NL+ + + + + + +
Sbjct: 200 ENIKKQIADDEKSHSITLKEIERLSIEYNNAMDDYNNLKSALNELSSLEDMKNRY--ESE 257
Query: 88 MKKIQALKKEKETLAHHYEQEEEF---LTNDLSRKLNQLTEEKYRLEQTIEQEQERVVN- 143
+K ++ + ++Y++ EE + ND K + ++ + IE +++ + N
Sbjct: 258 IKTAESDLSMELEKNNYYKELEERHMKIINDPVYKNRNYINDYFKYKNDIENKKQILSNI 317
Query: 144 --------KLMRKIEKLEAETD---AKQNNLEQLRREKVELENTLEQEQEALVN----KL 188
+++K+ L+ + + K++ + L + +ELE L + K
Sbjct: 318 DAEINKYHAIIKKLSVLQKDYNDYIKKKSRYDDLNNQILELEGYEMDYNSYLKSIESLKK 377
Query: 189 WKRMDKLEAEK------RILQIKLDQPVSDPSSPHDMN------NGDTASNLSTHIHSLR 236
E+ IL+I+ P + +++N + +S L+ I +LR
Sbjct: 378 KIEEYSKNIERMSAFISEILKIQEIDPDAIKKELNEINVKLQDISSKVSS-LNQRIRALR 436
Query: 237 SEVMRLRNQLANTQQEH----------TEKMERYVKD---EKNVREENLRLQRKLQREVE 283
+ L + + EK + +K+ EE + R+++ EV+
Sbjct: 437 ENLDELSRNMEMLNGQSVCPVCGTTLGEEKSNHIINHYNEKKSRLEEKI---REIEIEVK 493
Query: 284 RREALCRHLSESESSLEMEEERHY 307
+ L + + LE EE
Sbjct: 494 DIDEKIVDLKKRKEYLESEEINKS 517
Score = 28.7 bits (64), Expect = 5.8
Identities = 31/158 (19%), Positives = 69/158 (43%), Gaps = 6/158 (3%)
Query: 114 NDLSRKLNQLTEEKYRLEQTIEQEQERVVNKLMRKIEKLEAETDAKQNNLEQLRREKVEL 173
N++ ++LN L +E + + ++K +R+IE + K N +++ + +L
Sbjct: 590 NEIKKQLNDLESRLQEIEIGFP-DDKSYIDKSIREIENEANNLNNKYNEIQENKILIEKL 648
Query: 174 ENTLEQEQEALVNKLWKRMDKLEAEKRILQIKLDQPVSDPSSPHDMNNGDTASNLSTHIH 233
++ ++ + D E RI I+ + S D + A ST
Sbjct: 649 RGKIDNYKKQIAEIDSIIPDLKEITSRINDIEDNLKKS--RKALDDAKANRARLESTI-E 705
Query: 234 SLRSEVMRLRNQLANTQQ--EHTEKMERYVKDEKNVRE 269
LR+ + L +++ + + E +K+++ + D K +RE
Sbjct: 706 ILRTRINELSDRINDINETLESMKKIKKAIGDLKRLRE 743
>gnl|CDD|215212 PLN02372, PLN02372, violaxanthin de-epoxidase.
Length = 455
Score = 29.8 bits (67), Expect = 2.7
Identities = 22/84 (26%), Positives = 41/84 (48%), Gaps = 15/84 (17%)
Query: 129 RLEQTIEQ----------EQERVVNKLMRKIEKLEAETDAKQNNLEQ-LRREKVELEN-- 175
RLE+ +E+ + E + K + K+ K E E+ K+ LE+ L+ + + EN
Sbjct: 365 RLEKDVEEGEKTIVKEARQIEEELEKEVEKLGK-EEESLFKRVALEEGLKELEQDEENFL 423
Query: 176 -TLEQEQEALVNKLWKRMDKLEAE 198
L +E++ L+ KL ++E
Sbjct: 424 KELSKEEKELLEKLKMEASEVEKL 447
>gnl|CDD|148614 pfam07106, TBPIP, Tat binding protein 1(TBP-1)-interacting protein
(TBPIP). This family consists of several eukaryotic
TBP-1 interacting protein (TBPIP) sequences. TBP-1 has
been demonstrated to interact with the human
immunodeficiency virus type 1 (HIV-1) viral protein Tat,
then modulate the essential replication process of HIV.
In addition, TBP-1 has been shown to be a component of
the 26S proteasome, a basic multiprotein complex that
degrades ubiquitinated proteins in an ATP-dependent
fashion. Human TBPIP interacts with human TBP-1 then
modulates the inhibitory action of human TBP-1 on
HIV-Tat-mediated transactivation.
Length = 169
Score = 29.2 bits (66), Expect = 2.7
Identities = 24/89 (26%), Positives = 46/89 (51%), Gaps = 9/89 (10%)
Query: 22 PSPVTREQLQKRIESLQQQNRVLKVELETYKLQLRAL---------QEENKNLRQTSVNI 72
PS +L IE L+++ ++LK + T +++L++L QEE + L++ I
Sbjct: 69 PSDEELNKLDMEIEELREEVQLLKQDCSTLEIELKSLTSDLTTEELQEEIQELKKEVREI 128
Query: 73 QAKAEQEEEFISNTLMKKIQALKKEKETL 101
+ K E EE ++++ +KKE + L
Sbjct: 129 EEKLESLEEGWKPVTPEEMEKVKKEYKDL 157
>gnl|CDD|201540 pfam00992, Troponin, Troponin. Troponin (Tn) contains three
subunits, Ca2+ binding (TnC), inhibitory (TnI), and
tropomyosin binding (TnT). this Pfam contains members of
the TnT subunit. Troponin is a complex of three
proteins, Ca2+ binding (TnC), inhibitory (TnI), and
tropomyosin binding (TnT). The troponin complex
regulates Ca++ induced muscle contraction. This family
includes troponin T and troponin I. Troponin I binds to
actin and troponin T binds to tropomyosin.
Length = 131
Score = 28.7 bits (65), Expect = 2.9
Identities = 15/60 (25%), Positives = 28/60 (46%), Gaps = 4/60 (6%)
Query: 144 KLMRKIEKLEAETDAKQNNLEQLR--REKVE-LENTLE-QEQEALVNKLWKRMDKLEAEK 199
L+ E E + ++ E+ + E+ L +L E + L KL R+D+L+ E+
Sbjct: 6 SLLLLKAAEELEFEQRKKEEEKEKYLAERCPPLRLSLSRAELQELCKKLHARIDRLDEER 65
>gnl|CDD|236861 PRK11147, PRK11147, ABC transporter ATPase component; Reviewed.
Length = 635
Score = 29.9 bits (68), Expect = 3.0
Identities = 29/101 (28%), Positives = 41/101 (40%), Gaps = 11/101 (10%)
Query: 92 QALKKEKETLAHHYEQEEEFLTNDLS----RKLNQLTEEKYRLEQTIEQEQERVVNKLMR 147
Q K+KE A + + + LS R+L QL + LE IE Q +V +
Sbjct: 538 QPAVKKKEEAAAPKAETVKRSSKKLSYKLQRELEQLPQLLEDLEAEIEALQAQVADA--- 594
Query: 148 KIEKLEAETDAKQNNLEQLRREKVELENTLE--QEQEALVN 186
+ + Q L L + ELE E +E EAL N
Sbjct: 595 --DFFSQPHEQTQKVLADLADAEQELEVAFERWEELEALKN 633
>gnl|CDD|131526 TIGR02473, flagell_FliJ, flagellar export protein FliJ. Members of
this family are the FliJ protein found, in nearly every
case, in the midst of other flagellar biosynthesis genes
in bacgterial genomes. Typically the fliJ gene is found
adjacent to the gene for the flagellum-specific ATPase
FliI. Sequence scoring in the gray zone between trusted
and noise cutoffs include both probable FliJ proteins
and components of bacterial type III secretion systems.
Length = 141
Score = 28.4 bits (64), Expect = 3.0
Identities = 23/107 (21%), Positives = 48/107 (44%)
Query: 76 AEQEEEFISNTLMKKIQALKKEKETLAHHYEQEEEFLTNDLSRKLNQLTEEKYRLEQTIE 135
E+EEE L K ++ + L + EE+ L + + + Q
Sbjct: 11 REKEEEQAKLELAKAQAEFERLETQLQQLIKYREEYEQQALEKVGAGTSALELSNYQRFI 70
Query: 136 QEQERVVNKLMRKIEKLEAETDAKQNNLEQLRREKVELENTLEQEQE 182
++ ++ + + +++ L+ E +AK+ L + RRE LE E++Q+
Sbjct: 71 RQLDQRIQQQQQELALLQQEVEAKRERLLEARRELKALEKLKEKKQK 117
>gnl|CDD|173502 PTZ00266, PTZ00266, NIMA-related protein kinase; Provisional.
Length = 1021
Score = 29.7 bits (66), Expect = 3.1
Identities = 29/107 (27%), Positives = 53/107 (49%), Gaps = 20/107 (18%)
Query: 126 EKYRLEQTIEQEQERVVNKLMRKIEKLEAETDAKQNNLEQLRREKVELENTLEQEQEALV 185
EK R+E+ +E+ER+ + M +IE+ E E+L RE++E + LE++
Sbjct: 456 EKKRIERLEREERERLERERMERIERERLER-------ERLERERLERDR-LERD----- 502
Query: 186 NKLWKRMDKLEAEK--RILQIKLDQPVSDPSSPHDMNNGDTASNLST 230
R+D+LE E+ R+ + +L++ + M NG +A
Sbjct: 503 -----RLDRLERERVDRLERDRLEKARRNSYFLKGMENGLSAGGGPG 544
>gnl|CDD|223683 COG0610, COG0610, Type I site-specific restriction-modification
system, R (restriction) subunit and related helicases
[Defense mechanisms].
Length = 962
Score = 29.7 bits (67), Expect = 3.2
Identities = 39/207 (18%), Positives = 69/207 (33%), Gaps = 27/207 (13%)
Query: 98 KETLAHHYEQEEEFLTNDLSRKLNQLTEEKYRLEQTIEQEQ-------ERVVNKLMRKIE 150
K+ L + + EE L L + +L EK +EQ E L
Sbjct: 657 KKALKLYSNEGEEDLKEALEEAIEEL-VEKLEDVLNEFKEQSEDIGYLEDAGELLRDFAP 715
Query: 151 KLEAETDAKQNNLEQLRREKVELEN----------TLEQEQEALVNKLWKRMDKLEAEKR 200
+LE + + L +L ++ N TLE+E + L + L R++K K
Sbjct: 716 ELEIKVERGNKILAKLFGRFLKAFNALKKLYEFAYTLEEEIQYLEDLLKARLEKDYQSKY 775
Query: 201 ILQIKLDQPVSDPSSPHDMNNGDTASNLSTHIHSLRSEVMRLRN------QLANTQQEHT 254
+ I D D + + L + + Q N +
Sbjct: 776 LDLIDNLN---DAREIKDKESKAIDLDDIDFELELIGKQEINIDYILELLQTFNDKNGAY 832
Query: 255 EKMERYVKDEKNVREENLRLQRKLQRE 281
E ++ ++ E+LR ++KL
Sbjct: 833 ESLKELIERIIKEWIEDLRQKKKLIER 859
>gnl|CDD|227731 COG5444, COG5444, Uncharacterized conserved protein [Function
unknown].
Length = 565
Score = 29.7 bits (67), Expect = 3.3
Identities = 24/133 (18%), Positives = 45/133 (33%), Gaps = 19/133 (14%)
Query: 26 TREQLQKRIESLQQQNRVLKVELETYKLQLRALQEENKN----LRQTSVNIQAKAEQEEE 81
T E+L K + ++ +++ K +R L+E + ++ AKA
Sbjct: 158 TLEKLYKLDQEGMTLMAAVESKMQELKAIIRQLEEWTIKGGATKKGVPIHYVAKA----- 212
Query: 82 FISNTLMKKIQALKKEKET--LAHHYEQEEEFLTNDLSRKL--------NQLTEEKYRLE 131
F T+ KK + + ET +E DL + L L E
Sbjct: 213 FAEVTIHKKAAEVALQSETYLDIKTELAKERPDMRDLDKPLESANKTGYEGLAGEDIVPF 272
Query: 132 QTIEQEQERVVNK 144
E+ + ++
Sbjct: 273 FLAEETGQLAYSR 285
>gnl|CDD|219342 pfam07227, DUF1423, Protein of unknown function (DUF1423). This
family represents a conserved region approximately 500
residues long within a number of Arabidopsis thaliana
proteins of unknown function.
Length = 446
Score = 29.7 bits (67), Expect = 3.3
Identities = 25/98 (25%), Positives = 46/98 (46%), Gaps = 15/98 (15%)
Query: 56 RALQEENKNLRQTSVNIQAKAEQEEEFISNTLMKKIQA---------LKKEKETL----- 101
R L+E+ + + + + Q K Q +E S +K+ +A ++E E L
Sbjct: 329 RELEEKAREVSELKMERQRKKPQIDELESIVRLKQAEADMFQLKADEARREAERLQRIAL 388
Query: 102 AHHYEQEEEFLTNDLSRKLNQLTEEK-YRLEQTIEQEQ 138
A + EEE+ + L +L++ EE+ Y E+ QE+
Sbjct: 389 AKTEKSEEEYASKYLKLRLSEAEEERQYLFEELKLQEE 426
>gnl|CDD|129661 TIGR00570, cdk7, CDK-activating kinase assembly factor MAT1. All
proteins in this family for which functions are known
are cyclin dependent protein kinases that are components
of TFIIH, a complex that is involved in nucleotide
excision repair and transcription initiation. Also known
as MAT1 (menage a trois 1). This family is based on the
phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis,
Stanford University) [DNA metabolism, DNA replication,
recombination, and repair].
Length = 309
Score = 29.4 bits (66), Expect = 3.4
Identities = 41/159 (25%), Positives = 69/159 (43%), Gaps = 19/159 (11%)
Query: 98 KETLAHHYEQEEEFLTNDLSRKLNQLTEE----KYRLEQTIEQEQERVVNKLMRKIEKLE 153
K L + ++EE+F L R+ N EE Y L I+ + +KIE +
Sbjct: 76 KRVLKIYNKREEDF--PSL-REYNDYLEEVEDIVYNLTNNID------LENTKKKIETYQ 126
Query: 154 AET-DAKQNNLEQLRREKVELENTLEQEQEALVNKLWKRM--DKLEAEKRILQIKLDQPV 210
E D Q N E+ RE+ ELE LE E+E + +R+ K E E+++ + K Q +
Sbjct: 127 KENKDVIQKNKEKSTREQEELEEALEFEKE---EEEQRRLLLQKEEEEQQMNKRKNKQAL 183
Query: 211 SDPSSPHDMNNGDTASNLSTHIHSLRSEVMRLRNQLANT 249
D + + + + L +V + + + NT
Sbjct: 184 LDELETSTLPAAELIAQHKKNSVKLEMQVEKPKPEKPNT 222
>gnl|CDD|218038 pfam04350, PilO, Pilus assembly protein, PilO. PilO proteins are
involved in the assembly of pilin. However, the precise
function of this family of proteins is not known.
Length = 144
Score = 28.6 bits (65), Expect = 3.5
Identities = 13/52 (25%), Positives = 21/52 (40%), Gaps = 5/52 (9%)
Query: 147 RKIEKLEAETDAKQN---NLEQLRREKVELENTLEQEQEALVNKLWKRMDKL 195
K L+ E + K NL +++ ELE + L +K K + L
Sbjct: 6 AKEATLKQEFEEKARQAANLPAYKKQLEELEERFGALLKQLPSK--KEIPSL 55
>gnl|CDD|237258 PRK12903, secA, preprotein translocase subunit SecA; Reviewed.
Length = 925
Score = 29.6 bits (67), Expect = 3.5
Identities = 16/131 (12%), Positives = 42/131 (32%), Gaps = 7/131 (5%)
Query: 59 QEENKNLRQTSVNIQAKAEQEEEFISNTLMKKIQALKKEKETLAHHYEQEEEF-----LT 113
+ +N ++ +E I + + + LK + + EE
Sbjct: 784 ILIITEILSDGINNSDINDRPQELIDQIIESEEERLKALRIQREEMLMRPEELELINEEQ 843
Query: 114 NDLSRKLNQLTEEKYRLEQTIEQEQE--RVVNKLMRKIEKLEAETDAKQNNLEQLRREKV 171
+L +++ E E+ I+ E + +KL AK ++ +
Sbjct: 844 KNLKQEIKLELSEIQEAEEEIQNINENKNEFVEFKNDPKKLNKLIIAKDVLIKLVISSDE 903
Query: 172 ELENTLEQEQE 182
++ +++
Sbjct: 904 IKQDEKTTKKK 914
>gnl|CDD|219780 pfam08286, Spc24, Spc24 subunit of Ndc80. Spc24 is a component of
the evolutionarily conserved kinetochore-associated
Ndc80 complex and is involved in chromosome segregation.
Length = 117
Score = 28.1 bits (63), Expect = 3.6
Identities = 18/48 (37%), Positives = 27/48 (56%), Gaps = 3/48 (6%)
Query: 136 QEQERVVNKLMRKIEKLEAETDAKQNNLEQLRREKVELENTLEQEQEA 183
QE E +L +++ LE+E + Q+ LE+L+ ELE EQE E
Sbjct: 2 QELENEKFRLAKELNDLESELNNLQSELEELKERLEELE---EQEVEG 46
>gnl|CDD|221408 pfam12072, DUF3552, Domain of unknown function (DUF3552). This
presumed domain is functionally uncharacterized. This
domain is found in bacteria, archaea and eukaryotes.
This domain is about 200 amino acids in length. This
domain is found associated with pfam00013, pfam01966.
This domain has a single completely conserved residue A
that may be functionally important.
Length = 201
Score = 28.7 bits (65), Expect = 3.7
Identities = 36/124 (29%), Positives = 62/124 (50%), Gaps = 13/124 (10%)
Query: 74 AKAEQEEEFISNTLMKKIQALKKEKETLAHHYEQEEEF-LTNDLSRKLNQLTEEKYRLEQ 132
+ AE+E + I K+ +ALKKE A +EE L + R+L + E R E+
Sbjct: 30 SSAEEEAKQIIEEAKKEAEALKKEALLEA----KEEIHKLRAEAERELKERRNELQRQEK 85
Query: 133 TIEQEQERVVNKLMRKIEKLEAETDAKQNNLEQLRREKVELENTLEQEQEALVNKLWKRM 192
+ Q++E L RK E L + K+ +LE+ +E + LE+++E L + ++
Sbjct: 86 RLLQKEET----LDRKDESL----EKKEESLEEKEKELAARQQQLEEKEEELEELIEEQQ 137
Query: 193 DKLE 196
+LE
Sbjct: 138 QELE 141
>gnl|CDD|237449 PRK13616, PRK13616, lipoprotein LpqB; Provisional.
Length = 591
Score = 29.6 bits (67), Expect = 3.8
Identities = 14/36 (38%), Positives = 16/36 (44%), Gaps = 4/36 (11%)
Query: 355 PVHSRPQRIKTPLLSVEQPTSPEGAPTPTLGSPMDI 390
P S PQ I T VE+P P PT G D+
Sbjct: 27 PSSSAPQAIGT----VERPAPSNEVPKPTPGMDPDL 58
>gnl|CDD|216603 pfam01610, DDE_Tnp_ISL3, Transposase. Transposase proteins are
necessary for efficient DNA transposition. Contains
transposases for IS204, IS1001, IS1096 and IS1165.
Length = 244
Score = 29.1 bits (66), Expect = 3.8
Identities = 23/93 (24%), Positives = 36/93 (38%), Gaps = 15/93 (16%)
Query: 17 ALMMPPSPVTREQLQKRIESLQQQNRVLKVELETY--KLQLRALQEENKNLRQTSVNIQA 74
L+ P +T +Q Q+ + L N LK Y K +LR + R
Sbjct: 120 LLLKNPENLTEKQKQRLEDLLLDLNPELK---RAYELKEELRDI------YRYKDK---E 167
Query: 75 KAEQE-EEFISNTLMKKIQALKKEKETLAHHYE 106
+A + +E+I I KK +TL H +
Sbjct: 168 EARRLLKEWIEKARRSGIPPFKKLAKTLKRHLD 200
>gnl|CDD|225519 COG2972, COG2972, Predicted signal transduction protein with a
C-terminal ATPase domain [Signal transduction
mechanisms].
Length = 456
Score = 29.3 bits (66), Expect = 3.8
Identities = 30/116 (25%), Positives = 51/116 (43%), Gaps = 16/116 (13%)
Query: 26 TREQLQKRIESLQQQNRVLKVELETYKLQLRALQEEN-----KNLRQTSVNIQAKAE-QE 79
T ++++R+E L ++N L E E + +LRALQ + N +T I+ AE +
Sbjct: 231 TINEMEERLEYLIEENYSL--EQEQLEAELRALQSQINPHFLYNTLET---IRMLAEEDD 285
Query: 80 EEFISNTLMKKIQALKKEKETLAHHYEQEEEFLTNDLSRKLNQLTEEKYRLEQTIE 135
E + + + L+ L + E E L L K L +K R+ +E
Sbjct: 286 PEEAAKVVKALSKLLRYSLSNLDNIVTLEIELL---LIEKY--LEIQKLRIGDRLE 336
>gnl|CDD|222709 pfam14362, DUF4407, Domain of unknown function (DUF4407). This
family of proteins is found in bacteria. Proteins in
this family are typically between 366 and 597 amino
acids in length. There is a single completely conserved
residue R that may be functionally important.
Length = 297
Score = 29.1 bits (66), Expect = 3.9
Identities = 30/136 (22%), Positives = 52/136 (38%), Gaps = 9/136 (6%)
Query: 83 ISNTLMKKIQALKKEKETLAHHYEQEEEF---LTNDLSRKLNQLTEEKYRLEQTIEQEQE 139
IS L KI + + L E++ + K+ LT E L+ I++ Q
Sbjct: 97 ISEPLELKIFEKEIDTVLLEIQQEEQAAAQAQVAAGFRPKIAALTAEIAALQAEIDEAQA 156
Query: 140 RVVNKLMRKIEKLEAETDA---KQNNLEQLRREKVELENTLEQEQEALVNKLWKRMDKLE 196
V + + EAE + +EK E + + E L +L + +LE
Sbjct: 157 EV--NAAYQEAQCEAEGTGGTGVAGK-GPVYKEKREKLDAAQARLETLKARLDAAIAQLE 213
Query: 197 AEKRILQIKLDQPVSD 212
A+K L+ V++
Sbjct: 214 AQKAALERNRQAAVAE 229
>gnl|CDD|235850 PRK06669, fliH, flagellar assembly protein H; Validated.
Length = 281
Score = 29.2 bits (66), Expect = 3.9
Identities = 24/112 (21%), Positives = 44/112 (39%), Gaps = 4/112 (3%)
Query: 78 QEEEFISNTLMKKIQALKKEKETL---AHHYEQEEEFLTNDLSRKLNQLTEEKYRLEQTI 134
+ I+ +K + K + L +EEE + R+ E K +E+
Sbjct: 7 KRSNVINKEKLKTHEIQKYRFKVLSIKEKERLREEEEEQVEQLREEAN-DEAKEIIEEAE 65
Query: 135 EQEQERVVNKLMRKIEKLEAETDAKQNNLEQLRREKVELENTLEQEQEALVN 186
E E V E+L +TD + +E+L+ + + E+E E L+
Sbjct: 66 EDAFEIVEAAEEEAKEELLKKTDEASSIIEKLQMQIEREQEEWEEELERLIE 117
>gnl|CDD|224259 COG1340, COG1340, Uncharacterized archaeal coiled-coil protein
[Function unknown].
Length = 294
Score = 29.3 bits (66), Expect = 3.9
Identities = 31/172 (18%), Positives = 79/172 (45%), Gaps = 6/172 (3%)
Query: 27 REQLQKRIESLQQQNRVLKVELETYKLQLRALQEENKNLRQTSVNIQAKAEQEEEFISNT 86
R++L ++ L+++ + L+ E + +++ L+E+ + ++ + + +E +
Sbjct: 43 RDELNAKVRELREKAQELREERDEINEEVQELKEKRDEINAKLQELRKEYRELKEKRNEF 102
Query: 87 LMKKIQALKKEKETLAHHYEQEEEFLTNDLSRKLNQLTEEKYRLEQTIEQEQERVVNKLM 146
+ E+E +Q+ LT + R+L Q +E + + ++ +
Sbjct: 103 NLGGRSIKSLEREIERLEKKQQTSVLTPEEERELVQKIKELRKELEDAKK-----ALEEN 157
Query: 147 RKIEKLEAETDAKQNNLEQLRREKVELENTLEQEQEALVNKLWKRMDKLEAE 198
K+++L+AE D + ++ + EL N ++ E + KL++ D+L E
Sbjct: 158 EKLKELKAEIDELKKKAREIHEKIQELANEAQEYHEEM-IKLFEEADELRKE 208
>gnl|CDD|220646 pfam10240, DUF2464, Protein of unknown function (DUF2464). This is
a family of proteins conserved from worms to humans.
Members have been annotated as FAM125A proteins, but
their function is unknown.
Length = 251
Score = 28.9 bits (65), Expect = 4.1
Identities = 16/52 (30%), Positives = 24/52 (46%), Gaps = 5/52 (9%)
Query: 344 CGQTTVMPIPPPVHSRPQRIKTPL--LSVEQPTSPEGAPTPTLGSPMDISKS 393
+ +P PPPV +P+ I LS++ + P AP L S + KS
Sbjct: 139 WCKKGSVPAPPPV-PKPRSISQGSQSLSLDSASRP--APGLPLLSSLSRGKS 187
>gnl|CDD|220368 pfam09730, BicD, Microtubule-associated protein Bicaudal-D. BicD
proteins consist of three coiled-coiled domains and are
involved in dynein-mediated minus end-directed transport
from the Golgi apparatus to the endoplasmic reticulum
(ER). For full functioning they bind with GSK-3beta
pfam05350 to maintain the anchoring of microtubules to
the centromere. It appears that amino-acid residues
437-617 of BicD and the kinase activity of GSK-3 are
necessary for the formation of a complex between BicD
and GSK-3beta in intact cells.
Length = 711
Score = 29.4 bits (66), Expect = 4.2
Identities = 61/296 (20%), Positives = 105/296 (35%), Gaps = 51/296 (17%)
Query: 36 SLQQQNRVLK---VELETYKLQLRALQEENKNLRQT---SVNIQAKAEQEEEFISNTLMK 89
SLQ+Q VL+ VE E K ++R L+EE + L + ++ AE++ E TL
Sbjct: 108 SLQKQVSVLRQSQVEFEGLKHEIRRLEEETELLNSQLEEAARLKEIAEKQLEEALETLKT 167
Query: 90 KIQALKKEKETLAHHYEQEEEFLTNDLSRKLNQLTEEKYRLEQTIEQEQERVVN------ 143
+ + ++ L+HH E+ + ++LS L+ L + T N
Sbjct: 168 EREQKNALRKELSHHLNLEDFYSLSNLSISLDGLKFSEDPEASTEPNNDGEDENGHLNGG 227
Query: 144 KLMRKIEKLEAETDAK---------------------QNNLEQLRREKVELENTLE---- 178
+ K + + + LEQ+ REK L L+
Sbjct: 228 PGLAKSLGTPRKGELFPPAPGVSDLFSELNLSEIQKLKQQLEQVEREKASLLANLQELQT 287
Query: 179 ---------QEQEALVNKLWKRMDKLEAEKRILQIKLDQPVSDPSSPHDMNNGDTASNLS 229
EQ VN+L + ++ L + + + H+ +
Sbjct: 288 QLEQSEGALSEQHEKVNRLTEHLNALRNLQASKEQQDGLDSEKDRGSHEDGDYYEVDING 347
Query: 230 THI-----HSLRSEVMRLRNQLANTQQEHTEKMERYVKDEKNVREENLRLQRKLQR 280
I SEV L+ +L + + E ERY ++ +R E L KL
Sbjct: 348 PEILECKYRVAVSEVGELKAELKALKARYNELQERYEDEKDRLRNEIQNLAEKLLS 403
>gnl|CDD|220366 pfam09728, Taxilin, Myosin-like coiled-coil protein. Taxilin
contains an extraordinarily long coiled-coil domain in
its C-terminal half and is ubiquitously expressed. It is
a novel binding partner of several syntaxin family
members and is possibly involved in Ca2+-dependent
exocytosis in neuroendocrine cells. Gamma-taxilin,
described as leucine zipper protein Factor Inhibiting
ATF4-mediated Transcription (FIAT), localises to the
nucleus in osteoblasts and dimerises with ATF4 to form
inactive dimers, thus inhibiting ATF4-mediated
transcription.
Length = 309
Score = 28.9 bits (65), Expect = 4.3
Identities = 29/132 (21%), Positives = 61/132 (46%), Gaps = 7/132 (5%)
Query: 26 TREQLQKRIESLQQQNRVLKVELETY-KLQLRALQEENKNLRQTSVNIQAKAEQEEEFIS 84
R +L+ LQ+ N+ LK E + + +E + + T +IQ + E+ +
Sbjct: 72 ARSKLESLCRELQKHNKTLKEENKARSDEEEEKRKELTEKFQSTLKDIQDQMEEHSN-PN 130
Query: 85 NTLMKKIQALKKEKETLAHHYEQ-----EEEFLTNDLSRKLNQLTEEKYRLEQTIEQEQE 139
+ L ++ + L+++ + L YE E+ T DL ++LN E+ + + EQE+
Sbjct: 131 HKLCEENEELREKLKELIEQYELREQHFEKLLKTKDLEQQLNDAKLEQAQEQAEQEQEKH 190
Query: 140 RVVNKLMRKIEK 151
+ +++ K
Sbjct: 191 KREKEILLKEAA 202
>gnl|CDD|216249 pfam01025, GrpE, GrpE.
Length = 165
Score = 28.3 bits (64), Expect = 4.5
Identities = 13/40 (32%), Positives = 24/40 (60%)
Query: 28 EQLQKRIESLQQQNRVLKVELETYKLQLRALQEENKNLRQ 67
E+L+ E+L+++ L+ E+E K +L L E +N R+
Sbjct: 7 EELEDEEEALEEELEELEEEIEELKDRLLRLLAEFENYRK 46
>gnl|CDD|235551 PRK05667, dnaG, DNA primase; Validated.
Length = 580
Score = 29.0 bits (66), Expect = 4.6
Identities = 25/156 (16%), Positives = 53/156 (33%), Gaps = 8/156 (5%)
Query: 66 RQTSVNIQAKAEQEEEFISNTLMKKIQALKKEKETLAH--HYEQEEEFLTNDLSRKLNQL 123
+ + A+ EQ +K + E+E LA + + E + + L + +
Sbjct: 412 ERLGILDDAQLEQLVPKAQEPQLKAERPRTAERELLALLLQHPELAEEVRDALDEEDFEG 471
Query: 124 TEEKYRLEQTIEQEQERVVNKLMRKIEKLEAETDAKQ----NNLEQLRREKVELENTLEQ 179
L + I + + +E L + +L E LE+
Sbjct: 472 LPLFRALLEAILAQPGLTTGSQL--LEHLRDAGLEELAALLESLAVWEEISEEDIAALEK 529
Query: 180 EQEALVNKLWKRMDKLEAEKRILQIKLDQPVSDPSS 215
E + + KL ++ + E+ I + +L + S
Sbjct: 530 ELKDALEKLRDQLLEERLEELIAKERLLEGHGLSSE 565
>gnl|CDD|183173 PRK11519, PRK11519, tyrosine kinase; Provisional.
Length = 719
Score = 29.4 bits (66), Expect = 4.7
Identities = 25/99 (25%), Positives = 46/99 (46%), Gaps = 10/99 (10%)
Query: 85 NTLMKKIQALKKEKET--LAHHYEQEEEFLTNDLSRKLNQLTEEKYRLEQTIE---QEQE 139
N L + + K+T L+ Y E+ D+ LN +T LEQ IE +E
Sbjct: 212 NNLQNNLTVTENGKDTGVLSLTYTGEDREQIRDI---LNSITRN--YLEQNIERKSEEAS 266
Query: 140 RVVNKLMRKIEKLEAETDAKQNNLEQLRREKVELENTLE 178
+ + L +++ ++ + D +N L R++K ++ LE
Sbjct: 267 KSLAFLAQQLPEVRSRLDVAENKLNAFRQDKDSVDLPLE 305
>gnl|CDD|240334 PTZ00258, PTZ00258, GTP-binding protein; Provisional.
Length = 390
Score = 28.8 bits (65), Expect = 4.9
Identities = 19/73 (26%), Positives = 32/73 (43%), Gaps = 5/73 (6%)
Query: 132 QTIEQEQERVVNKLMRKIEKLEAETDAKQNNLEQLRREKVELENTLEQEQEALVNKLWKR 191
+ I E ++ K + +EK E K+ ++ + EKVEL+ L++ E L R
Sbjct: 145 EIISSE---LILKDLEFVEKRLDELTKKRKKKKKKKEEKVELD-VLKKVLEWLEEGKPVR 200
Query: 192 MDKL-EAEKRILQ 203
+ E IL
Sbjct: 201 DGDWTDKEIEILN 213
>gnl|CDD|185628 PTZ00449, PTZ00449, 104 kDa microneme/rhoptry antigen; Provisional.
Length = 943
Score = 29.3 bits (65), Expect = 5.0
Identities = 12/49 (24%), Positives = 19/49 (38%), Gaps = 2/49 (4%)
Query: 337 KEPRCFACGQTTVMPIPPPVHSRPQRIKTP--LLSVEQPTSPEGAPTPT 383
+EP+ ++ P P P+ + P E P SP+ P P
Sbjct: 592 EEPKKPKRPRSAQRPTRPKSPKLPELLDIPKSPKRPESPKSPKRPPPPQ 640
>gnl|CDD|233067 TIGR00641, acid_CoA_mut_N, methylmalonyl-CoA mutase N-terminal
domain. Methylmalonyl-CoA mutase (EC 5.4.99.2)
catalyzes a reversible isomerization between
L-methylmalonyl-CoA and succinyl-CoA. The enzyme uses an
adenosylcobalamin cofactor. It may be a homodimer, as in
mitochondrion, or a heterodimer with partially
homologous beta chain that does not bind the
adenosylcobalamin cofactor, as in Propionibacterium
freudenreichii. The most similar archaeal sequences are
separate chains, such as AF2215 abd AF2219 of
Archaeoglobus fulgidus, that correspond roughly to the
first 500 and last 130 residues, respectively of known
methylmalonyl-CoA mutases. This model describes the
N-terminal domain subfamily. In a neighbor-joining tree,
AF2215 branches with a bacterial isobutyryl-CoA mutase,
which is also the same length. Scoring between the noise
and trusted cutoffs are the non-catalytic, partially
homologous beta chains from certain heterodimeric
examples of 5.4.99.2.
Length = 524
Score = 29.0 bits (65), Expect = 5.1
Identities = 15/49 (30%), Positives = 25/49 (51%), Gaps = 7/49 (14%)
Query: 127 KYRLEQTIEQE-----QERVVNKLMRKIEKLEAETDAK--QNNLEQLRR 168
KY+LE+ E E V + + K++KL AE D + + L+ L +
Sbjct: 433 KYQLEEEDEVEVLKVDNSSVREEQIAKLKKLRAERDQEKVEAALDALTK 481
>gnl|CDD|236729 PRK10636, PRK10636, putative ABC transporter ATP-binding protein;
Provisional.
Length = 638
Score = 29.0 bits (65), Expect = 5.3
Identities = 33/107 (30%), Positives = 49/107 (45%), Gaps = 20/107 (18%)
Query: 47 ELETYKLQLRALQ----EENKNLRQTSVN-IQAKAEQ---EEEFISNT--LMKKIQALKK 96
+LE Y+ L +Q + ++ ++ + N QA+ +Q E E + T L K+I L+K
Sbjct: 511 DLEDYQQWLSDVQKQENQTDEAPKENNANSAQARKDQKRREAELRTQTQPLRKEIARLEK 570
Query: 97 EKETLAHHYEQEEEFLTN----DLSRK------LNQLTEEKYRLEQT 133
E E L Q EE L + D SRK L Q K LE+
Sbjct: 571 EMEKLNAQLAQAEEKLGDSELYDQSRKAELTACLQQQASAKSGLEEC 617
>gnl|CDD|184896 PRK14903, PRK14903, 16S rRNA methyltransferase B; Provisional.
Length = 431
Score = 28.7 bits (64), Expect = 5.4
Identities = 22/95 (23%), Positives = 42/95 (44%), Gaps = 10/95 (10%)
Query: 110 EFLTNDLSRKLNQLTEE-----KYRLEQTIEQEQERVVNKLMRKIEKLEAETDAK----- 159
+ L D+SR+ QL E+ K + + ER+ + +++ +
Sbjct: 264 KILAVDISREKIQLVEKHAKRLKLSSIEIKIADAERLTEYVQDTFDRILVDAPCTSLGTA 323
Query: 160 QNNLEQLRREKVELENTLEQEQEALVNKLWKRMDK 194
+N+ E LRR E L + Q +V++ WK ++K
Sbjct: 324 RNHPEVLRRVNKEDFKKLSEIQLRIVSQAWKLLEK 358
>gnl|CDD|226883 COG4477, EzrA, Negative regulator of septation ring formation [Cell
division and chromosome partitioning].
Length = 570
Score = 28.9 bits (65), Expect = 5.5
Identities = 33/176 (18%), Positives = 72/176 (40%), Gaps = 21/176 (11%)
Query: 29 QLQKRIESLQQQNRVLKVELETYKL-----QLRALQEENKNLRQTSVNIQAKAEQEEEFI 83
+ R+E L++Q L +L +L +QE+ ++L + + +A+ E
Sbjct: 252 NIDSRLERLKEQLVENSELLTQLELDEAEEELGLIQEKIESLYDL-LEREVEAKNVVEEN 310
Query: 84 SNTLMKKIQALKKEKETLAHHYE--QEEEFLTNDLSRKLNQLTEEKYRLEQTIEQ----- 136
L ++ K+ E L E +E L + + +E LE +++
Sbjct: 311 LPILPDYLEKAKENNEHLKEEIERVKESYRLAETELGSVRKFEKELKELESVLDEILENI 370
Query: 137 --------EQERVVNKLMRKIEKLEAETDAKQNNLEQLRREKVELENTLEQEQEAL 184
E + + ++ + + +E E + Q +L LR++++E LE+ + L
Sbjct: 371 EAQEVAYSELQDNLEEIEKALTDIEDEQEKVQEHLTSLRKDELEARENLERLKSKL 426
>gnl|CDD|227250 COG4913, COG4913, Uncharacterized protein conserved in bacteria
[Function unknown].
Length = 1104
Score = 28.8 bits (64), Expect = 5.5
Identities = 33/195 (16%), Positives = 65/195 (33%), Gaps = 22/195 (11%)
Query: 25 VTREQLQKRIESLQQQNRVLKVELETYKLQLRALQEENKNLRQTSVNIQAKAEQEEEFIS 84
+ +E+ + + + K ++E+ K++ +E I
Sbjct: 295 IKKEEQETLVRQFTVEQTQAKSKVESAKIETDRAREMETLAHDNVKQIVGAQHGI----- 349
Query: 85 NTLMKKIQALKKEKETLAHHYEQEEEFLTNDLSRKLNQLTEEKYRLEQTIEQEQERVVNK 144
L K + ++ T++ + + L E + V++
Sbjct: 350 --LSAKREGAVDKRRTISTARAGLDALVK-----GLGGAAPESAEELLELNNAARLTVDE 402
Query: 145 LMRKIEKLEAETDAKQNNLEQLRREKVELENTLEQEQEALVNKLWKRMDKLEAEKRILQI 204
E LE A Q N+E R V+ +QE ++ L K +E R+LQ+
Sbjct: 403 YPAAREALE---SAGQRNVEDRTR-AVDEFKAADQE----LSSLSKGSSNIEY--RLLQV 452
Query: 205 KLDQPVSDPSSPHDM 219
+ + SP DM
Sbjct: 453 RENLCQDLGVSPRDM 467
>gnl|CDD|114011 pfam05262, Borrelia_P83, Borrelia P83/100 protein. This family
consists of several Borrelia P83/P100 antigen proteins.
Length = 489
Score = 28.8 bits (64), Expect = 5.6
Identities = 22/128 (17%), Positives = 57/128 (44%), Gaps = 3/128 (2%)
Query: 28 EQLQKRIESLQQQNRVLKVELETYKLQLRALQEENKNLRQTSVNIQAKAEQEEEFISNTL 87
++ Q + QQ+ + + + ++R Q+E KNL + + K E++ ++
Sbjct: 223 DKKQIDADKAQQKADFAQDNADKQRDEVRQKQQEAKNLPKPADTSSPK---EDKQVAENQ 279
Query: 88 MKKIQALKKEKETLAHHYEQEEEFLTNDLSRKLNQLTEEKYRLEQTIEQEQERVVNKLMR 147
++I+ + E + + ++ DL ++ +E E ++++E V L +
Sbjct: 280 KREIEKAQIEIKKNDEEALKAKDHKAFDLKQESKASEKEAEDKELEAQKKREPVAEDLQK 339
Query: 148 KIEKLEAE 155
++EA+
Sbjct: 340 TKPQVEAQ 347
>gnl|CDD|150420 pfam09744, Jnk-SapK_ap_N, JNK_SAPK-associated protein-1. This is
the N-terminal 200 residues of a set of proteins
conserved from yeasts to humans. Most of the proteins in
this entry have an RhoGEF pfam00621 domain at their
C-terminal end.
Length = 158
Score = 28.2 bits (63), Expect = 5.7
Identities = 39/152 (25%), Positives = 62/152 (40%), Gaps = 20/152 (13%)
Query: 128 YRLEQTIEQEQERVVN------------KLMRKIEKLEAETDAKQN---NLEQLRREKVE 172
Y L +I QE ER+++ ++R +E L++ Q LE LR +
Sbjct: 2 YDLANSIYQEFERLIDNYGEEVVKNLMPLVIRVLENLDSLASENQELEVELELLREDNER 61
Query: 173 LENTLEQEQE---ALVNKLWKRMDKLEAEKRILQIKLDQPVSDPSS--PHDMNNGDTASN 227
L E+E+E KL + D+LE EK+ LQ K++ + N D S
Sbjct: 62 LSTQYEREKELRKQAEQKLLEFEDELEQEKKELQKKIEDLEENVRQLELKAKNLSDQVSR 121
Query: 228 LSTHIHSLRSEVMRLRNQLANTQQEHTEKMER 259
L L+ E L + + + E +ER
Sbjct: 122 LEERETELKKEYNSLHERYTKLLKNYVEYVER 153
>gnl|CDD|116500 pfam07888, CALCOCO1, Calcium binding and coiled-coil domain
(CALCOCO1) like. Proteins found in this family are
similar to the coiled-coil transcriptional coactivator
protein coexpressed by Mus musculus (CoCoA/CALCOCO1).
This protein binds to a highly conserved N-terminal
domain of p160 coactivators, such as GRIP1, and thus
enhances transcriptional activation by a number of
nuclear receptors. CALCOCO1 has a central coiled-coil
region with three leucine zipper motifs, which is
required for its interaction with GRIP1 and may regulate
the autonomous transcriptional activation activity of
the C-terminal region.
Length = 546
Score = 28.8 bits (64), Expect = 5.7
Identities = 50/197 (25%), Positives = 87/197 (44%), Gaps = 37/197 (18%)
Query: 27 REQLQKRIESLQQQNRVLKVELETYKLQLRALQEE---NKNLRQTSVNIQAKAEQEEEFI 83
R LQ E L + R L+ LE+ + + L EE +LR ++ ++ E +
Sbjct: 278 RIDLQLENEQLHEDLRTLQERLESSQQKAGLLGEELASLGSLRDHTIAELHQSRLESAQM 337
Query: 84 SNTLMKKIQALK-------KEKETLAHHYEQEEEFLTNDLSRKLNQLTEEKYRLEQTIEQ 136
S L + ALK +E+ETL E +++ ++ +L+ E +LE+ +++
Sbjct: 338 SLQLSQLNLALKEGQSQWAQERETLRQSAEADKD--------RIQKLSAELLKLEEWLQE 389
Query: 137 EQ-ERVVNKLMRKIEKLEAETDAKQNNLEQLRREKVELENTL-----EQEQEAL------ 184
E+ +R ++ +L E D + L + RRE EL + L E+EQ
Sbjct: 390 ERSQREKLEV-----ELGIEKDCNRVQLSENRRELSELRSALRVLQKEKEQLQEEKQELL 444
Query: 185 --VNKLWKRMDKLEAEK 199
+ L R+DK EK
Sbjct: 445 DYIRVLELRLDKEADEK 461
>gnl|CDD|218891 pfam06090, Ins_P5_2-kin, Inositol-pentakisphosphate 2-kinase. This
is a family of inositol-pentakisphosphate 2-kinases (EC
2.7.1.158) (also known as inositol
1,3,4,5,6-pentakisphosphate 2-kinase, Ins(1,3,4,5,6)P5
2-kinase) and InsP5 2-kinase). This enzyme
phosphorylates Ins(1,3,4,5,6)P5 to form
Ins(1,2,3,4,5,6)P6 (also known as InsP6 or phytate).
InsP6 is involved in many processes such as mRNA export,
nonhomologous end-joining, endocytosis and ion channel
regulation.
Length = 301
Score = 28.5 bits (64), Expect = 5.9
Identities = 10/30 (33%), Positives = 19/30 (63%)
Query: 234 SLRSEVMRLRNQLANTQQEHTEKMERYVKD 263
LR +++RLR N+Q + T+++ Y+K
Sbjct: 19 YLRGKLLRLRKLRPNSQNDSTQEIYDYIKR 48
>gnl|CDD|151277 pfam10828, DUF2570, Protein of unknown function (DUF2570). This is
a family of proteins with unknown function.
Length = 110
Score = 27.5 bits (61), Expect = 5.9
Identities = 17/50 (34%), Positives = 29/50 (58%), Gaps = 1/50 (2%)
Query: 148 KIEKLEAETDAKQNNLEQLRREKVELENTLEQEQEALVNKLWKRMDKLEA 197
KI+ L AE A+ +EQ ++ L + LEQE++A V + K +++E
Sbjct: 26 KIDSLRAENKAQAQTIEQQQKANQRLTDALEQERQA-VEEQQKIANEIER 74
>gnl|CDD|240227 PTZ00009, PTZ00009, heat shock 70 kDa protein; Provisional.
Length = 653
Score = 29.0 bits (65), Expect = 6.0
Identities = 24/80 (30%), Positives = 38/80 (47%), Gaps = 17/80 (21%)
Query: 123 LTEEKYRLEQTIEQEQERVVNKLMRKIEKLEAETDAKQNNLEQLRREKVELENTLEQEQE 182
+T +K RL + + +R+VN+ EK +AE +A RE+VE +N LE
Sbjct: 505 ITNDKGRLSK---ADIDRMVNE----AEKYKAEDEA--------NRERVEAKNGLENYCY 549
Query: 183 ALVNKLW--KRMDKLEAEKR 200
++ N L K KL +
Sbjct: 550 SMKNTLQDEKVKGKLSDSDK 569
>gnl|CDD|214395 CHL00204, ycf1, Ycf1; Provisional.
Length = 1832
Score = 28.9 bits (65), Expect = 6.2
Identities = 53/238 (22%), Positives = 93/238 (39%), Gaps = 45/238 (18%)
Query: 77 EQEEEFISNTLMKKIQALKK--------EKETLAHHYEQEEEFLT-------NDLSR-KL 120
E+++ +SN + +I+AL K EK T + E ++E+L N R ++
Sbjct: 410 EEKKNNLSNEFLNRIEALDKESSSLNILEKRTRLCNDETKKEYLPKIYDPFLNGPYRGRI 469
Query: 121 NQLTEEKYRLEQTIEQEQERV-VNK----LMRKIEKLEAETDAKQNNLEQLRREKVELEN 175
+L E +I+ E V +NK L+ + E E N + L E E
Sbjct: 470 KKLFSPSIINETSIKNNIEGVWINKIHGILLNNTDYQEFEQKIDTFNKKSLSTEINESLT 529
Query: 176 TLEQEQEALVNKLWKRMDKL-----------EAEKRILQIKLDQPVSDPSSPHDMNNGDT 224
+ + + L + L E E +I + D ++DP N T
Sbjct: 530 LINKFGAEPKSSLNLKGLSLFSEPEQEKINSEEEIKIFKFLFDAVITDP-------NDQT 582
Query: 225 ASNLSTHIHSLRSEVMRLRNQLANTQQEHTEKMERYVKDEKNVREENLRLQRKLQREV 282
S I + +V R +L + E++E ++E+NV ++ RK +R V
Sbjct: 583 IIKKSIGIKEISKKVPRWSYKLID----ELEQLEG--ENEENVPSDHQIRSRKAKRVV 634
>gnl|CDD|233119 TIGR00763, lon, ATP-dependent protease La. This protein is induced
by heat shock and other stresses in E. coli, B.
subtilis, and other species. The yeast member,
designated PIM1, is located in the mitochondrial matrix,
required for mitochondrial function, and also induced by
heat shock [Protein fate, Degradation of proteins,
peptides, and glycopeptides].
Length = 775
Score = 28.8 bits (65), Expect = 6.2
Identities = 31/155 (20%), Positives = 67/155 (43%), Gaps = 20/155 (12%)
Query: 28 EQLQKRIESLQQQNRVLKVELETYKLQLRALQEENK--NLRQTSVNIQAKAEQEEEF--- 82
++++ L +++ + + L + E + + S+ ++ K E +E
Sbjct: 127 REIKETFRELISLSKLFREQPALLS-ALEDIDEPGRLADFVAASLQLKEKDELQEVLETV 185
Query: 83 -ISNTLMKKIQALKKEKETLAHHYEQEEEFLTNDLSRKLNQLTEEKYRLEQT-------- 133
I L K ++ LKKE E L + + +T + K+ + E Y EQ
Sbjct: 186 NIEKRLKKALELLKKELELL-----KLQNKITKKVEEKMEKTQREYYLREQLKAIKKELG 240
Query: 134 IEQEQERVVNKLMRKIEKLEAETDAKQNNLEQLRR 168
IE++ + + KL K+E+L+ + K+ ++L +
Sbjct: 241 IEKDDKDELEKLKEKLEELKLPEEVKKVIEKELTK 275
>gnl|CDD|215656 pfam00012, HSP70, Hsp70 protein. Hsp70 chaperones help to fold
many proteins. Hsp70 assisted folding involves repeated
cycles of substrate binding and release. Hsp70 activity
is ATP dependent. Hsp70 proteins are made up of two
regions: the amino terminus is the ATPase domain and the
carboxyl terminus is the substrate binding region.
Length = 598
Score = 28.7 bits (65), Expect = 6.3
Identities = 15/53 (28%), Positives = 28/53 (52%), Gaps = 5/53 (9%)
Query: 149 IEKL--EAETDAKQNNLEQLRREKVELENTLEQEQEALVNKLWKRMDKLEAEK 199
IE++ +AE A + ++ R+E++E +N E+ +L L + DKL
Sbjct: 506 IERMVKDAEEYAAE---DKKRKERIEAKNEAEEYVYSLEKSLKEEGDKLPEAD 555
>gnl|CDD|220716 pfam10359, Fmp27_WPPW, RNA pol II promoter Fmp27 protein domain.
Fmp27_WPPW is a conserved domain of a family of proteins
involved in RNA polymerase II transcription initiation.
It contains characteristic HQR and WPPW sequence motifs.
and is towards the C-terminal in members which contain
Fmp27_SW pfam10305.
Length = 470
Score = 28.8 bits (65), Expect = 6.3
Identities = 20/93 (21%), Positives = 43/93 (46%), Gaps = 1/93 (1%)
Query: 138 QERVVNKLMRK-IEKLEAETDAKQNNLEQLRREKVELENTLEQEQEALVNKLWKRMDKLE 196
RV +L+ + +++LE + + L+ L E + +E E ++ L +R++ LE
Sbjct: 158 PRRVQRELLEERLKQLEEQIKKLEEKLDDLELNDTEELQSDLEELEEELSVLKERLEFLE 217
Query: 197 AEKRILQIKLDQPVSDPSSPHDMNNGDTASNLS 229
L+ + S+ + + D+ S+LS
Sbjct: 218 KLLEDLERSEESSDRSSSTDTESSIADSLSSLS 250
>gnl|CDD|185657 PTZ00479, PTZ00479, RAP Superfamily; Provisional.
Length = 435
Score = 28.6 bits (64), Expect = 6.4
Identities = 24/96 (25%), Positives = 38/96 (39%), Gaps = 13/96 (13%)
Query: 48 LETYKLQLRALQEENKNLRQTSVN--IQAKAEQEEEFISNTLMKKIQALKK--EKETLAH 103
E+ KLQ R L N+R+ +Q + E K+Q ++K E E L
Sbjct: 349 TESVKLQHRILSNLGWNIRRVVWIDWVQLGDDTEA---------KVQYVRKLRESEPLGD 399
Query: 104 HYEQEEEFLTNDLSRKLNQLTEEKYRLEQTIEQEQE 139
++ KL ++ + K + EQ Q QE
Sbjct: 400 TLTDGPSLSPQEIKEKLKKIKQYKIKKEQQKVQSQE 435
>gnl|CDD|130728 TIGR01667, YCCS_YHJK, integral membrane protein, YccS/YhfK family.
This model represents two clades of putative
transmembrane proteins including the E. coli YccS and
YhfK proteins. The YccS hypothetical equivalog
(TIGR01666) is found in beta and gamma proteobacteria,
while the smaller YhfK group is only found in E. coli,
Salmonella and Yersinia. TMHMM on the 19 hits to this
model shows a consensus of 11 transmembrane helices
separated into two clusters, an N-terminal cluster of 6
and a central cluster of 5. This would indicate two
non-membrane domains one on each side of the membrane.
Length = 701
Score = 28.7 bits (64), Expect = 6.4
Identities = 35/173 (20%), Positives = 64/173 (36%), Gaps = 18/173 (10%)
Query: 39 QQNRVLKVELETYKLQLRALQEENKNLRQTSVNIQA---KAEQEEEFISNTLMKKIQALK 95
++L LE + LR + + + + + A + +S TL +Q
Sbjct: 531 LLRKMLHDALEANQRYLRLILSQYPQGKPDDLAYRIARRNAHNTDAALSTTLSNMMQ--- 587
Query: 96 KEKETLAHHYEQEEEFLTND--LSRKLNQLTEEKYRL---EQTIEQEQ--ERVVNKLMRK 148
E +H+ E LT L ++ L + RL E E Q E V + R
Sbjct: 588 -EPAFNSHYLEDGFRLLTLSHTLLSYISALGAHRERLLNPELAAELLQACEIVAKAIQRC 646
Query: 149 IEKLEAETDAKQNNLEQLRREKVELE----NTLEQEQEALVNKLWKRMDKLEA 197
+LE + +A + + ++++ TL +Q L+ L M L +
Sbjct: 647 QARLEYDEEALAGDANIMEAQEMQPHEDAAGTLVYQQLQLIIGLLPTMHTLSS 699
>gnl|CDD|220410 pfam09798, LCD1, DNA damage checkpoint protein. This is a family
of proteins which regulate checkpoint kinases. In
Schizosaccharomyces pombe this protein is called Rad26
and in Saccharomyces cerevisiae it is called LCD1.
Length = 648
Score = 28.6 bits (64), Expect = 6.9
Identities = 21/119 (17%), Positives = 55/119 (46%), Gaps = 7/119 (5%)
Query: 152 LEAETDAKQNNLEQLRREKVELENTLEQEQEALVNKLWKRMDKLEAEKRILQIKLDQPVS 211
L + D Q ++ R ++ N L+++ + + KL + + LE E++ L ++ +
Sbjct: 2 LRDKLDMLQQQKQEERNKQKSRVNELKEKHDQELQKLKQELQSLEDERKFLVLEQRGLSA 61
Query: 212 -----DPSSPHDMNNGDTASNLSTHIHSLRSEVMRLRNQLANTQQEHTEKMERYVKDEK 265
+ S P ++ AS+++ S + + + + +++ ++ R +KD+K
Sbjct: 62 NDLRTELSPPSNLLKTSDASHIA--DESQPNSIKQKKREISPVKELVPLNPNRIIKDDK 118
>gnl|CDD|206687 cd01900, YchF, YchF GTPase. YchF is a member of the Obg family,
which includes four other subfamilies of GTPases: Obg,
DRG, Ygr210, and NOG1. Obg is an essential gene that is
involved in DNA replication in C. crescentus and
Streptomyces griseus and is associated with the
ribosome. Several members of the family, including YchF,
possess the TGS domain related to the RNA-binding
proteins. Experimental data and genomic analysis suggest
that YchF may be part of a nucleoprotein complex and may
function as a GTP-dependent translational factor.
Length = 274
Score = 28.2 bits (64), Expect = 7.1
Identities = 14/62 (22%), Positives = 31/62 (50%), Gaps = 10/62 (16%)
Query: 28 EQLQKRIESLQQQNR----VLKVELETYKLQLRALQEENKNLRQTSVNIQAKAEQEEEFI 83
E ++KR+E L+++ + K ELE + +++ EE K R + ++E + +
Sbjct: 134 ETIEKRLERLEKKAKSGDKEAKEELELLE-KIKEHLEEGKPARTLELT-----DEEIKIL 187
Query: 84 SN 85
+
Sbjct: 188 KS 189
>gnl|CDD|206034 pfam13863, DUF4200, Domain of unknown function (DUF4200). This
family is found in eukaryotes. It is a coiled-coil
domain of unknwon function.
Length = 126
Score = 27.2 bits (61), Expect = 7.1
Identities = 27/107 (25%), Positives = 53/107 (49%), Gaps = 3/107 (2%)
Query: 30 LQKRIESLQQQNRVLKVELETYKLQLRALQEENKNLRQTSVNIQAKAEQEEEFISNTLMK 89
L+KR E + Q L + E ++ + L++ + L + +Q + ++F+ K
Sbjct: 3 LEKRREMEEVQ-LALDAKREEFERREELLKQREEELEKKEEELQESLIKFDKFLKENEAK 61
Query: 90 KIQALKKEKETLAHHYEQEEEFLTNDLSRKLNQLTEEKYRLEQTIEQ 136
+ +A KK +E E+EEE +L +L +L E +LE+ +E+
Sbjct: 62 RRRAEKKAEEEKKLRKEKEEEI--KELKAELEELKAEIEKLEEKLEE 106
>gnl|CDD|132552 TIGR03513, GldL_gliding, gliding motility-associated protein GldL.
This protein family, GldL, is named for the member from
Flavobacterium johnsoniae, which is required for a type
of rapid gliding motility found in certain members of
the Bacteriodetes. However, members are found also in
several members of the Bacteriodetes that appear not to
be motile [Cellular processes, Chemotaxis and motility].
Length = 202
Score = 27.9 bits (62), Expect = 7.1
Identities = 14/84 (16%), Positives = 29/84 (34%), Gaps = 10/84 (11%)
Query: 83 ISNTLMKKIQALKKEKETLAHHYEQEEEFLTNDLSRKLNQLTEEKYRLEQTIEQEQERVV 142
+ TL + ++K+ Y ++ L ++ LN + Y +
Sbjct: 118 AAKTLAPMTDSYAQQKK-----YIEQMSSLAANME-GLNTI----YEAQLKGASSHADAN 167
Query: 143 NKLMRKIEKLEAETDAKQNNLEQL 166
N++ L+ E + NL L
Sbjct: 168 NEIAINSSSLKEEMEKMAANLTSL 191
>gnl|CDD|205039 pfam12718, Tropomyosin_1, Tropomyosin like. This family is a set
of eukaryotic tropomyosins. Within the yeast Tmp1 and
Tmp2, biochemical and sequence analyses indicate that
Tpm2p spans four actin monomers along a filament,
whereas Tpmlp spans five. Despite its shorter length,
Tpm2p can compete with Tpm1p for binding to F-actin.
Over-expression of Tpm2p in vivo alters the axial
budding of haploids to a bipolar pattern, and this can
be partially suppressed by co-over-expression of Tpm1p.
This suggests distinct functions for the two
tropomyosins, and indicates that the ratio between them
is important for correct morphogenesis. The family also
contains higher eukaryote Tmp3 members.
Length = 143
Score = 27.6 bits (62), Expect = 7.1
Identities = 30/98 (30%), Positives = 46/98 (46%), Gaps = 5/98 (5%)
Query: 90 KIQALKKEKETLAHHYEQEEE---FLTNDLSRKLNQLTEEKYRLEQTIEQEQERVVNKLM 146
K+ ALK E E E+ EE L + K ++T + + +Q +E+E E++ +L
Sbjct: 1 KMNALKLEAENAQERAEELEEKLKELEQENLEKEQEITSLQKKNQQ-LEEEVEKLEEQLK 59
Query: 147 RKIEKLEAETDAKQNNLEQLRREKVELENTLEQEQEAL 184
EKLE N E L R LE LE+ ++ L
Sbjct: 60 EAKEKLEESEKLATNA-EALTRRIQLLEEELEESEKRL 96
>gnl|CDD|192773 pfam11559, ADIP, Afadin- and alpha -actinin-Binding. This family
is found in mammals where it is localised at cell-cell
adherens junctions, and in Sch. pombe and other fungi
where it anchors spindle-pole bodies to spindle
microtubules. It is a coiled-coil structure, and in
pombe, it is required for anchoring the minus end of
spindle microtubules to the centrosome equivalent, the
spindle-pole body. The name ADIP derives from the family
being composed of Afadin- and alpha -Actinin-Binding
Proteins Localised at Cell-Cell Adherens Junctions.
Length = 149
Score = 27.6 bits (62), Expect = 7.2
Identities = 15/41 (36%), Positives = 23/41 (56%)
Query: 28 EQLQKRIESLQQQNRVLKVELETYKLQLRALQEENKNLRQT 68
E+LQ IE L+ Q L+ EL + + R L+++ K L Q
Sbjct: 67 ERLQNTIERLKTQLEDLERELALLQAKERQLEKKLKTLEQK 107
Score = 27.3 bits (61), Expect = 8.7
Identities = 23/80 (28%), Positives = 38/80 (47%), Gaps = 9/80 (11%)
Query: 105 YEQEEEFLTNDLSRKLNQLTEEKYRLEQTIEQEQERVVNKLMRKIEKLEAETDAKQNNLE 164
+++ EF + L L +L E RL+ TIE+ + ++ E LE E Q
Sbjct: 45 RDRDLEFRES-LEETLRKLEAEIERLQNTIERLKTQL--------EDLERELALLQAKER 95
Query: 165 QLRREKVELENTLEQEQEAL 184
QL ++ LE L+ E+E +
Sbjct: 96 QLEKKLKTLEQKLKNEKEEV 115
>gnl|CDD|204784 pfam11932, DUF3450, Protein of unknown function (DUF3450). This
family of proteins are functionally uncharacterized.
This protein is found in bacteria and eukaryotes.
Proteins in this family are about 260 amino acids in
length.
Length = 250
Score = 28.0 bits (63), Expect = 7.4
Identities = 19/80 (23%), Positives = 34/80 (42%), Gaps = 10/80 (12%)
Query: 26 TREQLQKRIE-------SLQQQNRVLKVELETYKLQLRALQEENKNLRQTSVNIQAKAEQ 78
Q QK+I+ L + R L E+E ++ LQ N +Q ++Q + EQ
Sbjct: 37 AAAQSQKKIDKWADQTQELLAEIRQLLKEIENLRVYNDQLQRLVANQQQEIASLQQQIEQ 96
Query: 79 EEEF---ISNTLMKKIQALK 95
E+ + + + I L+
Sbjct: 97 IEKTRQGLVPLMYRMIDGLE 116
>gnl|CDD|235339 PRK05084, xerS, site-specific tyrosine recombinase XerS; Reviewed.
Length = 357
Score = 28.3 bits (64), Expect = 7.4
Identities = 29/119 (24%), Positives = 49/119 (41%), Gaps = 29/119 (24%)
Query: 45 KVELETYK--LQLRALQEEN---KNLRQTSVN--------------IQAKAEQEEEFISN 85
K ++E + L+ R L + K QT++N +A+ E E +
Sbjct: 76 KKDVEAFILYLRERPLLNGHSTKKGNSQTTINRTLSALKSLFKYLTEEAENEDGEPYFYR 135
Query: 86 TLMKKIQALKKEKETLAHHYEQEEEFLTNDLSRKLNQLTEEKYRLEQTIEQEQERVVNK 144
+MKKI+ KK+KETLA ++ L +E Y I+ E E+ ++
Sbjct: 136 NVMKKIEL-KKKKETLAARAHNLKQKLFLG---------DEDYEFLDFIDNEYEQKLSN 184
>gnl|CDD|239815 cd04320, AspRS_cyto_N, AspRS_cyto_N: N-terminal, anticodon
recognition domain of the type found in Saccharomyces
cerevisiae and human cytoplasmic aspartyl-tRNA
synthetase (AspRS). This domain is a beta-barrel domain
(OB fold) involved in binding the tRNA anticodon
stem-loop. The enzymes in this group are homodimeric
class2b aminoacyl-tRNA synthetases (aaRSs). aaRSs
catalyze the specific attachment of amino acids (AAs)
to their cognate tRNAs during protein biosynthesis.
This 2-step reaction involves i) the activation of the
AA by ATP in the presence of magnesium ions, followed
by ii) the transfer of the activated AA to the
terminal ribose of tRNA. In the case of the class2b
aaRSs, the activated AA is attached to the 3'OH of the
terminal ribose. Eukaryotes contain 2 sets of aaRSs,
both of which are encoded by the nuclear genome. One
set concerns with cytoplasmic protein synthesis,
whereas the other exclusively with mitochondrial
protein synthesis.
Length = 102
Score = 26.8 bits (60), Expect = 7.6
Identities = 11/33 (33%), Positives = 15/33 (45%)
Query: 65 LRQTSVNIQAKAEQEEEFISNTLMKKIQALKKE 97
LRQ IQ E +S ++K +L KE
Sbjct: 23 LRQQGYTIQGVLAASAEGVSKQMVKWAGSLSKE 55
>gnl|CDD|149105 pfam07851, TMPIT, TMPIT-like protein. A number of members of
this family are annotated as being transmembrane
proteins induced by tumour necrosis factor alpha, but
no literature was found to support this.
Length = 330
Score = 28.2 bits (63), Expect = 7.8
Identities = 16/50 (32%), Positives = 31/50 (62%), Gaps = 4/50 (8%)
Query: 28 EQLQKRIESLQQQNRVLKVELE-TYKLQLR---ALQEENKNLRQTSVNIQ 73
++L+K + LQ+ +R+ K +LE KLQ + A+ + K L+Q V+++
Sbjct: 7 KELEKEFQQLQETHRLYKQKLEEVEKLQEQCTSAIARQRKRLKQLIVSLK 56
>gnl|CDD|221489 pfam12252, SidE, Dot/Icm substrate protein. This family of proteins
is found in bacteria. Proteins in this family are
typically between 397 and 1543 amino acids in length.
This family is the SidE protein in the Dot/Icm pathway of
Legionella pneumophila bacteria. There is little
literature describing the family.
Length = 1443
Score = 28.7 bits (64), Expect = 8.0
Identities = 51/253 (20%), Positives = 105/253 (41%), Gaps = 51/253 (20%)
Query: 76 AEQEEEFISNTLMKKIQALKKEKETL-----------AHHYEQEEEFLTNDLSRKLNQLT 124
+E+E IS L ++IQ++++E E+L + + L + S+++ +L
Sbjct: 1061 SEKEMGTISRELREQIQSVRQELESLQRAVVTPVVTDEEKVRERYDALIENTSKRITEL- 1119
Query: 125 EEKYRLEQ------------TIEQEQERVVNKLMR-----------KIEKLEAETDAKQN 161
EK +L ++QE + N+ +R IEKLE +
Sbjct: 1120 -EKAKLANLDVVKKAISNLNNLQQEVTLLRNEKIRMHTGTDKVDFSDIEKLEKQLQVIDT 1178
Query: 162 NL-----EQLRREKVELENTLEQEQEALVNKLWKRMDKLEAEKRILQIKLDQPVSDPSSP 216
L ++ ++ LE + Q L +K+ K D + + ++ + S P
Sbjct: 1179 KLADAYLLEVTKQISALEKEKPKNQSELKSKIAKFFDTTADIEVLRNERIKKHGSS-KDP 1237
Query: 217 HDMNNGDTAS-NLSTHIHSLRSEVMR-LRNQLANTQQEHTEKMERYVKDEKNVREENLRL 274
D+++ D S NL SL S ++ +R+ L + + E E+ + ++N L
Sbjct: 1238 LDLSDLDKLSGNLQGVNQSLVSILITTIRSSLNQMKPKTFEMQEKEI-------QQNFEL 1290
Query: 275 QRKLQREVERREA 287
KL++ +++ +
Sbjct: 1291 LAKLEKTLDKSDT 1303
>gnl|CDD|224381 COG1464, NlpA, ABC-type metal ion transport system, periplasmic
component/surface antigen [Inorganic ion transport and
metabolism].
Length = 268
Score = 28.0 bits (63), Expect = 8.1
Identities = 19/84 (22%), Positives = 35/84 (41%), Gaps = 2/84 (2%)
Query: 187 KLWKRMDKLEAEKRILQIKLDQPVSDPSSPHDM-NNGDTASNLSTHIHSLRSEVMRLRNQ 245
++ + + K +K+ L +K+ +D P++ +GD +N H L +
Sbjct: 42 EILEVVVKPALKKKGLDLKI-VEFTDYVQPNEALADGDIDANAFQHKPYLDQFNKEHGGK 100
Query: 246 LANTQQEHTEKMERYVKDEKNVRE 269
L H E M Y K K++ E
Sbjct: 101 LVAVGNTHIEPMGLYSKKYKSLAE 124
>gnl|CDD|234252 TIGR03545, TIGR03545, TIGR03545 family protein. This model
represents a relatively rare but broadly distributed
uncharacterized protein family, distributed in 1-2
percent of bacterial genomes, all of which have outer
membranes. In many of these genomes, it is part of a
two-gene pair.
Length = 555
Score = 28.5 bits (64), Expect = 8.1
Identities = 17/112 (15%), Positives = 44/112 (39%), Gaps = 23/112 (20%)
Query: 22 PSPVTREQLQKRIESLQQQNRVLKVELETYKLQLRALQEENKNLRQTSVNIQAKAEQEEE 81
P+ E+ +KR+E++++++ ++ + +E + +A ++ +
Sbjct: 188 PNKQDLEEYKKRLEAIKKKD------IKNPLELQKIKEE----FDKLKKEGKADKQKIKS 237
Query: 82 FISNTLMKKIQALKKEKETLAHHYEQEEEFLTNDLSRKLNQLTEEKYRLEQT 133
L+ +K+ L + ++ NDL R E KY ++
Sbjct: 238 A--------KNDLQNDKKQLKADLAELKKAPQNDLKR-----LENKYAIKSG 276
>gnl|CDD|221049 pfam11262, Tho2, Transcription factor/nuclear export subunit
protein 2. THO and TREX form a eukaryotic complex which
functions in messenger ribonucleoprotein metabolism and
plays a role in preventing the transcription-associated
genetic instability. Tho2, along with four other
subunits forms THO.
Length = 296
Score = 28.1 bits (63), Expect = 8.1
Identities = 17/73 (23%), Positives = 36/73 (49%), Gaps = 5/73 (6%)
Query: 46 VELETYKLQLRALQEENKNLRQTSVNIQAKAEQEEEFISNTLMKKIQALKKEKETLAHHY 105
V E Y ++ L+++ K L +S I K ++++ L I+ L++E + H
Sbjct: 24 VPKELYDKEIERLEKQIKELDSSSSGIDKKKKEKKR-----LKSLIKKLEEELKKHIEHN 78
Query: 106 EQEEEFLTNDLSR 118
E+ ++ L+ + S
Sbjct: 79 EKTKKRLSEEKSS 91
>gnl|CDD|193553 cd05678, M20_dipept_like_1, Uncharacterized M20 Dipeptidases.
Peptidase M20 family, unknown dipeptidase-like subfamily
1 (inferred by homology to be dipeptidases). M20
dipeptidases include a large variety of bacterial
enzymes including cytosolic nonspecific dipeptidase
(CNDP), Xaa-methyl-His dipeptidase (anserinase),and
canosinase. These dipeptidases have been shown to act on
a wide range of dipeptides, but not larger peptides. For
example, anserinase mainly catalyzes the hydrolysis of
N-alpha-acetylhistidine while carnosinase degrades
beta-alanyl-L-histidine.
Length = 466
Score = 28.4 bits (64), Expect = 8.3
Identities = 16/48 (33%), Positives = 22/48 (45%), Gaps = 6/48 (12%)
Query: 55 LRALQEENKNLRQTSVNIQAKAEQEEEFISNTLMKKIQALKKEKETLA 102
L AL+ +VNI+ + EEE S L Q +K+ KE L
Sbjct: 135 LDALKAAGIPP---AVNIKVILDSEEEKGSPHLP---QVVKEYKELLK 176
>gnl|CDD|205692 pfam13514, AAA_27, AAA domain. This domain is found in a number of
double-strand DNA break proteins. This domain contains a
P-loop motif.
Length = 1118
Score = 28.2 bits (63), Expect = 8.6
Identities = 28/123 (22%), Positives = 48/123 (39%), Gaps = 20/123 (16%)
Query: 106 EQEEEFLTNDLSRKLNQLTEEKYRLEQTIEQEQERVVN--KLMRKIEKLEAETDAKQNNL 163
E +E + ++N +E LE I + Q + L++ +++ E E + L
Sbjct: 138 EADELWKPRGRKPEINVALKELKELEAEIREVQLKTRTWKDLVKALDEAEEELANLRKEL 197
Query: 164 EQLRREKVELE------------NTLEQEQEALVN------KLWKRMDKLEAEKRILQIK 205
QL +EK LE LEQ+ AL +R ++ AE R +
Sbjct: 198 RQLEKEKQRLERLRRLLPLLAERKALEQQLAALGEVIDLPPDAVERYEEARAELRAARRN 257
Query: 206 LDQ 208
L+
Sbjct: 258 LEL 260
>gnl|CDD|237551 PRK13902, alaS, alanyl-tRNA synthetase; Provisional.
Length = 900
Score = 28.3 bits (64), Expect = 8.8
Identities = 18/78 (23%), Positives = 30/78 (38%)
Query: 25 VTREQLQKRIESLQQQNRVLKVELETYKLQLRALQEENKNLRQTSVNIQAKAEQEEEFIS 84
V EQL K +E ++ + K E+E + +L L + V +E +
Sbjct: 754 VPPEQLPKTVERFFEEWKEQKKEIEKLRKELAELLASELLSKAEEVGGVKVVVEEVDADE 813
Query: 85 NTLMKKIQALKKEKETLA 102
+ L K L KE +
Sbjct: 814 DELRKIANELAKEGKVAL 831
>gnl|CDD|235461 PRK05431, PRK05431, seryl-tRNA synthetase; Provisional.
Length = 425
Score = 28.1 bits (64), Expect = 8.9
Identities = 20/81 (24%), Positives = 35/81 (43%), Gaps = 20/81 (24%)
Query: 86 TLMKKIQALKKEKETLAHHYEQEEEFLTNDLSRKLNQLTEEKYRLEQTIEQEQERVVNKL 145
L +++ L+ E+ N LS+++ Q + E I + V +L
Sbjct: 39 ELQTELEELQAER---------------NALSKEIGQAKRKGEDAEALIAE-----VKEL 78
Query: 146 MRKIEKLEAETDAKQNNLEQL 166
+I+ LEAE D + LE+L
Sbjct: 79 KEEIKALEAELDELEAELEEL 99
>gnl|CDD|236583 PRK09601, PRK09601, GTP-binding protein YchF; Reviewed.
Length = 364
Score = 28.1 bits (64), Expect = 9.0
Identities = 16/62 (25%), Positives = 32/62 (51%), Gaps = 10/62 (16%)
Query: 28 EQLQKRIESLQQQ----NRVLKVELETYKLQLRALQEENKNLRQTSVNIQAKAEQEEEFI 83
E ++KR+E L+++ ++ K ELE + +L EE K R + ++EE+ +
Sbjct: 138 ETVEKRLERLEKKAKGGDKEAKAELELLE-KLLEHLEEGKPARTLELT-----DEEEKLL 191
Query: 84 SN 85
+
Sbjct: 192 KS 193
>gnl|CDD|224705 COG1792, MreC, Cell shape-determining protein [Cell envelope
biogenesis, outer membrane].
Length = 284
Score = 28.1 bits (63), Expect = 9.1
Identities = 12/37 (32%), Positives = 21/37 (56%), Gaps = 3/37 (8%)
Query: 35 ESLQQQNRVLKVELETYK---LQLRALQEENKNLRQT 68
+ L +N LK EL + ++ +L+EENK L++
Sbjct: 69 KDLALENEELKKELAELEQLLEEVESLEEENKRLKEL 105
>gnl|CDD|222160 pfam13476, AAA_23, AAA domain.
Length = 204
Score = 27.5 bits (61), Expect = 9.2
Identities = 24/114 (21%), Positives = 50/114 (43%), Gaps = 7/114 (6%)
Query: 56 RALQEENKNLRQTSVNIQAKAEQEEEFISNTLMKKIQALKKEKETLAHHYEQEEEFLTND 115
R L ++ + S+ + ++ +EFI L + L L + ++ E L
Sbjct: 96 RELTKKKGKKVKKSILEIVEIDELQEFIDELLKSDKEIL----PLLLYLGQERLEELKFK 151
Query: 116 LSRKLNQLTEEKYRLEQTIEQEQERVVNKLMRKIEKLEAETDAKQNNLEQLRRE 169
K +L E+ E ++++ ++ KL+ + EK + E + + LE+L E
Sbjct: 152 RKEKKERL--EELEKELEELEDEKDLLEKLLEEKEKKK-ELEELKEELEELLEE 202
>gnl|CDD|220404 pfam09789, DUF2353, Uncharacterized coiled-coil protein (DUF2353).
Members of this family of uncharacterized proteins have
no known function.
Length = 319
Score = 27.9 bits (62), Expect = 9.2
Identities = 40/179 (22%), Positives = 80/179 (44%), Gaps = 18/179 (10%)
Query: 27 REQLQKRIESLQQQNRVLKVELETYKLQLRALQEENKNLRQTSVNIQAKAE----QEEEF 82
R+ L + +E +++N+ L +++ + + LQ + K LRQT + + +E
Sbjct: 67 RQDLSQLLEKYREENQQLSTDVQELRQRKAELQGDIKLLRQTIAQQRVEFMGGGVRESFE 126
Query: 83 ISNTLMKKIQALKKEKETLAH-----HYEQEEEFLTND-LSRKLNQLTEEKYRLEQTIEQ 136
N L+ +++ L+++ + L E+EE + D RK ++L E L +
Sbjct: 127 ERNDLVSQLEKLREKCKQLESDLRSLLDEKEEVLMERDEYQRKAHRLNNE---LSYLLGG 183
Query: 137 EQERVVNKLMRKIEKLEAETDAKQNNLEQLRREKVELENTLEQEQEALVNKLWKRMDKL 195
+ RVV+ ++ L E + + L E ++ TL + ++ L K K M K
Sbjct: 184 DPRRVVD-----VDALVMENRYLKEKINTLEEESELIKMTLAKYKQMLEAKNKKTMVKA 237
>gnl|CDD|114855 pfam06160, EzrA, Septation ring formation regulator, EzrA. During
the bacterial cell cycle, the tubulin-like cell-division
protein FtsZ polymerises into a ring structure that
establishes the location of the nascent division site.
EzrA modulates the frequency and position of FtsZ ring
formation.
Length = 559
Score = 28.3 bits (64), Expect = 9.4
Identities = 42/220 (19%), Positives = 82/220 (37%), Gaps = 63/220 (28%)
Query: 114 NDLSRKLNQLTEEKYRLEQT-IEQEQERVVNKLMRKIEKLEAETDAKQNNLEQLRREKVE 172
+L ++TEE Y + IE+E ++ L+ + D LE+L ++ E
Sbjct: 229 EELKAGYREMTEEGYHFDHLDIEKE-----------LQDLKEQIDQNLALLEELDLDEAE 277
Query: 173 LEN------------TLEQEQEA------LVNKLWKRMDKLEAEKRILQIKLDQPVSDPS 214
EN LE+E +A ++KL ++ + + L ++LD+
Sbjct: 278 EENEEIEERIDTLYDILEKEVKAKKFVEKNIDKLTDFLEHAREQNKQLLLELDR----LQ 333
Query: 215 SPHDMNNGD--TASNLSTHIHSLRSEVMRLRNQLANTQQEHTEKMERY------------ 260
+ +N + T L + L ++ +L ++A + ++E E
Sbjct: 334 QSYTLNEDELETVRELEKQLEELEAQYDQLVERIAEKKVAYSELQEELEEIEKQLEEIEK 393
Query: 261 ------------VKDEKNVREENLRLQRKLQ---REVERR 285
KDE RE+ ++KL R +E+
Sbjct: 394 EQEELSESLQGLRKDELEAREKLQEYRQKLHEIKRYLEKS 433
>gnl|CDD|233255 TIGR01061, parC_Gpos, DNA topoisomerase IV, A subunit,
Gram-positive. Operationally, topoisomerase IV is a
type II topoisomerase required for the decatenation of
chromosome segregation. Not every bacterium has both a
topo II and a topo IV. The topo IV families of the
Gram-positive bacteria and the Gram-negative bacteria
appear not to represent a single clade among the type II
topoisomerases, and are represented by separate models
for this reason [DNA metabolism, DNA replication,
recombination, and repair].
Length = 738
Score = 28.2 bits (63), Expect = 9.8
Identities = 26/143 (18%), Positives = 62/143 (43%), Gaps = 6/143 (4%)
Query: 23 SPVTREQLQKRIESLQQQNRVLKVELETYKLQLRALQEENKNLRQTSVNIQAKAEQEEEF 82
S E+ KR+E ++ + + + E KL +R+ ++++ N + Q E
Sbjct: 358 SKYELEKASKRLEIVEGLIKAISIIDEIIKL-IRSSEDKSDAKENLIDNFKFTENQAEAI 416
Query: 83 ISNTLMK----KIQALKKEKETLAHHYEQEEEFLTNDLSRKLNQLTEEKYRLEQTIEQEQ 138
+S L + I LK+E+ L E+ + ++ +R L ++ ++ Q++
Sbjct: 417 VSLRLYRLTNTDIFELKEEQNELEKKIISLEQIIASEKARN-KLLKKQLEEYKKQFAQQR 475
Query: 139 ERVVNKLMRKIEKLEAETDAKQN 161
+ + +I+ E+E ++
Sbjct: 476 RSQIEDFINQIKINESELIENED 498
Database: CDD.v3.10
Posted date: Mar 20, 2013 7:55 AM
Number of letters in database: 10,937,602
Number of sequences in database: 44,354
Lambda K H
0.308 0.123 0.326
Gapped
Lambda K H
0.267 0.0677 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 19,350,691
Number of extensions: 1893153
Number of successful extensions: 6266
Number of sequences better than 10.0: 1
Number of HSP's gapped: 4692
Number of HSP's successfully gapped: 1197
Length of query: 393
Length of database: 10,937,602
Length adjustment: 99
Effective length of query: 294
Effective length of database: 6,546,556
Effective search space: 1924687464
Effective search space used: 1924687464
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.1 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.7 bits)
S2: 60 (27.0 bits)