RPS-BLAST 2.2.26 [Sep-21-2011]
Database: CDD.v3.10
44,354 sequences; 10,937,602 total letters
Searching..................................................done
Query= psy3451
(223 letters)
>gnl|CDD|237177 PRK12704, PRK12704, phosphodiesterase; Provisional.
Length = 520
Score = 44.0 bits (105), Expect = 3e-05
Identities = 59/172 (34%), Positives = 83/172 (48%), Gaps = 17/172 (9%)
Query: 49 AKEETSELAKEKTSELAKEETSELAKEKTSELAKEETSELAKEKTRALAKEETRELAKEK 108
AK + +E ++ E AK+E + KE E AKEE +L + E + +E+
Sbjct: 31 AKIKEAEEEAKRILEEAKKEAEAIKKEALLE-AKEEIHKL---------RNEFEKELRER 80
Query: 109 TRELAKEEIRELAKEETSELTKEKTRELAKEETRELANEETSELAKEETSELAKEKPREL 168
EL K E R L KEE K L K E EL +E K++ E +E+ EL
Sbjct: 81 RNELQKLEKRLLQKEEN---LDRKLELLEKRE-EELEKKEKELEQKQQELEKKEEELEEL 136
Query: 169 AKEETRELAKEKTRELAKEETRELAKEKTRELAKEKTRELAKEEIRELAKEE 220
+E+ +EL E+ L EE +E+ EK E A+ + L K EI E AKEE
Sbjct: 137 IEEQLQEL--ERISGLTAEEAKEILLEKVEEEARHEAAVLIK-EIEEEAKEE 185
Score = 42.8 bits (102), Expect = 7e-05
Identities = 47/135 (34%), Positives = 73/135 (54%), Gaps = 8/135 (5%)
Query: 41 AKEETSELAKEETSELAKEKTSELAKEETSELAKEKT----SELAKEETSELAKEKTRAL 96
AKEE +L + E + +E+ +EL K E L KE+ EL ++ EL K++
Sbjct: 62 AKEEIHKL-RNEFEKELRERRNELQKLEKRLLQKEENLDRKLELLEKREEELEKKEKELE 120
Query: 97 AKEETRELAKEKTRELAKEEIRELAKEETSELTKEKTRELAKEETRELANEETSELAKEE 156
K++ E +E+ EL +E+++EL E S LT E+ +E+ E+ E A E + L KE
Sbjct: 121 QKQQELEKKEEELEELIEEQLQEL--ERISGLTAEEAKEILLEKVEEEARHEAAVLIKEI 178
Query: 157 TSELAKEKPRELAKE 171
E AKE+ + AKE
Sbjct: 179 EEE-AKEEADKKAKE 192
Score = 40.5 bits (96), Expect = 4e-04
Identities = 51/149 (34%), Positives = 77/149 (51%), Gaps = 13/149 (8%)
Query: 75 EKTSELAKEETSELAKEKTRALAKEETRELAKEKTRELAKEEIRELAKEETSELTKEKTR 134
E + A+EE + +E AK+E + KE E AKEEI +L + E + +E+
Sbjct: 30 EAKIKEAEEEAKRILEE-----AKKEAEAIKKEALLE-AKEEIHKL-RNEFEKELRERRN 82
Query: 135 ELAKEETRELANEETSELAKEETSELAKEKPRELAKEETRELAKEKTRELAKEETRELAK 194
EL K E R L EE L ++ EL +++ EL K+E K++ E +EE EL +
Sbjct: 83 ELQKLEKRLLQKEEN--LDRK--LELLEKREEELEKKEKELEQKQQELEKKEEELEELIE 138
Query: 195 EKTRELAK--EKTRELAKEEIRELAKEET 221
E+ +EL + T E AKE + E +EE
Sbjct: 139 EQLQELERISGLTAEEAKEILLEKVEEEA 167
Score = 28.6 bits (65), Expect = 3.2
Identities = 33/125 (26%), Positives = 57/125 (45%), Gaps = 15/125 (12%)
Query: 35 ERGDKSAKEETSELAKEETSELAKEKTSELAKEETSELAKEKTSELAKEETSELAKEKTR 94
+R + ++ EL K+E K++ E +EE EL +E+ EL E+
Sbjct: 99 DRKLELLEKREEELEKKEKELEQKQQELEKKEEELEELIEEQLQEL----------ERIS 148
Query: 95 ALAKEETRELAKEKTRELAKEEIRELAKEETSELTKEKTRELAKEETRELANEETSELAK 154
L EE +E+ EK E A+ E L KE +E+ +E A ++ +E+ + A
Sbjct: 149 GLTAEEAKEILLEKVEEEARHEAAVLIKE-----IEEEAKEEADKKAKEILAQAIQRCAA 203
Query: 155 EETSE 159
+ +E
Sbjct: 204 DHVAE 208
>gnl|CDD|235600 PRK05771, PRK05771, V-type ATP synthase subunit I; Validated.
Length = 646
Score = 43.4 bits (103), Expect = 4e-05
Identities = 48/221 (21%), Positives = 84/221 (38%), Gaps = 35/221 (15%)
Query: 32 QPEERGDKSAKEETSELAKE--ETSELAKEKTSELAKEETSELAKEKTSELAKEETSELA 89
E+ ++ + E +L + SE + S L K +E+ +++ + EL
Sbjct: 32 HIEDLKEELSNERLRKLRSLLTKLSEALDKLRSYLPKLNP---LREEKKKVSVKSLEELI 88
Query: 90 KEKTRALAK--EETRELAKEKTRELAKEEIRELAKEE---------TSELTKEKTRELAK 138
K+ L K +E +EL +E + + EI+EL +E +L+ +
Sbjct: 89 KDVEEELEKIEKEIKELEEEISE--LENEIKELEQEIERLEPWGNFDLDLSLLLGFKYVS 146
Query: 139 ------EETRELANEETSELAKEETSELAKEK-------PRELAKEETRELAKEKTRELA 185
E + + S++ E K +EL+ E EL K L
Sbjct: 147 VFVGTVPEDKLEELKLESDVENVEYISTDKGYVYVVVVVLKELSDEVEEELKKLGFERLE 206
Query: 186 KEE----TRELAKEKTRELAKEKTRELAKEEIRELAKEETN 222
EE + + + K EK RE EE++ELAK+
Sbjct: 207 LEEEGTPSELIREIKEELEEIEKERESLLEELKELAKKYLE 247
Score = 39.1 bits (92), Expect = 0.001
Identities = 47/192 (24%), Positives = 80/192 (41%), Gaps = 25/192 (13%)
Query: 29 ETLQPEERGDKSAKEETSELAKEETSELAKEKTSELAKEETSELAKEKTSELAKEETSEL 88
+ P+ + K++ S + EE + +E+ ++ KE EL +E SEL + E EL
Sbjct: 63 RSYLPKLNPLREEKKKVSVKSLEELIKDVEEELEKIEKE-IKELEEEI-SEL-ENEIKEL 119
Query: 89 AKEKTRA---------LAKEETRELAKEKTRELAKEEIRELAKEETSELTKEKTRELAKE 139
+E R L+ + + ++++ EL + S++ + K
Sbjct: 120 EQEIERLEPWGNFDLDLSLLLGFKYVSVFVGTVPEDKLEEL--KLESDVENVEYISTDKG 177
Query: 140 ET-------RELANEETSELAKEETSELA---KEKPRELAKE-ETRELAKEKTRELAKEE 188
+EL++E EL K L + P EL +E + EK RE EE
Sbjct: 178 YVYVVVVVLKELSDEVEEELKKLGFERLELEEEGTPSELIREIKEELEEIEKERESLLEE 237
Query: 189 TRELAKEKTREL 200
+ELAK+ EL
Sbjct: 238 LKELAKKYLEEL 249
>gnl|CDD|188306 TIGR03319, RNase_Y, ribonuclease Y. Members of this family are
RNase Y, an endoribonuclease. The member from Bacillus
subtilis, YmdA, has been shown to be involved in
turnover of yitJ riboswitch [Transcription, Degradation
of RNA].
Length = 514
Score = 42.6 bits (101), Expect = 8e-05
Identities = 61/165 (36%), Positives = 82/165 (49%), Gaps = 17/165 (10%)
Query: 47 ELAKEETSELAKEKTSELAKEETSELAKEKTSELAKEETSELAKEKTRALAKEETRELAK 106
E AK+E L KE E AKEE +L E EL KE +EL + + R L +EET L +
Sbjct: 39 EEAKKEAETLKKEALLE-AKEEVHKLRAELEREL-KERRNELQRLERRLLQREET--LDR 94
Query: 107 EKTRELAKEEIRELAKEETSELTKEKTRELAKEETRELANEETSELAKEETSELAKEKPR 166
+ KEE L K+E KEK + +EE EL E+ EL E+
Sbjct: 95 KMESLDKKEE--NLEKKEKELSNKEKNLDEKEEELEELIAEQREEL----------ERIS 142
Query: 167 ELAKEETRELAKEKTRELAKEETRELAKEKTRELAKEKTRELAKE 211
L +EE +E+ E+ E A+ E +L KE E AKE+ + AKE
Sbjct: 143 GLTQEEAKEILLEEVEEEARHEAAKLIKEIEEE-AKEEADKKAKE 186
Score = 38.8 bits (91), Expect = 0.001
Identities = 48/135 (35%), Positives = 71/135 (52%), Gaps = 8/135 (5%)
Query: 41 AKEETSELAKEETSELAKEKTSELAKEETSELAKEKT----SELAKEETSELAKEKTRAL 96
AKEE +L E EL KE+ +EL + E L +E+T E ++ L K++
Sbjct: 56 AKEEVHKLRAELEREL-KERRNELQRLERRLLQREETLDRKMESLDKKEENLEKKEKELS 114
Query: 97 AKEETRELAKEKTRELAKEEIRELAKEETSELTKEKTRELAKEETRELANEETSELAKEE 156
KE+ + +E+ EL E+ EL E S LT+E+ +E+ EE E A E ++L KE
Sbjct: 115 NKEKNLDEKEEELEELIAEQREEL--ERISGLTQEEAKEILLEEVEEEARHEAAKLIKEI 172
Query: 157 TSELAKEKPRELAKE 171
E AKE+ + AKE
Sbjct: 173 EEE-AKEEADKKAKE 186
Score = 38.8 bits (91), Expect = 0.002
Identities = 49/135 (36%), Positives = 65/135 (48%), Gaps = 16/135 (11%)
Query: 97 AKEETRELAKEKTRELAKEEIRELAKEETSELTKEKTRELAKEETRELANEET------- 149
AK+E L KE E AKEE+ +L E EL KE+ EL + E R L EET
Sbjct: 41 AKKEAETLKKEALLE-AKEEVHKLRAELEREL-KERRNELQRLERRLLQREETLDRKMES 98
Query: 150 -----SELAKEETSELAKEKPRELAKEETRELAKEKTRELAKEETRELAKEKTRELAKEK 204
L K+E KEK + +EE EL E+ EL E L +E+ +E+ E+
Sbjct: 99 LDKKEENLEKKEKELSNKEKNLDEKEEELEELIAEQREEL--ERISGLTQEEAKEILLEE 156
Query: 205 TRELAKEEIRELAKE 219
E A+ E +L KE
Sbjct: 157 VEEEARHEAAKLIKE 171
Score = 37.6 bits (88), Expect = 0.003
Identities = 45/124 (36%), Positives = 60/124 (48%), Gaps = 9/124 (7%)
Query: 102 RELAKEKTRELAKEE---IRELAKEETSELTKEKTRELAKEETRELANEETSELAKEETS 158
R+ EK A+E I E AK+E L KE E AKEE +L E EL KE +
Sbjct: 19 RKRIAEKKLGSAEELAKRIIEEAKKEAETLKKEALLE-AKEEVHKLRAELEREL-KERRN 76
Query: 159 ELAKEKPRELAKEET----RELAKEKTRELAKEETRELAKEKTRELAKEKTRELAKEEIR 214
EL + + R L +EET E +K L K+E KEK + +E+ EL E+
Sbjct: 77 ELQRLERRLLQREETLDRKMESLDKKEENLEKKEKELSNKEKNLDEKEEELEELIAEQRE 136
Query: 215 ELAK 218
EL +
Sbjct: 137 ELER 140
Score = 30.7 bits (70), Expect = 0.68
Identities = 32/103 (31%), Positives = 53/103 (51%), Gaps = 7/103 (6%)
Query: 27 RPETLQPEERGDKSAKEETSELAKEETSELAKEKTSELAKEETSELAKEKTSELAKEETS 86
+ + L +E+ +EE EL E+ EL E+ S L +EE E+ E+ E A+ E +
Sbjct: 109 KEKELSNKEKNLDEKEEELEELIAEQREEL--ERISGLTQEEAKEILLEEVEEEARHEAA 166
Query: 87 ELAKEKTRALAKEETRELAKEKTRELAKEEIRELAKEETSELT 129
+L KE +EE +E A +K +E+ I+ A + +E T
Sbjct: 167 KLIKE-----IEEEAKEEADKKAKEILATAIQRYAGDHVAETT 204
Score = 29.1 bits (66), Expect = 1.9
Identities = 27/55 (49%), Positives = 32/55 (58%), Gaps = 2/55 (3%)
Query: 167 ELAKEETRELAKEKTRELAKEETRELAKEKTRELAKEKTRELAKEEIRELAKEET 221
E AK+E L KE E AKEE +L E REL KE+ EL + E R L +EET
Sbjct: 39 EEAKKEAETLKKEALLE-AKEEVHKLRAELEREL-KERRNELQRLERRLLQREET 91
>gnl|CDD|236081 PRK07735, PRK07735, NADH dehydrogenase subunit C; Validated.
Length = 430
Score = 40.0 bits (93), Expect = 6e-04
Identities = 54/190 (28%), Positives = 82/190 (43%), Gaps = 3/190 (1%)
Query: 34 EERGDKSAKEETSELAKEETSE-LAKEKTSELAKEETSELAKEKTSELAKEETSELAKEK 92
E+ + K+E + AKEE + L + +E++K E KEK + T E AK +
Sbjct: 3 PEKDLEDLKKEAARRAKEEARKRLVAKHGAEISKLEEENREKEKALPKNDDMTIEEAKRR 62
Query: 93 TRALAKEETRELAKEKTR--ELAKEEIRELAKEETSELTKEKTRELAKEETRELANEETS 150
A AK + LAK+K E EE + AK + + K K LAK++
Sbjct: 63 AAAAAKAKAAALAKQKREGTEEVTEEEKAKAKAKAAAAAKAKAAALAKQKREGTEEVTEE 122
Query: 151 ELAKEETSELAKEKPRELAKEETRELAKEKTRELAKEETRELAKEKTRELAKEKTRELAK 210
E A + A K + A + + E+ E +E +E AK K AK K LAK
Sbjct: 123 EKAAAKAKAAAAAKAKAAALAKQKREGTEEVTEEEEETDKEKAKAKAAAAAKAKAAALAK 182
Query: 211 EEIRELAKEE 220
++ E +
Sbjct: 183 QKAAEAGEGT 192
Score = 38.8 bits (90), Expect = 0.001
Identities = 51/180 (28%), Positives = 78/180 (43%), Gaps = 2/180 (1%)
Query: 42 KEETSELAKEETSELAKEKTSELAKEETSELAKEKTSELAKEETSELAKEKTRALAKEET 101
K E +E EE AK K + AK + + LAK+K + E A K +A A +
Sbjct: 78 KREGTEEVTEEEKAKAKAKAAAAAKAKAAALAKQKREGTEEVTEEEKAAAKAKAAAAAKA 137
Query: 102 RELAKEKTRELAKEEIRELAKEETSELTKEKTRELAKEETRELANEETSELAKEETSELA 161
+ A K + EE+ E +E E K K AK + LA ++ +E + E
Sbjct: 138 KAAALAKQKREGTEEVTEEEEETDKEKAKAKAAAAAKAKAAALAKQKAAEAGEGT--EEV 195
Query: 162 KEKPRELAKEETRELAKEKTRELAKEETRELAKEKTRELAKEKTRELAKEEIRELAKEET 221
E+ + AK + AK K LAK++ + + E AK K AK + A+ +T
Sbjct: 196 TEEEKAKAKAKAAAAAKAKAAALAKQKASQGNGDSGDEDAKAKAIAAAKAKAAAAARAKT 255
Score = 38.0 bits (88), Expect = 0.003
Identities = 55/193 (28%), Positives = 81/193 (41%), Gaps = 8/193 (4%)
Query: 28 PETLQPEERGDKSAKEETSELAKEETSELAKEKTSELAKEETSELAKEKTSELAKEETSE 87
E + +E+ + T E AK + AK K + LAK+ K + +E EE
Sbjct: 38 EEENREKEKALPKNDDMTIEEAKRRAAAAAKAKAAALAKQ------KREGTEEVTEEEKA 91
Query: 88 LAKEKTRALAKEETRELAKEKTRELAKEEIRELAKEETSELTKEKTRELAKEETRELANE 147
AK K A AK + LAK+K + E A + K + A + + E
Sbjct: 92 KAKAKAAAAAKAKAAALAKQKREGTEEVTEEEKAAAKAKAAAAAKAKAAALAKQKREGTE 151
Query: 148 ETSELAKEETSELAKEKPRELAKEETRELAKEKTRELAKEETRELAKEKTRELAKEKTRE 207
E +E +E E AK K AK + LAK+K E + E E+ + AK K
Sbjct: 152 EVTEEEEETDKEKAKAKAAAAAKAKAAALAKQKAAEAGEGT--EEVTEEEKAKAKAKAAA 209
Query: 208 LAKEEIRELAKEE 220
AK + LAK++
Sbjct: 210 AAKAKAAALAKQK 222
Score = 33.4 bits (76), Expect = 0.082
Identities = 50/183 (27%), Positives = 80/183 (43%), Gaps = 8/183 (4%)
Query: 36 RGDKSAKEETSELAKEETSELAKEKTSELAKEETSELAKEKTSELAKEETSELAKEKTRA 95
R +AK + + LAK+ K + +E EE AK K + AK + + LAK+K
Sbjct: 62 RAAAAAKAKAAALAKQ------KREGTEEVTEEEKAKAKAKAAAAAKAKAAALAKQKREG 115
Query: 96 LAKEETRELAKEKTRELAKEEIRELAKEETSELTKEKTRELAKEETRELANEETSELAKE 155
+ E A K + A + + A + E+ E +E +E A + + AK
Sbjct: 116 TEEVTEEEKAAAKAKAAAAAKAKAAALAKQKREGTEEVTEEEEETDKEKAKAKAAAAAKA 175
Query: 156 ETSELAKEKPRELAKEETRELAKEKTRELAKEETRELAKEKTRELAKEKTRELAKEEIRE 215
+ + LAK+K E + E E+ + AK + AK K LAK+K + + E
Sbjct: 176 KAAALAKQKAAEAGEGT--EEVTEEEKAKAKAKAAAAAKAKAAALAKQKASQGNGDSGDE 233
Query: 216 LAK 218
AK
Sbjct: 234 DAK 236
Score = 32.3 bits (73), Expect = 0.20
Identities = 49/178 (27%), Positives = 76/178 (42%), Gaps = 6/178 (3%)
Query: 41 AKEETSELAKEETSELAKEKTSELAKEETSELAKEKTSELAKEETSELAKEKTRALAKEE 100
AK + + AK + + LAK+K + E A K A + A K + EE
Sbjct: 93 AKAKAAAAAKAKAAALAKQKREGTEEVTEEEKAAAKAKAAAAAKAKAAALAKQKREGTEE 152
Query: 101 TRELAKEKTRELAKEEIRELAKEETSELTKEKTR------ELAKEETRELANEETSELAK 154
E +E +E AK + AK + + L K+K E EE + A + + AK
Sbjct: 153 VTEEEEETDKEKAKAKAAAAAKAKAAALAKQKAAEAGEGTEEVTEEEKAKAKAKAAAAAK 212
Query: 155 EETSELAKEKPRELAKEETRELAKEKTRELAKEETRELAKEKTRELAKEKTRELAKEE 212
+ + LAK+K + + E AK K AK + A+ KT+ +K E +EE
Sbjct: 213 AKAAALAKQKASQGNGDSGDEDAKAKAIAAAKAKAAAAARAKTKGAEGKKEEEPKQEE 270
Score = 26.9 bits (59), Expect = 10.0
Identities = 36/130 (27%), Positives = 59/130 (45%), Gaps = 2/130 (1%)
Query: 36 RGDKSAKEETSELAKEETSELAKEKTSELAKEETSELAKEKTSELAKEETSELAKEKTRA 95
+ + EE +E +E E AK K + AK + + LAK+K +E + E E+ +A
Sbjct: 144 KQKREGTEEVTEEEEETDKEKAKAKAAAAAKAKAAALAKQKAAEAGEGT--EEVTEEEKA 201
Query: 96 LAKEETRELAKEKTRELAKEEIRELAKEETSELTKEKTRELAKEETRELANEETSELAKE 155
AK + AK K LAK++ + + E K K AK + A +T +
Sbjct: 202 KAKAKAAAAAKAKAAALAKQKASQGNGDSGDEDAKAKAIAAAKAKAAAAARAKTKGAEGK 261
Query: 156 ETSELAKEKP 165
+ E +E+P
Sbjct: 262 KEEEPKQEEP 271
>gnl|CDD|227657 COG5354, COG5354, Uncharacterized protein, contains Trp-Asp (WD)
repeat [General function prediction only].
Length = 561
Score = 36.8 bits (85), Expect = 0.007
Identities = 35/162 (21%), Positives = 61/162 (37%), Gaps = 17/162 (10%)
Query: 10 PNGRNICMSSASPNSCGRPETLQPEERGDKSAKEETSELAKEETSELA-----KEKTSEL 64
P+G+ + SS+ P E G K K + +E E S+L
Sbjct: 409 PSGQYVTTSSSCPK--------HKVEHGYKIFKIAGALYPEEARGGFKNFAWRPEPPSKL 460
Query: 65 AKEETSELAKEK--TSELAKEETSELAKEKTRALAKEETRELAKEKTRELAKEEIRELAK 122
E + K ++ A R+ A + +EL ++ +++IR L K
Sbjct: 461 TIESAKKYVKPSRHRFVPFEKAVIMEADSANRSSAPRK-KELVEQWPEYSDEDKIRSLLK 519
Query: 123 EETS-ELTKEKTRELAKEETRELANEETSELAKEETSELAKE 163
+ + E KE+ R + E ++ ET E E EL ++
Sbjct: 520 KLRAIEALKERMRSGEELEVIQVNKIETEEEVLSELKELVED 561
Score = 29.5 bits (66), Expect = 1.5
Identities = 23/103 (22%), Positives = 38/103 (36%), Gaps = 4/103 (3%)
Query: 122 KEETSELTKEKTRELAKE--ETRELANEETSELAKEETSELAKEKPRELAKEETRELAKE 179
E S+LT E ++ K + A A K +EL ++ ++
Sbjct: 454 PEPPSKLTIESAKKYVKPSRHRFVPFEKAVIMEADSANRSSAPRK-KELVEQWPEYSDED 512
Query: 180 KTRELAKEETR-ELAKEKTRELAKEKTRELAKEEIRELAKEET 221
K R L K+ E KE+ R + + ++ K E E E
Sbjct: 513 KIRSLLKKLRAIEALKERMRSGEELEVIQVNKIETEEEVLSEL 555
>gnl|CDD|236545 PRK09510, tolA, cell envelope integrity inner membrane protein
TolA; Provisional.
Length = 387
Score = 35.6 bits (82), Expect = 0.015
Identities = 39/185 (21%), Positives = 79/185 (42%), Gaps = 2/185 (1%)
Query: 34 EERGDKSAKEETSELAKEETSELAKEKTSELAKEETSELAK--EKTSELAKEETSELAKE 91
EE+ K +++ EL +++ +E + K E + E K E+ ++ A + + +
Sbjct: 78 EEQRKKKEQQQAEELQQKQAAEQERLKQLEKERLAAQEQKKQAEEAAKQAALKQKQAEEA 137
Query: 92 KTRALAKEETRELAKEKTRELAKEEIRELAKEETSELTKEKTRELAKEETRELANEETSE 151
+A A + + A+ K A ++ AK++ +K AK++ A + +
Sbjct: 138 AAKAAAAAKAKAEAEAKRAAAAAKKAAAEAKKKAEAEAAKKAAAEAKKKAEAEAAAKAAA 197
Query: 152 LAKEETSELAKEKPRELAKEETRELAKEKTRELAKEETRELAKEKTRELAKEKTRELAKE 211
AK++ AK+K AK++ AK + A E K + A++ A
Sbjct: 198 EAKKKAEAEAKKKAAAEAKKKAAAEAKAAAAKAAAEAKAAAEKAAAAKAAEKAAAAKAAA 257
Query: 212 EIREL 216
E+ +L
Sbjct: 258 EVDDL 262
Score = 35.6 bits (82), Expect = 0.015
Identities = 37/183 (20%), Positives = 79/183 (43%), Gaps = 2/183 (1%)
Query: 31 LQPEERGDKSAKEETSELAKEETSELAKEKTSELAKEETSELAKEKTSELAK--EETSEL 88
Q +++ K A+E+ + +++ EL +++ +E + + E + E K EE ++
Sbjct: 67 QQQQQKSAKRAEEQRKKKEQQQAEELQQKQAAEQERLKQLEKERLAAQEQKKQAEEAAKQ 126
Query: 89 AKEKTRALAKEETRELAKEKTRELAKEEIRELAKEETSELTKEKTRELAKEETRELANEE 148
A K + + + A K + A+ + A ++ + K+K A ++ A ++
Sbjct: 127 AALKQKQAEEAAAKAAAAAKAKAEAEAKRAAAAAKKAAAEAKKKAEAEAAKKAAAEAKKK 186
Query: 149 TSELAKEETSELAKEKPRELAKEETRELAKEKTRELAKEETRELAKEKTRELAKEKTREL 208
A + + AK+K AK++ AK+K AK + A E K +
Sbjct: 187 AEAEAAAKAAAEAKKKAEAEAKKKAAAEAKKKAAAEAKAAAAKAAAEAKAAAEKAAAAKA 246
Query: 209 AKE 211
A++
Sbjct: 247 AEK 249
Score = 34.4 bits (79), Expect = 0.037
Identities = 34/142 (23%), Positives = 65/142 (45%), Gaps = 2/142 (1%)
Query: 79 ELAKEETSELAKEKTRALAKEETRELAKEKTRELAKEEIRELAKEETSELTKEKTRELAK 138
E K++ + A+E + A E+ R EK R A+E+ ++ EE ++ K ++ +
Sbjct: 79 EQRKKKEQQQAEELQQKQAAEQERLKQLEKERLAAQEQKKQ--AEEAAKQAALKQKQAEE 136
Query: 139 EETRELANEETSELAKEETSELAKEKPRELAKEETRELAKEKTRELAKEETRELAKEKTR 198
+ A + A+ + + A +K AK++ A +K AK++ A K
Sbjct: 137 AAAKAAAAAKAKAEAEAKRAAAAAKKAAAEAKKKAEAEAAKKAAAEAKKKAEAEAAAKAA 196
Query: 199 ELAKEKTRELAKEEIRELAKEE 220
AK+K AK++ AK++
Sbjct: 197 AEAKKKAEAEAKKKAAAEAKKK 218
>gnl|CDD|223095 COG0016, PheS, Phenylalanyl-tRNA synthetase alpha subunit
[Translation, ribosomal structure and biogenesis].
Length = 335
Score = 35.3 bits (82), Expect = 0.019
Identities = 24/96 (25%), Positives = 38/96 (39%), Gaps = 6/96 (6%)
Query: 48 LAKEETSELAKEKTS--ELAKEETSELAKEKTSELAKEET-SELAKEKTRALAKEETREL 104
LAK+ + ELA + L + K L K+ ++L K+ + EE +E+
Sbjct: 1 EVMIALGRLAKKGIAAIELASDLLKALEELKVKYLGKKGVLTDLLKKLGKLSPLEERKEV 60
Query: 105 AKEKTRELAKEEIRELAKEETSELTKEKT-RELAKE 139
K+E+ + E T EL LA E
Sbjct: 61 GAL--INELKKEVEDAITELTPELEAAGLWERLAFE 94
Score = 32.2 bits (74), Expect = 0.19
Identities = 23/98 (23%), Positives = 37/98 (37%), Gaps = 6/98 (6%)
Query: 120 LAKEETSELTKE--KTRELAKEETRELANEETSELAKEET-SELAKEKPRELAKEETREL 176
L K+ ELA + + L + L K+ ++L K+ + EE +E+
Sbjct: 1 EVMIALGRLAKKGIAAIELASDLLKALEELKVKYLGKKGVLTDLLKKLGKLSPLEERKEV 60
Query: 177 AKEKTRELAKEETRELAKEKTRELAKEKT-RELAKEEI 213
K+E + E T EL LA E+I
Sbjct: 61 GAL--INELKKEVEDAITELTPELEAAGLWERLAFEKI 96
Score = 31.4 bits (72), Expect = 0.28
Identities = 21/89 (23%), Positives = 32/89 (35%), Gaps = 9/89 (10%)
Query: 64 LAKEETSELAKEKTS--ELAKEETSELAKEKTRALAKEETRELAKEKTRELAKEEIR--- 118
LAK+ + ELA + L + K + L K+ +K +L+ E R
Sbjct: 1 EVMIALGRLAKKGIAAIELASDLLKALEELKVKYLGKKGVLTDLLKKLGKLSPLEERKEV 60
Query: 119 ----ELAKEETSELTKEKTRELAKEETRE 143
K+E + E T EL E
Sbjct: 61 GALINELKKEVEDAITELTPELEAAGLWE 89
Score = 31.4 bits (72), Expect = 0.32
Identities = 19/77 (24%), Positives = 30/77 (38%), Gaps = 8/77 (10%)
Query: 151 ELAKEETSELAKEKPRELAKEETRELAKEKTRELAKEETR-------ELAKEKTRELAKE 203
ELA + L + K + L K+ +K +L+ E R K++ + E
Sbjct: 18 ELASDLLKALEELKVKYLGKKGVLTDLLKKLGKLSPLEERKEVGALINELKKEVEDAITE 77
Query: 204 KTRELAKEEI-RELAKE 219
T EL + LA E
Sbjct: 78 LTPELEAAGLWERLAFE 94
Score = 28.7 bits (65), Expect = 2.0
Identities = 23/88 (26%), Positives = 38/88 (43%), Gaps = 2/88 (2%)
Query: 95 ALAKE--ETRELAKEKTRELAKEEIRELAKEETSELTKEKTRELAKEETRELANEETSEL 152
LAK+ ELA + + L + +++ L K+ +K +L+ E R+ +EL
Sbjct: 8 RLAKKGIAAIELASDLLKALEELKVKYLGKKGVLTDLLKKLGKLSPLEERKEVGALINEL 67
Query: 153 AKEETSELAKEKPRELAKEETRELAKEK 180
KE + + P A LA EK
Sbjct: 68 KKEVEDAITELTPELEAAGLWERLAFEK 95
>gnl|CDD|237952 PRK15362, PRK15362, pathogenicity island 2 effector protein SseC;
Provisional.
Length = 473
Score = 35.2 bits (81), Expect = 0.021
Identities = 28/120 (23%), Positives = 55/120 (45%), Gaps = 5/120 (4%)
Query: 39 KSAKEETSELAKEETSELAKEKTSELAKEETSELAKEKTSELAKEETSELAKEKTRALAK 98
K+A++ A EE E + E +E E K+ + +++K+ +ELA+E ++
Sbjct: 254 KAAEKVLDSGAGEELVERMVGGSEEAIEELAEEFGKQVSEQVSKQVANELAEESAEF-SR 312
Query: 99 EETRELAKEKTRELAKEEIRELAKEETSELTKEKTRE----LAKEETRELANEETSELAK 154
+ + + + KE I + E E K+ +E L KE T+++ E ++ K
Sbjct: 313 NVEKNMTRSLGKAFTKEAIEAMVTEAVEEALKKAVQEGVKFLLKEFTKQVVKEVFKKVIK 372
>gnl|CDD|235401 PRK05306, infB, translation initiation factor IF-2; Validated.
Length = 746
Score = 34.8 bits (81), Expect = 0.028
Identities = 34/153 (22%), Positives = 60/153 (39%), Gaps = 4/153 (2%)
Query: 30 TLQPEERGDKSAKEETSELAKEETSELAKEKTSELAKEETSELAKEKTSELAKEETSELA 89
T++ EE + AK E E AK E E A + E AK E + A + + A A
Sbjct: 37 TVEEEEARKEEAKREAEEEAKAEAEEAAAAEAEEEAKAEAAAAAPAEEAAEAAAAAEAAA 96
Query: 90 KEKTRALAKEETRELAKEKTRELAKEEIRELAKEETSELTKEKTR-ELAKEETRELANEE 148
+ A+ + K ++ AK++ K++ + + + R
Sbjct: 97 RPAEDEAARPAEAAARRPKAKKAAKKKKGPKPKKKKPKRKAARGGKRGKGGKGRRRRRGR 156
Query: 149 TSELAKEETSELAKEKPRELAKEET---RELAK 178
K++ + ++ PRE+ ET ELA+
Sbjct: 157 RRRRKKKKKQKPTEKIPREVVIPETITVAELAE 189
>gnl|CDD|173534 PTZ00341, PTZ00341, Ring-infected erythrocyte surface antigen;
Provisional.
Length = 1136
Score = 34.0 bits (77), Expect = 0.063
Identities = 50/176 (28%), Positives = 74/176 (42%), Gaps = 4/176 (2%)
Query: 34 EERGDKSAKEETSELAKEETSELAKEKTSELAKEETSELAKEKTSELAKEETSELAKEKT 93
EE +++ +E E +E E +E E +E E +E E +E E +E
Sbjct: 960 EENVEENVEENVEENVEENVEENVEENVEENVEENVEENVEENIEENVEENVEENIEENV 1019
Query: 94 RALAKEETRELAK--EKTRELAKEEIRELAKEETSELTKEKTRELAKEETRELANEETSE 151
+E E+ + E+ E EEI E A+E E +E E +E E+ EE E
Sbjct: 1020 EEYDEENVEEVEENVEEYDEENVEEIEENAEENVEENIEENIEEYDEENVEEI--EENIE 1077
Query: 152 LAKEETSELAKEKPRELAKEETRELAKEKTRELAKEETRELAKEKTRELAKEKTRE 207
EE E E+ E +E E +E E A+E E A+E E +E E
Sbjct: 1078 ENIEENVEENVEENVEEIEENVEENVEENAEENAEENAEENAEEYDDENPEEHNEE 1133
Score = 33.2 bits (75), Expect = 0.11
Identities = 52/184 (28%), Positives = 74/184 (40%), Gaps = 4/184 (2%)
Query: 34 EERGDKSAKEETSELAKEETSELAKEKTSELAKEETSELAKEKTSELAKEETSELAKEKT 93
EE ++ A+E E +E E +E E +E E +E E +E E +E
Sbjct: 952 EENVEEDAEENVEENVEENVEENVEENVEENVEENVEENVEENVEENVEENIEENVEENV 1011
Query: 94 RALAKEETRELAKEKTRELAK--EEIRELAKEETSELTKEKTRELAKEETRELANEETSE 151
+E E +E E+ + EE E EE E +E E +E E E E
Sbjct: 1012 EENIEENVEEYDEENVEEVEENVEEYDEENVEEIEENAEENVEENIEENIEEYDEENVEE 1071
Query: 152 LAKEETSELAKEKPRELAKEETRELAKEKTRELAKEETRELAKEKTRELAKEKTRELAKE 211
+ EE E E+ E EE E +E E +E E A+E E A+E E +E
Sbjct: 1072 I--EENIEENIEENVEENVEENVEEIEENVEENVEENAEENAEENAEENAEEYDDENPEE 1129
Query: 212 EIRE 215
E
Sbjct: 1130 HNEE 1133
>gnl|CDD|237478 PRK13709, PRK13709, conjugal transfer nickase/helicase TraI;
Provisional.
Length = 1747
Score = 33.6 bits (77), Expect = 0.085
Identities = 24/110 (21%), Positives = 50/110 (45%), Gaps = 5/110 (4%)
Query: 95 ALAKEETRELAKEKTRELAKEEIRELAKEETSELTKEKTRELAKEETRELANEETSELAK 154
LA+ + E + +T A E +R++A+ + +L KT + ++ + + +
Sbjct: 1643 VLAQRQAEEAIRRETERRADEIVRKMAENK-PDLPDGKTEQAVRDIAGQERDRAAISERE 1701
Query: 155 EETSELAKEKPRELAKEETRELAKEKTRELAKEETRELAKEKTRELAKEK 204
E +P+ E RE+A+E E +++ ++ R+L KEK
Sbjct: 1702 AALPESVLREPQRER-EAVREVARENLLR---ERLQQMERDMVRDLQKEK 1747
Score = 31.7 bits (72), Expect = 0.36
Identities = 25/125 (20%), Positives = 54/125 (43%), Gaps = 7/125 (5%)
Query: 28 PETLQPEE-RGDKSAKEETSELAKEETSELAKEKTSELAKEETSELAKEKTSELAKEETS 86
++QP G+ E ++ EE E+ ++ + +E + ++
Sbjct: 1626 DNSVQPGAGNGEPVTAEVLAQRQAEEAIRRETERRADEIVRKMAENKPDLPDGKTEQAVR 1685
Query: 87 ELAK-EKTRALAKEETRELAKEKTRELAKEE--IRELAKEETSELTKEKTRELAKEETRE 143
++A E+ RA E L + RE +E +RE+A+E E+ +++ ++ R+
Sbjct: 1686 DIAGQERDRAAISEREAALPESVLREPQREREAVREVARENLLR---ERLQQMERDMVRD 1742
Query: 144 LANEE 148
L E+
Sbjct: 1743 LQKEK 1747
>gnl|CDD|215507 PLN02939, PLN02939, transferase, transferring glycosyl groups.
Length = 977
Score = 33.3 bits (76), Expect = 0.093
Identities = 44/195 (22%), Positives = 76/195 (38%), Gaps = 8/195 (4%)
Query: 28 PETLQPEERGDKSAKEETSELAKEETSELAKEKTSELAKEETSELAKEKTSELAKE-ETS 86
E Q D + + + + +++T+ E+ S+ E + + E +
Sbjct: 84 MELPQKSTSSDDDHNRASMQRDEAIAAIDNEQQTNSKDGEQLSDFQLEDLVGMIQNAEKN 143
Query: 87 ELAKEKTRALAKEETRELAKEKTRELAKEEIRELAKEETSELTKEKTR-----ELAKEET 141
L + R A E+ ++ EK K I E+ ET K + E+ +E+
Sbjct: 144 ILLLNQARLQALEDLEKILTEKEALQGKINILEMRLSETDARIKLAAQEKIHVEILEEQL 203
Query: 142 RELANEETSELAKEETSELAKEKPRELAKEETRELAKEKTRELAKEETRELAKEKTRELA 201
+L NE A E + K ++ KEE L + + K E E+A+ + R
Sbjct: 204 EKLRNELLIRGATEGLCVHSLSKELDVLKEENMLLKDDI--QFLKAELIEVAETEERVFK 261
Query: 202 KEKTRELAKEEIREL 216
EK R L +REL
Sbjct: 262 LEKERSLLDASLREL 276
Score = 27.9 bits (62), Expect = 5.2
Identities = 31/105 (29%), Positives = 47/105 (44%), Gaps = 5/105 (4%)
Query: 57 AKEKT-SELAKEETSELAKEKTSELAKEETSELAKEKTRALAKEETRELAKEKTRELAKE 115
A+EK E+ +E+ +L E A E + K + KEE L + + K
Sbjct: 190 AQEKIHVEILEEQLEKLRNELLIRGATEGLCVHSLSKELDVLKEENMLLKDDI--QFLKA 247
Query: 116 EIRELAKEETSELTKEKTRELAKEETRELANEETSELAKEETSEL 160
E+ E+A+ E EK R L REL E +A+E+ S+L
Sbjct: 248 ELIEVAETEERVFKLEKERSLLDASLREL--ESKFIVAQEDVSKL 290
>gnl|CDD|224188 COG1269, NtpI, Archaeal/vacuolar-type H+-ATPase subunit I [Energy
production and conversion].
Length = 660
Score = 32.3 bits (74), Expect = 0.20
Identities = 27/183 (14%), Positives = 49/183 (26%), Gaps = 14/183 (7%)
Query: 51 EETSELAKEKTSELAKEETSELAKEKTSELAKEETSELAK-EKTRALAKEETRELAKEKT 109
E + + + EK K + EK + +E TR+L E+
Sbjct: 64 SEVLDYLRSVKGLEGRLFILPEEVEKLEAELKSLEEVIKPAEKFSSEVEELTRKLE-ERL 122
Query: 110 RELAKEEIRELAKEETSELTKEKTRELAKEETRELAN-------EETSELAKEETSELAK 162
EL +E E E +L+ + E E +
Sbjct: 123 SELDEELEDLEDLLEELEPLAYLDFDLSLLRGLKFLLVRLGLVRREKLEALVGVIEDEVA 182
Query: 163 EKPRELAKEE-----TRELAKEKTRELAKEETRELAKEKTRELAKEKTRELAKEEIRELA 217
+ +K ++ E EL + + + +E I E+
Sbjct: 183 LYGENVEASVVIVVAHGAEDLDKVSKILNELGFELYEVPEFDGGPSELISELEEVIAEIQ 242
Query: 218 KEE 220
E
Sbjct: 243 DEL 245
>gnl|CDD|237796 PRK14712, PRK14712, conjugal transfer nickase/helicase TraI;
Provisional.
Length = 1623
Score = 32.5 bits (73), Expect = 0.21
Identities = 31/106 (29%), Positives = 57/106 (53%), Gaps = 5/106 (4%)
Query: 96 LAKEETRELAKEKTRELAKEEIRELAKEET---SELTKEKTRELAKEETRELANEETSEL 152
LA+ + E + +T A E +R++A+ + T++ RE+A +E R+ A E
Sbjct: 1515 LAQRQAEEAIRRETERRADEIVRKMAENKPDLPDGKTEQAVREIAGQE-RDRAAITEREA 1573
Query: 153 AKEETSELAKEKPRELAKEETRE-LAKEKTRELAKEETRELAKEKT 197
A E+ ++ RE +E RE L +E+ +++ ++ R+L KEKT
Sbjct: 1574 ALPESVLREPQRVREAVREVARENLLQERLQQMERDMVRDLQKEKT 1619
>gnl|CDD|235250 PRK04195, PRK04195, replication factor C large subunit;
Provisional.
Length = 482
Score = 32.2 bits (74), Expect = 0.22
Identities = 25/87 (28%), Positives = 42/87 (48%), Gaps = 1/87 (1%)
Query: 119 ELAKEETSELTKEKTRELAKEETRELANEETSELAKEETSELAKEKPRELAKEETRELAK 178
EL +EE LT K ++ E A E+ E K+E + A ++ +EE + K
Sbjct: 393 ELTEEEIEFLTGSKKATKKIKKIVEKA-EKKREEEKKEKKKKAFAGKKKEEEEEEEKEKK 451
Query: 179 EKTRELAKEETRELAKEKTRELAKEKT 205
E+ +E +EE E +E+ + K+ T
Sbjct: 452 EEEKEEEEEEAEEEKEEEEEKKKKQAT 478
Score = 31.8 bits (73), Expect = 0.30
Identities = 24/81 (29%), Positives = 36/81 (44%), Gaps = 1/81 (1%)
Query: 79 ELAKEETSELAKEKTRALAKEETRELAKEKTRELAKEEIRELAKEETSELTKEKTRELAK 138
EL +EE L K K + EK RE K+E ++ A + +E+ + K
Sbjct: 393 ELTEEEIEFLTGSKKAT-KKIKKIVEKAEKKREEEKKEKKKKAFAGKKKEEEEEEEKEKK 451
Query: 139 EETRELANEETSELAKEETSE 159
EE +E EE E +EE +
Sbjct: 452 EEEKEEEEEEAEEEKEEEEEK 472
Score = 31.0 bits (71), Expect = 0.52
Identities = 25/87 (28%), Positives = 39/87 (44%), Gaps = 1/87 (1%)
Query: 47 ELAKEETSELAKEKTSELAKEETSELAKEKTSELAKEETSELAKEKTRALAKEETRELAK 106
EL +EE L K + K + EK E K+E + A + +EE + K
Sbjct: 393 ELTEEEIEFLTGSKKAT-KKIKKIVEKAEKKREEEKKEKKKKAFAGKKKEEEEEEEKEKK 451
Query: 107 EKTRELAKEEIRELAKEETSELTKEKT 133
E+ +E +EE E +EE + K+ T
Sbjct: 452 EEEKEEEEEEAEEEKEEEEEKKKKQAT 478
Score = 29.5 bits (67), Expect = 1.3
Identities = 23/81 (28%), Positives = 37/81 (45%), Gaps = 1/81 (1%)
Query: 63 ELAKEETSELAKEKTSELAKEETSELAKEKTRALAKEETRELAKEKTRELAKEEIRELAK 122
EL +EE L K + K + EK R K+E ++ A ++ +EE + K
Sbjct: 393 ELTEEEIEFLTGSKKAT-KKIKKIVEKAEKKREEEKKEKKKKAFAGKKKEEEEEEEKEKK 451
Query: 123 EETSELTKEKTRELAKEETRE 143
EE E +E+ E +EE +
Sbjct: 452 EEEKEEEEEEAEEEKEEEEEK 472
Score = 29.5 bits (67), Expect = 1.4
Identities = 22/78 (28%), Positives = 41/78 (52%), Gaps = 1/78 (1%)
Query: 135 ELAKEETRELANEETSELAKEETSELAKEKPRELAKEETRELAKEKTRELAKEETRELAK 194
EL +EE L + + ++ E A++K RE K+E ++ A ++ +EE + K
Sbjct: 393 ELTEEEIEFLTGSKKATKKIKKIVEKAEKK-REEEKKEKKKKAFAGKKKEEEEEEEKEKK 451
Query: 195 EKTRELAKEKTRELAKEE 212
E+ +E +E+ E +EE
Sbjct: 452 EEEKEEEEEEAEEEKEEE 469
Score = 29.5 bits (67), Expect = 1.4
Identities = 25/86 (29%), Positives = 41/86 (47%), Gaps = 1/86 (1%)
Query: 96 LAKEETRELAKEKTRELAKEEIRELAKEETSELTKEKTRELAKEETRELANEETSELAKE 155
L +EE L K ++I E A+++ E KEK ++ A ++ EE + KE
Sbjct: 394 LTEEEIEFLTGSKKATKKIKKIVEKAEKKREEEKKEKKKK-AFAGKKKEEEEEEEKEKKE 452
Query: 156 ETSELAKEKPRELAKEETRELAKEKT 181
E E +E+ E +EE + K+ T
Sbjct: 453 EEKEEEEEEAEEEKEEEEEKKKKQAT 478
Score = 29.5 bits (67), Expect = 1.7
Identities = 25/91 (27%), Positives = 38/91 (41%), Gaps = 7/91 (7%)
Query: 130 KEKTRELAKEETRELANEETSELAKEETSELAKEKPRELAKEETRELAKEKTRELAKEET 189
+ ELA L EL +EE L K K + EK RE K+E
Sbjct: 378 FKHNPELAARLAAFL------ELTEEEIEFLTGSKKAT-KKIKKIVEKAEKKREEEKKEK 430
Query: 190 RELAKEKTRELAKEKTRELAKEEIRELAKEE 220
++ A ++ +E+ + KEE +E +EE
Sbjct: 431 KKKAFAGKKKEEEEEEEKEKKEEEKEEEEEE 461
Score = 27.6 bits (62), Expect = 6.1
Identities = 23/87 (26%), Positives = 43/87 (49%), Gaps = 2/87 (2%)
Query: 55 ELAKEKTSELAKEETSELAKEKTSELAKEETSELAKEKTRALAKEETRELAKEKTRELAK 114
EL +E+ L + + +K E A+++ E KEK + + +E +E+ E K
Sbjct: 393 ELTEEEIEFLTGSKKATKKIKKIVEKAEKKREEEKKEKKKKAFAGKKKEEEEEE--EKEK 450
Query: 115 EEIRELAKEETSELTKEKTRELAKEET 141
+E + +EE +E KE+ E K++
Sbjct: 451 KEEEKEEEEEEAEEEKEEEEEKKKKQA 477
>gnl|CDD|240388 PTZ00372, PTZ00372, endonuclease 4-like protein; Provisional.
Length = 413
Score = 32.0 bits (73), Expect = 0.22
Identities = 22/114 (19%), Positives = 42/114 (36%), Gaps = 2/114 (1%)
Query: 73 AKEKTSELAKEETSELAKEKTRALAKEETRELAKEKTRELAKEEIRELAKEETSELTKEK 132
++ + S + L+ E K T K++ + E + K+K
Sbjct: 16 TQKSKLQPISYIYSNVLVLSKEILSTFS-EEENKVATTSTKKDKKEDKNNESKKKSEKKK 74
Query: 133 TRELAKEETRELANEETSELAKEETSELAKEKPRELAKEETRELAKEKTRELAK 186
++ K+E + E + S+ K+KP + E + A K ELA+
Sbjct: 75 KKKKEKKEPKSE-GETKLGFKTPKKSKKTKKKPPKPKPNEDVDNAFNKIAELAE 127
Score = 29.3 bits (66), Expect = 1.6
Identities = 16/94 (17%), Positives = 34/94 (36%), Gaps = 3/94 (3%)
Query: 32 QPEERGDKSAKEETSELAKEETSELAKEKTSELAKEETSELAK---EKTSELAKEETSEL 88
+E ++EE + ++K +E K+ + K +K + E
Sbjct: 34 LSKEILSTFSEEENKVATTSTKKDKKEDKNNESKKKSEKKKKKKKEKKEPKSEGETKLGF 93
Query: 89 AKEKTRALAKEETRELAKEKTRELAKEEIRELAK 122
K K++ + + + A +I ELA+
Sbjct: 94 KTPKKSKKTKKKPPKPKPNEDVDNAFNKIAELAE 127
>gnl|CDD|215180 PLN02316, PLN02316, synthase/transferase.
Length = 1036
Score = 32.1 bits (73), Expect = 0.23
Identities = 24/57 (42%), Positives = 29/57 (50%), Gaps = 9/57 (15%)
Query: 160 LAKEKPRELAKEETRELAKEKT-RELAKEETRELAKEKTRELAKEKTRELAKEEIRE 215
L +EK REL K LAKE+ RE EE R +EK A E R AK E+ +
Sbjct: 250 LLEEKRRELEK-----LAKEEAERERQAEEQRRREEEKA---AMEADRAQAKAEVEK 298
Score = 29.8 bits (67), Expect = 1.1
Identities = 23/68 (33%), Positives = 29/68 (42%), Gaps = 12/68 (17%)
Query: 104 LAKEKTRELAKEEIRELAKEETSELTKEKTRELAKEETRELANEETSELAKEETSELAKE 163
L +EK REL K LAKEE RE EE R E+ + A ++ E
Sbjct: 250 LLEEKRRELEK-----LAKEEAE-------RERQAEEQRRREEEKAAMEADRAQAKAEVE 297
Query: 164 KPRELAKE 171
K RE +
Sbjct: 298 KRREKLQN 305
Score = 29.1 bits (65), Expect = 2.5
Identities = 28/79 (35%), Positives = 36/79 (45%), Gaps = 16/79 (20%)
Query: 136 LAKEETRELANEETSELAKEETSELAKEKPRELAKEETRELAKEKTRELAKEETRELAKE 195
L +E+ REL +LAKEE RE EE R +EK A E R AK
Sbjct: 250 LLEEKRREL-----EKLAKEEAE-------RERQAEEQRRREEEKA---AMEADRAQAKA 294
Query: 196 KTRELAKEKTRELAKEEIR 214
+ E +EK + L K+ R
Sbjct: 295 EV-EKRREKLQNLLKKASR 312
Score = 28.7 bits (64), Expect = 3.3
Identities = 22/50 (44%), Positives = 25/50 (50%), Gaps = 7/50 (14%)
Query: 176 LAKEKTRE---LAKEET-RELAKEKTRELAKEKTRELAKEEIRELAKEET 221
L +EK RE LAKEE RE E+ R +EK A E R AK E
Sbjct: 250 LLEEKRRELEKLAKEEAERERQAEEQRRREEEKA---AMEADRAQAKAEV 296
>gnl|CDD|235334 PRK05035, PRK05035, electron transport complex protein RnfC;
Provisional.
Length = 695
Score = 31.8 bits (73), Expect = 0.25
Identities = 34/186 (18%), Positives = 68/186 (36%), Gaps = 11/186 (5%)
Query: 39 KSAKEETSELAKEETSELAKEKTSELAKEETSELAKEKTSELAKEETSELAKEKTRALAK 98
K+ K ++ + A+E A+ + +E ++ + A K A+A
Sbjct: 496 KAKKAAATQPIVIKAGARPDNSAVIAAREARKAQARARQAE---KQAAAAADPKKAAVAA 552
Query: 99 EETRELAKEKTRELAKEEIRELAKEETSELTKEKTRELAKEETRELANEETSELAKEETS 158
R AK+ ++ A E E + + + R AK+ A ++ + EE
Sbjct: 553 AIARAKAKKAAQQAANAEAEEEVDPKKAAVAAAIARAKAKK-----AAQQAASAEPEEQV 607
Query: 159 ELAKEKPRELAKEETRELAKEKTRELAKEETRELAKEKTR---ELAKEKTRELAKEEIRE 215
K +A R AK+ ++ E + K +A+ K R+ A+++
Sbjct: 608 AEVDPKKAAVAAAIARAKAKKAEQQANAEPEEPVDPRKAAVAAAIARAKARKAAQQQANA 667
Query: 216 LAKEET 221
+E
Sbjct: 668 EPEEAE 673
Score = 29.5 bits (67), Expect = 1.5
Identities = 33/180 (18%), Positives = 62/180 (34%), Gaps = 17/180 (9%)
Query: 49 AKEETSELAKEK-TSELAKE----ETSELAKEKTSELAKEETSELAKEKTRALAKEETRE 103
AK E + +EK +E AK + L +EK + A+ K+ A A ++
Sbjct: 434 AKAEIRAIEQEKKKAEEAKARFEARQARLEREKAAREAR------HKKAAEARAAKDKDA 487
Query: 104 LAKEKTRELAKEEIRELAKEETSELTKEKTRELAKEETRELANEETSELAKEETSELAKE 163
+A R AK+ + + + +A E R+ + E A
Sbjct: 488 VAAALARVKAKKAAATQPIVIKAGARPDNSAVIAAREARKAQARA----RQAEKQAAAAA 543
Query: 164 KPRELAKEETRELAKEKTRELAKEETRELAKEKTRELAKEKTRELAKEEIRELAKEETNS 223
P++ A AK K + ++ A AK ++ A++ ++
Sbjct: 544 DPKKAAVAAAIARAKAKKAAQQAANAEAEEEVDPKKAAVAAAIARAKA--KKAAQQAASA 601
Score = 29.5 bits (67), Expect = 1.5
Identities = 30/200 (15%), Positives = 64/200 (32%), Gaps = 11/200 (5%)
Query: 19 SASPNSCGRPETLQPEERGDKSAKEETSELAKEETSELAKEKTSELAKEETSELA----K 74
+A+ +P+ +A+E A+ +E + + K+ A K
Sbjct: 500 AAATQPIVIKAGARPDNSAVIAAREARKAQARARQAE-KQAAAAADPKKAAVAAAIARAK 558
Query: 75 EKTSELAKEETSELAKEKTRALAKEETRELAKEKTRELAKEEIRELAKEETSELTKEKTR 134
K + + + A AK K A ++ EE K
Sbjct: 559 AKKAAQQAANAEAEEEVDPKKAAVAAAIARAKAK---KAAQQAASAEPEEQVAEVDPKKA 615
Query: 135 ELAKEETRELANEETSELAKEETSELAKEKPR---ELAKEETRELAKEKTRELAKEETRE 191
+A R A + + E + K +A+ + R+ A+++ +E
Sbjct: 616 AVAAAIARAKAKKAEQQANAEPEEPVDPRKAAVAAAIARAKARKAAQQQANAEPEEAEDP 675
Query: 192 LAKEKTRELAKEKTRELAKE 211
+A+ K ++ A++
Sbjct: 676 KKAAVAAAIARAKAKKAAQQ 695
>gnl|CDD|178945 PRK00247, PRK00247, putative inner membrane protein translocase
component YidC; Validated.
Length = 429
Score = 31.7 bits (72), Expect = 0.29
Identities = 22/123 (17%), Positives = 43/123 (34%), Gaps = 5/123 (4%)
Query: 90 KEKTRALAKEETRELAKEKTRELAKEEIRELAKEETSELTKEKTRELA-KEETRELANEE 148
K L + R + E E+ K T+E + K R+ ++ R E
Sbjct: 302 KAFLWTLRRNRLRMIITPWRAPELHAENAEIKKTRTAEKNEAKARKKEIAQKRRAAEREI 361
Query: 149 TSELAKEETSELAKEKPRELAKEETRELAKEKT----RELAKEETRELAKEKTRELAKEK 204
E +E + +A+ + R A + ++ + + EE++ + E
Sbjct: 362 NREARQERAAAMARARARRAAVKAKKKGLIDASPNEDTPSENEESKGSPPQVEATTTAEP 421
Query: 205 TRE 207
RE
Sbjct: 422 NRE 424
>gnl|CDD|233626 TIGR01897, cas_MJ1666, CRISPR-associated protein, MJ1666 family.
CRISPR is a term for Clustered, Regularly Interspaced
Short Palidromic Repeats. A number of protein families
appear only in association with these repeats and are
designated Cas (CRISPR-Associated) proteins. This model
describes a Cas protein about 400 residues in length,
found mostly in the Archaea but also in Aquifex [Mobile
and extrachromosomal element functions, Other].
Length = 410
Score = 31.6 bits (72), Expect = 0.33
Identities = 22/89 (24%), Positives = 35/89 (39%), Gaps = 3/89 (3%)
Query: 77 TSELAKEET-SELAKEKTRALAKEETRELAKEKTRELAKEEIRELAKEETSELTKEKTRE 135
EL L K +L E + E+ EEI++LA+ E E
Sbjct: 279 KRELYLGRDFDTLLKAYFHSLILEVLKIKGNSPKNEVKLEEIKKLAETFFDEEYISVYIE 338
Query: 136 LAKEETRELAN--EETSELAKEETSELAK 162
+ K E + N ++ ELA + +E+ K
Sbjct: 339 IIKNELDNIKNRSKKAEELALIKATEIYK 367
Score = 27.0 bits (60), Expect = 8.5
Identities = 16/63 (25%), Positives = 26/63 (41%), Gaps = 2/63 (3%)
Query: 158 SELAKEKPRELAKEETRELAKEKTRELAKEETRELA--KEKTRELAKEKTRELAKEEIRE 215
+E+ E+ ++LA+ E E+ K E + +K ELA K E+ K
Sbjct: 313 NEVKLEEIKKLAETFFDEEYISVYIEIIKNELDNIKNRSKKAEELALIKATEIYKVSGEG 372
Query: 216 LAK 218
K
Sbjct: 373 FEK 375
>gnl|CDD|221121 pfam11489, DUF3210, Protein of unknown function (DUF3210). This is
a family of proteins conserved in yeasts. The function
is not known. The Schizosaccharomyces pombe member is
SPBC18E5.07 and the Saccharomyces cerevisiae member is
AIM21.
Length = 671
Score = 31.4 bits (71), Expect = 0.37
Identities = 25/115 (21%), Positives = 38/115 (33%), Gaps = 1/115 (0%)
Query: 19 SASPNSCGRPETLQPEERGDKSAKEETSELAKEETSELA-KEKTSELAKEETSELAKEKT 77
SP+S T+ A E + +SE++ SEL E+ +
Sbjct: 417 EDSPSSLQDTATVSTPSNPPPRASETPEQETSRSSSEVSLDPHQSELKSEKKKARPEVSK 476
Query: 78 SELAKEETSELAKEKTRALAKEETRELAKEKTRELAKEEIRELAKEETSELTKEK 132
+ E A E + EET E K + + K I K+ K K
Sbjct: 477 QRFPSRDVWEDAPESQELVTTEETPEEVKSSSPGVTKPAIPSRPKKGKPTSEKRK 531
>gnl|CDD|173412 PTZ00121, PTZ00121, MAEBL; Provisional.
Length = 2084
Score = 31.3 bits (70), Expect = 0.46
Identities = 38/191 (19%), Positives = 78/191 (40%)
Query: 29 ETLQPEERGDKSAKEETSELAKEETSELAKEKTSELAKEETSELAKEKTSELAKEETSEL 88
E + + + KEE + A + ++K ++ AK++ E K+ +
Sbjct: 1360 EAAEEKAEAAEKKKEEAKKKADAAKKKAEEKKKADEAKKKAEEDKKKADELKKAAAAKKK 1419
Query: 89 AKEKTRALAKEETRELAKEKTRELAKEEIRELAKEETSELTKEKTRELAKEETRELANEE 148
A E + +++ + AK+K E K + + EE + + K + ++ E +
Sbjct: 1420 ADEAKKKAEEKKKADEAKKKAEEAKKADEAKKKAEEAKKAEEAKKKAEEAKKADEAKKKA 1479
Query: 149 TSELAKEETSELAKEKPRELAKEETRELAKEKTRELAKEETRELAKEKTRELAKEKTREL 208
+E + A+E ++ + + AK+K E K E + A E + +K E
Sbjct: 1480 EEAKKADEAKKKAEEAKKKADEAKKAAEAKKKADEAKKAEEAKKADEAKKAEEAKKADEA 1539
Query: 209 AKEEIRELAKE 219
K E ++ A E
Sbjct: 1540 KKAEEKKKADE 1550
Score = 31.3 bits (70), Expect = 0.52
Identities = 40/179 (22%), Positives = 75/179 (41%)
Query: 42 KEETSELAKEETSELAKEKTSELAKEETSELAKEKTSELAKEETSELAKEKTRALAKEET 101
+ EL K E + A E K++ E K+ +E + A+E + +
Sbjct: 1277 ARKADELKKAEEKKKADEAKKAEEKKKADEAKKKAEEAKKADEAKKKAEEAKKKADAAKK 1336
Query: 102 RELAKEKTRELAKEEIRELAKEETSELTKEKTRELAKEETRELANEETSELAKEETSELA 161
+ +K E AK E A E + K + E KEE ++ A+ + +++ ++ A
Sbjct: 1337 KAEEAKKAAEAAKAEAEAAADEAEAAEEKAEAAEKKKEEAKKKADAAKKKAEEKKKADEA 1396
Query: 162 KEKPRELAKEETRELAKEKTRELAKEETRELAKEKTRELAKEKTRELAKEEIRELAKEE 220
K+K E K+ ++ A E ++ ++K + AK+K E K + + EE
Sbjct: 1397 KKKAEEDKKKADELKKAAAAKKKADEAKKKAEEKKKADEAKKKAEEAKKADEAKKKAEE 1455
Score = 30.9 bits (69), Expect = 0.65
Identities = 50/198 (25%), Positives = 80/198 (40%), Gaps = 7/198 (3%)
Query: 27 RPETLQPEERGDKSAKEETSELAKEETSELAKEKTSELAKEETSELAKEKTSELAKEETS 86
+ + + K A E ++ E K + ++ A E K+K EL K E
Sbjct: 1498 KADEAKKAAEAKKKADEAKKAEEAKKADEAKKAEEAKKADEAKKAEEKKKADELKKAEEL 1557
Query: 87 ELAKEKTRALAKEETRELAKEKTRELAKEEIRELAKEETSELTKEKTRELAKEETRELAN 146
+ A+EK +A EE ++ ++K L K E + A+E E + E K + E
Sbjct: 1558 KKAEEKKKA---EEAKKAEEDKNMALRKAEEAKKAEEARIEEVMKLYEEEKKMKAEEAKK 1614
Query: 147 EETSELAKEETSELAKEKPRELAKEETRELAKEKTRELAKEETRELAK----EKTRELAK 202
E +++ EE + +EK + ++ K+K EL K E K K E K
Sbjct: 1615 AEEAKIKAEELKKAEEEKKKVEQLKKKEAEEKKKAEELKKAEEENKIKAAEEAKKAEEDK 1674
Query: 203 EKTRELAKEEIRELAKEE 220
+K E K E E E
Sbjct: 1675 KKAEEAKKAEEDEKKAAE 1692
Score = 30.1 bits (67), Expect = 1.1
Identities = 38/192 (19%), Positives = 75/192 (39%)
Query: 29 ETLQPEERGDKSAKEETSELAKEETSELAKEKTSELAKEETSELAKEKTSELAKEETSEL 88
E + E A+ E E A EK E AK++ K+ + +E +
Sbjct: 1340 EAKKAAEAAKAEAEAAADEAEAAEEKAEAAEKKKEEAKKKADAAKKKAEEKKKADEAKKK 1399
Query: 89 AKEKTRALAKEETRELAKEKTRELAKEEIRELAKEETSELTKEKTRELAKEETRELANEE 148
A+E + + + AK+K E K+ + +E + +E + ++ E A +
Sbjct: 1400 AEEDKKKADELKKAAAAKKKADEAKKKAEEKKKADEAKKKAEEAKKADEAKKKAEEAKKA 1459
Query: 149 TSELAKEETSELAKEKPRELAKEETRELAKEKTRELAKEETRELAKEKTRELAKEKTREL 208
K E ++ A E ++ + + + AK+K E K+ + ++ A E +
Sbjct: 1460 EEAKKKAEEAKKADEAKKKAEEAKKADEAKKKAEEAKKKADEAKKAAEAKKKADEAKKAE 1519
Query: 209 AKEEIRELAKEE 220
++ E K E
Sbjct: 1520 EAKKADEAKKAE 1531
Score = 29.7 bits (66), Expect = 1.7
Identities = 46/193 (23%), Positives = 87/193 (45%), Gaps = 6/193 (3%)
Query: 34 EERGDKSAKEETSELAKEETSELAKEKTSELAKEETSELAKEKTSELAKEETSELAKEKT 93
E++ K E ++ A+E E + E K + E K + +++ EE + +EK
Sbjct: 1574 EDKNMALRKAEEAKKAEEARIEEVMKLYEEEKKMKAEEAKKAEEAKIKAEELKKAEEEKK 1633
Query: 94 RALAKEETRELAKEKTRELAKEE------IRELAKEETSELTKEKTRELAKEETRELANE 147
+ ++ K+K EL K E E AK+ + K + + A+E+ ++ A
Sbjct: 1634 KVEQLKKKEAEEKKKAEELKKAEEENKIKAAEEAKKAEEDKKKAEEAKKAEEDEKKAAEA 1693
Query: 148 ETSELAKEETSELAKEKPRELAKEETRELAKEKTRELAKEETRELAKEKTRELAKEKTRE 207
E + + +E K+K E K+ E+ ++ EE ++ A+E ++ + K E
Sbjct: 1694 LKKEAEEAKKAEELKKKEAEEKKKAEELKKAEEENKIKAEEAKKEAEEDKKKAEEAKKDE 1753
Query: 208 LAKEEIRELAKEE 220
K++I L KEE
Sbjct: 1754 EEKKKIAHLKKEE 1766
Score = 29.0 bits (64), Expect = 2.5
Identities = 48/191 (25%), Positives = 87/191 (45%), Gaps = 3/191 (1%)
Query: 27 RPETLQPEERGDKSAKEETSELAKEETSELAKEKTSELAKEETSELAKEKTSELAKEETS 86
R E + +K K E ++ A+E + + K +E K++ +L K++ E K E
Sbjct: 1593 RIEEVMKLYEEEKKMKAEEAKKAEEAKIKAEELKKAEEEKKKVEQLKKKEAEEKKKAEEL 1652
Query: 87 ELAKEKTRALAKEETR--ELAKEKTRELAKEEIRELAKEETSELTKEKTRELAKEETREL 144
+ A+E+ + A EE + E K+K E K E E E + E+ ++ + + +E
Sbjct: 1653 KKAEEENKIKAAEEAKKAEEDKKKAEEAKKAEEDEKKAAEALKKEAEEAKKAEELKKKEA 1712
Query: 145 ANEETSELAKEETSELAKEKPRELAKEETRELAKEKTRELAKEETRELAKEKTRELAKEK 204
++ +E K+ E K K E KE + K + + +EE +++A K E K +
Sbjct: 1713 EEKKKAEELKKA-EEENKIKAEEAKKEAEEDKKKAEEAKKDEEEKKKIAHLKKEEEKKAE 1771
Query: 205 TRELAKEEIRE 215
KE + E
Sbjct: 1772 EIRKEKEAVIE 1782
Score = 28.6 bits (63), Expect = 3.4
Identities = 39/195 (20%), Positives = 88/195 (45%), Gaps = 4/195 (2%)
Query: 29 ETLQPEERGDKSAKEETSELAKEETSELAKEKTSELAKEETSELAKEKTSELAKEETSEL 88
E + +E K+ + + ++ AK++ E K+ + K E ++ A E A+ E
Sbjct: 1300 EKKKADEAKKKAEEAKKADEAKKKAEEAKKKADAAKKKAEEAKKAAEAAKAEAEAAADEA 1359
Query: 89 AKEKTRALAKEETRELAKEKTRELAKE----EIRELAKEETSELTKEKTRELAKEETREL 144
+ +A A E+ +E AK+K K+ + + AK++ E K+ ++
Sbjct: 1360 EAAEEKAEAAEKKKEEAKKKADAAKKKAEEKKKADEAKKKAEEDKKKADELKKAAAAKKK 1419
Query: 145 ANEETSELAKEETSELAKEKPRELAKEETRELAKEKTRELAKEETRELAKEKTRELAKEK 204
A+E + +++ ++ AK+K E K + + E+ ++ + + + +K E K+
Sbjct: 1420 ADEAKKKAEEKKKADEAKKKAEEAKKADEAKKKAEEAKKAEEAKKKAEEAKKADEAKKKA 1479
Query: 205 TRELAKEEIRELAKE 219
+E ++ A+E
Sbjct: 1480 EEAKKADEAKKKAEE 1494
Score = 28.6 bits (63), Expect = 3.6
Identities = 46/197 (23%), Positives = 78/197 (39%), Gaps = 5/197 (2%)
Query: 29 ETLQPEERGDKSAKEETSELAKEETSELAKEKTSELAKEETSELAKEKTSELAKEETSEL 88
E + +E K+ + + ++ AK++ E K+ E + A E ++ E
Sbjct: 1468 EAKKADEAKKKAEEAKKADEAKKKAEEAKKKADEAKKAAEAKKKADEAKKAEEAKKADEA 1527
Query: 89 AKEKTRALAKEETRELAKEKTRELAKEEIRELAKEETSELTKEKTRE-----LAKEETRE 143
K + A E + K+K EL K E + A+E+ +K E L K E +
Sbjct: 1528 KKAEEAKKADEAKKAEEKKKADELKKAEELKKAEEKKKAEEAKKAEEDKNMALRKAEEAK 1587
Query: 144 LANEETSELAKEETSELAKEKPRELAKEETRELAKEKTRELAKEETRELAKEKTRELAKE 203
A E E + E K K E K E ++ E+ ++ +E+ + +K K+
Sbjct: 1588 KAEEARIEEVMKLYEEEKKMKAEEAKKAEEAKIKAEELKKAEEEKKKVEQLKKKEAEEKK 1647
Query: 204 KTRELAKEEIRELAKEE 220
K EL K E K
Sbjct: 1648 KAEELKKAEEENKIKAA 1664
Score = 28.2 bits (62), Expect = 4.7
Identities = 40/170 (23%), Positives = 76/170 (44%), Gaps = 2/170 (1%)
Query: 51 EETSELAKEKTSELAKEETSELAKEKTSELAKEETSELAKEKTRALAKEETRELAKEKTR 110
EE + ++K L K E ++ A+E E + E K K K E ++ E+ +
Sbjct: 1567 EEAKKAEEDKNMALRKAEEAKKAEEARIEEVMKLYEEEKKMKAEEAKKAEEAKIKAEELK 1626
Query: 111 ELAKEEIRELAKEETSELTKEKTRELAKEETRELANEETSELAKEETSELAKEKPRELAK 170
+ +E+ + ++ K+K EL K E E + +E AK+ + K + + A+
Sbjct: 1627 KAEEEKKKVEQLKKKEAEEKKKAEELKKAE--EENKIKAAEEAKKAEEDKKKAEEAKKAE 1684
Query: 171 EETRELAKEKTRELAKEETRELAKEKTRELAKEKTRELAKEEIRELAKEE 220
E+ ++ A+ +E + + E K+K E K+ EE ++ EE
Sbjct: 1685 EDEKKAAEALKKEAEEAKKAEELKKKEAEEKKKAEELKKAEEENKIKAEE 1734
Score = 27.8 bits (61), Expect = 6.2
Identities = 39/186 (20%), Positives = 78/186 (41%)
Query: 35 ERGDKSAKEETSELAKEETSELAKEKTSELAKEETSELAKEKTSELAKEETSELAKEKTR 94
++ D++ K E + A E +K E K E ++ A+E + ++ E+ R
Sbjct: 1534 KKADEAKKAEEKKKADELKKAEELKKAEEKKKAEEAKKAEEDKNMALRKAEEAKKAEEAR 1593
Query: 95 ALAKEETRELAKEKTRELAKEEIRELAKEETSELTKEKTRELAKEETRELANEETSELAK 154
+ E K+ E AK+ K E + +E+ +++ + + +E ++ +E K
Sbjct: 1594 IEEVMKLYEEEKKMKAEEAKKAEEAKIKAEELKKAEEEKKKVEQLKKKEAEEKKKAEELK 1653
Query: 155 EETSELAKEKPRELAKEETRELAKEKTRELAKEETRELAKEKTRELAKEKTRELAKEEIR 214
+ E + E K E + E+ ++ ++E + K +K EL K+E
Sbjct: 1654 KAEEENKIKAAEEAKKAEEDKKKAEEAKKAEEDEKKAAEALKKEAEEAKKAEELKKKEAE 1713
Query: 215 ELAKEE 220
E K E
Sbjct: 1714 EKKKAE 1719
>gnl|CDD|237178 PRK12705, PRK12705, hypothetical protein; Provisional.
Length = 508
Score = 30.8 bits (70), Expect = 0.49
Identities = 38/164 (23%), Positives = 67/164 (40%), Gaps = 2/164 (1%)
Query: 56 LAKEKTSELAKEETSELAKEKTSELAKEETSELAKEKTRALAKEETRELAKEKTRELAKE 115
LAKE L + + K + + L +E + + R A+ E EL +E+ R + KE
Sbjct: 31 LAKEAERILQEAQKEAEEKLEAALLEAKELLLRERNQQRQEARREREELQREEERLVQKE 90
Query: 116 EIRELAKEETSELTKEKTRELAKEETRELANEETSELAKEETSELAKEKPRELAKEETRE 175
E + E+ L + REL EE + E +A P + K +
Sbjct: 91 EQLDARAEKLDNLENQLEEREKALSARELELEELEKQLDNELYRVAGLTPEQARKLLLKL 150
Query: 176 LAKEKTRELAKEETRELAKEKTRELAKEKTRELAKEEIRELAKE 219
L E E A+ + +E+ A+ K + + + ++ +A E
Sbjct: 151 LDAELEEEKAQRVKK--IEEEADLEAERKAQNILAQAMQRIASE 192
Score = 27.0 bits (60), Expect = 8.4
Identities = 33/128 (25%), Positives = 50/128 (39%)
Query: 92 KTRALAKEETRELAKEKTRELAKEEIRELAKEETSELTKEKTRELAKEETRELANEETSE 151
K + LAKE R L + + K E L +E + + R+ A+ E EL EE
Sbjct: 27 KRQRLAKEAERILQEAQKEAEEKLEAALLEAKELLLRERNQQRQEARREREELQREEERL 86
Query: 152 LAKEETSELAKEKPRELAKEETRELAKEKTRELAKEETRELAKEKTRELAKEKTRELAKE 211
+ KEE + EK L + REL EE + + +A + K
Sbjct: 87 VQKEEQLDARAEKLDNLENQLEEREKALSARELELEELEKQLDNELYRVAGLTPEQARKL 146
Query: 212 EIRELAKE 219
++ L E
Sbjct: 147 LLKLLDAE 154
>gnl|CDD|224337 COG1419, FlhF, Flagellar GTP-binding protein [Cell motility and
secretion].
Length = 407
Score = 30.8 bits (70), Expect = 0.58
Identities = 23/107 (21%), Positives = 44/107 (41%), Gaps = 4/107 (3%)
Query: 114 KEEIRELAKEETSELTKEKTRELAKEETRELANEETSELAKEETSELAKEKPRELAKEET 173
E+ +LA + SEL K +TRE AK + E + K E + AK+ R
Sbjct: 48 AVEVTKLAAVD-SELRKFQTREAAKVKDAEAQLLKDPAEKKREERKAAKKIERSTPSLIE 106
Query: 174 RELAKEKTRELAKEETRELAKEKTRELAKEKTRELAKEEIRELAKEE 220
R+ + K + EE E+ +++ + + + + + +
Sbjct: 107 RKTQEVKD---SGEEIAEMMRDEKVPIRELEEIPPEFVALYKQEIQS 150
>gnl|CDD|216831 pfam01988, VIT1, VIT family. This family includes the vacuolar
Fe2+/Mn2+ uptake transporter, Ccc1 and the vacuolar iron
transporter VIT1.
Length = 209
Score = 30.0 bits (68), Expect = 0.73
Identities = 23/76 (30%), Positives = 34/76 (44%), Gaps = 2/76 (2%)
Query: 148 ETSELAKEETSELAKEKPRELAKEETRELAKEKTRELAKEETRELAKEKTRELAKEKTRE 207
SE EE +EL +EK REL + E + A+ + ELA++ REL K+ +
Sbjct: 56 VKSERDLEE-AELEREK-RELENDPEGEREELAEIYRARGLSEELARQVARELMKKDALD 113
Query: 208 LAKEEIRELAKEETNS 223
E L E +
Sbjct: 114 AMMREELGLEPETASP 129
Score = 27.6 bits (62), Expect = 4.8
Identities = 23/77 (29%), Positives = 33/77 (42%), Gaps = 2/77 (2%)
Query: 124 ETSELTKEKTRELAKEETRELANEETSELAKEETSELAKEKPRELAKEETRELAKEKTRE 183
SE E+ EL +E+ REL N+ E + A+ ELA++ REL K+ +
Sbjct: 56 VKSERDLEE-AELEREK-RELENDPEGEREELAEIYRARGLSEELARQVARELMKKDALD 113
Query: 184 LAKEETRELAKEKTREL 200
E L E L
Sbjct: 114 AMMREELGLEPETASPL 130
>gnl|CDD|173885 cd08520, PBP2_NikA_DppA_OppA_like_21, The substrate-binding
component of an uncharacterized ABC-type
nickel/dipeptide/oligopeptide-like import system
contains the type 2 periplasmic binding fold. This CD
represents the substrate-binding domain of an
uncharacterized ATP-binding cassette (ABC) type
nickel/dipeptide/oligopeptide-like transporter. The
oligopeptide-binding protein OppA and the
dipeptide-binding protein DppA show significant sequence
similarity to NikA, the initial nickel receptor. The
DppA binds dipeptides and some tripeptides and is
involved in chemotaxis toward dipeptides, whereas the
OppA binds peptides of a wide range of lengths (2-35
amino acid residues) and plays a role in recycling of
cell wall peptides, which precludes any involvement in
chemotaxis. Most of other periplasmic binding proteins
are comprised of only two globular subdomains
corresponding to domains I and III of the
dipeptide/oligopeptide binding proteins. The structural
topology of these domains is most similar to that of the
type 2 periplasmic binding proteins (PBP2), which are
responsible for the uptake of a variety of substrates
such as phosphate, sulfate, polysaccharides,
lysine/arginine/ornithine, and histidine. The PBP2 bind
their ligand in the cleft between these domains in a
manner resembling a Venus flytrap. After binding their
specific ligand with high affinity, they can interact
with a cognate membrane transport complex comprised of
two integral membrane domains and two cytoplasmically
located ATPase domains. This interaction triggers the
ligand translocation across the cytoplasmic membrane
energized by ATP hydrolysis. Besides transport
proteins, the PBP2 superfamily includes the
ligand-binding domains from ionotropic glutamate
receptors, LysR-type transcriptional regulators, and
unorthodox sensor proteins involved in signal
transduction.
Length = 468
Score = 30.4 bits (69), Expect = 0.76
Identities = 11/33 (33%), Positives = 17/33 (51%)
Query: 115 EEIRELAKEETSELTKEKTRELAKEETRELANE 147
EE+ L +++ E+ EK +EL E A E
Sbjct: 414 EELNALLRQQLQEMDPEKRKELVFEIQELYAEE 446
Score = 29.2 bits (66), Expect = 1.9
Identities = 12/35 (34%), Positives = 19/35 (54%)
Query: 145 ANEETSELAKEETSELAKEKPRELAKEETRELAKE 179
NEE + L +++ E+ EK +EL E A+E
Sbjct: 412 DNEELNALLRQQLQEMDPEKRKELVFEIQELYAEE 446
>gnl|CDD|234017 TIGR02794, tolA_full, TolA protein. TolA couples the inner
membrane complex of itself with TolQ and TolR to the
outer membrane complex of TolB and OprL (also called
Pal). Most of the length of the protein consists of
low-complexity sequence that may differ in both length
and composition from one species to another,
complicating efforts to discriminate TolA (the most
divergent gene in the tol-pal system) from paralogs such
as TonB. Selection of members of the seed alignment and
criteria for setting scoring cutoffs are based largely
conserved operon struction. //The Tol-Pal complex is
required for maintaining outer membrane integrity. Also
involved in transport (uptake) of colicins and
filamentous DNA, and implicated in pathogenesis.
Transport is energized by the proton motive force. TolA
is an inner membrane protein that interacts with
periplasmic TolB and with outer membrane porins ompC,
phoE and lamB [Transport and binding proteins, Other,
Cellular processes, Pathogenesis].
Length = 346
Score = 30.2 bits (68), Expect = 0.80
Identities = 35/193 (18%), Positives = 75/193 (38%)
Query: 27 RPETLQPEERGDKSAKEETSELAKEETSELAKEKTSELAKEETSELAKEKTSELAKEETS 86
+ +Q +++ ++E + +++ E K++ +E A+++ E + E +
Sbjct: 51 QANRIQQQKKPAAKKEQERQKKLEQQAEEAEKQRAAEQARQKELEQRAAAEKAAKQAEQA 110
Query: 87 ELAKEKTRALAKEETRELAKEKTRELAKEEIRELAKEETSELTKEKTRELAKEETRELAN 146
E+ + A+E + A E + E ++ +E + +E + A E ++ A
Sbjct: 111 AKQAEEKQKQAEEAKAKQAAEAKAKAEAEAEKKAKEEAKKQAEEEAKAKAAAEAKKKAAE 170
Query: 147 EETSELAKEETSELAKEKPRELAKEETRELAKEKTRELAKEETRELAKEKTRELAKEKTR 206
+ A+ + AK K + + E AK K A + A A+ K
Sbjct: 171 AKKKAEAEAKAKAEAKAKAKAEEAKAKAEAAKAKAAAEAAAKAEAEAAAAAAAEAERKAD 230
Query: 207 ELAKEEIRELAKE 219
E +I LA
Sbjct: 231 EAELGDIFGLASG 243
>gnl|CDD|240402 PTZ00399, PTZ00399, cysteinyl-tRNA-synthetase; Provisional.
Length = 651
Score = 30.0 bits (68), Expect = 1.1
Identities = 20/115 (17%), Positives = 48/115 (41%), Gaps = 13/115 (11%)
Query: 114 KEEIRELAKEETSELTKEKTRELAKEETRELANEETSEL------AKEETSELAKEKPRE 167
++E+R+ AK E ++ +K ++ + +L +E L + S + E
Sbjct: 493 RDEVRDAAKAEMKLISLDKKKKQLLQLCDKLRDEWLPNLGIRIEDKPDGPSVWKLDDKEE 552
Query: 168 LAKEETRELAKEKTRELAKEETRELAKEKTRELAKEKTREL---AKEEIRELAKE 219
L +E+ + A ++ + L K + +E+ ++ EK + E + +
Sbjct: 553 LQREKEEKEALKEQKRLRKLKK----QEEKKKKELEKLEKAKIPPAEFFKRQEDK 603
>gnl|CDD|235640 PRK05901, PRK05901, RNA polymerase sigma factor; Provisional.
Length = 509
Score = 30.0 bits (68), Expect = 1.2
Identities = 16/158 (10%), Positives = 52/158 (32%), Gaps = 4/158 (2%)
Query: 32 QPEERGDKSAKEETSELAKEETSELAKEKTSELAKEETSELAKEKTSELAKEETSELAKE 91
Q K+ + + K A + ++ + + +K+ + A +
Sbjct: 54 QVLIFLSGMVKDTDDATESDIPKKKTKTAAKAAAAKAPAKKKLKDELDSSKKAEKKNALD 113
Query: 92 KTRALAKEETRELAKEKTRELAKEEIRELAKEETSELTKEKTRELAKEETRELANEETSE 151
K L + ++ + + ++ +L ++ + ++ +E + + + +
Sbjct: 114 KDDDLNYVKDIDVLNQADDDDDDDDDDDLDDDDIDDDDDDED----DDEDDDDDDVDDED 169
Query: 152 LAKEETSELAKEKPRELAKEETRELAKEKTRELAKEET 189
K+E EL K + + + + + T
Sbjct: 170 EEKKEAKELEKLSDDDDFVWDEDDSEALRQARKDAKLT 207
Score = 29.2 bits (66), Expect = 1.6
Identities = 15/145 (10%), Positives = 47/145 (32%), Gaps = 4/145 (2%)
Query: 29 ETLQPEERGDKSAKEETSELAKEETSELAKEKTSELAKEETSELAKEKTSELAKEETSEL 88
+ + K K A + ++ + + +K+ + A +K +L + ++
Sbjct: 67 DDATESDIPKKKTKTAAKAAAAKAPAKKKLKDELDSSKKAEKKNALDKDDDLNYVKDIDV 126
Query: 89 AKEKTRALAKEETRELAKEKTRELAKEEIRELAKEETSELTKEKTRELAKEETRELANEE 148
+ ++ +L + + +E +E + + K+E +EL
Sbjct: 127 LNQADDDDDDDDDDDLDDDDIDDDDDDE----DDDEDDDDDDVDDEDEEKKEAKELEKLS 182
Query: 149 TSELAKEETSELAKEKPRELAKEET 173
+ + + + + T
Sbjct: 183 DDDDFVWDEDDSEALRQARKDAKLT 207
Score = 28.8 bits (65), Expect = 2.4
Identities = 21/206 (10%), Positives = 66/206 (32%), Gaps = 1/206 (0%)
Query: 17 MSSASPNSCGRPETLQPEERGDKSAKEETSELAKEETSELAKEKTSELAKEET-SELAKE 75
++++ E + + + + + + KEE E + K + +
Sbjct: 2 TTASTKAELAAEEEAKKKLKKLAAKSKSKGFITKEEIKEALESKKKTPEQIDQVLIFLSG 61
Query: 76 KTSELAKEETSELAKEKTRALAKEETRELAKEKTRELAKEEIRELAKEETSELTKEKTRE 135
+ S++ K+KT+ AK + +K + + ++ K+ + +
Sbjct: 62 MVKDTDDATESDIPKKKTKTAAKAAAAKAPAKKKLKDELDSSKKAEKKNALDKDDDLNYV 121
Query: 136 LAKEETRELANEETSELAKEETSELAKEKPRELAKEETRELAKEKTRELAKEETRELAKE 195
+ + +++ + + + + + +E + + K+E +EL K
Sbjct: 122 KDIDVLNQADDDDDDDDDDDLDDDDIDDDDDDEDDDEDDDDDDVDDEDEEKKEAKELEKL 181
Query: 196 KTRELAKEKTRELAKEEIRELAKEET 221
+ + + T
Sbjct: 182 SDDDDFVWDEDDSEALRQARKDAKLT 207
>gnl|CDD|131686 TIGR02638, lactal_redase, lactaldehyde reductase. This clade of
genes encoding iron-containing alcohol dehydrogenase
(pfam00465) proteins is generally found in apparent
operons for the catabolism of rhamnose or fucose.
Catabolism of both of these monosaccharides results in
lactaldehyde which is reduced by this enzyme to 1,2
propanediol. This protein is alternatively known by the
name 1,2 propanediol oxidoreductase. This enzyme is
active under anaerobic conditions in E. coli while being
inactivated by reactive oxygen species under aerobic
conditions. Under aerobic conditions the lactaldehyde
product of rhamnose and fucose catabolism is believed to
be oxidized to lactate by a separate enzyme,
lactaldehyde dehydrogenase [Energy metabolism, Sugars].
Length = 379
Score = 29.7 bits (67), Expect = 1.3
Identities = 15/43 (34%), Positives = 24/43 (55%), Gaps = 3/43 (6%)
Query: 179 EKTRELAKEETRELAKE---KTRELAKEKTRELAKEEIRELAK 218
E E E+ RE+AK KT ++ E+ R+ A E ++ L+K
Sbjct: 290 EFNAEFTGEKYREIAKAMGVKTEGMSDEEARDAAVEAVKTLSK 332
>gnl|CDD|219655 pfam07946, DUF1682, Protein of unknown function (DUF1682). The
members of this family are all hypothetical eukaryotic
proteins of unknown function. One member is described as
being an adipocyte-specific protein, but no evidence of
this was found.
Length = 322
Score = 29.1 bits (66), Expect = 1.6
Identities = 22/61 (36%), Positives = 31/61 (50%), Gaps = 1/61 (1%)
Query: 107 EKTRELAKEEIRELAKEETSELTKEKTRELAK-EETRELANEETSELAKEETSELAKEKP 165
+KTRE +E+I + A+EE E +EK E K E +LA E K E E K+
Sbjct: 262 DKTREEEEEKILKAAEEERQEEAQEKKEEKKKEEREAKLAKLSPEEQRKLEEKERKKQAR 321
Query: 166 R 166
+
Sbjct: 322 K 322
>gnl|CDD|233467 TIGR01554, major_cap_HK97, phage major capsid protein, HK97 family.
This model family represents the major capsid protein
component of the heads (capsids) of bacteriophage HK97,
phi-105, P27, and related phage. This model represents
one of several analogous families lacking detectable
sequence similarity. The gene encoding this component is
typically located in an operon encoding the small and
large terminase subunits, the portal protein and the
prohead or maturation protease [Mobile and
extrachromosomal element functions, Prophage functions].
Length = 384
Score = 29.2 bits (66), Expect = 1.6
Identities = 27/105 (25%), Positives = 40/105 (38%), Gaps = 3/105 (2%)
Query: 50 KEETSELAKEKTSELAKEETSELAKEKTS-ELAKEETSELAKEKTR--ALAKEETRELAK 106
+E +ELAKE E+ +E +EK + KEE +L E R L E + A
Sbjct: 5 REALAELAKELRKLTEDEKLAEAEEEKAEYDALKEEIDKLDAEIDRLEELLDELEAKPAA 64
Query: 107 EKTRELAKEEIRELAKEETSELTKEKTRELAKEETRELANEETSE 151
+EE E E + + A E R+ + T
Sbjct: 65 SGEGGGGEEEEEEAKAEAAEFRAYLRGGDDALAEERKALSTGTDS 109
Score = 27.7 bits (62), Expect = 4.6
Identities = 22/92 (23%), Positives = 30/92 (32%), Gaps = 3/92 (3%)
Query: 115 EEIRELAKEETSELTKEKTRELAKEETRELANEETSELAKEETSELAKEKPRELAKEETR 174
+E+RE E EL K E E E A + KEE +L E R +
Sbjct: 2 KELREALAELAKELRKLTEDEKLAEAEEEKAEYDA---LKEEIDKLDAEIDRLEELLDEL 58
Query: 175 ELAKEKTRELAKEETRELAKEKTRELAKEKTR 206
E + E E E + + R
Sbjct: 59 EAKPAASGEGGGGEEEEEEAKAEAAEFRAYLR 90
Score = 26.9 bits (60), Expect = 9.4
Identities = 15/83 (18%), Positives = 24/83 (28%)
Query: 74 KEKTSELAKEETSELAKEKTRALAKEETRELAKEKTRELAKEEIRELAKEETSELTKEKT 133
+E +ELAKE EK +E+ A ++ + EI L + K
Sbjct: 5 REALAELAKELRKLTEDEKLAEAEEEKAEYDALKEEIDKLDAEIDRLEELLDELEAKPAA 64
Query: 134 RELAKEETRELANEETSELAKEE 156
E +
Sbjct: 65 SGEGGGGEEEEEEAKAEAAEFRA 87
>gnl|CDD|224117 COG1196, Smc, Chromosome segregation ATPases [Cell division and
chromosome partitioning].
Length = 1163
Score = 29.7 bits (67), Expect = 1.6
Identities = 47/179 (26%), Positives = 74/179 (41%)
Query: 41 AKEETSELAKEETSELAKEKTSELAKEETSELAKEKTSELAKEETSELAKEKTRALAKEE 100
K E + L +E ++ + E EE E +E L + E + E+ A KEE
Sbjct: 721 LKRELAALEEELEQLQSRLEELEEELEELEEELEELQERLEELEEELESLEEALAKLKEE 780
Query: 101 TRELAKEKTRELAKEEIRELAKEETSELTKEKTRELAKEETRELANEETSELAKEETSEL 160
EL +++ + E E EE REL E R E+ E +EE EL
Sbjct: 781 IEELEEKRQALQEELEELEEELEEAERRLDALERELESLEQRRERLEQEIEELEEEIEEL 840
Query: 161 AKEKPRELAKEETRELAKEKTRELAKEETRELAKEKTRELAKEKTRELAKEEIRELAKE 219
++ + E E E+ +E +E E + + E+ +E +EE+REL E
Sbjct: 841 EEKLDELEEELEELEKELEELKEELEELEAEKEELEDELKELEEEKEELEEELRELESE 899
Score = 28.9 bits (65), Expect = 2.3
Identities = 47/188 (25%), Positives = 74/188 (39%), Gaps = 2/188 (1%)
Query: 33 PEERGDKSAKEETSELAKE-ETSELAKEKTSELAKEETSELAKEKTSELAKEETSELAKE 91
+R + K E EL +E E EK E K +EL + E +
Sbjct: 657 RNKRSSLAQKRELKELEEELAELEAQLEKLEEELKSLKNELRSLEDLLEELRRQLEELER 716
Query: 92 KTRALAKEETRELAKEKTRELAKEEIRELAKEETSELTKEKTRELAKEETRELANEETSE 151
+ L +E + + + EE+ E +E EL + + R EE E E ++
Sbjct: 717 QLEELKRELAALEEELEQLQSRLEELEEELEELEEELEELQERLEELEEELESLEEALAK 776
Query: 152 LAKEETSELAKEKPRELAKEETRELAKEKTRELAKEETRELAKEKTRELAKEKTRELAKE 211
L KEE EL +++ + E E E+ REL + R E+ E +E
Sbjct: 777 L-KEEIEELEEKRQALQEELEELEEELEEAERRLDALERELESLEQRRERLEQEIEELEE 835
Query: 212 EIRELAKE 219
EI EL ++
Sbjct: 836 EIEELEEK 843
Score = 27.8 bits (62), Expect = 5.7
Identities = 37/172 (21%), Positives = 63/172 (36%), Gaps = 1/172 (0%)
Query: 50 KEETSELAKEKTSELAKEETSELAKEKTSELAKEETSELAKEKTRALAKEETRELAKEKT 109
E L +E+ EL E + + + E E E+ L +E + LA+ +
Sbjct: 300 LEGEISLLRERLEELENELEELEERLEELKEKIEALKEEL-EERETLLEELEQLLAELEE 358
Query: 110 RELAKEEIRELAKEETSELTKEKTRELAKEETRELANEETSELAKEETSELAKEKPRELA 169
+ EE EE EL + ELA+ E E K E L + R
Sbjct: 359 AKEELEEKLSALLEELEELFEALREELAELEAELAEIRNELEELKREIESLEERLERLSE 418
Query: 170 KEETRELAKEKTRELAKEETRELAKEKTRELAKEKTRELAKEEIRELAKEET 221
+ E + ++ +E EL + E+ E ++ ++EL +E
Sbjct: 419 RLEDLKEELKELEAELEELQTELEELNEELEELEEQLEELRDRLKELERELA 470
>gnl|CDD|222581 pfam14181, YqfQ, YqfQ-like protein. The YqfQ-like protein family
includes the B. subtilis YqfQ protein, also known as
VrrA, which is functionally uncharacterized. This family
of proteins is found in bacteria. Proteins in this
family are typically between 146 and 237 amino acids in
length. There are two conserved sequence motifs: QYGP
and PKLY.
Length = 155
Score = 28.6 bits (64), Expect = 1.8
Identities = 16/63 (25%), Positives = 26/63 (41%), Gaps = 1/63 (1%)
Query: 36 RGDKSAKEETSELAKEETSELAKEKTSELAKEETSELAKEKTSELAKEETSELAKEKTRA 95
R S+ +E E +E T E +E E K E+ E K + + E+ + K
Sbjct: 89 RELSSSDDEEEETEEESTDETEQEDPPE-TKTESKEKKKREVPKPKTEKEKPKTEPKKPK 147
Query: 96 LAK 98
+K
Sbjct: 148 PSK 150
>gnl|CDD|183341 PRK11858, aksA, trans-homoaconitate synthase; Reviewed.
Length = 378
Score = 29.0 bits (66), Expect = 2.0
Identities = 18/49 (36%), Positives = 27/49 (55%), Gaps = 3/49 (6%)
Query: 174 RELAKEKTRELAKEETRELAK---EKTRELAKEKTRELAKEEIRELAKE 219
R K K +E E +RE EK +EL++ K R L EE++EL ++
Sbjct: 322 RHALKNKLKEYGIELSREELCELLEKVKELSERKKRSLTDEELKELVED 370
>gnl|CDD|237360 PRK13354, PRK13354, tyrosyl-tRNA synthetase; Provisional.
Length = 410
Score = 29.1 bits (66), Expect = 2.0
Identities = 24/94 (25%), Positives = 44/94 (46%), Gaps = 3/94 (3%)
Query: 112 LAKEEIRELAKEETSELTK-EKTRELAKEETRELANEETSELAKEETSELAKEKPRELAK 170
L+ +EI EL + +E + + LA+E T+ + EE +E A++ L + L
Sbjct: 273 LSPDEIDELEAQLETEPNPRDAKKVLAEEITKFVHGEEAAEEAEKIFKALFSGDVKPLKD 332
Query: 171 EETRELAKEKTR--ELAKEETRELAKEKTRELAK 202
T E++ E +L + E +K + R L +
Sbjct: 333 IPTFEVSAETKNLVDLLVDLGLEPSKREARRLIQ 366
Score = 27.2 bits (61), Expect = 7.6
Identities = 24/94 (25%), Positives = 45/94 (47%), Gaps = 3/94 (3%)
Query: 48 LAKEETSELAKEKTSELAKEETS-ELAKEKTSELAKEETSELAKEKTRALAKEETRELAK 106
L+ +E EL + +E + LA+E T + EE +E A++ +AL + + L
Sbjct: 273 LSPDEIDELEAQLETEPNPRDAKKVLAEEITKFVHGEEAAEEAEKIFKALFSGDVKPLKD 332
Query: 107 EKTRELAKEE--IRELAKEETSELTKEKTRELAK 138
T E++ E + +L + E +K + R L +
Sbjct: 333 IPTFEVSAETKNLVDLLVDLGLEPSKREARRLIQ 366
>gnl|CDD|217393 pfam03154, Atrophin-1, Atrophin-1 family. Atrophin-1 is the
protein product of the dentatorubral-pallidoluysian
atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive
neurodegenerative disorder. It is caused by the
expansion of a CAG repeat in the DRPLA gene on
chromosome 12p. This results in an extended
polyglutamine region in atrophin-1, that is thought to
confer toxicity to the protein, possibly through
altering its interactions with other proteins. The
expansion of a CAG repeat is also the underlying defect
in six other neurodegenerative disorders, including
Huntington's disease. One interaction of expanded
polyglutamine repeats that is thought to be pathogenic
is that with the short glutamine repeat in the
transcriptional coactivator CREB binding protein, CBP.
This interaction draws CBP away from its usual nuclear
location to the expanded polyglutamine repeat protein
aggregates that are characteristic of the polyglutamine
neurodegenerative disorders. This interferes with
CBP-mediated transcription and causes cytotoxicity.
Length = 979
Score = 28.9 bits (64), Expect = 2.3
Identities = 23/52 (44%), Positives = 31/52 (59%), Gaps = 5/52 (9%)
Query: 158 SELAKEKPRELAKEETRELAKEKTRELAKEETRELAKEKTRELAKEKTRELA 209
S+LAK+ RE A E+ + A++K RE E RE KEK RE +E+ E A
Sbjct: 575 SKLAKK--REEAVEKAKREAEQKARE---EREREKEKEKEREREREREAERA 621
Score = 28.1 bits (62), Expect = 5.0
Identities = 23/57 (40%), Positives = 34/57 (59%), Gaps = 5/57 (8%)
Query: 167 ELAKEETRELAKEKTRELAKEETRELAKEKTRELAKEKTRELAKEEIRELAKEETNS 223
+LAK+ RE A EK + A+++ RE E+ RE KEK RE +E E A + ++S
Sbjct: 576 KLAKK--REEAVEKAKREAEQKARE---EREREKEKEKEREREREREAERAAKASSS 627
Score = 27.7 bits (61), Expect = 6.1
Identities = 21/44 (47%), Positives = 27/44 (61%), Gaps = 3/44 (6%)
Query: 150 SELAK--EETSELAKEKPRELAKEETRELAKEKTRELAKEETRE 191
S+LAK EE E AK + + A+EE RE KEK +E +E RE
Sbjct: 575 SKLAKKREEAVEKAKREAEQKAREE-REREKEKEKERERERERE 617
>gnl|CDD|233191 TIGR00927, 2A1904, K+-dependent Na+/Ca+ exchanger. [Transport and
binding proteins, Cations and iron carrying compounds].
Length = 1096
Score = 28.8 bits (64), Expect = 2.7
Identities = 47/203 (23%), Positives = 70/203 (34%), Gaps = 11/203 (5%)
Query: 32 QPEERGDKSAKEETSELAKE----ETSELAKEKTSELAKEETSELAKEKTSELAKEETSE 87
+ E G+++ +E E E S E+ E + +E E +E E+ E
Sbjct: 637 EAEHTGERTGEEGERPTEAEGENGEESGGEAEQEGETETKGENESEGEIPAERKGEQEGE 696
Query: 88 LAKEKTRALAKEETRELAKEKTRELAKE------EIRELAKEETSELTKEKTRELAKEET 141
E A K ET E E E EI + E E E E E
Sbjct: 697 GEIEAKEADHKGETEAEEVEHEGETEAEGTEDEGEIETGEEGEEVEDEGEGEAEGKHEVE 756
Query: 142 RELANEETSELAKEETSELAKEKPRELAKEETRELAKEKTRELAKE-ETRELAKEKTREL 200
E +ET + E E E+ E E+ ++ E E E A EK
Sbjct: 757 TEGDRKETEHEGETEAEGKEDEDEGEIQAGEDGEMKGDEGAEGKVEHEGETEAGEKDEHE 816
Query: 201 AKEKTRELAKEEIRELAKEETNS 223
+ +T+ E E ++E N+
Sbjct: 817 GQSETQADDTEVKDETGEQELNA 839
>gnl|CDD|239472 cd03377, TPP_PFOR_PNO, Thiamine pyrophosphate (TPP family),
PFOR_PNO subfamily, TPP-binding module; composed of
proteins similar to the single subunit pyruvate
ferredoxin oxidoreductase (PFOR) of Desulfovibrio
Africanus, present in bacteria and amitochondriate
eukaryotes. This subfamily also includes proteins
characterized as pyruvate NADP+ oxidoreductase (PNO).
These enzymes are dependent on TPP and a divalent metal
cation as cofactors. PFOR and PNO catalyze the oxidative
decarboxylation of pyruvate to form acetyl-CoA, a
crucial step in many metabolic pathways. Archaea,
anaerobic bacteria and eukaryotes that lack mitochondria
(and therefore pyruvate dehydrogenase) use PFOR to
oxidatively decarboxylate pyruvate, with ferredoxin or
flavodoxin as the electron acceptor. The PFOR from
cyanobacterium Anabaena (NifJ) is required for the
transfer of electrons from pyruvate to flavodoxin, which
reduces nitrogenase. The facultative anaerobic
mitochondrion of the photosynthetic protist Euglena
gracilis oxidizes pyruvate with PNO.
Length = 365
Score = 28.7 bits (65), Expect = 2.8
Identities = 20/65 (30%), Positives = 30/65 (46%), Gaps = 6/65 (9%)
Query: 104 LAKEKTRELAKEEIRELAKEETSELTKEKTRELAK-----EETRELANEETSELAKEETS 158
LA ++ RE A+E +++L ++ E K EE+RE + L E
Sbjct: 78 LAVDQRRERARELVQKLIEKIGDEELKTLLNAWLATEDDIEESRERV-AKLKPLLAAEKD 136
Query: 159 ELAKE 163
ELAKE
Sbjct: 137 ELAKE 141
Score = 27.2 bits (61), Expect = 6.7
Identities = 20/66 (30%), Positives = 32/66 (48%), Gaps = 6/66 (9%)
Query: 95 ALAKEETRELAKEKTRELAKEEIRELAKE-----ETSELTKEKTRELAKEETRELANEET 149
LA ++ RE A+E ++L ++ E K +E E++RE + + L E
Sbjct: 77 RLAVDQRRERARELVQKLIEKIGDEELKTLLNAWLATEDDIEESRERV-AKLKPLLAAEK 135
Query: 150 SELAKE 155
ELAKE
Sbjct: 136 DELAKE 141
Score = 26.8 bits (60), Expect = 9.6
Identities = 20/68 (29%), Positives = 31/68 (45%), Gaps = 4/68 (5%)
Query: 128 LTKEKTRELAKEETRELANEETSELAKEETSELAKEKPRELAKEETRELAKEKTRELAKE 187
L ++ RE A+E ++L + E K + + EE+RE + LA E
Sbjct: 78 LAVDQRRERARELVQKLIEKIGDEELKTLLNAWLATEDD---IEESRERVAKLKPLLAAE 134
Query: 188 ETRELAKE 195
+ ELAKE
Sbjct: 135 KD-ELAKE 141
>gnl|CDD|222613 pfam14235, DUF4337, Domain of unknown function (DUF4337). This
family of proteins is functionally uncharacterized. This
family of proteins is found in bacteria. Proteins in
this family are typically between 187 and 201 amino
acids in length. There is a single completely conserved
residue Q that may be functionally important.
Length = 158
Score = 27.5 bits (62), Expect = 3.3
Identities = 16/70 (22%), Positives = 27/70 (38%), Gaps = 6/70 (8%)
Query: 39 KSAKEETSELAKEETSELAKEKTSELAKEETSELAKEKTSELAKEETSELAKEKTRALAK 98
KS K+ +ELA + EL + + + KEK SE + + +A
Sbjct: 47 KSIKQHLAELAADLL-ELEAAAPRAELQAKIARYKKEK-----ARYRSEAKELEAKAKEA 100
Query: 99 EETRELAKEK 108
E + A +
Sbjct: 101 EAESDHALHQ 110
>gnl|CDD|173935 cd08176, LPO, Lactadehyde:propanediol oxidoreductase (LPO)
catalyzes the interconversion between L-lactaldehyde and
L-1,2-propanediol in Escherichia coli and other
enterobacteria. Lactadehyde:propanediol oxidoreductase
(LPO) is a member of the group III iron-activated
dehydrogenases which catalyze the interconversion
between L-lactaldehyde and L-1,2-propanediol in
Escherichia coli and other enterobacteria. L-Fucose and
L-rhamnose is used by Escherichia coli through an
inducible pathway mediated by the fucose regulon
comprising four linked oeprons fucO, fucA, fucPIK, and
fucR. The fucA-encoded aldolase catalyzes the formation
of dihydroxyacetone phosphate and L-lactaldehyde. Under
anaerobic conditions, with NADH as a cofactor,
lactaldehyde is converted by a fucO-encoded
Lactadehyde:propanediol oxidoreductase (LPO) to
L-1,2-propanediol, which is excreted as a fermentation
product. In mutant strains, E. coli adapted to grow on
L-1,2-propanediol, FucO catalyzes the oxidation of the
polyol to L-lactaldehyde. FucO is induced regardless of
the respiratory conditions of the culture, remains fully
active in the absence of oxygen. In the presence of
oxygen, this enzyme becomes oxidatively inactivated by a
metal-catalyzed oxidation mechanism. FucO is an
iron-dependent metalloenzyme that is inactivated by
other metals, such as zinc, copper, or cadmium. This
enzyme can also reduces glycol aldehyde with similar
efficiency. Beside L-1,2-propanediol, the enzyme is
also able to oxidize methanol as alternative substrates.
Length = 377
Score = 28.3 bits (64), Expect = 3.3
Identities = 10/44 (22%), Positives = 22/44 (50%), Gaps = 3/44 (6%)
Query: 83 EETSELAKEKTRALAK---EETRELAKEKTRELAKEEIRELAKE 123
E + E+ +A+ +T L+ E+ E A + +R L+++
Sbjct: 287 EYNAPACPERFADIAEAMGVDTEGLSDEEAAEAAIDAVRALSED 330
>gnl|CDD|234750 PRK00409, PRK00409, recombination and DNA strand exchange inhibitor
protein; Reviewed.
Length = 782
Score = 28.6 bits (65), Expect = 3.3
Identities = 43/136 (31%), Positives = 63/136 (46%), Gaps = 9/136 (6%)
Query: 47 ELAKEETSELAKEKTSEL-AKEETSELAKEKTSELAKEETSELAKEKTRALAK----EET 101
E AK+ E KEK +EL A E E E+ +E A+ E K K K +E
Sbjct: 505 EEAKKLIGE-DKEKLNELIASLEELERELEQKAEEAEALLKEAEKLKEELEEKKEKLQEE 563
Query: 102 RELAKEKTRELAKEEIRELAKEETSELTKEKTRELAKEETRELANEETSELAKE-ETSEL 160
+ E+ + A++ I+E AK+E E+ KE R+L K + E E K +
Sbjct: 564 EDKLLEEAEKEAQQAIKE-AKKEADEIIKE-LRQLQKGGYASVKAHELIEARKRLNKANE 621
Query: 161 AKEKPRELAKEETREL 176
KEK ++ KE+ EL
Sbjct: 622 KKEKKKKKQKEKQEEL 637
>gnl|CDD|235218 PRK04081, PRK04081, hypothetical protein; Provisional.
Length = 207
Score = 27.8 bits (62), Expect = 3.7
Identities = 16/42 (38%), Positives = 22/42 (52%), Gaps = 10/42 (23%)
Query: 99 EETRELAKEKT---RELAKEEIRELAKEE-------TSELTK 130
ETR + ++ R L++EEI +L KEE TS LT
Sbjct: 70 SETRVVLRDLDGTERVLSQEEIDKLIKEEEAKIDNGTSNLTN 111
>gnl|CDD|130163 TIGR01091, upp, uracil phosphoribosyltransferase. A fairly deep
split in phylogenetic and UPGMA trees separates this
mostly prokaryotic set of uracil
phosphoribosyltransferases from a mostly eukaryotic set
that includes uracil phosphoribosyltransferase, uridine
kinases, and other, uncharacterized proteins [Purines,
pyrimidines, nucleosides, and nucleotides, Salvage of
nucleosides and nucleotides].
Length = 207
Score = 27.6 bits (62), Expect = 3.9
Identities = 20/64 (31%), Positives = 30/64 (46%), Gaps = 6/64 (9%)
Query: 72 LAKEKTSELAKEETSELAKEKTRALAKEETRELAKEKTRELAKEEIR---ELAKEETSEL 128
L K K + L + T ++ R L +E R LA E TR+L EE+ L + E +
Sbjct: 9 LIKHKLTLLRDKNTDT---KEFRELLRELGRLLAYEATRDLELEEVEVETPLGETEGGRI 65
Query: 129 TKEK 132
+K
Sbjct: 66 LGKK 69
>gnl|CDD|215212 PLN02372, PLN02372, violaxanthin de-epoxidase.
Length = 455
Score = 27.9 bits (62), Expect = 4.3
Identities = 31/107 (28%), Positives = 45/107 (42%), Gaps = 7/107 (6%)
Query: 36 RGDKSAKEETSELAKEETSELAKEKTSELAKEETSELAKEKTSELAKEETSELAKEKTRA 95
R D + E L + E EKT + E +++ +L KEE S K A
Sbjct: 351 RTDNTCGPEPPLLERLEKDVEEGEKTIVKEARQIEEELEKEVEKLGKEEESLF---KRVA 407
Query: 96 LAK--EETRELAKEKTRELAKEEIREL--AKEETSELTKEKTRELAK 138
L + +E + + +EL+KEE L K E SE+ K R L
Sbjct: 408 LEEGLKELEQDEENFLKELSKEEKELLEKLKMEASEVEKLFGRALPV 454
>gnl|CDD|234271 TIGR03591, polynuc_phos, polyribonucleotide nucleotidyltransferase.
Members of this protein family are polyribonucleotide
nucleotidyltransferase, also called polynucleotide
phosphorylase. Some members have been shown also to have
additional functions as guanosine pentaphosphate
synthetase and as poly(A) polymerase (see model
TIGR02696 for an exception clade, within this family)
[Transcription, Degradation of RNA].
Length = 684
Score = 27.8 bits (63), Expect = 4.6
Identities = 29/103 (28%), Positives = 44/103 (42%), Gaps = 1/103 (0%)
Query: 97 AKEETRELAKEKTRELAKEEIRELAKEETSELTKEKTRELAKEETRELANEETSELAKEE 156
A EE E A ++ E E+ E K + EL E + A + T + + KEE
Sbjct: 209 AIEELAEEAGKEKWEFVAPEVDEELKAKVKELAYEAGLKEAYQITEKQERYAALDAIKEE 268
Query: 157 TSELAKEKPRELAKEETRELAKEKTRELAKEETREL-AKEKTR 198
E E+ E + + + KE ++L K+ RE KE R
Sbjct: 269 VLEALAEEEEEEEEADREKEIKEAFKDLEKKIVRERILKEGKR 311
>gnl|CDD|235206 PRK04031, PRK04031, DNA primase; Provisional.
Length = 408
Score = 27.9 bits (63), Expect = 4.8
Identities = 22/106 (20%), Positives = 46/106 (43%), Gaps = 1/106 (0%)
Query: 43 EETSELAKEETSELAKEKTS-ELAKEETSELAKEKTSELAKEETSELAKEKTRALAKEET 101
+E EL K+E ++ + K E EE + A++ ++ +EE + + +E+
Sbjct: 246 KEVEELTKKEIAKALRNKVPVEQYLEELGKKAQKAAEKVKEEEEKPEKEPAEQPEPEEKE 305
Query: 102 RELAKEKTRELAKEEIRELAKEETSELTKEKTRELAKEETRELANE 147
+ E +E I+EL + L E + + R+L ++
Sbjct: 306 PAPVPAEKEETVREHIKELKGTLEARLLDENWNVIKEVPVRDLVDK 351
>gnl|CDD|114011 pfam05262, Borrelia_P83, Borrelia P83/100 protein. This family
consists of several Borrelia P83/P100 antigen proteins.
Length = 489
Score = 28.0 bits (62), Expect = 4.9
Identities = 31/180 (17%), Positives = 76/180 (42%), Gaps = 7/180 (3%)
Query: 40 SAKEETSELAKEETSELAKEKTSELAKEETSELAKEKTSELAKEETSELAKEKTRALAKE 99
S K+ L ++ + + KE S+ ++ ++ KEE + + +A K
Sbjct: 178 SDKKVVEALREDNEKGVNFRRDMTDLKERESQEDAKR-AQQLKEELDKKQIDADKAQQKA 236
Query: 100 ETRELAKEKTRELAKEEIRELAKEETSELTKEKT--RELAKEETRELANEETSELAKEET 157
+ + +K R+ +++ +E T +++A+ + RE+ + +E
Sbjct: 237 DFAQDNADKQRDEVRQKQQEAKNLPKPADTSSPKEDKQVAENQKREIEKAQIEIKKNDEE 296
Query: 158 SELAKEKPRELAKEETRELAK--EKTRELAKEETRELAK--EKTRELAKEKTRELAKEEI 213
+ AK+ K+E++ K E A+++ +A+ +KT+ + + L ++ I
Sbjct: 297 ALKAKDHKAFDLKQESKASEKEAEDKELEAQKKREPVAEDLQKTKPQVEAQPTSLNEDAI 356
>gnl|CDD|237205 PRK12792, flhA, flagellar biosynthesis protein FlhA; Reviewed.
Length = 694
Score = 27.8 bits (62), Expect = 4.9
Identities = 13/35 (37%), Positives = 17/35 (48%), Gaps = 1/35 (2%)
Query: 165 PRELAKEETRELAKEKTREL-AKEETRELAKEKTR 198
PR A E AK K E A+ E ++ KE+ R
Sbjct: 324 PRRRAARAAAEAAKVKREEESAQAEAKDSVKEQLR 358
>gnl|CDD|178867 PRK00106, PRK00106, hypothetical protein; Provisional.
Length = 535
Score = 27.9 bits (62), Expect = 5.2
Identities = 42/136 (30%), Positives = 70/136 (51%), Gaps = 8/136 (5%)
Query: 86 SELAKEKTRALAKEETRELAKEKTRELAKEEIRELAKEETSELTKEKTRELAKEETRELA 145
+E E + AK E++ L KE E AKEE R+ +EE + K + +EL + E+R
Sbjct: 51 AERDAEHIKKTAKRESKALKKELLLE-AKEEARKY-REEIEQEFKSERQELKQIESR--L 106
Query: 146 NEETSELAKEETSELAKEKPRELAKEETRELAKEKTRELAKEETRELAKEKTRELAKEKT 205
E + L +++ + +KEK L +E K K + +E+ +L ++K EL E+
Sbjct: 107 TERATSLDRKDENLSSKEK--TLESKEQSLTDKSKHIDEREEQVEKLEEQKKAEL--ERV 162
Query: 206 RELAKEEIRELAKEET 221
L++ E RE+ ET
Sbjct: 163 AALSQAEAREIILAET 178
Score = 27.9 bits (62), Expect = 5.4
Identities = 38/144 (26%), Positives = 67/144 (46%), Gaps = 4/144 (2%)
Query: 78 SELAKEETSELAKEKTRALAKEETRELAKEKTRELAKEEIRELAKEETSELTKEKTR--E 135
+E E + AK +++AL KE E AKE+ R+ +EEI + K E EL + ++R E
Sbjct: 51 AERDAEHIKKTAKRESKALKKELLLE-AKEEARKY-REEIEQEFKSERQELKQIESRLTE 108
Query: 136 LAKEETRELANEETSELAKEETSELAKEKPRELAKEETRELAKEKTRELAKEETRELAKE 195
A R+ N + E E + +K + + + E + E+ ++ E L++
Sbjct: 109 RATSLDRKDENLSSKEKTLESKEQSLTDKSKHIDEREEQVEKLEEQKKAELERVAALSQA 168
Query: 196 KTRELAKEKTRELAKEEIRELAKE 219
+ RE+ +T EI +E
Sbjct: 169 EAREIILAETENKLTHEIATRIRE 192
Score = 27.5 bits (61), Expect = 5.8
Identities = 42/151 (27%), Positives = 77/151 (50%), Gaps = 12/151 (7%)
Query: 35 ERGDKSAKEETSELAKEETSELAKEKTSELAKEETSELAK------EKTSELAKEETSEL 88
+R K+ K+E AKEE + +E E K E EL + E+ + L +++ +
Sbjct: 63 KRESKALKKELLLEAKEEARKYREEIEQEF-KSERQELKQIESRLTERATSLDRKDENLS 121
Query: 89 AKEKTRALAKEETRELAKEKTRELAKEEIRELAKEETSELTKEKTRELAKEETRELANEE 148
+KEKT L +E K K + +E++ +L +++ +EL E+ L++ E RE+ E
Sbjct: 122 SKEKT--LESKEQSLTDKSKHIDEREEQVEKLEEQKKAEL--ERVAALSQAEAREIILAE 177
Query: 149 TSELAKEETSELAKEKPRELAKEETRELAKE 179
T E + +E RE+ K+ + ++AK+
Sbjct: 178 TENKLTHEIATRIREAEREV-KDRSDKMAKD 207
>gnl|CDD|234801 PRK00591, prfA, peptide chain release factor 1; Validated.
Length = 359
Score = 27.7 bits (63), Expect = 5.4
Identities = 19/56 (33%), Positives = 28/56 (50%), Gaps = 2/56 (3%)
Query: 79 ELAKE--ETSELAKEKTRALAKEETRELAKEKTRELAKEEIRELAKEETSELTKEK 132
+L+KE E + + +E E AKE E + E+RE+AKEE EL +
Sbjct: 36 KLSKEYAELEPIVEAYREYKQAQEDLEEAKEMLEEESDPEMREMAKEELKELEERL 91
Score = 27.7 bits (63), Expect = 5.4
Identities = 21/67 (31%), Positives = 35/67 (52%), Gaps = 3/67 (4%)
Query: 107 EKTRELAKE--EIRELAKEETSELTKEKTRELAKEETRELANEETSELAKEETSELAKEK 164
++ R+L+KE E+ + + ++ E AKE E ++ E E+AKEE EL +
Sbjct: 32 KRFRKLSKEYAELEPIVEAYREYKQAQEDLEEAKEMLEEESDPEMREMAKEELKELEERL 91
Query: 165 PRELAKE 171
EL +E
Sbjct: 92 E-ELEEE 97
Score = 27.3 bits (62), Expect = 7.0
Identities = 19/56 (33%), Positives = 30/56 (53%), Gaps = 2/56 (3%)
Query: 166 RELAKE--ETRELAKEKTRELAKEETRELAKEKTRELAKEKTRELAKEEIRELAKE 219
R+L+KE E + + +E E AKE E + + RE+AKEE++EL +
Sbjct: 35 RKLSKEYAELEPIVEAYREYKQAQEDLEEAKEMLEEESDPEMREMAKEELKELEER 90
>gnl|CDD|233757 TIGR02168, SMC_prok_B, chromosome segregation protein SMC, common
bacterial type. SMC (structural maintenance of
chromosomes) proteins bind DNA and act in organizing and
segregating chromosomes for partition. SMC proteins are
found in bacteria, archaea, and eukaryotes. This family
represents the SMC protein of most bacteria. The smc
gene is often associated with scpB (TIGR00281) and scpA
genes, where scp stands for segregation and condensation
protein. SMC was shown (in Caulobacter crescentus) to be
induced early in S phase but present and bound to DNA
throughout the cell cycle [Cellular processes, Cell
division, DNA metabolism, Chromosome-associated
proteins].
Length = 1179
Score = 27.7 bits (62), Expect = 5.5
Identities = 45/196 (22%), Positives = 77/196 (39%), Gaps = 6/196 (3%)
Query: 29 ETLQPEERGDKSAKEETSELAKEETSELAK-EKTSELAKEETSELAKEKTSELAKEETSE 87
E L+ E + EE S +LA+ E E +E ++L+KE T A+ E E
Sbjct: 708 EELEEELEQLRKELEELSRQISALRKDLARLEAEVEQLEERIAQLSKELTELEAEIEELE 767
Query: 88 LAKEKTRALAKEETRELAKEKTR----ELAKEEIRELAKEETSELTKEKTRELAKEETRE 143
E+ E E+ + + + + + +RE E +ELT E A R
Sbjct: 768 ERLEEAEEELAEAEAEIEELEAQIEQLKEELKALREALDELRAELTLLN-EEAANLRERL 826
Query: 144 LANEETSELAKEETSELAKEKPRELAKEETRELAKEKTRELAKEETRELAKEKTRELAKE 203
+ E + +L ++ E+ E+ EL +E EL + E
Sbjct: 827 ESLERRIAATERRLEDLEEQIEELSEDIESLAAEIEELEELIEELESELEALLNERASLE 886
Query: 204 KTRELAKEEIRELAKE 219
+ L + E+ EL++E
Sbjct: 887 EALALLRSELEELSEE 902
Score = 26.9 bits (60), Expect = 9.8
Identities = 35/183 (19%), Positives = 69/183 (37%), Gaps = 5/183 (2%)
Query: 38 DKSAKEETSELAKEETSELAKEKTSELAKEETSELAKEKTSELAKEETSELAKEKTRALA 97
+ + E K+ E +L + E E + EE +EL ++ L
Sbjct: 292 ALANEISRLEQQKQILRERLANLERQLEELEAQLEELESKLDELAEELAELEEKLEE-LK 350
Query: 98 KEETRELAKEKTRELAKEEIRELAKEETSELTKEKTRELAKEETRELANEETSELAKE-E 156
+E A+ + E EE+ +E +L +++ E N E L E
Sbjct: 351 EELESLEAELEELEAELEELESRLEELEEQLETLRSKVAQLELQIASLNNEIERLEARLE 410
Query: 157 TSELAKEKPRELAKEETREL---AKEKTRELAKEETRELAKEKTRELAKEKTRELAKEEI 213
E +E+ ++ +E ++L ++ + +E EL + + E+ E +EE+
Sbjct: 411 RLEDRRERLQQEIEELLKKLEEAELKELQAELEELEEELEELQEELERLEEALEELREEL 470
Query: 214 REL 216
E
Sbjct: 471 EEA 473
>gnl|CDD|184061 PRK13454, PRK13454, F0F1 ATP synthase subunit B'; Provisional.
Length = 181
Score = 27.1 bits (60), Expect = 5.6
Identities = 26/92 (28%), Positives = 38/92 (41%), Gaps = 2/92 (2%)
Query: 37 GDKSAKEETSELAKEETSELAKEKTSELAKEETSELAKEKTSELAKEETSELAKEKTRAL 96
D +A EE + A E +E A K A+ E + E +E+ E +AK
Sbjct: 69 NDLAAAEELKQKAVE--AEKAYNKALADARAEAQRIVAETRAEIQAELDVAIAKADAEIA 126
Query: 97 AKEETRELAKEKTRELAKEEIRELAKEETSEL 128
AK E + R A E + E+AK+ L
Sbjct: 127 AKAAESEKRIAEIRAGALESVEEVAKDTAEAL 158
>gnl|CDD|189027 cd09857, PIN_EXO1, PIN domain of Exonuclease-1, a
structure-specific, divalent-metal-ion dependent, 5'
nuclease and homologs. Exonuclease-1 (EXO1) is involved
in multiple, eukaryotic DNA metabolic pathways,
including DNA replication processes (5' flap DNA
endonuclease activity and double stranded DNA
5'-exonuclease activity), DNA repair processes (DNA
mismatch repair (MMR) and post-replication repair
(PRR)), recombination, and telomere integrity. EXO1
functions in the MMS2 error-free branch of the PRR
pathway in the maintenance and repair of stalled
replication forks. Studies also suggest that EXO1 plays
both structural and catalytic roles during MMR-mediated
mutation avoidance. EXO1 belongs to the FEN1-EXO1-like
family of structure-specific, 5' nucleases. These
nucleases contain a PIN (PilT N terminus) domain with a
helical arch/clamp region (I domain) of variable length
(approximately 43 residues in EXO1 PIN domains) and a
H3TH (helix-3-turn-helix) domain, an atypical
helix-hairpin-helix-2-like region. Both the H3TH domain
(not included here) and the helical arch/clamp region
are involved in DNA binding. Nucleases within this group
also have a carboxylate-rich active site that is
involved in binding essential divalent metal ion
cofactors (Mg2+/Mn2+). EXO1 nucleases also have
C-terminal Mlh1- and Msh2-binding domains which allow
interaction with MMR and PRR proteins, respectively.
Length = 210
Score = 27.1 bits (61), Expect = 5.7
Identities = 21/64 (32%), Positives = 30/64 (46%), Gaps = 8/64 (12%)
Query: 153 AKEETSELAKEKPRELAKEETRELAKEKTRELAKEETRELAKEK---TRELAKEKTRELA 209
+K+ T E+ R +EE E A E RE + E RE + T E+A E + L
Sbjct: 84 SKKGT-----EEERRERREENLEKALELLREGNRSEARECFQRAVDITPEMAHELIKALR 138
Query: 210 KEEI 213
KE +
Sbjct: 139 KEGV 142
>gnl|CDD|145928 pfam03040, CemA, CemA family. Members of this family are probable
integral membrane proteins. Their molecular function is
unknown. CemA proteins are found in the inner envelope
membrane of chloroplasts but not in the thylakoid
membrane. A cyanobacterial member of this family has
been implicated in CO2 transport, but is probably not a
CO2 transporter itself. They are predicted to be
haem-binding however this has not been proven
experimentally.
Length = 230
Score = 27.1 bits (61), Expect = 5.9
Identities = 15/56 (26%), Positives = 25/56 (44%), Gaps = 4/56 (7%)
Query: 98 KEETRELAKEKTRELAKEEIRELAK--EETSELTKEKTRELAKEETRELANEETSE 151
E A EK +E EE+ EL + +E+ + ++ R ++ ELA E
Sbjct: 50 NSIQEERALEKLQEF--EELLELDELIKESPPTSLQELRIEIHKKAIELAKIYNEE 103
>gnl|CDD|236995 PRK11824, PRK11824, polynucleotide phosphorylase/polyadenylase;
Provisional.
Length = 693
Score = 27.7 bits (63), Expect = 6.4
Identities = 17/66 (25%), Positives = 28/66 (42%), Gaps = 2/66 (3%)
Query: 159 ELAKE--KPRELAKEETRELAKEKTRELAKEETRELAKEKTRELAKEKTRELAKEEIREL 216
ELA E E E E K +ELA+ + +E + ++ + + +E + L
Sbjct: 221 ELAAEAGPKWEWQPPEVDEELKAAVKELAEAKLKEAYQITDKQEREAALDAIKEEVLEAL 280
Query: 217 AKEETN 222
A EE
Sbjct: 281 AAEEEE 286
>gnl|CDD|227278 COG4942, COG4942, Membrane-bound metallopeptidase [Cell division
and chromosome partitioning].
Length = 420
Score = 27.4 bits (61), Expect = 6.7
Identities = 22/119 (18%), Positives = 42/119 (35%), Gaps = 3/119 (2%)
Query: 91 EKTRALAKEETRELAKEKTRELAKEEIRELAKEETSELTKEKTRELAKEETRELANEETS 150
+ T E+A E+ + + + ++L +E+ + LA+ + A+++
Sbjct: 167 KATLKQLAAVRAEIAAEQAELTTLLSEQRAQQAKLAQLLEERKKTLAQLNSELSADQKKL 226
Query: 151 ELAKEETSELAKEKPR---ELAKEETRELAKEKTRELAKEETRELAKEKTRELAKEKTR 206
E + S L E AK A E A+ + E + A EK
Sbjct: 227 EELRANESRLKNEIASAEAAAAKAREAAAAAEAAAARARAAEAKRTGETYKPTAPEKML 285
>gnl|CDD|202278 pfam02545, Maf, Maf-like protein. Maf is a putative inhibitor of
septum formation in eukaryotes, bacteria, and archaea.
Length = 193
Score = 26.9 bits (60), Expect = 7.1
Identities = 20/68 (29%), Positives = 31/68 (45%), Gaps = 5/68 (7%)
Query: 155 EETSELAKEKPRELAKEETRELAKEKTRELAKEETRELAKE-----KTRELAKEKTRELA 209
+E S L PRE + E A K +LA + + + R L K K +E A
Sbjct: 31 DEESVLYSLDPREYVVDLACEKALAKVAQLAPDNALIIGADTVVILGGRILGKPKDKEEA 90
Query: 210 KEEIRELA 217
+E +R+L+
Sbjct: 91 REMLRKLS 98
>gnl|CDD|215579 PLN03106, TCP2, Protein TCP2; Provisional.
Length = 447
Score = 27.3 bits (60), Expect = 7.7
Identities = 13/42 (30%), Positives = 23/42 (54%)
Query: 150 SELAKEETSELAKEKPRELAKEETRELAKEKTRELAKEETRE 191
S+ A TS+ +K L++ E R+ A+E+ RE +E +
Sbjct: 183 SKSACSSTSDTSKGSGLSLSRSELRDKARERARERTAKEKEK 224
>gnl|CDD|225177 COG2268, COG2268, Uncharacterized protein conserved in bacteria
[Function unknown].
Length = 548
Score = 27.1 bits (60), Expect = 7.8
Identities = 34/191 (17%), Positives = 66/191 (34%), Gaps = 8/191 (4%)
Query: 31 LQPEERGDKSAKEETSELAKEETSELAKEKTSELAKEETSELAKEKTSELAKEETSELAK 90
+ E + ++L + E + KT+E +E LA+ + A + +
Sbjct: 230 AEKETEIAIAEANRDAKLVELEVEQQPAGKTAEQTREVKIILAETEAEVAAWKAETRREA 289
Query: 91 EKTRALAKEETRELAKEKTRELAKEEIRELAKEETSELTKEKTRELAKEETRELANEETS 150
E+ LA E A ++ + A++E++ E E+ + E
Sbjct: 290 EQAEILA-----EQAIQEEKAQAEQEVQHAKALEAREMRVGLIERQKETELEPQERSYFI 344
Query: 151 ELAKEETSELAKEKPRELAKEETRELAKEKTRELAKEETRELAKEKTRELAKEKTRELAK 210
A+ + E AK + A +T A+E E+ +A + E +
Sbjct: 345 NAAQRQAQEEAKAAANIAEAIGAQAEAAVET---ARETEEAERAEQAALVAAAEAAEQEQ 401
Query: 211 EEIRELAKEET 221
EI A+
Sbjct: 402 VEIAVRAEAAK 412
>gnl|CDD|233149 TIGR00845, caca, sodium/calcium exchanger 1. The Ca2+:Cation
Antiporter (CaCA) Family (TC 2.A.19)Proteins of the CaCA
family are found ubiquitously, having been identified in
animals, plants, yeast, archaea and widely divergent
bacteria.All of the characterized animal proteins
catalyze Ca2+:Na+ exchange although some also transport
K+. The NCX1 plasma membrane protein exchanges 3 Na+ for
1 Ca2+. The E. coli ChaA protein catalyzes Ca2+:H+
antiport but may also catalyze Na+:H+ antiport. All
remaining well-characterized members of the family
catalyze Ca2+:H+ exchange.This model is specific for the
eukaryotic sodium ion/calcium ion exchangers of the Caca
family [Transport and binding proteins, Other].
Length = 928
Score = 27.4 bits (61), Expect = 7.8
Identities = 20/74 (27%), Positives = 27/74 (36%), Gaps = 2/74 (2%)
Query: 59 EKTSELAKEETSELAKEKTSELAKEETSELAKEKTRALAKEETRELAKEKTRELAKEEIR 118
EKT + + E K T + E + AL ET +K +EE R
Sbjct: 593 EKTIRVKIVDDEEYEKNDTFFIELGEPRWAKRGIKAALLLNETITDDDQKLTSKEEEERR 652
Query: 119 --ELAKEETSELTK 130
E+ K E TK
Sbjct: 653 IAEMGKPRLGEHTK 666
>gnl|CDD|216599 pfam01603, B56, Protein phosphatase 2A regulatory B subunit (B56
family). Protein phosphatase 2A (PP2A) is a major
intracellular protein phosphatase that regulates
multiple aspects of cell growth and metabolism. The
ability of this widely distributed heterotrimeric enzyme
to act on a diverse array of substrates is largely
controlled by the nature of its regulatory B subunit.
There are multiple families of B subunits (See also
pfam01240), this family is called the B56 family.
Length = 405
Score = 27.2 bits (61), Expect = 7.9
Identities = 11/37 (29%), Positives = 16/37 (43%)
Query: 86 SELAKEKTRALAKEETRELAKEKTRELAKEEIRELAK 122
+L E ++E +E KEK R+ I E A
Sbjct: 369 PDLFDECGNKYKEKEAKEKKKEKKRKKTWARIAEAAA 405
>gnl|CDD|182595 PRK10624, PRK10624, L-1,2-propanediol oxidoreductase; Provisional.
Length = 382
Score = 27.3 bits (61), Expect = 7.9
Identities = 15/58 (25%), Positives = 28/58 (48%), Gaps = 10/58 (17%)
Query: 171 EETRELAK---EKTRELAKEETRELAKEKTRELAK-----EKTRELA--KEEIRELAK 218
E+ R++A+ K ++ EE R A E + L + R++ +E+I LA+
Sbjct: 297 EKYRDIARAMGVKVEGMSLEEARNAAVEAVKALNRDVGIPPHLRDVGVKEEDIPALAQ 354
>gnl|CDD|234070 TIGR02956, TMAO_torS, TMAO reductase sytem sensor TorS. This
protein, TorS, is part of a regulatory system for the
torCAD operon that encodes the pterin molybdenum
cofactor-containing enzyme trimethylamine-N-oxide (TMAO)
reductase (TorA), a cognate chaperone (TorD), and a
penta-haem cytochrome (TorC). TorS works together with
the inducer-binding protein TorT and the response
regulator TorR. TorS contains histidine kinase ATPase
(pfam02518), HAMP (pfam00672), phosphoacceptor
(pfam00512), and phosphotransfer (pfam01627) domains and
a response regulator receiver domain (pfam00072) [Signal
transduction, Two-component systems].
Length = 968
Score = 27.4 bits (61), Expect = 8.0
Identities = 21/80 (26%), Positives = 35/80 (43%), Gaps = 2/80 (2%)
Query: 41 AKEETSELAKEETSELAKEKTSELAKEETSELAKEKTSELAKEETSELAKEKTRALAKEE 100
++ + K + E + + KE +L ++T ELA+ A+ K A A+ E
Sbjct: 397 FRDTAAHNLKLQADERQVAQELQEHKESLEQLVAQRTQELAETNERLNAEVKNHAKARAE 456
Query: 101 TRELAKEKTRELA--KEEIR 118
E + K+ LA EIR
Sbjct: 457 AEEANRAKSAFLATMSHEIR 476
>gnl|CDD|217540 pfam03408, Foamy_virus_ENV, Foamy virus envelope protein.
Expression of the envelope (Env) glycoprotein is
essential for viral particle egress. This feature is
unique to the Spumavirinae, a subclass of the
Retroviridae.
Length = 985
Score = 27.4 bits (61), Expect = 8.3
Identities = 16/50 (32%), Positives = 22/50 (44%), Gaps = 3/50 (6%)
Query: 63 ELAKEETSELAKEKTSELAKEETSELAKEKTRALAKEETRELAKEKTREL 112
E +E EL + K L K +S+L K RAL + + REL
Sbjct: 326 ENTSQENEELLRTKLMNLTKGNSSQL---KERALPPTWSSQGKALLFREL 372
>gnl|CDD|237060 PRK12326, PRK12326, preprotein translocase subunit SecA; Reviewed.
Length = 764
Score = 27.2 bits (61), Expect = 8.9
Identities = 13/37 (35%), Positives = 23/37 (62%)
Query: 149 TSELAKEETSELAKEKPRELAKEETRELAKEKTRELA 185
++ A EE +ELA E+ ELA+E E+ ++ R++
Sbjct: 620 RTDTAWEELAELAPERYAELAEEVDEEVLEQAARQIM 656
>gnl|CDD|193531 cd05654, M20_ArgE_RocB, M20 Peptidase Arginine utilization protein,
RocB. Peptidase M20 family, ArgE RocB (Arginine
utilization protein, RocB; arginine degradation protein,
RocB) subfamily. This group of proteins is possibly
related to acetylornithine deacetylase (ArgE) and may be
involved in the arginine and/or ornithine degradation
pathway. In Bacillus subtilis, RocB is one of the three
genes found in the rocABC operon, which is sigma L
dependent and induced by arginine. The function of
members of this family is as yet unknown, although they
are predicted as deacetylases.
Length = 535
Score = 27.2 bits (61), Expect = 9.2
Identities = 23/83 (27%), Positives = 36/83 (43%), Gaps = 3/83 (3%)
Query: 59 EKTSELAKEETSELAKEKTSELAKEETSELAKEKTRALAKEETRELAKEKTRELAKEEIR 118
+K + A E T A EK L+ + L K + L EE E AK+K E ++ +
Sbjct: 310 KKIATEAFENTINEAYEKYCRLSNKPGDSLPW-KVKVLTYEELYEEAKKKGGEKFEKFLE 368
Query: 119 ELAKEETSELT--KEKTRELAKE 139
EL K+ +E T + +
Sbjct: 369 ELEKKLLDGELDLREATIRIVEA 391
Database: CDD.v3.10
Posted date: Mar 20, 2013 7:55 AM
Number of letters in database: 10,937,602
Number of sequences in database: 44,354
Lambda K H
0.291 0.111 0.263
Gapped
Lambda K H
0.267 0.0677 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 10,647,707
Number of extensions: 1047896
Number of successful extensions: 7810
Number of sequences better than 10.0: 1
Number of HSP's gapped: 4395
Number of HSP's successfully gapped: 1563
Length of query: 223
Length of database: 10,937,602
Length adjustment: 93
Effective length of query: 130
Effective length of database: 6,812,680
Effective search space: 885648400
Effective search space used: 885648400
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 17 ( 7.1 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 44 (21.7 bits)
S2: 57 (25.8 bits)