RPS-BLAST 2.2.26 [Sep-21-2011]
Database: CDD.v3.10
44,354 sequences; 10,937,602 total letters
Searching..................................................done
Query= psy157
(347 letters)
>gnl|CDD|219906 pfam08565, CDC37_M, Cdc37 Hsp90 binding domain. Cdc37 is a
molecular chaperone required for the activity of
numerous eukaryotic protein kinases. This domains
corresponds to the Hsp90 chaperone (Heat shocked protein
90) binding domain of Cdc37. It is found between the N
terminal Cdc37 domain pfam03234, which is predominantly
involved in kinase binding, and the C terminal domain of
Cdc37 pfam08564 whose function is unclear.
Length = 173
Score = 116 bits (293), Expect = 1e-31
Identities = 65/176 (36%), Positives = 86/176 (48%), Gaps = 49/176 (27%)
Query: 128 NKKPPLQQGEKSDEEKEQEMKAFVKENEKLIK---------------------------- 159
NKK +K+DEE E +MK FV NEKL
Sbjct: 1 NKKKE---KDKTDEEDEDKMKKFVTTNEKLNSAVDKDDDDELKLSPATEKFGKIPAGDYK 57
Query: 160 ---------------QYANYLVIWCINLAMDEKFDLMDHVAHQCICMQYLLELSKSLNI- 203
Q + LV+ +L ++ K +LM VAHQ I MQY+LEL+KSL +
Sbjct: 58 DSEKFLSEHPHIVSEQTKDALVMEAFDLQLEGKEELMRQVAHQSILMQYILELAKSLKVD 117
Query: 204 -DPRACIGSFFSKI-QIAEPDYKASFDDELKSFKERIRNRAKEKLDEALKEVEEEE 257
R CI FF KI AEP YK +F DE+++F RI+ RA+E + EA++E EEEE
Sbjct: 118 PALRNCIRLFFKKIFAKAEPMYKEAFLDEVQAFFNRIKTRAQELIQEAMEEQEEEE 173
>gnl|CDD|215010 smart01070, CDC37_M, Cdc37 Hsp90 binding domain. Cdc37 is a
molecular chaperone required for the activity of
numerous eukaryotic protein kinases. This domains
corresponds to the Hsp90 chaperone (Heat shocked protein
90) binding domain of Cdc37. It is found between the N
terminal Cdc37 domain which is predominantly involved in
kinase binding, and the C terminal domain of Cdc37 whose
function is unclear.
Length = 155
Score = 112 bits (283), Expect = 2e-30
Identities = 52/157 (33%), Positives = 76/157 (48%), Gaps = 33/157 (21%)
Query: 123 TKTVVNKKPPLQQGEKSDEEKEQEMKAFVKENEKLIKQYA-------------------- 162
T+T +K PP S+EE E + K FV + E K++
Sbjct: 1 TETTSSKNPP---SAPSEEEDEIDDKDFVLKLEPATKKFGMIPAGDYDDSQKFLSEHPHI 57
Query: 163 ------NYLVIWCINLAMDEKFDLMDHVAHQCICMQYLLELSKSLNIDPRACIGSFFSKI 216
+ LV+W +L ++ K LM+ VAHQ I MQY+LEL+KSL +DPR C+ FF KI
Sbjct: 58 VSEETADALVMWAFDLELEGKEALMEQVAHQAILMQYILELAKSLKVDPRNCVRLFFQKI 117
Query: 217 ----QIAEPDYKASFDDELKSFKERIRNRAKEKLDEA 249
A+ + E + K R+R A+E+ +EA
Sbjct: 118 KTPSHPAKEGFLEDVQSEFEHIKTRVRIIAQEQAEEA 154
>gnl|CDD|198139 smart01071, CDC37_N, Cdc37 N terminal kinase binding. Cdc37 is a
molecular chaperone required for the activity of
numerous eukaryotic protein kinases. This domain
corresponds to the N terminal domain which binds
predominantly to protein kinases.and is found N terminal
to the Hsp (Heat shocked protein) 90-binding domain.
Expression of a construct consisting of only the
N-terminal domain of Saccharomyces pombe Cdc37 results
in cellular viability. This indicates that interactions
with the cochaperone Hsp90 may not be essential for
Cdc37 function.
Length = 154
Score = 107 bits (268), Expect = 4e-28
Identities = 61/156 (39%), Positives = 82/156 (52%), Gaps = 33/156 (21%)
Query: 1 MVDYSKWKDIEISDDED-ETHPNIDTPSLFRWR----HQARVERMEE------QRLENEK 49
+DYSKW IE+SDD D E HPN+D S RW+ HQARVERMEE + + N+
Sbjct: 1 PIDYSKWDKIELSDDSDIEVHPNVDKKSFIRWKQRDIHQARVERMEEIKNLKYELIMNDH 60
Query: 50 FQQEKEKTLKTLEET--------------------KKKLVESEQAGANNLDELKKALAEL 89
+ +K LK L E KK+L E+ L+ELKK +L
Sbjct: 61 LNKRIDKLLKGLREEELSPETPTYNEMLAELQDQLKKELEEANGDSEGLLEELKKHRDKL 120
Query: 90 EKEAANIRKKEEDLKKKEKITPWNVDTISKPGFTKT 125
+KE +RKK ++L+K+EK W+VDT GF K+
Sbjct: 121 KKEQKELRKKLDELEKEEKKKIWSVDT--HTGFDKS 154
>gnl|CDD|217443 pfam03234, CDC37_N, Cdc37 N terminal kinase binding. Cdc37 is a
molecular chaperone required for the activity of
numerous eukaryotic protein kinases. This domain
corresponds to the N terminal domain which binds
predominantly to protein kinases and is found N terminal
to the Hsp (Heat shocked protein) 90-binding domain
pfam08565. Expression of a construct consisting of only
the N-terminal domain of Saccharomyces pombe Cdc37
results in cellular viability. This indicates that
interactions with the cochaperone Hsp90 may not be
essential for Cdc37 function.
Length = 172
Score = 102 bits (256), Expect = 3e-26
Identities = 55/174 (31%), Positives = 80/174 (45%), Gaps = 51/174 (29%)
Query: 1 MVDYSKWKDIEISDDED-ETHPNIDTPSLFRWRHQARVERMEEQRLENEKFQQEKEKTLK 59
+DYSKW IE+SDD D E HPN+D PS RW+ QA ER E+++ E E+ ++E+ +
Sbjct: 1 PIDYSKWDKIELSDDSDIEVHPNVDKPSFIRWKQQAIHERREQRKQEIEELEKERIMNDR 60
Query: 60 TLEETKKKLVE------------------------------------------------S 71
LE K L E
Sbjct: 61 LLERVDKLLSELKEESLDSSQAVMKSLNENFTDKENVEPEQPTYNEMVEDLFDQVKDEVD 120
Query: 72 EQAGANNLDELKKALAELEKEAANIRKKEEDLKKKEKITPWNVDTISKPGFTKT 125
E+ GA ++EL+K +L+KE + KK ++L+K+EK W+VDT GF K+
Sbjct: 121 EKNGAALIEELQKHRDKLKKEQKELLKKLDELEKEEKKKIWSVDT--HTGFDKS 172
>gnl|CDD|219905 pfam08564, CDC37_C, Cdc37 C terminal domain. Cdc37 is a protein
required for the activity of numerous eukaryotic protein
kinases. This domains corresponds to the C terminal
domain whose function is unclear. It is found C terminal
to the Hsp90 chaperone (Heat shocked protein 90) binding
domain pfam08565 and the N terminal kinase binding
domain of Cdc37 pfam03234.
Length = 89
Score = 77.0 bits (190), Expect = 9e-18
Identities = 38/78 (48%), Positives = 50/78 (64%)
Query: 263 GPGGLDPVEVFESLPDVLKKCFELQDTAMLQEAIASLSPEEATYHMKRCVDSGLWVPDAK 322
G GLDPVEVFESLP L+K FE +D + + +A + EEA YHM+RC+DSGLWVP+AK
Sbjct: 1 GAEGLDPVEVFESLPPELQKAFETKDLDEVNKVLAKMPVEEAEYHMERCIDSGLWVPNAK 60
Query: 323 KAAGESEDPQENTPDAED 340
+ E + E+
Sbjct: 61 IEGEKEFKELEEEYNEEE 78
>gnl|CDD|215009 smart01069, CDC37_C, Cdc37 C terminal domain. Cdc37 is a protein
required for the activity of numerous eukaryotic protein
kinases. This domains corresponds to the C terminal
domain whose function is unclear. It is found C terminal
to the Hsp90 chaperone (Heat shocked protein 90) binding
domain pfam08565 and the N terminal kinase binding
domain of Cdc37.
Length = 93
Score = 68.6 bits (168), Expect = 1e-14
Identities = 39/83 (46%), Positives = 53/83 (63%), Gaps = 3/83 (3%)
Query: 262 LGPGGL--DPVEVFESLPDVLKKCFELQDTAMLQEAIASLSPEEATYHMKRCVDSGLW-V 318
PGGL DP EVFESLP L+K FE +D ML + +A + EEA YH++RC++SGLW V
Sbjct: 1 NLPGGLSLDPEEVFESLPPELQKAFETKDLDMLNKVLAKMPVEEAEYHLERCIESGLWGV 60
Query: 319 PDAKKAAGESEDPQENTPDAEDP 341
P+A + E ++ QE E+
Sbjct: 61 PNAIEDETEFKELQEQYEVEEEA 83
>gnl|CDD|237177 PRK12704, PRK12704, phosphodiesterase; Provisional.
Length = 520
Score = 39.4 bits (93), Expect = 0.002
Identities = 24/71 (33%), Positives = 43/71 (60%), Gaps = 3/71 (4%)
Query: 38 ERMEEQRLENEKFQQEKEKTLKTLEETKKKLVESEQAGANNLDELKKALAELEKEAANIR 97
E + + R E EK +E+ L+ LE K+L++ E+ L+ L+K ELEK+ +
Sbjct: 64 EEIHKLRNEFEKELRERRNELQKLE---KRLLQKEENLDRKLELLEKREEELEKKEKELE 120
Query: 98 KKEEDLKKKEK 108
+K+++L+KKE+
Sbjct: 121 QKQQELEKKEE 131
Score = 35.5 bits (83), Expect = 0.037
Identities = 27/126 (21%), Positives = 59/126 (46%), Gaps = 23/126 (18%)
Query: 39 RMEEQRLENEKFQQEKEKTLKTLEETKKKLVESEQAGANNLDELKKALAELEKEAANIRK 98
+ E + ++ E + KE+ K E +K+L E EL+K L ++ N+ +
Sbjct: 48 KKEAEAIKKEALLEAKEEIHKLRNEFEKELRERRN-------ELQKLEKRLLQKEENLDR 100
Query: 99 KEEDLKKKEKITPWNVDTISKPGFTKTVVNKKPPLQQGEKSDEEKEQEMKAFVKENEKLI 158
K E L+K+E+ + K+ L+Q ++ E+KE+E++ ++E + +
Sbjct: 101 KLELLEKREEE----------------LEKKEKELEQKQQELEKKEEELEELIEEQLQEL 144
Query: 159 KQYANY 164
++ +
Sbjct: 145 ERISGL 150
Score = 29.7 bits (68), Expect = 2.6
Identities = 23/89 (25%), Positives = 46/89 (51%), Gaps = 16/89 (17%)
Query: 32 RHQARVERMEEQRLENEKFQQEKEKTLKTLEETKKKLVE-------------SEQAGANN 78
R +E+ EE+ + EK ++K++ L+ EE ++L+E +E+A
Sbjct: 100 RKLELLEKREEELEKKEKELEQKQQELEKKEEELEELIEEQLQELERISGLTAEEAKEIL 159
Query: 79 LDELKKALAELEKEAANIRKKEEDLKKKE 107
L+++++ E EAA + K+ E+ K+E
Sbjct: 160 LEKVEE---EARHEAAVLIKEIEEEAKEE 185
Score = 29.0 bits (66), Expect = 4.2
Identities = 20/74 (27%), Positives = 38/74 (51%), Gaps = 4/74 (5%)
Query: 36 RVERMEEQRLENEKFQQEKEKTLKTLEETKKKLVESEQAGANNLDELKKALAELEKEAAN 95
R R E Q+LE + Q KE+ L E +K E + L++ ++ L + E+E
Sbjct: 78 RERRNELQKLEK-RLLQ-KEENLDRKLELLEKREEELEKKEKELEQKQQELEKKEEELEE 135
Query: 96 IRKKEEDLKKKEKI 109
+ +++ L++ E+I
Sbjct: 136 LIEEQ--LQELERI 147
>gnl|CDD|130141 TIGR01069, mutS2, MutS2 family protein. Function of MutS2 is
unknown. It should not be considered a DNA mismatch
repair protein. It is likely a DNA mismatch binding
protein of unknown cellular function [DNA metabolism,
Other].
Length = 771
Score = 39.4 bits (92), Expect = 0.003
Identities = 30/133 (22%), Positives = 56/133 (42%), Gaps = 13/133 (9%)
Query: 38 ERMEEQRLENEKFQQEKEKTLKTLEETKKKLVESEQAGANNLDELKKALAELEKEAANIR 97
E +E+ E EK ++E E+ ++ L+E ++ + E E ++AL L+KE +I
Sbjct: 532 EHLEKLLKEQEKLKKELEQEMEELKERERNK-KLEL-----EKEAQEALKALKKEVESII 585
Query: 98 KKEEDLKKKEKITPWNVDTISKPGFTKTVVNKKPP----LQQGEKSDEEKEQEMKAFVKE 153
++ LK+K+ + +I K K P Q + D+ + + K
Sbjct: 586 RE---LKEKKIHKAKEIKSIEDLVKLKETKQKIPQKPTNFQADKIGDKVRIRYFGQKGKI 642
Query: 154 NEKLIKQYANYLV 166
+ L N V
Sbjct: 643 VQILGGNKWNVTV 655
>gnl|CDD|227278 COG4942, COG4942, Membrane-bound metallopeptidase [Cell division
and chromosome partitioning].
Length = 420
Score = 38.2 bits (89), Expect = 0.006
Identities = 14/76 (18%), Positives = 26/76 (34%)
Query: 32 RHQARVERMEEQRLENEKFQQEKEKTLKTLEETKKKLVESEQAGANNLDELKKALAELEK 91
+ Q+ + + +E++KTL L + + N LK +A E
Sbjct: 186 ELTTLLSEQRAQQAKLAQLLEERKKTLAQLNSELSADQKKLEELRANESRLKNEIASAEA 245
Query: 92 EAANIRKKEEDLKKKE 107
AA R+ +
Sbjct: 246 AAAKAREAAAAAEAAA 261
Score = 32.8 bits (75), Expect = 0.26
Identities = 21/88 (23%), Positives = 37/88 (42%), Gaps = 11/88 (12%)
Query: 28 LFRWRHQARVERMEEQ-------RLENEKFQQEKEKTLKTLEETKKKLVESEQAGANNLD 80
+ + AR ER++ + E+ + L E + + + Q L+
Sbjct: 151 YYGALNPARAERIDALKATLKQLAAVRAEIAAEQAELTTLLSEQRAQQAKLAQL----LE 206
Query: 81 ELKKALAELEKEAANIRKKEEDLKKKEK 108
E KK LA+L E + +KK E+L+ E
Sbjct: 207 ERKKTLAQLNSELSADQKKLEELRANES 234
Score = 30.8 bits (70), Expect = 0.92
Identities = 17/68 (25%), Positives = 33/68 (48%)
Query: 34 QARVERMEEQRLENEKFQQEKEKTLKTLEETKKKLVESEQAGANNLDELKKALAELEKEA 93
Q + +E++ E + + + EK LK+LE L A++L +L+K +A+L
Sbjct: 44 QKEIAALEKKIREQQDQRAKLEKQLKSLETEIASLEAQLIETADDLKKLRKQIADLNARL 103
Query: 94 ANIRKKEE 101
+ +E
Sbjct: 104 NALEVQER 111
Score = 28.1 bits (63), Expect = 6.6
Identities = 14/58 (24%), Positives = 27/58 (46%)
Query: 51 QQEKEKTLKTLEETKKKLVESEQAGANNLDELKKALAELEKEAANIRKKEEDLKKKEK 108
++ ++ K + +KK+ E + A +LK E+ A + + +DLKK K
Sbjct: 37 DKQLKQIQKEIAALEKKIREQQDQRAKLEKQLKSLETEIASLEAQLIETADDLKKLRK 94
>gnl|CDD|235600 PRK05771, PRK05771, V-type ATP synthase subunit I; Validated.
Length = 646
Score = 37.6 bits (88), Expect = 0.008
Identities = 41/212 (19%), Positives = 85/212 (40%), Gaps = 49/212 (23%)
Query: 53 EKEKTLKTLEETKKKLVESEQAGANNLDELKKALAELEKEAANIRKKEEDLKKKEKITPW 112
+K+ ++K+LEE K + E + + EL++ ++ELE E + ++ E L+ PW
Sbjct: 76 KKKVSVKSLEELIKDVEEELEKIEKEIKELEEEISELENEIKELEQEIERLE------PW 129
Query: 113 ---NVDTISKPGFTKTVVNKKPPLQQGEKSDEEKEQEMKAFVKENEKLIKQYAN--YLVI 167
++D GF V G +++ E+ EN + I Y+V+
Sbjct: 130 GNFDLDLSLLLGFKYVSV------FVGTVPEDKLEELKLESDVENVEYISTDKGYVYVVV 183
Query: 168 WCINLAMDEKFDLMDHVAHQCICMQYLLELSKSLNIDPRACIGSFFSKIQIAEPDYKASF 227
+ DE + L +L F ++++ + + +
Sbjct: 184 VVLKELSDEVEEE-------------LKKLG--------------FERLEL---EEEGTP 213
Query: 228 DDELKSFKERIR--NRAKEKLDEALKEVEEEE 257
+ ++ KE + + +E L E LKE+ ++
Sbjct: 214 SELIREIKEELEEIEKERESLLEELKELAKKY 245
>gnl|CDD|224117 COG1196, Smc, Chromosome segregation ATPases [Cell division and
chromosome partitioning].
Length = 1163
Score = 37.4 bits (87), Expect = 0.011
Identities = 21/75 (28%), Positives = 44/75 (58%)
Query: 34 QARVERMEEQRLENEKFQQEKEKTLKTLEETKKKLVESEQAGANNLDELKKALAELEKEA 93
+ + R+EE+ E ++ +E EK ++ L+ ++L E + L ELK+ + ELE E
Sbjct: 245 EEELSRLEEELEELQEELEEAEKEIEELKSELEELREELEELQEELLELKEEIEELEGEI 304
Query: 94 ANIRKKEEDLKKKEK 108
+ +R++ E+L+ + +
Sbjct: 305 SLLRERLEELENELE 319
Score = 33.9 bits (78), Expect = 0.15
Identities = 30/131 (22%), Positives = 63/131 (48%), Gaps = 3/131 (2%)
Query: 34 QARVERMEEQRLENEKFQQEKEKTLKTLEETKKKLVESEQAGANNLDELKKALAELEKEA 93
+ +E +E++ E ++ +E E + LE+ K+L E ++ L EL+ LAEL++E
Sbjct: 848 EEELEELEKELEELKEELEELEAEKEELEDELKELEEEKEELEEELRELESELAELKEEI 907
Query: 94 ANIRKKEEDLKKKEKITPWNVDTISKPGFTKTVVNKKPPLQQGEKSDEEKEQEMKAFVKE 153
+R++ E+L+ K + + + + + E+ E E+E++A
Sbjct: 908 EKLRERLEELEAKLERLE---VELPELEEELEEEYEDTLETELEREIERLEEEIEALGPV 964
Query: 154 NEKLIKQYANY 164
N + I++Y
Sbjct: 965 NLRAIEEYEEV 975
Score = 33.9 bits (78), Expect = 0.15
Identities = 18/67 (26%), Positives = 37/67 (55%), Gaps = 7/67 (10%)
Query: 49 KFQQEKEKTLKTLEETKKKLVESEQAGANNLDELKKALAELEKEAAN---IRKKEEDLKK 105
K+++ KE+ + LE T++ L E + L+EL+K L +LE++A ++ + +L++
Sbjct: 169 KYKERKEEAERKLERTEENLERLE----DLLEELEKQLEKLERQAEKAERYQELKAELRE 224
Query: 106 KEKITPW 112
E
Sbjct: 225 LELALLL 231
Score = 33.5 bits (77), Expect = 0.16
Identities = 22/74 (29%), Positives = 43/74 (58%)
Query: 33 HQARVERMEEQRLENEKFQQEKEKTLKTLEETKKKLVESEQAGANNLDELKKALAELEKE 92
+ R ER+E++ E E+ +E E+ L LEE ++L + + L+EL+ ELE E
Sbjct: 819 LEQRRERLEQEIEELEEEIEELEEKLDELEEELEELEKELEELKEELEELEAEKEELEDE 878
Query: 93 AANIRKKEEDLKKK 106
+ +++E+L+++
Sbjct: 879 LKELEEEKEELEEE 892
Score = 33.5 bits (77), Expect = 0.17
Identities = 55/253 (21%), Positives = 110/253 (43%), Gaps = 38/253 (15%)
Query: 23 IDTPSLFRWRHQARVERMEEQRLENEKFQQEKEKTLKTLEETKKKLVESEQAGANNLDEL 82
I+ R Q +E +EE+ E E+ E+ L++LE+ +++L + + ++EL
Sbjct: 781 IEELEEKRQALQEELEELEEELEEAERRLDALERELESLEQRRERLEQEIEELEEEIEEL 840
Query: 83 KKALAELEKEAANIRKKEEDLKKKEKITPWNVDTISKPGFTKTVVNKKPPLQQGEKSDEE 142
++ L ELE+E + K+ E+LK++ + + L++ E+ EE
Sbjct: 841 EEKLDELEEELEELEKELEELKEELE------------ELEAEKEELEDELKELEEEKEE 888
Query: 143 KEQEMKAFVKENEKLIKQYANYLVIWCINLAMDEKFDLMDHVAHQCICMQYLLELSKSL- 201
E+E++ E +L ++ A E+ ++ + + +Y L L
Sbjct: 889 LEEELRELESELAELKEEIEKLRERLEELEAKLERLEVELPELEEELEEEYEDTLETELE 948
Query: 202 --------------NIDPRACIGSFFSKIQIAEPDYKASFDDELKSFKERIRNRAKEKLD 247
++ RA I E + +ELKS +E + AKEKL
Sbjct: 949 REIERLEEEIEALGPVNLRA----------IEEYEEVEERYEELKSQREDLE-EAKEKLL 997
Query: 248 EALKEVEEEERQK 260
E ++E+++E+R++
Sbjct: 998 EVIEELDKEKRER 1010
Score = 32.0 bits (73), Expect = 0.62
Identities = 21/73 (28%), Positives = 39/73 (53%), Gaps = 3/73 (4%)
Query: 40 MEEQRLENEKFQQEKEKTLKTLEETKKKLVESEQAGANN---LDELKKALAELEKEAANI 96
Q L+ E + E L L+E +K+L E E+ + L+EL++ L E EKE +
Sbjct: 213 ERYQELKAELRELELALLLAKLKELRKELEELEEELSRLEEELEELQEELEEAEKEIEEL 272
Query: 97 RKKEEDLKKKEKI 109
+ + E+L+++ +
Sbjct: 273 KSELEELREELEE 285
Score = 30.8 bits (70), Expect = 1.3
Identities = 18/77 (23%), Positives = 47/77 (61%)
Query: 32 RHQARVERMEEQRLENEKFQQEKEKTLKTLEETKKKLVESEQAGANNLDELKKALAELEK 91
+ +E+++ + E E+ +E E+ L+ L+E ++L E ++ L +LK+ + ELE+
Sbjct: 727 ALEEELEQLQSRLEELEEELEELEEELEELQERLEELEEELESLEEALAKLKEEIEELEE 786
Query: 92 EAANIRKKEEDLKKKEK 108
+ ++++ E+L+++ +
Sbjct: 787 KRQALQEELEELEEELE 803
Score = 28.9 bits (65), Expect = 4.6
Identities = 22/90 (24%), Positives = 44/90 (48%), Gaps = 5/90 (5%)
Query: 30 RWRHQARVERMEEQRLENEKFQQEKEKTLKTLEETKKKLVESEQAGANNLDELKKALAEL 89
R + ++EE + E ++E E+ LEE +++L E E L+EL++ L EL
Sbjct: 709 RQLEELE-RQLEELKRELAALEEELEQLQSRLEELEEELEELE----EELEELQERLEEL 763
Query: 90 EKEAANIRKKEEDLKKKEKITPWNVDTISK 119
E+E ++ + LK++ + + +
Sbjct: 764 EEELESLEEALAKLKEEIEELEEKRQALQE 793
>gnl|CDD|240253 PTZ00068, PTZ00068, 60S ribosomal protein L13a; Provisional.
Length = 202
Score = 36.2 bits (84), Expect = 0.014
Identities = 15/56 (26%), Positives = 27/56 (48%), Gaps = 3/56 (5%)
Query: 31 WRHQARVERMEEQRLE-NEKFQQEKEKTLKTLEETKKKLVESEQAGANNLDELKKA 85
W++ V ++EE+R E + ++K K K +E +KK + + LKK
Sbjct: 146 WKYADVVAKLEEKRKERAAAYYKKKVKLRKAWKEARKKALAKLP--KAIVAVLKKF 199
>gnl|CDD|227352 COG5019, CDC3, Septin family protein [Cell division and chromosome
partitioning / Cytoskeleton].
Length = 373
Score = 35.4 bits (82), Expect = 0.033
Identities = 24/76 (31%), Positives = 38/76 (50%), Gaps = 9/76 (11%)
Query: 33 HQARVERMEEQRLENEKFQQEKEKTLKTLEETKKKLVESEQAGANNLDELKKALAELEKE 92
H+AR EE+R +KF + K E+ +K+L E EQ EL L E++K+
Sbjct: 307 HEAR--LNEEERELKKKFTE------KIREK-EKRLEELEQNLIEERKELNSKLEEIQKK 357
Query: 93 AANIRKKEEDLKKKEK 108
++ K+ E LK +
Sbjct: 358 LEDLEKRLEKLKSNKS 373
>gnl|CDD|223562 COG0488, Uup, ATPase components of ABC transporters with duplicated
ATPase domains [General function prediction only].
Length = 530
Score = 34.9 bits (81), Expect = 0.051
Identities = 18/78 (23%), Positives = 33/78 (42%), Gaps = 5/78 (6%)
Query: 34 QARVERMEEQRLENEKFQQEKEKTLKTLEETKKKLVESEQAGANNLDELKKALAELEKEA 93
+ + ER+ ++ EK Q+E K + + K ++++A + K L +LE
Sbjct: 240 EQKAERLRQEAAAYEKQQKELAKEQEWIRRGKAAASKAKKAKS-----RIKRLEKLEARL 294
Query: 94 ANIRKKEEDLKKKEKITP 111
A R EE + P
Sbjct: 295 AEERPVEEGKPLAFRFPP 312
>gnl|CDD|173412 PTZ00121, PTZ00121, MAEBL; Provisional.
Length = 2084
Score = 35.1 bits (80), Expect = 0.060
Identities = 33/125 (26%), Positives = 62/125 (49%), Gaps = 7/125 (5%)
Query: 34 QARVERMEEQRLENEKFQQEKEKTLKTLEETKKKLVESEQAGANNLDELKKALAELEKEA 93
Q + + EE++ E + E+E +K EE KK + ++A +E KKA + +K A
Sbjct: 1637 QLKKKEAEEKKKAEELKKAEEENKIKAAEEAKKAEEDKKKA-----EEAKKAEEDEKKAA 1691
Query: 94 ANIRKKEEDLKKKEKITPWNVDTISKPGFTKTV--VNKKPPLQQGEKSDEEKEQEMKAFV 151
++K+ E+ KK E++ + K K NK + ++++E+K++ +A
Sbjct: 1692 EALKKEAEEAKKAEELKKKEAEEKKKAEELKKAEEENKIKAEEAKKEAEEDKKKAEEAKK 1751
Query: 152 KENEK 156
E EK
Sbjct: 1752 DEEEK 1756
Score = 33.6 bits (76), Expect = 0.20
Identities = 33/125 (26%), Positives = 61/125 (48%), Gaps = 5/125 (4%)
Query: 32 RHQARVERMEEQRLENEKFQQEKEKTLKTLEETKKKLVESEQAGANNLDELKKALAELEK 91
R ++ EE R+E E+EK +K E K + + + +E KK + +L+K
Sbjct: 1581 RKAEEAKKAEEARIEEVMKLYEEEKKMKAEEAKKAEEAKIKAEELKKAEEEKKKVEQLKK 1640
Query: 92 EAANIRKKEEDLKKKEKITPWNVDTISKPGFTKTVVNKKPPLQQGEKSDEEKEQEMKAFV 151
+ A +KK E+LKK E+ + I K K ++ +K++E++++ +A
Sbjct: 1641 KEAEEKKKAEELKKAEE-----ENKIKAAEEAKKAEEDKKKAEEAKKAEEDEKKAAEALK 1695
Query: 152 KENEK 156
KE E+
Sbjct: 1696 KEAEE 1700
Score = 33.2 bits (75), Expect = 0.22
Identities = 29/123 (23%), Positives = 52/123 (42%), Gaps = 6/123 (4%)
Query: 34 QARVERMEEQRLENEKFQQEKEKTLKTLEETKKKLVESEQAGANNLDELKKALAELEKEA 93
+A + E + E +++KE+ K + KKK E ++A DE KK E +K+A
Sbjct: 1353 EAAADEAEAAEEKAEAAEKKKEEAKKKADAAKKKAEEKKKA-----DEAKKKAEEDKKKA 1407
Query: 94 ANIRKKEEDLKKKEKITPWNVDTISKPGFTKTVVNKKPPLQQGEKSDEEKEQEMKAFVKE 153
++K KK ++ + K K + + +K EE ++ +A K
Sbjct: 1408 DELKKAAAAKKKADEAKK-KAEEKKKADEAKKKAEEAKKADEAKKKAEEAKKAEEAKKKA 1466
Query: 154 NEK 156
E
Sbjct: 1467 EEA 1469
Score = 31.3 bits (70), Expect = 0.88
Identities = 35/130 (26%), Positives = 67/130 (51%), Gaps = 9/130 (6%)
Query: 32 RHQARVERMEEQRLENEKFQQEKEKTLKTLEETKKKLVESEQAGANNLDELKKALAELEK 91
+ ++ EE + + E+ ++ +E K E KK+ E+++A +ELKK AE +K
Sbjct: 1662 KAAEEAKKAEEDKKKAEEAKKAEEDEKKAAEALKKEAEEAKKA-----EELKKKEAEEKK 1716
Query: 92 EAANIRKKEEDLKKK-EKITPWNVDTISKPGFTKTVVNKKPPLQQGEKSDEEKEQEMKAF 150
+A ++K EE+ K K E+ + K K +K + +K +E+K +E++
Sbjct: 1717 KAEELKKAEEENKIKAEEAKKEAEEDKKKAEEAKKDEEEKKKIAHLKKEEEKKAEEIR-- 1774
Query: 151 VKENEKLIKQ 160
KE E +I++
Sbjct: 1775 -KEKEAVIEE 1783
Score = 31.3 bits (70), Expect = 0.98
Identities = 22/72 (30%), Positives = 37/72 (51%), Gaps = 7/72 (9%)
Query: 38 ERMEEQRLENEKFQQEKEKTLKTLEETKKKLVESEQAGANNLDELKKALAELEKEAANIR 97
++ EE + E ++ +E K +E KKK E+++A DE KK E +K+A +
Sbjct: 1451 KKAEEAKKAEEAKKKAEEA--KKADEAKKKAEEAKKA-----DEAKKKAEEAKKKADEAK 1503
Query: 98 KKEEDLKKKEKI 109
K E KK ++
Sbjct: 1504 KAAEAKKKADEA 1515
Score = 30.9 bits (69), Expect = 1.4
Identities = 24/78 (30%), Positives = 41/78 (52%), Gaps = 8/78 (10%)
Query: 32 RHQARVERMEEQRLENEKFQQEKEKTLKTLEETKKKLVESEQAGANNLDELKKALAELEK 91
R +++ EE++ +E +K + K +E KKK E+++A DE KK E +K
Sbjct: 1278 RKADELKKAEEKKKADE---AKKAEEKKKADEAKKKAEEAKKA-----DEAKKKAEEAKK 1329
Query: 92 EAANIRKKEEDLKKKEKI 109
+A +KK E+ KK +
Sbjct: 1330 KADAAKKKAEEAKKAAEA 1347
Score = 29.7 bits (66), Expect = 2.9
Identities = 24/108 (22%), Positives = 44/108 (40%), Gaps = 4/108 (3%)
Query: 38 ERMEEQRLENEKFQQEKEKTLKTLEETKKKLVESEQAGANNLDELKKALAELEKEAANIR 97
++ EE + E + E E E ++K +A +E KK +K+A +
Sbjct: 1336 KKAEEAKKAAEAAKAEAEAAADEAEAAEEK----AEAAEKKKEEAKKKADAAKKKAEEKK 1391
Query: 98 KKEEDLKKKEKITPWNVDTISKPGFTKTVVNKKPPLQQGEKSDEEKEQ 145
K +E KK E+ + K K ++ +K+DE K++
Sbjct: 1392 KADEAKKKAEEDKKKADELKKAAAAKKKADEAKKKAEEKKKADEAKKK 1439
Score = 29.7 bits (66), Expect = 3.0
Identities = 26/114 (22%), Positives = 51/114 (44%), Gaps = 3/114 (2%)
Query: 38 ERMEEQRLENEKFQQEKEKTLKTLEETKKK---LVESEQAGANNLDELKKALAELEKEAA 94
++ E + + ++ E+ K E KKK L ++E+ +E KK E +K+A
Sbjct: 1688 KKAAEALKKEAEEAKKAEELKKKEAEEKKKAEELKKAEEENKIKAEEAKKEAEEDKKKAE 1747
Query: 95 NIRKKEEDLKKKEKITPWNVDTISKPGFTKTVVNKKPPLQQGEKSDEEKEQEMK 148
+K EE+ KK + + K V ++ ++ EK E ++++K
Sbjct: 1748 EAKKDEEEKKKIAHLKKEEEKKAEEIRKEKEAVIEEELDEEDEKRRMEVDKKIK 1801
Score = 28.6 bits (63), Expect = 6.4
Identities = 20/91 (21%), Positives = 38/91 (41%), Gaps = 1/91 (1%)
Query: 17 DETHPNIDTPSLFRWRHQARVERMEEQRLENEKFQQEKEKTLKTLEETKKKLVESEQAGA 76
DE H + E ++ + + +E + + EE K E+++
Sbjct: 1049 DEDIDGNHEGKAEAKAHVGQDEGLKPSYKDFDFDAKEDNRADEATEEAFGKAEEAKKTET 1108
Query: 77 NNLDELKKALAELEKEAANIRKKEEDLKKKE 107
+E +KA E +K+A + RK EE K ++
Sbjct: 1109 GKAEEARKA-EEAKKKAEDARKAEEARKAED 1138
Score = 28.2 bits (62), Expect = 9.7
Identities = 36/119 (30%), Positives = 57/119 (47%), Gaps = 15/119 (12%)
Query: 38 ERMEEQRLENEKFQQEKEKTLKTLEETKKKLVESEQAGANNLDELKKALAELEKEAANIR 97
++ EE + +E ++ +E K EE KKK E+++A DE KK AE K+A +
Sbjct: 1438 KKAEEAKKADEAKKKAEEA--KKAEEAKKKAEEAKKA-----DEAKKK-AEEAKKADEAK 1489
Query: 98 KKEEDLKKKEKITPWNVDTISKPGFTKTVVNKKPPLQQGEKSDEEKEQEMKAFVKENEK 156
KK E+ KKK D K K ++ ++ +K+DE K+ E E +K
Sbjct: 1490 KKAEEAKKK-------ADEAKKAAEAKKKADEAKKAEEAKKADEAKKAEEAKKADEAKK 1541
>gnl|CDD|236942 PRK11637, PRK11637, AmiB activator; Provisional.
Length = 428
Score = 34.7 bits (80), Expect = 0.060
Identities = 26/81 (32%), Positives = 42/81 (51%), Gaps = 14/81 (17%)
Query: 40 MEEQRLENEKFQQ---EKEKTLKTLEETKKK----LVESEQAGANNLDELK------KAL 86
+ EQ+ + +K +Q E++KTL LE + +K L E + D + KA
Sbjct: 200 LYEQQAQQQKLEQARNERKKTLTGLESSLQKDQQQLSELRANESRLRDSIARAEREAKAR 259
Query: 87 AELE-KEAANIRKKEEDLKKK 106
AE E +EAA +R K++ K+K
Sbjct: 260 AEREAREAARVRDKQKQAKRK 280
>gnl|CDD|240271 PTZ00108, PTZ00108, DNA topoisomerase 2-like protein; Provisional.
Length = 1388
Score = 35.0 bits (81), Expect = 0.068
Identities = 26/110 (23%), Positives = 40/110 (36%), Gaps = 3/110 (2%)
Query: 48 EKFQQEKEKTLKTLEETKKKLVESEQAGANNLDELKKALAELEKEAANIRKKEEDLKKKE 107
EK E EK K LE+ K + +LD+ ++AL E E+ KE+ LK K
Sbjct: 1105 EKLNAELEKKEKELEKLKNT--TPKDMWLEDLDKFEEALEEQEEVEEKEIAKEQRLKSKT 1162
Query: 108 K-ITPWNVDTISKPGFTKTVVNKKPPLQQGEKSDEEKEQEMKAFVKENEK 156
K K K + ++ K + K ++K
Sbjct: 1163 KGKASKLRKPKLKKKEKKKKKSSADKSKKASVVGNSKRVDSDEKRKLDDK 1212
>gnl|CDD|233757 TIGR02168, SMC_prok_B, chromosome segregation protein SMC, common
bacterial type. SMC (structural maintenance of
chromosomes) proteins bind DNA and act in organizing and
segregating chromosomes for partition. SMC proteins are
found in bacteria, archaea, and eukaryotes. This family
represents the SMC protein of most bacteria. The smc
gene is often associated with scpB (TIGR00281) and scpA
genes, where scp stands for segregation and condensation
protein. SMC was shown (in Caulobacter crescentus) to be
induced early in S phase but present and bound to DNA
throughout the cell cycle [Cellular processes, Cell
division, DNA metabolism, Chromosome-associated
proteins].
Length = 1179
Score = 35.0 bits (81), Expect = 0.069
Identities = 21/75 (28%), Positives = 35/75 (46%)
Query: 32 RHQARVERMEEQRLENEKFQQEKEKTLKTLEETKKKLVESEQAGANNLDELKKALAELEK 91
+ R+ +E Q E E +E E L L E +L E + L+ L+ L ELE
Sbjct: 306 ILRERLANLERQLEELEAQLEELESKLDELAEELAELEEKLEELKEELESLEAELEELEA 365
Query: 92 EAANIRKKEEDLKKK 106
E + + E+L+++
Sbjct: 366 ELEELESRLEELEEQ 380
Score = 32.7 bits (75), Expect = 0.30
Identities = 19/55 (34%), Positives = 32/55 (58%)
Query: 52 QEKEKTLKTLEETKKKLVESEQAGANNLDELKKALAELEKEAANIRKKEEDLKKK 106
E+ + ++ LEE ++L E L EL+K L ELE+E +RK+ E+L ++
Sbjct: 673 LERRREIEELEEKIEELEEKIAELEKALAELRKELEELEEELEQLRKELEELSRQ 727
Score = 31.2 bits (71), Expect = 1.0
Identities = 21/68 (30%), Positives = 37/68 (54%), Gaps = 2/68 (2%)
Query: 41 EEQRLENEKFQQEKEKTLKTLEETKKKLVESEQAGANNLDELKKALAELEKEAANIRKKE 100
E RLE +K Q + L LE ++L + + LDEL + LAELE++ ++++
Sbjct: 296 EISRLEQQK--QILRERLANLERQLEELEAQLEELESKLDELAEELAELEEKLEELKEEL 353
Query: 101 EDLKKKEK 108
E L+ + +
Sbjct: 354 ESLEAELE 361
Score = 30.8 bits (70), Expect = 1.2
Identities = 22/74 (29%), Positives = 38/74 (51%)
Query: 32 RHQARVERMEEQRLENEKFQQEKEKTLKTLEETKKKLVESEQAGANNLDELKKALAELEK 91
+ ++E +EE+ E EK E K L+ LEE ++L + + + + L+K LA LE
Sbjct: 681 ELEEKIEELEEKIAELEKALAELRKELEELEEELEQLRKELEELSRQISALRKDLARLEA 740
Query: 92 EAANIRKKEEDLKK 105
E + ++ L K
Sbjct: 741 EVEQLEERIAQLSK 754
Score = 30.4 bits (69), Expect = 1.5
Identities = 22/69 (31%), Positives = 34/69 (49%)
Query: 38 ERMEEQRLENEKFQQEKEKTLKTLEETKKKLVESEQAGANNLDELKKALAELEKEAANIR 97
E +E Q E E E + L LEE ++L E ++ L+EL+ L ELE +
Sbjct: 319 EELEAQLEELESKLDELAEELAELEEKLEELKEELESLEAELEELEAELEELESRLEELE 378
Query: 98 KKEEDLKKK 106
++ E L+ K
Sbjct: 379 EQLETLRSK 387
Score = 30.4 bits (69), Expect = 1.6
Identities = 26/73 (35%), Positives = 44/73 (60%)
Query: 34 QARVERMEEQRLENEKFQQEKEKTLKTLEETKKKLVESEQAGANNLDELKKALAELEKEA 93
+A +E +EE+ E E+ E E ++ LE ++L E +A LDEL+ L L +EA
Sbjct: 760 EAEIEELEERLEEAEEELAEAEAEIEELEAQIEQLKEELKALREALDELRAELTLLNEEA 819
Query: 94 ANIRKKEEDLKKK 106
AN+R++ E L+++
Sbjct: 820 ANLRERLESLERR 832
Score = 28.9 bits (65), Expect = 5.5
Identities = 15/73 (20%), Positives = 33/73 (45%)
Query: 34 QARVERMEEQRLENEKFQQEKEKTLKTLEETKKKLVESEQAGANNLDELKKALAELEKEA 93
+ ++E + E +E E+ ++ LE + L+ + L L+ L EL +E
Sbjct: 844 EEQIEELSEDIESLAAEIEELEELIEELESELEALLNERASLEEALALLRSELEELSEEL 903
Query: 94 ANIRKKEEDLKKK 106
+ K +L+++
Sbjct: 904 RELESKRSELRRE 916
Score = 28.5 bits (64), Expect = 7.2
Identities = 19/75 (25%), Positives = 41/75 (54%)
Query: 32 RHQARVERMEEQRLENEKFQQEKEKTLKTLEETKKKLVESEQAGANNLDELKKALAELEK 91
R +A VE++EE+ + K E E ++ LEE ++ E ++EL+ + +L++
Sbjct: 737 RLEAEVEQLEERIAQLSKELTELEAEIEELEERLEEAEEELAEAEAEIEELEAQIEQLKE 796
Query: 92 EAANIRKKEEDLKKK 106
E +R+ ++L+ +
Sbjct: 797 ELKALREALDELRAE 811
>gnl|CDD|233758 TIGR02169, SMC_prok_A, chromosome segregation protein SMC,
primarily archaeal type. SMC (structural maintenance of
chromosomes) proteins bind DNA and act in organizing and
segregating chromosomes for partition. SMC proteins are
found in bacteria, archaea, and eukaryotes. It is found
in a single copy and is homodimeric in prokaryotes, but
six paralogs (excluded from this family) are found in
eukarotes, where SMC proteins are heterodimeric. This
family represents the SMC protein of archaea and a few
bacteria (Aquifex, Synechocystis, etc); the SMC of other
bacteria is described by TIGR02168. The N- and
C-terminal domains of this protein are well conserved,
but the central hinge region is skewed in composition
and highly divergent [Cellular processes, Cell division,
DNA metabolism, Chromosome-associated proteins].
Length = 1164
Score = 35.0 bits (81), Expect = 0.070
Identities = 22/78 (28%), Positives = 39/78 (50%), Gaps = 3/78 (3%)
Query: 38 ERMEEQRLENEKFQQEKEKTLKTLEETKKKLVESEQAGANN---LDELKKALAELEKEAA 94
++ E E E+ +QE+EK + LEE ++ L EQ N L EL+ + ELE++
Sbjct: 716 RKIGEIEKEIEQLEQEEEKLKERLEELEEDLSSLEQEIENVKSELKELEARIEELEEDLH 775
Query: 95 NIRKKEEDLKKKEKITPW 112
+ + DL+ + +
Sbjct: 776 KLEEALNDLEARLSHSRI 793
Score = 30.0 bits (68), Expect = 2.0
Identities = 20/75 (26%), Positives = 40/75 (53%), Gaps = 6/75 (8%)
Query: 34 QARVERMEEQRLENEKFQQEKEKTLKTLEETKKKLVESEQAGANNLDELKKALAELEKEA 93
+ +E++ E+ E EK +E E+ L+ L + K L E EQ +K+ + ELE E
Sbjct: 250 EEELEKLTEEISELEKRLEEIEQLLEELNKKIKDLGEEEQL------RVKEKIGELEAEI 303
Query: 94 ANIRKKEEDLKKKEK 108
A++ + + +++ +
Sbjct: 304 ASLERSIAEKERELE 318
Score = 30.0 bits (68), Expect = 2.0
Identities = 23/115 (20%), Positives = 46/115 (40%), Gaps = 5/115 (4%)
Query: 32 RHQARVERMEEQRLENEKFQQEKEKTLKTLEETKKKLVESEQAGANNLDELKKALAELEK 91
+ E E Q + +Q K K +E K E E+ L+EL+ AL +LE
Sbjct: 828 KEYLEKEIQELQEQRIDLKEQIKSIE-KEIENLNGKKEELEE----ELEELEAALRDLES 882
Query: 92 EAANIRKKEEDLKKKEKITPWNVDTISKPGFTKTVVNKKPPLQQGEKSDEEKEQE 146
+++K+ ++L+ + + ++ + K + + +E E E
Sbjct: 883 RLGDLKKERDELEAQLRELERKIEELEAQIEKKRKRLSELKAKLEALEEELSEIE 937
Score = 29.7 bits (67), Expect = 2.6
Identities = 21/75 (28%), Positives = 40/75 (53%)
Query: 34 QARVERMEEQRLENEKFQQEKEKTLKTLEETKKKLVESEQAGANNLDELKKALAELEKEA 93
QA + ++EE+ E +E E+ L L K+ L + Q +LK+ + +EKE
Sbjct: 797 QAELSKLEEEVSRIEARLREIEQKLNRLTLEKEYLEKEIQELQEQRIDLKEQIKSIEKEI 856
Query: 94 ANIRKKEEDLKKKEK 108
N+ K+E+L+++ +
Sbjct: 857 ENLNGKKEELEEELE 871
Score = 28.5 bits (64), Expect = 6.8
Identities = 29/131 (22%), Positives = 53/131 (40%), Gaps = 20/131 (15%)
Query: 32 RHQARVERMEEQRLENEKFQQEKEKTLKTLEETKKKLVESEQAGANNLDELKKALAELEK 91
+A + +E + + +K + E E L+ LE ++L + L ELK L LE+
Sbjct: 872 ELEAALRDLESRLGDLKKERDELEAQLRELERKIEELEAQIEKKRKRLSELKAKLEALEE 931
Query: 92 EAANIRKKEEDLKKKEKITPWNVDTISKPGFTKTVVNKKPPLQQGEKSDEEKEQEMKAFV 151
E + I ED K +++ P L+ + + E+E++A
Sbjct: 932 ELSEI----EDPKGEDEEIPEEE----------------LSLEDVQAELQRVEEEIRALE 971
Query: 152 KENEKLIKQYA 162
N I++Y
Sbjct: 972 PVNMLAIQEYE 982
Score = 28.5 bits (64), Expect = 7.3
Identities = 28/144 (19%), Positives = 63/144 (43%), Gaps = 21/144 (14%)
Query: 41 EEQRLENEKFQQEKEKTLK------TLEET--------KKKLVESEQAGANNLDELKKAL 86
+E+R + E+ ++E+EK + E K+ L ++A L L++ L
Sbjct: 194 DEKRQQLERLRREREKAERYQALLKEKREYEGYELLKEKEALERQKEAIERQLASLEEEL 253
Query: 87 AELEKEAANIRKKEEDLKKK-----EKITPWNVDTISKPGFTKTVVNKKPPLQQGEKSDE 141
+L +E + + K+ E++++ +KI + + + + + + E+S
Sbjct: 254 EKLTEEISELEKRLEEIEQLLEELNKKIKDLGEEEQLR--VKEKIGELEAEIASLERSIA 311
Query: 142 EKEQEMKAFVKENEKLIKQYANYL 165
EKE+E++ + KL + L
Sbjct: 312 EKERELEDAEERLAKLEAEIDKLL 335
Score = 28.1 bits (63), Expect = 8.9
Identities = 17/57 (29%), Positives = 32/57 (56%)
Query: 52 QEKEKTLKTLEETKKKLVESEQAGANNLDELKKALAELEKEAANIRKKEEDLKKKEK 108
Q E+ ++ LE ++ + LDELK+ A+LE+E I ++ E+ +KK++
Sbjct: 961 QRVEEEIRALEPVNMLAIQEYEEVLKRLDELKEKRAKLEEERKAILERIEEYEKKKR 1017
>gnl|CDD|188306 TIGR03319, RNase_Y, ribonuclease Y. Members of this family are
RNase Y, an endoribonuclease. The member from Bacillus
subtilis, YmdA, has been shown to be involved in
turnover of yitJ riboswitch [Transcription, Degradation
of RNA].
Length = 514
Score = 34.1 bits (79), Expect = 0.096
Identities = 20/89 (22%), Positives = 46/89 (51%), Gaps = 12/89 (13%)
Query: 28 LFRWRHQA----RVERMEEQRLENEKFQQEK--EKTLKTLEETKKKLVESEQAGANNLDE 81
+ + R + + R E QRLE Q+E+ ++ +++L++ ++ L + E+ L
Sbjct: 60 VHKLRAELERELKERRNELQRLERRLLQREETLDRKMESLDKKEENLEKKEK----ELSN 115
Query: 82 LKKALAELEKEAANIRKKEEDLKKKEKIT 110
+K L E E+E + ++ ++ E+I+
Sbjct: 116 KEKNLDEKEEELEELIAEQ--REELERIS 142
Score = 33.7 bits (78), Expect = 0.15
Identities = 20/71 (28%), Positives = 39/71 (54%), Gaps = 3/71 (4%)
Query: 38 ERMEEQRLENEKFQQEKEKTLKTLEETKKKLVESEQAGANNLDELKKALAELEKEAANIR 97
E + + R E E+ +E+ L+ LE ++L++ E+ ++ L K LEK+ +
Sbjct: 58 EEVHKLRAELERELKERRNELQRLE---RRLLQREETLDRKMESLDKKEENLEKKEKELS 114
Query: 98 KKEEDLKKKEK 108
KE++L +KE+
Sbjct: 115 NKEKNLDEKEE 125
>gnl|CDD|238288 cd00520, RRF, Ribosome recycling factor (RRF). Ribosome recycling
factor dissociates the posttermination complex, composed
of the ribosome, deacylated tRNA, and mRNA, after
termination of translation. Thus ribosomes are
"recycled" and ready for another round of protein
synthesis. RRF is believed to bind the ribosome at the
A-site in a manner that mimics tRNA, but the specific
mechanisms remain unclear. RRF is essential for
bacterial growth. It is not necessary for cell growth
in archaea or eukaryotes, but is found in mitochondria
or chloroplasts of some eukaryotic species.
Length = 179
Score = 33.4 bits (77), Expect = 0.097
Identities = 23/74 (31%), Positives = 36/74 (48%), Gaps = 6/74 (8%)
Query: 41 EEQRLENEK-----FQQEKEKTLKTLEETKKKLVESEQAGANNLDELKKALAELEKEAAN 95
EE+R E K ++ K + K+ + E+ + DE+KKA +L+K
Sbjct: 103 EERRKELVKDAKKIAEEAKVAIRNIRRDANDKIKKLEKEKEISEDEVKKAEEDLQKLTDE 162
Query: 96 -IRKKEEDLKKKEK 108
I+K +E LK KEK
Sbjct: 163 YIKKIDELLKSKEK 176
>gnl|CDD|237178 PRK12705, PRK12705, hypothetical protein; Provisional.
Length = 508
Score = 33.9 bits (78), Expect = 0.14
Identities = 28/143 (19%), Positives = 59/143 (41%), Gaps = 6/143 (4%)
Query: 32 RHQARVERMEEQRLENEKFQQEKEKTLKTLEETKKKLVESEQAGANNLDELKKALAELEK 91
A +E E E + +QE + + L+ +++LV+ E+ ++L +LE+
Sbjct: 50 LEAALLEAKELLLRERNQQRQEARREREELQREEERLVQKEEQLDARAEKLDNLENQLEE 109
Query: 92 EAANIRKKEEDLKKKEKITPWNVDTI--SKPGFTKTVVNKKPPLQQGEKSDEEKEQEMKA 149
+ +E +L++ EK +D G T K + +EEK Q +K
Sbjct: 110 REKALSARELELEELEK----QLDNELYRVAGLTPEQARKLLLKLLDAELEEEKAQRVKK 165
Query: 150 FVKENEKLIKQYANYLVIWCINL 172
+E + ++ A ++ +
Sbjct: 166 IEEEADLEAERKAQNILAQAMQR 188
>gnl|CDD|233063 TIGR00631, uvrb, excinuclease ABC, B subunit. All proteins in this
family for wich functions are known are DNA helicases
that function in the nucleotide excision repair and are
endonucleases that make the 3' incision next to DNA
damage. They are part of a pathway requiring UvrA, UvrB,
UvrC, and UvrD homologs. This family is based on the
phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis,
Stanford University) [DNA metabolism, DNA replication,
recombination, and repair].
Length = 655
Score = 33.8 bits (78), Expect = 0.15
Identities = 26/92 (28%), Positives = 43/92 (46%), Gaps = 13/92 (14%)
Query: 80 DELKKALAELEKEAANIRKKEEDLKKKEKITPWNVDTISKP------GFTKTVVNKKPPL 133
D ++KA+ E E+ RK + ++ ITP TI KP K +
Sbjct: 555 DSMQKAIEETERR----RKIQMAYNEEHGITP---QTIRKPIRDILDIELKEKEDAAKKK 607
Query: 134 QQGEKSDEEKEQEMKAFVKENEKLIKQYANYL 165
++GE + ++E+K +K+ EK +KQ A L
Sbjct: 608 KKGEDLSDLSKKELKKLIKQLEKEMKQAARNL 639
>gnl|CDD|218038 pfam04350, PilO, Pilus assembly protein, PilO. PilO proteins are
involved in the assembly of pilin. However, the precise
function of this family of proteins is not known.
Length = 144
Score = 32.1 bits (74), Expect = 0.17
Identities = 17/42 (40%), Positives = 19/42 (45%), Gaps = 7/42 (16%)
Query: 51 QQEKEKTLKTLEETKKKLVESEQAGANNLDELKKALAELEKE 92
Q KE TLK E K + A NL KK L ELE+
Sbjct: 4 AQAKEATLKQEFEEKARQ-------AANLPAYKKQLEELEER 38
>gnl|CDD|114011 pfam05262, Borrelia_P83, Borrelia P83/100 protein. This family
consists of several Borrelia P83/P100 antigen proteins.
Length = 489
Score = 33.4 bits (76), Expect = 0.17
Identities = 19/116 (16%), Positives = 48/116 (41%), Gaps = 4/116 (3%)
Query: 37 VERMEEQRLENEKFQQE--KEKTLKTLEETKKKLVESEQA--GANNLDELKKALAELEKE 92
VE + E + F+++ K ++ E+ K+ E+ + D+ ++ +
Sbjct: 183 VEALREDNEKGVNFRRDMTDLKERESQEDAKRAQQLKEELDKKQIDADKAQQKADFAQDN 242
Query: 93 AANIRKKEEDLKKKEKITPWNVDTISKPGFTKTVVNKKPPLQQGEKSDEEKEQEMK 148
A R + +++ K P DT S + N+K +++ + ++ ++E
Sbjct: 243 ADKQRDEVRQKQQEAKNLPKPADTSSPKEDKQVAENQKREIEKAQIEIKKNDEEAL 298
>gnl|CDD|225288 COG2433, COG2433, Uncharacterized conserved protein [Function
unknown].
Length = 652
Score = 33.5 bits (77), Expect = 0.17
Identities = 23/84 (27%), Positives = 40/84 (47%), Gaps = 9/84 (10%)
Query: 34 QARVERMEEQRLENEKFQQE---------KEKTLKTLEETKKKLVESEQAGANNLDELKK 84
+ +++E+ EK E EK +K LEET ++L E L+ELK+
Sbjct: 391 AEALSKVKEEERPREKEGTEEEERREITVYEKRIKKLEETVERLEEENSELKRELEELKR 450
Query: 85 ALAELEKEAANIRKKEEDLKKKEK 108
+ +LE E R++ D +K++
Sbjct: 451 EIEKLESELERFRREVRDKVRKDR 474
>gnl|CDD|222613 pfam14235, DUF4337, Domain of unknown function (DUF4337). This
family of proteins is functionally uncharacterized. This
family of proteins is found in bacteria. Proteins in
this family are typically between 187 and 201 amino
acids in length. There is a single completely conserved
residue Q that may be functionally important.
Length = 158
Score = 32.2 bits (74), Expect = 0.17
Identities = 18/59 (30%), Positives = 29/59 (49%), Gaps = 4/59 (6%)
Query: 50 FQQEKEKTLKTLEETKKKLVESEQAGANNLDELKKALAELEKEAANIRKKEEDLKKKEK 108
+Q + K + L E L+E E A EL+ +A +KE A R + ++L+ K K
Sbjct: 44 YQAKSIK--QHLAELAADLLELEAAAPR--AELQAKIARYKKEKARYRSEAKELEAKAK 98
>gnl|CDD|234017 TIGR02794, tolA_full, TolA protein. TolA couples the inner
membrane complex of itself with TolQ and TolR to the
outer membrane complex of TolB and OprL (also called
Pal). Most of the length of the protein consists of
low-complexity sequence that may differ in both length
and composition from one species to another,
complicating efforts to discriminate TolA (the most
divergent gene in the tol-pal system) from paralogs such
as TonB. Selection of members of the seed alignment and
criteria for setting scoring cutoffs are based largely
conserved operon struction. //The Tol-Pal complex is
required for maintaining outer membrane integrity. Also
involved in transport (uptake) of colicins and
filamentous DNA, and implicated in pathogenesis.
Transport is energized by the proton motive force. TolA
is an inner membrane protein that interacts with
periplasmic TolB and with outer membrane porins ompC,
phoE and lamB [Transport and binding proteins, Other,
Cellular processes, Pathogenesis].
Length = 346
Score = 33.3 bits (76), Expect = 0.18
Identities = 22/75 (29%), Positives = 35/75 (46%), Gaps = 1/75 (1%)
Query: 34 QARVERMEEQRLENEKFQQEKEKTLKTLEETKKKLVESEQAGANNLDELKKALAELEKEA 93
A+ ++++ E K +Q E K E +KK E + A + KA AE +K+A
Sbjct: 110 AAKQAEEKQKQAEEAKAKQAAEAKAKAEAEAEKKAKEEAKKQAE-EEAKAKAAAEAKKKA 168
Query: 94 ANIRKKEEDLKKKEK 108
A +KK E K +
Sbjct: 169 AEAKKKAEAEAKAKA 183
Score = 27.9 bits (62), Expect = 8.4
Identities = 17/66 (25%), Positives = 32/66 (48%), Gaps = 1/66 (1%)
Query: 41 EEQRLENEKFQQEKEKTLKTLEETKKKLVESEQAGANNLDELKKALAELEKEAANIRKKE 100
E+ R + + + EK K E+ K+ E ++ A + A A+ + EA +K +
Sbjct: 87 EQARQKELEQRAAAEKAAKQAEQAAKQAEEKQKQ-AEEAKAKQAAEAKAKAEAEAEKKAK 145
Query: 101 EDLKKK 106
E+ KK+
Sbjct: 146 EEAKKQ 151
>gnl|CDD|219563 pfam07767, Nop53, Nop53 (60S ribosomal biogenesis). This nucleolar
family of proteins are involved in 60S ribosomal
biogenesis. They are specifically involved in the
processing beyond the 27S stage of 25S rRNA maturation.
This family contains sequences that bear similarity to
the glioma tumour suppressor candidate region gene 2
protein (p60). This protein has been found to interact
with herpes simplex type 1 regulatory proteins.
Length = 387
Score = 33.1 bits (76), Expect = 0.18
Identities = 20/73 (27%), Positives = 40/73 (54%), Gaps = 5/73 (6%)
Query: 36 RVERMEEQRLENEKFQQEKEKTLKTLEETKKKLVESEQAGANNLDELKKALAELEKEAAN 95
R +R + + EK ++E E+ K ++ KKKL + + L E+ K +A+ EK A
Sbjct: 274 RPKRKTKAQRNKEKRRKELEREAKEEKQLKKKLAQLAR-----LKEIAKEVAQKEKARAR 328
Query: 96 IRKKEEDLKKKEK 108
+++ ++ +K+K
Sbjct: 329 KKEQRKERGEKKK 341
>gnl|CDD|236545 PRK09510, tolA, cell envelope integrity inner membrane protein
TolA; Provisional.
Length = 387
Score = 33.2 bits (76), Expect = 0.19
Identities = 20/76 (26%), Positives = 39/76 (51%), Gaps = 2/76 (2%)
Query: 33 HQARVERMEEQRLENEKFQQEKEKTLKTLEETKKKLVESEQAGANNLDELKKALAELEKE 92
Q R++++E++RL ++ +++ E+ K +K+ E A KA AE ++
Sbjct: 99 EQERLKQLEKERLAAQEQKKQAEEAAKQAALKQKQ--AEEAAAKAAAAAKAKAEAEAKRA 156
Query: 93 AANIRKKEEDLKKKEK 108
AA +K + KKK +
Sbjct: 157 AAAAKKAAAEAKKKAE 172
Score = 27.8 bits (62), Expect = 8.8
Identities = 19/75 (25%), Positives = 33/75 (44%), Gaps = 4/75 (5%)
Query: 34 QARVERMEEQRLENEKFQQEKEKTLKTLEETKKKLVESEQAGANNLDELKKALAELEKEA 93
+ R+ E+++ E +Q K K EE K + +A A + K A K+A
Sbjct: 108 KERLAAQEQKKQAEEAAKQAALKQ-KQAEEAAAKAAAAAKAKA---EAEAKRAAAAAKKA 163
Query: 94 ANIRKKEEDLKKKEK 108
A KK+ + + +K
Sbjct: 164 AAEAKKKAEAEAAKK 178
>gnl|CDD|234750 PRK00409, PRK00409, recombination and DNA strand exchange inhibitor
protein; Reviewed.
Length = 782
Score = 32.9 bits (76), Expect = 0.27
Identities = 25/80 (31%), Positives = 37/80 (46%), Gaps = 8/80 (10%)
Query: 37 VERMEEQRL-------ENEKFQQEKEKTLKTLEETKKKLVESEQAGANNLD-ELKKALAE 88
+ +EE E E +E EK + LEE K+KL E E + E ++A+ E
Sbjct: 522 IASLEELERELEQKAEEAEALLKEAEKLKEELEEKKEKLQEEEDKLLEEAEKEAQQAIKE 581
Query: 89 LEKEAANIRKKEEDLKKKEK 108
+KEA I K+ L+K
Sbjct: 582 AKKEADEIIKELRQLQKGGY 601
Score = 31.3 bits (72), Expect = 0.86
Identities = 38/170 (22%), Positives = 58/170 (34%), Gaps = 51/170 (30%)
Query: 39 RMEEQRLENEKFQQEKEKTLKTLEETKKKLVESEQAGANNLD-ELKKALAELEKEAANIR 97
+E++ E E +E EK + LEE K+KL E E + E ++A+ E +KEA I
Sbjct: 531 ELEQKAEEAEALLKEAEKLKEELEEKKEKLQEEEDKLLEEAEKEAQQAIKEAKKEADEII 590
Query: 98 K-------------KEEDL-----------------KKKEKITPWNV---DT--ISKPGF 122
K K +L KKK+K + D G
Sbjct: 591 KELRQLQKGGYASVKAHELIEARKRLNKANEKKEKKKKKQKEKQEELKVGDEVKYLSLGQ 650
Query: 123 TKTVVNKKP---------------PLQQGEKSDEEKEQEMKAFVKENEKL 157
V++ PL EK + K+++ K K
Sbjct: 651 KGEVLSIPDDKEAIVQAGIMKMKVPLSDLEKIQKPKKKKKKKPKTVKPKP 700
>gnl|CDD|214661 smart00435, TOPEUc, DNA Topoisomerase I (eukaryota). DNA
Topoisomerase I (eukaryota), DNA topoisomerase V,
Vaccina virus topoisomerase, Variola virus
topoisomerase, Shope fibroma virus topoisomeras.
Length = 391
Score = 32.7 bits (75), Expect = 0.28
Identities = 16/77 (20%), Positives = 31/77 (40%)
Query: 32 RHQARVERMEEQRLENEKFQQEKEKTLKTLEETKKKLVESEQAGANNLDELKKALAELEK 91
+ Q +++ ++ Q +K E + K K + + E KK + EK
Sbjct: 281 KLQEKIKALKYQLKRLKKMILLFEMISDLKRKLKSKFERDNEKLDAEVKEKKKEKKKEEK 340
Query: 92 EAANIRKKEEDLKKKEK 108
+ I + EE ++K E
Sbjct: 341 KKKQIERLEERIEKLEV 357
>gnl|CDD|236184 PRK08203, PRK08203, hydroxydechloroatrazine ethylaminohydrolase;
Reviewed.
Length = 451
Score = 32.5 bits (75), Expect = 0.33
Identities = 16/44 (36%), Positives = 19/44 (43%), Gaps = 7/44 (15%)
Query: 265 GGLDPVEVFESLPDVLKKCFELQDT-------AMLQEAIASLSP 301
GGL P V E +L L D AML+ A+A SP
Sbjct: 165 GGLPPDSVVEDEDAILADSQRLIDRYHDPGPGAMLRIALAPCSP 208
>gnl|CDD|178867 PRK00106, PRK00106, hypothetical protein; Provisional.
Length = 535
Score = 32.5 bits (74), Expect = 0.33
Identities = 21/64 (32%), Positives = 34/64 (53%)
Query: 46 ENEKFQQEKEKTLKTLEETKKKLVESEQAGANNLDELKKALAELEKEAANIRKKEEDLKK 105
+ E +KE L+ EE +K E EQ + ELK+ + L + A ++ +K+E+L
Sbjct: 63 KRESKALKKELLLEAKEEARKYREEIEQEFKSERQELKQIESRLTERATSLDRKDENLSS 122
Query: 106 KEKI 109
KEK
Sbjct: 123 KEKT 126
>gnl|CDD|235175 PRK03918, PRK03918, chromosome segregation protein; Provisional.
Length = 880
Score = 32.7 bits (75), Expect = 0.36
Identities = 28/132 (21%), Positives = 66/132 (50%), Gaps = 4/132 (3%)
Query: 38 ERMEEQRLENEKFQQEKEKTLKTLEETKKKLVESEQAGA---NNLDELKKALAELEKEAA 94
+ + E E ++ + EK +K E ++ + E E+ ++E+ L EL +E
Sbjct: 165 KNLGEVIKEIKRRIERLEKFIKRTENIEELIKEKEKELEEVLREINEISSELPELREELE 224
Query: 95 NIRKKEEDLKK-KEKITPWNVDTISKPGFTKTVVNKKPPLQQGEKSDEEKEQEMKAFVKE 153
+ K+ ++L++ KE+I + S G + + K L++ + +++ +E++ VKE
Sbjct: 225 KLEKEVKELEELKEEIEELEKELESLEGSKRKLEEKIRELEERIEELKKEIEELEEKVKE 284
Query: 154 NEKLIKQYANYL 165
++L ++ Y+
Sbjct: 285 LKELKEKAEEYI 296
Score = 30.4 bits (69), Expect = 1.8
Identities = 21/76 (27%), Positives = 37/76 (48%), Gaps = 4/76 (5%)
Query: 35 ARVERMEEQRLENEKFQQEKEKTLKTLEETKKKLVESEQ----AGANNLDELKKALAELE 90
R+E ++++ E EK +E E+ + EE K K E E+ ++L+K L ELE
Sbjct: 338 ERLEELKKKLKELEKRLEELEERHELYEEAKAKKEELERLKKRLTGLTPEKLEKELEELE 397
Query: 91 KEAANIRKKEEDLKKK 106
K I ++ + +
Sbjct: 398 KAKEEIEEEISKITAR 413
Score = 29.3 bits (66), Expect = 3.7
Identities = 18/70 (25%), Positives = 37/70 (52%), Gaps = 1/70 (1%)
Query: 42 EQRLENEKFQQEKEKTLKTLEETKKKLVESEQAGANNLDELKKALAELEKEAANIRKKEE 101
E++ E++++ +E+ L+ E E E+ +E+KK L +L++E K ++
Sbjct: 653 EKKYSEEEYEELREEYLELSRELAGLRAELEEL-EKRREEIKKTLEKLKEELEEREKAKK 711
Query: 102 DLKKKEKITP 111
+L+K EK
Sbjct: 712 ELEKLEKALE 721
Score = 28.9 bits (65), Expect = 5.0
Identities = 27/88 (30%), Positives = 43/88 (48%), Gaps = 10/88 (11%)
Query: 32 RHQARVERMEEQRLENEKFQQEKEKTLKTLEETKKKLVESE---------QAGANNLDEL 82
R + + +EE+ E E+ ++ E+ K L+E +K+L E E +A L+ L
Sbjct: 318 RLEEEINGIEERIKELEEKEERLEELKKKLKELEKRLEELEERHELYEEAKAKKEELERL 377
Query: 83 KKALAELEKEAANIRKKEEDLKKKEKIT 110
KK L L E ++ EE K KE+I
Sbjct: 378 KKRLTGLTPEKLE-KELEELEKAKEEIE 404
Score = 28.5 bits (64), Expect = 5.8
Identities = 26/74 (35%), Positives = 46/74 (62%), Gaps = 4/74 (5%)
Query: 38 ERMEEQRLENEKFQQEKEKTLKTLEETKKKLVESEQAGANNLDEL---KKALAELEKEAA 94
ER+EE + E E+ + EK K LK L+E ++ ++ + LDEL +K L+ LE+E
Sbjct: 266 ERIEELKKEIEELE-EKVKELKELKEKAEEYIKLSEFYEEYLDELREIEKRLSRLEEEIN 324
Query: 95 NIRKKEEDLKKKEK 108
I ++ ++L++KE+
Sbjct: 325 GIEERIKELEEKEE 338
>gnl|CDD|234501 TIGR04211, SH3_and_anchor, SH3 domain protein. Members of this
protein family have a signal peptide, a strongly
conserved SH3 domain, a variable region, and then a
C-terminal hydrophobic transmembrane alpha helix region.
Length = 198
Score = 31.5 bits (72), Expect = 0.38
Identities = 25/85 (29%), Positives = 44/85 (51%), Gaps = 10/85 (11%)
Query: 34 QARVERMEEQRLENEKFQQEKEKTLKTLEETKKKLVESEQ------AGANNLD----ELK 83
Q + ++EQ E ++ QE ++ L TLE ++L + A A LD EL+
Sbjct: 79 QEELAELQEQLAELQQENQELKQELSTLEAELERLQKELARIKQLSANAIELDEENRELR 138
Query: 84 KALAELEKEAANIRKKEEDLKKKEK 108
+ LAEL++E + + E L++ E+
Sbjct: 139 EELAELKQENEALEAENERLQENEQ 163
Score = 28.1 bits (63), Expect = 6.0
Identities = 16/75 (21%), Positives = 36/75 (48%), Gaps = 6/75 (8%)
Query: 38 ERMEEQRLENEKFQQEKEKTLKTLEETKKKLVESEQAGANNLDELKKALAELE------K 91
ER+ E + E + Q+E + + L E +++ E +Q + EL++ EL
Sbjct: 66 ERLPELQQELAELQEELAELQEQLAELQQENQELKQELSTLEAELERLQKELARIKQLSA 125
Query: 92 EAANIRKKEEDLKKK 106
A + ++ +L+++
Sbjct: 126 NAIELDEENRELREE 140
>gnl|CDD|236892 PRK11281, PRK11281, hypothetical protein; Provisional.
Length = 1113
Score = 32.6 bits (75), Expect = 0.40
Identities = 19/78 (24%), Positives = 40/78 (51%), Gaps = 11/78 (14%)
Query: 34 QARVERMEEQRL---ENEKFQQEKEKTLKTLEETKKKLVESEQAGANNLDELKKALAELE 90
QA+++ + +Q+L E++ QQ+ E+TL L++ + + E ++LK+ LA+
Sbjct: 42 QAQLDALNKQKLLEAEDKLVQQDLEQTLALLDKIDR--QKEE------TEQLKQQLAQAP 93
Query: 91 KEAANIRKKEEDLKKKEK 108
+ + + E LK
Sbjct: 94 AKLRQAQAELEALKDDND 111
>gnl|CDD|220297 pfam09581, Spore_III_AF, Stage III sporulation protein AF
(Spore_III_AF). This family represents the stage III
sporulation protein AF (Spore_III_AF) of the bacterial
endospore formation program, which exists in some but
not all members of the Firmicutes (formerly called
low-GC Gram-positives). The C-terminal region of these
proteins is poorly conserved.
Length = 185
Score = 31.5 bits (72), Expect = 0.43
Identities = 22/125 (17%), Positives = 50/125 (40%), Gaps = 11/125 (8%)
Query: 41 EEQRLENEKFQQEKEKTLKTLEETKKKLVESEQAGANNLDELKKALAELEKEAANIRKKE 100
E +E++K + + + LEE K+L +++ K ++ + + E
Sbjct: 61 IENEIESKKKEIQASQRAYILEEYAKQL-------EKQVEKKLKEEYGVKVKDVEVEIDE 113
Query: 101 EDLKKKEKITPWNVDTISKPGFTKTVVNKKPPLQQGEKSDEEKEQEMKAFVKENEKLIKQ 160
+ I V+ K + +K P+ ++ + KE+E ++ +E EK+
Sbjct: 114 DLESNNFDIK--EVNVTLKEESKEKQKSKVEPVVIDTQTSKPKEEEEES--EEAEKIKNF 169
Query: 161 YANYL 165
A+
Sbjct: 170 LADEY 174
>gnl|CDD|234252 TIGR03545, TIGR03545, TIGR03545 family protein. This model
represents a relatively rare but broadly distributed
uncharacterized protein family, distributed in 1-2
percent of bacterial genomes, all of which have outer
membranes. In many of these genomes, it is part of a
two-gene pair.
Length = 555
Score = 32.0 bits (73), Expect = 0.44
Identities = 21/75 (28%), Positives = 33/75 (44%), Gaps = 6/75 (8%)
Query: 37 VERMEEQRLENEKFQQEKEKTLK------TLEETKKKLVESEQAGANNLDELKKALAELE 90
VE EE + QQ+ +K K LEE KK+L ++ N EL+K E +
Sbjct: 163 VETAEEIEKSLKAMQQKWKKRKKDLPNKQDLEEYKKRLEAIKKKDIKNPLELQKIKEEFD 222
Query: 91 KEAANIRKKEEDLKK 105
K + ++ +K
Sbjct: 223 KLKKEGKADKQKIKS 237
>gnl|CDD|224114 COG1193, COG1193, Mismatch repair ATPase (MutS family) [DNA
replication, recombination, and repair].
Length = 753
Score = 31.9 bits (73), Expect = 0.52
Identities = 15/70 (21%), Positives = 25/70 (35%), Gaps = 2/70 (2%)
Query: 38 ERMEEQRLENEKFQQEKEKTLKTLEETKKKLVESEQAGANNLDELKKALAELEKEAANIR 97
E +E+ E E +L E K ++V G ++A E E ++
Sbjct: 525 EEVEKLLDEVELLTGANSGGKTSLLELKAQIVVLAHMGLP--VPAEEAKVEAVDEVKFLK 582
Query: 98 KKEEDLKKKE 107
KK+ L
Sbjct: 583 KKKGILDAGA 592
Score = 30.8 bits (70), Expect = 1.4
Identities = 21/76 (27%), Positives = 28/76 (36%), Gaps = 5/76 (6%)
Query: 36 RVERMEEQRLEN-EKFQQEKEKTLKTLEETKKKLVESEQAGANNLD-ELKKALAELEKEA 93
V + E+ LE EK E E K L+E + L E + EA
Sbjct: 515 EVRKELEEELEEVEKLLDEVELLTGANSGGKTSLLELKAQIVVLAHMGLPVPAEEAKVEA 574
Query: 94 ANIRKKEEDLKKKEKI 109
+ K LKKK+ I
Sbjct: 575 VDEVKF---LKKKKGI 587
>gnl|CDD|224212 COG1293, COG1293, Predicted RNA-binding protein homologous to
eukaryotic snRNP [Transcription].
Length = 564
Score = 31.6 bits (72), Expect = 0.58
Identities = 22/124 (17%), Positives = 52/124 (41%), Gaps = 11/124 (8%)
Query: 50 FQQEKEKTLKTLEETKKKLVESEQAGANNLDELKKALAELEKEAANIRKKEEDL------ 103
+ E++K + E +KKL + + N L++ + L ELEK A +R+K E L
Sbjct: 273 EKFERDKIKQLASELEKKLEKELKKLENKLEKQEDELEELEKAAEELRQKGELLYANLQL 332
Query: 104 --KKKEKITPWNVDTISKPGFTKTVVNKKPPLQQGEKSDEEKEQEMKAFVKENEKLIKQY 161
+ + + + + K ++K + + +K +++K ++ + +
Sbjct: 333 IEEGLKSVRLADFYGNEE---IKIELDKSKTPSENAQRYFKKYKKLKGAKVNLDRQLSEL 389
Query: 162 ANYL 165
+
Sbjct: 390 KEAI 393
>gnl|CDD|145900 pfam02994, Transposase_22, L1 transposable element.
Length = 370
Score = 31.5 bits (71), Expect = 0.62
Identities = 30/122 (24%), Positives = 51/122 (41%), Gaps = 9/122 (7%)
Query: 42 EQRLENEKFQQEKEKTLKTLEETKKKLVES----EQAGANNLDELKKALAELEKEAANIR 97
R + E +E + LK LEE K VE+ + L ELKK EL++E +I+
Sbjct: 59 SLREDIETKGKEVQNFLKELEECITKQVEAHIENTEKCLKELMELKKKARELKEECRSIK 118
Query: 98 KKEEDLKKKEKITPWNVDTISKPGFTKTVVNKKPPLQQGEKSDEEKEQEMKAFVKENEKL 157
K+ + + + + N+ +++ E+ +E+K K NEK
Sbjct: 119 KEHMETTLDIENQKKRQGVVD-----TSFTNRIQEMEERISGAEDSIEEIKRTGKFNEKR 173
Query: 158 IK 159
K
Sbjct: 174 KK 175
>gnl|CDD|182767 PRK10836, PRK10836, lysine transporter; Provisional.
Length = 489
Score = 31.7 bits (72), Expect = 0.64
Identities = 11/15 (73%), Positives = 12/15 (80%)
Query: 324 AAGESEDPQENTPDA 338
AAGESEDP +N P A
Sbjct: 227 AAGESEDPAKNIPRA 241
>gnl|CDD|220369 pfam09731, Mitofilin, Mitochondrial inner membrane protein.
Mitofilin controls mitochondrial cristae morphology.
Mitofilin is enriched in the narrow space between the
inner boundary and the outer membranes, where it forms a
homotypic interaction and assembles into a large
multimeric protein complex. The first 78 amino acids
contain a typical amino-terminal-cleavable mitochondrial
presequence rich in positive-charged and hydroxylated
residues and a membrane anchor domain. In addition, it
has three centrally located coiled coil domains.
Length = 493
Score = 31.5 bits (72), Expect = 0.67
Identities = 21/51 (41%), Positives = 27/51 (52%), Gaps = 2/51 (3%)
Query: 42 EQRLENEKFQQEKEKTLKTLEETKKKLVESEQAGAN-NLDELKKALAELEK 91
EQ+L+NE Q E + +E K+K VE E+ G L EL L LEK
Sbjct: 255 EQKLKNELALQAIELQREFNKEIKEK-VEEERNGRLAKLAELNSRLKGLEK 304
>gnl|CDD|240246 PTZ00053, PTZ00053, methionine aminopeptidase 2; Provisional.
Length = 470
Score = 31.2 bits (71), Expect = 0.72
Identities = 24/104 (23%), Positives = 37/104 (35%), Gaps = 4/104 (3%)
Query: 46 ENEKFQQEKEKTLKTLEETKKKLVESEQAGANNLDELKKALAELEKEAANIRKKEEDLKK 105
ENE QQ+++ K ++ KK + L EL E E + N +KK++ KK
Sbjct: 7 ENEVKQQKQQNKQKGTKKKNKKSKKDVDDDDAFLAELISENQEAENKQNNKKKKKKKKKK 66
Query: 106 KEKITPWNVDTISKPGFTKT----VVNKKPPLQQGEKSDEEKEQ 145
K+K D + N E K+
Sbjct: 67 KKKNLGEAYDLAYDLPVVWSSAAFQDNSHIRKLGNWPEQEWKQT 110
>gnl|CDD|222290 pfam13654, AAA_32, AAA domain. This family includes a wide variety
of AAA domains including some that have lost essential
nucleotide binding residues in the P-loop.
Length = 509
Score = 31.3 bits (72), Expect = 0.77
Identities = 24/79 (30%), Positives = 39/79 (49%), Gaps = 15/79 (18%)
Query: 235 KERIRNRAKEKLDEALKEVEEEERQK----RLGPGGLD--PV--------EVFESLPDVL 280
KE I +EK +EA +E+EEE ++K PGG P+ E FE+LP+
Sbjct: 117 KEEIEEEFQEKREEAFEELEEEAKEKGFALVRTPGGFVFAPLKDGEPLTEEEFEALPEEE 176
Query: 281 KKCFELQDTAMLQEAIASL 299
++ E + L+E + +
Sbjct: 177 REELE-EKIDELEEELQEI 194
Score = 28.2 bits (64), Expect = 7.6
Identities = 20/66 (30%), Positives = 34/66 (51%), Gaps = 10/66 (15%)
Query: 41 EEQRLENEKFQQEKEKTLKTLEETKKKLVESEQAGANNLDELKKALAELEKEAANIRKKE 100
+ + L E+F+ E+ + LEE +L E L E+ + L ELE+EA R+K
Sbjct: 160 DGEPLTEEEFEALPEEEREELEEKIDELEEE-------LQEILRQLRELEREA---REKL 209
Query: 101 EDLKKK 106
+L ++
Sbjct: 210 RELDRE 215
>gnl|CDD|184539 PRK14150, PRK14150, heat shock protein GrpE; Provisional.
Length = 193
Score = 30.7 bits (70), Expect = 0.81
Identities = 24/81 (29%), Positives = 40/81 (49%), Gaps = 13/81 (16%)
Query: 41 EEQRLENEKFQQEKEKTLKTLEETKKKLVESEQAGAN-NLDELKKALAELEK-------- 91
EEQ+ E+ +E E ++ EE + +E E A+ + EL+ LAE +
Sbjct: 4 EEQKTPQEQVSEEIEAEVEQQEEADEAELEDELDEADARIAELEAQLAEAQAEERDSVLR 63
Query: 92 ---EAANIRKK-EEDLKKKEK 108
E NIR++ E+D++K K
Sbjct: 64 ARAEVENIRRRAEQDVEKAHK 84
>gnl|CDD|236681 PRK10401, PRK10401, DNA-binding transcriptional regulator GalS;
Provisional.
Length = 346
Score = 30.9 bits (70), Expect = 0.88
Identities = 20/66 (30%), Positives = 29/66 (43%), Gaps = 17/66 (25%)
Query: 139 SDEEKEQEMKAFVKENEKLIKQYANYLVIWCINLAMDEKFDLMDHV-------------A 185
S E E+E A E LI+Q N L++ L+ DE MD + A
Sbjct: 97 SYHEAEKERHAI----EVLIRQRCNALIVHSKALSDDELAQFMDQIPGMVLINRVVPGYA 152
Query: 186 HQCICM 191
H+C+C+
Sbjct: 153 HRCVCL 158
>gnl|CDD|225606 COG3064, TolA, Membrane protein involved in colicin uptake [Cell
envelope biogenesis, outer membrane].
Length = 387
Score = 31.1 bits (70), Expect = 0.93
Identities = 22/83 (26%), Positives = 37/83 (44%), Gaps = 9/83 (10%)
Query: 33 HQARVERMEEQRLENEKFQQEKEKTLKTLEET---------KKKLVESEQAGANNLDELK 83
Q ER++ Q + + + EK+ L+ ++ +KK E+ +A A
Sbjct: 104 KQLEKERLKAQEQQKQAEEAEKQAQLEQKQQEEQARKAAAEQKKKAEAAKAKAAAEAAKL 163
Query: 84 KALAELEKEAANIRKKEEDLKKK 106
KA AE +K+A K E+ K K
Sbjct: 164 KAAAEAKKKAEEAAKAAEEAKAK 186
>gnl|CDD|215656 pfam00012, HSP70, Hsp70 protein. Hsp70 chaperones help to fold
many proteins. Hsp70 assisted folding involves repeated
cycles of substrate binding and release. Hsp70 activity
is ATP dependent. Hsp70 proteins are made up of two
regions: the amino terminus is the ATPase domain and the
carboxyl terminus is the substrate binding region.
Length = 598
Score = 31.0 bits (71), Expect = 0.96
Identities = 18/69 (26%), Positives = 32/69 (46%), Gaps = 6/69 (8%)
Query: 37 VERMEEQRLENEKFQQEKEKTLKTLEETKKKLVESEQAGANNLDELKKALAELEKEAANI 96
+ERM + + E++ E +K K E K + E + +L E L E +K+ +
Sbjct: 506 IERMVK---DAEEYAAE-DKKRKERIEAKNEAEEYVYSLEKSLKEEGDKLPEADKKK--V 559
Query: 97 RKKEEDLKK 105
+ E LK+
Sbjct: 560 EEAIEWLKE 568
>gnl|CDD|234776 PRK00478, scpA, segregation and condensation protein A/unknown
domain fusion protein; Provisional.
Length = 505
Score = 31.1 bits (70), Expect = 0.98
Identities = 16/52 (30%), Positives = 29/52 (55%), Gaps = 1/52 (1%)
Query: 55 EKTLKTLEETKKKLVESEQAGANNLDELKKALAELEKEAANIRKKEEDLKKK 106
+ +E+ +++V+ +Q L E K + E E+ +N +K+EE LKKK
Sbjct: 415 PTKIVNSQESPQEMVQRQQEQEQALKESIKQIQE-ERAISNFQKREEYLKKK 465
>gnl|CDD|163064 TIGR02894, DNA_bind_RsfA, transcription factor, RsfA family. In a
subset of endospore-forming members of the Firmcutes,
members of this protein family are found, several to a
genome. Two very strongly conserved sequences regions
are separated by a highly variable linker region. Much
of the linker region was excised from the seed alignment
for this model. A characterized member is the
prespore-specific transcription RsfA from Bacillus
subtilis, previously called YwfN, which is controlled by
sigma factor F and seems to fine-tune expression of some
genes in the sigma-F regulon. A paralog in Bacillus
subtilis is designated YlbO [Regulatory functions, DNA
interactions, Cellular processes, Sporulation and
germination].
Length = 161
Score = 30.1 bits (68), Expect = 1.0
Identities = 24/92 (26%), Positives = 39/92 (42%), Gaps = 13/92 (14%)
Query: 29 FRWRHQARVERMEEQRLENEKFQQEKEKTLK----TLE------ETKKKLVESEQAGANN 78
FRW A V + E+ +E K +Q KE + TL+ + K S+QA
Sbjct: 49 FRW--NAYVRKQYEEAIELAK-KQRKELKREAGSLTLQDVISFLQNLKTTNPSDQALQKE 105
Query: 79 LDELKKALAELEKEAANIRKKEEDLKKKEKIT 110
+ LK L+K + K+ E L+++
Sbjct: 106 NERLKNQNESLQKRNEELEKELEKLRQRLSTI 137
>gnl|CDD|226513 COG4026, COG4026, Uncharacterized protein containing TOPRIM domain,
potential nuclease [General function prediction only].
Length = 290
Score = 30.6 bits (69), Expect = 1.1
Identities = 21/69 (30%), Positives = 35/69 (50%), Gaps = 4/69 (5%)
Query: 38 ERMEEQRLENEKFQQEKEKTLKTLEETKKKLVESEQAGANNLDELKKALAELEKEAANIR 97
E EE + + E+ Q+EKE+ LK LEE + + + L L+ + LE+ +
Sbjct: 135 EDYEELKEKLEELQKEKEELLKELEELEAEY----EEVQERLKRLEVENSRLEEMLKKLP 190
Query: 98 KKEEDLKKK 106
+ DLKK+
Sbjct: 191 GEVYDLKKR 199
>gnl|CDD|217902 pfam04111, APG6, Autophagy protein Apg6. In yeast, 15 Apg proteins
coordinate the formation of autophagosomes. Autophagy is
a bulk degradation process induced by starvation in
eukaryotic cells. Apg6/Vps30p has two distinct functions
in the autophagic process, either associated with the
membrane or in a retrieval step of the carboxypeptidase
Y sorting pathway.
Length = 356
Score = 30.6 bits (69), Expect = 1.1
Identities = 13/55 (23%), Positives = 24/55 (43%)
Query: 38 ERMEEQRLENEKFQQEKEKTLKTLEETKKKLVESEQAGANNLDELKKALAELEKE 92
ER+ ++ E EK + + L L+E K++L E + + +LE
Sbjct: 80 ERLLDELEELEKEDDDLDGELVELQEEKEQLENEELQYLREYNLFDRNNLQLEDN 134
>gnl|CDD|226894 COG4499, COG4499, Predicted membrane protein [Function unknown].
Length = 434
Score = 30.5 bits (69), Expect = 1.2
Identities = 11/73 (15%), Positives = 31/73 (42%)
Query: 35 ARVERMEEQRLENEKFQQEKEKTLKTLEETKKKLVESEQAGANNLDELKKALAELEKEAA 94
A + EE + + ++++ LK + + + + D ++A A+ ++E
Sbjct: 348 ALTKLYEEVKSNTDLSGDKRQELLKEYNKKLQDYTKKLGEVKDETDASEEAEAKAKEEKL 407
Query: 95 NIRKKEEDLKKKE 107
+ E+ K++
Sbjct: 408 KQEENEKKQKEQA 420
>gnl|CDD|153327 cd07643, I-BAR_IMD_MIM, Inverse (I)-BAR, also known as the
IRSp53/MIM homology Domain (IMD), of Missing In
Metastasis. The IMD domain, also called
Inverse-Bin/Amphiphysin/Rvs (I-BAR) domain, is a
dimerization and lipid-binding module that bends
membranes and induces membrane protrusions. Members of
this subfamily include missing in metastasis (MIM) or
metastasis suppressor 1 (MTSS1), metastasis suppressor
1-like (MTSSL) or ABBA (Actin-Bundling protein with
BAIAP2 homology), and similar proteins. They contain an
N-terminal IMD and a WASP homology 2 (WH2) actin-binding
motif at the C-terminus. MIM was originally identified
as a missing transcript from metastatic bladder and
prostate cancer cells. It is a scaffold protein that
functions in a signaling pathway between the PDGF
receptor, Src kinases, and actin assembly. It may also
function as a cofactor of the Sonic hedgehog (Shh)
transcriptional pathway and may participate in tumor
development and progression via this pathway. ABBA
regulates actin and plasma membrane dynamics to promote
the extension of radial glia, which is important in
neuronal migration, axon guidance and neurogenesis. The
IMD domain of MIM binds and bundles actin filaments,
binds membranes, and interacts with the small GTPase
Rac.
Length = 231
Score = 30.1 bits (68), Expect = 1.3
Identities = 34/135 (25%), Positives = 59/135 (43%), Gaps = 21/135 (15%)
Query: 79 LDELKKALAELEKEAANIRKK-EEDLKKKEKITPWNVDTISKPGFTKTVVNKKPPLQQGE 137
++E KK +L+K+ A KK +++KKK + DTI K K LQ
Sbjct: 106 IEEWKKVANQLDKDHAKEYKKARQEIKKK------SSDTIR---LQKKARKGKGDLQPQL 156
Query: 138 KSDEEKEQEMKAFVKENEK------LIKQYANY-LVIWCINLAMDEKFDLMDHVAHQCIC 190
S + + ++E EK LI++ + + + +DE+ ++ V H
Sbjct: 157 DSAMQDVNDKYLLLEETEKKAVRNALIEERGRFCTFVSFLKPVLDEEISMLGEVTH---- 212
Query: 191 MQYLLELSKSLNIDP 205
+Q ++E SL DP
Sbjct: 213 LQTIMEDLASLTADP 227
>gnl|CDD|240388 PTZ00372, PTZ00372, endonuclease 4-like protein; Provisional.
Length = 413
Score = 30.5 bits (69), Expect = 1.4
Identities = 40/195 (20%), Positives = 67/195 (34%), Gaps = 28/195 (14%)
Query: 58 LKTLEETKKKLVESEQAGANNLDELKKALAELEKEAANIRKKEEDLKKKEKITPWNVDTI 117
L T E + K+ + D+ ++ + EK+ +KKE+ K E T
Sbjct: 39 LSTFSEEENKVATTSTKKDKKEDKNNESKKKSEKKKK--KKKEKKEPKSEGETK---LGF 93
Query: 118 SKPGFTKTVVNKKPPLQQGEKSDEEKEQEMKAFVKENEKL-----IKQYANYLVIWCINL 172
P +K K P + E D + + K N + + I N+
Sbjct: 94 KTPKKSKKTKKKPPKPKPNEDVDNAFNKIAELAEKSNVYIGAHVSASGGVDNSPINAYNI 153
Query: 173 AMDEKFDLM-------DHVAHQCICMQYLLELSKSLNIDPRACI--GSFFSKIQIAEPDY 223
A + F L + + E K N DP+ + GS+ I +A PD
Sbjct: 154 A-GQAFALFLKNQRTWNSPPLSDETIDKFKENCKKYNYDPKFILPHGSYL--INLANPDK 210
Query: 224 ------KASFDDELK 232
+F D+L+
Sbjct: 211 EKREKSYDAFLDDLQ 225
>gnl|CDD|235250 PRK04195, PRK04195, replication factor C large subunit;
Provisional.
Length = 482
Score = 30.7 bits (70), Expect = 1.4
Identities = 16/61 (26%), Positives = 32/61 (52%)
Query: 48 EKFQQEKEKTLKTLEETKKKLVESEQAGANNLDELKKALAELEKEAANIRKKEEDLKKKE 107
+K ++ EK K EE KK+ + AG +E ++ + E+E ++ E+ K++E
Sbjct: 410 KKIKKIVEKAEKKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEKEEEEEEAEEEKEEE 469
Query: 108 K 108
+
Sbjct: 470 E 470
>gnl|CDD|147523 pfam05384, DegS, Sensor protein DegS. This is small family of
Bacillus DegS proteins. The DegS-DegU two-component
regulatory system of Bacillus subtilis controls various
processes that characterize the transition from the
exponential to the stationary growth phase, including
the induction of extracellular degradative enzymes,
expression of late competence genes and down-regulation
of the sigma D regulon. The family also contains one
sequence from Thermoanaerobacter tengcongensis which is
described as a sensory transduction histidine kinase.
Length = 159
Score = 29.5 bits (67), Expect = 1.4
Identities = 16/55 (29%), Positives = 36/55 (65%)
Query: 55 EKTLKTLEETKKKLVESEQAGANNLDELKKALAELEKEAANIRKKEEDLKKKEKI 109
EKT+ T+EE+K+++ E + + L++ L E+++E + + K+ + L+K+E+
Sbjct: 5 EKTIDTVEESKEEIFEIAEQARQEYERLEQELEEVKEEVSEVIKEVDKLEKQERR 59
>gnl|CDD|216654 pfam01702, TGT, Queuine tRNA-ribosyltransferase. This is a family
of queuine tRNA-ribosyltransferases EC:2.4.2.29, also
known as tRNA-guanine transglycosylase and guanine
insertion enzyme. Queuine tRNA-ribosyltransferase
modifies tRNAs for asparagine, aspartic acid, histidine
and tyrosine with queuine. It catalyzes the exchange of
guanine-34 at the wobble position with
7-aminomethyl-7-deazaguanine, and the addition of a
cyclopentenediol moiety to
7-aminomethyl-7-deazaguanine-34 tRNA; giving a
hypermodified base queuine in the wobble position. The
aligned region contains a zinc binding motif
C-x-C-x2-C-x29-H, and important tRNA and
7-aminomethyl-7deazaguanine binding residues.
Length = 237
Score = 30.0 bits (68), Expect = 1.4
Identities = 25/108 (23%), Positives = 36/108 (33%), Gaps = 18/108 (16%)
Query: 216 IQIAE---PDYKASFDD--ELKSFKERIR---NRAKEKLDEALKEVEEEERQKRLGP--G 265
I+I D D+ + +ER R D L+ + + Q G G
Sbjct: 4 IEIQNALGSDIIMPLDECTPYPAPRERAEKSVERTLRWADRCLEAHKRKG-QALFGIVQG 62
Query: 266 GLDPVEVFESLPDVLKKCFELQDTAMLQEAIASLSPEEATYHMKRCVD 313
GL ES ++ + F+ AI LS E M VD
Sbjct: 63 GLYEDLREESAEELAEDGFDGY-------AIGGLSVGEEKEEMLELVD 103
>gnl|CDD|223520 COG0443, DnaK, Molecular chaperone [Posttranslational modification,
protein turnover, chaperones].
Length = 579
Score = 30.4 bits (69), Expect = 1.5
Identities = 20/100 (20%), Positives = 43/100 (43%), Gaps = 12/100 (12%)
Query: 9 DIEISDDEDETHPNIDTPSLFRWRHQARVE------RMEEQRLENEKFQQEKEKTLKTLE 62
D EI ++ N FR +AR E +E+ E K +E+++ ++
Sbjct: 485 DEEIERMVEDAEANAALDKKFRELVEARNEAESLIYSLEKALKEIVKVSEEEKEKIEEAI 544
Query: 63 ETKKKLVESEQAGANNLDELKKALAELEKEAANIRKKEED 102
++ +E E+ +E+K + EL++ + +K+
Sbjct: 545 TDLEEALEGEK------EEIKAKIEELQEVTQKLAEKKYQ 578
>gnl|CDD|232961 TIGR00408, proS_fam_I, prolyl-tRNA synthetase, family I.
Prolyl-tRNA synthetase is a class II tRNA synthetase and
is recognized by pfam model tRNA-synt_2b, which
recognizes tRNA synthetases for Gly, His, Ser, and Pro.
The prolyl-tRNA synthetases are divided into two widely
divergent families. This family includes the archaeal
enzyme, the Pro-specific domain of a human
multifunctional tRNA ligase, and the enzyme from the
spirochete Borrelia burgdorferi. The other family
includes enzymes from Escherichia coli, Bacillus
subtilis, Synechocystis PCC6803, and one of the two
prolyL-tRNA synthetases of Saccharomyces cerevisiae
[Protein synthesis, tRNA aminoacylation].
Length = 472
Score = 30.5 bits (69), Expect = 1.5
Identities = 13/47 (27%), Positives = 25/47 (53%)
Query: 215 KIQIAEPDYKASFDDELKSFKERIRNRAKEKLDEALKEVEEEERQKR 261
K Q++ + + L + +E +RNRA E+ ++ + VE E K+
Sbjct: 365 KYQVSLDQLEERVVELLNNIQENLRNRAWERFEQKIVIVETLEEIKQ 411
>gnl|CDD|148614 pfam07106, TBPIP, Tat binding protein 1(TBP-1)-interacting protein
(TBPIP). This family consists of several eukaryotic
TBP-1 interacting protein (TBPIP) sequences. TBP-1 has
been demonstrated to interact with the human
immunodeficiency virus type 1 (HIV-1) viral protein Tat,
then modulate the essential replication process of HIV.
In addition, TBP-1 has been shown to be a component of
the 26S proteasome, a basic multiprotein complex that
degrades ubiquitinated proteins in an ATP-dependent
fashion. Human TBPIP interacts with human TBP-1 then
modulates the inhibitory action of human TBP-1 on
HIV-Tat-mediated transactivation.
Length = 169
Score = 29.6 bits (67), Expect = 1.5
Identities = 20/65 (30%), Positives = 30/65 (46%), Gaps = 7/65 (10%)
Query: 40 MEEQRLENEKFQQEKEKTLKTLEETKKKLVESEQAGANNLDELKKALAELEKEAANIRKK 99
ME + L E Q ++ TLE K L +EL++ + EL+KE I +K
Sbjct: 79 MEIEELREEV--QLLKQDCSTLEIELKSLTSDLT-----TEELQEEIQELKKEVREIEEK 131
Query: 100 EEDLK 104
E L+
Sbjct: 132 LESLE 136
>gnl|CDD|237258 PRK12903, secA, preprotein translocase subunit SecA; Reviewed.
Length = 925
Score = 30.4 bits (69), Expect = 1.6
Identities = 27/150 (18%), Positives = 59/150 (39%), Gaps = 13/150 (8%)
Query: 8 KDIEISDDEDETHPNIDTPSLFRWRHQARVERMEEQRLENEKFQQEKEKTLKTLEETKKK 67
+ I I + N + ++ EE+RL+ + ++E+ L EE +
Sbjct: 782 EKILIITEILSDGINNSDINDRPQELIDQIIESEEERLKALR--IQREEMLMRPEELELI 839
Query: 68 LVESEQAGANNLDELKKALAELEKEAANIRKKEEDLKKKEKITPWNVDTISKPGFTKTVV 127
E + E+K L+E+++ + + + K + V+ + P ++
Sbjct: 840 NEEQKNL----KQEIKLELSEIQE---AEEEIQNINENKNE----FVEFKNDPKKLNKLI 888
Query: 128 NKKPPLQQGEKSDEEKEQEMKAFVKENEKL 157
K L + S +E +Q+ K K+ + L
Sbjct: 889 IAKDVLIKLVISSDEIKQDEKTTKKKKKDL 918
>gnl|CDD|149815 pfam08870, DUF1832, Domain of unknown function (DUF1832). This
family of proteins are functionally uncharacterized.
Length = 116
Score = 28.8 bits (65), Expect = 1.6
Identities = 15/69 (21%), Positives = 28/69 (40%), Gaps = 5/69 (7%)
Query: 103 LKKKEKITPWNVDTISKPGFTKTVVNKKPPLQQGEKSDEEKEQEMKAFVKENEKLIKQYA 162
LK++ +TPWN+ + + F ++ + P K D E + F+ E + Y
Sbjct: 15 LKRRTGLTPWNI--LCRWAFCLSLESGSIPDGAEIKLDSGLEIDKYTFLGEYGDI---YI 69
Query: 163 NYLVIWCIN 171
L +
Sbjct: 70 ALLKQRYVQ 78
>gnl|CDD|223496 COG0419, SbcC, ATPase involved in DNA repair [DNA replication,
recombination, and repair].
Length = 908
Score = 30.5 bits (69), Expect = 1.7
Identities = 19/73 (26%), Positives = 39/73 (53%)
Query: 36 RVERMEEQRLENEKFQQEKEKTLKTLEETKKKLVESEQAGANNLDELKKALAELEKEAAN 95
R+E ++ + LE E + E K + ++L+E + L+EL++ + ELE+E
Sbjct: 250 RLEELKARLLEIESLELEALKIREEELRELERLLEELEEKIERLEELEREIEELEEELEG 309
Query: 96 IRKKEEDLKKKEK 108
+R E+L++ +
Sbjct: 310 LRALLEELEELLE 322
Score = 29.3 bits (66), Expect = 3.3
Identities = 19/76 (25%), Positives = 40/76 (52%), Gaps = 2/76 (2%)
Query: 34 QARVERMEEQRLENEKFQQEKEKTLKTLEETKKKLVESEQAGANNLDELKKALAELEKEA 93
+ ++ + E + + +E+ K LK LEE +++ E E ++ L++ LAELE+E
Sbjct: 191 EGQLSELLEDIEDLLEALEEELKELKKLEEIQEEQEEEELE--QEIEALEERLAELEEEK 248
Query: 94 ANIRKKEEDLKKKEKI 109
+ + + L + E +
Sbjct: 249 ERLEELKARLLEIESL 264
Score = 29.3 bits (66), Expect = 3.5
Identities = 21/75 (28%), Positives = 38/75 (50%)
Query: 34 QARVERMEEQRLENEKFQQEKEKTLKTLEETKKKLVESEQAGANNLDELKKALAELEKEA 93
+ E +E++ E+ E E+ + LEE K +L+E E L ++ L ELE+
Sbjct: 224 EQEEEELEQEIEALEERLAELEEEKERLEELKARLLEIESLELEALKIREEELRELERLL 283
Query: 94 ANIRKKEEDLKKKEK 108
+ +K E L++ E+
Sbjct: 284 EELEEKIERLEELER 298
Score = 28.2 bits (63), Expect = 7.2
Identities = 22/72 (30%), Positives = 40/72 (55%)
Query: 38 ERMEEQRLENEKFQQEKEKTLKTLEETKKKLVESEQAGANNLDELKKALAELEKEAANIR 97
ER+EE E EK + ++ + ++E K++L E A +EL++ ELE+ +
Sbjct: 368 ERLEELEKELEKALERLKQLEEAIQELKEELAELSAALEEIQEELEELEKELEELERELE 427
Query: 98 KKEEDLKKKEKI 109
+ EE++KK E+
Sbjct: 428 ELEEEIKKLEEQ 439
Score = 28.2 bits (63), Expect = 9.3
Identities = 21/89 (23%), Positives = 39/89 (43%), Gaps = 7/89 (7%)
Query: 27 SLFRWRHQARVERMEEQRLENEKFQQEKEKTLKTLEETKKKLVESE-QAGANNLDELKKA 85
+ EE + EK + E+ + E+ + + ++ E + + L ELK+
Sbjct: 515 EELIELLELEEALKEELEEKLEKLENLLEELEELKEKLQLQQLKEELRQLEDRLQELKEL 574
Query: 86 LAELEK------EAANIRKKEEDLKKKEK 108
L EL E +R++ ++LKKK K
Sbjct: 575 LEELRLLRTRKEELEELRERLKELKKKLK 603
>gnl|CDD|131606 TIGR02555, OrgA_MxiK, type III secretion apparatus protein
OrgA/MxiK. This gene is found in type III secretion
operons and has been shown to be essential for the
invasion phenotype in Salmonella and a component of the
secretion apparatus. The protein is known as OrgA in
Salmonella due to its oxygen-dependent expression
pattern in which low-oxygen levels up-regulate the gene.
In Shigella the ghene is called MxiK and has been shown
to be sessential for the proper assembly of the
secretion needle complex.
Length = 185
Score = 29.8 bits (67), Expect = 1.7
Identities = 18/75 (24%), Positives = 28/75 (37%), Gaps = 11/75 (14%)
Query: 162 ANYLVIWCINLAMDEKFDLMDHVAHQCICMQYLLELSKSLNIDPRAC--IGSFFSKIQIA 219
AN +I L+ DE F L + LL P+ IG + ++A
Sbjct: 33 ANEWIINHYRLSCDEDFALRGSPTAMFVDNWNLL---------PQIAYLIGCQRLRARLA 83
Query: 220 EPDYKASFDDELKSF 234
E + D ++SF
Sbjct: 84 ERGRFINLPDWVQSF 98
>gnl|CDD|224937 COG2026, RelE, Cytotoxic translational repressor of toxin-antitoxin
stability system [Translation, ribosomal structure and
biogenesis / Cell division and chromosome partitioning].
Length = 90
Score = 28.4 bits (64), Expect = 1.7
Identities = 13/47 (27%), Positives = 20/47 (42%)
Query: 222 DYKASFDDELKSFKERIRNRAKEKLDEALKEVEEEERQKRLGPGGLD 268
+ ELK ++IR + K+KL E L+ K+L G
Sbjct: 6 EIHPKALKELKKLDKKIRKKIKDKLKELLENPPPIRHGKKLRKGLSG 52
>gnl|CDD|225968 COG3434, COG3434, Predicted signal transduction protein containing
EAL and modified HD-GYP domains [Signal transduction
mechanisms].
Length = 407
Score = 30.1 bits (68), Expect = 1.8
Identities = 17/73 (23%), Positives = 28/73 (38%), Gaps = 1/73 (1%)
Query: 191 MQYLLELSKSLNIDPRACIGSFFSKIQIAEPDYKASFDDELKSFKERIRNRAKEKLDEAL 250
YLL L + I + D+K D+ + F + K+ L E +
Sbjct: 110 KGYLLALD-DFIFSNVSEWKPLLPLSDIVKIDFKRVTFDKARLFDRDLGYINKKFLAEKV 168
Query: 251 KEVEEEERQKRLG 263
+ EE E+ K+ G
Sbjct: 169 ETEEEFEQAKKAG 181
>gnl|CDD|233467 TIGR01554, major_cap_HK97, phage major capsid protein, HK97 family.
This model family represents the major capsid protein
component of the heads (capsids) of bacteriophage HK97,
phi-105, P27, and related phage. This model represents
one of several analogous families lacking detectable
sequence similarity. The gene encoding this component is
typically located in an operon encoding the small and
large terminase subunits, the portal protein and the
prohead or maturation protease [Mobile and
extrachromosomal element functions, Prophage functions].
Length = 384
Score = 30.0 bits (68), Expect = 2.0
Identities = 13/66 (19%), Positives = 24/66 (36%), Gaps = 4/66 (6%)
Query: 47 NEKFQQEKEKTLKTLEETKKKLVESEQAGANNL----DELKKALAELEKEAANIRKKEED 102
E + ++ K E+ K E E+A + L D+L + LE+ + K
Sbjct: 5 REALAELAKELRKLTEDEKLAEAEEEKAEYDALKEEIDKLDAEIDRLEELLDELEAKPAA 64
Query: 103 LKKKEK 108
+
Sbjct: 65 SGEGGG 70
>gnl|CDD|222499 pfam14022, DUF4238, Protein of unknown function (DUF4238). This
family of proteins is functionally uncharacterized. This
family of proteins is found in bacteria, archaea and
eukaryotes. Proteins in this family are typically
between 274 and 374 amino acids in length.
Length = 268
Score = 29.4 bits (66), Expect = 2.4
Identities = 18/108 (16%), Positives = 48/108 (44%), Gaps = 9/108 (8%)
Query: 77 NNLDELKKALAELEKEAANIRKKEEDLKKKEKITPWNVDTISK---------PGFTKTVV 127
+ DE++K L+++E A+ I K D + +TP + + ++ P F + ++
Sbjct: 52 GDDDEIEKLLSKIEGRASPIIDKIIDGNRSALLTPEDKEKLALFLALQYLRTPAFRERIL 111
Query: 128 NKKPPLQQGEKSDEEKEQEMKAFVKENEKLIKQYANYLVIWCINLAMD 175
+ + + EE ++ +K +K E+ ++ + ++ +
Sbjct: 112 SFLEAIARQVDELEEADEAIKELLKGIEEDTEELPDLSLLLLLKSLRK 159
>gnl|CDD|204739 pfam11768, DUF3312, Protein of unknown function (DUF3312). This is
a eukaryotic family of uncharacterized proteins. This
family shows similarity to WD40 repeat proteins.
Length = 544
Score = 29.7 bits (67), Expect = 2.5
Identities = 12/33 (36%), Positives = 20/33 (60%)
Query: 230 ELKSFKERIRNRAKEKLDEALKEVEEEERQKRL 262
E + + ++N+ EKL + LKE+EE +Q R
Sbjct: 6 ESRGYPWTLKNKRPEKLRDNLKELEELLQQSRC 38
>gnl|CDD|219791 pfam08317, Spc7, Spc7 kinetochore protein. This domain is found in
cell division proteins which are required for
kinetochore-spindle association.
Length = 321
Score = 29.6 bits (67), Expect = 2.6
Identities = 22/96 (22%), Positives = 42/96 (43%), Gaps = 8/96 (8%)
Query: 36 RVERMEEQRLENEKFQQEKEKTLKTLEETKKKLVESEQAGANNLDELKKALAELEK---- 91
+E+ ++ EK K L+ L++ ++L + +A N EL + +AE EK
Sbjct: 208 ELEKARQELRSLSVKISEKRKQLEELQQELQELTIAIEALTNKKSELLEEIAEAEKIREE 267
Query: 92 ----EAANIRKKEEDLKKKEKITPWNVDTISKPGFT 123
A I K + + +K+T W ++S +
Sbjct: 268 CRGWSAKEISKLKAKVSLLQKLTGWKFLSLSGSTLS 303
>gnl|CDD|237551 PRK13902, alaS, alanyl-tRNA synthetase; Provisional.
Length = 900
Score = 29.8 bits (68), Expect = 2.6
Identities = 15/66 (22%), Positives = 35/66 (53%), Gaps = 4/66 (6%)
Query: 48 EKFQQEKEKTLKTLEETKKKLVESEQAGANN----LDELKKALAELEKEAANIRKKEEDL 103
E+F +E ++ K +E+ +K+L E + + + +K + E++ + +RK +L
Sbjct: 764 ERFFEEWKEQKKEIEKLRKELAELLASELLSKAEEVGGVKVVVEEVDADEDELRKIANEL 823
Query: 104 KKKEKI 109
K+ K+
Sbjct: 824 AKEGKV 829
>gnl|CDD|235334 PRK05035, PRK05035, electron transport complex protein RnfC;
Provisional.
Length = 695
Score = 29.9 bits (68), Expect = 2.6
Identities = 21/131 (16%), Positives = 41/131 (31%), Gaps = 9/131 (6%)
Query: 30 RWRHQARVERMEEQRLENE-KFQQEKEKTLKTLEETKKKLVESEQAGANNLDELKKALAE 88
+ R +AR R+E ++ E + ++ E ++ + +A + A
Sbjct: 452 KARFEARQARLEREKAAREARHKKAAEARAAKDKDAVAAALARVKAKKAAATQPIVIKAG 511
Query: 89 LEKEAANIRKKEEDLKKKEKITPWNVDTISKPGFTKTVV--------NKKPPLQQGEKSD 140
+ + + E K + + + K V KK Q
Sbjct: 512 ARPDNSAVIAAREARKAQARARQAEKQAAAAADPKKAAVAAAIARAKAKKAAQQAANAEA 571
Query: 141 EEKEQEMKAFV 151
EE+ KA V
Sbjct: 572 EEEVDPKKAAV 582
>gnl|CDD|173502 PTZ00266, PTZ00266, NIMA-related protein kinase; Provisional.
Length = 1021
Score = 29.7 bits (66), Expect = 2.7
Identities = 17/52 (32%), Positives = 30/52 (57%), Gaps = 1/52 (1%)
Query: 34 QARVERMEEQRLENEKFQQEKEKTLKTLEETKKKLVESEQAGANNLDELKKA 85
+ R+ER+E +RLE E+ ++E+ + LE + +E E+ D L+KA
Sbjct: 473 RERMERIERERLERERLERERLER-DRLERDRLDRLERERVDRLERDRLEKA 523
>gnl|CDD|235582 PRK05729, valS, valyl-tRNA synthetase; Reviewed.
Length = 874
Score = 29.7 bits (68), Expect = 3.0
Identities = 24/84 (28%), Positives = 34/84 (40%), Gaps = 30/84 (35%)
Query: 82 LKKALAELEKEAANIRKKEEDLKKKEKITPWNVDTISKPGFTKTVVNKKPPLQQGEKSDE 141
L+K LA+LEKE + KK +S GF V K P E ++
Sbjct: 816 LEKELAKLEKEIERVEKK-----------------LSNEGF----VAKAPE----EVVEK 850
Query: 142 EKE-----QEMKAFVKENEKLIKQ 160
E+E +E A +KE +K
Sbjct: 851 EREKLAEYEEKLAKLKERLARLKA 874
Score = 29.7 bits (68), Expect = 3.0
Identities = 20/73 (27%), Positives = 34/73 (46%), Gaps = 18/73 (24%)
Query: 41 EEQRLENE--KFQQEKEKTLKTLEETK------KKLVESEQAGANNLDELKKALAELEKE 92
E RLE E K ++E E+ K L +++VE E ++ LAE E++
Sbjct: 812 ELARLEKELAKLEKEIERVEKKLSNEGFVAKAPEEVVEKE----------REKLAEYEEK 861
Query: 93 AANIRKKEEDLKK 105
A ++++ LK
Sbjct: 862 LAKLKERLARLKA 874
>gnl|CDD|239586 cd03509, DesA_FADS-like, Fatty acid desaturase protein family
subgroup, a delta-12 acyl-lipid desaturase-like,
DesA-like, yet uncharacterized subgroup of membrane
fatty acid desaturase proteins found in alpha-, beta-,
and gamma-proteobacteria. Sequences of this domain
family appear to be structurally related to membrane
fatty acid desaturases and alkane hydroxylases. They all
share in common extensive hydrophobic regions that would
be capable of spanning the membrane bilayer at least
twice. Comparison of these sequences also reveals three
regions of conserved histidine cluster motifs that
contain eight histidine residues: HXXXH, HXXHH, and
HXXHH. These histidine residues are reported to be
catalytically essential and proposed to be the ligands
for the iron atoms contained within homologs, stearoyl
CoA desaturase and alkane hydroxylase.
Length = 288
Score = 29.3 bits (66), Expect = 3.0
Identities = 10/30 (33%), Positives = 13/30 (43%), Gaps = 4/30 (13%)
Query: 15 DEDETHPNIDTPSLF----RWRHQARVERM 40
DED T P D S + +W R +R
Sbjct: 87 DEDLTDPGDDPESNYLSPEQWARLPRWQRA 116
>gnl|CDD|179338 PRK01816, PRK01816, hypothetical protein; Provisional.
Length = 143
Score = 28.4 bits (64), Expect = 3.2
Identities = 19/59 (32%), Positives = 28/59 (47%), Gaps = 4/59 (6%)
Query: 240 NRAKEKLDEALKEVEEEERQKRLGPG-GLDPVE---VFESLPDVLKKCFELQDTAMLQE 294
R+ L AL E R+K G L PVE +++L D+LK+ F+ D L +
Sbjct: 84 KRSVTPLPPALLNWFYEVREKLQEAGQALAPVEGKPTYQALADLLKRAFKQLDKTFLDD 142
>gnl|CDD|206034 pfam13863, DUF4200, Domain of unknown function (DUF4200). This
family is found in eukaryotes. It is a coiled-coil
domain of unknwon function.
Length = 126
Score = 28.0 bits (63), Expect = 3.2
Identities = 18/75 (24%), Positives = 39/75 (52%)
Query: 33 HQARVERMEEQRLENEKFQQEKEKTLKTLEETKKKLVESEQAGANNLDELKKALAELEKE 92
A + EE E +Q +E+ K EE ++ L++ ++ N + ++A + E+E
Sbjct: 13 QLALDAKREEFERREELLKQREEELEKKEEELQESLIKFDKFLKENEAKRRRAEKKAEEE 72
Query: 93 AANIRKKEEDLKKKE 107
++KEE++K+ +
Sbjct: 73 KKLRKEKEEEIKELK 87
>gnl|CDD|224644 COG1730, GIM5, Predicted prefoldin, molecular chaperone implicated
in de novo protein folding [Posttranslational
modification, protein turnover, chaperones].
Length = 145
Score = 28.5 bits (64), Expect = 3.3
Identities = 12/56 (21%), Positives = 34/56 (60%)
Query: 52 QEKEKTLKTLEETKKKLVESEQAGANNLDELKKALAELEKEAANIRKKEEDLKKKE 107
+ ++ ++ L++ ++L ++ + L EL + + +LE+EA +++K+ +KK+
Sbjct: 90 KSADEAIEFLKKRIEELEKAIEKLQQALAELAQRIEQLEQEAQQLQQKQAAAQKKQ 145
>gnl|CDD|135648 PRK05898, dnaE, DNA polymerase III DnaE; Validated.
Length = 971
Score = 29.4 bits (66), Expect = 3.3
Identities = 28/84 (33%), Positives = 39/84 (46%), Gaps = 8/84 (9%)
Query: 27 SLFRWRHQARVERMEEQRLENEKFQQEKEKTLKTLEETKKKLVESEQAGANNLDELKKAL 86
+LFR Q ++ E+R + E+F E T K LE T +V EQ + L K +
Sbjct: 561 ALFRPGPQQNIKTFVERRFKREEFSYWNEATKKILEPTHGIIVYQEQ-----VINLVKTI 615
Query: 87 AELE-KEAANIRKKEEDLKKKEKI 109
A + A N R+ KK EKI
Sbjct: 616 ANFDIATADNFRRAIS--KKDEKI 637
>gnl|CDD|114015 pfam05266, DUF724, Protein of unknown function (DUF724). This
family contains several uncharacterized proteins found
in Arabidopsis thaliana and other plants. This region is
often found associated with Agenet domains and may
contain coiled-coil.
Length = 190
Score = 28.6 bits (64), Expect = 3.4
Identities = 15/68 (22%), Positives = 31/68 (45%)
Query: 38 ERMEEQRLENEKFQQEKEKTLKTLEETKKKLVESEQAGANNLDELKKALAELEKEAANIR 97
+ +E++ E E +QE + + LE +L + K +A L+ EA+ I+
Sbjct: 113 KGLEKEIAEKEISRQELDSEIAELERKILELQRQAALLKEKKEAEDKEIARLKSEASKIK 172
Query: 98 KKEEDLKK 105
++ D +
Sbjct: 173 QELVDAEL 180
>gnl|CDD|224188 COG1269, NtpI, Archaeal/vacuolar-type H+-ATPase subunit I [Energy
production and conversion].
Length = 660
Score = 29.3 bits (66), Expect = 3.4
Identities = 19/69 (27%), Positives = 32/69 (46%), Gaps = 4/69 (5%)
Query: 35 ARVERMEEQRLENEKFQQEKEKTLKTLEETKKKLVESEQAGANNLDELKKALAELEKEAA 94
VE++E + E+ + EK +EE +KL E + LDE + L +L +E
Sbjct: 85 EEVEKLEAELKSLEEVIKPAEKFSSEVEELTRKLEERL----SELDEELEDLEDLLEELE 140
Query: 95 NIRKKEEDL 103
+ + DL
Sbjct: 141 PLAYLDFDL 149
>gnl|CDD|235255 PRK04204, PRK04204, RNA 3'-terminal-phosphate cyclase; Provisional.
Length = 343
Score = 29.0 bits (66), Expect = 3.4
Identities = 13/34 (38%), Positives = 16/34 (47%), Gaps = 4/34 (11%)
Query: 236 ERIRNRAKEKLDEALK----EVEEEERQKRLGPG 265
ER A E L +L E+ EE + LGPG
Sbjct: 203 ERQAKAAAELLALSLGLIEIEINVEELSRGLGPG 236
>gnl|CDD|202833 pfam03962, Mnd1, Mnd1 family. This family of proteins includes
MND1 from S. cerevisiae. The mnd1 protein forms a
complex with hop2 to promote homologous chromosome
pairing and meiotic double-strand break repair.
Length = 188
Score = 28.7 bits (65), Expect = 3.5
Identities = 19/78 (24%), Positives = 36/78 (46%), Gaps = 2/78 (2%)
Query: 33 HQARVERMEEQRLENEKFQQEKEKTLKTLEETKKKLVESEQAGA--NNLDELKKALAELE 90
R+E+ + E E+ +Q + +E+ KK E+E+ L +L+K L +L+
Sbjct: 64 LNKLKTRLEKLKKELEELKQRIAELQAQIEKLKKGREETEERTELLEELKQLEKELKKLK 123
Query: 91 KEAANIRKKEEDLKKKEK 108
E K + + +K K
Sbjct: 124 AELEKYEKNDPERIEKLK 141
>gnl|CDD|215212 PLN02372, PLN02372, violaxanthin de-epoxidase.
Length = 455
Score = 29.0 bits (65), Expect = 3.7
Identities = 25/102 (24%), Positives = 48/102 (47%), Gaps = 19/102 (18%)
Query: 59 KTLEETKKKLVESEQAGANNLDELKKALAELEKEAANIRKKEEDLKKKEKITPWNVDTIS 118
K +EE +K +V+ E ++ ELEKE + K+EE L K+ +
Sbjct: 368 KDVEEGEKTIVK----------EARQIEEELEKEVEKLGKEEESLFKRVAL--------- 408
Query: 119 KPGFTKTVVNKKPPLQQGEKSDEEKEQEMKAFVKENEKLIKQ 160
+ G + +++ L++ K ++E +++K E EKL +
Sbjct: 409 EEGLKELEQDEENFLKELSKEEKELLEKLKMEASEVEKLFGR 450
>gnl|CDD|216108 pfam00769, ERM, Ezrin/radixin/moesin family. This family of
proteins contain a band 4.1 domain (pfam00373), at their
amino terminus. This family represents the rest of these
proteins.
Length = 244
Score = 28.9 bits (65), Expect = 3.7
Identities = 25/73 (34%), Positives = 39/73 (53%)
Query: 34 QARVERMEEQRLENEKFQQEKEKTLKTLEETKKKLVESEQAGANNLDELKKALAELEKEA 93
+ R+E+MEE +K +E E+T LEE K+ E Q DEL++ LE+EA
Sbjct: 11 EERMEQMEEDMRRAQKELEEYEETALELEEKLKQEEEEAQLLEKKADELEEENRRLEEEA 70
Query: 94 ANIRKKEEDLKKK 106
A ++ E L+ +
Sbjct: 71 AASEEERERLEAE 83
>gnl|CDD|226064 COG3534, AbfA, Alpha-L-arabinofuranosidase [Carbohydrate transport
and metabolism].
Length = 501
Score = 29.3 bits (66), Expect = 3.8
Identities = 7/18 (38%), Positives = 11/18 (61%)
Query: 307 HMKRCVDSGLWVPDAKKA 324
H+ R V G++ PD+ A
Sbjct: 28 HLGRAVYEGIYEPDSPIA 45
>gnl|CDD|215557 PLN03069, PLN03069, magnesiumprotoporphyrin-IX chelatase subunit H;
Provisional.
Length = 1220
Score = 29.4 bits (66), Expect = 4.0
Identities = 16/38 (42%), Positives = 23/38 (60%), Gaps = 6/38 (15%)
Query: 244 EKLDEALKEVEEEERQKRLGPGGLDPVEVFESLPDVLK 281
E+L E +KE E ER + LD V VF S+P+V++
Sbjct: 91 EELAEKVKEAVEPERDR------LDAVLVFPSMPEVMR 122
>gnl|CDD|234701 PRK00252, alaS, alanyl-tRNA synthetase; Reviewed.
Length = 865
Score = 29.3 bits (67), Expect = 4.2
Identities = 19/65 (29%), Positives = 28/65 (43%), Gaps = 6/65 (9%)
Query: 48 EKFQQEKEKTLKTLEETKKK----LVESEQAGANNLDELKKALAELEKEAANIRKKEEDL 103
++ E+E LK L K L E +A L EL+K L +L+ + A DL
Sbjct: 693 LEYLNEQEALLKELAALLKAKPSELPERVEALLEELKELEKELEQLKAKLAA--AAAGDL 750
Query: 104 KKKEK 108
+ K
Sbjct: 751 LAQAK 755
>gnl|CDD|223903 COG0833, LysP, Amino acid transporters [Amino acid transport and
metabolism].
Length = 541
Score = 29.1 bits (66), Expect = 4.2
Identities = 9/15 (60%), Positives = 13/15 (86%)
Query: 324 AAGESEDPQENTPDA 338
AAGESE+P+++ P A
Sbjct: 256 AAGESENPRKSIPKA 270
>gnl|CDD|235185 PRK03980, PRK03980, flap endonuclease-1; Provisional.
Length = 292
Score = 28.6 bits (65), Expect = 4.3
Identities = 15/40 (37%), Positives = 26/40 (65%), Gaps = 4/40 (10%)
Query: 221 PDYKASFDDELKSFKERIRNRAKEKLDEALKEVEEEERQK 260
P+ KA +E++ +E +R A+EK +EA +E + EE +K
Sbjct: 37 PELKA---EEIEERRE-VREEAEEKYEEAKEEGDLEEARK 72
>gnl|CDD|214937 smart00957, SecA_DEAD, SecA DEAD-like domain. SecA protein binds
to the plasma membrane where it interacts with proOmpA
to support translocation of proOmpA through the
membrane. SecA protein achieves this translocation, in
association with SecY protein, in an ATP dependent
manner. This domain represents the N-terminal
ATP-dependent helicase domain, which is related to the.
Length = 380
Score = 28.9 bits (66), Expect = 4.5
Identities = 16/43 (37%), Positives = 24/43 (55%), Gaps = 6/43 (13%)
Query: 215 KIQIAEPDYKASFDDELKS----FKERIRNRAKEKLDEALKEV 253
+I EP+ +A D+EL++ FKER+ E LD+ L E
Sbjct: 22 QINALEPEMEALSDEELRAKTAEFKERLAE--GESLDDLLPEA 62
>gnl|CDD|215411 PLN02768, PLN02768, AMP deaminase.
Length = 835
Score = 29.1 bits (65), Expect = 4.5
Identities = 10/30 (33%), Positives = 18/30 (60%), Gaps = 4/30 (13%)
Query: 268 DPVEVFESLPDVLKKCFELQDTAMLQEAIA 297
D VE ++ VL++C EL+ + +E +A
Sbjct: 239 DEVEAYK----VLQECLELRKRYVFREEVA 264
>gnl|CDD|237613 PRK14109, PRK14109, bifunctional glutamine-synthetase
adenylyltransferase/deadenyltransferase; Provisional.
Length = 1007
Score = 29.0 bits (66), Expect = 4.6
Identities = 12/24 (50%), Positives = 15/24 (62%)
Query: 248 EALKEVEEEERQKRLGPGGLDPVE 271
E L E +R+ +LGPGGL VE
Sbjct: 360 EDLIPAAERDRELKLGPGGLRDVE 383
>gnl|CDD|236327 PRK08661, PRK08661, prolyl-tRNA synthetase; Provisional.
Length = 477
Score = 28.9 bits (66), Expect = 4.7
Identities = 10/40 (25%), Positives = 19/40 (47%)
Query: 215 KIQIAEPDYKASFDDELKSFKERIRNRAKEKLDEALKEVE 254
K + + + L+ +E + +AKE L+E EV+
Sbjct: 371 KETVPLDELVEKVPELLEEIQENLYEKAKEFLEENTVEVD 410
>gnl|CDD|218417 pfam05075, DUF684, Protein of unknown function (DUF684). This
family contains several uncharacterized proteins from
Caenorhabditis elegans. The GO annotation suggests that
the protein is involved in nematode larval development
and has a positive regulation on growth rate.
Length = 346
Score = 28.6 bits (64), Expect = 4.8
Identities = 29/95 (30%), Positives = 42/95 (44%), Gaps = 2/95 (2%)
Query: 58 LKTLEETKKKLVESEQAGANNLDELKKALAELEKEAANIRKKEEDLKKKEKITPWNVDTI 117
L+ L E KKKL + ++ D+LK + E E N L K + T + D
Sbjct: 1 LEKLGELKKKLEKLSDKMSSEFDDLKAFIVEHEFYN-NEAATASTLMKYMQDTVKHPDKK 59
Query: 118 SKPGFTKTVVNKKPPLQQGEKSDEEKEQEMKAFVK 152
+K F + V K PL+ EK EQE+ +K
Sbjct: 60 TKEIF-REVYQKHSPLELAEKMIILLEQEVTNPLK 93
>gnl|CDD|215314 PLN02576, PLN02576, protoporphyrinogen oxidase.
Length = 496
Score = 28.8 bits (65), Expect = 4.8
Identities = 28/120 (23%), Positives = 44/120 (36%), Gaps = 30/120 (25%)
Query: 213 FSKIQIAEPDYKASFDDELKSFKERIRNRAKEKLDEALKEVEEEERQKRLGP--GGLDPV 270
F K+ E + +K+ +E +N E D L + + + +G GGL
Sbjct: 186 FPKLWNLEKRGGSIIGGAIKAIQEAKKNPKPEPRDPRLPKP----KGQTVGSFRGGL--- 238
Query: 271 EVFESLPDVLKKCFEL-QDTAMLQEAIASLSPEEATYHMKRCVDSGLWV-----PDAKKA 324
++LPD L K L +D L + SLS D G + P+ K
Sbjct: 239 ---QTLPDALAK--RLGKDKVKLNWKVLSLS----------KNDDGGYSLTYDTPEGKVN 283
>gnl|CDD|237259 PRK12904, PRK12904, preprotein translocase subunit SecA; Reviewed.
Length = 830
Score = 28.9 bits (66), Expect = 4.9
Identities = 17/43 (39%), Positives = 23/43 (53%), Gaps = 6/43 (13%)
Query: 215 KIQIAEPDYKASFDDELKS----FKERIRNRAKEKLDEALKEV 253
KI EP+ + D+ELK+ FKER+ E LD+ L E
Sbjct: 26 KINALEPEMEKLSDEELKAKTAEFKERLAK--GETLDDLLPEA 66
>gnl|CDD|225708 COG3167, PilO, Tfp pilus assembly protein PilO [Cell motility and
secretion / Intracellular trafficking and secretion].
Length = 211
Score = 28.2 bits (63), Expect = 5.1
Identities = 13/41 (31%), Positives = 23/41 (56%)
Query: 51 QQEKEKTLKTLEETKKKLVESEQAGANNLDELKKALAELEK 91
+ E+ + L+ EE K + + A NL+ L+ LAE+E+
Sbjct: 47 KLEELEELEAEEEELKSTYQQKAIQAANLEALRAQLAEMEE 87
>gnl|CDD|221432 pfam12128, DUF3584, Protein of unknown function (DUF3584). This
protein is found in bacteria and eukaryotes. Proteins in
this family are typically between 943 to 1234 amino
acids in length. This family contains a P-loop motif
suggesting it is a nucleotide binding protein. It may be
involved in replication.
Length = 1198
Score = 28.9 bits (65), Expect = 5.1
Identities = 18/71 (25%), Positives = 30/71 (42%)
Query: 35 ARVERMEEQRLENEKFQQEKEKTLKTLEETKKKLVESEQAGANNLDELKKALAELEKEAA 94
R+ + E F QE +L E + L + ++ +EL++ L L K+
Sbjct: 788 ERIAVRRPEVREYRAFMQETWLHRDSLREERPNLAIQLRELESSAEELQQELTRLIKDTK 847
Query: 95 NIRKKEEDLKK 105
RKK E +K
Sbjct: 848 LRRKKLEQERK 858
>gnl|CDD|227570 COG5245, DYN1, Dynein, heavy chain [Cytoskeleton].
Length = 3164
Score = 28.8 bits (64), Expect = 5.2
Identities = 9/64 (14%), Positives = 18/64 (28%)
Query: 40 MEEQRLENEKFQQEKEKTLKTLEETKKKLVESEQAGANNLDELKKALAELEKEAANIRKK 99
+ + L L + + DE+ L L+K I ++
Sbjct: 2620 LFVHEKALNALKACGSLFLWVLARYLLAKLMLSISNMEQTDEIAVLLHNLKKSRKEIEEE 2679
Query: 100 EEDL 103
E +
Sbjct: 2680 ESES 2683
>gnl|CDD|225281 COG2425, COG2425, Uncharacterized protein containing a von
Willebrand factor type A (vWA) domain [General function
prediction only].
Length = 437
Score = 28.5 bits (64), Expect = 5.2
Identities = 18/72 (25%), Positives = 27/72 (37%), Gaps = 4/72 (5%)
Query: 38 ERMEEQRLENEKFQQEKEKTLKTLEETKKKLVESEQAGANNLDELKKALAELEKEAANIR 97
E + ++ E E E L+ +EE + AG K L L K A +
Sbjct: 133 EGLLSEKQEEISLSGEMEGILEEVEELISGRLWDMSAGELK----KGDLELLGKYADFLL 188
Query: 98 KKEEDLKKKEKI 109
K E K E++
Sbjct: 189 KFPELRKILEQL 200
>gnl|CDD|153338 cd07654, F-BAR_FCHSD, The F-BAR (FES-CIP4 Homology and
Bin/Amphiphysin/Rvs) domain of FCH and double SH3
domains proteins (FCHSD). F-BAR domains are
dimerization modules that bind and bend membranes and
are found in proteins involved in membrane dynamics and
actin reorganization. This subfamily is composed of FCH
and double SH3 domain (FCHSD) proteins, so named as they
contain an N-terminal F-BAR domain and two SH3 domains
at the C-terminus. Vertebrates harbor two subfamily
members, FCHSD1 and FCHSD2, which have been
characterized only in silico. Their biological function
is still unknown. F-BAR domains form banana-shaped
dimers with a positively-charged concave surface that
binds to negatively-charged lipid membranes. They can
induce membrane deformation in the form of long tubules.
Length = 264
Score = 28.3 bits (63), Expect = 5.3
Identities = 20/76 (26%), Positives = 31/76 (40%), Gaps = 1/76 (1%)
Query: 34 QARVERMEEQRLENEK-FQQEKEKTLKTLEETKKKLVESEQAGANNLDELKKALAELEKE 92
Q R E E K + KE+ LK E ++ Q L + +K E E+
Sbjct: 87 QNRCEAYRRYISEPAKTGRSAKEQQLKKCTEQLQRAQAEVQQTVRELSKSRKTYFEREQV 146
Query: 93 AANIRKKEEDLKKKEK 108
A R+K D++ +E
Sbjct: 147 AHLAREKAADVQAREA 162
>gnl|CDD|227355 COG5022, COG5022, Myosin heavy chain [Cytoskeleton].
Length = 1463
Score = 28.9 bits (65), Expect = 5.3
Identities = 21/131 (16%), Positives = 49/131 (37%), Gaps = 4/131 (3%)
Query: 23 IDTPSLFRWRHQARVERMEEQRLENEKFQQEKEKTLKTLEETKKKLVESEQAGANNLDEL 82
++ + + ++ E + ++ +E E LK ++ ++ L
Sbjct: 942 LEEGPSIEYVKLPELNKLHEVESKLKETSEEYEDLLKKSTILVREGNKANS----ELKNF 997
Query: 83 KKALAELEKEAANIRKKEEDLKKKEKITPWNVDTISKPGFTKTVVNKKPPLQQGEKSDEE 142
KK LAEL K+ +++ + LK+ T ++ PLQ+ +
Sbjct: 998 KKELAELSKQYGALQESTKQLKELPVEVAELQSASKIISSESTELSILKPLQKLKGLLLL 1057
Query: 143 KEQEMKAFVKE 153
+ +++A K
Sbjct: 1058 ENNQLQARYKA 1068
>gnl|CDD|234342 TIGR03752, conj_TIGR03752, integrating conjugative element protein,
PFL_4705 family. Members of this protein family are
found occasionally on plasmids such as the Pseudomonas
putida toluene catabolic TOL plasmid pWWO_p085. Usually,
however, they are found on the bacterial main chromosome
in regions flanked by markers of conjugative transfer
and/or transposition [Mobile and extrachromosomal
element functions, Plasmid functions].
Length = 472
Score = 28.5 bits (64), Expect = 5.3
Identities = 15/52 (28%), Positives = 25/52 (48%), Gaps = 14/52 (26%)
Query: 57 TLKTLEETKKKLVESEQAGANNLDELKKALAELEKEAANIRKKEEDLKKKEK 108
TL+TL + EL+K LA+L E ++ + E L+K+E+
Sbjct: 60 TLRTL--------------VAEVKELRKRLAKLISENEALKAENERLQKREQ 97
>gnl|CDD|100796 PRK01156, PRK01156, chromosome segregation protein; Provisional.
Length = 895
Score = 28.7 bits (64), Expect = 5.4
Identities = 14/45 (31%), Positives = 25/45 (55%)
Query: 66 KKLVESEQAGANNLDELKKALAELEKEAANIRKKEEDLKKKEKIT 110
K +++ +A +N+D L++ L E NI+K+ D +K IT
Sbjct: 172 KDVIDMLRAEISNIDYLEEKLKSSNLELENIKKQIADDEKSHSIT 216
>gnl|CDD|236912 PRK11448, hsdR, type I restriction enzyme EcoKI subunit R;
Provisional.
Length = 1123
Score = 28.8 bits (65), Expect = 5.4
Identities = 15/66 (22%), Positives = 29/66 (43%), Gaps = 3/66 (4%)
Query: 30 RWRHQARVERMEEQRLENEKFQQEKEKTLKTLEETKKKLVESEQAGANNLDELKKALAEL 89
+ + QA E +++ + E E E+ + LE ++L E + E K+ E+
Sbjct: 165 KAQSQALAEAQQQELVALEGLAAELEEKQQELEAQLEQLQEKAAETSQ---ERKQKRKEI 221
Query: 90 EKEAAN 95
+AA
Sbjct: 222 TDQAAK 227
>gnl|CDD|150467 pfam09803, DUF2346, Uncharacterized conserved protein (DUF2346).
Members of this family of proteins have no known
function.
Length = 80
Score = 26.7 bits (59), Expect = 5.5
Identities = 15/46 (32%), Positives = 28/46 (60%), Gaps = 4/46 (8%)
Query: 215 KIQIAEPDYKASFDDELKSFKERIRNRAKEKLDEALKEVEEEERQK 260
K ++ P+ + EL+ FKER+R + + +L L+ +EE E++K
Sbjct: 39 KRELYPPENE-QHRKELEDFKERLREKHEARL---LRAMEEAEQKK 80
>gnl|CDD|216266 pfam01051, Rep_3, Initiator Replication protein. This protein is
an initiator of plasmid replication. RepB possesses
nicking-closing (topoisomerase I) like activity. It is
also able to perform a strand transfer reaction on ssDNA
that contains its target. This family also includes RepA
which is an E.coli protein involved in plasmid
replication. The RepA protein binds to DNA repeats that
flank the repA gene.
Length = 218
Score = 28.0 bits (63), Expect = 5.7
Identities = 15/64 (23%), Positives = 25/64 (39%), Gaps = 2/64 (3%)
Query: 49 KFQQEKEKTLKTLEETKKKLVESEQAGANNLDELKKALAELEKEAANIRKKEEDLKKKEK 108
K + +++ + +L GA LK+AL L K+ I +EE +
Sbjct: 34 KPDDKDFVEYESISDYALELFGVNSEGAY--KRLKEALERLMKKDFEIEDEEEKGGEIIN 91
Query: 109 ITPW 112
I W
Sbjct: 92 IVLW 95
>gnl|CDD|162284 TIGR01282, nifD, nitrogenase molybdenum-iron protein alpha chain.
Nitrogenase consists of alpha (NifD) and beta (NifK)
subunits of the molybdenum-iron protein and an
ATP-binding iron-sulfur protein (NifH). This model
describes a large clade of NifD proteins, but excludes a
lineage that contains putative NifD and NifD homologs
from species with vanadium-dependent nitrogenases
[Central intermediary metabolism, Nitrogen fixation].
Length = 466
Score = 28.5 bits (64), Expect = 5.8
Identities = 21/72 (29%), Positives = 30/72 (41%), Gaps = 7/72 (9%)
Query: 212 FFSKIQIAEP--DYKASFDDELKSFKERIRNRAKEKLDEALKEVEEEERQKR--LGPGGL 267
FF +IAE FDDE+K E + + + +D + + K L GGL
Sbjct: 286 FFGPTKIAESLRKIAEFFDDEIKEKAEEVIAKYQPAVDAVIAKYRPRLEGKTVMLYVGGL 345
Query: 268 DP---VEVFESL 276
P + FE L
Sbjct: 346 RPRHVIGAFEDL 357
>gnl|CDD|165085 PHA02711, PHA02711, Toll/IL-receptor-like protein; Provisional.
Length = 190
Score = 28.0 bits (62), Expect = 6.0
Identities = 17/43 (39%), Positives = 26/43 (60%), Gaps = 2/43 (4%)
Query: 39 RMEEQRLENE--KFQQEKEKTLKTLEETKKKLVESEQAGANNL 79
R++ RL NE KF + E+ T+EE K+KL + + GA+N
Sbjct: 73 RIKYGRLFNEIKKFDNDAEEQFGTIEELKQKLRLNSEEGADNF 115
>gnl|CDD|236995 PRK11824, PRK11824, polynucleotide phosphorylase/polyadenylase;
Provisional.
Length = 693
Score = 28.5 bits (65), Expect = 6.0
Identities = 12/39 (30%), Positives = 21/39 (53%), Gaps = 3/39 (7%)
Query: 224 KASFDDELKSFKERIRNRAKEKLDEALKEVEEEERQKRL 262
D+ELK ++ A+ KL EA + +++ER+ L
Sbjct: 234 PPEVDEELK---AAVKELAEAKLKEAYQITDKQEREAAL 269
>gnl|CDD|221533 pfam12329, TMF_DNA_bd, TATA element modulatory factor 1 DNA
binding. This is the middle region of a family of TATA
element modulatory factor 1 proteins conserved in
eukaryotes that contains at its N-terminal section a
number of leucine zippers that could potentially form
coiled coil structures. The whole proteins bind to the
TATA element of some RNA polymerase II promoters and
repress their activity. by competing with the binding of
TATA binding protein. TMFs are evolutionarily conserved
golgins that bind Rab6, a ubiquitous ras-like
GTP-binding Golgi protein, and contribute to Golgi
organisation in animal and plant cells.
Length = 74
Score = 26.5 bits (59), Expect = 6.2
Identities = 15/53 (28%), Positives = 25/53 (47%)
Query: 52 QEKEKTLKTLEETKKKLVESEQAGANNLDELKKALAELEKEAANIRKKEEDLK 104
++ K T KKL + + ELKK L +LEKE N+ ++ + +
Sbjct: 22 EKLSKKELKHNNTIKKLRAKNKELEKEIAELKKKLEKLEKELENLEERLKRAE 74
>gnl|CDD|221489 pfam12252, SidE, Dot/Icm substrate protein. This family of proteins
is found in bacteria. Proteins in this family are
typically between 397 and 1543 amino acids in length.
This family is the SidE protein in the Dot/Icm pathway of
Legionella pneumophila bacteria. There is little
literature describing the family.
Length = 1443
Score = 28.7 bits (64), Expect = 6.4
Identities = 20/56 (35%), Positives = 32/56 (57%), Gaps = 5/56 (8%)
Query: 54 KEKTLKTLEETKKKLVESEQAGANNLDELKKALAELEKEAANIRKKEEDLKKKEKI 109
+E+ +E T K++ E E+A NLD +KKA++ L N ++E L + EKI
Sbjct: 1102 RERYDALIENTSKRITELEKAKLANLDVVKKAISNL-----NNLQQEVTLLRNEKI 1152
>gnl|CDD|234306 TIGR03674, fen_arch, flap structure-specific endonuclease.
Endonuclease that cleaves the 5'-overhanging flap
structure that is generated by displacement synthesis
when DNA polymerase encounters the 5'-end of a
downstream Okazaki fragment. Has 5'-endo-/exonuclease
and 5'-pseudo-Y-endonuclease activities. Cleaves the
junction between single and double-stranded regions of
flap DNA.
Length = 338
Score = 28.4 bits (64), Expect = 6.5
Identities = 15/40 (37%), Positives = 25/40 (62%), Gaps = 4/40 (10%)
Query: 221 PDYKASFDDELKSFKERIRNRAKEKLDEALKEVEEEERQK 260
P+ K + L+ +E IR A+EK +EAL++ + EE +K
Sbjct: 84 PELK---AETLEERRE-IREEAEEKWEEALEKGDLEEARK 119
>gnl|CDD|234659 PRK00137, rplI, 50S ribosomal protein L9; Reviewed.
Length = 147
Score = 27.4 bits (62), Expect = 6.8
Identities = 13/33 (39%), Positives = 18/33 (54%)
Query: 74 AGANNLDELKKALAELEKEAANIRKKEEDLKKK 106
A NL +L+ AELE +AA + E L +K
Sbjct: 39 ATKGNLKQLEARRAELEAKAAEELAEAEALAEK 71
>gnl|CDD|177296 PHA00728, PHA00728, hypothetical protein.
Length = 151
Score = 27.6 bits (61), Expect = 6.9
Identities = 22/48 (45%), Positives = 26/48 (54%), Gaps = 2/48 (4%)
Query: 67 KLVESEQAGANNLDELKKALAELEKEAANIRKKEEDLKKKEKITPWNV 114
KL E EQ N +ELKK LAELE N EED + +E P+ V
Sbjct: 3 KLTEVEQLKKEN-EELKKKLAELEALMNN-ESAEEDEELQEIENPYTV 48
>gnl|CDD|218813 pfam05929, Phage_GPO, Phage capsid scaffolding protein (GPO) serine
peptidase. This family consists of several
bacteriophage capsid scaffolding proteins (GPO) and some
related bacterial sequences. GPO is thought to function
in both the assembly of proheads and the cleavage of
GPN. The family is found to function as a serine
peptidase, with a conserved Asp, His and Ser catalytic
triad, as in subtilisin, and as represented in
MEROPS:S73. The family includes GpO from Enterobacteria
phage P2 which cleaves itself and then becomes the
scaffold protein upon which the bacteriophage prohead is
built - a mechanism quite common amongst phages.
Length = 277
Score = 27.9 bits (62), Expect = 7.0
Identities = 15/42 (35%), Positives = 21/42 (50%)
Query: 48 EKFQQEKEKTLKTLEETKKKLVESEQAGANNLDELKKALAEL 89
E Q + E T + L E +K + + EQA + DE KA L
Sbjct: 206 EHVQGQLEATEQRLSEAEKAVSDLEQALNSQGDERDKAFNAL 247
>gnl|CDD|235179 PRK03947, PRK03947, prefoldin subunit alpha; Reviewed.
Length = 140
Score = 27.2 bits (61), Expect = 7.1
Identities = 11/50 (22%), Positives = 28/50 (56%)
Query: 59 KTLEETKKKLVESEQAGANNLDELKKALAELEKEAANIRKKEEDLKKKEK 108
K L+E + L + ++ L++L++AL +L A + ++ + L+++
Sbjct: 90 KDLDEAIEILDKRKEELEKALEKLEEALQKLASRIAQLAQELQQLQQEAA 139
Score = 27.2 bits (61), Expect = 8.4
Identities = 8/47 (17%), Positives = 27/47 (57%)
Query: 55 EKTLKTLEETKKKLVESEQAGANNLDELKKALAELEKEAANIRKKEE 101
++ ++ L++ K++L ++ + L +L +A+L +E ++++
Sbjct: 93 DEAIEILDKRKEELEKALEKLEEALQKLASRIAQLAQELQQLQQEAA 139
>gnl|CDD|217051 pfam02463, SMC_N, RecF/RecN/SMC N terminal domain. This domain is
found at the N terminus of SMC proteins. The SMC
(structural maintenance of chromosomes) superfamily
proteins have ATP-binding domains at the N- and
C-termini, and two extended coiled-coil domains
separated by a hinge in the middle. The eukaryotic SMC
proteins form two kind of heterodimers: the SMC1/SMC3
and the SMC2/SMC4 types. These heterodimers constitute
an essential part of higher order complexes, which are
involved in chromatin and DNA dynamics. This family also
includes the RecF and RecN proteins that are involved in
DNA metabolism and recombination.
Length = 1162
Score = 28.4 bits (63), Expect = 7.3
Identities = 16/76 (21%), Positives = 38/76 (50%)
Query: 34 QARVERMEEQRLENEKFQQEKEKTLKTLEETKKKLVESEQAGANNLDELKKALAELEKEA 93
QA+ Q E + ++E L L+ ++++ ++ + +E++ + ELEKE
Sbjct: 206 QAKKALEYYQLKEKLELEEENLLYLDYLKLNEERIDLLQELLRDEQEEIESSKQELEKEE 265
Query: 94 ANIRKKEEDLKKKEKI 109
+ + ++ K++EK
Sbjct: 266 EILAQVLKENKEEEKE 281
>gnl|CDD|237403 PRK13505, PRK13505, formate--tetrahydrofolate ligase; Provisional.
Length = 557
Score = 28.2 bits (64), Expect = 7.4
Identities = 12/30 (40%), Positives = 16/30 (53%)
Query: 69 VESEQAGANNLDELKKALAELEKEAANIRK 98
V + N++ LKK A LE+ NIRK
Sbjct: 341 VAKDDLKEENVEALKKGFANLERHIENIRK 370
>gnl|CDD|218769 pfam05833, FbpA, Fibronectin-binding protein A N-terminus (FbpA).
This family consists of the N-terminal region of the
prokaryotic fibronectin-binding protein. Fibronectin
binding is considered to be an important virulence
factor in streptococcal infections. Fibronectin is a
dimeric glycoprotein that is present in a soluble form
in plasma and extracellular fluids; it is also present
in a fibrillar form on cell surfaces. Both the soluble
and cellular forms of fibronectin may be incorporated
into the extracellular tissue matrix. While fibronectin
has critical roles in eukaryotic cellular processes,
such as adhesion, migration and differentiation, it is
also a substrate for the attachment of bacteria. The
binding of pathogenic Streptococcus pyogenes and
Staphylococcus aureus to epithelial cells via
fibronectin facilitates their internalisation and
systemic spread within the host.
Length = 447
Score = 28.0 bits (63), Expect = 7.8
Identities = 16/69 (23%), Positives = 29/69 (42%), Gaps = 14/69 (20%)
Query: 42 EQRLENEKFQQEKEKTLKTLEETKKKLVESEQAGANNLDELKKALAELEKEAANIRKKEE 101
++ E ++ +Q++ K +E N L++L+K L +LEKE E
Sbjct: 282 SKKAERDRVKQKRSDLEKRVE--------------NELEKLEKKLEKLEKELEEAENAEN 327
Query: 102 DLKKKEKIT 110
E +T
Sbjct: 328 YRLYGELLT 336
>gnl|CDD|216529 pfam01491, Frataxin_Cyay, Frataxin-like domain. This family
contains proteins that have a domain related to the
globular C-terminus of Frataxin the protein that is
mutated in Friedreich's ataxia. This domain is found in
a family of bacterial proteins. The function of this
domain is currently unknown. It has been suggested that
this family is involved in iron transport.
Length = 107
Score = 26.7 bits (60), Expect = 7.9
Identities = 9/23 (39%), Positives = 15/23 (65%)
Query: 113 NVDTISKPGFTKTVVNKKPPLQQ 135
V T++ + V+NK+PPL+Q
Sbjct: 39 GVLTLTFENGGQIVINKQPPLKQ 61
>gnl|CDD|216970 pfam02312, CBF_beta, Core binding factor beta subunit. Core
binding factor (CBF) is a heterodimeric transcription
factor essential for genetic regulation of hematopoiesis
and osteogenesis. The beta subunit enhances DNA-binding
ability of the alpha subunit in vitro, and has been show
to have a structure related to the OB fold.
Length = 170
Score = 27.4 bits (61), Expect = 8.0
Identities = 12/44 (27%), Positives = 20/44 (45%)
Query: 29 FRWRHQARVERMEEQRLENEKFQQEKEKTLKTLEETKKKLVESE 72
W R++ M + E+ QQE + LE+ ++L E E
Sbjct: 106 RGWMDLERLDGMGCLEYDEERAQQEDAILREQLEQYNQRLREFE 149
>gnl|CDD|224755 COG1842, PspA, Phage shock protein A (IM30), suppresses
sigma54-dependent transcription [Transcription / Signal
transduction mechanisms].
Length = 225
Score = 27.7 bits (62), Expect = 8.1
Identities = 27/80 (33%), Positives = 45/80 (56%), Gaps = 5/80 (6%)
Query: 34 QARVERMEEQ-----RLENEKFQQEKEKTLKTLEETKKKLVESEQAGANNLDELKKALAE 88
QAR E++EE+ + NE +E + ++LE+ K L Q +++LKK LA
Sbjct: 65 QARAEKLEEKAELALQAGNEDLAREALEEKQSLEDLAKALEAELQQAEEQVEKLKKQLAA 124
Query: 89 LEKEAANIRKKEEDLKKKEK 108
LE++ A +R K+E LK ++
Sbjct: 125 LEQKIAELRAKKEALKARKA 144
>gnl|CDD|234172 TIGR03345, VI_ClpV1, type VI secretion ATPase, ClpV1 family.
Members of this protein family are homologs of ClpB, an
ATPase associated with chaperone-related functions.
These ClpB homologs, designated ClpV1, are a key
component of the bacterial pathogenicity-associated type
VI secretion system [Protein fate, Protein and peptide
secretion and trafficking, Cellular processes,
Pathogenesis].
Length = 852
Score = 28.0 bits (63), Expect = 8.3
Identities = 21/76 (27%), Positives = 37/76 (48%), Gaps = 9/76 (11%)
Query: 33 HQARVERMEEQR--LENE------KFQQEKEKTLKTLEETKKKLVESEQAGANNLDELKK 84
H R+ + + LE E ++QQEKE ++ + + +L A A++ L+
Sbjct: 448 HDERLAELRAELAALEAELAALEARWQQEKE-LVEAILALRAELEADADAPADDDAALRA 506
Query: 85 ALAELEKEAANIRKKE 100
LAELE A+ + +E
Sbjct: 507 QLAELEAALASAQGEE 522
>gnl|CDD|188994 cd06455, M3A_TOP, Peptidase M3 Thimet oligopeptidase (TOP) also
includes neurolysin. Peptidase M3 Thimet oligopeptidase
(TOP; PZ-peptidase; endo-oligopeptidase A; endopeptidase
24.15; soluble metallo-endopeptidase; EC 3.4.24.15)
family also includes neurolysin (endopeptidase 24.16,
microsomal endopeptidase, mitochondrial oligopeptidase
M, neurotensin endopeptidase, soluble angiotensin
II-binding protein, thimet oligopeptidase II) which
hydrolyzes oligopeptides such as neurotensin, bradykinin
and dynorphin A. TOP and neurolysin are neuropeptidases
expressed abundantly in the testis, but also found in
the liver, lung and kidney. They are involved in the
metabolism of neuropeptides under 20 amino acid residues
long and cleave most bioactive peptides at the same
sites, but recognize different positions on some
naturally occurring and synthetic peptides; they cleave
at distinct sites on the 13-residue bioactive peptide
neurotensin, which modulates central dopaminergic and
cholinergic circuits. TOP has been shown to degrade
peptides released by the proteasome, limiting the extent
of antigen presentation by major histocompatibility
complex class I molecules, and has been associated with
amyloid protein precursor processing.
Length = 637
Score = 27.9 bits (63), Expect = 8.4
Identities = 15/52 (28%), Positives = 29/52 (55%), Gaps = 3/52 (5%)
Query: 67 KLVESEQAGANNLDELKKALAEL-EKEAANI--RKKEEDLKKKEKITPWNVD 115
++ +S + + LD+L + L L EKE A + KK+E + ++I W++
Sbjct: 248 RMAKSPETVRSFLDDLAEKLKPLAEKELAELLALKKKECPEAGDRIYAWDLR 299
>gnl|CDD|223630 COG0556, UvrB, Helicase subunit of the DNA excision repair complex
[DNA replication, recombination, and repair].
Length = 663
Score = 27.9 bits (63), Expect = 8.5
Identities = 21/91 (23%), Positives = 39/91 (42%), Gaps = 14/91 (15%)
Query: 80 DELKKALAELEKEAANIRKKEEDLKKKEKITPWNVDTISKP-----GFTKTVVNKKPPLQ 134
D ++KA+ E E+ R+ + ++ ITP TI K K ++
Sbjct: 559 DSMQKAIDETERR----REIQMAYNEEHGITP---QTIKKKIRDILDGEYEEDEYKAKIE 611
Query: 135 QGEKSDEEKEQEMKAFVKENEKLIKQYANYL 165
+ +KE ++ +K+ EK +K+ A L
Sbjct: 612 KKASKMSKKE--LEKLIKKLEKEMKEAAKNL 640
>gnl|CDD|220719 pfam10368, YkyA, Putative cell-wall binding lipoprotein. YkyA is a
family of proteins containing a lipoprotein signal and a
hydrolase domain. It is similar to cell wall binding
proteins and might also be recognisable by a host immune
defence system. It is thus likely to belong to pathways
important for pathogenicity.
Length = 205
Score = 27.7 bits (62), Expect = 8.6
Identities = 19/68 (27%), Positives = 39/68 (57%), Gaps = 2/68 (2%)
Query: 42 EQRLENEKFQQEKEKTLKTLEETKKKLVESEQAGANNLDELKKALAELEKEAANIRKKEE 101
E+ E EK +E++K L+ LE+ +++L + Q D+ + ++ AN+ K+E+
Sbjct: 26 EKAAEQEKPFEEEQKKLEELEKKEQELYD--QIIQLGKDDNEAIKKLSDQALANVDKREK 83
Query: 102 DLKKKEKI 109
LKK+++
Sbjct: 84 LLKKEKES 91
>gnl|CDD|225177 COG2268, COG2268, Uncharacterized protein conserved in bacteria
[Function unknown].
Length = 548
Score = 27.9 bits (62), Expect = 8.8
Identities = 17/136 (12%), Positives = 39/136 (28%), Gaps = 15/136 (11%)
Query: 30 RWRHQARVERMEEQRLEN-----EKFQQEKEKTLKTLEETKKKLVESEQAGANNLDELKK 84
+ E+ E + + + E E+ + + V+ A K
Sbjct: 223 AEIAENEAEKETEIAIAEANRDAKLVELEVEQQPAGKTAEQTREVKIILAETEAEVAAWK 282
Query: 85 ALAELEKEAANIRKKEEDLKKKEKITPWNVDTISKPGFTKTVVNKKPPLQQGEKSDEEKE 144
A E E A I ++ ++K + + + + + +KE
Sbjct: 283 AETRREAEQAEILAEQAIQEEKAQ----------AEQEVQHAKALEAREMRVGLIERQKE 332
Query: 145 QEMKAFVKENEKLIKQ 160
E++ + Q
Sbjct: 333 TELEPQERSYFINAAQ 348
>gnl|CDD|227608 COG5283, COG5283, Phage-related tail protein [Function unknown].
Length = 1213
Score = 28.0 bits (62), Expect = 8.8
Identities = 18/73 (24%), Positives = 31/73 (42%), Gaps = 7/73 (9%)
Query: 54 KEKTLKTL-EETKKKLVESEQAGANNLDELKKALAELEKEAANIRKKEE------DLKKK 106
E + T+ + KKL E+ + A+ + EL ++LA+ + AN +EE
Sbjct: 500 VEDSAGTMADAFGKKLGEALKKLADAIAELFQSLADNRGKIANGSMEEELLTLYKATAGI 559
Query: 107 EKITPWNVDTISK 119
+I V S
Sbjct: 560 WQIYKSGVTEFSD 572
>gnl|CDD|215457 PLN02852, PLN02852, ferredoxin-NADP+ reductase.
Length = 491
Score = 28.1 bits (63), Expect = 9.0
Identities = 11/34 (32%), Positives = 17/34 (50%)
Query: 216 IQIAEPDYKASFDDELKSFKERIRNRAKEKLDEA 249
++I E D S +DE + R + R E L +A
Sbjct: 242 VRIKEADLTLSPEDEEELKASRPKRRVYELLSKA 275
>gnl|CDD|179385 PRK02224, PRK02224, chromosome segregation protein; Provisional.
Length = 880
Score = 28.1 bits (63), Expect = 9.2
Identities = 19/71 (26%), Positives = 40/71 (56%), Gaps = 4/71 (5%)
Query: 34 QARVERMEEQRLENEKFQQEKEKTLKTLEETKKKLVESEQAGANNLDELKKALAELEKEA 93
+ER EEQR + + + E ++ L+ EE +++L E +++L++ +AE E+E
Sbjct: 219 DEEIERYEEQREQARETRDEADEVLEEHEERREELETLEAE----IEDLRETIAETERER 274
Query: 94 ANIRKKEEDLK 104
+ ++ DL+
Sbjct: 275 EELAEEVRDLR 285
>gnl|CDD|232931 TIGR00344, alaS, alanine--tRNA ligase. The model describes
alanine--tRNA ligase. This enzyme catalyzes the reaction
(tRNAala + L-alanine + ATP = L-alanyl-tRNAala +
pyrophosphate + AMP) [Protein synthesis, tRNA
aminoacylation].
Length = 851
Score = 28.1 bits (63), Expect = 9.5
Identities = 14/53 (26%), Positives = 23/53 (43%)
Query: 55 EKTLKTLEETKKKLVESEQAGANNLDELKKALAELEKEAANIRKKEEDLKKKE 107
++ L E + KL E +EL K + +E ++K+ E LK K
Sbjct: 697 NAAIEYLNEQEDKLKELADILKVTPNELPKKIERFFEEWKALQKELESLKLKI 749
>gnl|CDD|234084 TIGR03007, pepcterm_ChnLen, polysaccharide chain length determinant
protein, PEP-CTERM locus subfamily. Members of this
protein family belong to the family of polysaccharide
chain length determinant proteins (pfam02706). All are
found in species that encode the PEP-CTERM/exosortase
system predicted to act in protein sorting in a number
of Gram-negative bacteria, and are found near the epsH
homolog that is the putative exosortase gene [Cell
envelope, Biosynthesis and degradation of surface
polysaccharides and lipopolysaccharides].
Length = 498
Score = 27.7 bits (62), Expect = 9.8
Identities = 12/62 (19%), Positives = 27/62 (43%), Gaps = 1/62 (1%)
Query: 43 QRLENEKFQQEKE-KTLKTLEETKKKLVESEQAGANNLDELKKALAELEKEAANIRKKEE 101
Q+L+ E + E E +L+ +E ++ + E++ L +L ++ + E
Sbjct: 313 QQLQIELAEAEAEIASLEARVAELTARIERLESLLRTIPEVEAELTQLNRDYEVNKSNYE 372
Query: 102 DL 103
L
Sbjct: 373 QL 374
>gnl|CDD|222698 pfam14346, DUF4398, Domain of unknown function (DUF4398). This
family of proteins is functionally uncharacterized. This
family of proteins is found in bacteria and archaea.
Proteins in this family are typically between 127 and
269 amino acids in length.
Length = 105
Score = 26.6 bits (59), Expect = 9.9
Identities = 13/50 (26%), Positives = 22/50 (44%)
Query: 59 KTLEETKKKLVESEQAGANNLDELKKALAELEKEAANIRKKEEDLKKKEK 108
+ L + K L E + A L E +A A L + A K + L++ +
Sbjct: 49 EKLAQAKAALDEGKYEEARRLAEQAEADARLAEAKARAAKAQAALEEAKD 98
Database: CDD.v3.10
Posted date: Mar 20, 2013 7:55 AM
Number of letters in database: 10,937,602
Number of sequences in database: 44,354
Lambda K H
0.310 0.129 0.364
Gapped
Lambda K H
0.267 0.0677 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 18,094,980
Number of extensions: 1806424
Number of successful extensions: 5880
Number of sequences better than 10.0: 1
Number of HSP's gapped: 5014
Number of HSP's successfully gapped: 908
Length of query: 347
Length of database: 10,937,602
Length adjustment: 98
Effective length of query: 249
Effective length of database: 6,590,910
Effective search space: 1641136590
Effective search space used: 1641136590
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.8 bits)
S2: 59 (26.6 bits)