RPS-BLAST 2.2.26 [Sep-21-2011]
Database: CDD.v3.10
44,354 sequences; 10,937,602 total letters
Searching..................................................done
Query= psy6112
(374 letters)
>gnl|CDD|239493 cd03399, Band_7_flotillin, Band_7_flotillin: a subgroup of the band
7 domain of flotillin (reggie) like proteins. This
subgroup contains proteins similar to stomatin,
prohibitin, flotillin, HlfK/C and podicin. These two
proteins are lipid raft-associated. Individual proteins
of this band 7 domain family may cluster to form
membrane microdomains which may in turn recruit
multiprotein complexes. Microdomains formed from
flotillin proteins may in addition be dynamic units with
their own regulatory functions. Flotillins have been
implicated in signal transduction, vesicle trafficking,
cytoskeleton rearrangement and, interact with a variety
of proteins. Flotillins may play a role in the
progression of prion disease, in the pathogenesis of
neurodegenerative diseases such as Parkinson's and
Alzheimer's disease and, in cancer invasion and
metastasis.
Length = 128
Score = 104 bits (262), Expect = 2e-27
Identities = 42/98 (42%), Positives = 63/98 (64%)
Query: 82 IVNVMDEEGHEQKHSTKQNHISEDEIQNIALVTLEGHQRAIMGSMSVEDIYKDRKKFSKQ 141
V V E + + SE+EI+ + LEGH RA++G+M+VE+IY+DR KF++Q
Sbjct: 31 QVKVGGTEEAIATAAERFLGKSEEEIEELVKEVLEGHLRAVVGTMTVEEIYEDRDKFAEQ 90
Query: 142 VFQVASSDLVNMGITVVSYTLKDIRDDEGYLKSLGMAR 179
V +V + DL MG+ + S+T+KDI D +GYL +LG A
Sbjct: 91 VQEVVAPDLNKMGLELDSFTIKDITDTDGYLNNLGDAE 128
>gnl|CDD|225177 COG2268, COG2268, Uncharacterized protein conserved in bacteria
[Function unknown].
Length = 548
Score = 99.5 bits (248), Expect = 7e-23
Identities = 86/336 (25%), Positives = 147/336 (43%), Gaps = 65/336 (19%)
Query: 103 SEDEIQNIALVTLEGHQRAIMGSMSVEDIYKDRKKFSKQVFQVASSDLVNMGITVVSYTL 162
S ++++ +A TLEG RA++ M+VE++ +DR F++ V +V DL MG+ + S +
Sbjct: 132 SREDLEQLAEDTLEGALRAVLAQMTVEELNEDRLGFAQVVQEVVGDDLSKMGLVLDSLAI 191
Query: 163 KDIRDD-------EGYLKSLGMARTAEVKRDARIGEAEAQKDARIKEAMAEEE------- 208
DI D YL +LG R A+V +DA I E EA+K+ I A A +
Sbjct: 192 NDINDTSKENQDPNNYLDALGRRRIAQVLQDAEIAENEAEKETEIAIAEANRDAKLVELE 251
Query: 209 --------KMAARFVNDTEIAKAQRDFEIKKATYDAEVETKRAEAELAFELQAAKTKQRI 260
+A+ + + KA E E AE A + + A+ +Q +
Sbjct: 252 VEQQPAGKTAEQTREVKIILAETEAEVAAWKAETRREAEQAEILAEQAIQEEKAQAEQEV 311
Query: 261 ------KEEQMKIQVIERNQEIQVQEQE----IQRRKKELESSVNRPAE----------- 299
+ +M++ +IER +E +++ QE I +++ + A
Sbjct: 312 QHAKALEAREMRVGLIERQKETELEPQERSYFINAAQRQAQEEAKAAANIAEAIGAQAEA 371
Query: 300 -------------AEKYRIEKLAQANKKRL------MIEAEAEAEAIRLKGEAEAAAIAA 340
AE+ + A+A ++ A+AEAEA + +AEA AI
Sbjct: 372 AVETARETEEAERAEQAALVAAAEAAEQEQVEIAVRAEAAKAEAEAQAAEIKAEAEAIRE 431
Query: 341 KARAEVEIMTKKAEAWKDYREAA---VIDMILESLP 373
K +AE E AEA + +AA + ++++LP
Sbjct: 432 KGKAEAEAKRALAEAIQVLGDAAAAELFKALVQALP 467
>gnl|CDD|239497 cd03403, Band_7_stomatin_like, Band_7_stomatin_like: A subgroup of
the band 7 domain of flotillin (reggie) like proteins
similar to stomatin and podicin (two lipid
raft-associated integral membrane proteins). Individual
proteins of this band 7 domain family may cluster to
form membrane microdomains which may in turn recruit
multiprotein complexes. Stomatin is widely expressed
and, highly expressed in red blood cells. It localizes
predominantly to the plasma membrane and to
intracellular vesicles of the endocytic pathway, where
it is present in higher order homo-oligomeric complexes
(of between 9 and 12 monomers). Stomatin interacts with
and regulates members of the degenerin/epithelia Na+
channel family in mechanosensory cells of Caenorhabditis
elegans and vertebrate neurons and, is implicated in
trafficking of Glut1 glucose transporters. Prohibitin is
a mitochondrial inner-membrane protein hypothesized to
act as a chaperone for the stabilization of
mitochondrial proteins. Podicin localizes to the plasma
membrane of podocyte foot processes and, is found in
higher order oligomers. Podocin plays a role in
regulating glomerular permeability. Mutations in the
podicin gene give rise to autosomal recessive steroid
resistant nephritic syndrome. This group also contains
proteins similar to three Caenorhabditis elegans
proteins: UNC-1, UNC-24 and, MEC-2. Mutations in the
unc-1 and unc-24 genes result in abnormal motion and
altered patterns of sensitivity to volatile anesthetics.
MEC-2 and UNC-24 proteins interact with MEC-4 which is
part of the degenerin channel complex required for
response to gentle body touch.
Length = 215
Score = 58.3 bits (142), Expect = 6e-10
Identities = 24/105 (22%), Positives = 53/105 (50%), Gaps = 4/105 (3%)
Query: 104 EDEIQNIALVTLEGHQRAIMGSMSVEDIYKDRKKFSKQVFQVASSDLVNMGITVVSYTLK 163
I +A TL R+++G M ++++ +R++ + ++ ++ G+ V +K
Sbjct: 83 RYAISQLAQTTL----RSVIGKMELDELLSEREEINAELVEILDEATDPWGVKVERVEIK 138
Query: 164 DIRDDEGYLKSLGMARTAEVKRDARIGEAEAQKDARIKEAMAEEE 208
DI + +++ AE ++ A+I EAE ++ A I A A ++
Sbjct: 139 DIILPQEIQEAMAKQAEAEREKRAKIIEAEGERQAAILLAEAAKQ 183
Score = 27.9 bits (63), Expect = 7.3
Identities = 23/71 (32%), Positives = 37/71 (52%), Gaps = 12/71 (16%)
Query: 288 KELESSVNRPAEAEKYRIEKLAQANKKRLMIEAEAEAEAIRLKGEAE---AAAIAAKARA 344
+E++ ++ + AEAE+ EK A+ +IEAE E +A L EA A AA
Sbjct: 144 QEIQEAMAKQAEAER---EKRAK------IIEAEGERQAAILLAEAAKQAAINPAALQLR 194
Query: 345 EVEIMTKKAEA 355
E+E + + A+
Sbjct: 195 ELETLEEIAKE 205
>gnl|CDD|214581 smart00244, PHB, prohibitin homologues. prohibitin homologues.
Length = 160
Score = 55.7 bits (135), Expect = 2e-09
Identities = 25/157 (15%), Positives = 46/157 (29%), Gaps = 16/157 (10%)
Query: 110 IALVTLEGHQRAIMGSMSVEDIYKDRKKFSKQVFQVASSDLVNMGITVVSYTLKDIRDDE 169
+ + G + D KK A +D V T+ +K D
Sbjct: 13 VVERLGRVLRVLGPGLHFLIPFIDDVKKVDL----RAQTDDVPPQETITKDNVKVSVDAV 68
Query: 170 GYLKSLGMARTAEVKRDARIGEAEAQKDARIKEAMAEEEKMAARFVNDTEIAKAQRDFEI 229
Y + L R DA E ++ + + + + D
Sbjct: 69 VYYRVLDPLRAVYRVLDADYAVIEQLAQTTLRSVIGK-----------RTLDELLTDQRE 117
Query: 230 KKATYDAEVETKRAEAELAFELQAAKTKQRIKEEQMK 266
K + E + AEA +++ + K E++K
Sbjct: 118 KISENIREELNEAAEA-WGIKVEDVEIKDIRLPEEIK 153
>gnl|CDD|239024 cd02106, Band_7, The band 7 domain of flotillin (reggie) like
proteins. This group contains proteins similar to
stomatin, prohibitin, flotillin, HlfK/C and podicin.
Many of these band 7 domain-containing proteins are
lipid raft-associated. Individual proteins of this band
7 domain family may cluster to form membrane
microdomains which may in turn recruit multiprotein
complexes. Microdomains formed from flotillin proteins
may in addition be dynamic units with their own
regulatory functions. Flotillins have been implicated
in signal transduction, vesicle trafficking,
cytoskeleton rearrangement and are known to interact
with a variety of proteins. Stomatin interacts with and
regulates members of the degenerin/epithelia Na+ channel
family in mechanosensory cells of Caenorhabditis elegans
and vertebrate neurons and participates in trafficking
of Glut1 glucose transporters. Prohibitin may act as a
chaperone for the stabilization of mitochondrial
proteins. Prokaryotic HflK/C plays a role in the
decision between lysogenic and lytic cycle growth during
lambda phage infection. Flotillins have been implicated
in the progression of prion disease, in the pathogenesis
of neurodegenerative diseases such as Parkinson's and
Alzheimer's disease and, in cancer invasion and
metastasis. Mutations in the podicin gene give rise to
autosomal recessive steroid resistant nephritic
syndrome.
Length = 121
Score = 53.1 bits (128), Expect = 8e-09
Identities = 18/80 (22%), Positives = 42/80 (52%), Gaps = 4/80 (5%)
Query: 99 QNHISEDEIQNIALVTLEGHQRAIMGSMSVEDIYKDRKKFSKQVFQVASSDLVNMGITVV 158
++ E+ ++ +A + R+++G M+++++ +DR + + +V + DL GI VV
Sbjct: 45 RDPEDEEALRQLA----QSALRSVIGKMTLDELLEDRDEIAAEVREALQEDLDKYGIEVV 100
Query: 159 SYTLKDIRDDEGYLKSLGMA 178
+KDI E +++
Sbjct: 101 DVRIKDIDPPEEVQEAMEDR 120
>gnl|CDD|216327 pfam01145, Band_7, SPFH domain / Band 7 family. This family has
been called SPFH, Band 7 or PHB domain. Recent
phylogenetic analysis has shown this domain to be a
slipin or Stomatin-like integral membrane domain
conserved from protozoa to mammals.
Length = 177
Score = 51.6 bits (124), Expect = 6e-08
Identities = 17/99 (17%), Positives = 48/99 (48%), Gaps = 1/99 (1%)
Query: 99 QNHISEDEIQNIALVTLEGHQRAIMGSMSVEDIYKDRKKFSKQVFQVASSDLVNMGITVV 158
N+ E+++Q + + R ++ +++++ +R++ +++V + +L G+ +
Sbjct: 79 ANYTGEEDLQELLRPLVRSALREVIARYTLDELLSNREEIAQEVKEALQEELEKYGLEIE 138
Query: 159 SYTLKDIRDDEGYLKSLGMARTAEVKR-DARIGEAEAQK 196
+ DI +++ + AE + +A I AEA+
Sbjct: 139 DVQITDIDPPPEIAEAIEEKQAAEQEAEEAEIERAEAEA 177
>gnl|CDD|223407 COG0330, HflC, Membrane protease subunits, stomatin/prohibitin
homologs [Posttranslational modification, protein
turnover, chaperones].
Length = 291
Score = 47.1 bits (112), Expect = 6e-06
Identities = 37/149 (24%), Positives = 66/149 (44%), Gaps = 9/149 (6%)
Query: 120 RAIMGSMSVEDIY-KDRKKFSKQVFQVASSDLVNMGITVVSYTLKDIRDDEGYLKSLGMA 178
R+++G M+++++ + R + + ++ ++ GI VV +KDI E ++
Sbjct: 123 RSVIGRMTLDELLTERRAEINAKIREILDEAADPWGIKVVDVEIKDIDPPEEVQAAMEKQ 182
Query: 179 RTAEVKRDARIGEAEAQKDARIKEAMAEEEKMAARFVNDTEIAKAQRDFEIKKATYDAEV 238
AE + A I EAE + A I A E AA +A+A+ + E+ A +A+
Sbjct: 183 MAAERDKRAEILEAEGEAQAAILRAEG--EAEAAII-----LAEAEAEAEV-IARAEADA 234
Query: 239 ETKRAEAELAFELQAAKTKQRIKEEQMKI 267
A A QR EE ++I
Sbjct: 235 AKIIAAALREAPAAPQALAQRYLEELLEI 263
Score = 38.6 bits (90), Expect = 0.004
Identities = 42/144 (29%), Positives = 62/144 (43%), Gaps = 15/144 (10%)
Query: 228 EIKKATYDAEVETKRAE--AELAFELQAAKTKQRIKEEQMKIQVIERNQEIQ-----VQE 280
I + T D + +RAE A++ L A IK ++I+ I+ +E+Q
Sbjct: 125 VIGRMTLDELLTERRAEINAKIREILDEAADPWGIKVVDVEIKDIDPPEEVQAAMEKQMA 184
Query: 281 QEIQRRKKELESSVNRPAEAEKYRIEKLAQANKKRLMIEAEAEAEAIR-LKGEAEAAAIA 339
E +R + LE AE E A+ + +I AEAEAEA + EA+AA I
Sbjct: 185 AERDKRAEILE------AEGEAQAAILRAEGEAEAAIILAEAEAEAEVIARAEADAAKII 238
Query: 340 AKARAEVEIMTKKAEAWKDYREAA 363
A A E +A A + E
Sbjct: 239 AAALREAP-AAPQALAQRYLEELL 261
>gnl|CDD|221389 pfam12037, DUF3523, Domain of unknown function (DUF3523). This
presumed domain is functionally uncharacterized. This
domain is found in eukaryotes. This domain is typically
between 257 to 277 amino acids in length. This domain is
found associated with pfam00004. This domain has a
conserved LER sequence motif.
Length = 276
Score = 40.9 bits (96), Expect = 6e-04
Identities = 30/153 (19%), Positives = 68/153 (44%), Gaps = 2/153 (1%)
Query: 195 QKDARIKEAMAEEEKMAARFVNDTEIAKAQRDFEIKKATYDAEVETKRAEAELAFELQAA 254
Q+ R E A+ ++ A+ ++ +A+ + E ++ T + + ++ A+ EL
Sbjct: 60 QEKTRQAELEAKIKEYEAQ-QAQAKLERARVEAEERRKTLQEQTQQEQQRAQYQDELARK 118
Query: 255 KTKQRIKEEQMKIQVIERNQEIQVQEQEIQRRKKELESSVNRPAEAEKYRIEKLAQANKK 314
+ ++ +++++ + + + + QE V QE RR E E + E + E + +
Sbjct: 119 RYQKELEQQRRQNEELLKMQEESVLRQEAMRRATEEEI-LEMRRETIEEEAELERENIRA 177
Query: 315 RLMIEAEAEAEAIRLKGEAEAAAIAAKARAEVE 347
++ EA A+ R + + KA E E
Sbjct: 178 KIEAEARGRAKEERENEDINREMLKLKANEERE 210
>gnl|CDD|114011 pfam05262, Borrelia_P83, Borrelia P83/100 protein. This family
consists of several Borrelia P83/P100 antigen proteins.
Length = 489
Score = 40.0 bits (93), Expect = 0.002
Identities = 36/166 (21%), Positives = 72/166 (43%), Gaps = 17/166 (10%)
Query: 189 IGEAEAQKDARIKEAMAEEEKMAARFVNDTEIAKAQRDFEIKKATYDA---EVETKRAEA 245
+ E E+Q+DA+ + + EE + KAQ+ + + D EV K+ EA
Sbjct: 203 LKERESQEDAKRAQQLKEELDKKQI-----DADKAQQKADFAQDNADKQRDEVRQKQQEA 257
Query: 246 ELAFE---LQAAKTKQRIKEEQMKIQVIERNQ-EIQVQEQEIQRRKKELESSVNRPAEAE 301
+ + + K +++ E Q + IE+ Q EI+ ++E + K + + ++A
Sbjct: 258 KNLPKPADTSSPKEDKQVAENQKR--EIEKAQIEIKKNDEEALKAKDHKAFDLKQESKAS 315
Query: 302 KYRIE-KLAQANKKRLMIEAEAEAEAIRLKGEAEAAAIAAKARAEV 346
+ E K +A KKR + + + + EA+ ++ A
Sbjct: 316 EKEAEDKELEAQKKRE--PVAEDLQKTKPQVEAQPTSLNEDAIDSS 359
Score = 35.4 bits (81), Expect = 0.052
Identities = 28/172 (16%), Positives = 70/172 (40%), Gaps = 14/172 (8%)
Query: 197 DARIKEAMAEEEKMAARFVNDTEIAKAQRDFEIKKATYDAEVETKRAEAELAFELQAAKT 256
D ++ EA+ E+ N+ + + ++K+ +++ + KRA+ + + K
Sbjct: 179 DKKVVEALRED--------NEKGVNFRRDMTDLKER--ESQEDAKRAQQ---LKEELDKK 225
Query: 257 KQRIKEEQMKIQVIERNQEIQVQE-QEIQRRKKELESSVNRPAEAEKYRIEKLAQANKKR 315
+ + Q K + N + Q E ++ Q+ K L + + E ++ + + ++
Sbjct: 226 QIDADKAQQKADFAQDNADKQRDEVRQKQQEAKNLPKPADTSSPKEDKQVAENQKREIEK 285
Query: 316 LMIEAEAEAEAIRLKGEAEAAAIAAKARAEVEIMTKKAEAWKDYREAAVIDM 367
IE + E + +A + +++A + K + RE D+
Sbjct: 286 AQIEIKKNDEEALKAKDHKAFDLKQESKASEKEAEDKELEAQKKREPVAEDL 337
>gnl|CDD|188306 TIGR03319, RNase_Y, ribonuclease Y. Members of this family are
RNase Y, an endoribonuclease. The member from Bacillus
subtilis, YmdA, has been shown to be involved in
turnover of yitJ riboswitch [Transcription, Degradation
of RNA].
Length = 514
Score = 39.9 bits (94), Expect = 0.002
Identities = 27/135 (20%), Positives = 61/135 (45%), Gaps = 6/135 (4%)
Query: 212 ARFVNDTEIAKAQRDFEIKKATYDAEVETKRAEAELAFELQAAKTKQRIKEEQMKIQVIE 271
+ + + ++ A+ + E ET + EA L AK + +++ ++ E
Sbjct: 19 RKRIAEKKLGSAEELAKRIIEEAKKEAETLKKEALLE-----AKEEVHKLRAELERELKE 73
Query: 272 RNQEIQVQEQEIQRRKKELESSVNRPAEAEKYRIEKLAQ-ANKKRLMIEAEAEAEAIRLK 330
R E+Q E+ + +R++ L+ + + E+ +K + +NK++ + E E E E + +
Sbjct: 74 RRNELQRLERRLLQREETLDRKMESLDKKEENLEKKEKELSNKEKNLDEKEEELEELIAE 133
Query: 331 GEAEAAAIAAKARAE 345
E I+ + E
Sbjct: 134 QREELERISGLTQEE 148
Score = 37.2 bits (87), Expect = 0.013
Identities = 41/196 (20%), Positives = 85/196 (43%), Gaps = 40/196 (20%)
Query: 171 YLKSLGMARTAEVKRDARIGEAEAQKDA--RIKEAMAEEEKMAARFVNDTEIAKAQRDFE 228
K + + + A+ EA+K+A KEA+ E ++ E+ K + + E
Sbjct: 18 LRKRIAEKKLGSAEELAKRIIEEAKKEAETLKKEALLEAKE---------EVHKLRAELE 68
Query: 229 IKKATYDAEVETKRAEAELAFELQAAKTKQRIKEEQM---KIQVIERNQEIQVQEQEIQR 285
E++ +R ELQ + + +EE + + ++ + ++ +E+E+
Sbjct: 69 -------RELKERRN------ELQRLERRLLQREETLDRKMESLDKKEENLEKKEKELSN 115
Query: 286 RKKEL---ESSVNRPAEAEKYRIEKLA-----QANKKRLMIEAEAEAE---AIRLKGEAE 334
++K L E + ++ +E+++ +A K+ L+ E E EA A +K E
Sbjct: 116 KEKNLDEKEEELEELIAEQREELERISGLTQEEA-KEILLEEVEEEARHEAAKLIKEIEE 174
Query: 335 AAAIAAKARAEVEIMT 350
A A +A+ EI+
Sbjct: 175 EAKEEADKKAK-EILA 189
>gnl|CDD|237177 PRK12704, PRK12704, phosphodiesterase; Provisional.
Length = 520
Score = 39.4 bits (93), Expect = 0.003
Identities = 42/181 (23%), Positives = 87/181 (48%), Gaps = 36/181 (19%)
Query: 171 YLKSLGMARTAEVKRDAR--IGEAEAQKDARIKEAMAEEEKMAARFVNDTEIAKAQRDFE 228
K + A+ E + +A+ + EA+ + +A KEA+ E A+ +
Sbjct: 24 VRKKIAEAKIKEAEEEAKRILEEAKKEAEAIKKEALLE----------------AKEEIH 67
Query: 229 IKKATYDAEVETKRAEAELAFELQAAKTKQRIKEEQM--KIQVIE-RNQEIQVQEQEIQR 285
+ ++ E+ +R ELQ + + KEE + K++++E R +E++ +E+E+++
Sbjct: 68 KLRNEFEKELRERRN------ELQKLEKRLLQKEENLDRKLELLEKREEELEKKEKELEQ 121
Query: 286 RKKELESSVNRPAEAEKYRIEKLAQAN-------KKRLM--IEAEAEAEAIRLKGEAEAA 336
+++ELE E + ++++L + + K+ L+ +E EA EA L E E
Sbjct: 122 KQQELEKKEEELEELIEEQLQELERISGLTAEEAKEILLEKVEEEARHEAAVLIKEIEEE 181
Query: 337 A 337
A
Sbjct: 182 A 182
Score = 36.3 bits (85), Expect = 0.024
Identities = 25/132 (18%), Positives = 61/132 (46%), Gaps = 10/132 (7%)
Query: 212 ARFVNDTEIAKAQRDFE--IKKATYDAEVETKRAEAELAFELQAAKTKQRIKEEQMKIQV 269
+ + + +I +A+ + + +++A +AE K A E AK + + + ++
Sbjct: 25 RKKIAEAKIKEAEEEAKRILEEAKKEAEAIKKEALLE-------AKEEIHKLRNEFEKEL 77
Query: 270 IERNQEIQVQEQEIQRRKKELESSVNRPAEAEKYRIEKLAQANKKRLMIEA-EAEAEAIR 328
ER E+Q E+ + ++++ L+ + + E+ +K + +K+ +E E E E +
Sbjct: 78 RERRNELQKLEKRLLQKEENLDRKLELLEKREEELEKKEKELEQKQQELEKKEEELEELI 137
Query: 329 LKGEAEAAAIAA 340
+ E I+
Sbjct: 138 EEQLQELERISG 149
>gnl|CDD|239499 cd03405, Band_7_HflC, Band_7_HflC: The band 7 domain of flotillin
(reggie) like proteins. This group includes proteins
similar to prokaryotic HlfC (High frequency of
lysogenization C). Although many members of the band 7
family are lipid raft associated, prokaryote plasma
membranes lack cholesterol and are unlikely to have
lipid raft domains. Individual proteins of this band 7
domain family may cluster to form membrane microdomains
which may in turn recruit multiprotein complexes.
Escherichia coli HflC is an integral membrane protein
which may localize to the plasma membrane. HflC
associates with another band 7 family member (HflK) to
form an HflKC complex. HflKC interacts with FtsH in a
large complex termed the FtsH holo-enzyme. FtsH is an
AAA ATP-dependent protease which exerts progressive
proteolysis against membrane-embedded and soluble
substrate proteins. HflKC can modulate the activity of
FtsH. HflKC plays a role in the decision between
lysogenic and lytic cycle growth during lambda phage
infection.
Length = 242
Score = 38.7 bits (91), Expect = 0.003
Identities = 20/47 (42%), Positives = 29/47 (61%), Gaps = 6/47 (12%)
Query: 298 AEAEKYRIEKLAQANKKRLMIEAEAEAEAIRLKGEAEA--AAIAAKA 342
EAE+ R A A+++R +I AEA EA ++GE +A A I A+A
Sbjct: 179 EEAERIR----ADADRERTVILAEAYREAQEIRGEGDAEAARIYAEA 221
Score = 34.8 bits (81), Expect = 0.047
Identities = 20/47 (42%), Positives = 23/47 (48%), Gaps = 5/47 (10%)
Query: 319 EAEAEAEAIRLKGEAEAAAIAAKARAEVEIMTKKAEAWKDYREAAVI 365
E E A R +GE EA I A A E ++ AEA YREA I
Sbjct: 165 ERERIAAEFRAEGEEEAERIRADADRERTVIL--AEA---YREAQEI 206
Score = 29.8 bits (68), Expect = 2.1
Identities = 30/120 (25%), Positives = 56/120 (46%), Gaps = 10/120 (8%)
Query: 120 RAIMGSMSVEDIY-KDRKKFSKQVFQVASSDLVNMGITVVSYTLKDIRDDEGYLKSLGMA 178
RA G ++ ++ +R + +++ + + + +GI VV +K I E +S+
Sbjct: 102 RAEFGKRTLIELVSGERGELMEEIRRAVAEEAKELGIEVVDVRIKRIDLPEEVSESVYRR 161
Query: 179 RTAEVKRDARIGEAEAQKDARIKEAMAEEEKMAARFVNDTEI-AKAQRDFEIKKATYDAE 237
AE +R A AE +++A A A+ E+ T I A+A R+ + + DAE
Sbjct: 162 MRAERERIAAEFRAEGEEEAERIRADADRER--------TVILAEAYREAQEIRGEGDAE 213
>gnl|CDD|236545 PRK09510, tolA, cell envelope integrity inner membrane protein
TolA; Provisional.
Length = 387
Score = 39.0 bits (91), Expect = 0.003
Identities = 49/190 (25%), Positives = 82/190 (43%), Gaps = 9/190 (4%)
Query: 179 RTAEVKRDARIGEAEAQKDARIKEAMAEEEKMAARFVNDTEIAKAQRDFEIKK----ATY 234
R + ++ A+ E + +K + + ++++ A + + E KK A
Sbjct: 66 RQQQQQKSAKRAEEQRKKKEQQQAEELQQKQAAEQERLKQLEKERLAAQEQKKQAEEAAK 125
Query: 235 DAEVETKRAEAELAFELQAAKTKQRIKEEQMKIQVIERNQEIQVQEQEIQRRKKELESSV 294
A ++ K+AE A AAK K + ++ + E + + + +K E+
Sbjct: 126 QAALKQKQAEEAAAKAAAAAKAKAEAEAKRAAAAAKKAAAEAKKKAEAEAAKKAAAEAKK 185
Query: 295 NRPAEAEKYRIEKLAQANKKRLMIEAEAEAEAI-RLKGEAEAAAIAAKARAEVEIMTKKA 353
AEA K A KK+ EA+ +A A + K AEA A AAKA AE + +KA
Sbjct: 186 KAEAEAAA----KAAAEAKKKAEAEAKKKAAAEAKKKAAAEAKAAAAKAAAEAKAAAEKA 241
Query: 354 EAWKDYREAA 363
A K +AA
Sbjct: 242 AAAKAAEKAA 251
>gnl|CDD|173412 PTZ00121, PTZ00121, MAEBL; Provisional.
Length = 2084
Score = 39.0 bits (90), Expect = 0.004
Identities = 51/211 (24%), Positives = 94/211 (44%), Gaps = 27/211 (12%)
Query: 178 ARTAEVKRDARIGEAEAQKDARIKEAM-----AEEEKMAARFVNDTEIAKAQRDFEIKKA 232
AR AE R + EA +DA+ E A + + A + + + A++ E++KA
Sbjct: 1133 ARKAEDAR--KAEEARKAEDAKRVEIARKAEDARKAEEARKAEDAKKAEAARKAEEVRKA 1190
Query: 233 TYDAEVE-TKRAEAELAFELQAAKTKQRIKEEQMKIQVIERNQEIQVQEQEIQRRKKELE 291
+ E ++AEA E + + R E+ K + +++ +E + +E ++ ++E
Sbjct: 1191 EELRKAEDARKAEAARKAEEERKAEEARKAEDAKKAEAVKKAEEAKKDAEEAKKAEEERN 1250
Query: 292 SSVNRPAEA----------------EKYRIEKLAQANKKRLMIE---AEAEAEAIRLKGE 332
+ R E E + ++L +A +K+ E AE + +A K +
Sbjct: 1251 NEEIRKFEEARMAHFARRQAAIKAEEARKADELKKAEEKKKADEAKKAEEKKKADEAKKK 1310
Query: 333 AEAAAIAAKARAEVEIMTKKAEAWKDYREAA 363
AE A A +A+ + E KKA+A K E A
Sbjct: 1311 AEEAKKADEAKKKAEEAKKKADAAKKKAEEA 1341
Score = 39.0 bits (90), Expect = 0.004
Identities = 43/188 (22%), Positives = 81/188 (43%), Gaps = 3/188 (1%)
Query: 178 ARTAEVKR---DARIGEAEAQKDARIKEAMAEEEKMAARFVNDTEIAKAQRDFEIKKATY 234
A+ AE K+ +A+ EA+K K+ E +K A E AK + +A
Sbjct: 1295 AKKAEEKKKADEAKKKAEEAKKADEAKKKAEEAKKKADAAKKKAEEAKKAAEAAKAEAEA 1354
Query: 235 DAEVETKRAEAELAFELQAAKTKQRIKEEQMKIQVIERNQEIQVQEQEIQRRKKELESSV 294
A+ E A E + + K++ + K + ++ E + + +E +++ EL+ +
Sbjct: 1355 AADEAEAAEEKAEAAEKKKEEAKKKADAAKKKAEEKKKADEAKKKAEEDKKKADELKKAA 1414
Query: 295 NRPAEAEKYRIEKLAQANKKRLMIEAEAEAEAIRLKGEAEAAAIAAKARAEVEIMTKKAE 354
+A++ + + + +AE +A K +AE A A +A+ + E K E
Sbjct: 1415 AAKKKADEAKKKAEEKKKADEAKKKAEEAKKADEAKKKAEEAKKAEEAKKKAEEAKKADE 1474
Query: 355 AWKDYREA 362
A K EA
Sbjct: 1475 AKKKAEEA 1482
Score = 38.2 bits (88), Expect = 0.008
Identities = 51/190 (26%), Positives = 93/190 (48%), Gaps = 12/190 (6%)
Query: 173 KSLGMARTAEVKRDARIGEAEAQKDARIKEA--MAEEEKMAARFVNDTEIAKAQRDFEIK 230
K+ + AE + A + +A++ + EA AEE K A AK + D E K
Sbjct: 1458 KAEEAKKKAEEAKKADEAKKKAEEAKKADEAKKKAEEAKKKADEAKKAAEAKKKAD-EAK 1516
Query: 231 KA--TYDAEVETKRAEAELAFELQAAKTKQRIKEEQMKIQVIERNQEIQVQEQEIQRRKK 288
KA A+ K EA+ A E + A+ K++ +E K + +++ +E + E+ ++ +
Sbjct: 1517 KAEEAKKADEAKKAEEAKKADEAKKAEEKKK-ADELKKAEELKKAEEKKKAEE--AKKAE 1573
Query: 289 ELESSVNRPAE----AEKYRIEKLAQANKKRLMIEAEAEAEAIRLKGEAEAAAIAAKARA 344
E ++ R AE AE+ RIE++ + ++ ++AE +A K +AE A + +
Sbjct: 1574 EDKNMALRKAEEAKKAEEARIEEVMKLYEEEKKMKAEEAKKAEEAKIKAEELKKAEEEKK 1633
Query: 345 EVEIMTKKAE 354
+VE + KK
Sbjct: 1634 KVEQLKKKEA 1643
Score = 37.8 bits (87), Expect = 0.010
Identities = 43/190 (22%), Positives = 88/190 (46%), Gaps = 10/190 (5%)
Query: 179 RTAEVKRDARIGEAEAQKDARIKEAMAEEEKMAARFVNDTEIAKAQRDFEIKKATYDAEV 238
+ E K+ A + +A++ + EA E + AA E + + ++A A+
Sbjct: 1323 KAEEAKKKADAAKKKAEEAKKAAEAAKAEAEAAADEAEAAEEKAEAAEKKKEEAKKKADA 1382
Query: 239 ETKRAEAELAFELQAAKTKQRIKEEQMKIQVIERNQEIQVQEQEIQRRKKELESS--VNR 296
K+AE + +A + K++ +E++ K +++ + + E +++ +E + + +
Sbjct: 1383 AKKKAEEKK----KADEAKKKAEEDKKKADELKKAAAAKKKADEAKKKAEEKKKADEAKK 1438
Query: 297 PAEAEKYRIEKLAQANKKRLMIEAEAEAEAIR----LKGEAEAAAIAAKARAEVEIMTKK 352
AE K E +A + + EA+ +AE + K +AE A A +A+ + E KK
Sbjct: 1439 KAEEAKKADEAKKKAEEAKKAEEAKKKAEEAKKADEAKKKAEEAKKADEAKKKAEEAKKK 1498
Query: 353 AEAWKDYREA 362
A+ K EA
Sbjct: 1499 ADEAKKAAEA 1508
Score = 37.4 bits (86), Expect = 0.013
Identities = 60/209 (28%), Positives = 93/209 (44%), Gaps = 15/209 (7%)
Query: 163 KDIRDDEGYLKSLGMARTAEVKRDARIGEAEAQKDARIKEAMAEEEKMAARFVNDTEIAK 222
+D R E K+ + E ++ +AEA K A + AEE K A N+ EI K
Sbjct: 1197 EDARKAEAARKAEEERKAEEARKAEDAKKAEAVKKAEEAKKDAEEAKKAEEERNNEEIRK 1256
Query: 223 AQR----DFEIKKATYDAEVETKRAEAELAFELQAAKTKQRIKEEQMKIQVIERNQEIQV 278
+ F ++A AE K E + A E + A + + EE+ K ++ E
Sbjct: 1257 FEEARMAHFARRQAAIKAEEARKADELKKAEEKKKAD-EAKKAEEKKKADEAKKKAEEAK 1315
Query: 279 QEQEIQRR----KKELESSVNRPAEAEKYRIEKLAQANKKRLMIE-AEAEAEAIRLK-GE 332
+ E +++ KK+ +++ + EA+K A+A E AE +AEA K E
Sbjct: 1316 KADEAKKKAEEAKKKADAAKKKAEEAKKAAEAAKAEAEAAADEAEAAEEKAEAAEKKKEE 1375
Query: 333 AEAAAIAAKARAE----VEIMTKKAEAWK 357
A+ A AAK +AE + KKAE K
Sbjct: 1376 AKKKADAAKKKAEEKKKADEAKKKAEEDK 1404
Score = 36.3 bits (83), Expect = 0.032
Identities = 45/180 (25%), Positives = 77/180 (42%), Gaps = 3/180 (1%)
Query: 182 EVKRDARIGEAEAQKDARIKEAMAEEEKMAARFVNDTEIAKAQRDFEIKKATYDAEVETK 241
E+K+ + +AE +K + +EA EE E KA+ + E +
Sbjct: 1550 ELKKAEELKKAEEKK--KAEEAKKAEEDKNMALRKAEEAKKAEEARIEEVMKLYEEEKKM 1607
Query: 242 RAEAELAFELQAAKTKQRIKEEQMKIQVIERNQEIQVQEQEIQRRKKELESSVNRPAEAE 301
+AE E K ++ K E+ K +V + ++ ++++ + KK E + + AE
Sbjct: 1608 KAEEAKKAEEAKIKAEELKKAEEEKKKVEQLKKKEAEEKKKAEELKKAEEENKIKAAEEA 1667
Query: 302 KYRIEKLAQANKKRLMIEAEAEAEAIRLKGEAEAAAIAAKARAEVEIMTKKAEAWKDYRE 361
K E +A + + E E +A LK EAE A A + + + KKAE K E
Sbjct: 1668 KKAEEDKKKAEEAKKAEEDEKKAAE-ALKKEAEEAKKAEELKKKEAEEKKKAEELKKAEE 1726
Score = 35.1 bits (80), Expect = 0.068
Identities = 53/206 (25%), Positives = 87/206 (42%), Gaps = 7/206 (3%)
Query: 163 KDIRDDEGYLKSLGMARTAEVKRDARIGEAEAQKDA----RIKEAMAEEEKMAARFVNDT 218
+D + E K+ + + E+++ +AEA + A + +EA E+ A V
Sbjct: 1173 EDAKKAEAARKAEEVRKAEELRKAEDARKAEAARKAEEERKAEEARKAEDAKKAEAVKKA 1232
Query: 219 EIAKAQRDFEIKKATYDAEVETKRAEAELAFELQAAKTKQRIKEEQMKIQVIERNQEI-Q 277
E AK + E KKA + E R E A + EE K +++ +E +
Sbjct: 1233 EEAKKDAE-EAKKAEEERNNEEIRKFEEARMAHFARRQAAIKAEEARKADELKKAEEKKK 1291
Query: 278 VQEQEIQRRKKELESSVNRPAEAEKYRIEKLAQANKKRLMIEAEAEAEAIRLKGEAEAAA 337
E + KK+ + + + EA+K K K+ A+ +AE + EA A A
Sbjct: 1292 ADEAKKAEEKKKADEAKKKAEEAKKADEAKKKAEEAKKKADAAKKKAEEAKKAAEA-AKA 1350
Query: 338 IAAKARAEVEIMTKKAEAWKDYREAA 363
A A E E +KAEA + +E A
Sbjct: 1351 EAEAAADEAEAAEEKAEAAEKKKEEA 1376
Score = 34.0 bits (77), Expect = 0.16
Identities = 42/159 (26%), Positives = 73/159 (45%), Gaps = 18/159 (11%)
Query: 188 RIGEAEAQKDARIKEAMAEEEKMAARFVNDTEIAKAQRDFEIKKATYD---------AEV 238
+ E E +K ++K+ AEE+K A E K + E KKA D AE
Sbjct: 1626 KKAEEEKKKVEQLKKKEAEEKKKAEELKKAEEENKIKAAEEAKKAEEDKKKAEEAKKAEE 1685
Query: 239 ETKRAEAELAFELQAAKTKQRIK----EEQMKIQVIERNQEIQVQEQEIQRRKKELESSV 294
+ K+A L E + AK + +K EE+ K + +++ +E + E +++ E +
Sbjct: 1686 DEKKAAEALKKEAEEAKKAEELKKKEAEEKKKAEELKKAEEENKIKAEEAKKEAEEDKK- 1744
Query: 295 NRPAEAEKYRIEKLAQANKKRLMIEAEAEAEAIRLKGEA 333
+AE+ + ++ + L E E +AE IR + EA
Sbjct: 1745 ----KAEEAKKDEEEKKKIAHLKKEEEKKAEEIRKEKEA 1779
Score = 32.4 bits (73), Expect = 0.43
Identities = 49/197 (24%), Positives = 82/197 (41%), Gaps = 16/197 (8%)
Query: 179 RTAEVKRDARIGEAEAQKDARIKEAMAEEEKMAARFVNDTEIAKAQRDFEIKKATYDAEV 238
+ E K+ A + A+ + EA EE A E +A++ E KKA +
Sbjct: 1491 KAEEAKKKADEAKKAAEAKKKADEAKKAEEAKKADEAKKAE--EAKKADEAKKAEEKKKA 1548
Query: 239 ETKRAEAELAFELQAAKTKQRIKEEQMKIQVIERNQEI-QVQEQEIQRRKKELESSVN-- 295
+ + EL + K ++ K E+ K + + +E + +E I+ K E
Sbjct: 1549 DELKKAEELKKAEEKKKAEEAKKAEEDKNMALRKAEEAKKAEEARIEEVMKLYEEEKKMK 1608
Query: 296 ----RPAEAEKYRIEKLAQANKKRLMIE------AEAEAEAIRLKGEAEAAAI-AAKARA 344
+ AE K + E+L +A +++ +E AE + +A LK E I AA+
Sbjct: 1609 AEEAKKAEEAKIKAEELKKAEEEKKKVEQLKKKEAEEKKKAEELKKAEEENKIKAAEEAK 1668
Query: 345 EVEIMTKKAEAWKDYRE 361
+ E KKAE K E
Sbjct: 1669 KAEEDKKKAEEAKKAEE 1685
Score = 32.0 bits (72), Expect = 0.58
Identities = 56/188 (29%), Positives = 88/188 (46%), Gaps = 14/188 (7%)
Query: 178 ARTAEVKRDARIGEAEAQK---DARIKEAMA---EEEKMAARFVNDTEIAKAQRDFEIKK 231
A+ AE ++ + +AE K +ARI+E M EE+KM A E AK + + E+KK
Sbjct: 1569 AKKAEEDKNMALRKAEEAKKAEEARIEEVMKLYEEEKKMKAEEAKKAEEAKIKAE-ELKK 1627
Query: 232 ATYDAEVETKRAEAELAFELQAAKTKQRIK--EEQMKIQVIERNQEIQVQEQEIQRRKKE 289
A E E K+ E E + K + +K EE+ KI+ E ++ + +++ + KK
Sbjct: 1628 A----EEEKKKVEQLKKKEAEEKKKAEELKKAEEENKIKAAEEAKKAEEDKKKAEEAKKA 1683
Query: 290 LESSVNRPAEAEKYRIEKLAQANKKRLMIEAEAEAEAIRLKGEAEAAAIAAKARAEVEIM 349
E +K E K+ E + +AE ++ K E E A +A+ E E
Sbjct: 1684 EEDEKKAAEALKKEAEEAKKAEELKKKEAEEKKKAEELK-KAEEENKIKAEEAKKEAEED 1742
Query: 350 TKKAEAWK 357
KKAE K
Sbjct: 1743 KKKAEEAK 1750
Score = 32.0 bits (72), Expect = 0.58
Identities = 50/200 (25%), Positives = 93/200 (46%), Gaps = 5/200 (2%)
Query: 163 KDIRDDEGYLKSLGMARTAEVKRDARIGEAEAQKDARIKEAMAEEEKMAARFVNDTEIAK 222
K +D+ L A A+ K D +AE +K A + AEE K A E AK
Sbjct: 1398 KKAEEDKKKADELKKAAAAKKKADEAKKKAEEKKKADEAKKKAEEAKKADEAKKKAEEAK 1457
Query: 223 AQRDFEIKKATYDAEVETKRAEAELAFELQAAKTKQRIKEEQMKIQVIERNQEIQVQEQE 282
+ + KKA + + + +AE A +A + K++ +E + K ++ E + + E
Sbjct: 1458 KAEEAK-KKAEEAKKADEAKKKAEEA--KKADEAKKKAEEAKKKADEAKKAAEAKKKADE 1514
Query: 283 IQRRKKELESSVNRPAEAEKYRIEKLAQANKKRLMIEAEAEAEAIRLKGEAEAAAIAAKA 342
++ ++ ++ + AE E + ++ +A +K+ E + +AE ++ E + A A KA
Sbjct: 1515 AKKAEEAKKADEAKKAE-EAKKADEAKKAEEKKKADELK-KAEELKKAEEKKKAEEAKKA 1572
Query: 343 RAEVEIMTKKAEAWKDYREA 362
+ + +KAE K EA
Sbjct: 1573 EEDKNMALRKAEEAKKAEEA 1592
Score = 32.0 bits (72), Expect = 0.61
Identities = 49/209 (23%), Positives = 85/209 (40%), Gaps = 14/209 (6%)
Query: 163 KDIRDDEGYLKSLGMARTAEVKRDARIGEAEAQKDARIKEA----MAEEEKMAARFVNDT 218
++ R++E K +R A I EA+K +K+A A+E K A
Sbjct: 1246 EEERNNEEIRKFEEARMAHFARRQAAIKAEEARKADELKKAEEKKKADEAKKAEEKKKAD 1305
Query: 219 EIAK-AQRDFEIKKATYDAEVETKRAEAELAFELQAAKTKQRIKEEQMKIQVIERNQEIQ 277
E K A+ + +A AE K+A+ + + + E K + E +
Sbjct: 1306 EAKKKAEEAKKADEAKKKAEEAKKKAD-----AAKKKAEEAKKAAEAAKAEAEAAADEAE 1360
Query: 278 VQEQEIQRRKKELESSVNRPAEAEKYRIEKLAQANKKRLMIEAEAEAEAIRLKGEAEAAA 337
E++ + +K+ E + + A+K EK K+ E + +A+ ++ A+ A
Sbjct: 1361 AAEEKAEAAEKKKEEAKKKADAAKKKAEEKKKADEAKKKAEEDKKKADELKKAAAAKKKA 1420
Query: 338 IAAKARAE----VEIMTKKAEAWKDYREA 362
AK +AE + KKAE K EA
Sbjct: 1421 DEAKKKAEEKKKADEAKKKAEEAKKADEA 1449
Score = 32.0 bits (72), Expect = 0.63
Identities = 29/124 (23%), Positives = 54/124 (43%), Gaps = 5/124 (4%)
Query: 191 EAEAQKDARIKEAMAEEEKMAARFVNDTEIAKAQRDFEIKKATYDAEVETKRAEAELAFE 250
EA+K +K+ AEE+K A E + ++A +AE + K+AE E
Sbjct: 1698 AEEAKKAEELKKKEAEEKKKAEELKKAEEE----NKIKAEEAKKEAEEDKKKAEEAKKDE 1753
Query: 251 LQAAKTKQRIKEEQMKIQVIERNQEIQVQEQEIQRRKKELESSVNRPAEAEKYRIEKLAQ 310
+ K KEE+ K + I + +E V E+E+ ++ V++ + + +
Sbjct: 1754 EEKKKIAHLKKEEEKKAEEIRKEKE-AVIEEELDEEDEKRRMEVDKKIKDIFDNFANIIE 1812
Query: 311 ANKK 314
K+
Sbjct: 1813 GGKE 1816
>gnl|CDD|224117 COG1196, Smc, Chromosome segregation ATPases [Cell division and
chromosome partitioning].
Length = 1163
Score = 37.8 bits (88), Expect = 0.009
Identities = 35/210 (16%), Positives = 85/210 (40%), Gaps = 9/210 (4%)
Query: 161 TLKDIRDDEGYLKSLGMARTAEVKR--------DARIGEAEAQKDARIKEAMAEEEKMAA 212
L+++ LK A E+++ + + E E + + + EE++ +
Sbjct: 710 QLEELERQLEELKRELAALEEELEQLQSRLEELEEELEELEEELEELQERLEELEEELES 769
Query: 213 RFVNDTEIAKAQRDFEIKKATYDAEVETKRAEAELAFELQAAKTKQRIKEEQMKIQVIER 272
++ + + E K+ E+E E E A A ++ EQ + ++ +
Sbjct: 770 LEEALAKLKEEIEELEEKRQALQEELEELEEELEEAERRLDALERELESLEQRRERLEQE 829
Query: 273 NQEIQVQEQEIQRRKKELESSVNRPAEAEKYRIEKLAQANKKRLMIEAE-AEAEAIRLKG 331
+E++ + +E++ + ELE + + + E+L + ++ +E E E E + +
Sbjct: 830 IEELEEEIEELEEKLDELEEELEELEKELEELKEELEELEAEKEELEDELKELEEEKEEL 889
Query: 332 EAEAAAIAAKARAEVEIMTKKAEAWKDYRE 361
E E + ++ E + K E ++
Sbjct: 890 EEELRELESELAELKEEIEKLRERLEELEA 919
Score = 35.8 bits (83), Expect = 0.038
Identities = 33/157 (21%), Positives = 64/157 (40%), Gaps = 8/157 (5%)
Query: 199 RIKEAMAEEEKMAARFVNDTEIAKAQRDFEIKKATYDAEVETKRAEAELAFELQAAKTKQ 258
R++E E E++ R E +A ++ ++ T E+E AE E A E K
Sbjct: 310 RLEELENELEELEERLEELKEKIEALKEELEERETLLEELEQLLAELEEAKEELEEKLSA 369
Query: 259 RIKEEQMKIQVIERNQ--------EIQVQEQEIQRRKKELESSVNRPAEAEKYRIEKLAQ 310
++E + + + EI+ + +E++R + LE + R +E + E+L +
Sbjct: 370 LLEELEELFEALREELAELEAELAEIRNELEELKREIESLEERLERLSERLEDLKEELKE 429
Query: 311 ANKKRLMIEAEAEAEAIRLKGEAEAAAIAAKARAEVE 347
+ ++ E E L+ E E+E
Sbjct: 430 LEAELEELQTELEELNEELEELEEQLEELRDRLKELE 466
Score = 33.5 bits (77), Expect = 0.23
Identities = 34/162 (20%), Positives = 68/162 (41%), Gaps = 8/162 (4%)
Query: 172 LKSLGMARTAEVKRDARIGEAEAQKDARIKEAMAEEEKMAARFVNDTEIAKAQRDFEIKK 231
L L A+ ++ + + E + ++E +AE E A N+ E K + + ++
Sbjct: 353 LAELEEAKEELEEKLSALLEELEELFEALREELAELEAELAEIRNELEELKREIESLEER 412
Query: 232 ATYDAEVETKRAEAELAFELQAAKTKQRIKEEQMKIQVIERNQEIQVQEQEIQRRKKELE 291
+E E E + + + ++E ++ +E++ Q +E++ R KELE
Sbjct: 413 LERLSERLEDLKEELKELEAELEELQTELEELNEEL------EELEEQLEELRDRLKELE 466
Query: 292 SSVNRPAEAEKYRIEKLAQANKKRLMIEAEAEAEAIRLKGEA 333
+ + E R+EK + + RL EAE A +
Sbjct: 467 RELAE-LQEELQRLEKELSSLEARLD-RLEAEQRASQGVRAV 506
Score = 32.0 bits (73), Expect = 0.64
Identities = 29/157 (18%), Positives = 63/157 (40%), Gaps = 11/157 (7%)
Query: 187 ARIGEAEAQKDARIKEAMAEEEKMAARFVNDTEIAKAQRDFEIKKATYDAEVETKRAEAE 246
+ E + + + KE + ++ + + + E+K+ + E E
Sbjct: 253 EELEELQEELEEAEKEIEELKSELEE--LREELEELQEELLELKEEIEELEGEISLLRER 310
Query: 247 LAF-ELQAAKTKQRIKEEQMKIQVIERN--------QEIQVQEQEIQRRKKELESSVNRP 297
L E + + ++R++E + KI+ ++ +E++ E++ K+ELE ++
Sbjct: 311 LEELENELEELEERLEELKEKIEALKEELEERETLLEELEQLLAELEEAKEELEEKLSAL 370
Query: 298 AEAEKYRIEKLAQANKKRLMIEAEAEAEAIRLKGEAE 334
E + E L + + AE E LK E E
Sbjct: 371 LEELEELFEALREELAELEAELAEIRNELEELKREIE 407
Score = 32.0 bits (73), Expect = 0.66
Identities = 36/171 (21%), Positives = 71/171 (41%), Gaps = 13/171 (7%)
Query: 184 KRDARIGEAEAQKDARIKEAMAEEEKMAARFVNDTEIAKAQRDFEIKKATYDAEVETKRA 243
+R A E + A ++E E + + + + + E+ + + E +
Sbjct: 206 ERQAEKAERYQELKAELREL---ELALLLAKLKELRKELEELEEELSRLEEELEELQEEL 262
Query: 244 EAELAFELQAAKTK-QRIKEEQMKIQ--VIERNQEIQVQEQEIQRRKKELESSVNRPAEA 300
E E E++ K++ + ++EE ++Q ++E +EI+ E EI ++ LE N E
Sbjct: 263 E-EAEKEIEELKSELEELREELEELQEELLELKEEIEELEGEISLLRERLEELENELEEL 321
Query: 301 EKY------RIEKLAQANKKRLMIEAEAEAEAIRLKGEAEAAAIAAKARAE 345
E+ +IE L + ++R + E E L+ E A E
Sbjct: 322 EERLEELKEKIEALKEELEERETLLEELEQLLAELEEAKEELEEKLSALLE 372
Score = 31.2 bits (71), Expect = 1.0
Identities = 31/194 (15%), Positives = 77/194 (39%), Gaps = 12/194 (6%)
Query: 163 KDIRDDEGYLKSLGMARTAEVKRDARIGEAEAQKDARIKEAMAEEEKMAARFVN-DTEIA 221
+ + + ++ L R A + + E + + R+ E E + R + EI
Sbjct: 772 EALAKLKEEIEELEEKRQALQEELEELEEELEEAERRLDALERELESLEQRRERLEQEIE 831
Query: 222 KAQRDFEIKKATYDAEVETKRAEAELAFELQAAKTKQRIKEEQMKIQVIERNQEIQVQEQ 281
+ + + + +++ E E EL+ + + + E+++ + E E++ E+
Sbjct: 832 ELEE----EIEELEEKLDELEEELE---ELEKELEELKEELEELEAEKEELEDELKELEE 884
Query: 282 EIQRRKKELESSVNRPAEAEKYRIEKLAQANKKRLMIEAEAEAEAIRLKGEAEAAAIAAK 341
E + ++EL + AE + E++ + ++ +EA+ E + L E +
Sbjct: 885 EKEELEEELRELESELAELK----EEIEKLRERLEELEAKLERLEVELPELEEELEEEYE 940
Query: 342 ARAEVEIMTKKAEA 355
E E+ +
Sbjct: 941 DTLETELEREIERL 954
>gnl|CDD|240439 PTZ00491, PTZ00491, major vault protein; Provisional.
Length = 850
Score = 37.3 bits (87), Expect = 0.012
Identities = 27/110 (24%), Positives = 54/110 (49%), Gaps = 15/110 (13%)
Query: 254 AKTKQRI-KEEQMKIQVIERNQEIQVQEQEIQRRKKELESSVNRPAEAEKYRIEKLAQA- 311
+T+ + K Q+ I++ ++QE + Q E E+ E+ ++ A+A
Sbjct: 643 ERTRDSLQKSVQLAIEITTKSQEAAARHQAELL---EQEAR----GRLERQKMHDKAKAE 695
Query: 312 NKKRLMIEAEAEAEAIRLKGEAEAAAIA------AKARAEVEIMTKKAEA 355
++ ++E +AE+ A+ G++ A A+A +A AEVE +A+A
Sbjct: 696 EQRTKLLELQAESAAVESSGQSRAEALAEAEARLIEAEAEVEQAELRAKA 745
Score = 36.1 bits (84), Expect = 0.027
Identities = 35/125 (28%), Positives = 51/125 (40%), Gaps = 22/125 (17%)
Query: 239 ETKRAEAELAFELQAAKTKQRIKEEQMKIQVIE---RNQEIQVQEQEIQRRKKELESSVN 295
E EA E Q K + K E+ + +++E + ++ Q
Sbjct: 673 ELLEQEARGRLERQ--KMHDKAKAEEQRTKLLELQAESAAVESSGQSRAEAL-------- 722
Query: 296 RPAEAEKYRIE-----KLAQANKKRLMIEAEAEAEAIRLKGEAEAAAIAAKARAEVEIMT 350
AEAE IE + A+ K L IEAEAE E +R + E E +A+ E+EI
Sbjct: 723 --AEAEARLIEAEAEVEQAELRAKALRIEAEAELEKLRKRQELELEYE--QAQNELEIAK 778
Query: 351 KKAEA 355
K A
Sbjct: 779 AKELA 783
Score = 36.1 bits (84), Expect = 0.033
Identities = 38/131 (29%), Positives = 63/131 (48%), Gaps = 23/131 (17%)
Query: 237 EVETKRAEAELAFELQAAKTKQRIKEEQMKIQVIERNQEIQVQEQEIQRRKKELESSVNR 296
E+ TK EA A+ + + E++ + + +ER Q++ + + ++R K LE
Sbjct: 658 EITTKSQEAA-------ARHQAELLEQEARGR-LER-QKMHDKAKAEEQRTKLLE----- 703
Query: 297 PAEAEKYRIEKLAQANKKRLMIEAEAEAEAIRLKGEAEAAAIAAKAR-----AEVEIMTK 351
+AE +E Q+ R AEAEA I + E E A + AKA AE+E + K
Sbjct: 704 -LQAESAAVESSGQS---RAEALAEAEARLIEAEAEVEQAELRAKALRIEAEAELEKLRK 759
Query: 352 KAEAWKDYREA 362
+ E +Y +A
Sbjct: 760 RQELELEYEQA 770
Score = 32.3 bits (74), Expect = 0.44
Identities = 39/217 (17%), Positives = 81/217 (37%), Gaps = 39/217 (17%)
Query: 119 QRAIMGSMSVEDIYKDRKKFSKQVFQVASSDLVNMGITVVSYTLKDIRDDEGYLKSLGMA 178
++A+ GS +D +F + + D+ ++ D R + KS+ +A
Sbjct: 604 RQAVFGSNDETGEVRDSLRFPANNLVITNVDVQSVEPV-------DERTRDSLQKSVQLA 656
Query: 179 RTAEVKRDARIGEAEAQKDARIKEAMA----EEEKMAARFVNDTE-------IAKAQRDF 227
E+ ++ EA A+ A + E A E +KM + + + A++
Sbjct: 657 --IEITTKSQ--EAAARHQAELLEQEARGRLERQKMHDKAKAEEQRTKLLELQAESAAVE 712
Query: 228 EIKKATYDAEVETKRAEAELAFELQAAKTKQRIKEEQMKIQVIERNQEIQVQ----EQEI 283
++ +A E + E E++ A E + K IE E++ E E+
Sbjct: 713 SSGQSRAEALAEAEARLIEAEAEVEQA-------ELRAKALRIEAEAELEKLRKRQELEL 765
Query: 284 QRRKKELESSVNRPAEAEKYRIEKLAQANKKRLMIEA 320
+ + + E + + E K + ++EA
Sbjct: 766 EYEQAQNELEIAKAKELADIEATKFER------IVEA 796
>gnl|CDD|235334 PRK05035, PRK05035, electron transport complex protein RnfC;
Provisional.
Length = 695
Score = 36.8 bits (86), Expect = 0.016
Identities = 39/191 (20%), Positives = 68/191 (35%), Gaps = 11/191 (5%)
Query: 185 RDARIGEAEAQKDARIKEAMAEEEKMAARFVND-TEIAKAQRDFEIKKATYDAEVETKRA 243
R AR+ +A ++AR K+A V KA++ + A +
Sbjct: 458 RQARLEREKAAREARHKKAAEARAAKDKDAVAAALARVKAKKAAATQPIVIKAGARPDNS 517
Query: 244 EAELAFELQAAKTKQRIKEEQMKIQVIERNQEIQVQEQEIQRRKKELESSVNRPAEAEKY 303
A E + A+ + R E+Q + + + +K +++ E
Sbjct: 518 AVIAAREARKAQARARQAEKQAAAAADPKKAAVAAAIARAKAKKAAQQAANAEAEEEVDP 577
Query: 304 RIEKL------AQANKKRLMIEAEAEAEA---IRLKGEAEAAAI-AAKARAEVEIMTKKA 353
+ + A+A K + E + K A AAAI AKA+ + +
Sbjct: 578 KKAAVAAAIARAKAKKAAQQAASAEPEEQVAEVDPKKAAVAAAIARAKAKKAEQQANAEP 637
Query: 354 EAWKDYREAAV 364
E D R+AAV
Sbjct: 638 EEPVDPRKAAV 648
Score = 36.8 bits (86), Expect = 0.019
Identities = 45/208 (21%), Positives = 72/208 (34%), Gaps = 21/208 (10%)
Query: 178 ARTAEVKRDARIGEAEAQKDARIKEAMAEEEKMAARFVNDT----------EIAKAQRDF 227
+ A R A+ +A A AR+K A + E KAQ
Sbjct: 473 HKKAAEARAAKDKDAVAAALARVKAKKAAATQPIVIKAGARPDNSAVIAAREARKAQARA 532
Query: 228 EIKKATYDAEVETKRAEAELAFE-LQAAKTKQRIKEEQMKIQVIERNQEIQVQEQEIQRR 286
+ A + K+A A +A K Q+ + + +V + + + +
Sbjct: 533 RQAEKQAAAAADPKKAAVAAAIARAKAKKAAQQAANAEAEEEVDPKKAAVAAAIARAKAK 592
Query: 287 KKELESSVNRPAEAEKYRIEKLAQAN--------KKRLMIEAEAEAEAIRLKGEAEAAAI 338
K +++ P E K A KK E + + A AAAI
Sbjct: 593 KAAQQAASAEPEEQVAEVDPKKAAVAAAIARAKAKKAEQQANAEPEEPVDPRKAAVAAAI 652
Query: 339 A-AKAR-AEVEIMTKKAEAWKDYREAAV 364
A AKAR A + + E +D ++AAV
Sbjct: 653 ARAKARKAAQQQANAEPEEAEDPKKAAV 680
Score = 33.4 bits (77), Expect = 0.19
Identities = 46/186 (24%), Positives = 70/186 (37%), Gaps = 18/186 (9%)
Query: 177 MARTAEVKRDARIGEAEAQKDARIKEAMAEEEKMAARFVNDTEIAKAQR-DFEIKKATYD 235
+A K AR +AE Q A A AR AKA++ + A +
Sbjct: 520 IAAREARKAQARARQAEKQAAAAADPKKAAVAAAIAR-------AKAKKAAQQAANAEAE 572
Query: 236 AEVETKRAEAELAF-ELQAAKTKQRIKEEQMKIQVIERNQEIQVQEQEIQRRK-----KE 289
EV+ K+A A +A K Q+ + + QV E + + I R K ++
Sbjct: 573 EEVDPKKAAVAAAIARAKAKKAAQQAASAEPEEQVAEVDPKKAAVAAAIARAKAKKAEQQ 632
Query: 290 LESSVNRPAEAEKYRIEKLAQANKKRLMIEAEAEAEAIRLKGEAEAAAIAAKARAEVEIM 349
+ P + K + K R + +A AE + +AA AA ARA+
Sbjct: 633 ANAEPEEPVDPRKAAVAAAIARAKARKAAQQQANAEPEEAEDPKKAAVAAAIARAK---- 688
Query: 350 TKKAEA 355
KKA
Sbjct: 689 AKKAAQ 694
>gnl|CDD|225606 COG3064, TolA, Membrane protein involved in colicin uptake [Cell
envelope biogenesis, outer membrane].
Length = 387
Score = 36.5 bits (84), Expect = 0.017
Identities = 39/152 (25%), Positives = 67/152 (44%), Gaps = 8/152 (5%)
Query: 218 TEIAKAQRDFEIKKATYDAEVETKRAEAELAFELQAAKTKQRIKEEQMKIQVIERNQEIQ 277
+ K ++ KK AE + AE Q K + + +E+Q + + E+ +++
Sbjct: 72 SSAKKGEQQ-RKKKEEQVAEELKPKQAAEQERLKQLEKERLKAQEQQKQAEEAEKQAQLE 130
Query: 278 VQEQEIQRRKKELESSVNRPAEAEKYRIEKLAQANKKRLMIEAEAEAEAIRLKGE---AE 334
++QE Q RK E +AE + + A+A K + EA+ +AE E A+
Sbjct: 131 QKQQEEQARKAAAEQK----KKAEAAKAKAAAEAAKLKAAAEAKKKAEEAAKAAEEAKAK 186
Query: 335 AAAIAAKARAEVEIMTKKAEAWKDYREAAVID 366
A A AAK +AE E +A + A +
Sbjct: 187 AEAAAAKKKAEAEAKAAAEKAKAEAEAKAKAE 218
Score = 33.8 bits (77), Expect = 0.14
Identities = 54/189 (28%), Positives = 71/189 (37%), Gaps = 28/189 (14%)
Query: 167 DDEGYLKSLGMARTAEVKRDARIGEAEAQKDARIKEAMAEEEKMAARFVNDTEIAKAQRD 226
++ LK L R ++ + EAE Q K+ + K AA E AKA+
Sbjct: 98 AEQERLKQLEKERLKAQEQQKQAEEAEKQAQLEQKQQEEQARKAAAEQKKKAEAAKAKAA 157
Query: 227 FEIKKATYDAEVETKRAEAELAFELQAAKTKQRIKEEQMKIQVIERNQEIQVQEQEIQRR 286
E K AE + K EA A E AK +
Sbjct: 158 AEAAKLKAAAEAKKKAEEAAKAAEEAKAKAEAA-------------------------AA 192
Query: 287 KKELESSVNRPAEAEKYRIEKLAQANKKRLMIEAEAEAEAIRLKGEAEAAAIAAKARAEV 346
KK+ E+ AE K E A+A KK EA AE +A K +A A A A KA A
Sbjct: 193 KKKAEAEAKAAAEKAKAEAEAKAKAEKKA---EAAAEEKAAAEKKKAAAKAKADKAAAAA 249
Query: 347 EIMTKKAEA 355
+ +KA A
Sbjct: 250 KAAERKAAA 258
Score = 31.8 bits (72), Expect = 0.54
Identities = 38/130 (29%), Positives = 61/130 (46%), Gaps = 2/130 (1%)
Query: 235 DAEVETKRAEAELAFELQAAKTKQRIKEEQMKIQVIERNQEIQVQEQEIQRRKKELESSV 294
DA + A + +Q+ ++ + E+Q K + + +E++ ++ Q R K+LE
Sbjct: 51 DAVMVDPGAVVQQYGRIQSQQSSAKKGEQQRKKKEEQVAEELKPKQAAEQERLKQLEKER 110
Query: 295 NRPAEAEKYRIEKLAQANKKRLMIEAEAEAEAIRLKGEAEAAAIAAKARAEVEIMTKKAE 354
+ E +K E QA ++ E +A A K +AEAA AKA AE + AE
Sbjct: 111 LKAQEQQKQAEEAEKQAQLEQKQQEEQARKAAAEQKKKAEAAK--AKAAAEAAKLKAAAE 168
Query: 355 AWKDYREAAV 364
A K EAA
Sbjct: 169 AKKKAEEAAK 178
>gnl|CDD|233973 TIGR02680, TIGR02680, TIGR02680 family protein. Members of this
protein family belong to a conserved gene four-gene
neighborhood found sporadically in a phylogenetically
broad range of bacteria: Nocardia farcinica,
Symbiobacterium thermophilum, and Streptomyces
avermitilis (Actinobacteria), Geobacillus kaustophilus
(Firmicutes), Azoarcus sp. EbN1 and Ralstonia
solanacearum (Betaproteobacteria). Proteins in this
family average over 1400 amino acids in length
[Hypothetical proteins, Conserved].
Length = 1353
Score = 35.6 bits (82), Expect = 0.054
Identities = 35/182 (19%), Positives = 56/182 (30%), Gaps = 14/182 (7%)
Query: 178 ARTAEVKRDARIGEAEAQKDARIKEAMAEEEKMAARFVNDTEIAKAQRDFE--------- 228
A R A A ++ R E A + + + A + F
Sbjct: 807 RELARAARKAAAAAAAWKQARRELERDAADLDLPTDPDALEAVGLALKRFGDHLHTLEVA 866
Query: 229 ----IKKATYDAEVETKRAEAELAFELQAAK-TKQRIKEEQMKIQVIERNQEIQVQEQEI 283
AT AE + A AE A + R + E+ +++ + + EI
Sbjct: 867 VRELRHAATRAAEQRARAARAESDAREAAEDAAEARAEAEEASLRLRTLEESVGAMVDEI 926
Query: 284 QRRKKELESSVNRPAEAEKYRIEKLAQANKKRLMIEAEAEAEAIRLKGEAEAAAIAAKAR 343
+ R E +++ E LA A + R E + L AEA A
Sbjct: 927 RARLAETRAALASGGRELPRLAEALATAEEARGRAEEKRAEADATLDERAEARDHAIGQL 986
Query: 344 AE 345
E
Sbjct: 987 RE 988
Score = 28.2 bits (63), Expect = 9.2
Identities = 42/215 (19%), Positives = 68/215 (31%), Gaps = 31/215 (14%)
Query: 174 SLGMARTAEVKRDAR-IG---------EAEAQKDARIKEAMAEEEKMAARFVNDTEIAKA 223
LG+ R A K A IG A+ DAR+ E ++A +A
Sbjct: 715 RLGVLRGAWAKPAAEYIGAAARERARLRRIAELDARLAAVDDELAELARELRALGARQRA 774
Query: 224 QRDFEIKKATYDAEVETKRAEAELAFELQAAKTKQRIKEEQMKIQVIERNQEIQVQEQEI 283
D E+ A D + A A E QA ++ + K + +E+
Sbjct: 775 LAD-ELAGAPSDRSLRAAHRRAAEA-ERQAESAERELARAARKAAAAAA--AWKQARREL 830
Query: 284 QRRKKELESSVNRPA----------------EAEKYRIEKLAQANKKRLMIEAEAEAEAI 327
+R +L+ + A E E A + A AE+
Sbjct: 831 ERDAADLDLPTDPDALEAVGLALKRFGDHLHTLEVAVRELRHAATRAAEQRARAARAESD 890
Query: 328 RLKGEAEAAAIAAKA-RAEVEIMTKKAEAWKDYRE 361
+ +AA A+A A + + T + E
Sbjct: 891 AREAAEDAAEARAEAEEASLRLRTLEESVGAMVDE 925
Score = 28.2 bits (63), Expect = 9.9
Identities = 35/185 (18%), Positives = 62/185 (33%), Gaps = 8/185 (4%)
Query: 162 LKDIRDDEGYLKSLGMARTAEVKRDARIGEAEAQKDARIKEAMAEEEKMAARFVNDTEIA 221
L+ ++ Y + + R + A+A++ R E+ EEE+ E A
Sbjct: 320 LEALQGSPAYQDAEELERARADAEALQAAAADARQAIREAESRLEEERRRL-----DEEA 374
Query: 222 KAQRDFEIKKATYDAEVETKRAEAELAFELQAAKTKQRIKEEQMKIQVIE-RNQEIQVQE 280
D E + ++ A L+ A +E ++ ++ R Q+
Sbjct: 375 GRLDDAERELRAAREQLARAAERAGLSPAHTAEPDAALAAQELQELGALDARRQDADRVI 434
Query: 281 QEIQRRKKELESSVNRPAEAEKYRIEKLAQANKKRLMIEAEAEAEAIRLKGEAEAAAIAA 340
+ + L + AE A+A + L EAE A L EA A
Sbjct: 435 AQRSEQVALLRRRDDVADRAE--ATHAAARARRDELDEEAEQAAARAELADEAVHREGAR 492
Query: 341 KARAE 345
A +
Sbjct: 493 LAWVD 497
>gnl|CDD|239501 cd03407, Band_7_4, A subgroup of the band 7 domain of flotillin
(reggie) like proteins. This subgroup contains proteins
similar to stomatin, prohibitin, flotillin, HlfK/C and
podicin. Many of these band 7 domain-containing
proteins are lipid raft-associated. Individual proteins
of this band 7 domain family may cluster to form
membrane microdomains which may in turn recruit
multiprotein complexes. Microdomains formed from
flotillin proteins may in addition be dynamic units with
their own regulatory functions. Flotillins have been
implicated in signal transduction, vesicle trafficking,
cytoskeleton rearrangement and are known to interact
with a variety of proteins. Stomatin interacts with and
regulates members of the degenerin/epithelia Na+ channel
family in mechanosensory cells of Caenorhabditis elegans
and vertebrate neurons and participates in trafficking
of Glut1 glucose transporters. Prohibitin may act as a
chaperone for the stabilization of mitochondrial
proteins. Prokaryotic HflK/C plays a role in the
decision between lysogenic and lytic cycle growth during
lambda phage infection. Flotillins have been implicated
in the progression of prion disease, in the pathogenesis
of neurodegenerative diseases such as Parkinson's and
Alzheimer's disease and, in cancer invasion and
metastasis. Mutations in the podicin gene give rise to
autosomal recessive steroid resistant nephritic
syndrome.
Length = 262
Score = 34.6 bits (80), Expect = 0.057
Identities = 25/120 (20%), Positives = 54/120 (45%), Gaps = 9/120 (7%)
Query: 120 RAIMGSMSVEDIYKDRKKFSKQVFQVASSDLVNMGITVVSYTLKDIRDDEGYLKSLGMAR 179
RA + ++++++++ + + +K V + + G +V+ + DI D +++
Sbjct: 94 RARIPKLTLDELFEQKDEIAKAVEEELREAMSRYGFEIVATLITDIDPDAEVKRAMNEIN 153
Query: 180 TAEVKRDARIGEAEAQKDARIKEAMAEEEKMAARFVNDTEIAKAQRDFEIKKATYDAEVE 239
A+ +R A + +AEA+K IK A A+ E + E ++A D E
Sbjct: 154 AAQRQRVAAVHKAEAEKIKDIKAAEADAEAK---------RLQGVGAAEQRQAIADGLRE 204
Score = 34.2 bits (79), Expect = 0.069
Identities = 21/64 (32%), Positives = 31/64 (48%)
Query: 306 EKLAQANKKRLMIEAEAEAEAIRLKGEAEAAAIAAKARAEVEIMTKKAEAWKDYREAAVI 365
A+A K + + AEA+AEA RL+G A A A E + A+A V+
Sbjct: 163 VHKAEAEKIKDIKAAEADAEAKRLQGVGAAEQRQAIADGLRESILSLADAVPGMTAKDVM 222
Query: 366 DMIL 369
D++L
Sbjct: 223 DLLL 226
>gnl|CDD|226809 COG4372, COG4372, Uncharacterized protein conserved in bacteria
with the myosin-like domain [Function unknown].
Length = 499
Score = 35.0 bits (80), Expect = 0.057
Identities = 44/203 (21%), Positives = 80/203 (39%), Gaps = 23/203 (11%)
Query: 176 GMARTAEVKRDARIGEAEAQKDARIKEAMAEEEKMAARFVNDTEIAKAQRDFEIKKATYD 235
G R AE +R+A E + AR + +E AAR +AKAQ++
Sbjct: 95 GEKRAAETEREAARSELQK---ARQEREAVRQELAAAR----QNLAKAQQELARLTKQAQ 147
Query: 236 ------AEVETKRAEAELAFE-LQAAKTKQRIKEEQMKIQVIE---RNQEIQVQEQEIQR 285
+ +R + E + LQA++ + + Q+K QV++ R+ +I+ + Q +
Sbjct: 148 DLQTRLKTLAEQRRQLEAQAQSLQASQKQLQASATQLKSQVLDLKLRSAQIEQEAQNLAT 207
Query: 286 RKKELESSVNRPAEAEKYRIEKLAQANKKRLMI-----EAEAEAEAIRLKGEAEAAAIAA 340
R ++ A + ++ I + A AE IR + A
Sbjct: 208 RANAAQARTEELARRAAAAQQTAQAIQQRDAQISQKAQQIAARAEQIRERERQLQRLETA 267
Query: 341 KARAEVEIMTKKAEAWKDYREAA 363
+AR E E+ +A ++ Y
Sbjct: 268 QARLEQEVAQLEA-YYQAYVRLR 289
Score = 30.4 bits (68), Expect = 1.7
Identities = 37/171 (21%), Positives = 63/171 (36%), Gaps = 11/171 (6%)
Query: 185 RDARIGEAEAQKDARIKEAMAEEEKMAARFVNDTEIAKAQRDFEIKKATYDAE-VETKRA 243
R+ R G + A+ E A E + E++KA + E V + A
Sbjct: 67 RNLRSGVFQLDDIRPQLRALRTELGTAQGEKRAAETEREAARSELQKARQEREAVRQELA 126
Query: 244 EAEL----AFELQAAKTKQRIKEEQMKIQVIERNQEIQVQEQEIQRRKKELESSVN---- 295
A A + A TKQ + + E+ ++++ Q Q +Q +K+L++S
Sbjct: 127 AARQNLAKAQQELARLTKQAQDLQTRLKTLAEQRRQLEAQAQSLQASQKQLQASATQLKS 186
Query: 296 --RPAEAEKYRIEKLAQANKKRLMIEAEAEAEAIRLKGEAEAAAIAAKARA 344
+ +IE+ AQ R E R A+ A A + R
Sbjct: 187 QVLDLKLRSAQIEQEAQNLATRANAAQARTEELARRAAAAQQTAQAIQQRD 237
>gnl|CDD|234017 TIGR02794, tolA_full, TolA protein. TolA couples the inner
membrane complex of itself with TolQ and TolR to the
outer membrane complex of TolB and OprL (also called
Pal). Most of the length of the protein consists of
low-complexity sequence that may differ in both length
and composition from one species to another,
complicating efforts to discriminate TolA (the most
divergent gene in the tol-pal system) from paralogs such
as TonB. Selection of members of the seed alignment and
criteria for setting scoring cutoffs are based largely
conserved operon struction. //The Tol-Pal complex is
required for maintaining outer membrane integrity. Also
involved in transport (uptake) of colicins and
filamentous DNA, and implicated in pathogenesis.
Transport is energized by the proton motive force. TolA
is an inner membrane protein that interacts with
periplasmic TolB and with outer membrane porins ompC,
phoE and lamB [Transport and binding proteins, Other,
Cellular processes, Pathogenesis].
Length = 346
Score = 34.8 bits (80), Expect = 0.064
Identities = 38/175 (21%), Positives = 71/175 (40%), Gaps = 5/175 (2%)
Query: 185 RDARIGEAEAQKDARIKEAMAEEEKMAARFVNDTEIAKAQRDFEIKKATYDAEVETKRAE 244
+ + A+ +++ + K EE R + ++ +KA AE + +
Sbjct: 56 QQQKKPAAKKEQERQKKLEQQAEEAEKQRAAEQARQKELEQRAAAEKAAKQAE-QAAKQA 114
Query: 245 AELAFELQAAKTKQRIKEEQMKIQVIERNQEIQVQEQEIQRRKKELESSVNRPAEAEKYR 304
E + + AK KQ + + E+ + + ++Q + K + + + A K +
Sbjct: 115 EEKQKQAEEAKAKQAAEAKAKAEAEAEKKAKEEAKKQAEEEAKAKAAAEAKKKAAEAKKK 174
Query: 305 IE----KLAQANKKRLMIEAEAEAEAIRLKGEAEAAAIAAKARAEVEIMTKKAEA 355
E A+A K EA+A+AEA + K AEAAA A A + +A
Sbjct: 175 AEAEAKAKAEAKAKAKAEEAKAKAEAAKAKAAAEAAAKAEAEAAAAAAAEAERKA 229
Score = 32.5 bits (74), Expect = 0.32
Identities = 34/132 (25%), Positives = 60/132 (45%), Gaps = 9/132 (6%)
Query: 241 KRAEAELAFELQAAKTKQRIKEEQMKIQVIERNQEIQVQEQEI--QRRKKELESSVNRPA 298
+ + A + + + +QR E+ Q + ++ + ++++ + K+ E+ A
Sbjct: 79 EAEKQRAAEQARQKELEQRAAAEKAAKQAEQAAKQAEEKQKQAEEAKAKQAAEAKAKAEA 138
Query: 299 EAEKYRIEKLA----QANKKRLMIEAEAEAEAIRLKGEAEAAAIA---AKARAEVEIMTK 351
EAEK E+ + K + EA+ +A + K EAEA A A AKA+AE
Sbjct: 139 EAEKKAKEEAKKQAEEEAKAKAAAEAKKKAAEAKKKAEAEAKAKAEAKAKAKAEEAKAKA 198
Query: 352 KAEAWKDYREAA 363
+A K EAA
Sbjct: 199 EAAKAKAAAEAA 210
Score = 31.3 bits (71), Expect = 0.75
Identities = 33/129 (25%), Positives = 56/129 (43%), Gaps = 5/129 (3%)
Query: 235 DAEVETKRAEAELAFELQAAKTKQRIKEEQMKIQVIERNQEIQVQEQEIQRRKKELESSV 294
A + A A+ A +Q K KE++ + ++ ++ +E + Q Q R+KELE
Sbjct: 39 QAVLVDPGAVAQQANRIQQQKKPAAKKEQERQKKLEQQAEEAEKQRAAEQARQKELEQRA 98
Query: 295 NRPAEAEKYRIEKLAQANKKRLMIEAEAEAEAIRLKGEAEAAAIAAKARAEVEIMTKKAE 354
A++ K++ EA+A+ A EA+A A A + E K+AE
Sbjct: 99 AAEKAAKQAEQAAKQAEEKQKQAEEAKAKQAA-----EAKAKAEAEAEKKAKEEAKKQAE 153
Query: 355 AWKDYREAA 363
+ AA
Sbjct: 154 EEAKAKAAA 162
>gnl|CDD|233758 TIGR02169, SMC_prok_A, chromosome segregation protein SMC,
primarily archaeal type. SMC (structural maintenance of
chromosomes) proteins bind DNA and act in organizing and
segregating chromosomes for partition. SMC proteins are
found in bacteria, archaea, and eukaryotes. It is found
in a single copy and is homodimeric in prokaryotes, but
six paralogs (excluded from this family) are found in
eukarotes, where SMC proteins are heterodimeric. This
family represents the SMC protein of archaea and a few
bacteria (Aquifex, Synechocystis, etc); the SMC of other
bacteria is described by TIGR02168. The N- and
C-terminal domains of this protein are well conserved,
but the central hinge region is skewed in composition
and highly divergent [Cellular processes, Cell division,
DNA metabolism, Chromosome-associated proteins].
Length = 1164
Score = 35.0 bits (81), Expect = 0.066
Identities = 31/188 (16%), Positives = 70/188 (37%), Gaps = 21/188 (11%)
Query: 162 LKDIRDDEGY--LKSLGMARTAEVKRDARIGEAEAQKDARIKEAMAEEEKMAARFVNDTE 219
LK+ R+ EGY LK + + ++ E + + +E E+++ E
Sbjct: 217 LKEKREYEGYELLKEKEALERQKEAIERQLASLEEELEKLTEEISELEKRLEEIEQLLEE 276
Query: 220 IAKAQRDFEIKKATYDAEVETKRAEAELAFELQAAKTKQRIKEEQMKIQVIERNQEIQVQ 279
+ K +IK + ++ K EL E+ + + KE +++ ER +++ +
Sbjct: 277 LNK-----KIKDLGEEEQLRVKEKIGELEAEIASLERSIAEKERELE-DAEERLAKLEAE 330
Query: 280 EQEIQRRKKELESSVNRPAEAEKYRIEKLAQANKKRLMIEAEAEAEAIRLKGEAEAAAIA 339
++ +ELE + + ++ + E AE + AE +
Sbjct: 331 IDKLLAEIEELEREIEE-------------ERKRRDKLTEEYAELKEELEDLRAELEEVD 377
Query: 340 AKARAEVE 347
+ +
Sbjct: 378 KEFAETRD 385
Score = 33.5 bits (77), Expect = 0.21
Identities = 36/200 (18%), Positives = 83/200 (41%), Gaps = 15/200 (7%)
Query: 162 LKDIRDDEGYLKSLGMARTAEVKR-DARIGEAEAQKDARIKEAMAEEEKMAA--RFVNDT 218
L+++ +D L+ +E+K +ARI E E +++EA+ + E + R
Sbjct: 739 LEELEEDLSSLEQEIENVKSELKELEARIEELEEDL-HKLEEALNDLEARLSHSRIPEIQ 797
Query: 219 EIAKAQRDF--EIKKATYDAEVETKRAEAELAFELQAAKTKQ--------RIKEEQMKIQ 268
+ I+ + E + R E + + + Q +IK + +I+
Sbjct: 798 AELSKLEEEVSRIEARLREIEQKLNRLTLEKEYLEKEIQELQEQRIDLKEQIKSIEKEIE 857
Query: 269 VIE-RNQEIQVQEQEIQRRKKELESSVNRPAEAEKYRIEKLAQANKKRLMIEAEAEAEAI 327
+ + +E++ + +E++ ++LES + + +L + +K +EA+ E +
Sbjct: 858 NLNGKKEELEEELEELEAALRDLESRLGDLKKERDELEAQLRELERKIEELEAQIEKKRK 917
Query: 328 RLKGEAEAAAIAAKARAEVE 347
RL + +E+E
Sbjct: 918 RLSELKAKLEALEEELSEIE 937
Score = 30.0 bits (68), Expect = 2.7
Identities = 40/208 (19%), Positives = 79/208 (37%), Gaps = 17/208 (8%)
Query: 184 KRDARIGEAEAQKDARIKEAMAEEEKMAARFVND-----TEIAKAQRDFEIKKATY-DAE 237
+ + R+ E E + K+ E+ R EIA +R K+ DAE
Sbjct: 262 ELEKRLEEIEQLLEELNKKIKDLGEEEQLRVKEKIGELEAEIASLERSIAEKERELEDAE 321
Query: 238 VETKRAEAEL-AFELQAAKTKQRIKEEQM-KIQVIERNQEIQVQEQEIQRRKKELESSVN 295
+ EAE+ + + ++ I+EE+ + ++ E E++ + ++++ +E++
Sbjct: 322 ERLAKLEAEIDKLLAEIEELEREIEEERKRRDKLTEEYAELKEELEDLRAELEEVDKEFA 381
Query: 296 RPAEAEKYRIEKLAQANKK---------RLMIEAEAEAEAIRLKGEAEAAAIAAKARAEV 346
+ K EKL + ++ RL E + +E + A A A E
Sbjct: 382 ETRDELKDYREKLEKLKREINELKRELDRLQEELQRLSEELADLNAAIAGIEAKINELEE 441
Query: 347 EIMTKKAEAWKDYREAAVIDMILESLPK 374
E K E K + + L +
Sbjct: 442 EKEDKALEIKKQEWKLEQLAADLSKYEQ 469
>gnl|CDD|233757 TIGR02168, SMC_prok_B, chromosome segregation protein SMC, common
bacterial type. SMC (structural maintenance of
chromosomes) proteins bind DNA and act in organizing and
segregating chromosomes for partition. SMC proteins are
found in bacteria, archaea, and eukaryotes. This family
represents the SMC protein of most bacteria. The smc
gene is often associated with scpB (TIGR00281) and scpA
genes, where scp stands for segregation and condensation
protein. SMC was shown (in Caulobacter crescentus) to be
induced early in S phase but present and bound to DNA
throughout the cell cycle [Cellular processes, Cell
division, DNA metabolism, Chromosome-associated
proteins].
Length = 1179
Score = 35.0 bits (81), Expect = 0.070
Identities = 43/193 (22%), Positives = 76/193 (39%), Gaps = 10/193 (5%)
Query: 187 ARIGEAEAQKDARIKEAMAEEEKMAARFVNDTEIAKAQRDFEIKKATYDAEVETKRAE-A 245
+ E A + +++E A+ E++ ++ E + K E+E+ AE
Sbjct: 305 QILRERLANLERQLEELEAQLEELESKLDELAEELAELEE---KLEELKEELESLEAELE 361
Query: 246 ELAFELQAAKTKQRIKEEQMKIQ---VIERNQEIQVQEQEIQRRKKELESS---VNRPAE 299
EL EL+ +++ EEQ++ V + +I EI+R + LE R +
Sbjct: 362 ELEAELEELESRLEELEEQLETLRSKVAQLELQIASLNNEIERLEARLERLEDRRERLQQ 421
Query: 300 AEKYRIEKLAQANKKRLMIEAEAEAEAIRLKGEAEAAAIAAKARAEVEIMTKKAEAWKDY 359
+ ++KL +A K L E E E + E A E+ +
Sbjct: 422 EIEELLKKLEEAELKELQAELEELEEELEELQEELERLEEALEELREELEEAEQALDAAE 481
Query: 360 REAAVIDMILESL 372
RE A + L+SL
Sbjct: 482 RELAQLQARLDSL 494
Score = 33.9 bits (78), Expect = 0.17
Identities = 28/178 (15%), Positives = 67/178 (37%), Gaps = 6/178 (3%)
Query: 178 ARTAEVKRDARIGEAEAQKDARIKEAMAEEEKMAARFVNDTEIAKAQRDFEIKKATYDAE 237
+ E++ E + + E + EE + + + + + ++ + + E
Sbjct: 684 EKIEELEEKIAELEKALAELRKELEELEEELEQLRKELEELSRQISALRKDLARLEAEVE 743
Query: 238 VETKR---AEAELAFELQAAKTKQRIKEEQMKIQVIERNQEIQVQEQEIQRRKKELESSV 294
+R EL EL+A + + E+ + ++ E EI+ E +I++ K+EL++
Sbjct: 744 QLEERIAQLSKELT-ELEAEIEELEERLEEAEEELAEAEAEIEELEAQIEQLKEELKALR 802
Query: 295 NRPAEAEK-YRIEKLAQANKKRLMIEAEAEAEAIRLKGEAEAAAIAAKARAEVEIMTK 351
E + AN + + E A + E + + ++E +
Sbjct: 803 EALDELRAELTLLNEEAANLRERLESLERRIAATERRLE-DLEEQIEELSEDIESLAA 859
Score = 32.7 bits (75), Expect = 0.32
Identities = 30/182 (16%), Positives = 57/182 (31%), Gaps = 20/182 (10%)
Query: 183 VKRDARIGEAEAQKDA---RIKEAMAEEEKMAARFVNDTE-IAKAQRDFEIKKATYDAEV 238
+ +A+I + + + A + E AE + N E + +R + +
Sbjct: 785 EELEAQIEQLKEELKALREALDELRAELTLLNEEAANLRERLESLERRIAATERRLEDLE 844
Query: 239 ETKRAEAELAFELQAAKTKQRIKEEQMKIQVIERNQEIQVQEQEIQRRKKELESSVNRPA 298
E +E L A + E+++ ++ E E+ + + ELE
Sbjct: 845 EQIEELSEDIESLAAEIEELEELIEELESELEALLNERASLEEALALLRSELEELSEELR 904
Query: 299 EAEKYRIEKLAQANKKRLMIEAEAEAEAIRLKGEAEAAAIAAKARAEVEIMTKKAEAWKD 358
E E R E + + R E A EV I + ++
Sbjct: 905 ELESKRSELRRELEELR----------------EKLAQLELRLEGLEVRIDNLQERLSEE 948
Query: 359 YR 360
Y
Sbjct: 949 YS 950
Score = 32.3 bits (74), Expect = 0.50
Identities = 44/191 (23%), Positives = 87/191 (45%), Gaps = 10/191 (5%)
Query: 181 AEVKRDARIGEAEAQKDARIKEAMAEEEKMAAR-FVNDTEIAKAQRDFEIKKATYDAEVE 239
E++R + E +A+K R KE AE ++ V E + + + E+++ +AE E
Sbjct: 196 NELERQLKSLERQAEKAERYKELKAELRELELALLVLRLEELREELE-ELQEELKEAEEE 254
Query: 240 TKRAEAELAFELQAAKTKQRIKEEQMKIQVIERNQEIQVQEQEIQRRKKELESSVNRPAE 299
+ AEL EL+ + R++ +++ ++ E +E+ EI R +++ + R A
Sbjct: 255 LEELTAELQ-ELEEKLEELRLEVSELEEEIEELQKELYALANEISRLEQQKQILRERLAN 313
Query: 300 AEK------YRIEKLAQANKKRLMIEAEAEAEAIRLKGEAEAAAIA-AKARAEVEIMTKK 352
E+ ++E+L + AE E + LK E E+ + AE+E + +
Sbjct: 314 LERQLEELEAQLEELESKLDELAEELAELEEKLEELKEELESLEAELEELEAELEELESR 373
Query: 353 AEAWKDYREAA 363
E ++ E
Sbjct: 374 LEELEEQLETL 384
>gnl|CDD|182913 PRK11029, PRK11029, FtsH protease regulator HflC; Provisional.
Length = 334
Score = 34.3 bits (79), Expect = 0.081
Identities = 34/134 (25%), Positives = 55/134 (41%), Gaps = 26/134 (19%)
Query: 231 KATYDAEVETKRAEAELAFELQAAKTKQRIKEEQ----------MKIQVIE-RNQEI--- 276
+ E E A+ A A + + K + + I+V++ R ++I
Sbjct: 157 SGSAGTEDEVATPAADDAIASAAERVEAETKGKVPVINPNSMAALGIEVVDVRIKQINLP 216
Query: 277 -QVQEQEIQRRKKELESSVNR-----PAEAEKYRIEKLAQANKKRLMIEAEAEAEAIRLK 330
+V + R + E E+ R EAEK R A A+ + AEAE + ++
Sbjct: 217 TEVSDAIYNRMRAEREAVARRHRSQGQEEAEKLR----ATADYEVTRTLAEAERQGRIMR 272
Query: 331 GE--AEAAAIAAKA 342
GE AEAA + A A
Sbjct: 273 GEGDAEAAKLFADA 286
>gnl|CDD|221432 pfam12128, DUF3584, Protein of unknown function (DUF3584). This
protein is found in bacteria and eukaryotes. Proteins in
this family are typically between 943 to 1234 amino
acids in length. This family contains a P-loop motif
suggesting it is a nucleotide binding protein. It may be
involved in replication.
Length = 1198
Score = 34.7 bits (80), Expect = 0.096
Identities = 30/150 (20%), Positives = 53/150 (35%), Gaps = 11/150 (7%)
Query: 172 LKSLGMARTAEVKRDARIGEAEAQKDARIKEAMAEEEKMAARFVNDTEIAKAQRDFE--- 228
+K + +A E D + ++ +A+ + + A EI K Q DFE
Sbjct: 200 VKRMIVAIAEEHLADGTVPPKSRLNPQDVEHWIADIQALRAIQKVAPEIEKLQEDFEQLL 259
Query: 229 --------IKKATYDAEVETKRAEAELAFELQAAKTKQRIKEEQMKIQVIERNQEIQVQE 280
+ E + E + + R E+Q+K E NQE+
Sbjct: 260 SLELRLQHLHGELVADEERLAEEQEERQEAKNRLRQQLRTLEDQLKEARDELNQELSAAN 319
Query: 281 QEIQRRKKELESSVNRPAEAEKYRIEKLAQ 310
++ + ELE ++ E IE+L
Sbjct: 320 AKLAADRSELELLEDQKGAFEDADIEQLQA 349
Score = 32.4 bits (74), Expect = 0.44
Identities = 35/160 (21%), Positives = 73/160 (45%), Gaps = 14/160 (8%)
Query: 187 ARIGEAEAQKDARIKEAMAEEEKMAARFVNDTEIAKAQRDFEIKKATYDAEVETKRAEAE 246
R+ +AE + + + EE++ + E+ + +R AE E + A +
Sbjct: 607 ERLQQAEEALQSAVAKQKQAEEQLVQA---NAELEEQKR----------AEAEARTALKQ 653
Query: 247 LAFELQAAKTKQRIKEEQMKIQVIERNQEIQVQEQEIQRRKKELESSVNRPAEAEKYRI- 305
+LQ + +Q+ ++++++ + ER Q+ + Q +++ + K+L EA K
Sbjct: 654 ARLDLQRLQNEQQSLKDKLELAIAERKQQAETQLRQLDAQLKQLLEQQQAFLEALKDDFR 713
Query: 306 EKLAQANKKRLMIEAEAEAEAIRLKGEAEAAAIAAKARAE 345
E + K ++E E + + +L EAA AKAR +
Sbjct: 714 ELRTERLAKWQVVEGELDNQLAQLSAAIEAARTQAKARLK 753
>gnl|CDD|233644 TIGR01932, hflC, HflC protein. HflK and HflC are paralogs encoded
by tandem genes in Proteobacteria, spirochetes, and some
other bacterial lineages. The HflKC complex is anchored
in the membrane and exposed to the periplasm. The
complex is not active as a protease, but rather binds to
and appears to modulate the ATP-dependent protease FtsH.
The overall function of HflKC is not fully described
[Protein fate, Degradation of proteins, peptides, and
glycopeptides, Regulatory functions, Protein
interactions].
Length = 317
Score = 34.0 bits (78), Expect = 0.10
Identities = 31/142 (21%), Positives = 54/142 (38%), Gaps = 27/142 (19%)
Query: 228 EIKKATYDAEVETKRAEAELAFELQAAKTKQRIKEE------QMKIQVIERNQ--EIQVQ 279
+K+ + + ++LA K + +I I +Q +I ++
Sbjct: 135 GLKEIVRSSNDQLDTLVSKLALNRGGKINKIAMTITKGREILAREISQIANSQLKDIGIE 194
Query: 280 EQEIQRRK----KELESSVN------RPAEAEKYRI-------EKLAQANKKRLMIEAEA 322
+++ +K EL S+ R A +R E L +A + I +EA
Sbjct: 195 VVDVRIKKINYSDELSESIYNRMRSEREQIARMHRSQGEEKAEEILGKAEYEVRKILSEA 254
Query: 323 EAEAIRLKGE--AEAAAIAAKA 342
A +KGE AEAA I + A
Sbjct: 255 YRTARIIKGEGDAEAAKIYSDA 276
Score = 33.6 bits (77), Expect = 0.14
Identities = 28/110 (25%), Positives = 52/110 (47%), Gaps = 7/110 (6%)
Query: 129 EDIYKDRKKFSKQVFQVASSDLVNMGITVVSYTLKDIRDDEGYLKSLGMARTAEVKRDAR 188
I K R+ ++++ Q+A+S L ++GI VV +K I + +S+ +E ++ AR
Sbjct: 167 MTITKGREILAREISQIANSQLKDIGIEVVDVRIKKINYSDELSESIYNRMRSEREQIAR 226
Query: 189 IGEAEAQKDARIKEAMAEEEKMAARFVNDTEIAKAQRDFEIKKATYDAEV 238
+ ++ ++ A AE E +++A R I K DAE
Sbjct: 227 MHRSQGEEKAEEILGKAEYEVR-------KILSEAYRTARIIKGEGDAEA 269
>gnl|CDD|227278 COG4942, COG4942, Membrane-bound metallopeptidase [Cell division
and chromosome partitioning].
Length = 420
Score = 34.3 bits (79), Expect = 0.11
Identities = 35/106 (33%), Positives = 54/106 (50%), Gaps = 6/106 (5%)
Query: 252 QAAKTKQRIKEEQMKIQ--VIERNQEIQVQEQEIQRRKKELESSVNRPAEAEKYRIEKLA 309
Q A + I EQ ++ + E+ + Q ++ RKK + +N A++ ++E+L
Sbjct: 172 QLAAVRAEIAAEQAELTTLLSEQRAQQAKLAQLLEERKK-TLAQLNSELSADQKKLEEL- 229
Query: 310 QANKKRLMIE-AEAEAEAIRLKGEAEAAAIAAKARAEVEIMTKKAE 354
+AN+ RL E A AEA A + EA AAA AA ARA + E
Sbjct: 230 RANESRLKNEIASAEAAAAK-AREAAAAAEAAAARARAAEAKRTGE 274
Score = 28.1 bits (63), Expect = 8.5
Identities = 19/106 (17%), Positives = 42/106 (39%), Gaps = 7/106 (6%)
Query: 263 EQMKIQVIERNQEIQVQEQEIQRRKKELESSVNRPAEAEKYRIEKLAQANKKRLM-IEAE 321
E++ +++ EI + EL + ++ ++ ++ +L + KK L + +E
Sbjct: 161 ERIDALKATL-KQLAAVRAEIAAEQAELTTLLSEQRA-QQAKLAQLLEERKKTLAQLNSE 218
Query: 322 AEAEAIRLK----GEAEAAAIAAKARAEVEIMTKKAEAWKDYREAA 363
A+ +L+ E+ A A A + A A + A
Sbjct: 219 LSADQKKLEELRANESRLKNEIASAEAAAAKAREAAAAAEAAAARA 264
>gnl|CDD|179302 PRK01558, PRK01558, V-type ATP synthase subunit E; Provisional.
Length = 198
Score = 33.6 bits (77), Expect = 0.11
Identities = 16/47 (34%), Positives = 23/47 (48%)
Query: 309 AQANKKRLMIEAEAEAEAIRLKGEAEAAAIAAKARAEVEIMTKKAEA 355
A+ +++EA+ EAE I K E EA + AKA E + A
Sbjct: 20 AERLANEIILEAKEEAEEIIAKAEEEAKELKAKAEKEANDYKRHALE 66
Score = 28.2 bits (63), Expect = 6.2
Identities = 18/48 (37%), Positives = 25/48 (52%)
Query: 298 AEAEKYRIEKLAQANKKRLMIEAEAEAEAIRLKGEAEAAAIAAKARAE 345
EAE+ E + +A ++ I A+AE EA LK +AE A K A
Sbjct: 18 EEAERLANEIILEAKEEAEEIIAKAEEEAKELKAKAEKEANDYKRHAL 65
>gnl|CDD|236942 PRK11637, PRK11637, AmiB activator; Provisional.
Length = 428
Score = 33.9 bits (78), Expect = 0.14
Identities = 35/99 (35%), Positives = 53/99 (53%), Gaps = 13/99 (13%)
Query: 250 ELQAAKTKQRIKEEQMKIQVIERNQEIQVQEQEIQRRKK---ELESSVNRPAEAEKYRIE 306
EL A K + K+ Q K + E+ + Q EQ RKK LESS+ + ++ ++
Sbjct: 181 ELAAQKAELEEKQSQQKTLLYEQQAQQQKLEQARNERKKTLTGLESSL----QKDQQQLS 236
Query: 307 KLAQANKKRL---MIEAEAEAEAIRLKGEA-EAAAIAAK 341
+L +AN+ RL + AE EA+A R + EA EAA + K
Sbjct: 237 EL-RANESRLRDSIARAEREAKA-RAEREAREAARVRDK 273
>gnl|CDD|223496 COG0419, SbcC, ATPase involved in DNA repair [DNA replication,
recombination, and repair].
Length = 908
Score = 33.6 bits (77), Expect = 0.17
Identities = 30/147 (20%), Positives = 55/147 (37%), Gaps = 11/147 (7%)
Query: 181 AEVKRDARIGEAEAQKDARIKEAMAEEEKMAARFVNDTEIAKAQRDFEIKKATYDAEVET 240
+ + E + + EEK + + ++ E + + E+E
Sbjct: 324 LKSLEERLEKLEEKLEKLESELEELAEEKNELA----KLLEERLKELEERLEELEKELE- 378
Query: 241 KRAEAELAFELQAAKTKQRIKEEQMKIQVIERNQEIQVQEQEIQRRKKELESSVNRPAEA 300
L Q + Q +KEE ++ +EIQ + +E+++ +ELE + E
Sbjct: 379 ----KALERLKQLEEAIQELKEELAEL--SAALEEIQEELEELEKELEELERELEELEEE 432
Query: 301 EKYRIEKLAQANKKRLMIEAEAEAEAI 327
K E++ Q K LMI A A
Sbjct: 433 IKKLEEQINQLESKELMIAELAGAGEK 459
>gnl|CDD|234750 PRK00409, PRK00409, recombination and DNA strand exchange inhibitor
protein; Reviewed.
Length = 782
Score = 33.6 bits (78), Expect = 0.20
Identities = 31/142 (21%), Positives = 62/142 (43%), Gaps = 11/142 (7%)
Query: 237 EVETKRAEAELAFELQAAKTKQRIKEEQMKIQVIERNQEIQVQEQEIQRRKKELESSVN- 295
E+E K EAE +A K K+ ++E++ K+Q E ++ E+E Q+ KE + +
Sbjct: 531 ELEQKAEEAEAL-LKEAEKLKEELEEKKEKLQEEEDK-LLEEAEKEAQQAIKEAKKEADE 588
Query: 296 -----RPAEAEKYRIEKLAQA-NKKRLMIEAEAEAEAIRLKGEAEAAAIAAKARAEVEIM 349
R + Y K + ++ + +A + E + K + + + + +
Sbjct: 589 IIKELRQLQKGGYASVKAHELIEARKRLNKANEKKEKKKKKQKEKQEELKVGDEVKYLSL 648
Query: 350 TKKAE--AWKDYREAAVIDMIL 369
+K E + D +EA V I+
Sbjct: 649 GQKGEVLSIPDDKEAIVQAGIM 670
Score = 31.3 bits (72), Expect = 1.1
Identities = 23/100 (23%), Positives = 49/100 (49%), Gaps = 13/100 (13%)
Query: 263 EQMKIQVIERNQEIQVQEQEIQRRKKELESSVNRPAEAEKYRIEKLAQANKKRLMIEAEA 322
E+++ ++ ++ +E + +E ++ K+ELE + EKL Q + +L+ EAE
Sbjct: 526 EELERELEQKAEEAEALLKEAEKLKEELEE-----------KKEKL-QEEEDKLLEEAEK 573
Query: 323 EAEAIRLKGEAEAAAIAAKARAEVEIMTKKAEAWKDYREA 362
EA+ + + EA I + R +++ + + EA
Sbjct: 574 EAQQAIKEAKKEADEIIKELR-QLQKGGYASVKAHELIEA 612
>gnl|CDD|224482 COG1566, EmrA, Multidrug resistance efflux pump [Defense
mechanisms].
Length = 352
Score = 32.7 bits (75), Expect = 0.29
Identities = 23/134 (17%), Positives = 45/134 (33%), Gaps = 10/134 (7%)
Query: 224 QRDFEIKKATYDAEVETKRAEAELAF----ELQAAKTKQRIKEEQMKIQVIERNQEIQVQ 279
RD+ +A + A+ QA + ++ +ER E+ Q
Sbjct: 86 PRDYRAALEQAEAALAAAEAQLRNLRAQLASAQALIAQAEAQDLDQAQNELERRAEL-AQ 144
Query: 280 EQEIQRRKKELESSVNRPAEAEKYRIEKLAQANKKRLMIEAEAEAEAIRLKGEAEAAAIA 339
+ R + + + + AEA + + N L E + A A A
Sbjct: 145 RGVVSREELDRARAALQAAEAALAAAQAAQKQNLALLESEVSGAQAQV-----ASAEAAL 199
Query: 340 AKARAEVEIMTKKA 353
+A+ ++E +A
Sbjct: 200 DQAKLDLERTVIRA 213
>gnl|CDD|217051 pfam02463, SMC_N, RecF/RecN/SMC N terminal domain. This domain is
found at the N terminus of SMC proteins. The SMC
(structural maintenance of chromosomes) superfamily
proteins have ATP-binding domains at the N- and
C-termini, and two extended coiled-coil domains
separated by a hinge in the middle. The eukaryotic SMC
proteins form two kind of heterodimers: the SMC1/SMC3
and the SMC2/SMC4 types. These heterodimers constitute
an essential part of higher order complexes, which are
involved in chromatin and DNA dynamics. This family also
includes the RecF and RecN proteins that are involved in
DNA metabolism and recombination.
Length = 1162
Score = 33.0 bits (75), Expect = 0.31
Identities = 23/165 (13%), Positives = 70/165 (42%), Gaps = 2/165 (1%)
Query: 193 EAQKDARIKEAMAEEEKMAARFVNDTEIAKAQRDFEIKKATYDAEVETKRAEAELAFELQ 252
+ + ++ +E EEE +A + E K ++ E + E E ++E +
Sbjct: 251 QEEIESSKQELEKEEEILAQVLKENKEEEKEKKLQEEELKLLAKEEEELKSELLKLERRK 310
Query: 253 AA--KTKQRIKEEQMKIQVIERNQEIQVQEQEIQRRKKELESSVNRPAEAEKYRIEKLAQ 310
+ + ++E K++ + ++ +++E E + ++ E++ E + ++++ +
Sbjct: 311 VDDEEKLKESEKELKKLEKELKKEKEEIEELEKELKELEIKREAEEEEEEQLEKLQEKLE 370
Query: 311 ANKKRLMIEAEAEAEAIRLKGEAEAAAIAAKARAEVEIMTKKAEA 355
++ L+ + + E+E + + + + K E E +
Sbjct: 371 QLEEELLAKKKLESERLSSAAKLKEEELELKNEEEKEAKLLLELS 415
Score = 28.0 bits (62), Expect = 9.4
Identities = 27/172 (15%), Positives = 63/172 (36%), Gaps = 9/172 (5%)
Query: 183 VKRDARIGEAEAQKDARIKEAMAEEEKMAARFVNDTEIAKAQRDFEIKKATYDAEVETKR 242
+ E +++ E E + EI ++ E +
Sbjct: 230 LDYLKLNEERIDLLQELLRDEQEEIESSKQELEKEEEILAQVLKENKEEEKEKKLQEEEL 289
Query: 243 AEAELAFELQAAKTKQRIKEEQMKIQVIERNQEIQVQEQEIQRRKKELESSVNRPAEAEK 302
E + + + +++ + ++ ++++ E+E+++ +KEL+ E EK
Sbjct: 290 KLLAKEEE------ELKSELLKLERRKVDDEEKLKESEKELKKLEKELKKEKEEIEELEK 343
Query: 303 YRIEKLAQANKKRLMIEAEAEAEAIRLKGEAEAAAIAAKARAEVEIMTKKAE 354
E K+ E E + E ++ K E + AK + E E ++ A+
Sbjct: 344 ---ELKELEIKREAEEEEEEQLEKLQEKLEQLEEELLAKKKLESERLSSAAK 392
>gnl|CDD|236912 PRK11448, hsdR, type I restriction enzyme EcoKI subunit R;
Provisional.
Length = 1123
Score = 33.0 bits (76), Expect = 0.33
Identities = 18/90 (20%), Positives = 35/90 (38%), Gaps = 7/90 (7%)
Query: 236 AEVETKRAEAELAFELQAAKTKQRIKEEQMKIQVIERNQEIQVQEQEIQRRKKELES--- 292
EV T + + E K + + E + +++ E++ Q + +LE
Sbjct: 149 QEVLTLKQQLE---LQAREKAQSQALAEAQQQELVALEGLAAELEEKQQELEAQLEQLQE 205
Query: 293 -SVNRPAEAEKYRIEKLAQANKKRLMIEAE 321
+ E ++ R E QA K+ + E E
Sbjct: 206 KAAETSQERKQKRKEITDQAAKRLELSEEE 235
>gnl|CDD|206474 pfam14306, PUA_2, PUA-like domain. This PUA like domain is found
at the N-terminus of ATP-sulfurylase enzymes.
Length = 159
Score = 31.3 bits (72), Expect = 0.43
Identities = 13/49 (26%), Positives = 28/49 (57%), Gaps = 10/49 (20%)
Query: 102 ISEDEIQN------IALVTLEGHQRAIMGSMSVEDIYK-DRKKFSKQVF 143
+SE++ +AL EG + ++VE+IY+ D+++ +++VF
Sbjct: 84 VSEEDADGLKEGDRVALRDPEGE---PLAVLTVEEIYEPDKEEEAEKVF 129
>gnl|CDD|191249 pfam05279, Asp-B-Hydro_N, Aspartyl beta-hydroxylase N-terminal
region. This family includes the N-terminal regions of
the junctin, junctate and aspartyl beta-hydroxylase
proteins. Junctate is an integral ER/SR membrane calcium
binding protein, which comes from an alternatively
spliced form of the same gene that generates aspartyl
beta-hydroxylase and junctin. Aspartyl beta-hydroxylase
catalyzes the post-translational hydroxylation of
aspartic acid or asparagine residues contained within
epidermal growth factor (EGF) domains of proteins.
Length = 240
Score = 31.8 bits (72), Expect = 0.50
Identities = 27/120 (22%), Positives = 45/120 (37%), Gaps = 4/120 (3%)
Query: 239 ETKRAEAELAFELQAAKTKQRIKEEQMKIQVIERNQEIQVQEQEIQRRKKELESSVNRPA 298
E AE +L KTKQ+++EE +K Q+ ++I V +QE KE + ++
Sbjct: 80 EEAEPHAEEEGQLAVRKTKQKVEEE-VKEQLQSLLEKIVVSKQEEDGPGKEPQLDEDKFL 138
Query: 299 EAEKYRIEKLAQANKKRLMIEAEAEAEAIRLKGEAEAAAIAAKARAEVEIMTKKAEAWKD 358
AE + K E E + +E K +A + E +
Sbjct: 139 LAEDSDDRQETLEAGKV---HEETEDSYHVEETASEQYKQDMKEKASEQENEDSKEPVEK 195
>gnl|CDD|239500 cd03406, Band_7_3, A subgroup of the band 7 domain of flotillin
(reggie) like proteins. This subgroup contains proteins
similar to stomatin, prohibitin, flotillin, HlfK/C and
podicin. Many of these band 7 domain-containing
proteins are lipid raft-associated. Individual proteins
of this band 7 domain family may cluster to form
membrane microdomains which may in turn recruit
multiprotein complexes. Microdomains formed from
flotillin proteins may in addition be dynamic units with
their own regulatory functions. Flotillins have been
implicated in signal transduction, vesicle trafficking,
cytoskeleton rearrangement and are known to interact
with a variety of proteins. Stomatin interacts with and
regulates members of the degenerin/epithelia Na+ channel
family in mechanosensory cells of Caenorhabditis elegans
and vertebrate neurons and participates in trafficking
of Glut1 glucose transporters. Prohibitin may act as a
chaperone for the stabilization of mitochondrial
proteins. Prokaryotic HflK/C plays a role in the
decision between lysogenic and lytic cycle growth during
lambda phage infection. Flotillins have been implicated
in the progression of prion disease, in the pathogenesis
of neurodegenerative diseases such as Parkinson's and
Alzheimer's disease and, in cancer invasion and
metastasis. Mutations in the podicin gene give rise to
autosomal recessive steroid resistant nephritic
syndrome.
Length = 280
Score = 31.7 bits (72), Expect = 0.61
Identities = 21/70 (30%), Positives = 35/70 (50%), Gaps = 5/70 (7%)
Query: 255 KTKQRIKEEQMKIQVIERNQEIQVQEQEIQRRKKELESSVN-----RPAEAEKYRIEKLA 309
+ K+ + E + QV + +V E+E ++R E+E A+AE Y +K A
Sbjct: 192 ERKKAVIEAEKVAQVAKILFGQKVMEKETEKRISEIEDEAFLAREKAKADAEYYTAQKEA 251
Query: 310 QANKKRLMIE 319
+ANK +L E
Sbjct: 252 EANKLKLTPE 261
Score = 29.0 bits (65), Expect = 4.3
Identities = 22/56 (39%), Positives = 30/56 (53%), Gaps = 3/56 (5%)
Query: 299 EAEKYRIEKLAQANKKRLMIEAEAEAEAIRLKGEAEAAAIAAKARAEVEIMTKKAE 354
EAEK KL A +K+ ++E EAE E + EAE A AK ++M K+ E
Sbjct: 169 EAEK---TKLLIAIQKQKVVEKEAETERKKAVIEAEKVAQVAKILFGQKVMEKETE 221
Score = 28.2 bits (63), Expect = 7.8
Identities = 29/103 (28%), Positives = 50/103 (48%), Gaps = 16/103 (15%)
Query: 258 QRIKEEQMKIQVIERNQEIQVQEQEIQRRKKELESSVNRPAEAEKYRIEKLAQANK---K 314
+ ++ E+ K+ + + Q++ +E E +R+K +E+ EK+AQ K
Sbjct: 166 ELMEAEKTKLLIAIQKQKVVEKEAETERKKAVIEA-------------EKVAQVAKILFG 212
Query: 315 RLMIEAEAEAEAIRLKGEAEAAAIAAKARAEVEIMTKKAEAWK 357
+ ++E E E ++ EA A AKA AE K+AEA K
Sbjct: 213 QKVMEKETEKRISEIEDEAFLAREKAKADAEYYTAQKEAEANK 255
>gnl|CDD|235401 PRK05306, infB, translation initiation factor IF-2; Validated.
Length = 746
Score = 31.7 bits (73), Expect = 0.72
Identities = 28/170 (16%), Positives = 58/170 (34%), Gaps = 3/170 (1%)
Query: 172 LKSLGMARTAEVKRDARIGEAEAQKDARIKEAMAEEEKMAARFVNDTEIAKAQRDFEIKK 231
LK LG+ + + + E EA+K+ +EA E + A +A+ +
Sbjct: 24 LKELGIEVKSH---SSTVEEEEARKEEAKREAEEEAKAEAEEAAAAEAEEEAKAEAAAAA 80
Query: 232 ATYDAEVETKRAEAELAFELQAAKTKQRIKEEQMKIQVIERNQEIQVQEQEIQRRKKELE 291
+A AEA A + K + + ++ +++ +RK
Sbjct: 81 PAEEAAEAAAAAEAAARPAEDEAARPAEAAARRPKAKKAAKKKKGPKPKKKKPKRKAARG 140
Query: 292 SSVNRPAEAEKYRIEKLAQANKKRLMIEAEAEAEAIRLKGEAEAAAIAAK 341
+ + + R + + KK+ E + + A +A K
Sbjct: 141 GKRGKGGKGRRRRRGRRRRRKKKKKQKPTEKIPREVVIPETITVAELAEK 190
>gnl|CDD|235040 PRK02463, PRK02463, OxaA-like protein precursor; Provisional.
Length = 307
Score = 31.2 bits (71), Expect = 0.76
Identities = 14/28 (50%), Positives = 19/28 (67%), Gaps = 4/28 (14%)
Query: 199 RIKEAMAEEEKMAARFVNDTEIAKAQRD 226
R+K A +EEKMAA+ TE+ AQR+
Sbjct: 106 RLKNATTQEEKMAAQ----TELMAAQRE 129
>gnl|CDD|225369 COG2812, DnaX, DNA polymerase III, gamma/tau subunits [DNA
replication, recombination, and repair].
Length = 515
Score = 31.2 bits (71), Expect = 0.84
Identities = 31/182 (17%), Positives = 59/182 (32%), Gaps = 24/182 (13%)
Query: 182 EVKRDARIGEAEAQKDARIKEAMAEEEKMAARFVND----TEIAKAQRDFEIKKATYDAE 237
+ + EA KEA+ ++ + ++ RD + K T E
Sbjct: 243 LTDIEKLLSLLEAILKGDAKEALRLINELIEEGKDPEAFLEDLLNFLRDLLLFKQTVPNE 302
Query: 238 VETKRAEAELAFELQAAKTKQRIKEEQMKIQVIERNQEIQVQEQEIQRRKKELESSVNRP 297
++ E E + A++ +Q Q + KEL+ S++
Sbjct: 303 LQLNTTEIEERTKELASQIS---------------VLNLQRLYQLLLPGLKELKRSLSPR 347
Query: 298 AEAE--KYRIEKLAQA--NKKRLMIEAEAEAEAIRLKGEAEAAAIAAKARAEVEIMTKKA 353
E R+ + A A L+ A + + LK + A A + ++K
Sbjct: 348 LGLEMTLIRLLEAAPASDAASPLLQRAPRLEQELVLK-ANVSLAAAPAQSPQAPALSKNP 406
Query: 354 EA 355
E
Sbjct: 407 EL 408
>gnl|CDD|235026 PRK02292, PRK02292, V-type ATP synthase subunit E; Provisional.
Length = 188
Score = 30.4 bits (69), Expect = 0.97
Identities = 15/37 (40%), Positives = 21/37 (56%)
Query: 319 EAEAEAEAIRLKGEAEAAAIAAKARAEVEIMTKKAEA 355
EA A A IR + + EA I A+A A+ E + + EA
Sbjct: 13 EARARASEIRAEADEEAEEIIAEAEADAEEILEDREA 49
Score = 29.6 bits (67), Expect = 2.0
Identities = 16/43 (37%), Positives = 21/43 (48%)
Query: 305 IEKLAQANKKRLMIEAEAEAEAIRLKGEAEAAAIAAKARAEVE 347
I A+A + EA+ EAE I + EA+A I AE E
Sbjct: 10 IRDEARARASEIRAEADEEAEEIIAEAEADAEEILEDREAEAE 52
>gnl|CDD|180941 PRK07352, PRK07352, F0F1 ATP synthase subunit B; Validated.
Length = 174
Score = 29.9 bits (68), Expect = 1.3
Identities = 21/77 (27%), Positives = 39/77 (50%), Gaps = 4/77 (5%)
Query: 284 QRRKKELESSVNRPAEAEKYRIEKLAQANKKRLMIEAEAEAEAIRLKGEAEAAAIAAK-- 341
+ R++ + ++ E + + LA+A +K + +A+ EAE IR +A A AI A+
Sbjct: 49 EERREAILQALKEAEERLRQAAQALAEAQQK--LAQAQQEAERIRADAKARAEAIRAEIE 106
Query: 342 ARAEVEIMTKKAEAWKD 358
+A ++ K A D
Sbjct: 107 KQAIEDMARLKQTAAAD 123
>gnl|CDD|226127 COG3599, DivIVA, Cell division initiation protein [Cell division
and chromosome partitioning].
Length = 212
Score = 29.7 bits (67), Expect = 1.7
Identities = 20/95 (21%), Positives = 48/95 (50%)
Query: 268 QVIERNQEIQVQEQEIQRRKKELESSVNRPAEAEKYRIEKLAQANKKRLMIEAEAEAEAI 327
VI+ +++ + ++++ EL+ + A+AE + + A+ + L AEAEA+ I
Sbjct: 34 DVIDDYEQLLDENEDLEDEIDELKEELKEAADAEDSQAIQQAETEAEELKQAAEAEADDI 93
Query: 328 RLKGEAEAAAIAAKARAEVEIMTKKAEAWKDYREA 362
+ A+A + KA+ + + + A+ ++ +
Sbjct: 94 LKRASAQAQRVFGKAQYKADRYLEDAKIAQELEDL 128
>gnl|CDD|239498 cd03404, Band_7_HflK, Band_7_HflK: The band 7 domain of flotillin
(reggie) like proteins. This group includes proteins
similar to prokaryotic HlfK (High frequency of
lysogenization K). Although many members of the band 7
family are lipid raft associated, prokaryote plasma
membranes lack cholesterol and are unlikely to have
lipid raft domains. Individual proteins of this band 7
domain family may cluster to form membrane microdomains
which may in turn recruit multiprotein complexes.
Escherichia coli HflK is an integral membrane protein
which may localize to the plasma membrane. HflK
associates with another band 7 family member (HflC) to
form an HflKC complex. HflKC interacts with FtsH in a
large complex termed the FtsH holo-enzyme. FtsH is an
AAA ATP-dependent protease which exerts progressive
proteolysis against membrane-embedded and soluble
substrate proteins. HflKC can modulate the activity of
FtsH. HflKC plays a role in the decision between
lysogenic and lytic cycle growth during lambda phage
infection.
Length = 266
Score = 29.8 bits (68), Expect = 1.9
Identities = 21/77 (27%), Positives = 36/77 (46%), Gaps = 11/77 (14%)
Query: 309 AQANKKRLMIEAEAEA---------EAIRLKGEAEA--AAIAAKARAEVEIMTKKAEAWK 357
A+ +++RL+ EAEA A EA R+ EAEA + A+A+ E +K
Sbjct: 188 ARQDRERLINEAEAYANEVVPKARGEAARIIQEAEAYKEEVIAEAQGEAARFESLLAEYK 247
Query: 358 DYREAAVIDMILESLPK 374
+ + LE++ +
Sbjct: 248 KAPDVTRERLYLETMEE 264
>gnl|CDD|223783 COG0711, AtpF, F0F1-type ATP synthase, subunit b [Energy production
and conversion].
Length = 161
Score = 29.2 bits (66), Expect = 2.1
Identities = 27/89 (30%), Positives = 40/89 (44%), Gaps = 9/89 (10%)
Query: 271 ERNQEIQVQEQEIQRRKKELESSVNRP----AEAEKYRIEKLAQANKKRLMIEAEAEAEA 326
ER +I E +R K+E ++ + EA + E + QA K EAE AE
Sbjct: 37 ERQAKIADDLAEAERLKEEAQALLAEYEQELEEAREQASEIIEQAKK-----EAEQIAEE 91
Query: 327 IRLKGEAEAAAIAAKARAEVEIMTKKAEA 355
I+ + E E I A AE+E ++A
Sbjct: 92 IKAEAEEELERIKEAAEAEIEAEKERALE 120
>gnl|CDD|226055 COG3524, KpsE, Capsule polysaccharide export protein [Cell envelope
biogenesis, outer membrane].
Length = 372
Score = 29.8 bits (67), Expect = 2.3
Identities = 26/150 (17%), Positives = 56/150 (37%), Gaps = 8/150 (5%)
Query: 198 ARIKEAMAEEEKMAARFVNDTEIAKAQRDFEIKKATYDAEVETKRAEAELAFELQAAKTK 257
AR EE++ + + D+ IK +D + + + + L +L+
Sbjct: 176 ARRDTVRFAEEEVQKAEERVKKASNDLTDYRIKNGVFDPKAQAE-VQMSLVSKLEDELIV 234
Query: 258 QRIKEEQMKIQVIERNQEIQVQEQEIQRRKKELESSVNRPAEAEKYRIEKLAQANKKRLM 317
+ + + +K + N +I + I+ +K+L + + A +RL
Sbjct: 235 IQAQLDTVKSVMNPENPQIPGLKARIESLRKQLLQEKQAISAGGSSQSLSNQAAEFQRLY 294
Query: 318 IEAEAEAEAIRLKGEAEAAAIAAKARAEVE 347
+E +A AAA+ + A +E
Sbjct: 295 LENT-------FAEKAYAAALTSLESARIE 317
>gnl|CDD|181442 PRK08476, PRK08476, F0F1 ATP synthase subunit B'; Validated.
Length = 141
Score = 28.9 bits (65), Expect = 2.3
Identities = 16/48 (33%), Positives = 21/48 (43%), Gaps = 5/48 (10%)
Query: 314 KRLMIEAEAEAEAIRLKGEAEAAAIAAKARAEVEIMTKKAEAWKDYRE 361
+ ++ A EA IR + A AK AE +I KKAE Y
Sbjct: 65 ETILKNAREEANKIR-----QKAIAKAKEEAEKKIEAKKAELESKYEA 107
>gnl|CDD|211913 TIGR04190, B12_SAM_Ta0216, B12-binding domain/radical SAM domain
protein, Ta0216 family. Members of this family are
enzymes with an N-terminal B12-binding domain and
central radical SAM domain. Families TIGR03975,
TIGR04013 and TIGR04014 exhibit a similar architecture,
which may be associated with lipid metabolism.
Length = 553
Score = 30.1 bits (68), Expect = 2.3
Identities = 14/66 (21%), Positives = 34/66 (51%), Gaps = 2/66 (3%)
Query: 228 EIKKATYDAEVETKRAEAELAF--ELQAAKTKQRIKEEQMKIQVIERNQEIQVQEQEIQR 285
EI +ATY+A + + + E E + +RIK + I + ++ +++ E++
Sbjct: 464 EIAEATYEAAIRLNKLKYEYGLIDEEAYDEILERIKNARDGIPYLSDEKKSKIKLNELRY 523
Query: 286 RKKELE 291
++ ++E
Sbjct: 524 KEDDVE 529
>gnl|CDD|236081 PRK07735, PRK07735, NADH dehydrogenase subunit C; Validated.
Length = 430
Score = 29.9 bits (67), Expect = 2.4
Identities = 36/144 (25%), Positives = 62/144 (43%), Gaps = 3/144 (2%)
Query: 203 AMAEEEKMAARFVNDTEIAKAQRDF-EIKKATYDAEVETKRAEAELAFELQAAKTKQRIK 261
A+A++++ V + E AKA+ KA A + KR E E + A K +
Sbjct: 73 ALAKQKREGTEEVTEEEKAKAKAKAAAAAKAKAAALAKQKREGTEEVTEEEKAAAKAKAA 132
Query: 262 EE-QMKIQVIERNQEIQVQEQEIQRRKKELESSVNRPAEAEKYRIEKLAQANKKRLMIEA 320
+ K + + + +E + + + E + + A A K + LA+
Sbjct: 133 AAAKAKAAALAKQKREGTEEVTEEEEETDKEKAKAKAAAAAKAKAAALAKQKAAEAGEGT 192
Query: 321 EAEAEAIRLKGEAEAAAIAAKARA 344
E E + K +A+AAA AAKA+A
Sbjct: 193 EEVTEEEKAKAKAKAAA-AAKAKA 215
Score = 29.6 bits (66), Expect = 3.1
Identities = 47/187 (25%), Positives = 75/187 (40%), Gaps = 6/187 (3%)
Query: 178 ARTAEVKRDARIGEAEAQKDARIKEAMAEEEKMAARFVNDTEIAKAQRDFEIKKATYDAE 237
AR A+ + R+ + ++++E E+EK + + T +R KA A
Sbjct: 15 ARRAKEEARKRLVAKHGAEISKLEEENREKEKALPKNDDMTIEEAKRRAAAAAKAKAAAL 74
Query: 238 VETKRAEAELAFELQAAKTKQRIKEEQMKIQVIERNQEIQVQEQEIQRRKKELESSVNRP 297
+ KR E E + AK K + + + Q++E E E + +
Sbjct: 75 AKQKREGTEEVTEEEKAKAKAKAAAA-----AKAKAAALAKQKREGTEEVTEEEKAAAKA 129
Query: 298 AEAEKYRIEKLAQANKKRLMIEAEAEAEAIRLKGEAEA-AAIAAKARAEVEIMTKKAEAW 356
A + + A A +KR E E E K +A+A AA AAKA+A K AEA
Sbjct: 130 KAAAAAKAKAAALAKQKREGTEEVTEEEEETDKEKAKAKAAAAAKAKAAALAKQKAAEAG 189
Query: 357 KDYREAA 363
+ E
Sbjct: 190 EGTEEVT 196
>gnl|CDD|181619 PRK09039, PRK09039, hypothetical protein; Validated.
Length = 343
Score = 29.5 bits (67), Expect = 2.4
Identities = 21/79 (26%), Positives = 37/79 (46%), Gaps = 10/79 (12%)
Query: 284 QRRKKELESSVN------RPAEAEKYRIEKLAQANKKRLMIEAEAEAEAIRLKGE-AEAA 336
++ ++L+ SV AEAE+ R++ L A AE A L E
Sbjct: 73 RQGNQDLQDSVANLRASLSAAEAERSRLQALLAELAGA---GAAAEGRAGELAQELDSEK 129
Query: 337 AIAAKARAEVEIMTKKAEA 355
++A+A A+VE++ ++ A
Sbjct: 130 QVSARALAQVELLNQQIAA 148
>gnl|CDD|132364 TIGR03321, alt_F1F0_F0_B, alternate F1F0 ATPase, F0 subunit B. A
small number of taxonomically diverse prokaryotic
species, including Methanosarcina barkeri, have what
appears to be a second ATP synthase, in addition to the
normal F1F0 ATPase in bacteria and A1A0 ATPase in
archaea. These enzymes use ion gradients to synthesize
ATP, CC and in principle may run in either direction.
This model represents the F0 subunit B of this apparent
second ATP synthase.
Length = 246
Score = 29.2 bits (66), Expect = 2.8
Identities = 30/87 (34%), Positives = 46/87 (52%), Gaps = 19/87 (21%)
Query: 244 EAELAFELQAAKTKQRIKEEQMKIQVIERNQEIQVQEQEIQRRKKELESSVNRPAEAEKY 303
E ++A EL A TK+R + EQ + + E+N+E+ Q+R+ L A+ E
Sbjct: 38 EKKIAGELADADTKKR-EAEQERREYEEKNEELD------QQREVLLTK-----AKEE-- 83
Query: 304 RIEKLAQANKKRLMIEAEAEAEAIRLK 330
AQA ++RL+ EA EA+ IR K
Sbjct: 84 -----AQAERQRLLDEAREEADEIREK 105
>gnl|CDD|191367 pfam05761, 5_nucleotid, 5' nucleotidase family. This family of
eukaryotic proteins includes 5' nucleotidase enzymes,
such as purine 5'-nucleotidase EC:3.1.3.5.
Length = 448
Score = 29.5 bits (67), Expect = 3.0
Identities = 9/56 (16%), Positives = 24/56 (42%), Gaps = 1/56 (1%)
Query: 260 IKEEQMKIQVIERNQEIQVQEQEIQRRKKELESSVNRPAEAEKYRIEKLAQANKKR 315
+ E + +I++ Q + Q + + L ++ AE R + L++ + +
Sbjct: 323 VPELEREIEIWNTEQYRFEELQRLDILLERLYDHLDVHAELSTLRPD-LSELDAEI 377
>gnl|CDD|226889 COG4487, COG4487, Uncharacterized protein conserved in bacteria
[Function unknown].
Length = 438
Score = 29.4 bits (66), Expect = 3.2
Identities = 23/161 (14%), Positives = 53/161 (32%), Gaps = 1/161 (0%)
Query: 201 KEAMAEEEKMAARFVNDTEIAKAQRDFEIKKATYDAEVETKRAEAELAFELQAAKTKQRI 260
KEA + + + + + Q + K+ + K+ E L E+ + + +
Sbjct: 49 KEANEKRAQYRSAKKKELSQLEEQLINQKKEQKNLFNEQIKQFELALQDEIAKLEALELL 108
Query: 261 KEEQMKIQVIERNQEIQVQEQEIQRRKKELESSVNRPAEAEKYRIEKLAQANKKRLMIEA 320
E+ K + + ++ ++ ++ + E + + K K +E
Sbjct: 109 NLEKDKELELLEKELDELSKELQKQLQNTAEIIEKKRENNKNEERLKFENEKKLEESLEL 168
Query: 321 EAEAEAIRLKGEAEAAAIAAKARAEVEIMTKKAEAWKDYRE 361
E E +L EA + + E + K E
Sbjct: 169 EREKFEEQLH-EANLDLEFKENEEQRESKWAILKKLKRRAE 208
>gnl|CDD|115307 pfam06637, PV-1, PV-1 protein (PLVAP). This family consists of
several PV-1 (PLVAP) proteins which seem to be specific
to mammals. PV-1 is a novel protein component of the
endothelial fenestral and stomatal diaphragms. The
function of this family is unknown.
Length = 442
Score = 29.6 bits (66), Expect = 3.3
Identities = 29/96 (30%), Positives = 47/96 (48%), Gaps = 4/96 (4%)
Query: 279 QEQEIQRRKKELESSVNRPAEAEKYRIEKLAQANKKRLMIEAEAEAEAIRLKGEAEAAAI 338
+ ++QR+K E E + + A+ K + EK AQA + +L E A +L E +AA
Sbjct: 300 ENADLQRQKLEAEQAA-QAAQEAKAKAEKEAQAREAKLQAEC---ARQTQLALEEKAALR 355
Query: 339 AAKARAEVEIMTKKAEAWKDYREAAVIDMILESLPK 374
A + E+ KK EA + E A+ L++ K
Sbjct: 356 AERDNLAKELEAKKREAEQLRMELAIRISALDTCIK 391
>gnl|CDD|235999 PRK07353, PRK07353, F0F1 ATP synthase subunit B'; Validated.
Length = 140
Score = 28.4 bits (64), Expect = 3.4
Identities = 31/89 (34%), Positives = 43/89 (48%), Gaps = 11/89 (12%)
Query: 271 ERNQEIQVQEQEIQRRKKELESSVNRPAEAEKYRIEKLAQANKKRLMIEAEAEAEAIRLK 330
ER I+ E + R E E +Y ++LA A K+ + AEAEAEA +L
Sbjct: 36 EREDYIRTNRAEAKERLAEAE------KLEAQYE-QQLASARKQAQAVIAEAEAEADKLA 88
Query: 331 GE----AEAAAIAAKARAEVEIMTKKAEA 355
E A+A A A+K +A EI +K A
Sbjct: 89 AEALAEAQAEAQASKEKARREIEQQKQAA 117
>gnl|CDD|237110 PRK12472, PRK12472, hypothetical protein; Provisional.
Length = 508
Score = 29.5 bits (66), Expect = 3.5
Identities = 24/75 (32%), Positives = 36/75 (48%), Gaps = 7/75 (9%)
Query: 296 RPAEAEKYRIE-KLAQANKKRLMI---EAEAEAEAIRLKGEAEAAAIAAK---ARAEVEI 348
R E K R + +L +A+K EA+A AE + K +AA A + A+A+ E
Sbjct: 228 RKLERAKARADAELKRADKALAAAKTDEAKARAEERQQKAAQQAAEAATQLDTAKADAEA 287
Query: 349 MTKKAEAWKDYREAA 363
A A K+ +AA
Sbjct: 288 KRAAAAATKEAAKAA 302
>gnl|CDD|112890 pfam04094, DUF390, Protein of unknown function (DUF390). This is a
family of long proteins currently only found in the rice
genome. They have no known function. However they may be
some kind of transposable element.
Length = 843
Score = 29.4 bits (65), Expect = 3.5
Identities = 45/198 (22%), Positives = 85/198 (42%), Gaps = 16/198 (8%)
Query: 166 RDDEGYLKSLGMARTAEVKRDARIGEAEAQKDARIKEAMAEEEKMAARFVNDTEIAKAQR 225
R +EG M E EA+K + A EE+ A + T + +AQ
Sbjct: 422 RVEEGRRSVEAMVEAGRKAHRRHTSELEARKKDLAEIAKEVEEEREAALIATTVLNEAQD 481
Query: 226 DFEIKKATYDAEVETKRAEAELAFELQAAKTKQ--------RIKEEQMKIQVIERNQEIQ 277
D ++ + AE+E K A+ + AA+ ++ R +EE ++ + + +
Sbjct: 482 DLRLQYGSRAAELEKKLDAAQGVLDAAAAREQRAAENEAASRQREEALEARAMALEERAC 541
Query: 278 VQEQEIQRRK-----KELESSVNRPAEAEK---YRIEKLAQANKKRLMIEAEAEAEAIRL 329
+E+++ R+ +E + + A AE+ R+ + A ++R + EAEA A+ +
Sbjct: 542 AKERDLADREAAVAIREATLAAHEAACAEEEFALRLREDALTERERALEEAEAAAQQLAD 601
Query: 330 KGEAEAAAIAAKARAEVE 347
AA +AR +E
Sbjct: 602 SLFLREAAQEEQARRNLE 619
Score = 28.6 bits (63), Expect = 7.0
Identities = 26/79 (32%), Positives = 33/79 (41%)
Query: 285 RRKKELESSVNRPAEAEKYRIEKLAQANKKRLMIEAEAEAEAIRLKGEAEAAAIAAKARA 344
R K +S PA AE R E + RL EA +A R + EAA A
Sbjct: 218 RHSKSGQSEAEDPAAAEARRREADRREAADRLREAEEAAQDAARARQAEEAAREEAARAR 277
Query: 345 EVEIMTKKAEAWKDYREAA 363
+ E ++AEA EAA
Sbjct: 278 QAEEAAREAEAAFRADEAA 296
>gnl|CDD|188887 cd09488, SAM_EPH-R, SAM domain of EPH family of tyrosine kinase
receptors. SAM (sterile alpha motif) domain of EPH
(erythropoietin-producing hepatocyte) family of receptor
tyrosine kinases is a C-terminal signal transduction
module located in the cytoplasmic region of these
receptors. SAM appears to mediate cell-cell initiated
signal transduction via binding proteins to a conserved
tyrosine that is phosphorylated. In some cases the SAM
domain mediates homodimerization/oligomerization and
plays a role in the clustering process necessary for
signaling. EPH kinases are the largest family of
receptor tyrosine kinases. They are classified into two
groups based on their abilities to bind ephrin-A and
ephrin-B ligands. The EPH receptors are involved in
regulation of cell movement, shape, and attachment
during embryonic development; they control cell-cell
interactions in the vascular, nervous, epithelial, and
immune systems, and in many tumors. They are potential
molecular markers for cancer diagnostics and potential
targets for cancer therapy.
Length = 61
Score = 26.8 bits (60), Expect = 3.6
Identities = 8/27 (29%), Positives = 17/27 (62%), Gaps = 1/27 (3%)
Query: 101 HISEDEIQNIALVTLEGHQRAIMGSMS 127
++ +++ + VTL GHQ+ I+ S+
Sbjct: 34 QMTAEDLTRLG-VTLVGHQKKILNSIQ 59
>gnl|CDD|222709 pfam14362, DUF4407, Domain of unknown function (DUF4407). This
family of proteins is found in bacteria. Proteins in
this family are typically between 366 and 597 amino
acids in length. There is a single completely conserved
residue R that may be functionally important.
Length = 297
Score = 29.1 bits (66), Expect = 3.8
Identities = 20/116 (17%), Positives = 45/116 (38%), Gaps = 18/116 (15%)
Query: 191 EAEAQKDARIKEAMAEE-EKMAARFV-----NDTEIAKAQRDFEIKKATYDAEVETKRAE 244
E + +E A ++AA F EIA Q + + +A +A + + E
Sbjct: 109 EIDTVLLEIQQEEQAAAQAQVAAGFRPKIAALTAEIAALQAEIDEAQAEVNAAYQEAQCE 168
Query: 245 AE------------LAFELQAAKTKQRIKEEQMKIQVIERNQEIQVQEQEIQRRKK 288
AE + E + + + E +K ++ +++ Q+ ++R ++
Sbjct: 169 AEGTGGTGVAGKGPVYKEKREKLDAAQARLETLKARLDAAIAQLEAQKAALERNRQ 224
>gnl|CDD|180240 PRK05759, PRK05759, F0F1 ATP synthase subunit B; Validated.
Length = 156
Score = 28.2 bits (64), Expect = 4.0
Identities = 20/40 (50%), Positives = 26/40 (65%), Gaps = 5/40 (12%)
Query: 308 LAQANKKRLMIEAEAEAEAIRLKGEAEAAAIAAKARAEVE 347
+ QA K+ I EA+AEA EAEAA I A+A+AE+E
Sbjct: 76 IEQAKKRAAQIIEEAKAEA-----EAEAARIKAQAQAEIE 110
>gnl|CDD|227061 COG4717, COG4717, Uncharacterized conserved protein [Function
unknown].
Length = 984
Score = 29.4 bits (66), Expect = 4.2
Identities = 22/137 (16%), Positives = 49/137 (35%), Gaps = 9/137 (6%)
Query: 220 IAKAQRDFEIKKATYDAEVETKRAEAELAF-------ELQAAKTKQRIKEEQMKIQVIER 272
+ + Q FE + A +E + + F + K + R+ E + E
Sbjct: 639 LREEQAAFEERVEGLLAVLEAQFIDLSTLFCVQRLRVAAELQKEEARLALEGNIERTKEL 698
Query: 273 NQEIQVQEQEIQRRKKELESSVNRPAEAEKYRIEKLAQANKKRL--MIEAEAEAEAIRLK 330
N E++ + + ++ +L E + Q +R + EA+ E + +
Sbjct: 699 NDELRAELELHRKEILDLFDCGTADTEDAFREAAREEQQLTQRESRLESLEAQLEGVAAE 758
Query: 331 GEAEAAAIAAKARAEVE 347
+A++ + E E
Sbjct: 759 AYELSASLDQRELKEEE 775
>gnl|CDD|217022 pfam02413, Caudo_TAP, Caudovirales tail fibre assembly protein.
This family contains bacterial and phage tail fibre
assembly proteins. E.coli contains several members of
this family although the function of these proteins is
uncertain.
Length = 126
Score = 27.8 bits (62), Expect = 4.4
Identities = 16/66 (24%), Positives = 23/66 (34%), Gaps = 9/66 (13%)
Query: 297 PAEAEKYRIEKLAQANKKRLMIEAEAEAEAIRLKGEAEAAAIAAKARAEVEIMTKKAEAW 356
P K + A+ K+ L+ +A E I +A IA + AW
Sbjct: 51 PVWVTKEELVAQAEQEKQALLEQAS---EKIAPLQDAVDLGIATDE------EKAQLTAW 101
Query: 357 KDYREA 362
K YR
Sbjct: 102 KKYRVL 107
>gnl|CDD|236035 PRK07508, PRK07508, aminodeoxychorismate synthase; Provisional.
Length = 378
Score = 28.8 bits (65), Expect = 4.4
Identities = 15/47 (31%), Positives = 24/47 (51%), Gaps = 7/47 (14%)
Query: 166 RDDEGYLKSLGMARTAEVKRDARIGEAEAQKDARIKEAMAEEEKMAA 212
D EG++++ M TA R A +DAR++ A+ +EK A
Sbjct: 184 VDGEGWIETHPMKGTA--PRGA-----TPAEDARLRAALLNDEKNQA 223
>gnl|CDD|218434 pfam05103, DivIVA, DivIVA protein. The Bacillus subtilis divIVA1
mutation causes misplacement of the septum during cell
division, resulting in the formation of small, circular,
anucleate mini-cells. Inactivation of divIVA produces a
mini-cell phenotype, whereas overproduction of DivIVA
results in a filamentation phenotype. These proteins
appear to contain coiled-coils.
Length = 131
Score = 28.0 bits (63), Expect = 4.5
Identities = 23/87 (26%), Positives = 42/87 (48%), Gaps = 2/87 (2%)
Query: 263 EQMKIQVIERNQEIQVQEQEIQRRKKELESSVNRPAEAEKYRIEKLAQANKKRLMIEAEA 322
E + + E +EI+ E++++ K+ E+ A++ E A A K+ +I EA
Sbjct: 35 EALYKENEELKEEIERLEEKLEEYKELEETLQKTLVVAQETAEEVKANAQKEAELIIKEA 94
Query: 323 EAEAIRLKGEAEAAAIAAKARAEVEIM 349
EA+A R+ +A A K E + +
Sbjct: 95 EAKAERI--VNDANEEAKKLATEYDEL 119
>gnl|CDD|235903 PRK06991, PRK06991, ferredoxin; Provisional.
Length = 270
Score = 28.6 bits (64), Expect = 4.7
Identities = 31/119 (26%), Positives = 45/119 (37%), Gaps = 5/119 (4%)
Query: 187 ARIGEAEAQKDAR----IKEAMAEEEKMAARFVNDTEIAKAQRDFEIKKATYDAEVETKR 242
A+ A A+ DAR +E A E + AAR A A+ A DAE KR
Sbjct: 150 AQADAARARHDARQARLRREREAAEARAAARAAASAAAAAAEASAAAAPAADDAE-AKKR 208
Query: 243 AEAELAFELQAAKTKQRIKEEQMKIQVIERNQEIQVQEQEIQRRKKELESSVNRPAEAE 301
A A E K ++ + + +Q Q + R+K L + P +A
Sbjct: 209 AIIAAALERARKKKEELAAQGAGPKNTEGVSAAVQAQIDAAEARRKRLAEQRDAPDDAN 267
>gnl|CDD|234172 TIGR03345, VI_ClpV1, type VI secretion ATPase, ClpV1 family.
Members of this protein family are homologs of ClpB, an
ATPase associated with chaperone-related functions.
These ClpB homologs, designated ClpV1, are a key
component of the bacterial pathogenicity-associated type
VI secretion system [Protein fate, Protein and peptide
secretion and trafficking, Cellular processes,
Pathogenesis].
Length = 852
Score = 29.1 bits (66), Expect = 4.7
Identities = 23/107 (21%), Positives = 42/107 (39%), Gaps = 7/107 (6%)
Query: 240 TKRAEAELAFELQAAKTKQRI--KEEQMKIQVIERNQEIQVQEQEIQRRKKELESSVNRP 297
T A +L + A + + +E + ER E++ + ++ LE+
Sbjct: 417 TPAALEDLRRRIAALELELDALEREAALGADHDERLAELRAELAALEAELAALEARW--- 473
Query: 298 AEAEKYRIEKLAQANKKRLMIEAEAEAEAIRLKGEAEAAAIAAKARA 344
+ EK +E + A + L +A+A A+ A AA A A
Sbjct: 474 -QQEKELVEAI-LALRAELEADADAPADDDAALRAQLAELEAALASA 518
>gnl|CDD|178867 PRK00106, PRK00106, hypothetical protein; Provisional.
Length = 535
Score = 29.1 bits (65), Expect = 4.9
Identities = 36/152 (23%), Positives = 73/152 (48%), Gaps = 14/152 (9%)
Query: 222 KAQRDFEIKKATYDAEVETKRAEAELAFELQAAKTKQRIKEE---------QMKIQVIER 272
KA+RD E K T E + + E L + +A K ++ I++E Q++ ++ ER
Sbjct: 50 KAERDAEHIKKTAKRESKALKKELLLEAKEEARKYREEIEQEFKSERQELKQIESRLTER 109
Query: 273 NQEIQVQEQEIQRRKKELESSVNRPAEAEKYRIEKLAQANKKRLMIEAEAEAEAIRLKGE 332
+ +++ + ++K LES + K+ E+ + ++L + +AE E + +
Sbjct: 110 ATSLDRKDENLSSKEKTLESKEQSLTDKSKHIDER--EEQVEKLEEQKKAELERVAALSQ 167
Query: 333 AEAAAI---AAKARAEVEIMTKKAEAWKDYRE 361
AEA I + + EI T+ EA ++ ++
Sbjct: 168 AEAREIILAETENKLTHEIATRIREAEREVKD 199
>gnl|CDD|224212 COG1293, COG1293, Predicted RNA-binding protein homologous to
eukaryotic snRNP [Transcription].
Length = 564
Score = 28.9 bits (65), Expect = 5.3
Identities = 23/116 (19%), Positives = 45/116 (38%), Gaps = 17/116 (14%)
Query: 200 IKEAMAEEEKMAARFVNDTEIA----KAQRDFEIKKATYDAEVETKRAEAELAFELQAAK 255
I+E + A F + EI K++ E + + + K A+ L +L K
Sbjct: 333 IEEGLKSVRL--ADFYGNEEIKIELDKSKTPSENAQRYFKKYKKLKGAKVNLDRQLSELK 390
Query: 256 TKQR-------IKEEQMKIQVIERNQEIQVQEQEIQRRKKELESSVNRPAEAEKYR 304
E+ + IE +E ++E ++ +KK+ + + EK+R
Sbjct: 391 EAIAYYESAKTALEKAEGKKAIEEIREELIEEGLLKSKKKKRK----KKEWFEKFR 442
>gnl|CDD|220365 pfam09726, Macoilin, Transmembrane protein. This entry is a highly
conserved protein present in eukaryotes.
Length = 680
Score = 28.7 bits (64), Expect = 5.3
Identities = 41/175 (23%), Positives = 75/175 (42%), Gaps = 34/175 (19%)
Query: 177 MARTAEVKRDARIG-EAEAQKDARIKEAMAEEEKMAARFVNDTEIAKAQRDFEIKKATYD 235
M + + + D+R+ E + ++ + K+ EEE+ AAR ++ + +K+A D
Sbjct: 481 MEKRLKSEADSRVNAEKQLAEEKKRKK---EEEETAARAAAQAAASREECAESLKQAKQD 537
Query: 236 AEVETKRAEAELAFELQAAKTKQRIKEEQMKIQVIERNQEIQVQEQEIQRRKKELESSVN 295
E+E K+ E +L ++KEE+ ++ E+E Q +K ES
Sbjct: 538 LEMEIKKLEHDL-----------KLKEEE-----------CRMLEKEAQELRKYQESEKE 575
Query: 296 RPAEAEKYRIEKLAQANKKRLMIEAEAEAEAIRLKGEAEAAAIAAKARAEVEIMT 350
+ + L K LM+E AE RLK + +A + ++EI
Sbjct: 576 -----TEVLMSALQAMQDKNLMLENSLSAET-RLKLDLFSA--LGDVKRQLEIAH 622
>gnl|CDD|178945 PRK00247, PRK00247, putative inner membrane protein translocase
component YidC; Validated.
Length = 429
Score = 28.7 bits (64), Expect = 5.5
Identities = 33/132 (25%), Positives = 47/132 (35%), Gaps = 29/132 (21%)
Query: 230 KKATYDAEVETKRAEAELAFELQAAKTKQRIKEEQMKIQVIERNQ--------EIQVQEQ 281
+K E + AE Q A+ +++ KE++ + + RN+
Sbjct: 275 RKYPLTDEFKEHHAE-------QRAQYREKQKEKKAFLWTLRRNRLRMIITPWRAPELHA 327
Query: 282 EIQRRKKELESSVNRPAEAEKYRIEKLAQANKKRLMIEAEAEAEAIRLKGEAEAAAIAAK 341
E KK R AE + + K A K+R AE E R EA AA
Sbjct: 328 ENAEIKKT------RTAEKNEAKARKKEIAQKRR-----AAEREINR---EARQERAAAM 373
Query: 342 ARAEVEIMTKKA 353
ARA KA
Sbjct: 374 ARARARRAAVKA 385
>gnl|CDD|115392 pfam06730, FAM92, FAM92 protein. This family of proteins has a
role in embryogenesis. During embryogenesis it is
essential for ectoderm and axial mesoderm development.
It may regulate cell proliferation and apoptosis.
Length = 219
Score = 28.4 bits (63), Expect = 5.6
Identities = 22/81 (27%), Positives = 39/81 (48%), Gaps = 1/81 (1%)
Query: 237 EVETKRAEAELAFELQAAKTKQRIKEEQMKIQVIERNQEIQVQEQ-EIQRRKKELESSVN 295
+ E +R EA++ L+ T ++K + +K + RN+E + Q E R++ + V
Sbjct: 85 QAEVERLEAKVVEPLKQYGTIVKMKRDDLKATLTARNREAKQLRQLERTRQRNPSDRHVI 144
Query: 296 RPAEAEKYRIEKLAQANKKRL 316
AE E R + AQ + L
Sbjct: 145 SQAETELQRAKMDAQRTSRHL 165
>gnl|CDD|200948 pfam00038, Filament, Intermediate filament protein.
Length = 312
Score = 28.3 bits (64), Expect = 6.4
Identities = 28/96 (29%), Positives = 42/96 (43%), Gaps = 5/96 (5%)
Query: 231 KATYDAEVETKRAEAELAF-----ELQAAKTKQRIKEEQMKIQVIERNQEIQVQEQEIQR 285
+A Y+ E R EAE + ELQ A + K ++ E ++IQ E E+Q
Sbjct: 175 RAQYEELAEKNRQEAEEWYKSKLEELQQAAARNGDALRSAKEEITELRRQIQSLEIELQS 234
Query: 286 RKKELESSVNRPAEAEKYRIEKLAQANKKRLMIEAE 321
KK+ S + AE E+ +LA +E E
Sbjct: 235 LKKQKASLERQLAELEERYELELADYQDTISELEEE 270
>gnl|CDD|218769 pfam05833, FbpA, Fibronectin-binding protein A N-terminus (FbpA).
This family consists of the N-terminal region of the
prokaryotic fibronectin-binding protein. Fibronectin
binding is considered to be an important virulence
factor in streptococcal infections. Fibronectin is a
dimeric glycoprotein that is present in a soluble form
in plasma and extracellular fluids; it is also present
in a fibrillar form on cell surfaces. Both the soluble
and cellular forms of fibronectin may be incorporated
into the extracellular tissue matrix. While fibronectin
has critical roles in eukaryotic cellular processes,
such as adhesion, migration and differentiation, it is
also a substrate for the attachment of bacteria. The
binding of pathogenic Streptococcus pyogenes and
Staphylococcus aureus to epithelial cells via
fibronectin facilitates their internalisation and
systemic spread within the host.
Length = 447
Score = 28.4 bits (64), Expect = 6.9
Identities = 16/54 (29%), Positives = 26/54 (48%), Gaps = 2/54 (3%)
Query: 279 QEQEIQRRKKELESSVNRPAEAEKYRIEKLAQANKKRLMIEAEAEAEAIRLKGE 332
++ + R K+ S + + E E ++EK + +K L E AE RL GE
Sbjct: 282 SKKAERDRVKQKRSDLEKRVENELEKLEKKLEKLEKEL--EEAENAENYRLYGE 333
>gnl|CDD|215180 PLN02316, PLN02316, synthase/transferase.
Length = 1036
Score = 28.7 bits (64), Expect = 7.0
Identities = 23/76 (30%), Positives = 37/76 (48%), Gaps = 11/76 (14%)
Query: 283 IQRRKKELESSVNRPAEAEKYRIEKLAQANKKRLMIEAEAEAEAIRLKGEAEAAAIAAKA 342
++ +++ELE AE E+ QA ++R E +A EA R A+A A K
Sbjct: 251 LEEKRRELEKLAKEEAERER-------QAEEQRRREEEKAAMEADR----AQAKAEVEKR 299
Query: 343 RAEVEIMTKKAEAWKD 358
R +++ + KKA D
Sbjct: 300 REKLQNLLKKASRSAD 315
>gnl|CDD|184695 PRK14471, PRK14471, F0F1 ATP synthase subunit B; Provisional.
Length = 164
Score = 27.8 bits (62), Expect = 7.2
Identities = 22/57 (38%), Positives = 32/57 (56%), Gaps = 3/57 (5%)
Query: 191 EAEAQKDARIKEAMAEEEKMAARFVNDTEIAKAQRDFEIKKATYDAEVETKRAEAEL 247
EA A++DA +KEA +EKM A + E A+ + D I++A E E A AE+
Sbjct: 71 EARAERDAILKEAREIKEKMIA---DAKEEAQVEGDKMIEQAKASIESEKNAAMAEI 124
>gnl|CDD|132829 cd07296, PX_PLD1, The phosphoinositide binding Phox Homology domain
of Phospholipase D1. The PX domain is a
phosphoinositide binding module present in many proteins
with diverse functions such as cell signaling, vesicular
trafficking, protein sorting, and lipid modification,
among others. Phospholipase D (PLD) catalyzes the
hydrolysis of the phosphodiester bond of
phosphatidylcholine to generate membrane-bound
phosphatidic acid and choline. PLDs are implicated in
many cellular functions like signaling, cytoskeletal
reorganization, vesicular transport, stress responses,
and the control of differentiation, proliferation, and
survival. PLD1 contains PX and Pleckstrin Homology (PH)
domains in addition to the catalytic domain. It acts as
an effector of Rheb in the signaling of the mammalian
target of rapamycin (mTOR), a serine/threonine protein
kinase that transduces nutrients and other stimuli to
regulate many cellular processes. PLD1 also regulates
the secretion of the procoagulant von Willebrand factor
(VWF) in endothelial cells. The PX domain is involved in
targeting of proteins to PI-enriched membranes, and may
also be involved in protein-protein interaction. The PX
domain of PLD1 specifically binds to
phosphatidylinositol-3,4,5-trisphosphate [PI(3,4,5)P3],
which enables PLD1 to mediate signals via the ERK1/2
pathway.
Length = 135
Score = 27.2 bits (60), Expect = 7.3
Identities = 13/49 (26%), Positives = 26/49 (53%), Gaps = 1/49 (2%)
Query: 257 KQRIK-EEQMKIQVIERNQEIQVQEQEIQRRKKELESSVNRPAEAEKYR 304
+Q IK E + + R E + +E++ R+K+LE +++ + YR
Sbjct: 75 RQTIKRGEPRHMPSLPRGAEEEAREEQFSSRRKQLEDYLSKLLKMPMYR 123
>gnl|CDD|216833 pfam01991, vATP-synt_E, ATP synthase (E/31 kDa) subunit. This
family includes the vacuolar ATP synthase E subunit, as
well as the archaebacterial ATP synthase E subunit.
Length = 195
Score = 27.7 bits (62), Expect = 8.1
Identities = 13/29 (44%), Positives = 16/29 (55%)
Query: 319 EAEAEAEAIRLKGEAEAAAIAAKARAEVE 347
EAE +AE IR + E E A+A E E
Sbjct: 5 EAEEKAEEIRAEAEEEFEIEKAEAVEEAE 33
>gnl|CDD|237855 PRK14900, valS, valyl-tRNA synthetase; Provisional.
Length = 1052
Score = 28.4 bits (63), Expect = 8.5
Identities = 35/168 (20%), Positives = 64/168 (38%), Gaps = 8/168 (4%)
Query: 194 AQKDARIKEAMAEEEKMAA----RFVNDTEIAKAQRDFEIKKATYDAEVETKRAEAELAF 249
A + AR+ + + + ++ A + N + + A K E+ KR + E
Sbjct: 841 AAETARVDKEIGKVDQDLAVLERKLQNPSFVQNAPPAVVEKDRARAEELREKRGKLEAHR 900
Query: 250 ELQAAKTKQRIKEEQMKIQVIERNQEIQVQEQEIQRRKKELESSVNRPAEAEKYRIEKLA 309
+ + + + M+IQ N++ Q+ + E++V AE + + A
Sbjct: 901 AMLSGSEANSARRDTMEIQ----NEQKPTQDGPAAEAQPAQENTVVESAEKAVAAVSEAA 956
Query: 310 QANKKRLMIEAEAEAEAIRLKGEAEAAAIAAKARAEVEIMTKKAEAWK 357
Q + E AEA+R AA A + + KKA A K
Sbjct: 957 QQAATAVASGIEKVAEAVRKTVRRSVKKAAATRAAMKKKVAKKAPAKK 1004
>gnl|CDD|219655 pfam07946, DUF1682, Protein of unknown function (DUF1682). The
members of this family are all hypothetical eukaryotic
proteins of unknown function. One member is described as
being an adipocyte-specific protein, but no evidence of
this was found.
Length = 322
Score = 28.0 bits (63), Expect = 8.6
Identities = 15/61 (24%), Positives = 33/61 (54%), Gaps = 1/61 (1%)
Query: 255 KTKQRIKEEQMKIQVIERNQEIQVQEQEIQRRKKELESSVNRPAEAEKYRIEKLAQANKK 314
KT++ +E+ +K ER +E Q +++E ++ ++E + + P E K EK + +
Sbjct: 263 KTREEEEEKILKAAEEERQEEAQEKKEEKKKEEREAKLAKLSPEEQRKLE-EKERKKQAR 321
Query: 315 R 315
+
Sbjct: 322 K 322
>gnl|CDD|179673 PRK03910, PRK03910, D-cysteine desulfhydrase; Validated.
Length = 331
Score = 27.9 bits (63), Expect = 8.7
Identities = 13/37 (35%), Positives = 21/37 (56%)
Query: 293 SVNRPAEAEKYRIEKLAQANKKRLMIEAEAEAEAIRL 329
+V+R A ++ ++ KLAQA + L + E IRL
Sbjct: 217 TVSRSAAEQEPKVAKLAQATAELLGLPTEIPRADIRL 253
>gnl|CDD|179385 PRK02224, PRK02224, chromosome segregation protein; Provisional.
Length = 880
Score = 28.1 bits (63), Expect = 8.9
Identities = 35/190 (18%), Positives = 71/190 (37%), Gaps = 14/190 (7%)
Query: 178 ARTAEVKRDARIGEAE-AQKDARIKEAMAEEEKMAARFVNDTEIAKAQRDFEI-----KK 231
+ + R+ AE + E + E + + + ++ +
Sbjct: 487 LEEEVEEVEERLERAEDLVEAEDRIERLEERREDLEELIAERRETIEEKRERAEELRERA 546
Query: 232 ATYDAEVETKRAEAELAFELQAAKTKQRIKEEQMK-------IQVIERNQEIQVQEQEIQ 284
A +AE E KR A A E +A + ++ + E K I+ +ER + + + +
Sbjct: 547 AELEAEAEEKREAAAEA-EEEAEEAREEVAELNSKLAELKERIESLERIRTLLAAIADAE 605
Query: 285 RRKKELESSVNRPAEAEKYRIEKLAQANKKRLMIEAEAEAEAIRLKGEAEAAAIAAKARA 344
+ L AE R E+LA+ +++ +EAE + I E + A +
Sbjct: 606 DEIERLREKREALAELNDERRERLAEKRERKRELEAEFDEARIEEAREDKERAEEYLEQV 665
Query: 345 EVEIMTKKAE 354
E ++ + E
Sbjct: 666 EEKLDELREE 675
>gnl|CDD|129661 TIGR00570, cdk7, CDK-activating kinase assembly factor MAT1. All
proteins in this family for which functions are known
are cyclin dependent protein kinases that are components
of TFIIH, a complex that is involved in nucleotide
excision repair and transcription initiation. Also known
as MAT1 (menage a trois 1). This family is based on the
phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis,
Stanford University) [DNA metabolism, DNA replication,
recombination, and repair].
Length = 309
Score = 27.8 bits (62), Expect = 9.0
Identities = 17/66 (25%), Positives = 30/66 (45%), Gaps = 1/66 (1%)
Query: 250 ELQAAKTKQRIKEEQMKIQVIERNQEIQVQEQEIQRRKKELESSVNRPAEAEKYRIEKLA 309
+L+ K K +++ K VI++N+E +EQE E E + E+
Sbjct: 114 DLENTKKKIETYQKENK-DVIQKNKEKSTREQEELEEALEFEKEEEEQRRLLLQKEEEEQ 172
Query: 310 QANKKR 315
Q NK++
Sbjct: 173 QMNKRK 178
>gnl|CDD|236446 PRK09279, PRK09279, pyruvate phosphate dikinase; Provisional.
Length = 879
Score = 28.2 bits (64), Expect = 9.0
Identities = 24/74 (32%), Positives = 31/74 (41%), Gaps = 17/74 (22%)
Query: 288 KELESSVNRPAEAEKYRIEKLAQANKK------RLMI---E-AEAEAEAIRLKGEAEAAA 337
+EL ++ E K R+E L + N RL I E E +A AI EAA
Sbjct: 637 EELAEALGLSLEELKARVEALHEFNPMLGHRGCRLGITYPEIYEMQARAI-----FEAAV 691
Query: 338 IAAKARAEV--EIM 349
K +V EIM
Sbjct: 692 ELKKEGIDVVPEIM 705
>gnl|CDD|235227 PRK04149, sat, sulfate adenylyltransferase; Reviewed.
Length = 391
Score = 27.9 bits (63), Expect = 9.1
Identities = 9/23 (39%), Positives = 16/23 (69%), Gaps = 1/23 (4%)
Query: 122 IMGSMSVEDIYK-DRKKFSKQVF 143
G + VE+IY D+KK +++V+
Sbjct: 109 PYGVLEVEEIYTYDKKKEAEKVY 131
>gnl|CDD|221283 pfam11875, DUF3395, Domain of unknown function (DUF3395). This
domain is functionally uncharacterized. This domain is
found in eukaryotes. This presumed domain is typically
between 147 to 176 amino acids in length. This domain is
found associated with pfam00226.
Length = 144
Score = 27.3 bits (61), Expect = 9.1
Identities = 11/46 (23%), Positives = 24/46 (52%)
Query: 221 AKAQRDFEIKKATYDAEVETKRAEAELAFELQAAKTKQRIKEEQMK 266
A+ +++ E ++ E+ K+ EAE A L ++++ E+ K
Sbjct: 5 AQKKKEQERQREKNKTEIAKKKQEAESAVSLMGDVVERKLTREEEK 50
>gnl|CDD|223571 COG0497, RecN, ATPase involved in DNA repair [DNA replication,
recombination, and repair].
Length = 557
Score = 27.9 bits (63), Expect = 9.1
Identities = 25/133 (18%), Positives = 59/133 (44%), Gaps = 12/133 (9%)
Query: 248 AFELQAAKTKQRIKE-----EQMKIQVIERNQEIQVQEQEIQR---RKKELESSVNRPAE 299
AF ++ +E +Q + ++ + ++ + + Q + +ELE +P E
Sbjct: 147 AFAGLEELAQEAYQEAYQAWKQARRELEDLQEKERERAQRADLLQFQLEELEELNLQPGE 206
Query: 300 AEKYRIEKLAQANKKRLMIEAEAEAEAIRLKGEAEAAAIAAKARAEVEIMTKKAEAWKDY 359
E+ E+ +N ++L EA A + L GE + + + +E + +E D
Sbjct: 207 DEELEEERKRLSNSEKLA-EAIQNALEL-LSGEDDTVSALSLLGRALEALEDLSEY--DG 262
Query: 360 REAAVIDMILESL 372
+ + + +++ E+L
Sbjct: 263 KLSELAELLEEAL 275
>gnl|CDD|227084 COG4741, COG4741, Predicted secreted endonuclease distantly related
to archaeal Holliday junction resolvase [Nucleotide
transport and metabolism].
Length = 175
Score = 27.4 bits (61), Expect = 9.3
Identities = 19/63 (30%), Positives = 31/63 (49%), Gaps = 4/63 (6%)
Query: 266 KIQVIERNQEIQVQEQEIQRRKKELESSVNRPAEAEKYRIEKLAQANKKRLMIEAEAEAE 325
I+ ++ E + +E E +K E E VN A+A K KL + +K+ EA +
Sbjct: 23 YIRSLQGKVESKARELEETLQKAERERLVNE-AQARKEEEWKLKEWIEKK---IEEARED 78
Query: 326 AIR 328
A+R
Sbjct: 79 AVR 81
>gnl|CDD|234084 TIGR03007, pepcterm_ChnLen, polysaccharide chain length determinant
protein, PEP-CTERM locus subfamily. Members of this
protein family belong to the family of polysaccharide
chain length determinant proteins (pfam02706). All are
found in species that encode the PEP-CTERM/exosortase
system predicted to act in protein sorting in a number
of Gram-negative bacteria, and are found near the epsH
homolog that is the putative exosortase gene [Cell
envelope, Biosynthesis and degradation of surface
polysaccharides and lipopolysaccharides].
Length = 498
Score = 28.1 bits (63), Expect = 9.5
Identities = 21/79 (26%), Positives = 37/79 (46%), Gaps = 10/79 (12%)
Query: 191 EAEAQKDA-----RIKEAMAEEEKMAARFV--NDTEIAKAQRDFEIKKATYDAEVETKRA 243
EAEA+ + A E + R + + E+ + RD+E+ K+ Y+ + T+R
Sbjct: 321 EAEAEIASLEARVAELTARIERLESLLRTIPEVEAELTQLNRDYEVNKSNYEQLL-TRRE 379
Query: 244 EAELA--FELQAAKTKQRI 260
AE++ E+Q RI
Sbjct: 380 SAEVSKQMEVQDKAVSFRI 398
>gnl|CDD|223562 COG0488, Uup, ATPase components of ABC transporters with duplicated
ATPase domains [General function prediction only].
Length = 530
Score = 28.0 bits (63), Expect = 9.6
Identities = 18/103 (17%), Positives = 33/103 (32%), Gaps = 13/103 (12%)
Query: 239 ETKRAEAELAFELQAAKTKQRIKEEQMKIQ--VIERNQEIQVQEQEIQRRKKELESSVNR 296
++AE K ++ + +EQ I+ ++ + + R K LE R
Sbjct: 239 LEQKAERLRQEAAAYEKQQKELAKEQEWIRRGKAAASKAKKA-----KSRIKRLEKLEAR 293
Query: 297 PAEAEKYRIEKLAQANKKRLMIEAEAEAEAIRLKGEAEAAAIA 339
AE E+ + K +L E E +
Sbjct: 294 LAE------ERPVEEGKPLAFRFPPPGKRLGKLVLEFENVSKG 330
Database: CDD.v3.10
Posted date: Mar 20, 2013 7:55 AM
Number of letters in database: 10,937,602
Number of sequences in database: 44,354
Lambda K H
0.311 0.126 0.326
Gapped
Lambda K H
0.267 0.0686 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 18,791,612
Number of extensions: 1889821
Number of successful extensions: 4296
Number of sequences better than 10.0: 1
Number of HSP's gapped: 3092
Number of HSP's successfully gapped: 663
Length of query: 374
Length of database: 10,937,602
Length adjustment: 98
Effective length of query: 276
Effective length of database: 6,590,910
Effective search space: 1819091160
Effective search space used: 1819091160
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.9 bits)
S2: 60 (27.0 bits)