RPS-BLAST 2.2.26 [Sep-21-2011]
Database: CDD.v3.10
44,354 sequences; 10,937,602 total letters
Searching..................................................done
Query= psy7739
(142 letters)
>gnl|CDD|237177 PRK12704, PRK12704, phosphodiesterase; Provisional.
Length = 520
Score = 37.5 bits (88), Expect = 0.001
Identities = 33/101 (32%), Positives = 55/101 (54%), Gaps = 7/101 (6%)
Query: 42 EAES-KEQALQRQKVEYRKALEQQMQEELDKKKELKYRQYQEEQRVTQGE--IDRMTKQD 98
EAE+ K++AL K E K + +E +++ EL+ + E+R+ Q E +DR +
Sbjct: 50 EAEAIKKEALLEAKEEIHKLRNEFEKELRERRNELQ----KLEKRLLQKEENLDRKLELL 105
Query: 99 EKYREEKQNKMMKLRQDMEESLKSREQMRQHEKLQNQELEK 139
EK EE + K +L Q +E K E++ + + Q QELE+
Sbjct: 106 EKREEELEKKEKELEQKQQELEKKEEELEELIEEQLQELER 146
Score = 26.3 bits (59), Expect = 7.3
Identities = 26/87 (29%), Positives = 48/87 (55%), Gaps = 12/87 (13%)
Query: 38 RQMLEAESKEQALQRQKVEYRKALEQQMQEELDKKKELKYRQYQEEQRVTQGEIDRMTKQ 97
R++ E +E+ L+++ K LEQ+ Q+EL+KK+E + + EEQ I +T +
Sbjct: 100 RKLELLEKREEELEKK----EKELEQK-QQELEKKEE-ELEELIEEQLQELERISGLTAE 153
Query: 98 D------EKYREEKQNKMMKLRQDMEE 118
+ EK EE +++ L +++EE
Sbjct: 154 EAKEILLEKVEEEARHEAAVLIKEIEE 180
>gnl|CDD|130141 TIGR01069, mutS2, MutS2 family protein. Function of MutS2 is
unknown. It should not be considered a DNA mismatch
repair protein. It is likely a DNA mismatch binding
protein of unknown cellular function [DNA metabolism,
Other].
Length = 771
Score = 36.3 bits (84), Expect = 0.003
Identities = 21/110 (19%), Positives = 51/110 (46%), Gaps = 9/110 (8%)
Query: 30 EQMYSATQRQMLEAESKEQALQRQKVEYRKALEQQMQEELDKKKELKYRQYQEEQRVTQG 89
E++ + + + E E+ L+ Q+ K LEQ+M+E ++++ K +E Q +
Sbjct: 518 EKLSALEKELEQKNEHLEKLLKEQEKLK-KELEQEMEELKERERNKKLELEKEAQEALKA 576
Query: 90 EIDRMTKQDEKYREEKQNKMMKLRQDMEESLKSREQMRQHEKLQNQELEK 139
+ + +E+K +K + +KS E + + ++ + + +K
Sbjct: 577 LKKEVESIIRELKEKKIHK--------AKEIKSIEDLVKLKETKQKIPQK 618
>gnl|CDD|173412 PTZ00121, PTZ00121, MAEBL; Provisional.
Length = 2084
Score = 32.4 bits (73), Expect = 0.079
Identities = 25/111 (22%), Positives = 61/111 (54%)
Query: 29 QEQMYSATQRQMLEAESKEQALQRQKVEYRKALEQQMQEELDKKKELKYRQYQEEQRVTQ 88
+++M + ++ EA+ K + L++ + E +K + + +E +KKK + ++ +EE ++
Sbjct: 1604 EKKMKAEEAKKAEEAKIKAEELKKAEEEKKKVEQLKKKEAEEKKKAEELKKAEEENKIKA 1663
Query: 89 GEIDRMTKQDEKYREEKQNKMMKLRQDMEESLKSREQMRQHEKLQNQELEK 139
E + ++D+K EE + ++ E K E+ ++ E+L+ +E E+
Sbjct: 1664 AEEAKKAEEDKKKAEEAKKAEEDEKKAAEALKKEAEEAKKAEELKKKEAEE 1714
Score = 29.7 bits (66), Expect = 0.54
Identities = 21/94 (22%), Positives = 47/94 (50%)
Query: 42 EAESKEQALQRQKVEYRKALEQQMQEELDKKKELKYRQYQEEQRVTQGEIDRMTKQDEKY 101
E K + +++ E +KA E + + KKE + + EE + + E + ++ +K
Sbjct: 1665 EEAKKAEEDKKKAEEAKKAEEDEKKAAEALKKEAEEAKKAEELKKKEAEEKKKAEELKKA 1724
Query: 102 REEKQNKMMKLRQDMEESLKSREQMRQHEKLQNQ 135
EE + K + +++ EE K E+ ++ E+ + +
Sbjct: 1725 EEENKIKAEEAKKEAEEDKKKAEEAKKDEEEKKK 1758
Score = 28.6 bits (63), Expect = 1.3
Identities = 26/117 (22%), Positives = 56/117 (47%), Gaps = 11/117 (9%)
Query: 26 RYYQEQMYSATQRQMLEAESKEQALQRQKVEYR---KALEQQMQEELDKKKELKYRQYQE 82
+ +E+ Q + EAE K++A + +K E KA E+ + E DKKK + ++ +E
Sbjct: 1626 KKAEEEKKKVEQLKKKEAEEKKKAEELKKAEEENKIKAAEEAKKAEEDKKKAEEAKKAEE 1685
Query: 83 EQRVTQGEIDRMTKQDEKYREEKQNKMMKLRQDMEESLKSREQMRQHEKLQNQELEK 139
+++ + + E+ K +L++ E K E++++ E+ + E+
Sbjct: 1686 DEKKAAEALKK--------EAEEAKKAEELKKKEAEEKKKAEELKKAEEENKIKAEE 1734
Score = 28.2 bits (62), Expect = 1.8
Identities = 24/110 (21%), Positives = 55/110 (50%), Gaps = 3/110 (2%)
Query: 30 EQMYSATQRQMLEAESKEQALQRQKVEYRKALEQQMQEELDKKKELKYRQYQEEQRVTQG 89
E+ A +++ EA+ K A +++ E +KA E + + E DKKK + ++ ++
Sbjct: 1363 EEKAEAAEKKKEEAKKKADAAKKKAEEKKKADEAKKKAEEDKKKADELKKAAAAKKKAD- 1421
Query: 90 EIDRMTKQDEKYREEKQNKMMKLRQDMEESLKSREQMRQHEKLQNQELEK 139
+ K +EK + ++ K + + +E+ K E+ ++ E+ + + E
Sbjct: 1422 --EAKKKAEEKKKADEAKKKAEEAKKADEAKKKAEEAKKAEEAKKKAEEA 1469
Score = 27.0 bits (59), Expect = 4.7
Identities = 23/103 (22%), Positives = 54/103 (52%), Gaps = 2/103 (1%)
Query: 37 QRQMLEAESKEQALQRQKVEYRKALEQQMQEELDKKKELKYRQYQEEQRVTQGEIDRMTK 96
+++ EA KE ++ E +K ++ ++ + KK + + + E+ + E D+
Sbjct: 1687 EKKAAEALKKEAEEAKKAEELKKKEAEEKKKAEELKKAEEENKIKAEEAKKEAEEDKKKA 1746
Query: 97 QDEKYREEKQNKMMKLRQDMEESLKSREQMRQHEKLQNQELEK 139
++ K EE++ K+ L++ EE K+ E ++ E + +EL++
Sbjct: 1747 EEAKKDEEEKKKIAHLKK--EEEKKAEEIRKEKEAVIEEELDE 1787
Score = 26.6 bits (58), Expect = 6.3
Identities = 18/89 (20%), Positives = 39/89 (43%)
Query: 42 EAESKEQALQRQKVEYRKALEQQMQEELDKKKELKYRQYQEEQRVTQGEIDRMTKQDEKY 101
EA+ K A +++ E +KA E E E + + + E + E + K
Sbjct: 1326 EAKKKADAAKKKAEEAKKAAEAAKAEAEAAADEAEAAEEKAEAAEKKKEEAKKKADAAKK 1385
Query: 102 REEKQNKMMKLRQDMEESLKSREQMRQHE 130
+ E++ K + ++ EE K +++++
Sbjct: 1386 KAEEKKKADEAKKKAEEDKKKADELKKAA 1414
>gnl|CDD|234750 PRK00409, PRK00409, recombination and DNA strand exchange inhibitor
protein; Reviewed.
Length = 782
Score = 31.7 bits (73), Expect = 0.11
Identities = 19/120 (15%), Positives = 60/120 (50%), Gaps = 11/120 (9%)
Query: 30 EQMYSATQRQMLEAESKEQALQ--RQKVE-YRKALEQQMQEELDKKKELKYRQ---YQEE 83
++ ++ + E E K + + ++ E ++ LE++ ++ +++ +L Q+
Sbjct: 519 NELIASLEELERELEQKAEEAEALLKEAEKLKEELEEKKEKLQEEEDKLLEEAEKEAQQA 578
Query: 84 QRVTQGEIDRMTK-----QDEKYREEKQNKMMKLRQDMEESLKSREQMRQHEKLQNQELE 138
+ + E D + K Q Y K +++++ R+ + ++ + +E+ ++ +K + +EL+
Sbjct: 579 IKEAKKEADEIIKELRQLQKGGYASVKAHELIEARKRLNKANEKKEKKKKKQKEKQEELK 638
Score = 31.0 bits (71), Expect = 0.18
Identities = 20/91 (21%), Positives = 44/91 (48%), Gaps = 1/91 (1%)
Query: 29 QEQMYSATQRQMLE-AESKEQALQRQKVEYRKALEQQMQEELDKKKELKYRQYQEEQRVT 87
E++ + + + E +++ L+ + E ++A+++ +E + KEL+ Q V
Sbjct: 546 AEKLKEELEEKKEKLQEEEDKLLEEAEKEAQQAIKEAKKEADEIIKELRQLQKGGYASVK 605
Query: 88 QGEIDRMTKQDEKYREEKQNKMMKLRQDMEE 118
E+ K+ K E+K+ K K ++ EE
Sbjct: 606 AHELIEARKRLNKANEKKEKKKKKQKEKQEE 636
>gnl|CDD|188306 TIGR03319, RNase_Y, ribonuclease Y. Members of this family are
RNase Y, an endoribonuclease. The member from Bacillus
subtilis, YmdA, has been shown to be involved in
turnover of yitJ riboswitch [Transcription, Degradation
of RNA].
Length = 514
Score = 31.0 bits (71), Expect = 0.15
Identities = 27/101 (26%), Positives = 51/101 (50%), Gaps = 7/101 (6%)
Query: 42 EAES-KEQALQRQKVEYRKALEQQMQEELDKKKELKYRQYQEEQRVTQGE--IDRMTKQD 98
EAE+ K++AL K E K + +E +++ EL+ + E+R+ Q E +DR +
Sbjct: 44 EAETLKKEALLEAKEEVHKLRAELERELKERRNELQ----RLERRLLQREETLDRKMESL 99
Query: 99 EKYREEKQNKMMKLRQDMEESLKSREQMRQHEKLQNQELEK 139
+K E + K +L + + E++ + Q +ELE+
Sbjct: 100 DKKEENLEKKEKELSNKEKNLDEKEEELEELIAEQREELER 140
Score = 27.6 bits (62), Expect = 2.7
Identities = 22/82 (26%), Positives = 40/82 (48%), Gaps = 4/82 (4%)
Query: 37 QRQMLEAESKEQALQRQKVEYRKALEQQMQEELDKKKELKYRQYQEEQRVTQGEIDRMTK 96
R LE E KE+ + Q++E R Q +E LD+K E + +E + E+ K
Sbjct: 63 LRAELERELKERRNELQRLERRL---LQREETLDRKMES-LDKKEENLEKKEKELSNKEK 118
Query: 97 QDEKYREEKQNKMMKLRQDMEE 118
++ EE + + + R+++E
Sbjct: 119 NLDEKEEELEELIAEQREELER 140
Score = 26.8 bits (60), Expect = 4.9
Identities = 18/91 (19%), Positives = 42/91 (46%), Gaps = 10/91 (10%)
Query: 59 KALEQQMQEELDKKKELKYRQYQEEQRVTQGEIDRMTKQDEK----------YREEKQNK 108
K + ++ ++E + K+ + +EE + E++R K+ REE ++
Sbjct: 35 KRIIEEAKKEAETLKKEALLEAKEEVHKLRAELERELKERRNELQRLERRLLQREETLDR 94
Query: 109 MMKLRQDMEESLKSREQMRQHEKLQNQELEK 139
M+ EE+L+ +E+ +++ E E+
Sbjct: 95 KMESLDKKEENLEKKEKELSNKEKNLDEKEE 125
>gnl|CDD|233757 TIGR02168, SMC_prok_B, chromosome segregation protein SMC, common
bacterial type. SMC (structural maintenance of
chromosomes) proteins bind DNA and act in organizing and
segregating chromosomes for partition. SMC proteins are
found in bacteria, archaea, and eukaryotes. This family
represents the SMC protein of most bacteria. The smc
gene is often associated with scpB (TIGR00281) and scpA
genes, where scp stands for segregation and condensation
protein. SMC was shown (in Caulobacter crescentus) to be
induced early in S phase but present and bound to DNA
throughout the cell cycle [Cellular processes, Cell
division, DNA metabolism, Chromosome-associated
proteins].
Length = 1179
Score = 31.2 bits (71), Expect = 0.19
Identities = 25/107 (23%), Positives = 47/107 (43%), Gaps = 6/107 (5%)
Query: 33 YSATQRQMLEAESKEQALQRQKVEYRKALEQQMQEELDKKKELKYRQYQEEQRVTQGEID 92
Q ++ EAE + + L + E + LE+ E + ++E+ + Q+E EI
Sbjct: 241 LEELQEELKEAEEELEELTAELQELEEKLEELRLEVSELEEEI--EELQKELYALANEIS 298
Query: 93 RMTKQDEKYREEKQNKMMKLRQDMEESLKSREQMRQHEKLQNQELEK 139
R+ +Q + RE N L + +EE E++ +EL +
Sbjct: 299 RLEQQKQILRERLAN----LERQLEELEAQLEELESKLDELAEELAE 341
>gnl|CDD|173502 PTZ00266, PTZ00266, NIMA-related protein kinase; Provisional.
Length = 1021
Score = 31.2 bits (70), Expect = 0.19
Identities = 17/89 (19%), Positives = 46/89 (51%), Gaps = 2/89 (2%)
Query: 51 QRQKVEYRKALEQQMQEELDKKKELKYRQYQEEQRVTQGEIDRMTKQDEKYREEKQNKMM 110
+R ++E A + ++ ++ +KK ++ + +E +R+ + ++R+ + E+ E+ +
Sbjct: 436 ERARIEKENAHRKALEMKILEKKRIERLEREERERLERERMERIER--ERLERERLERER 493
Query: 111 KLRQDMEESLKSREQMRQHEKLQNQELEK 139
R +E R + + ++L+ LEK
Sbjct: 494 LERDRLERDRLDRLERERVDRLERDRLEK 522
>gnl|CDD|220369 pfam09731, Mitofilin, Mitochondrial inner membrane protein.
Mitofilin controls mitochondrial cristae morphology.
Mitofilin is enriched in the narrow space between the
inner boundary and the outer membranes, where it forms a
homotypic interaction and assembles into a large
multimeric protein complex. The first 78 amino acids
contain a typical amino-terminal-cleavable mitochondrial
presequence rich in positive-charged and hydroxylated
residues and a membrane anchor domain. In addition, it
has three centrally located coiled coil domains.
Length = 493
Score = 30.8 bits (70), Expect = 0.21
Identities = 13/102 (12%), Positives = 49/102 (48%), Gaps = 8/102 (7%)
Query: 40 MLEAESKEQALQRQKVEYRKALEQQMQEELDKKKELKYRQYQEEQRVTQGEIDRMTKQDE 99
+ A+ + L ++ E + E++++ L +K+E + +EE + R+ ++
Sbjct: 165 IASAKEELDQLSKKLAELKAEEEEELERALKEKREELLSKLEEE------LLARLESKEA 218
Query: 100 KYREEKQNKMMKLRQDMEESLKS--REQMRQHEKLQNQELEK 139
++ + + + ++++ + + R+++ + + Q+L+
Sbjct: 219 ALEKQLRLEFEREKEELRKKYEEKLRQELERQAEAHEQKLKN 260
Score = 26.1 bits (58), Expect = 8.8
Identities = 22/93 (23%), Positives = 48/93 (51%), Gaps = 8/93 (8%)
Query: 42 EAESKEQALQRQKVEYRKALEQQMQEELDKKKELKYRQYQEEQRVTQGEIDRMTKQD--E 99
+ +R+K E RK E+++++EL+++ E ++ + E + E+ R ++ E
Sbjct: 220 LEKQLRLEFEREKEELRKKYEEKLRQELERQAEAHEQKLKNELALQAIELQREFNKEIKE 279
Query: 100 KYREEKQNKMMKLR------QDMEESLKSREQM 126
K EE+ ++ KL + +E++L SR +
Sbjct: 280 KVEEERNGRLAKLAELNSRLKGLEKALDSRSEA 312
>gnl|CDD|233816 TIGR02302, aProt_lowcomp, TIGR02302 family protein. Members of
this family are long (~850 residue) bacterial proteins
from the alpha Proteobacteria. Each has 2-3 predicted
transmembrane helices near the N-terminus and a long
C-terminal region that includes stretches of
Gln/Gly-rich low complexity sequence, predicted by TMHMM
to be outside the membrane. In Bradyrhizobium japonicum,
two tandem reading frames are together homologous the
single members found in other species; the cutoffs
scores are set low enough that the longer scores above
the trusted cutoff and the shorter above the noise
cutoff for this model.
Length = 851
Score = 30.7 bits (69), Expect = 0.22
Identities = 27/128 (21%), Positives = 50/128 (39%), Gaps = 13/128 (10%)
Query: 25 DRYYQEQMYSATQRQMLEAESKEQALQRQKVEYRKALEQQM----------QEELDKKKE 74
DR +Q+ S Q+ M + + +Q + +EQQM Q+ D+ +
Sbjct: 585 DRDQAKQLLSQLQQMMNNLQMGQPGQGQQMGDQSGDMEQQMNKLGELMRKQQQLRDETFK 644
Query: 75 LKYRQYQEEQRVTQGEI---DRMTKQDEKYREEKQNKMMKLRQDMEESLKSREQMRQHEK 131
L Q ++ R GE D M +QD + + + EE+ + Q + K
Sbjct: 645 LDQDQRRDRMRGQDGEQNFGDDMPQQDGQPNGQPNLHDRLRKLQQEEAKRGLGQSQGGLK 704
Query: 132 LQNQELEK 139
+ +L +
Sbjct: 705 GELGQLGQ 712
>gnl|CDD|227352 COG5019, CDC3, Septin family protein [Cell division and chromosome
partitioning / Cytoskeleton].
Length = 373
Score = 30.0 bits (68), Expect = 0.40
Identities = 13/65 (20%), Positives = 32/65 (49%), Gaps = 1/65 (1%)
Query: 73 KELKYRQYQEEQRVTQGEIDRMTKQDEKYREEKQNKMMKLRQDMEESLKSRE-QMRQHEK 131
KE+ + EE+R + + ++ EK EE + +++ R+++ L+ + ++ EK
Sbjct: 304 KEIHEARLNEEERELKKKFTEKIREKEKRLEELEQNLIEERKELNSKLEEIQKKLEDLEK 363
Query: 132 LQNQE 136
+
Sbjct: 364 RLEKL 368
>gnl|CDD|236545 PRK09510, tolA, cell envelope integrity inner membrane protein
TolA; Provisional.
Length = 387
Score = 29.8 bits (67), Expect = 0.45
Identities = 21/75 (28%), Positives = 44/75 (58%)
Query: 50 LQRQKVEYRKALEQQMQEELDKKKELKYRQYQEEQRVTQGEIDRMTKQDEKYREEKQNKM 109
Q+Q+ ++A EQ+ ++E + +EL+ +Q E++R+ Q E +R+ Q++K + E+ K
Sbjct: 67 QQQQQKSAKRAEEQRKKKEQQQAEELQQKQAAEQERLKQLEKERLAAQEQKKQAEEAAKQ 126
Query: 110 MKLRQDMEESLKSRE 124
L+Q E ++
Sbjct: 127 AALKQKQAEEAAAKA 141
>gnl|CDD|240227 PTZ00009, PTZ00009, heat shock 70 kDa protein; Provisional.
Length = 653
Score = 29.4 bits (66), Expect = 0.65
Identities = 10/28 (35%), Positives = 20/28 (71%), Gaps = 1/28 (3%)
Query: 82 EEQRVTQGEIDRMTKQDEKYREE-KQNK 108
++ R+++ +IDRM + EKY+ E + N+
Sbjct: 508 DKGRLSKADIDRMVNEAEKYKAEDEANR 535
>gnl|CDD|225288 COG2433, COG2433, Uncharacterized conserved protein [Function
unknown].
Length = 652
Score = 29.3 bits (66), Expect = 0.69
Identities = 19/96 (19%), Positives = 47/96 (48%), Gaps = 4/96 (4%)
Query: 42 EAESKEQALQRQKVEYRKALEQQMQEELDKKKELKYR----QYQEEQRVTQGEIDRMTKQ 97
EA SK + +R + + E++ + + +K+ K + +EE + E++ + ++
Sbjct: 392 EALSKVKEEERPREKEGTEEEERREITVYEKRIKKLEETVERLEEENSELKRELEELKRE 451
Query: 98 DEKYREEKQNKMMKLRQDMEESLKSREQMRQHEKLQ 133
EK E + ++R + + + R + R+ E+L+
Sbjct: 452 IEKLESELERFRREVRDKVRKDREIRARDRRIERLE 487
>gnl|CDD|206172 pfam14002, YniB, YniB-like protein. The YniB-like protein family
includes the E. coli YniB protein, which is functionally
uncharacterized. This family of proteins is found in
bacteria. Proteins in this family are approximately 180
amino acids in length. This family of proteins are
integral membrane proteins.
Length = 166
Score = 28.8 bits (65), Expect = 0.81
Identities = 11/38 (28%), Positives = 20/38 (52%), Gaps = 1/38 (2%)
Query: 50 LQRQKVEYRKALEQQMQEELDKKKELKYRQYQEEQRVT 87
+ RQ R+ +E Q+ E K E + R+ Q E+++
Sbjct: 97 MSRQVKFIREGIEDQLILEKAKGVEGRTRE-QLEEKIV 133
>gnl|CDD|214661 smart00435, TOPEUc, DNA Topoisomerase I (eukaryota). DNA
Topoisomerase I (eukaryota), DNA topoisomerase V,
Vaccina virus topoisomerase, Variola virus
topoisomerase, Shope fibroma virus topoisomeras.
Length = 391
Score = 28.9 bits (65), Expect = 1.0
Identities = 16/89 (17%), Positives = 46/89 (51%), Gaps = 8/89 (8%)
Query: 59 KALEQQMQEELDKKKELKY------RQYQEEQRVTQGEIDRMTKQDEKYREEKQNKMMKL 112
K E+ M++ +K K LKY + + ++ + ++ + E+ E+ ++ +
Sbjct: 273 KTHEKSMEKLQEKIKALKYQLKRLKKMILLFEMISDLK-RKLKSKFERDNEKLDAEVKEK 331
Query: 113 RQDMEESLKSREQM-RQHEKLQNQELEKT 140
+++ ++ K ++Q+ R E+++ E++ T
Sbjct: 332 KKEKKKEEKKKKQIERLEERIEKLEVQAT 360
>gnl|CDD|236892 PRK11281, PRK11281, hypothetical protein; Provisional.
Length = 1113
Score = 28.7 bits (65), Expect = 1.0
Identities = 18/81 (22%), Positives = 41/81 (50%), Gaps = 10/81 (12%)
Query: 34 SATQRQMLEAESKEQALQRQKVEYRKALEQ---QMQEELDKKKEL---KYRQYQEEQRVT 87
+QR +L+A EQAL + + ++ + Q+Q+ L K+++ + ++ + + ++
Sbjct: 190 RPSQRVLLQA---EQALLNAQNDLQRKSLEGNTQLQDLLQKQRDYLTARIQRLEHQLQLL 246
Query: 88 QGEI-DRMTKQDEKYREEKQN 107
Q I + EK +E Q+
Sbjct: 247 QEAINSKRLTLSEKTVQEAQS 267
>gnl|CDD|178572 PLN02992, PLN02992, coniferyl-alcohol glucosyltransferase.
Length = 481
Score = 28.8 bits (64), Expect = 1.1
Identities = 19/74 (25%), Positives = 34/74 (45%), Gaps = 3/74 (4%)
Query: 62 EQQMQEEL-DKKKELKYRQYQEEQRVTQGEIDRMTKQ--DEKYREEKQNKMMKLRQDMEE 118
EQ M L + + R ++ +++ +I+ + ++ E+ EE + K+ KLR E
Sbjct: 387 EQNMNAALLSDELGIAVRSDDPKEVISRSKIEALVRKVMVEEEGEEMRRKVKKLRDTAEM 446
Query: 119 SLKSREQMRQHEKL 132
SL HE L
Sbjct: 447 SLSIDGGGVAHESL 460
>gnl|CDD|224117 COG1196, Smc, Chromosome segregation ATPases [Cell division and
chromosome partitioning].
Length = 1163
Score = 28.9 bits (65), Expect = 1.1
Identities = 26/111 (23%), Positives = 55/111 (49%), Gaps = 1/111 (0%)
Query: 30 EQMYSATQRQMLEAESKEQALQRQKVEYRKALEQQMQEELDKKKEL-KYRQYQEEQRVTQ 88
E+ Q ++ EAE + + L+ + E R+ LE+ +E L+ K+E+ + R
Sbjct: 252 EEELEELQEELEEAEKEIEELKSELEELREELEELQEELLELKEEIEELEGEISLLRERL 311
Query: 89 GEIDRMTKQDEKYREEKQNKMMKLRQDMEESLKSREQMRQHEKLQNQELEK 139
E++ ++ E+ EE + K+ L++++EE E++ Q + E+
Sbjct: 312 EELENELEELEERLEELKEKIEALKEELEERETLLEELEQLLAELEEAKEE 362
>gnl|CDD|225803 COG3264, COG3264, Small-conductance mechanosensitive channel [Cell
envelope biogenesis, outer membrane].
Length = 835
Score = 28.5 bits (64), Expect = 1.2
Identities = 15/113 (13%), Positives = 37/113 (32%), Gaps = 11/113 (9%)
Query: 39 QMLEAESKEQALQRQKVEYRKALEQQMQEELDKKKELKYRQYQEEQR----VTQGEIDRM 94
L + Q L + + E A Q++ L +E + QEE + + + +
Sbjct: 1 LALAGNNVLQELLQSRRELLTAESAQLEAALQLLQEAVNSKRQEEAEPAAEEAELQAELI 60
Query: 95 TKQDEKYRE-----EKQNKMMKLRQDMEESLKSREQMRQHEKLQNQELEKTPW 142
++ + +Q + +L + + + Q + E+
Sbjct: 61 QQELAINDQLSQALNQQTE--RLNALASDDRQLANLLLQLLQSSRTIREQIAV 111
>gnl|CDD|223571 COG0497, RecN, ATPase involved in DNA repair [DNA replication,
recombination, and repair].
Length = 557
Score = 28.3 bits (64), Expect = 1.3
Identities = 20/110 (18%), Positives = 48/110 (43%), Gaps = 16/110 (14%)
Query: 29 QEQMYSATQRQMLEAESKEQALQRQKVEYR----KALEQQMQEELDKKKELKYR----QY 80
Q + QRQ+L+A + + L ++ + K +++++ +K++E R Q+
Sbjct: 133 QSLLKPELQRQLLDAFAGLEELAQEAYQEAYQAWKQARRELEDLQEKERERAQRADLLQF 192
Query: 81 QEEQRVTQGEIDRMTKQDEKYRE--EKQNKMMKLRQDMEESLKSREQMRQ 128
Q E E++ + Q + E E++ ++ + E + E +
Sbjct: 193 QLE------ELEELNLQPGEDEELEEERKRLSNSEKLAEAIQNALELLSG 236
>gnl|CDD|169719 PRK09212, PRK09212, pyruvate dehydrogenase subunit beta;
Validated.
Length = 327
Score = 28.5 bits (64), Expect = 1.3
Identities = 13/45 (28%), Positives = 27/45 (60%), Gaps = 5/45 (11%)
Query: 54 KVEYRKALEQQMQEELDKKKEL-----KYRQYQEEQRVTQGEIDR 93
++ R+AL MQEE+++ ++ + +YQ +VTQG +++
Sbjct: 3 QLTVREALRDAMQEEMERDPKVFLMGEEVGEYQGAYKVTQGLLEQ 47
>gnl|CDD|216066 pfam00695, vMSA, Major surface antigen from hepadnavirus.
Length = 364
Score = 28.0 bits (62), Expect = 1.7
Identities = 15/74 (20%), Positives = 25/74 (33%)
Query: 10 PKTSKTCSRNMFTIIDRYYQEQMYSATQRQMLEAESKEQALQRQKVEYRKALEQQMQEEL 69
P + + I+D Q + TQ + L ++ + E
Sbjct: 30 PVGAFGKRPTIPHILDHSPQVEGIRTTQSGIPPPAGTSNRLGLRQPTPITPPLRDWHPEE 89
Query: 70 DKKKELKYRQYQEE 83
D+K + QYQEE
Sbjct: 90 DQKNSTAFHQYQEE 103
>gnl|CDD|130902 TIGR01843, type_I_hlyD, type I secretion membrane fusion protein,
HlyD family. Type I secretion is an ABC transport
process that exports proteins, without cleavage of any
signal sequence, from the cytosol to extracellular
medium across both inner and outer membranes. The
secretion signal is found in the C-terminus of the
transported protein. This model represents the adaptor
protein between the ATP-binding cassette (ABC) protein
of the inner membrane and the outer membrane protein,
and is called the membrane fusion protein. This model
selects a subfamily closely related to HlyD; it is
defined narrowly and excludes, for example, colicin V
secretion protein CvaA and multidrug efflux proteins
[Protein fate, Protein and peptide secretion and
trafficking].
Length = 423
Score = 28.1 bits (63), Expect = 1.8
Identities = 15/79 (18%), Positives = 32/79 (40%), Gaps = 8/79 (10%)
Query: 29 QEQMYSATQRQMLEAESKEQALQRQKVEYRKALEQQMQEELDKKKELKYRQYQEEQRVTQ 88
E+ + Q ++ E++ + L+RQ E Q E + ++ + EE Q
Sbjct: 201 LERERAEAQGELGRLEAELEVLKRQIDEL--------QLERQQIEQTFREEVLEELTEAQ 252
Query: 89 GEIDRMTKQDEKYREEKQN 107
+ + ++ K R+ Q
Sbjct: 253 ARLAELRERLNKARDRLQR 271
Score = 27.3 bits (61), Expect = 3.2
Identities = 27/115 (23%), Positives = 52/115 (45%), Gaps = 14/115 (12%)
Query: 38 RQMLEAESKEQALQRQKVEYRKALEQQMQEELDKKKELKYRQY---------QEEQRVTQ 88
Q+ + E++ LQ Q R+ LE + EEL+ +++LK + + E+ Q
Sbjct: 151 AQIKQLEAELAGLQAQLQALRQQLE-VISEELEARRKLKEKGLVSRLELLELERERAEAQ 209
Query: 89 GEIDRMTKQDEKYR---EEKQNKMMKLRQD-MEESLKSREQMRQHEKLQNQELEK 139
GE+ R+ + E + +E Q + ++ Q EE L+ + + + L K
Sbjct: 210 GELGRLEAELEVLKRQIDELQLERQQIEQTFREEVLEELTEAQARLAELRERLNK 264
>gnl|CDD|235012 PRK02201, PRK02201, putative inner membrane protein translocase
component YidC; Provisional.
Length = 357
Score = 28.1 bits (63), Expect = 1.8
Identities = 13/37 (35%), Positives = 21/37 (56%), Gaps = 6/37 (16%)
Query: 81 QEEQRVTQGEIDRMTKQDEKYREEKQNKMMKLRQDME 117
QEE + + +ID KY++ K++K MK R+ E
Sbjct: 162 QEELQGKKAKID------AKYKDYKKDKQMKQRKQQE 192
>gnl|CDD|225606 COG3064, TolA, Membrane protein involved in colicin uptake [Cell
envelope biogenesis, outer membrane].
Length = 387
Score = 27.6 bits (61), Expect = 2.2
Identities = 22/84 (26%), Positives = 45/84 (53%)
Query: 48 QALQRQKVEYRKALEQQMQEELDKKKELKYRQYQEEQRVTQGEIDRMTKQDEKYREEKQN 107
+Q Q+ +K +Q+ ++E +ELK +Q E++R+ Q E +R+ Q+++ + E+
Sbjct: 65 GRIQSQQSSAKKGEQQRKKKEEQVAEELKPKQAAEQERLKQLEKERLKAQEQQKQAEEAE 124
Query: 108 KMMKLRQDMEESLKSREQMRQHEK 131
K +L Q +E + Q +K
Sbjct: 125 KQAQLEQKQQEEQARKAAAEQKKK 148
>gnl|CDD|215214 PLN02381, PLN02381, valyl-tRNA synthetase.
Length = 1066
Score = 27.9 bits (62), Expect = 2.3
Identities = 10/44 (22%), Positives = 30/44 (68%)
Query: 82 EEQRVTQGEIDRMTKQDEKYREEKQNKMMKLRQDMEESLKSREQ 125
E++ +T+ E++R K++EK +E++ K+ +++ + L++++
Sbjct: 9 EKKILTEEELERKKKKEEKAKEKELKKLKAAQKEAKAKLQAQQA 52
Score = 27.6 bits (61), Expect = 2.8
Identities = 16/74 (21%), Positives = 35/74 (47%)
Query: 34 SATQRQMLEAESKEQALQRQKVEYRKALEQQMQEELDKKKELKYRQYQEEQRVTQGEIDR 93
S T+ + + E+ L+R+K + KA E+++++ +KE K + ++ +
Sbjct: 2 SRTESEAEKKILTEEELERKKKKEEKAKEKELKKLKAAQKEAKAKLQAQQASDGTNVPKK 61
Query: 94 MTKQDEKYREEKQN 107
K+ K E +N
Sbjct: 62 SEKKSRKRDVEDEN 75
>gnl|CDD|218769 pfam05833, FbpA, Fibronectin-binding protein A N-terminus (FbpA).
This family consists of the N-terminal region of the
prokaryotic fibronectin-binding protein. Fibronectin
binding is considered to be an important virulence
factor in streptococcal infections. Fibronectin is a
dimeric glycoprotein that is present in a soluble form
in plasma and extracellular fluids; it is also present
in a fibrillar form on cell surfaces. Both the soluble
and cellular forms of fibronectin may be incorporated
into the extracellular tissue matrix. While fibronectin
has critical roles in eukaryotic cellular processes,
such as adhesion, migration and differentiation, it is
also a substrate for the attachment of bacteria. The
binding of pathogenic Streptococcus pyogenes and
Staphylococcus aureus to epithelial cells via
fibronectin facilitates their internalisation and
systemic spread within the host.
Length = 447
Score = 27.6 bits (62), Expect = 2.8
Identities = 13/59 (22%), Positives = 27/59 (45%), Gaps = 2/59 (3%)
Query: 24 IDRYYQEQMYSATQRQMLEAESKEQALQRQKVEYRKALEQQMQEELDKKKELKYRQYQE 82
+D YY ++ + R + E+ ++ + + K LE+ +E + + YR Y E
Sbjct: 277 LDEYYSKK--AERDRVKQKRSDLEKRVENELEKLEKKLEKLEKELEEAENAENYRLYGE 333
>gnl|CDD|226654 COG4191, COG4191, Signal transduction histidine kinase regulating
C4-dicarboxylate transport system [Signal transduction
mechanisms].
Length = 603
Score = 27.7 bits (62), Expect = 2.8
Identities = 13/68 (19%), Positives = 29/68 (42%), Gaps = 9/68 (13%)
Query: 50 LQRQKVEY-RKALEQQMQEELDKKKELKYRQYQEEQRVTQGEIDRMTKQDEKYREEKQNK 108
L+R++ R A Q+ + EL+++ E + Q EI RE+ +
Sbjct: 316 LRRRRRARLRLAELQEARAELERRVEERTADLTRANARLQAEIAE--------REQAEAA 367
Query: 109 MMKLRQDM 116
+ + + ++
Sbjct: 368 LRRAQDEL 375
>gnl|CDD|221084 pfam11336, DUF3138, Protein of unknown function (DUF3138). This
family of proteins with unknown function appear to be
restricted to Proteobacteria.
Length = 514
Score = 27.6 bits (61), Expect = 3.0
Identities = 9/33 (27%), Positives = 17/33 (51%)
Query: 32 MYSATQRQMLEAESKEQALQRQKVEYRKALEQQ 64
++ Q+ +++ ALQ+Q E R AL +
Sbjct: 19 AAASDASQIKALQAQLTALQQQVNELRAALAAK 51
>gnl|CDD|217902 pfam04111, APG6, Autophagy protein Apg6. In yeast, 15 Apg proteins
coordinate the formation of autophagosomes. Autophagy is
a bulk degradation process induced by starvation in
eukaryotic cells. Apg6/Vps30p has two distinct functions
in the autophagic process, either associated with the
membrane or in a retrieval step of the carboxypeptidase
Y sorting pathway.
Length = 356
Score = 27.1 bits (60), Expect = 3.6
Identities = 23/102 (22%), Positives = 44/102 (43%), Gaps = 3/102 (2%)
Query: 39 QMLEAESKEQALQRQKVEYRKALEQQMQEELDKKKELKYRQYQEEQRVTQGEIDRMTKQD 98
S +AL + E +K E+ + E + +KE E Q E +++ ++
Sbjct: 57 SQNVEISNYEALDSELDELKKEEERLLDELEELEKEDD--DLDGELVELQEEKEQLENEE 114
Query: 99 EKYREEKQNKMMKLRQDMEESLKSREQMRQHEKLQNQELEKT 140
+Y E N + +E++L+S E ++ Q +L KT
Sbjct: 115 LQYLRE-YNLFDRNNLQLEDNLQSLELQYEYSLNQLDKLRKT 155
Score = 26.0 bits (57), Expect = 8.9
Identities = 15/80 (18%), Positives = 35/80 (43%), Gaps = 1/80 (1%)
Query: 35 ATQRQMLEAESKEQALQRQKVEYRKALEQQMQEELDKKKELKYRQYQEEQRVTQGEIDRM 94
+ + L E +E + ++ Q+ +E+L+ ++ R+Y R +++
Sbjct: 76 KKEEERLLDELEELEKEDDDLDGELVELQEEKEQLENEELQYLREYNLFDR-NNLQLEDN 134
Query: 95 TKQDEKYREEKQNKMMKLRQ 114
+ E E N++ KLR+
Sbjct: 135 LQSLELQYEYSLNQLDKLRK 154
>gnl|CDD|236365 PRK09041, motB, flagellar motor protein MotB; Validated.
Length = 317
Score = 26.8 bits (60), Expect = 3.8
Identities = 12/46 (26%), Positives = 28/46 (60%), Gaps = 4/46 (8%)
Query: 87 TQGEIDRMTKQDEKYREEK----QNKMMKLRQDMEESLKSREQMRQ 128
GE+++ E + EK Q ++ KL+Q+++++++S ++RQ
Sbjct: 93 QDGEVNKQPNIAELAQGEKERREQERLEKLKQELDQAIESNPKLRQ 138
>gnl|CDD|224482 COG1566, EmrA, Multidrug resistance efflux pump [Defense
mechanisms].
Length = 352
Score = 26.9 bits (60), Expect = 4.0
Identities = 18/75 (24%), Positives = 38/75 (50%), Gaps = 3/75 (4%)
Query: 21 FTIIDRYYQEQMYSATQRQMLEAESKEQALQRQKVEYRKALEQQMQEELDK-KKELK-YR 78
F I R Y+ + A + + AE++ + L+ Q + + Q ++LD+ + EL+
Sbjct: 82 FRIDPRDYRAALEQA-EAALAAAEAQLRNLRAQLASAQALIAQAEAQDLDQAQNELERRA 140
Query: 79 QYQEEQRVTQGEIDR 93
+ + V++ E+DR
Sbjct: 141 ELAQRGVVSREELDR 155
>gnl|CDD|233758 TIGR02169, SMC_prok_A, chromosome segregation protein SMC,
primarily archaeal type. SMC (structural maintenance of
chromosomes) proteins bind DNA and act in organizing and
segregating chromosomes for partition. SMC proteins are
found in bacteria, archaea, and eukaryotes. It is found
in a single copy and is homodimeric in prokaryotes, but
six paralogs (excluded from this family) are found in
eukarotes, where SMC proteins are heterodimeric. This
family represents the SMC protein of archaea and a few
bacteria (Aquifex, Synechocystis, etc); the SMC of other
bacteria is described by TIGR02168. The N- and
C-terminal domains of this protein are well conserved,
but the central hinge region is skewed in composition
and highly divergent [Cellular processes, Cell division,
DNA metabolism, Chromosome-associated proteins].
Length = 1164
Score = 27.0 bits (60), Expect = 4.0
Identities = 23/101 (22%), Positives = 48/101 (47%), Gaps = 1/101 (0%)
Query: 38 RQMLEAESKEQALQRQKVEYRKALEQQMQEELDKKKELKYRQYQEEQRVTQGEIDRMTKQ 97
++ LE E +E QR ++ + ++ E L+ KKE + + EE +++
Sbjct: 828 KEYLEKEIQELQEQRIDLKEQIKSIEKEIENLNGKKE-ELEEELEELEAALRDLESRLGD 886
Query: 98 DEKYREEKQNKMMKLRQDMEESLKSREQMRQHEKLQNQELE 138
+K R+E + ++ +L + +EE E+ R+ +LE
Sbjct: 887 LKKERDELEAQLRELERKIEELEAQIEKKRKRLSELKAKLE 927
Score = 27.0 bits (60), Expect = 4.9
Identities = 29/114 (25%), Positives = 52/114 (45%), Gaps = 5/114 (4%)
Query: 30 EQMYSATQRQMLEAESKEQALQRQKVEYRKALEQQMQEELDKKKELKYRQYQEEQRVTQG 89
E+ + ++ E E++ +L+R E K E + EE K E + + E +
Sbjct: 286 EEEQLRVKEKIGELEAEIASLERSIAE--KERELEDAEERLAKLEAEIDKLLAEIEELER 343
Query: 90 EIDRMTKQDEKYREE---KQNKMMKLRQDMEESLKSREQMRQHEKLQNQELEKT 140
EI+ K+ +K EE + ++ LR ++EE K + R K ++LEK
Sbjct: 344 EIEEERKRRDKLTEEYAELKEELEDLRAELEEVDKEFAETRDELKDYREKLEKL 397
Score = 27.0 bits (60), Expect = 5.2
Identities = 17/116 (14%), Positives = 48/116 (41%), Gaps = 7/116 (6%)
Query: 34 SATQRQMLEAESKEQALQRQKVEYRKALEQ------QMQEELDKKKELKYRQYQEEQRVT 87
A +RQ+ E + + L + E K LE+ ++ +++ E + + +E+
Sbjct: 240 EAIERQLASLEEELEKLTEEISELEKRLEEIEQLLEELNKKIKDLGEEEQLRVKEKIGEL 299
Query: 88 QGEIDRMTKQDEKYREEKQNKMMKLRQDMEESLKSREQMRQ-HEKLQNQELEKTPW 142
+ EI + + + E ++ +L + E K ++ + +++ + +
Sbjct: 300 EAEIASLERSIAEKERELEDAEERLAKLEAEIDKLLAEIEELEREIEEERKRRDKL 355
>gnl|CDD|184039 PRK13411, PRK13411, molecular chaperone DnaK; Provisional.
Length = 653
Score = 27.0 bits (60), Expect = 4.1
Identities = 16/54 (29%), Positives = 35/54 (64%), Gaps = 3/54 (5%)
Query: 87 TQGEIDRMTKQDEKYREE--KQNKMMKLRQDMEESLKSREQ-MRQHEKLQNQEL 137
+ EI+RM ++ EKY EE ++ ++++L+ + L S E ++++ +L ++EL
Sbjct: 503 SSNEIERMRQEAEKYAEEDRRRKQLIELKNQADSLLYSYESTLKENGELISEEL 556
>gnl|CDD|234767 PRK00448, polC, DNA polymerase III PolC; Validated.
Length = 1437
Score = 27.1 bits (61), Expect = 4.3
Identities = 14/69 (20%), Positives = 30/69 (43%)
Query: 71 KKKELKYRQYQEEQRVTQGEIDRMTKQDEKYREEKQNKMMKLRQDMEESLKSREQMRQHE 130
K +Y ++ EID ++ EK+ +K+ + KL ++ E++K E ++ +
Sbjct: 145 PKLIKQYEKFGFGILKIDFEIDDSKEELEKFEAQKEEEDEKLAKEALEAMKKLEAEKKKQ 204
Query: 131 KLQNQELEK 139
E
Sbjct: 205 SKNFDPKEG 213
>gnl|CDD|178450 PLN02859, PLN02859, glutamine-tRNA ligase.
Length = 788
Score = 27.0 bits (60), Expect = 4.4
Identities = 17/39 (43%), Positives = 24/39 (61%), Gaps = 2/39 (5%)
Query: 91 IDRMTKQDEK-YREEKQNKMMKLRQDMEESLKSREQMRQ 128
+D T ++ K YRE+K N + R +EESLK E MR+
Sbjct: 358 VDHQTPEEIKEYREKKMNSPWRDRP-IEESLKLFEDMRR 395
>gnl|CDD|236669 PRK10263, PRK10263, DNA translocase FtsK; Provisional.
Length = 1355
Score = 27.0 bits (59), Expect = 4.6
Identities = 21/83 (25%), Positives = 37/83 (44%), Gaps = 13/83 (15%)
Query: 63 QQMQEELDKKKELKYRQYQEEQRVTQGEIDRMTK----------QDEKYREEKQNKMMKL 112
Q+ EE K +E + QY + EID M + Q ++Y E+ Q+ +
Sbjct: 630 QRAAEE--KAREAQRNQYDSGDQYNDDEIDAMQQDELARQFAQTQQQRYGEQYQHDVPVN 687
Query: 113 RQDMEESLKSREQMRQHEKLQNQ 135
+D + + ++ E RQ + Q Q
Sbjct: 688 AEDADAAAEA-ELARQFAQTQQQ 709
>gnl|CDD|222374 pfam13779, DUF4175, Domain of unknown function (DUF4175).
Length = 820
Score = 26.8 bits (60), Expect = 4.8
Identities = 18/75 (24%), Positives = 32/75 (42%), Gaps = 11/75 (14%)
Query: 48 QALQRQKVEYRKALEQQMQEELDKKKELKYRQYQEEQRVTQGEIDRMTKQDEKY-----R 102
+AL +Y + L +Q Q + + +TQ ++ RM + E+ R
Sbjct: 503 EALD----DYMRQLAEQAQRNPQ--QGDQPPDQGNSMELTQQDLQRMMDRIEELMESGRR 556
Query: 103 EEKQNKMMKLRQDME 117
E Q + +L+Q ME
Sbjct: 557 AEAQQLLEQLQQMME 571
>gnl|CDD|220365 pfam09726, Macoilin, Transmembrane protein. This entry is a highly
conserved protein present in eukaryotes.
Length = 680
Score = 26.8 bits (59), Expect = 4.8
Identities = 28/117 (23%), Positives = 50/117 (42%), Gaps = 9/117 (7%)
Query: 29 QEQMYSATQRQMLEAESKEQALQRQKVE--YRKALEQQMQEELDKKKELKYRQYQ----- 81
Q ++ S + + +S + +R K E R E+Q+ EE +KKE + +
Sbjct: 461 QTKLNSMVSAKQKDKQSMQSMEKRLKSEADSRVNAEKQLAEEKKRKKEEEETAARAAAQA 520
Query: 82 EEQRVTQGEIDRMTKQDEKYREEKQNKMMKLRQDMEESLKSREQMRQHEKLQNQELE 138
R E + KQD + +K +KL++ EE ++ ++ K Q E E
Sbjct: 521 AASREECAESLKQAKQDLEMEIKKLEHDLKLKE--EECRMLEKEAQELRKYQESEKE 575
>gnl|CDD|219655 pfam07946, DUF1682, Protein of unknown function (DUF1682). The
members of this family are all hypothetical eukaryotic
proteins of unknown function. One member is described as
being an adipocyte-specific protein, but no evidence of
this was found.
Length = 322
Score = 26.4 bits (59), Expect = 5.0
Identities = 17/59 (28%), Positives = 35/59 (59%), Gaps = 8/59 (13%)
Query: 52 RQKVE--YRKALEQQMQEELDKKKELKYRQYQEEQRVTQGEIDRMTKQDEKYREEKQNK 108
R++ E KA E++ QEE +KKE K ++ +E ++ +++ ++++ EEK+ K
Sbjct: 265 REEEEEKILKAAEEERQEEAQEKKEEKKKEEREA------KLAKLSPEEQRKLEEKERK 317
>gnl|CDD|129428 TIGR00328, flhB, flagellar biosynthetic protein FlhB. FlhB and its
functionally equivalent orthologs, from among a larger
superfamily of proteins involved in type III protein
export systems, are specifically involved in flagellar
protein export. The seed members are restricted and the
trusted cutoff is set high such that the proteins
gathered by this model play roles specifically related
to flagellar structures. Full-length homologs scoring
below the trusted cutoff are involved in peptide export
but not necessarily in the creation of flagella
[Cellular processes, Chemotaxis and motility].
Length = 347
Score = 26.5 bits (59), Expect = 5.8
Identities = 14/51 (27%), Positives = 23/51 (45%), Gaps = 12/51 (23%)
Query: 93 RMTKQDEKYREEKQNKMMKLRQDMEES--LKSREQMRQHEKLQNQELEKTP 141
+MTKQ+ K E KQ E +K R + Q E + + +++ P
Sbjct: 214 KMTKQEVK-DELKQ---------SEGDPEVKGRIRQMQREAARRRMMQEVP 254
>gnl|CDD|227355 COG5022, COG5022, Myosin heavy chain [Cytoskeleton].
Length = 1463
Score = 26.2 bits (58), Expect = 7.2
Identities = 28/119 (23%), Positives = 44/119 (36%), Gaps = 10/119 (8%)
Query: 18 RNMFTII-----DRYYQEQMYSATQRQMLEAESKEQALQRQKVEY----RKALEQQMQEE 68
N+ T I RY + + A +R + R+ V+Y R ++ Q
Sbjct: 745 DNIATRIQRAIRGRYLRRRYLQALKRIKKIQVIQHGFRLRRLVDYELKWRLFIKLQPLLS 804
Query: 69 LDKKKELKYRQYQEEQRVTQGEIDRMTKQDEKYREEKQNKMMKLRQDMEESLKSREQMR 127
L ++ +YR Y Q I R K E E K L Q SLK++++
Sbjct: 805 LLGSRK-EYRSYLACIIKLQKTIKREKKLRETEEVEFSLKAEVLIQKFGRSLKAKKRFS 862
>gnl|CDD|173877 cd08512, PBP2_NikA_DppA_OppA_like_7, The substrate-binding
component of an uncharacterized ABC-type
nickel/dipeptide/oligopeptide-like import system
contains the type 2 periplasmic binding fold. This CD
represents the substrate-binding domain of an
uncharacterized ATP-binding cassette (ABC) type
nickel/dipeptide/oligopeptide-like transporter. The
oligopeptide-binding protein OppA and the
dipeptide-binding protein DppA show significant sequence
similarity to NikA, the initial nickel receptor. The
DppA binds dipeptides and some tripeptides and is
involved in chemotaxis toward dipeptides, whereas the
OppA binds peptides of a wide range of lengths (2-35
amino acid residues) and plays a role in recycling of
cell wall peptides, which precludes any involvement in
chemotaxis. Most of other periplasmic binding proteins
are comprised of only two globular subdomains
corresponding to domains I and III of the
dipeptide/oligopeptide binding proteins. The structural
topology of these domains is most similar to that of the
type 2 periplasmic binding proteins (PBP2), which are
responsible for the uptake of a variety of substrates
such as phosphate, sulfate, polysaccharides,
lysine/arginine/ornithine, and histidine. The PBP2 bind
their ligand in the cleft between these domains in a
manner resembling a Venus flytrap. After binding their
specific ligand with high affinity, they can interact
with a cognate membrane transport complex comprised of
two integral membrane domains and two cytoplasmically
located ATPase domains. This interaction triggers the
ligand translocation across the cytoplasmic membrane
energized by ATP hydrolysis. Besides transport
proteins, the PBP2 superfamily includes the
ligand-binding domains from ionotropic glutamate
receptors, LysR-type transcriptional regulators, and
unorthodox sensor proteins involved in signal
transduction.
Length = 476
Score = 26.0 bits (58), Expect = 7.4
Identities = 8/24 (33%), Positives = 14/24 (58%), Gaps = 3/24 (12%)
Query: 19 NMFTIIDRYYQEQMYSATQRQMLE 42
+ +I R+ E +AT+R +LE
Sbjct: 200 KLKRVIIRHVPE---AATRRLLLE 220
>gnl|CDD|220535 pfam10037, MRP-S27, Mitochondrial 28S ribosomal protein S27.
Members of this family of small ribosomal proteins
possess one of three conserved blocks of sequence found
in proteins that stimulate the dissociation of guanine
nucleotides from G-proteins, leaving open the
possibility that MRP-S27 might be a functional partner
of GTP-binding ribosomal proteins.
Length = 417
Score = 26.3 bits (58), Expect = 7.4
Identities = 10/68 (14%), Positives = 28/68 (41%)
Query: 30 EQMYSATQRQMLEAESKEQALQRQKVEYRKALEQQMQEELDKKKELKYRQYQEEQRVTQG 89
++ + + + S E+ L + E + EQ+ Q + ++L R+ + ++ +
Sbjct: 348 SKLQALAKVESRSLLSVEERLPTIEKEDLELYEQRQQLWFFENRKLWQRKKKLREQADEE 407
Query: 90 EIDRMTKQ 97
R +
Sbjct: 408 YQQRHATR 415
>gnl|CDD|215618 PLN03184, PLN03184, chloroplast Hsp70; Provisional.
Length = 673
Score = 26.4 bits (58), Expect = 7.5
Identities = 16/55 (29%), Positives = 31/55 (56%), Gaps = 6/55 (10%)
Query: 88 QGEIDRMTKQDEKYREEKQNKMMKLRQDMEESLKSREQMRQHEKLQNQEL-EKTP 141
+ E++RM ++ EK+ +E + K R ++ ++ + Q EK Q +EL +K P
Sbjct: 541 KDEVERMVQEAEKFAKEDKEK----RDAVDTKNQADSVVYQTEK-QLKELGDKVP 590
>gnl|CDD|179580 PRK03449, PRK03449, putative inner membrane protein translocase
component YidC; Provisional.
Length = 304
Score = 26.2 bits (58), Expect = 7.6
Identities = 10/31 (32%), Positives = 19/31 (61%)
Query: 36 TQRQMLEAESKEQALQRQKVEYRKALEQQMQ 66
T R+M E + + +ALQ++ R+ + +MQ
Sbjct: 57 TTRKMQELQPQIKALQKKYGNDRQKMALEMQ 87
>gnl|CDD|237199 PRK12772, PRK12772, bifunctional flagellar biosynthesis protein
FliR/FlhB; Provisional.
Length = 609
Score = 26.2 bits (58), Expect = 8.0
Identities = 14/49 (28%), Positives = 24/49 (48%), Gaps = 5/49 (10%)
Query: 78 RQYQEEQRVTQGEIDRMTKQDEKYREEKQNKMMKLRQDMEESLKSREQM 126
QY ++ R+T+ E+ KQDE + K K++Q E + + M
Sbjct: 470 YQYNKDLRMTKQEVKEEYKQDEGDPQIKA----KIKQKQRE-MAMQRMM 513
>gnl|CDD|215140 PLN02251, PLN02251, pyrophosphate-dependent phosphofructokinase.
Length = 568
Score = 26.3 bits (58), Expect = 8.2
Identities = 20/64 (31%), Positives = 31/64 (48%), Gaps = 8/64 (12%)
Query: 11 KTSKTCSRNMFTIIDRYYQEQMYSATQRQMLEAESKEQALQRQKVEYRKALEQQMQEELD 70
K K S +F + QEQ+ MLE + +Q K+E K L Q ++ EL+
Sbjct: 369 KKLKPQSLQLFDFLPHAIQEQL-------MLERDPHGN-VQVAKIETEKMLIQMVETELE 420
Query: 71 KKKE 74
K+K+
Sbjct: 421 KRKQ 424
>gnl|CDD|215180 PLN02316, PLN02316, synthase/transferase.
Length = 1036
Score = 26.0 bits (57), Expect = 8.6
Identities = 18/58 (31%), Positives = 35/58 (60%), Gaps = 3/58 (5%)
Query: 56 EYRKALEQQMQEELDKKKELKYRQYQEEQRVTQGEIDRMTKQDE--KYREEKQNKMMK 111
E R+ LE+ +EE +++++ + ++ +EE++ E DR + E K RE+ QN + K
Sbjct: 253 EKRRELEKLAKEEAERERQAEEQRRREEEKAAM-EADRAQAKAEVEKRREKLQNLLKK 309
>gnl|CDD|153328 cd07644, I-BAR_IMD_BAIAP2L2, Inverse (I)-BAR, also known as the
IRSp53/MIM homology Domain (IMD), of Brain-specific
Angiogenesis Inhibitor 1-Associated Protein 2-Like 2.
The IMD domain, also called Inverse-Bin/Amphiphysin/Rvs
(I-BAR) domain, is a dimerization and lipid-binding
module that bends membranes and induces membrane
protrusions. This group is composed of uncharacterized
proteins known as BAIAP2L2 (Brain-specific Angiogenesis
Inhibitor 1-Associated Protein 2-Like 2). They contain
an N-terminal IMD, an SH3 domain, and a WASP homology 2
(WH2) actin-binding motif at the C-terminus. The related
proteins, BAIAP2L1 and IRSp53, function as regulators of
membrane dynamics and the actin cytoskeleton. The IMD
domain binds and bundles actin filaments, binds
membranes and produces membrane protrusions, and
interacts with the small GTPase Rac.
Length = 215
Score = 25.7 bits (56), Expect = 9.0
Identities = 19/73 (26%), Positives = 35/73 (47%)
Query: 53 QKVEYRKALEQQMQEELDKKKELKYRQYQEEQRVTQGEIDRMTKQDEKYREEKQNKMMKL 112
Q ++ L+ Q E+ + EL+YR E+ RM +Q ++ E + + +L
Sbjct: 97 QHMDKNTKLDMQFIEDSRRVYELEYRHRAANLEKCMSELWRMERQRDRNVREMKENVNRL 156
Query: 113 RQDMEESLKSREQ 125
RQ M+ LK ++
Sbjct: 157 RQSMQAFLKESQR 169
>gnl|CDD|217817 pfam03961, DUF342, Protein of unknown function (DUF342). This
family of bacterial proteins has no known function. The
proteins are in the region of 500-600 amino acid
residues in length.
Length = 450
Score = 26.1 bits (58), Expect = 9.3
Identities = 17/79 (21%), Positives = 42/79 (53%), Gaps = 2/79 (2%)
Query: 62 EQQMQEELDKKKELKYRQYQEEQRVTQGEIDRMTKQDE-KYREEKQNKMMKLRQDMEESL 120
+++EEL + +E + ++ +EE + + ++ K+ + EK+ ++ KL + E+
Sbjct: 329 FPELKEELKELEE-ELKELEEELEKIKKLLKKLPKKARGQLPPEKREQLEKLLETKEKLS 387
Query: 121 KSREQMRQHEKLQNQELEK 139
+ E++ + K +ELE
Sbjct: 388 EELEELEEELKELKEELES 406
>gnl|CDD|221389 pfam12037, DUF3523, Domain of unknown function (DUF3523). This
presumed domain is functionally uncharacterized. This
domain is found in eukaryotes. This domain is typically
between 257 to 277 amino acids in length. This domain is
found associated with pfam00004. This domain has a
conserved LER sequence motif.
Length = 276
Score = 25.9 bits (57), Expect = 9.6
Identities = 28/107 (26%), Positives = 56/107 (52%), Gaps = 3/107 (2%)
Query: 34 SATQRQMLEAESKEQALQRQKVEYRKALEQQMQEELDKKKELKYRQYQEEQRVT-QGEID 92
S Q + +AE + + + + + + LE+ E +++K L+ + QE+QR Q E+
Sbjct: 57 SKMQEKTRQAELEAKIKEYEAQQAQAKLERARVEAEERRKTLQEQTQQEQQRAQYQDELA 116
Query: 93 RMTKQDEKYREEKQNKMMKLRQDMEESLKSREQMRQHEKLQNQELEK 139
R Q E ++ +QN+ + Q EES+ +E MR+ + + E+ +
Sbjct: 117 RKRYQKELEQQRRQNEELLKMQ--EESVLRQEAMRRATEEEILEMRR 161
Database: CDD.v3.10
Posted date: Mar 20, 2013 7:55 AM
Number of letters in database: 10,937,602
Number of sequences in database: 44,354
Lambda K H
0.310 0.123 0.322
Gapped
Lambda K H
0.267 0.0677 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 6,835,379
Number of extensions: 613331
Number of successful extensions: 2790
Number of sequences better than 10.0: 1
Number of HSP's gapped: 2109
Number of HSP's successfully gapped: 881
Length of query: 142
Length of database: 10,937,602
Length adjustment: 87
Effective length of query: 55
Effective length of database: 7,078,804
Effective search space: 389334220
Effective search space used: 389334220
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.1 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.8 bits)
S2: 54 (24.7 bits)