RPS-BLAST 2.2.26 [Sep-21-2011]
Database: CDD.v3.10
44,354 sequences; 10,937,602 total letters
Searching..................................................done
Query= psy12642
(201 letters)
>gnl|CDD|234059 TIGR02917, PEP_TPR_lipo, putative PEP-CTERM system TPR-repeat
lipoprotein. This protein family occurs in strictly
within a subset of Gram-negative bacterial species with
the proposed PEP-CTERM/exosortase system, analogous to
the LPXTG/sortase system common in Gram-positive
bacteria. This protein occurs in a species if and only
if a transmembrane histidine kinase (TIGR02916) and a
DNA-binding response regulator (TIGR02915) also occur.
The present of tetratricopeptide repeats (TPR) suggests
protein-protein interaction, possibly for the regulation
of PEP-CTERM protein expression, since many PEP-CTERM
proteins in these genomes are preceded by a proposed DNA
binding site for the response regulator.
Length = 899
Score = 47.4 bits (113), Expect = 2e-06
Identities = 43/169 (25%), Positives = 66/169 (39%), Gaps = 5/169 (2%)
Query: 3 PIVYIIGCFHFFR--YEEALEHLETIIKIDETNTAARKRKICILKAKNKIPEAIKELTEY 60
P + + G + E+A ++L I+K + AR+ I ++ EAI L+
Sbjct: 296 PALLLAGASEYQLGNLEQAYQYLNQILKYAPNSHQARRLLASIQLRLGRVDEAIATLSPA 355
Query: 61 LKKFMTDQETWQELCDLYLSEGDYAKAVFCMEELFLHHPHNHLLHQRYADILYTQGGLEN 120
L D L + YL+ GD+ KA + + P N + I G +
Sbjct: 356 LGLDPDDPAALSLLGEAYLALGDFEKAAEYLAKATELDPENAAARTQLG-ISKLSQG--D 412
Query: 121 IELAISHYLMAINLNEKNIRALYGLALSCHQVLTSAKCSAAKKKEISKQ 169
AI+ A L+ + RA L LS + K AA KK KQ
Sbjct: 413 PSEAIADLETAAQLDPELGRADLLLILSYLRSGQFDKALAAAKKLEKKQ 461
Score = 37.4 bits (87), Expect = 0.004
Identities = 33/135 (24%), Positives = 57/135 (42%), Gaps = 10/135 (7%)
Query: 16 YEEALEHLETIIKIDETNTAARKRKICILKAKNKIPEAIKELTEYLKKFMTDQETWQELC 75
Y+ A+ L+ ++ D + AR I A A KEL + L + L
Sbjct: 38 YKAAIIQLKNALQKDPNDAEARFLLGKIYLALGDYAAAEKELRKALSLGYPKNQVLPLLA 97
Query: 76 DLYLSEGDYAKAVFCMEELFLHHPHNHLLHQRYADILYTQG----GLENIELAISHYLMA 131
YL +G + + + + L L + A++L +G GL +ELA Y A
Sbjct: 98 RAYLLQGKFQQVLDELPGKTL------LDDEGAAELLALRGLAYLGLGQLELAQKSYEQA 151
Query: 132 INLNEKNIRALYGLA 146
+ ++ +++ A GLA
Sbjct: 152 LAIDPRSLYAKLGLA 166
Score = 37.4 bits (87), Expect = 0.004
Identities = 33/132 (25%), Positives = 49/132 (37%), Gaps = 3/132 (2%)
Query: 15 RYEEALEHLETIIKIDETNTAARKRKICILKAKNKIPEAIKELTEYLKKFMTDQETWQEL 74
EEA+ LE +++ K ++ +A+ L E E W L
Sbjct: 548 NEEEAVAWLEKAAELNPQEIEPALALAQYYLGKGQLKKALAILNEAADAAPDSPEAWLML 607
Query: 75 CDLYLSEGDYAKAVFCMEELFLHHPHNHLLHQRYADILYTQGGLENIELAISHYLMAINL 134
L+ GD KAV ++L P + L AD ++N AI+ A+ L
Sbjct: 608 GRAQLAAGDLNKAVSSFKKLLALQPDSALALLLLADA---YAVMKNYAKAITSLKRALEL 664
Query: 135 NEKNIRALYGLA 146
N A GLA
Sbjct: 665 KPDNTEAQIGLA 676
Score = 35.8 bits (83), Expect = 0.012
Identities = 33/130 (25%), Positives = 52/130 (40%), Gaps = 3/130 (2%)
Query: 17 EEALEHLETIIKIDETNTAARKRKICILKAKNKIPEAIKELTEYLKKFMTDQETWQELCD 76
EA+ LET ++D A I + +A+ + KK + L
Sbjct: 414 SEAIADLETAAQLDPELGRADLLLILSYLRSGQFDKALAAAKKLEKKQPDNASLHNLLGA 473
Query: 77 LYLSEGDYAKAVFCMEELFLHHPHNHLLHQRYADILYTQGGLENIELAISHYLMAINLNE 136
+YL +GD AKA E+ P A I +G N + AI + + ++
Sbjct: 474 IYLGKGDLAKAREAFEKALSIEPDFFPAAANLARIDIQEG---NPDDAIQRFEKVLTIDP 530
Query: 137 KNIRALYGLA 146
KN+RA+ LA
Sbjct: 531 KNLRAILALA 540
Score = 33.9 bits (78), Expect = 0.043
Identities = 27/110 (24%), Positives = 46/110 (41%), Gaps = 1/110 (0%)
Query: 15 RYEEALEHLETIIKIDETNTAARKRKICILKAKNKIPEAIKELTEYLKKFMTDQETWQEL 74
Y A++ +K ++ A K +L + N EA+K L +LK D L
Sbjct: 718 DYPAAIQAYRKALKRAPSSQNAIKLHRALLASGNT-AEAVKTLEAWLKTHPNDAVLRTAL 776
Query: 75 CDLYLSEGDYAKAVFCMEELFLHHPHNHLLHQRYADILYTQGGLENIELA 124
+LYL++ DY KA+ + + P N ++ A + +E A
Sbjct: 777 AELYLAQKDYDKAIKHYQTVVKKAPDNAVVLNNLAWLYLELKDPRALEYA 826
Score = 32.0 bits (73), Expect = 0.19
Identities = 27/134 (20%), Positives = 40/134 (29%), Gaps = 5/134 (3%)
Query: 16 YEEALEHLETIIKIDETNTAARKRKICILKAKNKIPEAIKELTEYLKKFMTDQETWQELC 75
E AL I + N A IL + EA K LKK
Sbjct: 209 IELALAAYRKAIALRPNNIAVLLALATILIEAGEFEEAEKHADALLKKAPNSPLAHYLKA 268
Query: 76 DLYLSEGDYAKAVFCMEELFLHHPHNHLLHQRYADI-LYTQGGLENIELAISHYLMAINL 134
+ + +Y A +++ P +L A Y G LE ++ L
Sbjct: 269 LVDFQKKNYEDARETLQDALKSAP-EYLPALLLAGASEYQLGNLEQAYQYLNQILKYA-- 325
Query: 135 NEKNIRALYGLALS 148
+ +A LA
Sbjct: 326 -PNSHQARRLLASI 338
Score = 31.6 bits (72), Expect = 0.24
Identities = 25/103 (24%), Positives = 40/103 (38%)
Query: 17 EEALEHLETIIKIDETNTAARKRKICILKAKNKIPEAIKELTEYLKKFMTDQETWQELCD 76
E A + E + ID + A+ + A+N+ EA + E L + + D
Sbjct: 142 ELAQKSYEQALAIDPRSLYAKLGLAQLALAENRFDEARALIDEVLTADPGNVDALLLKGD 201
Query: 77 LYLSEGDYAKAVFCMEELFLHHPHNHLLHQRYADILYTQGGLE 119
L LS G+ A+ + P+N + A IL G E
Sbjct: 202 LLLSLGNIELALAAYRKAIALRPNNIAVLLALATILIEAGEFE 244
Score = 31.2 bits (71), Expect = 0.42
Identities = 18/43 (41%), Positives = 22/43 (51%)
Query: 46 AKNKIPEAIKELTEYLKKFMTDQETWQELCDLYLSEGDYAKAV 88
KNK AI +L L+K D E L +YL+ GDYA A
Sbjct: 34 QKNKYKAAIIQLKNALQKDPNDAEARFLLGKIYLALGDYAAAE 76
Score = 30.8 bits (70), Expect = 0.44
Identities = 18/86 (20%), Positives = 31/86 (36%)
Query: 16 YEEALEHLETIIKIDETNTAARKRKICILKAKNKIPEAIKELTEYLKKFMTDQETWQELC 75
+A E E + I+ A I + +AI+ + L + L
Sbjct: 481 LAKAREAFEKALSIEPDFFPAAANLARIDIQEGNPDDAIQRFEKVLTIDPKNLRAILALA 540
Query: 76 DLYLSEGDYAKAVFCMEELFLHHPHN 101
LYL G+ +AV +E+ +P
Sbjct: 541 GLYLRTGNEEEAVAWLEKAAELNPQE 566
Score = 30.1 bits (68), Expect = 0.96
Identities = 24/130 (18%), Positives = 50/130 (38%), Gaps = 3/130 (2%)
Query: 14 FRYEEALEHLETIIKIDETNTAARKRKICILKAKNKIPEAIKELTEYLKKFMTDQETWQE 73
R++EA ++ ++ D N A K +L + I A+ + + +
Sbjct: 173 NRFDEARALIDEVLTADPGNVDALLLKGDLLLSLGNIELALAAYRKAIALRPNNIAVLLA 232
Query: 74 LCDLYLSEGDYAKAVFCMEELFLHHPHNHLLHQRYADILYTQGGLENIELAISHYLMAIN 133
L + + G++ +A + L P++ L H A + + + N E A A+
Sbjct: 233 LATILIEAGEFEEAEKHADALLKKAPNSPLAHYLKALVDFQKK---NYEDARETLQDALK 289
Query: 134 LNEKNIRALY 143
+ + AL
Sbjct: 290 SAPEYLPALL 299
Score = 29.7 bits (67), Expect = 1.1
Identities = 33/164 (20%), Positives = 55/164 (33%), Gaps = 36/164 (21%)
Query: 16 YEEALEHLETIIKIDETNTAARKRKICILKAKNKIPEAIKELTEYLKKFMTDQETWQELC 75
Y +A+ L+ +++ NT A+ +L A + A K K+ ++
Sbjct: 651 YAKAITSLKRALELKPDNTEAQIGLAQLLLAAKRTESAKKIAKSLQKQHPKAALGFELEG 710
Query: 76 DLYLSEGDYAKAVFCMEELF-----------LH----------------------HPHNH 102
DLYL + DY A+ + LH HP++
Sbjct: 711 DLYLRQKDYPAAIQAYRKALKRAPSSQNAIKLHRALLASGNTAEAVKTLEAWLKTHPNDA 770
Query: 103 LLHQRYADILYTQGGLENIELAISHYLMAINLNEKNIRALYGLA 146
+L A++ Q + AI HY + N L LA
Sbjct: 771 VLRTALAELYLAQKDYDK---AIKHYQTVVKKAPDNAVVLNNLA 811
Score = 28.5 bits (64), Expect = 2.8
Identities = 34/191 (17%), Positives = 62/191 (32%), Gaps = 12/191 (6%)
Query: 15 RYEEALEHLETIIKIDETNTAARKRKICILKAKNKIPEAIKELTEYLKKFMTDQETWQEL 74
++A++ E ++ ID N A + EA+ L + + + E L
Sbjct: 514 NPDDAIQRFEKVLTIDPKNLRAILALAGLYLRTGNEEEAVAWLEKAAELNPQEIEPALAL 573
Query: 75 CDLYLSEGDYAKAVFCMEELFLHHPHNHLLHQRYADILYTQGGLENIELAISHYLMAINL 134
YL +G KA+ + E P + G L A+S + + L
Sbjct: 574 AQYYLGKGQLKKALAILNEAADAAPDSPEAWLMLGRAQLAAGDLNK---AVSSFKKLLAL 630
Query: 135 NEKNIRALYGLALSCHQVLTSAKCSAAKKKEISKQM-----MWVSKHLARQYEEQQGNTE 189
+ AL LA K A + + + ++ Q TE
Sbjct: 631 QPDSALALLLLA----DAYAVMKNYAKAITSLKRALELKPDNTEAQIGLAQLLLAAKRTE 686
Query: 190 TLTELMSALQV 200
+ ++ +LQ
Sbjct: 687 SAKKIAKSLQK 697
Score = 28.5 bits (64), Expect = 3.1
Identities = 24/72 (33%), Positives = 34/72 (47%), Gaps = 3/72 (4%)
Query: 77 LYLSEGDYAKAVFCMEELFLHHPHNHLLHQRYADILYTQGGLENIELAISHYLMAINLNE 136
L L+E + +A ++E+ P N D+L + G NIELA++ Y AI L
Sbjct: 168 LALAENRFDEARALIDEVLTADPGNVDALLLKGDLLLSLG---NIELALAAYRKAIALRP 224
Query: 137 KNIRALYGLALS 148
NI L LA
Sbjct: 225 NNIAVLLALATI 236
Score = 27.4 bits (61), Expect = 7.4
Identities = 21/118 (17%), Positives = 38/118 (32%), Gaps = 3/118 (2%)
Query: 17 EEALEHLETIIKIDETNTAARKRKICILKAKNKIPEAIKELTEYLKKFMTDQETWQELCD 76
+A+ + ++ + + A +AI L L+ + E L
Sbjct: 618 NKAVSSFKKLLALQPDSALALLLLADAYAVMKNYAKAITSLKRALELKPDNTEAQIGLAQ 677
Query: 77 LYLSEGDYAKAVFCMEELFLHHPHNHLLHQRYADILYTQGGLENIELAISHYLMAINL 134
L L+ A + L HP L + D+ Q ++ AI Y A+
Sbjct: 678 LLLAAKRTESAKKIAKSLQKQHPKAALGFELEGDLYLRQ---KDYPAAIQAYRKALKR 732
>gnl|CDD|238112 cd00189, TPR, Tetratricopeptide repeat domain; typically contains
34 amino acids
[WLF]-X(2)-[LIM]-[GAS]-X(2)-[YLF]-X(8)-[ASE]-X(3)-[FYL]-
X(2)-[ASL]-X(4)-[PKE] is the consensus sequence; found
in a variety of organisms including bacteria,
cyanobacteria, yeast, fungi, plants, and humans in
various subcellular locations; involved in a variety of
functions including protein-protein interactions, but
common features in the interaction partners have not
been defined; involved in chaperone, cell-cycle,
transciption, and protein transport complexes; the
number of TPR motifs varies among proteins (1,3-11,13
15,16,19); 5-6 tandem repeats generate a right-handed
helical structure with an amphipathic channel that is
thought to accomodate an alpha-helix of a target
protein; it has been proposed that TPR proteins
preferably interact with WD-40 repeat proteins, but in
many instances several TPR-proteins seem to aggregate to
multi-protein complexes; examples of TPR-proteins
include, Cdc16p, Cdc23p and Cdc27p components of the
cyclosome/APC, the Pex5p/Pas10p receptor for peroxisomal
targeting signals, the Tom70p co-receptor for
mitochondrial targeting signals, Ser/Thr phosphatase 5C
and the p110 subunit of O-GlcNAc transferase; three
copies of the repeat are present here.
Length = 100
Score = 42.4 bits (100), Expect = 1e-05
Identities = 18/100 (18%), Positives = 33/100 (33%), Gaps = 3/100 (3%)
Query: 38 KRKICILKAKNKIPEAIKELTEYLKKFMTDQETWQELCDLYLSEGDYAKAVFCMEELFLH 97
+ EA++ + L+ + + + L Y G Y +A+ E+
Sbjct: 4 LNLGNLYYKLGDYDEALEYYEKALELDPDNADAYYNLAAAYYKLGKYEEALEDYEKALEL 63
Query: 98 HPHNHLLHQRYADILYTQGGLENIELAISHYLMAINLNEK 137
P N + Y G E A+ Y A+ L+
Sbjct: 64 DPDNAKAYYNLGLAYYKLG---KYEEALEAYEKALELDPN 100
Score = 42.4 bits (100), Expect = 1e-05
Identities = 24/83 (28%), Positives = 36/83 (43%), Gaps = 3/83 (3%)
Query: 69 ETWQELCDLYLSEGDYAKAVFCMEELFLHHPHNHLLHQRYADILYTQGGLENIELAISHY 128
E L +LY GDY +A+ E+ P N + A Y G E A+ Y
Sbjct: 1 EALLNLGNLYYKLGDYDEALEYYEKALELDPDNADAYYNLAAAYYKLG---KYEEALEDY 57
Query: 129 LMAINLNEKNIRALYGLALSCHQ 151
A+ L+ N +A Y L L+ ++
Sbjct: 58 EKALELDPDNAKAYYNLGLAYYK 80
Score = 34.7 bits (80), Expect = 0.005
Identities = 19/79 (24%), Positives = 35/79 (44%)
Query: 15 RYEEALEHLETIIKIDETNTAARKRKICILKAKNKIPEAIKELTEYLKKFMTDQETWQEL 74
Y+EALE+ E +++D N A K EA+++ + L+ + + + L
Sbjct: 15 DYDEALEYYEKALELDPDNADAYYNLAAAYYKLGKYEEALEDYEKALELDPDNAKAYYNL 74
Query: 75 CDLYLSEGDYAKAVFCMEE 93
Y G Y +A+ E+
Sbjct: 75 GLAYYKLGKYEEALEAYEK 93
Score = 33.9 bits (78), Expect = 0.012
Identities = 14/48 (29%), Positives = 22/48 (45%)
Query: 15 RYEEALEHLETIIKIDETNTAARKRKICILKAKNKIPEAIKELTEYLK 62
+YEEALE E +++D N A K EA++ + L+
Sbjct: 49 KYEEALEDYEKALELDPDNAKAYYNLGLAYYKLGKYEEALEAYEKALE 96
>gnl|CDD|222112 pfam13414, TPR_11, TPR repeat.
Length = 69
Score = 38.1 bits (89), Expect = 2e-04
Identities = 15/68 (22%), Positives = 26/68 (38%), Gaps = 2/68 (2%)
Query: 69 ETWQELCDLYLSEGDYAKAVFCMEELFLHHPHNHLLHQRYADILYTQGGLENIELAISHY 128
E + L + GDY +A+ E+ P N + A G ++ E A+
Sbjct: 4 EALKNLGNALFKLGDYDEAIEAYEKALELDPDNAEAYYNLALAYLKLG--KDYEEALEDL 61
Query: 129 LMAINLNE 136
A+ L+
Sbjct: 62 EKALELDP 69
Score = 27.7 bits (62), Expect = 0.97
Identities = 13/45 (28%), Positives = 20/45 (44%), Gaps = 3/45 (6%)
Query: 103 LLHQRYADILYTQGGLENIELAISHYLMAINLNEKNIRALYGLAL 147
+ + L+ G + + AI Y A+ L+ N A Y LAL
Sbjct: 4 EALKNLGNALFKLG---DYDEAIEAYEKALELDPDNAEAYYNLAL 45
Score = 26.9 bits (60), Expect = 1.8
Identities = 14/49 (28%), Positives = 25/49 (51%), Gaps = 1/49 (2%)
Query: 15 RYEEALEHLETIIKIDETNTAAR-KRKICILKAKNKIPEAIKELTEYLK 62
Y+EA+E E +++D N A + LK EA+++L + L+
Sbjct: 18 DYDEAIEAYEKALELDPDNAEAYYNLALAYLKLGKDYEEALEDLEKALE 66
Score = 26.1 bits (58), Expect = 3.7
Identities = 17/61 (27%), Positives = 25/61 (40%), Gaps = 1/61 (1%)
Query: 33 NTAARKRKICILKAKNKIPEAIKELTEYLKKFMTDQETWQELCDLYLSEG-DYAKAVFCM 91
N A K L EAI+ + L+ + E + L YL G DY +A+ +
Sbjct: 2 NAEALKNLGNALFKLGDYDEAIEAYEKALELDPDNAEAYYNLALAYLKLGKDYEEALEDL 61
Query: 92 E 92
E
Sbjct: 62 E 62
>gnl|CDD|234406 TIGR03939, PGA_TPR_OMP, poly-beta-1,6 N-acetyl-D-glucosamine export
porin PgaA. Members of this protein family are the
poly-beta-1,6 N-acetyl-D-glucosamine (PGA) export porin
PgaA of Gram-negative bacteria. There is no counterpart
in the poly-beta-1,6 N-acetyl-D-glucosamine biosynthesis
systems of Gram-positive bacteria such as Staphylococcus
epidermidis. The PGA polysaccharide adhesin is a
critical determinant of biofilm formation. The conserved
C-terminal domain of this outer membrane protein is
preceded by a variable number of TPR repeats.
Length = 800
Score = 37.3 bits (87), Expect = 0.003
Identities = 23/98 (23%), Positives = 41/98 (41%)
Query: 15 RYEEALEHLETIIKIDETNTAARKRKICILKAKNKIPEAIKELTEYLKKFMTDQETWQEL 74
++++ALE +++ D N L K EA+K L EY+ +F TD ++ L
Sbjct: 101 QWDKALELYRKLLQRDPNNPDGLLGLALTLADAGKDAEALKYLKEYVARFPTDAARYEAL 160
Query: 75 CDLYLSEGDYAKAVFCMEELFLHHPHNHLLHQRYADIL 112
+ + D+ A+ ++ P N IL
Sbjct: 161 AYVLRAAEDHLDALQAWQQALTLEPDNPEAALELYRIL 198
>gnl|CDD|222121 pfam13428, TPR_14, Tetratricopeptide repeat.
Length = 44
Score = 32.1 bits (73), Expect = 0.017
Identities = 7/43 (16%), Positives = 13/43 (30%)
Query: 69 ETWQELCDLYLSEGDYAKAVFCMEELFLHHPHNHLLHQRYADI 111
L L+ GD +A+ + P + A +
Sbjct: 2 AALLALARALLALGDLDEALALLRRALALDPDDPEALLLLARL 44
>gnl|CDD|222123 pfam13432, TPR_16, Tetratricopeptide repeat.
Length = 65
Score = 32.3 bits (74), Expect = 0.025
Identities = 13/66 (19%), Positives = 23/66 (34%), Gaps = 3/66 (4%)
Query: 73 ELCDLYLSEGDYAKAVFCMEELFLHHPHNHLLHQRYADILYTQGGLENIELAISHYLMAI 132
L L GDY +A+ +E +P + L QG + A + A+
Sbjct: 2 ALARAALRAGDYDEALAALEAALARYPLAAEALLLLGEALLRQG---RLAEAAALLRAAL 58
Query: 133 NLNEKN 138
+ +
Sbjct: 59 AADPDD 64
Score = 28.4 bits (64), Expect = 0.52
Identities = 13/53 (24%), Positives = 18/53 (33%)
Query: 15 RYEEALEHLETIIKIDETNTAARKRKICILKAKNKIPEAIKELTEYLKKFMTD 67
Y+EAL LE + A L + ++ EA L L D
Sbjct: 12 DYDEALAALEAALARYPLAAEALLLLGEALLRQGRLAEAAALLRAALAADPDD 64
>gnl|CDD|225094 COG2183, Tex, Transcriptional accessory protein [Transcription].
Length = 780
Score = 33.0 bits (76), Expect = 0.10
Identities = 44/162 (27%), Positives = 65/162 (40%), Gaps = 24/162 (14%)
Query: 23 LETIIKIDETNTAARKRKICILKAKNKIPEAIKELTEYLKKFMTDQETWQELCDLYLSEG 82
L + + E +RK ILK+ I E K LT+ LK+ + E EL DLYL
Sbjct: 57 LRDLEERLEYLRELEERKESILKS---IEEQGK-LTDELKEQIEAAEELTELEDLYL--- 109
Query: 83 DYA-----KAVFCME-------ELFLHHPHNHLLHQRYADILYTQGGLENIELAIS---H 127
Y +A E +L L P L + AD + + G+ + +LA+
Sbjct: 110 PYKEKRRTRATIAREKGLEPLADLILSKPSLDPLLESAADFISIEEGVSSSKLALDGARD 169
Query: 128 YLM-AINLNEKNIRALYGLALSCHQVLTSAKCSAAKKKEISK 168
L I+ + + L L H VLTS + E +K
Sbjct: 170 ILAERISEDAELREKLRDY-LRKHGVLTSKVVKGKEDDEGAK 210
>gnl|CDD|217453 pfam03251, Tymo_45kd_70kd, Tymovirus 45/70Kd protein. Tymoviruses
are single stranded RNA viruses. This family includes a
protein of unknown function that has been named based on
its molecular weight. Tymoviruses such as the ononis
yellow mosaic tymovirus encode only three proteins. Of
these two are overlapping this protein overlaps a larger
ORF that is thought to be the polymerase.
Length = 458
Score = 32.0 bits (73), Expect = 0.17
Identities = 9/24 (37%), Positives = 13/24 (54%), Gaps = 2/24 (8%)
Query: 94 LFLHHP--HNHLLHQRYADILYTQ 115
LHH H + LH R +D+L +
Sbjct: 130 HHLHHSSKHRNRLHARRSDVLPSL 153
>gnl|CDD|224661 COG1747, COG1747, Uncharacterized N-terminal domain of the
transcription elongation factor GreA [Function unknown].
Length = 711
Score = 32.2 bits (73), Expect = 0.19
Identities = 23/83 (27%), Positives = 38/83 (45%), Gaps = 7/83 (8%)
Query: 16 YEEALEHLETIIKIDETNTAARKRKICILKAKNKIPEAIKELTEYLKKFMTDQETWQELC 75
+ EA+ L+ I++ DE + ARK I L+ K + +L EYLK Q + +
Sbjct: 221 WTEAIRILKHILEHDEKDVWARKEIIENLRDKYR---GHSQLEEYLKISNISQ-SGRNFF 276
Query: 76 DLYLSEGDYAKAVFCMEELFLHH 98
+ D+ K + E F+ H
Sbjct: 277 EAL---NDFEKLMHFDEGNFVFH 296
>gnl|CDD|226156 COG3629, DnrI, DNA-binding transcriptional activator of the SARP
family [Signal transduction mechanisms].
Length = 280
Score = 31.6 bits (72), Expect = 0.20
Identities = 12/53 (22%), Positives = 23/53 (43%)
Query: 15 RYEEALEHLETIIKIDETNTAARKRKICILKAKNKIPEAIKELTEYLKKFMTD 67
R + +EHLE +I++D + A R + + AI+ + K +
Sbjct: 168 RADAVIEHLERLIELDPYDEPAYLRLMEAYLVNGRQSAAIRAYRQLKKTLAEE 220
>gnl|CDD|223966 COG1035, FrhB, Coenzyme F420-reducing hydrogenase, beta subunit
[Energy production and conversion].
Length = 332
Score = 31.6 bits (72), Expect = 0.21
Identities = 21/63 (33%), Positives = 28/63 (44%), Gaps = 6/63 (9%)
Query: 4 IVYIIG--CFHFFRYEEALEHLETIIKIDETNTAA---RKRKICI-LKAKNKIPEAIKEL 57
IVY+IG C F YE + LE + +D + RK K + LK +KE
Sbjct: 174 IVYVIGLFCMENFSYEGLKKFLEEDLGVDPEDVEKMDIRKGKFVVELKDGEVKEIPLKET 233
Query: 58 TEY 60
EY
Sbjct: 234 EEY 236
>gnl|CDD|200938 pfam00025, Arf, ADP-ribosylation factor family. Pfam combines a
number of different Prosite families together.
Length = 174
Score = 31.0 bits (71), Expect = 0.24
Identities = 16/60 (26%), Positives = 28/60 (46%), Gaps = 6/60 (10%)
Query: 4 IVYIIGCFHFFRYEEALEHLETIIKIDETNTAARKRKICILKAKNKIPEAIK--ELTEYL 61
+++++ R EEA E L ++ +E A + IL K +P A+ E+ E L
Sbjct: 85 VIFVVDSADRDRIEEAKEELHALLNEEELADAP----LLILANKQDLPGAMSEAEIRELL 140
>gnl|CDD|226559 COG4073, COG4073, Uncharacterized protein conserved in archaea
[Function unknown].
Length = 198
Score = 31.3 bits (71), Expect = 0.25
Identities = 10/47 (21%), Positives = 22/47 (46%)
Query: 21 EHLETIIKIDETNTAARKRKICILKAKNKIPEAIKELTEYLKKFMTD 67
L+ + +I +R R+ I + + + PE ++ EY + F +
Sbjct: 151 RSLDVVSEILGLIAESRTREEFIEELRRRFPEQTEKALEYFRSFYGE 197
>gnl|CDD|225660 COG3118, COG3118, Thioredoxin domain-containing protein
[Posttranslational modification, protein turnover,
chaperones].
Length = 304
Score = 31.2 bits (71), Expect = 0.30
Identities = 12/34 (35%), Positives = 19/34 (55%), Gaps = 2/34 (5%)
Query: 15 RYEEALEHLETIIKIDET--NTAARKRKICILKA 46
R E ALEHL +++ D + ARK + + +A
Sbjct: 251 RNEAALEHLLALLRRDRGFEDGEARKTLLELFEA 284
>gnl|CDD|131219 TIGR02164, torA, trimethylamine-N-oxide reductase TorA. This very
narrowly defined family represents TorA, part of a
family of related molybdoenzymes that include biotin
sulfoxide reductases, dimethyl sulfoxide reductases, and
at least two different subfamilies of
trimethylamine-N-oxide reductases. A single enzyme from
the larger family may have more than one activity. TorA
typically is located in the periplasm, has a Tat
(twin-arginine translocation)-dependent signal sequence,
and is encoded in a torCAD operon.
Length = 822
Score = 31.0 bits (70), Expect = 0.42
Identities = 26/107 (24%), Positives = 47/107 (43%), Gaps = 21/107 (19%)
Query: 7 IIGCFHFFRYE------EALEHLETIIKIDETNTAARKRKICILKA-----KNKIPE--- 52
GC + ++ +A + LET++ ID + TA + +L A +N I
Sbjct: 470 FSGCNPWHHHQDRNRMKQAFQKLETVVTIDVSWTATCRFSDIVLPACTQFERNDIDVYGS 529
Query: 53 -------AIKELTEYLKKFMTDQETWQELCDLYLSEGDYAKAVFCME 92
A+++L + L +D E + ELC + E +Y + + ME
Sbjct: 530 YSNRGIIAMQKLVDPLFDSRSDFEIFTELCRRFGKEKEYTRNMDEME 576
>gnl|CDD|227122 COG4783, COG4783, Putative Zn-dependent protease, contains TPR
repeats [General function prediction only].
Length = 484
Score = 30.4 bits (69), Expect = 0.53
Identities = 25/113 (22%), Positives = 38/113 (33%), Gaps = 18/113 (15%)
Query: 15 RYEEALEHLETIIKIDETNTAARKRKICILKAKNKIPEAIKELTEYLKKFMTDQETWQEL 74
+ +EA+E L+ + +D + + L K EAI+ L YL D W L
Sbjct: 355 KAKEAIERLKKALALDPNSPLLQLNLAQALLKGGKPQEAIRILNRYLFNDPEDPNGWDLL 414
Query: 75 CDLYLSEGDYAKAVFCMEELFLHHPHNHLLHQRYADILYTQGGLEN-IELAIS 126
Y G+ A+A A+ G LE I +
Sbjct: 415 AQAYAELGNRAEA-----------------LLARAEGYALAGRLEQAIIFLMR 450
>gnl|CDD|233199 TIGR00939, 2a57, Equilibrative Nucleoside Transporter (ENT).
[Transport and binding proteins, Nucleosides, purines
and pyrimidines].
Length = 437
Score = 30.1 bits (68), Expect = 0.79
Identities = 14/74 (18%), Positives = 21/74 (28%), Gaps = 1/74 (1%)
Query: 48 NKIPEAIKELTEYLKKFMTDQETWQELCDLYLSEGDYAKAVFCMEELFLHHPHNHLLHQR 107
N A + EY K E ++ Y + L + N L R
Sbjct: 13 NAFITAPQYFIEYYKYAQNIPEAIPSSSKMWKHFNTYY-TLASQLPSLLFNSLNLFLIFR 71
Query: 108 YADILYTQGGLENI 121
+ GGL +
Sbjct: 72 IPVTVRLLGGLVIL 85
>gnl|CDD|198111 smart01043, BTAD, Bacterial transcriptional activator domain.
Found in the DNRI/REDD/AFSR family of regulators. This
region of AFSR along with the C terminal region is
capable of independently directing actinorhodin
production. This family contains TPR repeats.
Length = 145
Score = 29.2 bits (66), Expect = 0.80
Identities = 15/60 (25%), Positives = 24/60 (40%), Gaps = 3/60 (5%)
Query: 69 ETWQELCDLYLSEGDYAKAVFCMEELFLHHPHNHLLHQRYADILYTQGGLENIELAISHY 128
E + L + L+ G + +A+ +E L P LH+ LY G A+ Y
Sbjct: 62 EALEALAEALLALGRHEEALALLERLLALDPLRERLHRLLMRALYRAG---RRAEALRAY 118
>gnl|CDD|199209 cd00463, Ribosomal_L31e, Eukaryotic/archaeal ribosomal protein
L31. Ribosomal protein L31e, which is present in
archaea and eukaryotes, binds the 23S rRNA and is one
of six protein components encircling the polypeptide
exit tunnel. It is a component of the eukaryotic 60S
(large) ribosomal subunit, and the archaeal 50S (large)
ribosomal subunit.
Length = 83
Score = 28.3 bits (64), Expect = 0.82
Identities = 10/33 (30%), Positives = 17/33 (51%)
Query: 39 RKICILKAKNKIPEAIKELTEYLKKFMTDQETW 71
RK + K + P AIKE+ ++ K M ++
Sbjct: 14 RKRKHVPRKKRAPRAIKEIRKFAAKHMKTEDVR 46
>gnl|CDD|191825 pfam07719, TPR_2, Tetratricopeptide repeat. This Pfam entry
includes outlying Tetratricopeptide-like repeats (TPR)
that are not matched by pfam00515.
Length = 34
Score = 26.7 bits (60), Expect = 0.94
Identities = 10/33 (30%), Positives = 14/33 (42%)
Query: 69 ETWQELCDLYLSEGDYAKAVFCMEELFLHHPHN 101
E L Y GDY +A+ E+ P+N
Sbjct: 2 EALYNLGLAYYKLGDYEEALEAYEKALELDPNN 34
>gnl|CDD|220459 pfam09892, DUF2119, Uncharacterized protein conserved in archaea
(DUF2119). This domain, found in various hypothetical
archaeal proteins, has no known function.
Length = 193
Score = 29.3 bits (66), Expect = 1.1
Identities = 15/44 (34%), Positives = 22/44 (50%)
Query: 21 EHLETIIKIDETNTAARKRKICILKAKNKIPEAIKELTEYLKKF 64
E LE + +I E +R R+ I K + PE K+ E +KF
Sbjct: 147 ESLEFVAEILEVGKESRSREEFIEYLKKRYPEQAKKAIEDARKF 190
>gnl|CDD|225605 COG3063, PilF, Tfp pilus assembly protein PilF [Cell motility and
secretion / Intracellular trafficking and secretion].
Length = 250
Score = 29.3 bits (66), Expect = 1.2
Identities = 26/76 (34%), Positives = 34/76 (44%), Gaps = 6/76 (7%)
Query: 78 YLSEGDYAKAVFCMEELFLHHPHNHLLHQRYADILYTQGGLENIELAISHYLMAINLNEK 137
YL +GDYA+A +E+ H P +L H A G +LA Y A++L
Sbjct: 45 YLQQGDYAQAKKNLEKALEHDPSYYLAHLVRAHYYQKLG---ENDLADESYRKALSLAPN 101
Query: 138 NIRAL--YGLALSCHQ 151
N L YG L C Q
Sbjct: 102 NGDVLNNYGAFL-CAQ 116
>gnl|CDD|234418 TIGR03964, mycofact_creat, mycofactocin system creatininase family
protein. Members of this protein family are
uncharacterized Actinobacterial proteins, with homology
to creatinine amidohydrolase from Pseudomonas. Members
occur only in the context of the mycofactocin system
[Unknown function, Enzymes of unknown specificity].
Length = 228
Score = 28.9 bits (65), Expect = 1.6
Identities = 10/29 (34%), Positives = 14/29 (48%)
Query: 170 MMWVSKHLARQYEEQQGNTETLTELMSAL 198
++ +S R + GNT L ELM L
Sbjct: 149 LLHLSPDRVRMDRAEAGNTAPLAELMPRL 177
>gnl|CDD|115863 pfam07234, DUF1426, Protein of unknown function (DUF1426). This
family consists of several Banana bunchy top virus
proteins of around 120 residues in length. One member
is annotated a movement protein whereas most other
family members are hypothetical. The function of this
family is unknown.
Length = 117
Score = 28.0 bits (62), Expect = 1.7
Identities = 11/35 (31%), Positives = 20/35 (57%)
Query: 49 KIPEAIKELTEYLKKFMTDQETWQELCDLYLSEGD 83
++P+ IK L YL +++T + W + L + GD
Sbjct: 37 EVPKYIKGLVRYLVEYLTRRRVWMQRTQLTEATGD 71
>gnl|CDD|236644 PRK10049, pgaA, outer membrane protein PgaA; Provisional.
Length = 765
Score = 28.9 bits (65), Expect = 1.9
Identities = 9/57 (15%), Positives = 15/57 (26%)
Query: 56 ELTEYLKKFMTDQETWQELCDLYLSEGDYAKAVFCMEELFLHHPHNHLLHQRYADIL 112
+ L D+ L +Y G + + M + P Y L
Sbjct: 104 VKAKQLVSGAPDKANLLALAYVYKRAGRHWDELRAMTQALPRAPQTQQYPTEYVQAL 160
>gnl|CDD|173359 PTZ00064, PTZ00064, histone acetyltransferase; Provisional.
Length = 552
Score = 28.4 bits (63), Expect = 2.5
Identities = 25/99 (25%), Positives = 38/99 (38%), Gaps = 10/99 (10%)
Query: 10 CFHFFRYE-EALEHLE-TIIKIDETNTAARKRKICILKAKNKIPEAIKELTEYLKKFMTD 67
C FF +E E + HL ++ N RK I + + + E YL K D
Sbjct: 286 CLDFFCFEDELIRHLSRCQLRHPPGNEIYRKDNISVFEIDGALTRGYAENLCYLAKLFLD 345
Query: 68 QETWQELCDLYL--------SEGDYAKAVFCMEELFLHH 98
+T Q + +L EG + F E++ L H
Sbjct: 346 HKTLQYDVEPFLFYIVTEVDEEGCHIVGYFSKEKVSLLH 384
>gnl|CDD|206645 cd00879, Sar1, Sar1 is an essential component of COPII vesicle
coats. Sar1 is an essential component of COPII vesicle
coats involved in export of cargo from the ER. The
GTPase activity of Sar1 functions as a molecular switch
to control protein-protein and protein-lipid
interactions that direct vesicle budding from the ER.
Activation of the GDP to the GTP-bound form of Sar1
involves the membrane-associated guanine nucleotide
exchange factor (GEF) Sec12. Sar1 is unlike all Ras
superfamily GTPases that use either myristoyl or prenyl
groups to direct membrane association and function, in
that Sar1 lacks such modification. Instead, Sar1
contains a unique nine-amino-acid N-terminal extension.
This extension contains an evolutionarily conserved
cluster of bulky hydrophobic amino acids, referred to as
the Sar1-N-terminal activation recruitment (STAR) motif.
The STAR motif mediates the recruitment of Sar1 to ER
membranes and facilitates its interaction with mammalian
Sec12 GEF leading to activation.
Length = 191
Score = 28.0 bits (63), Expect = 2.6
Identities = 14/53 (26%), Positives = 26/53 (49%), Gaps = 4/53 (7%)
Query: 4 IVYIIGCFHFFRYEEALEHLETIIKIDETNTAARKRKICILKAKNKIPEAIKE 56
IV+++ R++E+ E L++++ +E I IL K P A+ E
Sbjct: 90 IVFLVDAADPERFQESKEELDSLLNDEELAN----VPILILGNKIDKPGAVSE 138
>gnl|CDD|221596 pfam12481, DUF3700, Aluminium induced protein. This domain family
is found in eukaryotes, and is approximately 120 amino
acids in length. There are two conserved sequence
motifs: YGL and LRDR. This family is related to GATase
enzyme domains.
Length = 228
Score = 27.9 bits (63), Expect = 2.6
Identities = 14/48 (29%), Positives = 19/48 (39%), Gaps = 7/48 (14%)
Query: 98 HPHNHLLHQR----YADI--LYTQGGLENIELAISHYLMAINLNEKNI 139
H L R DI L+ QG LEN+ Y ++ NE +
Sbjct: 60 HSKQSPLLPRLFAVVDDIFCLF-QGHLENLASLRQQYGLSKTANEAML 106
>gnl|CDD|184851 PRK14849, PRK14849, putative lipoprotein/autotransporter
domain-containing protein; Provisional.
Length = 1806
Score = 28.5 bits (63), Expect = 2.9
Identities = 12/35 (34%), Positives = 17/35 (48%)
Query: 82 GDYAKAVFCMEELFLHHPHNHLLHQRYADILYTQG 116
G Y + LF H H+ L +Y D L++QG
Sbjct: 1486 GSYISNIAAANSLFSHRLHDRLGEPQYIDSLHSQG 1520
>gnl|CDD|238649 cd01328, FSL_SPARC, Follistatin-like SPARC (secreted protein,
acidic, and rich in cysteines) domain;
SPARC/BM-40/osteonectin is a multifunctional
glycoprotein which modulates cellular interaction with
the extracellular matrix by its binding to structural
matrix proteins such as collagen and vitronectin. The
protein it composed of an N-terminal acidic region, a
follistatin (FS) domain and an EF-hand calcium binding
domain. The FS domain consists of an N-terminal beta
hairpin (FOLN/EGF-like domain) and a small hydrophobic
core of alpha/beta structure (Kazal domain) and has
five disulfide bonds and a conserved N-glycosylation
site. The FSL_SPARC domain is a member of the
superfamily of kazal-like proteinase inhibitors and
follistatin-like proteins.
Length = 86
Score = 26.7 bits (59), Expect = 2.9
Identities = 12/44 (27%), Positives = 20/44 (45%), Gaps = 5/44 (11%)
Query: 40 KICILKAKNK-----IPEAIKELTEYLKKFMTDQETWQELCDLY 78
K+C + +N I +E+ + K D ET+ C+LY
Sbjct: 11 KVCEVDDENTPKCVCIDPCPEEVDDRRKVCTNDNETFDSDCELY 54
>gnl|CDD|182415 PRK10370, PRK10370, formate-dependent nitrite reductase complex
subunit NrfG; Provisional.
Length = 198
Score = 27.9 bits (62), Expect = 3.2
Identities = 19/68 (27%), Positives = 30/68 (44%), Gaps = 1/68 (1%)
Query: 51 PEA-IKELTEYLKKFMTDQETWQELCDLYLSEGDYAKAVFCMEELFLHHPHNHLLHQRYA 109
PEA ++ L + ++ + E W L + YL DY A+ + N L+ A
Sbjct: 55 PEAQLQALQDKIRANPQNSEQWALLGEYYLWRNDYDNALLAYRQALQLRGENAELYAALA 114
Query: 110 DILYTQGG 117
+LY Q G
Sbjct: 115 TVLYYQAG 122
>gnl|CDD|165653 PLN00078, PLN00078, photosystem I reaction center subunit N
(PsaN); Provisional.
Length = 122
Score = 27.0 bits (59), Expect = 4.2
Identities = 18/59 (30%), Positives = 28/59 (47%), Gaps = 5/59 (8%)
Query: 16 YEEALEHLETIIKIDETNTAARKRKICILKAKNKIPEAIKE-----LTEYLKKFMTDQE 69
+ A +HL + + N +R+ + L + IPEA E L EYLKK ++E
Sbjct: 15 HRSAAKHLHPAAMLAQRNGISRRCLLTFLTSTAAIPEAGSESRKALLQEYLKKSEENKE 73
>gnl|CDD|184804 PRK14720, PRK14720, transcript cleavage factor/unknown domain
fusion protein; Provisional.
Length = 906
Score = 27.8 bits (62), Expect = 4.4
Identities = 15/48 (31%), Positives = 23/48 (47%), Gaps = 3/48 (6%)
Query: 16 YEEALEHLETIIKIDETNTAARKRKICILKAKNKIPEAIKELTEYLKK 63
++E + L+ I++ D N AR+ I K K K L +YLK
Sbjct: 239 WDEVIYILKKILEHDNKNNKAREELIRFYKEKYK---DHSLLEDYLKM 283
>gnl|CDD|220821 pfam10602, RPN7, 26S proteasome subunit RPN7. RPN7 (known as the
non ATPase regulatory subunit 6 in higher eukaryotes)
is one of the lid subunits of the 26S proteasome and
has been shown in Saccharomyces cerevisiae to be
required for structural integrity. The 26S proteasome
is is involved in the ATP-dependent degradation of
ubiquitinated proteins.
Length = 174
Score = 27.1 bits (61), Expect = 4.5
Identities = 13/58 (22%), Positives = 22/58 (37%), Gaps = 8/58 (13%)
Query: 44 LKAKNKIPEAIKELTEYLKKFMTD------QETWQELCDLYLSEGDYAKAVFCMEELF 95
L+ KN+ E ++EL LK + + +L + Y GD A+
Sbjct: 8 LRKKNE--EELEELDAELKDAKENLGKEEIRRALLDLAEHYAKIGDLENALKAYSRAR 63
>gnl|CDD|177876 PLN02231, PLN02231, alanine transaminase.
Length = 534
Score = 28.0 bits (62), Expect = 4.5
Identities = 11/23 (47%), Positives = 15/23 (65%)
Query: 43 ILKAKNKIPEAIKELTEYLKKFM 65
IL ++KIP + LTE+ K FM
Sbjct: 507 ILPQEDKIPAIVSRLTEFHKSFM 529
>gnl|CDD|226862 COG4455, ImpE, Protein of avirulence locus involved in
temperature-dependent protein secretion [General
function prediction only].
Length = 273
Score = 27.5 bits (61), Expect = 4.7
Identities = 12/40 (30%), Positives = 17/40 (42%)
Query: 48 NKIPEAIKELTEYLKKFMTDQETWQELCDLYLSEGDYAKA 87
N + +AI + +K TD L L GD+ KA
Sbjct: 15 NSLQDAIGLARDQVKAKPTDAGGRHFLFQLLCVAGDWEKA 54
>gnl|CDD|221641 pfam12569, NARP1, NMDA receptor-regulated protein 1. This domain
family is found in eukaryotes, and is approximately 40
amino acids in length. The family is found in
association with pfam07719, pfam00515. There is a single
completely conserved residue L that may be functionally
important. NARP1 is the mammalian homologue of a yeast
N-terminal acetyltransferase that regulates entry into
the G(0) phase of the cell cycle.
Length = 516
Score = 27.2 bits (61), Expect = 6.1
Identities = 19/84 (22%), Positives = 36/84 (42%), Gaps = 2/84 (2%)
Query: 40 KICILKAKNKIPEAIKELTEYLKKFMTDQETWQEL-CDLYLSEGDYAKAVFCMEELFLHH 98
K +++ + EA++ L E K+ + D+ E+ DL L G +A L +
Sbjct: 10 KNSLIEESGDLEEALEHLEEKEKQ-IVDRLAVMEMRADLLLKLGRKEEAEATYRALLDRN 68
Query: 99 PHNHLLHQRYADILYTQGGLENIE 122
P N+ ++ L + +IE
Sbjct: 69 PENYDYYEGLQKALGLEISSGDIE 92
>gnl|CDD|234164 TIGR03302, OM_YfiO, outer membrane assembly lipoprotein YfiO.
Members of this protein family include YfiO, a
near-essential protein of the outer membrane, part of a
complex involved in protein insertion into the bacterial
outer membrane. Many proteins in this family are
annotated as ComL, based on the involvement of this
protein in natural transformation with exogenous DNA in
Neisseria gonorrhoeae. This protein family shows
sequence similarity to, but is distinct from, the
tol-pal system protein YbgF (TIGR02795) [Protein fate,
Protein and peptide secretion and trafficking].
Length = 235
Score = 26.7 bits (60), Expect = 6.8
Identities = 20/74 (27%), Positives = 28/74 (37%), Gaps = 3/74 (4%)
Query: 78 YLSEGDYAKAVFCMEELFLHHPHNHLLHQRYADILYTQGGLENIELAISHYLMAINL--N 135
L GDY +A+ E L +P + Q D+ Y + AI+ I L N
Sbjct: 43 ALDSGDYTEAIKYFEALESRYPFSPYAEQAQLDLAYAYYKSGDYAEAIAAADRFIRLHPN 102
Query: 136 EKNI-RALYGLALS 148
+ A Y LS
Sbjct: 103 HPDADYAYYLRGLS 116
>gnl|CDD|192936 pfam12095, DUF3571, Protein of unknown function (DUF3571). This
family of proteins is functionally uncharacterized.
This protein is found in bacteria and eukaryotes.
Proteins in this family are typically between 85 to 97
amino acids in length.
Length = 83
Score = 25.7 bits (57), Expect = 7.0
Identities = 8/37 (21%), Positives = 12/37 (32%), Gaps = 4/37 (10%)
Query: 52 EAIKELTEYLKKFMTDQETWQEL----CDLYLSEGDY 84
+ L L K + Q L C+L + G
Sbjct: 38 TRLDSLPADLAKLPSLDAQAQRLLDTACELEIGPGLT 74
>gnl|CDD|235288 PRK04346, PRK04346, tryptophan synthase subunit beta; Validated.
Length = 397
Score = 27.0 bits (61), Expect = 7.7
Identities = 14/40 (35%), Positives = 19/40 (47%), Gaps = 7/40 (17%)
Query: 50 IPE----AIKELTEYLKKFMTDQETWQELCDLYLSEGDYA 85
+PE A++EL E +K D E EL D L +Y
Sbjct: 19 VPETLMPALEELEEAYEKAKNDPEFQAEL-DYLLK--NYV 55
>gnl|CDD|233999 TIGR02757, TIGR02757, TIGR02757 family protein. Members of this
uncharacterized protein family are found sporadically,
so far only among spirochetes, epsilon and delta
proteobacteria, and Bacteroides. The function is unknown
and its gene neighborhoods show little conservation
[Hypothetical proteins, Conserved].
Length = 229
Score = 26.6 bits (59), Expect = 8.5
Identities = 12/41 (29%), Positives = 24/41 (58%), Gaps = 1/41 (2%)
Query: 24 ETIIKIDETNTAARKRKICILKAKNKIPEAIKELTEYLKKF 64
+ I+ +D T+ +K+ +LK K+ +A E+TE L++
Sbjct: 171 DLILPLD-THVFRIAKKLKLLKRKSYDLKAAIEITEALREL 210
>gnl|CDD|225008 COG2097, RPL31A, Ribosomal protein L31E [Translation, ribosomal
structure and biogenesis].
Length = 89
Score = 25.4 bits (56), Expect = 8.9
Identities = 6/33 (18%), Positives = 16/33 (48%)
Query: 39 RKICILKAKNKIPEAIKELTEYLKKFMTDQETW 71
R+ + + P A+K + +++ + M +E
Sbjct: 16 RRAKKVPRTKRAPRAVKIIRKFVARHMKAEEVR 48
>gnl|CDD|237080 PRK12370, PRK12370, invasion protein regulator; Provisional.
Length = 553
Score = 26.7 bits (59), Expect = 9.1
Identities = 31/162 (19%), Positives = 61/162 (37%), Gaps = 18/162 (11%)
Query: 2 APIVYIIGCFHFF---RYEEALEHLETIIKIDETNTAARKRKICILKAKNKIPEAIKELT 58
+ + ++ F + EEAL+ + +K+D T AA K+ I I +AI+
Sbjct: 371 SADIKYYYGWNLFMAGQLEEALQTINECLKLDPTRAAAGITKLWITYYHTGIDDAIRLGD 430
Query: 59 EYLKKFMTDQETWQELCDLYLS-EGDYAKAVFCMEELFLHHPHNHLLHQRYADILYT--- 114
E + + D + ++LS +G + A +L + ++LY
Sbjct: 431 ELRSQHLQDNPILLSMQVMFLSLKGKHELA----RKLTKEISTQEITGLIAVNLLYAEYC 486
Query: 115 QGGLENIELAISHYLMAINLNEKN------IRALYGLALSCH 150
Q E I +L + + N + +G A++
Sbjct: 487 QNS-ERALPTIREFLESEQRIDNNPGLLPLVLVAHGEAIAEK 527
Database: CDD.v3.10
Posted date: Mar 20, 2013 7:55 AM
Number of letters in database: 10,937,602
Number of sequences in database: 44,354
Lambda K H
0.321 0.133 0.391
Gapped
Lambda K H
0.267 0.0733 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 10,190,980
Number of extensions: 948496
Number of successful extensions: 1390
Number of sequences better than 10.0: 1
Number of HSP's gapped: 1373
Number of HSP's successfully gapped: 131
Length of query: 201
Length of database: 10,937,602
Length adjustment: 92
Effective length of query: 109
Effective length of database: 6,857,034
Effective search space: 747416706
Effective search space used: 747416706
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.9 bits)
S2: 56 (25.3 bits)