RPS-BLAST 2.2.26 [Sep-21-2011]
Database: CDD.v3.10
44,354 sequences; 10,937,602 total letters
Searching..................................................done
Query= psy14212
(111 letters)
>gnl|CDD|238112 cd00189, TPR, Tetratricopeptide repeat domain; typically contains
34 amino acids
[WLF]-X(2)-[LIM]-[GAS]-X(2)-[YLF]-X(8)-[ASE]-X(3)-
[FYL]-X(2)-[ASL]-X(4)-[PKE] is the consensus sequence;
found in a variety of organisms including bacteria,
cyanobacteria, yeast, fungi, plants, and humans in
various subcellular locations; involved in a variety of
functions including protein-protein interactions, but
common features in the interaction partners have not
been defined; involved in chaperone, cell-cycle,
transciption, and protein transport complexes; the
number of TPR motifs varies among proteins (1,3-11,13
15,16,19); 5-6 tandem repeats generate a right-handed
helical structure with an amphipathic channel that is
thought to accomodate an alpha-helix of a target
protein; it has been proposed that TPR proteins
preferably interact with WD-40 repeat proteins, but in
many instances several TPR-proteins seem to aggregate
to multi-protein complexes; examples of TPR-proteins
include, Cdc16p, Cdc23p and Cdc27p components of the
cyclosome/APC, the Pex5p/Pas10p receptor for
peroxisomal targeting signals, the Tom70p co-receptor
for mitochondrial targeting signals, Ser/Thr
phosphatase 5C and the p110 subunit of O-GlcNAc
transferase; three copies of the repeat are present
here.
Length = 100
Score = 30.8 bits (70), Expect = 0.054
Identities = 15/53 (28%), Positives = 23/53 (43%), Gaps = 3/53 (5%)
Query: 9 PHNHLLHQRYADILYTQGGLENIELAISHYLMAINLNEKNIRALYGLALSCHQ 61
P N + A Y G E A+ Y A+ L+ N +A Y L L+ ++
Sbjct: 31 PDNADAYYNLAAAYYKLG---KYEEALEDYEKALELDPDNAKAYYNLGLAYYK 80
Score = 27.0 bits (60), Expect = 1.2
Identities = 10/30 (33%), Positives = 16/30 (53%)
Query: 28 LENIELAISHYLMAINLNEKNIRALYGLAL 57
L + + A+ +Y A+ L+ N A Y LA
Sbjct: 13 LGDYDEALEYYEKALELDPDNADAYYNLAA 42
>gnl|CDD|217453 pfam03251, Tymo_45kd_70kd, Tymovirus 45/70Kd protein. Tymoviruses
are single stranded RNA viruses. This family includes a
protein of unknown function that has been named based on
its molecular weight. Tymoviruses such as the ononis
yellow mosaic tymovirus encode only three proteins. Of
these two are overlapping this protein overlaps a larger
ORF that is thought to be the polymerase.
Length = 458
Score = 28.9 bits (65), Expect = 0.44
Identities = 9/24 (37%), Positives = 13/24 (54%), Gaps = 2/24 (8%)
Query: 4 LFLHHP--HNHLLHQRYADILYTQ 25
LHH H + LH R +D+L +
Sbjct: 130 HHLHHSSKHRNRLHARRSDVLPSL 153
>gnl|CDD|221596 pfam12481, DUF3700, Aluminium induced protein. This domain family
is found in eukaryotes, and is approximately 120 amino
acids in length. There are two conserved sequence
motifs: YGL and LRDR. This family is related to GATase
enzyme domains.
Length = 228
Score = 28.3 bits (64), Expect = 0.62
Identities = 14/48 (29%), Positives = 19/48 (39%), Gaps = 7/48 (14%)
Query: 8 HPHNHLLHQR----YADI--LYTQGGLENIELAISHYLMAINLNEKNI 49
H L R DI L+ QG LEN+ Y ++ NE +
Sbjct: 60 HSKQSPLLPRLFAVVDDIFCLF-QGHLENLASLRQQYGLSKTANEAML 106
>gnl|CDD|222112 pfam13414, TPR_11, TPR repeat.
Length = 69
Score = 26.9 bits (60), Expect = 0.75
Identities = 13/45 (28%), Positives = 20/45 (44%), Gaps = 3/45 (6%)
Query: 13 LLHQRYADILYTQGGLENIELAISHYLMAINLNEKNIRALYGLAL 57
+ + L+ G + + AI Y A+ L+ N A Y LAL
Sbjct: 4 EALKNLGNALFKLG---DYDEAIEAYEKALELDPDNAEAYYNLAL 45
Score = 24.6 bits (54), Expect = 6.5
Identities = 8/38 (21%), Positives = 14/38 (36%), Gaps = 2/38 (5%)
Query: 9 PHNHLLHQRYADILYTQGGLENIELAISHYLMAINLNE 46
P N + A G ++ E A+ A+ L+
Sbjct: 34 PDNAEAYYNLALAYLKLG--KDYEEALEDLEKALELDP 69
>gnl|CDD|234418 TIGR03964, mycofact_creat, mycofactocin system creatininase family
protein. Members of this protein family are
uncharacterized Actinobacterial proteins, with homology
to creatinine amidohydrolase from Pseudomonas. Members
occur only in the context of the mycofactocin system
[Unknown function, Enzymes of unknown specificity].
Length = 228
Score = 27.7 bits (62), Expect = 0.97
Identities = 10/29 (34%), Positives = 14/29 (48%)
Query: 80 MMWVSKHLARQYEEQQGNTETLTELMSAL 108
++ +S R + GNT L ELM L
Sbjct: 149 LLHLSPDRVRMDRAEAGNTAPLAELMPRL 177
>gnl|CDD|239717 cd03748, Ntn_PGA, Penicillin G acylase (PGA) is the key enzyme in
the industrial production of beta-lactam antibiotics.
PGA hydrolyzes the side chain of penicillin G and
related beta-lactam antibiotics releasing 6-amino
penicillanic acid (6-APA), a building block in the
production of semisynthetic penicillins. PGA is widely
distributed among microorganisms, including bacteria,
yeast and filamentous fungi but it's in vivo role
remains unclear.
Length = 488
Score = 26.3 bits (58), Expect = 3.5
Identities = 11/39 (28%), Positives = 18/39 (46%), Gaps = 2/39 (5%)
Query: 66 AKCSAAKKKEISKQMMWVSKHLARQYEE--QQGNTETLT 102
+K A E+ M W + A+ +EE Q + + LT
Sbjct: 138 SKARAWDGYELQSLMAWTKQTKAKNWEEWLDQASKQALT 176
>gnl|CDD|217874 pfam04074, DUF386, Domain of unknown function (DUF386). This
family consists of conserved hypothetical proteins,
typically about 150 amino acids in length, with no
known function.
Length = 153
Score = 25.7 bits (57), Expect = 4.1
Identities = 10/23 (43%), Positives = 13/23 (56%)
Query: 14 LHQRYADILYTQGGLENIELAIS 36
+H+RY DI G E IE A +
Sbjct: 65 VHRRYIDIQILLEGQEKIEYATA 87
>gnl|CDD|200386 TIGR04135, FibroRuminTarg, Cys-rich radical SAM target,
FibroRumin family. Members of this protein family are
cysteine-rich small peptides, about 52 amino acids
long, that are proposed targets for modification by a
radical SAM enzyme. Known occurrences are as tandem
gene pairs Fibrobacter succinogenes subsp. succinogenes
S85 (missed gene calls) and in Ruminococcus albus 8.
Length = 52
Score = 24.7 bits (53), Expect = 4.3
Identities = 8/37 (21%), Positives = 20/37 (54%)
Query: 39 LMAINLNEKNIRALYGLALSCHQVLTSAKCSAAKKKE 75
M N+ ++ + A ++C+ + KC A+K+++
Sbjct: 10 EMFGNIADEFMAAAACACVACNGCKCACKCRASKEED 46
>gnl|CDD|131755 TIGR02708, L_lactate_ox, L-lactate oxidase. Members of this
protein oxidize L-lactate to pyruvate, reducing
molecular oxygen to hydrogen peroxide. The enzyme is
known in Aerococcus viridans, Streptococcus iniae, and
some strains of Streptococcus pyogenes where it appears
to contribute to virulence [Energy metabolism, Other].
Length = 367
Score = 26.0 bits (57), Expect = 4.8
Identities = 8/28 (28%), Positives = 15/28 (53%)
Query: 81 MWVSKHLARQYEEQQGNTETLTELMSAL 108
+WV+ H RQ + ++L E+ A+
Sbjct: 253 IWVTNHGGRQLDGGPAAFDSLQEVAEAV 280
>gnl|CDD|220365 pfam09726, Macoilin, Transmembrane protein. This entry is a highly
conserved protein present in eukaryotes.
Length = 680
Score = 25.7 bits (56), Expect = 6.7
Identities = 15/39 (38%), Positives = 20/39 (51%), Gaps = 3/39 (7%)
Query: 72 KKKEISKQMMWVSKHLARQYEEQQGNTETLTELMSALQV 110
K KE +M+ R+Y+E + TE LMSALQ
Sbjct: 550 KLKEEECRMLEKEAQELRKYQESEKETEV---LMSALQA 585
>gnl|CDD|213811 TIGR03427, ABC_peri_uca, ABC transporter periplasmic binding
protein, urea carboxylase region. Members of this
family are ABC transporter periplasmic binding proteins
associated with the urea carboxylase/allophanate
hydrolase pathway, an alternative to urease for urea
degradation. The protein is restricted to bacteria with
the pathway, with its gene close to the urea carboxylase
and allophanate hydrolase genes. The substrate for this
transporter therefore is likely to be urea or a compound
from which urea is easily derived [Transport and binding
proteins, Unknown substrate].
Length = 328
Score = 25.5 bits (56), Expect = 6.8
Identities = 10/25 (40%), Positives = 19/25 (76%), Gaps = 5/25 (20%)
Query: 31 IELAISHYLMA-----INLNEKNIR 50
+EL++SHYL+A + L+EK+++
Sbjct: 112 VELSVSHYLLARALESVGLSEKDVK 136
>gnl|CDD|182380 PRK10328, PRK10328, DNA binding protein, nucleoid-associated;
Provisional.
Length = 134
Score = 24.8 bits (54), Expect = 8.2
Identities = 19/74 (25%), Positives = 31/74 (41%), Gaps = 14/74 (18%)
Query: 39 LMAINLNEKNIRALYGLALS-----CHQVLTSAKCSAAKKKEISKQMMWVSKHLARQYEE 93
+M +LN NIR L +A ++L + +++E +Q R+ E
Sbjct: 3 VMLQSLN--NIRTLRAMAREFSIDVLEEMLEKFRVVTKERREEEEQQ-------QRELAE 53
Query: 94 QQGNTETLTELMSA 107
+Q T ELM A
Sbjct: 54 RQEKINTWLELMKA 67
Database: CDD.v3.10
Posted date: Mar 20, 2013 7:55 AM
Number of letters in database: 10,937,602
Number of sequences in database: 44,354
Lambda K H
0.318 0.128 0.367
Gapped
Lambda K H
0.267 0.0867 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 5,405,043
Number of extensions: 440219
Number of successful extensions: 430
Number of sequences better than 10.0: 1
Number of HSP's gapped: 429
Number of HSP's successfully gapped: 42
Length of query: 111
Length of database: 10,937,602
Length adjustment: 75
Effective length of query: 36
Effective length of database: 7,611,052
Effective search space: 273997872
Effective search space used: 273997872
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 53 (23.9 bits)