RPS-BLAST 2.2.26 [Sep-21-2011]
Database: CDD.v3.10
44,354 sequences; 10,937,602 total letters
Searching..................................................done
Query= psy8426
(278 letters)
>gnl|CDD|214614 smart00317, SET, SET (Su(var)3-9, Enhancer-of-zeste, Trithorax)
domain. Putative methyl transferase, based on outlier
plant homologues.
Length = 124
Score = 80.5 bits (199), Expect = 4e-19
Identities = 33/125 (26%), Positives = 49/125 (39%), Gaps = 9/125 (7%)
Query: 68 PCYRYSLEGNRGARIVATRKWEKNDNIAYLVGCIAELTKEEEKRLLHTGLNDFSVMYSCR 127
+G + AT K + I VG I +E E+R + Y
Sbjct: 1 NKLEVFKSPGKGWGVRATEDIPKGEFIGEYVGEIITS-EEAEERPKAYDTDGAKAFYLFD 59
Query: 128 KNC-----AQLWLGPAAYINHDCRPNCRFVATERDT---ACVKVLRDIEQGEEITCFYGE 179
+ A+ A +INH C PNC + E + + LRDI+ GEE+T YG
Sbjct: 60 IDSDLCIDARRKGNLARFINHSCEPNCELLFVEVNGDDRIVIFALRDIKPGEELTIDYGS 119
Query: 180 DFFGD 184
D+ +
Sbjct: 120 DYANE 124
>gnl|CDD|216155 pfam00856, SET, SET domain. SET domains are protein lysine
methyltransferase enzymes. SET domains appear to be
protein-protein interaction domains. It has been
demonstrated that SET domains mediate interactions with
a family of proteins that display similarity with
dual-specificity phosphatases (dsPTPases). A subset of
SET domains have been called PR domains. These domains
are divergent in sequence from other SET domains, but
also appear to mediate protein-protein interaction. The
SET domain consists of two regions known as SET-N and
SET-C. SET-C forms an unusual and conserved knot-like
structure of probably functional importance.
Additionally to SET-N and SET-C, an insert region
(SET-I) and flanking regions of high structural
variability form part of the overall structure.
Length = 113
Score = 67.5 bits (165), Expect = 2e-14
Identities = 35/110 (31%), Positives = 44/110 (40%), Gaps = 13/110 (11%)
Query: 82 IVATRKWEKNDNIAYLVGCI----------AELTKEEEKRLLHTGLNDFSVMYSCRKNCA 131
+ ATR K + I VG + KEE + LL S + S A
Sbjct: 4 LFATRDIPKGELIIEYVGELITPEEAEERELLYNKEELRGLLSDLELFLSRLDSEYDIDA 63
Query: 132 QLWLGPAAYINHDCRPNCRFV---ATERDTACVKVLRDIEQGEEITCFYG 178
A +INH C PNC D V+ LRDI+ GEE+T YG
Sbjct: 64 TGLGNVARFINHSCEPNCEVRFVFVNGGDRIVVRALRDIKPGEELTIDYG 113
>gnl|CDD|225491 COG2940, COG2940, Proteins containing SET domain [General function
prediction only].
Length = 480
Score = 42.1 bits (99), Expect = 2e-04
Identities = 31/111 (27%), Positives = 42/111 (37%), Gaps = 13/111 (11%)
Query: 101 IAELTKEEEKRLLHTGLNDFSVMYSCRKNCAQLWLGPAAYINHDCRPNCRFVATERD--- 157
+EE LL + + + +Q A +INH C PNC E +
Sbjct: 369 KEAREREENYDLLGNEFSFGLLEDKDKVRDSQKAGDVARFINHSCTPNCEASPIEVNGIF 428
Query: 158 TACVKVLRDIEQGEEITCFYG---EDFFGDK-------NCLCECDTCEIRM 198
+ +RDI+ GEE+T YG ED K C C D C M
Sbjct: 429 KISIYAIRDIKAGEELTYDYGPSLEDNRELKKLLEKRWGCACGEDRCSHTM 479
>gnl|CDD|131771 TIGR02724, phenyl_P_beta, phenylphosphate carboxylase, beta
subunit. Members of this protein family are the beta
subunit of phenylphosphate carboxylase. Phenol
(methyl-benzene) is converted to phenylphosphate, then
para-carboxylated by this four-subunit enzyme, with the
release of phosphate, to 4-hydroxybenzoate. The enzyme
contains neither biotin nor thiamin pyrophosphate. This
beta subunit is homologous to the alpha subunit and,
more broadly, to UbiD family decarboxylases.
Length = 472
Score = 33.5 bits (76), Expect = 0.10
Identities = 25/82 (30%), Positives = 38/82 (46%), Gaps = 4/82 (4%)
Query: 57 SYRIQYLPELKPCYRYSLEGNRGARIVAT-RKWEKNDNIAYLVGCIAELTKEEEKRLLHT 115
+YR+Q L + K C L G RG RI+ K K A ++GC + L+H
Sbjct: 156 TYRMQMLDD-KRCGVQILPGKRGERIMKKYAKLGKKMPAAAIIGCDPLIFMA--GTLMHK 212
Query: 116 GLNDFSVMYSCRKNCAQLWLGP 137
G +DF + + R A+ + P
Sbjct: 213 GASDFDITGTVRGQQAEFLMAP 234
>gnl|CDD|221583 pfam12449, DUF3684, Protein of unknown function (DUF3684). This
domain family is found in eukaryotes, and is typically
between 1072 and 1090 amino acids in length.
Length = 1084
Score = 30.4 bits (69), Expect = 1.1
Identities = 11/36 (30%), Positives = 14/36 (38%)
Query: 77 NRGARIVATRKWEKNDNIAYLVGCIAELTKEEEKRL 112
N KW D I YL ++ E+ KRL
Sbjct: 611 NPTEAGSGKGKWSHVDLIKYLASVRDDIPAEDIKRL 646
>gnl|CDD|237326 PRK13271, treA, trehalase; Provisional.
Length = 569
Score = 30.3 bits (68), Expect = 1.2
Identities = 20/66 (30%), Positives = 31/66 (46%), Gaps = 14/66 (21%)
Query: 11 HVPIQLACHLLNKLNPSFWAFCVQALLAQRFNIGRILDMGTQHHRYSYRIQYLPELKPCY 70
H+P + L++ P F+A V+ LLAQ H + QYLP+++ Y
Sbjct: 197 HIPNGNRSYYLSRSQPPFFALMVE-LLAQ-------------HEGDAALKQYLPQMQKEY 242
Query: 71 RYSLEG 76
Y +EG
Sbjct: 243 AYWMEG 248
>gnl|CDD|185614 PTZ00434, PTZ00434, cytosolic glyceraldehyde 3-phosphate
dehydrogenase; Provisional.
Length = 361
Score = 29.6 bits (66), Expect = 1.8
Identities = 14/58 (24%), Positives = 30/58 (51%), Gaps = 1/58 (1%)
Query: 28 FWAFCVQALLAQRFNIGRILDMGTQHHRYSYRIQYLPELKPCYRYSLEGNRGARIVAT 85
F A C Q L+ ++ ++DM T ++Y+++Y + +Y++E + + V T
Sbjct: 19 FQAICDQGLIGTEIDVVAVVDMSTNAEYFAYQMKY-DTVHGRPKYTVETTKSSPSVKT 75
>gnl|CDD|107086 PHA01816, PHA01816, hypothetical protein.
Length = 160
Score = 28.5 bits (63), Expect = 2.3
Identities = 14/43 (32%), Positives = 25/43 (58%)
Query: 226 LNKRKLHKKLNRLLLASDKNDTNSSDNSSKYSLVQWNNNSKEN 268
L + HKKLN +L ++ND +++ N+ K+ +N +K N
Sbjct: 73 LVRGSRHKKLNYILEIYNRNDDSNNKNAKKHKYALYNLQAKNN 115
>gnl|CDD|181592 PRK08947, fadA, 3-ketoacyl-CoA thiolase; Reviewed.
Length = 387
Score = 28.8 bits (65), Expect = 3.6
Identities = 16/42 (38%), Positives = 19/42 (45%), Gaps = 13/42 (30%)
Query: 16 LACHLLNKL---NPSF---------WAFCVQALLAQRFNIGR 45
L+ HL+ L NP+ W CVQ L Q FNI R
Sbjct: 30 LSAHLMRSLLARNPALDPAEIDDIIWG-CVQQTLEQGFNIAR 70
>gnl|CDD|235869 PRK06830, PRK06830, diphosphate--fructose-6-phosphate
1-phosphotransferase; Provisional.
Length = 443
Score = 28.3 bits (64), Expect = 4.4
Identities = 10/29 (34%), Positives = 14/29 (48%), Gaps = 2/29 (6%)
Query: 3 SIVHNLILHVPIQLACHLLNKLNP--SFW 29
+N +H+PI LA K+NP W
Sbjct: 399 GRWNNRFVHLPIDLAVSKRKKVNPEGDLW 427
>gnl|CDD|234548 TIGR04331, o_ant_LIC12162, putative transferase, LIC12162 family.
This protein family shows C-terminal sequence similarity
to various surface carbohydrate biosynthesis enzymes:
spore coat polysaccharide biosynthesis protein SpsB,
UDP-N-acetyl-D-glucosamine 2-epimerase, lipid A
disaccharide synthetase LpxB, etc. It may occur in
O-antigen biosythesis regions.
Length = 585
Score = 28.1 bits (63), Expect = 6.7
Identities = 12/24 (50%), Positives = 14/24 (58%)
Query: 42 NIGRILDMGTQHHRYSYRIQYLPE 65
N G IL +GT RYSYR+ P
Sbjct: 392 NNGEILFVGTAMPRYSYRLASGPT 415
Database: CDD.v3.10
Posted date: Mar 20, 2013 7:55 AM
Number of letters in database: 10,937,602
Number of sequences in database: 44,354
Lambda K H
0.323 0.137 0.435
Gapped
Lambda K H
0.267 0.0717 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 13,570,660
Number of extensions: 1227294
Number of successful extensions: 870
Number of sequences better than 10.0: 1
Number of HSP's gapped: 865
Number of HSP's successfully gapped: 14
Length of query: 278
Length of database: 10,937,602
Length adjustment: 96
Effective length of query: 182
Effective length of database: 6,679,618
Effective search space: 1215690476
Effective search space used: 1215690476
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.5 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (22.0 bits)
S2: 58 (26.1 bits)