RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy8426
         (278 letters)



>gnl|CDD|214614 smart00317, SET, SET (Su(var)3-9, Enhancer-of-zeste, Trithorax)
           domain.  Putative methyl transferase, based on outlier
           plant homologues.
          Length = 124

 Score = 80.5 bits (199), Expect = 4e-19
 Identities = 33/125 (26%), Positives = 49/125 (39%), Gaps = 9/125 (7%)

Query: 68  PCYRYSLEGNRGARIVATRKWEKNDNIAYLVGCIAELTKEEEKRLLHTGLNDFSVMYSCR 127
                     +G  + AT    K + I   VG I    +E E+R      +     Y   
Sbjct: 1   NKLEVFKSPGKGWGVRATEDIPKGEFIGEYVGEIITS-EEAEERPKAYDTDGAKAFYLFD 59

Query: 128 KNC-----AQLWLGPAAYINHDCRPNCRFVATERDT---ACVKVLRDIEQGEEITCFYGE 179
            +      A+     A +INH C PNC  +  E +      +  LRDI+ GEE+T  YG 
Sbjct: 60  IDSDLCIDARRKGNLARFINHSCEPNCELLFVEVNGDDRIVIFALRDIKPGEELTIDYGS 119

Query: 180 DFFGD 184
           D+  +
Sbjct: 120 DYANE 124


>gnl|CDD|216155 pfam00856, SET, SET domain.  SET domains are protein lysine
           methyltransferase enzymes. SET domains appear to be
           protein-protein interaction domains. It has been
           demonstrated that SET domains mediate interactions with
           a family of proteins that display similarity with
           dual-specificity phosphatases (dsPTPases). A subset of
           SET domains have been called PR domains. These domains
           are divergent in sequence from other SET domains, but
           also appear to mediate protein-protein interaction. The
           SET domain consists of two regions known as SET-N and
           SET-C. SET-C forms an unusual and conserved knot-like
           structure of probably functional importance.
           Additionally to SET-N and SET-C, an insert region
           (SET-I) and flanking regions of high structural
           variability form part of the overall structure.
          Length = 113

 Score = 67.5 bits (165), Expect = 2e-14
 Identities = 35/110 (31%), Positives = 44/110 (40%), Gaps = 13/110 (11%)

Query: 82  IVATRKWEKNDNIAYLVGCI----------AELTKEEEKRLLHTGLNDFSVMYSCRKNCA 131
           + ATR   K + I   VG +              KEE + LL       S + S     A
Sbjct: 4   LFATRDIPKGELIIEYVGELITPEEAEERELLYNKEELRGLLSDLELFLSRLDSEYDIDA 63

Query: 132 QLWLGPAAYINHDCRPNCRFV---ATERDTACVKVLRDIEQGEEITCFYG 178
                 A +INH C PNC          D   V+ LRDI+ GEE+T  YG
Sbjct: 64  TGLGNVARFINHSCEPNCEVRFVFVNGGDRIVVRALRDIKPGEELTIDYG 113


>gnl|CDD|225491 COG2940, COG2940, Proteins containing SET domain [General function
           prediction only].
          Length = 480

 Score = 42.1 bits (99), Expect = 2e-04
 Identities = 31/111 (27%), Positives = 42/111 (37%), Gaps = 13/111 (11%)

Query: 101 IAELTKEEEKRLLHTGLNDFSVMYSCRKNCAQLWLGPAAYINHDCRPNCRFVATERD--- 157
                +EE   LL    +   +    +   +Q     A +INH C PNC     E +   
Sbjct: 369 KEAREREENYDLLGNEFSFGLLEDKDKVRDSQKAGDVARFINHSCTPNCEASPIEVNGIF 428

Query: 158 TACVKVLRDIEQGEEITCFYG---EDFFGDK-------NCLCECDTCEIRM 198
              +  +RDI+ GEE+T  YG   ED    K        C C  D C   M
Sbjct: 429 KISIYAIRDIKAGEELTYDYGPSLEDNRELKKLLEKRWGCACGEDRCSHTM 479


>gnl|CDD|131771 TIGR02724, phenyl_P_beta, phenylphosphate carboxylase, beta
           subunit.  Members of this protein family are the beta
           subunit of phenylphosphate carboxylase. Phenol
           (methyl-benzene) is converted to phenylphosphate, then
           para-carboxylated by this four-subunit enzyme, with the
           release of phosphate, to 4-hydroxybenzoate. The enzyme
           contains neither biotin nor thiamin pyrophosphate. This
           beta subunit is homologous to the alpha subunit and,
           more broadly, to UbiD family decarboxylases.
          Length = 472

 Score = 33.5 bits (76), Expect = 0.10
 Identities = 25/82 (30%), Positives = 38/82 (46%), Gaps = 4/82 (4%)

Query: 57  SYRIQYLPELKPCYRYSLEGNRGARIVAT-RKWEKNDNIAYLVGCIAELTKEEEKRLLHT 115
           +YR+Q L + K C    L G RG RI+    K  K    A ++GC   +       L+H 
Sbjct: 156 TYRMQMLDD-KRCGVQILPGKRGERIMKKYAKLGKKMPAAAIIGCDPLIFMA--GTLMHK 212

Query: 116 GLNDFSVMYSCRKNCAQLWLGP 137
           G +DF +  + R   A+  + P
Sbjct: 213 GASDFDITGTVRGQQAEFLMAP 234


>gnl|CDD|221583 pfam12449, DUF3684, Protein of unknown function (DUF3684).  This
           domain family is found in eukaryotes, and is typically
           between 1072 and 1090 amino acids in length.
          Length = 1084

 Score = 30.4 bits (69), Expect = 1.1
 Identities = 11/36 (30%), Positives = 14/36 (38%)

Query: 77  NRGARIVATRKWEKNDNIAYLVGCIAELTKEEEKRL 112
           N         KW   D I YL     ++  E+ KRL
Sbjct: 611 NPTEAGSGKGKWSHVDLIKYLASVRDDIPAEDIKRL 646


>gnl|CDD|237326 PRK13271, treA, trehalase; Provisional.
          Length = 569

 Score = 30.3 bits (68), Expect = 1.2
 Identities = 20/66 (30%), Positives = 31/66 (46%), Gaps = 14/66 (21%)

Query: 11  HVPIQLACHLLNKLNPSFWAFCVQALLAQRFNIGRILDMGTQHHRYSYRIQYLPELKPCY 70
           H+P     + L++  P F+A  V+ LLAQ             H   +   QYLP+++  Y
Sbjct: 197 HIPNGNRSYYLSRSQPPFFALMVE-LLAQ-------------HEGDAALKQYLPQMQKEY 242

Query: 71  RYSLEG 76
            Y +EG
Sbjct: 243 AYWMEG 248


>gnl|CDD|185614 PTZ00434, PTZ00434, cytosolic glyceraldehyde 3-phosphate
          dehydrogenase; Provisional.
          Length = 361

 Score = 29.6 bits (66), Expect = 1.8
 Identities = 14/58 (24%), Positives = 30/58 (51%), Gaps = 1/58 (1%)

Query: 28 FWAFCVQALLAQRFNIGRILDMGTQHHRYSYRIQYLPELKPCYRYSLEGNRGARIVAT 85
          F A C Q L+    ++  ++DM T    ++Y+++Y   +    +Y++E  + +  V T
Sbjct: 19 FQAICDQGLIGTEIDVVAVVDMSTNAEYFAYQMKY-DTVHGRPKYTVETTKSSPSVKT 75


>gnl|CDD|107086 PHA01816, PHA01816, hypothetical protein.
          Length = 160

 Score = 28.5 bits (63), Expect = 2.3
 Identities = 14/43 (32%), Positives = 25/43 (58%)

Query: 226 LNKRKLHKKLNRLLLASDKNDTNSSDNSSKYSLVQWNNNSKEN 268
           L +   HKKLN +L   ++ND +++ N+ K+    +N  +K N
Sbjct: 73  LVRGSRHKKLNYILEIYNRNDDSNNKNAKKHKYALYNLQAKNN 115


>gnl|CDD|181592 PRK08947, fadA, 3-ketoacyl-CoA thiolase; Reviewed.
          Length = 387

 Score = 28.8 bits (65), Expect = 3.6
 Identities = 16/42 (38%), Positives = 19/42 (45%), Gaps = 13/42 (30%)

Query: 16 LACHLLNKL---NPSF---------WAFCVQALLAQRFNIGR 45
          L+ HL+  L   NP+          W  CVQ  L Q FNI R
Sbjct: 30 LSAHLMRSLLARNPALDPAEIDDIIWG-CVQQTLEQGFNIAR 70


>gnl|CDD|235869 PRK06830, PRK06830, diphosphate--fructose-6-phosphate
           1-phosphotransferase; Provisional.
          Length = 443

 Score = 28.3 bits (64), Expect = 4.4
 Identities = 10/29 (34%), Positives = 14/29 (48%), Gaps = 2/29 (6%)

Query: 3   SIVHNLILHVPIQLACHLLNKLNP--SFW 29
              +N  +H+PI LA     K+NP    W
Sbjct: 399 GRWNNRFVHLPIDLAVSKRKKVNPEGDLW 427


>gnl|CDD|234548 TIGR04331, o_ant_LIC12162, putative transferase, LIC12162 family.
           This protein family shows C-terminal sequence similarity
           to various surface carbohydrate biosynthesis enzymes:
           spore coat polysaccharide biosynthesis protein SpsB,
           UDP-N-acetyl-D-glucosamine 2-epimerase, lipid A
           disaccharide synthetase LpxB, etc. It may occur in
           O-antigen biosythesis regions.
          Length = 585

 Score = 28.1 bits (63), Expect = 6.7
 Identities = 12/24 (50%), Positives = 14/24 (58%)

Query: 42  NIGRILDMGTQHHRYSYRIQYLPE 65
           N G IL +GT   RYSYR+   P 
Sbjct: 392 NNGEILFVGTAMPRYSYRLASGPT 415


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.323    0.137    0.435 

Gapped
Lambda     K      H
   0.267   0.0717    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 13,570,660
Number of extensions: 1227294
Number of successful extensions: 870
Number of sequences better than 10.0: 1
Number of HSP's gapped: 865
Number of HSP's successfully gapped: 14
Length of query: 278
Length of database: 10,937,602
Length adjustment: 96
Effective length of query: 182
Effective length of database: 6,679,618
Effective search space: 1215690476
Effective search space used: 1215690476
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.5 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (22.0 bits)
S2: 58 (26.1 bits)