RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy2450
         (254 letters)



>gnl|CDD|197727 smart00443, G_patch, glycine rich nucleic binding domain.  A
           predicted glycine rich nucleic binding domain found in
           the splicing factor 45, SON DNA binding protein and
           D-type Retrovirus- polyproteins.
          Length = 47

 Score = 60.3 bits (147), Expect = 1e-12
 Identities = 24/47 (51%), Positives = 28/47 (59%)

Query: 180 IGADNVGNKLLQKMGWTQGQGLGKTNQGRTSIIEAEARISSAGLGTN 226
           I   N+G KLL+KMGW +GQGLGK  QG    I AE +    GLG  
Sbjct: 1   ISTSNIGAKLLRKMGWKEGQGLGKNEQGIVEPISAEIKKDRKGLGAV 47


>gnl|CDD|144978 pfam01585, G-patch, G-patch domain.  This domain is found in a
           number of RNA binding proteins, and is also found in
           proteins that contain RNA binding domains. This suggests
           that this domain may have an RNA binding function. This
           domain has seven highly conserved glycines.
          Length = 45

 Score = 59.1 bits (144), Expect = 4e-12
 Identities = 26/42 (61%), Positives = 28/42 (66%)

Query: 183 DNVGNKLLQKMGWTQGQGLGKTNQGRTSIIEAEARISSAGLG 224
            N+G KLLQKMGW  GQGLGK  QG T  IEA+ R    GLG
Sbjct: 2   SNIGFKLLQKMGWKPGQGLGKNEQGITEPIEAKIRPDRKGLG 43


>gnl|CDD|221692 pfam12656, G-patch_2, DExH-box splicing factor binding site.  Yeast
           Spp2, a G-patch protein and spliceosome component,
           interacts with the ATP-dependent DExH-box splicing
           factor Prp2. As this interaction involves the G-patch
           sequence in Spp2 and is required for the recruitment of
           Prp2 to the spliceosome before the first catalytic step
           of splicing, it is proposed that Spp2 might be an
           accessory factor that confers spliceosome specificity on
           Prp2.
          Length = 79

 Score = 45.8 bits (109), Expect = 6e-07
 Identities = 18/46 (39%), Positives = 24/46 (52%)

Query: 183 DNVGNKLLQKMGWTQGQGLGKTNQGRTSIIEAEARISSAGLGTNAI 228
           +  G  LL+ MGW +GQG+GK N+G     E + R    GLG    
Sbjct: 31  EEFGAALLRGMGWKEGQGIGKNNKGDVKPKEPKRRPGGLGLGAKPA 76


>gnl|CDD|206084 pfam13913, zf-C2HC_2, zinc-finger of a C2HC-type.  This family
           contains a number of divergent C2H2 type zinc fingers.
          Length = 25

 Score = 27.2 bits (61), Expect = 0.86
 Identities = 9/17 (52%), Positives = 10/17 (58%), Gaps = 1/17 (5%)

Query: 88  CLICKRQFNSKDILKKH 104
           C IC R+F   D L KH
Sbjct: 5   CPICGRKFAP-DRLAKH 20


>gnl|CDD|206065 pfam13894, zf-C2H2_4, C2H2-type zinc finger.  This family contains
           a number of divergent C2H2 type zinc fingers.
          Length = 24

 Score = 26.8 bits (59), Expect = 1.1
 Identities = 9/17 (52%), Positives = 13/17 (76%)

Query: 88  CLICKRQFNSKDILKKH 104
           C +C + F+SKD LK+H
Sbjct: 3   CPLCGKSFSSKDALKRH 19


>gnl|CDD|204841 pfam12171, zf-C2H2_jaz, Zinc-finger double-stranded RNA-binding.
           This domain family is found in archaea and eukaryotes,
           and is approximately 30 amino acids in length. The
           mammalian members of this group occur multiple times
           along the protein, joined by flexible linkers, and are
           referred to as JAZ - dsRNA-binding ZF protein -
           zinc-fingers. The JAZ proteins are expressed in all
           tissues tested and localise in the nucleus, particularly
           the nucleolus. JAZ preferentially binds to
           double-stranded (ds) RNA or RNA/DNA hybrids rather than
           DNA. In addition to binding double-stranded RNA, these
           zinc-fingers are required for nucleolar localisation.
          Length = 27

 Score = 26.4 bits (59), Expect = 1.3
 Identities = 9/25 (36%), Positives = 16/25 (64%), Gaps = 1/25 (4%)

Query: 88  CLICKRQFNSKDILKKHTQLSELHK 112
           C+ C + F S++ L+ H + S+ HK
Sbjct: 4   CVACDKYFKSENALENHLK-SKKHK 27


>gnl|CDD|205121 pfam12874, zf-met, Zinc-finger of C2H2 type.  This is a zinc-finger
           domain with the CxxCx(12)Hx(6)H motif, found in multiple
           copies in a wide range of proteins from plants to
           metazoans. Some member proteins, particularly those from
           plants, are annotated as being RNA-binding.
          Length = 25

 Score = 26.3 bits (59), Expect = 1.4
 Identities = 7/19 (36%), Positives = 10/19 (52%)

Query: 88  CLICKRQFNSKDILKKHTQ 106
           C +C   F S+  LK H +
Sbjct: 3   CELCNVTFTSESQLKSHLR 21


>gnl|CDD|185438 PTZ00087, PTZ00087, thrombosponding-related protein; Provisional.
          Length = 340

 Score = 29.5 bits (66), Expect = 1.7
 Identities = 19/92 (20%), Positives = 39/92 (42%), Gaps = 15/92 (16%)

Query: 39  VCFSVVGNNREEKTKKSLVAAYESDE-------EPE--VQEETVSGEEKQQIDWDKLACL 89
           +C+  + N  + +     +   + +        EPE  V +    G+ KQ+ D ++   L
Sbjct: 14  ICYQCISNLFKRQNTNYFLQLLQKNGNYKIVTLEPECTVGKGLQLGQAKQESDVNRKEFL 73

Query: 90  ------ICKRQFNSKDILKKHTQLSELHKSNL 115
                 I   QF ++  L+ ++   E +KS+L
Sbjct: 74  LHNYVVIKNSQFTNEKTLELYSSKEENNKSHL 105


>gnl|CDD|219146 pfam06707, DUF1194, Protein of unknown function (DUF1194).  This
           family consists of several hypothetical Rhizobiales
           specific proteins of around 270 residues in length. The
           function of this family is unknown.
          Length = 206

 Score = 28.7 bits (65), Expect = 2.1
 Identities = 17/50 (34%), Positives = 21/50 (42%), Gaps = 7/50 (14%)

Query: 200 GLGKTNQGRTSIIEAEARISSAGLGTN--AIGMTPAPGET-----YKDCV 242
           G G  NQG   +  A     +AG+  N  AI    AP        Y+DCV
Sbjct: 124 GDGPNNQGFPPVTAARDAAVAAGVTINGLAIMGAEAPTSDDLDAYYRDCV 173


>gnl|CDD|133126 cd06595, GH31_xylosidase_XylS-like, This family represents an
           uncharacterized glycosyl hydrolase family 31 (GH31)
           enzyme found in bacteria and eukaryotes that is related
           to the XylS xylosidase of Sulfolobus solfataricus.
           Alpha-xylosidases catalyze the release of an
           alpha-xylose residue from the non-reducing end of
           alpha-xyloside substrates. Enzymes of the GH31 family
           possess a wide range of different hydrolytic activities
           including alpha-glucosidase (glucoamylase and
           sucrase-isomaltase), alpha-xylosidase,
           6-alpha-glucosyltransferase,
           3-alpha-isomaltosyltransferase and alpha-1,4-glucan
           lyase. All GH31 enzymes cleave a terminal carbohydrate
           moiety from a substrate that varies considerably in
           size, depending on the enzyme, and may be either a
           starch or a glycoprotein.
          Length = 292

 Score = 28.5 bits (64), Expect = 3.2
 Identities = 14/59 (23%), Positives = 19/59 (32%), Gaps = 4/59 (6%)

Query: 107 LSELHKSNLKMWYTCRNLDPNDAVH-RAVQYRDRAKERRLKYGEPEVPPACKLKAKYSK 164
           L +LH   LK      NL P D +     QY + AK   +               K+  
Sbjct: 80  LQDLHDRGLK---VTLNLHPADGIRAHEDQYPEMAKALGVDPATEGPILFDLTNPKFMD 135


>gnl|CDD|185627 PTZ00448, PTZ00448, hypothetical protein; Provisional.
          Length = 373

 Score = 28.5 bits (63), Expect = 3.4
 Identities = 26/119 (21%), Positives = 45/119 (37%), Gaps = 9/119 (7%)

Query: 14  ISKVANNTEQAASESPSQSTQGSADVCFSVVGNNREEKTKKSLVAAYESDEEP------E 67
           +  +  N+ Q   E+        +D+ F+ + N+      K L     S +E        
Sbjct: 237 VKTIDKNSNQDNVEAIQLELSNKSDILFNSITNDPRHIIDKKLEKTEVSIKENANKFVKP 296

Query: 68  VQEETVSGEEKQQIDWDKLACLICKRQFNSKDILKKHTQLSELHKSNLKMWYTCRNLDP 126
           +       E K++I  + L C  C  Q    +  K+H + SE H  N K     R ++P
Sbjct: 297 IVSSLNIEESKREIKSNMLLCRKCNIQLMDHNAFKQHYR-SEWHIFNTKR--NARKMEP 352


>gnl|CDD|184482 PRK14059, PRK14059, hypothetical protein; Provisional.
          Length = 251

 Score = 28.0 bits (63), Expect = 4.2
 Identities = 8/23 (34%), Positives = 13/23 (56%)

Query: 132 RAVQYRDRAKERRLKYGEPEVPP 154
             V+    A+++R   G+ EVPP
Sbjct: 86  GGVRLSAAARQQRQARGQAEVPP 108


>gnl|CDD|236014 PRK07418, PRK07418, acetolactate synthase 3 catalytic subunit;
           Reviewed.
          Length = 616

 Score = 28.1 bits (63), Expect = 5.1
 Identities = 12/18 (66%), Positives = 12/18 (66%)

Query: 218 ISSAGLGTNAIGMTPAPG 235
           ISSAGLGT   GM  A G
Sbjct: 428 ISSAGLGTMGFGMPAAMG 445


>gnl|CDD|223988 COG1060, ThiH, Thiamine biosynthesis enzyme ThiH and related
           uncharacterized enzymes [Coenzyme metabolism / General
           function prediction only].
          Length = 370

 Score = 28.0 bits (63), Expect = 5.7
 Identities = 12/53 (22%), Positives = 19/53 (35%), Gaps = 4/53 (7%)

Query: 200 GLGKTNQGRTSIIEAEARISSAGLGTNAIGMTPAPGETYKDCVKKMMRARYSQ 252
              K+ +    I E   R    G+ T A  +     ET +D +  +   R  Q
Sbjct: 193 PPKKSPEEWLEIHE---RAHRLGIPTTATMLL-GHVETREDRIDHLEHIRDLQ 241


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.310    0.125    0.359 

Gapped
Lambda     K      H
   0.267   0.0809    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 12,088,937
Number of extensions: 1072339
Number of successful extensions: 728
Number of sequences better than 10.0: 1
Number of HSP's gapped: 728
Number of HSP's successfully gapped: 23
Length of query: 254
Length of database: 10,937,602
Length adjustment: 95
Effective length of query: 159
Effective length of database: 6,723,972
Effective search space: 1069111548
Effective search space used: 1069111548
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.1 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.8 bits)
S2: 58 (26.0 bits)