RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy4272
         (575 letters)



>gnl|CDD|200956 pfam00046, Homeobox, Homeobox domain. 
          Length = 57

 Score = 86.8 bits (216), Expect = 3e-21
 Identities = 30/56 (53%), Positives = 40/56 (71%)

Query: 325 RFRQVYSSHQIVELEKEFRLTNYLPSDRRKVLSKELGLTERQIKIWFQNRRMKLKR 380
           R R  ++  Q+ ELEKEF    Y  ++ R+ L+K+LGLTERQ+K+WFQNRR K KR
Sbjct: 2   RKRTTFTPEQLEELEKEFEKNRYPSAEEREELAKKLGLTERQVKVWFQNRRAKWKR 57


>gnl|CDD|197696 smart00389, HOX, Homeodomain.  DNA-binding factors that are
           involved in the transcriptional regulation of key
           developmental processes.
          Length = 57

 Score = 78.1 bits (193), Expect = 4e-18
 Identities = 28/55 (50%), Positives = 39/55 (70%)

Query: 325 RFRQVYSSHQIVELEKEFRLTNYLPSDRRKVLSKELGLTERQIKIWFQNRRMKLK 379
           R R  ++  Q+ ELEKEF+   Y   + R+ L+K+LGL+ERQ+K+WFQNRR K K
Sbjct: 3   RKRTSFTPEQLEELEKEFQKNPYPSREEREELAKKLGLSERQVKVWFQNRRAKWK 57


>gnl|CDD|238039 cd00086, homeodomain, Homeodomain;  DNA binding domains involved in
           the transcriptional regulation of key eukaryotic
           developmental processes; may bind to DNA as monomers or
           as homo- and/or heterodimers, in a sequence-specific
           manner.
          Length = 59

 Score = 77.7 bits (192), Expect = 5e-18
 Identities = 33/58 (56%), Positives = 41/58 (70%)

Query: 325 RFRQVYSSHQIVELEKEFRLTNYLPSDRRKVLSKELGLTERQIKIWFQNRRMKLKRTH 382
           R R  ++  Q+ ELEKEF    Y   + R+ L+KELGLTERQ+KIWFQNRR KLKR+ 
Sbjct: 2   RKRTRFTPEQLEELEKEFEKNPYPSREEREELAKELGLTERQVKIWFQNRRAKLKRSE 59


>gnl|CDD|227863 COG5576, COG5576, Homeodomain-containing transcription factor
           [Transcription].
          Length = 156

 Score = 49.0 bits (117), Expect = 6e-07
 Identities = 23/75 (30%), Positives = 37/75 (49%), Gaps = 2/75 (2%)

Query: 331 SSHQIVELEKEFRLTNYLPSDRRKVLSKELGLTERQIKIWFQNRRMKLKRTH--PDMVVE 388
           +  Q++ LE+EF +  Y  S  R  LS  L +  + ++IWFQN+R K K+          
Sbjct: 59  TDEQLMVLEREFEINPYPSSITRIKLSLLLNMPPKSVQIWFQNKRAKEKKKRSGKVEQRP 118

Query: 389 YVKESSEARSASQKT 403
             +E+  A+  S  T
Sbjct: 119 GEEEADLAKIGSLST 133


>gnl|CDD|200998 pfam00096, zf-C2H2, Zinc finger, C2H2 type.  The C2H2 zinc finger
           is the classical zinc finger domain. The two conserved
           cysteines and histidines co-ordinate a zinc ion. The
           following pattern describes the zinc finger.
           #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can
           be any amino acid, and numbers in brackets indicate the
           number of residues. The positions marked # are those
           that are important for the stable fold of the zinc
           finger. The final position can be either his or cys. The
           C2H2 zinc finger is composed of two short beta strands
           followed by an alpha helix. The amino terminal part of
           the helix binds the major groove in DNA binding zinc
           fingers. The accepted consensus binding sequence for Sp1
           is usually defined by the asymmetric hexanucleotide core
           GGGCGG but this sequence does not include, among others,
           the GAG (=CTC) repeat that constitutes a high-affinity
           site for Sp1 binding to the wt1 promoter.
          Length = 22

 Score = 30.8 bits (70), Expect = 0.083
 Identities = 10/20 (50%), Positives = 14/20 (70%)

Query: 119 TCLQCGRSYCRPYNLKRHIQ 138
            C  CG+S+ R  NLKRH++
Sbjct: 1   KCPDCGKSFSRKSNLKRHLR 20


>gnl|CDD|206065 pfam13894, zf-C2H2_4, C2H2-type zinc finger.  This family contains
           a number of divergent C2H2 type zinc fingers.
          Length = 24

 Score = 28.8 bits (64), Expect = 0.47
 Identities = 8/24 (33%), Positives = 10/24 (41%)

Query: 147 FPCLYCSYRARHKHDLKKHVTFKH 170
           F C  C      K  LK+H+   H
Sbjct: 1   FKCPLCGKSFSSKDALKRHLRKHH 24


>gnl|CDD|183362 PRK11891, PRK11891, aspartate carbamoyltransferase; Provisional.
          Length = 429

 Score = 32.1 bits (73), Expect = 0.83
 Identities = 18/85 (21%), Positives = 36/85 (42%), Gaps = 5/85 (5%)

Query: 353 RKVLSKELGLTERQIKIWFQNRRMKLKRTHPDMVVEYVKESSEARSASQK---TNHQSAP 409
           R+  +  +G++ R +  W      +  R  P++V  +V E  E  +  +K   +  +   
Sbjct: 19  REAFANRIGVSRRALDTWLLPDDSQESRGMPEIVERFVSEIVERAAPGEKYTQSVDEQPL 78

Query: 410 ADTSAPNT--HMKSFEEFIREHYER 432
           AD  A      + S ++F R+  E 
Sbjct: 79  ADQFAFEGKPQLLSVDQFSRDSVEA 103


>gnl|CDD|205059 pfam12760, Zn_Tnp_IS1595, Transposase zinc-ribbon domain.  This
           zinc binding domain is found in a range of transposase
           proteins such as ISSPO8, ISSOD11, ISRSSP2 etc. It is
           likely a zinc-binding beta ribbon domain that could bind
           the DNA.
          Length = 46

 Score = 28.8 bits (65), Expect = 0.85
 Identities = 11/36 (30%), Positives = 14/36 (38%), Gaps = 2/36 (5%)

Query: 106 RESIRQKRLAQGLTCLQCGRSYCRPYNLKRHIQYEC 141
           R  + Q R   G  C  CG      Y + R   Y+C
Sbjct: 7   RAYLEQLRWPNGFVCPHCG--SEEAYRITRRGLYQC 40


>gnl|CDD|203350 pfam05920, Homeobox_KN, Homeobox KN domain.  This is a homeobox
           transcription factor KN domain conserved from fungi to
           human and plants.
          Length = 40

 Score = 27.8 bits (63), Expect = 1.5
 Identities = 12/29 (41%), Positives = 18/29 (62%), Gaps = 1/29 (3%)

Query: 349 PSDRRKV-LSKELGLTERQIKIWFQNRRM 376
           PS+  K  L+++ GL+ +QI  WF N R 
Sbjct: 11  PSEEEKSELARQTGLSRKQIDNWFINARR 39


>gnl|CDD|197676 smart00355, ZnF_C2H2, zinc finger. 
          Length = 23

 Score = 27.0 bits (60), Expect = 1.9
 Identities = 5/21 (23%), Positives = 11/21 (52%)

Query: 118 LTCLQCGRSYCRPYNLKRHIQ 138
             C +CG+ +     L+ H++
Sbjct: 1   YRCPECGKVFKSKSALREHMR 21


>gnl|CDD|215203 PLN02355, PLN02355, probable galactinol--sucrose
           galactosyltransferase 1.
          Length = 758

 Score = 31.0 bits (70), Expect = 2.0
 Identities = 14/27 (51%), Positives = 15/27 (55%)

Query: 41  IRTSDPDWPRPTASFAATIADVATKTL 67
           IR SD  WPR  AS    IA VA  T+
Sbjct: 453 IRASDDFWPRDPASHTIHIASVAYNTI 479


>gnl|CDD|237177 PRK12704, PRK12704, phosphodiesterase; Provisional.
          Length = 520

 Score = 30.5 bits (70), Expect = 2.6
 Identities = 19/96 (19%), Positives = 35/96 (36%), Gaps = 2/96 (2%)

Query: 181 QEELEVTTKELSPGPKAENRHRFREFLNEHRNLERNVSHEEIRTHKKHEQPRKESSYKET 240
           ++E E   KE     K E      EF  E R     +   E R  +K E   ++    E 
Sbjct: 48  KKEAEAIKKEALLEAKEEIHKLRNEFEKELRERRNELQKLEKRLLQKEENLDRKLELLEK 107

Query: 241 ENTQFNSETNDLSKLEE--PNKTSSILLAHLQQVQK 274
              +   +  +L + ++    K   +     +Q+Q+
Sbjct: 108 REEELEKKEKELEQKQQELEKKEEELEELIEEQLQE 143


>gnl|CDD|115196 pfam06524, NOA36, NOA36 protein.  This family consists of several
           NOA36 proteins which contain 29 highly conserved
           cysteine residues. The function of this protein is
           unknown.
          Length = 314

 Score = 29.9 bits (67), Expect = 3.9
 Identities = 15/50 (30%), Positives = 22/50 (44%), Gaps = 3/50 (6%)

Query: 113 RLAQGLTCLQCGRSYCRPYNLKRHIQYECGKAPQFPCLYCSYRARHKHDL 162
           RL Q  +CL+C    C  +  ++  +Y+  K    PC  C Y      DL
Sbjct: 179 RLGQ-YSCLRCKVCCCNDHVRRKGFKYDKDKNL--PCPKCGYPTTETKDL 225


>gnl|CDD|151230 pfam10758, DUF2586, Protein of unknown function (DUF2586).  This
           bacterial family of proteins has no known function.
          Length = 363

 Score = 29.6 bits (67), Expect = 4.6
 Identities = 22/54 (40%), Positives = 28/54 (51%), Gaps = 10/54 (18%)

Query: 56  AATIAD----VATKTLPVIQELLSQRDGKPVGKATLPSPLKAKLS------DYE 99
           A TIAD    VAT  L  + EL   +DGKP+  ATL +   A+ S      DY+
Sbjct: 188 AVTIADSPMRVATGALLGLGELPVDKDGKPLTLATLAALDAARYSVPQWYPDYD 241


>gnl|CDD|238892 cd01911, proteasome_alpha, proteasome alpha subunit. The 20S
           proteasome, multisubunit proteolytic complex, is the
           central enzyme of nonlysosomal protein degradation in
           both the cytosol and nucleus. It is composed of 28
           subunits arranged as four homoheptameric rings that
           stack on top of one another forming an elongated
           alpha-beta-beta-alpha cylinder with a central cavity.
           The proteasome alpha and beta subunits are members of
           the N-terminal nucleophile (Ntn)-hydrolase superfamily.
           Their N-terminal threonine residues are exposed as a
           nucleophile in peptide bond hydrolysis. Mammals have 7
           different alpha and 10 different beta proteasome subunit
           genes while archaea have one of each.
          Length = 209

 Score = 28.2 bits (64), Expect = 9.0
 Identities = 9/27 (33%), Positives = 17/27 (62%)

Query: 540 GGHAEDAPTDLSRKLKSDMEEDEAVKE 566
           G  +++A T L ++ K D+  +EA+K 
Sbjct: 160 GKGSQEAKTFLEKRYKKDLTLEEAIKL 186


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.311    0.128    0.364 

Gapped
Lambda     K      H
   0.267   0.0760    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 28,638,888
Number of extensions: 2755562
Number of successful extensions: 2707
Number of sequences better than 10.0: 1
Number of HSP's gapped: 2703
Number of HSP's successfully gapped: 36
Length of query: 575
Length of database: 10,937,602
Length adjustment: 102
Effective length of query: 473
Effective length of database: 6,413,494
Effective search space: 3033582662
Effective search space used: 3033582662
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.8 bits)
S2: 62 (27.6 bits)