RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy15415
         (128 letters)



>gnl|CDD|189470 pfam00250, Fork_head, Fork head domain. 
          Length = 96

 Score = 60.4 bits (147), Expect = 4e-13
 Identities = 21/48 (43%), Positives = 27/48 (56%), Gaps = 2/48 (4%)

Query: 13 RERFPYYNQNDDRWKNSVRHNLSINPHFRKGVKA--SQGAGHLWNLSD 58
           + FPYY QN   W+NS+RHNLS+N  F K  ++    G G  W L  
Sbjct: 32 MDLFPYYRQNKQGWQNSIRHNLSLNKCFIKVPRSPDKPGKGSYWTLDP 79


>gnl|CDD|214627 smart00339, FH, FORKHEAD.  FORKHEAD, also known as a "winged
          helix".
          Length = 89

 Score = 59.9 bits (146), Expect = 4e-13
 Identities = 19/43 (44%), Positives = 24/43 (55%), Gaps = 2/43 (4%)

Query: 14 ERFPYYNQNDDRWKNSVRHNLSINPHFRK--GVKASQGAGHLW 54
          + FPYY +N   W+NS+RHNLS+N  F K        G G  W
Sbjct: 33 DNFPYYRENRAGWQNSIRHNLSLNDCFVKVPREGDRPGKGSYW 75


>gnl|CDD|238016 cd00059, FH, Forkhead (FH), also known as a "winged helix".  FH
          is named for the Drosophila fork head protein, a
          transcription factor which promotes terminal rather
          than segmental development. This family of
          transcription factor domains, which bind to B-DNA as
          monomers, are also found in the Hepatocyte nuclear
          factor (HNF) proteins, which provide tissue-specific
          gene regulation. The structure contains 2 flexible
          loops or "wings" in the C-terminal region, hence the
          term winged helix.
          Length = 78

 Score = 58.7 bits (143), Expect = 9e-13
 Identities = 18/46 (39%), Positives = 24/46 (52%), Gaps = 2/46 (4%)

Query: 13 RERFPYYNQNDDRWKNSVRHNLSINPHFRKGVKAS--QGAGHLWNL 56
           + FPY+      W+NS+RHNLS+N  F K  +     G G  W L
Sbjct: 32 SDNFPYFRDAPAGWQNSIRHNLSLNKCFVKVPREPDEPGKGSYWTL 77


>gnl|CDD|227358 COG5025, COG5025, Transcription factor of the Forkhead/HNF3 family
           [Transcription].
          Length = 610

 Score = 42.9 bits (101), Expect = 1e-05
 Identities = 26/91 (28%), Positives = 36/91 (39%), Gaps = 3/91 (3%)

Query: 13  RERFPYYNQNDDRWKNSVRHNLSINPHFRKGVKAS--QGAGHLWNLSDMEPVEDASKSNW 70
               PYY      W+NS+RHNLS+N  F K  +++   G G  W +      E  SK N 
Sbjct: 368 SSNLPYYRHKPTAWQNSIRHNLSLNKSFEKVPRSASQPGKGCFWKIDYSYIYEKESKRNP 427

Query: 71  VSIKVTGVKRTGWLAFGYYSERPKYDMLNTS 101
            S K +    +       +     Y    TS
Sbjct: 428 RSPKKSPSAHSVHQKLSLHVNDL-YQSPATS 457



 Score = 42.1 bits (99), Expect = 2e-05
 Identities = 21/53 (39%), Positives = 25/53 (47%), Gaps = 9/53 (16%)

Query: 16  FPYYNQNDDRWKNSVRHNLSINPHF-----RKGVKASQGAGHLW-NLSDMEPV 62
           F YY +   RW+NS+RHNLS+N  F     R G K     GH W      E  
Sbjct: 120 FFYYAKVVSRWQNSIRHNLSLNDAFIKIEGRNGAK---VKGHFWSIGPGHETQ 169


>gnl|CDD|192749 pfam11374, DUF3176, Protein of unknown function (DUF3176).  This
          eukaryotic family of proteins has no known function.
          Length = 112

 Score = 26.1 bits (58), Expect = 3.9
 Identities = 14/49 (28%), Positives = 23/49 (46%), Gaps = 10/49 (20%)

Query: 47 SQGAGHL-WN---------LSDMEPVEDASKSNWVSIKVTGVKRTGWLA 85
          S+  G L W          LSD+E  ++AS+  W ++K+    +   LA
Sbjct: 42 SEALGQLKWLWFKGRRPRPLSDLEYFDEASRGPWGALKLLWRLKGRHLA 90


>gnl|CDD|215507 PLN02939, PLN02939, transferase, transferring glycosyl groups.
          Length = 977

 Score = 26.0 bits (57), Expect = 6.7
 Identities = 13/36 (36%), Positives = 20/36 (55%), Gaps = 3/36 (8%)

Query: 14  ER-FPYYNQNDDRWKNSVRHNLSINPHFRKGVKASQ 48
           ER F YY +  + WK  V+ +++I+  F     ASQ
Sbjct: 926 ERAFNYYKRKPEVWKQLVQKDMNID--FSWDSSASQ 959


>gnl|CDD|219610 pfam07857, DUF1632, CEO family (DUF1632).  These sequences are
           found in hypothetical eukaryotic proteins of unknown
           function. The region concerned is approximately 280
           residues long. This family has been termed the CEO
           family for C. elegans ORF.
          Length = 254

 Score = 25.8 bits (57), Expect = 7.0
 Identities = 10/31 (32%), Positives = 14/31 (45%), Gaps = 2/31 (6%)

Query: 77  GVKRTGWLAFGYYSERPKYDMLNTSGVLRTC 107
            V R G   FG  ++ PK   LN  G++   
Sbjct: 98  AVTRFGL--FGIPAQVPKSPFLNYIGIVLVV 126


>gnl|CDD|222627 pfam14254, DUF4348, Domain of unknown function (DUF4348). 
          Length = 273

 Score = 25.8 bits (57), Expect = 8.2
 Identities = 24/96 (25%), Positives = 38/96 (39%), Gaps = 18/96 (18%)

Query: 5   ANLRLFFHRERFPYYNQNDDRWKNSVRHNLSINPHFRKGVKASQGAGHLW--NLSDMEPV 62
           AN +L   R +FP    N  +     + +   +  F K     QG   L   N   M+ V
Sbjct: 69  ANDKLQRSRIKFPLPVYNGKKVSKIEKKDWKHDYFFMK-----QGYYTLLFDNEEQMKLV 123

Query: 63  EDASKSN----WVSIKVTGVKRTGWLAFGYYSERPK 94
           +D + ++     + +K   VK+       YY ER K
Sbjct: 124 KDTTLTSVVVEKIFLKTKTVKQ-------YYFERIK 152


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.321    0.133    0.435 

Gapped
Lambda     K      H
   0.267   0.0768    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 6,249,489
Number of extensions: 501419
Number of successful extensions: 345
Number of sequences better than 10.0: 1
Number of HSP's gapped: 345
Number of HSP's successfully gapped: 16
Length of query: 128
Length of database: 10,937,602
Length adjustment: 86
Effective length of query: 42
Effective length of database: 7,123,158
Effective search space: 299172636
Effective search space used: 299172636
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.9 bits)
S2: 53 (24.1 bits)