RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy9499
         (289 letters)



>gnl|CDD|189470 pfam00250, Fork_head, Fork head domain. 
          Length = 96

 Score =  191 bits (488), Expect = 2e-62
 Identities = 68/96 (70%), Positives = 79/96 (82%)

Query: 124 KPPYSYIALITMAILQSPHKKLTLSGICEFIMSRFPYYREKFPAWQNSIRHNLSLNDCFV 183
           KPPYSYIALITMAI QSP K LTLS I ++IM  FPYYR+    WQNSIRHNLSLN CF+
Sbjct: 1   KPPYSYIALITMAIQQSPEKMLTLSEIYQWIMDLFPYYRQNKQGWQNSIRHNLSLNKCFI 60

Query: 184 KIPREPGNPGKGNYWTLDPMAEDMFDNGSFLRRRKR 219
           K+PR P  PGKG+YWTLDP +E+MF+NG +L+RRKR
Sbjct: 61  KVPRSPDKPGKGSYWTLDPESENMFENGKYLKRRKR 96


>gnl|CDD|214627 smart00339, FH, FORKHEAD.  FORKHEAD, also known as a "winged
           helix".
          Length = 89

 Score =  168 bits (429), Expect = 1e-53
 Identities = 65/89 (73%), Positives = 73/89 (82%)

Query: 124 KPPYSYIALITMAILQSPHKKLTLSGICEFIMSRFPYYREKFPAWQNSIRHNLSLNDCFV 183
           KPPYSYIALI MAIL SP K+LTLS I ++I   FPYYRE    WQNSIRHNLSLNDCFV
Sbjct: 1   KPPYSYIALIAMAILSSPDKRLTLSEIYKWIEDNFPYYRENRAGWQNSIRHNLSLNDCFV 60

Query: 184 KIPREPGNPGKGNYWTLDPMAEDMFDNGS 212
           K+PRE   PGKG+YWTLDP AE+MF+NG+
Sbjct: 61  KVPREGDRPGKGSYWTLDPAAENMFENGN 89


>gnl|CDD|238016 cd00059, FH, Forkhead (FH), also known as a "winged helix".  FH is
           named for the Drosophila fork head protein, a
           transcription factor which promotes terminal rather than
           segmental development. This family of transcription
           factor domains, which bind to B-DNA as monomers, are
           also found in the Hepatocyte nuclear factor (HNF)
           proteins, which provide tissue-specific gene regulation.
           The structure contains 2 flexible loops or "wings" in
           the C-terminal region, hence the term winged helix.
          Length = 78

 Score =  167 bits (425), Expect = 5e-53
 Identities = 54/78 (69%), Positives = 61/78 (78%)

Query: 124 KPPYSYIALITMAILQSPHKKLTLSGICEFIMSRFPYYREKFPAWQNSIRHNLSLNDCFV 183
           KPPYSY ALI MAI  SP K+LTLS I ++I   FPY+R+    WQNSIRHNLSLN CFV
Sbjct: 1   KPPYSYSALIAMAIQSSPEKRLTLSEIYKWISDNFPYFRDAPAGWQNSIRHNLSLNKCFV 60

Query: 184 KIPREPGNPGKGNYWTLD 201
           K+PREP  PGKG+YWTLD
Sbjct: 61  KVPREPDEPGKGSYWTLD 78


>gnl|CDD|227358 COG5025, COG5025, Transcription factor of the Forkhead/HNF3 family
           [Transcription].
          Length = 610

 Score =  102 bits (255), Expect = 2e-24
 Identities = 51/107 (47%), Positives = 60/107 (56%), Gaps = 2/107 (1%)

Query: 113 SDKQDGKNSIVKPPYSYIALITMAILQSPHKKLTLSGICEFIMSRFPYYREKFPAWQNSI 172
           SD     N   KP +SY   IT AIL SP  K+TLS I  +I S  PYYR K  AWQNSI
Sbjct: 326 SDLSADVNRTSKPAFSYANSITQAILSSPSGKMTLSEIYSWISSNLPYYRHKPTAWQNSI 385

Query: 173 RHNLSLNDCFVKIPREPGNPGKGNYWTLDPMAEDMFDNGSFLRRRKR 219
           RHNLSLN  F K+PR    PGKG +W +D     +++  S    R  
Sbjct: 386 RHNLSLNKSFEKVPRSASQPGKGCFWKIDY--SYIYEKESKRNPRSP 430



 Score = 78.3 bits (193), Expect = 3e-16
 Identities = 43/85 (50%), Positives = 52/85 (61%)

Query: 124 KPPYSYIALITMAILQSPHKKLTLSGICEFIMSRFPYYREKFPAWQNSIRHNLSLNDCFV 183
            PPYSY     +AIL SP K LTLS I  +I + F YY +    WQNSIRHNLSLND F+
Sbjct: 86  VPPYSYATGRGLAILNSPDKPLTLSKIYTWIHNTFFYYAKVVSRWQNSIRHNLSLNDAFI 145

Query: 184 KIPREPGNPGKGNYWTLDPMAEDMF 208
           KI    G   KG++W++ P  E  F
Sbjct: 146 KIEGRNGAKVKGHFWSIGPGHETQF 170


>gnl|CDD|185638 PTZ00459, PTZ00459, mucin-associated surface protein (MASP);
          Provisional.
          Length = 291

 Score = 31.7 bits (71), Expect = 0.39
 Identities = 19/54 (35%), Positives = 23/54 (42%), Gaps = 2/54 (3%)

Query: 21 PHHTGPRTSFDSPLDLRGAPHPGDMCNLSSPSPSSEDDSDADINVESDDGDEQG 74
          P   G  TS     DL+G          S P P+ EDD D D + E  D D+ G
Sbjct: 43 PESKGLETSSQGTQDLKGGA--AGAKENSPPLPTEEDDEDVDDDSEEGDDDDGG 94


>gnl|CDD|227326 COG4993, Gcd, Glucose dehydrogenase [Carbohydrate transport and
           metabolism].
          Length = 773

 Score = 29.8 bits (67), Expect = 2.3
 Identities = 25/98 (25%), Positives = 34/98 (34%), Gaps = 23/98 (23%)

Query: 189 PGNPGKGNYWTL--DPMAEDMFDNGSFLRRRKRYKRTPPDFLLRDPHAAAMASFLGSDPY 246
           PGN G  N+  +  DP+ +  F N  +L    +     P    RDP    M +    D  
Sbjct: 569 PGNHGMFNWGGVSVDPVRQVTFGNPYYLAFVSKL---VP----RDPV-GPMENGAAGDGT 620

Query: 247 HHGLLPHSLQHGGFSFMSPLPPAV---PLLPPSELVVQ 281
             G++P              P  V   P L P  L  Q
Sbjct: 621 EGGVVP----------NYGEPYGVWMGPFLSPGGLPCQ 648


>gnl|CDD|131813 TIGR02766, crypt_chrom_pln, cryptochrome, plant family.  At least
           five major families of cryptochomes and photolyases
           share FAD cofactor binding, sequence homology, and the
           ability to react to short wavelengths of visible light.
           Photolysases are responsible for light-dependent DNA
           repair by removal of two types of uv-induced DNA
           dimerizations. Cryptochromes have other functions, often
           regulatory and often largely unknown, which may include
           circadian clock entrainment and control of development.
           Members of this subfamily are known so far only in
           plants; they may show some photolyase activity in vitro
           but appear mostly to be regulatory proteins that respond
           to blue light.
          Length = 475

 Score = 29.1 bits (65), Expect = 2.7
 Identities = 9/22 (40%), Positives = 13/22 (59%)

Query: 265 PLPPAVPLLPPSELVVQSISLC 286
           P  P  PLLPP +++   +S C
Sbjct: 155 PYDPESPLLPPKKIISGDVSKC 176


>gnl|CDD|227531 COG5206, GPI8, Glycosylphosphatidylinositol transamidase (GPIT),
           subunit GPI8 [Posttranslational modification, protein
           turnover, chaperones].
          Length = 382

 Score = 28.8 bits (64), Expect = 3.1
 Identities = 34/133 (25%), Positives = 53/133 (39%), Gaps = 22/133 (16%)

Query: 120 NSIVKPPYS--YIALITMAILQSPHKKLTLSGICEFIMSRFPYYREKFPAWQNSIRHNLS 177
           N++    YS   +A+ +  I QS +   + S I   ++ RF Y+  KF   +  I   L+
Sbjct: 193 NALYDKSYSPNVLAVGSSEIGQSSYSHHSDSLIGVAVIDRFTYFFLKFLE-KIDIGSKLT 251

Query: 178 LNDCFVKIPREPGNPGKGNYWTLDPMAEDMFDNGSFLRRRKRYKRTPPDFLLRDPHAAAM 237
           L D    + +EP +   G       + E +FD            R P DFL+ D  A   
Sbjct: 252 LQDLLASLNKEPIHSHVG-------VRELVFD------------RRPSDFLITDFFANVQ 292

Query: 238 ASFLGSDPYHHGL 250
            S L     H+  
Sbjct: 293 NSALELLKSHYHH 305


>gnl|CDD|182461 PRK10438, PRK10438, C-N hydrolase family amidase; Provisional.
          Length = 256

 Score = 28.6 bits (64), Expect = 3.2
 Identities = 9/16 (56%), Positives = 12/16 (75%)

Query: 161 YREKFPAWQNSIRHNL 176
           YREKFPAW+++    L
Sbjct: 240 YREKFPAWRDADEFTL 255


>gnl|CDD|222389 pfam13801, Metal_resist, Heavy-metal resistance.  This is a
           metal-binding protein which is involved in resistance to
           heavy-metal ions. The protein forms a four-helix hooked
           hairpin, consisting of two long alpha helices each
           flanked by a shorter alpha helix. It binds a metal ion
           in a type-2 like centre. It contains two copies of an
           LTXXQ motif.
          Length = 125

 Score = 27.6 bits (62), Expect = 3.9
 Identities = 8/36 (22%), Positives = 10/36 (27%), Gaps = 2/36 (5%)

Query: 234 AAAMASFLGSDPYHHGLLPHSLQHGGFSFMSPLPPA 269
             A A+  G      G      + G       LPP 
Sbjct: 12  VVAGAALRGPGGPGPG--GPRRRPGLGPAALALPPE 45


>gnl|CDD|219406 pfam07420, DUF1509, Protein of unknown function (DUF1509).  This
           family consists of several uncharacterized viral
           proteins from the Marek's disease-like viruses. Members
           of this family are typically around 400 residues in
           length. The function of this family is unknown.
          Length = 377

 Score = 28.5 bits (63), Expect = 4.5
 Identities = 11/63 (17%), Positives = 20/63 (31%), Gaps = 2/63 (3%)

Query: 19  PQPHHTGPRTSFDSPLDLRGAPHPGDMCNLSSPSPSSEDDSDADINVESDDGDEQGMSNV 78
           P P  + P    D   +    P P      +  + +   D  + I+ +S  G    M   
Sbjct: 260 PSPRPSSPPPEIDE--EYNAQPDPWGTNAQTWDNRAGTPDDGSAISEDSAGGSHHTMRRP 317

Query: 79  RRN 81
             +
Sbjct: 318 PHS 320


>gnl|CDD|215742 pfam00136, DNA_pol_B, DNA polymerase family B.  This region of DNA
           polymerase B appears to consist of more than one
           structural domain, possibly including elongation,
           DNA-binding and dNTP binding activities.
          Length = 458

 Score = 28.4 bits (64), Expect = 5.0
 Identities = 9/23 (39%), Positives = 12/23 (52%)

Query: 66  ESDDGDEQGMSNVRRNIAEFHKH 88
               GD +G+  VRR+  EF K 
Sbjct: 275 YDGKGDIKGVDLVRRDWCEFVKE 297


>gnl|CDD|223433 COG0356, AtpB, F0F1-type ATP synthase, subunit a [Energy production
           and conversion].
          Length = 246

 Score = 28.0 bits (63), Expect = 5.4
 Identities = 16/36 (44%), Positives = 19/36 (52%), Gaps = 5/36 (13%)

Query: 258 GGF--SFMSPLPPAVPLLPPSELV---VQSISLCLR 288
           GGF   F  P  P  PL+ P E+V    + ISL LR
Sbjct: 145 GGFFKEFAFPPFPLAPLMLPLEIVSELAKPISLSLR 180


>gnl|CDD|217311 pfam02995, DUF229, Protein of unknown function (DUF229).  Members
           of this family are uncharacterized. They are 500-1200
           amino acids in length and share a long region
           conservation that probably corresponds to several
           domains. The Go annotation for the protein indicates
           that it is involved in nematode larval development and
           has a positive regulation on growth rate.
          Length = 492

 Score = 27.7 bits (62), Expect = 9.4
 Identities = 19/68 (27%), Positives = 26/68 (38%), Gaps = 15/68 (22%)

Query: 196 NYWTLDPMAEDMFDNGSFLRRRKRYKRTPPDFLLRDPHAAAM------------ASFLGS 243
            Y T    AED    G+F   +  +++ P D  LR P   A+               LG 
Sbjct: 212 GYATA--FAEDWAGIGTFNYLKPGFRKQPTDHYLR-PLLLAVEKHLTYSTRFGLNYCLGR 268

Query: 244 DPYHHGLL 251
            P H+ LL
Sbjct: 269 RPTHNYLL 276


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.318    0.137    0.434 

Gapped
Lambda     K      H
   0.267   0.0768    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 15,322,003
Number of extensions: 1472031
Number of successful extensions: 1321
Number of sequences better than 10.0: 1
Number of HSP's gapped: 1317
Number of HSP's successfully gapped: 26
Length of query: 289
Length of database: 10,937,602
Length adjustment: 96
Effective length of query: 193
Effective length of database: 6,679,618
Effective search space: 1289166274
Effective search space used: 1289166274
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 58 (26.0 bits)