RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy12666
         (347 letters)



>gnl|CDD|214627 smart00339, FH, FORKHEAD.  FORKHEAD, also known as a "winged
           helix".
          Length = 89

 Score = 62.6 bits (153), Expect = 1e-12
 Identities = 22/45 (48%), Positives = 30/45 (66%), Gaps = 3/45 (6%)

Query: 95  FRPAN-HYQNSIRHNLSLHSKFMRVQNEGT--GKSSWWMLNPDAK 136
           +R     +QNSIRHNLSL+  F++V  EG   GK S+W L+P A+
Sbjct: 38  YRENRAGWQNSIRHNLSLNDCFVKVPREGDRPGKGSYWTLDPAAE 82


>gnl|CDD|238016 cd00059, FH, Forkhead (FH), also known as a "winged helix".  FH is
           named for the Drosophila fork head protein, a
           transcription factor which promotes terminal rather than
           segmental development. This family of transcription
           factor domains, which bind to B-DNA as monomers, are
           also found in the Hepatocyte nuclear factor (HNF)
           proteins, which provide tissue-specific gene regulation.
           The structure contains 2 flexible loops or "wings" in
           the C-terminal region, hence the term winged helix.
          Length = 78

 Score = 57.6 bits (140), Expect = 6e-11
 Identities = 18/33 (54%), Positives = 23/33 (69%), Gaps = 2/33 (6%)

Query: 102 QNSIRHNLSLHSKFMRVQNE--GTGKSSWWMLN 132
           QNSIRHNLSL+  F++V  E    GK S+W L+
Sbjct: 46  QNSIRHNLSLNKCFVKVPREPDEPGKGSYWTLD 78


>gnl|CDD|189470 pfam00250, Fork_head, Fork head domain. 
          Length = 96

 Score = 46.2 bits (110), Expect = 1e-06
 Identities = 22/50 (44%), Positives = 33/50 (66%), Gaps = 6/50 (12%)

Query: 101 YQNSIRHNLSLHSKFMRV--QNEGTGKSSWWMLNPDAKA----GKSARRR 144
           +QNSIRHNLSL+  F++V    +  GK S+W L+P+++     GK  +RR
Sbjct: 45  WQNSIRHNLSLNKCFIKVPRSPDKPGKGSYWTLDPESENMFENGKYLKRR 94


>gnl|CDD|225723 COG3182, PiuB, Uncharacterized iron-regulated membrane protein
           [Function unknown].
          Length = 442

 Score = 32.0 bits (73), Expect = 0.43
 Identities = 15/90 (16%), Positives = 28/90 (31%), Gaps = 7/90 (7%)

Query: 172 TPSPSSSVSEIRTTDSVPHPTLPPVGLIFSPDYRQRASSNASSCGRLSPIPAQESPAWLP 231
            P    SV ++       +P      ++   D         S+   +S         ++ 
Sbjct: 54  APERPLSVDQLVARAEEAYPGAAVWQVLPPDD------PGGSAAITVSEPGGAADAVYVD 107

Query: 232 PYSNPYLGA-EQLAGSLETTMKLSSDNFLP 260
           PY+   +G      G +    +L SD  L 
Sbjct: 108 PYTGEVVGPLPPAGGLMNFMRELHSDLLLG 137


>gnl|CDD|227358 COG5025, COG5025, Transcription factor of the Forkhead/HNF3 family
           [Transcription].
          Length = 610

 Score = 32.1 bits (73), Expect = 0.51
 Identities = 28/92 (30%), Positives = 39/92 (42%), Gaps = 7/92 (7%)

Query: 102 QNSIRHNLSLHSKFMRVQNE--GTGKSSWWMLNPDAKAGKSARRRATSMETSKFEKRRGR 159
           QNSIRHNLSL+  F +V       GK  +W ++      K ++R   S    K       
Sbjct: 382 QNSIRHNLSLNKSFEKVPRSASQPGKGCFWKIDYSYIYEKESKRNPRS---PKKSPSAHS 438

Query: 160 VKKKVEALRNGITPSP--SSSVSEIRTTDSVP 189
           V +K+    N +  SP  S   S     +S P
Sbjct: 439 VHQKLSLHVNDLYQSPATSDIASSSSQVNSQP 470


>gnl|CDD|220309 pfam09606, Med15, ARC105 or Med15 subunit of Mediator complex
           non-fungal.  The approx. 70 residue Med15 domain of the
           ARC-Mediator co-activator is a three-helix bundle with
           marked similarity to the KIX domain. The sterol
           regulatory element binding protein (SREBP) family of
           transcription activators use the ARC105 subunit to
           activate target genes in the regulation of cholesterol
           and fatty acid homeostasis. In addition, Med15 is a
           critical transducer of gene activation signals that
           control early metazoan development.
          Length = 768

 Score = 31.9 bits (72), Expect = 0.58
 Identities = 14/69 (20%), Positives = 18/69 (26%), Gaps = 4/69 (5%)

Query: 260 PYPSPP----PPYGAPPPYPCPHHLVNSCNCHNHPAVKQQSISPSYHNSPSPSPPATVMG 315
           P P  P    P   +    P P  L+ S +     +   Q        SP          
Sbjct: 435 PSPGGPGSQPPQSVSGGMIPSPPALMPSPSPQMSQSPASQRTIQQDMVSPGGPLNTPGQS 494

Query: 316 QVLSSLNSS 324
            V S  N  
Sbjct: 495 SVNSPANPQ 503


>gnl|CDD|220708 pfam10349, WWbp, WW-domain ligand protein.  The WWbp domain is
           characterized by several short PY and PT-like motifs of
           the PPPPY form. These appear to bind directly to the WW
           domains of WWP1 and WWP2 and other such diverse proteins
           as dystrophin and YAP (Yes-associated protein). This is
           the WW-domain binding protein WWbp via PY and PY_like
           motifs. The presence of a phosphotyrosine residue in the
           pWBP-1 peptide abolishes WW domain binding which
           suggests a potential regulatory role for tyrosine
           phosphorylation in modulating WW domain-ligand
           interactions. Given the likelihood that WWP1 and WWP2
           function as E3 ubiquitin-protein ligases, it is possible
           that initial substrate-specific recognition occurs via
           WW domain-substrate protein interaction followed by
           ubiquitin transfer and subsequent proteolysis. This
           domain lies just downstream of the GRAM (pfam02893) in
           many members.
          Length = 111

 Score = 29.3 bits (66), Expect = 1.0
 Identities = 19/56 (33%), Positives = 21/56 (37%)

Query: 218 LSPIPAQESPAWLPPYSNPYLGAEQLAGSLETTMKLSSDNFLPYPSPPPPYGAPPP 273
           L P+PA    A  PPY  P        G   T    +S        PPPPY  PP 
Sbjct: 56  LEPLPAYGQYAAPPPYGPPPPYYPAPPGVYPTPPPPNSGYMADPQEPPPPYPGPPQ 111


>gnl|CDD|112890 pfam04094, DUF390, Protein of unknown function (DUF390).  This is a
           family of long proteins currently only found in the rice
           genome. They have no known function. However they may be
           some kind of transposable element.
          Length = 843

 Score = 30.9 bits (69), Expect = 1.1
 Identities = 30/145 (20%), Positives = 51/145 (35%), Gaps = 15/145 (10%)

Query: 204 YRQRASSNASSCGRLSPIPAQESPAWLPPYSNPYLGAEQLAGSLETTMKLSSDNFLPYPS 263
           +R   ++  S   R     AQ +P      +              TT + + D       
Sbjct: 290 FRADEAAATSEAARDEAAGAQLAPDPSGDAAA-------------TTSEAAGDEAAGALL 336

Query: 264 PPPPYGAPPPYPCPHHLVNSCNCHNHPAVKQQSISPSYHNSPSPSP-PATVMGQVLSSLN 322
            P P G     P P    +S      P+ +           PS +P  A  + Q L++ N
Sbjct: 337 GPDPSGDAQDEPAPGGAPDSGTSIGGPS-RAAPSPRRLFPLPSAAPLNAEPLLQALAAAN 395

Query: 323 SSVLDDFNLNIESLQGFDCNVDEVY 347
           ++VLD  +  +E+LQ     +D  +
Sbjct: 396 TTVLDGLSAQMEALQAERAELDAAW 420


>gnl|CDD|215076 PLN00147, PLN00147, light-harvesting complex I chlorophyll-a/b
           binding protein Lhca5; Provisional.
          Length = 252

 Score = 30.2 bits (68), Expect = 1.3
 Identities = 14/30 (46%), Positives = 18/30 (60%)

Query: 207 RASSNASSCGRLSPIPAQESPAWLPPYSNP 236
             SS A+S G ++P+ AQE P WLP    P
Sbjct: 26  SISSVAASSGGINPVVAQERPTWLPGLDPP 55


>gnl|CDD|223021 PHA03247, PHA03247, large tegument protein UL36; Provisional.
          Length = 3151

 Score = 30.7 bits (69), Expect = 1.5
 Identities = 31/140 (22%), Positives = 46/140 (32%), Gaps = 22/140 (15%)

Query: 173  PSPSSSVSEIRTTDSVPHPTLPPVGLIFSPDYRQRASSNASSCGRLSPIPAQESPAWLPP 232
            PSPS + +E         P         +P  R      A   GR +   +         
Sbjct: 2629 PSPSPAANEPDPHPPPTVPPPERPRDDPAPG-RVSRPRRARRLGR-AAQASSPPQRPRRR 2686

Query: 233  YSNPYLGAEQLAGSLETTMKLSSDNFLPYPSPPPPYGAPPPYPCPHHLVNSCNCHNHPAV 292
             + P +G+                       PPPP   P P P PH LV++      PA 
Sbjct: 2687 AARPTVGS-----------------LTSLADPPPP--PPTPEPAPHALVSATPLPPGPAA 2727

Query: 293  KQQSISPSYHNSPSPSPPAT 312
             +Q+ SP+   +P+P     
Sbjct: 2728 ARQA-SPALPAAPAPPAVPA 2746


>gnl|CDD|217393 pfam03154, Atrophin-1, Atrophin-1 family.  Atrophin-1 is the
           protein product of the dentatorubral-pallidoluysian
           atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive
           neurodegenerative disorder. It is caused by the
           expansion of a CAG repeat in the DRPLA gene on
           chromosome 12p. This results in an extended
           polyglutamine region in atrophin-1, that is thought to
           confer toxicity to the protein, possibly through
           altering its interactions with other proteins. The
           expansion of a CAG repeat is also the underlying defect
           in six other neurodegenerative disorders, including
           Huntington's disease. One interaction of expanded
           polyglutamine repeats that is thought to be pathogenic
           is that with the short glutamine repeat in the
           transcriptional coactivator CREB binding protein, CBP.
           This interaction draws CBP away from its usual nuclear
           location to the expanded polyglutamine repeat protein
           aggregates that are characteristic of the polyglutamine
           neurodegenerative disorders. This interferes with
           CBP-mediated transcription and causes cytotoxicity.
          Length = 979

 Score = 30.4 bits (68), Expect = 1.7
 Identities = 34/138 (24%), Positives = 47/138 (34%), Gaps = 8/138 (5%)

Query: 187 SVPHPTLP-PVGLIFSPDYRQRASSNASSCGRLSPIPAQESPAWLPPYSNPYLGAEQLAG 245
            +PH     PV L        +    A S     P+P+Q  P    P S   L  +Q   
Sbjct: 274 PMPHALQQGPVFLQHPSSNPPQPFGLAQSQVPPLPLPSQAQPHSHTPPSQSALQPQQPPR 333

Query: 246 SLETTMKLSSDNFLPYPSPPPPYGAPPPYPCPHHLVN-------SCNCHNHPAVKQQSIS 298
                   S  +  P P+ P P      +  P HL           N    PA+K  S  
Sbjct: 334 EQPLPPAPSMPHIKPPPTTPIPQLPNQSHKHPPHLQGPSPFPQMPSNLPPPPALKPLSSL 393

Query: 299 PSYHNSPSPSPPATVMGQ 316
           P++H   +  PP  +M Q
Sbjct: 394 PTHHPPSAHPPPLQLMPQ 411


>gnl|CDD|219569 pfam07777, MFMR, G-box binding protein MFMR.  This region is found
           to the N-terminus of the pfam00170 transcription factor
           domain. It is between 150 and 200 amino acids in length.
           The N-terminal half is rather rich in proline residues
           and has been termed the PRD (proline rich domain),
           whereas the C-terminal half is more polar and has been
           called the MFMR (multifunctional mosaic region). It has
           been suggested that this family is composed of three
           sub-families called A, B and C, classified according to
           motif composition. It has been suggested that some of
           these motifs may be involved in mediating
           protein-protein interactions. The MFMR region contains a
           nuclear localisation signal in bZIP opaque and GBF-2.
           The MFMR also contains a transregulatory activity in
           TAF-1. The MFMR in CPRF-2 contains cytoplasmic retention
           signals.
          Length = 189

 Score = 29.4 bits (66), Expect = 1.8
 Identities = 23/82 (28%), Positives = 34/82 (41%), Gaps = 8/82 (9%)

Query: 265 PPPYGAPPPYP--CPHHLVNSCNCHNHPAVKQQSISPSYHNSPSPSPPATVMGQVLSSLN 322
            PPYG PPPY    P   V     + HP++   S   S +  PS   P +    + +   
Sbjct: 69  MPPYGTPPPYAAMYPPGGV-----YAHPSMPPGSHPFSPYAMPSAEVPGSTPLSMETDAK 123

Query: 323 SSVLDDFNLNIESLQGFDCNVD 344
           SS   D    I+  +G D ++ 
Sbjct: 124 SSDNKDKGS-IKKSKGSDGSLG 144


>gnl|CDD|218535 pfam05281, Secretogranin_V, Neuroendocrine protein 7B2 precursor
           (Secretogranin V).  The neuroendocrine protein 7B2 has a
           critical role in the proteolytic conversion and
           activation of proPC2, the enzyme responsible for the
           proteolytic conversion of many peptide hormone
           precursors. The 7B2 protein acts as an intracellular
           binding protein for proPC2, facilitates its maturation,
           and is required for its enzymatic activity. Processing
           of many important peptide precursors does not occur in
           7B2 nulls. 7B2 null mice exhibit a unique form of
           Cushing's disease with many atypical symptoms, such as
           hypoglycemia.
          Length = 210

 Score = 29.5 bits (66), Expect = 1.8
 Identities = 13/33 (39%), Positives = 15/33 (45%)

Query: 277 PHHLVNSCNCHNHPAVKQQSISPSYHNSPSPSP 309
             HL    N  N  AVK     P+Y N P+P P
Sbjct: 74  LQHLGPYGNIPNIVAVKTDDNLPAYCNPPNPCP 106


>gnl|CDD|235960 PRK07199, PRK07199, phosphoribosylpyrophosphate synthetase;
           Provisional.
          Length = 301

 Score = 29.5 bits (67), Expect = 2.2
 Identities = 6/14 (42%), Positives = 12/14 (85%)

Query: 179 VSEIRTTDSVPHPT 192
           ++ + +TD+VPHP+
Sbjct: 264 IARVVSTDTVPHPS 277


>gnl|CDD|233973 TIGR02680, TIGR02680, TIGR02680 family protein.  Members of this
           protein family belong to a conserved gene four-gene
           neighborhood found sporadically in a phylogenetically
           broad range of bacteria: Nocardia farcinica,
           Symbiobacterium thermophilum, and Streptomyces
           avermitilis (Actinobacteria), Geobacillus kaustophilus
           (Firmicutes), Azoarcus sp. EbN1 and Ralstonia
           solanacearum (Betaproteobacteria). Proteins in this
           family average over 1400 amino acids in length
           [Hypothetical proteins, Conserved].
          Length = 1353

 Score = 29.8 bits (67), Expect = 2.4
 Identities = 7/41 (17%), Positives = 14/41 (34%), Gaps = 1/41 (2%)

Query: 128 WWMLNPDAKAGKSARRRATSMETSKFEKRRGRVKKKVEALR 168
                P  ++ ++A RRA   E  + E     + +      
Sbjct: 778 ELAGAPSDRSLRAAHRRAAEAER-QAESAERELARAARKAA 817


>gnl|CDD|221195 pfam11737, DUF3300, Protein of unknown function (DUF3300).  This
           hypothetical bacterial gene product has a long
           hydrophobic segment and is thus likely to be a membrane
           protein.
          Length = 230

 Score = 29.2 bits (66), Expect = 2.9
 Identities = 10/20 (50%), Positives = 11/20 (55%)

Query: 260 PYPSPPPPYGAPPPYPCPHH 279
           PYP+ PP Y  PPPY     
Sbjct: 156 PYPAYPPYYWPPPPYGAAAA 175


>gnl|CDD|116500 pfam07888, CALCOCO1, Calcium binding and coiled-coil domain
           (CALCOCO1) like.  Proteins found in this family are
           similar to the coiled-coil transcriptional coactivator
           protein coexpressed by Mus musculus (CoCoA/CALCOCO1).
           This protein binds to a highly conserved N-terminal
           domain of p160 coactivators, such as GRIP1, and thus
           enhances transcriptional activation by a number of
           nuclear receptors. CALCOCO1 has a central coiled-coil
           region with three leucine zipper motifs, which is
           required for its interaction with GRIP1 and may regulate
           the autonomous transcriptional activation activity of
           the C-terminal region.
          Length = 546

 Score = 29.2 bits (65), Expect = 3.9
 Identities = 14/52 (26%), Positives = 21/52 (40%), Gaps = 3/52 (5%)

Query: 222 PAQESPAWLPPYSNPYLGAEQLAGSLETTMKLSSDNFLPYPSPPPPYGAPPP 273
           P  ES + L P +   L  EQ   S     K + D  +  P+P  P+ +   
Sbjct: 492 PLDESSSELSPSA---LDEEQHNESTPKNPKENEDVVIQQPAPIQPFLSGIA 540


>gnl|CDD|218115 pfam04502, DUF572, Family of unknown function (DUF572).  Family of
           eukaryotic proteins with undetermined function.
          Length = 321

 Score = 28.9 bits (65), Expect = 3.9
 Identities = 15/88 (17%), Positives = 26/88 (29%)

Query: 132 NPDAKAGKSARRRATSMETSKFEKRRGRVKKKVEALRNGITPSPSSSVSEIRTTDSVPHP 191
            P  K+G S+  + TS+      KR            +   P P  ++S +         
Sbjct: 226 TPSPKSGSSSPAKPTSILKKSAAKRSEAPSSSKAKKNSRGIPKPRDALSSLVVRKKAAPE 285

Query: 192 TLPPVGLIFSPDYRQRASSNASSCGRLS 219
           +         P      ++  SS   L 
Sbjct: 286 STSQSPSSAEPTSESPQTAGNSSLSSLG 313


>gnl|CDD|218191 pfam04652, DUF605, Vta1 like.  Vta1 (VPS20-associated protein 1) is
           a positive regulator of Vps4. Vps4 is an ATPase that is
           required in the multivesicular body (MVB) sorting
           pathway to dissociate the endosomal sorting complex
           required for transport (ESCRT). Vta1 promotes correct
           assembly of Vps4 and stimulates its ATPase activity
           through its conserved Vta1/SBP1/LIP5 region.
          Length = 315

 Score = 28.9 bits (65), Expect = 4.3
 Identities = 26/150 (17%), Positives = 39/150 (26%), Gaps = 24/150 (16%)

Query: 160 VKKKVEALRNGITPSPSSSVSEIRTTDSVPHPTLPPVGLIFSPDYRQRASSNASSCGRLS 219
              + +   +  T +  +S        +   P+ PP      P +       +      S
Sbjct: 153 PLDEEDEDADVATTNSDNSFPGEDADPASASPSDPPSSSPGVPSFPSPPEDPS------S 206

Query: 220 PIPAQESPAWLPPYSNPYLGAEQLAGSLETTMKLSSDNFLPYPSPPPPYGAPPPYPCPHH 279
           P  +   PA  P                       S      PSPPP   APPP P    
Sbjct: 207 PSDSSLPPA--PSSFQSDTPP-------------PSPESPTNPSPPPGPAAPPPPPVQQV 251

Query: 280 LVNSCNCHNHPAVKQQSISPSYHNSPSPSP 309
              S      P     S +P+     +   
Sbjct: 252 PPLS---TAKPTPPSASATPAPIGGITLDD 278


>gnl|CDD|184923 PRK14959, PRK14959, DNA polymerase III subunits gamma and tau;
           Provisional.
          Length = 624

 Score = 28.9 bits (64), Expect = 4.4
 Identities = 32/149 (21%), Positives = 46/149 (30%), Gaps = 27/149 (18%)

Query: 164 VEALRNGITPSPSSSVSEIRTTDSVPHPTLPPVGLIFSPDYRQRASSNASSCGRLSPIPA 223
           VE+LR     + + S S      S    T+P  G         + ++ A+     S  PA
Sbjct: 368 VESLRPSGGGASAPSGSAAEGPASGGAATIPTPGTQ-----GPQGTAPAAGMTPSSAAPA 422

Query: 224 QESPAWLPPYSNPYLGAEQLAGSLETTMKLSSDNFLPYPSPPPPYGAPPPYPCPHHLVNS 283
             +P+  P    P+  A                       P PP    PP P P     S
Sbjct: 423 TPAPSAAPSPRVPWDDA----------------------PPAPPRSGIPPRPAPRMPEAS 460

Query: 284 CNCHNHPAVKQQSISPSYHNSPSPSPPAT 312
                  +V   S +P     PS +   T
Sbjct: 461 PVPGAPDSVASASDAPPTLGDPSDTAEHT 489


>gnl|CDD|147580 pfam05474, Semenogelin, Semenogelin.  This family consists of
           several mammalian semenogelin (I and II) proteins.
           Freshly ejaculated human semen has the appearance of a
           loose gel in which the predominant structural protein
           components are the seminal vesicle secreted semenogelins
           (Sg).
          Length = 450

 Score = 28.5 bits (63), Expect = 5.3
 Identities = 25/117 (21%), Positives = 51/117 (43%), Gaps = 11/117 (9%)

Query: 96  RPANHYQNSIRHNLSLHSKFMRVQNEGTGKSSWWMLNPDAKAGKSARRRATSMETSKFEK 155
           R  NH +N ++ ++S  S   + + +  GKS   +  P  + G+ A +   S ++S  E+
Sbjct: 286 RRLNHGENGVQKDVSKGSISRQTEEKIMGKSQKQVTTPSQEQGQKANKI--SYQSSSTEE 343

Query: 156 RRGRVKKKVEALRNGITPSPSSSVSEIRTTDSVPHPTLPPVGLIFSPDYRQRASSNA 212
           RR    +K   ++  ++   +S  +E +  D            +  P   QR+  +A
Sbjct: 344 RRLNHGEKG--IQKDVSKGSTSKKTEEKIHDK-------SQNQVTIPSQDQRSGQDA 391


>gnl|CDD|223287 COG0209, NrdA, Ribonucleotide reductase, alpha subunit [Nucleotide
           transport and metabolism].
          Length = 651

 Score = 28.9 bits (65), Expect = 5.4
 Identities = 13/65 (20%), Positives = 25/65 (38%), Gaps = 9/65 (13%)

Query: 107 HNLSL----HSKFM-RVQNEGTGKSSWWMLNPDAKAGKSARRRATSMETSKFEKRRGRVK 161
            ++S+       FM R +        W + +P           A   E +++E   G  K
Sbjct: 248 FDISVGVWIPDLFMKRAKEGED----WTLFSPRDGYEAPDVYGAEFEEYAEYEADPGIRK 303

Query: 162 KKVEA 166
           K+++A
Sbjct: 304 KRIKA 308


>gnl|CDD|219094 pfam06583, Neogenin_C, Neogenin C-terminus.  This family represents
           the C-terminus of eukaryotic neogenin precursor
           proteins, which contains several potential
           phosphorylation sites. Neogenin is a member of the N-CAM
           family of cell adhesion molecules (and therefore
           contains multiple copies of pfam00047 and pfam00041) and
           is closely related to the DCC tumour suppressor gene
           product - these proteins may play an integral role in
           regulating differentiation programmes and/or cell
           migration events within many adult and embryonic
           tissues.
          Length = 295

 Score = 28.4 bits (63), Expect = 5.5
 Identities = 30/150 (20%), Positives = 43/150 (28%), Gaps = 17/150 (11%)

Query: 170 GITPSPSSSVS-EIRTTDSVPHPTLPPVGLIFSPDYRQRASSNASSCGRLSPIPAQESPA 228
           GI PSP         +   +P  TL    L    D  +              +     P 
Sbjct: 111 GILPSPRCGYLHHQFSLRPMPFSTLTVQRLYQHGDRAESVE----------SVRQTPEPP 160

Query: 229 WLPPYSNPYLGA--EQLAGSLETTMKLSSDNFLPY--PSPPPPYGA-PPPYPCPHHLVNS 283
           +LP   +    A  E  + S+ T     +     +  P+ P       P  P    L   
Sbjct: 161 YLPAAQSESSNAAEEAPSRSIPTAHVRPTHPLKSFAVPALPASMSTIEPKLPSTPLLTQQ 220

Query: 284 CNCHNHPAVKQQSISPSYHNSPSPSPPATV 313
                  +VK  S+        SP  P TV
Sbjct: 221 GPTLPKHSVKTASVGTLGRAR-SPLLPVTV 249


>gnl|CDD|215600 PLN03141, PLN03141, 3-epi-6-deoxocathasterone 23-monooxygenase;
           Provisional.
          Length = 452

 Score = 28.6 bits (64), Expect = 5.5
 Identities = 14/36 (38%), Positives = 18/36 (50%)

Query: 7   LRNGITPSPSSSVSESLDLFPEDSPIHFQLRCLKIT 42
           L+  IT      VSESLD + +D P+  Q    KI 
Sbjct: 118 LKAQITRDMERYVSESLDSWRDDPPVLVQDETKKIA 153


>gnl|CDD|216648 pfam01690, PLRV_ORF5, Potato leaf roll virus readthrough protein.
           This family consists mainly of the potato leaf roll
           virus readthrough protein. This is generated via a
           readthrough of open reading frame 3 a coat protein
           allowing transcription of open reading frame 5 to give
           an extended coat protein with a large c-terminal
           addition or read through domain. The readthrough protein
           is thought to play a role in the circulative aphid
           transmission of potato leaf roll virus. Also in the
           family is open reading frame 6 from beet western yellows
           virus and potato leaf roll virus both luteovirus and an
           unknown protein from cucurbit aphid-borne yellows virus
           a closterovirus.
          Length = 460

 Score = 28.1 bits (63), Expect = 7.0
 Identities = 10/19 (52%), Positives = 11/19 (57%)

Query: 260 PYPSPPPPYGAPPPYPCPH 278
           P P+PPPP   P P P  H
Sbjct: 11  PTPTPPPPAPTPEPTPAKH 29


>gnl|CDD|215379 PLN02708, PLN02708, Probable pectinesterase/pectinesterase
           inhibitor.
          Length = 553

 Score = 28.3 bits (63), Expect = 7.8
 Identities = 15/51 (29%), Positives = 20/51 (39%), Gaps = 2/51 (3%)

Query: 260 PYPSPPPPYGAPPPYPCPHHLVNSCNCHNHPAVKQQSISPSYHNSPSPSPP 310
             PSP PP   P     P  ++ +CN    P     S+S +    P P P 
Sbjct: 29  HTPSPSPP--PPSSPSTPPQILLACNATRFPDTCVSSLSNAGRVPPDPKPI 77


>gnl|CDD|223039 PHA03307, PHA03307, transcriptional regulator ICP4; Provisional.
          Length = 1352

 Score = 28.2 bits (63), Expect = 9.5
 Identities = 21/139 (15%), Positives = 30/139 (21%), Gaps = 15/139 (10%)

Query: 175 PSSSVSEIRTTDSVPHPTLPPVGLIFSPDYRQRASSNASSCGRLSPIPAQESPAWLPPYS 234
           P    +E    +S   PT     L  +   R+ + +         P P    PA  PP  
Sbjct: 72  PPGPGTEAPANESRSTPTWSLSTLAPASPAREGSPTPPGP-SSPDPPPPTPPPASPPPSP 130

Query: 235 NPYLGAEQLAGSLETTMKLSSDNFLPYPSPPPPYGAPPPYPCPHHLVNSCNCHNHPAVKQ 294
            P L               +S              A                    A+  
Sbjct: 131 APDLSEMLRPVGSPGPPPAASPPAAGASPAAVASDAASS--------------RQAALPL 176

Query: 295 QSISPSYHNSPSPSPPATV 313
            S   +     SP      
Sbjct: 177 SSPEETARAPSSPPAEPPP 195


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.314    0.131    0.405 

Gapped
Lambda     K      H
   0.267   0.0647    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 17,535,879
Number of extensions: 1632458
Number of successful extensions: 1638
Number of sequences better than 10.0: 1
Number of HSP's gapped: 1504
Number of HSP's successfully gapped: 101
Length of query: 347
Length of database: 10,937,602
Length adjustment: 98
Effective length of query: 249
Effective length of database: 6,590,910
Effective search space: 1641136590
Effective search space used: 1641136590
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (22.0 bits)
S2: 59 (26.7 bits)