RPS-BLAST 2.2.26 [Sep-21-2011]
Database: CDD.v3.10
44,354 sequences; 10,937,602 total letters
Searching..................................................done
Query= psy12666
(347 letters)
>gnl|CDD|214627 smart00339, FH, FORKHEAD. FORKHEAD, also known as a "winged
helix".
Length = 89
Score = 62.6 bits (153), Expect = 1e-12
Identities = 22/45 (48%), Positives = 30/45 (66%), Gaps = 3/45 (6%)
Query: 95 FRPAN-HYQNSIRHNLSLHSKFMRVQNEGT--GKSSWWMLNPDAK 136
+R +QNSIRHNLSL+ F++V EG GK S+W L+P A+
Sbjct: 38 YRENRAGWQNSIRHNLSLNDCFVKVPREGDRPGKGSYWTLDPAAE 82
>gnl|CDD|238016 cd00059, FH, Forkhead (FH), also known as a "winged helix". FH is
named for the Drosophila fork head protein, a
transcription factor which promotes terminal rather than
segmental development. This family of transcription
factor domains, which bind to B-DNA as monomers, are
also found in the Hepatocyte nuclear factor (HNF)
proteins, which provide tissue-specific gene regulation.
The structure contains 2 flexible loops or "wings" in
the C-terminal region, hence the term winged helix.
Length = 78
Score = 57.6 bits (140), Expect = 6e-11
Identities = 18/33 (54%), Positives = 23/33 (69%), Gaps = 2/33 (6%)
Query: 102 QNSIRHNLSLHSKFMRVQNE--GTGKSSWWMLN 132
QNSIRHNLSL+ F++V E GK S+W L+
Sbjct: 46 QNSIRHNLSLNKCFVKVPREPDEPGKGSYWTLD 78
>gnl|CDD|189470 pfam00250, Fork_head, Fork head domain.
Length = 96
Score = 46.2 bits (110), Expect = 1e-06
Identities = 22/50 (44%), Positives = 33/50 (66%), Gaps = 6/50 (12%)
Query: 101 YQNSIRHNLSLHSKFMRV--QNEGTGKSSWWMLNPDAKA----GKSARRR 144
+QNSIRHNLSL+ F++V + GK S+W L+P+++ GK +RR
Sbjct: 45 WQNSIRHNLSLNKCFIKVPRSPDKPGKGSYWTLDPESENMFENGKYLKRR 94
>gnl|CDD|225723 COG3182, PiuB, Uncharacterized iron-regulated membrane protein
[Function unknown].
Length = 442
Score = 32.0 bits (73), Expect = 0.43
Identities = 15/90 (16%), Positives = 28/90 (31%), Gaps = 7/90 (7%)
Query: 172 TPSPSSSVSEIRTTDSVPHPTLPPVGLIFSPDYRQRASSNASSCGRLSPIPAQESPAWLP 231
P SV ++ +P ++ D S+ +S ++
Sbjct: 54 APERPLSVDQLVARAEEAYPGAAVWQVLPPDD------PGGSAAITVSEPGGAADAVYVD 107
Query: 232 PYSNPYLGA-EQLAGSLETTMKLSSDNFLP 260
PY+ +G G + +L SD L
Sbjct: 108 PYTGEVVGPLPPAGGLMNFMRELHSDLLLG 137
>gnl|CDD|227358 COG5025, COG5025, Transcription factor of the Forkhead/HNF3 family
[Transcription].
Length = 610
Score = 32.1 bits (73), Expect = 0.51
Identities = 28/92 (30%), Positives = 39/92 (42%), Gaps = 7/92 (7%)
Query: 102 QNSIRHNLSLHSKFMRVQNE--GTGKSSWWMLNPDAKAGKSARRRATSMETSKFEKRRGR 159
QNSIRHNLSL+ F +V GK +W ++ K ++R S K
Sbjct: 382 QNSIRHNLSLNKSFEKVPRSASQPGKGCFWKIDYSYIYEKESKRNPRS---PKKSPSAHS 438
Query: 160 VKKKVEALRNGITPSP--SSSVSEIRTTDSVP 189
V +K+ N + SP S S +S P
Sbjct: 439 VHQKLSLHVNDLYQSPATSDIASSSSQVNSQP 470
>gnl|CDD|220309 pfam09606, Med15, ARC105 or Med15 subunit of Mediator complex
non-fungal. The approx. 70 residue Med15 domain of the
ARC-Mediator co-activator is a three-helix bundle with
marked similarity to the KIX domain. The sterol
regulatory element binding protein (SREBP) family of
transcription activators use the ARC105 subunit to
activate target genes in the regulation of cholesterol
and fatty acid homeostasis. In addition, Med15 is a
critical transducer of gene activation signals that
control early metazoan development.
Length = 768
Score = 31.9 bits (72), Expect = 0.58
Identities = 14/69 (20%), Positives = 18/69 (26%), Gaps = 4/69 (5%)
Query: 260 PYPSPP----PPYGAPPPYPCPHHLVNSCNCHNHPAVKQQSISPSYHNSPSPSPPATVMG 315
P P P P + P P L+ S + + Q SP
Sbjct: 435 PSPGGPGSQPPQSVSGGMIPSPPALMPSPSPQMSQSPASQRTIQQDMVSPGGPLNTPGQS 494
Query: 316 QVLSSLNSS 324
V S N
Sbjct: 495 SVNSPANPQ 503
>gnl|CDD|220708 pfam10349, WWbp, WW-domain ligand protein. The WWbp domain is
characterized by several short PY and PT-like motifs of
the PPPPY form. These appear to bind directly to the WW
domains of WWP1 and WWP2 and other such diverse proteins
as dystrophin and YAP (Yes-associated protein). This is
the WW-domain binding protein WWbp via PY and PY_like
motifs. The presence of a phosphotyrosine residue in the
pWBP-1 peptide abolishes WW domain binding which
suggests a potential regulatory role for tyrosine
phosphorylation in modulating WW domain-ligand
interactions. Given the likelihood that WWP1 and WWP2
function as E3 ubiquitin-protein ligases, it is possible
that initial substrate-specific recognition occurs via
WW domain-substrate protein interaction followed by
ubiquitin transfer and subsequent proteolysis. This
domain lies just downstream of the GRAM (pfam02893) in
many members.
Length = 111
Score = 29.3 bits (66), Expect = 1.0
Identities = 19/56 (33%), Positives = 21/56 (37%)
Query: 218 LSPIPAQESPAWLPPYSNPYLGAEQLAGSLETTMKLSSDNFLPYPSPPPPYGAPPP 273
L P+PA A PPY P G T +S PPPPY PP
Sbjct: 56 LEPLPAYGQYAAPPPYGPPPPYYPAPPGVYPTPPPPNSGYMADPQEPPPPYPGPPQ 111
>gnl|CDD|112890 pfam04094, DUF390, Protein of unknown function (DUF390). This is a
family of long proteins currently only found in the rice
genome. They have no known function. However they may be
some kind of transposable element.
Length = 843
Score = 30.9 bits (69), Expect = 1.1
Identities = 30/145 (20%), Positives = 51/145 (35%), Gaps = 15/145 (10%)
Query: 204 YRQRASSNASSCGRLSPIPAQESPAWLPPYSNPYLGAEQLAGSLETTMKLSSDNFLPYPS 263
+R ++ S R AQ +P + TT + + D
Sbjct: 290 FRADEAAATSEAARDEAAGAQLAPDPSGDAAA-------------TTSEAAGDEAAGALL 336
Query: 264 PPPPYGAPPPYPCPHHLVNSCNCHNHPAVKQQSISPSYHNSPSPSP-PATVMGQVLSSLN 322
P P G P P +S P+ + PS +P A + Q L++ N
Sbjct: 337 GPDPSGDAQDEPAPGGAPDSGTSIGGPS-RAAPSPRRLFPLPSAAPLNAEPLLQALAAAN 395
Query: 323 SSVLDDFNLNIESLQGFDCNVDEVY 347
++VLD + +E+LQ +D +
Sbjct: 396 TTVLDGLSAQMEALQAERAELDAAW 420
>gnl|CDD|215076 PLN00147, PLN00147, light-harvesting complex I chlorophyll-a/b
binding protein Lhca5; Provisional.
Length = 252
Score = 30.2 bits (68), Expect = 1.3
Identities = 14/30 (46%), Positives = 18/30 (60%)
Query: 207 RASSNASSCGRLSPIPAQESPAWLPPYSNP 236
SS A+S G ++P+ AQE P WLP P
Sbjct: 26 SISSVAASSGGINPVVAQERPTWLPGLDPP 55
>gnl|CDD|223021 PHA03247, PHA03247, large tegument protein UL36; Provisional.
Length = 3151
Score = 30.7 bits (69), Expect = 1.5
Identities = 31/140 (22%), Positives = 46/140 (32%), Gaps = 22/140 (15%)
Query: 173 PSPSSSVSEIRTTDSVPHPTLPPVGLIFSPDYRQRASSNASSCGRLSPIPAQESPAWLPP 232
PSPS + +E P +P R A GR + +
Sbjct: 2629 PSPSPAANEPDPHPPPTVPPPERPRDDPAPG-RVSRPRRARRLGR-AAQASSPPQRPRRR 2686
Query: 233 YSNPYLGAEQLAGSLETTMKLSSDNFLPYPSPPPPYGAPPPYPCPHHLVNSCNCHNHPAV 292
+ P +G+ PPPP P P P PH LV++ PA
Sbjct: 2687 AARPTVGS-----------------LTSLADPPPP--PPTPEPAPHALVSATPLPPGPAA 2727
Query: 293 KQQSISPSYHNSPSPSPPAT 312
+Q+ SP+ +P+P
Sbjct: 2728 ARQA-SPALPAAPAPPAVPA 2746
>gnl|CDD|217393 pfam03154, Atrophin-1, Atrophin-1 family. Atrophin-1 is the
protein product of the dentatorubral-pallidoluysian
atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive
neurodegenerative disorder. It is caused by the
expansion of a CAG repeat in the DRPLA gene on
chromosome 12p. This results in an extended
polyglutamine region in atrophin-1, that is thought to
confer toxicity to the protein, possibly through
altering its interactions with other proteins. The
expansion of a CAG repeat is also the underlying defect
in six other neurodegenerative disorders, including
Huntington's disease. One interaction of expanded
polyglutamine repeats that is thought to be pathogenic
is that with the short glutamine repeat in the
transcriptional coactivator CREB binding protein, CBP.
This interaction draws CBP away from its usual nuclear
location to the expanded polyglutamine repeat protein
aggregates that are characteristic of the polyglutamine
neurodegenerative disorders. This interferes with
CBP-mediated transcription and causes cytotoxicity.
Length = 979
Score = 30.4 bits (68), Expect = 1.7
Identities = 34/138 (24%), Positives = 47/138 (34%), Gaps = 8/138 (5%)
Query: 187 SVPHPTLP-PVGLIFSPDYRQRASSNASSCGRLSPIPAQESPAWLPPYSNPYLGAEQLAG 245
+PH PV L + A S P+P+Q P P S L +Q
Sbjct: 274 PMPHALQQGPVFLQHPSSNPPQPFGLAQSQVPPLPLPSQAQPHSHTPPSQSALQPQQPPR 333
Query: 246 SLETTMKLSSDNFLPYPSPPPPYGAPPPYPCPHHLVN-------SCNCHNHPAVKQQSIS 298
S + P P+ P P + P HL N PA+K S
Sbjct: 334 EQPLPPAPSMPHIKPPPTTPIPQLPNQSHKHPPHLQGPSPFPQMPSNLPPPPALKPLSSL 393
Query: 299 PSYHNSPSPSPPATVMGQ 316
P++H + PP +M Q
Sbjct: 394 PTHHPPSAHPPPLQLMPQ 411
>gnl|CDD|219569 pfam07777, MFMR, G-box binding protein MFMR. This region is found
to the N-terminus of the pfam00170 transcription factor
domain. It is between 150 and 200 amino acids in length.
The N-terminal half is rather rich in proline residues
and has been termed the PRD (proline rich domain),
whereas the C-terminal half is more polar and has been
called the MFMR (multifunctional mosaic region). It has
been suggested that this family is composed of three
sub-families called A, B and C, classified according to
motif composition. It has been suggested that some of
these motifs may be involved in mediating
protein-protein interactions. The MFMR region contains a
nuclear localisation signal in bZIP opaque and GBF-2.
The MFMR also contains a transregulatory activity in
TAF-1. The MFMR in CPRF-2 contains cytoplasmic retention
signals.
Length = 189
Score = 29.4 bits (66), Expect = 1.8
Identities = 23/82 (28%), Positives = 34/82 (41%), Gaps = 8/82 (9%)
Query: 265 PPPYGAPPPYP--CPHHLVNSCNCHNHPAVKQQSISPSYHNSPSPSPPATVMGQVLSSLN 322
PPYG PPPY P V + HP++ S S + PS P + + +
Sbjct: 69 MPPYGTPPPYAAMYPPGGV-----YAHPSMPPGSHPFSPYAMPSAEVPGSTPLSMETDAK 123
Query: 323 SSVLDDFNLNIESLQGFDCNVD 344
SS D I+ +G D ++
Sbjct: 124 SSDNKDKGS-IKKSKGSDGSLG 144
>gnl|CDD|218535 pfam05281, Secretogranin_V, Neuroendocrine protein 7B2 precursor
(Secretogranin V). The neuroendocrine protein 7B2 has a
critical role in the proteolytic conversion and
activation of proPC2, the enzyme responsible for the
proteolytic conversion of many peptide hormone
precursors. The 7B2 protein acts as an intracellular
binding protein for proPC2, facilitates its maturation,
and is required for its enzymatic activity. Processing
of many important peptide precursors does not occur in
7B2 nulls. 7B2 null mice exhibit a unique form of
Cushing's disease with many atypical symptoms, such as
hypoglycemia.
Length = 210
Score = 29.5 bits (66), Expect = 1.8
Identities = 13/33 (39%), Positives = 15/33 (45%)
Query: 277 PHHLVNSCNCHNHPAVKQQSISPSYHNSPSPSP 309
HL N N AVK P+Y N P+P P
Sbjct: 74 LQHLGPYGNIPNIVAVKTDDNLPAYCNPPNPCP 106
>gnl|CDD|235960 PRK07199, PRK07199, phosphoribosylpyrophosphate synthetase;
Provisional.
Length = 301
Score = 29.5 bits (67), Expect = 2.2
Identities = 6/14 (42%), Positives = 12/14 (85%)
Query: 179 VSEIRTTDSVPHPT 192
++ + +TD+VPHP+
Sbjct: 264 IARVVSTDTVPHPS 277
>gnl|CDD|233973 TIGR02680, TIGR02680, TIGR02680 family protein. Members of this
protein family belong to a conserved gene four-gene
neighborhood found sporadically in a phylogenetically
broad range of bacteria: Nocardia farcinica,
Symbiobacterium thermophilum, and Streptomyces
avermitilis (Actinobacteria), Geobacillus kaustophilus
(Firmicutes), Azoarcus sp. EbN1 and Ralstonia
solanacearum (Betaproteobacteria). Proteins in this
family average over 1400 amino acids in length
[Hypothetical proteins, Conserved].
Length = 1353
Score = 29.8 bits (67), Expect = 2.4
Identities = 7/41 (17%), Positives = 14/41 (34%), Gaps = 1/41 (2%)
Query: 128 WWMLNPDAKAGKSARRRATSMETSKFEKRRGRVKKKVEALR 168
P ++ ++A RRA E + E + +
Sbjct: 778 ELAGAPSDRSLRAAHRRAAEAER-QAESAERELARAARKAA 817
>gnl|CDD|221195 pfam11737, DUF3300, Protein of unknown function (DUF3300). This
hypothetical bacterial gene product has a long
hydrophobic segment and is thus likely to be a membrane
protein.
Length = 230
Score = 29.2 bits (66), Expect = 2.9
Identities = 10/20 (50%), Positives = 11/20 (55%)
Query: 260 PYPSPPPPYGAPPPYPCPHH 279
PYP+ PP Y PPPY
Sbjct: 156 PYPAYPPYYWPPPPYGAAAA 175
>gnl|CDD|116500 pfam07888, CALCOCO1, Calcium binding and coiled-coil domain
(CALCOCO1) like. Proteins found in this family are
similar to the coiled-coil transcriptional coactivator
protein coexpressed by Mus musculus (CoCoA/CALCOCO1).
This protein binds to a highly conserved N-terminal
domain of p160 coactivators, such as GRIP1, and thus
enhances transcriptional activation by a number of
nuclear receptors. CALCOCO1 has a central coiled-coil
region with three leucine zipper motifs, which is
required for its interaction with GRIP1 and may regulate
the autonomous transcriptional activation activity of
the C-terminal region.
Length = 546
Score = 29.2 bits (65), Expect = 3.9
Identities = 14/52 (26%), Positives = 21/52 (40%), Gaps = 3/52 (5%)
Query: 222 PAQESPAWLPPYSNPYLGAEQLAGSLETTMKLSSDNFLPYPSPPPPYGAPPP 273
P ES + L P + L EQ S K + D + P+P P+ +
Sbjct: 492 PLDESSSELSPSA---LDEEQHNESTPKNPKENEDVVIQQPAPIQPFLSGIA 540
>gnl|CDD|218115 pfam04502, DUF572, Family of unknown function (DUF572). Family of
eukaryotic proteins with undetermined function.
Length = 321
Score = 28.9 bits (65), Expect = 3.9
Identities = 15/88 (17%), Positives = 26/88 (29%)
Query: 132 NPDAKAGKSARRRATSMETSKFEKRRGRVKKKVEALRNGITPSPSSSVSEIRTTDSVPHP 191
P K+G S+ + TS+ KR + P P ++S +
Sbjct: 226 TPSPKSGSSSPAKPTSILKKSAAKRSEAPSSSKAKKNSRGIPKPRDALSSLVVRKKAAPE 285
Query: 192 TLPPVGLIFSPDYRQRASSNASSCGRLS 219
+ P ++ SS L
Sbjct: 286 STSQSPSSAEPTSESPQTAGNSSLSSLG 313
>gnl|CDD|218191 pfam04652, DUF605, Vta1 like. Vta1 (VPS20-associated protein 1) is
a positive regulator of Vps4. Vps4 is an ATPase that is
required in the multivesicular body (MVB) sorting
pathway to dissociate the endosomal sorting complex
required for transport (ESCRT). Vta1 promotes correct
assembly of Vps4 and stimulates its ATPase activity
through its conserved Vta1/SBP1/LIP5 region.
Length = 315
Score = 28.9 bits (65), Expect = 4.3
Identities = 26/150 (17%), Positives = 39/150 (26%), Gaps = 24/150 (16%)
Query: 160 VKKKVEALRNGITPSPSSSVSEIRTTDSVPHPTLPPVGLIFSPDYRQRASSNASSCGRLS 219
+ + + T + +S + P+ PP P + + S
Sbjct: 153 PLDEEDEDADVATTNSDNSFPGEDADPASASPSDPPSSSPGVPSFPSPPEDPS------S 206
Query: 220 PIPAQESPAWLPPYSNPYLGAEQLAGSLETTMKLSSDNFLPYPSPPPPYGAPPPYPCPHH 279
P + PA P S PSPPP APPP P
Sbjct: 207 PSDSSLPPA--PSSFQSDTPP-------------PSPESPTNPSPPPGPAAPPPPPVQQV 251
Query: 280 LVNSCNCHNHPAVKQQSISPSYHNSPSPSP 309
S P S +P+ +
Sbjct: 252 PPLS---TAKPTPPSASATPAPIGGITLDD 278
>gnl|CDD|184923 PRK14959, PRK14959, DNA polymerase III subunits gamma and tau;
Provisional.
Length = 624
Score = 28.9 bits (64), Expect = 4.4
Identities = 32/149 (21%), Positives = 46/149 (30%), Gaps = 27/149 (18%)
Query: 164 VEALRNGITPSPSSSVSEIRTTDSVPHPTLPPVGLIFSPDYRQRASSNASSCGRLSPIPA 223
VE+LR + + S S S T+P G + ++ A+ S PA
Sbjct: 368 VESLRPSGGGASAPSGSAAEGPASGGAATIPTPGTQ-----GPQGTAPAAGMTPSSAAPA 422
Query: 224 QESPAWLPPYSNPYLGAEQLAGSLETTMKLSSDNFLPYPSPPPPYGAPPPYPCPHHLVNS 283
+P+ P P+ A P PP PP P P S
Sbjct: 423 TPAPSAAPSPRVPWDDA----------------------PPAPPRSGIPPRPAPRMPEAS 460
Query: 284 CNCHNHPAVKQQSISPSYHNSPSPSPPAT 312
+V S +P PS + T
Sbjct: 461 PVPGAPDSVASASDAPPTLGDPSDTAEHT 489
>gnl|CDD|147580 pfam05474, Semenogelin, Semenogelin. This family consists of
several mammalian semenogelin (I and II) proteins.
Freshly ejaculated human semen has the appearance of a
loose gel in which the predominant structural protein
components are the seminal vesicle secreted semenogelins
(Sg).
Length = 450
Score = 28.5 bits (63), Expect = 5.3
Identities = 25/117 (21%), Positives = 51/117 (43%), Gaps = 11/117 (9%)
Query: 96 RPANHYQNSIRHNLSLHSKFMRVQNEGTGKSSWWMLNPDAKAGKSARRRATSMETSKFEK 155
R NH +N ++ ++S S + + + GKS + P + G+ A + S ++S E+
Sbjct: 286 RRLNHGENGVQKDVSKGSISRQTEEKIMGKSQKQVTTPSQEQGQKANKI--SYQSSSTEE 343
Query: 156 RRGRVKKKVEALRNGITPSPSSSVSEIRTTDSVPHPTLPPVGLIFSPDYRQRASSNA 212
RR +K ++ ++ +S +E + D + P QR+ +A
Sbjct: 344 RRLNHGEKG--IQKDVSKGSTSKKTEEKIHDK-------SQNQVTIPSQDQRSGQDA 391
>gnl|CDD|223287 COG0209, NrdA, Ribonucleotide reductase, alpha subunit [Nucleotide
transport and metabolism].
Length = 651
Score = 28.9 bits (65), Expect = 5.4
Identities = 13/65 (20%), Positives = 25/65 (38%), Gaps = 9/65 (13%)
Query: 107 HNLSL----HSKFM-RVQNEGTGKSSWWMLNPDAKAGKSARRRATSMETSKFEKRRGRVK 161
++S+ FM R + W + +P A E +++E G K
Sbjct: 248 FDISVGVWIPDLFMKRAKEGED----WTLFSPRDGYEAPDVYGAEFEEYAEYEADPGIRK 303
Query: 162 KKVEA 166
K+++A
Sbjct: 304 KRIKA 308
>gnl|CDD|219094 pfam06583, Neogenin_C, Neogenin C-terminus. This family represents
the C-terminus of eukaryotic neogenin precursor
proteins, which contains several potential
phosphorylation sites. Neogenin is a member of the N-CAM
family of cell adhesion molecules (and therefore
contains multiple copies of pfam00047 and pfam00041) and
is closely related to the DCC tumour suppressor gene
product - these proteins may play an integral role in
regulating differentiation programmes and/or cell
migration events within many adult and embryonic
tissues.
Length = 295
Score = 28.4 bits (63), Expect = 5.5
Identities = 30/150 (20%), Positives = 43/150 (28%), Gaps = 17/150 (11%)
Query: 170 GITPSPSSSVS-EIRTTDSVPHPTLPPVGLIFSPDYRQRASSNASSCGRLSPIPAQESPA 228
GI PSP + +P TL L D + + P
Sbjct: 111 GILPSPRCGYLHHQFSLRPMPFSTLTVQRLYQHGDRAESVE----------SVRQTPEPP 160
Query: 229 WLPPYSNPYLGA--EQLAGSLETTMKLSSDNFLPY--PSPPPPYGA-PPPYPCPHHLVNS 283
+LP + A E + S+ T + + P+ P P P L
Sbjct: 161 YLPAAQSESSNAAEEAPSRSIPTAHVRPTHPLKSFAVPALPASMSTIEPKLPSTPLLTQQ 220
Query: 284 CNCHNHPAVKQQSISPSYHNSPSPSPPATV 313
+VK S+ SP P TV
Sbjct: 221 GPTLPKHSVKTASVGTLGRAR-SPLLPVTV 249
>gnl|CDD|215600 PLN03141, PLN03141, 3-epi-6-deoxocathasterone 23-monooxygenase;
Provisional.
Length = 452
Score = 28.6 bits (64), Expect = 5.5
Identities = 14/36 (38%), Positives = 18/36 (50%)
Query: 7 LRNGITPSPSSSVSESLDLFPEDSPIHFQLRCLKIT 42
L+ IT VSESLD + +D P+ Q KI
Sbjct: 118 LKAQITRDMERYVSESLDSWRDDPPVLVQDETKKIA 153
>gnl|CDD|216648 pfam01690, PLRV_ORF5, Potato leaf roll virus readthrough protein.
This family consists mainly of the potato leaf roll
virus readthrough protein. This is generated via a
readthrough of open reading frame 3 a coat protein
allowing transcription of open reading frame 5 to give
an extended coat protein with a large c-terminal
addition or read through domain. The readthrough protein
is thought to play a role in the circulative aphid
transmission of potato leaf roll virus. Also in the
family is open reading frame 6 from beet western yellows
virus and potato leaf roll virus both luteovirus and an
unknown protein from cucurbit aphid-borne yellows virus
a closterovirus.
Length = 460
Score = 28.1 bits (63), Expect = 7.0
Identities = 10/19 (52%), Positives = 11/19 (57%)
Query: 260 PYPSPPPPYGAPPPYPCPH 278
P P+PPPP P P P H
Sbjct: 11 PTPTPPPPAPTPEPTPAKH 29
>gnl|CDD|215379 PLN02708, PLN02708, Probable pectinesterase/pectinesterase
inhibitor.
Length = 553
Score = 28.3 bits (63), Expect = 7.8
Identities = 15/51 (29%), Positives = 20/51 (39%), Gaps = 2/51 (3%)
Query: 260 PYPSPPPPYGAPPPYPCPHHLVNSCNCHNHPAVKQQSISPSYHNSPSPSPP 310
PSP PP P P ++ +CN P S+S + P P P
Sbjct: 29 HTPSPSPP--PPSSPSTPPQILLACNATRFPDTCVSSLSNAGRVPPDPKPI 77
>gnl|CDD|223039 PHA03307, PHA03307, transcriptional regulator ICP4; Provisional.
Length = 1352
Score = 28.2 bits (63), Expect = 9.5
Identities = 21/139 (15%), Positives = 30/139 (21%), Gaps = 15/139 (10%)
Query: 175 PSSSVSEIRTTDSVPHPTLPPVGLIFSPDYRQRASSNASSCGRLSPIPAQESPAWLPPYS 234
P +E +S PT L + R+ + + P P PA PP
Sbjct: 72 PPGPGTEAPANESRSTPTWSLSTLAPASPAREGSPTPPGP-SSPDPPPPTPPPASPPPSP 130
Query: 235 NPYLGAEQLAGSLETTMKLSSDNFLPYPSPPPPYGAPPPYPCPHHLVNSCNCHNHPAVKQ 294
P L +S A A+
Sbjct: 131 APDLSEMLRPVGSPGPPPAASPPAAGASPAAVASDAASS--------------RQAALPL 176
Query: 295 QSISPSYHNSPSPSPPATV 313
S + SP
Sbjct: 177 SSPEETARAPSSPPAEPPP 195
Database: CDD.v3.10
Posted date: Mar 20, 2013 7:55 AM
Number of letters in database: 10,937,602
Number of sequences in database: 44,354
Lambda K H
0.314 0.131 0.405
Gapped
Lambda K H
0.267 0.0647 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 17,535,879
Number of extensions: 1632458
Number of successful extensions: 1638
Number of sequences better than 10.0: 1
Number of HSP's gapped: 1504
Number of HSP's successfully gapped: 101
Length of query: 347
Length of database: 10,937,602
Length adjustment: 98
Effective length of query: 249
Effective length of database: 6,590,910
Effective search space: 1641136590
Effective search space used: 1641136590
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (22.0 bits)
S2: 59 (26.7 bits)