RPS-BLAST 2.2.26 [Sep-21-2011]
Database: CDD.v3.10
44,354 sequences; 10,937,602 total letters
Searching..................................................done
Query= psy8702
(194 letters)
>gnl|CDD|222150 pfam13465, zf-H2C2_2, Zinc-finger double domain.
Length = 26
Score = 38.5 bits (90), Expect = 6e-05
Identities = 14/26 (53%), Positives = 17/26 (65%)
Query: 43 NLKTHLLTHTDHKPYECNSCGKVFRR 68
NL+ H+ THT KPY+C CGK F
Sbjct: 1 NLRRHMRTHTGEKPYKCPVCGKSFSS 26
>gnl|CDD|200998 pfam00096, zf-C2H2, Zinc finger, C2H2 type. The C2H2 zinc finger
is the classical zinc finger domain. The two conserved
cysteines and histidines co-ordinate a zinc ion. The
following pattern describes the zinc finger.
#-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can
be any amino acid, and numbers in brackets indicate the
number of residues. The positions marked # are those
that are important for the stable fold of the zinc
finger. The final position can be either his or cys.
The C2H2 zinc finger is composed of two short beta
strands followed by an alpha helix. The amino terminal
part of the helix binds the major groove in DNA binding
zinc fingers. The accepted consensus binding sequence
for Sp1 is usually defined by the asymmetric
hexanucleotide core GGGCGG but this sequence does not
include, among others, the GAG (=CTC) repeat that
constitutes a high-affinity site for Sp1 binding to the
wt1 promoter.
Length = 22
Score = 33.1 bits (76), Expect = 0.005
Identities = 11/22 (50%), Positives = 14/22 (63%)
Query: 58 ECNSCGKVFRRNCDLRRHALTH 79
+C CGK F R +L+RH TH
Sbjct: 1 KCPDCGKSFSRKSNLKRHLRTH 22
>gnl|CDD|197676 smart00355, ZnF_C2H2, zinc finger.
Length = 23
Score = 31.7 bits (72), Expect = 0.015
Identities = 12/23 (52%), Positives = 13/23 (56%)
Query: 57 YECNSCGKVFRRNCDLRRHALTH 79
Y C CGKVF+ LR H TH
Sbjct: 1 YRCPECGKVFKSKSALREHMRTH 23
>gnl|CDD|206065 pfam13894, zf-C2H2_4, C2H2-type zinc finger. This family
contains a number of divergent C2H2 type zinc fingers.
Length = 24
Score = 31.1 bits (70), Expect = 0.029
Identities = 9/23 (39%), Positives = 12/23 (52%)
Query: 57 YECNSCGKVFRRNCDLRRHALTH 79
++C CGK F L+RH H
Sbjct: 1 FKCPLCGKSFSSKDALKRHLRKH 23
>gnl|CDD|109943 pfam00906, Hepatitis_core, Hepatitis core antigen. The core
antigen of hepatitis viruses possesses a carboxyl
terminus rich in arginine. On this basis it was
predicted that the core antigen would bind DNA. There is
some experimental evidence to support this.
Length = 182
Score = 32.1 bits (73), Expect = 0.11
Identities = 19/43 (44%), Positives = 20/43 (46%), Gaps = 14/43 (32%)
Query: 136 GRRSRSRTPSPPLRSHSPISIFRRHQSPSPIGIYRRHSQSPQP 178
GR R RTPSP RR +S SP RR SQSP
Sbjct: 152 GRSPRRRTPSP-----------RRRRSQSP---RRRRSQSPSS 180
>gnl|CDD|223021 PHA03247, PHA03247, large tegument protein UL36; Provisional.
Length = 3151
Score = 31.4 bits (71), Expect = 0.35
Identities = 18/82 (21%), Positives = 30/82 (36%), Gaps = 5/82 (6%)
Query: 103 PPAPSPTGSQSSEDIEIERTPSEERSLEPS-DMLGRRSRSRTPSPPLRSHSPISIFRRHQ 161
PP+PSP ++ P E +P+ + R R+R ++ SP RR
Sbjct: 2628 PPSPSPAANEPDPHPPPTVPPPERPRDDPAPGRVSRPRRARRLGRAAQASSPPQRPRRRA 2687
Query: 162 SPSPIG----IYRRHSQSPQPA 179
+ +G + P P
Sbjct: 2688 ARPTVGSLTSLADPPPPPPTPE 2709
>gnl|CDD|222997 PHA03132, PHA03132, thymidine kinase; Provisional.
Length = 580
Score = 30.1 bits (68), Expect = 0.73
Identities = 20/101 (19%), Positives = 28/101 (27%), Gaps = 5/101 (4%)
Query: 92 ESIGSLKRDNLPPAPSPTGSQSSEDIEIERTPSEERSLEPSDMLGRRSRSRTPSPPLRSH 151
S D+L P I P R E + + PP +
Sbjct: 44 SEATSEDDDDLYPPRETGSGGGVATSTIYTVPRPPRGPEQTLDKPDSLPASRELPPGPTP 103
Query: 152 SPISIFRRHQSPSPIGIYRRHSQSPQPAHSSSIPFRHHSPI 192
P FR SP S SP+ + + +PI
Sbjct: 104 VPPGGFRGASSPRL----GADSTSPRFLYQVN-FPVILAPI 139
>gnl|CDD|223039 PHA03307, PHA03307, transcriptional regulator ICP4; Provisional.
Length = 1352
Score = 30.1 bits (68), Expect = 0.77
Identities = 20/99 (20%), Positives = 29/99 (29%), Gaps = 6/99 (6%)
Query: 88 SSLAESIGSLKRDNLPPAPSPTGSQSSEDIEIERTPSEERSLEPSDMLGRRSRSRTPSPP 147
S+ + + + S + S SSE P E L R + P+
Sbjct: 216 SASSPAPAPGRSAADDAGASSSDSSSSESSGCGWGPENECPL------PRPAPITLPTRI 269
Query: 148 LRSHSPISIFRRHQSPSPIGIYRRHSQSPQPAHSSSIPF 186
+ R S R S SP P+ S P
Sbjct: 270 WEASGWNGPSSRPGPASSSSSPRERSPSPSPSSPGSGPA 308
>gnl|CDD|177288 PHA00616, PHA00616, hypothetical protein.
Length = 44
Score = 27.4 bits (61), Expect = 0.85
Identities = 9/29 (31%), Positives = 15/29 (51%), Gaps = 1/29 (3%)
Query: 57 YECNSCGKVFRRNCDLRRHAL-THAVGDV 84
Y+C CG +FR+ ++ H L H +
Sbjct: 2 YQCLRCGGIFRKKKEVIEHLLSVHKQNKL 30
>gnl|CDD|221734 pfam12722, Hid1, High-temperature-induced dauer-formation protein.
Hid1 (high-temperature-induced dauer-formation protein
1) represents proteins of approximately 800 residues
long and is conserved from fungi to humans. It contains
up to seven potential transmembrane domains separated by
regions of low complexity. Functionally it might be
involved in vesicle secretion or be an inter-cellular
signalling protein or be a novel insulin receptor.
Length = 813
Score = 29.3 bits (66), Expect = 1.4
Identities = 17/89 (19%), Positives = 33/89 (37%), Gaps = 4/89 (4%)
Query: 72 LRRHALTHAVGDVPPESSLAESIGSLKRDNLPPAPSPTGSQSSEDIEIERTPSEERSLEP 131
LR + + ++ +SS E S + + + +P+ S + P E +SL
Sbjct: 562 LRHRKVFDQLRNLILDSSQEEDERSNQSASGSLSDNPSNDNDSRSPSLSEVPEENKSLAI 621
Query: 132 ----SDMLGRRSRSRTPSPPLRSHSPISI 156
S S +PP + P+ +
Sbjct: 622 TDDFDPASRENSTSEAAAPPSVNSVPLQL 650
>gnl|CDD|218276 pfam04814, HNF-1_N, Hepatocyte nuclear factor 1 (HNF-1), N
terminus. This family consists of the N terminus of
homeobox-containing transcription factor HNF-1. This
region contains a dimerisation sequence and an acidic
region that may be involved in transcription activation.
Mutations and the common Ala/Val 98 polymorphism in
HNF-1 cause the type 3 form of maturity-onset diabetes
of the young (MODY3).
Length = 177
Score = 28.3 bits (63), Expect = 1.8
Identities = 22/83 (26%), Positives = 37/83 (44%), Gaps = 15/83 (18%)
Query: 72 LRRHALTHAVGDVPPESSLAESIGSLKRDNL------------PPAPSPTGSQSSEDIEI 119
L + AL A+G++ PE + ES+ +L+ NL + ++SED E
Sbjct: 21 LSKEALIQALGELEPEPEVGESVITLQHLNLDKGESCHALPNGLGEGRLSEDETSEDGED 80
Query: 120 ERTP---SEERSLEPSDMLGRRS 139
+R P E L P + +R+
Sbjct: 81 DRPPPILKELERLSPEEAAHQRA 103
>gnl|CDD|221177 pfam11708, Slu7, Pre-mRNA splicing Prp18-interacting factor. The
spliceosome, an assembly of snRNAs (U1, U2, U4/U6, and
U5) and proteins, catalyzes the excision of introns from
pre-mRNAs in two successive trans-esterification
reactions. Step 2 depends upon integral spliceosome
constituents such as U5 snRNA and Prp8 and
non-spliceosomal proteins Prp16, Slu7, Prp18, and Prp22.
ATP hydrolysis by the DEAH-box enzyme Prp16 promotes a
conformational change in the spliceosome that leads to
protection of the 3'ss from targeted RNase H cleavage.
This change, which probably reflects binding of the 3'ss
PyAG in the catalytic centre of the spliceosome,
requires the ordered recruitment of Slu7, Prp18, and
Prp22 to the spliceosome. There is a close functional
relationship between Prp8, Prp18, and Slu7, and Prp18
interacts with Slu7, so that together they recruit Prp22
to the spliceosome. Most members of the family carry a
zinc-finger of the CCHC-type upstream of this domain.
Length = 236
Score = 28.6 bits (64), Expect = 1.8
Identities = 14/36 (38%), Positives = 17/36 (47%), Gaps = 3/36 (8%)
Query: 114 SEDIEIERTPSEERSLEPSDMLGRRSRSRTPSPPLR 149
D E E EE L+ SDM S++RT LR
Sbjct: 46 DTDYEDE---DEELYLDESDMGDSDSKTRTTVRNLR 78
>gnl|CDD|219406 pfam07420, DUF1509, Protein of unknown function (DUF1509). This
family consists of several uncharacterized viral
proteins from the Marek's disease-like viruses. Members
of this family are typically around 400 residues in
length. The function of this family is unknown.
Length = 377
Score = 28.5 bits (63), Expect = 2.7
Identities = 23/87 (26%), Positives = 32/87 (36%), Gaps = 13/87 (14%)
Query: 99 RDNLPPAPSPTGSQSSEDIEIERTPSEERSLEPSDMLGRRSRSRTPSPPLRSHSPISIFR 158
PP P P+G S RTP+ ++ D +R PSPP
Sbjct: 196 AAQRPPQPGPSGQNRSP-----RTPTLS-NVRVLDAPVATNRGEAPSPPRTDTL------ 243
Query: 159 RHQSPSPIGIYRRHSQSPQPAHSSSIP 185
P+ G R +++P P SS P
Sbjct: 244 -DPDPAIAGPSRAVNRTPSPRPSSPPP 269
>gnl|CDD|215194 PLN02340, PLN02340, endoglucanase.
Length = 614
Score = 28.3 bits (63), Expect = 2.8
Identities = 15/58 (25%), Positives = 27/58 (46%), Gaps = 4/58 (6%)
Query: 138 RSRSRTPSPPLRSHSPIS-IFRRHQSPSPIGIYRRHSQSPQPAHSSSIPFRHHSPIKF 194
RS P L ++P+ +F + QS S S +P ++S P + +P++F
Sbjct: 470 RSNYEQTEPTLSGNAPLVGLFAKLQSASETADA---YGSYKPTPNTSSPKQSGAPVEF 524
>gnl|CDD|199820 cd10495, MH2_R-SMAD, C-terminal Mad Homology 2 (MH2) domain in
receptor regulated SMADs. The MH2 domain is located at
the C-terminus of the SMAD (small mothers against
decapentaplegic) family of proteins, which are signal
transducers and transcriptional modulators that mediate
multiple signaling pathways. The MH2 domain is
responsible for type I receptor interaction,
phosphorylation-triggered homo- and
hetero-oligomerization, and transactivation. It is
negatively regulated by the N-terminal MH1 domain.
Receptor regulated SMADs (R-SMADs) include SMAD1, SMAD2,
SMAD3, SMAD5 and SMAD9. SMAD1 plays an essential role in
bone development and postnatal bone formation through
activation by bone morphogenetic protein (BMP) type 1
receptor kinase. SMAD2 regulates multiple cellular
processes, such as cell proliferation, apoptosis and
differentiation, while SMAD3 modulates signals of
activin and TGF-beta. SMAD5 is involved in BMP signal
modulation, possibly playing a role in the pathway
involving inhibition of hematopoietic progenitor cells
by TGF-beta. SMAD9 (also known as SMAD8) can mediate the
differentiation of mesenchymal stem cells into
tendon-like cells by inhibiting the osteogenic pathway.
Length = 182
Score = 27.7 bits (62), Expect = 3.4
Identities = 22/69 (31%), Positives = 30/69 (43%), Gaps = 12/69 (17%)
Query: 35 LLTNV----LTSNLKTHL-----LTHTDHKPY-ECNSCGKVF--RRNCDLRRHALTHAVG 82
LL+NV N + H+ L + + Y EC S +F RNC+LR V
Sbjct: 43 LLSNVNRNATIENTRRHIGRGVHLFYVGGEVYAECLSDSAIFVQSRNCNLRHGFHPATVC 102
Query: 83 DVPPESSLA 91
+PP SL
Sbjct: 103 KIPPGCSLK 111
>gnl|CDD|234570 PRK00009, PRK00009, phosphoenolpyruvate carboxylase; Reviewed.
Length = 911
Score = 27.8 bits (63), Expect = 4.6
Identities = 13/33 (39%), Positives = 16/33 (48%), Gaps = 1/33 (3%)
Query: 73 RRHALTHAVGDVPPESSLAESIGSLKRDNLPPA 105
RR HA G P SLAE++ LK + P
Sbjct: 93 RRRRE-HASGSQPQPGSLAETLRRLKAAGVSPE 124
>gnl|CDD|144655 pfam01141, Gag_p12, Gag polyprotein, inner coat protein p12. The
retroviral p12 is a virion structural protein. p12 is
proline rich. The function carried out by p12 in
assembly and replication is unknown. p12 is associated
with pathogenicity of the virus.
Length = 85
Score = 26.0 bits (57), Expect = 5.4
Identities = 19/81 (23%), Positives = 32/81 (39%), Gaps = 4/81 (4%)
Query: 76 ALTHAVGDVPPESSLAESIGS----LKRDNLPPAPSPTGSQSSEDIEIERTPSEERSLEP 131
ALT ++ PP+ + G L ++ PP P S + + + E + + +P
Sbjct: 2 ALTPSIKPKPPKPQVLPDSGGPLIDLLTEDPPPYRGPGPSPPAGEGDEEEAATTSEAPDP 61
Query: 132 SDMLGRRSRSRTPSPPLRSHS 152
S M R R P + S
Sbjct: 62 SPMASRLRGRREPPVADSTTS 82
>gnl|CDD|235357 PRK05177, minC, septum formation inhibitor; Reviewed.
Length = 239
Score = 26.8 bits (60), Expect = 6.0
Identities = 16/54 (29%), Positives = 24/54 (44%), Gaps = 9/54 (16%)
Query: 101 NLPPAPSPTGSQSSEDIEIERTPSEERSLEPSDMLGRRSRSRTPS----PPLRS 150
+PPA G + + D+EI P EE + P + + PS P+RS
Sbjct: 94 GMPPAL--VGGRPAGDVEI---PEEEPAAPPPAPAAPEAPAAVPSLVIEEPVRS 142
>gnl|CDD|201791 pfam01428, zf-AN1, AN1-like Zinc finger. Zinc finger at the
C-terminus of An1, a ubiquitin-like protein in Xenopus
laevis. The following pattern describes the zinc
finger. C-X2-C-X(9-12)-C-X(1-2)-C-X4-C-X2-H-X5-H-X-C
Where X can be any amino acid, and numbers in brackets
indicate the number of residues.
Length = 43
Score = 24.7 bits (54), Expect = 6.6
Identities = 8/24 (33%), Positives = 10/24 (41%), Gaps = 3/24 (12%)
Query: 56 PYECNSCGKVFRRNCDLRRHALTH 79
P++C CG F C R H
Sbjct: 13 PFKCRFCGGTF---CLKHRLPEDH 33
>gnl|CDD|235290 PRK04351, PRK04351, hypothetical protein; Provisional.
Length = 149
Score = 26.4 bits (59), Expect = 6.7
Identities = 9/18 (50%), Positives = 11/18 (61%), Gaps = 3/18 (16%)
Query: 57 YECNSCGKVFRRNCDLRR 74
YEC SCG+ + R RR
Sbjct: 113 YECQSCGQQYLRK---RR 127
>gnl|CDD|227785 COG5498, ACF2, Predicted glycosyl hydrolase [Cell envelope
biogenesis, outer membrane].
Length = 760
Score = 27.1 bits (60), Expect = 6.9
Identities = 15/48 (31%), Positives = 19/48 (39%), Gaps = 3/48 (6%)
Query: 131 PSDMLGRRSRSRTPSPPLRSH--SPISIFRRHQSPSPIGIYRRHSQSP 176
S + + S PP+ S +P S FR Q P P R S P
Sbjct: 43 HSVSVPKGD-SNVEFPPILSGTINPPSTFRSVQHPVPPPGVRSDSTGP 89
>gnl|CDD|219953 pfam08648, DUF1777, Protein of unknown function (DUF1777). This is
a family of eukaryotic proteins of unknown function.
Some of the proteins in this family are putative nucleic
acid binding proteins.
Length = 158
Score = 26.4 bits (58), Expect = 7.1
Identities = 19/42 (45%), Positives = 21/42 (50%), Gaps = 4/42 (9%)
Query: 124 SEERSLEPSDMLGRRSRSRTPSPPLRSHSPISIFRRHQSPSP 165
ERS RRSRSR+P RS SP RRH+S S
Sbjct: 27 RRERSRSRERDRRRRSRSRSPHRSRRSRSP----RRHRSRSR 64
>gnl|CDD|215346 PLN02643, PLN02643, ADP-glucose phosphorylase.
Length = 336
Score = 26.6 bits (59), Expect = 8.8
Identities = 13/46 (28%), Positives = 18/46 (39%), Gaps = 5/46 (10%)
Query: 145 SPPLRSHSPIS----IFRRHQSPSPIGIYRRHSQSPQPAHSSSIPF 186
LR P++ IF + P + Q+P HSS PF
Sbjct: 1 MAELR-KDPVTNRWVIFSPARGKRPTDFKSKSPQNPNGNHSSGCPF 45
>gnl|CDD|220759 pfam10446, DUF2457, Protein of unknown function (DUF2457). This is
a family of uncharacterized proteins.
Length = 449
Score = 26.9 bits (59), Expect = 9.1
Identities = 26/94 (27%), Positives = 33/94 (35%), Gaps = 19/94 (20%)
Query: 104 PAPSPTGSQSSEDIEIERTPSEERSLEPSD-MLGRRSRSRTP--------SPPLRSH--- 151
P P + D E S++ D GRR R R+P SPP R
Sbjct: 230 PTSDPEDEEDELDDVEEVIESDDHFFLDLDGERGRRRRKRSPRTSPKRFRSPPPRKARGR 289
Query: 152 SPISIFRRHQSPSPIGIYRRHSQSPQPAHSSSIP 185
SP + R P +SP P H+S P
Sbjct: 290 SPRRLIRSPPPP-------GRLRSPPPLHASDSP 316
Database: CDD.v3.10
Posted date: Mar 20, 2013 7:55 AM
Number of letters in database: 10,937,602
Number of sequences in database: 44,354
Lambda K H
0.313 0.129 0.387
Gapped
Lambda K H
0.267 0.0647 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 9,585,664
Number of extensions: 849921
Number of successful extensions: 735
Number of sequences better than 10.0: 1
Number of HSP's gapped: 716
Number of HSP's successfully gapped: 86
Length of query: 194
Length of database: 10,937,602
Length adjustment: 92
Effective length of query: 102
Effective length of database: 6,857,034
Effective search space: 699417468
Effective search space used: 699417468
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.9 bits)
S2: 56 (25.5 bits)