RPS-BLAST 2.2.26 [Sep-21-2011]
Database: CDD.v3.10
44,354 sequences; 10,937,602 total letters
Searching..................................................done
Query= psy11336
(194 letters)
>gnl|CDD|233514 TIGR01658, EYA-cons_domain, eyes absent protein conserved domain.
This domain is common to all eyes absent (EYA) homologs.
Metazoan EYA's also contain a variable N-terminal domain
consisting largely of low-complexity sequences.
Length = 274
Score = 168 bits (428), Expect = 2e-52
Identities = 48/135 (35%), Positives = 76/135 (56%), Gaps = 20/135 (14%)
Query: 54 LDRVFIWDLDETIIIFHSLLTGSYATKHG--KCPQTVIQLGYKMEEMIFGVADNHFFFNE 111
+ V++WD+DET+I+ HSLL GSYA K + +++G + EEMI + D HFF+ E
Sbjct: 1 PENVYVWDMDETLILLHSLLNGSYAESFNGSKDHKRGVEIGRRWEEMILEICDTHFFYEE 60
Query: 112 VEDCDQVHIDDISTDDNGQDLNGYNFSADGFRASVLNVPSGPPGLCVASGVRGGVDWMRK 171
+E+C++ +DD+ + D+G+DL+ Y F DGF + L RK
Sbjct: 61 IEECNEPFLDDVRSYDDGKDLSRYEFKTDGFS-------TPTDDLN-----------KRK 102
Query: 172 LAFRYRKIKDIYNNY 186
LA+R+R + +IY
Sbjct: 103 LAYRHRAVAEIYEKG 117
>gnl|CDD|237057 PRK12323, PRK12323, DNA polymerase III subunits gamma and tau;
Provisional.
Length = 700
Score = 33.3 bits (76), Expect = 0.072
Identities = 9/48 (18%), Positives = 11/48 (22%)
Query: 1 MSPYLEVGGSTSSRGRGRKSSASASNASSAGTPTPIIPNPCPSPGSPA 48
SP E + A A + A P P A
Sbjct: 426 RSPAPEALAAARQASARGPGGAPAPAPAPAAAPAAAARPAAAGPRPVA 473
>gnl|CDD|218803 pfam05904, DUF863, Plant protein of unknown function (DUF863).
This family consists of a number of hypothetical
proteins from Arabidopsis thaliana and Oryza sativa. The
function of this family is unknown.
Length = 766
Score = 30.4 bits (68), Expect = 0.60
Identities = 15/46 (32%), Positives = 19/46 (41%)
Query: 9 GSTSSRGRGRKSSASASNASSAGTPTPIIPNPCPSPGSPAGDSRIL 54
G RGR+ S + N +P P PCPS P + R L
Sbjct: 705 GRRKGGPRGRRRSVTIINEEVEVSPPPAPAPPCPSLAQPMANDRSL 750
>gnl|CDD|221717 pfam12694, MoCo_carrier, Putative molybdenum carrier. The
structure of proteins in this family contain central
beta strands with flanking alpha helices. The structure
is similar to that of a molybdenum cofactor carrier
protein.
Length = 145
Score = 29.5 bits (67), Expect = 0.66
Identities = 14/40 (35%), Positives = 20/40 (50%), Gaps = 7/40 (17%)
Query: 59 IWDLDETIIIFH------SLLTGSYATKHGKCPQTVIQLG 92
+ D D T+I+ + LT +A KHGK P V+ L
Sbjct: 61 VRDSDGTLILTRGELSGGTALTVEFARKHGK-PLLVVDLD 99
>gnl|CDD|226439 COG3926, zliS, Lysozyme family protein [General function prediction
only].
Length = 252
Score = 29.0 bits (65), Expect = 1.2
Identities = 12/51 (23%), Positives = 19/51 (37%)
Query: 37 IPNPCPSPGSPAGDSRILDRVFIWDLDETIIIFHSLLTGSYATKHGKCPQT 87
+P P P P G + + D D LL+G Y ++ P +
Sbjct: 176 VPQPAPFPNGGQGKAVVDDAPSTAPADLATGATGGLLSGLYDLQNQLSPLS 226
>gnl|CDD|234428 TIGR03979, His_Ser_Rich, His-Xaa-Ser repeat protein HxsA. Members
of this protein share two defining regions. One is a
histidine/serine-rich cluster, typically
H-R-S-H-S-S-H-R-S-H-S-S-H. Members are found always in
the context of a pair of radical SAM proteins, HxsB and
HxsC, and a fourth protein HxsD. The system is predicted
to perform peptide modifications, likely in the
His-Xaa-Ser region, to produce some uncharacterized
natural product.
Length = 186
Score = 28.7 bits (64), Expect = 1.4
Identities = 12/42 (28%), Positives = 15/42 (35%)
Query: 10 STSSRGRGRKSSASASNASSAGTPTPIIPNPCPSPGSPAGDS 51
S SS + + + S T T P P PS G S
Sbjct: 61 SHSSHSSHYSGAGGSYSVPSGDTSTYSYPVPSPSYSPSPGSS 102
>gnl|CDD|226918 COG4544, COG4544, Uncharacterized conserved protein [Function
unknown].
Length = 260
Score = 28.6 bits (64), Expect = 1.8
Identities = 13/39 (33%), Positives = 15/39 (38%), Gaps = 6/39 (15%)
Query: 16 RGRKSSASASNASSAGT------PTPIIPNPCPSPGSPA 48
R R + A A +A P P P P P G PA
Sbjct: 168 RRRAAEAEAFGQPTAAVSRWRISPRPSEPLPVPGVGRPA 206
>gnl|CDD|218440 pfam05110, AF-4, AF-4 proto-oncoprotein. This family consists of AF4
(Proto-oncogene AF4) and FMR2 (Fragile X E mental
retardation syndrome) nuclear proteins. These proteins
have been linked to human diseases such as acute
lymphoblastic leukaemia and mental retardation. The
family also contains a Drosophila AF4 protein homologue
Lilliputian which contains an AT-hook domain. Lilliputian
represents a novel pair-rule gene that acts in
cytoskeleton regulation, segmentation and morphogenesis
in Drosophila.
Length = 1154
Score = 28.7 bits (64), Expect = 2.5
Identities = 15/40 (37%), Positives = 20/40 (50%), Gaps = 9/40 (22%)
Query: 19 KSSASASNASS-------AGTPTPIIPNPCPSPGSPAGDS 51
K+S+ + A S GTP+P+ PN PSP S G
Sbjct: 1028 KNSSKVAQAPSPWVGSKSTGTPSPVSPN--PSPASSVGSQ 1065
>gnl|CDD|236394 PRK09169, PRK09169, hypothetical protein; Validated.
Length = 2316
Score = 28.1 bits (63), Expect = 3.2
Identities = 12/40 (30%), Positives = 16/40 (40%)
Query: 8 GGSTSSRGRGRKSSASASNASSAGTPTPIIPNPCPSPGSP 47
GGS + GR R + A+ P I +PGS
Sbjct: 1149 GGSRLTPGRTRVFELLLAGAAKRYGRLPAIRYADTAPGSA 1188
>gnl|CDD|179460 PRK02693, PRK02693, apocytochrome f; Reviewed.
Length = 312
Score = 28.0 bits (63), Expect = 3.3
Identities = 16/39 (41%), Positives = 21/39 (53%), Gaps = 9/39 (23%)
Query: 7 VGGSTSSRGRGR------KSSASASNASSAGTPTPIIPN 39
VGG+ RGRG+ KS+ + AS+AGT T I
Sbjct: 178 VGGN---RGRGQVYPTGEKSNNNVFTASAAGTITSIETG 213
>gnl|CDD|118685 pfam10157, DUF2365, Uncharacterized conserved protein (DUF2365).
This is a family of conserved proteins found from
nematodes to humans. The function is unknown.
Length = 149
Score = 27.5 bits (61), Expect = 3.3
Identities = 12/37 (32%), Positives = 18/37 (48%)
Query: 18 RKSSASASNASSAGTPTPIIPNPCPSPGSPAGDSRIL 54
R SS+ +S S++ P + PSP P D +L
Sbjct: 18 RLSSSRSSTPSASMGPITLTRGFLPSPNIPPIDPSVL 54
>gnl|CDD|234030 TIGR02834, spo_ytxC, putative sporulation protein YtxC. This
uncharacterized protein is part of a panel of proteins
conserved in all known endospore-forming Firmicutes
(low-GC Gram-positive bacteria), including
Carboxydothermus hydrogenoformans, and nowhere else
[Cellular processes, Sporulation and germination].
Length = 276
Score = 27.6 bits (62), Expect = 4.3
Identities = 7/49 (14%), Positives = 22/49 (44%), Gaps = 3/49 (6%)
Query: 93 YKMEEMIFGVADNHFFFNEVEDCDQVH---IDDISTDDNGQDLNGYNFS 138
+K +E+I + + ++F + E+ +Q+ ++ + Y +
Sbjct: 74 HKEDELILKIIEESYYFTDQEEIEQILAIANSILTGERKDDPSEIYKMN 122
>gnl|CDD|117486 pfam08919, F_actin_bind, F-actin binding. The F-actin binding
domain forms a compact bundle of four antiparallel
alpha-helices, which are arranged in a left-handed
topology. Binding of F-actin to the F-actin binding
domain may result in cytoplasmic retention and
subcellular distribution of the protein, as well as
possible inhibition of protein function.
Length = 179
Score = 27.4 bits (60), Expect = 4.7
Identities = 13/39 (33%), Positives = 16/39 (41%), Gaps = 2/39 (5%)
Query: 23 SASNASSAGTPTPIIPNP--CPSPGSPAGDSRILDRVFI 59
S A GTP +P P PSP A ++ FI
Sbjct: 14 PQSTAKPVGTPPSPVPLPSTSPSPSKMANGTQPSSAAFI 52
>gnl|CDD|140329 PTZ00308, PTZ00308, ethanolamine-phosphate cytidylyltransferase;
Provisional.
Length = 353
Score = 27.4 bits (61), Expect = 5.3
Identities = 12/36 (33%), Positives = 20/36 (55%), Gaps = 6/36 (16%)
Query: 102 VADNHFFFNEVEDCDQ------VHIDDISTDDNGQD 131
V + + + +ED ++ VH DDIS D NG++
Sbjct: 78 VVEGYPYTTRLEDLERLECDFVVHGDDISVDLNGRN 113
>gnl|CDD|223039 PHA03307, PHA03307, transcriptional regulator ICP4; Provisional.
Length = 1352
Score = 27.4 bits (61), Expect = 5.8
Identities = 14/51 (27%), Positives = 21/51 (41%)
Query: 2 SPYLEVGGSTSSRGRGRKSSASASNASSAGTPTPIIPNPCPSPGSPAGDSR 52
S SS + S+S+S +SS+ + +P PSP SR
Sbjct: 305 SGPAPSSPRASSSSSSSRESSSSSTSSSSESSRGAAVSPGPSPSRSPSPSR 355
Score = 27.1 bits (60), Expect = 7.4
Identities = 14/48 (29%), Positives = 24/48 (50%), Gaps = 5/48 (10%)
Query: 10 STSSRGRGRKSSASASNASS-----AGTPTPIIPNPCPSPGSPAGDSR 52
++SS R+SS+S++++SS A P+ PSP P +
Sbjct: 314 ASSSSSSSRESSSSSTSSSSESSRGAAVSPGPSPSRSPSPSRPPPPAD 361
>gnl|CDD|216205 pfam00937, Corona_nucleoca, Coronavirus nucleocapsid protein.
Length = 346
Score = 27.4 bits (61), Expect = 5.8
Identities = 16/40 (40%), Positives = 20/40 (50%), Gaps = 10/40 (25%)
Query: 13 SRGRGRKSSASASNASSAGTPTPIIPNPCPSPGSPAGDSR 52
RGR R SS S+S ++S G PS GS +SR
Sbjct: 152 FRGRSRSSSRSSSRSNSRG----------PSRGSSRNNSR 181
>gnl|CDD|218107 pfam04484, DUF566, Family of unknown function (DUF566). Family of
related proteins that is plant specific.
Length = 313
Score = 27.2 bits (60), Expect = 6.0
Identities = 15/51 (29%), Positives = 21/51 (41%), Gaps = 4/51 (7%)
Query: 7 VGGSTSSR----GRGRKSSASASNASSAGTPTPIIPNPCPSPGSPAGDSRI 53
G SSR RG S+A++ N S A T +P S + + D
Sbjct: 73 SRGRLSSRFVSPSRGSPSAAASLNGSLATASTSGSSSPSRSRRTTSSDLSS 123
>gnl|CDD|217393 pfam03154, Atrophin-1, Atrophin-1 family. Atrophin-1 is the
protein product of the dentatorubral-pallidoluysian
atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive
neurodegenerative disorder. It is caused by the
expansion of a CAG repeat in the DRPLA gene on
chromosome 12p. This results in an extended
polyglutamine region in atrophin-1, that is thought to
confer toxicity to the protein, possibly through
altering its interactions with other proteins. The
expansion of a CAG repeat is also the underlying defect
in six other neurodegenerative disorders, including
Huntington's disease. One interaction of expanded
polyglutamine repeats that is thought to be pathogenic
is that with the short glutamine repeat in the
transcriptional coactivator CREB binding protein, CBP.
This interaction draws CBP away from its usual nuclear
location to the expanded polyglutamine repeat protein
aggregates that are characteristic of the polyglutamine
neurodegenerative disorders. This interferes with
CBP-mediated transcription and causes cytotoxicity.
Length = 979
Score = 27.0 bits (59), Expect = 7.6
Identities = 10/41 (24%), Positives = 14/41 (34%)
Query: 8 GGSTSSRGRGRKSSASASNASSAGTPTPIIPNPCPSPGSPA 48
S G S A + ++G I P P +PA
Sbjct: 437 THPHSGLHSGPPQSPFAQHPFTSGGLPAIGPPPSLPTSTPA 477
>gnl|CDD|221581 pfam12446, DUF3682, Protein of unknown function (DUF3682). This
domain family is found in eukaryotes, and is typically
between 125 and 136 amino acids in length.
Length = 133
Score = 25.9 bits (57), Expect = 8.8
Identities = 17/42 (40%), Positives = 21/42 (50%), Gaps = 2/42 (4%)
Query: 10 STSSRGRGRKSSASASNASSAGTPTPIIPNPCPSPGSPAGDS 51
ST G G SS S++ A AG PN P+P +P DS
Sbjct: 1 STGGDGTGGVSSGSSAPAPPAGPGPG--PNAPPAPAAPGVDS 40
>gnl|CDD|235906 PRK07003, PRK07003, DNA polymerase III subunits gamma and tau;
Validated.
Length = 830
Score = 26.7 bits (59), Expect = 9.4
Identities = 13/43 (30%), Positives = 20/43 (46%), Gaps = 4/43 (9%)
Query: 13 SRGRGRKSSASA-SNASSAGTPTPIIPNP---CPSPGSPAGDS 51
S RG +++A+A A+ A P P P P+P + A
Sbjct: 564 SSDRGARAAAAAKPAAAPAAAPKPAAPRVAVQVPTPRARAATG 606
>gnl|CDD|216368 pfam01213, CAP_N, Adenylate cyclase associated (CAP) N terminal.
Length = 313
Score = 26.7 bits (59), Expect = 9.5
Identities = 11/44 (25%), Positives = 18/44 (40%)
Query: 4 YLEVGGSTSSRGRGRKSSASASNASSAGTPTPIIPNPCPSPGSP 47
+ G + S +G + ++ SS+ P P P P P P
Sbjct: 202 HHTTGLTWSKKGPVAAAKSALPAVSSSAPSAPPPPPPPPPPSVP 245
Database: CDD.v3.10
Posted date: Mar 20, 2013 7:55 AM
Number of letters in database: 10,937,602
Number of sequences in database: 44,354
Lambda K H
0.319 0.137 0.425
Gapped
Lambda K H
0.267 0.0757 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 9,992,094
Number of extensions: 919461
Number of successful extensions: 955
Number of sequences better than 10.0: 1
Number of HSP's gapped: 946
Number of HSP's successfully gapped: 26
Length of query: 194
Length of database: 10,937,602
Length adjustment: 92
Effective length of query: 102
Effective length of database: 6,857,034
Effective search space: 699417468
Effective search space used: 699417468
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 56 (25.3 bits)