RPS-BLAST 2.2.26 [Sep-21-2011]
Database: CDD.v3.10
44,354 sequences; 10,937,602 total letters
Searching..................................................done
Query= psy17674
(302 letters)
>gnl|CDD|216339 pfam01167, Tub, Tub family.
Length = 243
Score = 241 bits (617), Expect = 1e-79
Identities = 85/128 (66%), Positives = 106/128 (82%)
Query: 169 DTNVLGFKGPRKMTVLIPGMTQSYERVSITPLEISETLIECWNSKNMDNIIELHNKTPTW 228
+TNVLGF+GPRKMTV++PG+ S+ERV I PL E+L+ W +KN ++++ LHNK P W
Sbjct: 116 ETNVLGFRGPRKMTVIMPGIPSSHERVPIQPLNDQESLLSRWQNKNKESLLVLHNKAPRW 175
Query: 229 NEDTQSYVLNFHGRVTQASVKNFQIIHDNDPEYIVLQFGRTSEDVFTMDYRYPLCPLQAF 288
N+ QSYVLNFHGRVTQASVKNFQI+H +DP+YIVLQFGR ++D+FTMDYRYPL QAF
Sbjct: 176 NDQLQSYVLNFHGRVTQASVKNFQIVHASDPDYIVLQFGRVAKDMFTMDYRYPLSAFQAF 235
Query: 289 SIALSSFD 296
+I LSSFD
Sbjct: 236 AICLSSFD 243
Score = 66.3 bits (162), Expect = 1e-12
Identities = 23/38 (60%), Positives = 31/38 (81%)
Query: 131 PAAQKVSYKCRITRDRKGVDRGLYPTYFLHLERDYGKK 168
PA + + +CRITRD+ G+D GL+PTY+LHLER+ GKK
Sbjct: 1 PAPRGGTVQCRITRDKSGMDYGLFPTYYLHLERENGKK 38
>gnl|CDD|217392 pfam03153, TFIIA, Transcription factor IIA, alpha/beta subunit.
Transcription initiation factor IIA (TFIIA) is a
heterotrimer, the three subunits being known as alpha,
beta, and gamma, in order of molecular weight. The N and
C-terminal domains of the gamma subunit are represented
in pfam02268 and pfam02751, respectively. This family
represents the precursor that yields both the alpha and
beta subunits. The TFIIA heterotrimer is an essential
general transcription initiation factor for the
expression of genes transcribed by RNA polymerase II.
Together with TFIID, TFIIA binds to the promoter region;
this is the first step in the formation of a
pre-initiation complex (PIC). Binding of the rest of the
transcription machinery follows this step. After
initiation, the PIC does not completely dissociate from
the promoter. Some components, including TFIIA, remain
attached and re-initiate a subsequent round of
transcription.
Length = 332
Score = 33.6 bits (77), Expect = 0.12
Identities = 13/25 (52%), Positives = 15/25 (60%)
Query: 67 PDRSSQDDEDDDDDDEDDIEQELSD 91
D + D DDDDDED IE +L D
Sbjct: 247 IDSDDEGDGSDDDDDEDAIESDLDD 271
>gnl|CDD|218312 pfam04889, Cwf_Cwc_15, Cwf15/Cwc15 cell cycle control protein.
This family represents Cwf15/Cwc15 (from
Schizosaccharomyces pombe and Saccharomyces cerevisiae
respectively) and their homologues. The function of
these proteins is unknown, but they form part of the
spliceosome and are thus thought to be involved in mRNA
splicing.
Length = 241
Score = 32.0 bits (73), Expect = 0.27
Identities = 10/27 (37%), Positives = 12/27 (44%)
Query: 63 GSGYPDRSSQDDEDDDDDDEDDIEQEL 89
S S D+D DDDD +D L
Sbjct: 121 DSDSSSDSDSSDDDSDDDDSEDETAAL 147
Score = 30.8 bits (70), Expect = 0.65
Identities = 14/37 (37%), Positives = 18/37 (48%)
Query: 52 RTQASTDTSDYGSGYPDRSSQDDEDDDDDDEDDIEQE 88
R + D + + D SS D DDD D+DD E E
Sbjct: 107 RKRIEEDARNSDADDSDSSSDSDSSDDDSDDDDSEDE 143
Score = 30.8 bits (70), Expect = 0.74
Identities = 14/36 (38%), Positives = 17/36 (47%), Gaps = 3/36 (8%)
Query: 60 SDYGSGYPDRSSQDDEDDDDDDEDDIEQ---ELSDI 92
+D D S DD+ DDDD ED+ EL I
Sbjct: 119 ADDSDSSSDSDSSDDDSDDDDSEDETAALLRELEKI 154
>gnl|CDD|145949 pfam03066, Nucleoplasmin, Nucleoplasmin. Nucleoplasmins are also
known as chromatin decondensation proteins. They bind to
core histones and transfer DNA to them in a reaction
that requires ATP. This is thought to play a role in the
assembly of regular nucleosomal arrays.
Length = 146
Score = 31.1 bits (71), Expect = 0.36
Identities = 8/21 (38%), Positives = 16/21 (76%)
Query: 68 DRSSQDDEDDDDDDEDDIEQE 88
D +++EDD++DD++D +E
Sbjct: 117 DEEDEEEEDDEEDDDEDESEE 137
Score = 30.4 bits (69), Expect = 0.55
Identities = 8/31 (25%), Positives = 18/31 (58%)
Query: 63 GSGYPDRSSQDDEDDDDDDEDDIEQELSDIS 93
D +D+E++DD+++DD ++ + S
Sbjct: 110 EEDESDDDEEDEEEEDDEEDDDEDESEEEES 140
Score = 28.4 bits (64), Expect = 2.7
Identities = 9/23 (39%), Positives = 16/23 (69%)
Query: 73 DDEDDDDDDEDDIEQELSDISLH 95
DD+++D+++EDD E + D S
Sbjct: 115 DDDEEDEEEEDDEEDDDEDESEE 137
Score = 28.1 bits (63), Expect = 3.6
Identities = 7/30 (23%), Positives = 17/30 (56%)
Query: 59 TSDYGSGYPDRSSQDDEDDDDDDEDDIEQE 88
D + ++D+++DDD+++ E+E
Sbjct: 110 EEDESDDDEEDEEEEDDEEDDDEDESEEEE 139
>gnl|CDD|225618 COG3076, COG3076, Uncharacterized protein conserved in bacteria
[Function unknown].
Length = 135
Score = 30.6 bits (69), Expect = 0.51
Identities = 11/23 (47%), Positives = 19/23 (82%)
Query: 62 YGSGYPDRSSQDDEDDDDDDEDD 84
+G+ + D +++DD+DD+D DEDD
Sbjct: 108 WGTYFEDPNAEDDDDDEDVDEDD 130
>gnl|CDD|184209 PRK13650, cbiO, cobalt transporter ATP-binding subunit;
Provisional.
Length = 279
Score = 30.9 bits (70), Expect = 0.63
Identities = 14/29 (48%), Positives = 17/29 (58%), Gaps = 3/29 (10%)
Query: 215 MDNIIELHNKTPTWNEDTQSYVLN---FH 240
M NIIE+ N T + ED + Y LN FH
Sbjct: 1 MSNIIEVKNLTFKYKEDQEKYTLNDVSFH 29
>gnl|CDD|220759 pfam10446, DUF2457, Protein of unknown function (DUF2457). This is
a family of uncharacterized proteins.
Length = 449
Score = 31.1 bits (70), Expect = 0.72
Identities = 16/41 (39%), Positives = 21/41 (51%)
Query: 68 DRSSQDDEDDDDDDEDDIEQELSDISLHRGSHVISDPDEIN 108
D DDED+DDDD+DD E + + H S D+ N
Sbjct: 59 DDDDDDDEDEDDDDDDDDEDDEDEDDDDSTLHDDSSADDGN 99
>gnl|CDD|115196 pfam06524, NOA36, NOA36 protein. This family consists of several
NOA36 proteins which contain 29 highly conserved
cysteine residues. The function of this protein is
unknown.
Length = 314
Score = 31.1 bits (70), Expect = 0.74
Identities = 12/61 (19%), Positives = 20/61 (32%)
Query: 50 FTRTQASTDTSDYGSGYPDRSSQDDEDDDDDDEDDIEQELSDISLHRGSHVISDPDEINS 109
+ + +S D+ S DDE +D D+ E D + + DE
Sbjct: 252 YKQNLSSNKYGDFKGVNYGYESDDDEGSSSNDYDEEEDGDDDDNEDNDDTNTNHGDEDCD 311
Query: 110 G 110
Sbjct: 312 N 312
>gnl|CDD|223507 COG0430, RCL1, RNA 3'-terminal phosphate cyclase [RNA processing
and modification].
Length = 341
Score = 29.9 bits (68), Expect = 1.7
Identities = 15/66 (22%), Positives = 25/66 (37%), Gaps = 20/66 (30%)
Query: 90 SDISLHRGSHVISDPDEINSGSLPIEMQGEVVGNLERFVLEPAAQKVSYKCRITRDRKGV 149
S I++ G+ V P P++ R V P +K+ +C + +
Sbjct: 114 SRITVTGGTDVPWAP--------PVD--------YIRRVTLPVLRKMGIECELEVLK--- 154
Query: 150 DRGLYP 155
RG YP
Sbjct: 155 -RGFYP 159
>gnl|CDD|223032 PHA03283, PHA03283, envelope glycoprotein E; Provisional.
Length = 542
Score = 29.5 bits (66), Expect = 2.2
Identities = 15/71 (21%), Positives = 24/71 (33%), Gaps = 3/71 (4%)
Query: 44 YNGPLQFTRTQASTDTSDYGSGYPDRSSQDDEDDDDD--DEDDIEQELSDISLHRGSHVI 101
Y F S ++D +R + D +D D DE+ + G +V
Sbjct: 433 YEVLNPFETVYTSVPSNDPEVLVFERLASDSDDSFDSSSDEELEPPPPPGPAGQLGPYVS 492
Query: 102 SDP-DEINSGS 111
D D + S
Sbjct: 493 LDAEDAFSGRS 503
>gnl|CDD|219293 pfam07093, SGT1, SGT1 protein. This family consists of several
eukaryotic SGT1 proteins. Human SGT1 or hSGT1 is known
to suppress GCR2 and is highly expressed in the muscle
and heart. The function of this family is unknown
although it has been speculated that SGT1 may be
functionally analogous to the Gcr2p protein of
Saccharomyces cerevisiae which is known to be a
regulatory factor of glycolytic gene expression.
Length = 557
Score = 29.7 bits (67), Expect = 2.3
Identities = 15/34 (44%), Positives = 21/34 (61%), Gaps = 1/34 (2%)
Query: 61 DYGSGYPDRSSQDDEDDDDDDEDDIEQELSDISL 94
+ + PD S DEDDD+DD++D E SD +L
Sbjct: 476 EIDNDLPDDSDDADEDDDEDDDED-EDSSSDSTL 508
>gnl|CDD|238761 cd01484, E1-2_like, Ubiquitin activating enzyme (E1), repeat
2-like. E1, a highly conserved small protein present
universally in eukaryotic cells, is part of cascade to
attach ubiquitin (Ub) covalently to substrate proteins.
This cascade consists of activating (E1), conjugating
(E2), and/or ligating (E3) enzymes and then targets them
for degradation by the 26S proteasome. E1 activates
ubiquitin by C-terminal adenylation, and subsequently
forms a highly reactive thioester bond between its
catalytic cysteine and ubiquitin's C-terminus. E1 also
associates with E2 and promotes ubiquitin transfer to
the E2's catalytic cysteine. A set of novel molecules
with a structural similarity to Ub, called Ub-like
proteins (Ubls), have similar conjugation cascades. In
contrast to ubiquitin-E1, which is a single-chain
protein with a weakly conserved two-fold repeat, many of
the Ubls-E1are a heterodimer where each subunit
corresponds to one half of a single-chain E1. This CD
represents the family homologous to the second repeat of
Ub-E1.
Length = 234
Score = 29.1 bits (65), Expect = 2.4
Identities = 10/25 (40%), Positives = 15/25 (60%), Gaps = 2/25 (8%)
Query: 169 DTNVLGFKGPRKMTVLIPGMTQSYE 193
++ GFKG V++PGMT+ E
Sbjct: 121 ESGTEGFKG--NAQVILPGMTECIE 143
>gnl|CDD|221965 pfam13186, DUF4008, Domain of unknown function (DUF4008). This
presumed domain is functionally uncharacterized. This
domain family is found in bacteria and archaea, and is
approximately 70 amino acids in length. The family is
found in association with pfam04055.
Length = 65
Score = 27.0 bits (60), Expect = 2.4
Identities = 6/23 (26%), Positives = 10/23 (43%)
Query: 202 ISETLIECWNSKNMDNIIELHNK 224
I ++L E WNS+ +
Sbjct: 32 IKDSLKEIWNSEKYREFRKAVLD 54
>gnl|CDD|217927 pfam04147, Nop14, Nop14-like family. Emg1 and Nop14 are novel
proteins whose interaction is required for the
maturation of the 18S rRNA and for 40S ribosome
production.
Length = 809
Score = 29.6 bits (67), Expect = 2.4
Identities = 12/41 (29%), Positives = 21/41 (51%)
Query: 66 YPDRSSQDDEDDDDDDEDDIEQELSDISLHRGSHVISDPDE 106
+ D++D++DD+DD+E+E D+ L D D
Sbjct: 323 EEEEEDGVDDEDEEDDDDDLEEEEEDVDLSDEEEDEEDEDS 363
>gnl|CDD|240400 PTZ00396, PTZ00396, Casein kinase II subunit beta; Provisional.
Length = 251
Score = 29.2 bits (66), Expect = 2.6
Identities = 10/25 (40%), Positives = 15/25 (60%)
Query: 68 DRSSQDDEDDDDDDEDDIEQELSDI 92
D DDED +D+ ++ QE SD+
Sbjct: 69 DSEPPDDEDLEDEQFLEVYQEASDL 93
>gnl|CDD|217203 pfam02724, CDC45, CDC45-like protein. CDC45 is an essential gene
required for initiation of DNA replication in S.
cerevisiae, forming a complex with MCM5/CDC46.
Homologues of CDC45 have been identified in human, mouse
and smut fungus among others.
Length = 583
Score = 29.2 bits (66), Expect = 2.8
Identities = 13/31 (41%), Positives = 15/31 (48%)
Query: 58 DTSDYGSGYPDRSSQDDEDDDDDDEDDIEQE 88
D S+DDEDDDDDD+DD
Sbjct: 130 DEESDEEDEESSKSEDDEDDDDDDDDDDIAT 160
Score = 28.0 bits (63), Expect = 7.1
Identities = 13/34 (38%), Positives = 18/34 (52%)
Query: 54 QASTDTSDYGSGYPDRSSQDDEDDDDDDEDDIEQ 87
+ SD +S D++DDDDDD+DDI
Sbjct: 127 DDDDEESDEEDEESSKSEDDEDDDDDDDDDDIAT 160
>gnl|CDD|203043 pfam04546, Sigma70_ner, Sigma-70, non-essential region. The
domain is found in the primary vegetative sigma factor.
The function of this domain is unclear and can be
removed without loss of function.
Length = 211
Score = 28.7 bits (65), Expect = 2.9
Identities = 9/16 (56%), Positives = 15/16 (93%)
Query: 73 DDEDDDDDDEDDIEQE 88
DD+DDD+D+++D E+E
Sbjct: 55 DDDDDDEDEDEDDEEE 70
Score = 28.7 bits (65), Expect = 3.1
Identities = 9/23 (39%), Positives = 16/23 (69%)
Query: 68 DRSSQDDEDDDDDDEDDIEQELS 90
D DD+D+D+D++D+ E +L
Sbjct: 52 DLEDDDDDDEDEDEDDEEEADLG 74
Score = 27.2 bits (61), Expect = 8.8
Identities = 7/24 (29%), Positives = 16/24 (66%)
Query: 68 DRSSQDDEDDDDDDEDDIEQELSD 91
++D +DDDDD++D +++ +
Sbjct: 46 SELDEEDLEDDDDDDEDEDEDDEE 69
>gnl|CDD|234846 PRK00844, glgC, glucose-1-phosphate adenylyltransferase;
Provisional.
Length = 407
Score = 29.0 bits (66), Expect = 3.3
Identities = 12/30 (40%), Positives = 20/30 (66%), Gaps = 3/30 (10%)
Query: 250 NFQIIHDNDPEYIVLQFGRTSEDVFTMDYR 279
+ +I D DP+Y+V+ FG ++ V+ MD R
Sbjct: 108 SLNLIEDEDPDYVVV-FG--ADHVYRMDPR 134
>gnl|CDD|223009 PHA03211, PHA03211, serine/threonine kinase US3; Provisional.
Length = 461
Score = 28.7 bits (64), Expect = 3.7
Identities = 10/20 (50%), Positives = 15/20 (75%)
Query: 66 YPDRSSQDDEDDDDDDEDDI 85
YPD ++ ++DDDDD DD+
Sbjct: 84 YPDSGAEAEDDDDDDAPDDV 103
>gnl|CDD|238278 cd00498, Hsp33, Heat shock protein 33 (Hsp33): Cytosolic protein
that acts as a molecular chaperone under oxidative
conditions. In normal (reducing) cytosolic conditions,
four conserved Cys residues are coordinated by a Zn ion.
Under oxidative stress (such as heat shock), the Cys
are reversibly oxidized to disulfide bonds, which causes
the chaperone activity to be turned on. Hsp33 is
homodimeric in its functional form.
Length = 275
Score = 28.7 bits (65), Expect = 3.9
Identities = 15/71 (21%), Positives = 26/71 (36%), Gaps = 10/71 (14%)
Query: 76 DDDDDDEDDIEQELSDISLHRGSHVISDPDEINSGSLPIEMQGEVVGNLERFVLEPAAQK 135
D++D D E + + G P E+ + E +LE Q
Sbjct: 178 GADEEDIDAWE--------KVIKLMPTVSALELLGLSPEELLYRLFHEEEVRILEK--QP 227
Query: 136 VSYKCRITRDR 146
V ++C +R+R
Sbjct: 228 VRFRCDCSRER 238
>gnl|CDD|218003 pfam04281, Tom22, Mitochondrial import receptor subunit Tom22.
The mitochondrial protein translocase family, which is
responsible for movement of nuclear encoded
pre-proteins into mitochondria, is very complex with at
least 19 components. These proteins include several
chaperone proteins, four proteins of the outer membrane
translocase (Tom) import receptor, five proteins of the
Tom channel complex, five proteins of the inner
membrane translocase (Tim) and three "motor" proteins.
This family represents the Tom22 proteins. The N
terminal region of Tom22 has been shown to have
chaperone-like activity, and the C terminal region
faces the intermembrane face.
Length = 136
Score = 27.6 bits (62), Expect = 4.0
Identities = 11/28 (39%), Positives = 19/28 (67%)
Query: 66 YPDRSSQDDEDDDDDDEDDIEQELSDIS 93
P +Q++ DDDD+D+ D + ++SD S
Sbjct: 18 APKNLAQEESDDDDEDDTDTDSDISDDS 45
Score = 26.8 bits (60), Expect = 8.4
Identities = 10/39 (25%), Positives = 12/39 (30%)
Query: 50 FTRTQASTDTSDYGSGYPDRSSQDDEDDDDDDEDDIEQE 88
F A+ D D D D D+ D E E
Sbjct: 12 FQEKPAAPKNLAQEESDDDDEDDTDTDSDISDDSDFENE 50
>gnl|CDD|235394 PRK05297, PRK05297, phosphoribosylformylglycinamidine synthase;
Provisional.
Length = 1290
Score = 29.0 bits (66), Expect = 4.3
Identities = 16/62 (25%), Positives = 25/62 (40%), Gaps = 2/62 (3%)
Query: 78 DDDDEDDIEQELSDISLHRGSHVISDPDEINSGSLPIEMQGEVVGNLERFVLEPAAQKVS 137
D D +E L++ L HVI P+ + I G+ V + R L + S
Sbjct: 928 RAADRDAVEAILAEHGLSDCVHVIGKPNA--GDRIVITRNGKTVFSESRTELRRWWSETS 985
Query: 138 YK 139
Y+
Sbjct: 986 YQ 987
>gnl|CDD|177868 PLN02222, PLN02222, phosphoinositide phospholipase C 2.
Length = 581
Score = 28.8 bits (64), Expect = 4.3
Identities = 10/19 (52%), Positives = 14/19 (73%)
Query: 68 DRSSQDDEDDDDDDEDDIE 86
+ S+ DD+DDDDD ED +
Sbjct: 291 NDSNGDDDDDDDDGEDKSK 309
>gnl|CDD|185603 PTZ00415, PTZ00415, transmission-blocking target antigen s230;
Provisional.
Length = 2849
Score = 28.9 bits (64), Expect = 4.5
Identities = 15/42 (35%), Positives = 23/42 (54%)
Query: 47 PLQFTRTQASTDTSDYGSGYPDRSSQDDEDDDDDDEDDIEQE 88
+ R A D S + D +D+++DDDD+EDD E+E
Sbjct: 133 KRRRARHLAEEDMSPRDNFVIDDDDEDEDEDDDDEEDDEEEE 174
>gnl|CDD|219355 pfam07267, Nucleo_P87, Nucleopolyhedrovirus capsid protein P87.
This family consists of several Nucleopolyhedrovirus
capsid protein P87 sequences. P87 is expressed late in
infection and concentrated in infected cell nuclei.
Length = 606
Score = 28.7 bits (64), Expect = 4.6
Identities = 13/23 (56%), Positives = 17/23 (73%)
Query: 67 PDRSSQDDEDDDDDDEDDIEQEL 89
P+ SS +DEDD D+DE D E+E
Sbjct: 367 PEYSSDEDEDDSDEDEVDYEKER 389
>gnl|CDD|239451 cd03335, TCP1_alpha, TCP-1 (CTT or eukaryotic type II) chaperonin
family, alpha subunit. Chaperonins are involved in
productive folding of proteins. They share a common
general morphology, a double toroid of 2 stacked rings.
In contrast to bacterial group I chaperonins (GroEL),
each ring of the eukaryotic cytosolic chaperonin (CTT)
consists of eight different, but homologous subunits.
Their common function is to sequester nonnative proteins
inside their central cavity and promote folding by using
energy derived from ATP hydrolysis. The best studied in
vivo substrates of CTT are actin and tubulin.
Length = 527
Score = 28.4 bits (64), Expect = 4.8
Identities = 14/44 (31%), Positives = 21/44 (47%), Gaps = 1/44 (2%)
Query: 93 SLHRGSHVISDPDEINSGSLPIEMQGEVVGNLERFVLEPAAQKV 136
+ H + V D + L + + G+V NLE VLEP K+
Sbjct: 461 AYHAAAQVKPDKKHLKWYGLDL-INGKVRDNLEAGVLEPTVSKI 503
>gnl|CDD|221490 pfam12253, CAF1A, Chromatin assembly factor 1 subunit A. The
CAF-1 or chromatin assembly factor-1 consists of three
subunits, and this is the first, or A. The A domain is
uniquely required for the progression of S phase in
mouse cells, independent of its ability to promote
histone deposition but dependent on its ability to
interact with HP1 - heterochromatin protein 1-rich
heterochromatin domains next to centromeres that are
crucial for chromosome segregation during mitosis. This
HP1-CAF-1 interaction module functions as a built-in
replication control for heterochromatin, which, like a
control barrier, has an impact on S-phase progression
in addition to DNA-based checkpoints.
Length = 76
Score = 26.4 bits (59), Expect = 5.0
Identities = 9/17 (52%), Positives = 15/17 (88%)
Query: 68 DRSSQDDEDDDDDDEDD 84
D S+D+ED+++DD+DD
Sbjct: 56 DLESEDEEDEEEDDDDD 72
>gnl|CDD|151714 pfam11273, DUF3073, Protein of unknown function (DUF3073). This
family of proteins with unknown function appears to be
restricted to Actinobacteria.
Length = 64
Score = 26.1 bits (58), Expect = 5.1
Identities = 16/37 (43%), Positives = 19/37 (51%), Gaps = 7/37 (18%)
Query: 56 STDTSDYG------SGYPDRSSQDDEDDDDDDEDDIE 86
S DT D SG RSS DD+D +DD DD +
Sbjct: 20 SPDT-DLDALQRELSGGSSRSSADDDDQYEDDYDDYD 55
>gnl|CDD|218177 pfam04615, Utp14, Utp14 protein. This protein is found to be part
of a large ribonucleoprotein complex containing the U3
snoRNA. Depletion of the Utp proteins impedes production
of the 18S rRNA, indicating that they are part of the
active pre-rRNA processing complex. This large RNP
complex has been termed the small subunit (SSU)
processome.
Length = 728
Score = 28.5 bits (64), Expect = 5.3
Identities = 7/24 (29%), Positives = 15/24 (62%)
Query: 68 DRSSQDDEDDDDDDEDDIEQELSD 91
S +D+++D D +E+D + + D
Sbjct: 322 SVSEEDEDEDSDSEEEDEDDDEDD 345
>gnl|CDD|215628 PLN03196, PLN03196, MOC1-like protein; Provisional.
Length = 487
Score = 28.5 bits (64), Expect = 5.6
Identities = 10/15 (66%), Positives = 11/15 (73%)
Query: 70 SSQDDEDDDDDDEDD 84
S D+EDDDDDDE
Sbjct: 466 SVSDEEDDDDDDEVL 480
>gnl|CDD|220815 pfam10577, UPF0560, Uncharacterized protein family UPF0560. This
family of proteins has no known function.
Length = 805
Score = 28.3 bits (63), Expect = 5.6
Identities = 17/61 (27%), Positives = 30/61 (49%), Gaps = 2/61 (3%)
Query: 30 LRPLSSSSSREFYCYNGPLQFTRTQASTDT--SDYGSGYPDRSSQDDEDDDDDDEDDIEQ 87
LRPL S+ ++ +R ++STD+ S R + +++ DDDD DD +
Sbjct: 728 LRPLLDEGSKVSGEQAPTVEESRGRSSTDSSREPLSSPRRRRGRRPEDELDDDDGDDAGE 787
Query: 88 E 88
+
Sbjct: 788 D 788
>gnl|CDD|218566 pfam05349, GATA-N, GATA-type transcription activator, N-terminal.
GATA transcription factors mediate cell differentiation
in a diverse range of tissues. Mutation are often
associated with certain congenital human disorders. The
six classical vertebrate GATA proteins, GATA-1 to
GATA-6, are highly homologous and have two tandem zinc
fingers. The classical GATA transcription factors
function transcription activators. In lower metazoans
GATA proteins carry a single canonical zinc finger. This
family represents the N-terminal domain of the family of
GATA transcription activators.
Length = 177
Score = 27.5 bits (61), Expect = 6.4
Identities = 13/43 (30%), Positives = 17/43 (39%), Gaps = 8/43 (18%)
Query: 32 PLSSSSSREFYCYNGPL-------QFTRTQASTDTSDYGSGYP 67
P+S+ R+ Y GPL Q+ S Y S YP
Sbjct: 95 PMSNGGGRDAGAYQGPLLLSGGREQYGAALRPLGGS-YSSPYP 136
>gnl|CDD|220284 pfam09538, FYDLN_acid, Protein of unknown function (FYDLN_acid).
Members of this family are bacterial proteins with a
conserved motif [KR]FYDLN, sometimes flanked by a pair
of CXXC motifs, followed by a long region of low
complexity sequence in which roughly half the residues
are Asp and Glu, including multiple runs of five or
more acidic residues. The function of members of this
family is unknown.
Length = 104
Score = 26.5 bits (59), Expect = 6.6
Identities = 10/21 (47%), Positives = 13/21 (61%)
Query: 73 DDEDDDDDDEDDIEQELSDIS 93
D+DDDDDD+DD+ D
Sbjct: 68 LDDDDDDDDDDDLPDLDDDDV 88
Score = 26.5 bits (59), Expect = 8.6
Identities = 13/34 (38%), Positives = 19/34 (55%)
Query: 73 DDEDDDDDDEDDIEQELSDISLHRGSHVISDPDE 106
DD+DDDDDD D++ + D+ + D DE
Sbjct: 71 DDDDDDDDDLPDLDDDDVDLDDDDDDFLEDDDDE 104
>gnl|CDD|218555 pfam05320, Pox_RNA_Pol_19, Poxvirus DNA-directed RNA polymerase
19 kDa subunit. This family contains several
DNA-directed RNA polymerase 19 kDa polypeptides. The
Poxvirus DNA-directed RNA polymerase (EC: 2.7.7.6)
catalyzes DNA-template-directed extension of the 3'-end
of an RNA strand by one nucleotide at a time.
Length = 167
Score = 27.4 bits (61), Expect = 6.9
Identities = 15/38 (39%), Positives = 23/38 (60%), Gaps = 2/38 (5%)
Query: 56 STDTSDYGSGYPDRSSQDDEDDDDDDEDDIEQELSDIS 93
S D DY S D S++ E++++D+ED E SD+S
Sbjct: 4 SDDIIDYESD--DDDSEEYEEEEEDEEDAESLESSDVS 39
>gnl|CDD|239566 cd03484, MutL_Trans_hPMS_2_like, MutL_Trans_hPMS2_like: transducer
domain, having a ribosomal S5 domain 2-like fold, found
in proteins similar to human PSM2 (hPSM2). hPSM2 belongs
to the DNA mismatch repair (MutL/MLH1/PMS2) family.
This transducer domain is homologous to the second
domain of the DNA gyrase B subunit, which is known to be
important in nucleotide hydrolysis and the transduction
of structural signals from ATP-binding site to the DNA
breakage/reunion regions of the enzymes. Included in
this group are proteins similar to yeast PMS1. The yeast
MLH1-PMS1 and the human MLH1-PMS2 heterodimers play a
role in meiosis. hMLH1-hPMS2 also participates in the
repair of all DNA mismatch repair (MMR) substrates.
Cells lacking hPMS2 have a strong mutator phenotype and
display microsatellite instability (MSI). Mutation in
hPMS2 causes predisposition to HPNCC and Turcot
syndrome.
Length = 142
Score = 27.2 bits (61), Expect = 7.0
Identities = 11/34 (32%), Positives = 15/34 (44%), Gaps = 4/34 (11%)
Query: 153 LYPTYFLHLERDYGKKDTNVLGFKGPRKMTVLIP 186
YP + L++ D NV P K TVL+
Sbjct: 95 QYPFFILNISLPTSLYDVNVT----PDKRTVLLH 124
>gnl|CDD|221392 pfam12043, DUF3527, Domain of unknown function (DUF3527). This
presumed domain is functionally uncharacterized. This
domain is found in eukaryotes. This domain is about 120
amino acids in length. This domain has a conserved
CDCGGWD sequence motif.
Length = 277
Score = 27.8 bits (62), Expect = 7.9
Identities = 15/60 (25%), Positives = 27/60 (45%), Gaps = 1/60 (1%)
Query: 235 YVLNFHGRVTQASVKNFQI-IHDNDPEYIVLQFGRTSEDVFTMDYRYPLCPLQAFSIALS 293
N TQ S + F++ + E L E ++++++ L LQAFSI ++
Sbjct: 212 ARKNKGDSPTQESSQPFELFVQGGKEEDPALSMTNVREGLYSVEFHASLSALQAFSICVA 271
>gnl|CDD|165163 PHA02811, PHA02811, putative host range protein; Provisional.
Length = 197
Score = 27.3 bits (60), Expect = 8.1
Identities = 11/23 (47%), Positives = 18/23 (78%)
Query: 70 SSQDDEDDDDDDEDDIEQELSDI 92
SS DD+D+D+ D D+ + E++DI
Sbjct: 170 SSDDDDDNDNADNDEEDDEVNDI 192
>gnl|CDD|237047 PRK12298, obgE, GTPase CgtA; Reviewed.
Length = 390
Score = 27.5 bits (62), Expect = 8.3
Identities = 10/15 (66%), Positives = 11/15 (73%)
Query: 73 DDEDDDDDDEDDIEQ 87
DD+ DDD DEDD E
Sbjct: 369 DDDWDDDWDEDDDEG 383
Score = 27.5 bits (62), Expect = 8.9
Identities = 6/19 (31%), Positives = 12/19 (63%)
Query: 68 DRSSQDDEDDDDDDEDDIE 86
+ D+D D+DD++ +E
Sbjct: 367 EDDDDWDDDWDEDDDEGVE 385
>gnl|CDD|219922 pfam08595, RXT2_N, RXT2-like, N-terminal. The family represents
the N-terminal region of RXT2-like proteins. In S.
cerevisiae, RXT2 has been demonstrated to be involved in
conjugation with cellular fusion (mating) and invasive
growth. A high throughput localisation study has
localised RXT2 to the nucleus.
Length = 141
Score = 27.0 bits (60), Expect = 8.4
Identities = 9/34 (26%), Positives = 20/34 (58%)
Query: 67 PDRSSQDDEDDDDDDEDDIEQELSDISLHRGSHV 100
D DD+D+DD+D+++ + E D + ++ +
Sbjct: 49 EDGGDIDDDDEDDEDDEEADAEDDDENPYKLIRL 82
>gnl|CDD|221333 pfam11942, Spt5_N, Spt5 transcription elongation factor, acidic
N-terminal. This is the very acidic N-terminal region
of the early transcription elongation factor Spt5. The
Spt5-Spt4 complex regulates early transcription
elongation by RNA polymerase II and has an imputed role
in pre-mRNA processing via its physical association
with mRNA capping enzymes. The actual function of this
N-terminal domain is not known although it is
dispensable for binding to Spt4.
Length = 92
Score = 26.3 bits (58), Expect = 8.6
Identities = 8/19 (42%), Positives = 17/19 (89%)
Query: 73 DDEDDDDDDEDDIEQELSD 91
DDE++++++E+D ++LSD
Sbjct: 9 DDEEEEEEEEEDDLEDLSD 27
>gnl|CDD|219900 pfam08553, VID27, VID27 cytoplasmic protein. This is a family of
fungal and plant proteins and contains many hypothetical
proteins. VID27 is a cytoplasmic protein that plays a
potential role in vacuolar protein degradation.
Length = 794
Score = 27.8 bits (62), Expect = 9.2
Identities = 7/28 (25%), Positives = 17/28 (60%)
Query: 61 DYGSGYPDRSSQDDEDDDDDDEDDIEQE 88
D + D +D+E+++++DED+ +
Sbjct: 383 DANTERDDEEEEDEEEEEEEDEDEGPSK 410
>gnl|CDD|220441 pfam09849, DUF2076, Uncharacterized protein conserved in bacteria
(DUF2076). This domain, found in various hypothetical
prokaryotic proteins, has no known function. The domain,
however, is found in various periplasmic ligand-binding
sensor proteins.
Length = 234
Score = 27.3 bits (61), Expect = 9.3
Identities = 11/34 (32%), Positives = 13/34 (38%), Gaps = 2/34 (5%)
Query: 53 TQASTDTSDYGSG--YPDRSSQDDEDDDDDDEDD 84
+ D GS D D DDDD +DD
Sbjct: 197 DDSDAAGGDQGSNGDDDDGGFADSGYDDDDMDDD 230
>gnl|CDD|240329 PTZ00248, PTZ00248, eukaryotic translation initiation factor 2
subunit 1; Provisional.
Length = 319
Score = 27.3 bits (61), Expect = 9.4
Identities = 6/24 (25%), Positives = 17/24 (70%)
Query: 68 DRSSQDDEDDDDDDEDDIEQELSD 91
S+D++++D+D+E++ + + D
Sbjct: 295 YSESEDEDEEDEDEEEEEDDDEGD 318
>gnl|CDD|234824 PRK00725, glgC, glucose-1-phosphate adenylyltransferase;
Provisional.
Length = 425
Score = 27.5 bits (62), Expect = 9.4
Identities = 13/36 (36%), Positives = 19/36 (52%), Gaps = 7/36 (19%)
Query: 243 VTQASVKNFQIIHDNDPEYIVLQFGRTSEDVFTMDY 278
V Q N II DP+Y+V+ G + ++ MDY
Sbjct: 117 VYQ----NLDIIRRYDPKYVVILAG---DHIYKMDY 145
Database: CDD.v3.10
Posted date: Mar 20, 2013 7:55 AM
Number of letters in database: 10,937,602
Number of sequences in database: 44,354
Lambda K H
0.315 0.133 0.387
Gapped
Lambda K H
0.267 0.0696 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 15,066,881
Number of extensions: 1422329
Number of successful extensions: 3274
Number of sequences better than 10.0: 1
Number of HSP's gapped: 2855
Number of HSP's successfully gapped: 134
Length of query: 302
Length of database: 10,937,602
Length adjustment: 96
Effective length of query: 206
Effective length of database: 6,679,618
Effective search space: 1376001308
Effective search space used: 1376001308
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.6 bits)
S2: 59 (26.6 bits)