RPS-BLAST 2.2.26 [Sep-21-2011]
Database: CDD.v3.10
44,354 sequences; 10,937,602 total letters
Searching..................................................done
Query= psy12936
(187 letters)
>gnl|CDD|227577 COG5252, COG5252, Uncharacterized conserved protein, contains
CCCH-type Zn-finger protein [General function prediction
only].
Length = 299
Score = 107 bits (267), Expect = 2e-28
Identities = 63/151 (41%), Positives = 88/151 (58%), Gaps = 9/151 (5%)
Query: 1 ICKYFIEAVENSKYGWFWSCPNG-PTCIYKHALPPGFVLKKDKKKEEKKDQISLEDLIER 59
+CK+FIEA+E+ KYGW W+CPNG C Y H LP G+VL +DK K+ + +ISLE+ IE
Sbjct: 143 VCKFFIEAMESGKYGWGWTCPNGNMRCSYIHKLPDGYVLSRDKIKDSTQVEISLEEFIEL 202
Query: 60 ERAALAASSKFTKVTLDSFLAWKKRKLKEKSEAMTKAEEKKRSDFKA-GRQVGLSGRDMF 118
ER +L K T +T ++F WK + + K +E+K S K GR G G ++F
Sbjct: 203 ERQSL--PEKLTPLTEENFKEWK----DGRRLRILKQKEEKESARKVKGRATGTKGVELF 256
Query: 119 SFNPDLAKDDMDEGDE-AFDSYTRDEEEDPE 148
DL KDD++ G+E + E ED E
Sbjct: 257 ETRRDLFKDDVEAGEEWDYTERCYSESEDEE 287
>gnl|CDD|181730 PRK09256, PRK09256, hypothetical protein; Provisional.
Length = 138
Score = 32.0 bits (74), Expect = 0.063
Identities = 19/62 (30%), Positives = 28/62 (45%), Gaps = 15/62 (24%)
Query: 50 QISLEDLIERERAALAA--SSKFTKVTLDSFLAWKKRKLKEKSEAMTKAEEKKRSDFKAG 107
+ +LE L+ R AL + TK T S K+R+L+ K K+R + KA
Sbjct: 85 EDALERLVALIREALKPPKKRRATKPTRGS----KERRLESK---------KQRGEVKAL 131
Query: 108 RQ 109
R
Sbjct: 132 RG 133
>gnl|CDD|182885 PRK10991, fucI, L-fucose isomerase; Provisional.
Length = 588
Score = 30.7 bits (70), Expect = 0.44
Identities = 18/61 (29%), Positives = 32/61 (52%), Gaps = 6/61 (9%)
Query: 79 LAWKKRKLKE----KSEAMTKAEEKKRSDFKAGRQVGLSGRDMFSFNPDLAKDDMDEGDE 134
LAW K+ KE +E + E+KR+ ++ ++ + RD+ NP LA ++ +E
Sbjct: 229 LAWAKKNCKEGEDENAEQYQRNAEQKRAVWEESVKMAMIIRDLMQGNPKLA--ELGFVEE 286
Query: 135 A 135
A
Sbjct: 287 A 287
>gnl|CDD|214632 smart00356, ZnF_C3H1, zinc finger.
Length = 27
Score = 27.2 bits (61), Expect = 0.59
Identities = 10/30 (33%), Positives = 13/30 (43%), Gaps = 10/30 (33%)
Query: 1 ICKYFIEAVENSKYGWFWSCPNGPTCIYKH 30
+CK+F K G CP G C + H
Sbjct: 6 LCKFF-------KRG---YCPRGDRCKFAH 25
>gnl|CDD|214395 CHL00204, ycf1, Ycf1; Provisional.
Length = 1832
Score = 30.5 bits (69), Expect = 0.65
Identities = 21/53 (39%), Positives = 30/53 (56%), Gaps = 1/53 (1%)
Query: 38 LKKDKKKEEKKDQISLEDLIERERAALAASSKFTKVTLDSFLAWKKRKLKEKS 90
LKK +K +KK + L+ L ER + L S + K + SFL + KR +KE S
Sbjct: 922 LKKKIRKFKKKYFLVLKILKERTKLFLKVSKETKKWIIKSFL-FLKRIIKELS 973
>gnl|CDD|233835 TIGR02361, dak_ATP, dihydroxyacetone kinase, ATP-dependent. This
family consists of examples of the form of
dihydroxyacetone kinase (also called glycerone kinase)
that uses ATP (2.7.1.29) as the phosphate donor, rather
than a phosphoprotein as in E. coli. This form is
composed of a single chain with separable domains
homologous to the K and L subunits of the E. coli
enzyme, and is found in yeasts and other eukaryotes and
in some bacteria, including Citrobacter freundii. The
member from tomato has been shown to phosphorylate
dihydroxyacetone, 3,4-dihydroxy-2-butanone, and some
other aldoses and ketoses (PMID:11985845).
Length = 574
Score = 29.6 bits (67), Expect = 0.98
Identities = 25/110 (22%), Positives = 38/110 (34%), Gaps = 19/110 (17%)
Query: 27 IYKHALPPGFVLKKDKKKEEKKDQISLEDLIERERAALAASSKFTKVT------------ 74
I+ A K+ KK++I+ E + AL A K+T
Sbjct: 457 IFFSAAAQSL-----KQNASKKEEITPEAWAAALKTALDALYKYTGARPGDRTMIDALQP 511
Query: 75 -LDSFLAWKKRKLKEKSEAMTKAEEKKRSDFKAGRQVGLSGRDMFSFNPD 123
+++ A K A AE K+ KAGR +S + PD
Sbjct: 512 FVEALEAGKSNLNAAVKAAEAGAEATKKMKAKAGRASYVSS-EALEGVPD 560
>gnl|CDD|235640 PRK05901, PRK05901, RNA polymerase sigma factor; Provisional.
Length = 509
Score = 29.6 bits (67), Expect = 1.0
Identities = 20/147 (13%), Positives = 48/147 (32%), Gaps = 6/147 (4%)
Query: 37 VLKKDKKKEEKKDQISLEDLIERERAALAASSKFTKVTLDSFLAWKKRKLKEKSEAMTKA 96
VL + D + D+ +++ A ++ K + +
Sbjct: 55 VLIFLSGMVKDTDDATESDIPKKKTKTAAKAAAAKAPAKKKLKDELDSSKKAEKKNALDK 114
Query: 97 EEKKRSDFKAGRQVGLSGRDMFSFNPDLAKDDMDEG------DEAFDSYTRDEEEDPESN 150
++ D + DL DD+D+ DE D D+E++ +
Sbjct: 115 DDDLNYVKDIDVLNQADDDDDDDDDDDLDDDDIDDDDDDEDDDEDDDDDDVDDEDEEKKE 174
Query: 151 YRELDLDMLLNEAAELDETNISQAKDR 177
+EL+ ++ ++ + + + R
Sbjct: 175 AKELEKLSDDDDFVWDEDDSEALRQAR 201
>gnl|CDD|201014 pfam00116, COX2, Cytochrome C oxidase subunit II, periplasmic
domain.
Length = 120
Score = 27.8 bits (62), Expect = 1.7
Identities = 10/29 (34%), Positives = 16/29 (55%)
Query: 129 MDEGDEAFDSYTRDEEEDPESNYRELDLD 157
D + FDSY E++ E +R L++D
Sbjct: 17 SDFKNIEFDSYMIPEDDLEEGQFRLLEVD 45
>gnl|CDD|173412 PTZ00121, PTZ00121, MAEBL; Provisional.
Length = 2084
Score = 29.0 bits (64), Expect = 2.0
Identities = 19/62 (30%), Positives = 29/62 (46%)
Query: 42 KKKEEKKDQISLEDLIERERAALAASSKFTKVTLDSFLAWKKRKLKEKSEAMTKAEEKKR 101
KK EE K + E + A A K + + A K + K+K++ KAEE K+
Sbjct: 1464 KKAEEAKKADEAKKKAEEAKKADEAKKKAEEAKKKADEAKKAAEAKKKADEAKKAEEAKK 1523
Query: 102 SD 103
+D
Sbjct: 1524 AD 1525
>gnl|CDD|216491 pfam01421, Reprolysin, Reprolysin (M12B) family zinc
metalloprotease. The members of this family are
enzymes that cleave peptides. These proteases require
zinc for catalysis. Members of this family are also
known as adamalysins. Most members of this family are
snake venom endopeptidases, but there are also some
mammalian proteins such as human ADAM8 and fertilin.
Fertilin and closely related proteins appear to not
have some active site residues and may not be active
enzymes.
Length = 198
Score = 28.0 bits (63), Expect = 2.3
Identities = 9/37 (24%), Positives = 14/37 (37%), Gaps = 8/37 (21%)
Query: 54 EDLIERERAALAASSKFTKVTLDSFLAWKKRKLKEKS 90
D I + A TL FL W++ L ++
Sbjct: 57 GDKITVQGDAN--------DTLHRFLEWRETDLLKRK 85
>gnl|CDD|221324 pfam11933, DUF3451, Domain of unknown function (DUF3451). This
presumed domain is functionally uncharacterized. This
domain is found in eukaryotes. This domain is typically
between 199 to 238 amino acids in length. This domain is
found associated with pfam06512, pfam00520. This domain
has a conserved ADD sequence motif.
Length = 222
Score = 27.8 bits (62), Expect = 2.8
Identities = 7/28 (25%), Positives = 11/28 (39%)
Query: 77 SFLAWKKRKLKEKSEAMTKAEEKKRSDF 104
S + R K K E + E+ S+
Sbjct: 10 SAKERRNRNDKNKKEEHSIGSEEGDSEK 37
>gnl|CDD|220645 pfam10239, DUF2465, Protein of unknown function (DUF2465). FAM98A
and B proteins are found from worms to humans but their
function is unknown. This entry is of a family of
proteins that is rich in glycines.
Length = 318
Score = 27.9 bits (62), Expect = 2.9
Identities = 29/100 (29%), Positives = 40/100 (40%), Gaps = 9/100 (9%)
Query: 29 KHALPPGFVLKKDKKKEEKKDQIS--LEDLIERERAALAASSKFTKVTLDSFLAWKKRKL 86
H P ++ EK + I+ LE E R L K VT+ SF W R
Sbjct: 174 THLGKPLLKKPLTSEQWEKLELINHELEAEYELRRRML---LKRLDVTVQSFG-WSDR-A 228
Query: 87 KEKSEAMTKAEEKKRSDFKAGRQVGLSGRDMFSFNPDLAK 126
K K + M + KR AG V L+ + + DLA+
Sbjct: 229 KTKEDQMAAVYQPKRHSLDAGSTVDLA--HLLAAREDLAR 266
>gnl|CDD|227220 COG4883, COG4883, Uncharacterized protein conserved in archaea
[Function unknown].
Length = 500
Score = 27.6 bits (61), Expect = 4.7
Identities = 16/41 (39%), Positives = 20/41 (48%), Gaps = 14/41 (34%)
Query: 5 FIEAVENS----------KYGWFWSCPNGPTCIYKHALPPG 35
FIE +E S +YG F+ GPTC+ AL PG
Sbjct: 113 FIEEIEKSEALIGKESIRRYGGFY----GPTCVVDFALVPG 149
>gnl|CDD|214438 MTH00154, COX2, cytochrome c oxidase subunit II; Provisional.
Length = 227
Score = 27.1 bits (61), Expect = 5.3
Identities = 9/22 (40%), Positives = 12/22 (54%)
Query: 136 FDSYTRDEEEDPESNYRELDLD 157
FDSY E + +R LD+D
Sbjct: 118 FDSYMIPTNELENNGFRLLDVD 139
>gnl|CDD|171499 PRK12438, PRK12438, hypothetical protein; Provisional.
Length = 991
Score = 27.5 bits (61), Expect = 5.5
Identities = 11/27 (40%), Positives = 15/27 (55%)
Query: 17 FWSCPNGPTCIYKHALPPGFVLKKDKK 43
FWS P+ PT PP +VL D++
Sbjct: 723 FWSVPSDPTNDANATQPPYYVLVGDQQ 749
>gnl|CDD|145949 pfam03066, Nucleoplasmin, Nucleoplasmin. Nucleoplasmins are also
known as chromatin decondensation proteins. They bind to
core histones and transfer DNA to them in a reaction
that requires ATP. This is thought to play a role in the
assembly of regular nucleosomal arrays.
Length = 146
Score = 26.5 bits (59), Expect = 6.5
Identities = 12/39 (30%), Positives = 21/39 (53%)
Query: 110 VGLSGRDMFSFNPDLAKDDMDEGDEAFDSYTRDEEEDPE 148
V +SG+ + + D + DD ++ +E D DE+E E
Sbjct: 99 VHISGQHLVASEEDESDDDEEDEEEEDDEEDDDEDESEE 137
>gnl|CDD|204356 pfam09959, DUF2193, Uncharacterized protein conserved in archaea
(DUF2193). This domain, found in various hypothetical
archaeal proteins, has no known function.
Length = 499
Score = 27.0 bits (60), Expect = 6.8
Identities = 21/64 (32%), Positives = 27/64 (42%), Gaps = 21/64 (32%)
Query: 5 FIEAVENS----------KYGWFWSCPNGPTCIYKHALPPGF-------VLKKDKKKEEK 47
FI+A+ S +YG F+ GPTC+ AL PG +LKK E
Sbjct: 112 FIDAIGKSEALIGKESIRRYGGFY----GPTCVVDFALIPGSTSNVVNRILKKTDIPVEH 167
Query: 48 KDQI 51
K I
Sbjct: 168 KQAI 171
>gnl|CDD|239618 cd03556, L-fucose_isomerase, L-fucose isomerase (FucIase); FucIase
converts L-fucose, an aldohexose, to its ketose form,
which prepares it for aldol cleavage (similar to the
isomerization of glucose during glycolysis). L-fucose
(or 6-deoxy-L-galactose) is found in blood group
determinants as well as in various oligo- and
polysaccharides, and glycosides in mammals, bacteria and
plants.
Length = 584
Score = 27.1 bits (60), Expect = 7.2
Identities = 15/60 (25%), Positives = 25/60 (41%), Gaps = 4/60 (6%)
Query: 70 FTKVTLDSFLAWKKRKLKE----KSEAMTKAEEKKRSDFKAGRQVGLSGRDMFSFNPDLA 125
+ + LAW K KE + K E+K D++ ++ + RD+ N LA
Sbjct: 216 YDPEEYEKALAWTKENCKEGEDENEKNDQKKREQKDKDWEFVVKMAIIIRDLMYGNEKLA 275
>gnl|CDD|181735 PRK09262, PRK09262, hypothetical protein; Provisional.
Length = 225
Score = 26.4 bits (59), Expect = 7.4
Identities = 14/39 (35%), Positives = 20/39 (51%), Gaps = 4/39 (10%)
Query: 88 EKSEAMTKAEEKKRSDFKAGRQVGLSGRDMFSFNPDLAK 126
+ +EA EE KR AG ++GL D++ LAK
Sbjct: 184 DAAEAREANEESKRERLAAG-ELGL---DIYKMRERLAK 218
>gnl|CDD|219922 pfam08595, RXT2_N, RXT2-like, N-terminal. The family represents
the N-terminal region of RXT2-like proteins. In S.
cerevisiae, RXT2 has been demonstrated to be involved in
conjugation with cellular fusion (mating) and invasive
growth. A high throughput localisation study has
localised RXT2 to the nucleus.
Length = 141
Score = 26.2 bits (58), Expect = 7.7
Identities = 15/65 (23%), Positives = 27/65 (41%)
Query: 85 KLKEKSEAMTKAEEKKRSDFKAGRQVGLSGRDMFSFNPDLAKDDMDEGDEAFDSYTRDEE 144
KL +++ KK ++ + LS + + NP +D + D+ + DEE
Sbjct: 7 KLLKRARPSGPPLYKKVVEYNGYERDVLSRKRIKFNNPPRIDEDGGDIDDDDEDDEDDEE 66
Query: 145 EDPES 149
D E
Sbjct: 67 ADAED 71
>gnl|CDD|219240 pfam06957, COPI_C, Coatomer (COPI) alpha subunit C-terminus. This
family represents the C-terminus (approximately 500
residues) of the eukaryotic coatomer alpha subunit.
Coatomer (COPI) is a large cytosolic protein complex
which forms a coat around vesicles budding from the
Golgi apparatus. Such coatomer-coated vesicles have been
proposed to play a role in many distinct steps of
intracellular transport. Note that many family members
also contain the pfam04053 domain.
Length = 421
Score = 26.4 bits (58), Expect = 9.8
Identities = 10/53 (18%), Positives = 20/53 (37%)
Query: 127 DDMDEGDEAFDSYTRDEEEDPESNYRELDLDMLLNEAAELDETNISQAKDRVF 179
D EG++ E E +++ D++L ++ + A VF
Sbjct: 52 DGYLEGEDGLLDDEEGPEGGDEEGGWDVEDDLVLPPELDVPKDQAGNADSDVF 104
Database: CDD.v3.10
Posted date: Mar 20, 2013 7:55 AM
Number of letters in database: 10,937,602
Number of sequences in database: 44,354
Lambda K H
0.313 0.131 0.376
Gapped
Lambda K H
0.267 0.0763 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 9,714,484
Number of extensions: 906905
Number of successful extensions: 1104
Number of sequences better than 10.0: 1
Number of HSP's gapped: 1074
Number of HSP's successfully gapped: 102
Length of query: 187
Length of database: 10,937,602
Length adjustment: 91
Effective length of query: 96
Effective length of database: 6,901,388
Effective search space: 662533248
Effective search space used: 662533248
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.9 bits)
S2: 56 (25.3 bits)