RPS-BLAST 2.2.26 [Sep-21-2011]
Database: CDD.v3.10
44,354 sequences; 10,937,602 total letters
Searching..................................................done
Query= psy10513
(239 letters)
>gnl|CDD|153114 cd01056, Euk_Ferritin, eukaryotic ferritins. Eukaryotic Ferritin
(Euk_Ferritin) domain. Ferritins are the primary iron
storage proteins of most living organisms and members of
a broad superfamily of ferritin-like diiron-carboxylate
proteins. The iron-free (apoferritin) ferritin molecule
is a protein shell composed of 24 protein chains
arranged in 432 symmetry. Iron storage involves the
uptake of iron (II) at the protein shell, its oxidation
by molecular oxygen at the dinuclear ferroxidase
centers, and the movement of iron (III) into the cavity
for deposition as ferrihydrite; the protein shell can
hold up to 4500 iron atoms. In vertebrates, two types of
chains (subunits) have been characterized, H or M (fast)
and L (slow), which differ in rates of iron uptake and
mineralization. Fe(II) oxidation in the H/M subunits
take place initially at the ferroxidase center, a
carboxylate-bridged diiron center, located within the
subunit four-helix bundle. In a complementary role,
negatively charged residues on the protein shell inner
surface of the L subunits promote ferrihydrite
nucleation. Most plant ferritins combine both oxidase
and nucleation functions in one chain: they have four
interior glutamate residues as well as seven ferroxidase
center residues.
Length = 161
Score = 178 bits (454), Expect = 4e-57
Identities = 71/145 (48%), Positives = 98/145 (67%)
Query: 71 HEDCIDKINTQINDEIRSAYTYISMASFCTQDNVALFGFAKFFTHSYKEEIEHMEHLIAY 130
HE+C +N QIN E+ ++Y Y+SMA++ +D+VAL GFAKFF EE EH E LI Y
Sbjct: 1 HEECEAALNKQINLELNASYVYLSMAAYFDRDDVALPGFAKFFRKLSDEEREHAEKLIKY 60
Query: 131 LNKRGGQLRLTSIEAPSKQEWNTVEDLLTEALHMEKQLNEKLLKLHACASQHGDANLTDF 190
NKRGG++ L I+ P K EW + + L AL +EK +N+ LL LH AS+H D +L DF
Sbjct: 61 QNKRGGRVVLQDIKKPEKDEWGSGLEALELALDLEKLVNQSLLDLHKLASEHNDPHLADF 120
Query: 191 LEGRYLQEQVDAIKTLADLLTTVRR 215
LE +L+EQV++IK LA +T ++R
Sbjct: 121 LESEFLEEQVESIKKLAGYITNLKR 145
>gnl|CDD|153098 cd00904, Ferritin, Ferritin iron storage proteins. Ferritins are
the primary iron storage proteins of most living
organisms and members of a broad superfamily of
ferritin-like diiron-carboxylate proteins. The iron-free
(apoferritin) ferritin molecule is a protein shell
composed of 24 protein chains arranged in 432 symmetry.
Iron storage involves the uptake of iron (II) at the
protein shell, its oxidation by molecular oxygen at the
dinuclear ferroxidase centers, and the movement of iron
(III) into the cavity for deposition as ferrihydrite;
the protein shell can hold up to 4500 iron atoms. In
vertebrates, two types of chains (subunits) have been
characterized, H or M (fast) and L (slow), which differ
in rates of iron uptake and mineralization. Bacterial
non-heme ferritins are composed only of H chains. Fe(II)
oxidation in the H/M subunits take place initially at
the ferroxidase center, a carboxylate-bridged diiron
center, located within the subunit four-helix bundle. In
a complementary role, negatively charged residues on the
protein shell inner surface of the L subunits promote
ferrihydrite nucleation. Most plant ferritins combine
both oxidase and nucleation functions in one chain: they
have four interior glutamate residues as well as seven
ferroxidase center residues.
Length = 160
Score = 130 bits (328), Expect = 3e-38
Identities = 66/160 (41%), Positives = 93/160 (58%), Gaps = 4/160 (2%)
Query: 71 HEDCIDKINTQINDEIRSAYTYISMASFCTQDNVALFGFAKFFTHSYKEEIEHMEHLIAY 130
E +N Q+N E+ ++YTY+SMA++ +D+VAL G A FF +EE EH E Y
Sbjct: 1 SEKVEAAVNRQLNLELYASYTYLSMATYFDRDDVALKGVAHFFKEQAQEEREHAEKFYKY 60
Query: 131 LNKRGGQLRLTSIEAPSKQEWNTVEDLLTEALHMEKQLNEKLLKLHACASQHGDANLTDF 190
N+RGG++ L IE P EW D + AL +EK +N+ LL LH AS+ D +L DF
Sbjct: 61 QNERGGRVELQDIEKPPSDEWGGTLDAMEAALKLEKFVNQALLDLHELASEEKDPHLCDF 120
Query: 191 LEGRYLQEQVDAIKTLADLLTTVRRTQLYLVDRDLMSGKF 230
LE +L EQV IK + D+LT + R L + SG++
Sbjct: 121 LESHFLDEQVKEIKQVGDILTNLER----LNGQQAGSGEY 156
>gnl|CDD|215791 pfam00210, Ferritin, Ferritin-like domain. This family contains
ferritins and other ferritin-like proteins such as
members of the DPS family and bacterioferritins.
Length = 142
Score = 119 bits (301), Expect = 2e-34
Identities = 49/144 (34%), Positives = 75/144 (52%), Gaps = 6/144 (4%)
Query: 75 IDKINTQINDEIRSAYTYISMASFCTQDNVALFGFAKFFTHSYKEEIEHMEHLIAYLNKR 134
I+ +N Q+ DE+ ++Y Y+ MA + D L G A+FF +EE EH + L +
Sbjct: 1 IEALNEQLADELTASYQYLQMAWYF--DGPGLPGLAEFFDEQAEEEREHADKLAERILAL 58
Query: 135 GGQLRLTSIEAP---SKQEWNTVEDLLTEALHMEKQLNEKLLKLHACASQHGDANLTDFL 191
GG+ LT +E E+ +V + L AL EK++ E L +L A + GD DFL
Sbjct: 59 GGKPVLTPVELLEIEPPDEFGSVLEALEAALEHEKEVTESLRELIELAEEEGDYATADFL 118
Query: 192 EGRYLQEQVDAIKTLADLLTTVRR 215
E +L+EQ + L LL ++R
Sbjct: 119 EW-FLEEQEEHEWFLEALLEKLKR 141
>gnl|CDD|153113 cd01055, Nonheme_Ferritin, nonheme-containing ferritins. Nonheme
Ferritin domain, found in archaea and bacteria, is a
member of a broad superfamily of ferritin-like
diiron-carboxylate proteins. The ferritin protein shell
is composed of 24 protein subunits arranged in 432
symmetry. Each protein subunit, a four-helix bundle with
a fifth short terminal helix, contains a dinuclear
ferroxidase center (H type). Unique to this group of
proteins is a third metal site in the ferroxidase
center. Iron storage involves the uptake of iron (II) at
the protein shell, its oxidation by molecular oxygen at
the ferroxidase centers, and the movement of iron (III)
into the cavity for deposition as ferrihydrite.
Length = 156
Score = 119 bits (300), Expect = 7e-34
Identities = 52/160 (32%), Positives = 87/160 (54%), Gaps = 9/160 (5%)
Query: 71 HEDCIDKINTQINDEIRSAYTYISMASFCTQDNVALFGFAKFFTHSYKEEIEHMEHLIAY 130
E +N QIN E+ S+Y Y++MA++ D+ L GFA FF +EE EH Y
Sbjct: 1 SEKLEKALNEQINLELYSSYLYLAMAAWF--DSKGLDGFANFFRVQAQEEREHAMKFFDY 58
Query: 131 LNKRGGQLRLTSIEAPSKQEWNTVEDLLTEALHMEKQLNEKLLKLHACASQHGDANLTDF 190
LN RGG++ L +IEAP E+ ++ ++ AL E+++ E + L A + D +F
Sbjct: 59 LNDRGGRVELPAIEAP-PSEFESLLEVFEAALEHEQKVTESINNLVDLALEEKDYATFNF 117
Query: 191 LEGRYLQEQVDAIKTLADLLTTVRR-----TQLYLVDRDL 225
L+ +++EQV+ D+L ++ LY++D++L
Sbjct: 118 LQW-FVKEQVEEEALARDILDKLKLAGDDGGGLYMLDKEL 156
>gnl|CDD|224445 COG1528, Ftn, Ferritin-like protein [Inorganic ion transport and
metabolism].
Length = 167
Score = 97.4 bits (243), Expect = 2e-25
Identities = 49/165 (29%), Positives = 89/165 (53%), Gaps = 9/165 (5%)
Query: 72 EDCIDKINTQINDEIRSAYTYISMASFCTQDNVALFGFAKFFTHSYKEEIEHMEHLIAYL 131
E I+ +N Q+N E ++ Y+ MA++C + +L GFAKF +EE+ H L YL
Sbjct: 4 EKMIELLNEQMNLEFYASNLYLQMAAWC--SSESLPGFAKFLRAQAQEELTHAMKLFNYL 61
Query: 132 NKRGGQLRLTSIEAPSKQEWNTVEDLLTEALHMEKQLNEKLLKLHACASQHGDANLTDFL 191
N+RG + L +IEAP +++++++L + L E+++ + +L A + D +FL
Sbjct: 62 NERGARPELKAIEAP-PNKFSSLKELFEKTLEHEQKVTSSINELAEVAREEKDYATFNFL 120
Query: 192 EGRYLQEQVDAIKTLADLLTTVRRTQ-----LYLVDRDLMSGKFS 231
+ ++ EQV+ K +L + LYL+D++L + S
Sbjct: 121 QW-FVAEQVEEEKLFKTILDKLELAGNDGEGLYLIDKELKNRASS 164
>gnl|CDD|182367 PRK10304, PRK10304, ferritin; Provisional.
Length = 165
Score = 58.9 bits (142), Expect = 6e-11
Identities = 44/158 (27%), Positives = 85/158 (53%), Gaps = 9/158 (5%)
Query: 73 DCIDKINTQINDEIRSAYTYISMASFCTQDNVALFGFAKFFTHSYKEEIEHMEHLIAYLN 132
+ I+K+N Q+N E+ S+ Y M+++C+ G A F +EE+ HM+ L YL
Sbjct: 5 EMIEKLNEQMNLELYSSLLYQQMSAWCSYHTFE--GAAAFLRRHAQEEMTHMQRLFDYLT 62
Query: 133 KRGGQLRLTSIEAPSKQEWNTVEDLLTEALHMEKQLNEKLLKLHACASQHGDANLTDFLE 192
G R+ ++E+P E++++++L E E+ + +K+ +L A + D +FL+
Sbjct: 63 DTGNLPRINTVESPFA-EYSSLDELFQETYKHEQLITQKINELAHAAMTNQDYPTFNFLQ 121
Query: 193 GRYLQEQVDA---IKTLADLLTTVRRT--QLYLVDRDL 225
Y+ EQ + K++ D L+ ++ LY +D++L
Sbjct: 122 W-YVSEQHEEEKLFKSIIDKLSLAGKSGEGLYFIDKEL 158
>gnl|CDD|184983 PRK15022, PRK15022, ferritin-like protein; Provisional.
Length = 167
Score = 47.2 bits (112), Expect = 9e-07
Identities = 33/152 (21%), Positives = 73/152 (48%), Gaps = 5/152 (3%)
Query: 75 IDKINTQINDEIRSAYTYISMASFCTQDNVALFGFAKFFTHSYKEEIEHMEHLIAYLNKR 134
+ K+N+Q+N E ++ Y+ ++ +C++ +L G A F + + M + ++
Sbjct: 7 LLKLNSQMNLEFYASNLYLHLSEWCSEQ--SLNGTATFLRAQAQSNVTQMMRMFNFMKSA 64
Query: 135 GGQLRLTSIEAPSKQEWNTVEDLLTEALHMEKQLNEKLLKLHACASQHGDANLTDFLEGR 194
G + +I+ P ++ N++E+L + L +Q + L +L A D + +FL
Sbjct: 65 GATPIVKAIDVPG-EKLNSLEELFQKTLEEYEQRSSTLAQLADEAKALNDDSTLNFLRDL 123
Query: 195 YLQEQVDAI--KTLADLLTTVRRTQLYLVDRD 224
++Q D + +T+ D + + + L V D
Sbjct: 124 EKEQQHDGLLLQTILDEVRSAKLAGLCPVQTD 155
>gnl|CDD|218602 pfam05478, Prominin, Prominin. The prominins are an emerging
family of proteins that among the multispan membrane
proteins display a novel topology. Mouse prominin and
human prominin (mouse)-like 1 (PROML1) are predicted to
contain five membrane spanning domains, with an
N-terminal domain exposed to the extracellular space
followed by four, alternating small cytoplasmic and
large extracellular, loops and a cytoplasmic C-terminal
domain. The exact function of prominin is unknown
although in humans defects in PROM1, the gene coding for
prominin, cause retinal degeneration.
Length = 807
Score = 38.5 bits (90), Expect = 0.003
Identities = 27/130 (20%), Positives = 50/130 (38%), Gaps = 18/130 (13%)
Query: 59 LGKTERQTLQNFHEDCIDKINTQ-INDEIRSAYTYISMASFCTQ-DNVA----------- 105
L R+ L++ +D IN +I+ T +++ SF + D +A
Sbjct: 588 LTPEARRNLEDLASSGLDDINFSNFLKQIQKPLTEVNLNSFAEELDALANNLPNGKLKVA 647
Query: 106 LFGFAKFFTHSYKEEIEHMEHLIAYLNKRGGQLRLTSIEAPSKQEWNTVEDLLTEALHME 165
L A + + ME L+ LN + +++ S N V+D+L E +
Sbjct: 648 LKNEALDLRSIQQNFVPPMESLLEKLNS-----NVKTLDESSNDLQNAVKDVLAEVQAAQ 702
Query: 166 KQLNEKLLKL 175
LN ++
Sbjct: 703 GFLNNNASEI 712
>gnl|CDD|153097 cd00657, Ferritin_like, Ferritin-like superfamily of
diiron-containing four-helix-bundle proteins.
Ferritin-like, diiron-carboxylate proteins participate
in a range of functions including iron regulation,
mono-oxygenation, and reactive radical production. These
proteins are characterized by the fact that they
catalyze dioxygen-dependent oxidation-hydroxylation
reactions within diiron centers; one exception is
manganese catalase, which catalyzes peroxide-dependent
oxidation-reduction within a dimanganese center.
Diiron-carboxylate proteins are further characterized by
the presence of duplicate metal ligands, glutamates and
histidines (ExxH) and two additional glutamates within a
four-helix bundle. Outside of these conserved residues
there is little obvious homology. Members include
bacterioferritin, ferritin, rubrerythrin, aromatic and
alkene monooxygenase hydroxylases (AAMH), ribonucleotide
reductase R2 (RNRR2), acyl-ACP-desaturases
(Acyl_ACP_Desat), manganese (Mn) catalases,
demethoxyubiquinone hydroxylases (DMQH), DNA protecting
proteins (DPS), and ubiquinol oxidases (AOX), and the
aerobic cyclase system, Fe-containing subunit (ACSF).
Length = 130
Score = 35.2 bits (81), Expect = 0.009
Identities = 25/131 (19%), Positives = 39/131 (29%), Gaps = 16/131 (12%)
Query: 76 DKINTQINDEIRSAYTYISMASFCTQDNVALFGFAKFFTHSYKEEIEHMEHLIAYLNKRG 135
+N + E + Y +A+ EE H + L L + G
Sbjct: 1 RLLNDALAGEYAAIIAYGQLAA-----RAPDPDLKDELLEIADEERRHADALAERLRELG 55
Query: 136 GQ------LRLTSIEAPSKQEWNTVEDLLTEALHMEKQLNEKLLKLHACASQHGDANLTD 189
G L + P + + + L AL +E + +L A D L
Sbjct: 56 GTPPLPPAHLLAAYALP--KTSDDPAEALRAALEVEARAIAAYRELIEQA---DDPELRR 110
Query: 190 FLEGRYLQEQV 200
LE EQ
Sbjct: 111 LLERILADEQR 121
>gnl|CDD|240271 PTZ00108, PTZ00108, DNA topoisomerase 2-like protein; Provisional.
Length = 1388
Score = 33.1 bits (76), Expect = 0.13
Identities = 17/84 (20%), Positives = 32/84 (38%)
Query: 2 SSDCKGSSNDPPPKKDKPKCESTRHEPTKENKPCKETKPKSKKSSEPTQSSQHSPDSLGK 61
S+ + P KK K + E + K+ K K+T K K + Q+S L +
Sbjct: 1295 ESNGGSKPSSPTKKKVKKRLEGSLAALKKKKKSEKKTARKKKSKTRVKQASASQSSRLLR 1354
Query: 62 TERQTLQNFHEDCIDKINTQINDE 85
R+ + + D +++
Sbjct: 1355 RPRKKKSDSSSEDDDDSEVDDSED 1378
>gnl|CDD|153102 cd01043, DPS, DPS protein, ferritin-like diiron-binding domain.
DPS (DNA Protecting protein under Starved conditions)
domain is a member of a broad superfamily of
ferritin-like diiron-carboxylate proteins. Some DPS
proteins nonspecifically bind DNA, protecting it from
cleavage caused by reactive oxygen species such as the
hydroxyl radicals produced during oxidation of Fe(II) by
hydrogen peroxide. These proteins assemble into
dodecameric structures, some form DPS-DNA co-crystalline
complexes, and possess iron and H2O2 detoxification
capabilities. Expression of DPS is induced by oxidative
or nutritional stress, including metal ion starvation.
Members of the DPS family are homopolymers formed by 12
four-helix bundle subunits that assemble with 23
symmetry into a hollow shell. The DPS ferroxidase site
is unusual in that it is not located in a four-helix
bundle as in ferritin, but is shared by 2-fold
symmetry-related subunits providing the iron ligands.
Many DPS sequences (e.g., E. coli) display an N-terminal
extension of variable length that contains two or three
positively charged lysine residues that extends into the
solvent and is thought to play an important role in the
stabilization of the complex with DNA. DPS Listeria Flp,
Bacillus anthracis Dlp-1 and Dlp-2, and Helicobacter
pylori HP-NAP which lack the N-terminal extension, do
not bind DNA. DPS proteins from Helicobacter pylori,
Treponema pallidum, and Borrelia burgdorferi are highly
immunogenic.
Length = 139
Score = 31.4 bits (72), Expect = 0.17
Identities = 12/57 (21%), Positives = 24/57 (42%), Gaps = 1/57 (1%)
Query: 138 LRLTSIEAPSKQEWNTVEDLLTEALHMEKQLNEKLLKLHACASQHGDANLTDFLEGR 194
L++I+ + ++++ E L + L E+L + A + GD D L
Sbjct: 69 AELSTIKEEPA-GVLSAKEMVAELLEDYETLIEELREAIELADEAGDPATADLLTEI 124
>gnl|CDD|173626 cd05034, PTKc_Src_like, Catalytic domain of Src kinase-like Protein
Tyrosine Kinases. Protein Tyrosine Kinase (PTK) family;
Src kinase subfamily; catalytic (c) domain. Src
subfamily members include Src, Lck, Hck, Blk, Lyn, Fgr,
Fyn, Yrk, and Yes. The PTKc family is part of a larger
superfamily that includes the catalytic domains of other
kinases such as protein serine/threonine kinases, RIO
kinases, and phosphoinositide 3-kinase (PI3K). PTKs
catalyze the transfer of the gamma-phosphoryl group from
ATP to tyrosine (tyr) residues in protein substrates.
Src (or c-Src) proteins are cytoplasmic (or
non-receptor) tyr kinases which are anchored to the
plasma membrane. They contain an N-terminal SH4 domain
with a myristoylation site, followed by SH3 and SH2
domains, a tyr kinase domain, and a regulatory
C-terminal region containing a conserved tyr. They are
activated by autophosphorylation at the tyr kinase
domain, but are negatively regulated by phosphorylation
at the C-terminal tyr by Csk (C-terminal Src Kinase).
Src proteins are involved in signaling pathways that
regulate cytokine and growth factor responses,
cytoskeleton dynamics, cell proliferation, survival, and
differentiation. They were identified as the first
proto-oncogene products, and they regulate cell
adhesion, invasion, and motility in cancer cells and
tumor vasculature, contributing to cancer progression
and metastasis. Src kinases are overexpressed in a
variety of human cancers, making them attractive targets
for therapy. They are also implicated in acute
inflammatory responses and osteoclast function. Src,
Fyn, Yes, and Yrk are widely expressed, while Blk, Lck,
Hck, Fgr, and Lyn show a limited expression pattern.
Length = 261
Score = 31.9 bits (73), Expect = 0.22
Identities = 26/93 (27%), Positives = 40/93 (43%), Gaps = 27/93 (29%)
Query: 153 TVEDLLTEALHMEKQLNEKLLKLHACASQ------------HGDANLTDFL---EGRYLQ 197
+ E L EA M+K ++KL++L+A S+ G +L DFL EG+ L+
Sbjct: 44 SPEAFLQEAQIMKKLRHDKLVQLYAVCSEEEPIYIVTEYMSKG--SLLDFLKSGEGKKLR 101
Query: 198 EQVDAIKTLADLLTTVRRTQLYL-----VDRDL 225
L D+ + YL + RDL
Sbjct: 102 LP-----QLVDMAAQIAEGMAYLESRNYIHRDL 129
>gnl|CDD|133202 cd05071, PTKc_Src, Catalytic domain of the Protein Tyrosine Kinase,
Src. Protein Tyrosine Kinase (PTK) family; Src kinase;
catalytic (c) domain. The PTKc family is part of a
larger superfamily that includes the catalytic domains
of other kinases such as protein serine/threonine
kinases, RIO kinases, and phosphoinositide 3-kinase
(PI3K). PTKs catalyze the transfer of the
gamma-phosphoryl group from ATP to tyrosine (tyr)
residues in protein substrates. Src (or c-Src) is a
cytoplasmic (or non-receptor) tyr kinase, containing an
N-terminal SH4 domain with a myristoylation site,
followed by SH3 and SH2 domains, a tyr kinase domain,
and a regulatory C-terminal region with a conserved tyr.
It is activated by autophosphorylation at the tyr kinase
domain, and is negatively regulated by phosphorylation
at the C-terminal tyr by Csk (C-terminal Src Kinase).
c-Src is the vertebrate homolog of the oncogenic protein
(v-Src) from Rous sarcoma virus. Together with other Src
subfamily proteins, it is involved in signaling pathways
that regulate cytokine and growth factor responses,
cytoskeleton dynamics, cell proliferation, survival, and
differentiation. Src also play a role in regulating cell
adhesion, invasion, and motility in cancer cells and
tumor vasculature, contributing to cancer progression
and metastasis. Elevated levels of Src kinase activity
have been reported in a variety of human cancers.
Several inhibitors of Src have been developed as
anti-cancer drugs. Src is also implicated in acute
inflammatory responses and osteoclast function.
Length = 262
Score = 31.9 bits (72), Expect = 0.25
Identities = 27/85 (31%), Positives = 39/85 (45%), Gaps = 16/85 (18%)
Query: 155 EDLLTEALHMEKQLNEKLLKLHACASQH---------GDANLTDFLEGRY-----LQEQV 200
E L EA M+K +EKL++L+A S+ +L DFL+G L + V
Sbjct: 46 EAFLQEAQVMKKLRHEKLVQLYAVVSEEPIYIVTEYMSKGSLLDFLKGEMGKYLRLPQLV 105
Query: 201 DAIKTLADLLTTVRRTQLYLVDRDL 225
D +A + V R + V RDL
Sbjct: 106 DMAAQIASGMAYVER--MNYVHRDL 128
>gnl|CDD|223880 COG0810, TonB, Periplasmic protein TonB, links inner and outer
membranes [Cell envelope biogenesis, outer membrane].
Length = 244
Score = 31.7 bits (72), Expect = 0.28
Identities = 10/38 (26%), Positives = 16/38 (42%)
Query: 12 PPPKKDKPKCESTRHEPTKENKPCKETKPKSKKSSEPT 49
PP + + E T +P ++ KP K+ K K
Sbjct: 68 KPPTEPETPPEPTPPKPKEKPKPEKKPKKPKPKPKPKP 105
>gnl|CDD|133200 cd05069, PTKc_Yes, Catalytic domain of the Protein Tyrosine Kinase,
Yes. Protein Tyrosine Kinase (PTK) family; Yes kinase;
catalytic (c) domain. The PTKc family is part of a
larger superfamily that includes the catalytic domains
of other kinases such as protein serine/threonine
kinases, RIO kinases, and phosphoinositide 3-kinase
(PI3K). PTKs catalyze the transfer of the
gamma-phosphoryl group from ATP to tyrosine (tyr)
residues in protein substrates. Yes (or c-Yes) is a
member of the Src subfamily of proteins, which are
cytoplasmic (or non-receptor) tyr kinases. Src kinases
contain an N-terminal SH4 domain with a myristoylation
site, followed by SH3 and SH2 domains, a tyr kinase
domain, and a regulatory C-terminal region containing a
conserved tyr. They are activated by autophosphorylation
at the tyr kinase domain, but are negatively regulated
by phosphorylation at the C-terminal tyr by Csk
(C-terminal Src Kinase). Src proteins are involved in
signaling pathways that regulate cytokine and growth
factor responses, cytoskeleton dynamics, cell
proliferation, survival, and differentiation. c-Yes
kinase is the cellular homolog of the oncogenic protein
(v-Yes) encoded by the Yamaguchi 73 and Esh sarcoma
viruses. It displays functional overlap with other Src
subfamily members, particularly Src. It also shows some
unique functions such as binding to occludins,
transmembrane proteins that regulate extracellular
interactions in tight junctions. Yes also associates
with a number of proteins in different cell types that
Src does not interact with, like JAK2 and gp130 in
pre-adipocytes, and Pyk2 in treated pulmonary vein
endothelial cells. Although the biological function of
Yes remains unclear, it appears to have a role in
regulating cell-cell interactions and vesicle
trafficking in polarized cells.
Length = 260
Score = 31.2 bits (70), Expect = 0.44
Identities = 27/85 (31%), Positives = 43/85 (50%), Gaps = 16/85 (18%)
Query: 155 EDLLTEALHMEKQLNEKLLKLHACASQH---------GDANLTDFL---EGRYLQ--EQV 200
E L EA M+K ++KL+ L+A S+ G +L DFL +G+YL+ + V
Sbjct: 46 EAFLQEAQIMKKLRHDKLVPLYAVVSEEPIYIVTEFMGKGSLLDFLKEGDGKYLKLPQLV 105
Query: 201 DAIKTLADLLTTVRRTQLYLVDRDL 225
D +AD + + R + + RDL
Sbjct: 106 DMAAQIADGMAYIER--MNYIHRDL 128
>gnl|CDD|187551 cd05240, UDP_G4E_3_SDR_e, UDP-glucose 4 epimerase (G4E), subgroup
3, extended (e) SDRs. Members of this bacterial
subgroup are identified as possible sugar epimerases,
such as UDP-glucose 4 epimerase. However, while the
NAD(P)-binding motif is fairly well conserved, not all
members retain the canonical active site tetrad of the
extended SDRs. UDP-glucose 4 epimerase (aka
UDP-galactose-4-epimerase), is a homodimeric extended
SDR. It catalyzes the NAD-dependent conversion of
UDP-galactose to UDP-glucose, the final step in Leloir
galactose synthesis. Extended SDRs are distinct from
classical SDRs. In addition to the Rossmann fold
(alpha/beta folding pattern with a central beta-sheet)
core region typical of all SDRs, extended SDRs have a
less conserved C-terminal extension of approximately 100
amino acids. Extended SDRs are a diverse collection of
proteins, and include isomerases, epimerases,
oxidoreductases, and lyases; they typically have a
TGXXGXXG cofactor binding motif. SDRs are a functionally
diverse family of oxidoreductases that have a single
domain with a structurally conserved Rossmann fold, an
NAD(P)(H)-binding region, and a structurally diverse
C-terminal region. Sequence identity between different
SDR enzymes is typically in the 15-30% range; they
catalyze a wide range of activities including the
metabolism of steroids, cofactors, carbohydrates,
lipids, aromatic compounds, and amino acids, and act in
redox sensing. Classical SDRs have an TGXXX[AG]XG
cofactor binding motif and a YXXXK active site motif,
with the Tyr residue of the active site motif serving as
a critical catalytic residue (Tyr-151, human
15-hydroxyprostaglandin dehydrogenase numbering). In
addition to the Tyr and Lys, there is often an upstream
Ser and/or an Asn, contributing to the active site;
while substrate binding is in the C-terminal region,
which determines specificity. The standard reaction
mechanism is a 4-pro-S hydride transfer and proton relay
involving the conserved Tyr and Lys, a water molecule
stabilized by Asn, and nicotinamide. Atypical SDRs
generally lack the catalytic residues characteristic of
the SDRs, and their glycine-rich NAD(P)-binding motif is
often different from the forms normally seen in
classical or extended SDRs. Complex (multidomain) SDRs
such as ketoreductase domains of fatty acid synthase
have a GGXGXXG NAD(P)-binding motif and an altered
active site motif (YXXXN). Fungal type ketoacyl
reductases have a TGXXXGX(1-2)G NAD(P)-binding motif.
Length = 306
Score = 30.8 bits (70), Expect = 0.53
Identities = 13/43 (30%), Positives = 16/43 (37%)
Query: 154 VEDLLTEALHMEKQLNEKLLKLHACASQHGDANLTDFLEGRYL 196
VE LL E +LN +L+ DFL R L
Sbjct: 145 VEQLLAEFRRRHPELNVTVLRPATILGPGTRNTTRDFLSPRRL 187
>gnl|CDD|220684 pfam10310, DUF2413, Protein of unknown function (DUF2413). This is
a family of proteins conserved in fungi. The function is
not known.
Length = 436
Score = 30.9 bits (70), Expect = 0.57
Identities = 12/58 (20%), Positives = 17/58 (29%)
Query: 6 KGSSNDPPPKKDKPKCESTRHEPTKENKPCKETKPKSKKSSEPTQSSQHSPDSLGKTE 63
+ + P K KP S K KP K + + S S + E
Sbjct: 58 RPGTPRNPKKSSKPTESSAASSEEKPAKPRKSAESTRSSHPKSKAPSTESEEEEEPEE 115
>gnl|CDD|185628 PTZ00449, PTZ00449, 104 kDa microneme/rhoptry antigen; Provisional.
Length = 943
Score = 31.2 bits (70), Expect = 0.61
Identities = 17/58 (29%), Positives = 24/58 (41%), Gaps = 5/58 (8%)
Query: 7 GSSNDPPPKKDKPKCESTRHEPTKENKPCKE-TKPKSKKSSE----PTQSSQHSPDSL 59
+S PP + E HE +KE+ KE KP K E P + +H P +
Sbjct: 515 EASGLPPKAPGDKEGEEGEHEDSKESDEPKEGGKPGETKEGEVGKKPGPAKEHKPSKI 572
>gnl|CDD|143491 cd06817, PLPDE_III_DSD, Type III Pyridoxal 5-phosphate
(PLP)-Dependent Enzyme Eukaryotic D-Serine Dehydratase.
This subfamily is composed of chicken D-serine
dehydratase (DSD, EC 4.3.1.18) and similar eukaryotic
proteins. Chicken DSD catalyzes the dehydration of
D-serine to aminoacrylate, which is rapidly hydrolyzed
to pyruvate and ammonia. It is a fold type III
PLP-dependent enzyme with similarity to bacterial
alanine racemase (AR), which contains an N-terminal
PLP-binding TIM-barrel domain and a C-terminal
beta-sandwich domain. AR exists as dimers with active
sites that lie at the interface between the TIM barrel
domain of one subunit and the beta-sandwich domain of
the other subunit. Experimental data suggest that
chicken DSD also exists as dimers. Sequence comparison
and biochemical experiments show that chicken DSD is
distinct from the ubiquitous bacterial DSDs coded by
dsdA gene, mammalian L-serine dehydratases (LSD) and
mammalian serine racemase (SerRac), which are fold type
II PLP-dependent enzymes.
Length = 389
Score = 30.4 bits (69), Expect = 0.92
Identities = 25/91 (27%), Positives = 34/91 (37%), Gaps = 21/91 (23%)
Query: 101 QDNVALFGFAKFFTHSYK------------EEIEHMEHLIAYLNKRGGQLRLT-SIEA-P 146
+ V LFGF HSY EEIE + L G +LT S+ A P
Sbjct: 165 SEAVELFGFYSHAGHSYSSRSAEDAKEVLREEIEAVLTAAKKLKSIQGDRKLTLSVGATP 224
Query: 147 SKQEWNTVEDLLTEALHMEKQLNEKLLKLHA 177
+ + + +L LL+LHA
Sbjct: 225 TAHAAEALVLIPAPSLSG-------LLELHA 248
>gnl|CDD|191371 pfam05778, Apo-CIII, Apolipoprotein CIII (Apo-CIII). This family
consists of several mammalian apolipoprotein CIII
(Apo-CIII) sequences. Apolipoprotein C-III is a
79-residue glycoprotein. It is synthesised in the
intestine and liver as part of the very low density
lipoprotein (VLDL) and the high density lipoprotein
(HDL) particles. Owing to its positive correlation with
plasma triglyceride (Tg) levels, Apo-CIII is suggested
to play a role in Tg metabolism and is therefore of
interest regarding atherosclerosis. However, unlike
other apolipoproteins such as Apo-AI, Apo E or CII for
which many naturally occurring mutations are known, the
structure-function relationships of apo C-III remains a
subject of debate. One possibility is that apo C-III
inhibits lipoprotein lipase (LPL) activity, as shown by
in vitro experiments. Another suggestion, is that
elevated levels of Apo-CIII displace other
apolipoproteins at the lipoprotein surface, modifying
their clearance from plasma.
Length = 70
Score = 28.0 bits (62), Expect = 1.1
Identities = 16/48 (33%), Positives = 28/48 (58%), Gaps = 4/48 (8%)
Query: 184 DANLTDFLEGRYLQEQVDAIKTLADLLTTVRRTQLYLVDRDLMSGKFS 231
D++L ++G Y+++ A KT D LT+V+ +Q+ R M+ FS
Sbjct: 4 DSSLLGKMQG-YMKQ---ATKTAQDALTSVQESQVAQQARGWMTDSFS 47
>gnl|CDD|234938 PRK01297, PRK01297, ATP-dependent RNA helicase RhlB; Provisional.
Length = 475
Score = 29.9 bits (67), Expect = 1.1
Identities = 12/30 (40%), Positives = 16/30 (53%)
Query: 12 PPPKKDKPKCESTRHEPTKENKPCKETKPK 41
P K P + R E K++KP +E KPK
Sbjct: 39 PATKAAAPAAAAPRAEKPKKDKPRRERKPK 68
>gnl|CDD|233728 TIGR02102, pullulan_Gpos, pullulanase, extracellular, Gram-positive.
Pullulan is an unusual, industrially important
polysaccharide in which short alpha-1,4 chains
(maltotriose) are connected in alpha-1,6 linkages.
Enzymes that cleave alpha-1,6 linkages in pullulan and
release maltotriose are called pullulanases although
pullulan itself may not be the natural substrate. In
contrast, a glycogen debranching enzyme such GlgX,
homologous to this family, can release glucose at
alpha,1-6 linkages from glycogen first subjected to limit
degradation by phosphorylase. Characterized members of
this family include a surface-located pullulanase from
Streptococcus pneumoniae (PMID:11083842) and an
extracellular bifunctional amylase/pullulanase with
C-terminal pullulanase activity (PMID:8798645).
Length = 1111
Score = 29.8 bits (67), Expect = 1.4
Identities = 13/70 (18%), Positives = 25/70 (35%), Gaps = 3/70 (4%)
Query: 3 SDCKGSSNDPPPKKDKPKCESTRHEPTKENKPCKETKPKSKKSSEPTQSSQHSPDSLGKT 62
+ PPP + +P+ PT++ K K ++ + S D G+T
Sbjct: 1005 GGIEAPEKTPPPPEHEPQAP---KPPTQDPDGSKPKDKVDPKDNKDPLTPPGSDDENGET 1061
Query: 63 ERQTLQNFHE 72
+ + E
Sbjct: 1062 PKGNEEKKEE 1071
>gnl|CDD|132155 TIGR03111, glyc2_xrt_Gpos1, putative glycosyltransferase,
exosortase G-associated. Members of this protein family
are probable glycosyltransferases of family 2, whose
genes are near those for the exosortase homolog XrtG
(TIGR03110), which is restricted to Gram-positive
bacteria. Other genes in the conserved gene neighborhood
include a 6-pyruvoyl tetrahydropterin synthase homolog
(TIGR03112) and an uncharacterized intergral membrane
protein (TIGR03766).
Length = 439
Score = 29.7 bits (67), Expect = 1.5
Identities = 14/35 (40%), Positives = 20/35 (57%), Gaps = 5/35 (14%)
Query: 190 FLEGRYLQEQVDAIKTLADLLTTVRR-----TQLY 219
FL GR + QV+++ TL+ + RR TQLY
Sbjct: 199 FLAGRNFESQVNSLFTLSGAFSAFRRETILKTQLY 233
>gnl|CDD|218919 pfam06163, DUF977, Bacterial protein of unknown function (DUF977).
This family consists of several hypothetical bacterial
proteins from Escherichia coli and Salmonella typhi. The
function of this family is unknown.
Length = 127
Score = 28.5 bits (63), Expect = 1.5
Identities = 17/53 (32%), Positives = 24/53 (45%), Gaps = 3/53 (5%)
Query: 110 AKFFTHSYKEEIEHMEHLIAYLNKRGGQLRLTSIEAPSKQEWNTVEDLLTEAL 162
AK FT +E I E +I L + G++ +EA + T E L AL
Sbjct: 2 AKPFTQEEREII--KERIIG-LVREQGRITTNQLEAITGAHRTTAEKYLRIAL 51
>gnl|CDD|114868 pfam06175, MiaE, tRNA-(MS[2]IO[6]A)-hydroxylase (MiaE). This
family consists of several bacterial
tRNA-(MS[2]IO[6]A)-hydroxylase (MiaE) proteins. The
modified nucleoside 2-methylthio-N-6-isopentenyl
adenosine (ms2i6A) is present at position 37 (3' of the
anticodon) of tRNAs that read codons beginning with U
except tRNA(I,V Ser) in Escherichia coli. Salmonella
typhimurium 2-methylthio-cis-ribozeatin (ms2io6A) is
found in tRNA, probably in the corresponding species
that have ms2i6A in E. coli. The miaE gene is absent in
E. coli, a finding consistent with the absence of the
hydroxylated derivative of ms2i6A in this species.
Length = 199
Score = 29.0 bits (65), Expect = 1.7
Identities = 20/86 (23%), Positives = 30/86 (34%), Gaps = 16/86 (18%)
Query: 118 KEEIEHMEHLIAYLNKRGGQLRLTSIEAP----------SKQEWNTVEDLLTEALHMEKQ 167
KEE+ H E ++ L R + I A E + D L ++E +
Sbjct: 69 KEELHHFEQVLEILEAR--NIPYAPISASRYAKGLLAAVRTHEPQRLLDKLIVGAYIEAR 126
Query: 168 LNEKLLKLHACASQHGDANLTDFLEG 193
E+ L + H DA L F
Sbjct: 127 SCERFALL----APHLDAELAKFYVS 148
>gnl|CDD|132982 cd06651, STKc_MEKK3, Catalytic domain of the Protein
Serine/Threonine Kinase, MAP/ERK kinase kinase 3.
Serine/threonine kinases (STKs), MAP/ERK kinase kinase 3
(MEKK3) subfamily, catalytic (c) domain. STKs catalyze
the transfer of the gamma-phosphoryl group from ATP to
serine/threonine residues on protein substrates. The
MEKK3 subfamily is part of a larger superfamily that
includes the catalytic domains of other protein STKs,
protein tyrosine kinases, RIO kinases, aminoglycoside
phosphotransferase, choline kinase, and phosphoinositide
3-kinase. MEKK3 is a mitogen-activated protein kinase
(MAPK) kinase kinase (MAPKKK or MKKK or MAP3K), that
phosphorylates and activates the MAPK kinase MEK5 (or
MKK5), which in turn phosphorylates and activates
extracellular signal-regulated kinase 5 (ERK5). The ERK5
cascade plays roles in promoting cell proliferation,
differentiation, neuronal survival, and neuroprotection.
MEKK3 plays an essential role in embryonic angiogenesis
and early heart development. In addition, MEKK3 is
involved in interleukin-1 receptor and Toll-like
receptor 4 signaling. It is also a specific regulator of
the proinflammatory cytokines IL-6 and GM-CSF in some
immune cells. MEKK3 also regulates calcineurin, which
plays a critical role in T cell activation, apoptosis,
skeletal myocyte differentiation, and cardiac
hypertrophy.
Length = 266
Score = 29.3 bits (65), Expect = 1.7
Identities = 20/80 (25%), Positives = 33/80 (41%), Gaps = 9/80 (11%)
Query: 161 ALHMEKQL-----NEKLLKLHACASQHGDANLTDFLEGRYLQEQVDAIKTLADLLTTVRR 215
AL E QL +E++++ + C + LT F+E D +K L +V R
Sbjct: 50 ALECEIQLLKNLQHERIVQYYGCLRDRAEKTLTIFMEYMPGGSVKDQLKAYGALTESVTR 109
Query: 216 TQLYLVDRDLMSGKFSMHGN 235
R ++ G +H N
Sbjct: 110 K----YTRQILEGMSYLHSN 125
>gnl|CDD|236178 PRK08187, PRK08187, pyruvate kinase; Validated.
Length = 493
Score = 29.6 bits (67), Expect = 1.7
Identities = 19/71 (26%), Positives = 28/71 (39%), Gaps = 5/71 (7%)
Query: 159 TEALHMEKQLNEKLLKLHACASQHGDANLTDFLEGRYLQ--EQVDAIKTLADLLTTVRRT 216
+ L L E LL L A + A L + ++ E + + LA L +RR
Sbjct: 2 DDDLSEPAALLEALLALRADVVEEAQARLAGW--ASRIERPEFLPSAANLAHYL-ALRRH 58
Query: 217 QLYLVDRDLMS 227
L + R LM
Sbjct: 59 DLRPLQRALMP 69
>gnl|CDD|153099 cd00907, Bacterioferritin, Bacterioferritin, ferritin-like
diiron-binding domain. Bacterioferritins, also known as
cytochrome b1, are members of a broad superfamily of
ferritin-like diiron-carboxylate proteins. Similar to
ferritin in architecture, Bfr forms an oligomer of 24
subunits that assembles to form a hollow sphere with 432
symmetry. Up to 12 heme cofactor groups (iron
protoporphyrin IX or coproporphyrin III) are bound
between dimer pairs. The role of the heme is unknown,
although it may be involved in mediating iron-core
reduction and iron release. Each subunit is composed of
a four-helix bundle which carries a diiron ferroxidase
center; it is here that initial oxidation of ferrous
iron by molecular oxygen occurs, facilitating the
detoxification of iron, protection against dioxygen and
radical products, and storage of ferric-hydroxyphosphate
at the core. Some bacterioferritins are composed of two
subunit types, one conferring heme-binding ability
(alpha) and the other (beta) bestowing ferroxidase
activity.
Length = 153
Score = 28.3 bits (64), Expect = 2.3
Identities = 29/112 (25%), Positives = 41/112 (36%), Gaps = 20/112 (17%)
Query: 108 GFAKFFTHSYKEEIEHMEH---LIA---YLNKRGGQLRLTSIEAPSKQEWNTVEDLLTEA 161
G K KE IE M+H LI +L G L + E V ++L
Sbjct: 35 GLEKLAERFRKESIEEMKHADKLIERILFL---EGLPNLQRLGKLRIGE--DVPEMLEND 89
Query: 162 LHMEKQLNEKLLKLHACASQHGDANLTDFLE---------GRYLQEQVDAIK 204
L +E + L + A + GD D LE +L+ Q+D I
Sbjct: 90 LALEYEAIAALNEAIALCEEVGDYVSRDLLEEILEDEEEHIDWLETQLDLID 141
>gnl|CDD|215397 PLN02744, PLN02744, dihydrolipoyllysine-residue acetyltransferase
component of pyruvate dehydrogenase complex.
Length = 539
Score = 29.1 bits (65), Expect = 2.4
Identities = 17/50 (34%), Positives = 23/50 (46%), Gaps = 5/50 (10%)
Query: 4 DCKGSSNDPPPKKDKPKCESTRHEPTKE--NKPCKETKPKSKKSSEPTQS 51
D K SS+ P PK + + P +E KP +PK+ K S P S
Sbjct: 199 DYKPSSSAAPAA---PKAKPSPPPPKEEEVEKPASSPEPKASKPSAPPSS 245
>gnl|CDD|233223 TIGR00990, 3a0801s09, mitochondrial precursor proteins import
receptor (72 kDa mitochondrial outermembrane protein)
(mitochondrial import receptor for the ADP/ATP carrier)
(translocase of outermembrane tom70). [Transport and
binding proteins, Amino acids, peptides and amines].
Length = 615
Score = 29.2 bits (65), Expect = 2.4
Identities = 17/79 (21%), Positives = 33/79 (41%)
Query: 12 PPPKKDKPKCESTRHEPTKENKPCKETKPKSKKSSEPTQSSQHSPDSLGKTERQTLQNFH 71
++ KPK K+ + E K + KKS+ P + D L + + ++ N
Sbjct: 60 QQQRESKPKISKKERRKRKQAEKETEGKTEEKKSTAPKNAPVEPADELPEIDESSVANLS 119
Query: 72 EDCIDKINTQINDEIRSAY 90
E+ K ++ ++ AY
Sbjct: 120 EEERKKYAAKLKEKGNKAY 138
>gnl|CDD|240323 PTZ00233, PTZ00233, variable surface protein Vir18; Provisional.
Length = 509
Score = 28.8 bits (64), Expect = 2.6
Identities = 16/49 (32%), Positives = 22/49 (44%), Gaps = 7/49 (14%)
Query: 10 NDPPPKKDKPKCESTRHEPTKENKPCK-------ETKPKSKKSSEPTQS 51
N P K P + T EP K K CK +TK +SK P+++
Sbjct: 101 NGAFPAKKPPLIKPTTQEPCKGGKGCKTETPQRVDTKSQSKLRPVPSKA 149
>gnl|CDD|218332 pfam04929, Herpes_DNAp_acc, Herpes DNA replication accessory
factor. Replicative DNA polymerases are capable of
polymerising tens of thousands of nucleotides without
dissociating from their DNA templates. The high
processivity of these polymerases is dependent upon
accessory proteins that bind to the catalytic subunit of
the polymerase or to the substrate. The Epstein-Barr
virus (EBV) BMRF1 protein is an essential component of
the viral DNA polymerase and is absolutely required for
lytic virus replication. BMRF1 is also a transactivator.
This family is predicted to have a UL42 like structure.
Length = 381
Score = 28.9 bits (65), Expect = 2.6
Identities = 13/54 (24%), Positives = 19/54 (35%), Gaps = 2/54 (3%)
Query: 2 SSDCKGSSNDPPPKKDKPKCESTRHEPTKENKPCKETKPKSKKSSEPTQSSQHS 55
S S +DP + P S H P + PK K+ + Q + S
Sbjct: 319 PSPPDTSDSDPSTETPPPA--SLSHSPPAAFERPLALSPKRKREGDKKQKKKKS 370
>gnl|CDD|133201 cd05070, PTKc_Fyn_Yrk, Catalytic domain of the Protein Tyrosine
Kinases, Fyn and Yrk. Protein Tyrosine Kinase (PTK)
family; Fyn and Yrk kinases; catalytic (c) domain. The
PTKc family is part of a larger superfamily that
includes the catalytic domains of other kinases such as
protein serine/threonine kinases, RIO kinases, and
phosphoinositide 3-kinase (PI3K). PTKs catalyze the
transfer of the gamma-phosphoryl group from ATP to
tyrosine (tyr) residues in protein substrates. Fyn and
Yrk are members of the Src subfamily of proteins, which
are cytoplasmic (or non-receptor) tyr kinases. Src
kinases contain an N-terminal SH4 domain with a
myristoylation site, followed by SH3 and SH2 domains, a
tyr kinase domain, and a regulatory C-terminal region
containing a conserved tyr. They are activated by
autophosphorylation at the tyr kinase domain, but are
negatively regulated by phosphorylation at the
C-terminal tyr by Csk (C-terminal Src Kinase). Src
proteins are involved in signaling pathways that
regulate cytokine and growth factor responses,
cytoskeleton dynamics, cell proliferation, survival, and
differentiation. Fyn, together with Lck, plays a
critical role in T-cell signal transduction by
phosphorylating ITAM (immunoreceptor tyr activation
motif) sequences on T-cell receptors, ultimately leading
to the proliferation and differentiation of T-cells. In
addition, Fyn is involved in the myelination of neurons,
and is implicated in Alzheimer's and Parkinson's
diseases. Yrk has been detected only in chickens. It is
primarily found in neuronal and epithelial cells and in
macrophages. It may play a role in inflammation and in
response to injury.
Length = 260
Score = 28.8 bits (64), Expect = 2.6
Identities = 27/88 (30%), Positives = 41/88 (46%), Gaps = 16/88 (18%)
Query: 155 EDLLTEALHMEKQLNEKLLKLHACASQH---------GDANLTDFL---EGRYLQ--EQV 200
E L EA M+K ++KL++L+A S+ +L DFL EGR L+ V
Sbjct: 46 ESFLEEAQIMKKLRHDKLVQLYAVVSEEPIYIVTEYMSKGSLLDFLKDGEGRALKLPNLV 105
Query: 201 DAIKTLADLLTTVRRTQLYLVDRDLMSG 228
D +A + + R + + RDL S
Sbjct: 106 DMAAQVAAGMAYIER--MNYIHRDLRSA 131
>gnl|CDD|217476 pfam03286, Pox_Ag35, Pox virus Ag35 surface protein.
Length = 198
Score = 28.2 bits (63), Expect = 3.3
Identities = 12/45 (26%), Positives = 19/45 (42%)
Query: 12 PPPKKDKPKCESTRHEPTKENKPCKETKPKSKKSSEPTQSSQHSP 56
P PK+ K K +T +P K K+ K K + + +S
Sbjct: 46 PSPKQPKKKRPTTPRKPATTKKSKKKDKEKLTEEEKKPESDDDKT 90
>gnl|CDD|238449 cd00884, beta_CA_cladeB, Carbonic anhydrases (CA) are
zinc-containing enzymes that catalyze the reversible
hydration of carbon dioxide in a two-step mechanism in
which the nucleophilic attack of a zinc-bound hydroxide
ion on carbon dioxide is followed by the regeneration of
an active site by ionization of the zinc-bound water
molecule and removal of a proton from the active site.
CAs are ubiquitous enzymes involved in fundamental
processes like photosynthesis, respiration, pH
homeostasis and ion transport. There are three
evolutionarily distinct families of CAs (the alpha-,
beta-, and gamma-CAs) which show no significant sequence
identity or structural similarity. Within the beta-CA
family there are four evolutionarily distinct clades (A
through D). The beta-CAs are multimeric enzymes (forming
dimers,tetramers,hexamers and octamers) which are
present in higher plants, algae, fungi, archaea and
prokaryotes.
Length = 190
Score = 27.9 bits (63), Expect = 4.0
Identities = 8/20 (40%), Positives = 10/20 (50%)
Query: 108 GFAKFFTHSYKEEIEHMEHL 127
GF +F + EE E E L
Sbjct: 1 GFRRFRKEYFPEERELFEKL 20
>gnl|CDD|225104 COG2193, Bfr, Bacterioferritin (cytochrome b1) [Inorganic ion
transport and metabolism].
Length = 157
Score = 27.3 bits (61), Expect = 5.2
Identities = 30/109 (27%), Positives = 47/109 (43%), Gaps = 9/109 (8%)
Query: 108 GFAKFFTHSYKEEIEHMEH---LIAYLNKRGGQLRLTSIEAPSKQEWNTVEDLLTEALHM 164
G K H Y E IE M+H LI + G L + + TV+++L L +
Sbjct: 36 GLTKLAAHEYHESIEEMKHADQLIERILFLEGLPNLQ--DLGKLRIGETVKEMLEADLAL 93
Query: 165 EKQLNEKLLKLHACASQHGDANLTDFLEGRYL---QEQVDAIKTLADLL 210
E + + L + A + D D LE L +E +D ++T DL+
Sbjct: 94 EYEARDALKEAIAYCEEVQDYVSRDLLE-EILADEEEHIDWLETQLDLI 141
>gnl|CDD|218673 pfam05642, Sporozoite_P67, Sporozoite P67 surface antigen. This
family consists of several Theileria P67 surface
antigens. A stage specific surface antigen of Theileria
parva, p67, is the basis for the development of an
anti-sporozoite vaccine for the control of East Coast
fever (ECF) in cattle. The antigen has been shown to
contain five distinct linear peptide sequences
recognised by sporozoite-neutralising murine monoclonal
antibodies.
Length = 727
Score = 27.7 bits (61), Expect = 6.0
Identities = 17/62 (27%), Positives = 25/62 (40%)
Query: 1 MSSDCKGSSNDPPPKKDKPKCESTRHEPTKENKPCKETKPKSKKSSEPTQSSQHSPDSLG 60
+ D KGS D D + E + +K+ K K+T+P SS T S
Sbjct: 105 QNQDTKGSKTDSEEDDDDSEEEDNKSTSSKDGKGSKKTQPGVSTSSGSTTSGTDLNTKQS 164
Query: 61 KT 62
+T
Sbjct: 165 QT 166
>gnl|CDD|236768 PRK10819, PRK10819, transport protein TonB; Provisional.
Length = 246
Score = 27.3 bits (61), Expect = 6.0
Identities = 13/46 (28%), Positives = 20/46 (43%), Gaps = 7/46 (15%)
Query: 12 PPPKKDKPKCESTRHEPTKENKPCKETKPKSKKSSEPTQSSQHSPD 57
P K KPK P + KP K+ + + K+ +P + SP
Sbjct: 94 KPEPKPKPK-------PKPKPKPVKKVEEQPKREVKPVEPRPASPF 132
>gnl|CDD|182447 PRK10422, PRK10422, lipopolysaccharide core biosynthesis protein;
Provisional.
Length = 352
Score = 27.4 bits (61), Expect = 6.1
Identities = 10/32 (31%), Positives = 16/32 (50%), Gaps = 2/32 (6%)
Query: 126 HLIAYLNKRGGQLRLTSIEAPSKQEWNTVEDL 157
+I L RG ++ LTS P K + V ++
Sbjct: 206 AVIDALQARGYEVVLTS--GPDKDDLACVNEI 235
>gnl|CDD|226853 COG4445, MiaE, Hydroxylase for synthesis of
2-methylthio-cis-ribozeatin in tRNA [Nucleotide
transport and metabolism / Translation, ribosomal
structure and biogenesis].
Length = 203
Score = 27.1 bits (60), Expect = 6.4
Identities = 26/127 (20%), Positives = 45/127 (35%), Gaps = 23/127 (18%)
Query: 118 KEEIEHMEHLIAYLNKRGGQLRLTSIEAP----------SKQEWNTVEDLLTEALHMEKQ 167
+EE+ H E ++ L R + I A E + D L ++E +
Sbjct: 69 REELHHFEQVLEILQARN--IPYVPIPASRYAKGLLAAVRTHEPQRLIDKLIVGAYIEAR 126
Query: 168 LNEKLLKLHACASQHGDANLTDFL------EGRYLQEQVDAIKTLADLLTTVRRTQ-LYL 220
E+ L + H D L F E R+ Q+ + D +R +
Sbjct: 127 SCERFAAL----APHLDEELAKFYKGLLRSEARHFQDYLVLADQYFDEEDVSQRVDYFGI 182
Query: 221 VDRDLMS 227
V+ +L+S
Sbjct: 183 VEAELIS 189
>gnl|CDD|218883 pfam06075, DUF936, Plant protein of unknown function (DUF936).
This family consists of several hypothetical proteins
from Arabidopsis thaliana and Oryza sativa. The function
of this family is unknown.
Length = 564
Score = 27.9 bits (62), Expect = 6.5
Identities = 13/61 (21%), Positives = 19/61 (31%)
Query: 2 SSDCKGSSNDPPPKKDKPKCESTRHEPTKENKPCKETKPKSKKSSEPTQSSQHSPDSLGK 61
S + SS P K + + K+T SK P + S+ S K
Sbjct: 245 SPRARSSSAKSPFKSSIQRKATKALSKLSLRASPKDTSKSSKSEVAPPKKSEAKVPSSSK 304
Query: 62 T 62
Sbjct: 305 K 305
>gnl|CDD|225701 COG3159, COG3159, Uncharacterized protein conserved in bacteria
[Function unknown].
Length = 218
Score = 27.3 bits (61), Expect = 6.9
Identities = 22/98 (22%), Positives = 33/98 (33%), Gaps = 14/98 (14%)
Query: 118 KEEIEHMEHLIAYLNKRGGQLRLTSIEAPSKQEWNTVEDLLTEALHMEKQLNEKLLKLHA 177
E L+ L +L S E L + ++L E+L L
Sbjct: 17 PEFFIQHAELLEEL-----RLPHPVAGTVSLVERQ-----LARLRNRIRELEEELAAL-- 64
Query: 178 CASQHGDANLTDFLEGRYLQEQVDAIKTLADLLTTVRR 215
++ AN F LQ + ++L DLL V R
Sbjct: 65 --MENARANERLFYRLHALQLDLLDARSLDDLLRRVDR 100
>gnl|CDD|236698 PRK10475, PRK10475, 23S rRNA pseudouridine synthase F; Provisional.
Length = 290
Score = 27.4 bits (61), Expect = 7.0
Identities = 10/49 (20%), Positives = 15/49 (30%)
Query: 8 SSNDPPPKKDKPKCESTRHEPTKENKPCKETKPKSKKSSEPTQSSQHSP 56
SS P K KPK + K K ++ + T +
Sbjct: 240 SSEAKPKAKAKPKTAGIKRPVVKMEKTAEKGGRPASNGKRFTSPGRKKK 288
>gnl|CDD|191187 pfam05087, Rota_VP2, Rotavirus VP2 protein. Rotavirus particles
consist of three concentric proteinaceous capsid layers.
The innermost capsid (core) is made of VP2. The genomic
RNA and the two minor proteins VP1 and VP3 are
encapsidated within this layer. The N-terminus of
rotavirus VP2 is necessary for the encapsidation of VP1
and VP3.
Length = 887
Score = 27.6 bits (61), Expect = 7.4
Identities = 21/75 (28%), Positives = 29/75 (38%), Gaps = 2/75 (2%)
Query: 158 LTEALHMEKQLNEKLLKLHACASQHGDANLTDFLEGRYLQEQVDAIKTLADLLTTVRRTQ 217
L E + E Q+ + L A + T FL G Q DA KT+ + + R
Sbjct: 343 LKELVSTEAQIQKMSQDLQLEALTI--QSETQFLTGINSQAANDAFKTIIAAMLSQRTIS 400
Query: 218 LYLVDRDLMSGKFSM 232
L V + MS M
Sbjct: 401 LDFVTSNYMSLISGM 415
>gnl|CDD|148787 pfam07373, CAMP_factor, CAMP factor (Cfa). This family consists of
several bacterial CAMP factor (Cfa) proteins which seem
to be specific to Streptococcus species. The CAMP
reaction is a synergistic lysis of erythrocytes by the
interaction of an extracellular protein (CAMP factor)
produced by some streptococcal species with the
Staphylococcus aureus sphingomyelinase C (beta-toxin).
Length = 228
Score = 27.0 bits (60), Expect = 7.7
Identities = 10/44 (22%), Positives = 17/44 (38%)
Query: 125 EHLIAYLNKRGGQLRLTSIEAPSKQEWNTVEDLLTEALHMEKQL 168
+ + LN R QL+ + + DLL A ++ L
Sbjct: 18 QQELQKLNARIAQLQDIQKSVKGSDYEDQINDLLKAAFDLKTAL 61
>gnl|CDD|183019 PRK11179, PRK11179, DNA-binding transcriptional regulator AsnC;
Provisional.
Length = 153
Score = 26.6 bits (59), Expect = 8.7
Identities = 11/31 (35%), Positives = 17/31 (54%), Gaps = 2/31 (6%)
Query: 65 QTLQNFHEDCIDKINTQINDEIRSAYTYISM 95
+++ I+KI Q DEI+S T IS+
Sbjct: 115 RSIDALQHVLINKI--QTIDEIQSTETLISL 143
>gnl|CDD|222581 pfam14181, YqfQ, YqfQ-like protein. The YqfQ-like protein family
includes the B. subtilis YqfQ protein, also known as
VrrA, which is functionally uncharacterized. This family
of proteins is found in bacteria. Proteins in this
family are typically between 146 and 237 amino acids in
length. There are two conserved sequence motifs: QYGP
and PKLY.
Length = 155
Score = 26.3 bits (58), Expect = 10.0
Identities = 15/42 (35%), Positives = 20/42 (47%), Gaps = 1/42 (2%)
Query: 8 SSNDPPPKKDKPKCESTRHEP-TKENKPCKETKPKSKKSSEP 48
DPP K + K + R P K K +T+PK K S+P
Sbjct: 110 EQEDPPETKTESKEKKKREVPKPKTEKEKPKTEPKKPKPSKP 151
Database: CDD.v3.10
Posted date: Mar 20, 2013 7:55 AM
Number of letters in database: 10,937,602
Number of sequences in database: 44,354
Lambda K H
0.312 0.128 0.368
Gapped
Lambda K H
0.267 0.0663 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 11,386,245
Number of extensions: 1012511
Number of successful extensions: 966
Number of sequences better than 10.0: 1
Number of HSP's gapped: 940
Number of HSP's successfully gapped: 75
Length of query: 239
Length of database: 10,937,602
Length adjustment: 94
Effective length of query: 145
Effective length of database: 6,768,326
Effective search space: 981407270
Effective search space used: 981407270
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.9 bits)
S2: 57 (25.9 bits)