RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy10513
         (239 letters)



>gnl|CDD|153114 cd01056, Euk_Ferritin, eukaryotic ferritins.  Eukaryotic Ferritin
           (Euk_Ferritin) domain. Ferritins are the primary iron
           storage proteins of most living organisms and members of
           a broad superfamily of ferritin-like diiron-carboxylate
           proteins. The iron-free (apoferritin) ferritin molecule
           is a protein shell composed of 24 protein chains
           arranged in 432 symmetry. Iron storage involves the
           uptake of iron (II) at the protein shell, its oxidation
           by molecular oxygen at the dinuclear ferroxidase
           centers, and the movement of iron (III) into the cavity
           for deposition as ferrihydrite; the protein shell can
           hold up to 4500 iron atoms. In vertebrates, two types of
           chains (subunits) have been characterized, H or M (fast)
           and L (slow), which differ in rates of iron uptake and
           mineralization. Fe(II) oxidation in the H/M subunits
           take place initially at the ferroxidase center, a
           carboxylate-bridged diiron center, located within the
           subunit four-helix bundle. In a complementary role,
           negatively charged residues on the protein shell inner
           surface of the L subunits promote ferrihydrite
           nucleation. Most plant ferritins combine both oxidase
           and nucleation functions in one chain: they have four
           interior glutamate residues as well as seven ferroxidase
           center residues.
          Length = 161

 Score =  178 bits (454), Expect = 4e-57
 Identities = 71/145 (48%), Positives = 98/145 (67%)

Query: 71  HEDCIDKINTQINDEIRSAYTYISMASFCTQDNVALFGFAKFFTHSYKEEIEHMEHLIAY 130
           HE+C   +N QIN E+ ++Y Y+SMA++  +D+VAL GFAKFF     EE EH E LI Y
Sbjct: 1   HEECEAALNKQINLELNASYVYLSMAAYFDRDDVALPGFAKFFRKLSDEEREHAEKLIKY 60

Query: 131 LNKRGGQLRLTSIEAPSKQEWNTVEDLLTEALHMEKQLNEKLLKLHACASQHGDANLTDF 190
            NKRGG++ L  I+ P K EW +  + L  AL +EK +N+ LL LH  AS+H D +L DF
Sbjct: 61  QNKRGGRVVLQDIKKPEKDEWGSGLEALELALDLEKLVNQSLLDLHKLASEHNDPHLADF 120

Query: 191 LEGRYLQEQVDAIKTLADLLTTVRR 215
           LE  +L+EQV++IK LA  +T ++R
Sbjct: 121 LESEFLEEQVESIKKLAGYITNLKR 145


>gnl|CDD|153098 cd00904, Ferritin, Ferritin iron storage proteins.  Ferritins are
           the primary iron storage proteins of most living
           organisms and members of a broad superfamily of
           ferritin-like diiron-carboxylate proteins. The iron-free
           (apoferritin) ferritin molecule is a protein shell
           composed of 24 protein chains arranged in 432 symmetry.
           Iron storage involves the uptake of iron (II) at the
           protein shell, its oxidation by molecular oxygen at the
           dinuclear ferroxidase centers, and the movement of iron
           (III) into the cavity for deposition as ferrihydrite;
           the protein shell can hold up to 4500 iron atoms. In
           vertebrates, two types of chains (subunits) have been
           characterized, H or M (fast) and L (slow), which differ
           in rates of iron uptake and mineralization. Bacterial
           non-heme ferritins are composed only of H chains. Fe(II)
           oxidation in the H/M subunits take place initially at
           the ferroxidase center, a carboxylate-bridged diiron
           center, located within the subunit four-helix bundle. In
           a complementary role, negatively charged residues on the
           protein shell inner surface of the L subunits promote
           ferrihydrite nucleation. Most plant ferritins combine
           both oxidase and nucleation functions in one chain: they
           have four interior glutamate residues as well as seven
           ferroxidase center residues.
          Length = 160

 Score =  130 bits (328), Expect = 3e-38
 Identities = 66/160 (41%), Positives = 93/160 (58%), Gaps = 4/160 (2%)

Query: 71  HEDCIDKINTQINDEIRSAYTYISMASFCTQDNVALFGFAKFFTHSYKEEIEHMEHLIAY 130
            E     +N Q+N E+ ++YTY+SMA++  +D+VAL G A FF    +EE EH E    Y
Sbjct: 1   SEKVEAAVNRQLNLELYASYTYLSMATYFDRDDVALKGVAHFFKEQAQEEREHAEKFYKY 60

Query: 131 LNKRGGQLRLTSIEAPSKQEWNTVEDLLTEALHMEKQLNEKLLKLHACASQHGDANLTDF 190
            N+RGG++ L  IE P   EW    D +  AL +EK +N+ LL LH  AS+  D +L DF
Sbjct: 61  QNERGGRVELQDIEKPPSDEWGGTLDAMEAALKLEKFVNQALLDLHELASEEKDPHLCDF 120

Query: 191 LEGRYLQEQVDAIKTLADLLTTVRRTQLYLVDRDLMSGKF 230
           LE  +L EQV  IK + D+LT + R    L  +   SG++
Sbjct: 121 LESHFLDEQVKEIKQVGDILTNLER----LNGQQAGSGEY 156


>gnl|CDD|215791 pfam00210, Ferritin, Ferritin-like domain.  This family contains
           ferritins and other ferritin-like proteins such as
           members of the DPS family and bacterioferritins.
          Length = 142

 Score =  119 bits (301), Expect = 2e-34
 Identities = 49/144 (34%), Positives = 75/144 (52%), Gaps = 6/144 (4%)

Query: 75  IDKINTQINDEIRSAYTYISMASFCTQDNVALFGFAKFFTHSYKEEIEHMEHLIAYLNKR 134
           I+ +N Q+ DE+ ++Y Y+ MA +   D   L G A+FF    +EE EH + L   +   
Sbjct: 1   IEALNEQLADELTASYQYLQMAWYF--DGPGLPGLAEFFDEQAEEEREHADKLAERILAL 58

Query: 135 GGQLRLTSIEAP---SKQEWNTVEDLLTEALHMEKQLNEKLLKLHACASQHGDANLTDFL 191
           GG+  LT +E        E+ +V + L  AL  EK++ E L +L   A + GD    DFL
Sbjct: 59  GGKPVLTPVELLEIEPPDEFGSVLEALEAALEHEKEVTESLRELIELAEEEGDYATADFL 118

Query: 192 EGRYLQEQVDAIKTLADLLTTVRR 215
           E  +L+EQ +    L  LL  ++R
Sbjct: 119 EW-FLEEQEEHEWFLEALLEKLKR 141


>gnl|CDD|153113 cd01055, Nonheme_Ferritin, nonheme-containing ferritins.  Nonheme
           Ferritin domain, found in archaea and bacteria, is a
           member of a broad superfamily of ferritin-like
           diiron-carboxylate proteins. The ferritin protein shell
           is composed of 24 protein subunits arranged in 432
           symmetry. Each protein subunit, a four-helix bundle with
           a fifth short terminal helix, contains a dinuclear
           ferroxidase center (H type). Unique to this group of
           proteins is a third metal site in the ferroxidase
           center. Iron storage involves the uptake of iron (II) at
           the protein shell, its oxidation by molecular oxygen at
           the ferroxidase centers, and the movement of iron (III)
           into the cavity for deposition as ferrihydrite.
          Length = 156

 Score =  119 bits (300), Expect = 7e-34
 Identities = 52/160 (32%), Positives = 87/160 (54%), Gaps = 9/160 (5%)

Query: 71  HEDCIDKINTQINDEIRSAYTYISMASFCTQDNVALFGFAKFFTHSYKEEIEHMEHLIAY 130
            E     +N QIN E+ S+Y Y++MA++   D+  L GFA FF    +EE EH      Y
Sbjct: 1   SEKLEKALNEQINLELYSSYLYLAMAAWF--DSKGLDGFANFFRVQAQEEREHAMKFFDY 58

Query: 131 LNKRGGQLRLTSIEAPSKQEWNTVEDLLTEALHMEKQLNEKLLKLHACASQHGDANLTDF 190
           LN RGG++ L +IEAP   E+ ++ ++   AL  E+++ E +  L   A +  D    +F
Sbjct: 59  LNDRGGRVELPAIEAP-PSEFESLLEVFEAALEHEQKVTESINNLVDLALEEKDYATFNF 117

Query: 191 LEGRYLQEQVDAIKTLADLLTTVRR-----TQLYLVDRDL 225
           L+  +++EQV+      D+L  ++        LY++D++L
Sbjct: 118 LQW-FVKEQVEEEALARDILDKLKLAGDDGGGLYMLDKEL 156


>gnl|CDD|224445 COG1528, Ftn, Ferritin-like protein [Inorganic ion transport and
           metabolism].
          Length = 167

 Score = 97.4 bits (243), Expect = 2e-25
 Identities = 49/165 (29%), Positives = 89/165 (53%), Gaps = 9/165 (5%)

Query: 72  EDCIDKINTQINDEIRSAYTYISMASFCTQDNVALFGFAKFFTHSYKEEIEHMEHLIAYL 131
           E  I+ +N Q+N E  ++  Y+ MA++C   + +L GFAKF     +EE+ H   L  YL
Sbjct: 4   EKMIELLNEQMNLEFYASNLYLQMAAWC--SSESLPGFAKFLRAQAQEELTHAMKLFNYL 61

Query: 132 NKRGGQLRLTSIEAPSKQEWNTVEDLLTEALHMEKQLNEKLLKLHACASQHGDANLTDFL 191
           N+RG +  L +IEAP   +++++++L  + L  E+++   + +L   A +  D    +FL
Sbjct: 62  NERGARPELKAIEAP-PNKFSSLKELFEKTLEHEQKVTSSINELAEVAREEKDYATFNFL 120

Query: 192 EGRYLQEQVDAIKTLADLLTTVRRTQ-----LYLVDRDLMSGKFS 231
           +  ++ EQV+  K    +L  +         LYL+D++L +   S
Sbjct: 121 QW-FVAEQVEEEKLFKTILDKLELAGNDGEGLYLIDKELKNRASS 164


>gnl|CDD|182367 PRK10304, PRK10304, ferritin; Provisional.
          Length = 165

 Score = 58.9 bits (142), Expect = 6e-11
 Identities = 44/158 (27%), Positives = 85/158 (53%), Gaps = 9/158 (5%)

Query: 73  DCIDKINTQINDEIRSAYTYISMASFCTQDNVALFGFAKFFTHSYKEEIEHMEHLIAYLN 132
           + I+K+N Q+N E+ S+  Y  M+++C+       G A F     +EE+ HM+ L  YL 
Sbjct: 5   EMIEKLNEQMNLELYSSLLYQQMSAWCSYHTFE--GAAAFLRRHAQEEMTHMQRLFDYLT 62

Query: 133 KRGGQLRLTSIEAPSKQEWNTVEDLLTEALHMEKQLNEKLLKLHACASQHGDANLTDFLE 192
             G   R+ ++E+P   E++++++L  E    E+ + +K+ +L   A  + D    +FL+
Sbjct: 63  DTGNLPRINTVESPFA-EYSSLDELFQETYKHEQLITQKINELAHAAMTNQDYPTFNFLQ 121

Query: 193 GRYLQEQVDA---IKTLADLLTTVRRT--QLYLVDRDL 225
             Y+ EQ +     K++ D L+   ++   LY +D++L
Sbjct: 122 W-YVSEQHEEEKLFKSIIDKLSLAGKSGEGLYFIDKEL 158


>gnl|CDD|184983 PRK15022, PRK15022, ferritin-like protein; Provisional.
          Length = 167

 Score = 47.2 bits (112), Expect = 9e-07
 Identities = 33/152 (21%), Positives = 73/152 (48%), Gaps = 5/152 (3%)

Query: 75  IDKINTQINDEIRSAYTYISMASFCTQDNVALFGFAKFFTHSYKEEIEHMEHLIAYLNKR 134
           + K+N+Q+N E  ++  Y+ ++ +C++   +L G A F     +  +  M  +  ++   
Sbjct: 7   LLKLNSQMNLEFYASNLYLHLSEWCSEQ--SLNGTATFLRAQAQSNVTQMMRMFNFMKSA 64

Query: 135 GGQLRLTSIEAPSKQEWNTVEDLLTEALHMEKQLNEKLLKLHACASQHGDANLTDFLEGR 194
           G    + +I+ P  ++ N++E+L  + L   +Q +  L +L   A    D +  +FL   
Sbjct: 65  GATPIVKAIDVPG-EKLNSLEELFQKTLEEYEQRSSTLAQLADEAKALNDDSTLNFLRDL 123

Query: 195 YLQEQVDAI--KTLADLLTTVRRTQLYLVDRD 224
             ++Q D +  +T+ D + + +   L  V  D
Sbjct: 124 EKEQQHDGLLLQTILDEVRSAKLAGLCPVQTD 155


>gnl|CDD|218602 pfam05478, Prominin, Prominin.  The prominins are an emerging
           family of proteins that among the multispan membrane
           proteins display a novel topology. Mouse prominin and
           human prominin (mouse)-like 1 (PROML1) are predicted to
           contain five membrane spanning domains, with an
           N-terminal domain exposed to the extracellular space
           followed by four, alternating small cytoplasmic and
           large extracellular, loops and a cytoplasmic C-terminal
           domain. The exact function of prominin is unknown
           although in humans defects in PROM1, the gene coding for
           prominin, cause retinal degeneration.
          Length = 807

 Score = 38.5 bits (90), Expect = 0.003
 Identities = 27/130 (20%), Positives = 50/130 (38%), Gaps = 18/130 (13%)

Query: 59  LGKTERQTLQNFHEDCIDKINTQ-INDEIRSAYTYISMASFCTQ-DNVA----------- 105
           L    R+ L++     +D IN      +I+   T +++ SF  + D +A           
Sbjct: 588 LTPEARRNLEDLASSGLDDINFSNFLKQIQKPLTEVNLNSFAEELDALANNLPNGKLKVA 647

Query: 106 LFGFAKFFTHSYKEEIEHMEHLIAYLNKRGGQLRLTSIEAPSKQEWNTVEDLLTEALHME 165
           L   A       +  +  ME L+  LN       + +++  S    N V+D+L E    +
Sbjct: 648 LKNEALDLRSIQQNFVPPMESLLEKLNS-----NVKTLDESSNDLQNAVKDVLAEVQAAQ 702

Query: 166 KQLNEKLLKL 175
             LN    ++
Sbjct: 703 GFLNNNASEI 712


>gnl|CDD|153097 cd00657, Ferritin_like, Ferritin-like superfamily of
           diiron-containing four-helix-bundle proteins.
           Ferritin-like, diiron-carboxylate proteins participate
           in a range of functions including iron regulation,
           mono-oxygenation, and reactive radical production. These
           proteins are characterized by the fact that they
           catalyze dioxygen-dependent oxidation-hydroxylation
           reactions within diiron centers; one exception is
           manganese catalase, which catalyzes peroxide-dependent
           oxidation-reduction within a dimanganese center.
           Diiron-carboxylate proteins are further characterized by
           the presence of duplicate metal ligands, glutamates and
           histidines (ExxH) and two additional glutamates within a
           four-helix bundle. Outside of these conserved residues
           there is little obvious homology. Members include
           bacterioferritin, ferritin, rubrerythrin, aromatic and
           alkene monooxygenase hydroxylases (AAMH), ribonucleotide
           reductase R2 (RNRR2), acyl-ACP-desaturases
           (Acyl_ACP_Desat), manganese (Mn) catalases,
           demethoxyubiquinone hydroxylases (DMQH), DNA protecting
           proteins (DPS), and ubiquinol oxidases (AOX), and the
           aerobic cyclase system, Fe-containing subunit (ACSF).
          Length = 130

 Score = 35.2 bits (81), Expect = 0.009
 Identities = 25/131 (19%), Positives = 39/131 (29%), Gaps = 16/131 (12%)

Query: 76  DKINTQINDEIRSAYTYISMASFCTQDNVALFGFAKFFTHSYKEEIEHMEHLIAYLNKRG 135
             +N  +  E  +   Y  +A+                     EE  H + L   L + G
Sbjct: 1   RLLNDALAGEYAAIIAYGQLAA-----RAPDPDLKDELLEIADEERRHADALAERLRELG 55

Query: 136 GQ------LRLTSIEAPSKQEWNTVEDLLTEALHMEKQLNEKLLKLHACASQHGDANLTD 189
           G         L +   P  +  +   + L  AL +E +      +L   A    D  L  
Sbjct: 56  GTPPLPPAHLLAAYALP--KTSDDPAEALRAALEVEARAIAAYRELIEQA---DDPELRR 110

Query: 190 FLEGRYLQEQV 200
            LE     EQ 
Sbjct: 111 LLERILADEQR 121


>gnl|CDD|240271 PTZ00108, PTZ00108, DNA topoisomerase 2-like protein; Provisional.
          Length = 1388

 Score = 33.1 bits (76), Expect = 0.13
 Identities = 17/84 (20%), Positives = 32/84 (38%)

Query: 2    SSDCKGSSNDPPPKKDKPKCESTRHEPTKENKPCKETKPKSKKSSEPTQSSQHSPDSLGK 61
             S+     + P  KK K + E +     K+ K  K+T  K K  +   Q+S      L +
Sbjct: 1295 ESNGGSKPSSPTKKKVKKRLEGSLAALKKKKKSEKKTARKKKSKTRVKQASASQSSRLLR 1354

Query: 62   TERQTLQNFHEDCIDKINTQINDE 85
              R+   +   +  D      +++
Sbjct: 1355 RPRKKKSDSSSEDDDDSEVDDSED 1378


>gnl|CDD|153102 cd01043, DPS, DPS protein, ferritin-like diiron-binding domain.
           DPS (DNA Protecting protein under Starved conditions)
           domain is a member of a broad superfamily of
           ferritin-like diiron-carboxylate proteins. Some DPS
           proteins nonspecifically bind DNA, protecting it from
           cleavage caused by reactive oxygen species such as the
           hydroxyl radicals produced during oxidation of Fe(II) by
           hydrogen peroxide. These proteins assemble into
           dodecameric structures, some form DPS-DNA co-crystalline
           complexes, and possess iron and H2O2 detoxification
           capabilities. Expression of DPS is induced by oxidative
           or nutritional stress, including metal ion starvation.
           Members of the DPS family are homopolymers formed by 12
           four-helix bundle subunits that assemble with 23
           symmetry into a hollow shell. The DPS ferroxidase site
           is unusual in that it is not located in a four-helix
           bundle as in ferritin, but is shared by 2-fold
           symmetry-related subunits providing the iron ligands.
           Many DPS sequences (e.g., E. coli) display an N-terminal
           extension of variable length that contains two or three
           positively charged lysine residues that extends into the
           solvent and is thought to play an important role in the
           stabilization of the complex with DNA. DPS Listeria Flp,
           Bacillus anthracis Dlp-1 and Dlp-2, and Helicobacter
           pylori HP-NAP which lack the N-terminal extension, do
           not bind DNA. DPS proteins from Helicobacter pylori,
           Treponema pallidum, and Borrelia burgdorferi are highly
           immunogenic.
          Length = 139

 Score = 31.4 bits (72), Expect = 0.17
 Identities = 12/57 (21%), Positives = 24/57 (42%), Gaps = 1/57 (1%)

Query: 138 LRLTSIEAPSKQEWNTVEDLLTEALHMEKQLNEKLLKLHACASQHGDANLTDFLEGR 194
             L++I+        + ++++ E L   + L E+L +    A + GD    D L   
Sbjct: 69  AELSTIKEEPA-GVLSAKEMVAELLEDYETLIEELREAIELADEAGDPATADLLTEI 124


>gnl|CDD|173626 cd05034, PTKc_Src_like, Catalytic domain of Src kinase-like Protein
           Tyrosine Kinases.  Protein Tyrosine Kinase (PTK) family;
           Src kinase subfamily; catalytic (c) domain. Src
           subfamily members include Src, Lck, Hck, Blk, Lyn, Fgr,
           Fyn, Yrk, and Yes. The PTKc family is part of a larger
           superfamily that includes the catalytic domains of other
           kinases such as protein serine/threonine kinases, RIO
           kinases, and phosphoinositide 3-kinase (PI3K). PTKs
           catalyze the transfer of the gamma-phosphoryl group from
           ATP to tyrosine (tyr) residues in protein substrates.
           Src (or c-Src) proteins are cytoplasmic (or
           non-receptor) tyr kinases which are anchored to the
           plasma membrane. They contain an N-terminal SH4 domain
           with a myristoylation site, followed by SH3 and SH2
           domains, a tyr kinase domain, and a regulatory
           C-terminal region containing a conserved tyr. They are
           activated by autophosphorylation at the tyr kinase
           domain, but are negatively regulated by phosphorylation
           at the C-terminal tyr by Csk (C-terminal Src Kinase).
           Src proteins are involved in signaling pathways that
           regulate cytokine and growth factor responses,
           cytoskeleton dynamics, cell proliferation, survival, and
           differentiation. They were identified as the first
           proto-oncogene products, and they regulate cell
           adhesion, invasion, and motility in cancer cells and
           tumor vasculature, contributing to cancer progression
           and metastasis. Src kinases are overexpressed in a
           variety of human cancers, making them attractive targets
           for therapy. They are also implicated in acute
           inflammatory responses and osteoclast function. Src,
           Fyn, Yes, and Yrk are widely expressed, while Blk, Lck,
           Hck, Fgr, and Lyn show a limited expression pattern.
          Length = 261

 Score = 31.9 bits (73), Expect = 0.22
 Identities = 26/93 (27%), Positives = 40/93 (43%), Gaps = 27/93 (29%)

Query: 153 TVEDLLTEALHMEKQLNEKLLKLHACASQ------------HGDANLTDFL---EGRYLQ 197
           + E  L EA  M+K  ++KL++L+A  S+             G  +L DFL   EG+ L+
Sbjct: 44  SPEAFLQEAQIMKKLRHDKLVQLYAVCSEEEPIYIVTEYMSKG--SLLDFLKSGEGKKLR 101

Query: 198 EQVDAIKTLADLLTTVRRTQLYL-----VDRDL 225
                   L D+   +     YL     + RDL
Sbjct: 102 LP-----QLVDMAAQIAEGMAYLESRNYIHRDL 129


>gnl|CDD|133202 cd05071, PTKc_Src, Catalytic domain of the Protein Tyrosine Kinase,
           Src.  Protein Tyrosine Kinase (PTK) family; Src kinase;
           catalytic (c) domain. The PTKc family is part of a
           larger superfamily that includes the catalytic domains
           of other kinases such as protein serine/threonine
           kinases, RIO kinases, and phosphoinositide 3-kinase
           (PI3K). PTKs catalyze the transfer of the
           gamma-phosphoryl group from ATP to tyrosine (tyr)
           residues in protein substrates. Src (or c-Src) is a
           cytoplasmic (or non-receptor) tyr kinase, containing an
           N-terminal SH4 domain with a myristoylation site,
           followed by SH3 and SH2 domains, a tyr kinase domain,
           and a regulatory C-terminal region with a conserved tyr.
           It is activated by autophosphorylation at the tyr kinase
           domain, and is negatively regulated by phosphorylation
           at the C-terminal tyr by Csk (C-terminal Src Kinase).
           c-Src is the vertebrate homolog of the oncogenic protein
           (v-Src) from Rous sarcoma virus. Together with other Src
           subfamily proteins, it is involved in signaling pathways
           that regulate cytokine and growth factor responses,
           cytoskeleton dynamics, cell proliferation, survival, and
           differentiation. Src also play a role in regulating cell
           adhesion, invasion, and motility in cancer cells and
           tumor vasculature, contributing to cancer progression
           and metastasis. Elevated levels of Src kinase activity
           have been reported in a variety of human cancers.
           Several inhibitors of Src have been developed as
           anti-cancer drugs. Src is also implicated in acute
           inflammatory responses and osteoclast function.
          Length = 262

 Score = 31.9 bits (72), Expect = 0.25
 Identities = 27/85 (31%), Positives = 39/85 (45%), Gaps = 16/85 (18%)

Query: 155 EDLLTEALHMEKQLNEKLLKLHACASQH---------GDANLTDFLEGRY-----LQEQV 200
           E  L EA  M+K  +EKL++L+A  S+             +L DFL+G       L + V
Sbjct: 46  EAFLQEAQVMKKLRHEKLVQLYAVVSEEPIYIVTEYMSKGSLLDFLKGEMGKYLRLPQLV 105

Query: 201 DAIKTLADLLTTVRRTQLYLVDRDL 225
           D    +A  +  V R  +  V RDL
Sbjct: 106 DMAAQIASGMAYVER--MNYVHRDL 128


>gnl|CDD|223880 COG0810, TonB, Periplasmic protein TonB, links inner and outer
           membranes [Cell envelope biogenesis, outer membrane].
          Length = 244

 Score = 31.7 bits (72), Expect = 0.28
 Identities = 10/38 (26%), Positives = 16/38 (42%)

Query: 12  PPPKKDKPKCESTRHEPTKENKPCKETKPKSKKSSEPT 49
            PP + +   E T  +P ++ KP K+ K    K     
Sbjct: 68  KPPTEPETPPEPTPPKPKEKPKPEKKPKKPKPKPKPKP 105


>gnl|CDD|133200 cd05069, PTKc_Yes, Catalytic domain of the Protein Tyrosine Kinase,
           Yes.  Protein Tyrosine Kinase (PTK) family; Yes kinase;
           catalytic (c) domain. The PTKc family is part of a
           larger superfamily that includes the catalytic domains
           of other kinases such as protein serine/threonine
           kinases, RIO kinases, and phosphoinositide 3-kinase
           (PI3K). PTKs catalyze the transfer of the
           gamma-phosphoryl group from ATP to tyrosine (tyr)
           residues in protein substrates. Yes (or c-Yes) is a
           member of the Src subfamily of proteins, which are
           cytoplasmic (or non-receptor) tyr kinases. Src kinases
           contain an N-terminal SH4 domain with a myristoylation
           site, followed by SH3 and SH2 domains, a tyr kinase
           domain, and a regulatory C-terminal region containing a
           conserved tyr. They are activated by autophosphorylation
           at the tyr kinase domain, but are negatively regulated
           by phosphorylation at the C-terminal tyr by Csk
           (C-terminal Src Kinase). Src proteins are involved in
           signaling pathways that regulate cytokine and growth
           factor responses, cytoskeleton dynamics, cell
           proliferation, survival, and differentiation. c-Yes
           kinase is the cellular homolog of the oncogenic protein
           (v-Yes) encoded by the Yamaguchi 73 and Esh sarcoma
           viruses. It displays functional overlap with other Src
           subfamily members, particularly Src. It also shows some
           unique functions such as binding to occludins,
           transmembrane proteins that regulate extracellular
           interactions in tight junctions. Yes also associates
           with a number of proteins in different cell types that
           Src does not interact with, like JAK2 and gp130 in
           pre-adipocytes, and Pyk2 in treated pulmonary vein
           endothelial cells. Although the biological function of
           Yes remains unclear, it appears to have a role in
           regulating cell-cell interactions and vesicle
           trafficking in polarized cells.
          Length = 260

 Score = 31.2 bits (70), Expect = 0.44
 Identities = 27/85 (31%), Positives = 43/85 (50%), Gaps = 16/85 (18%)

Query: 155 EDLLTEALHMEKQLNEKLLKLHACASQH---------GDANLTDFL---EGRYLQ--EQV 200
           E  L EA  M+K  ++KL+ L+A  S+          G  +L DFL   +G+YL+  + V
Sbjct: 46  EAFLQEAQIMKKLRHDKLVPLYAVVSEEPIYIVTEFMGKGSLLDFLKEGDGKYLKLPQLV 105

Query: 201 DAIKTLADLLTTVRRTQLYLVDRDL 225
           D    +AD +  + R  +  + RDL
Sbjct: 106 DMAAQIADGMAYIER--MNYIHRDL 128


>gnl|CDD|187551 cd05240, UDP_G4E_3_SDR_e, UDP-glucose 4 epimerase (G4E), subgroup
           3, extended (e) SDRs.  Members of this bacterial
           subgroup are identified as possible sugar epimerases,
           such as UDP-glucose 4 epimerase. However, while the
           NAD(P)-binding motif is fairly well conserved, not all
           members retain the canonical active site tetrad of the
           extended SDRs. UDP-glucose 4 epimerase (aka
           UDP-galactose-4-epimerase), is a homodimeric extended
           SDR. It catalyzes the NAD-dependent conversion of
           UDP-galactose to UDP-glucose, the final step in Leloir
           galactose synthesis. Extended SDRs are distinct from
           classical SDRs. In addition to the Rossmann fold
           (alpha/beta folding pattern with a central beta-sheet)
           core region typical of all SDRs, extended SDRs have a
           less conserved C-terminal extension of approximately 100
           amino acids. Extended SDRs are a diverse collection of
           proteins, and include isomerases, epimerases,
           oxidoreductases, and lyases; they typically have a
           TGXXGXXG cofactor binding motif. SDRs are a functionally
           diverse family of oxidoreductases that have a single
           domain with a structurally conserved Rossmann fold, an
           NAD(P)(H)-binding region, and a structurally diverse
           C-terminal region. Sequence identity between different
           SDR enzymes is typically in the 15-30% range; they
           catalyze a wide range of activities including the
           metabolism of steroids, cofactors, carbohydrates,
           lipids, aromatic compounds, and amino acids, and act in
           redox sensing. Classical SDRs have an TGXXX[AG]XG
           cofactor binding motif and a YXXXK active site motif,
           with the Tyr residue of the active site motif serving as
           a critical catalytic residue (Tyr-151, human
           15-hydroxyprostaglandin dehydrogenase numbering). In
           addition to the Tyr and Lys, there is often an upstream
           Ser and/or an Asn, contributing to the active site;
           while substrate binding is in the C-terminal region,
           which determines specificity. The standard reaction
           mechanism is a 4-pro-S hydride transfer and proton relay
           involving the conserved Tyr and Lys, a water molecule
           stabilized by Asn, and nicotinamide. Atypical SDRs
           generally lack the catalytic residues characteristic of
           the SDRs, and their glycine-rich NAD(P)-binding motif is
           often different from the forms normally seen in
           classical or extended SDRs. Complex (multidomain) SDRs
           such as ketoreductase domains of fatty acid synthase
           have a GGXGXXG NAD(P)-binding motif and an altered
           active site motif (YXXXN). Fungal type ketoacyl
           reductases have a TGXXXGX(1-2)G NAD(P)-binding motif.
          Length = 306

 Score = 30.8 bits (70), Expect = 0.53
 Identities = 13/43 (30%), Positives = 16/43 (37%)

Query: 154 VEDLLTEALHMEKQLNEKLLKLHACASQHGDANLTDFLEGRYL 196
           VE LL E      +LN  +L+              DFL  R L
Sbjct: 145 VEQLLAEFRRRHPELNVTVLRPATILGPGTRNTTRDFLSPRRL 187


>gnl|CDD|220684 pfam10310, DUF2413, Protein of unknown function (DUF2413).  This is
           a family of proteins conserved in fungi. The function is
           not known.
          Length = 436

 Score = 30.9 bits (70), Expect = 0.57
 Identities = 12/58 (20%), Positives = 17/58 (29%)

Query: 6   KGSSNDPPPKKDKPKCESTRHEPTKENKPCKETKPKSKKSSEPTQSSQHSPDSLGKTE 63
           +  +   P K  KP   S      K  KP K  +       +    S  S +     E
Sbjct: 58  RPGTPRNPKKSSKPTESSAASSEEKPAKPRKSAESTRSSHPKSKAPSTESEEEEEPEE 115


>gnl|CDD|185628 PTZ00449, PTZ00449, 104 kDa microneme/rhoptry antigen; Provisional.
          Length = 943

 Score = 31.2 bits (70), Expect = 0.61
 Identities = 17/58 (29%), Positives = 24/58 (41%), Gaps = 5/58 (8%)

Query: 7   GSSNDPPPKKDKPKCESTRHEPTKENKPCKE-TKPKSKKSSE----PTQSSQHSPDSL 59
            +S  PP      + E   HE +KE+   KE  KP   K  E    P  + +H P  +
Sbjct: 515 EASGLPPKAPGDKEGEEGEHEDSKESDEPKEGGKPGETKEGEVGKKPGPAKEHKPSKI 572


>gnl|CDD|143491 cd06817, PLPDE_III_DSD, Type III Pyridoxal 5-phosphate
           (PLP)-Dependent Enzyme Eukaryotic D-Serine Dehydratase. 
           This subfamily is composed of chicken D-serine
           dehydratase (DSD, EC 4.3.1.18) and similar eukaryotic
           proteins. Chicken DSD catalyzes the dehydration of
           D-serine to aminoacrylate, which is rapidly hydrolyzed
           to pyruvate and ammonia. It is a fold type III
           PLP-dependent enzyme with similarity to bacterial
           alanine racemase (AR), which contains an N-terminal
           PLP-binding TIM-barrel domain and a C-terminal
           beta-sandwich domain. AR exists as dimers with active
           sites that lie at the interface between the TIM barrel
           domain of one subunit and the beta-sandwich domain of
           the other subunit. Experimental data suggest that
           chicken DSD also exists as dimers. Sequence comparison
           and biochemical experiments show that chicken DSD is
           distinct from the ubiquitous bacterial DSDs coded by
           dsdA gene, mammalian L-serine dehydratases (LSD) and
           mammalian serine racemase (SerRac), which are fold type
           II PLP-dependent enzymes.
          Length = 389

 Score = 30.4 bits (69), Expect = 0.92
 Identities = 25/91 (27%), Positives = 34/91 (37%), Gaps = 21/91 (23%)

Query: 101 QDNVALFGFAKFFTHSYK------------EEIEHMEHLIAYLNKRGGQLRLT-SIEA-P 146
            + V LFGF     HSY             EEIE +      L    G  +LT S+ A P
Sbjct: 165 SEAVELFGFYSHAGHSYSSRSAEDAKEVLREEIEAVLTAAKKLKSIQGDRKLTLSVGATP 224

Query: 147 SKQEWNTVEDLLTEALHMEKQLNEKLLKLHA 177
           +      +  +   +L         LL+LHA
Sbjct: 225 TAHAAEALVLIPAPSLSG-------LLELHA 248


>gnl|CDD|191371 pfam05778, Apo-CIII, Apolipoprotein CIII (Apo-CIII).  This family
           consists of several mammalian apolipoprotein CIII
           (Apo-CIII) sequences. Apolipoprotein C-III is a
           79-residue glycoprotein. It is synthesised in the
           intestine and liver as part of the very low density
           lipoprotein (VLDL) and the high density lipoprotein
           (HDL) particles. Owing to its positive correlation with
           plasma triglyceride (Tg) levels, Apo-CIII is suggested
           to play a role in Tg metabolism and is therefore of
           interest regarding atherosclerosis. However, unlike
           other apolipoproteins such as Apo-AI, Apo E or CII for
           which many naturally occurring mutations are known, the
           structure-function relationships of apo C-III remains a
           subject of debate. One possibility is that apo C-III
           inhibits lipoprotein lipase (LPL) activity, as shown by
           in vitro experiments. Another suggestion, is that
           elevated levels of Apo-CIII displace other
           apolipoproteins at the lipoprotein surface, modifying
           their clearance from plasma.
          Length = 70

 Score = 28.0 bits (62), Expect = 1.1
 Identities = 16/48 (33%), Positives = 28/48 (58%), Gaps = 4/48 (8%)

Query: 184 DANLTDFLEGRYLQEQVDAIKTLADLLTTVRRTQLYLVDRDLMSGKFS 231
           D++L   ++G Y+++   A KT  D LT+V+ +Q+    R  M+  FS
Sbjct: 4   DSSLLGKMQG-YMKQ---ATKTAQDALTSVQESQVAQQARGWMTDSFS 47


>gnl|CDD|234938 PRK01297, PRK01297, ATP-dependent RNA helicase RhlB; Provisional.
          Length = 475

 Score = 29.9 bits (67), Expect = 1.1
 Identities = 12/30 (40%), Positives = 16/30 (53%)

Query: 12 PPPKKDKPKCESTRHEPTKENKPCKETKPK 41
          P  K   P   + R E  K++KP +E KPK
Sbjct: 39 PATKAAAPAAAAPRAEKPKKDKPRRERKPK 68


>gnl|CDD|233728 TIGR02102, pullulan_Gpos, pullulanase, extracellular, Gram-positive. 
            Pullulan is an unusual, industrially important
            polysaccharide in which short alpha-1,4 chains
            (maltotriose) are connected in alpha-1,6 linkages.
            Enzymes that cleave alpha-1,6 linkages in pullulan and
            release maltotriose are called pullulanases although
            pullulan itself may not be the natural substrate. In
            contrast, a glycogen debranching enzyme such GlgX,
            homologous to this family, can release glucose at
            alpha,1-6 linkages from glycogen first subjected to limit
            degradation by phosphorylase. Characterized members of
            this family include a surface-located pullulanase from
            Streptococcus pneumoniae (PMID:11083842) and an
            extracellular bifunctional amylase/pullulanase with
            C-terminal pullulanase activity (PMID:8798645).
          Length = 1111

 Score = 29.8 bits (67), Expect = 1.4
 Identities = 13/70 (18%), Positives = 25/70 (35%), Gaps = 3/70 (4%)

Query: 3    SDCKGSSNDPPPKKDKPKCESTRHEPTKENKPCKETKPKSKKSSEPTQSSQHSPDSLGKT 62
               +     PPP + +P+       PT++    K       K ++   +   S D  G+T
Sbjct: 1005 GGIEAPEKTPPPPEHEPQAP---KPPTQDPDGSKPKDKVDPKDNKDPLTPPGSDDENGET 1061

Query: 63   ERQTLQNFHE 72
             +   +   E
Sbjct: 1062 PKGNEEKKEE 1071


>gnl|CDD|132155 TIGR03111, glyc2_xrt_Gpos1, putative glycosyltransferase,
           exosortase G-associated.  Members of this protein family
           are probable glycosyltransferases of family 2, whose
           genes are near those for the exosortase homolog XrtG
           (TIGR03110), which is restricted to Gram-positive
           bacteria. Other genes in the conserved gene neighborhood
           include a 6-pyruvoyl tetrahydropterin synthase homolog
           (TIGR03112) and an uncharacterized intergral membrane
           protein (TIGR03766).
          Length = 439

 Score = 29.7 bits (67), Expect = 1.5
 Identities = 14/35 (40%), Positives = 20/35 (57%), Gaps = 5/35 (14%)

Query: 190 FLEGRYLQEQVDAIKTLADLLTTVRR-----TQLY 219
           FL GR  + QV+++ TL+   +  RR     TQLY
Sbjct: 199 FLAGRNFESQVNSLFTLSGAFSAFRRETILKTQLY 233


>gnl|CDD|218919 pfam06163, DUF977, Bacterial protein of unknown function (DUF977). 
           This family consists of several hypothetical bacterial
           proteins from Escherichia coli and Salmonella typhi. The
           function of this family is unknown.
          Length = 127

 Score = 28.5 bits (63), Expect = 1.5
 Identities = 17/53 (32%), Positives = 24/53 (45%), Gaps = 3/53 (5%)

Query: 110 AKFFTHSYKEEIEHMEHLIAYLNKRGGQLRLTSIEAPSKQEWNTVEDLLTEAL 162
           AK FT   +E I   E +I  L +  G++    +EA +     T E  L  AL
Sbjct: 2   AKPFTQEEREII--KERIIG-LVREQGRITTNQLEAITGAHRTTAEKYLRIAL 51


>gnl|CDD|114868 pfam06175, MiaE, tRNA-(MS[2]IO[6]A)-hydroxylase (MiaE).  This
           family consists of several bacterial
           tRNA-(MS[2]IO[6]A)-hydroxylase (MiaE) proteins. The
           modified nucleoside 2-methylthio-N-6-isopentenyl
           adenosine (ms2i6A) is present at position 37 (3' of the
           anticodon) of tRNAs that read codons beginning with U
           except tRNA(I,V Ser) in Escherichia coli. Salmonella
           typhimurium 2-methylthio-cis-ribozeatin (ms2io6A) is
           found in tRNA, probably in the corresponding species
           that have ms2i6A in E. coli. The miaE gene is absent in
           E. coli, a finding consistent with the absence of the
           hydroxylated derivative of ms2i6A in this species.
          Length = 199

 Score = 29.0 bits (65), Expect = 1.7
 Identities = 20/86 (23%), Positives = 30/86 (34%), Gaps = 16/86 (18%)

Query: 118 KEEIEHMEHLIAYLNKRGGQLRLTSIEAP----------SKQEWNTVEDLLTEALHMEKQ 167
           KEE+ H E ++  L  R   +    I A              E   + D L    ++E +
Sbjct: 69  KEELHHFEQVLEILEAR--NIPYAPISASRYAKGLLAAVRTHEPQRLLDKLIVGAYIEAR 126

Query: 168 LNEKLLKLHACASQHGDANLTDFLEG 193
             E+   L    + H DA L  F   
Sbjct: 127 SCERFALL----APHLDAELAKFYVS 148


>gnl|CDD|132982 cd06651, STKc_MEKK3, Catalytic domain of the Protein
           Serine/Threonine Kinase, MAP/ERK kinase kinase 3.
           Serine/threonine kinases (STKs), MAP/ERK kinase kinase 3
           (MEKK3) subfamily, catalytic (c) domain. STKs catalyze
           the transfer of the gamma-phosphoryl group from ATP to
           serine/threonine residues on protein substrates. The
           MEKK3 subfamily is part of a larger superfamily that
           includes the catalytic domains of other protein STKs,
           protein tyrosine kinases, RIO kinases, aminoglycoside
           phosphotransferase, choline kinase, and phosphoinositide
           3-kinase. MEKK3 is a mitogen-activated protein kinase
           (MAPK) kinase kinase (MAPKKK or MKKK or MAP3K), that
           phosphorylates and activates the MAPK kinase MEK5 (or
           MKK5), which in turn phosphorylates and activates
           extracellular signal-regulated kinase 5 (ERK5). The ERK5
           cascade plays roles in promoting cell proliferation,
           differentiation, neuronal survival, and neuroprotection.
           MEKK3 plays an essential role in embryonic angiogenesis
           and early heart development. In addition, MEKK3 is
           involved in interleukin-1 receptor and Toll-like
           receptor 4 signaling. It is also a specific regulator of
           the proinflammatory cytokines IL-6 and GM-CSF in some
           immune cells. MEKK3 also regulates calcineurin, which
           plays a critical role in T cell activation, apoptosis,
           skeletal myocyte differentiation, and cardiac
           hypertrophy.
          Length = 266

 Score = 29.3 bits (65), Expect = 1.7
 Identities = 20/80 (25%), Positives = 33/80 (41%), Gaps = 9/80 (11%)

Query: 161 ALHMEKQL-----NEKLLKLHACASQHGDANLTDFLEGRYLQEQVDAIKTLADLLTTVRR 215
           AL  E QL     +E++++ + C     +  LT F+E        D +K    L  +V R
Sbjct: 50  ALECEIQLLKNLQHERIVQYYGCLRDRAEKTLTIFMEYMPGGSVKDQLKAYGALTESVTR 109

Query: 216 TQLYLVDRDLMSGKFSMHGN 235
                  R ++ G   +H N
Sbjct: 110 K----YTRQILEGMSYLHSN 125


>gnl|CDD|236178 PRK08187, PRK08187, pyruvate kinase; Validated.
          Length = 493

 Score = 29.6 bits (67), Expect = 1.7
 Identities = 19/71 (26%), Positives = 28/71 (39%), Gaps = 5/71 (7%)

Query: 159 TEALHMEKQLNEKLLKLHACASQHGDANLTDFLEGRYLQ--EQVDAIKTLADLLTTVRRT 216
            + L     L E LL L A   +   A L  +     ++  E + +   LA  L  +RR 
Sbjct: 2   DDDLSEPAALLEALLALRADVVEEAQARLAGW--ASRIERPEFLPSAANLAHYL-ALRRH 58

Query: 217 QLYLVDRDLMS 227
            L  + R LM 
Sbjct: 59  DLRPLQRALMP 69


>gnl|CDD|153099 cd00907, Bacterioferritin, Bacterioferritin, ferritin-like
           diiron-binding domain.  Bacterioferritins, also known as
           cytochrome b1, are members of a broad superfamily of
           ferritin-like diiron-carboxylate proteins. Similar to
           ferritin in architecture, Bfr forms an oligomer of 24
           subunits that assembles to form a hollow sphere with 432
           symmetry. Up to 12 heme cofactor groups (iron
           protoporphyrin IX or coproporphyrin III) are bound
           between dimer pairs. The role of the heme is unknown,
           although it may be involved in mediating iron-core
           reduction and iron release. Each subunit is composed of
           a four-helix bundle which carries a diiron ferroxidase
           center; it is here that initial oxidation of ferrous
           iron by molecular oxygen occurs, facilitating the
           detoxification of iron, protection against dioxygen and
           radical products, and storage of ferric-hydroxyphosphate
           at the core. Some bacterioferritins are composed of two
           subunit types, one conferring heme-binding ability
           (alpha) and the other (beta) bestowing ferroxidase
           activity.
          Length = 153

 Score = 28.3 bits (64), Expect = 2.3
 Identities = 29/112 (25%), Positives = 41/112 (36%), Gaps = 20/112 (17%)

Query: 108 GFAKFFTHSYKEEIEHMEH---LIA---YLNKRGGQLRLTSIEAPSKQEWNTVEDLLTEA 161
           G  K      KE IE M+H   LI    +L    G   L  +      E   V ++L   
Sbjct: 35  GLEKLAERFRKESIEEMKHADKLIERILFL---EGLPNLQRLGKLRIGE--DVPEMLEND 89

Query: 162 LHMEKQLNEKLLKLHACASQHGDANLTDFLE---------GRYLQEQVDAIK 204
           L +E +    L +  A   + GD    D LE           +L+ Q+D I 
Sbjct: 90  LALEYEAIAALNEAIALCEEVGDYVSRDLLEEILEDEEEHIDWLETQLDLID 141


>gnl|CDD|215397 PLN02744, PLN02744, dihydrolipoyllysine-residue acetyltransferase
           component of pyruvate dehydrogenase complex.
          Length = 539

 Score = 29.1 bits (65), Expect = 2.4
 Identities = 17/50 (34%), Positives = 23/50 (46%), Gaps = 5/50 (10%)

Query: 4   DCKGSSNDPPPKKDKPKCESTRHEPTKE--NKPCKETKPKSKKSSEPTQS 51
           D K SS+  P     PK + +   P +E   KP    +PK+ K S P  S
Sbjct: 199 DYKPSSSAAPAA---PKAKPSPPPPKEEEVEKPASSPEPKASKPSAPPSS 245


>gnl|CDD|233223 TIGR00990, 3a0801s09, mitochondrial precursor proteins import
           receptor (72 kDa mitochondrial outermembrane protein)
           (mitochondrial import receptor for the ADP/ATP carrier)
           (translocase of outermembrane tom70).  [Transport and
           binding proteins, Amino acids, peptides and amines].
          Length = 615

 Score = 29.2 bits (65), Expect = 2.4
 Identities = 17/79 (21%), Positives = 33/79 (41%)

Query: 12  PPPKKDKPKCESTRHEPTKENKPCKETKPKSKKSSEPTQSSQHSPDSLGKTERQTLQNFH 71
              ++ KPK         K+ +   E K + KKS+ P  +     D L + +  ++ N  
Sbjct: 60  QQQRESKPKISKKERRKRKQAEKETEGKTEEKKSTAPKNAPVEPADELPEIDESSVANLS 119

Query: 72  EDCIDKINTQINDEIRSAY 90
           E+   K   ++ ++   AY
Sbjct: 120 EEERKKYAAKLKEKGNKAY 138


>gnl|CDD|240323 PTZ00233, PTZ00233, variable surface protein Vir18; Provisional.
          Length = 509

 Score = 28.8 bits (64), Expect = 2.6
 Identities = 16/49 (32%), Positives = 22/49 (44%), Gaps = 7/49 (14%)

Query: 10  NDPPPKKDKPKCESTRHEPTKENKPCK-------ETKPKSKKSSEPTQS 51
           N   P K  P  + T  EP K  K CK       +TK +SK    P+++
Sbjct: 101 NGAFPAKKPPLIKPTTQEPCKGGKGCKTETPQRVDTKSQSKLRPVPSKA 149


>gnl|CDD|218332 pfam04929, Herpes_DNAp_acc, Herpes DNA replication accessory
           factor.  Replicative DNA polymerases are capable of
           polymerising tens of thousands of nucleotides without
           dissociating from their DNA templates. The high
           processivity of these polymerases is dependent upon
           accessory proteins that bind to the catalytic subunit of
           the polymerase or to the substrate. The Epstein-Barr
           virus (EBV) BMRF1 protein is an essential component of
           the viral DNA polymerase and is absolutely required for
           lytic virus replication. BMRF1 is also a transactivator.
           This family is predicted to have a UL42 like structure.
          Length = 381

 Score = 28.9 bits (65), Expect = 2.6
 Identities = 13/54 (24%), Positives = 19/54 (35%), Gaps = 2/54 (3%)

Query: 2   SSDCKGSSNDPPPKKDKPKCESTRHEPTKENKPCKETKPKSKKSSEPTQSSQHS 55
            S    S +DP  +   P   S  H P    +      PK K+  +  Q  + S
Sbjct: 319 PSPPDTSDSDPSTETPPPA--SLSHSPPAAFERPLALSPKRKREGDKKQKKKKS 370


>gnl|CDD|133201 cd05070, PTKc_Fyn_Yrk, Catalytic domain of the Protein Tyrosine
           Kinases, Fyn and Yrk.  Protein Tyrosine Kinase (PTK)
           family; Fyn and Yrk kinases; catalytic (c) domain. The
           PTKc family is part of a larger superfamily that
           includes the catalytic domains of other kinases such as
           protein serine/threonine kinases, RIO kinases, and
           phosphoinositide 3-kinase (PI3K). PTKs catalyze the
           transfer of the gamma-phosphoryl group from ATP to
           tyrosine (tyr) residues in protein substrates. Fyn and
           Yrk are members of the Src subfamily of proteins, which
           are cytoplasmic (or non-receptor) tyr kinases. Src
           kinases contain an N-terminal SH4 domain with a
           myristoylation site, followed by SH3 and SH2 domains, a
           tyr kinase domain, and a regulatory C-terminal region
           containing a conserved tyr. They are activated by
           autophosphorylation at the tyr kinase domain, but are
           negatively regulated by phosphorylation at the
           C-terminal tyr by Csk (C-terminal Src Kinase). Src
           proteins are involved in signaling pathways that
           regulate cytokine and growth factor responses,
           cytoskeleton dynamics, cell proliferation, survival, and
           differentiation. Fyn, together with Lck, plays a
           critical role in T-cell signal transduction by
           phosphorylating ITAM (immunoreceptor tyr activation
           motif) sequences on T-cell receptors, ultimately leading
           to the proliferation and differentiation of T-cells. In
           addition, Fyn is involved in the myelination of neurons,
           and is implicated in Alzheimer's and Parkinson's
           diseases. Yrk has been detected only in chickens. It is
           primarily found in neuronal and epithelial cells and in
           macrophages. It may play a role in inflammation and in
           response to injury.
          Length = 260

 Score = 28.8 bits (64), Expect = 2.6
 Identities = 27/88 (30%), Positives = 41/88 (46%), Gaps = 16/88 (18%)

Query: 155 EDLLTEALHMEKQLNEKLLKLHACASQH---------GDANLTDFL---EGRYLQ--EQV 200
           E  L EA  M+K  ++KL++L+A  S+             +L DFL   EGR L+    V
Sbjct: 46  ESFLEEAQIMKKLRHDKLVQLYAVVSEEPIYIVTEYMSKGSLLDFLKDGEGRALKLPNLV 105

Query: 201 DAIKTLADLLTTVRRTQLYLVDRDLMSG 228
           D    +A  +  + R  +  + RDL S 
Sbjct: 106 DMAAQVAAGMAYIER--MNYIHRDLRSA 131


>gnl|CDD|217476 pfam03286, Pox_Ag35, Pox virus Ag35 surface protein. 
          Length = 198

 Score = 28.2 bits (63), Expect = 3.3
 Identities = 12/45 (26%), Positives = 19/45 (42%)

Query: 12 PPPKKDKPKCESTRHEPTKENKPCKETKPKSKKSSEPTQSSQHSP 56
          P PK+ K K  +T  +P    K  K+ K K  +  +  +S     
Sbjct: 46 PSPKQPKKKRPTTPRKPATTKKSKKKDKEKLTEEEKKPESDDDKT 90


>gnl|CDD|238449 cd00884, beta_CA_cladeB, Carbonic anhydrases (CA) are
           zinc-containing enzymes that catalyze the reversible
           hydration of carbon dioxide in a two-step mechanism in
           which the nucleophilic attack of a zinc-bound hydroxide
           ion on carbon dioxide is followed by the regeneration of
           an active site by ionization of the zinc-bound water
           molecule and removal of a proton from the active site.
           CAs are ubiquitous enzymes involved in fundamental
           processes like photosynthesis, respiration, pH
           homeostasis and ion transport. There are three
           evolutionarily distinct families of CAs (the alpha-,
           beta-, and gamma-CAs) which show no significant sequence
           identity or structural similarity.  Within the beta-CA
           family there are four evolutionarily distinct clades (A
           through D). The beta-CAs are multimeric enzymes (forming
           dimers,tetramers,hexamers and octamers) which are
           present in higher plants, algae, fungi, archaea and
           prokaryotes.
          Length = 190

 Score = 27.9 bits (63), Expect = 4.0
 Identities = 8/20 (40%), Positives = 10/20 (50%)

Query: 108 GFAKFFTHSYKEEIEHMEHL 127
           GF +F    + EE E  E L
Sbjct: 1   GFRRFRKEYFPEERELFEKL 20


>gnl|CDD|225104 COG2193, Bfr, Bacterioferritin (cytochrome b1) [Inorganic ion
           transport and metabolism].
          Length = 157

 Score = 27.3 bits (61), Expect = 5.2
 Identities = 30/109 (27%), Positives = 47/109 (43%), Gaps = 9/109 (8%)

Query: 108 GFAKFFTHSYKEEIEHMEH---LIAYLNKRGGQLRLTSIEAPSKQEWNTVEDLLTEALHM 164
           G  K   H Y E IE M+H   LI  +    G   L   +    +   TV+++L   L +
Sbjct: 36  GLTKLAAHEYHESIEEMKHADQLIERILFLEGLPNLQ--DLGKLRIGETVKEMLEADLAL 93

Query: 165 EKQLNEKLLKLHACASQHGDANLTDFLEGRYL---QEQVDAIKTLADLL 210
           E +  + L +  A   +  D    D LE   L   +E +D ++T  DL+
Sbjct: 94  EYEARDALKEAIAYCEEVQDYVSRDLLE-EILADEEEHIDWLETQLDLI 141


>gnl|CDD|218673 pfam05642, Sporozoite_P67, Sporozoite P67 surface antigen.  This
           family consists of several Theileria P67 surface
           antigens. A stage specific surface antigen of Theileria
           parva, p67, is the basis for the development of an
           anti-sporozoite vaccine for the control of East Coast
           fever (ECF) in cattle. The antigen has been shown to
           contain five distinct linear peptide sequences
           recognised by sporozoite-neutralising murine monoclonal
           antibodies.
          Length = 727

 Score = 27.7 bits (61), Expect = 6.0
 Identities = 17/62 (27%), Positives = 25/62 (40%)

Query: 1   MSSDCKGSSNDPPPKKDKPKCESTRHEPTKENKPCKETKPKSKKSSEPTQSSQHSPDSLG 60
            + D KGS  D     D  + E  +   +K+ K  K+T+P    SS  T S         
Sbjct: 105 QNQDTKGSKTDSEEDDDDSEEEDNKSTSSKDGKGSKKTQPGVSTSSGSTTSGTDLNTKQS 164

Query: 61  KT 62
           +T
Sbjct: 165 QT 166


>gnl|CDD|236768 PRK10819, PRK10819, transport protein TonB; Provisional.
          Length = 246

 Score = 27.3 bits (61), Expect = 6.0
 Identities = 13/46 (28%), Positives = 20/46 (43%), Gaps = 7/46 (15%)

Query: 12  PPPKKDKPKCESTRHEPTKENKPCKETKPKSKKSSEPTQSSQHSPD 57
            P  K KPK       P  + KP K+ + + K+  +P +    SP 
Sbjct: 94  KPEPKPKPK-------PKPKPKPVKKVEEQPKREVKPVEPRPASPF 132


>gnl|CDD|182447 PRK10422, PRK10422, lipopolysaccharide core biosynthesis protein;
           Provisional.
          Length = 352

 Score = 27.4 bits (61), Expect = 6.1
 Identities = 10/32 (31%), Positives = 16/32 (50%), Gaps = 2/32 (6%)

Query: 126 HLIAYLNKRGGQLRLTSIEAPSKQEWNTVEDL 157
            +I  L  RG ++ LTS   P K +   V ++
Sbjct: 206 AVIDALQARGYEVVLTS--GPDKDDLACVNEI 235


>gnl|CDD|226853 COG4445, MiaE, Hydroxylase for synthesis of
           2-methylthio-cis-ribozeatin in tRNA [Nucleotide
           transport and metabolism / Translation, ribosomal
           structure and biogenesis].
          Length = 203

 Score = 27.1 bits (60), Expect = 6.4
 Identities = 26/127 (20%), Positives = 45/127 (35%), Gaps = 23/127 (18%)

Query: 118 KEEIEHMEHLIAYLNKRGGQLRLTSIEAP----------SKQEWNTVEDLLTEALHMEKQ 167
           +EE+ H E ++  L  R   +    I A              E   + D L    ++E +
Sbjct: 69  REELHHFEQVLEILQARN--IPYVPIPASRYAKGLLAAVRTHEPQRLIDKLIVGAYIEAR 126

Query: 168 LNEKLLKLHACASQHGDANLTDFL------EGRYLQEQVDAIKTLADLLTTVRRTQ-LYL 220
             E+   L    + H D  L  F       E R+ Q+ +       D     +R     +
Sbjct: 127 SCERFAAL----APHLDEELAKFYKGLLRSEARHFQDYLVLADQYFDEEDVSQRVDYFGI 182

Query: 221 VDRDLMS 227
           V+ +L+S
Sbjct: 183 VEAELIS 189


>gnl|CDD|218883 pfam06075, DUF936, Plant protein of unknown function (DUF936).
           This family consists of several hypothetical proteins
           from Arabidopsis thaliana and Oryza sativa. The function
           of this family is unknown.
          Length = 564

 Score = 27.9 bits (62), Expect = 6.5
 Identities = 13/61 (21%), Positives = 19/61 (31%)

Query: 2   SSDCKGSSNDPPPKKDKPKCESTRHEPTKENKPCKETKPKSKKSSEPTQSSQHSPDSLGK 61
           S   + SS   P K    +  +            K+T   SK    P + S+    S  K
Sbjct: 245 SPRARSSSAKSPFKSSIQRKATKALSKLSLRASPKDTSKSSKSEVAPPKKSEAKVPSSSK 304

Query: 62  T 62
            
Sbjct: 305 K 305


>gnl|CDD|225701 COG3159, COG3159, Uncharacterized protein conserved in bacteria
           [Function unknown].
          Length = 218

 Score = 27.3 bits (61), Expect = 6.9
 Identities = 22/98 (22%), Positives = 33/98 (33%), Gaps = 14/98 (14%)

Query: 118 KEEIEHMEHLIAYLNKRGGQLRLTSIEAPSKQEWNTVEDLLTEALHMEKQLNEKLLKLHA 177
            E       L+  L     +L        S  E       L    +  ++L E+L  L  
Sbjct: 17  PEFFIQHAELLEEL-----RLPHPVAGTVSLVERQ-----LARLRNRIRELEEELAAL-- 64

Query: 178 CASQHGDANLTDFLEGRYLQEQVDAIKTLADLLTTVRR 215
              ++  AN   F     LQ  +   ++L DLL  V R
Sbjct: 65  --MENARANERLFYRLHALQLDLLDARSLDDLLRRVDR 100


>gnl|CDD|236698 PRK10475, PRK10475, 23S rRNA pseudouridine synthase F; Provisional.
          Length = 290

 Score = 27.4 bits (61), Expect = 7.0
 Identities = 10/49 (20%), Positives = 15/49 (30%)

Query: 8   SSNDPPPKKDKPKCESTRHEPTKENKPCKETKPKSKKSSEPTQSSQHSP 56
           SS   P  K KPK    +    K  K  ++    +      T   +   
Sbjct: 240 SSEAKPKAKAKPKTAGIKRPVVKMEKTAEKGGRPASNGKRFTSPGRKKK 288


>gnl|CDD|191187 pfam05087, Rota_VP2, Rotavirus VP2 protein.  Rotavirus particles
           consist of three concentric proteinaceous capsid layers.
           The innermost capsid (core) is made of VP2. The genomic
           RNA and the two minor proteins VP1 and VP3 are
           encapsidated within this layer. The N-terminus of
           rotavirus VP2 is necessary for the encapsidation of VP1
           and VP3.
          Length = 887

 Score = 27.6 bits (61), Expect = 7.4
 Identities = 21/75 (28%), Positives = 29/75 (38%), Gaps = 2/75 (2%)

Query: 158 LTEALHMEKQLNEKLLKLHACASQHGDANLTDFLEGRYLQEQVDAIKTLADLLTTVRRTQ 217
           L E +  E Q+ +    L   A      + T FL G   Q   DA KT+   + + R   
Sbjct: 343 LKELVSTEAQIQKMSQDLQLEALTI--QSETQFLTGINSQAANDAFKTIIAAMLSQRTIS 400

Query: 218 LYLVDRDLMSGKFSM 232
           L  V  + MS    M
Sbjct: 401 LDFVTSNYMSLISGM 415


>gnl|CDD|148787 pfam07373, CAMP_factor, CAMP factor (Cfa).  This family consists of
           several bacterial CAMP factor (Cfa) proteins which seem
           to be specific to Streptococcus species. The CAMP
           reaction is a synergistic lysis of erythrocytes by the
           interaction of an extracellular protein (CAMP factor)
           produced by some streptococcal species with the
           Staphylococcus aureus sphingomyelinase C (beta-toxin).
          Length = 228

 Score = 27.0 bits (60), Expect = 7.7
 Identities = 10/44 (22%), Positives = 17/44 (38%)

Query: 125 EHLIAYLNKRGGQLRLTSIEAPSKQEWNTVEDLLTEALHMEKQL 168
           +  +  LN R  QL+            + + DLL  A  ++  L
Sbjct: 18  QQELQKLNARIAQLQDIQKSVKGSDYEDQINDLLKAAFDLKTAL 61


>gnl|CDD|183019 PRK11179, PRK11179, DNA-binding transcriptional regulator AsnC;
           Provisional.
          Length = 153

 Score = 26.6 bits (59), Expect = 8.7
 Identities = 11/31 (35%), Positives = 17/31 (54%), Gaps = 2/31 (6%)

Query: 65  QTLQNFHEDCIDKINTQINDEIRSAYTYISM 95
           +++       I+KI  Q  DEI+S  T IS+
Sbjct: 115 RSIDALQHVLINKI--QTIDEIQSTETLISL 143


>gnl|CDD|222581 pfam14181, YqfQ, YqfQ-like protein.  The YqfQ-like protein family
           includes the B. subtilis YqfQ protein, also known as
           VrrA, which is functionally uncharacterized. This family
           of proteins is found in bacteria. Proteins in this
           family are typically between 146 and 237 amino acids in
           length. There are two conserved sequence motifs: QYGP
           and PKLY.
          Length = 155

 Score = 26.3 bits (58), Expect = 10.0
 Identities = 15/42 (35%), Positives = 20/42 (47%), Gaps = 1/42 (2%)

Query: 8   SSNDPPPKKDKPKCESTRHEP-TKENKPCKETKPKSKKSSEP 48
              DPP  K + K +  R  P  K  K   +T+PK  K S+P
Sbjct: 110 EQEDPPETKTESKEKKKREVPKPKTEKEKPKTEPKKPKPSKP 151


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.312    0.128    0.368 

Gapped
Lambda     K      H
   0.267   0.0663    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 11,386,245
Number of extensions: 1012511
Number of successful extensions: 966
Number of sequences better than 10.0: 1
Number of HSP's gapped: 940
Number of HSP's successfully gapped: 75
Length of query: 239
Length of database: 10,937,602
Length adjustment: 94
Effective length of query: 145
Effective length of database: 6,768,326
Effective search space: 981407270
Effective search space used: 981407270
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.9 bits)
S2: 57 (25.9 bits)