RPS-BLAST 2.2.26 [Sep-21-2011]
Database: CDD.v3.10
44,354 sequences; 10,937,602 total letters
Searching..................................................done
Query= psy11975
(786 letters)
>gnl|CDD|188630 cd00408, DHDPS-like, Dihydrodipicolinate synthase family.
Dihydrodipicolinate synthase family. A member of the
class I aldolases, which use an active-site lysine which
stabilizes a reaction intermediate via Schiff base
formation, and have TIM beta/alpha barrel fold. The
dihydrodipicolinate synthase family comprises several
pyruvate-dependent class I aldolases that use the same
catalytic step to catalyze different reactions in
different pathways and includes such proteins as
N-acetylneuraminate lyase, MosA protein,
5-keto-4-deoxy-glucarate dehydratase,
trans-o-hydroxybenzylidenepyruvate hydratase-aldolase,
trans-2'-carboxybenzalpyruvate hydratase-aldolase, and
2-keto-3-deoxy- gluconate aldolase. The family is also
referred to as the N-acetylneuraminate lyase (NAL)
family.
Length = 281
Score = 200 bits (511), Expect = 1e-58
Identities = 76/216 (35%), Positives = 112/216 (51%), Gaps = 11/216 (5%)
Query: 523 HTTTRATIDLTQKAAKAGANAALILCPYYFQKKMTEDLIYEHFISVADNSPIPVIIYNNT 582
+TR I+L + A +AGA+ L++ PYY K +++ I HF +VAD S +PVI+YN
Sbjct: 75 ANSTREAIELARHAEEAGADGVLVVPPYYN--KPSQEGIVAHFKAVADASDLPVILYNIP 132
Query: 583 FVTNIDISVDTLVKLAHHENIRGVKDT--DNIKLANMANQTKDLNFSVFAGSAGYLLSGL 640
T +D+S +T+ +LA H NI G+KD+ D +L + +F+V +G LL L
Sbjct: 133 GRTGVDLSPETIARLAEHPNIVGIKDSSGDLDRLTRLIALLGP-DFAVLSGDDDLLLPAL 191
Query: 641 LVGCAGGINALSAVLGGPICELYDLAKAGKWEEAMKLQHRLVKPDVTVRNVLLMKEMGVP 700
+G G I+ + V LY+ A+AG EEA LQ RL+ L KE
Sbjct: 192 ALGADGAISGAANVAPKLAVALYEAARAGDLEEARALQDRLLPLI-----EALFKEGNPA 246
Query: 701 GVRAAMELYGYYGGRSRRPLPAALKPGGAEKIKQVL 736
V+AA+ L G G R PL L K++ +L
Sbjct: 247 PVKAALALLGLDAGPVRLPLV-PLSEEERAKLEALL 281
Score = 37.9 bits (89), Expect = 0.015
Identities = 9/21 (42%), Positives = 12/21 (57%)
Query: 300 VFSALPTPFREDEEIDFEKFK 320
V AL TPF D E+D + +
Sbjct: 1 VIPALVTPFTADGEVDLDALR 21
>gnl|CDD|223406 COG0329, DapA, Dihydrodipicolinate synthase/N-acetylneuraminate
lyase [Amino acid transport and metabolism / Cell
envelope biogenesis, outer membrane].
Length = 299
Score = 168 bits (429), Expect = 6e-47
Identities = 76/222 (34%), Positives = 117/222 (52%), Gaps = 10/222 (4%)
Query: 524 TTTRATIDLTQKAAKAGANAALILCPYYFQKKMTEDLIYEHFISVADNSPIPVIIYNNTF 583
+T I+L + A K GA+ L++ PYY K +++ +Y HF ++A+ +PVI+YN
Sbjct: 83 NSTAEAIELAKHAEKLGADGILVVPPYYN--KPSQEGLYAHFKAIAEAVDLPVILYNIPS 140
Query: 584 VTNIDISVDTLVKLAHHENIRGVKD-TDNI-KLANMANQTKDLNFSVFAGSAGYLLSGLL 641
T +D+S +T+ +LA H NI GVKD + ++ +L + D +F V +G L LL
Sbjct: 141 RTGVDLSPETIARLAEHPNIVGVKDSSGDLDRLEEIIAALGDRDFIVLSGDDELALPALL 200
Query: 642 VGCAGGINALSAVLGGPICELYDLAKAGKWEEAMKLQHRLVKPDVTVRNVLLMKEMGVPG 701
+G G I+ + V ELY AKAG EEA +LQ RL+ +R LL +E
Sbjct: 201 LGADGVISVTANVAPELAVELYRAAKAGDIEEARELQDRLLP---LIR--LLFREGNPIP 255
Query: 702 VRAAMELYGYY-GGRSRRPLPAALKPGGAEKIKQVLTEAGFL 742
++AA+ L G GG R PL + + ++ L G L
Sbjct: 256 IKAALRLLGLIDGGTVRLPLVPLSEEEREKVLEAALAALGLL 297
Score = 40.7 bits (96), Expect = 0.002
Identities = 11/24 (45%), Positives = 13/24 (54%)
Query: 297 LTGVFSALPTPFREDEEIDFEKFK 320
GV AL TPF ED +D E +
Sbjct: 5 FKGVIPALVTPFDEDGSVDEEALR 28
>gnl|CDD|235107 PRK03170, PRK03170, dihydrodipicolinate synthase; Provisional.
Length = 292
Score = 149 bits (379), Expect = 3e-40
Identities = 74/222 (33%), Positives = 119/222 (53%), Gaps = 15/222 (6%)
Query: 526 TRATIDLTQKAAKAGANAALILCPYYFQKKMTEDLIYEHFISVADNSPIPVIIYNNTFV- 584
T I+LT+ A KAGA+ AL++ PYY K T++ +Y+HF ++A+ + +P+I+YN V
Sbjct: 82 TAEAIELTKFAEKAGADGALVVTPYY--NKPTQEGLYQHFKAIAEATDLPIILYN---VP 136
Query: 585 --TNIDISVDTLVKLAHHENIRGVKD-TDNIKLANMANQTKDLNFSVFAGSAGYLLSGLL 641
T +DI +T+ +LA H NI G+K+ T +++ + + +F+V++G L L
Sbjct: 137 GRTGVDILPETVARLAEHPNIVGIKEATGDLERVSELIELVPDDFAVYSGDDALALPFLA 196
Query: 642 VGCAGGINALSAVLGGPICELYDLAKAGKWEEAMKLQHRLVKPDVTVRNVLLMKEMGVPG 701
+G G I+ + V + E+ D A AG + EA ++ RL+ + L E
Sbjct: 197 LGGVGVISVAANVAPKEMAEMCDAALAGDFAEAREIHRRLLP-----LHKALFIEPNPIP 251
Query: 702 VRAAMELYGYYGGRSRRPLPAALKPGGAEKIKQVLTEAGFLV 743
V+AA+ L G G R PL L EK++ L EAG L
Sbjct: 252 VKAALNLLGLIEGELRLPL-VPLSEEEKEKLRAALKEAGLLA 292
Score = 36.6 bits (86), Expect = 0.042
Identities = 10/24 (41%), Positives = 14/24 (58%)
Query: 297 LTGVFSALPTPFREDEEIDFEKFK 320
G +AL TPF+ED +DF +
Sbjct: 2 FGGSITALVTPFKEDGSVDFAALR 25
>gnl|CDD|188637 cd00950, DHDPS, Dihydrodipicolinate synthase (DHDPS).
Dihydrodipicolinate synthase (DHDPS) is a key enzyme in
lysine biosynthesis. It catalyzes the aldol condensation
of L-aspartate-beta- semialdehyde and pyruvate to
dihydropicolinic acid via a Schiff base formation
between pyruvate and a lysine residue. The functional
enzyme is a homotetramer consisting of a dimer of
dimers. DHDPS is member of dihydrodipicolinate synthase
family that comprises several pyruvate-dependent class I
aldolases that use the same catalytic step to catalyze
different reactions in different pathways.
Length = 284
Score = 145 bits (369), Expect = 6e-39
Identities = 68/218 (31%), Positives = 113/218 (51%), Gaps = 21/218 (9%)
Query: 526 TRATIDLTQKAAKAGANAALILCPYYFQKKMTEDLIYEHFISVADNSPIPVIIYNNTFVT 585
T I+LT++A KAGA+AAL++ PYY K +++ +Y HF ++A+ + +PVI+YN T
Sbjct: 81 TAEAIELTKRAEKAGADAALVVTPYY--NKPSQEGLYAHFKAIAEATDLPVILYNVPGRT 138
Query: 586 NIDISVDTLVKLAHHENIRGVKD-TDNIKLANMANQTKDL------NFSVFAGSAGYLLS 638
++I +T+++LA H NI G+K+ T ++ ++ +L +F+V +G L
Sbjct: 139 GVNIEPETVLRLAEHPNIVGIKEATGDL------DRVSELIALCPDDFAVLSGDDALTLP 192
Query: 639 GLLVGCAGGINALSAVLGGPICELYDLAKAGKWEEAMKLQHRLVKPDVTVRNVLLMKEMG 698
L +G G I+ + V + E+ A AG E+A +L +L+ L E
Sbjct: 193 FLALGGVGVISVAANVAPKLMAEMVRAALAGDLEKARELHRKLLP-----LIKALFAEPN 247
Query: 699 VPGVRAAMELYGYYGGRSRRPLPAALKPGGAEKIKQVL 736
V+AA+ L G G R PL L K++ L
Sbjct: 248 PIPVKAALALLGLISGELRLPLV-PLSEELRAKLRAAL 284
Score = 38.6 bits (91), Expect = 0.009
Identities = 9/24 (37%), Positives = 15/24 (62%)
Query: 297 LTGVFSALPTPFREDEEIDFEKFK 320
G +AL TPF++D +DF+ +
Sbjct: 1 FGGSITALVTPFKDDGSVDFDALE 24
>gnl|CDD|201403 pfam00701, DHDPS, Dihydrodipicolinate synthetase family. This
family has a TIM barrel structure.
Length = 289
Score = 123 bits (310), Expect = 5e-31
Identities = 62/216 (28%), Positives = 104/216 (48%), Gaps = 9/216 (4%)
Query: 526 TRATIDLTQKAAKAGANAALILCPYYFQKKMTEDLIYEHFISVADNSPIPVIIYNNTFVT 585
TR I L Q A AGA+ L + PYY K +++ +Y+HF ++A + +PVI+YN T
Sbjct: 82 TREAIHLAQLAEAAGADGVLAVTPYYN--KPSQEGLYQHFKAIAAATDLPVILYNVPSRT 139
Query: 586 NIDISVDTLVKLAHHENIRGVKDT-DNIKLANMANQTKDLNFSVFAGSAGYLLSGLLVGC 644
D++ +T+ +LA N+ GVKD +++ + +F++ +G LS L +G
Sbjct: 140 GQDLTPETIERLAECPNVVGVKDAVGDLERMENIRKRAGPDFTILSGDDETALSYLSLGA 199
Query: 645 AGGINALSAVLGGPICELYDLAKAGKWEEAMKLQHRLVKPDVTVRNVLLMKEMGVPGVRA 704
G I+ S + + ++Y K G + A L +L+ +L E V+
Sbjct: 200 DGVISVTSNIAPKLMRDIYRALKNGDFATAALLNEKLLP-----LMKILFAEPNPIPVKT 254
Query: 705 AMELYGYYGGRSRRPLPAALKPGGAEKIKQVLTEAG 740
A++L G G R PL L E++ +L G
Sbjct: 255 ALQLLGLDVGPCRLPL-TPLSEEEREELDALLKALG 289
>gnl|CDD|129757 TIGR00674, dapA, dihydrodipicolinate synthase. Dihydrodipicolinate
synthase is a homotetrameric enzyme of lysine
biosynthesis. E. coli has several paralogs closely
related to dihydrodipicoline synthase (DapA), as well as
the more distant N-acetylneuraminate lyase. In
Pyrococcus horikoshii, the bidirectional best hit with
E. coli is to an uncharacterized paralog of DapA, not
DapA itself, and it is omitted from the seed. The
putative members from the Chlamydias (pathogens with a
parasitic metabolism) are easily the most divergent
members of the multiple alignment [Amino acid
biosynthesis, Aspartate family].
Length = 285
Score = 106 bits (266), Expect = 3e-25
Identities = 59/221 (26%), Positives = 104/221 (47%), Gaps = 17/221 (7%)
Query: 524 TTTRATIDLTQKAAKAGANAALILCPYYFQKKMTEDLIYEHFISVADNSPIPVIIYNNTF 583
T I LT+ A GA+ L++ PYY K T++ +Y+HF ++A+ +P+I+YN
Sbjct: 77 NATEEAISLTKFAEDVGADGFLVVTPYY--NKPTQEGLYQHFKAIAEEVDLPIILYNVPS 134
Query: 584 VTNIDISVDTLVKLAHHENIRGVKD-TDNIKLANMANQTKDLNFSVFAGSAGYLLSGLLV 642
T + + +T+ +LA NI +K+ T N++ + +F V +G L + +
Sbjct: 135 RTGVSLYPETVKRLAEEPNIVAIKEATGNLERISEIKAIAPDDFVVLSGDDALTLPMMAL 194
Query: 643 GCAGGINALSAVLGGPICELYDLAKAGKWEEAMKLQHRLVKPDVTVRNVLLMKEMGV--- 699
G G I+ + V + E+ + A G + EA ++ +L+ L K + +
Sbjct: 195 GGKGVISVTANVAPKLMKEMVNNALEGDFAEAREIHQKLMP---------LHKALFIETN 245
Query: 700 P-GVRAAMELYGYYGGRSRRPLPAALKPGGAEKIKQVLTEA 739
P V+ A+ L G G R PL L K++ VL +
Sbjct: 246 PIPVKTALALLGLIEGELRLPL-TELSEEHRNKLRDVLKDL 285
Score = 30.4 bits (69), Expect = 3.1
Identities = 11/26 (42%), Positives = 16/26 (61%)
Query: 299 GVFSALPTPFREDEEIDFEKFKFNVS 324
GV +AL TPF+ED +DF + +
Sbjct: 1 GVITALITPFKEDGSVDFAALEKLID 26
>gnl|CDD|188641 cd00954, NAL, N-Acetylneuraminic acid aldolase, also called
N-acetylneuraminate lyase (NAL). N-Acetylneuraminic
acid aldolase, also called N-acetylneuraminate lyase
(NAL), which catalyses the reversible aldol reaction of
N-acetyl-D-mannosamine and pyruvate to give
N-acetyl-D-neuraminic acid (D-sialic acid). It has a
widespread application as biocatalyst for the synthesis
of sialic acid and its derivatives. This enzyme has been
shown to be quite specific for pyruvate as the donor,
but flexible to a variety of D- and, to some extent,
L-hexoses and pentoses as acceptor substrates. NAL is
member of dihydrodipicolinate synthase family that
comprises several pyruvate-dependent class I aldolases.
Length = 288
Score = 87.8 bits (218), Expect = 6e-19
Identities = 49/220 (22%), Positives = 96/220 (43%), Gaps = 16/220 (7%)
Query: 523 HTTTRATIDLTQKAAKAGANAALILCPYYFQKKMTEDLIYEHFISVADNSP-IPVIIYNN 581
+ + +L + A + G +A + P+Y+ K + + I +++ + + +P+IIY+
Sbjct: 79 SLNLKESQELAKHAEELGYDAISAITPFYY--KFSFEEIKDYYREIIAAAASLPMIIYHI 136
Query: 582 TFVTNIDISVDTLVKLAHHENIRGVKDTD-NIKLANMANQTKDLNFSVFAGSAGYLLSGL 640
+T ++++++ ++L N+ GVK T ++ + V G LLS L
Sbjct: 137 PALTGVNLTLEQFLELFEIPNVIGVKFTATDLYDLERIRAASPEDKLVLNGFDEMLLSAL 196
Query: 641 LVGCAGGINALSAVLGGPICELYDLAKAGKWEEAMKLQHRLVKPDVTVRNVLLMKEMG-V 699
+G G I + V G ++++ AG + A +LQH V + ++ + G
Sbjct: 197 ALGADGAIGSTYNVNGKRYRKIFEAFNAGDIDTARELQH------VINDVITVLIKNGLY 250
Query: 700 PGVRAAMELYGYYGGRSRRPLPAALKPGGAEKIKQVLTEA 739
P ++A + L G G R PL EK E
Sbjct: 251 PTLKAILRLMGLDAGPCRLPLRKV-----TEKALAKAKEL 285
Score = 32.7 bits (75), Expect = 0.72
Identities = 10/20 (50%), Positives = 14/20 (70%)
Query: 297 LTGVFSALPTPFREDEEIDF 316
L G+ +AL TPF E+ EI+
Sbjct: 1 LKGLIAALLTPFDENGEINE 20
>gnl|CDD|179749 PRK04147, PRK04147, N-acetylneuraminate lyase; Provisional.
Length = 293
Score = 87.4 bits (217), Expect = 9e-19
Identities = 56/219 (25%), Positives = 99/219 (45%), Gaps = 16/219 (7%)
Query: 526 TRATIDLTQKAAKAGANAALILCPYYFQKKMTEDLIYEHFISVADNSPIPVIIYNNTFVT 585
T +L + A + G +A + P+Y+ + + I +++ + D++ P+I+YN +T
Sbjct: 85 TAEAQELAKYATELGYDAISAVTPFYY--PFSFEEICDYYREIIDSADNPMIVYNIPALT 142
Query: 586 NIDISVDTLVKLAHHENIRGVKDT--DNIKLANMANQTKDLNFSVFAGSAGYLLSGLLVG 643
+++S+D +L + GVK T D +L + D ++ G SGLL G
Sbjct: 143 GVNLSLDQFNELFTLPKVIGVKQTAGDLYQLERIRKAFPDK--LIYNGFDEMFASGLLAG 200
Query: 644 CAGGINALSAVLGGPICELYDLAKAGKWEEAMKLQHRLVKPDVTVRNVLLMKEMGV-PGV 702
G I + V G ++++ AKAG +EA +LQH + L+ + GV PG+
Sbjct: 201 ADGAIGSTYNVNGWRARQIFEAAKAGDIQEAQELQH------ECNDVIDLLIKNGVYPGL 254
Query: 703 RAAMELYGYYGGRSRRPL-PA--ALKPGGAEKIKQVLTE 738
+ + G G R+P P P + L E
Sbjct: 255 KEILHYMGVDAGLCRKPFKPVDEKYLPALKALAAKYLKE 293
Score = 38.1 bits (89), Expect = 0.014
Identities = 15/34 (44%), Positives = 22/34 (64%), Gaps = 4/34 (11%)
Query: 296 DLTGVFSALPTPFREDEEIDFEKFK----FNVSK 325
+L GV++AL TPF ED +ID + + FN+ K
Sbjct: 3 NLKGVYAALLTPFDEDGQIDEQGLRRLVRFNIEK 36
>gnl|CDD|131366 TIGR02313, HpaI-NOT-DapA, 2,4-dihydroxyhept-2-ene-1,7-dioic acid
aldolase. This model represents a subset of the DapA
(dihydrodipicolinate synthase) family which has
apparently evolved a separate function. The product of
DapA, dihydrodipicolinate, results from the
non-enzymatic cyclization and dehydration of
6-amino-2,4-dihydroxyhept-2-ene-1,7-dioic acid, which is
different from the substrate of this reaction only in
the presence of the amino group. In the absence of this
amino group, and running the reaction in the opposite
direction, the reaction corresponds to the HpaI aldolase
component of the 4-hydroxyphenylacetic acid catabolism
pathway (see TIGR02311). At present, this variant of
DapA is found only in Oceanobacillus iheyensis HTE831
and Thermus thermophilus HB27. In both of these cases,
one or more other DapA genes can be found and the one
identified by this model is part of an operon for
4-hydroxyphenylacetic acid catabolism.
Length = 294
Score = 78.5 bits (193), Expect = 9e-16
Identities = 66/227 (29%), Positives = 105/227 (46%), Gaps = 31/227 (13%)
Query: 529 TIDLTQKAAKAGANAALILCPYYFQKKMTEDLIYEHFISVADNSP-IPVIIYNNTFVTNI 587
T++LT+ A +AGA+AA+++ PYY K ++ +Y+HF VAD P P+IIYN
Sbjct: 84 TLELTKFAEEAGADAAMVIVPYY--NKPNQEALYDHFAEVADAVPDFPIIIYNIPGRAAQ 141
Query: 588 DISVDTLVKLAHHENIRGVKDTDNIKLANMANQTKDL------------NFSVFAGSAGY 635
+I+ T+ +L KD NI A +N KD +F +F G
Sbjct: 142 EIAPKTMARL--------RKDCPNIVGAKESN--KDFEHLNHLFLEAGRDFLLFCGIELL 191
Query: 636 LLSGLLVGCAGGINALSAVLGGPICELYDLAKAGKWEEAMKLQHRLVKPDVTVRNVLLMK 695
L L +G AG I A + V + EL + A+AG + A L L++ N + K
Sbjct: 192 CLPMLAIGAAGSIAATANVEPKEVAELCEAAEAGDIKGAQDLHFELLEA-----NDAIFK 246
Query: 696 EMGVPGVRAAMELYGYYGGRSRRPLPAALKPGGAEKIKQVLTEAGFL 742
+ ++AA+ + G R PL E+I+ + + G +
Sbjct: 247 DTNPAPLKAALGMMGLIEKELRPPL-GLPSDALEEEIRDMAEKYGKI 292
>gnl|CDD|99896 cd05835, Dnmt3b_related, The PWWP domain is an essential component
of DNA methyltransferase 3 B (Dnmt3b) which is
responsible for establishing DNA methylation patterns
during embryogenesis and gametogenesis. In
tumorigenesis, DNA methylation by Dnmt3b is known to
play a role in the inactivation of tumor suppressor
genes. In addition, a point mutation in the PWWP domain
of Dnmt3b has been identified in patients with ICF
syndrome (immunodeficiency, centromeric instability, and
facial anomalies), a rare autosomal recessive disorder
characterized by hypomethylation of classical satellite
DNA. The PWWP domain, named for a conserved
Pro-Trp-Trp-Pro motif, is a small domain consisting of
100-150 amino acids. The PWWP domain is found in
numerous proteins that are involved in cell division,
growth and differentiation. Most PWWP-domain proteins
seem to be nuclear, often DNA-binding, proteins that
function as transcription factors regulating a variety
of developmental processes.
Length = 87
Score = 70.9 bits (174), Expect = 5e-15
Identities = 30/76 (39%), Positives = 39/76 (51%), Gaps = 8/76 (10%)
Query: 358 AEGDLVWGSVKGYPSWPGKLIS------PAPTQGRVWVKWFGMSNEPLSEVEPATLKSLS 411
GDLVWG +KG+P WPG+++S P G WV WFG SEV L S
Sbjct: 2 NVGDLVWGKIKGFPWWPGRVVSITVTSKRPPVVGMRWVTWFGSGTF--SEVSVDKLSPFS 59
Query: 412 QGLEAHHRSRKKLQGL 427
+ +A R +K +GL
Sbjct: 60 EFFKAFSRYNRKKKGL 75
>gnl|CDD|188640 cd00953, KDG_aldolase, KDG (2-keto-3-deoxygluconate) aldolases
found in archaea. KDG (2-keto-3-deoxygluconate)
aldolases found in archaea. This subfamily of enzymes is
adapted for high thermostability and shows specificity
for non-phosphorylated substrates. The enzyme catalyses
the reversible aldol cleavage of 2-keto-3-dexoygluconate
to pyruvate and glyceraldehyde, the third step of a
modified non-phosphorylated Entner-Doudoroff pathway of
glucose oxidation. KDG aldolase shows no significant
sequence similarity to microbial
2-keto-3-deoxyphosphogluconate (KDPG) aldolases, and the
enzyme shows no activity with glyceraldehyde 3-phosphate
as substrate. The enzyme is a tetramer and a member of
the DHDPS family of Schiff-base-dependent class I
aldolases.
Length = 279
Score = 65.5 bits (160), Expect = 2e-11
Identities = 47/152 (30%), Positives = 71/152 (46%), Gaps = 11/152 (7%)
Query: 530 IDLTQKAAKAGANAALILCPYYFQKKMTEDLIYEHFISVADNSPIPVIIYNNTFVTNIDI 589
I+L + A G A L PYYF + E+ + ++F ++ SP P IYN T DI
Sbjct: 81 IELARAAKSFGIYAIASLPPYYF-PGIPEEWLIKYFTDIS--SPYPTFIYNYPKATGYDI 137
Query: 590 SVDTLVKL-AHHENIRGVKDTDNIKLANMANQTKDL--NFSVFAGSAGYLLSGLLVGCAG 646
+ ++ +I GVKDT N +++M K L +F V++G + S L G G
Sbjct: 138 NARMAKEIKKAGGDIIGVKDT-NEDISHMLE-YKRLVPDFKVYSGPDSLIFSALRSGLDG 195
Query: 647 GINALSAVLGGPICELYDLAKAGKWEEAMKLQ 678
+ A S L ++ D E+A KLQ
Sbjct: 196 SVAAASNYLPEVFVKIKDHVAI---EDAFKLQ 224
>gnl|CDD|188072 TIGR00683, nanA, N-acetylneuraminate lyase. N-acetylneuraminate
lyase is also known as N-acetylneuraminic acid aldolase,
sialic acid aldolase, or sialate lyase. It is an
intracellular enzyme. The structure of this
homotetrameric enzyme related to dihydrodipicolinate
synthase is known. In Clostridium tertium, the enzyme
appears to be in an operon with a secreted sialidase
that releases sialic acid from host
sialoglycoconjugates. In several E. coli strains,
however, this enzyme is responsible for
N-acetyl-D-neuraminic acid synthesis for capsule
production by condensing N-acetyl-D-mannosamine and
pyruvate [Cell envelope, Biosynthesis and degradation of
surface polysaccharides and lipopolysaccharides, Central
intermediary metabolism, Amino sugars].
Length = 290
Score = 63.4 bits (154), Expect = 9e-11
Identities = 46/220 (20%), Positives = 100/220 (45%), Gaps = 14/220 (6%)
Query: 525 TTRATIDLTQKAAKAGANAALILCPYYFQKKMTEDLIYEHFISVADNS-PIPVIIYNNTF 583
+ ++L + A + G + + P+Y++ E I ++ ++ + + +I+Y+ F
Sbjct: 81 NLKEAVELGKYATELGYDCLSAVTPFYYKFSFPE--IKHYYDTIIAETGGLNMIVYSIPF 138
Query: 584 VTNIDISVDTLVKLAHHENIRGVKDT--DNIKLANMANQTKDLNFSVFAGSAGYLLSGLL 641
+T +++ ++ +L + + GVK T D L + + ++AG +L
Sbjct: 139 LTGVNMGIEQFGELYKNPKVLGVKFTAGDFYLLERLKKAYPNH--LIWAGFDEMMLPAAS 196
Query: 642 VGCAGGINALSAVLGGPICELYDLAKAGKWEEAMKLQHRLVKPDVTVRNVLLMKEMGV-P 700
+G G I + V G ++++L KAGK +EA+++QH VT + + G+
Sbjct: 197 LGVDGAIGSTFNVNGVRARQIFELTKAGKLQEALEIQH------VTNDLIEGILANGLYL 250
Query: 701 GVRAAMELYGYYGGRSRRPLPAALKPGGAEKIKQVLTEAG 740
++ ++L G G R P+ + P K K + +
Sbjct: 251 TIKELLKLEGVDAGYCREPMTSKATPEQKAKAKDLKAKFL 290
>gnl|CDD|178038 PLN02417, PLN02417, dihydrodipicolinate synthase.
Length = 280
Score = 63.1 bits (154), Expect = 1e-10
Identities = 34/83 (40%), Positives = 44/83 (53%), Gaps = 4/83 (4%)
Query: 526 TRATIDLTQKAAKAGANAALILCPYYFQKKMTEDLIYEHFISVADNSPIPVIIYNNTFVT 585
TR I T++ G +AAL + PYY K E LI +HF +V D P IIYN T
Sbjct: 82 TREAIHATEQGFAVGMHAALHINPYY-GKTSQEGLI-KHFETVLDMGP--TIIYNVPGRT 137
Query: 586 NIDISVDTLVKLAHHENIRGVKD 608
DI + + K+A H N GVK+
Sbjct: 138 GQDIPPEVIFKIAQHPNFAGVKE 160
>gnl|CDD|188638 cd00951, KDGDH, 5-dehydro-4-deoxyglucarate dehydratase, also called
5-keto-4-deoxy-glucarate dehydratase (KDGDH).
5-dehydro-4-deoxyglucarate dehydratase, also called
5-keto-4-deoxy-glucarate dehydratase (KDGDH), which is
member of dihydrodipicolinate synthase (DHDPS) family
that comprises several pyruvate-dependent class I
aldolases. The enzyme is involved in glucarate
metabolism, and its mechanism presumbly involves a
Schiff-base intermediate similar to members of DHDPS
family. While in the case of Pseudomonas sp.
5-dehydro-4-deoxy-D-glucarate is degraded by KDGDH to
2,5-dioxopentanoate, in certain species of
Enterobacteriaceae it is degraded instead to pyruvate
and glycerate.
Length = 289
Score = 58.1 bits (141), Expect = 4e-09
Identities = 57/205 (27%), Positives = 84/205 (40%), Gaps = 22/205 (10%)
Query: 526 TRATIDLTQKAAKAGANAALILCPYYFQKKMTE---DLIYEHFISVADNSPIPVIIYNNT 582
T I Q A KAGA+ L+L PY +TE + +Y H +V ++ + VI+YN
Sbjct: 80 TATAIAYAQAAEKAGADGILLLPPY-----LTEAPQEGLYAHVEAVCKSTDLGVIVYNR- 133
Query: 583 FVTNIDISVDTLVKLAHH-ENIRGVKD-TDNIKLANMANQTKDL--NFSVFAGSAG---Y 635
N ++ D+L +LA N+ G KD +I+L M L G +
Sbjct: 134 --ANAVLTADSLARLAERCPNLVGFKDGVGDIEL--MRRIVAKLGDRLLYLGGLPTAEVF 189
Query: 636 LLSGLLVGCAGGINALSAVLGGPICELYDLAKAGKWEEAMKLQHRLVKPDVTVRNVLLMK 695
L+ L +G +A+ + Y +AG +L P V +RN K
Sbjct: 190 ALAYLAMGVPTYSSAVFNFVPEIALAFYAAVRAGDHATVKRLLRDFFLPYVDIRN--RRK 247
Query: 696 EMGVPGVRAAMELYGYYGGRSRRPL 720
V V+A L G G R PL
Sbjct: 248 GYAVSIVKAGARLVGRDAGPVRPPL 272
>gnl|CDD|216154 pfam00855, PWWP, PWWP domain. The PWWP domain is named after a
conserved Pro-Trp-Trp-Pro motif. The domain binds to
Histone-4 methylated at lysine-20, H4K20me, suggesting
that it is methyl-lysine recognition motif. Removal of
two conserved aromatic residues in a hydrophobic cavity
created by this domain within the full-lemgth protein,
Pdp1, abolishes the interaction o f the protein with
H4K20me3. In fission yeast, Set9 is the sole enzyme that
catalyzes all three states of H4K20me, and Set9-mediated
H4K20me is required for efficient recruitment of
checkpoint protein Crb2 to sites of DNA damage. The
methylation of H4K20 is involved in a diverse array of
cellular processes, such as organising higher-order
chromatin, maintaining genome stability, and regulating
cell-cycle progression.
Length = 74
Score = 52.8 bits (127), Expect = 8e-09
Identities = 26/69 (37%), Positives = 36/69 (52%), Gaps = 8/69 (11%)
Query: 360 GDLVWGSVKGYPSWPGKLISPAPT------QGRVWVKWFGMSNEPLSEVEPATLKSLSQG 413
GDLVW +KG+P WP +++S T RV V++FG + V P LK L
Sbjct: 5 GDLVWAKMKGFPWWPARIVSQKETPTSIRKDNRVKVQFFGDHK--TAWVSPKRLKPLDVD 62
Query: 414 LEAHHRSRK 422
+E H+ RK
Sbjct: 63 IEKFHKDRK 71
>gnl|CDD|99897 cd05836, N_Pac_NP60, The PWWP domain is an essential part of the
cytokine-like nuclear factor n-pac protein, or NP60,
which enhances the activity of MAP2K4 and MAP2K6 kinases
to phosphorylate p38-alpha. In a variety of cell lines,
NP60 has been shown to localize to the nucleus. In
addition to the PWWP domain, NP60 also contains an
AT-hook and a C-terminal NAD-binding domain. The PWWP
domain, named for a conserved Pro-Trp-Trp-Pro motif, is
a small domain consisting of 100-150 amino acids. The
PWWP domain is found in numerous proteins that are
involved in cell division, growth and differentiation.
Most PWWP-domain proteins seem to be nuclear, often
DNA-binding proteins, that function as transcription
factors regulating a variety of developmental processes.
Length = 86
Score = 47.1 bits (112), Expect = 1e-06
Identities = 18/51 (35%), Positives = 28/51 (54%), Gaps = 7/51 (13%)
Query: 360 GDLVWGSVKGYPSWPGKLISPAPT----QGRV---WVKWFGMSNEPLSEVE 403
GDLVW +KG+P WPG+++ P +G+ +V +FG N + E
Sbjct: 4 GDLVWAKMKGFPPWPGRIVKPPKDLKKPRGKAKCFFVFFFGSENHAWIKEE 54
>gnl|CDD|99894 cd05162, PWWP, The PWWP domain, named for a conserved
Pro-Trp-Trp-Pro motif, is a small domain consisting of
100-150 amino acids. The PWWP domain is found in
numerous proteins that are involved in cell division,
growth and differentiation. Most PWWP-domain proteins
seem to be nuclear, often DNA-binding, proteins that
function as transcription factors regulating a variety
of developmental processes. The function of the PWWP
domain is still not known precisely; however, based on
the fact that other regions of PWWP-domain proteins are
responsible for nuclear localization and DNA-binding, is
likely that the PWWP domain acts as a site for
protein-protein binding interactions, influencing
chromatin remodeling and thereby regulating
transcriptional processes. Some PWWP-domain proteins
have been linked to cancer or other diseases; some are
known to function as growth factors.
Length = 87
Score = 46.2 bits (110), Expect = 2e-06
Identities = 17/43 (39%), Positives = 23/43 (53%), Gaps = 9/43 (20%)
Query: 360 GDLVWGSVKGYPSWPGKLISPA---------PTQGRVWVKWFG 393
GDLVW +KGYP WP ++ P +G+V V +FG
Sbjct: 4 GDLVWAKMKGYPWWPALVVDPPKDSKKAKKKAKEGKVLVLFFG 46
>gnl|CDD|235141 PRK03620, PRK03620, 5-dehydro-4-deoxyglucarate dehydratase;
Provisional.
Length = 303
Score = 49.8 bits (120), Expect = 2e-06
Identities = 29/87 (33%), Positives = 45/87 (51%), Gaps = 12/87 (13%)
Query: 526 TRATIDLTQKAAKAGANAALILCPYYFQKKMTE---DLIYEHFISVADNSPIPVIIYNNT 582
T I+ Q A +AGA+ L+L PY +TE + + H +V ++ + VI+YN
Sbjct: 87 TAQAIEYAQAAERAGADGILLLPPY-----LTEAPQEGLAAHVEAVCKSTDLGVIVYNR- 140
Query: 583 FVTNIDISVDTLVKLAH-HENIRGVKD 608
N ++ DTL +LA N+ G KD
Sbjct: 141 --DNAVLTADTLARLAERCPNLVGFKD 165
>gnl|CDD|99895 cd05834, HDGF_related, The PWWP domain is an essential part of the
Hepatoma Derived Growth Factor (HDGF) family of
proteins, and is necessary for DNA binding by HDGF. This
family of endogenous nuclear-targeted mitogens includes
HRP (HDGF-related proteins 1, 2, 3, 4, or HPR1, HPR2,
HPR3, HPR4, respectively) and lens epithelium-derived
growth factor, LEDGF. Members of the HDGF family have
been linked to human diseases, and HDGF is a prognostic
factor in several types of cancer. The PWWP domain,
named for a conserved Pro-Trp-Trp-Pro motif, is a small
domain consisting of 100-150 amino acids. The PWWP
domain is found in numerous proteins that are involved
in cell division, growth and differentiation. Most
PWWP-domain proteins seem to be nuclear, often
DNA-binding, proteins that function as transcription
factors regulating a variety of developmental processes.
Length = 83
Score = 44.6 bits (106), Expect = 7e-06
Identities = 16/37 (43%), Positives = 22/37 (59%), Gaps = 3/37 (8%)
Query: 360 GDLVWGSVKGYPSWPGKLISP---APTQGRVWVKWFG 393
GDLV+ VKGYP+WP ++ P P + V +FG
Sbjct: 6 GDLVFAKVKGYPAWPARVDEPEDWKPPGKKYPVYFFG 42
>gnl|CDD|214603 smart00293, PWWP, domain with conserved PWWP motif. conservation
of Pro-Trp-Trp-Pro residues.
Length = 63
Score = 43.5 bits (103), Expect = 8e-06
Identities = 17/44 (38%), Positives = 21/44 (47%), Gaps = 10/44 (22%)
Query: 360 GDLVWGSVKGYPSWPGKLISPAPTQ----------GRVWVKWFG 393
GDLVW +KG+P WP +ISP T V +FG
Sbjct: 4 GDLVWAKMKGFPWWPALVISPKMTPDNIMKRKSDENLYPVLFFG 47
>gnl|CDD|227709 COG5422, ROM1, RhoGEF, Guanine nucleotide exchange factor for
Rho/Rac/Cdc42-like GTPases [Signal transduction
mechanisms].
Length = 1175
Score = 48.0 bits (114), Expect = 2e-05
Identities = 50/254 (19%), Positives = 79/254 (31%), Gaps = 48/254 (18%)
Query: 19 SLSSNLYTSTSSGNVFTSSSIGQSNFTPSSSGSNVYTSNPSGTSSVYTTISSSATNSVYT 78
+L +++S SS T S S ++ +S S S + +
Sbjct: 67 ALGHQIFSSFSSSPKLFQRRNSAGPITHSPSATSSTSSLNSNDGD-----QFSPASDSLS 121
Query: 79 ISSTAPSSVYTSSPPDTTTLSPIEAGGAPSPCHSLEVPPSPLLPDDNHPSPTSTSVHLVS 138
+ ++ S S P D SP++ +PLLP ST
Sbjct: 122 FNPSSTQSRKDSGPGDG---SPVQKRK------------NPLLPSS------STHGTHPP 160
Query: 139 SSNTDWSDVNPDSDGFVTPKPSGVPPSSSTTTFVKRLFSPPTPG--SSKDFVESSSSPPT 196
TD + + + + K PS + + +L SP SS D S P
Sbjct: 161 IVFTDNNGSHAGAPNARSRKEI---PSLGSQSM--QLPSPHFRQKFSSSDTSNGFSYPSI 215
Query: 197 HRSYGRSHHHSHHHRSHSHHHHQSQSKHH---HSKPLSRTMFGPVSRLCLKVTSRTLSIQ 253
+ S H S+ S H K H L + P S ++
Sbjct: 216 RK---NSRHSSNSMPSFPHSSTAVLLKRHSGSSGASLISSNITPSS------SNSEAMST 266
Query: 254 DKEEPNV---LLSR 264
+ P + LLSR
Sbjct: 267 SSKRPYIYPALLSR 280
Score = 32.6 bits (74), Expect = 1.0
Identities = 19/87 (21%), Positives = 32/87 (36%), Gaps = 4/87 (4%)
Query: 5 SPITIYVSQTPSTSS----LSSNLYTSTSSGNVFTSSSIGQSNFTPSSSGSNVYTSNPSG 60
+P + PS S L S + S + ++ S S SN S P
Sbjct: 173 APNARSRKEIPSLGSQSMQLPSPHFRQKFSSSDTSNGFSYPSIRKNSRHSSNSMPSFPHS 232
Query: 61 TSSVYTTISSSATNSVYTISSTAPSSV 87
+++V S ++ + S+ PSS
Sbjct: 233 STAVLLKRHSGSSGASLISSNITPSSS 259
>gnl|CDD|132293 TIGR03249, KdgD, 5-dehydro-4-deoxyglucarate dehydratase.
5-dehydro-4-deoxyglucarate dehydratase not only
catalyzes the dehydration of the substrate (diol to
ketone + water), but causes the decarboxylation of the
intermediate product to yield 2-oxoglutarate
semialdehyde (2,5-dioxopentanoate). The gene for the
enzyme is usually observed in the vicinity of
transporters and dehydratases handling D-galactarate and
D-gluconate as well as aldehyde dehydrogenases which
convert the product to alpha-ketoglutarate.
Length = 296
Score = 42.4 bits (100), Expect = 6e-04
Identities = 27/84 (32%), Positives = 46/84 (54%), Gaps = 6/84 (7%)
Query: 526 TRATIDLTQKAAKAGANAALILCPYYFQKKMTEDLIYEHFISVADNSPIPVIIYNNTFVT 585
T I++ + A KAGA+ L+L PY + ++ +Y H +V +++ + VI+Y
Sbjct: 85 TSDAIEIARLAEKAGADGYLLLPPYLINGE--QEGLYAHVEAVCESTDLGVIVYQR---D 139
Query: 586 NIDISVDTLVKLAHH-ENIRGVKD 608
N ++ DTL +LA N+ G KD
Sbjct: 140 NAVLNADTLERLADRCPNLVGFKD 163
>gnl|CDD|99903 cd06080, MUM1_like, Mutated melanoma-associated antigen 1 (MUM-1)
is a melanoma-associated antigen (MAA). MUM-1 belongs
to the mutated or aberrantly expressed type of MAAs,
along with antigens such as CDK4, beta-catenin,
gp100-in4, p15, and N-acetylglucosaminyltransferase V.
It is highly expressed in several types of human
cancers. The PWWP domain, named for a conserved
Pro-Trp-Trp-Pro motif, is a small domain consisting of
100-150 amino acids. The PWWP domain is found in
numerous proteins that are involved in cell division,
growth and differentiation. Most PWWP-domain proteins
seem to be nuclear, often DNA-binding, proteins that
function as transcription factors regulating a variety
of developmental processes.
Length = 80
Score = 38.2 bits (89), Expect = 0.001
Identities = 14/30 (46%), Positives = 17/30 (56%), Gaps = 2/30 (6%)
Query: 360 GDLVWGSVKGYPSWPG--KLISPAPTQGRV 387
DLVW ++GYP WP K IS + RV
Sbjct: 4 NDLVWAKIQGYPWWPAVIKSISRKKQKARV 33
>gnl|CDD|99898 cd05837, MSH6_like, The PWWP domain is present in MSH6, a mismatch
repair protein homologous to bacterial MutS. The PWWP
domain of histone-lysine N-methyltransferase, also known
as Nuclear SET domain-containing protein 3, is also
included. Mutations in MSH6 have been linked to
increased cancer susceptibility, particularly in
hereditary nonpolyposis colorectal cancer in humans.
The role of the PWWP domain in MSH6 is not clear; MSH6
orthologs found in S. cerevisiae, Caenorhabditis elegans
and Arabidopsis thaliana lack the PWWP domain. Histone
methyltransferases (HMTases) induce the
posttranslational methylation of lysine residues in
histones and play a role in apoptosis. In the HMTase
Whistle, the PWWP domain is necessary for HMTase
activity. The PWWP domain, named for a conserved
Pro-Trp-Trp-Pro motif, is a small domain consisting of
100-150 amino acids. The PWWP domain is found in
numerous proteins that are involved in cell division,
growth and differentiation. Most PWWP-domain proteins
seem to be nuclear, often DNA-binding, proteins that
function as transcription factors regulating a variety
of developmental processes.
Length = 110
Score = 37.4 bits (87), Expect = 0.003
Identities = 13/26 (50%), Positives = 15/26 (57%), Gaps = 1/26 (3%)
Query: 360 GDLVWGSVKGYPSWPGKLISPAPTQG 385
GDLVW V GYP WP ++ P G
Sbjct: 6 GDLVWAKVSGYPWWPC-MVCSDPLLG 30
>gnl|CDD|218439 pfam05109, Herpes_BLLF1, Herpes virus major outer envelope
glycoprotein (BLLF1). This family consists of the BLLF1
viral late glycoprotein, also termed gp350/220. It is
the most abundantly expressed glycoprotein in the viral
envelope of the Herpesviruses and is the major antigen
responsible for stimulating the production of
neutralising antibodies in vivo.
Length = 830
Score = 40.2 bits (93), Expect = 0.005
Identities = 44/192 (22%), Positives = 68/192 (35%), Gaps = 9/192 (4%)
Query: 15 PSTSSLSSNLYTSTSSGNVFTSSSI-GQSNFTPSSSGS-NVYTSNPSGTSSVYTTISSSA 72
P+T SL T S TS + G ++ T S T++ + ++ T ++
Sbjct: 450 PTTPSLPPASTGPTVSTADPTSGTPTGTTSSTLPEDTSPTSRTTSATPNATSPTPAVTTP 509
Query: 73 TNSVYTISSTAPSSVYTS-----SPPDTTTLSPIEAGGAPSPCHSLEVPPSPLLPDDNHP 127
+ T T+ + TS TT SP + S +V + + N P
Sbjct: 510 NATSPTTQKTSDTPNATSPTPIVIGVTTTATSPPTGTTSVPNATSPQVTEESPVNNTNTP 569
Query: 128 SPTSTSVHLVSSSNTDWSDV--NPDSDGFVTPKPSGVPPSSSTTTFVKRLFSPPTPGSSK 185
TS L S+ T +P S P S P S++T+ L S G
Sbjct: 570 VVTSAPSVLTSAVTTGQHGTGSSPTSQQPGIPSSSHSTPRSNSTSTTPLLTSAHPTGGEN 629
Query: 186 DFVESSSSPPTH 197
E+ S P T
Sbjct: 630 ITEETPSVPSTT 641
Score = 37.5 bits (86), Expect = 0.035
Identities = 40/203 (19%), Positives = 70/203 (34%), Gaps = 17/203 (8%)
Query: 11 VSQTPSTSSLSSNLYTST-----SSGNVFTSSSIGQSNFTPSSSGSNVYTSNPSGTSSVY 65
+ P T+ +YT + V T+ S+ ++ P+ S ++ + P+GT+S
Sbjct: 422 FHKAPDTTKSVIFVYTLVHVEPHKTTAVPTTPSLPPASTGPTVSTADPTSGTPTGTTSST 481
Query: 66 TTISSSATNSVYTISSTAPSSVYTSSPPDTTTL-----SPIEAGGAPSPCHSLEVP--PS 118
+S T+ + + A S + P+ T+ S +P+P S
Sbjct: 482 LPEDTSPTSRTTSATPNATSPTPAVTTPNATSPTTQKTSDTPNATSPTPIVIGVTTTATS 541
Query: 119 PLLPDDNHPSPTSTSVHLVSSSN--TDWSDVNPDSDGFVTPKPSGVPPSSSTTTFVKRLF 176
P + P+ TS V S N + S SS T +
Sbjct: 542 PPTGTTSVPNATSPQVTEESPVNNTNTPVVTSAPSVLTSAVTTGQHGTGSSPT---SQQP 598
Query: 177 SPPTPGSSKDFVESSSSPPTHRS 199
P+ S S+S+ P S
Sbjct: 599 GIPSSSHSTPRSNSTSTTPLLTS 621
Score = 37.1 bits (85), Expect = 0.042
Identities = 32/186 (17%), Positives = 61/186 (32%), Gaps = 15/186 (8%)
Query: 5 SPITIYVSQTPSTSSLSSNLYTSTSSGNVFTSSSIGQSNFTPSSSGSNVYTSNPSGTSSV 64
+ TP+ +S ++ TS TS + T +++ T++ +S
Sbjct: 499 ATSPTPAVTTPNATSPTT---QKTSDTPNATSPTPIVIGVTTTATSPPTGTTSVPNATSP 555
Query: 65 YTTISSSATNSVYTISSTAPSSVYTSSPPDTTTLSPIEAGGAPSPCHSLEVPPSPLLP-- 122
T S N+ T T+ SV TS+ + + G SP PS
Sbjct: 556 QVT-EESPVNNTNTPVVTSAPSVLTSAV------TTGQHGTGSSPTSQQPGIPSSSHSTP 608
Query: 123 ---DDNHPSPTSTSVHLVSSSNTDWSDVNPDSDGFVTPKPSGVPPSSSTTTFVKRLFSPP 179
+ +++ + T+ + P + T P P ++S + +
Sbjct: 609 RSNSTSTTPLLTSAHPTGGENITEETPSVPSTTHVSTLSPGPGPGTTSQVSGPGNSSTSR 668
Query: 180 TPGSSK 185
PG
Sbjct: 669 YPGEVH 674
Score = 33.2 bits (75), Expect = 0.58
Identities = 55/246 (22%), Positives = 83/246 (33%), Gaps = 39/246 (15%)
Query: 5 SPITIYVSQTPSTSSLSSNLYT--STSSGNVFTSSSIGQSNFTPSSSGSN-VYTSNPSGT 61
SP + S TP+ +S + + T +TS TS + ++ TP G TS P+GT
Sbjct: 487 SPTSRTTSATPNATSPTPAVTTPNATSPTTQKTSDTPNATSPTPIVIGVTTTATSPPTGT 546
Query: 62 SSVYTTIS-----SSATNSVYTISSTAPSSVYTSSPPDTTTLSPIEAGGAPSPCHSLEVP 116
+SV S S N+ T T+ SV TS+ + + G SP
Sbjct: 547 TSVPNATSPQVTEESPVNNTNTPVVTSAPSVLTSAV------TTGQHGTGSSPTSQQPGI 600
Query: 117 PSPLLP----DDNHPSPTSTSVHLV-----------SSSNTDWSDVNPDSDGFVTPKPSG 161
PS + +P TS H S T S ++P T + SG
Sbjct: 601 PSSSHSTPRSNSTSTTPLLTSAHPTGGENITEETPSVPSTTHVSTLSPGPGPGTTSQVSG 660
Query: 162 VPPSSSTT----------TFVKRLFSPPTPGSSKDFVESSSSPPTHRSYGRSHHHSHHHR 211
SS++ SP P K V + +S +
Sbjct: 661 PGNSSTSRYPGEVHVTEGMPNPNATSPSAPSGQKTAVPTVTSTGGKANSTTKETSGSTLM 720
Query: 212 SHSHHH 217
+ + H
Sbjct: 721 ASTSPH 726
Score = 30.9 bits (69), Expect = 2.9
Identities = 30/132 (22%), Positives = 49/132 (37%), Gaps = 8/132 (6%)
Query: 3 QASPITIYVSQTPSTSSLSSNLYTSTSSGNVFTSSSIGQSNFTPSSSGSNVYTSN-PSGT 61
+ P T +VS T S + GN TS G+ + T N + + PSG
Sbjct: 635 PSVPSTTHVS-TLSPGPGPGTTSQVSGPGNSSTSRYPGEVHVTEGMPNPNATSPSAPSGQ 693
Query: 62 SSVYTTISSSATNSVYTISSTAPSSVYTSSPPDTTTLSPIEAGGAPSPCHSLEVPPSPLL 121
+ T++S+ + T T+ S++ S+ P T G + ++ P
Sbjct: 694 KTAVPTVTSTGGKANSTTKETSGSTLMASTSPHT------NEGAFRTTPYNATTYLPPST 747
Query: 122 PDDNHPSPTSTS 133
P T TS
Sbjct: 748 SSKLRPRWTFTS 759
>gnl|CDD|236641 PRK10019, PRK10019, nickel/cobalt efflux protein RcnA; Provisional.
Length = 279
Score = 39.4 bits (92), Expect = 0.005
Identities = 10/27 (37%), Positives = 10/27 (37%)
Query: 201 GRSHHHSHHHRSHSHHHHQSQSKHHHS 227
HH H H H HH HHH
Sbjct: 119 NMHHHDHDHDHDHDHEHHHDHGHHHHH 145
Score = 38.6 bits (90), Expect = 0.008
Identities = 10/38 (26%), Positives = 11/38 (28%), Gaps = 4/38 (10%)
Query: 192 SSPPTHRSYGRSHHHSHHHRSHSHHHHQSQSKHHHSKP 229
+ H H H H HH H HHH
Sbjct: 113 ERNWLENMHHHDHDHDHDHDHEHHHDHG----HHHHHE 146
Score = 38.2 bits (89), Expect = 0.010
Identities = 10/35 (28%), Positives = 11/35 (31%)
Query: 197 HRSYGRSHHHSHHHRSHSHHHHQSQSKHHHSKPLS 231
H H H H HHH HHH +
Sbjct: 116 WLENMHHHDHDHDHDHDHEHHHDHGHHHHHEHGAT 150
Score = 32.5 bits (74), Expect = 0.73
Identities = 9/40 (22%), Positives = 18/40 (45%)
Query: 184 SKDFVESSSSPPTHRSYGRSHHHSHHHRSHSHHHHQSQSK 223
++++E+ + H H H H H HH H + ++
Sbjct: 113 ERNWLENMHHHDHDHDHDHDHEHHHDHGHHHHHEHGATAE 152
Score = 29.0 bits (65), Expect = 9.6
Identities = 6/29 (20%), Positives = 10/29 (34%)
Query: 198 RSYGRSHHHSHHHRSHSHHHHQSQSKHHH 226
R++ + + H H H HH
Sbjct: 108 RTWRGERNWLENMHHHDHDHDHDHDHEHH 136
>gnl|CDD|183756 PRK12799, motB, flagellar motor protein MotB; Reviewed.
Length = 421
Score = 39.7 bits (92), Expect = 0.006
Identities = 26/120 (21%), Positives = 47/120 (39%), Gaps = 2/120 (1%)
Query: 14 TPSTSSLSSNLYTSTSSGNVFTSSSIGQSNFTPSS-SGSNVYTSNPSGTSSVYTTISSSA 72
T ++++ + + SS +S++I PSS + + T+ S + + S
Sbjct: 298 TVPVAAVTPSSAVTQSSAITPSSAAIPSPAVIPSSVTTQSATTTQASAVALSSAGVLPSD 357
Query: 73 TNSVYTISSTAPSSVYTSSPPDTTTLSPIEAGGAPSPCHSLEVPPSPLLPDDNHP-SPTS 131
T++ A V P +TT + + G + + P P N P SPTS
Sbjct: 358 VTLPGTVALPAAEPVNMQPQPMSTTETQQSSTGNITSTANGPTTSLPAAPASNIPVSPTS 417
>gnl|CDD|188639 cd00952, CHBPH_aldolase, Trans-o-hydroxybenzylidenepyruvate
hydratase-aldolase (HBPHA) and
trans-2'-carboxybenzalpyruvate hydratase-aldolase
(CBPHA). Trans-o-hydroxybenzylidenepyruvate
hydratase-aldolase (HBPHA) and
trans-2'-carboxybenzalpyruvate hydratase-aldolase
(CBPHA). HBPHA catalyzes HBP to salicyaldehyde and
pyruvate. This reaction is part of the degradative
pathways for naphthalene and naphthalenesulfonates by
bacteria. CBPHA is homologous to HBPHA and catalyzes the
cleavage of CBP to 2-carboxylbenzaldehyde and pyruvate
during the degradation of phenanthrene. They are member
of the DHDPS family of Schiff-base-dependent class I
aldolases.
Length = 309
Score = 39.0 bits (91), Expect = 0.007
Identities = 47/221 (21%), Positives = 72/221 (32%), Gaps = 24/221 (10%)
Query: 526 TRATIDLTQKAAKAGANAALILCPYYFQKKMTEDLIYEHFISVADNSP-IPVIIYNNTFV 584
TR TI T+ GA+ ++ P + + D + + VA+ P + + IY N
Sbjct: 89 TRDTIARTRALLDLGADGTMLGRPMWL--PLDVDTAVQFYRDVAEAVPEMAIAIYANPEA 146
Query: 585 TNIDISVDTLVKLAHHENIRGVKDTDNI-KLANMANQTKD------LNFSVFAGSAGYLL 637
D +LA + K +I L + K L +A A L
Sbjct: 147 FKFDFPRAAWAELAQIPQVVAAKYLGDIGALLSDLAAVKGRMRLLPLEDDYYA--AARLF 204
Query: 638 SGLLVGCAGGINALSAVLG-GPICELYDLAKAGKWEEAMKLQHRLVKPDVTVRNVLLMKE 696
+ A G P+ L D G W +A L R+ + E
Sbjct: 205 PEEVTAFWSS----GAACGPAPVTALRDAVATGDWTDARALTDRMRWAAEPLFPRGDFSE 260
Query: 697 ---MGVPGVRAAMELYGYY-GGRSRRP---LPAALKPGGAE 730
+ +A + GY G +R P P A G E
Sbjct: 261 FSKYNIALEKARFDAAGYMRAGPARPPYNTAPEAYLEGARE 301
>gnl|CDD|177648 PHA03420, PHA03420, E4 protein; Provisional.
Length = 137
Score = 36.9 bits (85), Expect = 0.011
Identities = 8/38 (21%), Positives = 13/38 (34%), Gaps = 2/38 (5%)
Query: 191 SSSPPTHRSYGRSHHHSHHHRSHSHHHHQSQSKHHHSK 228
HR++ + + H H H HQ H +
Sbjct: 34 QREGGNHRTWDPADYLDRPH--HHPHRHQQDDHHLQDR 69
Score = 35.0 bits (80), Expect = 0.048
Identities = 11/49 (22%), Positives = 14/49 (28%), Gaps = 2/49 (4%)
Query: 189 ESSSSPPTHRSYGRSHHHSHHHRSHSHHHHQSQSKHHHSKPLSRTMFGP 237
+ P + R HHH H H+ HH Q P
Sbjct: 40 HRTWDPADYLD--RPHHHPHRHQQDDHHLQDRQHLPQQHLQRPHHPLSP 86
Score = 30.7 bits (69), Expect = 1.3
Identities = 8/38 (21%), Positives = 10/38 (26%)
Query: 192 SSPPTHRSYGRSHHHSHHHRSHSHHHHQSQSKHHHSKP 229
P H + H R H H + H S
Sbjct: 50 DRPHHHPHRHQQDDHHLQDRQHLPQQHLQRPHHPLSPQ 87
>gnl|CDD|215130 PLN02217, PLN02217, probable pectinesterase/pectinesterase
inhibitor.
Length = 670
Score = 38.1 bits (88), Expect = 0.016
Identities = 36/118 (30%), Positives = 52/118 (44%), Gaps = 9/118 (7%)
Query: 53 VYTSNPSGTSSVYTTISSSATNSVYTISSTAPSSVYTSSPPDTTTLSPIEAGGAPSPCHS 112
++ NP T+S T + SA +S T SS +PS+V S T P G+P S
Sbjct: 561 LFAGNPGSTNS---TPTGSAASSNTTFSSDSPSTVVAPS-----TSPPAGHLGSPPATPS 612
Query: 113 LEVPPSPLLPDDNHPSPTSTSVHLVSSSNTDWSD-VNPDSDGFVTPKPSGVPPSSSTT 169
V PS P + SP++T SS ++ +P+S V S V S +T
Sbjct: 613 KIVSPSTSPPASHLGSPSTTPSSPESSIKVASTETASPESSIKVASTESSVSMVSMST 670
Score = 33.5 bits (76), Expect = 0.54
Identities = 31/121 (25%), Positives = 45/121 (37%), Gaps = 16/121 (13%)
Query: 24 LYTSTSSGNVFTSSSIGQSNFTPSSSGSNVYTSNPSGTSSVYTTISSSATNSVYTISSTA 83
L+ T + S+ T SS S PS + S AT S ST+
Sbjct: 561 LFAGNPGSTNSTPTGSAASSNTTFSSDSPSTVVAPSTSPPAGHLGSPPATPSKIVSPSTS 620
Query: 84 PSSVYTSSPPDTTT--LSPIEAGGAP--SPCHSLEVPPSPLLPDDNHPSPTSTSVHLVSS 139
P + + SP T + S I+ SP S++V + T +SV +VS
Sbjct: 621 PPASHLGSPSTTPSSPESSIKVASTETASPESSIKV------------ASTESSVSMVSM 668
Query: 140 S 140
S
Sbjct: 669 S 669
Score = 33.1 bits (75), Expect = 0.69
Identities = 26/84 (30%), Positives = 36/84 (42%), Gaps = 1/84 (1%)
Query: 18 SSLSSNLYTSTSSGNVFTSSSIGQSNFTPSSSGSNVYTSNPSGTSSVYTTISSSATNSVY 77
S +S S +S N SS + PS+S + +P T S + S+S S
Sbjct: 567 GSTNSTPTGSAASSNTTFSSDSPSTVVAPSTSPPAGHLGSPPATPSKIVSPSTSPPASHL 626
Query: 78 TISSTAPSSVYTSSPPDTT-TLSP 100
ST PSS +S +T T SP
Sbjct: 627 GSPSTTPSSPESSIKVASTETASP 650
Score = 32.4 bits (73), Expect = 0.99
Identities = 24/93 (25%), Positives = 42/93 (45%), Gaps = 2/93 (2%)
Query: 12 SQTPSTSSLSSN-LYTSTSSGNVFTSSSIGQSNFTPSSSGSNVYTSNPSGTSSVYTTISS 70
+ TP+ S+ SSN ++S S V S+ + S + +PS TS + + S
Sbjct: 570 NSTPTGSAASSNTTFSSDSPSTVVAPSTSPPAGHLGSPPATPSKIVSPS-TSPPASHLGS 628
Query: 71 SATNSVYTISSTAPSSVYTSSPPDTTTLSPIEA 103
+T SS +S T+SP + ++ E+
Sbjct: 629 PSTTPSSPESSIKVASTETASPESSIKVASTES 661
Score = 31.6 bits (71), Expect = 1.8
Identities = 27/102 (26%), Positives = 40/102 (39%), Gaps = 8/102 (7%)
Query: 48 SSGSNVYTSNPSGTSSVYTTISSSATNSVYTISSTAPSSVYTSSPPDTTTLSPIEAGGAP 107
+ GS T S SS T S S + V ST+P + + SPP T S I +
Sbjct: 565 NPGSTNSTPTGSAASSNTTFSSDSPSTVV--APSTSPPAGHLGSPPATP--SKIVSPSTS 620
Query: 108 SPCHSLEVPP----SPLLPDDNHPSPTSTSVHLVSSSNTDWS 145
P L P SP + T++ + ++T+ S
Sbjct: 621 PPASHLGSPSTTPSSPESSIKVASTETASPESSIKVASTESS 662
Score = 31.2 bits (70), Expect = 2.6
Identities = 31/95 (32%), Positives = 38/95 (40%), Gaps = 15/95 (15%)
Query: 115 VPPSPLLPDDNHPSPTSTSVHLVSSSNTDWSDVNPDSDGFVTPKPS------GVPP---- 164
VP P L N S ST +SSNT +S +P + V P S G PP
Sbjct: 555 VPYIPGLFAGNPGSTNSTPTGSAASSNTTFSSDSPST--VVAPSTSPPAGHLGSPPATPS 612
Query: 165 ---SSSTTTFVKRLFSPPTPGSSKDFVESSSSPPT 196
S ST+ L SP T SS + +S T
Sbjct: 613 KIVSPSTSPPASHLGSPSTTPSSPESSIKVASTET 647
>gnl|CDD|218191 pfam04652, DUF605, Vta1 like. Vta1 (VPS20-associated protein 1) is
a positive regulator of Vps4. Vps4 is an ATPase that is
required in the multivesicular body (MVB) sorting
pathway to dissociate the endosomal sorting complex
required for transport (ESCRT). Vta1 promotes correct
assembly of Vps4 and stimulates its ATPase activity
through its conserved Vta1/SBP1/LIP5 region.
Length = 315
Score = 37.4 bits (87), Expect = 0.021
Identities = 19/135 (14%), Positives = 28/135 (20%), Gaps = 10/135 (7%)
Query: 88 YTSSPPDTTTLSPIEAGGAPSPCHSLEVPPSPLLPDDNHPSPTSTSV-HLVSSSNTDWSD 146
PP + S P + P +S V S S
Sbjct: 148 PNPGPPLDEEDEDADVATTNSDNSFPGEDADPASASPSDPPSSSPGVPSFPSPPEDPSSP 207
Query: 147 VNPDSDGFVTPKPSGVPPSSSTTTFVKRLFSPPTPGSSKDFVESSSSPPTHRSYGRSHHH 206
+ + S PP S PT S + PP + S
Sbjct: 208 SDSSLPPAPSSFQSDTPPPS---------PESPTNPSPPPGPAAPPPPPVQQVPPLSTAK 258
Query: 207 SHHHRSHSHHHHQSQ 221
+ +
Sbjct: 259 PTPPSASATPAPIGG 273
>gnl|CDD|236558 PRK09545, znuA, high-affinity zinc transporter periplasmic
component; Reviewed.
Length = 311
Score = 36.9 bits (86), Expect = 0.029
Identities = 8/25 (32%), Positives = 11/25 (44%)
Query: 202 RSHHHSHHHRSHSHHHHQSQSKHHH 226
+ H HH H H H+ + HH
Sbjct: 115 KGAHDDHHDDDHDHAGHEKSDEDHH 139
Score = 35.8 bits (83), Expect = 0.065
Identities = 7/22 (31%), Positives = 10/22 (45%)
Query: 197 HRSYGRSHHHSHHHRSHSHHHH 218
+ H H+ H +S HHH
Sbjct: 119 DDHHDDDHDHAGHEKSDEDHHH 140
Score = 33.4 bits (77), Expect = 0.43
Identities = 6/26 (23%), Positives = 8/26 (30%)
Query: 201 GRSHHHSHHHRSHSHHHHQSQSKHHH 226
G H H+ H + HH
Sbjct: 116 GAHDDHHDDDHDHAGHEKSDEDHHHG 141
>gnl|CDD|237323 PRK13263, ureE, urease accessory protein UreE; Provisional.
Length = 206
Score = 36.3 bits (84), Expect = 0.031
Identities = 13/33 (39%), Positives = 14/33 (42%), Gaps = 3/33 (9%)
Query: 197 HRSYGRSHHHSHHHRSHSH---HHHQSQSKHHH 226
H +G SH HS H H H H H H H
Sbjct: 169 HEHHGHSHSHSDHDHDHDHDHDHVHGPGCGHGH 201
Score = 34.8 bits (80), Expect = 0.11
Identities = 10/26 (38%), Positives = 11/26 (42%)
Query: 201 GRSHHHSHHHRSHSHHHHQSQSKHHH 226
+ H HH SHSH H H H
Sbjct: 164 AQQVFHEHHGHSHSHSDHDHDHDHDH 189
Score = 33.2 bits (76), Expect = 0.36
Identities = 10/27 (37%), Positives = 11/27 (40%), Gaps = 1/27 (3%)
Query: 200 YGRSHHHSHHHRSHSHHHHQSQSKHHH 226
+ H HSH H S H H H H
Sbjct: 168 FHEHHGHSHSH-SDHDHDHDHDHDHVH 193
Score = 29.4 bits (66), Expect = 5.8
Identities = 9/30 (30%), Positives = 10/30 (33%)
Query: 197 HRSYGRSHHHSHHHRSHSHHHHQSQSKHHH 226
S+ S H H H H H HH
Sbjct: 173 GHSHSHSDHDHDHDHDHDHVHGPGCGHGHH 202
>gnl|CDD|227430 COG5099, COG5099, RNA-binding protein of the Puf family,
translational repressor [Translation, ribosomal
structure and biogenesis].
Length = 777
Score = 36.6 bits (85), Expect = 0.054
Identities = 46/185 (24%), Positives = 65/185 (35%), Gaps = 19/185 (10%)
Query: 7 ITIYVSQTPSTSSLSSNLYTSTSSGNVFTSSSIGQSNFTPSSSGSNVYTSNPSGTSSVYT 66
IT S S S +S++SG+ + S+F PS+S N SN + +S+
Sbjct: 70 ITSSSSSRRKPSGSWSVAISSSTSGSQSLLMELPSSSFNPSTSSRN--KSNSALSSTQQG 127
Query: 67 TISSSATNSVYTISSTAPSSVYTSSPPD-----TTTLSPIEAGGAPSPCHSLEVPPSPLL 121
+SS T S T SS S+ P+ TT S P+ S + +
Sbjct: 128 NANSSVTLSSSTASSMFNSNKLPLPNPNHSNSATTNQSGSSFINTPASSSSQPLTNLVV- 186
Query: 122 PDDNHPSPTSTSVHLVSSSNTDWSDVNPDSDGFVT---PKPSGVPPS--SSTTTFVKRLF 176
S +L S S ++P SD PS PP S F
Sbjct: 187 ------SSIKRFPYLTSLSPFFNYLIDPSSDSATASADTSPSFNPPPNLSPNNLFSTSDL 240
Query: 177 SPPTP 181
SP
Sbjct: 241 SPLPD 245
>gnl|CDD|218163 pfam04592, SelP_N, Selenoprotein P, N terminal region. SelP is the
only known eukaryotic selenoprotein that contains
multiple selenocysteine (Sec) residues, and accounts for
more than 50% of the selenium content of rat and human
plasma. It is thought to be glycosylated. SelP may have
antioxidant properties. It can attach to epithelial
cells, and may protect vascular endothelial cells
against peroxynitrite toxicity. The high selenium
content of SelP suggests that it may be involved in
selenium intercellular transport or storage. The
promoter structure of bovine SelP suggest that it may be
involved in countering heavy metal intoxication, and may
also have a developmental function. The N-terminal
region of SelP can exist independently of the C terminal
region. Zebrafish selenoprotein Pb lacks the C terminal
Sec-rich region, and a protein encoded by the rat SelP
gene and lacking this region has also been reported.
N-terminal region contains a conserved SecxxCys motif,
which is similar to the CysxxCys found in thioredoxins.
It is speculated that the N terminal region may adopt a
thioredoxin fold and catalyze redox reactions. The
N-terminal region also contains a His-rich region, which
is thought to mediate heparin binding. Binding to
heparan proteoglycans could account for the membrane
binding properties of SelP. The function of the
bacterial members of this family is uncharcterised.
Length = 238
Score = 35.6 bits (82), Expect = 0.074
Identities = 18/53 (33%), Positives = 21/53 (39%), Gaps = 1/53 (1%)
Query: 189 ESSSSPPTHRSYGRSHHHSHHHRSHSHHHHQSQSKHHHSKPLSRTMFGPVSRL 241
E P H S+ H+HHH H HHH S S +P P S L
Sbjct: 180 EPRQDHPHHHSHHEHQGHAHHHP-HGHHHPGSNSHSESQQPDPDKPTEPPSGL 231
>gnl|CDD|113452 pfam04684, BAF1_ABF1, BAF1 / ABF1 chromatin reorganising factor.
ABF1 is a sequence-specific DNA binding protein involved
in transcription activation, gene silencing and
initiation of DNA replication. ABF1 is known to remodel
chromatin, and it is proposed that it mediates its
effects on transcription and gene expression by
modifying local chromatin architecture. These functions
require a conserved stretch of 20 amino acids in the
C-terminal region of ABF1 (amino acids 639 to 662 S.
cerevisiae). The N-terminal two thirds of the protein
are necessary for DNA binding, and the N-terminus (amino
acids 9 to 91 in S. cerevisiae) is thought to contain a
novel zinc-finger motif which may stabilise the protein
structure.
Length = 508
Score = 36.3 bits (83), Expect = 0.074
Identities = 15/54 (27%), Positives = 21/54 (38%), Gaps = 7/54 (12%)
Query: 178 PPTPGSSKDFVESSSSPPTHRSYGRSHHHSHHHRSHSHHHHQSQSKHHHSKPLS 231
G ++D SP G H H H H+HHHH + H + L+
Sbjct: 80 ANNSGQNED-----GSP--GGGSGDEGGHHHGHNMHAHHHHHHHNGHTNGHGLA 126
Score = 31.2 bits (70), Expect = 2.4
Identities = 14/36 (38%), Positives = 16/36 (44%), Gaps = 3/36 (8%)
Query: 194 PPTHRSYGRSHH---HSHHHRSHSHHHHQSQSKHHH 226
P H+ G HH H HH +HH HQ H H
Sbjct: 281 PNAHQISGADHHQHQHHHHPSIPAHHQHQLPEGHQH 316
>gnl|CDD|219500 pfam07655, Secretin_N_2, Secretin N-terminal domain. This is a
short domain found in bacterial type II/III secretory
system proteins. The architecture of these proteins
suggest that this family may be functionally analogous
to pfam03958.
Length = 95
Score = 33.1 bits (76), Expect = 0.091
Identities = 23/71 (32%), Positives = 37/71 (52%), Gaps = 3/71 (4%)
Query: 8 TIYVSQTPSTSSLSSNLYTSTSSGNVFTSSSIGQSNFTPSSSGSNVYTSNPSGTSSVYTT 67
T V+ S SSN TS +SG+V +SS S+ + +SS +S+ SG SS T
Sbjct: 3 TFPVNYLNMKRSGSSN--TSVTSGSV-SSSGSNSSSSSSNSSNGGSSSSSSSGDSSSGTR 59
Query: 68 ISSSATNSVYT 78
I++ + + +
Sbjct: 60 ITTESESDFWG 70
>gnl|CDD|237322 PRK13261, ureE, urease accessory protein UreE; Provisional.
Length = 159
Score = 34.2 bits (79), Expect = 0.097
Identities = 9/23 (39%), Positives = 10/23 (43%), Gaps = 2/23 (8%)
Query: 195 PTHRSYGRSHHHSHHHRSHSHHH 217
P +YG HH SH H H
Sbjct: 139 PERGAYG--GAFRHHGHSHDHSH 159
Score = 33.0 bits (76), Expect = 0.30
Identities = 5/18 (27%), Positives = 8/18 (44%)
Query: 201 GRSHHHSHHHRSHSHHHH 218
++ + H HSH H
Sbjct: 141 RGAYGGAFRHHGHSHDHS 158
>gnl|CDD|227270 COG4934, COG4934, Predicted protease [Posttranslational modification,
protein turnover, chaperones].
Length = 1174
Score = 35.9 bits (83), Expect = 0.099
Identities = 22/82 (26%), Positives = 30/82 (36%), Gaps = 5/82 (6%)
Query: 13 QTPSTSSLSSNLYTSTSSGNVFTSSSIGQSNFTPSSSGSNVYTSNPSGTSSVYTTISSSA 72
S +S SNL S SS V ++ S + N YT N + IS+S
Sbjct: 1034 NITSVNSALSNLIVSLSSTTVPIIKNVLPSLVYGEYNIINSYTGN--DFGVITIVISNSP 1091
Query: 73 ---TNSVYTISSTAPSSVYTSS 91
S + +S Y SS
Sbjct: 1092 SGSYPSTLYNTDQTQTSSYISS 1113
Score = 30.9 bits (70), Expect = 3.3
Identities = 19/89 (21%), Positives = 26/89 (29%), Gaps = 3/89 (3%)
Query: 6 PITIYVSQTPSTSSLSSNLYTSTSSGNVFTSSSIGQSNF-TPSSSGSNVY--TSNPSGTS 62
P IY T + ++ SG V S+ T +G + N
Sbjct: 927 PPGIYTINTSIPGLDPYSSLINSKSGTVSNLQIYFLSSVPTSGLTGKSSDGGIKNFVIDV 986
Query: 63 SVYTTISSSATNSVYTISSTAPSSVYTSS 91
V T S+ N A S T S
Sbjct: 987 LVNTNGISAINNGTGNYYVIASVSNGTIS 1015
>gnl|CDD|99900 cd05839, BR140_related, The PWWP domain is found in the BR140
family, which includes peregrin and BR140-like proteins
1 and 2. BR140 is the only family to contain the PWWP
domain at the C terminus, with PHD and bromo domains in
the N-terminal region. In myeloid leukemias, BR140 is
disrupted by chromosomal translocations, similar to
translocations of WHSC1 in lymphoid multiple myeloma.
The PWWP domain, named for a conserved Pro-Trp-Trp-Pro
motif, is a small domain consisting of 100-150 amino
acids. The PWWP domain is found in numerous proteins
that are involved in cell division, growth and
differentiation. Most PWWP-domain proteins seem to be
nuclear, often DNA-binding proteins, that function as
transcription factors regulating a variety of
developmental processes.
Length = 111
Score = 33.2 bits (76), Expect = 0.12
Identities = 11/24 (45%), Positives = 14/24 (58%)
Query: 357 LAEGDLVWGSVKGYPSWPGKLISP 380
L LVW +GYPS+P +I P
Sbjct: 1 LEPLTLVWAKCRGYPSYPALIIDP 24
>gnl|CDD|220365 pfam09726, Macoilin, Transmembrane protein. This entry is a highly
conserved protein present in eukaryotes.
Length = 680
Score = 35.3 bits (81), Expect = 0.15
Identities = 19/67 (28%), Positives = 24/67 (35%), Gaps = 2/67 (2%)
Query: 165 SSSTTTFVKRLFSPPTPGSSKDFVESSSSPPTHRSYGRSHHHSHHHRSHSHHHHQSQSKH 224
SS T K L G + +SS+S Y HH +H HH +
Sbjct: 220 SSKGLTSTKELVPVQNSGGNHSLSKSSNSQTPELEYSEKGKDHHHSHNHQHHSIGINN-- 277
Query: 225 HHSKPLS 231
HHSK
Sbjct: 278 HHSKHAD 284
>gnl|CDD|99902 cd05841, BS69_related, The PWWP domain is part of BS69 protein, a
nuclear protein that specifically binds adenoviral E1A
and Epstein-Barr viral EBNA2 proteins, suppressing their
transactivation functions. BS69 is a multi-domain
protein, containing bromo, PHD, PWWP, and MYND domains.
The specific role of the PWWP domain within BS69 is not
clearly identified, but BS69 functions in chromatin
remodeling, consistent with other PWWP-containing
proteins. The PWWP domain, named for a conserved
Pro-Trp-Trp-Pro motif, is a small domain consisting of
100-150 amino acids. The PWWP domain is found in
numerous proteins that are involved in cell division,
growth and differentiation. Most PWWP-domain proteins
seem to be nuclear, often DNA-binding, proteins that
function as transcription factors regulating a variety
of developmental processes.
Length = 83
Score = 31.6 bits (72), Expect = 0.23
Identities = 14/32 (43%), Positives = 20/32 (62%), Gaps = 2/32 (6%)
Query: 362 LVWGSVKGYPSWPGKLISPAPTQGRVWVKWFG 393
LVW +KG+P WP K++ Q V V++FG
Sbjct: 12 LVWAKLKGFPYWPAKVMRVEDNQ--VDVRFFG 41
>gnl|CDD|113196 pfam04415, DUF515, Protein of unknown function (DUF515). Family of
hypothetical Archaeal proteins.
Length = 416
Score = 34.1 bits (78), Expect = 0.25
Identities = 24/96 (25%), Positives = 43/96 (44%), Gaps = 3/96 (3%)
Query: 7 ITIYVSQT-PSTSSLSSNLYTSTSSGNVFTSSSIGQSNFTPSSSGSNVYTSNPSGTSSVY 65
+ +Y+ Q + + Y SS +V S S QS T SSS ++ S+ + S
Sbjct: 252 VKVYLKQNGTIFYEIVDSGYVILSSISVSESQS--QSTSTSSSSSTSSSESSSTSYSPGD 309
Query: 66 TTISSSATNSVYTISSTAPSSVYTSSPPDTTTLSPI 101
+I +S + + + +S + S +SS + L I
Sbjct: 310 ASIQNSQRSQLQSSTSQSESESASSSYSYSVNLPEI 345
>gnl|CDD|220924 pfam10986, DUF2796, Protein of unknown function (DUF2796). This
bacterial family of proteins has no known function.
Length = 161
Score = 33.0 bits (76), Expect = 0.27
Identities = 15/52 (28%), Positives = 18/52 (34%), Gaps = 6/52 (11%)
Query: 175 LFS-PPTPGSSKDFVESSSSPPTHRSYGRSHHHSHHHRSHSHHHHQSQSKHH 225
LF+ PP G + V SS H H H H HH + H
Sbjct: 60 LFALPPAAGCTLTEVSLESSLFGD-----HDHDHHDHDHHDHHDDHEHAGHS 106
Score = 32.3 bits (74), Expect = 0.48
Identities = 9/32 (28%), Positives = 10/32 (31%)
Query: 196 THRSYGRSHHHSHHHRSHSHHHHQSQSKHHHS 227
+ S H HH H HH HS
Sbjct: 75 SLESSLFGDHDHDHHDHDHHDHHDDHEHAGHS 106
Score = 29.2 bits (66), Expect = 5.8
Identities = 9/26 (34%), Positives = 10/26 (38%)
Query: 203 SHHHSHHHRSHSHHHHQSQSKHHHSK 228
S H H H H HH H H+
Sbjct: 79 SLFGDHDHDHHDHDHHDHHDDHEHAG 104
Score = 28.8 bits (65), Expect = 7.8
Identities = 7/23 (30%), Positives = 7/23 (30%)
Query: 204 HHHSHHHRSHSHHHHQSQSKHHH 226
H H HH H H H
Sbjct: 89 HDHDHHDHHDDHEHAGHSDFHAE 111
>gnl|CDD|114270 pfam05539, Pneumo_att_G, Pneumovirinae attachment membrane
glycoprotein G.
Length = 408
Score = 33.9 bits (77), Expect = 0.29
Identities = 28/145 (19%), Positives = 54/145 (37%), Gaps = 15/145 (10%)
Query: 13 QTPSTSSLSSNLYTSTSSGNVFTSSSIGQSNFTPSSSGSNVYTSNPSGTSSVYTTISSSA 72
QT + + S+ + G +S+ Q+ PS G + +P T+S + ++
Sbjct: 205 QTATANQRLSSTEPVGTQGTTTSSNPEPQTEPPPSQRGPSGSPQHPPSTTSQDQS--TTG 262
Query: 73 TNSVYTISSTAPSS-------VYTSSPPDTTTLSPIEAGGAPSPCHSLEVPPSPLLPDDN 125
+T P + T++PP TT + G P+P + P
Sbjct: 263 DGQEHTQRRKTPPATSNRRSPHSTATPPPTT---KRQETGRPTPRPTATTQSGSSPP--- 316
Query: 126 HPSPTSTSVHLVSSSNTDWSDVNPD 150
H SP + + + D +++P
Sbjct: 317 HSSPPGVQANPTTQNLVDCKELDPP 341
>gnl|CDD|215173 PLN02304, PLN02304, probable pectinesterase.
Length = 379
Score = 34.1 bits (78), Expect = 0.30
Identities = 13/26 (50%), Positives = 14/26 (53%), Gaps = 4/26 (15%)
Query: 203 SHHHSHHHRSHSHHHHQSQSKHHHSK 228
S+ HS HH SH HHHH KH
Sbjct: 32 SYPHSIHHHSHHHHHH----KHPDDG 53
Score = 31.0 bits (70), Expect = 2.4
Identities = 11/49 (22%), Positives = 15/49 (30%)
Query: 175 LFSPPTPGSSKDFVESSSSPPTHRSYGRSHHHSHHHRSHSHHHHQSQSK 223
+ P +E S + + H HHH H HHH
Sbjct: 5 FKTSPAAPLLGHLIELSLISFSSKIATSYPHSIHHHSHHHHHHKHPDDG 53
Score = 30.2 bits (68), Expect = 4.0
Identities = 11/43 (25%), Positives = 15/43 (34%), Gaps = 5/43 (11%)
Query: 196 THRSYGRSHHHSHHHRSHSHHHHQSQSKHHHSKPLSRTMFGPV 238
+ S+ S+ H H H HH HHH K +
Sbjct: 21 SLISFSSKIATSYPHSIHHHSHH-----HHHHKHPDDGKKVSI 58
>gnl|CDD|221931 pfam13136, DUF3984, Protein of unknown function (DUF3984). This
family of proteins is functionally uncharacterized. This
family of proteins is found in eukaryotes. Proteins in
this family are typically between 393 and 442 amino
acids in length.
Length = 301
Score = 33.5 bits (77), Expect = 0.32
Identities = 21/116 (18%), Positives = 36/116 (31%), Gaps = 13/116 (11%)
Query: 116 PPSPLLP--DDNHPSPTSTSVHLVSSSNTDWSDVNPDSDGFVT--PKPSGVPPSSSTTTF 171
P + P DD+ S + H ++S V P + G ++ PS +
Sbjct: 16 PLTSRFPLDDDDEERDYSYAPHPPTTSYLSSKSV-PTTPGILSHSRSPSRSRLHKRKKSS 74
Query: 172 VKRLFSPPTPGSSKDFVESSSSPPTHRSYGRSHHHSHHHRSHSHHHHQSQSKHHHS 227
+ S S +SS+ H+ + H + S S H
Sbjct: 75 RRSPMSDTLLKS-----KSSAHLLHHQ---STRSHRRSKSGTTSPRKPSSSAHRRR 122
Score = 30.1 bits (68), Expect = 4.3
Identities = 21/91 (23%), Positives = 30/91 (32%), Gaps = 13/91 (14%)
Query: 152 DGFVTPKPSGVPPSSSTTTFVKRLFSPPTPGSSKDFVESSSSPPTHRSYGRSHHHSHHHR 211
D + P TT+++ P TPG + S SP R + R
Sbjct: 24 DDDDEERDYSYAPHPPTTSYLSSKSVPTTPG----ILSHSRSPSRSRLHKRKKSSRRSPM 79
Query: 212 SHSH---------HHHQSQSKHHHSKPLSRT 233
S + HHQS H SK + +
Sbjct: 80 SDTLLKSKSSAHLLHHQSTRSHRRSKSGTTS 110
>gnl|CDD|107088 PHA01818, PHA01818, hypothetical protein.
Length = 458
Score = 33.5 bits (76), Expect = 0.40
Identities = 27/94 (28%), Positives = 44/94 (46%), Gaps = 6/94 (6%)
Query: 9 IYVSQTPSTSSLSSNLYTSTSSGNVFTSSSIGQSNFT---PSSSGSNVYTSNPSGTSSVY 65
+++S TP+TSSL + G T S +G SN+T S S ++T S S+
Sbjct: 296 LFMSDTPNTSSLHGVYNAIGTDGRNVTGSVVG-SNWTSPKTSPSHKELWTGAQSFLSTGT 354
Query: 66 T-TISSSATNSVYT-ISSTAPSSVYTSSPPDTTT 97
T +S +N Y + +T ++ T +T T
Sbjct: 355 TENLSDDISNYSYVEVYTTHKTTEKTKGNDNTGT 388
>gnl|CDD|173789 cd04059, Peptidases_S8_Protein_convertases_Kexins_Furin-like,
Peptidase S8 family domain in Protein convertases.
Protein convertases, whose members include furins and
kexins, are members of the peptidase S8 or Subtilase
clan of proteases. They have an Asp/His/Ser catalytic
triad that is not homologous to trypsin. Kexins are
involved in the activation of peptide hormones, growth
factors, and viral proteins. Furin cleaves cell surface
vasoactive peptides and proteins involved in
cardiovascular tissue remodeling in the TGN, at cell
surface, or in endosomes but rarely in the ER. Furin
also plays a key role in blood pressure regulation
though the activation of transforming growth factor
(TGF)-beta. High specificity is seen for cleavage after
dibasic (Lys-Arg or Arg-Arg) or multiple basic residues
in protein convertases. There is also strong sequence
conservation.
Length = 297
Score = 33.3 bits (77), Expect = 0.44
Identities = 20/66 (30%), Positives = 28/66 (42%), Gaps = 9/66 (13%)
Query: 4 ASPITIYVS------QTPSTSSLSSNLYTSTSSGNVFTSSSIGQSNFTPSSSGSNVYTSN 57
S TI VS S S + S++ S SG S + S T G+ TS+
Sbjct: 202 NSIYTISVSAVTANGVRASYSEVGSSVLASAPSGG---SGNPEASIVTTDLGGNCNCTSS 258
Query: 58 PSGTSS 63
+GTS+
Sbjct: 259 HNGTSA 264
Score = 30.2 bits (69), Expect = 4.4
Identities = 19/74 (25%), Positives = 32/74 (43%), Gaps = 10/74 (13%)
Query: 21 SSNLYTSTSSGNVFTSSSIGQSNFTP--SSSGSNVYTSNPSG-----TSSVYTT---ISS 70
+ N +S + S++ + S GS+V S PSG +S+ TT +
Sbjct: 194 NCNCDGYNNSIYTISVSAVTANGVRASYSEVGSSVLASAPSGGSGNPEASIVTTDLGGNC 253
Query: 71 SATNSVYTISSTAP 84
+ T+S S+ AP
Sbjct: 254 NCTSSHNGTSAAAP 267
>gnl|CDD|114268 pfam05537, DUF759, Borrelia burgdorferi protein of unknown function
(DUF759). This family consists of several
uncharacterized proteins from the Lyme disease
spirochete Borrelia burgdorferi.
Length = 439
Score = 33.1 bits (75), Expect = 0.61
Identities = 21/80 (26%), Positives = 35/80 (43%), Gaps = 7/80 (8%)
Query: 608 DTDNIKLANMANQTKDLNFSVFAGSAGYLLSGLLVGCAGGINALSAVLGGPICELYDLAK 667
D ++ + N N + FA A + SG + + A++ +LGG E +DL K
Sbjct: 226 DIRDLDMLNDKEGKNLKNATEFA--AMFKSSGAMSDNEDAVKAVNKILGGDGSEAFDLLK 283
Query: 668 -----AGKWEEAMKLQHRLV 682
K+ EA K+ L+
Sbjct: 284 KIDGFGDKYIEAAKMALELL 303
>gnl|CDD|220401 pfam09786, CytochromB561_N, Cytochrome B561, N terminal. Members
of this family are found in the N terminal region of
cytochrome B561, as well as in various other putative
uncharacterized proteins.
Length = 559
Score = 32.8 bits (75), Expect = 0.69
Identities = 28/98 (28%), Positives = 41/98 (41%), Gaps = 7/98 (7%)
Query: 4 ASPITIYVSQTPSTSSLSSNLYTSTSSGNVFTSSS-IGQSNFTPSSSGSNVYTS---NPS 59
++P+ P SS S + S S F+ SS I QS S+ +S +PS
Sbjct: 126 STPMNTSEPLVPGHSSFSDSPSRSASPSRKFSPSSTIQQSPQLTPSNKPASPSSSYQSPS 185
Query: 60 GTSSVYTTISSSATNSVYTISSTAPSSVYTSSPPDTTT 97
+SS+ ++SS S S A S + D TT
Sbjct: 186 YSSSL-GPVNSSGNRSNLRSSPWALRS--SGDKKDITT 220
Score = 31.7 bits (72), Expect = 1.9
Identities = 25/111 (22%), Positives = 40/111 (36%), Gaps = 5/111 (4%)
Query: 36 SSSIGQSNFTPSSSGSNVYTSNPSGTSSVYTTISSSATNSVYTISSTAPSSVYTSSPPDT 95
S S S S TS P TS SS ++S +S + +S+ +
Sbjct: 108 SQFTVVSQAKKSPPASK--TSTPMNTSEPLVPGHSSFSDSPSRSASPSRKFSPSSTIQQS 165
Query: 96 TTLSPIEAGGAPSPCHSLEVPPSPLLPDDNHP---SPTSTSVHLVSSSNTD 143
L+P +PS + S L P ++ + S+ L SS +
Sbjct: 166 PQLTPSNKPASPSSSYQSPSYSSSLGPVNSSGNRSNLRSSPWALRSSGDKK 216
Score = 30.5 bits (69), Expect = 3.6
Identities = 21/96 (21%), Positives = 36/96 (37%), Gaps = 6/96 (6%)
Query: 14 TPSTSSLSSNLYTSTSSGNVFTSSSIGQSNFTPSSSGSNVYTSNPSGTSSVYTTISSSAT 73
S + S +++ N G S+F+ S S S + + + S S T
Sbjct: 112 VVSQAKKSPPASKTSTPMNTSEPLVPGHSSFSDSPSRSA--SPSRKFSPSSTIQQSPQLT 169
Query: 74 NSVYTISSTAPSSVYTSSPPDTTTLSPIEAGGAPSP 109
S + A S SP +++L P+ + G S
Sbjct: 170 PS----NKPASPSSSYQSPSYSSSLGPVNSSGNRSN 201
>gnl|CDD|223021 PHA03247, PHA03247, large tegument protein UL36; Provisional.
Length = 3151
Score = 33.4 bits (76), Expect = 0.70
Identities = 27/158 (17%), Positives = 47/158 (29%), Gaps = 12/158 (7%)
Query: 45 TPSSSGSNVYTSNPSGTSSVYTTISSSATNSVYTISSTA---PSSVYTSSPPDTTTLSPI 101
T + + + P+ T + A+ S S + P+ + L P
Sbjct: 2762 TTAGPPAPAPPAAPAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPPA 2821
Query: 102 EAGGAPSPCHSLEVPPSPLLPDDNHPSPTSTSVHLVSSSNTDWSDVNPDSDGFVTPKPSG 161
+ P P + P +P P P P S + + D P +
Sbjct: 2822 ASPAGPLPPPTSAQPTAPPPPPG--PPPPSLPLGGSVAPGGDVRRRPP-------SRSPA 2872
Query: 162 VPPSSSTTTFVKRLFSPPTPGSSKDFVESSSSPPTHRS 199
P++ V+RL P S++ F P
Sbjct: 2873 AKPAAPARPPVRRLARPAVSRSTESFALPPDQPERPPQ 2910
Score = 31.1 bits (70), Expect = 3.0
Identities = 19/111 (17%), Positives = 34/111 (30%), Gaps = 6/111 (5%)
Query: 89 TSSPPDTTTLSPIEAGGAPSPCHSLEVPPSPLLP---DDNHPSPTSTSVHLVSSSNTDWS 145
+ PPDT P +P+ PP + P + P+P S + +
Sbjct: 2615 SPLPPDTHAPDPPPPSPSPAANEPDPHPPPTVPPPERPRDDPAPGRVSRPRRARRLGRAA 2674
Query: 146 DVNPDSDGFVTPKPSGVPPSSSTTTFVKRLFSPPTPGSSKDFVESSSSPPT 196
+ P+ P+ + T + PP S++P
Sbjct: 2675 QASSPPQR---PRRRAARPTVGSLTSLADPPPPPPTPEPAPHALVSATPLP 2722
>gnl|CDD|219406 pfam07420, DUF1509, Protein of unknown function (DUF1509). This
family consists of several uncharacterized viral
proteins from the Marek's disease-like viruses. Members
of this family are typically around 400 residues in
length. The function of this family is unknown.
Length = 377
Score = 32.7 bits (74), Expect = 0.71
Identities = 25/96 (26%), Positives = 34/96 (35%), Gaps = 10/96 (10%)
Query: 89 TSSPPDTTTLSPIEAGGAPSPCHSLEVPPSPLLPD-------DNHPSPTSTSVHLV---S 138
SPP T TL P A PS + P P P + P P T+ +
Sbjct: 234 APSPPRTDTLDPDPAIAGPSRAVNRTPSPRPSSPPPEIDEEYNAQPDPWGTNAQTWDNRA 293
Query: 139 SSNTDWSDVNPDSDGFVTPKPSGVPPSSSTTTFVKR 174
+ D S ++ DS G P S+S +R
Sbjct: 294 GTPDDGSAISEDSAGGSHHTMRRPPHSTSGERRGRR 329
>gnl|CDD|238501 cd01019, ZnuA, Zinc binding protein ZnuA. These proteins have been
shown to function as initial receptors in the ABC uptake
of Zn2+. They belong to the TroA superfamily of
periplasmic metal binding proteins that share a distinct
fold and ligand binding mechanism. They are comprised
of two globular subdomains connected by a single helix
and bind their specific ligands in the cleft between
these domains. A typical TroA protein is comprised of
two globular subdomains connected by a single helix and
can bind the metal ion in the cleft between these
domains. In addition, these proteins sometimes have a
low complexity region containing a metal-binding
histidine-rich motif (repetitive HDH sequence).
Length = 286
Score = 32.3 bits (74), Expect = 0.75
Identities = 6/33 (18%), Positives = 7/33 (21%)
Query: 194 PPTHRSYGRSHHHSHHHRSHSHHHHQSQSKHHH 226
SH H H H + H
Sbjct: 83 AKLIDLKTLEDGASHGDHEHDHEHAHGEHDGHE 115
>gnl|CDD|237019 PRK11907, PRK11907, bifunctional 2',3'-cyclic nucleotide
2'-phosphodiesterase/3'-nucleotidase precursor protein;
Reviewed.
Length = 814
Score = 32.9 bits (75), Expect = 0.85
Identities = 19/100 (19%), Positives = 32/100 (32%), Gaps = 16/100 (16%)
Query: 3 QASPITIYVSQTPSTSSLSSNLYTSTSSGNVFTSSSIGQSNFTPSSSGSNVYTSNPSGTS 62
QA I T +S + T S + + T + + PS +
Sbjct: 27 QAEEIV----TTTPATSTEAEQTTPVESDATEEADN------TETPVAATTAAEAPSSSE 76
Query: 63 SVYTTISSSATNSVYTISSTAPSSVYTSSPPDTTTLSPIE 102
+ S + + T +S A + P T T P+E
Sbjct: 77 T--AETSDPTSEATDTTTSEARTVT----PAATETSKPVE 110
>gnl|CDD|215579 PLN03106, TCP2, Protein TCP2; Provisional.
Length = 447
Score = 32.4 bits (73), Expect = 0.97
Identities = 20/87 (22%), Positives = 26/87 (29%), Gaps = 13/87 (14%)
Query: 139 SSNTDWSDVNPDSDGFVTPKPSGVP----------PSSSTTTFVKRLFSPPTPGSSKDFV 188
++N W P + P SSS T G+S
Sbjct: 257 NNNRQWMASAPSGQKAAAARQWPSPMDYFGSGLLGGSSSRTHHSSGFPGQIQLGNSIPQP 316
Query: 189 ESSSSPPTHRSYGRSHHHSHHHRSHSH 215
S++S P HH HHH H H
Sbjct: 317 ISTNSHPFSSI---GDHHHHHHHQHQH 340
>gnl|CDD|188634 cd00945, Aldolase_Class_I, Class I aldolases. Class I aldolases.
The class I aldolases use an active-site lysine which
stabilizes a reaction intermediates via Schiff base
formation, and have TIM beta/alpha barrel fold. The
members of this family include
2-keto-3-deoxy-6-phosphogluconate (KDPG) and
2-keto-4-hydroxyglutarate (KHG) aldolases,
transaldolase, dihydrodipicolinate synthase sub-family,
Type I 3-dehydroquinate dehydratase, DeoC and DhnA
proteins, and metal-independent
fructose-1,6-bisphosphate aldolase. Although
structurally similar, the class II aldolases use a
different mechanism and are believed to have an
independent evolutionary origin.
Length = 201
Score = 31.5 bits (72), Expect = 1.00
Identities = 17/97 (17%), Positives = 33/97 (34%), Gaps = 23/97 (23%)
Query: 524 TTTRATIDLTQKAAKAGANAALILCPYYFQKKMTEDLIYEHFISVAD--NSPIPVIIYNN 581
TTT + ++A GA+ ++ K+ + + E +V + + +P+ +
Sbjct: 62 TTTEVKVAEVEEAIDLGADEIDVVINIGSLKEGDWEEVLEEIAAVVEAADGGLPLKVILE 121
Query: 582 TFVTNIDISVDTLVKLAHHENIRGVKDTDNIKLANMA 618
T RG+K D I A
Sbjct: 122 T---------------------RGLKTADEIAKAARI 137
>gnl|CDD|223033 PHA03291, PHA03291, envelope glycoprotein I; Provisional.
Length = 401
Score = 32.2 bits (73), Expect = 1.00
Identities = 19/76 (25%), Positives = 29/76 (38%), Gaps = 2/76 (2%)
Query: 60 GTSSVYTTISSSATNSVYTISSTAPSSVYTSSPPDTTTLSPIEAGGAPSPCHSLEVP--P 117
G + V+ + T T P+ T+SPP TT +P AP + E P
Sbjct: 198 GPADVFVPATPRPTPRTTASPETTPTPSTTTSPPSTTIPAPSTTIAAPQAGTTPEAEGTP 257
Query: 118 SPLLPDDNHPSPTSTS 133
+P P P + +
Sbjct: 258 APPTPGGGEAPPANAT 273
>gnl|CDD|115579 pfam06933, SSP160, Special lobe-specific silk protein SSP160. This
family consists of several special lobe-specific silk
protein SSP160 sequences which appear to be specific to
Chironomus (Midge) species.
Length = 758
Score = 32.4 bits (73), Expect = 1.0
Identities = 25/60 (41%), Positives = 33/60 (55%)
Query: 36 SSSIGQSNFTPSSSGSNVYTSNPSGTSSVYTTISSSATNSVYTISSTAPSSVYTSSPPDT 95
S+S S+ S+S SN TSN S TSS TT +S++T+S + SS S S DT
Sbjct: 104 SASGNSSSSANSTSNSNSTTSNNSTTSSNSTTTTSNSTSSSNSTSSGLTSGASVVSLIDT 163
Score = 30.1 bits (67), Expect = 5.8
Identities = 13/46 (28%), Positives = 27/46 (58%)
Query: 44 FTPSSSGSNVYTSNPSGTSSVYTTISSSATNSVYTISSTAPSSVYT 89
F S+ +N +++ S ++S + S++ TNS + +ST+ S+ T
Sbjct: 276 FANGSASANSTSNSNSTSNSTTNSNSTTTTNSTTSTNSTSSSNSST 321
Score = 29.7 bits (66), Expect = 6.7
Identities = 22/102 (21%), Positives = 43/102 (42%), Gaps = 1/102 (0%)
Query: 10 YVSQTPSTSSLSSNLYTSTSSGNVFTSSSIGQSNFTPSSSGSNVYTSNPSGTSSVYTTIS 69
+ + + S +S S++ TS S+ N ++++ + T S+S SN T + TI+
Sbjct: 276 FANGSASANSTSNSNSTSNSTTNSNSTTTTNSTTSTNSTSSSNSSTIAGCIDIAANFTIA 335
Query: 70 SSATNSVYTISST-APSSVYTSSPPDTTTLSPIEAGGAPSPC 110
++ +T AP+ + P +A S C
Sbjct: 336 LQNLQALLLQEATCAPALAANAKKSGVRDFGPCKAAKTASGC 377
Score = 29.4 bits (65), Expect = 9.5
Identities = 24/63 (38%), Positives = 32/63 (50%), Gaps = 1/63 (1%)
Query: 30 SGNVFTSSSIGQSNFTPSSSGSNVYTSNPSGTSSVYTTISSSAT-NSVYTISSTAPSSVY 88
S N+ S S SSS +N +++ S TS+ TT S+S T S T SS + SS
Sbjct: 92 SANIQAIQSGSGSASGNSSSSANSTSNSNSTTSNNSTTSSNSTTTTSNSTSSSNSTSSGL 151
Query: 89 TSS 91
TS
Sbjct: 152 TSG 154
>gnl|CDD|146711 pfam04217, DUF412, Protein of unknown function, DUF412. This
family consists of bacterial uncharacterized proteins.
Length = 143
Score = 31.1 bits (71), Expect = 1.1
Identities = 16/45 (35%), Positives = 23/45 (51%), Gaps = 3/45 (6%)
Query: 706 MELYGYY--GGRSRRPLPAALKPGGAEKIKQVLTEAGFLVPGVRA 748
+ L G Y G R+ PLP +L E I+Q L E G + ++A
Sbjct: 75 LPLQGLYWLGKRANTPLPPSLLSWYRE-IRQKLNEQGIALQPLKA 118
Score = 28.4 bits (64), Expect = 9.2
Identities = 15/38 (39%), Positives = 19/38 (50%), Gaps = 3/38 (7%)
Query: 750 MELYGYY--GGRSRRPLPAALKPGGAEKIKQVLTEAGF 785
+ L G Y G R+ PLP +L E I+Q L E G
Sbjct: 75 LPLQGLYWLGKRANTPLPPSLLSWYRE-IRQKLNEQGI 111
>gnl|CDD|223597 COG0523, COG0523, Putative GTPases (G3E family) [General function
prediction only].
Length = 323
Score = 31.9 bits (73), Expect = 1.3
Identities = 6/22 (27%), Positives = 6/22 (27%)
Query: 201 GRSHHHSHHHRSHSHHHHQSQS 222
S S H H H S
Sbjct: 206 RTSPLFSDHQHDHEHDEGGISS 227
>gnl|CDD|217089 pfam02535, Zip, ZIP Zinc transporter. The ZIP family consists of
zinc transport proteins and many putative metal
transporters. The main contribution to this family is
from the Arabidopsis thaliana ZIP protein family these
proteins are responsible for zinc uptake in the plant.
Also found within this family are C. elegans proteins of
unknown function which are annotated as being similar to
human growth arrest inducible gene product, although
this protein in not found within this family.
Length = 314
Score = 31.9 bits (73), Expect = 1.3
Identities = 8/29 (27%), Positives = 10/29 (34%)
Query: 201 GRSHHHSHHHRSHSHHHHQSQSKHHHSKP 229
+ H H HHHH+ K K
Sbjct: 131 ESNVVHHEHGEGPHHHHHEGHEKGESDKE 159
Score = 30.0 bits (68), Expect = 4.2
Identities = 8/30 (26%), Positives = 10/30 (33%)
Query: 203 SHHHSHHHRSHSHHHHQSQSKHHHSKPLSR 232
HH H H HHH + + R
Sbjct: 134 VVHHEHGEGPHHHHHEGHEKGESDKESGLR 163
Score = 30.0 bits (68), Expect = 5.3
Identities = 9/43 (20%), Positives = 12/43 (27%)
Query: 188 VESSSSPPTHRSYGRSHHHSHHHRSHSHHHHQSQSKHHHSKPL 230
E S + P S + HH HHH +
Sbjct: 116 GEHSHTVPDEESGAVESNVVHHEHGEGPHHHHHEGHEKGESDK 158
Score = 29.6 bits (67), Expect = 6.5
Identities = 9/38 (23%), Positives = 13/38 (34%)
Query: 202 RSHHHSHHHRSHSHHHHQSQSKHHHSKPLSRTMFGPVS 239
S+ H H HHHH + S S ++
Sbjct: 131 ESNVVHHEHGEGPHHHHHEGHEKGESDKESGLRVRDIA 168
>gnl|CDD|165513 PHA03255, PHA03255, BDLF3; Provisional.
Length = 234
Score = 31.4 bits (70), Expect = 1.4
Identities = 29/112 (25%), Positives = 47/112 (41%), Gaps = 14/112 (12%)
Query: 3 QASPITIYVSQTPSTSSLSSNLYTSTSSGNVFTSSSIGQSNFTP---------SSSGSNV 53
Q++ +T + +T+ LS+N T TS+G T + T + + +
Sbjct: 57 QSTTLTTTSAPITTTAILSTNTTTVTSTGTTVTPVPTTSNASTINVTTKVTAQNITATEA 116
Query: 54 YTSNPSGTSSVYTTISSSATNSVY-----TISSTAPSSVYTSSPPDTTTLSP 100
T +G +S TT SSS T++ T + SS TS+ TT P
Sbjct: 117 GTGTSTGVTSNVTTRSSSTTSATTRITNATTLAPTLSSKGTSNATKTTAELP 168
>gnl|CDD|233191 TIGR00927, 2A1904, K+-dependent Na+/Ca+ exchanger. [Transport and
binding proteins, Cations and iron carrying compounds].
Length = 1096
Score = 32.3 bits (73), Expect = 1.4
Identities = 36/140 (25%), Positives = 51/140 (36%), Gaps = 27/140 (19%)
Query: 27 STSSGNVFTSSSIGQSNFTPSSSGSNVYTSNP-SGTSSVYTTISSSATNSVYTISSTAPS 85
+TS G V S G S +S + NP S TS+ I+S+ + STAPS
Sbjct: 319 ATSEGQVTISIMTGSSPAETKASTAAWKIRNPLSRTSAPAVRIASATFRGLEKNPSTAPS 378
Query: 86 SVYTSSPPDTTTLSPIEAGGAPSPCHSLEVPPSPLLPDDNHPSPTSTSVHLVSSSNTDWS 145
T + A H + V P+P +P PSP+ T+
Sbjct: 379 ---------TPATPRVRAVLTTQVHHCVVVKPAPAVPTT--PSPSLTTALF--------- 418
Query: 146 DVNPDSDGFVTPKPSGVPPS 165
+P PS +PP
Sbjct: 419 ------PEAPSPSPSALPPG 432
>gnl|CDD|237944 PRK15327, PRK15327, type III secretion system needle complex
protein PrgH; Provisional.
Length = 393
Score = 31.7 bits (72), Expect = 1.4
Identities = 14/54 (25%), Positives = 23/54 (42%), Gaps = 5/54 (9%)
Query: 496 DRPMSVGPRS--VRPSEREWQADLLKPQKHTTTRATIDLTQKAAKAGANAALIL 547
+ P+ VG +RP W ++L+P+K + + K G AAL
Sbjct: 100 NTPIQVGELLILIRPESEPWVPEILQPEKLEASAKK---NEPRFKNGIVAALAG 150
>gnl|CDD|144451 pfam00859, CTF_NFI, CTF/NF-I family transcription modulation
region.
Length = 295
Score = 31.6 bits (71), Expect = 1.5
Identities = 38/185 (20%), Positives = 62/185 (33%), Gaps = 10/185 (5%)
Query: 33 VFTSSSIGQSNFTPSSSGSNVYTSNPSGTSSVYTTISSSAT--NSVYTISSTAPSSVYTS 90
VF+ + + + + TP ++G+ S +SS Y ++ A S+ + SS++ T
Sbjct: 9 VFSVTELVRVSRTPIAAGTGPNFSLADLSSSSYYDLNPGAGLRRSLPSTSSSSSKRPKTV 68
Query: 91 SPPDTTTLSPIEAGGAPSPCHSLEVPPSPLLPDDNHPSPTSTSVHLVSSSNTDWSDVNPD 150
S + SP SP + + SPT S+ D +P
Sbjct: 69 SMEEEMDTSPGGEDFYTSPSSPSSSSANWHEVEGGMSSPTMKKPDKSLFSSPSPQDSSPR 128
Query: 151 SDGFVTPKPSGVPPSSSTTTFVKRLFSPPTPGSSKDFVESSSSPPTHRSYGRSHHHSHHH 210
F + S + SP S F S P SY H + +
Sbjct: 129 LSAFTQHHRPVITGHSGISA------SPHPTPSPLHFPTSPILPQQPSSY--FPHTAIRY 180
Query: 211 RSHSH 215
H H
Sbjct: 181 PPHLH 185
>gnl|CDD|215561 PLN03077, PLN03077, Protein ECB2; Provisional.
Length = 857
Score = 31.7 bits (72), Expect = 1.6
Identities = 17/48 (35%), Positives = 27/48 (56%), Gaps = 4/48 (8%)
Query: 666 AKAGKWEEAMKLQHRL----VKPDVTVRNVLLMKEMGVPGVRAAMELY 709
AKAG ++EA+ L HR+ V+PDV +L G+P + E++
Sbjct: 163 AKAGYFDEALCLYHRMLWAGVRPDVYTFPCVLRTCGGIPDLARGREVH 210
>gnl|CDD|178748 PLN03209, PLN03209, translocon at the inner envelope of chloroplast
subunit 62; Provisional.
Length = 576
Score = 31.8 bits (72), Expect = 1.7
Identities = 45/192 (23%), Positives = 66/192 (34%), Gaps = 38/192 (19%)
Query: 5 SPITIYVSQTPSTSSLSSNLYTSTSSGNVFTSSSIGQSNFTPSSSG--SNV--------- 53
SP T Y P TS + + +S +S + + S G SNV
Sbjct: 372 SPYTAYEDLKPPTSPIPTPPSSSPASSKSVDAVAKPAEPDVVPSPGSASNVPEVEPAQVE 431
Query: 54 ---------YT--------SNPSGTSSVYTTISSSATNSVYTISSTAPSSVYTS----SP 92
Y ++PS T+ + S S+T+SV + TAP++ T P
Sbjct: 432 AKKTRPLSPYARYEDLKPPTSPSPTAPTGVSPSVSSTSSVPAVPDTAPATAATDAAAPPP 491
Query: 93 PDTTTLSPIEAGGAPSPCHSLEVPPSPLLPDDNHPSPTSTSVHLVSSSNTDWSDVNPDSD 152
+ LSP P S PSP P ++ V V +S + D
Sbjct: 492 ANMRPLSPYAVYDDLKPPTS----PSPAAPVGKVAPSSTNEVVKVGNSAPPTALA--DEQ 545
Query: 153 GFVTPKPSGVPP 164
PKP + P
Sbjct: 546 HHAQPKPRPLSP 557
>gnl|CDD|202669 pfam03495, Binary_toxB, Clostridial binary toxin B/anthrax toxin
PA. The N-terminal region of this family contains a
calcium-binding motif that may be an EF-hand.
Length = 406
Score = 31.3 bits (71), Expect = 1.9
Identities = 21/62 (33%), Positives = 32/62 (51%), Gaps = 4/62 (6%)
Query: 14 TPSTSSLSSNLYTSTSSGNVFTSSSIGQSNFTPSSSGSNVYTSNPSGTSSVYTTISSSAT 73
T S S+ +S YT+T +V S G S F S S S Y+ + S T +V + S+S +
Sbjct: 105 TISKSTSTSKTYTNTVGASV----SAGASLFGFSGSVSANYSHSWSSTVAVDWSTSTSWS 160
Query: 74 NS 75
+
Sbjct: 161 ET 162
>gnl|CDD|219210 pfam06873, SerH, Cell surface immobilisation antigen SerH. This
family consists of several cell surface immobilisation
antigen SerH proteins which seem to be specific to
Tetrahymena thermophila. The SerH locus of Tetrahymena
thermophila is one of several paralogous loci with genes
encoding variants of the major cell surface protein
known as the immobilisation antigen (i-ag).
Length = 407
Score = 31.4 bits (71), Expect = 2.1
Identities = 25/98 (25%), Positives = 43/98 (43%), Gaps = 5/98 (5%)
Query: 14 TPSTSSLSSNLYTSTSSGN-VFTSSSIGQSNFTPSSSGSNVYTSNPSGTSSVYTTISSSA 72
P+T + S + SS +T S+ T S SNV+ +N +G++ V ++ S +
Sbjct: 126 NPTTPAAVSGACQACSSITSGWTDSNCNACATTASPKNSNVF-ANSAGSACVASSASCGS 184
Query: 73 TNSVYTISSTAPSSVYTSSPPDTTTLSPIEAGGAPSPC 110
T+ T + A + P T L ++ A S C
Sbjct: 185 TSRGTTAWTDADCLLCN---PTTPYLVGDKSSCAASSC 219
>gnl|CDD|111090 pfam02158, Neuregulin, Neuregulin family.
Length = 406
Score = 31.3 bits (70), Expect = 2.1
Identities = 52/208 (25%), Positives = 75/208 (36%), Gaps = 19/208 (9%)
Query: 20 LSSNLYTSTSSGNVFTSSSIGQSNFTPSSSGSNVYTSNPSGTSS--VYTTISSSATNSVY 77
+S N+ +S T +S S++T ++ S T PS + S ++ S +NSV
Sbjct: 82 VSKNVISSEHIIERETETSFSTSHYTSTAHHSTTVTQTPSHSWSNGHSESMISEESNSVI 141
Query: 78 TISSTAPSSVYTSSPPDTTTLSPIEAGGAPSPCHSLEVPPSPLLPDDNHPSPTSTSVHLV 137
+SS S + + L I GG P C L PD SP S
Sbjct: 142 MMSSVENSRHSSPAGGPRGRLHGI--GGPPDDCSFLRHARDT--PDSYRDSPHSERYVSA 197
Query: 138 SSSNTDWSDVNPDSDGFVTPKPSGVPPSSSTTTFVKRLFSPPTPGSSKDFVESSS----- 192
++ S V+ F TP P + S P+ S F+E
Sbjct: 198 MTTPARMSPVD-----FHTPISPKSPCLEMSPPESSLAVSMPSVAVSP-FIEEERPLLLV 251
Query: 193 SPPTHRS--YGRSHHHSHHHRSHSHHHH 218
SPP R Y HH+ H+ HH
Sbjct: 252 SPPRLREKKYDHKTPQKTHHKQHNSFHH 279
>gnl|CDD|115650 pfam07010, Endomucin, Endomucin. This family consists of several
mammalian endomucin proteins. Endomucin is an early
endothelial-specific antigen that is also expressed on
putative hematopoietic progenitor cells.
Length = 259
Score = 30.8 bits (69), Expect = 2.2
Identities = 28/126 (22%), Positives = 47/126 (37%), Gaps = 3/126 (2%)
Query: 42 SNFTPSSSGSNVYTSNPSGTSSVYTTISSSATNSVYTISSTAPSSVYTSSPPDTTTLSPI 101
++ +S + TS + + TT + N T T + T S +T S
Sbjct: 66 TSELLKTSLMSTATSLTTPKHELKTTTTGVRKNESSTSKVTVTNV--TLSNAVSTLQSSQ 123
Query: 102 EAGGAPSPCHSLEVPPSPLLPDDNHPSPTSTSVHLVSSSNTDWSDVNPDSDGFVTPKPSG 161
S + E+ P+ +L D P T T+ ++++ T S DG + S
Sbjct: 124 NKTENQSSIRTTEISPTSVLQPDASPKKTGTTSASLTTAET-TSQSQDTEDGKIASTSST 182
Query: 162 VPPSSS 167
P SS
Sbjct: 183 TPSYSS 188
>gnl|CDD|193490 cd04085, delta_endotoxin_C, delta-endotoxin C-terminal domain may
be associated with carbohydrate binding functionality.
Delta-endotoxin C-terminal domain (delta endotoxin
domain III) is part of the activated region of delta
endotoxins, which are insecticidal toxins produced
during sporulation by Bacillus species of bacteria. The
activated endotoxin binds to the gut epithelium and
causes cell lysis leading to death. This activated
region of the delta endotoxin is composed of three
structural domains. The N-terminal helical domain (I) is
involved in membrane insertion and pore formation, while
the second and third domains (II and III) are involved
in receptor binding. Domain III structurally resembles
the carbohydrate binding domain 6 (CBM6) and it is
possible that insect specificity is determined by
protein-protein or protein-carbohydrate interactions
mediated by both domains II and III of the toxin.
Delta-endotoxins are of great interest for development
of new bioinsecticides and in the control of mosquitoes.
Length = 145
Score = 29.9 bits (68), Expect = 2.2
Identities = 15/65 (23%), Positives = 23/65 (35%), Gaps = 3/65 (4%)
Query: 25 YTSTSSGNVFTS---SSIGQSNFTPSSSGSNVYTSNPSGTSSVYTTISSSATNSVYTISS 81
Y S + S S+ NF + S + T T + S +++ TIS
Sbjct: 67 YASNGDFTLSISGGGSTTNTVNFPSTMSSGDNLTYESFQYVEFSTPFTFSESSNTITISI 126
Query: 82 TAPSS 86
SS
Sbjct: 127 QNSSS 131
>gnl|CDD|220851 pfam10697, DUF2502, Protein of unknown function (DUF2502). Members
of this family are all Gammaproteobacteria. The function
is not known.
Length = 82
Score = 28.5 bits (64), Expect = 2.7
Identities = 8/29 (27%), Positives = 11/29 (37%)
Query: 197 HRSYGRSHHHSHHHRSHSHHHHQSQSKHH 225
GR ++ R H H + KHH
Sbjct: 54 PHDDGRHRYYYRRGRKHDDHRGKGPGKHH 82
>gnl|CDD|223039 PHA03307, PHA03307, transcriptional regulator ICP4; Provisional.
Length = 1352
Score = 31.3 bits (71), Expect = 2.8
Identities = 38/190 (20%), Positives = 56/190 (29%), Gaps = 32/190 (16%)
Query: 15 PSTSSLSSNLYTSTSSGNVFTSSSIGQSNFTPSSSGSNVYTSNPSGTSSVYTTISSSATN 74
S SS + S + +SS S S G P + T +
Sbjct: 215 ASASSPAPAPGRSAADDAGASSSDSSSSE--SSGCGWGPENECPLPRPAPITLPTRIWEA 272
Query: 75 SVYTISSTAPSSVYTSSPPDTTTLSPIEAGGAPSPCHSLEVPPSPLLPDDNHPSPTSTSV 134
S + S+ P +SS P E +PSP P S PS S
Sbjct: 273 SGWNGPSSRPGPASSSSSPR-------ERSPSPSPSS----PGSGP-----APSSPRASS 316
Query: 135 HLVSSSNTDWSDVNPDSDGFVTPKPSGVPPSSSTTTFVKRLFSPPTPGSSKDFVESSSSP 194
SS + S + S+ + + V P S + P+P + P
Sbjct: 317 SSSSSRESSSSSTSSSSES---SRGAAVSPGPSPS-------RSPSPSRPPP----PADP 362
Query: 195 PTHRSYGRSH 204
+ R R
Sbjct: 363 SSPRKRPRPS 372
Score = 30.5 bits (69), Expect = 4.3
Identities = 26/125 (20%), Positives = 40/125 (32%), Gaps = 6/125 (4%)
Query: 83 APSSVYTSSPPDTTTLSPIEAGGAPSPCHSLEVPPSPLLPDDNHPSPTSTSVHLVSSSNT 142
A +S SS +++ P P + P+ + P+S SSS++
Sbjct: 232 AGASSSDSSSSESSGCGWGPENECPLPRPAPITLPTRIWEASGWNGPSSRPGP-ASSSSS 290
Query: 143 DWSDVNPDSDGFVTPKPSGVPPSSSTTTFVKRLFSPPTPGSSKDFVESSSSPPTHRSYGR 202
+ +P G P+ S+ S SS S SS S G
Sbjct: 291 P-----RERSPSPSPSSPGSGPAPSSPRASSSSSSSRESSSSSTSSSSESSRGAAVSPGP 345
Query: 203 SHHHS 207
S S
Sbjct: 346 SPSRS 350
Score = 30.1 bits (68), Expect = 6.0
Identities = 22/120 (18%), Positives = 31/120 (25%)
Query: 80 SSTAPSSVYTSSPPDTTTLSPIEAGGAPSPCHSLEVPPSPLLPDDNHPSPTSTSVHLVSS 139
S+ + S P A P S S + +S
Sbjct: 176 LSSPEETARAPSSPPAEPPPSTPPAAASPRPPRRSSPISASASSPAPAPGRSAADDAGAS 235
Query: 140 SNTDWSDVNPDSDGFVTPKPSGVPPSSSTTTFVKRLFSPPTPGSSKDFVESSSSPPTHRS 199
S+ S + + P+ T S SS+ SSSS P RS
Sbjct: 236 SSDSSSSESSGCGWGPENECPLPRPAPITLPTRIWEASGWNGPSSRPGPASSSSSPRERS 295
>gnl|CDD|184923 PRK14959, PRK14959, DNA polymerase III subunits gamma and tau;
Provisional.
Length = 624
Score = 30.8 bits (69), Expect = 3.1
Identities = 25/104 (24%), Positives = 40/104 (38%), Gaps = 8/104 (7%)
Query: 55 TSNPSGTSSVYTTISSSATNSVYTISST-APSSVYTSSPPDTTTLSPIEAGGAPSPCHSL 113
T P GT+ SSA + S+ +P + +PP +P +G P P +
Sbjct: 402 TQGPQGTAPAAGMTPSSAAPATPAPSAAPSPRVPWDDAPP-----APPRSGIPPRPAPRM 456
Query: 114 -EVPPSPLLPDDNHPSPTSTSVHLVSSSNTDWSDVNPDS-DGFV 155
E P P PD + + S + + P + DGF+
Sbjct: 457 PEASPVPGAPDSVASASDAPPTLGDPSDTAEHTPSGPRTWDGFL 500
>gnl|CDD|236005 PRK07376, PRK07376, NAD(P)H-quinone oxidoreductase subunit F;
Validated.
Length = 673
Score = 30.8 bits (70), Expect = 3.1
Identities = 13/36 (36%), Positives = 16/36 (44%), Gaps = 2/36 (5%)
Query: 199 SYGRSHHHSHHHRSHSHHHHQSQSKHHHSKPLSRTM 234
+ HH HH H HHH S S H P + T+
Sbjct: 472 AADAPGHHEGHHEEHGEHHHHSGSPHE--SPWTMTL 505
>gnl|CDD|178608 PLN03046, PLN03046, D-glycerate 3-kinase; Provisional.
Length = 460
Score = 30.7 bits (69), Expect = 3.3
Identities = 23/138 (16%), Positives = 48/138 (34%), Gaps = 15/138 (10%)
Query: 17 TSSLSSNLYTSTSSGNVFTSSSIGQSNFTPSSSGSNVYTSNPSGTSSVYTTISSSATNSV 76
+S +SSNL + I +++ + + + NP S ++
Sbjct: 8 SSLISSNLQHY--------MNPIHAADYPIPCASAAAFNFNPISNPFNLKRRPYSPSSHK 59
Query: 77 YTISSTAPSSVYTSSPPDTTTLSPIEAGGAPSPCHSLEVPPSPLLPDDNHPSPTSTSVHL 136
+ A S Y SS LS + + C + + ++ D + TS
Sbjct: 60 FNDRVAASCSSYPSS-----KLSSRKTHLSCPGCGCSWIQDNSMVHDYATTAATSKRCSS 114
Query: 137 VSSSNTDWSDVNPDSDGF 154
+ +S+ + V+ D +
Sbjct: 115 LPTSSP--ALVSSVQDLY 130
>gnl|CDD|240410 PTZ00418, PTZ00418, Poly(A) polymerase; Provisional.
Length = 593
Score = 30.9 bits (70), Expect = 3.4
Identities = 17/62 (27%), Positives = 30/62 (48%), Gaps = 4/62 (6%)
Query: 12 SQTPSTSSLSSNLYTSTSSGNVFTSSSIGQSNFTPSSSGSNVYTSNPSGTSSVYTTISSS 71
SQTP + + +S TS G S T +S+ + S+P+ +S+ +SS+
Sbjct: 533 SQTPEEPVKTKANTKTNTSSAT-TSGQSGSSGSTSNSNSN---ESSPTMSSTELLNVSST 588
Query: 72 AT 73
+T
Sbjct: 589 ST 590
>gnl|CDD|183928 PRK13262, ureE, urease accessory protein UreE; Provisional.
Length = 231
Score = 30.0 bits (67), Expect = 3.5
Identities = 10/28 (35%), Positives = 11/28 (39%)
Query: 199 SYGRSHHHSHHHRSHSHHHHQSQSKHHH 226
S+ H H H HSH H H H
Sbjct: 204 SHEEGHSHGDHDHDHSHSHGDHDHDHKH 231
>gnl|CDD|204706 pfam11666, DUF2933, Protein of unknown function (DUF2933). This
bacterial family of proteins has no known function.
Length = 56
Score = 27.6 bits (62), Expect = 3.6
Identities = 3/17 (17%), Positives = 3/17 (17%)
Query: 204 HHHSHHHRSHSHHHHQS 220
H H H
Sbjct: 39 MHGGHGGHGGHDSHDDD 55
>gnl|CDD|193258 pfam12782, Innate_immun, Invertebrate innate immunity transcript
family. The immune response of the purple sea urchin
appears to be more complex than previously believed in
that it uses immune-related gene families homologous to
vertebrate Toll-like and NOD/NALP-like receptor families
as well as C-type lectins and a rudimentary complement
system. In addition, the species also produces this
unusual family of mRNAs, also known as 185/333, which is
strongly upregulated in response to pathogen challenge.
Length = 312
Score = 30.4 bits (68), Expect = 3.7
Identities = 6/31 (19%), Positives = 8/31 (25%)
Query: 194 PPTHRSYGRSHHHSHHHRSHSHHHHQSQSKH 224
P HR +G +H H
Sbjct: 178 PHHHRHHGHNHFGRKPFGGRPFGRRNHTEGH 208
>gnl|CDD|221146 pfam11597, Med13_N, Mediator complex subunit 13 N-terminal.
Mediator is a large complex of up to 33 proteins that is
conserved from plants through fungi to humans - the
number and representation of individual subunits varying
with species. It is arranged into four different
sections, a core, a head, a tail and a kinase-activity
part, and the number of subunits within each of these is
what varies with species. Overall, Mediator regulates
the transcriptional activity of RNA polymerase II but it
would appear that each of the four different sections
has a slightly different function. Med13 is part of the
ancillary kinase module, together with Med12, CDK8 and
CycC, which in yeast is implicated in transcriptional
repression, though most of this activity is likely
attributable to the CDK8 kinase. The large Med12 and
Med13 proteins are required for specific developmental
processes in Drosophila, zebrafish, and Caenorhabditis
elegans but their biochemical functions are not
understood.
Length = 370
Score = 30.6 bits (69), Expect = 3.8
Identities = 43/243 (17%), Positives = 72/243 (29%), Gaps = 38/243 (15%)
Query: 5 SPITIYVSQTPSTSSLSSNLYTSTSSG-----------NVFTSSSIGQSNFTP------- 46
I + P++S N+ +S V TS I Q P
Sbjct: 128 VRIGKWFVNPPTSSERPQNINSSQHLSEIYTFFISAVSGVLTSVLIRQHGAIPLLSRTHL 187
Query: 47 SSSGSNVYTSNPSGTSSVYTTISSSATNSVYTISS-TAPSSVYTSSPPDTTTLSPIEAGG 105
+ + VY +P G + T S + V S+ S+ T P L +
Sbjct: 188 TPAQKQVY-LSPGGLNGTLTGQSYLSDLQVDLTSAGELLVSLVTMVYPGLFVLVDLSD-- 244
Query: 106 APSPCHSLEVPPSPLLPDDNHPSPTSTSVHLVSSSNTDWSDVNPDSDGFVTPKPSGVPPS 165
SP ++PP + P+P+ + LVS+ + + S V P
Sbjct: 245 --SPQMIEDLPPGTDV----WPAPSGSIARLVSTVDGTQLASDDPSPSSVKLTPPTSSRD 298
Query: 166 SSTT---------TFVKRLFSPPTPGSSKDFVE-SSSSPPTHRSYGRSHHHSHHHRSHSH 215
++ PP+ + +VE R G H + +S
Sbjct: 299 HDPDIKQWKANVLKWLSNFGLPPSSVEALSWVEVEVWEGFYSRLSGEHHRQNEEGKSSPP 358
Query: 216 HHH 218
Sbjct: 359 LKR 361
>gnl|CDD|226908 COG4531, ZnuA, ABC-type Zn2+ transport system, periplasmic
component/surface adhesin [Inorganic ion transport and
metabolism].
Length = 318
Score = 30.1 bits (68), Expect = 4.2
Identities = 8/27 (29%), Positives = 11/27 (40%), Gaps = 1/27 (3%)
Query: 200 YGRSHHHSHHHRSHSHHHHQSQSKHHH 226
+ H H H H+ H+ HHH
Sbjct: 120 HDHEEEHDHGH-DHAGADHKGDHDHHH 145
>gnl|CDD|165099 PHA02732, PHA02732, hypothetical protein; Provisional.
Length = 1467
Score = 30.5 bits (68), Expect = 4.2
Identities = 23/93 (24%), Positives = 37/93 (39%), Gaps = 6/93 (6%)
Query: 10 YVSQTPSTSSLSSNLYTSTSSGNVFTSSSIGQSNFTPS--SSGSNVYTSNPSGTSSVYTT 67
Y+S + SL ++ T+ +F+ I S +GS +Y +P T
Sbjct: 1155 YISYKFNALSLG---FSITADAAIFSLFGIPAPQLLSSYIPTGSVLYQ-DPIFTYIPPGI 1210
Query: 68 ISSSATNSVYTISSTAPSSVYTSSPPDTTTLSP 100
I S TN+ ++ S +S P TT P
Sbjct: 1211 IGMSGTNTFTFKAAQLQLSAASSPPAATTPTPP 1243
>gnl|CDD|238500 cd01018, ZntC, Metal binding protein ZntC. These proteins are
predicted to function as initial receptors in ABC
transport of metal ions. They belong to the TroA
superfamily of helical backbone metal receptor proteins
that share a distinct fold and ligand binding mechanism.
They are comprised of two globular subdomains connected
by a long alpha helix and bind their specific ligands in
the cleft between these domains. In addition, many of
these proteins possess a metal-binding histidine-rich
motif (repetitive HDH sequence).
Length = 266
Score = 30.0 bits (68), Expect = 4.8
Identities = 10/14 (71%), Positives = 10/14 (71%)
Query: 205 HHSHHHRSHSHHHH 218
HH HHH H HHHH
Sbjct: 94 HHHHHHGEHEHHHH 107
>gnl|CDD|240270 PTZ00107, PTZ00107, hexokinase; Provisional.
Length = 464
Score = 30.0 bits (68), Expect = 5.1
Identities = 16/35 (45%), Positives = 18/35 (51%)
Query: 389 VKWFGMSNEPLSEVEPATLKSLSQGLEAHHRSRKK 423
V F MS E L E+ L L +GLEAH R R
Sbjct: 15 VNQFTMSKEKLKELVDYFLYELVEGLEAHRRHRNL 49
>gnl|CDD|99901 cd05840, SPBC215_ISWI_like, The PWWP domain is a component of the
S. pombe hypothetical protein SPBC215, as well as ISWI
complex protein 4. The ISWI (imitation switch) proteins
are ATPases responsible for chromatin remodeling in
eukaryotes, and SPBC215 is proposed to also bind
chromatin. The PWWP domain, named for a conserved
Pro-Trp-Trp-Pro motif, is a small domain consisting of
100-150 amino acids. The PWWP domain is found in
numerous proteins that are involved in cell division,
growth and differentiation. Most PWWP-domain proteins
seem to be nuclear, often DNA-binding, proteins that
function as transcription factors regulating a variety
of developmental processes.
Length = 93
Score = 28.1 bits (63), Expect = 5.2
Identities = 9/16 (56%), Positives = 11/16 (68%)
Query: 360 GDLVWGSVKGYPSWPG 375
GD V VKG+P+WP
Sbjct: 4 GDRVLAKVKGFPAWPA 19
>gnl|CDD|219916 pfam08580, KAR9, Yeast cortical protein KAR9. The KAR9 protein in
Saccharomyces cerevisiae is a cytoskeletal protein
required for karyogamy, correct positioning of the
mitotic spindle and for orientation of cytoplasmic
microtubules. KAR9 localises at the shmoo tip in mating
cells and at the tip of the growing bud in anaphase.
Length = 626
Score = 30.2 bits (68), Expect = 5.3
Identities = 41/215 (19%), Positives = 60/215 (27%), Gaps = 34/215 (15%)
Query: 11 VSQTPSTSSLSSNLYTSTSSGNVFTSSSIG-----------QSNFTPSSSGSNVYTSNPS 59
V TP S SN+ + S +G P SN+ S
Sbjct: 405 VIMTPPDSGPGSNVSSRRVGTPGSKSDRVGAVLLRRMNIKPTLASIPDEKPSNISVFEDS 464
Query: 60 GTSSVYTTISSSATNSVYTISSTA---------PSSVYTSSPP-----DTTTLSPIEAGG 105
TS +T+ S +S PP TL
Sbjct: 465 ETSPNSSTLLRDPPPKKCGEESGHLPNNPFFNKLKLTLSSIPPLSPRQSIITLPTPSRP- 523
Query: 106 APSPCHSLEVPPSPLLPDDNHPSPTSTSVHLVSSSNTDW----SDVNPDSDGFVTPKPSG 161
S SL + P P T V ++ + SD+ + G P+
Sbjct: 524 -ASRISSLSLRLGSYSGSIVSPPPYPTLVSRKGAAGLSFNRSVSDIEGERIGRYNLLPTR 582
Query: 162 VPP---SSSTTTFVKRLFSPPTPGSSKDFVESSSS 193
+P + +TT +R S P+P F S
Sbjct: 583 IPALPFKAESTTSSRRSSSLPSPTGVIGFPGSVPR 617
>gnl|CDD|221427 pfam12114, Period_C, Period protein 2/3C-terminal region. This
domain is found in eukaryotes. This domain is typically
between 164 to 200 amino acids in length. This domain
is found associated with pfam08447.
Length = 190
Score = 29.3 bits (66), Expect = 5.4
Identities = 21/63 (33%), Positives = 29/63 (46%), Gaps = 2/63 (3%)
Query: 15 PSTSS--LSSNLYTSTSSGNVFTSSSIGQSNFTPSSSGSNVYTSNPSGTSSVYTTISSSA 72
S+SS L L + SG +S G S SGSN +S S +S + +I S+
Sbjct: 5 HSSSSDLLDLLLQEDSQSGTGSAASGSGSSGSGSLGSGSNGSSSGSSNSSKYFGSIDMSS 64
Query: 73 TNS 75
NS
Sbjct: 65 ENS 67
>gnl|CDD|226406 COG3889, COG3889, Predicted solute binding protein [General
function prediction only].
Length = 872
Score = 30.2 bits (68), Expect = 5.4
Identities = 24/86 (27%), Positives = 38/86 (44%), Gaps = 6/86 (6%)
Query: 4 ASPITIYVSQTPS-TSSLSSNLYTSTSSGNVFTSSSIGQSNF----TPSSSGSNVYTSNP 58
+ T V Q+P+ T + S S +S + T+S + T +SS S T++P
Sbjct: 769 SGTFTAEVPQSPTKTETTLSYSAYSNTSILIETTSVVITKTVTQTQTTTSSPSPTQTTSP 828
Query: 59 SGTSSVY-TTISSSATNSVYTISSTA 83
+ TS+ TT S S T + I
Sbjct: 829 TQTSTSTTTTTSPSQTTTGGGICGPI 854
>gnl|CDD|223031 PHA03273, PHA03273, envelope glycoprotein C; Provisional.
Length = 486
Score = 30.0 bits (67), Expect = 5.6
Identities = 15/76 (19%), Positives = 31/76 (40%)
Query: 12 SQTPSTSSLSSNLYTSTSSGNVFTSSSIGQSNFTPSSSGSNVYTSNPSGTSSVYTTISSS 71
S + S S+ ST + T+S++ T + + +N + + +S + ++
Sbjct: 31 SSSIENSDNSTAEMQSTPATPTHTTSNLTSPFGTGTDNSTNANGTESTTQASQPHSHETT 90
Query: 72 ATNSVYTISSTAPSSV 87
T + IS SV
Sbjct: 91 ITCTKSLISVPYYKSV 106
>gnl|CDD|202321 pfam02630, SCO1-SenC, SCO1/SenC. This family is involved in
biogenesis of respiratory and photosynthetic systems.
SCO1 is required for a post-translational step in the
accumulation of subunits COXI and COXII of cytochrome c
oxidase. SenC is required for optimal cytochrome c
oxidase activity and maximal induction of genes encoding
the light-harvesting and reaction centre complexes of R.
capsulatus.
Length = 152
Score = 28.9 bits (65), Expect = 6.0
Identities = 11/51 (21%), Positives = 23/51 (45%), Gaps = 2/51 (3%)
Query: 375 GKLISPAPTQGRVWVKWFGMSNEPLSEVEPATLKSLSQGLEAHHRSRKKLQ 425
GK + +G++ + +FG +N P + P L L+ ++ +Q
Sbjct: 20 GKNFTSENLKGKLSLIYFGFTNCPD--ICPPALDRLTDIIKKLKAENIDVQ 68
>gnl|CDD|173883 cd08518, PBP2_NikA_DppA_OppA_like_19, The substrate-binding
component of an uncharacterized ABC-type
nickel/dipeptide/oligopeptide-like import system
contains the type 2 periplasmic binding fold. This CD
represents the substrate-binding domain of an
uncharacterized ATP-binding cassette (ABC) type
nickel/dipeptide/oligopeptide-like transporter. The
oligopeptide-binding protein OppA and the
dipeptide-binding protein DppA show significant sequence
similarity to NikA, the initial nickel receptor. The
DppA binds dipeptides and some tripeptides and is
involved in chemotaxis toward dipeptides, whereas the
OppA binds peptides of a wide range of lengths (2-35
amino acid residues) and plays a role in recycling of
cell wall peptides, which precludes any involvement in
chemotaxis. Most of other periplasmic binding proteins
are comprised of only two globular subdomains
corresponding to domains I and III of the
dipeptide/oligopeptide binding proteins. The structural
topology of these domains is most similar to that of the
type 2 periplasmic binding proteins (PBP2), which are
responsible for the uptake of a variety of substrates
such as phosphate, sulfate, polysaccharides,
lysine/arginine/ornithine, and histidine. The PBP2 bind
their ligand in the cleft between these domains in a
manner resembling a Venus flytrap. After binding their
specific ligand with high affinity, they can interact
with a cognate membrane transport complex comprised of
two integral membrane domains and two cytoplasmically
located ATPase domains. This interaction triggers the
ligand translocation across the cytoplasmic membrane
energized by ATP hydrolysis. Besides transport
proteins, the PBP2 superfamily includes the
ligand-binding domains from ionotropic glutamate
receptors, LysR-type transcriptional regulators, and
unorthodox sensor proteins involved in signal
transduction.
Length = 464
Score = 29.9 bits (68), Expect = 6.3
Identities = 21/84 (25%), Positives = 32/84 (38%), Gaps = 14/84 (16%)
Query: 705 AMELYGYYGGRSRRPLPAALKPGGAEKIKQVLTEAGFLVP---------GVRAAMELYGY 755
Y G AA+ EK K++L EAG+ G +A LY
Sbjct: 275 GTPAYSPPDGLPWGNPDAAIYDYDPEKAKKILEEAGW-KDGDDGGREKDGQKAEFTLYYP 333
Query: 756 YGGRSRRPLPAALKPGGAEKIKQV 779
G + R+ L A+ A + K++
Sbjct: 334 SGDQVRQDLAVAV----ASQAKKL 353
>gnl|CDD|223044 PHA03325, PHA03325, nuclear-egress-membrane-like protein;
Provisional.
Length = 418
Score = 29.9 bits (67), Expect = 6.5
Identities = 22/126 (17%), Positives = 35/126 (27%), Gaps = 8/126 (6%)
Query: 16 STSSLSSNLYTSTSSGNVFTSSSIGQSNFTPSSSGSNVYTSNPS--GTSSVYTTISSSAT 73
S S L S V G+ + +S + SSSA
Sbjct: 299 SEHSDPEPLPASLPPPPVRRPRVKHPEAGKEEPDGARNAEAKEPAQPATSTSSKGSSSAQ 358
Query: 74 NSVYTISSTAPSSVYTSSPPDTTTLSPIEAGGAPSPCHSLEVPPSPLLPDDNHPSPTSTS 133
N + S SS + + G+P + + P + P P S
Sbjct: 359 NKDSGSTGPGSSLAAASSFLED------DDFGSPPLDLTTSLRHMPSPSVTSAPEPPSIP 412
Query: 134 VHLVSS 139
+ +S
Sbjct: 413 LTYLSD 418
>gnl|CDD|183854 PRK13042, PRK13042, superantigen-like protein; Reviewed.
Length = 291
Score = 29.2 bits (65), Expect = 7.3
Identities = 19/72 (26%), Positives = 28/72 (38%), Gaps = 2/72 (2%)
Query: 26 TSTSSGNVFTSSSIGQSNFTPSSSGSNVYTSNPSGTSSVYTTISSSATNSVYTISSTAPS 85
T+T + N T SS P S+ + P + T S+ T ++T PS
Sbjct: 23 TTTQAANATTPSSTKVE--APQSTPPSTKVEAPQSKPNATTPPSTKVEAPQQTPNATTPS 80
Query: 86 SVYTSSPPDTTT 97
S +P TT
Sbjct: 81 STKVETPQSPTT 92
>gnl|CDD|221000 pfam11145, DUF2921, Protein of unknown function (DUF2921). This
eukaryotic family of proteins has no known function.
Length = 871
Score = 29.8 bits (67), Expect = 7.6
Identities = 20/108 (18%), Positives = 29/108 (26%), Gaps = 4/108 (3%)
Query: 110 CHSLEVPPSPLLPDDNHPSPTSTSVHLVSSSNTDWSDVNPDSDGFVTPKPSGVPPSSSTT 169
H + P D P+ +S + DW D S S
Sbjct: 3 DHCSSIVPESSELKDEDPTRPLVRSRQLSFFDGDWFQSGGDP--LFPRDGSDAGDSLRLA 60
Query: 170 TF-VKRLFSPPTPGSSKDFVESSSSPPTHRSYGRSHHHSHHHRSHSHH 216
+F V + G VE R+ S ++S H S
Sbjct: 61 SFSVTHIRKTRRAGKIIK-VEGVLVLTISRTSASSSYYSGDHGQSSLE 107
>gnl|CDD|233045 TIGR00601, rad23, UV excision repair protein Rad23. All proteins
in this family for which functions are known are
components of a multiprotein complex used for targeting
nucleotide excision repair to specific parts of the
genome. In humans, Rad23 complexes with the XPC protein.
This family is based on the phylogenomic analysis of JA
Eisen (1999, Ph.D. Thesis, Stanford University) [DNA
metabolism, DNA replication, recombination, and repair].
Length = 378
Score = 29.5 bits (66), Expect = 7.7
Identities = 15/82 (18%), Positives = 30/82 (36%), Gaps = 1/82 (1%)
Query: 76 VYTISSTAPSSVYTSSPPDTTTLSPIEAGGAP-SPCHSLEVPPSPLLPDDNHPSPTSTSV 134
V +S + + P T T +P P SP + P+ + + + ++T+
Sbjct: 71 VVMVSKPKTGTGKVAPPAATPTSAPTPTPSPPASPASGMSAAPASAVEEKSPSEESATAT 130
Query: 135 HLVSSSNTDWSDVNPDSDGFVT 156
S S + S + + V
Sbjct: 131 APESPSTSVPSSGSDAASTLVV 152
>gnl|CDD|218107 pfam04484, DUF566, Family of unknown function (DUF566). Family of
related proteins that is plant specific.
Length = 313
Score = 29.1 bits (65), Expect = 7.8
Identities = 25/134 (18%), Positives = 44/134 (32%), Gaps = 5/134 (3%)
Query: 67 TISSSATNSVYTISSTAPSSVYTSSPPDTTTLSPIEAGGAPSPCHSLEVPPSPLLPDDNH 126
++SS +T+ + ++ +SS T+ S AP+ P S + +
Sbjct: 4 SVSSGSTSGDASSPRSSSRRRLSSSFLSTSASSRPRRLNAPASP-----PSSSPARNTSS 58
Query: 127 PSPTSTSVHLVSSSNTDWSDVNPDSDGFVTPKPSGVPPSSSTTTFVKRLFSPPTPGSSKD 186
S S SS + S +P + S T SP +
Sbjct: 59 SSSFGLSKQRPSSLSRGRLSSRFVSPSRGSPSAAASLNGSLATASTSGSSSPSRSRRTTS 118
Query: 187 FVESSSSPPTHRSY 200
SS + P+ S+
Sbjct: 119 SDLSSGNGPSVLSF 132
>gnl|CDD|237864 PRK14950, PRK14950, DNA polymerase III subunits gamma and tau;
Provisional.
Length = 585
Score = 29.4 bits (66), Expect = 8.0
Identities = 18/112 (16%), Positives = 28/112 (25%), Gaps = 30/112 (26%)
Query: 80 SSTAPSSVYTSSPPDTTTLSPIEAG----GAPSPCHSLEVPPSPLLPDDNHPSPTSTSVH 135
+ P + P SP+ P + +PP + + P P
Sbjct: 358 ALLVPVPAPQPAKPTAAAPSPVRPTPAPSTRPKAAAAANIPPKEPVRETATPPPVPPRP- 416
Query: 136 LVSSSNTDWSDVNPDSDGFVTPKPSGVPPSSSTTTF------VKRLFSPPTP 181
V P P S+ T K ++PP P
Sbjct: 417 -------------------VAPPVPHTPESAPKLTRAAIPVDEKPKYTPPAP 449
>gnl|CDD|236082 PRK07737, fliD, flagellar capping protein; Validated.
Length = 501
Score = 29.4 bits (66), Expect = 8.1
Identities = 17/56 (30%), Positives = 23/56 (41%), Gaps = 4/56 (7%)
Query: 46 PSSSGSNVYTSNPSGTSSVYTTISSSATNSVYTISST--APSSVYTSSPPDTTTLS 99
S+ TS S S V T SSSA N T+ T A ++ + SS +
Sbjct: 67 QSTYLKKTVTS--SNESVVTATASSSAANGSTTMQVTQLATAATWKSSAKTSFYTG 120
>gnl|CDD|217393 pfam03154, Atrophin-1, Atrophin-1 family. Atrophin-1 is the
protein product of the dentatorubral-pallidoluysian
atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive
neurodegenerative disorder. It is caused by the
expansion of a CAG repeat in the DRPLA gene on
chromosome 12p. This results in an extended
polyglutamine region in atrophin-1, that is thought to
confer toxicity to the protein, possibly through
altering its interactions with other proteins. The
expansion of a CAG repeat is also the underlying defect
in six other neurodegenerative disorders, including
Huntington's disease. One interaction of expanded
polyglutamine repeats that is thought to be pathogenic
is that with the short glutamine repeat in the
transcriptional coactivator CREB binding protein, CBP.
This interaction draws CBP away from its usual nuclear
location to the expanded polyglutamine repeat protein
aggregates that are characteristic of the polyglutamine
neurodegenerative disorders. This interferes with
CBP-mediated transcription and causes cytotoxicity.
Length = 979
Score = 29.7 bits (66), Expect = 8.5
Identities = 32/174 (18%), Positives = 50/174 (28%), Gaps = 7/174 (4%)
Query: 41 QSNFTPSSSGSNVYTSNPSGTSSVYTTISSSATNSVYTISSTAPSSVYTSSPPDTTTLSP 100
Q N + S S + P G S+ ++ S + +A + SP
Sbjct: 158 QDNESDSDSSAQQQLLQPQGPPSIQVPPGAALAPSAPPPTPSAQAVPPQGSPIAAQPAPQ 217
Query: 101 IEAGGAPSPCHSLEVPPSPLLPDDNHPSPTSTSVHLVSSSNTDWSDVNPDSDGFVTPKPS 160
P L + +P L PSP +S + P S +
Sbjct: 218 ------PQQPSPLSLISAPSLHPQRLPSPHPPLQPQTASQQSP-QPPAPSSRHPQSSHHG 270
Query: 161 GVPPSSSTTTFVKRLFSPPTPGSSKDFVESSSSPPTHRSYGRSHHHSHHHRSHS 214
PP P+ + F + S P ++ HSH S S
Sbjct: 271 PGPPMPHALQQGPVFLQHPSSNPPQPFGLAQSQVPPLPLPSQAQPHSHTPPSQS 324
>gnl|CDD|166510 PLN02869, PLN02869, fatty aldehyde decarbonylase.
Length = 620
Score = 29.4 bits (66), Expect = 8.6
Identities = 9/14 (64%), Positives = 10/14 (71%)
Query: 204 HHHSHHHRSHSHHH 217
HHH + R HSHHH
Sbjct: 150 HHHYLYSRYHSHHH 163
>gnl|CDD|148679 pfam07218, RAP1, Rhoptry-associated protein 1 (RAP-1). This family
consists of several rhoptry-associated protein 1 (RAP-1)
sequences which appear to be specific to Plasmodium
falciparum.
Length = 790
Score = 29.7 bits (66), Expect = 8.6
Identities = 19/103 (18%), Positives = 35/103 (33%), Gaps = 21/103 (20%)
Query: 3 QASPITIYVSQTPSTSSLSSNLYTSTSSGNVFTSSSIGQSNFTPSSSGSNVYTSNPSGTS 62
+A+P P + +S + +S+ S +++ G S
Sbjct: 123 KANPNEAGKPGKPKGNQGEGLASSSDGK--------------SKASAKSGSKSASKHGES 168
Query: 63 SV---YTTISSSATNSVYTISSTAPSSVYTSSPPDTTTLSPIE 102
+ T S A+ SV I + + P TL+P+E
Sbjct: 169 NSSDESATDSGKASASVAGIV----GADEEAPPAPKNTLTPLE 207
>gnl|CDD|118101 pfam09565, RE_NgoFVII, NgoFVII restriction endonuclease. This
family includes the NgoFVII (recognises GCSGC but
cleavage site unknown) restriction endonuclease.
Length = 296
Score = 29.0 bits (65), Expect = 8.6
Identities = 18/116 (15%), Positives = 36/116 (31%), Gaps = 6/116 (5%)
Query: 50 GSNVYTSNPSGTSSVYTTISSSATNSVYTISSTAPSSVYTSSPP--DTTTLSPIEAGGAP 107
GS + + T+ Y + + Y + S++ D+ + +P
Sbjct: 100 GSANLSQINAFTTRQYEAVVTLDPAPAYDLYEFLLSNIVCLHVNIEDSIEPQIVIEYNSP 159
Query: 108 --SPCHSLEVPPSPLLPDDNHPSPTSTSVHLV--SSSNTDWSDVNPDSDGFVTPKP 159
+ + S + N S + L SSN +WS +G +
Sbjct: 160 LTTIAGVTGIADSTIQLLMNKGFDYSFDIPLKVPESSNLNWSYGLARKNGQRKARN 215
>gnl|CDD|236304 PRK08581, PRK08581, N-acetylmuramoyl-L-alanine amidase; Validated.
Length = 619
Score = 29.4 bits (66), Expect = 8.7
Identities = 26/181 (14%), Positives = 58/181 (32%), Gaps = 9/181 (4%)
Query: 20 LSSNLYTSTSSGNVFTSSSIGQSNFTPSSSGSNVYTSNPSGTSSVYTTISSSATNSVYTI 79
LS+ L T + + + + T ++ + SN TS ++ + ++ T
Sbjct: 11 LSTTLVLPTLTSPTAYADDPQKDS-TAKTTSHDSKKSNDDETSKDTSSKDTDKADNNNTS 69
Query: 80 SS----TAPSSVYTSSPPDTTTLSPIEAGGAPSPCHSLEVPPSPLLPDDNHPSPTSTSVH 135
+ S++ +S+ + I P ++ + DDN+ T
Sbjct: 70 NQDNNDKKFSTIDSSTSDSNNIIDFIYKN---LPQTNINQLLTKNKYDDNYSLTTLIQNL 126
Query: 136 LVSSSNTDWSDVNPDSDGFVTPKPSGVPPSSSTTTFVKRLFSPPTPGSSKDFVESSSSPP 195
+S+ P + T + SS + + K +++ P
Sbjct: 127 FNLNSDIS-DYEQPRNSEKSTNDSNKNSDSSIKNDTDTQSSKQDKADNQKAPSSNNTKPS 185
Query: 196 T 196
T
Sbjct: 186 T 186
>gnl|CDD|146285 pfam03566, Peptidase_A21, Peptidase family A21.
Length = 628
Score = 29.5 bits (66), Expect = 8.7
Identities = 16/73 (21%), Positives = 25/73 (34%), Gaps = 1/73 (1%)
Query: 26 TSTSSGNVFTSSSIGQSNFTPSSSGSNVYTSNPSGTSSVYTTISSSATN-SVYTISSTAP 84
T SSG G + YT ++ T++ T TI+ TAP
Sbjct: 328 TPNSSGQFGQWRHGGFDASVILPTVPRGYTMEYGDFANPGDTLTFGQTGGDNVTITITAP 387
Query: 85 SSVYTSSPPDTTT 97
+ T T++
Sbjct: 388 TVTVTVLASLTSS 400
>gnl|CDD|219071 pfam06525, SoxE, Sulfocyanin (SoxE). This family consists of
several archaeal sulfocyanin (or blue copper protein)
sequences from a number of Sulfolobus species.
Length = 195
Score = 28.6 bits (64), Expect = 8.8
Identities = 9/36 (25%), Positives = 17/36 (47%)
Query: 54 YTSNPSGTSSVYTTISSSATNSVYTISSTAPSSVYT 89
+N + T + + +SS + S + S+T P T
Sbjct: 17 TFTNITKTYNTTSMLSSPTSTSTTSSSTTLPPGAIT 52
>gnl|CDD|215448 PLN02834, PLN02834, 3-dehydroquinate synthase.
Length = 433
Score = 29.4 bits (66), Expect = 9.0
Identities = 16/78 (20%), Positives = 24/78 (30%), Gaps = 4/78 (5%)
Query: 67 TISSSATNS---VYTISSTAPSS-VYTSSPPDTTTLSPIEAGGAPSPCHSLEVPPSPLLP 122
SSSA NS T+ S +PS + + ++ C S +
Sbjct: 1 KKSSSADNSESNTPTVLSRSPSDAFFDQNSSIESSKEGDLTEVIHEKCPVSGANKSEVTK 60
Query: 123 DDNHPSPTSTSVHLVSSS 140
+ T V L S
Sbjct: 61 TASATVTTVVKVDLGDRS 78
>gnl|CDD|218773 pfam05841, Apc15p, Apc15p protein. The anaphase-promoting complex
(APC) is a conserved multi-subunit ubiquitin ligase
required for the degradation of key cell cycle
regulators Members of this family are components of the
anaphase-promoting complex homologous to Apc15p.
Length = 114
Score = 27.8 bits (62), Expect = 9.9
Identities = 2/36 (5%), Positives = 7/36 (19%)
Query: 190 SSSSPPTHRSYGRSHHHSHHHRSHSHHHHQSQSKHH 225
+ HS++ + +
Sbjct: 8 IRDEQSSRNYRKGRQSHSNNSTDRELLSAPPLAAYG 43
Database: CDD.v3.10
Posted date: Mar 20, 2013 7:55 AM
Number of letters in database: 10,937,602
Number of sequences in database: 44,354
Lambda K H
0.313 0.128 0.376
Gapped
Lambda K H
0.267 0.0712 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 38,854,007
Number of extensions: 3735948
Number of successful extensions: 4698
Number of sequences better than 10.0: 1
Number of HSP's gapped: 4141
Number of HSP's successfully gapped: 293
Length of query: 786
Length of database: 10,937,602
Length adjustment: 105
Effective length of query: 681
Effective length of database: 6,280,432
Effective search space: 4276974192
Effective search space used: 4276974192
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.9 bits)
S2: 63 (28.1 bits)