RPS-BLAST 2.2.26 [Sep-21-2011]
Database: CDD.v3.10
44,354 sequences; 10,937,602 total letters
Searching..................................................done
Query= 002403
(927 letters)
>gnl|CDD|215860 pfam00328, His_Phos_2, Histidine phosphatase superfamily (branch
2). The histidine phosphatase superfamily is so named
because catalysis centres on a conserved His residue
that is transiently phosphorylated during the catalytic
cycle. Other conserved residues contribute to a
'phosphate pocket' and interact with the phospho group
of substrate before, during and after its transfer to
the His residue. Structure and sequence analyses show
that different families contribute different additional
residues to the 'phosphate pocket' and, more
surprisingly, differ in the position, in sequence and in
three dimensions, of a catalytically essential acidic
residue. The superfamily may be divided into two main
branches.The smaller branch 2 contains predominantly
eukaryotic proteins. The catalytic functions in members
include phytase, glucose-1-phosphatase and multiple
inositol polyphosphate phosphatase. The in vivo roles of
the mammalian acid phosphatases in branch 2 are not
fully understood, although activity against
lysophosphatidic acid and tyrosine-phosphorylated
proteins has been demonstrated.
Length = 327
Score = 81.3 bits (201), Expect = 2e-16
Identities = 55/272 (20%), Positives = 88/272 (32%), Gaps = 78/272 (28%)
Query: 509 GLLRLHSTYRHDLKIYSSDEGRVQMSAAAFAKGLLDLEGQLTPILVSLVSKDSSMLDGLD 568
GLL ++ I SSD R SA AF GL EG D+ +LD
Sbjct: 73 GLLPDGYPP-SEVYIRSSDSNRTLASAQAFLAGLFPPEG----------DIDNDLLDWQP 121
Query: 569 NASIEMEEAKARLNEIIKSGSKMIHSNGSSDCPWMADGVGLPPNASELLPKLVKLTKKVT 628
+ + E K L ++ G CP + + K+V
Sbjct: 122 IPVVTLPEVKKALANLLLL--------GYDSCPAFDE---------------SLVEKRVD 158
Query: 629 EQVRQLAKDEDEDLAETNPYDVIPPYDQAKALGKTNIDVDRIAAGLPCGSEGFLLMYARW 688
++ +L E +A+ G+TN+ W
Sbjct: 159 PELAKLLAVYLEPIAKRL---------SQLLPGETNLT-----------------GLDVW 192
Query: 689 RKLERDLYNERKERFDITQIPDVYDSCKYDLLHNAHLNLEGLDELFKVAQLLADGVIPNE 748
L+ L+ D + PD++ YD LH +L+ L+E + ++
Sbjct: 193 ALLDLCLFE--TNASDNSPFPDLFTG--YDALHLEYLSD--LEEYYGLS----------- 235
Query: 749 YGINPKQKLKIGSKIARRLLGKLLIDLRNTRE 780
GI P+ IG + LL +L L +
Sbjct: 236 -GIGPELAKLIGGPLLNELLKRLTNALVPDQT 266
Score = 45.1 bits (107), Expect = 1e-04
Identities = 14/41 (34%), Positives = 17/41 (41%)
Query: 355 ELRCVIAVMRHGDRTPKQKVKLKVTEEKLLNLMLKYNGGRP 395
EL V V RHGDRTP + + L K+N
Sbjct: 1 ELEFVQVVSRHGDRTPTKSLPKDPEAWPFKLLSEKHNDFLN 41
Score = 41.3 bits (97), Expect = 0.002
Identities = 15/82 (18%), Positives = 29/82 (35%), Gaps = 19/82 (23%)
Query: 849 ANVKTPERHVRTRLYFTSESHIHSLMNVLRYCNLDESLQGEDSLVCHSALERLYKTKELD 908
++ LYF ++ I L++ L L + L G S E
Sbjct: 261 LVPDQTRLDLKLYLYFAHDTTIAPLLSAL---GLFDQLPGLSS-------------GEPP 304
Query: 909 YMSYIVLRMFENTA---VRISV 927
Y + +V ++++ VR+
Sbjct: 305 YAARLVFELWQDGKEFYVRLLY 326
>gnl|CDD|132717 cd07061, HP_HAP_like, Histidine phosphatase domain found in
histidine acid phosphatases and phytases; contains a His
residue which is phosphorylated during the reaction.
Catalytic domain of HAP (histidine acid phosphatases)
and phytases (myo-inositol hexakisphosphate
phosphohydrolases). The conserved catalytic core of this
domain contains a His residue which is phosphorylated in
the reaction. Functions in this subgroup include roles
in metabolism, signaling, or regulation, for example
Escherichia coli glucose-1-phosphatase functions to
scavenge glucose from glucose-1-phosphate and the
signaling molecules inositol 1,3,4,5,6-pentakisphosphate
(InsP5) and inositol hexakisphosphate (InsP6) are in
vivo substrates for eukaryotic multiple inositol
polyphosphate phosphatase 1 (Minpp1). Phytases scavenge
phosphate from extracellular sources and are added to
animal feed while prostatic acid phosphatase (PAP) has
been used for many years as a serum marker for prostate
cancer. Recently PAP has been shown in mouse models to
suppress pain by functioning as an
ecto-5prime-nucleotidase. In vivo it dephosphorylates
extracellular adenosine monophosphate (AMP) generating
adenosine,and leading to the activation of A1-adenosine
receptors in dorsal spinal cord.
Length = 242
Score = 56.2 bits (136), Expect = 1e-08
Identities = 22/65 (33%), Positives = 31/65 (47%)
Query: 509 GLLRLHSTYRHDLKIYSSDEGRVQMSAAAFAKGLLDLEGQLTPILVSLVSKDSSMLDGLD 568
LL LHS R DL I SSD R SA AF GL +G + ++ ++ + + D
Sbjct: 40 ELLLLHSYNRSDLYIRSSDSQRTLQSAQAFLAGLFPPDGWQPIAVHTIPEEEDDVSNLFD 99
Query: 569 NASIE 573
+ E
Sbjct: 100 LCAYE 104
Score = 37.0 bits (86), Expect = 0.027
Identities = 9/16 (56%), Positives = 10/16 (62%)
Query: 355 ELRCVIAVMRHGDRTP 370
EL V + RHGDR P
Sbjct: 1 ELEQVQVLSRHGDRYP 16
Score = 34.7 bits (80), Expect = 0.16
Identities = 14/81 (17%), Positives = 25/81 (30%), Gaps = 10/81 (12%)
Query: 839 ETQYRLDPKYANVKTPERHVRTRLYFTSESHIHSLMNVLRYCNLDESLQGEDSLVCHSAL 898
E RL + R + SH +++ +L L + +
Sbjct: 155 ELLARLTNG---PSGSQTFPLDRKLYLYFSHDTTILPLLTALGLFDFAEPLP-------P 204
Query: 899 ERLYKTKELDYMSYIVLRMFE 919
+ L E DY + +FE
Sbjct: 205 DFLRGFSESDYPPFAARLVFE 225
>gnl|CDD|223267 COG0189, RimK, Glutathione synthase/Ribosomal protein S6
modification enzyme (glutaminyl transferase) [Coenzyme
metabolism / Translation, ribosomal structure and
biogenesis].
Length = 318
Score = 51.6 bits (124), Expect = 8e-07
Identities = 62/296 (20%), Positives = 100/296 (33%), Gaps = 59/296 (19%)
Query: 11 VCVMEKKVFSAPMGQILDRLQAFGEFEVIHFGDKVILEDPIEKWPICDCLI-----AFYS 65
+ +++ S G+I + EVI ++I E+ + D +I F
Sbjct: 33 LAILDDGDLSLRKGEIKALARLVEVGEVIGLHYELIEEEDLSLLDELDVIIMRKDPPFDF 92
Query: 66 SGYPLEKAESYATLRKPFLVNELEPQHLLHDRRKVYEQ--LEKYGIPVPRYALVNREVPY 123
+ L AE ++N+ PQ + R K+Y L K GIPVP +
Sbjct: 93 ATRFLRLAERKGV----PVIND--PQSIRRCRNKLYTTQLLAKAGIPVPPTLITRDPD-- 144
Query: 124 QELDYFIEEEDFVEVHGNRFWKPFVEKPVHGDDHSIMIYYPSSAGGGMKELFRKVGNRSS 183
E F+ E P V KP+ G S G G+ V +
Sbjct: 145 -EAAEFVAE---------HLGFPVVLKPLDG-----------SGGRGVF----LVEDADP 179
Query: 184 EF--HPDVRRVRREGSYIYEEFMPTGGTDVKVYTVGPEYAHAEARKSPVVDGVVMRNPDG 241
E + I +E++P D + VG V + R P
Sbjct: 180 ELLSLLETLTQEGRKLIIVQEYIPKAKRDDRRVLVG--------GGEVVAIYALARIPAS 231
Query: 242 KEVR---------YPVLLTPNEKQMAREVCIAFRQAVCGFDLLRCEGRSYVCDVNG 288
+ R P LT E+++A + A + G D++ + YV +VN
Sbjct: 232 GDFRSNLARGGRAEPCELTEEEEELAVKAAPALGLGLVGVDIIEDKDGLYVTEVNV 287
>gnl|CDD|117020 pfam08443, RimK, RimK-like ATP-grasp domain. This ATP-grasp domain
is found in the ribosomal S6 modification enzyme RimK.
Length = 190
Score = 45.9 bits (109), Expect = 2e-05
Identities = 50/206 (24%), Positives = 77/206 (37%), Gaps = 34/206 (16%)
Query: 96 DRRKVYEQLEKYGIPVPRYALVNREVPYQELDYFIEEEDFVEVHGNRFWKPFVEKPVHGD 155
D+ K ++ L K+GIPVP L + E F+E F P V K V G
Sbjct: 3 DKAKSHQLLAKHGIPVPNTGLAWSPE---------DAEKFIE-QIKGF--PVVVKSVFGS 50
Query: 156 --DHSIMIYYPSSAGGGMKELFRKVGNRSSEFHPDVRRVRREGSYIYEEFMPT-GGTDVK 212
+ S + E F+ + N + +EF+ G D++
Sbjct: 51 QGIGVFLAEDEQSLEQLL-EAFKWLKN----------------QILVQEFIAEAGNRDIR 93
Query: 213 VYTVGPEYAHAEARKSPVVDGVVMRNPDGKEVRYPVLLTPNEKQMAREVCIAFRQAVCGF 272
VG E A R+S +G N V P L+ E+++A + A V G
Sbjct: 94 CLVVGGEVVGAIHRQSN--EGDFRTNLHRGGVAEPYQLSQEEEEIAIKAAQAMGLDVLGV 151
Query: 273 DLLRCEGRSYVCDVNGWSFVKNSYKY 298
D+ R + VC+VN +K +
Sbjct: 152 DIFRSKRGLLVCEVNSSPGLKGIERT 177
>gnl|CDD|233121 TIGR00768, rimK_fam, alpha-L-glutamate ligases, RimK family. This
family, related to bacterial glutathione synthetases,
contains at least two different alpha-L-glutamate
ligases. One is RimK, as in E. coli, which adds
additional Glu residues to the native Glu-Glu C-terminus
of ribosomal protein S6, but not to Lys-Glu mutants.
Most species with a member of this subfamily lack an S6
homolog ending in Glu-Glu, however. Members in
Methanococcus jannaschii act instead as a
tetrahydromethanopterin:alpha-l-glutamate ligase
(MJ0620) and a gamma-F420-2:alpha-l-glutamate ligase
(MJ1001).
Length = 277
Score = 46.2 bits (110), Expect = 4e-05
Identities = 63/294 (21%), Positives = 97/294 (32%), Gaps = 66/294 (22%)
Query: 26 ILDRLQAFG-EFEVIHFGDKVI-LEDP---IEKWPICDCLIAFYSSG-YPLEKAESYATL 79
+ + + G +++V+ + + + + + I G ES
Sbjct: 16 LKEAAEELGIDYKVVTPPAIPLTFNEGPRELAELDVVIVRIVSMFRGLAVARYLESLGV- 74
Query: 80 RKPFLVNELEPQHLLHDRRKV--YEQLEKYGIPVPRYALVNREVPYQELDYFIEEEDFVE 137
++N +L+ K + L K G+P PR L E IEE F
Sbjct: 75 ---PVINS--SDAILNAGDKFLTSQLLAKAGLPQPRTGLAGSPE---EALKLIEEIGF-- 124
Query: 138 VHGNRFWKPFVEKPVHGD--DHSIMIYYPSSAGGGMKELFRKVGNRSSEFHPDVRRVRRE 195
P V KPV G + +A + E F ++ + F
Sbjct: 125 --------PVVLKPVFGSWGRLVSLARDKQAAETLL-EHFEQLNGPQNLF---------- 165
Query: 196 GSYIYEEFMPTGGTDVKVYTVGPEYAHAEARKSPVVDGVVMRNPD----------GKEVR 245
Y+ E GG D++V+ VG E V + R GK
Sbjct: 166 --YVQEYIKKPGGRDIRVFVVGDE-----------VIAAIYRITSGHWRTNLARGGKAEP 212
Query: 246 YPVLLTPNEKQMAREVCIAFRQAVCGFDLLRCEGRSY-VCDVNGWSFVKNSYKY 298
P LT +++A + A V G DLL E R V +VN KNS K
Sbjct: 213 CP--LTEEIEELAIKAAKALGLDVVGIDLLESEDRGLLVNEVNPNPEFKNSVKT 264
>gnl|CDD|132716 cd07040, HP, Histidine phosphatase domain found in a functionally
diverse set of proteins, mostly phosphatases; contains a
His residue which is phosphorylated during the reaction.
Catalytic domain of a functionally diverse set of
proteins, most of which are phosphatases. The conserved
catalytic core of this domain contains a His residue
which is phosphorylated in the reaction. This set of
proteins includes cofactor-dependent and
cofactor-independent phosphoglycerate mutases (dPGM, and
BPGM respectively), fructose-2,6-bisphosphatase
(F26BP)ase, Sts-1, SixA, histidine acid phosphatases,
phytases, and related proteins. Functions include roles
in metabolism, signaling, or regulation, for example
F26BPase affects glycolysis and gluconeogenesis through
controlling the concentration of F26BP; BPGM controls
the concentration of 2,3-BPG (the main allosteric
effector of hemoglobin in human blood cells); human
Sts-1 is a T-cell regulator; Escherichia coli Six A
participates in the ArcB-dependent His-to-Asp
phosphorelay signaling system; phytases scavenge
phosphate from extracellular sources. Deficiency and
mutation in many of the human members result in disease,
for example erythrocyte BPGM deficiency is a disease
associated with a decrease in the concentration of
2,3-BPG. Clinical applications include the use of
prostatic acid phosphatase (PAP) as a serum marker for
prostate cancer. Agricultural applications include the
addition of phytases to animal feed.
Length = 153
Score = 39.7 bits (93), Expect = 0.002
Identities = 10/33 (30%), Positives = 14/33 (42%)
Query: 512 RLHSTYRHDLKIYSSDEGRVQMSAAAFAKGLLD 544
L Y +IYSS R +A +GL +
Sbjct: 40 ALRERYIKFDRIYSSPLKRAIQTAEIILEGLFE 72
>gnl|CDD|222206 pfam13535, ATP-grasp_4, ATP-grasp domain. This family includes a
diverse set of enzymes that possess ATP-dependent
carboxylate-amine ligase activity.
Length = 183
Score = 36.9 bits (86), Expect = 0.020
Identities = 42/224 (18%), Positives = 67/224 (29%), Gaps = 76/224 (33%)
Query: 93 LLHDRRKVYEQLEKYGIPVPRYALVNREVPYQELDYFIEEEDFVEVHGNRFWKPFVEKPV 152
D+ + E L G+PVP + LV+ ++LD EE F P V KP
Sbjct: 1 AFRDKALMRELLRAAGLPVPPFFLVD---DEEDLDAAAEEIGF----------PVVLKPR 47
Query: 153 HGDDHSIMIYYPSSAGGGMKELFRKVGNRS---SEFHPDVRRVRREGSYIYEEFMPTGGT 209
G S G +V + + + V Y+ EE++
Sbjct: 48 DGA---------GSLG------VFRVDSAAELEAALAALAAEVEDTREYLVEEYID---- 88
Query: 210 DVKVYTVGPEYAHAEARKSPVVDGVV------------------MRNPDGKEVRYPVLLT 251
G EY VDG+V +G E+
Sbjct: 89 -------GDEYH---------VDGLVDDGELVFLGVSRYLGPPPPDFSEGVELGSVSPGE 132
Query: 252 PNEKQMAREVC------IAFRQAVCGFDLLRC-EGRSYVCDVNG 288
+ RE+ + R V + +GR + ++N
Sbjct: 133 DPLPEALRELAERVLKALGLRNGVFHLEFFLTPDGRPVLLEINP 176
>gnl|CDD|165256 PHA02946, PHA02946, ankyin-like protein; Provisional.
Length = 446
Score = 35.0 bits (80), Expect = 0.18
Identities = 30/96 (31%), Positives = 47/96 (48%), Gaps = 11/96 (11%)
Query: 678 SEGFLLMYARWRKLERDLYNERKERFDITQIPDVYDSCKYDLLHNAHLNLEGLDELFKVA 737
+E +L +YA++ D++ + + S Y +LH A+ ++GLDE F V
Sbjct: 7 AEYYLSLYAKYNSKNLDVFR--------NMLQAIEPSGNYHILH-AYCGIKGLDERF-VE 56
Query: 738 QLLADGVIPNEYGINPKQKLKIGSKI-ARRLLGKLL 772
+LL G PNE + L I SKI R++ LL
Sbjct: 57 ELLHRGYSPNETDDDGNYPLHIASKINNNRIVAMLL 92
>gnl|CDD|181145 PRK07850, PRK07850, acetyl-CoA acetyltransferase; Provisional.
Length = 387
Score = 33.5 bits (77), Expect = 0.53
Identities = 57/195 (29%), Positives = 81/195 (41%), Gaps = 41/195 (21%)
Query: 527 DEGRVQMSAAAFAKGLLDLEGQLTPILVSLVSKDSSMLDGLDNASIEMEEAKARLNEIIK 586
EGR + +LD EGQ T LV++D GL + ++E A L +++
Sbjct: 181 AEGRFDREISPVQAPVLDEEGQPTGETR-LVTRDQ----GLRDTTME---GLAGLKPVLE 232
Query: 587 SGSKMIH--------SNGSSDCPWM----ADGVGLPPNA---------SE---LLPKLVK 622
G IH S+G++ WM A +GL P A +E L V+
Sbjct: 233 GG---IHTAGTSSQISDGAAAVLWMDEDRARALGLRPRARIVAQALVGAEPYYHLDGPVQ 289
Query: 623 LTKKVTEQVRQLAKDEDEDLAETNPY--DVIPPYDQAKA--LGKTNIDVDRIAAGLPCGS 678
T KV E + K D DL E N V+ + Q + K N++ IA G P GS
Sbjct: 290 ATAKVLE--KAGMKIGDIDLVEINEAFASVVLSWAQVHEPDMDKVNVNGGAIALGHPVGS 347
Query: 679 EGFLLMYARWRKLER 693
G L+ +LER
Sbjct: 348 TGARLITTALHELER 362
>gnl|CDD|216956 pfam02274, Amidinotransf, Amidinotransferase. This family contains
glycine (EC:2.1.4.1) and inosamine (EC:2.1.4.2)
amidinotransferases, enzymes involved in creatine and
streptomycin biosynthesis respectively. This family also
includes arginine deiminases, EC:3.5.3.6. These enzymes
catalyze the reaction: arginine + H2O <=> citrulline +
NH3. Also found in this family is the Streptococcus anti
tumour glycoprotein.
Length = 286
Score = 33.0 bits (76), Expect = 0.53
Identities = 13/51 (25%), Positives = 20/51 (39%), Gaps = 4/51 (7%)
Query: 684 MYARWRKLERDLYNERKERFDITQIPDVYDSCKYDLLHNAHLNLEGLDELF 734
M+A R+ ER L ++ V D L + + LEG D +
Sbjct: 81 MFAPNRRRERRLD-ILILLKHPYKVQLVLD---ISELEDENAFLEGGDMVL 127
>gnl|CDD|235876 PRK06849, PRK06849, hypothetical protein; Provisional.
Length = 389
Score = 32.7 bits (75), Expect = 0.90
Identities = 19/73 (26%), Positives = 29/73 (39%), Gaps = 12/73 (16%)
Query: 83 FLVNELEPQHLLHDRRKVYEQLEKYGIPVPRYALVNREVPYQELDYFIEEEDFVEVHGNR 142
L + E LLH++ + EQ G+ VP+ L+ P I F H
Sbjct: 103 VLHFDFELLLLLHNKWEFAEQARSLGLSVPKTYLITD--PEA-----IRNFMFKTPH--- 152
Query: 143 FWKPFVEKPVHGD 155
P+V KP++
Sbjct: 153 --TPYVLKPIYSR 163
>gnl|CDD|131199 TIGR02144, LysX_arch, Lysine biosynthesis enzyme LysX. The family
of proteins found in this equivalog include the
characterized LysX from Thermus thermophilus which is
part of a well-organized lysine biosynthesis gene
cluster. LysX is believed to carry out an ATP-dependent
acylation of the amino group of alpha-aminoadipate in
the prokaryotic version of the fungal AAA lysine
biosynthesis pathway. No species having a sequence in
this equivalog contains the elements of the more common
diaminopimelate lysine biosythesis pathway, and none has
been shown to be a lysine auxotroph. These sequences
have mainly recieved the name of the related enzyme,
"ribosomal protein S6 modification protein RimK". RimK
has been characterized in E. coli, and acts by
ATP-dependent condensation of S6 with glutamate
residues.
Length = 280
Score = 31.6 bits (72), Expect = 1.6
Identities = 53/275 (19%), Positives = 91/275 (33%), Gaps = 35/275 (12%)
Query: 26 ILDRLQAFG-EFEVIHFGDKVILED--PIEKWPICDCLIAFYSSGYPLEKAESYATLRKP 82
+++ L+ G + I+ + P E + D I S +
Sbjct: 15 LIEELEKLGLPYRKIYVPALPLPFGERPKELEDV-DVAIIRCVSQSRALYSARLLEALGV 73
Query: 83 FLVNELEPQHLLHDRRKVYEQLEKYGIPVPRYALVNREVPYQELDYFIEEEDFVEVHGNR 142
++N D+ Y +L K G+P PR L + E +
Sbjct: 74 PVINSSHVIEACGDKIFTYLKLAKAGVPTPRTYLAFDR---EAALKLAEALGY------- 123
Query: 143 FWKPFVEKPVHGDDHSIMIYYPSSAGGGMKELFRKVGNRSSEF-HPDVRRVRREGSYIYE 201
P V KPV G + G + L R S H +V + + +
Sbjct: 124 ---PVVLKPVIG------------SWGRLVALIRDKDELESLLEHKEVLGGSQHKLFYIQ 168
Query: 202 EFMPTGGTDVKVYTVGPEYAHAEARKSPVVDGVVMRNPDGKEVRYPVLLTPNEKQMAREV 261
E++ G D++V+ +G E A R S + G + P L +++A +
Sbjct: 169 EYINKPGRDIRVFVIGDEAIAAIYRYS---NHWRTNTARGGKA-EPCPLDEEVEELAVKA 224
Query: 262 CIAFRQAVCGFDLLRCEGRSYVCD-VNGWSFVKNS 295
A V D+ + R + + VN KNS
Sbjct: 225 AEAVGGGVVAIDIFESKERGLLVNEVNHVPEFKNS 259
>gnl|CDD|226304 COG3781, COG3781, Predicted membrane protein [Function unknown].
Length = 306
Score = 30.1 bits (68), Expect = 5.2
Identities = 12/34 (35%), Positives = 17/34 (50%), Gaps = 1/34 (2%)
Query: 764 ARRLLGKLLIDLRN-TREEAISVAELKSSQDQVS 796
AR+L G+L+ +RN R+ V E S D
Sbjct: 82 ARKLWGQLVNAIRNLARQIWTLVPETLDSADVRE 115
>gnl|CDD|239172 cd02771, MopB_NDH-1_NuoG2-N7, MopB_NDH-1_NuoG2-N7: The second
domain of the NuoG subunit (with a [4Fe-4S] cluster, N7)
of the NADH-quinone oxidoreductase/NADH dehydrogenase-1
(NDH-1) found in various bacteria. The NDH-1 is the
first energy-transducting complex in the respiratory
chain and functions as a redox pump that uses the redox
energy to translocate H+ ions across the membrane,
resulting in a significant contribution to energy
production. In Escherichia coli NDH-1, the largest
subunit is encoded by the nuoG gene, and is part of the
14 distinct subunits constituting the functional enzyme.
The NuoG subunit is made of two domains: the first
contains three binding sites for FeS clusters (the fer2
domain), the second domain (this CD), is of unknown
function or, as postulated, has lost an ancestral
formate dehydrogenase activity that became redundant
during the evolution of the complex I enzyme. Unique to
this group, compared to the other prokaryotic and
eukaryotic groups in this domain protein family
(NADH-Q-OR-NuoG2), is an N-terminal [4Fe-4S] cluster
(N7/N1c) present in the second domain. Although only
vestigial sequence evidence remains of a molybdopterin
binding site, this protein domain belongs to the
molybdopterin_binding (MopB) superfamily of proteins.
Length = 472
Score = 30.0 bits (68), Expect = 5.9
Identities = 23/135 (17%), Positives = 39/135 (28%), Gaps = 15/135 (11%)
Query: 535 AAAFAKGLLDLEGQLTPILVSLVSKDSSMLDGLDNASIEMEEAKARLNEIIKSGSKMIHS 594
+AA GL E +K ++ G + S+E+ +A A L + +K +
Sbjct: 242 SAAGVSGLAPKEKAARIAARLTGAKKPLIVSGTLSGSLELIKAAANLAKALKRRGEN--- 298
Query: 595 NGSSDCPWMADGVGLPPNASELLP--KLVKLTKKVTEQVRQLAKDEDED---LAETNPYD 649
N+ LL V + +D D + + Y
Sbjct: 299 -----AGLT--LAVEEGNSPGLLLLGGHVTEPGLDLDGALAALEDGSADALIVLGNDLYR 351
Query: 650 VIPPYDQAKALGKTN 664
P AL
Sbjct: 352 SAPERRVEAALDAAE 366
>gnl|CDD|129443 TIGR00343, TIGR00343, pyridoxal 5'-phosphate synthase, synthase
subunit Pdx1. This protein had been believed to be a
singlet oxygen resistance protein. Subsequent work
showed that it is a protein of pyridoxine (vitamin B6)
biosynthesis, and that pyridoxine quenches the highly
toxic singlet form of oxygen produced by light in the
presence of certain chemicals [Biosynthesis of
cofactors, prosthetic groups, and carriers, Pyridoxine].
Length = 287
Score = 29.7 bits (67), Expect = 6.5
Identities = 28/112 (25%), Positives = 50/112 (44%), Gaps = 22/112 (19%)
Query: 576 EAKARLNEIIKSGSKMIHSNGSSDCPWMADGVGLPPNASELLPKLVKLTKKVTEQVRQLA 635
EA R+NE G+ MI + G + G G N E V+ +K+ E++RQ+
Sbjct: 126 EALRRINE----GAAMIRTKGEA-------GTG---NIVEA----VRHMRKINEEIRQIQ 167
Query: 636 KDEDEDLAETNPYDVIPPYD---QAKALGKTNIDVDRIAAGLPCGSEGFLLM 684
+E+ ++ P + + LGK + V+ A G+ ++ L+M
Sbjct: 168 NMLEEEDLAAVAKELRVPVELLLEVLKLGKLPV-VNFAAGGVATPADAALMM 218
>gnl|CDD|151671 pfam11229, DUF3028, Protein of unknown function (DUF3028). This
eukaryotic family of proteins has no known function.
Length = 589
Score = 30.0 bits (67), Expect = 6.5
Identities = 19/68 (27%), Positives = 36/68 (52%), Gaps = 10/68 (14%)
Query: 553 LVSLVSKDSSMLDGLDNASIEMEEAKARLNEIIKSGSKMIHSNGSSDCPWMADGVGLPPN 612
+V+LV + ++ L + +I+ + RLNE+I++ +++I +G +GL N
Sbjct: 68 MVALVGSEGDVMQ-LKSEAIQNSHFQGRLNEVIRTLTQVISVSGV---------IGLQSN 117
Query: 613 ASELLPKL 620
A LL L
Sbjct: 118 AIWLLGHL 125
>gnl|CDD|223292 COG0214, SNZ1, Pyridoxine biosynthesis enzyme [Coenzyme
metabolism].
Length = 296
Score = 29.6 bits (67), Expect = 6.9
Identities = 25/103 (24%), Positives = 45/103 (43%), Gaps = 28/103 (27%)
Query: 576 EAKARLNEIIKSGSKMIHSNGSSDCPWMADGVGLPPNASELLPKLVKLTKKVTEQVRQLA 635
EA R++E G+ MI + G + G G N E V+ +K+ ++R+L
Sbjct: 136 EALRRISE----GAAMIRTKGEA-------GTG---NVVEA----VRHMRKINGEIRRLQ 177
Query: 636 KDEDEDLAE-----TNPYDVIPPYDQAKALGKTNIDVDRIAAG 673
+++L PY+++ + LG+ + V AAG
Sbjct: 178 SMTEDELYVVAKELQAPYELV---KEVAKLGR--LPVVNFAAG 215
Database: CDD.v3.10
Posted date: Mar 20, 2013 7:55 AM
Number of letters in database: 10,937,602
Number of sequences in database: 44,354
Lambda K H
0.318 0.136 0.402
Gapped
Lambda K H
0.267 0.0747 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 48,071,343
Number of extensions: 4876877
Number of successful extensions: 4637
Number of sequences better than 10.0: 1
Number of HSP's gapped: 4625
Number of HSP's successfully gapped: 37
Length of query: 927
Length of database: 10,937,602
Length adjustment: 106
Effective length of query: 821
Effective length of database: 6,236,078
Effective search space: 5119820038
Effective search space used: 5119820038
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 64 (28.4 bits)