RPS-BLAST 2.2.26 [Sep-21-2011]
Database: CDD.v3.10
44,354 sequences; 10,937,602 total letters
Searching..................................................done
Query= psy10158
(1266 letters)
>gnl|CDD|215860 pfam00328, His_Phos_2, Histidine phosphatase superfamily (branch
2). The histidine phosphatase superfamily is so named
because catalysis centres on a conserved His residue
that is transiently phosphorylated during the catalytic
cycle. Other conserved residues contribute to a
'phosphate pocket' and interact with the phospho group
of substrate before, during and after its transfer to
the His residue. Structure and sequence analyses show
that different families contribute different additional
residues to the 'phosphate pocket' and, more
surprisingly, differ in the position, in sequence and in
three dimensions, of a catalytically essential acidic
residue. The superfamily may be divided into two main
branches.The smaller branch 2 contains predominantly
eukaryotic proteins. The catalytic functions in members
include phytase, glucose-1-phosphatase and multiple
inositol polyphosphate phosphatase. The in vivo roles of
the mammalian acid phosphatases in branch 2 are not
fully understood, although activity against
lysophosphatidic acid and tyrosine-phosphorylated
proteins has been demonstrated.
Length = 327
Score = 147 bits (374), Expect = 6e-39
Identities = 79/392 (20%), Positives = 124/392 (31%), Gaps = 104/392 (26%)
Query: 513 EPSLVLILKWGGELTPAGRIQAEELGRVFRCMYPGGQGNGLGLLRLHSTFRHDLKIYASD 572
+ L WG LTP GR QA ELGR R Y G LL ++ I +SD
Sbjct: 40 LNKVQWPLGWGQ-LTPRGRAQAFELGRYLRQRYVG-------LLPDGYP-PSEVYIRSSD 90
Query: 573 EGRVQMTAAAFAKGLLALEGELTPILVQMVKSANTNGLLDNDSDASKHQNIVKAKLHDLL 632
R +A AF GL EG D D+D Q I L ++
Sbjct: 91 SNRTLASAQAFLAGLFPPEG-------------------DIDNDLLDWQPIPVVTLPEVK 131
Query: 633 QRDRTFTPEDRDKVNPCNATSINIAMDFVKNPVQCCKRIHELIHVLQHIIQKKLEDVKCK 692
+ D C A D + + +L+ V I K+L +
Sbjct: 132 KALANLLLLGYD---SCP------AFDESLVEKRVDPELAKLLAVYLEPIAKRLSQLLPG 182
Query: 693 ESSLYHGESWELMGRRWSKIEKDFCMKNYKYDISKIPDIYDCIKYDLQHNQHTVQFDQAE 752
E++L W+ ++ N D S PD++ YD H + E
Sbjct: 183 ETNL-------TGLDVWALLDLCLFETN-ASDNSPFPDLFTG--YDALHLE---YLSDLE 229
Query: 753 ELYLNAKYMADIVIPQEYGMTMSEKLTISQGICVPLLKKIRADLQRNVEESEENVNRLNP 812
E Y + I + LLK++ L +
Sbjct: 230 EYYGLSGI------------GPELAKLIGGPLLNELLKRLTNALVPDQT----------- 266
Query: 813 QYSHGVSSPGRHVRTRLYFTSESHIHSLLTVLRYGGLTESVHMNDEQWMRAMEYVSMVSE 872
++ LYF ++ I LL+ L GL + +
Sbjct: 267 ---------RLDLKLYLYFAHDTTIAPLLSAL---GLFDQLPGLSSGE------------ 302
Query: 873 LNYMSQVVIMLYEDPTKDPTSDERFHIELHFS 904
Y +++V L++D + F++ L ++
Sbjct: 303 PPYAARLVFELWQD-------GKEFYVRLLYN 327
Score = 50.5 bits (121), Expect = 3e-06
Identities = 13/41 (31%), Positives = 19/41 (46%)
Query: 385 ELRCVVAVIRHGDRTPKQKMKVEVRHPKFFEIFYKYGGQND 425
EL V V RHGDRTP + + + F + K+ +
Sbjct: 1 ELEFVQVVSRHGDRTPTKSLPKDPEAWPFKLLSEKHNDFLN 41
>gnl|CDD|132717 cd07061, HP_HAP_like, Histidine phosphatase domain found in
histidine acid phosphatases and phytases; contains a His
residue which is phosphorylated during the reaction.
Catalytic domain of HAP (histidine acid phosphatases)
and phytases (myo-inositol hexakisphosphate
phosphohydrolases). The conserved catalytic core of this
domain contains a His residue which is phosphorylated in
the reaction. Functions in this subgroup include roles
in metabolism, signaling, or regulation, for example
Escherichia coli glucose-1-phosphatase functions to
scavenge glucose from glucose-1-phosphate and the
signaling molecules inositol 1,3,4,5,6-pentakisphosphate
(InsP5) and inositol hexakisphosphate (InsP6) are in
vivo substrates for eukaryotic multiple inositol
polyphosphate phosphatase 1 (Minpp1). Phytases scavenge
phosphate from extracellular sources and are added to
animal feed while prostatic acid phosphatase (PAP) has
been used for many years as a serum marker for prostate
cancer. Recently PAP has been shown in mouse models to
suppress pain by functioning as an
ecto-5prime-nucleotidase. In vivo it dephosphorylates
extracellular adenosine monophosphate (AMP) generating
adenosine,and leading to the activation of A1-adenosine
receptors in dorsal spinal cord.
Length = 242
Score = 84.0 bits (208), Expect = 8e-18
Identities = 36/90 (40%), Positives = 44/90 (48%), Gaps = 8/90 (8%)
Query: 524 GELTPAGRIQAEELGRVFRCMYPGGQGNGLGLLRLHSTFRHDLKIYASDEGRVQMTAAAF 583
GELTP GR QA ELGR FR Y LL LHS R DL I +SD R +A AF
Sbjct: 17 GELTPFGRQQAFELGRYFRQRYGE-------LLLLHSYNRSDLYIRSSDSQRTLQSAQAF 69
Query: 584 AKGLLALEG-ELTPILVQMVKSANTNGLLD 612
GL +G + + + + + L D
Sbjct: 70 LAGLFPPDGWQPIAVHTIPEEEDDVSNLFD 99
Score = 50.5 bits (121), Expect = 1e-06
Identities = 32/182 (17%), Positives = 58/182 (31%), Gaps = 34/182 (18%)
Query: 724 DISKIPDIYDCIKYDLQHNQHTVQFDQ---AEELYLNAKYMADIVIPQEYGMTMSEKLTI 780
+ + +++D Y+ ++ F EE + YG L
Sbjct: 90 EEDDVSNLFDLCAYETVAKGYSAPFCDLFTEEEWVKLEYLN---DLKFYYGYGPGNPLAR 146
Query: 781 SQGICVPLLKKIRADLQRNVEESEENVNRLNPQYSHGVSSPGRHVRTRLYFTSESHIHSL 840
+QG PLL ++ + RL +G S R + SH ++
Sbjct: 147 AQGS--PLLNEL--------------LARLT----NGPSGSQTFPLDRKLYLYFSHDTTI 186
Query: 841 LTVLRYGGLTESVHMNDEQWMRAMEYVSMVSELNYMSQVVIMLYEDP--TKDPTSDERFH 898
L +L GL + ++R SE +Y +++E D S R
Sbjct: 187 LPLLTALGLFDFAEPLPPDFLRG------FSESDYPPFAARLVFELWRCPGDGESYVRVL 240
Query: 899 IE 900
+
Sbjct: 241 VN 242
Score = 38.1 bits (89), Expect = 0.018
Identities = 9/16 (56%), Positives = 10/16 (62%)
Query: 385 ELRCVVAVIRHGDRTP 400
EL V + RHGDR P
Sbjct: 1 ELEQVQVLSRHGDRYP 16
>gnl|CDD|132716 cd07040, HP, Histidine phosphatase domain found in a functionally
diverse set of proteins, mostly phosphatases; contains a
His residue which is phosphorylated during the reaction.
Catalytic domain of a functionally diverse set of
proteins, most of which are phosphatases. The conserved
catalytic core of this domain contains a His residue
which is phosphorylated in the reaction. This set of
proteins includes cofactor-dependent and
cofactor-independent phosphoglycerate mutases (dPGM, and
BPGM respectively), fructose-2,6-bisphosphatase
(F26BP)ase, Sts-1, SixA, histidine acid phosphatases,
phytases, and related proteins. Functions include roles
in metabolism, signaling, or regulation, for example
F26BPase affects glycolysis and gluconeogenesis through
controlling the concentration of F26BP; BPGM controls
the concentration of 2,3-BPG (the main allosteric
effector of hemoglobin in human blood cells); human
Sts-1 is a T-cell regulator; Escherichia coli Six A
participates in the ArcB-dependent His-to-Asp
phosphorelay signaling system; phytases scavenge
phosphate from extracellular sources. Deficiency and
mutation in many of the human members result in disease,
for example erythrocyte BPGM deficiency is a disease
associated with a decrease in the concentration of
2,3-BPG. Clinical applications include the use of
prostatic acid phosphatase (PAP) as a serum marker for
prostate cancer. Agricultural applications include the
addition of phytases to animal feed.
Length = 153
Score = 56.3 bits (136), Expect = 4e-09
Identities = 20/66 (30%), Positives = 24/66 (36%), Gaps = 17/66 (25%)
Query: 522 WGGELTPAGRIQAEELGRVFRCMYPGGQGNGLGLLRLHSTFRHDLKIYASDEGRVQMTAA 581
G LT GR QA ELG+ R Y +IY+S R TA
Sbjct: 22 GDGPLTEKGRQQARELGKALRERYI-----------------KFDRIYSSPLKRAIQTAE 64
Query: 582 AFAKGL 587
+GL
Sbjct: 65 IILEGL 70
>gnl|CDD|132718 cd07067, HP_PGM_like, Histidine phosphatase domain found in
phosphoglycerate mutases and related proteins, mostly
phosphatases; contains a His residue which is
phosphorylated during the reaction. Subgroup of the
catalytic domain of a functionally diverse set of
proteins, most of which are phosphatases. The conserved
catalytic core of this domain contains a His residue
which is phosphorylated in the reaction. This subgroup
contains cofactor-dependent and cofactor-independent
phosphoglycerate mutases (dPGM, and BPGM respectively),
fructose-2,6-bisphosphatase (F26BP)ase, Sts-1, SixA, and
related proteins. Functions include roles in metabolism,
signaling, or regulation, for example, F26BPase affects
glycolysis and gluconeogenesis through controlling the
concentration of F26BP; BPGM controls the concentration
of 2,3-BPG (the main allosteric effector of hemoglobin
in human blood cells); human Sts-1 is a T-cell
regulator; Escherichia coli Six A participates in the
ArcB-dependent His-to-Asp phosphorelay signaling system.
Deficiency and mutation in many of the human members
result in disease, for example erythrocyte BPGM
deficiency is a disease associated with a decrease in
the concentration of 2,3-BPG.
Length = 153
Score = 41.5 bits (98), Expect = 6e-04
Identities = 21/74 (28%), Positives = 26/74 (35%), Gaps = 17/74 (22%)
Query: 525 ELTPAGRIQAEELGRVFRCMYPGGQGNGLGLLRLHSTFRHDLKIYASDEGRVQMTAAAFA 584
LT GR QA LG+ RL +IY+S R TA
Sbjct: 25 PLTEKGREQARALGK-----------------RLKELGIKFDRIYSSPLKRAIQTAEIIL 67
Query: 585 KGLLALEGELTPIL 598
+ L L E+ P L
Sbjct: 68 EELPGLPVEVDPRL 81
>gnl|CDD|214859 smart00855, PGAM, Phosphoglycerate mutase family. Phosphoglycerate
mutase (PGAM) and bisphosphoglycerate mutase (BPGM) are
structurally related enzymes that catalyse reactions
involving the transfer of phospho groups between the
three carbon atoms of phosphoglycerate... Both enzymes
can catalyse three different reactions with different
specificities, the isomerization of 2-phosphoglycerate
(2-PGA) to 3-phosphoglycerate (3-PGA) with
2,3-diphosphoglycerate (2,3-DPG) as the primer of the
reaction, the synthesis of 2,3-DPG from 1,3-DPG with
3-PGA as a primer and the degradation of 2,3-DPG to
3-PGA (phosphatase activity). In mammals, PGAM is a
dimeric protein with two isoforms, the M (muscle) and B
(brain) forms. In yeast, PGAM is a tetrameric protein.
Length = 158
Score = 37.4 bits (87), Expect = 0.016
Identities = 21/60 (35%), Positives = 24/60 (40%), Gaps = 16/60 (26%)
Query: 525 ELTPAGRIQAEELGRVFRCMYPGGQGNGLGLLRLHSTFRHDLKIYASDEGRVQMTAAAFA 584
LT GR QAE LGR LL R D +Y+S R + TA A A
Sbjct: 25 PLTELGRAQAEALGR---------------LLASLLLPRFD-VVYSSPLKRARQTAEALA 68
>gnl|CDD|233121 TIGR00768, rimK_fam, alpha-L-glutamate ligases, RimK family. This
family, related to bacterial glutathione synthetases,
contains at least two different alpha-L-glutamate
ligases. One is RimK, as in E. coli, which adds
additional Glu residues to the native Glu-Glu C-terminus
of ribosomal protein S6, but not to Lys-Glu mutants.
Most species with a member of this subfamily lack an S6
homolog ending in Glu-Glu, however. Members in
Methanococcus jannaschii act instead as a
tetrahydromethanopterin:alpha-l-glutamate ligase
(MJ0620) and a gamma-F420-2:alpha-l-glutamate ligase
(MJ1001).
Length = 277
Score = 37.3 bits (87), Expect = 0.035
Identities = 59/296 (19%), Positives = 100/296 (33%), Gaps = 57/296 (19%)
Query: 61 KSQSKPMKEILTRLE-EFEFIKM----IVFSEETIQKPVDEWPIVDCLISFHSKGFPLEK 115
+ K +KE L +++ + + F+E + + E +V I +G
Sbjct: 10 RLDEKMLKEAAEELGIDYKVVTPPAIPLTFNEG--PRELAELDVVIVRIVSMFRGL---A 64
Query: 116 AIKYANLRKPFVINNLNMQYDIQDRRKVYALLEKEGIEIPRYAVLDRESPDPVKHELVES 175
+Y VIN+ + + D+ LL K G+ PR + +
Sbjct: 65 VARYLESLGVPVINSSDAILNAGDKFLTSQLLAKAGLPQPRTGL---------------A 109
Query: 176 EDHVEVNGIV--FNKPFVEKPVSAEDHNIYIYYPTSAGGGSQRLFRKIGSRSSVYSPESR 233
E ++ P V KPV S G K + + + E
Sbjct: 110 GSPEEALKLIEEIGFPVVLKPVFG-----------SWGRLVSLARDKQAAETLLEHFEQL 158
Query: 234 VRKSGSFIYEDFMPTDGT-DVKVYTVGPDYAHAEARKSP-------ALDGKVERDSEGKE 285
F ++++ G D++V+ VG + A R + A GK E
Sbjct: 159 NGPQNLFYVQEYIKKPGGRDIRVFVVGDEVIAAIYRITSGHWRTNLARGGKAE------- 211
Query: 286 IRYPVILSNAEKLISRKVCLAFKQTVCGFDLLRANGKSF-VCDVNGFSFVKNSNKY 340
P L+ + ++ K A V G DLL + + V +VN KNS K
Sbjct: 212 ---PCPLTEEIEELAIKAAKALGLDVVGIDLLESEDRGLLVNEVNPNPEFKNSVKT 264
>gnl|CDD|215845 pfam00300, His_Phos_1, Histidine phosphatase superfamily (branch
1). The histidine phosphatase superfamily is so named
because catalysis centres on a conserved His residue
that is transiently phosphorylated during the catalytic
cycle. Other conserved residues contribute to a
'phosphate pocket' and interact with the phospho group
of substrate before, during and after its transfer to
the His residue. Structure and sequence analyses show
that different families contribute different additional
residues to the 'phosphate pocket' and, more
surprisingly, differ in the position, in sequence and in
three dimensions, of a catalytically essential acidic
residue. The superfamily may be divided into two main
branches. The larger branch 1 contains a wide variety of
catalytic functions, the best known being fructose
2,6-bisphosphatase (found in a bifunctional protein with
2-phosphofructokinase) and cofactor-dependent
phosphoglycerate mutase. The latter is an unusual
example of a mutase activity in the superfamily: the
vast majority of members appear to be phosphatases. The
bacterial regulatory protein phosphatase SixA is also in
branch 1 and has a minimal, and possible ancestral-like
structure, lacking the large domain insertions that
contribute to binding of small molecules in branch 1
members.
Length = 154
Score = 35.1 bits (81), Expect = 0.072
Identities = 17/62 (27%), Positives = 22/62 (35%), Gaps = 19/62 (30%)
Query: 526 LTPAGRIQAEELGRVFRCMYPGGQGNGLGLLRLHSTFRHDLKIYASDEGRVQMTAAAFAK 585
LT GR QA LG+ + D +IY+S R TA A+
Sbjct: 25 LTELGREQARALGKRLK------------------GIPFD-RIYSSPLLRAIQTAEILAE 65
Query: 586 GL 587
L
Sbjct: 66 AL 67
>gnl|CDD|117020 pfam08443, RimK, RimK-like ATP-grasp domain. This ATP-grasp domain
is found in the ribosomal S6 modification enzyme RimK.
Length = 190
Score = 35.5 bits (82), Expect = 0.076
Identities = 45/214 (21%), Positives = 71/214 (33%), Gaps = 51/214 (23%)
Query: 139 DRRKVYALLEKEGIEIPRYAVLDRESPDPVKHE-LVESEDHVEVNGIVFNKPFVEKPVSA 197
D+ K + LL K GI +P + + P E +E ++ G P V K V
Sbjct: 3 DKAKSHQLLAKHGIPVPNTGL----AWSPEDAEKFIE-----QIKGF----PVVVKSVF- 48
Query: 198 EDHNIYIYYPTSAGGGSQ-RLFRKIGSRSSVYSPESRVRKSGSFIY-EDFMPTDGT-DVK 254
GSQ S+ + + I ++F+ G D++
Sbjct: 49 ---------------GSQGIGVFLAEDEQSLEQLLEAFKWLKNQILVQEFIAEAGNRDIR 93
Query: 255 VYTVGPDYA--------HAEARKSPALDGKVERDSEGKEIRYPVILSNAEKLISRKVCLA 306
VG + + R + G E P LS E+ I+ K A
Sbjct: 94 CLVVGGEVVGAIHRQSNEGDFRTNLHRGGVAE----------PYQLSQEEEEIAIKAAQA 143
Query: 307 FKQTVCGFDLLRANGKSFVCDVNGFSFVKNSNKY 340
V G D+ R+ VC+VN +K +
Sbjct: 144 MGLDVLGVDIFRSKRGLLVCEVNSSPGLKGIERT 177
>gnl|CDD|223483 COG0406, phoE, Broad specificity phosphatase PhoE and related
phosphatases [General function prediction only].
Length = 208
Score = 35.5 bits (82), Expect = 0.099
Identities = 19/63 (30%), Positives = 21/63 (33%), Gaps = 17/63 (26%)
Query: 525 ELTPAGRIQAEELGRVFRCMYPGGQGNGLGLLRLHSTFRHDLKIYASDEGRVQMTAAAFA 584
LT GR QAE L L F IY+S R Q TA A
Sbjct: 28 PLTEEGRAQAEALAER--------------LAARDIGFDA---IYSSPLKRAQQTAEPLA 70
Query: 585 KGL 587
+ L
Sbjct: 71 EEL 73
>gnl|CDD|165563 PHA03308, PHA03308, transcriptional regulator ICP4; Provisional.
Length = 1463
Score = 36.7 bits (84), Expect = 0.11
Identities = 36/144 (25%), Positives = 57/144 (39%), Gaps = 10/144 (6%)
Query: 892 TSDERFHIELHFSPG----VNCCVQKNLPPGPGFRPHSRNDQKK-NLPRIDQEDTEFYST 946
T D H+ L+ SPG + P P R ++Q+ ++ E
Sbjct: 730 TPDRAAHLLLYGSPGGGDGAEPSAAQESPANPWPRAPPCDEQEPLSVSPYGPEPDRPPDD 789
Query: 947 DAEDNTGSSKSTSDQDSPTSAEGPSVDQSKGKFVLSQPIPITVKDLKRKNSVGDPCPSIV 1006
D E G + +S+ + EG + ++ G L +P V + R N + PCP +
Sbjct: 790 DFETRKGLKRKSSEDHADPIPEGNATKKTCGLQGLPDSLPPAVPETDRDNPLLPPCP--I 847
Query: 1007 APEG---HPYRRPRSPNEEQRSRS 1027
PEG P P+ P E Q +S
Sbjct: 848 TPEGPPCPPREEPQQPQEPQEPQS 871
>gnl|CDD|178268 PLN02662, PLN02662, cinnamyl-alcohol dehydrogenase family protein.
Length = 322
Score = 32.0 bits (73), Expect = 2.2
Identities = 14/27 (51%), Positives = 16/27 (59%), Gaps = 3/27 (11%)
Query: 352 ILRELAPTLHIPWSVPFQLDDPPFVPT 378
IL EL PTL +P DD P+VPT
Sbjct: 263 ILHELYPTLQLPEKCA---DDKPYVPT 286
>gnl|CDD|162743 TIGR02172, Fb_sc_TIGR02172, Fibrobacter succinogenes paralogous
family TIGR02172. This model describes a paralogous
family of five proteins, likely to be enzymes, in the
rumen bacterium Fibrobacter succinogenes S85. Members
show homology to proteins described by pfam01636, a
phosphotransferase enzyme family associated with
resistance to aminoglycoside antibiotics. However,
members of this family score below the current trusted
and noise cutoffs for pfam01636.
Length = 226
Score = 30.9 bits (70), Expect = 3.0
Identities = 23/75 (30%), Positives = 38/75 (50%), Gaps = 9/75 (12%)
Query: 670 RIHELIHVLQHIIQKKLEDVKCKESSLYHGESWELMGRRWSKIEKDFCMKNYKYD----I 725
R+ E+ + + KKL KC S+ +S++ R++ + EKDF K+YK I
Sbjct: 92 RLEEIAKIFAEM-AKKLHSTKCDTSTF---QSYKEKIRKFIE-EKDFVPKDYKEKARAFI 146
Query: 726 SKIPDIYDCIKYDLQ 740
++PD C+ D Q
Sbjct: 147 KEVPDTSTCLHGDFQ 161
>gnl|CDD|227430 COG5099, COG5099, RNA-binding protein of the Puf family,
translational repressor [Translation, ribosomal structure
and biogenesis].
Length = 777
Score = 31.6 bits (72), Expect = 3.6
Identities = 26/142 (18%), Positives = 50/142 (35%), Gaps = 23/142 (16%)
Query: 1095 NLRNMIPSTTNVTALDGFGGVPS-IRPLETL-HNALSLKHLDNFLGKMIRVIPSKTPASS 1152
+ + PS+T L P+ P+ + ++ + + I T +SS
Sbjct: 19 HSKKSPPSSTTSQELMNGNSTPNSFSPIPSKASSSATFTLNLPINNSVNHKI---TSSSS 75
Query: 1153 PPKYPSTPIEHSLSGARLGWSGPPSFVSESGPSS---------------PNALSDYFFKS 1197
+ PS ++S + SG S + E SS + S
Sbjct: 76 SRRKPSGSWSVAISSST---SGSQSLLMELPSSSFNPSTSSRNKSNSALSSTQQGNANSS 132
Query: 1198 GLNTSDSCSSEFSSSRSSVPSS 1219
+S + SS F+S++ +P+
Sbjct: 133 VTLSSSTASSMFNSNKLPLPNP 154
>gnl|CDD|218803 pfam05904, DUF863, Plant protein of unknown function (DUF863).
This family consists of a number of hypothetical
proteins from Arabidopsis thaliana and Oryza sativa. The
function of this family is unknown.
Length = 766
Score = 31.2 bits (70), Expect = 4.1
Identities = 18/60 (30%), Positives = 30/60 (50%), Gaps = 5/60 (8%)
Query: 457 PEIEEKQGKLEQLKGVLEMYGH--FSGINRKVQMKYQPKGRPRGSSSDEEEEDVCKPKEP 514
E+ E ++ L G++ G+ SG+ R+ + K P+GR R + EE +V P P
Sbjct: 677 HEVTED---IQMLGGLMRATGYSWMSGLTRRGRRKGGPRGRRRSVTIINEEVEVSPPPAP 733
>gnl|CDD|215601 PLN03142, PLN03142, Probable chromatin-remodeling complex ATPase
chain; Provisional.
Length = 1033
Score = 31.3 bits (71), Expect = 4.3
Identities = 23/82 (28%), Positives = 42/82 (51%), Gaps = 9/82 (10%)
Query: 427 HVKLKKPKQLQEVLDIARMLLTEIENNSADPEIEEKQGKLEQLKGVLEMYGHFSGINRKV 486
+K +K +++Q++L+ + NN +G+L+ L E++ HF+ ++
Sbjct: 67 ELKKQKKQEIQKILEQQNAAIDADMNNKG-------KGRLKYLLQQTEIFAHFAKGDQSA 119
Query: 487 QMKYQPKGRPRGSSSD-EEEED 507
K + KGR R +S EEEED
Sbjct: 120 SAK-KAKGRGRHASKLTEEEED 140
>gnl|CDD|223267 COG0189, RimK, Glutathione synthase/Ribosomal protein S6
modification enzyme (glutaminyl transferase) [Coenzyme
metabolism / Translation, ribosomal structure and
biogenesis].
Length = 318
Score = 30.8 bits (70), Expect = 4.6
Identities = 50/274 (18%), Positives = 85/274 (31%), Gaps = 51/274 (18%)
Query: 71 LTRLEEFEFIKMIVFSEETIQKPVDEWPIVDCLISFHSKGFPLEKAIKYANL---RKPFV 127
L RL E + + + + + D +I K P + A ++ L + V
Sbjct: 51 LARLVEVGEVIGLHYELIEEEDLSLLDEL-DVIIM--RKDPPFDFATRFLRLAERKGVPV 107
Query: 128 INNLNMQYDIQDRRKVYALLEKEGIEIPRYAVLDRESPDPVKHELVESEDHVEVNGIVFN 187
IN+ +++ LL K GI +P + PD E V
Sbjct: 108 INDPQSIRRCRNKLYTTQLLAKAGIPVPPTLIT--RDPDEAA-EFVAE---------HLG 155
Query: 188 KPFVEKPVSAEDHNIYIYYPTSAGGGSQRLFRKIGSRSSVYSPESRVRKSGSFIYEDFMP 247
P V KP+ +GG L I ++++P
Sbjct: 156 FPVVLKPLD------------GSGGRGVFLVEDADPELLSLLETLTQEGRKLIIVQEYIP 203
Query: 248 TDGTDVKVY-----TVGPDYA------HAEARKSPALDGKVERDSEGKEIRYPVILSNAE 296
D + V YA + R + A G+ E P L+ E
Sbjct: 204 KAKRDDRRVLVGGGEVVAIYALARIPASGDFRSNLARGGRAE----------PCELTEEE 253
Query: 297 KLISRKVCLAFKQTVCGFDLLRANGKSFVCDVNG 330
+ ++ K A + G D++ +V +VN
Sbjct: 254 EELAVKAAPALGLGLVGVDIIEDKDGLYVTEVNV 287
>gnl|CDD|219602 pfam07841, DM4_12, DM4/DM12 family. This family contains sequences
derived from hypothetical proteins expressed by two
insect species, D. melanogaster and A. gambiae. The
region in question is approximately 115 amino acid
residues long and contains four highly- conserved
cysteine residues.
Length = 83
Score = 28.4 bits (64), Expect = 4.8
Identities = 10/34 (29%), Positives = 19/34 (55%), Gaps = 2/34 (5%)
Query: 1125 HNALSLKHLDNFLGKMIRVI--PSKTPASSPPKY 1156
L L + LG+++R++ PS++ PP+Y
Sbjct: 30 AAQLPLDEHNGLLGELLRIVFTPSESEDELPPEY 63
>gnl|CDD|218107 pfam04484, DUF566, Family of unknown function (DUF566). Family of
related proteins that is plant specific.
Length = 313
Score = 30.7 bits (69), Expect = 4.9
Identities = 17/76 (22%), Positives = 26/76 (34%), Gaps = 1/76 (1%)
Query: 1145 PSKTPASSPPKYPSTPIEHSLSGARLGWSGPPSFVSESGPSSPNALSDYFFKSGLNTSDS 1204
+ +P SS + S+ + + +R P+ S + N S F S
Sbjct: 13 DASSPRSSSRRRLSSSFLSTSASSRPRRLNAPAS-PPSSSPARNTSSSSSFGLSKQRPSS 71
Query: 1205 CSSEFSSSRSSVPSSE 1220
S SSR PS
Sbjct: 72 LSRGRLSSRFVSPSRG 87
>gnl|CDD|232869 TIGR00197, yjeF_nterm, yjeF N-terminal region. The protein region
corresponding to This model shows no clear homology to
any protein of known function. This model is built on
yeast protein YNL200C and the N-terminal regions of E.
coli yjeF and its orthologs in various species. The
C-terminal region of yjeF and its orthologs shows
similarity to hydroxyethylthiazole kinase (thiM) and
other enzymes involved in thiamine biosynthesis. Yeast
YKL151C and B. subtilis yxkO match the yjeF C-terminal
domain but lack this region [Unknown function, General].
Length = 205
Score = 30.1 bits (68), Expect = 5.6
Identities = 14/80 (17%), Positives = 33/80 (41%), Gaps = 8/80 (10%)
Query: 55 VCAMAKKSQSKPMKEILTRLEEFEFIKMIVFS--EETIQKPVDEWPIVDCLISFHSKGFP 112
V + K+ + + ++ + + +K+ S E + KP D I+D ++ KG
Sbjct: 75 VFLLKKEERIEFTEQ---AVVNLKALKVGGISIDEGNLVKPEDCDVIIDAILGTGVKGPL 131
Query: 113 LE---KAIKYANLRKPFVIN 129
E ++ N +++
Sbjct: 132 REPFKTIVEEINELPAPIVS 151
>gnl|CDD|221745 pfam12737, Mating_C, C-terminal domain of homeodomain 1. Mating in
fungi is controlled by the loci that determine the mating
type of an individual, and only individuals with
differing mating types can mate. Basidiomycete fungi have
evolved a unique mating system, termed tetrapolar or
bifactorial incompatibility, in which mating type is
determined by two unlinked loci; compatibility at both
loci is required for mating to occur. The multi-allelic
tetrapolar mating system is considered to be a novel
innovation that could have only evolved once, and is thus
unique to the mushroom fungi. This domain is C-terminal
to the homeodomain transcription factor region.
Length = 418
Score = 30.5 bits (69), Expect = 5.9
Identities = 28/124 (22%), Positives = 43/124 (34%), Gaps = 21/124 (16%)
Query: 1102 STTNVTALDGFGGVPSIRPLETLHNALSLKHLDNFLGKM---IRVIPSKTPASSPPKYPS 1158
ST V AL S PL+ S + + + + P+SS
Sbjct: 288 STFAVPALTSSSVDQSATPLD---QGFSNFGSNMYSEPLNPTNDSLLYGLPSSSSLYANR 344
Query: 1159 T--PIEHSLSGARLGWSGPPSFVSESGPSSPNALSDYFFKSGLNTSDSCSSEFSSSRSSV 1216
T P S S + L +S + S S +S + L+ + S S ++
Sbjct: 345 TIFPAWASTSVSPLDFSTLFNQPSPSPMASQSILAP-------------AQPTSPSPVAL 391
Query: 1217 PSSE 1220
PSSE
Sbjct: 392 PSSE 395
>gnl|CDD|234304 TIGR03671, cca_archaeal, CCA-adding enzyme.
Length = 408
Score = 30.3 bits (69), Expect = 7.9
Identities = 21/72 (29%), Positives = 25/72 (34%), Gaps = 19/72 (26%)
Query: 140 RRKVYALLEKEGIEIPRYAV-------------LDRESPDPVKHE--LVESEDHV----E 180
R + LLE+EG E+ RY V E P H V DH E
Sbjct: 304 GRSLVKLLEREGFEVLRYGVWADENTCYLLLELESAELPRVKLHVGPPVWVRDHAEKFIE 363
Query: 181 VNGIVFNKPFVE 192
V PF+E
Sbjct: 364 KYEDVDYGPFIE 375
>gnl|CDD|222432 pfam13882, Bravo_FIGEY, C-terminal domain of Fibronectin type III.
This is the very C-terminal region of neural adhesion
molecule L1 proteins that are also known as Bravo or
NrCAM. It lies upstream of the IG and Fn3 domains and has
the highly conserved motif FIGEY. The function is not
known.
Length = 118
Score = 28.4 bits (64), Expect = 8.9
Identities = 10/49 (20%), Positives = 18/49 (36%), Gaps = 1/49 (2%)
Query: 993 KRKNSVGDPCPSIVAPEG-HPYRRPRSPNEEQRSRSYDQQHQRPKGASK 1040
+++++ GDP + EG Y RP R + +S
Sbjct: 40 EKEDAPGDPEDQPMKDEGFGEYSRPEDNKPLPRGSQQSLDGEIGDDSST 88
Database: CDD.v3.10
Posted date: Mar 20, 2013 7:55 AM
Number of letters in database: 10,937,602
Number of sequences in database: 44,354
Lambda K H
0.316 0.134 0.395
Gapped
Lambda K H
0.267 0.0657 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 64,369,744
Number of extensions: 6384977
Number of successful extensions: 5410
Number of sequences better than 10.0: 1
Number of HSP's gapped: 5397
Number of HSP's successfully gapped: 36
Length of query: 1266
Length of database: 10,937,602
Length adjustment: 108
Effective length of query: 1158
Effective length of database: 6,147,370
Effective search space: 7118654460
Effective search space used: 7118654460
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.6 bits)
S2: 65 (29.0 bits)