RPS-BLAST 2.2.22 [Sep-27-2009]
Database: CddA
21,609 sequences; 6,263,737 total letters
Searching..................................................done
Query= gi|254780780|ref|YP_003065193.1| histidine triad (HIT) protein
[Candidatus Liberibacter asiaticus str. psy62]
(155 letters)
>gnl|CDD|30883 COG0537, Hit, Diadenosine tetraphosphate (Ap4A) hydrolase and other
HIT family hydrolases [Nucleotide transport and
metabolism / Carbohydrate transport and metabolism /
General function prediction only].
Length = 138
Score = 131 bits (330), Expect = 9e-32
Identities = 65/139 (46%), Positives = 83/139 (59%), Gaps = 1/139 (0%)
Query: 11 NQNIFIKIIRNETNACRVYEDDILLAIMDIMPRNPGHVLIIPKSRIRDIFEAPPEILSQI 70
IF KIIR E A +VYED+ +LA +DI P PGH L+IPK + D+ + PE L+++
Sbjct: 1 MMCIFCKIIRGEIPANKVYEDEHVLAFLDIYPAAPGHTLVIPKRHVSDLEDLDPEELAEL 60
Query: 71 AFLIKKIAIACKSAFQADGIQILQFNGHAAGQTVPHLHFHVIPCKNGDNASHTNIHPTQK 130
L +KIA A K AF ADG I NG AAGQ V HLH H+IP GD+ T+
Sbjct: 61 FLLAQKIAKALKEAFGADGYNIGINNGKAAGQEVFHLHIHIIPRYKGDDNFPGPGWGTKV 120
Query: 131 IENFAKLEINAQKIRKELQ 149
N +LE A+KIRK L+
Sbjct: 121 EPN-EELEELAEKIRKALE 138
>gnl|CDD|29590 cd01277, HINT_subgroup, HINT (histidine triad nucleotide-binding
protein) subgroup: Members of this CD belong to the
superfamily of histidine triad hydrolases that act on
alpha-phosphate of ribonucleotides. This subgroup
includes members from all three forms of cellular life.
Although the biochemical function has not been
characterised for many of the members of this subgroup,
the proteins from Yeast have been shown to be involved
in secretion, peroxisome formation and gene expression..
Length = 103
Score = 127 bits (320), Expect = 2e-30
Identities = 50/101 (49%), Positives = 66/101 (65%)
Query: 13 NIFIKIIRNETNACRVYEDDILLAIMDIMPRNPGHVLIIPKSRIRDIFEAPPEILSQIAF 72
IF KII E + +VYEDD +LA +DI P + GH L+IPK ++ + PE L+++
Sbjct: 2 CIFCKIIAGEIPSYKVYEDDHVLAFLDINPASKGHTLVIPKKHYENLLDLDPEELAELIL 61
Query: 73 LIKKIAIACKSAFQADGIQILQFNGHAAGQTVPHLHFHVIP 113
KK+A A K A +ADG+ ILQ NG AAGQ V H+H HVIP
Sbjct: 62 AAKKVARALKKALKADGLNILQNNGRAAGQVVFHVHVHVIP 102
>gnl|CDD|144720 pfam01230, HIT, HIT domain.
Length = 98
Score = 95.1 bits (237), Expect = 8e-21
Identities = 45/86 (52%), Positives = 61/86 (70%)
Query: 28 VYEDDILLAIMDIMPRNPGHVLIIPKSRIRDIFEAPPEILSQIAFLIKKIAIACKSAFQA 87
VYEDD++LA +DI P+ PGHVL+IPK IR++ + PE L + KK+A A F+A
Sbjct: 9 VYEDDLVLAFLDIDPQAPGHVLVIPKKHIRELHDLTPEELGDLMSEAKKVARALGKVFKA 68
Query: 88 DGIQILQFNGHAAGQTVPHLHFHVIP 113
DG +++ NG AAGQ+VPHLH H+IP
Sbjct: 69 DGYRVVINNGAAAGQSVPHLHIHIIP 94
>gnl|CDD|38485 KOG3275, KOG3275, KOG3275, Zinc-binding protein of the histidine
triad (HIT) family [Signal transduction mechanisms].
Length = 127
Score = 84.6 bits (209), Expect = 1e-17
Identities = 42/107 (39%), Positives = 59/107 (55%), Gaps = 6/107 (5%)
Query: 10 DNQNIFIKIIRNETNACRVYEDDILLAIMDIMPRNPGHVLIIPKSRIRDIFEA---PPEI 66
IF KIIR E A ++EDD LA DI P+ PGH L+IPK I + +A E+
Sbjct: 15 AAPTIFCKIIRKEIPAKIIFEDDRCLAFHDIAPQAPGHFLVIPKKHITQLSKAEDRDDEL 74
Query: 67 LSQIAFLIKKIAIACKSAFQADGIQILQFNGHAAGQTVPHLHFHVIP 113
L + + KK+A K+ DG +++Q NG Q+V H+H HV+
Sbjct: 75 LGHLLPVAKKVA---KALGLEDGYRVVQNNGKDGHQSVYHVHLHVLG 118
>gnl|CDD|29589 cd01276, PKCI_related, Protein Kinase C Interacting protein related
(PKCI): PKCI and related proteins belong to the
ubiquitous HIT family of hydrolases that act on
alpha-phosphates of ribonucleotides. The members of this
subgroup have a conserved HxHxHxx motif (x is a
hydrophobic residue) that is a signature for this
family. No enzymatic activity has been reported however,
for PKCI and its related members..
Length = 104
Score = 82.5 bits (204), Expect = 4e-17
Identities = 40/104 (38%), Positives = 55/104 (52%), Gaps = 5/104 (4%)
Query: 12 QNIFIKIIRNETNACRVYEDDILLAIMDIMPRNPGHVLIIPKSRIRDIFEAP---PEILS 68
IF KIIR E A +VYEDD +LA DI P+ P H+L+IPK I + +A E+L
Sbjct: 1 DCIFCKIIRGEIPAKKVYEDDEVLAFHDINPQAPVHILVIPKKHIASLSDATEEDEELLG 60
Query: 69 QIAFLIKKIAIACKSAFQADGIQILQFNGHAAGQTVPHLHFHVI 112
+ K+A DG +++ G GQ V HLH H++
Sbjct: 61 HLLSAAAKVAK--DLGIAEDGYRLVINCGKDGGQEVFHLHLHLL 102
>gnl|CDD|29588 cd01275, FHIT, FHIT (fragile histidine family): FHIT proteins,
related to the HIT family carry a motif HxHxH/Qxx (x, is
a hydrophobic amino acid), On the basis of sequence,
substrate specificity, structure, evolution and
mechanism, HIT proteins are classified into three
branches: the Hint branch, which consists of adenosine
5' -monophosphoramide hydrolases, the Fhit branch, that
consists of diadenosine polyphosphate hydrolases, and
the GalT branch consisting of specific nucloside
monophosphate transferases. Fhit plays a very important
role in the development of tumours. Infact, Fhit
deletions are among the earliest and most frequent
genetic alterations in the development of tumours..
Length = 126
Score = 73.0 bits (179), Expect = 3e-14
Identities = 30/92 (32%), Positives = 50/92 (54%)
Query: 28 VYEDDILLAIMDIMPRNPGHVLIIPKSRIRDIFEAPPEILSQIAFLIKKIAIACKSAFQA 87
Y A++++ P NPGHVL++P + + + PE ++ + L++ A K ++
Sbjct: 17 FYRTKHSFAVVNLYPYNPGHVLVVPYRHVPRLEDLTPEEIADLFKLVQLAMKALKVVYKP 76
Query: 88 DGIQILQFNGHAAGQTVPHLHFHVIPCKNGDN 119
DG I +G A G VPH+H H++P NGD
Sbjct: 77 DGFNIGINDGKAGGGIVPHVHIHIVPRWNGDT 108
>gnl|CDD|29586 cd00468, HIT_like, HIT family: HIT (Histidine triad) proteins,
named for a motif related to the sequence HxHxH/Qxx (x,
a hydrophobic amino acid), are a superfamily of
nucleotide hydrolases and transferases, which act on the
alpha-phosphate of ribonucleotides. On the basis of
sequence, substrate specificity, structure, evolution
and mechanism, HIT proteins are classified in the
literacture into three major branches: the Hint branch,
which consists of adenosine 5' -monophosphoramide
hydrolases, the Fhit branch, that consists of
diadenosine polyphosphate hydrolases, and the GalT
branch consisting of specific nucloside monophosphate
transferases. Further sequence analysis reveals several
new closely related, yet uncharacterized subgroups..
Length = 86
Score = 67.3 bits (164), Expect = 2e-12
Identities = 25/86 (29%), Positives = 47/86 (54%)
Query: 28 VYEDDILLAIMDIMPRNPGHVLIIPKSRIRDIFEAPPEILSQIAFLIKKIAIACKSAFQA 87
V +D+ A +++ P PGHVL+ PK + + + +L+ + +++A +
Sbjct: 1 VPDDEHSFAFVNLKPAAPGHVLVCPKRHVETLPDLDEALLADLVITAQRVAAELEKHGNV 60
Query: 88 DGIQILQFNGHAAGQTVPHLHFHVIP 113
+ + +G AAGQ+VPH+H HV+P
Sbjct: 61 PSLTVFVNDGAAAGQSVPHVHLHVLP 86
>gnl|CDD|38589 KOG3379, KOG3379, KOG3379, Diadenosine polyphosphate hydrolase and
related proteins of the histidine triad (HIT) family
[Nucleotide transport and metabolism, General function
prediction only].
Length = 150
Score = 63.4 bits (154), Expect = 3e-11
Identities = 27/98 (27%), Positives = 49/98 (50%)
Query: 29 YEDDILLAIMDIMPRNPGHVLIIPKSRIRDIFEAPPEILSQIAFLIKKIAIACKSAFQAD 88
Y+ A +++ P PGHVL+ P + + + + + ++K+ + + A
Sbjct: 21 YKTKHSFAFVNLKPVVPGHVLVSPLRVVPRLTDLTAAETADLFTTVQKVQRVLEKHYNAT 80
Query: 89 GIQILQFNGHAAGQTVPHLHFHVIPCKNGDNASHTNIH 126
+ I +G AGQTVPH+H H++P K GD + I+
Sbjct: 81 SLTIAIQDGPEAGQTVPHVHVHILPRKAGDFGDNDLIY 118
>gnl|CDD|31282 COG1085, GalT, Galactose-1-phosphate uridylyltransferase [Energy
production and conversion].
Length = 338
Score = 40.0 bits (93), Expect = 3e-04
Identities = 23/105 (21%), Positives = 41/105 (39%), Gaps = 2/105 (1%)
Query: 9 YDNQNIFIKIIRNETNACR--VYEDDILLAIMDIMPRNPGHVLIIPKSRIRDIFEAPPEI 66
+ ++ ++ E V E+D LA + R P VLI PK + + + E
Sbjct: 183 ENGSCMYCDLVEREKGDGERIVVENDHFLAFVPFWARWPFEVLIYPKEHVSFLTDLSDEE 242
Query: 67 LSQIAFLIKKIAIACKSAFQADGIQILQFNGHAAGQTVPHLHFHV 111
L +A ++KK+ + F + F+ + H H H
Sbjct: 243 LKDLAEILKKLLARYDNLFGNSFPYSMGFHQAPFNEVNEHYHLHA 287
>gnl|CDD|29587 cd00608, GalT, Galactose-1-phosphate uridyl transferase (GalT):
This enzyme plays a key role in galactose metabolism by
catalysing the transfer of a uridine 5'-phosphoryl group
from UDP-galactose 1-phosphate. The structure of E.coli
GalT reveals that the enzyme contains two identical
subunits. It also demonstrates that the active site is
formed by amino acid residues from both subunits of the
dimer..
Length = 329
Score = 32.9 bits (75), Expect = 0.034
Identities = 19/91 (20%), Positives = 34/91 (37%), Gaps = 5/91 (5%)
Query: 28 VYEDDILLAIMDIMPRNPGHVLIIPKSRIRDIFEAPPEILSQIAFLIKKIAIACKSAFQA 87
V E++ +A++ R P V I+PK + + E +A ++K++ + F
Sbjct: 203 VVENEHFVAVVPFWARWPFEVHILPKRHVSRFTDLTDEEREDLAEILKRLLARYDNLFNC 262
Query: 88 D-----GIQILQFNGHAAGQTVPHLHFHVIP 113
G G H HF + P
Sbjct: 263 SFPYSMGWHQAPTGGKELENWYYHWHFEIPP 293
>gnl|CDD|145739 pfam02744, GalP_UDP_tr_C, Galactose-1-phosphate uridyl transferase,
C-terminal domain. SCOP reports fold duplication with
N-terminal domain. Both involved in Zn and Fe binding.
Length = 168
Score = 30.9 bits (70), Expect = 0.13
Identities = 14/72 (19%), Positives = 26/72 (36%), Gaps = 5/72 (6%)
Query: 45 PGHVLIIPKSRIRDIFEAPPEILSQIAFLIKKIAIACKSAFQ-----ADGIQILQFNGHA 99
P L +PK + + E + +A ++K + + F+ + GI N
Sbjct: 50 PFETLRLPKRHVLSLTELTDKEREDLAAIVKPLTRRYDNLFETSFPYSMGIHQAPLNAEE 109
Query: 100 AGQTVPHLHFHV 111
H HF+
Sbjct: 110 LNHWQFHPHFYP 121
>gnl|CDD|29591 cd01278, aprataxin_related, aprataxin related: Aprataxin, a HINT
family hydrolase is mutated in ataxia oculomotor apraxia
syndrome. All the members of this subgroup have the
conserved HxHxHxx (where x is a hydrophobic residue)
signature motif. Members of this subgroup are
predominantly eukaryotic in origin..
Length = 104
Score = 30.7 bits (69), Expect = 0.18
Identities = 25/86 (29%), Positives = 37/86 (43%), Gaps = 2/86 (2%)
Query: 28 VYEDDILLAIMDIMPRNPGHVLIIPKSRIRDIFEAPPEILSQIAFLIKKIAIACKSAFQA 87
VYEDD ++ DI P+ H L+IPK I + E + + + + +
Sbjct: 19 VYEDDRVVVFKDIYPKARHHYLVIPKEHIASLKALTKEDVPLLEHM-ETVGREKLLRSDN 77
Query: 88 DGIQILQFNGHA-AGQTVPHLHFHVI 112
+F HA +V HLH HVI
Sbjct: 78 TDPSEFRFGFHAPPFTSVSHLHLHVI 103
>gnl|CDD|37687 KOG2476, KOG2476, KOG2476, Uncharacterized conserved protein
[Function unknown].
Length = 528
Score = 29.6 bits (66), Expect = 0.34
Identities = 17/75 (22%), Positives = 25/75 (33%), Gaps = 12/75 (16%)
Query: 42 PRNPGHVLIIPKSRIRDIFEAPPEILSQI-AFLIKKIAIACKSAFQADGI--QILQFNGH 98
P + HVLIIP I + E+ +I + K D + + +
Sbjct: 350 PLSSDHVLIIPIEHIPSLVPLSAEVTQEINKY--KAALRKMYKKQGKDAVVFERQSYRSV 407
Query: 99 AAGQTVPHLHFHVIP 113
HL VIP
Sbjct: 408 -------HLQLQVIP 415
>gnl|CDD|173960 cd08550, GlyDH-like, Glycerol_dehydrogenase-like. Families of
proteins related to glycerol dehydrogenases. Glycerol
dehydrogenases (GlyDH) is a key enzyme in the glycerol
dissimilation pathway. In anaerobic conditions, many
microorganisms utilize glycerol as a source of carbon
through coupled oxidative and reductive pathways. One of
the pathways involves the oxidation of glycerol to
dihydroxyacetone with the reduction of NAD+ to NADH
catalyzed by glycerol dehydrogenases. Dihydroxyacetone
is then phosphorylated by dihydroxyacetone kinase and
enters the glycolytic pathway for further degradation.
The activity of GlyDH is zinc-dependent. The zinc ion
plays a role in stabilizing an alkoxide intermediate at
the active site. Some subfamilies have not been
characterized till now.
Length = 349
Score = 29.4 bits (66), Expect = 0.47
Identities = 15/45 (33%), Positives = 21/45 (46%), Gaps = 4/45 (8%)
Query: 23 TNACRVYEDDILLAIMDIMPRNPGHVLIIPKSRIRDIFEAPPEIL 67
+N +Y DD A D RNP VL+ + I ++P E L
Sbjct: 118 SNLSVIYSDDGEFARYDFQKRNPDLVLVDTEV----IAQSPAEYL 158
>gnl|CDD|37931 KOG2720, KOG2720, KOG2720, Predicted hydrolase (HIT family)
[General function prediction only].
Length = 431
Score = 29.2 bits (65), Expect = 0.50
Identities = 23/81 (28%), Positives = 39/81 (48%), Gaps = 11/81 (13%)
Query: 31 DDILLAIMDIMPRNPGHVLIIPKSRIRDIFEAPPEILSQIAFLIKKIAIACKSAFQADGI 90
++ + +++ P GHVLIIP+ + + P+ ++ A L+ A +AD
Sbjct: 165 ENSPVVAINVSPIEYGHVLIIPR-----VLKCLPQRITHKALLL-----AVTMMAEADDP 214
Query: 91 QI-LQFNGHAAGQTVPHLHFH 110
L +N A +V HLHFH
Sbjct: 215 YFRLGYNSLGAFASVNHLHFH 235
>gnl|CDD|35586 KOG0365, KOG0365, KOG0365, Beta subunit of farnesyltransferase
[Posttranslational modification, protein turnover,
chaperones].
Length = 423
Score = 27.3 bits (60), Expect = 2.0
Identities = 11/36 (30%), Positives = 17/36 (47%)
Query: 76 KIAIACKSAFQADGIQILQFNGHAAGQTVPHLHFHV 111
K+ C S + ILQ +AG ++ HL F+
Sbjct: 293 KLVDGCYSFWVGASAPILQAFYSSAGHSLEHLLFNK 328
>gnl|CDD|35782 KOG0562, KOG0562, KOG0562, Predicted hydrolase (HIT family)
[General function prediction only].
Length = 184
Score = 27.3 bits (60), Expect = 2.1
Identities = 25/99 (25%), Positives = 41/99 (41%), Gaps = 6/99 (6%)
Query: 18 IIRNETNACRVYED--DILLAIMDIMPRNPGHVLIIPK-SRIRDIFEAPPEILSQIAFLI 74
I+ + VY + D ++ I D P+ H+L++P+ S I +F E LS +
Sbjct: 7 IVMDIPKPENVYIESPDDVVVIRDKFPKARMHLLVLPRRSSIDSLFSVVQEHLSLLKEDH 66
Query: 75 KKIAIACKSAFQADGIQILQFN-GHAAGQTVPHLHFHVI 112
C + F G A ++ +LH HVI
Sbjct: 67 AVGP--CWVDQLTNEALCNYFRVGFHAVPSMNNLHLHVI 103
>gnl|CDD|36539 KOG1325, KOG1325, KOG1325, Lysophospholipase [Lipid transport and
metabolism].
Length = 571
Score = 26.9 bits (59), Expect = 2.5
Identities = 7/22 (31%), Positives = 8/22 (36%)
Query: 108 HFHVIPCKNGDNASHTNIHPTQ 129
F +P N N N PT
Sbjct: 402 GFPYVPDGNTFNNLGLNTLPTF 423
>gnl|CDD|48021 cd03457, intradiol_dioxygenase_like, Intradiol dioxygenase
supgroup. Intradiol dioxygenases catalyze the critical
ring-cleavage step in the conversion of catecholate
derivatives to citric acid cycle intermediates. They
break the catechol C1-C2 bond and utilize Fe3+, as
opposed to the extradiol-cleaving enzymes which break
the C2-C3 or C1-C6 bond and utilize Fe2+ and Mn+. The
family contains catechol 1,2-dioxygenases and
protocatechuate 3,4-dioxygenases. The specific function
of this subgroup is unknown..
Length = 188
Score = 26.8 bits (59), Expect = 2.6
Identities = 12/33 (36%), Positives = 16/33 (48%), Gaps = 5/33 (15%)
Query: 95 FNGHAAGQTVPHLHFHVIP----CKNGDNASHT 123
F G G+ H+HF V P +G N +HT
Sbjct: 95 FPGWYPGRA-THIHFKVHPDATSATSGGNVAHT 126
>gnl|CDD|177069 CHL00149, odpA, pyruvate dehydrogenase E1 component alpha subunit;
Reviewed.
Length = 341
Score = 26.4 bits (58), Expect = 3.4
Identities = 13/29 (44%), Positives = 17/29 (58%)
Query: 59 IFEAPPEILSQIAFLIKKIAIACKSAFQA 87
IF AP L AF+ + I IA +AFQ+
Sbjct: 116 IFSAPHNFLGGFAFIGEGIPIALGAAFQS 144
>gnl|CDD|30671 COG0323, MutL, DNA mismatch repair enzyme (predicted ATPase) [DNA
replication, recombination, and repair].
Length = 638
Score = 26.5 bits (58), Expect = 3.6
Identities = 7/30 (23%), Positives = 14/30 (46%)
Query: 124 NIHPTQKIENFAKLEINAQKIRKELQNFLK 153
N+HP +K F+ + I + ++ L
Sbjct: 302 NVHPAKKEVRFSDERLVHDLIYEAIKEALA 331
>gnl|CDD|113446 pfam04677, CwfJ_C_1, Protein similar to CwfJ C-terminus 1. This
region is found in the N terminus of Schizosaccharomyces
pombe protein CwfJ. CwfJ is part of the Cdc5p complex
involved in mRNA splicing.
Length = 122
Score = 26.2 bits (58), Expect = 4.1
Identities = 20/73 (27%), Positives = 25/73 (34%), Gaps = 6/73 (8%)
Query: 41 MPRNPGHVLIIPKSRIRDIFEAPPEILSQIAFLIKKIAIACKSAFQADGIQILQFNGHAA 100
P GH LIIP I E+ +I +K + D + A
Sbjct: 42 GPLVSGHCLIIPIQHIPSTLSLDEEVWDEIR-NFRKALTLMYKSQGKDAVFF-----EIA 95
Query: 101 GQTVPHLHFHVIP 113
Q PHLH IP
Sbjct: 96 SQRRPHLHIQCIP 108
>gnl|CDD|37190 KOG1979, KOG1979, KOG1979, DNA mismatch repair protein - MLH1
family [Replication, recombination and repair].
Length = 694
Score = 25.3 bits (55), Expect = 6.9
Identities = 8/29 (27%), Positives = 18/29 (62%)
Query: 124 NIHPTQKIENFAKLEINAQKIRKELQNFL 152
N+HPT++ +F E ++I ++++ L
Sbjct: 308 NVHPTKREVHFLNQEEIIERICQQIEERL 336
>gnl|CDD|33774 COG4015, COG4015, Predicted dinucleotide-utilizing enzyme of the
ThiF/HesA family [General function prediction only].
Length = 217
Score = 24.9 bits (54), Expect = 8.7
Identities = 13/54 (24%), Positives = 20/54 (37%)
Query: 18 IIRNETNACRVYEDDILLAIMDIMPRNPGHVLIIPKSRIRDIFEAPPEILSQIA 71
I E C E + + P H+L+ IRD P +L ++A
Sbjct: 146 IGEEEVKVCDAEEAKGPAKFLLLDEGGPDHILVGTGKFIRDFEPITPYVLDRVA 199
Database: CddA
Posted date: Feb 4, 2011 9:38 PM
Number of letters in database: 6,263,737
Number of sequences in database: 21,609
Lambda K H
0.323 0.137 0.402
Gapped
Lambda K H
0.267 0.0699 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 21609
Number of Hits to DB: 1,873,535
Number of extensions: 93634
Number of successful extensions: 263
Number of sequences better than 10.0: 1
Number of HSP's gapped: 259
Number of HSP's successfully gapped: 32
Length of query: 155
Length of database: 6,263,737
Length adjustment: 86
Effective length of query: 69
Effective length of database: 4,405,363
Effective search space: 303970047
Effective search space used: 303970047
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.9 bits)
S2: 53 (24.3 bits)