RPS-BLAST 2.2.22 [Sep-27-2009] Database: CddA 21,609 sequences; 6,263,737 total letters Searching..................................................done Query= gi|254780780|ref|YP_003065193.1| histidine triad (HIT) protein [Candidatus Liberibacter asiaticus str. psy62] (155 letters) >gnl|CDD|30883 COG0537, Hit, Diadenosine tetraphosphate (Ap4A) hydrolase and other HIT family hydrolases [Nucleotide transport and metabolism / Carbohydrate transport and metabolism / General function prediction only]. Length = 138 Score = 131 bits (330), Expect = 9e-32 Identities = 65/139 (46%), Positives = 83/139 (59%), Gaps = 1/139 (0%) Query: 11 NQNIFIKIIRNETNACRVYEDDILLAIMDIMPRNPGHVLIIPKSRIRDIFEAPPEILSQI 70 IF KIIR E A +VYED+ +LA +DI P PGH L+IPK + D+ + PE L+++ Sbjct: 1 MMCIFCKIIRGEIPANKVYEDEHVLAFLDIYPAAPGHTLVIPKRHVSDLEDLDPEELAEL 60 Query: 71 AFLIKKIAIACKSAFQADGIQILQFNGHAAGQTVPHLHFHVIPCKNGDNASHTNIHPTQK 130 L +KIA A K AF ADG I NG AAGQ V HLH H+IP GD+ T+ Sbjct: 61 FLLAQKIAKALKEAFGADGYNIGINNGKAAGQEVFHLHIHIIPRYKGDDNFPGPGWGTKV 120 Query: 131 IENFAKLEINAQKIRKELQ 149 N +LE A+KIRK L+ Sbjct: 121 EPN-EELEELAEKIRKALE 138 >gnl|CDD|29590 cd01277, HINT_subgroup, HINT (histidine triad nucleotide-binding protein) subgroup: Members of this CD belong to the superfamily of histidine triad hydrolases that act on alpha-phosphate of ribonucleotides. This subgroup includes members from all three forms of cellular life. Although the biochemical function has not been characterised for many of the members of this subgroup, the proteins from Yeast have been shown to be involved in secretion, peroxisome formation and gene expression.. Length = 103 Score = 127 bits (320), Expect = 2e-30 Identities = 50/101 (49%), Positives = 66/101 (65%) Query: 13 NIFIKIIRNETNACRVYEDDILLAIMDIMPRNPGHVLIIPKSRIRDIFEAPPEILSQIAF 72 IF KII E + +VYEDD +LA +DI P + GH L+IPK ++ + PE L+++ Sbjct: 2 CIFCKIIAGEIPSYKVYEDDHVLAFLDINPASKGHTLVIPKKHYENLLDLDPEELAELIL 61 Query: 73 LIKKIAIACKSAFQADGIQILQFNGHAAGQTVPHLHFHVIP 113 KK+A A K A +ADG+ ILQ NG AAGQ V H+H HVIP Sbjct: 62 AAKKVARALKKALKADGLNILQNNGRAAGQVVFHVHVHVIP 102 >gnl|CDD|144720 pfam01230, HIT, HIT domain. Length = 98 Score = 95.1 bits (237), Expect = 8e-21 Identities = 45/86 (52%), Positives = 61/86 (70%) Query: 28 VYEDDILLAIMDIMPRNPGHVLIIPKSRIRDIFEAPPEILSQIAFLIKKIAIACKSAFQA 87 VYEDD++LA +DI P+ PGHVL+IPK IR++ + PE L + KK+A A F+A Sbjct: 9 VYEDDLVLAFLDIDPQAPGHVLVIPKKHIRELHDLTPEELGDLMSEAKKVARALGKVFKA 68 Query: 88 DGIQILQFNGHAAGQTVPHLHFHVIP 113 DG +++ NG AAGQ+VPHLH H+IP Sbjct: 69 DGYRVVINNGAAAGQSVPHLHIHIIP 94 >gnl|CDD|38485 KOG3275, KOG3275, KOG3275, Zinc-binding protein of the histidine triad (HIT) family [Signal transduction mechanisms]. Length = 127 Score = 84.6 bits (209), Expect = 1e-17 Identities = 42/107 (39%), Positives = 59/107 (55%), Gaps = 6/107 (5%) Query: 10 DNQNIFIKIIRNETNACRVYEDDILLAIMDIMPRNPGHVLIIPKSRIRDIFEA---PPEI 66 IF KIIR E A ++EDD LA DI P+ PGH L+IPK I + +A E+ Sbjct: 15 AAPTIFCKIIRKEIPAKIIFEDDRCLAFHDIAPQAPGHFLVIPKKHITQLSKAEDRDDEL 74 Query: 67 LSQIAFLIKKIAIACKSAFQADGIQILQFNGHAAGQTVPHLHFHVIP 113 L + + KK+A K+ DG +++Q NG Q+V H+H HV+ Sbjct: 75 LGHLLPVAKKVA---KALGLEDGYRVVQNNGKDGHQSVYHVHLHVLG 118 >gnl|CDD|29589 cd01276, PKCI_related, Protein Kinase C Interacting protein related (PKCI): PKCI and related proteins belong to the ubiquitous HIT family of hydrolases that act on alpha-phosphates of ribonucleotides. The members of this subgroup have a conserved HxHxHxx motif (x is a hydrophobic residue) that is a signature for this family. No enzymatic activity has been reported however, for PKCI and its related members.. Length = 104 Score = 82.5 bits (204), Expect = 4e-17 Identities = 40/104 (38%), Positives = 55/104 (52%), Gaps = 5/104 (4%) Query: 12 QNIFIKIIRNETNACRVYEDDILLAIMDIMPRNPGHVLIIPKSRIRDIFEAP---PEILS 68 IF KIIR E A +VYEDD +LA DI P+ P H+L+IPK I + +A E+L Sbjct: 1 DCIFCKIIRGEIPAKKVYEDDEVLAFHDINPQAPVHILVIPKKHIASLSDATEEDEELLG 60 Query: 69 QIAFLIKKIAIACKSAFQADGIQILQFNGHAAGQTVPHLHFHVI 112 + K+A DG +++ G GQ V HLH H++ Sbjct: 61 HLLSAAAKVAK--DLGIAEDGYRLVINCGKDGGQEVFHLHLHLL 102 >gnl|CDD|29588 cd01275, FHIT, FHIT (fragile histidine family): FHIT proteins, related to the HIT family carry a motif HxHxH/Qxx (x, is a hydrophobic amino acid), On the basis of sequence, substrate specificity, structure, evolution and mechanism, HIT proteins are classified into three branches: the Hint branch, which consists of adenosine 5' -monophosphoramide hydrolases, the Fhit branch, that consists of diadenosine polyphosphate hydrolases, and the GalT branch consisting of specific nucloside monophosphate transferases. Fhit plays a very important role in the development of tumours. Infact, Fhit deletions are among the earliest and most frequent genetic alterations in the development of tumours.. Length = 126 Score = 73.0 bits (179), Expect = 3e-14 Identities = 30/92 (32%), Positives = 50/92 (54%) Query: 28 VYEDDILLAIMDIMPRNPGHVLIIPKSRIRDIFEAPPEILSQIAFLIKKIAIACKSAFQA 87 Y A++++ P NPGHVL++P + + + PE ++ + L++ A K ++ Sbjct: 17 FYRTKHSFAVVNLYPYNPGHVLVVPYRHVPRLEDLTPEEIADLFKLVQLAMKALKVVYKP 76 Query: 88 DGIQILQFNGHAAGQTVPHLHFHVIPCKNGDN 119 DG I +G A G VPH+H H++P NGD Sbjct: 77 DGFNIGINDGKAGGGIVPHVHIHIVPRWNGDT 108 >gnl|CDD|29586 cd00468, HIT_like, HIT family: HIT (Histidine triad) proteins, named for a motif related to the sequence HxHxH/Qxx (x, a hydrophobic amino acid), are a superfamily of nucleotide hydrolases and transferases, which act on the alpha-phosphate of ribonucleotides. On the basis of sequence, substrate specificity, structure, evolution and mechanism, HIT proteins are classified in the literacture into three major branches: the Hint branch, which consists of adenosine 5' -monophosphoramide hydrolases, the Fhit branch, that consists of diadenosine polyphosphate hydrolases, and the GalT branch consisting of specific nucloside monophosphate transferases. Further sequence analysis reveals several new closely related, yet uncharacterized subgroups.. Length = 86 Score = 67.3 bits (164), Expect = 2e-12 Identities = 25/86 (29%), Positives = 47/86 (54%) Query: 28 VYEDDILLAIMDIMPRNPGHVLIIPKSRIRDIFEAPPEILSQIAFLIKKIAIACKSAFQA 87 V +D+ A +++ P PGHVL+ PK + + + +L+ + +++A + Sbjct: 1 VPDDEHSFAFVNLKPAAPGHVLVCPKRHVETLPDLDEALLADLVITAQRVAAELEKHGNV 60 Query: 88 DGIQILQFNGHAAGQTVPHLHFHVIP 113 + + +G AAGQ+VPH+H HV+P Sbjct: 61 PSLTVFVNDGAAAGQSVPHVHLHVLP 86 >gnl|CDD|38589 KOG3379, KOG3379, KOG3379, Diadenosine polyphosphate hydrolase and related proteins of the histidine triad (HIT) family [Nucleotide transport and metabolism, General function prediction only]. Length = 150 Score = 63.4 bits (154), Expect = 3e-11 Identities = 27/98 (27%), Positives = 49/98 (50%) Query: 29 YEDDILLAIMDIMPRNPGHVLIIPKSRIRDIFEAPPEILSQIAFLIKKIAIACKSAFQAD 88 Y+ A +++ P PGHVL+ P + + + + + ++K+ + + A Sbjct: 21 YKTKHSFAFVNLKPVVPGHVLVSPLRVVPRLTDLTAAETADLFTTVQKVQRVLEKHYNAT 80 Query: 89 GIQILQFNGHAAGQTVPHLHFHVIPCKNGDNASHTNIH 126 + I +G AGQTVPH+H H++P K GD + I+ Sbjct: 81 SLTIAIQDGPEAGQTVPHVHVHILPRKAGDFGDNDLIY 118 >gnl|CDD|31282 COG1085, GalT, Galactose-1-phosphate uridylyltransferase [Energy production and conversion]. Length = 338 Score = 40.0 bits (93), Expect = 3e-04 Identities = 23/105 (21%), Positives = 41/105 (39%), Gaps = 2/105 (1%) Query: 9 YDNQNIFIKIIRNETNACR--VYEDDILLAIMDIMPRNPGHVLIIPKSRIRDIFEAPPEI 66 + ++ ++ E V E+D LA + R P VLI PK + + + E Sbjct: 183 ENGSCMYCDLVEREKGDGERIVVENDHFLAFVPFWARWPFEVLIYPKEHVSFLTDLSDEE 242 Query: 67 LSQIAFLIKKIAIACKSAFQADGIQILQFNGHAAGQTVPHLHFHV 111 L +A ++KK+ + F + F+ + H H H Sbjct: 243 LKDLAEILKKLLARYDNLFGNSFPYSMGFHQAPFNEVNEHYHLHA 287 >gnl|CDD|29587 cd00608, GalT, Galactose-1-phosphate uridyl transferase (GalT): This enzyme plays a key role in galactose metabolism by catalysing the transfer of a uridine 5'-phosphoryl group from UDP-galactose 1-phosphate. The structure of E.coli GalT reveals that the enzyme contains two identical subunits. It also demonstrates that the active site is formed by amino acid residues from both subunits of the dimer.. Length = 329 Score = 32.9 bits (75), Expect = 0.034 Identities = 19/91 (20%), Positives = 34/91 (37%), Gaps = 5/91 (5%) Query: 28 VYEDDILLAIMDIMPRNPGHVLIIPKSRIRDIFEAPPEILSQIAFLIKKIAIACKSAFQA 87 V E++ +A++ R P V I+PK + + E +A ++K++ + F Sbjct: 203 VVENEHFVAVVPFWARWPFEVHILPKRHVSRFTDLTDEEREDLAEILKRLLARYDNLFNC 262 Query: 88 D-----GIQILQFNGHAAGQTVPHLHFHVIP 113 G G H HF + P Sbjct: 263 SFPYSMGWHQAPTGGKELENWYYHWHFEIPP 293 >gnl|CDD|145739 pfam02744, GalP_UDP_tr_C, Galactose-1-phosphate uridyl transferase, C-terminal domain. SCOP reports fold duplication with N-terminal domain. Both involved in Zn and Fe binding. Length = 168 Score = 30.9 bits (70), Expect = 0.13 Identities = 14/72 (19%), Positives = 26/72 (36%), Gaps = 5/72 (6%) Query: 45 PGHVLIIPKSRIRDIFEAPPEILSQIAFLIKKIAIACKSAFQ-----ADGIQILQFNGHA 99 P L +PK + + E + +A ++K + + F+ + GI N Sbjct: 50 PFETLRLPKRHVLSLTELTDKEREDLAAIVKPLTRRYDNLFETSFPYSMGIHQAPLNAEE 109 Query: 100 AGQTVPHLHFHV 111 H HF+ Sbjct: 110 LNHWQFHPHFYP 121 >gnl|CDD|29591 cd01278, aprataxin_related, aprataxin related: Aprataxin, a HINT family hydrolase is mutated in ataxia oculomotor apraxia syndrome. All the members of this subgroup have the conserved HxHxHxx (where x is a hydrophobic residue) signature motif. Members of this subgroup are predominantly eukaryotic in origin.. Length = 104 Score = 30.7 bits (69), Expect = 0.18 Identities = 25/86 (29%), Positives = 37/86 (43%), Gaps = 2/86 (2%) Query: 28 VYEDDILLAIMDIMPRNPGHVLIIPKSRIRDIFEAPPEILSQIAFLIKKIAIACKSAFQA 87 VYEDD ++ DI P+ H L+IPK I + E + + + + + Sbjct: 19 VYEDDRVVVFKDIYPKARHHYLVIPKEHIASLKALTKEDVPLLEHM-ETVGREKLLRSDN 77 Query: 88 DGIQILQFNGHA-AGQTVPHLHFHVI 112 +F HA +V HLH HVI Sbjct: 78 TDPSEFRFGFHAPPFTSVSHLHLHVI 103 >gnl|CDD|37687 KOG2476, KOG2476, KOG2476, Uncharacterized conserved protein [Function unknown]. Length = 528 Score = 29.6 bits (66), Expect = 0.34 Identities = 17/75 (22%), Positives = 25/75 (33%), Gaps = 12/75 (16%) Query: 42 PRNPGHVLIIPKSRIRDIFEAPPEILSQI-AFLIKKIAIACKSAFQADGI--QILQFNGH 98 P + HVLIIP I + E+ +I + K D + + + Sbjct: 350 PLSSDHVLIIPIEHIPSLVPLSAEVTQEINKY--KAALRKMYKKQGKDAVVFERQSYRSV 407 Query: 99 AAGQTVPHLHFHVIP 113 HL VIP Sbjct: 408 -------HLQLQVIP 415 >gnl|CDD|173960 cd08550, GlyDH-like, Glycerol_dehydrogenase-like. Families of proteins related to glycerol dehydrogenases. Glycerol dehydrogenases (GlyDH) is a key enzyme in the glycerol dissimilation pathway. In anaerobic conditions, many microorganisms utilize glycerol as a source of carbon through coupled oxidative and reductive pathways. One of the pathways involves the oxidation of glycerol to dihydroxyacetone with the reduction of NAD+ to NADH catalyzed by glycerol dehydrogenases. Dihydroxyacetone is then phosphorylated by dihydroxyacetone kinase and enters the glycolytic pathway for further degradation. The activity of GlyDH is zinc-dependent. The zinc ion plays a role in stabilizing an alkoxide intermediate at the active site. Some subfamilies have not been characterized till now. Length = 349 Score = 29.4 bits (66), Expect = 0.47 Identities = 15/45 (33%), Positives = 21/45 (46%), Gaps = 4/45 (8%) Query: 23 TNACRVYEDDILLAIMDIMPRNPGHVLIIPKSRIRDIFEAPPEIL 67 +N +Y DD A D RNP VL+ + I ++P E L Sbjct: 118 SNLSVIYSDDGEFARYDFQKRNPDLVLVDTEV----IAQSPAEYL 158 >gnl|CDD|37931 KOG2720, KOG2720, KOG2720, Predicted hydrolase (HIT family) [General function prediction only]. Length = 431 Score = 29.2 bits (65), Expect = 0.50 Identities = 23/81 (28%), Positives = 39/81 (48%), Gaps = 11/81 (13%) Query: 31 DDILLAIMDIMPRNPGHVLIIPKSRIRDIFEAPPEILSQIAFLIKKIAIACKSAFQADGI 90 ++ + +++ P GHVLIIP+ + + P+ ++ A L+ A +AD Sbjct: 165 ENSPVVAINVSPIEYGHVLIIPR-----VLKCLPQRITHKALLL-----AVTMMAEADDP 214 Query: 91 QI-LQFNGHAAGQTVPHLHFH 110 L +N A +V HLHFH Sbjct: 215 YFRLGYNSLGAFASVNHLHFH 235 >gnl|CDD|35586 KOG0365, KOG0365, KOG0365, Beta subunit of farnesyltransferase [Posttranslational modification, protein turnover, chaperones]. Length = 423 Score = 27.3 bits (60), Expect = 2.0 Identities = 11/36 (30%), Positives = 17/36 (47%) Query: 76 KIAIACKSAFQADGIQILQFNGHAAGQTVPHLHFHV 111 K+ C S + ILQ +AG ++ HL F+ Sbjct: 293 KLVDGCYSFWVGASAPILQAFYSSAGHSLEHLLFNK 328 >gnl|CDD|35782 KOG0562, KOG0562, KOG0562, Predicted hydrolase (HIT family) [General function prediction only]. Length = 184 Score = 27.3 bits (60), Expect = 2.1 Identities = 25/99 (25%), Positives = 41/99 (41%), Gaps = 6/99 (6%) Query: 18 IIRNETNACRVYED--DILLAIMDIMPRNPGHVLIIPK-SRIRDIFEAPPEILSQIAFLI 74 I+ + VY + D ++ I D P+ H+L++P+ S I +F E LS + Sbjct: 7 IVMDIPKPENVYIESPDDVVVIRDKFPKARMHLLVLPRRSSIDSLFSVVQEHLSLLKEDH 66 Query: 75 KKIAIACKSAFQADGIQILQFN-GHAAGQTVPHLHFHVI 112 C + F G A ++ +LH HVI Sbjct: 67 AVGP--CWVDQLTNEALCNYFRVGFHAVPSMNNLHLHVI 103 >gnl|CDD|36539 KOG1325, KOG1325, KOG1325, Lysophospholipase [Lipid transport and metabolism]. Length = 571 Score = 26.9 bits (59), Expect = 2.5 Identities = 7/22 (31%), Positives = 8/22 (36%) Query: 108 HFHVIPCKNGDNASHTNIHPTQ 129 F +P N N N PT Sbjct: 402 GFPYVPDGNTFNNLGLNTLPTF 423 >gnl|CDD|48021 cd03457, intradiol_dioxygenase_like, Intradiol dioxygenase supgroup. Intradiol dioxygenases catalyze the critical ring-cleavage step in the conversion of catecholate derivatives to citric acid cycle intermediates. They break the catechol C1-C2 bond and utilize Fe3+, as opposed to the extradiol-cleaving enzymes which break the C2-C3 or C1-C6 bond and utilize Fe2+ and Mn+. The family contains catechol 1,2-dioxygenases and protocatechuate 3,4-dioxygenases. The specific function of this subgroup is unknown.. Length = 188 Score = 26.8 bits (59), Expect = 2.6 Identities = 12/33 (36%), Positives = 16/33 (48%), Gaps = 5/33 (15%) Query: 95 FNGHAAGQTVPHLHFHVIP----CKNGDNASHT 123 F G G+ H+HF V P +G N +HT Sbjct: 95 FPGWYPGRA-THIHFKVHPDATSATSGGNVAHT 126 >gnl|CDD|177069 CHL00149, odpA, pyruvate dehydrogenase E1 component alpha subunit; Reviewed. Length = 341 Score = 26.4 bits (58), Expect = 3.4 Identities = 13/29 (44%), Positives = 17/29 (58%) Query: 59 IFEAPPEILSQIAFLIKKIAIACKSAFQA 87 IF AP L AF+ + I IA +AFQ+ Sbjct: 116 IFSAPHNFLGGFAFIGEGIPIALGAAFQS 144 >gnl|CDD|30671 COG0323, MutL, DNA mismatch repair enzyme (predicted ATPase) [DNA replication, recombination, and repair]. Length = 638 Score = 26.5 bits (58), Expect = 3.6 Identities = 7/30 (23%), Positives = 14/30 (46%) Query: 124 NIHPTQKIENFAKLEINAQKIRKELQNFLK 153 N+HP +K F+ + I + ++ L Sbjct: 302 NVHPAKKEVRFSDERLVHDLIYEAIKEALA 331 >gnl|CDD|113446 pfam04677, CwfJ_C_1, Protein similar to CwfJ C-terminus 1. This region is found in the N terminus of Schizosaccharomyces pombe protein CwfJ. CwfJ is part of the Cdc5p complex involved in mRNA splicing. Length = 122 Score = 26.2 bits (58), Expect = 4.1 Identities = 20/73 (27%), Positives = 25/73 (34%), Gaps = 6/73 (8%) Query: 41 MPRNPGHVLIIPKSRIRDIFEAPPEILSQIAFLIKKIAIACKSAFQADGIQILQFNGHAA 100 P GH LIIP I E+ +I +K + D + A Sbjct: 42 GPLVSGHCLIIPIQHIPSTLSLDEEVWDEIR-NFRKALTLMYKSQGKDAVFF-----EIA 95 Query: 101 GQTVPHLHFHVIP 113 Q PHLH IP Sbjct: 96 SQRRPHLHIQCIP 108 >gnl|CDD|37190 KOG1979, KOG1979, KOG1979, DNA mismatch repair protein - MLH1 family [Replication, recombination and repair]. Length = 694 Score = 25.3 bits (55), Expect = 6.9 Identities = 8/29 (27%), Positives = 18/29 (62%) Query: 124 NIHPTQKIENFAKLEINAQKIRKELQNFL 152 N+HPT++ +F E ++I ++++ L Sbjct: 308 NVHPTKREVHFLNQEEIIERICQQIEERL 336 >gnl|CDD|33774 COG4015, COG4015, Predicted dinucleotide-utilizing enzyme of the ThiF/HesA family [General function prediction only]. Length = 217 Score = 24.9 bits (54), Expect = 8.7 Identities = 13/54 (24%), Positives = 20/54 (37%) Query: 18 IIRNETNACRVYEDDILLAIMDIMPRNPGHVLIIPKSRIRDIFEAPPEILSQIA 71 I E C E + + P H+L+ IRD P +L ++A Sbjct: 146 IGEEEVKVCDAEEAKGPAKFLLLDEGGPDHILVGTGKFIRDFEPITPYVLDRVA 199 Database: CddA Posted date: Feb 4, 2011 9:38 PM Number of letters in database: 6,263,737 Number of sequences in database: 21,609 Lambda K H 0.323 0.137 0.402 Gapped Lambda K H 0.267 0.0699 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Sequences: 21609 Number of Hits to DB: 1,873,535 Number of extensions: 93634 Number of successful extensions: 263 Number of sequences better than 10.0: 1 Number of HSP's gapped: 259 Number of HSP's successfully gapped: 32 Length of query: 155 Length of database: 6,263,737 Length adjustment: 86 Effective length of query: 69 Effective length of database: 4,405,363 Effective search space: 303970047 Effective search space used: 303970047 Neighboring words threshold: 11 Window for multiple hits: 40 X1: 16 ( 7.4 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (21.9 bits) S2: 53 (24.3 bits)