RPS-BLAST 2.2.22 [Sep-27-2009] Database: CddA 21,609 sequences; 6,263,737 total letters Searching..................................................done Query= gi|254780629|ref|YP_003065042.1| putative deoxyribonucleotide triphosphate pyrophosphatase [Candidatus Liberibacter asiaticus str. psy62] (224 letters) >gnl|CDD|145072 pfam01725, Ham1p_like, Ham1 family. This family consists of the HAM1 protein and hypothetical archaeal bacterial and C. elegans proteins. HAM1 controls 6-N-hydroxylaminopurine (HAP) sensitivity and mutagenesis in S. cerevisiae HAM1. The HAM1 protein protects the cell from HAP, either on the level of deoxynucleoside triphosphate or the DNA level by a yet unidentified set of reactions. Length = 181 Score = 204 bits (522), Expect = 2e-53 Identities = 86/208 (41%), Positives = 123/208 (59%), Gaps = 29/208 (13%) Query: 9 IVIASHNVDKIHEMDSLIMPLGIMTTSALELNLIIPEETGNSFEENAMIKSLTAAKNAGM 68 IV A+ N K+ E+ +++ LGI I EETG++FEENA++K+ AAK G Sbjct: 1 IVFATGNKGKLKEIKAILGDLGIEVIDLKHDP-IEVEETGDTFEENALLKAKAAAKALGK 59 Query: 69 PALSDDSGLVIDVLDGKPGIHSARWAESNTGERDFDMAMQKIENALRSKFAHDPAFRSAH 128 P L+DDSGL +D L+G PG++SAR+A++ E+ ++ ++ +E+ RSA+ Sbjct: 60 PVLADDSGLCVDALNGFPGVYSARFADTLNNEKLLEL-LEGVED------------RSAY 106 Query: 129 FISVLSLAWPDGHVENFSGKVSGIIVWPPRGQLGFGYDPIFQPNGYDRTFGEMTEEEKNG 188 F+ V++LA PDG V F G V G IV PRG+ GFGYDPIF P GY +TF EM+EEEKN Sbjct: 107 FVCVIALADPDGKVLVFEGIVEGEIVEEPRGEGGFGYDPIFIPPGYGKTFAEMSEEEKN- 165 Query: 189 GIDSATLFSILSTDLLSHRARAFKCFVD 216 +SHR +A + + Sbjct: 166 --------------KISHRGKALRKLKE 179 >gnl|CDD|29953 cd00515, HAM1, NTPase/HAM1. This family consists of the HAM1 protein and pyrophosphate-releasing xanthosine/ inosine triphosphatase. HAM1 protects the cell against mutagenesis by the base analog 6-N-hydroxylaminopurine (HAP) in E. Coli and S. cerevisiae. A Ham1-related protein from Methanococcus jannaschii is a novel NTPase that has been shown to hydrolyze nonstandard nucleotides such as XTP to XMP and ITP to IMP, but not the standard nucleotides, in the presence of Mg or Mn ions. The enzyme exists as a homodimer. The HAM1 protein may be acting as an NTPase by hydrolyzing the HAP triphosphate.. Length = 183 Score = 198 bits (504), Expect = 1e-51 Identities = 86/208 (41%), Positives = 118/208 (56%), Gaps = 26/208 (12%) Query: 9 IVIASHNVDKIHEMDSLIMPLGIMTTSALELNLIIPEETGNSFEENAMIKSLTAAKNAGM 68 IV A+ N K+ E ++ P GI S ++ I EETG++FEENA++K+ AA+ G+ Sbjct: 1 IVFATGNKGKLKEFKEILAPFGIEVVSLKDI--IDIEETGSTFEENALLKARAAAEALGL 58 Query: 69 PALSDDSGLVIDVLDGKPGIHSARWAESNTGERDFDMAMQKIENALRSKFAHDPAFRSAH 128 P L+DDSGL +D L+G PG++SAR+A GE D +K+ L RSA+ Sbjct: 59 PVLADDSGLCVDALNGFPGVYSARFA----GEHDDAENNEKLLELLE-----GDEDRSAY 109 Query: 129 FISVLSLAWPDGHVENFSGKVSGIIVWPPRGQLGFGYDPIFQPNGYDRTFGEMTEEEKNG 188 F+ V++L PDG F G+V G IV PRG GFGYDPIF P GY +TF EM+ EEKN Sbjct: 110 FVCVIALVDPDGEPLVFEGEVEGKIVTEPRGTGGFGYDPIFIPEGYGKTFAEMSPEEKNA 169 Query: 189 GIDSATLFSILSTDLLSHRARAFKCFVD 216 +SHR +A + + Sbjct: 170 ---------------ISHRGKALRKLKE 182 >gnl|CDD|30476 COG0127, COG0127, Xanthosine triphosphate pyrophosphatase [Nucleotide transport and metabolism]. Length = 194 Score = 195 bits (498), Expect = 7e-51 Identities = 89/209 (42%), Positives = 121/209 (57%), Gaps = 25/209 (11%) Query: 9 IVIASHNVDKIHEMDSLIMPLGIMTTSALELNLII-PEETGNSFEENAMIKSLTAAKNAG 67 IV+A+ N K+ E+ S++ P GI S EL + I EETG +FEENA++K+ AAK G Sbjct: 4 IVLATGNKGKLRELKSILAPGGIEIESLKELGVEIEVEETGLTFEENALLKARAAAKATG 63 Query: 68 MPALSDDSGLVIDVLDGKPGIHSARWAESNTGERDFDMAMQKIENALRSKFAHDPAFRSA 127 +P ++DDSGL +D L+G PG++SAR+A GE D + +K+ L RSA Sbjct: 64 LPVIADDSGLCVDALNGFPGVYSARFA----GEADDTIGNEKLLKLLEGV-----PDRSA 114 Query: 128 HFISVLSLAWPDGHVENFSGKVSGIIVWPPRGQLGFGYDPIFQPNGYDRTFGEMTEEEKN 187 +F+ V+ LA G F G+V G I PRG+ GFGYDPIF P GY +TF E++ EEKN Sbjct: 115 YFVCVIVLARDGGEPIVFEGEVEGEIAREPRGEGGFGYDPIFIPEGYGKTFAELSTEEKN 174 Query: 188 GGIDSATLFSILSTDLLSHRARAFKCFVD 216 +SHRARA K + Sbjct: 175 A---------------ISHRARALKKLKE 188 >gnl|CDD|38432 KOG3222, KOG3222, KOG3222, Inosine triphosphate pyrophosphatase [Nucleotide transport and metabolism]. Length = 195 Score = 80.7 bits (199), Expect = 3e-16 Identities = 63/215 (29%), Positives = 98/215 (45%), Gaps = 33/215 (15%) Query: 4 LIENNIVIASHNVDKIHEMDSLIMPLGIMTTSALELNLIIPEETGNSFEENAMIKSLTAA 63 ++ I + N K+ E+ +++ + + ++L +PE G S EE A+ K AA Sbjct: 2 VMSREINFVTGNAKKLEEVRAILGD-SFPYCTLINIDLDLPEIQG-SPEEIAIEKCKVAA 59 Query: 64 KNAGMPALSDDSGLVIDVLDGKPGIHSARWAESNTGERDFDMAMQKIENALRSKFAHDPA 123 + P L +D+ L + L G PG + +W G + K EN Sbjct: 60 EIVKGPVLVEDTSLCFNALGGLPGPY-IKWFLKKLGPEGLHEMLSKFEN----------- 107 Query: 124 FRSAHFISVLSLAWPDGH--VENFSGKVSGIIVWPPRGQLGFGYDPIFQPNGYDRTFGEM 181 +SA+ + A G V F+G+ G IV PPRG FG+DPIFQP+GY++T+ EM Sbjct: 108 -KSAYALCTFGYADGRGAKPVHTFAGETEGSIV-PPRGPTDFGWDPIFQPDGYEQTYAEM 165 Query: 182 TEEEKNGGIDSATLFSILSTDLLSHRARAFKCFVD 216 ++EKN +SHR RA + Sbjct: 166 PKDEKN---------------AISHRYRALAKLKE 185 >gnl|CDD|73242 cd00985, Maf_Ham1, Maf_Ham1. Maf, a nucleotide binding protein, has been implicated in inhibition of septum formation in eukaryotes, bacteria and archaea. A Ham1-related protein from Methanococcus jannaschii is a novel NTPase that has been shown to hydrolyze nonstandard nucleotides, such as hypoxanthine/xanthine NTP, but not standard nucleotides.. Length = 131 Score = 65.3 bits (159), Expect = 1e-11 Identities = 37/152 (24%), Positives = 71/152 (46%), Gaps = 36/152 (23%) Query: 9 IVIASHNVDKIHEMDSL------IMPLGIMTTSALELNLIIPEETGNSFEENAMIKSLTA 62 +++AS + ++ E+ + ++P I T + E ++ EE A++K+ Sbjct: 1 LILASGSPRRLEELKQIGGIEFEVLPSDIDETG-------LKGEPEDTVEELALLKARAV 53 Query: 63 AKN-AGMPALSDDSGLVIDVLDGKPGIHSARWAESNTGERDFDMAMQKIENALRSKFAHD 121 A+ P ++DD+GLV+D G+PG AR+AE+ ++ + Sbjct: 54 AERLPDAPVIADDTGLVVD---GRPGGKPARFAEAL-------EMLRGLSG--------- 94 Query: 122 PAFRSAHFISVLSLAWPDGHVENFSGKVSGII 153 R+A F++ ++L PDG + F G+ G I Sbjct: 95 ---RTAEFVTAVALVDPDGKIITFEGETEGKI 123 >gnl|CDD|39490 KOG4289, KOG4289, KOG4289, Cadherin EGF LAG seven-pass G-type receptor [Signal transduction mechanisms]. Length = 2531 Score = 31.9 bits (72), Expect = 0.13 Identities = 15/31 (48%), Positives = 19/31 (61%) Query: 53 ENAMIKSLTAAKNAGMPALSDDSGLVIDVLD 83 EN+ A++ G P LS+ SGLVI VLD Sbjct: 440 ENSEYTLRIRAQDGGRPPLSNTSGLVIQVLD 470 >gnl|CDD|145115 pfam01784, NIF3, NIF3 (NGG1p interacting factor 3). This family contains several NIF3 (NGG1p interacting factor 3) protein homologues. NIF3 interacts with the yeast transcriptional coactivator NGG1p which is part of the ADA complex, the exact function of this interaction is unknown. Length = 238 Score = 27.6 bits (62), Expect = 2.7 Identities = 16/49 (32%), Positives = 23/49 (46%), Gaps = 4/49 (8%) Query: 2 RKLIENNI-VIASH-NVDKIHE--MDSLIMPLGIMTTSALELNLIIPEE 46 KLI+NNI + ++H N+D D L LG+ LE + E Sbjct: 82 LKLIKNNISLYSAHTNLDAAPGGVNDWLAELLGLENIEPLEPKTVSEGE 130 >gnl|CDD|32509 COG2362, DppA, D-aminopeptidase [Amino acid transport and metabolism]. Length = 274 Score = 27.6 bits (61), Expect = 2.8 Identities = 19/101 (18%), Positives = 35/101 (34%), Gaps = 9/101 (8%) Query: 40 NLIIPEETGNSFEENAMIKSLTAAKNAGMPA--LSDDSGLVIDVLDGKPGIHSARWAESN 97 + I + NA A G+P +S D + + P I + E++ Sbjct: 124 RVRINGVEVGEYGLNAY-----LAGEYGVPVALVSGDDVAAREARELTPWIETVAVKEAS 178 Query: 98 TGERDFDMAMQKIENALRSKFAHDPAFRSAHFISVLSLAWP 138 + ++ K E A+R A + ++ L A P Sbjct: 179 SRSAAISLSPAKAEKAIREAVRA--AVQRKLSLAPLRPADP 217 >gnl|CDD|37359 KOG2148, KOG2148, KOG2148, Exocyst protein Sec3 [Intracellular trafficking, secretion, and vesicular transport]. Length = 867 Score = 27.3 bits (60), Expect = 3.2 Identities = 16/48 (33%), Positives = 22/48 (45%), Gaps = 2/48 (4%) Query: 178 FGEMTEEEKNG-GIDSATLFSILSTDLLSHRARAFKCFVDN-CLRIDE 223 F MTE + D A+ S+ LL R F F+ N C +I+E Sbjct: 586 FVIMTEHVLSAQKADPASFLSLTLGSLLVRVKRQFDRFIQNQCSQIEE 633 >gnl|CDD|176475 cd08638, DNA_pol_A_theta, DNA polymerase theta is a low-fidelity family A enzyme implicated in translesion synthesis and in somatic hypermutation. DNA polymerase theta is a low-fidelity family A enzyme implicated in translesion synthesis (TLS) and in somatic hypermutation (SHM). DNA-dependent DNA polymerases can be classified in six main groups based upon phylogenetic relationships with E. coli polymerase I (classA), E. coli polymerase II (class B), E.coli polymerase III (class C), euryarchaaeota polymerase II (class D), human polymerase beta (class x), E. coli UmuC/DinB and eukaryotic RAP 30/Xeroderma pigmentosum variant (class Y). Family A polymerase functions primarily to fill DNA gaps that arise during DNA repair, recombination and replication. Pol theta is an exception among family A polymerases and generates processive single base substitutions. Family A polymerase are found primarily in organisms related to prokaryotes and include prokaryotic DNA polymerase I (pol I) ,mitochondrial polymerase delta, and several bacteriphage polymerases including those from odd-numbered phage (T3, T5, and T7). Prokaryotic Pol Is have two functional domains located on the same polypeptide; a 5'-3' polymerase and 5'-3' exonuclease. Pol I uses its 5' nuclease activity to remove the ribonucleotide portion of newly synthesized Okazaki fragments and DNA polymerase activity to fill in the resulting gap. Polymerase theta mostly has amino-terminal helicase domain, a carboxy-terminal polymerase domain and an intervening space region. Length = 373 Score = 26.0 bits (58), Expect = 7.6 Identities = 15/53 (28%), Positives = 19/53 (35%), Gaps = 13/53 (24%) Query: 86 PGIHSARWAESNTGER------------D-FDMAMQKIENALRSKFAHDPAFR 125 P I+S +E ER D +AM I L S + PA R Sbjct: 267 PEINSGNSSERAQAERQAVNTVIQGSAADIMKIAMINIHEKLHSLLPNLPAGR 319 >gnl|CDD|48437 cd01902, Ntn_CGH, Choloylglycine hydrolase (CGH) is a bile salt-modifying enzyme that hydrolyzes non-peptide carbon-nitrogen bonds in choloylglycine and choloyltaurine, both of which are present in bile. CGH is present in a number of probiotic microbial organisms that inhabit the gut. CGH has an N-terminal nucleophilic cysteine, as do other members of the Ntn hydrolase family to which CGH belongs.. Length = 291 Score = 25.6 bits (56), Expect = 9.9 Identities = 13/54 (24%), Positives = 23/54 (42%), Gaps = 4/54 (7%) Query: 70 ALSDDSG--LVIDVLDGKPGIHSARWAESNTGERDFDMAMQKIENALRSKFAHD 121 ++SD +G +I+ +DGK IH + + T +D Q N + Sbjct: 143 SISDATGDSAIIEYIDGKLVIHHGKQYQVMTNSPTYD--QQLALNKYWKQEKDI 194 Database: CddA Posted date: Feb 4, 2011 9:38 PM Number of letters in database: 6,263,737 Number of sequences in database: 21,609 Lambda K H 0.318 0.135 0.400 Gapped Lambda K H 0.267 0.0782 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Sequences: 21609 Number of Hits to DB: 2,729,282 Number of extensions: 141605 Number of successful extensions: 319 Number of sequences better than 10.0: 1 Number of HSP's gapped: 307 Number of HSP's successfully gapped: 22 Length of query: 224 Length of database: 6,263,737 Length adjustment: 90 Effective length of query: 134 Effective length of database: 4,318,927 Effective search space: 578736218 Effective search space used: 578736218 Neighboring words threshold: 11 Window for multiple hits: 40 X1: 16 ( 7.3 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (21.7 bits) S2: 55 (24.9 bits)