RPS-BLAST 2.2.22 [Sep-27-2009] Database: CddA 21,609 sequences; 6,263,737 total letters Searching..................................................done Query= gi|254780286|ref|YP_003064699.1| deoxyguanosinetriphosphate triphosphohydrolase-like protein [Candidatus Liberibacter asiaticus str. psy62] (410 letters) >gnl|CDD|30581 COG0232, Dgt, dGTP triphosphohydrolase [Nucleotide transport and metabolism]. Length = 412 Score = 365 bits (937), Expect = e-101 Identities = 179/404 (44%), Positives = 242/404 (59%), Gaps = 18/404 (4%) Query: 12 QKKVAYAADPTQSLGRMYPEKRSLTRSEFQRDRDRMIHTTAFRRLKDKTQVFFHRQRDHY 71 K A DP++S GR++PE S TRSEF+RDRDR+IH+ AFRRL+DKTQVF + D Y Sbjct: 7 DKIRWQARDPSESRGRLHPETESDTRSEFERDRDRIIHSAAFRRLQDKTQVFPLHEGDFY 66 Query: 72 RTRLMHTIEVSQIARSLARALRID-----EDLVEAIALAHDFGHPPFGHVGEDVLQELLS 126 RTRL H++EV+QI RS+AR L +D EDLVE LAHD GHPPFGH GED L E++ Sbjct: 67 RTRLTHSLEVAQIGRSIARELGLDLDLPFEDLVETACLAHDIGHPPFGHAGEDALNEVMR 126 Query: 127 SYGGFDHNIQSFRIVTELECSYADFDGINLTWETLEGLIGHNGPILPQDLDKPRIIP--R 184 YGGF+ N Q+FRI+T+LE YADF G+NLT TLEG++ + GP+L R Sbjct: 127 EYGGFEGNAQTFRILTKLEKRYADFGGLNLTRATLEGVVKYPGPLLYAMKKGGRFGLYEE 186 Query: 185 IFSDYYHIHGLSLANFASLEGQVAAIADDIAYDAHDIDDGVRAGLLTVDMLKEISFLEK- 243 DY + L SLE Q+ +ADDIAY HD++DG+R GL+T++ L E+ L Sbjct: 187 DILDYDWLLDLLEEVTRSLEAQIMDLADDIAYSVHDLEDGLRKGLITLEDLIELPLLALE 246 Query: 244 -HIASLHDLYGHLDDKRL---VHELVRRQITAMVEDVITVSQKRIAHLKPHAIHDIRSA- 298 I L+ L L +EL+RR I + D+ V+++ + I D+R+A Sbjct: 247 DAIVKGRVLFPELVSAALEVAANELIRRSIQQLFADLRVVTKELVGAAAQAFIDDVRAAD 306 Query: 299 ----GYRIIDFSDEMTLVDKEIKSMLVKYVYRHPSIMTCCNQIANVIRNLFSAYMSD-PR 353 + +++FS E + K +K L K+VYRHP + + +I +LF A+M D PR Sbjct: 307 AGTFRHALLEFSSEAAALLKVLKKFLFKHVYRHPRVERIEYKGYRIITDLFEAFMPDAPR 366 Query: 354 KMRGCNQLEYERDMTDSIKARHVGDYLAGMTDSYAIREHHILFG 397 + Y D+ KAR V DY+AGMTDSYA+RE+ LFG Sbjct: 367 LLPPNYLKAYREAADDAAKARVVCDYIAGMTDSYALREYRRLFG 410 >gnl|CDD|31276 COG1078, COG1078, HD superfamily phosphohydrolases [General function prediction only]. Length = 421 Score = 45.4 bits (107), Expect = 3e-05 Identities = 31/96 (32%), Positives = 43/96 (44%), Gaps = 15/96 (15%) Query: 46 RMIHTTAFRRLKDKTQV----FFHRQRDHYRTRLMHTIEVSQIARSLARALRIDED---- 97 +I T F+RL+ Q+ + +H TR H++ V +AR L L + + Sbjct: 22 ELIDTPEFQRLRRIKQLGLAYLVYPGANH--TRFEHSLGVYHLARRLLEHLEKNSEEEID 79 Query: 98 -----LVEAIALAHDFGHPPFGHVGEDVLQELLSSY 128 LV AL HD GH PF H E VL + L Y Sbjct: 80 EEERLLVRLAALLHDIGHGPFSHTFEYVLDKNLGFY 115 >gnl|CDD|145246 pfam01966, HD, HD domain. HD domains are metal dependent phosphohydrolases. Length = 108 Score = 44.5 bits (105), Expect = 6e-05 Identities = 25/72 (34%), Positives = 38/72 (52%), Gaps = 5/72 (6%) Query: 74 RLMHTIEVSQIARSLARALRIDEDLVEAIALAHDFGHPPFGHVGEDVLQELLSSYGGFDH 133 L H++ V+ +AR LA L +D +L+ AL HD G PPFG + ++ E+ H Sbjct: 1 VLEHSLRVALLARELAEELGLDRELLLLAALLHDIGKPPFGFLEKEEDFEIF-----LSH 55 Query: 134 NIQSFRIVTELE 145 + I+ ELE Sbjct: 56 AVVGAEILRELE 67 >gnl|CDD|28958 cd00077, HDc, Metal dependent phosphohydrolases with conserved 'HD' motif. Length = 145 Score = 38.1 bits (87), Expect = 0.004 Identities = 25/77 (32%), Positives = 34/77 (44%), Gaps = 8/77 (10%) Query: 72 RTRLMHTIEVSQIARSLARALRI---DEDLVEAIALAHDFGHPPFGHVGEDVLQELLSSY 128 R H++ V+Q+AR LA L + D +L+ AL HD G P + EL Sbjct: 1 EHRFEHSLRVAQLARRLAEELGLSEEDIELLRLAALLHDIGKPGTPDAITEEESEL---- 56 Query: 129 GGFDHNIQSFRIVTELE 145 DH I I+ EL Sbjct: 57 -EKDHAIVGAEILRELL 72 >gnl|CDD|37892 KOG2681, KOG2681, KOG2681, Metal-dependent phosphohydrolase [Function unknown]. Length = 498 Score = 37.7 bits (87), Expect = 0.007 Identities = 29/112 (25%), Positives = 49/112 (43%), Gaps = 17/112 (15%) Query: 45 DRMIHTTAFRRLKDKTQV----FFHRQRDHYRTRLMHTIEVSQIARSLARALR------- 93 ++I T F+RL+ Q+ + +H +R H++ +A L AL Sbjct: 43 IKLIDTPLFQRLRHVKQLGLRYLVYPGANH--SRFEHSLGTYTLAGILVNALNKNQCPEL 100 Query: 94 -IDEDLVEAIALA---HDFGHPPFGHVGEDVLQELLSSYGGFDHNIQSFRIV 141 I E ++A+ +A HD GH PF H+ E ++ S F H S ++ Sbjct: 101 CITEVDLQAVQVAALLHDIGHGPFSHLFEGEFTPMVRSGPEFYHEDMSIDMI 152 >gnl|CDD|31608 COG1418, COG1418, Predicted HD superfamily hydrolase [General function prediction only]. Length = 222 Score = 36.6 bits (84), Expect = 0.011 Identities = 14/48 (29%), Positives = 23/48 (47%) Query: 64 FHRQRDHYRTRLMHTIEVSQIARSLARALRIDEDLVEAIALAHDFGHP 111 + R + + L H++ V+ +A +A +D DL AL HD G Sbjct: 27 RLKFRTYGQHVLEHSLRVAYLAYRIAEEEGVDPDLALRAALLHDIGKA 74 >gnl|CDD|31899 COG1713, COG1713, Predicted HD superfamily hydrolase involved in NAD metabolism [Coenzyme metabolism]. Length = 187 Score = 30.6 bits (69), Expect = 0.91 Identities = 10/36 (27%), Positives = 16/36 (44%) Query: 74 RLMHTIEVSQIARSLARALRIDEDLVEAIALAHDFG 109 R H + V++ A LA A +D + + HD Sbjct: 18 RFEHCLGVAETAIELAEAYGLDPEKAYLAGILHDIA 53 >gnl|CDD|143346 cd07841, STKc_CDK7, Catalytic domain of the Serine/Threonine Kinase, Cyclin-Dependent protein Kinase 7. Serine/Threonine Kinases (STKs), Cyclin-Dependent protein Kinase 7 (CDK7) subfamily, catalytic (c) domain. STKs catalyze the transfer of the gamma-phosphoryl group from ATP to serine/threonine residues on protein substrates. The CDK7 subfamily is part of a larger superfamily that includes the catalytic domains of other protein STKs, protein tyrosine kinases, RIO kinases, aminoglycoside phosphotransferase, choline kinase, and phosphoinositide 3-kinase. CDKs belong to a large family of STKs that are regulated by their cognate cyclins. Together, they are involved in the control of cell-cycle progression, transcription, and neuronal function. CDK7 plays essential roles in the cell cycle and in transcription. It associates with cyclin H and MAT1 and acts as a CDK-Activating Kinase (CAK) by phosphorylating and activating cell cycle CDKs (CDK1/2/4/6). In the brain, it activates CDK5. CDK7 is also a component of the general transcription factor TFIIH, which phosphorylates the C-terminal domain (CTD) of RNA polymerase II when it is bound with unphosphorylated DNA, as present in the pre-initiation complex. Following phosphorylation, the CTD dissociates from the DNA which allows transcription initiation. Length = 298 Score = 28.7 bits (65), Expect = 3.1 Identities = 24/103 (23%), Positives = 41/103 (39%), Gaps = 25/103 (24%) Query: 204 EGQVAAIADDIAYDAHDIDDG-------VRAGLLTVDM-------LKEISFLEK----HI 245 EG A + Y A D + G ++ G L+EI L++ +I Sbjct: 10 EGTYAVV-----YKARDKETGRIVAIKKIKLGERKEAKDGINFTALREIKLLQELKHPNI 64 Query: 246 ASLHDLYGHLDDKRLVHELVRRQITAMVEDVITVSQKRIAHLK 288 L D++GH + LV E + + +++D V A +K Sbjct: 65 IGLLDVFGHKSNINLVFEFMETDLEKVIKDKSIVLTP--ADIK 105 >gnl|CDD|144356 pfam00723, Glyco_hydro_15, Glycosyl hydrolases family 15. Length = 416 Score = 28.6 bits (64), Expect = 3.5 Identities = 14/50 (28%), Positives = 21/50 (42%), Gaps = 2/50 (4%) Query: 98 LVEAIALAHDFGHPPFGHVGEDVLQELLS--SYGGFDHNIQSFRIVTELE 145 L+EAI A FG + E E+L+ GGF ++ + E Sbjct: 189 LLEAIDFAELFGDHGSAQLWESTADEILNKLQGGGFWPLFNTYIQLYGSE 238 >gnl|CDD|176529 cd08587, PI-PLCXDc_like, Catalytic domain of phosphatidylinositol-specific phospholipase C X domain containing and similar proteins. This family corresponds to the catalytic domain present in phosphatidylinositol-specific phospholipase C X domain containing proteins (PI-PLCXD) which are bacterial phosphatidylinositol-specific phospholipase C (PI-PLC, EC 4.6.1.13) sequence homologs mainly found in eukaryota. The typical eukaryotic phosphoinositide-specific phospholipase C (PI-PLC, EC 3.1.4.11) have a multidomain organization that consists of a PLC catalytic core domain, and various regulatory domains. The catalytic core domain is assembled from two highly conserved X- and Y-regions split by a divergent linker sequence. In contrast, eukaryotic PI-PLCXDs and their bacterial homologs contain a single TIM-barrel type catalytic domain, X domain, which is more closely related to that of bacterial PI-PLCs. Although the biological function of eukaryotic PI-PLCXDs still remains unclear, it may be distinct from that of typical eukaryotic PI-PLCs. Length = 288 Score = 27.7 bits (62), Expect = 5.7 Identities = 22/129 (17%), Positives = 42/129 (32%), Gaps = 39/129 (30%) Query: 181 IIPRIFSDYYHIHGLSLANFASLEGQVAAIADDIAYDAHDIDDGVRAGLLTVDMLKEI-S 239 P + Y +HGL Y +D+ L+++ Sbjct: 74 YKPDSENKLYFVHGL--------------------YSGEPVDEV----------LEDVND 103 Query: 240 FLEKH-----IASLHDLYGHLDDKRLVHELVRRQITAMVEDVITVSQKRIAHLKPHAIHD 294 FL++H I + YG D HE + + + D + + + + D Sbjct: 104 FLDEHPKEVVILDFNHFYGMDDKSPEDHEKLVELLEDIFGDKLCPRDSDLLDVT---LAD 160 Query: 295 IRSAGYRII 303 + +G R+I Sbjct: 161 LWESGKRVI 169 >gnl|CDD|100037 cd06061, PurM-like1, AIR synthase (PurM) related protein, subgroup 1 of unknown function. The family of PurM related proteins includes Hydrogen expression/formation protein HypE, AIR synthases, FGAM synthase and Selenophosphate synthetase (SelD). They all contain two conserved domains and seem to dimerize. The N-terminal domain forms the dimer interface and is a putative ATP binding domain.. Length = 298 Score = 27.6 bits (62), Expect = 6.3 Identities = 14/58 (24%), Positives = 26/58 (44%), Gaps = 11/58 (18%) Query: 271 AMVEDVITVSQKRIAHLKPHAIH----DIRSAGYRIIDFSDEMTL----VDKEIKSML 320 + D IT + K L A+H DI ++G R + L ++E+K+++ Sbjct: 46 VVSTDPITGAGKDAGWL---AVHIAANDIATSGARPRWLLVTLLLPPGTDEEELKAIM 100 >gnl|CDD|32470 COG2316, COG2316, Predicted hydrolase (HD superfamily) [General function prediction only]. Length = 212 Score = 27.3 bits (60), Expect = 7.6 Identities = 18/71 (25%), Positives = 28/71 (39%), Gaps = 10/71 (14%) Query: 48 IHTTAFRRLKDKTQVFFHRQRD--------HYRTRLM--HTIEVSQIARSLARALRIDED 97 +H+ F + +T + RD H + + H + V + R LAR DE+ Sbjct: 12 LHSGLFLTTRRRTTIMAAINRDEAYELLKEHVPSESLQKHCLAVEAVMRWLAREWGEDEE 71 Query: 98 LVEAIALAHDF 108 L HDF Sbjct: 72 KWAVTGLLHDF 82 >gnl|CDD|35559 KOG0338, KOG0338, KOG0338, ATP-dependent RNA helicase [RNA processing and modification]. Length = 691 Score = 27.3 bits (60), Expect = 8.3 Identities = 18/87 (20%), Positives = 40/87 (45%), Gaps = 3/87 (3%) Query: 12 QKKVAYAADPTQSLGRMYPEKRSLTRSEFQRDRDRMIHTTAFRRLKDKTQVFFHRQRDHY 71 K V DP + ++ R + + DR+ M+ + R +D+T VF ++ + Sbjct: 381 NKPVRIFVDPNKDTAPKLTQEFIRIRPKREGDREAMLASLITRTFQDRTIVFVRTKKQAH 440 Query: 72 RTRLMH---TIEVSQIARSLARALRID 95 R R++ ++ ++ SL + R++ Sbjct: 441 RLRILLGLLGLKAGELHGSLTQEQRLE 467 >gnl|CDD|143924 pfam00156, Pribosyltran, Phosphoribosyl transferase domain. This family includes a range of diverse phosphoribosyl transferase enzymes. This family includes: Adenine phosphoribosyltransferase EC:2.4.2.7. Hypoxanthine-guanine-xanthine phosphoribosyltransferase. Hypoxanthine phosphoribosyltransferase EC:2.4.2.8. Ribose-phosphate pyrophosphokinase i EC:2.7.6.1. Amidophosphoribosyltransferase EC:2.4.2.14. Orotate phosphoribosyltransferase EC:2.4.2.10. Uracil phosphoribosyltransferase EC:2.4.2.9. Xanthine-guanine phosphoribosyltransferase EC:2.4.2.22. In Arabidopsis, At the very N-terminus of this domain is the ATP-GTP-binding domain. Length = 123 Score = 27.3 bits (61), Expect = 8.8 Identities = 16/82 (19%), Positives = 26/82 (31%), Gaps = 15/82 (18%) Query: 176 LDKPRIIPRIFSDYYHIHG--------LSLANFASLEGQVAAIADDIAYDAHDIDDGVRA 227 L P ++ R L L+G+ I DD+ +D G Sbjct: 49 LGIPLVLVRKRISLPSSIKSRGGESVTLLSRLPELLKGKRVLIVDDV------LDTGGTL 102 Query: 228 GLLTVDMLKEISFLEKHIASLH 249 V++LK+ +A L Sbjct: 103 -RAAVELLKKAGAKVVGVAVLV 123 >gnl|CDD|32165 COG1982, LdcC, Arginine/lysine/ornithine decarboxylases [Amino acid transport and metabolism]. Length = 557 Score = 27.2 bits (60), Expect = 9.7 Identities = 28/125 (22%), Positives = 42/125 (33%), Gaps = 4/125 (3%) Query: 143 ELECSYADFDGINLTWETLEGLIGHNGPILPQDLDKPRIIPRIFSDYYHIHGLSLANFAS 202 + D LT T E G G I+ + L + IIP DY ++ S Sbjct: 334 DYADDQYFLDPTKLTITTPE--FGIPGAIVAKYLREHGIIPEETGDYSNLLLFSPG-IGK 390 Query: 203 LEGQVAAIAD-DIAYDAHDIDDGVRAGLLTVDMLKEISFLEKHIASLHDLYGHLDDKRLV 261 + V + + YD V L + + +HDLY L+ K L Sbjct: 391 WQTLVDRLLEFKRRYDDDAPLKQVLPSLYDQYPRYRGMGIRQLCQEMHDLYVSLNVKALQ 450 Query: 262 HELVR 266 +R Sbjct: 451 DAALR 455 Database: CddA Posted date: Feb 4, 2011 9:38 PM Number of letters in database: 6,263,737 Number of sequences in database: 21,609 Lambda K H 0.325 0.140 0.415 Gapped Lambda K H 0.267 0.0704 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Sequences: 21609 Number of Hits to DB: 5,186,105 Number of extensions: 286592 Number of successful extensions: 960 Number of sequences better than 10.0: 1 Number of HSP's gapped: 955 Number of HSP's successfully gapped: 21 Length of query: 410 Length of database: 6,263,737 Length adjustment: 96 Effective length of query: 314 Effective length of database: 4,189,273 Effective search space: 1315431722 Effective search space used: 1315431722 Neighboring words threshold: 11 Window for multiple hits: 40 X1: 15 ( 7.0 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 40 (21.6 bits) S2: 59 (26.6 bits)