RPS-BLAST 2.2.22 [Sep-27-2009]
Database: CddA
21,609 sequences; 6,263,737 total letters
Searching..................................................done
Query= gi|254780286|ref|YP_003064699.1| deoxyguanosinetriphosphate
triphosphohydrolase-like protein [Candidatus Liberibacter asiaticus
str. psy62]
(410 letters)
>gnl|CDD|30581 COG0232, Dgt, dGTP triphosphohydrolase [Nucleotide transport and
metabolism].
Length = 412
Score = 365 bits (937), Expect = e-101
Identities = 179/404 (44%), Positives = 242/404 (59%), Gaps = 18/404 (4%)
Query: 12 QKKVAYAADPTQSLGRMYPEKRSLTRSEFQRDRDRMIHTTAFRRLKDKTQVFFHRQRDHY 71
K A DP++S GR++PE S TRSEF+RDRDR+IH+ AFRRL+DKTQVF + D Y
Sbjct: 7 DKIRWQARDPSESRGRLHPETESDTRSEFERDRDRIIHSAAFRRLQDKTQVFPLHEGDFY 66
Query: 72 RTRLMHTIEVSQIARSLARALRID-----EDLVEAIALAHDFGHPPFGHVGEDVLQELLS 126
RTRL H++EV+QI RS+AR L +D EDLVE LAHD GHPPFGH GED L E++
Sbjct: 67 RTRLTHSLEVAQIGRSIARELGLDLDLPFEDLVETACLAHDIGHPPFGHAGEDALNEVMR 126
Query: 127 SYGGFDHNIQSFRIVTELECSYADFDGINLTWETLEGLIGHNGPILPQDLDKPRIIP--R 184
YGGF+ N Q+FRI+T+LE YADF G+NLT TLEG++ + GP+L R
Sbjct: 127 EYGGFEGNAQTFRILTKLEKRYADFGGLNLTRATLEGVVKYPGPLLYAMKKGGRFGLYEE 186
Query: 185 IFSDYYHIHGLSLANFASLEGQVAAIADDIAYDAHDIDDGVRAGLLTVDMLKEISFLEK- 243
DY + L SLE Q+ +ADDIAY HD++DG+R GL+T++ L E+ L
Sbjct: 187 DILDYDWLLDLLEEVTRSLEAQIMDLADDIAYSVHDLEDGLRKGLITLEDLIELPLLALE 246
Query: 244 -HIASLHDLYGHLDDKRL---VHELVRRQITAMVEDVITVSQKRIAHLKPHAIHDIRSA- 298
I L+ L L +EL+RR I + D+ V+++ + I D+R+A
Sbjct: 247 DAIVKGRVLFPELVSAALEVAANELIRRSIQQLFADLRVVTKELVGAAAQAFIDDVRAAD 306
Query: 299 ----GYRIIDFSDEMTLVDKEIKSMLVKYVYRHPSIMTCCNQIANVIRNLFSAYMSD-PR 353
+ +++FS E + K +K L K+VYRHP + + +I +LF A+M D PR
Sbjct: 307 AGTFRHALLEFSSEAAALLKVLKKFLFKHVYRHPRVERIEYKGYRIITDLFEAFMPDAPR 366
Query: 354 KMRGCNQLEYERDMTDSIKARHVGDYLAGMTDSYAIREHHILFG 397
+ Y D+ KAR V DY+AGMTDSYA+RE+ LFG
Sbjct: 367 LLPPNYLKAYREAADDAAKARVVCDYIAGMTDSYALREYRRLFG 410
>gnl|CDD|31276 COG1078, COG1078, HD superfamily phosphohydrolases [General
function prediction only].
Length = 421
Score = 45.4 bits (107), Expect = 3e-05
Identities = 31/96 (32%), Positives = 43/96 (44%), Gaps = 15/96 (15%)
Query: 46 RMIHTTAFRRLKDKTQV----FFHRQRDHYRTRLMHTIEVSQIARSLARALRIDED---- 97
+I T F+RL+ Q+ + +H TR H++ V +AR L L + +
Sbjct: 22 ELIDTPEFQRLRRIKQLGLAYLVYPGANH--TRFEHSLGVYHLARRLLEHLEKNSEEEID 79
Query: 98 -----LVEAIALAHDFGHPPFGHVGEDVLQELLSSY 128
LV AL HD GH PF H E VL + L Y
Sbjct: 80 EEERLLVRLAALLHDIGHGPFSHTFEYVLDKNLGFY 115
>gnl|CDD|145246 pfam01966, HD, HD domain. HD domains are metal dependent
phosphohydrolases.
Length = 108
Score = 44.5 bits (105), Expect = 6e-05
Identities = 25/72 (34%), Positives = 38/72 (52%), Gaps = 5/72 (6%)
Query: 74 RLMHTIEVSQIARSLARALRIDEDLVEAIALAHDFGHPPFGHVGEDVLQELLSSYGGFDH 133
L H++ V+ +AR LA L +D +L+ AL HD G PPFG + ++ E+ H
Sbjct: 1 VLEHSLRVALLARELAEELGLDRELLLLAALLHDIGKPPFGFLEKEEDFEIF-----LSH 55
Query: 134 NIQSFRIVTELE 145
+ I+ ELE
Sbjct: 56 AVVGAEILRELE 67
>gnl|CDD|28958 cd00077, HDc, Metal dependent phosphohydrolases with conserved 'HD'
motif.
Length = 145
Score = 38.1 bits (87), Expect = 0.004
Identities = 25/77 (32%), Positives = 34/77 (44%), Gaps = 8/77 (10%)
Query: 72 RTRLMHTIEVSQIARSLARALRI---DEDLVEAIALAHDFGHPPFGHVGEDVLQELLSSY 128
R H++ V+Q+AR LA L + D +L+ AL HD G P + EL
Sbjct: 1 EHRFEHSLRVAQLARRLAEELGLSEEDIELLRLAALLHDIGKPGTPDAITEEESEL---- 56
Query: 129 GGFDHNIQSFRIVTELE 145
DH I I+ EL
Sbjct: 57 -EKDHAIVGAEILRELL 72
>gnl|CDD|37892 KOG2681, KOG2681, KOG2681, Metal-dependent phosphohydrolase
[Function unknown].
Length = 498
Score = 37.7 bits (87), Expect = 0.007
Identities = 29/112 (25%), Positives = 49/112 (43%), Gaps = 17/112 (15%)
Query: 45 DRMIHTTAFRRLKDKTQV----FFHRQRDHYRTRLMHTIEVSQIARSLARALR------- 93
++I T F+RL+ Q+ + +H +R H++ +A L AL
Sbjct: 43 IKLIDTPLFQRLRHVKQLGLRYLVYPGANH--SRFEHSLGTYTLAGILVNALNKNQCPEL 100
Query: 94 -IDEDLVEAIALA---HDFGHPPFGHVGEDVLQELLSSYGGFDHNIQSFRIV 141
I E ++A+ +A HD GH PF H+ E ++ S F H S ++
Sbjct: 101 CITEVDLQAVQVAALLHDIGHGPFSHLFEGEFTPMVRSGPEFYHEDMSIDMI 152
>gnl|CDD|31608 COG1418, COG1418, Predicted HD superfamily hydrolase [General
function prediction only].
Length = 222
Score = 36.6 bits (84), Expect = 0.011
Identities = 14/48 (29%), Positives = 23/48 (47%)
Query: 64 FHRQRDHYRTRLMHTIEVSQIARSLARALRIDEDLVEAIALAHDFGHP 111
+ R + + L H++ V+ +A +A +D DL AL HD G
Sbjct: 27 RLKFRTYGQHVLEHSLRVAYLAYRIAEEEGVDPDLALRAALLHDIGKA 74
>gnl|CDD|31899 COG1713, COG1713, Predicted HD superfamily hydrolase involved in
NAD metabolism [Coenzyme metabolism].
Length = 187
Score = 30.6 bits (69), Expect = 0.91
Identities = 10/36 (27%), Positives = 16/36 (44%)
Query: 74 RLMHTIEVSQIARSLARALRIDEDLVEAIALAHDFG 109
R H + V++ A LA A +D + + HD
Sbjct: 18 RFEHCLGVAETAIELAEAYGLDPEKAYLAGILHDIA 53
>gnl|CDD|143346 cd07841, STKc_CDK7, Catalytic domain of the Serine/Threonine
Kinase, Cyclin-Dependent protein Kinase 7.
Serine/Threonine Kinases (STKs), Cyclin-Dependent
protein Kinase 7 (CDK7) subfamily, catalytic (c) domain.
STKs catalyze the transfer of the gamma-phosphoryl group
from ATP to serine/threonine residues on protein
substrates. The CDK7 subfamily is part of a larger
superfamily that includes the catalytic domains of other
protein STKs, protein tyrosine kinases, RIO kinases,
aminoglycoside phosphotransferase, choline kinase, and
phosphoinositide 3-kinase. CDKs belong to a large family
of STKs that are regulated by their cognate cyclins.
Together, they are involved in the control of cell-cycle
progression, transcription, and neuronal function. CDK7
plays essential roles in the cell cycle and in
transcription. It associates with cyclin H and MAT1 and
acts as a CDK-Activating Kinase (CAK) by phosphorylating
and activating cell cycle CDKs (CDK1/2/4/6). In the
brain, it activates CDK5. CDK7 is also a component of
the general transcription factor TFIIH, which
phosphorylates the C-terminal domain (CTD) of RNA
polymerase II when it is bound with unphosphorylated
DNA, as present in the pre-initiation complex. Following
phosphorylation, the CTD dissociates from the DNA which
allows transcription initiation.
Length = 298
Score = 28.7 bits (65), Expect = 3.1
Identities = 24/103 (23%), Positives = 41/103 (39%), Gaps = 25/103 (24%)
Query: 204 EGQVAAIADDIAYDAHDIDDG-------VRAGLLTVDM-------LKEISFLEK----HI 245
EG A + Y A D + G ++ G L+EI L++ +I
Sbjct: 10 EGTYAVV-----YKARDKETGRIVAIKKIKLGERKEAKDGINFTALREIKLLQELKHPNI 64
Query: 246 ASLHDLYGHLDDKRLVHELVRRQITAMVEDVITVSQKRIAHLK 288
L D++GH + LV E + + +++D V A +K
Sbjct: 65 IGLLDVFGHKSNINLVFEFMETDLEKVIKDKSIVLTP--ADIK 105
>gnl|CDD|144356 pfam00723, Glyco_hydro_15, Glycosyl hydrolases family 15.
Length = 416
Score = 28.6 bits (64), Expect = 3.5
Identities = 14/50 (28%), Positives = 21/50 (42%), Gaps = 2/50 (4%)
Query: 98 LVEAIALAHDFGHPPFGHVGEDVLQELLS--SYGGFDHNIQSFRIVTELE 145
L+EAI A FG + E E+L+ GGF ++ + E
Sbjct: 189 LLEAIDFAELFGDHGSAQLWESTADEILNKLQGGGFWPLFNTYIQLYGSE 238
>gnl|CDD|176529 cd08587, PI-PLCXDc_like, Catalytic domain of
phosphatidylinositol-specific phospholipase C X domain
containing and similar proteins. This family
corresponds to the catalytic domain present in
phosphatidylinositol-specific phospholipase C X domain
containing proteins (PI-PLCXD) which are bacterial
phosphatidylinositol-specific phospholipase C (PI-PLC,
EC 4.6.1.13) sequence homologs mainly found in
eukaryota. The typical eukaryotic
phosphoinositide-specific phospholipase C (PI-PLC, EC
3.1.4.11) have a multidomain organization that consists
of a PLC catalytic core domain, and various regulatory
domains. The catalytic core domain is assembled from two
highly conserved X- and Y-regions split by a divergent
linker sequence. In contrast, eukaryotic PI-PLCXDs and
their bacterial homologs contain a single TIM-barrel
type catalytic domain, X domain, which is more closely
related to that of bacterial PI-PLCs. Although the
biological function of eukaryotic PI-PLCXDs still
remains unclear, it may be distinct from that of typical
eukaryotic PI-PLCs.
Length = 288
Score = 27.7 bits (62), Expect = 5.7
Identities = 22/129 (17%), Positives = 42/129 (32%), Gaps = 39/129 (30%)
Query: 181 IIPRIFSDYYHIHGLSLANFASLEGQVAAIADDIAYDAHDIDDGVRAGLLTVDMLKEI-S 239
P + Y +HGL Y +D+ L+++
Sbjct: 74 YKPDSENKLYFVHGL--------------------YSGEPVDEV----------LEDVND 103
Query: 240 FLEKH-----IASLHDLYGHLDDKRLVHELVRRQITAMVEDVITVSQKRIAHLKPHAIHD 294
FL++H I + YG D HE + + + D + + + + D
Sbjct: 104 FLDEHPKEVVILDFNHFYGMDDKSPEDHEKLVELLEDIFGDKLCPRDSDLLDVT---LAD 160
Query: 295 IRSAGYRII 303
+ +G R+I
Sbjct: 161 LWESGKRVI 169
>gnl|CDD|100037 cd06061, PurM-like1, AIR synthase (PurM) related protein, subgroup
1 of unknown function. The family of PurM related
proteins includes Hydrogen expression/formation protein
HypE, AIR synthases, FGAM synthase and Selenophosphate
synthetase (SelD). They all contain two conserved
domains and seem to dimerize. The N-terminal domain
forms the dimer interface and is a putative ATP binding
domain..
Length = 298
Score = 27.6 bits (62), Expect = 6.3
Identities = 14/58 (24%), Positives = 26/58 (44%), Gaps = 11/58 (18%)
Query: 271 AMVEDVITVSQKRIAHLKPHAIH----DIRSAGYRIIDFSDEMTL----VDKEIKSML 320
+ D IT + K L A+H DI ++G R + L ++E+K+++
Sbjct: 46 VVSTDPITGAGKDAGWL---AVHIAANDIATSGARPRWLLVTLLLPPGTDEEELKAIM 100
>gnl|CDD|32470 COG2316, COG2316, Predicted hydrolase (HD superfamily) [General
function prediction only].
Length = 212
Score = 27.3 bits (60), Expect = 7.6
Identities = 18/71 (25%), Positives = 28/71 (39%), Gaps = 10/71 (14%)
Query: 48 IHTTAFRRLKDKTQVFFHRQRD--------HYRTRLM--HTIEVSQIARSLARALRIDED 97
+H+ F + +T + RD H + + H + V + R LAR DE+
Sbjct: 12 LHSGLFLTTRRRTTIMAAINRDEAYELLKEHVPSESLQKHCLAVEAVMRWLAREWGEDEE 71
Query: 98 LVEAIALAHDF 108
L HDF
Sbjct: 72 KWAVTGLLHDF 82
>gnl|CDD|35559 KOG0338, KOG0338, KOG0338, ATP-dependent RNA helicase [RNA
processing and modification].
Length = 691
Score = 27.3 bits (60), Expect = 8.3
Identities = 18/87 (20%), Positives = 40/87 (45%), Gaps = 3/87 (3%)
Query: 12 QKKVAYAADPTQSLGRMYPEKRSLTRSEFQRDRDRMIHTTAFRRLKDKTQVFFHRQRDHY 71
K V DP + ++ R + + DR+ M+ + R +D+T VF ++ +
Sbjct: 381 NKPVRIFVDPNKDTAPKLTQEFIRIRPKREGDREAMLASLITRTFQDRTIVFVRTKKQAH 440
Query: 72 RTRLMH---TIEVSQIARSLARALRID 95
R R++ ++ ++ SL + R++
Sbjct: 441 RLRILLGLLGLKAGELHGSLTQEQRLE 467
>gnl|CDD|143924 pfam00156, Pribosyltran, Phosphoribosyl transferase domain. This
family includes a range of diverse phosphoribosyl
transferase enzymes. This family includes: Adenine
phosphoribosyltransferase EC:2.4.2.7.
Hypoxanthine-guanine-xanthine phosphoribosyltransferase.
Hypoxanthine phosphoribosyltransferase EC:2.4.2.8.
Ribose-phosphate pyrophosphokinase i EC:2.7.6.1.
Amidophosphoribosyltransferase EC:2.4.2.14. Orotate
phosphoribosyltransferase EC:2.4.2.10. Uracil
phosphoribosyltransferase EC:2.4.2.9. Xanthine-guanine
phosphoribosyltransferase EC:2.4.2.22. In Arabidopsis,
At the very N-terminus of this domain is the
ATP-GTP-binding domain.
Length = 123
Score = 27.3 bits (61), Expect = 8.8
Identities = 16/82 (19%), Positives = 26/82 (31%), Gaps = 15/82 (18%)
Query: 176 LDKPRIIPRIFSDYYHIHG--------LSLANFASLEGQVAAIADDIAYDAHDIDDGVRA 227
L P ++ R L L+G+ I DD+ +D G
Sbjct: 49 LGIPLVLVRKRISLPSSIKSRGGESVTLLSRLPELLKGKRVLIVDDV------LDTGGTL 102
Query: 228 GLLTVDMLKEISFLEKHIASLH 249
V++LK+ +A L
Sbjct: 103 -RAAVELLKKAGAKVVGVAVLV 123
>gnl|CDD|32165 COG1982, LdcC, Arginine/lysine/ornithine decarboxylases [Amino acid
transport and metabolism].
Length = 557
Score = 27.2 bits (60), Expect = 9.7
Identities = 28/125 (22%), Positives = 42/125 (33%), Gaps = 4/125 (3%)
Query: 143 ELECSYADFDGINLTWETLEGLIGHNGPILPQDLDKPRIIPRIFSDYYHIHGLSLANFAS 202
+ D LT T E G G I+ + L + IIP DY ++ S
Sbjct: 334 DYADDQYFLDPTKLTITTPE--FGIPGAIVAKYLREHGIIPEETGDYSNLLLFSPG-IGK 390
Query: 203 LEGQVAAIAD-DIAYDAHDIDDGVRAGLLTVDMLKEISFLEKHIASLHDLYGHLDDKRLV 261
+ V + + YD V L + + +HDLY L+ K L
Sbjct: 391 WQTLVDRLLEFKRRYDDDAPLKQVLPSLYDQYPRYRGMGIRQLCQEMHDLYVSLNVKALQ 450
Query: 262 HELVR 266
+R
Sbjct: 451 DAALR 455
Database: CddA
Posted date: Feb 4, 2011 9:38 PM
Number of letters in database: 6,263,737
Number of sequences in database: 21,609
Lambda K H
0.325 0.140 0.415
Gapped
Lambda K H
0.267 0.0704 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 21609
Number of Hits to DB: 5,186,105
Number of extensions: 286592
Number of successful extensions: 960
Number of sequences better than 10.0: 1
Number of HSP's gapped: 955
Number of HSP's successfully gapped: 21
Length of query: 410
Length of database: 6,263,737
Length adjustment: 96
Effective length of query: 314
Effective length of database: 4,189,273
Effective search space: 1315431722
Effective search space used: 1315431722
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 15 ( 7.0 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 40 (21.6 bits)
S2: 59 (26.6 bits)