RPS-BLAST 2.2.26 [Sep-21-2011]
Database: CDD.v3.10
44,354 sequences; 10,937,602 total letters
Searching..................................................done
Query= 021052
(318 letters)
>gnl|CDD|215285 PLN02521, PLN02521, galactokinase.
Length = 497
Score = 475 bits (1224), Expect = e-168
Identities = 192/253 (75%), Positives = 220/253 (86%)
Query: 26 GSGLSSSTAFVCSSTVALMAAFGVEVPKKEIAQLTCECEQFIGTQSGGMDQAISIMAKSG 85
GSGLSSS A VCS+ +A+MAA G+ KKE+AQ TC+CE+ IGTQSGGMDQAISIMA+ G
Sbjct: 161 GSGLSSSAALVCSAAIAIMAALGLNFTKKEVAQFTCKCERHIGTQSGGMDQAISIMAQQG 220
Query: 86 FAELIDFNPIRTTDVQLPAGGTFVVAHSLAESLKAITAASNYNNRVVECRLTAIVLAIKL 145
A+LIDFNP+R TDVQLPAGGTFV+A+SLAES KA+TAA+NYNNRVVECRL AIVLA+KL
Sbjct: 221 VAKLIDFNPVRATDVQLPAGGTFVIANSLAESNKAVTAATNYNNRVVECRLAAIVLAVKL 280
Query: 146 GMKPQEAISKVKTLSDVEGLCVAFACKNGSSDPVFAVKEFLRKEPYTALDIEKITEEKLT 205
GM +EAISKVKTLSDVEGLCV+FA +GSSDP AVKE L + PYTA +IE+I E LT
Sbjct: 281 GMSAEEAISKVKTLSDVEGLCVSFAGSHGSSDPAVAVKELLHEGPYTAEEIEEILGESLT 340
Query: 206 SIFANSSSSLDVLNAAKQYKLHQRAAHVYSEAKRVHAFKDTVSSNLSEEDKLKKLGDLMN 265
SIF NS +SL VL AAK +KLHQRA HVYSEAKRVHAF+DTVSS+LSEE+KLKKLGDLMN
Sbjct: 341 SIFKNSPTSLAVLKAAKHFKLHQRAVHVYSEAKRVHAFRDTVSSSLSEEEKLKKLGDLMN 400
Query: 266 DSHHSCSVLYECS 278
+SH+SCSVLYECS
Sbjct: 401 ESHYSCSVLYECS 413
>gnl|CDD|223231 COG0153, GalK, Galactokinase [Carbohydrate transport and
metabolism].
Length = 390
Score = 117 bits (294), Expect = 5e-30
Identities = 66/256 (25%), Positives = 99/256 (38%), Gaps = 67/256 (26%)
Query: 25 LGSGLSSSTAFVCSSTVALMAAFGVEVPKKEIAQLTCECE-QFIGTQSGGMDQAISIMAK 83
+G+GLSSS A + +AL F + + K E+A++ E QF+G G MDQ S K
Sbjct: 126 IGAGLSSSAALEVAVALALQRLFNLPLDKAELAKIAQVAENQFVGVNCGIMDQLASAFGK 185
Query: 84 SGFAELIDFNPIRTTDVQLPAGG-TFVVAHSLAESLKAITAASNYNNRVVECRLTAIVLA 142
A L+D + V P GG + V+ +S K A S YN R EC A L
Sbjct: 186 KDHALLLDCRTLEYEPVPFPVGGVSIVIVNSNV---KRELADSEYNERRAECEEAAEFLG 242
Query: 143 IKLGMKPQEAISKVKTLSDVEGLCVAFACKNGSSDPVFAVKEFLRKEPYTALDIEKITEE 202
+ + K+L DV + + A++ + +P
Sbjct: 243 VSI-----------KSLRDV------------TDEEFAALQAEIEVDP------------ 267
Query: 203 KLTSIFANSSSSLDVLNAAKQYKLHQRAAHVYSEAKRVHAFKDTVSSNLSEEDKLKKLGD 262
K+ +RA HV +E +RV + L + G+
Sbjct: 268 ----------------------KIARRARHVVTENQRV---LEAA--KALRSGDLTEFGE 300
Query: 263 LMNDSHHSCSVLYECS 278
LMN+SH S YE +
Sbjct: 301 LMNESHESLRDDYEVT 316
>gnl|CDD|232841 TIGR00131, gal_kin, galactokinase. Galactokinase is a member of
the GHMP kinases (Galactokinase, Homoserine kinase,
Mevalonate kinase, Phosphomevalonate kinase) and shares
with them an amino-terminal domain probably related to
ATP binding.The galactokinases found by This model are
divided into two sets. Prokaryotic forms are generally
shorter. The eukaryotic forms are longer because of
additional central regions and in some cases are known
to be bifunctional, with regulatory activities that are
independent of galactokinase activity [Energy
metabolism, Sugars].
Length = 386
Score = 104 bits (260), Expect = 2e-25
Identities = 74/261 (28%), Positives = 98/261 (37%), Gaps = 73/261 (27%)
Query: 25 LGSGLSSSTAFVCSSTVALMAAFGVEVPKKEIAQLTCECE-QFIGTQSGGMDQAISIMAK 83
GSGLSSS AF C+ L + + K+I E F+G G MDQA S++ K
Sbjct: 121 TGSGLSSSAAFECAVGAVLQNMGHLPLDSKQILLRIQVAENHFVGVNCGIMDQAASVLGK 180
Query: 84 SGFAELIDFNPIRTTDVQLPAGG-TFVVAHSLAESLKAITAASNYNNRVVECRLTAIVLA 142
A L++ ++ T + P G FV+A++ +K A SNYN R EC A LA
Sbjct: 181 EDHALLVECRSLKATPFKFPQLGIAFVIANTN---VKRTLAPSNYNTRRQECTTAANFLA 237
Query: 143 IKLGMKPQEAISKVKTLSDVEGLCVAFACKNGSSDPVFAVKEFLRKEPYTALDIEKITEE 202
L D Y A I ++T+
Sbjct: 238 ATD----------KGALRDF-------------------------MNEYFARYIARLTK- 261
Query: 203 KLTSIFANSSSSLDVLNAAKQYKLHQRAAHVYSEAKRV----HAFKDTVSSNLSEEDKLK 258
+ L +RA HV SE RV A KD D K
Sbjct: 262 ------------MLPLVE-------ERAKHVVSENLRVLKAVKAMKD--------ND-FK 293
Query: 259 KLGDLMNDSHHSCSVLYECSI 279
+ G LMN+SH SC YEC+
Sbjct: 294 QFGALMNESHASCDDDYECTC 314
>gnl|CDD|235407 PRK05322, PRK05322, galactokinase; Provisional.
Length = 387
Score = 67.2 bits (165), Expect = 2e-12
Identities = 68/290 (23%), Positives = 102/290 (35%), Gaps = 88/290 (30%)
Query: 2 KGETVVIITKFQLFNH-INSLFF-NL--GSGLSSSTAFVCSSTVALMAAFGVEVPKKEIA 57
KG + +H + L + N+ G+GLSSS + + V L F +++ + E+
Sbjct: 96 KGVLKFLQEAGYKIDHGFDILIYGNIPNGAGLSSSASIELLTGVILKDLFNLDLDRLELV 155
Query: 58 QLTCECE-QFIGTQSGGMDQAISIMAKSGFAELIDFNPIRTTDVQLPAGGTFVV------ 110
+L + E +FIG SG MDQ M K A L+D N + V L G +V
Sbjct: 156 KLGQKTENEFIGVNSGIMDQFAIGMGKKDHAILLDCNTLEYEYVPLDLGDYVIVIMNTNK 215
Query: 111 AHSLAESLKAITAASNYNNRVVECRLTAIVLAIKLGMKPQEAISKVKTLSDVEGLCVAFA 170
LA+ S YN R EC ++A+ +++ D++ L
Sbjct: 216 RRELAD--------SKYNERRAEC---------------EKALEELQKKLDIKSLG---- 248
Query: 171 CKNGSSDPVFAVKEFLRKEPYTALDIEKITEEKLTSIFANSSSSLDVLNAAKQYKLHQRA 230
L E+ E ++ K RA
Sbjct: 249 ----------------------ELTEEEFDEYS------------YLIKDETLLK---RA 271
Query: 231 AHVYSEAKRV----HAFKDTVSSNLSEEDKLKKLGDLMNDSHHSCSVLYE 276
H +E +R A K L+K G LMN SH S YE
Sbjct: 272 RHAVTENQRTLKAVKALKA---------GDLEKFGRLMNASHVSLRDDYE 312
>gnl|CDD|235163 PRK03817, PRK03817, galactokinase; Provisional.
Length = 351
Score = 64.2 bits (157), Expect = 1e-11
Identities = 35/111 (31%), Positives = 51/111 (45%), Gaps = 4/111 (3%)
Query: 25 LGSGLSSSTAFVCSSTVALMAAFGVEVPKKEIAQLTCECE-QFIGTQSGGMDQAISIMAK 83
+G+GLSSS + + AL A+ + + K E+A L E E +F+G G MDQ K
Sbjct: 97 IGAGLSSSASLEVAVAYALNEAYNLNLSKLELALLAREAENEFVGVPCGIMDQFAVAFGK 156
Query: 84 SGFAELIDFNPIRTTDVQLPAGGTFVVAHSLAESLKAITAASNYNNRVVEC 134
A +D + V P +V + +K A+S YN R EC
Sbjct: 157 KDHAIFLDTMTLEYEYVPFPEDYEILVFDT---GVKRELASSEYNERRQEC 204
Score = 31.1 bits (71), Expect = 0.69
Identities = 19/53 (35%), Positives = 29/53 (54%), Gaps = 5/53 (9%)
Query: 226 LHQRAAHVYSEAKRVHAFKDTVSSNLSEEDKLKKLGDLMNDSHHSCSVLYECS 278
L +RA +V E +RV +D L E D ++ LG+L+ +SH + YE S
Sbjct: 231 LRKRAGYVLRENERVLKVRDA----LKEGD-IETLGELLTESHWDLADNYEVS 278
>gnl|CDD|215839 pfam00288, GHMP_kinases_N, GHMP kinases N terminal domain. This
family includes homoserine kinases, galactokinases and
mevalonate kinases.
Length = 67
Score = 56.8 bits (138), Expect = 7e-11
Identities = 24/57 (42%), Positives = 33/57 (57%), Gaps = 1/57 (1%)
Query: 25 LGSGLSSSTAFVCSSTVALMAAFGVEVPKKEIAQLTCECEQFIGTQSGGMDQAISIM 81
LG+GL SS A + +AL FG+ + K+E+A+L E E IG S G D A S+
Sbjct: 10 LGAGLGSSAALAVALLLALNELFGLPLSKEELARLALEAEGAIGVNS-GDDVAASVY 65
>gnl|CDD|179937 PRK05101, PRK05101, galactokinase; Provisional.
Length = 382
Score = 55.3 bits (134), Expect = 1e-08
Identities = 65/247 (26%), Positives = 89/247 (36%), Gaps = 70/247 (28%)
Query: 25 LGSGLSSSTAFVCSSTVALMAAFGVEVPKKEIAQLTCECE-QFIGTQSGGMDQAISIMAK 83
G+GLSSS + + + + + EIA E E QF+G G MDQ IS + K
Sbjct: 123 QGAGLSSSASLEVAVGQTFQQLYHLPLSGAEIALNGQEAENQFVGCNCGIMDQLISALGK 182
Query: 84 SGFAELIDFNPIRTTDVQLPAGGTFVVAHSLAESLKAITAASNYNNRVVECRLTAIVLAI 143
A LID + T V +P G V+ +S ++K S YN R +C A
Sbjct: 183 KDHALLIDCRSLETKAVPMPEGVAVVIINS---NVKRGLVDSEYNTRRQQCETAA----- 234
Query: 144 KLGMKPQEAISKVKTLSDVEGLCVAFACKNGSSDPVFAVKEFLRKEPYTALDIEKITEEK 203
VK L DV F DPV A
Sbjct: 235 --------RFFGVKALRDVT--LEQFNAVAAELDPVVA---------------------- 262
Query: 204 LTSIFANSSSSLDVLNAAKQYKLHQRAAHVYSEAKRVHAFKDTVSSNLSEEDKLKKLGDL 263
+RA HV +E R + S+ L+ D LK++G+L
Sbjct: 263 ------------------------KRARHVITENART---LEAASA-LAAGD-LKRMGEL 293
Query: 264 MNDSHHS 270
M +SH S
Sbjct: 294 MAESHAS 300
>gnl|CDD|179063 PRK00555, PRK00555, galactokinase; Provisional.
Length = 363
Score = 54.9 bits (132), Expect = 1e-08
Identities = 35/120 (29%), Positives = 52/120 (43%), Gaps = 1/120 (0%)
Query: 25 LGSGLSSSTAFVCSSTVALMAAFGVEVPKKEIAQLTCECE-QFIGTQSGGMDQAISIMAK 83
+GSGLSSS A C+ A+ AA G + + E A+L E +++G +G +DQ ++
Sbjct: 102 IGSGLSSSAALECAVLGAVGAATGTRIDRLEQARLAQRAENEYVGAPTGLLDQLAALFGA 161
Query: 84 SGFAELIDFNPIRTTDVQLPAGGTFVVAHSLAESLKAITAASNYNNRVVECRLTAIVLAI 143
A LIDF + V VV + + A Y R C A L +
Sbjct: 162 PKTALLIDFRDLTVRPVAFDPDAAGVVLLLMDSRARHRHAGGEYAARRASCERAAADLGV 221
Score = 32.1 bits (73), Expect = 0.39
Identities = 18/67 (26%), Positives = 28/67 (41%), Gaps = 5/67 (7%)
Query: 204 LTSIFANSSSSLDVLNAAKQYKLHQRAAHVYSEAKRVHAFKDTVSSNLSEEDKLKKLGDL 263
++S+ A L L A +RA HV +E +RV F ++ + G L
Sbjct: 221 VSSLRAVQDRGLAALGAIADPIDARRARHVLTENQRVLDFAAALADS-----DFTAAGQL 275
Query: 264 MNDSHHS 270
+ SH S
Sbjct: 276 LTASHAS 282
>gnl|CDD|215466 PLN02865, PLN02865, galactokinase.
Length = 423
Score = 51.0 bits (122), Expect = 4e-07
Identities = 66/282 (23%), Positives = 104/282 (36%), Gaps = 72/282 (25%)
Query: 26 GSGLSSSTAFVCSSTVALMAAFGVEVPKKEIAQLTCECE-QFIGTQSGGMDQAISIMAKS 84
SGLSSS A + +AL A + V ++ +L E +++G ++G +DQ+ ++++
Sbjct: 143 SSGLSSSAAVGVAYLLALENANNLTVSPEDNIELDRLIENEYLGLRNGILDQSAILLSRY 202
Query: 85 GFAELIDFNPIRTTDVQLPAGGT-----FVVAHSLAESLKAITAASNYNNRVVECRLTAI 139
G +D + V L F + + + A+T YN RV EC+ A
Sbjct: 203 GCLTFMDCKTLDHKLVSLQFQQPGGEKPFKILLAFSGLRHALTNKPGYNLRVSECQEAAR 262
Query: 140 VLAIKLGMKPQEAISKVKTLSDVEGLCVAFACKNGSSDPVFAVKEFLRKEPYTALDIEKI 199
L G E + L +VE E Y A
Sbjct: 263 FLLEASGNDELEPL-----LCNVE------------------------PEVYEAH----- 288
Query: 200 TEEKLTSIFANSSSSLDVLNAAKQYKLHQRAAHVYSEAKRVHA-FKDTVSSNLSEEDKLK 258
+ KL ++ A +RA H +SE RV + S NL E
Sbjct: 289 -KCKLEAVLA------------------RRAEHYFSENMRVIKGVEAWASGNLEE----- 324
Query: 259 KLGDLMNDSHHSCSVLYECSITSSARVHEILISMVTIARKPG 300
G L++ S S YEC +++EIL + PG
Sbjct: 325 -FGKLISASGLSSIENYECGCEPLIQLYEIL------LKAPG 359
>gnl|CDD|233017 TIGR00549, mevalon_kin, mevalonate kinase. This model represents
mevalonate kinase, the third step in the mevalonate
pathway of isopentanyl pyrophosphate (IPP) biosynthesis.
IPP is a common intermediate for a number of pathways
including cholesterol biosynthesis. This model covers
enzymes from eukaryotes, archaea and bacteria. The
related enzyme from the same pathway, phosphmevalonate
kinase, serves as an outgroup for this clade. Paracoccus
exhibits two genes within the
phosphomevalonate/mevalonate kinase family, one of which
falls between trusted and noise cutoffs of this model.
The degree of divergence is high, but if the trees
created from this model are correct, the proper names of
these genes have been swapped [Central intermediary
metabolism, Other].
Length = 274
Score = 43.4 bits (103), Expect = 6e-05
Identities = 28/88 (31%), Positives = 39/88 (44%), Gaps = 4/88 (4%)
Query: 26 GSGLSSSTAFVCSSTVALMAAFGVEVPKKEIAQLTCECEQFIGTQSGGMDQAISIMAKSG 85
G GL SS A + AL FG E+ K+E+A+L E E+ + G+D A S G
Sbjct: 91 GRGLGSSAAVAVALIRALADYFGSELSKEELAELANEAEKIAHGKPSGIDTATSTS--GG 148
Query: 86 FAELIDFNPIRTTDVQLPAGGTFVVAHS 113
T + L G FV+A +
Sbjct: 149 PVYFEKGEGEFTKLISLD--GYFVIADT 174
>gnl|CDD|224493 COG1577, ERG12, Mevalonate kinase [Lipid metabolism].
Length = 307
Score = 43.5 bits (103), Expect = 8e-05
Identities = 24/88 (27%), Positives = 40/88 (45%), Gaps = 4/88 (4%)
Query: 25 LGSGLSSSTAFVCSSTVALMAAFGVEVPKKEIAQLTCECEQFIGTQSGGMDQAISIMAKS 84
+G+GL SS A + AL A FGVE+ +E+A+L + E + ++ G+D A
Sbjct: 94 IGAGLGSSAAVSVAVIKALSAYFGVELSPEELAKLANKVELIVQGKASGIDIATITY--G 151
Query: 85 GFAELIDFNPIRTTDVQLPAGGTFVVAH 112
G +++ GT V+
Sbjct: 152 GLVAFKKGFDFEK--LEIELLGTLVIGD 177
>gnl|CDD|179677 PRK03926, PRK03926, mevalonate kinase; Provisional.
Length = 302
Score = 38.1 bits (89), Expect = 0.004
Identities = 18/56 (32%), Positives = 28/56 (50%)
Query: 26 GSGLSSSTAFVCSSTVALMAAFGVEVPKKEIAQLTCECEQFIGTQSGGMDQAISIM 81
GSGL SS A ++ AL G+ + +EIA+L + E + + D +S M
Sbjct: 87 GSGLGSSAAVTVATIGALNRLLGLGLSLEEIAKLGHKVELLVQGAASPTDTYVSTM 142
>gnl|CDD|226917 COG4542, PduX, Protein involved in propanediol utilization, and
related proteins (includes coumermycin biosynthetic
protein), possible kinase [Secondary metabolites
biosynthesis, transport, and catabolism].
Length = 293
Score = 36.7 bits (85), Expect = 0.011
Identities = 14/39 (35%), Positives = 22/39 (56%)
Query: 26 GSGLSSSTAFVCSSTVALMAAFGVEVPKKEIAQLTCECE 64
G G++SSTA + ++ A G E+ + EIA+L E
Sbjct: 95 GKGMASSTADLVATARATARFLGRELRESEIAKLCVSIE 133
>gnl|CDD|236995 PRK11824, PRK11824, polynucleotide phosphorylase/polyadenylase;
Provisional.
Length = 693
Score = 33.5 bits (78), Expect = 0.14
Identities = 15/25 (60%), Positives = 17/25 (68%), Gaps = 1/25 (4%)
Query: 25 LGSGLSSSTAFVCSSTVALMAAFGV 49
L S SSS A VC S++ALM A GV
Sbjct: 432 LESNGSSSMASVCGSSLALMDA-GV 455
>gnl|CDD|224599 COG1685, COG1685, Archaeal shikimate kinase [Amino acid transport
and metabolism / Coenzyme metabolism].
Length = 278
Score = 32.7 bits (75), Expect = 0.18
Identities = 14/35 (40%), Positives = 21/35 (60%)
Query: 25 LGSGLSSSTAFVCSSTVALMAAFGVEVPKKEIAQL 59
+GSGL SS+A + A++ A G E+ EI +L
Sbjct: 81 VGSGLKSSSAASNALVKAVLKALGEEIDDFEILRL 115
>gnl|CDD|206769 cd11364, RNase_PH_PNPase_2, Polyribonucleotide
nucleotidyltransferase, repeat 2. Polyribonucleotide
nucleotidyltransferase (PNPase) is a member of the
RNase_PH family, named after the bacterial Ribonuclease
PH, a 3'-5' exoribonuclease. Structurally, all members
of this family form hexameric rings. In the case of
PNPase the complex is a trimer, since each monomer
contains two tandem copies of the domain. PNPase is
involved in mRNA degradation in a 3'-5' direction and in
quality control of ribosomal RNA precursors, with the
second repeat containing the active site. PNPase is part
of the RNA degradosome complex and binds to the
scaffolding domain of the endoribonuclease RNase E.
Length = 223
Score = 31.7 bits (73), Expect = 0.37
Identities = 16/33 (48%), Positives = 20/33 (60%), Gaps = 2/33 (6%)
Query: 25 LGSGLSSSTAFVCSSTVALMAAFGVEVPKKEIA 57
L S SSS A VC ++ALM A GV + K +A
Sbjct: 110 LESNGSSSMASVCGGSLALMDA-GVPI-KAPVA 140
>gnl|CDD|234271 TIGR03591, polynuc_phos, polyribonucleotide nucleotidyltransferase.
Members of this protein family are polyribonucleotide
nucleotidyltransferase, also called polynucleotide
phosphorylase. Some members have been shown also to have
additional functions as guanosine pentaphosphate
synthetase and as poly(A) polymerase (see model
TIGR02696 for an exception clade, within this family)
[Transcription, Degradation of RNA].
Length = 684
Score = 31.7 bits (73), Expect = 0.55
Identities = 12/20 (60%), Positives = 14/20 (70%), Gaps = 1/20 (5%)
Query: 30 SSSTAFVCSSTVALMAAFGV 49
SSS A VC ++ALM A GV
Sbjct: 433 SSSMASVCGGSLALMDA-GV 451
>gnl|CDD|237379 PRK13412, fkp, bifunctional
fucokinase/L-fucose-1-P-guanylyltransferase;
Provisional.
Length = 974
Score = 31.7 bits (72), Expect = 0.56
Identities = 19/54 (35%), Positives = 27/54 (50%), Gaps = 4/54 (7%)
Query: 26 GSGLSSSTAFVCSSTV--ALMAAFGVEVPKKEIAQLTCECEQFIGTQSGGMDQA 77
GSGL +S+ + ++TV A+ G+ K EI T EQ + T G DQ
Sbjct: 738 GSGLGTSS--ILAATVLGAISDFCGLAWDKNEICNRTLVLEQLLTTGGGWQDQY 789
>gnl|CDD|234906 PRK01123, PRK01123, shikimate kinase; Provisional.
Length = 282
Score = 30.6 bits (70), Expect = 0.91
Identities = 12/35 (34%), Positives = 21/35 (60%)
Query: 25 LGSGLSSSTAFVCSSTVALMAAFGVEVPKKEIAQL 59
L SGL SS+A ++ +A + A G ++ +I +L
Sbjct: 86 LASGLKSSSAAANATVLATLDALGEDLDDLDILRL 120
>gnl|CDD|224106 COG1185, Pnp, Polyribonucleotide nucleotidyltransferase
(polynucleotide phosphorylase) [Translation, ribosomal
structure and biogenesis].
Length = 692
Score = 31.0 bits (71), Expect = 0.94
Identities = 16/34 (47%), Positives = 19/34 (55%), Gaps = 4/34 (11%)
Query: 25 LGSGLSSSTAFVCSSTVALMAAFGVEVP-KKEIA 57
L S SSS A VC ++ALM A VP K +A
Sbjct: 430 LESNGSSSMASVCGGSLALMDA---GVPIKAPVA 460
>gnl|CDD|240347 PTZ00290, PTZ00290, galactokinase; Provisional.
Length = 468
Score = 30.6 bits (69), Expect = 1.1
Identities = 22/86 (25%), Positives = 34/86 (39%), Gaps = 29/86 (33%)
Query: 25 LGSGLSSSTAFVCSSTVALMAAFGVEV---------------------PKKEIAQLTCEC 63
+G+G+S+S +F VAL+ A V K+E+ +L +
Sbjct: 145 MGAGMSASASF----GVALLNAINTVVTRRYKGCPTSPGRRYSILPPMSKEELIELAKQA 200
Query: 64 E----QFIGTQSGGMDQAISIMAKSG 85
+F G G MDQ IS A+
Sbjct: 201 RRIETEFCGVNVGIMDQFISAFAEED 226
>gnl|CDD|233637 TIGR01920, Shik_kin_archae, shikimate kinase. This model
represents the shikimate kinase (SK) gene found in
archaea which is only distantly related to homoserine
kinase (thrB) and not atr all to the bacterial SK
enzyme. The SK from M. janaschii has been overexpressed
in E. coli and characterized. SK catalyzes the fifth
step of the biosynthesis of chorismate from
D-erythrose-4-phosphate and phosphoenolpyruvate [Amino
acid biosynthesis, Aromatic amino acid family].
Length = 261
Score = 30.5 bits (69), Expect = 1.2
Identities = 15/35 (42%), Positives = 22/35 (62%)
Query: 25 LGSGLSSSTAFVCSSTVALMAAFGVEVPKKEIAQL 59
GSGL SS+A V + A++ A GVE+ +I +L
Sbjct: 75 AGSGLKSSSALVNALVEAVLKAKGVEIDDIDILRL 109
>gnl|CDD|225325 COG2605, COG2605, Predicted kinase related to galactokinase and
mevalonate kinase [General function prediction only].
Length = 333
Score = 30.4 bits (69), Expect = 1.2
Identities = 25/74 (33%), Positives = 33/74 (44%), Gaps = 12/74 (16%)
Query: 26 GSGLSSSTAFVCSSTVALMAAF----GVEVPKKEIAQLTCECEQFIGTQSGG-MDQAISI 80
GSGL SS+AFV VAL+ A G + E+A+ E E+ GG DQ
Sbjct: 101 GSGLGSSSAFV----VALLNALHAWKGESLGPYELAREAYEIEREDLKIVGGKQDQ---Y 153
Query: 81 MAKSGFAELIDFNP 94
A G ++F
Sbjct: 154 AAAFGGFNFMEFRG 167
>gnl|CDD|178678 PLN03132, PLN03132, NADH dehydrogenase (ubiquinone) flavoprotein 1;
Provisional.
Length = 461
Score = 30.2 bits (68), Expect = 1.4
Identities = 12/29 (41%), Positives = 13/29 (44%)
Query: 289 LISMVTIARKPGHTPPPTTPPPIQSKTKF 317
L + T A TP P PPP KT F
Sbjct: 2 LRAFSTQAAATAATPQPPPPPPPPEKTHF 30
>gnl|CDD|223099 COG0020, UppS, Undecaprenyl pyrophosphate synthase [Lipid
metabolism].
Length = 245
Score = 29.9 bits (68), Expect = 1.5
Identities = 18/65 (27%), Positives = 30/65 (46%), Gaps = 2/65 (3%)
Query: 144 KLGMKPQEAISKVKTLS-DVEGLCVAFACKNGS-SDPVFAVKEFLRKEPYTALDIEKITE 201
+L + +EAI K + + + GL + A G + V AV++ L E I E
Sbjct: 114 RLPEEVREAIEKAEEKTKNNTGLTLNIAVNYGGRDEIVDAVRKIAEDVAAGKLSPEDIDE 173
Query: 202 EKLTS 206
E ++S
Sbjct: 174 ELISS 178
>gnl|CDD|215248 PLN02451, PLN02451, homoserine kinase.
Length = 370
Score = 30.1 bits (68), Expect = 1.5
Identities = 15/40 (37%), Positives = 21/40 (52%)
Query: 25 LGSGLSSSTAFVCSSTVALMAAFGVEVPKKEIAQLTCECE 64
LGSGL SS A ++ VA+ FG + K ++ E E
Sbjct: 145 LGSGLGSSAASAAAAAVAVNELFGSPLGKDDLVLAGLESE 184
>gnl|CDD|215363 PLN02677, PLN02677, mevalonate kinase.
Length = 387
Score = 29.8 bits (67), Expect = 1.7
Identities = 20/69 (28%), Positives = 31/69 (44%), Gaps = 14/69 (20%)
Query: 25 LGSGLSSSTAFVCSSTVALMAAFG-------------VEVPKKEIA-QLTCECEQFIGTQ 70
LGSGL SS AF + + AL+AA ++ E+ + E E+ I +
Sbjct: 140 LGSGLGSSAAFCVALSAALLAASDSISVSTGGNGWSSLDETDLELVNKWAFEGEKIIHGK 199
Query: 71 SGGMDQAIS 79
G+D +S
Sbjct: 200 PSGIDNTVS 208
>gnl|CDD|148397 pfam06767, Sif, Sif protein. This family consists of several SifA
and SifB and SseJ proteins which seem to be specific to
the Salmonella species. SifA, SifB and SseJ have been
demonstrated to localise to the Salmonella-containing
vacuole (SCV) and to Salmonella-induced filaments
(Sifs). Trafficking of SseJ and SifB away from the SCV
requires the SPI-2 effector SifA. SseJ trafficking away
from the SCV along Sifs is unnecessary for its virulence
function.
Length = 336
Score = 29.5 bits (66), Expect = 2.3
Identities = 14/40 (35%), Positives = 21/40 (52%)
Query: 273 VLYECSITSSARVHEILISMVTIARKPGHTPPPTTPPPIQ 312
+L+ +I+ S + L +M KP TPPP T PI+
Sbjct: 220 ILHHANISESTQQRAFLETMSMCGLKPLETPPPPTHIPIE 259
>gnl|CDD|100810 PRK01194, PRK01194, V-type ATP synthase subunit E; Provisional.
Length = 185
Score = 28.7 bits (64), Expect = 2.9
Identities = 29/115 (25%), Positives = 41/115 (35%), Gaps = 19/115 (16%)
Query: 195 DIEKITEEKLTSIFANSSSSLDVLNAAKQYKLHQRAAHVYSEAKRVHAFKDTVSSNLSEE 254
DIEK EEK I S ++ L K+ Q Y + R + S
Sbjct: 9 DIEKSREEKKKEINDEYSKRIEKLEKECDSKI-QSIKEYYEKKMR----AEISRLKKSII 63
Query: 255 DKLK---------KLGDLMNDSHHSCSVLYEC--SITSSARVHEILISMVTIARK 298
DK K +++ D + YE +IT S IL M+ +A K
Sbjct: 64 DKANIEARSIKREKRREILKD---YLDIAYEHLMNITKSKEYDSILNKMIEVAIK 115
>gnl|CDD|216392 pfam01255, Prenyltransf, Putative undecaprenyl diphosphate
synthase. Previously known as uncharacterized protein
family UPF0015, a single member of this family has been
identified as an undecaprenyl diphosphate synthase.
Length = 222
Score = 28.7 bits (65), Expect = 3.0
Identities = 13/42 (30%), Positives = 16/42 (38%), Gaps = 1/42 (2%)
Query: 165 LCVAFACKNGSSDPVFAVKEFLRKEPYTALDIEKITEEKLTS 206
L +A G + V AVK R L E I EE +
Sbjct: 116 LNIAL-NYGGRDEIVDAVKRLARDVADGKLSPEDIDEEVIEK 156
>gnl|CDD|216923 pfam02190, LON, ATP-dependent protease La (LON) domain. This
domain has been shown to be part of the PUA superfamily.
Length = 191
Score = 28.5 bits (64), Expect = 3.3
Identities = 17/85 (20%), Positives = 34/85 (40%), Gaps = 6/85 (7%)
Query: 180 FAVKEFLRKEPYTALDIEKITEEKLTSIF----ANSSSSLDVLNAAKQYKLHQRAAHVYS 235
F + E ++EPY ++E + EE+L + A ++ + L
Sbjct: 87 FRILELEQEEPYLVAEVEDLPEEELEELLEALEALVKELIEKIKELLPLLLPLELLLKID 146
Query: 236 EAKRVHAFKDTVSSNL--SEEDKLK 258
+ + D ++S L S E+K +
Sbjct: 147 DIEDPGRLADLIASLLPLSPEEKQE 171
>gnl|CDD|222027 pfam13288, DXPR_C, DXP reductoisomerase C-terminal domain. This is
the C-terminal domain of the
1-deoxy-D-xylulose-5-phosphate reductoisomerase enzyme.
This domain forms a left handed super-helix.
Length = 118
Score = 27.8 bits (63), Expect = 3.5
Identities = 18/60 (30%), Positives = 26/60 (43%), Gaps = 11/60 (18%)
Query: 170 ACKNGSSDPVF-------AVKEFLRKE-PYTALDIEKITEEKLTSIFANSSSSL-DVLNA 220
A + G + P AV FL + + LDI +I E+ L + SL D+L A
Sbjct: 53 ALRAGGTAPAVLNAANEVAVAAFLEGKIGF--LDIARIVEKVLDAHEPIEPPSLEDILEA 110
>gnl|CDD|234624 PRK00089, era, GTPase Era; Reviewed.
Length = 292
Score = 28.9 bits (66), Expect = 3.8
Identities = 12/29 (41%), Positives = 14/29 (48%), Gaps = 2/29 (6%)
Query: 184 EFLRKE-PY-TALDIEKITEEKLTSIFAN 210
L E PY A++IEK E L I A
Sbjct: 201 RLLGDELPYSVAVEIEKFEERGLVRIEAT 229
>gnl|CDD|162819 TIGR02354, thiF_fam2, thiamine biosynthesis protein ThiF, family 2.
Members of the HesA/MoeB/ThiF family of proteins
(pfam00899) include a number of members encoded in the
midst of thiamine biosynthetic operons. This mix of
known and putative ThiF proteins shows a deep split in
phylogenetic trees, with one the E. coli ThiF and the E.
coli MoeB proteins seemingly more closely related than
E. coli ThiF and Campylobacter (for example) ThiF. This
model represents the divergent clade of putative ThiF
proteins such found in Campylobacter [Biosynthesis of
cofactors, prosthetic groups, and carriers, Thiamine].
Length = 200
Score = 28.3 bits (63), Expect = 4.7
Identities = 23/97 (23%), Positives = 41/97 (42%), Gaps = 10/97 (10%)
Query: 180 FAVKEFLRK-EPYTALDI--EKITEEKLTSIFANSSSSLDVLNAAKQYKLHQRAAH-VYS 235
A+KE + + PYT ++ EKITEE + F ++ + + A+ + A Y
Sbjct: 77 EALKENISEINPYTEIEAYDEKITEENIDKFFKDADIVCEAFDNAEAKAMLVNAVLEKYK 136
Query: 236 EAKRVHA--FKDTVSSNLSEEDKLKK----LGDLMND 266
+ + A +N + K+ K GD +D
Sbjct: 137 DKYLIAASGLAGYDDANSIKTRKISKHFYLCGDGKSD 173
>gnl|CDD|223161 COG0083, ThrB, Homoserine kinase [Amino acid transport and
metabolism].
Length = 299
Score = 28.3 bits (64), Expect = 5.3
Identities = 15/40 (37%), Positives = 22/40 (55%)
Query: 25 LGSGLSSSTAFVCSSTVALMAAFGVEVPKKEIAQLTCECE 64
LG GL SS A + ++ A G+ + K+E+ QL E E
Sbjct: 88 LGRGLGSSAASIVAALAAANELAGLPLSKEELLQLALEIE 127
>gnl|CDD|153410 cd07956, Anticodon_Ia_Arg, Anticodon-binding domain of arginyl tRNA
synthetases. This domain is found in arginyl tRNA
synthetases (ArgRS), which belong to the class Ia
aminoacyl tRNA synthetases. It lies C-terminal to the
catalytic core domain, and recognizes and specifically
binds to the tRNA anticodon. ArgRS catalyzes the
transfer of arginine to the 3'-end of its tRNA.
Length = 156
Score = 27.2 bits (61), Expect = 8.7
Identities = 11/43 (25%), Positives = 20/43 (46%), Gaps = 1/43 (2%)
Query: 217 VLNAAKQYKLHQRAAHVYSEAKRVHAFKDTVSSNLSEEDKLKK 259
V NAA+ + H A +++ A F + L E++L+
Sbjct: 87 VKNAAETLEPHTIATYLFDLAHAFSKFYNACPV-LGAEEELRN 128
Database: CDD.v3.10
Posted date: Mar 20, 2013 7:55 AM
Number of letters in database: 10,937,602
Number of sequences in database: 44,354
Lambda K H
0.317 0.130 0.364
Gapped
Lambda K H
0.267 0.0775 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 15,415,142
Number of extensions: 1421243
Number of successful extensions: 1542
Number of sequences better than 10.0: 1
Number of HSP's gapped: 1526
Number of HSP's successfully gapped: 54
Length of query: 318
Length of database: 10,937,602
Length adjustment: 97
Effective length of query: 221
Effective length of database: 6,635,264
Effective search space: 1466393344
Effective search space used: 1466393344
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 59 (26.4 bits)