RPS-BLAST 2.2.26 [Sep-21-2011]
Database: CDD.v3.10
44,354 sequences; 10,937,602 total letters
Searching..................................................done
Query= 002206
(953 letters)
>gnl|CDD|235163 PRK03817, PRK03817, galactokinase; Provisional.
Length = 351
Score = 108 bits (273), Expect = 1e-25
Identities = 69/243 (28%), Positives = 105/243 (43%), Gaps = 49/243 (20%)
Query: 501 RAPGRLDVMGGIADYSGSLVLQMPIREACHVALQKISPSKQRLWKHALARHNDKGQGPMP 560
++PGR++++G DY+ VL I + L+ I S++
Sbjct: 4 KSPGRVNLIGEHTDYNDGYVLPFAINL--YTFLE-IEKSEK------------------- 41
Query: 561 VLQIVSYGSELSNRGPTFDMDLSDFMDEGKPMSYEKAKKYFDTNPSQKWAAYVAGTILVL 620
+ SE N TF++D EK WA Y+ G I VL
Sbjct: 42 ----FIFYSENFNEEKTFELD-----------KLEK---------LNSWADYIKGVIWVL 77
Query: 621 MTELGVRFEDSISMLVSSAVPEGKGVSSSASVEVASMSAIAAAHGLNIHPRDLALLCQKV 680
G + VSS +P G G+SSSAS+EVA A+ A+ LN+ +LALL ++
Sbjct: 78 EKR-GYEVG-GVKGKVSSNLPIGAGLSSSASLEVAVAYALNEAYNLNLSKLELALLAREA 135
Query: 681 ENHIVGAPCGVMDQMASACGEANKLLAMVCQPAELLGVVEIPSHIRFWGIDSGIRHSVGG 740
EN VG PCG+MDQ A A G+ + + + E V P D+G++ +
Sbjct: 136 ENEFVGVPCGIMDQFAVAFGKKDHAIFLDTMTLE-YEYVPFPEDYEILVFDTGVKRELAS 194
Query: 741 ADY 743
++Y
Sbjct: 195 SEY 197
Score = 33.8 bits (78), Expect = 0.45
Identities = 23/96 (23%), Positives = 40/96 (41%), Gaps = 24/96 (25%)
Query: 850 HPIYENFRV-KAFKALLTAAASDDQLTSLGELLYQCHYS----YSACGLGSDGTDRLVQL 904
+ + EN RV K AL + + +LGELL + H+ Y + + D V+
Sbjct: 237 YVLRENERVLKVRDAL-----KEGDIETLGELLTESHWDLADNY---EVSCEELDFFVE- 287
Query: 905 VQEIQHSKVSKSKDGTLFGAKITGGGSGGTICVIGR 940
+ + +GA++TG G GG+ +
Sbjct: 288 ----------FALELGAYGARLTGAGFGGSAIALVD 313
>gnl|CDD|223231 COG0153, GalK, Galactokinase [Carbohydrate transport and
metabolism].
Length = 390
Score = 107 bits (268), Expect = 1e-24
Identities = 70/260 (26%), Positives = 108/260 (41%), Gaps = 44/260 (16%)
Query: 479 RQMRERKAA--AGLFNWEEEIFVARAPGRLDVMGGIADYSGSLVLQMPIREACHVALQKI 536
++E+ A A F + E A APGR++++G DY+G VL I +VA+ K
Sbjct: 3 SSLKEKLQALFAEHFGYVEPTVTAFAPGRVNLIGEHTDYNGGFVLPCAINYGTYVAVAKR 62
Query: 537 SPSKQRLWKHALARHNDKGQGPMPVLQIVSYGSELSNRGPTFDMDLSDFMDEGKPMSYEK 596
K RL Y + N G F + +
Sbjct: 63 DDGKVRL-----------------------YSANFGNAGDIFFLL-----LDIAKE---- 90
Query: 597 AKKYFDTNPSQKWAAYVAGTILVLMTELGVRFEDSISMLVSSAVPEGKGVSSSASVEVAS 656
WA YV G I L + G F + +++S +P G G+SSSA++EVA
Sbjct: 91 --------KIDDWANYVKGVIKALQ-KRGYAF-TGLDIVISGNIPIGAGLSSSAALEVAV 140
Query: 657 MSAIAAAHGLNIHPRDLALLCQKVENHIVGAPCGVMDQMASACGEANKLLAMVCQPAELL 716
A+ L + +LA + Q EN VG CG+MDQ+ASA G+ + L + C+ E
Sbjct: 141 ALALQRLFNLPLDKAELAKIAQVAENQFVGVNCGIMDQLASAFGKKDHALLLDCRTLEYE 200
Query: 717 GVVEIPSHIRFWGIDSGIRH 736
V + ++S ++
Sbjct: 201 PVPFPVGGVSIVIVNSNVKR 220
Score = 33.4 bits (77), Expect = 0.48
Identities = 31/94 (32%), Positives = 40/94 (42%), Gaps = 25/94 (26%)
Query: 850 HPIYENFRV-KAFKALLTAAASDDQLTSLGELLYQCHYS----YSACGLGSDGTDRLVQL 904
H + EN RV +A KAL LT GEL+ + H S Y D LV++
Sbjct: 275 HVVTENQRVLEAAKAL-----RSGDLTEFGELMNESHESLRDDYEVTCPE---LDTLVEI 326
Query: 905 VQEIQHSKVSKSKDGTLFGAKITGGGSGGTICVI 938
G +GA++TG G GG CVI
Sbjct: 327 ALA----------AGGAYGARMTGAGFGG--CVI 348
>gnl|CDD|179937 PRK05101, PRK05101, galactokinase; Provisional.
Length = 382
Score = 105 bits (265), Expect = 3e-24
Identities = 70/270 (25%), Positives = 120/270 (44%), Gaps = 49/270 (18%)
Query: 491 FNWEEEIFVARAPGRLDVMGGIADYSGSLVLQMPIREACHVALQKISPSKQRLWKHALAR 550
F + + +APGR++++G DY+ V +P C + Q + + A+
Sbjct: 15 FGYPPTHTI-QAPGRVNLIGEHTDYNDGFV--LP----CAIDYQTVI---------SCAK 58
Query: 551 HNDKGQGPMPVLQIVSYGSELSNRGPTFDMDLSDFMDEGKPMSYEKAKKYFDTNPSQKWA 610
+D+ + ++ N+ F +D P+ +P Q+WA
Sbjct: 59 RDDR--------IVRVIAADYDNQQDEFSLD--------APIVP---------HPEQQWA 93
Query: 611 AYVAGTILVLMTELGVRFEDSISMLVSSAVPEGKGVSSSASVEVASMSAIAAAHGLNIHP 670
YV G + L E F +++S VP+G G+SSSAS+EVA + L +
Sbjct: 94 NYVRGVVKHLQ-ERNPDFG-GADLVISGNVPQGAGLSSSASLEVAVGQTFQQLYHLPLSG 151
Query: 671 RDLALLCQKVENHIVGAPCGVMDQMASACGEANKLLAMVCQPAELLGVVEIPSHIRFWGI 730
++AL Q+ EN VG CG+MDQ+ SA G+ + L + C+ E V +P + I
Sbjct: 152 AEIALNGQEAENQFVGCNCGIMDQLISALGKKDHALLIDCRSLETKAVP-MPEGVAVVII 210
Query: 731 DSGIRHSVGGADYGSVR-----AGAFMGRK 755
+S ++ + ++Y + R A F G K
Sbjct: 211 NSNVKRGLVDSEYNTRRQQCETAARFFGVK 240
Score = 36.4 bits (85), Expect = 0.064
Identities = 31/96 (32%), Positives = 43/96 (44%), Gaps = 28/96 (29%)
Query: 850 HPIYENFRVK-AFKALLTAAASDDQLTSLGELLYQCHYSYSACGLGSD------GTDRLV 902
H I EN R A AL AA D L +GEL+ + H S + D D LV
Sbjct: 267 HVITENARTLEAASAL---AAGD--LKRMGELMAESHAS-----MRDDFEITVPQIDTLV 316
Query: 903 QLVQEIQHSKVSKSKDGTLFGAKITGGGSGGTICVI 938
++V+ + G G ++TGGG GG C++
Sbjct: 317 EIVKAVI------GDQG---GVRMTGGGFGG--CIV 341
>gnl|CDD|179063 PRK00555, PRK00555, galactokinase; Provisional.
Length = 363
Score = 90.7 bits (225), Expect = 2e-19
Identities = 54/154 (35%), Positives = 77/154 (50%), Gaps = 12/154 (7%)
Query: 602 DTNPSQ--KWAAYVAGTILVLMTELGVRFEDSISMLVSSAVPEGKGVSSSASVEVASMSA 659
DT P Q WAAY AG I L G +M ++S V G G+SSSA++E A + A
Sbjct: 62 DTTPGQVTGWAAYAAGVIWALRGA-GHPVPGG-AMSITSDVEIGSGLSSSAALECAVLGA 119
Query: 660 IAAAHGLNIHPRDLALLCQKVENHIVGAPCGVMDQMASACGEANKLLAMVCQPAELLGVV 719
+ AA G I + A L Q+ EN VGAP G++DQ+A+ G A++ + L V
Sbjct: 120 VGAATGTRIDRLEQARLAQRAENEYVGAPTGLLDQLAALFGAPKT--ALLIDFRD-LTVR 176
Query: 720 EIP-----SHIRFWGIDSGIRHSVGGADYGSVRA 748
+ + + +DS RH G +Y + RA
Sbjct: 177 PVAFDPDAAGVVLLLMDSRARHRHAGGEYAARRA 210
Score = 31.4 bits (71), Expect = 2.4
Identities = 28/89 (31%), Positives = 38/89 (42%), Gaps = 14/89 (15%)
Query: 850 HPIYENFRVKAFKALLTAAASDDQLTSLGELLYQCHYSYSACGLGSDGTDRLVQLVQEIQ 909
H + EN RV F AA +D T+ G+LL H S D + I
Sbjct: 249 HVLTENQRVLDF----AAALADSDFTAAGQLLTASHASMR---------DDFEITTERID 295
Query: 910 HSKVSKSKDGTLFGAKITGGGSGGTICVI 938
S + G L GA++TGGG GG + +
Sbjct: 296 LIADSAVRAGAL-GARMTGGGFGGCVIAL 323
>gnl|CDD|232841 TIGR00131, gal_kin, galactokinase. Galactokinase is a member of
the GHMP kinases (Galactokinase, Homoserine kinase,
Mevalonate kinase, Phosphomevalonate kinase) and shares
with them an amino-terminal domain probably related to
ATP binding.The galactokinases found by This model are
divided into two sets. Prokaryotic forms are generally
shorter. The eukaryotic forms are longer because of
additional central regions and in some cases are known
to be bifunctional, with regulatory activities that are
independent of galactokinase activity [Energy
metabolism, Sugars].
Length = 386
Score = 85.3 bits (211), Expect = 2e-17
Identities = 58/232 (25%), Positives = 88/232 (37%), Gaps = 46/232 (19%)
Query: 484 RKAAAGLFNW---EEEIFVARAPGRLDVMGGIADYSGSLVLQMPIREACHVALQKISPSK 540
++ +F + F ARAPGR++++G DY+ VL C + +
Sbjct: 1 CESIQKIFASAFGAKPDFTARAPGRVNLIGEHTDYNDGSVL------PCAIDFGTLCAVA 54
Query: 541 QRLWKHALARHNDKGQGPMPVLQIVS-YGSELSNRGPTFDMDLSDFMDEGKPMSYEKAKK 599
R +DK V Y + N+ +DL
Sbjct: 55 VR---------DDK---------NVRIYLANADNKFAERSLDL------------PLDGS 84
Query: 600 YFDTNPSQKWAAYVAGTILVLMTELGVRFEDSISMLVSSAVPEGKGVSSSASVEVASMSA 659
WA Y G + V E F ++ S VP G G+SSSA+ E A +
Sbjct: 85 EVS-----DWANYFKGVLHVAQ-ERFNSFPLGADIVCSGNVPTGSGLSSSAAFECAVGAV 138
Query: 660 IAAAHGLNIHPRDLALLCQKVENHIVGAPCGVMDQMASACGEANKLLAMVCQ 711
+ L + + + L Q ENH VG CG+MDQ AS G+ + L + C+
Sbjct: 139 LQNMGHLPLDSKQILLRIQVAENHFVGVNCGIMDQAASVLGKEDHALLVECR 190
Score = 33.6 bits (77), Expect = 0.52
Identities = 26/96 (27%), Positives = 36/96 (37%), Gaps = 29/96 (30%)
Query: 850 HPIYENFRV-KAFKALLTAAASDDQLTSLGELLYQCHYSYSACGLGSD------GTDRLV 902
H + EN RV KA KA+ D+ G L+ + H S D D LV
Sbjct: 272 HVVSENLRVLKAVKAM-----KDNDFKQFGALMNESHAS-----CDDDYECTCPEIDELV 321
Query: 903 QLVQEIQHSKVSKSKDGTLFGAKITGGGSGGTICVI 938
+ S G+++TG G GG C +
Sbjct: 322 CSAALVNGS----------GGSRMTGAGFGG--CTV 345
>gnl|CDD|215285 PLN02521, PLN02521, galactokinase.
Length = 497
Score = 82.4 bits (204), Expect = 3e-16
Identities = 107/483 (22%), Positives = 183/483 (37%), Gaps = 103/483 (21%)
Query: 498 FVARAPGRLDVMGGIADYSGSLVLQMPIREACHVALQKISPSKQRLWKHALARHNDKGQG 557
AR+PGR++++G DY G VL M IR+ VA+++ SK+
Sbjct: 49 LFARSPGRVNLIGEHIDYEGYSVLPMAIRQDTIVAIRRAEGSKK---------------- 92
Query: 558 PMPVLQIVSYGSELSNRGPTFDMDLSDFMDEGKPMSYEKAKKYFDTNPSQ-------KWA 610
L+I +N + F +P Q KW
Sbjct: 93 ----LRI-------ANVNDKYTT------------------CTFPADPDQEVDLANHKWG 123
Query: 611 AYVAGTILVLMTELGVRFEDS-----ISMLVSSAVPEGKGVSSSASVEVASMSAIAAAHG 665
Y + L + D + ++V VP G G+SSSA++ ++ AI AA G
Sbjct: 124 NYFICGYKGVFEFLKSKGVDVGPPVGLDVVVDGTVPTGSGLSSSAALVCSAAIAIMAALG 183
Query: 666 LNIHPRDLALLCQKVENHIVGAPCGVMDQMASACGEANKLLAMVCQPAELLGV------- 718
LN +++A K E HI G G MDQ S + A+L+
Sbjct: 184 LNFTKKEVAQFTCKCERHI-GTQSGGMDQAISIMAQQGV--------AKLIDFNPVRATD 234
Query: 719 VEIPSHIRFWGIDSGIRHSVGGADYGSVRAGAFMGRKMIKSTAS-------GMLPQ-SLP 770
V++P+ F I +S+ ++ A + R + A+ GM + ++
Sbjct: 235 VQLPAGGTFV-----IANSLAESNKAVTAATNYNNRVVECRLAAIVLAVKLGMSAEEAIS 289
Query: 771 SSNGLNNIEPEVDGVELLEAEASLDYLCNLSPHRFEALYAKNIPESIVGEEFSKNYGDHN 830
L+++E V + S D + E Y E I+GE + + +
Sbjct: 290 KVKTLSDVEGLC--VSFAGSHGSSDPAVAVKELLHEGPYTAEEIEEILGESLTSIFKNSP 347
Query: 831 DPVTVIDPKRTYFVRAPVCHPIYENFRVKAFKALLTAAASD-DQLTSLGELLYQCHYSYS 889
+ V+ + + + H E RV AF+ ++++ S+ ++L LG+L+ + HYS S
Sbjct: 348 TSLAVLKAAKHFKLHQRAVHVYSEAKRVHAFRDTVSSSLSEEEKLKKLGDLMNESHYSCS 407
Query: 890 AC-GLGSDGTDRLVQLVQEIQHSKVSKSKDGTLFGAKITGGGSGGTICVIGRNSLRSSEQ 948
+ LV++ +D GA++TG G GG C + Q
Sbjct: 408 VLYECSCPELEELVKV-----------CRDNGALGARLTGAGWGG--CAVALVKEAIVPQ 454
Query: 949 VLE 951
+
Sbjct: 455 FIL 457
>gnl|CDD|235407 PRK05322, PRK05322, galactokinase; Provisional.
Length = 387
Score = 79.9 bits (198), Expect = 9e-16
Identities = 53/216 (24%), Positives = 87/216 (40%), Gaps = 53/216 (24%)
Query: 494 EEEIFVARAPGRLDVMGGIADYSGSLVLQMPIREACHVALQKISPSKQRLWKHALARHND 553
E++F + PGR++++G DY+G V P+ L + AR D
Sbjct: 18 AEDVFFS--PGRINLIGEHTDYNGGHVF----------------PAAITLGTYGAARKRD 59
Query: 554 KGQ-----GPMPVLQIVSYGSELSNRGPTFDMDLSDFMDEGKPMSYEKAKKYFDTNPSQK 608
+ L I+ + DL D +S++K
Sbjct: 60 DKKVRLYSANFEDLGIIEF-------------DLDD-------LSFDKEDD--------- 90
Query: 609 WAAYVAGTILVLMTELGVRFEDSISMLVSSAVPEGKGVSSSASVEVASMSAIAAAHGLNI 668
WA Y G + L E G + + +L+ +P G G+SSSAS+E+ + + L++
Sbjct: 91 WANYPKGVLKFLQ-EAGYKIDHGFDILIYGNIPNGAGLSSSASIELLTGVILKDLFNLDL 149
Query: 669 HPRDLALLCQKVENHIVGAPCGVMDQMASACGEANK 704
+L L QK EN +G G+MDQ A G+ +
Sbjct: 150 DRLELVKLGQKTENEFIGVNSGIMDQFAIGMGKKDH 185
Score = 46.0 bits (110), Expect = 7e-05
Identities = 42/161 (26%), Positives = 58/161 (36%), Gaps = 52/161 (32%)
Query: 783 DGVELLEAEASLDYLCNLSPHRFEALYAKNIPESIVGEEFSKNYGDHNDPVTVIDPKRTY 842
+E L+ + + L L+ F+ Y+ I D + KR
Sbjct: 233 KALEELQKKLDIKSLGELTEEEFDE-YSYLI----------------KDETLL---KR-- 270
Query: 843 FVRAPVCHPIYENFRV-KAFKALLTAAASDDQLTSLGELLYQCHYS----YSACGLGSDG 897
R H + EN R KA KAL L G L+ H S Y GL
Sbjct: 271 -AR----HAVTENQRTLKAVKAL-----KAGDLEKFGRLMNASHVSLRDDYEVTGLE--- 317
Query: 898 TDRLVQLVQEIQHSKVSKSKDGTLFGAKITGGGSGGTICVI 938
D LV+ + ++G L GA++TG G GG C I
Sbjct: 318 LDTLVEAAWK---------QEGVL-GARMTGAGFGG--CAI 346
>gnl|CDD|215839 pfam00288, GHMP_kinases_N, GHMP kinases N terminal domain. This
family includes homoserine kinases, galactokinases and
mevalonate kinases.
Length = 67
Score = 60.6 bits (148), Expect = 1e-11
Identities = 23/69 (33%), Positives = 34/69 (49%), Gaps = 2/69 (2%)
Query: 633 SMLVSSAVPEGKGVSSSASVEVASMSAIAAAHGLNIHPRDLALLCQKVENHIVGAPCGVM 692
+ + S +P G G+ SSA++ VA + A+ GL + +LA L + E I G G
Sbjct: 1 DIEIESNIPLGAGLGSSAALAVALLLALNELFGLPLSKEELARLALEAEGAI-GVNSG-D 58
Query: 693 DQMASACGE 701
D AS G
Sbjct: 59 DVAASVYGG 67
>gnl|CDD|240347 PTZ00290, PTZ00290, galactokinase; Provisional.
Length = 468
Score = 66.4 bits (162), Expect = 4e-11
Identities = 68/295 (23%), Positives = 110/295 (37%), Gaps = 75/295 (25%)
Query: 495 EEIFVARAPGRLDVMGGIADYSGSLVLQMPIREACHVALQKISPSKQRLWKHAL--ARHN 552
E + APGR++ +G DY G V + E CH+ + ++ + H L A
Sbjct: 35 EWLLFTFAPGRVNFIGEHVDYMGGYVCPAAVLEGCHILVGRV----KHFCDHKLRFATET 90
Query: 553 DKGQGPMPVLQIVSYGSELSNRGPTFDMDLSDFMDEGKPMSYEKAKKYFDTNPSQKWAAY 612
D+ F +D + K + KA W +
Sbjct: 91 DE----------------------HFVLD---HLGGAK---HNKA-----------WTTF 111
Query: 613 VAGTILVLMTELGVRFEDS----ISMLVSSAVPEGKGVSSSASVEVASMSAIAAAHGLN- 667
V G + + LGV + + M+V +P G G+S+SAS VA ++AI
Sbjct: 112 VRGAATLRLNRLGVAIDAPSLQGVCMVVHGTLPMGAGMSASASFGVALLNAINTVVTRRY 171
Query: 668 ------------IHPR-------DLALLCQKVENHIVGAPCGVMDQMASACGEANKLLAM 708
I P +LA +++E G G+MDQ SA E +K + +
Sbjct: 172 KGCPTSPGRRYSILPPMSKEELIELAKQARRIETEFCGVNVGIMDQFISAFAEEDKFMFL 231
Query: 709 VCQPA--ELLGVVEIPSHIRFWG-IDSGIRHSVGGAD---YGSVRAGAFMGRKMI 757
C+ E + + + IDS I+H + G Y +VR+ +K I
Sbjct: 232 DCKSLTFESHDMTPLLGDGACFLLIDSMIKHDLLGGTAGMYNTVRSDQEGAQKKI 286
>gnl|CDD|222200 pfam13528, Glyco_trans_1_3, Glycosyl transferase family 1.
Length = 317
Score = 56.9 bits (138), Expect = 2e-08
Identities = 41/191 (21%), Positives = 76/191 (39%), Gaps = 19/191 (9%)
Query: 20 YYVTGHGFGHATRVVEVVRNLISAGHDVHVVTG--APDFVFTSEIQSPRLFIRKVLLDCG 77
Y V G G GH +R + L GH+V V+ A + + SP R++ G
Sbjct: 5 YGVQGTGNGHISRARVLAEAL--RGHEVDVLFSGRAEELL------SPEFPYRRLP---G 53
Query: 78 -AVQADALTVDRLASLEKYSETAVAPRKSILKDEVEWLNSIKADLVVSDVVPVACRAAAD 136
+ VD ++ K + + +++D V+ L+ + DLV++D P++ AA
Sbjct: 54 LTFVTENGRVDYGKTILKNLQG-LIRLPRLVRDLVKLLDLFRPDLVITDFEPISAWAAKR 112
Query: 137 AGIRSVCVTNFSWDFIYAEY-VMAAGHHHRSIVWQIAEDYSHCEFLIRLP-GYCPMPAFR 194
G+ V + + F+ Y + + ++ + + L P F
Sbjct: 113 QGVPCVGIDHQY-AFLACNYDIPKGLKFDALFEKYVLRAFAPAQRRLGLHFYPPLKPHFE 171
Query: 195 DVIDVPLVVRR 205
I VP ++R
Sbjct: 172 QPI-VPPILRP 181
>gnl|CDD|215466 PLN02865, PLN02865, galactokinase.
Length = 423
Score = 53.7 bits (129), Expect = 3e-07
Identities = 60/241 (24%), Positives = 95/241 (39%), Gaps = 40/241 (16%)
Query: 477 EKRQMRERKAAAGLFNWEEEIFVARAPGRLDVMGGIADYSGSLVLQMPIREACHVALQKI 536
E ++RER AA N E+ V +P R+ +G D+ G V M I
Sbjct: 11 ELDEIRERVAAMSGRN-SGEVRVVVSPYRICPLGAHIDHQGGTVSAMTI----------- 58
Query: 537 SPSKQRLWKHALARHNDKGQGPMPVLQ--IVSYGSELSNRGPTFDMDLSDFMDEGKPMSY 594
+KG +L + S E+ R F+ ++ +DE + +
Sbjct: 59 ----------------NKG-----ILLGFVPSGDPEVLLRSAQFEGEVRFRVDE---IQH 94
Query: 595 EKAKKYFDTNPSQKWAAYVAGTILVLMTELGVRFEDSISMLVS-SAVPEGKGVSSSASVE 653
A D+ W Y G + L G I+ +S S + G+SSSA+V
Sbjct: 95 PIANVSSDSKEESNWGDYARGAVYALQ-SRGHALSQGITGYISGSEGLDSSGLSSSAAVG 153
Query: 654 VASMSAIAAAHGLNIHPRDLALLCQKVENHIVGAPCGVMDQMASACGEANKLLAMVCQPA 713
VA + A+ A+ L + P D L + +EN +G G++DQ A L M C+
Sbjct: 154 VAYLLALENANNLTVSPEDNIELDRLIENEYLGLRNGILDQSAILLSRYGCLTFMDCKTL 213
Query: 714 E 714
+
Sbjct: 214 D 214
>gnl|CDD|233017 TIGR00549, mevalon_kin, mevalonate kinase. This model represents
mevalonate kinase, the third step in the mevalonate
pathway of isopentanyl pyrophosphate (IPP) biosynthesis.
IPP is a common intermediate for a number of pathways
including cholesterol biosynthesis. This model covers
enzymes from eukaryotes, archaea and bacteria. The
related enzyme from the same pathway, phosphmevalonate
kinase, serves as an outgroup for this clade. Paracoccus
exhibits two genes within the
phosphomevalonate/mevalonate kinase family, one of which
falls between trusted and noise cutoffs of this model.
The degree of divergence is high, but if the trees
created from this model are correct, the proper names of
these genes have been swapped [Central intermediary
metabolism, Other].
Length = 274
Score = 47.3 bits (113), Expect = 2e-05
Identities = 23/84 (27%), Positives = 39/84 (46%), Gaps = 1/84 (1%)
Query: 617 ILVLMTELGVRFEDSISMLVSSAVPEGKGVSSSASVEVASMSAIAAAHGLNIHPRDLALL 676
+ ++ + + + S +P G+G+ SSA+V VA + A+A G + +LA L
Sbjct: 65 VAEALSYFSELNPPPLEITIDSEIPPGRGLGSSAAVAVALIRALADYFGSELSKEELAEL 124
Query: 677 CQKVENHIVGAPCGVMDQMASACG 700
+ E G P G +D S G
Sbjct: 125 ANEAEKIAHGKPSG-IDTATSTSG 147
Score = 41.9 bits (99), Expect = 0.001
Identities = 24/80 (30%), Positives = 34/80 (42%), Gaps = 17/80 (21%)
Query: 864 LLTAAAS----DDQLTSLGELLYQCHYSYSACGLGSDGTDRLVQLVQEIQHSKVSKSKDG 919
LT A D + SLGEL+ A G+ D+LV++ ++
Sbjct: 207 ELTLEAKAALEDGDVESLGELMNINQGLLKALGVSHPKLDQLVEI-----------ARKA 255
Query: 920 TLFGAKITGGGSGGTICVIG 939
GAK+TG G GG C+I
Sbjct: 256 GALGAKLTGAGGGG--CMIA 273
>gnl|CDD|225325 COG2605, COG2605, Predicted kinase related to galactokinase and
mevalonate kinase [General function prediction only].
Length = 333
Score = 47.4 bits (113), Expect = 2e-05
Identities = 22/69 (31%), Positives = 36/69 (52%)
Query: 632 ISMLVSSAVPEGKGVSSSASVEVASMSAIAAAHGLNIHPRDLALLCQKVENHIVGAPCGV 691
I + S P G G+ SS++ VA ++A+ A G ++ P +LA ++E + G
Sbjct: 90 IELHTQSDAPPGSGLGSSSAFVVALLNALHAWKGESLGPYELAREAYEIEREDLKIVGGK 149
Query: 692 MDQMASACG 700
DQ A+A G
Sbjct: 150 QDQYAAAFG 158
>gnl|CDD|224493 COG1577, ERG12, Mevalonate kinase [Lipid metabolism].
Length = 307
Score = 47.3 bits (113), Expect = 2e-05
Identities = 25/81 (30%), Positives = 41/81 (50%)
Query: 611 AYVAGTILVLMTELGVRFEDSISMLVSSAVPEGKGVSSSASVEVASMSAIAAAHGLNIHP 670
Y+ + + L S+ + S +P G G+ SSA+V VA + A++A G+ + P
Sbjct: 63 GYIQAAVRLASELLNQSSLKPFSLEIDSEIPIGAGLGSSAAVSVAVIKALSAYFGVELSP 122
Query: 671 RDLALLCQKVENHIVGAPCGV 691
+LA L KVE + G G+
Sbjct: 123 EELAKLANKVELIVQGKASGI 143
Score = 36.1 bits (84), Expect = 0.064
Identities = 21/73 (28%), Positives = 28/73 (38%), Gaps = 13/73 (17%)
Query: 867 AAASDDQLTSLGELLYQCHYSYSACGLGSDGTDRLVQLVQEIQHSKVSKSKDGTLFGAKI 926
AA LGEL+ A G+ + D LV ++ GAK+
Sbjct: 218 AALQTGDFEELGELMNINQGLLKALGVSTPELDELV-----------EAARSLGALGAKL 266
Query: 927 TGGGSGGTICVIG 939
TG G GG C+I
Sbjct: 267 TGAGGGG--CIIA 277
>gnl|CDD|226917 COG4542, PduX, Protein involved in propanediol utilization, and
related proteins (includes coumermycin biosynthetic
protein), possible kinase [Secondary metabolites
biosynthesis, transport, and catabolism].
Length = 293
Score = 43.6 bits (103), Expect = 3e-04
Identities = 28/95 (29%), Positives = 45/95 (47%), Gaps = 9/95 (9%)
Query: 632 ISMLVSSAVPEGKGVSSSASVEVASMSAIAAAHGLNIHPRDLALLCQKVE--NHIVGAPC 689
I +L+ S++P GKG++SS + VA+ A A G + ++A LC +E + I+
Sbjct: 84 IDLLLQSSIPVGKGMASSTADLVATARATARFLGRELRESEIAKLCVSIEPTDSIIFDKA 143
Query: 690 GVMDQMA----SACGEANKLLAMVCQPAELLGVVE 720
+ DQ GE L +V + G VE
Sbjct: 144 TLFDQREGRVIEFLGEMPPLHILVFEG---KGTVE 175
>gnl|CDD|179677 PRK03926, PRK03926, mevalonate kinase; Provisional.
Length = 302
Score = 43.5 bits (103), Expect = 4e-04
Identities = 24/71 (33%), Positives = 39/71 (54%), Gaps = 1/71 (1%)
Query: 630 DSISMLVSSAVPEGKGVSSSASVEVASMSAIAAAHGLNIHPRDLALLCQKVENHIVGAPC 689
D +++ ++S +P G G+ SSA+V VA++ A+ GL + ++A L KVE + GA
Sbjct: 74 DGVTVSITSQIPVGSGLGSSAAVTVATIGALNRLLGLGLSLEEIAKLGHKVELLVQGAAS 133
Query: 690 GVMDQMASACG 700
D S G
Sbjct: 134 PT-DTYVSTMG 143
>gnl|CDD|99974 cd03801, GT1_YqgM_like, This family is most closely related to the
GT1 family of glycosyltransferases and named after YqgM
in Bacillus licheniformis about which little is known.
Glycosyltransferases catalyze the transfer of sugar
moieties from activated donor molecules to specific
acceptor molecules, forming glycosidic bonds. The
acceptor molecule can be a lipid, a protein, a
heterocyclic compound, or another carbohydrate residue.
This group of glycosyltransferases is most closely
related to the previously defined glycosyltransferase
family 1 (GT1). The members of this family may transfer
UDP, ADP, GDP, or CMP linked sugars. The diverse
enzymatic activities among members of this family
reflect a wide range of biological functions. The
protein structure available for this family has the GTB
topology, one of the two protein topologies observed for
nucleotide-sugar-dependent glycosyltransferases. GTB
proteins have distinct N- and C- terminal domains each
containing a typical Rossmann fold. The two domains have
high structural homology despite minimal sequence
homology. The large cleft that separates the two domains
includes the catalytic center and permits a high degree
of flexibility. The members of this family are found
mainly in certain bacteria and archaea.
Length = 374
Score = 42.4 bits (100), Expect = 8e-04
Identities = 59/336 (17%), Positives = 100/336 (29%), Gaps = 79/336 (23%)
Query: 14 KHLVFAYYVTGHGFGHATRVVEVVRNLISAGHDVHVVTGAPDFVFTSEIQSPRLFIRKVL 73
K L+ G V+E+ R L + GH+V V+T + E + +R
Sbjct: 1 KILLVTPEYPPSVGGAERHVLELARALAARGHEVTVLTPGDGGLPDEEEVGGIVVVRPPP 60
Query: 74 LDCGAVQADALTVDRLASLEKYSETAVAPRKSILKDEVEWLNSIKADLVVS---DVVPVA 130
L L + L + D+V + + A
Sbjct: 61 LLRVRRLLLLLLLALRLRR--------------------LLRRERFDVVHAHDWLALLAA 100
Query: 131 CRAAADAGIRSVCVTNFSWDFIYAEYVMAAGHHHRSIVWQIAEDYSHC--------EFLI 182
AA GI V + + R++ + E L
Sbjct: 101 ALAARLLGIPLVLTVHGLEFGRPGNELGLLLKLARALERRALRRADRIIAVSEATREELR 160
Query: 183 RLPGYCPMPAFRDVIDVPLVVR--RLHKSRKEVRKELGIEDDVKLLILNFGGQPAGWK-- 238
L G + +P V R + + R+ LGI +D +++ F G+ K
Sbjct: 161 ELGG----VPPEKITVIPNGVDTERFRPAPRAARRRLGIPEDEPVIL--FVGRLVPRKGV 214
Query: 239 ---------LKEEYLPSGWKCLVCG-----------ASDSQLPPNFI---KLPKDAYTPD 275
L++EY + ++ G A++ L +P + P
Sbjct: 215 DLLLEALAKLRKEY--PDVRLVIVGDGPLREELEALAAELGLGDRVTFLGFVPDED-LPA 271
Query: 276 FMAASDCM--------LGKIGYGTVSEALAYKLPFV 303
AA+D G + EA+A LP V
Sbjct: 272 LYAAADVFVLPSLYEGFG----LVLLEAMAAGLPVV 303
>gnl|CDD|204502 pfam10509, GalKase_gal_bdg, Galactokinase galactose-binding
signature. This is the highly conserved galactokinase
signature sequence which appears to be present in all
galactokinases irrespective of how many other ATP
binding sites, etc that they carry. The function of this
domain appears to be to bind galactose, and the domain
is normally at the N-terminus of the enzymes,
EC:2.7.1.6. This domain is associated with the families
GHMP_kinases_C, pfam08544 and GHMP_kinases_N, pfam00288.
Length = 52
Score = 35.9 bits (84), Expect = 0.003
Identities = 17/42 (40%), Positives = 24/42 (57%)
Query: 494 EEEIFVARAPGRLDVMGGIADYSGSLVLQMPIREACHVALQK 535
E VA APGR++++G DY+G VL I +VA+ K
Sbjct: 10 VEPEGVASAPGRVNLIGEHTDYNGGFVLPAAINLDTYVAVSK 51
>gnl|CDD|223779 COG0707, MurG, UDP-N-acetylglucosamine:LPS N-acetylglucosamine
transferase [Cell envelope biogenesis, outer membrane].
Length = 357
Score = 38.8 bits (91), Expect = 0.011
Identities = 16/72 (22%), Positives = 30/72 (41%), Gaps = 13/72 (18%)
Query: 274 PDFMAASDCMLGKIGYGTVSEALAYKLPFVFV------RRD-YFNEEPFLRNMLEFYQGG 326
+AA+D ++ + G T++E LA +P + V +N + + G
Sbjct: 247 AALLAAADLVISRAGALTIAELLALGVPAILVPYPPGADGHQEYNAKFLEK------AGA 300
Query: 327 VEMIRRDLLTGH 338
+IR+ LT
Sbjct: 301 ALVIRQSELTPE 312
>gnl|CDD|233637 TIGR01920, Shik_kin_archae, shikimate kinase. This model
represents the shikimate kinase (SK) gene found in
archaea which is only distantly related to homoserine
kinase (thrB) and not atr all to the bacterial SK
enzyme. The SK from M. janaschii has been overexpressed
in E. coli and characterized. SK catalyzes the fifth
step of the biosynthesis of chorismate from
D-erythrose-4-phosphate and phosphoenolpyruvate [Amino
acid biosynthesis, Aromatic amino acid family].
Length = 261
Score = 36.2 bits (84), Expect = 0.061
Identities = 17/59 (28%), Positives = 32/59 (54%), Gaps = 2/59 (3%)
Query: 620 LMTELGVRF--EDSISMLVSSAVPEGKGVSSSASVEVASMSAIAAAHGLNIHPRDLALL 676
++T + +F D + + V S +P G G+ SS+++ A + A+ A G+ I D+ L
Sbjct: 51 ILTAIRSKFGIVDGLEVEVESEIPAGSGLKSSSALVNALVEAVLKAKGVEIDDIDILRL 109
>gnl|CDD|223161 COG0083, ThrB, Homoserine kinase [Amino acid transport and
metabolism].
Length = 299
Score = 36.0 bits (84), Expect = 0.072
Identities = 19/68 (27%), Positives = 35/68 (51%), Gaps = 2/68 (2%)
Query: 616 TILVLMTELGVRFEDSISMLVSSAVPEGKGVSSSASVEVASMSAIAAAHGLNIHPRDLAL 675
L + LG+ E + + + +P G+G+ SSA+ VA+++A GL + +L
Sbjct: 64 AALKFLEALGI--EAGVKIRIEKGIPLGRGLGSSAASIVAALAAANELAGLPLSKEELLQ 121
Query: 676 LCQKVENH 683
L ++E H
Sbjct: 122 LALEIEGH 129
>gnl|CDD|129295 TIGR00191, thrB, homoserine kinase. Homoserine kinase is part of
the threonine biosynthetic pathway.Homoserine kinase is
a member of the GHMP kinases (Galactokinase, Homoserine
kinase, Mevalonate kinase, Phosphomevalonate kinase) and
shares with them an amino-terminal domain probably
related to ATP binding.P.aeruginosa homoserine kinase
seems not to be homologous (see PROSITE:PDOC0054) [Amino
acid biosynthesis, Aspartate family].
Length = 302
Score = 35.8 bits (83), Expect = 0.085
Identities = 27/123 (21%), Positives = 50/123 (40%), Gaps = 13/123 (10%)
Query: 573 NRGPTFD---MDLSDFMD---------EGKPMSYEKAKKYFDTNPSQKWAAYVAGTILVL 620
N GP FD LS ++ E E + + P++ +
Sbjct: 11 NLGPGFDVLGAALSLYLGLTVTDVVAQESDDTEIEAEGEGVEKIPTEPTDNLIYQVAKRF 70
Query: 621 MTELGVRFEDSISMLVSSAVPEGKGVSSSASVEVASMSAIAAAHGLNIHPRDLALLCQKV 680
+ +LG+R + + + +P G+G+ SSA+ VA+++A GL + L ++
Sbjct: 71 LDQLGIRMP-PVKVTLEKNIPLGRGLGSSAAAIVAALAAANELCGLPLSKERLLDYASEL 129
Query: 681 ENH 683
E H
Sbjct: 130 EGH 132
>gnl|CDD|215325 PLN02605, PLN02605, monogalactosyldiacylglycerol synthase.
Length = 382
Score = 35.7 bits (83), Expect = 0.093
Identities = 30/114 (26%), Positives = 53/114 (46%), Gaps = 20/114 (17%)
Query: 208 KSRKEVRKELGIEDDVKLLILNFGGQPAG----------WKLKEEYL--PSGWKCLVCG- 254
+ + E+R+ELG+++D+ ++L GG+ G L ++ L P G ++CG
Sbjct: 191 RPKDELRRELGMDEDLPAVLLMGGGEGMGPLEETARALGDSLYDKNLGKPIGQVVVICGR 250
Query: 255 -----ASDSQLPPNFIKLPKDAYT--PDFMAASDCMLGKIGYGTVSEALAYKLP 301
+ + T ++M A DC++ K G GT++EAL LP
Sbjct: 251 NKKLQSKLESRDWKIPVKVRGFVTNMEEWMGACDCIITKAGPGTIAEALIRGLP 304
>gnl|CDD|224732 COG1819, COG1819, Glycosyl transferases, related to
UDP-glucuronosyltransferase [Carbohydrate transport and
metabolism / Signal transduction mechanisms].
Length = 406
Score = 35.1 bits (81), Expect = 0.15
Identities = 60/330 (18%), Positives = 95/330 (28%), Gaps = 70/330 (21%)
Query: 27 FGHATRVVEVVRNLISAGHDVHVVT--GAPDFVFTSEIQSPRLFIRKVLLDCGAVQADAL 84
+GH + + + L GH+V + +FV + + IR L
Sbjct: 12 YGHVNPCLALGKELRRRGHEVVFASTGKFKEFVEAAGLAFVAYPIRDSEL---------A 62
Query: 85 TVDRLASLEKYSETAVAPRKSILKDEVEWLNSIKADLVVSDVVPVACRAAADAGIRSVCV 144
T D + K + K ++++ +E L ++ DLVV D AA GI V
Sbjct: 63 TEDGKFAGVKSFRRLLQQFKKLIRELLELLRELEPDLVVDDARLSLGLAARLLGI-PVVG 121
Query: 145 TNFSWDFIYAEYVMAAGHHHRSIVWQIAED--------YSHCEFLIRLPGYC-------- 188
N A Y IA +
Sbjct: 122 INV------APYTPLPAAGLPLPPVGIAGKLPIPLYPLPPRLVRPLIFARSWLPKLVVRR 175
Query: 189 ------PMPAFRD-----------VIDV--------PLVVRRLHKSRKEVRKEL--GIED 221
+P R DV P + + E EL I
Sbjct: 176 NLGLELGLPNIRRLFASGPLLEIAYTDVLFPPGDRLPFIGPYIGPLLGEAANELPYWIPA 235
Query: 222 DVKLLILNFGGQPAGWKLKEEYL----PSGWKCLVC-GASDS---QLPPNFIKLPKDAYT 273
D ++ ++ G +L L + +V G + +P N I
Sbjct: 236 DRPIVYVSLGTVGNAVELLAIVLEALADLDVRVIVSLGGARDTLVNVPDNVIVADYVPQL 295
Query: 274 PDFMAASDCMLGKIGYGTVSEALAYKLPFV 303
+D ++ G GT SEAL +P V
Sbjct: 296 ELL-PRADAVIHHGGAGTTSEALYAGVPLV 324
>gnl|CDD|184179 PRK13608, PRK13608, diacylglycerol glucosyltransferase;
Provisional.
Length = 391
Score = 35.2 bits (81), Expect = 0.16
Identities = 21/58 (36%), Positives = 29/58 (50%), Gaps = 7/58 (12%)
Query: 275 DFMAASDCMLGKIGYGTVSEALAYKLPFVFVRRDYFNEEPF--LRNMLEFYQGGVEMI 330
++MA+S M+ K G T+SE LA +P +F N P L N L F + G I
Sbjct: 269 EWMASSQLMITKPGGITISEGLARCIPMIF-----LNPAPGQELENALYFEEKGFGKI 321
>gnl|CDD|217893 pfam04101, Glyco_tran_28_C, Glycosyltransferase family 28
C-terminal domain. The glycosyltransferase family 28
includes monogalactosyldiacylglycerol synthase (EC
2.4.1.46) and UDP-N-acetylglucosamine transferase (EC
2.4.1.-). Structural analysis suggests the C-terminal
domain contains the UDP-GlcNAc binding site.
Length = 167
Score = 34.2 bits (79), Expect = 0.16
Identities = 31/135 (22%), Positives = 52/135 (38%), Gaps = 22/135 (16%)
Query: 258 SQLPPNFIKLPKDAYTPDFMAASDCMLGKIGYGTVSEALAYKLPFVFVRRDYFNEEPFLR 317
S+ N P +++ A+D ++ + G GT++E LA P + V R E
Sbjct: 51 SKFGINVEVFPFIDNMAEYIKAADLVISRAGAGTIAELLALGKPAILVPRPKAAGEHQDN 110
Query: 318 NMLEFY-QGGVEMIRRDLLTGHWKPYLERAISLKPCYEGGINGGEVAAHILQETAIGKNY 376
N LE G ++ + LT L A+ +L + + Y
Sbjct: 111 NALELVKAGAALVLLQKELTPEK---LVEALL----------------KLLLKPL--RLY 149
Query: 377 ASDKLSGARRLRDAI 391
+K + RL+DAI
Sbjct: 150 EMNKAAKGSRLKDAI 164
>gnl|CDD|99982 cd03811, GT1_WabH_like, This family is most closely related to
the GT1 family of glycosyltransferases. WabH in
Klebsiella pneumoniae has been shown to transfer a
GlcNAc residue from UDP-GlcNAc onto the acceptor GalUA
residue in the cellular outer core.
Length = 353
Score = 35.0 bits (81), Expect = 0.16
Identities = 18/77 (23%), Positives = 32/77 (41%), Gaps = 2/77 (2%)
Query: 19 AYYVTGHGFGHATRV-VEVVRNLISAGHDVHVVTGAPDFVFTSEIQSPRLFIRKVLLDCG 77
+ + G G A RV + + L G+DV +V + + + S I +L
Sbjct: 3 LFVIPSLGGGGAERVLLNLANGLDKRGYDVTLVVLRDEGDYLELLPSNVKLIPVRVLKLK 62
Query: 78 AVQADALTVDRLASLEK 94
++ D L + RL L +
Sbjct: 63 SL-RDLLAILRLRRLLR 78
>gnl|CDD|99968 cd03794, GT1_wbuB_like, This family is most closely related to
the GT1 family of glycosyltransferases. wbuB in E. coli
is involved in the biosynthesis of the O26 O-antigen.
It has been proposed to function as an
N-acetyl-L-fucosamine (L-FucNAc) transferase.
Length = 394
Score = 34.9 bits (81), Expect = 0.19
Identities = 14/41 (34%), Positives = 24/41 (58%), Gaps = 1/41 (2%)
Query: 16 LVFAYYVTGHGFGHATRVVEVVRNLISAGHDVHVVTGAPDF 56
++ Y+ G G A R E+ L+ GH+V V+TG+P++
Sbjct: 4 ILSQYFPPELG-GGAFRTTELAEELVKRGHEVTVITGSPNY 43
>gnl|CDD|222237 pfam13579, Glyco_trans_4_4, Glycosyl transferase 4-like domain.
Length = 158
Score = 33.2 bits (76), Expect = 0.28
Identities = 11/28 (39%), Positives = 15/28 (53%)
Query: 28 GHATRVVEVVRNLISAGHDVHVVTGAPD 55
G V+E+ R L + GH+V VVT
Sbjct: 2 GAERYVLELARALAARGHEVTVVTPRGP 29
>gnl|CDD|99960 cd03784, GT1_Gtf_like, This family includes the Gtfs, a group of
homologous glycosyltransferases involved in the final
stages of the biosynthesis of antibiotics vancomycin and
related chloroeremomycin. Gtfs transfer sugar moieties
from an activated NDP-sugar donor to the oxidatively
cross-linked heptapeptide core of vancomycin group
antibiotics. The core structure is important for the
bioactivity of the antibiotics.
Length = 401
Score = 34.3 bits (79), Expect = 0.33
Identities = 35/161 (21%), Positives = 52/161 (32%), Gaps = 23/161 (14%)
Query: 28 GHATRVVEVVRNLISAGHDVHVVTGAPDFVFTSEIQSPRLFIRKVLLDCGAV-------Q 80
G +V + L +AGH+V V T P+F + L+ V
Sbjct: 12 GDVQPLVALAWALRAAGHEVRVATP-PEF---------ADLVEAAGLEFVPVGGDPDELL 61
Query: 81 ADALTVDRLASLEKYSETAVAPR-----KSILKDEVEWLNSIKADLVVSDVVPVACRAAA 135
A L L +++L D V DLVV+D + A AA
Sbjct: 62 ASPERNAGLLLLGPGLLLGALRLLRREAEAMLDDLVAAARDWGPDLVVADPLAFAGAVAA 121
Query: 136 DA-GIRSVCVTNFSWDFIYAEYVMAAGHHHRSIVWQIAEDY 175
+A GI +V + A + R AE +
Sbjct: 122 EALGIPAVRLLLGPDTPTSAFPPPLGRANLRLYALLEAELW 162
>gnl|CDD|99971 cd03798, GT1_wlbH_like, This family is most closely related to
the GT1 family of glycosyltransferases. wlbH in
Bordetella parapertussis has been shown to be required
for the biosynthesis of a trisaccharide that, when
attached to the B. pertussis lipopolysaccharide (LPS)
core (band B), generates band A LPS.
Length = 377
Score = 33.9 bits (78), Expect = 0.42
Identities = 15/59 (25%), Positives = 23/59 (38%), Gaps = 1/59 (1%)
Query: 16 LVFA-YYVTGHGFGHATRVVEVVRNLISAGHDVHVVTGAPDFVFTSEIQSPRLFIRKVL 73
LV + Y + G V E+ R L G +V V+ P ++ RL + L
Sbjct: 2 LVISSLYPPPNNGGGGIFVKELARALAKRGVEVTVLAPGPWGPKLLDLLKGRLVGVERL 60
>gnl|CDD|219894 pfam08544, GHMP_kinases_C, GHMP kinases C terminal. This family
includes homoserine kinases, galactokinases and
mevalonate kinases.
Length = 86
Score = 30.9 bits (70), Expect = 0.48
Identities = 22/87 (25%), Positives = 38/87 (43%), Gaps = 9/87 (10%)
Query: 865 LTAAASDDQLTSLGELLYQCHYSYSACGLGSDGTDRLVQLVQEIQHSKVSKSKDGTLFGA 924
L A + L LGELL + S + L +L++E++ GA
Sbjct: 1 LLEALREGDLELLGELLNENALSLEPLLYLGILSPELDELLEELRELGA--------LGA 52
Query: 925 KITGGGSGGTICVIGRNSLRSSEQVLE 951
K++G G G T+ + ++ +E+V E
Sbjct: 53 KLSGSGGGPTVFALFKDE-EDAEEVAE 78
>gnl|CDD|215978 pfam00533, BRCT, BRCA1 C Terminus (BRCT) domain. The BRCT domain
is found predominantly in proteins involved in cell
cycle checkpoint functions responsive to DNA damage. The
BRCT domain of XRCC1 forms a homodimer in the crystal
structure. This suggests that pairs of BRCT domains
associate as homo- or heterodimers. BRCT domains are
often found as tandem-repeat pairs. Structures of the
BRCA1 BRCT domains revealed a basis for a widely
utilised head-to-tail BRCT-BRCT oligomerisation mode.
This conserved tandem BRCT architecture facilitates
formation of the canonical BRCT phospho-peptide
interaction cleft at a groove between the BRCT domains.
Disease associated missense and nonsense mutations in
the BRCA1 BRCT domains disrupt peptide binding by
directly occluding this peptide binding groove, or by
disrupting key conserved BRCT core folding determinants.
Length = 77
Score = 30.4 bits (69), Expect = 0.66
Identities = 14/66 (21%), Positives = 23/66 (34%), Gaps = 10/66 (15%)
Query: 56 FVFTSEIQSPRLFIRKVLLDCGAV-------QADALTVDRLASLEKYSETAVAPRKSILK 108
FV T R + +++ G + + V A K + A+A I+
Sbjct: 11 FVITGLDSLEREELEELIEKLGGKVTDSLSKKTTHVIVGENAGTLKLLK-AIALGIPIVT 69
Query: 109 DEVEWL 114
EWL
Sbjct: 70 --EEWL 73
>gnl|CDD|179875 PRK04758, PRK04758, hypothetical protein; Validated.
Length = 181
Score = 32.1 bits (73), Expect = 0.86
Identities = 29/132 (21%), Positives = 52/132 (39%), Gaps = 23/132 (17%)
Query: 355 GGINGGEVAAHILQETAIGKNYASDKLSGARRLRDAII-------FGYEL------QRVP 401
GG +G + A IL + L +LR A + F +EL +
Sbjct: 37 GGADGADWLARILADAGQVAPAQGSALD---QLRQATVAQLEDRDFAFELLLAEDGAPLA 93
Query: 402 GRDVSIPEWYQTAEDELGLSASRSPPCTPEGDSTV-------KLSTEDFEILHGDCQGLP 454
R ++ +W + GL+A + P + EG+ + + S++DF+ D L
Sbjct: 94 ARADALFDWCRAFLGGFGLAAQQRPALSEEGEEALQDLARLAQASSDDFDAAEEDDTALA 153
Query: 455 DTMSFLKSLVEL 466
+ F++ V L
Sbjct: 154 EIEEFVRVAVLL 165
>gnl|CDD|233284 TIGR01133, murG, undecaprenyldiphospho-muramoylpentapeptide
beta-N-acetylglucosaminyltransferase. RM 8449890 RT The
final step of peptidoglycan subunit assembly in
Escherichia coli occurs in the cytoplasm. RA Bupp K, van
Heijenoort J. RL J Bacteriol 1993 Mar;175(6):1841-3
[Cell envelope, Biosynthesis and degradation of murein
sacculus and peptidoglycan].
Length = 348
Score = 32.2 bits (74), Expect = 1.2
Identities = 37/179 (20%), Positives = 62/179 (34%), Gaps = 41/179 (22%)
Query: 192 AFRDVIDVPLVVRRLHKSRKEVRK------ELGIEDDVKLLILNFGG-QPAG-------- 236
+F D V + R+E+R G+ + K IL GG Q A
Sbjct: 142 SFPGAKDHFEAVLVGNPVRQEIRSLPVPRERFGLREG-KPTILVLGGSQGAKILNELVPK 200
Query: 237 --WKLKEEYLPSGWKCLVCGASDSQLPPN-FIKLPKDAYTPDF-------MAASDCMLGK 286
KL E+ + G +D + N + +L +A AA+D ++ +
Sbjct: 201 ALAKLAEKGIQI---VHQTGKNDLEKVKNVYQELGIEAIVTFIDENMAAAYAAADLVISR 257
Query: 287 IGYGTVSEALAYKLPFVFV------RRDYFNEEPFLRNMLEFYQGGVEMIRRDLLTGHW 339
G TV+E A +P + + Y+N + G +IR+ L
Sbjct: 258 AGASTVAELAAAGVPAILIPYPYAADDQYYNAKFLED------LGAGLVIRQKELLPEK 310
>gnl|CDD|224599 COG1685, COG1685, Archaeal shikimate kinase [Amino acid transport
and metabolism / Coenzyme metabolism].
Length = 278
Score = 31.9 bits (73), Expect = 1.3
Identities = 17/54 (31%), Positives = 25/54 (46%), Gaps = 6/54 (11%)
Query: 623 ELGVRFEDSISMLVSSAVPEGKGVSSSASVEVASMSAIAAAHGLNIHPRDLALL 676
LGV E V S +P G G+ SS++ A + A+ A G I ++ L
Sbjct: 68 PLGVEVE------VESEIPVGSGLKSSSAASNALVKAVLKALGEEIDDFEILRL 115
>gnl|CDD|235513 PRK05579, PRK05579, bifunctional phosphopantothenoylcysteine
decarboxylase/phosphopantothenate synthase; Validated.
Length = 399
Score = 32.0 bits (74), Expect = 1.4
Identities = 11/21 (52%), Positives = 14/21 (66%)
Query: 30 ATRVVEVVRNLISAGHDVHVV 50
A + +E+VR L AG DV VV
Sbjct: 19 AYKALELVRRLRKAGADVRVV 39
>gnl|CDD|237871 PRK14965, PRK14965, DNA polymerase III subunits gamma and tau;
Provisional.
Length = 576
Score = 31.6 bits (72), Expect = 2.2
Identities = 21/67 (31%), Positives = 35/67 (52%), Gaps = 5/67 (7%)
Query: 654 VASMSAIAAAHGLNIHPRDLALLCQKVENHIVGAPCGVMDQMASACGEANKLLAMVCQPA 713
V + IA G++I LAL+ +K + + + +DQ+ + CG+A + A
Sbjct: 184 VDRLRYIADQEGISISDAALALVARKGDGSMRDS-LSTLDQVLAFCGDAVGDDDV----A 238
Query: 714 ELLGVVE 720
ELLGVV+
Sbjct: 239 ELLGVVD 245
>gnl|CDD|235098 PRK03007, PRK03007, deoxyguanosinetriphosphate
triphosphohydrolase-like protein; Provisional.
Length = 428
Score = 31.1 bits (71), Expect = 3.1
Identities = 15/50 (30%), Positives = 22/50 (44%), Gaps = 6/50 (12%)
Query: 460 LKSLVELDIIKDSDRTPEKRQMRERKAAAGLFNWEEEIFVARAPGRLDVM 509
LK+L ++ SD RQ R+R+ + + A APG LD
Sbjct: 341 LKTLALQFVM--SDPRHLARQARQRERIHRVADA----LWAGAPGALDPQ 384
>gnl|CDD|217035 pfam02441, Flavoprotein, Flavoprotein. This family contains
diverse flavoprotein enzymes. This family includes
epidermin biosynthesis protein, EpiD, which has been
shown to be a flavoprotein that binds FMN. This enzyme
catalyzes the removal of two reducing equivalents from
the cysteine residue of the C-terminal meso-lanthionine
of epidermin to form a --C==C-- double bond. This family
also includes the B chain of dipicolinate synthase a
small polar molecule that accumulates to high
concentrations in bacterial endospores, and is thought
to play a role in spore heat resistance, or the
maintenance of heat resistance. dipicolinate synthase
catalyzes the formation of dipicolinic acid from
dihydroxydipicolinic acid. This family also includes
phenyl-acrylic acid decarboxylase (EC:4.1.1.-).
Length = 132
Score = 29.4 bits (67), Expect = 3.6
Identities = 24/115 (20%), Positives = 45/115 (39%), Gaps = 31/115 (26%)
Query: 14 KHLVFAYYVTGHGFGHATRVVEVVRNLISAGHDVHVVTGAPDFVFTSEIQSPRLFIRKVL 73
K ++ +TG A + + ++R L G +V V + + ++ + FI
Sbjct: 1 KKILLG--ITGSS--AAIKALRLLRELKKEGAEVRV-------ILS---KAAKKFITPET 46
Query: 74 LDCGAVQADALTVDRLASLEKYSETAVAPRKSILKDEVEWLNSIKADLVVSDVVP 128
L +AL+ + + + Y E + + L S ADL+V V P
Sbjct: 47 L-------EALSGEEVYTDAWYDEDDIIAHIA--------LGSRWADLMV--VAP 84
>gnl|CDD|173924 cd02173, ECT, CTP:phosphoethanolamine cytidylyltransferase (ECT).
CTP:phosphoethanolamine cytidylyltransferase (ECT)
catalyzes the conversion of phosphoethanolamine to
CDP-ethanolamine as part of the CDP-ethanolamine
biosynthesis pathway. ECT expression in hepatocytes is
localized predominantly to areas of the cytoplasm that
are rich in rough endoplasmic reticulum. Several ECTs,
including yeast and human ECT, have large repetitive
sequences located within their N- and C-termini.
Length = 152
Score = 29.5 bits (67), Expect = 4.7
Identities = 10/27 (37%), Positives = 19/27 (70%), Gaps = 1/27 (3%)
Query: 666 LNIHPRDLALL-CQKVENHIVGAPCGV 691
+N+H R L++L C+ V+ ++GAP +
Sbjct: 53 MNLHERVLSVLACRYVDEVVIGAPYVI 79
>gnl|CDD|222070 pfam13358, DDE_3, DDE superfamily endonuclease. This family of
proteins are related to pfam00665 and are probably
endonucleases of the DDE superfamily. Transposase
proteins are necessary for efficient DNA transposition.
This domain is a member of the DDE superfamily, which
contain three carboxylate residues that are believed to
be responsible for coordinating metal ions needed for
catalysis. The catalytic activity of this enzyme
involves DNA cleavage at a specific site followed by a
strand transfer reaction.
Length = 146
Score = 29.2 bits (66), Expect = 4.9
Identities = 9/30 (30%), Positives = 14/30 (46%)
Query: 160 AGHHHRSIVWQIAEDYSHCEFLIRLPGYCP 189
A +H V ++ E + L+ LP Y P
Sbjct: 89 ASYHKSKEVQELVEAEAAGIELLYLPPYSP 118
>gnl|CDD|99961 cd03785, GT1_MurG, MurG is an N-acetylglucosaminyltransferase, the
last enzyme involved in the intracellular phase of
peptidoglycan biosynthesis. It transfers
N-acetyl-D-glucosamine (GlcNAc) from UDP-GlcNAc to the
C4 hydroxyl of a lipid-linked N-acetylmuramoyl
pentapeptide (NAM). The resulting disaccharide is then
transported across the cell membrane, where it is
polymerized into NAG-NAM cell-wall repeat structure.
MurG belongs to the GT-B structural superfamily of
glycoslytransferases, which have characteristic N- and
C-terminal domains, each containing a typical Rossmann
fold. The two domains have high structural homology
despite minimal sequence homology. The large cleft that
separates the two domains includes the catalytic center
and permits a high degree of flexibility.
Length = 350
Score = 30.1 bits (69), Expect = 5.0
Identities = 16/73 (21%), Positives = 28/73 (38%), Gaps = 13/73 (17%)
Query: 274 PDFMAASDCMLGKIGYGTVSEALAYKLPFVFV-----RRD--YFNEEPFLRNMLEFYQGG 326
AA+D ++ + G TV+E A LP + + D N ++ G
Sbjct: 247 AAAYAAADLVISRAGASTVAELAALGLPAILIPLPYAADDHQTANARALVKA------GA 300
Query: 327 VEMIRRDLLTGHW 339
+I ++ LT
Sbjct: 301 AVLIPQEELTPER 313
>gnl|CDD|129913 TIGR00833, actII, Transport protein. The
Resistance-Nodulation-Cell Division (RND) Superfamily-
MmpL sub family (TC 2.A.6.5)Characterized members of the
RND superfamily all probably catalyze substrate efflux
via an H+ antiport mechanism. These proteins are found
ubiquitously in bacteria, archaea and eukaryotes. This
sub-family includes the S. coelicolor ActII3 protein,
which may play a role in drug resistance, and the M.
tuberculosis MmpL7 protein, which catalyzes export of an
outer membrane lipid, phthiocerol dimycocerosate
[Transport and binding proteins, Unknown substrate].
Length = 910
Score = 30.7 bits (69), Expect = 5.1
Identities = 21/90 (23%), Positives = 34/90 (37%), Gaps = 3/90 (3%)
Query: 789 EAEASLDYLCNLSPHRFEALYAKNIPESIVGEEFSKNYGDHNDPVTVIDPKRTYFVRAPV 848
EA A L + E L ++ ++ + SK G D + DP R + P
Sbjct: 502 EAIAGLRLVQAGMRLDGENLGQVSLAVRLMQQAISKLQGSAGDVFDIFDPLRRFVAAIPE 561
Query: 849 CH--PIYENFRVKAFKALLTAAASDDQLTS 876
C P+ R + +A T +S +L
Sbjct: 562 CRANPVCSVAR-EIVQAADTVVSSAAKLAD 590
>gnl|CDD|222130 pfam13439, Glyco_transf_4, Glycosyltransferase Family 4.
Length = 171
Score = 29.7 bits (67), Expect = 5.3
Identities = 10/24 (41%), Positives = 13/24 (54%)
Query: 28 GHATRVVEVVRNLISAGHDVHVVT 51
G V+E+ R L GH+V VV
Sbjct: 13 GVERVVLELARALARRGHEVTVVA 36
>gnl|CDD|237379 PRK13412, fkp, bifunctional
fucokinase/L-fucose-1-P-guanylyltransferase;
Provisional.
Length = 974
Score = 30.2 bits (68), Expect = 6.2
Identities = 46/191 (24%), Positives = 80/191 (41%), Gaps = 31/191 (16%)
Query: 484 RKAAAGLFNWEEEIFVARAPGRLDVMGGIAD------YSGSLVLQMPIREACHVALQK-I 536
RK L + ++I R+P R+D+ GG D YSG V+ + I LQ +
Sbjct: 596 RKQTPKLEVYSDQIVWGRSPVRIDLAGGWTDTPPYCLYSGGNVVNLAIELNGQPPLQVYV 655
Query: 537 SPSKQRLWKHALARHNDKGQGPMPVLQIVSYGSELSNRGPTFDMDLSDFMDEGKPMSYEK 596
P + H + R D G +++V T + +L D+ G P S K
Sbjct: 656 KPCSE---PHIVLRSIDLG-----AMEVVR----------TNE-ELRDYKKVGSPFSIPK 696
Query: 597 AKKYFDTNPSQKWAAYVAGTILVLMTELGVRFEDSISMLVSSAVPEGKGVSSSASVEVAS 656
A + +++A ++ + G E I++L +A+P G G+ +S+ +
Sbjct: 697 AALCL-AGFAPRFSAESYASLEEQLKAFGSGIE--ITLL--AAIPAGSGLGTSSILAATV 751
Query: 657 MSAIAAAHGLN 667
+ AI+ GL
Sbjct: 752 LGAISDFCGLA 762
>gnl|CDD|220466 pfam09909, DUF2138, Uncharacterized protein conserved in bacteria
(DUF2138). This domain, found in various hypothetical
prokaryotic proteins, has no known function.
Length = 552
Score = 30.1 bits (68), Expect = 6.3
Identities = 16/53 (30%), Positives = 22/53 (41%), Gaps = 7/53 (13%)
Query: 859 KAFKALLTAAASDDQLTSLGEL-------LYQCHYSYSACGLGSDGTDRLVQL 904
K + L A SD QL+ + +YQ Y+ L + DRLV L
Sbjct: 144 KLLEPLAFVATSDSQLSKVAISLGSGGVPVYQLRYNGGRSLLFASKGDRLVAL 196
>gnl|CDD|235235 PRK04165, PRK04165, acetyl-CoA decarbonylase/synthase complex
subunit gamma; Provisional.
Length = 450
Score = 29.8 bits (68), Expect = 6.8
Identities = 18/58 (31%), Positives = 27/58 (46%), Gaps = 6/58 (10%)
Query: 211 KEVRKELGIEDDVK--LLILNFGGQPAGWKLKEEYLPSGWKCLVCGASDSQLPPNFIK 266
+ KE GIE+ V LI+ ++E +GW+ LV G +S P F+K
Sbjct: 391 ADFIKESGIEEKVNHRKLIIPGLAAVLSGDIEEL---TGWEVLV-GPRESSGIPAFLK 444
>gnl|CDD|237445 PRK13609, PRK13609, diacylglycerol glucosyltransferase;
Provisional.
Length = 380
Score = 29.7 bits (67), Expect = 7.5
Identities = 17/59 (28%), Positives = 29/59 (49%), Gaps = 3/59 (5%)
Query: 275 DFMAASDCMLGKIGYGTVSEALAYKLPFVFVRRDYFNEEPFLRNMLEFYQGGVEMIRRD 333
+ + CM+ K G T+SEA A +P V + + +E N + F + G ++ RD
Sbjct: 269 ELFRVTSCMITKPGGITLSEAAALGVP-VILYKPVPGQE--KENAMYFERKGAAVVIRD 324
>gnl|CDD|227866 COG5579, COG5579, Uncharacterized conserved protein [Function
unknown].
Length = 143
Score = 28.7 bits (64), Expect = 8.6
Identities = 18/94 (19%), Positives = 32/94 (34%), Gaps = 6/94 (6%)
Query: 307 RDYFNEEPFLRNMLEFYQGGVEMIRRDLLTGHWKPYL-ERAISLKPCYEG---GINGGEV 362
D F+ + F+ Y+ VE +R HW ++ + L I+ E
Sbjct: 3 SDPFDLKRFVYAQAPVYRAVVEELRAGRKRTHWMWFIFPQLAGLGSSPLAVRYAISSLEE 62
Query: 363 AAHILQETAIGKNY--ASDKLSGARRLRDAIIFG 394
A L +G + ++ + IFG
Sbjct: 63 AQAYLAHDLLGPRLHECTGLVNQVQGRSIEEIFG 96
>gnl|CDD|237977 cd00003, PNPsynthase, Pyridoxine 5'-phosphate (PNP) synthase
domain; pyridoxal 5'-phosphate is the active form of
vitamin B6 that acts as an essential, ubiquitous
coenzyme in amino acid metabolism. In bacteria,
formation of pyridoxine 5'-phosphate is a step in the
biosynthesis of vitamin B6. PNP synthase, a
homooctameric enzyme, catalyzes the final step in PNP
biosynthesis, the condensation of 1-amino-acetone
3-phosphate and 1-deoxy-D-xylulose 5-phosphate. PNP
synthase adopts a TIM barrel topology, intersubunit
contacts are mediated by three ''extra'' helices,
generating a tetramer of symmetric dimers with shared
active sites; the open state has been proposed to accept
substrates and to release products, while most of the
catalytic events are likely to occur in the closed
state; a hydrophilic channel running through the center
of the barrel was identified as the essential structural
feature that enables PNP synthase to release water
molecules produced during the reaction from the closed,
solvent-shielded active site.
Length = 234
Score = 29.0 bits (66), Expect = 9.0
Identities = 14/48 (29%), Positives = 23/48 (47%), Gaps = 11/48 (22%)
Query: 652 VEVASMSAIAAAHGLNIHPR---------DLALLCQ--KVENHIVGAP 688
VE A ++ A A G+ +H R D+ LL + + E ++ AP
Sbjct: 24 VEAALLAEKAGADGITVHLREDRRHIQDRDVRLLRELVRTELNLEMAP 71
Database: CDD.v3.10
Posted date: Mar 20, 2013 7:55 AM
Number of letters in database: 10,937,602
Number of sequences in database: 44,354
Lambda K H
0.319 0.136 0.407
Gapped
Lambda K H
0.267 0.0667 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 49,596,738
Number of extensions: 5036470
Number of successful extensions: 4295
Number of sequences better than 10.0: 1
Number of HSP's gapped: 4272
Number of HSP's successfully gapped: 77
Length of query: 953
Length of database: 10,937,602
Length adjustment: 106
Effective length of query: 847
Effective length of database: 6,236,078
Effective search space: 5281958066
Effective search space used: 5281958066
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 64 (28.6 bits)