BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 020779
(321 letters)
Database: swissprot
539,616 sequences; 191,569,459 total letters
Searching..................................................done
>sp|Q9SEE5|GALK1_ARATH Galactokinase OS=Arabidopsis thaliana GN=GAL1 PE=1 SV=2
Length = 496
Score = 389 bits (999), Expect = e-107, Method: Compositional matrix adjust.
Identities = 194/252 (76%), Positives = 224/252 (88%)
Query: 26 GSGLSSSTAFVCSSTVALMAAFGVEVPKKEIAQLTCECEQFIGTQSGGMDQAISIMAKSG 85
GSGLSSS AFVCS+T+A+MA FG KKE+AQLTCECE+ IGTQSGGMDQAISIMAK+G
Sbjct: 161 GSGLSSSAAFVCSATIAIMAVFGHNFEKKELAQLTCECERHIGTQSGGMDQAISIMAKTG 220
Query: 86 FAELIDFNPIRTTDVQLPAGGTFVVAHSLAESLKAITAASNYNNRVVECRLTAIVLAIKL 145
FAELIDFNP+R TDV+LP GG+FV+AHSLAES KA+TAA NYNNRVVECRL +I+L +KL
Sbjct: 221 FAELIDFNPVRATDVKLPDGGSFVIAHSLAESQKAVTAAKNYNNRVVECRLASIILGVKL 280
Query: 146 GMKPQEAISKVKTLSDVEGLCVAFACKNGSSDPVFAVKEFLRKEPYTALDIEKITEEKLT 205
GM+P+EAISKVKTLSDVEGLCV+FA GSSDP+ AVKE+L++EPYTA +IEKI EEKL
Sbjct: 281 GMEPKEAISKVKTLSDVEGLCVSFAGDRGSSDPLLAVKEYLKEEPYTAEEIEKILEEKLP 340
Query: 206 SIFANSSSSLDVLNAAKQYKLHQRAAHVYSEAKRVHAFKDTVSSNLSEEDKLKKLGDLMN 265
SI N +SL VLNAA +KLHQRAAHVYSEA+RVH FKDTV+SNLS+E+KLKKLGDLMN
Sbjct: 341 SIVNNDPTSLAVLNAATHFKLHQRAAHVYSEARRVHGFKDTVNSNLSDEEKLKKLGDLMN 400
Query: 266 DSHHSCSVLYEC 277
+SH+SCSVLYEC
Sbjct: 401 ESHYSCSVLYEC 412
>sp|Q54DN6|GALK_DICDI Galactokinase OS=Dictyostelium discoideum GN=galK PE=3 SV=1
Length = 501
Score = 163 bits (413), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 99/270 (36%), Positives = 163/270 (60%), Gaps = 28/270 (10%)
Query: 25 LGSGLSSSTAFVCSSTVALMAAFGVEVPKKEIAQLTCECEQFIGTQSGGMDQAISIMAKS 84
+G+G+SSS+A VC ST+A+ + + K+E+AQL+ + E+++G +SGGMDQ+IS +A+
Sbjct: 157 MGAGVSSSSALVCVSTLAISYCNNLILNKEELAQLSIKSERYVGVESGGMDQSISFLAEQ 216
Query: 85 GFAELIDFNP-IRTTDVQLPAGGTFVVAHSLAESLKAITAASNYNNRVVECRLTAIVLAI 143
A+LI+F+P ++T DVQLP G +FV+ +SL +SLK +T A+NYN RVVECRL A++LA
Sbjct: 217 NTAKLIEFHPSLKTFDVQLPKGVSFVICNSLVDSLKVVTGATNYNLRVVECRLAAVLLAF 276
Query: 144 KLGMKPQEAISKVKTLSDV--EGLCVAFACKNGSSDPVFAVKEFLRKEPYTALDIEKITE 201
G+ + KV+ L DV +G + + + + R+E T LD I+
Sbjct: 277 HCGL----SWEKVRRLRDVQYQGNFDLPQLIQLTEQHLSEKQTYTREEVATILD---ISV 329
Query: 202 EKLTSIFANSSSSLDVLNAAKQYKLHQRAAHVYSEAKRVHAFKDTVSSNLSEEDK----- 256
E+L + S ++ ++ ++L++RA HV++E +RV+ F + + +
Sbjct: 330 EQLVKTYFPSGITVQ----SEHFELYKRARHVFTETQRVYKFSEICKQQSNFNNNNNNNN 385
Query: 257 ---------LKKLGDLMNDSHHSCSVLYEC 277
+++LG LMN+SH SCS L+EC
Sbjct: 386 NNSSNNTNIIQELGKLMNESHESCSKLFEC 415
>sp|Q5XIG6|GALK2_RAT N-acetylgalactosamine kinase OS=Rattus norvegicus GN=Galk2 PE=2
SV=1
Length = 458
Score = 155 bits (391), Expect = 4e-37, Method: Compositional matrix adjust.
Identities = 97/250 (38%), Positives = 147/250 (58%), Gaps = 27/250 (10%)
Query: 34 AFVCSSTVALMAAFGVEVPKKEIAQLTCECEQFIGTQSGGMDQAISIMAKSGFAELIDFN 93
A VC + + + G+ + K E+A++ + E++IGT+ GGMDQ+IS +A+ G A+LI+F+
Sbjct: 149 ALVCCAGLVTLTVLGMRLSKVELAEICAKSERYIGTEGGGMDQSISFLAEEGTAKLIEFS 208
Query: 94 PIRTTDVQLPAGGTFVVAHSLAESLKAITAASNYNNRVVECRLTAIVLAIKLGMKPQEAI 153
P+R TDV+LP+G FV+A+S E KA A S++N RV+ECRL A VLA G++
Sbjct: 209 PLRATDVKLPSGAVFVIANSCVEMNKA--ATSHFNVRVMECRLAAKVLAKHKGLQ----W 262
Query: 154 SKVKTLSDVEG-LCVAFACKNGSSDPVFAVKEFLRKEPYTALDIEKI----TEEKLTSIF 208
KV L +V+ L ++ + + ++ L EPY+ +I K EE T I
Sbjct: 263 DKVLRLEEVQSELGISL------EEMLLVTEDALHAEPYSREEICKCLGISLEELRTQIL 316
Query: 209 A-NSSSSLDVLNAAKQYKLHQRAAHVYSEAKRVHAFKDTVSSNLSEEDKLKKLGDLMNDS 267
+ N+ L +KL+QRA HVYSEA RV FK + + ++ ++ LG+LMN S
Sbjct: 317 SPNTQGELT-------FKLYQRAKHVYSEAARVLQFKQVCEA--APDNAVQLLGELMNQS 367
Query: 268 HHSCSVLYEC 277
H SC +YEC
Sbjct: 368 HRSCRDMYEC 377
>sp|Q01415|GALK2_HUMAN N-acetylgalactosamine kinase OS=Homo sapiens GN=GALK2 PE=1 SV=1
Length = 458
Score = 150 bits (380), Expect = 9e-36, Method: Compositional matrix adjust.
Identities = 99/250 (39%), Positives = 145/250 (58%), Gaps = 27/250 (10%)
Query: 34 AFVCSSTVALMAAFGVEVPKKEIAQLTCECEQFIGTQSGGMDQAISIMAKSGFAELIDFN 93
A VC + + + G + K E+A++ + E++IGT+ GGMDQ+IS +A+ G A+LI+F+
Sbjct: 149 ALVCCAGLVTLTVLGRNLSKVELAEICAKSERYIGTEGGGMDQSISFLAEEGTAKLIEFS 208
Query: 94 PIRTTDVQLPAGGTFVVAHSLAESLKAITAASNYNNRVVECRLTAIVLAIKLGMKPQEAI 153
P+R TDV+LP+G FV+A+S E KA A S++N RV+ECRL A +LA ++
Sbjct: 209 PLRATDVKLPSGAVFVIANSCVEMNKA--ATSHFNIRVMECRLAAKLLAKYKSLQ----W 262
Query: 154 SKVKTLSDVEGLCVAFACKNGSS--DPVFAVKEFLRKEPYTALDIEKI----TEEKLTSI 207
KV L +V+ K G S + + ++ L EPY +I + EE T I
Sbjct: 263 DKVLRLEEVQA-------KLGISLEEMLLVTEDALHPEPYNPEEICRCLGISLEELRTQI 315
Query: 208 FANSSSSLDVLNAAKQYKLHQRAAHVYSEAKRVHAFKDTVSSNLSEEDKLKKLGDLMNDS 267
S ++ DVL +KL+QRA HVYSEA RV FK + E+ ++ LG+LMN S
Sbjct: 316 L--SPNTQDVL----IFKLYQRAKHVYSEAARVLQFKKICEE--APENMVQLLGELMNQS 367
Query: 268 HHSCSVLYEC 277
H SC +YEC
Sbjct: 368 HMSCRDMYEC 377
>sp|Q5R6J8|GALK2_PONAB N-acetylgalactosamine kinase OS=Pongo abelii GN=GALK2 PE=2 SV=1
Length = 458
Score = 150 bits (380), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 99/250 (39%), Positives = 145/250 (58%), Gaps = 27/250 (10%)
Query: 34 AFVCSSTVALMAAFGVEVPKKEIAQLTCECEQFIGTQSGGMDQAISIMAKSGFAELIDFN 93
A VC + + + G + K E+A++ + E++IGT+ GGMDQ+IS +A+ G A+LI+F+
Sbjct: 149 ALVCCAGLVTLTVLGRNLSKVELAEICAKSERYIGTEGGGMDQSISFLAEEGTAKLIEFS 208
Query: 94 PIRTTDVQLPAGGTFVVAHSLAESLKAITAASNYNNRVVECRLTAIVLAIKLGMKPQEAI 153
P+R TDV+LP+G FV+A+S E KA A S++N RV+ECRL A +LA ++
Sbjct: 209 PLRATDVKLPSGAVFVIANSCVEMNKA--ATSHFNIRVMECRLAAKLLAKYKSLQ----W 262
Query: 154 SKVKTLSDVEGLCVAFACKNGSS--DPVFAVKEFLRKEPYTALDIEKI----TEEKLTSI 207
KV L +V+ K G S + + ++ L EPY +I + EE T I
Sbjct: 263 DKVLRLEEVQA-------KLGISLEEMLLVTEDALHPEPYNPEEICRCLGISLEELRTQI 315
Query: 208 FANSSSSLDVLNAAKQYKLHQRAAHVYSEAKRVHAFKDTVSSNLSEEDKLKKLGDLMNDS 267
S ++ DVL +KL+QRA HVYSEA RV FK + E+ ++ LG+LMN S
Sbjct: 316 L--SPNTQDVL----IFKLYQRAKHVYSEAARVLQFKKICEE--APENMVQLLGELMNQS 367
Query: 268 HHSCSVLYEC 277
H SC +YEC
Sbjct: 368 HMSCRDMYEC 377
>sp|Q68FH4|GALK2_MOUSE N-acetylgalactosamine kinase OS=Mus musculus GN=Galk2 PE=1 SV=1
Length = 458
Score = 150 bits (379), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 88/247 (35%), Positives = 148/247 (59%), Gaps = 21/247 (8%)
Query: 34 AFVCSSTVALMAAFGVEVPKKEIAQLTCECEQFIGTQSGGMDQAISIMAKSGFAELIDFN 93
A VC + + + G+ + K E+A++ + E++IGT+ GGMDQ+IS +A+ G A+LI+F+
Sbjct: 149 ALVCCAGLVTLTVLGLRLSKVELAEICAKSERYIGTEGGGMDQSISFLAEEGTAKLIEFS 208
Query: 94 PIRTTDVQLPAGGTFVVAHSLAESLKAITAASNYNNRVVECRLTAIVLAIKLGMKPQEAI 153
P+R T+V+LP+G FV+A+S E KA A S++N RV+ECRL A VLA G++ + +
Sbjct: 209 PLRATNVKLPSGAVFVIANSCMEMNKA--ATSHFNVRVMECRLAAKVLAKHKGLQ-WDNV 265
Query: 154 SKVKTLSDVEGLCVAFACKNGSSDPVFAVKEFLRKEPYTALDIEK---ITEEKLTSIFAN 210
+++ + G+ + + + ++ L EPY+ +I + I+ E+L +
Sbjct: 266 LRLEEVQSKLGISL--------EEMLLVTEDALHPEPYSREEICRCLGISLERLRTQILT 317
Query: 211 SSSSLDVLNAAKQYKLHQRAAHVYSEAKRVHAFKDTVSSNLSEEDKLKKLGDLMNDSHHS 270
++ ++ +KL+QRA HVYSEA RV FK + ++ ++ LG+LMN SH S
Sbjct: 318 PNTQDEL-----TFKLYQRAKHVYSEAARVLQFKQVCED--APDNAVQLLGELMNQSHRS 370
Query: 271 CSVLYEC 277
C +YEC
Sbjct: 371 CRDMYEC 377
>sp|P04385|GAL1_YEAST Galactokinase OS=Saccharomyces cerevisiae (strain ATCC 204508 /
S288c) GN=GAL1 PE=1 SV=4
Length = 528
Score = 109 bits (273), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 100/291 (34%), Positives = 147/291 (50%), Gaps = 51/291 (17%)
Query: 26 GSGLSSSTAFVCSSTVALMAAF---GVEVPKKEIAQLTCECEQFIGTQSGGMDQAISIMA 82
GSGLSSS AF+C+ +A++ A G + K+ + ++T E ++G +GGMDQA S+
Sbjct: 165 GSGLSSSAAFICAVALAVVKANMGPGYHMSKQNLMRITVVAEHYVGVNNGGMDQAASVCG 224
Query: 83 KSGFAELIDFNP-IRTTDVQLPAGG----TFVVAHSLAESLKAITAASNYNNRVVECRLT 137
+ A ++F P ++ T + P +FV+A++L S K TA +NYN RVVE
Sbjct: 225 EEDHALYVEFKPQLKATPFKFPQLKNHEISFVIANTLVVSNKFETAPTNYNLRVVEVTTA 284
Query: 138 AIVLAIKLG---MKPQEAISKVK-TLSDVEGLCVAFACKNGSSDP--------------- 178
A VLA G + +E S K L D + V +A + S P
Sbjct: 285 ANVLAATYGVVLLSGKEGSSTNKGNLRDF--MNVYYARYHNISTPWNGDIESGIERLTKM 342
Query: 179 VFAVKEFL--RKEPYTALDI--------EKITEEKLTSIFANSSSSLDVLNAAKQYKLHQ 228
+ V+E L +K+ ++ D+ E+ T + LT+ S VL KL+Q
Sbjct: 343 LVLVEESLANKKQGFSVDDVAQSLNCSREEFTRDYLTT----SPVRFQVL------KLYQ 392
Query: 229 RAAHVYSEAKRV-HAFK-DTVSSNLSEEDKLKKLGDLMNDSHHSCSVLYEC 277
RA HVYSE+ RV A K T +S ++ED K+ G LMN+S SC LYEC
Sbjct: 393 RAKHVYSESLRVLKAVKLMTTASFTADEDFFKQFGALMNESQASCDKLYEC 443
>sp|O42821|GAL1_CANPA Galactokinase OS=Candida parapsilosis GN=GAL1 PE=3 SV=1
Length = 504
Score = 107 bits (266), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 96/290 (33%), Positives = 143/290 (49%), Gaps = 30/290 (10%)
Query: 9 ITKFQLFNHINSLFFNLGSGLSSSTAFVCSSTVALMAAFGVE-VPKKEIAQLTCECEQFI 67
+TK + N S G GLSSS AF +ST+A++ A GVE + K ++ ++T E ++
Sbjct: 133 MTKLKGMNITFSGTVPTGGGLSSSAAFCVASTLAVLYANGVEDISKADLTRITVVSEHYL 192
Query: 68 GTQSGGMDQAISIMAKSGFAELIDFNP-IRTTDVQLPAGG-TFVVAHSLAESLKAITAAS 125
G +GGMDQ S+ + G A I F P ++ T + P TFV+ +SL S K TA
Sbjct: 193 GLNNGGMDQCASVYGEQGKALFIQFKPQLKGTPFEFPVKNLTFVITNSLQVSNKYETAPI 252
Query: 126 NYNNRVVECRLTAIVLAIKLGMKPQEAI---SKVKTLS---DVEGLCVAFACKNGSSDPV 179
+YN RVVE + +LA KL ++ +E I S V T S ++G C A+ ++ V
Sbjct: 253 HYNLRVVEMAIAGDLLAKKLNVEGKEGIVKDSNVDTYSLRGVMDGYCGAWDGEDLDVGVV 312
Query: 180 F------AVKEFLRKEPYTALDIEK------ITEEKLTSIFANS-SSSLDVLNAAKQYKL 226
V + L KE +E+ +T E+ S + DVL KL
Sbjct: 313 HLEKMIDVVGKTLTKE--GGYTVEQCCEEMGLTPEEFHSRYLKKIPVKFDVL------KL 364
Query: 227 HQRAAHVYSEAKRVHAFKDTVSSNLSEEDKLKKLGDLMNDSHHSCSVLYE 276
++RA HVY E+ RV +S+ + L+ G LMN+S H +L E
Sbjct: 365 YERALHVYRESLRVLKTLQLLSTVVDASQFLQTFGSLMNESQHDLDILNE 414
>sp|P09608|GAL1_KLULA Galactokinase OS=Kluyveromyces lactis (strain ATCC 8585 / CBS 2359
/ DSM 70799 / NBRC 1267 / NRRL Y-1140 / WM37) GN=GAL1
PE=2 SV=2
Length = 503
Score = 101 bits (251), Expect = 8e-21, Method: Compositional matrix adjust.
Identities = 91/283 (32%), Positives = 128/283 (45%), Gaps = 44/283 (15%)
Query: 26 GSGLSSSTAFVCSSTVALM---AAFGVEVPKKEIAQLTCECEQFIGTQSGGMDQAISIMA 82
G GLSSS AF+C+ ++A++ G + K E+ + T E +G +GGMDQA SI
Sbjct: 153 GGGLSSSAAFICAVSLAIIYSNVPAGTPILKDELTKTTAVAEHHVGVNNGGMDQAASICG 212
Query: 83 KSGFAELIDFNP-IRTTDVQLPAGG--TFVVAHSLAESLKAITAASNYNNRVVECRLTAI 139
G A ++F P ++ T + P +F++A++L S KA T NYN RVVE + A
Sbjct: 213 IEGHALYVEFKPELKATPFKFPEDLPISFLIANTLVVSNKAETGPVNYNLRVVEVTVAAN 272
Query: 140 VLAIKLGMKPQE----AISKVKTLSDVEGLCVAFACKNGSSDPVFA-----------VKE 184
VLA K G+ Q ++ D +C+ + V+E
Sbjct: 273 VLAQKFGVTLQTEGNLGKGTLRNFMDSYYTKYDKSCRKPWDGEIQTGIERLNKMLQLVEE 332
Query: 185 FLRKEPYTALD----------IEKITEEKLTSIFANSSSSLDVLNAAKQYKLHQRAAHVY 234
L YT LD I + TE LT+ ++ KL QRA HVY
Sbjct: 333 TLDPNGYT-LDHAVELCGCESISQFTELYLTNFPVR----------FQRLKLFQRAKHVY 381
Query: 235 SEAKRVHAFKDTVSSNLSEEDKLKKLGDLMNDSHHSCSVLYEC 277
SEA RV K E + ++ G LMN+S SC LYEC
Sbjct: 382 SEALRV--LKALQLFQKGESNFFEEFGALMNESQESCDKLYEC 422
>sp|P56091|GAL1_CANAX Galactokinase OS=Candida albicans GN=GAL1 PE=3 SV=1
Length = 515
Score = 100 bits (249), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 92/280 (32%), Positives = 135/280 (48%), Gaps = 36/280 (12%)
Query: 26 GSGLSSSTAFVCSSTVALMAAFGV-EVPKKEIAQLTCECEQFIGTQSGGMDQAISIMAKS 84
G GLSSS AF +ST+A++ A GV ++ K ++ ++T CE ++G +GGMDQ S+ +
Sbjct: 155 GGGLSSSAAFCVASTLAILHANGVKDITKADLTRITVVCEHYVGVNTGGMDQCASVYGEP 214
Query: 85 GFAELIDFNP-IRTTDVQLPAGG-TFVVAHSLAESLKAITAASNYNNRVVECRLTAIVLA 142
A LI F P + + P TFV+ +SL S K TA +YN RVVE + A VL
Sbjct: 215 DKALLIQFKPKLIGKPFKFPVENLTFVITNSLQVSNKHETAPIHYNLRVVEMAIAADVLV 274
Query: 143 IKLGMK---PQEAISKVKTLSDVEGLCVAFACKNGSSDPVFAVKEFLR------------ 187
KL + PQ++ +L V V CK +D + + +
Sbjct: 275 KKLNLGTLVPQDSNIGTSSLRGVMD-AVFNTCKWDGNDIDVGIDQLKKMIAIVETELNNN 333
Query: 188 KEPYTA---LDIEKITEEKLTSIFANSSS-SLDVLNAAKQYKLHQRAAHVYSEAKRV-HA 242
+E YT L + ++ ++ S + + DVL KL+QRA HVY E+ RV
Sbjct: 334 QEGYTVDQCLTVLDLSLDEFKSKYLQAYPVKFDVL------KLYQRAKHVYQESLRVLET 387
Query: 243 FK----DTVSSNLSEEDK--LKKLGDLMNDSHHSCSVLYE 276
K SSN ++D+ L K G+LMN S L E
Sbjct: 388 LKLLSTTQTSSNSKDDDESFLVKFGELMNQSQSDLDKLNE 427
>sp|P13045|GAL3_YEAST Protein GAL3 OS=Saccharomyces cerevisiae (strain ATCC 204508 /
S288c) GN=GAL3 PE=1 SV=2
Length = 520
Score = 99.0 bits (245), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 89/289 (30%), Positives = 137/289 (47%), Gaps = 49/289 (16%)
Query: 26 GSGLSSSTAFVCSSTVALMAAF---GVEVPKKEIAQLTCECEQFIGTQSGGMDQAISIMA 82
G GLSS AF C++ +A + A ++ KK++ ++T E ++G +GGMDQA S+
Sbjct: 159 GGGLSS--AFTCAAALATIRANMGKNFDISKKDLTRITAVAEHYVGVNNGGMDQATSVYG 216
Query: 83 KSGFAELIDFNP-IRTTDVQLPAGG----TFVVAHSLAESLKAITAASNYNNRVVECRLT 137
+ A ++F P ++ T + P +FV+A++L +S K TA +NYN RV+E +
Sbjct: 217 EEDHALYVEFRPKLKATPFKFPQLKNHEISFVIANTLVKSNKFETAPTNYNLRVIEVTVA 276
Query: 138 AIVLAIKLGMK-PQEAISKVKTLSDVEGLCVAFACK--------NGS------------- 175
A LA + + P + ++ A+ + NG
Sbjct: 277 ANALATRYSVALPSHKDNSNSERGNLRDFMDAYYARYENQAQPWNGDIGTGIERLLKMLQ 336
Query: 176 -SDPVFAVKE--FLRKEPYTALDI--EKITEEKLTSIFANSSSSLDVLNAAKQYKLHQRA 230
+ F+ K+ F E TAL+ E+ T + LT+ VL KL+QRA
Sbjct: 337 LVEESFSRKKSGFTVHEASTALNCSREEFTRDYLTTF----PVRFQVL------KLYQRA 386
Query: 231 AHVYSEAKRV-HAFKDTVSSNL-SEEDKLKKLGDLMNDSHHSCSVLYEC 277
HVYSE+ RV A K S+ ++ED G LMN+S SC LYEC
Sbjct: 387 KHVYSESLRVLKALKMMTSATFHTDEDFFTDFGRLMNESQASCDKLYEC 435
>sp|Q9HDU2|GAL1_SCHPO Galactokinase OS=Schizosaccharomyces pombe (strain 972 / ATCC
24843) GN=gal1 PE=3 SV=1
Length = 519
Score = 86.7 bits (213), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 79/280 (28%), Positives = 132/280 (47%), Gaps = 40/280 (14%)
Query: 26 GSGLSSSTAFVCSSTVALMAAFGVE-VPKKEIAQLTCECEQFIGTQSGGMDQAISIMAKS 84
G GLSSS AF +S +A++ A G+ + K+++ +++ E ++G +GGMDQ SI +
Sbjct: 159 GGGLSSSAAFCVASILAILKANGINTITKEDLVKISVVSEHYVGVNTGGMDQCASIYGEQ 218
Query: 85 GFAELIDFNP-IRTTDVQLPA----GGTFVVAHSLAESLKAITAASNYNNRVVECRLTAI 139
A L+ F P + T ++P F+++++L E+ K TA +NYN RVVE + +
Sbjct: 219 NKALLVQFKPKLMATPFKMPVLKPHDMVFLISNTLVEANKQETALTNYNLRVVEMAVASE 278
Query: 140 VLAIKLGMK-PQEAISKVKTLSDVEGLCVAFACKN------GSSD---PVFAVKEFLR-- 187
LA K ++ P+E+ TL G + K+ SD V ++E LR
Sbjct: 279 FLAKKFNLELPKESNLHTGTL---RGFMDEYYEKHLKQPHWDGSDIDMGVQRMQEMLRLT 335
Query: 188 -----KEPYTALDIEKITE------EKLTSIFANSSSSLDVLNAAKQYKLHQRAAHVYSE 236
+E E++ + E+ T +F + ++ K++QR HVYS+
Sbjct: 336 EIMFSEEQKVGFKTEELAKELGLSVEEFTKVFLTK-----IPVKYERMKIYQRTVHVYSD 390
Query: 237 AKRVHAFKDTVSSNLSEEDKLK---KLGDLMNDSHHSCSV 273
A RV + +D K G L+NDS S +
Sbjct: 391 AMRVLQVLKLFHQHKDSDDPQKFMLAFGRLLNDSQRSEDI 430
>sp|P56838|GAL1_THEMA Galactokinase OS=Thermotoga maritima (strain ATCC 43589 / MSB8 /
DSM 3109 / JCM 10099) GN=galK PE=3 SV=1
Length = 350
Score = 54.7 bits (130), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 37/112 (33%), Positives = 58/112 (51%), Gaps = 5/112 (4%)
Query: 25 LGSGLSSSTAFVCSSTVALMAAFGVEVPKKEIAQLTCECE-QFIGTQSGGMDQAISIMAK 83
+G+GLSSS A ++ A+ FG VPK E+ ++ E E +F+G + G MDQ ++ K
Sbjct: 95 IGAGLSSSAALEVATAYAISEYFGFNVPKLELVKIAREAEVEFVGVRCGIMDQFTAVFGK 154
Query: 84 SGFAELIDFNPIRTTDVQLPAGGTFVVAHSLAES-LKAITAASNYNNRVVEC 134
A +D + V L G + +L +S +K ++S YN R EC
Sbjct: 155 KDHAIFLDTMTLEYEYVPLKLEGYEI---NLVDSNVKHELSSSEYNRRRQEC 203
>sp|B9K9C8|GAL1_THENN Galactokinase OS=Thermotoga neapolitana (strain ATCC 49049 / DSM
4359 / NS-E) GN=galK PE=3 SV=1
Length = 350
Score = 53.9 bits (128), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 45/142 (31%), Positives = 69/142 (48%), Gaps = 11/142 (7%)
Query: 25 LGSGLSSSTAFVCSSTVALMAAFGVEVPKKEIAQLTCECE-QFIGTQSGGMDQAISIMAK 83
+G+GLSSS A ++ A+ FG +PK E+ ++ E E +F+G + G MDQ S K
Sbjct: 95 MGAGLSSSAALEMATAYAISEHFGFHLPKLELVKIAREAEVEFVGVRCGIMDQFTSAFGK 154
Query: 84 SGFAELIDFNPIRTTDVQLPAGGTFVVAHSLAES-LKAITAASNYNNRVVECRLTAIVLA 142
A +D + V L G + +L +S +K ++S YN R EC V
Sbjct: 155 KDHAIFLDTMTLEYEYVPLKLEGYEI---NLVDSNVKHELSSSEYNKRRQECEEVLRV-- 209
Query: 143 IKLGMKPQEAISK--VKTLSDV 162
LG + ++K +K L DV
Sbjct: 210 --LGKRSFREVTKEDLKKLPDV 229
>sp|O85253|GAL1_THENE Galactokinase OS=Thermotoga neapolitana GN=galK PE=3 SV=1
Length = 350
Score = 53.9 bits (128), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 45/142 (31%), Positives = 69/142 (48%), Gaps = 11/142 (7%)
Query: 25 LGSGLSSSTAFVCSSTVALMAAFGVEVPKKEIAQLTCECE-QFIGTQSGGMDQAISIMAK 83
+G+GLSSS A ++ A+ FG +PK E+ ++ E E +F+G + G MDQ S K
Sbjct: 95 MGAGLSSSAALEMATAYAISEHFGFHLPKLELVKIAREAEVEFVGVRCGIMDQFTSAFGK 154
Query: 84 SGFAELIDFNPIRTTDVQLPAGGTFVVAHSLAES-LKAITAASNYNNRVVECRLTAIVLA 142
A +D + V L G + +L +S +K ++S YN R EC V
Sbjct: 155 KDHAIFLDTMTLEYEYVPLKLEGYEI---NLVDSNVKHELSSSEYNKRRQECEEVLRV-- 209
Query: 143 IKLGMKPQEAISK--VKTLSDV 162
LG + ++K +K L DV
Sbjct: 210 --LGKRSFREVTKEDLKKLPDV 229
>sp|B9LFE4|GAL1_CHLSY Galactokinase OS=Chloroflexus aurantiacus (strain ATCC 29364 / DSM
637 / Y-400-fl) GN=galK PE=3 SV=1
Length = 390
Score = 52.4 bits (124), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 44/139 (31%), Positives = 66/139 (47%), Gaps = 12/139 (8%)
Query: 26 GSGLSSSTAFVCSSTVALMAAFGVEVPKKEIAQLTCECEQ-FIGTQSGGMDQAISIMAKS 84
GSGLSSS A + + + +E+A + E F+G + G MDQ I+ + ++
Sbjct: 122 GSGLSSSAALEVAVGYTFQLLNNINLLGEELALVAQGAEHTFVGVKCGIMDQLIAALGEA 181
Query: 85 GFAELIDFNPIRTTDVQLPAGGTFVVAHSLAESLKAITAASNYNNRVVECRLTAIVLAIK 144
G A LID + V +PA VV S ++ A S YN R C A++
Sbjct: 182 GHALLIDCRDLSYRPVPIPAEARVVVCDS---GVRHRLAGSEYNQRRAGCE-----EAVR 233
Query: 145 LGMKPQEAISKVKTLSDVE 163
L +KP A+ K++ L DV
Sbjct: 234 L-LKP--ALGKIQALRDVR 249
>sp|A9WB97|GAL1_CHLAA Galactokinase OS=Chloroflexus aurantiacus (strain ATCC 29366 / DSM
635 / J-10-fl) GN=galK PE=3 SV=1
Length = 390
Score = 52.4 bits (124), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 44/139 (31%), Positives = 66/139 (47%), Gaps = 12/139 (8%)
Query: 26 GSGLSSSTAFVCSSTVALMAAFGVEVPKKEIAQLTCECEQ-FIGTQSGGMDQAISIMAKS 84
GSGLSSS A + + + +E+A + E F+G + G MDQ I+ + ++
Sbjct: 122 GSGLSSSAALEVAVGYTFQLLNNINLLGEELALVAQGAEHTFVGVKCGIMDQLIAALGEA 181
Query: 85 GFAELIDFNPIRTTDVQLPAGGTFVVAHSLAESLKAITAASNYNNRVVECRLTAIVLAIK 144
G A LID + V +PA VV S ++ A S YN R C A++
Sbjct: 182 GHALLIDCRDLSYRPVPIPAEARVVVCDS---GVRHRLAGSEYNQRRAGCE-----EAVR 233
Query: 145 LGMKPQEAISKVKTLSDVE 163
L +KP A+ K++ L DV
Sbjct: 234 L-LKP--ALGKIQALRDVR 249
>sp|B7LK02|GAL1_ESCF3 Galactokinase OS=Escherichia fergusonii (strain ATCC 35469 / DSM
13698 / CDC 0568-73) GN=galK PE=3 SV=1
Length = 382
Score = 52.0 bits (123), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 38/114 (33%), Positives = 53/114 (46%), Gaps = 4/114 (3%)
Query: 26 GSGLSSSTAFVCSSTVALMAAFGVEVPKKEIAQLTCECE-QFIGTQSGGMDQAISIMAKS 84
G+GLSSS + + L + + + +IA E E QF+G G MDQ IS + K
Sbjct: 124 GAGLSSSASLEVAVGTVLQQLYHLPLDGAQIALNGQEAENQFVGCNCGIMDQLISALGKK 183
Query: 85 GFAELIDFNPIRTTDVQLPAGGTFVVAHSLAESLKAITAASNYNNRVVECRLTA 138
A LID + T V +P G V+ +S + K S YN R +C A
Sbjct: 184 NHALLIDCRSLGTKAVSMPKGVAVVIINS---NFKRTLVGSEYNTRREQCETGA 234
>sp|Q32IG8|GAL1_SHIDS Galactokinase OS=Shigella dysenteriae serotype 1 (strain Sd197)
GN=galK PE=3 SV=1
Length = 382
Score = 51.6 bits (122), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 38/114 (33%), Positives = 53/114 (46%), Gaps = 4/114 (3%)
Query: 26 GSGLSSSTAFVCSSTVALMAAFGVEVPKKEIAQLTCECE-QFIGTQSGGMDQAISIMAKS 84
G+GLSSS + + L + + + +IA E E QF+G G MDQ IS + K
Sbjct: 124 GAGLSSSASLEVAVGTVLQQLYHLPLDGAQIALNGQEAENQFVGCNCGIMDQLISALGKK 183
Query: 85 GFAELIDFNPIRTTDVQLPAGGTFVVAHSLAESLKAITAASNYNNRVVECRLTA 138
A LID + T V +P G V+ +S + K S YN R +C A
Sbjct: 184 DHALLIDCRSLGTKAVSMPKGVAVVIINS---NFKRTLVGSEYNTRREQCETGA 234
>sp|B7N9Z9|GAL1_ECOLU Galactokinase OS=Escherichia coli O17:K52:H18 (strain UMN026 /
ExPEC) GN=galK PE=3 SV=1
Length = 382
Score = 51.6 bits (122), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 38/114 (33%), Positives = 53/114 (46%), Gaps = 4/114 (3%)
Query: 26 GSGLSSSTAFVCSSTVALMAAFGVEVPKKEIAQLTCECE-QFIGTQSGGMDQAISIMAKS 84
G+GLSSS + + L + + + +IA E E QF+G G MDQ IS + K
Sbjct: 124 GAGLSSSASLEVAVGTVLQQLYHLPLDGAQIALNGQEAENQFVGCNCGIMDQLISALGKK 183
Query: 85 GFAELIDFNPIRTTDVQLPAGGTFVVAHSLAESLKAITAASNYNNRVVECRLTA 138
A LID + T V +P G V+ +S + K S YN R +C A
Sbjct: 184 DHALLIDCRSLGTKAVSMPKGVAVVIINS---NFKRTLVGSEYNTRREQCETGA 234
>sp|Q0T6Y7|GAL1_SHIF8 Galactokinase OS=Shigella flexneri serotype 5b (strain 8401)
GN=galK PE=3 SV=1
Length = 382
Score = 51.6 bits (122), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 38/114 (33%), Positives = 53/114 (46%), Gaps = 4/114 (3%)
Query: 26 GSGLSSSTAFVCSSTVALMAAFGVEVPKKEIAQLTCECE-QFIGTQSGGMDQAISIMAKS 84
G+GLSSS + + L + + + +IA E E QF+G G MDQ IS + K
Sbjct: 124 GAGLSSSASLEVAVGTVLQQLYHLPLDGAQIALNGQEAENQFVGCNCGIMDQLISALGKK 183
Query: 85 GFAELIDFNPIRTTDVQLPAGGTFVVAHSLAESLKAITAASNYNNRVVECRLTA 138
A LID + T V +P G V+ +S + K S YN R +C A
Sbjct: 184 DHALLIDCRSLGTKAVSMPKGVAVVIINS---NFKRTLVGSEYNTRREQCETGA 234
>sp|Q3Z453|GAL1_SHISS Galactokinase OS=Shigella sonnei (strain Ss046) GN=galK PE=3 SV=1
Length = 382
Score = 51.6 bits (122), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 38/114 (33%), Positives = 53/114 (46%), Gaps = 4/114 (3%)
Query: 26 GSGLSSSTAFVCSSTVALMAAFGVEVPKKEIAQLTCECE-QFIGTQSGGMDQAISIMAKS 84
G+GLSSS + + L + + + +IA E E QF+G G MDQ IS + K
Sbjct: 124 GAGLSSSASLEVAVGTVLQQLYHLPLDGAQIALNGQEAENQFVGCNCGIMDQLISALGKK 183
Query: 85 GFAELIDFNPIRTTDVQLPAGGTFVVAHSLAESLKAITAASNYNNRVVECRLTA 138
A LID + T V +P G V+ +S + K S YN R +C A
Sbjct: 184 DHALLIDCRSLGTKAVSMPKGVAVVIINS---NFKRTLVGSEYNTRREQCETGA 234
>sp|Q83M01|GAL1_SHIFL Galactokinase OS=Shigella flexneri GN=galK PE=3 SV=3
Length = 382
Score = 51.6 bits (122), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 38/114 (33%), Positives = 53/114 (46%), Gaps = 4/114 (3%)
Query: 26 GSGLSSSTAFVCSSTVALMAAFGVEVPKKEIAQLTCECE-QFIGTQSGGMDQAISIMAKS 84
G+GLSSS + + L + + + +IA E E QF+G G MDQ IS + K
Sbjct: 124 GAGLSSSASLEVAVGTVLQQLYHLPLDGAQIALNGQEAENQFVGCNCGIMDQLISALGKK 183
Query: 85 GFAELIDFNPIRTTDVQLPAGGTFVVAHSLAESLKAITAASNYNNRVVECRLTA 138
A LID + T V +P G V+ +S + K S YN R +C A
Sbjct: 184 DHALLIDCRSLGTKAVSMPKGVAVVIINS---NFKRTLVGSEYNTRREQCETGA 234
>sp|B1LM48|GAL1_ECOSM Galactokinase OS=Escherichia coli (strain SMS-3-5 / SECEC) GN=galK
PE=3 SV=1
Length = 382
Score = 51.6 bits (122), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 38/114 (33%), Positives = 53/114 (46%), Gaps = 4/114 (3%)
Query: 26 GSGLSSSTAFVCSSTVALMAAFGVEVPKKEIAQLTCECE-QFIGTQSGGMDQAISIMAKS 84
G+GLSSS + + L + + + +IA E E QF+G G MDQ IS + K
Sbjct: 124 GAGLSSSASLEVAVGTVLQQLYHLPLDGAQIALNGQEAENQFVGCNCGIMDQLISALGKK 183
Query: 85 GFAELIDFNPIRTTDVQLPAGGTFVVAHSLAESLKAITAASNYNNRVVECRLTA 138
A LID + T V +P G V+ +S + K S YN R +C A
Sbjct: 184 DHALLIDCRSLGTKAVSMPKGVAVVIINS---NFKRTLVGSEYNTRREQCETGA 234
>sp|B6I7R1|GAL1_ECOSE Galactokinase OS=Escherichia coli (strain SE11) GN=galK PE=3 SV=1
Length = 382
Score = 51.6 bits (122), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 38/114 (33%), Positives = 53/114 (46%), Gaps = 4/114 (3%)
Query: 26 GSGLSSSTAFVCSSTVALMAAFGVEVPKKEIAQLTCECE-QFIGTQSGGMDQAISIMAKS 84
G+GLSSS + + L + + + +IA E E QF+G G MDQ IS + K
Sbjct: 124 GAGLSSSASLEVAVGTVLQQLYHLPLDGAQIALNGQEAENQFVGCNCGIMDQLISALGKK 183
Query: 85 GFAELIDFNPIRTTDVQLPAGGTFVVAHSLAESLKAITAASNYNNRVVECRLTA 138
A LID + T V +P G V+ +S + K S YN R +C A
Sbjct: 184 DHALLIDCRSLGTKAVSMPKGVAVVIINS---NFKRTLVGSEYNTRREQCETGA 234
>sp|P0A6T3|GAL1_ECOLI Galactokinase OS=Escherichia coli (strain K12) GN=galK PE=1 SV=2
Length = 382
Score = 51.6 bits (122), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 38/114 (33%), Positives = 53/114 (46%), Gaps = 4/114 (3%)
Query: 26 GSGLSSSTAFVCSSTVALMAAFGVEVPKKEIAQLTCECE-QFIGTQSGGMDQAISIMAKS 84
G+GLSSS + + L + + + +IA E E QF+G G MDQ IS + K
Sbjct: 124 GAGLSSSASLEVAVGTVLQQLYHLPLDGAQIALNGQEAENQFVGCNCGIMDQLISALGKK 183
Query: 85 GFAELIDFNPIRTTDVQLPAGGTFVVAHSLAESLKAITAASNYNNRVVECRLTA 138
A LID + T V +P G V+ +S + K S YN R +C A
Sbjct: 184 DHALLIDCRSLGTKAVSMPKGVAVVIINS---NFKRTLVGSEYNTRREQCETGA 234
>sp|B1IXX9|GAL1_ECOLC Galactokinase OS=Escherichia coli (strain ATCC 8739 / DSM 1576 /
Crooks) GN=galK PE=3 SV=1
Length = 382
Score = 51.6 bits (122), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 38/114 (33%), Positives = 53/114 (46%), Gaps = 4/114 (3%)
Query: 26 GSGLSSSTAFVCSSTVALMAAFGVEVPKKEIAQLTCECE-QFIGTQSGGMDQAISIMAKS 84
G+GLSSS + + L + + + +IA E E QF+G G MDQ IS + K
Sbjct: 124 GAGLSSSASLEVAVGTVLQQLYHLPLDGAQIALNGQEAENQFVGCNCGIMDQLISALGKK 183
Query: 85 GFAELIDFNPIRTTDVQLPAGGTFVVAHSLAESLKAITAASNYNNRVVECRLTA 138
A LID + T V +P G V+ +S + K S YN R +C A
Sbjct: 184 DHALLIDCRSLGTKAVSMPKGVAVVIINS---NFKRTLVGSEYNTRREQCETGA 234
>sp|C4ZXS8|GAL1_ECOBW Galactokinase OS=Escherichia coli (strain K12 / MC4100 / BW2952)
GN=galK PE=3 SV=1
Length = 382
Score = 51.6 bits (122), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 38/114 (33%), Positives = 53/114 (46%), Gaps = 4/114 (3%)
Query: 26 GSGLSSSTAFVCSSTVALMAAFGVEVPKKEIAQLTCECE-QFIGTQSGGMDQAISIMAKS 84
G+GLSSS + + L + + + +IA E E QF+G G MDQ IS + K
Sbjct: 124 GAGLSSSASLEVAVGTVLQQLYHLPLDGAQIALNGQEAENQFVGCNCGIMDQLISALGKK 183
Query: 85 GFAELIDFNPIRTTDVQLPAGGTFVVAHSLAESLKAITAASNYNNRVVECRLTA 138
A LID + T V +P G V+ +S + K S YN R +C A
Sbjct: 184 DHALLIDCRSLGTKAVSMPKGVAVVIINS---NFKRTLVGSEYNTRREQCETGA 234
>sp|B7M6C0|GAL1_ECO8A Galactokinase OS=Escherichia coli O8 (strain IAI1) GN=galK PE=3
SV=1
Length = 382
Score = 51.6 bits (122), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 38/114 (33%), Positives = 53/114 (46%), Gaps = 4/114 (3%)
Query: 26 GSGLSSSTAFVCSSTVALMAAFGVEVPKKEIAQLTCECE-QFIGTQSGGMDQAISIMAKS 84
G+GLSSS + + L + + + +IA E E QF+G G MDQ IS + K
Sbjct: 124 GAGLSSSASLEVAVGTVLQQLYHLPLDGAQIALNGQEAENQFVGCNCGIMDQLISALGKK 183
Query: 85 GFAELIDFNPIRTTDVQLPAGGTFVVAHSLAESLKAITAASNYNNRVVECRLTA 138
A LID + T V +P G V+ +S + K S YN R +C A
Sbjct: 184 DHALLIDCRSLGTKAVSMPKGVAVVIINS---NFKRTLVGSEYNTRREQCETGA 234
>sp|B5YRF5|GAL1_ECO5E Galactokinase OS=Escherichia coli O157:H7 (strain EC4115 / EHEC)
GN=galK PE=3 SV=1
Length = 382
Score = 51.6 bits (122), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 38/114 (33%), Positives = 53/114 (46%), Gaps = 4/114 (3%)
Query: 26 GSGLSSSTAFVCSSTVALMAAFGVEVPKKEIAQLTCECE-QFIGTQSGGMDQAISIMAKS 84
G+GLSSS + + L + + + +IA E E QF+G G MDQ IS + K
Sbjct: 124 GAGLSSSASLEVAVGTVLQQLYHLPLDGAQIALNGQEAENQFVGCNCGIMDQLISALGKK 183
Query: 85 GFAELIDFNPIRTTDVQLPAGGTFVVAHSLAESLKAITAASNYNNRVVECRLTA 138
A LID + T V +P G V+ +S + K S YN R +C A
Sbjct: 184 DHALLIDCRSLGTKAVSMPKGVAVVIINS---NFKRTLVGSEYNTRREQCETGA 234
>sp|P0A6T4|GAL1_ECO57 Galactokinase OS=Escherichia coli O157:H7 GN=galK PE=3 SV=2
Length = 382
Score = 51.6 bits (122), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 38/114 (33%), Positives = 53/114 (46%), Gaps = 4/114 (3%)
Query: 26 GSGLSSSTAFVCSSTVALMAAFGVEVPKKEIAQLTCECE-QFIGTQSGGMDQAISIMAKS 84
G+GLSSS + + L + + + +IA E E QF+G G MDQ IS + K
Sbjct: 124 GAGLSSSASLEVAVGTVLQQLYHLPLDGAQIALNGQEAENQFVGCNCGIMDQLISALGKK 183
Query: 85 GFAELIDFNPIRTTDVQLPAGGTFVVAHSLAESLKAITAASNYNNRVVECRLTA 138
A LID + T V +P G V+ +S + K S YN R +C A
Sbjct: 184 DHALLIDCRSLGTKAVSMPKGVAVVIINS---NFKRTLVGSEYNTRREQCETGA 234
>sp|B7LAF8|GAL1_ECO55 Galactokinase OS=Escherichia coli (strain 55989 / EAEC) GN=galK
PE=3 SV=1
Length = 382
Score = 51.6 bits (122), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 38/114 (33%), Positives = 53/114 (46%), Gaps = 4/114 (3%)
Query: 26 GSGLSSSTAFVCSSTVALMAAFGVEVPKKEIAQLTCECE-QFIGTQSGGMDQAISIMAKS 84
G+GLSSS + + L + + + +IA E E QF+G G MDQ IS + K
Sbjct: 124 GAGLSSSASLEVAVGTVLQQLYHLPLDGAQIALNGQEAENQFVGCNCGIMDQLISALGKK 183
Query: 85 GFAELIDFNPIRTTDVQLPAGGTFVVAHSLAESLKAITAASNYNNRVVECRLTA 138
A LID + T V +P G V+ +S + K S YN R +C A
Sbjct: 184 DHALLIDCRSLGTKAVSMPKGVAVVIINS---NFKRTLVGSEYNTRREQCETGA 234
>sp|A7ZJD2|GAL1_ECO24 Galactokinase OS=Escherichia coli O139:H28 (strain E24377A / ETEC)
GN=galK PE=3 SV=1
Length = 382
Score = 51.6 bits (122), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 38/114 (33%), Positives = 53/114 (46%), Gaps = 4/114 (3%)
Query: 26 GSGLSSSTAFVCSSTVALMAAFGVEVPKKEIAQLTCECE-QFIGTQSGGMDQAISIMAKS 84
G+GLSSS + + L + + + +IA E E QF+G G MDQ IS + K
Sbjct: 124 GAGLSSSASLEVAVGTVLQQLYHLPLDGAQIALNGQEAENQFVGCNCGIMDQLISALGKK 183
Query: 85 GFAELIDFNPIRTTDVQLPAGGTFVVAHSLAESLKAITAASNYNNRVVECRLTA 138
A LID + T V +P G V+ +S + K S YN R +C A
Sbjct: 184 DHALLIDCRSLGTKAVSMPKGVAVVIINS---NFKRTLVGSEYNTRREQCETGA 234
>sp|B7MPP1|GAL1_ECO81 Galactokinase OS=Escherichia coli O81 (strain ED1a) GN=galK PE=3
SV=1
Length = 382
Score = 51.6 bits (122), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 38/114 (33%), Positives = 53/114 (46%), Gaps = 4/114 (3%)
Query: 26 GSGLSSSTAFVCSSTVALMAAFGVEVPKKEIAQLTCECE-QFIGTQSGGMDQAISIMAKS 84
G+GLSSS + + L + + + +IA E E QF+G G MDQ IS + K
Sbjct: 124 GAGLSSSASLEVAVGTVLQQLYHLPLDGAQIALNGQEAENQFVGCNCGIMDQLISALGKK 183
Query: 85 GFAELIDFNPIRTTDVQLPAGGTFVVAHSLAESLKAITAASNYNNRVVECRLTA 138
A LID + T V +P G V+ +S + K S YN R +C A
Sbjct: 184 DHALLIDCRSLGTKAVSMPKGVAVVIINS---NFKRTLVGSEYNTRREQCETGA 234
>sp|B7NNH9|GAL1_ECO7I Galactokinase OS=Escherichia coli O7:K1 (strain IAI39 / ExPEC)
GN=galK PE=3 SV=1
Length = 382
Score = 51.6 bits (122), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 38/114 (33%), Positives = 53/114 (46%), Gaps = 4/114 (3%)
Query: 26 GSGLSSSTAFVCSSTVALMAAFGVEVPKKEIAQLTCECE-QFIGTQSGGMDQAISIMAKS 84
G+GLSSS + + L + + + +IA E E QF+G G MDQ IS + K
Sbjct: 124 GAGLSSSASLEVAVGTVLQQLYHLPLDGAQIALNGQEAENQFVGCNCGIMDQLISALGKK 183
Query: 85 GFAELIDFNPIRTTDVQLPAGGTFVVAHSLAESLKAITAASNYNNRVVECRLTA 138
A LID + T V +P G V+ +S + K S YN R +C A
Sbjct: 184 DHALLIDCRSLGTKAVSMPKGVAVVIINS---NFKRTLVGSEYNTRREQCETGA 234
>sp|Q1REH4|GAL1_ECOUT Galactokinase OS=Escherichia coli (strain UTI89 / UPEC) GN=galK
PE=3 SV=1
Length = 382
Score = 51.6 bits (122), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 38/114 (33%), Positives = 53/114 (46%), Gaps = 4/114 (3%)
Query: 26 GSGLSSSTAFVCSSTVALMAAFGVEVPKKEIAQLTCECE-QFIGTQSGGMDQAISIMAKS 84
G+GLSSS + + L + + + +IA E E QF+G G MDQ IS + K
Sbjct: 124 GAGLSSSASLEVAVGTVLQQLYHLPLDGAQIALNGQEAENQFVGCNCGIMDQLISALGKK 183
Query: 85 GFAELIDFNPIRTTDVQLPAGGTFVVAHSLAESLKAITAASNYNNRVVECRLTA 138
A LID + T V +P G V+ +S + K S YN R +C A
Sbjct: 184 DHALLIDCRSLGTKAVSMPKGVAVVIINS---NFKRTLVGSEYNTRREQCETGA 234
>sp|Q8FJS1|GAL1_ECOL6 Galactokinase OS=Escherichia coli O6:H1 (strain CFT073 / ATCC
700928 / UPEC) GN=galK PE=3 SV=3
Length = 382
Score = 51.6 bits (122), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 38/114 (33%), Positives = 53/114 (46%), Gaps = 4/114 (3%)
Query: 26 GSGLSSSTAFVCSSTVALMAAFGVEVPKKEIAQLTCECE-QFIGTQSGGMDQAISIMAKS 84
G+GLSSS + + L + + + +IA E E QF+G G MDQ IS + K
Sbjct: 124 GAGLSSSASLEVAVGTVLQQLYHLPLDGAQIALNGQEAENQFVGCNCGIMDQLISALGKK 183
Query: 85 GFAELIDFNPIRTTDVQLPAGGTFVVAHSLAESLKAITAASNYNNRVVECRLTA 138
A LID + T V +P G V+ +S + K S YN R +C A
Sbjct: 184 DHALLIDCRSLGTKAVSMPKGVAVVIINS---NFKRTLVGSEYNTRREQCETGA 234
>sp|Q0TJU4|GAL1_ECOL5 Galactokinase OS=Escherichia coli O6:K15:H31 (strain 536 / UPEC)
GN=galK PE=3 SV=1
Length = 382
Score = 51.6 bits (122), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 38/114 (33%), Positives = 53/114 (46%), Gaps = 4/114 (3%)
Query: 26 GSGLSSSTAFVCSSTVALMAAFGVEVPKKEIAQLTCECE-QFIGTQSGGMDQAISIMAKS 84
G+GLSSS + + L + + + +IA E E QF+G G MDQ IS + K
Sbjct: 124 GAGLSSSASLEVAVGTVLQQLYHLPLDGAQIALNGQEAENQFVGCNCGIMDQLISALGKK 183
Query: 85 GFAELIDFNPIRTTDVQLPAGGTFVVAHSLAESLKAITAASNYNNRVVECRLTA 138
A LID + T V +P G V+ +S + K S YN R +C A
Sbjct: 184 DHALLIDCRSLGTKAVSMPKGVAVVIINS---NFKRTLVGSEYNTRREQCETGA 234
>sp|A1A900|GAL1_ECOK1 Galactokinase OS=Escherichia coli O1:K1 / APEC GN=galK PE=3 SV=1
Length = 382
Score = 51.6 bits (122), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 38/114 (33%), Positives = 53/114 (46%), Gaps = 4/114 (3%)
Query: 26 GSGLSSSTAFVCSSTVALMAAFGVEVPKKEIAQLTCECE-QFIGTQSGGMDQAISIMAKS 84
G+GLSSS + + L + + + +IA E E QF+G G MDQ IS + K
Sbjct: 124 GAGLSSSASLEVAVGTVLQQLYHLPLDGAQIALNGQEAENQFVGCNCGIMDQLISALGKK 183
Query: 85 GFAELIDFNPIRTTDVQLPAGGTFVVAHSLAESLKAITAASNYNNRVVECRLTA 138
A LID + T V +P G V+ +S + K S YN R +C A
Sbjct: 184 DHALLIDCRSLGTKAVSMPKGVAVVIINS---NFKRTLVGSEYNTRREQCETGA 234
>sp|A7ZY13|GAL1_ECOHS Galactokinase OS=Escherichia coli O9:H4 (strain HS) GN=galK PE=3
SV=1
Length = 382
Score = 51.6 bits (122), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 38/114 (33%), Positives = 53/114 (46%), Gaps = 4/114 (3%)
Query: 26 GSGLSSSTAFVCSSTVALMAAFGVEVPKKEIAQLTCECE-QFIGTQSGGMDQAISIMAKS 84
G+GLSSS + + L + + + +IA E E QF+G G MDQ IS + K
Sbjct: 124 GAGLSSSASLEVAVGTVLQQLYHLPLDGAQIALNGQEAENQFVGCNCGIMDQLISALGKK 183
Query: 85 GFAELIDFNPIRTTDVQLPAGGTFVVAHSLAESLKAITAASNYNNRVVECRLTA 138
A LID + T V +P G V+ +S + K S YN R +C A
Sbjct: 184 DHALLIDCRSLGTKAVSMPKGVAVVIINS---NFKRTLVGSEYNTRREQCETGA 234
>sp|B7MGL4|GAL1_ECO45 Galactokinase OS=Escherichia coli O45:K1 (strain S88 / ExPEC)
GN=galK PE=3 SV=1
Length = 382
Score = 51.6 bits (122), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 38/114 (33%), Positives = 53/114 (46%), Gaps = 4/114 (3%)
Query: 26 GSGLSSSTAFVCSSTVALMAAFGVEVPKKEIAQLTCECE-QFIGTQSGGMDQAISIMAKS 84
G+GLSSS + + L + + + +IA E E QF+G G MDQ IS + K
Sbjct: 124 GAGLSSSASLEVAVGTVLQQLYHLPLDGAQIALNGQEAENQFVGCNCGIMDQLISALGKK 183
Query: 85 GFAELIDFNPIRTTDVQLPAGGTFVVAHSLAESLKAITAASNYNNRVVECRLTA 138
A LID + T V +P G V+ +S + K S YN R +C A
Sbjct: 184 DHALLIDCRSLGTKAVSMPKGVAVVIINS---NFKRTLVGSEYNTRREQCETGA 234
>sp|Q324G2|GAL1_SHIBS Galactokinase OS=Shigella boydii serotype 4 (strain Sb227) GN=galK
PE=3 SV=1
Length = 382
Score = 51.6 bits (122), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 38/114 (33%), Positives = 53/114 (46%), Gaps = 4/114 (3%)
Query: 26 GSGLSSSTAFVCSSTVALMAAFGVEVPKKEIAQLTCECE-QFIGTQSGGMDQAISIMAKS 84
G+GLSSS + + L + + + +IA E E QF+G G MDQ IS + K
Sbjct: 124 GAGLSSSASLEVAVGTVLQQLYHLPLDGAQIALNGQEAENQFVGCNCGIMDQLISALGKK 183
Query: 85 GFAELIDFNPIRTTDVQLPAGGTFVVAHSLAESLKAITAASNYNNRVVECRLTA 138
A LID + T V +P G V+ +S + K S YN R +C A
Sbjct: 184 DHALLIDCRSLGTKAVSMPKGVAVVIINS---NFKRTLVGSEYNTRREQCETGA 234
>sp|B7ULN0|GAL1_ECO27 Galactokinase OS=Escherichia coli O127:H6 (strain E2348/69 / EPEC)
GN=galK PE=3 SV=1
Length = 382
Score = 51.6 bits (122), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 38/114 (33%), Positives = 53/114 (46%), Gaps = 4/114 (3%)
Query: 26 GSGLSSSTAFVCSSTVALMAAFGVEVPKKEIAQLTCECE-QFIGTQSGGMDQAISIMAKS 84
G+GLSSS + + L + + + +IA E E QF+G G MDQ IS + K
Sbjct: 124 GAGLSSSASLEVAVGTVLQQLYHLPLDGAQIALNGQEAENQFVGCNCGIMDQLISALGKK 183
Query: 85 GFAELIDFNPIRTTDVQLPAGGTFVVAHSLAESLKAITAASNYNNRVVECRLTA 138
A LID + T V +P G V+ +S + K S YN R +C A
Sbjct: 184 DHALLIDCRSLGTKAVSMPKGVAVVIINS---NFKRTLVGSEYNTRREQCETGA 234
>sp|A7NI09|GAL1_ROSCS Galactokinase OS=Roseiflexus castenholzii (strain DSM 13941 / HLO8)
GN=galK PE=3 SV=1
Length = 391
Score = 51.2 bits (121), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 41/137 (29%), Positives = 64/137 (46%), Gaps = 9/137 (6%)
Query: 26 GSGLSSSTAFVCSSTVALMAAFGVEVPKKEIAQLTCECEQ-FIGTQSGGMDQAISIMAKS 84
G+GLSSS A + A + + +E+A L E F+G Q G MDQ I+++ ++
Sbjct: 121 GAGLSSSAALEVAVGYAFQVLNNLNILGEELALLAQGAENNFVGVQCGIMDQLIAVLGRA 180
Query: 85 GFAELIDFNPIRTTDVQLPAGGTFVVAHSLAESLKAITAASNYNNRVVECRLTAIVLAIK 144
A LID + V LP V+ S + AAS YN R EC +A++
Sbjct: 181 DHALLIDCRDLSYRAVPLPPSVAVVICDS---HIPRTLAASAYNQRRQECD-----MAVQ 232
Query: 145 LGMKPQEAISKVKTLSD 161
L + I ++ +S+
Sbjct: 233 LLRRWYPGIRALRDVSE 249
>sp|A4W899|GAL1_ENT38 Galactokinase OS=Enterobacter sp. (strain 638) GN=galK PE=3 SV=1
Length = 382
Score = 51.2 bits (121), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 38/114 (33%), Positives = 52/114 (45%), Gaps = 4/114 (3%)
Query: 26 GSGLSSSTAFVCSSTVALMAAFGVEVPKKEIAQLTCECE-QFIGTQSGGMDQAISIMAKS 84
G+GLSSS + + + + + +IA E E QF+G G MDQ IS + K
Sbjct: 124 GAGLSSSASLEVAVGTVFQQLYHLPLDGAQIALNGQEAENQFVGCNCGIMDQLISALGKK 183
Query: 85 GFAELIDFNPIRTTDVQLPAGGTFVVAHSLAESLKAITAASNYNNRVVECRLTA 138
A LID + T V LP G V+ +S + K S YN R +C A
Sbjct: 184 EHALLIDCRSLGTKAVPLPKGAAVVIINS---NFKRTLVGSEYNTRREQCETGA 234
>sp|P56599|GAL1_CANMA Galactokinase (Fragment) OS=Candida maltosa GN=GAL1 PE=3 SV=1
Length = 214
Score = 50.8 bits (120), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 23/56 (41%), Positives = 39/56 (69%), Gaps = 1/56 (1%)
Query: 26 GSGLSSSTAFVCSSTVALMAAFGV-EVPKKEIAQLTCECEQFIGTQSGGMDQAISI 80
G GLSSS AF +ST+A++ A G+ ++ K+++ ++T E ++G +GGMDQ S+
Sbjct: 149 GGGLSSSAAFCVASTLAIIRANGITDLTKQDLTRITVVSEHYVGVNTGGMDQCASV 204
>sp|B1LCQ5|GAL1_THESQ Galactokinase OS=Thermotoga sp. (strain RQ2) GN=galK PE=3 SV=1
Length = 350
Score = 50.8 bits (120), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 37/112 (33%), Positives = 56/112 (50%), Gaps = 5/112 (4%)
Query: 25 LGSGLSSSTAFVCSSTVALMAAFGVEVPKKEIAQLTCECE-QFIGTQSGGMDQAISIMAK 83
+G+GLSSS A ++ A+ F VPK E+ ++ E E +F+G + G MDQ S K
Sbjct: 95 MGAGLSSSAALEVATAYAISEYFSFNVPKLELVKIAREAEVEFVGVRCGIMDQFTSAFGK 154
Query: 84 SGFAELIDFNPIRTTDVQLPAGGTFVVAHSLAES-LKAITAASNYNNRVVEC 134
A +D + V L G + +L +S +K ++S YN R EC
Sbjct: 155 KDHAIFLDTMTLEYEYVPLRLEGYEI---NLVDSNVKHELSSSEYNKRRQEC 203
>sp|A5IMZ6|GAL1_THEP1 Galactokinase OS=Thermotoga petrophila (strain RKU-1 / ATCC BAA-488
/ DSM 13995) GN=galK PE=3 SV=1
Length = 350
Score = 50.8 bits (120), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 37/112 (33%), Positives = 56/112 (50%), Gaps = 5/112 (4%)
Query: 25 LGSGLSSSTAFVCSSTVALMAAFGVEVPKKEIAQLTCECE-QFIGTQSGGMDQAISIMAK 83
+G+GLSSS A ++ A+ F VPK E+ ++ E E +F+G + G MDQ S K
Sbjct: 95 MGAGLSSSAALEVATAYAISEYFSFNVPKLELVKIAREAEVEFVGVRCGIMDQFTSAFGK 154
Query: 84 SGFAELIDFNPIRTTDVQLPAGGTFVVAHSLAES-LKAITAASNYNNRVVEC 134
A +D + V L G + +L +S +K ++S YN R EC
Sbjct: 155 KDHAIFLDTMTLEYEYVPLRLEGYEI---NLVDSNVKHELSSSEYNKRRQEC 203
>sp|B1JST8|GAL1_YERPY Galactokinase OS=Yersinia pseudotuberculosis serotype O:3 (strain
YPIII) GN=galK PE=3 SV=1
Length = 383
Score = 50.4 bits (119), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 38/120 (31%), Positives = 57/120 (47%), Gaps = 4/120 (3%)
Query: 26 GSGLSSSTAFVCSSTVALMAAFGVEVPKKEIAQLTCECE-QFIGTQSGGMDQAISIMAKS 84
G+GLSSS + + AL + + + + E+A E E QF+G G MDQ IS + K
Sbjct: 124 GAGLSSSASLEVAVGQALQSLYQLPLSGVELALNGQEAENQFVGCNCGIMDQLISALGKK 183
Query: 85 GFAELIDFNPIRTTDVQLPAGGTFVVAHSLAESLKAITAASNYNNRVVECRLTAIVLAIK 144
A LID + T V +P V+ +S +++ S YN R +C A +K
Sbjct: 184 DHALLIDCRTLETRAVPMPENMAVVIINS---NIQRGLVDSEYNTRRQQCEAAARFFGVK 240
>sp|A4TNR8|GAL1_YERPP Galactokinase OS=Yersinia pestis (strain Pestoides F) GN=galK PE=3
SV=1
Length = 383
Score = 50.4 bits (119), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 38/120 (31%), Positives = 57/120 (47%), Gaps = 4/120 (3%)
Query: 26 GSGLSSSTAFVCSSTVALMAAFGVEVPKKEIAQLTCECE-QFIGTQSGGMDQAISIMAKS 84
G+GLSSS + + AL + + + + E+A E E QF+G G MDQ IS + K
Sbjct: 124 GAGLSSSASLEVAVGQALQSLYQLPLSGVELALNGQEAENQFVGCNCGIMDQLISALGKK 183
Query: 85 GFAELIDFNPIRTTDVQLPAGGTFVVAHSLAESLKAITAASNYNNRVVECRLTAIVLAIK 144
A LID + T V +P V+ +S +++ S YN R +C A +K
Sbjct: 184 DHALLIDCRTLETRAVPMPENMAVVIINS---NIQRGLVDSEYNTRRQQCEAAARFFGVK 240
Database: swissprot
Posted date: Mar 23, 2013 2:32 AM
Number of letters in database: 191,569,459
Number of sequences in database: 539,616
Lambda K H
0.323 0.134 0.389
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 105,479,778
Number of Sequences: 539616
Number of extensions: 3922700
Number of successful extensions: 12436
Number of sequences better than 100.0: 50
Number of HSP's better than 100.0 without gapping: 104
Number of HSP's successfully gapped in prelim test: 70
Number of HSP's that attempted gapping in prelim test: 12187
Number of HSP's gapped (non-prelim): 250
length of query: 321
length of database: 191,569,459
effective HSP length: 117
effective length of query: 204
effective length of database: 128,434,387
effective search space: 26200614948
effective search space used: 26200614948
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (22.0 bits)
S2: 61 (28.1 bits)