BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 019144
(345 letters)
Database: swissprot
539,616 sequences; 191,569,459 total letters
Searching..................................................done
>sp|Q54PM7|NAGK_DICDI N-acetyl-D-glucosamine kinase OS=Dictyostelium discoideum GN=nagk
PE=3 SV=2
Length = 319
Score = 211 bits (538), Expect = 5e-54, Method: Compositional matrix adjust.
Identities = 120/330 (36%), Positives = 185/330 (56%), Gaps = 25/330 (7%)
Query: 21 REVILGLDGGTTSTVCICMPVISMSDSLPDPLPVLARAAAGCSNHNSVGEDAARETIE-- 78
+E+ +G+DGG T T + + LAR + CSN++SVGED A+ I
Sbjct: 3 KEIFIGIDGGGTKTSTVAVDSNGQE---------LARHTSPCSNYHSVGEDLAKAAINEG 53
Query: 79 -----KVMADALLKSGSNRSAVRAVCLAVSGVNHPTDQQRILNWLRDIFPGNVRLYVHND 133
+ + + + + V ++CL +SGV+ D+ + +W+ ++ ++ +HND
Sbjct: 54 IKYVIRKVKETITDDDNKEVTVGSICLGMSGVDREKDKLLVKSWVTELLGESINYSIHND 113
Query: 134 ALAALASGTMGKLHGCVLIAGTGTIAYGFTEDGRDARAAGAGPILGDWGSGYGIAAQALT 193
A+ AL+SGT GKL G V+I GTG I+ GF +G R+ G GP+LGD+GSGY I L
Sbjct: 114 AIVALSSGTQGKLFGVVIICGTGCISLGFNREGVSGRSGGWGPLLGDYGSGYQIGYDILR 173
Query: 194 AVIRAYDGRGPDTMLTSNILSTLELSSPDELIGWTYVDP---SWARIAALVPVVVSCAEA 250
V++A D GP T LT +L L+L+ ++LI W Y DP SW + A L P+ A+
Sbjct: 174 HVLKAKDQVGPKTSLTQVLLEKLQLTKEEDLISWAY-DPKTQSWQKFAQLSPLAFEQAQL 232
Query: 251 GDEVANKILQDSVEELALSVKAVVQRLSLSGEDGKHSFPLVMVGGVLEANRRWDIGREVV 310
GDE++N IL D+ L + +V+++L L D + FPLV GG +E R+ + +
Sbjct: 233 GDEISNLILVDAANALYDLINSVIKKLGL---DKEEKFPLVYTGGNIE--RKGILSDLLS 287
Query: 311 KCILRDYPGAVPIRPKVEPAVGAALLAWNS 340
K I+ +YP A + +P++GAALLA NS
Sbjct: 288 KKIMENYPNAEILNTTCDPSMGAALLALNS 317
>sp|Q3SZM9|NAGK_BOVIN N-acetyl-D-glucosamine kinase OS=Bos taurus GN=NAGK PE=2 SV=1
Length = 344
Score = 74.7 bits (182), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 79/286 (27%), Positives = 127/286 (44%), Gaps = 21/286 (7%)
Query: 29 GGTTSTVCICMPVISMSDSLPDPLPVLARAAAGCSNHNSVGEDAARETIEKVMADALLKS 88
GGT S V + L + +LA A +NH +G D E I +++ A K+
Sbjct: 11 GGTRSKVLL----------LSEDGQILAEADGLSTNHWLIGTDKCVERINEMVNRAKRKA 60
Query: 89 GSN-RSAVRAVCLAVSGVNHPTDQQRILNWLRDIFPGNVRLY-VHNDALAALASGTMGKL 146
G + +R + L++SG + + ++ LRD FP Y + DA ++A+ T
Sbjct: 61 GVDPLVPLRGLGLSLSGGDQEDAVRMLMEELRDRFPYLSESYLITTDAAGSIATATPDG- 119
Query: 147 HGCVLIAGTGTIAYGFTEDGRDARAAGAGPILGDWGSGYGIAAQALTAVIRAYDG--RGP 204
G VLI+GTG+ DG ++ G G ++GD GS Y IA QA+ V + D P
Sbjct: 120 -GVVLISGTGSNCRLINPDGSESGCGGWGHMMGDEGSAYWIAHQAVKIVFDSIDNLEAAP 178
Query: 205 DTMLTSNILSTLELSSPDEL--IGWTYVDPSWARIAALVPVVVSCAEAGDEVANKILQDS 262
+ PD L + Y D +R A V A+ GD ++ I + +
Sbjct: 179 HDIGYVKQAMFNYFQVPDRLGILTHLYRDFDKSRFAGFCRKVAEGAQQGDPLSRCIFRKA 238
Query: 263 VEELALSVKAVVQRLSLSGEDGKHSFPLVMVGGVLEANRRWDIGRE 308
E L V AV+ + G+ P++ VG V ++ W++ +E
Sbjct: 239 GEMLGRHVVAVLPEIDPVLFQGEMGLPILCVGSVWKS---WELLKE 281
>sp|Q9UJ70|NAGK_HUMAN N-acetyl-D-glucosamine kinase OS=Homo sapiens GN=NAGK PE=1 SV=4
Length = 344
Score = 74.7 bits (182), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 72/261 (27%), Positives = 119/261 (45%), Gaps = 11/261 (4%)
Query: 54 VLARAAAGCSNHNSVGEDAARETIEKVMADALLKSGSN-RSAVRAVCLAVSGVNHPTDQQ 112
+LA A +NH +G D E I +++ A K+G + +R++ L++SG + +
Sbjct: 26 ILAEADGLSTNHWLIGTDKCVERINEMVNRAKRKAGVDPLVPLRSLGLSLSGGDQEDAGR 85
Query: 113 RILNWLRDIFPGNVRLY-VHNDALAALASGTMGKLHGCVLIAGTGTIAYGFTEDGRDARA 171
++ LRD FP Y + DA ++A+ T G VLI+GTG+ DG ++
Sbjct: 86 ILIEELRDRFPYLSESYLITTDAAGSIATATPDG--GVVLISGTGSNCRLINPDGSESGC 143
Query: 172 AGAGPILGDWGSGYGIAAQALTAVIRAYDG--RGPDTMLTSNILSTLELSSPDEL--IGW 227
G G ++GD GS Y IA QA+ V + D P + PD L +
Sbjct: 144 GGWGHMMGDEGSAYWIAHQAVKIVFDSIDNLEAAPHDIGYVKQAMFHYFQVPDRLGILTH 203
Query: 228 TYVDPSWARIAALVPVVVSCAEAGDEVANKILQDSVEELALSVKAVVQRLSLSGEDGKHS 287
Y D R A + A+ GD ++ I + + E L + AV+ + GK
Sbjct: 204 LYRDFDKCRFAGFCRKIAEGAQQGDPLSRYIFRKAGEMLGRHIVAVLPEIDPVLFQGKIG 263
Query: 288 FPLVMVGGVLEANRRWDIGRE 308
P++ VG V ++ W++ +E
Sbjct: 264 LPILCVGSVWKS---WELLKE 281
>sp|P81799|NAGK_RAT N-acetyl-D-glucosamine kinase OS=Rattus norvegicus GN=Nagk PE=1
SV=4
Length = 343
Score = 70.5 bits (171), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 74/286 (25%), Positives = 128/286 (44%), Gaps = 21/286 (7%)
Query: 29 GGTTSTVCICMPVISMSDSLPDPLPVLARAAAGCSNHNSVGEDAARETIEKVMADALLKS 88
GGT S V + L + +LA A +NH +G E I +++ A K+
Sbjct: 11 GGTRSKVLL----------LSEDGQILAEADGLSTNHWLIGTGTCVERINEMVDRAKRKA 60
Query: 89 GSN-RSAVRAVCLAVSGVNHPTDQQRILNWLRDIFPG-NVRLYVHNDALAALASGTMGKL 146
G + +R++ L++SG + ++ LRD FP + ++ DA ++A+ T
Sbjct: 61 GVDPLVPLRSLGLSLSGGEQEDAVRLLMEELRDRFPYLSESYFITTDAAGSIATATPDG- 119
Query: 147 HGCVLIAGTGTIAYGFTEDGRDARAAGAGPILGDWGSGYGIAAQALTAVIRAYDGRGPDT 206
G VLI+GTG+ DG ++ G G ++GD GS Y IA QA+ V + D
Sbjct: 120 -GIVLISGTGSNCRLINPDGSESGCGGWGHMMGDEGSAYWIAHQAVKIVFDSIDNLEAAP 178
Query: 207 MLTSNILSTL--ELSSPDEL--IGWTYVDPSWARIAALVPVVVSCAEAGDEVANKILQDS 262
++ + PD L + Y D ++ A + A+ GD ++ I + +
Sbjct: 179 HDIGHVKQAMFNYFQVPDRLGILTHLYRDFDKSKFAGFCQKIAEGAQQGDPLSRFIFRKA 238
Query: 263 VEELALSVKAVVQRLSLSGEDGKHSFPLVMVGGVLEANRRWDIGRE 308
E L V AV+ + G+ P++ VG V ++ W++ +E
Sbjct: 239 GEMLGRHVVAVLPEIDPVLFQGELGLPILCVGSVWKS---WELLKE 281
>sp|Q9QZ08|NAGK_MOUSE N-acetyl-D-glucosamine kinase OS=Mus musculus GN=Nagk PE=2 SV=3
Length = 343
Score = 70.1 bits (170), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 75/286 (26%), Positives = 125/286 (43%), Gaps = 21/286 (7%)
Query: 29 GGTTSTVCICMPVISMSDSLPDPLPVLARAAAGCSNHNSVGEDAARETIEKVMADALLKS 88
GGT S V + L + +LA A +NH +G D E I +++ A K+
Sbjct: 11 GGTRSKVLL----------LSEDGQILAEADGLSTNHWLIGTDQCVERINEMVDRAKQKA 60
Query: 89 GSN-RSAVRAVCLAVSGVNHPTDQQRILNWLRDIFPGNVRLY-VHNDALAALASGTMGKL 146
G + +R++ L++SG + ++ LR FP Y + DA ++A+ T
Sbjct: 61 GVDPLVPLRSLGLSLSGGEQEDAVRLLIEELRHRFPNLSENYLITTDAAGSIATATPDG- 119
Query: 147 HGCVLIAGTGTIAYGFTEDGRDARAAGAGPILGDWGSGYGIAAQALTAVIRAYDGRGPDT 206
G VLI+GTG+ DG ++ G G ++GD GS Y IA QA+ V + D
Sbjct: 120 -GIVLISGTGSNCRLINPDGSESGCGGWGHMMGDEGSAYWIAHQAVKIVFDSIDNLEAAP 178
Query: 207 MLTSNILSTL--ELSSPDEL--IGWTYVDPSWARIAALVPVVVSCAEAGDEVANKILQDS 262
++ + PD L + Y D + A + A GD ++ I + +
Sbjct: 179 HDIGHVKQAMFDYFQVPDRLGILTHLYRDFDKCKFAGFCQKIAEGAHQGDPLSRYIFRKA 238
Query: 263 VEELALSVKAVVQRLSLSGEDGKHSFPLVMVGGVLEANRRWDIGRE 308
E L V AV+ + G+ P++ VG V ++ W++ +E
Sbjct: 239 GEMLGRHVVAVLPEIDPVLFQGELGLPILCVGSVWKS---WELLKE 281
>sp|Q9KUA9|GSPK_VIBCH Glucosamine kinase GspK OS=Vibrio cholerae serotype O1 (strain ATCC
39315 / El Tor Inaba N16961) GN=gspK PE=1 SV=2
Length = 294
Score = 37.7 bits (86), Expect = 0.13, Method: Compositional matrix adjust.
Identities = 34/151 (22%), Positives = 70/151 (46%), Gaps = 8/151 (5%)
Query: 128 LYVHNDALAALASGTMGKLHGCVLIAGTGTIAYGFTEDGRDARAAGAGPILGDWGSGYGI 187
+ ++ DA A +G+ G ++IAGTG+ + G+ G + D GSG +
Sbjct: 96 ITLNTDAYGACLGAHLGE-EGAIMIAGTGSCGI-LLKGGKQYVVGGREFPISDQGSGAVM 153
Query: 188 AAQALTAVIRAYDGRGPDTMLTSNILSTLELSSPDELIGW--TYVDPSWARIAALVPVVV 245
+ + V+ A DG P T L +++ D ++ W T + + + + P +
Sbjct: 154 GLRLIQQVLLAQDGIRPHTPLCDVVMNHFN-HDIDSIVAWSKTALPRDYGQFS---PQIF 209
Query: 246 SCAEAGDEVANKILQDSVEELALSVKAVVQR 276
S A GD +A ++L+ + ++ + + A+ +
Sbjct: 210 SHAYCGDPLAIELLKQTAADIEMFLIALHHK 240
>sp|Q06164|MMS22_YEAST Methyl methanesulfonate-sensitivity protein 22 OS=Saccharomyces
cerevisiae (strain ATCC 204508 / S288c) GN=MMS22 PE=1
SV=1
Length = 1454
Score = 34.3 bits (77), Expect = 1.5, Method: Composition-based stats.
Identities = 22/75 (29%), Positives = 36/75 (48%), Gaps = 11/75 (14%)
Query: 242 PVVVSCAEAGDEVANKILQ-----------DSVEELALSVKAVVQRLSLSGEDGKHSFPL 290
P+V+S +EA DE + I + ++V+E + S K V +RLSL G+ P
Sbjct: 8 PIVISDSEATDEEISIIYEPEFNENYLWAEENVQEASRSQKIVTERLSLDSTAGESCTPS 67
Query: 291 VMVGGVLEANRRWDI 305
V+ + RW +
Sbjct: 68 VVTDTQVTTGLRWSL 82
>sp|A5GS43|RLMH_SYNR3 Ribosomal RNA large subunit methyltransferase H OS=Synechococcus
sp. (strain RCC307) GN=rlmH PE=3 SV=1
Length = 147
Score = 32.7 bits (73), Expect = 4.5, Method: Compositional matrix adjust.
Identities = 22/71 (30%), Positives = 34/71 (47%), Gaps = 7/71 (9%)
Query: 162 FTEDGRDARAAGAGPILGDWGSGYGIAAQALTAVIRAYDGRGPDTMLTSNILSTL-ELSS 220
+E+GR + G +LG W A++ L VI DG P +++L +L EL+
Sbjct: 66 LSEEGRQLSSVGLAELLGGW------ASERLALVIGGADGHDPTLKQQADVLLSLSELTF 119
Query: 221 PDELIGWTYVD 231
P EL V+
Sbjct: 120 PHELARLMLVE 130
>sp|A0JC77|GRII_STRGR 2-amino-4,5-dihydroxy-6-one-heptanoic acid-7-phosphate synthase
OS=Streptomyces griseus GN=griI PE=1 SV=1
Length = 274
Score = 31.6 bits (70), Expect = 9.7, Method: Compositional matrix adjust.
Identities = 22/78 (28%), Positives = 36/78 (46%), Gaps = 3/78 (3%)
Query: 229 YVDPSWA-RIAALVPVVVSCAEAGDEVANKILQDSVEELALSVKAVVQRLSLSGEDGKHS 287
+VDP W R + +V + S A D A ++ E L + AV ++L E +H
Sbjct: 68 HVDPEWFTRTSLIVHLSASTVHAPDPNAKYLVSSVEESLRMGADAVSVHVNLGSEGERHQ 127
Query: 288 FPLVMVGGVLEANRRWDI 305
+ + V EA RW++
Sbjct: 128 --IADMAAVAEACDRWNV 143
>sp|B1VTI8|GRII_STRGG 2-amino-4,5-dihydroxy-6-one-heptanoic acid-7-phosphate synthase
OS=Streptomyces griseus subsp. griseus (strain JCM 4626
/ NBRC 13350) GN=griI PE=3 SV=1
Length = 274
Score = 31.6 bits (70), Expect = 9.7, Method: Compositional matrix adjust.
Identities = 22/78 (28%), Positives = 36/78 (46%), Gaps = 3/78 (3%)
Query: 229 YVDPSWA-RIAALVPVVVSCAEAGDEVANKILQDSVEELALSVKAVVQRLSLSGEDGKHS 287
+VDP W R + +V + S A D A ++ E L + AV ++L E +H
Sbjct: 68 HVDPEWFTRTSLIVHLSASTVHAPDPNAKYLVSSVEESLRMGADAVSVHVNLGSEGERHQ 127
Query: 288 FPLVMVGGVLEANRRWDI 305
+ + V EA RW++
Sbjct: 128 --IADMAAVAEACDRWNV 143
>sp|P57163|RPOD_BUCAI RNA polymerase sigma factor RpoD OS=Buchnera aphidicola subsp.
Acyrthosiphon pisum (strain APS) GN=rpoD PE=3 SV=1
Length = 612
Score = 31.6 bits (70), Expect = 9.8, Method: Compositional matrix adjust.
Identities = 30/115 (26%), Positives = 55/115 (47%), Gaps = 17/115 (14%)
Query: 221 PDELIGWTYVDPSWARIAAL-VPVVVSCAEAGDEVANKILQDSVEELALSVKAVVQRL-S 278
P+++I +D I + +PVV +A D + N+I D+ E+ +V+A Q L S
Sbjct: 32 PEDIIDSEQIDDIIQMINDMGIPVVEEAPDADDLILNEINTDTDED---AVEAATQVLSS 88
Query: 279 LSGEDGKHSFPLVMV------------GGVLEANRRWDIGREVVKCILRDYPGAV 321
+ E G+ + P+ M G ++ +R + G V+C + +YP A+
Sbjct: 89 VESELGRTTDPVRMYMREMGTVELLTREGEIDIAKRIEEGINQVQCSVSEYPEAI 143
Database: swissprot
Posted date: Mar 23, 2013 2:32 AM
Number of letters in database: 191,569,459
Number of sequences in database: 539,616
Lambda K H
0.318 0.135 0.406
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 132,981,239
Number of Sequences: 539616
Number of extensions: 5630128
Number of successful extensions: 15823
Number of sequences better than 100.0: 17
Number of HSP's better than 100.0 without gapping: 5
Number of HSP's successfully gapped in prelim test: 12
Number of HSP's that attempted gapping in prelim test: 15808
Number of HSP's gapped (non-prelim): 17
length of query: 345
length of database: 191,569,459
effective HSP length: 118
effective length of query: 227
effective length of database: 127,894,771
effective search space: 29032113017
effective search space used: 29032113017
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 62 (28.5 bits)