Your job contains 1 sequence.
>012026
MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT
QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP
VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVL
DLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVGV
GMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVY
NCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYS
TSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVF
QLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQVRCPYVSQ
The BLAST search returned 1 gene product which did not match your query constraints. Please see the full BLAST report below for the details.
BLASTP 2.0MP-WashU [04-May-2006] [linux26-i686-ILP32F64 2006-05-09T11:47:08]
Copyright (C) 1996-2006 Washington University, Saint Louis, Missouri USA.
All Rights Reserved.
Reference: Gish, W. (1996-2006) http://blast.wustl.edu
Query= 012026
(472 letters)
Database: go_20130330-seqdb.fasta
368,745 sequences; 169,044,731 total letters.
Searching....10....20....30....40....50....60....70....80....90....100% done
Smallest
Sum
High Probability
Sequences producing High-scoring Segment Pairs: Score P(N) N
TAIR|locus:2173209 - symbol:CYL1 "AT5G13690" species:3702... 1548 6.8e-159 1
RGD|1564228 - symbol:Naglu "N-acetylglucosaminidase, alph... 825 4.7e-97 2
UNIPROTKB|P54802 - symbol:NAGLU "Alpha-N-acetylglucosamin... 804 9.9e-95 2
UNIPROTKB|H9L296 - symbol:H9L296 "Uncharacterized protein... 779 1.1e-91 2
UNIPROTKB|F1S1D7 - symbol:NAGLU "Uncharacterized protein"... 817 2.0e-81 1
UNIPROTKB|A6QM01 - symbol:NAGLU "NAGLU protein" species:9... 809 1.4e-80 1
FB|FBgn0014417 - symbol:CG13397 species:7227 "Drosophila ... 634 1.4e-78 2
DICTYBASE|DDB_G0291998 - symbol:naglu "alpha-N-acetylgluc... 697 1.0e-68 1
>TAIR|locus:2173209 [details] [associations]
symbol:CYL1 "AT5G13690" species:3702 "Arabidopsis
thaliana" [GO:0004561 "alpha-N-acetylglucosaminidase activity"
evidence=ISS] [GO:0009507 "chloroplast" evidence=ISM] [GO:0005773
"vacuole" evidence=IDA] Pfam:PF05089 EMBL:CP002688 GO:GO:0005773
EMBL:AB006704 CAZy:GH89 HOGENOM:HOG000214539 KO:K01205 OMA:LFPNSTM
InterPro:IPR007781 InterPro:IPR024732 InterPro:IPR024240
InterPro:IPR024733 PANTHER:PTHR12872 Pfam:PF12972 Pfam:PF12971
UniGene:At.49017 UniGene:At.6477 EMBL:AY080811 EMBL:AY117179
IPI:IPI00516873 RefSeq:NP_196873.1 ProteinModelPortal:Q9FNA3
STRING:Q9FNA3 PRIDE:Q9FNA3 ProMEX:Q9FNA3 EnsemblPlants:AT5G13690.1
GeneID:831214 KEGG:ath:AT5G13690 TAIR:At5g13690 InParanoid:Q9FNA3
PhylomeDB:Q9FNA3 ProtClustDB:CLSN2687036 ArrayExpress:Q9FNA3
Genevestigator:Q9FNA3 Uniprot:Q9FNA3
Length = 806
Score = 1548 (550.0 bits), Expect = 6.8e-159, P = 6.8e-159
Identities = 290/472 (61%), Positives = 360/472 (76%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NLH WGGPL ++WLD QL+LQK+IL R+ + GM PVLP+FSGNVP+AL+ ++P A IT
Sbjct: 231 MGNLHAWGGPLSKNWLDDQLLLQKQILSRMLKFGMTPVLPSFSGNVPSALRKIYPEANIT 290
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
+L NW +V D RWCCTYLL+ +DPLFIEIG AFI+QQ +EYG ++IYNCDTF+ENTPP
Sbjct: 291 RLDNWNTVDGDSRWCCTYLLNPSDPLFIEIGEAFIKQQTEEYGEITNIYNCDTFNENTPP 350
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
PEYISSLGAA+Y M G+ +AVWLMQGWLFS D FW+PPQ+KALL+SVP GK++V
Sbjct: 351 TSEPEYISSLGAAVYKAMSKGNKNAVWLMQGWLFSSDSKFWKPPQLKALLHSVPFGKMIV 410
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
LDL+AEVKPIW+ S QFYG PYIWCMLHNF GNIEMYG LDSI+ GPV+AR S+N+TMVG
Sbjct: 411 LDLYAEVKPIWNKSAQFYGTPYIWCMLHNFGGNIEMYGALDSISSGPVDARVSKNSTMVG 470
Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTV 299
VGM MEGIEQNPVVY+L SEMAF+ EKVDV+ W+ Y+ RRY + I+ AW +LYHTV
Sbjct: 471 VGMCMEGIEQNPVVYELTSEMAFRDEKVDVQKWLKSYARRRYMKENHQIEAAWEILYHTV 530
Query: 300 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQ--NY----GKPVSKEAVL-KSETSSY 352
YNCTDG D N D IV PD DPS SV + Q +Y G +K VL + +T+
Sbjct: 531 YNCTDGIADHNTDFIVKLPDWDPSS-SVQDDLKQKDSYMISTGPYETKRRVLFQDKTADL 589
Query: 353 DHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQ 412
HLWYST EVI+AL+LF+ +G++LS S TYRYD++DLTRQ L+K AN+++ + A+
Sbjct: 590 PKAHLWYSTKEVIQALKLFLEAGDDLSRSLTYRYDMVDLTRQVLSKLANQVYTEAVTAFV 649
Query: 413 LNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQ 464
D + QLS +FLEL++DMD LLA D LLG WLESAK+LA+N ++ KQ
Sbjct: 650 KKDIGSLGQLSEKFLELIKDMDVLLASDDNCLLGTWLESAKKLAKNGDERKQ 701
>RGD|1564228 [details] [associations]
symbol:Naglu "N-acetylglucosaminidase, alpha" species:10116
"Rattus norvegicus" [GO:0007040 "lysosome organization"
evidence=ISO] [GO:0021680 "cerebellar Purkinje cell layer
development" evidence=ISO] [GO:0042474 "middle ear morphogenesis"
evidence=ISO] [GO:0045475 "locomotor rhythm" evidence=ISO]
[GO:0046548 "retinal rod cell development" evidence=ISO]
[GO:0060119 "inner ear receptor cell development" evidence=ISO]
REFSEQ:XM_001081442 Ncbi:XP_001081442
Length = 739
Score = 825 (295.5 bits), Expect = 4.7e-97, Sum P(2) = 4.7e-97
Identities = 157/326 (48%), Positives = 214/326 (65%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NLH W GPLP+SW +QL LQ +IL R+ GM PVLPAF+G+VP A+ VFP +
Sbjct: 202 MGNLHTWDGPLPRSWHLKQLYLQHRILDRMRSFGMTPVLPAFAGHVPKAITRVFPQVNVI 261
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
QLGNW + C++LL DPLF IG F+ + KE+G T HIY DTF+E PP
Sbjct: 262 QLGNWGHFNCS--YSCSFLLAPGDPLFPLIGTLFLRELTKEFG-TDHIYGADTFNEMQPP 318
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
P Y+++ AA+Y M + D DAVWL+QGWLF + P FW P Q+KA+L +VP G+L+V
Sbjct: 319 FSDPSYLAAATAAVYEAMVTVDPDAVWLLQGWLFQHQPQFWGPSQIKAVLEAVPRGRLLV 378
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
LDLFAE +P++S + F+G P+IWCMLHNF GN ++G L+ + GP AR N+TMVG
Sbjct: 379 LDLFAETQPVYSRTASFHGQPFIWCMLHNFGGNHGLFGALEDVNQGPQAARLFPNSTMVG 438
Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKV-DVKAWINQYSVRRYGRSVPAIQDAWNVLYHT 298
G++ EGI QN VVY LM+E+ ++ + V D+ AW++ ++ RRYG S P AW +L +
Sbjct: 439 TGIAPEGIGQNEVVYALMAELGWRKDPVPDLVAWVSSFASRRYGVSQPDAVAAWRLLLRS 498
Query: 299 VYNCT-DGATDKNRDVIVAFPDVDPS 323
VYNC+ + + NR +V P + S
Sbjct: 499 VYNCSGEACSGHNRSPLVKRPSLQMS 524
Score = 159 (61.0 bits), Expect = 4.7e-97, Sum P(2) = 4.7e-97
Identities = 34/106 (32%), Positives = 60/106 (56%)
Query: 357 LWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDA 416
+WY+ S+V A L + + L+AS +RYDL+D+TRQA+ + + + A+ D
Sbjct: 527 VWYNRSDVFEAWRLLLRAAPNLTASPAFRYDLLDVTRQAVQELVSSCYEEARTAFLNQDL 586
Query: 417 HGVFQLSRRFL-ELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQ 461
+ + +L+ +D LLA + FLLG WL+ A+++A +E +
Sbjct: 587 DLLLRAGGLLTYKLLPSLDELLASNSHFLLGTWLDQAREVAVSESE 632
>UNIPROTKB|P54802 [details] [associations]
symbol:NAGLU "Alpha-N-acetylglucosaminidase" species:9606
"Homo sapiens" [GO:0004561 "alpha-N-acetylglucosaminidase activity"
evidence=IEA] [GO:0007040 "lysosome organization" evidence=IEA]
[GO:0021680 "cerebellar Purkinje cell layer development"
evidence=IEA] [GO:0042474 "middle ear morphogenesis" evidence=IEA]
[GO:0045475 "locomotor rhythm" evidence=IEA] [GO:0046548 "retinal
rod cell development" evidence=IEA] [GO:0060119 "inner ear receptor
cell development" evidence=IEA] [GO:0007399 "nervous system
development" evidence=TAS] [GO:0005764 "lysosome" evidence=TAS]
[GO:0005975 "carbohydrate metabolic process" evidence=TAS]
[GO:0006027 "glycosaminoglycan catabolic process" evidence=TAS]
[GO:0030203 "glycosaminoglycan metabolic process" evidence=TAS]
[GO:0043202 "lysosomal lumen" evidence=TAS] [GO:0044281 "small
molecule metabolic process" evidence=TAS] Reactome:REACT_111217
Pfam:PF05089 Reactome:REACT_116125 GO:GO:0007399 GO:GO:0044281
GO:GO:0005975 GO:GO:0043202 GO:GO:0006027 EMBL:U43572 EMBL:U43573
EMBL:U40846 EMBL:L78464 EMBL:AC067852 EMBL:BC053991 IPI:IPI00008787
PIR:G02270 RefSeq:NP_000254.2 UniGene:Hs.50727
ProteinModelPortal:P54802 SMR:P54802 STRING:P54802 CAZy:GH89
PhosphoSite:P54802 DMDM:1703303 PaxDb:P54802 PRIDE:P54802
Ensembl:ENST00000225927 GeneID:4669 KEGG:hsa:4669 UCSC:uc002hzv.3
CTD:4669 GeneCards:GC17P040687 H-InvDB:HIX0202517 HGNC:HGNC:7632
HPA:HPA038815 MIM:252920 MIM:609701 neXtProt:NX_P54802
Orphanet:79270 PharmGKB:PA31437 eggNOG:NOG86381
HOGENOM:HOG000214539 HOVERGEN:HBG004225 InParanoid:P54802 KO:K01205
OMA:LFPNSTM OrthoDB:EOG4Q84X0 PhylomeDB:P54802 ChiTaRS:NAGLU
DrugBank:DB00141 GenomeRNAi:4669 NextBio:17990 Bgee:P54802
CleanEx:HS_NAGLU Genevestigator:P54802 GermOnline:ENSG00000108784
GO:GO:0004561 InterPro:IPR007781 InterPro:IPR024732
InterPro:IPR024240 InterPro:IPR024733 PANTHER:PTHR12872
Pfam:PF12972 Pfam:PF12971 Uniprot:P54802
Length = 743
Score = 804 (288.1 bits), Expect = 9.9e-95, Sum P(2) = 9.9e-95
Identities = 148/323 (45%), Positives = 212/323 (65%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NLH W GPLP SW +QL LQ ++L ++ GM PVLPAF+G+VP A+ VFP +T
Sbjct: 204 MGNLHTWDGPLPPSWHIKQLYLQHRVLDQMRSFGMTPVLPAFAGHVPEAVTRVFPQVNVT 263
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
++G+W + C++LL DP+F IG F+ + +KE+G T HIY DTF+E PP
Sbjct: 264 KMGSWGHFNCS--YSCSFLLAPEDPIFPIIGSLFLRELIKEFG-TDHIYGADTFNEMQPP 320
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
P Y+++ A+Y M + D++AVWL+QGWLF + P FW P Q++A+L +VP G+L+V
Sbjct: 321 SSEPSYLAAATTAVYEAMTAVDTEAVWLLQGWLFQHQPQFWGPAQIRAVLGAVPRGRLLV 380
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
LDLFAE +P+++ + F G P+IWCMLHNF GN ++G L+++ GP AR N+TMVG
Sbjct: 381 LDLFAESQPVYTRTASFQGQPFIWCMLHNFGGNHGLFGALEAVNGGPEAARLFPNSTMVG 440
Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKV-DVKAWINQYSVRRYGRSVPAIQDAWNVLYHT 298
GM+ EGI QN VVY LM+E+ ++ + V D+ AW+ ++ RRYG S P AW +L +
Sbjct: 441 TGMAPEGISQNEVVYSLMAELGWRKDPVPDLAAWVTSFAARRYGVSHPDAGAAWRLLLRS 500
Query: 299 VYNCT-DGATDKNRDVIVAFPDV 320
VYNC+ + NR +V P +
Sbjct: 501 VYNCSGEACRGHNRSPLVRRPSL 523
Score = 158 (60.7 bits), Expect = 9.9e-95, Sum P(2) = 9.9e-95
Identities = 36/106 (33%), Positives = 56/106 (52%)
Query: 357 LWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDA 416
+WY+ S+V A L + S L+ S +RYDL+DLTRQA+ + + + AY +
Sbjct: 529 IWYNRSDVFEAWRLLLTSAPSLATSPAFRYDLLDLTRQAVQELVSLYYEEARSAYLSKEL 588
Query: 417 HGVFQLSRRFL-ELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQ 461
+ + EL+ +D +LA FLLG WLE A+ A +E +
Sbjct: 589 ASLLRAGGVLAYELLPALDEVLASDSRFLLGSWLEQARAAAVSEAE 634
>UNIPROTKB|H9L296 [details] [associations]
symbol:H9L296 "Uncharacterized protein" species:9031
"Gallus gallus" [GO:0007040 "lysosome organization" evidence=IEA]
[GO:0021680 "cerebellar Purkinje cell layer development"
evidence=IEA] [GO:0042474 "middle ear morphogenesis" evidence=IEA]
[GO:0045475 "locomotor rhythm" evidence=IEA] [GO:0046548 "retinal
rod cell development" evidence=IEA] [GO:0060119 "inner ear receptor
cell development" evidence=IEA] Pfam:PF05089 OMA:LFPNSTM
InterPro:IPR007781 InterPro:IPR024732 InterPro:IPR024733
PANTHER:PTHR12872 Pfam:PF12972 GeneTree:ENSGT00390000005900
EMBL:AADN02054251 EMBL:AADN02054252 Ensembl:ENSGALT00000035813
Uniprot:H9L296
Length = 601
Score = 779 (279.3 bits), Expect = 1.1e-91, Sum P(2) = 1.1e-91
Identities = 156/357 (43%), Positives = 221/357 (61%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NLH W GPLP++W QQL LQ +I+ R+ LGM VLPAF+G+VP + VFP T
Sbjct: 87 MGNLHSWAGPLPRAWHLQQLYLQYRIVERMRSLGMITVLPAFAGHVPPGVLRVFPRINAT 146
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
+LGNW D C YLL +P+F IG F+++ +KE+G T IY+ DT P
Sbjct: 147 RLGNWSHF--DCTLSCAYLLSPEEPMFQVIGTLFLKELIKEFG-TDRIYSADTHPHPRPA 203
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
V P + SL + + D +A WLMQGWLF + P FW+PPQ++A+L +VPLG+++V
Sbjct: 204 V-GPWLLCSL-----CSLPAADPEAQWLMQGWLFQHQPDFWQPPQVQAVLRAVPLGRMIV 257
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
LDLFAE KP++ ++ FYG P+IWCMLHNF GN ++G +++I GP AR N+TMVG
Sbjct: 258 LDLFAESKPVYEWTESFYGQPFIWCMLHNFGGNHGLFGAVEAINRGPFVARRFPNSTMVG 317
Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTV 299
G+ EGIEQN +VY+LM+E+ ++HE +D+ W+++Y+ RRYG A AW +L +V
Sbjct: 318 TGLVPEGIEQNDMVYELMNELGWRHEPLDLPVWVSRYAQRRYGAPDAAAGAAWQLLLRSV 377
Query: 300 YNCTDGATDKNRDVIVAFPDV--DPSIISVTEGKYQNYGKPVSKEAVLKSETS-SYD 353
YNC+ + NR +V P + D + Y+ + +S A L S + YD
Sbjct: 378 YNCSGACVNHNRSPLVRRPSLHMDTQLWYNASDVYEAWRLLLSAGAALGSSPAFRYD 434
Score = 154 (59.3 bits), Expect = 1.1e-91, Sum P(2) = 1.1e-91
Identities = 36/109 (33%), Positives = 60/109 (55%)
Query: 357 LWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDA 416
LWY+ S+V A L +++G L +S +RYDL D+TRQA+ + + + I +++Q
Sbjct: 404 LWYNASDVYEAWRLLLSAGAALGSSPAFRYDLADVTRQAVQQLVADYYQRIRDSFQRRAL 463
Query: 417 HGVFQLSRRFL-ELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQ 464
+ L +L+ ++D LL FLLG L+SA A +E + +Q
Sbjct: 464 PELLAAGGVLLYDLLPELDALLGSQRLFLLGRLLQSAHAAATSEREAEQ 512
>UNIPROTKB|F1S1D7 [details] [associations]
symbol:NAGLU "Uncharacterized protein" species:9823 "Sus
scrofa" [GO:0060119 "inner ear receptor cell development"
evidence=IEA] [GO:0046548 "retinal rod cell development"
evidence=IEA] [GO:0045475 "locomotor rhythm" evidence=IEA]
[GO:0042474 "middle ear morphogenesis" evidence=IEA] [GO:0021680
"cerebellar Purkinje cell layer development" evidence=IEA]
[GO:0007040 "lysosome organization" evidence=IEA] Pfam:PF05089
CTD:4669 KO:K01205 OMA:LFPNSTM InterPro:IPR007781
InterPro:IPR024732 InterPro:IPR024240 InterPro:IPR024733
PANTHER:PTHR12872 Pfam:PF12972 Pfam:PF12971
GeneTree:ENSGT00390000005900 EMBL:FP016109 RefSeq:XP_003131436.1
UniGene:Ssc.44812 Ensembl:ENSSSCT00000018940 GeneID:100519685
KEGG:ssc:100519685 Uniprot:F1S1D7
Length = 744
Score = 817 (292.7 bits), Expect = 2.0e-81, P = 2.0e-81
Identities = 174/421 (41%), Positives = 248/421 (58%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NLH W GPLP+SW +QL LQ +IL R+ GM PVLPAF+G+VP AL VFP +T
Sbjct: 205 MGNLHTWSGPLPRSWHLKQLYLQHRILDRMRSFGMIPVLPAFAGHVPKALTRVFPQISVT 264
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
Q+G+W + C++LL DPLF +G F+ + KE+G T HIY DTF+E PP
Sbjct: 265 QMGSWGHFNCS--YSCSFLLAPEDPLFPIVGSLFLRELTKEFG-TDHIYGADTFNEMQPP 321
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
P Y+++ AA+Y M + D DAVWL+QGWLF + P FW P Q+ A+L +VP G+L+V
Sbjct: 322 SSEPSYLAAATAAVYQAMITVDPDAVWLLQGWLFQHQPQFWGPAQVGAVLGAVPRGRLLV 381
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
LDLFAE +P++ + F G P+IWCMLHNF GN ++G L+S+ GP AR N+TM G
Sbjct: 382 LDLFAESQPVYVRTASFLGQPFIWCMLHNFGGNHGLFGALESVNQGPAAARLFPNSTMAG 441
Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKV-DVKAWINQYSVRRYGRSVPAIQDAWNVLYHT 298
GM+ EGI QN VVY LM+E+ ++ + V D+ W+ ++ RRYG S + AW +L +
Sbjct: 442 TGMAPEGIGQNEVVYALMAELGWRKDPVADLGTWVTSFAARRYGVSQGDAEAAWRLLLRS 501
Query: 299 VYNCT-DGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHL 357
VYNC+ +G T NR +V PS+ T Y + +LK+ + P
Sbjct: 502 VYNCSGEGCTGHNRSPLVR----RPSLQMATTVWYNQSDVFEAWRLLLKATPTLASSPAF 557
Query: 358 WYSTSEVIR-ALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDA 416
Y ++ R A++ ++ E + + +L+ L R A A EL L ++ +D+
Sbjct: 558 RYDLVDITRQAVQELVSLYYEEARTAYLNKELVSLMR-AGGILAYEL-LPALDKVLASDS 615
Query: 417 H 417
H
Sbjct: 616 H 616
Score = 148 (57.2 bits), Expect = 6.8e-07, P = 6.8e-07
Identities = 34/106 (32%), Positives = 58/106 (54%)
Query: 357 LWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDA 416
+WY+ S+V A L + + L++S +RYDL+D+TRQA+ + + + AY +
Sbjct: 530 VWYNQSDVFEAWRLLLKATPTLASSPAFRYDLVDITRQAVQELVSLYYEEARTAYLNKEL 589
Query: 417 HGVFQLSRRFL-ELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQ 461
+ + EL+ +D +LA FLLG WLE A+ +A +E +
Sbjct: 590 VSLMRAGGILAYELLPALDKVLASDSHFLLGSWLEQARGVAVSEAE 635
>UNIPROTKB|A6QM01 [details] [associations]
symbol:NAGLU "NAGLU protein" species:9913 "Bos taurus"
[GO:0060119 "inner ear receptor cell development" evidence=IEA]
[GO:0046548 "retinal rod cell development" evidence=IEA]
[GO:0045475 "locomotor rhythm" evidence=IEA] [GO:0042474 "middle
ear morphogenesis" evidence=IEA] [GO:0021680 "cerebellar Purkinje
cell layer development" evidence=IEA] [GO:0007040 "lysosome
organization" evidence=IEA] Pfam:PF05089 InterPro:IPR017853
SUPFAM:SSF51445 CAZy:GH89 CTD:4669 eggNOG:NOG86381
HOGENOM:HOG000214539 HOVERGEN:HBG004225 KO:K01205 OMA:LFPNSTM
OrthoDB:EOG4Q84X0 InterPro:IPR007781 InterPro:IPR024732
InterPro:IPR024240 InterPro:IPR024733 PANTHER:PTHR12872
Pfam:PF12972 Pfam:PF12971 GeneTree:ENSGT00390000005900
EMBL:DAAA02049190 EMBL:BC148147 IPI:IPI00717554
RefSeq:NP_001095696.1 UniGene:Bt.4204 Ensembl:ENSBTAT00000063695
GeneID:789125 KEGG:bta:789125 InParanoid:A6QM01 NextBio:20929511
Uniprot:A6QM01
Length = 667
Score = 809 (289.8 bits), Expect = 1.4e-80, P = 1.4e-80
Identities = 175/438 (39%), Positives = 251/438 (57%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NLH W GPLP SW +QL LQ +IL R+ GM PVLPAF+G+VP AL VFP +T
Sbjct: 204 MGNLHTWSGPLPPSWHLKQLYLQHRILDRMRSFGMIPVLPAFAGHVPKALTRVFPQVNVT 263
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
Q+GNW + C++LL DPLF +G F+ + KE+G T HIY DTF+E PP
Sbjct: 264 QMGNWGHFNCS--YSCSFLLAPEDPLFPLVGSLFLRELTKEFG-TDHIYGADTFNEMQPP 320
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
P Y+++ AA+Y M + D DAVWL+QGWLF + P FW P Q+ A+L +VP G+L+V
Sbjct: 321 SSEPSYLAAATAAVYQAMTAVDPDAVWLLQGWLFQHQPEFWGPAQVAAVLGAVPRGRLLV 380
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
LDLFAE +P++ + F G P+IWCMLHNF GN ++G L+S+ GP AR N+TMVG
Sbjct: 381 LDLFAESQPVYVRTASFQGQPFIWCMLHNFGGNHGLFGALESVNQGPTTARHFPNSTMVG 440
Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKV-DVKAWINQYSVRRYGRSVPAIQDAWNVLYHT 298
GM+ EGI QN VVY LM+E+ +Q + V D+ AW+ ++ RRYG S + AW +L +
Sbjct: 441 TGMAPEGIGQNEVVYALMAELGWQKDPVADLGAWVTSFAARRYGVSHGDAEAAWRLLLRS 500
Query: 299 VYNCT-DGATDKNRDVIVAFPDVDPSIISVTEGK---YQNYGKPVSKEAVLKSETS-SYD 353
VYNC+ + N +V P + + +V + ++ + ++ + L S + S
Sbjct: 501 VYNCSGEECRGHNHSPLVRRPSLQ-MVTTVWYNRSDVFEAWRLLLTATSTLASSPAVSET 559
Query: 354 HPHLWYSTSEVIRALELFIASGNELSASNTYRYDLI-DLTRQALAKYANELFLNIIEA-- 410
H + S L L+ GN L +N L+ D + L ++++
Sbjct: 560 EAHFYEQNSRY--QLTLWGPEGNILDYANKQLAGLVADYYAPRWRLFTETLVESLVQGVP 617
Query: 411 YQLNDA-HGVFQLSRRFL 427
+Q + FQL + F+
Sbjct: 618 FQQHQFDRNAFQLEQTFV 635
>FB|FBgn0014417 [details] [associations]
symbol:CG13397 species:7227 "Drosophila melanogaster"
[GO:0004561 "alpha-N-acetylglucosaminidase activity" evidence=ISS]
Pfam:PF05089 EMBL:AE014134 CAZy:GH89 eggNOG:NOG86381 KO:K01205
OMA:LFPNSTM GO:GO:0004561 InterPro:IPR007781 InterPro:IPR024732
InterPro:IPR024240 InterPro:IPR024733 PANTHER:PTHR12872
Pfam:PF12972 Pfam:PF12971 EMBL:AY058738 RefSeq:NP_652045.1
UniGene:Dm.4228 SMR:Q9VLL5 MINT:MINT-996629 STRING:Q9VLL5
EnsemblMetazoa:FBtr0079711 EnsemblMetazoa:FBtr0331991 GeneID:46386
KEGG:dme:Dmel_CG13397 UCSC:CG13397-RA FlyBase:FBgn0014417
GeneTree:ENSGT00390000005900 InParanoid:Q9VLL5 OrthoDB:EOG422810
ChiTaRS:CG13397 GenomeRNAi:46386 NextBio:838826 Uniprot:Q9VLL5
Length = 778
Score = 634 (228.2 bits), Expect = 1.4e-78, Sum P(2) = 1.4e-78
Identities = 118/301 (39%), Positives = 181/301 (60%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M N+ GW GPL +W QL+LQ++I+ LGM+ LPAF+G+VP AL+ + P +
Sbjct: 223 MGNIRGWAGPLTPAWRRYQLLLQQEIITAQRNLGMSVALPAFAGHVPRALKRLNPESTFM 282
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
++ W R+CC ++ T+ LF EI F+ + +YG ++HI+ CD F+E PP
Sbjct: 283 EVQRWNQFPD--RYCCGLFVEPTENLFKEIASRFLHNIITKYG-SNHIFFCDPFNELEPP 339
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVL 180
V PEY+ S AAIY M+ D A+WL+QGW+F +PFW +A L + P G+++VL
Sbjct: 340 VAKPEYMRSTAAAIYESMRGIDPQAIWLLQGWMFVKNPFWTTDMAEAFLTAAPRGRILVL 399
Query: 181 DLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVGV 240
DL +E P + ++ ++G P+IWCMLHNF G + M+G I G EAR N+++VG
Sbjct: 400 DLQSEQFPQYELTRSYFGQPFIWCMLHNFGGTLGMFGSAKLINSGIEEARRLPNSSLVGT 459
Query: 241 GMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVY 300
G++ EGI QN V+Y E + + +D+ +W +S RYG ++ AW +L ++VY
Sbjct: 460 GITPEGIGQNYVMYSFTLERGWSNTSLDLDSWFTNFSHSRYGVKDERLEQAWLLLKNSVY 519
Query: 301 N 301
+
Sbjct: 520 S 520
Score = 175 (66.7 bits), Expect = 1.4e-78, Sum P(2) = 1.4e-78
Identities = 42/124 (33%), Positives = 65/124 (52%)
Query: 344 VLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNT----YRYDLIDLTRQALAKY 399
V+ S P WY+ S V+ A L + + + Y +DL+D+TRQ L
Sbjct: 532 VVTRRPSFNQEPFTWYNASAVLDAWHLLLTFRAIIPLEDNRYEIYEHDLVDITRQFLQIS 591
Query: 400 ANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNE 459
A++L++N+ AY+ LS + L+L +DM+ +LA FLLG WL+ AKQ A N
Sbjct: 592 ADQLYINLRSAYRKRQVSRFEFLSVKLLKLFDDMELILASSRNFLLGNWLQQAKQAAPNT 651
Query: 460 EQEK 463
Q++
Sbjct: 652 GQQR 655
>DICTYBASE|DDB_G0291998 [details] [associations]
symbol:naglu "alpha-N-acetylglucosaminidase"
species:44689 "Dictyostelium discoideum" [GO:0006027
"glycosaminoglycan catabolic process" evidence=IC] [GO:0004561
"alpha-N-acetylglucosaminidase activity" evidence=ISS]
dictyBase:DDB_G0291998 Pfam:PF05089 GenomeReviews:CM000155_GR
EMBL:AAFI02000187 GO:GO:0006027 eggNOG:NOG86381 KO:K01205
OMA:LFPNSTM GO:GO:0004561 InterPro:IPR007781 InterPro:IPR024732
InterPro:IPR024240 InterPro:IPR024733 PANTHER:PTHR12872
Pfam:PF12972 Pfam:PF12971 RefSeq:XP_629757.1
ProteinModelPortal:Q54DW5 STRING:Q54DW5 EnsemblProtists:DDB0238329
GeneID:8628432 KEGG:ddi:DDB_G0291998 ProtClustDB:CLSZ2497091
Uniprot:Q54DW5
Length = 798
Score = 697 (250.4 bits), Expect = 1.0e-68, P = 1.0e-68
Identities = 149/406 (36%), Positives = 225/406 (55%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M N++GWGGP+ WL++Q LQ KIL R+ + GM PVLP F+G++P A+Q +FP A I+
Sbjct: 255 MGNVNGWGGPITLDWLEKQRDLQIKILERMRQYGMKPVLPGFAGHIPGAIQQLFPQANIS 314
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
L W + T+ L++TDPLF +I FI + + +G T H YN D F+E PP
Sbjct: 315 VLSTWCNFNG------TFYLESTDPLFAKITTMFIGELIDVFG-TDHFYNFDPFNELEPP 367
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
+ +Y+ ++Y + D AVW++QGW P FW+ Q +A + VP+G ++V
Sbjct: 368 SNDTDYLRQTSQSMYENVLLADPKAVWVLQGWFIVDAPEFWQAKQTEAWFSGVPIGGVLV 427
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
LDL+++V P W+T+ +YG ++WCMLHNF G MYG L I+ P+ AR + MVG
Sbjct: 428 LDLWSDVIPGWTTTNYYYGHYWVWCMLHNFGGRSGMYGRLPWISSNPITAR-GLSPNMVG 486
Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTV 299
+G++ E IEQN VVYD+MSEM+++ + ++ W+ QY+ RRYG+ VP I D W L +TV
Sbjct: 487 IGLTPEAIEQNVVVYDMMSEMSWRSVQPNLTEWVTQYTHRRYGKLVPEIVDVWISLVNTV 546
Query: 300 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQ--------NYGKPVSKEAVLKSETSS 351
+N T N +F + P + Y N V E V+ +ET
Sbjct: 547 FNATAATARANMGAPESFIALRPQLTFGNNSFYNPNILYNAWNVFSMVDDEYVISTETFE 606
Query: 352 YDHPHLW------YSTSEVIRALELFIASGNELSASNTYRYDLIDL 391
+D Y + +E F AS ++ +T +L+D+
Sbjct: 607 FDISEFTMQSLSNYFMDQYFLLIEAFNAS--DVQTLSTISIELLDI 650
Parameters:
V=100
filter=SEG
E=0.001
ctxfactor=1.00
Query ----- As Used ----- ----- Computed ----
Frame MatID Matrix name Lambda K H Lambda K H
+0 0 BLOSUM62 0.319 0.135 0.422 same same same
Q=9,R=2 0.244 0.0300 0.180 n/a n/a n/a
Query
Frame MatID Length Eff.Length E S W T X E2 S2
+0 0 472 472 0.00099 118 3 11 22 0.39 34
35 0.45 37
Statistics:
Database: /share/blast/go-seqdb.fasta
Title: go_20130330-seqdb.fasta
Posted: 5:47:42 AM PDT Apr 1, 2013
Created: 5:47:42 AM PDT Apr 1, 2013
Format: XDF-1
# of letters in database: 169,044,731
# of sequences in database: 368,745
# of database sequences satisfying E: 8
No. of states in DFA: 621 (66 KB)
Total size of DFA: 312 KB (2159 KB)
Time to generate neighborhood: 0.00u 0.00s 0.00t Elapsed: 00:00:00
No. of threads or processors used: 24
Search cpu time: 37.41u 0.12s 37.53t Elapsed: 00:00:02
Total cpu time: 37.41u 0.12s 37.53t Elapsed: 00:00:02
Start: Tue May 21 06:49:45 2013 End: Tue May 21 06:49:47 2013