Your job contains 1 sequence.
>009153
MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT
QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP
VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVL
DLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMAWI
NQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQ
NYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTR
QALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAK
QLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKY
MIESLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKYLQGTGVF
DH
The BLAST search returned 1 gene product which did not match your query constraints. Please see the full BLAST report below for the details.
BLASTP 2.0MP-WashU [04-May-2006] [linux26-i686-ILP32F64 2006-05-09T11:47:08]
Copyright (C) 1996-2006 Washington University, Saint Louis, Missouri USA.
All Rights Reserved.
Reference: Gish, W. (1996-2006) http://blast.wustl.edu
Query= 009153
(542 letters)
Database: go_20130330-seqdb.fasta
368,745 sequences; 169,044,731 total letters.
Searching....10....20....30....40....50....60....70....80....90....100% done
Smallest
Sum
High Probability
Sequences producing High-scoring Segment Pairs: Score P(N) N
TAIR|locus:2173209 - symbol:CYL1 "AT5G13690" species:3702... 941 4.5e-176 2
UNIPROTKB|F1S1D7 - symbol:NAGLU "Uncharacterized protein"... 639 2.4e-91 2
RGD|1564228 - symbol:Naglu "N-acetylglucosaminidase, alph... 909 3.5e-91 1
UNIPROTKB|P54802 - symbol:NAGLU "Alpha-N-acetylglucosamin... 865 1.6e-86 1
UNIPROTKB|H9L296 - symbol:H9L296 "Uncharacterized protein... 820 9.4e-82 1
FB|FBgn0014417 - symbol:CG13397 species:7227 "Drosophila ... 782 1.0e-77 1
UNIPROTKB|A6QM01 - symbol:NAGLU "NAGLU protein" species:9... 641 4.1e-77 2
DICTYBASE|DDB_G0291998 - symbol:naglu "alpha-N-acetylgluc... 715 1.3e-70 1
>TAIR|locus:2173209 [details] [associations]
symbol:CYL1 "AT5G13690" species:3702 "Arabidopsis
thaliana" [GO:0004561 "alpha-N-acetylglucosaminidase activity"
evidence=ISS] [GO:0009507 "chloroplast" evidence=ISM] [GO:0005773
"vacuole" evidence=IDA] Pfam:PF05089 EMBL:CP002688 GO:GO:0005773
EMBL:AB006704 CAZy:GH89 HOGENOM:HOG000214539 KO:K01205 OMA:LFPNSTM
InterPro:IPR007781 InterPro:IPR024732 InterPro:IPR024240
InterPro:IPR024733 PANTHER:PTHR12872 Pfam:PF12972 Pfam:PF12971
UniGene:At.49017 UniGene:At.6477 EMBL:AY080811 EMBL:AY117179
IPI:IPI00516873 RefSeq:NP_196873.1 ProteinModelPortal:Q9FNA3
STRING:Q9FNA3 PRIDE:Q9FNA3 ProMEX:Q9FNA3 EnsemblPlants:AT5G13690.1
GeneID:831214 KEGG:ath:AT5G13690 TAIR:At5g13690 InParanoid:Q9FNA3
PhylomeDB:Q9FNA3 ProtClustDB:CLSN2687036 ArrayExpress:Q9FNA3
Genevestigator:Q9FNA3 Uniprot:Q9FNA3
Length = 806
Score = 941 (336.3 bits), Expect = 4.5e-176, Sum P(2) = 4.5e-176
Identities = 164/241 (68%), Positives = 199/241 (82%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NLH WGGPL ++WLD QL+LQK+IL R+ + GM PVLP+FSGNVP+AL+ ++P A IT
Sbjct: 231 MGNLHAWGGPLSKNWLDDQLLLQKQILSRMLKFGMTPVLPSFSGNVPSALRKIYPEANIT 290
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
+L NW +V D RWCCTYLL+ +DPLFIEIG AFI+QQ +EYG ++IYNCDTF+ENTPP
Sbjct: 291 RLDNWNTVDGDSRWCCTYLLNPSDPLFIEIGEAFIKQQTEEYGEITNIYNCDTFNENTPP 350
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
PEYISSLGAA+Y M G+ +AVWLMQGWLFS D FW+PPQ+KALL+SVP GK++V
Sbjct: 351 TSEPEYISSLGAAVYKAMSKGNKNAVWLMQGWLFSSDSKFWKPPQLKALLHSVPFGKMIV 410
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMAW 239
LDL+AEVKPIW+ S QFYG PYIWCMLHNF GNIEMYG LDSI+ GPV+AR S+N+TM
Sbjct: 411 LDLYAEVKPIWNKSAQFYGTPYIWCMLHNFGGNIEMYGALDSISSGPVDARVSKNSTMVG 470
Query: 240 I 240
+
Sbjct: 471 V 471
Score = 791 (283.5 bits), Expect = 4.5e-176, Sum P(2) = 4.5e-176
Identities = 161/338 (47%), Positives = 216/338 (63%)
Query: 204 CMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMAWINQYSVRRYGRSVPAIQDAWNVLY 263
CM N +Y + +AF R + W+ Y+ RRY + I+ AW +LY
Sbjct: 474 CM-EGIEQNPVVYELTSEMAF-----RDEKVDVQKWLKSYARRRYMKENHQIEAAWEILY 527
Query: 264 HTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQ--NY----GKPVSKEAVL-KSET 316
HTVYNCTDG D N D IV PD DPS SV + Q +Y G +K VL + +T
Sbjct: 528 HTVYNCTDGIADHNTDFIVKLPDWDPSS-SVQDDLKQKDSYMISTGPYETKRRVLFQDKT 586
Query: 317 SSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIE 376
+ HLWYST EVI+AL+LF+ +G++LS S TYRYD++DLTRQ L+K AN+++ +
Sbjct: 587 ADLPKAHLWYSTKEVIQALKLFLEAGDDLSRSLTYRYDMVDLTRQVLSKLANQVYTEAVT 646
Query: 377 AYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNA 436
A+ D + QLS +FLEL++DMD LLA D LLG WLESAK+LA+N ++ KQYEWNA
Sbjct: 647 AFVKKDIGSLGQLSEKFLELIKDMDVLLASDDNCLLGTWLESAKKLAKNGDERKQYEWNA 706
Query: 437 RTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDW 496
RTQ+TMW+D+ S L DY NK+WSGLL DYY PRA +YF M++SL F+++ W
Sbjct: 707 RTQVTMWYDSNDVNQSKLHDYANKFWSGLLEDYYLPRARLYFNEMLKSLRDKKIFKVEKW 766
Query: 497 RREWIKLTNDWQNGRN-VYPVESNGDALITSQWLYNKY 533
RREWI +++ WQ + VYPV++ GDAL S+ L +KY
Sbjct: 767 RREWIMMSHKWQQSSSEVYPVKAKGDALAISRHLLSKY 804
>UNIPROTKB|F1S1D7 [details] [associations]
symbol:NAGLU "Uncharacterized protein" species:9823 "Sus
scrofa" [GO:0060119 "inner ear receptor cell development"
evidence=IEA] [GO:0046548 "retinal rod cell development"
evidence=IEA] [GO:0045475 "locomotor rhythm" evidence=IEA]
[GO:0042474 "middle ear morphogenesis" evidence=IEA] [GO:0021680
"cerebellar Purkinje cell layer development" evidence=IEA]
[GO:0007040 "lysosome organization" evidence=IEA] Pfam:PF05089
CTD:4669 KO:K01205 OMA:LFPNSTM InterPro:IPR007781
InterPro:IPR024732 InterPro:IPR024240 InterPro:IPR024733
PANTHER:PTHR12872 Pfam:PF12972 Pfam:PF12971
GeneTree:ENSGT00390000005900 EMBL:FP016109 RefSeq:XP_003131436.1
UniGene:Ssc.44812 Ensembl:ENSSSCT00000018940 GeneID:100519685
KEGG:ssc:100519685 Uniprot:F1S1D7
Length = 744
Score = 639 (230.0 bits), Expect = 2.4e-91, Sum P(2) = 2.4e-91
Identities = 120/239 (50%), Positives = 159/239 (66%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NLH W GPLP+SW +QL LQ +IL R+ GM PVLPAF+G+VP AL VFP +T
Sbjct: 205 MGNLHTWSGPLPRSWHLKQLYLQHRILDRMRSFGMIPVLPAFAGHVPKALTRVFPQISVT 264
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
Q+G+W + C++LL DPLF +G F+ + KE+G T HIY DTF+E PP
Sbjct: 265 QMGSWGHFNCS--YSCSFLLAPEDPLFPIVGSLFLRELTKEFG-TDHIYGADTFNEMQPP 321
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
P Y+++ AA+Y M + D DAVWL+QGWLF + P FW P Q+ A+L +VP G+L+V
Sbjct: 322 SSEPSYLAAATAAVYQAMITVDPDAVWLLQGWLFQHQPQFWGPAQVGAVLGAVPRGRLLV 381
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMA 238
LDLFAE +P++ + F G P+IWCMLHNF GN ++G L+S+ GP AR N+TMA
Sbjct: 382 LDLFAESQPVYVRTASFLGQPFIWCMLHNFGGNHGLFGALESVNQGPAAARLFPNSTMA 440
Score = 291 (107.5 bits), Expect = 2.4e-91, Sum P(2) = 2.4e-91
Identities = 68/211 (32%), Positives = 114/211 (54%)
Query: 324 LWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDA 383
+WY+ S+V A L + + L++S +RYDL+D+TRQA+ + + + AY +
Sbjct: 530 VWYNQSDVFEAWRLLLKATPTLASSPAFRYDLVDITRQAVQELVSLYYEEARTAYLNKEL 589
Query: 384 HGVFQLSRRFL-ELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITM 442
+ + EL+ +D +LA FLLG WLE A+ +A +E + YE N+R Q+T+
Sbjct: 590 VSLMRAGGILAYELLPALDKVLASDSHFLLGSWLEQARGVAVSEAEALFYEQNSRYQLTL 649
Query: 443 WFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIK 502
W E ++L DY NK +GL+ DYY PR ++ + ++ESL G F+ + + +
Sbjct: 650 W----GPEGNIL-DYANKQLAGLVADYYTPRWRLFMEMLVESLVQGIPFQQHQFDQNVFQ 704
Query: 503 LTNDWQNGRNVYPVESNGDALITSQWLYNKY 533
L + G YP + GD + ++ L+ KY
Sbjct: 705 LEQTFVLGTRRYPSQPQGDTVDLAKKLFLKY 735
Score = 111 (44.1 bits), Expect = 2.3e-72, Sum P(2) = 2.3e-72
Identities = 39/148 (26%), Positives = 67/148 (45%)
Query: 239 WINQYSVRRYGRSVPAIQDAWNVLYHTVYNCT-DGATDKNRDVIVAFPDVDPSIISVTEG 297
W+ ++ RRYG S + AW +L +VYNC+ +G T NR +V PS+ T
Sbjct: 475 WVTSFAARRYGVSQGDAEAAWRLLLRSVYNCSGEGCTGHNRSPLVR----RPSLQMATTV 530
Query: 298 KYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIR-ALELFIASGNELSASNTYRYDLI 356
Y + +LK+ + P Y ++ R A++ ++ E + + +L+
Sbjct: 531 WYNQSDVFEAWRLLLKATPTLASSPAFRYDLVDITRQAVQELVSLYYEEARTAYLNKELV 590
Query: 357 DLTRQALAKYANELFLNIIEAYQLNDAH 384
L R A A EL L ++ +D+H
Sbjct: 591 SLMR-AGGILAYEL-LPALDKVLASDSH 616
Score = 51 (23.0 bits), Expect = 3.5e-23, Sum P(2) = 3.5e-23
Identities = 10/29 (34%), Positives = 16/29 (55%)
Query: 119 PPVDSPEYISSLGAAIYSGMQSGDSDAVW 147
P D +++S A Y G+ GD++A W
Sbjct: 468 PVADLGTWVTSFAARRY-GVSQGDAEAAW 495
>RGD|1564228 [details] [associations]
symbol:Naglu "N-acetylglucosaminidase, alpha" species:10116
"Rattus norvegicus" [GO:0007040 "lysosome organization"
evidence=ISO] [GO:0021680 "cerebellar Purkinje cell layer
development" evidence=ISO] [GO:0042474 "middle ear morphogenesis"
evidence=ISO] [GO:0045475 "locomotor rhythm" evidence=ISO]
[GO:0046548 "retinal rod cell development" evidence=ISO]
[GO:0060119 "inner ear receptor cell development" evidence=ISO]
REFSEQ:XM_001081442 Ncbi:XP_001081442
Length = 739
Score = 909 (325.0 bits), Expect = 3.5e-91, P = 3.5e-91
Identities = 201/540 (37%), Positives = 299/540 (55%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NLH W GPLP+SW +QL LQ +IL R+ GM PVLPAF+G+VP A+ VFP +
Sbjct: 202 MGNLHTWDGPLPRSWHLKQLYLQHRILDRMRSFGMTPVLPAFAGHVPKAITRVFPQVNVI 261
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
QLGNW + C++LL DPLF IG F+ + KE+G T HIY DTF+E PP
Sbjct: 262 QLGNWGHFNCS--YSCSFLLAPGDPLFPLIGTLFLRELTKEFG-TDHIYGADTFNEMQPP 318
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
P Y+++ AA+Y M + D DAVWL+QGWLF + P FW P Q+KA+L +VP G+L+V
Sbjct: 319 FSDPSYLAAATAAVYEAMVTVDPDAVWLLQGWLFQHQPQFWGPSQIKAVLEAVPRGRLLV 378
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMAW 239
LDLFAE +P++S + F+G P+IWCMLHNF GN ++G L+ + GP AR N+TM
Sbjct: 379 LDLFAETQPVYSRTASFHGQPFIWCMLHNFGGNHGLFGALEDVNQGPQAARLFPNSTMVG 438
Query: 240 INQYSVRRYGRS--VPAIQDAWNVLYHTVYNCTDGATD-KNRDVIVAFPDVDPSIISVTE 296
+ G++ V A+ V + + +R V+ PD + +
Sbjct: 439 TG-IAPEGIGQNEVVYALMAELGWRKDPVPDLVAWVSSFASRRYGVSQPDAVAAWRLLLR 497
Query: 297 GKYQNYGKPVS--KEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYD 354
Y G+ S + L S +WY+ S+V A L + + L+AS +RYD
Sbjct: 498 SVYNCSGEACSGHNRSPLVKRPSLQMSTAVWYNRSDVFEAWRLLLRAAPNLTASPAFRYD 557
Query: 355 LIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFL-ELVEDMDGLLACHDGFLLG 413
L+D+TRQA+ + + + A+ D + + +L+ +D LLA + FLLG
Sbjct: 558 LLDVTRQAVQELVSSCYEEARTAFLNQDLDLLLRAGGLLTYKLLPSLDELLASNSHFLLG 617
Query: 414 PWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPR 473
WL+ A+++A +E + + YE N+R QIT+W E ++L DY NK +GL+ DYY PR
Sbjct: 618 TWLDQAREVAVSESEAQFYEQNSRYQITLW----GPEGNIL-DYANKQLAGLVADYYQPR 672
Query: 474 AAIYFKYMIESLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKY 533
++ + SL G F+ + + L + N + YP++ GD + S+ ++ K+
Sbjct: 673 WCLFLGTLAHSLARGIPFQQHQFEKSVFPLEQAFINNKKRYPIQPQGDTVDLSKKIFLKF 732
>UNIPROTKB|P54802 [details] [associations]
symbol:NAGLU "Alpha-N-acetylglucosaminidase" species:9606
"Homo sapiens" [GO:0004561 "alpha-N-acetylglucosaminidase activity"
evidence=IEA] [GO:0007040 "lysosome organization" evidence=IEA]
[GO:0021680 "cerebellar Purkinje cell layer development"
evidence=IEA] [GO:0042474 "middle ear morphogenesis" evidence=IEA]
[GO:0045475 "locomotor rhythm" evidence=IEA] [GO:0046548 "retinal
rod cell development" evidence=IEA] [GO:0060119 "inner ear receptor
cell development" evidence=IEA] [GO:0007399 "nervous system
development" evidence=TAS] [GO:0005764 "lysosome" evidence=TAS]
[GO:0005975 "carbohydrate metabolic process" evidence=TAS]
[GO:0006027 "glycosaminoglycan catabolic process" evidence=TAS]
[GO:0030203 "glycosaminoglycan metabolic process" evidence=TAS]
[GO:0043202 "lysosomal lumen" evidence=TAS] [GO:0044281 "small
molecule metabolic process" evidence=TAS] Reactome:REACT_111217
Pfam:PF05089 Reactome:REACT_116125 GO:GO:0007399 GO:GO:0044281
GO:GO:0005975 GO:GO:0043202 GO:GO:0006027 EMBL:U43572 EMBL:U43573
EMBL:U40846 EMBL:L78464 EMBL:AC067852 EMBL:BC053991 IPI:IPI00008787
PIR:G02270 RefSeq:NP_000254.2 UniGene:Hs.50727
ProteinModelPortal:P54802 SMR:P54802 STRING:P54802 CAZy:GH89
PhosphoSite:P54802 DMDM:1703303 PaxDb:P54802 PRIDE:P54802
Ensembl:ENST00000225927 GeneID:4669 KEGG:hsa:4669 UCSC:uc002hzv.3
CTD:4669 GeneCards:GC17P040687 H-InvDB:HIX0202517 HGNC:HGNC:7632
HPA:HPA038815 MIM:252920 MIM:609701 neXtProt:NX_P54802
Orphanet:79270 PharmGKB:PA31437 eggNOG:NOG86381
HOGENOM:HOG000214539 HOVERGEN:HBG004225 InParanoid:P54802 KO:K01205
OMA:LFPNSTM OrthoDB:EOG4Q84X0 PhylomeDB:P54802 ChiTaRS:NAGLU
DrugBank:DB00141 GenomeRNAi:4669 NextBio:17990 Bgee:P54802
CleanEx:HS_NAGLU Genevestigator:P54802 GermOnline:ENSG00000108784
GO:GO:0004561 InterPro:IPR007781 InterPro:IPR024732
InterPro:IPR024240 InterPro:IPR024733 PANTHER:PTHR12872
Pfam:PF12972 Pfam:PF12971 Uniprot:P54802
Length = 743
Score = 865 (309.6 bits), Expect = 1.6e-86, P = 1.6e-86
Identities = 191/542 (35%), Positives = 295/542 (54%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NLH W GPLP SW +QL LQ ++L ++ GM PVLPAF+G+VP A+ VFP +T
Sbjct: 204 MGNLHTWDGPLPPSWHIKQLYLQHRVLDQMRSFGMTPVLPAFAGHVPEAVTRVFPQVNVT 263
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
++G+W + C++LL DP+F IG F+ + +KE+G T HIY DTF+E PP
Sbjct: 264 KMGSWGHFNCS--YSCSFLLAPEDPIFPIIGSLFLRELIKEFG-TDHIYGADTFNEMQPP 320
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
P Y+++ A+Y M + D++AVWL+QGWLF + P FW P Q++A+L +VP G+L+V
Sbjct: 321 SSEPSYLAAATTAVYEAMTAVDTEAVWLLQGWLFQHQPQFWGPAQIRAVLGAVPRGRLLV 380
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMAW 239
LDLFAE +P+++ + F G P+IWCMLHNF GN ++G L+++ GP AR N+TM
Sbjct: 381 LDLFAESQPVYTRTASFQGQPFIWCMLHNFGGNHGLFGALEAVNGGPEAARLFPNSTMVG 440
Query: 240 INQ----YSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATD-KNRDVIVAFPDVDPSIISV 294
S S+ A + W V + T R V+ PD + +
Sbjct: 441 TGMAPEGISQNEVVYSLMA-ELGWRK--DPVPDLAAWVTSFAARRYGVSHPDAGAAWRLL 497
Query: 295 TEGKYQNYGKPVS--KEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYR 352
Y G+ + L S + +WY+ S+V A L + S L+ S +R
Sbjct: 498 LRSVYNCSGEACRGHNRSPLVRRPSLQMNTSIWYNRSDVFEAWRLLLTSAPSLATSPAFR 557
Query: 353 YDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFL-ELVEDMDGLLACHDGFL 411
YDL+DLTRQA+ + + + AY + + + EL+ +D +LA FL
Sbjct: 558 YDLLDLTRQAVQELVSLYYEEARSAYLSKELASLLRAGGVLAYELLPALDEVLASDSRFL 617
Query: 412 LGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYG 471
LG WLE A+ A +E + YE N+R Q+T+W E ++L DY NK +GL+ +YY
Sbjct: 618 LGSWLEQARAAAVSEAEADFYEQNSRYQLTLW----GPEGNIL-DYANKQLAGLVANYYT 672
Query: 472 PRAAIYFKYMIESLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLYN 531
PR ++ + +++S+ G F+ + + +L + + YP + GD + ++ ++
Sbjct: 673 PRWRLFLEALVDSVAQGIPFQQHQFDKNVFQLEQAFVLSKQRYPSQPRGDTVDLAKKIFL 732
Query: 532 KY 533
KY
Sbjct: 733 KY 734
>UNIPROTKB|H9L296 [details] [associations]
symbol:H9L296 "Uncharacterized protein" species:9031
"Gallus gallus" [GO:0007040 "lysosome organization" evidence=IEA]
[GO:0021680 "cerebellar Purkinje cell layer development"
evidence=IEA] [GO:0042474 "middle ear morphogenesis" evidence=IEA]
[GO:0045475 "locomotor rhythm" evidence=IEA] [GO:0046548 "retinal
rod cell development" evidence=IEA] [GO:0060119 "inner ear receptor
cell development" evidence=IEA] Pfam:PF05089 OMA:LFPNSTM
InterPro:IPR007781 InterPro:IPR024732 InterPro:IPR024733
PANTHER:PTHR12872 Pfam:PF12972 GeneTree:ENSGT00390000005900
EMBL:AADN02054251 EMBL:AADN02054252 Ensembl:ENSGALT00000035813
Uniprot:H9L296
Length = 601
Score = 820 (293.7 bits), Expect = 9.4e-82, P = 9.4e-82
Identities = 196/540 (36%), Positives = 292/540 (54%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NLH W GPLP++W QQL LQ +I+ R+ LGM VLPAF+G+VP + VFP T
Sbjct: 87 MGNLHSWAGPLPRAWHLQQLYLQYRIVERMRSLGMITVLPAFAGHVPPGVLRVFPRINAT 146
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
+LGNW D C YLL +P+F IG F+++ +KE+G T IY+ DT P
Sbjct: 147 RLGNWSHF--DCTLSCAYLLSPEEPMFQVIGTLFLKELIKEFG-TDRIYSADTHPHPRPA 203
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
V P + SL + + D +A WLMQGWLF + P FW+PPQ++A+L +VPLG+++V
Sbjct: 204 V-GPWLLCSL-----CSLPAADPEAQWLMQGWLFQHQPDFWQPPQVQAVLRAVPLGRMIV 257
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMAW 239
LDLFAE KP++ ++ FYG P+IWCMLHNF GN ++G +++I GP AR N+TM
Sbjct: 258 LDLFAESKPVYEWTESFYGQPFIWCMLHNFGGNHGLFGAVEAINRGPFVARRFPNSTMVG 317
Query: 240 -------INQ----YSV-RRYG-RSVPAIQDAWNVLY-HTVYNCTDGATDKNRDVIVAFP 285
I Q Y + G R P W Y Y D A +++
Sbjct: 318 TGLVPEGIEQNDMVYELMNELGWRHEPLDLPVWVSRYAQRRYGAPDAAAGAAWQLLLR-- 375
Query: 286 DVDPSIISVTEGKYQNYGK-PVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNE 344
S+ + + G N+ + P+ + L +T LWY+ S+V A L +++G
Sbjct: 376 ----SVYNCS-GACVNHNRSPLVRRPSLHMDTQ------LWYNASDVYEAWRLLLSAGAA 424
Query: 345 LSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFL-ELVEDMDGL 403
L +S +RYDL D+TRQA+ + + + I +++Q + L +L+ ++D L
Sbjct: 425 LGSSPAFRYDLADVTRQAVQQLVADYYQRIRDSFQRRALPELLAAGGVLLYDLLPELDAL 484
Query: 404 LACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWS 463
L FLLG L+SA A +E + +QYE NAR Q+T+W + ++L DY NK +
Sbjct: 485 LGSQRLFLLGRLLQSAHAAATSEREAEQYERNARNQVTLWGPS----GNIL-DYANKQLA 539
Query: 464 GLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDAL 523
GL+ DYYG R +++ ++ESL +G F + + + ++ + YP GD L
Sbjct: 540 GLVLDYYGVRWSLFVSLLVESLNTGSPFPQEQFNQAVFQVERGFVYNEKRYPATPVGDTL 599
>FB|FBgn0014417 [details] [associations]
symbol:CG13397 species:7227 "Drosophila melanogaster"
[GO:0004561 "alpha-N-acetylglucosaminidase activity" evidence=ISS]
Pfam:PF05089 EMBL:AE014134 CAZy:GH89 eggNOG:NOG86381 KO:K01205
OMA:LFPNSTM GO:GO:0004561 InterPro:IPR007781 InterPro:IPR024732
InterPro:IPR024240 InterPro:IPR024733 PANTHER:PTHR12872
Pfam:PF12972 Pfam:PF12971 EMBL:AY058738 RefSeq:NP_652045.1
UniGene:Dm.4228 SMR:Q9VLL5 MINT:MINT-996629 STRING:Q9VLL5
EnsemblMetazoa:FBtr0079711 EnsemblMetazoa:FBtr0331991 GeneID:46386
KEGG:dme:Dmel_CG13397 UCSC:CG13397-RA FlyBase:FBgn0014417
GeneTree:ENSGT00390000005900 InParanoid:Q9VLL5 OrthoDB:EOG422810
ChiTaRS:CG13397 GenomeRNAi:46386 NextBio:838826 Uniprot:Q9VLL5
Length = 778
Score = 782 (280.3 bits), Expect = 1.0e-77, P = 1.0e-77
Identities = 179/546 (32%), Positives = 292/546 (53%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M N+ GW GPL +W QL+LQ++I+ LGM+ LPAF+G+VP AL+ + P +
Sbjct: 223 MGNIRGWAGPLTPAWRRYQLLLQQEIITAQRNLGMSVALPAFAGHVPRALKRLNPESTFM 282
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
++ W R+CC ++ T+ LF EI F+ + +YG ++HI+ CD F+E PP
Sbjct: 283 EVQRWNQFPD--RYCCGLFVEPTENLFKEIASRFLHNIITKYG-SNHIFFCDPFNELEPP 339
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVL 180
V PEY+ S AAIY M+ D A+WL+QGW+F +PFW +A L + P G+++VL
Sbjct: 340 VAKPEYMRSTAAAIYESMRGIDPQAIWLLQGWMFVKNPFWTTDMAEAFLTAAPRGRILVL 399
Query: 181 DLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMAW- 239
DL +E P + ++ ++G P+IWCMLHNF G + M+G I G EAR N+++
Sbjct: 400 DLQSEQFPQYELTRSYFGQPFIWCMLHNFGGTLGMFGSAKLINSGIEEARRLPNSSLVGT 459
Query: 240 -INQYSV-RRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVI-VAFPDVDPSIISVTE 296
I + + Y ++ W+ +T + T+ + V ++ + + +
Sbjct: 460 GITPEGIGQNYVMYSFTLERGWS---NTSLDLDSWFTNFSHSRYGVKDERLEQAWLLLKN 516
Query: 297 GKYQNYG-KPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNT----Y 351
Y G + + + V+ S P WY+ S V+ A L + + + Y
Sbjct: 517 SVYSFRGLQKMRGQYVVTRRPSFNQEPFTWYNASAVLDAWHLLLTFRAIIPLEDNRYEIY 576
Query: 352 RYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFL 411
+DL+D+TRQ L A++L++N+ AY+ LS + L+L +DM+ +LA FL
Sbjct: 577 EHDLVDITRQFLQISADQLYINLRSAYRKRQVSRFEFLSVKLLKLFDDMELILASSRNFL 636
Query: 412 LGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYG 471
LG WL+ AKQ A N Q++ +E+NAR QIT W + Q + DY K WSGL+ DYY
Sbjct: 637 LGNWLQQAKQAAPNTGQQRNFEFNARNQITAWGPDGQ-----ILDYACKQWSGLVSDYYR 691
Query: 472 PRAAIYFKYMIESLESGDGFRLKDWRREWIKLTND----WQNGRNVYPVESNGDALITSQ 527
PR ++ + + +L +G F ++ +K++++ + N +VYPV G+ + SQ
Sbjct: 692 PRWRLFLEDVTVALHAGRPFNGTAFK---LKVSHEIELPFSNKDDVYPVTPVGNTWLISQ 748
Query: 528 WLYNKY 533
++ +
Sbjct: 749 DIFETW 754
>UNIPROTKB|A6QM01 [details] [associations]
symbol:NAGLU "NAGLU protein" species:9913 "Bos taurus"
[GO:0060119 "inner ear receptor cell development" evidence=IEA]
[GO:0046548 "retinal rod cell development" evidence=IEA]
[GO:0045475 "locomotor rhythm" evidence=IEA] [GO:0042474 "middle
ear morphogenesis" evidence=IEA] [GO:0021680 "cerebellar Purkinje
cell layer development" evidence=IEA] [GO:0007040 "lysosome
organization" evidence=IEA] Pfam:PF05089 InterPro:IPR017853
SUPFAM:SSF51445 CAZy:GH89 CTD:4669 eggNOG:NOG86381
HOGENOM:HOG000214539 HOVERGEN:HBG004225 KO:K01205 OMA:LFPNSTM
OrthoDB:EOG4Q84X0 InterPro:IPR007781 InterPro:IPR024732
InterPro:IPR024240 InterPro:IPR024733 PANTHER:PTHR12872
Pfam:PF12972 Pfam:PF12971 GeneTree:ENSGT00390000005900
EMBL:DAAA02049190 EMBL:BC148147 IPI:IPI00717554
RefSeq:NP_001095696.1 UniGene:Bt.4204 Ensembl:ENSBTAT00000063695
GeneID:789125 KEGG:bta:789125 InParanoid:A6QM01 NextBio:20929511
Uniprot:A6QM01
Length = 667
Score = 641 (230.7 bits), Expect = 4.1e-77, Sum P(2) = 4.1e-77
Identities = 120/238 (50%), Positives = 157/238 (65%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M NLH W GPLP SW +QL LQ +IL R+ GM PVLPAF+G+VP AL VFP +T
Sbjct: 204 MGNLHTWSGPLPPSWHLKQLYLQHRILDRMRSFGMIPVLPAFAGHVPKALTRVFPQVNVT 263
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
Q+GNW + C++LL DPLF +G F+ + KE+G T HIY DTF+E PP
Sbjct: 264 QMGNWGHFNCS--YSCSFLLAPEDPLFPLVGSLFLRELTKEFG-TDHIYGADTFNEMQPP 320
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
P Y+++ AA+Y M + D DAVWL+QGWLF + P FW P Q+ A+L +VP G+L+V
Sbjct: 321 SSEPSYLAAATAAVYQAMTAVDPDAVWLLQGWLFQHQPEFWGPAQVAAVLGAVPRGRLLV 380
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTM 237
LDLFAE +P++ + F G P+IWCMLHNF GN ++G L+S+ GP AR N+TM
Sbjct: 381 LDLFAESQPVYVRTASFQGQPFIWCMLHNFGGNHGLFGALESVNQGPTTARHFPNSTM 438
Score = 154 (59.3 bits), Expect = 4.1e-77, Sum P(2) = 4.1e-77
Identities = 38/115 (33%), Positives = 60/115 (52%)
Query: 419 AKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYF 478
A A +E + YE N+R Q+T+W E ++L DY NK +GL+ DYY PR ++
Sbjct: 551 ASSPAVSETEAHFYEQNSRYQLTLW----GPEGNIL-DYANKQLAGLVADYYAPRWRLFT 605
Query: 479 KYMIESLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKY 533
+ ++ESL G F+ + R +L + G YP + GD + + L+ KY
Sbjct: 606 ETLVESLVQGVPFQQHQFDRNAFQLEQTFVLGTRRYPSQPEGDTVDLVKKLFLKY 660
Score = 89 (36.4 bits), Expect = 2.9e-70, Sum P(2) = 2.9e-70
Identities = 37/166 (22%), Positives = 70/166 (42%)
Query: 238 AWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCT-DGATDKNRDVIVAFPDVDPSIISVTE 296
AW+ ++ RRYG S + AW +L +VYNC+ + N +V P + + +V
Sbjct: 473 AWVTSFAARRYGVSHGDAEAAWRLLLRSVYNCSGEECRGHNHSPLVRRPSLQ-MVTTVWY 531
Query: 297 GK---YQNYGKPVSKEAVLKSETS-SYDHPHLWYSTSEVIRALELFIASGNELSASNTYR 352
+ ++ + ++ + L S + S H + S L L+ GN L +N
Sbjct: 532 NRSDVFEAWRLLLTATSTLASSPAVSETEAHFYEQNSRY--QLTLWGPEGNILDYANKQL 589
Query: 353 YDLI-DLTRQALAKYANELFLNIIEA--YQLNDA-HGVFQLSRRFL 394
L+ D + L ++++ +Q + FQL + F+
Sbjct: 590 AGLVADYYAPRWRLFTETLVESLVQGVPFQQHQFDRNAFQLEQTFV 635
Score = 51 (23.0 bits), Expect = 4.9e-08, Sum P(2) = 4.9e-08
Identities = 10/32 (31%), Positives = 17/32 (53%)
Query: 116 ENTPPVDSPEYISSLGAAIYSGMQSGDSDAVW 147
+ P D +++S A Y G+ GD++A W
Sbjct: 464 QKDPVADLGAWVTSFAARRY-GVSHGDAEAAW 494
>DICTYBASE|DDB_G0291998 [details] [associations]
symbol:naglu "alpha-N-acetylglucosaminidase"
species:44689 "Dictyostelium discoideum" [GO:0006027
"glycosaminoglycan catabolic process" evidence=IC] [GO:0004561
"alpha-N-acetylglucosaminidase activity" evidence=ISS]
dictyBase:DDB_G0291998 Pfam:PF05089 GenomeReviews:CM000155_GR
EMBL:AAFI02000187 GO:GO:0006027 eggNOG:NOG86381 KO:K01205
OMA:LFPNSTM GO:GO:0004561 InterPro:IPR007781 InterPro:IPR024732
InterPro:IPR024240 InterPro:IPR024733 PANTHER:PTHR12872
Pfam:PF12972 Pfam:PF12971 RefSeq:XP_629757.1
ProteinModelPortal:Q54DW5 STRING:Q54DW5 EnsemblProtists:DDB0238329
GeneID:8628432 KEGG:ddi:DDB_G0291998 ProtClustDB:CLSZ2497091
Uniprot:Q54DW5
Length = 798
Score = 715 (256.8 bits), Expect = 1.3e-70, P = 1.3e-70
Identities = 169/552 (30%), Positives = 282/552 (51%)
Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
M N++GWGGP+ WL++Q LQ KIL R+ + GM PVLP F+G++P A+Q +FP A I+
Sbjct: 255 MGNVNGWGGPITLDWLEKQRDLQIKILERMRQYGMKPVLPGFAGHIPGAIQQLFPQANIS 314
Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
L W + T+ L++TDPLF +I FI + + +G T H YN D F+E PP
Sbjct: 315 VLSTWCNFNG------TFYLESTDPLFAKITTMFIGELIDVFG-TDHFYNFDPFNELEPP 367
Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
+ +Y+ ++Y + D AVW++QGW P FW+ Q +A + VP+G ++V
Sbjct: 368 SNDTDYLRQTSQSMYENVLLADPKAVWVLQGWFIVDAPEFWQAKQTEAWFSGVPIGGVLV 427
Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEAR-TSENTTMA 238
LDL+++V P W+T+ +YG ++WCMLHNF G MYG L I+ P+ AR S N
Sbjct: 428 LDLWSDVIPGWTTTNYYYGHYWVWCMLHNFGGRSGMYGRLPWISSNPITARGLSPNMVGI 487
Query: 239 WINQYSVRRYGRSVPAIQD-AWNVLYHTVYNCTDGATDKNRDVIVA-FPDVDPSIISV-- 294
+ ++ + + + +W + + T + +V DV S+++
Sbjct: 488 GLTPEAIEQNVVVYDMMSEMSWRSVQPNLTEWVTQYTHRRYGKLVPEIVDVWISLVNTVF 547
Query: 295 --TEGKYQ-NYGKPVSKEAVLKSET---SSYDHPHLWYSTSEVIRALELFIASGNELSAS 348
T + N G P S A+ T +S+ +P++ Y+ V ++ + ++
Sbjct: 548 NATAATARANMGAPESFIALRPQLTFGNNSFYNPNILYNAWNVFSMVD-----DEYVIST 602
Query: 349 NTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHD 408
T+ +D+ + T Q+L+ Y + + +IEA+ +D + +S L+++ MD + +
Sbjct: 603 ETFEFDISEFTMQSLSNYFMDQYFLLIEAFNASDVQTLSTISIELLDIINYMDEIASTQS 662
Query: 409 GFLLGPWLESAKQLA---------QNEEQEKQ--YEWNARTQITMWFDNTQEEASLLRDY 457
LG W A+ A QN YE+NAR +T+W + S+L DY
Sbjct: 663 SLQLGLWTYRARLWAYPTNDIPTLQNSSNSNTAPYEFNARNVLTLWGPSN----SVLHDY 718
Query: 458 GNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVE 517
K WSGL+ D+Y PR ++ K +++S+E+ F + + R L W + +YP
Sbjct: 719 AFKLWSGLVSDFYSPRWQLFLKSLVQSVENRKPFNKESFNRMVENLEEQWVVQQTIYPTV 778
Query: 518 SNGDALITSQWL 529
G A TS+++
Sbjct: 779 PVGQAYNTSKYI 790
Parameters:
V=100
filter=SEG
E=0.001
ctxfactor=1.00
Query ----- As Used ----- ----- Computed ----
Frame MatID Matrix name Lambda K H Lambda K H
+0 0 BLOSUM62 0.319 0.136 0.432 same same same
Q=9,R=2 0.244 0.0300 0.180 n/a n/a n/a
Query
Frame MatID Length Eff.Length E S W T X E2 S2
+0 0 542 542 0.00095 119 3 11 22 0.40 34
35 0.45 37
Statistics:
Database: /share/blast/go-seqdb.fasta
Title: go_20130330-seqdb.fasta
Posted: 5:47:42 AM PDT Apr 1, 2013
Created: 5:47:42 AM PDT Apr 1, 2013
Format: XDF-1
# of letters in database: 169,044,731
# of sequences in database: 368,745
# of database sequences satisfying E: 8
No. of states in DFA: 634 (67 KB)
Total size of DFA: 369 KB (2178 KB)
Time to generate neighborhood: 0.00u 0.00s 0.00t Elapsed: 00:00:00
No. of threads or processors used: 24
Search cpu time: 45.13u 0.12s 45.25t Elapsed: 00:00:02
Total cpu time: 45.14u 0.12s 45.26t Elapsed: 00:00:02
Start: Tue May 21 05:21:55 2013 End: Tue May 21 05:21:57 2013