Citrus Sinensis ID: 027682


Local Sequence Feature Prediction

Prediction and (Method)Result
Residue Number Marker
Protein Sequence ?
Secondary Structure (PSIPRED) ?
Secondary Structure Prediction (SSPRO) ?
Coil and Loop (DISEMBL) ?
Flexible Loop (DISEMBL) ?
Low Complexity Region (SEG) ?
Disordered region (IsUnstruct) ?
Disordered Region (DISOPRED) ?
Disordered Region (DISEMBL) ?
Disordered Region (DISPRO) ?
Transmembrane Helix (TMHMM) ?
Transmembrane Helix (HMMTOP) ?
Transmembrane Helix (MEMSAT) ?
TM Helix, Signal Peptide (MEMSAT_SVM) ?
TM Helix, Signal Peptide (Phobius) ?
Signal Peptide (SignalP HMM Mode) ?
Signal Peptide (SignalP NN Mode) ?
Coiled Coils (COILS) ?
Positional Conservation ?
 
--------10--------20--------30--------40--------50--------60--------70--------80--------90-------100-------110-------120-------130-------140-------150-------160-------170-------180-------190-------200-------210-------220
MDSPFRSVSLILFFITTITLSTLAVAQSSTIGVQYISRLLDIQDRERAPPSVQLAAAYAVLQRLLPSHYSAFQFRIISKKQCGGEYCFILRNHPSSYIRGTPEIVISGVTGVEVLAGLHWYLRYWCGSHISWDKTGGVQVASMPKLGSFPRVQDAGVFVKRPIPLNYYQNAVTSSYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKVFQVAFC
ccccccHHHHHHHHHHHHHHHHHHHHHHHcccHHHHHHHHHHcccccccHHHHHHHHHHHHHHHcccccccEEEEEEEcccccccEEEEEEcccccccccccEEEEEEccHHHHHHHHHHHHHHHHcccEEEccccccccccccccccccccccccEEEEcccccEEEccccccccccccccHHHHHHHHHHHHHccccccHHHHcHHHHHHHHHHHHcc
ccccHHHHHHHHHHHHHHHHHHHHHHcccHHHHHHHHHHHHHcccccccHHHHHHHHHHHHHHHcccHcccEEEEEEccccccccEEEEEEcccccccccccEEEEEcccHHHHHHHHHHHHHHHHccEEEEcccccccHccccccccccccccccEEEEccccEEEEEEEcccccccccccHHHHHHHHHHHHHHcccHHHHHHHHHHHHHHHHHHHcc
MDSPFRSVSLILFFITTITLSTLAVAQSSTIGVQYISRLLdiqdrerappsVQLAAAYAVLQRLLPSHYSAFQFRIIskkqcggeycfilrnhpssyirgtpeiviSGVTGVEVLAGLHWYLRYWCgshiswdktggvqvasmpklgsfprvqdagvfvkrpiplnyyqnavtssytfaWWDWKRWEKEIDWMALQginlplaftgqETIWQKVFQVAFC
MDSPFRSVSLILFFITTITLSTLAVAQSSTIGVQYISRLLDIQDRERAPPSVQLAAAYAVLQRLLPSHYSAFQFRIISKKQCGGEYCFILRNHPSSYIRGTPEIVISGVTGVEVLAGLHWYLRYWCGSHISWDKTGGVQVASMPKLGSFPRVQDAGVFVKRPIPLNYYQNAVTSSYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKVFQVAFC
MDSPFRSVslilffittitlstlAVAQSSTIGVQYISRLLDIQDRERAPPSVQLAAAYAVLQRLLPSHYSAFQFRIISKKQCGGEYCFILRNHPSSYIRGTPEIVISGVTGVEVLAGLHWYLRYWCGSHISWDKTGGVQVASMPKLGSFPRVQDAGVFVKRPIPLNYYQNAVTSSYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKVFQVAFC
******SVSLILFFITTITLSTLAVAQSSTIGVQYISRLLDIQDRERAPPSVQLAAAYAVLQRLLPSHYSAFQFRIISKKQCGGEYCFILRNHPSSYIRGTPEIVISGVTGVEVLAGLHWYLRYWCGSHISWDKTGGVQVASMPKLGSFPRVQDAGVFVKRPIPLNYYQNAVTSSYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKVFQVAF*
****FRS*SLILFFITTITLSTLAVAQSSTIGVQYI********************AYAVLQRLLPSHYSAFQFRIISKKQCGGEYCFILRNHPSSYIRGTPEIVISGVTGVEVLAGLHWYLRYWCGSHISWDKTGGVQVASMPKLG**********FVKRPIPLNYYQNAVTSSYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKVFQVAFC
MDSPFRSVSLILFFITTITLSTLAVAQSSTIGVQYISRLLDIQDRERAPPSVQLAAAYAVLQRLLPSHYSAFQFRIISKKQCGGEYCFILRNHPSSYIRGTPEIVISGVTGVEVLAGLHWYLRYWCGSHISWDKTGGVQVASMPKLGSFPRVQDAGVFVKRPIPLNYYQNAVTSSYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKVFQVAFC
**SPFRSVSLILFFITTITLSTLAVAQSSTIGVQYISRLLDIQDRERAPPSVQLAAAYAVLQRLLPSHYSAFQFRIISKKQCGGEYCFILRNHPSSYIRGTPEIVISGVTGVEVLAGLHWYLRYWCGSHISWDKTGGVQVASMPKLGSFPRVQDAGVFVKRPIPLNYYQNAVTSSYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKVFQVAFC
iiiiiiHHHHHHHHHHHHHHHHHHHHHHHooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
iiiiiiiHHHHHHHHHHHHHHHHHHHoooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiHHHHHHHHHHHHHHHHHHHHoooooooooooooooooooooooooooooooooooooooooooooooooooo
SSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiHHHHHHHHHHHHHHHHoooooooooooo
SSSSSSSSSSSSSSSSSSSSSSSSSSoooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
SSSSSSSSSSSSSSSSSSSSSSSSSSSooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
MDSPFRSVSLILFFITTITLSTLAVAQSSTIGVQYISRLLDIQDRERAPPSVQLAAAYAVLQRLLPSHYSAFQFRIISKKQCGGEYCFILRNHPSSYIRGTPEIVISGVTGVEVLAGLHWYLRYWCGSHISWDKTGGVQVASMPKLGSFPRVQDAGVFVKRPIPLNYYQNAVTSSYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKVFQVAFC
no confident homologs detected

Close Homologs for Annotation Transfer

Close Homologs in SWISS-PROT Database Detected by BLAST ?

ID ?Alignment graph ?Length ? Definition ? RBH(Q2H) ? RBH(H2Q) ? Q cover ? H cover ? Identity ? E-value ?
Query220 2.2.26 [Sep-21-2011]
P54802 743 Alpha-N-acetylglucosamini yes no 0.495 0.146 0.482 2e-24
>sp|P54802|ANAG_HUMAN Alpha-N-acetylglucosaminidase OS=Homo sapiens GN=NAGLU PE=1 SV=2 Back     alignment and function desciption
 Score =  112 bits (280), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 56/116 (48%), Positives = 70/116 (60%), Gaps = 7/116 (6%)

Query: 100 GTPEIVISGVTGVEVLAGLHWYLRYWCGSHISWDKTGGVQVASMPKLGSFPRVQDAGVFV 159
           G   + + G TGV   AGLH YLR +CG H++W    G Q+     L + P      +  
Sbjct: 71  GAARVRVRGSTGVAAAAGLHRYLRDFCGCHVAWS---GSQLRLPRPLPAVP----GELTE 123

Query: 160 KRPIPLNYYQNAVTSSYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKVF 215
             P    YYQN  T SY+F WWDW RWE+EIDWMAL GINL LA++GQE IWQ+V+
Sbjct: 124 ATPNRYRYYQNVCTQSYSFVWWDWARWEREIDWMALNGINLALAWSGQEAIWQRVY 179




Involved in the degradation of heparan sulfate.
Homo sapiens (taxid: 9606)
EC: 3EC: .EC: 2EC: .EC: 1EC: .EC: 5EC: 0

Close Homologs in the Non-Redundant Database Detected by BLAST ?

GI ?Alignment Graph ?Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query220
224121634 812 predicted protein [Populus trichocarpa] 0.968 0.262 0.751 4e-91
297733843 846 unnamed protein product [Vitis vinifera] 0.959 0.249 0.740 6e-90
225457148 813 PREDICTED: alpha-N-acetylglucosaminidase 0.959 0.259 0.740 7e-90
255540793 809 alpha-n-acetylglucosaminidase, putative 0.954 0.259 0.745 9e-89
147860882 562 hypothetical protein VITISV_031934 [Viti 0.959 0.375 0.736 9e-89
449441031 808 PREDICTED: alpha-N-acetylglucosaminidase 0.927 0.252 0.686 6e-82
356519003 812 PREDICTED: alpha-N-acetylglucosaminidase 0.945 0.256 0.635 3e-77
357458271 807 Alpha-N-acetylglucosaminidase [Medicago 0.827 0.225 0.644 1e-69
357458267 832 Alpha-N-acetylglucosaminidase [Medicago 0.827 0.218 0.644 1e-69
357458269 539 Alpha-N-acetylglucosaminidase [Medicago 0.827 0.337 0.644 2e-69
>gi|224121634|ref|XP_002318632.1| predicted protein [Populus trichocarpa] gi|222859305|gb|EEE96852.1| predicted protein [Populus trichocarpa] Back     alignment and taxonomy information
 Score =  340 bits (871), Expect = 4e-91,   Method: Compositional matrix adjust.
 Identities = 163/217 (75%), Positives = 186/217 (85%), Gaps = 4/217 (1%)

Query: 1   MDSP-FRSVSLILFFITTITLSTLAVAQSSTIGVQYISRLLDIQDRERAPPSVQLAAAYA 59
           MDSP   +VSL+L       L   + AQSSTIGV YIS++L+IQDRERA PSVQ+AAA  
Sbjct: 1   MDSPPVAAVSLVLL---ASFLLFFSFAQSSTIGVGYISKILEIQDRERALPSVQVAAARG 57

Query: 60  VLQRLLPSHYSAFQFRIISKKQCGGEYCFILRNHPSSYIRGTPEIVISGVTGVEVLAGLH 119
           VLQRLLPSH S+F+FRI+SK+QCGGE CFI++NHPS   RG P+I+ISGVTGVEVLAGLH
Sbjct: 58  VLQRLLPSHSSSFEFRIVSKEQCGGESCFIIKNHPSFTRRGAPQILISGVTGVEVLAGLH 117

Query: 120 WYLRYWCGSHISWDKTGGVQVASMPKLGSFPRVQDAGVFVKRPIPLNYYQNAVTSSYTFA 179
           WYL+YWCGSHISWDKTGGVQ+ S+PKLGS PR+QD  + V+RP+P NYYQNAVTSSY+FA
Sbjct: 118 WYLKYWCGSHISWDKTGGVQLNSIPKLGSLPRLQDDSILVQRPVPWNYYQNAVTSSYSFA 177

Query: 180 WWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKVFQ 216
           WWDWKRWEKEIDWMALQGINLPLAFTGQE IWQKVFQ
Sbjct: 178 WWDWKRWEKEIDWMALQGINLPLAFTGQEAIWQKVFQ 214




Source: Populus trichocarpa

Species: Populus trichocarpa

Genus: Populus

Family: Salicaceae

Order: Malpighiales

Class:

Phylum: Streptophyta

Superkingdom: Eukaryota

>gi|297733843|emb|CBI15090.3| unnamed protein product [Vitis vinifera] Back     alignment and taxonomy information
>gi|225457148|ref|XP_002280399.1| PREDICTED: alpha-N-acetylglucosaminidase-like [Vitis vinifera] Back     alignment and taxonomy information
>gi|255540793|ref|XP_002511461.1| alpha-n-acetylglucosaminidase, putative [Ricinus communis] gi|223550576|gb|EEF52063.1| alpha-n-acetylglucosaminidase, putative [Ricinus communis] Back     alignment and taxonomy information
>gi|147860882|emb|CAN83148.1| hypothetical protein VITISV_031934 [Vitis vinifera] Back     alignment and taxonomy information
>gi|449441031|ref|XP_004138287.1| PREDICTED: alpha-N-acetylglucosaminidase-like [Cucumis sativus] Back     alignment and taxonomy information
>gi|356519003|ref|XP_003528164.1| PREDICTED: alpha-N-acetylglucosaminidase-like [Glycine max] Back     alignment and taxonomy information
>gi|357458271|ref|XP_003599416.1| Alpha-N-acetylglucosaminidase [Medicago truncatula] gi|355488464|gb|AES69667.1| Alpha-N-acetylglucosaminidase [Medicago truncatula] Back     alignment and taxonomy information
>gi|357458267|ref|XP_003599414.1| Alpha-N-acetylglucosaminidase [Medicago truncatula] gi|355488462|gb|AES69665.1| Alpha-N-acetylglucosaminidase [Medicago truncatula] Back     alignment and taxonomy information
>gi|357458269|ref|XP_003599415.1| Alpha-N-acetylglucosaminidase [Medicago truncatula] gi|355488463|gb|AES69666.1| Alpha-N-acetylglucosaminidase [Medicago truncatula] Back     alignment and taxonomy information

Prediction of Gene Ontology (GO) Terms

Close Homologs with Gene Ontology terms Detected by BLAST ?

ID ? Alignment graph ? Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query220
TAIR|locus:2173209 806 CYL1 "AT5G13690" [Arabidopsis 0.854 0.233 0.581 2.4e-60
RGD|1564228 739 Naglu "N-acetylglucosaminidase 0.490 0.146 0.508 9.3e-26
UNIPROTKB|P54802 743 NAGLU "Alpha-N-acetylglucosami 0.495 0.146 0.491 4.2e-25
UNIPROTKB|A6QM01 667 NAGLU "NAGLU protein" [Bos tau 0.490 0.161 0.5 6.8e-25
UNIPROTKB|F1S1D7 744 NAGLU "Uncharacterized protein 0.481 0.142 0.495 8.8e-25
DICTYBASE|DDB_G0291998 798 naglu "alpha-N-acetylglucosami 0.777 0.214 0.355 2.7e-24
FB|FBgn0014417 778 CG13397 [Drosophila melanogast 0.686 0.194 0.341 3.7e-16
UNIPROTKB|H9L296 601 H9L296 "Uncharacterized protei 0.25 0.091 0.563 1.4e-15
TAIR|locus:2173209 CYL1 "AT5G13690" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
 Score = 618 (222.6 bits), Expect = 2.4e-60, P = 2.4e-60
 Identities = 111/191 (58%), Positives = 140/191 (73%)

Query:    28 SSTIGVQY--ISRLLDIQDRERAPPSVQLAAAYAVLQRLLPSHYSAFQFRIISKKQCGGE 85
             S T+   +  I  LLD  D      SVQ +AA  +LQRLLP+H  +F+ RIISK  CGG 
Sbjct:    18 SQTVSKHHPTIDGLLDRLDSLLPTSSVQESAAKGLLQRLLPTHSQSFELRIISKDACGGT 77

Query:    86 YCFILRNHPSSYIRGTPEIVISGVTGVEVLAGLHWYLRYWCGSHISWDKTGGVQVASMPK 145
              CF++ N+      G PEI+I G TGVE+ +GLHWYL+Y C +H+SWDKTGG+QVAS+P+
Sbjct:    78 SCFVIENYDGPGRIG-PEILIKGTTGVEIASGLHWYLKYKCNAHVSWDKTGGIQVASVPQ 136

Query:   146 LGSFPRVQDAGVFVKRPIPLNYYQNAVTSSYTFAWWDWKRWEKEIDWMALQGINLPLAFT 205
              G  PR+    +F++RP+P NYYQN VTSSY++ WW W+RWE+EIDWMALQGINLPLAFT
Sbjct:   137 PGHLPRIDSKRIFIRRPVPWNYYQNVVTSSYSYVWWGWERWEREIDWMALQGINLPLAFT 196

Query:   206 GQETIWQKVFQ 216
             GQE IWQKVF+
Sbjct:   197 GQEAIWQKVFK 207




GO:0004561 "alpha-N-acetylglucosaminidase activity" evidence=ISS
GO:0009507 "chloroplast" evidence=ISM
GO:0005773 "vacuole" evidence=IDA
RGD|1564228 Naglu "N-acetylglucosaminidase, alpha" [Rattus norvegicus (taxid:10116)] Back     alignment and assigned GO terms
UNIPROTKB|P54802 NAGLU "Alpha-N-acetylglucosaminidase" [Homo sapiens (taxid:9606)] Back     alignment and assigned GO terms
UNIPROTKB|A6QM01 NAGLU "NAGLU protein" [Bos taurus (taxid:9913)] Back     alignment and assigned GO terms
UNIPROTKB|F1S1D7 NAGLU "Uncharacterized protein" [Sus scrofa (taxid:9823)] Back     alignment and assigned GO terms
DICTYBASE|DDB_G0291998 naglu "alpha-N-acetylglucosaminidase" [Dictyostelium discoideum (taxid:44689)] Back     alignment and assigned GO terms
FB|FBgn0014417 CG13397 [Drosophila melanogaster (taxid:7227)] Back     alignment and assigned GO terms
UNIPROTKB|H9L296 H9L296 "Uncharacterized protein" [Gallus gallus (taxid:9031)] Back     alignment and assigned GO terms

Prediction of Enzyme Commission (EC) Number

EC Number Prediction by Annotation Transfer from SWISS-PROT Entries ?

No confident hit for EC number transfering in SWISSPROT detected by BLAST

EC Number Prediction by Ezypred Server ?

Fail to connect to Ezypred Server

EC Number Prediction by EFICAz Software ?

No EC number assignment, probably not an enzyme!


Prediction of Functionally Associated Proteins

Functionally Associated Proteins Detected by STRING ?

Your Input:
fgenesh4_pg.C_LG_XII000747
hypothetical protein (812 aa)
(Populus trichocarpa)
Predicted Functional Partners:
gw1.XIV.2399.1
hypothetical protein (454 aa)
     0.919
gw1.XIII.847.1
hypothetical protein (437 aa)
      0.914
gw1.28.778.1
hypothetical protein (434 aa)
       0.899
gw1.133.86.1
hypothetical protein (437 aa)
       0.899

Conserved Domains and Related Protein Families

Conserved Domains Detected by RPS-BLAST ?

ID ?Alignment Graph ?Length ? Definition ? E-value ?
Query220
pfam05089 333 pfam05089, NAGLU, Alpha-N-acetylglucosaminidase (N 3e-29
pfam1297185 pfam12971, NAGLU_N, Alpha-N-acetylglucosaminidase 7e-22
>gnl|CDD|218423 pfam05089, NAGLU, Alpha-N-acetylglucosaminidase (NAGLU) tim-barrel domain Back     alignment and domain information
 Score =  110 bits (278), Expect = 3e-29
 Identities = 36/50 (72%), Positives = 40/50 (80%)

Query: 167 YYQNAVTSSYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKVFQ 216
           YY N  T SYTFAWWDW+RWE+EIDWMAL GINLPLA TGQE +W +V  
Sbjct: 2   YYLNYCTFSYTFAWWDWERWEREIDWMALNGINLPLAITGQEAVWYRVLL 51


Alpha-N-acetylglucosaminidase, a lysosomal enzyme required for the stepwise degradation of heparan sulfate. Mutations on the alpha-N-acetylglucosaminidase (NAGLU) gene can lead to Mucopolysaccharidosis type IIIB (MPS IIIB; or Sanfilippo syndrome type B) characterized by neurological dysfunction but relatively mild somatic manifestations. The structure shows that the enzyme is composed of three domains. This central domain has a tim barrel fold. Length = 333

>gnl|CDD|193444 pfam12971, NAGLU_N, Alpha-N-acetylglucosaminidase (NAGLU) N-terminal domain Back     alignment and domain information

Conserved Domains Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query 220
KOG2233 666 consensus Alpha-N-acetylglucosaminidase [Intracell 100.0
PF1297186 NAGLU_N: Alpha-N-acetylglucosaminidase (NAGLU) N-t 99.95
>KOG2233 consensus Alpha-N-acetylglucosaminidase [Intracellular trafficking, secretion, and vesicular transport] Back     alignment and domain information
Probab=100.00  E-value=1.7e-44  Score=342.76  Aligned_cols=109  Identities=58%  Similarity=1.141  Sum_probs=99.9

Q ss_pred             CeEEEEecCHHHHHHHHHHHHHHhcCceeeecCCCCcccccCCCCCCCCCCCCCceeEecccceeeecccccCCcccccc
Q 027682          102 PEIVISGVTGVEVLAGLHWYLRYWCGSHISWDKTGGVQVASMPKLGSFPRVQDAGVFVKRPIPLNYYQNAVTSSYTFAWW  181 (220)
Q Consensus       102 ~kI~I~G~s~vala~Gln~YLK~~c~~~iSW~~~~G~ql~~lp~P~~LP~v~~~~v~~~s~~~~RYy~N~cTfsYS~awW  181 (220)
                      ++|.|+|+|||++|+||||||||.|++|++|++.   ++.   +|.+||+++.+ +.+.+|  ||||||+||+||||+||
T Consensus         5 ~~il~~g~tGv~~~~~lh~ylk~~~~~~v~w~k~---~~s---lp~~lprv~~e-~~~~~p--~~YyqNvcT~SYSfaWW   75 (666)
T KOG2233|consen    5 GRILIKGTTGVEIASGLHWYLKYKCNAHVSWDKQ---VAS---LPQHLPRVDSE-IFIARP--WNYYQNVCTFSYSFAWW   75 (666)
T ss_pred             ceEEEecCchhhhhhcccHHHHHhhcCceeehhe---eee---CCCcCCCCCcc-eeeccc--hHhhcceeeeeeeeeee
Confidence            6899999999999999999999999999999963   543   56699999874 555555  99999999999999999


Q ss_pred             ChhhHhhhHhHHHhcCCCccccccchHHHHHHHHHHhc
Q 027682          182 DWKRWEKEIDWMALQGINLPLAFTGQETIWQKVFQVAF  219 (220)
Q Consensus       182 dW~rWErEIDWMAL~GINlpLA~~GqEaIW~~V~~~~~  219 (220)
                      +|+|||||||||||||||++||++|||+|||+||+.+|
T Consensus        76 ~w~qWeR~iDWmALnGinl~la~~gQEaIWqkVf~~lg  113 (666)
T KOG2233|consen   76 GWEQWEREIDWMALNGINLVLAPLGQEAIWQKVFMGLG  113 (666)
T ss_pred             chHHHHhHhhHHHHcCcceeeccchhHHHHHHHHHHcC
Confidence            99999999999999999999999999999999999876



>PF12971 NAGLU_N: Alpha-N-acetylglucosaminidase (NAGLU) N-terminal domain; InterPro: IPR024240 Alpha-N-acetylglucosaminidase, is a lysosomal enzyme required for the stepwise degradation of heparan sulphate [] Back     alignment and domain information

Homologous Structure Templates

Structure Templates Detected by BLAST ?

ID ?Alignment Graph ?Length ? Definition ? E-value ?
Query220
4a4a_A 914 Cpgh89 (E483q, E601q), From Clostridium Perfringens 6e-13
2vc9_A 891 Family 89 Glycoside Hydrolase From Clostridium Perf 6e-13
>pdb|4A4A|A Chain A, Cpgh89 (E483q, E601q), From Clostridium Perfringens, In Complex With Its Substrate Glcnac-Alpha-1,4-Galactose Length = 914 Back     alignment and structure

Iteration: 1

Score = 70.9 bits (172), Expect = 6e-13, Method: Composition-based stats. Identities = 37/116 (31%), Positives = 66/116 (56%), Gaps = 6/116 (5%) Query: 100 GTPEIVISGVTGVEVLAGLHWYLRYWCGSHISWDKTGGVQVASMPKLGSFPRVQDAGVFV 159 G +++I G GV + +G ++YL+ +C ++S++ G + MP+ + P V + V + Sbjct: 233 GDGKVLIKGNNGVSLASGFNYYLKNYC--NVSYNPIMGSNL-KMPE--TMPSVGER-VVI 286 Query: 160 KRPIPLNYYQNAVTSSYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQETIWQKVF 215 P Y N T SYT ++WDW ++E+ +DW A+ G+NL L GQE + ++ Sbjct: 287 DTPYEHRYALNFCTYSYTMSFWDWDQYEEFLDWCAMNGVNLVLDIIGQEEVLRRTL 342
>pdb|2VC9|A Chain A, Family 89 Glycoside Hydrolase From Clostridium Perfringens In Complex With 2-Acetamido-1,2-Dideoxynojirmycin Length = 891 Back     alignment and structure

Structure Templates Detected by RPS-BLAST ?

ID ?Alignment Graph ?Length ? Definition ? E-value ?
Query220
4a4a_A 914 Alpha-N-acetylglucosaminidase family protein; hydr 2e-37
>4a4a_A Alpha-N-acetylglucosaminidase family protein; hydrolase, 2 hydrolase, family 89 glycoside hydrolase, mucin carbohydrate-active enzyme; HET: NDG GAL; 1.90A {Clostridium perfringens} PDB: 2vcc_A 2vc9_A* 2vcb_A* 2vca_A Length = 914 Back     alignment and structure
 Score =  137 bits (345), Expect = 2e-37
 Identities = 39/144 (27%), Positives = 68/144 (47%), Gaps = 12/144 (8%)

Query: 73  QFRIISKKQCGGEYCFILRNHPSSYIRGTPEIVISGVTGVEVLAGLHWYLRYWCGSHISW 132
           +F    + Q  G   F + +          +++I G  GV + +G ++YL+ +C    + 
Sbjct: 212 KFIFEIRDQLNGNDVFEVSDSG------DGKVLIKGNNGVSLASGFNYYLKNYCNVSYNP 265

Query: 133 DKTGGVQVASMPKLGSFPRVQDAGVFVKRPIPLNYYQNAVTSSYTFAWWDWKRWEKEIDW 192
               G  +  MP+  + P V    V +  P    Y  N  T SYT ++WDW ++E+ +DW
Sbjct: 266 --IMGSNL-KMPE--TMPSV-GERVVIDTPYEHRYALNFCTYSYTMSFWDWDQYEEFLDW 319

Query: 193 MALQGINLPLAFTGQETIWQKVFQ 216
            A+ G+NL L   GQE + ++   
Sbjct: 320 CAMNGVNLVLDIIGQEEVLRRTLN 343


Structure Templates Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query220
4a4a_A 914 Alpha-N-acetylglucosaminidase family protein; hydr 100.0
2cho_A 716 Glucosaminidase, hexosaminiase; O-GLCNACASE, hydro 92.92
2v5d_A 737 O-GLCNACASE NAGJ; family 32 carbohydrate binding m 86.24
2v5c_A 594 O-GLCNACASE NAGJ; glycosidase, GH84, GH84C, hydrol 85.86
3gh5_A 525 HEX1, beta-hexosaminidase; beta-N-acetylhexosamini 85.52
1jak_A 512 Beta-N-acetylhexosaminidase; glycoside hydrolase, 84.03
>4a4a_A Alpha-N-acetylglucosaminidase family protein; hydrolase, 2 hydrolase, family 89 glycoside hydrolase, mucin carbohydrate-active enzyme; HET: NDG GAL; 1.90A {Clostridium perfringens} PDB: 2vcc_A 2vc9_A* 2vcb_A* 2vca_A Back     alignment and structure
Probab=100.00  E-value=1.7e-59  Score=471.30  Aligned_cols=154  Identities=26%  Similarity=0.533  Sum_probs=139.3

Q ss_pred             hHHHHHHHHHHhhcCCC-CCceEEEEeecCCCCCceeEEEecCCCCCCCCCCeEEEEecCHHHHHHHHHHHHHHhcCcee
Q 027682           52 VQLAAAYAVLQRLLPSH-YSAFQFRIISKKQCGGEYCFILRNHPSSYIRGTPEIVISGVTGVEVLAGLHWYLRYWCGSHI  130 (220)
Q Consensus        52 ~q~~aa~~ll~Rllp~~-~~~F~~~i~~~~~~~g~d~F~i~~~~~~~~~g~~kI~I~G~s~vala~Gln~YLK~~c~~~i  130 (220)
                      .-+.++++||+|+||+| +++|+|++.+.  ++|.|+|+|++.    .+  +||+|+|||+|++|+|||||||||||+||
T Consensus       192 ~~~~~~~~li~R~l~~~~~~~F~~~~~~~--~~~~d~F~~~~~----~~--gki~I~Gns~va~a~Gl~~YLk~~c~~~~  263 (914)
T 4a4a_A          192 KTLNEIKNLVGRVIGREFKDKFIFEIRDQ--LNGNDVFEVSDS----GD--GKVLIKGNNGVSLASGFNYYLKNYCNVSY  263 (914)
T ss_dssp             HHHHHHHHHHHHHHCGGGGGGEEEEECCC--BTTBCEEEEEEC----SS--SCEEEEESSHHHHHHHHHHHHHHHSCCBC
T ss_pred             cchHHHHHHHHhhcCccccceEEEEEecC--CCCCceEEEeeC----CC--CeEEEEeCCHHHHHHHHHHHHHHHhCcee
Confidence            44579999999999997 57899999753  368999999842    23  69999999999999999999999999999


Q ss_pred             e-ecCCCCcccccCCCCCCCCCCCCCceeEecccceeeecccccCCccccccChhhHhhhHhHHHhcCCCccccccchHH
Q 027682          131 S-WDKTGGVQVASMPKLGSFPRVQDAGVFVKRPIPLNYYQNAVTSSYTFAWWDWKRWEKEIDWMALQGINLPLAFTGQET  209 (220)
Q Consensus       131 S-W~~~~G~ql~~lp~P~~LP~v~~~~v~~~s~~~~RYy~N~cTfsYS~awWdW~rWErEIDWMAL~GINlpLA~~GqEa  209 (220)
                      | |+   |+|++   +|.+||.|++ +|++++|++|||||||||||||||||||+|||||||||||||||||||++|||+
T Consensus       264 s~w~---g~~l~---lP~~LP~v~~-~v~~~t~~~~RY~~N~cT~sYT~~~WdW~rWEreIDWMAL~GiNlpLa~~GqEa  336 (914)
T 4a4a_A          264 NPIM---GSNLK---MPETMPSVGE-RVVIDTPYEHRYALNFCTYSYTMSFWDWDQYEEFLDWCAMNGVNLVLDIIGQEE  336 (914)
T ss_dssp             CTTT---CCBCC---CCSSCCCCCS-CEEEECSCSEEEECCTTHHHHTTTTCCHHHHHHHHHHHHHTTCCEEECCTTHHH
T ss_pred             eccC---CCccc---CCcCCCCCCC-ceEeccCCceEEEecccCCcccccccChHHHHHHHHHHHHhCCchhhhhhhhHH
Confidence            9 66   55874   6889999997 599999999999999999999999999999999999999999999999999999


Q ss_pred             HHHHHHHHhcC
Q 027682          210 IWQKVFQVAFC  220 (220)
Q Consensus       210 IW~~V~~~~~~  220 (220)
                      ||+|||+++|.
T Consensus       337 vw~~v~~~lG~  347 (914)
T 4a4a_A          337 VLRRTLNEFGY  347 (914)
T ss_dssp             HHHHHHGGGTC
T ss_pred             HHHHHHHHcCC
Confidence            99999999984



>2cho_A Glucosaminidase, hexosaminiase; O-GLCNACASE, hydrolase, N-acetylglucosamine; 1.85A {Bacteroides thetaiotaomicron} SCOP: a.246.1.1 c.1.8.10 d.92.2.3 PDB: 2chn_A 2vvn_A* 2vvs_A* 2x0h_A* 2xm2_A* 2w4x_A* 2w66_A* 2w67_A* 2wca_A* 2xj7_A* 2xm1_A* 2j47_A* 2jiw_A* 2wzh_A* 2wzi_A* 2j4g_A* Back     alignment and structure
>2v5d_A O-GLCNACASE NAGJ; family 32 carbohydrate binding module, glycosidase, GH84, GH84C, CBM32, hydrolase, coiled coil; 3.30A {Clostridium perfringens} Back     alignment and structure
>2v5c_A O-GLCNACASE NAGJ; glycosidase, GH84, GH84C, hydrolase, coiled coil, family 84 glycoside hydrolase, carbohydrate binding module; 2.10A {Clostridium perfringens} PDB: 2cbj_A* 2cbi_A 2vur_A* 2x0y_A* 2j62_A* 2wb5_A* 2xpk_A* 2yds_A* 2ydr_A* 2ydq_A* Back     alignment and structure
>3gh5_A HEX1, beta-hexosaminidase; beta-N-acetylhexosaminidase, glycosphingolipids, paenibacill GH20, hydrolase, structural genomics, NPPSFA; HET: NAG; 1.60A {Paenibacillus SP} PDB: 3gh4_A* 3gh7_A* 3sur_A* 3sus_A* 3sut_A* 3suu_A* 3suv_A* 3suw_A* Back     alignment and structure
>1jak_A Beta-N-acetylhexosaminidase; glycoside hydrolase, family 20, substrate-assisted catalysis, alpha/beta barrel, isofagomin inhibitor complex; HET: IFG; 1.75A {Streptomyces plicatus} SCOP: c.1.8.6 d.92.2.1 PDB: 1hp4_A* 1hp5_A* 1m01_A* 1m04_A* 1m03_A* Back     alignment and structure

Homologous Structure Domains

Structure Domains Detected by RPS-BLAST ?

No hit with e-value below 0.005

Homologous Domains Detected by HHsearch ?

No hit with probability above 80.00