Citrus Sinensis ID: 008173


Local Sequence Feature Prediction

Prediction and (Method)Result
Residue Number Marker
Protein Sequence ?
Secondary Structure (PSIPRED) ?
Secondary Structure Prediction (SSPRO) ?
Coil and Loop (DISEMBL) ?
Flexible Loop (DISEMBL) ?
Low Complexity Region (SEG) ?
Disordered region (IsUnstruct) ?
Disordered Region (DISOPRED) ?
Disordered Region (DISEMBL) ?
Disordered Region (DISPRO) ?
Transmembrane Helix (TMHMM) ?
Transmembrane Helix (HMMTOP) ?
Transmembrane Helix (MEMSAT) ?
TM Helix, Signal Peptide (MEMSAT_SVM) ?
TM Helix, Signal Peptide (Phobius) ?
Signal Peptide (SignalP HMM Mode) ?
Signal Peptide (SignalP NN Mode) ?
Coiled Coils (COILS) ?
Positional Conservation ?
 
--------10--------20--------30--------40--------50--------60--------70--------80--------90-------100-------110-------120-------130-------140-------150-------160-------170-------180-------190-------200-------210-------220-------230-------240-------250-------260-------270-------280-------290-------300-------310-------320-------330-------340-------350-------360-------370-------380-------390-------400-------410-------420-------430-------440-------450-------460-------470-------480-------490-------500-------510-------520-------530-------540-------550-------560-------570-----
MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKYLQGTGVFDH
ccccccccccccHHHHHHHHHHHHHHHHHHHHHccccccccccccccHHHHHHccccEEEEccccccccccccccccccccccccHHHHHHHHHHHHHHHHHcccccccccccccccccccccHHHHHHHHHHHHHHHHccccccEEEEEccccccccccccHHHHHHHHccccccEEEEEcccccccccccccccccccccccccccccccccccccHHHHHccHHHHHHcccccEEEEEEccccccccHHHHHHHHHHccccccccHHHHHHHHHHcccccccHHHHHHHHHHHHccccccccccccccccEEcccccccccccccccccccccccccHHHHHHccccccccccccccHHHHHHHHHHHHHccccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHccHHHHHHHHHHHHHHHHHHHHHHcccccccHHHHHHHHHHHcccHHHHHHHHHHHHHccccccccccccccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHccccccHHHHHHHHHHHHHHHccccccccccccccHHHHHHHHHHHHccccccccc
ccccccccccccHHHHHHHHHHHHHHHHHHHHcccccEEccccccccHHHHHHccccEEEEccccccccccccccccEEcccccHHHHHHHHHHHHHHHHHHcccccEEEcccccccccccccHHHHHHHHHHHHHHHHHccccHEEEEHHHHHccccccccHHHHHHHHccccccEEEEEccccccccEcccHccccccEEEEEEccccccccccccHHHHHccHHHHHHcccccEEEEEccHHHHcccHHHHHHHHHHccccccccHHHHHHHHHHHHcccccHHHHHHHHHHHHHHcccccccccccccEEEEccccccccccccccccccccccccccccccccccccccccccccHHHHHHHHHHHHHHHHHHcccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHccHHHHHHHHHHHHHHHHHHHHHHccccccccHHHHHHHHHHcccHHHHHHHHHHHHHHHHHcccccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHccccccHHHHHHHHHHHHHHHHHccccccccccccHHHHHHHHHHHHccccccccc
msnlhgwggplpqswLDQQLVLQKKILVRIYElgmnpvlpafsgnvpaalqnvfpsakitqlgnwfsvksdprwcctylldatdpLFIEIGRAFIEQQLKEygrtshiyncdtfdentppvdspeyiSSLGAAIYsgmqsgdsDAVWLMQGWlfsydpfwrppqmkallnsvplgkLVVLDLFAevkpiwstskqfygVPYIWCMLHNfagniemygildsiafgpveartsenttmvgvgmsmegieqnPVVYDLMSEMAFQHEKVDVKAWINQYSvrrygrsvpaIQDAWNVLYHTVynctdgatdknrdvivafpdvdpsiisvtegkyqnygkpvskeavlksetssydhphlwysTSEVIRALELFIASgnelsasntyrydlIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGllachdgfllgpWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIEslesgdgfrlkDWRREWIKLTNdwqngrnvypvesngdaLITSQWLYNKYlqgtgvfdh
msnlhgwggplpqSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPdvdpsiisvtegkyqnygkpVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLtndwqngrnvyPVESNGDALITSQWLYNKYLQGTGVFDH
MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKYLQGTGVFDH
******WGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENT*****PEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPV****V******SYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAK**********QYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKYLQGTGV***
MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKYLQG******
MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQ**********YEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKYLQGTGVFDH
*****GWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKYLQGTG****
ooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
ooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiihhhhhhhhhhhhhhhhhhhoooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiihhhhhhhhhhhhhhhhooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
ooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKYLQGTGVFDH
no confident homologs detected

Close Homologs for Annotation Transfer

Close Homologs in SWISS-PROT Database Detected by BLAST ?

ID ?Alignment graph ?Length ? Definition ? RBH(Q2H) ? RBH(H2Q) ? Q cover ? H cover ? Identity ? E-value ?
Query575 2.2.26 [Sep-21-2011]
P54802743 Alpha-N-acetylglucosamini yes no 0.916 0.709 0.378 1e-111
>sp|P54802|ANAG_HUMAN Alpha-N-acetylglucosaminidase OS=Homo sapiens GN=NAGLU PE=1 SV=2 Back     alignment and function desciption
 Score =  403 bits (1035), Expect = e-111,   Method: Compositional matrix adjust.
 Identities = 216/570 (37%), Positives = 327/570 (57%), Gaps = 43/570 (7%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M NLH W GPLP SW  +QL LQ ++L ++   GM PVLPAF+G+VP A+  VFP   +T
Sbjct: 204 MGNLHTWDGPLPPSWHIKQLYLQHRVLDQMRSFGMTPVLPAFAGHVPEAVTRVFPQVNVT 263

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
           ++G+W        + C++LL   DP+F  IG  F+ + +KE+G T HIY  DTF+E  PP
Sbjct: 264 KMGSWGHFNCS--YSCSFLLAPEDPIFPIIGSLFLRELIKEFG-TDHIYGADTFNEMQPP 320

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
              P Y+++   A+Y  M + D++AVWL+QGWLF + P FW P Q++A+L +VP G+L+V
Sbjct: 321 SSEPSYLAAATTAVYEAMTAVDTEAVWLLQGWLFQHQPQFWGPAQIRAVLGAVPRGRLLV 380

Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
           LDLFAE +P+++ +  F G P+IWCMLHNF GN  ++G L+++  GP  AR   N+TMVG
Sbjct: 381 LDLFAESQPVYTRTASFQGQPFIWCMLHNFGGNHGLFGALEAVNGGPEAARLFPNSTMVG 440

Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKV-DVKAWINQYSVRRYGRSVPAIQDAWNVLYHT 298
            GM+ EGI QN VVY LM+E+ ++ + V D+ AW+  ++ RRYG S P    AW +L  +
Sbjct: 441 TGMAPEGISQNEVVYSLMAELGWRKDPVPDLAAWVTSFAARRYGVSHPDAGAAWRLLLRS 500

Query: 299 VYNCT-DGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHL 357
           VYNC+ +     NR  +V  P                          L+  TS      +
Sbjct: 501 VYNCSGEACRGHNRSPLVRRPS-------------------------LQMNTS------I 529

Query: 358 WYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAH 417
           WY+ S+V  A  L + S   L+ S  +RYDL+DLTRQA+ +  +  +     AY   +  
Sbjct: 530 WYNRSDVFEAWRLLLTSAPSLATSPAFRYDLLDLTRQAVQELVSLYYEEARSAYLSKELA 589

Query: 418 GVFQLSRRFL-ELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMW 476
            + +       EL+  +D +LA    FLLG WLE A+  A +E +   YE N+R Q+T+W
Sbjct: 590 SLLRAGGVLAYELLPALDEVLASDSRFLLGSWLEQARAAAVSEAEADFYEQNSRYQLTLW 649

Query: 477 FDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKL 536
                 E ++L DY NK  +GL+ +YY PR  ++ + +++S+  G  F+   + +   +L
Sbjct: 650 ----GPEGNIL-DYANKQLAGLVANYYTPRWRLFLEALVDSVAQGIPFQQHQFDKNVFQL 704

Query: 537 TNDWQNGRNVYPVESNGDALITSQWLYNKY 566
              +   +  YP +  GD +  ++ ++ KY
Sbjct: 705 EQAFVLSKQRYPSQPRGDTVDLAKKIFLKY 734




Involved in the degradation of heparan sulfate.
Homo sapiens (taxid: 9606)
EC: 3EC: .EC: 2EC: .EC: 1EC: .EC: 5EC: 0

Close Homologs in the Non-Redundant Database Detected by BLAST ?

GI ?Alignment Graph ?Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query575
225457148 813 PREDICTED: alpha-N-acetylglucosaminidase 0.998 0.706 0.811 0.0
255540793 809 alpha-n-acetylglucosaminidase, putative 1.0 0.710 0.810 0.0
224121634 812 predicted protein [Populus trichocarpa] 1.0 0.708 0.793 0.0
356519003 812 PREDICTED: alpha-N-acetylglucosaminidase 0.984 0.697 0.800 0.0
297733843 846 unnamed protein product [Vitis vinifera] 0.998 0.678 0.752 0.0
449441031 808 PREDICTED: alpha-N-acetylglucosaminidase 0.984 0.700 0.767 0.0
326515664 829 predicted protein [Hordeum vulgare subsp 0.986 0.683 0.680 0.0
222629680 1129 hypothetical protein OsJ_16433 [Oryza sa 0.986 0.502 0.682 0.0
326519955 829 predicted protein [Hordeum vulgare subsp 0.986 0.683 0.680 0.0
218195716 829 hypothetical protein OsI_17702 [Oryza sa 0.986 0.683 0.680 0.0
>gi|225457148|ref|XP_002280399.1| PREDICTED: alpha-N-acetylglucosaminidase-like [Vitis vinifera] Back     alignment and taxonomy information
 Score =  995 bits (2572), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 469/578 (81%), Positives = 518/578 (89%), Gaps = 4/578 (0%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M NLHGWGGPLPQSWLDQQL+LQKKIL R+YELGM PVLPAFSGNVPAAL+ +FPSAKIT
Sbjct: 235 MGNLHGWGGPLPQSWLDQQLLLQKKILARMYELGMTPVLPAFSGNVPAALKYIFPSAKIT 294

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
           +LGNWF+V  +PRWCCTYLLDATDPLFIEIG+AFI+QQLKEYGRT HIYNCDTFDENTPP
Sbjct: 295 RLGNWFTVGGNPRWCCTYLLDATDPLFIEIGKAFIQQQLKEYGRTGHIYNCDTFDENTPP 354

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVL 180
           VD PEYISSLGAAI+ GMQSGDS+A+WLMQGWLFSYDPFWRPPQMKALL+SVP+G+LVVL
Sbjct: 355 VDDPEYISSLGAAIFRGMQSGDSNAIWLMQGWLFSYDPFWRPPQMKALLHSVPMGRLVVL 414

Query: 181 DLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVGV 240
           DLFAEVKPIW TS+QFYGVPYIWCMLHNFAGNIEMYGILD++A GPVEARTSEN+TMVGV
Sbjct: 415 DLFAEVKPIWITSEQFYGVPYIWCMLHNFAGNIEMYGILDAVASGPVEARTSENSTMVGV 474

Query: 241 GMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVY 300
           GMSMEGIEQNPVVYDLMSEMAFQH KVDVK WI  YS RRYG+SVP IQDAWN+LYHTVY
Sbjct: 475 GMSMEGIEQNPVVYDLMSEMAFQHSKVDVKVWIALYSTRRYGKSVPEIQDAWNILYHTVY 534

Query: 301 NCTDGATDKNRDVIVAFPDVDPSIISVTE----GKYQNYGKPVSKEAVLKSETSSYDHPH 356
           NCTDG+ DKNRDVIVAFPD+DPS I   +    G Y  YGK VS+  VLK  T+S++ PH
Sbjct: 535 NCTDGSYDKNRDVIVAFPDIDPSFIPTPKLSMPGGYHRYGKSVSRRTVLKEITNSFEQPH 594

Query: 357 LWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDA 416
           LWYSTSEV  AL LFIASG +L  SNTYRYDL+DLTRQALAKYAN+LFL +IEAYQLND 
Sbjct: 595 LWYSTSEVKDALGLFIASGGQLLGSNTYRYDLVDLTRQALAKYANQLFLEVIEAYQLNDV 654

Query: 417 HGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMW 476
            G    S++FLELVEDMD LLACHDGFLLGPWLESAKQLAQ+E+QE Q+EWNARTQITMW
Sbjct: 655 RGAACHSQKFLELVEDMDTLLACHDGFLLGPWLESAKQLAQDEQQEIQFEWNARTQITMW 714

Query: 477 FDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKL 536
           FDNT++EASLLRDYGNKYWSGLLRDYYGPRAAIYFKY++ESLE+G+ F LKDWRREWIKL
Sbjct: 715 FDNTEDEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYLLESLETGNEFALKDWRREWIKL 774

Query: 537 TNDWQNGRNVYPVESNGDALITSQWLYNKYLQGTGVFD 574
           TNDWQN RN YPV S+G+A+ TS+ LYNKYLQ   ++D
Sbjct: 775 TNDWQNSRNAYPVRSSGNAIDTSRRLYNKYLQDPEIYD 812




Source: Vitis vinifera

Species: Vitis vinifera

Genus: Vitis

Family: Vitaceae

Order: Vitales

Class:

Phylum: Streptophyta

Superkingdom: Eukaryota

>gi|255540793|ref|XP_002511461.1| alpha-n-acetylglucosaminidase, putative [Ricinus communis] gi|223550576|gb|EEF52063.1| alpha-n-acetylglucosaminidase, putative [Ricinus communis] Back     alignment and taxonomy information
>gi|224121634|ref|XP_002318632.1| predicted protein [Populus trichocarpa] gi|222859305|gb|EEE96852.1| predicted protein [Populus trichocarpa] Back     alignment and taxonomy information
>gi|356519003|ref|XP_003528164.1| PREDICTED: alpha-N-acetylglucosaminidase-like [Glycine max] Back     alignment and taxonomy information
>gi|297733843|emb|CBI15090.3| unnamed protein product [Vitis vinifera] Back     alignment and taxonomy information
>gi|449441031|ref|XP_004138287.1| PREDICTED: alpha-N-acetylglucosaminidase-like [Cucumis sativus] Back     alignment and taxonomy information
>gi|326515664|dbj|BAK07078.1| predicted protein [Hordeum vulgare subsp. vulgare] Back     alignment and taxonomy information
>gi|222629680|gb|EEE61812.1| hypothetical protein OsJ_16433 [Oryza sativa Japonica Group] Back     alignment and taxonomy information
>gi|326519955|dbj|BAK03902.1| predicted protein [Hordeum vulgare subsp. vulgare] Back     alignment and taxonomy information
>gi|218195716|gb|EEC78143.1| hypothetical protein OsI_17702 [Oryza sativa Indica Group] Back     alignment and taxonomy information

Prediction of Gene Ontology (GO) Terms

Close Homologs with Gene Ontology terms Detected by BLAST ?

ID ? Alignment graph ? Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query575
TAIR|locus:2173209806 CYL1 "AT5G13690" [Arabidopsis 0.982 0.700 0.6 3.6e-192
RGD|1564228739 Naglu "N-acetylglucosaminidase 0.556 0.433 0.481 3.8e-112
UNIPROTKB|P54802743 NAGLU "Alpha-N-acetylglucosami 0.551 0.426 0.458 3.1e-108
UNIPROTKB|H9L296601 H9L296 "Uncharacterized protei 0.598 0.572 0.436 8.2e-106
FB|FBgn0014417778 CG13397 [Drosophila melanogast 0.518 0.383 0.392 3.4e-92
UNIPROTKB|F1S1D7744 NAGLU "Uncharacterized protein 0.709 0.548 0.413 2e-81
UNIPROTKB|A6QM01667 NAGLU "NAGLU protein" [Bos tau 0.732 0.631 0.399 1.4e-80
DICTYBASE|DDB_G0291998798 naglu "alpha-N-acetylglucosami 0.662 0.477 0.366 1e-68
TAIR|locus:2173209 CYL1 "AT5G13690" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
 Score = 1862 (660.5 bits), Expect = 3.6e-192, P = 3.6e-192
 Identities = 345/575 (60%), Positives = 433/575 (75%)

Query:     1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
             M NLH WGGPL ++WLD QL+LQK+IL R+ + GM PVLP+FSGNVP+AL+ ++P A IT
Sbjct:   231 MGNLHAWGGPLSKNWLDDQLLLQKQILSRMLKFGMTPVLPSFSGNVPSALRKIYPEANIT 290

Query:    61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
             +L NW +V  D RWCCTYLL+ +DPLFIEIG AFI+QQ +EYG  ++IYNCDTF+ENTPP
Sbjct:   291 RLDNWNTVDGDSRWCCTYLLNPSDPLFIEIGEAFIKQQTEEYGEITNIYNCDTFNENTPP 350

Query:   121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
                PEYISSLGAA+Y  M  G+ +AVWLMQGWLFS D  FW+PPQ+KALL+SVP GK++V
Sbjct:   351 TSEPEYISSLGAAVYKAMSKGNKNAVWLMQGWLFSSDSKFWKPPQLKALLHSVPFGKMIV 410

Query:   180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
             LDL+AEVKPIW+ S QFYG PYIWCMLHNF GNIEMYG LDSI+ GPV+AR S+N+TMVG
Sbjct:   411 LDLYAEVKPIWNKSAQFYGTPYIWCMLHNFGGNIEMYGALDSISSGPVDARVSKNSTMVG 470

Query:   240 VGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTV 299
             VGM MEGIEQNPVVY+L SEMAF+ EKVDV+ W+  Y+ RRY +    I+ AW +LYHTV
Sbjct:   471 VGMCMEGIEQNPVVYELTSEMAFRDEKVDVQKWLKSYARRRYMKENHQIEAAWEILYHTV 530

Query:   300 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQ--NY----GKPVSKEAVL-KSETSSY 352
             YNCTDG  D N D IV  PD DPS  SV +   Q  +Y    G   +K  VL + +T+  
Sbjct:   531 YNCTDGIADHNTDFIVKLPDWDPSS-SVQDDLKQKDSYMISTGPYETKRRVLFQDKTADL 589

Query:   353 DHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQ 412
                HLWYST EVI+AL+LF+ +G++LS S TYRYD++DLTRQ L+K AN+++   + A+ 
Sbjct:   590 PKAHLWYSTKEVIQALKLFLEAGDDLSRSLTYRYDMVDLTRQVLSKLANQVYTEAVTAFV 649

Query:   413 LNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQ 472
               D   + QLS +FLEL++DMD LLA  D  LLG WLESAK+LA+N ++ KQYEWNARTQ
Sbjct:   650 KKDIGSLGQLSEKFLELIKDMDVLLASDDNCLLGTWLESAKKLAKNGDERKQYEWNARTQ 709

Query:   473 ITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRRE 532
             +TMW+D+     S L DY NK+WSGLL DYY PRA +YF  M++SL     F+++ WRRE
Sbjct:   710 VTMWYDSNDVNQSKLHDYANKFWSGLLEDYYLPRARLYFNEMLKSLRDKKIFKVEKWRRE 769

Query:   533 WIKLTNDWQNGRN-VYPVESNGDALITSQWLYNKY 566
             WI +++ WQ   + VYPV++ GDAL  S+ L +KY
Sbjct:   770 WIMMSHKWQQSSSEVYPVKAKGDALAISRHLLSKY 804




GO:0004561 "alpha-N-acetylglucosaminidase activity" evidence=ISS
GO:0009507 "chloroplast" evidence=ISM
GO:0005773 "vacuole" evidence=IDA
RGD|1564228 Naglu "N-acetylglucosaminidase, alpha" [Rattus norvegicus (taxid:10116)] Back     alignment and assigned GO terms
UNIPROTKB|P54802 NAGLU "Alpha-N-acetylglucosaminidase" [Homo sapiens (taxid:9606)] Back     alignment and assigned GO terms
UNIPROTKB|H9L296 H9L296 "Uncharacterized protein" [Gallus gallus (taxid:9031)] Back     alignment and assigned GO terms
FB|FBgn0014417 CG13397 [Drosophila melanogaster (taxid:7227)] Back     alignment and assigned GO terms
UNIPROTKB|F1S1D7 NAGLU "Uncharacterized protein" [Sus scrofa (taxid:9823)] Back     alignment and assigned GO terms
UNIPROTKB|A6QM01 NAGLU "NAGLU protein" [Bos taurus (taxid:9913)] Back     alignment and assigned GO terms
DICTYBASE|DDB_G0291998 naglu "alpha-N-acetylglucosaminidase" [Dictyostelium discoideum (taxid:44689)] Back     alignment and assigned GO terms

Prediction of Enzyme Commission (EC) Number

EC Number Prediction by Annotation Transfer from SWISS-PROT Entries ?

No confident hit for EC number transfering in SWISSPROT detected by BLAST

EC Number Prediction by Ezypred Server ?

Fail to connect to Ezypred Server

EC Number Prediction by EFICAz Software ?

Prediction LevelEC numberConfidence of Prediction
3rd Layer3.2.1LOW CONFIDENCE prediction!
3rd Layer3.2.1.50LOW CONFIDENCE prediction!

Prediction of Functionally Associated Proteins

Functionally Associated Proteins Detected by STRING ?

Your Input:
fgenesh4_pg.C_LG_XII000747
hypothetical protein (812 aa)
(Populus trichocarpa)
Predicted Functional Partners:
gw1.XIV.2399.1
hypothetical protein (454 aa)
     0.919
gw1.XIII.847.1
hypothetical protein (437 aa)
      0.914
gw1.28.778.1
hypothetical protein (434 aa)
       0.899
gw1.133.86.1
hypothetical protein (437 aa)
       0.899

Conserved Domains and Related Protein Families

Conserved Domains Detected by RPS-BLAST ?

ID ?Alignment Graph ?Length ? Definition ? E-value ?
Query575
pfam05089333 pfam05089, NAGLU, Alpha-N-acetylglucosaminidase (N 1e-133
pfam12972269 pfam12972, NAGLU_C, Alpha-N-acetylglucosaminidase 1e-104
>gnl|CDD|218423 pfam05089, NAGLU, Alpha-N-acetylglucosaminidase (NAGLU) tim-barrel domain Back     alignment and domain information
 Score =  390 bits (1005), Expect = e-133
 Identities = 141/266 (53%), Positives = 185/266 (69%), Gaps = 7/266 (2%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M NL GWGGPLPQSW+D+QL LQKKIL R+ ELGM PVLPAF+G+VP AL+  +P A IT
Sbjct: 75  MGNLEGWGGPLPQSWIDKQLALQKKILDRMRELGMTPVLPAFAGHVPRALKRKYPEANIT 134

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
           + G W        +   Y LD TDPLF +I   F+E+Q K YG T HIY+ D F+E  PP
Sbjct: 135 RQGRWGG------FHRPYFLDPTDPLFAKIAARFLEEQKKLYG-TDHIYSADPFNEGGPP 187

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVL 180
              P+Y+++ G AIY  MQ  D DAVW++QGWLF    FW+   ++A L++VP G+L+VL
Sbjct: 188 SGDPDYLAAAGQAIYKSMQKADPDAVWVLQGWLFVDSDFWQANPIEAFLSAVPKGRLLVL 247

Query: 181 DLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVGV 240
           DLF+E  P W  ++ FYG P+IWCMLHNF GN  +YG LD +A G   AR +  + +VGV
Sbjct: 248 DLFSERYPQWKRTESFYGKPWIWCMLHNFGGNTGLYGNLDLVASGIEAARANPGSNLVGV 307

Query: 241 GMSMEGIEQNPVVYDLMSEMAFQHEK 266
           G++ EGIEQNPVVY+L+ E+A++ + 
Sbjct: 308 GITPEGIEQNPVVYELLLELAWREDP 333


Alpha-N-acetylglucosaminidase, a lysosomal enzyme required for the stepwise degradation of heparan sulfate. Mutations on the alpha-N-acetylglucosaminidase (NAGLU) gene can lead to Mucopolysaccharidosis type IIIB (MPS IIIB; or Sanfilippo syndrome type B) characterized by neurological dysfunction but relatively mild somatic manifestations. The structure shows that the enzyme is composed of three domains. This central domain has a tim barrel fold. Length = 333

>gnl|CDD|221877 pfam12972, NAGLU_C, Alpha-N-acetylglucosaminidase (NAGLU) C-terminal domain Back     alignment and domain information

Conserved Domains Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query 575
KOG2233666 consensus Alpha-N-acetylglucosaminidase [Intracell 100.0
PF05089333 NAGLU: Alpha-N-acetylglucosaminidase (NAGLU) tim-b 100.0
PF12972267 NAGLU_C: Alpha-N-acetylglucosaminidase (NAGLU) C-t 100.0
PF02449374 Glyco_hydro_42: Beta-galactosidase; InterPro: IPR0 93.77
PF00150281 Cellulase: Cellulase (glycosyl hydrolase family 5) 92.14
PF12891239 Glyco_hydro_44: Glycoside hydrolase family 44; Int 80.86
>KOG2233 consensus Alpha-N-acetylglucosaminidase [Intracellular trafficking, secretion, and vesicular transport] Back     alignment and domain information
Probab=100.00  E-value=2.1e-169  Score=1321.61  Aligned_cols=530  Identities=45%  Similarity=0.861  Sum_probs=514.3

Q ss_pred             CCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHcCCeeccCCCCCCCchhhHhhCCCCceeccCCCCCCCCCCcccccccc
Q 008173            1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLL   80 (575)
Q Consensus         1 MgNi~gwgGPLp~~wi~~q~~LQ~kIl~RmreLGM~PVLPgF~G~VP~~~~~~~P~a~i~~~~~W~gf~~~~~~~~~~~L   80 (575)
                      ||||++||||||++|+..|.-||||||+|||+||||||||+|+||||.+++++||++++++++.|+.|.  ++|||.++|
T Consensus       134 MGNl~awgGpLs~aw~~~ql~LqkrIidrm~~lGmTpvLPaFaG~VP~al~rlfPesnf~rl~rWn~f~--s~~~C~l~v  211 (666)
T KOG2233|consen  134 MGNLHAWGGPLSPAWMLNQLLLQKRIIDRMLELGMTPVLPAFAGHVPDALERLFPESNFTRLPRWNNFT--SRYSCMLLV  211 (666)
T ss_pred             hcCccccCCCCCHHHHHHHHHHHHHHHHHHHHcCCCccchhhccccHHHHHHhCchhceeeccccCCCC--cceeeeEEc
Confidence            999999999999999999999999999999999999999999999999999999999999999999997  699999999


Q ss_pred             CCCChHHHHHHHHHHHHHHHHhCCCCcccccCCCCCCCCCCCChhHHHHHHHHHHHHHhccCCCceEEEecccCCCCCCC
Q 008173           81 DATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFW  160 (575)
Q Consensus        81 dP~DplF~~i~~~F~~~q~~~yG~~~h~Y~~DpFnE~~pp~~d~~~L~~~~~~v~~am~~~dP~AvWvmQgW~F~~~~fW  160 (575)
                      .|+||||.+||.+|+++|+|+||.++|+|+||||||+.||+++|+|+++.+.+||+||+++||+||||||||+|.+|+||
T Consensus       212 ~P~dplF~eIgs~Flr~~~kefG~~tniy~~DpFNE~~Pp~sepey~~staaAiyesm~kvdknaVWllQgWlF~~d~FW  291 (666)
T KOG2233|consen  212 SPFDPLFQEIGSTFLRHQIKEFGGVTNIYSADPFNEILPPESEPEYVKSTAAAIYESMKKVDKNAVWLLQGWLFTYDPFW  291 (666)
T ss_pred             cCCcchHHHHHHHHHHHHHHHhCCcccccccCcccccCCCCCChHHHHHHHHHHHHHHhccCcceEEeeecceeecCCCC
Confidence            99999999999999999999999888999999999999999999999999999999999999999999999999999999


Q ss_pred             CchhHHHhhhCCCCCCEEEEecCCCcccccccccCCCCCCceeeeccCCCCccccccchhhhhcChHHhhhCCCCceEEe
Q 008173          161 RPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVGV  240 (575)
Q Consensus       161 ~~~~~~a~L~~Vp~~~mliLDL~~E~~p~W~~~~~f~G~pwIWc~L~NFGGn~gl~G~l~~i~~~~~~a~~~~~~~m~Gi  240 (575)
                      ++.++||||++||.||||||||++|..|+|++|.+|+|||||||||||||||.+|+|.++.|+.||..|+..+||+|||+
T Consensus       292 ~~~~ikafL~avP~GrllVLDLyaEv~P~~~~Tasf~GqpfIWCmLHNFGGn~~lfGal~~InsG~e~Ar~~~nStlVGt  371 (666)
T KOG2233|consen  292 PPWQIKAFLSAVPRGRLLVLDLYAEVVPAWQMTASFQGQPFIWCMLHNFGGNRELFGALQKINSGPELARMKPNSTLVGT  371 (666)
T ss_pred             ChHHHHHHHhcCCcCcEEEEehhhhhhhHHHhhhhhcCCceeEEEeeccCCcHhhhhhHHHhccCHHHHhcCCCCceeec
Confidence            99999999999999999999999999999999999999999999999999999999999999999999999999999999


Q ss_pred             eeCccccccChhHHHHHHhhcCCCCCCCHHHHHHHHhhcccCCCChHHHHHHHHHHhcccCCCCCCCCCC-CcceeccCC
Q 008173          241 GMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKN-RDVIVAFPD  319 (575)
Q Consensus       241 G~tpEGi~~NpvvYel~~d~aW~~~~id~~~W~~~Ya~rRYG~~~~~~~~AW~iL~~tvY~~~~~~~~~~-~~~~~~~P~  319 (575)
                      ||+||||+||||||+||.|++|+++++|++.|+++|+.+|||+.++.+++||.+|.+|||||+++...++ ...+.+||+
T Consensus       372 Gm~~EgI~QN~VvYsf~~e~~wsde~ldl~~Wlksys~sRY~~~~~~~eaaW~lL~~tvYn~~~~~~~~~~~~~l~rRp~  451 (666)
T KOG2233|consen  372 GMSMEGIDQNYVVYSFMIERGWSDEPLDLNNWLKSYSESRYGVDFKVAEAAWTLLAGTVYNCPGKWATRGFSYFLYRRPS  451 (666)
T ss_pred             ccCccccccchhhHHHHHHhccCCCCCCHHHHHHHHHHhhhccchHHHHHHHHHHhhhhcCCCcccccCCCceeeEeccc
Confidence            9999999999999999999999999999999999999999999999999999999999999998776665 666777776


Q ss_pred             CCCCcccccccccccCCCCccccccccccCCCCCCCCCcCCHHHHHHHHHHHHhccccCCCC-CcccchHHHHHHHHHHH
Q 008173          320 VDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSAS-NTYRYDLIDLTRQALAK  398 (575)
Q Consensus       320 l~p~~~~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~~~~Yd~~~l~~A~~lll~~~~~l~~~-~~y~yDLvDvtRQvL~n  398 (575)
                      ++                               .++.+||+.+++..||++||.+.+.+..+ ++|||||||+|||+|+.
T Consensus       452 f~-------------------------------~k~~~Wyn~sev~~a~rllL~a~~~l~~e~~~fr~DlvDltRq~lqe  500 (666)
T KOG2233|consen  452 FQ-------------------------------RKTEYWYNVSEVFSAWRLLLTALVHLLGEHPLFRYDLVDLTRQMLQE  500 (666)
T ss_pred             cc-------------------------------CCceeeecHHHHHHHHHHHHhhhhhhccCchhhHhHHHHHHHHHHHH
Confidence            54                               14679999999999999999988777666 99999999999999999


Q ss_pred             HHHHHHHHHHHHHHhcChHHHHHHHHHHHHHHHHHHHHhccCCCCChHHHHHHHHhhCCCHHHHHHHHHHhhhcccccCC
Q 008173          399 YANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFD  478 (575)
Q Consensus       399 ~~~~~~~~~~~Ay~~kd~~~~~~~~~~~l~li~dlD~LL~t~~~flLg~Wl~~Ar~~g~~~~ek~~yE~NAR~~IT~WG~  478 (575)
                      .++++|.++.+||..||...+.+.|.++++|+.|||.+|+++.+||||.||++||+.|.+.+|++.||.|||+|||+|||
T Consensus       501 lanq~y~e~~sAflkkd~~sl~~~~~~llelf~dle~~lasd~nfLlg~WleqAk~~A~n~~er~~fe~nar~qlT~Wgp  580 (666)
T KOG2233|consen  501 LANQAYLEARSAFLKKDKQSLGALSEKLLELFQDLESYLASDDNFLLGQWLEQAKSAAPNSEERQVFEVNARDQLTVWGP  580 (666)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHccCcchhHHHHHHHHHhhCCChhhhhhccccccceEEEECC
Confidence            99999999999999999999999999999999999999999999999999999999999999999999999999999999


Q ss_pred             CCCcCCCCCcccccccccchhhhhhHHHHHHHHHHHHHHHhcCCCCChHHHHHHH-HHHhhhhhcCCCcCCCCCCCcHHH
Q 008173          479 NTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREW-IKLTNDWQNGRNVYPVESNGDALI  557 (575)
Q Consensus       479 ~~~~~~~~L~DYA~k~W~GL~~dyY~pRW~~f~~~l~~~l~~~~~f~~~~~~~~~-~~~E~~W~~~~~~y~~~p~gd~~~  557 (575)
                      .     |+|+|||||+||||++|||.|||++|++.+.+++..++||+.++|..+| ..+|.+++.++.+||++|.||+++
T Consensus       581 ~-----gqilDYA~kqwaGLvsdYY~pRwrlf~e~il~slhd~~pFn~t~f~~~v~~~~e~~fs~s~~vYPv~p~Gdt~~  655 (666)
T KOG2233|consen  581 T-----GQILDYANKQWAGLVSDYYGPRWRLFLEWILESLHDGKPFNHTEFSISVFRDVERPFSISNIVYPVKPKGDTLA  655 (666)
T ss_pred             C-----cchhhHHHHHHHHhhhhcccHHHHHHHHHHHHHHhcCCCccccchhhHHHHhhhccccccccccccCCCccHHH
Confidence            5     5899999999999999999999999999999999999999999999998 599999999999999999999999


Q ss_pred             HHHHHHHHHhc
Q 008173          558 TSQWLYNKYLQ  568 (575)
Q Consensus       558 ~a~~l~~ky~~  568 (575)
                      +++.|+.||++
T Consensus       656 is~~if~ky~P  666 (666)
T KOG2233|consen  656 ISRHIFSKYFP  666 (666)
T ss_pred             HHHHHHHhcCC
Confidence            99999999963



>PF05089 NAGLU: Alpha-N-acetylglucosaminidase (NAGLU) tim-barrel domain; InterPro: IPR024733 Alpha-N-acetylglucosaminidase is a lysosomal enzyme that is required for the stepwise degradation of heparan sulphate [] Back     alignment and domain information
>PF12972 NAGLU_C: Alpha-N-acetylglucosaminidase (NAGLU) C-terminal domain; InterPro: IPR024732 Alpha-N-acetylglucosaminidase is a lysosomal enzyme required for the stepwise degradation of heparan sulphate [] Back     alignment and domain information
>PF02449 Glyco_hydro_42: Beta-galactosidase; InterPro: IPR013529 O-Glycosyl hydrolases 3 Back     alignment and domain information
>PF00150 Cellulase: Cellulase (glycosyl hydrolase family 5); InterPro: IPR001547 O-Glycosyl hydrolases 3 Back     alignment and domain information
>PF12891 Glyco_hydro_44: Glycoside hydrolase family 44; InterPro: IPR024745 This is a family of putative bacterial glycoside hydrolases Back     alignment and domain information

Homologous Structure Templates

Structure Templates Detected by BLAST ?

ID ?Alignment Graph ?Length ? Definition ? E-value ?
Query575
2vc9_A891 Family 89 Glycoside Hydrolase From Clostridium Perf 2e-67
4a4a_A914 Cpgh89 (E483q, E601q), From Clostridium Perfringens 2e-66
>pdb|2VC9|A Chain A, Family 89 Glycoside Hydrolase From Clostridium Perfringens In Complex With 2-Acetamido-1,2-Dideoxynojirmycin Length = 891 Back     alignment and structure

Iteration: 1

Score = 253 bits (646), Expect = 2e-67, Method: Compositional matrix adjust. Identities = 166/564 (29%), Positives = 277/564 (49%), Gaps = 49/564 (8%) Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60 M N+ G+GGPLP W +Q+ L +K+ R+ G+NPVL +SG VP + A+ Sbjct: 344 MQNMTGFGGPLPNDWFEQRAELGRKMHDRMQSFGINPVLQGYSGMVPRDFKEKNQEAQTI 403 Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE--NT 118 G W P TY+ + F ++ F E+Q + +G ++ Y D F E NT Sbjct: 404 SQGGWCGFDR-PDMLKTYVNEGEADYFQKVADVFYEKQKEVFGDVTNFYGVDPFHEGGNT 462 Query: 119 PPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLV 178 +D+ + I + M D+DAVW++Q W + P L + + Sbjct: 463 GDLDN----GKIYEIIQNKMIEHDNDAVWVIQNWQGN-------PSNNKLEGLTKKDQAM 511 Query: 179 VLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMV 238 VLDLF+EV P W+ ++ +P+IW MLHNF G + M + +A + + + MV Sbjct: 512 VLDLFSEVSPDWNRLEE-RDLPWIWNMLHNFGGRMGMDAAPEKLA-TEIPKALANSEHMV 569 Query: 239 GVGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHT 298 G+G++ E I NP+ Y+L+ +MA+ ++++ + W Y RRYG++ I +AWN++ T Sbjct: 570 GIGITPEAINTNPLAYELLFDMAWTRDQINFRTWTEDYIERRYGKTNKEILEAWNIILDT 629 Query: 299 VYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLW 358 Y K D + SII+ G +G +KS S++ H + Sbjct: 630 AYK-------KRNDY---YQGAAESIINARPG----FG--------IKS-ASTWGHSKIV 666 Query: 359 YSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHG 418 Y SE +A+E+F + +E S+ + YD D+ +Q LA A E + + AY + Sbjct: 667 YDKSEFEKAIEIFAKNYDEFKDSDAFLYDFADILKQLLANSAQEYYEVMCNAYNNGNGEK 726 Query: 419 VFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQ--EKQYEWNARTQITMW 476 +S +FLEL++ + +L+ FL+G W+E A+ + ++ + + +E+NAR +T W Sbjct: 727 FKFVSGKFLELIKLQERVLSTRPEFLIGNWIEDARTMLKDSDDWTKDLFEFNARALVTTW 786 Query: 477 FDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKL 536 + L+DY N+ WSGL DYY R + + L+ G K +W K+ Sbjct: 787 GSRNNADGGGLKDYSNRQWSGLTEDYYYARWEKWINGLQAELDGG----AKAPNIDWFKM 842 Query: 537 TNDWQNGRN----VYPVESNGDAL 556 DW N ++ +YP E++ + L Sbjct: 843 EYDWVNKKSDTDKLYPTEASNENL 866
>pdb|4A4A|A Chain A, Cpgh89 (E483q, E601q), From Clostridium Perfringens, In Complex With Its Substrate Glcnac-Alpha-1,4-Galactose Length = 914 Back     alignment and structure

Structure Templates Detected by RPS-BLAST ?

ID ?Alignment Graph ?Length ? Definition ? E-value ?
Query575
4a4a_A914 Alpha-N-acetylglucosaminidase family protein; hydr 0.0
1vt4_I 1221 APAF-1 related killer DARK; drosophila apoptosome, 5e-06
1vt4_I 1221 APAF-1 related killer DARK; drosophila apoptosome, 6e-05
>4a4a_A Alpha-N-acetylglucosaminidase family protein; hydrolase, 2 hydrolase, family 89 glycoside hydrolase, mucin carbohydrate-active enzyme; HET: NDG GAL; 1.90A {Clostridium perfringens} PDB: 2vcc_A 2vc9_A* 2vcb_A* 2vca_A Length = 914 Back     alignment and structure
 Score =  621 bits (1603), Expect = 0.0
 Identities = 153/569 (26%), Positives = 260/569 (45%), Gaps = 38/569 (6%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M N+ G+GGPLP  W +Q+  L +K+  R+   G+NPVL  +SG VP   +     A+  
Sbjct: 367 MQNMTGFGGPLPNDWFEQRAELGRKMHDRMQSFGINPVLQGYSGMVPRDFKEKNQEAQTI 426

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
             G W      P    TY+ +     F ++   F E+Q + +G  ++ Y  D F +    
Sbjct: 427 SQGGWCGF-DRPDMLKTYVNEGEADYFQKVADVFYEKQKEVFGDVTNFYGVDPFHQGGNT 485

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVL 180
                    +   I + M   D+DAVW++Q W          P    L       + +VL
Sbjct: 486 --GDLDNGKIYEIIQNKMIEHDNDAVWVIQNW-------QGNPSNNKLEGLTKKDQAMVL 536

Query: 181 DLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVGV 240
           DLF+EV P W+   +   +P+IW MLHNF G + M    + +A    +A    +  MVG+
Sbjct: 537 DLFSEVSPDWNRL-EERDLPWIWNMLHNFGGRMGMDAAPEKLATEIPKALA-NSEHMVGI 594

Query: 241 GMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVY 300
           G++ + I  NP+ Y+L+ +MA+  ++++ + W   Y  RRYG++   I +AWN++  T Y
Sbjct: 595 GITPQAINTNPLAYELLFDMAWTRDQINFRTWTEDYIERRYGKTNKEILEAWNIILDTAY 654

Query: 301 NCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYS 360
              +       + I+      P                           S++ H  + Y 
Sbjct: 655 KKRNDYYQGAAESIINAR---PGF--------------------GIKSASTWGHSKIVYD 691

Query: 361 TSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVF 420
            SE  +A+E+F  + +E   S+ + YD  D+ +Q LA  A E +  +  AY   +     
Sbjct: 692 KSEFEKAIEIFAKNYDEFKDSDAFLYDFADILKQLLANSAQEYYEVMCNAYNNGNGEKFK 751

Query: 421 QLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQN--EEQEKQYEWNARTQITMWFD 478
            +S +FLEL++  + +L+    FL+G W+E A+ + ++  +  +  +E+NAR  +T W  
Sbjct: 752 FVSGKFLELIKLQERVLSTRPEFLIGNWIEDARTMLKDSDDWTKDLFEFNARALVTTWGS 811

Query: 479 NTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLTN 538
               +   L+DY N+ WSGL  DYY  R   +   +   L+ G      DW +      N
Sbjct: 812 RNNADGGGLKDYSNRQWSGLTEDYYYARWEKWINGLQAELDGGAKAPNIDWFKMEYDWVN 871

Query: 539 DWQNGRNVYPVESNGDALIT-SQWLYNKY 566
              +   +YP E++ + L   ++     Y
Sbjct: 872 KKSDTDKLYPTEASNENLGELAKIAMESY 900


>1vt4_I APAF-1 related killer DARK; drosophila apoptosome, apoptosis, programmed cell death; HET: DTP; 6.90A {Drosophila melanogaster} PDB: 3iz8_A* Length = 1221 Back     alignment and structure
>1vt4_I APAF-1 related killer DARK; drosophila apoptosome, apoptosis, programmed cell death; HET: DTP; 6.90A {Drosophila melanogaster} PDB: 3iz8_A* Length = 1221 Back     alignment and structure

Structure Templates Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query575
4a4a_A914 Alpha-N-acetylglucosaminidase family protein; hydr 100.0
1kwg_A645 Beta-galactosidase; TIM barrel, glycoside hydrolas 94.64
3n9k_A399 Glucan 1,3-beta-glucosidase; aromatic entranceway/ 94.52
3tty_A675 Beta-GAL, beta-galactosidase; TIM barrel, glycosid 94.12
3qr3_A340 Endoglucanase EG-II; TIM barrel, hydrolase; 2.05A 91.97
1w91_A503 Beta-xylosidase; MAD, seMet, tetramer, hydrolase; 91.34
1h4p_A408 Glucan 1,3-beta-glucosidase I/II; hydrolase, gluca 90.66
3icg_A515 Endoglucanase D; cellulase, xylanase, carbohydrate 89.46
1qnr_A344 Endo-1,4-B-D-mannanase; hydrolase, anomalous scatt 89.23
2osx_A481 Endoglycoceramidase II; (alpha/beta)8 (TIM) barrel 89.14
4ekj_A500 Beta-xylosidase; TIM-barrel fold, hemicellulase, h 88.77
3ndz_A345 Endoglucanase D; cellotriose, xylanase, carbohydra 88.29
1uhv_A500 Beta-xylosidase; family 39 glycoside hydrolase, xy 87.96
1vjz_A341 Endoglucanase; TM1752, structural genomics, JCSG, 87.5
3l55_A353 B-1,4-endoglucanase/cellulase; putative beta-1,4-e 86.61
3nco_A320 Endoglucanase fncel5A; fncel5A, F. nodosum RT17-B1 85.74
2whl_A294 Beta-mannanase, baman5; glycoside hydrolase, hydro 82.69
1h1n_A305 Endo type cellulase ENGI; hydrolase, glycosyl hydr 81.6
3aof_A317 Endoglucanase; glycosyl hydrolase family 5, cellul 81.3
7a3h_A303 Endoglucanase; hydrolase, cellulose degradation, g 80.84
2epl_X627 N-acetyl-beta-D-glucosaminidase; glycoside hydrola 80.46
1rh9_A373 Endo-beta-mannanase; endo-beta-mannase, retaining, 80.28
>4a4a_A Alpha-N-acetylglucosaminidase family protein; hydrolase, 2 hydrolase, family 89 glycoside hydrolase, mucin carbohydrate-active enzyme; HET: NDG GAL; 1.90A {Clostridium perfringens} PDB: 2vcc_A 2vc9_A* 2vcb_A* 2vca_A Back     alignment and structure
Probab=100.00  E-value=1.2e-171  Score=1464.66  Aligned_cols=526  Identities=30%  Similarity=0.602  Sum_probs=494.5

Q ss_pred             CCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHcCCeeccCCCCCCCchhhHhhCCCCceeccCCCCCCCCCCcccccccc
Q 008173            1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLL   80 (575)
Q Consensus         1 MgNi~gwgGPLp~~wi~~q~~LQ~kIl~RmreLGM~PVLPgF~G~VP~~~~~~~P~a~i~~~~~W~gf~~~~~~~~~~~L   80 (575)
                      ||||+|||||||++||++|.+||||||+|||||||+||||||+||||++||++||+|+|+++|.|+||.+ +.+|+|++|
T Consensus       367 MgNl~~wgGPLp~~w~~~q~~Lq~kIl~RmrelGM~PVLPaF~G~VP~~~~~~~P~a~i~~~~~W~gf~~-~~~~~~~~l  445 (914)
T 4a4a_A          367 MQNMTGFGGPLPNDWFEQRAELGRKMHDRMQSFGINPVLQGYSGMVPRDFKEKNQEAQTISQGGWCGFDR-PDMLKTYVN  445 (914)
T ss_dssp             TTSCCSTTCCCCTTHHHHHHHHHHHHHHHHHHHTCEEEEECCSCEECTTHHHHSTTCCEECCCEETTEEC-CEEECSSCC
T ss_pred             hcCccccCCCCCHHHHHHHHHHHHHHHHHHHHcCCeecCCCcCCCCChHHHhhCCCCeeecCCCCCCCCC-chhcccccC
Confidence            9999999999999999999999999999999999999999999999999999999999999999999985 556779999


Q ss_pred             CCCChHHHHHHHHHHHHHHHHhCCCCcccccCCCCCCCCCCCChhHHHHHHHHHHHHHhccCCCceEEEecccCCCCCCC
Q 008173           81 DATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFW  160 (575)
Q Consensus        81 dP~DplF~~i~~~F~~~q~~~yG~~~h~Y~~DpFnE~~pp~~d~~~L~~~~~~v~~am~~~dP~AvWvmQgW~F~~~~fW  160 (575)
                      +|+||||++||++||+||+++||+++|||+||||||++|+.+.  .|++++++||++|+++||+||||||||+|.     
T Consensus       446 ~p~dplF~~i~~~F~~~q~~~yG~~~h~Y~~D~FnE~~~~~~~--~l~~~~~~v~~am~~~dp~AvWv~QgW~~~-----  518 (914)
T 4a4a_A          446 EGEADYFQKVADVFYEKQKEVFGDVTNFYGVDPFHQGGNTGDL--DNGKIYEIIQNKMIEHDNDAVWVIQNWQGN-----  518 (914)
T ss_dssp             TTSCCHHHHHHHHHHHHHHHHHCSCCSEEECCTTTTSCCCTTC--CHHHHHHHHHHHHHHHCTTCEEEEEEBTTB-----
T ss_pred             CCCChHHHHHHHHHHHHHHHHhCCcccccccCccccCCCCCCc--CHHHHHHHHHHHHHHhCCCCEEEEcccCCC-----
Confidence            9999999999999999999999976899999999999876553  389999999999999999999999999974     


Q ss_pred             CchhHHHhhhCCCC-CCEEEEecCCCcccccccccCCCCCCceeeeccCCCCccccccchhhhhcChHHhhhCCCCceEE
Q 008173          161 RPPQMKALLNSVPL-GKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG  239 (575)
Q Consensus       161 ~~~~~~a~L~~Vp~-~~mliLDL~~E~~p~W~~~~~f~G~pwIWc~L~NFGGn~gl~G~l~~i~~~~~~a~~~~~~~m~G  239 (575)
                       +.  ++||++||+ ||||||||+||..|+|+++++ +|||||||||||||||+||+|+++.|+++|.+|+.. +++|+|
T Consensus       519 -~~--~~~L~~vp~~~~mlvLDL~se~~p~w~~~~~-~G~pwiwc~L~NFGGn~gl~G~~~~~~~~~~~a~~~-~~~m~G  593 (914)
T 4a4a_A          519 -PS--NNKLEGLTKKDQAMVLDLFSEVSPDWNRLEE-RDLPWIWNMLHNFGGRMGMDAAPEKLATEIPKALAN-SEHMVG  593 (914)
T ss_dssp             -SC--HHHHTTCSCGGGEEEEETTTTTSCCCHHHHT-TTCCEEEEECCCSTTCCSSCCCHHHHHHHHHHHHHT-CSCEEE
T ss_pred             -Ch--HHHHhCCCCCCCEEEEEcccccccccchhhh-CCCceEEeccccCCCCCCCcccHHHHhhhHHHHhhc-CCCceE
Confidence             33  479999997 999999999999999999988 899999999999999999999999999999999875 689999


Q ss_pred             eeeCccccccChhHHHHHHhhcCCCCCCCHHHHHHHHhhcccCCCChHHHHHHHHHHhcccCCCCCCCC-CCCcceeccC
Q 008173          240 VGMSMEGIEQNPVVYDLMSEMAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATD-KNRDVIVAFP  318 (575)
Q Consensus       240 iG~tpEGi~~NpvvYel~~d~aW~~~~id~~~W~~~Ya~rRYG~~~~~~~~AW~iL~~tvY~~~~~~~~-~~~~~~~~~P  318 (575)
                      ||+|||||||||||||||+||+|++++||+++||++||+||||+.++++++||++|++|||||+..... .+++++++||
T Consensus       594 ~G~tpEgie~NpvvYeL~~e~aW~~~~id~~~W~~~ya~~RYG~~~~~~~~AW~~L~~tvY~~~~~~~~~~~~s~~~~rP  673 (914)
T 4a4a_A          594 IGITPQAINTNPLAYELLFDMAWTRDQINFRTWTEDYIERRYGKTNKEILEAWNIILDTAYKKRNDYYQGAAESIINARP  673 (914)
T ss_dssp             EEECCSCSCSCHHHHHHHHHGGGCSSCCCHHHHHHHHHHHHHTCCCHHHHHHHHHHHHTTTSCCCSCCCSSCCCGGGSCS
T ss_pred             EeechhhcccCHHHHHHHHhcccCCCCCCHHHHHHHHHHhhcCCCCHHHHHHHHHHHhhhcCCCCccccCCCcCeeeeCC
Confidence            999999999999999999999999999999999999999999999999999999999999999864333 3568888889


Q ss_pred             CCCCCcccccccccccCCCCccccccccccCCCCCCCCCcCCHHHHHHHHHHHHhccccCCCCCcccchHHHHHHHHHHH
Q 008173          319 DVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAK  398 (575)
Q Consensus       319 ~l~p~~~~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~~~~Yd~~~l~~A~~lll~~~~~l~~~~~y~yDLvDvtRQvL~n  398 (575)
                      +|+..                        ..++|+++.++|||++|++||++||+++++|+++++|||||||||||||+|
T Consensus       674 ~l~~~------------------------~~~~w~~~~~~Yd~~~~~~A~~lll~~~~~l~~s~~y~yDLvDvtRQ~l~n  729 (914)
T 4a4a_A          674 GFGIK------------------------SASTWGHSKIVYDKSEFEKAIEIFAKNYDEFKDSDAFLYDFADILKQLLAN  729 (914)
T ss_dssp             CSSCC------------------------CSSTTCCCCCCSCHHHHHHHHHHHHHHHHHHTTCHHHHHHHHHHHHHHHHH
T ss_pred             CCCcc------------------------CccccCcccccCCHHHHHHHHHHHHHhhhhcCCCchhhHhHHHHHHHHHHH
Confidence            87633                        346789999999999999999999999999999999999999999999999


Q ss_pred             HHHHHHHHHHHHHHhcChHHHHHHHHHHHHHHHHHHHHhccCCCCChHHHHHHHHhhCCCHH--HHHHHHHHhhhccccc
Q 008173          399 YANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEE--QEKQYEWNARTQITMW  476 (575)
Q Consensus       399 ~~~~~~~~~~~Ay~~kd~~~~~~~~~~~l~li~dlD~LL~t~~~flLg~Wl~~Ar~~g~~~~--ek~~yE~NAR~~IT~W  476 (575)
                      +++.+|.++++||++||.+.|++++++||+||.+||+||+||++|+||+||++||++|.+++  ||++||+|||+|||+|
T Consensus       730 ~~~~~~~~~~~ay~~~d~~~~~~~~~~~l~ll~~~D~lL~t~~~flLg~Wl~~Ar~~a~~~~~~e~~~yE~NAR~qIT~W  809 (914)
T 4a4a_A          730 SAQEYYEVMCNAYNNGNGEKFKFVSGKFLELIKLQERVLSTRPEFLIGNWIEDARTMLKDSDDWTKDLFEFNARALVTTW  809 (914)
T ss_dssp             HHHHHHHHHHHHHHTTCHHHHHHHHHHHHHHHHHHHHHHTTCGGGBHHHHHHHHHHSSTTCCHHHHHHHHHHHHHHTTTS
T ss_pred             HHHHHHHHHHHHHHhcCHHHHHHHHHHHHHHHHHHHHHhCCCCcccHHHHHHHHHHhcCCCCHHHHHHHHHHhHheEeec
Confidence            99999999999999999999999999999999999999999999999999999999998865  8999999999999999


Q ss_pred             CCCCCcCCCCCcccccccccchhhhhhHHHHHHHHHHHHHHHhcCCCCChHHHHHHHHHHhhhhhcCC----CcCCCCCC
Q 008173          477 FDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLTNDWQNGR----NVYPVESN  552 (575)
Q Consensus       477 G~~~~~~~~~L~DYA~k~W~GL~~dyY~pRW~~f~~~l~~~l~~~~~f~~~~~~~~~~~~E~~W~~~~----~~y~~~p~  552 (575)
                      ||+..+++|+|+|||||+|+|||+|||+|||++||++|.++++++++|+..+    |+++|++|++++    +.||++|.
T Consensus       810 G~~~~~~~g~l~DYA~k~W~GL~~dyY~pRW~~f~~~l~~~l~~~~~~~~~~----~~~~e~~W~~~~~~~~~~yp~~~~  885 (914)
T 4a4a_A          810 GSRNNADGGGLKDYSNRQWSGLTEDYYYARWEKWINGLQAELDGGAKAPNID----WFKMEYDWVNKKSDTDKLYPTEAS  885 (914)
T ss_dssp             SSHHHHTTTSCTTTTCCCCBTHHHHTHHHHHHHHHHHHHHHHTTCSCCCCCC----HHHHHHHHHTCCTTTSCCCCCSCC
T ss_pred             cCCCCCCCCcccccccchhhhhHHHHHHHHHHHHHHHHHHHHhcCCCCChHH----HHHHHHHHhccccccCCcCCCCCC
Confidence            9975555678999999999999999999999999999999999999998765    469999999964    89999999


Q ss_pred             -CcHHHHHHHHHHHHh
Q 008173          553 -GDALITSQWLYNKYL  567 (575)
Q Consensus       553 -gd~~~~a~~l~~ky~  567 (575)
                       ||++++|++||.||.
T Consensus       886 g~d~~~~a~~~~~ky~  901 (914)
T 4a4a_A          886 NENLGELAKIAMESYS  901 (914)
T ss_dssp             CCCHHHHHHHHHHHSC
T ss_pred             CCCHHHHHHHHHHHcc
Confidence             599999999999994



>1kwg_A Beta-galactosidase; TIM barrel, glycoside hydrolase family 42, trimer, hydrolase; 1.60A {Thermus thermophilus} SCOP: b.71.1.1 c.1.8.1 c.23.16.5 PDB: 1kwk_A* Back     alignment and structure
>3n9k_A Glucan 1,3-beta-glucosidase; aromatic entranceway/clamp, exoglucanase, glycoside hydrolas protein-carbohydrate interaction; HET: BGC; 1.70A {Candida albicans} SCOP: c.1.8.3 PDB: 2pc8_A* 2pb1_A* 2pbo_A 3o6a_A 2pf0_A 1cz1_A 1eqc_A* 1eqp_A Back     alignment and structure
>3tty_A Beta-GAL, beta-galactosidase; TIM barrel, glycoside hydrolase, hydrolase; HET: GLA; 2.25A {Bacillus circulans subsp} PDB: 3tts_A* Back     alignment and structure
>3qr3_A Endoglucanase EG-II; TIM barrel, hydrolase; 2.05A {Hypocrea jecorina} Back     alignment and structure
>1w91_A Beta-xylosidase; MAD, seMet, tetramer, hydrolase; 2.2A {Geobacillus stearothermophilus} SCOP: b.71.1.2 c.1.8.3 PDB: 2bs9_A 2bfg_A* Back     alignment and structure
>1h4p_A Glucan 1,3-beta-glucosidase I/II; hydrolase, glucan degradation, hydrolyase, glycosidase; HET: NAG BMA MAN NDG; 1.75A {Saccharomyces cerevisiae} SCOP: c.1.8.3 Back     alignment and structure
>1qnr_A Endo-1,4-B-D-mannanase; hydrolase, anomalous scattering; HET: NAG MAB; 1.4A {Trichoderma reesei} SCOP: c.1.8.3 PDB: 1qno_A* 1qnq_A* 1qnp_A* 1qns_A* Back     alignment and structure
>2osx_A Endoglycoceramidase II; (alpha/beta)8 (TIM) barrel, hydrolase; HET: SIA GAL BGC 16C; 1.10A {Rhodococcus SP} PDB: 2oyk_A* 2osw_A* 2oyl_A* 2oym_A* 2osy_A* Back     alignment and structure
>4ekj_A Beta-xylosidase; TIM-barrel fold, hemicellulase, hydrolase; 2.50A {Caulobacter vibrioides} Back     alignment and structure
>3ndz_A Endoglucanase D; cellotriose, xylanase, carbohydrate binding D glucanase, hydrolase; HET: CT3; 2.08A {Clostridium cellulovorans} PDB: 3ndy_A* Back     alignment and structure
>1uhv_A Beta-xylosidase; family 39 glycoside hydrolase, xylan, xylose, covalent glycosyl-enzyme intermediate; 2.10A {Thermoanaerobacterium saccharolyticum} SCOP: b.71.1.2 c.1.8.3 PDB: 1px8_A Back     alignment and structure
>1vjz_A Endoglucanase; TM1752, structural genomics, JCSG, PSI, prote structure initiative, joint center for structural genomics; 2.05A {Thermotoga maritima} SCOP: c.1.8.3 Back     alignment and structure
>3nco_A Endoglucanase fncel5A; fncel5A, F. nodosum RT17-B1, hydrolase; 1.50A {Fervidobacterium nodosum} PDB: 3rjx_A 3rjy_A* Back     alignment and structure
>2whl_A Beta-mannanase, baman5; glycoside hydrolase, hydrolase; HET: MAN BMA; 1.40A {Bacillus agaradhaerens} PDB: 2whj_A Back     alignment and structure
>1h1n_A Endo type cellulase ENGI; hydrolase, glycosyl hydrolase, family 5, subtype, thermophilic, thermophIle, endoglucanase; 1.12A {Thermoascus aurantiacus} SCOP: c.1.8.3 PDB: 1gzj_A Back     alignment and structure
>3aof_A Endoglucanase; glycosyl hydrolase family 5, cellulase, biofuel, hyperthermo hydrolase; HET: BMA; 1.29A {Thermotoga maritima} PDB: 3amg_A* 3amc_A 3amd_A 3mmu_A 3mmw_A 3azs_A* 3azr_A* 3azt_A* Back     alignment and structure
>7a3h_A Endoglucanase; hydrolase, cellulose degradation, glycoside H family 5, michaelis complex, SKEW-BOAT, distortion; 0.95A {Bacillus agaradhaerens} SCOP: c.1.8.3 PDB: 1h2j_A* 1hf6_A* 1ocq_A* 1w3k_A* 1h11_A* 4a3h_A* 5a3h_A* 6a3h_A* 1w3l_A 8a3h_A* 2v38_A* 1qhz_A 1qi0_A* 1e5j_A* 1qi2_A* 1h5v_A* 1a3h_A 2a3h_A* 3a3h_A* 1lf1_A Back     alignment and structure
>2epl_X N-acetyl-beta-D-glucosaminidase; glycoside hydrolase, family 20, GCNA, hydro; 1.40A {Streptococcus gordonii} PDB: 2epk_X 2epm_X 2epn_A* 2epo_A Back     alignment and structure
>1rh9_A Endo-beta-mannanase; endo-beta-mannase, retaining, glycoside hydrolase family 5; 1.50A {Solanum lycopersicum} SCOP: c.1.8.3 Back     alignment and structure

Homologous Structure Domains

Structure Domains Detected by RPS-BLAST ?

No hit with e-value below 0.005

Homologous Domains Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query575
d1kwga2393 A4 beta-galactosidase {Thermus thermophilus [TaxId 93.95
d1ceoa_340 Endoglucanase CelC {Clostridium thermocellum [TaxI 93.39
d1qnra_344 Beta-mannanase {Trichoderma reesei [TaxId: 51453]} 91.83
d1vjza_325 Endoglucanase homologue TM1752 {Thermotoga maritim 87.04
d2pb1a1394 Exo-beta-(1,3)-glucanase {Yeast (Candida albicans) 85.24
d1ecea_358 Endocellulase E1 {Acidothermus cellulolyticus [Tax 83.37
>d1kwga2 c.1.8.1 (A:1-393) A4 beta-galactosidase {Thermus thermophilus [TaxId: 274]} Back     information, alignment and structure
class: Alpha and beta proteins (a/b)
fold: TIM beta/alpha-barrel
superfamily: (Trans)glycosidases
family: Amylase, catalytic domain
domain: A4 beta-galactosidase
species: Thermus thermophilus [TaxId: 274]
Probab=93.95  E-value=0.1  Score=48.57  Aligned_cols=94  Identities=15%  Similarity=0.143  Sum_probs=74.3

Q ss_pred             HHHHHHHHHHHcCCeeccCCCCCCCchhhHhhCCCCceeccCCCCCCCCCCccccccccCCCChHHHHHHHHHHHHHHHH
Q 008173           22 LQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKE  101 (575)
Q Consensus        22 LQ~kIl~RmreLGM~PVLPgF~G~VP~~~~~~~P~a~i~~~~~W~gf~~~~~~~~~~~LdP~DplF~~i~~~F~~~q~~~  101 (575)
                      -..++|+.+++.||+||+=-|....|..+.+.+|+.......  +.+   ..+....+.++.+|.+.+..+.|++...+.
T Consensus        52 ~~d~~i~~~~~~Gi~~iv~l~~~~~P~w~~~~~~~~~~~~~~--g~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~  126 (393)
T d1kwga2          52 WLDEAIATLAAEGLKVVLGTPTATPPKWLVDRYPEILPVDRE--GRR---RRFGGRRHYCFSSPVYREEARRIVTLLAER  126 (393)
T ss_dssp             HHHHHHHHHHTTTCEEEEECSTTSCCHHHHHHCGGGSCBCTT--SCB---CCSSSSCCCCTTCHHHHHHHHHHHHHHHHH
T ss_pred             HHHHHHHHHHHCCCEEEEEcCCCCCchhhhccCcccccccCC--Ccc---cccccccccCCCCHHHHHHHHHHHHHHHHH
Confidence            446799999999999998888888899999999876444422  112   123346678899999999999999999999


Q ss_pred             hCCCCcccccCCCCCCCCC
Q 008173          102 YGRTSHIYNCDTFDENTPP  120 (575)
Q Consensus       102 yG~~~h~Y~~DpFnE~~pp  120 (575)
                      |+....+...+..||..-.
T Consensus       127 ~~~~~~~~~~~~~ne~~~~  145 (393)
T d1kwga2         127 YGGLEAVAGFQTDNEYGCH  145 (393)
T ss_dssp             HTTCTTEEEEECSSSTTTT
T ss_pred             hcCCceEEEEeeccccccc
Confidence            9865567788888987543



>d1ceoa_ c.1.8.3 (A:) Endoglucanase CelC {Clostridium thermocellum [TaxId: 1515]} Back     information, alignment and structure
>d1qnra_ c.1.8.3 (A:) Beta-mannanase {Trichoderma reesei [TaxId: 51453]} Back     information, alignment and structure
>d1vjza_ c.1.8.3 (A:) Endoglucanase homologue TM1752 {Thermotoga maritima [TaxId: 2336]} Back     information, alignment and structure
>d2pb1a1 c.1.8.3 (A:7-400) Exo-beta-(1,3)-glucanase {Yeast (Candida albicans) [TaxId: 5476]} Back     information, alignment and structure
>d1ecea_ c.1.8.3 (A:) Endocellulase E1 {Acidothermus cellulolyticus [TaxId: 28049]} Back     information, alignment and structure