Citrus Sinensis ID: 009153


Local Sequence Feature Prediction

Prediction and (Method)Result
Residue Number Marker
Protein Sequence ?
Secondary Structure (PSIPRED) ?
Secondary Structure Prediction (SSPRO) ?
Coil and Loop (DISEMBL) ?
Flexible Loop (DISEMBL) ?
Low Complexity Region (SEG) ?
Disordered region (IsUnstruct) ?
Disordered Region (DISOPRED) ?
Disordered Region (DISEMBL) ?
Disordered Region (DISPRO) ?
Transmembrane Helix (TMHMM) ?
Transmembrane Helix (HMMTOP) ?
Transmembrane Helix (MEMSAT) ?
TM Helix, Signal Peptide (MEMSAT_SVM) ?
TM Helix, Signal Peptide (Phobius) ?
Signal Peptide (SignalP HMM Mode) ?
Signal Peptide (SignalP NN Mode) ?
Coiled Coils (COILS) ?
Positional Conservation ?
 
--------10--------20--------30--------40--------50--------60--------70--------80--------90-------100-------110-------120-------130-------140-------150-------160-------170-------180-------190-------200-------210-------220-------230-------240-------250-------260-------270-------280-------290-------300-------310-------320-------330-------340-------350-------360-------370-------380-------390-------400-------410-------420-------430-------440-------450-------460-------470-------480-------490-------500-------510-------520-------530-------540--
MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMAWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKYLQGTGVFDH
ccccccccccccHHHHHHHHHHHHHHHHHHHHHccccccccccccccHHHHHHccccEEEEccccccccccccccccccccccccHHHHHHHHHHHHHHHHHcccccccccccccccccccccHHHHHHHHHHHHHHHHccccccEEEEEccccccccccccHHHHHHHHccccccEEEEEcccccccccccccccccccccccccccccccccccccHHHHHccHHHHHcccccHHHHHHHHHHcccccccHHHHHHHHHHHHHHccccccccccccccEEcccccccccccccccccccccccccHHHHHcccccccccccccccHHHHHHHHHHHHHccccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHccHHHHHHHHHHHHHHHHHHHHHHcccccccHHHHHHHHHHHcccHHHHHHHHHHHHHccccccccccccccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHccccccHHHHHHHHHHHHHHHccccccccccccccHHHHHHHHHHHHccccccccc
ccccccccccccHHHHHHHHHHHHHHHHHHHHcccccEEccccccccHHHHHHccccEEEEccccccccccccccccEEcccccHHHHHHHHHHHHHHHHHHcccccEEEcccccccccccccHHHHHHHHHHHHHHHHHccccHEEEEHHHHHccccccccHHHHHHHHccccccEEEEEccccccccEcccHccccccEEEEEEccccccccccccHHHHcccHHHHHHccccHHHHHHHHHHHHcccccHHHHHHHHHHHHHHcccccccccccccEEEEcccccccHcccccccccccccccccccHccccccccccccccccHHHHHHHHHHHHHHHHHHcccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHccHHHHHHHHHHHHHHHHHHHHHHccccccccHHHHHHHHHHcccHHHHHHHHHHHHHHHHHcccccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHccccccHHHHHHHHHHHHHHHHHccccccccccccHHHHHHHHHHHHccccccccc
msnlhgwggplpqswLDQQLVLQKKILVRIYElgmnpvlpafsgnvpaalqnvfpsakitqlgnwfsvksdprwcctylldatdpLFIEIGRAFIEQQLKEygrtshiyncdtfdentppvdspeyiSSLGAAIYsgmqsgdsDAVWLMQGWlfsydpfwrppqmkallnsvplgkLVVLDLFAevkpiwstskqfygVPYIWCMLHNfagniemygildsiafgpveartsenttMAWINQYSvrrygrsvpaIQDAWNVLYHTVynctdgatdknrdvivafpdvdpsiisvtegkyqnygkpvskeavlksetssydhphlwysTSEVIRALELFIASgnelsasntyrydlIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGllachdgfllgpWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIEslesgdgfrlkDWRREWIKLTNdwqngrnvypvesngdaLITSQWLYNKYlqgtgvfdh
msnlhgwggplpqSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMAWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPdvdpsiisvtegkyqnygkpVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLtndwqngrnvyPVESNGDALITSQWLYNKYLQGTGVFDH
MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMAWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKYLQGTGVFDH
******WGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENT*****PEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMAWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPV****V******SYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAK**********QYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKYLQGTGV***
MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMAWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKYLQG******
MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMAWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQ**********YEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKYLQGTGVFDH
*****GWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMAWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKYLQGTG****
oooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
oooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiihhhhhhhhhhhhhhhhhhhooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiihhhhhhhhhhhhhhhhooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
oooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMAWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLTNDWQNGRNVYPVESNGDALITSQWLYNKYLQGTGVFDH
no confident homologs detected

Close Homologs for Annotation Transfer

Close Homologs in SWISS-PROT Database Detected by BLAST ?

ID ?Alignment graph ?Length ? Definition ? RBH(Q2H) ? RBH(H2Q) ? Q cover ? H cover ? Identity ? E-value ?
Query542 2.2.26 [Sep-21-2011]
P54802743 Alpha-N-acetylglucosamini yes no 0.911 0.664 0.349 9e-97
>sp|P54802|ANAG_HUMAN Alpha-N-acetylglucosaminidase OS=Homo sapiens GN=NAGLU PE=1 SV=2 Back     alignment and function desciption
 Score =  354 bits (909), Expect = 9e-97,   Method: Compositional matrix adjust.
 Identities = 199/570 (34%), Positives = 303/570 (53%), Gaps = 76/570 (13%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M NLH W GPLP SW  +QL LQ ++L ++   GM PVLPAF+G+VP A+  VFP   +T
Sbjct: 204 MGNLHTWDGPLPPSWHIKQLYLQHRVLDQMRSFGMTPVLPAFAGHVPEAVTRVFPQVNVT 263

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
           ++G+W        + C++LL   DP+F  IG  F+ + +KE+G T HIY  DTF+E  PP
Sbjct: 264 KMGSWGHFNCS--YSCSFLLAPEDPIFPIIGSLFLRELIKEFG-TDHIYGADTFNEMQPP 320

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
              P Y+++   A+Y  M + D++AVWL+QGWLF + P FW P Q++A+L +VP G+L+V
Sbjct: 321 SSEPSYLAAATTAVYEAMTAVDTEAVWLLQGWLFQHQPQFWGPAQIRAVLGAVPRGRLLV 380

Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTM-- 237
           LDLFAE +P+++ +  F G P+IWCMLHNF GN  ++G L+++  GP  AR   N+TM  
Sbjct: 381 LDLFAESQPVYTRTASFQGQPFIWCMLHNFGGNHGLFGALEAVNGGPEAARLFPNSTMVG 440

Query: 238 --------------------------------AWINQYSVRRYGRSVPAIQDAWNVLYHT 265
                                           AW+  ++ RRYG S P    AW +L  +
Sbjct: 441 TGMAPEGISQNEVVYSLMAELGWRKDPVPDLAAWVTSFAARRYGVSHPDAGAAWRLLLRS 500

Query: 266 VYNCT-DGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHL 324
           VYNC+ +     NR  +V  P                          L+  TS      +
Sbjct: 501 VYNCSGEACRGHNRSPLVRRPS-------------------------LQMNTS------I 529

Query: 325 WYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAH 384
           WY+ S+V  A  L + S   L+ S  +RYDL+DLTRQA+ +  +  +     AY   +  
Sbjct: 530 WYNRSDVFEAWRLLLTSAPSLATSPAFRYDLLDLTRQAVQELVSLYYEEARSAYLSKELA 589

Query: 385 GVFQLSRRFL-ELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMW 443
            + +       EL+  +D +LA    FLLG WLE A+  A +E +   YE N+R Q+T+W
Sbjct: 590 SLLRAGGVLAYELLPALDEVLASDSRFLLGSWLEQARAAAVSEAEADFYEQNSRYQLTLW 649

Query: 444 FDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKL 503
                 E ++L DY NK  +GL+ +YY PR  ++ + +++S+  G  F+   + +   +L
Sbjct: 650 ----GPEGNIL-DYANKQLAGLVANYYTPRWRLFLEALVDSVAQGIPFQQHQFDKNVFQL 704

Query: 504 TNDWQNGRNVYPVESNGDALITSQWLYNKY 533
              +   +  YP +  GD +  ++ ++ KY
Sbjct: 705 EQAFVLSKQRYPSQPRGDTVDLAKKIFLKY 734




Involved in the degradation of heparan sulfate.
Homo sapiens (taxid: 9606)
EC: 3EC: .EC: 2EC: .EC: 1EC: .EC: 5EC: 0

Close Homologs in the Non-Redundant Database Detected by BLAST ?

GI ?Alignment Graph ?Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query542
255540793 809 alpha-n-acetylglucosaminidase, putative 1.0 0.669 0.755 0.0
225457148 813 PREDICTED: alpha-N-acetylglucosaminidase 0.998 0.665 0.756 0.0
224121634 812 predicted protein [Populus trichocarpa] 1.0 0.667 0.739 0.0
356519003 812 PREDICTED: alpha-N-acetylglucosaminidase 0.983 0.656 0.746 0.0
297733843 846 unnamed protein product [Vitis vinifera] 0.998 0.639 0.702 0.0
449441031 808 PREDICTED: alpha-N-acetylglucosaminidase 0.983 0.659 0.711 0.0
326515664 829 predicted protein [Hordeum vulgare subsp 0.985 0.644 0.633 0.0
222629680 1129 hypothetical protein OsJ_16433 [Oryza sa 0.985 0.472 0.635 0.0
326519955 829 predicted protein [Hordeum vulgare subsp 0.985 0.644 0.633 0.0
218195716 829 hypothetical protein OsI_17702 [Oryza sa 0.985 0.644 0.635 0.0
>gi|255540793|ref|XP_002511461.1| alpha-n-acetylglucosaminidase, putative [Ricinus communis] gi|223550576|gb|EEF52063.1| alpha-n-acetylglucosaminidase, putative [Ricinus communis] Back     alignment and taxonomy information
 Score =  911 bits (2355), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 435/576 (75%), Positives = 489/576 (84%), Gaps = 34/576 (5%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M NLH WGG LPQSW  QQL+LQKKIL R+YELGMNPVLPAFSGNVPAAL+N+FPSAKI 
Sbjct: 234 MGNLHRWGGSLPQSWFFQQLILQKKILARMYELGMNPVLPAFSGNVPAALRNIFPSAKIA 293

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
           +LGNWFSVKSD RWCCTYLLDATDPLFIEIGRAFIEQQL+EYG TSHIYNCDTFDENTPP
Sbjct: 294 RLGNWFSVKSDLRWCCTYLLDATDPLFIEIGRAFIEQQLEEYGSTSHIYNCDTFDENTPP 353

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVL 180
           VD P+YIS+LGAA++ GMQSGD+DAVWLMQGWLFSYDPFWRPPQMKALL+SVP+G+LVVL
Sbjct: 354 VDDPKYISALGAAVFKGMQSGDNDAVWLMQGWLFSYDPFWRPPQMKALLHSVPVGRLVVL 413

Query: 181 DLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTM--- 237
           DLFAEVKPIW++S QFYGVPYIWCMLHNFAGN+EMYGILDSIA GPVEARTSEN+TM   
Sbjct: 414 DLFAEVKPIWTSSYQFYGVPYIWCMLHNFAGNVEMYGILDSIASGPVEARTSENSTMVGV 473

Query: 238 ------------------------------AWINQYSVRRYGRSVPAIQDAWNVLYHTVY 267
                                         AWIN YS RRYGRSVP+IQDAW++LYHTVY
Sbjct: 474 GMSMEGIEQNPVVYDLMSEMAFQHKKVDVKAWINLYSTRRYGRSVPSIQDAWDILYHTVY 533

Query: 268 NCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYS 327
           NCTDGA DKNRDVIVAFPDV+P   SV++ ++   GKPVS+ AVLK  + SYDHPHLWYS
Sbjct: 534 NCTDGAYDKNRDVIVAFPDVNPFYFSVSQKRHHLNGKPVSRRAVLKENSDSYDHPHLWYS 593

Query: 328 TSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVF 387
           TSEV+ ALELFI SG ELS S+TY YDL+DLTRQALAKY NELFL IIE+YQ ND +GV 
Sbjct: 594 TSEVLHALELFITSGEELSGSSTYSYDLVDLTRQALAKYGNELFLKIIESYQANDGNGVA 653

Query: 388 QLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFDNT 447
             S++FL+LVEDMD LL CH+GFLLGPWLESAKQLAQ++EQEKQ+EWNARTQITMWFDNT
Sbjct: 654 SRSQKFLDLVEDMDTLLGCHEGFLLGPWLESAKQLAQDQEQEKQFEWNARTQITMWFDNT 713

Query: 448 QEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLTNDW 507
           ++EASLL DYGNKYWSGLL+DYYGPRAAIYFKY+I+SLE+G  F LKDWRREWIKLTN+W
Sbjct: 714 EDEASLLHDYGNKYWSGLLQDYYGPRAAIYFKYLIKSLENGKVFPLKDWRREWIKLTNEW 773

Query: 508 QNGRNVYPVESNGDALITSQWLYNKYLQGTG-VFDH 542
           Q  RN +PV+SNG+ALI S+WLY+KYL+     +DH
Sbjct: 774 QRSRNKFPVKSNGNALIISKWLYDKYLRNPDTTYDH 809




Source: Ricinus communis

Species: Ricinus communis

Genus: Ricinus

Family: Euphorbiaceae

Order: Malpighiales

Class:

Phylum: Streptophyta

Superkingdom: Eukaryota

>gi|225457148|ref|XP_002280399.1| PREDICTED: alpha-N-acetylglucosaminidase-like [Vitis vinifera] Back     alignment and taxonomy information
>gi|224121634|ref|XP_002318632.1| predicted protein [Populus trichocarpa] gi|222859305|gb|EEE96852.1| predicted protein [Populus trichocarpa] Back     alignment and taxonomy information
>gi|356519003|ref|XP_003528164.1| PREDICTED: alpha-N-acetylglucosaminidase-like [Glycine max] Back     alignment and taxonomy information
>gi|297733843|emb|CBI15090.3| unnamed protein product [Vitis vinifera] Back     alignment and taxonomy information
>gi|449441031|ref|XP_004138287.1| PREDICTED: alpha-N-acetylglucosaminidase-like [Cucumis sativus] Back     alignment and taxonomy information
>gi|326515664|dbj|BAK07078.1| predicted protein [Hordeum vulgare subsp. vulgare] Back     alignment and taxonomy information
>gi|222629680|gb|EEE61812.1| hypothetical protein OsJ_16433 [Oryza sativa Japonica Group] Back     alignment and taxonomy information
>gi|326519955|dbj|BAK03902.1| predicted protein [Hordeum vulgare subsp. vulgare] Back     alignment and taxonomy information
>gi|218195716|gb|EEC78143.1| hypothetical protein OsI_17702 [Oryza sativa Indica Group] Back     alignment and taxonomy information

Prediction of Gene Ontology (GO) Terms

Close Homologs with Gene Ontology terms Detected by BLAST ?

ID ? Alignment graph ? Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query542
TAIR|locus:2173209806 CYL1 "AT5G13690" [Arabidopsis 0.442 0.297 0.680 4.5e-176
UNIPROTKB|F1S1D7744 NAGLU "Uncharacterized protein 0.433 0.315 0.502 2.4e-91
RGD|1564228739 Naglu "N-acetylglucosaminidase 0.966 0.709 0.372 3.5e-91
UNIPROTKB|P54802743 NAGLU "Alpha-N-acetylglucosami 0.963 0.702 0.352 1.6e-86
UNIPROTKB|H9L296601 H9L296 "Uncharacterized protei 0.915 0.825 0.362 9.4e-82
FB|FBgn0014417778 CG13397 [Drosophila melanogast 0.957 0.667 0.327 1e-77
UNIPROTKB|A6QM01667 NAGLU "NAGLU protein" [Bos tau 0.431 0.350 0.504 4.1e-77
DICTYBASE|DDB_G0291998798 naglu "alpha-N-acetylglucosami 0.946 0.642 0.306 1.3e-70
TAIR|locus:2173209 CYL1 "AT5G13690" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
 Score = 941 (336.3 bits), Expect = 4.5e-176, Sum P(2) = 4.5e-176
 Identities = 164/241 (68%), Positives = 199/241 (82%)

Query:     1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
             M NLH WGGPL ++WLD QL+LQK+IL R+ + GM PVLP+FSGNVP+AL+ ++P A IT
Sbjct:   231 MGNLHAWGGPLSKNWLDDQLLLQKQILSRMLKFGMTPVLPSFSGNVPSALRKIYPEANIT 290

Query:    61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
             +L NW +V  D RWCCTYLL+ +DPLFIEIG AFI+QQ +EYG  ++IYNCDTF+ENTPP
Sbjct:   291 RLDNWNTVDGDSRWCCTYLLNPSDPLFIEIGEAFIKQQTEEYGEITNIYNCDTFNENTPP 350

Query:   121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
                PEYISSLGAA+Y  M  G+ +AVWLMQGWLFS D  FW+PPQ+KALL+SVP GK++V
Sbjct:   351 TSEPEYISSLGAAVYKAMSKGNKNAVWLMQGWLFSSDSKFWKPPQLKALLHSVPFGKMIV 410

Query:   180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMAW 239
             LDL+AEVKPIW+ S QFYG PYIWCMLHNF GNIEMYG LDSI+ GPV+AR S+N+TM  
Sbjct:   411 LDLYAEVKPIWNKSAQFYGTPYIWCMLHNFGGNIEMYGALDSISSGPVDARVSKNSTMVG 470

Query:   240 I 240
             +
Sbjct:   471 V 471


GO:0004561 "alpha-N-acetylglucosaminidase activity" evidence=ISS
GO:0009507 "chloroplast" evidence=ISM
GO:0005773 "vacuole" evidence=IDA
UNIPROTKB|F1S1D7 NAGLU "Uncharacterized protein" [Sus scrofa (taxid:9823)] Back     alignment and assigned GO terms
RGD|1564228 Naglu "N-acetylglucosaminidase, alpha" [Rattus norvegicus (taxid:10116)] Back     alignment and assigned GO terms
UNIPROTKB|P54802 NAGLU "Alpha-N-acetylglucosaminidase" [Homo sapiens (taxid:9606)] Back     alignment and assigned GO terms
UNIPROTKB|H9L296 H9L296 "Uncharacterized protein" [Gallus gallus (taxid:9031)] Back     alignment and assigned GO terms
FB|FBgn0014417 CG13397 [Drosophila melanogaster (taxid:7227)] Back     alignment and assigned GO terms
UNIPROTKB|A6QM01 NAGLU "NAGLU protein" [Bos taurus (taxid:9913)] Back     alignment and assigned GO terms
DICTYBASE|DDB_G0291998 naglu "alpha-N-acetylglucosaminidase" [Dictyostelium discoideum (taxid:44689)] Back     alignment and assigned GO terms

Prediction of Enzyme Commission (EC) Number

EC Number Prediction by Annotation Transfer from SWISS-PROT Entries ?

No confident hit for EC number transfering in SWISSPROT detected by BLAST

EC Number Prediction by Ezypred Server ?

Fail to connect to Ezypred Server

EC Number Prediction by EFICAz Software ?

Prediction LevelEC numberConfidence of Prediction
3rd Layer3.2.1LOW CONFIDENCE prediction!
3rd Layer3.2.1.50LOW CONFIDENCE prediction!

Prediction of Functionally Associated Proteins

Functionally Associated Proteins Detected by STRING ?

Your Input:
fgenesh4_pg.C_LG_XII000747
hypothetical protein (812 aa)
(Populus trichocarpa)
Predicted Functional Partners:
gw1.XIV.2399.1
hypothetical protein (454 aa)
     0.919
gw1.XIII.847.1
hypothetical protein (437 aa)
      0.914
gw1.28.778.1
hypothetical protein (434 aa)
       0.899
gw1.133.86.1
hypothetical protein (437 aa)
       0.899

Conserved Domains and Related Protein Families

Conserved Domains Detected by RPS-BLAST ?

ID ?Alignment Graph ?Length ? Definition ? E-value ?
Query542
pfam05089333 pfam05089, NAGLU, Alpha-N-acetylglucosaminidase (N 1e-114
pfam12972269 pfam12972, NAGLU_C, Alpha-N-acetylglucosaminidase 1e-103
>gnl|CDD|218423 pfam05089, NAGLU, Alpha-N-acetylglucosaminidase (NAGLU) tim-barrel domain Back     alignment and domain information
 Score =  340 bits (875), Expect = e-114
 Identities = 124/238 (52%), Positives = 160/238 (67%), Gaps = 7/238 (2%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M NL GWGGPLPQSW+D+QL LQKKIL R+ ELGM PVLPAF+G+VP AL+  +P A IT
Sbjct: 75  MGNLEGWGGPLPQSWIDKQLALQKKILDRMRELGMTPVLPAFAGHVPRALKRKYPEANIT 134

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
           + G W        +   Y LD TDPLF +I   F+E+Q K YG T HIY+ D F+E  PP
Sbjct: 135 RQGRWGG------FHRPYFLDPTDPLFAKIAARFLEEQKKLYG-TDHIYSADPFNEGGPP 187

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVL 180
              P+Y+++ G AIY  MQ  D DAVW++QGWLF    FW+   ++A L++VP G+L+VL
Sbjct: 188 SGDPDYLAAAGQAIYKSMQKADPDAVWVLQGWLFVDSDFWQANPIEAFLSAVPKGRLLVL 247

Query: 181 DLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMA 238
           DLF+E  P W  ++ FYG P+IWCMLHNF GN  +YG LD +A G   AR +  + + 
Sbjct: 248 DLFSERYPQWKRTESFYGKPWIWCMLHNFGGNTGLYGNLDLVASGIEAARANPGSNLV 305


Alpha-N-acetylglucosaminidase, a lysosomal enzyme required for the stepwise degradation of heparan sulfate. Mutations on the alpha-N-acetylglucosaminidase (NAGLU) gene can lead to Mucopolysaccharidosis type IIIB (MPS IIIB; or Sanfilippo syndrome type B) characterized by neurological dysfunction but relatively mild somatic manifestations. The structure shows that the enzyme is composed of three domains. This central domain has a tim barrel fold. Length = 333

>gnl|CDD|221877 pfam12972, NAGLU_C, Alpha-N-acetylglucosaminidase (NAGLU) C-terminal domain Back     alignment and domain information

Conserved Domains Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query 542
KOG2233666 consensus Alpha-N-acetylglucosaminidase [Intracell 100.0
PF05089333 NAGLU: Alpha-N-acetylglucosaminidase (NAGLU) tim-b 100.0
PF12972267 NAGLU_C: Alpha-N-acetylglucosaminidase (NAGLU) C-t 100.0
PF02449374 Glyco_hydro_42: Beta-galactosidase; InterPro: IPR0 93.58
PF00150281 Cellulase: Cellulase (glycosyl hydrolase family 5) 93.12
PF12891239 Glyco_hydro_44: Glycoside hydrolase family 44; Int 80.68
>KOG2233 consensus Alpha-N-acetylglucosaminidase [Intracellular trafficking, secretion, and vesicular transport] Back     alignment and domain information
Probab=100.00  E-value=9.7e-156  Score=1210.54  Aligned_cols=497  Identities=44%  Similarity=0.860  Sum_probs=480.6

Q ss_pred             CCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHcCCeeccCCCCCCCchhhHhhCCCCceeccCCCCCCCCCCcccccccc
Q 009153            1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLL   80 (542)
Q Consensus         1 MgNi~gwGGPLp~~wi~~q~~Lq~kIl~RmreLGM~PVLPgF~G~VP~~~~~~~P~a~i~~~~~W~~f~~~~~~~~~~~L   80 (542)
                      ||||++||||||++|+..|..|||||++|||+||||||||+|+||||++++++||++++++++.|+.|+  ++|||.++|
T Consensus       134 MGNl~awgGpLs~aw~~~ql~LqkrIidrm~~lGmTpvLPaFaG~VP~al~rlfPesnf~rl~rWn~f~--s~~~C~l~v  211 (666)
T KOG2233|consen  134 MGNLHAWGGPLSPAWMLNQLLLQKRIIDRMLELGMTPVLPAFAGHVPDALERLFPESNFTRLPRWNNFT--SRYSCMLLV  211 (666)
T ss_pred             hcCccccCCCCCHHHHHHHHHHHHHHHHHHHHcCCCccchhhccccHHHHHHhCchhceeeccccCCCC--cceeeeEEc
Confidence            999999999999999999999999999999999999999999999999999999999999999999997  699999999


Q ss_pred             CCCChHHHHHHHHHHHHHHHHhCCCCcccccCCCCCCCCCCCChHHHHHHHHHHHHHHhccCCCceEEEecccCCCCCCC
Q 009153           81 DATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFW  160 (542)
Q Consensus        81 dP~dplF~~i~~~F~~~q~~~yG~~~h~Y~~DpFnE~~p~~~d~~~L~~~~~~iy~am~~~dP~AvWvmQgW~F~~~~fW  160 (542)
                      .|+||||.+||.+|+++|+|+||.++|+|+||||||+.||+++|+|+++.+.+||+||+++||+||||||||+|.+|+||
T Consensus       212 ~P~dplF~eIgs~Flr~~~kefG~~tniy~~DpFNE~~Pp~sepey~~staaAiyesm~kvdknaVWllQgWlF~~d~FW  291 (666)
T KOG2233|consen  212 SPFDPLFQEIGSTFLRHQIKEFGGVTNIYSADPFNEILPPESEPEYVKSTAAAIYESMKKVDKNAVWLLQGWLFTYDPFW  291 (666)
T ss_pred             cCCcchHHHHHHHHHHHHHHHhCCcccccccCcccccCCCCCChHHHHHHHHHHHHHHhccCcceEEeeecceeecCCCC
Confidence            99999999999999999999999889999999999999999999999999999999999999999999999999999999


Q ss_pred             CchhHHHhHhCCCCCCEEEEEcCCCccccccccCCCCCCCceeecccCCCcccccccchhHhhcChHHHHhCCCCch---
Q 009153          161 RPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTM---  237 (542)
Q Consensus       161 ~~~~~~a~L~~Vp~~~mliLDL~sE~~p~w~~~~~f~G~pwIWc~L~NFGGn~gl~G~l~~i~~~~~~a~~~~~~~m---  237 (542)
                      ++.++||||++||.|+||||||++|..|+|++|.+|+|||||||||||||||.+|+|.++.|+.||..|+..+||+|   
T Consensus       292 ~~~~ikafL~avP~GrllVLDLyaEv~P~~~~Tasf~GqpfIWCmLHNFGGn~~lfGal~~InsG~e~Ar~~~nStlVGt  371 (666)
T KOG2233|consen  292 PPWQIKAFLSAVPRGRLLVLDLYAEVVPAWQMTASFQGQPFIWCMLHNFGGNRELFGALQKINSGPELARMKPNSTLVGT  371 (666)
T ss_pred             ChHHHHHHHhcCCcCcEEEEehhhhhhhHHHhhhhhcCCceeEEEeeccCCcHhhhhhHHHhccCHHHHhcCCCCceeec
Confidence            99999999999999999999999999999999999999999999999999999999999999999999999999986   


Q ss_pred             ------------------------------hhHHhhhhcccCCCChHHHHHHHHHHhcccCCCCCCCCCC-CcceeeccC
Q 009153          238 ------------------------------AWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKN-RDVIVAFPD  286 (542)
Q Consensus       238 ------------------------------~W~~~Ya~rRYG~~~~~~~~AW~iL~~tvY~~~~~~~~~~-~~~i~~~P~  286 (542)
                                                    .|+++|+.+|||+.++.+++||.+|.+|||||+++...++ .+.+..||+
T Consensus       372 Gm~~EgI~QN~VvYsf~~e~~wsde~ldl~~Wlksys~sRY~~~~~~~eaaW~lL~~tvYn~~~~~~~~~~~~~l~rRp~  451 (666)
T KOG2233|consen  372 GMSMEGIDQNYVVYSFMIERGWSDEPLDLNNWLKSYSESRYGVDFKVAEAAWTLLAGTVYNCPGKWATRGFSYFLYRRPS  451 (666)
T ss_pred             ccCccccccchhhHHHHHHhccCCCCCCHHHHHHHHHHhhhccchHHHHHHHHHHhhhhcCCCcccccCCCceeeEeccc
Confidence                                          9999999999999999999999999999999998776665 566667776


Q ss_pred             CCCccccccccccccCCCCccccccccccCCCCCCCCccCCHHHHHHHHHHHHhccccCCCC-CcccchHHHHHHHHHHH
Q 009153          287 VDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSAS-NTYRYDLIDLTRQALAK  365 (542)
Q Consensus       287 l~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~Yd~~~l~~A~~lll~~~~~l~~~-~~y~yDLvDvtRQvL~n  365 (542)
                      ++                               .++.+||+.+++..||++||.+.+.+..+ ++|||||||||||+|+.
T Consensus       452 f~-------------------------------~k~~~Wyn~sev~~a~rllL~a~~~l~~e~~~fr~DlvDltRq~lqe  500 (666)
T KOG2233|consen  452 FQ-------------------------------RKTEYWYNVSEVFSAWRLLLTALVHLLGEHPLFRYDLVDLTRQMLQE  500 (666)
T ss_pred             cc-------------------------------CCceeeecHHHHHHHHHHHHhhhhhhccCchhhHhHHHHHHHHHHHH
Confidence            54                               14679999999999999999988777666 99999999999999999


Q ss_pred             HHHHHHHHHHHHHHhcChHHHHHHHHHHHHHHHHHHHHhccCCCCChHHHHHHHHhhCCCHHHHHHHHHHhhhcccccCC
Q 009153          366 YANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQEKQYEWNARTQITMWFD  445 (542)
Q Consensus       366 ~~~~~~~~~~~Ay~~~d~~~f~~~~~~~l~li~dlD~LL~t~~~flLg~Wl~~Ar~~g~~~~ek~~yE~NAR~qIT~WG~  445 (542)
                      .++++|.++.+||..||...+.+.|.++++|++|||.+|+++.+||||.||++||+.|.+.+|++.||.|||+|||+|||
T Consensus       501 lanq~y~e~~sAflkkd~~sl~~~~~~llelf~dle~~lasd~nfLlg~WleqAk~~A~n~~er~~fe~nar~qlT~Wgp  580 (666)
T KOG2233|consen  501 LANQAYLEARSAFLKKDKQSLGALSEKLLELFQDLESYLASDDNFLLGQWLEQAKSAAPNSEERQVFEVNARDQLTVWGP  580 (666)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHccCcchhHHHHHHHHHhhCCChhhhhhccccccceEEEECC
Confidence            99999999999999999999999999999999999999999999999999999999999999999999999999999999


Q ss_pred             CCCcCCCCCcccccccccchhhhhhHHHHHHHHHHHHHHHhcCCCCChHHHHHHH-HHHhhhhhcCCCcCCCCCCCcHHH
Q 009153          446 NTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREW-IKLTNDWQNGRNVYPVESNGDALI  524 (542)
Q Consensus       446 ~~~~~~~~L~DYA~k~WaGL~~dyY~pRW~~f~~~l~~~l~~~~~~~~~~~~~~~-~~~E~~W~~~~~~y~~~p~gd~~~  524 (542)
                      .     |+|+||||||||||++|||.|||++|++.+.++++.|++|+.++|..+| ..+|.+++.++.+||++|.||+++
T Consensus       581 ~-----gqilDYA~kqwaGLvsdYY~pRwrlf~e~il~slhd~~pFn~t~f~~~v~~~~e~~fs~s~~vYPv~p~Gdt~~  655 (666)
T KOG2233|consen  581 T-----GQILDYANKQWAGLVSDYYGPRWRLFLEWILESLHDGKPFNHTEFSISVFRDVERPFSISNIVYPVKPKGDTLA  655 (666)
T ss_pred             C-----cchhhHHHHHHHHhhhhcccHHHHHHHHHHHHHHhcCCCccccchhhHHHHhhhccccccccccccCCCccHHH
Confidence            5     5899999999999999999999999999999999999999999999998 599999999999999999999999


Q ss_pred             HHHHHHHHHhc
Q 009153          525 TSQWLYNKYLQ  535 (542)
Q Consensus       525 ~a~~l~~ky~~  535 (542)
                      +++.|+.||++
T Consensus       656 is~~if~ky~P  666 (666)
T KOG2233|consen  656 ISRHIFSKYFP  666 (666)
T ss_pred             HHHHHHHhcCC
Confidence            99999999973



>PF05089 NAGLU: Alpha-N-acetylglucosaminidase (NAGLU) tim-barrel domain; InterPro: IPR024733 Alpha-N-acetylglucosaminidase is a lysosomal enzyme that is required for the stepwise degradation of heparan sulphate [] Back     alignment and domain information
>PF12972 NAGLU_C: Alpha-N-acetylglucosaminidase (NAGLU) C-terminal domain; InterPro: IPR024732 Alpha-N-acetylglucosaminidase is a lysosomal enzyme required for the stepwise degradation of heparan sulphate [] Back     alignment and domain information
>PF02449 Glyco_hydro_42: Beta-galactosidase; InterPro: IPR013529 O-Glycosyl hydrolases 3 Back     alignment and domain information
>PF00150 Cellulase: Cellulase (glycosyl hydrolase family 5); InterPro: IPR001547 O-Glycosyl hydrolases 3 Back     alignment and domain information
>PF12891 Glyco_hydro_44: Glycoside hydrolase family 44; InterPro: IPR024745 This is a family of putative bacterial glycoside hydrolases Back     alignment and domain information

Homologous Structure Templates

Structure Templates Detected by BLAST ?

ID ?Alignment Graph ?Length ? Definition ? E-value ?
Query542
2vc9_A891 Family 89 Glycoside Hydrolase From Clostridium Perf 1e-54
4a4a_A914 Cpgh89 (E483q, E601q), From Clostridium Perfringens 3e-54
>pdb|2VC9|A Chain A, Family 89 Glycoside Hydrolase From Clostridium Perfringens In Complex With 2-Acetamido-1,2-Dideoxynojirmycin Length = 891 Back     alignment and structure

Iteration: 1

Score = 211 bits (536), Expect = 1e-54, Method: Compositional matrix adjust. Identities = 161/563 (28%), Positives = 257/563 (45%), Gaps = 80/563 (14%) Query: 1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60 M N+ G+GGPLP W +Q+ L +K+ R+ G+NPVL +SG VP + A+ Sbjct: 344 MQNMTGFGGPLPNDWFEQRAELGRKMHDRMQSFGINPVLQGYSGMVPRDFKEKNQEAQTI 403 Query: 61 QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDE--NT 118 G W P TY+ + F ++ F E+Q + +G ++ Y D F E NT Sbjct: 404 SQGGWCGFDR-PDMLKTYVNEGEADYFQKVADVFYEKQKEVFGDVTNFYGVDPFHEGGNT 462 Query: 119 PPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLV 178 +D+ + I + M D+DAVW++Q W + P L + + Sbjct: 463 GDLDN----GKIYEIIQNKMIEHDNDAVWVIQNWQGN-------PSNNKLEGLTKKDQAM 511 Query: 179 VLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFG-PVEARTSENTT- 236 VLDLF+EV P W+ ++ +P+IW MLHNF G + M + +A P SE+ Sbjct: 512 VLDLFSEVSPDWNRLEE-RDLPWIWNMLHNFGGRMGMDAAPEKLATEIPKALANSEHMVG 570 Query: 237 --------------------MAW----IN------QYSVRRYGRSVPAIQDAWNVLYHTV 266 MAW IN Y RRYG++ I +AWN++ T Sbjct: 571 IGITPEAINTNPLAYELLFDMAWTRDQINFRTWTEDYIERRYGKTNKEILEAWNIILDTA 630 Query: 267 YNCTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWY 326 Y K D + SII+ G +G +KS S++ H + Y Sbjct: 631 YK-------KRNDY---YQGAAESIINARPG----FG--------IKS-ASTWGHSKIVY 667 Query: 327 STSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGV 386 SE +A+E+F + +E S+ + YD D+ +Q LA A E + + AY + Sbjct: 668 DKSEFEKAIEIFAKNYDEFKDSDAFLYDFADILKQLLANSAQEYYEVMCNAYNNGNGEKF 727 Query: 387 FQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQ--EKQYEWNARTQITMWF 444 +S +FLEL++ + +L+ FL+G W+E A+ + ++ + + +E+NAR +T W Sbjct: 728 KFVSGKFLELIKLQERVLSTRPEFLIGNWIEDARTMLKDSDDWTKDLFEFNARALVTTWG 787 Query: 445 DNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLT 504 + L+DY N+ WSGL DYY R + + L+ G K +W K+ Sbjct: 788 SRNNADGGGLKDYSNRQWSGLTEDYYYARWEKWINGLQAELDGG----AKAPNIDWFKME 843 Query: 505 NDWQNGRN----VYPVESNGDAL 523 DW N ++ +YP E++ + L Sbjct: 844 YDWVNKKSDTDKLYPTEASNENL 866
>pdb|4A4A|A Chain A, Cpgh89 (E483q, E601q), From Clostridium Perfringens, In Complex With Its Substrate Glcnac-Alpha-1,4-Galactose Length = 914 Back     alignment and structure

Structure Templates Detected by RPS-BLAST ?

ID ?Alignment Graph ?Length ? Definition ? E-value ?
Query542
4a4a_A914 Alpha-N-acetylglucosaminidase family protein; hydr 0.0
1vt4_I 1221 APAF-1 related killer DARK; drosophila apoptosome, 2e-06
1vt4_I 1221 APAF-1 related killer DARK; drosophila apoptosome, 4e-04
>4a4a_A Alpha-N-acetylglucosaminidase family protein; hydrolase, 2 hydrolase, family 89 glycoside hydrolase, mucin carbohydrate-active enzyme; HET: NDG GAL; 1.90A {Clostridium perfringens} PDB: 2vcc_A 2vc9_A* 2vcb_A* 2vca_A Length = 914 Back     alignment and structure
 Score =  546 bits (1409), Expect = 0.0
 Identities = 143/568 (25%), Positives = 236/568 (41%), Gaps = 69/568 (12%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M N+ G+GGPLP  W +Q+  L +K+  R+   G+NPVL  +SG VP   +     A+  
Sbjct: 367 MQNMTGFGGPLPNDWFEQRAELGRKMHDRMQSFGINPVLQGYSGMVPRDFKEKNQEAQTI 426

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
             G W      P    TY+ +     F ++   F E+Q + +G  ++ Y  D F +    
Sbjct: 427 SQGGWCGF-DRPDMLKTYVNEGEADYFQKVADVFYEKQKEVFGDVTNFYGVDPFHQGGNT 485

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFWRPPQMKALLNSVPLGKLVVL 180
                    +   I + M   D+DAVW++Q W          P    L       + +VL
Sbjct: 486 --GDLDNGKIYEIIQNKMIEHDNDAVWVIQNW-------QGNPSNNKLEGLTKKDQAMVL 536

Query: 181 DLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENT----- 235
           DLF+EV P W+   +   +P+IW MLHNF G + M    + +A    +A  +        
Sbjct: 537 DLFSEVSPDWNRL-EERDLPWIWNMLHNFGGRMGMDAAPEKLATEIPKALANSEHMVGIG 595

Query: 236 -TM--------------------------AWINQYSVRRYGRSVPAIQDAWNVLYHTVYN 268
            T                            W   Y  RRYG++   I +AWN++  T Y 
Sbjct: 596 ITPQAINTNPLAYELLFDMAWTRDQINFRTWTEDYIERRYGKTNKEILEAWNIILDTAYK 655

Query: 269 CTDGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYST 328
             +       + I+      P                           S++ H  + Y  
Sbjct: 656 KRNDYYQGAAESIINAR---PGF--------------------GIKSASTWGHSKIVYDK 692

Query: 329 SEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAYQLNDAHGVFQ 388
           SE  +A+E+F  + +E   S+ + YD  D+ +Q LA  A E +  +  AY   +      
Sbjct: 693 SEFEKAIEIFAKNYDEFKDSDAFLYDFADILKQLLANSAQEYYEVMCNAYNNGNGEKFKF 752

Query: 389 LSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQN--EEQEKQYEWNARTQITMWFDN 446
           +S +FLEL++  + +L+    FL+G W+E A+ + ++  +  +  +E+NAR  +T W   
Sbjct: 753 VSGKFLELIKLQERVLSTRPEFLIGNWIEDARTMLKDSDDWTKDLFEFNARALVTTWGSR 812

Query: 447 TQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLTND 506
              +   L+DY N+ WSGL  DYY  R   +   +   L+ G      DW +      N 
Sbjct: 813 NNADGGGLKDYSNRQWSGLTEDYYYARWEKWINGLQAELDGGAKAPNIDWFKMEYDWVNK 872

Query: 507 WQNGRNVYPVESNGDALIT-SQWLYNKY 533
             +   +YP E++ + L   ++     Y
Sbjct: 873 KSDTDKLYPTEASNENLGELAKIAMESY 900


>1vt4_I APAF-1 related killer DARK; drosophila apoptosome, apoptosis, programmed cell death; HET: DTP; 6.90A {Drosophila melanogaster} PDB: 3iz8_A* Length = 1221 Back     alignment and structure
>1vt4_I APAF-1 related killer DARK; drosophila apoptosome, apoptosis, programmed cell death; HET: DTP; 6.90A {Drosophila melanogaster} PDB: 3iz8_A* Length = 1221 Back     alignment and structure

Structure Templates Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query542
4a4a_A914 Alpha-N-acetylglucosaminidase family protein; hydr 100.0
3n9k_A399 Glucan 1,3-beta-glucosidase; aromatic entranceway/ 94.89
1kwg_A645 Beta-galactosidase; TIM barrel, glycoside hydrolas 94.72
3tty_A675 Beta-GAL, beta-galactosidase; TIM barrel, glycosid 94.16
3qr3_A340 Endoglucanase EG-II; TIM barrel, hydrolase; 2.05A 92.28
1w91_A503 Beta-xylosidase; MAD, seMet, tetramer, hydrolase; 91.62
2osx_A481 Endoglycoceramidase II; (alpha/beta)8 (TIM) barrel 91.26
4ekj_A500 Beta-xylosidase; TIM-barrel fold, hemicellulase, h 90.58
1h4p_A408 Glucan 1,3-beta-glucosidase I/II; hydrolase, gluca 90.48
3icg_A515 Endoglucanase D; cellulase, xylanase, carbohydrate 89.73
1qnr_A344 Endo-1,4-B-D-mannanase; hydrolase, anomalous scatt 89.42
3ndz_A345 Endoglucanase D; cellotriose, xylanase, carbohydra 88.64
1uhv_A500 Beta-xylosidase; family 39 glycoside hydrolase, xy 88.51
1vjz_A341 Endoglucanase; TM1752, structural genomics, JCSG, 87.89
3l55_A353 B-1,4-endoglucanase/cellulase; putative beta-1,4-e 87.09
3nco_A320 Endoglucanase fncel5A; fncel5A, F. nodosum RT17-B1 86.49
1ceo_A343 Cellulase CELC; glycosyl hydrolase, family A/5 of 86.09
2whl_A294 Beta-mannanase, baman5; glycoside hydrolase, hydro 83.37
1h1n_A305 Endo type cellulase ENGI; hydrolase, glycosyl hydr 81.57
3aof_A317 Endoglucanase; glycosyl hydrolase family 5, cellul 80.93
7a3h_A303 Endoglucanase; hydrolase, cellulose degradation, g 80.8
1rh9_A373 Endo-beta-mannanase; endo-beta-mannase, retaining, 80.49
>4a4a_A Alpha-N-acetylglucosaminidase family protein; hydrolase, 2 hydrolase, family 89 glycoside hydrolase, mucin carbohydrate-active enzyme; HET: NDG GAL; 1.90A {Clostridium perfringens} PDB: 2vcc_A 2vc9_A* 2vcb_A* 2vca_A Back     alignment and structure
Probab=100.00  E-value=1.7e-157  Score=1338.28  Aligned_cols=493  Identities=30%  Similarity=0.588  Sum_probs=460.5

Q ss_pred             CCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHcCCeeccCCCCCCCchhhHhhCCCCceeccCCCCCCCCCCcccccccc
Q 009153            1 MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLL   80 (542)
Q Consensus         1 MgNi~gwGGPLp~~wi~~q~~Lq~kIl~RmreLGM~PVLPgF~G~VP~~~~~~~P~a~i~~~~~W~~f~~~~~~~~~~~L   80 (542)
                      ||||+|||||||++||++|.+||||||+|||||||+||||||+||||++||++||+|+|+++|.|+||.+ +.+|+|++|
T Consensus       367 MgNl~~wgGPLp~~w~~~q~~Lq~kIl~RmrelGM~PVLPaF~G~VP~~~~~~~P~a~i~~~~~W~gf~~-~~~~~~~~l  445 (914)
T 4a4a_A          367 MQNMTGFGGPLPNDWFEQRAELGRKMHDRMQSFGINPVLQGYSGMVPRDFKEKNQEAQTISQGGWCGFDR-PDMLKTYVN  445 (914)
T ss_dssp             TTSCCSTTCCCCTTHHHHHHHHHHHHHHHHHHHTCEEEEECCSCEECTTHHHHSTTCCEECCCEETTEEC-CEEECSSCC
T ss_pred             hcCccccCCCCCHHHHHHHHHHHHHHHHHHHHcCCeecCCCcCCCCChHHHhhCCCCeeecCCCCCCCCC-chhcccccC
Confidence            9999999999999999999999999999999999999999999999999999999999999999999985 566779999


Q ss_pred             CCCChHHHHHHHHHHHHHHHHhCCCCcccccCCCCCCCCCCCChHHHHHHHHHHHHHHhccCCCceEEEecccCCCCCCC
Q 009153           81 DATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPPVDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDPFW  160 (542)
Q Consensus        81 dP~dplF~~i~~~F~~~q~~~yG~~~h~Y~~DpFnE~~p~~~d~~~L~~~~~~iy~am~~~dP~AvWvmQgW~F~~~~fW  160 (542)
                      +|+||||++||++||+||+++||+++|||+||||||++|+.+.+  |++++++||++|+++||+||||||||+|.     
T Consensus       446 ~p~dplF~~i~~~F~~~q~~~yG~~~h~Y~~D~FnE~~~~~~~~--l~~~~~~v~~am~~~dp~AvWv~QgW~~~-----  518 (914)
T 4a4a_A          446 EGEADYFQKVADVFYEKQKEVFGDVTNFYGVDPFHQGGNTGDLD--NGKIYEIIQNKMIEHDNDAVWVIQNWQGN-----  518 (914)
T ss_dssp             TTSCCHHHHHHHHHHHHHHHHHCSCCSEEECCTTTTSCCCTTCC--HHHHHHHHHHHHHHHCTTCEEEEEEBTTB-----
T ss_pred             CCCChHHHHHHHHHHHHHHHHhCCcccccccCccccCCCCCCcC--HHHHHHHHHHHHHHhCCCCEEEEcccCCC-----
Confidence            99999999999999999999999668999999999998765533  89999999999999999999999999974     


Q ss_pred             CchhHHHhHhCCCC-CCEEEEEcCCCccccccccCCCCCCCceeecccCCCcccccccchhHhhcChHHHHhCCCCch--
Q 009153          161 RPPQMKALLNSVPL-GKLVVLDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTM--  237 (542)
Q Consensus       161 ~~~~~~a~L~~Vp~-~~mliLDL~sE~~p~w~~~~~f~G~pwIWc~L~NFGGn~gl~G~l~~i~~~~~~a~~~~~~~m--  237 (542)
                       +.  +++|++||+ ||||||||+||..|+|+++++ +|||||||||||||||+||+|+++.|+++|.+|+.. +++|  
T Consensus       519 -~~--~~~L~~vp~~~~mlvLDL~se~~p~w~~~~~-~G~pwiwc~L~NFGGn~gl~G~~~~~~~~~~~a~~~-~~~m~G  593 (914)
T 4a4a_A          519 -PS--NNKLEGLTKKDQAMVLDLFSEVSPDWNRLEE-RDLPWIWNMLHNFGGRMGMDAAPEKLATEIPKALAN-SEHMVG  593 (914)
T ss_dssp             -SC--HHHHTTCSCGGGEEEEETTTTTSCCCHHHHT-TTCCEEEEECCCSTTCCSSCCCHHHHHHHHHHHHHT-CSCEEE
T ss_pred             -Ch--HHHHhCCCCCCCEEEEEcccccccccchhhh-CCCceEEeccccCCCCCCCcccHHHHhhhHHHHhhc-CCCceE
Confidence             33  479999997 999999999999999999987 899999999999999999999999999999999875 4555  


Q ss_pred             -------------------------------hhHHhhhhcccCCCChHHHHHHHHHHhcccCCCCCCCC-CCCcceeecc
Q 009153          238 -------------------------------AWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATD-KNRDVIVAFP  285 (542)
Q Consensus       238 -------------------------------~W~~~Ya~rRYG~~~~~~~~AW~iL~~tvY~~~~~~~~-~~~~~i~~~P  285 (542)
                                                     +||++||+||||+.++++++||+||++|||||+..... .+++++++||
T Consensus       594 ~G~tpEgie~NpvvYeL~~e~aW~~~~id~~~W~~~ya~~RYG~~~~~~~~AW~~L~~tvY~~~~~~~~~~~~s~~~~rP  673 (914)
T 4a4a_A          594 IGITPQAINTNPLAYELLFDMAWTRDQINFRTWTEDYIERRYGKTNKEILEAWNIILDTAYKKRNDYYQGAAESIINARP  673 (914)
T ss_dssp             EEECCSCSCSCHHHHHHHHHGGGCSSCCCHHHHHHHHHHHHHTCCCHHHHHHHHHHHHTTTSCCCSCCCSSCCCGGGSCS
T ss_pred             EeechhhcccCHHHHHHHHhcccCCCCCCHHHHHHHHHHhhcCCCCHHHHHHHHHHHhhhcCCCCccccCCCcCeeeeCC
Confidence                                           99999999999999999999999999999999864333 3568888888


Q ss_pred             CCCCccccccccccccCCCCccccccccccCCCCCCCCccCCHHHHHHHHHHHHhccccCCCCCcccchHHHHHHHHHHH
Q 009153          286 DVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHLWYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAK  365 (542)
Q Consensus       286 ~l~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~Yd~~~l~~A~~lll~~~~~l~~~~~y~yDLvDvtRQvL~n  365 (542)
                      +|+..                        ..++|+++.++|||++|++||++||+++++|+++++|||||||||||||+|
T Consensus       674 ~l~~~------------------------~~~~w~~~~~~Yd~~~~~~A~~lll~~~~~l~~s~~y~yDLvDvtRQ~l~n  729 (914)
T 4a4a_A          674 GFGIK------------------------SASTWGHSKIVYDKSEFEKAIEIFAKNYDEFKDSDAFLYDFADILKQLLAN  729 (914)
T ss_dssp             CSSCC------------------------CSSTTCCCCCCSCHHHHHHHHHHHHHHHHHHTTCHHHHHHHHHHHHHHHHH
T ss_pred             CCCcc------------------------CccccCcccccCCHHHHHHHHHHHHHhhhhcCCCchhhHhHHHHHHHHHHH
Confidence            87633                        346789999999999999999999999999999999999999999999999


Q ss_pred             HHHHHHHHHHHHHHhcChHHHHHHHHHHHHHHHHHHHHhccCCCCChHHHHHHHHhhCCCHH--HHHHHHHHhhhccccc
Q 009153          366 YANELFLNIIEAYQLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEE--QEKQYEWNARTQITMW  443 (542)
Q Consensus       366 ~~~~~~~~~~~Ay~~~d~~~f~~~~~~~l~li~dlD~LL~t~~~flLg~Wl~~Ar~~g~~~~--ek~~yE~NAR~qIT~W  443 (542)
                      +++.+|.++++||+++|.+.|++++++||+||.+||+||+||++|+||+||++||++|.+++  ||++||+|||+|||+|
T Consensus       730 ~~~~~~~~~~~ay~~~d~~~~~~~~~~~l~ll~~~D~lL~t~~~flLg~Wl~~Ar~~a~~~~~~e~~~yE~NAR~qIT~W  809 (914)
T 4a4a_A          730 SAQEYYEVMCNAYNNGNGEKFKFVSGKFLELIKLQERVLSTRPEFLIGNWIEDARTMLKDSDDWTKDLFEFNARALVTTW  809 (914)
T ss_dssp             HHHHHHHHHHHHHHTTCHHHHHHHHHHHHHHHHHHHHHHTTCGGGBHHHHHHHHHHSSTTCCHHHHHHHHHHHHHHTTTS
T ss_pred             HHHHHHHHHHHHHHhcCHHHHHHHHHHHHHHHHHHHHHhCCCCcccHHHHHHHHHHhcCCCCHHHHHHHHHHhHheEeec
Confidence            99999999999999999999999999999999999999999999999999999999998865  8999999999999999


Q ss_pred             CCCCCcCCCCCcccccccccchhhhhhHHHHHHHHHHHHHHHhcCCCCChHHHHHHHHHHhhhhhcCC----CcCCCCCC
Q 009153          444 FDNTQEEASLLRDYGNKYWSGLLRDYYGPRAAIYFKYMIESLESGDGFRLKDWRREWIKLTNDWQNGR----NVYPVESN  519 (542)
Q Consensus       444 G~~~~~~~~~L~DYA~k~WaGL~~dyY~pRW~~f~~~l~~~l~~~~~~~~~~~~~~~~~~E~~W~~~~----~~y~~~p~  519 (542)
                      ||+..+++|+|+|||||+|+|||+|||+|||++||++|.++|++|++|+..+    |+++|++|++++    +.||++|.
T Consensus       810 G~~~~~~~g~l~DYA~k~W~GL~~dyY~pRW~~f~~~l~~~l~~~~~~~~~~----~~~~e~~W~~~~~~~~~~yp~~~~  885 (914)
T 4a4a_A          810 GSRNNADGGGLKDYSNRQWSGLTEDYYYARWEKWINGLQAELDGGAKAPNID----WFKMEYDWVNKKSDTDKLYPTEAS  885 (914)
T ss_dssp             SSHHHHTTTSCTTTTCCCCBTHHHHTHHHHHHHHHHHHHHHHTTCSCCCCCC----HHHHHHHHHTCCTTTSCCCCCSCC
T ss_pred             cCCCCCCCCcccccccchhhhhHHHHHHHHHHHHHHHHHHHHhcCCCCChHH----HHHHHHHHhccccccCCcCCCCCC
Confidence            9976556689999999999999999999999999999999999999998764    569999999964    89999999


Q ss_pred             -CcHHHHHHHHHHHHh
Q 009153          520 -GDALITSQWLYNKYL  534 (542)
Q Consensus       520 -gd~~~~a~~l~~ky~  534 (542)
                       ||++++|++||+||.
T Consensus       886 g~d~~~~a~~~~~ky~  901 (914)
T 4a4a_A          886 NENLGELAKIAMESYS  901 (914)
T ss_dssp             CCCHHHHHHHHHHHSC
T ss_pred             CCCHHHHHHHHHHHcc
Confidence             599999999999994



>3n9k_A Glucan 1,3-beta-glucosidase; aromatic entranceway/clamp, exoglucanase, glycoside hydrolas protein-carbohydrate interaction; HET: BGC; 1.70A {Candida albicans} SCOP: c.1.8.3 PDB: 2pc8_A* 2pb1_A* 2pbo_A 3o6a_A 2pf0_A 1cz1_A 1eqc_A* 1eqp_A Back     alignment and structure
>1kwg_A Beta-galactosidase; TIM barrel, glycoside hydrolase family 42, trimer, hydrolase; 1.60A {Thermus thermophilus} SCOP: b.71.1.1 c.1.8.1 c.23.16.5 PDB: 1kwk_A* Back     alignment and structure
>3tty_A Beta-GAL, beta-galactosidase; TIM barrel, glycoside hydrolase, hydrolase; HET: GLA; 2.25A {Bacillus circulans subsp} PDB: 3tts_A* Back     alignment and structure
>3qr3_A Endoglucanase EG-II; TIM barrel, hydrolase; 2.05A {Hypocrea jecorina} Back     alignment and structure
>1w91_A Beta-xylosidase; MAD, seMet, tetramer, hydrolase; 2.2A {Geobacillus stearothermophilus} SCOP: b.71.1.2 c.1.8.3 PDB: 2bs9_A 2bfg_A* Back     alignment and structure
>2osx_A Endoglycoceramidase II; (alpha/beta)8 (TIM) barrel, hydrolase; HET: SIA GAL BGC 16C; 1.10A {Rhodococcus SP} PDB: 2oyk_A* 2osw_A* 2oyl_A* 2oym_A* 2osy_A* Back     alignment and structure
>4ekj_A Beta-xylosidase; TIM-barrel fold, hemicellulase, hydrolase; 2.50A {Caulobacter vibrioides} Back     alignment and structure
>1h4p_A Glucan 1,3-beta-glucosidase I/II; hydrolase, glucan degradation, hydrolyase, glycosidase; HET: NAG BMA MAN NDG; 1.75A {Saccharomyces cerevisiae} SCOP: c.1.8.3 Back     alignment and structure
>1qnr_A Endo-1,4-B-D-mannanase; hydrolase, anomalous scattering; HET: NAG MAB; 1.4A {Trichoderma reesei} SCOP: c.1.8.3 PDB: 1qno_A* 1qnq_A* 1qnp_A* 1qns_A* Back     alignment and structure
>3ndz_A Endoglucanase D; cellotriose, xylanase, carbohydrate binding D glucanase, hydrolase; HET: CT3; 2.08A {Clostridium cellulovorans} PDB: 3ndy_A* Back     alignment and structure
>1uhv_A Beta-xylosidase; family 39 glycoside hydrolase, xylan, xylose, covalent glycosyl-enzyme intermediate; 2.10A {Thermoanaerobacterium saccharolyticum} SCOP: b.71.1.2 c.1.8.3 PDB: 1px8_A Back     alignment and structure
>1vjz_A Endoglucanase; TM1752, structural genomics, JCSG, PSI, prote structure initiative, joint center for structural genomics; 2.05A {Thermotoga maritima} SCOP: c.1.8.3 Back     alignment and structure
>3nco_A Endoglucanase fncel5A; fncel5A, F. nodosum RT17-B1, hydrolase; 1.50A {Fervidobacterium nodosum} PDB: 3rjx_A 3rjy_A* Back     alignment and structure
>1ceo_A Cellulase CELC; glycosyl hydrolase, family A/5 of glycosyl hydrolases, cellulose degradation; 1.90A {Clostridium thermocellum} SCOP: c.1.8.3 PDB: 1cen_A 1cec_A Back     alignment and structure
>2whl_A Beta-mannanase, baman5; glycoside hydrolase, hydrolase; HET: MAN BMA; 1.40A {Bacillus agaradhaerens} PDB: 2whj_A Back     alignment and structure
>1h1n_A Endo type cellulase ENGI; hydrolase, glycosyl hydrolase, family 5, subtype, thermophilic, thermophIle, endoglucanase; 1.12A {Thermoascus aurantiacus} SCOP: c.1.8.3 PDB: 1gzj_A Back     alignment and structure
>3aof_A Endoglucanase; glycosyl hydrolase family 5, cellulase, biofuel, hyperthermo hydrolase; HET: BMA; 1.29A {Thermotoga maritima} PDB: 3amg_A* 3amc_A 3amd_A 3mmu_A 3mmw_A 3azs_A* 3azr_A* 3azt_A* Back     alignment and structure
>7a3h_A Endoglucanase; hydrolase, cellulose degradation, glycoside H family 5, michaelis complex, SKEW-BOAT, distortion; 0.95A {Bacillus agaradhaerens} SCOP: c.1.8.3 PDB: 1h2j_A* 1hf6_A* 1ocq_A* 1w3k_A* 1h11_A* 4a3h_A* 5a3h_A* 6a3h_A* 1w3l_A 8a3h_A* 2v38_A* 1qhz_A 1qi0_A* 1e5j_A* 1qi2_A* 1h5v_A* 1a3h_A 2a3h_A* 3a3h_A* 1lf1_A Back     alignment and structure
>1rh9_A Endo-beta-mannanase; endo-beta-mannase, retaining, glycoside hydrolase family 5; 1.50A {Solanum lycopersicum} SCOP: c.1.8.3 Back     alignment and structure

Homologous Structure Domains

Structure Domains Detected by RPS-BLAST ?

No hit with e-value below 0.005

Homologous Domains Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query542
d1kwga2393 A4 beta-galactosidase {Thermus thermophilus [TaxId 94.18
d1ceoa_340 Endoglucanase CelC {Clostridium thermocellum [TaxI 93.78
d1qnra_344 Beta-mannanase {Trichoderma reesei [TaxId: 51453]} 92.19
d1vjza_325 Endoglucanase homologue TM1752 {Thermotoga maritim 86.72
d2pb1a1394 Exo-beta-(1,3)-glucanase {Yeast (Candida albicans) 85.96
d1ecea_358 Endocellulase E1 {Acidothermus cellulolyticus [Tax 81.74
>d1kwga2 c.1.8.1 (A:1-393) A4 beta-galactosidase {Thermus thermophilus [TaxId: 274]} Back     information, alignment and structure
class: Alpha and beta proteins (a/b)
fold: TIM beta/alpha-barrel
superfamily: (Trans)glycosidases
family: Amylase, catalytic domain
domain: A4 beta-galactosidase
species: Thermus thermophilus [TaxId: 274]
Probab=94.18  E-value=0.092  Score=48.57  Aligned_cols=94  Identities=15%  Similarity=0.143  Sum_probs=73.8

Q ss_pred             HHHHHHHHHHHcCCeeccCCCCCCCchhhHhhCCCCceeccCCCCCCCCCCccccccccCCCChHHHHHHHHHHHHHHHH
Q 009153           22 LQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKITQLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKE  101 (542)
Q Consensus        22 Lq~kIl~RmreLGM~PVLPgF~G~VP~~~~~~~P~a~i~~~~~W~~f~~~~~~~~~~~LdP~dplF~~i~~~F~~~q~~~  101 (542)
                      -..++|+.+++.||+||+=-|....|..+.+.+|+.......  +.+   ..+....+.++.+|.+.+..+.|++...+.
T Consensus        52 ~~d~~i~~~~~~Gi~~iv~l~~~~~P~w~~~~~~~~~~~~~~--g~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~  126 (393)
T d1kwga2          52 WLDEAIATLAAEGLKVVLGTPTATPPKWLVDRYPEILPVDRE--GRR---RRFGGRRHYCFSSPVYREEARRIVTLLAER  126 (393)
T ss_dssp             HHHHHHHHHHTTTCEEEEECSTTSCCHHHHHHCGGGSCBCTT--SCB---CCSSSSCCCCTTCHHHHHHHHHHHHHHHHH
T ss_pred             HHHHHHHHHHHCCCEEEEEcCCCCCchhhhccCcccccccCC--Ccc---cccccccccCCCCHHHHHHHHHHHHHHHHH
Confidence            446899999999999998888888899999999876444322  111   123345678889999999999999999999


Q ss_pred             hCCCCcccccCCCCCCCCC
Q 009153          102 YGRTSHIYNCDTFDENTPP  120 (542)
Q Consensus       102 yG~~~h~Y~~DpFnE~~p~  120 (542)
                      |+....+...+..||..-.
T Consensus       127 ~~~~~~~~~~~~~ne~~~~  145 (393)
T d1kwga2         127 YGGLEAVAGFQTDNEYGCH  145 (393)
T ss_dssp             HTTCTTEEEEECSSSTTTT
T ss_pred             hcCCceEEEEeeccccccc
Confidence            9865567788888997543



>d1ceoa_ c.1.8.3 (A:) Endoglucanase CelC {Clostridium thermocellum [TaxId: 1515]} Back     information, alignment and structure
>d1qnra_ c.1.8.3 (A:) Beta-mannanase {Trichoderma reesei [TaxId: 51453]} Back     information, alignment and structure
>d1vjza_ c.1.8.3 (A:) Endoglucanase homologue TM1752 {Thermotoga maritima [TaxId: 2336]} Back     information, alignment and structure
>d2pb1a1 c.1.8.3 (A:7-400) Exo-beta-(1,3)-glucanase {Yeast (Candida albicans) [TaxId: 5476]} Back     information, alignment and structure
>d1ecea_ c.1.8.3 (A:) Endocellulase E1 {Acidothermus cellulolyticus [TaxId: 28049]} Back     information, alignment and structure