Citrus Sinensis ID: 011031


Local Sequence Feature Prediction

Prediction and (Method)Result
Residue Number Marker
Protein Sequence ?
Secondary Structure (PSIPRED) ?
Secondary Structure Prediction (SSPRO) ?
Coil and Loop (DISEMBL) ?
Flexible Loop (DISEMBL) ?
Low Complexity Region (SEG) ?
Disordered region (IsUnstruct) ?
Disordered Region (DISOPRED) ?
Disordered Region (DISEMBL) ?
Disordered Region (DISPRO) ?
Transmembrane Helix (TMHMM) ?
Transmembrane Helix (HMMTOP) ?
Transmembrane Helix (MEMSAT) ?
TM Helix, Signal Peptide (MEMSAT_SVM) ?
TM Helix, Signal Peptide (Phobius) ?
Signal Peptide (SignalP HMM Mode) ?
Signal Peptide (SignalP NN Mode) ?
Coiled Coils (COILS) ?
Positional Conservation ?
 
--------10--------20--------30--------40--------50--------60--------70--------80--------90-------100-------110-------120-------130-------140-------150-------160-------170-------180-------190-------200-------210-------220-------230-------240-------250-------260-------270-------280-------290-------300-------310-------320-------330-------340-------350-------360-------370-------380-------390-------400-------410-------420-------430-------440-------450-------460-------470-------480-------490-----
MADLRIVEEGLGRTQLVEQEQDDGKDSENGINKEKGLERSEVQDEQKGELQLQQLLQQKSKRVATLDAFRGLTVVLMILVDDAGGAYARIDHSPWNGCTLADFVMPFFLFIVGVAIALALKKVPKINGAVKKIIFRTLKLLFWGIILQGGYSHAPDALSYGVDMKHIRWCGILQRIALVYVVVALIETLTTKRRPNVLEPRHLSIFTAYQWQWIGGFIAFVIYIITTYSLYVPNWSFSEHSDHGVKKYIVKCGMRGHLGPACNAVGYVDRELWGINHLYSDPVWSRLEACTLSSPNSGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHVLIHFKGHSARLKHWVSMGFGLLIIAIILHFTNAIPINKQLYSFSYVCFTAGAAGIVFSALYVLMDVWELRTPFLFLKWIGMNAMLVFVLGAQGILAGFVNGWYYKNPDNTLVNWIQNHLFIHVWNSERLGTLLYVIFAEITFWGVVAGILHRLGIYWKL
cHHHHHHHHcccccccHHHccccccccccccccccccccccccccccHHHHHHHHHHcccccHHHHHHHHHHHHHHHHHHcccccccccccccccccccHHHHHHHHHHHHHHHHHHHHHccccccccHHHHHHHHHHHHHHHHHHHHHccccccccccccccccHHHHHHHHHHHHHHHHHHHHHHHHHcccccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHccccccccccccccccEEEEEccccccccccccHHHHHHHHHcccccccccHHHHHHHHHccccccccccccccccccccccccccccHHHHHHHHHHHHHHHHHHHHcccccHHHHHHHHHHHHHHHHHHHHHHHccccccccccccHHHHHHHHHHHHHHHHHHHHHHHHccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHcEEEEccccccHHHHHHHHHHHHHcccHHHHHHHHHHHHHHHHHHHHHHHHHHcccEEEc
cccHHHHHHcccccHHHHHHcccccccHHcccHHcccccccccccccccccccccccccccHHHHHHHHHHHHHHHHHEEcccccccccccccccccccHHHHHHHHHHHHHHHHHHHHHHHcccHHHHHHHHHHHHHHHHHHHHHHccccccccccccccccHHHccHHHHHHHHHHHHHHHHHHHHHccccccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHcccccccccccccccccccEEEcccccccccccccEEEEEHHHHccHHcccccccEEEEcccccccccccccccccccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHccccHHHHHHHHHHHHHHHHHHHHHHccccccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHccccccccEHccHHHHHHHHHHHHHHHHHccccccccccccHHHHHHHHHHHHHHcccHHHHHHHHHHHHHHHHHHHHHHHHHHccEEEEc
MADLRIVEEglgrtqlveqeqddgkdsenginkekglersevqDEQKGELQLQQLLQQKSKRVATLDAFRGLTVVLMILVDdaggayaridhspwngctladfvMPFFLFIVGVAIALALKKVPKINGAVKKIIFRTLKLLFWGIIlqggyshapdalsygvdmkhirWCGILQRIALVYVVVALIETlttkrrpnvleprhlSIFTAYQWQWIGGFIAFVIYIITTYSlyvpnwsfsehsdhgVKKYIVkcgmrghlgpacnavgyvdrelwginhlysdpvwsrleactlsspnsgplredapswcrapfepeglLSTISAILSGTIGIHYGHVLIHFKGHSARLKHWVSMGFGLLIIAIILHFTnaipinkqlySFSYVCFTAGAAGIVFSALYVLMDVWELRTPFLFLKWIGMNAMLVFVLGAQGILAGfvngwyyknpdntlVNWIQNHLFIHVWNSERLGTLLYVIFAEITFWGVVAGILHRLGIYWKL
madlriveeglgrtqlveqeqddgkdsenginkekglersevQDEQKGELQLQQLLQQKSKRVATLDAFRGLTVVLMILVDDAGGAYARIDHSPWNGCTLADFVMPFFLFIVGVAIALALKKVPKINGAVKKIIFRTLKLLFWGIILQGGYSHAPDALSYGVDMKHIRWCGILQRIALVYVVVALIEtlttkrrpnvlepRHLSIFTAYQWQWIGGFIAFVIYIITTYSLYVPNWSFSEHSDHGVKKYIVKCGMRGHLGPACNAVGYVDRELWGINHLYSDPVWSRLEACTLSSPNSGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHVLIHFKGHSARLKHWVSMGFGLLIIAIILHFTNAIPINKQLYSFSYVCFTAGAAGIVFSALYVLMDVWELRTPFLFLKWIGMNAMLVFVLGAQGILAGFVNGWYYKNPDNTLVNWIQNHLFIHVWNSERLGTLLYVIFAEITFWGVVAGILHRLGIYWKL
MADLRIVEEGLGRTQLVEQEQDDGKDSENGINKEKGLERSEVQDEQKGElqlqqllqqKSKRVATLDAFRGLTVVLMILVDDAGGAYARIDHSPWNGCTLADFVMPFFLFIVGVAIALALKKVPKINGAVKKIIFRTLKLLFWGIILQGGYSHAPDALSYGVDMKHIRWCGILQRIALVYVVVALIETLTTKRRPNVLEPRHLSIFTAYQWQWIGGFIAFVIYIITTYSLYVPNWSFSEHSDHGVKKYIVKCGMRGHLGPACNAVGYVDRELWGINHLYSDPVWSRLEACTLSSPNSGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHVLIHFKGHSARLKHWVSMGFGLLIIAIILHFTNAIPINKQLYSFSYVCFTAGAAGIVFSALYVLMDVWELRTPFLFLKWIGMNAMLVFVLGAQGILAGFVNGWYYKNPDNTLVNWIQNHLFIHVWNSERLGTLLYVIFAEITFWGVVAGILHRLGIYWKL
*************************************************************RVATLDAFRGLTVVLMILVDDAGGAYARIDHSPWNGCTLADFVMPFFLFIVGVAIALALKKVPKINGAVKKIIFRTLKLLFWGIILQGGYSHAPDALSYGVDMKHIRWCGILQRIALVYVVVALIETLTTKRRPNVLEPRHLSIFTAYQWQWIGGFIAFVIYIITTYSLYVPNWSFSEHSDHGVKKYIVKCGMRGHLGPACNAVGYVDRELWGINHLYSDPVWSRLEACTL*************SWCRAPFEPEGLLSTISAILSGTIGIHYGHVLIHFKGHSARLKHWVSMGFGLLIIAIILHFTNAIPINKQLYSFSYVCFTAGAAGIVFSALYVLMDVWELRTPFLFLKWIGMNAMLVFVLGAQGILAGFVNGWYYKNPDNTLVNWIQNHLFIHVWNSERLGTLLYVIFAEITFWGVVAGILHRLGIYW**
**********LGR****************************************************LDAFRGLTVVLMILVDDAGGAYARIDHSPWNGCTLADFVMPFFLFIVGVAIALALKKVPKINGAVKKIIFRTLKLLFWGIILQGGYSHAPDALSYGVDMKHIRWCGILQRIALVYVVVALIETLTTKRRPNVLEPRHLSIFTAYQWQWIGGFIAFVIYIITTYSLYVPNWSFSEHSDHGVKKYIVKCGMRGHLGPACNAVGYVDRELWGINHLYSDPVWSRLEACTLSSPNSGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHVLIHFKGHSARLKHWVSMGFGLLIIAIILHFTNAIPINKQLYSFSYVCFTAGAAGIVFSALYVLMDVWELRTPFLFLKWIGMNAMLVFVLGAQGILAGFVNGWYYKNPDNTLVNWIQNHLFIHVWNSERLGTLLYVIFAEITFWGVVAGILHRLGIYWKL
MADLRIVEEGLGRTQLV**********ENGINKEKGL***********ELQLQQLLQQKSKRVATLDAFRGLTVVLMILVDDAGGAYARIDHSPWNGCTLADFVMPFFLFIVGVAIALALKKVPKINGAVKKIIFRTLKLLFWGIILQGGYSHAPDALSYGVDMKHIRWCGILQRIALVYVVVALIETLTTKRRPNVLEPRHLSIFTAYQWQWIGGFIAFVIYIITTYSLYVPNWSFSEHSDHGVKKYIVKCGMRGHLGPACNAVGYVDRELWGINHLYSDPVWSRLEACTLSS**********PSWCRAPFEPEGLLSTISAILSGTIGIHYGHVLIHFKGHSARLKHWVSMGFGLLIIAIILHFTNAIPINKQLYSFSYVCFTAGAAGIVFSALYVLMDVWELRTPFLFLKWIGMNAMLVFVLGAQGILAGFVNGWYYKNPDNTLVNWIQNHLFIHVWNSERLGTLLYVIFAEITFWGVVAGILHRLGIYWKL
*****I*E*GL**T**V******************************************SKRVATLDAFRGLTVVLMILVDDAGGAYARIDHSPWNGCTLADFVMPFFLFIVGVAIALALKKVPKINGAVKKIIFRTLKLLFWGIILQGGYSHAPDALSYGVDMKHIRWCGILQRIALVYVVVALIETLTTKRRPNVLEPRHLSIFTAYQWQWIGGFIAFVIYIITTYSLYVPNWSFSEHSDHGVKKYIVKCGMRGHLGPACNAVGYVDRELWGINHLYSDPVWSRLEACTLSSPNSGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHVLIHFKGHSARLKHWVSMGFGLLIIAIILHFTNAIPINKQLYSFSYVCFTAGAAGIVFSALYVLMDVWELRTPFLFLKWIGMNAMLVFVLGAQGILAGFVNGWYYKNPDNTLVNWIQNHLFIHVWNSERLGTLLYVIFAEITFWGVVAGILHRLGIYWKL
iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiHHHHHHHHHHHHHHHHHHHHHHHooooooooooooooHHHHHHHHHHHHHHHHHHiiiiiiiiiiiiHHHHHHHHHHHHHHHHHHHHooooooooooooooHHHHHHHHHHHHHHHHHHHHHHHiiiiiiiiiiiiiiiiiiiiHHHHHHHHHHHHHHHHHHHHHHHoooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooHHHHHHHHHHHHHHHHHHHHHHHiiiiiiiiiiiHHHHHHHHHHHHHHHHHHHHHHHoooHHHHHHHHHHHHHHHHHHHHHHHiiiiiiiiiiiHHHHHHHHHHHHHHHHHHHHHHHooooooooooooooooooooooooooooooooHHHHHHHHHHHHHHHHHHHHHHHiiiiiiii
iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiHHHHHHHHHHHHHHHHHHooooooooooooooHHHHHHHHHHHHHHHHHHiiiiiiiiiiiiiiiiiiiiHHHHHHHHHHHHHHHHHHooooooooooooHHHHHHHHHHHHHHHHHHHHiiiiiiiiiiiiiiiiiiiiiiHHHHHHHHHHHHHHHHHHHHooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooHHHHHHHHHHHHHHHHHHiiiiiiiiiiiiHHHHHHHHHHHHHHHHHHHHooooooooooooHHHHHHHHHHHHHHHHHHHHiiiiiiiiiiiiHHHHHHHHHHHHHHHHHHHHooooooooooooooooooooooooooooooHHHHHHHHHHHHHHHHHHHHHHiiiiiiiii
iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiHHHHHHHHHHHHHHHHHHHHHHHHooooooooooooooHHHHHHHHHHHHHHHHHHHHHiiiiiiiiiiiiiiiHHHHHHHHHHHHHHHHHHHHoooooooooooooooHHHHHHHHHHHHHHHHHHHHHiiiiiiiiiiiiiiiiiHHHHHHHHHHHHHHHHHHHHHHHHHooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooHHHHHHHHHHHHHHHHHHHHHHHHiiiiiiiiHHHHHHHHHHHHHHHHHHHHooooooooooooooooHHHHHHHHHHHHHHHHHHHHiiiiiiiHHHHHHHHHHHHHHHHHHHHHHHHoooooooooooooooooooooooooooooooHHHHHHHHHHHHHHHHHHHHHHHHiiiiiii
iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiHHHHHHHHHHHHHHHHooooooooooooooooHHHHHHHHHHHHHHHHHHHHiiiiiiiiiiiiiiiHHHHHHHHHHHHHHHHHHHooooooooooooooooooHHHHHHHHHHHHHHHHHHHHHiiiiiiiiiiiiiiiiHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHoooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooHHHHHHHHHHHHHHHHHHHHHiiiiiiiiiiiiHHHHHHHHHHHHHHHHHHHHoooooooooooHHHHHHHHHHHHHHHHHHHHHHHiiiiiiiiiiHHHHHHHHHHHHHHHHHHHHHHHoooooooooooooooooooooooooooooHHHHHHHHHHHHHHHHHHHHHHiiiiiiiii
iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiHHHHHHHHHHHHHHHHHHoooooooooooooooooooHHHHHHHHHHHHHHHHHHHHHHiiiiiiiiiiiHHHHHHHHHHHHHHHHHHHHoooooooooooooooooooHHHHHHHHHHHHHHHHHHHiiiiiiiiiiiiiiiiiiiiHHHHHHHHHHHHHHHHHHHHHoooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooHHHHHHHHHHHHHHHHHHHHHHiiiiiiiiiiiHHHHHHHHHHHHHHHHHHHHHoooooHHHHHHHHHHHHHHHHHHHHHHHHiiiiiiiiiiiiiiiiiiiiHHHHHHHHHHHHHHHHHHHHHoooooooooooooooooooooooooooHHHHHHHHHHHHHHHHHHHiiiiiiiii
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
MADLRIVEEGLGRTQLVEQEQDDGKDSENGINKEKGLERSEVQDEQKGELQLQQLLQQKSKRVATLDAFRGLTVVLMILVDDAGGAYARIDHSPWNGCTLADFVMPFFLFIVGVAIALALKKVPKINGAVKKIIFRTLKLLFWGIILQGGYSHAPDALSYGVDMKHIRWCGILQRIALVYVVVALIETLTTKRRPNVLEPRHLSIFTAYQWQWIGGFIAFVIYIITTYSLYVPNWSFSEHSDHGVKKYIVKCGMRGHLGPACNAVGYVDRELWGINHLYSDPVWSRLEACTLSSPNSGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHVLIHFKGHSARLKHWVSMGFGLLIIAIILHFTNAIPINKQLYSFSYVCFTAGAAGIVFSALYVLMDVWELRTPFLFLKWIGMNAMLVFVLGAQGILAGFVNGWYYKNPDNTLVNWIQNHLFIHVWNSERLGTLLYVIFAEITFWGVVAGILHRLGIYWKL
no confident homologs detected

Close Homologs for Annotation Transfer

Close Homologs in SWISS-PROT Database Detected by BLAST ?

ID ?Alignment graph ?Length ? Definition ? RBH(Q2H) ? RBH(H2Q) ? Q cover ? H cover ? Identity ? E-value ?
Query495 2.2.26 [Sep-21-2011]
Q3UDW8656 Heparan-alpha-glucosamini yes no 0.828 0.625 0.293 2e-36
Q68CP4663 Heparan-alpha-glucosamini yes no 0.832 0.621 0.284 2e-34
>sp|Q3UDW8|HGNAT_MOUSE Heparan-alpha-glucosaminide N-acetyltransferase OS=Mus musculus GN=Hgsnat PE=1 SV=2 Back     alignment and function desciption
 Score =  154 bits (388), Expect = 2e-36,   Method: Compositional matrix adjust.
 Identities = 144/490 (29%), Positives = 229/490 (46%), Gaps = 80/490 (16%)

Query: 16  LVEQEQDDGKDSENGINKEKGLERSEVQDEQKGELQLQQLLQQKSKRVATLDAFRGLTVV 75
           +  +E D   +SE G         ++ Q E +         +  + R+  +D FRGL +V
Sbjct: 223 IASRETDRLINSELGSPSRADPLSADYQPETR---------RSSANRLRCVDTFRGLALV 273

Query: 76  LMILVDDAGGAYARIDHSPWNGCTLADFVMPFFLFIVGVAIALALKKVPKINGAVK---- 131
           LM+ V+  GG Y    HS WNG T+AD V P+F+FI+G +I L++  + +  G  K    
Sbjct: 274 LMVFVNYGGGKYWYFKHSSWNGLTVADLVFPWFVFIMGTSIFLSMTSILQ-RGCSKLKLL 332

Query: 132 -KIIFRTLKLLFWG-IILQGGYSHAPDALSYGVDMKHIRWCGILQRIALVYVVVALIETL 189
            KI++R+  L+  G II+   Y   P      +    +R  G+LQR+ + Y VVA++E  
Sbjct: 333 GKIVWRSFLLICIGVIIVNPNYCLGP------LSWDKVRIPGVLQRLGVTYFVVAVLEFF 386

Query: 190 TTKRRPN--VLEPRHLSI--FTAYQWQWIGGFIAFVIYIITTYSLYVPNWSFSEHSDHGV 245
             K  P+   LE    S+   T+   QW+       I++  T+ L VP          G 
Sbjct: 387 FWKPVPDSCTLESSCFSLRDITSSWPQWLTILTLESIWLALTFFLPVP----------GC 436

Query: 246 KKYIVKCGMRGHLG--PACN--AVGYVDRELWGINHLYSDPVWSRLEACTLSSPNSGPLR 301
               +  G  G LG  P C   A GY+DR L G NHLY  P  + L    ++        
Sbjct: 437 PTGYLGPGGIGDLGKYPHCTGGAAGYIDRLLLGDNHLYQHPSSTVLYHTEVA-------- 488

Query: 302 EDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHVLIHFKGHS----ARLKHWVSMGFGL 357
                     ++PEG+L TI++I+   +G+  G +L+++K  +     R   W  +   L
Sbjct: 489 ----------YDPEGVLGTINSIVMAFLGVQAGKILVYYKDQTKAILTRFAAWCCI---L 535

Query: 358 LIIAIILHFTNA----IPINKQLYSFSYVCFTAGAAGIVFSALYVLMDVWEL--RTPFLF 411
            +I+I+L   +A    IPINK L+S SYV   +  A  +   LY ++DV  L   TPF +
Sbjct: 536 GLISIVLTKVSANEGFIPINKNLWSISYVTTLSCFAFFILLILYPVVDVKGLWTGTPFFY 595

Query: 412 LKWIGMNAMLVFVLGAQGILAGFVNGWYYKNPDNTLVNWIQNHLFIHVWNSERLGTLLYV 471
               GMN++LV+V G + +   F   W   +  +   + IQN +   +W       + YV
Sbjct: 596 ---PGMNSILVYV-GHEVLENYFPFQWKLADEQSHKEHLIQNIVATALWV-----LIAYV 646

Query: 472 IFAEITFWGV 481
           ++ +  FW +
Sbjct: 647 LYKKKLFWKI 656




Lysosomal acetyltransferase that acetylates the non-reducing terminal alpha-glucosamine residue of intralysosomal heparin or heparan sulfate, converting it into a substrate for luminal alpha-N-acetyl glucosaminidase.
Mus musculus (taxid: 10090)
EC: 2EC: .EC: 3EC: .EC: 1EC: .EC: 7EC: 8
>sp|Q68CP4|HGNAT_HUMAN Heparan-alpha-glucosaminide N-acetyltransferase OS=Homo sapiens GN=HGSNAT PE=1 SV=2 Back     alignment and function description

Close Homologs in the Non-Redundant Database Detected by BLAST ?

GI ?Alignment Graph ?Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query495
359481929489 PREDICTED: heparan-alpha-glucosaminide N 0.981 0.993 0.766 0.0
147817637511 hypothetical protein VITISV_004036 [Viti 0.981 0.951 0.733 0.0
255556868519 conserved hypothetical protein [Ricinus 0.997 0.951 0.749 0.0
224069583496 predicted protein [Populus trichocarpa] 0.981 0.979 0.733 0.0
356503734508 PREDICTED: heparan-alpha-glucosaminide N 0.995 0.970 0.722 0.0
356570776509 PREDICTED: LOW QUALITY PROTEIN: heparan- 0.987 0.960 0.721 0.0
356548323419 PREDICTED: heparan-alpha-glucosaminide N 0.846 1.0 0.806 0.0
449440411488 PREDICTED: heparan-alpha-glucosaminide N 0.961 0.975 0.709 0.0
242067981512 hypothetical protein SORBIDRAFT_05g00697 0.957 0.925 0.677 0.0
357152403498 PREDICTED: heparan-alpha-glucosaminide N 0.977 0.971 0.673 0.0
>gi|359481929|ref|XP_002268831.2| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like [Vitis vinifera] gi|297739972|emb|CBI30154.3| unnamed protein product [Vitis vinifera] Back     alignment and taxonomy information
 Score =  788 bits (2035), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 378/493 (76%), Positives = 427/493 (86%), Gaps = 7/493 (1%)

Query: 3   DLRIVEEGLGRTQLVEQEQDDGKDSENGINKEKGLERSEVQDEQKGELQLQQLLQQKSKR 62
           D + VEEGLG        ++D  +  + I K++         EQKGE Q   L++QKSKR
Sbjct: 4   DAKRVEEGLGHVH-----KEDISEKADKIEKDESSATPAQSVEQKGEEQ--PLIKQKSKR 56

Query: 63  VATLDAFRGLTVVLMILVDDAGGAYARIDHSPWNGCTLADFVMPFFLFIVGVAIALALKK 122
           VATLDAFRGLT+VLMILVDDAGG+YARIDHSPWNGCTLADFVMPFFLFIVGVA+ALALKK
Sbjct: 57  VATLDAFRGLTIVLMILVDDAGGSYARIDHSPWNGCTLADFVMPFFLFIVGVAVALALKK 116

Query: 123 VPKINGAVKKIIFRTLKLLFWGIILQGGYSHAPDALSYGVDMKHIRWCGILQRIALVYVV 182
           +P+I+ AVKKI  RTLKLLFWGI+LQGGYSHAPD LSYGVDMKHIRW GILQRIA+VY V
Sbjct: 117 IPRISLAVKKISLRTLKLLFWGILLQGGYSHAPDDLSYGVDMKHIRWFGILQRIAVVYFV 176

Query: 183 VALIETLTTKRRPNVLEPRHLSIFTAYQWQWIGGFIAFVIYIITTYSLYVPNWSFSEHSD 242
           VALIETLTTKRRP V++  H SI +AY+WQWIGGF+AF+IY+ITTY+LYVP+WSF    D
Sbjct: 177 VALIETLTTKRRPTVIDSGHFSILSAYKWQWIGGFVAFLIYMITTYALYVPDWSFVIDQD 236

Query: 243 HGVKKYIVKCGMRGHLGPACNAVGYVDRELWGINHLYSDPVWSRLEACTLSSPNSGPLRE 302
           H  K+Y VKCGMRGHLGPACNAVGYVDR++WGINHLYS PVW+RL+ACTLSSPNSGP RE
Sbjct: 237 HEAKRYTVKCGMRGHLGPACNAVGYVDRQVWGINHLYSQPVWTRLKACTLSSPNSGPFRE 296

Query: 303 DAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHVLIHFKGHSARLKHWVSMGFGLLIIAI 362
           DAPSWC APFEPEGLLSTISAILSGTIGIHYGHVLIHFKGH+ RLK WVSMG  LLI+AI
Sbjct: 297 DAPSWCYAPFEPEGLLSTISAILSGTIGIHYGHVLIHFKGHAERLKQWVSMGIVLLIVAI 356

Query: 363 ILHFTNAIPINKQLYSFSYVCFTAGAAGIVFSALYVLMDVWELRTPFLFLKWIGMNAMLV 422
           ILHFT+AIPINKQLYSFSYVCFTAGAAGIV SA Y+++DVW  RTPFLFL+WIGMNAMLV
Sbjct: 357 ILHFTDAIPINKQLYSFSYVCFTAGAAGIVLSAFYLVIDVWGFRTPFLFLEWIGMNAMLV 416

Query: 423 FVLGAQGILAGFVNGWYYKNPDNTLVNWIQNHLFIHVWNSERLGTLLYVIFAEITFWGVV 482
           FV+ AQGI A F+NGWY+++ DN+LV+WIQ H+FI VW+SERLGTLLYVIFAEITFW VV
Sbjct: 417 FVMAAQGIFAAFINGWYFESSDNSLVHWIQRHVFIDVWHSERLGTLLYVIFAEITFWAVV 476

Query: 483 AGILHRLGIYWKL 495
           +GILH+L IYWKL
Sbjct: 477 SGILHKLHIYWKL 489




Source: Vitis vinifera

Species: Vitis vinifera

Genus: Vitis

Family: Vitaceae

Order: Vitales

Class:

Phylum: Streptophyta

Superkingdom: Eukaryota

>gi|147817637|emb|CAN64496.1| hypothetical protein VITISV_004036 [Vitis vinifera] Back     alignment and taxonomy information
>gi|255556868|ref|XP_002519467.1| conserved hypothetical protein [Ricinus communis] gi|223541330|gb|EEF42881.1| conserved hypothetical protein [Ricinus communis] Back     alignment and taxonomy information
>gi|224069583|ref|XP_002326379.1| predicted protein [Populus trichocarpa] gi|222833572|gb|EEE72049.1| predicted protein [Populus trichocarpa] Back     alignment and taxonomy information
>gi|356503734|ref|XP_003520659.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like [Glycine max] Back     alignment and taxonomy information
>gi|356570776|ref|XP_003553560.1| PREDICTED: LOW QUALITY PROTEIN: heparan-alpha-glucosaminide N-acetyltransferase-like [Glycine max] Back     alignment and taxonomy information
>gi|356548323|ref|XP_003542552.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like [Glycine max] Back     alignment and taxonomy information
>gi|449440411|ref|XP_004137978.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like [Cucumis sativus] gi|449517341|ref|XP_004165704.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like [Cucumis sativus] Back     alignment and taxonomy information
>gi|242067981|ref|XP_002449267.1| hypothetical protein SORBIDRAFT_05g006970 [Sorghum bicolor] gi|241935110|gb|EES08255.1| hypothetical protein SORBIDRAFT_05g006970 [Sorghum bicolor] Back     alignment and taxonomy information
>gi|357152403|ref|XP_003576108.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like [Brachypodium distachyon] Back     alignment and taxonomy information

Prediction of Gene Ontology (GO) Terms


Prediction of Enzyme Commission (EC) Number

EC Number Prediction by Annotation Transfer from SWISS-PROT Entries ?

No confident hit for EC number transfering in SWISSPROT detected by BLAST

EC Number Prediction by Ezypred Server ?

Fail to connect to Ezypred Server

EC Number Prediction by EFICAz Software ?

No EC number assignment, probably not an enzyme!


Prediction of Functionally Associated Proteins

Functionally Associated Proteins Detected by STRING ?

Your Input:
GSVIVG00000614001
SubName- Full=Chromosome chr8 scaffold_106, whole genome shotgun sequence; (486 aa)
(Vitis vinifera)
Predicted Functional Partners:
GSVIVG00028736001
SubName- Full=Chromosome chr13 scaffold_45, whole genome shotgun sequence; (799 aa)
      0.916
GSVIVG00016165001
SubName- Full=Chromosome chr17 scaffold_12, whole genome shotgun sequence; (555 aa)
       0.899

Conserved Domains and Related Protein Families

Conserved Domains Detected by RPS-BLAST ?

ID ?Alignment Graph ?Length ? Definition ? E-value ?
Query495
COG4299371 COG4299, COG4299, Uncharacterized protein conserve 1e-34
>gnl|CDD|226749 COG4299, COG4299, Uncharacterized protein conserved in bacteria [Function unknown] Back     alignment and domain information
 Score =  133 bits (335), Expect = 1e-34
 Identities = 110/445 (24%), Positives = 173/445 (38%), Gaps = 83/445 (18%)

Query: 58  QKSKRVATLDAFRGLTVVLMILVDDAG---GAYARIDHSPWNGCTLADFVMPFFLFIVGV 114
           Q + R+ +LD FRGLTV+LMILV++AG     Y ++ H+ W G TL D V P+FLF VG 
Sbjct: 3   QPAFRLTSLDVFRGLTVLLMILVNNAGLGDSTYRQLSHAHWGGLTLTDLVFPWFLFCVGA 62

Query: 115 AIALALKKVPKINGAVKKIIFRTLKLLFWGIILQGGYSHAPDALSYGVDMKHIRWCGILQ 174
           A+  +  K+ K N     +  R  +    G ++    +    ++         R  G+LQ
Sbjct: 63  AMPFSASKMNKANVTTWPLYRRAAERFALGYLMGAFVTVRDWSV---TSHSLTRGMGVLQ 119

Query: 175 RIALVYVVVALIETLTTKRRPNVLEPRHLSIFTAYQWQWIGGFIAFVIYIITTYSLYVPN 234
           RIAL Y+  AL+      R                 WQ +   +    Y +       P 
Sbjct: 120 RIALAYLFAALLVRQLRGR-----------------WQALLAAVLLAGYWLFLMFTPHPA 162

Query: 235 WSFSEHSDHGVKKYIVKCGMRGHLGPACNAVGYVDRELWGINHLYSDPVWSRLEACTLSS 294
                                  LG   N     D      +HLYS              
Sbjct: 163 AP---------------------LGGIGNVGESADPLQILNDHLYS-------------- 187

Query: 295 PNSGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHVLIHFKGHS-ARLKHWVSM 353
                      +     F+PEGLLST+   +    G      L    G+  A L      
Sbjct: 188 -----------AD--GGFDPEGLLSTVPTTVLVLAGYLAARPLQQKPGNPRAPLL---LA 231

Query: 354 GFGLLIIAIILHFTNAIPINKQLYSFSYVCFTAGAAGIVFSALYVLMDVWELRTPFLFLK 413
           G G+++ A+   +    PI+K+L++ SYV +TAG   ++ +A +VL +    +       
Sbjct: 232 GLGVVLTALGYGWAGRFPISKKLWTSSYVLYTAGLGLLLLAACWVLAESPGGKRLLAPFT 291

Query: 414 WIGMNAMLVFVLG---AQGILAGFVNGWYYKNPDNTLVNWIQNHLFIHVWNSERLGTLLY 470
             G+NA+ ++VL       +L  +  G       +  + W   ++F         G+LLY
Sbjct: 292 IPGLNALALYVLSILIKVWLLLDWGVGETAP---SQSIAWSLLNMF-RSSFGPVGGSLLY 347

Query: 471 VIFAEITFWGVVAGILHRLGIYWKL 495
            +   +  W +    + R GI WKL
Sbjct: 348 ALGYVLAVW-LGLAWMARRGIIWKL 371


Length = 371

Conserved Domains Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query 495
KOG4683549 consensus Uncharacterized conserved protein [Funct 100.0
COG4299371 Uncharacterized protein conserved in bacteria [Fun 100.0
PF07786223 DUF1624: Protein of unknown function (DUF1624); In 99.85
COG3503323 Predicted membrane protein [Function unknown] 99.58
PF10129358 OpgC_C: OpgC protein; InterPro: IPR014550 There is 98.89
COG2311394 Predicted membrane protein [Function unknown] 98.72
PF01757340 Acyl_transf_3: Acyltransferase family; InterPro: I 98.46
PRK10835373 hypothetical protein; Provisional 98.43
PF06423136 GWT1: GWT1; InterPro: IPR009447 Glycosylphosphatid 98.34
PRK03854375 opgC glucans biosynthesis protein; Provisional 97.53
PF04235163 DUF418: Protein of unknown function (DUF418); Inte 96.79
COG3594343 NolL Fucose 4-O-acetylase and related acetyltransf 95.96
COG4645410 Uncharacterized protein conserved in bacteria [Fun 95.9
COG3274332 Predicted O-acyltransferase [General function pred 93.78
COG5062429 Uncharacterized membrane protein [Function unknown 91.38
PF15345233 TMEM51: Transmembrane protein 51 89.46
COG1835386 Predicted acyltransferases [Lipid metabolism] 85.79
>KOG4683 consensus Uncharacterized conserved protein [Function unknown] Back     alignment and domain information
Probab=100.00  E-value=6e-98  Score=751.82  Aligned_cols=414  Identities=38%  Similarity=0.677  Sum_probs=370.9

Q ss_pred             hHhhhhhccccchhhhhhhhhcchheeeeEeccCCcccccccCCCCCCchhhHHHHHHHHHHHHHHHhhccccCChhHHH
Q 011031           51 QLQQLLQQKSKRVATLDAFRGLTVVLMILVDDAGGAYARIDHSPWNGCTLADFVMPFFLFIVGVAIALALKKVPKINGAV  130 (495)
Q Consensus        51 ~~~~~~~~~~~Rl~SLD~~RGlti~lMIlVn~~g~~~~~l~Ha~W~G~t~aDlVfP~FlFivGvSi~ls~~~~~~~~~~~  130 (495)
                      .+..+.+..++|+.|||+|||+|+++||+||+.|+.||.++||+|||+++||+|||+|+||||+|+++|+|+..+|.+..
T Consensus       132 ~~~rsla~~r~RL~SLD~FRGltValMIlVdd~GG~~p~I~HapWnG~~LADfVmPfFLfIvGVsials~K~~s~rf~a~  211 (549)
T KOG4683|consen  132 GEARSLATQRKRLRSLDTFRGLTVALMILVDDGGGGYPWIEHAPWNGLHLADFVMPFFLFIVGVSIALSVKSQSSRFSAT  211 (549)
T ss_pred             hhhhhcCCCchhhhhhhhhcCceEEEEEEEecCCCCchhhhcCCcCCccHHHHHHHHHHHHHHhhhhhhhhhhhhhhhHh
Confidence            35555666678999999999999999999999999999999999999999999999999999999999999999999999


Q ss_pred             HHHHHHHHHHHHHHHHHhccCCCCCCccccCccCccceechhHHHHHHHHHHHHHHHHHhcCCCCCCCCccchhhhhhhH
Q 011031          131 KKIIFRTLKLLFWGIILQGGYSHAPDALSYGVDMKHIRWCGILQRIALVYVVVALIETLTTKRRPNVLEPRHLSIFTAYQ  210 (495)
Q Consensus       131 ~kil~Rsl~Lf~iGl~l~~~~~~~~~~~~~~~~~~~~R~~GVLqrIa~~Y~i~all~l~~~~~~~~~~~~~~~~i~~~~~  210 (495)
                      ||...|+.+|+++|++++++|.|.++++|++.|.+.+|++|||||+|++|+++|++.+++.+..  +++         ..
T Consensus       212 rKa~~R~cklllwgLflqGgf~h~~~nLTygidve~lR~mGILQr~~~ayLVvAi~~~~~~~~~--~~~---------~S  280 (549)
T KOG4683|consen  212 RKAKARICKLLLWGLFLQGGFLHSMSNLTYGIDVEQLRIMGILQRFGVAYLVVAILHTLCCRPI--SPQ---------RS  280 (549)
T ss_pred             HHHHHHHHHHHHHHHHHhhhcccCcccccCCccHHHHHHHHHHHHhhHHHHHHHHHhhhccCCC--ccc---------cc
Confidence            9999999999999999999999999999999999999999999999999999999998776522  111         12


Q ss_pred             HHHHHHHHHHHHHHHHHHhcccCCCCcccc---CCCCcceeeeccCCCCCCCC-CCCHHHHHHHHhcCCCCccCCccccc
Q 011031          211 WQWIGGFIAFVIYIITTYSLYVPNWSFSEH---SDHGVKKYIVKCGMRGHLGP-ACNAVGYVDRELWGINHLYSDPVWSR  286 (495)
Q Consensus       211 ~~~~~~~~ll~~y~~l~~~~~vP~~~~~~~---~~~~~~~~~~~~g~~g~l~~-~~N~a~~iDr~vlG~~HlY~~p~~~~  286 (495)
                      ||-++....+..|-+..++.+||+|.+...   .|+.|+.|.++||.++...+ .||+++|.||+++|++|||++|++||
T Consensus       281 ~~R~V~~~~L~~~~~~~~~~~V~~~~~~~~~~~~~~~~r~~~~~~G~~~~~~~P~CnAvGy~DrqvLGi~HiY~hP~~~r  360 (549)
T KOG4683|consen  281 WQRAVHDVCLFSGELAVLLALVATYLGLTFGLRVPGCPRGYLGPGGKHDYNAHPKCNAVGYADRQVLGIAHIYQHPTAKR  360 (549)
T ss_pred             hhhhhhHHHHHHHHHHHHHHhhhhhhceecccccCCCCcccccCCcccccCCCCCccchhhhHHhhhhhHHHhcCchHHH
Confidence            455666666777777777777777765433   34457777788888888865 59999999999999999999999999


Q ss_pred             cccccCCCCCCCCCCCCCCCcccCCCCCcchhhhhHHHHHHHHHHHHHHHHhcccchhHHHHHHHHHHHHHHHHHHHHHh
Q 011031          287 LEACTLSSPNSGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHVLIHFKGHSARLKHWVSMGFGLLIIAIILHF  366 (495)
Q Consensus       287 ~~~c~~~~~~~g~~~~~~~~~~~~~fDPEGlLStipai~~~llG~~aG~~L~~~~~~~~r~~~ll~~G~~llilG~ll~~  366 (495)
                      +|+||.|+|++||++.|+|+||.+|||||||||+|.|++++++|+++|+++.+.+++..|+++|...+.++.++|..+++
T Consensus       361 ~k~cs~n~P~nG~l~~DAPSWCqapFdPEGilssi~avv~~llG~h~Ghiilh~k~~~sRir~wis~~~~l~llg~tL~~  440 (549)
T KOG4683|consen  361 VKDCSINYPNNGPLPPDAPSWCQAPFDPEGILSSILAVVQVLLGAHAGHIILHHKNFQSRIRRWISLAILLGLLGGTLCG  440 (549)
T ss_pred             hhhcccCCCCCCCCCCCCchhhcCCCChHHHHHHHHHHHHHHHHhhcCeEEEEccchHHHHHHHHHHHHHHHHHhhhhhc
Confidence            99999999999999999999999999999999999999999999999999999999999999999999999999999998


Q ss_pred             CCCccccccCCchhHHHHHHHHHHHHHHHHHHHHHhcCccchhhHHHHhchhHHHHHHHhhhhhHhhhhcceeecCCCcC
Q 011031          367 TNAIPINKQLYSFSYVCFTAGAAGIVFSALYVLMDVWELRTPFLFLKWIGMNAMLVFVLGAQGILAGFVNGWYYKNPDNT  446 (495)
Q Consensus       367 ~~~~PiNK~LWT~SfVL~TaG~a~llLa~~y~liDv~~~~~~~~pf~~~GmNai~iYv~s~~~i~~~~l~~~~~~~~~~~  446 (495)
                      .+.+|+||+|||.||+|+|+|+|+++++.+|++||++.|++.+.||++.|||||++||++  +++..+++ |+|+.++++
T Consensus       441 ~s~~Plnk~L~slsfvCVT~~~A~Li~S~mY~~iDv~EW~~~~~P~~~~GMNAi~~YV~~--~vL~~~~~-W~~R~~~~~  517 (549)
T KOG4683|consen  441 FSAIPLNKNLWSLSFVCVTVSLALLILSLMYYFIDVREWSWSGYPFTECGMNAIVMYVGH--SVLHKMLP-WHWRIGEMN  517 (549)
T ss_pred             ccccchhHhHHHhhhhHHHHHHHHHHHHHHHHHhhHHHhhhccCChhhhccchhHHHHhH--HHHHHhcc-hhhccCCCc
Confidence            778999999999999999999999999999999999999888999999999999999999  89999886 999998887


Q ss_pred             hHHHHHHHHHHhhhcccchhHHHHHHHHHHHHHHHHHHHHHhcCceEeC
Q 011031          447 LVNWIQNHLFIHVWNSERLGTLLYVIFAEITFWGVVAGILHRLGIYWKL  495 (495)
Q Consensus       447 l~~wl~~~~~~~~~~~~~~gsLlyal~~~~~~w~lia~~L~rkkIfiKL  495 (495)
                      +      |++..+|++-           -+++|.+|+.++|+.|||||+
T Consensus       518 ~------H~~l~~~~t~-----------~~L~W~~i~~~~~~~~~Y~~~  549 (549)
T KOG4683|consen  518 T------HFMLLLEATW-----------NTLVWVGIALYLDAQEFYYSV  549 (549)
T ss_pred             e------eEEEeeehhh-----------hhhhhhhhheeeeheeeEecC
Confidence            6      3333444322           246799999999999999986



>COG4299 Uncharacterized protein conserved in bacteria [Function unknown] Back     alignment and domain information
>PF07786 DUF1624: Protein of unknown function (DUF1624); InterPro: IPR012429 These sequences are found in hypothetical proteins of unknown function expressed by bacterial and archaeal species Back     alignment and domain information
>COG3503 Predicted membrane protein [Function unknown] Back     alignment and domain information
>PF10129 OpgC_C: OpgC protein; InterPro: IPR014550 There is currently no experimental data for members of this group or their homologues, nor do they exhibit features indicative of any function Back     alignment and domain information
>COG2311 Predicted membrane protein [Function unknown] Back     alignment and domain information
>PF01757 Acyl_transf_3: Acyltransferase family; InterPro: IPR002656 This entry contains a range of acyltransferase enzymes as well as yet uncharacterised proteins from Caenorhabditis elegans Back     alignment and domain information
>PRK10835 hypothetical protein; Provisional Back     alignment and domain information
>PF06423 GWT1: GWT1; InterPro: IPR009447 Glycosylphosphatidylinositol (GPI) is a conserved post-translational modification to anchor cell surface proteins to plasma membrane in eukaryotes Back     alignment and domain information
>PRK03854 opgC glucans biosynthesis protein; Provisional Back     alignment and domain information
>PF04235 DUF418: Protein of unknown function (DUF418); InterPro: IPR007349 Tihs is a probable integral membrane protein Back     alignment and domain information
>COG3594 NolL Fucose 4-O-acetylase and related acetyltransferases [Carbohydrate transport and metabolism] Back     alignment and domain information
>COG4645 Uncharacterized protein conserved in bacteria [Function unknown] Back     alignment and domain information
>COG3274 Predicted O-acyltransferase [General function prediction only] Back     alignment and domain information
>COG5062 Uncharacterized membrane protein [Function unknown] Back     alignment and domain information
>PF15345 TMEM51: Transmembrane protein 51 Back     alignment and domain information
>COG1835 Predicted acyltransferases [Lipid metabolism] Back     alignment and domain information

Homologous Structure Templates

Structure Templates Detected by BLAST ?

No homologous structure with e-value below 0.005

Structure Templates Detected by RPS-BLAST ?

No hit with e-value below 0.005

Structure Templates Detected by HHsearch ?

No hit with probability above 80.00


Homologous Structure Domains

Structure Domains Detected by RPS-BLAST ?

No hit with e-value below 0.005

Homologous Domains Detected by HHsearch ?

No hit with probability above 80.00