Citrus Sinensis ID: 014321


Local Sequence Feature Prediction

Prediction and (Method)Result
Residue Number Marker
Protein Sequence ?
Secondary Structure (PSIPRED) ?
Secondary Structure Prediction (SSPRO) ?
Coil and Loop (DISEMBL) ?
Flexible Loop (DISEMBL) ?
Low Complexity Region (SEG) ?
Disordered region (IsUnstruct) ?
Disordered Region (DISOPRED) ?
Disordered Region (DISEMBL) ?
Disordered Region (DISPRO) ?
Transmembrane Helix (TMHMM) ?
Transmembrane Helix (HMMTOP) ?
Transmembrane Helix (MEMSAT) ?
TM Helix, Signal Peptide (MEMSAT_SVM) ?
TM Helix, Signal Peptide (Phobius) ?
Signal Peptide (SignalP HMM Mode) ?
Signal Peptide (SignalP NN Mode) ?
Coiled Coils (COILS) ?
Positional Conservation ?
 
--------10--------20--------30--------40--------50--------60--------70--------80--------90-------100-------110-------120-------130-------140-------150-------160-------170-------180-------190-------200-------210-------220-------230-------240-------250-------260-------270-------280-------290-------300-------310-------320-------330-------340-------350-------360-------370-------380-------390-------400-------410-------420-------
MADLRIVEEGLGRTQLVEQEQDDGKDSENGINKEKGLERSEVQDEQKGELQLQQLLQQKSKRVATLDAFRGLTVVLMILVDDAGGAYARIDHSPWNGCTLADFVMPFFLFIVGVAIALALKKVPKINGAVKKIIFRTLKLLFWGIILQGGYSHAPDALSYGVDMKHIRWCGILQRIALVYVVVALIETLTTKRRPNVLEPRHLSIFTAYQWQWIGGFIAFVIYIITTYSLYVPNWSFSEHSDHGVKKYIVKCGMRGHLGPACNAVGYVDRELWGINHLYSDPVWSRLEACTLSSPNSGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHVLIHFKGHSARLKHWVSMGFGLLIIAIILHFTNAIPINKQLYSFSYVCFTADGCLGITNSIFIPEMDWDECDACFRAWSTRYPSRICKWMVL
cHHHHHHHHcccccccHHcccccccccccccccccccccccccccccHHHHHHHHHHcccccHHHHHHHHHHHHHHHHHHcccccccccccccccccccHHHHHHHHHHHHHHHHHHHHHccccccccHHHHHHHHHHHHHHHHHHHHHcccccccccccccccccHHHHHHHHHHHHHHHHHHHHHHHHHcccccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHcccccccccccccccccEEEEEccccccccccccHHHHHHHHHccccccccccccccccccccccccccccccccccccccccccccHHHHHHHHHHHHHHHHHHHHHHcccccHHHHHHHHHHHHHHHHHHHHHHHccccccccccccHHHHHHHHHHHHHHHHHHHHHHHHccccccccccccccccccEEEEccc
cccHHHHHcccccEEEcccccHcccccccccccccccccccccccccccccccccccccccHEHHHHHHHHHHHHHHHEEcccccccccccccccccccHHHHHHHHHHHHHHHHHHHHHHccccHHHHHHHHHHHHHHHHHHHHHHccccccccccccccccHHHccHHHHHHHHHHHHHHHHHHHHHcccccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHccccccccccccccccccccEEcccccccccccccEEEEEHHHHccHHcccccccEEEEcccccccccccccccccccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHccccHHHHHHHHHHHHHHHHHHHHHHHcccccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcccccccHHHHHHHHHcc
MADLRIVEEglgrtqlveqeqddgkdsenginkekglersevqDEQKGELQLQQLLQQKSKRVATLDAFRGLTVVLMILVDdaggayaridhspwngctladfvMPFFLFIVGVAIALALKKVPKINGAVKKIIFRTLKLLFWGIIlqggyshapdalsygvdmkhirWCGILQRIALVYVVVALIETlttkrrpnvleprhlSIFTAYQWQWIGGFIAFVIYIITTYSlyvpnwsfsehsdhgVKKYIVkcgmrghlgpacnavgyvdrelwginhlysdpvwsrleactlsspnsgplredapswcrapfepeglLSTISAILSGTIGIHYGHVLIHFKGHSARLKHWVSMGFGLLIIAIILHFTnaipinkqlYSFSYVCFtadgclgitnsifipemdwdeCDACFRawstrypsrickwmvl
madlriveeglgrtqlveqeqddgkdsenginkekglersevQDEQKGELQLQQLLQQKSKRVATLDAFRGLTVVLMILVDDAGGAYARIDHSPWNGCTLADFVMPFFLFIVGVAIALALKKVPKINGAVKKIIFRTLKLLFWGIILQGGYSHAPDALSYGVDMKHIRWCGILQRIALVYVVVALIEtlttkrrpnvlepRHLSIFTAYQWQWIGGFIAFVIYIITTYSLYVPNWSFSEHSDHGVKKYIVKCGMRGHLGPACNAVGYVDRELWGINHLYSDPVWSRLEACTLSSPNSGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHVLIHFKGHSARLKHWVSMGFGLLIIAIILHFTNAIPINKQLYSFSYVCFTADGCLGITNSIFIPEMDWDECDACFRAWStrypsrickwmvl
MADLRIVEEGLGRTQLVEQEQDDGKDSENGINKEKGLERSEVQDEQKGElqlqqllqqKSKRVATLDAFRGLTVVLMILVDDAGGAYARIDHSPWNGCTLADFVMPFFLFIVGVAIALALKKVPKINGAVKKIIFRTLKLLFWGIILQGGYSHAPDALSYGVDMKHIRWCGILQRIALVYVVVALIETLTTKRRPNVLEPRHLSIFTAYQWQWIGGFIAFVIYIITTYSLYVPNWSFSEHSDHGVKKYIVKCGMRGHLGPACNAVGYVDRELWGINHLYSDPVWSRLEACTLSSPNSGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHVLIHFKGHSARLKHWVSMGFGLLIIAIILHFTNAIPINKQLYSFSYVCFTADGCLGITNSIFIPEMDWDECDACFRAWSTRYPSRICKWMVL
*************************************************************RVATLDAFRGLTVVLMILVDDAGGAYARIDHSPWNGCTLADFVMPFFLFIVGVAIALALKKVPKINGAVKKIIFRTLKLLFWGIILQGGYSHAPDALSYGVDMKHIRWCGILQRIALVYVVVALIETLTTKRRPNVLEPRHLSIFTAYQWQWIGGFIAFVIYIITTYSLYVPNWSFSEHSDHGVKKYIVKCGMRGHLGPACNAVGYVDRELWGINHLYSDPVWSRLEACTL*************SWCRAPFEPEGLLSTISAILSGTIGIHYGHVLIHFKGHSARLKHWVSMGFGLLIIAIILHFTNAIPINKQLYSFSYVCFTADGCLGITNSIFIPEMDWDECDACFRAWSTRYPSRICKWMV*
****R***EGLGRT***************************************************LDAFRGLTVVLMILVDDAGGAYARIDHSPWNGCTLADFVMPFFLFIVGVAIALALKKVPKINGAVKKIIFRTLKLLFWGIILQGGYSHAPDALSYGVDMKHIRWCGILQRIALVYVVVALIETLTTKRRPNVLEPRHLSIFTAYQWQWIGGFIAFVIYIITTYSLYVPNWSFSEHSDHGVKKYIVKCGMRGHLGPACNAVGYVDRELWGINHLYSDPVWSRLEACTLSSPNSGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHVLIHFKGHSARLKHWVSMGFGLLIIAIILHFTNAIPINKQLYSFSYVCFTADGCLGITNSIFIPEMDWDECDACFRAWSTRYPSRICKWMVL
MADLRIVEEGLGRTQLV**********ENGINKEKGL***********ELQLQQLLQQKSKRVATLDAFRGLTVVLMILVDDAGGAYARIDHSPWNGCTLADFVMPFFLFIVGVAIALALKKVPKINGAVKKIIFRTLKLLFWGIILQGGYSHAPDALSYGVDMKHIRWCGILQRIALVYVVVALIETLTTKRRPNVLEPRHLSIFTAYQWQWIGGFIAFVIYIITTYSLYVPNWSFSEHSDHGVKKYIVKCGMRGHLGPACNAVGYVDRELWGINHLYSDPVWSRLEACTLSS**********PSWCRAPFEPEGLLSTISAILSGTIGIHYGHVLIHFKGHSARLKHWVSMGFGLLIIAIILHFTNAIPINKQLYSFSYVCFTADGCLGITNSIFIPEMDWDECDACFRAWSTRYPSRICKWMVL
**DLRIVEEGLGRTQLV******************************************SKRVATLDAFRGLTVVLMILVDDAGGAYARIDHSPWNGCTLADFVMPFFLFIVGVAIALALKKVPKINGAVKKIIFRTLKLLFWGIILQGGYSHAPDALSYGVDMKHIRWCGILQRIALVYVVVALIETLTTKRRPNVLEPRHLSIFTAYQWQWIGGFIAFVIYIITTYSLYVPNWSFSEHSDHGVKKYIVKCGMRGHLGPACNAVGYVDRELWGINHLYSDPVWSRLEACTLSSPNSGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHVLIHFKGHSARLKHWVSMGFGLLIIAIILHFTNAIPINKQLYSFSYVCFTADGCLGITNSIFIPEMDWDECDACFRAWSTRYPSRICKWMVL
iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiHHHHHHHHHHHHHHHHHHHHHHHooooooooooooooHHHHHHHHHHHHHHHHHHiiiiiiiiiiiiHHHHHHHHHHHHHHHHHHHHooooooooooooooHHHHHHHHHHHHHHHHHHHHHHHiiiiiiiiiiiiiiiiiiiiHHHHHHHHHHHHHHHHHHHHHHHoooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooHHHHHHHHHHHHHHHHHHHHHHHiiiiiiiiiiiHHHHHHHHHHHHHHHHHHHHHHHoooHHHHHHHHHHHHHHHHHHHHHHHiiiiiiiiiiiiiiiiiiiiiiiiiiiii
iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiHHHHHHHHHHHHHHHHHHooooooooooooooHHHHHHHHHHHHHHHHHHiiiiiiiiiiiiHHHHHHHHHHHHHHHHHHooooooooooooooooooHHHHHHHHHHHHHHHHHHiiiiiiiiiiiiiiiiiiiiiiiiiiHHHHHHHHHHHHHHHHHHooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooHHHHHHHHHHHHHHHHHHiiiiiiiiiiiiHHHHHHHHHHHHHHHHHHHHooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiHHHHHHHHHHHHHHHHHHHHHHHHoooooooooooooooHHHHHHHHHHHHHHHHHHHHiiiiiiiiiiiiiiiHHHHHHHHHHHHHHHHHHHHooooooooooooooHHHHHHHHHHHHHHHHHHHHHHiiiiiiiiiiiiiiiiiiiiiHHHHHHHHHHHHHHHHHHHHHHHHooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooHHHHHHHHHHHHHHHHHHHHHHHHiiiiiiiiiHHHHHHHHHHHHHHHHHHHHoooooooooooooooHHHHHHHHHHHHHHHHHHHHHHHiiiiiiiiiiiiiiiiiiiiiii
iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiHHHHHHHHHHHHHHHHooooooooooooooooHHHHHHHHHHHHHHHHHHHiiiiiiiiiiiiiiiiiHHHHHHHHHHHHHHHHHHooooooooooooooooooHHHHHHHHHHHHHHHHHHHHHiiiiiiiiiiiiiiiHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHoooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooHHHHHHHHHHHHHHHHHHHHHiiiiiiiiiiiiHHHHHHHHHHHHHHHHHHooooooooooooHHHHHHHHHHHHHHHHHHHHHHHHHiiiiiiiiiiiiiiiiiiiiiiii
iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiHHHHHHHHHHHHHHHHHHoooooooooooooooooooHHHHHHHHHHHHHHHHHHHHHHiiiiiiiiiiiHHHHHHHHHHHHHHHHHHHHoooooooooooooooooooHHHHHHHHHHHHHHHHHHHiiiiiiiiiiiiiiiiiiiiHHHHHHHHHHHHHHHHHHHHHoooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooHHHHHHHHHHHHHHHHHHHHHHiiiiiiiiiiiHHHHHHHHHHHHHHHHHHHHHooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
MADLRIVEEGLGRTQLVEQEQDDGKDSENGINKEKGLERSEVQDEQKGELQLQQLLQQKSKRVATLDAFRGLTVVLMILVDDAGGAYARIDHSPWNGCTLADFVMPFFLFIVGVAIALALKKVPKINGAVKKIIFRTLKLLFWGIILQGGYSHAPDALSYGVDMKHIRWCGILQRIALVYVVVALIETLTTKRRPNVLEPRHLSIFTAYQWQWIGGFIAFVIYIITTYSLYVPNWSFSEHSDHGVKKYIVKCGMRGHLGPACNAVGYVDRELWGINHLYSDPVWSRLEACTLSSPNSGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHVLIHFKGHSARLKHWVSMGFGLLIIAIILHFTNAIPINKQLYSFSYVCFTADGCLGITNSIFIPEMDWDECDACFRAWSTRYPSRICKWMVL
no confident homologs detected

Close Homologs for Annotation Transfer

Close Homologs in SWISS-PROT Database Detected by BLAST ?

ID ?Alignment graph ?Length ? Definition ? RBH(Q2H) ? RBH(H2Q) ? Q cover ? H cover ? Identity ? E-value ?
Query427 2.2.26 [Sep-21-2011]
Q3UDW8656 Heparan-alpha-glucosamini yes no 0.754 0.490 0.303 2e-31
Q68CP4663 Heparan-alpha-glucosamini yes no 0.749 0.482 0.295 9e-29
>sp|Q3UDW8|HGNAT_MOUSE Heparan-alpha-glucosaminide N-acetyltransferase OS=Mus musculus GN=Hgsnat PE=1 SV=2 Back     alignment and function desciption
 Score =  137 bits (345), Expect = 2e-31,   Method: Compositional matrix adjust.
 Identities = 120/395 (30%), Positives = 185/395 (46%), Gaps = 73/395 (18%)

Query: 16  LVEQEQDDGKDSENGINKEKGLERSEVQDEQKGELQLQQLLQQKSKRVATLDAFRGLTVV 75
           +  +E D   +SE G         ++ Q E +         +  + R+  +D FRGL +V
Sbjct: 223 IASRETDRLINSELGSPSRADPLSADYQPETR---------RSSANRLRCVDTFRGLALV 273

Query: 76  LMILVDDAGGAYARIDHSPWNGCTLADFVMPFFLFIVGVAIALALKKVPKINGAVK---- 131
           LM+ V+  GG Y    HS WNG T+AD V P+F+FI+G +I L++  + +  G  K    
Sbjct: 274 LMVFVNYGGGKYWYFKHSSWNGLTVADLVFPWFVFIMGTSIFLSMTSILQ-RGCSKLKLL 332

Query: 132 -KIIFRTLKLLFWG-IILQGGYSHAPDALSYGVDMKHIRWCGILQRIALVYVVVALIETL 189
            KI++R+  L+  G II+   Y   P      +    +R  G+LQR+ + Y VVA++E  
Sbjct: 333 GKIVWRSFLLICIGVIIVNPNYCLGP------LSWDKVRIPGVLQRLGVTYFVVAVLEFF 386

Query: 190 TTKRRPN--VLEPRHLSI--FTAYQWQWIGGFIAFVIYIITTYSLYVPNWSFSEHSDHGV 245
             K  P+   LE    S+   T+   QW+       I++  T+ L VP          G 
Sbjct: 387 FWKPVPDSCTLESSCFSLRDITSSWPQWLTILTLESIWLALTFFLPVP----------GC 436

Query: 246 KKYIVKCGMRGHLG--PACN--AVGYVDRELWGINHLYSDPVWSRLEACTLSSPNSGPLR 301
               +  G  G LG  P C   A GY+DR L G NHLY  P  + L    ++        
Sbjct: 437 PTGYLGPGGIGDLGKYPHCTGGAAGYIDRLLLGDNHLYQHPSSTVLYHTEVA-------- 488

Query: 302 EDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHVLIHFKGHS----ARLKHWVSMGFGL 357
                     ++PEG+L TI++I+   +G+  G +L+++K  +     R   W  +   L
Sbjct: 489 ----------YDPEGVLGTINSIVMAFLGVQAGKILVYYKDQTKAILTRFAAWCCI---L 535

Query: 358 LIIAIILHFTNA----IPINKQLYSFSYV----CF 384
            +I+I+L   +A    IPINK L+S SYV    CF
Sbjct: 536 GLISIVLTKVSANEGFIPINKNLWSISYVTTLSCF 570




Lysosomal acetyltransferase that acetylates the non-reducing terminal alpha-glucosamine residue of intralysosomal heparin or heparan sulfate, converting it into a substrate for luminal alpha-N-acetyl glucosaminidase.
Mus musculus (taxid: 10090)
EC: 2EC: .EC: 3EC: .EC: 1EC: .EC: 7EC: 8
>sp|Q68CP4|HGNAT_HUMAN Heparan-alpha-glucosaminide N-acetyltransferase OS=Homo sapiens GN=HGSNAT PE=1 SV=2 Back     alignment and function description

Close Homologs in the Non-Redundant Database Detected by BLAST ?

GI ?Alignment Graph ?Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query427
359481929489 PREDICTED: heparan-alpha-glucosaminide N 0.950 0.830 0.723 1e-174
147817637511 hypothetical protein VITISV_004036 [Viti 0.950 0.794 0.687 1e-171
255556868519 conserved hypothetical protein [Ricinus 0.936 0.770 0.736 1e-170
224069583496 predicted protein [Populus trichocarpa] 0.948 0.816 0.686 1e-162
356503734508 PREDICTED: heparan-alpha-glucosaminide N 0.967 0.812 0.675 1e-161
356570776509 PREDICTED: LOW QUALITY PROTEIN: heparan- 0.957 0.803 0.666 1e-157
356548323419 PREDICTED: heparan-alpha-glucosaminide N 0.793 0.809 0.736 1e-151
449440411488 PREDICTED: heparan-alpha-glucosaminide N 0.927 0.811 0.660 1e-144
357152403498 PREDICTED: heparan-alpha-glucosaminide N 0.946 0.811 0.611 1e-139
242067981512 hypothetical protein SORBIDRAFT_05g00697 0.922 0.769 0.610 1e-139
>gi|359481929|ref|XP_002268831.2| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like [Vitis vinifera] gi|297739972|emb|CBI30154.3| unnamed protein product [Vitis vinifera] Back     alignment and taxonomy information
 Score =  617 bits (1592), Expect = e-174,   Method: Compositional matrix adjust.
 Identities = 306/423 (72%), Positives = 343/423 (81%), Gaps = 17/423 (4%)

Query: 3   DLRIVEEGLGRTQLVEQEQDDGKDSENGINKEKGLERSEVQDEQKGELQLQQLLQQKSKR 62
           D + VEEGLG        ++D  +  + I K++         EQKGE Q   L++QKSKR
Sbjct: 4   DAKRVEEGLGHVH-----KEDISEKADKIEKDESSATPAQSVEQKGEEQ--PLIKQKSKR 56

Query: 63  VATLDAFRGLTVVLMILVDDAGGAYARIDHSPWNGCTLADFVMPFFLFIVGVAIALALKK 122
           VATLDAFRGLT+VLMILVDDAGG+YARIDHSPWNGCTLADFVMPFFLFIVGVA+ALALKK
Sbjct: 57  VATLDAFRGLTIVLMILVDDAGGSYARIDHSPWNGCTLADFVMPFFLFIVGVAVALALKK 116

Query: 123 VPKINGAVKKIIFRTLKLLFWGIILQGGYSHAPDALSYGVDMKHIRWCGILQRIALVYVV 182
           +P+I+ AVKKI  RTLKLLFWGI+LQGGYSHAPD LSYGVDMKHIRW GILQRIA+VY V
Sbjct: 117 IPRISLAVKKISLRTLKLLFWGILLQGGYSHAPDDLSYGVDMKHIRWFGILQRIAVVYFV 176

Query: 183 VALIETLTTKRRPNVLEPRHLSIFTAYQWQWIGGFIAFVIYIITTYSLYVPNWSFSEHSD 242
           VALIETLTTKRRP V++  H SI +AY+WQWIGGF+AF+IY+ITTY+LYVP+WSF    D
Sbjct: 177 VALIETLTTKRRPTVIDSGHFSILSAYKWQWIGGFVAFLIYMITTYALYVPDWSFVIDQD 236

Query: 243 HGVKKYIVKCGMRGHLGPACNAVGYVDRELWGINHLYSDPVWSRLEACTLSSPNSGPLRE 302
           H  K+Y VKCGMRGHLGPACNAVGYVDR++WGINHLYS PVW+RL+ACTLSSPNSGP RE
Sbjct: 237 HEAKRYTVKCGMRGHLGPACNAVGYVDRQVWGINHLYSQPVWTRLKACTLSSPNSGPFRE 296

Query: 303 DAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHVLIHFKGHSARLKHWVSMGFGLLIIAI 362
           DAPSWC APFEPEGLLSTISAILSGTIGIHYGHVLIHFKGH+ RLK WVSMG  LLI+AI
Sbjct: 297 DAPSWCYAPFEPEGLLSTISAILSGTIGIHYGHVLIHFKGHAERLKQWVSMGIVLLIVAI 356

Query: 363 ILHFTNAIPINKQLYSFSYVCFTADGCLGITNSIFIPEMDWDECDACFRAWSTRYPSRIC 422
           ILHFT+AIPINKQLYSFSYVCFTA G  GI  S F   +D          W  R P    
Sbjct: 357 ILHFTDAIPINKQLYSFSYVCFTA-GAAGIVLSAFYLVID---------VWGFRTPFLFL 406

Query: 423 KWM 425
           +W+
Sbjct: 407 EWI 409




Source: Vitis vinifera

Species: Vitis vinifera

Genus: Vitis

Family: Vitaceae

Order: Vitales

Class:

Phylum: Streptophyta

Superkingdom: Eukaryota

>gi|147817637|emb|CAN64496.1| hypothetical protein VITISV_004036 [Vitis vinifera] Back     alignment and taxonomy information
>gi|255556868|ref|XP_002519467.1| conserved hypothetical protein [Ricinus communis] gi|223541330|gb|EEF42881.1| conserved hypothetical protein [Ricinus communis] Back     alignment and taxonomy information
>gi|224069583|ref|XP_002326379.1| predicted protein [Populus trichocarpa] gi|222833572|gb|EEE72049.1| predicted protein [Populus trichocarpa] Back     alignment and taxonomy information
>gi|356503734|ref|XP_003520659.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like [Glycine max] Back     alignment and taxonomy information
>gi|356570776|ref|XP_003553560.1| PREDICTED: LOW QUALITY PROTEIN: heparan-alpha-glucosaminide N-acetyltransferase-like [Glycine max] Back     alignment and taxonomy information
>gi|356548323|ref|XP_003542552.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like [Glycine max] Back     alignment and taxonomy information
>gi|449440411|ref|XP_004137978.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like [Cucumis sativus] gi|449517341|ref|XP_004165704.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like [Cucumis sativus] Back     alignment and taxonomy information
>gi|357152403|ref|XP_003576108.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like [Brachypodium distachyon] Back     alignment and taxonomy information
>gi|242067981|ref|XP_002449267.1| hypothetical protein SORBIDRAFT_05g006970 [Sorghum bicolor] gi|241935110|gb|EES08255.1| hypothetical protein SORBIDRAFT_05g006970 [Sorghum bicolor] Back     alignment and taxonomy information

Prediction of Gene Ontology (GO) Terms

Close Homologs with Gene Ontology terms Detected by BLAST ?

ID ? Alignment graph ? Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query427
TAIR|locus:2180305472 AT5G27730 "AT5G27730" [Arabido 0.800 0.724 0.579 1.8e-110
TAIR|locus:2160902440 AT5G47900 "AT5G47900" [Arabido 0.878 0.852 0.445 1.8e-85
MGI|MGI:1196297656 Hgsnat "heparan-alpha-glucosam 0.480 0.312 0.352 1e-28
UNIPROTKB|F1NBK1584 HGSNAT "Uncharacterized protei 0.622 0.455 0.291 3.2e-28
UNIPROTKB|Q68CP4663 HGSNAT "Heparan-alpha-glucosam 0.480 0.309 0.333 2.8e-26
UNIPROTKB|F1MF45592 HGSNAT "Uncharacterized protei 0.564 0.407 0.317 2.9e-26
DICTYBASE|DDB_G0286315 675 DDB_G0286315 "transmembrane pr 0.281 0.177 0.418 1.1e-20
DICTYBASE|DDB_G0270192426 DDB_G0270192 "DUF1624 family p 0.262 0.262 0.385 1.7e-19
UNIPROTKB|Q489U3358 CPS_0413 "Putative membrane pr 0.285 0.340 0.384 5.6e-19
TIGR_CMR|CPS_0413358 CPS_0413 "putative membrane pr 0.285 0.340 0.384 5.6e-19
TAIR|locus:2180305 AT5G27730 "AT5G27730" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
 Score = 1091 (389.1 bits), Expect = 1.8e-110, P = 1.8e-110
 Identities = 201/347 (57%), Positives = 252/347 (72%)

Query:    62 RVATLDAFRGLTVVLMILVDDAGGAYARIDHSPWNGCTLADFVMPFFLFIVGVAIALALK 121
             R+A+LD FRGLTV LMILVDDAGG +  I H+PWNGC LADFVMPFFLFIVGV+IAL+LK
Sbjct:    36 RLASLDIFRGLTVALMILVDDAGGDWPMIAHAPWNGCNLADFVMPFFLFIVGVSIALSLK 95

Query:   122 KVPKINGAVKKIIFRTLKLLFWGIILQGGYSHAPDALSYGVDMKHIRWCGILQRIALVYV 181
             ++     A KK+ FRT KLLFWG++LQGG+SHAPD L+YGVD+  +R+CGILQRIAL Y+
Sbjct:    96 RISNKFEACKKVGFRTCKLLFWGLLLQGGFSHAPDELTYGVDVTMMRFCGILQRIALSYL 155

Query:   182 VVALIETLTTKRRPNVLEPRHLSIFTAYQWQWIGGFIAFVIYIITTYSLYVPNWSFSEHS 241
             VVAL+E  T       L     SIF +Y W WI      VIY+ T Y  YVP+W F  + 
Sbjct:   156 VVALVEIFTKDSHEENLSTGRFSIFKSYYWHWIVAASVLVIYLATLYGTYVPDWEFVVYD 215

Query:   242 DHGV---KKYIVKCGMRGHLGPACNAVGYVDRELWGINHLYSDPVWSRLEACTLSSPNSG 298
                V   K   V CG+RG L P CNAVGYVDR++ GINH+Y  P W R +ACT  SP  G
Sbjct:   216 KDSVLYGKILSVSCGVRGKLNPPCNAVGYVDRQVLGINHMYHHPAWRRSKACTDDSPYEG 275

Query:   299 PLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHVLIHFKGHSARLKHWVSMGFGLL 358
              +R+DAPSWCRAPFEPEG+LS+ISAILS  IG+H+GH+++H KGHSARLKHW+S G  LL
Sbjct:   276 AIRQDAPSWCRAPFEPEGILSSISAILSTIIGVHFGHIILHLKGHSARLKHWISTGLVLL 335

Query:   359 IIAIILHFTNAIPINKQLYSFSYVCFTADGCLGITNSIF--IPEMDW 403
              + + LHFT+ +P+NKQLYSFSY+C T+     + +S++  +  ++W
Sbjct:   336 ALGLTLHFTHLMPLNKQLYSFSYICVTSGAAALVFSSLYSLVDILEW 382




GO:0003674 "molecular_function" evidence=ND
GO:0005575 "cellular_component" evidence=ND
GO:0008150 "biological_process" evidence=ND
GO:0016556 "mRNA modification" evidence=RCA
TAIR|locus:2160902 AT5G47900 "AT5G47900" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
MGI|MGI:1196297 Hgsnat "heparan-alpha-glucosaminide N-acetyltransferase" [Mus musculus (taxid:10090)] Back     alignment and assigned GO terms
UNIPROTKB|F1NBK1 HGSNAT "Uncharacterized protein" [Gallus gallus (taxid:9031)] Back     alignment and assigned GO terms
UNIPROTKB|Q68CP4 HGSNAT "Heparan-alpha-glucosaminide N-acetyltransferase" [Homo sapiens (taxid:9606)] Back     alignment and assigned GO terms
UNIPROTKB|F1MF45 HGSNAT "Uncharacterized protein" [Bos taurus (taxid:9913)] Back     alignment and assigned GO terms
DICTYBASE|DDB_G0286315 DDB_G0286315 "transmembrane protein" [Dictyostelium discoideum (taxid:44689)] Back     alignment and assigned GO terms
DICTYBASE|DDB_G0270192 DDB_G0270192 "DUF1624 family protein" [Dictyostelium discoideum (taxid:44689)] Back     alignment and assigned GO terms
UNIPROTKB|Q489U3 CPS_0413 "Putative membrane protein" [Colwellia psychrerythraea 34H (taxid:167879)] Back     alignment and assigned GO terms
TIGR_CMR|CPS_0413 CPS_0413 "putative membrane protein" [Colwellia psychrerythraea 34H (taxid:167879)] Back     alignment and assigned GO terms

Prediction of Enzyme Commission (EC) Number

EC Number Prediction by Annotation Transfer from SWISS-PROT Entries ?

No confident hit for EC number transfering in SWISSPROT detected by BLAST

EC Number Prediction by Ezypred Server ?

Fail to connect to Ezypred Server

EC Number Prediction by EFICAz Software ?

No EC number assignment, probably not an enzyme!


Prediction of Functionally Associated Proteins

Functionally Associated Proteins Detected by STRING ?

Fail to connect to STRING server


Conserved Domains and Related Protein Families

Conserved Domains Detected by RPS-BLAST ?

ID ?Alignment Graph ?Length ? Definition ? E-value ?
Query427
COG4299371 COG4299, COG4299, Uncharacterized protein conserve 1e-28
>gnl|CDD|226749 COG4299, COG4299, Uncharacterized protein conserved in bacteria [Function unknown] Back     alignment and domain information
 Score =  115 bits (289), Expect = 1e-28
 Identities = 89/334 (26%), Positives = 128/334 (38%), Gaps = 77/334 (23%)

Query: 58  QKSKRVATLDAFRGLTVVLMILVDDAG---GAYARIDHSPWNGCTLADFVMPFFLFIVGV 114
           Q + R+ +LD FRGLTV+LMILV++AG     Y ++ H+ W G TL D V P+FLF VG 
Sbjct: 3   QPAFRLTSLDVFRGLTVLLMILVNNAGLGDSTYRQLSHAHWGGLTLTDLVFPWFLFCVGA 62

Query: 115 AIALALKKVPKINGAVKKIIFRTLKLLFWGIILQGGYSHAPDALSYGVD-MKHIRWCGIL 173
           A+  +  K+ K N      ++R     F    L G +        + V      R  G+L
Sbjct: 63  AMPFSASKMNKANVTTWP-LYRRAAERFALGYLMGAF---VTVRDWSVTSHSLTRGMGVL 118

Query: 174 QRIALVYVVVALIETLTTKRRPNVLEPRHLSIFTAYQWQWIGGFIAFVIYIITTYSLYVP 233
           QRIAL Y+  AL+      R                 WQ +   +    Y +       P
Sbjct: 119 QRIALAYLFAALLVRQLRGR-----------------WQALLAAVLLAGYWLFLMFTPHP 161

Query: 234 NWSFSEHSDHGVKKYIVKCGMRGHLGPACNAVGYVDRELWGINHLYSDPVWSRLEACTLS 293
                                   LG   N     D      +HLYS             
Sbjct: 162 AAP---------------------LGGIGNVGESADPLQILNDHLYS------------- 187

Query: 294 SPNSGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHVLIHFKGHS-ARLKHWVS 352
                       +     F+PEGLLST+   +    G      L    G+  A L     
Sbjct: 188 ------------AD--GGFDPEGLLSTVPTTVLVLAGYLAARPLQQKPGNPRAPLL---L 230

Query: 353 MGFGLLIIAIILHFTNAIPINKQLYSFSYVCFTA 386
            G G+++ A+   +    PI+K+L++ SYV +TA
Sbjct: 231 AGLGVVLTALGYGWAGRFPISKKLWTSSYVLYTA 264


Length = 371

Conserved Domains Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query 427
KOG4683549 consensus Uncharacterized conserved protein [Funct 100.0
COG4299371 Uncharacterized protein conserved in bacteria [Fun 100.0
PF07786223 DUF1624: Protein of unknown function (DUF1624); In 99.87
COG3503323 Predicted membrane protein [Function unknown] 99.49
COG2311394 Predicted membrane protein [Function unknown] 99.31
PRK10835373 hypothetical protein; Provisional 98.7
PF10129358 OpgC_C: OpgC protein; InterPro: IPR014550 There is 98.0
PF01757340 Acyl_transf_3: Acyltransferase family; InterPro: I 97.64
PF06423136 GWT1: GWT1; InterPro: IPR009447 Glycosylphosphatid 97.41
PRK03854375 opgC glucans biosynthesis protein; Provisional 96.95
COG4645410 Uncharacterized protein conserved in bacteria [Fun 95.64
COG3274332 Predicted O-acyltransferase [General function pred 90.02
COG1835386 Predicted acyltransferases [Lipid metabolism] 85.49
PF15345 233 TMEM51: Transmembrane protein 51 84.17
>KOG4683 consensus Uncharacterized conserved protein [Function unknown] Back     alignment and domain information
Probab=100.00  E-value=4.4e-87  Score=664.51  Aligned_cols=357  Identities=40%  Similarity=0.702  Sum_probs=311.9

Q ss_pred             hhhcccccCchhhHHHHHHHHHHHHHHHHhcccccccCccccCCCCchhHHHHHHHHHHHHHHHHHHHhcccCCchHHHH
Q 014321           52 LQQLLQQKSKRVATLDAFRGLTVVLMILVDDAGGAYARIDHSPWNGCTLADFVMPFFLFIVGVAIALALKKVPKINGAVK  131 (427)
Q Consensus        52 ~~~~~~~~~~Rl~SLD~lRGlti~lMIlVN~~g~~~~~l~Ha~W~G~t~aDlVfP~FlFivGvSi~ls~~r~~~k~~~~~  131 (427)
                      +.+..++.++|+.|||+|||+|+++||+||+.|+.||+++||+|||++++|+|||+|+||||+|+++|+++..+|.+..|
T Consensus       133 ~~rsla~~r~RL~SLD~FRGltValMIlVdd~GG~~p~I~HapWnG~~LADfVmPfFLfIvGVsials~K~~s~rf~a~r  212 (549)
T KOG4683|consen  133 EARSLATQRKRLRSLDTFRGLTVALMILVDDGGGGYPWIEHAPWNGLHLADFVMPFFLFIVGVSIALSVKSQSSRFSATR  212 (549)
T ss_pred             hhhhcCCCchhhhhhhhhcCceEEEEEEEecCCCCchhhhcCCcCCccHHHHHHHHHHHHHHhhhhhhhhhhhhhhhHhH
Confidence            44555666789999999999999999999999999999999999999999999999999999999999999999999999


Q ss_pred             HHHHHHHHHHHHHHHHhhcCCCCCCccccCccCccceecchHHHHHHHHHHHHHHHHHhccCCCCCCCc-cchhhhhhhH
Q 014321          132 KIIFRTLKLLFWGIILQGGYSHAPDALSYGVDMKHIRWCGILQRIALVYVVVALIETLTTKRRPNVLEP-RHLSIFTAYQ  210 (427)
Q Consensus       132 kilrRsl~LfllGlll~~~f~~~~~~l~~~~~~~~~Ri~GVLQrIal~Yli~all~l~~~~~~~~~~~~-~~~~~~~~~~  210 (427)
                      |.-.|+.+|+++|++++++|.|+++++|++.|.+.+|++|||||+|++|+++|++..+..+....+..- +.+.....+.
T Consensus       213 Ka~~R~cklllwgLflqGgf~h~~~nLTygidve~lR~mGILQr~~~ayLVvAi~~~~~~~~~~~~~S~~R~V~~~~L~~  292 (549)
T KOG4683|consen  213 KAKARICKLLLWGLFLQGGFLHSMSNLTYGIDVEQLRIMGILQRFGVAYLVVAILHTLCCRPISPQRSWQRAVHDVCLFS  292 (549)
T ss_pred             HHHHHHHHHHHHHHHHhhhcccCcccccCCccHHHHHHHHHHHHhhHHHHHHHHHhhhccCCCccccchhhhhhHHHHHH
Confidence            999999999999999999999999999999999999999999999999999999998876643221100 0000000011


Q ss_pred             HHHHHHHHHHHHHHHHHHhcccCCCCccccCCCCcceeeeecCCCCCCC-CCCChhHHHhhhhcCCCCccCCcccccccc
Q 014321          211 WQWIGGFIAFVIYIITTYSLYVPNWSFSEHSDHGVKKYIVKCGMRGHLG-PACNAVGYVDRELWGINHLYSDPVWSRLEA  289 (427)
Q Consensus       211 ~q~~~~~~ll~~Y~~l~~~l~vP~~~~~~~~pg~~~~~~~~cg~~g~l~-~~~N~a~~IDr~vLG~~HlY~~p~~~~~~~  289 (427)
                      ....+-..++.-|..++|++.+|+||.+|++|         ||.++... |.||++||.||++||++||||+|++||+|+
T Consensus       293 ~~~~~~~~~V~~~~~~~~~~~~~~~~r~~~~~---------~G~~~~~~~P~CnAvGy~DrqvLGi~HiY~hP~~~r~k~  363 (549)
T KOG4683|consen  293 GELAVLLALVATYLGLTFGLRVPGCPRGYLGP---------GGKHDYNAHPKCNAVGYADRQVLGIAHIYQHPTAKRVKD  363 (549)
T ss_pred             HHHHHHHHhhhhhhceecccccCCCCcccccC---------CcccccCCCCCccchhhhHHhhhhhHHHhcCchHHHhhh
Confidence            11222223334444555666666666666655         55566665 459999999999999999999999999999


Q ss_pred             ccCCCCCCCCCCCCCCCCCCCCCCCCchhhhhHHHHHHHHHHHHHHHHhcccchhHHHHHHHHHHHHHHHHHHHHhhCCC
Q 014321          290 CTLSSPNSGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHVLIHFKGHSARLKHWVSMGFGLLIIAIILHFTNA  369 (427)
Q Consensus       290 ~~~~~p~~g~~~~~~~~~~~~~fDPEGlLstipAi~~~llG~~aG~~L~~~~~~~~rl~~~l~~G~~ll~lG~ll~~~~~  369 (427)
                      ||.++|++||++.|+|+||.+|||||||||+|.|++++++|+++|+++.+.+++..|+++|...++++.++|..+++.+.
T Consensus       364 cs~n~P~nG~l~~DAPSWCqapFdPEGilssi~avv~~llG~h~Ghiilh~k~~~sRir~wis~~~~l~llg~tL~~~s~  443 (549)
T KOG4683|consen  364 CSINYPNNGPLPPDAPSWCQAPFDPEGILSSILAVVQVLLGAHAGHIILHHKNFQSRIRRWISLAILLGLLGGTLCGFSA  443 (549)
T ss_pred             cccCCCCCCCCCCCCchhhcCCCChHHHHHHHHHHHHHHHHhhcCeEEEEccchHHHHHHHHHHHHHHHHHhhhhhcccc
Confidence            99999999999999999999999999999999999999999999999999999999999999999999999999998879


Q ss_pred             ccccccCCCchhHHHhhhhHHHHHHHhhhhhccCCccccccccccccCCcceEEEecC
Q 014321          370 IPINKQLYSFSYVCFTADGCLGITNSIFIPEMDWDECDACFRAWSTRYPSRICKWMVL  427 (427)
Q Consensus       370 ~PiNK~LWT~SfVL~T~~G~a~llLa~~y~liD~~~~~~~~~v~~~~~~~~~f~v~Gm  427 (427)
                      +|+||||||.||+|+|+ |.|+++|+.+|++||         |+.|+....||+|.||
T Consensus       444 ~Plnk~L~slsfvCVT~-~~A~Li~S~mY~~iD---------v~EW~~~~~P~~~~GM  491 (549)
T KOG4683|consen  444 IPLNKNLWSLSFVCVTV-SLALLILSLMYYFID---------VREWSWSGYPFTECGM  491 (549)
T ss_pred             cchhHhHHHhhhhHHHH-HHHHHHHHHHHHHhh---------HHHhhhccCChhhhcc
Confidence            99999999999999999 999999999999999         9999999999999998



>COG4299 Uncharacterized protein conserved in bacteria [Function unknown] Back     alignment and domain information
>PF07786 DUF1624: Protein of unknown function (DUF1624); InterPro: IPR012429 These sequences are found in hypothetical proteins of unknown function expressed by bacterial and archaeal species Back     alignment and domain information
>COG3503 Predicted membrane protein [Function unknown] Back     alignment and domain information
>COG2311 Predicted membrane protein [Function unknown] Back     alignment and domain information
>PRK10835 hypothetical protein; Provisional Back     alignment and domain information
>PF10129 OpgC_C: OpgC protein; InterPro: IPR014550 There is currently no experimental data for members of this group or their homologues, nor do they exhibit features indicative of any function Back     alignment and domain information
>PF01757 Acyl_transf_3: Acyltransferase family; InterPro: IPR002656 This entry contains a range of acyltransferase enzymes as well as yet uncharacterised proteins from Caenorhabditis elegans Back     alignment and domain information
>PF06423 GWT1: GWT1; InterPro: IPR009447 Glycosylphosphatidylinositol (GPI) is a conserved post-translational modification to anchor cell surface proteins to plasma membrane in eukaryotes Back     alignment and domain information
>PRK03854 opgC glucans biosynthesis protein; Provisional Back     alignment and domain information
>COG4645 Uncharacterized protein conserved in bacteria [Function unknown] Back     alignment and domain information
>COG3274 Predicted O-acyltransferase [General function prediction only] Back     alignment and domain information
>COG1835 Predicted acyltransferases [Lipid metabolism] Back     alignment and domain information
>PF15345 TMEM51: Transmembrane protein 51 Back     alignment and domain information

Homologous Structure Templates

Structure Templates Detected by BLAST ?

No homologous structure with e-value below 0.005

Structure Templates Detected by RPS-BLAST ?

No hit with e-value below 0.005

Structure Templates Detected by HHsearch ?

No hit with probability above 80.00


Homologous Structure Domains

Structure Domains Detected by RPS-BLAST ?

No hit with e-value below 0.005

Homologous Domains Detected by HHsearch ?

No hit with probability above 80.00