Citrus Sinensis ID: 017230


Local Sequence Feature Prediction

Prediction and (Method)Result
Residue Number Marker
Protein Sequence ?
Secondary Structure (PSIPRED) ?
Secondary Structure Prediction (SSPRO) ?
Coil and Loop (DISEMBL) ?
Flexible Loop (DISEMBL) ?
Low Complexity Region (SEG) ?
Disordered region (IsUnstruct) ?
Disordered Region (DISOPRED) ?
Disordered Region (DISEMBL) ?
Disordered Region (DISPRO) ?
Transmembrane Helix (TMHMM) ?
Transmembrane Helix (HMMTOP) ?
Transmembrane Helix (MEMSAT) ?
TM Helix, Signal Peptide (MEMSAT_SVM) ?
TM Helix, Signal Peptide (Phobius) ?
Signal Peptide (SignalP HMM Mode) ?
Signal Peptide (SignalP NN Mode) ?
Coiled Coils (COILS) ?
Positional Conservation ?
 
--------10--------20--------30--------40--------50--------60--------70--------80--------90-------100-------110-------120-------130-------140-------150-------160-------170-------180-------190-------200-------210-------220-------230-------240-------250-------260-------270-------280-------290-------300-------310-------320-------330-------340-------350-------360-------370-----
MSEIKAETTHHHPLIISEPDVSDQQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFLFIVGVAIALALKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALV
ccccccccccccccccccccccccccccccccccHHHHHHHHHHHHHHHHHHccccccccccccccccccccHHHHHHHHHHHHHHEEEEEcccccccHHHHHHHHHHHHHHHHHHHHHHHccccccccccccccccHHHHHHHHHHHHHHHHHHHHHHHHHcccccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHcccccccccccccccccccccccEEEccccccccccccHHHHHHHHHccccccccccccccccccccccccccccccccccccccccccccHHHHHHHHHHHHHHHHHHHHHHHcccHHHHHHHHHHHHHHHHHHHHHHHHccccccccccccHHHHHHHHHHHHHHHHHHHHHc
ccccccccccccHHHccccccccccccccccccHHHHHHHHHHHHHHHHHEEcccccccccccccccccccHHHHHHHHHHHHHHHHHHHHHHccccHHHHHHHHHHHHHHHHHHHHHHHcccccccccccccccHHHccHHHHHHHHHHHHHHHHHHHHHccccccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHcccccccccccccccccccccccEEcccccccccccccEEEEEHHHHcccHcccccccHccccccccccccccccccccccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHcccHHHHHHHHHHHHHHHHHHHHHHHcccccccccccHHHHHHHHHHHHHHHHHHHHHHHc
mseikaetthhhpliisepdvsdqqekshlktqrLASLDIFRGLAVALMILVDhaggdwpeishapwngcnladfvMPFFLFIVGVAIALALKRIPDRADAVKKVIFRTLKLLFWGILlqggfshapdeltygvDVRMIRLCGVLQRIALSYLLVSLVEIFTkdvqdkdqsvgrfsIFRLYCWHWLMAACVLVVYLALLYgtyvpdwqftiinkdsadygkvFNVTCgvraklnppcnavgyidrkvlginhmyhhpawrrskactqdspfegplrkdapswchapfepegllSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWVTMGFALLIFGLTLhftnaiplnkqlYTLSYVCVTSGAAALVFSAIYALV
mseikaetthhhpliisepdvsdQQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFLFIVGVAIALALKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTkdvqdkdqsvgRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCgvraklnppcNAVGYIDRKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALV
MSEIKAETTHHHPLIISEPDVSDQQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFLFIVGVAIALALKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMaacvlvvylallygtyvPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGllssvssilstiigvhfghviihTKGHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALV
*********************************RLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFLFIVGVAIALALKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACT*************PSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYAL*
*************************************LDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFLFIVGVAIALALKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALV
**********HHPLIISEP************TQRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFLFIVGVAIALALKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALV
********************************QRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFLFIVGVAIALALKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALV
iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiHHHHHHHHHHHHHHHHHHooooooooooooooHHHHHHHHHHHHHHHHHHHHHiiiiiiiiiiiiHHHHHHHHHHHHHHHHHHHHooooooooooooooHHHHHHHHHHHHHHHHHHHHHHHiiiiiiiiiiiiiiiiiiiiHHHHHHHHHHHHHHHHHHHHHHHooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooHHHHHHHHHHHHHHHHHHHHHHHiiiiiiiiiiiHHHHHHHHHHHHHHHHHHHHHHHooooHHHHHHHHHHHHHHHHHHHHHHHi
iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiHHHHHHHHHHHHHHHHHHooooooooooooooooooooooHHHHHHHHHHHHHHHHHHiiiiiiiiiiiiHHHHHHHHHHHHHHHHHHHooooooooooooooooooooHHHHHHHHHHHHHHHHHHiiiiiiiiiiiiiiiiiiHHHHHHHHHHHHHHHHHHHHooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooHHHHHHHHHHHHHHHHHHiiiiiiiiiiiiHHHHHHHHHHHHHHHHHHHHooooooooHHHHHHHHHHHHHHHHHHHHii
iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiHHHHHHHHHHHHHHHHHHHHoooooooooooooooHHHHHHHHHHHHHHHHHHHHiiiiiiiiiiiiiiiiHHHHHHHHHHHHHHHHHHHHoooooooooooooooHHHHHHHHHHHHHHHHHHHHiiiiiiiiiiiiiiiHHHHHHHHHHHHHHHHHHHHHHHHHooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooHHHHHHHHHHHHHHHHHHHHHHHHiiiiiiiiiHHHHHHHHHHHHHHHHHHHHooooooooooooHHHHHHHHHHHHHHHHHHHiii
iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiHHHHHHHHHHHHHHHHoooooooooooooooooooHHHHHHHHHHHHHHHHHiiiiiiiiiiiiiiiiiHHHHHHHHHHHHHHHHHHooooooooooooooooooHHHHHHHHHHHHHHHHHHHHHiiiiiiiiiiiiiiiiiiHHHHHHHHHHHHHHHHHHHHHHHHHHoooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooHHHHHHHHHHHHHHHHHHHiiiiiiiiiiiiiHHHHHHHHHHHHHHHHHHooooooooooooHHHHHHHHHHHHHHHHHHHiii
iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiHHHHHHHHHHHHHHHHHHHHHoooooooooooooooooooHHHHHHHHHHHHHHHHHHHiiiiiiiiiiiHHHHHHHHHHHHHHHHHHHHHHoooooooooooHHHHHHHHHHHHHHHHHHHHHHHHiiiiiiiiiiiiiiiiiiiiHHHHHHHHHHHHHHHHHHHHHHooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooHHHHHHHHHHHHHHHHHHHHHHiiiiiiiiiiiHHHHHHHHHHHHHHHHHHHHHoooooHHHHHHHHHHHHHHHHHHHHHHHHi
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
MSEIKAETTHHHPLIISEPDVSDQQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFLFIVGVAIALALKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALV
no confident homologs detected

Close Homologs for Annotation Transfer

Close Homologs in SWISS-PROT Database Detected by BLAST ?

ID ?Alignment graph ?Length ? Definition ? RBH(Q2H) ? RBH(H2Q) ? Q cover ? H cover ? Identity ? E-value ?
Query375 2.2.26 [Sep-21-2011]
Q3UDW8656 Heparan-alpha-glucosamini yes no 0.864 0.493 0.284 2e-31
Q68CP4663 Heparan-alpha-glucosamini yes no 0.821 0.464 0.302 3e-31
>sp|Q3UDW8|HGNAT_MOUSE Heparan-alpha-glucosaminide N-acetyltransferase OS=Mus musculus GN=Hgsnat PE=1 SV=2 Back     alignment and function desciption
 Score =  136 bits (343), Expect = 2e-31,   Method: Compositional matrix adjust.
 Identities = 107/376 (28%), Positives = 175/376 (46%), Gaps = 52/376 (13%)

Query: 17  SEPDVSD-QQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADF 75
           ++P  +D Q E       RL  +D FRGLA+ LM+ V++ GG +    H+ WNG  +AD 
Sbjct: 242 ADPLSADYQPETRRSSANRLRCVDTFRGLALVLMVFVNYGGGKYWYFKHSSWNGLTVADL 301

Query: 76  VMPFFLFIVGVAIALALKRIPDRA----DAVKKVIFRTLKLLFWGILLQGGFSHAPDELT 131
           V P+F+FI+G +I L++  I  R       + K+++R+  L+  G+++       P+   
Sbjct: 302 VFPWFVFIMGTSIFLSMTSILQRGCSKLKLLGKIVWRSFLLICIGVIIVN-----PNYCL 356

Query: 132 YGVDVRMIRLCGVLQRIALSYLLVSLVEIFT-KDVQDKDQSVGRFSIFRLY----CW-HW 185
             +    +R+ GVLQR+ ++Y +V+++E F  K V   D      S F L      W  W
Sbjct: 357 GPLSWDKVRIPGVLQRLGVTYFVVAVLEFFFWKPV--PDSCTLESSCFSLRDITSSWPQW 414

Query: 186 LMAACVLVVYLALLYGTYVPDWQFTIINKDS-ADYGKVFNVTCGVRAKLNPPCNAVGYID 244
           L    +  ++LAL +   VP      +      D GK  + T G          A GYID
Sbjct: 415 LTILTLESIWLALTFFLPVPGCPTGYLGPGGIGDLGKYPHCTGG----------AAGYID 464

Query: 245 RKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTII 304
           R +LG NH+Y HP+                            ++PEG+L +++SI+   +
Sbjct: 465 RLLLGDNHLYQHPS------------------STVLYHTEVAYDPEGVLGTINSIVMAFL 506

Query: 305 GVHFGHVIIH----TKGHLARLKQWVT-MGFALLIFGLTLHFTNAIPLNKQLYTLSYVCV 359
           GV  G ++++    TK  L R   W   +G   ++          IP+NK L+++SYV  
Sbjct: 507 GVQAGKILVYYKDQTKAILTRFAAWCCILGLISIVLTKVSANEGFIPINKNLWSISYVTT 566

Query: 360 TSGAAALVFSAIYALV 375
            S  A  +   +Y +V
Sbjct: 567 LSCFAFFILLILYPVV 582




Lysosomal acetyltransferase that acetylates the non-reducing terminal alpha-glucosamine residue of intralysosomal heparin or heparan sulfate, converting it into a substrate for luminal alpha-N-acetyl glucosaminidase.
Mus musculus (taxid: 10090)
EC: 2EC: .EC: 3EC: .EC: 1EC: .EC: 7EC: 8
>sp|Q68CP4|HGNAT_HUMAN Heparan-alpha-glucosaminide N-acetyltransferase OS=Homo sapiens GN=HGSNAT PE=1 SV=2 Back     alignment and function description

Close Homologs in the Non-Redundant Database Detected by BLAST ?

GI ?Alignment Graph ?Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query375
18421151472 heparan-alpha-glucosaminide N-acetyltran 1.0 0.794 0.721 1e-158
224125166468 predicted protein [Populus trichocarpa] 0.992 0.794 0.702 1e-157
297812935453 hypothetical protein ARALYDRAFT_489556 [ 0.949 0.785 0.721 1e-155
359487632 499 PREDICTED: LOW QUALITY PROTEIN: heparan- 0.970 0.729 0.732 1e-152
242059773 481 hypothetical protein SORBIDRAFT_03g04483 0.949 0.740 0.688 1e-149
449454063 490 PREDICTED: heparan-alpha-glucosaminide N 0.997 0.763 0.706 1e-149
326493552 486 predicted protein [Hordeum vulgare subsp 0.984 0.759 0.664 1e-147
357126662 485 PREDICTED: heparan-alpha-glucosaminide N 0.92 0.711 0.698 1e-144
212723192 484 uncharacterized protein LOC100193372 [Ze 0.949 0.735 0.679 1e-142
296089693 481 unnamed protein product [Vitis vinifera] 0.922 0.719 0.702 1e-142
>gi|18421151|ref|NP_568500.1| heparan-alpha-glucosaminide N-acetyltransferase [Arabidopsis thaliana] gi|14334592|gb|AAK59474.1| unknown protein [Arabidopsis thaliana] gi|26983902|gb|AAN86203.1| unknown protein [Arabidopsis thaliana] gi|332006336|gb|AED93719.1| heparan-alpha-glucosaminide N-acetyltransferase [Arabidopsis thaliana] Back     alignment and taxonomy information
 Score =  565 bits (1457), Expect = e-158,   Method: Compositional matrix adjust.
 Identities = 272/377 (72%), Positives = 320/377 (84%), Gaps = 2/377 (0%)

Query: 1   MSEIKAETTHHHPLIISEPDVSDQQEKSHLKTQR--LASLDIFRGLAVALMILVDHAGGD 58
           M+EIK E +H   L+  + D S    +  L   R  LASLDIFRGL VALMILVD AGGD
Sbjct: 1   MAEIKVERSHDQHLLEPKEDTSSSYTRRSLAGNRPRLASLDIFRGLTVALMILVDDAGGD 60

Query: 59  WPEISHAPWNGCNLADFVMPFFLFIVGVAIALALKRIPDRADAVKKVIFRTLKLLFWGIL 118
           WP I+HAPWNGCNLADFVMPFFLFIVGV+IAL+LKRI ++ +A KKV FRT KLLFWG+L
Sbjct: 61  WPMIAHAPWNGCNLADFVMPFFLFIVGVSIALSLKRISNKFEACKKVGFRTCKLLFWGLL 120

Query: 119 LQGGFSHAPDELTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIF 178
           LQGGFSHAPDELTYGVDV M+R CG+LQRIALSYL+V+LVEIFTKD  +++ S GRFSIF
Sbjct: 121 LQGGFSHAPDELTYGVDVTMMRFCGILQRIALSYLVVALVEIFTKDSHEENLSTGRFSIF 180

Query: 179 RLYCWHWLMAACVLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCN 238
           + Y WHW++AA VLV+YLA LYGTYVPDW+F + +KDS  YGK+ +V+CGVR KLNPPCN
Sbjct: 181 KSYYWHWIVAASVLVIYLATLYGTYVPDWEFVVYDKDSVLYGKILSVSCGVRGKLNPPCN 240

Query: 239 AVGYIDRKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSS 298
           AVGY+DR+VLGINHMYHHPAWRRSKACT DSP+EG +R+DAPSWC APFEPEG+LSS+S+
Sbjct: 241 AVGYVDRQVLGINHMYHHPAWRRSKACTDDSPYEGAIRQDAPSWCRAPFEPEGILSSISA 300

Query: 299 ILSTIIGVHFGHVIIHTKGHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVC 358
           ILSTIIGVHFGH+I+H KGH ARLK W++ G  LL  GLTLHFT+ +PLNKQLY+ SY+C
Sbjct: 301 ILSTIIGVHFGHIILHLKGHSARLKHWISTGLVLLALGLTLHFTHLMPLNKQLYSFSYIC 360

Query: 359 VTSGAAALVFSAIYALV 375
           VTSGAAALVFS++Y+LV
Sbjct: 361 VTSGAAALVFSSLYSLV 377




Source: Arabidopsis thaliana

Species: Arabidopsis thaliana

Genus: Arabidopsis

Family: Brassicaceae

Order: Brassicales

Class:

Phylum: Streptophyta

Superkingdom: Eukaryota

>gi|224125166|ref|XP_002319516.1| predicted protein [Populus trichocarpa] gi|222857892|gb|EEE95439.1| predicted protein [Populus trichocarpa] Back     alignment and taxonomy information
>gi|297812935|ref|XP_002874351.1| hypothetical protein ARALYDRAFT_489556 [Arabidopsis lyrata subsp. lyrata] gi|297320188|gb|EFH50610.1| hypothetical protein ARALYDRAFT_489556 [Arabidopsis lyrata subsp. lyrata] Back     alignment and taxonomy information
>gi|359487632|ref|XP_003633626.1| PREDICTED: LOW QUALITY PROTEIN: heparan-alpha-glucosaminide N-acetyltransferase-like [Vitis vinifera] Back     alignment and taxonomy information
>gi|242059773|ref|XP_002459032.1| hypothetical protein SORBIDRAFT_03g044830 [Sorghum bicolor] gi|241931007|gb|EES04152.1| hypothetical protein SORBIDRAFT_03g044830 [Sorghum bicolor] Back     alignment and taxonomy information
>gi|449454063|ref|XP_004144775.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like [Cucumis sativus] gi|449490878|ref|XP_004158735.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like [Cucumis sativus] Back     alignment and taxonomy information
>gi|326493552|dbj|BAJ85237.1| predicted protein [Hordeum vulgare subsp. vulgare] gi|326511587|dbj|BAJ91938.1| predicted protein [Hordeum vulgare subsp. vulgare] Back     alignment and taxonomy information
>gi|357126662|ref|XP_003565006.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like [Brachypodium distachyon] Back     alignment and taxonomy information
>gi|212723192|ref|NP_001131974.1| uncharacterized protein LOC100193372 [Zea mays] gi|194693076|gb|ACF80622.1| unknown [Zea mays] gi|413951397|gb|AFW84046.1| hypothetical protein ZEAMMB73_047978 [Zea mays] Back     alignment and taxonomy information
>gi|296089693|emb|CBI39512.3| unnamed protein product [Vitis vinifera] Back     alignment and taxonomy information

Prediction of Gene Ontology (GO) Terms

Close Homologs with Gene Ontology terms Detected by BLAST ?

ID ? Alignment graph ? Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query375
TAIR|locus:2180305472 AT5G27730 "AT5G27730" [Arabido 1.0 0.794 0.636 2.1e-132
TAIR|locus:2160902440 AT5G47900 "AT5G47900" [Arabido 0.904 0.770 0.459 1.4e-78
MGI|MGI:1196297656 Hgsnat "heparan-alpha-glucosam 0.605 0.346 0.301 8.9e-26
UNIPROTKB|Q68CP4663 HGSNAT "Heparan-alpha-glucosam 0.56 0.316 0.307 1.6e-25
UNIPROTKB|F1NBK1584 HGSNAT "Uncharacterized protei 0.357 0.229 0.382 1.2e-23
UNIPROTKB|F1MF45592 HGSNAT "Uncharacterized protei 0.562 0.356 0.309 2e-22
DICTYBASE|DDB_G0286315 675 DDB_G0286315 "transmembrane pr 0.344 0.191 0.389 2.2e-21
DICTYBASE|DDB_G0270192426 DDB_G0270192 "DUF1624 family p 0.309 0.272 0.398 1.5e-18
UNIPROTKB|Q489U3358 CPS_0413 "Putative membrane pr 0.304 0.318 0.341 3.9e-15
TIGR_CMR|CPS_0413358 CPS_0413 "putative membrane pr 0.304 0.318 0.341 3.9e-15
TAIR|locus:2180305 AT5G27730 "AT5G27730" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
 Score = 1298 (462.0 bits), Expect = 2.1e-132, P = 2.1e-132
 Identities = 240/377 (63%), Positives = 282/377 (74%)

Query:     1 MSEIKAETTHHHPLIISEPDVSDQQEKSHL--KTQRLASLDIFRGLAVALMILVDHAGGD 58
             M+EIK E +H   L+  + D S    +  L     RLASLDIFRGL VALMILVD AGGD
Sbjct:     1 MAEIKVERSHDQHLLEPKEDTSSSYTRRSLAGNRPRLASLDIFRGLTVALMILVDDAGGD 60

Query:    59 WPEISHAPWNGCNLADFVMPFFLFIVGVAIALALKRIPDRADAVKKVIFRTLKLLFWGIL 118
             WP I+HAPWNGCNLADFVMPFFLFIVGV+IAL+LKRI ++ +A KKV FRT KLLFWG+L
Sbjct:    61 WPMIAHAPWNGCNLADFVMPFFLFIVGVSIALSLKRISNKFEACKKVGFRTCKLLFWGLL 120

Query:   119 LQGGFSHAPDELTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIF 178
             LQGGFSHAPDELTYGVDV M+R CG+LQRIALSYL+V+LVEIFTKD  +++ S GRFSIF
Sbjct:   121 LQGGFSHAPDELTYGVDVTMMRFCGILQRIALSYLVVALVEIFTKDSHEENLSTGRFSIF 180

Query:   179 RLYCWHWLMXXXXXXXXXXXXXXXXXPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCN 238
             + Y WHW++                 PDW+F + +KDS  YGK+ +V+CGVR KLNPPCN
Sbjct:   181 KSYYWHWIVAASVLVIYLATLYGTYVPDWEFVVYDKDSVLYGKILSVSCGVRGKLNPPCN 240

Query:   239 AVGYIDRKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGXXXXXXX 298
             AVGY+DR+VLGINHMYHHPAWRRSKACT DSP+EG +R+DAPSWC APFEPEG       
Sbjct:   241 AVGYVDRQVLGINHMYHHPAWRRSKACTDDSPYEGAIRQDAPSWCRAPFEPEGILSSISA 300

Query:   299 XXXXXXXXXXXXXXXXTKGHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVC 358
                              KGH ARLK W++ G  LL  GLTLHFT+ +PLNKQLY+ SY+C
Sbjct:   301 ILSTIIGVHFGHIILHLKGHSARLKHWISTGLVLLALGLTLHFTHLMPLNKQLYSFSYIC 360

Query:   359 VTSGAAALVFSAIYALV 375
             VTSGAAALVFS++Y+LV
Sbjct:   361 VTSGAAALVFSSLYSLV 377




GO:0003674 "molecular_function" evidence=ND
GO:0005575 "cellular_component" evidence=ND
GO:0008150 "biological_process" evidence=ND
GO:0016556 "mRNA modification" evidence=RCA
TAIR|locus:2160902 AT5G47900 "AT5G47900" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
MGI|MGI:1196297 Hgsnat "heparan-alpha-glucosaminide N-acetyltransferase" [Mus musculus (taxid:10090)] Back     alignment and assigned GO terms
UNIPROTKB|Q68CP4 HGSNAT "Heparan-alpha-glucosaminide N-acetyltransferase" [Homo sapiens (taxid:9606)] Back     alignment and assigned GO terms
UNIPROTKB|F1NBK1 HGSNAT "Uncharacterized protein" [Gallus gallus (taxid:9031)] Back     alignment and assigned GO terms
UNIPROTKB|F1MF45 HGSNAT "Uncharacterized protein" [Bos taurus (taxid:9913)] Back     alignment and assigned GO terms
DICTYBASE|DDB_G0286315 DDB_G0286315 "transmembrane protein" [Dictyostelium discoideum (taxid:44689)] Back     alignment and assigned GO terms
DICTYBASE|DDB_G0270192 DDB_G0270192 "DUF1624 family protein" [Dictyostelium discoideum (taxid:44689)] Back     alignment and assigned GO terms
UNIPROTKB|Q489U3 CPS_0413 "Putative membrane protein" [Colwellia psychrerythraea 34H (taxid:167879)] Back     alignment and assigned GO terms
TIGR_CMR|CPS_0413 CPS_0413 "putative membrane protein" [Colwellia psychrerythraea 34H (taxid:167879)] Back     alignment and assigned GO terms

Prediction of Enzyme Commission (EC) Number

EC Number Prediction by Annotation Transfer from SWISS-PROT Entries ?

No confident hit for EC number transfering in SWISSPROT detected by BLAST

EC Number Prediction by Ezypred Server ?

Fail to connect to Ezypred Server

EC Number Prediction by EFICAz Software ?

No EC number assignment, probably not an enzyme!


Prediction of Functionally Associated Proteins

Functionally Associated Proteins Detected by STRING ?

Your Input:
AT5G27730
unknown protein; unknown protein; FUNCTIONS IN- molecular_function unknown; INVOLVED IN- biological_process unknown; LOCATED IN- cellular_component unknown; EXPRESSED IN- 22 plant structures; EXPRESSED DURING- 14 growth stages; BEST Arabidopsis thaliana protein match is- unknown protein (TAIR-AT5G47900.1); Has 651 Blast hits to 616 proteins in 129 species- Archae - 0; Bacteria - 235; Metazoa - 100; Fungi - 0; Plants - 60; Viruses - 0; Other Eukaryotes - 256 (source- NCBI BLink). (472 aa)
(Arabidopsis thaliana)
Predicted Functional Partners:
AT5G13690
alpha-N-acetylglucosaminidase family / NAGLU family; alpha-N-acetylglucosaminidase family / NAG [...] (806 aa)
     0.927

Conserved Domains and Related Protein Families

Conserved Domains Detected by RPS-BLAST ?

ID ?Alignment Graph ?Length ? Definition ? E-value ?
Query375
COG4299371 COG4299, COG4299, Uncharacterized protein conserve 6e-35
pfam07786222 pfam07786, DUF1624, Protein of unknown function (D 6e-04
pfam01757326 pfam01757, Acyl_transf_3, Acyltransferase family 0.002
pfam10129358 pfam10129, OpgC_C, OpgC protein 0.002
>gnl|CDD|226749 COG4299, COG4299, Uncharacterized protein conserved in bacteria [Function unknown] Back     alignment and domain information
 Score =  131 bits (331), Expect = 6e-35
 Identities = 96/346 (27%), Positives = 138/346 (39%), Gaps = 78/346 (22%)

Query: 34  RLASLDIFRGLAVALMILVDHAG-GD--WPEISHAPWNGCNLADFVMPFFLFIVGVAIAL 90
           RL SLD+FRGL V LMILV++AG GD  + ++SHA W G  L D V P+FLF VG A+  
Sbjct: 7   RLTSLDVFRGLTVLLMILVNNAGLGDSTYRQLSHAHWGGLTLTDLVFPWFLFCVGAAMPF 66

Query: 91  ALKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVD-VRMIRLCGVLQRIA 149
           +  +  ++A+     ++R     F    L G F        + V    + R  GVLQRIA
Sbjct: 67  SASK-MNKANVTTWPLYRRAAERFALGYLMGAF---VTVRDWSVTSHSLTRGMGVLQRIA 122

Query: 150 LSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPDWQF 209
           L+YL  +L+           Q  GR        W  L+AA +L  Y   L  T  P    
Sbjct: 123 LAYLFAALLVR---------QLRGR--------WQALLAAVLLAGYWLFLMFTPHPAAPL 165

Query: 210 TIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQDS 269
             I                         N     D   +  +H+Y               
Sbjct: 166 GGIG------------------------NVGESADPLQILNDHLY--------------- 186

Query: 270 PFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWVTMG 329
                   D        F+PEGLLS+V + +  + G      +    G+         +G
Sbjct: 187 ------SAD------GGFDPEGLLSTVPTTVLVLAGYLAARPLQQKPGNPRAPLLLAGLG 234

Query: 330 FALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALV 375
             L   G    +    P++K+L+T SYV  T+G   L+ +A + L 
Sbjct: 235 VVLTALGYG--WAGRFPISKKLWTSSYVLYTAGLGLLLLAACWVLA 278


Length = 371

>gnl|CDD|219576 pfam07786, DUF1624, Protein of unknown function (DUF1624) Back     alignment and domain information
>gnl|CDD|216682 pfam01757, Acyl_transf_3, Acyltransferase family Back     alignment and domain information
>gnl|CDD|220589 pfam10129, OpgC_C, OpgC protein Back     alignment and domain information

Conserved Domains Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query 375
KOG4683549 consensus Uncharacterized conserved protein [Funct 100.0
COG4299371 Uncharacterized protein conserved in bacteria [Fun 100.0
PF07786223 DUF1624: Protein of unknown function (DUF1624); In 99.88
COG3503323 Predicted membrane protein [Function unknown] 99.71
COG2311394 Predicted membrane protein [Function unknown] 99.44
PRK10835373 hypothetical protein; Provisional 98.94
PF10129358 OpgC_C: OpgC protein; InterPro: IPR014550 There is 98.59
COG4645410 Uncharacterized protein conserved in bacteria [Fun 97.73
PF01757340 Acyl_transf_3: Acyltransferase family; InterPro: I 97.71
PF06423136 GWT1: GWT1; InterPro: IPR009447 Glycosylphosphatid 97.44
PRK03854375 opgC glucans biosynthesis protein; Provisional 97.2
COG3274332 Predicted O-acyltransferase [General function pred 95.74
COG1835386 Predicted acyltransferases [Lipid metabolism] 92.74
COG3594343 NolL Fucose 4-O-acetylase and related acetyltransf 86.92
>KOG4683 consensus Uncharacterized conserved protein [Function unknown] Back     alignment and domain information
Probab=100.00  E-value=1.8e-76  Score=574.77  Aligned_cols=364  Identities=42%  Similarity=0.736  Sum_probs=309.4

Q ss_pred             CcccccccccCCCcccCCCCcchhhhc--cccccchhHHHHHHHHHHHHHHHHHhccCCCccccccCCCcchhhHHHHHH
Q 017230            1 MSEIKAETTHHHPLIISEPDVSDQQEK--SHLKTQRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMP   78 (375)
Q Consensus         1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~Ri~sLD~lRGlai~~MIlvn~~~~~~~~l~ha~W~G~t~~DlvfP   78 (375)
                      |.|||.|..|.|-+--...|.|.-+|+  .+++++|+.|||++||+++++||+||+.|+.||..+|++|||++++|+|||
T Consensus       108 ~~~ik~~~~~d~~~~E~k~~~ss~~~~rsla~~r~RL~SLD~FRGltValMIlVdd~GG~~p~I~HapWnG~~LADfVmP  187 (549)
T KOG4683|consen  108 ALKIKSCAWRDYRYDEAKAAASSIGEARSLATQRKRLRSLDTFRGLTVALMILVDDGGGGYPWIEHAPWNGLHLADFVMP  187 (549)
T ss_pred             HHHHhhccchhhhhccchhhhhhhhhhhhcCCCchhhhhhhhhcCceEEEEEEEecCCCCchhhhcCCcCCccHHHHHHH
Confidence            458899988887765444444444443  445567999999999999999999999999999999999999999999999


Q ss_pred             HHHHHHHHHHHHHhccCCchhHHHHHHHHHHHHHHHHHHHHhhccCCCCCcccccccccceeehhhHHHHHHHHHHHHHH
Q 017230           79 FFLFIVGVAIALALKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIALSYLLVSLV  158 (375)
Q Consensus        79 ~FlFl~G~si~ls~~r~~~~~~~~~~i~~R~l~L~~lGl~l~~~~~~~~~~~~~~~~~~~~r~~GVLqrIgl~Yll~all  158 (375)
                      +|+|++|+|+++|+++...|.+..||...|..+|++.|++++++|.+..+++|++.|.+++|++|||||+|++|++.|++
T Consensus       188 fFLfIvGVsials~K~~s~rf~a~rKa~~R~cklllwgLflqGgf~h~~~nLTygidve~lR~mGILQr~~~ayLVvAi~  267 (549)
T KOG4683|consen  188 FFLFIVGVSIALSVKSQSSRFSATRKAKARICKLLLWGLFLQGGFLHSMSNLTYGIDVEQLRIMGILQRFGVAYLVVAIL  267 (549)
T ss_pred             HHHHHHHhhhhhhhhhhhhhhhHhHHHHHHHHHHHHHHHHHhhhcccCcccccCCccHHHHHHHHHHHHhhHHHHHHHHH
Confidence            99999999999999999999999999999999999999999999999999999999999999999999999999999999


Q ss_pred             HHHhcccccccccccchhhhhhhHHHHHHHHHHHHHHHHHHHhcccCCCcccccCCCCCCcCccccccccccccCC-CCC
Q 017230          159 EIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLN-PPC  237 (375)
Q Consensus       159 ~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ll~~y~~l~~~l~vp~~~~~~~~~~~~~~~~~~~~~~~~~g~~~-~~~  237 (375)
                      ..+..+... .+          ..||-+++...+..|-+..++-+||++.+-..+...+.+++.|.++||.+|..+ |.|
T Consensus       268 ~~~~~~~~~-~~----------~S~~R~V~~~~L~~~~~~~~~~~V~~~~~~~~~~~~~~~~r~~~~~~G~~~~~~~P~C  336 (549)
T KOG4683|consen  268 HTLCCRPIS-PQ----------RSWQRAVHDVCLFSGELAVLLALVATYLGLTFGLRVPGCPRGYLGPGGKHDYNAHPKC  336 (549)
T ss_pred             hhhccCCCc-cc----------cchhhhhhHHHHHHHHHHHHHHhhhhhhceecccccCCCCcccccCCcccccCCCCCc
Confidence            877765211 11          113334444444444444444444444433333333444445555555555554 459


Q ss_pred             ChhhhhhhhhcCccccccCcccccccccCCCCCCCCCCCCCCCCCCCCCCCCCCcccchHHHHHHHHHHHHHHHHhcccc
Q 017230          238 NAVGYIDRKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKG  317 (375)
Q Consensus       238 n~a~~iDr~vlG~~HlY~~p~~~~~~~~~~~~p~~g~~~~~~~~~~~~~~DPEGlLstlpai~~~llG~~aG~~l~~~~~  317 (375)
                      |+++|.||+++|.+|||++|+|||+|+|++++|++|+++.|+|+||+.||||||+||++.|++++++|.++|+++.+.++
T Consensus       337 nAvGy~DrqvLGi~HiY~hP~~~r~k~cs~n~P~nG~l~~DAPSWCqapFdPEGilssi~avv~~llG~h~Ghiilh~k~  416 (549)
T KOG4683|consen  337 NAVGYADRQVLGIAHIYQHPTAKRVKDCSINYPNNGPLPPDAPSWCQAPFDPEGILSSILAVVQVLLGAHAGHIILHHKN  416 (549)
T ss_pred             cchhhhHHhhhhhHHHhcCchHHHhhhcccCCCCCCCCCCCCchhhcCCCChHHHHHHHHHHHHHHHHhhcCeEEEEccc
Confidence            99999999999999999999999999999999999999999999999999999999999999999999999999999999


Q ss_pred             hHHHHHHHHHHHHHHHHHHHHhhccCCCCCcccCCChhHHHHHHHHHHHHHHHHHHhC
Q 017230          318 HLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALV  375 (375)
Q Consensus       318 ~~~r~~~ll~~G~~ll~~G~ll~~~~~~PinK~LWT~SfvL~T~G~a~llLa~~y~li  375 (375)
                      +..|+++|...+.++.++|..+++.+.+|+||+|||.||+.+|+|.|+++++..|++|
T Consensus       417 ~~sRir~wis~~~~l~llg~tL~~~s~~Plnk~L~slsfvCVT~~~A~Li~S~mY~~i  474 (549)
T KOG4683|consen  417 FQSRIRRWISLAILLGLLGGTLCGFSAIPLNKNLWSLSFVCVTVSLALLILSLMYYFI  474 (549)
T ss_pred             hHHHHHHHHHHHHHHHHHhhhhhcccccchhHhHHHhhhhHHHHHHHHHHHHHHHHHh
Confidence            9999999999999999999999987789999999999999999999999999999875



>COG4299 Uncharacterized protein conserved in bacteria [Function unknown] Back     alignment and domain information
>PF07786 DUF1624: Protein of unknown function (DUF1624); InterPro: IPR012429 These sequences are found in hypothetical proteins of unknown function expressed by bacterial and archaeal species Back     alignment and domain information
>COG3503 Predicted membrane protein [Function unknown] Back     alignment and domain information
>COG2311 Predicted membrane protein [Function unknown] Back     alignment and domain information
>PRK10835 hypothetical protein; Provisional Back     alignment and domain information
>PF10129 OpgC_C: OpgC protein; InterPro: IPR014550 There is currently no experimental data for members of this group or their homologues, nor do they exhibit features indicative of any function Back     alignment and domain information
>COG4645 Uncharacterized protein conserved in bacteria [Function unknown] Back     alignment and domain information
>PF01757 Acyl_transf_3: Acyltransferase family; InterPro: IPR002656 This entry contains a range of acyltransferase enzymes as well as yet uncharacterised proteins from Caenorhabditis elegans Back     alignment and domain information
>PF06423 GWT1: GWT1; InterPro: IPR009447 Glycosylphosphatidylinositol (GPI) is a conserved post-translational modification to anchor cell surface proteins to plasma membrane in eukaryotes Back     alignment and domain information
>PRK03854 opgC glucans biosynthesis protein; Provisional Back     alignment and domain information
>COG3274 Predicted O-acyltransferase [General function prediction only] Back     alignment and domain information
>COG1835 Predicted acyltransferases [Lipid metabolism] Back     alignment and domain information
>COG3594 NolL Fucose 4-O-acetylase and related acetyltransferases [Carbohydrate transport and metabolism] Back     alignment and domain information

Homologous Structure Templates

Structure Templates Detected by BLAST ?

No homologous structure with e-value below 0.005

Structure Templates Detected by RPS-BLAST ?

No hit with e-value below 0.005

Structure Templates Detected by HHsearch ?

No hit with probability above 80.00


Homologous Structure Domains

Structure Domains Detected by RPS-BLAST ?

No hit with e-value below 0.005

Homologous Domains Detected by HHsearch ?

No hit with probability above 80.00