Citrus Sinensis ID: 012127


Local Sequence Feature Prediction

Prediction and (Method)Result
Residue Number Marker
Protein Sequence ?
Secondary Structure (PSIPRED) ?
Secondary Structure Prediction (SSPRO) ?
Coil and Loop (DISEMBL) ?
Flexible Loop (DISEMBL) ?
Low Complexity Region (SEG) ?
Disordered region (IsUnstruct) ?
Disordered Region (DISOPRED) ?
Disordered Region (DISEMBL) ?
Disordered Region (DISPRO) ?
Transmembrane Helix (TMHMM) ?
Transmembrane Helix (HMMTOP) ?
Transmembrane Helix (MEMSAT) ?
TM Helix, Signal Peptide (MEMSAT_SVM) ?
TM Helix, Signal Peptide (Phobius) ?
Signal Peptide (SignalP HMM Mode) ?
Signal Peptide (SignalP NN Mode) ?
Coiled Coils (COILS) ?
Positional Conservation ?
 
--------10--------20--------30--------40--------50--------60--------70--------80--------90-------100-------110-------120-------130-------140-------150-------160-------170-------180-------190-------200-------210-------220-------230-------240-------250-------260-------270-------280-------290-------300-------310-------320-------330-------340-------350-------360-------370-------380-------390-------400-------410-------420-------430-------440-------450-------460-------470
MSEIKAETTHHHPLIISEPDVSDQQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFLFIVGVAIALALKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVDIWNLKYPFLPLAWIGMNAMLVYVMAAEGIFAGFINGWYYGDPHNTLPYWIKKHAFLGVWRSRKVSTILYVIFVEILFWGLVTGILHRFGIYWKL
ccccccccccccccccccccccccccccccccccHHHHHHHHHHHHHHHHHHccccccccccccccccccccHHHHHHHHHHHHHHHHHHccccccccHHHHHHHHHHHHHHHHHHHHHHHcccccccccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHcccccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHcccccccccEEccccccccccccEEEccccccccccccHHHHHHHHHcccccccccccccccccccccccccccccccccccccccccccccHHHHHHHHHHHHHHHHHHHHHcccccHHHHHHHHHHHHHHHHHHHHHHHcccccccccccccHHHHHHHHHHHHHHHHHHHHHHHccccccccccccccHHHHHHHHHHHHHHHHHHHcccccccccccHHHHHHHHHHHHccccHHHHHHHHHHHHHHHHHHHHHHHHHHcccEEEc
ccccccccccccHHHccccccccccccccccccHHHHHHHHHHHHHHHHHEEcccccccccccccccccccHHHHHHHHHHHHHHHHHHHHHHccccHHHHHHHHHHHHHHHHHHHHHHHcccccccccccccccHHHccHHHHHHHHHHHHHHHHHHHHHccccccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHcccccccccccccccccccccccccccccccccccccccHEHEHHHHHcccHcccccccHccccccccccccccccccccccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHcccHHHHHHHHHHHHHHHHHHHHHHHccccccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHcccccccccEcccHHHHHHHHHHHHHHHHHcccccccccccccHHHHHHHHHHHHcccccHHHHHHHHHHHHHHHHHHHHHHHHHccEEEEc
mseikaetthhhpliisepdvsdqqekshlktqrLASLDIFRGLAVALMILVDhaggdwpeishapwngcnladfvMPFFLFIVGVAIALALKRIPDRADAVKKVIFRTLKLLFWGILlqggfshapdeltygvDVRMIRLCGVLQRIALSYLLVSLVEIFTkdvqdkdqsvgrfsIFRLYCWHWLMAACVLVVYLALLYgtyvpdwqftiinkdsadygkvFNVTCgvraklnppcnavgyidrkvlginhmyhhpawrrskactqdspfegplrkdapswchapfepegllSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWVTMGFALLIFGLTLhftnaiplnkqlYTLSYVCVTSGAAALVFSAIYALVDIwnlkypflplaWIGMNAMLVYVMAAEGIFAgfingwyygdphntlpyWIKKHAFLGVWRSRKVSTILYVIFVEILFWGLVTGILHRFGIYWKL
mseikaetthhhpliisepdvsdQQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFLFIVGVAIALALKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTkdvqdkdqsvgRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCgvraklnppcNAVGYIDRKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVDIWNLKYPFLPLAWIGMNAMLVYVMAAEGIFAGFINGWYYGDPHNTLPYWIKKHAFLGVWRSRKVSTILYVIFVEILFWGLVTGILHRFGIYWKL
MSEIKAETTHHHPLIISEPDVSDQQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFLFIVGVAIALALKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMaacvlvvylallygtyvPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGllssvssilstiigvhfghviihTKGHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVDIWNLKYPFLPLAWIGMNAMLVYVMAAEGIFAGFINGWYYGDPHNTLPYWIKKHAFLGVWRSRKVSTILYVIFVEILFWGLVTGILHRFGIYWKL
*********************************RLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFLFIVGVAIALALKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACT*************PSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVDIWNLKYPFLPLAWIGMNAMLVYVMAAEGIFAGFINGWYYGDPHNTLPYWIKKHAFLGVWRSRKVSTILYVIFVEILFWGLVTGILHRFGIYW**
*************************************LDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFLFIVGVAIALALKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVDIWNLKYPFLPLAWIGMNAMLVYVMAAEGIFAGFINGWYYGDPHNTLPYWIKKHAFLGVWRSRKVSTILYVIFVEILFWGLVTGILHRFGIYWKL
**********HHPLIISEP************TQRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFLFIVGVAIALALKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVDIWNLKYPFLPLAWIGMNAMLVYVMAAEGIFAGFINGWYYGDPHNTLPYWIKKHAFLGVWRSRKVSTILYVIFVEILFWGLVTGILHRFGIYWKL
*******************************TQRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFLFIVGVAIALALKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVDIWNLKYPFLPLAWIGMNAMLVYVMAAEGIFAGFINGWYYGDPHNTLPYWIKKHAFLGVWRSRKVSTILYVIFVEILFWGLVTGILHRFGIYWKL
iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiHHHHHHHHHHHHHHHHHHooooooooooooooHHHHHHHHHHHHHHHHHHHHHiiiiiiiiiiiiHHHHHHHHHHHHHHHHHHHHooooooooooooooHHHHHHHHHHHHHHHHHHHHHHHiiiiiiiiiiiiiiiiiiiiHHHHHHHHHHHHHHHHHHHHHHHooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooHHHHHHHHHHHHHHHHHHHHHHHiiiiiiiiiiiHHHHHHHHHHHHHHHHHHHHHHHooooHHHHHHHHHHHHHHHHHHHHHHHiiiiiiiiiiiiHHHHHHHHHHHHHHHHHHHHHHHoooooooooooooooooooooooooooooooooHHHHHHHHHHHHHHHHHHHHHHHiiiii
iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiHHHHHHHHHHHHHHHHHHooooooooooooooooooooHHHHHHHHHHHHHHHHHHHHiiiiiiiiiiiiHHHHHHHHHHHHHHHHHHHHooooooooooooooooHHHHHHHHHHHHHHHHHHHHHiiiiiiiiiiiiiiiiiiHHHHHHHHHHHHHHHHHHHHooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooHHHHHHHHHHHHHHHHHHiiiiiiiiiiiiHHHHHHHHHHHHHHHHHHHHooooooooHHHHHHHHHHHHHHHHHHHHHHiiiiiiiiiiiiHHHHHHHHHHHHHHHHHHHHHHooooooooooooooooooooooooooooooHHHHHHHHHHHHHHHHHHHHHHiiiiiiiii
iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiHHHHHHHHHHHHHHHHHHHHooooooooooooooooHHHHHHHHHHHHHHHHHHHHiiiiiiiiiiiiiiiHHHHHHHHHHHHHHHHHHHHooooooooooooooHHHHHHHHHHHHHHHHHHHHiiiiiiiiiiiiiiiiHHHHHHHHHHHHHHHHHHHHHHHHHoooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooHHHHHHHHHHHHHHHHHHHHHHHHiiiiiiiiHHHHHHHHHHHHHHHHHHHHooooooooooooooooHHHHHHHHHHHHHHHHHHHHiiiiiiiHHHHHHHHHHHHHHHHHHHHHHHHooooooooooooooooooooooooooooooHHHHHHHHHHHHHHHHHHHHHHHHiiiiiiii
iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiHHHHHHHHHHHHHHHHoooooooooooooooooooHHHHHHHHHHHHHHHHHiiiiiiiiiiiiiiiiiHHHHHHHHHHHHHHHHHHooooooooooooooooooHHHHHHHHHHHHHHHHHHHHHiiiiiiiiiiiiiiiiiiHHHHHHHHHHHHHHHHHHHHHHHHHHHooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooHHHHHHHHHHHHHHHHHHHHiiiiiiiiiiiiHHHHHHHHHHHHHHHHHHHHHoooooooooooHHHHHHHHHHHHHHHHHHHHHiiiiiiiiiiHHHHHHHHHHHHHHHHHHHHHHHooooooooooooooooooooooooooooooHHHHHHHHHHHHHHHHHHHHHHiiiiiiiii
iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiHHHHHHHHHHHHHHHHHHHHHoooooooooooooooooooHHHHHHHHHHHHHHHHHHHiiiiiiiiiiiHHHHHHHHHHHHHHHHHHHHHHoooooooooooHHHHHHHHHHHHHHHHHHHHHHHHiiiiiiiiiiiiiiiiiiiiHHHHHHHHHHHHHHHHHHHHHHooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooHHHHHHHHHHHHHHHHHHHHHHiiiiiiiiiiiHHHHHHHHHHHHHHHHHHHHHoooooHHHHHHHHHHHHHHHHHHHHHHHHHiiiiiiiiiiiHHHHHHHHHHHHHHHHHHHHHHHHHHHHHooooooooooooooooooooooooooHHHHHHHHHHHHHHHHHHHHiiiiiiiii
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
MSEIKAETTHHHPLIISEPDVSDQQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFLFIVGVAIALALKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVDIWNLKYPFLPLAWIGMNAMLVYVMAAEGIFAGFINGWYYGDPHNTLPYWIKKHAFLGVWRSRKVSTILYVIFVEILFWGLVTGILHRFGIYWKL
no confident homologs detected

Close Homologs for Annotation Transfer

Close Homologs in SWISS-PROT Database Detected by BLAST ?

ID ?Alignment graph ?Length ? Definition ? RBH(Q2H) ? RBH(H2Q) ? Q cover ? H cover ? Identity ? E-value ?
Query470 2.2.26 [Sep-21-2011]
Q3UDW8656 Heparan-alpha-glucosamini yes no 0.842 0.603 0.283 5e-36
Q68CP4663 Heparan-alpha-glucosamini yes no 0.810 0.574 0.293 8e-35
>sp|Q3UDW8|HGNAT_MOUSE Heparan-alpha-glucosaminide N-acetyltransferase OS=Mus musculus GN=Hgsnat PE=1 SV=2 Back     alignment and function desciption
 Score =  152 bits (384), Expect = 5e-36,   Method: Compositional matrix adjust.
 Identities = 129/455 (28%), Positives = 211/455 (46%), Gaps = 59/455 (12%)

Query: 17  SEPDVSD-QQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADF 75
           ++P  +D Q E       RL  +D FRGLA+ LM+ V++ GG +    H+ WNG  +AD 
Sbjct: 242 ADPLSADYQPETRRSSANRLRCVDTFRGLALVLMVFVNYGGGKYWYFKHSSWNGLTVADL 301

Query: 76  VMPFFLFIVGVAIALALKRIPDRADA----VKKVIFRTLKLLFWGILLQGGFSHAPDELT 131
           V P+F+FI+G +I L++  I  R  +    + K+++R+  L+  G+++       P+   
Sbjct: 302 VFPWFVFIMGTSIFLSMTSILQRGCSKLKLLGKIVWRSFLLICIGVIIVN-----PNYCL 356

Query: 132 YGVDVRMIRLCGVLQRIALSYLLVSLVEIFT-KDVQDKDQSVGRFSIFRLY----CW-HW 185
             +    +R+ GVLQR+ ++Y +V+++E F  K V   D      S F L      W  W
Sbjct: 357 GPLSWDKVRIPGVLQRLGVTYFVVAVLEFFFWKPV--PDSCTLESSCFSLRDITSSWPQW 414

Query: 186 LMAACVLVVYLALLYGTYVPDWQFTIINKDS-ADYGKVFNVTCGVRAKLNPPCNAVGYID 244
           L    +  ++LAL +   VP      +      D GK  + T G          A GYID
Sbjct: 415 LTILTLESIWLALTFFLPVPGCPTGYLGPGGIGDLGKYPHCTGG----------AAGYID 464

Query: 245 RKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTII 304
           R +LG NH+Y HP+                            ++PEG+L +++SI+   +
Sbjct: 465 RLLLGDNHLYQHPS------------------STVLYHTEVAYDPEGVLGTINSIVMAFL 506

Query: 305 GVHFGHVIIH----TKGHLARLKQWVT-MGFALLIFGLTLHFTNAIPLNKQLYTLSYVCV 359
           GV  G ++++    TK  L R   W   +G   ++          IP+NK L+++SYV  
Sbjct: 507 GVQAGKILVYYKDQTKAILTRFAAWCCILGLISIVLTKVSANEGFIPINKNLWSISYVTT 566

Query: 360 TSGAAALVFSAIYALVDIWNLKYPFLPLAWIGMNAMLVYVMAAEGIFAGFINGWYYGDPH 419
            S  A  +   +Y +VD+  L +   P  + GMN++LVYV   E +   F   W   D  
Sbjct: 567 LSCFAFFILLILYPVVDVKGL-WTGTPFFYPGMNSILVYV-GHEVLENYFPFQWKLADEQ 624

Query: 420 NTLPYWIKKHAFLGVWRSRKVSTILYVIFVEILFW 454
           +   + I+      +W       I YV++ + LFW
Sbjct: 625 SHKEHLIQNIVATALW-----VLIAYVLYKKKLFW 654




Lysosomal acetyltransferase that acetylates the non-reducing terminal alpha-glucosamine residue of intralysosomal heparin or heparan sulfate, converting it into a substrate for luminal alpha-N-acetyl glucosaminidase.
Mus musculus (taxid: 10090)
EC: 2EC: .EC: 3EC: .EC: 1EC: .EC: 7EC: 8
>sp|Q68CP4|HGNAT_HUMAN Heparan-alpha-glucosaminide N-acetyltransferase OS=Homo sapiens GN=HGSNAT PE=1 SV=2 Back     alignment and function description

Close Homologs in the Non-Redundant Database Detected by BLAST ?

GI ?Alignment Graph ?Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query470
224125166468 predicted protein [Populus trichocarpa] 0.993 0.997 0.717 0.0
18421151472 heparan-alpha-glucosaminide N-acetyltran 1.0 0.995 0.716 0.0
449454063490 PREDICTED: heparan-alpha-glucosaminide N 0.997 0.957 0.725 0.0
297812935453 hypothetical protein ARALYDRAFT_489556 [ 0.959 0.995 0.711 0.0
359487632499 PREDICTED: LOW QUALITY PROTEIN: heparan- 0.976 0.919 0.722 0.0
242059773481 hypothetical protein SORBIDRAFT_03g04483 0.959 0.937 0.682 0.0
356572978464 PREDICTED: heparan-alpha-glucosaminide N 0.985 0.997 0.723 0.0
356504028465 PREDICTED: heparan-alpha-glucosaminide N 0.985 0.995 0.722 0.0
326493552486 predicted protein [Hordeum vulgare subsp 0.987 0.954 0.653 0.0
296089693481 unnamed protein product [Vitis vinifera] 0.938 0.916 0.698 0.0
>gi|224125166|ref|XP_002319516.1| predicted protein [Populus trichocarpa] gi|222857892|gb|EEE95439.1| predicted protein [Populus trichocarpa] Back     alignment and taxonomy information
 Score =  714 bits (1843), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 338/471 (71%), Positives = 397/471 (84%), Gaps = 4/471 (0%)

Query: 1   MSEIKAETTHHHPLIISE-PDVSDQQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGDW 59
           M+EIKA+    H L I+E  D+S Q+    +   R+ASLDI+RGL VALMILVD AGG+W
Sbjct: 1   MAEIKADIALDHRLTIAEVTDISAQKPDPKI---RVASLDIYRGLTVALMILVDDAGGEW 57

Query: 60  PEISHAPWNGCNLADFVMPFFLFIVGVAIALALKRIPDRADAVKKVIFRTLKLLFWGILL 119
           P+I HAPWNGCNLADFVMPFFLFIVG+AI LA KRI  R  AV++VI RTLKLLFWGI+L
Sbjct: 58  PKIGHAPWNGCNLADFVMPFFLFIVGMAIPLAFKRITSRHHAVRRVIVRTLKLLFWGIML 117

Query: 120 QGGFSHAPDELTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFR 179
           QGGFSHAPD+LTYGVD++ IR CG+LQRIA +YL+V+L+EIFTK  Q ++   G  SI++
Sbjct: 118 QGGFSHAPDKLTYGVDMKKIRWCGILQRIAFAYLVVALMEIFTKKKQTRELPPGWLSIYK 177

Query: 180 LYCWHWLMAACVLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNA 239
           LY   WLM AC+LV+YLA++YGTYVP WQFT+ ++DSADYGKVF V C VR KL+PPCNA
Sbjct: 178 LYSSQWLMGACILVIYLAVIYGTYVPHWQFTVNDRDSADYGKVFTVECAVRGKLDPPCNA 237

Query: 240 VGYIDRKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSI 299
           VG+IDR++LGINHMY HPAW+RS+ACT++SP+EGP R  APSWC APFEPEG+LSS+S++
Sbjct: 238 VGFIDREILGINHMYQHPAWKRSEACTENSPYEGPFRTSAPSWCKAPFEPEGILSSISAV 297

Query: 300 LSTIIGVHFGHVIIHTKGHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCV 359
           LSTIIGVHFGHV+++ +GH ARLK W+ MGFALLI GL LHFT+AIPLNKQLYT SYVCV
Sbjct: 298 LSTIIGVHFGHVLVYMRGHAARLKHWIVMGFALLILGLVLHFTHAIPLNKQLYTFSYVCV 357

Query: 360 TSGAAALVFSAIYALVDIWNLKYPFLPLAWIGMNAMLVYVMAAEGIFAGFINGWYYGDPH 419
           TSGAAALVFS+IYALVDIW  K  F PLAWIGMNAMLVYVMAAEGIFAGFINGWYY DPH
Sbjct: 358 TSGAAALVFSSIYALVDIWGWKCIFQPLAWIGMNAMLVYVMAAEGIFAGFINGWYYNDPH 417

Query: 420 NTLPYWIKKHAFLGVWRSRKVSTILYVIFVEILFWGLVTGILHRFGIYWKL 470
           NTL YWI+KH F+GVW S++V  +LYVIF EILFWG+V GI HR GIYWKL
Sbjct: 418 NTLIYWIQKHIFIGVWHSQRVGILLYVIFAEILFWGMVAGIFHRLGIYWKL 468




Source: Populus trichocarpa

Species: Populus trichocarpa

Genus: Populus

Family: Salicaceae

Order: Malpighiales

Class:

Phylum: Streptophyta

Superkingdom: Eukaryota

>gi|18421151|ref|NP_568500.1| heparan-alpha-glucosaminide N-acetyltransferase [Arabidopsis thaliana] gi|14334592|gb|AAK59474.1| unknown protein [Arabidopsis thaliana] gi|26983902|gb|AAN86203.1| unknown protein [Arabidopsis thaliana] gi|332006336|gb|AED93719.1| heparan-alpha-glucosaminide N-acetyltransferase [Arabidopsis thaliana] Back     alignment and taxonomy information
>gi|449454063|ref|XP_004144775.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like [Cucumis sativus] gi|449490878|ref|XP_004158735.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like [Cucumis sativus] Back     alignment and taxonomy information
>gi|297812935|ref|XP_002874351.1| hypothetical protein ARALYDRAFT_489556 [Arabidopsis lyrata subsp. lyrata] gi|297320188|gb|EFH50610.1| hypothetical protein ARALYDRAFT_489556 [Arabidopsis lyrata subsp. lyrata] Back     alignment and taxonomy information
>gi|359487632|ref|XP_003633626.1| PREDICTED: LOW QUALITY PROTEIN: heparan-alpha-glucosaminide N-acetyltransferase-like [Vitis vinifera] Back     alignment and taxonomy information
>gi|242059773|ref|XP_002459032.1| hypothetical protein SORBIDRAFT_03g044830 [Sorghum bicolor] gi|241931007|gb|EES04152.1| hypothetical protein SORBIDRAFT_03g044830 [Sorghum bicolor] Back     alignment and taxonomy information
>gi|356572978|ref|XP_003554642.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like [Glycine max] Back     alignment and taxonomy information
>gi|356504028|ref|XP_003520801.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like [Glycine max] Back     alignment and taxonomy information
>gi|326493552|dbj|BAJ85237.1| predicted protein [Hordeum vulgare subsp. vulgare] gi|326511587|dbj|BAJ91938.1| predicted protein [Hordeum vulgare subsp. vulgare] Back     alignment and taxonomy information
>gi|296089693|emb|CBI39512.3| unnamed protein product [Vitis vinifera] Back     alignment and taxonomy information

Prediction of Gene Ontology (GO) Terms

Close Homologs with Gene Ontology terms Detected by BLAST ?

ID ? Alignment graph ? Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query470
TAIR|locus:2180305472 AT5G27730 "AT5G27730" [Arabido 1.0 0.995 0.648 2.4e-172
TAIR|locus:2160902440 AT5G47900 "AT5G47900" [Arabido 0.829 0.886 0.443 2.3e-87
MGI|MGI:1196297656 Hgsnat "heparan-alpha-glucosam 0.482 0.346 0.301 4.1e-28
UNIPROTKB|F1MF45592 HGSNAT "Uncharacterized protei 0.448 0.356 0.309 2.4e-27
UNIPROTKB|Q68CP4663 HGSNAT "Heparan-alpha-glucosam 0.446 0.316 0.307 9.5e-27
UNIPROTKB|F1NBK1584 HGSNAT "Uncharacterized protei 0.457 0.368 0.316 1.6e-26
DICTYBASE|DDB_G0286315675 DDB_G0286315 "transmembrane pr 0.274 0.191 0.389 4e-23
DICTYBASE|DDB_G0270192426 DDB_G0270192 "DUF1624 family p 0.246 0.272 0.398 3.3e-22
UNIPROTKB|Q489U3358 CPS_0413 "Putative membrane pr 0.242 0.318 0.341 2.2e-18
TIGR_CMR|CPS_0413358 CPS_0413 "putative membrane pr 0.242 0.318 0.341 2.2e-18
TAIR|locus:2180305 AT5G27730 "AT5G27730" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
 Score = 1675 (594.7 bits), Expect = 2.4e-172, P = 2.4e-172
 Identities = 306/472 (64%), Positives = 356/472 (75%)

Query:     1 MSEIKAETTHHHPLIISEPDVSDQQEKSHL--KTQRLASLDIFRGLAVALMILVDHAGGD 58
             M+EIK E +H   L+  + D S    +  L     RLASLDIFRGL VALMILVD AGGD
Sbjct:     1 MAEIKVERSHDQHLLEPKEDTSSSYTRRSLAGNRPRLASLDIFRGLTVALMILVDDAGGD 60

Query:    59 WPEISHAPWNGCNLADFVMPFFLFIVGVAIALALKRIPDRADAVKKVIFRTLKLLFWGIL 118
             WP I+HAPWNGCNLADFVMPFFLFIVGV+IAL+LKRI ++ +A KKV FRT KLLFWG+L
Sbjct:    61 WPMIAHAPWNGCNLADFVMPFFLFIVGVSIALSLKRISNKFEACKKVGFRTCKLLFWGLL 120

Query:   119 LQGGFSHAPDELTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIF 178
             LQGGFSHAPDELTYGVDV M+R CG+LQRIALSYL+V+LVEIFTKD  +++ S GRFSIF
Sbjct:   121 LQGGFSHAPDELTYGVDVTMMRFCGILQRIALSYLVVALVEIFTKDSHEENLSTGRFSIF 180

Query:   179 RLYCWHWLMXXXXXXXXXXXXXXXXXPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCN 238
             + Y WHW++                 PDW+F + +KDS  YGK+ +V+CGVR KLNPPCN
Sbjct:   181 KSYYWHWIVAASVLVIYLATLYGTYVPDWEFVVYDKDSVLYGKILSVSCGVRGKLNPPCN 240

Query:   239 AVGYIDRKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGXXXXXXX 298
             AVGY+DR+VLGINHMYHHPAWRRSKACT DSP+EG +R+DAPSWC APFEPEG       
Sbjct:   241 AVGYVDRQVLGINHMYHHPAWRRSKACTDDSPYEGAIRQDAPSWCRAPFEPEGILSSISA 300

Query:   299 XXXXXXXXXXXXXXXXTKGHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVC 358
                              KGH ARLK W++ G  LL  GLTLHFT+ +PLNKQLY+ SY+C
Sbjct:   301 ILSTIIGVHFGHIILHLKGHSARLKHWISTGLVLLALGLTLHFTHLMPLNKQLYSFSYIC 360

Query:   359 VTSGAAALVFSAIYALVDIWNLKYPFLPLAWIGMNAMLVYVMAAEGIFAGFINGWYYGDP 418
             VTSGAAALVFS++Y+LVDI   K+ FLPL WIGMNAMLVYVM AEGI A F NGWYY  P
Sbjct:   361 VTSGAAALVFSSLYSLVDILEWKHMFLPLKWIGMNAMLVYVMGAEGILAAFFNGWYYRHP 420

Query:   419 HNTLPYWIKKHAFLGVWRSRKVSTILYVIFVEILFWGLVTGILHRFGIYWKL 470
             HNTL  WI++H F+ VW SR+V  ++YVIF EILFWGLVTG+ HRF IYWKL
Sbjct:   421 HNTLINWIREHVFIRVWHSRRVGVLMYVIFAEILFWGLVTGVFHRFKIYWKL 472




GO:0003674 "molecular_function" evidence=ND
GO:0005575 "cellular_component" evidence=ND
GO:0008150 "biological_process" evidence=ND
GO:0016556 "mRNA modification" evidence=RCA
TAIR|locus:2160902 AT5G47900 "AT5G47900" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
MGI|MGI:1196297 Hgsnat "heparan-alpha-glucosaminide N-acetyltransferase" [Mus musculus (taxid:10090)] Back     alignment and assigned GO terms
UNIPROTKB|F1MF45 HGSNAT "Uncharacterized protein" [Bos taurus (taxid:9913)] Back     alignment and assigned GO terms
UNIPROTKB|Q68CP4 HGSNAT "Heparan-alpha-glucosaminide N-acetyltransferase" [Homo sapiens (taxid:9606)] Back     alignment and assigned GO terms
UNIPROTKB|F1NBK1 HGSNAT "Uncharacterized protein" [Gallus gallus (taxid:9031)] Back     alignment and assigned GO terms
DICTYBASE|DDB_G0286315 DDB_G0286315 "transmembrane protein" [Dictyostelium discoideum (taxid:44689)] Back     alignment and assigned GO terms
DICTYBASE|DDB_G0270192 DDB_G0270192 "DUF1624 family protein" [Dictyostelium discoideum (taxid:44689)] Back     alignment and assigned GO terms
UNIPROTKB|Q489U3 CPS_0413 "Putative membrane protein" [Colwellia psychrerythraea 34H (taxid:167879)] Back     alignment and assigned GO terms
TIGR_CMR|CPS_0413 CPS_0413 "putative membrane protein" [Colwellia psychrerythraea 34H (taxid:167879)] Back     alignment and assigned GO terms

Prediction of Enzyme Commission (EC) Number

EC Number Prediction by Annotation Transfer from SWISS-PROT Entries ?

No confident hit for EC number transfering in SWISSPROT detected by BLAST

EC Number Prediction by Ezypred Server ?

Fail to connect to Ezypred Server

EC Number Prediction by EFICAz Software ?

No EC number assignment, probably not an enzyme!


Prediction of Functionally Associated Proteins

Functionally Associated Proteins Detected by STRING ?

Your Input:
AT5G27730
unknown protein; unknown protein; FUNCTIONS IN- molecular_function unknown; INVOLVED IN- biological_process unknown; LOCATED IN- cellular_component unknown; EXPRESSED IN- 22 plant structures; EXPRESSED DURING- 14 growth stages; BEST Arabidopsis thaliana protein match is- unknown protein (TAIR-AT5G47900.1); Has 651 Blast hits to 616 proteins in 129 species- Archae - 0; Bacteria - 235; Metazoa - 100; Fungi - 0; Plants - 60; Viruses - 0; Other Eukaryotes - 256 (source- NCBI BLink). (472 aa)
(Arabidopsis thaliana)
Predicted Functional Partners:
AT5G13690
alpha-N-acetylglucosaminidase family / NAGLU family; alpha-N-acetylglucosaminidase family / NAG [...] (806 aa)
     0.927

Conserved Domains and Related Protein Families

Conserved Domains Detected by RPS-BLAST ?

ID ?Alignment Graph ?Length ? Definition ? E-value ?
Query470
COG4299371 COG4299, COG4299, Uncharacterized protein conserve 4e-36
pfam07786222 pfam07786, DUF1624, Protein of unknown function (D 0.004
>gnl|CDD|226749 COG4299, COG4299, Uncharacterized protein conserved in bacteria [Function unknown] Back     alignment and domain information
 Score =  136 bits (345), Expect = 4e-36
 Identities = 118/442 (26%), Positives = 175/442 (39%), Gaps = 82/442 (18%)

Query: 34  RLASLDIFRGLAVALMILVDHAG-GD--WPEISHAPWNGCNLADFVMPFFLFIVGVAIAL 90
           RL SLD+FRGL V LMILV++AG GD  + ++SHA W G  L D V P+FLF VG A+  
Sbjct: 7   RLTSLDVFRGLTVLLMILVNNAGLGDSTYRQLSHAHWGGLTLTDLVFPWFLFCVGAAMPF 66

Query: 91  ALKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVD-VRMIRLCGVLQRIA 149
           +  +  ++A+     ++R     F    L G F        + V    + R  GVLQRIA
Sbjct: 67  SASK-MNKANVTTWPLYRRAAERFALGYLMGAF---VTVRDWSVTSHSLTRGMGVLQRIA 122

Query: 150 LSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPDWQF 209
           L+YL  +L+           Q  GR        W  L+AA +L  Y   L  T  P    
Sbjct: 123 LAYLFAALLVR---------QLRGR--------WQALLAAVLLAGYWLFLMFTPHPAAPL 165

Query: 210 TIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQDS 269
             I                         N     D   +  +H+Y               
Sbjct: 166 GGIG------------------------NVGESADPLQILNDHLY--------------- 186

Query: 270 PFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWVTMG 329
                   D        F+PEGLLS+V + +  + G      +    G+         +G
Sbjct: 187 ------SAD------GGFDPEGLLSTVPTTVLVLAGYLAARPLQQKPGNPRAPLLLAGLG 234

Query: 330 FALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVDIWNLKYPFLPLAW 389
             L   G    +    P++K+L+T SYV  T+G   L+ +A + L +    K    P   
Sbjct: 235 VVLTALGYG--WAGRFPISKKLWTSSYVLYTAGLGLLLLAACWVLAESPGGKRLLAPFTI 292

Query: 390 IGMNAMLVYVMAAEGIFAGFINGWYYGDPHNTLPY-WIKKHAFLGVWRSRKVSTILYVIF 448
            G+NA+ +YV+ +  I    +  W  G+   +    W   + F          ++LY + 
Sbjct: 293 PGLNALALYVL-SILIKVWLLLDWGVGETAPSQSIAWSLLNMFRS-SFGPVGGSLLYAL- 349

Query: 449 VEILFWGLVTGILHRFGIYWKL 470
             +L   L    + R GI WKL
Sbjct: 350 GYVLAVWLGLAWMARRGIIWKL 371


Length = 371

>gnl|CDD|219576 pfam07786, DUF1624, Protein of unknown function (DUF1624) Back     alignment and domain information

Conserved Domains Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query 470
KOG4683549 consensus Uncharacterized conserved protein [Funct 100.0
COG4299371 Uncharacterized protein conserved in bacteria [Fun 100.0
COG2311394 Predicted membrane protein [Function unknown] 99.89
PF07786223 DUF1624: Protein of unknown function (DUF1624); In 99.8
PRK10835373 hypothetical protein; Provisional 99.77
COG3503323 Predicted membrane protein [Function unknown] 99.58
PF10129358 OpgC_C: OpgC protein; InterPro: IPR014550 There is 99.08
PF01757340 Acyl_transf_3: Acyltransferase family; InterPro: I 98.51
PF04235163 DUF418: Protein of unknown function (DUF418); Inte 98.33
COG4645410 Uncharacterized protein conserved in bacteria [Fun 98.17
PRK03854375 opgC glucans biosynthesis protein; Provisional 98.11
PF06423136 GWT1: GWT1; InterPro: IPR009447 Glycosylphosphatid 97.72
COG3274332 Predicted O-acyltransferase [General function pred 96.94
COG5062429 Uncharacterized membrane protein [Function unknown 94.44
COG3594343 NolL Fucose 4-O-acetylase and related acetyltransf 92.23
COG1835386 Predicted acyltransferases [Lipid metabolism] 88.48
>KOG4683 consensus Uncharacterized conserved protein [Function unknown] Back     alignment and domain information
Probab=100.00  E-value=6e-72  Score=548.49  Aligned_cols=439  Identities=38%  Similarity=0.671  Sum_probs=375.6

Q ss_pred             CcccccccccCCCccccCCCcch--hhhccccccchhHHHHHHHHHHHHHHHHHhcCCCCccccccCcccchhhHHHHHH
Q 012127            1 MSEIKAETTHHHPLIISEPDVSD--QQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMP   78 (470)
Q Consensus         1 ~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~Ri~slD~lRGlai~~Milvn~~~~~~~~l~h~~w~G~~~~DlvfP   78 (470)
                      |.|||.|..|.|-+--...|.|.  ..++.++.++|+.|+|++||+++++||+||+.|+.||+.+|++|||++++|.|+|
T Consensus       108 ~~~ik~~~~~d~~~~E~k~~~ss~~~~rsla~~r~RL~SLD~FRGltValMIlVdd~GG~~p~I~HapWnG~~LADfVmP  187 (549)
T KOG4683|consen  108 ALKIKSCAWRDYRYDEAKAAASSIGEARSLATQRKRLRSLDTFRGLTVALMILVDDGGGGYPWIEHAPWNGLHLADFVMP  187 (549)
T ss_pred             HHHHhhccchhhhhccchhhhhhhhhhhhcCCCchhhhhhhhhcCceEEEEEEEecCCCCchhhhcCCcCCccHHHHHHH
Confidence            35889998887776433333332  2344556678999999999999999999999999999999999999999999999


Q ss_pred             HHHHHHHHHHHHHhccCCchhHHHHHHHHHHHHHHHHHHHHHhccCCCCcccccccccchhhhchHHHHHHHHHHHHHHH
Q 012127           79 FFLFIVGVAIALALKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIALSYLLVSLV  158 (470)
Q Consensus        79 ~Flfl~G~s~~l~~~r~~~~~~~~~~~~~R~~~L~~lG~~~~~~~~~~~~~~~~~~~~~~~r~~gVLq~I~~~y~v~all  158 (470)
                      +|+|++|+|+++++++...|.+..+|.--|..+|++.|++++..|.+.++++++|.|.+++|++|||||+|++|++.+++
T Consensus       188 fFLfIvGVsials~K~~s~rf~a~rKa~~R~cklllwgLflqGgf~h~~~nLTygidve~lR~mGILQr~~~ayLVvAi~  267 (549)
T KOG4683|consen  188 FFLFIVGVSIALSVKSQSSRFSATRKAKARICKLLLWGLFLQGGFLHSMSNLTYGIDVEQLRIMGILQRFGVAYLVVAIL  267 (549)
T ss_pred             HHHHHHHhhhhhhhhhhhhhhhHhHHHHHHHHHHHHHHHHHhhhcccCcccccCCccHHHHHHHHHHHHhhHHHHHHHHH
Confidence            99999999999999998889999999999999999999999999999999999999999999999999999999999988


Q ss_pred             HHHcccCCCcccccchhhhhhhHHHHHHHHHHHHHHHHHHHHhccCCCccccccCCCCCCcCcccceeccccCCCC-CCC
Q 012127          159 EIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLN-PPC  237 (470)
Q Consensus       159 ~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~lL~~~~~l~~~~~vp~~~~~~~~~~~~~~g~~~~v~~~~~g~l~-~~~  237 (470)
                      .....++  .+.+++         +|-++..+.+..+.+..++..||+|.....+...+.|||-+.+.||.++... |.|
T Consensus       268 ~~~~~~~--~~~~~S---------~~R~V~~~~L~~~~~~~~~~~V~~~~~~~~~~~~~~~~r~~~~~~G~~~~~~~P~C  336 (549)
T KOG4683|consen  268 HTLCCRP--ISPQRS---------WQRAVHDVCLFSGELAVLLALVATYLGLTFGLRVPGCPRGYLGPGGKHDYNAHPKC  336 (549)
T ss_pred             hhhccCC--Cccccc---------hhhhhhHHHHHHHHHHHHHHhhhhhhceecccccCCCCcccccCCcccccCCCCCc
Confidence            7776552  222221         3344555666666666666777877665555555567777777777777665 469


Q ss_pred             chhHHHHHHhhcccccccCCcccccccccCCCCCCCCCCCCCCCCCCCCCCCcchhhhHHHHHHHHHHHHHHHHHhcccc
Q 012127          238 NAVGYIDRKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKG  317 (470)
Q Consensus       238 n~~~~id~~llg~~h~y~~~~~~~~~~~~~~~~~~g~~r~~~~~~~~~~~dpeGll~tl~~i~~~LlG~~aG~~l~~~~~  317 (470)
                      |+++|.||.++|++|+|++|+|++.++|+.+.|.+|++|.++++||..+|||||+++++.++..+++|+.+|+++.+.++
T Consensus       337 nAvGy~DrqvLGi~HiY~hP~~~r~k~cs~n~P~nG~l~~DAPSWCqapFdPEGilssi~avv~~llG~h~Ghiilh~k~  416 (549)
T KOG4683|consen  337 NAVGYADRQVLGIAHIYQHPTAKRVKDCSINYPNNGPLPPDAPSWCQAPFDPEGILSSILAVVQVLLGAHAGHIILHHKN  416 (549)
T ss_pred             cchhhhHHhhhhhHHHhcCchHHHhhhcccCCCCCCCCCCCCchhhcCCCChHHHHHHHHHHHHHHHHhhcCeEEEEccc
Confidence            99999999999999999999999999999999999999999999999999999999999999999999999998877888


Q ss_pred             hHHHHHHHHHHHHHHHHHHHHHhhhcCccccCCCChHHHHHHHHHHHHHHHHHHHHHHhhcCcccchhHhhHhcHHHHHH
Q 012127          318 HLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVDIWNLKYPFLPLAWIGMNAMLV  397 (470)
Q Consensus       318 ~~~~~~~l~~~G~~ll~lGl~l~~~~~~pi~K~lwT~S~vl~t~G~a~l~La~~y~l~d~~~~~~~~~pf~~~G~naL~~  397 (470)
                      +..|+++|...++.+.++|..++.....|.||++|+.||+++|+|.+.++++.+|+++|++.|+.-..||+..|||++.+
T Consensus       417 ~~sRir~wis~~~~l~llg~tL~~~s~~Plnk~L~slsfvCVT~~~A~Li~S~mY~~iDv~EW~~~~~P~~~~GMNAi~~  496 (549)
T KOG4683|consen  417 FQSRIRRWISLAILLGLLGGTLCGFSAIPLNKNLWSLSFVCVTVSLALLILSLMYYFIDVREWSWSGYPFTECGMNAIVM  496 (549)
T ss_pred             hHHHHHHHHHHHHHHHHHhhhhhcccccchhHhHHHhhhhHHHHHHHHHHHHHHHHHhhHHHhhhccCChhhhccchhHH
Confidence            89999999999999888898887655579999999999999999999999999999999999887889999999999999


Q ss_pred             HHHhhhhHHHHhhhhcccCCCCCChHHHHHHHhhhhhhccCchhHHHHHHHHHHHHHHHHHHHHHhccceEeC
Q 012127          398 YVMAAEGIFAGFINGWYYGDPHNTLPYWIKKHAFLGVWRSRKVSTILYVIFVEILFWGLVTGILHRFGIYWKL  470 (470)
Q Consensus       398 Yl~~~~~ii~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~l~~~~~~~~~~w~~i~~~l~rkki~ikl  470 (470)
                      |+.+  +++..+++ |+|+.++++.      |.+.-.+.+-           -+++|.+++.+++|.++|+|+
T Consensus       497 YV~~--~vL~~~~~-W~~R~~~~~~------H~~l~~~~t~-----------~~L~W~~i~~~~~~~~~Y~~~  549 (549)
T KOG4683|consen  497 YVGH--SVLHKMLP-WHWRIGEMNT------HFMLLLEATW-----------NTLVWVGIALYLDAQEFYYSV  549 (549)
T ss_pred             HHhH--HHHHHhcc-hhhccCCCce------eEEEeeehhh-----------hhhhhhhhheeeeheeeEecC
Confidence            9999  99998887 9999988765      2222233221           234699999999999999986



>COG4299 Uncharacterized protein conserved in bacteria [Function unknown] Back     alignment and domain information
>COG2311 Predicted membrane protein [Function unknown] Back     alignment and domain information
>PF07786 DUF1624: Protein of unknown function (DUF1624); InterPro: IPR012429 These sequences are found in hypothetical proteins of unknown function expressed by bacterial and archaeal species Back     alignment and domain information
>PRK10835 hypothetical protein; Provisional Back     alignment and domain information
>COG3503 Predicted membrane protein [Function unknown] Back     alignment and domain information
>PF10129 OpgC_C: OpgC protein; InterPro: IPR014550 There is currently no experimental data for members of this group or their homologues, nor do they exhibit features indicative of any function Back     alignment and domain information
>PF01757 Acyl_transf_3: Acyltransferase family; InterPro: IPR002656 This entry contains a range of acyltransferase enzymes as well as yet uncharacterised proteins from Caenorhabditis elegans Back     alignment and domain information
>PF04235 DUF418: Protein of unknown function (DUF418); InterPro: IPR007349 Tihs is a probable integral membrane protein Back     alignment and domain information
>COG4645 Uncharacterized protein conserved in bacteria [Function unknown] Back     alignment and domain information
>PRK03854 opgC glucans biosynthesis protein; Provisional Back     alignment and domain information
>PF06423 GWT1: GWT1; InterPro: IPR009447 Glycosylphosphatidylinositol (GPI) is a conserved post-translational modification to anchor cell surface proteins to plasma membrane in eukaryotes Back     alignment and domain information
>COG3274 Predicted O-acyltransferase [General function prediction only] Back     alignment and domain information
>COG5062 Uncharacterized membrane protein [Function unknown] Back     alignment and domain information
>COG3594 NolL Fucose 4-O-acetylase and related acetyltransferases [Carbohydrate transport and metabolism] Back     alignment and domain information
>COG1835 Predicted acyltransferases [Lipid metabolism] Back     alignment and domain information

Homologous Structure Templates

Structure Templates Detected by BLAST ?

No homologous structure with e-value below 0.005

Structure Templates Detected by RPS-BLAST ?

No hit with e-value below 0.005

Structure Templates Detected by HHsearch ?

No hit with probability above 80.00


Homologous Structure Domains

Structure Domains Detected by RPS-BLAST ?

No hit with e-value below 0.005

Homologous Domains Detected by HHsearch ?

No hit with probability above 80.00