Citrus Sinensis ID: 018810


Local Sequence Feature Prediction

Prediction and (Method)Result
Residue Number Marker
Protein Sequence ?
Secondary Structure (PSIPRED) ?
Secondary Structure Prediction (SSPRO) ?
Coil and Loop (DISEMBL) ?
Flexible Loop (DISEMBL) ?
Low Complexity Region (SEG) ?
Disordered region (IsUnstruct) ?
Disordered Region (DISOPRED) ?
Disordered Region (DISEMBL) ?
Disordered Region (DISPRO) ?
Transmembrane Helix (TMHMM) ?
Transmembrane Helix (HMMTOP) ?
Transmembrane Helix (MEMSAT) ?
TM Helix, Signal Peptide (MEMSAT_SVM) ?
TM Helix, Signal Peptide (Phobius) ?
Signal Peptide (SignalP HMM Mode) ?
Signal Peptide (SignalP NN Mode) ?
Coiled Coils (COILS) ?
Positional Conservation ?
 
--------10--------20--------30--------40--------50--------60--------70--------80--------90-------100-------110-------120-------130-------140-------150-------160-------170-------180-------190-------200-------210-------220-------230-------240-------250-------260-------270-------280-------290-------300-------310-------320-------330-------340-------350
MKRKVVYQQQQKFNYKWKRKVFAAILLGFCFGSLVLMQCQYTRIMSLRPRFVQKPKIAFLFIARNRLPLEMVWDKFFKGEESRFSIYVHSRPGFLFSKGTTRSIYFLDRQVNDSIQVDWGGASMIEAERILLRHALADPFNDRFVFLSDSFADTKEGRYNPKMAPVIPVHNWRKGSQWAVLTRKHAEIVVNDTTVFPMFQQHCKRKSLPEFWREHSFPADPSKEHNCIPDEHYVQTLLAQEGLEGELTRRSLTYSSWDLSSSKDHERRGWHPATYKYADATPLLIQSIKEIDNIYYETEHRREWCSDKGKPSSCFLFARKFTRPAALRLLTMSVLGANSNEAIRESKHAK
ccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcccccccccccEEEEEEcccccccHHHHHHHHccccccEEEEEEccccccccccccccccccccEEccccccccccccHHHHHHHHHHHHHccccccEEEEEEcccccccccccccccccccccccccccccEEEEEcccEEEEEccccccHHHHHccccccccccccccccccccccccccccccHHHHHHHHHccccccccccEEEEEccccccccccccccccccccccccccHHHHHHHHcccccccccccccccccccccccccccccccccHHHHHHHHHHcccccccHHHHHHHcccc
ccccEEEcHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHccccccccccEEEEEEEccccccHHHHHHHHHccccccEEEEEEccccccccccccccccEcccccccccEEEcccccHHHHHHHHHHHHHccccccEEEEEEccccccccccccccccccccHHHHccccEEEEEEHHHHHHHHHcHcHHHHHHHHcccccccccccccccccHHccccccccccccccEEEEEccccccccccEEEEEEcccccccccccccccccccccccccHHHHHHHHccccEEEEccccccEcccccccccHEHHHHHccHHHHHHHHHHHccccccccHHHHccccc
MKRKVVYQQQQKFNYKWKRKVFAAILLGFCFGSLVLMQCQYTrimslrprfvqkpkIAFLFIARNRLPLEMVWDkffkgeesrfsiyvhsrpgflfskgttrSIYFLDRQvndsiqvdwggaSMIEAERILLRHAladpfndrfvflsdsfadtkegrynpkmapvipvhnwrkgsqwAVLTRKHAEIVVndttvfpmfqqhckrkslpefwrehsfpadpskehncipdehYVQTLLAQeglegeltrrsltysswdlssskdherrgwhpatykyadatpLLIQSIKEIDNIYyetehrrewcsdkgkpsscflfaRKFTRPAALRLLTMSVLGANSNEAIRESKHAK
mkrkvvyqqqqkfnykwkRKVFAAILLGFCFGSLVLMQCQYTRIMSLRPRFVQKPKIAFLFIARNRLPLEMVWDKFFKGEESRFSIYVHSRPGFLFSKGTTRSIYFLDRQVNDSIQVDWGGASMIEAERILLRHALADPFNDRFVFLSDSfadtkegrynpkmapvipvhnwRKGSQWAVLTRKHAEIVVNDTTVFPMFQQHCKRKSLPEFWREHSFPADPSKEHNCIPDEHYVQTLLAQEGLEGELTRRSLTysswdlssskdherrgwhpatykyadatplLIQSIKEIDNIYYETEHrrewcsdkgkpSSCFLFARKFTRPAALRLLTmsvlgansneaireskhak
MKRKVVYQQQQKFNYKWKRKVFAAILLGFCFGSLVLMQCQYTRIMSLRPRFVQKPKIAFLFIARNRLPLEMVWDKFFKGEESRFSIYVHSRPGFLFSKGTTRSIYFLDRQVNDSIQVDWGGASMIEAERILLRHALADPFNDRFVFLSDSFADTKEGRYNPKMAPVIPVHNWRKGSQWAVLTRKHAEIVVNDTTVFPMFQQHCKRKSLPEFWREHSFPADPSKEHNCIPDEHYVQTLLAQEGLEGELTRRsltysswdlsssKDHERRGWHPATYKYADATPLLIQSIKEIDNIYYETEHRREWCSDKGKPSSCFLFARKFTRPAALRLLTMSVLGANSNEAIRESKHAK
*****VYQQQQKFNYKWKRKVFAAILLGFCFGSLVLMQCQYTRIMSLRPRFVQKPKIAFLFIARNRLPLEMVWDKFFKGEESRFSIYVHSRPGFLFSKGTTRSIYFLDRQVNDSIQVDWGGASMIEAERILLRHALADPFNDRFVFLSDSFADTKEGRYNPKMAPVIPVHNWRKGSQWAVLTRKHAEIVVNDTTVFPMFQQHCKRKSLPEFWREH**********NCIPDEHYVQTLLAQEGLEGELTRRSLTYSSW**********RGWHPATYKYADATPLLIQSIKEIDNIYYETEHRREWCSDKGKPSSCFLFARKFTRPAALRLLTMSVLG**************
*************NYKWKRKVFAAILLGFCFGSLVLMQCQY***************IAFLFIARNRLPLEMVWDKFFKGEESRFSIYVHSRPGFLFSKGTTRSIYFLDRQVND****DWGGASMIEAERILLRHALADPFNDRFVFLSDSFADTKEGRYNPKMAPVIPVHNWRKGSQWAVLTRKHAEIVVNDTTVFPMFQQHCKRKSLPEFWREHSFPADPSKEHNCIPDEHYVQTLLAQEGLEGELTRRSLTYSSWDLSSSK****RGWHPATYKYADATPLLIQSIKEIDNIYYETEHRREWCSDKGKPSSCFLFARKFTRPAALRLLTMSVLGANSN**********
MKRKVVYQQQQKFNYKWKRKVFAAILLGFCFGSLVLMQCQYTRIMSLRPRFVQKPKIAFLFIARNRLPLEMVWDKFFKGEESRFSIYVHSRPGFLFSKGTTRSIYFLDRQVNDSIQVDWGGASMIEAERILLRHALADPFNDRFVFLSDSFADTKEGRYNPKMAPVIPVHNWRKGSQWAVLTRKHAEIVVNDTTVFPMFQQHCKRKSLPEFWREHSFPADPSKEHNCIPDEHYVQTLLAQEGLEGELTRRSLTYSSWD*********RGWHPATYKYADATPLLIQSIKEIDNIYYETEHRREWCSDKGKPSSCFLFARKFTRPAALRLLTMSVLGANSNE*********
**RKVVYQQQQKFNYKWKRKVFAAILLGFCFGSLVLMQCQYTRIMSLRPRFVQKPKIAFLFIARNRLPLEMVWDKFFKGEESRFSIYVHSRPGFLFSKGTTRSIYFLDRQVNDSIQVDWGGASMIEAERILLRHALADPFNDRFVFLSDSFADTKEGRYNPKMAPVIPVHNWRKGSQWAVLTRKHAEIVVNDTTVFPMFQQHCKRKSLPEFWREHSFPADPSKEHNCIPDEHYVQTLLAQEGLEGELTRRSLTYSSWDLS********GWHPATYKYADATPLLIQSIKEIDNIYYETEHRREWCSDKGKPSSCFLFARKFTRPAALRLLTMSVLGANS***********
iiiiiiiiiiiiiiiiiiiiHHHHHHHHHHHHHHHHHHoooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
iiiiiiiiiiiiiiiiiiiiHHHHHHHHHHHHHHHHHooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
iiiiiiiiiiiiiiiiHHHHHHHHHHHHHHHHHHHHoooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
SSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSiiiiiiiiiiiiiiiiiHHHHHHHHHHHHHHHHoooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
SSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
MKRKVVYQQQQKFNYKWKRKVFAAILLGFCFGSLVLMQCQYTRIMSLRPRFVQKPKIAFLFIARNRLPLEMVWDKFFKGEESRFSIYVHSRPGFLFSKGTTRSIYFLDRQVNDSIQVDWGGASMIEAERILLRHALADPFNDRFVFLSDSFADTKEGRYNPKMAPVIPVHNWRKGSQWAVLTRKHAEIVVNDTTVFPMFQQHCKRKSLPEFWREHSFPADPSKEHNCIPDEHYVQTLLAQEGLEGELTRRSLTYSSWDLSSSKDHERRGWHPATYKYADATPLLIQSIKEIDNIYYETEHRREWCSDKGKPSSCFLFARKFTRPAALRLLTMSVLGANSNEAIRESKHAK
no confident homologs detected

Close Homologs for Annotation Transfer

Close Homologs in SWISS-PROT Database Detected by BLAST ?

No hits with e-value below 0.001 by BLAST

Close Homologs in the Non-Redundant Database Detected by BLAST ?

GI ?Alignment Graph ?Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query350
225452380367 PREDICTED: uncharacterized protein LOC10 0.957 0.912 0.730 1e-154
449450078366 PREDICTED: uncharacterized protein LOC10 0.954 0.912 0.713 1e-148
356511696364 PREDICTED: uncharacterized protein LOC10 0.942 0.906 0.685 1e-146
297811585369 hypothetical protein ARALYDRAFT_909422 [ 0.948 0.899 0.684 1e-144
224060092358 predicted protein [Populus trichocarpa] 0.825 0.807 0.786 1e-144
356573337365 PREDICTED: uncharacterized protein LOC10 0.945 0.906 0.677 1e-143
42567851377 Core-2/I-branching beta-1,6-N-acetylgluc 0.948 0.880 0.663 1e-142
51968788377 putative protein [Arabidopsis thaliana] 0.948 0.880 0.660 1e-141
296087625324 unnamed protein product [Vitis vinifera] 0.834 0.901 0.753 1e-139
357520047363 BC10 protein [Medicago truncatula] gi|35 0.9 0.867 0.693 1e-137
>gi|225452380|ref|XP_002275670.1| PREDICTED: uncharacterized protein LOC100259507 [Vitis vinifera] Back     alignment and taxonomy information
 Score =  550 bits (1418), Expect = e-154,   Method: Compositional matrix adjust.
 Identities = 265/363 (73%), Positives = 299/363 (82%), Gaps = 28/363 (7%)

Query: 8   QQQQKFNYKWKRKVFAAILLGFCFGSLVLMQCQYTRI---MSLRPRFVQKPKIAFLFIAR 64
           +++QK  YKWKR +FA +LLGFCFGSLVL+Q QY+RI    S+   F+Q+PKIAFLFIAR
Sbjct: 2   KRKQKSQYKWKRNLFAMLLLGFCFGSLVLLQTQYSRIRMFASMPSPFLQRPKIAFLFIAR 61

Query: 65  NRLPLEMVWDKFFKGE-ESRFSIYVHSRPGFLFSKGTTRSIYFLDRQVNDSIQVDWGGAS 123
           NRLPL++VWD FF+ E E++FSI+VHSRPGFL +K TTRS+YFL+RQ+NDSIQVDWG AS
Sbjct: 62  NRLPLDVVWDAFFRDEKENKFSIFVHSRPGFLLNKATTRSVYFLNRQLNDSIQVDWGEAS 121

Query: 124 MIEAERILLRHALADPFNDRFVFLSDS------------------------FADTKEGRY 159
           MI+AERILLR AL DP N+RFVFLSDS                        FADTKEGRY
Sbjct: 122 MIQAERILLRSALLDPLNERFVFLSDSCIPLYNFSYIYDYIMSTSTSFVDSFADTKEGRY 181

Query: 160 NPKMAPVIPVHNWRKGSQWAVLTRKHAEIVVNDTTVFPMFQQHCKRKSLPEFWREHSFPA 219
           NPKM PVIPVHNWRKGSQW VLTRKHA+IVV D TVFPMFQQHCKRKSLPEFWR+H  PA
Sbjct: 182 NPKMDPVIPVHNWRKGSQWVVLTRKHAQIVVEDDTVFPMFQQHCKRKSLPEFWRDHPLPA 241

Query: 220 DPSKEHNCIPDEHYVQTLLAQEGLEGELTRRSLTYSSWDLSSSKDHERRGWHPATYKYAD 279
           D SKEHNCIPDEHYVQTLLAQEG E E+TRRSLT+SSWDLSSSKD ER+GWHP TYK++D
Sbjct: 242 DASKEHNCIPDEHYVQTLLAQEGFEEEITRRSLTHSSWDLSSSKDRERKGWHPLTYKFSD 301

Query: 280 ATPLLIQSIKEIDNIYYETEHRREWCSDKGKPSSCFLFARKFTRPAALRLLTMSVLGANS 339
           ATP+LIQSIK+IDNIYYETE+RREWC+ KGKP+ CFLFARKFTRPAALRLL MS LGA  
Sbjct: 302 ATPMLIQSIKDIDNIYYETEYRREWCTSKGKPAPCFLFARKFTRPAALRLLNMSALGAFH 361

Query: 340 NEA 342
            E 
Sbjct: 362 GET 364




Source: Vitis vinifera

Species: Vitis vinifera

Genus: Vitis

Family: Vitaceae

Order: Vitales

Class:

Phylum: Streptophyta

Superkingdom: Eukaryota

>gi|449450078|ref|XP_004142791.1| PREDICTED: uncharacterized protein LOC101222566 [Cucumis sativus] gi|449483780|ref|XP_004156689.1| PREDICTED: uncharacterized LOC101222566 [Cucumis sativus] Back     alignment and taxonomy information
>gi|356511696|ref|XP_003524559.1| PREDICTED: uncharacterized protein LOC100791308 [Glycine max] Back     alignment and taxonomy information
>gi|297811585|ref|XP_002873676.1| hypothetical protein ARALYDRAFT_909422 [Arabidopsis lyrata subsp. lyrata] gi|297319513|gb|EFH49935.1| hypothetical protein ARALYDRAFT_909422 [Arabidopsis lyrata subsp. lyrata] Back     alignment and taxonomy information
>gi|224060092|ref|XP_002300039.1| predicted protein [Populus trichocarpa] gi|222847297|gb|EEE84844.1| predicted protein [Populus trichocarpa] Back     alignment and taxonomy information
>gi|356573337|ref|XP_003554818.1| PREDICTED: uncharacterized protein LOC100776366 [Glycine max] Back     alignment and taxonomy information
>gi|42567851|ref|NP_196959.2| Core-2/I-branching beta-1,6-N-acetylglucosaminyltransferase family protein [Arabidopsis thaliana] gi|209863160|gb|ACI88738.1| At5g14550 [Arabidopsis thaliana] gi|332004663|gb|AED92046.1| Core-2/I-branching beta-1,6-N-acetylglucosaminyltransferase family protein [Arabidopsis thaliana] Back     alignment and taxonomy information
>gi|51968788|dbj|BAD43086.1| putative protein [Arabidopsis thaliana] Back     alignment and taxonomy information
>gi|296087625|emb|CBI34881.3| unnamed protein product [Vitis vinifera] Back     alignment and taxonomy information
>gi|357520047|ref|XP_003630312.1| BC10 protein [Medicago truncatula] gi|355524334|gb|AET04788.1| BC10 protein [Medicago truncatula] Back     alignment and taxonomy information

Prediction of Gene Ontology (GO) Terms

Close Homologs with Gene Ontology terms Detected by BLAST ?

ID ? Alignment graph ? Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query350
TAIR|locus:2222637377 AT5G14550 [Arabidopsis thalian 0.534 0.496 0.737 3.4e-128
TAIR|locus:2008890383 AT1G11940 [Arabidopsis thalian 0.548 0.501 0.476 9.5e-80
TAIR|locus:504956294378 AT1G62305 [Arabidopsis thalian 0.522 0.484 0.513 3.6e-76
TAIR|locus:2012748412 AT1G10280 [Arabidopsis thalian 0.265 0.225 0.458 2.7e-34
TAIR|locus:2181442411 AT5G16170 [Arabidopsis thalian 0.262 0.223 0.484 1.2e-32
TAIR|locus:2202324408 AT1G68390 [Arabidopsis thalian 0.277 0.237 0.44 1.9e-32
TAIR|locus:2027769418 AT1G73810 [Arabidopsis thalian 0.268 0.224 0.473 2.5e-32
TAIR|locus:2165605388 AT5G57270 [Arabidopsis thalian 0.291 0.262 0.485 1.1e-28
TAIR|locus:2094781383 AT3G21310 [Arabidopsis thalian 0.291 0.266 0.418 1.4e-28
TAIR|locus:2181945386 AT5G11730 [Arabidopsis thalian 0.277 0.251 0.39 6.2e-28
TAIR|locus:2222637 AT5G14550 [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
 Score = 768 (275.4 bits), Expect = 3.4e-128, Sum P(2) = 3.4e-128
 Identities = 138/187 (73%), Positives = 155/187 (82%)

Query:   149 DSFADTKEGRYNPKMAPVIPVHNWRKGSQWAVLTRKHAEIVVNDTTVFPMFQQHCKRKSL 208
             DSFADTK+ RYNP+M P+IPV NWRKGSQW VL RKHAEIVVNDT+VFPMFQQHC+RKSL
Sbjct:   182 DSFADTKDSRYNPRMNPIIPVRNWRKGSQWVVLNRKHAEIVVNDTSVFPMFQQHCRRKSL 241

Query:   209 PEFWREHSFPADPSKEHNCIPDEHYVQTLLAQEGLEGELTRRXXXXXXXXXXXXKDHERR 268
             PEFWR+   PA+  KEHNCIPDEHYVQTLL+Q+G++ ELTRR            K +ERR
Sbjct:   242 PEFWRDRPVPAEGWKEHNCIPDEHYVQTLLSQKGVDSELTRRSLTHSAWDLSSSKSNERR 301

Query:   269 GWHPATYKYADATPLLIQSIKEIDNIYYETEHRREWCSDKGKPSSCFLFARKFTRPAALR 328
             GWHP TYK++DATP LIQSIK IDNI YETE+RREWCS KGKPS CFLFARKFTRPAALR
Sbjct:   302 GWHPMTYKFSDATPDLIQSIKGIDNINYETEYRREWCSSKGKPSPCFLFARKFTRPAALR 361

Query:   329 LLTMSVL 335
             LL  ++L
Sbjct:   362 LLRETIL 368


GO:0005739 "mitochondrion" evidence=ISM
GO:0016020 "membrane" evidence=IEA
GO:0007623 "circadian rhythm" evidence=RCA
GO:0016757 "transferase activity, transferring glycosyl groups" evidence=ISM
TAIR|locus:2008890 AT1G11940 [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:504956294 AT1G62305 [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2012748 AT1G10280 [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2181442 AT5G16170 [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2202324 AT1G68390 [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2027769 AT1G73810 [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2165605 AT5G57270 [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2094781 AT3G21310 [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2181945 AT5G11730 [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms

Prediction of Enzyme Commission (EC) Number

EC Number Prediction by Annotation Transfer from SWISS-PROT Entries ?

No confident hit for EC number transfering in SWISSPROT detected by BLAST

EC Number Prediction by Ezypred Server ?

Fail to connect to Ezypred Server

EC Number Prediction by EFICAz Software ?

No EC number assignment, probably not an enzyme!


Prediction of Functionally Associated Proteins

Functionally Associated Proteins Detected by STRING ?

Fail to connect to STRING server


Conserved Domains and Related Protein Families

Conserved Domains Detected by RPS-BLAST ?

ID ?Alignment Graph ?Length ? Definition ? E-value ?
Query350
pfam02485242 pfam02485, Branch, Core-2/I-Branching enzyme 6e-59
>gnl|CDD|217062 pfam02485, Branch, Core-2/I-Branching enzyme Back     alignment and domain information
 Score =  190 bits (484), Expect = 6e-59
 Identities = 89/271 (32%), Positives = 121/271 (44%), Gaps = 67/271 (24%)

Query: 57  IAFLFIA-RNRLPLEMVWDKFFKGEESRFSIYVHSR-PGFLFSKGTTR-SIYFLDRQVND 113
           IAF+F+  +  LP   +W +FF   ++ + IYV ++ P     +     S +F  R +  
Sbjct: 1   IAFMFLVYKGDLPFLPLWLRFFYHPQNLYCIYVDAKSPSEFKERLRALASCFFNVRVIPK 60

Query: 114 SIQVDWGGASMIEAERILLRHALA-DPFNDRFVFLS------------------------ 148
           S  V WGG SM++AER LL + L  DP  D FV LS                        
Sbjct: 61  SEPVTWGGHSMVDAERRLLANLLLLDPSWDYFVLLSESDIPLKTFDELYRYLSSLNGNNS 120

Query: 149 --DSFADTK---EGRYNPKMAPVIPVHN----WRKGSQWAVLTRKHAEIVVNDTTVFPMF 199
             +SF D      GRYNP+M P I + +    WRKGSQW VL+R  AE VV+D   +P+F
Sbjct: 121 FVESFDDPGWKGRGRYNPRMLPEIKLSDLPLKWRKGSQWFVLSRAFAEYVVSDNLYYPLF 180

Query: 200 QQHCKRKSLPEFWREHSFPADPSKEHNCIPDEHYVQTLLAQEG-LEGELTRRSLTYSSWD 258
           + +C                     + C PDEHY  TLL   G        R+LTY  W 
Sbjct: 181 KYYC---------------------NTCYPDEHYFPTLLGMPGHFPNTCVNRTLTYVDWS 219

Query: 259 LSSSKDHERRGWHPATYKYADATPLLIQSIK 289
                   R G HP TY+  D  P  ++ I+
Sbjct: 220 --------RGGCHPKTYRPCDIGPEDLKRIR 242


This is a family of two different beta-1,6-N-acetylglucosaminyltransferase enzymes, I-branching enzyme and core-2 branching enzyme . I-branching enzyme is responsible for the production of the blood group I-antigen during embryonic development. Core-2 branching enzyme forms crucial side-chain branches in O-glycans. Length = 242

Conserved Domains Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query 350
PLN03183421 acetylglucosaminyltransferase family protein; Prov 100.0
PF02485244 Branch: Core-2/I-Branching enzyme; InterPro: IPR00 100.0
KOG0799439 consensus Branching enzyme [Carbohydrate transport 99.76
>PLN03183 acetylglucosaminyltransferase family protein; Provisional Back     alignment and domain information
Probab=100.00  E-value=1.6e-41  Score=340.38  Aligned_cols=213  Identities=22%  Similarity=0.262  Sum_probs=164.3

Q ss_pred             CCCCcEEEEEEeC-CC-CchHHHHHHHhccCCCceeEEEEeCCCCcccC-------------CCCCcceeeccccCCcee
Q 018810           52 VQKPKIAFLFIAR-NR-LPLEMVWDKFFKGEESRFSIYVHSRPGFLFSK-------------GTTRSIYFLDRQVNDSIQ  116 (350)
Q Consensus        52 ~~~~KiAfLiLAh-~~-~~l~~Lw~~ff~l~~~~~sIYIHvD~k~~~~~-------------~~~~s~vf~~r~i~~Rv~  116 (350)
                      ..+|||||||++| ++ .++++|+.+   ++++++.+|||+|++++..+             ....++|++   +.++..
T Consensus        75 ~~~~r~AYLI~~h~~d~~~l~RLL~a---LYhprN~y~IHlDkKS~~~er~~l~~~v~~~~~~~~~~NV~v---l~k~~~  148 (421)
T PLN03183         75 DKLPRFAYLVSGSKGDLEKLWRTLRA---LYHPRNQYVVHLDLESPAEERLELASRVENDPMFSKVGNVYM---ITKANL  148 (421)
T ss_pred             CCCCeEEEEEEecCCcHHHHHHHHHH---hcCCCceEEEEecCCCChHHHHHHHHHhhccchhhccCcEEE---Eeccee
Confidence            3489999999999 56 678999999   98999999999999975321             012345655   356788


Q ss_pred             eecCcccHHHHHHHHHHHHhc-CCCCCEEEEeccc----------------------ccCCC-C------CCCCCC----
Q 018810          117 VDWGGASMIEAERILLRHALA-DPFNDRFVFLSDS----------------------FADTK-E------GRYNPK----  162 (350)
Q Consensus       117 V~WGg~SlVeAel~LLr~AL~-d~~~~~fvLLSGs----------------------F~e~~-~------~RY~~~----  162 (350)
                      |.|||+|||+||+.+++.+|+ ..+++|||+|||+                      |++.. .      .|+.+.    
T Consensus       149 V~WGG~S~V~AtL~~m~~LL~~~~~WDyfinLSGsDyPLkTqdelI~~F~~~nr~~NFI~~~s~~~wk~~~r~~~~i~~p  228 (421)
T PLN03183        149 VTYRGPTMVANTLHACAILLKRSKDWDWFINLSASDYPLVTQDDLIHTFSTLDRNLNFIEHTSQLGWKEEKRAMPLIIDP  228 (421)
T ss_pred             eccCChHHHHHHHHHHHHHHhhCCCCCEEEEccCCcccccCHHHHHHHHHhCCCCceeeecccccccchhhhcceEEecC
Confidence            999999999999999999997 6789999999993                      22210 0      111110    


Q ss_pred             ---C-----------CCCCCc-cccccccceeeecHHHHHHhhcccccchH-HHHhhhccCCccccccCCCCCCCCCCCC
Q 018810          163 ---M-----------APVIPV-HNWRKGSQWAVLTRKHAEIVVNDTTVFPM-FQQHCKRKSLPEFWREHSFPADPSKEHN  226 (350)
Q Consensus       163 ---m-----------~p~i~~-~~~~kGSQW~sLtR~~Ae~Vv~d~~~~~~-F~~~c~~~~~~~~~~~~~~~~~~~~~~~  226 (350)
                         +           .+.+|. .++++|||||+|||++|+||+...+..+. ...|++                    .+
T Consensus       229 gl~~~~ks~~~~~~~~R~~P~~~~lf~GS~W~sLSR~fvey~l~~~dnlpr~ll~y~~--------------------~t  288 (421)
T PLN03183        229 GLYSTNKSDIYWVTPRRSLPTAFKLFTGSAWMVLSRSFVEYCIWGWDNLPRTLLMYYT--------------------NF  288 (421)
T ss_pred             ceeecccchhhhhhhhccCCccccccCCCceEEecHHHHHHHHhcccchHHHHHHHHh--------------------cC
Confidence               0           012232 47999999999999999999987655443 334444                    47


Q ss_pred             ccCchhHHHHHHhccC-CCCccccCceeEEeccCCCCCCCCCCCCCccccccCCCCHHHHHHHhhccccccccccccccc
Q 018810          227 CIPDEHYVQTLLAQEG-LEGELTRRSLTYSSWDLSSSKDHERRGWHPATYKYADATPLLIQSIKEIDNIYYETEHRREWC  305 (350)
Q Consensus       227 ~~pDE~ffqTLL~ns~-~~~~i~~~~LrYidW~~~~~~~~~~~g~hP~~~~~~D~t~~li~~i~~~d~~~~~~~~~~~~c  305 (350)
                      ++|||+||||+|+|++ |.+.+.|++||||+|+.       .+..||++|+.+|+     ++|.                
T Consensus       289 ~~pdE~fFqTVl~NS~~f~~t~vn~nLRyI~W~~-------~~~~~P~~l~~~D~-----~~l~----------------  340 (421)
T PLN03183        289 VSSPEGYFHTVICNVPEFAKTAVNHDLHYISWDN-------PPKQHPHTLSLNDT-----EKMI----------------  340 (421)
T ss_pred             CCCchHHHHHHHhhcccccccccCCceeEEecCC-------CCCCCCcccCHHHH-----HHHH----------------
Confidence            7999999999999997 99999999999999983       22349999999999     6666                


Q ss_pred             cCCCCCCCCceEEecCChh
Q 018810          306 SDKGKPSSCFLFARKFTRP  324 (350)
Q Consensus       306 ~~~g~~~s~~LFARKF~~~  324 (350)
                            +|+.+|||||+.+
T Consensus       341 ------~S~~lFARKFd~d  353 (421)
T PLN03183        341 ------ASGAAFARKFRRD  353 (421)
T ss_pred             ------hCCCccccCCCCC
Confidence                  4888999999976



>PF02485 Branch: Core-2/I-Branching enzyme; InterPro: IPR003406 The biosynthesis of disaccharides, oligosaccharides and polysaccharides involves the action of hundreds of different glycosyltransferases Back     alignment and domain information
>KOG0799 consensus Branching enzyme [Carbohydrate transport and metabolism] Back     alignment and domain information

Homologous Structure Templates

Structure Templates Detected by BLAST ?

No homologous structure with e-value below 0.005

Structure Templates Detected by RPS-BLAST ?

ID ?Alignment Graph ?Length ? Definition ? E-value ?
Query350
2gak_A391 Beta-1,6-N-acetylglucosaminyltransferase; glycopro 4e-17
>2gak_A Beta-1,6-N-acetylglucosaminyltransferase; glycoprotein, CIS-peptide, dimer; HET: NAG; 2.00A {Mus musculus} PDB: 2gam_A* 3otk_A* Length = 391 Back     alignment and structure
 Score = 81.0 bits (199), Expect = 4e-17
 Identities = 36/315 (11%), Positives = 77/315 (24%), Gaps = 77/315 (24%)

Query: 53  QKPKIAFLFIARNRLPLEMVWDKFFKGEESRFSIYVH----SRPGFLFSKGTTRSIYFLD 108
               IA+  +  +++  EM+          +    +H    +   FL +     S +   
Sbjct: 82  VGFPIAYSIVVHHKI--EMLDRLLRAIYMPQNFYCIHVDRKAEESFLAAVQGIASCFDNV 139

Query: 109 RQVNDSIQVDWGGASMIEAERILLRHALADPFN-DRFVFLSDS----------------- 150
              +    V +   + ++A+   ++       N    + L                    
Sbjct: 140 FVASQLESVVYASWTRVKADLNCMKDLYRMNANWKYLINLCGMDFPIKTNLEIVRKLKCS 199

Query: 151 ----------------------FADTKEGRYNPKMAPVIPVHN--WRKGSQWAVLTRKHA 186
                                 +A       N  +    P        GS + V+TR++ 
Sbjct: 200 TGENNLETEKMPPNKEERWKKRYAVVDGKLTNTGIVKAPPPLKTPLFSGSAYFVVTREYV 259

Query: 187 EIVVNDTTVFPMFQQHCKRKSLPEFWREHSFPADPSKEHNCIPDEHYVQTLLAQEGLEGE 246
             V+ +  +  + +                       +    PDE    T+     + G 
Sbjct: 260 GYVLENENIQKLMEWA---------------------QDTYSPDEFLWATIQRIPEVPGS 298

Query: 247 LTRRSLTYSSWDLSSSKDHERRGWHPATYKYADATPLLIQSIKEIDNIYYETEHRREWCS 306
               +    S     +       W       ++  P    S   + ++         W  
Sbjct: 299 FPSSNKYDLSD---MNAIARFVKWQYFEGDVSNGAPYPPCSGVHVRSVCVFGAGDLSWML 355

Query: 307 DKGKPSSCFLFARKF 321
                    LFA KF
Sbjct: 356 -----RQHHLFANKF 365


Structure Templates Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query350
2gak_A391 Beta-1,6-N-acetylglucosaminyltransferase; glycopro 100.0
>2gak_A Beta-1,6-N-acetylglucosaminyltransferase; glycoprotein, CIS-peptide, dimer; HET: NAG; 2.00A {Mus musculus} PDB: 2gam_A* 3otk_A* Back     alignment and structure
Probab=100.00  E-value=2.7e-42  Score=343.14  Aligned_cols=229  Identities=14%  Similarity=0.113  Sum_probs=169.7

Q ss_pred             CCCcEEEEEEeCCC-CchHHHHHHHhccCCCceeEEEEeCCCCccc--C-----CCCCcceeeccccCCceeeecCcccH
Q 018810           53 QKPKIAFLFIARNR-LPLEMVWDKFFKGEESRFSIYVHSRPGFLFS--K-----GTTRSIYFLDRQVNDSIQVDWGGASM  124 (350)
Q Consensus        53 ~~~KiAfLiLAh~~-~~l~~Lw~~ff~l~~~~~sIYIHvD~k~~~~--~-----~~~~s~vf~~r~i~~Rv~V~WGg~Sl  124 (350)
                      .++||||||++|++ .++++|++.   +.++.+.+|||+|+++...  .     ......+++   +.+|+.|.|||+||
T Consensus        82 ~~~kiAflil~h~d~~~l~rll~~---ly~p~n~y~IHvD~ks~~~~~~~~~~~~~~f~NV~v---~~~~~~v~WGg~S~  155 (391)
T 2gak_A           82 VGFPIAYSIVVHHKIEMLDRLLRA---IYMPQNFYCIHVDRKAEESFLAAVQGIASCFDNVFV---ASQLESVVYASWTR  155 (391)
T ss_dssp             HTSCEEEEEEECSCHHHHHHHHHH---HCCTTSEEEEEECTTSCHHHHHHHHHHHHTCTTEEE---CSSCCCCCTTSHHH
T ss_pred             cCCCEEEEEEecCCHHHHHHHHHH---HhCCCCeEEEEEeCCCCHHHHHHHHHHHhcCCCEEE---eccCcccccCCchH
Confidence            46899999999998 456887777   7778999999999997421  0     111234444   24799999999999


Q ss_pred             HHHHHHHHHHHhc-CCCCCEEEEecccc---------------------cCCC------CCC--CCCC----------CC
Q 018810          125 IEAERILLRHALA-DPFNDRFVFLSDSF---------------------ADTK------EGR--YNPK----------MA  164 (350)
Q Consensus       125 VeAel~LLr~AL~-d~~~~~fvLLSGsF---------------------~e~~------~~R--Y~~~----------m~  164 (350)
                      |+||++||+.||+ +++|+|||||||++                     ++..      ..|  |+..          +.
T Consensus       156 v~A~l~ll~~aL~~~~~w~yfilLSgsD~PLkt~~~i~~~l~~~~~~nFIe~~~~~~~~~~R~~~~~~~~~~~l~~~~~~  235 (391)
T 2gak_A          156 VKADLNCMKDLYRMNANWKYLINLCGMDFPIKTNLEIVRKLKCSTGENNLETEKMPPNKEERWKKRYAVVDGKLTNTGIV  235 (391)
T ss_dssp             HHHHHHHHHHHHHHCSCCCEEEEEETTCEESSCHHHHHHHHHHTTTCCBCCBEECCGGGSHHHHEEEEEETTEEEEEEEE
T ss_pred             HHHHHHHHHHHHhcCCCCCEEEEecCCCccccCHHHHHHHHHhcCCCceeeccCCCcccccceEeeeeccccceeeeccc
Confidence            9999999999998 55899999999943                     3210      122  1110          00


Q ss_pred             -CCCC-ccccccccceeeecHHHHHHhhcccccchHHHHhhhccCCccccccCCCCCCCCCCCCccCchhHHHHHHhccC
Q 018810          165 -PVIP-VHNWRKGSQWAVLTRKHAEIVVNDTTVFPMFQQHCKRKSLPEFWREHSFPADPSKEHNCIPDEHYVQTLLAQEG  242 (350)
Q Consensus       165 -p~i~-~~~~~kGSQW~sLtR~~Ae~Vv~d~~~~~~F~~~c~~~~~~~~~~~~~~~~~~~~~~~~~pDE~ffqTLL~ns~  242 (350)
                       ..+| ..+|++|||||+|||++|++|++|.. .+.|.++|+                    ++|+|||+|||||+.+++
T Consensus       236 k~~~P~~~~~~~GSqW~~LtR~~v~~vl~d~~-~~~~~~~~k--------------------~t~~pDE~ffqTll~~~~  294 (391)
T 2gak_A          236 KAPPPLKTPLFSGSAYFVVTREYVGYVLENEN-IQKLMEWAQ--------------------DTYSPDEFLWATIQRIPE  294 (391)
T ss_dssp             CCCCSCSSCCEECCSCCEEEHHHHHHHHHCHH-HHHHHHHHT--------------------TSSSGGGTHHHHHTTSTT
T ss_pred             ccCCccccceecccceeEecHHHHHHHHhCCc-HHHHHHHHc--------------------CCcCCchhHHHHHhhccC
Confidence             1123 25899999999999999999999854 678888998                    589999999999999876


Q ss_pred             CCCc----------cccCceeEEeccCCCCCCCCCCCCCcc----------ccccCCCCHHHHHHHhhcccccccccccc
Q 018810          243 LEGE----------LTRRSLTYSSWDLSSSKDHERRGWHPA----------TYKYADATPLLIQSIKEIDNIYYETEHRR  302 (350)
Q Consensus       243 ~~~~----------i~~~~LrYidW~~~~~~~~~~~g~hP~----------~~~~~D~t~~li~~i~~~d~~~~~~~~~~  302 (350)
                      +.+.          ..++++|||+|+..... ..+|++||.          +|+.+|+     +.|+             
T Consensus       295 ~pg~~~~~~~~~~~~~~~~lr~i~W~~~~g~-~~~G~p~p~c~~~~~~~~C~~~~~Dl-----~~l~-------------  355 (391)
T 2gak_A          295 VPGSFPSSNKYDLSDMNAIARFVKWQYFEGD-VSNGAPYPPCSGVHVRSVCVFGAGDL-----SWML-------------  355 (391)
T ss_dssp             STTCCCSSGGGCBCTTTSSSEEECBTTTCSC-GGGTCSSCCCSSEEETTEEECCGGGH-----HHHT-------------
T ss_pred             CCCccccccccccccccCceEEEEecccCCc-cccCCCCCCCccceecceeeecHHHH-----HHHH-------------
Confidence            4332          24789999999731100 002334666          8998998     6666             


Q ss_pred             ccccCCCCCCCCceEEecCChhhHHHHHhhhhcc
Q 018810          303 EWCSDKGKPSSCFLFARKFTRPAALRLLTMSVLG  336 (350)
Q Consensus       303 ~~c~~~g~~~s~~LFARKF~~~~~~~Ll~~~~~~  336 (350)
                               .+++||||||+++++..+|+..+..
T Consensus       356 ---------~s~~lFARKF~~~~d~~~l~~l~~~  380 (391)
T 2gak_A          356 ---------RQHHLFANKFDMDVDPFAIQCLDEH  380 (391)
T ss_dssp             ---------TSCCSEEECCCTTTCHHHHHHHHHH
T ss_pred             ---------hCCCceEEccCCccCHHHHHHHHHH
Confidence                     3789999999999888898876643




Homologous Structure Domains

Structure Domains Detected by RPS-BLAST ?

No hit with e-value below 0.005

Homologous Domains Detected by HHsearch ?

No hit with probability above 80.00