Citrus Sinensis ID: 020207


Local Sequence Feature Prediction

Prediction and (Method)Result
Residue Number Marker
Protein Sequence ?
Secondary Structure (PSIPRED) ?
Secondary Structure Prediction (SSPRO) ?
Coil and Loop (DISEMBL) ?
Flexible Loop (DISEMBL) ?
Low Complexity Region (SEG) ?
Disordered region (IsUnstruct) ?
Disordered Region (DISOPRED) ?
Disordered Region (DISEMBL) ?
Disordered Region (DISPRO) ?
Transmembrane Helix (TMHMM) ?
Transmembrane Helix (HMMTOP) ?
Transmembrane Helix (MEMSAT) ?
TM Helix, Signal Peptide (MEMSAT_SVM) ?
TM Helix, Signal Peptide (Phobius) ?
Signal Peptide (SignalP HMM Mode) ?
Signal Peptide (SignalP NN Mode) ?
Coiled Coils (COILS) ?
Positional Conservation ?
 
--------10--------20--------30--------40--------50--------60--------70--------80--------90-------100-------110-------120-------130-------140-------150-------160-------170-------180-------190-------200-------210-------220-------230-------240-------250-------260-------270-------280-------290-------300-------310-------320-------33
MKTARAWKLGGIRDFQMMAGYRQQRPHLKRPIWIIAFVTFFIIFLTGVYVYPSGSGSLASCYFFSGRGCTMLEQLPATPTRELTDAENIAQVVFKEILNKPPVHSKNPKVAFMFLTPGNLPFENLWEKFFHGHEDRFSVYVHASSEKPMHVSRYFVGRDIRSEKVAWGRISMLDAEKRLLAHALLDPDNQQFVLLSDSCIPLHNFDYVYNYLMYTNVSYVDCFEDPGPHGNGRYSEHMLPEVEKKDFRKGSQWFTLKRQHAMIIMADSLYYTKFKHYCKPNMEGRNCYADEHYLPTLFHVGTSYSVQLFRLKTTQVLPFLCYELFVLFR
cccccccccccccccccccccccccccccccHHHHHHHHHHHHHHHcEEEEcccccccccccccccccccccccccccccccccHHHHHHHHHHHHHccccccccccccEEEEEEcccccccHHHHHHHHccccccEEEEEEcccccccccccccccccccccccccccccHHHHHHHHHHHHHccccccEEEEEcccccccccHHHHHHHHHccccccEEEcccccccccccccccccccccccccEEEccEEEEEHHHHHHHHcccHHHHHHHHccccccccccccccccHHHHHHccccccccccEEEEccccccEEEEcccEEcc
ccccEEEccccccccccccccccccccccccHHHHHHHHHHHHHHHHHHcccccccccccccccccccccccccccccccccccHHHHHHHHHHcHHcccccccccccEEEEEEEccccccHHHHHHHHHccccccEEEEEEccccccccccccEccccccccEEEcccccHHHHHHHHHHHHHccccccEEEEEEcccccccccHHHHHHHHcccccHHHccccccccccccccccccccccHHHHccccEEEEEEHHHHHHHHHcHHHHHHHHHHccccccccccccccccccEEEEEEcccccccEEEEEEEEccEHHHHHHHHcc
MKTARAWKLGGIRDFQMMAGYrqqrphlkrpiWIIAFVTFFIIFLTGvyvypsgsgslascyffsgrgctmleqlpatptreltdaENIAQVVFKEilnkppvhsknpkvafmfltpgnlpfenlWEKFfhghedrfsvyvhassekpmhvsryfvgrdirsEKVAWGRISMLDAEKRLLAHAlldpdnqqfvllsdsciplhnfdyvYNYLMYTNVSyvdcfedpgphgngrysehmlpevekkdfrkgsqWFTLKRQHAMIIMADSLYYTKFkhyckpnmegrncyadehylptlfhvGTSYSVQLFRLKTTQVLPFLCYELFVLFR
mktarawklggirdfqMMAGYRQQRPHLKRPIWIIAFVTFFIIFLTGVYVYPSGSGSLASCYFFSGRGCTMLEQLPATPTRELTDAENIAQVVFKEILNKPPVHSKNPKVAFMFLTPGNLPFENLWEKFFHGHEDRFSVYvhassekpmhvsryfvgrdIRSEKVAWGRISMLDAEKRLLAHALLDPDNQQFVLLSDSCIPLHNFDYVYNYLMYTNVSYVDCFEDPGPHGNGRYSEHMLPEVEKKDFRKGSQWFTLKRQHAMIIMADSLYYTKFKHYCKPNMEGRNCYADEHYLPTLFHVGTSYSVQLFRLKTTQVLPFLCYELFVLFR
MKTARAWKLGGIRDFQMMAGYRQQRPHLKRPIWIIAFVTFFIIFLTGVYVYPSGSGSLASCYFFSGRGCTMLEQLPATPTRELTDAENIAQVVFKEILNKPPVHSKNPKVAFMFLTPGNLPFENLWEKFFHGHEDRFSVYVHASSEKPMHVSRYFVGRDIRSEKVAWGRISMLDAEKRLLAHALLDPDNQQFVLLSDSCIPLHNFDYVYNYLMYTNVSYVDCFEDPGPHGNGRYSEHMLPEVEKKDFRKGSQWFTLKRQHAMIIMADSLYYTKFKHYCKPNMEGRNCYADEHYLPTLFHVGTSYSVQLFRLKTTQVLPFLCYELFVLFR
*****AWKLGGIRDFQMMAGYRQQRPHLKRPIWIIAFVTFFIIFLTGVYVYPSGSGSLASCYFFSGRGCTMLEQLPATPTRELTDAENIAQVVFKEILNKPPVHSKNPKVAFMFLTPGNLPFENLWEKFFHGHEDRFSVYVHASSEKPMHVSRYFVGRDIRSEKVAWGRISMLDAEKRLLAHALLDPDNQQFVLLSDSCIPLHNFDYVYNYLMYTNVSYVDCFED*********************FRKGSQWFTLKRQHAMIIMADSLYYTKFKHYCKPNMEGRNCYADEHYLPTLFHVGTSYSVQLFRLKTTQVLPFLCYELFVLF*
********LGGIRD*****************IWIIAFVTFFIIFLTGVYVYPSGSGS****YFFSGRGCTMLEQLPATPTRELTDAENIAQVV****************VAFMFLTPGNLPFENLWEKFFHGHEDRFSVYVHASSEKPMHVSRYFVGRDIRSEKVAWGRISMLDAEKRLLAHALLDPDNQQFVLLSDSCIPLHNFDYVYNYLMYTNVSYVDCFEDPGPHGNGRYSEHMLPEVEKKDFRKGSQWFTLKRQHAMIIMADSLYYTKFKHYCKPNMEGRNCYADEHYLPTLFHVGTSYSVQLFRLKTTQVLPFLCYELFVLFR
MKTARAWKLGGIRDFQMMAGYRQQRPHLKRPIWIIAFVTFFIIFLTGVYVYPSGSGSLASCYFFSGRGCTMLEQLPATPTRELTDAENIAQVVFKEILNKPPVHSKNPKVAFMFLTPGNLPFENLWEKFFHGHEDRFSVYVHASSEKPMHVSRYFVGRDIRSEKVAWGRISMLDAEKRLLAHALLDPDNQQFVLLSDSCIPLHNFDYVYNYLMYTNVSYVDCFEDPGPHGNGRYSEHMLPEVEKKDFRKGSQWFTLKRQHAMIIMADSLYYTKFKHYCKPNMEGRNCYADEHYLPTLFHVGTSYSVQLFRLKTTQVLPFLCYELFVLFR
****R*WKLGGIRDFQMMAGYRQQRPHLKRPIWIIAFVTFFIIFLTGVYVYPSGSGSLASCYFFSGRGCTMLEQLPATPTRELTDAENIAQVVFKEILNKPPVHSKNPKVAFMFLTPGNLPFENLWEKFFHGHEDRFSVYVHASSEKPMHVSRYFVGRDIRSEKVAWGRISMLDAEKRLLAHALLDPDNQQFVLLSDSCIPLHNFDYVYNYLMYTNVSYVDCFEDPGPHGNGRYSEHMLPEVEKKDFRKGSQWFTLKRQHAMIIMADSLYYTKFKHYCKPNMEGRNCYADEHYLPTLFHVGTSYSVQLFRLKTTQVLPFLCYELFVLFR
iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiHHHHHHHHHHHHHHHHHHHHHHHooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiHHHHHHHHHHHHHHHHHHHHoooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiHHHHHHHHHHHHHHHHHHHHHHHHHoooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
SSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSiiHHHHHHHHHHHHHHHHoooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiHHHHHHHHHHHHHHHHHHHHoooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
SSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSoooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
MKTARAWKLGGIRDFQMMAGYRQQRPHLKRPIWIIAFVTFFIIFLTGVYVYPSGSGSLASCYFFSGRGCTMLEQLPATPTRELTDAENIAQVVFKEILNKPPVHSKNPKVAFMFLTPGNLPFENLWEKFFHGHEDRFSVYVHASSEKPMHVSRYFVGRDIRSEKVAWGRISMLDAEKRLLAHALLDPDNQQFVLLSDSCIPLHNFDYVYNYLMYTNVSYVDCFEDPGPHGNGRYSEHMLPEVEKKDFRKGSQWFTLKRQHAMIIMADSLYYTKFKHYCKPNMEGRNCYADEHYLPTLFHVGTSYSVQLFRLKTTQVLPFLCYELFVLFR
no confident homologs detected

Close Homologs for Annotation Transfer

Close Homologs in SWISS-PROT Database Detected by BLAST ?

No hits with e-value below 0.001 by BLAST

Close Homologs in the Non-Redundant Database Detected by BLAST ?

GI ?Alignment Graph ?Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query329
225429805391 PREDICTED: uncharacterized protein LOC10 0.899 0.757 0.746 1e-135
255574113379 conserved hypothetical protein [Ricinus 0.854 0.741 0.716 1e-118
224090529392 predicted protein [Populus trichocarpa] 0.899 0.755 0.661 1e-118
225446265382 PREDICTED: uncharacterized protein LOC10 0.863 0.743 0.697 1e-118
356550390393 PREDICTED: uncharacterized protein LOC10 0.902 0.755 0.641 1e-117
356562117393 PREDICTED: uncharacterized protein LOC10 0.902 0.755 0.644 1e-117
296090292376 unnamed protein product [Vitis vinifera] 0.851 0.744 0.700 1e-116
357479561396 hypothetical protein MTR_4g127550 [Medic 0.905 0.752 0.634 1e-115
388520347396 unknown [Medicago truncatula] 0.905 0.752 0.631 1e-114
449461176393 PREDICTED: uncharacterized protein LOC10 0.896 0.750 0.627 1e-112
>gi|225429805|ref|XP_002282865.1| PREDICTED: uncharacterized protein LOC100257774 [Vitis vinifera] gi|296081776|emb|CBI20781.3| unnamed protein product [Vitis vinifera] Back     alignment and taxonomy information
 Score =  487 bits (1254), Expect = e-135,   Method: Compositional matrix adjust.
 Identities = 224/300 (74%), Positives = 259/300 (86%), Gaps = 4/300 (1%)

Query: 1   MKTARAWKLGGIRDFQMMAGYRQQRPHLKRPIWIIAFVTFFIIFLTGVYVYPSGSGSLAS 60
           MKTARAW+LG I+D  +M+G RQ RPHLKRPIWII  V+   I L G Y+YP  S   A+
Sbjct: 1   MKTARAWQLG-IKDMLIMSGSRQ-RPHLKRPIWIIVLVSLVSICLIGAYIYPPRSS--AA 56

Query: 61  CYFFSGRGCTMLEQLPATPTRELTDAENIAQVVFKEILNKPPVHSKNPKVAFMFLTPGNL 120
           CY FS R C++ EQ    P+RELTD E  AQVV +EIL  PPVHSKNPK+AFMFLTPG+L
Sbjct: 57  CYIFSSRDCSIYEQPIHVPSRELTDDETAAQVVIREILKTPPVHSKNPKIAFMFLTPGSL 116

Query: 121 PFENLWEKFFHGHEDRFSVYVHASSEKPMHVSRYFVGRDIRSEKVAWGRISMLDAEKRLL 180
           PFE LW++FFHGHEDRF+VYVHASSEKP HVSRYF+GRDIRSEKV WG+ISM+DAEKRLL
Sbjct: 117 PFEKLWDRFFHGHEDRFTVYVHASSEKPAHVSRYFIGRDIRSEKVIWGKISMVDAEKRLL 176

Query: 181 AHALLDPDNQQFVLLSDSCIPLHNFDYVYNYLMYTNVSYVDCFEDPGPHGNGRYSEHMLP 240
            HAL DP+NQ FVLLSDSC+PLHNFDYVYNYLM+TN+SY+DCFEDPGPHGNGRYS+HM+P
Sbjct: 177 GHALEDPNNQHFVLLSDSCVPLHNFDYVYNYLMFTNISYIDCFEDPGPHGNGRYSDHMMP 236

Query: 241 EVEKKDFRKGSQWFTLKRQHAMIIMADSLYYTKFKHYCKPNMEGRNCYADEHYLPTLFHV 300
           EVE KDFRKG+QWF++KRQHA+I+MADSLYYTKFK YC+P M+GRNCYADEHYLPTLF++
Sbjct: 237 EVEVKDFRKGAQWFSMKRQHAIIVMADSLYYTKFKLYCRPGMDGRNCYADEHYLPTLFNM 296




Source: Vitis vinifera

Species: Vitis vinifera

Genus: Vitis

Family: Vitaceae

Order: Vitales

Class:

Phylum: Streptophyta

Superkingdom: Eukaryota

>gi|255574113|ref|XP_002527972.1| conserved hypothetical protein [Ricinus communis] gi|223532598|gb|EEF34384.1| conserved hypothetical protein [Ricinus communis] Back     alignment and taxonomy information
>gi|224090529|ref|XP_002309015.1| predicted protein [Populus trichocarpa] gi|222854991|gb|EEE92538.1| predicted protein [Populus trichocarpa] Back     alignment and taxonomy information
>gi|225446265|ref|XP_002267173.1| PREDICTED: uncharacterized protein LOC100262683 [Vitis vinifera] Back     alignment and taxonomy information
>gi|356550390|ref|XP_003543570.1| PREDICTED: uncharacterized protein LOC100817202 [Glycine max] Back     alignment and taxonomy information
>gi|356562117|ref|XP_003549320.1| PREDICTED: uncharacterized protein LOC100784132 [Glycine max] Back     alignment and taxonomy information
>gi|296090292|emb|CBI40111.3| unnamed protein product [Vitis vinifera] Back     alignment and taxonomy information
>gi|357479561|ref|XP_003610066.1| hypothetical protein MTR_4g127550 [Medicago truncatula] gi|355511121|gb|AES92263.1| hypothetical protein MTR_4g127550 [Medicago truncatula] Back     alignment and taxonomy information
>gi|388520347|gb|AFK48235.1| unknown [Medicago truncatula] Back     alignment and taxonomy information
>gi|449461176|ref|XP_004148318.1| PREDICTED: uncharacterized protein LOC101219592 [Cucumis sativus] gi|449507419|ref|XP_004163027.1| PREDICTED: uncharacterized protein LOC101231398 [Cucumis sativus] Back     alignment and taxonomy information

Prediction of Gene Ontology (GO) Terms

Close Homologs with Gene Ontology terms Detected by BLAST ?

ID ? Alignment graph ? Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query329
TAIR|locus:2128181376 AT4G31350 [Arabidopsis thalian 0.838 0.734 0.665 4.4e-107
TAIR|locus:2059040394 AT2G19160 [Arabidopsis thalian 0.899 0.751 0.576 2.4e-99
TAIR|locus:2126525401 AT4G30060 [Arabidopsis thalian 0.902 0.740 0.578 4.1e-97
TAIR|locus:2117542389 AT4G25870 [Arabidopsis thalian 0.823 0.696 0.563 7.5e-89
TAIR|locus:2165605388 AT5G57270 [Arabidopsis thalian 0.841 0.713 0.570 1.2e-88
TAIR|locus:2012748412 AT1G10280 [Arabidopsis thalian 0.689 0.550 0.458 1.1e-52
TAIR|locus:2181945386 AT5G11730 [Arabidopsis thalian 0.781 0.665 0.435 3.1e-51
TAIR|locus:2197354 651 AT1G10880 [Arabidopsis thalian 0.653 0.330 0.471 6.5e-51
TAIR|locus:2181442411 AT5G16170 [Arabidopsis thalian 0.568 0.454 0.525 1.7e-50
TAIR|locus:2094781383 AT3G21310 [Arabidopsis thalian 0.665 0.571 0.458 2.8e-50
TAIR|locus:2128181 AT4G31350 [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
 Score = 1059 (377.8 bits), Expect = 4.4e-107, P = 4.4e-107
 Identities = 185/278 (66%), Positives = 228/278 (82%)

Query:    23 QQRPHLKRPIWIIAFVTFFIIFLTGVYVYPSGSGSLASCYFFSGRGCTMLEQLPATPTRE 82
             +QRP  K P WII  V    + +   ++YP  + S+A CY FSG GC + +Q    PTRE
Sbjct:     5 RQRPPFKGPRWIITLVVLVTVVVITAFIYPPRN-SVA-CYMFSGPGCPLYQQFLFVPTRE 62

Query:    83 LTDAENIAQVVFKEILNKPPVHSKNPKVAFMFLTPGNLPFENLWEKFFHGHEDRFSVYVH 142
             LTD+E  AQVV  EI+N P   + NPK+AFMFLTPG LPFE LWE FF GHE++FSVYVH
Sbjct:    63 LTDSEAAAQVVMNEIMNLPQSKTANPKLAFMFLTPGTLPFEPLWEMFFRGHENKFSVYVH 122

Query:   143 ASSEKPMHVSRYFVGRDIRSEKVAWGRISMLDAEKRLLAHALLDPDNQQFVLLSDSCIPL 202
             AS + P+H S YFVGRDI S KVAWG+ISM+DAE+RLLAHAL+DPDNQ F+LLSDSC+PL
Sbjct:   123 ASKKSPVHTSSYFVGRDIHSHKVAWGQISMVDAERRLLAHALVDPDNQHFILLSDSCVPL 182

Query:   203 HNFDYVYNYLMYTNVSYVDCFEDPGPHGNGRYSEHMLPEVEKKDFRKGSQWFTLKRQHAM 262
              +F+Y+YN+L++ N+S++DCFEDPGPHG+GRYS+HMLPEVEKKDFRKGSQWF++KR+HA+
Sbjct:   183 FDFNYIYNHLIFANLSFIDCFEDPGPHGSGRYSQHMLPEVEKKDFRKGSQWFSMKRRHAI 242

Query:   263 IIMADSLYYTKFKHYCKPNMEGRNCYADEHYLPTLFHV 300
             ++MADSLYYTKFK YC+PNMEGRNCYADEHY PTLF++
Sbjct:   243 VVMADSLYYTKFKLYCRPNMEGRNCYADEHYFPTLFNM 280




GO:0005739 "mitochondrion" evidence=ISM
GO:0016020 "membrane" evidence=IEA
GO:0016757 "transferase activity, transferring glycosyl groups" evidence=ISM
TAIR|locus:2059040 AT2G19160 [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2126525 AT4G30060 [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2117542 AT4G25870 [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2165605 AT5G57270 [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2012748 AT1G10280 [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2181945 AT5G11730 [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2197354 AT1G10880 [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2181442 AT5G16170 [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2094781 AT3G21310 [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms

Prediction of Enzyme Commission (EC) Number

EC Number Prediction by Annotation Transfer from SWISS-PROT Entries ?

No confident hit for EC number transfering in SWISSPROT detected by BLAST

EC Number Prediction by Ezypred Server ?

Fail to connect to Ezypred Server

EC Number Prediction by EFICAz Software ?

No EC number assignment, probably not an enzyme!


Prediction of Functionally Associated Proteins

Functionally Associated Proteins Detected by STRING ?

Your Input:
GSVIVG00032346001
SubName- Full=Chromosome chr4 scaffold_6, whole genome shotgun sequence; (391 aa)
(Vitis vinifera)
Predicted Functional Partners:
 
Sorry, there are no predicted associations at the current settings.
 

Conserved Domains and Related Protein Families

Conserved Domains Detected by RPS-BLAST ?

ID ?Alignment Graph ?Length ? Definition ? E-value ?
Query329
pfam02485242 pfam02485, Branch, Core-2/I-Branching enzyme 4e-74
>gnl|CDD|217062 pfam02485, Branch, Core-2/I-Branching enzyme Back     alignment and domain information
 Score =  228 bits (583), Expect = 4e-74
 Identities = 96/205 (46%), Positives = 123/205 (60%), Gaps = 21/205 (10%)

Query: 110 VAFMFLT-PGNLPFENLWEKFFHGHEDRFSVYVHASSEKPMH------VSRYFVGRDI-R 161
           +AFMFL   G+LPF  LW +FF+  ++ + +YV A S            S +F  R I +
Sbjct: 1   IAFMFLVYKGDLPFLPLWLRFFYHPQNLYCIYVDAKSPSEFKERLRALASCFFNVRVIPK 60

Query: 162 SEKVAWGRISMLDAEKRLLAHALL-DPDNQQFVLLSDSCIPLHNFDYVYNYLMYTN--VS 218
           SE V WG  SM+DAE+RLLA+ LL DP    FVLLS+S IPL  FD +Y YL   N   S
Sbjct: 61  SEPVTWGGHSMVDAERRLLANLLLLDPSWDYFVLLSESDIPLKTFDELYRYLSSLNGNNS 120

Query: 219 YVDCFEDPGPHGNGRYSEHMLPEVEKKD----FRKGSQWFTLKRQHAMIIMADSLYYTKF 274
           +V+ F+DPG  G GRY+  MLPE++  D    +RKGSQWF L R  A  +++D+LYY  F
Sbjct: 121 FVESFDDPGWKGRGRYNPRMLPEIKLSDLPLKWRKGSQWFVLSRAFAEYVVSDNLYYPLF 180

Query: 275 KHYCKPNMEGRNCYADEHYLPTLFH 299
           K+YC        CY DEHY PTL  
Sbjct: 181 KYYCN------TCYPDEHYFPTLLG 199


This is a family of two different beta-1,6-N-acetylglucosaminyltransferase enzymes, I-branching enzyme and core-2 branching enzyme . I-branching enzyme is responsible for the production of the blood group I-antigen during embryonic development. Core-2 branching enzyme forms crucial side-chain branches in O-glycans. Length = 242

Conserved Domains Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query 329
PF02485244 Branch: Core-2/I-Branching enzyme; InterPro: IPR00 100.0
PLN03183421 acetylglucosaminyltransferase family protein; Prov 100.0
KOG0799439 consensus Branching enzyme [Carbohydrate transport 99.83
>PF02485 Branch: Core-2/I-Branching enzyme; InterPro: IPR003406 The biosynthesis of disaccharides, oligosaccharides and polysaccharides involves the action of hundreds of different glycosyltransferases Back     alignment and domain information
Probab=100.00  E-value=1.9e-40  Score=305.33  Aligned_cols=203  Identities=25%  Similarity=0.458  Sum_probs=128.0

Q ss_pred             EEEEEEcCC-CCChhHHHHHHHhcCCCCceEEEEeCCCCCC------CC--ccceeeeccC-CcccccCcchHHHHHHHH
Q 020207          110 VAFMFLTPG-NLPFENLWEKFFHGHEDRFSVYVHASSEKPM------HV--SRYFVGRDIR-SEKVAWGRISMLDAEKRL  179 (329)
Q Consensus       110 IAFLfLt~g-~l~l~~Lwe~ff~~~~~~~sIYVHvD~ks~~------~~--s~vF~~r~Ip-s~~V~WG~~SlVeAelrL  179 (329)
                      |||||++|+ +.++...|.+++.  ++...+|||+|+|++.      ..  ........++ +++|.||++|||+||++|
T Consensus         1 iAylil~h~~~~~~~~~l~~~l~--~~~~~f~iHiD~k~~~~~~~~~~~~~~~~~nv~~v~~r~~v~WG~~S~v~A~l~l   78 (244)
T PF02485_consen    1 IAYLILAHKNDPEQLERLLRLLY--HPDNDFYIHIDKKSPDYFYEEIKKLISCFPNVHFVPKRVDVRWGGFSLVEATLNL   78 (244)
T ss_dssp             EEEEEEESS--HHHHHHHHHHH----TTSEEEEEE-TTS-HHHHHHHHHHHCT-TTEEE-SS-----TTSHHHHHHHHHH
T ss_pred             CEEEEEecCCCHHHHHHHHHHhc--CCCCEEEEEEcCCCChHHHHHHHHhcccCCceeecccccccccCCccHHHHHHHH
Confidence            799999977 7665555555543  5677889999999641      10  1111122344 589999999999999999


Q ss_pred             HHHHhc-CCCCCEEEEecCCcccCCChHHHHHHHhcC--CcccEEeecCCCCCCCCCCccc----cCccccccccccccc
Q 020207          180 LAHALL-DPDNQQFVLLSDSCIPLHNFDYVYNYLMYT--NVSYVDCFEDPGPHGNGRYSEH----MLPEVEKKDFRKGSQ  252 (329)
Q Consensus       180 L~~AL~-d~~n~yFvLLSgsdiPL~s~~~I~~fL~~~--~~sFIe~~~~p~~~~~~RY~~~----mlp~i~~~~~rkGSQ  252 (329)
                      |++|++ +++|+|||||||+|+||+++++|++||+..  +.+|+++...++....+||...    +.+.++..++++|||
T Consensus        79 l~~al~~~~~~~y~~llSg~D~Pl~s~~~i~~~l~~~~~~~~f~~~~~~~~~~~~~r~~~~~~~~~~~~~~~~~~~~GSq  158 (244)
T PF02485_consen   79 LREALKRDGDWDYFILLSGQDYPLKSNEEIHEFLESNNGDNNFIESFSDEDPRESGRYNPRIYDPFRPFFRKRTLYKGSQ  158 (244)
T ss_dssp             HHHHHHH-S---EEEEEETTEEESS-HHHHHHHHHHTTT--B---BEE--GGGG-HHHHEEEETTEEEEEEEE--EEE-S
T ss_pred             HHHHHhcCCCCcEEEEcccccccccchHHHHHHHHhcCCCCcceecccccccchhhcceeeeeeecccccccccccccce
Confidence            999999 899999999999999999999999999985  5788999887654333566543    334444458999999


Q ss_pred             eeeecHHHHHHHHhChHHHHHHHhhccCCCCCCcccccchhHhhhhccCCCCCceeEEeeecCccccceee
Q 020207          253 WFTLKRQHAMIIMADSLYYTKFKHYCKPNMEGRNCYADEHYLPTLFHVGTSYSVQLFRLKTTQVLPFLCYE  323 (329)
Q Consensus       253 WfsLtR~~A~~Vv~d~~~~~~Fk~~c~~~~~~~tc~pDEhyfpTLL~~~~~~~~~~y~l~~~~~~~~~~~~  323 (329)
                      ||+|||++|++|+.|....+.|+.+|+     .+++|||+||||||.+.+....+.    ...+++|+-|+
T Consensus       159 W~~Ltr~~v~~il~~~~~~~~~~~~~~-----~~~~pDE~ffqTll~n~~~~~~~~----~~~~~r~i~W~  220 (244)
T PF02485_consen  159 WFSLTRDFVEYILDDPNYRPKLKKYFR-----FSLCPDESFFQTLLNNSGHFKDTI----VNRNLRYIDWS  220 (244)
T ss_dssp             --EEEHHHHHHHHH-HHHHHHHHHHT------TSSSGGGTHHHHH--SSGGG-B-T----TTSSSEEE-BT
T ss_pred             eeEeeHHHHHHhhhhHHHHHHHHHhhc-----CccCcchhhHHHhhcccchhcccc----cCCCEEEEECC
Confidence            999999999999999999999999985     479999999999998772221111    23444666665



These enzymes catalyse the transfer of sugar moieties from activated donor molecules to specific acceptor molecules, forming glycosidic bonds. A classification of glycosyltransferases using nucleotide diphospho-sugar, nucleotide monophospho-sugar and sugar phosphates (2.4.1.- from EC) and related proteins into distinct sequence based families has been described []. This classification is available on the CAZy (CArbohydrate-Active EnZymes) web site. The same three-dimensional fold is expected to occur within each of the families. Because 3-D structures are better conserved than sequences, several of the families defined on the basis of sequence similarities may have similar 3-D structures and therefore form 'clans'. This is the glycosyltransferase family 14 GT14 from CAZY, a family of two different beta-1,6-N-acetylglucosaminyltransferase enzymes, I-branching enzyme (2.4.1.150 from EC) and core-2 branching enzyme (2.4.1.102 from EC). I-branching enzyme, an integral membrane protein, converts linear into branched poly-N-acetyllactosaminoglycans in the glycosylation pathway, and is responsible for the production of the blood group I-antigen during embryonic development []. Core-2 branching enzyme, also an integral membrane protein, forms crucial side-chain branches in O-glycans in the glycosylation pathway [].; GO: 0008375 acetylglucosaminyltransferase activity, 0016020 membrane; PDB: 3OTK_D 2GAM_A 2GAK_B.

>PLN03183 acetylglucosaminyltransferase family protein; Provisional Back     alignment and domain information
>KOG0799 consensus Branching enzyme [Carbohydrate transport and metabolism] Back     alignment and domain information

Homologous Structure Templates

Structure Templates Detected by BLAST ?

No homologous structure with e-value below 0.005

Structure Templates Detected by RPS-BLAST ?

ID ?Alignment Graph ?Length ? Definition ? E-value ?
Query329
2gak_A391 Beta-1,6-N-acetylglucosaminyltransferase; glycopro 2e-20
1vt4_I 1221 APAF-1 related killer DARK; drosophila apoptosome, 2e-07
>2gak_A Beta-1,6-N-acetylglucosaminyltransferase; glycoprotein, CIS-peptide, dimer; HET: NAG; 2.00A {Mus musculus} PDB: 2gam_A* 3otk_A* Length = 391 Back     alignment and structure
 Score = 89.9 bits (222), Expect = 2e-20
 Identities = 25/249 (10%), Positives = 70/249 (28%), Gaps = 30/249 (12%)

Query: 74  QLPATPTRELTDAENIAQVVFKEILNKPPVHSKNPKVAFMFLTPGNLP-FENLWEKFFHG 132
           +        +T          K I+           +A+  +    +   + L    +  
Sbjct: 50  RWTPHDYINMTRDCASFIRTRKYIVEPLTKEEVGFPIAYSIVVHHKIEMLDRLLRAIYMP 109

Query: 133 HEDRFSVYVHASSEKPMHVSRYFVGRDI-------RSEKVAWGRISMLDAEKRLLAHAL- 184
             + + ++V   +E+    +   +           + E V +   + + A+   +     
Sbjct: 110 Q-NFYCIHVDRKAEESFLAAVQGIASCFDNVFVASQLESVVYASWTRVKADLNCMKDLYR 168

Query: 185 LDPDNQQFVLLSDSCIPLHNFDYVYNYL-MYTNVSYVDCFEDPGPHGNGRYSEH------ 237
           ++ + +  + L     P+     +   L   T  + ++  + P          +      
Sbjct: 169 MNANWKYLINLCGMDFPIKTNLEIVRKLKCSTGENNLETEKMPPNKEERWKKRYAVVDGK 228

Query: 238 -------MLPEVEKKDFRKGSQWFTLKRQHAMIIMADSLYYTKFKHYCKPNMEGRNCYAD 290
                    P   K     GS +F + R++   ++ +     K   + +          D
Sbjct: 229 LTNTGIVKAPPPLKTPLFSGSAYFVVTREYVGYVLEN-ENIQKLMEWAQ-----DTYSPD 282

Query: 291 EHYLPTLFH 299
           E    T+  
Sbjct: 283 EFLWATIQR 291


>1vt4_I APAF-1 related killer DARK; drosophila apoptosome, apoptosis, programmed cell death; HET: DTP; 6.90A {Drosophila melanogaster} PDB: 3iz8_A* Length = 1221 Back     alignment and structure

Structure Templates Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query329
2gak_A391 Beta-1,6-N-acetylglucosaminyltransferase; glycopro 100.0
>2gak_A Beta-1,6-N-acetylglucosaminyltransferase; glycoprotein, CIS-peptide, dimer; HET: NAG; 2.00A {Mus musculus} PDB: 2gam_A* 3otk_A* Back     alignment and structure
Probab=100.00  E-value=3.2e-41  Score=332.21  Aligned_cols=187  Identities=12%  Similarity=0.169  Sum_probs=153.2

Q ss_pred             CCcEEEEEEcCCCCChhHHHHHHHhc-CCCCceEEEEeCCCCCCC-------CccceeeeccC--CcccccCcchHHHHH
Q 020207          107 NPKVAFMFLTPGNLPFENLWEKFFHG-HEDRFSVYVHASSEKPMH-------VSRYFVGRDIR--SEKVAWGRISMLDAE  176 (329)
Q Consensus       107 ~~KIAFLfLt~g~l~l~~Lwe~ff~~-~~~~~sIYVHvD~ks~~~-------~s~vF~~r~Ip--s~~V~WG~~SlVeAe  176 (329)
                      .+||||||++||++   +.|+++|++ .++.+.+|||+|++++..       ..+.|.+..|.  +..|.||++|||+||
T Consensus        83 ~~kiAflil~h~d~---~~l~rll~~ly~p~n~y~IHvD~ks~~~~~~~~~~~~~~f~NV~v~~~~~~v~WGg~S~v~A~  159 (391)
T 2gak_A           83 GFPIAYSIVVHHKI---EMLDRLLRAIYMPQNFYCIHVDRKAEESFLAAVQGIASCFDNVFVASQLESVVYASWTRVKAD  159 (391)
T ss_dssp             TSCEEEEEEECSCH---HHHHHHHHHHCCTTSEEEEEECTTSCHHHHHHHHHHHHTCTTEEECSSCCCCCTTSHHHHHHH
T ss_pred             CCCEEEEEEecCCH---HHHHHHHHHHhCCCCeEEEEEeCCCCHHHHHHHHHHHhcCCCEEEeccCcccccCCchHHHHH
Confidence            47999999999976   577888886 467889999999998631       23346666653  689999999999999


Q ss_pred             HHHHHHHhcC-CCCCEEEEecCCcccCCChHHHHHHHhc-CCcccEEeecCCCCCCCCCCcc----------------cc
Q 020207          177 KRLLAHALLD-PDNQQFVLLSDSCIPLHNFDYVYNYLMY-TNVSYVDCFEDPGPHGNGRYSE----------------HM  238 (329)
Q Consensus       177 lrLL~~AL~d-~~n~yFvLLSgsdiPL~s~~~I~~fL~~-~~~sFIe~~~~p~~~~~~RY~~----------------~m  238 (329)
                      ++||++||++ .+|+|||||||+|+||+++++|++||+. .++|||++.++++. ..+|+..                ++
T Consensus       160 l~ll~~aL~~~~~w~yfilLSgsD~PLkt~~~i~~~l~~~~~~nFIe~~~~~~~-~~~R~~~~~~~~~~~l~~~~~~k~~  238 (391)
T 2gak_A          160 LNCMKDLYRMNANWKYLINLCGMDFPIKTNLEIVRKLKCSTGENNLETEKMPPN-KEERWKKRYAVVDGKLTNTGIVKAP  238 (391)
T ss_dssp             HHHHHHHHHHCSCCCEEEEEETTCEESSCHHHHHHHHHHTTTCCBCCBEECCGG-GSHHHHEEEEEETTEEEEEEEECCC
T ss_pred             HHHHHHHHhcCCCCCEEEEecCCCccccCHHHHHHHHHhcCCCceeeccCCCcc-cccceEeeeeccccceeeecccccC
Confidence            9999999984 5999999999999999999999999997 89999999987542 1223211                13


Q ss_pred             CccccccccccccceeeecHHHHHHHHhChHHHHHHHhhccCCCCCCcccccchhHhhhhccCCCCC
Q 020207          239 LPEVEKKDFRKGSQWFTLKRQHAMIIMADSLYYTKFKHYCKPNMEGRNCYADEHYLPTLFHVGTSYS  305 (329)
Q Consensus       239 lp~i~~~~~rkGSQWfsLtR~~A~~Vv~d~~~~~~Fk~~c~~~~~~~tc~pDEhyfpTLL~~~~~~~  305 (329)
                      +|.  ..+|++|||||+|||++|++|++|.. ..+|.++|+     ++|+|||||||||+.+.+.+|
T Consensus       239 ~P~--~~~~~~GSqW~~LtR~~v~~vl~d~~-~~~~~~~~k-----~t~~pDE~ffqTll~~~~~pg  297 (391)
T 2gak_A          239 PPL--KTPLFSGSAYFVVTREYVGYVLENEN-IQKLMEWAQ-----DTYSPDEFLWATIQRIPEVPG  297 (391)
T ss_dssp             CSC--SSCCEECCSCCEEEHHHHHHHHHCHH-HHHHHHHHT-----TSSSGGGTHHHHHTTSTTSTT
T ss_pred             Ccc--ccceecccceeEecHHHHHHHHhCCc-HHHHHHHHc-----CCcCCchhHHHHHhhccCCCC
Confidence            332  56899999999999999999999965 678888998     579999999999998765333




Homologous Structure Domains

Structure Domains Detected by RPS-BLAST ?

No hit with e-value below 0.005

Homologous Domains Detected by HHsearch ?

No hit with probability above 80.00