Citrus Sinensis ID: 018175


Local Sequence Feature Prediction

Prediction and (Method)Result
Residue Number Marker
Protein Sequence ?
Secondary Structure (PSIPRED) ?
Secondary Structure Prediction (SSPRO) ?
Coil and Loop (DISEMBL) ?
Flexible Loop (DISEMBL) ?
Low Complexity Region (SEG) ?
Disordered region (IsUnstruct) ?
Disordered Region (DISOPRED) ?
Disordered Region (DISEMBL) ?
Disordered Region (DISPRO) ?
Transmembrane Helix (TMHMM) ?
Transmembrane Helix (HMMTOP) ?
Transmembrane Helix (MEMSAT) ?
TM Helix, Signal Peptide (MEMSAT_SVM) ?
TM Helix, Signal Peptide (Phobius) ?
Signal Peptide (SignalP HMM Mode) ?
Signal Peptide (SignalP NN Mode) ?
Coiled Coils (COILS) ?
Positional Conservation ?
 
--------10--------20--------30--------40--------50--------60--------70--------80--------90-------100-------110-------120-------130-------140-------150-------160-------170-------180-------190-------200-------210-------220-------230-------240-------250-------260-------270-------280-------290-------300-------310-------320-------330-------340-------350-------360
MRINRLPSSGHTTPSPPASPRRSPRYRHGRGKPGGRFTPVGPPGRANLAHRLAWLFLSVLLRRQGIFLFAPLIYISGMLLYMGTVSFDVVPVIKHRPAPGSVYRSPQVYAKLKPEMDADNSSSDAISTIWKASHKGVEWKPCVKRSSGELPESNGYISVEANGGLNQQRISICNAVAVAGYLNATLVIPNFHYHSIWRDPSKFRDIYDEDYFTSVLQNDVRVVNKIPEYIMERFDHNMSNVYNFRVKAWSPIQFYKDEVLPKLLEERLIRISPFANRLSFDAPPAVQRLRCLANYEALRFSSPILTMGETLVARMKERSVNHGGKYISVHLRFEEVSFHSAINVVAELLFIVITKLIISI
ccccccccccccccccccccccccccccccccccccccccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHccccccccccccccccccccccHHHHHHccccccccccccccccccccccccccccccccccccccccccccEEEEEEccccHHHHHHHHHHHHHHHHHccEEEcccccccccccccccccccccHHHHHHHccccEEEEccccHHHHHHccccccccccccccccccHHHHHHHcHHHHHHccEEEEccccccccccccHHHHHHHHHHccccccccHHHHHHHHHHHHHHHHHcccccccEEEEEcHHHHHHHcccccccHHHHHHHHHHHHHcc
ccccccccccccccccccccccccccEccccccccccccccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcccccccccccccccccccccccHHHHHHHHHHHcccccccccHHHHHcccccccccccccccccccccccccEEEEEEcccHHHHHHHHHHHHHHHHHHHHHEEcccccccccccccccHHHHccHHHHHHHHHHHcEHHHcccHHHHHHccccccccccccccccccHHHHHHHHHHHHHHccEEEEcccHHHHHccccHHHHHHEEHHHHHHHHccHHHHHHHHHHHHHHHHHcccccccEEEEEEEEHHEEEccccccccHHHHHHHHHHHHcc
mrinrlpssghttpsppasprrspryrhgrgkpggrftpvgppgraNLAHRLAWLFLSVLLRRQGIFLFAPLIYISGMLLYMGTvsfdvvpvikhrpapgsvyrspqvyaklkpemdadnsssdAISTIWKAshkgvewkpcvkrssgelpesngyisveangglnqqRISICNAVAVAGYLNAtlvipnfhyhsiwrdpskfrdiydedyftsVLQNDVRVVNKIPEYIMERfdhnmsnvynfrvkawspiqfykdeVLPKLLEERLIRIspfanrlsfdappaVQRLRCLANYEalrfsspilTMGETLVARMKERsvnhggkyISVHLRFEEVSFHSAINVVAELLFIVITKLIISI
mrinrlpssghttpsppasprrspryRHGRGKPGGRFTPVGPPGRANLAHRLAWLFLSVLLRRQGIFLFAPLIYISGMLLYMGTVSFDVVPVIKhrpapgsvyrspQVYAKLKPEMDADNSSSDAISTIWKashkgvewkpCVKRSSGELPESNGYISVEANGGLNQQRISICNAVAVAGYLNATLVIPNFHYHSIWRDPSKFRDIYDEDYFtsvlqndvrvVNKIPEYIMERFDHNMSNVYNFRVKAWSPIQFYKDEVLPKLLEERLIRISpfanrlsfdapPAVQRLRCLANYEalrfsspilTMGETLVARMKERSVNHGGKYISVHLRFEEVSFHSAINVVAELLFIVITKLIISI
MRINRLPSSGHTTpsppasprrsprYRHGRGKPGGRFTPVGPPGRANLAHRLAWLFLSVLLRRQGIFLFAPLIYISGMLLYMGTVSFDVVPVIKHRPAPGSVYRSPQVYAKLKPEMDADNSSSDAISTIWKASHKGVEWKPCVKRSSGELPESNGYISVEANGGLNQQRISICNAVAVAGYLNATLVIPNFHYHSIWRDPSKFRDIYDEDYFTSVLQNDVRVVNKIPEYIMERFDHNMSNVYNFRVKAWSPIQFYKDEVLPKLLEERLIRISPFANRLSFDAPPAVQRLRCLANYEALRFSSPILTMGETLVARMKERSVNHGGKYISVHLRFEEVSFHSAINVVAELLFIVITKLIISI
*********************************************ANLAHRLAWLFLSVLLRRQGIFLFAPLIYISGMLLYMGTVSFDVVPVIKHRPAP**VY***********************STIWKASHKGVEWKPCV***********GYISVEANGGLNQQRISICNAVAVAGYLNATLVIPNFHYHSIWRDPSKFRDIYDEDYFTSVLQNDVRVVNKIPEYIMERFDHNMSNVYNFRVKAWSPIQFYKDEVLPKLLEERLIRISPFANRLSFDAPPAVQRLRCLANYEALRFSSPILTMGETLVARMKERSVNHGGKYISVHLRFEEVSFHSAINVVAELLFIVITKLII**
***************************************************LAWLFLSVLLRRQGIFLFAPLIYISGMLLYMGTVSFDVVPVIKHRPAPGSVYRSP*V********************IWKASHKGVEWKPCV***********GYISVEANGGLNQQRISICNAVAVAGYLNATLVIPNFHYHSIWRDPSKFRDIYDEDYFTSVLQNDVRVVNKIPEYIMERFDHNMSNVYNFRVKAWSPIQFYKDEVLPKLLEERLIRISPFANRLSFDAPPAVQRLRCLANYEALRFSSPILTMGETLVARMKER****GGKYISVHLRFEEVSFHSAINVVAELLFIVITKLIISI
***************************HGRGKPGGRFTPVGPPGRANLAHRLAWLFLSVLLRRQGIFLFAPLIYISGMLLYMGTVSFDVVPVIKHRPAPGSVYRSPQVYAKLKP*********DAISTIWKASHKGVEWKPCVKRSSGELPESNGYISVEANGGLNQQRISICNAVAVAGYLNATLVIPNFHYHSIWRDPSKFRDIYDEDYFTSVLQNDVRVVNKIPEYIMERFDHNMSNVYNFRVKAWSPIQFYKDEVLPKLLEERLIRISPFANRLSFDAPPAVQRLRCLANYEALRFSSPILTMGETLVARMKERSVNHGGKYISVHLRFEEVSFHSAINVVAELLFIVITKLIISI
*********************************************ANLAHRLAWLFLSVLLRRQGIFLFAPLIYISGMLLYMGTVSFDVVPVIKHRPAPGSVYRSPQVYAKLKPEMDADNSSSDAISTIWKASHKGVEWKPCVKRSSGELPESNGYISVEANGGLNQQRISICNAVAVAGYLNATLVIPNFHYHSIWRDPSKFRDIYDEDYFTSVLQNDVRVVNKIPEYIMERFDHNMSNVYNFRVKAWSPIQFYKDEVLPKLLEERLIRISPFANRLSFDAPPAVQRLRCLANYEALRFSSPILTMGETLVARMKERSVNHGGKYISVHLRFEEVSFHSAINVVAELLFIVITKLIISI
iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiHHHHHHHHHHHHHHHHHHHHHHHooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiHHHHHHHHHHHHHHHHHHooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooHHHHHHHHHHHHHHHHHHHi
iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiHHHHHHHHHHHHHHHHHHHHHHHHHoooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
SSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSiiiiiiiiiiiiiiiiHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiHHHHHHHHHHHHHHHHHHHHHHooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooHHHHHHHHHHHHHHHHHHHHHii
SSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSoooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
MRINRLPSSGHTTPSPPASPRRSPRYRHGRGKPGGRFTPVGPPGRANLAHRLAWLFLSVLLRRQGIFLFAPLIYISGMLLYMGTVSFDVVPVIKHRPAPGSVYRSPQVYAKLKPEMDADNSSSDAISTIWKASHKGVEWKPCVKRSSGELPESNGYISVEANGGLNQQRISICNAVAVAGYLNATLVIPNFHYHSIWRDPSKFRDIYDEDYFTSVLQNDVRVVNKIPEYIMERFDHNMSNVYNFRVKAWSPIQFYKDEVLPKLLEERLIRISPFANRLSFDAPPAVQRLRCLANYEALRFSSPILTMGETLVARMKERSVNHGGKYISVHLRFEEVSFHSAINVVAELLFIVITKLIISI
no confident homologs detected

Close Homologs for Annotation Transfer

Close Homologs in SWISS-PROT Database Detected by BLAST ?

ID ?Alignment graph ?Length ? Definition ? RBH(Q2H) ? RBH(H2Q) ? Q cover ? H cover ? Identity ? E-value ?
Query360 2.2.26 [Sep-21-2011]
Q8W486 519 Uncharacterized protein A no no 0.561 0.389 0.375 2e-34
>sp|Q8W486|Y1491_ARATH Uncharacterized protein At1g04910 OS=Arabidopsis thaliana GN=At1g04910 PE=1 SV=1 Back     alignment and function desciption
 Score =  147 bits (370), Expect = 2e-34,   Method: Compositional matrix adjust.
 Identities = 80/213 (37%), Positives = 123/213 (57%), Gaps = 11/213 (5%)

Query: 126 ISTIWKASHKGVEWKPCVKRSSGELP---ESNGYISVEANGGLNQQRISICNAVAVAGYL 182
           +  +W+++  G  W+P     S   P   E+NGY+ V  NGGLNQQR +ICNAV  A  +
Sbjct: 59  VEELWESAKSG-GWRPSSAPRSDWPPPTKETNGYLRVRCNGGLNQQRSAICNAVLAARIM 117

Query: 183 NATLVIPNFHYHSIWRDPSKFRDIYDEDYFTSVLQNDVRVVNKIPEYIMERFDHNMSNVY 242
           NATLV+P    +S W D S F+ IYD ++F   L+ DV++V KIP+ + +         +
Sbjct: 118 NATLVLPELDANSFWHDDSGFQGIYDVEHFIETLKYDVKIVGKIPD-VHKNGKTKKIKAF 176

Query: 243 NFRVKAWSPIQFYKDEVLPKLLEERLIRISPFANRLSFDAP-PAVQRLRCLANYEALRFS 301
             R    +PI++Y    L  + E   I ++PF++RL+ +   P  QRLRC  NY ALRF 
Sbjct: 177 QIRPPRDAPIEWYLTTALKAMREHSAIYLTPFSHRLAEEIDNPEYQRLRCRVNYHALRFK 236

Query: 302 SPILTMGETLVARMKERSVNHGGKYISVHLRFE 334
             I+ + E++V +++ +     G ++S+HLRFE
Sbjct: 237 PHIMKLSESIVDKLRSQ-----GHFMSIHLRFE 264





Arabidopsis thaliana (taxid: 3702)

Close Homologs in the Non-Redundant Database Detected by BLAST ?

GI ?Alignment Graph ?Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query360
15226282 567 O-fucosyltransferase family protein [Ara 0.936 0.594 0.798 1e-160
297814382 567 hypothetical protein ARALYDRAFT_484066 [ 0.936 0.594 0.795 1e-159
224111498 569 predicted protein [Populus trichocarpa] 0.938 0.594 0.786 1e-154
224099495 569 predicted protein [Populus trichocarpa] 0.938 0.594 0.789 1e-153
225424244 561 PREDICTED: DUF246 domain-containing prot 0.852 0.547 0.822 1e-148
147801555 575 hypothetical protein VITISV_036881 [Viti 0.852 0.533 0.822 1e-148
449465763 563 PREDICTED: DUF246 domain-containing prot 0.877 0.561 0.796 1e-147
255570136 570 conserved hypothetical protein [Ricinus 0.936 0.591 0.785 1e-145
356521550 564 PREDICTED: DUF246 domain-containing prot 0.880 0.562 0.768 1e-144
356575943 564 PREDICTED: DUF246 domain-containing prot 0.880 0.562 0.746 1e-140
>gi|15226282|ref|NP_178257.1| O-fucosyltransferase family protein [Arabidopsis thaliana] gi|3785977|gb|AAC67324.1| similar to axi 1 protein from Nicotiana tabacum [Arabidopsis thaliana] gi|20260420|gb|AAM13108.1| similar to axi 1 protein [Arabidopsis thaliana] gi|22136280|gb|AAM91218.1| similar to axi 1 protein [Arabidopsis thaliana] gi|110742513|dbj|BAE99174.1| similar to axi 1 protein from Nicotiana tabacum [Arabidopsis thaliana] gi|330250365|gb|AEC05459.1| O-fucosyltransferase family protein [Arabidopsis thaliana] Back     alignment and taxonomy information
 Score =  570 bits (1468), Expect = e-160,   Method: Compositional matrix adjust.
 Identities = 274/343 (79%), Positives = 302/343 (88%), Gaps = 6/343 (1%)

Query: 4   NRLPSSGHTTPSPPASPRRSPRYRHGRGKP--GGRFTPVGPPGRANLAHRLAWLFLSVLL 61
           NRLP SGHTTPSPPASPRRSPRYRHGR K   G RF  V P     LAHRL+W+ LSVLL
Sbjct: 5   NRLPGSGHTTPSPPASPRRSPRYRHGRSKAAAGSRFPTVQP--SRTLAHRLSWILLSVLL 62

Query: 62  RRQGIFLFAPLIYISGMLLYMGTVSFDVVPVIKHRPAPGSVYRSPQVYAKLKPEMDADNS 121
           RRQGIFLFAPLIYIS MLLYMGTVSFDVVP+I+ RP PGSVY+SPQVYAKL+PEMDADNS
Sbjct: 63  RRQGIFLFAPLIYISCMLLYMGTVSFDVVPIIQRRPPPGSVYKSPQVYAKLRPEMDADNS 122

Query: 122 SSDAISTIWKASHKGVEWKPCVKRSSGELPESNGYISVEANGGLNQQRISICNAVAVAGY 181
           ++DAI+TIWK S+KG EWKP V +S+G+LPESNGYI VEANGGLNQQR SICNAVAVAGY
Sbjct: 123 TADAITTIWKHSYKGGEWKPYVNKSTGDLPESNGYIYVEANGGLNQQRTSICNAVAVAGY 182

Query: 182 LNATLVIPNFHYHSIWRDPSKFRDIYDEDYFTSVLQNDVRVVNKIPEYIMERFDHNMSNV 241
           LNATLVIPNFHYHSIWRDPSKF DIYDE++F S L NDVRVV+ IPEY+MERFDHNM+NV
Sbjct: 183 LNATLVIPNFHYHSIWRDPSKFGDIYDEEFFVSTLSNDVRVVDTIPEYLMERFDHNMTNV 242

Query: 242 YNFRVKAWSPIQFYKDEVLPKLLEERLIRISPFANRLSFDAPPAVQRLRCLANYEALRFS 301
           YNFRVKAWSPIQ+Y+D +LPKLLEE++IRISPFANRLSFDAP AVQRLRCLANYEAL+FS
Sbjct: 243 YNFRVKAWSPIQYYRDSILPKLLEEKIIRISPFANRLSFDAPQAVQRLRCLANYEALKFS 302

Query: 302 SPILTMGETLVARMKERSVNHGGKYISVHLRFEE--VSFHSAI 342
             ILT+GETLV RMKE+S NHG KY+SVHLRFEE  V+F   I
Sbjct: 303 KTILTLGETLVKRMKEQSANHGAKYVSVHLRFEEDMVAFSCCI 345




Source: Arabidopsis thaliana

Species: Arabidopsis thaliana

Genus: Arabidopsis

Family: Brassicaceae

Order: Brassicales

Class:

Phylum: Streptophyta

Superkingdom: Eukaryota

>gi|297814382|ref|XP_002875074.1| hypothetical protein ARALYDRAFT_484066 [Arabidopsis lyrata subsp. lyrata] gi|297320912|gb|EFH51333.1| hypothetical protein ARALYDRAFT_484066 [Arabidopsis lyrata subsp. lyrata] Back     alignment and taxonomy information
>gi|224111498|ref|XP_002315879.1| predicted protein [Populus trichocarpa] gi|222864919|gb|EEF02050.1| predicted protein [Populus trichocarpa] Back     alignment and taxonomy information
>gi|224099495|ref|XP_002311505.1| predicted protein [Populus trichocarpa] gi|222851325|gb|EEE88872.1| predicted protein [Populus trichocarpa] Back     alignment and taxonomy information
>gi|225424244|ref|XP_002284457.1| PREDICTED: DUF246 domain-containing protein At1g04910 [Vitis vinifera] gi|297737694|emb|CBI26895.3| unnamed protein product [Vitis vinifera] Back     alignment and taxonomy information
>gi|147801555|emb|CAN77011.1| hypothetical protein VITISV_036881 [Vitis vinifera] Back     alignment and taxonomy information
>gi|449465763|ref|XP_004150597.1| PREDICTED: DUF246 domain-containing protein At1g04910-like [Cucumis sativus] gi|449514634|ref|XP_004164435.1| PREDICTED: DUF246 domain-containing protein At1g04910-like [Cucumis sativus] Back     alignment and taxonomy information
>gi|255570136|ref|XP_002526030.1| conserved hypothetical protein [Ricinus communis] gi|223534677|gb|EEF36370.1| conserved hypothetical protein [Ricinus communis] Back     alignment and taxonomy information
>gi|356521550|ref|XP_003529417.1| PREDICTED: DUF246 domain-containing protein At1g04910-like [Glycine max] Back     alignment and taxonomy information
>gi|356575943|ref|XP_003556095.1| PREDICTED: DUF246 domain-containing protein At1g04910-like [Glycine max] Back     alignment and taxonomy information

Prediction of Gene Ontology (GO) Terms

Close Homologs with Gene Ontology terms Detected by BLAST ?

ID ? Alignment graph ? Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query360
TAIR|locus:2049527 567 AT2G01480 [Arabidopsis thalian 0.936 0.594 0.766 1e-139
TAIR|locus:2006852 562 AT1G14970 [Arabidopsis thalian 0.922 0.590 0.671 5.2e-120
TAIR|locus:2014666 568 AT1G35510 [Arabidopsis thalian 0.838 0.531 0.507 1.6e-79
TAIR|locus:504956200 589 AT1G38131 [Arabidopsis thalian 0.813 0.497 0.496 2.9e-71
TAIR|locus:1005716710 470 AT1G38065 "AT1G38065" [Arabido 0.525 0.402 0.592 8.4e-58
TAIR|locus:2079281 557 AT3G26370 [Arabidopsis thalian 0.538 0.348 0.439 9.3e-43
TAIR|locus:2168297 504 AT5G65470 [Arabidopsis thalian 0.594 0.424 0.404 2.5e-39
TAIR|locus:2029984 611 AT1G29200 [Arabidopsis thalian 0.6 0.353 0.408 1.1e-38
TAIR|locus:2121909 519 AT4G24530 [Arabidopsis thalian 0.575 0.398 0.381 2.8e-38
TAIR|locus:2027129 652 AT1G62330 [Arabidopsis thalian 0.536 0.296 0.404 1.1e-36
TAIR|locus:2049527 AT2G01480 [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
 Score = 1367 (486.3 bits), Expect = 1.0e-139, P = 1.0e-139
 Identities = 263/343 (76%), Positives = 291/343 (84%)

Query:     4 NRLPSSGHTTXXXXXXXXXXXXYRHGRGKP--GGRFTPVGPPGRANLAHRLAWLFLSVLL 61
             NRLP SGHTT            YRHGR K   G RF P   P R  LAHRL+W+ LSVLL
Sbjct:     5 NRLPGSGHTTPSPPASPRRSPRYRHGRSKAAAGSRF-PTVQPSRT-LAHRLSWILLSVLL 62

Query:    62 RRQGIFLFAPLIYISGMLLYMGTVSFDVVPVIKHRPAPGSVYRSPQVYAKLKPEMDADNS 121
             RRQGIFLFAPLIYIS MLLYMGTVSFDVVP+I+ RP PGSVY+SPQVYAKL+PEMDADNS
Sbjct:    63 RRQGIFLFAPLIYISCMLLYMGTVSFDVVPIIQRRPPPGSVYKSPQVYAKLRPEMDADNS 122

Query:   122 SSDAISTIWKASHKGVEWKPCVKRSSGELPESNGYISVEANGGLNQQRISICNAVAVAGY 181
             ++DAI+TIWK S+KG EWKP V +S+G+LPESNGYI VEANGGLNQQR SICNAVAVAGY
Sbjct:   123 TADAITTIWKHSYKGGEWKPYVNKSTGDLPESNGYIYVEANGGLNQQRTSICNAVAVAGY 182

Query:   182 LNATLVIPNFHYHSIWRDPSKFRDIYDEDYFTSVLQNDVRVVNKIPEYIMERFDHNMSNV 241
             LNATLVIPNFHYHSIWRDPSKF DIYDE++F S L NDVRVV+ IPEY+MERFDHNM+NV
Sbjct:   183 LNATLVIPNFHYHSIWRDPSKFGDIYDEEFFVSTLSNDVRVVDTIPEYLMERFDHNMTNV 242

Query:   242 YNFRVKAWSPIQFYKDEVLPKLLEERLIRISPFANRLSFDAPPAVQRLRCLANYEALRFS 301
             YNFRVKAWSPIQ+Y+D +LPKLLEE++IRISPFANRLSFDAP AVQRLRCLANYEAL+FS
Sbjct:   243 YNFRVKAWSPIQYYRDSILPKLLEEKIIRISPFANRLSFDAPQAVQRLRCLANYEALKFS 302

Query:   302 SPILTMGETLVARMKERSVNHGGKYISVHLRFEE--VSFHSAI 342
               ILT+GETLV RMKE+S NHG KY+SVHLRFEE  V+F   I
Sbjct:   303 KTILTLGETLVKRMKEQSANHGAKYVSVHLRFEEDMVAFSCCI 345




GO:0005794 "Golgi apparatus" evidence=ISM
GO:0016757 "transferase activity, transferring glycosyl groups" evidence=ISM
TAIR|locus:2006852 AT1G14970 [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2014666 AT1G35510 [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:504956200 AT1G38131 [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:1005716710 AT1G38065 "AT1G38065" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2079281 AT3G26370 [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2168297 AT5G65470 [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2029984 AT1G29200 [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2121909 AT4G24530 [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2027129 AT1G62330 [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms

Prediction of Enzyme Commission (EC) Number

EC Number Prediction by Annotation Transfer from SWISS-PROT Entries ?

No confident hit for EC number transfering in SWISSPROT detected by BLAST

EC Number Prediction by Ezypred Server ?

Fail to connect to Ezypred Server

EC Number Prediction by EFICAz Software ?

No EC number assignment, probably not an enzyme!


Prediction of Functionally Associated Proteins

Functionally Associated Proteins Detected by STRING ?

Your Input:
AT2G01480
unknown protein; unknown protein; LOCATED IN- chloroplast; EXPRESSED IN- 23 plant structures; EXPRESSED DURING- 15 growth stages; CONTAINS InterPro DOMAIN/s- Protein of unknown function DUF246, plant (InterPro-IPR004348); BEST Arabidopsis thaliana protein match is- unknown protein (TAIR-AT1G14970.1); Has 431 Blast hits to 423 proteins in 21 species- Archae - 0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 431; Viruses - 0; Other Eukaryotes - 0 (source- NCBI BLink). (567 aa)
(Arabidopsis thaliana)
Predicted Functional Partners:
AGP4
AGP4 (ARABINOGALACTAN PROTEIN 4); Encodes arabinogalactan-protein (AGP4). ; Proteoglycan that s [...] (135 aa)
       0.435

Conserved Domains and Related Protein Families

Conserved Domains Detected by RPS-BLAST ?

ID ?Alignment Graph ?Length ? Definition ? E-value ?
Query360
cd11299290 cd11299, O-FucT_plant, GDP-fucose protein O-fucosy 4e-84
pfam10250 315 pfam10250, O-FucT, GDP-fucose protein O-fucosyltra 3e-55
>gnl|CDD|211385 cd11299, O-FucT_plant, GDP-fucose protein O-fucosyltransferase, plant specific subfamily Back     alignment and domain information
 Score =  256 bits (657), Expect = 4e-84
 Identities = 87/181 (48%), Positives = 119/181 (65%), Gaps = 9/181 (4%)

Query: 156 YISVEANGGLNQQRISICNAVAVAGYLNATLVIPNFHYHSIWRDPSKFRDIYDEDYFTSV 215
           Y+ V ANGGLNQQR  IC+AVAVA  LNATLV+P    +S+W D SKF DIYD D+F   
Sbjct: 1   YLLVSANGGLNQQRSQICDAVAVARLLNATLVLPELDKNSVWGDSSKFGDIYDVDHFIKS 60

Query: 216 LQNDVRVVNKIPEYIMERFDHNMSNVYNFRVKAWSPIQFYKDEVLPKLLEERLIRISPFA 275
           L++DVRVV K+PE +  +       +   RV + S   +Y +EVLP L +  +IR++PF 
Sbjct: 61  LKDDVRVVKKLPEELASKKPE----ITVKRVPSRSSPSYYLEEVLPLLKKHGVIRLAPFD 116

Query: 276 NRLSFDA-PPAVQRLRCLANYEALRFSSPILTMGETLVARMKERSVNHGGKYISVHLRFE 334
           +RL+ D  PP +QRLRC  N+ ALRF   I  +G+ LV R++E     GG ++++HLRFE
Sbjct: 117 SRLANDLLPPEIQRLRCRVNFHALRFVPEIEELGDKLVDRLRE----AGGPFLALHLRFE 172

Query: 335 E 335
           +
Sbjct: 173 K 173


Some members of this plant-specific family of O-fucosyltransferases have been annotated as auxin-independent growth promotors. The function of the protein seems unclear. O-fucosyltransferase-like proteins are GDP-fucose dependent enzymes with similarities to the family 1 glycosyltransferases (GT1). They are soluble ER proteins that may be proteolytically cleaved from a membrane-associated preprotein, and are involved in the O-fucosylation of protein substrates, the core fucosylation of growth factor receptors, and other processes. Length = 290

>gnl|CDD|220652 pfam10250, O-FucT, GDP-fucose protein O-fucosyltransferase Back     alignment and domain information

Conserved Domains Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query 360
PF10250 351 O-FucT: GDP-fucose protein O-fucosyltransferase; I 99.96
KOG3849 386 consensus GDP-fucose protein O-fucosyltransferase 97.47
>PF10250 O-FucT: GDP-fucose protein O-fucosyltransferase; InterPro: IPR019378 This is a family of conserved proteins representing the enzyme responsible for adding O-fucose to EGF (epidermal growth factor-like) repeats Back     alignment and domain information
Probab=99.96  E-value=1.3e-30  Score=249.10  Aligned_cols=170  Identities=35%  Similarity=0.528  Sum_probs=120.4

Q ss_pred             ecCChhHHHHHHHHHHHHHHHhcCcEeecccccccccCCCCC-----CCccCCHHHHHHhccCCceEeccCchhHHHhhc
Q 018175          161 ANGGLNQQRISICNAVAVAGYLNATLVIPNFHYHSIWRDPSK-----FRDIYDEDYFTSVLQNDVRVVNKIPEYIMERFD  235 (360)
Q Consensus       161 ~nGGLNQqR~~IcDAVaVAriLNATLVlP~l~~~S~WkD~S~-----F~DIFDvdhFI~sL~~dVrIVk~LP~~l~~~~~  235 (360)
                      +.||+||||.++++||++|++||+|||||.+.....|+|.++     |+++||+++|++.++++|.+.+.+|........
T Consensus         6 ~~GGfnNQr~~~~~a~~~A~~LnRTLVLPp~~~~~~~~~~~~~~~ipf~~~fD~~~l~~~~~~vi~~~ef~~~~~~~~~~   85 (351)
T PF10250_consen    6 CMGGFNNQRMGFENAVVFAKALNRTLVLPPFIKHYHWKDQSKQRHIPFSDFFDVEHLRKFLRPVITMEEFLPKHWDEVFR   85 (351)
T ss_dssp             -SSSHHHHHHHHHHHHHHHHHHT-EEE--EEEEESSSS----EEEEEHHHHB-HHHHTTTS--EE-HHHHHHHHS-GGG-
T ss_pred             CCCCHHHHHHHHHHHHHHHHHhCCEEEcCCccccccccccccccccChhhhccHHHHHHHhhCceehheeccchhccccc
Confidence            789999999999999999999999999999999999999988     999999999999999999999999876644221


Q ss_pred             ccC-------------------------Ccceeccccc-CCChhhHHhhchhhhhcc------ceEEEcCCCccccCC-C
Q 018175          236 HNM-------------------------SNVYNFRVKA-WSPIQFYKDEVLPKLLEE------RLIRISPFANRLSFD-A  282 (360)
Q Consensus       236 ~~~-------------------------~~i~~~~v~~-wss~~yY~e~VlP~L~k~------~VI~l~~f~~RLa~~-l  282 (360)
                      ...                         .....+.... |+.+.+|.++++|.+.++      +++.|.++..++.++ .
T Consensus        86 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~  165 (351)
T PF10250_consen   86 LQYCWSPWESGSWDDNMKDGNPFGPFWDKFPIKFDPCEFWSSPSLYLEDVLPELREWNENSEHPVIAFTGFESRLPDNYL  165 (351)
T ss_dssp             EEEESS-B--------TTSSTTHHHHHHHTT---SEEE-E-TTSTTTT-STTHHHHHHHHTT-SEEEESS-SS-SS--GG
T ss_pred             hhhcccccccccchhhccccccccccccccceeeccccccCCchhhHHHhhhHHhhhccccccccceeccccccchhccc
Confidence            100                         0112223333 888999999999999887      999999999999876 7


Q ss_pred             CchhhhhhchhcccccccchhHHHHHHHHHHHHHhcccCCCCceEEEecccchhhHHHhHH
Q 018175          283 PPAVQRLRCLANYEALRFSSPILTMGETLVARMKERSVNHGGKYISVHLRFEEVSFHSAIN  343 (360)
Q Consensus       283 P~~iQrLRCrvnf~ALrF~~~I~~Lg~~LV~RmR~~s~~~~g~yIALHLRfE~DMla~a~~  343 (360)
                      +.++|+        +|+|+++|+++|++++++|+.    .+++|||+|||+|+|| ..+|.
T Consensus       166 ~~~~~r--------~l~~~~~i~~~a~~~i~~~~~----~~~~yiavHlR~~~D~-~~~C~  213 (351)
T PF10250_consen  166 DRDLQR--------YLRFSPEIRELADKFIKRLLA----GGGPYIAVHLRRGKDW-FSACE  213 (351)
T ss_dssp             GGGGGG--------G--B-HHHHHHHHHHHHHH--------SSEEEEEE--SHHH-HHHHC
T ss_pred             CccceE--------EEecCHHHHHHHHHHHHHhhc----ccCceEEEeecccCch-Hhhcc
Confidence            888888        999999999999999999992    3689999999999999 46665



Six highly conserved cysteines are present as well as a DXD-like motif (ERD), conserved in mammals, Drosophila, and Caenorhabditis elegans. Both features are characteristic of several glycosyltransferase families. The enzyme is a membrane-bound protein released by proteolysis and, as for most glycosyltransferases, is strongly activated by manganese []. ; PDB: 3ZY6_A 3ZY3_A 3ZY5_A 3ZY2_A 3ZY4_A.

>KOG3849 consensus GDP-fucose protein O-fucosyltransferase [Posttranslational modification, protein turnover, chaperones] Back     alignment and domain information

Homologous Structure Templates

Structure Templates Detected by BLAST ?

No homologous structure with e-value below 0.005

Structure Templates Detected by RPS-BLAST ?

ID ?Alignment Graph ?Length ? Definition ? E-value ?
Query360
1vt4_I 1221 APAF-1 related killer DARK; drosophila apoptosome, 4e-05
1vt4_I 1221 APAF-1 related killer DARK; drosophila apoptosome, 6e-04
3zy2_A 362 Putative GDP-fucose protein O-fucosyltransferase; 9e-04
>1vt4_I APAF-1 related killer DARK; drosophila apoptosome, apoptosis, programmed cell death; HET: DTP; 6.90A {Drosophila melanogaster} PDB: 3iz8_A* Length = 1221 Back     alignment and structure
 Score = 44.5 bits (104), Expect = 4e-05
 Identities = 28/222 (12%), Positives = 60/222 (27%), Gaps = 75/222 (33%)

Query: 142 CVKRSSGELPESNGYISVEANGGLNQQRISICNAVAVAGYLNATLVIPNFHYHSIWR--- 198
            +     +LP        E     N +R+SI          +      N+ + +  +   
Sbjct: 310 YLDCRPQDLPR-------EVLTT-NPRRLSIIAES----IRDGLATWDNWKHVNCDKLTT 357

Query: 199 ---------DPSKFRDIYDEDYFTSVLQNDVRVVNKIPEYIMERFDHNMSNVYNFRVKAW 249
                    +P+++R ++D     SV          IP  ++                 W
Sbjct: 358 IIESSLNVLEPAEYRKMFDR---LSVFPPSA----HIPTILLSLI--------------W 396

Query: 250 SPIQFYKDE-VLPKLLEERLIRISPFANRLSF-----------DAPPAVQRLRCLANYEA 297
             +       V+ KL +  L+   P  + +S            +   A+ R   + +Y  
Sbjct: 397 FDVIKSDVMVVVNKLHKYSLVEKQPKESTISIPSIYLELKVKLENEYALHR-SIVDHYNI 455

Query: 298 LRFSSPILTMGETLVARMKERSVNHGGKYISVHLRFEEVSFH 339
            +                 +    +  +Y   H+       H
Sbjct: 456 PKTFDS------------DDLIPPYLDQYFYSHI-----GHH 480


>1vt4_I APAF-1 related killer DARK; drosophila apoptosome, apoptosis, programmed cell death; HET: DTP; 6.90A {Drosophila melanogaster} PDB: 3iz8_A* Length = 1221 Back     alignment and structure
>3zy2_A Putative GDP-fucose protein O-fucosyltransferase; glycosyltransferase, GT-B, catalytic mechanism,; HET: GDP; 1.54A {Caenorhabditis elegans} PDB: 3zy3_A* 3zy4_A* 3zy5_A* 3zy6_A* Length = 362 Back     alignment and structure

Structure Templates Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query360
4ap5_A 408 GDP-fucose protein O-fucosyltransferase 2; GT-B, G 98.36
3zy2_A 362 Putative GDP-fucose protein O-fucosyltransferase; 98.02
2hhc_A 330 NODZ, nodulation fucosyltransferase NODZ; glycosyl 85.0
2de0_X 526 Alpha-(1,6)-fucosyltransferase; FUT8, glycosyltran 82.67
>4ap5_A GDP-fucose protein O-fucosyltransferase 2; GT-B, GT68; HET: NAG; 3.00A {Homo sapiens} PDB: 4ap6_A* Back     alignment and structure
Probab=98.36  E-value=1.5e-06  Score=85.23  Aligned_cols=70  Identities=20%  Similarity=0.409  Sum_probs=54.5

Q ss_pred             CCCCceEEEEec-C-ChhHHHHHHHHHHHHHHHhcCc----EeecccccccccCCCC------CCCccCCHHHHHHhccC
Q 018175          151 PESNGYISVEAN-G-GLNQQRISICNAVAVAGYLNAT----LVIPNFHYHSIWRDPS------KFRDIYDEDYFTSVLQN  218 (360)
Q Consensus       151 ~~snGYL~V~~n-G-GLNQqR~~IcDAVaVAriLNAT----LVlP~l~~~S~WkD~S------~F~DIFDvdhFI~sL~~  218 (360)
                      ...++||+-..| | |.|+||...-.|.+.|+.||.|    ||||-...---|+-.+      .|++.||++.+    +.
T Consensus        19 ~~~~rYl~y~~~~g~Gfn~qrd~f~~~a~~a~~Lnrt~~~~LVLPPw~~~~h~~~~~~~~~~ipf~~fFDv~~L----~~   94 (408)
T 4ap5_A           19 ASRRRYLLYDVNPPEGFNLRRDVYIRIASLLKTLLKTEEWVLVLPPWGRLYHWQSPDIHQVRIPWSEFFDLPSL----NK   94 (408)
T ss_dssp             -CCCEEEEECCCTTCCHHHHHHHHHHHHHHHHHHHTTSCEEEEECCBCCSCGGGSTTSCCCSBCGGGTBCHHHH----HT
T ss_pred             ccccceEEecCCCCCcccHHHHHHHHHHHHHHHHhCcCCcEEEeCCCcccccccCCcccceeccHHHhcCHHHH----Hh
Confidence            356899999875 4 8999999999999999999999    9999665433455322      49999999875    45


Q ss_pred             CceEec
Q 018175          219 DVRVVN  224 (360)
Q Consensus       219 dVrIVk  224 (360)
                      .|+||.
T Consensus        95 ~vpVI~  100 (408)
T 4ap5_A           95 NIPVIE  100 (408)
T ss_dssp             TSCEEE
T ss_pred             hCCeeE
Confidence            666763



>3zy2_A Putative GDP-fucose protein O-fucosyltransferase; glycosyltransferase, GT-B, catalytic mechanism,; HET: GDP; 1.54A {Caenorhabditis elegans} PDB: 3zy3_A* 3zy4_A* 3zy5_A* 3zy6_A* Back     alignment and structure
>2hhc_A NODZ, nodulation fucosyltransferase NODZ; glycosyltransferase; 1.54A {Bradyrhizobium SP} PDB: 2hlh_A 2ocx_A* 3siw_A* 3six_A* Back     alignment and structure
>2de0_X Alpha-(1,6)-fucosyltransferase; FUT8, glycosyltransferase, N-glycan, COR SH3 domain; 2.61A {Homo sapiens} Back     alignment and structure

Homologous Structure Domains

Structure Domains Detected by RPS-BLAST ?

No hit with e-value below 0.005

Homologous Domains Detected by HHsearch ?

No hit with probability above 80.00