Citrus Sinensis ID: 040374


Local Sequence Feature Prediction

Prediction and (Method)Result
Residue Number Marker
Protein Sequence ?
Secondary Structure (PSIPRED) ?
Secondary Structure Prediction (SSPRO) ?
Coil and Loop (DISEMBL) ?
Flexible Loop (DISEMBL) ?
Low Complexity Region (SEG) ?
Disordered region (IsUnstruct) ?
Disordered Region (DISOPRED) ?
Disordered Region (DISEMBL) ?
Disordered Region (DISPRO) ?
Transmembrane Helix (TMHMM) ?
Transmembrane Helix (HMMTOP) ?
Transmembrane Helix (MEMSAT) ?
TM Helix, Signal Peptide (MEMSAT_SVM) ?
TM Helix, Signal Peptide (Phobius) ?
Signal Peptide (SignalP HMM Mode) ?
Signal Peptide (SignalP NN Mode) ?
Coiled Coils (COILS) ?
Positional Conservation ?
 
--------10--------20--------30--------40--------50--------60--------70--------80--------90-------100-------110-------120-------130-------14
MAMALKHAFFILALTCLIMANIANATSRNDHLNNNMKLGYNLATRLEASGSLTECWNALMELKSCSNEIIIFFLNGQADIGPDCCRAIDIITRNCWPTMLTSLGFTAEEGNILRGYCDASSAPSPGGPAVIYQPQTDNI
cccHHHHHHHHHHHHHHHHHHHHHcccccccccccccccccHHHHccccccHHHHHHHHHccccHHHHHHHHHHHcccccccHHHHHHHHHHHccccHHHccccccccHHHHHHHcccccccccccccccccccccccc
cHHHHHHHHHHHHHHHHHHHHHHccccccccccccccccccHHHHccccccHHHHHHHHHHHHHHHHHHHHHHHcccccccHHHHHHHHHHccccHHHHHHHccccHHHHHHHHccccccccccccccccccccccccc
MAMALKHAFFILALTCLIMANIAnatsrndhlnnnmKLGYNLATRLEASGSLTECWNALMELKSCSNEIIIFFLngqadigpdccraidiitrncwptmltslgftaeegnilrgycdassapspggpaviyqpqtdni
MAMALKHAFFILALTCLIMANIANATSRNDHLNNNMKLGYNLATRLEASGSLTECWNALMELKSCSNEIIIFFLNGQADIGPDCCRAIDIITRNCWPTMLTSLGFTAEEGNILRGYCDAssapspggpaviyqpqtdni
MAMALKHAFFILALTCLIMANIANATSRNDHLNNNMKLGYNLATRLEASGSLTECWNALMELKSCSNEIIIFFLNGQADIGPDCCRAIDIITRNCWPTMLTSLGFTAEEGNILRGYCDASSAPSPGGPAVIYQPQTDNI
****LKHAFFILALTCLIMANIANATSRNDHLNNNMKLGYNLATRLEASGSLTECWNALMELKSCSNEIIIFFLNGQADIGPDCCRAIDIITRNCWPTMLTSLGFTAEEGNILRGYCD*********************
******H*FFILALTCLIMANIA**************************GSLTECWNALMELKSCSNEIIIFFLNGQADIGPDCCRAIDIITRNCWPTMLTSLGFTAEEGNILRGY***********************
MAMALKHAFFILALTCLIMANIANATSRNDHLNNNMKLGYNLATRLEASGSLTECWNALMELKSCSNEIIIFFLNGQADIGPDCCRAIDIITRNCWPTMLTSLGFTAEEGNILRGYCDASSAPSPGGPAVIYQPQTDNI
MAMALKHAFFILALTCLIMANIANATS**************LA****ASGSLTECWNALMELKSCSNEIIIFFLNGQADIGPDCCRAIDIITRNCWPTMLTSLGFTAEEGNILRGYCDA********************
ooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
iiiiiiiHHHHHHHHHHHHHHHHHHoooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
ooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooHHHHHHHHHHHHHHHHHHHHHHHHiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiii
SSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSiiiiiiiiiiiiiiiiiiiHHHHHHHHHHHHHHHHoooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
SSSSSSSSSSSSSSSSSSSSSSSSSoooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
SSSSSSSSSSSSSSSSSSSSSSSSSSooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
MAMALKHAFFILALTCLIMANIANATSRNDHLNNNMKLGYNLATRLEASGSLTECWNALMELKSCSNEIIIFFLNGQADIGPDCCRAIDIITRNCWPTMLTSLGFTAEEGNILRGYCDASSAPSPGGPAVIYQPQTDNI
no confident homologs detected

Close Homologs for Annotation Transfer

Close Homologs in SWISS-PROT Database Detected by BLAST ?

ID ?Alignment graph ?Length ? Definition ? RBH(Q2H) ? RBH(H2Q) ? Q cover ? H cover ? Identity ? E-value ?
Query139 2.2.26 [Sep-21-2011]
Q9T039127 Egg cell-secreted protein yes no 0.848 0.929 0.519 7e-29
Q9SJ24125 Egg cell-secreted protein no no 0.791 0.88 0.478 3e-26
Q9SJ23125 Egg cell-secreted protein no no 0.877 0.976 0.439 8e-26
Q9SRD8158 Egg cell-secreted protein no no 0.618 0.544 0.511 4e-22
Q9FGG1155 Egg cell-secreted protein no no 0.676 0.606 0.401 9e-17
>sp|Q9T039|EC14_ARATH Egg cell-secreted protein 1.4 OS=Arabidopsis thaliana GN=EC1.4 PE=2 SV=1 Back     alignment and function desciption
 Score =  125 bits (314), Expect = 7e-29,   Method: Compositional matrix adjust.
 Identities = 67/129 (51%), Positives = 83/129 (64%), Gaps = 11/129 (8%)

Query: 3   MALKHAFFILALTCLIMANIANATSRNDHLNNNMKLGYNLATRLEASGSLTECWNALMEL 62
           MA    F    +T LI+  + N T     L    +   N+A RL+ SG L ECWNAL EL
Sbjct: 1   MASNTTFLFSTVTLLII--LLNTTVSGRDLP--AESSTNIAARLQ-SGGLMECWNALYEL 55

Query: 63  KSCSNEIIIFFLNGQADIGPDCCRAIDIITRNCWPTMLTSLGFTAEEGNILRGYC----- 117
           KSC+NEI++FFLNG+  +G  CC ++DIIT NCWP MLTSLGFT EE N+LRG+C     
Sbjct: 56  KSCTNEIVLFFLNGETKLGVSCCESVDIITTNCWPAMLTSLGFTPEEANVLRGFCQNPNS 115

Query: 118 -DASSAPSP 125
            D+S APSP
Sbjct: 116 GDSSPAPSP 124




Involved in the regulation of gamete interactions during the double fertilization and to prevent multiple-pollen tube attraction; mediates the redistribution of the gamete fusogen HAP2/GCS1 to the cell surface after secretion upon sperm arrival.
Arabidopsis thaliana (taxid: 3702)
>sp|Q9SJ24|EC12_ARATH Egg cell-secreted protein 1.2 OS=Arabidopsis thaliana GN=EC1.2 PE=2 SV=1 Back     alignment and function description
>sp|Q9SJ23|EC13_ARATH Egg cell-secreted protein 1.3 OS=Arabidopsis thaliana GN=EC1.3 PE=2 SV=1 Back     alignment and function description
>sp|Q9SRD8|EC11_ARATH Egg cell-secreted protein 1.1 OS=Arabidopsis thaliana GN=EC1.1 PE=2 SV=1 Back     alignment and function description
>sp|Q9FGG1|EC15_ARATH Egg cell-secreted protein 1.5 OS=Arabidopsis thaliana GN=EC1.5 PE=2 SV=1 Back     alignment and function description

Close Homologs in the Non-Redundant Database Detected by BLAST ?

GI ?Alignment Graph ?Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query139
359482498128 PREDICTED: uncharacterized protein LOC10 0.892 0.968 0.610 3e-37
296082551135 unnamed protein product [Vitis vinifera] 0.798 0.822 0.669 1e-36
224060465146 predicted protein [Populus trichocarpa] 0.769 0.732 0.632 1e-33
255578404136 conserved hypothetical protein [Ricinus 0.827 0.845 0.580 6e-30
15235039127 uncharacterized protein [Arabidopsis tha 0.848 0.929 0.519 5e-27
116830619128 unknown [Arabidopsis thaliana] 0.848 0.921 0.519 6e-27
297802032127 hypothetical protein ARALYDRAFT_490711 [ 0.553 0.606 0.666 4e-25
297825031125 hypothetical protein ARALYDRAFT_900605 [ 0.834 0.928 0.465 6e-25
255578398135 conserved hypothetical protein [Ricinus 0.546 0.562 0.705 2e-24
116830473126 unknown [Arabidopsis thaliana] 0.791 0.873 0.478 2e-24
>gi|359482498|ref|XP_002274398.2| PREDICTED: uncharacterized protein LOC100260034 [Vitis vinifera] gi|147772490|emb|CAN65099.1| hypothetical protein VITISV_039724 [Vitis vinifera] gi|297743029|emb|CBI35896.3| unnamed protein product [Vitis vinifera] Back     alignment and taxonomy information
 Score =  159 bits (402), Expect = 3e-37,   Method: Compositional matrix adjust.
 Identities = 80/131 (61%), Positives = 95/131 (72%), Gaps = 7/131 (5%)

Query: 3   MALKHAFFILALTCLIMANIANATSRNDHLNNNMKLGYNLATRLEASGSLTECWNALMEL 62
           MALK  F ++ALT L+    ANA +  D   N    G +L TRLE SG L ECWNALME+
Sbjct: 1   MALKTLFLLVALTWLM----ANAAATRDLPTNP---GLDLTTRLETSGGLVECWNALMEI 53

Query: 63  KSCSNEIIIFFLNGQADIGPDCCRAIDIITRNCWPTMLTSLGFTAEEGNILRGYCDASSA 122
           + C+NEII+FFLNGQ  +GP+CC+AI IITRNCWP MLTSLGFTAEEGNIL+GYC+ASS 
Sbjct: 54  RQCTNEIILFFLNGQTVLGPECCQAISIITRNCWPAMLTSLGFTAEEGNILQGYCNASSG 113

Query: 123 PSPGGPAVIYQ 133
           P       +YQ
Sbjct: 114 PPTPASPPLYQ 124




Source: Vitis vinifera

Species: Vitis vinifera

Genus: Vitis

Family: Vitaceae

Order: Vitales

Class:

Phylum: Streptophyta

Superkingdom: Eukaryota

>gi|296082551|emb|CBI21556.3| unnamed protein product [Vitis vinifera] Back     alignment and taxonomy information
>gi|224060465|ref|XP_002300213.1| predicted protein [Populus trichocarpa] gi|222847471|gb|EEE85018.1| predicted protein [Populus trichocarpa] Back     alignment and taxonomy information
>gi|255578404|ref|XP_002530067.1| conserved hypothetical protein [Ricinus communis] gi|223530420|gb|EEF32307.1| conserved hypothetical protein [Ricinus communis] Back     alignment and taxonomy information
>gi|15235039|ref|NP_195644.1| uncharacterized protein [Arabidopsis thaliana] gi|75213705|sp|Q9T039.1|EC14_ARATH RecName: Full=Egg cell-secreted protein 1.4; Flags: Precursor gi|4914446|emb|CAB43649.1| hypothetical protein [Arabidopsis thaliana] gi|7270918|emb|CAB80597.1| hypothetical protein [Arabidopsis thaliana] gi|91805627|gb|ABE65542.1| hypothetical protein At4g39340 [Arabidopsis thaliana] gi|332661658|gb|AEE87058.1| uncharacterized protein [Arabidopsis thaliana] Back     alignment and taxonomy information
>gi|116830619|gb|ABK28267.1| unknown [Arabidopsis thaliana] Back     alignment and taxonomy information
>gi|297802032|ref|XP_002868900.1| hypothetical protein ARALYDRAFT_490711 [Arabidopsis lyrata subsp. lyrata] gi|297314736|gb|EFH45159.1| hypothetical protein ARALYDRAFT_490711 [Arabidopsis lyrata subsp. lyrata] Back     alignment and taxonomy information
>gi|297825031|ref|XP_002880398.1| hypothetical protein ARALYDRAFT_900605 [Arabidopsis lyrata subsp. lyrata] gi|297326237|gb|EFH56657.1| hypothetical protein ARALYDRAFT_900605 [Arabidopsis lyrata subsp. lyrata] Back     alignment and taxonomy information
>gi|255578398|ref|XP_002530064.1| conserved hypothetical protein [Ricinus communis] gi|223530417|gb|EEF32304.1| conserved hypothetical protein [Ricinus communis] Back     alignment and taxonomy information
>gi|116830473|gb|ABK28194.1| unknown [Arabidopsis thaliana] Back     alignment and taxonomy information

Prediction of Gene Ontology (GO) Terms

Close Homologs with Gene Ontology terms Detected by BLAST ?

ID ? Alignment graph ? Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query139
TAIR|locus:2136293127 EC1.4 "AT4G39340" [Arabidopsis 0.848 0.929 0.519 4.4e-29
TAIR|locus:2052536125 EC1.2 "AT2G21740" [Arabidopsis 0.827 0.92 0.472 1.3e-27
TAIR|locus:2052556125 EC1.3 "AT2G21750" [Arabidopsis 0.877 0.976 0.439 2.5e-26
TAIR|locus:2030136158 EC1.1 "AT1G76750" [Arabidopsis 0.892 0.784 0.406 7e-22
TAIR|locus:2176080155 EC1.5 "AT5G64720" [Arabidopsis 0.633 0.567 0.437 6.7e-17
TAIR|locus:2164758103 AT5G48210 "AT5G48210" [Arabido 0.352 0.475 0.36 8.3e-07
TAIR|locus:504955577119 AT3G48675 "AT3G48675" [Arabido 0.503 0.588 0.32 3.4e-06
TAIR|locus:4010713622120 AT2G14378 [Arabidopsis thalian 0.330 0.383 0.340 7e-06
TAIR|locus:504955408120 AT4G35165 "AT4G35165" [Arabido 0.330 0.383 0.361 9e-06
TAIR|locus:504954909207 AT5G54062 "AT5G54062" [Arabido 0.345 0.231 0.346 1.1e-05
TAIR|locus:2136293 EC1.4 "AT4G39340" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
 Score = 323 (118.8 bits), Expect = 4.4e-29, P = 4.4e-29
 Identities = 67/129 (51%), Positives = 82/129 (63%)

Query:     3 MALKHAFFILALTCLIMANIANATSRNDHLNNNMKLGYNLATRLEASGSLTECWNALMEL 62
             MA    F    +T LI+  + N T     L        N+A RL+ SG L ECWNAL EL
Sbjct:     1 MASNTTFLFSTVTLLII--LLNTTVSGRDLPAESST--NIAARLQ-SGGLMECWNALYEL 55

Query:    63 KSCSNEIIIFFLNGQADIGPDCCRAIDIITRNCWPTMLTSLGFTAEEGNILRGYC----- 117
             KSC+NEI++FFLNG+  +G  CC ++DIIT NCWP MLTSLGFT EE N+LRG+C     
Sbjct:    56 KSCTNEIVLFFLNGETKLGVSCCESVDIITTNCWPAMLTSLGFTPEEANVLRGFCQNPNS 115

Query:   118 -DASSAPSP 125
              D+S APSP
Sbjct:   116 GDSSPAPSP 124




GO:0003674 "molecular_function" evidence=ND
GO:0005576 "extracellular region" evidence=ISM;IDA
GO:0009567 "double fertilization forming a zygote and endosperm" evidence=IGI
GO:0031982 "vesicle" evidence=IDA
GO:0080155 "regulation of double fertilization forming a zygote and endosperm" evidence=IMP
GO:2000008 "regulation of protein localization to cell surface" evidence=IMP
TAIR|locus:2052536 EC1.2 "AT2G21740" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2052556 EC1.3 "AT2G21750" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2030136 EC1.1 "AT1G76750" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2176080 EC1.5 "AT5G64720" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2164758 AT5G48210 "AT5G48210" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:504955577 AT3G48675 "AT3G48675" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:4010713622 AT2G14378 [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:504955408 AT4G35165 "AT4G35165" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:504954909 AT5G54062 "AT5G54062" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms

Prediction of Enzyme Commission (EC) Number

EC Number Prediction by Annotation Transfer from SWISS-PROT Entries ?

ID ?Name ?Annotated EC number ?Identity ?Query coverage ?Hit coverage ?RBH(Q2H) ?RBH(H2Q) ?
Q9T039EC14_ARATHNo assigned EC number0.51930.84890.9291yesno

EC Number Prediction by Ezypred Server ?

Fail to connect to Ezypred Server

EC Number Prediction by EFICAz Software ?

No EC number assignment, probably not an enzyme!


Prediction of Functionally Associated Proteins

Functionally Associated Proteins Detected by STRING ?

Fail to connect to STRING server


Conserved Domains and Related Protein Families

Conserved Domains Detected by RPS-BLAST ?

ID ?Alignment Graph ?Length ? Definition ? E-value ?
Query139
pfam0561767 pfam05617, Prolamin_like, Prolamin-like 3e-16
>gnl|CDD|218659 pfam05617, Prolamin_like, Prolamin-like Back     alignment and domain information
 Score = 67.5 bits (165), Expect = 3e-16
 Identities = 20/67 (29%), Positives = 32/67 (47%), Gaps = 2/67 (2%)

Query: 54  ECWNALME-LKSCSNEIIIFFLNGQADIGPDCCRAIDIITRNCWPTMLTSLGFTAE-EGN 111
           +CW+A    +  C +EI       + ++GPDCC+AI  I  +CWP +             
Sbjct: 1   KCWSACAVKIPGCVDEIFASIFGNKGNLGPDCCQAILKIGDDCWPALFKMFPSLPFFPPK 60

Query: 112 ILRGYCD 118
           +L+ YC 
Sbjct: 61  LLKNYCS 67


Both DUF784 and DUF1278 members are found to be expressed in the plant embryo sac and are regulated by the Myb98 transcription factor. Computational analysis has revealed that they are homologous to the plant prolamin superfamily (Protease inhibitor-seed storage-LTP family, pfam00234). In contrast to the typical prolamin members that have eight conserved Cys residues forming four pairs of disulfide bonds, both DUF784 and DUF1278 domains only contain six conserved Cys residues that may form three pairs of disulfide bonds. These two domains may have potential functions in lipid transfer or protection during plant embryo sac development and reproduction. This family has been merged with the DUF1278 family. Length = 67

Conserved Domains Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query 139
PLN00213118 predicted protein; Provisional 100.0
PF0561770 Prolamin_like: Prolamin-like; InterPro: IPR008502 99.78
PF0717295 GRP: Glycine rich protein family; InterPro: IPR010 94.28
PLN00214115 putative protein; Provisional 91.67
>PLN00213 predicted protein; Provisional Back     alignment and domain information
Probab=100.00  E-value=1.4e-39  Score=243.47  Aligned_cols=113  Identities=26%  Similarity=0.508  Sum_probs=98.5

Q ss_pred             hhHHHHHHHHHHHHHHHhhhccccccCCCC--CCccccCCchhhhccCCCCchHHHHhhhcCcchHHHHHHHHHhCccc-
Q 040374            3 MALKHAFFILALTCLIMANIANATSRNDHL--NNNMKLGYNLATRLEASGSLTECWNALMELKSCSNEIIIFFLNGQAD-   79 (139)
Q Consensus         3 Ma~k~~~~~l~v~~~~~~~~~~a~~~~~~p--~p~p~~~~~l~~~l~~~~~~~~Cwssl~~v~gC~~EI~~~fl~G~~~-   79 (139)
                      |++|||++||+|+||+++|  ||+ ++|++  +|+|||++.    ++|.+|+.||||||++++||+.||.+++++||++ 
T Consensus         1 m~iknV~~ll~v~cIvvsV--na~-lpqf~~~fp~~~p~~~----~pg~pd~~kCwSSl~~vpGCv~EI~~si~~gkf~~   73 (118)
T PLN00213          1 MSIKNVFLLLAVLCIIVSV--NAQ-LPQFPAQLPFLFPFQL----IPGLPDITKCFSSVMDIPGCIAEISQSIFTGKFGN   73 (118)
T ss_pred             CchHHHHHHHHHHHHHhee--ccC-CCCCCCCCCCCCCccC----CCCCccHHHHHHHHcCCcchHHHHHHHHHhchhcc
Confidence            8999999999999999999  888 88886  566666655    5556699999999999999999999999999997 


Q ss_pred             cChhchHHHHhhhcCcccccccCCCCCcchhhhHhhhcccC-CCCCCCC
Q 040374           80 IGPDCCRAIDIITRNCWPTMLTSLGFTAEEGNILRGYCDAS-SAPSPGG  127 (139)
Q Consensus        80 IGp~CC~AI~~I~~~CWP~mfps~pF~p~e~~~LKg~C~~~-~~~sP~~  127 (139)
                      ||++|||||++.+ +|||+| |.+||+|   ++||++|++. .+++|+|
T Consensus        74 Ig~aCCKAf~~~d-nCwP~~-P~~P~fP---p~LK~~Cs~i~~~~~~~~  117 (118)
T PLN00213         74 LGPACCKAFLDAD-NCIPKI-PFIPFFP---PMLKEQCSRVAGATPPIP  117 (118)
T ss_pred             cchHHHHHHHhhh-ccccCC-cCCCccc---hHHHHHHhcccCCCCCCC
Confidence            9999999999965 999995 9999888   7999999994 4445443



>PF05617 Prolamin_like: Prolamin-like; InterPro: IPR008502 This entry consists of several proteins of unknown function found exclusively in Arabidopsis thaliana Back     alignment and domain information
>PF07172 GRP: Glycine rich protein family; InterPro: IPR010800 This family consists of glycine rich proteins Back     alignment and domain information
>PLN00214 putative protein; Provisional Back     alignment and domain information

Homologous Structure Templates

Structure Templates Detected by BLAST ?

No homologous structure with e-value below 0.005

Structure Templates Detected by RPS-BLAST ?

No hit with e-value below 0.005

Structure Templates Detected by HHsearch ?

No hit with probability above 80.00


Homologous Structure Domains

Structure Domains Detected by RPS-BLAST ?

No hit with e-value below 0.005

Homologous Domains Detected by HHsearch ?

No hit with probability above 80.00