Citrus Sinensis ID: 029409


Local Sequence Feature Prediction

Prediction and (Method)Result
Residue Number Marker
Protein Sequence ?
Secondary Structure (PSIPRED) ?
Secondary Structure Prediction (SSPRO) ?
Coil and Loop (DISEMBL) ?
Flexible Loop (DISEMBL) ?
Low Complexity Region (SEG) ?
Disordered region (IsUnstruct) ?
Disordered Region (DISOPRED) ?
Disordered Region (DISEMBL) ?
Disordered Region (DISPRO) ?
Transmembrane Helix (TMHMM) ?
Transmembrane Helix (HMMTOP) ?
Transmembrane Helix (MEMSAT) ?
TM Helix, Signal Peptide (MEMSAT_SVM) ?
TM Helix, Signal Peptide (Phobius) ?
Signal Peptide (SignalP HMM Mode) ?
Signal Peptide (SignalP NN Mode) ?
Coiled Coils (COILS) ?
Positional Conservation ?
 
--------10--------20--------30--------40--------50--------60--------70--------80--------90-------100-------110-------120-------130-------140-------150-------160-------170-------180-------190----
MEAAAFYGGSRGIPCIHPPSSSSKRNAFSSLLVQSMATQKPLPSAAKTVGSRKNSVVFPLGEPGPRNAPLATSPPVKLLTRVEQLKLLSKAEKAGLLSAAEKFGLSLSSIEKLGLLSKAEEFGVLSAATDPATPGALLTLSLGLLLLGPSCVYLVPEDYPWEVALQGVVALVSVVGGSAAFAASNLVSNLQKSS
cccccccccccccccccccccccccccccccEEEHHccccccccccccccccccEEEEccccccccccccccccccHHHHHHHHHHHHHHHHHHcHHHHHHHHcccHHHHHHHccccHHHHHccccccccccccHHHHHHHHHHHHHcccEEEEcccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcccc
ccHHEEcccccccccccccccccccccccccEHHHHccccccccccccccccccEEEEcccccccccccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHccccHHHHHHHcHHHHHHHHcHHccccccccHHHHHHHHHHHHHHHcEEEEEcccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHccc
meaaafyggsrgipcihppsssskrNAFSSLLVQSmatqkplpsaaktvgsrknsvvfplgepgprnaplatsppvklLTRVEQLKLLSKAEKAGLLSAAEKFGLSLSSIEKLGllskaeefgvlsaatdpatpgALLTLSLGLlllgpscvylvpedypwEVALQGVVALVSVVGGSAAFAASNLVSNLQKSS
MEAAafyggsrgipcihppssSSKRNAFSSLLVQSMATQKPlpsaaktvgsrKNSVVFplgepgprnaplatsppvkLLTRVEQLKLLSKAEKAGLLSAAEKFGLSLSSIEKLGLLSKAEEFGVLSAATDPATPGALLTLSLGLLLLGPSCVYLVPEDYPWEVALQGVVALVSVVGGSAAFAasnlvsnlqkss
MEAAAFYGGSRGIPCIHPPSSSSKRNAFSSLLVQSMATQKPLPSAAKTVGSRKNSVVFPLGEPGPRNAPLATSPPVKLLTRVEQLKLLSKAEKAGLLSAAEKFGLSLSSIEKLGLLSKAEEFGVLSAATDpatpgalltlslgllllgpsCVYLVPEDYPWEvalqgvvalvsvvggsaafaasNLVSNLQKSS
*****************************************************************************LLTRVEQLKLLSKAEKAGLLSAAEKFGLSLSSIEKLGLLSKAEEFGVLSAATDPATPGALLTLSLGLLLLGPSCVYLVPEDYPWEVALQGVVALVSVVGGSAAFAASN*********
*********SR*****************************************KNSVVFPL*****************LLTRVEQLKLLSKAEKAGLLSAAEKFGLSLSSIEKLGLLSKAEEFGVLSAATDPATPGALLTLSLGLLLLGPSCVYLVPEDYPWEVALQGVVALVSVVGGSAAFAASNLVSNLQ***
********GSRGIPCIHPPSSSSKRNAFSSLLVQSMATQ************RKNSVVFPLGEPGPRNAPLATSPPVKLLTRVEQLKLLSKAEKAGLLSAAEKFGLSLSSIEKLGLLSKAEEFGVLSAATDPATPGALLTLSLGLLLLGPSCVYLVPEDYPWEVALQGVVALVSVVGGSAAFAASNLVSNLQKSS
***********GIP***P************************************S****LGEP********TSPPVKLLTRVEQLKLLSKAEKAGLLSAAEKFGLSLSSIEKLGLLSKAEEFGVLSAATDPATPGALLTLSLGLLLLGPSCVYLVPEDYPWEVALQGVVALVSVVGGSAAFAASNLVSNLQ***
oooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiHHHHHHHHHHHHHHHHHHHHooooooooooooHHHHHHHHHHHHHHHHHHHHiiiiiii
oooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooHHHHHHHHHHHHHHHHHHHHHHHHiiiiiiiii
SSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiHHHHHHHHHHHHHHHHHHHoooooooooHHHHHHHHHHHHHHHHHHHHHiiiiiiiiii
iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiHHHHHHHHHHHHHHHHHHHHHoooooooooooHHHHHHHHHHHHHHHHHHHHHiiiiiii
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
MEAAAFYGGSRGIPCIHPPSSSSKRNAFSSLLVQSMATQKPLPSAAKTVGSRKNSVVFPLGEPGPRNAPLATSPPVKLLTRVEQLKLLSKAEKAGLLSAAEKFGLSLSSIEKLGLLSKAEEFGVLSAATDPATPGALLTLSLGLLLLGPSCVYLVPEDYPWEVALQGVVALVSVVGGSAAFAASNLVSNLQKSS
no confident homologs detected

Close Homologs for Annotation Transfer

Close Homologs in SWISS-PROT Database Detected by BLAST ?

No hits with e-value below 0.001 by BLAST

Close Homologs in the Non-Redundant Database Detected by BLAST ?

GI ?Alignment Graph ?Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query194
255547173188 conserved hypothetical protein [Ricinus 0.963 0.994 0.813 6e-69
359491758206 PREDICTED: uncharacterized protein LOC10 0.850 0.800 0.843 3e-64
297734067160 unnamed protein product [Vitis vinifera] 0.809 0.981 0.854 4e-61
449439329197 PREDICTED: uncharacterized protein LOC10 0.989 0.974 0.738 2e-60
224130146194 predicted protein [Populus trichocarpa] 0.994 0.994 0.764 2e-58
357457451185 hypothetical protein MTR_3g026020 [Medic 0.860 0.902 0.689 7e-52
351726766214 uncharacterized protein LOC100500269 [Gl 0.984 0.892 0.678 7e-51
15241507158 uncharacterized protein [Arabidopsis tha 0.773 0.949 0.664 1e-41
17978981158 AT5g08050/F13G24_250 [Arabidopsis thalia 0.773 0.949 0.664 2e-41
116780843183 unknown [Picea sitchensis] 0.752 0.797 0.677 3e-40
>gi|255547173|ref|XP_002514644.1| conserved hypothetical protein [Ricinus communis] gi|223546248|gb|EEF47750.1| conserved hypothetical protein [Ricinus communis] Back     alignment and taxonomy information
 Score =  265 bits (678), Expect = 6e-69,   Method: Compositional matrix adjust.
 Identities = 157/193 (81%), Positives = 167/193 (86%), Gaps = 6/193 (3%)

Query: 1   MEAAAFYGGSRGIPCIHPPSSSSKRNAFSSLLVQSMATQKPLPSAAKTVGSRKNSVVFPL 60
           MEAAA   GSRGI C    S + KR    +LL +SMA QKP+PSAAKTV SRKNS VFPL
Sbjct: 1   MEAAAI-SGSRGI-CFLYSSPNGKR----TLLNRSMAAQKPVPSAAKTVSSRKNSTVFPL 54

Query: 61  GEPGPRNAPLATSPPVKLLTRVEQLKLLSKAEKAGLLSAAEKFGLSLSSIEKLGLLSKAE 120
           GE GPRN+PL TSPP+KLLTRVE+LKLLSKAEKAGLLSAAEKFGLSLS+IEKLGLLSKAE
Sbjct: 55  GEKGPRNSPLVTSPPIKLLTRVEELKLLSKAEKAGLLSAAEKFGLSLSTIEKLGLLSKAE 114

Query: 121 EFGVLSAATDPATPGALLTLSLGLLLLGPSCVYLVPEDYPWEVALQGVVALVSVVGGSAA 180
           E GVLSAATDP TPGAL +LSLGLLLLGPSCVYLVPEDYPWE+ LQ  VALVSVVGGSAA
Sbjct: 115 ELGVLSAATDPGTPGALFSLSLGLLLLGPSCVYLVPEDYPWEIVLQVAVALVSVVGGSAA 174

Query: 181 FAASNLVSNLQKS 193
           FAASNLVSNLQKS
Sbjct: 175 FAASNLVSNLQKS 187




Source: Ricinus communis

Species: Ricinus communis

Genus: Ricinus

Family: Euphorbiaceae

Order: Malpighiales

Class:

Phylum: Streptophyta

Superkingdom: Eukaryota

>gi|359491758|ref|XP_002265727.2| PREDICTED: uncharacterized protein LOC100243309 [Vitis vinifera] Back     alignment and taxonomy information
>gi|297734067|emb|CBI15314.3| unnamed protein product [Vitis vinifera] Back     alignment and taxonomy information
>gi|449439329|ref|XP_004137438.1| PREDICTED: uncharacterized protein LOC101204037 [Cucumis sativus] gi|449486926|ref|XP_004157444.1| PREDICTED: uncharacterized protein LOC101227416 [Cucumis sativus] Back     alignment and taxonomy information
>gi|224130146|ref|XP_002328665.1| predicted protein [Populus trichocarpa] gi|118487258|gb|ABK95457.1| unknown [Populus trichocarpa] gi|118488248|gb|ABK95943.1| unknown [Populus trichocarpa] gi|222838841|gb|EEE77192.1| predicted protein [Populus trichocarpa] Back     alignment and taxonomy information
>gi|357457451|ref|XP_003599006.1| hypothetical protein MTR_3g026020 [Medicago truncatula] gi|355488054|gb|AES69257.1| hypothetical protein MTR_3g026020 [Medicago truncatula] Back     alignment and taxonomy information
>gi|351726766|ref|NP_001235602.1| uncharacterized protein LOC100500269 [Glycine max] gi|255629883|gb|ACU15292.1| unknown [Glycine max] Back     alignment and taxonomy information
>gi|15241507|ref|NP_196422.1| uncharacterized protein [Arabidopsis thaliana] gi|6562319|emb|CAB62617.1| putative protein [Arabidopsis thaliana] gi|332003855|gb|AED91238.1| uncharacterized protein [Arabidopsis thaliana] Back     alignment and taxonomy information
>gi|17978981|gb|AAL47451.1| AT5g08050/F13G24_250 [Arabidopsis thaliana] gi|33589718|gb|AAQ22625.1| At5g08050/F13G24_250 [Arabidopsis thaliana] Back     alignment and taxonomy information
>gi|116780843|gb|ABK21842.1| unknown [Picea sitchensis] Back     alignment and taxonomy information

Prediction of Gene Ontology (GO) Terms

Close Homologs with Gene Ontology terms Detected by BLAST ?

ID ? Alignment graph ? Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query194
TAIR|locus:2142848158 AT5G08050 "AT5G08050" [Arabido 0.768 0.943 0.487 3.5e-27
TAIR|locus:2027257198 AT1G74730 "AT1G74730" [Arabido 0.427 0.419 0.376 3.6e-09
TAIR|locus:2142848 AT5G08050 "AT5G08050" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
 Score = 305 (112.4 bits), Expect = 3.5e-27, P = 3.5e-27
 Identities = 81/166 (48%), Positives = 93/166 (56%)

Query:    29 SSLLVQSMATQKPLPSAA-KTVGSRKNSVVFPLGEPGPRNAPLATSPPVKLLTRVEQLKL 87
             SSL V SMA +KP PSAA +T+ S+K++               AT P VKLLTRVEQLKL
Sbjct:     8 SSLTVHSMANKKPSPSAATRTITSKKST---------------AT-PQVKLLTRVEQLKL 51

Query:    88 LSKAEKAGLLSAAEKFGLSLSSIEKLGLLSKAEEFGVLSAATDXXXXXXXXXXXXXXXXX 147
             L+KAEKAGLLS AEK G SLS+IE+LGLL+KAEEFGVLSAAT+                 
Sbjct:    52 LTKAEKAGLLSLAEKSGFSLSTIERLGLLTKAEEFGVLSAATNPETPGTLFTLSLGLLLL 111

Query:   148 XXXCVYLVPEDYPWEXXXXXXXXXXXXXXXXXXXXXXNLVSNLQKS 193
                  Y+VPEDY WE                        VSNLQKS
Sbjct:   112 GPVFAYVVPEDYTWEVVIQVLVALLSVLGGSAAFAASGFVSNLQKS 157




GO:0003674 "molecular_function" evidence=ND
GO:0009507 "chloroplast" evidence=ISM;IDA
GO:0009535 "chloroplast thylakoid membrane" evidence=IDA
GO:0009579 "thylakoid" evidence=IDA
GO:0009534 "chloroplast thylakoid" evidence=IDA
GO:0080167 "response to karrikin" evidence=IEP
GO:0009658 "chloroplast organization" evidence=IMP
GO:0045893 "positive regulation of transcription, DNA-dependent" evidence=IMP
GO:0006098 "pentose-phosphate shunt" evidence=RCA
GO:0009773 "photosynthetic electron transport in photosystem I" evidence=RCA
GO:0015995 "chlorophyll biosynthetic process" evidence=RCA
GO:0019684 "photosynthesis, light reaction" evidence=RCA
TAIR|locus:2027257 AT1G74730 "AT1G74730" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms

Prediction of Enzyme Commission (EC) Number

EC Number Prediction by Annotation Transfer from SWISS-PROT Entries ?

No confident hit for EC number transfering in SWISSPROT detected by BLAST

EC Number Prediction by Ezypred Server ?

Fail to connect to Ezypred Server

EC Number Prediction by EFICAz Software ?

No EC number assignment, probably not an enzyme!


Prediction of Functionally Associated Proteins

Functionally Associated Proteins Detected by STRING ?

Fail to connect to STRING server


Conserved Domains and Related Protein Families

Conserved Domains Detected by RPS-BLAST ?

ID ?Alignment Graph ?Length ? Definition ? E-value ?
Query194
pfam06549116 pfam06549, DUF1118, Protein of unknown function (D 2e-39
>gnl|CDD|191553 pfam06549, DUF1118, Protein of unknown function (DUF1118) Back     alignment and domain information
 Score =  130 bits (329), Expect = 2e-39
 Identities = 68/115 (59%), Positives = 87/115 (75%), Gaps = 2/115 (1%)

Query: 81  RVEQLKLLSKAEKAGLLSAAEKFGLSLSSIEKLGLLSKAEEFGVLSAATDPA--TPGALL 138
            +E+LK+LSK EKAGLLS AE+ GL+LSSIEKLGLLSKAE+ G+LS A + A  +PGAL 
Sbjct: 1   NLEKLKVLSKVEKAGLLSKAEELGLTLSSIEKLGLLSKAEDLGLLSLAENAAGTSPGALA 60

Query: 139 TLSLGLLLLGPSCVYLVPEDYPWEVALQGVVALVSVVGGSAAFAASNLVSNLQKS 193
           +++L LL+   + V LVP+D  W VA+Q VVAL  VVGGSA F  S ++S LQ+S
Sbjct: 61  SVALPLLVAAIAAVVLVPDDSTWLVAVQAVVALALVVGGSALFVGSVVLSGLQES 115


This family consists of several hypothetical plant proteins of unknown function. Length = 116

Conserved Domains Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query 194
PF06549116 DUF1118: Protein of unknown function (DUF1118); In 100.0
PF06549116 DUF1118: Protein of unknown function (DUF1118); In 99.48
>PF06549 DUF1118: Protein of unknown function (DUF1118); InterPro: IPR009500 This family consists of several hypothetical plant proteins of unknown function Back     alignment and domain information
Probab=100.00  E-value=8.2e-48  Score=302.79  Aligned_cols=114  Identities=75%  Similarity=1.032  Sum_probs=112.7

Q ss_pred             HHHHhhhHHHHHHhchhhHHHHcCCchhhhhhhcchhhhhhcccccccCCCC--ChhhHHHHHHHHHhhcceeEEEecCC
Q 029409           81 RVEQLKLLSKAEKAGLLSAAEKFGLSLSSIEKLGLLSKAEEFGVLSAATDPA--TPGALLTLSLGLLLLGPSCVYLVPED  158 (194)
Q Consensus        81 ~vEk~~LLskaEkaGLLS~aE~~GlSLs~iE~LGlLS~aE~lg~Ls~a~~~~--tPgaL~slaL~LlvAg~a~v~lvPdd  158 (194)
                      |+||+|+|||+||+||||+||++|+|||+|||+|||||||++|+||+++|+.  +||+|++++||++++||++||+||||
T Consensus         1 ~~Ek~KlLs~~EkaGLLS~AE~~GlsLS~iEkLgLlSkAE~LGlLs~a~~~~~~~P~~L~slaL~ll~ag~~~v~~vPdd   80 (116)
T PF06549_consen    1 RVEKLKLLSKAEKAGLLSKAEKAGLSLSSIEKLGLLSKAEELGLLSLAEDPASSSPGALASLALPLLVAGPAAVYLVPDD   80 (116)
T ss_pred             ChHHHHHHHHHHHhhhHHHHHHcCCcHHHHHHhccccchHHhhhhHhccccccCChHHHHHHHHHHHHhhhheEEEecCC
Confidence            6899999999999999999999999999999999999999999999999987  99999999999999999999999999


Q ss_pred             ChHHHHHHHHHHHHHhhhhhhhhhHHHHhhhhccCC
Q 029409          159 YPWEVALQGVVALVSVVGGSAAFAASNLVSNLQKSS  194 (194)
Q Consensus       159 s~~~Va~Q~vvA~~~vvg~~a~f~gS~~ls~LQ~s~  194 (194)
                      |+|+|++|+|+|++|++|++++|+||+++++|||+|
T Consensus        81 s~~~va~Q~vvA~~~~vg~~a~f~gS~~l~~LQ~s~  116 (116)
T PF06549_consen   81 STWLVALQAVVALVCVVGGSAAFAGSSLLSKLQESD  116 (116)
T ss_pred             cHHHHHHHHHHHHHHHhhHHHHHHHHHHHHhhhcCC
Confidence            999999999999999999999999999999999998



>PF06549 DUF1118: Protein of unknown function (DUF1118); InterPro: IPR009500 This family consists of several hypothetical plant proteins of unknown function Back     alignment and domain information

Homologous Structure Templates

Structure Templates Detected by BLAST ?

No homologous structure with e-value below 0.005

Structure Templates Detected by RPS-BLAST ?

No hit with e-value below 0.005

Structure Templates Detected by HHsearch ?

No hit with probability above 80.00


Homologous Structure Domains

Structure Domains Detected by RPS-BLAST ?

No hit with e-value below 0.005

Homologous Domains Detected by HHsearch ?

No hit with probability above 80.00