Citrus Sinensis ID: 028863


Local Sequence Feature Prediction

Prediction and (Method)Result
Residue Number Marker
Protein Sequence ?
Secondary Structure (PSIPRED) ?
Secondary Structure Prediction (SSPRO) ?
Coil and Loop (DISEMBL) ?
Flexible Loop (DISEMBL) ?
Low Complexity Region (SEG) ?
Disordered region (IsUnstruct) ?
Disordered Region (DISOPRED) ?
Disordered Region (DISEMBL) ?
Disordered Region (DISPRO) ?
Transmembrane Helix (TMHMM) ?
Transmembrane Helix (HMMTOP) ?
Transmembrane Helix (MEMSAT) ?
TM Helix, Signal Peptide (MEMSAT_SVM) ?
TM Helix, Signal Peptide (Phobius) ?
Signal Peptide (SignalP HMM Mode) ?
Signal Peptide (SignalP NN Mode) ?
Coiled Coils (COILS) ?
Positional Conservation ?
 
--------10--------20--------30--------40--------50--------60--------70--------80--------90-------100-------110-------120-------130-------140-------150-------160-------170-------180-------190-------200--
MVAAGESHASIRLLPLLKETLSHLFWFNPWPPRNLCHQLPKLSALERSVTVDSSTIHVQCLNSVVFPLGEPGPRNAPLATSPPVKLLTRVEQLKLLSKAEKAGLLSAAEKFGLSLSSIEKLGLLSKAEEFGVLSAATDPATPGALLTLSLGLLLLGPSCVYLVPEDYPWEVALQGVVALVSVVGGSAAFAASNLVSNLQKSS
cccccccccccHHHHcHHHHHHHHcccccccccccccccccHHHHHHHccccccccEEEEcccEEEccccccccccccccccccHHHHHHHHHHHHHHHHHHcHHHHHHHHcccHHHHHHHccccHHHHHccccccccccccHHHHHHHHHHHHHcccEEEEcccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcccc
cccccccHHHHHHHHHHHHHHHHHHHccccccHHHHccccccccccccccccccccEEEcccEEEEccccccccccccccccccHHHHHHHHHHHHHHHHHHcHHHHHHHccccHHHHHHHcHHHHHHHHcHHEcccccccHHHHHHHHHHHHHHHHHEEEEcccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHccc
mvaageshasIRLLPLLKETLShlfwfnpwpprnlchqlpklsalersvtvdsstihvqclnsvvfplgepgprnaplatsppvklLTRVEQLKLLSKAEKAGLLSAAEKFGLSLSSIEKLGllskaeefgvlsaatdpatpgALLTLSLGLlllgpscvylvpedypwEVALQGVVALVSVVGGSAAFAASNLVSNLQKSS
mvaageshasirLLPLLKETLSHLFWFNPWPPRNLCHQLPKLSALERSVTVDSSTIHVQCLNSVVFPLGEPGPrnaplatsppvkLLTRVEQLKLLSKAEKAGLLSAAEKFGLSLSSIEKLGLLSKAEEFGVLSAATDPATPGALLTLSLGLLLLGPSCVYLVPEDYPWEVALQGVVALVSVVGGSAAFAasnlvsnlqkss
MVAAGESHASIRLLPLLKETLSHLFWFNPWPPRNLCHQLPKLSALERSVTVDSSTIHVQCLNSVVFPLGEPGPRNAPLATSPPVKLLTRVEQLKLLSKAEKAGLLSAAEKFGLSLSSIEKLGLLSKAEEFGVLSAATDpatpgalltlslgllllgpsCVYLVPEDYPWEvalqgvvalvsvvggsaafaasNLVSNLQKSS
**********IRLLPLLKETLSHLFWFNPWPPRNLCHQLPKLSALERSVTVDSSTIHVQCLNSVVFPLG**************VKLLTRVEQLKLLSKAEKAGLLSAAEKFGLSLSSIEKLGLLSKAEEFGVLSAATDPATPGALLTLSLGLLLLGPSCVYLVPEDYPWEVALQGVVALVSVVGGSAAFAASN*********
***********RLLPLLKETLSHLFWFNPWPPRNLCHQLPK*********************************************LTRVEQLKLLSKAEKAGLLSAAEKFGLSLSSIEKLGLLSKAEEFGVLSAATDPATPGALLTLSLGLLLLGPSCVYLVPEDYPWEVALQGVVALVSVVGGSAAFAASNLVSNLQ***
********ASIRLLPLLKETLSHLFWFNPWPPRNLCHQLPKLSALERSVTVDSSTIHVQCLNSVVFPLGEPGPRNAPLATSPPVKLLTRVEQLKLLSKAEKAGLLSAAEKFGLSLSSIEKLGLLSKAEEFGVLSAATDPATPGALLTLSLGLLLLGPSCVYLVPEDYPWEVALQGVVALVSVVGGSAAFAASNLVSNLQKSS
********ASIRLLPLLKETLSHLFWFNPWPPRNLCHQLPKLSALER**TVDSSTIHVQCLNSVVFPLGEP********TSPPVKLLTRVEQLKLLSKAEKAGLLSAAEKFGLSLSSIEKLGLLSKAEEFGVLSAATDPATPGALLTLSLGLLLLGPSCVYLVPEDYPWEVALQGVVALVSVVGGSAAFAASNLVSNLQ***
iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiHHHHHHHHHHHHHHHHHHHHHHHoooooooooHHHHHHHHHHHHHHHHHHHHHHHiiiiii
iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiHHHHHHHHHHHHHHHHHHHHooooooooooooHHHHHHHHHHHHHHHHHHHHiiiiiii
iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiHHHHHHHHHHHHHHHHHHHoooooooHHHHHHHHHHHHHHHHHHHHHHHHiiiiiiiii
iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiHHHHHHHHHHHHHHHHHHHoooooooooHHHHHHHHHHHHHHHHHHHHHiiiiiiiiii
iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiHHHHHHHHHHHHHHHHHHHHHoooooooooooHHHHHHHHHHHHHHHHHHHHHiiiiiii
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
MVAAGESHASIRLLPLLKETLSHLFWFNPWPPRNLCHQLPKLSALERSVTVDSSTIHVQCLNSVVFPLGEPGPRNAPLATSPPVKLLTRVEQLKLLSKAEKAGLLSAAEKFGLSLSSIEKLGLLSKAEEFGVLSAATDPATPGALLTLSLGLLLLGPSCVYLVPEDYPWEVALQGVVALVSVVGGSAAFAASNLVSNLQKSS
no confident homologs detected

Close Homologs for Annotation Transfer

Close Homologs in SWISS-PROT Database Detected by BLAST ?

No hits with e-value below 0.001 by BLAST

Close Homologs in the Non-Redundant Database Detected by BLAST ?

GI ?Alignment Graph ?Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query202
255547173188 conserved hypothetical protein [Ricinus 0.693 0.744 0.892 6e-56
359491758206 PREDICTED: uncharacterized protein LOC10 0.821 0.805 0.724 1e-54
297734067160 unnamed protein product [Vitis vinifera] 0.683 0.862 0.876 3e-54
449439329197 PREDICTED: uncharacterized protein LOC10 0.787 0.807 0.768 2e-51
224130146194 predicted protein [Populus trichocarpa] 0.698 0.726 0.843 1e-45
357457451185 hypothetical protein MTR_3g026020 [Medic 0.678 0.740 0.748 4e-45
351726766214 uncharacterized protein LOC100500269 [Gl 0.683 0.644 0.769 1e-40
116780843183 unknown [Picea sitchensis] 0.673 0.743 0.704 6e-39
15241507158 uncharacterized protein [Arabidopsis tha 0.599 0.765 0.785 7e-37
17978981158 AT5g08050/F13G24_250 [Arabidopsis thalia 0.599 0.765 0.785 8e-37
>gi|255547173|ref|XP_002514644.1| conserved hypothetical protein [Ricinus communis] gi|223546248|gb|EEF47750.1| conserved hypothetical protein [Ricinus communis] Back     alignment and taxonomy information
 Score =  222 bits (566), Expect = 6e-56,   Method: Compositional matrix adjust.
 Identities = 125/140 (89%), Positives = 131/140 (93%)

Query: 62  NSVVFPLGEPGPRNAPLATSPPVKLLTRVEQLKLLSKAEKAGLLSAAEKFGLSLSSIEKL 121
           NS VFPLGE GPRN+PL TSPP+KLLTRVE+LKLLSKAEKAGLLSAAEKFGLSLS+IEKL
Sbjct: 48  NSTVFPLGEKGPRNSPLVTSPPIKLLTRVEELKLLSKAEKAGLLSAAEKFGLSLSTIEKL 107

Query: 122 GLLSKAEEFGVLSAATDPATPGALLTLSLGLLLLGPSCVYLVPEDYPWEVALQGVVALVS 181
           GLLSKAEE GVLSAATDP TPGAL +LSLGLLLLGPSCVYLVPEDYPWE+ LQ  VALVS
Sbjct: 108 GLLSKAEELGVLSAATDPGTPGALFSLSLGLLLLGPSCVYLVPEDYPWEIVLQVAVALVS 167

Query: 182 VVGGSAAFAASNLVSNLQKS 201
           VVGGSAAFAASNLVSNLQKS
Sbjct: 168 VVGGSAAFAASNLVSNLQKS 187




Source: Ricinus communis

Species: Ricinus communis

Genus: Ricinus

Family: Euphorbiaceae

Order: Malpighiales

Class:

Phylum: Streptophyta

Superkingdom: Eukaryota

>gi|359491758|ref|XP_002265727.2| PREDICTED: uncharacterized protein LOC100243309 [Vitis vinifera] Back     alignment and taxonomy information
>gi|297734067|emb|CBI15314.3| unnamed protein product [Vitis vinifera] Back     alignment and taxonomy information
>gi|449439329|ref|XP_004137438.1| PREDICTED: uncharacterized protein LOC101204037 [Cucumis sativus] gi|449486926|ref|XP_004157444.1| PREDICTED: uncharacterized protein LOC101227416 [Cucumis sativus] Back     alignment and taxonomy information
>gi|224130146|ref|XP_002328665.1| predicted protein [Populus trichocarpa] gi|118487258|gb|ABK95457.1| unknown [Populus trichocarpa] gi|118488248|gb|ABK95943.1| unknown [Populus trichocarpa] gi|222838841|gb|EEE77192.1| predicted protein [Populus trichocarpa] Back     alignment and taxonomy information
>gi|357457451|ref|XP_003599006.1| hypothetical protein MTR_3g026020 [Medicago truncatula] gi|355488054|gb|AES69257.1| hypothetical protein MTR_3g026020 [Medicago truncatula] Back     alignment and taxonomy information
>gi|351726766|ref|NP_001235602.1| uncharacterized protein LOC100500269 [Glycine max] gi|255629883|gb|ACU15292.1| unknown [Glycine max] Back     alignment and taxonomy information
>gi|116780843|gb|ABK21842.1| unknown [Picea sitchensis] Back     alignment and taxonomy information
>gi|15241507|ref|NP_196422.1| uncharacterized protein [Arabidopsis thaliana] gi|6562319|emb|CAB62617.1| putative protein [Arabidopsis thaliana] gi|332003855|gb|AED91238.1| uncharacterized protein [Arabidopsis thaliana] Back     alignment and taxonomy information
>gi|17978981|gb|AAL47451.1| AT5g08050/F13G24_250 [Arabidopsis thaliana] gi|33589718|gb|AAQ22625.1| At5g08050/F13G24_250 [Arabidopsis thaliana] Back     alignment and taxonomy information

Prediction of Gene Ontology (GO) Terms

Close Homologs with Gene Ontology terms Detected by BLAST ?

ID ? Alignment graph ? Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query202
TAIR|locus:2142848158 AT5G08050 "AT5G08050" [Arabido 0.599 0.765 0.520 8.7e-24
TAIR|locus:2027257198 AT1G74730 "AT1G74730" [Arabido 0.410 0.419 0.376 3.6e-09
TAIR|locus:2142848 AT5G08050 "AT5G08050" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
 Score = 273 (101.2 bits), Expect = 8.7e-24, P = 8.7e-24
 Identities = 63/121 (52%), Positives = 70/121 (57%)

Query:    81 SPPVKLLTRVEQLKLLSKAEKAGLLSAAEKFGLSLSSIEKLGLLSKAEEFGVLSAATDXX 140
             +P VKLLTRVEQLKLL+KAEKAGLLS AEK G SLS+IE+LGLL+KAEEFGVLSAAT+  
Sbjct:    37 TPQVKLLTRVEQLKLLTKAEKAGLLSLAEKSGFSLSTIERLGLLTKAEEFGVLSAATNPE 96

Query:   141 XXXXXXXXXXXXXXXXXXCVYLVPEDYPWEXXXXXXXXXXXXXXXXXXXXXXNLVSNLQK 200
                                 Y+VPEDY WE                        VSNLQK
Sbjct:    97 TPGTLFTLSLGLLLLGPVFAYVVPEDYTWEVVIQVLVALLSVLGGSAAFAASGFVSNLQK 156

Query:   201 S 201
             S
Sbjct:   157 S 157




GO:0003674 "molecular_function" evidence=ND
GO:0009507 "chloroplast" evidence=ISM;IDA
GO:0009535 "chloroplast thylakoid membrane" evidence=IDA
GO:0009579 "thylakoid" evidence=IDA
GO:0009534 "chloroplast thylakoid" evidence=IDA
GO:0080167 "response to karrikin" evidence=IEP
GO:0009658 "chloroplast organization" evidence=IMP
GO:0045893 "positive regulation of transcription, DNA-dependent" evidence=IMP
GO:0006098 "pentose-phosphate shunt" evidence=RCA
GO:0009773 "photosynthetic electron transport in photosystem I" evidence=RCA
GO:0015995 "chlorophyll biosynthetic process" evidence=RCA
GO:0019684 "photosynthesis, light reaction" evidence=RCA
TAIR|locus:2027257 AT1G74730 "AT1G74730" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms

Prediction of Enzyme Commission (EC) Number

EC Number Prediction by Annotation Transfer from SWISS-PROT Entries ?

No confident hit for EC number transfering in SWISSPROT detected by BLAST

EC Number Prediction by Ezypred Server ?

Fail to connect to Ezypred Server

EC Number Prediction by EFICAz Software ?

No EC number assignment, probably not an enzyme!


Prediction of Functionally Associated Proteins

Functionally Associated Proteins Detected by STRING ?

Your Input:
GSVIVG00000042001
SubName- Full=Chromosome chr17 scaffold_101, whole genome shotgun sequence; (206 aa)
(Vitis vinifera)
Predicted Functional Partners:
 
Sorry, there are no predicted associations at the current settings.
 

Conserved Domains and Related Protein Families

Conserved Domains Detected by RPS-BLAST ?

ID ?Alignment Graph ?Length ? Definition ? E-value ?
Query202
pfam06549116 pfam06549, DUF1118, Protein of unknown function (D 1e-40
>gnl|CDD|191553 pfam06549, DUF1118, Protein of unknown function (DUF1118) Back     alignment and domain information
 Score =  133 bits (337), Expect = 1e-40
 Identities = 68/115 (59%), Positives = 87/115 (75%), Gaps = 2/115 (1%)

Query: 89  RVEQLKLLSKAEKAGLLSAAEKFGLSLSSIEKLGLLSKAEEFGVLSAATDPA--TPGALL 146
            +E+LK+LSK EKAGLLS AE+ GL+LSSIEKLGLLSKAE+ G+LS A + A  +PGAL 
Sbjct: 1   NLEKLKVLSKVEKAGLLSKAEELGLTLSSIEKLGLLSKAEDLGLLSLAENAAGTSPGALA 60

Query: 147 TLSLGLLLLGPSCVYLVPEDYPWEVALQGVVALVSVVGGSAAFAASNLVSNLQKS 201
           +++L LL+   + V LVP+D  W VA+Q VVAL  VVGGSA F  S ++S LQ+S
Sbjct: 61  SVALPLLVAAIAAVVLVPDDSTWLVAVQAVVALALVVGGSALFVGSVVLSGLQES 115


This family consists of several hypothetical plant proteins of unknown function. Length = 116

Conserved Domains Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query 202
PF06549116 DUF1118: Protein of unknown function (DUF1118); In 100.0
PF06549116 DUF1118: Protein of unknown function (DUF1118); In 99.62
>PF06549 DUF1118: Protein of unknown function (DUF1118); InterPro: IPR009500 This family consists of several hypothetical plant proteins of unknown function Back     alignment and domain information
Probab=100.00  E-value=1.8e-46  Score=296.23  Aligned_cols=114  Identities=75%  Similarity=1.032  Sum_probs=112.8

Q ss_pred             hHHHhhhhHHHHhhccchHHHHcCCchhhhhhhcchhhhhhhcccccccCCC--ChhhHHHHHHHHHhhccceEEEecCC
Q 028863           89 RVEQLKLLSKAEKAGLLSAAEKFGLSLSSIEKLGLLSKAEEFGVLSAATDPA--TPGALLTLSLGLLLLGPSCVYLVPED  166 (202)
Q Consensus        89 kvEk~~LLSkaEkaGLLS~aEklGlSLSkiEkLGLLS~AE~lglLSla~~~~--sP~~L~~laL~llvagpaaVylVPDD  166 (202)
                      |+||+|+|||+||+||||+||++|+|||+|||+|||||||++|+||+++|+.  +|++|+++++++++++|++||+||||
T Consensus         1 ~~Ek~KlLs~~EkaGLLS~AE~~GlsLS~iEkLgLlSkAE~LGlLs~a~~~~~~~P~~L~slaL~ll~ag~~~v~~vPdd   80 (116)
T PF06549_consen    1 RVEKLKLLSKAEKAGLLSKAEKAGLSLSSIEKLGLLSKAEELGLLSLAEDPASSSPGALASLALPLLVAGPAAVYLVPDD   80 (116)
T ss_pred             ChHHHHHHHHHHHhhhHHHHHHcCCcHHHHHHhccccchHHhhhhHhccccccCChHHHHHHHHHHHHhhhheEEEecCC
Confidence            6899999999999999999999999999999999999999999999999987  99999999999999999999999999


Q ss_pred             ChhHHHHHHHHHHHHhhhhhhhhhHHHHhhhhccCC
Q 028863          167 YPWEVALQGVVALVSVVGGSAAFAASNLVSNLQKSS  202 (202)
Q Consensus       167 s~~~ValQ~vvA~~~vvG~~a~f~gS~~ls~LQ~s~  202 (202)
                      |+|+|++|+|+|++|++|++++|+||+++++|||||
T Consensus        81 s~~~va~Q~vvA~~~~vg~~a~f~gS~~l~~LQ~s~  116 (116)
T PF06549_consen   81 STWLVALQAVVALVCVVGGSAAFAGSSLLSKLQESD  116 (116)
T ss_pred             cHHHHHHHHHHHHHHHhhHHHHHHHHHHHHhhhcCC
Confidence            999999999999999999999999999999999998



>PF06549 DUF1118: Protein of unknown function (DUF1118); InterPro: IPR009500 This family consists of several hypothetical plant proteins of unknown function Back     alignment and domain information

Homologous Structure Templates

Structure Templates Detected by BLAST ?

No homologous structure with e-value below 0.005

Structure Templates Detected by RPS-BLAST ?

No hit with e-value below 0.005

Structure Templates Detected by HHsearch ?

No hit with probability above 80.00


Homologous Structure Domains

Structure Domains Detected by RPS-BLAST ?

No hit with e-value below 0.005

Homologous Domains Detected by HHsearch ?

No hit with probability above 80.00