Citrus Sinensis ID: 031338


Local Sequence Feature Prediction

Prediction and (Method)Result
Residue Number Marker
Protein Sequence ?
Secondary Structure (PSIPRED) ?
Secondary Structure Prediction (SSPRO) ?
Coil and Loop (DISEMBL) ?
Flexible Loop (DISEMBL) ?
Low Complexity Region (SEG) ?
Disordered region (IsUnstruct) ?
Disordered Region (DISOPRED) ?
Disordered Region (DISEMBL) ?
Disordered Region (DISPRO) ?
Transmembrane Helix (TMHMM) ?
Transmembrane Helix (HMMTOP) ?
Transmembrane Helix (MEMSAT) ?
TM Helix, Signal Peptide (MEMSAT_SVM) ?
TM Helix, Signal Peptide (Phobius) ?
Signal Peptide (SignalP HMM Mode) ?
Signal Peptide (SignalP NN Mode) ?
Coiled Coils (COILS) ?
Positional Conservation ?
 
--------10--------20--------30--------40--------50--------60--------70--------80--------90-------100-------110-------120-------130-------140-------150-------160-
MIEALMARAVGIVSPTILFQFGLSPQTIASLGRRGISFSSSSAGFRSIRAVQENGGPRRLVDIVRLVPELSRNYFRSPSRRALFGGISLLGGFYVAQTISLSFGALGVNDVIAAVLCVLLTEYVTRFYYSRPKVTFPLALLNNFKMGFTYGLFIDAFKLAS
cHHHHHHHHHcccccccccccccccHHHHHHccccccccccccccccccccccccccccHHHHHHHHHHHHHHHHHcHHHHHHHHHHHHHHHHHHHHHHHHHHccccccHHHHHHHHHHHHHHHHHHHHcccccccHHHHHHHHHHHHHHHHHHHHHcccc
cHHHHHHHHcccccccEEEEcccccccEEEccccccccccccccccEEEEEEcccccHHHHHHHHHHHHHHHHHHHcHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcccccccHHHHHHHHHHHHHHHHHHHHHHHccc
MIEALMARAVgivsptilfqfglspqtiaslgrrgisfssssagFRSIRAVQenggprrlVDIVRLVPelsrnyfrspsrralFGGISLLGGFYVAQTISLSFGALGVNDVIAAVLCVLLTEYVTRfyysrpkvtfpLALLNNFKMGFTYGLFIDAFKLAS
MIEALMARAVGIVSPTILFQFGLSPQTIASLGRRGISFSSSSAGFRSIRavqenggprrlvDIVRLVPELsrnyfrspsrrALFGGISLLGGFYVAQTISLSFGALGVNDVIAAVLCVLLTEYVTRFYYSRPKVTFPLALLNNFKMGFTYGLFIDAFKLAS
MIEALMARAVGIVSPTILFQFGLSPQTIASLGrrgisfssssagfrsiraVQENGGPRRLVDIVRLVPELSRNYFRSPSRRALFGGISLLGGFYVAQTISLSFGALGVNDVIAAVLCVLLTEYVTRFYYSRPKVTFPLALLNNFKMGFTYGLFIDAFKLAS
****LMARAVGIVSPTILFQFGLSPQTIASLGRRGISF******FRSIRAVQ*NGGPRRLVDIVRLVPELSRNYFRSPSRRALFGGISLLGGFYVAQTISLSFGALGVNDVIAAVLCVLLTEYVTRFYYSRPKVTFPLALLNNFKMGFTYGLFIDAFK***
***********IVSPTILFQFGLS************************************VDIVRLVPELSRNYFRSPSRRALFGGISLLGGFYVAQTISLSFGALGVNDVIAAVLCVLLTEYVTRFYYSRPKVTFPLALLNNFKMGFTYGLFIDAFKLAS
MIEALMARAVGIVSPTILFQFGLSPQTIASLGRRGISFSSSSAGFRSIRAVQENGGPRRLVDIVRLVPELSRNYFRSPSRRALFGGISLLGGFYVAQTISLSFGALGVNDVIAAVLCVLLTEYVTRFYYSRPKVTFPLALLNNFKMGFTYGLFIDAFKLAS
MIEALMARAVGIVSPTILFQFGLSPQTIASLGRRGISFSSSSAGFRSIRAVQENGGPRRLVDIVRLVPELSRNYFRSPSRRALFGGISLLGGFYVAQTISLSFGALGVNDVIAAVLCVLLTEYVTRFYYSRPKVTFPLALLNNFKMGFTYGLFIDAFKLAS
iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiHHHHHHHHHHHHHHHHHHHHoooooooooHHHHHHHHHHHHHHHHHHHHiiiiiiiiiiiiiiiiiiiiiiiiiiiiiii
oooHHHHHHHHHHHHHHHHHHHHiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiHHHHHHHHHHHHHHHHHHHHHHHHHooooooooooooooooooooooooooooooHHHHHHHHHHHHHHHHHHHHiiii
iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiHHHHHHHHHHHHHHHHHHHHHHHHooHHHHHHHHHHHHHHHHHHHHiiiiiiiiiiiiiHHHHHHHHHHHHHHHHHHHHoo
SSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiHHHHHHHHHHHHHHHHHHoooooooooHHHHHHHHHHHHHHHHiiiiiiiiiiiiiiiHHHHHHHHHHHHHHHHooooo
iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiHHHHHHHHHHHHHHHHHHHHHHHHoooooHHHHHHHHHHHHHHHHHHHiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiii
SSSSSSSSSSSSSSSSSSSSSSSSSSSSSSooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
MIEALMARAVGIVSPTILFQFGLSPQTIASLGRRGISFSSSSAGFRSIRAVQENGGPRRLVDIVRLVPELSRNYFRSPSRRALFGGISLLGGFYVAQTISLSFGALGVNDVIAAVLCVLLTEYVTRFYYSRPKVTFPLALLNNFKMGFTYGLFIDAFKLAS
no confident homologs detected

Close Homologs for Annotation Transfer

Close Homologs in SWISS-PROT Database Detected by BLAST ?

ID ?Alignment graph ?Length ? Definition ? RBH(Q2H) ? RBH(H2Q) ? Q cover ? H cover ? Identity ? E-value ?
Query161 2.2.26 [Sep-21-2011]
P72983109 Ycf20-like protein OS=Syn N/A no 0.639 0.944 0.371 5e-10
Q1XDS2108 Uncharacterized protein y N/A no 0.590 0.879 0.422 6e-10
P51214108 Uncharacterized protein y N/A no 0.552 0.824 0.445 9e-09
O80813197 Ycf20-like protein OS=Ara no no 0.453 0.370 0.35 0.0002
Q9MUL5116 Uncharacterized protein y N/A no 0.434 0.603 0.354 0.0003
>sp|P72983|YC20L_SYNY3 Ycf20-like protein OS=Synechocystis sp. (strain PCC 6803 / Kazusa) GN=sll1509 PE=3 SV=1 Back     alignment and function desciption
 Score = 63.5 bits (153), Expect = 5e-10,   Method: Compositional matrix adjust.
 Identities = 39/105 (37%), Positives = 56/105 (53%), Gaps = 2/105 (1%)

Query: 59  RLVDIVRLVPELSRNYFRSPSRRALFGGISLLGGFYVAQTISLSFGALGVNDVIAAVLCV 118
           RL  IV +  +    +FR+P RR     +S L GF+V   ++ + G     DV+ A   +
Sbjct: 5   RLNTIVEVRGQQLSQFFRNPWRRISLSLLSFLFGFFVGTAVATTAGQNSQWDVVCAAFIL 64

Query: 119 LLTEYVTRFYYSRPKVTFPL--ALLNNFKMGFTYGLFIDAFKLAS 161
           L  E V R++Y R      L   +LN FKMG +Y LF++AFKL S
Sbjct: 65  LFCELVNRWFYRRGVKMGDLQAEVLNIFKMGVSYSLFLEAFKLGS 109





Synechocystis sp. (strain PCC 6803 / Kazusa) (taxid: 1111708)
>sp|Q1XDS2|YCF20_PORYE Uncharacterized protein ycf20 OS=Porphyra yezoensis GN=ycf20 PE=3 SV=1 Back     alignment and function description
>sp|P51214|YCF20_PORPU Uncharacterized protein ycf20 OS=Porphyra purpurea GN=ycf20 PE=3 SV=1 Back     alignment and function description
>sp|O80813|YC20L_ARATH Ycf20-like protein OS=Arabidopsis thaliana GN=At1g65420 PE=2 SV=2 Back     alignment and function description
>sp|Q9MUL5|YCF20_MESVI Uncharacterized protein ycf20 OS=Mesostigma viride GN=ycf20 PE=3 SV=1 Back     alignment and function description

Close Homologs in the Non-Redundant Database Detected by BLAST ?

GI ?Alignment Graph ?Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query161
225469306155 PREDICTED: ycf20-like protein isoform 2 0.962 1.0 0.782 2e-60
255539575155 conserved hypothetical protein [Ricinus 0.962 1.0 0.769 7e-60
356566265151 PREDICTED: uncharacterized protein ycf20 0.863 0.920 0.859 1e-59
224122468155 predicted protein [Populus trichocarpa] 0.962 1.0 0.762 3e-59
449451787156 PREDICTED: uncharacterized protein ycf20 0.956 0.987 0.759 4e-58
212274997155 hypothetical protein [Zea mays] gi|19468 0.913 0.948 0.718 3e-53
115459106155 Os04g0488600 [Oryza sativa Japonica Grou 0.751 0.780 0.876 7e-53
125548804190 hypothetical protein OsI_16403 [Oryza sa 0.751 0.636 0.876 1e-52
297795157159 hypothetical protein ARALYDRAFT_917392 [ 0.720 0.729 0.872 3e-52
15239166158 uncharacterized protein [Arabidopsis tha 0.720 0.734 0.872 6e-52
>gi|225469306|ref|XP_002269823.1| PREDICTED: ycf20-like protein isoform 2 [Vitis vinifera] gi|225469308|ref|XP_002269782.1| PREDICTED: ycf20-like protein isoform 1 [Vitis vinifera] gi|297741124|emb|CBI31855.3| unnamed protein product [Vitis vinifera] Back     alignment and taxonomy information
 Score =  236 bits (602), Expect = 2e-60,   Method: Compositional matrix adjust.
 Identities = 122/156 (78%), Positives = 136/156 (87%), Gaps = 1/156 (0%)

Query: 6   MARAVGIVSPTILFQFGLSPQTIASLGRRGISFSSSSAGFRSIRAVQENGGPRRLVDIVR 65
           MA ++ + S T L +FGL+  T  S G+  +  S SS  F S++AVQEN GPRRLVDI+R
Sbjct: 1   MAHSLSLPS-TCLLKFGLATGTRVSHGKCSMLASRSSPRFFSVQAVQENEGPRRLVDIIR 59

Query: 66  LVPELSRNYFRSPSRRALFGGISLLGGFYVAQTISLSFGALGVNDVIAAVLCVLLTEYVT 125
           +VP+LSRNYF+SPSRRALFGGISLLGGFYVAQTISLSFGALGVNDVIAAVLCVL+TEYVT
Sbjct: 60  IVPQLSRNYFQSPSRRALFGGISLLGGFYVAQTISLSFGALGVNDVIAAVLCVLITEYVT 119

Query: 126 RFYYSRPKVTFPLALLNNFKMGFTYGLFIDAFKLAS 161
           RFYYSRPKVTFPLALLNNFKMGFTYGLFIDAFKLAS
Sbjct: 120 RFYYSRPKVTFPLALLNNFKMGFTYGLFIDAFKLAS 155




Source: Vitis vinifera

Species: Vitis vinifera

Genus: Vitis

Family: Vitaceae

Order: Vitales

Class:

Phylum: Streptophyta

Superkingdom: Eukaryota

>gi|255539575|ref|XP_002510852.1| conserved hypothetical protein [Ricinus communis] gi|223549967|gb|EEF51454.1| conserved hypothetical protein [Ricinus communis] Back     alignment and taxonomy information
>gi|356566265|ref|XP_003551354.1| PREDICTED: uncharacterized protein ycf20-like [Glycine max] Back     alignment and taxonomy information
>gi|224122468|ref|XP_002318844.1| predicted protein [Populus trichocarpa] gi|222859517|gb|EEE97064.1| predicted protein [Populus trichocarpa] Back     alignment and taxonomy information
>gi|449451787|ref|XP_004143642.1| PREDICTED: uncharacterized protein ycf20-like [Cucumis sativus] gi|449530953|ref|XP_004172456.1| PREDICTED: uncharacterized protein ycf20-like [Cucumis sativus] Back     alignment and taxonomy information
>gi|212274997|ref|NP_001130729.1| hypothetical protein [Zea mays] gi|194689966|gb|ACF79067.1| unknown [Zea mays] gi|195626582|gb|ACG35121.1| hypothetical protein [Zea mays] gi|223950487|gb|ACN29327.1| unknown [Zea mays] gi|414586642|tpg|DAA37213.1| TPA: hypothetical protein ZEAMMB73_365649 [Zea mays] gi|414586643|tpg|DAA37214.1| TPA: hypothetical protein ZEAMMB73_365649 [Zea mays] gi|414586644|tpg|DAA37215.1| TPA: hypothetical protein ZEAMMB73_365649 [Zea mays] gi|414586645|tpg|DAA37216.1| TPA: hypothetical protein ZEAMMB73_365649 [Zea mays] gi|414586646|tpg|DAA37217.1| TPA: hypothetical protein ZEAMMB73_365649 [Zea mays] Back     alignment and taxonomy information
>gi|115459106|ref|NP_001053153.1| Os04g0488600 [Oryza sativa Japonica Group] gi|38344242|emb|CAD41335.2| OJ991113_30.19 [Oryza sativa Japonica Group] gi|113564724|dbj|BAF15067.1| Os04g0488600 [Oryza sativa Japonica Group] gi|215707021|dbj|BAG93481.1| unnamed protein product [Oryza sativa Japonica Group] gi|215737413|dbj|BAG96543.1| unnamed protein product [Oryza sativa Japonica Group] Back     alignment and taxonomy information
>gi|125548804|gb|EAY94626.1| hypothetical protein OsI_16403 [Oryza sativa Indica Group] gi|125590821|gb|EAZ31171.1| hypothetical protein OsJ_15270 [Oryza sativa Japonica Group] Back     alignment and taxonomy information
>gi|297795157|ref|XP_002865463.1| hypothetical protein ARALYDRAFT_917392 [Arabidopsis lyrata subsp. lyrata] gi|297311298|gb|EFH41722.1| hypothetical protein ARALYDRAFT_917392 [Arabidopsis lyrata subsp. lyrata] Back     alignment and taxonomy information
>gi|15239166|ref|NP_199119.1| uncharacterized protein [Arabidopsis thaliana] gi|9757831|dbj|BAB08268.1| unnamed protein product [Arabidopsis thaliana] gi|88900426|gb|ABD57525.1| At5g43050 [Arabidopsis thaliana] gi|332007521|gb|AED94904.1| uncharacterized protein [Arabidopsis thaliana] Back     alignment and taxonomy information

Prediction of Gene Ontology (GO) Terms

Close Homologs with Gene Ontology terms Detected by BLAST ?

ID ? Alignment graph ? Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query161
TAIR|locus:2167811158 NPQ6 "AT5G43050" [Arabidopsis 0.689 0.702 0.867 2e-47
TAIR|locus:2103645230 AT3G56830 "AT3G56830" [Arabido 0.434 0.304 0.386 1.4e-05
TAIR|locus:2206305197 NPQ7 "AT1G65420" [Arabidopsis 0.453 0.370 0.35 8e-05
TAIR|locus:2167811 NPQ6 "AT5G43050" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
 Score = 496 (179.7 bits), Expect = 2.0e-47, P = 2.0e-47
 Identities = 98/113 (86%), Positives = 104/113 (92%)

Query:    51 VQENGGP--RRLVDIVRLVPELSRNYFRSPSRRALFGGISLLGGFYVAQTISLSFGALGV 108
             +QE  GP  RRL+DI+R VPE+SRNYF+ PSRR LFGGISLLGGFYVAQTISLSFGALGV
Sbjct:    46 LQETEGPTPRRLIDIIRSVPEISRNYFKKPSRRTLFGGISLLGGFYVAQTISLSFGALGV 105

Query:   109 NDVIAAVLCVLLTEYVTRFYYSRPKVTFPLALLNNFKMGFTYGLFIDAFKLAS 161
             NDVIAAVLCVLLTEYVTRFYYSR  VTFP+ALLNNFKMGFTYGLFIDAFKLAS
Sbjct:   106 NDVIAAVLCVLLTEYVTRFYYSRTTVTFPIALLNNFKMGFTYGLFIDAFKLAS 158




GO:0003674 "molecular_function" evidence=ND
GO:0009507 "chloroplast" evidence=ISM;IDA
GO:0010196 "nonphotochemical quenching" evidence=IMP
TAIR|locus:2103645 AT3G56830 "AT3G56830" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2206305 NPQ7 "AT1G65420" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms

Prediction of Enzyme Commission (EC) Number

EC Number Prediction by Annotation Transfer from SWISS-PROT Entries ?

No confident hit for EC number transfering in SWISSPROT detected by BLAST

EC Number Prediction by Ezypred Server ?

Fail to connect to Ezypred Server

EC Number Prediction by EFICAz Software ?

No EC number assignment, probably not an enzyme!


Prediction of Functionally Associated Proteins

Functionally Associated Proteins Detected by STRING ?

Fail to connect to STRING server


Conserved Domains and Related Protein Families

Conserved Domains Detected by RPS-BLAST ?

ID ?Alignment Graph ?Length ? Definition ? E-value ?
Query161
pfam0448360 pfam04483, DUF565, Protein of unknown function (DU 3e-19
>gnl|CDD|218106 pfam04483, DUF565, Protein of unknown function (DUF565) Back     alignment and domain information
 Score = 76.1 bits (188), Expect = 3e-19
 Identities = 28/60 (46%), Positives = 38/60 (63%), Gaps = 3/60 (5%)

Query: 105 ALGVNDVIAAVLCVLLTEYVTRFYYSRPKVTF---PLALLNNFKMGFTYGLFIDAFKLAS 161
             G  D++AA + V +TE +++ +YSRP        L LLN FK+G  YGLF+DAFKL S
Sbjct: 1   QTGDWDILAAAILVAITEIISKLFYSRPSKKRRSLILELLNLFKIGLIYGLFVDAFKLGS 60


Predicted transmembrane protein found in plants, chloroplasts and cyanobacteria. This family is also known as YCF20. Length = 60

Conserved Domains Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query 161
PF0448360 DUF565: Protein of unknown function (DUF565); Inte 99.92
>PF04483 DUF565: Protein of unknown function (DUF565); InterPro: IPR007572 This family represents Ycf20, it is found in cyanobacteria and is also encoded in plant and algal chloroplasts; its function is unknown Back     alignment and domain information
Probab=99.92  E-value=2.1e-25  Score=153.52  Aligned_cols=57  Identities=53%  Similarity=0.869  Sum_probs=53.6

Q ss_pred             ccccchHHHHHHHHHHHHHHHHHhccCCCC---CchhhhhhHHHHHHHHHHHHHHhccCC
Q 031338          105 ALGVNDVIAAVLCVLLTEYVTRFYYSRPKV---TFPLALLNNFKMGFTYGLFIDAFKLAS  161 (161)
Q Consensus       105 q~g~wD~l~A~liVl~~Evi~rl~Y~r~~~---~l~~~lLN~fKIGllYGLflEAFKLGS  161 (161)
                      |+|+|||++|+++|+++|++++++|+++++   +++++++|+||||++||||+|||||||
T Consensus         1 Q~g~wD~i~a~~iv~~~E~i~~l~Y~~~~~~~~~~~~~~lN~~KiGl~YgLfleAFKLGS   60 (60)
T PF04483_consen    1 QTGDWDVIAAAIIVLFIEVISRLRYSKPKKKRKSLLVELLNNFKIGLLYGLFLEAFKLGS   60 (60)
T ss_pred             CCchhHHHHHHHHHHHHHHHHHHhhccccccccchHHHHHHHHHHHHHHHHHHHHHhcCC
Confidence            789999999999999999999999997643   478999999999999999999999998



As the family is exclusively found in phototrophic organisms it may therefore play a role in photosynthesis.


Homologous Structure Templates

Structure Templates Detected by BLAST ?

No homologous structure with e-value below 0.005

Structure Templates Detected by RPS-BLAST ?

No hit with e-value below 0.005

Structure Templates Detected by HHsearch ?

No hit with probability above 80.00


Homologous Structure Domains

Structure Domains Detected by RPS-BLAST ?

No hit with e-value below 0.005

Homologous Domains Detected by HHsearch ?

No hit with probability above 80.00