Citrus Sinensis ID: 030966


Local Sequence Feature Prediction

Prediction and (Method)Result
Residue Number Marker
Protein Sequence ?
Secondary Structure (PSIPRED) ?
Secondary Structure Prediction (SSPRO) ?
Coil and Loop (DISEMBL) ?
Flexible Loop (DISEMBL) ?
Low Complexity Region (SEG) ?
Disordered region (IsUnstruct) ?
Disordered Region (DISOPRED) ?
Disordered Region (DISEMBL) ?
Disordered Region (DISPRO) ?
Transmembrane Helix (TMHMM) ?
Transmembrane Helix (HMMTOP) ?
Transmembrane Helix (MEMSAT) ?
TM Helix, Signal Peptide (MEMSAT_SVM) ?
TM Helix, Signal Peptide (Phobius) ?
Signal Peptide (SignalP HMM Mode) ?
Signal Peptide (SignalP NN Mode) ?
Coiled Coils (COILS) ?
Positional Conservation ?
 
--------10--------20--------30--------40--------50--------60--------70--------80--------90-------100-------110-------120-------130-------140-------150-------160-------17
MAYAPVRYKTLGILYYTNPLPKATSEETSSGTDQYVVDKRDGATAAEDVPAVEKNVYNESVATAVPKEESPVDGLTNELLDNLKIKFDSEDKYSLVLYGTGALLALWLTTVVVGAIDSIPLFPKLMEVVGLGYTLWFSWRYLLFKKNRDELATKIEELKQQVLGSNDD
ccccccHHHHHcccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccHHHHHHHHHHHHHcccccHHHHHHHHHHHHHHHHHHHHHHHHcccccccHHHHHHHHHEEEHHHHHHHcccccHHHHHHHHHHHHHHHcccccc
cccccHHHHEccEEEEcccccccccccccccccEEEEccccccEEcccccccccccccccccccccccccHHHHHHHHHHHHHHHHHcccccHHHHHHHHHHHHHHHHHHHHHHHHHcccccHHHHHHHHHHHHHHHHHHHHHccccHHHHHHHHHHHHHHHcccccc
mayapvryKTLGILYytnplpkatseetssgtdqyvvdkrdgataaedvpaveknvynesvatavpkeespvdgltnELLDNLkikfdsedkysLVLYGTGALLALWLTTVVVGaidsiplfpKLMEVVGLGYTLWFSWRYLLFKKNRDELATKIEELKQQVLGSNDD
mayapvryKTLGILYYTNPlpkatseetssgtdqyVVDKRDGAtaaedvpaveknvynesvatavpkeespvdglTNELLDNLKIKFDSEDKYSLVLYGTGALLALWLTTVVVGAIDSIPLFPKLMEVVGLGYTLWFSWRYLLFKKNRDELATKIEelkqqvlgsndd
MAYAPVRYKTLGILYYTNPLPKATSEETSSGTDQYVVDKRDGATAAEDVPAVEKNVYNESVATAVPKEESPVDGLTNELLDNLKIKFDSEDKYSLVLYgtgallalwlttvvvgaIDSIPLFPKLMEVVGLGYTLWFSWRYLLFKKNRDELATKIEELKQQVLGSNDD
***APVRYKTLGILYYTN********************************************************LTNELLDNLKIKFDSEDKYSLVLYGTGALLALWLTTVVVGAIDSIPLFPKLMEVVGLGYTLWFSWRYLLFKKNRDEL*****************
****P*RYKTLGILYYTNPLP*****************************************************LTNELLDNLKIKFDSEDKYSLVLYGTGALLALWLTTVVVGAIDSIPLFPKLMEVVGLGYTLWFSWRYLLFKKNRDELATKIEELKQ*VLG****
MAYAPVRYKTLGILYYTNPLPK**********DQYVVDKRDGATAAEDVPAVEKNVYNESVATAVPKEESPVDGLTNELLDNLKIKFDSEDKYSLVLYGTGALLALWLTTVVVGAIDSIPLFPKLMEVVGLGYTLWFSWRYLLFKKNRDELATKIEELKQ********
***APVRYKTLGILYYTNPLPK**********DQY****RDGATAAE**********************SPVDGLTNELLDNLKIKFDSEDKYSLVLYGTGALLALWLTTVVVGAIDSIPLFPKLMEVVGLGYTLWFSWRYLLFKKNRDELATKIEELKQQVLG****
iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiHHHHHHHHHHHHHHHHHHHHHHHoooooooooHHHHHHHHHHHHHHHHHHHHiiiiiiiiiiiiiiiiiiiiiiii
iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiHHHHHHHHHHHHHHHHHHHHooooooooHHHHHHHHHHHHHHHHHHHHiiiiiiiiiiiiiiiiiiiiiiii
ooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooHHHHHHHHHHHHHHHHHHHHHHHHiiiiiiiiiiiiiiiiiiiiiiiiiiii
SSSSSSSSSSSSSSSSSSiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiHHHHHHHHHHHHHHHHHHHHooooHHHHHHHHHHHHHHHHHHHHHiiiiiiiiiiiiiiiiiiiiiiiiiiiii
iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiHHHHHHHHHHHHHHHHHHHHHHoooooooooooHHHHHHHHHHHHHHHHHiiiiiiiiiiiiiiiiiiiiiiii
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
MAYAPVRYKTLGILYYTNPLPKATSEETSSGTDQYVVDKRDGATAAEDVPAVEKNVYNESVATAVPKEESPVDGLTNELLDNLKIKFDSEDKYSLVLYGTGALLALWLTTVVVGAIDSIPLFPKLMEVVGLGYTLWFSWRxxxxxxxxxxxxxxxxxxxxxxxxxxxx
no confident homologs detected

Close Homologs for Annotation Transfer

Close Homologs in SWISS-PROT Database Detected by BLAST ?

ID ?Alignment graph ?Length ? Definition ? RBH(Q2H) ? RBH(H2Q) ? Q cover ? H cover ? Identity ? E-value ?
Query168 2.2.26 [Sep-21-2011]
O04616164 Uncharacterized protein A no no 0.595 0.609 0.553 6e-26
Q8LCA1174 Thylakoid membrane phosph no no 0.458 0.442 0.376 2e-11
Q119Z5881 Glutamate--tRNA ligase OS yes no 0.690 0.131 0.311 8e-10
>sp|O04616|Y4115_ARATH Uncharacterized protein At4g01150, chloroplastic OS=Arabidopsis thaliana GN=At4g01150 PE=1 SV=1 Back     alignment and function desciption
 Score =  116 bits (291), Expect = 6e-26,   Method: Compositional matrix adjust.
 Identities = 57/103 (55%), Positives = 82/103 (79%), Gaps = 3/103 (2%)

Query: 64  AVPKEESPVDGLTNELLDNLKIKFDS-EDKYSLVLYGTGALLALWLTTVVVGAIDSIPLF 122
           A  +E S +D  TNEL+ +LK K+D  E+K ++++YG GA++A+WL+++VVGAI+S+PL 
Sbjct: 63  ASSEETSSID--TNELITDLKEKWDGLENKSTVLIYGGGAIVAVWLSSIVVGAINSVPLL 120

Query: 123 PKLMEVVGLGYTLWFSWRYLLFKKNRDELATKIEELKQQVLGS 165
           PK+ME+VGLGYT WF +RYLLFK +R ELA  IE LK+++ GS
Sbjct: 121 PKVMELVGLGYTGWFVYRYLLFKSSRKELAEDIESLKKKIAGS 163





Arabidopsis thaliana (taxid: 3702)
>sp|Q8LCA1|TMP14_ARATH Thylakoid membrane phosphoprotein 14 kDa, chloroplastic OS=Arabidopsis thaliana GN=TMP14 PE=1 SV=2 Back     alignment and function description
>sp|Q119Z5|SYE_TRIEI Glutamate--tRNA ligase OS=Trichodesmium erythraeum (strain IMS101) GN=gltX PE=3 SV=1 Back     alignment and function description

Close Homologs in the Non-Redundant Database Detected by BLAST ?

GI ?Alignment Graph ?Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query168
225469778204 PREDICTED: uncharacterized protein LOC10 0.982 0.808 0.630 3e-54
359495653204 PREDICTED: uncharacterized protein LOC10 0.982 0.808 0.630 7e-54
388506910199 unknown [Lotus japonicus] 0.916 0.773 0.558 2e-39
351723571197 uncharacterized protein LOC100499731 [Gl 0.886 0.756 0.558 2e-38
351727288200 uncharacterized protein LOC100305966 [Gl 0.976 0.82 0.514 1e-36
357485585182 Glutamyl-tRNA synthetase [Medicago trunc 0.922 0.851 0.518 1e-35
4467116153 hypothetical protein [Arabidopsis thalia 0.869 0.954 0.496 2e-32
297798056194 threonine endopeptidase [Arabidopsis lyr 0.851 0.737 0.493 3e-32
449508380210 PREDICTED: uncharacterized protein At4g0 0.958 0.766 0.383 3e-32
18420175193 uncharacterized protein [Arabidopsis tha 0.851 0.740 0.493 6e-32
>gi|225469778|ref|XP_002274430.1| PREDICTED: uncharacterized protein LOC100261101 [Vitis vinifera] gi|297735928|emb|CBI18704.3| unnamed protein product [Vitis vinifera] Back     alignment and taxonomy information
 Score =  216 bits (549), Expect = 3e-54,   Method: Compositional matrix adjust.
 Identities = 106/168 (63%), Positives = 129/168 (76%), Gaps = 3/168 (1%)

Query: 4   APVRYKTLGILYYTNPLPKAT-SEETSSGTDQYVVDKRDGATAAEDVPAVEKNVYNESVA 62
           +P+   + G LY+ NPL +A+ SE +SSG DQY+ ++RD     ED+PA E+NVYNE + 
Sbjct: 37  SPISRTSPGSLYFNNPLLRASISEGSSSGADQYIGEERDSVLVMEDIPATEENVYNEVIP 96

Query: 63  TAVPKEESPVDGLT--NELLDNLKIKFDSEDKYSLVLYGTGALLALWLTTVVVGAIDSIP 120
           T  P E+S V+  T   E LDNL IKFDSED YS+ LYGTGAL ALW  + +VGAIDSIP
Sbjct: 97  TEAPIEDSQVEEQTVAFEFLDNLNIKFDSEDPYSIFLYGTGALTALWFASAIVGAIDSIP 156

Query: 121 LFPKLMEVVGLGYTLWFSWRYLLFKKNRDELATKIEELKQQVLGSNDD 168
           +FPKLME+VGLGYTLWFS RYL+FK+NRDELA KIEELKQQVLGS D+
Sbjct: 157 IFPKLMEIVGLGYTLWFSARYLIFKQNRDELAAKIEELKQQVLGSEDE 204




Source: Vitis vinifera

Species: Vitis vinifera

Genus: Vitis

Family: Vitaceae

Order: Vitales

Class:

Phylum: Streptophyta

Superkingdom: Eukaryota

>gi|359495653|ref|XP_003635047.1| PREDICTED: uncharacterized protein LOC100853187 isoform 1 [Vitis vinifera] gi|359495655|ref|XP_003635048.1| PREDICTED: uncharacterized protein LOC100853187 isoform 2 [Vitis vinifera] gi|297735931|emb|CBI18707.3| unnamed protein product [Vitis vinifera] Back     alignment and taxonomy information
>gi|388506910|gb|AFK41521.1| unknown [Lotus japonicus] Back     alignment and taxonomy information
>gi|351723571|ref|NP_001235236.1| uncharacterized protein LOC100499731 [Glycine max] gi|255626121|gb|ACU13405.1| unknown [Glycine max] Back     alignment and taxonomy information
>gi|351727288|ref|NP_001235620.1| uncharacterized protein LOC100305966 [Glycine max] gi|255627137|gb|ACU13913.1| unknown [Glycine max] Back     alignment and taxonomy information
>gi|357485585|ref|XP_003613080.1| Glutamyl-tRNA synthetase [Medicago truncatula] gi|355514415|gb|AES96038.1| Glutamyl-tRNA synthetase [Medicago truncatula] gi|388495348|gb|AFK35740.1| unknown [Medicago truncatula] Back     alignment and taxonomy information
>gi|4467116|emb|CAB37550.1| hypothetical protein [Arabidopsis thaliana] gi|7270793|emb|CAB80475.1| hypothetical protein [Arabidopsis thaliana] Back     alignment and taxonomy information
>gi|297798056|ref|XP_002866912.1| threonine endopeptidase [Arabidopsis lyrata subsp. lyrata] gi|297312748|gb|EFH43171.1| threonine endopeptidase [Arabidopsis lyrata subsp. lyrata] Back     alignment and taxonomy information
>gi|449508380|ref|XP_004163298.1| PREDICTED: uncharacterized protein At4g01150, chloroplastic-like [Cucumis sativus] Back     alignment and taxonomy information
>gi|18420175|ref|NP_568035.1| uncharacterized protein [Arabidopsis thaliana] gi|21554198|gb|AAM63277.1| unknown [Arabidopsis thaliana] gi|107738205|gb|ABF83661.1| At4g38100 [Arabidopsis thaliana] gi|332661478|gb|AEE86878.1| uncharacterized protein [Arabidopsis thaliana] Back     alignment and taxonomy information

Prediction of Gene Ontology (GO) Terms

Close Homologs with Gene Ontology terms Detected by BLAST ?

ID ? Alignment graph ? Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query168
TAIR|locus:2121065193 AT4G38100 [Arabidopsis thalian 0.803 0.699 0.442 4e-26
TAIR|locus:2125018164 AT4G01150 "AT4G01150" [Arabido 0.625 0.640 0.449 5.8e-18
TAIR|locus:2044335174 PSI-P "photosystem I P subunit 0.684 0.660 0.260 1.8e-09
TAIR|locus:2037435156 AT1G52220 "AT1G52220" [Arabido 0.625 0.673 0.261 2.9e-07
TAIR|locus:2121065 AT4G38100 [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
 Score = 295 (108.9 bits), Expect = 4.0e-26, P = 4.0e-26
 Identities = 65/147 (44%), Positives = 92/147 (62%)

Query:    22 KATSEETSSGTDQYVVDKRDGATAAEDVPAVEKNVYNESVATAVPKEESPVDGLTNELLD 81
             +  +EETS+G +++ V+ RDG   A    A EKN  +E    A   E+     L  E L+
Sbjct:    59 ETVTEETSTGVNEFGVEDRDGVVVA----AEEKNSNSE----APQAEDEETQAL--EFLN 108

Query:    82 NLKIKFDSEDKYSLVLYXXXXXXXXXXXXXXXXXIDSIPLFPKLMEVVGLGYTLWFSWRY 141
             ++K+  DS+  YS++LY                 +++IPLFPKLMEVVGLGYTLWF+ RY
Sbjct:   109 DIKL--DSDKTYSILLYGSGAIVALYLTSAIVSSLEAIPLFPKLMEVVGLGYTLWFTTRY 166

Query:   142 LLFKKNRDELATKIEELKQQVLGSNDD 168
             LLFK+NR+EL TK+ E+K+QVLGS+ +
Sbjct:   167 LLFKRNREELKTKVSEIKKQVLGSDSE 193




GO:0009507 "chloroplast" evidence=ISM
GO:0009535 "chloroplast thylakoid membrane" evidence=IDA
TAIR|locus:2125018 AT4G01150 "AT4G01150" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2044335 PSI-P "photosystem I P subunit" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2037435 AT1G52220 "AT1G52220" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms

Prediction of Enzyme Commission (EC) Number

EC Number Prediction by Annotation Transfer from SWISS-PROT Entries ?

No confident hit for EC number transfering in SWISSPROT detected by BLAST

EC Number Prediction by Ezypred Server ?

Fail to connect to Ezypred Server

EC Number Prediction by EFICAz Software ?

No EC number assignment, probably not an enzyme!


Prediction of Functionally Associated Proteins

Functionally Associated Proteins Detected by STRING ?

Your Input:
GSVIVG00011469001
SubName- Full=Chromosome undetermined scaffold_310, whole genome shotgun sequence; (204 aa)
(Vitis vinifera)
Predicted Functional Partners:
GSVIVG00027603001
SubName- Full=Chromosome chr19 scaffold_4, whole genome shotgun sequence; (254 aa)
       0.505
GSVIVG00002625001
SubName- Full=Chromosome undetermined scaffold_133, whole genome shotgun sequence; (149 aa)
       0.505
GSVIVG00007397001
SubName- Full=Chromosome chr16 scaffold_189, whole genome shotgun sequence; (251 aa)
       0.504
GSVIVG00004124001
SubName- Full=Chromosome undetermined scaffold_608, whole genome shotgun sequence; (162 aa)
       0.502
GSVIVG00035652001
SubName- Full=Chromosome undetermined scaffold_79, whole genome shotgun sequence; (199 aa)
       0.501
GSVIVG00020301001
SubName- Full=Chromosome chr5 scaffold_2, whole genome shotgun sequence; (293 aa)
       0.501
cemA
RecName- Full=Chloroplast envelope membrane protein;; May be involved in proton extrusion. Indi [...] (226 aa)
       0.500
GSVIVG00017320001
SubName- Full=Chromosome chr6 scaffold_15, whole genome shotgun sequence; (94 aa)
       0.500
GSVIVG00009465001
SubName- Full=Chromosome undetermined scaffold_235, whole genome shotgun sequence; (234 aa)
       0.500
GSVIVG00034898001
SubName- Full=Chromosome chr4 scaffold_73, whole genome shotgun sequence; (285 aa)
       0.499

Conserved Domains and Related Protein Families

Conserved Domains Detected by RPS-BLAST ?

ID ?Alignment Graph ?Length ? Definition ? E-value ?
Query168
pfam1415990 pfam14159, DUF4308, Domain of unknown function (DU 5e-33
PLN02777167 PLN02777, PLN02777, photosystem I P subunit (PSI-P 3e-17
>gnl|CDD|222574 pfam14159, DUF4308, Domain of unknown function (DUF4308) Back     alignment and domain information
 Score =  112 bits (282), Expect = 5e-33
 Identities = 45/81 (55%), Positives = 59/81 (72%), Gaps = 4/81 (4%)

Query: 88  DSEDKYSLVLYGTGA----LLALWLTTVVVGAIDSIPLFPKLMEVVGLGYTLWFSWRYLL 143
             EDKY   L G GA    ++ALWL+  V+ AIDSIPL P L+E+VGLGY+ WF +RYLL
Sbjct: 10  KFEDKYKRPLLGVGAIIAVIVALWLSAAVLDAIDSIPLLPGLLELVGLGYSGWFVYRYLL 69

Query: 144 FKKNRDELATKIEELKQQVLG 164
           F ++R EL  KI+ELK+++LG
Sbjct: 70  FSEDRQELLAKIQELKKEILG 90


This presumed domain is functionally uncharacterized. This domain family is found in bacteria and eukaryotes, and is approximately 90 amino acids in length. The domain is found in several amino-acyl tRNA synthetase enzymes as well as in isolation in single domain proteins. Length = 90

>gnl|CDD|178376 PLN02777, PLN02777, photosystem I P subunit (PSI-P) Back     alignment and domain information

Conserved Domains Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query 168
PLN02777167 photosystem I P subunit (PSI-P) 100.0
PF1415990 CAAD: CAAD domains of cyanobacterial aminoacyl-tRN 99.96
>PLN02777 photosystem I P subunit (PSI-P) Back     alignment and domain information
Probab=100.00  E-value=2.1e-35  Score=239.76  Aligned_cols=92  Identities=35%  Similarity=0.737  Sum_probs=88.9

Q ss_pred             hhHHHHHHhhhhhc-cccchhhHHHHHHHHHHHHHHHHHHHHhhcCCCccchheehhheeeehhhhhhhcccccHHHHHH
Q 030966           75 LTNELLDNLKIKFD-SEDKYSLVLYGTGALLALWLTTVVVGAIDSIPLFPKLMEVVGLGYTLWFSWRYLLFKKNRDELAT  153 (168)
Q Consensus        75 ~~~e~l~~l~~kwd-~edK~~v~~~g~gaiVal~v~~aVl~AId~IPLlp~llELVGlgYs~WFvyRyLL~~~~R~eL~~  153 (168)
                      +..|+++.++++|| .|||++++++++++||++|++.+||+|||+|||+|++||||||||++||+||||+|+++|+||++
T Consensus        75 ~~~ei~k~~~e~Wd~~EdK~av~~l~~aaiVal~v~~~VL~AId~lPLlP~lLELVGigYs~WF~yRyLLfke~ReeL~~  154 (167)
T PLN02777         75 ELPEIVKTVQEAWDKVEDKYAVSSLAFAGVVALWGSAGMISAIDRLPLVPGVLELVGIGYTGWFAYKNLVFKPDREALIE  154 (167)
T ss_pred             cHHHHHHHHHHHHhhhcchhHHHHHHHHHHHHHHHHHHHHHHHhccccccchHHHhhhhhhhhhhhhHhcCcccHHHHHH
Confidence            35699999999999 69999999999999999999999999999999999999999999999999999999999999999


Q ss_pred             HHHHHhhhhcCCC
Q 030966          154 KIEELKQQVLGSN  166 (168)
Q Consensus       154 ki~~lk~~I~G~~  166 (168)
                      +|+++|++|+|++
T Consensus       155 ki~~lk~~IlG~s  167 (167)
T PLN02777        155 KIKDTYKEIIGSS  167 (167)
T ss_pred             HHHHHHHHhhCCC
Confidence            9999999999963



>PF14159 CAAD: CAAD domains of cyanobacterial aminoacyl-tRNA synthetase Back     alignment and domain information

Homologous Structure Templates

Structure Templates Detected by BLAST ?

No homologous structure with e-value below 0.005

Structure Templates Detected by RPS-BLAST ?

No hit with e-value below 0.005

Structure Templates Detected by HHsearch ?

No hit with probability above 80.00


Homologous Structure Domains

Structure Domains Detected by RPS-BLAST ?

No hit with e-value below 0.005

Homologous Domains Detected by HHsearch ?

No hit with probability above 80.00