Citrus Sinensis ID: 029001


Local Sequence Feature Prediction

Prediction and (Method)Result
Residue Number Marker
Protein Sequence ?
Secondary Structure (PSIPRED) ?
Secondary Structure Prediction (SSPRO) ?
Coil and Loop (DISEMBL) ?
Flexible Loop (DISEMBL) ?
Low Complexity Region (SEG) ?
Disordered region (IsUnstruct) ?
Disordered Region (DISOPRED) ?
Disordered Region (DISEMBL) ?
Disordered Region (DISPRO) ?
Transmembrane Helix (TMHMM) ?
Transmembrane Helix (HMMTOP) ?
Transmembrane Helix (MEMSAT) ?
TM Helix, Signal Peptide (MEMSAT_SVM) ?
TM Helix, Signal Peptide (Phobius) ?
Signal Peptide (SignalP HMM Mode) ?
Signal Peptide (SignalP NN Mode) ?
Coiled Coils (COILS) ?
Positional Conservation ?
 
--------10--------20--------30--------40--------50--------60--------70--------80--------90-------100-------110-------120-------130-------140-------150-------160-------170-------180-------190-------200
MELCTAIQTQSAISNHHHPFLTATTAPALARTKSALSLKQTTVSRSGYGLRSRILYYTNPLPKATSEETSSGTDQYVVDKRDGATAAEDVPAVEKNVYNESVATAVPKEESPVDGLTNELLDNLKIKFDSEDKYSLVLYGTGALLALWLTTVVFPKLMEVVGLGYTLWFSWRYLLFKKNRDELATKIEELKQQVLGSNDD
cccHHHHHHHHHHccccccccccccHHHHHHccccccccccccccccccEEEEEcccccccccccccccccccccccccccccccccccccccccccccccccccccccccccHHHHHHHHHHHHHHHcccccHHHHHHHHHHHHHHHHHHccccHHHHHHHHHHHHHHHHHHHcccccHHHHHHHHHHHHHHHcccccc
ccEEEcccccHHHHcccccccccccccccccccccccccccccccccHHHHHEEEEccccccccccccccccccEEEEEcccccEEcccccccHcccccccccccccccccccHHHHHHHHHHHHHHcccccccEEEEEHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHccccHHHHHHHHHHHHHHHcccccc
melctaiqtqsaisnhhhpfltattapaLARTKSALSlkqttvsrsgyglrsrilyytnplpkatseetssgtdqyvvdkrdgataaedvpaveknvynesvatavpkeespvdgltnELLDNLkikfdsedkysLVLYGTGALLALWLTTVVFPKLMEVVGLGYTLWFSWRYLLFKKNRDELATKIEELKQQVLGSNDD
MELCTAIQTqsaisnhhhpFLTATTAPALARTksalslkqttvsrsgyglrsrILYYTnplpkatseetssgtdqyVVDKRDGAtaaedvpaveknvynesvatavpkeespvdglTNELLDNLKIKFDSEDKYSLVLYGTGALLALWLTTVVFPKLMEVVGLGYTLWFSWRYLLFKKNRDELATKIEelkqqvlgsndd
MELCTAIQTQSAISNHHHPFltattapalartKSALSLKQTTVSRSGYGLRSRILYYTNPLPKATSEETSSGTDQYVVDKRDGATAAEDVPAVEKNVYNESVATAVPKEESPVDGLTNELLDNLKIKFDSEDKYSLVlygtgallalwlttVVFPKLMEVVGLGYTLWFSWRYLLFKKNRDELATKIEELKQQVLGSNDD
******************PFLTATT****************TVSRSGYGLRSRILYYTN********************************************************LTNELLDNLKIKFDSEDKYSLVLYGTGALLALWLTTVVFPKLMEVVGLGYTLWFSWRYLLFKKNRDEL*****************
*********Q****NH******************************GYGLRSRILY************************************************************TNELLDNLKIKFDSEDKYSLVLYGTGALLALWLTTVVFPKLMEVVGLGYTLWFSWRYLLFKKNRDELATKIEELKQ*VLG****
********TQSAISNHHHPFLTATTAPALARTKSALSLKQTTVSRSGYGLRSRILYYTNPLPK**********DQYVVDKRDGATAAEDVPAVEKNVYNESVATAVPKEESPVDGLTNELLDNLKIKFDSEDKYSLVLYGTGALLALWLTTVVFPKLMEVVGLGYTLWFSWRYLLFKKNRDELATKIEELKQ********
*ELCTAIQTQSAISNHHHPFLTATT********SALSLKQTTVSRSGYGLRSRILYYTNPLPK**********DQYVVDKRDGATAAED**********************PVDGLTNELLDNLKIKFDSEDKYSLVLYGTGALLALWLTTVVFPKLMEVVGLGYTLWFSWRYLLFKKNRDELATKIEELKQQVL*****
iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiHHHHHHHHHHHHHHHHHHHHoooHHHHHHHHHHHHHHHHHHHHiiiiiiiiiiiiiiiiiiiiiiii
ooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooHHHHHHHHHHHHHHHHHHHHHHHHHiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiii
oooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooHHHHHHHHHHHHHHHHHHHHHHHHHiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiii
SSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHooooooooooooooooooooooooooooooooo
iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiHHHHHHHHHHHHHHHHHHHHHoooooHHHHHHHHHHHHHHHHHiiiiiiiiiiiiiiiiiiiiiiii
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
MELCTAIQTQSAISNHHHPFLTATTAPALARTKSALSLKQTTVSRSGYGLRSRILYYTNPLPKATSEETSSGTDQYVVDKRDGATAAEDVPAVEKNVYNESVATAVPKEESPVDGLTNELLDNLKIKFDSEDKYSLVLYGTGALLALWLTTVVFPKLMEVVGLGYTLWFSWRxxxxxxxxxxxxxxxxxxxxxxxxxxxx
no confident homologs detected

Close Homologs for Annotation Transfer

Close Homologs in SWISS-PROT Database Detected by BLAST ?

ID ?Alignment graph ?Length ? Definition ? RBH(Q2H) ? RBH(H2Q) ? Q cover ? H cover ? Identity ? E-value ?
Query200 2.2.26 [Sep-21-2011]
O04616164 Uncharacterized protein A no no 0.44 0.536 0.49 2e-19
Q8LCA1174 Thylakoid membrane phosph no no 0.34 0.390 0.311 5e-06
Q119Z5881 Glutamate--tRNA ligase OS yes no 0.475 0.107 0.272 2e-05
>sp|O04616|Y4115_ARATH Uncharacterized protein At4g01150, chloroplastic OS=Arabidopsis thaliana GN=At4g01150 PE=1 SV=1 Back     alignment and function desciption
 Score = 95.5 bits (236), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 49/100 (49%), Positives = 72/100 (72%), Gaps = 12/100 (12%)

Query: 108 KEESPVDGLTNELLDNLKIKFDS-EDKYSLVLYGTGALLALWLTTVV---------FPKL 157
           +E S +D  TNEL+ +LK K+D  E+K ++++YG GA++A+WL+++V          PK+
Sbjct: 66  EETSSID--TNELITDLKEKWDGLENKSTVLIYGGGAIVAVWLSSIVVGAINSVPLLPKV 123

Query: 158 MEVVGLGYTLWFSWRYLLFKKNRDELATKIEELKQQVLGS 197
           ME+VGLGYT WF +RYLLFK +R ELA  IE LK+++ GS
Sbjct: 124 MELVGLGYTGWFVYRYLLFKSSRKELAEDIESLKKKIAGS 163





Arabidopsis thaliana (taxid: 3702)
>sp|Q8LCA1|TMP14_ARATH Thylakoid membrane phosphoprotein 14 kDa, chloroplastic OS=Arabidopsis thaliana GN=TMP14 PE=1 SV=2 Back     alignment and function description
>sp|Q119Z5|SYE_TRIEI Glutamate--tRNA ligase OS=Trichodesmium erythraeum (strain IMS101) GN=gltX PE=3 SV=1 Back     alignment and function description

Close Homologs in the Non-Redundant Database Detected by BLAST ?

GI ?Alignment Graph ?Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query200
225469778204 PREDICTED: uncharacterized protein LOC10 0.96 0.941 0.537 2e-51
359495653204 PREDICTED: uncharacterized protein LOC10 0.96 0.941 0.537 4e-51
388506910199 unknown [Lotus japonicus] 0.835 0.839 0.482 2e-34
351723571197 uncharacterized protein LOC100499731 [Gl 0.79 0.802 0.485 2e-32
297798056194 threonine endopeptidase [Arabidopsis lyr 0.905 0.932 0.408 4e-30
351727288200 uncharacterized protein LOC100305966 [Gl 0.715 0.715 0.502 6e-30
18420175193 uncharacterized protein [Arabidopsis tha 0.9 0.932 0.417 2e-29
357485585182 Glutamyl-tRNA synthetase [Medicago trunc 0.745 0.818 0.477 2e-29
4467116153 hypothetical protein [Arabidopsis thalia 0.63 0.823 0.469 8e-26
449508380210 PREDICTED: uncharacterized protein At4g0 0.715 0.680 0.343 2e-23
>gi|225469778|ref|XP_002274430.1| PREDICTED: uncharacterized protein LOC100261101 [Vitis vinifera] gi|297735928|emb|CBI18704.3| unnamed protein product [Vitis vinifera] Back     alignment and taxonomy information
 Score =  207 bits (527), Expect = 2e-51,   Method: Compositional matrix adjust.
 Identities = 114/212 (53%), Positives = 138/212 (65%), Gaps = 20/212 (9%)

Query: 1   MELCTAIQTQSAISNHHHPFLTATTAPALARTKSALSLKQTTVSRSGYGLRSRILYYTNP 60
           MELCT      A SN HH  L    A  L     ++  KQ+ +SR+  G     LY+ NP
Sbjct: 1   MELCTT----RAFSNLHHRTLFNPLANRLRWKTISIPFKQSPISRTSPG----SLYFNNP 52

Query: 61  LPKAT-SEETSSGTDQYVVDKRDGATAAEDVPAVEKNVYNESVATAVPKEESPVDGLT-- 117
           L +A+ SE +SSG DQY+ ++RD     ED+PA E+NVYNE + T  P E+S V+  T  
Sbjct: 53  LLRASISEGSSSGADQYIGEERDSVLVMEDIPATEENVYNEVIPTEAPIEDSQVEEQTVA 112

Query: 118 NELLDNLKIKFDSEDKYSLVLYGTGALLALWLTTVV---------FPKLMEVVGLGYTLW 168
            E LDNL IKFDSED YS+ LYGTGAL ALW  + +         FPKLME+VGLGYTLW
Sbjct: 113 FEFLDNLNIKFDSEDPYSIFLYGTGALTALWFASAIVGAIDSIPIFPKLMEIVGLGYTLW 172

Query: 169 FSWRYLLFKKNRDELATKIEELKQQVLGSNDD 200
           FS RYL+FK+NRDELA KIEELKQQVLGS D+
Sbjct: 173 FSARYLIFKQNRDELAAKIEELKQQVLGSEDE 204




Source: Vitis vinifera

Species: Vitis vinifera

Genus: Vitis

Family: Vitaceae

Order: Vitales

Class:

Phylum: Streptophyta

Superkingdom: Eukaryota

>gi|359495653|ref|XP_003635047.1| PREDICTED: uncharacterized protein LOC100853187 isoform 1 [Vitis vinifera] gi|359495655|ref|XP_003635048.1| PREDICTED: uncharacterized protein LOC100853187 isoform 2 [Vitis vinifera] gi|297735931|emb|CBI18707.3| unnamed protein product [Vitis vinifera] Back     alignment and taxonomy information
>gi|388506910|gb|AFK41521.1| unknown [Lotus japonicus] Back     alignment and taxonomy information
>gi|351723571|ref|NP_001235236.1| uncharacterized protein LOC100499731 [Glycine max] gi|255626121|gb|ACU13405.1| unknown [Glycine max] Back     alignment and taxonomy information
>gi|297798056|ref|XP_002866912.1| threonine endopeptidase [Arabidopsis lyrata subsp. lyrata] gi|297312748|gb|EFH43171.1| threonine endopeptidase [Arabidopsis lyrata subsp. lyrata] Back     alignment and taxonomy information
>gi|351727288|ref|NP_001235620.1| uncharacterized protein LOC100305966 [Glycine max] gi|255627137|gb|ACU13913.1| unknown [Glycine max] Back     alignment and taxonomy information
>gi|18420175|ref|NP_568035.1| uncharacterized protein [Arabidopsis thaliana] gi|21554198|gb|AAM63277.1| unknown [Arabidopsis thaliana] gi|107738205|gb|ABF83661.1| At4g38100 [Arabidopsis thaliana] gi|332661478|gb|AEE86878.1| uncharacterized protein [Arabidopsis thaliana] Back     alignment and taxonomy information
>gi|357485585|ref|XP_003613080.1| Glutamyl-tRNA synthetase [Medicago truncatula] gi|355514415|gb|AES96038.1| Glutamyl-tRNA synthetase [Medicago truncatula] gi|388495348|gb|AFK35740.1| unknown [Medicago truncatula] Back     alignment and taxonomy information
>gi|4467116|emb|CAB37550.1| hypothetical protein [Arabidopsis thaliana] gi|7270793|emb|CAB80475.1| hypothetical protein [Arabidopsis thaliana] Back     alignment and taxonomy information
>gi|449508380|ref|XP_004163298.1| PREDICTED: uncharacterized protein At4g01150, chloroplastic-like [Cucumis sativus] Back     alignment and taxonomy information

Prediction of Gene Ontology (GO) Terms

Close Homologs with Gene Ontology terms Detected by BLAST ?

ID ? Alignment graph ? Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query200
TAIR|locus:2121065193 AT4G38100 [Arabidopsis thalian 0.915 0.948 0.361 1e-22
TAIR|locus:2125018164 AT4G01150 "AT4G01150" [Arabido 0.48 0.585 0.412 7.1e-13
TAIR|locus:2044335174 PSI-P "photosystem I P subunit 0.22 0.252 0.409 5.8e-05
TAIR|locus:2037435156 AT1G52220 "AT1G52220" [Arabido 0.19 0.243 0.421 0.00081
TAIR|locus:2121065 AT4G38100 [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
 Score = 263 (97.6 bits), Expect = 1.0e-22, P = 1.0e-22
 Identities = 76/210 (36%), Positives = 111/210 (52%)

Query:     1 MELCTAIQTQSAISNHHHPFXXXXXXXXXXXXKSALSLKQTTVSRSGYGLRSRILYYTNP 60
             MELCT  ++ + I++    F            + +L L++   S     L+SR L  +  
Sbjct:     1 MELCT--RSTTIITHLPASFNGHGYLAGKSVDRISLPLQRNVASLV---LQSRTLRCSRK 55

Query:    61 LPKAT-SEETSSGTDQYVVDKRDGATAAEDVPAVEKNVYNESVATAVPKEESPVDGLTNE 119
              P  T +EETS+G +++ V+ RDG   A    A EKN  +E    A   E+     L  E
Sbjct:    56 FPGETVTEETSTGVNEFGVEDRDGVVVA----AEEKNSNSE----APQAEDEETQAL--E 105

Query:   120 LLDNLKIKFDSEDKYSLVXXXXXXXXXXXXXXVV---------FPKLMEVVGLGYTLWFS 170
              L+++K+  DS+  YS++               +         FPKLMEVVGLGYTLWF+
Sbjct:   106 FLNDIKL--DSDKTYSILLYGSGAIVALYLTSAIVSSLEAIPLFPKLMEVVGLGYTLWFT 163

Query:   171 WRYLLFKKNRDELATKIEELKQQVLGSNDD 200
              RYLLFK+NR+EL TK+ E+K+QVLGS+ +
Sbjct:   164 TRYLLFKRNREELKTKVSEIKKQVLGSDSE 193




GO:0009507 "chloroplast" evidence=ISM
GO:0009535 "chloroplast thylakoid membrane" evidence=IDA
TAIR|locus:2125018 AT4G01150 "AT4G01150" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2044335 PSI-P "photosystem I P subunit" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2037435 AT1G52220 "AT1G52220" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms

Prediction of Enzyme Commission (EC) Number

EC Number Prediction by Annotation Transfer from SWISS-PROT Entries ?

No confident hit for EC number transfering in SWISSPROT detected by BLAST

EC Number Prediction by Ezypred Server ?

Fail to connect to Ezypred Server

EC Number Prediction by EFICAz Software ?

No EC number assignment, probably not an enzyme!


Prediction of Functionally Associated Proteins

Functionally Associated Proteins Detected by STRING ?

Fail to connect to STRING server


Conserved Domains and Related Protein Families

Conserved Domains Detected by RPS-BLAST ?

ID ?Alignment Graph ?Length ? Definition ? E-value ?
Query200
pfam1415990 pfam14159, DUF4308, Domain of unknown function (DU 2e-24
PLN02777167 PLN02777, PLN02777, photosystem I P subunit (PSI-P 7e-10
>gnl|CDD|222574 pfam14159, DUF4308, Domain of unknown function (DUF4308) Back     alignment and domain information
 Score = 91.5 bits (228), Expect = 2e-24
 Identities = 38/81 (46%), Positives = 51/81 (62%), Gaps = 13/81 (16%)

Query: 129 DSEDKYSLVLYGTGA----LLALWLTTVV---------FPKLMEVVGLGYTLWFSWRYLL 175
             EDKY   L G GA    ++ALWL+  V          P L+E+VGLGY+ WF +RYLL
Sbjct: 10  KFEDKYKRPLLGVGAIIAVIVALWLSAAVLDAIDSIPLLPGLLELVGLGYSGWFVYRYLL 69

Query: 176 FKKNRDELATKIEELKQQVLG 196
           F ++R EL  KI+ELK+++LG
Sbjct: 70  FSEDRQELLAKIQELKKEILG 90


This presumed domain is functionally uncharacterized. This domain family is found in bacteria and eukaryotes, and is approximately 90 amino acids in length. The domain is found in several amino-acyl tRNA synthetase enzymes as well as in isolation in single domain proteins. Length = 90

>gnl|CDD|178376 PLN02777, PLN02777, photosystem I P subunit (PSI-P) Back     alignment and domain information

Conserved Domains Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query 200
PLN02777167 photosystem I P subunit (PSI-P) 99.96
PF1415990 CAAD: CAAD domains of cyanobacterial aminoacyl-tRN 99.91
>PLN02777 photosystem I P subunit (PSI-P) Back     alignment and domain information
Probab=99.96  E-value=1.7e-30  Score=216.33  Aligned_cols=82  Identities=33%  Similarity=0.728  Sum_probs=79.3

Q ss_pred             hHHHHHHhhhhhc-cccchhhhhHHHHHHHHHHhhh---------cccchhhhhhhhhhhhhhHhhhhccccchHHHHHH
Q 029001          117 TNELLDNLKIKFD-SEDKYSLVLYGTGALLALWLTT---------VVFPKLMEVVGLGYTLWFSWRYLLFKKNRDELATK  186 (200)
Q Consensus       117 ~~Evl~~Lk~kwD-seDK~avv~~G~gAlVAlWlss---------PLLp~lLELVGLgYT~WFvyRyLLfke~RqEL~~k  186 (200)
                      .+|+++.+|++|| .|||++++++++||+|++|++.         ||+|++||||||||++||+||||+|+++|+||+++
T Consensus        76 ~~ei~k~~~e~Wd~~EdK~av~~l~~aaiVal~v~~~VL~AId~lPLlP~lLELVGigYs~WF~yRyLLfke~ReeL~~k  155 (167)
T PLN02777         76 LPEIVKTVQEAWDKVEDKYAVSSLAFAGVVALWGSAGMISAIDRLPLVPGVLELVGIGYTGWFAYKNLVFKPDREALIEK  155 (167)
T ss_pred             HHHHHHHHHHHHhhhcchhHHHHHHHHHHHHHHHHHHHHHHHhccccccchHHHhhhhhhhhhhhhHhcCcccHHHHHHH
Confidence            5699999999999 7999999999999999999987         99999999999999999999999999999999999


Q ss_pred             HHHHhhhhcCCC
Q 029001          187 IEELKQQVLGSN  198 (200)
Q Consensus       187 I~~lk~~I~G~~  198 (200)
                      |+++|++|+|++
T Consensus       156 i~~lk~~IlG~s  167 (167)
T PLN02777        156 IKDTYKEIIGSS  167 (167)
T ss_pred             HHHHHHHhhCCC
Confidence            999999999963



>PF14159 CAAD: CAAD domains of cyanobacterial aminoacyl-tRNA synthetase Back     alignment and domain information

Homologous Structure Templates

Structure Templates Detected by BLAST ?

No homologous structure with e-value below 0.005

Structure Templates Detected by RPS-BLAST ?

No hit with e-value below 0.005

Structure Templates Detected by HHsearch ?

No hit with probability above 80.00


Homologous Structure Domains

Structure Domains Detected by RPS-BLAST ?

No hit with e-value below 0.005

Homologous Domains Detected by HHsearch ?

No hit with probability above 80.00