Citrus Sinensis ID: 034359


Local Sequence Feature Prediction

Prediction and (Method)Result
Residue Number Marker
Protein Sequence ?
Secondary Structure (PSIPRED) ?
Secondary Structure Prediction (SSPRO) ?
Coil and Loop (DISEMBL) ?
Flexible Loop (DISEMBL) ?
Low Complexity Region (SEG) ?
Disordered region (IsUnstruct) ?
Disordered Region (DISOPRED) ?
Disordered Region (DISEMBL) ?
Disordered Region (DISPRO) ?
Transmembrane Helix (TMHMM) ?
Transmembrane Helix (HMMTOP) ?
Transmembrane Helix (MEMSAT) ?
TM Helix, Signal Peptide (MEMSAT_SVM) ?
TM Helix, Signal Peptide (Phobius) ?
Signal Peptide (SignalP HMM Mode) ?
Signal Peptide (SignalP NN Mode) ?
Coiled Coils (COILS) ?
Positional Conservation ?
 
--------10--------20--------30--------40--------50--------60--------70--------80--------90-------
MAASVMASSLSLKPAPFTVEKSAARGLPSLAKTSFKIVAKGGKIKTDKPYGVNGGMDLREGLDASGRKAKYKFLNQQPLNSGELFVYIFGERCLPIC
ccccccccccccccccccEEccccccccccccccEEEEEcccEEEEccccccccccccccccccccccEEEEEEEccccccccEEEEEEcccccccc
ccHHHHHHHHccccccccccccccccccccccccEEEEEcccEEEcccccccccccEccccccccccccEEEEEEccccccccEEEEEEcccccccc
MAASVMAsslslkpapftveksaarglpslakTSFKIVAkggkiktdkpygvnggmdlreglDASGRKAKYKFlnqqplnsgelFVYIFgerclpic
maasvmasslslkpapftVEKSaarglpslaktsfkivakggkiktdkpygvnggMDLREGLDASGRKAKYKflnqqplnsgelFVYIFGERCLPIC
maasvmasslslKPAPFTVEKSAARGLPSLAKTSFKIVAKGGKIKTDKPYGVNGGMDLREGLDASGRKAKYKFLNQQPLNSGELFVYIFGERCLPIC
*********************************SFKIVAKGGKIKT**PYGVNG**************AKYKFLNQQPLNSGELFVYIFGERCLPI*
*************************************************YGVNGGMDLREGLDASGRKAKYKFLNQQPLNSGELFVYIFGERCLPIC
**********SLKPAPFTVEKSAARGLPSLAKTSFKIVAKGGKIKTDKPYGVNGGMDLREGLDASGRKAKYKFLNQQPLNSGELFVYIFGERCLPIC
****************************SLAKTSFKIVAKGGKIKTDKPYGVNGGMDLREGLDASGRKAKYKFLNQQPLNSGELFVYIFGERCLPIC
ooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiii
iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiihhhhhhhhhhhhhhhhhhhhooooooooooooooooooooooooooooooooooooooooooo
SSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiihhhhhhhhhhhhhhhhoooo
ooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
MAASVMASSLSLKPAPFTVEKSAARGLPSLAKTSFKIVAKGGKIKTDKPYGVNGGMDLREGLDASGRKAKYKFLNQQPLNSGELFVYIFGERCLPIC
no confident homologs detected

Close Homologs for Annotation Transfer

Close Homologs in SWISS-PROT Database Detected by BLAST ?

ID ?Alignment graph ?Length ? Definition ? RBH(Q2H) ? RBH(H2Q) ? Q cover ? H cover ? Identity ? E-value ?
Query97 2.2.26 [Sep-21-2011]
P27202140 Photosystem II 10 kDa pol yes no 0.824 0.571 0.666 1e-22
P49108141 Photosystem II 10 kDa pol N/A no 0.824 0.567 0.647 5e-22
Q40519136 Photosystem II 10 kDa pol N/A no 0.783 0.558 0.563 7e-18
P06183138 Photosystem II 10 kDa pol N/A no 0.804 0.565 0.574 4e-17
Q40163138 Photosystem II 10 kDa pol N/A no 0.804 0.565 0.563 4e-16
P10690140 Photosystem II 10 kDa pol N/A no 0.814 0.564 0.534 2e-13
Q40070138 Photosystem II 10 kDa pol N/A no 0.762 0.536 0.468 2e-06
>sp|P27202|PSBR_ARATH Photosystem II 10 kDa polypeptide, chloroplastic OS=Arabidopsis thaliana GN=PSBR PE=1 SV=1 Back     alignment and function desciption
 Score =  105 bits (261), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 58/87 (66%), Positives = 70/87 (80%), Gaps = 7/87 (8%)

Query: 1  MAASVMASSLSLKPAPFTVEKSAARGLPSL--AKTSFKIVAKG-GKIKTDKPYGVNGGMD 57
          MAASVM SS++LKPA FTVEK+AARGLPSL  A+ SFKIVA G  KIKTDKP+G+NG MD
Sbjct: 1  MAASVMLSSVTLKPAGFTVEKTAARGLPSLTRARPSFKIVASGVKKIKTDKPFGINGSMD 60

Query: 58 LREGLDASGRKAK----YKFLNQQPLN 80
          LR+G+DASGRK K    YK++++   N
Sbjct: 61 LRDGVDASGRKGKGYGVYKYVDKYGAN 87




Associated with the oxygen-evolving complex of photosystem II.
Arabidopsis thaliana (taxid: 3702)
>sp|P49108|PSBR_BRACM Photosystem II 10 kDa polypeptide, chloroplastic OS=Brassica campestris GN=PSBR PE=2 SV=1 Back     alignment and function description
>sp|Q40519|PSBR_TOBAC Photosystem II 10 kDa polypeptide, chloroplastic OS=Nicotiana tabacum GN=PSBR PE=2 SV=1 Back     alignment and function description
>sp|P06183|PSBR_SOLTU Photosystem II 10 kDa polypeptide, chloroplastic OS=Solanum tuberosum GN=PSBR PE=2 SV=1 Back     alignment and function description
>sp|Q40163|PSBR_SOLLC Photosystem II 10 kDa polypeptide, chloroplastic OS=Solanum lycopersicum GN=PSBR PE=2 SV=1 Back     alignment and function description
>sp|P10690|PSBR_SPIOL Photosystem II 10 kDa polypeptide, chloroplastic OS=Spinacia oleracea GN=PSBR PE=2 SV=1 Back     alignment and function description
>sp|Q40070|PSBR_HORVU Photosystem II 10 kDa polypeptide, chloroplastic OS=Hordeum vulgare GN=PSBR PE=3 SV=1 Back     alignment and function description

Close Homologs in the Non-Redundant Database Detected by BLAST ?

GI ?Alignment Graph ?Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query97
351722140139 uncharacterized protein LOC100499745 [Gl 0.814 0.568 0.655 3e-21
15219268140 photosystem II subunit R [Arabidopsis th 0.824 0.571 0.666 4e-21
351722402138 uncharacterized protein LOC100305752 [Gl 0.814 0.572 0.639 5e-21
297839767133 hypothetical protein ARALYDRAFT_477073 [ 0.824 0.601 0.666 6e-21
118489740136 unknown [Populus trichocarpa x Populus d 0.814 0.580 0.642 2e-20
1346871141 RecName: Full=Photosystem II 10 kDa poly 0.824 0.567 0.647 2e-20
119720780140 photosystem II 10kDa polytpeptide [Brass 0.824 0.571 0.655 3e-20
193872616146 ST244 [Eutrema halophilum] 0.824 0.547 0.647 4e-20
22720441977 AT1G79040 [Arabidopsis thaliana] 0.731 0.922 0.743 5e-20
869889182 polypeptide precursor of photosystem II 0.773 0.914 0.646 6e-20
>gi|351722140|ref|NP_001235442.1| uncharacterized protein LOC100499745 [Glycine max] gi|255626243|gb|ACU13466.1| unknown [Glycine max] Back     alignment and taxonomy information
 Score =  105 bits (262), Expect = 3e-21,   Method: Compositional matrix adjust.
 Identities = 57/87 (65%), Positives = 72/87 (82%), Gaps = 8/87 (9%)

Query: 1  MAASVMASSLSLKPAPFTVEKSAARGLPSLAK--TSFKIVAKGGK-IKTDKPYGVNGGMD 57
          MA+SVMAS ++LKPAPFTVEKS+ RGLPSL++  +SF++VA GGK IKTDKPYG+NGGM 
Sbjct: 1  MASSVMAS-VALKPAPFTVEKSSVRGLPSLSRNSSSFRVVASGGKKIKTDKPYGINGGMA 59

Query: 58 LREGLDASGRKAK----YKFLNQQPLN 80
          LREG+DASGRK K    Y+F+++   N
Sbjct: 60 LREGVDASGRKGKGKGVYQFVDKYGAN 86




Source: Glycine max

Species: Glycine max

Genus: Glycine

Family: Fabaceae

Order: Fabales

Class:

Phylum: Streptophyta

Superkingdom: Eukaryota

>gi|15219268|ref|NP_178025.1| photosystem II subunit R [Arabidopsis thaliana] gi|131398|sp|P27202.1|PSBR_ARATH RecName: Full=Photosystem II 10 kDa polypeptide, chloroplastic; Flags: Precursor gi|16447|emb|CAA39441.1| photosystem II 10 kDa polypeptide [Arabidopsis thaliana] gi|3152571|gb|AAC17052.1| Match to photosystem II 10kDa polypeptide gb|X55970. ESTs gb|Z17693, gb|N37616, gb|T41858, gb|T88021, gb|R37531, gb|T04679, gb|N37520, gb|N64965, gb|Z17592 and gb|N65338, gb|N37466 and gb|T45400 come from this gene [Arabidopsis thaliana] gi|17979203|gb|AAL49840.1| putative photosystem II polypeptide protein [Arabidopsis thaliana] gi|20465469|gb|AAM20194.1| putative photosystem II polypeptide [Arabidopsis thaliana] gi|21536748|gb|AAM61080.1| photosystem II polypeptide, putative [Arabidopsis thaliana] gi|110740828|dbj|BAE98511.1| PSII-R protein [Arabidopsis thaliana] gi|332198075|gb|AEE36196.1| photosystem II subunit R [Arabidopsis thaliana] Back     alignment and taxonomy information
>gi|351722402|ref|NP_001236219.1| uncharacterized protein LOC100305752 [Glycine max] gi|255626523|gb|ACU13606.1| unknown [Glycine max] Back     alignment and taxonomy information
>gi|297839767|ref|XP_002887765.1| hypothetical protein ARALYDRAFT_477073 [Arabidopsis lyrata subsp. lyrata] gi|297333606|gb|EFH64024.1| hypothetical protein ARALYDRAFT_477073 [Arabidopsis lyrata subsp. lyrata] Back     alignment and taxonomy information
>gi|118489740|gb|ABK96671.1| unknown [Populus trichocarpa x Populus deltoides] Back     alignment and taxonomy information
>gi|1346871|sp|P49108.1|PSBR_BRACM RecName: Full=Photosystem II 10 kDa polypeptide, chloroplastic; Flags: Precursor gi|967968|gb|AAA74957.1| photosystem II 10kDa polypeptide [Brassica rapa subsp. campestris] Back     alignment and taxonomy information
>gi|119720780|gb|ABL97960.1| photosystem II 10kDa polytpeptide [Brassica rapa] Back     alignment and taxonomy information
>gi|193872616|gb|ACF23036.1| ST244 [Eutrema halophilum] Back     alignment and taxonomy information
>gi|227204419|dbj|BAH57061.1| AT1G79040 [Arabidopsis thaliana] Back     alignment and taxonomy information
>gi|8698891|gb|AAF78511.1|AF195209_1 polypeptide precursor of photosystem II [Pyrus pyrifolia] Back     alignment and taxonomy information

Prediction of Gene Ontology (GO) Terms

Close Homologs with Gene Ontology terms Detected by BLAST ?

ID ? Alignment graph ? Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query97
TAIR|locus:2207320140 PSBR "photosystem II subunit R 0.701 0.485 0.653 1.4e-18
TAIR|locus:2207320 PSBR "photosystem II subunit R" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
 Score = 224 (83.9 bits), Expect = 1.4e-18, P = 1.4e-18
 Identities = 49/75 (65%), Positives = 59/75 (78%)

Query:    13 KPAPFTVEKSAARGLPSL--AKTSFKIVAKG-GKIKTDKPYGVNGGMDLREGLDASGRKA 69
             KPA FTVEK+AARGLPSL  A+ SFKIVA G  KIKTDKP+G+NG MDLR+G+DASGRK 
Sbjct:    13 KPAGFTVEKTAARGLPSLTRARPSFKIVASGVKKIKTDKPFGINGSMDLRDGVDASGRKG 72

Query:    70 K----YKFLNQQPLN 80
             K    YK++++   N
Sbjct:    73 KGYGVYKYVDKYGAN 87


Parameters:
  V=100
  filter=SEG
  E=0.001

  ctxfactor=1.00

  Query                        -----  As Used  -----    -----  Computed  ----
  Frame  MatID Matrix name     Lambda    K       H      Lambda    K       H
   +0      0   BLOSUM62        0.318   0.139   0.412    same    same    same
               Q=9,R=2         0.244   0.0300  0.180     n/a     n/a     n/a

  Query
  Frame  MatID  Length  Eff.Length     E     S W   T  X   E2     S2
   +0      0       97        85   0.00091  102 3  11 22  0.48    29
                                                     29  0.46    30


Statistics:

  Database:  /share/blast/go-seqdb.fasta
   Title:  go_20130330-seqdb.fasta
   Posted:  5:47:42 AM PDT Apr 1, 2013
   Created:  5:47:42 AM PDT Apr 1, 2013
   Format:  XDF-1
   # of letters in database:  169,044,731
   # of sequences in database:  368,745
   # of database sequences satisfying E:  1
  No. of states in DFA:  474 (50 KB)
  Total size of DFA:  86 KB (2066 KB)
  Time to generate neighborhood:  0.00u 0.00s 0.00t   Elapsed:  00:00:00
  No. of threads or processors used:  24
  Search cpu time:  9.88u 0.10s 9.98t   Elapsed:  00:00:01
  Total cpu time:  9.88u 0.10s 9.98t   Elapsed:  00:00:01
  Start:  Fri May 10 22:08:29 2013   End:  Fri May 10 22:08:30 2013


GO:0009507 "chloroplast" evidence=ISM;IDA
GO:0009523 "photosystem II" evidence=IEA;ISS
GO:0009654 "oxygen evolving complex" evidence=IEA
GO:0015979 "photosynthesis" evidence=IEA;ISS
GO:0042651 "thylakoid membrane" evidence=IEA
GO:0009535 "chloroplast thylakoid membrane" evidence=IDA
GO:0010270 "photosystem II oxygen evolving complex assembly" evidence=IEP
GO:0009579 "thylakoid" evidence=IDA
GO:0009534 "chloroplast thylakoid" evidence=IDA
GO:0005515 "protein binding" evidence=IPI
GO:0009637 "response to blue light" evidence=RCA
GO:0009644 "response to high light intensity" evidence=RCA
GO:0009744 "response to sucrose stimulus" evidence=RCA
GO:0010114 "response to red light" evidence=RCA
GO:0010155 "regulation of proton transport" evidence=RCA
GO:0010218 "response to far red light" evidence=RCA
GO:0019344 "cysteine biosynthetic process" evidence=RCA

Prediction of Enzyme Commission (EC) Number

EC Number Prediction by Annotation Transfer from SWISS-PROT Entries ?

No confident hit for EC number transfering in SWISSPROT detected by BLAST

EC Number Prediction by Ezypred Server ?

Fail to connect to Ezypred Server

EC Number Prediction by EFICAz Software ?

No EC number assignment, probably not an enzyme!


Prediction of Functionally Associated Proteins

Functionally Associated Proteins Detected by STRING ?

Fail to connect to STRING server


Conserved Domains and Related Protein Families

Conserved Domains Detected by RPS-BLAST ?

ID ?Alignment Graph ?Length ? Definition ? E-value ?
Query97
PLN00053117 PLN00053, PLN00053, photosystem II subunit R; Prov 1e-16
pfam0472599 pfam04725, PsbR, Photosystem II 10 kDa polypeptide 8e-11
>gnl|CDD|215041 PLN00053, PLN00053, photosystem II subunit R; Provisional Back     alignment and domain information
 Score = 68.6 bits (168), Expect = 1e-16
 Identities = 34/60 (56%), Positives = 47/60 (78%), Gaps = 7/60 (11%)

Query: 24 ARGLPSLAKT--SFKIVAKGG-KIKTDKPYGVNGGMDLREGLDASGRKAK----YKFLNQ 76
           RGLP L++T  SFK+ A GG KIKTD+PYG +GGM+L++G+DASGRK K    Y+F+++
Sbjct: 1  VRGLPPLSRTARSFKVTASGGKKIKTDQPYGPSGGMNLKDGVDASGRKGKGKGVYQFVDK 60


Length = 117

>gnl|CDD|147071 pfam04725, PsbR, Photosystem II 10 kDa polypeptide PsbR Back     alignment and domain information

Conserved Domains Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query 97
PLN00053117 photosystem II subunit R; Provisional 100.0
PF0472599 PsbR: Photosystem II 10 kDa polypeptide PsbR; Inte 99.91
PLN00083101 photosystem II subunit R; Provisional 98.75
>PLN00053 photosystem II subunit R; Provisional Back     alignment and domain information
Probab=100.00  E-value=7.4e-34  Score=204.53  Aligned_cols=62  Identities=55%  Similarity=0.853  Sum_probs=59.9

Q ss_pred             ccCCCCCcc--ceeEEEEcCe-eeeeecccccCCCcccCCCcCcCCCcccceeeEecccccCcee
Q 034359           24 ARGLPSLAK--TSFKIVAKGG-KIKTDKPYGVNGGMDLREGLDASGRKAKYKFLNQQPLNSGELF   85 (97)
Q Consensus        24 vrgLpslaR--ssf~VvAsg~-KIkt~kP~G~~ggm~~k~gvDAsGRk~KGkGVYqf~dKYGan~   85 (97)
                      +||||+++|  ++|+|+||++ ||||++|||++|+|++|+||||||||+||||||||+||||||.
T Consensus         1 ~~glp~l~r~~ss~~v~as~~kkikt~~p~G~~G~m~~k~gvDasGRk~kGkGVYqFvdKYGANV   65 (117)
T PLN00053          1 VRGLPPLSRTARSFKVTASGGKKIKTDQPYGPSGGMNLKDGVDASGRKGKGKGVYQFVDKYGANV   65 (117)
T ss_pred             CCccCcccccccceEEEecCCceeeecCCcccCCCcccccccCCCCccCCCcceEEehhhcCccc
Confidence            589999999  6899999988 9999999999999999999999999999999999999999996



>PF04725 PsbR: Photosystem II 10 kDa polypeptide PsbR; InterPro: IPR006814 Oxygenic photosynthesis uses two multi-subunit photosystems (I and II) located in the cell membranes of cyanobacteria and in the thylakoid membranes of chloroplasts in plants and algae Back     alignment and domain information
>PLN00083 photosystem II subunit R; Provisional Back     alignment and domain information

Homologous Structure Templates

Structure Templates Detected by BLAST ?

No homologous structure with e-value below 0.005

Structure Templates Detected by RPS-BLAST ?

No hit with e-value below 0.005

Structure Templates Detected by HHsearch ?

No hit with probability above 80.00


Homologous Structure Domains

Structure Domains Detected by RPS-BLAST ?

No hit with e-value below 0.005

Homologous Domains Detected by HHsearch ?

No hit with probability above 80.00