Citrus Sinensis ID: 032153


Local Sequence Feature Prediction

Prediction and (Method)Result
Residue Number Marker
Protein Sequence ?
Secondary Structure (PSIPRED) ?
Secondary Structure Prediction (SSPRO) ?
Coil and Loop (DISEMBL) ?
Flexible Loop (DISEMBL) ?
Low Complexity Region (SEG) ?
Disordered region (IsUnstruct) ?
Disordered Region (DISOPRED) ?
Disordered Region (DISEMBL) ?
Disordered Region (DISPRO) ?
Transmembrane Helix (TMHMM) ?
Transmembrane Helix (HMMTOP) ?
Transmembrane Helix (MEMSAT) ?
TM Helix, Signal Peptide (MEMSAT_SVM) ?
TM Helix, Signal Peptide (Phobius) ?
Signal Peptide (SignalP HMM Mode) ?
Signal Peptide (SignalP NN Mode) ?
Coiled Coils (COILS) ?
Positional Conservation ?
 
--------10--------20--------30--------40--------50--------60--------70--------80--------90-------100-------110-------120-------130-------140------
MAAAAFASSSPVVGLGSSSLSSPKRTALGSSFVKSPVAARNPLRVAGACGGKVTCFERDWLRTDLNVIGFGLIGWLAPSSIPAINGKSLTGLFFESIGTELAHFPTPPPLTSQFWLWLVTWHFGLFLCLTFGQIGFKGRTEDYFSK
cccccccccccEEccccccccccccccccccccccccccccHHHHcccccccEEEEcccHHcccccHHHHHHHHccccccccccccccHHHHHHHHHHHHHcccccccccccHHHHHHHHHHHHHHHHHHHHHccccccccccccc
ccHHHHHccccEcccccccccccccccccccccccccccccHHHHHcccccEEEEcccHHHHccccEEEEEHHHccccccccccccccHHHHHHHHHHHHHcccccccccccHHHHHHHHHHHHHHHHHHHcHccccccccccccc
maaaafassspvvglgssslsspkrtalgssfvkspvaarnplrvagacggkvtcferDWLRTDLNVIGFGligwlapssipaingksltgLFFESIgtelahfptpppltsqFWLWLVTWHFGLFLCltfgqigfkgrtedyfsk
maaaafassspvvglgssslsspkRTALGssfvkspvaarnplrvagacggkVTCFERDWLRTDLNVIGFGLIGWLAPSSIPAINGKSLTGLFFESIGTELAHFPTPPPLTSQFWLWLVTWHFGLFLCLTFGQIGFKGRTEDYFSK
MaaaafassspvvglgssslsspKRTALGSSFVKSPVAARNPLRVAGACGGKVTCFERDWLRTDLNVIGFGLIGWLAPSSIPAINGKSLTGLFFESIGTELAHFPTPPPLTSQFWLWLVTWHFGLFLCLTFGQIGFKGRTEDYFSK
*************************************AARNPLRVAGACGGKVTCFERDWLRTDLNVIGFGLIGWLAPSSIPAINGKSLTGLFFESIGTELAHFPTPPPLTSQFWLWLVTWHFGLFLCLTFGQIGFKGR*******
*************GLGS*********************************GKVTCFERDWLRTDLNVIGFGLIGWLAPSSIPAINGKSLTGLFFESIGTELAHFPTPPPLTSQFWLWLVTWHFGLFLCLTFGQIGFKGR*******
***************************LGSSFVKSPVAARNPLRVAGACGGKVTCFERDWLRTDLNVIGFGLIGWLAPSSIPAINGKSLTGLFFESIGTELAHFPTPPPLTSQFWLWLVTWHFGLFLCLTFGQIGFKGRTEDYFSK
***********************************P*AARNPLRVAGACGGKVTCFERDWLRTDLNVIGFGLIGWLAPSSIPAINGKSLTGLFFESIGTELAHFPTPPPLTSQFWLWLVTWHFGLFLCLTFGQIGFKGRTE*****
iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiHHHHHHHHHHHHHHHHHHHHoooooooooooooooooooooooooooooHHHHHHHHHHHHHHHHHHHHHHHiiiiiiiiii
iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiHHHHHHHHHHHHHHHHHHooooooooooooooooooooooooooooooHHHHHHHHHHHHHHHHHHiiiiiiiiiiiiii
iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiHHHHHHHHHHHHHHHHHHHHooooooooooooooooooooooooooooooooHHHHHHHHHHHHHHHHHHHHHiiiiiiiiiiiii
SSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiHHHHHHHHHHHHHHHHHHHoooooooooooo
SSSSSSSSSSSSSSSSSSSSSooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooHHHHHHHHHHHHHHHHHHHHHHHiiiiiiiiii
SSSSSSSSSSSSSSSSSSSSSSoooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
MAAAAFASSSPVVGLGSSSLSSPKRTALGSSFVKSPVAARNPLRVAGACGGKVTCFERDWLRTDLNVIGFGLIGWLAPSSIPAINGKSLTGLFFESIGTELAHFPTPPPLTSQFWLWLVTWHFGLFLCLTFGQIGFKGRTEDYFSK
no confident homologs detected

Close Homologs for Annotation Transfer

Close Homologs in SWISS-PROT Database Detected by BLAST ?

No hits with e-value below 0.001 by BLAST

Close Homologs in the Non-Redundant Database Detected by BLAST ?

GI ?Alignment Graph ?Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query146
255565725143 conserved hypothetical protein [Ricinus 0.972 0.993 0.830 6e-62
224104879149 predicted protein [Populus trichocarpa] 0.972 0.953 0.810 2e-57
18390888140 photosystem I subunit O [Arabidopsis tha 0.952 0.992 0.781 2e-57
22531014140 expressed protein [Arabidopsis thaliana] 0.952 0.992 0.774 7e-57
312281545142 unnamed protein product [Thellungiella h 0.965 0.992 0.777 2e-56
297849148140 hypothetical protein ARALYDRAFT_470899 [ 0.952 0.992 0.767 2e-56
224131784151 predicted protein [Populus trichocarpa] 0.986 0.953 0.8 2e-55
147858299144 hypothetical protein VITISV_019056 [Viti 0.945 0.958 0.797 1e-54
225464874148 PREDICTED: uncharacterized protein LOC10 0.972 0.959 0.775 2e-54
449443468146 PREDICTED: uncharacterized protein LOC10 0.979 0.979 0.787 3e-54
>gi|255565725|ref|XP_002523852.1| conserved hypothetical protein [Ricinus communis] gi|223536940|gb|EEF38578.1| conserved hypothetical protein [Ricinus communis] Back     alignment and taxonomy information
 Score =  241 bits (615), Expect = 6e-62,   Method: Compositional matrix adjust.
 Identities = 118/142 (83%), Positives = 127/142 (89%)

Query: 3   AAAFASSSPVVGLGSSSLSSPKRTALGSSFVKSPVAARNPLRVAGACGGKVTCFERDWLR 62
           AA FA+SS V+GL SSSLSSPK+  LGS FVKSPV ARNPLR  GA GGK  CFER+WLR
Sbjct: 2   AATFATSSTVIGLCSSSLSSPKKICLGSGFVKSPVPARNPLRQTGANGGKFVCFERNWLR 61

Query: 63  TDLNVIGFGLIGWLAPSSIPAINGKSLTGLFFESIGTELAHFPTPPPLTSQFWLWLVTWH 122
            DLNVIGFGLIGWLAPS+IPAI+GKSLTGLFFESIG+ELAHFP+PP LTSQFWLWL+TWH
Sbjct: 62  KDLNVIGFGLIGWLAPSTIPAIDGKSLTGLFFESIGSELAHFPSPPALTSQFWLWLITWH 121

Query: 123 FGLFLCLTFGQIGFKGRTEDYF 144
            GLFLCLTFGQIGFKGRTEDYF
Sbjct: 122 LGLFLCLTFGQIGFKGRTEDYF 143




Source: Ricinus communis

Species: Ricinus communis

Genus: Ricinus

Family: Euphorbiaceae

Order: Malpighiales

Class:

Phylum: Streptophyta

Superkingdom: Eukaryota

>gi|224104879|ref|XP_002313603.1| predicted protein [Populus trichocarpa] gi|222850011|gb|EEE87558.1| predicted protein [Populus trichocarpa] Back     alignment and taxonomy information
>gi|18390888|ref|NP_563815.1| photosystem I subunit O [Arabidopsis thaliana] gi|15293053|gb|AAK93637.1| unknown protein [Arabidopsis thaliana] gi|20259137|gb|AAM14284.1| unknown protein [Arabidopsis thaliana] gi|21592969|gb|AAM64918.1| putative 16kDa membrane protein [Arabidopsis thaliana] gi|21702253|emb|CAD37939.1| photosystem I subunit O [Arabidopsis thaliana] gi|332190162|gb|AEE28283.1| photosystem I subunit O [Arabidopsis thaliana] Back     alignment and taxonomy information
>gi|22531014|gb|AAM97011.1| expressed protein [Arabidopsis thaliana] gi|23197966|gb|AAN15510.1| expressed protein [Arabidopsis thaliana] Back     alignment and taxonomy information
>gi|312281545|dbj|BAJ33638.1| unnamed protein product [Thellungiella halophila] Back     alignment and taxonomy information
>gi|297849148|ref|XP_002892455.1| hypothetical protein ARALYDRAFT_470899 [Arabidopsis lyrata subsp. lyrata] gi|297338297|gb|EFH68714.1| hypothetical protein ARALYDRAFT_470899 [Arabidopsis lyrata subsp. lyrata] Back     alignment and taxonomy information
>gi|224131784|ref|XP_002328107.1| predicted protein [Populus trichocarpa] gi|222837622|gb|EEE75987.1| predicted protein [Populus trichocarpa] Back     alignment and taxonomy information
>gi|147858299|emb|CAN83522.1| hypothetical protein VITISV_019056 [Vitis vinifera] Back     alignment and taxonomy information
>gi|225464874|ref|XP_002273037.1| PREDICTED: uncharacterized protein LOC100244962 isoform 1 [Vitis vinifera] gi|359479489|ref|XP_003632280.1| PREDICTED: uncharacterized protein LOC100244962 isoform 2 [Vitis vinifera] gi|296084873|emb|CBI28282.3| unnamed protein product [Vitis vinifera] Back     alignment and taxonomy information
>gi|449443468|ref|XP_004139499.1| PREDICTED: uncharacterized protein LOC101209111 [Cucumis sativus] Back     alignment and taxonomy information

Prediction of Gene Ontology (GO) Terms

Close Homologs with Gene Ontology terms Detected by BLAST ?

ID ? Alignment graph ? Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query146
TAIR|locus:2201836140 PSAO "AT1G08380" [Arabidopsis 0.787 0.821 0.869 2.6e-54
TAIR|locus:2201836 PSAO "AT1G08380" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
 Score = 561 (202.5 bits), Expect = 2.6e-54, P = 2.6e-54
 Identities = 100/115 (86%), Positives = 109/115 (94%)

Query:    30 SSFVKSPVAARNPLRVAGACGGKVTCFERDWLRTDLNVIGFGLIGWLAPSSIPAINGKSL 89
             SSF+K  ++A+NPLR+AGA GG+VTCFER+WLR DLNV+GFGLIGWLAPSSIPAINGKSL
Sbjct:    26 SSFLKPTLSAKNPLRLAGASGGRVTCFERNWLRRDLNVVGFGLIGWLAPSSIPAINGKSL 85

Query:    90 TGLFFESIGTELAHFPTPPPLTSQFWLWLVTWHFGLFLCLTFGQIGFKGRTEDYF 144
             TGLFF+SIGTELAHFPTPP LTSQFWLWLVTWH GLFLCLTFGQIGFKGRTEDYF
Sbjct:    86 TGLFFDSIGTELAHFPTPPALTSQFWLWLVTWHLGLFLCLTFGQIGFKGRTEDYF 140


Parameters:
  V=100
  filter=SEG
  E=0.001

  ctxfactor=1.00

  Query                        -----  As Used  -----    -----  Computed  ----
  Frame  MatID Matrix name     Lambda    K       H      Lambda    K       H
   +0      0   BLOSUM62        0.326   0.142   0.481    same    same    same
               Q=9,R=2         0.244   0.0300  0.180     n/a     n/a     n/a

  Query
  Frame  MatID  Length  Eff.Length     E     S W   T  X   E2     S2
   +0      0      146       124   0.00091  102 3  11 22  0.47    30
                                                     29  0.39    33


Statistics:

  Database:  /share/blast/go-seqdb.fasta
   Title:  go_20130330-seqdb.fasta
   Posted:  5:47:42 AM PDT Apr 1, 2013
   Created:  5:47:42 AM PDT Apr 1, 2013
   Format:  XDF-1
   # of letters in database:  169,044,731
   # of sequences in database:  368,745
   # of database sequences satisfying E:  1
  No. of states in DFA:  602 (64 KB)
  Total size of DFA:  146 KB (2088 KB)
  Time to generate neighborhood:  0.00u 0.00s 0.00t   Elapsed:  00:00:00
  No. of threads or processors used:  24
  Search cpu time:  13.47u 0.12s 13.59t   Elapsed:  00:00:01
  Total cpu time:  13.47u 0.12s 13.59t   Elapsed:  00:00:01
  Start:  Fri May 10 18:03:14 2013   End:  Fri May 10 18:03:15 2013


GO:0003674 "molecular_function" evidence=ND
GO:0005739 "mitochondrion" evidence=ISM
GO:0009522 "photosystem I" evidence=IDA
GO:0009579 "thylakoid" evidence=IDA
GO:0009768 "photosynthesis, light harvesting in photosystem I" evidence=IC
GO:0009507 "chloroplast" evidence=IDA
GO:0006098 "pentose-phosphate shunt" evidence=RCA
GO:0006364 "rRNA processing" evidence=RCA
GO:0009657 "plastid organization" evidence=RCA
GO:0010207 "photosystem II assembly" evidence=RCA
GO:0019344 "cysteine biosynthetic process" evidence=RCA
GO:0019684 "photosynthesis, light reaction" evidence=RCA
GO:0030003 "cellular cation homeostasis" evidence=RCA
GO:0035304 "regulation of protein dephosphorylation" evidence=RCA
GO:0070838 "divalent metal ion transport" evidence=RCA

Prediction of Enzyme Commission (EC) Number

EC Number Prediction by Annotation Transfer from SWISS-PROT Entries ?

No confident hit for EC number transfering in SWISSPROT detected by BLAST

EC Number Prediction by Ezypred Server ?

Fail to connect to Ezypred Server

EC Number Prediction by EFICAz Software ?

No EC number assignment, probably not an enzyme!


Prediction of Functionally Associated Proteins

Functionally Associated Proteins Detected by STRING ?

Your Input:
grail3.0001007701
photosystem I reaction center protein PsaO (150 aa)
(Populus trichocarpa)
Predicted Functional Partners:
 
Sorry, there are no predicted associations at the current settings.
 

Conserved Domains and Related Protein Families

Conserved Domains Detected by RPS-BLAST ?

ID ?Alignment Graph ?Length ? Definition ? E-value ?
Query146
PLN00046141 PLN00046, PLN00046, photosystem I reaction center 4e-72
TIGR0305982 TIGR03059, psaOeuk, photosystem I protein PsaO 1e-34
>gnl|CDD|215040 PLN00046, PLN00046, photosystem I reaction center subunit O; Provisional Back     alignment and domain information
 Score =  212 bits (541), Expect = 4e-72
 Identities = 102/143 (71%), Positives = 113/143 (79%), Gaps = 3/143 (2%)

Query: 2   AAAAFASSSPVVGLGSSSLSSPKRTALGSSFVKSPVAARNPLRVAGACGGKVTCFERDWL 61
            AA  A++S V GLGSSSLS+P+R +  S FVK PV  R    +A A G K T F+RDWL
Sbjct: 1   MAATAATASTVSGLGSSSLSAPRRLS--SGFVKGPVTVRRRAVLARASGSKKT-FDRDWL 57

Query: 62  RTDLNVIGFGLIGWLAPSSIPAINGKSLTGLFFESIGTELAHFPTPPPLTSQFWLWLVTW 121
           R DLNVIGFGLIGWLAPSSIPAI G SLTGLFF+SIGTELAHFPT P L S+FWLWL+TW
Sbjct: 58  RKDLNVIGFGLIGWLAPSSIPAIGGNSLTGLFFDSIGTELAHFPTGPALDSKFWLWLITW 117

Query: 122 HFGLFLCLTFGQIGFKGRTEDYF 144
           H GLFLCLTFGQIGFKGR + YF
Sbjct: 118 HLGLFLCLTFGQIGFKGRKQGYF 140


Length = 141

>gnl|CDD|132103 TIGR03059, psaOeuk, photosystem I protein PsaO Back     alignment and domain information

Conserved Domains Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query 146
PLN00046141 photosystem I reaction center subunit O; Provision 100.0
TIGR0305982 psaOeuk photosystem I protein PsaO. Members of thi 100.0
>PLN00046 photosystem I reaction center subunit O; Provisional Back     alignment and domain information
Probab=100.00  E-value=1.5e-79  Score=475.75  Aligned_cols=141  Identities=72%  Similarity=1.233  Sum_probs=136.2

Q ss_pred             ccccccCCcceeccccCCCCCCCccccccCcccCcccccchhhhhccCCCceeeeccchhccccccceeeeeeccccCCc
Q 032153            2 AAAAFASSSPVVGLGSSSLSSPKRTALGSSFVKSPVAARNPLRVAGACGGKVTCFERDWLRTDLNVIGFGLIGWLAPSSI   81 (146)
Q Consensus         2 ~aat~~~~stv~gl~~ssls~~~~~~l~s~f~~~~V~arnpl~~a~a~~gr~tcF~rdWlr~d~~V~~~gliGW~~PSsi   81 (146)
                      |||+|++++||+||+++++++  +.+++|+|+|++|.+|||++++.+++++++ |||||||+|++||++|||||++||+|
T Consensus         1 maa~~~a~stV~gL~~~sl~~--~~~~ss~f~~~~~~~~~~~~~~~a~~~~~t-F~rDWLr~d~~V~~~gl~GW~~PS~i   77 (141)
T PLN00046          1 MAATAATASTVSGLGSSSLSA--PRRLSSGFVKGPVTVRRRAVLARASGSKKT-FDRDWLRKDLNVIGFGLIGWLAPSSI   77 (141)
T ss_pred             ChHHHHhHHHhhhccccCccc--ccccccccccCcccccchhhhhhhcccccc-cchhhhhcccceeeeeeeeeeccccc
Confidence            566778888999999999986  678999999999999999999999999998 99999999999999999999999999


Q ss_pred             cccCCCchhhhhHhhhhhhhcCCCCCCCCCchHHHHHHHHHHHHHHHHHhhccccccccccccC
Q 032153           82 PAINGKSLTGLFFESIGTELAHFPTPPPLTSQFWLWLVTWHFGLFLCLTFGQIGFKGRTEDYFS  145 (146)
Q Consensus        82 pa~~g~SL~glF~~SIG~eLAhFPtgPal~~~FWL~lvtWHlGLF~~ltfGQIG~kgR~e~Yf~  145 (146)
                      |+||||||+|+|++|||+|||||||||||||||||||||||+|||+|||||||||||||||||+
T Consensus        78 pa~~g~sL~glF~~sIg~~LahfPt~Pal~~~fWL~litWHlGLF~~ltfGqIG~kgR~egYF~  141 (141)
T PLN00046         78 PAIGGNSLTGLFFDSIGTELAHFPTGPALDSKFWLWLITWHLGLFLCLTFGQIGFKGRKQGYFN  141 (141)
T ss_pred             cccCCchhHHHHHHHHHHHHhcCCCCCCCCcchHHHHHHHHHHHHHHHHhhhcccccccccccC
Confidence            9999999999999999999999999999999999999999999999999999999999999996



>TIGR03059 psaOeuk photosystem I protein PsaO Back     alignment and domain information

Homologous Structure Templates

Structure Templates Detected by BLAST ?

No homologous structure with e-value below 0.005

Structure Templates Detected by RPS-BLAST ?

No hit with e-value below 0.005

Structure Templates Detected by HHsearch ?

No hit with probability above 80.00


Homologous Structure Domains

Structure Domains Detected by RPS-BLAST ?

No hit with e-value below 0.005

Homologous Domains Detected by HHsearch ?

No hit with probability above 80.00