Citrus Sinensis ID: 026814
Local Sequence Feature Prediction
| Prediction and (Method) | Result |
|---|
Close Homologs for Annotation Transfer
Close Homologs in the Non-Redundant Database Detected by BLAST 
Original result of BLAST against Nonredundant Database
GI ![]() | Alignment Graph ![]() | Length ![]() |
Definition ![]() |
Q cover ![]() |
H cover ![]() |
Identity ![]() |
E-value ![]() |
| Query | 232 | ||||||
| 255579087 | 275 | conserved hypothetical protein [Ricinus | 0.836 | 0.705 | 0.741 | 6e-83 | |
| 224077518 | 225 | predicted protein [Populus trichocarpa] | 0.706 | 0.728 | 0.842 | 7e-78 | |
| 449447484 | 294 | PREDICTED: thylakoid membrane protein sl | 0.728 | 0.574 | 0.816 | 9e-77 | |
| 449481443 | 294 | PREDICTED: thylakoid membrane protein sl | 0.728 | 0.574 | 0.816 | 1e-76 | |
| 297801752 | 287 | hypothetical protein ARALYDRAFT_494103 [ | 0.741 | 0.599 | 0.791 | 1e-74 | |
| 30693285 | 286 | protein acclimation of photosynthesis to | 0.801 | 0.650 | 0.712 | 1e-73 | |
| 334188069 | 431 | protein acclimation of photosynthesis to | 0.918 | 0.494 | 0.652 | 3e-73 | |
| 25082754 | 286 | Unknown protein [Arabidopsis thaliana] | 0.801 | 0.650 | 0.707 | 4e-73 | |
| 26450956 | 286 | unknown protein [Arabidopsis thaliana] | 0.801 | 0.650 | 0.707 | 6e-73 | |
| 224127618 | 218 | predicted protein [Populus trichocarpa] | 0.616 | 0.655 | 0.881 | 2e-70 |
| >gi|255579087|ref|XP_002530392.1| conserved hypothetical protein [Ricinus communis] gi|223530078|gb|EEF31998.1| conserved hypothetical protein [Ricinus communis] | Back alignment and taxonomy information |
|---|
Score = 312 bits (800), Expect = 6e-83, Method: Compositional matrix adjust.
Identities = 161/217 (74%), Positives = 182/217 (83%), Gaps = 23/217 (10%)
Query: 5 TLTRATNLFSQHQDDRYLGLSNHCLCFLSVHPASPPPLLAS---------KRHR-LKLAF 54
+++AT +F+ H +DR+ FLSV+P SPP LL S KRHR +KL F
Sbjct: 8 VVSKAT-VFNSH-NDRF---------FLSVYPHSPPLLLQSPRLDSRLCAKRHRNVKLVF 56
Query: 55 VAKAADSTQPSSATTSADKTLVPDDEFTLAKVSFGVIGLGLGISLLSYGFGAYFSIFPGS 114
VAKAADSTQPS+A+T+ K +V D+EF+LAKVSFGVIGLG+GISLLSYGFGAYF+I PGS
Sbjct: 57 VAKAADSTQPSTASTA--KAIVSDEEFSLAKVSFGVIGLGVGISLLSYGFGAYFNILPGS 114
Query: 115 EWSALMLTYGFPLAVIGMALKYAELKPVPCLTYSDAQSLRETCATPILKQVRNDVIRFRY 174
EWSA+MLTYGFPLA+IGMALKYAELKPVPCLTYSDAQ LRET ATPILKQVRNDVIR+RY
Sbjct: 115 EWSAIMLTYGFPLAIIGMALKYAELKPVPCLTYSDAQMLRETSATPILKQVRNDVIRYRY 174
Query: 175 GDEQHLDEALKRIFQYGLGGGIPRRSAPVLQMIREEV 211
GDEQHLDEALKRIFQYG GGGIPRRSAP+LQMIREEV
Sbjct: 175 GDEQHLDEALKRIFQYGQGGGIPRRSAPILQMIREEV 211
|
Source: Ricinus communis Species: Ricinus communis Genus: Ricinus Family: Euphorbiaceae Order: Malpighiales Class: Phylum: Streptophyta Superkingdom: Eukaryota |
| >gi|224077518|ref|XP_002305283.1| predicted protein [Populus trichocarpa] gi|222848247|gb|EEE85794.1| predicted protein [Populus trichocarpa] | Back alignment and taxonomy information |
|---|
| >gi|449447484|ref|XP_004141498.1| PREDICTED: thylakoid membrane protein slr0575-like [Cucumis sativus] | Back alignment and taxonomy information |
|---|
| >gi|449481443|ref|XP_004156184.1| PREDICTED: thylakoid membrane protein slr0575-like [Cucumis sativus] | Back alignment and taxonomy information |
|---|
| >gi|297801752|ref|XP_002868760.1| hypothetical protein ARALYDRAFT_494103 [Arabidopsis lyrata subsp. lyrata] gi|297314596|gb|EFH45019.1| hypothetical protein ARALYDRAFT_494103 [Arabidopsis lyrata subsp. lyrata] | Back alignment and taxonomy information |
|---|
| >gi|30693285|ref|NP_198682.3| protein acclimation of photosynthesis to environment [Arabidopsis thaliana] gi|87116662|gb|ABD19695.1| At5g38660 [Arabidopsis thaliana] gi|332006962|gb|AED94345.1| protein acclimation of photosynthesis to environment [Arabidopsis thaliana] | Back alignment and taxonomy information |
|---|
| >gi|334188069|ref|NP_001190435.1| protein acclimation of photosynthesis to environment [Arabidopsis thaliana] gi|332006963|gb|AED94346.1| protein acclimation of photosynthesis to environment [Arabidopsis thaliana] | Back alignment and taxonomy information |
|---|
| >gi|25082754|gb|AAN71998.1| Unknown protein [Arabidopsis thaliana] | Back alignment and taxonomy information |
|---|
| >gi|26450956|dbj|BAC42585.1| unknown protein [Arabidopsis thaliana] | Back alignment and taxonomy information |
|---|
| >gi|224127618|ref|XP_002329322.1| predicted protein [Populus trichocarpa] gi|222870776|gb|EEF07907.1| predicted protein [Populus trichocarpa] | Back alignment and taxonomy information |
|---|
Prediction of Gene Ontology (GO) Terms
Close Homologs with Gene Ontology terms Detected by BLAST 
Original result of BLAST against Gene Ontology (AMIGO)
ID ![]() |
Alignment graph ![]() |
Length ![]() |
Definition ![]() |
Q cover ![]() |
H cover ![]() |
Identity ![]() |
E-value ![]() |
| Query | 232 | ||||||
| TAIR|locus:2159888 | 286 | APE1 "AT5G38660" [Arabidopsis | 0.741 | 0.601 | 0.676 | 3.2e-58 |
| TAIR|locus:2159888 APE1 "AT5G38660" [Arabidopsis thaliana (taxid:3702)] | Back alignment and assigned GO terms |
|---|
Score = 598 (215.6 bits), Expect = 3.2e-58, P = 3.2e-58
Identities = 117/173 (67%), Positives = 135/173 (78%)
Query: 46 KRHRLKLAFVAKAADSTQPSSATTSADKTLVPDDEFTLAKXXXXXXXXXXXXXXXXXXXX 105
KR LKL V +AADST S + S D+TL+PDDEFTLAK
Sbjct: 54 KREVLKLDVVGRAADSTSSSPSVASGDRTLIPDDEFTLAKISFGVIGLGLGVSLLSYGFG 113
Query: 106 AYFSIFPGSEWSALMLTYGFPLAVIGMALKYAELKPVPCLTYSDAQSLRETCATPILKQV 165
AYF+I PG+EWSA+MLTYGFPL++IGMALKYAELKPVPCL+YSDA LRE+CATPIL QV
Sbjct: 114 AYFTILPGTEWSAIMLTYGFPLSIIGMALKYAELKPVPCLSYSDAVKLRESCATPILTQV 173
Query: 166 RNDVIRFRYGDEQHLDEALKRIFQYGLGGGIPRRSAPVLQMIREEVCLSNFRF 218
RNDV R+RYGDEQHL+EALKRIFQYGLGGGIPRRSAP+LQ+IREEV L++ R+
Sbjct: 174 RNDVTRYRYGDEQHLEEALKRIFQYGLGGGIPRRSAPILQLIREEV-LTDGRY 225
Parameters:
V=100
filter=SEG
E=0.001
ctxfactor=1.00
Query ----- As Used ----- ----- Computed ----
Frame MatID Matrix name Lambda K H Lambda K H
+0 0 BLOSUM62 0.322 0.135 0.410 same same same
Q=9,R=2 0.244 0.0300 0.180 n/a n/a n/a
Query
Frame MatID Length Eff.Length E S W T X E2 S2
+0 0 232 212 0.00083 112 3 11 22 0.46 32
31 0.49 35
Statistics:
Database: /share/blast/go-seqdb.fasta
Title: go_20130330-seqdb.fasta
Posted: 5:47:42 AM PDT Apr 1, 2013
Created: 5:47:42 AM PDT Apr 1, 2013
Format: XDF-1
# of letters in database: 169,044,731
# of sequences in database: 368,745
# of database sequences satisfying E: 1
No. of states in DFA: 604 (64 KB)
Total size of DFA: 174 KB (2101 KB)
Time to generate neighborhood: 0.00u 0.00s 0.00t Elapsed: 00:00:00
No. of threads or processors used: 24
Search cpu time: 18.47u 0.12s 18.59t Elapsed: 00:00:01
Total cpu time: 18.47u 0.12s 18.59t Elapsed: 00:00:01
Start: Sat May 11 06:37:09 2013 End: Sat May 11 06:37:10 2013
|
|
Prediction of Enzyme Commission (EC) Number
EC Number Prediction by Ezypred Server 
Original result from Ezypred Server
Fail to connect to Ezypred Server
Prediction of Functionally Associated Proteins
Functionally Associated Proteins Detected by STRING 
Original result from the STRING server
Fail to connect to STRING server
Conserved Domains and Related Protein Families
Conserved Domains Detected by RPS-BLAST 
Original result of RPS-BLAST against CDD database part I
ID ![]() | Alignment Graph ![]() | Length ![]() |
Definition ![]() |
E-value ![]() |
| Query | 232 | |||
| pfam11016 | 146 | pfam11016, DUF2854, Protein of unknown function (D | 7e-44 |
| >gnl|CDD|192697 pfam11016, DUF2854, Protein of unknown function (DUF2854) | Back alignment and domain information |
|---|
Score = 144 bits (364), Expect = 7e-44
Identities = 47/113 (41%), Positives = 65/113 (57%), Gaps = 12/113 (10%)
Query: 102 YGFGAYFSIFPGSEWSALMLTYGFPLAVIGMALKYAELKPVP--CLTYSDAQSLRETCAT 159
GF AYF+ G+ S YG P+ + G+ALK +ELKPVP C ++A +LRE AT
Sbjct: 1 VGFVAYFT--DGANLSLPGFFYGIPILLGGLALKSSELKPVPVRCTPSAEALALREKQAT 58
Query: 160 PILKQVRNDVIRFRYGDEQHLDEALKRIFQYGLGGGIPRR-SAPVLQMIREEV 211
PIL Q+R DV R+RYG + HL+ +L+R+ G+ P L I E +
Sbjct: 59 PILNQLRKDVTRYRYGQKAHLESSLERL-------GLWDDDEPPQLLGIEELL 104
|
This family of proteins has no known function. Length = 146 |
Conserved Domains Detected by HHsearch 
Original result of HHsearch against CDD database
ID ![]() | Alignment Graph ![]() | Length ![]() |
Definition ![]() |
Probability ![]() |
| Query | 232 | |||
| PF11016 | 147 | DUF2854: Protein of unknown function (DUF2854); In | 100.0 |
| >PF11016 DUF2854: Protein of unknown function (DUF2854); InterPro: IPR021275 This family of proteins has no known function | Back alignment and domain information |
|---|
Probab=100.00 E-value=1.3e-57 Score=377.71 Aligned_cols=118 Identities=47% Similarity=0.688 Sum_probs=111.7
Q ss_pred hhhhhhhccCCCCcchhhhhhhhhhHHHHHhhhhccccCCCC--CCCchHHHHHHhhcCchHHHHhhhcccceecchhhh
Q 026814 102 YGFGAYFSIFPGSEWSALMLTYGFPLAVIGMALKYAELKPVP--CLTYSDAQSLRETCATPILKQVRNDVIRFRYGDEQH 179 (232)
Q Consensus 102 vGf~AYf~~~p~anLSl~gffYGiPIlLgGLALK~AELkPvp--~~T~~~~~alRe~qAT~~q~Qvr~DVTRyRYGqeaH 179 (232)
|||+|||++++ |||++||||||||+||||||||||||||| |.|+++++++||+|||+||+|||+||||||||||||
T Consensus 1 ~Gf~aY~~~~a--~lsl~~ffYGiPilLgGlALK~aEL~Pvp~~~~~~~~~~~lRe~qat~~~~qlr~DVTR~RYGqeaH 78 (147)
T PF11016_consen 1 IGFVAYFTDNA--NLSLPGFFYGIPILLGGLALKYAELKPVPFSCTTSPEALALREQQATPTQNQLRKDVTRYRYGQEAH 78 (147)
T ss_pred CceeEEecCCC--ceeeehHHhhhHHHHHHHHHHHhcCCCCCcccCCHHHHHHHHHhcCCHHHHHHHhhhhhhhccHHHH
Confidence 69999999755 59999999999999999999999999999 889999999999999999999999999999999999
Q ss_pred HHHHHHHHHhcCCCCCCCCCCCcchhhheeeeecCceeEEEeeCccc
Q 026814 180 LDEALKRIFQYGLGGGIPRRSAPVLQMIREEVCLSNFRFKHECFGCL 226 (232)
Q Consensus 180 Ld~ALerLf~~gL~~gi~d~~~PqL~~IrE~~~eg~Y~Lvle~~~~~ 226 (232)
||+|||+| ||++ +|+++|+|++|||+++||+|+|+|++..+.
T Consensus 79 Le~aL~~L---~L~~--~~~~~P~L~~irE~~~~g~Y~LvL~F~s~~ 120 (147)
T PF11016_consen 79 LEEALERL---GLSW--DDDEPPQLQGIREEVIDGAYGLVLEFESPA 120 (147)
T ss_pred HHHHHHHh---cCCC--CcccChhhhheeeeeeCCceEEEEEEecCC
Confidence 99999999 7754 899999999999999999999999987543
|
|
Homologous Structure Templates
Structure Templates Detected by BLAST 
Original result of BLAST against Protein Data Bank
No homologous structure with e-value below 0.005
Structure Templates Detected by RPS-BLAST 
Original result of RPS-BLAST against PDB70 database
No hit with e-value below 0.005
Structure Templates Detected by HHsearch 
Original result of HHsearch against PDB70 database
No hit with probability above 80.00
Homologous Structure Domains
Structure Domains Detected by RPS-BLAST 
Original result of RPS-BLAST against SCOP70(version1.75) database
No hit with e-value below 0.005
Homologous Domains Detected by HHsearch 
Original result of HHsearch against SCOP70(version1.75) database
No hit with probability above 80.00