Citrus Sinensis ID: 029778
Local Sequence Feature Prediction
| Prediction and (Method) | Result |
|---|
Close Homologs for Annotation Transfer
Close Homologs in the Non-Redundant Database Detected by BLAST 
Original result of BLAST against Nonredundant Database
GI ![]() | Alignment Graph ![]() | Length ![]() |
Definition ![]() |
Q cover ![]() |
H cover ![]() |
Identity ![]() |
E-value ![]() |
| Query | 188 | ||||||
| 449521800 | 187 | PREDICTED: uncharacterized LOC101212505 | 0.994 | 1.0 | 0.723 | 3e-76 | |
| 449464600 | 187 | PREDICTED: uncharacterized protein LOC10 | 0.994 | 1.0 | 0.723 | 4e-76 | |
| 255566971 | 187 | conserved hypothetical protein [Ricinus | 0.994 | 1.0 | 0.797 | 2e-74 | |
| 357478013 | 856 | hypothetical protein MTR_4g114100 [Medic | 0.984 | 0.216 | 0.671 | 1e-69 | |
| 224122574 | 188 | predicted protein [Populus trichocarpa] | 0.941 | 0.941 | 0.785 | 4e-68 | |
| 225445452 | 187 | PREDICTED: uncharacterized protein LOC10 | 0.994 | 1.0 | 0.734 | 1e-67 | |
| 297808079 | 188 | hypothetical protein ARALYDRAFT_910052 [ | 0.941 | 0.941 | 0.723 | 4e-64 | |
| 15241270 | 188 | uncharacterized protein [Arabidopsis tha | 0.941 | 0.941 | 0.717 | 2e-63 | |
| 356563030 | 194 | PREDICTED: uncharacterized protein LOC10 | 0.936 | 0.907 | 0.664 | 4e-56 | |
| 356548597 | 194 | PREDICTED: uncharacterized protein LOC10 | 0.936 | 0.907 | 0.653 | 9e-55 |
| >gi|449521800|ref|XP_004167917.1| PREDICTED: uncharacterized LOC101212505 [Cucumis sativus] | Back alignment and taxonomy information |
|---|
Score = 289 bits (740), Expect = 3e-76, Method: Compositional matrix adjust.
Identities = 136/188 (72%), Positives = 157/188 (83%), Gaps = 1/188 (0%)
Query: 1 MSGGSTPVAGGYMRQRHSQGYASSGDDLEDDACSRPQPFSPSIPRAKTWVEILENVLWIA 60
MSGG TPVAGGYMRQRHSQGYASSGDD+EDDACSR Q S P+++TW+E++ENVLW+
Sbjct: 1 MSGG-TPVAGGYMRQRHSQGYASSGDDIEDDACSRFQTMSFHTPKSRTWIEVVENVLWLV 59
Query: 61 SALFIIYYGDRHSNFIYLLWHDERIRRLPLYLGMVLVGLNVAFFFYTSMLVWNVRRFDEK 120
SA FI+YYGDRHSN IYLLWHD R+RRLPLYLGMV +GLN+ F YTSM W++RRFDEK
Sbjct: 60 SATFIVYYGDRHSNMIYLLWHDGRVRRLPLYLGMVGIGLNIVIFIYTSMSAWSIRRFDEK 119
Query: 121 WESFCISALPYVTLLGVVSFFLLSFALWPIWSFLTLPLLFTLFMASMVIFPHSMIGTLRP 180
WE +S LP VTLLG++SF LLSFALWPIW FLTLPLLFT+FMA MV+FP+ MIG RP
Sbjct: 120 WELASLSVLPIVTLLGLISFCLLSFALWPIWGFLTLPLLFTMFMACMVVFPNIMIGKFRP 179
Query: 181 QNDAFRID 188
Q+D R D
Sbjct: 180 QSDFLRTD 187
|
Source: Cucumis sativus Species: Cucumis sativus Genus: Cucumis Family: Cucurbitaceae Order: Cucurbitales Class: Phylum: Streptophyta Superkingdom: Eukaryota |
| >gi|449464600|ref|XP_004150017.1| PREDICTED: uncharacterized protein LOC101212505 [Cucumis sativus] | Back alignment and taxonomy information |
|---|
| >gi|255566971|ref|XP_002524468.1| conserved hypothetical protein [Ricinus communis] gi|223536256|gb|EEF37908.1| conserved hypothetical protein [Ricinus communis] | Back alignment and taxonomy information |
|---|
| >gi|357478013|ref|XP_003609292.1| hypothetical protein MTR_4g114100 [Medicago truncatula] gi|355510347|gb|AES91489.1| hypothetical protein MTR_4g114100 [Medicago truncatula] | Back alignment and taxonomy information |
|---|
| >gi|224122574|ref|XP_002330515.1| predicted protein [Populus trichocarpa] gi|222872449|gb|EEF09580.1| predicted protein [Populus trichocarpa] | Back alignment and taxonomy information |
|---|
| >gi|225445452|ref|XP_002285091.1| PREDICTED: uncharacterized protein LOC100261198 [Vitis vinifera] gi|297738931|emb|CBI28176.3| unnamed protein product [Vitis vinifera] | Back alignment and taxonomy information |
|---|
| >gi|297808079|ref|XP_002871923.1| hypothetical protein ARALYDRAFT_910052 [Arabidopsis lyrata subsp. lyrata] gi|297317760|gb|EFH48182.1| hypothetical protein ARALYDRAFT_910052 [Arabidopsis lyrata subsp. lyrata] | Back alignment and taxonomy information |
|---|
| >gi|15241270|ref|NP_197512.1| uncharacterized protein [Arabidopsis thaliana] gi|28466915|gb|AAO44066.1| At5g20120 [Arabidopsis thaliana] gi|110743897|dbj|BAE99783.1| hypothetical protein [Arabidopsis thaliana] gi|332005414|gb|AED92797.1| uncharacterized protein [Arabidopsis thaliana] | Back alignment and taxonomy information |
|---|
| >gi|356563030|ref|XP_003549769.1| PREDICTED: uncharacterized protein LOC100818154 [Glycine max] | Back alignment and taxonomy information |
|---|
| >gi|356548597|ref|XP_003542687.1| PREDICTED: uncharacterized protein LOC100792201 [Glycine max] | Back alignment and taxonomy information |
|---|
Prediction of Gene Ontology (GO) Terms
Close Homologs with Gene Ontology terms Detected by BLAST 
Original result of BLAST against Gene Ontology (AMIGO)
ID ![]() |
Alignment graph ![]() |
Length ![]() |
Definition ![]() |
Q cover ![]() |
H cover ![]() |
Identity ![]() |
E-value ![]() |
| Query | 188 | ||||||
| TAIR|locus:2149219 | 188 | AT5G20120 "AT5G20120" [Arabido | 0.994 | 0.994 | 0.597 | 2.5e-58 |
| TAIR|locus:2149219 AT5G20120 "AT5G20120" [Arabidopsis thaliana (taxid:3702)] | Back alignment and assigned GO terms |
|---|
Score = 599 (215.9 bits), Expect = 2.5e-58, P = 2.5e-58
Identities = 113/189 (59%), Positives = 132/189 (69%)
Query: 1 MSGGSTPVAGG-YMRQRHSQGYASSGDDLEDDACSRPQPFSPSIPRAKTWVEILENVLWI 59
MSGG TPV GG Y+RQRHSQGYAS GDDLEDDACSRPQPFS PR+KTWVEILENVLWI
Sbjct: 1 MSGG-TPVGGGGYIRQRHSQGYASGGDDLEDDACSRPQPFSLENPRSKTWVEILENVLWI 59
Query: 60 ASALFIIYYGDRHSNFIYLLWHDERIRRLPLYLGMVLVGLNVAFFFYTSMLVWNVRRFDE 119
ASA+FI+Y+GDRHSN I++L D RI+R+PLY GM+ + +N+ Y SML W++RRFDE
Sbjct: 60 ASAVFIVYFGDRHSNMIHILLRDARIKRMPLYFGMLGIAVNIIIIIYESMLSWSMRRFDE 119
Query: 120 KWESFCISALPYXXXXXXXXXXXXXXXXWPIWSXXXXXXXXXXXMASMVIFPHSMIGTLR 179
KWE + ISALP+ WPIW MA +V+FPH MI R
Sbjct: 120 KWELWSISALPFITLLGIISFGLLSFALWPIWGFLTLPLLFTLFMACLVVFPHLMIIKFR 179
Query: 180 PQNDAFRID 188
PQND RID
Sbjct: 180 PQNDELRID 188
Parameters:
V=100
filter=SEG
E=0.001
ctxfactor=1.00
Query ----- As Used ----- ----- Computed ----
Frame MatID Matrix name Lambda K H Lambda K H
+0 0 BLOSUM62 0.326 0.140 0.471 same same same
Q=9,R=2 0.244 0.0300 0.180 n/a n/a n/a
Query
Frame MatID Length Eff.Length E S W T X E2 S2
+0 0 188 161 0.00074 107 3 11 22 0.44 31
30 0.44 34
Statistics:
Database: /share/blast/go-seqdb.fasta
Title: go_20130330-seqdb.fasta
Posted: 5:47:42 AM PDT Apr 1, 2013
Created: 5:47:42 AM PDT Apr 1, 2013
Format: XDF-1
# of letters in database: 169,044,731
# of sequences in database: 368,745
# of database sequences satisfying E: 1
No. of states in DFA: 606 (64 KB)
Total size of DFA: 179 KB (2102 KB)
Time to generate neighborhood: 0.00u 0.00s 0.00t Elapsed: 00:00:01
No. of threads or processors used: 24
Search cpu time: 15.09u 0.09s 15.18t Elapsed: 00:00:01
Total cpu time: 15.09u 0.09s 15.18t Elapsed: 00:00:02
Start: Thu May 9 17:29:19 2013 End: Thu May 9 17:29:21 2013
|
|
Prediction of Enzyme Commission (EC) Number
EC Number Prediction by Ezypred Server 
Original result from Ezypred Server
Fail to connect to Ezypred Server
Prediction of Functionally Associated Proteins
Functionally Associated Proteins Detected by STRING 
Original result from the STRING server
| eugene3.01450022 | hypothetical protein (189 aa) | |||||||
(Populus trichocarpa) | ||||||||
| Sorry, there are no predicted associations at the current settings. |
Conserved Domains and Related Protein Families
Conserved Domains Detected by RPS-BLAST 
Original result of RPS-BLAST against CDD database part I
Conserved Domains Detected by HHsearch 
Original result of HHsearch against CDD database
ID ![]() | Alignment Graph ![]() | Length ![]() |
Definition ![]() |
Probability ![]() |
| Query | 188 | |||
| PF10856 | 118 | DUF2678: Protein of unknown function (DUF2678); In | 84.97 | |
| PF13937 | 81 | DUF4212: Domain of unknown function (DUF4212) | 83.42 | |
| PF03348 | 429 | Serinc: Serine incorporator (Serinc); InterPro: IP | 83.3 |
| >PF10856 DUF2678: Protein of unknown function (DUF2678); InterPro: IPR022564 This family of proteins has no known function | Back alignment and domain information |
|---|
Probab=84.97 E-value=1.9 Score=34.50 Aligned_cols=61 Identities=10% Similarity=0.111 Sum_probs=43.9
Q ss_pred cCCCcchhHHHHHHHHHHHHHHHHHHHHhhhhhhccccchhhhccCchhHHHHHHHHHHHHHHhhhhhh
Q 029778 82 DERIRRLPLYLGMVLVGLNVAFFFYTSMLVWNVRRFDEKWESFCISALPYVTLLGVVSFFLLSFALWPI 150 (188)
Q Consensus 82 d~~i~r~~f~lgll~l~i~~~i~~Y~~v~l~~ik~~~~~wE~~~P~~iPiaT~~gi~s~i~f~iALWPV 150 (188)
+++..-.+|.+.+++++++..+ +.+|+|+-| -|-+.-..|...+++-++..+|-|.=.|.|
T Consensus 56 ~p~p~~iffavcI~l~~~s~~l------LI~WYR~gd--l~Pkfr~li~~~~~~ivllci~anlYf~~~ 116 (118)
T PF10856_consen 56 PPKPLHIFFAVCILLICISAIL------LIFWYRQGD--LDPKFRYLIYYNCFSIVLLCICANLYFHDV 116 (118)
T ss_pred CCCceEEehHHHHHHHHHHHHh------heeehhcCC--CChhHHHHHHHHHHHHHHHHHhheeEEeec
Confidence 3666678887777777666544 345677543 444556789999999999999998877654
|
|
| >PF13937 DUF4212: Domain of unknown function (DUF4212) | Back alignment and domain information |
|---|
| >PF03348 Serinc: Serine incorporator (Serinc); InterPro: IPR005016 This is a family of proteins which display differential expression in various tumour and cell lines | Back alignment and domain information |
|---|
Homologous Structure Templates
Structure Templates Detected by BLAST 
Original result of BLAST against Protein Data Bank
No homologous structure with e-value below 0.005
Structure Templates Detected by RPS-BLAST 
Original result of RPS-BLAST against PDB70 database
No hit with e-value below 0.005
Structure Templates Detected by HHsearch 
Original result of HHsearch against PDB70 database
No hit with probability above 80.00
Homologous Structure Domains
Structure Domains Detected by RPS-BLAST 
Original result of RPS-BLAST against SCOP70(version1.75) database
No hit with e-value below 0.005
Homologous Domains Detected by HHsearch 
Original result of HHsearch against SCOP70(version1.75) database
No hit with probability above 80.00