Citrus Sinensis ID: 030500
Local Sequence Feature Prediction
| Prediction and (Method) | Result |
|---|
Close Homologs for Annotation Transfer
Close Homologs in the Non-Redundant Database Detected by BLAST 
Original result of BLAST against Nonredundant Database
GI ![]() | Alignment Graph ![]() | Length ![]() |
Definition ![]() |
Q cover ![]() |
H cover ![]() |
Identity ![]() |
E-value ![]() |
| Query | 176 | ||||||
| 255541316 | 223 | conserved hypothetical protein [Ricinus | 0.909 | 0.717 | 0.7 | 4e-61 | |
| 449454636 | 222 | PREDICTED: uncharacterized protein LOC10 | 0.892 | 0.707 | 0.652 | 9e-56 | |
| 224063866 | 219 | predicted protein [Populus trichocarpa] | 0.914 | 0.735 | 0.630 | 4e-54 | |
| 449498986 | 263 | PREDICTED: uncharacterized protein LOC10 | 0.761 | 0.509 | 0.725 | 5e-54 | |
| 356524043 | 219 | PREDICTED: uncharacterized protein LOC10 | 0.892 | 0.716 | 0.615 | 5e-52 | |
| 357469573 | 220 | hypothetical protein MTR_4g023550 [Medic | 0.903 | 0.722 | 0.618 | 7e-52 | |
| 297817400 | 216 | hypothetical protein ARALYDRAFT_486553 [ | 0.897 | 0.731 | 0.642 | 8e-52 | |
| 15232964 | 214 | uncharacterized protein [Arabidopsis tha | 0.880 | 0.724 | 0.650 | 5e-51 | |
| 296089151 | 321 | unnamed protein product [Vitis vinifera] | 0.897 | 0.492 | 0.640 | 1e-50 | |
| 225453911 | 218 | PREDICTED: uncharacterized protein LOC10 | 0.897 | 0.724 | 0.640 | 4e-50 |
| >gi|255541316|ref|XP_002511722.1| conserved hypothetical protein [Ricinus communis] gi|223548902|gb|EEF50391.1| conserved hypothetical protein [Ricinus communis] | Back alignment and taxonomy information |
|---|
Score = 239 bits (609), Expect = 4e-61, Method: Compositional matrix adjust.
Identities = 119/170 (70%), Positives = 140/170 (82%), Gaps = 10/170 (5%)
Query: 1 MALMA--SSNGYCYYNYKKKYNKRYNCSTVPRITLSQSQ-------EPRDQTSRREIILR 51
M MA +++ YC Y YK + +K +N PR+ LSQ Q +P DQ +RREIILR
Sbjct: 1 MVSMAAPTTSAYCGYYYKTR-SKNFNGLIRPRVILSQQQNDNDNNNKPSDQIARREIILR 59
Query: 52 SSEVAVIGAIFNLSGKKPEYLGVQKNQQALALCPATKNCISTAENISDLTHYAPPWNYNR 111
SSE+AV+GAIFNLSGKKPEYLGVQKNQQALALCPATKNCIST+ENISDLTHY PPWNYN
Sbjct: 60 SSEIAVVGAIFNLSGKKPEYLGVQKNQQALALCPATKNCISTSENISDLTHYTPPWNYNG 119
Query: 112 GRKKPVSREVAMEELLQVIKSTKPDRFTPRLVEKKDDYVHVEYESPVLGV 161
RKKPV++EVAM+ELLQVI+STKPD+FTPR+ EK DDYVHVEY+SP+LG+
Sbjct: 120 NRKKPVNKEVAMKELLQVIQSTKPDKFTPRVAEKSDDYVHVEYQSPILGL 169
|
Source: Ricinus communis Species: Ricinus communis Genus: Ricinus Family: Euphorbiaceae Order: Malpighiales Class: Phylum: Streptophyta Superkingdom: Eukaryota |
| >gi|449454636|ref|XP_004145060.1| PREDICTED: uncharacterized protein LOC101221074 [Cucumis sativus] gi|449472979|ref|XP_004153749.1| PREDICTED: uncharacterized protein LOC101211056 [Cucumis sativus] | Back alignment and taxonomy information |
|---|
| >gi|224063866|ref|XP_002301293.1| predicted protein [Populus trichocarpa] gi|222843019|gb|EEE80566.1| predicted protein [Populus trichocarpa] | Back alignment and taxonomy information |
|---|
| >gi|449498986|ref|XP_004160689.1| PREDICTED: uncharacterized protein LOC101227119 [Cucumis sativus] | Back alignment and taxonomy information |
|---|
| >gi|356524043|ref|XP_003530642.1| PREDICTED: uncharacterized protein LOC100806732 [Glycine max] | Back alignment and taxonomy information |
|---|
| >gi|357469573|ref|XP_003605071.1| hypothetical protein MTR_4g023550 [Medicago truncatula] gi|355506126|gb|AES87268.1| hypothetical protein MTR_4g023550 [Medicago truncatula] | Back alignment and taxonomy information |
|---|
| >gi|297817400|ref|XP_002876583.1| hypothetical protein ARALYDRAFT_486553 [Arabidopsis lyrata subsp. lyrata] gi|297322421|gb|EFH52842.1| hypothetical protein ARALYDRAFT_486553 [Arabidopsis lyrata subsp. lyrata] | Back alignment and taxonomy information |
|---|
| >gi|15232964|ref|NP_191640.1| uncharacterized protein [Arabidopsis thaliana] gi|7329691|emb|CAB82685.1| putative protein [Arabidopsis thaliana] gi|110741004|dbj|BAE98596.1| hypothetical protein [Arabidopsis thaliana] gi|117168213|gb|ABK32189.1| At3g60810 [Arabidopsis thaliana] gi|332646589|gb|AEE80110.1| uncharacterized protein [Arabidopsis thaliana] | Back alignment and taxonomy information |
|---|
| >gi|296089151|emb|CBI38854.3| unnamed protein product [Vitis vinifera] | Back alignment and taxonomy information |
|---|
| >gi|225453911|ref|XP_002279226.1| PREDICTED: uncharacterized protein LOC100245470 [Vitis vinifera] | Back alignment and taxonomy information |
|---|
Prediction of Gene Ontology (GO) Terms
Close Homologs with Gene Ontology terms Detected by BLAST 
Original result of BLAST against Gene Ontology (AMIGO)
ID ![]() |
Alignment graph ![]() |
Length ![]() |
Definition ![]() |
Q cover ![]() |
H cover ![]() |
Identity ![]() |
E-value ![]() |
| Query | 176 | ||||||
| TAIR|locus:2101826 | 214 | AT3G60810 "AT3G60810" [Arabido | 0.715 | 0.588 | 0.75 | 1.8e-48 |
| TAIR|locus:2101826 AT3G60810 "AT3G60810" [Arabidopsis thaliana (taxid:3702)] | Back alignment and assigned GO terms |
|---|
Score = 506 (183.2 bits), Expect = 1.8e-48, P = 1.8e-48
Identities = 96/128 (75%), Positives = 112/128 (87%)
Query: 34 SQSQEPRDQTSRREIILRSSEVAVIGAIFNLSGKKPEYLGVQKNQQALALCPATKNCIST 93
S + P+ RR+IILRSSE+A+IGAIF LSGKKP+YLGVQKN++ LALCPAT NCIST
Sbjct: 35 SYDKHPK-LIGRRDIILRSSELAMIGAIFQLSGKKPDYLGVQKNER-LALCPATNNCIST 92
Query: 94 AENISDLTHYAPPWNYNRGRKKPVSREVAMEELLQVIKSTKPDRFTPRLVEKKDDYVHVE 153
+ENISD HYAPPWNYN GRK PV+R+VAM+ELL VIKS KPD+FTPR+VEKKDDYVHVE
Sbjct: 93 SENISDRVHYAPPWNYNGGRKTPVNRQVAMKELLNVIKSVKPDKFTPRIVEKKDDYVHVE 152
Query: 154 YESPVLGV 161
YESP+LG+
Sbjct: 153 YESPILGL 160
Parameters:
V=100
filter=SEG
E=0.001
ctxfactor=1.00
Query ----- As Used ----- ----- Computed ----
Frame MatID Matrix name Lambda K H Lambda K H
+0 0 BLOSUM62 0.317 0.131 0.378 same same same
Q=9,R=2 0.244 0.0300 0.180 n/a n/a n/a
Query
Frame MatID Length Eff.Length E S W T X E2 S2
+0 0 176 159 0.00091 106 3 11 22 0.37 32
30 0.43 34
Statistics:
Database: /share/blast/go-seqdb.fasta
Title: go_20130330-seqdb.fasta
Posted: 5:47:42 AM PDT Apr 1, 2013
Created: 5:47:42 AM PDT Apr 1, 2013
Format: XDF-1
# of letters in database: 169,044,731
# of sequences in database: 368,745
# of database sequences satisfying E: 1
No. of states in DFA: 584 (62 KB)
Total size of DFA: 138 KB (2086 KB)
Time to generate neighborhood: 0.00u 0.00s 0.00t Elapsed: 00:00:00
No. of threads or processors used: 24
Search cpu time: 14.97u 0.13s 15.10t Elapsed: 00:00:01
Total cpu time: 14.98u 0.13s 15.11t Elapsed: 00:00:01
Start: Sat May 11 03:28:37 2013 End: Sat May 11 03:28:38 2013
|
|
Prediction of Enzyme Commission (EC) Number
EC Number Prediction by Ezypred Server 
Original result from Ezypred Server
Fail to connect to Ezypred Server
Prediction of Functionally Associated Proteins
Functionally Associated Proteins Detected by STRING 
Original result from the STRING server
| estExt_fgenesh4_pg.C_LG_II1341 | hypothetical protein (219 aa) | ||||||||||
(Populus trichocarpa) | |||||||||||
| eugene3.00170453 | • | 0.407 |
Conserved Domains and Related Protein Families
Conserved Domains Detected by RPS-BLAST 
Original result of RPS-BLAST against CDD database part I
ID ![]() | Alignment Graph ![]() | Length ![]() |
Definition ![]() |
E-value ![]() |
| Query | 176 | |||
| pfam07386 | 118 | pfam07386, DUF1499, Protein of unknown function (D | 1e-08 | |
| COG4446 | 141 | COG4446, COG4446, Uncharacterized protein conserve | 1e-07 |
| >gnl|CDD|219394 pfam07386, DUF1499, Protein of unknown function (DUF1499) | Back alignment and domain information |
|---|
Score = 50.3 bits (121), Expect = 1e-08
Identities = 22/81 (27%), Positives = 31/81 (38%), Gaps = 10/81 (12%)
Query: 81 LALCPATKNCISTAENIS-DLTHYAPPWNYNRGRKKPVSREVAMEELLQVIKSTKPDRFT 139
L+ CP + NC+S+A S D P S E A+ LL V ++
Sbjct: 3 LSTCPDSPNCVSSAAAASRDAYPDIAPLTL------DGSPEEALAALLAVAEALGRTIVV 56
Query: 140 PRLVEKKDDYVHVEYESPVLG 160
E D Y+ SP+ G
Sbjct: 57 T---EPTDGYIEATATSPLFG 74
|
This family consists of several hypothetical bacterial and plant proteins of around 125 residues in length. The function of this family is unknown. Length = 118 |
| >gnl|CDD|226854 COG4446, COG4446, Uncharacterized protein conserved in bacteria [Function unknown] | Back alignment and domain information |
|---|
Conserved Domains Detected by HHsearch 
Original result of HHsearch against CDD database
ID ![]() | Alignment Graph ![]() | Length ![]() |
Definition ![]() |
Probability ![]() |
| Query | 176 | |||
| COG4446 | 141 | Uncharacterized protein conserved in bacteria [Fun | 99.97 | |
| PF07386 | 118 | DUF1499: Protein of unknown function (DUF1499); In | 99.87 |
| >COG4446 Uncharacterized protein conserved in bacteria [Function unknown] | Back alignment and domain information |
|---|
Probab=99.97 E-value=1.7e-32 Score=215.63 Aligned_cols=94 Identities=32% Similarity=0.542 Sum_probs=89.4
Q ss_pred HhcCCCCCCcccccCcccccCCCCCCCeeeeccccCCCCcccCCeeccCCCCCCCCHHHHHHHHHHHHHhcCCCCCCcEE
Q 030500 63 NLSGKKPEYLGVQKNQQALALCPATKNCISTAENISDLTHYAPPWNYNRGRKKPVSREVAMEELLQVIKSTKPDRFTPRL 142 (176)
Q Consensus 63 ~~sg~~P~~LGV~dG~~~La~CP~sPNCVSSqa~~~D~~h~I~Pw~y~g~~~~~~s~e~A~~~L~~vl~~~~~~~f~~~I 142 (176)
.++|+ |.+|||.+|+ |+|||++|||||||+ .|..|.|+|+.|.++ ++.|.++|++++.++|| ++|
T Consensus 11 ~f~~s-~~~lGV~sgr--lapCpn~PNCVssQ~--adt~h~iaPl~f~~~------~~~a~e~l~~il~~lP~----t~i 75 (141)
T COG4446 11 AFSGS-PCNLGVDSGR--LAPCPNSPNCVSSQD--ADTKHAIAPLNFILD------PGVAIEQLERILLSLPG----TVI 75 (141)
T ss_pred hhccC-ccccCcccCc--ccCCCCCCCeeeccc--ccchhcccccccccC------HHHHHHHHHHHHhhCCC----ceE
Confidence 57777 9999999998 999999999999998 799999999999986 88999999999999998 999
Q ss_pred EEecCCEEEEEEEcCCCCcccceEEEeee
Q 030500 143 VEKKDDYVHVEYESPVLGVGLAHHTFVQL 171 (176)
Q Consensus 143 Ve~~~~YL~aef~S~ifGFVDDVEF~~d~ 171 (176)
|+++++||||||+|++||||||||||+++
T Consensus 76 ve~~~nYl~ae~~Srlf~FVDDlEfyl~~ 104 (141)
T COG4446 76 VEKNDNYLRAECTSRLFGFVDDLEFYLPQ 104 (141)
T ss_pred eecCchHHHHHHHHHHhhcccceEEecCC
Confidence 99999999999999999999999999975
|
|
| >PF07386 DUF1499: Protein of unknown function (DUF1499); InterPro: IPR010865 This family consists of several hypothetical bacterial and plant proteins of around 125 residues in length | Back alignment and domain information |
|---|
Homologous Structure Templates
Structure Templates Detected by BLAST 
Original result of BLAST against Protein Data Bank
No homologous structure with e-value below 0.005
Structure Templates Detected by RPS-BLAST 
Original result of RPS-BLAST against PDB70 database
ID ![]() | Alignment Graph ![]() | Length ![]() |
Definition ![]() |
E-value ![]() |
| Query | 176 | |||
| 1vt4_I | 1221 | APAF-1 related killer DARK; drosophila apoptosome, | 7e-05 |
| >1vt4_I APAF-1 related killer DARK; drosophila apoptosome, apoptosis, programmed cell death; HET: DTP; 6.90A {Drosophila melanogaster} PDB: 3iz8_A* Length = 1221 | Back alignment and structure |
|---|
Score = 41.4 bits (96), Expect = 7e-05
Identities = 25/152 (16%), Positives = 50/152 (32%), Gaps = 38/152 (25%)
Query: 10 YCYYN----YKKKYNKRYNCSTV---PRITLSQS------QEP-RDQTSRR--EIILRSS 53
Y Y + ++ + ++C V P+ LS+ + R +L
Sbjct: 16 YQYKDILSVFEDAFVDNFDCKDVQDMPKSILSKEEIDHIIMSKDAVSGTLRLFWTLLSKQ 75
Query: 54 EVAVIGAIFNLSGKKPEY------LGVQKNQQALALCPATKNCISTAENISDLTHYAPPW 107
E V F + Y + ++ Q ++ T+ I + + +
Sbjct: 76 EEMV--QKFVEEVLRINYKFLMSPIKTEQRQPSMM----TRMYIEQRDRL---------Y 120
Query: 108 NYNRG-RKKPVSREVAMEELLQVIKSTKPDRF 138
N N+ K VSR +L Q + +P +
Sbjct: 121 NDNQVFAKYNVSRLQPYLKLRQALLELRPAKN 152
|
Structure Templates Detected by HHsearch 
Original result of HHsearch against PDB70 database
No hit with probability above 80.00
Homologous Structure Domains
Structure Domains Detected by RPS-BLAST 
Original result of RPS-BLAST against SCOP70(version1.75) database
No hit with e-value below 0.005
Homologous Domains Detected by HHsearch 
Original result of HHsearch against SCOP70(version1.75) database
No hit with probability above 80.00