Citrus Sinensis ID: 023468
Local Sequence Feature Prediction
| Prediction and (Method) | Result |
|---|
Close Homologs for Annotation Transfer
Close Homologs in the Non-Redundant Database Detected by BLAST 
Original result of BLAST against Nonredundant Database
GI ![]() | Alignment Graph ![]() | Length ![]() |
Definition ![]() |
Q cover ![]() |
H cover ![]() |
Identity ![]() |
E-value ![]() |
| Query | 282 | ||||||
| 224128117 | 271 | predicted protein [Populus trichocarpa] | 0.950 | 0.988 | 0.806 | 1e-122 | |
| 356498525 | 287 | PREDICTED: uncharacterized protein C9orf | 0.975 | 0.958 | 0.781 | 1e-119 | |
| 358248108 | 288 | uncharacterized protein LOC100812323 [Gl | 0.975 | 0.954 | 0.774 | 1e-118 | |
| 297744059 | 298 | unnamed protein product [Vitis vinifera] | 0.960 | 0.909 | 0.775 | 1e-110 | |
| 225437728 | 270 | PREDICTED: uncharacterized protein LOC10 | 0.950 | 0.992 | 0.780 | 1e-110 | |
| 255556659 | 293 | Protein C9orf78, putative [Ricinus commu | 0.957 | 0.921 | 0.767 | 1e-109 | |
| 449463519 | 293 | PREDICTED: uncharacterized protein LOC10 | 0.964 | 0.928 | 0.708 | 1e-109 | |
| 297848438 | 277 | hypothetical protein ARALYDRAFT_887370 [ | 0.939 | 0.956 | 0.742 | 1e-107 | |
| 449481099 | 305 | PREDICTED: uncharacterized LOC101215146 | 0.964 | 0.891 | 0.67 | 1e-105 | |
| 217071714 | 291 | unknown [Medicago truncatula] | 0.992 | 0.962 | 0.715 | 1e-105 |
| >gi|224128117|ref|XP_002320248.1| predicted protein [Populus trichocarpa] gi|222861021|gb|EEE98563.1| predicted protein [Populus trichocarpa] | Back alignment and taxonomy information |
|---|
Score = 445 bits (1144), Expect = e-122, Method: Compositional matrix adjust.
Identities = 221/274 (80%), Positives = 244/274 (89%), Gaps = 6/274 (2%)
Query: 10 EKKKNFRKRSYEEEEETTNKLSDDEEERRLALEEIKFLQKQRERKSGIPAIPSALQSAAA 69
+KK+NFRKR++EE+E + DDE+ERRLALEE+KFLQKQRERKSGIPA+ + Q+A
Sbjct: 3 QKKRNFRKRTFEEDEHSKAS-DDDEQERRLALEEVKFLQKQRERKSGIPALATTSQTATT 61
Query: 70 AGGGGLTKVSEKNEGDGEKDELVLQDTFAQETAVMVEDPNMLKYVEQELAKKRGKNIDVN 129
K++EK +GDGEK+ELVLQDTFAQETAVMVEDPNML+YVEQELAKKRGKNID
Sbjct: 62 VAA----KLTEKADGDGEKEELVLQDTFAQETAVMVEDPNMLQYVEQELAKKRGKNIDAT 117
Query: 130 DRVENDLKHAEDELYKIPEHLKVKKRNSEESSTQWTTGIAEVQLPIEYKLKNIEETEAAK 189
D+VE +LK AEDELYKIPEHLKVKKRNSEESSTQWTTGIAEVQLPIEYKL+NIEETEAAK
Sbjct: 118 DQVETELKRAEDELYKIPEHLKVKKRNSEESSTQWTTGIAEVQLPIEYKLRNIEETEAAK 177
Query: 190 KLLQEKRLMGRAKSDFSIPSSYSADYFQRGRDYAEKLRREHPELYKDRGSQDDG-AGSRP 248
KLLQEKRLMGR KS+FSIPSSYSADYFQRGRDYAEKLRR+HPELYKDR QDD AGS+P
Sbjct: 178 KLLQEKRLMGRPKSEFSIPSSYSADYFQRGRDYAEKLRRDHPELYKDRSLQDDAVAGSKP 237
Query: 249 TDNSTDAAGSRQAATDQFMLERFRKRERHRVMRR 282
DNSTDAAG RQAATD+FMLERFRKRERHRVMRR
Sbjct: 238 ADNSTDAAGRRQAATDEFMLERFRKRERHRVMRR 271
|
Source: Populus trichocarpa Species: Populus trichocarpa Genus: Populus Family: Salicaceae Order: Malpighiales Class: Phylum: Streptophyta Superkingdom: Eukaryota |
| >gi|356498525|ref|XP_003518101.1| PREDICTED: uncharacterized protein C9orf78 homolog [Glycine max] | Back alignment and taxonomy information |
|---|
| >gi|358248108|ref|NP_001239815.1| uncharacterized protein LOC100812323 [Glycine max] gi|255645199|gb|ACU23097.1| unknown [Glycine max] | Back alignment and taxonomy information |
|---|
| >gi|297744059|emb|CBI37029.3| unnamed protein product [Vitis vinifera] | Back alignment and taxonomy information |
|---|
| >gi|225437728|ref|XP_002280535.1| PREDICTED: uncharacterized protein LOC100250416 [Vitis vinifera] | Back alignment and taxonomy information |
|---|
| >gi|255556659|ref|XP_002519363.1| Protein C9orf78, putative [Ricinus communis] gi|223541430|gb|EEF42980.1| Protein C9orf78, putative [Ricinus communis] | Back alignment and taxonomy information |
|---|
| >gi|449463519|ref|XP_004149481.1| PREDICTED: uncharacterized protein LOC101215146 [Cucumis sativus] | Back alignment and taxonomy information |
|---|
| >gi|297848438|ref|XP_002892100.1| hypothetical protein ARALYDRAFT_887370 [Arabidopsis lyrata subsp. lyrata] gi|297337942|gb|EFH68359.1| hypothetical protein ARALYDRAFT_887370 [Arabidopsis lyrata subsp. lyrata] | Back alignment and taxonomy information |
|---|
| >gi|449481099|ref|XP_004156081.1| PREDICTED: uncharacterized LOC101215146 [Cucumis sativus] | Back alignment and taxonomy information |
|---|
| >gi|217071714|gb|ACJ84217.1| unknown [Medicago truncatula] | Back alignment and taxonomy information |
|---|
Prediction of Gene Ontology (GO) Terms
Close Homologs with Gene Ontology terms Detected by BLAST 
Original result of BLAST against Gene Ontology (AMIGO)
ID ![]() |
Alignment graph ![]() |
Length ![]() |
Definition ![]() |
Q cover ![]() |
H cover ![]() |
Identity ![]() |
E-value ![]() |
| Query | 282 | ||||||
| TAIR|locus:2204888 | 279 | AT1G02330 "AT1G02330" [Arabido | 0.936 | 0.946 | 0.709 | 1.2e-95 | |
| UNIPROTKB|F1P4D4 | 293 | C17H9orf78 "Uncharacterized pr | 0.705 | 0.679 | 0.349 | 1e-22 | |
| UNIPROTKB|F1RQE0 | 289 | C9orf78 "Uncharacterized prote | 0.702 | 0.685 | 0.342 | 4.9e-21 | |
| UNIPROTKB|Q1JQC8 | 265 | C11H9ORF78 "Uncharacterized pr | 0.702 | 0.747 | 0.338 | 1e-20 | |
| UNIPROTKB|Q9NZ63 | 289 | C9orf78 "Uncharacterized prote | 0.702 | 0.685 | 0.338 | 1e-20 | |
| MGI|MGI:2385132 | 289 | BC005624 "cDNA sequence BC0056 | 0.702 | 0.685 | 0.338 | 1e-20 | |
| ZFIN|ZDB-GENE-041010-15 | 289 | zgc:103692 "zgc:103692" [Danio | 0.709 | 0.692 | 0.311 | 3.5e-20 |
| TAIR|locus:2204888 AT1G02330 "AT1G02330" [Arabidopsis thaliana (taxid:3702)] | Back alignment and assigned GO terms |
|---|
Score = 951 (339.8 bits), Expect = 1.2e-95, P = 1.2e-95
Identities = 198/279 (70%), Positives = 222/279 (79%)
Query: 14 NFRKRSYEEEEETT--NK--LSXXXXXXXXXXXXIKFLQKQRERKSGIPAIPSALQSXXX 69
NFRKRS+EEEEE NK +S +KFLQK RERK GIPA+ S QS
Sbjct: 6 NFRKRSFEEEEEDNDVNKAAISEEEEKRRLALEEVKFLQKLRERKLGIPALSSTAQSSIG 65
Query: 70 XXXXXLTKVSEKNEGDGEKDELVLQDTFAQETAVMVEDPNMLKYVEQELAKKRGKNIDVN 129
K EK E +GEK+ELVLQDTFAQETAV++EDPNM+KY+EQELAKKRG+NID
Sbjct: 66 K-----VKPVEKTETEGEKEELVLQDTFAQETAVLIEDPNMVKYIEQELAKKRGRNIDDA 120
Query: 130 DRVENDLKHAEDELYKIPEHLKVKKRNSEESSTQWTTGIAEVQLPIEYKLKNIEETEAAK 189
+ VEN+LK EDELYKIP+HLKVKKR+SEESSTQWTTGIAEVQLPIEYKLKNIEETEAAK
Sbjct: 121 EEVENELKRVEDELYKIPDHLKVKKRSSEESSTQWTTGIAEVQLPIEYKLKNIEETEAAK 180
Query: 190 KLLQEKRLMGRAKSDFSIPSSYSADYFQRGRDYAEKLRREHPELYKDRGS-QDDGAGSRP 248
KLLQE+RLMGR KS+FSIPSSYSADYFQRG+DYAEKLRREHPELYKDRG Q DG ++P
Sbjct: 181 KLLQERRLMGRPKSEFSIPSSYSADYFQRGKDYAEKLRREHPELYKDRGGPQADGEAAKP 240
Query: 249 -----TDNSTDAAGSRQAATDQFMLERFRKRERHRVMRR 282
T+N+ D+ SRQAATDQ MLERFRKRER+RVMRR
Sbjct: 241 STSSSTNNNADSGKSRQAATDQIMLERFRKRERNRVMRR 279
|
|
| UNIPROTKB|F1P4D4 C17H9orf78 "Uncharacterized protein" [Gallus gallus (taxid:9031)] | Back alignment and assigned GO terms |
|---|
| UNIPROTKB|F1RQE0 C9orf78 "Uncharacterized protein" [Sus scrofa (taxid:9823)] | Back alignment and assigned GO terms |
|---|
| UNIPROTKB|Q1JQC8 C11H9ORF78 "Uncharacterized protein" [Bos taurus (taxid:9913)] | Back alignment and assigned GO terms |
|---|
| UNIPROTKB|Q9NZ63 C9orf78 "Uncharacterized protein C9orf78" [Homo sapiens (taxid:9606)] | Back alignment and assigned GO terms |
|---|
| MGI|MGI:2385132 BC005624 "cDNA sequence BC005624" [Mus musculus (taxid:10090)] | Back alignment and assigned GO terms |
|---|
| ZFIN|ZDB-GENE-041010-15 zgc:103692 "zgc:103692" [Danio rerio (taxid:7955)] | Back alignment and assigned GO terms |
|---|
Prediction of Enzyme Commission (EC) Number
EC Number Prediction by Ezypred Server 
Original result from Ezypred Server
Fail to connect to Ezypred Server
Prediction of Functionally Associated Proteins
Functionally Associated Proteins Detected by STRING 
Original result from the STRING server
| estExt_fgenesh4_pg.C_LG_XIV0499 | hypothetical protein (271 aa) | |||||||
(Populus trichocarpa) | ||||||||
| Sorry, there are no predicted associations at the current settings. |
Conserved Domains and Related Protein Families
Conserved Domains Detected by RPS-BLAST 
Original result of RPS-BLAST against CDD database part I
ID ![]() | Alignment Graph ![]() | Length ![]() |
Definition ![]() |
E-value ![]() |
| Query | 282 | |||
| pfam07052 | 102 | pfam07052, Hep_59, Hepatocellular carcinoma-associ | 7e-30 |
| >gnl|CDD|219280 pfam07052, Hep_59, Hepatocellular carcinoma-associated antigen 59 | Back alignment and domain information |
|---|
Score = 108 bits (271), Expect = 7e-30
Identities = 56/103 (54%), Positives = 69/103 (66%), Gaps = 5/103 (4%)
Query: 95 DTFAQETAVMVEDPNMLKYVEQELAKKRGKNID----VNDRVENDLKHAEDELYKIPEHL 150
D F +ET ED +M+KY+E+ELAK++G+ D +D+ N+LK AEDELY +PEHL
Sbjct: 1 DRFTKETGQRDEDAHMMKYIEEELAKRKGQAADADAEDSDKAANNLKRAEDELYALPEHL 60
Query: 151 KVKKRNSEESSTQWTTGIAEVQLPIEYKLKNIEETEAAKKLLQ 193
KVKK N E S Q GI EV L IE KLKNIE TE AK+ L
Sbjct: 61 KVKK-NEEMLSNQMLGGIPEVDLGIEAKLKNIERTEKAKRRLL 102
|
This family represents a conserved region approximately 100 residues long within mammalian hepatocellular carcinoma-associated antigen 59 and similar proteins. Family members are found in a variety of eukaryotes, mainly as hypothetical proteins. Length = 102 |
Conserved Domains Detected by HHsearch 
Original result of HHsearch against CDD database
ID ![]() | Alignment Graph ![]() | Length ![]() |
Definition ![]() |
Probability ![]() |
| Query | 282 | |||
| KOG3345 | 275 | consensus Uncharacterized conserved protein [Funct | 100.0 | |
| PF07052 | 104 | Hep_59: Hepatocellular carcinoma-associated antige | 100.0 |
| >KOG3345 consensus Uncharacterized conserved protein [Function unknown] | Back alignment and domain information |
|---|
Probab=100.00 E-value=1.6e-72 Score=504.88 Aligned_cols=257 Identities=42% Similarity=0.598 Sum_probs=212.5
Q ss_pred ccccccccccccCCcchhhccCCCCchHHHHhhcHHHHHHHHHhhccCCCcccccccccccccc--------CCCCcccc
Q 023468 7 QKKEKKKNFRKRSYEEEEETTNKLSDDEEERRLALEEIKFLQKQRERKSGIPAIPSALQSAAAA--------GGGGLTKV 78 (282)
Q Consensus 7 ~~~kkkK~~RkR~~~~d~e~~~~~~e~eee~~~~leeikelQKlR~R~~GI~a~~la~g~~~~~--------g~g~~~~~ 78 (282)
.|+|.+..+++|...++++.. ++++++.+++|+||++||+||.|++||+...|++|.+.++ +||++.+.
T Consensus 4 ~frk~~~k~~~rk~l~~~~~~---~eEeeerrl~leeikelQrlReRknGvs~v~lavgeeala~~d~~qm~~Ggmv~~k 80 (275)
T KOG3345|consen 4 IFRKVKFKIQKRKNLEDDDVA---AEEEEERRLALEEIKELQRLRERKNGVSEVELAVGEEALALEDPFQMKTGGMVDMK 80 (275)
T ss_pred chhhhHHHHHHhhcchhhhhh---hhhHHHHHhhHHHHHHHHHHHhccCCcchhhhhhhhhhhhccCchhcccCcceehH
Confidence 455666667777766555422 5567788999999999999999999999999999974432 66666554
Q ss_pred ccccCCCccc-hhhhhhcchhhhccccCcchhHHHHHHHHHHHhhCCCCCCCccccccccchhhhhhcCCcccccc--cc
Q 023468 79 SEKNEGDGEK-DELVLQDTFAQETAVMVEDPNMLKYVEQELAKKRGKNIDVNDRVENDLKHAEDELYKIPEHLKVK--KR 155 (282)
Q Consensus 79 ~~k~~~~~e~-~~~~l~~~Ft~eT~~~DeD~hMmkYIE~eLaKRk~~~~~~~~~~~~~~~~~~~~Ly~~pe~l~v~--kr 155 (282)
..+..+..+. ..-+|+.+|+++|+.+|+|++||+|||++|+||+|..+.+.+.+.+.+++++||||++|+||+++ ++
T Consensus 81 K~k~~~mkeas~eqdlgtqFs~ET~~rdEdeem~KYIeqeLqkrkg~~~~e~eeqk~k~~naED~Ly~lpdhlk~ssskk 160 (275)
T KOG3345|consen 81 KLKERGMKEASEEQDLGTQFSAETNRRDEDEEMMKYIEQELQKRKGTSIVEHEEQKNKPKNAEDCLYELPDHLKVSSSKK 160 (275)
T ss_pred HhhhhhhhHHHHhhhhhHHHHHHHhcccccHHHHHHHHHHHHHhhccchhhhhhhhcccCCHHHHHHhhhhhhccccccc
Confidence 4443332222 23369999999999999999999999999999999888888888899999999999999999998 88
Q ss_pred cccccccccccCCccccCChhHHhhcHHHHHHHHHHHHHHHhccCCCCCCCCCCccchhhhhcchhhHHHHHhhCccccc
Q 023468 156 NSEESSTQWTTGIAEVQLPIEYKLKNIEETEAAKKLLQEKRLMGRAKSDFSIPSSYSADYFQRGRDYAEKLRREHPELYK 235 (282)
Q Consensus 156 ~~e~~s~~mlsgI~EVDLGieaKlkNIe~TE~AKr~L~e~~~~~~~~~~~~~p~n~a~n~~qh~r~~~~~~~~~~~~~~~ 235 (282)
++||+|+|||+|||||||||++||+||++||+|||+|+++++++++++++|||+|||||||||||||++.+.+.
T Consensus 161 s~EmlS~qml~GIpEVdLgI~~kikNI~eTE~aKrlLqe~~~~~~~g~~~~ipt~~a~ny~qr~rf~~e~~~~~------ 234 (275)
T KOG3345|consen 161 SEEMLSNQMLSGIPEVDLGIDAKIKNIEETEDAKRLLQEQQNKKKDGETSFIPTNMAVNYVQRNRFYHEELNRD------ 234 (275)
T ss_pred hHHHHHHHHHcCCcccccchhhhhhhHHhhHHHHHHHHHHHhccCCCCCCCCccchhhhHHHhhhhhHHHhccc------
Confidence 99999999999999999999999999999999999999999999999999999999999999999999998652
Q ss_pred ccCCCCCCCCCCCCCCCC---CCCCCCCCcchHHHHHHHHHHHHh
Q 023468 236 DRGSQDDGAGSRPTDNST---DAAGSRQAATDQFMLERFRKRERH 277 (282)
Q Consensus 236 ~~~~~~~~~~~~~~~~~~---~~~~~~~~ATDd~~~erFrK~~r~ 277 (282)
+..++ ..+.++.+ ..++.+++|||||||||||||+|+
T Consensus 235 ----r~~d~-e~~ep~~s~~~~~np~~~~atddyhyekfrk~~rr 274 (275)
T KOG3345|consen 235 ----RQADT-EKAEPSTSPSTNNNPDRQAATDDYHYEKFRKRERR 274 (275)
T ss_pred ----hhccc-cccCCCCCccccCChhhhhhhhHHHHHHHHHHhcc
Confidence 11111 22222222 244669999999999999999994
|
|
| >PF07052 Hep_59: Hepatocellular carcinoma-associated antigen 59; InterPro: IPR010756 This family represents a conserved region approximately 100 residues long within mammalian hepatocellular carcinoma-associated antigen 59 and similar proteins | Back alignment and domain information |
|---|
Homologous Structure Templates
Structure Templates Detected by BLAST 
Original result of BLAST against Protein Data Bank
No homologous structure with e-value below 0.005
Structure Templates Detected by RPS-BLAST 
Original result of RPS-BLAST against PDB70 database
ID ![]() | Alignment Graph ![]() | Length ![]() |
Definition ![]() |
E-value ![]() |
| Query | 282 | |||
| 1vt4_I | 1221 | APAF-1 related killer DARK; drosophila apoptosome, | 4e-05 | |
| 1vt4_I | 1221 | APAF-1 related killer DARK; drosophila apoptosome, | 5e-04 |
| >1vt4_I APAF-1 related killer DARK; drosophila apoptosome, apoptosis, programmed cell death; HET: DTP; 6.90A {Drosophila melanogaster} PDB: 3iz8_A* Length = 1221 | Back alignment and structure |
|---|
Score = 44.1 bits (103), Expect = 4e-05
Identities = 33/258 (12%), Positives = 66/258 (25%), Gaps = 81/258 (31%)
Query: 17 KRSYEEEEETTNKLSDDEEERRLALEEIKFLQKQ-----RERKSGIPAIPSALQSAAAAG 71
++ + + L+ DE + L K+L + RE + P
Sbjct: 285 TTTHISLDHHSMTLTPDEVKSLLL----KYLDCRPQDLPREVLTTNP------------- 327
Query: 72 GGGLTKVSEKNEGDGEKDELVLQDTFAQETAVMVEDPNMLKYVEQELAKKRGKNID--VN 129
L A + + + K I+ +N
Sbjct: 328 -------------------RRL-SIIA---ESIRDGLATWDNWKHVNCDKLTTIIESSLN 364
Query: 130 DRVENDLKHAEDELY------KIPEH-LKV------------------KKRNSEESSTQW 164
+ + D L IP L + K E+ +
Sbjct: 365 VLEPAEYRKMFDRLSVFPPSAHIPTILLSLIWFDVIKSDVMVVVNKLHKYSLVEKQPKES 424
Query: 165 TTGIAEVQLPIEYKLKNIEETEAAKKLLQEKRLM-GRAKSDFSIPS--SYSADYFQ---R 218
T I + L E K+K E + ++ + D P Y + +
Sbjct: 425 TISIPSIYL--ELKVKLENEYALHRSIVDHYNIPKTFDSDDLIPPYLDQYFYSHIGHHLK 482
Query: 219 GRDYAEKLRREHPELYKD 236
++ E++ ++ D
Sbjct: 483 NIEHPERM-TLFRMVFLD 499
|
| >1vt4_I APAF-1 related killer DARK; drosophila apoptosome, apoptosis, programmed cell death; HET: DTP; 6.90A {Drosophila melanogaster} PDB: 3iz8_A* Length = 1221 | Back alignment and structure |
|---|
Structure Templates Detected by HHsearch 
Original result of HHsearch against PDB70 database
No hit with probability above 80.00
Homologous Structure Domains
Structure Domains Detected by RPS-BLAST 
Original result of RPS-BLAST against SCOP70(version1.75) database
No hit with e-value below 0.005
Homologous Domains Detected by HHsearch 
Original result of HHsearch against SCOP70(version1.75) database
No hit with probability above 80.00