Citrus Sinensis ID: 024104
Local Sequence Feature Prediction
| Prediction and (Method) | Result |
|---|
Close Homologs for Annotation Transfer
Close Homologs in the Non-Redundant Database Detected by BLAST 
Original result of BLAST against Nonredundant Database
GI ![]() | Alignment Graph ![]() | Length ![]() |
Definition ![]() |
Q cover ![]() |
H cover ![]() |
Identity ![]() |
E-value ![]() |
| Query | 272 | ||||||
| 118485488 | 268 | unknown [Populus trichocarpa] | 0.963 | 0.977 | 0.775 | 1e-117 | |
| 297743774 | 273 | unnamed protein product [Vitis vinifera] | 1.0 | 0.996 | 0.728 | 1e-112 | |
| 359478124 | 279 | PREDICTED: UPF0454 protein C12orf49 homo | 1.0 | 0.974 | 0.728 | 1e-112 | |
| 224112759 | 245 | predicted protein [Populus trichocarpa] | 0.900 | 1.0 | 0.779 | 1e-111 | |
| 356538903 | 281 | PREDICTED: uncharacterized protein LOC10 | 0.948 | 0.918 | 0.744 | 1e-111 | |
| 297817236 | 272 | hypothetical protein ARALYDRAFT_486406 [ | 1.0 | 1.0 | 0.698 | 1e-106 | |
| 388496984 | 272 | unknown [Lotus japonicus] | 0.933 | 0.933 | 0.721 | 1e-106 | |
| 255585501 | 251 | conserved hypothetical protein [Ricinus | 0.889 | 0.964 | 0.739 | 1e-104 | |
| 30694985 | 273 | uncharacterized protein [Arabidopsis tha | 0.948 | 0.945 | 0.709 | 1e-103 | |
| 242084782 | 276 | hypothetical protein SORBIDRAFT_08g00331 | 0.930 | 0.916 | 0.644 | 2e-95 |
| >gi|118485488|gb|ABK94599.1| unknown [Populus trichocarpa] | Back alignment and taxonomy information |
|---|
Score = 427 bits (1097), Expect = e-117, Method: Compositional matrix adjust.
Identities = 204/263 (77%), Positives = 234/263 (88%), Gaps = 1/263 (0%)
Query: 11 LFFQLLTL-QFLSIISAIRKDIGFQETPFCKTTVQGRYLLSDDNGYVCDAVSMDPQSRCC 69
L QLL L QF SIISAIRKDIGF+++ C+ TVQGRYLLSDDNGYVCDA+S+DP SRCC
Sbjct: 6 LSLQLLILFQFSSIISAIRKDIGFEQSRSCRNTVQGRYLLSDDNGYVCDALSVDPPSRCC 65
Query: 70 PEKGEKFSCQGCNLLSQCCNSYEYCVSCCVNPARTLKEQVLNVKIAKPTTAGTYGSVFDF 129
PE G +FSCQGCNL+SQCCNSYE+CVSCC++P+RT + QVL VKIAKP+TA +Y SVFDF
Sbjct: 66 PENGLRFSCQGCNLVSQCCNSYEFCVSCCLHPSRTQEAQVLKVKIAKPSTAVSYTSVFDF 125
Query: 130 CAGRCRHNSESVVHENAYLSDFHHCFSMPSNASGAGVTQLEGRLSGISVIIGRQGESCDS 189
CAGRCRHNSESVVHENAY SDFHHCFS+PSN+SGA TQLE RL GI+V+IG+QGESCDS
Sbjct: 126 CAGRCRHNSESVVHENAYRSDFHHCFSLPSNSSGANYTQLEARLGGINVVIGKQGESCDS 185
Query: 190 VCKSSGQSCVLNKLILLNQCEIIQKYMSCKRGCLASMGADQPAEVADDAPSNLNPTACLY 249
VCKS+GQSCVLNKL++LNQC+++QKYMSCK GCLAS+G DQPAEV +DAP +LNP ACLY
Sbjct: 186 VCKSNGQSCVLNKLLVLNQCDVMQKYMSCKGGCLASIGTDQPAEVVEDAPRHLNPGACLY 245
Query: 250 TRIQSMLSCDGSHRHTRRLCPCA 272
TR QS+LSCDGS HTRRLCPCA
Sbjct: 246 TRTQSLLSCDGSLWHTRRLCPCA 268
|
Source: Populus trichocarpa Species: Populus trichocarpa Genus: Populus Family: Salicaceae Order: Malpighiales Class: Phylum: Streptophyta Superkingdom: Eukaryota |
| >gi|297743774|emb|CBI36657.3| unnamed protein product [Vitis vinifera] | Back alignment and taxonomy information |
|---|
| >gi|359478124|ref|XP_002264803.2| PREDICTED: UPF0454 protein C12orf49 homolog [Vitis vinifera] | Back alignment and taxonomy information |
|---|
| >gi|224112759|ref|XP_002332721.1| predicted protein [Populus trichocarpa] gi|222832884|gb|EEE71361.1| predicted protein [Populus trichocarpa] | Back alignment and taxonomy information |
|---|
| >gi|356538903|ref|XP_003537940.1| PREDICTED: uncharacterized protein LOC100782588 [Glycine max] | Back alignment and taxonomy information |
|---|
| >gi|297817236|ref|XP_002876501.1| hypothetical protein ARALYDRAFT_486406 [Arabidopsis lyrata subsp. lyrata] gi|297322339|gb|EFH52760.1| hypothetical protein ARALYDRAFT_486406 [Arabidopsis lyrata subsp. lyrata] | Back alignment and taxonomy information |
|---|
| >gi|388496984|gb|AFK36558.1| unknown [Lotus japonicus] | Back alignment and taxonomy information |
|---|
| >gi|255585501|ref|XP_002533442.1| conserved hypothetical protein [Ricinus communis] gi|223526704|gb|EEF28938.1| conserved hypothetical protein [Ricinus communis] | Back alignment and taxonomy information |
|---|
| >gi|30694985|ref|NP_191498.2| uncharacterized protein [Arabidopsis thaliana] gi|42572729|ref|NP_974460.1| uncharacterized protein [Arabidopsis thaliana] gi|332646394|gb|AEE79915.1| uncharacterized protein [Arabidopsis thaliana] gi|332646395|gb|AEE79916.1| uncharacterized protein [Arabidopsis thaliana] | Back alignment and taxonomy information |
|---|
| >gi|242084782|ref|XP_002442816.1| hypothetical protein SORBIDRAFT_08g003310 [Sorghum bicolor] gi|241943509|gb|EES16654.1| hypothetical protein SORBIDRAFT_08g003310 [Sorghum bicolor] | Back alignment and taxonomy information |
|---|
Prediction of Gene Ontology (GO) Terms
Close Homologs with Gene Ontology terms Detected by BLAST 
Original result of BLAST against Gene Ontology (AMIGO)
ID ![]() |
Alignment graph ![]() |
Length ![]() |
Definition ![]() |
Q cover ![]() |
H cover ![]() |
Identity ![]() |
E-value ![]() |
| Query | 272 | ||||||
| TAIR|locus:2081192 | 273 | AT3G59390 "AT3G59390" [Arabido | 0.915 | 0.912 | 0.730 | 1.1e-105 | |
| MGI|MGI:1924042 | 205 | 2410131K14Rik "RIKEN cDNA 2410 | 0.422 | 0.560 | 0.365 | 2.2e-18 | |
| UNIPROTKB|Q17QN8 | 205 | Q17QN8 "UPF0454 protein C12orf | 0.422 | 0.560 | 0.373 | 5.8e-18 | |
| UNIPROTKB|Q9H741 | 205 | C12orf49 "UPF0454 protein C12o | 0.422 | 0.560 | 0.341 | 8.6e-17 | |
| UNIPROTKB|Q5ZJE4 | 205 | RCJMB04_18o22 "UPF0454 protein | 0.422 | 0.560 | 0.349 | 1.8e-16 | |
| ZFIN|ZDB-GENE-050522-367 | 199 | zgc:110063 "zgc:110063" [Danio | 0.415 | 0.567 | 0.352 | 6e-16 |
| TAIR|locus:2081192 AT3G59390 "AT3G59390" [Arabidopsis thaliana (taxid:3702)] | Back alignment and assigned GO terms |
|---|
Score = 1046 (373.3 bits), Expect = 1.1e-105, P = 1.1e-105
Identities = 182/249 (73%), Positives = 218/249 (87%)
Query: 24 ISAIRKDIGFQETPFCKTTVQGRYLLSDDNGYVCDAVSMDPQSRCCPEKGEKFSCQGCNL 83
+SAIRKDIGF E C+TTVQGRYL+SDD G VCDA+S++ ++RCCP KGE+FSC GCN+
Sbjct: 25 VSAIRKDIGFLEERSCRTTVQGRYLISDDEGNVCDALSLESRTRCCPWKGERFSCHGCNI 84
Query: 84 LSQCCNSYEYCVSCCVNPARTLKEQVLNVKIAKPTTAGTYGSVFDFCAGRCRHNSESVVH 143
LSQCCNSYE+CVSCC+NP++TL E+V+ VK+AKP T+GTY SVFDFCAGRCRHNSESVVH
Sbjct: 85 LSQCCNSYEFCVSCCLNPSQTLLEKVVKVKVAKPATSGTYKSVFDFCAGRCRHNSESVVH 144
Query: 144 ENAYLSDFHHCFSMPSNASGAGVTQLEGRLSGISVIIGRQGESCDSVCKSSGQSCVLNKL 203
ENAY S+FHHCFS+ SNASGA +TQ+E RL GI VI+G QG+SCD+VCKS GQ CV+NKL
Sbjct: 145 ENAYHSEFHHCFSLTSNASGANLTQVETRLLGIDVIVGSQGDSCDAVCKSRGQLCVMNKL 204
Query: 204 ILLNQCEIIQKYMSCKRGCLASMGADQPAEVADDAPSNLNPTACLYTRIQSMLSCDGSHR 263
LLNQC+++++YM+CK CLAS GADQPAEV +DAP +L P ACLYTR QS+LSCDGSH+
Sbjct: 205 SLLNQCDVMKRYMNCKGSCLASAGADQPAEVVEDAPRDLYPGACLYTRTQSLLSCDGSHQ 264
Query: 264 HTRRLCPCA 272
HTRRLCPCA
Sbjct: 265 HTRRLCPCA 273
|
|
| MGI|MGI:1924042 2410131K14Rik "RIKEN cDNA 2410131K14 gene" [Mus musculus (taxid:10090)] | Back alignment and assigned GO terms |
|---|
| UNIPROTKB|Q17QN8 Q17QN8 "UPF0454 protein C12orf49 homolog" [Bos taurus (taxid:9913)] | Back alignment and assigned GO terms |
|---|
| UNIPROTKB|Q9H741 C12orf49 "UPF0454 protein C12orf49" [Homo sapiens (taxid:9606)] | Back alignment and assigned GO terms |
|---|
| UNIPROTKB|Q5ZJE4 RCJMB04_18o22 "UPF0454 protein C12orf49 homolog" [Gallus gallus (taxid:9031)] | Back alignment and assigned GO terms |
|---|
| ZFIN|ZDB-GENE-050522-367 zgc:110063 "zgc:110063" [Danio rerio (taxid:7955)] | Back alignment and assigned GO terms |
|---|
Prediction of Enzyme Commission (EC) Number
EC Number Prediction by Ezypred Server 
Original result from Ezypred Server
Fail to connect to Ezypred Server
Prediction of Functionally Associated Proteins
Functionally Associated Proteins Detected by STRING 
Original result from the STRING server
Fail to connect to STRING server
Conserved Domains and Related Protein Families
Conserved Domains Detected by RPS-BLAST 
Original result of RPS-BLAST against CDD database part I
ID ![]() | Alignment Graph ![]() | Length ![]() |
Definition ![]() |
E-value ![]() |
| Query | 272 | |||
| pfam10218 | 132 | pfam10218, DUF2054, Uncharacterized conserved prot | 1e-44 |
| >gnl|CDD|118742 pfam10218, DUF2054, Uncharacterized conserved protein (DUF2054) | Back alignment and domain information |
|---|
Score = 146 bits (371), Expect = 1e-44
Identities = 63/133 (47%), Positives = 82/133 (61%), Gaps = 7/133 (5%)
Query: 29 KDIGFQETPFCKTTVQGRYLLSDDNGYVCDAVSMDPQSRCCP---EKGEKFSCQGCNLLS 85
KD G C+ +VQGR L++DD GYVC+ + + + CCP ++FSC+ CNL S
Sbjct: 1 KDRGLDPESSCRNSVQGRSLIADDRGYVCERLDL-LVNGCCPVEVPGTKRFSCRTCNLSS 59
Query: 86 QCCNSYEYCVSCCVNPA-RTLKEQVL-NVKIAKPTTAGTYGSVFDFCAGRCRHNSESVVH 143
QCC+ YEYCVSCC+NP+ R L E+VL +A+P T F+FC +CR NS SV H
Sbjct: 60 QCCSVYEYCVSCCLNPSKRPLLERVLAKAAVARPAVYATVTDHFEFCLAKCRTNSHSVQH 119
Query: 144 ENAYLS-DFHHCF 155
EN Y D HCF
Sbjct: 120 ENKYRDPDAKHCF 132
|
This entry contains 14 conserved cysteines, three of which are CC-dimers. The region is of approximately 200 residues in length but its function is unknown. Length = 132 |
Conserved Domains Detected by HHsearch 
Original result of HHsearch against CDD database
ID ![]() | Alignment Graph ![]() | Length ![]() |
Definition ![]() |
Probability ![]() |
| Query | 272 | |||
| KOG3136 | 196 | consensus Uncharacterized conserved protein [Funct | 100.0 | |
| PF10218 | 131 | DUF2054: Uncharacterized conserved protein (DUF205 | 100.0 | |
| PF15024 | 559 | Glyco_transf_18: Glycosyltransferase family 18 | 99.97 |
| >KOG3136 consensus Uncharacterized conserved protein [Function unknown] | Back alignment and domain information |
|---|
Probab=100.00 E-value=6.5e-56 Score=382.14 Aligned_cols=167 Identities=51% Similarity=0.962 Sum_probs=152.6
Q ss_pred CCccccchhhHHHHHHHHHHHHH-H----------------HHhhcccCCCCC---c---cccccccCCeEEecCCCccc
Q 024104 1 MPKSRLNLLFLFFQLLTLQFLSI-I----------------SAIRKDIGFQET---P---FCKTTVQGRYLLSDDNGYVC 57 (272)
Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~-~----------------~~~~~d~~~~~~---~---~CrntvQg~~li~Dd~G~vC 57 (272)
|.|.+..++++..+|+..+.++- + +.||+|++|+++ + +||||||||+||+||+||||
T Consensus 1 lrkr~v~~l~~vLlLiygf~~~f~~e~~a~~~~~~~v~~~~~pi~w~~qfl~~~~sR~~~~CRnsvQGr~lisDe~G~VC 80 (196)
T KOG3136|consen 1 LRKRGVSLLFLVLLLIYGFLFPFRVEERAVRDRLLQVHDHNSPIRWDIQFLEENSSRPSNSCRNSVQGRYLISDEEGNVC 80 (196)
T ss_pred CcceehhHHHHHHHHHHHccChhhhhhhhhhhhhhhhhccCCCcchhhhhccccCCCcchhhhhhhcceeEeeccccchH
Confidence 67888888888777777776665 6 899999999988 3 79999999999999999999
Q ss_pred cCcccCCCCCCCCCC---CceEeccCCCCCCCCcccCccccccccCCCcch-HHHHhhhhccCCcccccccchhhhhhhh
Q 024104 58 DAVSMDPQSRCCPEK---GEKFSCQGCNLLSQCCNSYEYCVSCCVNPARTL-KEQVLNVKIAKPTTAGTYGSVFDFCAGR 133 (272)
Q Consensus 58 ~r~~l~~~~GCC~~~---~~~~sC~~C~~~~~CC~~YE~CVSCCl~P~k~~-~e~~l~~~~~~~~~~~~~~d~Fe~C~~k 133 (272)
.|++++ .|||||.. +++|+|+|||..++||++|||||||||+|.|++ ||++|+++++++.++++++||||+|++|
T Consensus 81 ~r~~~l-~ngCCn~~~p~~~~ysC~gCn~~s~CC~~YEyCVSCCL~P~kQ~LLErvl~a~va~~~~f~tv~dhFelCl~k 159 (196)
T KOG3136|consen 81 DRLLEL-RNGCCNWKGPSTKRYSCHGCNILSQCCNSYEYCVSCCLNPSKQTLLERVLKAKVAKPATFGTVKDHFELCLGK 159 (196)
T ss_pred HHHHHH-hcCCccccCCCcceeecCCCccccchhhHHHHHHHHHcCCchhHHHHHHHhcccccchhhhhHHHHHHHHHHH
Confidence 999998 79999985 679999999998999999999999999999864 5999999999999999999999999999
Q ss_pred ccccchhhhhhhhcccc-cCcccCCCCCCCCCCccc
Q 024104 134 CRHNSESVVHENAYLSD-FHHCFSMPSNASGAGVTQ 168 (272)
Q Consensus 134 CRT~s~sv~heN~Y~~~-~~~C~~~~~~~~g~~~~~ 168 (272)
|||+|+||+|||+|+|+ .|||||.+.|.||++.+|
T Consensus 160 CRtnS~SV~HEN~Yrdp~akhCyglt~~eS~~~l~p 195 (196)
T KOG3136|consen 160 CRTNSESVVHENAYRDPEAKHCYGLTSNESGANLTP 195 (196)
T ss_pred hcCCchhheecccccChHHHhhcccccccCccccCC
Confidence 99999999999999997 699999999999988544
|
|
| >PF10218 DUF2054: Uncharacterized conserved protein (DUF2054); InterPro: IPR019352 This family of proteins includes those belonging to UPF0454 | Back alignment and domain information |
|---|
| >PF15024 Glyco_transf_18: Glycosyltransferase family 18 | Back alignment and domain information |
|---|
Homologous Structure Templates
Structure Templates Detected by BLAST 
Original result of BLAST against Protein Data Bank
No homologous structure with e-value below 0.005
Structure Templates Detected by RPS-BLAST 
Original result of RPS-BLAST against PDB70 database
No hit with e-value below 0.005
Structure Templates Detected by HHsearch 
Original result of HHsearch against PDB70 database
No hit with probability above 80.00
Homologous Structure Domains
Structure Domains Detected by RPS-BLAST 
Original result of RPS-BLAST against SCOP70(version1.75) database
No hit with e-value below 0.005
Homologous Domains Detected by HHsearch 
Original result of HHsearch against SCOP70(version1.75) database
No hit with probability above 80.00