Citrus Sinensis ID: 022355
Local Sequence Feature Prediction
| Prediction and (Method) | Result |
|---|
Close Homologs for Annotation Transfer
Close Homologs in the Non-Redundant Database Detected by BLAST 
Original result of BLAST against Nonredundant Database
GI ![]() | Alignment Graph ![]() | Length ![]() |
Definition ![]() |
Q cover ![]() |
H cover ![]() |
Identity ![]() |
E-value ![]() |
| Query | 298 | ||||||
| 224074229 | 311 | predicted protein [Populus trichocarpa] | 1.0 | 0.958 | 0.761 | 1e-123 | |
| 225438936 | 309 | PREDICTED: uncharacterized protein ycf36 | 0.993 | 0.957 | 0.749 | 1e-119 | |
| 357453667 | 312 | hypothetical protein MTR_2g089840 [Medic | 1.0 | 0.955 | 0.737 | 1e-117 | |
| 356496955 | 311 | PREDICTED: uncharacterized protein ycf36 | 0.996 | 0.954 | 0.733 | 1e-113 | |
| 356541681 | 311 | PREDICTED: uncharacterized protein ycf36 | 0.996 | 0.954 | 0.730 | 1e-112 | |
| 15240715 | 327 | uncharacterized protein [Arabidopsis tha | 0.959 | 0.874 | 0.701 | 1e-107 | |
| 297794245 | 326 | hypothetical protein ARALYDRAFT_496863 [ | 0.942 | 0.861 | 0.712 | 1e-107 | |
| 449463963 | 312 | PREDICTED: uncharacterized protein ycf36 | 0.996 | 0.951 | 0.661 | 1e-103 | |
| 217073882 | 258 | unknown [Medicago truncatula] | 0.768 | 0.887 | 0.740 | 4e-93 | |
| 449527867 | 237 | PREDICTED: uncharacterized protein ycf36 | 0.748 | 0.940 | 0.738 | 7e-88 |
| >gi|224074229|ref|XP_002304310.1| predicted protein [Populus trichocarpa] gi|222841742|gb|EEE79289.1| predicted protein [Populus trichocarpa] | Back alignment and taxonomy information |
|---|
Score = 447 bits (1151), Expect = e-123, Method: Compositional matrix adjust.
Identities = 236/310 (76%), Positives = 262/310 (84%), Gaps = 12/310 (3%)
Query: 1 MIRLNAYCSTLPSAAQVKLGSSYGSFIIKNYKARKSSWGVSVRALKDETNGGTSSSAGRS 60
M++LN CS +PS Q LG+S+ S+II+ ++A+K VSV+ALKD+TN GTSS GRS
Sbjct: 1 MLKLNVCCSLIPSPRQATLGASHRSWIIRYHRAQKLLPVVSVKALKDDTNEGTSSFRGRS 60
Query: 61 WEPGLEIEVPSEQRPVNEYSSLKDGVLYSWGELGQGPFILRLGGLWLVAFMVLGVPTAAA 120
WEPGLEIEVP EQRPVNEYSSLK+G LYSWGELG GPF+LRLGGLWLV F VLGVP AAA
Sbjct: 61 WEPGLEIEVPFEQRPVNEYSSLKEGPLYSWGELGPGPFLLRLGGLWLVTFTVLGVPIAAA 120
Query: 121 SFDPSREPLRFVLAAGTGTLFLVSLIVLRIYLGWSYVGDRLLSAVIPYEESGWYDGQMWV 180
+F+PSREPLRFVLAAGTGTLFLVSLI+LRIYLGWSYVGDRLLSAVIPYEESGWYDGQMWV
Sbjct: 121 TFNPSREPLRFVLAAGTGTLFLVSLIILRIYLGWSYVGDRLLSAVIPYEESGWYDGQMWV 180
Query: 181 KPPE------------VKPVIKMLKQTLVGTGALLVTATLLFIFATPVEQFFQSTMTTKE 228
KP E VKPVIKMLKQTLVGTGALLVTA +LFIFATPVE FFQ+T TKE
Sbjct: 181 KPTEVLARDRLLGSYKVKPVIKMLKQTLVGTGALLVTAVMLFIFATPVEDFFQTTFATKE 240
Query: 229 NPAIVPASKTKKNFNIRKEELLQLPAEVMSDDDLAAAAAEAADGRPVYCRDRYYRALAGG 288
NP+I PAS +N+RKEELL+LP EV++DDDLAAAAAEAA GRPVYCRDRYYRALAGG
Sbjct: 241 NPSIDPASGKNTKYNVRKEELLRLPVEVIADDDLAAAAAEAAGGRPVYCRDRYYRALAGG 300
Query: 289 QYCKWEDLVK 298
QYCKWEDL+
Sbjct: 301 QYCKWEDLLN 310
|
Source: Populus trichocarpa Species: Populus trichocarpa Genus: Populus Family: Salicaceae Order: Malpighiales Class: Phylum: Streptophyta Superkingdom: Eukaryota |
| >gi|225438936|ref|XP_002284127.1| PREDICTED: uncharacterized protein ycf36 [Vitis vinifera] gi|147834799|emb|CAN75014.1| hypothetical protein VITISV_039949 [Vitis vinifera] gi|296087349|emb|CBI33723.3| unnamed protein product [Vitis vinifera] | Back alignment and taxonomy information |
|---|
| >gi|357453667|ref|XP_003597114.1| hypothetical protein MTR_2g089840 [Medicago truncatula] gi|357482685|ref|XP_003611629.1| hypothetical protein MTR_5g016100 [Medicago truncatula] gi|355486162|gb|AES67365.1| hypothetical protein MTR_2g089840 [Medicago truncatula] gi|355512964|gb|AES94587.1| hypothetical protein MTR_5g016100 [Medicago truncatula] gi|388506430|gb|AFK41281.1| unknown [Medicago truncatula] | Back alignment and taxonomy information |
|---|
| >gi|356496955|ref|XP_003517330.1| PREDICTED: uncharacterized protein ycf36-like [Glycine max] | Back alignment and taxonomy information |
|---|
| >gi|356541681|ref|XP_003539302.1| PREDICTED: uncharacterized protein ycf36-like [Glycine max] | Back alignment and taxonomy information |
|---|
| >gi|15240715|ref|NP_201538.1| uncharacterized protein [Arabidopsis thaliana] gi|13430430|gb|AAK25837.1|AF360127_1 unknown protein [Arabidopsis thaliana] gi|9758436|dbj|BAB09022.1| unnamed protein product [Arabidopsis thaliana] gi|15293189|gb|AAK93705.1| unknown protein [Arabidopsis thaliana] gi|332010950|gb|AED98333.1| uncharacterized protein [Arabidopsis thaliana] | Back alignment and taxonomy information |
|---|
| >gi|297794245|ref|XP_002865007.1| hypothetical protein ARALYDRAFT_496863 [Arabidopsis lyrata subsp. lyrata] gi|297310842|gb|EFH41266.1| hypothetical protein ARALYDRAFT_496863 [Arabidopsis lyrata subsp. lyrata] | Back alignment and taxonomy information |
|---|
| >gi|449463963|ref|XP_004149699.1| PREDICTED: uncharacterized protein ycf36-like [Cucumis sativus] | Back alignment and taxonomy information |
|---|
| >gi|217073882|gb|ACJ85301.1| unknown [Medicago truncatula] | Back alignment and taxonomy information |
|---|
| >gi|449527867|ref|XP_004170930.1| PREDICTED: uncharacterized protein ycf36-like, partial [Cucumis sativus] | Back alignment and taxonomy information |
|---|
Prediction of Gene Ontology (GO) Terms
Close Homologs with Gene Ontology terms Detected by BLAST 
Original result of BLAST against Gene Ontology (AMIGO)
ID ![]() |
Alignment graph ![]() |
Length ![]() |
Definition ![]() |
Q cover ![]() |
H cover ![]() |
Identity ![]() |
E-value ![]() |
| Query | 298 | ||||||
| TAIR|locus:2158197 | 327 | CGLD27 "AT5G67370" [Arabidopsi | 0.959 | 0.874 | 0.623 | 1.4e-94 | |
| TAIR|locus:2142999 | 265 | AT5G11840 "AT5G11840" [Arabido | 0.406 | 0.456 | 0.492 | 3.2e-26 |
| TAIR|locus:2158197 CGLD27 "AT5G67370" [Arabidopsis thaliana (taxid:3702)] | Back alignment and assigned GO terms |
|---|
Score = 941 (336.3 bits), Expect = 1.4e-94, P = 1.4e-94
Identities = 192/308 (62%), Positives = 220/308 (71%)
Query: 13 SAAQVKLGSSYGSFIIKNY--------KARKSSWGVSVRALKDETN--GGTSSSAGRSWE 62
S + KLGS Y S I Y K ++ VSV+A++D+ N GG+ S +G+SW+
Sbjct: 20 SNSSSKLGSYYDSSSIIKYGGISDVVGKKQELFLSVSVKAVEDKGNNGGGSMSFSGQSWD 79
Query: 63 PGLEIEVPSEQRPVNEYSSLKDGVLYSWGELGQGPFILRLGGLWLVAFMVLGVPTAAASF 122
P EIEVPS+QRPVNEYSSLK+G+LYSWGELG F +RLGGLWLV F VLGVP AAASF
Sbjct: 80 PSSEIEVPSDQRPVNEYSSLKEGMLYSWGELGPSEFFIRLGGLWLVTFTVLGVPVAAASF 139
Query: 123 DPSREPLRFVLAAGTGTLFLVSLIVLRIYLGWSYVGDRLLSAVIPYEESGWYDGQMWVKP 182
+PSREPLRF+LAAGTGTLFLVSLIVLRIYLGWSYVGDRLLSAVIPYEESGWYDGQMWVKP
Sbjct: 140 NPSREPLRFILAAGTGTLFLVSLIVLRIYLGWSYVGDRLLSAVIPYEESGWYDGQMWVKP 199
Query: 183 PEV------------KPVIKMLKQXXXXXXXXXXXXXXXFIFATPVEQFFQSTMTTKENP 230
PEV KPVIKMLKQ F+FATPVE FF++T+ + EN
Sbjct: 200 PEVLARDRLLGSYKVKPVIKMLKQTLIGTGALLVSAFVLFVFATPVEDFFKTTLGSTENQ 259
Query: 231 AIVPASKTKKNFNIRKEELLQLPAEVMSXXXXXXXXXXXXXGRPVYCRDRYYRALAGGQY 290
V S+T FNIRKE+LL+LP +V++ GRPVYCRDRYYRALAGGQY
Sbjct: 260 PEVSISRTSNKFNIRKEQLLRLPVDVVTDDDLAAAAAEAADGRPVYCRDRYYRALAGGQY 319
Query: 291 CKWEDLVK 298
CKWEDLVK
Sbjct: 320 CKWEDLVK 327
|
|
| TAIR|locus:2142999 AT5G11840 "AT5G11840" [Arabidopsis thaliana (taxid:3702)] | Back alignment and assigned GO terms |
|---|
Prediction of Enzyme Commission (EC) Number
EC Number Prediction by Ezypred Server 
Original result from Ezypred Server
Fail to connect to Ezypred Server
Prediction of Functionally Associated Proteins
Functionally Associated Proteins Detected by STRING 
Original result from the STRING server
| grail3.0089000701 | hypothetical protein (311 aa) | ||||||||||
(Populus trichocarpa) | |||||||||||
| gw1.70.142.1 | • | • | 0.560 | ||||||||
| gw1.XIII.355.1 | • | • | 0.545 | ||||||||
| eugene3.00150423 | • | • | 0.543 | ||||||||
| gw1.V.3086.1 | • | • | 0.523 | ||||||||
| eugene3.00090150 | • | • | 0.519 | ||||||||
| gw1.IV.4064.1 | • | • | 0.516 | ||||||||
| gw1.123.192.1 | • | • | 0.515 | ||||||||
| estExt_Genewise1_v1.C_LG_II4195 | • | • | 0.515 | ||||||||
| gw1.X.329.1 | • | • | 0.514 | ||||||||
| gw1.XVIII.1253.1 | • | • | 0.513 |
Conserved Domains and Related Protein Families
Conserved Domains Detected by RPS-BLAST 
Original result of RPS-BLAST against CDD database part I
ID ![]() | Alignment Graph ![]() | Length ![]() |
Definition ![]() |
E-value ![]() |
| Query | 298 | |||
| pfam06799 | 145 | pfam06799, DUF1230, Protein of unknown function (D | 3e-61 |
| >gnl|CDD|191610 pfam06799, DUF1230, Protein of unknown function (DUF1230) | Back alignment and domain information |
|---|
Score = 190 bits (485), Expect = 3e-61
Identities = 71/144 (49%), Positives = 90/144 (62%), Gaps = 13/144 (9%)
Query: 69 VPSEQRPVNEYSSLKDGVLYSWGELGQGPFILRLGGLWLVAFMVLGVPTAAASFDPSREP 128
VP EQRP+NEY LKD +SW L + + +L +WL++F V G P A+ SF + P
Sbjct: 3 VPPEQRPLNEYEELKDSWFFSWPTLEKKGYYRKLLKIWLISFPVFG-PIASGSFPLRKSP 61
Query: 129 LRFVLAAGTGTLFLVSLIVLRIYLGWSYVGDRLLSAVIPYEESGWYDGQMWVKPPE---- 184
LR +L+A G L + LI+LR+YLGWSYV RLLS + YEESGWYDGQ+WVKPPE
Sbjct: 62 LRLILSAALGALLIPLLILLRLYLGWSYVRKRLLSETVEYEESGWYDGQVWVKPPEWLAR 121
Query: 185 --------VKPVIKMLKQTLVGTG 200
VKP++ LKQTL
Sbjct: 122 DRLIASYQVKPILNRLKQTLAILA 145
|
This family consists of several hypothetical plant and photosynthetic bacterial proteins of around 160 residues in length. The function of this family is unknown although looking at the species distribution the protein may play a part in photosynthesis. Length = 145 |
Conserved Domains Detected by HHsearch 
Original result of HHsearch against CDD database
ID ![]() | Alignment Graph ![]() | Length ![]() |
Definition ![]() |
Probability ![]() |
| Query | 298 | |||
| PF06799 | 144 | DUF1230: Protein of unknown function (DUF1230); In | 100.0 |
| >PF06799 DUF1230: Protein of unknown function (DUF1230); InterPro: IPR009631 This family represents Ycf36, which is found in plants, encoded in the genomes of algal chloroplasts and in cyanobacteria | Back alignment and domain information |
|---|
Probab=100.00 E-value=1.2e-67 Score=450.90 Aligned_cols=132 Identities=55% Similarity=1.046 Sum_probs=129.3
Q ss_pred CCCCCCCCchhHHHhhhcCCCccccCCCchhHHHHHHHHHHHHHHHhhhhcccceeCCCCchHHHHHHHHHHHHHHHHHH
Q 022355 67 IEVPSEQRPVNEYSSLKDGVLYSWGELGQGPFILRLGGLWLVAFMVLGVPTAAASFDPSREPLRFVLAAGTGTLFLVSLI 146 (298)
Q Consensus 67 CPVP~EQrPvNEY~eLk~SwfFsW~tl~~~~y~~rL~~~w~~~~~iv~~PIAa~Sf~p~~~pl~fiL~a~~ga~~l~~Lv 146 (298)
||||.||||+|||+|||+|||||||+++..+|.+||+++|++++++ ++|||++||+|+++|+||+++|++||+++++|+
T Consensus 1 CPVP~eQqP~nEy~~L~~S~~FsW~~~~~~~y~~~l~~~w~~~~~~-~~pia~~S~~~~~~~~~~~l~~~~ga~~~~~l~ 79 (144)
T PF06799_consen 1 CPVPPEQQPLNEYQELKESWFFSWPTLELKSYLKRLLWIWLISFLV-FGPIAAGSFPPEKDPLEFILSGAVGALLLLLLV 79 (144)
T ss_pred CccCccccCHHHHHHHhhCcCccCccCChHHHHHHHHHHHHHHHHH-HHhhheeecCccccHHHHHHHHHHHHHHHHHHH
Confidence 9999999999999999999999999999999999999999999995 779999999999999999999999999999999
Q ss_pred HHHHHhChHHHHhhhccCcccccccCccCCccccCCCC------------chHHHHHHHHHHHHH
Q 022355 147 VLRIYLGWSYVGDRLLSAVIPYEESGWYDGQMWVKPPE------------VKPVIKMLKQTLVGT 199 (298)
Q Consensus 147 lLRLYLGWsYV~dRLlSatVeYEESGWYDGQvWvKP~E------------VkPIL~RLk~Tl~~l 199 (298)
++||||||+||+|||+|+|||||||||||||+|+||+| |||||+|||+|+.++
T Consensus 80 llRlyLGW~YV~~RL~s~tV~YEESGWYDGQ~W~Kp~e~l~rDrLi~~yqVkPiL~RL~~tl~~l 144 (144)
T PF06799_consen 80 LLRLYLGWSYVGDRLLSATVEYEESGWYDGQVWVKPPEVLARDRLIGSYQVKPILSRLKQTLSIL 144 (144)
T ss_pred HHHHHhChHHHHhhhccCcccccccCccCCccccCCHHHHHHHHHHhhhhhhHHHHHHHHHHhcC
Confidence 99999999999999999999999999999999999999 999999999999763
|
As the family is exclusively found in phototrophic organisms it may play a role in photosynthesis. |
Homologous Structure Templates
Structure Templates Detected by BLAST 
Original result of BLAST against Protein Data Bank
No homologous structure with e-value below 0.005
Structure Templates Detected by RPS-BLAST 
Original result of RPS-BLAST against PDB70 database
No hit with e-value below 0.005
Structure Templates Detected by HHsearch 
Original result of HHsearch against PDB70 database
No hit with probability above 80.00
Homologous Structure Domains
Structure Domains Detected by RPS-BLAST 
Original result of RPS-BLAST against SCOP70(version1.75) database
No hit with e-value below 0.005
Homologous Domains Detected by HHsearch 
Original result of HHsearch against SCOP70(version1.75) database
No hit with probability above 80.00