Citrus Sinensis ID: 023802
Local Sequence Feature Prediction
| Prediction and (Method) | Result |
|---|
Close Homologs for Annotation Transfer
Close Homologs in the Non-Redundant Database Detected by BLAST 
Original result of BLAST against Nonredundant Database
GI ![]() | Alignment Graph ![]() | Length ![]() |
Definition ![]() |
Q cover ![]() |
H cover ![]() |
Identity ![]() |
E-value ![]() |
| Query | 277 | ||||||
| 356568226 | 316 | PREDICTED: uncharacterized protein LOC10 | 0.909 | 0.797 | 0.518 | 1e-66 | |
| 297735538 | 344 | unnamed protein product [Vitis vinifera] | 0.971 | 0.781 | 0.486 | 8e-66 | |
| 225439699 | 339 | PREDICTED: uncharacterized protein LOC10 | 0.971 | 0.793 | 0.486 | 9e-66 | |
| 388520637 | 310 | unknown [Medicago truncatula] | 0.891 | 0.796 | 0.513 | 1e-65 | |
| 357506683 | 308 | hypothetical protein MTR_7g073340 [Medic | 0.891 | 0.801 | 0.513 | 1e-65 | |
| 147828305 | 385 | hypothetical protein VITISV_015389 [Viti | 0.935 | 0.672 | 0.493 | 2e-65 | |
| 224087865 | 331 | predicted protein [Populus trichocarpa] | 0.974 | 0.815 | 0.489 | 3e-64 | |
| 224139370 | 228 | predicted protein [Populus trichocarpa] | 0.783 | 0.951 | 0.557 | 3e-62 | |
| 18404784 | 335 | Transmembrane proteins 14C [Arabidopsis | 0.963 | 0.797 | 0.501 | 1e-61 | |
| 21554522 | 334 | putative non-green plastid inner envelop | 0.963 | 0.799 | 0.501 | 1e-61 |
| >gi|356568226|ref|XP_003552314.1| PREDICTED: uncharacterized protein LOC100808049 [Glycine max] | Back alignment and taxonomy information |
|---|
Score = 259 bits (662), Expect = 1e-66, Method: Compositional matrix adjust.
Identities = 139/268 (51%), Positives = 186/268 (69%), Gaps = 16/268 (5%)
Query: 16 NPRLSFGSEALYGSPCMRV-QSRAQRVSIAPRAKVAA---KGIGVGFQLPERRRSSFYRS 71
NP+LS +L P R S + PRA AA G+ F RR+
Sbjct: 13 NPKLSHSHSSL---PFSRFPHSLNFHPLLKPRAVNAAVPPNGLAACFLTLHRRKPL---- 65
Query: 72 TVVFSASHEESSS-----EESSEIKASAKESEEAWKQTLAAFREQAIKMQSVSQEAYEIY 126
TV F+ASH++S E+ ++ A ++ES+EAWKQ L FREQA K Q VSQEAYE+Y
Sbjct: 66 TVAFTASHQDSEHREIEVEKERDVHAGSEESQEAWKQVLETFREQAEKFQGVSQEAYEVY 125
Query: 127 SKRATAILKETAEQLKIEAEKARKDLTVVAKELSEDGKKYLTEATENNPEVKEIVETFTL 186
SK+A IL + EQLK+ A+K + +L+V AKE++++GK+YL+ A +++PEVKEIVETFT
Sbjct: 126 SKKAAVILTDATEQLKVLADKTKNELSVAAKEITDEGKEYLSAAADSSPEVKEIVETFTS 185
Query: 187 PTEDVKEFSTLRDFYLGIPYGLLLTLGGFISFMVTGSIPAIRFGVILGGTLLALSIASLR 246
P ED+++ S +RDFY+G+PYGL+L+LGGF+SFMVTG+ AIRFGVILGG LLALSI SL+
Sbjct: 186 PPEDIQKLSGVRDFYVGVPYGLMLSLGGFLSFMVTGNTAAIRFGVILGGVLLALSILSLK 245
Query: 247 SHKKGKTSPVALKGQAGQRVVYFTIEIT 274
S+KKG+TSP+ALKGQA + F EI+
Sbjct: 246 SYKKGRTSPLALKGQAAIASILFLREIS 273
|
Source: Glycine max Species: Glycine max Genus: Glycine Family: Fabaceae Order: Fabales Class: Phylum: Streptophyta Superkingdom: Eukaryota |
| >gi|297735538|emb|CBI18032.3| unnamed protein product [Vitis vinifera] | Back alignment and taxonomy information |
|---|
| >gi|225439699|ref|XP_002271868.1| PREDICTED: uncharacterized protein LOC100243025 [Vitis vinifera] | Back alignment and taxonomy information |
|---|
| >gi|388520637|gb|AFK48380.1| unknown [Medicago truncatula] | Back alignment and taxonomy information |
|---|
| >gi|357506683|ref|XP_003623630.1| hypothetical protein MTR_7g073340 [Medicago truncatula] gi|355498645|gb|AES79848.1| hypothetical protein MTR_7g073340 [Medicago truncatula] | Back alignment and taxonomy information |
|---|
| >gi|147828305|emb|CAN66482.1| hypothetical protein VITISV_015389 [Vitis vinifera] | Back alignment and taxonomy information |
|---|
| >gi|224087865|ref|XP_002308255.1| predicted protein [Populus trichocarpa] gi|222854231|gb|EEE91778.1| predicted protein [Populus trichocarpa] | Back alignment and taxonomy information |
|---|
| >gi|224139370|ref|XP_002323079.1| predicted protein [Populus trichocarpa] gi|222867709|gb|EEF04840.1| predicted protein [Populus trichocarpa] | Back alignment and taxonomy information |
|---|
| >gi|18404784|ref|NP_565892.1| Transmembrane proteins 14C [Arabidopsis thaliana] gi|14334694|gb|AAK59525.1| putative non-green plastid inner envelope membrane protein [Arabidopsis thaliana] gi|20197395|gb|AAC67363.2| putative non-green plastid inner envelope membrane protein [Arabidopsis thaliana] gi|22136954|gb|AAM91706.1| putative non-green plastid inner envelope membrane protein [Arabidopsis thaliana] gi|330254455|gb|AEC09549.1| Transmembrane proteins 14C [Arabidopsis thaliana] | Back alignment and taxonomy information |
|---|
| >gi|21554522|gb|AAM63598.1| putative non-green plastid inner envelope membrane protein [Arabidopsis thaliana] | Back alignment and taxonomy information |
|---|
Prediction of Gene Ontology (GO) Terms
Close Homologs with Gene Ontology terms Detected by BLAST 
Original result of BLAST against Gene Ontology (AMIGO)
ID ![]() |
Alignment graph ![]() |
Length ![]() |
Definition ![]() |
Q cover ![]() |
H cover ![]() |
Identity ![]() |
E-value ![]() |
| Query | 277 | ||||||
| TAIR|locus:2064206 | 335 | AT2G38550 "AT2G38550" [Arabido | 0.963 | 0.797 | 0.462 | 1.3e-52 | |
| TAIR|locus:2082578 | 226 | AT3G57280 "AT3G57280" [Arabido | 0.516 | 0.632 | 0.324 | 7.8e-10 | |
| TAIR|locus:2033177 | 119 | AT1G50740 "AT1G50740" [Arabido | 0.220 | 0.512 | 0.393 | 3.9e-06 | |
| TAIR|locus:2085780 | 119 | AT3G20510 [Arabidopsis thalian | 0.223 | 0.521 | 0.403 | 3.9e-06 |
| TAIR|locus:2064206 AT2G38550 "AT2G38550" [Arabidopsis thaliana (taxid:3702)] | Back alignment and assigned GO terms |
|---|
Score = 545 (196.9 bits), Expect = 1.3e-52, P = 1.3e-52
Identities = 131/283 (46%), Positives = 171/283 (60%)
Query: 1 MSVSIELLSTKNPNPNPRLSFGSEA---LYGSPCMRVQSRAQRVSIAPRAKVAAKGIGVG 57
MS+ +EL+S +NPN S L P + SR R APRA V+ GI G
Sbjct: 2 MSIPMELMSIRNPNSTLLYRAHSRPPVKLCAPPRSLLPSR--RHFSAPRAVVSYPGIRFG 59
Query: 58 FQLPERRRSSFYRSTVVFXXXXXXXXXXXXXXXXXXXX------XXXXXWKQTLAAFREQ 111
F PE + RS V F WKQTL +F+EQ
Sbjct: 60 FTSPEVLLN---RSVVAFAASHEDSGESGVEVGKEKSDIDVEDDTSKEAWKQTLESFKEQ 116
Query: 112 AIKMQSVSQEAYEIYSKRATAILKETAEQLKIEAEKARKDLTVVAKELSEDGKKYLTEAT 171
KMQSVS EAY + S++A +LKET+EQL+I+AEKA+++L AK +SE+G++Y+ +A
Sbjct: 117 VSKMQSVSSEAYSVNSQKAMTVLKETSEQLRIQAEKAKEELGTKAKVVSEEGREYILKAA 176
Query: 172 ENNP-EVKEIVETFTLPTEDVKEFSTLRDFYLGIPYGLLLTLGGFISFMVTGSIPAIRFG 230
E +P +VKEIVE F TED+K S DF++GIPYGLLL +GGFI+FMV+GSIPAIRFG
Sbjct: 177 EESPSDVKEIVEAFA-STEDLKNVSRANDFHVGIPYGLLLLVGGFINFMVSGSIPAIRFG 235
Query: 231 VILGGTLLALSIASLRSHKKGKTSPVALKGQAGQRVVYFTIEI 273
VILGG L ALS+ASL+SH+KG++S LKGQ + F E+
Sbjct: 236 VILGGALFALSLASLKSHRKGESSTKFLKGQMAIVAIIFLREL 278
|
|
| TAIR|locus:2082578 AT3G57280 "AT3G57280" [Arabidopsis thaliana (taxid:3702)] | Back alignment and assigned GO terms |
|---|
| TAIR|locus:2033177 AT1G50740 "AT1G50740" [Arabidopsis thaliana (taxid:3702)] | Back alignment and assigned GO terms |
|---|
| TAIR|locus:2085780 AT3G20510 [Arabidopsis thaliana (taxid:3702)] | Back alignment and assigned GO terms |
|---|
Prediction of Enzyme Commission (EC) Number
EC Number Prediction by Ezypred Server 
Original result from Ezypred Server
Fail to connect to Ezypred Server
Prediction of Functionally Associated Proteins
Functionally Associated Proteins Detected by STRING 
Original result from the STRING server
Fail to connect to STRING server
Conserved Domains and Related Protein Families
Conserved Domains Detected by RPS-BLAST 
Original result of RPS-BLAST against CDD database part I
ID ![]() | Alignment Graph ![]() | Length ![]() |
Definition ![]() |
E-value ![]() |
| Query | 277 | |||
| pfam03647 | 96 | pfam03647, Tmemb_14, Transmembrane proteins 14C | 4e-10 |
| >gnl|CDD|202717 pfam03647, Tmemb_14, Transmembrane proteins 14C | Back alignment and domain information |
|---|
Score = 55.3 bits (134), Expect = 4e-10
Identities = 17/63 (26%), Positives = 30/63 (47%), Gaps = 2/63 (3%)
Query: 200 FYLGIPYGLLLTLGGFISFMVTGSIPAIRFGVILGGTLLALSIASLRSHKKGKTSPVALK 259
+ GI Y L+ +GG + + GS+P++ G+I G L + ++ G +AL
Sbjct: 1 DHPGIAYAALVAIGGIMGYAKAGSVPSLIAGLIFGALLGYGAYLLSQNQPYG--LELALL 58
Query: 260 GQA 262
A
Sbjct: 59 ASA 61
|
This family of short membrane proteins are as yet uncharacterized. Length = 96 |
Conserved Domains Detected by HHsearch 
Original result of HHsearch against CDD database
ID ![]() | Alignment Graph ![]() | Length ![]() |
Definition ![]() |
Probability ![]() |
| Query | 277 | |||
| KOG4267 | 110 | consensus Predicted membrane protein [Function unk | 99.87 | |
| PF03647 | 96 | Tmemb_14: Transmembrane proteins 14C; InterPro: IP | 99.61 | |
| COG5548 | 105 | Small integral membrane protein [Function unknown] | 98.7 | |
| PF06570 | 206 | DUF1129: Protein of unknown function (DUF1129); In | 93.44 | |
| PRK05771 | 646 | V-type ATP synthase subunit I; Validated | 88.43 | |
| PRK06569 | 155 | F0F1 ATP synthase subunit B'; Validated | 86.29 | |
| PRK10884 | 206 | SH3 domain-containing protein; Provisional | 83.12 | |
| PF06738 | 193 | DUF1212: Protein of unknown function (DUF1212); In | 82.17 | |
| PF05478 | 806 | Prominin: Prominin; InterPro: IPR008795 The promin | 81.18 | |
| PF04156 | 191 | IncA: IncA protein; InterPro: IPR007285 Chlamydia | 80.46 |
| >KOG4267 consensus Predicted membrane protein [Function unknown] | Back alignment and domain information |
|---|
Probab=99.87 E-value=1.4e-22 Score=166.21 Aligned_cols=77 Identities=31% Similarity=0.455 Sum_probs=73.7
Q ss_pred cccceechhhHHHHHHhhHHHHHhcCChhHHHHHHHHHHHHHHHHHHHHHHhhCCC-CchhHHH-HHHHHHHHHHHHHhh
Q 023802 197 LRDFYLGIPYGLLLTLGGFISFMVTGSIPAIRFGVILGGTLLALSIASLRSHKKGK-TSPVALK-GQAGQRVVYFTIEIT 274 (277)
Q Consensus 197 mHDFclgipYg~LVa~GGiiGY~kaGSi~SL~aGl~fG~lLl~aGygSLk~~k~gk-~s~l~L~-~SavLAavm~~R~~~ 274 (277)
|||||+||||++||++||||||+|+||++||++|++||++ +||+||+.|+|++ ++++++. ++++|+++|++|.++
T Consensus 1 m~~~~f~~~y~~Lv~~GGliGY~kkgSi~SL~aGl~~G~l---~g~~s~~l~~~~~~~~~~~l~~~s~~L~gvmg~R~~~ 77 (110)
T KOG4267|consen 1 MHDDCFGIPYAALVTVGGLIGYLKKGSIPSLAAGLLFGAL---AGYGSYLLSRDKKGGSLVALGGTSAALLGVMGQRFYR 77 (110)
T ss_pred CCchhhhhhHHHHHHhcceeeeeecCCcchHHHHHHHHHH---HHHHHHHhhcCCCcCchhHHHHHHHHHHHHHhhhhhc
Confidence 7999999999999999999999999999999999999988 9999999999999 9999999 699999999999876
Q ss_pred hc
Q 023802 275 KK 276 (277)
Q Consensus 275 ~~ 276 (277)
.+
T Consensus 78 s~ 79 (110)
T KOG4267|consen 78 SR 79 (110)
T ss_pred cC
Confidence 53
|
|
| >PF03647 Tmemb_14: Transmembrane proteins 14C; InterPro: IPR005349 This family of short membrane proteins is as yet uncharacterised | Back alignment and domain information |
|---|
| >COG5548 Small integral membrane protein [Function unknown] | Back alignment and domain information |
|---|
| >PF06570 DUF1129: Protein of unknown function (DUF1129); InterPro: IPR009214 There are currently no experimental data for members of this group or their homologues | Back alignment and domain information |
|---|
| >PRK05771 V-type ATP synthase subunit I; Validated | Back alignment and domain information |
|---|
| >PRK06569 F0F1 ATP synthase subunit B'; Validated | Back alignment and domain information |
|---|
| >PRK10884 SH3 domain-containing protein; Provisional | Back alignment and domain information |
|---|
| >PF06738 DUF1212: Protein of unknown function (DUF1212); InterPro: IPR010619 This entry represents a predicted domain found within a number of hypothetical proteins of unknown function found in eukaryotes, bacteria and archaea | Back alignment and domain information |
|---|
| >PF05478 Prominin: Prominin; InterPro: IPR008795 The prominins are an emerging family of proteins that, among the multispan membrane proteins, display a novel topology | Back alignment and domain information |
|---|
| >PF04156 IncA: IncA protein; InterPro: IPR007285 Chlamydia trachomatis is an obligate intracellular bacterium that develops within a parasitophorous vacuole termed an inclusion | Back alignment and domain information |
|---|
Homologous Structure Templates
Structure Templates Detected by BLAST 
Original result of BLAST against Protein Data Bank
No homologous structure with e-value below 0.005
Structure Templates Detected by RPS-BLAST 
Original result of RPS-BLAST against PDB70 database
ID ![]() | Alignment Graph ![]() | Length ![]() |
Definition ![]() |
E-value ![]() |
| Query | 277 | |||
| 2loo_A | 108 | Transmembrane protein 14A; helical bundle; NMR {Ho | 8e-04 |
| >2loo_A Transmembrane protein 14A; helical bundle; NMR {Homo sapiens} PDB: 2lop_A Length = 108 | Back alignment and structure |
|---|
Score = 37.4 bits (86), Expect = 8e-04
Identities = 12/58 (20%), Positives = 23/58 (39%), Gaps = 3/58 (5%)
Query: 201 YLGIPYGLLLTLGGFISFMVTGSIPAIRFGVILGGTLLALSIASLRSHKKGKTSPVAL 258
+G Y L+T G + G +P++ G+ +G + R + V+L
Sbjct: 12 LIGFGYAALVTFGSIFGYKRRGGVPSLIAGLFVGCLAG---YGAYRVSNDKRDVKVSL 66
|
Structure Templates Detected by HHsearch 
Original result of HHsearch against PDB70 database
ID ![]() | Alignment Graph ![]() | Length ![]() |
Definition ![]() |
Probability ![]() |
| Query | 277 | |||
| 2loo_A | 108 | Transmembrane protein 14A; helical bundle; NMR {Ho | 99.43 | |
| 2los_A | 121 | Transmembrane protein 14C; paramagnetic relaxation | 99.41 |
| >2loo_A Transmembrane protein 14A; helical bundle; NMR {Homo sapiens} PDB: 2lop_A | Back alignment and structure |
|---|
Probab=99.43 E-value=2.6e-17 Score=132.74 Aligned_cols=76 Identities=16% Similarity=0.221 Sum_probs=70.3
Q ss_pred ccccceechhhHHHHHHhhHHHHHhcCChhHHHHHHHHHHHHHHHHHHHHHHhhCCCCchhHHHHHHHHHHHHHHHHhh
Q 023802 196 TLRDFYLGIPYGLLLTLGGFISFMVTGSIPAIRFGVILGGTLLALSIASLRSHKKGKTSPVALKGQAGQRVVYFTIEIT 274 (277)
Q Consensus 196 ~mHDFclgipYg~LVa~GGiiGY~kaGSi~SL~aGl~fG~lLl~aGygSLk~~k~gk~s~l~L~~SavLAavm~~R~~~ 274 (277)
++-.+|++||||+++++||++||+|+||++||++|+++|++ +++++|+.|++++...+.+..+++++.+|+.|.++
T Consensus 7 ~~~~~~~~~~yg~Ll~~GGi~GY~k~gS~~SL~aG~~~G~l---l~~~~~~~~~~~~~~~l~l~~S~~L~~~m~~R~~~ 82 (108)
T 2loo_A 7 KVGMDLIGFGYAALVTFGSIFGYKRRGGVPSLIAGLFVGCL---AGYGAYRVSNDKRDVKVSLFTAFFLATIMGVRFKR 82 (108)
Confidence 34568999999999999999999999999999999999998 69999999999988888899999999999999875
|
| >2los_A Transmembrane protein 14C; paramagnetic relaxation enhancement, helic; NMR {Homo sapiens} | Back alignment and structure |
|---|
Homologous Structure Domains
Structure Domains Detected by RPS-BLAST 
Original result of RPS-BLAST against SCOP70(version1.75) database
No hit with e-value below 0.005
Homologous Domains Detected by HHsearch 
Original result of HHsearch against SCOP70(version1.75) database
No hit with probability above 80.00