Citrus Sinensis ID: 032597
Local Sequence Feature Prediction
| Prediction and (Method) | Result |
|---|
Close Homologs for Annotation Transfer
Prediction of Gene Ontology (GO) Terms
Close Homologs with Gene Ontology terms Detected by BLAST 
Original result of BLAST against Gene Ontology (AMIGO)
ID ![]() |
Alignment graph ![]() |
Length ![]() |
Definition ![]() |
Q cover ![]() |
H cover ![]() |
Identity ![]() |
E-value ![]() |
| Query | 137 | ||||||
| TAIR|locus:2158621 | 154 | FRO1 "FROSTBITE1" [Arabidopsis | 0.912 | 0.811 | 0.713 | 1.4e-46 | |
| ZFIN|ZDB-GENE-050522-421 | 168 | ndufs4 "NADH dehydrogenase (ub | 0.890 | 0.726 | 0.380 | 1.7e-20 | |
| UNIPROTKB|Q8QGH0 | 116 | NDUFS4 "NADH dehydrogenase" [G | 0.678 | 0.801 | 0.453 | 9.2e-20 | |
| RGD|1594380 | 175 | Ndufs4 "NADH dehydrogenase (ub | 0.700 | 0.548 | 0.450 | 9.2e-20 | |
| UNIPROTKB|J9PB72 | 175 | NDUFS4 "Uncharacterized protei | 0.700 | 0.548 | 0.441 | 3.1e-19 | |
| UNIPROTKB|O43181 | 175 | NDUFS4 "NADH dehydrogenase [ub | 0.700 | 0.548 | 0.441 | 3.1e-19 | |
| UNIPROTKB|Q0MQH0 | 175 | NDUFS4 "NADH dehydrogenase [ub | 0.700 | 0.548 | 0.441 | 3.1e-19 | |
| UNIPROTKB|P0CB95 | 175 | NDUFS4 "NADH dehydrogenase [ub | 0.700 | 0.548 | 0.450 | 4e-19 | |
| UNIPROTKB|P0CB96 | 175 | NDUFS4 "NADH dehydrogenase [ub | 0.700 | 0.548 | 0.450 | 4e-19 | |
| UNIPROTKB|Q02375 | 175 | NDUFS4 "NADH dehydrogenase [ub | 0.700 | 0.548 | 0.441 | 5.1e-19 |
| TAIR|locus:2158621 FRO1 "FROSTBITE1" [Arabidopsis thaliana (taxid:3702)] | Back alignment and assigned GO terms |
|---|
Score = 488 (176.8 bits), Expect = 1.4e-46, P = 1.4e-46
Identities = 92/129 (71%), Positives = 109/129 (84%)
Query: 13 RTVR--GTVCPFSRAFSADALVE--VKPGEIGMVSGIPEEHLRRRVVIYTPARTATQQGS 68
RT+R T+ +R F+ DA+VE K GEIG VSGIPEEHL R+V+IY+PARTATQ GS
Sbjct: 9 RTIRIAATLRRVARPFATDAVVESDYKRGEIGKVSGIPEEHLSRKVIIYSPARTATQSGS 68
Query: 69 GKLGRWKINFMSTQKWENPLMGWTSTGDPYANVGDAGLSFDSKEAAREFAERHGWEYVVR 128
GKLG+WKINF+ST KWENPLMGWTSTGDPYANVGD+ L+FDS+EAA+ FAERHGW+Y V+
Sbjct: 69 GKLGKWKINFVSTLKWENPLMGWTSTGDPYANVGDSALAFDSEEAAKSFAERHGWDYKVK 128
Query: 129 KPHRPLLKV 137
KP+ PLLKV
Sbjct: 129 KPNTPLLKV 137
|
|
| ZFIN|ZDB-GENE-050522-421 ndufs4 "NADH dehydrogenase (ubiquinone) Fe-S protein 4, (NADH-coenzyme Q reductase)" [Danio rerio (taxid:7955)] | Back alignment and assigned GO terms |
|---|
| UNIPROTKB|Q8QGH0 NDUFS4 "NADH dehydrogenase" [Gallus gallus (taxid:9031)] | Back alignment and assigned GO terms |
|---|
| RGD|1594380 Ndufs4 "NADH dehydrogenase (ubiquinone) Fe-S protein 4" [Rattus norvegicus (taxid:10116)] | Back alignment and assigned GO terms |
|---|
| UNIPROTKB|J9PB72 NDUFS4 "Uncharacterized protein" [Canis lupus familiaris (taxid:9615)] | Back alignment and assigned GO terms |
|---|
| UNIPROTKB|O43181 NDUFS4 "NADH dehydrogenase [ubiquinone] iron-sulfur protein 4, mitochondrial" [Homo sapiens (taxid:9606)] | Back alignment and assigned GO terms |
|---|
| UNIPROTKB|Q0MQH0 NDUFS4 "NADH dehydrogenase [ubiquinone] iron-sulfur protein 4, mitochondrial" [Gorilla gorilla gorilla (taxid:9595)] | Back alignment and assigned GO terms |
|---|
| UNIPROTKB|P0CB95 NDUFS4 "NADH dehydrogenase [ubiquinone] iron-sulfur protein 4, mitochondrial" [Pongo abelii (taxid:9601)] | Back alignment and assigned GO terms |
|---|
| UNIPROTKB|P0CB96 NDUFS4 "NADH dehydrogenase [ubiquinone] iron-sulfur protein 4, mitochondrial" [Pongo pygmaeus (taxid:9600)] | Back alignment and assigned GO terms |
|---|
| UNIPROTKB|Q02375 NDUFS4 "NADH dehydrogenase [ubiquinone] iron-sulfur protein 4, mitochondrial" [Bos taurus (taxid:9913)] | Back alignment and assigned GO terms |
|---|
Prediction of Enzyme Commission (EC) Number
EC Number Prediction by Ezypred Server 
Original result from Ezypred Server
Fail to connect to Ezypred Server
Prediction of Functionally Associated Proteins
Functionally Associated Proteins Detected by STRING 
Original result from the STRING server
| GSVIVG00010578001 | SubName- Full=Putative uncharacterized protein (Chromosome chr7 scaffold_270, whole genome shotgun sequence); (154 aa) | ||||||||||
(Vitis vinifera) | |||||||||||
| GSVIVG00028496001 | • | • | • | • | 0.676 | ||||||
| GSVIVG00022315001 | • | • | • | 0.676 | |||||||
| GSVIVG00023838001 | • | • | • | 0.667 | |||||||
| GSVIVG00036386001 | • | • | • | • | 0.644 | ||||||
| nad9 | • | • | • | 0.640 | |||||||
| GSVIVG00014042001 | • | • | • | 0.577 | |||||||
| GSVIVG00032166001 | • | • | • | 0.548 | |||||||
| GSVIVG00025731001 | • | • | • | 0.490 | |||||||
| GSVIVG00030686001 | • | • | 0.460 | ||||||||
| GSVIVG00037763001 | • | • | • | 0.437 |
Conserved Domains and Related Protein Families
Conserved Domains Detected by RPS-BLAST 
Original result of RPS-BLAST against CDD database part I
ID ![]() | Alignment Graph ![]() | Length ![]() |
Definition ![]() |
E-value ![]() |
| Query | 137 | |||
| pfam04800 | 101 | pfam04800, ETC_C1_NDUFA4, ETC complex I subunit co | 7e-36 |
| >gnl|CDD|218273 pfam04800, ETC_C1_NDUFA4, ETC complex I subunit conserved region | Back alignment and domain information |
|---|
Score = 118 bits (298), Expect = 7e-36
Identities = 44/86 (51%), Positives = 55/86 (63%), Gaps = 2/86 (2%)
Query: 52 RRVVIYTPARTATQQGSGKLGRWKINFMSTQKWENPLMGWTSTGDPYANVGDAGLSFDSK 111
R IY PAR A Q G + +W + F + +WENPLMGWTSTGDP +N + L+F +K
Sbjct: 1 RTARIYRPARNAMQSGRARTKKWTLEFDRSARWENPLMGWTSTGDPLSNQME--LTFPTK 58
Query: 112 EAAREFAERHGWEYVVRKPHRPLLKV 137
EAA FAER GWEY V +P+ P K
Sbjct: 59 EAAIAFAERQGWEYDVEEPNAPKAKP 84
|
Family of pankaryotic NADH-ubiquinone oxidoreductase subunits (EC:1.6.5.3) (EC:1.6.99.3) from complex I of the electron transport chain initially identified in Neurospora crassa as a 21 kDa protein. Length = 101 |
Conserved Domains Detected by HHsearch 
Original result of HHsearch against CDD database
ID ![]() | Alignment Graph ![]() | Length ![]() |
Definition ![]() |
Probability ![]() |
| Query | 137 | |||
| KOG3389 | 178 | consensus NADH:ubiquinone oxidoreductase, NDUFS4/1 | 100.0 | |
| PF04800 | 101 | ETC_C1_NDUFA4: ETC complex I subunit conserved reg | 100.0 | |
| PF09954 | 62 | DUF2188: Uncharacterized protein conserved in bact | 86.41 | |
| PF08727 | 57 | P3A: Poliovirus 3A protein like; InterPro: IPR0148 | 80.6 | |
| PHA02552 | 151 | 4 head completion protein; Provisional | 80.32 |
| >KOG3389 consensus NADH:ubiquinone oxidoreductase, NDUFS4/18 kDa subunit [Energy production and conversion] | Back alignment and domain information |
|---|
Probab=100.00 E-value=1.4e-48 Score=304.08 Aligned_cols=115 Identities=60% Similarity=1.063 Sum_probs=108.5
Q ss_pred ccccccccc-ccc--ccccccccccCCCcccc-cceEEEecCCCCCCCCCCCCCCCcEEEccCCCCccCCCcCccCCCCc
Q 032597 22 FSRAFSADA-LVE--VKPGEIGMVSGIPEEHL-RRRVVIYTPARTATQQGSGKLGRWKINFMSTQKWENPLMGWTSTGDP 97 (137)
Q Consensus 22 ~~r~fs~d~-~~~--~~~~e~~~vSG~P~e~~-~R~vrIY~Pak~amQSG~~~~~~W~LeFe~~~rw~nPLMGWtsS~D~ 97 (137)
+.|.|+.|+ .|+ .+-+|||-|+|+|+||+ .|+||||.|+|++||||.+|+++|+|||+...+||||||||+|++||
T Consensus 42 la~~~~~Dak~ve~d~kld~i~~v~GvPeeH~~sRkvrIf~PAR~~tQSg~gntkkWkiefd~r~rWENPLMGWtsTaDP 121 (178)
T KOG3389|consen 42 LARPFATDAKVVESDYKLDEIGKVSGVPEEHLDSRKVRIFSPARTATQSGSGNTKKWKIEFDSRLRWENPLMGWTSTADP 121 (178)
T ss_pred ccccccccceeEeehhhhcccccccCCChHHhcceeEEEecchhhhhhcccCCccceEEEecchhhccCccccccccCCc
Confidence 578999998 444 56678999999999999 69999999999999999999999999999999999999999999999
Q ss_pred cCccCCceeeeCCHHHHHHHHHHcCCcEEEeCCCCCCCCC
Q 032597 98 YANVGDAGLSFDSKEAAREFAERHGWEYVVRKPHRPLLKV 137 (137)
Q Consensus 98 ~sqv~~~~L~F~SkE~AIayaek~Gw~Y~V~~P~~~~~~~ 137 (137)
++|| |+.|.|+|+|||++|||||||+|.|++|+++++||
T Consensus 122 lsNv-gm~L~F~tkEdA~sFaEkngW~ydveep~~pk~K~ 160 (178)
T KOG3389|consen 122 LSNV-GMALAFDTKEDAKSFAEKNGWDYDVEEPNTPKLKV 160 (178)
T ss_pred cccc-ceeeeeccHHHHHHHHHHcCCcccccCCCCCcccc
Confidence 9999 58999999999999999999999999999999875
|
|
| >PF04800 ETC_C1_NDUFA4: ETC complex I subunit conserved region; InterPro: IPR006885 This entry represents prokaryotic NADH-ubiquinone oxidoreductase subunits (1 | Back alignment and domain information |
|---|
| >PF09954 DUF2188: Uncharacterized protein conserved in bacteria (DUF2188); InterPro: IPR018691 This family has no known function | Back alignment and domain information |
|---|
| >PF08727 P3A: Poliovirus 3A protein like; InterPro: IPR014838 The 3A protein is found in positive-strand RNA viruses | Back alignment and domain information |
|---|
| >PHA02552 4 head completion protein; Provisional | Back alignment and domain information |
|---|
Homologous Structure Templates
Structure Templates Detected by BLAST 
Original result of BLAST against Protein Data Bank
ID ![]() | Alignment Graph ![]() | Length ![]() |
Definition ![]() |
E-value ![]() | |
| Query | 137 | ||||
| 2jya_A | 106 | Nmr Solution Structure Of Protein Atu1810 From Agro | 5e-08 | ||
| 2lju_A | 108 | Solution Structure Of Putative Oxidoreductase From | 1e-07 |
| >pdb|2JYA|A Chain A, Nmr Solution Structure Of Protein Atu1810 From Agrobacterium Tumefaciens. Northeast Structural Genomics Consortium Target Atr23, Ontario Centre For Structural Proteomics Target Atc1776 Length = 106 | Back alignment and structure |
|
| >pdb|2LJU|A Chain A, Solution Structure Of Putative Oxidoreductase From Ehrlichia Chaffeensis, Seattle Structural Genomics Center For Infectious Disease (Ssgcid) Length = 108 | Back alignment and structure |
Structure Templates Detected by RPS-BLAST 
Original result of RPS-BLAST against PDB70 database
ID ![]() | Alignment Graph ![]() | Length ![]() |
Definition ![]() |
E-value ![]() |
| Query | 137 | |||
| 2lju_A | 108 | Putative oxidoreductase; structural genomics, seat | 4e-32 | |
| 2jya_A | 106 | AGR_C_3324P, uncharacterized protein ATU1810; prot | 8e-30 |
| >2lju_A Putative oxidoreductase; structural genomics, seattle structural GENO center for infectious disease, ssgcid; NMR {Ehrlichia chaffeensis} Length = 108 | Back alignment and structure |
|---|
Score = 109 bits (273), Expect = 4e-32
Identities = 31/97 (31%), Positives = 46/97 (47%), Gaps = 5/97 (5%)
Query: 43 SGIPEEHLR-RRVVIYTPARTATQQGSGKLGRWKINFM-STQKWENPLMGWTSTGDPYAN 100
G +E + R IY PA++ Q G KL WK+ F S ++ PLM WT + D
Sbjct: 2 PGSMQEQVSNVRARIYKPAKSTMQSGHSKLKAWKLEFEPSCTQYTEPLMNWTGSHDTKQQ 61
Query: 101 VGDAGLSFDSKEAAREFAERHGWEYVVRKPHRPLLKV 137
V LSF ++E A +A H +Y V + + +
Sbjct: 62 VC---LSFTTRELAIAYAVAHKIDYTVLQDNPRTIVP 95
|
| >2jya_A AGR_C_3324P, uncharacterized protein ATU1810; protein with unknown function ATU1810, ontario centre for ST proteomics, OCSP; NMR {Agrobacterium tumefaciens str} Length = 106 | Back alignment and structure |
|---|
Structure Templates Detected by HHsearch 
Original result of HHsearch against PDB70 database
ID ![]() | Alignment Graph ![]() | Length ![]() |
Definition ![]() |
Probability ![]() |
| Query | 137 | |||
| 2lju_A | 108 | Putative oxidoreductase; structural genomics, seat | 100.0 | |
| 2jya_A | 106 | AGR_C_3324P, uncharacterized protein ATU1810; prot | 100.0 | |
| 4b0z_A | 229 | RPN12, 26S proteasome regulatory subunit RPN12; pr | 87.43 | |
| 3t5v_A | 316 | Nuclear mRNA export protein SAC3; PCI, mRNA nuclea | 82.47 | |
| 4b4t_T | 274 | 26S proteasome regulatory subunit RPN12; hydrolase | 81.63 |
| >2lju_A Putative oxidoreductase; structural genomics, seattle structural GENO center for infectious disease, ssgcid; NMR {Ehrlichia chaffeensis} | Back alignment and structure |
|---|
Probab=100.00 E-value=3.1e-46 Score=274.81 Aligned_cols=92 Identities=32% Similarity=0.566 Sum_probs=84.4
Q ss_pred ccccCCCcccccceEEEecCCCCCCCCCCCCCCCcEEEccCC-CCccCCCcCccCCCCccCccCCceeeeCCHHHHHHHH
Q 032597 40 GMVSGIPEEHLRRRVVIYTPARTATQQGSGKLGRWKINFMST-QKWENPLMGWTSTGDPYANVGDAGLSFDSKEAAREFA 118 (137)
Q Consensus 40 ~~vSG~P~e~~~R~vrIY~Pak~amQSG~~~~~~W~LeFe~~-~rw~nPLMGWtsS~D~~sqv~~~~L~F~SkE~AIaya 118 (137)
|...|.|+. ++||||+|+|+|||||++++++|+|||++. ++|+|||||||||+||++|| +|+|+|+|+||+||
T Consensus 3 ~~~~~~~~~---~~arIy~Pak~amQSG~~~t~~W~lefe~~~~r~~nPLMGWtsS~D~~~qv---~L~F~skE~AiayA 76 (108)
T 2lju_A 3 GSMQEQVSN---VRARIYKPAKSTMQSGHSKLKAWKLEFEPSCTQYTEPLMNWTGSHDTKQQV---CLSFTTRELAIAYA 76 (108)
T ss_dssp --CCCCCCC---CEEEEECCCCCCSSSSCCSCCCEEEEECCCSSCCCCCCCCCSSSCCCCCCS---CEEESSHHHHHHHH
T ss_pred cccCCCCCC---CEEEEeCCCCCccccCCCCCCceEEEEecCCCCccCCCccccCCCCccccc---eEecCCHHHHHHHH
Confidence 345666665 899999999999999999999999999996 69999999999999999999 99999999999999
Q ss_pred HHcCCcEEEeCCCCCCCCC
Q 032597 119 ERHGWEYVVRKPHRPLLKV 137 (137)
Q Consensus 119 ek~Gw~Y~V~~P~~~~~~~ 137 (137)
|||||+|+|++|+.+++++
T Consensus 77 ek~G~~y~V~ep~~~~~r~ 95 (108)
T 2lju_A 77 VAHKIDYTVLQDNPRTIVP 95 (108)
T ss_dssp HHTTCEEEEECSSCCCCCC
T ss_pred HHcCCEEEEecCCcccCCc
Confidence 9999999999999988764
|
| >2jya_A AGR_C_3324P, uncharacterized protein ATU1810; protein with unknown function ATU1810, ontario centre for ST proteomics, OCSP; NMR {Agrobacterium tumefaciens str} | Back alignment and structure |
|---|
| >4b0z_A RPN12, 26S proteasome regulatory subunit RPN12; protein binding, proteasome ubitquitin; HET: SGM GOL; 1.58A {Schizosaccharomyces pombe} | Back alignment and structure |
|---|
| >3t5v_A Nuclear mRNA export protein SAC3; PCI, mRNA nuclear export, mRNA, nuclear, transcription; 2.90A {Saccharomyces cerevisiae} | Back alignment and structure |
|---|
| >4b4t_T 26S proteasome regulatory subunit RPN12; hydrolase, AAA-atpases, protein degradation, ubiquitin-prote pathway; 7.40A {Saccharomyces cerevisiae} | Back alignment and structure |
|---|
Homologous Structure Domains
Structure Domains Detected by RPS-BLAST 
Original result of RPS-BLAST against SCOP70(version1.75) database
No hit with e-value below 0.005
Homologous Domains Detected by HHsearch 
Original result of HHsearch against SCOP70(version1.75) database
ID ![]() | Alignment Graph ![]() | Length ![]() |
Definition ![]() |
Probability ![]() |
| Query | 137 | |||
| d1k92a1 | 188 | Argininosuccinate synthetase, N-terminal domain {E | 88.87 |
| >d1k92a1 c.26.2.1 (A:1-188) Argininosuccinate synthetase, N-terminal domain {Escherichia coli [TaxId: 562]} | Back information, alignment and structure |
|---|
class: Alpha and beta proteins (a/b) fold: Adenine nucleotide alpha hydrolase-like superfamily: Adenine nucleotide alpha hydrolases-like family: N-type ATP pyrophosphatases domain: Argininosuccinate synthetase, N-terminal domain species: Escherichia coli [TaxId: 562]
Probab=88.87 E-value=0.059 Score=35.66 Aligned_cols=24 Identities=17% Similarity=0.347 Sum_probs=20.3
Q ss_pred eeeCCHHHHHHHHHHcCCcEEEeC
Q 032597 106 LSFDSKEAAREFAERHGWEYVVRK 129 (137)
Q Consensus 106 L~F~SkE~AIayaek~Gw~Y~V~~ 129 (137)
+.|.||+|=++||++||++|....
T Consensus 163 ~~~~sk~ei~~ya~~~gi~~~~~~ 186 (188)
T d1k92a1 163 DELGGRHEMSEFMIACGFDYKMSV 186 (188)
T ss_dssp HHSSSHHHHHHHHHHTTCCCCCCC
T ss_pred cccCCHHHHHHHHHHcCCCCCCCC
Confidence 446799999999999999997643
|