Citrus Sinensis ID: 035376


Local Sequence Feature Prediction

Prediction and (Method)Result
Residue Number Marker
Protein Sequence ?
Secondary Structure (PSIPRED) ?
Secondary Structure Prediction (SSPRO) ?
Coil and Loop (DISEMBL) ?
Flexible Loop (DISEMBL) ?
Low Complexity Region (SEG) ?
Disordered region (IsUnstruct) ?
Disordered Region (DISOPRED) ?
Disordered Region (DISEMBL) ?
Disordered Region (DISPRO) ?
Transmembrane Helix (TMHMM) ?
Transmembrane Helix (HMMTOP) ?
Transmembrane Helix (MEMSAT) ?
TM Helix, Signal Peptide (MEMSAT_SVM) ?
TM Helix, Signal Peptide (Phobius) ?
Signal Peptide (SignalP HMM Mode) ?
Signal Peptide (SignalP NN Mode) ?
Coiled Coils (COILS) ?
Positional Conservation ?
 
--------10--------20--------30--------40--------50--------60------
MSWLIFEGLLPLGIIAAMLTIAGNAQYQIHKAAHGRPKHVGNDMWDVAMERRDKKLVEQLSGASSN
ccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHccccccccccHHHHHHHHHHHHHHHHHHccccc
ccHHHHHHHHHHHHHHHHHHHHHcHHHHHHHHHccccccccccHHHHHHHHHHHHHHHHHcccccc
mswlifegLLPLGIIAAMLTIAGNAQYQIHkaahgrpkhvgndmwDVAMERRDKKLVEQLSGASSN
MSWLIFEGLLPLGIIAAMLTIAGNAQYQIHKaahgrpkhVGNDMWDVAMERRDKKLveqlsgassn
MSWLIFEGLLPLGIIAAMLTIAGNAQYQIHKAAHGRPKHVGNDMWDVAMERRDKKLVEQLSGASSN
**WLIFEGLLPLGIIAAMLTIAGNAQYQIHKAAHGRPKHVGNDMWDV*******************
**WLIFEGLLPLGIIAAMLTIAGNAQYQIHKAAHGRPKHVGNDMWDVAMER***************
MSWLIFEGLLPLGIIAAMLTIAGNAQYQIHKAAHGRPKHVGNDMWDVAMERRDKKL**********
*SWLIFEGLLPLGIIAAMLTIAGNAQYQIHKAAHGRPKHVGNDMWDVAMERRDKKLVEQLS*****
oooHHHHHHHHHHHHHHHHHHHHHHHiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiii
oooHHHHHHHHHHHHHHHHHHiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiii
ooooooooHHHHHHHHHHHHHHHHHHHiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiii
ooooooooooooooHHHHHHHHHHHHHHHHiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiii
SSSSSSSSSSSSSSSSSSSSSSSSoooooooooooooooooooooooooooooooooooooooooo
SSSSSSSSSSSSSSSSSSSSSSSSSSoooooooooooooooooooooooooooooooooooooooo
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
MSWLIFEGLLPLGIIAAMLTIAGNAQYQIHKAAHGRPKHVGNDMWDVAMERRDKKLVEQLSGASSN
no confident homologs detected

Close Homologs for Annotation Transfer

Close Homologs in SWISS-PROT Database Detected by BLAST ?

ID ?Alignment graph ?Length ? Definition ? RBH(Q2H) ? RBH(H2Q) ? Q cover ? H cover ? Identity ? E-value ?
Query66 2.2.26 [Sep-21-2011]
Q9C9Z565 NADH dehydrogenase [ubiqu yes no 0.984 1.0 0.661 4e-19
>sp|Q9C9Z5|NDUA1_ARATH NADH dehydrogenase [ubiquinone] 1 alpha subcomplex subunit 1 OS=Arabidopsis thaliana GN=At3g08610 PE=3 SV=1 Back     alignment and function desciption
 Score = 93.2 bits (230), Expect = 4e-19,   Method: Compositional matrix adjust.
 Identities = 43/65 (66%), Positives = 51/65 (78%)

Query: 1  MSWLIFEGLLPLGIIAAMLTIAGNAQYQIHKAAHGRPKHVGNDMWDVAMERRDKKLVEQL 60
          MS +  E +LPLGII  ML I GN+QY IHKA HGRPKH+G+D WDVAMERRDKK+VE+ 
Sbjct: 1  MSLVWLEAMLPLGIIGGMLCIMGNSQYYIHKAYHGRPKHIGHDEWDVAMERRDKKVVEKA 60

Query: 61 SGASS 65
          +  SS
Sbjct: 61 AAPSS 65


  Database: swissprot
    Posted date:  Mar 23, 2013  2:32 AM
  Number of letters in database: 191,569,459
  Number of sequences in database:  539,616
  
Lambda     K      H
   0.320    0.133    0.405 

Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 24,793,707
Number of Sequences: 539616
Number of extensions: 760748
Number of successful extensions: 1486
Number of sequences better than 100.0: 1
Number of HSP's better than 100.0 without gapping: 1
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 1485
Number of HSP's gapped (non-prelim): 1
length of query: 66
length of database: 191,569,459
effective HSP length: 38
effective length of query: 28
effective length of database: 171,064,051
effective search space: 4789793428
effective search space used: 4789793428
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 55 (25.8 bits)


Accessory subunit of the mitochondrial membrane respiratory chain NADH dehydrogenase (Complex I), that is believed not to be involved in catalysis. Complex I functions in the transfer of electrons from NADH to the respiratory chain. The immediate electron acceptor for the enzyme is believed to be ubiquinone.
Arabidopsis thaliana (taxid: 3702)

Prediction of Gene Ontology (GO) Terms

Close Homologs with Gene Ontology terms Detected by BLAST ?

ID ? Alignment graph ? Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query66
TAIR|locus:207781365 AT3G08610 "AT3G08610" [Arabido 0.984 1.0 0.661 6.5e-19
ASPGD|ASPL000002807486 AN11487 [Emericella nidulans ( 0.924 0.709 0.338 0.0001
UNIPROTKB|G4MSW586 MGG_04614 "Uncharacterized pro 0.909 0.697 0.380 0.00045
TAIR|locus:2077813 AT3G08610 "AT3G08610" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
 Score = 227 (85.0 bits), Expect = 6.5e-19, P = 6.5e-19
 Identities = 43/65 (66%), Positives = 51/65 (78%)

Query:     1 MSWLIFEGLLPLGIIAAMLTIAGNAQYQIHKAAHGRPKHVGNDMWDVAMERRDKKLVEQL 60
             MS +  E +LPLGII  ML I GN+QY IHKA HGRPKH+G+D WDVAMERRDKK+VE+ 
Sbjct:     1 MSLVWLEAMLPLGIIGGMLCIMGNSQYYIHKAYHGRPKHIGHDEWDVAMERRDKKVVEKA 60

Query:    61 SGASS 65
             +  SS
Sbjct:    61 AAPSS 65




GO:0003674 "molecular_function" evidence=ND
GO:0008150 "biological_process" evidence=ND
GO:0005739 "mitochondrion" evidence=IDA
GO:0005747 "mitochondrial respiratory chain complex I" evidence=IDA
GO:0006511 "ubiquitin-dependent protein catabolic process" evidence=RCA
GO:0009853 "photorespiration" evidence=RCA
GO:0051788 "response to misfolded protein" evidence=RCA
GO:0080129 "proteasome core complex assembly" evidence=RCA
ASPGD|ASPL0000028074 AN11487 [Emericella nidulans (taxid:162425)] Back     alignment and assigned GO terms
UNIPROTKB|G4MSW5 MGG_04614 "Uncharacterized protein" [Magnaporthe oryzae 70-15 (taxid:242507)] Back     alignment and assigned GO terms

Prediction of Enzyme Commission (EC) Number

EC Number Prediction by Annotation Transfer from SWISS-PROT Entries ?

ID ?Name ?Annotated EC number ?Identity ?Query coverage ?Hit coverage ?RBH(Q2H) ?RBH(H2Q) ?
Q9C9Z5NDUA1_ARATHNo assigned EC number0.66150.98481.0yesno

EC Number Prediction by Ezypred Server ?

Fail to connect to Ezypred Server

EC Number Prediction by EFICAz Software ?

No EC number assignment, probably not an enzyme!


Prediction of Functionally Associated Proteins

Functionally Associated Proteins Detected by STRING ?

Your Input:
GSVIVG00001539001
SubName- Full=Chromosome chr11 scaffold_118, whole genome shotgun sequence; (66 aa)
(Vitis vinifera)
Predicted Functional Partners:
GSVIVG00035362001
SubName- Full=Chromosome undetermined scaffold_77, whole genome shotgun sequence; (70 aa)
      0.693
GSVIVG00028496001
SubName- Full=Chromosome chr7 scaffold_44, whole genome shotgun sequence; (399 aa)
       0.485
GSVIVG00008632001
SubName- Full=Chromosome undetermined scaffold_203, whole genome shotgun sequence; (65 aa)
       0.417

Conserved Domains and Related Protein Families

Conserved Domains Detected by RPS-BLAST ?

Conserved Domains Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query 66
cd00922136 Cyt_c_Oxidase_IV Cytochrome c oxidase subunit IV. 92.66
PF02936142 COX4: Cytochrome c oxidase subunit IV; InterPro: I 84.91
>cd00922 Cyt_c_Oxidase_IV Cytochrome c oxidase subunit IV Back     alignment and domain information
Probab=92.66  E-value=0.58  Score=30.80  Aligned_cols=50  Identities=20%  Similarity=0.146  Sum_probs=42.4

Q ss_pred             HHHHHHHHHHHHHHHhHHHHHHHHHHhCCCCccccCcHHHHHHHHhhhhh
Q 035376            7 EGLLPLGIIAAMLTIAGNAQYQIHKAAHGRPKHVGNDMWDVAMERRDKKL   56 (66)
Q Consensus         7 E~Lpp~gIi~~~~~v~G~~~~~i~~~~~Gk~~R~~~D~wd~~mm~RD~RL   56 (66)
                      |-..-++....++++.++....++.+.+|-+++.-.+.|+....||-+..
T Consensus        73 ewk~v~~~~~~~i~~s~~~~~~~r~~~~~~~P~T~t~Ewqea~~er~~~~  122 (136)
T cd00922          73 EWKTVFGGVLAFIGITGVIFGLQRAFVYGPKPHTFTEEWQEAQLERMLDM  122 (136)
T ss_pred             cHHHHHHHHHHHHHHHHHHHHHHHHhccCCCCCCcCHHHHHHHHHHHHHh
Confidence            55667888888889999999999999888888888889999999987764



Cytochrome c oxidase (CcO), the terminal oxidase in the respiratory chains of eukaryotes and most bacteria, is a multi-chain transmembrane protein located in the inner membrane of mitochondria and the cell membrane of prokaryotes. It catalyzes the reduction of O2 and simultaneously pumps protons across the membrane. The number of subunits varies from three to five in bacteria and up to 13 in mammalian mitochondria. Subunits I, II, and III of mammalian CcO are encoded within the mitochondrial genome and the remaining 10 subunits are encoded within the nuclear genome. Found only in eukaryotes, subunit IV is the largest of the nuclear-encoded subunits. It binds ATP at the matrix side, leading to an allosteric inhibition of enzyme activity at high intramitochondrial ATP/ADP ratios. In mammals, subunit IV has a lung-specific isoform and a ubiquitously expressed isoform.

>PF02936 COX4: Cytochrome c oxidase subunit IV; InterPro: IPR004203 Cytochrome c oxidase, a 13 sub-unit complex (1 Back     alignment and domain information

Homologous Structure Templates

Structure Templates Detected by BLAST ?

No homologous structure with e-value below 0.005

Structure Templates Detected by RPS-BLAST ?

No hit with e-value below 0.005

Structure Templates Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query66
1v54_D147 Cytochrome C oxidase subunit IV isoform 1; oxidore 88.58
2y69_D169 Cytochrome C oxidase subunit 4 isoform 1; electron 86.89
>1v54_D Cytochrome C oxidase subunit IV isoform 1; oxidoreductase; HET: FME TPO HEA TGL PGV CHD CDL PEK PSC DMU; 1.80A {Bos taurus} SCOP: f.23.1.1 PDB: 1oco_D* 1occ_D* 1ocz_D* 1ocr_D* 1v55_D* 2dyr_D* 2dys_D* 2eij_D* 2eik_D* 2eil_D* 2eim_D* 2ein_D* 2occ_D* 2ybb_O* 2zxw_D* 3abk_D* 3abl_D* 3abm_D* 3ag1_D* 3ag2_D* ... Back     alignment and structure
Probab=88.58  E-value=1.8  Score=28.19  Aligned_cols=47  Identities=11%  Similarity=0.049  Sum_probs=38.1

Q ss_pred             HHHHHHHHHHHHHhHHHHHHHHHHhCCCCccccCcHHHHHHHHhhhh
Q 035376            9 LLPLGIIAAMLTIAGNAQYQIHKAAHGRPKHVGNDMWDVAMERRDKK   55 (66)
Q Consensus         9 Lpp~gIi~~~~~v~G~~~~~i~~~~~Gk~~R~~~D~wd~~mm~RD~R   55 (66)
                      -+-+|.+..++++++.....++.+.++.+++--.+.|+....+|=..
T Consensus        79 K~v~g~v~~~i~~s~~~f~~~r~~v~~p~P~T~~~Ewqeaq~erm~~  125 (147)
T 1v54_D           79 KTVVGAAMFFIGFTALLLIWEKHYVYGPIPHTFEEEWVAKQTKRMLD  125 (147)
T ss_dssp             HHHHHHHHHHHHHHHHHHHHHHHHTCCCCCGGGSHHHHHHHHHHHHH
T ss_pred             hHHHHHHHHHHHHHHHHHHHHHHHccCCCCCCCCHHHHHHHHHHHHH
Confidence            44577777888888888888999988888888899999888877444



>2y69_D Cytochrome C oxidase subunit 4 isoform 1; electron transport, complex IV, proton pumps, membrane prote; HET: TPO HEA CHD PEK PGV DMU; 1.95A {Bos taurus} Back     alignment and structure

Homologous Structure Domains

Structure Domains Detected by RPS-BLAST ?

No hit with e-value below 0.005

Homologous Domains Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query66
d1v54d_144 Mitochondrial cytochrome c oxidase subunit IV {Cow 89.32
>d1v54d_ f.23.1.1 (D:) Mitochondrial cytochrome c oxidase subunit IV {Cow (Bos taurus) [TaxId: 9913]} Back     information, alignment and structure
class: Membrane and cell surface proteins and peptides
fold: Single transmembrane helix
superfamily: Mitochondrial cytochrome c oxidase subunit IV
family: Mitochondrial cytochrome c oxidase subunit IV
domain: Mitochondrial cytochrome c oxidase subunit IV
species: Cow (Bos taurus) [TaxId: 9913]
Probab=89.32  E-value=0.72  Score=28.51  Aligned_cols=47  Identities=11%  Similarity=0.049  Sum_probs=38.4

Q ss_pred             HHHHHHHHHHHHHhHHHHHHHHHHhCCCCccccCcHHHHHHHHhhhh
Q 035376            9 LLPLGIIAAMLTIAGNAQYQIHKAAHGRPKHVGNDMWDVAMERRDKK   55 (66)
Q Consensus         9 Lpp~gIi~~~~~v~G~~~~~i~~~~~Gk~~R~~~D~wd~~mm~RD~R   55 (66)
                      -.-+|.+..++++++.....++.+.++.+++--.+.|+..+.+|=..
T Consensus        76 K~v~g~~~~~i~~s~~i~~~~r~~v~p~~P~T~~~Ewqeaq~erm~~  122 (144)
T d1v54d_          76 KTVVGAAMFFIGFTALLLIWEKHYVYGPIPHTFEEEWVAKQTKRMLD  122 (144)
T ss_dssp             HHHHHHHHHHHHHHHHHHHHHHHHTCCCCCGGGSHHHHHHHHHHHHH
T ss_pred             hHHHHHHHHHHHHHHHHHHHHHHHcCCCCCCCCCHHHHHHHHHHHHH
Confidence            34567777788888888889999999988888899999998887443