Citrus Sinensis ID: 035185


Local Sequence Feature Prediction

Prediction and (Method)Result
Residue Number Marker
Protein Sequence ?
Secondary Structure (PSIPRED) ?
Secondary Structure Prediction (SSPRO) ?
Coil and Loop (DISEMBL) ?
Flexible Loop (DISEMBL) ?
Low Complexity Region (SEG) ?
Disordered region (IsUnstruct) ?
Disordered Region (DISOPRED) ?
Disordered Region (DISEMBL) ?
Disordered Region (DISPRO) ?
Transmembrane Helix (TMHMM) ?
Transmembrane Helix (HMMTOP) ?
Transmembrane Helix (MEMSAT) ?
TM Helix, Signal Peptide (MEMSAT_SVM) ?
TM Helix, Signal Peptide (Phobius) ?
Signal Peptide (SignalP HMM Mode) ?
Signal Peptide (SignalP NN Mode) ?
Coiled Coils (COILS) ?
Positional Conservation ?
 
--------10--------20--------30--------40--------50--------60--------70-
MGGGMEANKNRFIEEWGAARENLEHNFRWTRRNLALVGLFGIAVPVFIYKGIVKEFHMQDEDAGRPYRKFL
cccccccccHHHHHHHHHHHHHHHHHcEEccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcccccccc
ccccccccccHHHHHHHHHHHHHHHHccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcccccHHcc
mgggmeankNRFIEEWGAARENLEHNFRWTRRNLALVGLFGIAVPVFIYKGIVKefhmqdedagrpyrkfl
MGGGMEANKNRFIEEWGAARENLEHNFRWTRRNLALVGLFGIAVPVFIYKGIVKEfhmqdedagrpyrkfl
MGGGMEANKNRFIEEWGAARENLEHNFRWTRRNLALVGLFGIAVPVFIYKGIVKEFHMQDEDAGRPYRKFL
***********FIEEWGAARENLEHNFRWTRRNLALVGLFGIAVPVFIYKGIVKEFH**************
***********FIEEWGAARENLEHNFRWTRRNLALVGLFGIAVPVFIYKGIVKEFHM******R******
MGGGMEANKNRFIEEWGAARENLEHNFRWTRRNLALVGLFGIAVPVFIYKGIVKEFHMQDEDAGRPYRKFL
*****EANKNRFIEEWGAARENLEHNFRWTRRNLALVGLFGIAVPVFIYKGIVKEFHMQDEDAG*******
iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiHHHHHHHHHHHHHHHHHHHHoooooooooooooooooo
iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiHHHHHHHHHHHHHHHHHHHHoooooooooooooooooo
iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiHHHHHHHHHHHHHHHHHHHoooooooooooooooooooo
iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiHHHHHHHHHHHHHHHHooooooooooooooooooooo
iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiHHHHHHHHHHHHHHHHHHHooooooooooooooooooo
SSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSooooooooooooooooooooooooo
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
MGGGMEANKNRFIEEWGAARENLEHNFRWTRRNLALVGLFGIAVPVFIYKGIVKEFHMQDEDAGRPYRKFL
no confident homologs detected

Close Homologs for Annotation Transfer

Close Homologs in SWISS-PROT Database Detected by BLAST ?

No hits with e-value below 0.001 by BLAST

Close Homologs in the Non-Redundant Database Detected by BLAST ?

GI ?Alignment Graph ?Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query71
1840268271 uncharacterized protein [Arabidopsis tha 1.0 1.0 0.816 2e-29
35652088171 PREDICTED: uncharacterized protein LOC10 1.0 1.0 0.845 2e-29
22409053971 predicted protein [Populus trichocarpa] 1.0 1.0 0.830 3e-29
22544782071 PREDICTED: uncharacterized protein LOC10 1.0 1.0 0.830 4e-29
38851843371 unknown [Lotus japonicus] 1.0 1.0 0.845 4e-29
2153682971 unknown [Arabidopsis thaliana] 1.0 1.0 0.816 5e-29
38849719272 unknown [Medicago truncatula] 0.985 0.972 0.828 2e-28
44952386271 PREDICTED: uncharacterized protein LOC10 1.0 1.0 0.816 7e-28
19469664471 unknown [Zea mays] gi|195605062|gb|ACG24 1.0 1.0 0.816 1e-27
24205869971 hypothetical protein SORBIDRAFT_03g03472 1.0 1.0 0.802 2e-27
>gi|18402682|ref|NP_565726.1| uncharacterized protein [Arabidopsis thaliana] gi|297822895|ref|XP_002879330.1| hypothetical protein ARALYDRAFT_902184 [Arabidopsis lyrata subsp. lyrata] gi|4582445|gb|AAD24829.1| expressed protein [Arabidopsis thaliana] gi|20198160|gb|AAM15434.1| expressed protein [Arabidopsis thaliana] gi|26451972|dbj|BAC43077.1| unknown protein [Arabidopsis thaliana] gi|28416745|gb|AAO42903.1| At2g31490 [Arabidopsis thaliana] gi|227204167|dbj|BAH56935.1| AT2G31490 [Arabidopsis thaliana] gi|297325169|gb|EFH55589.1| hypothetical protein ARALYDRAFT_902184 [Arabidopsis lyrata subsp. lyrata] gi|330253460|gb|AEC08554.1| uncharacterized protein [Arabidopsis thaliana] Back     alignment and taxonomy information
 Score =  133 bits (334), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 58/71 (81%), Positives = 68/71 (95%)

Query: 1  MGGGMEANKNRFIEEWGAARENLEHNFRWTRRNLALVGLFGIAVPVFIYKGIVKEFHMQD 60
          MGGGME NKN+FIE+WG+ARENLEHNFRWTRRN AL+G+FGIA+P+ +YKGIVK+FHMQD
Sbjct: 1  MGGGMETNKNKFIEDWGSARENLEHNFRWTRRNFALIGIFGIALPIIVYKGIVKDFHMQD 60

Query: 61 EDAGRPYRKFL 71
          EDAGRP+RKFL
Sbjct: 61 EDAGRPHRKFL 71




Source: Arabidopsis thaliana

Species: Arabidopsis thaliana

Genus: Arabidopsis

Family: Brassicaceae

Order: Brassicales

Class:

Phylum: Streptophyta

Superkingdom: Eukaryota

>gi|356520881|ref|XP_003529088.1| PREDICTED: uncharacterized protein LOC100306283 isoform 1 [Glycine max] Back     alignment and taxonomy information
>gi|224090539|ref|XP_002309020.1| predicted protein [Populus trichocarpa] gi|118488449|gb|ABK96039.1| unknown [Populus trichocarpa] gi|222854996|gb|EEE92543.1| predicted protein [Populus trichocarpa] Back     alignment and taxonomy information
>gi|225447820|ref|XP_002269616.1| PREDICTED: uncharacterized protein LOC100258994 isoform 1 [Vitis vinifera] gi|359485910|ref|XP_003633354.1| PREDICTED: uncharacterized protein LOC100258994 isoform 2 [Vitis vinifera] gi|296081504|emb|CBI20027.3| unnamed protein product [Vitis vinifera] Back     alignment and taxonomy information
>gi|388518433|gb|AFK47278.1| unknown [Lotus japonicus] Back     alignment and taxonomy information
>gi|21536829|gb|AAM61161.1| unknown [Arabidopsis thaliana] Back     alignment and taxonomy information
>gi|388497192|gb|AFK36662.1| unknown [Medicago truncatula] Back     alignment and taxonomy information
>gi|449523862|ref|XP_004168942.1| PREDICTED: uncharacterized protein LOC101229266 [Cucumis sativus] Back     alignment and taxonomy information
>gi|194696644|gb|ACF82406.1| unknown [Zea mays] gi|195605062|gb|ACG24361.1| hypothetical protein [Zea mays] gi|195627010|gb|ACG35335.1| hypothetical protein [Zea mays] gi|195636166|gb|ACG37551.1| hypothetical protein [Zea mays] gi|195640360|gb|ACG39648.1| hypothetical protein [Zea mays] gi|238012190|gb|ACR37130.1| unknown [Zea mays] gi|413952056|gb|AFW84705.1| hypothetical protein ZEAMMB73_570227 [Zea mays] gi|413952438|gb|AFW85087.1| hypothetical protein ZEAMMB73_581400 [Zea mays] Back     alignment and taxonomy information
>gi|242058699|ref|XP_002458495.1| hypothetical protein SORBIDRAFT_03g034720 [Sorghum bicolor] gi|241930470|gb|EES03615.1| hypothetical protein SORBIDRAFT_03g034720 [Sorghum bicolor] Back     alignment and taxonomy information

Prediction of Gene Ontology (GO) Terms

Close Homologs with Gene Ontology terms Detected by BLAST ?

ID ? Alignment graph ? Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query71
TAIR|locus:206130571 AT2G31490 "AT2G31490" [Arabido 1.0 1.0 0.816 2.6e-31
TAIR|locus:2061305 AT2G31490 "AT2G31490" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
 Score = 344 (126.2 bits), Expect = 2.6e-31, P = 2.6e-31
 Identities = 58/71 (81%), Positives = 68/71 (95%)

Query:     1 MGGGMEANKNRFIEEWGAARENLEHNFRWTRRNLALVGLFGIAVPVFIYKGIVKEFHMQD 60
             MGGGME NKN+FIE+WG+ARENLEHNFRWTRRN AL+G+FGIA+P+ +YKGIVK+FHMQD
Sbjct:     1 MGGGMETNKNKFIEDWGSARENLEHNFRWTRRNFALIGIFGIALPIIVYKGIVKDFHMQD 60

Query:    61 EDAGRPYRKFL 71
             EDAGRP+RKFL
Sbjct:    61 EDAGRPHRKFL 71


Parameters:
  V=100
  filter=SEG
  E=0.001

  ctxfactor=1.00

  Query                        -----  As Used  -----    -----  Computed  ----
  Frame  MatID Matrix name     Lambda    K       H      Lambda    K       H
   +0      0   BLOSUM62        0.325   0.143   0.456    same    same    same
               Q=9,R=2         0.244   0.0300  0.180     n/a     n/a     n/a

  Query
  Frame  MatID  Length  Eff.Length     E     S W   T  X   E2     S2
   +0      0       71        71   0.00091  102 3  11 22  0.45    28
                                                     29  0.42    29


Statistics:

  Database:  /share/blast/go-seqdb.fasta
   Title:  go_20130330-seqdb.fasta
   Posted:  5:47:42 AM PDT Apr 1, 2013
   Created:  5:47:42 AM PDT Apr 1, 2013
   Format:  XDF-1
   # of letters in database:  169,044,731
   # of sequences in database:  368,745
   # of database sequences satisfying E:  1
  No. of states in DFA:  517 (55 KB)
  Total size of DFA:  104 KB (2073 KB)
  Time to generate neighborhood:  0.00u 0.00s 0.00t   Elapsed:  00:00:00
  No. of threads or processors used:  24
  Search cpu time:  7.63u 0.13s 7.76t   Elapsed:  00:00:00
  Total cpu time:  7.63u 0.13s 7.76t   Elapsed:  00:00:00
  Start:  Fri May 10 06:21:56 2013   End:  Fri May 10 06:21:56 2013


GO:0003674 "molecular_function" evidence=ND
GO:0005739 "mitochondrion" evidence=ISM;IDA
GO:0009853 "photorespiration" evidence=RCA;TAS
GO:0031966 "mitochondrial membrane" evidence=IDA
GO:0045271 "respiratory chain complex I" evidence=IDA
GO:0005747 "mitochondrial respiratory chain complex I" evidence=IDA

Prediction of Enzyme Commission (EC) Number

EC Number Prediction by Annotation Transfer from SWISS-PROT Entries ?

No confident hit for EC number transfering in SWISSPROT detected by BLAST

EC Number Prediction by Ezypred Server ?

Fail to connect to Ezypred Server

EC Number Prediction by EFICAz Software ?

No EC number assignment, probably not an enzyme!


Prediction of Functionally Associated Proteins

Functionally Associated Proteins Detected by STRING ?

Your Input:
scaffold_401685.1
annotation not avaliable (71 aa)
(Arabidopsis lyrata)
Predicted Functional Partners:
scaffold_100875.1
annotation not avaliable (142 aa)
      0.495
fgenesh2_kg.1__2880__AT1G27300.1
annotation not avaliable (193 aa)
       0.488
fgenesh1_pm.C_scaffold_4001959
annotation not avaliable (173 aa)
      0.487
fgenesh2_kg.1__3843__AT1G47420.1
annotation not avaliable (260 aa)
      0.434

Conserved Domains and Related Protein Families

Conserved Domains Detected by RPS-BLAST ?

ID ?Alignment Graph ?Length ? Definition ? E-value ?
Query71
PLN0275571 PLN02755, PLN02755, complex I subunit 7e-43
>gnl|CDD|178356 PLN02755, PLN02755, complex I subunit Back     alignment and domain information
 Score =  132 bits (334), Expect = 7e-43
 Identities = 61/71 (85%), Positives = 67/71 (94%)

Query: 1  MGGGMEANKNRFIEEWGAARENLEHNFRWTRRNLALVGLFGIAVPVFIYKGIVKEFHMQD 60
          MGGGME NKN+FIEEWGAARENLE NFRWTRRNLA+VG+FGIAVP+ +YKGIV+EFHMQD
Sbjct: 1  MGGGMEVNKNKFIEEWGAARENLEFNFRWTRRNLAVVGIFGIAVPILVYKGIVREFHMQD 60

Query: 61 EDAGRPYRKFL 71
          EDAGRP RKFL
Sbjct: 61 EDAGRPERKFL 71


Length = 71

Conserved Domains Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query 71
PLN0275571 complex I subunit 100.0
PF07225125 NDUF_B4: NADH-ubiquinone oxidoreductase B15 subuni 99.82
>PLN02755 complex I subunit Back     alignment and domain information
Probab=100.00  E-value=2.7e-35  Score=179.49  Aligned_cols=71  Identities=86%  Similarity=1.483  Sum_probs=70.2

Q ss_pred             CCCCCCCccCHHHHHHHHHhhccccceeecchhHHHHHHHHhHHHHHhHHHHhhhhhhhhhhcCCcccCCC
Q 035185            1 MGGGMEANKNRFIEEWGAARENLEHNFRWTRRNLALVGLFGIAVPVFIYKGIVKEFHMQDEDAGRPYRKFL   71 (71)
Q Consensus         1 MaG~~~l~~dPalerw~~~ren~~~~FR~Tprt~~~~~~~~~~vP~~~yy~~~~~~~~~d~~~G~~~r~f~   71 (71)
                      |||++++.+|..||.|++||||+|+||||||||++++++||++||+++||++++||++||.++|||+|||+
T Consensus         1 m~gg~~~~~Nk~iE~~~~~reNrekyFRWT~Rt~~i~~ifgv~VP~liy~giv~eF~~~d~~~grp~~kf~   71 (71)
T PLN02755          1 MGGGMEVNKNKFIEEWGAARENLEFNFRWTRRNLAVVGIFGIAVPILVYKGIVREFHMQDEDAGRPERKFL   71 (71)
T ss_pred             CCCCcccCccHHHHHHHHHHHHHHHheecccchhhhhhhhhhhhhHHhhhhhhhhhcccchhccCCccccC
Confidence            89999999999999999999999999999999999999999999999999999999999999999999996



>PF07225 NDUF_B4: NADH-ubiquinone oxidoreductase B15 subunit (NDUFB4); InterPro: IPR009866 NADH:ubiquinone oxidoreductase (complex I) (1 Back     alignment and domain information

Homologous Structure Templates

Structure Templates Detected by BLAST ?

No homologous structure with e-value below 0.005

Structure Templates Detected by RPS-BLAST ?

No hit with e-value below 0.005

Structure Templates Detected by HHsearch ?

No hit with probability above 80.00


Homologous Structure Domains

Structure Domains Detected by RPS-BLAST ?

No hit with e-value below 0.005

Homologous Domains Detected by HHsearch ?

No hit with probability above 80.00