Citrus Sinensis ID: 030938


Local Sequence Feature Prediction

Prediction and (Method)Result
Residue Number Marker
Protein Sequence ?
Secondary Structure (PSIPRED) ?
Secondary Structure Prediction (SSPRO) ?
Coil and Loop (DISEMBL) ?
Flexible Loop (DISEMBL) ?
Low Complexity Region (SEG) ?
Disordered region (IsUnstruct) ?
Disordered Region (DISOPRED) ?
Disordered Region (DISEMBL) ?
Disordered Region (DISPRO) ?
Transmembrane Helix (TMHMM) ?
Transmembrane Helix (HMMTOP) ?
Transmembrane Helix (MEMSAT) ?
TM Helix, Signal Peptide (MEMSAT_SVM) ?
TM Helix, Signal Peptide (Phobius) ?
Signal Peptide (SignalP HMM Mode) ?
Signal Peptide (SignalP NN Mode) ?
Coiled Coils (COILS) ?
Positional Conservation ?
 
--------10--------20--------30--------40--------50--------60--------70--------80--------90-------100-------110-------120-------130-------140-------150-------160-------17
MMTPQKFVTTHLTQKSSLRAMASTIPWSSLSRAPTLLRVRNNARPVTCKAHAPKSAQALHASRRNLLFFSLTALPVLTARESASSAEDIPLFGLRRKLKQAEEEAVDIVREVEKKIETVEERIEASEKEVETTAGFGGLAQAGAVAGAELVGLLVATSVVNGILGPEPQ
cccccHHHHHHccccHHHHHHcccccccccccccccccccccccccccccccccccccccHHHHHHHHHHHHcHHHHHHccccccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHccccHHHHHHHHHHHHHHHHHHHHHHccccccccc
ccccccEEEcccccHcHHHHcccccccHHHccccHHHHHHccccccccccccccccccccHHHHHHHHHHHHHHHHHHHHccHHHHccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHccccHHHHHHHHHHHHHHEEHEHHHHHHHccccccc
MMTPQKFVTTHLTQKSSLRAMAstipwsslsraptllrvrnnarpvtckahapkSAQALHASRRNLLFFsltalpvltaresassaediplFGLRRKLKQAEEEAVDIVREVEKKIETVEERIEASEKEVETTAGFGGLAQAGAVAGAELVGLLVATSVvngilgpepq
mmtpqkfvtthltqksslrAMAStipwsslsraptlLRVRNNARPVTCKAHAPKSAQALHASRRNLLFFSLTALPVLTAresassaediplFGLRRKLKQAEEEAVDIVREVEKKIETVEERIEASEKEVETTAGFGGLAQAGAVAGAELVGLLVATSVvngilgpepq
MMTPQKFVTTHLTQKSSLRAMASTIPWSSLSRAPTLLRVRNNARPVTCKAHAPKSAQALHASRRNLLFFSLTALPVLTARESASSAEDIPLFGLRRKLKQAEEEAVDIVREVEKKIETVEERIEASEKEVETTagfgglaqagavagaelvgllvaTSVVNGILGPEPQ
**********************************TLLRVR*****VTC**********LHASRRNLLFFSLTALPVLTA*********IPLFGLR**********VDIVRE********************TTAGFGGLAQAGAVAGAELVGLLVATSVVNGIL*****
*****KF*TTHLTQKS*************************************************LLFFSLTALPVLT********EDIPLFGLRRKL*******************************VETTAGFGGLAQAGAVAGAELVGLLVATSVVNGILGP***
***************SSLRAMASTIPWSSLSRAPTLLRVRNNARPVT************HASRRNLLFFSLTALPVLTARESASSAEDIPLFGLRRKLKQAEEEAVDIVREVEKKIETVEERIEASEKEVETTAGFGGLAQAGAVAGAELVGLLVATSVVNGILGPEPQ
*****KFVT**LTQK***********WSS*SRAPTLLR**N**RPVTCKAH***S***LHASRRNLLFFSLTALPVLTARESASSAEDIPLFGLRRKLKQAEEEAVDIVREVEKKIETVEERIEASEKEVETTAGFGGLAQAGAVAGAELVGLLVATSVVNGILGPE**
ooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiii
oooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooHHHHHHHHHHHHHHHHHHHHiiiiiiii
SSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiihhhhhhhhhhhhhhhhooooooooooo
ooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
SSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSoooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
MMTPQKFVTTHLTQKSSLRAMASTIPWSSLSRAPTLLRVRNNARPVTCKAHAPKSAQALHASRRNLLFFSLTALPVLTARESASSAEDIPxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxVETTAGFGGLAQAGAVAGAELVGLLVATSVVNGILGPEPQ
no confident homologs detected

Close Homologs for Annotation Transfer

Close Homologs in SWISS-PROT Database Detected by BLAST ?

No hits with e-value below 0.001 by BLAST

Close Homologs in the Non-Redundant Database Detected by BLAST ?

GI ?Alignment Graph ?Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query169
21554083167 unknown [Arabidopsis thaliana] 0.704 0.712 0.577 1e-28
224094564167 predicted protein [Populus trichocarpa] 0.834 0.844 0.520 4e-28
297825275167 hypothetical protein ARALYDRAFT_481233 [ 0.692 0.700 0.565 6e-28
18400342167 synechocystis YCF37-like protein [Arabid 0.639 0.646 0.6 1e-27
356496840169 PREDICTED: uncharacterized protein LOC10 0.846 0.846 0.542 2e-27
357483133161 hypothetical protein MTR_5g018590 [Medic 0.822 0.863 0.503 2e-27
356537796162 PREDICTED: uncharacterized protein LOC10 0.834 0.870 0.521 3e-25
255562751157 conserved hypothetical protein [Ricinus 0.763 0.821 0.578 3e-23
225467027152 PREDICTED: uncharacterized protein LOC10 0.715 0.796 0.532 3e-23
449460624167 PREDICTED: uncharacterized protein LOC10 0.727 0.736 0.544 4e-22
>gi|21554083|gb|AAM63164.1| unknown [Arabidopsis thaliana] Back     alignment and taxonomy information
 Score =  130 bits (328), Expect = 1e-28,   Method: Compositional matrix adjust.
 Identities = 78/135 (57%), Positives = 92/135 (68%), Gaps = 16/135 (11%)

Query: 50  AHAPKSAQA----LHASRRNLLFFSLTALPVLTARESASSAEDIPLFGLRRKLKQAEEEA 105
           AH+ K A A      ASRRNLLF SLTA  V+T  +SAS AE+IPLFG+R+KLK+AEEEA
Sbjct: 32  AHSNKPALASFSLTGASRRNLLF-SLTAATVVTGLQSASMAENIPLFGIRKKLKKAEEEA 90

Query: 106 VDIVRE-----------VEKKIETVEERIEASEKEVETTAGFGGLAQAGAVAGAELVGLL 154
           V+IV+E            EK +E  E  +E +EKE+ T   F GL QAGAV  AE VG+L
Sbjct: 91  VEIVKEGFETAEKGVDAAEKGLEAAERGVETAEKEIVTAVSFNGLTQAGAVVAAEFVGVL 150

Query: 155 VATSVVNGILGPEPQ 169
           VATSVVNGILGPE Q
Sbjct: 151 VATSVVNGILGPEAQ 165




Source: Arabidopsis thaliana

Species: Arabidopsis thaliana

Genus: Arabidopsis

Family: Brassicaceae

Order: Brassicales

Class:

Phylum: Streptophyta

Superkingdom: Eukaryota

>gi|224094564|ref|XP_002310181.1| predicted protein [Populus trichocarpa] gi|222853084|gb|EEE90631.1| predicted protein [Populus trichocarpa] Back     alignment and taxonomy information
>gi|297825275|ref|XP_002880520.1| hypothetical protein ARALYDRAFT_481233 [Arabidopsis lyrata subsp. lyrata] gi|297326359|gb|EFH56779.1| hypothetical protein ARALYDRAFT_481233 [Arabidopsis lyrata subsp. lyrata] Back     alignment and taxonomy information
>gi|18400342|ref|NP_565556.1| synechocystis YCF37-like protein [Arabidopsis thaliana] gi|3152615|gb|AAC17094.1| expressed protein [Arabidopsis thaliana] gi|16648702|gb|AAL25543.1| At2g23670/F26B6.32 [Arabidopsis thaliana] gi|20197003|gb|AAM14867.1| expressed protein [Arabidopsis thaliana] gi|20334912|gb|AAM16212.1| At2g23670/F26B6.32 [Arabidopsis thaliana] gi|206748360|gb|ACI22363.1| YCF37-like protein [Arabidopsis thaliana] gi|330252384|gb|AEC07478.1| synechocystis YCF37-like protein [Arabidopsis thaliana] Back     alignment and taxonomy information
>gi|356496840|ref|XP_003517273.1| PREDICTED: uncharacterized protein LOC100780194 [Glycine max] Back     alignment and taxonomy information
>gi|357483133|ref|XP_003611853.1| hypothetical protein MTR_5g018590 [Medicago truncatula] gi|355513188|gb|AES94811.1| hypothetical protein MTR_5g018590 [Medicago truncatula] Back     alignment and taxonomy information
>gi|356537796|ref|XP_003537411.1| PREDICTED: uncharacterized protein LOC100788441 [Glycine max] Back     alignment and taxonomy information
>gi|255562751|ref|XP_002522381.1| conserved hypothetical protein [Ricinus communis] gi|223538459|gb|EEF40065.1| conserved hypothetical protein [Ricinus communis] Back     alignment and taxonomy information
>gi|225467027|ref|XP_002265860.1| PREDICTED: uncharacterized protein LOC100262366 [Vitis vinifera] gi|225468769|ref|XP_002275177.1| PREDICTED: uncharacterized protein LOC100262616 [Vitis vinifera] gi|147791855|emb|CAN61831.1| hypothetical protein VITISV_027633 [Vitis vinifera] Back     alignment and taxonomy information
>gi|449460624|ref|XP_004148045.1| PREDICTED: uncharacterized protein LOC101208092 [Cucumis sativus] Back     alignment and taxonomy information

Prediction of Gene Ontology (GO) Terms

Close Homologs with Gene Ontology terms Detected by BLAST ?

ID ? Alignment graph ? Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query169
TAIR|locus:2046832167 YCF37 "AT2G23670" [Arabidopsis 0.875 0.886 0.427 1.3e-20
TAIR|locus:2046832 YCF37 "AT2G23670" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
 Score = 243 (90.6 bits), Expect = 1.3e-20, P = 1.3e-20
 Identities = 71/166 (42%), Positives = 87/166 (52%)

Query:    21 MASTIPWSSLSRAPTLLRVR--NNARPVTCKAHAPKSAQALH----ASRRNLLFFSLTAL 74
             MAST+  S LS     L+ R    A      AH+ K A +      ASRRNLLF SLTA 
Sbjct:     1 MASTVSLSLLSGGSGALQARCARTAHVSPRAAHSNKPALSSFSLTGASRRNLLF-SLTAA 59

Query:    75 PVLTARESASSAEDIPLFGLRRKLKQAEEEAVDIVRE-----------VEKKIETVEERI 123
              V+T  + AS AE+IPLFG+R+KLK+AEEEAV+IV+E            EK +E  E  +
Sbjct:    60 TVVTGLQPASMAENIPLFGIRKKLKKAEEEAVEIVKEGFETAEKGVDAAEKGLEAAERGV 119

Query:   124 EASEKEVETTXXXXXXXXXXXXXXXXXXXXXXXTSVVNGILGPEPQ 169
             E +EKE+ T                        TSVVNGILGPE Q
Sbjct:   120 ETAEKEIVTAVSFNGLTQAGAVVAAEFVGVLVATSVVNGILGPEAQ 165


Parameters:
  V=100
  filter=SEG
  E=0.001

  ctxfactor=1.00

  Query                        -----  As Used  -----    -----  Computed  ----
  Frame  MatID Matrix name     Lambda    K       H      Lambda    K       H
   +0      0   BLOSUM62        0.313   0.124   0.335    same    same    same
               Q=9,R=2         0.244   0.0300  0.180     n/a     n/a     n/a

  Query
  Frame  MatID  Length  Eff.Length     E     S W   T  X   E2     S2
   +0      0      169       146   0.00079  103 3  10 23  0.48    31
                                                     30  0.49    33


Statistics:

  Database:  /share/blast/go-seqdb.fasta
   Title:  go_20130330-seqdb.fasta
   Posted:  5:47:42 AM PDT Apr 1, 2013
   Created:  5:47:42 AM PDT Apr 1, 2013
   Format:  XDF-1
   # of letters in database:  169,044,731
   # of sequences in database:  368,745
   # of database sequences satisfying E:  1
  No. of states in DFA:  580 (62 KB)
  Total size of DFA:  153 KB (2093 KB)
  Time to generate neighborhood:  0.00u 0.00s 0.00t   Elapsed:  00:00:00
  No. of threads or processors used:  24
  Search cpu time:  28.47u 0.21s 28.68t   Elapsed:  00:00:01
  Total cpu time:  28.47u 0.21s 28.68t   Elapsed:  00:00:01
  Start:  Fri May 10 04:51:48 2013   End:  Fri May 10 04:51:49 2013


GO:0003674 "molecular_function" evidence=ND
GO:0008150 "biological_process" evidence=ND
GO:0009507 "chloroplast" evidence=ISM;IDA
GO:0009543 "chloroplast thylakoid lumen" evidence=ISS
GO:0009535 "chloroplast thylakoid membrane" evidence=IDA
GO:0009579 "thylakoid" evidence=IDA
GO:0009534 "chloroplast thylakoid" evidence=IDA
GO:0006098 "pentose-phosphate shunt" evidence=RCA
GO:0010027 "thylakoid membrane organization" evidence=RCA
GO:0019288 "isopentenyl diphosphate biosynthetic process, mevalonate-independent pathway" evidence=RCA

Prediction of Enzyme Commission (EC) Number

EC Number Prediction by Annotation Transfer from SWISS-PROT Entries ?

No confident hit for EC number transfering in SWISSPROT detected by BLAST

EC Number Prediction by Ezypred Server ?

Fail to connect to Ezypred Server

EC Number Prediction by EFICAz Software ?

No EC number assignment, probably not an enzyme!


Prediction of Functionally Associated Proteins

Functionally Associated Proteins Detected by STRING ?

Your Input:
eugene3.00071002
hypothetical protein (168 aa)
(Populus trichocarpa)
Predicted Functional Partners:
gw1.VI.698.1
hypothetical protein (139 aa)
      0.447

Conserved Domains and Related Protein Families

Conserved Domains Detected by RPS-BLAST ?

Conserved Domains Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query 169
PF03179105 V-ATPase_G: Vacuolar (H+)-ATPase G subunit; InterP 93.39
TIGR01147113 V_ATP_synt_G vacuolar ATP synthase, subunit G. Thi 88.25
PRK10132108 hypothetical protein; Provisional 84.72
PLN00042260 photosystem II oxygen-evolving enhancer protein 2; 83.62
KOG1772108 consensus Vacuolar H+-ATPase V1 sector, subunit G 82.29
PF0595794 DUF883: Bacterial protein of unknown function (DUF 82.09
>PF03179 V-ATPase_G: Vacuolar (H+)-ATPase G subunit; InterPro: IPR005124 This family represents the eukaryotic vacuolar (H+)-ATPase (V-ATPase) G subunit Back     alignment and domain information
Probab=93.39  E-value=0.26  Score=35.54  Aligned_cols=37  Identities=35%  Similarity=0.535  Sum_probs=26.7

Q ss_pred             hHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHh
Q 030938           96 RKLKQAEEEAVDIVREVE-KKIETVEERIEASEKEVET  132 (169)
Q Consensus        96 KklkkAEEeA~EiVkE~E-~~veaae~g~e~Aekei~~  132 (169)
                      +.|.+||++|.+||.++. ........+-++|+++|+.
T Consensus         7 q~Ll~AE~eA~~iV~~Ar~~r~~~lk~Ak~eA~~ei~~   44 (105)
T PF03179_consen    7 QQLLEAEKEAQEIVEEARKEREQRLKQAKEEAEKEIEE   44 (105)
T ss_dssp             STHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence            468999999999999977 3344455566666666554



V-ATPases generate an acidic environment in several intracellular compartments. Correspondingly, they are found as membrane-attached proteins in several organelles. They are also found in the plasma membranes of some specialised cells. V-ATPases consist of peripheral (V1) and membrane integral (V0) heteromultimeric complexes. The G subunit is part of the V1 subunit, but is also thought to be strongly attached to the V0 complex. It may be involved in the coupling of ATP degradation to H+ translocation.; GO: 0016820 hydrolase activity, acting on acid anhydrides, catalyzing transmembrane movement of substances, 0015992 proton transport, 0016471 vacuolar proton-transporting V-type ATPase complex; PDB: 2KWY_A 2K88_A.

>TIGR01147 V_ATP_synt_G vacuolar ATP synthase, subunit G Back     alignment and domain information
>PRK10132 hypothetical protein; Provisional Back     alignment and domain information
>PLN00042 photosystem II oxygen-evolving enhancer protein 2; Provisional Back     alignment and domain information
>KOG1772 consensus Vacuolar H+-ATPase V1 sector, subunit G [Energy production and conversion] Back     alignment and domain information
>PF05957 DUF883: Bacterial protein of unknown function (DUF883); InterPro: IPR010279 This family consists of several bacterial proteins of unknown function that include the Escherichia coli genes for ElaB, YgaM and YqjD Back     alignment and domain information

Homologous Structure Templates

Structure Templates Detected by BLAST ?

No homologous structure with e-value below 0.005

Structure Templates Detected by RPS-BLAST ?

No hit with e-value below 0.005

Structure Templates Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query169
4efa_G119 V-type proton ATPase subunit G; heterotrimer, peri 91.95
2k6i_A56 Uncharacterized protein MJ0223; H subunit, A1AO AT 91.74
>4efa_G V-type proton ATPase subunit G; heterotrimer, peripheral stalk, vacuolar ATPase, hydrolase; 2.82A {Saccharomyces cerevisiae} PDB: 4dl0_K 2k88_A 2kwy_A Back     alignment and structure
Probab=91.95  E-value=0.38  Score=36.07  Aligned_cols=39  Identities=26%  Similarity=0.413  Sum_probs=29.2

Q ss_pred             hhhhHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHh
Q 030938           93 GLRRKLKQAEEEAVDIVREVE-KKIETVEERIEASEKEVET  132 (169)
Q Consensus        93 GIRKklkkAEEeA~EiVkE~E-~~veaae~g~e~Aekei~~  132 (169)
                      ||..= =+||.+|.+||.|+- ........+.++|++||+.
T Consensus        11 GIQqL-L~AEk~A~~iV~~ARk~k~~rLKqAK~EA~~EIe~   50 (119)
T 4efa_G           11 GIATL-LQAEKEAHEIVSKARKYRQDKLKQAKTDAAKEIDS   50 (119)
T ss_dssp             HHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
T ss_pred             HHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence            66554 589999999999977 4455566677788777764



>2k6i_A Uncharacterized protein MJ0223; H subunit, A1AO ATP synthase, V1VO ATPase, F1FO ATP synthase, structural protein; NMR {Methanocaldococcus jannaschii} Back     alignment and structure

Homologous Structure Domains

Structure Domains Detected by RPS-BLAST ?

No hit with e-value below 0.005

Homologous Domains Detected by HHsearch ?

No hit with probability above 80.00