Citrus Sinensis ID: 034231


Local Sequence Feature Prediction

Prediction and (Method)Result
Residue Number Marker
Protein Sequence ?
Secondary Structure (PSIPRED) ?
Secondary Structure Prediction (SSPRO) ?
Coil and Loop (DISEMBL) ?
Flexible Loop (DISEMBL) ?
Low Complexity Region (SEG) ?
Disordered region (IsUnstruct) ?
Disordered Region (DISOPRED) ?
Disordered Region (DISEMBL) ?
Disordered Region (DISPRO) ?
Transmembrane Helix (TMHMM) ?
Transmembrane Helix (HMMTOP) ?
Transmembrane Helix (MEMSAT) ?
TM Helix, Signal Peptide (MEMSAT_SVM) ?
TM Helix, Signal Peptide (Phobius) ?
Signal Peptide (SignalP HMM Mode) ?
Signal Peptide (SignalP NN Mode) ?
Coiled Coils (COILS) ?
Positional Conservation ?
 
--------10--------20--------30--------40--------50--------60--------70--------80--------90-------100
MLKVVCRKKERDRDHKNNIHPYKVVEITPPPKCLGIRCFPPNLQCGESVTIEGQAYTISAVTHRYQLRKGKYEPSEKRLDVLSSSRYILNLYLENLLEQS
cEEEEEEEcccccccccccccEEEEEEcccccccccEEcccccccccEEEEcccEEEEEEEEEEEEEEccEEEccccEEEEEEccHHHHHHHHHHHHHcc
ccEEEEEEcccccccccccccEEEEEEccccccccccEcccccccccEEEEcccEEEEEEEEEEEEEEcccEcccccccEccccHHHHHHHHHHHHHccc
mlkvvcrkkerdrdhknnihpykvveitpppkclgircfppnlqcgesvtieGQAYTISAVTHRYQlrkgkyepsekrldvLSSSRYILNLYLENLLEQS
mlkvvcrkkerdrdhknnihpykvveitpppkCLGIRCFPPNLQCGESVTIEGQAYTISAVTHryqlrkgkyepsekrldvlsssrYILNLYLENLLEQS
MLKVVCRKKERDRDHKNNIHPYKVVEITPPPKCLGIRCFPPNLQCGESVTIEGQAYTISAVTHRYQLRKGKYEPSEKRLDVLSSSRyilnlylenlleQS
******************IHPYKVVEITPPPKCLGIRCFPPNLQCGESVTIEGQAYTISAVTHRYQLRKGKY*****RLDVLSSSRYILNLYLENL****
**********************KVVEITPPPKCLGIRCFPPNLQCGESVTIEGQAYTISAVTHRYQLRKGKYEPSEKRLDVLSSSRYILNLYLENLL***
MLKVVCRKKERDRDHKNNIHPYKVVEITPPPKCLGIRCFPPNLQCGESVTIEGQAYTISAVTHRYQLRKGKYEPSEKRLDVLSSSRYILNLYLENLLEQS
MLKVVCRKKERDRDHKNNIHPYKVVEITPPPKCLGIRCFPPNLQCGESVTIEGQAYTISAVTHRYQLRKGKYEPSEKRLDVLSSSRYILNLYLENLLEQ*
iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiii
oooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiihhhhhhhhhhhhhhhhhhhhooooooooooooooooooooooooooooooooooooooooooooooo
iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiihhhhhhhhhhhhhhhhoooo
iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiii
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
MLKVVCRKKERDRDHKNNIHPYKVVEITPPPKCLGIRCFPPNLQCGESVTIEGQAYTISAVTHRYQLRKGKYEPSEKRLDVLSSSRYILNLYLENLLEQS
no confident homologs detected

Close Homologs for Annotation Transfer

Close Homologs in SWISS-PROT Database Detected by BLAST ?

No hits with e-value below 0.001 by BLAST

Close Homologs in the Non-Redundant Database Detected by BLAST ?

GI ?Alignment Graph ?Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query100
225452332110 PREDICTED: uncharacterized protein LOC10 0.95 0.863 0.838 6e-42
242076274120 hypothetical protein SORBIDRAFT_06g02053 0.99 0.825 0.761 2e-40
259490454118 uncharacterized protein LOC100303883 [Ze 0.99 0.838 0.757 5e-40
326518965173 predicted protein [Hordeum vulgare subsp 0.99 0.572 0.754 2e-39
326508454157 predicted protein [Hordeum vulgare subsp 0.99 0.630 0.754 2e-39
356558849115 PREDICTED: uncharacterized protein LOC10 0.98 0.852 0.81 2e-39
351723319115 uncharacterized protein LOC100305736 [Gl 0.98 0.852 0.8 5e-39
194698056 261 unknown [Zea mays] 0.88 0.337 0.818 3e-38
22327475123 uncharacterized protein [Arabidopsis tha 0.97 0.788 0.737 1e-36
145334677114 uncharacterized protein [Arabidopsis tha 0.97 0.850 0.737 2e-36
>gi|225452332|ref|XP_002273651.1| PREDICTED: uncharacterized protein LOC100262706 [Vitis vinifera] gi|296087605|emb|CBI34861.3| unnamed protein product [Vitis vinifera] Back     alignment and taxonomy information
 Score =  174 bits (441), Expect = 6e-42,   Method: Compositional matrix adjust.
 Identities = 83/99 (83%), Positives = 91/99 (91%), Gaps = 4/99 (4%)

Query: 2   LKVVCRKKERDRDHKNNIHPYKVVEITPPPKCLGIRCFPPNLQCGESVTIEGQAYTISAV 61
           L++ CRKK+R+RDH    HPYKV+EITPPPK LG+RCFP NLQCGESVTIEGQAYTISAV
Sbjct: 16  LQIFCRKKDRERDH----HPYKVIEITPPPKNLGVRCFPSNLQCGESVTIEGQAYTISAV 71

Query: 62  THRYQLRKGKYEPSEKRLDVLSSSRYILNLYLENLLEQS 100
           THRYQLRKGKYEPSEKRLDVLS+ RYILNLYLENLL+QS
Sbjct: 72  THRYQLRKGKYEPSEKRLDVLSTGRYILNLYLENLLQQS 110




Source: Vitis vinifera

Species: Vitis vinifera

Genus: Vitis

Family: Vitaceae

Order: Vitales

Class:

Phylum: Streptophyta

Superkingdom: Eukaryota

>gi|242076274|ref|XP_002448073.1| hypothetical protein SORBIDRAFT_06g020530 [Sorghum bicolor] gi|241939256|gb|EES12401.1| hypothetical protein SORBIDRAFT_06g020530 [Sorghum bicolor] Back     alignment and taxonomy information
>gi|259490454|ref|NP_001158949.1| uncharacterized protein LOC100303883 [Zea mays] gi|195610298|gb|ACG26979.1| hypothetical protein [Zea mays] gi|413918697|gb|AFW58629.1| hypothetical protein ZEAMMB73_224163 [Zea mays] Back     alignment and taxonomy information
>gi|326518965|dbj|BAJ92643.1| predicted protein [Hordeum vulgare subsp. vulgare] Back     alignment and taxonomy information
>gi|326508454|dbj|BAJ99494.1| predicted protein [Hordeum vulgare subsp. vulgare] Back     alignment and taxonomy information
>gi|356558849|ref|XP_003547715.1| PREDICTED: uncharacterized protein LOC100812628 [Glycine max] Back     alignment and taxonomy information
>gi|351723319|ref|NP_001235995.1| uncharacterized protein LOC100305736 [Glycine max] gi|255626475|gb|ACU13582.1| unknown [Glycine max] Back     alignment and taxonomy information
>gi|194698056|gb|ACF83112.1| unknown [Zea mays] Back     alignment and taxonomy information
>gi|22327475|ref|NP_198866.2| uncharacterized protein [Arabidopsis thaliana] gi|17644175|gb|AAL38785.1| unknown protein [Arabidopsis thaliana] gi|21436195|gb|AAM51385.1| unknown protein [Arabidopsis thaliana] gi|332007171|gb|AED94554.1| uncharacterized protein [Arabidopsis thaliana] Back     alignment and taxonomy information
>gi|145334677|ref|NP_001078684.1| uncharacterized protein [Arabidopsis thaliana] gi|332007172|gb|AED94555.1| uncharacterized protein [Arabidopsis thaliana] Back     alignment and taxonomy information

Prediction of Gene Ontology (GO) Terms

Close Homologs with Gene Ontology terms Detected by BLAST ?

ID ? Alignment graph ? Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query100
TAIR|locus:2168723123 AT5G40500 "AT5G40500" [Arabido 0.83 0.674 0.752 4.2e-31
TAIR|locus:2168723 AT5G40500 "AT5G40500" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
 Score = 342 (125.4 bits), Expect = 4.2e-31, P = 4.2e-31
 Identities = 64/85 (75%), Positives = 74/85 (87%)

Query:     2 LKVVCRKKERDRDHKNNIHPYKVVEITPPPKCLGIRCFPPNLQCGESVTIEGQAYTISAV 61
             L++ CRKKE+ RD  +  +PYKV+EITPPPK LGIRC P NLQCGE+V IEGQ YTISAV
Sbjct:    27 LQINCRKKEKGRDQSH--YPYKVIEITPPPKSLGIRCLPHNLQCGENVMIEGQTYTISAV 84

Query:    62 THRYQLRKGKYEPSEKRLDVLSSSR 86
             THRYQLRKGKYEPSE+RLDVLS++R
Sbjct:    85 THRYQLRKGKYEPSERRLDVLSAAR 109


Parameters:
  V=100
  filter=SEG
  E=0.001

  ctxfactor=1.00

  Query                        -----  As Used  -----    -----  Computed  ----
  Frame  MatID Matrix name     Lambda    K       H      Lambda    K       H
   +0      0   BLOSUM62        0.318   0.135   0.408    same    same    same
               Q=9,R=2         0.244   0.0300  0.180     n/a     n/a     n/a

  Query
  Frame  MatID  Length  Eff.Length     E     S W   T  X   E2     S2
   +0      0      100        88   0.00091  102 3  11 22  0.48    29
                                                     29  0.49    30


Statistics:

  Database:  /share/blast/go-seqdb.fasta
   Title:  go_20130330-seqdb.fasta
   Posted:  5:47:42 AM PDT Apr 1, 2013
   Created:  5:47:42 AM PDT Apr 1, 2013
   Format:  XDF-1
   # of letters in database:  169,044,731
   # of sequences in database:  368,745
   # of database sequences satisfying E:  1
  No. of states in DFA:  532 (57 KB)
  Total size of DFA:  110 KB (2075 KB)
  Time to generate neighborhood:  0.00u 0.00s 0.00t   Elapsed:  00:00:00
  No. of threads or processors used:  24
  Search cpu time:  11.20u 0.15s 11.35t   Elapsed:  00:00:00
  Total cpu time:  11.20u 0.15s 11.35t   Elapsed:  00:00:00
  Start:  Fri May 10 18:14:27 2013   End:  Fri May 10 18:14:27 2013


GO:0003674 "molecular_function" evidence=ND
GO:0005739 "mitochondrion" evidence=ISM
GO:0008150 "biological_process" evidence=ND
GO:0006661 "phosphatidylinositol biosynthetic process" evidence=RCA

Prediction of Enzyme Commission (EC) Number

EC Number Prediction by Annotation Transfer from SWISS-PROT Entries ?

No confident hit for EC number transfering in SWISSPROT detected by BLAST

EC Number Prediction by Ezypred Server ?

Fail to connect to Ezypred Server

EC Number Prediction by EFICAz Software ?

No EC number assignment, probably not an enzyme!


Prediction of Functionally Associated Proteins

Functionally Associated Proteins Detected by STRING ?

Your Input:
GSVIVG00020661001
SubName- Full=Chromosome chr14 scaffold_21, whole genome shotgun sequence; (108 aa)
(Vitis vinifera)
Predicted Functional Partners:
 
Sorry, there are no predicted associations at the current settings.
 

Conserved Domains and Related Protein Families

Conserved Domains Detected by RPS-BLAST ?

Conserved Domains Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query 100
PF1385695 Gifsy-2: ATP-binding sugar transporter from pro-ph 92.55
PF0820758 EFP_N: Elongation factor P (EF-P) KOW-like domain; 85.59
PRK10377120 PTS system glucitol/sorbitol-specific transporter 83.64
PF05954292 Phage_GPD: Phage late control gene D protein (GPD) 82.06
TIGR00849121 gutA PTS system, glucitol/sorbitol-specific IIA co 81.96
>PF13856 Gifsy-2: ATP-binding sugar transporter from pro-phage; PDB: 2PP6_A Back     alignment and domain information
Probab=92.55  E-value=0.32  Score=32.53  Aligned_cols=32  Identities=31%  Similarity=0.659  Sum_probs=21.1

Q ss_pred             EcCCCCCCCCeEEEcCeEEEEEEEEEEEEEecceEEe
Q 034231           38 CFPPNLQCGESVTIEGQAYTISAVTHRYQLRKGKYEP   74 (100)
Q Consensus        38 ~Lp~~pqpG~~le~eg~sY~Vl~rrHrYqLr~GrY~l   74 (100)
                      .+++ |++|+.|+++|++|+|..    ++...|.|++
T Consensus        62 d~~~-P~~gd~v~~dG~~y~V~~----~~~~~G~~~I   93 (95)
T PF13856_consen   62 DYPK-PRRGDRVVIDGESYTVTR----FQEEDGMYVI   93 (95)
T ss_dssp             S------TT-EEEETTEEEEEEE----EEEETTEEEE
T ss_pred             CCCC-CCCCCEEEECCeEEEEeE----EecCCCEEEE
Confidence            3555 999999999999999974    5566687765



>PF08207 EFP_N: Elongation factor P (EF-P) KOW-like domain; InterPro: IPR013185 This entry represents the N-terminal domain of homologues of elongation factor P, which probably are translation initiation factors Back     alignment and domain information
>PRK10377 PTS system glucitol/sorbitol-specific transporter subunit IIA; Provisional Back     alignment and domain information
>PF05954 Phage_GPD: Phage late control gene D protein (GPD); PDB: 2P5Z_X 3D37_A 1WRU_A 3CDD_E Back     alignment and domain information
>TIGR00849 gutA PTS system, glucitol/sorbitol-specific IIA component Back     alignment and domain information

Homologous Structure Templates

Structure Templates Detected by BLAST ?

No homologous structure with e-value below 0.005

Structure Templates Detected by RPS-BLAST ?

No hit with e-value below 0.005

Structure Templates Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query100
2pp6_A102 Gifsy-2 prophage ATP-binding sugar transporter-LI 88.57
2f9h_A129 PTS system, IIA component; alpha-beta structure, b 80.62
>2pp6_A Gifsy-2 prophage ATP-binding sugar transporter-LI protein; beta barrel, 4 helix bundle, structural genomics, PSI-2; 2.70A {Salmonella typhimurium LT2} SCOP: b.106.1.2 Back     alignment and structure
Probab=88.57  E-value=1.1  Score=31.06  Aligned_cols=51  Identities=16%  Similarity=0.142  Sum_probs=35.0

Q ss_pred             cceEEEEeCCCCceeeeEEcCCCCCCCCeEEEcCeEEEEEEEEEEEEEecceEEe
Q 034231           20 HPYKVVEITPPPKCLGIRCFPPNLQCGESVTIEGQAYTISAVTHRYQLRKGKYEP   74 (100)
Q Consensus        20 lp~EV~~~t~p~r~LG~~~Lp~~pqpG~~le~eg~sY~Vl~rrHrYqLr~GrY~l   74 (100)
                      .|-|.--++++.++|=.+.=.--|+-|+-|..+|+.|+|.    ||++..|+.++
T Consensus        41 ~~~emg~lsG~~rsLvvFSsgYrP~r~D~Vv~~Gk~y~Vt----r~~~~ngk~~i   91 (102)
T 2pp6_A           41 FLAELGPVEGNGKNVVVFSGNVIPRRGDRVVLRGSEFTVT----RIRRFNGKPQL   91 (102)
T ss_dssp             GTC--------CEEEEECCSSCCCCTTCEEEETTEEEEEE----EEEEETTEEEE
T ss_pred             hHHHhCCccCCceEEEEecCCcccCCCCEEEEcCcEEEEE----EEEEECCcEEE
Confidence            4445555688888887774445599999999999999995    68888999875



>2f9h_A PTS system, IIA component; alpha-beta structure, beta-barrel, dimer, structural genomic protein structure initiative; 1.57A {Enterococcus faecalis} SCOP: b.161.1.1 Back     alignment and structure

Homologous Structure Domains

Structure Domains Detected by RPS-BLAST ?

No hit with e-value below 0.005

Homologous Domains Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query100
d2pp6a193 Gifsy-2 prophage protein STM1035 {Salmonella typhi 85.03
d1ueba163 Elongation factor P N-terminal domain {Thermus the 83.3
>d2pp6a1 b.106.1.2 (A:1-93) Gifsy-2 prophage protein STM1035 {Salmonella typhimurium [TaxId: 90371]} Back     information, alignment and structure
class: All beta proteins
fold: Phage tail proteins
superfamily: Phage tail proteins
family: gpFII-like
domain: Gifsy-2 prophage protein STM1035
species: Salmonella typhimurium [TaxId: 90371]
Probab=85.03  E-value=1.1  Score=28.98  Aligned_cols=51  Identities=16%  Similarity=0.152  Sum_probs=33.8

Q ss_pred             ceEEEEeCCCCceeeeEEcCCCCCCCCeEEEcCeEEEEEEEEEEEEEecceEEec
Q 034231           21 PYKVVEITPPPKCLGIRCFPPNLQCGESVTIEGQAYTISAVTHRYQLRKGKYEPS   75 (100)
Q Consensus        21 p~EV~~~t~p~r~LG~~~Lp~~pqpG~~le~eg~sY~Vl~rrHrYqLr~GrY~l~   75 (100)
                      |-|.--++++.++|=...=.--|.-|+-|..+|+.|+|.    ||+.-+|++++.
T Consensus        39 ~~em~al~G~~rsLvvFSs~ykPrr~D~Vv~~G~~~~VT----r~~~~ngk~~i~   89 (93)
T d2pp6a1          39 LAELGPVEGNGKNVVVFSGNVIPRRGDRVVLRGSEFTVT----RIRRFNGKPQLT   89 (93)
T ss_dssp             TC--------CEEEEECCSSCCCCTTCEEEETTEEEEEE----EEEEETTEEEEE
T ss_pred             hhhcCCccCCceEEEEEcCCcccCCCCeEEecCcEEEEE----EEEEECCcEEEE
Confidence            445555688888887753333399999999999999995    678889998863



>d1ueba1 b.34.5.2 (A:1-63) Elongation factor P N-terminal domain {Thermus thermophilus HB8 [TaxId: 300852]} Back     information, alignment and structure