Citrus Sinensis ID: 035538


Local Sequence Feature Prediction

Prediction and (Method)Result
Residue Number Marker
Protein Sequence ?
Secondary Structure (PSIPRED) ?
Secondary Structure Prediction (SSPRO) ?
Coil and Loop (DISEMBL) ?
Flexible Loop (DISEMBL) ?
Low Complexity Region (SEG) ?
Disordered region (IsUnstruct) ?
Disordered Region (DISOPRED) ?
Disordered Region (DISEMBL) ?
Disordered Region (DISPRO) ?
Transmembrane Helix (TMHMM) ?
Transmembrane Helix (HMMTOP) ?
Transmembrane Helix (MEMSAT) ?
TM Helix, Signal Peptide (MEMSAT_SVM) ?
TM Helix, Signal Peptide (Phobius) ?
Signal Peptide (SignalP HMM Mode) ?
Signal Peptide (SignalP NN Mode) ?
Coiled Coils (COILS) ?
Positional Conservation ?
 
--------10--------20--------30--------40--------50--------60--------70--------80--------90-------100-------110-------120-------130-------140-------150-------160-------170-
MASSSPSISLSRPPIITSILNPKPSRHFHRCTVSFSTIYCSHTELAAKAIHFHNPCRGIAQKRSKIWRSNATSGEVLPTESTPLETSQEMVSSTGDESLSSVISVLLFAAFIALSILTIGVVYIGVTDFLQKREREKFEKEEAATKKKKKVKAKPRAGPRGFGQKIDDDDD
ccccccccccccccccccccccccccccccccccccccccccccccccHHHccccccccccccccccccccccccccccccccccccHHHHcccccccHHHHHHHHHHHHHHHHHHHHHEEEEEEHHHHHHHHHHHHHHHHHHHHHHHHHHcccccccccccccccccccc
cccccccccccccccEEEEEcccccccccccccccccHHHccccccccEEEEccccccHHccccccccccccccccccccccHHHHHHHHHHcccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcccccccEEccccccccccEEccccc
massspsislsrppiitsilnpkpsrhfhrctvsfstIYCSHTELAAKAIhfhnpcrgiaQKRSKIwrsnatsgevlptestpletsQEMVSSTGDESLSSVISVLLFAAFIALSILTIGVVYIGVTDFLQKREREKFEKEEAATKKKKkvkakpragprgfgqkiddddd
massspsislsrppIITSilnpkpsrhFHRCTVSFSTIYCSHTELAAKAIHFHNPCRGIAQKRSKIWRSNAtsgevlptestpletSQEMVSSTGDESLSSVISVLLFAAFIALSILTIGVVYIGVTDFLQKRerekfekeeaatkkkkkvkakpragprgfgqkiddddd
MAssspsislsrppiitsiLNPKPSRHFHRCTVSFSTIYCSHTELAAKAIHFHNPCRGIAQKRSKIWRSNATSGEVLPTESTPLETSQEMVSSTGDESLSSVisvllfaafialsilTIGVVYIGVTDFLQkrerekfekeeaatkkkkkvkakpraGPRGFGQKIDDDDD
***************ITSILN**PSRHFHRCTVSFSTIYCSHTELAAKAIHFHNPCRGIAQKRSKIW*********************************SVISVLLFAAFIALSILTIGVVYIGVTDFLQ****************************************
************************************************************************************************ESLSSVISVLLFAAFIALSILTIGVVYIGVTDF******************************************
**********SRPPIITSILNPKPSRHFHRCTVSFSTIYCSHTELAAKAIHFHNPCRGIAQKRSKIWRSNATS*************************LSSVISVLLFAAFIALSILTIGVVYIGVTDFLQKRER************************RGFGQKIDDDDD
*******ISLSRPPIITSILNPKPSRHFHRCTVSFSTIYCSHTELAAKAIHFHNPCRGIAQKRSKIWRSNATS******************SSTGDESLSSVISVLLFAAFIALSILTIGVVYIGVTDFLQKREREKFEKEE*****************************
ooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooHHHHHHHHHHHHHHHHHHHHHHHiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiii
oooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooHHHHHHHHHHHHHHHHHHHHHHHHHiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiii
ooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooHHHHHHHHHHHHHHHHHHHHHHHHHiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiii
SSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiHHHHHHHHHHHHHHHHHHHHHHHHHHHHooooooooooooooooooooooooooooooooooooooooooooo
oooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooHHHHHHHHHHHHHHHHHHHHHHHHHiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiii
SSSSSSSSSSSSSSSSSSSoooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
MASSSPSISLSRPPIITSILNPKPSRHFHRCTVSFSTIYCSHTELAAKAIHFHNPCRGIAQKRSKIWRSNATSGEVLPTESTPLETSQEMVSSTGDESLSSVISVLLFAAFIALSILTIGVVYIGVTDFLQKREREKFEKEEAATKKKKKVKAKPRAGPRGFGQKIDDDDD
no confident homologs detected

Close Homologs for Annotation Transfer

Close Homologs in SWISS-PROT Database Detected by BLAST ?

No hits with e-value below 0.001 by BLAST

Close Homologs in the Non-Redundant Database Detected by BLAST ?

GI ?Alignment Graph ?Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query171
225434875175 PREDICTED: uncharacterized protein LOC10 0.988 0.965 0.508 3e-29
224054641202 predicted protein [Populus trichocarpa] 0.970 0.821 0.436 1e-28
158828252167 unknown [Boechera divaricarpa] 0.883 0.904 0.421 2e-21
297822645167 hypothetical protein ARALYDRAFT_901879 [ 0.877 0.898 0.412 4e-21
297790301167 hypothetical protein ARALYDRAFT_920649 [ 0.883 0.904 0.421 5e-21
356550293165 PREDICTED: uncharacterized protein LOC10 0.695 0.721 0.460 1e-19
449450758176 PREDICTED: uncharacterized protein LOC10 0.660 0.642 0.537 3e-19
15227050169 uncharacterized protein [Arabidopsis tha 0.824 0.834 0.404 1e-18
255559024168 conserved hypothetical protein [Ricinus 0.871 0.886 0.4 2e-18
357454233166 hypothetical protein MTR_2g097680 [Medic 0.736 0.759 0.465 2e-14
>gi|225434875|ref|XP_002283174.1| PREDICTED: uncharacterized protein LOC100248460 [Vitis vinifera] gi|297746039|emb|CBI16095.3| unnamed protein product [Vitis vinifera] Back     alignment and taxonomy information
 Score =  132 bits (333), Expect = 3e-29,   Method: Compositional matrix adjust.
 Identities = 90/177 (50%), Positives = 117/177 (66%), Gaps = 8/177 (4%)

Query: 1   MASSSPSISLSRPPIITSILNPKP--SRHFHRCTVSFSTIYCSHTELA-AKAIHFHNPCR 57
           MAS+S   SLS  P   SI N KP  S H H  ++ F  +    T     K I+FH+P  
Sbjct: 1   MASASVCGSLSPLPF-PSIFNSKPPNSLHIHNSSLPFIPLLALRTTTPRKKTINFHHPPH 59

Query: 58  GIAQKRSKIWRSNATSGEVLPTESTPLETSQEMVSSTGDESLSSVISVLLFAAFIALSIL 117
            I QK +++WR +A S + LP+E TPLE SQ++VS T DE +S+VIS LLF AF+AL+IL
Sbjct: 60  AIPQKSTQLWRISAVSEDALPSEVTPLENSQQIVS-TSDEGVSTVISALLFVAFVALTIL 118

Query: 118 TIGVVYIGVTDFLQKR-EREKFEKEEAATKKKKKVKAKPRAGPRGFGQKIDDD--DD 171
           TIGV+YI V+DFLQKR + +  ++E A  K  KK K + RAGPRGFGQKID++  DD
Sbjct: 119 TIGVIYIAVSDFLQKREKEKFEKEESAKKKNGKKRKVRARAGPRGFGQKIDEESGDD 175




Source: Vitis vinifera

Species: Vitis vinifera

Genus: Vitis

Family: Vitaceae

Order: Vitales

Class:

Phylum: Streptophyta

Superkingdom: Eukaryota

>gi|224054641|ref|XP_002298342.1| predicted protein [Populus trichocarpa] gi|222845600|gb|EEE83147.1| predicted protein [Populus trichocarpa] Back     alignment and taxonomy information
>gi|158828252|gb|ABW81129.1| unknown [Boechera divaricarpa] Back     alignment and taxonomy information
>gi|297822645|ref|XP_002879205.1| hypothetical protein ARALYDRAFT_901879 [Arabidopsis lyrata subsp. lyrata] gi|297325044|gb|EFH55464.1| hypothetical protein ARALYDRAFT_901879 [Arabidopsis lyrata subsp. lyrata] Back     alignment and taxonomy information
>gi|297790301|ref|XP_002863050.1| hypothetical protein ARALYDRAFT_920649 [Arabidopsis lyrata subsp. lyrata] gi|158828186|gb|ABW81065.1| unknown [Arabidopsis lyrata subsp. lyrata] gi|297308855|gb|EFH39309.1| hypothetical protein ARALYDRAFT_920649 [Arabidopsis lyrata subsp. lyrata] Back     alignment and taxonomy information
>gi|356550293|ref|XP_003543522.1| PREDICTED: uncharacterized protein LOC100792546 [Glycine max] Back     alignment and taxonomy information
>gi|449450758|ref|XP_004143129.1| PREDICTED: uncharacterized protein LOC101205908 [Cucumis sativus] gi|449496669|ref|XP_004160194.1| PREDICTED: uncharacterized protein LOC101223634 [Cucumis sativus] Back     alignment and taxonomy information
>gi|15227050|ref|NP_180481.1| uncharacterized protein [Arabidopsis thaliana] gi|3980414|gb|AAC95217.1| unknown protein [Arabidopsis thaliana] gi|26451351|dbj|BAC42776.1| unknown protein [Arabidopsis thaliana] gi|28973607|gb|AAO64128.1| unknown protein [Arabidopsis thaliana] gi|330253125|gb|AEC08219.1| uncharacterized protein [Arabidopsis thaliana] Back     alignment and taxonomy information
>gi|255559024|ref|XP_002520535.1| conserved hypothetical protein [Ricinus communis] gi|223540377|gb|EEF41948.1| conserved hypothetical protein [Ricinus communis] Back     alignment and taxonomy information
>gi|357454233|ref|XP_003597397.1| hypothetical protein MTR_2g097680 [Medicago truncatula] gi|355486445|gb|AES67648.1| hypothetical protein MTR_2g097680 [Medicago truncatula] Back     alignment and taxonomy information

Prediction of Gene Ontology (GO) Terms

Close Homologs with Gene Ontology terms Detected by BLAST ?

ID ? Alignment graph ? Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query171
TAIR|locus:2043192169 AT2G29180 "AT2G29180" [Arabido 0.631 0.639 0.316 1.1e-09
TAIR|locus:2043192 AT2G29180 "AT2G29180" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
 Score = 140 (54.3 bits), Expect = 1.1e-09, P = 1.1e-09
 Identities = 37/117 (31%), Positives = 57/117 (48%)

Query:    61 QKRSKIWRSNAT----SGEVLPTESTPLETSQEMVSSTGDESLSSVXXXXXXXXXXXXXX 116
             Q+++++WR +AT    S E++ ++S+   +S  +VSS G + ++ +              
Sbjct:    49 QEKTQLWRVSATPEEISQEIVSSDSS--SSSGAVVSSGGQDGVALIIQVLLFVAFLALTV 106

Query:   117 XTIGVVYIGVTDFL---QXXXXXXXXXXXXXXXXXXXXXXXXXXGPRGFGQKIDDDD 170
              TIGVVYIGVT+FL   +                          GPRGFGQKI+DDD
Sbjct:   107 LTIGVVYIGVTEFLGKREREKFEKDEAAKKSKKGGKKKAMRARAGPRGFGQKIEDDD 163


Parameters:
  V=100
  filter=SEG
  E=0.001

  ctxfactor=1.00

  Query                        -----  As Used  -----    -----  Computed  ----
  Frame  MatID Matrix name     Lambda    K       H      Lambda    K       H
   +0      0   BLOSUM62        0.316   0.131   0.392    same    same    same
               Q=9,R=2         0.244   0.0300  0.180     n/a     n/a     n/a

  Query
  Frame  MatID  Length  Eff.Length     E     S W   T  X   E2     S2
   +0      0      171       113   0.00091  102 3  11 22  0.49    30
                                                     29  0.44    32


Statistics:

  Database:  /share/blast/go-seqdb.fasta
   Title:  go_20130330-seqdb.fasta
   Posted:  5:47:42 AM PDT Apr 1, 2013
   Created:  5:47:42 AM PDT Apr 1, 2013
   Format:  XDF-1
   # of letters in database:  169,044,731
   # of sequences in database:  368,745
   # of database sequences satisfying E:  1
  No. of states in DFA:  579 (62 KB)
  Total size of DFA:  125 KB (2080 KB)
  Time to generate neighborhood:  0.00u 0.00s 0.00t   Elapsed:  00:00:00
  No. of threads or processors used:  24
  Search cpu time:  12.56u 0.18s 12.74t   Elapsed:  00:00:01
  Total cpu time:  12.56u 0.18s 12.74t   Elapsed:  00:00:01
  Start:  Thu May  9 20:31:53 2013   End:  Thu May  9 20:31:54 2013


GO:0003674 "molecular_function" evidence=ND
GO:0008150 "biological_process" evidence=ND
GO:0009507 "chloroplast" evidence=ISM
GO:0009535 "chloroplast thylakoid membrane" evidence=IDA
GO:0009965 "leaf morphogenesis" evidence=RCA
GO:0030154 "cell differentiation" evidence=RCA
GO:0045893 "positive regulation of transcription, DNA-dependent" evidence=RCA

Prediction of Enzyme Commission (EC) Number

EC Number Prediction by Annotation Transfer from SWISS-PROT Entries ?

No confident hit for EC number transfering in SWISSPROT detected by BLAST

EC Number Prediction by Ezypred Server ?

Fail to connect to Ezypred Server

EC Number Prediction by EFICAz Software ?

No EC number assignment, probably not an enzyme!


Prediction of Functionally Associated Proteins

Functionally Associated Proteins Detected by STRING ?

Fail to connect to STRING server


Conserved Domains and Related Protein Families

Conserved Domains Detected by RPS-BLAST ?

Conserved Domains Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query 171
PF08229196 SHR3_chaperone: ER membrane protein SH3 ; InterPro 94.79
PF1488059 COX14: Cytochrome oxidase c assembly 89.7
smart00786196 SHR3_chaperone ER membrane protein SH3. This famil 89.49
PF11770158 GAPT: GRB2-binding adapter (GAPT); InterPro: IPR02 86.65
PF1104449 TMEMspv1-c74-12: Plectrovirus spv1-c74 ORF 12 tran 83.95
PF1498151 FAM165: FAM165 family 82.32
>PF08229 SHR3_chaperone: ER membrane protein SH3 ; InterPro: IPR013248 This family of proteins are membrane localised chaperones that are required for correct plasma membrane localisation of amino acid permeases (AAPs) [] Back     alignment and domain information
Probab=94.79  E-value=0.041  Score=46.15  Aligned_cols=39  Identities=28%  Similarity=0.472  Sum_probs=31.3

Q ss_pred             HHHHHHHHHHHHhhheeeeeehhhhhhhHHHHHHHHHHHHhh
Q 035538          105 VLLFAAFIALSILTIGVVYIGVTDFLQKREREKFEKEEAATK  146 (171)
Q Consensus       105 ~LlfvAfvgLsilTiGViYL~VtdFlqKRe~EkfeKeEaak~  146 (171)
                      +++.++++|..+|=+|--|   .+..+.++.++|++||++++
T Consensus       138 ~Ii~~~LvGVLvLQaG~~Y---Ae~~~~~~~~~~~~~e~~~~  176 (196)
T PF08229_consen  138 TIIALVLVGVLVLQAGQWY---AERKDAKELEEFEKEEAEAA  176 (196)
T ss_pred             HHHHHHHHHHHHHHhhHHH---HhhhhHHHHHHHHHHHHHHH
Confidence            4667788899999999998   78888889998888874333



Shr3 prevents AAPs proteins from aggregating and assists in their correct folding. In the absence of Shr3, AAPs are retained in the ER.

>PF14880 COX14: Cytochrome oxidase c assembly Back     alignment and domain information
>smart00786 SHR3_chaperone ER membrane protein SH3 Back     alignment and domain information
>PF11770 GAPT: GRB2-binding adapter (GAPT); InterPro: IPR021082 This entry represents a family of transmembrane proteins which bind the growth factor receptor-bound protein 2 (GRB2) in B cells [] Back     alignment and domain information
>PF11044 TMEMspv1-c74-12: Plectrovirus spv1-c74 ORF 12 transmembrane protein; InterPro: IPR022743 This is a group of proteins expressed by Plectroviruses Back     alignment and domain information
>PF14981 FAM165: FAM165 family Back     alignment and domain information

Homologous Structure Templates

Structure Templates Detected by BLAST ?

No homologous structure with e-value below 0.005

Structure Templates Detected by RPS-BLAST ?

No hit with e-value below 0.005

Structure Templates Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query171
2knc_B79 Integrin beta-3; transmembrane signaling, protein 92.06
>2knc_B Integrin beta-3; transmembrane signaling, protein structure, cell A cleavage on PAIR of basic residues, disease mutation, disul bond, glycoprotein; NMR {Homo sapiens} Back     alignment and structure
Probab=92.06  E-value=0.13  Score=36.33  Aligned_cols=41  Identities=29%  Similarity=0.374  Sum_probs=27.0

Q ss_pred             HHHHHHHHHHHHHHHHHhhheeeeeehhhhhhhHHHHHHHHHHH
Q 035538          100 SSVISVLLFAAFIALSILTIGVVYIGVTDFLQKREREKFEKEEA  143 (171)
Q Consensus       100 a~vis~LlfvAfvgLsilTiGViYL~VtdFlqKRe~EkfeKeEa  143 (171)
                      +.+..++..+..+||.+|-+   +--++.+-++||-+|||||-.
T Consensus        12 ~Iv~gvi~gilliGllllli---wk~~~~i~DrrE~~kFEkE~~   52 (79)
T 2knc_B           12 VVLLSVMGAILLIGLAALLI---WKLLITIHDRKEFAKFEEERA   52 (79)
T ss_dssp             HHHHHHHHHHHHHHHHHHHH---HHHHHHHHHHHHHHHHHHHHH
T ss_pred             hhHHHHHHHHHHHHHHHHHH---HHHHHHHHHHHHHHHHHHHHH
Confidence            34445555556666665543   444577889999999998833




Homologous Structure Domains

Structure Domains Detected by RPS-BLAST ?

No hit with e-value below 0.005

Homologous Domains Detected by HHsearch ?

No hit with probability above 80.00