Citrus Sinensis ID: 048574


Local Sequence Feature Prediction

Prediction and (Method)Result
Residue Number Marker
Protein Sequence ?
Secondary Structure (PSIPRED) ?
Secondary Structure Prediction (SSPRO) ?
Coil and Loop (DISEMBL) ?
Flexible Loop (DISEMBL) ?
Low Complexity Region (SEG) ?
Disordered region (IsUnstruct) ?
Disordered Region (DISOPRED) ?
Disordered Region (DISEMBL) ?
Disordered Region (DISPRO) ?
Transmembrane Helix (TMHMM) ?
Transmembrane Helix (HMMTOP) ?
Transmembrane Helix (MEMSAT) ?
TM Helix, Signal Peptide (MEMSAT_SVM) ?
TM Helix, Signal Peptide (Phobius) ?
Signal Peptide (SignalP HMM Mode) ?
Signal Peptide (SignalP NN Mode) ?
Coiled Coils (COILS) ?
Positional Conservation ?
 
--------10--------20--------30--------40--------50--------60-----
MGTIKINRITSPFASADSEPSAYDVLQEYDFPVGILPKGVTRYALNAETGELIFGTYEQQLQLHH
cccEEEEEEccccccccccccHHHHHHHccccccccccccEEEEEcccccEEEEEcccEEEEEcc
cccEEEEEcccccccccccccHHHHHHHccccccccccccEEEEccccccEEEEEEEEEEEEEcc
mgtikinritspfasadsepsaydvlqeydfpvgilpkgvtRYALNAETGELIFGTYEQQLQLHH
mgtikinritspfasadsepSAYDVLQEYDFPVGILPKGVTRYALNAETGELIFGTYEQQLQLHH
MGTIKINRITSPFASADSEPSAYDVLQEYDFPVGILPKGVTRYALNAETGELIFGTYEQQLQLHH
*********************AYDVLQEYDFPVGILPKGVTRYALNAETGELIFGTYE*******
**TIK***IT**********SAYDVLQEYDFPVGILPKGVTRYALNAETGELIFGTYEQQL****
MGTIKINRITSPFASADSEPSAYDVLQEYDFPVGILPKGVTRYALNAETGELIFGTYEQQLQLHH
**TIKINRITSPFASADSEPSAYDVLQEYDFPVGILPKGVTRYALNAETGELIFGTYEQQLQLHH
iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiii
ooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
ooooooooooooooooooooooohhhhhhhhhhhhhhhhhhhiiiiiiiiiiiiiiiiiiiiiii
ooooooooooooooooooooooooohhhhhhhhhhhhhhhhiiiiiiiiiiiiiiiiiiiiiiii
ooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
MGTIKINRITSPFASADSEPSAYDVLQEYDFPVGILPKGVTRYALNAETGELIFGTYEQQLQLHH
no confident homologs detected

Close Homologs for Annotation Transfer

Close Homologs in SWISS-PROT Database Detected by BLAST ?

No hits with e-value below 0.001 by BLAST

Close Homologs in the Non-Redundant Database Detected by BLAST ?

GI ?Alignment Graph ?Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query65
296089725 269 unnamed protein product [Vitis vinifera] 0.676 0.163 0.522 4e-07
225450709 169 PREDICTED: uncharacterized protein LOC10 0.646 0.248 0.547 2e-06
255564212161 conserved hypothetical protein [Ricinus 0.630 0.254 0.536 2e-06
225450691 170 PREDICTED: uncharacterized protein LOC10 0.661 0.252 0.511 3e-06
225450706 166 PREDICTED: uncharacterized protein At5g0 0.569 0.222 0.621 8e-06
125552016 183 hypothetical protein OsI_19648 [Oryza sa 0.523 0.185 0.676 9e-06
115463431 182 Os05g0362500 [Oryza sativa Japonica Grou 0.523 0.186 0.676 9e-06
147776440 261 hypothetical protein VITISV_002758 [Viti 0.569 0.141 0.621 1e-05
225450695158 PREDICTED: uncharacterized protein At5g0 0.661 0.272 0.511 1e-05
222631296 192 hypothetical protein OsJ_18240 [Oryza sa 0.523 0.177 0.676 1e-05
>gi|296089725|emb|CBI39544.3| unnamed protein product [Vitis vinifera] Back     alignment and taxonomy information
 Score = 58.5 bits (140), Expect = 4e-07,   Method: Composition-based stats.
 Identities = 23/44 (52%), Positives = 34/44 (77%)

Query: 10  TSPFASADSEPSAYDVLQEYDFPVGILPKGVTRYALNAETGELI 53
           + P  SA+  P+AY+VLQ+Y+FPVG+LP+GV  Y LN+ TG+ +
Sbjct: 129 SPPVTSANENPTAYEVLQDYNFPVGLLPEGVLGYELNSTTGKFV 172




Source: Vitis vinifera

Species: Vitis vinifera

Genus: Vitis

Family: Vitaceae

Order: Vitales

Class:

Phylum: Streptophyta

Superkingdom: Eukaryota

>gi|225450709|ref|XP_002283330.1| PREDICTED: uncharacterized protein LOC100247685 [Vitis vinifera] gi|147776442|emb|CAN74023.1| hypothetical protein VITISV_002760 [Vitis vinifera] Back     alignment and taxonomy information
>gi|255564212|ref|XP_002523103.1| conserved hypothetical protein [Ricinus communis] gi|223537665|gb|EEF39288.1| conserved hypothetical protein [Ricinus communis] Back     alignment and taxonomy information
>gi|225450691|ref|XP_002283241.1| PREDICTED: uncharacterized protein LOC100263080 [Vitis vinifera] gi|296089728|emb|CBI39547.3| unnamed protein product [Vitis vinifera] Back     alignment and taxonomy information
>gi|225450706|ref|XP_002283314.1| PREDICTED: uncharacterized protein At5g01610-like [Vitis vinifera] Back     alignment and taxonomy information
>gi|125552016|gb|EAY97725.1| hypothetical protein OsI_19648 [Oryza sativa Indica Group] Back     alignment and taxonomy information
>gi|115463431|ref|NP_001055315.1| Os05g0362500 [Oryza sativa Japonica Group] gi|54287657|gb|AAV31401.1| unknown protein [Oryza sativa Japonica Group] gi|113578866|dbj|BAF17229.1| Os05g0362500 [Oryza sativa Japonica Group] gi|215706967|dbj|BAG93427.1| unnamed protein product [Oryza sativa Japonica Group] Back     alignment and taxonomy information
>gi|147776440|emb|CAN74021.1| hypothetical protein VITISV_002758 [Vitis vinifera] Back     alignment and taxonomy information
>gi|225450695|ref|XP_002283277.1| PREDICTED: uncharacterized protein At5g01610 [Vitis vinifera] gi|147780197|emb|CAN68006.1| hypothetical protein VITISV_026357 [Vitis vinifera] Back     alignment and taxonomy information
>gi|222631296|gb|EEE63428.1| hypothetical protein OsJ_18240 [Oryza sativa Japonica Group] Back     alignment and taxonomy information

Prediction of Gene Ontology (GO) Terms

Close Homologs with Gene Ontology terms Detected by BLAST ?

ID ? Alignment graph ? Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query65
TAIR|locus:505006096166 AT1G02816 "AT1G02816" [Arabido 0.661 0.259 0.543 2.3e-07
TAIR|locus:2133264167 AT4G02370 "AT4G02370" [Arabido 0.569 0.221 0.578 6.1e-07
TAIR|locus:2133254154 AT4G02360 "AT4G02360" [Arabido 0.630 0.266 0.512 2.1e-06
TAIR|locus:2171297 195 AT5G16380 "AT5G16380" [Arabido 0.723 0.241 0.404 3.3e-05
TAIR|locus:505006095149 AT1G02813 [Arabidopsis thalian 0.630 0.275 0.487 0.00013
TAIR|locus:505006096 AT1G02816 "AT1G02816" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
 Score = 118 (46.6 bits), Expect = 2.3e-07, P = 2.3e-07
 Identities = 25/46 (54%), Positives = 31/46 (67%)

Query:     9 ITSPFASA---DSEPSAYDVLQEYDFPVGILPKGVTRYALNAETGE 51
             I SP  +A   D  P+AY +LQ Y+FPVGILPKGV  Y L+  TG+
Sbjct:    16 IPSPLIAAANDDDIPTAYTLLQSYNFPVGILPKGVVSYDLDKSTGQ 61




GO:0003674 "molecular_function" evidence=ND
GO:0005576 "extracellular region" evidence=ISM
GO:0008150 "biological_process" evidence=ND
GO:0005773 "vacuole" evidence=IDA
GO:0005774 "vacuolar membrane" evidence=IDA
TAIR|locus:2133264 AT4G02370 "AT4G02370" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2133254 AT4G02360 "AT4G02360" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2171297 AT5G16380 "AT5G16380" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:505006095 AT1G02813 [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms

Prediction of Enzyme Commission (EC) Number

EC Number Prediction by Annotation Transfer from SWISS-PROT Entries ?

No confident hit for EC number transfering in SWISSPROT detected by BLAST

EC Number Prediction by Ezypred Server ?

Fail to connect to Ezypred Server

EC Number Prediction by EFICAz Software ?

No EC number assignment, probably not an enzyme!


Prediction of Functionally Associated Proteins

Functionally Associated Proteins Detected by STRING ?

Fail to connect to STRING server


Conserved Domains and Related Protein Families

Conserved Domains Detected by RPS-BLAST ?

ID ?Alignment Graph ?Length ? Definition ? E-value ?
Query65
pfam04398108 pfam04398, DUF538, Protein of unknown function, DU 5e-13
>gnl|CDD|218064 pfam04398, DUF538, Protein of unknown function, DUF538 Back     alignment and domain information
 Score = 58.0 bits (141), Expect = 5e-13
 Identities = 18/32 (56%), Positives = 25/32 (78%)

Query: 21 SAYDVLQEYDFPVGILPKGVTRYALNAETGEL 52
          +AY++L+EY  PVG+LPKGVT Y L+  TG+ 
Sbjct: 1  TAYELLEEYGLPVGLLPKGVTEYTLDETTGKF 32


This family consists of several plant proteins of unknown function. Length = 108

Conserved Domains Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query 65
PF04398110 DUF538: Protein of unknown function, DUF538; Inter 99.58
>PF04398 DUF538: Protein of unknown function, DUF538; InterPro: IPR007493 This family consists of several plant proteins of unknown function Back     alignment and domain information
Probab=99.58  E-value=1.2e-15  Score=99.60  Aligned_cols=38  Identities=45%  Similarity=0.809  Sum_probs=28.8

Q ss_pred             CHHHHHhhCCCCCcCCCCCcceeEeeCCCCcEEEEech
Q 048574           21 SAYDVLQEYDFPVGILPKGVTRYALNAETGELIFGTYE   58 (65)
Q Consensus        21 t~yelL~~ygLP~GLLP~gV~~Y~Ld~~tG~F~V~~y~   58 (65)
                      ||||+|++||||+||||++|++|+||++||.|+|++-.
T Consensus         1 tayelL~~~glP~GLLP~~v~~y~l~~~tG~f~v~l~~   38 (110)
T PF04398_consen    1 TAYELLEEYGLPRGLLPLGVTEYGLNRDTGFFWVKLKS   38 (110)
T ss_dssp             --HHHHHHHS-TT-TTTSSS-EEEE-TTT-SEEEE-SS
T ss_pred             CHHHhHHHcCCCCCcCCCCceEEEEecCCcEEEEEecC
Confidence            68999999999999999999999999999999999854



; PDB: 1YDU_A.


Homologous Structure Templates

Structure Templates Detected by BLAST ?

No homologous structure with e-value below 0.005

Structure Templates Detected by RPS-BLAST ?

ID ?Alignment Graph ?Length ? Definition ? E-value ?
Query65
1ydu_A170 AT5G01610; DUF538, structural genomics, protein st 8e-13
>1ydu_A AT5G01610; DUF538, structural genomics, protein structure initiative, center for eukaryotic structural genomics, CESG unknown function; NMR {Arabidopsis thaliana} SCOP: b.162.1.1 Length = 170 Back     alignment and structure
 Score = 58.3 bits (141), Expect = 8e-13
 Identities = 13/39 (33%), Positives = 20/39 (51%)

Query: 14 ASADSEPSAYDVLQEYDFPVGILPKGVTRYALNAETGEL 52
               +    ++L+EYD P+GI P   T Y  + ET +L
Sbjct: 46 IKGKMQKPLPELLKEYDLPIGIFPGDATNYEFDEETKKL 84


Structure Templates Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query65
1ydu_A170 AT5G01610; DUF538, structural genomics, protein st 99.52
>1ydu_A AT5G01610; DUF538, structural genomics, protein structure initiative, center for eukaryotic structural genomics, CESG unknown function; NMR {Arabidopsis thaliana} SCOP: b.162.1.1 Back     alignment and structure
Probab=99.52  E-value=6.6e-15  Score=102.27  Aligned_cols=42  Identities=31%  Similarity=0.566  Sum_probs=39.4

Q ss_pred             CCCCCCHHHHHhhCCCCCcCCCCCcceeEeeCCCCcEEEEec
Q 048574           16 ADSEPSAYDVLQEYDFPVGILPKGVTRYALNAETGELIFGTY   57 (65)
Q Consensus        16 ~~~~~t~yelL~~ygLP~GLLP~gV~~Y~Ld~~tG~F~V~~y   57 (65)
                      +.+++++||+|++||||+||||++|++|.+|++||.|+|++-
T Consensus        48 g~~~~ta~elL~e~gLP~GLLP~~V~~Y~l~~~tG~f~V~l~   89 (170)
T 1ydu_A           48 GKMQKPLPELLKEYDLPIGIFPGDATNYEFDEETKKLTVLIP   89 (170)
T ss_dssp             TTTTSSCHHHHHHHSCTTCTTTSSSCEEEECTTTCSEEEECS
T ss_pred             ccccccHHHHHHHcCCCCCcCCCCCeEEEEECCCcEEEEEeC
Confidence            477889999999999999999999999999999999999873




Homologous Structure Domains

Structure Domains Detected by RPS-BLAST ?

ID ?Alignment Graph ?Length ? Definition ? E-value ?
Query 65
d1ydua1169 b.162.1.1 (A:2-170) Hypothetical protein At5g01610 1e-11
>d1ydua1 b.162.1.1 (A:2-170) Hypothetical protein At5g01610 {Thale cress (Arabidopsis thaliana) [TaxId: 3702]} Length = 169 Back     information, alignment and structure

class: All beta proteins
fold: At5g01610-like
superfamily: At5g01610-like
family: At5g01610-like
domain: Hypothetical protein At5g01610
species: Thale cress (Arabidopsis thaliana) [TaxId: 3702]
 Score = 53.6 bits (129), Expect = 1e-11
 Identities = 13/34 (38%), Positives = 20/34 (58%)

Query: 19 EPSAYDVLQEYDFPVGILPKGVTRYALNAETGEL 52
          +    ++L+EYD P+GI P   T Y  + ET +L
Sbjct: 50 QKPLPELLKEYDLPIGIFPGDATNYEFDEETKKL 83


Homologous Domains Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query65
d1ydua1169 Hypothetical protein At5g01610 {Thale cress (Arabi 99.49
>d1ydua1 b.162.1.1 (A:2-170) Hypothetical protein At5g01610 {Thale cress (Arabidopsis thaliana) [TaxId: 3702]} Back     information, alignment and structure
class: All beta proteins
fold: At5g01610-like
superfamily: At5g01610-like
family: At5g01610-like
domain: Hypothetical protein At5g01610
species: Thale cress (Arabidopsis thaliana) [TaxId: 3702]
Probab=99.49  E-value=7.2e-15  Score=100.59  Aligned_cols=42  Identities=31%  Similarity=0.566  Sum_probs=38.9

Q ss_pred             CCCCCCHHHHHhhCCCCCcCCCCCcceeEeeCCCCcEEEEec
Q 048574           16 ADSEPSAYDVLQEYDFPVGILPKGVTRYALNAETGELIFGTY   57 (65)
Q Consensus        16 ~~~~~t~yelL~~ygLP~GLLP~gV~~Y~Ld~~tG~F~V~~y   57 (65)
                      ..+..+++|+|++||||+||||++|++|++|++||+|+|++-
T Consensus        47 gk~~k~l~ELL~eygLP~GLLP~~V~~Y~l~~~TG~f~V~l~   88 (169)
T d1ydua1          47 GKMQKPLPELLKEYDLPIGIFPGDATNYEFDEETKKLTVLIP   88 (169)
T ss_dssp             TTTTSSCHHHHHHHSCTTCTTTSSSCEEEECTTTCSEEEECS
T ss_pred             CCCCCCHHHHHHhcCCCCccCCCCceEEEEECCCcEEEEEeC
Confidence            356789999999999999999999999999999999999873