Citrus Sinensis ID: 034494


Local Sequence Feature Prediction

Prediction and (Method)Result
Residue Number Marker
Protein Sequence ?
Secondary Structure (PSIPRED) ?
Secondary Structure Prediction (SSPRO) ?
Coil and Loop (DISEMBL) ?
Flexible Loop (DISEMBL) ?
Low Complexity Region (SEG) ?
Disordered region (IsUnstruct) ?
Disordered Region (DISOPRED) ?
Disordered Region (DISEMBL) ?
Disordered Region (DISPRO) ?
Transmembrane Helix (TMHMM) ?
Transmembrane Helix (HMMTOP) ?
Transmembrane Helix (MEMSAT) ?
TM Helix, Signal Peptide (MEMSAT_SVM) ?
TM Helix, Signal Peptide (Phobius) ?
Signal Peptide (SignalP HMM Mode) ?
Signal Peptide (SignalP NN Mode) ?
Coiled Coils (COILS) ?
Positional Conservation ?
 
--------10--------20--------30--------40--------50--------60--------70--------80--------90---
MDDSSASYIHLVHHLIEECLVFNMSKEECMEALSKHANIKPVITSTVWKELEKENKEFFEAYARKNAERATVMETRERIEKMLANSTTIRDTK
ccccHHHHHHHHHHHHHHHHHHHccHHHHHHHHHHHccccHHHHHHHHHHHHHHcHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHccccccc
ccccHHHHHHHHHHHHHHHHHHHccHHHHHHHHHHHcccccHHHHHHHHHHHHHcHHHHHHHHHHHcccHHHHHHHHHHHHHHHHcccccccc
MDDSSASYIHLVHHLIEECLVFNMSKEECMEALSkhanikpviTSTVWKELEKENKEFFEAYARKNAERATVMETRERIEKMLAnsttirdtk
MDDSSASYIHLVHHLIEECLVFNMSKEECMEALskhanikpvITSTVWKELEKENKEFFEayarknaeratvmeTRERIEkmlansttirdtk
MDDSSASYIHLVHHLIEECLVFNMSKEECMEALSKHANIKPVITSTVWkelekenkeffeAYARKNAERATVMETRERIEKMLANSTTIRDTK
*******YIHLVHHLIEECLVFNMSKEECMEALSKHANIKPVITSTVWKELEKENKEFFEAYA******************************
*****AS**HLVHHLIEECLVFNMSKEECMEALSKHANIKPVITSTVWKELEKENKEFFEAYARKNAERA***********************
MDDSSASYIHLVHHLIEECLVFNMSKEECMEALSKHANIKPVITSTVWKELEKENKEFFEAYARKNAERATVMETRERIEKMLAN********
****SASYIHLVHHLIEECLVFNMSKEECMEALSKHANIKPVITSTVWKELEKENKEFFEAYARKNAER*TVMETRERIEKMLAN********
ooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
ooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiihhhhhhhhhhhhhhhhhhhoooooooooooooooooooooooooooooooooooooooo
iiiiiiihhhhhhhhhhhhhhhhoooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
ooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
MDDSSASYIHLVHHLIEECLVFNMSKEECMEALSKHANIKPVITSTVWKELEKENKEFFEAYARKNAERATVMETRERIEKMLANSTTIRDTK
no confident homologs detected

Close Homologs for Annotation Transfer

Close Homologs in SWISS-PROT Database Detected by BLAST ?

No hits with e-value below 0.001 by BLAST

Close Homologs in the Non-Redundant Database Detected by BLAST ?

GI ?Alignment Graph ?Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query93
21707562688 unknown [Medicago truncatula] gi|3884953 0.935 0.988 0.770 5e-32
38851388192 unknown [Lotus japonicus] 0.935 0.945 0.75 1e-31
35657235593 PREDICTED: uncharacterized protein LOC10 0.967 0.967 0.739 2e-31
22543985494 PREDICTED: uncharacterized protein LOC10 0.978 0.968 0.706 9e-31
35650514293 PREDICTED: uncharacterized protein LOC10 0.935 0.935 0.730 3e-30
35654815896 PREDICTED: uncharacterized protein LOC10 0.881 0.854 0.743 4e-29
22408857382 predicted protein [Populus trichocarpa] 0.752 0.853 0.842 7e-29
29780619391 hypothetical protein ARALYDRAFT_487036 [ 0.924 0.945 0.709 3e-28
225448833110 PREDICTED: uncharacterized protein LOC10 0.946 0.8 0.681 4e-28
4256759691 uncharacterized protein [Arabidopsis tha 0.924 0.945 0.709 9e-28
>gi|217075626|gb|ACJ86173.1| unknown [Medicago truncatula] gi|388495396|gb|AFK35764.1| unknown [Medicago truncatula] Back     alignment and taxonomy information
 Score =  141 bits (356), Expect = 5e-32,   Method: Compositional matrix adjust.
 Identities = 67/87 (77%), Positives = 77/87 (88%)

Query: 1  MDDSSASYIHLVHHLIEECLVFNMSKEECMEALSKHANIKPVITSTVWKELEKENKEFFE 60
          M+DSSA+YIHLVH LIEEC++FNMSKEECM+ALSKHANIKPVITSTVWKELEKENKEFFE
Sbjct: 1  MEDSSAAYIHLVHRLIEECILFNMSKEECMDALSKHANIKPVITSTVWKELEKENKEFFE 60

Query: 61 AYARKNAERATVMETRERIEKMLANST 87
          AY R  AERA+  E R+RI  M+ +S+
Sbjct: 61 AYLRNRAERASETEMRQRIRNMVLDSS 87




Source: Medicago truncatula

Species: Medicago truncatula

Genus: Medicago

Family: Fabaceae

Order: Fabales

Class:

Phylum: Streptophyta

Superkingdom: Eukaryota

>gi|388513881|gb|AFK45002.1| unknown [Lotus japonicus] Back     alignment and taxonomy information
>gi|356572355|ref|XP_003554334.1| PREDICTED: uncharacterized protein LOC100785268 [Glycine max] Back     alignment and taxonomy information
>gi|225439854|ref|XP_002278074.1| PREDICTED: uncharacterized protein LOC100256772 [Vitis vinifera] gi|297741538|emb|CBI32670.3| unnamed protein product [Vitis vinifera] Back     alignment and taxonomy information
>gi|356505142|ref|XP_003521351.1| PREDICTED: uncharacterized protein LOC100808933 [Glycine max] Back     alignment and taxonomy information
>gi|356548158|ref|XP_003542470.1| PREDICTED: uncharacterized protein LOC100811683 [Glycine max] Back     alignment and taxonomy information
>gi|224088573|ref|XP_002308479.1| predicted protein [Populus trichocarpa] gi|222854455|gb|EEE92002.1| predicted protein [Populus trichocarpa] Back     alignment and taxonomy information
>gi|297806193|ref|XP_002870980.1| hypothetical protein ARALYDRAFT_487036 [Arabidopsis lyrata subsp. lyrata] gi|297316817|gb|EFH47239.1| hypothetical protein ARALYDRAFT_487036 [Arabidopsis lyrata subsp. lyrata] Back     alignment and taxonomy information
>gi|225448833|ref|XP_002282451.1| PREDICTED: uncharacterized protein LOC100244913 [Vitis vinifera] gi|296086934|emb|CBI33167.3| unnamed protein product [Vitis vinifera] Back     alignment and taxonomy information
>gi|42567596|ref|NP_568103.3| uncharacterized protein [Arabidopsis thaliana] gi|29028782|gb|AAO64770.1| At5g02580 [Arabidopsis thaliana] gi|110743100|dbj|BAE99442.1| hypothetical protein [Arabidopsis thaliana] gi|332003110|gb|AED90493.1| uncharacterized protein [Arabidopsis thaliana] Back     alignment and taxonomy information

Prediction of Gene Ontology (GO) Terms

Close Homologs with Gene Ontology terms Detected by BLAST ?

ID ? Alignment graph ? Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query93
TAIR|locus:218183891 AT5G02580 "AT5G02580" [Arabido 0.924 0.945 0.581 5.5e-22
TAIR|locus:210067695 AT3G55240 "AT3G55240" [Arabido 0.967 0.947 0.510 7.2e-20
TAIR|locus:208699288 AT3G28990 "AT3G28990" [Arabido 0.870 0.920 0.481 9.8e-16
TAIR|locus:4010713425125 AT1G10657 "AT1G10657" [Arabido 0.451 0.336 0.476 5.3e-08
TAIR|locus:2146728 305 AT5G04090 "AT5G04090" [Arabido 0.903 0.275 0.285 9.5e-05
TAIR|locus:2076299 324 AT3G10250 "AT3G10250" [Arabido 0.903 0.259 0.273 0.00037
TAIR|locus:2181838 AT5G02580 "AT5G02580" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
 Score = 256 (95.2 bits), Expect = 5.5e-22, P = 5.5e-22
 Identities = 50/86 (58%), Positives = 58/86 (67%)

Query:     2 DDSSASYIHLVHHLIEECLVFNMSKEECMEALSKHANIKPVITSTVWXXXXXXXXXXXXA 61
             + S ASYIHLVHHLIEEC+VFNM KEECM+AL KHANIKP+ITSTVW            A
Sbjct:     5 NSSQASYIHLVHHLIEECIVFNMGKEECMDALFKHANIKPIITSTVWKELAKENKEFFEA 64

Query:    62 YARKNAERATVMETRERIEKMLANST 87
             Y R+  E  T  ET  RI  +L+ +T
Sbjct:    65 YERRREEIPTEKETARRIRDLLSRTT 90




GO:0003674 "molecular_function" evidence=ND
GO:0005575 "cellular_component" evidence=ND
GO:0008150 "biological_process" evidence=ND
GO:0006635 "fatty acid beta-oxidation" evidence=RCA
GO:0009062 "fatty acid catabolic process" evidence=RCA
TAIR|locus:2100676 AT3G55240 "AT3G55240" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2086992 AT3G28990 "AT3G28990" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:4010713425 AT1G10657 "AT1G10657" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2146728 AT5G04090 "AT5G04090" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2076299 AT3G10250 "AT3G10250" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms

Prediction of Enzyme Commission (EC) Number

EC Number Prediction by Annotation Transfer from SWISS-PROT Entries ?

No confident hit for EC number transfering in SWISSPROT detected by BLAST

EC Number Prediction by Ezypred Server ?

Fail to connect to Ezypred Server

EC Number Prediction by EFICAz Software ?

No EC number assignment, probably not an enzyme!


Prediction of Functionally Associated Proteins

Functionally Associated Proteins Detected by STRING ?

Your Input:
GSVIVG00023118001
SubName- Full=Chromosome chr8 scaffold_29, whole genome shotgun sequence; (94 aa)
(Vitis vinifera)
Predicted Functional Partners:
 
Sorry, there are no predicted associations at the current settings.
 

Conserved Domains and Related Protein Families

Conserved Domains Detected by RPS-BLAST ?

ID ?Alignment Graph ?Length ? Definition ? E-value ?
Query93
pfam0971354 pfam09713, A_thal_3526, Plant protein 1589 of unkn 1e-25
TIGR0158957 TIGR01589, A_thal_3526, uncharacterized plant-spec 5e-21
>gnl|CDD|192339 pfam09713, A_thal_3526, Plant protein 1589 of unknown function (A_thal_3526) Back     alignment and domain information
 Score = 89.7 bits (223), Expect = 1e-25
 Identities = 35/51 (68%), Positives = 39/51 (76%)

Query: 12 VHHLIEECLVFNMSKEECMEALSKHANIKPVITSTVWKELEKENKEFFEAY 62
          V +LIE C+   MSKEEC+ AL KHANI P  TSTVWK+LEKEN EFF AY
Sbjct: 1  VQNLIERCIQLYMSKEECVNALQKHANISPKFTSTVWKQLEKENPEFFRAY 51


This plant-specific family of proteins is defined by an uncharacterized region 57 residues in length. It is found toward the N terminus of most proteins that contain it. Examples include at least several proteins from Arabidopsis thaliana and Oryza sativa. The function of the proteins are unknown. Length = 54

>gnl|CDD|130650 TIGR01589, A_thal_3526, uncharacterized plant-specific domain TIGR01589 Back     alignment and domain information

Conserved Domains Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query 93
TIGR0158957 A_thal_3526 uncharacterized plant-specific domain 100.0
PF0971354 A_thal_3526: Plant protein 1589 of unknown functio 100.0
PF1356577 HTH_32: Homeodomain-like domain 83.25
PF08986119 DUF1889: Domain of unknown function (DUF1889); Int 82.17
smart0041190 BHL bacterial (prokaryotic) histone like domain. 81.01
>TIGR01589 A_thal_3526 uncharacterized plant-specific domain TIGR01589 Back     alignment and domain information
Probab=100.00  E-value=3e-37  Score=196.30  Aligned_cols=57  Identities=46%  Similarity=0.673  Sum_probs=55.7

Q ss_pred             HHHHHHHHHHHHHhhCCHHHHHHHHHhhcCCChhhHHHHHHHHHHhcHHHHHHHHhc
Q 034494            9 IHLVHHLIEECLVFNMSKEECMEALSKHANIKPVITSTVWKELEKENKEFFEAYARK   65 (93)
Q Consensus         9 I~~VQ~LIErCLqlyMsk~Evv~~L~~~a~I~P~fT~~VW~~Le~eNpeFF~aY~~~   65 (93)
                      |++|||||||||++|||++|||++|++||||+|+||++||++||+||||||+|||.+
T Consensus         1 i~~Vq~lIE~Cl~~yMsk~E~v~~L~~~a~I~P~~T~~VW~~LekeN~eFF~aY~~~   57 (57)
T TIGR01589         1 IDLVQNRIETCIQGYMSKEETVSFLFENAGISPKFTRFVWYLLEKENADFFRCYKTH   57 (57)
T ss_pred             CHHHHHHHHHHHHHHCCHHHHHHHHHHHcCCCchhHHHHHHHHHHHHHHHHHHHhcC
Confidence            689999999999999999999999999999999999999999999999999999975



This model represents an uncharacterized plant-specific domain 57 residues in length. It is found toward the N-terminus of most proteins that contain it. Examples include at least 10 proteins from Arabidopsis thaliana and at least one from Oryza sativa.

>PF09713 A_thal_3526: Plant protein 1589 of unknown function (A_thal_3526); InterPro: IPR006476 This plant-specific family of proteins are defined by an uncharacterised region 57 residues in length Back     alignment and domain information
>PF13565 HTH_32: Homeodomain-like domain Back     alignment and domain information
>PF08986 DUF1889: Domain of unknown function (DUF1889); InterPro: IPR015079 This family consist of hypothetical bacterial proteins Back     alignment and domain information
>smart00411 BHL bacterial (prokaryotic) histone like domain Back     alignment and domain information

Homologous Structure Templates

Structure Templates Detected by BLAST ?

No homologous structure with e-value below 0.005

Structure Templates Detected by RPS-BLAST ?

No hit with e-value below 0.005

Structure Templates Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query93
2di0_A71 Activating signal cointegrator 1 complex subunit 2 85.1
2es9_A115 Putative cytoplasmic protein; structural genomics, 83.8
>2di0_A Activating signal cointegrator 1 complex subunit 2; ASCC2, CUE domain, structural genomics, NPPSFA; NMR {Homo sapiens} SCOP: a.5.2.4 Back     alignment and structure
Probab=85.10  E-value=1.8  Score=27.70  Aligned_cols=38  Identities=24%  Similarity=0.492  Sum_probs=31.0

Q ss_pred             HHHHHHHHHH--------HHHHHHhh-CCHHHHHHHHHhhcCCChhhH
Q 034494            6 ASYIHLVHHL--------IEECLVFN-MSKEECMEALSKHANIKPVIT   44 (93)
Q Consensus         6 ~~~I~~VQ~L--------IErCLqly-Msk~Evv~~L~~~a~I~P~fT   44 (93)
                      .+.|..|+.|        |.+||..| -|-|.||.+|.+. ++.|...
T Consensus        14 ~s~I~qV~DLfPdLG~gfi~~~L~~y~~nvE~vin~LLE~-~LPp~L~   60 (71)
T 2di0_A           14 DSLISQVKDLLPDLGEGFILACLEYYHYDPEQVINNILEE-RLAPTLS   60 (71)
T ss_dssp             HHHHHHHHHHCCSSCHHHHHHHHHHTTTCHHHHHHHHHTT-CCCTTTT
T ss_pred             HHHHHHHHHHcccCCHHHHHHHHHHhCCCHHHHHHHHHcc-CCCHHHH
Confidence            3567777765        89999999 8999999999985 7877653



>2es9_A Putative cytoplasmic protein; structural genomics, PSI, protein structure initiative, NORT structural genomics consortium, NESG; 2.00A {Salmonella typhimurium} SCOP: a.247.1.1 PDB: 2jn8_A Back     alignment and structure

Homologous Structure Domains

Structure Domains Detected by RPS-BLAST ?

No hit with e-value below 0.005

Homologous Domains Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query93
d2es9a197 Hypothetical protein YoaC {Salmonella typhimurium 84.91
d1b8za_90 HU protein {Thermotoga maritima [TaxId: 2336]} 82.88
d1mula_90 HU protein {Escherichia coli [TaxId: 562]} 81.47
d1exea_99 Transcription factor 1, TF1 {Bacteriophage SPO1 [T 80.88
d2di0a163 Activating signal cointegrator 1 complex subunit 2 80.54
>d2es9a1 a.247.1.1 (A:11-107) Hypothetical protein YoaC {Salmonella typhimurium [TaxId: 90371]} Back     information, alignment and structure
class: All alpha proteins
fold: YoaC-like
superfamily: YoaC-like
family: YoaC-like
domain: Hypothetical protein YoaC
species: Salmonella typhimurium [TaxId: 90371]
Probab=84.91  E-value=0.26  Score=31.81  Aligned_cols=37  Identities=27%  Similarity=0.529  Sum_probs=26.9

Q ss_pred             HHHHHHhhcCCChhhHHHH--HHHHHH-------hcHHHHHHHHhc
Q 034494           29 CMEALSKHANIKPVITSTV--WKELEK-------ENKEFFEAYARK   65 (93)
Q Consensus        29 vv~~L~~~a~I~P~fT~~V--W~~Le~-------eNpeFF~aY~~~   65 (93)
                      .|.+=-.+-+-+|+||.-|  |.+--+       .|||||..|.+-
T Consensus        46 dv~aRg~qegWn~gFT~K~~gWA~k~~sGerivIKnPEyFstYM~e   91 (97)
T d2es9a1          46 VVVARGEQEGWNPEFTKKVAGWAEKVASGNRILIKNPEYFSTYMQE   91 (97)
T ss_dssp             HHHHHHHHTTCCHHHHHHHHHHHHHHHTTCCCCCSSGGGSCHHHHH
T ss_pred             HHHHhcccccCChhHHHHHHHHHHHhccCCeeeecChHHHHHHHHH
Confidence            3444455779999999987  543222       799999999764



>d1b8za_ a.55.1.1 (A:) HU protein {Thermotoga maritima [TaxId: 2336]} Back     information, alignment and structure
>d1mula_ a.55.1.1 (A:) HU protein {Escherichia coli [TaxId: 562]} Back     information, alignment and structure
>d1exea_ a.55.1.1 (A:) Transcription factor 1, TF1 {Bacteriophage SPO1 [TaxId: 10685]} Back     information, alignment and structure
>d2di0a1 a.5.2.4 (A:8-70) Activating signal cointegrator 1 complex subunit 2 {Human (Homo sapiens) [TaxId: 9606]} Back     information, alignment and structure