Citrus Sinensis ID: 035343


Local Sequence Feature Prediction

Prediction and (Method)Result
Residue Number Marker
Protein Sequence ?
Secondary Structure (PSIPRED) ?
Secondary Structure Prediction (SSPRO) ?
Coil and Loop (DISEMBL) ?
Flexible Loop (DISEMBL) ?
Low Complexity Region (SEG) ?
Disordered region (IsUnstruct) ?
Disordered Region (DISOPRED) ?
Disordered Region (DISEMBL) ?
Disordered Region (DISPRO) ?
Transmembrane Helix (TMHMM) ?
Transmembrane Helix (HMMTOP) ?
Transmembrane Helix (MEMSAT) ?
TM Helix, Signal Peptide (MEMSAT_SVM) ?
TM Helix, Signal Peptide (Phobius) ?
Signal Peptide (SignalP HMM Mode) ?
Signal Peptide (SignalP NN Mode) ?
Coiled Coils (COILS) ?
Positional Conservation ?
 
--------10--------20--------30--------40--------50--------60-------
MMLVCSDTGRKVDDPEVLEAIRLIVINNLLQYHPESSSQLAMGVTFGVDPPKQQVFAFWSYFNFSDY
cEEEEcccccccccHHHHHHHHHHHHHHHHHHcccccHHHHHcccccccccccEEEEEEEEEEEEEc
cEEEEcccccccccHHHHHHHHHHHHHHHHHHccccHHHHHHHHHHcccccHHHHEEHEHEEEcccc
mmlvcsdtgrkvddpEVLEAIRLIVINNLlqyhpesssqlamgvtfgvdppkqqVFAFWsyfnfsdy
mmlvcsdtgrkvddpeVLEAIRLIVINNLLQYHPESSSQLAMGVTFGVDPPKQQVFAFWSYFNFSDY
MMLVCSDTGRKVDDPEVLEAIRLIVINNLLQYHPESSSQLAMGVTFGVDPPKQQVFAFWSYFNFSDY
**************PEVLEAIRLIVINNLLQYHPESSSQLAMGVTFGVDPPKQQVFAFWSYFNF***
***V****GRKVDDPEVLEAIRLIVINNLLQYHPESSSQLAMGVTFGVDPPKQQVFAFWSYFNFSDY
********GRKVDDPEVLEAIRLIVINNLLQYHPESSSQLAMGVTFGVDPPKQQVFAFWSYFNFSDY
*MLVCSDTGRKVDDPEVLEAIRLIVINNLLQYHPESSSQLAMGVTFGVDPPKQQVFAFWSYFNFSD*
ooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
ooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
iiiiiiiiiiiiiiiiiiiihhhhhhhhhhhhhhhhhhhoooooooooooooooooooooooooooo
ooooooooooooooooooooooooooooooooooooooooooooooohhhhhhhhhhhhhhhhiiii
ooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
MMLVCSDTGRKVDDPEVLEAIRLIVINNLLQYHPESSSQLAMGVTFGVDPPKQQVFAFWSYFNFSDY
no confident homologs detected

Close Homologs for Annotation Transfer

Close Homologs in SWISS-PROT Database Detected by BLAST ?

ID ?Alignment graph ?Length ? Definition ? RBH(Q2H) ? RBH(H2Q) ? Q cover ? H cover ? Identity ? E-value ?
Query67 2.2.26 [Sep-21-2011]
Q0J709283 Unknown protein DS12 from yes no 0.716 0.169 0.645 2e-14
P83643283 Unknown protein DS12 from N/A no 0.716 0.169 0.645 2e-14
>sp|Q0J709|UP12_ORYSJ Unknown protein DS12 from 2D-PAGE of leaf, chloroplastic OS=Oryza sativa subsp. japonica GN=Os08g0242700 PE=2 SV=2 Back     alignment and function desciption
 Score = 77.4 bits (189), Expect = 2e-14,   Method: Composition-based stats.
 Identities = 31/48 (64%), Positives = 42/48 (87%)

Query: 8   TGRKVDDPEVLEAIRLIVINNLLQYHPESSSQLAMGVTFGVDPPKQQV 55
           TGRK+ +PE+LEA+RL +INN++QYHPE+SSQLA+G TFG +PP + V
Sbjct: 140 TGRKIGEPELLEAVRLTIINNMIQYHPEASSQLALGATFGPEPPTELV 187





Oryza sativa subsp. japonica (taxid: 39947)
>sp|P83643|UP12_ORYSI Unknown protein DS12 from 2D-PAGE of leaf, chloroplastic OS=Oryza sativa subsp. indica GN=OsI_027381 PE=1 SV=2 Back     alignment and function description

Close Homologs in the Non-Redundant Database Detected by BLAST ?

GI ?Alignment Graph ?Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query67
255564051 282 amino acid binding protein, putative [Ri 0.746 0.177 0.86 1e-18
359483143 291 PREDICTED: unknown protein DS12 from 2D- 0.746 0.171 0.8 2e-17
449439743 292 PREDICTED: unknown protein DS12 from 2D- 0.746 0.171 0.82 3e-17
449533134 224 PREDICTED: unknown protein DS12 from 2D- 0.746 0.223 0.82 4e-17
224110036 279 predicted protein [Populus trichocarpa] 0.716 0.172 0.833 5e-17
147800453 453 hypothetical protein VITISV_038929 [Viti 0.746 0.110 0.8 6e-17
118488105150 unknown [Populus trichocarpa] 0.985 0.44 0.651 8e-17
388516485 293 unknown [Medicago truncatula] 0.910 0.208 0.677 2e-16
357467939 293 hypothetical protein MTR_4g007140 [Medic 0.910 0.208 0.677 2e-16
224097482 203 predicted protein [Populus trichocarpa] 0.716 0.236 0.833 3e-16
>gi|255564051|ref|XP_002523024.1| amino acid binding protein, putative [Ricinus communis] gi|223537746|gb|EEF39366.1| amino acid binding protein, putative [Ricinus communis] Back     alignment and taxonomy information
 Score = 97.1 bits (240), Expect = 1e-18,   Method: Composition-based stats.
 Identities = 43/50 (86%), Positives = 48/50 (96%)

Query: 6   SDTGRKVDDPEVLEAIRLIVINNLLQYHPESSSQLAMGVTFGVDPPKQQV 55
           +DTGRKV+DPE+LEAIRL +INNLLQYHPESSSQLAMGV FGV+PPKQQV
Sbjct: 137 ADTGRKVEDPELLEAIRLTIINNLLQYHPESSSQLAMGVAFGVEPPKQQV 186




Source: Ricinus communis

Species: Ricinus communis

Genus: Ricinus

Family: Euphorbiaceae

Order: Malpighiales

Class:

Phylum: Streptophyta

Superkingdom: Eukaryota

>gi|359483143|ref|XP_003632910.1| PREDICTED: unknown protein DS12 from 2D-PAGE of leaf, chloroplastic-like [Vitis vinifera] gi|298204773|emb|CBI25271.3| unnamed protein product [Vitis vinifera] Back     alignment and taxonomy information
>gi|449439743|ref|XP_004137645.1| PREDICTED: unknown protein DS12 from 2D-PAGE of leaf, chloroplastic-like [Cucumis sativus] Back     alignment and taxonomy information
>gi|449533134|ref|XP_004173532.1| PREDICTED: unknown protein DS12 from 2D-PAGE of leaf, chloroplastic-like, partial [Cucumis sativus] Back     alignment and taxonomy information
>gi|224110036|ref|XP_002315393.1| predicted protein [Populus trichocarpa] gi|222864433|gb|EEF01564.1| predicted protein [Populus trichocarpa] Back     alignment and taxonomy information
>gi|147800453|emb|CAN70848.1| hypothetical protein VITISV_038929 [Vitis vinifera] Back     alignment and taxonomy information
>gi|118488105|gb|ABK95872.1| unknown [Populus trichocarpa] Back     alignment and taxonomy information
>gi|388516485|gb|AFK46304.1| unknown [Medicago truncatula] Back     alignment and taxonomy information
>gi|357467939|ref|XP_003604254.1| hypothetical protein MTR_4g007140 [Medicago truncatula] gi|355505309|gb|AES86451.1| hypothetical protein MTR_4g007140 [Medicago truncatula] Back     alignment and taxonomy information
>gi|224097482|ref|XP_002310954.1| predicted protein [Populus trichocarpa] gi|222850774|gb|EEE88321.1| predicted protein [Populus trichocarpa] Back     alignment and taxonomy information

Prediction of Gene Ontology (GO) Terms

Close Homologs with Gene Ontology terms Detected by BLAST ?

ID ? Alignment graph ? Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query67
TAIR|locus:2015616290 ACR11 "ACT domain repeats 11" 0.746 0.172 0.72 3.3e-15
TAIR|locus:2175458301 ACR12 "ACT domain repeats 12" 0.731 0.162 0.653 1.1e-13
TAIR|locus:2015616 ACR11 "ACT domain repeats 11" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
 Score = 192 (72.6 bits), Expect = 3.3e-15, P = 3.3e-15
 Identities = 36/50 (72%), Positives = 44/50 (88%)

Query:     6 SDTGRKVDDPEVLEAIRLIVINNLLQYHPESSSQLAMGVTFGVDPPKQQV 55
             +D+GRKV+DPE+LEAIRL VINNLL++HPESSSQLAMG  FGV PP + +
Sbjct:   145 ADSGRKVEDPELLEAIRLTVINNLLEFHPESSSQLAMGAAFGVLPPTEPI 194




GO:0009507 "chloroplast" evidence=ISM;IDA
GO:0009941 "chloroplast envelope" evidence=IDA
GO:0009535 "chloroplast thylakoid membrane" evidence=IDA
GO:0009570 "chloroplast stroma" evidence=IDA
GO:0009409 "response to cold" evidence=IEP;RCA
GO:0010319 "stromule" evidence=IDA
GO:0009416 "response to light stimulus" evidence=IEP
GO:0009744 "response to sucrose stimulus" evidence=IEP
GO:0006833 "water transport" evidence=RCA
GO:0009651 "response to salt stress" evidence=RCA
GO:0009750 "response to fructose stimulus" evidence=RCA
GO:0019684 "photosynthesis, light reaction" evidence=RCA
GO:0030003 "cellular cation homeostasis" evidence=RCA
GO:0042742 "defense response to bacterium" evidence=RCA
GO:0070838 "divalent metal ion transport" evidence=RCA
TAIR|locus:2175458 ACR12 "ACT domain repeats 12" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms

Prediction of Enzyme Commission (EC) Number

EC Number Prediction by Annotation Transfer from SWISS-PROT Entries ?

No confident hit for EC number transfering in SWISSPROT detected by BLAST

EC Number Prediction by Ezypred Server ?

Fail to connect to Ezypred Server

EC Number Prediction by EFICAz Software ?

No EC number assignment, probably not an enzyme!


Prediction of Functionally Associated Proteins

Functionally Associated Proteins Detected by STRING ?

Fail to connect to STRING server


Conserved Domains and Related Protein Families

Conserved Domains Detected by RPS-BLAST ?

Conserved Domains Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query 67
cd0489572 ACT_ACR_1 ACT domain-containing protein which is c 96.88
cd0489775 ACT_ACR_3 ACT domain-containing protein which is c 94.91
cd0492574 ACT_ACR_2 ACT domain-containing protein which is c 94.64
cd0489675 ACT_ACR-like_3 ACT domain-containing protein which 93.71
PF1152376 DUF3223: Protein of unknown function (DUF3223); In 93.36
PF1133383 DUF3135: Protein of unknown function (DUF3135); In 90.01
cd0490073 ACT_UUR-like_1 ACT domain family, ACT_UUR-like_1, 83.95
PRK05007884 PII uridylyl-transferase; Provisional 80.3
cd0492776 ACT_ACR-like_2 Second ACT domain, of a novel type 80.06
>cd04895 ACT_ACR_1 ACT domain-containing protein which is composed almost entirely of four ACT domain repeats (the "ACR" protein) Back     alignment and domain information
Probab=96.88  E-value=0.00071  Score=41.70  Aligned_cols=26  Identities=15%  Similarity=0.189  Sum_probs=23.4

Q ss_pred             CceeecCCCCccCCHHHHHHHHHHHHH
Q 035343            1 MMLVCSDTGRKVDDPEVLEAIRLIVIN   27 (67)
Q Consensus         1 ~~It~a~Tg~KV~~~e~LE~IRlTIiN   27 (67)
                      |||+|. .|+||.||+++++||.++..
T Consensus        45 FyV~d~-~g~kl~d~~~~~~l~~~L~~   70 (72)
T cd04895          45 FHVTDQ-LGNKLTDDSLIAYIEKSLGT   70 (72)
T ss_pred             EEEECC-CCCCCCCHHHHHHHHHHhcc
Confidence            799988 89999999999999998753



This CD includes the N-terminal ACT domain, of a novel type of ACT domain-containing protein which is composed almost entirely of four ACT domain repeats (the "ACR" protein). ACR proteins, found only in Arabidopsis and Oryza, as yet, are proposed to function as novel regulatory or sensor proteins in plants. Nine ACR gene products have been described (ACR1-8 in Arabidopsis and OsARC1-9 in Oryza) and are represented in this CD. Members of this CD belong to the superfamily of ACT regulatory domains.

>cd04897 ACT_ACR_3 ACT domain-containing protein which is composed almost entirely of four ACT domain repeats (the "ACR" protein) Back     alignment and domain information
>cd04925 ACT_ACR_2 ACT domain-containing protein which is composed almost entirely of four ACT domain repeats (the "ACR" protein) Back     alignment and domain information
>cd04896 ACT_ACR-like_3 ACT domain-containing protein which is composed almost entirely of four ACT domain repeats (the "ACR" protein) Back     alignment and domain information
>PF11523 DUF3223: Protein of unknown function (DUF3223); InterPro: IPR021602 This family of proteins has no known function Back     alignment and domain information
>PF11333 DUF3135: Protein of unknown function (DUF3135); InterPro: IPR021482 This family of proteins with unkown function appears to be restricted to Proteobacteria Back     alignment and domain information
>cd04900 ACT_UUR-like_1 ACT domain family, ACT_UUR-like_1, includes the first of two C-terminal ACT domains of the bacterial signal-transducing uridylyltransferase /uridylyl-removing (UUR) enzyme, GlnD and related domains Back     alignment and domain information
>PRK05007 PII uridylyl-transferase; Provisional Back     alignment and domain information
>cd04927 ACT_ACR-like_2 Second ACT domain, of a novel type of ACT domain-containing protein which is composed almost entirely of four ACT domain repeats (the "ACR" protein) Back     alignment and domain information

Homologous Structure Templates

Structure Templates Detected by BLAST ?

No homologous structure with e-value below 0.005

Structure Templates Detected by RPS-BLAST ?

No hit with e-value below 0.005

Structure Templates Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query67
2k0m_A104 Uncharacterized protein; structural genomics, unkn 89.16
4gud_A211 Imidazole glycerol phosphate synthase subunit His; 81.24
2kl4_A118 BH2032 protein; NB7804A, structural genomics, PSI- 80.81
2oc6_A124 YDHG protein; secretion chaperone-like fold, struc 80.33
>2k0m_A Uncharacterized protein; structural genomics, unknown function, PSI-2, protein structure initiative; NMR {Rhodospirillum rubrum atcc 11170} Back     alignment and structure
Probab=89.16  E-value=0.19  Score=32.16  Aligned_cols=33  Identities=24%  Similarity=0.409  Sum_probs=24.4

Q ss_pred             cCCCCccCCHHHHHHHHHHHHHHHHhhCccchhhhhccc
Q 035343            6 SDTGRKVDDPEVLEAIRLIVINNLLQYHPESSSQLAMGV   44 (67)
Q Consensus         6 a~Tg~KV~~~e~LE~IRlTIiNNll~yHPEsse~la~G~   44 (67)
                      ...|++|.+.|.      +++-.||+||||..+.+..|.
T Consensus        31 y~~g~~l~~~d~------~~l~~lL~~HP~~~~KiG~Gi   63 (104)
T 2k0m_A           31 YRPGDIVSTVDG------AFLVEALKRHPDATSKIGPGV   63 (104)
T ss_dssp             SCTTEECCHHHH------HHHHHHHHTSTTHHHHHTTCE
T ss_pred             CCCCCccCHHHH------HHHHHHHHhCCcHHHhcCCCc
Confidence            345677766532      568889999999999987664



>4gud_A Imidazole glycerol phosphate synthase subunit His; structural genomics, niaid, national institute of allergy AN infectious diseases; HET: MSE 1PE; 1.91A {Vibrio cholerae} Back     alignment and structure
>2kl4_A BH2032 protein; NB7804A, structural genomics, PSI-2, protein structure initiative, joint center for structural genomics, JCSG; NMR {Bacillus halodurans} Back     alignment and structure
>2oc6_A YDHG protein; secretion chaperone-like fold, structural genomics, joint CE structural genomics, JCSG, protein structure initiative; 1.75A {Bacillus subtilis} SCOP: d.198.4.1 Back     alignment and structure

Homologous Structure Domains

Structure Domains Detected by RPS-BLAST ?

No hit with e-value below 0.005

Homologous Domains Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query67
d1ka9h_195 GAT subunit, HisH, (or domain) of imidazoleglycero 89.23
d1k9vf_200 GAT subunit, HisH, (or domain) of imidazoleglycero 88.77
d1gpma2205 GMP synthetase {Escherichia coli [TaxId: 562]} 86.59
d1wl8a1188 GMP synthase subunit A, GuaAA {Archaeon Pyrococcus 86.56
d1qdlb_195 Anthranilate synthase GAT subunit, TrpG {Archaeon 85.93
d1i7qb_192 Anthranilate synthase GAT subunit, TrpG {Serratia 81.27
>d1ka9h_ c.23.16.1 (H:) GAT subunit, HisH, (or domain) of imidazoleglycerolphosphate synthase HisF {Thermus thermophilus [TaxId: 274]} Back     information, alignment and structure
class: Alpha and beta proteins (a/b)
fold: Flavodoxin-like
superfamily: Class I glutamine amidotransferase-like
family: Class I glutamine amidotransferases (GAT)
domain: GAT subunit, HisH, (or domain) of imidazoleglycerolphosphate synthase HisF
species: Thermus thermophilus [TaxId: 274]
Probab=89.23  E-value=0.036  Score=32.50  Aligned_cols=8  Identities=63%  Similarity=1.041  Sum_probs=6.6

Q ss_pred             HhhCccch
Q 035343           30 LQYHPESS   37 (67)
Q Consensus        30 l~yHPEss   37 (67)
                      +|+|||.|
T Consensus       173 vQFHPEkS  180 (195)
T d1ka9h_         173 PQFHPEKS  180 (195)
T ss_dssp             ESSCTTSS
T ss_pred             EeCCcccc
Confidence            69999955



>d1k9vf_ c.23.16.1 (F:) GAT subunit, HisH, (or domain) of imidazoleglycerolphosphate synthase HisF {Thermotoga maritima [TaxId: 2336]} Back     information, alignment and structure
>d1gpma2 c.23.16.1 (A:3-207) GMP synthetase {Escherichia coli [TaxId: 562]} Back     information, alignment and structure
>d1wl8a1 c.23.16.1 (A:1-188) GMP synthase subunit A, GuaAA {Archaeon Pyrococcus horikoshii [TaxId: 53953]} Back     information, alignment and structure
>d1qdlb_ c.23.16.1 (B:) Anthranilate synthase GAT subunit, TrpG {Archaeon Sulfolobus solfataricus [TaxId: 2287]} Back     information, alignment and structure
>d1i7qb_ c.23.16.1 (B:) Anthranilate synthase GAT subunit, TrpG {Serratia marcescens [TaxId: 615]} Back     information, alignment and structure