Citrus Sinensis ID: 032021


Local Sequence Feature Prediction

Prediction and (Method)Result
Residue Number Marker
Protein Sequence ?
Secondary Structure (PSIPRED) ?
Secondary Structure Prediction (SSPRO) ?
Coil and Loop (DISEMBL) ?
Flexible Loop (DISEMBL) ?
Low Complexity Region (SEG) ?
Disordered region (IsUnstruct) ?
Disordered Region (DISOPRED) ?
Disordered Region (DISEMBL) ?
Disordered Region (DISPRO) ?
Transmembrane Helix (TMHMM) ?
Transmembrane Helix (HMMTOP) ?
Transmembrane Helix (MEMSAT) ?
TM Helix, Signal Peptide (MEMSAT_SVM) ?
TM Helix, Signal Peptide (Phobius) ?
Signal Peptide (SignalP HMM Mode) ?
Signal Peptide (SignalP NN Mode) ?
Coiled Coils (COILS) ?
Positional Conservation ?
 
--------10--------20--------30--------40--------50--------60--------70--------80--------90-------100-------110-------120-------130-------140-------15
MDPFSGSGSWNMMPSIPSHNSSPAASNQDNLFLSPQHQQQFYQQPTQFPQQQFQQQGRTPQQQQQPQQQQQQNQHHQSLASNFHLLHLMENLADAIENGTRDQQSDALVNELNNHFEKCQQLLSSISESLDTKAMTVEGQRRKLEERN
ccccccccccEEcccccccccccccccccccccccccccccccccccccccHHHHHcccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcccccccHHHHHHHHHHHHHHHHHHHHHHHHccccccccHHHHHHHHHHHc
cccccccccccccccccccccccccccccHEEEcccccccccccHHHHccccHHHccccccHccHHHHccccccccHHHHHHHHHHHHHHHHHHHHHcccccccHHHHHHHHHHHHHHHHHHHHHHcccccccccEHHHHHHHHHHcc
mdpfsgsgswnmmpsipshnsspaasnqdnlflspqhqqqfyqqptqfpqqqfqqqgrtpqqqqqpqqqQQQNQHHQSLASNFHLLHLMENLADAIENGTRDQQSDALVNELNNHFEKCQQLLSSISESLDTKAMTVEGQRRKLEERN
MDPFSGSGSWNMMPSIPSHNSSPAASNQDNLFLSPQHQQQFYQQPTQFPQQQFQQQGRTPQQQQQPQQQQQQNQHHQSLASNFHLLHLMENLADAIENGTRDQQSDALVNELNNHFEKCQQLLSSisesldtkamtvegqrrkleern
MDPFSGSGSWNMMPSIPSHNSSPAASNQDNLFLSpqhqqqfyqqptqfpqqqfqqqgrtpqqqqqpqqqqqqnqhhqSLASNFHLLHLMENLADAIENGTRDQQSDALVNELNNHFEKCQQLLSSISESLDTKAMTVEGQRRKLEERN
**********************************************************************************FHLLHLMENLAD******************************************************
*********WN**********************************************************************NFHLLHLMENLADAIENGTRDQQSDALVNELNNHFEKCQQLLSSI**********************
********SWNMMPSIPS********NQDNLFLSPQHQ***************************************SLASNFHLLHLMENLADAIENGTRDQQSDALVNELNNHFEKCQQLLSSISESLDTK***************
****************************DNLFLS*******************************************SLASNFHLLHLMENLADAIENGTRDQQSDALVNELNNHFEKCQQLLSSISESLDTKAMTV***********
oooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
oooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
oooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooohhhhhhhhhhhhhhhhhhhhiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiii
oooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooohhhhhhhhhhhhhhhhiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiii
oooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
MDPFSGSGSWNMMPSIPSHNSSPAASNQDNLFLSPQHQQQFYQQPTQFPQQQFQQQGRTPQQQQQPQQQQQQNQHHQSLASNFHLLHLMENLADAIENGTRDQQSDALVNELNNHFEKCQQLLSSISESLDTKAMTVEGQRRKLEERN
no confident homologs detected

Close Homologs for Annotation Transfer

Close Homologs in SWISS-PROT Database Detected by BLAST ?

ID ?Alignment graph ?Length ? Definition ? RBH(Q2H) ? RBH(H2Q) ? Q cover ? H cover ? Identity ? E-value ?
Query148 2.2.26 [Sep-21-2011]
Q8RWA2244 Mediator of RNA polymeras yes no 0.493 0.299 0.726 1e-24
>sp|Q8RWA2|MED9_ARATH Mediator of RNA polymerase II transcription subunit 9 OS=Arabidopsis thaliana GN=MED9 PE=1 SV=1 Back     alignment and function desciption
 Score =  111 bits (278), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 53/73 (72%), Positives = 63/73 (86%)

Query: 74  QHHQSLASNFHLLHLMENLADAIENGTRDQQSDALVNELNNHFEKCQQLLSSISESLDTK 133
           Q HQSLAS+FHL  ++E LAD IENGTRDQ SDALVNELN+HF+KCQQLL+SIS SL +K
Sbjct: 145 QQHQSLASHFHLYPMVEKLADVIENGTRDQNSDALVNELNSHFDKCQQLLNSISGSLGSK 204

Query: 134 AMTVEGQRRKLEE 146
            MTV+GQ+R +EE
Sbjct: 205 TMTVDGQKRNVEE 217




Component of the Mediator complex, a coactivator involved in the regulated transcription of nearly all RNA polymerase II-dependent genes. Mediator functions as a bridge to convey information from gene-specific regulatory proteins to the basal RNA polymerase II transcription machinery. The Mediator complex, having a compact conformation in its free form, is recruited to promoters by direct interactions with regulatory proteins and serves for the assembly of a functional preinitiation complex with RNA polymerase II and the general transcription factors.
Arabidopsis thaliana (taxid: 3702)

Close Homologs in the Non-Redundant Database Detected by BLAST ?

GI ?Alignment Graph ?Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query148
224135025190 predicted protein [Populus trichocarpa] 0.966 0.752 0.622 1e-34
225442291166 PREDICTED: uncharacterized protein LOC10 0.932 0.831 0.646 1e-28
449458229133 PREDICTED: mediator of RNA polymerase II 0.466 0.518 0.826 4e-25
356522394195 PREDICTED: uncharacterized protein LOC10 0.466 0.353 0.782 5e-24
12321581234 hypothetical protein [Arabidopsis thalia 0.493 0.311 0.726 7e-23
22330255244 uncharacterized protein [Arabidopsis tha 0.493 0.299 0.726 8e-23
351723835177 uncharacterized protein LOC100305737 [Gl 0.959 0.802 0.574 1e-22
357513635196 hypothetical protein MTR_8g017180 [Medic 0.466 0.352 0.739 1e-21
242074772174 hypothetical protein SORBIDRAFT_06g03295 0.554 0.471 0.609 8e-20
115436772152 Os01g0501000 [Oryza sativa Japonica Grou 0.486 0.473 0.666 1e-19
>gi|224135025|ref|XP_002327548.1| predicted protein [Populus trichocarpa] gi|222836102|gb|EEE74523.1| predicted protein [Populus trichocarpa] Back     alignment and taxonomy information
 Score =  150 bits (380), Expect = 1e-34,   Method: Compositional matrix adjust.
 Identities = 99/159 (62%), Positives = 112/159 (70%), Gaps = 16/159 (10%)

Query: 4   FSGSGSWNMMPSIPSHNSSPAASNQDNLFLSPQHQQQFYQQPTQFPQQQFQQQGRTPQQQ 63
           FSG GSW ++PS+P+H+ SPA SNQD  +LSPQ QQ  + Q  Q  Q   QQQ    QQQ
Sbjct: 5   FSGGGSWTVIPSVPTHSGSPAHSNQDQFYLSPQQQQPQFTQFQQQQQFNQQQQQFQQQQQ 64

Query: 64  QQPQQQQQQ----------------NQHHQSLASNFHLLHLMENLADAIENGTRDQQSDA 107
            Q QQ QQQ                N HHQSLAS+FHLL L ENLADAIENGTRDQ SDA
Sbjct: 65  YQQQQSQQQRFIQQQQQQQPQVQQQNHHHQSLASHFHLLQLAENLADAIENGTRDQHSDA 124

Query: 108 LVNELNNHFEKCQQLLSSISESLDTKAMTVEGQRRKLEE 146
           LVNELN HF+KCQQLL+SIS S++ KAMTVEGQ+RKLEE
Sbjct: 125 LVNELNTHFDKCQQLLNSISSSINAKAMTVEGQKRKLEE 163




Source: Populus trichocarpa

Species: Populus trichocarpa

Genus: Populus

Family: Salicaceae

Order: Malpighiales

Class:

Phylum: Streptophyta

Superkingdom: Eukaryota

>gi|225442291|ref|XP_002280177.1| PREDICTED: uncharacterized protein LOC100260024 [Vitis vinifera] gi|147856497|emb|CAN78638.1| hypothetical protein VITISV_031737 [Vitis vinifera] gi|297743090|emb|CBI35957.3| unnamed protein product [Vitis vinifera] Back     alignment and taxonomy information
>gi|449458229|ref|XP_004146850.1| PREDICTED: mediator of RNA polymerase II transcription subunit 9-like [Cucumis sativus] Back     alignment and taxonomy information
>gi|356522394|ref|XP_003529831.1| PREDICTED: uncharacterized protein LOC100803884 [Glycine max] Back     alignment and taxonomy information
>gi|12321581|gb|AAG50844.1|AC073944_11 hypothetical protein [Arabidopsis thaliana] Back     alignment and taxonomy information
>gi|22330255|ref|NP_175905.2| uncharacterized protein [Arabidopsis thaliana] gi|75158655|sp|Q8RWA2.1|MED9_ARATH RecName: Full=Mediator of RNA polymerase II transcription subunit 9 gi|20260684|gb|AAM13240.1| unknown protein [Arabidopsis thaliana] gi|30023712|gb|AAP13389.1| At1g55080 [Arabidopsis thaliana] gi|110742509|dbj|BAE99172.1| hypothetical protein [Arabidopsis thaliana] gi|332195062|gb|AEE33183.1| uncharacterized protein [Arabidopsis thaliana] Back     alignment and taxonomy information
>gi|351723835|ref|NP_001236013.1| uncharacterized protein LOC100305737 [Glycine max] gi|255626477|gb|ACU13583.1| unknown [Glycine max] Back     alignment and taxonomy information
>gi|357513635|ref|XP_003627106.1| hypothetical protein MTR_8g017180 [Medicago truncatula] gi|355521128|gb|AET01582.1| hypothetical protein MTR_8g017180 [Medicago truncatula] gi|388506434|gb|AFK41283.1| unknown [Medicago truncatula] Back     alignment and taxonomy information
>gi|242074772|ref|XP_002447322.1| hypothetical protein SORBIDRAFT_06g032950 [Sorghum bicolor] gi|241938505|gb|EES11650.1| hypothetical protein SORBIDRAFT_06g032950 [Sorghum bicolor] Back     alignment and taxonomy information
>gi|115436772|ref|NP_001043132.1| Os01g0501000 [Oryza sativa Japonica Group] gi|56201801|dbj|BAD73251.1| unknown protein [Oryza sativa Japonica Group] gi|113532663|dbj|BAF05046.1| Os01g0501000 [Oryza sativa Japonica Group] Back     alignment and taxonomy information

Prediction of Gene Ontology (GO) Terms

Close Homologs with Gene Ontology terms Detected by BLAST ?

ID ? Alignment graph ? Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query148
TAIR|locus:2205662244 MED9 "AT1G55080" [Arabidopsis 0.466 0.282 0.724 1.4e-27
TAIR|locus:201376889 AT1G29580 "AT1G29580" [Arabido 0.398 0.662 0.419 8.1e-05
TAIR|locus:2205662 MED9 "AT1G55080" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
 Score = 257 (95.5 bits), Expect = 1.4e-27, Sum P(2) = 1.4e-27
 Identities = 50/69 (72%), Positives = 60/69 (86%)

Query:    78 SLASNFHLLHLMENLADAIENGTRDQQSDALVNELNNHFEKCQQLLSSISESLDTKAMTV 137
             SLAS+FHL  ++E LAD IENGTRDQ SDALVNELN+HF+KCQQLL+SIS SL +K MTV
Sbjct:   149 SLASHFHLYPMVEKLADVIENGTRDQNSDALVNELNSHFDKCQQLLNSISGSLGSKTMTV 208

Query:   138 EGQRRKLEE 146
             +GQ+R +EE
Sbjct:   209 DGQKRNVEE 217


GO:0003674 "molecular_function" evidence=ND
GO:0005634 "nucleus" evidence=ISM
GO:0008150 "biological_process" evidence=ND
GO:0016592 "mediator complex" evidence=IDA
TAIR|locus:2013768 AT1G29580 "AT1G29580" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms

Prediction of Enzyme Commission (EC) Number

EC Number Prediction by Annotation Transfer from SWISS-PROT Entries ?

ID ?Name ?Annotated EC number ?Identity ?Query coverage ?Hit coverage ?RBH(Q2H) ?RBH(H2Q) ?
Q8RWA2MED9_ARATHNo assigned EC number0.72600.49320.2991yesno

EC Number Prediction by Ezypred Server ?

Fail to connect to Ezypred Server

EC Number Prediction by EFICAz Software ?

No EC number assignment, probably not an enzyme!


Prediction of Functionally Associated Proteins

Functionally Associated Proteins Detected by STRING ?

Fail to connect to STRING server


Conserved Domains and Related Protein Families

Conserved Domains Detected by RPS-BLAST ?

ID ?Alignment Graph ?Length ? Definition ? E-value ?
Query148
pfam13779820 pfam13779, DUF4175, Domain of unknown function (DU 9e-05
pfam13779820 pfam13779, DUF4175, Domain of unknown function (DU 2e-04
pfam13779 820 pfam13779, DUF4175, Domain of unknown function (DU 9e-04
PRK10263 1355 PRK10263, PRK10263, DNA translocase FtsK; Provisio 0.001
pfam13779820 pfam13779, DUF4175, Domain of unknown function (DU 0.002
>gnl|CDD|222374 pfam13779, DUF4175, Domain of unknown function (DUF4175) Back     alignment and domain information
 Score = 40.7 bits (96), Expect = 9e-05
 Identities = 18/45 (40%), Positives = 21/45 (46%)

Query: 36  QHQQQFYQQPTQFPQQQFQQQGRTPQQQQQPQQQQQQNQHHQSLA 80
           + +QQ  Q      Q Q  QQG+  Q QQQ QQ Q       SLA
Sbjct: 619 RGEQQGQQGQGGQGQGQPGQQGQQGQGQQQGQQGQGGQGGQGSLA 663


Length = 820

>gnl|CDD|222374 pfam13779, DUF4175, Domain of unknown function (DUF4175) Back     alignment and domain information
>gnl|CDD|222374 pfam13779, DUF4175, Domain of unknown function (DUF4175) Back     alignment and domain information
>gnl|CDD|236669 PRK10263, PRK10263, DNA translocase FtsK; Provisional Back     alignment and domain information
>gnl|CDD|222374 pfam13779, DUF4175, Domain of unknown function (DUF4175) Back     alignment and domain information

Conserved Domains Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query 148
PF0754483 Med9: RNA polymerase II transcription mediator com 92.56
PF0719751 DUF1409: Protein of unknown function (DUF1409); In 89.44
>PF07544 Med9: RNA polymerase II transcription mediator complex subunit 9; InterPro: IPR011425 The Mediator complex is a coactivator involved in the regulated transcription of nearly all RNA polymerase II-dependent genes Back     alignment and domain information
Probab=92.56  E-value=0.24  Score=34.75  Aligned_cols=59  Identities=19%  Similarity=0.305  Sum_probs=44.0

Q ss_pred             HHHHHHHHHHHHhc-CCCC-cchHHHHHHHHHhHHHHHHHHHHhhhhcccccceehhhhhhhhhc
Q 032021           85 LLHLMENLADAIEN-GTRD-QQSDALVNELNNHFEKCQQLLSSISESLDTKAMTVEGQRRKLEER  147 (148)
Q Consensus        85 L~~lve~Ladaie~-GtRD-Q~sDaLv~EL~s~FekcQQLLnSis~sl~tk~~TVegQk~kLEE~  147 (148)
                      ++|.+-++-..+++ |..| ...+.-+.+|.-+|.+|..++.++.+    -..||+-|..++++.
T Consensus         4 ~lP~i~~~l~~~~~d~~~~~kd~~~~~~~lk~Klq~ar~~i~~lpg----i~~s~eeq~~~i~~L   64 (83)
T PF07544_consen    4 FLPLIFDILHQISKDPPLSSKDLDTATGSLKHKLQKARAAIRELPG----IDRSVEEQEEEIEEL   64 (83)
T ss_pred             ccchHHHHHHHHhhcCCCCHHHHHHHHHHHHHHHHHHHHHHHhCCC----ccCCHHHHHHHHHHH
Confidence            56777777777776 6665 34567788899999999999999776    355677777776654



Mediator functions as a bridge to convey information from gene-specific regulatory proteins to the basal RNA polymerase II transcription machinery. The Mediator complex, having a compact conformation in its free form, is recruited to promoters by direct interactions with regulatory proteins and serves for the assembly of a functional preinitiation complex with RNA polymerase II and the general transcription factors. On recruitment the Mediator complex unfolds to an extended conformation and partially surrounds RNA polymerase II, specifically interacting with the unphosphorylated form of the C-terminal domain (CTD) of RNA polymerase II. The Mediator complex dissociates from the RNA polymerase II holoenzyme and stays at the promoter when transcriptional elongation begins. The Mediator complex is composed of at least 31 subunits: MED1, MED4, MED6, MED7, MED8, MED9, MED10, MED11, MED12, MED13, MED13L, MED14, MED15, MED16, MED17, MED18, MED19, MED20, MED21, MED22, MED23, MED24, MED25, MED26, MED27, MED29, MED30, MED31, CCNC, CDK8 and CDC2L6/CDK11. The subunits form at least three structurally distinct submodules. The head and the middle modules interact directly with RNA polymerase II, whereas the elongated tail module interacts with gene-specific regulatory proteins. Mediator containing the CDK8 module is less active than Mediator lacking this module in supporting transcriptional activation. The head module contains: MED6, MED8, MED11, SRB4/MED17, SRB5/MED18, ROX3/MED19, SRB2/MED20 and SRB6/MED22. The middle module contains: MED1, MED4, NUT1/MED5, MED7, CSE2/MED9, NUT2/MED10, SRB7/MED21 and SOH1/MED31. CSE2/MED9 interacts directly with MED4. The tail module contains: MED2, PGD1/MED3, RGR1/MED14, GAL11/MED15 and SIN4/MED16. The CDK8 module contains: MED12, MED13, CCNC and CDK8. Individual preparations of the Mediator complex lacking one or more distinct subunits have been variously termed ARC, CRSP, DRIP, PC2, SMCC and TRAP. This entry represents subunit Med9 of the Mediator complex. Subunit Med9 is part of the middle module of the Mediator complex []; this associates with the core polymerase subunits to form the RNA polymerase II holoenzyme. Med9 alternatively known as the chromosome segregation protein, CSE2 (P33308 from SWISSPROT) is required, along with CSE1 (P33307 from SWISSPROT) for accurate mitotic chromosome segregation in Saccharomyces cerevisiae (Baker's yeast) [].; GO: 0001104 RNA polymerase II transcription cofactor activity, 0006357 regulation of transcription from RNA polymerase II promoter, 0016592 mediator complex

>PF07197 DUF1409: Protein of unknown function (DUF1409); InterPro: IPR010811 This represents a short conserved region (approximately 50 residues long), sometimes repeated, within a number of hypothetical Oryza sativa proteins of unknown function Back     alignment and domain information

Homologous Structure Templates

Structure Templates Detected by BLAST ?

No homologous structure with e-value below 0.005

Structure Templates Detected by RPS-BLAST ?

No hit with e-value below 0.005

Structure Templates Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query148
4b8c_D 727 Glucose-repressible alcohol dehydrogenase transcr 92.89
>4b8c_D Glucose-repressible alcohol dehydrogenase transcr effector; hydrolase-cell cycle complex; 3.41A {Saccharomyces cerevisiae S288C} Back     alignment and structure
Probab=92.89  E-value=0.031  Score=48.63  Aligned_cols=12  Identities=25%  Similarity=0.504  Sum_probs=0.0

Q ss_pred             CcceeccCCCCC
Q 032021            8 GSWNMMPSIPSH   19 (148)
Q Consensus         8 GSwtMiPs~~s~   19 (148)
                      .+..|+|..+.+
T Consensus         9 ~~~~~~~~~~~~   20 (727)
T 4b8c_D            9 NSIGMVPTVGTP   20 (727)
T ss_dssp             ------------
T ss_pred             cccCCCcccCCc
Confidence            455677744433




Homologous Structure Domains

Structure Domains Detected by RPS-BLAST ?

No hit with e-value below 0.005

Homologous Domains Detected by HHsearch ?

No hit with probability above 80.00