Citrus Sinensis ID: 045019


Local Sequence Feature Prediction

Prediction and (Method)Result
Residue Number Marker
Protein Sequence ?
Secondary Structure (PSIPRED) ?
Secondary Structure Prediction (SSPRO) ?
Coil and Loop (DISEMBL) ?
Flexible Loop (DISEMBL) ?
Low Complexity Region (SEG) ?
Disordered region (IsUnstruct) ?
Disordered Region (DISOPRED) ?
Disordered Region (DISEMBL) ?
Disordered Region (DISPRO) ?
Transmembrane Helix (TMHMM) ?
Transmembrane Helix (HMMTOP) ?
Transmembrane Helix (MEMSAT) ?
TM Helix, Signal Peptide (MEMSAT_SVM) ?
TM Helix, Signal Peptide (Phobius) ?
Signal Peptide (SignalP HMM Mode) ?
Signal Peptide (SignalP NN Mode) ?
Coiled Coils (COILS) ?
Positional Conservation ?
 
--------10--------20--------30--------40--------50--------60--------70--------80--------90-------100-------110-------120-------130-------140-------150-------160-------170-------180-------190-------200-------210-------220-------230-------240-------25
MSTSARAQASHDPNPNLHKPEPKESVNLFSTVNVPPDPLLDSGFFIDAAPPATSGSNTTTTNDQTSKKRGTIREHKSENLATTNGTESALTPETASTRRLTATEAWLPPGWEIEDRVRTSGATAGTVDKYYFHVASGRRFRSKKEVLYFLETGTKRKRRKENSNADMDSSGSAAGSTKQKKPNIKAKTSALNFDYFNSPENVEWVLTDPSEGSWTPFIGKVEVPESVRQDWAAAFTDLTTSNNGSKIC
cccHHHHHcccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccEEEEEEcccccccccEEEEEEEcccccccccHHHHHHHHHHcccccccccccccccccccccccccccccccccccccccccccccccccEEEEEEccccccccccccccccccHHHHHHHHHHHHHHccccccccc
ccccHHHHccccccccccccccccccccccccccccccHcccccEEcccccccccccccccccccccccccccccccccEEccccccccccccHHHHccccccccccccccEEEEEEEccccccccEEEEEEccccccEcccHHHHHHHHHcccccccccccccccccccccccccEEcccccccccccccccccccccccEEEEEEcccccccccccccccccHccHccHHHHHHHHHHcccccccc
mstsaraqashdpnpnlhkpepkesvnlfstvnvppdplldsgffidaappatsgsnttttndqtskkrgtirehksenlattngtesaltpetasTRRLtateawlppgweiedrvrtsgatagtvdKYYFHVasgrrfrskKEVLYFLETGTkrkrrkensnadmdssgsaagstkqkkpnikaktsalnfdyfnspenvewvltdpsegswtpfigkvevpesvRQDWAAAFTdlttsnngskic
mstsaraqashdpnpnlhkpepKESVNLFSTVNVPPDPLLDSGFFIDAAppatsgsnttttndqtskkrgtirehksenlattngtesaltpetastrrltateawlppgweiedrvrtsgatagtvdkyyfhvasgrrfrskkevLYFLetgtkrkrrkensnadmdssgsaagstkqkkpnIKAKTSALNFDYFNSPENVEWVLTDPSEGSWTPFIGKVEVPESVRQDWAAAftdlttsnngskic
MSTSARAQASHDPNPNLHKPEPKESVNLFSTVNVPPDPLLDSGFFIDAAPPATSGSNTTTTNDQTSKKRGTIREHKSENLATTNGTESALTPETASTRRLTATEAWLPPGWEIEDRVRTSGATAGTVDKYYFHVASGRRFRSKKEVLYFLETGTKRKRRKEnsnadmdssgsaagsTKQKKPNIKAKTSALNFDYFNSPENVEWVLTDPSEGSWTPFIGKVEVPESVRQDWAAAFTDLTTSNNGSKIC
****************************************************************************************************TATEAWLPPGWEIEDRVRTSGATAGTVDKYYFHVASGRRFRSKKEVLYFLE**************************************ALNFDYFNSPENVEWVLTDPSEGSWTPFIGKVEVPESVRQDWAAAFTD***********
*********************************************************************************************************WLPPGWEIEDRVRT**ATAGTVDKYYFHVASGRRFRSKKEVLYFLETGT*******************************************************S*GSWTPFIGKVEVPESVRQDWAAAFTDLTTS*******
**************PNLHKPEPKESVNLFSTVNVPPDPLLDSGFFIDAAPPA*******************IREHKSENLATTNGTESALTPETASTRRLTATEAWLPPGWEIEDRVRTSGATAGTVDKYYFHVASGRRFRSKKEVLYFLETGTKR***************************IKAKTSALNFDYFNSPENVEWVLTDPSEGSWTPFIGKVEVPESVRQDWAAAFTDLTTSNNGSKIC
**************PNLHKPEPKESVNLFSTVNVPPDPLLDSGFFIDAA******************************************************EAWLPPGWEIEDRVRTSGATAGTVDKYYFHVASGRRFRSKKEVLYFLETG*********************************KTSALNFDYFNSPENVEWVLTDPSEGSWTPFIGKVEVPESVRQDWAAAFTDLTTS*******
oooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
oooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiihhhhhhhhhhhhhhhhhhhooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
oooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooohhhhhhhhhhhhhhhhiiii
oooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
MSTSARAQASHDPNPNLHKPEPKESVNLFSTVNVPPDPLLDSGFFIDAAPPATSGSNTTTTNDQTSKKRGTIREHKSENLATTNGTESALTPETASTRRLTATEAWLPPGWEIEDRVRTSGATAGTVDKYYFHVASGRRFRSKKEVLYFLETGTKRKRRKENSNADMDSSGSAAGSTKQKKPNIKAKTSALNFDYFNSPENVEWVLTDPSEGSWTPFIGKVEVPESVRQDWAAAFTDLTTSNNGSKIC
no confident homologs detected

Close Homologs for Annotation Transfer

Close Homologs in SWISS-PROT Database Detected by BLAST ?

ID ?Alignment graph ?Length ? Definition ? RBH(Q2H) ? RBH(H2Q) ? Q cover ? H cover ? Identity ? E-value ?
Query248 2.2.26 [Sep-21-2011]
Q9LTJ1225 Methyl-CpG-binding domain yes no 0.858 0.946 0.395 1e-35
Q9SNC0182 Methyl-CpG-binding domain no no 0.665 0.906 0.444 2e-31
Q9UBB5 411 Methyl-CpG-binding domain yes no 0.173 0.104 0.5 1e-05
Q9Z2E1 414 Methyl-CpG-binding domain yes no 0.173 0.103 0.5 1e-05
>sp|Q9LTJ1|MBD6_ARATH Methyl-CpG-binding domain-containing protein 6 OS=Arabidopsis thaliana GN=MBD6 PE=1 SV=1 Back     alignment and function desciption
 Score =  150 bits (378), Expect = 1e-35,   Method: Compositional matrix adjust.
 Identities = 87/220 (39%), Positives = 122/220 (55%), Gaps = 7/220 (3%)

Query: 33  NVPPDPLLDSGFFIDAAPPATSGSNTTTTNDQTSKKRGTIREHKSENLATTNGTESA-LT 91
           + PPDPLL SG FI +A   T  S+      Q         E  S  +   NGT+     
Sbjct: 8   DFPPDPLLASGAFISSAGDGTLDSSAKRRPIQGGIGISGSGE--SVRIGMANGTDQVNHQ 65

Query: 92  PETASTRRLTATEAWLPPGWEIEDRVRTSGATAGTVDKYYFHVASGRRFRSKKEVLYFLE 151
            E+ S +R    + WLPPGW +ED++RTSGATAG+VDKYY+   +GR+FRS+ EVLY+LE
Sbjct: 66  TESKSRKRAAPGDNWLPPGWRVEDKIRTSGATAGSVDKYYYEPNTGRKFRSRTEVLYYLE 125

Query: 152 TGTKRK--RRKENSNADMDS-SGSAAGSTKQKKPNIKAKTSALNFDYFNSPENVEWVLTD 208
            GT ++  ++ EN+  + D   G  +    +           L+FD+ N P+ V W + +
Sbjct: 126 HGTSKRGTKKAENTYFNPDHFEGQGSNRVTRTATVPPPPPPPLDFDFKNPPDKVSWSMAN 185

Query: 209 PSEGSWTPFIGKVEVPESVRQDWAAAFTDLTTSNNGSKIC 248
             E  W P IG V+V +SVR+DW+ AFT   TS N SK+ 
Sbjct: 186 AGEEGWIPNIGDVKVQDSVRRDWSTAFT-FITSRNPSKVS 224




Transcriptional regulator that binds CpG, CpNpN and CpNpG (N is A, T, or C) islands in promoters regardless the DNA methylation status. Plays probably a role in gene silencing. May associate with histone deacetylase proteins (HDAC). Required for nucleolar dominance that consist in the silencing of rRNA genes inherited from one progenitor in genetic hybrids. Recruited to rRNA genes in a DRM2-dependent manner.
Arabidopsis thaliana (taxid: 3702)
>sp|Q9SNC0|MBD5_ARATH Methyl-CpG-binding domain-containing protein 5 OS=Arabidopsis thaliana GN=MBD5 PE=1 SV=1 Back     alignment and function description
>sp|Q9UBB5|MBD2_HUMAN Methyl-CpG-binding domain protein 2 OS=Homo sapiens GN=MBD2 PE=1 SV=1 Back     alignment and function description
>sp|Q9Z2E1|MBD2_MOUSE Methyl-CpG-binding domain protein 2 OS=Mus musculus GN=Mbd2 PE=2 SV=2 Back     alignment and function description

Close Homologs in the Non-Redundant Database Detected by BLAST ?

GI ?Alignment Graph ?Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query248
297793491233 hypothetical protein ARALYDRAFT_919168 [ 0.814 0.866 0.403 3e-36
224106479214 methyl binding domain protein [Populus t 0.645 0.747 0.509 1e-35
224059490214 methyl binding domain protein [Populus t 0.806 0.934 0.411 4e-35
15238389225 methyl-CPG-binding domain 6 protein [Ara 0.858 0.946 0.395 7e-34
255584216234 DNA binding protein, putative [Ricinus c 0.745 0.790 0.446 3e-33
297815816181 hypothetical protein ARALYDRAFT_485028 [ 0.665 0.911 0.458 4e-30
15232591182 methyl-CPG-binding domain protein 5 [Ara 0.665 0.906 0.444 1e-29
21554707182 unknown [Arabidopsis thaliana] 0.665 0.906 0.444 1e-29
359478635297 PREDICTED: methyl-CpG-binding domain-con 0.479 0.400 0.503 5e-28
297746071135 unnamed protein product [Vitis vinifera] 0.479 0.881 0.503 1e-26
>gi|297793491|ref|XP_002864630.1| hypothetical protein ARALYDRAFT_919168 [Arabidopsis lyrata subsp. lyrata] gi|297310465|gb|EFH40889.1| hypothetical protein ARALYDRAFT_919168 [Arabidopsis lyrata subsp. lyrata] Back     alignment and taxonomy information
 Score =  157 bits (398), Expect = 3e-36,   Method: Compositional matrix adjust.
 Identities = 92/228 (40%), Positives = 125/228 (54%), Gaps = 26/228 (11%)

Query: 33  NVPPDPLLDSGFFIDAAPPATSGSNTTTTNDQTSKKR------GTIREHKSENLATTNGT 86
           + PPDPLL SG FI +A   T         D +SK+R      G     +S  +   NGT
Sbjct: 8   DFPPDPLLASGAFISSAGDGTL--------DSSSKRRPIQGGIGVSGSGESVRIGMANGT 59

Query: 87  E--------SALTPETASTRRLTATEAWLPPGWEIEDRVRTSGATAGTVDKYYFHVASGR 138
           +        S+   E+ S +R    + WLPPGW +ED++RTSGATAG+VDKYY+   +GR
Sbjct: 60  DQVNPSPDSSSHQAESKSRKRAAPGDNWLPPGWRVEDKIRTSGATAGSVDKYYYEPNTGR 119

Query: 139 RFRSKKEVLYFLETGT-KRKRRKENSNADMDSS---GSAAGSTKQKKPNIKAKTSALNFD 194
           +FRS+ EVLY+LE GT KR   K+  N D  S    G  +   ++K          L+FD
Sbjct: 120 KFRSRTEVLYYLEHGTSKRGGTKKVENTDFLSDHFEGQGSNRARRKAKEPPPPPPPLDFD 179

Query: 195 YFNSPENVEWVLTDPSEGSWTPFIGKVEVPESVRQDWAAAFTDLTTSN 242
           + N PE V W +    E +W P IG V+V +SVR+DW+ AFT +T  N
Sbjct: 180 FKNPPEKVSWSMAKAGEEAWIPIIGDVKVQDSVRRDWSTAFTFITNRN 227




Source: Arabidopsis lyrata subsp. lyrata

Species: Arabidopsis lyrata

Genus: Arabidopsis

Family: Brassicaceae

Order: Brassicales

Class:

Phylum: Streptophyta

Superkingdom: Eukaryota

>gi|224106479|ref|XP_002314180.1| methyl binding domain protein [Populus trichocarpa] gi|222850588|gb|EEE88135.1| methyl binding domain protein [Populus trichocarpa] Back     alignment and taxonomy information
>gi|224059490|ref|XP_002299872.1| methyl binding domain protein [Populus trichocarpa] gi|118483055|gb|ABK93437.1| unknown [Populus trichocarpa] gi|222847130|gb|EEE84677.1| methyl binding domain protein [Populus trichocarpa] Back     alignment and taxonomy information
>gi|15238389|ref|NP_200746.1| methyl-CPG-binding domain 6 protein [Arabidopsis thaliana] gi|75180469|sp|Q9LTJ1.1|MBD6_ARATH RecName: Full=Methyl-CpG-binding domain-containing protein 6; Short=AtMBD6; Short=MBD06; AltName: Full=Methyl-CpG-binding protein MBD6 gi|8885544|dbj|BAA97474.1| unnamed protein product [Arabidopsis thaliana] gi|28416643|gb|AAO42852.1| At5g59380 [Arabidopsis thaliana] gi|110743343|dbj|BAE99559.1| hypothetical protein [Arabidopsis thaliana] gi|332009795|gb|AED97178.1| methyl-CPG-binding domain 6 protein [Arabidopsis thaliana] Back     alignment and taxonomy information
>gi|255584216|ref|XP_002532846.1| DNA binding protein, putative [Ricinus communis] gi|223527383|gb|EEF29524.1| DNA binding protein, putative [Ricinus communis] Back     alignment and taxonomy information
>gi|297815816|ref|XP_002875791.1| hypothetical protein ARALYDRAFT_485028 [Arabidopsis lyrata subsp. lyrata] gi|297321629|gb|EFH52050.1| hypothetical protein ARALYDRAFT_485028 [Arabidopsis lyrata subsp. lyrata] Back     alignment and taxonomy information
>gi|15232591|ref|NP_190242.1| methyl-CPG-binding domain protein 5 [Arabidopsis thaliana] gi|75206905|sp|Q9SNC0.1|MBD5_ARATH RecName: Full=Methyl-CpG-binding domain-containing protein 5; Short=AtMBD5; Short=MBD05; AltName: Full=Methyl-CpG-binding protein MBD5 gi|6523061|emb|CAB62328.1| putative protein [Arabidopsis thaliana] gi|17380750|gb|AAL36205.1| unknown protein [Arabidopsis thaliana] gi|23296870|gb|AAN13191.1| unknown protein [Arabidopsis thaliana] gi|332644653|gb|AEE78174.1| methyl-CPG-binding domain protein 5 [Arabidopsis thaliana] Back     alignment and taxonomy information
>gi|21554707|gb|AAM63666.1| unknown [Arabidopsis thaliana] Back     alignment and taxonomy information
>gi|359478635|ref|XP_003632148.1| PREDICTED: methyl-CpG-binding domain-containing protein 6-like [Vitis vinifera] Back     alignment and taxonomy information
>gi|297746071|emb|CBI16127.3| unnamed protein product [Vitis vinifera] Back     alignment and taxonomy information

Prediction of Gene Ontology (GO) Terms

Close Homologs with Gene Ontology terms Detected by BLAST ?

ID ? Alignment graph ? Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query248
TAIR|locus:2148308225 MBD6 "methyl-CPG-binding domai 0.846 0.933 0.387 4e-35
TAIR|locus:2075135182 MBD5 "methyl-CPG-binding domai 0.641 0.873 0.440 3.8e-30
UNIPROTKB|F1RPM5 360 MBD2 "Uncharacterized protein" 0.217 0.15 0.438 0.00023
UNIPROTKB|D4A986 380 D4A986 "Uncharacterized protei 0.217 0.142 0.438 0.00025
UNIPROTKB|Q9UBB5 411 MBD2 "Methyl-CpG-binding domai 0.217 0.131 0.438 0.00028
MGI|MGI:1333813 414 Mbd2 "methyl-CpG binding domai 0.217 0.130 0.438 0.00029
TAIR|locus:2148308 MBD6 "methyl-CPG-binding domain 6" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
 Score = 380 (138.8 bits), Expect = 4.0e-35, P = 4.0e-35
 Identities = 84/217 (38%), Positives = 119/217 (54%)

Query:    35 PPDPLLDSGFFIDAAPPATSGSNTTTTNDQTSKKRGTIREHKSENLATTNGTESAL-TPE 93
             PPDPLL SG FI +A   T  S+      Q     G     +S  +   NGT+      E
Sbjct:    10 PPDPLLASGAFISSAGDGTLDSSAKRRPIQGGI--GISGSGESVRIGMANGTDQVNHQTE 67

Query:    94 TASTRRLTATEAWLPPGWEIEDRVRTSGATAGTVDKYYFHVASGRRFRSKKEVLYFLETG 153
             + S +R    + WLPPGW +ED++RTSGATAG+VDKYY+   +GR+FRS+ EVLY+LE G
Sbjct:    68 SKSRKRAAPGDNWLPPGWRVEDKIRTSGATAGSVDKYYYEPNTGRKFRSRTEVLYYLEHG 127

Query:   154 TKRKRRKEXXXXXXXXXXXXXXXTKQ--KKPNIKAKTSA-LNFDYFNSPENVEWVLTDPS 210
             T ++  K+               + +  +   +       L+FD+ N P+ V W + +  
Sbjct:   128 TSKRGTKKAENTYFNPDHFEGQGSNRVTRTATVPPPPPPPLDFDFKNPPDKVSWSMANAG 187

Query:   211 EGSWTPFIGKVEVPESVRQDWAAAFTDLTTSNNGSKI 247
             E  W P IG V+V +SVR+DW+ AFT +T S N SK+
Sbjct:   188 EEGWIPNIGDVKVQDSVRRDWSTAFTFIT-SRNPSKV 223




GO:0003677 "DNA binding" evidence=IEA;ISS
GO:0005634 "nucleus" evidence=ISM;IEA;IDA
GO:0008150 "biological_process" evidence=ND
GO:0008327 "methyl-CpG binding" evidence=ISS;IDA
GO:0051747 "cytosine C-5 DNA demethylase activity" evidence=IDA
GO:0005720 "nuclear heterochromatin" evidence=IDA
GO:0010370 "perinucleolar chromocenter" evidence=IDA
GO:0019899 "enzyme binding" evidence=IPI
GO:0005515 "protein binding" evidence=IPI
GO:0043621 "protein self-association" evidence=IDA
GO:0005731 "nucleolus organizer region" evidence=IDA
TAIR|locus:2075135 MBD5 "methyl-CPG-binding domain protein 5" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
UNIPROTKB|F1RPM5 MBD2 "Uncharacterized protein" [Sus scrofa (taxid:9823)] Back     alignment and assigned GO terms
UNIPROTKB|D4A986 D4A986 "Uncharacterized protein" [Rattus norvegicus (taxid:10116)] Back     alignment and assigned GO terms
UNIPROTKB|Q9UBB5 MBD2 "Methyl-CpG-binding domain protein 2" [Homo sapiens (taxid:9606)] Back     alignment and assigned GO terms
MGI|MGI:1333813 Mbd2 "methyl-CpG binding domain protein 2" [Mus musculus (taxid:10090)] Back     alignment and assigned GO terms

Prediction of Enzyme Commission (EC) Number

EC Number Prediction by Annotation Transfer from SWISS-PROT Entries ?

ID ?Name ?Annotated EC number ?Identity ?Query coverage ?Hit coverage ?RBH(Q2H) ?RBH(H2Q) ?
Q9LTJ1MBD6_ARATHNo assigned EC number0.39540.85880.9466yesno

EC Number Prediction by Ezypred Server ?

Fail to connect to Ezypred Server

EC Number Prediction by EFICAz Software ?

No EC number assignment, probably not an enzyme!


Prediction of Functionally Associated Proteins

Functionally Associated Proteins Detected by STRING ?

Your Input:
scaffold_802568.1
annotation not avaliable (233 aa)
(Arabidopsis lyrata)
Predicted Functional Partners:
 
Sorry, there are no predicted associations at the current settings.
 

Conserved Domains and Related Protein Families

Conserved Domains Detected by RPS-BLAST ?

ID ?Alignment Graph ?Length ? Definition ? E-value ?
Query248
cd0139677 cd01396, MeCP2_MBD, MeCP2, MBD1, MBD2, MBD3, and M 6e-11
pfam0142975 pfam01429, MBD, Methyl-CpG binding domain 2e-10
cd0012262 cd00122, MBD, MeCP2, MBD1, MBD2, MBD3, MBD4, CLLD8 4e-09
smart0039177 smart00391, MBD, Methyl-CpG binding domain 8e-06
>gnl|CDD|238690 cd01396, MeCP2_MBD, MeCP2, MBD1, MBD2, MBD3, and MBD4 are members of a protein family that share the methyl-CpG-binding domain (MBD) Back     alignment and domain information
 Score = 56.6 bits (137), Expect = 6e-11
 Identities = 22/51 (43%), Positives = 29/51 (56%), Gaps = 2/51 (3%)

Query: 105 AWLPPGWEIEDRVRTSGATAGTVDKYYFHVASGRRFRSKKEVLYFLETGTK 155
             LPPGW+ E   R SG +AG  D YY    +G++FRSK E+  +LE    
Sbjct: 5   PRLPPGWKRELVPRKSG-SAGKFDVYYIS-PTGKKFRSKVELARYLEKNGP 53


The MBD, consists of about 70 residues and is defined as the minimal region required for binding to methylated DNA by a methyl-CpG-binding protein which binds specifically to methylated DNA. The MBD can recognize a single symmetrically methylated CpG either as naked DNA or within chromatin. MeCP2, MBD1 and MBD2 (and likely MBD3) form complexes with histone deacetylase and are involved in histone deacetylase-dependent repression of transcription. MBD4 is an endonuclease that forms a complex with the DNA mismatch-repair protein MLH1. Length = 77

>gnl|CDD|216497 pfam01429, MBD, Methyl-CpG binding domain Back     alignment and domain information
>gnl|CDD|238069 cd00122, MBD, MeCP2, MBD1, MBD2, MBD3, MBD4, CLLD8-like, and BAZ2A-like proteins constitute a family of proteins that share the methyl-CpG-binding domain (MBD) Back     alignment and domain information
>gnl|CDD|128673 smart00391, MBD, Methyl-CpG binding domain Back     alignment and domain information

Conserved Domains Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query 248
cd0139677 MeCP2_MBD MeCP2, MBD1, MBD2, MBD3, and MBD4 are me 99.74
PF0142977 MBD: Methyl-CpG binding domain; InterPro: IPR00173 99.74
cd0012262 MBD MeCP2, MBD1, MBD2, MBD3, MBD4, CLLD8-like, and 99.69
smart0039177 MBD Methyl-CpG binding domain. Methyl-CpG binding 99.68
KOG4161272 consensus Methyl-CpG binding transcription regulat 99.53
cd0139773 HAT_MBD Methyl-CpG binding domains (MBD) present i 99.48
cd0139560 HMT_MBD Methyl-CpG binding domains (MBD) present i 98.5
PF0039731 WW: WW domain; InterPro: IPR001202 Synonym(s): Rsp 90.51
>cd01396 MeCP2_MBD MeCP2, MBD1, MBD2, MBD3, and MBD4 are members of a protein family that share the methyl-CpG-binding domain (MBD) Back     alignment and domain information
Probab=99.74  E-value=3e-18  Score=130.02  Aligned_cols=60  Identities=37%  Similarity=0.654  Sum_probs=55.0

Q ss_pred             CCCCCCCCceEEEEEccCCCCCCceeEEEeeCCCCCeeecHHHHHHHHHhCCcccccccCCc
Q 045019          103 TEAWLPPGWEIEDRVRTSGATAGTVDKYYFHVASGRRFRSKKEVLYFLETGTKRKRRKENSN  164 (248)
Q Consensus       103 ~~~~LP~GW~rEv~~RksG~SaGk~DvYYisP~sGkKFRSK~EV~rYL~sg~~~~~~~en~~  164 (248)
                      .++.||+||+||+++|++| +++++|+|||+| +|++|||+.||++||+.......+.+||+
T Consensus         3 ~~~~lp~GW~r~~~~R~~g-s~~k~DvyY~sP-~Gkk~RS~~ev~~yL~~~~~~~~~~~~Fd   62 (77)
T cd01396           3 EDPRLPPGWKRELVPRKSG-SAGKFDVYYISP-TGKKFRSKVELARYLEKNGPTSLDLSDFD   62 (77)
T ss_pred             CCCCCCCCCEEEEEEecCC-CCCcceEEEECC-CCCEEECHHHHHHHHHhCCCCCCcHhHcc
Confidence            4667999999999999999 889999999999 89999999999999999876667778888



The MBD, consists of about 70 residues and is defined as the minimal region required for binding to methylated DNA by a methyl-CpG-binding protein which binds specifically to methylated DNA. The MBD can recognize a single symmetrically methylated CpG either as naked DNA or within chromatin. MeCP2, MBD1 and MBD2 (and likely MBD3) form complexes with histone deacetylase and are involved in histone deacetylase-dependent repression of transcription. MBD4 is an endonuclease that forms a complex with the DNA mismatch-repair protein MLH1.

>PF01429 MBD: Methyl-CpG binding domain; InterPro: IPR001739 Methylation at CpG dinucleotide, the most common DNA modification in eukaryotes, has been correlated with gene silencing associated with various phenomena such as genomic imprinting, transposon and chromosome X inactivation, differentiation, and cancer Back     alignment and domain information
>cd00122 MBD MeCP2, MBD1, MBD2, MBD3, MBD4, CLLD8-like, and BAZ2A-like proteins constitute a family of proteins that share the methyl-CpG-binding domain (MBD) Back     alignment and domain information
>smart00391 MBD Methyl-CpG binding domain Back     alignment and domain information
>KOG4161 consensus Methyl-CpG binding transcription regulators [Transcription; Chromatin structure and dynamics] Back     alignment and domain information
>cd01397 HAT_MBD Methyl-CpG binding domains (MBD) present in putative chromatin remodelling factor such as BAZ2A; BAZ2A contains a MBD, DDT, PHD-type zinc finger and Bromo domain suggesting that BAZ2A might be associated with histone acetyltransferase (HAT) activity Back     alignment and domain information
>cd01395 HMT_MBD Methyl-CpG binding domains (MBD) present in putative histone methyltransferases (HMT) such as CLLD8 and SETDB1 proteins; CLLD8 contains a MBD, a PreSET and a bifurcated SET domain, suggesting that CLLD8 might be associated with methylation-mediated transcriptional repression Back     alignment and domain information
>PF00397 WW: WW domain; InterPro: IPR001202 Synonym(s): Rsp5 or WWP domain The WW domain is a short conserved region in a number of unrelated proteins, which folds as a stable, triple stranded beta-sheet Back     alignment and domain information

Homologous Structure Templates

Structure Templates Detected by BLAST ?

ID ?Alignment Graph ?Length ? Definition ? E-value ?
Query248
2ky8_A72 Solution Structure And Dynamic Analysis Of Chicken 2e-05
>pdb|2KY8|A Chain A, Solution Structure And Dynamic Analysis Of Chicken Mbd2 Methyl Binding Domain Bound To A Target Methylated Dna Sequence Length = 72 Back     alignment and structure

Iteration: 1

Score = 46.2 bits (108), Expect = 2e-05, Method: Composition-based stats. Identities = 22/44 (50%), Positives = 31/44 (70%), Gaps = 1/44 (2%) Query: 107 LPPGWEIEDRVRTSGATAGTVDKYYFHVASGRRFRSKKEVLYFL 150 LPPGW+ E+ +R SG +AG D YYF SG++FRSK ++ +L Sbjct: 13 LPPGWKKEEVIRKSGLSAGKSDVYYFS-PSGKKFRSKPQLARYL 55

Structure Templates Detected by RPS-BLAST ?

ID ?Alignment Graph ?Length ? Definition ? E-value ?
Query248
2ky8_A72 Methyl-CPG-binding domain protein 2; DNA binding d 9e-13
1d9n_A75 Methyl-CPG-binding protein MBD1; PCM1, methylation 5e-11
3c2i_A97 Methyl-CPG-binding protein 2; water mediated recog 1e-09
1ub1_A133 MECP2, attachment region binding protein; chicken 5e-08
>2ky8_A Methyl-CPG-binding domain protein 2; DNA binding domain, transcription-DNA complex; HET: DNA 5CM TED; NMR {Gallus gallus} Length = 72 Back     alignment and structure
 Score = 61.1 bits (148), Expect = 9e-13
 Identities = 23/59 (38%), Positives = 34/59 (57%), Gaps = 1/59 (1%)

Query: 95  ASTRRLTATEAWLPPGWEIEDRVRTSGATAGTVDKYYFHVASGRRFRSKKEVLYFLETG 153
            S ++       LPPGW+ E+ +R SG +AG  D YYF   SG++FRSK ++  +L   
Sbjct: 1   GSDKQGRTDCPALPPGWKKEEVIRKSGLSAGKSDVYYFS-PSGKKFRSKPQLARYLGNA 58


>1d9n_A Methyl-CPG-binding protein MBD1; PCM1, methylation, DNA binding domain, gene regulation; NMR {Homo sapiens} SCOP: d.10.1.3 PDB: 1ig4_A* Length = 75 Back     alignment and structure
>3c2i_A Methyl-CPG-binding protein 2; water mediated recognition; HET: DNA 5CM; 2.50A {Homo sapiens} PDB: 1qk9_A Length = 97 Back     alignment and structure
>1ub1_A MECP2, attachment region binding protein; chicken methyl-CPG-binding protein 2 (cmecp2), MAR-binding protein (ARBP), spectroscopy; NMR {Gallus gallus} SCOP: d.10.1.3 Length = 133 Back     alignment and structure

Structure Templates Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query248
3c2i_A97 Methyl-CPG-binding protein 2; water mediated recog 99.8
3vxv_A69 Methyl-CPG-binding domain protein 4; methyl CPG bi 99.79
1ub1_A133 MECP2, attachment region binding protein; chicken 99.78
2ky8_A72 Methyl-CPG-binding domain protein 2; DNA binding d 99.78
1d9n_A75 Methyl-CPG-binding protein MBD1; PCM1, methylation 99.77
1ymz_A43 CC45; artificial protein, computational design, un 83.76
2jv4_A54 Peptidyl-prolyl CIS/trans isomerase; ppiase domain 83.03
2ysh_A40 GAS-7, growth-arrest-specific protein 7; WW domain 82.57
1wr3_A36 Ubiquitin-protein ligase NEDD4-2; all-beta; NMR {M 82.54
2ysg_A40 Syntaxin-binding protein 4; synip, STXBP4, WW doma 82.44
2law_A38 Yorkie homolog; YAP, SMAD1, CDK, signal transducti 82.36
2ysf_A40 E3 ubiquitin-protein ligase itchy homolog; AIP4, N 81.67
1wr4_A36 Ubiquitin-protein ligase NEDD4-2; all-beta; NMR {M 81.65
1wr7_A41 NEDD4-2; all-beta, ligase; NMR {Mus musculus} 81.35
1e0m_A37 Wwprototype; SH3 prototype, protein design, de nov 80.04
>3c2i_A Methyl-CPG-binding protein 2; water mediated recognition; HET: DNA 5CM; 2.50A {Homo sapiens} PDB: 1qk9_A Back     alignment and structure
Probab=99.80  E-value=8.2e-20  Score=143.92  Aligned_cols=63  Identities=29%  Similarity=0.512  Sum_probs=57.4

Q ss_pred             ccCCCCCCCCceEEEEEccCCCCCCceeEEEeeCCCCCeeecHHHHHHHHHhCCcccccccCCc
Q 045019          101 TATEAWLPPGWEIEDRVRTSGATAGTVDKYYFHVASGRRFRSKKEVLYFLETGTKRKRRKENSN  164 (248)
Q Consensus       101 ~~~~~~LP~GW~rEv~~RksG~SaGk~DvYYisP~sGkKFRSK~EV~rYL~sg~~~~~~~en~~  164 (248)
                      ...+++||+||++|+++|++|.++|++|+|||+| +|++|||+.||++||+.......+.++|+
T Consensus        18 ~~~~~~lP~GW~re~~~R~~G~s~gk~DvYY~sP-~GkkfRSk~ev~ryL~~~g~~~~~~~~Fd   80 (97)
T 3c2i_A           18 MYDDPTLPEGWTRKLKQRKSGRSAGKYDVYLINP-QGKAFRSKVELIMYFEKVGDTSLDPNDFD   80 (97)
T ss_dssp             CCCCTTSCTTCEEEEEECCSSTTTTCEEEEEECT-TSCEECSHHHHHHHHHHHTCCSSCTTTCC
T ss_pred             ccCCCCCCCCCEEEEEEecCCCCCCcceEEEECC-CCCEEECHHHHHHHHHHCCCCCCCHhhcc
Confidence            4678899999999999999999999999999999 89999999999999997665556778887



>3vxv_A Methyl-CPG-binding domain protein 4; methyl CPG binding domain, protein-DNA complex, versatIle BA recognition, hydrolase-DNA complex; HET: DNA 5CM; 2.00A {Mus musculus} PDB: 3vxx_A* 3vyb_A* 3vyq_A* Back     alignment and structure
>1ub1_A MECP2, attachment region binding protein; chicken methyl-CPG-binding protein 2 (cmecp2), MAR-binding protein (ARBP), spectroscopy; NMR {Gallus gallus} SCOP: d.10.1.3 Back     alignment and structure
>2ky8_A Methyl-CPG-binding domain protein 2; DNA binding domain, transcription-DNA complex; HET: DNA 5CM TED; NMR {Gallus gallus} Back     alignment and structure
>1d9n_A Methyl-CPG-binding protein MBD1; PCM1, methylation, DNA binding domain, gene regulation; NMR {Homo sapiens} SCOP: d.10.1.3 PDB: 1ig4_A* Back     alignment and structure
>1ymz_A CC45; artificial protein, computational design, unknown function; NMR {Synthetic} SCOP: k.22.1.1 Back     alignment and structure
>2jv4_A Peptidyl-prolyl CIS/trans isomerase; ppiase domain, WW domain group IV, rotamase; NMR {Emericella nidulans} Back     alignment and structure
>2ysh_A GAS-7, growth-arrest-specific protein 7; WW domain, structural genomics, NPPSFA, national project on protein structural and functional analyses; NMR {Homo sapiens} SCOP: k.22.1.1 Back     alignment and structure
>1wr3_A Ubiquitin-protein ligase NEDD4-2; all-beta; NMR {Mus musculus} Back     alignment and structure
>2ysg_A Syntaxin-binding protein 4; synip, STXBP4, WW domain, structural genomics, NPPSFA, national project on protein structural and functional analyses; NMR {Homo sapiens} SCOP: k.22.1.1 Back     alignment and structure
>2law_A Yorkie homolog; YAP, SMAD1, CDK, signal transduction, signaling protein-TRAN complex; NMR {Homo sapiens} Back     alignment and structure
>2ysf_A E3 ubiquitin-protein ligase itchy homolog; AIP4, NAPP1, WW domain, structural genomics, NPPSFA; NMR {Homo sapiens} SCOP: k.22.1.1 Back     alignment and structure
>1wr4_A Ubiquitin-protein ligase NEDD4-2; all-beta; NMR {Mus musculus} PDB: 2lb2_A* Back     alignment and structure
>1wr7_A NEDD4-2; all-beta, ligase; NMR {Mus musculus} Back     alignment and structure
>1e0m_A Wwprototype; SH3 prototype, protein design, de novo protein; NMR {} SCOP: k.22.1.1 Back     alignment and structure

Homologous Structure Domains

Structure Domains Detected by RPS-BLAST ?

ID ?Alignment Graph ?Length ? Definition ? E-value ?
Query 248
d1ig4a_75 d.10.1.3 (A:) Methylation-dependent transcriptiona 2e-11
d1qk9a_92 d.10.1.3 (A:) Methyl-CpG-binding protein 2, MECP2 1e-10
>d1ig4a_ d.10.1.3 (A:) Methylation-dependent transcriptional repressor MBD1/PCM1 {Human (Homo sapiens) [TaxId: 9606]} Length = 75 Back     information, alignment and structure

class: Alpha and beta proteins (a+b)
fold: DNA-binding domain
superfamily: DNA-binding domain
family: Methyl-CpG-binding domain, MBD
domain: Methylation-dependent transcriptional repressor MBD1/PCM1
species: Human (Homo sapiens) [TaxId: 9606]
 Score = 56.0 bits (135), Expect = 2e-11
 Identities = 20/48 (41%), Positives = 25/48 (52%), Gaps = 1/48 (2%)

Query: 106 WLPPGWEIEDRVRTSGATAGTVDKYYFHVASGRRFRSKKEVLYFLETG 153
            L PGW+  +  R SGAT G  D YY    +G R RSK E+  +L   
Sbjct: 10  ALGPGWKRREVFRKSGATCGRSDTYYQS-PTGDRIRSKVELTRYLGPA 56


>d1qk9a_ d.10.1.3 (A:) Methyl-CpG-binding protein 2, MECP2 {Human (Homo sapiens) [TaxId: 9606]} Length = 92 Back     information, alignment and structure

Homologous Domains Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query248
d1ig4a_75 Methylation-dependent transcriptional repressor MB 99.77
d1qk9a_92 Methyl-CpG-binding protein 2, MECP2 {Human (Homo s 99.76
d1pina134 Mitotic rotamase PIN1 {Human (Homo sapiens) [TaxId 87.1
>d1ig4a_ d.10.1.3 (A:) Methylation-dependent transcriptional repressor MBD1/PCM1 {Human (Homo sapiens) [TaxId: 9606]} Back     information, alignment and structure
class: Alpha and beta proteins (a+b)
fold: DNA-binding domain
superfamily: DNA-binding domain
family: Methyl-CpG-binding domain, MBD
domain: Methylation-dependent transcriptional repressor MBD1/PCM1
species: Human (Homo sapiens) [TaxId: 9606]
Probab=99.77  E-value=1.7e-19  Score=134.36  Aligned_cols=59  Identities=34%  Similarity=0.475  Sum_probs=53.1

Q ss_pred             ccCCCCCCCCceEEEEEccCCCCCCceeEEEeeCCCCCeeecHHHHHHHHHhCCcccccccCCc
Q 045019          101 TATEAWLPPGWEIEDRVRTSGATAGTVDKYYFHVASGRRFRSKKEVLYFLETGTKRKRRKENSN  164 (248)
Q Consensus       101 ~~~~~~LP~GW~rEv~~RksG~SaGk~DvYYisP~sGkKFRSK~EV~rYL~sg~~~~~~~en~~  164 (248)
                      .++|+.||+||+||+++|++|.++|+.|+|||+| +|++|||+.||.+||..    .++.++|+
T Consensus         5 ~~~~p~LP~GW~re~~~Rk~g~~~gk~DvyY~sP-~Gkk~RS~~ev~~yL~~----~l~~~~Fd   63 (75)
T d1ig4a_           5 WLDCPALGPGWKRREVFRKSGATCGRSDTYYQSP-TGDRIRSKVELTRYLGP----ACDLTLFD   63 (75)
T ss_dssp             EEECTTTCTTCEEEEECCCSSSSTTCCEEEEECS-SSCEECSHHHHHHHHCS----SSCCTTBC
T ss_pred             cccCCCCCCCcEEEEEEEcCCCCCCceeEEEECC-CCCEEeCHHHHHHHhcc----CCCcccee
Confidence            3567899999999999999999999999999999 89999999999999954    36667887



>d1qk9a_ d.10.1.3 (A:) Methyl-CpG-binding protein 2, MECP2 {Human (Homo sapiens) [TaxId: 9606]} Back     information, alignment and structure
>d1pina1 b.72.1.1 (A:6-39) Mitotic rotamase PIN1 {Human (Homo sapiens) [TaxId: 9606]} Back     information, alignment and structure