Query psy8081
Match_columns 463
No_of_seqs 306 out of 1557
Neff 6.5
Searched_HMMs 46136
Date Fri Aug 16 17:55:51 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy8081.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/8081hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 KOG1141|consensus 100.0 7E-104 2E-108 831.3 13.0 248 9-276 620-869 (1262)
2 KOG1082|consensus 100.0 1.8E-52 3.9E-57 431.2 17.3 288 71-462 55-353 (364)
3 KOG4442|consensus 100.0 5.6E-41 1.2E-45 354.4 10.7 164 178-462 94-259 (729)
4 KOG1080|consensus 99.9 7.4E-28 1.6E-32 269.3 9.0 133 204-463 868-1005(1005)
5 KOG1079|consensus 99.9 1.3E-27 2.8E-32 251.9 9.6 140 178-441 559-712 (739)
6 smart00317 SET SET (Su(var)3-9 99.9 3.2E-23 6.8E-28 176.8 12.8 55 379-437 62-116 (116)
7 KOG1141|consensus 99.9 3.9E-23 8.4E-28 220.5 8.7 420 7-463 821-1262(1262)
8 smart00468 PreSET N-terminal t 99.9 8.1E-22 1.8E-26 167.0 7.2 95 79-186 2-98 (98)
9 PF05033 Pre-SET: Pre-SET moti 99.9 8.4E-22 1.8E-26 168.0 7.1 101 80-194 1-103 (103)
10 KOG1083|consensus 99.8 4.2E-21 9.1E-26 209.8 2.1 132 190-441 1165-1297(1306)
11 KOG1085|consensus 99.7 4.7E-17 1E-21 157.8 8.1 128 197-440 251-379 (392)
12 COG2940 Proteins containing SE 99.6 1.8E-16 4E-21 169.5 5.6 166 178-462 308-479 (480)
13 cd01395 HMT_MBD Methyl-CpG bin 99.5 1.9E-14 4.1E-19 110.4 3.3 39 6-59 22-60 (60)
14 PF00856 SET: SET domain; Int 99.4 8.2E-13 1.8E-17 117.3 8.2 55 380-438 108-162 (162)
15 smart00391 MBD Methyl-CpG bind 99.0 4.3E-10 9.4E-15 91.2 3.7 49 6-69 26-75 (77)
16 cd01397 HAT_MBD Methyl-CpG bin 98.8 1.9E-09 4.2E-14 85.9 3.4 47 5-66 22-69 (73)
17 KOG1081|consensus 98.8 1.8E-09 3.9E-14 114.8 3.1 75 381-462 362-436 (463)
18 cd00122 MBD MeCP2, MBD1, MBD2, 98.7 1.5E-08 3.1E-13 78.9 3.4 40 5-59 22-62 (62)
19 PF01429 MBD: Methyl-CpG bindi 98.5 5E-08 1.1E-12 79.2 2.9 46 6-66 29-76 (77)
20 cd01396 MeCP2_MBD MeCP2, MBD1, 98.0 4.4E-06 9.6E-11 67.8 3.0 45 5-64 23-68 (77)
21 KOG2589|consensus 97.8 2E-05 4.3E-10 79.9 4.6 57 389-455 195-252 (453)
22 KOG2461|consensus 97.5 8.9E-05 1.9E-09 77.7 4.4 57 378-442 86-147 (396)
23 smart00508 PostSET Cysteine-ri 96.4 0.0016 3.4E-08 41.8 1.3 15 448-462 2-16 (26)
24 KOG2084|consensus 92.2 0.16 3.5E-06 53.5 4.6 55 393-455 208-271 (482)
25 smart00570 AWS associated with 92.0 0.078 1.7E-06 39.7 1.3 22 178-199 28-49 (51)
26 KOG1337|consensus 77.9 1.8 4E-05 46.7 3.2 40 393-439 239-278 (472)
27 KOG3813|consensus 64.6 3.2 6.9E-05 44.7 1.2 20 125-145 307-326 (640)
28 PF03638 TCR: Tesmin/TSO1-like 51.2 8.7 0.00019 27.6 1.2 37 125-195 3-40 (42)
29 PF11403 Yeast_MT: Yeast metal 49.7 10 0.00023 25.8 1.3 19 125-143 21-39 (40)
30 PF08666 SAF: SAF domain; Int 32.0 27 0.0006 26.1 1.4 15 420-434 3-17 (63)
31 KOG1171|consensus 27.4 23 0.00049 37.6 0.3 37 123-193 215-252 (406)
32 smart00317 SET SET (Su(var)3-9 22.2 1.4E+02 0.0031 24.1 4.3 18 213-230 97-114 (116)
No 1
>KOG1141|consensus
Probab=100.00 E-value=7.4e-104 Score=831.26 Aligned_cols=248 Identities=33% Similarity=0.513 Sum_probs=238.7
Q ss_pred EEEEecCcCCCCCCHHHHHHHHHhccc-cceeecccccccccCCcccccccccchhhcccccccccceeeecccCCCCCC
Q psy8081 9 CIMYTAPCGRTLRTSDQLVLYLFITKA-KWTIDMFEYDHFVSSKWTIDMFEYDHFVDCLREFVIENANITIKDMSNGREN 87 (463)
Q Consensus 9 ~v~y~~pCg~~lr~~~ev~~yl~~t~~-~l~~~~~~~~~~~~~~~~~d~f~f~~~v~~~~~~~~~~~~~~~~DiS~G~E~ 87 (463)
.|+||+|||++||+|.||.|||++|+| +|++ ++|+|++||.+.|.|++.++++++.||++|+|.
T Consensus 620 hv~yktpcg~~lr~~~el~ryL~et~c~flf~---------------~~f~~~~yV~~~r~~~p~kp~~~~~Di~~g~e~ 684 (1262)
T KOG1141|consen 620 HVEYKTPCGMPLRMRIELYRYLVETRCKFLFV---------------IGFDRAFYVVRHRAPNPLKPGNRCTDIPCGREH 684 (1262)
T ss_pred eeeccCCCccchHHHHHHHHHHHHhcCcEEEE---------------eecccchheeecccCCCcCCcceeccccCCccc
Confidence 699999999999999999999999999 6788 999999999999999999999999999999999
Q ss_pred CCeEEEcCCCCCCCCCceEcccccCCCCccccCCCCCccccCCCCCCCCCCCcccccccccCccccCCCCCCCCcccccc
Q psy8081 88 VPISCVNYIDTDVPKTVDYMTERKPKEGVTINTNKEFLVCCDCTDDCRDRNNCACWQLTIKGSRDLWNVSEPKDFVGYQN 167 (463)
Q Consensus 88 vPI~~vN~iD~~~pp~f~Y~~~~~p~~gv~l~~~~~f~~gCdC~d~C~d~~~C~C~~~t~~g~~~~~~~~~~~~~~gy~~ 167 (463)
|||+++|+||..+||.+.|.+++||+.++..+..++|++||||.+||+|.++|+|.|+|++..... +..+..++.||+|
T Consensus 685 vpis~~neids~~lpq~ay~K~~ip~~~nl~n~~~~fl~scdc~~gcid~~kcachQltvk~~~t~-p~~~v~~t~gyky 763 (1262)
T KOG1141|consen 685 VPISEKNEIDSHRLPQAAYKKHMIPTNNNLSNRRKDFLQSCDCPTGCIDSMKCACHQLTVKKKTTG-PNQNVASTNGYKY 763 (1262)
T ss_pred cccceeecccCcCCccchhheeeccCCCcccccChhhhhcCCCCcchhhhhhhhHHHHHHHhhccC-CCcccccCcchhh
Confidence 999999999999999999999999999999999999999999999999999999999999887777 7778899999999
Q ss_pred ccCCcCCccceeecCCCCCCC-CCCCCceeccCCcccEEEEEeCCcceEEEeCCCCCCCCeEEEEeeEEeChhhhhhhcc
Q psy8081 168 RRLPEHVVSGIFECNDLCKCK-HTCHNRVVQFPMLQKLQLFKTEMKGWGLRCLNDIPQGTFICIYAGHLLTDSDANEEGK 246 (463)
Q Consensus 168 ~rL~~~~~tgIyECn~~C~C~-~~C~NRvvQ~g~~~rLqVFkT~~kGWGVR~l~dI~kGtFVc~Y~Gellt~~~a~~~~~ 246 (463)
|||++.+|||+|||+.+|+|. ++|+||+||||+|+|||+|||.+||||+||++||.+|+|||+|+|.+|+++.+++.+.
T Consensus 764 KRl~e~~ptg~yEc~k~ckc~~~~C~nrmvqhg~qvRlq~fkt~~kGWg~rclddi~~g~fVciy~g~~l~~~~sdks~~ 843 (1262)
T KOG1141|consen 764 KRLIEIRPTGPYECLKACKCCGPDCLNRMVQHGYQVRLQRFKTIHKGWGRRCLDDITGGNFVCIYPGGALLHQISDKSEY 843 (1262)
T ss_pred HHHHHhcCCCHHHHHHhhccCcHHHHHHHhhcCceeEeeeccccccccceEeeeecCCceEEEEecchhhhhhhchhhhh
Confidence 999999999999999999999 9999999999999999999999999999999999999999999999999999999999
Q ss_pred ccCCcchhchhhHHHHHHhhhcccCCCCcc
Q psy8081 247 NYGDEYLAELDFIETVERYKEAYESDVPEE 276 (463)
Q Consensus 247 ~~gdeYl~~ld~ie~ve~~k~~~e~~~~~~ 276 (463)
.++++||+.||+ +.++++|+++...+
T Consensus 844 ~~~~~~~~~id~----~~f~~~~dt~~~~t 869 (1262)
T KOG1141|consen 844 IHVTRSLLTIDC----FSFDARIDTATYIT 869 (1262)
T ss_pred cccchhhhcccc----cchhccccccceee
Confidence 999999999998 57888888877653
No 2
>KOG1082|consensus
Probab=100.00 E-value=1.8e-52 Score=431.20 Aligned_cols=288 Identities=31% Similarity=0.560 Sum_probs=212.2
Q ss_pred cccceeeecccCCCCCCCCeEEEcCCCCCCCCCceEcccccCCCCccccCCCCCccccCCCCCCCCCCC--ccccccccc
Q psy8081 71 IENANITIKDMSNGRENVPISCVNYIDTDVPKTVDYMTERKPKEGVTINTNKEFLVCCDCTDDCRDRNN--CACWQLTIK 148 (463)
Q Consensus 71 ~~~~~~~~~DiS~G~E~vPI~~vN~iD~~~pp~f~Y~~~~~p~~gv~l~~~~~f~~gCdC~d~C~d~~~--C~C~~~t~~ 148 (463)
+........||+.|.|.+||+++|+||...++.|.|+++.+..+| .........+|.|.+.|..... |.|.+.+..
T Consensus 55 ~~~~~~~~~d~~~~~e~~~v~~~n~id~~~~~~f~y~~~~~~~~~--~~~~~~~~~~c~C~~~~~~~~~~~C~C~~~n~~ 132 (364)
T KOG1082|consen 55 KLEAKSELEDIALGSENLPVPLVNRIDEDAPLYFQYIATEIVDPG--ELSDCENSTGCRCCSSCSSVLPLTCLCERHNGG 132 (364)
T ss_pred ccccccccccccCccccCceeeeeeccCCccccceeccccccCcc--ccccCccccCCCccCCCCCCCCccccChHhhCC
Confidence 345566789999999999999999999887799999999888776 2233467889999998876433 999987643
Q ss_pred CccccCCCCCCCCccccccccCCcCCccceeecCCCCCCCCCCCCceeccCCcccEEEEEeCCcceEEEeCCCCCCCCeE
Q psy8081 149 GSRDLWNVSEPKDFVGYQNRRLPEHVVSGIFECNDLCKCKHTCHNRVVQFPMLQKLQLFKTEMKGWGLRCLNDIPQGTFI 228 (463)
Q Consensus 149 g~~~~~~~~~~~~~~gy~~~rL~~~~~tgIyECn~~C~C~~~C~NRvvQ~g~~~rLqVFkT~~kGWGVR~l~dI~kGtFV 228 (463)
...+..++.. ...+ .....||||+..|+|...|.|||+|+|++.+|+||+|+.||||||+++.|++|+||
T Consensus 133 ~~~~~~~~~~--~~~~--------~~~~~i~EC~~~C~C~~~C~nRv~q~g~~~~leIfrt~~kGwgvRs~~~I~~G~fv 202 (364)
T KOG1082|consen 133 LVAYTCDGDC--GTLG--------KFKEPVFECSVACGCHPDCANRVVQKGLQFHLEVFRTPEKGWGVRTLDPIPAGEFV 202 (364)
T ss_pred ccccccCCcc--cccc--------ccCccccccccCCCCCCcCcchhhccccccceEEEecCCceeeecccccccCCCee
Confidence 3333311100 0000 11234899999999999999999999999999999999999999999999999999
Q ss_pred EEEeeEEeChhhhhhhccccCCcchhchhhHHHHHHhhhcccCCCCcccccccccccccCCCCCCCCCCCCCCCchhhhc
Q psy8081 229 CIYAGHLLTDSDANEEGKNYGDEYLAELDFIETVERYKEAYESDVPEEDMVEDDEAENENSDEESPNSNSNEDNSQDKAI 308 (463)
Q Consensus 229 c~Y~Gellt~~~a~~~~~~~gdeYl~~ld~ie~ve~~k~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 308 (463)
|+|+|||++.++++.+...+ +|+.+..-.... ...+|..+.. .... .
T Consensus 203 cEyaGe~~t~~e~~~~~~~~--~~~~~~~~~~~~--~~~~~~~~~~---------------------------~~~~--~ 249 (364)
T KOG1082|consen 203 CEYAGEVLTSEEAQRRTHLR--EYLDDDCDAYSI--ADREWVDESP---------------------------VGNT--F 249 (364)
T ss_pred EEEeeEecChHHhhhccccc--cccccccccchh--hhcccccccc---------------------------cccc--c
Confidence 99999999999887653221 122110000000 0000000000 0000 0
Q ss_pred ccCCcccccCCCCCchhHHHHHHHhhhhhhhccccccccchhhHHHHHhhhhhhhhhhcchhhhccCCCceEEEeCcccC
Q psy8081 309 LNSDDETENSSNADSDHIRSRLRKRKRKQKADKKEGKRKTSSLLMTLQANQKKKTKRLRSLREYFGEDENVYIMDARTSG 388 (463)
Q Consensus 309 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~~~~~y~IDA~~~G 388 (463)
| .. .+.......++|||+..|
T Consensus 250 ~--------------------------------------------------------~~---~~~~~~~~~~~ida~~~G 270 (364)
T KOG1082|consen 250 V--------------------------------------------------------AP---SLPGGPGRELLIDAKPHG 270 (364)
T ss_pred c--------------------------------------------------------cc---ccccCCCcceEEchhhcc
Confidence 0 00 011222467999999999
Q ss_pred CccccccCCCCCCeeEEEEEEcCCCCCcCEEEEEEccCCCCCCeEEEecCCCCC------C---CCCCCeeeeeCCCCCC
Q psy8081 389 NIGRYLNHSCTPNVFVQNVFVDTHDPRFPWVSFFALKFIEAGSELTWDYAYDIG------S---VPDKVVYCYCGSSECR 459 (463)
Q Consensus 389 NvgRFiNHSC~PNl~vq~Vfvdt~d~~fP~VafFA~r~I~aGeELTwDYgy~~~------s---~~~k~~~C~CGs~~Cr 459 (463)
|++|||||||.||+++|.|+.++.++.+|+|+|||+++|+||||||||||+... . .+.....|.||+.+||
T Consensus 271 Nv~RfinHSC~PN~~~~~v~~~~~~~~~~~i~ffa~~~I~p~~ELT~dYg~~~~~~~~~~~~~~~~~~~~~c~c~~~~cr 350 (364)
T KOG1082|consen 271 NVARFINHSCSPNLLYQAVFQDEFVLLYLRIGFFALRDISPGEELTLDYGKAYKLLVQDGANIYTPVMKKNCNCGLEKCR 350 (364)
T ss_pred cccccccCCCCccceeeeeeecCCccchheeeeeeccccCCCcccchhhcccccccccccccccccccchhhcCCCHHhC
Confidence 999999999999999999999999999999999999999999999999997632 1 2346789999999999
Q ss_pred ccc
Q psy8081 460 QRL 462 (463)
Q Consensus 460 g~l 462 (463)
++|
T Consensus 351 ~~~ 353 (364)
T KOG1082|consen 351 GLL 353 (364)
T ss_pred ccc
Confidence 987
No 3
>KOG4442|consensus
Probab=100.00 E-value=5.6e-41 Score=354.37 Aligned_cols=164 Identities=37% Similarity=0.665 Sum_probs=139.5
Q ss_pred eeecCC-CCC-CCCCCCCceeccCCcccEEEEEeCCcceEEEeCCCCCCCCeEEEEeeEEeChhhhhhhccccCCcchhc
Q psy8081 178 IFECND-LCK-CKHTCHNRVVQFPMLQKLQLFKTEMKGWGLRCLNDIPQGTFICIYAGHLLTDSDANEEGKNYGDEYLAE 255 (463)
Q Consensus 178 IyECn~-~C~-C~~~C~NRvvQ~g~~~rLqVFkT~~kGWGVR~l~dI~kGtFVc~Y~Gellt~~~a~~~~~~~gdeYl~~ 255 (463)
..||++ .|. |+..|+|+-.|+..-.+++||+|++|||||||..|||+|+||.||.||||+.++.++|...|..
T Consensus 94 ~iECs~~~C~~cg~~C~NQRFQkkqyA~vevF~Te~KG~GLRA~~dI~~g~FI~EY~GEVI~~~Ef~kR~~~Y~~----- 168 (729)
T KOG4442|consen 94 SIECSDRECPRCGVYCKNQRFQKKQYAKVEVFLTEKKGCGLRAEEDIPKGQFILEYIGEVIEEKEFEKRVKRYAK----- 168 (729)
T ss_pred hcccCCccCCCccccccchhhhhhccCceeEEEecCcccceeeccccCCCcEEeeeccccccHHHHHHHHHHHHh-----
Confidence 369999 999 9999999999999999999999999999999999999999999999999999988877543310
Q ss_pred hhhHHHHHHhhhcccCCCCcccccccccccccCCCCCCCCCCCCCCCchhhhcccCCcccccCCCCCchhHHHHHHHhhh
Q psy8081 256 LDFIETVERYKEAYESDVPEEDMVEDDEAENENSDEESPNSNSNEDNSQDKAILNSDDETENSSNADSDHIRSRLRKRKR 335 (463)
Q Consensus 256 ld~ie~ve~~k~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 335 (463)
+- +
T Consensus 169 ----------------d~----------------------------~--------------------------------- 171 (729)
T KOG4442|consen 169 ----------------DG----------------------------I--------------------------------- 171 (729)
T ss_pred ----------------cC----------------------------C---------------------------------
Confidence 00 0
Q ss_pred hhhhccccccccchhhHHHHHhhhhhhhhhhcchhhhccCCCceEEEeCcccCCccccccCCCCCCeeEEEEEEcCCCCC
Q psy8081 336 KQKADKKEGKRKTSSLLMTLQANQKKKTKRLRSLREYFGEDENVYIMDARTSGNIGRYLNHSCTPNVFVQNVFVDTHDPR 415 (463)
Q Consensus 336 ~~s~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~~~~~y~IDA~~~GNvgRFiNHSC~PNl~vq~Vfvdt~d~~ 415 (463)
-..||....+.++|||++.||++|||||||+||+.+|.|-|.+.
T Consensus 172 ---------------------------------kh~Yfm~L~~~e~IDAT~KGnlaRFiNHSC~PNa~~~KWtV~~~--- 215 (729)
T KOG4442|consen 172 ---------------------------------KHYYFMALQGGEYIDATKKGNLARFINHSCDPNAEVQKWTVPDE--- 215 (729)
T ss_pred ---------------------------------ceEEEEEecCCceecccccCcHHHhhcCCCCCCceeeeeeeCCe---
Confidence 00123333567999999999999999999999999999999884
Q ss_pred cCEEEEEEccCCCCCCeEEEecCCCCCCCCCCCeeeeeCCCCCCccc
Q psy8081 416 FPWVSFFALKFIEAGSELTWDYAYDIGSVPDKVVYCYCGSSECRQRL 462 (463)
Q Consensus 416 fP~VafFA~r~I~aGeELTwDYgy~~~s~~~k~~~C~CGs~~Crg~l 462 (463)
-+|+|||.|.|++||||||||+++. .......|+||+.+|||||
T Consensus 216 -lRvGiFakk~I~~GEEITFDYqf~r--YGr~AQ~CyCgeanC~G~I 259 (729)
T KOG4442|consen 216 -LRVGIFAKKVIKPGEEITFDYQFDR--YGRDAQPCYCGEANCRGWI 259 (729)
T ss_pred -eEEEEeEecccCCCceeeEeccccc--ccccccccccCCccccccc
Confidence 3599999999999999999999863 2224568999999999997
No 4
>KOG1080|consensus
Probab=99.94 E-value=7.4e-28 Score=269.33 Aligned_cols=133 Identities=33% Similarity=0.721 Sum_probs=113.4
Q ss_pred EEEEEeCCcceEEEeCCCCCCCCeEEEEeeEEeChhhhhhhcccc-----CCcchhchhhHHHHHHhhhcccCCCCcccc
Q psy8081 204 LQLFKTEMKGWGLRCLNDIPQGTFICIYAGHLLTDSDANEEGKNY-----GDEYLAELDFIETVERYKEAYESDVPEEDM 278 (463)
Q Consensus 204 LqVFkT~~kGWGVR~l~dI~kGtFVc~Y~Gellt~~~a~~~~~~~-----gdeYl~~ld~ie~ve~~k~~~e~~~~~~~~ 278 (463)
|..-+....||||+|++.|.+|+||.||.||++...-|+.|+..| |+.|||.+
T Consensus 868 ~~F~~s~iH~wglfa~~~i~~~dmViEY~Ge~vR~~iad~RE~~Y~~~gi~~sYlfri---------------------- 925 (1005)
T KOG1080|consen 868 VKFGRSGIHGWGLFAMENIAAGDMVIEYRGELVRSSIADLREARYERMGIGDSYLFRI---------------------- 925 (1005)
T ss_pred hccccccccccceeeccCccccceEEEeeceehhhhHHHHHHHHHhccCcccceeeec----------------------
Confidence 444466789999999999999999999999999888777776443 44555433
Q ss_pred cccccccccCCCCCCCCCCCCCCCchhhhcccCCcccccCCCCCchhHHHHHHHhhhhhhhccccccccchhhHHHHHhh
Q psy8081 279 VEDDEAENENSDEESPNSNSNEDNSQDKAILNSDDETENSSNADSDHIRSRLRKRKRKQKADKKEGKRKTSSLLMTLQAN 358 (463)
Q Consensus 279 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~~~~~~~~~~~~~~~~ 358 (463)
T Consensus 926 -------------------------------------------------------------------------------- 925 (1005)
T KOG1080|consen 926 -------------------------------------------------------------------------------- 925 (1005)
T ss_pred --------------------------------------------------------------------------------
Confidence
Q ss_pred hhhhhhhhcchhhhccCCCceEEEeCcccCCccccccCCCCCCeeEEEEEEcCCCCCcCEEEEEEccCCCCCCeEEEecC
Q psy8081 359 QKKKTKRLRSLREYFGEDENVYIMDARTSGNIGRYLNHSCTPNVFVQNVFVDTHDPRFPWVSFFALKFIEAGSELTWDYA 438 (463)
Q Consensus 359 ~~~~~~~~~~~r~~~~~~~~~y~IDA~~~GNvgRFiNHSC~PNl~vq~Vfvdt~d~~fP~VafFA~r~I~aGeELTwDYg 438 (463)
+..+||||++.||++|||||||.|||.+..+-|+++- +|.+||.|+|.+|||||+||.
T Consensus 926 ------------------d~~~ViDAtk~gniAr~InHsC~PNCyakvi~V~g~~----~IvIyakr~I~~~EElTYDYk 983 (1005)
T KOG1080|consen 926 ------------------DDEVVVDATKKGNIARFINHSCNPNCYAKVITVEGDK----RIVIYSKRDIAAGEELTYDYK 983 (1005)
T ss_pred ------------------ccceEEeccccCchhheeecccCCCceeeEEEecCee----EEEEEEecccccCceeeeecc
Confidence 2357999999999999999999999999999999975 599999999999999999999
Q ss_pred CCCCCCCCCCeeeeeCCCCCCcccC
Q psy8081 439 YDIGSVPDKVVYCYCGSSECRQRLL 463 (463)
Q Consensus 439 y~~~s~~~k~~~C~CGs~~Crg~ll 463 (463)
.... ...++|+|||++|||.|.
T Consensus 984 F~~e---~~kipClCgap~Crg~~n 1005 (1005)
T KOG1080|consen 984 FPTE---DDKIPCLCGAPNCRGFLN 1005 (1005)
T ss_pred cccc---ccccccccCCCccccccC
Confidence 8653 238999999999999873
No 5
>KOG1079|consensus
Probab=99.94 E-value=1.3e-27 Score=251.93 Aligned_cols=140 Identities=32% Similarity=0.639 Sum_probs=122.4
Q ss_pred eeecCC-CCCC----------CCCCCCceeccCCcccEEEEEeCCcceEEEeCCCCCCCCeEEEEeeEEeChhhhhhhcc
Q psy8081 178 IFECND-LCKC----------KHTCHNRVVQFPMLQKLQLFKTEMKGWGLRCLNDIPQGTFICIYAGHLLTDSDANEEGK 246 (463)
Q Consensus 178 IyECn~-~C~C----------~~~C~NRvvQ~g~~~rLqVFkT~~kGWGVR~l~dI~kGtFVc~Y~Gellt~~~a~~~~~ 246 (463)
..||.+ .|.+ .-.|.|--+|+|.+.|+-|-.+.--|||+|.++.+.|++||.+|+||||+.+||++||.
T Consensus 559 ~rECdPd~Cl~cg~~~~~d~~~~~C~N~~l~~~~qkr~llapSdVaGwGlFlKe~v~KnefisEY~GE~IS~dEADrRGk 638 (739)
T KOG1079|consen 559 VRECDPDVCLMCGNVDHFDSSKISCKNTNLQRGEQKRVLLAPSDVAGWGLFLKESVSKNEFISEYTGEIISHDEADRRGK 638 (739)
T ss_pred ccccCchHHhccCcccccccCccccccchhhhhhhcceeechhhccccceeeccccCCCceeeeecceeccchhhhhccc
Confidence 578874 6655 14899999999999999999999999999999999999999999999999999999998
Q ss_pred ccCC---cchhchhhHHHHHHhhhcccCCCCcccccccccccccCCCCCCCCCCCCCCCchhhhcccCCcccccCCCCCc
Q psy8081 247 NYGD---EYLAELDFIETVERYKEAYESDVPEEDMVEDDEAENENSDEESPNSNSNEDNSQDKAILNSDDETENSSNADS 323 (463)
Q Consensus 247 ~~gd---eYl~~ld~ie~ve~~k~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 323 (463)
.|.. +|||+|
T Consensus 639 iYDr~~cSflFnl------------------------------------------------------------------- 651 (739)
T KOG1079|consen 639 IYDRYMCSFLFNL------------------------------------------------------------------- 651 (739)
T ss_pred ccccccceeeeec-------------------------------------------------------------------
Confidence 7742 333332
Q ss_pred hhHHHHHHHhhhhhhhccccccccchhhHHHHHhhhhhhhhhhcchhhhccCCCceEEEeCcccCCccccccCCCCCCee
Q psy8081 324 DHIRSRLRKRKRKQKADKKEGKRKTSSLLMTLQANQKKKTKRLRSLREYFGEDENVYIMDARTSGNIGRYLNHSCTPNVF 403 (463)
Q Consensus 324 ~~~~~~~~~~~~~~s~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~~~~~y~IDA~~~GNvgRFiNHSC~PNl~ 403 (463)
...|+|||++.||.+||.|||=.|||+
T Consensus 652 -----------------------------------------------------n~dyviDs~rkGnk~rFANHS~nPNCY 678 (739)
T KOG1079|consen 652 -----------------------------------------------------NNDYVIDSTRKGNKIRFANHSFNPNCY 678 (739)
T ss_pred -----------------------------------------------------cccceEeeeeecchhhhccCCCCCCcE
Confidence 235999999999999999999999999
Q ss_pred EEEEEEcCCCCCcCEEEEEEccCCCCCCeEEEecCCCC
Q psy8081 404 VQNVFVDTHDPRFPWVSFFALKFIEAGSELTWDYAYDI 441 (463)
Q Consensus 404 vq~Vfvdt~d~~fP~VafFA~r~I~aGeELTwDYgy~~ 441 (463)
+..++|.+.. +|.|||+|.|.|||||||||+|.-
T Consensus 679 Akvm~V~Gdh----RIGifAkRaIeagEELffDYrYs~ 712 (739)
T KOG1079|consen 679 AKVMMVAGDH----RIGIFAKRAIEAGEELFFDYRYSP 712 (739)
T ss_pred EEEEEecCCc----ceeeeehhhcccCceeeeeeccCc
Confidence 9888887743 499999999999999999999964
No 6
>smart00317 SET SET (Su(var)3-9, Enhancer-of-zeste, Trithorax) domain. Putative methyl transferase, based on outlier plant homologues
Probab=99.90 E-value=3.2e-23 Score=176.82 Aligned_cols=55 Identities=45% Similarity=0.708 Sum_probs=51.4
Q ss_pred eEEEeCcccCCccccccCCCCCCeeEEEEEEcCCCCCcCEEEEEEccCCCCCCeEEEec
Q psy8081 379 VYIMDARTSGNIGRYLNHSCTPNVFVQNVFVDTHDPRFPWVSFFALKFIEAGSELTWDY 437 (463)
Q Consensus 379 ~y~IDA~~~GNvgRFiNHSC~PNl~vq~Vfvdt~d~~fP~VafFA~r~I~aGeELTwDY 437 (463)
.+.|||+..||++|||||||.||+.++.++.++.. ++.|+|.|+|++|||||+||
T Consensus 62 ~~~id~~~~~~~~~~iNHsc~pN~~~~~~~~~~~~----~~~~~a~r~I~~GeEi~i~Y 116 (116)
T smart00317 62 DLCIDARRKGNIARFINHSCEPNCELLFVEVNGDS----RIVIFALRDIKPGEELTIDY 116 (116)
T ss_pred CEEEeCCccCcHHHeeCCCCCCCEEEEEEEECCCc----EEEEEECCCcCCCCEEeecC
Confidence 58999999999999999999999999999887654 69999999999999999999
No 7
>KOG1141|consensus
Probab=99.88 E-value=3.9e-23 Score=220.47 Aligned_cols=420 Identities=35% Similarity=0.606 Sum_probs=313.6
Q ss_pred eeEEEEecCcCCCCCCHHHHHHHHHhccccceeecccccccccCCcccccccccchhhcccccccccceeeecccCCCCC
Q psy8081 7 KKCIMYTAPCGRTLRTSDQLVLYLFITKAKWTIDMFEYDHFVSSKWTIDMFEYDHFVDCLREFVIENANITIKDMSNGRE 86 (463)
Q Consensus 7 ~~~v~y~~pCg~~lr~~~ev~~yl~~t~~~l~~~~~~~~~~~~~~~~~d~f~f~~~v~~~~~~~~~~~~~~~~DiS~G~E 86 (463)
++.++|.-|||.-|+.++++.-|..-|+++|++ |+|+|+..+...+.+.+....+-..|.+.|.+
T Consensus 821 ~g~fVciy~g~~l~~~~sdks~~~~~~~~~~~i---------------d~~~f~~~~dt~~~~tvD~~g~d~~d~~~g~s 885 (1262)
T KOG1141|consen 821 GGNFVCIYPGGALLHQISDKSEYIHVTRSLLTI---------------DCFSFDARIDTATYITVDDKGLDVADFSLGTS 885 (1262)
T ss_pred CceEEEEecchhhhhhhchhhhhcccchhhhcc---------------cccchhccccccceeeccccccchhhhhcccc
Confidence 558999999999999999999999999999999 99999999999999999999999999999999
Q ss_pred CCCeEEEcCCCCCCCCCceEcccccCCCC-ccc-cCCCCCccccCCCCCCCCCCCcccccccccCccccCCCCCCCCc--
Q psy8081 87 NVPISCVNYIDTDVPKTVDYMTERKPKEG-VTI-NTNKEFLVCCDCTDDCRDRNNCACWQLTIKGSRDLWNVSEPKDF-- 162 (463)
Q Consensus 87 ~vPI~~vN~iD~~~pp~f~Y~~~~~p~~g-v~l-~~~~~f~~gCdC~d~C~d~~~C~C~~~t~~g~~~~~~~~~~~~~-- 162 (463)
.+|||.+|.+|+..||..+|.+.+....+ +.+ ..+..|..||.|...|.|..+|.|.++.....+-+ |+......
T Consensus 886 g~~~p~~~~~d~~~~~~c~d~~~~~~~~~~~~~s~~~~~~~~~~s~d~hp~d~~~~~~~~~~~~~~~~c-pp~~s~d~~~ 964 (1262)
T KOG1141|consen 886 GIPIPLVNSVDNDEPPSCEDSKRRFQYNDQVDISSVSRDFCSGCSCDGHPSDASKCECQQLSIEAMKRC-PPNLSFDGHD 964 (1262)
T ss_pred CCCCccccccccCCCccccccceeecccccchhhhhccccccccccCCCCcccCcccCCCCChhhhcCC-CCccccCchh
Confidence 99999999999999999999988765433 333 35678999999999999999999999987665555 33321111
Q ss_pred cccc--cccCCcCCccceeecCCCCCCCCCCCCceeccCCccc--------EEEEEeCCcceEEEeCCCCCCCCeEEEEe
Q psy8081 163 VGYQ--NRRLPEHVVSGIFECNDLCKCKHTCHNRVVQFPMLQK--------LQLFKTEMKGWGLRCLNDIPQGTFICIYA 232 (463)
Q Consensus 163 ~gy~--~~rL~~~~~tgIyECn~~C~C~~~C~NRvvQ~g~~~r--------LqVFkT~~kGWGVR~l~dI~kGtFVc~Y~ 232 (463)
.-|. ++. ...+--+.+||+..|.|...|.||+||.|.+.+ ||||+|...|||+|...||+.-+|||+|.
T Consensus 965 ~~~eS~~~~-ns~~~~~f~e~~~hss~~~~e~~~~v~~~~~~~me~~s~~~l~i~~~~~~~~~~~edtD~~~~~~~~~~~ 1043 (1262)
T KOG1141|consen 965 ELYESSEKQ-NSFLKLFFFECNDHSSCHRKEYNRVVQNNIKYPMEVSSFNDLQIFKTAQSGWGVREDTDIPQSTFICTYV 1043 (1262)
T ss_pred hhhhhhhhc-chhhhccceeccccchhcccccchhhhcCCccceeeeecccccccccccccccccccccCCCCccccccc
Confidence 1111 110 011224678999999999889999999998765 67899999999999999999999999999
Q ss_pred eEEeChhhhhhhccccCCcchhchhhHHHHHHhhhc--ccCCCCcccccccccccccCCCCCCCCCCCCCCCchhhhccc
Q psy8081 233 GHLLTDSDANEEGKNYGDEYLAELDFIETVERYKEA--YESDVPEEDMVEDDEAENENSDEESPNSNSNEDNSQDKAILN 310 (463)
Q Consensus 233 Gellt~~~a~~~~~~~gdeYl~~ld~ie~ve~~k~~--~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 310 (463)
|...++..|++-.....|.|..++|..+.++-.+.. .+.++. ..+.++.....+++++..+.+.+
T Consensus 1044 ~~ppt~~l~~~~r~aqad~~sn~~D~~~~~~l~es~~~~~T~~r--------~~t~~~~~~~~~d~dd~q~I~k~----- 1110 (1262)
T KOG1141|consen 1044 GAPPTDDLADELRNAQADQYSNDLDLKDTVELEESREDHETDFR--------GDTSDYDDEEGSDGDDGQDIMKM----- 1110 (1262)
T ss_pred CCCCchhhHHHHhhhhhccccCccchhhhhhhhhcccccccccC--------CCCCCCcccccccCccHHHHHHH-----
Confidence 999999998887777789999999998877544321 111110 01222222222222222222211
Q ss_pred CCcccccCCCCCchhHHHHHHHhhhhhhhcccc-----ccccchhhHHHHHhhhhhhhhhh-cchhhhccCCCceEEEeC
Q psy8081 311 SDDETENSSNADSDHIRSRLRKRKRKQKADKKE-----GKRKTSSLLMTLQANQKKKTKRL-RSLREYFGEDENVYIMDA 384 (463)
Q Consensus 311 ~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~-----~~~~~~~~~~~~~~~~~~~~~~~-~~~r~~~~~~~~~y~IDA 384 (463)
.+.....++....++.+++++.+++.... +....|...|+.+. .+..... ..+..||+ .+.+|+|||
T Consensus 1111 ----ve~qd~~~~~~~T~~~~RQ~~~~s~k~~~~~s~~~~~~ts~~~~~~dk--ges~~~~~~~~~~y~~-~~~~yvIDA 1183 (1262)
T KOG1141|consen 1111 ----VERQDSSESGEETKRLTRQKRKQSKKSGKGGSVEKDDTTSRDSMEKDK--GESKDEPVFNWDKYFE-PFPLYVIDA 1183 (1262)
T ss_pred ----hhcccccccccccchhhhhhhhhhhhcccCccccccccCccchhhhcc--CccCcccccchhhccC-CCceEEEec
Confidence 01111222223333333333333332211 11123333444331 1111111 23334454 488999999
Q ss_pred cccCCccccccCCCCCCeeEEEEEEcCCCCCcCEEEEEEccCCCCCCeEEEecCCCCCCCCCCCeeeeeCCCCCCcccC
Q psy8081 385 RTSGNIGRYLNHSCTPNVFVQNVFVDTHDPRFPWVSFFALKFIEAGSELTWDYAYDIGSVPDKVVYCYCGSSECRQRLL 463 (463)
Q Consensus 385 ~~~GNvgRFiNHSC~PNl~vq~Vfvdt~d~~fP~VafFA~r~I~aGeELTwDYgy~~~s~~~k~~~C~CGs~~Crg~ll 463 (463)
+++||+||||||||+|||+||||||||||+|||||||||.|+|+|||||||||+|+.|++++|++.|+||+.+||||||
T Consensus 1184 k~eGNlGRfLNHSC~PNl~VQnVfvdTHdlrfPwVAFFt~kyVkAgtELTWDY~Ye~g~v~~keL~C~CGa~~CrgrLL 1262 (1262)
T KOG1141|consen 1184 KQEGNLGRFLNHSCDPNLHVQNVFVDTHDLRFPWVAFFTRKYVKAGTELTWDYQYEQGQVATKELTCHCGAENCRGRLL 1262 (1262)
T ss_pred ccccchhhhhccCCCccceeeeeeeeccccCCchhhhhhhhhhccCceeeeeccccccccccceEEEecChhhhhcccC
Confidence 9999999999999999999999999999999999999999999999999999999999999999999999999999998
No 8
>smart00468 PreSET N-terminal to some SET domains. A Cys-rich putative Zn2+-binding domain that occurs N-terminal to some SET domains. Function is unknown. Unpublished.
Probab=99.85 E-value=8.1e-22 Score=166.96 Aligned_cols=95 Identities=37% Similarity=0.667 Sum_probs=82.5
Q ss_pred cccCCCCCCCCeEEEcCCCCCCC-CCceEcccccCCCCccccCCCCCccccCCCCCCCCCCCcccccccccCccccCCCC
Q psy8081 79 KDMSNGRENVPISCVNYIDTDVP-KTVDYMTERKPKEGVTINTNKEFLVCCDCTDDCRDRNNCACWQLTIKGSRDLWNVS 157 (463)
Q Consensus 79 ~DiS~G~E~vPI~~vN~iD~~~p-p~f~Y~~~~~p~~gv~l~~~~~f~~gCdC~d~C~d~~~C~C~~~t~~g~~~~~~~~ 157 (463)
.|||+|+|++||++||++|.+.| +.|+|++++++.+|+.+.....++.||+|+++|.+..+|+|++++. .
T Consensus 2 ~Dis~G~E~~pI~~vN~vD~~~~p~~F~Yi~~~~~~~gv~~~~~~~~~~gC~C~~~C~~~~~C~C~~~~~--~------- 72 (98)
T smart00468 2 LDISNGKENVPVPLVNEVDEDPPPPDFEYISEYIYGQGVPIDRSPSPLVGCSCSGDCSSSNKCECARKNG--G------- 72 (98)
T ss_pred ccccCCccCCCcceEecCCCCCCCCCcEECcceEcCCCcccccCCCCCCCCcCCCCCCCCCcCCcHhhcC--C-------
Confidence 69999999999999999999865 8999999999999998777889999999999999988899998762 2
Q ss_pred CCCCcccc-ccccCCcCCccceeecCCCCC
Q psy8081 158 EPKDFVGY-QNRRLPEHVVSGIFECNDLCK 186 (463)
Q Consensus 158 ~~~~~~gy-~~~rL~~~~~tgIyECn~~C~ 186 (463)
..+| .++|+.....++|||||++|+
T Consensus 73 ----~~~Y~~~~~~~~~~~~~IyECn~~C~ 98 (98)
T smart00468 73 ----EFAYELNGGLRLKRKPLIYECNSRCS 98 (98)
T ss_pred ----ccCcccCCCEEeCCCCEEEcCCCCCC
Confidence 2245 567777777899999999996
No 9
>PF05033 Pre-SET: Pre-SET motif; InterPro: IPR007728 This region is found in a number of histone lysine methyltransferases (HMTase), N-terminal to the SET domain; it is generally described as the pre-SET domain. Histone lysine methylation is part of the histone code that regulated chromatin function and epigenetic control of gene function. Histone lysine methyltransferases (HMTase) differ both in their substrate specificity for the various acceptor lysines as well as in their product specificity for the number of methyl groups (one, two, or three) they transfer. With just one exception [], the HMTases belong to SET family that can be classified according to the sequences surrounding the SET domain [, ]. Structural studies on the human SET7/9, a mono-methylase, have revealed the molecular basis for the specificity of the enzyme for the histone-target and the roles of the invariant residues in the SET domain in determining the methylation specificities []. The pre-SET domain, as found in the SUV39 SET family, contains nine invariant cysteine residues that are grouped into two segments separated by a region of variable length. These 9 cysteines coordinate 3 zinc ions to form a triangular cluster, where each of the zinc ions is coordinated by 4 four cysteines to give a tetrahedral configuration. The function of this domain is structural, holding together 2 long segments of random coils and stabilising the SET domain. The C-terminal region including the post-SET domain is disordered when not interacting with a histone tail and in the absence of zinc. The three conserved cysteines in the post-SET domain form a zinc-binding site [] when coupled to a fourth conserved cysteine in the knot-like structure close to the SET domain active site []. The structured post-SET region brings in the C-terminal residues that participate in S-adenosylmethine-binding and histone tail interactions. The three conserved cysteine residues are essential for HMTase activity, as replacement with serine abolishes HMTase activity []. ; GO: 0008270 zinc ion binding, 0018024 histone-lysine N-methyltransferase activity, 0034968 histone lysine methylation, 0005634 nucleus; PDB: 3K5K_A 2O8J_D 3RJW_B 1ML9_A 1PEG_B 1MVH_A 1MVX_A 3BO5_A 2RFI_B 3MO5_B ....
Probab=99.85 E-value=8.4e-22 Score=168.00 Aligned_cols=101 Identities=44% Similarity=0.817 Sum_probs=70.4
Q ss_pred ccCCCCCCCCeEEEcCCCCCCC-CCceEcccccCCCCccccCCCCCccccCCCCCCCCCCCcccccccccCccccCCCCC
Q psy8081 80 DMSNGRENVPISCVNYIDTDVP-KTVDYMTERKPKEGVTINTNKEFLVCCDCTDDCRDRNNCACWQLTIKGSRDLWNVSE 158 (463)
Q Consensus 80 DiS~G~E~vPI~~vN~iD~~~p-p~f~Y~~~~~p~~gv~l~~~~~f~~gCdC~d~C~d~~~C~C~~~t~~g~~~~~~~~~ 158 (463)
|||+|+|.+||+++|++|+++| +.|+|+.++++.+++. .....+..||+|.++|.+..+|+|++++....
T Consensus 1 Dis~g~e~~pI~~~N~vd~~~~p~~F~Yi~~~~~~~~~~-~~~~~~~~~C~C~~~C~~~~~C~C~~~~~~~~-------- 71 (103)
T PF05033_consen 1 DISRGKENVPIPVVNDVDDEPPPPNFEYIPENIYGEGVP-DIDPEFLQGCDCSGDCSNPSNCECLQRNGGIF-------- 71 (103)
T ss_dssp -TTCTSSSS-EEEEESSSS--SSTSSEE-SS-EESTTSS--TBGGGTS----SSSSTCTTTSHHHCCTSSS---------
T ss_pred CCCCCccCCCEEEEeCCCCCCCCCCeEEeeeEEcCCCcc-ccccccCccCccCCCCCCCCCCcCccccCccc--------
Confidence 8999999999999999999966 8999999999999987 67788999999999998899999998762212
Q ss_pred CCCccccc-cccCCcCCccceeecCCCCCCCCCCCCc
Q psy8081 159 PKDFVGYQ-NRRLPEHVVSGIFECNDLCKCKHTCHNR 194 (463)
Q Consensus 159 ~~~~~gy~-~~rL~~~~~tgIyECn~~C~C~~~C~NR 194 (463)
.|. .++|......+|||||+.|+|+..|.||
T Consensus 72 -----~Y~~~g~l~~~~~~~i~EC~~~C~C~~~C~NR 103 (103)
T PF05033_consen 72 -----AYDSNGRLRIPDKPPIFECNDNCGCSPSCRNR 103 (103)
T ss_dssp -----SB-TTSSBSSSSTSEEE---TTSSS-TTSTT-
T ss_pred -----cccCCCcCccCCCCeEEeCCCCCCCCCCCCCC
Confidence 232 2345445577899999999999999998
No 10
>KOG1083|consensus
Probab=99.81 E-value=4.2e-21 Score=209.77 Aligned_cols=132 Identities=31% Similarity=0.543 Sum_probs=105.2
Q ss_pred CCCCceec-cCCcccEEEEEeCCcceEEEeCCCCCCCCeEEEEeeEEeChhhhhhhccccCCcchhchhhHHHHHHhhhc
Q psy8081 190 TCHNRVVQ-FPMLQKLQLFKTEMKGWGLRCLNDIPQGTFICIYAGHLLTDSDANEEGKNYGDEYLAELDFIETVERYKEA 268 (463)
Q Consensus 190 ~C~NRvvQ-~g~~~rLqVFkT~~kGWGVR~l~dI~kGtFVc~Y~Gellt~~~a~~~~~~~gdeYl~~ld~ie~ve~~k~~ 268 (463)
.|+|+-+| |+.-.+|+||+.+.+||||++...|..|+|||||+|+|++.+++..+ +-.-|+.+
T Consensus 1165 ~c~nqrm~r~e~cp~L~v~~gp~~G~~v~tk~PikagtfI~EYvGeVit~ke~e~~---mmtl~~~d------------- 1228 (1306)
T KOG1083|consen 1165 SCSNQRMQRHEECPPLEVFRGPKKGWGVRTKEPIKAGTFIMEYVGEVITEKEFEPR---MMTLYHND------------- 1228 (1306)
T ss_pred hhhhHHhhhhccCCCcceeccCCCCccccccccccccchHHHHHHHHHHHHhhccc---ccccCCCC-------------
Confidence 37888887 56777899999999999999999999999999999999998765443 00011100
Q ss_pred ccCCCCcccccccccccccCCCCCCCCCCCCCCCchhhhcccCCcccccCCCCCchhHHHHHHHhhhhhhhccccccccc
Q psy8081 269 YESDVPEEDMVEDDEAENENSDEESPNSNSNEDNSQDKAILNSDDETENSSNADSDHIRSRLRKRKRKQKADKKEGKRKT 348 (463)
Q Consensus 269 ~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~~~~~~ 348 (463)
++ +|-+
T Consensus 1229 ------------------------------~~-------~~cL------------------------------------- 1234 (1306)
T KOG1083|consen 1229 ------------------------------DD-------HYCL------------------------------------- 1234 (1306)
T ss_pred ------------------------------Cc-------cccc-------------------------------------
Confidence 00 0100
Q ss_pred hhhHHHHHhhhhhhhhhhcchhhhccCCCceEEEeCcccCCccccccCCCCCCeeEEEEEEcCCCCCcCEEEEEEccCCC
Q psy8081 349 SSLLMTLQANQKKKTKRLRSLREYFGEDENVYIMDARTSGNIGRYLNHSCTPNVFVQNVFVDTHDPRFPWVSFFALKFIE 428 (463)
Q Consensus 349 ~~~~~~~~~~~~~~~~~~~~~r~~~~~~~~~y~IDA~~~GNvgRFiNHSC~PNl~vq~Vfvdt~d~~fP~VafFA~r~I~ 428 (463)
......+||+.+.||..||+||||.|||..|.+-|.+. -+|.+||+|+|.
T Consensus 1235 --------------------------~I~p~l~id~~R~~n~~RfinhscKPNc~~qkwSVNG~----~Rv~L~A~rDi~ 1284 (1306)
T KOG1083|consen 1235 --------------------------VIDPGLFIDIPRMGNGARFINHSCKPNCEMQKWSVNGE----YRVGLFALRDLP 1284 (1306)
T ss_pred --------------------------ccCccccCChhhccccccccccccCCCCccccccccce----eeeeeeecCCCC
Confidence 01345789999999999999999999999999999885 469999999999
Q ss_pred CCCeEEEecCCCC
Q psy8081 429 AGSELTWDYAYDI 441 (463)
Q Consensus 429 aGeELTwDYgy~~ 441 (463)
+|||||+||+...
T Consensus 1285 kGEELtYDYN~ks 1297 (1306)
T KOG1083|consen 1285 KGEELTYDYNFKS 1297 (1306)
T ss_pred CCceEEEeccccc
Confidence 9999999998644
No 11
>KOG1085|consensus
Probab=99.69 E-value=4.7e-17 Score=157.79 Aligned_cols=128 Identities=28% Similarity=0.437 Sum_probs=102.2
Q ss_pred ccCCcccEEEEEeCCcceEEEeCCCCCCCCeEEEEeeEEeChhhhhhhccccCCcchhchhhHHHHHHhhhcccCCCCcc
Q psy8081 197 QFPMLQKLQLFKTEMKGWGLRCLNDIPQGTFICIYAGHLLTDSDANEEGKNYGDEYLAELDFIETVERYKEAYESDVPEE 276 (463)
Q Consensus 197 Q~g~~~rLqVFkT~~kGWGVR~l~dI~kGtFVc~Y~Gellt~~~a~~~~~~~gdeYl~~ld~ie~ve~~k~~~e~~~~~~ 276 (463)
-.|..-.|++---..||.||++...+.+|+||.+|.|.||.-.+|..++..|.. | +
T Consensus 251 l~g~~egl~~~~~dgKGRGv~a~~~F~rgdFVVEY~Gdliei~eAk~rE~~Ya~------D------------------e 306 (392)
T KOG1085|consen 251 LKGTNEGLLEVYKDGKGRGVRAKVNFERGDFVVEYRGDLIEISEAKVREEQYAN------D------------------E 306 (392)
T ss_pred HhccccceeEEeeccccceeEeecccccCceEEEEecceeeechHHHHHHHhcc------C------------------c
Confidence 345556777777778999999999999999999999999988877766433210 0 0
Q ss_pred cccccccccccCCCCCCCCCCCCCCCchhhhcccCCcccccCCCCCchhHHHHHHHhhhhhhhccccccccchhhHHHHH
Q psy8081 277 DMVEDDEAENENSDEESPNSNSNEDNSQDKAILNSDDETENSSNADSDHIRSRLRKRKRKQKADKKEGKRKTSSLLMTLQ 356 (463)
Q Consensus 277 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~~~~~~~~~~~~~~ 356 (463)
+
T Consensus 307 ~------------------------------------------------------------------------------- 307 (392)
T KOG1085|consen 307 E------------------------------------------------------------------------------- 307 (392)
T ss_pred c-------------------------------------------------------------------------------
Confidence 0
Q ss_pred hhhhhhhhhhcchhhhccCCCceEEEeCcccC-CccccccCCCCCCeeEEEEEEcCCCCCcCEEEEEEccCCCCCCeEEE
Q psy8081 357 ANQKKKTKRLRSLREYFGEDENVYIMDARTSG-NIGRYLNHSCTPNVFVQNVFVDTHDPRFPWVSFFALKFIEAGSELTW 435 (463)
Q Consensus 357 ~~~~~~~~~~~~~r~~~~~~~~~y~IDA~~~G-NvgRFiNHSC~PNl~vq~Vfvdt~d~~fP~VafFA~r~I~aGeELTw 435 (463)
.-..+.||......|+|||+++- -+||.||||=.+||....|.+|+. ||+.+.|.|+|.+||||++
T Consensus 308 ---------~GcYMYyF~h~sk~yCiDAT~et~~lGRLINHS~~gNl~TKvv~Idg~----pHLiLvA~rdIa~GEELlY 374 (392)
T KOG1085|consen 308 ---------IGCYMYYFEHNSKKYCIDATKETPWLGRLINHSVRGNLKTKVVEIDGS----PHLILVARRDIAQGEELLY 374 (392)
T ss_pred ---------cceEEEeeeccCeeeeeecccccccchhhhcccccCcceeeEEEecCC----ceEEEEeccccccchhhhh
Confidence 01233456555668999999975 579999999999999999999984 8999999999999999999
Q ss_pred ecCCC
Q psy8081 436 DYAYD 440 (463)
Q Consensus 436 DYgy~ 440 (463)
|||..
T Consensus 375 DYGDR 379 (392)
T KOG1085|consen 375 DYGDR 379 (392)
T ss_pred hcccc
Confidence 99974
No 12
>COG2940 Proteins containing SET domain [General function prediction only]
Probab=99.63 E-value=1.8e-16 Score=169.49 Aligned_cols=166 Identities=27% Similarity=0.444 Sum_probs=126.7
Q ss_pred eeecCCCCCCCCCCCCceeccCCcccEEEEEeCCcceEEEeCCCCCCCCeEEEEeeEEeChhhhhhhccccCCcchhchh
Q psy8081 178 IFECNDLCKCKHTCHNRVVQFPMLQKLQLFKTEMKGWGLRCLNDIPQGTFICIYAGHLLTDSDANEEGKNYGDEYLAELD 257 (463)
Q Consensus 178 IyECn~~C~C~~~C~NRvvQ~g~~~rLqVFkT~~kGWGVR~l~dI~kGtFVc~Y~Gellt~~~a~~~~~~~gdeYl~~ld 257 (463)
+.+++..+.....+.|...+........+..++.+||||++++.|++|+||.+|.|+++...++..+...+.. +
T Consensus 308 ~~~~~s~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~fa~~~i~~~e~i~~~~~~~~~~~~~~~~~~~~~~-----~- 381 (480)
T COG2940 308 SDFSKSNVSKLKELLNSNGCKKRREPNVVQESEIKGYGVFALESIKKGEFIIEYHGEIIRRKEAREREENYDL-----L- 381 (480)
T ss_pred cccccccCccccchhhhcccccccchhhhhhhcccccceeehhhccchHHHHHhcCcccchHHHHhhhccccc-----c-
Confidence 3455555555567888888888888888999999999999999999999999999999888765543221100 0
Q ss_pred hHHHHHHhhhcccCCCCcccccccccccccCCCCCCCCCCCCCCCchhhhcccCCcccccCCCCCchhHHHHHHHhhhhh
Q psy8081 258 FIETVERYKEAYESDVPEEDMVEDDEAENENSDEESPNSNSNEDNSQDKAILNSDDETENSSNADSDHIRSRLRKRKRKQ 337 (463)
Q Consensus 258 ~ie~ve~~k~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 337 (463)
...+
T Consensus 382 -----------------------------------------~~~~----------------------------------- 385 (480)
T COG2940 382 -----------------------------------------GNEF----------------------------------- 385 (480)
T ss_pred -----------------------------------------cccc-----------------------------------
Confidence 0000
Q ss_pred hhccccccccchhhHHHHHhhhhhhhhhhcchhhhccCCCceEEEeCcccCCccccccCCCCCCeeEEEEEEcCCCCCcC
Q psy8081 338 KADKKEGKRKTSSLLMTLQANQKKKTKRLRSLREYFGEDENVYIMDARTSGNIGRYLNHSCTPNVFVQNVFVDTHDPRFP 417 (463)
Q Consensus 338 s~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~~~~~y~IDA~~~GNvgRFiNHSC~PNl~vq~Vfvdt~d~~fP 417 (463)
.+ ..+. ...+++|+...||++||+||||.||+......+.+ .-
T Consensus 386 ------------------------------~~-~~~~--~~~~~~d~~~~g~~~r~~nHS~~pN~~~~~~~~~g----~~ 428 (480)
T COG2940 386 ------------------------------SF-GLLE--DKDKVRDSQKAGDVARFINHSCTPNCEASPIEVNG----IF 428 (480)
T ss_pred ------------------------------ch-hhcc--ccchhhhhhhcccccceeecCCCCCcceecccccc----cc
Confidence 00 0111 12678999999999999999999999998888875 22
Q ss_pred EEEEEEccCCCCCCeEEEecCCCCCCCC------CCCeeeeeCCCCCCccc
Q psy8081 418 WVSFFALKFIEAGSELTWDYAYDIGSVP------DKVVYCYCGSSECRQRL 462 (463)
Q Consensus 418 ~VafFA~r~I~aGeELTwDYgy~~~s~~------~k~~~C~CGs~~Crg~l 462 (463)
.++++|.|+|++|+|||+||+...+.-. .....|.||+..|++.|
T Consensus 429 ~~~~~~~rDI~~geEl~~dy~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 479 (480)
T COG2940 429 KISIYAIRDIKAGEELTYDYGPSLEDNRELKKLLEKRWGCACGEDRCSHTM 479 (480)
T ss_pred eeeecccccchhhhhhccccccccccchhhhhhhhhhhccccCCCccCCCC
Confidence 5999999999999999999998664422 24789999999999987
No 13
>cd01395 HMT_MBD Methyl-CpG binding domains (MBD) present in putative histone methyltransferases (HMT) such as CLLD8 and SETDB1 proteins; CLLD8 contains a MBD, a PreSET and a bifurcated SET domain, suggesting that CLLD8 might be associated with methylation-mediated transcriptional repression. SETDB1 and other proteins in this group have a similar domain architecture. SETDB1 is a novel KAP-1-associated histone H3, lysine 9-specific methyltransferase that contributes to HP1-mediated silencing of euchromatic genes by KRAB zinc-finger proteins.
Probab=99.48 E-value=1.9e-14 Score=110.36 Aligned_cols=39 Identities=41% Similarity=0.696 Sum_probs=36.7
Q ss_pred ceeEEEEecCcCCCCCCHHHHHHHHHhccccceeecccccccccCCcccccccc
Q psy8081 6 NKKCIMYTAPCGRTLRTSDQLVLYLFITKAKWTIDMFEYDHFVSSKWTIDMFEY 59 (463)
Q Consensus 6 ~~~~v~y~~pCg~~lr~~~ev~~yl~~t~~~l~~~~~~~~~~~~~~~~~d~f~f 59 (463)
.|+.|+|+|||||+||||+||++||++|.++|++ |+|||
T Consensus 22 ~k~~V~Y~aPCGr~Lr~~~EV~~YL~~t~~~L~~---------------d~FsF 60 (60)
T cd01395 22 VKKHVIYKAPCGRSLRNMSEVHRYLRETCSFLTV---------------DNFSF 60 (60)
T ss_pred cccceEEECCcchhhhcHHHHHHHHHhcccccee---------------ecccC
Confidence 5789999999999999999999999999668999 99998
No 14
>PF00856 SET: SET domain; InterPro: IPR001214 The SET domain appears generally as one part of a larger multidomain protein, and recently there were described three structures of very different proteins with distinct domain compositions: Neurospora crassa DIM-5, a member of the Su(var) family of HKMTs which methylate histone H3 on lysine 9,human SET7 (also called SET9), which methylates H3 on lysine 4 and garden pea Rubisco LSMT, an enzyme that does not modify histones, but instead methylates lysine 14 in the flexible tail of the large subunit of the enzyme Rubisco. The SET domain itself turned out to be an uncommon structure. Although in all three studies, electron density maps revealed the location of the AdoMet or AdoHcy cofactor, the SET domain bears no similarity at all to the canonical/AdoMet-dependent methyltransferase fold. Strictly conserved in the C-terminal motif of the SET domain tyrosine could be involved in abstracting a proton from the protonated amino group of the substrate lysine, promoting its nucleophilic attack on the sulphonium methyl group of the AdoMet cofactor. In contrast to the AdoMet-dependent protein methyltranferases of the classical type, which tend to bind their polypeptide substrates on top of the cofactor, it is noted from the Rubisco LSMT structure that the AdoMet seems to bind in a separate cleft, suggesting how a polypeptide substrate could be subjected to multiple rounds of methylation without having to be released from the enzyme. In contrast, SET7/9 is able to add only a single methyl group to its substrate. It has been demonstrated that association of SET domain and myotubularin-related proteins modulates growth control []. The SET domain-containing Drosophila melanogaster (Fruit fly) protein, enhancer of zeste, has a function in segment determination and the mammalian homologue may be involved in the regulation of gene transcription and chromatin structure. Histone lysine methylation is part of the histone code that regulated chromatin function and epigenetic control of gene function. Histone lysine methyltransferases (HMTase) differ both in their substrate specificity for the various acceptor lysines as well as in their product specificity for the number of methyl groups (one, two, or three) they transfer. With just one exception [], the HMTases belong to SET family that can be classified according to the sequences surrounding the SET domain [, ]. Structural studies on the human SET7/9, a mono-methylase, have revealed the molecular basis for the specificity of the enzyme for the histone-target and the roles of the invariant residues in the SET domain in determining the methylation specificities []. The pre-SET domain, as found in the SUV39 SET family, contains nine invariant cysteine residues that are grouped into two segments separated by a region of variable length. These 9 cysteines coordinate 3 zinc ions to form to form a triangular cluster, where each of the zinc ions is coordinated by 4 four cysteines to give a tetrahedral configuration. The function of this domain is structural, holding together 2 long segments of random coils. The C-terminal region including the post-SET domain is disordered when not interacting with a histone tail and in the absence of zinc. The three conserved cysteines in the post-SET domain form a zinc-binding site when coupled to a fourth conserved cysteine in the knot-like structure close to the SET domain active site []. The structured post-SET region brings in the C-terminal residues that participate in S-adenosylmethine-binding and histone tail interactions. The three conserved cysteine residues are essential for HMTase activity, as replacement with serine abolishes HMTase activity [], []. ; GO: 0005515 protein binding; PDB: 3TG5_A 3S7F_A 3RIB_B 3TG4_A 3S7J_A 3S7D_A 3S7B_A 3H6L_A 3SMT_A 3K5K_A ....
Probab=99.39 E-value=8.2e-13 Score=117.35 Aligned_cols=55 Identities=27% Similarity=0.254 Sum_probs=44.6
Q ss_pred EEEeCcccCCccccccCCCCCCeeEEEEEEcCCCCCcCEEEEEEccCCCCCCeEEEecC
Q psy8081 380 YIMDARTSGNIGRYLNHSCTPNVFVQNVFVDTHDPRFPWVSFFALKFIEAGSELTWDYA 438 (463)
Q Consensus 380 y~IDA~~~GNvgRFiNHSC~PNl~vq~Vfvdt~d~~fP~VafFA~r~I~aGeELTwDYg 438 (463)
...++.....++.||||||.||+.++..+.... ..+.|.|.|+|++|||||++||
T Consensus 108 ~~~~~~~l~p~~d~~NHsc~pn~~~~~~~~~~~----~~~~~~a~r~I~~GeEi~isYG 162 (162)
T PF00856_consen 108 DDRDGIALYPFADMLNHSCDPNCEVSFDFDGDG----GCLVVRATRDIKKGEEIFISYG 162 (162)
T ss_dssp EEEEEEEEETGGGGSEEESSTSEEEEEEEETTT----TEEEEEESS-B-TTSBEEEEST
T ss_pred ccccccccCcHhHheccccccccceeeEeeccc----ceEEEEECCccCCCCEEEEEEC
Confidence 345666778899999999999999887765333 3599999999999999999998
No 15
>smart00391 MBD Methyl-CpG binding domain. Methyl-CpG binding domain, also known as the TAM (TTF-IIP5, ARBP, MeCP1) domain
Probab=98.96 E-value=4.3e-10 Score=91.19 Aligned_cols=49 Identities=27% Similarity=0.569 Sum_probs=43.9
Q ss_pred ceeEEEEecCcCCCCCCHHHHHHHHHhccc-cceeecccccccccCCcccccccccchhhccccc
Q psy8081 6 NKKCIMYTAPCGRTLRTSDQLVLYLFITKA-KWTIDMFEYDHFVSSKWTIDMFEYDHFVDCLREF 69 (463)
Q Consensus 6 ~~~~v~y~~pCg~~lr~~~ev~~yl~~t~~-~l~~~~~~~~~~~~~~~~~d~f~f~~~v~~~~~~ 69 (463)
++.+|+|.+|||++||++.||++||.++.+ .+.+ |+|+|++.+.+...+
T Consensus 26 ~~~dV~Y~sP~GkklRs~~ev~~YL~~~~~~~~~~---------------~~F~F~~~~~~~~~~ 75 (77)
T smart00391 26 GKFDVYYISPCGKKLRSKSELARYLHKNGDLSLDL---------------ECFDFNATVPVGPKF 75 (77)
T ss_pred CcccEEEECCCCCeeeCHHHHHHHHHhCCCccccc---------------ccccCcCCccccccc
Confidence 578999999999999999999999999987 5777 999999999876543
No 16
>cd01397 HAT_MBD Methyl-CpG binding domains (MBD) present in putative chromatin remodelling factor such as BAZ2A; BAZ2A contains a MBD, DDT, PHD-type zinc finger and Bromo domain suggesting that BAZ2A might be associated with histone acetyltransferase (HAT) activity. The Drosophila melanogaster toutatis protein, a putative subunit of the chromatin-remodeling complex, and other such proteins in this group share a similar domain architecture with BAZ2A, as does the Caenorhabditis elegans flectin homolog.
Probab=98.84 E-value=1.9e-09 Score=85.94 Aligned_cols=47 Identities=21% Similarity=0.357 Sum_probs=41.0
Q ss_pred cceeEEEEecCcCCCCCCHHHHHHHHHh-ccccceeecccccccccCCcccccccccchhhcc
Q psy8081 5 NNKKCIMYTAPCGRTLRTSDQLVLYLFI-TKAKWTIDMFEYDHFVSSKWTIDMFEYDHFVDCL 66 (463)
Q Consensus 5 ~~~~~v~y~~pCg~~lr~~~ev~~yl~~-t~~~l~~~~~~~~~~~~~~~~~d~f~f~~~v~~~ 66 (463)
+.|..|.|.||||+.||++.||++||.. +.+.|++ |+|+|++.+-+-
T Consensus 22 ~~~~dV~Y~aPcGKklRs~~ev~~yL~~~~~~~Lt~---------------dnFsF~~~~~vg 69 (73)
T cd01397 22 RIQGEVAYYAPCGKKLRQYPEVIKYLSKNGISLLSR---------------ENFSFSARAPVG 69 (73)
T ss_pred CccceEEEECCCCcccccHHHHHHHHHhCCccCccH---------------hHccccCCcccc
Confidence 5667899999999999999999999996 5568888 999999887654
No 17
>KOG1081|consensus
Probab=98.81 E-value=1.8e-09 Score=114.85 Aligned_cols=75 Identities=41% Similarity=0.669 Sum_probs=64.6
Q ss_pred EEeCcccCCccccccCCCCCCeeEEEEEEcCCCCCcCEEEEEEccCCCCCCeEEEecCCCCCCCCCCCeeeeeCCCCCCc
Q psy8081 381 IMDARTSGNIGRYLNHSCTPNVFVQNVFVDTHDPRFPWVSFFALKFIEAGSELTWDYAYDIGSVPDKVVYCYCGSSECRQ 460 (463)
Q Consensus 381 ~IDA~~~GNvgRFiNHSC~PNl~vq~Vfvdt~d~~fP~VafFA~r~I~aGeELTwDYgy~~~s~~~k~~~C~CGs~~Crg 460 (463)
+|||...||..||+||||+||+.-+..-+-. -+.+.+||.+.|++|+|||++|+.. ..+....|.||+.+|.+
T Consensus 362 ~id~~~~~n~sr~~nh~~~~~v~~~k~~~~~----~t~~~~~a~~~i~~g~e~t~~~n~~---~~~~~~~~~~~~e~~~~ 434 (463)
T KOG1081|consen 362 IIDAGPKGNYSRFLNHSCQPNVETEKWQVIG----DTRVGLFAPRQIEAGEELTFNYNGN---CEGNEKRCCCGSENCTE 434 (463)
T ss_pred ccccccccchhhhhcccCCCceeechhheec----ccccccccccccccchhhhheeecc---ccCCcceEeeccccccc
Confidence 9999999999999999999999887654433 2359999999999999999999853 45677899999999988
Q ss_pred cc
Q psy8081 461 RL 462 (463)
Q Consensus 461 ~l 462 (463)
.+
T Consensus 435 ~~ 436 (463)
T KOG1081|consen 435 TK 436 (463)
T ss_pred CC
Confidence 65
No 18
>cd00122 MBD MeCP2, MBD1, MBD2, MBD3, MBD4, CLLD8-like, and BAZ2A-like proteins constitute a family of proteins that share the methyl-CpG-binding domain (MBD). The MBD consists of about 70 residues and is defined as the minimal region required for binding to methylated DNA by a methyl-CpG-binding protein which binds specifically to methylated DNA. The MBD can recognize a single symmetrically methylated CpG either as naked DNA or within chromatin. MeCP2, MBD1 and MBD2 (and likely MBD3) form complexes with histone deacetylase and are involved in histone deacetylase-dependent repression of transcription. MBD4 is an endonuclease that forms a complex with the DNA mismatch-repair protein MLH1. The MBDs present in putative chromatin remodelling subunit, BAZ2A, and putative histone methyltransferase, CLLD8, represent two phylogenetically distinct groups within the MBD protein family.
Probab=98.67 E-value=1.5e-08 Score=78.90 Aligned_cols=40 Identities=28% Similarity=0.598 Sum_probs=36.0
Q ss_pred cceeEEEEecCcCCCCCCHHHHHHHHHhcc-ccceeecccccccccCCcccccccc
Q psy8081 5 NNKKCIMYTAPCGRTLRTSDQLVLYLFITK-AKWTIDMFEYDHFVSSKWTIDMFEY 59 (463)
Q Consensus 5 ~~~~~v~y~~pCg~~lr~~~ev~~yl~~t~-~~l~~~~~~~~~~~~~~~~~d~f~f 59 (463)
.+|.+|.|.+|||+.||++.||++||..+. +.|.+ |+|+|
T Consensus 22 ~~k~dv~Y~sP~Gk~~Rs~~ev~~yL~~~~~~~l~~---------------~~F~F 62 (62)
T cd00122 22 AGKGDVYYYSPCGKKLRSKPEVARYLEKTGPSSLDL---------------ENFSF 62 (62)
T ss_pred CCcceEEEECCCCceecCHHHHHHHHHhCCCCCCcH---------------HHCCC
Confidence 468899999999999999999999999994 47888 99988
No 19
>PF01429 MBD: Methyl-CpG binding domain; InterPro: IPR001739 Methylation at CpG dinucleotide, the most common DNA modification in eukaryotes, has been correlated with gene silencing associated with various phenomena such as genomic imprinting, transposon and chromosome X inactivation, differentiation, and cancer. Effects of DNA methylation are mediated through proteins which bind to symmetrically methylated CpGs. Such proteins contain a specific domain of ~70 residues, the methyl-CpG-binding domain (MBD), which is linked to additional domains associated with chromatin, such as the bromodomain, the AT hook motif,the SET domain, or the PHD finger. MBD-containing proteins appear to act as structural proteins, which recruit a variety of histone deacetylase (HDAC) complexes and chromatin remodelling factors, leading to chromatin compaction and, consequently, to transcriptional repression. The MBD of MeCP2, MBD1, MBD2, MBD4 and BAZ2 mediates binding to DNA, in case of MeCP2, MBD1 and MBD2 preferentially to methylated CpG. In case of human MBD3 and SETDB1 the MBD has been shown to mediate protein-protein interactions [, ]. The MBD folds into an alpha/beta sandwich structure comprising a layer of twisted beta sheet, backed by another layer formed by the alpha1 helix and a hairpin loop at the C terminus. These layers are both amphipathic, with the alpha1 helix and the beta sheet lying parallel and the hydrophobic faces tightly packed against each other. The beta sheet is composed of two long inner strands (beta2 and beta3) sandwiched by two shorter outer strands (beta1 and beta4) [].; GO: 0003677 DNA binding, 0005634 nucleus; PDB: 2KY8_A 1UB1_A 1D9N_A 1IG4_A 1QK9_A 3C2I_A.
Probab=98.53 E-value=5e-08 Score=79.18 Aligned_cols=46 Identities=26% Similarity=0.516 Sum_probs=38.6
Q ss_pred ceeEEEEecCcCCCCCCHHHHHHHHHhccc--cceeecccccccccCCcccccccccchhhcc
Q psy8081 6 NKKCIMYTAPCGRTLRTSDQLVLYLFITKA--KWTIDMFEYDHFVSSKWTIDMFEYDHFVDCL 66 (463)
Q Consensus 6 ~~~~v~y~~pCg~~lr~~~ev~~yl~~t~~--~l~~~~~~~~~~~~~~~~~d~f~f~~~v~~~ 66 (463)
++.+|.|.+|||+.+|++.||.+||..+.. .|.+ ++|+|++.+..+
T Consensus 29 ~~~dv~Y~sP~Gk~~RS~~eV~~yL~~~~~~~~l~~---------------~~F~F~~~~~~~ 76 (77)
T PF01429_consen 29 GKKDVYYYSPCGKRFRSKKEVVRYLKENPSEHDLKP---------------ENFSFSKRLIML 76 (77)
T ss_dssp TSEEEEEEETTSEEESSHHHHHHHHTTSS---SS-C---------------TTBBTTTTB---
T ss_pred CceEEEEECCCCCEEeCHHHHHHHHHhCCCcccCCH---------------hHCCCCCCcccC
Confidence 578999999999999999999999999986 7788 999999987654
No 20
>cd01396 MeCP2_MBD MeCP2, MBD1, MBD2, MBD3, and MBD4 are members of a protein family that share the methyl-CpG-binding domain (MBD). The MBD, consists of about 70 residues and is defined as the minimal region required for binding to methylated DNA by a methyl-CpG-binding protein which binds specifically to methylated DNA. The MBD can recognize a single symmetrically methylated CpG either as naked DNA or within chromatin. MeCP2, MBD1 and MBD2 (and likely MBD3) form complexes with histone deacetylase and are involved in histone deacetylase-dependent repression of transcription. MBD4 is an endonuclease that forms a complex with the DNA mismatch-repair protein MLH1.
Probab=97.99 E-value=4.4e-06 Score=67.81 Aligned_cols=45 Identities=20% Similarity=0.352 Sum_probs=38.7
Q ss_pred cceeEEEEecCcCCCCCCHHHHHHHHHhc-cccceeecccccccccCCcccccccccchhh
Q psy8081 5 NNKKCIMYTAPCGRTLRTSDQLVLYLFIT-KAKWTIDMFEYDHFVSSKWTIDMFEYDHFVD 64 (463)
Q Consensus 5 ~~~~~v~y~~pCg~~lr~~~ev~~yl~~t-~~~l~~~~~~~~~~~~~~~~~d~f~f~~~v~ 64 (463)
.+|.+|.|.+|||+.+|++.||++||..+ ...|.+ ++|+|.+-..
T Consensus 23 ~~k~DvyY~sP~Gkk~RS~~ev~~yL~~~~~~~~~~---------------~~FdF~~~k~ 68 (77)
T cd01396 23 AGKFDVYYISPTGKKFRSKVELARYLEKNGPTSLDL---------------SDFDFTVPKK 68 (77)
T ss_pred CCcceEEEECCCCCEEECHHHHHHHHHhCCCCCCcH---------------hHcccCCCcc
Confidence 35779999999999999999999999987 556777 9999998643
No 21
>KOG2589|consensus
Probab=97.80 E-value=2e-05 Score=79.94 Aligned_cols=57 Identities=32% Similarity=0.510 Sum_probs=44.8
Q ss_pred CccccccCCCCCCeeEEEEEEcCC-CCCcCEEEEEEccCCCCCCeEEEecCCCCCCCCCCCeeeeeCC
Q psy8081 389 NIGRYLNHSCTPNVFVQNVFVDTH-DPRFPWVSFFALKFIEAGSELTWDYAYDIGSVPDKVVYCYCGS 455 (463)
Q Consensus 389 NvgRFiNHSC~PNl~vq~Vfvdt~-d~~fP~VafFA~r~I~aGeELTwDYgy~~~s~~~k~~~C~CGs 455 (463)
.-|+||||-|.|||. ||-+. | ...+-++|||++|||+|--||.++ ...+...|.|-+
T Consensus 195 GPaafINHDCrpnCk----Fvs~g~~----tacvkvlRDIePGeEITcFYgs~f--FG~~N~~CeC~T 252 (453)
T KOG2589|consen 195 GPAAFINHDCRPNCK----FVSTGRD----TACVKVLRDIEPGEEITCFYGSGF--FGENNEECECVT 252 (453)
T ss_pred ccHHhhcCCCCCCce----eecCCCc----eeeeehhhcCCCCceeEEeecccc--cCCCCceeEEee
Confidence 358999999999996 66654 3 377889999999999999999876 334556777743
No 22
>KOG2461|consensus
Probab=97.50 E-value=8.9e-05 Score=77.66 Aligned_cols=57 Identities=19% Similarity=0.280 Sum_probs=44.5
Q ss_pred ceEEEeCcc--cCCccccccCCCC---CCeeEEEEEEcCCCCCcCEEEEEEccCCCCCCeEEEecCCCCC
Q psy8081 378 NVYIMDART--SGNIGRYLNHSCT---PNVFVQNVFVDTHDPRFPWVSFFALKFIEAGSELTWDYAYDIG 442 (463)
Q Consensus 378 ~~y~IDA~~--~GNvgRFiNHSC~---PNl~vq~Vfvdt~d~~fP~VafFA~r~I~aGeELTwDYgy~~~ 442 (463)
.-++||++. ..|+-||+|=.++ -||++ |..+.+ |-|.|.|+|++||||-+.|+.+++
T Consensus 86 ~~~~iDg~d~~~sNWmRYV~~Ar~~eeQNL~A---~Q~~~~-----Ifyrt~r~I~p~eELlVWY~~e~~ 147 (396)
T KOG2461|consen 86 GYEYIDGTDEEHSNWMRYVNSARSEEEQNLLA---FQIGEN-----IFYRTIRDIRPNEELLVWYGSEYA 147 (396)
T ss_pred ceEEeccCChhhcceeeeecccCChhhhhHHH---HhccCc-----eEEEecccCCCCCeEEEEeccchH
Confidence 458899987 6899999997774 35542 233332 999999999999999999997653
No 23
>smart00508 PostSET Cysteine-rich motif following a subset of SET domains.
Probab=96.40 E-value=0.0016 Score=41.77 Aligned_cols=15 Identities=40% Similarity=0.926 Sum_probs=13.7
Q ss_pred CeeeeeCCCCCCccc
Q psy8081 448 VVYCYCGSSECRQRL 462 (463)
Q Consensus 448 ~~~C~CGs~~Crg~l 462 (463)
.+.|+|||.+|||+|
T Consensus 2 ~~~C~CGs~~CRG~l 16 (26)
T smart00508 2 KQPCLCGAPNCRGFL 16 (26)
T ss_pred CeeeeCCCcccccee
Confidence 468999999999997
No 24
>KOG2084|consensus
Probab=92.24 E-value=0.16 Score=53.54 Aligned_cols=55 Identities=35% Similarity=0.561 Sum_probs=40.8
Q ss_pred cccCCCCCCeeEEEEEEcCCCCCcCEEEEEEccCCCCCC-eEEEecCCCCCCCC--------CCCeeeeeCC
Q psy8081 393 YLNHSCTPNVFVQNVFVDTHDPRFPWVSFFALKFIEAGS-ELTWDYAYDIGSVP--------DKVVYCYCGS 455 (463)
Q Consensus 393 FiNHSC~PNl~vq~Vfvdt~d~~fP~VafFA~r~I~aGe-ELTwDYgy~~~s~~--------~k~~~C~CGs 455 (463)
++||||.||+. +..+.. .+++.+...+.+++ ||+..|-+..++.. .+.+.|.|+.
T Consensus 208 ~~~hsC~pn~~---~~~~~~-----~~~~~~~~~~~~~~~~l~~~y~~~~~~~~~r~~~l~~~~~f~c~c~r 271 (482)
T KOG2084|consen 208 LFNHSCFPNIS---VIFDGR-----GLALLVPAGIDAGEEELTISYTDPLLSTASRQKQLRQSKLFSCQCPR 271 (482)
T ss_pred hcccCCCCCeE---EEECCc-----eeEEEeecccCCCCCEEEEeecccccCHHHHHHHHhhccceeeecCC
Confidence 89999999997 344443 38889999999998 99999998765421 1246777754
No 25
>smart00570 AWS associated with SET domains. subdomain of PRESET
Probab=91.98 E-value=0.078 Score=39.66 Aligned_cols=22 Identities=32% Similarity=0.738 Sum_probs=20.2
Q ss_pred eeecCCCCCCCCCCCCceeccC
Q psy8081 178 IFECNDLCKCKHTCHNRVVQFP 199 (463)
Q Consensus 178 IyECn~~C~C~~~C~NRvvQ~g 199 (463)
.+||+..|+|+..|+||..|+.
T Consensus 28 ~~EC~~~C~~G~~C~NqrFqk~ 49 (51)
T smart00570 28 LIECSSDCPCGSYCSNQRFQKR 49 (51)
T ss_pred hhhcCCCCCCCcCccCcccccC
Confidence 4899999999999999999974
No 26
>KOG1337|consensus
Probab=77.95 E-value=1.8 Score=46.65 Aligned_cols=40 Identities=28% Similarity=0.266 Sum_probs=34.2
Q ss_pred cccCCCCCCeeEEEEEEcCCCCCcCEEEEEEccCCCCCCeEEEecCC
Q psy8081 393 YLNHSCTPNVFVQNVFVDTHDPRFPWVSFFALKFIEAGSELTWDYAY 439 (463)
Q Consensus 393 FiNHSC~PNl~vq~Vfvdt~d~~fP~VafFA~r~I~aGeELTwDYgy 439 (463)
+.||++.+ ...+..+.|. ++-+.+.++|.+|+|+.+.||-
T Consensus 239 ~~NH~~~~----~~~~~~~~d~---~~~l~~~~~v~~geevfi~YG~ 278 (472)
T KOG1337|consen 239 LLNHSPEV----IKAGYNQEDE---AVELVAERDVSAGEEVFINYGP 278 (472)
T ss_pred hhccCchh----ccccccCCCC---cEEEEEeeeecCCCeEEEecCC
Confidence 67999998 4456777765 6999999999999999999994
No 27
>KOG3813|consensus
Probab=64.60 E-value=3.2 Score=44.71 Aligned_cols=20 Identities=40% Similarity=0.933 Sum_probs=16.5
Q ss_pred ccccCCCCCCCCCCCcccccc
Q psy8081 125 LVCCDCTDDCRDRNNCACWQL 145 (463)
Q Consensus 125 ~~gCdC~d~C~d~~~C~C~~~ 145 (463)
-+||+|..-| |+..|+|-|.
T Consensus 307 eCGCsCr~~C-dPETCaCSqa 326 (640)
T KOG3813|consen 307 ECGCSCRGVC-DPETCACSQA 326 (640)
T ss_pred hhCCccccee-Chhhcchhcc
Confidence 5899999666 5899999873
No 28
>PF03638 TCR: Tesmin/TSO1-like CXC domain, cysteine-rich domain; InterPro: IPR005172 This entry includes proteins that have two copies of a cysteine rich motif as follows: C-X-C-X4-C-X3-YC-X-C-X6-C-X3-C-X-C-X2-C. The family includes Tesmin Q9Y4I5 from SWISSPROT [] and TSO1 Q9LE32 from SWISSPROT []. This group of proteins is called a CXC domain in [].
Probab=51.22 E-value=8.7 Score=27.62 Aligned_cols=37 Identities=32% Similarity=0.882 Sum_probs=27.0
Q ss_pred ccccCCCC-CCCCCCCcccccccccCccccCCCCCCCCccccccccCCcCCccceeecCCCCCCCCCCCCce
Q psy8081 125 LVCCDCTD-DCRDRNNCACWQLTIKGSRDLWNVSEPKDFVGYQNRRLPEHVVSGIFECNDLCKCKHTCHNRV 195 (463)
Q Consensus 125 ~~gCdC~d-~C~d~~~C~C~~~t~~g~~~~~~~~~~~~~~gy~~~rL~~~~~tgIyECn~~C~C~~~C~NRv 195 (463)
..||.|.. .|. ...|.|++.. ..|++.|+| ..|.|+.
T Consensus 3 ~~gC~Ckks~Cl-k~YC~Cf~~g--------------------------------~~C~~~C~C-~~C~N~~ 40 (42)
T PF03638_consen 3 KKGCNCKKSKCL-KLYCECFQAG--------------------------------RFCTPNCKC-QNCKNTE 40 (42)
T ss_pred CCCCcccCcChh-hhhCHHHHCc--------------------------------CcCCCCccc-CCCCCcC
Confidence 35899964 577 5679998631 469999999 5888864
No 29
>PF11403 Yeast_MT: Yeast metallothionein; InterPro: IPR022710 Metallothioneins are characterised by an abundance of cysteine residues and a lack of generic secondary structure motifs. This protein functions in primary metal storage, transport and detoxification []. For the first 40 residues in the protein the polypeptide wraps around the metal by forming two large parallel loops separated by a deep cleft containing the metal cluster []. ; PDB: 1AQS_A 1AQR_A 1RJU_V 1FMY_A 1AOO_A 1AQQ_A.
Probab=49.73 E-value=10 Score=25.82 Aligned_cols=19 Identities=26% Similarity=0.961 Sum_probs=11.8
Q ss_pred ccccCCCCCCCCCCCcccc
Q psy8081 125 LVCCDCTDDCRDRNNCACW 143 (463)
Q Consensus 125 ~~gCdC~d~C~d~~~C~C~ 143 (463)
..+|.|..||....+|+|.
T Consensus 21 qkscscptgcnsddkcpcg 39 (40)
T PF11403_consen 21 QKSCSCPTGCNSDDKCPCG 39 (40)
T ss_dssp TTS-SS-TTTTSSTT--TT
T ss_pred hhcCCCCCCCCCCCcCCCC
Confidence 4578999999888889884
No 30
>PF08666 SAF: SAF domain; InterPro: IPR013974 This entry includes a range of different proteins, such as antifreeze proteins, flagellar FlgA proteins, and CpaB pilus proteins. ; PDB: 1C89_A 3NLA_A 3RDN_A 1C8A_A 3FRN_A 1WVO_A 3K3S_H 3G8R_B 1XUU_A 1XUZ_A ....
Probab=32.03 E-value=27 Score=26.08 Aligned_cols=15 Identities=33% Similarity=0.357 Sum_probs=10.9
Q ss_pred EEEEccCCCCCCeEE
Q psy8081 420 SFFALKFIEAGSELT 434 (463)
Q Consensus 420 afFA~r~I~aGeELT 434 (463)
.+.|.|+|++|+.||
T Consensus 3 vvVA~~di~~G~~i~ 17 (63)
T PF08666_consen 3 VVVAARDIPAGTVIT 17 (63)
T ss_dssp EEEESSTB-TT-BEC
T ss_pred EEEEeCccCCCCEEc
Confidence 357999999999985
No 31
>KOG1171|consensus
Probab=27.38 E-value=23 Score=37.59 Aligned_cols=37 Identities=32% Similarity=0.981 Sum_probs=28.8
Q ss_pred CCccccCCCC-CCCCCCCcccccccccCccccCCCCCCCCccccccccCCcCCccceeecCCCCCCCCCCCC
Q psy8081 123 EFLVCCDCTD-DCRDRNNCACWQLTIKGSRDLWNVSEPKDFVGYQNRRLPEHVVSGIFECNDLCKCKHTCHN 193 (463)
Q Consensus 123 ~f~~gCdC~d-~C~d~~~C~C~~~t~~g~~~~~~~~~~~~~~gy~~~rL~~~~~tgIyECn~~C~C~~~C~N 193 (463)
.-..||+|.. +|. +..|.|+|.. .=|+..|+|- .|.|
T Consensus 215 ~hkkGC~CkkSgCl-KkYCECyQa~--------------------------------vlCS~nCkC~-~CkN 252 (406)
T KOG1171|consen 215 RHKKGCNCKKSGCL-KKYCECYQAG--------------------------------VLCSSNCKCQ-GCKN 252 (406)
T ss_pred hhcCCCCCccccch-HHHHHHHhcC--------------------------------CCccccccCc-CCcc
Confidence 3567999986 687 5689999842 3489999997 7888
No 32
>smart00317 SET SET (Su(var)3-9, Enhancer-of-zeste, Trithorax) domain. Putative methyl transferase, based on outlier plant homologues
Probab=22.23 E-value=1.4e+02 Score=24.15 Aligned_cols=18 Identities=28% Similarity=0.281 Sum_probs=14.9
Q ss_pred ceEEEeCCCCCCCCeEEE
Q psy8081 213 GWGLRCLNDIPQGTFICI 230 (463)
Q Consensus 213 GWGVR~l~dI~kGtFVc~ 230 (463)
-..++|+.||++|+=|++
T Consensus 97 ~~~~~a~r~I~~GeEi~i 114 (116)
T smart00317 97 RIVIFALRDIKPGEELTI 114 (116)
T ss_pred EEEEEECCCcCCCCEEee
Confidence 368899999999987654
Done!