Query psy14657
Match_columns 605
No_of_seqs 606 out of 2827
Neff 7.2
Searched_HMMs 46136
Date Fri Aug 16 17:19:24 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy14657.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/14657hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 KOG1141|consensus 100.0 2.1E-50 4.5E-55 435.4 8.1 478 30-602 634-1261(1262)
2 KOG1082|consensus 100.0 5.8E-42 1.2E-46 365.3 16.3 257 346-604 68-355 (364)
3 KOG4442|consensus 100.0 2.1E-39 4.5E-44 350.3 12.7 156 428-603 94-260 (729)
4 KOG1080|consensus 99.9 1.8E-27 3.9E-32 273.9 9.7 129 455-602 868-1004(1005)
5 KOG1079|consensus 99.9 6.6E-25 1.4E-29 236.7 9.9 137 427-574 558-712 (739)
6 smart00317 SET SET (Su(var)3-9 99.9 3.9E-21 8.5E-26 170.6 13.2 107 454-570 2-116 (116)
7 KOG1083|consensus 99.8 1.3E-20 2.9E-25 210.5 2.3 123 441-574 1166-1297(1306)
8 PF05033 Pre-SET: Pre-SET moti 99.8 9.7E-20 2.1E-24 160.1 5.7 97 84-181 6-103 (103)
9 smart00468 PreSET N-terminal t 99.8 2.5E-19 5.5E-24 156.1 7.2 88 85-173 9-98 (98)
10 KOG1082|consensus 99.8 7.3E-19 1.6E-23 187.9 8.9 137 85-232 70-209 (364)
11 PF05033 Pre-SET: Pre-SET moti 99.7 5.9E-17 1.3E-21 142.5 7.1 98 345-444 4-103 (103)
12 KOG1085|consensus 99.7 1.5E-16 3.2E-21 156.9 9.3 124 446-576 250-382 (392)
13 smart00468 PreSET N-terminal t 99.6 1.8E-15 3.9E-20 131.8 6.9 90 346-436 7-98 (98)
14 COG2940 Proteins containing SE 99.6 6.6E-16 1.4E-20 171.3 2.2 167 430-603 310-480 (480)
15 KOG1141|consensus 99.5 3E-15 6.6E-20 164.0 1.7 156 346-506 681-855 (1262)
16 PF00856 SET: SET domain; Int 99.4 3.7E-13 7.9E-18 125.1 9.1 106 464-571 1-162 (162)
17 PF00385 Chromo: Chromo (CHRro 99.3 1.6E-12 3.5E-17 100.8 4.9 51 224-275 1-55 (55)
18 cd00024 CHROMO Chromatin organ 99.2 3.6E-11 7.8E-16 93.0 4.6 52 222-274 1-54 (55)
19 KOG1911|consensus 99.0 2.2E-10 4.8E-15 117.9 4.0 67 213-280 38-104 (270)
20 KOG2748|consensus 99.0 1.2E-10 2.5E-15 118.6 1.0 57 218-276 5-61 (369)
21 smart00298 CHROMO Chromatin or 98.8 1.9E-09 4.1E-14 83.2 3.2 54 223-276 1-54 (55)
22 KOG2589|consensus 98.7 7.6E-09 1.6E-13 106.1 4.4 103 462-575 136-241 (453)
23 KOG1081|consensus 98.7 7.3E-09 1.6E-13 113.6 2.4 140 429-603 290-437 (463)
24 KOG2461|consensus 98.2 1.3E-06 2.8E-11 93.9 5.6 116 450-575 26-147 (396)
25 KOG4442|consensus 96.6 0.0017 3.7E-08 72.7 4.0 57 166-232 95-153 (729)
26 smart00508 PostSET Cysteine-ri 96.4 0.0016 3.4E-08 42.1 1.3 16 588-603 2-17 (26)
27 KOG0384|consensus 95.4 0.0084 1.8E-07 71.2 2.4 51 224-276 286-337 (1373)
28 cd00034 ChSh Chromo Shadow Dom 94.0 0.04 8.6E-07 42.5 2.3 48 226-277 2-51 (54)
29 PF01393 Chromo_shadow: Chromo 93.5 0.067 1.5E-06 41.9 2.8 50 225-277 4-54 (58)
30 KOG2084|consensus 93.0 0.12 2.6E-06 57.0 5.1 62 528-595 208-271 (482)
31 smart00300 ChSh Chromo Shadow 93.0 0.048 1E-06 43.1 1.4 49 225-277 8-57 (61)
32 smart00570 AWS associated with 92.8 0.062 1.3E-06 40.8 1.6 24 427-450 27-50 (51)
33 PF11717 Tudor-knot: RNA bindi 91.9 0.29 6.3E-06 37.8 4.5 41 221-262 13-53 (55)
34 PF14061 Mtf2_C: Polycomb-like 89.7 0.33 7.2E-06 36.2 2.7 22 229-250 28-49 (50)
35 KOG0384|consensus 88.2 0.2 4.4E-06 60.0 1.2 39 238-276 224-266 (1373)
36 smart00570 AWS associated with 87.8 0.27 5.8E-06 37.4 1.2 23 164-186 27-49 (51)
37 KOG1337|consensus 73.2 2.7 6E-05 46.9 3.1 42 528-574 239-280 (472)
38 KOG1079|consensus 63.4 4.3 9.4E-05 46.2 2.1 39 164-203 558-607 (739)
39 KOG1338|consensus 44.1 15 0.00033 39.5 2.2 40 525-572 218-260 (466)
40 cd01395 HMT_MBD Methyl-CpG bin 43.4 7.2 0.00016 30.8 -0.2 32 20-57 27-60 (60)
41 PF08666 SAF: SAF domain; Int 43.1 15 0.00033 28.3 1.7 16 553-568 3-18 (63)
42 KOG2155|consensus 31.7 24 0.00053 38.5 1.4 56 513-574 197-253 (631)
43 KOG0383|consensus 29.6 21 0.00045 41.7 0.5 55 223-277 208-263 (696)
44 PF00856 SET: SET domain; Int 25.0 38 0.00083 30.3 1.4 16 553-568 3-18 (162)
No 1
>KOG1141|consensus
Probab=100.00 E-value=2.1e-50 Score=435.42 Aligned_cols=478 Identities=25% Similarity=0.426 Sum_probs=320.7
Q ss_pred hhHHHHHHhhhcccCCCcchhHHHHHHHHHHHhhhhHHHHHHHHHHHHHH-HhhcCCCCccEEEeCCCCCCCCCCcEEcc
Q psy14657 30 HRTVSRFLFNRKLLSSAKDFAYAHMLLLTHFLHGKRKQQLANIREAEERY-NAACETAARLTLENNFDLESPPMDFTYIP 108 (605)
Q Consensus 30 ~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~l~~~~~~~-n~~~~~~~~I~~~N~vD~~~~p~~F~Yi~ 108 (605)
-.++++||++ ++|+|++..+|+++.|++..|..+..+--.-.-+| +. ....||+.+|++|+..|| .|.|..
T Consensus 634 ~~el~ryL~e-----t~c~flf~~~f~~~~yV~~~r~~~p~kp~~~~~Di~~g--~e~vpis~~neids~~lp-q~ay~K 705 (1262)
T KOG1141|consen 634 RIELYRYLVE-----TRCKFLFVIGFDRAFYVVRHRAPNPLKPGNRCTDIPCG--REHVPISEKNEIDSHRLP-QAAYKK 705 (1262)
T ss_pred HHHHHHHHHH-----hcCcEEEEeecccchheeecccCCCcCCcceeccccCC--ccccccceeecccCcCCc-cchhhe
Confidence 3688999999 99999999999999999887776544222222233 22 245599999999988877 899999
Q ss_pred CCCCCCCccccC--CCcccccCCCCCcCCCcccccccc--------CCCccc--cccccceeeccCcccccCCCCcCCC-
Q psy14657 109 SSVPRDGVVVTD--DPVIWCECRGNCVSNRDACCSDLN--------DADFAY--SRRTKRLKLEKGTPIYECNKKCACD- 175 (605)
Q Consensus 109 ~~~~~~~~~~~~--~~~~gC~C~~~C~~~~~~cC~~~~--------~~~~~Y--~~~~~~l~~~~~~~IyECn~~C~C~- 175 (605)
+.|++.+..+.. +++++|+|..+|.+..+|+|.++. ++..+- +++++||.....+++|||+.+|+|.
T Consensus 706 ~~ip~~~nl~n~~~~fl~scdc~~gcid~~kcachQltvk~~~t~p~~~v~~t~gykyKRl~e~~ptg~yEc~k~ckc~~ 785 (1262)
T KOG1141|consen 706 HMIPTNNNLSNRRKDFLQSCDCPTGCIDSMKCACHQLTVKKKTTGPNQNVASTNGYKYKRLIEIRPTGPYECLKACKCCG 785 (1262)
T ss_pred eeccCCCcccccChhhhhcCCCCcchhhhhhhhHHHHHHHhhccCCCcccccCcchhhHHHHHhcCCCHHHHHHhhccCc
Confidence 999999988643 678999999999998899997663 111111 2235788888889999999999998
Q ss_pred cCCccceeeeCceeeeeEEEeeccCCceeEEEeecccccccccCCCceeEEeeeccceeec-CCcEEEEEeccCCCCCCC
Q psy14657 176 ETCLNRVVQKGNLVQDCHTTITSRDGSVVLLRVADINLTKFSSDTMTEYEVESVLDSLELT-SDMTVYLVKWKNYDPEYN 254 (605)
Q Consensus 176 ~~C~NRVvQ~G~~~~L~vf~~~~~kG~~~~~~~~~I~i~~~~~~~~~~y~vE~Ild~~~~~-~~~~~YlVkW~gy~~~~~ 254 (605)
+.|.||+||||.+|+||+|+ +.-+||| ++-...+..+.|++=|--...... ..+ .||-.-.
T Consensus 786 ~~C~nrmvqhg~qvRlq~fk-t~~kGWg---------~rclddi~~g~fVciy~g~~l~~~~sdk-------s~~~~~~- 847 (1262)
T KOG1141|consen 786 PDCLNRMVQHGYQVRLQRFK-TIHKGWG---------RRCLDDITGGNFVCIYPGGALLHQISDK-------SEYIHVT- 847 (1262)
T ss_pred HHHHHHHhhcCceeEeeecc-ccccccc---------eEeeeecCCceEEEEecchhhhhhhchh-------hhhcccc-
Confidence 56999999999999999999 7778999 555556666666665543221000 000 0110000
Q ss_pred cccccccccchHHHHHHHHHhCCCCchhhHHHHhhccCCChhHHHHHHHHHHhcccchhhhhHHHHHHHHhhhhHHHHhh
Q psy14657 255 TWEPIENLGNCAKKLAEFLKAGPDQERTDFEKMKSFLSQHTEEEVESVLAKLRNKKDFAYAHMLLLTHFLHGKRKQQLAN 334 (605)
Q Consensus 255 TWEp~~nl~~~~~li~~f~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~k~~~~~~~~~~~~~~~~~~l~~kR~~ql~~ 334 (605)
+...+| +| ..|.+. +....+.... .....
T Consensus 848 --~~~~~i-d~----~~f~~~------------------------------~dt~~~~tvD--------------~~g~d 876 (1262)
T KOG1141|consen 848 --RSLLTI-DC----FSFDAR------------------------------IDTATYITVD--------------DKGLD 876 (1262)
T ss_pred --hhhhcc-cc----cchhcc------------------------------ccccceeecc--------------ccccc
Confidence 000111 11 011111 1111110000 00001
Q ss_pred HHHHHHHHHhccCCCcceEEEcCCCCCCCCC------CcEEeeecccCCCccccCCCCcccccCCCCcCCCccc-ccccC
Q psy14657 335 IREAEERYNAACETAARLTLENNFDLESPPM------DFTYIPSSVPRDGVVVTDDPVIWCECRGNCVSNRDAC-CSDLN 407 (605)
Q Consensus 335 l~~we~~~n~~~~~~~pi~~~N~vd~~~~p~------~F~yi~~~~~~~~~~~~~~~~~gC~C~~~C~~~~~~C-c~~~~ 407 (605)
++.++ .+....|||.+|.+|.+.||. .|.|..+.-.. + .......||+|.+.|.+... | |....
T Consensus 877 ~~d~~-----~g~sg~~~p~~~~~d~~~~~~c~d~~~~~~~~~~~~~s-~--~~~~~~~~~s~d~hp~d~~~-~~~~~~~ 947 (1262)
T KOG1141|consen 877 VADFS-----LGTSGIPIPLVNSVDNDEPPSCEDSKRRFQYNDQVDIS-S--VSRDFCSGCSCDGHPSDASK-CECQQLS 947 (1262)
T ss_pred hhhhh-----ccccCCCCccccccccCCCccccccceeecccccchhh-h--hccccccccccCCCCcccCc-ccCCCCC
Confidence 11111 134568999999999888875 34443321111 1 23455679999988987766 5 43322
Q ss_pred C-------CCccee-------cccCcccccCCCceEecCCCCCCCCCCcCcccccCCCcCEE--------EEEeCCCCce
Q psy14657 408 D-------ADFAYS-------RRTKRLKLEKGTPIYECNKKCACDETCLNRVVQKGITLPLT--------IFKTKNNRGW 465 (605)
Q Consensus 408 ~-------~~~~y~-------~~~~~l~~~~~~~i~EC~~~C~C~~~C~Nrv~q~g~~~~l~--------i~~t~~~kG~ 465 (605)
. ..+..+ ...... .......+||+..|.|...|.|+++|.+.+.+++ |+++ ...||
T Consensus 948 ~~~~~~cpp~~s~d~~~~~~eS~~~~n-s~~~~~f~e~~~hss~~~~e~~~~v~~~~~~~me~~s~~~l~i~~~-~~~~~ 1025 (1262)
T KOG1141|consen 948 IEAMKRCPPNLSFDGHDELYESSEKQN-SFLKLFFFECNDHSSCHRKEYNRVVQNNIKYPMEVSSFNDLQIFKT-AQSGW 1025 (1262)
T ss_pred hhhhcCCCCccccCchhhhhhhhhhcc-hhhhccceeccccchhcccccchhhhcCCccceeeeeccccccccc-ccccc
Confidence 1 011111 000000 0011246899999999999999999998877665 5555 56799
Q ss_pred eEEcCCccCCCcEEEEeeeeeecHHHHHhh---cCCceeeeccCCC----------------------------------
Q psy14657 466 GVRTPDKIKAGTFVCEYVGEILTHENALQR---TNQTYCFNLDFNQ---------------------------------- 508 (605)
Q Consensus 466 Gv~A~~~I~~G~~I~ey~Gevi~~~e~~~r---~~~~y~f~l~~~~---------------------------------- 508 (605)
|+++..||+.-+||++|+|...+..-+.+. ..+.|.-++|...
T Consensus 1026 ~~~edtD~~~~~~~~~~~~~ppt~~l~~~~r~aqad~~sn~~D~~~~~~l~es~~~~~T~~r~~t~~~~~~~~~d~dd~q 1105 (1262)
T KOG1141|consen 1026 GVREDTDIPQSTFICTYVGAPPTDDLADELRNAQADQYSNDLDLKDTVELEESREDHETDFRGDTSDYDDEEGSDGDDGQ 1105 (1262)
T ss_pred cccccccCCCCcccccccCCCCchhhHHHHhhhhhccccCccchhhhhhhhhcccccccccCCCCCCCcccccccCccHH
Confidence 999999999999999999988766544432 0111111111000
Q ss_pred ----------------C---------------------------------------------------CCCceEEEeeee
Q psy14657 509 ----------------D---------------------------------------------------SNSVAFVLDAAR 521 (605)
Q Consensus 509 ----------------~---------------------------------------------------~~~~~~~IDa~~ 521 (605)
+ ....-|+|||+.
T Consensus 1106 ~I~k~ve~qd~~~~~~~T~~~~RQ~~~~s~k~~~~~s~~~~~~ts~~~~~~dkges~~~~~~~~~~y~~~~~~yvIDAk~ 1185 (1262)
T KOG1141|consen 1106 DIMKMVERQDSSESGEETKRLTRQKRKQSKKSGKGGSVEKDDTTSRDSMEKDKGESKDEPVFNWDKYFEPFPLYVIDAKQ 1185 (1262)
T ss_pred HHHHHhhcccccccccccchhhhhhhhhhhhcccCccccccccCccchhhhccCccCcccccchhhccCCCceEEEeccc
Confidence 0 012578999999
Q ss_pred ccCcccccccCCCCCeeEE--EEecCCCCeeEEEEEEcCCCCCCCeEEEecCCCcccccchhhhcCCCCeeeecCCCCCc
Q psy14657 522 YGNVSHFINHSCDPNLEVS--RINNLNPDLHHVALFAKRDINKNEELSFCYLDLTKAKFTSSKRKKLVRNECRCGSSNCL 599 (605)
Q Consensus 522 ~GN~~RfiNHSC~PN~~~~--~~~~~d~~~~~i~~fA~rdI~~GEELT~dY~~~~~~~~~~~~~~~~~~~~C~Cgs~~Cr 599 (605)
.||++||+||||+||+.++ +++.+|..+|.++|||.+-|+||+||||||++..+.. ......|+||+.+||
T Consensus 1186 eGNlGRfLNHSC~PNl~VQnVfvdTHdlrfPwVAFFt~kyVkAgtELTWDY~Ye~g~v-------~~keL~C~CGa~~Cr 1258 (1262)
T KOG1141|consen 1186 EGNLGRFLNHSCDPNLHVQNVFVDTHDLRFPWVAFFTRKYVKAGTELTWDYQYEQGQV-------ATKELTCHCGAENCR 1258 (1262)
T ss_pred ccchhhhhccCCCccceeeeeeeeccccCCchhhhhhhhhhccCceeeeecccccccc-------ccceEEEecChhhhh
Confidence 9999999999999999999 8999999999999999999999999999999976521 224578999999999
Q ss_pred ccc
Q psy14657 600 GYY 602 (605)
Q Consensus 600 g~l 602 (605)
|.|
T Consensus 1259 grL 1261 (1262)
T KOG1141|consen 1259 GRL 1261 (1262)
T ss_pred ccc
Confidence 987
No 2
>KOG1082|consensus
Probab=100.00 E-value=5.8e-42 Score=365.31 Aligned_cols=257 Identities=33% Similarity=0.589 Sum_probs=206.5
Q ss_pred cCCCcceEEEcCCCCCCCCCCcEEeeecccCCCccccCCCCcccccCCCCcCCCcc--cccccCCCCcceecccCc-ccc
Q psy14657 346 CETAARLTLENNFDLESPPMDFTYIPSSVPRDGVVVTDDPVIWCECRGNCVSNRDA--CCSDLNDADFAYSRRTKR-LKL 422 (605)
Q Consensus 346 ~~~~~pi~~~N~vd~~~~p~~F~yi~~~~~~~~~~~~~~~~~gC~C~~~C~~~~~~--Cc~~~~~~~~~y~~~~~~-l~~ 422 (605)
+.+..||+++|+||.+.+ ..|.|++..++..+.........+|.|...|...... +|...++..++|...... ...
T Consensus 68 ~~e~~~v~~~n~id~~~~-~~f~y~~~~~~~~~~~~~~~~~~~c~C~~~~~~~~~~~C~C~~~n~~~~~~~~~~~~~~~~ 146 (364)
T KOG1082|consen 68 GSENLPVPLVNRIDEDAP-LYFQYIATEIVDPGELSDCENSTGCRCCSSCSSVLPLTCLCERHNGGLVAYTCDGDCGTLG 146 (364)
T ss_pred ccccCceeeeeeccCCcc-ccceeccccccCccccccCccccCCCccCCCCCCCCccccChHhhCCccccccCCcccccc
Confidence 344569999999998877 8999999999887644445678899999888775431 377778888888654211 456
Q ss_pred cCCCceEecCCCCCCCCCCcCcccccCCCcCEEEEEeCCCCceeEEcCCccCCCcEEEEeeeeeecHHHHHhh-cCCcee
Q psy14657 423 EKGTPIYECNKKCACDETCLNRVVQKGITLPLTIFKTKNNRGWGVRTPDKIKAGTFVCEYVGEILTHENALQR-TNQTYC 501 (605)
Q Consensus 423 ~~~~~i~EC~~~C~C~~~C~Nrv~q~g~~~~l~i~~t~~~kG~Gv~A~~~I~~G~~I~ey~Gevi~~~e~~~r-~~~~y~ 501 (605)
..+.++|||+..|+|+++|.|||+|+|.+.+|+||+|.+ +||||||++.|++|+||+||+||+++.++++.+ ....|.
T Consensus 147 ~~~~~i~EC~~~C~C~~~C~nRv~q~g~~~~leIfrt~~-kGwgvRs~~~I~~G~fvcEyaGe~~t~~e~~~~~~~~~~~ 225 (364)
T KOG1082|consen 147 KFKEPVFECSVACGCHPDCANRVVQKGLQFHLEVFRTPE-KGWGVRTLDPIPAGEFVCEYAGEVLTSEEAQRRTHLREYL 225 (364)
T ss_pred ccCccccccccCCCCCCcCcchhhccccccceEEEecCC-ceeeecccccccCCCeeEEEeeEecChHHhhhcccccccc
Confidence 677899999999999999999999999999999999954 999999999999999999999999999999977 222333
Q ss_pred e---------eccCCC--------------CCCCceEEEeeeeccCcccccccCCCCCeeEE--EEecCCCCeeEEEEEE
Q psy14657 502 F---------NLDFNQ--------------DSNSVAFVLDAARYGNVSHFINHSCDPNLEVS--RINNLNPDLHHVALFA 556 (605)
Q Consensus 502 f---------~l~~~~--------------~~~~~~~~IDa~~~GN~~RfiNHSC~PN~~~~--~~~~~d~~~~~i~~fA 556 (605)
+ ..++.. ......+.|||...||++|||||||.||+.++ +.++.++..++++|||
T Consensus 226 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ida~~~GNv~RfinHSC~PN~~~~~v~~~~~~~~~~~i~ffa 305 (364)
T KOG1082|consen 226 DDDCDAYSIADREWVDESPVGNTFVAPSLPGGPGRELLIDAKPHGNVARFINHSCSPNLLYQAVFQDEFVLLYLRIGFFA 305 (364)
T ss_pred ccccccchhhhccccccccccccccccccccCCCcceEEchhhcccccccccCCCCccceeeeeeecCCccchheeeeee
Confidence 3 222221 12357899999999999999999999999999 6667788899999999
Q ss_pred cCCCCCCCeEEEecCCCcccccchhh--hcCCCCeeeecCCCCCcccccC
Q psy14657 557 KRDINKNEELSFCYLDLTKAKFTSSK--RKKLVRNECRCGSSNCLGYYYL 604 (605)
Q Consensus 557 ~rdI~~GEELT~dY~~~~~~~~~~~~--~~~~~~~~C~Cgs~~Crg~l~~ 604 (605)
+++|.||||||+||+..+........ ........|.||+.+||+.++.
T Consensus 306 ~~~I~p~~ELT~dYg~~~~~~~~~~~~~~~~~~~~~c~c~~~~cr~~~~~ 355 (364)
T KOG1082|consen 306 LRDISPGEELTLDYGKAYKLLVQDGANIYTPVMKKNCNCGLEKCRGLLGS 355 (364)
T ss_pred ccccCCCcccchhhcccccccccccccccccccchhhcCCCHHhCcccCC
Confidence 99999999999999987531111111 1133667999999999999863
No 3
>KOG4442|consensus
Probab=100.00 E-value=2.1e-39 Score=350.28 Aligned_cols=156 Identities=35% Similarity=0.631 Sum_probs=139.2
Q ss_pred eEecCC-CCC-CCCCCcCcccccCCCcCEEEEEeCCCCceeEEcCCccCCCcEEEEeeeeeecHHHHHhh--------cC
Q psy14657 428 IYECNK-KCA-CDETCLNRVVQKGITLPLTIFKTKNNRGWGVRTPDKIKAGTFVCEYVGEILTHENALQR--------TN 497 (605)
Q Consensus 428 i~EC~~-~C~-C~~~C~Nrv~q~g~~~~l~i~~t~~~kG~Gv~A~~~I~~G~~I~ey~Gevi~~~e~~~r--------~~ 497 (605)
..||++ .|. |+..|.|+.+|+....+++||.| .++||||+|.++|++|+||+||.||||+..++..| ..
T Consensus 94 ~iECs~~~C~~cg~~C~NQRFQkkqyA~vevF~T-e~KG~GLRA~~dI~~g~FI~EY~GEVI~~~Ef~kR~~~Y~~d~~k 172 (729)
T KOG4442|consen 94 SIECSDRECPRCGVYCKNQRFQKKQYAKVEVFLT-EKKGCGLRAEEDIPKGQFILEYIGEVIEEKEFEKRVKRYAKDGIK 172 (729)
T ss_pred hcccCCccCCCccccccchhhhhhccCceeEEEe-cCcccceeeccccCCCcEEeeeccccccHHHHHHHHHHHHhcCCc
Confidence 469977 998 99999999999999999999999 69999999999999999999999999999999988 33
Q ss_pred CceeeeccCCCCCCCceEEEeeeeccCcccccccCCCCCeeEE-EEecCCCCeeEEEEEEcCCCCCCCeEEEecCCCccc
Q psy14657 498 QTYCFNLDFNQDSNSVAFVLDAARYGNVSHFINHSCDPNLEVS-RINNLNPDLHHVALFAKRDINKNEELSFCYLDLTKA 576 (605)
Q Consensus 498 ~~y~f~l~~~~~~~~~~~~IDa~~~GN~~RfiNHSC~PN~~~~-~~~~~d~~~~~i~~fA~rdI~~GEELT~dY~~~~~~ 576 (605)
..|.|.|.. .++|||+.+||.||||||||+|||.+. |... +..||+|||.|+|++||||||||+.....
T Consensus 173 h~Yfm~L~~-------~e~IDAT~KGnlaRFiNHSC~PNa~~~KWtV~---~~lRvGiFakk~I~~GEEITFDYqf~rYG 242 (729)
T KOG4442|consen 173 HYYFMALQG-------GEYIDATKKGNLARFINHSCDPNAEVQKWTVP---DELRVGIFAKKVIKPGEEITFDYQFDRYG 242 (729)
T ss_pred eEEEEEecC-------CceecccccCcHHHhhcCCCCCCceeeeeeeC---CeeEEEEeEecccCCCceeeEeccccccc
Confidence 455555543 469999999999999999999999999 9874 68999999999999999999999986421
Q ss_pred ccchhhhcCCCCeeeecCCCCCccccc
Q psy14657 577 KFTSSKRKKLVRNECRCGSSNCLGYYY 603 (605)
Q Consensus 577 ~~~~~~~~~~~~~~C~Cgs~~Crg~l~ 603 (605)
....+|.||+++|||||.
T Consensus 243 ---------r~AQ~CyCgeanC~G~IG 260 (729)
T KOG4442|consen 243 ---------RDAQPCYCGEANCRGWIG 260 (729)
T ss_pred ---------ccccccccCCcccccccC
Confidence 145689999999999984
No 4
>KOG1080|consensus
Probab=99.94 E-value=1.8e-27 Score=273.87 Aligned_cols=129 Identities=36% Similarity=0.658 Sum_probs=111.7
Q ss_pred EEEEeCCCCceeEEcCCccCCCcEEEEeeeeeecHHHHHhh--------cCCceeeeccCCCCCCCceEEEeeeeccCcc
Q psy14657 455 TIFKTKNNRGWGVRTPDKIKAGTFVCEYVGEILTHENALQR--------TNQTYCFNLDFNQDSNSVAFVLDAARYGNVS 526 (605)
Q Consensus 455 ~i~~t~~~kG~Gv~A~~~I~~G~~I~ey~Gevi~~~e~~~r--------~~~~y~f~l~~~~~~~~~~~~IDa~~~GN~~ 526 (605)
.-|..+..+||||||++.|.+|++|+||+||+|...-|+.| .++.|+|.+|. .++|||++.||+|
T Consensus 868 ~~F~~s~iH~wglfa~~~i~~~dmViEY~Ge~vR~~iad~RE~~Y~~~gi~~sYlfrid~-------~~ViDAtk~gniA 940 (1005)
T KOG1080|consen 868 VKFGRSGIHGWGLFAMENIAAGDMVIEYRGELVRSSIADLREARYERMGIGDSYLFRIDD-------EVVVDATKKGNIA 940 (1005)
T ss_pred hccccccccccceeeccCccccceEEEeeceehhhhHHHHHHHHHhccCcccceeeeccc-------ceEEeccccCchh
Confidence 34455589999999999999999999999999988877766 57899999985 4599999999999
Q ss_pred cccccCCCCCeeEEEEecCCCCeeEEEEEEcCCCCCCCeEEEecCCCcccccchhhhcCCCCeeeecCCCCCcccc
Q psy14657 527 HFINHSCDPNLEVSRINNLNPDLHHVALFAKRDINKNEELSFCYLDLTKAKFTSSKRKKLVRNECRCGSSNCLGYY 602 (605)
Q Consensus 527 RfiNHSC~PN~~~~~~~~~d~~~~~i~~fA~rdI~~GEELT~dY~~~~~~~~~~~~~~~~~~~~C~Cgs~~Crg~l 602 (605)
|||||||+|||....+.. + +..+|+|||.|+|.+|||||+||.+...+ ....|+|||++|||+|
T Consensus 941 r~InHsC~PNCyakvi~V-~-g~~~IvIyakr~I~~~EElTYDYkF~~e~----------~kipClCgap~Crg~~ 1004 (1005)
T KOG1080|consen 941 RFINHSCNPNCYAKVITV-E-GDKRIVIYSKRDIAAGEELTYDYKFPTED----------DKIPCLCGAPNCRGFL 1004 (1005)
T ss_pred heeecccCCCceeeEEEe-c-CeeEEEEEEecccccCceeeeeccccccc----------cccccccCCCcccccc
Confidence 999999999999984332 2 56699999999999999999999875431 3679999999999987
No 5
>KOG1079|consensus
Probab=99.91 E-value=6.6e-25 Score=236.69 Aligned_cols=137 Identities=32% Similarity=0.576 Sum_probs=119.6
Q ss_pred ceEec-CCCCCC-C---------CCCcCcccccCCCcCEEEEEeCCCCceeEEcCCccCCCcEEEEeeeeeecHHHHHhh
Q psy14657 427 PIYEC-NKKCAC-D---------ETCLNRVVQKGITLPLTIFKTKNNRGWGVRTPDKIKAGTFVCEYVGEILTHENALQR 495 (605)
Q Consensus 427 ~i~EC-~~~C~C-~---------~~C~Nrv~q~g~~~~l~i~~t~~~kG~Gv~A~~~I~~G~~I~ey~Gevi~~~e~~~r 495 (605)
+..|| |..|.+ + .+|.|--+|++.+.++.+..+ ...|||+|+.+...+++||.||+||+|+.+||++|
T Consensus 558 A~rECdPd~Cl~cg~~~~~d~~~~~C~N~~l~~~~qkr~llapS-dVaGwGlFlKe~v~KnefisEY~GE~IS~dEADrR 636 (739)
T KOG1079|consen 558 AVRECDPDVCLMCGNVDHFDSSKISCKNTNLQRGEQKRVLLAPS-DVAGWGLFLKESVSKNEFISEYTGEIISHDEADRR 636 (739)
T ss_pred hccccCchHHhccCcccccccCccccccchhhhhhhcceeechh-hccccceeeccccCCCceeeeecceeccchhhhhc
Confidence 45699 478865 2 369999999999999998777 88999999999999999999999999999999999
Q ss_pred ------cCCceeeeccCCCCCCCceEEEeeeeccCcccccccCCCCCeeEE-EEecCCCCeeEEEEEEcCCCCCCCeEEE
Q psy14657 496 ------TNQTYCFNLDFNQDSNSVAFVLDAARYGNVSHFINHSCDPNLEVS-RINNLNPDLHHVALFAKRDINKNEELSF 568 (605)
Q Consensus 496 ------~~~~y~f~l~~~~~~~~~~~~IDa~~~GN~~RfiNHSC~PN~~~~-~~~~~d~~~~~i~~fA~rdI~~GEELT~ 568 (605)
..-+|+|+|.. .|+|||++.||.+||+|||-+|||... .+.. +..+|+|||.|.|.+||||||
T Consensus 637 GkiYDr~~cSflFnln~-------dyviDs~rkGnk~rFANHS~nPNCYAkvm~V~---GdhRIGifAkRaIeagEELff 706 (739)
T KOG1079|consen 637 GKIYDRYMCSFLFNLNN-------DYVIDSTRKGNKIRFANHSFNPNCYAKVMMVA---GDHRIGIFAKRAIEAGEELFF 706 (739)
T ss_pred ccccccccceeeeeccc-------cceEeeeeecchhhhccCCCCCCcEEEEEEec---CCcceeeeehhhcccCceeee
Confidence 44567777643 489999999999999999999999988 4433 567999999999999999999
Q ss_pred ecCCCc
Q psy14657 569 CYLDLT 574 (605)
Q Consensus 569 dY~~~~ 574 (605)
||.++-
T Consensus 707 DYrYs~ 712 (739)
T KOG1079|consen 707 DYRYSP 712 (739)
T ss_pred eeccCc
Confidence 999864
No 6
>smart00317 SET SET (Su(var)3-9, Enhancer-of-zeste, Trithorax) domain. Putative methyl transferase, based on outlier plant homologues
Probab=99.86 E-value=3.9e-21 Score=170.57 Aligned_cols=107 Identities=44% Similarity=0.799 Sum_probs=90.1
Q ss_pred EEEEEeCCCCceeEEcCCccCCCcEEEEeeeeeecHHHHHhh------cC--CceeeeccCCCCCCCceEEEeeeeccCc
Q psy14657 454 LTIFKTKNNRGWGVRTPDKIKAGTFVCEYVGEILTHENALQR------TN--QTYCFNLDFNQDSNSVAFVLDAARYGNV 525 (605)
Q Consensus 454 l~i~~t~~~kG~Gv~A~~~I~~G~~I~ey~Gevi~~~e~~~r------~~--~~y~f~l~~~~~~~~~~~~IDa~~~GN~ 525 (605)
++++++ +++|+||+|+++|++|++|++|.|.++...++... .+ ..|+|.... .++||+...||+
T Consensus 2 ~~~~~~-~~~G~gl~a~~~i~~g~~i~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-------~~~id~~~~~~~ 73 (116)
T smart00317 2 LEVFKS-PGKGWGVRATEDIPKGEFIGEYVGEIITSEEAEERSKAYDTDGADSFYLFEIDS-------DLCIDARRKGNI 73 (116)
T ss_pred cEEEec-CCCcEEEEECCccCCCCEEEEEEeEEECHHHHHHHHHHHHhcCCCCEEEEECCC-------CEEEeCCccCcH
Confidence 566777 79999999999999999999999999988776654 22 366666532 579999999999
Q ss_pred ccccccCCCCCeeEEEEecCCCCeeEEEEEEcCCCCCCCeEEEec
Q psy14657 526 SHFINHSCDPNLEVSRINNLNPDLHHVALFAKRDINKNEELSFCY 570 (605)
Q Consensus 526 ~RfiNHSC~PN~~~~~~~~~d~~~~~i~~fA~rdI~~GEELT~dY 570 (605)
+|||||||.||+.+.+.... +..++.++|+|||++|||||+||
T Consensus 74 ~~~iNHsc~pN~~~~~~~~~--~~~~~~~~a~r~I~~GeEi~i~Y 116 (116)
T smart00317 74 ARFINHSCEPNCELLFVEVN--GDSRIVIFALRDIKPGEELTIDY 116 (116)
T ss_pred HHeeCCCCCCCEEEEEEEEC--CCcEEEEEECCCcCCCCEEeecC
Confidence 99999999999999954432 34489999999999999999999
No 7
>KOG1083|consensus
Probab=99.79 E-value=1.3e-20 Score=210.49 Aligned_cols=123 Identities=43% Similarity=0.761 Sum_probs=106.1
Q ss_pred CcCccccc-CCCcCEEEEEeCCCCceeEEcCCccCCCcEEEEeeeeeecHHHHHhh-------cCCceeeeccCCCCCCC
Q psy14657 441 CLNRVVQK-GITLPLTIFKTKNNRGWGVRTPDKIKAGTFVCEYVGEILTHENALQR-------TNQTYCFNLDFNQDSNS 512 (605)
Q Consensus 441 C~Nrv~q~-g~~~~l~i~~t~~~kG~Gv~A~~~I~~G~~I~ey~Gevi~~~e~~~r-------~~~~y~f~l~~~~~~~~ 512 (605)
|.|+.+|+ +.-.+|++|+. +.+||||+|.++|++|+||+||+|+|++.++++.+ ....|...++
T Consensus 1166 c~nqrm~r~e~cp~L~v~~g-p~~G~~v~tk~PikagtfI~EYvGeVit~ke~e~~mmtl~~~d~~~~cL~I~------- 1237 (1306)
T KOG1083|consen 1166 CSNQRMQRHEECPPLEVFRG-PKKGWGVRTKEPIKAGTFIMEYVGEVITEKEFEPRMMTLYHNDDDHYCLVID------- 1237 (1306)
T ss_pred hhhHHhhhhccCCCcceecc-CCCCccccccccccccchHHHHHHHHHHHHhhcccccccCCCCCcccccccC-------
Confidence 78888776 55677999999 89999999999999999999999999999988876 2234444443
Q ss_pred ceEEEeeeeccCcccccccCCCCCeeEE-EEecCCCCeeEEEEEEcCCCCCCCeEEEecCCCc
Q psy14657 513 VAFVLDAARYGNVSHFINHSCDPNLEVS-RINNLNPDLHHVALFAKRDINKNEELSFCYLDLT 574 (605)
Q Consensus 513 ~~~~IDa~~~GN~~RfiNHSC~PN~~~~-~~~~~d~~~~~i~~fA~rdI~~GEELT~dY~~~~ 574 (605)
...+||+.++||.+||+||||.|||.++ |..+ +..|+.+||+|||.+|||||+||+...
T Consensus 1238 p~l~id~~R~~n~~RfinhscKPNc~~qkwSVN---G~~Rv~L~A~rDi~kGEELtYDYN~ks 1297 (1306)
T KOG1083|consen 1238 PGLFIDIPRMGNGARFINHSCKPNCEMQKWSVN---GEYRVGLFALRDLPKGEELTYDYNFKS 1297 (1306)
T ss_pred ccccCChhhccccccccccccCCCCcccccccc---ceeeeeeeecCCCCCCceEEEeccccc
Confidence 3569999999999999999999999999 7654 889999999999999999999998743
No 8
>PF05033 Pre-SET: Pre-SET motif; InterPro: IPR007728 This region is found in a number of histone lysine methyltransferases (HMTase), N-terminal to the SET domain; it is generally described as the pre-SET domain. Histone lysine methylation is part of the histone code that regulated chromatin function and epigenetic control of gene function. Histone lysine methyltransferases (HMTase) differ both in their substrate specificity for the various acceptor lysines as well as in their product specificity for the number of methyl groups (one, two, or three) they transfer. With just one exception [], the HMTases belong to SET family that can be classified according to the sequences surrounding the SET domain [, ]. Structural studies on the human SET7/9, a mono-methylase, have revealed the molecular basis for the specificity of the enzyme for the histone-target and the roles of the invariant residues in the SET domain in determining the methylation specificities []. The pre-SET domain, as found in the SUV39 SET family, contains nine invariant cysteine residues that are grouped into two segments separated by a region of variable length. These 9 cysteines coordinate 3 zinc ions to form a triangular cluster, where each of the zinc ions is coordinated by 4 four cysteines to give a tetrahedral configuration. The function of this domain is structural, holding together 2 long segments of random coils and stabilising the SET domain. The C-terminal region including the post-SET domain is disordered when not interacting with a histone tail and in the absence of zinc. The three conserved cysteines in the post-SET domain form a zinc-binding site [] when coupled to a fourth conserved cysteine in the knot-like structure close to the SET domain active site []. The structured post-SET region brings in the C-terminal residues that participate in S-adenosylmethine-binding and histone tail interactions. The three conserved cysteine residues are essential for HMTase activity, as replacement with serine abolishes HMTase activity []. ; GO: 0008270 zinc ion binding, 0018024 histone-lysine N-methyltransferase activity, 0034968 histone lysine methylation, 0005634 nucleus; PDB: 3K5K_A 2O8J_D 3RJW_B 1ML9_A 1PEG_B 1MVH_A 1MVX_A 3BO5_A 2RFI_B 3MO5_B ....
Probab=99.79 E-value=9.7e-20 Score=160.15 Aligned_cols=97 Identities=33% Similarity=0.776 Sum_probs=69.4
Q ss_pred CCCCccEEEeCCCCCCCCCCcEEccCCCCCCCcc-ccCCCcccccCCCCCcCCCccccccccCCCccccccccceeeccC
Q psy14657 84 ETAARLTLENNFDLESPPMDFTYIPSSVPRDGVV-VTDDPVIWCECRGNCVSNRDACCSDLNDADFAYSRRTKRLKLEKG 162 (605)
Q Consensus 84 ~~~~~I~~~N~vD~~~~p~~F~Yi~~~~~~~~~~-~~~~~~~gC~C~~~C~~~~~~cC~~~~~~~~~Y~~~~~~l~~~~~ 162 (605)
++..||+|+|+||++.||++|+||+++++++++. ++.....||+|.+.|.+...|.|.+.++..++|+. .++|.+...
T Consensus 6 ~e~~pI~~~N~vd~~~~p~~F~Yi~~~~~~~~~~~~~~~~~~~C~C~~~C~~~~~C~C~~~~~~~~~Y~~-~g~l~~~~~ 84 (103)
T PF05033_consen 6 KENVPIPVVNDVDDEPPPPNFEYIPENIYGEGVPDIDPEFLQGCDCSGDCSNPSNCECLQRNGGIFAYDS-NGRLRIPDK 84 (103)
T ss_dssp SSSS-EEEEESSSS--SSTSSEE-SS-EESTTSS-TBGGGTS----SSSSTCTTTSHHHCCTSSS-SB-T-TSSBSSSST
T ss_pred ccCCCEEEEeCCCCCCCCCCeEEeeeEEcCCCccccccccCccCccCCCCCCCCCCcCccccCccccccC-CCcCccCCC
Confidence 3566999999999999988999999999999887 44566789999999966667779999888899997 477777788
Q ss_pred cccccCCCCcCCCcCCccc
Q psy14657 163 TPIYECNKKCACDETCLNR 181 (605)
Q Consensus 163 ~~IyECn~~C~C~~~C~NR 181 (605)
.+|||||+.|+|+++|+||
T Consensus 85 ~~i~EC~~~C~C~~~C~NR 103 (103)
T PF05033_consen 85 PPIFECNDNCGCSPSCRNR 103 (103)
T ss_dssp SEEE---TTSSS-TTSTT-
T ss_pred CeEEeCCCCCCCCCCCCCC
Confidence 9999999999999999998
No 9
>smart00468 PreSET N-terminal to some SET domains. A Cys-rich putative Zn2+-binding domain that occurs N-terminal to some SET domains. Function is unknown. Unpublished.
Probab=99.78 E-value=2.5e-19 Score=156.10 Aligned_cols=88 Identities=34% Similarity=0.740 Sum_probs=78.4
Q ss_pred CCCccEEEeCCCCCCCCCCcEEccCCCCCCCcc--ccCCCcccccCCCCCcCCCccccccccCCCccccccccceeeccC
Q psy14657 85 TAARLTLENNFDLESPPMDFTYIPSSVPRDGVV--VTDDPVIWCECRGNCVSNRDACCSDLNDADFAYSRRTKRLKLEKG 162 (605)
Q Consensus 85 ~~~~I~~~N~vD~~~~p~~F~Yi~~~~~~~~~~--~~~~~~~gC~C~~~C~~~~~~cC~~~~~~~~~Y~~~~~~l~~~~~ 162 (605)
+..||+++|+||++.||.+|+||+++++++|+. .+..++.||+|.++|++...|+|.+++++.+||+. .+++.+..+
T Consensus 9 E~~pI~~vN~vD~~~~p~~F~Yi~~~~~~~gv~~~~~~~~~~gC~C~~~C~~~~~C~C~~~~~~~~~Y~~-~~~~~~~~~ 87 (98)
T smart00468 9 ENVPVPLVNEVDEDPPPPDFEYISEYIYGQGVPIDRSPSPLVGCSCSGDCSSSNKCECARKNGGEFAYEL-NGGLRLKRK 87 (98)
T ss_pred cCCCcceEecCCCCCCCCCcEECcceEcCCCcccccCCCCCCCCcCCCCCCCCCcCCcHhhcCCccCccc-CCCEEeCCC
Confidence 456999999999999988999999999999986 35678899999999999766889999999999963 478889999
Q ss_pred cccccCCCCcC
Q psy14657 163 TPIYECNKKCA 173 (605)
Q Consensus 163 ~~IyECn~~C~ 173 (605)
+||||||++|+
T Consensus 88 ~~IyECn~~C~ 98 (98)
T smart00468 88 PLIYECNSRCS 98 (98)
T ss_pred CEEEcCCCCCC
Confidence 99999999995
No 10
>KOG1082|consensus
Probab=99.76 E-value=7.3e-19 Score=187.92 Aligned_cols=137 Identities=23% Similarity=0.380 Sum_probs=116.1
Q ss_pred CCCccEEEeCCCCCCCCCCcEEccCCCCCCCccccCCCcccccCCCCCcCCCc--cccccccCCCccccccccc-eeecc
Q psy14657 85 TAARLTLENNFDLESPPMDFTYIPSSVPRDGVVVTDDPVIWCECRGNCVSNRD--ACCSDLNDADFAYSRRTKR-LKLEK 161 (605)
Q Consensus 85 ~~~~I~~~N~vD~~~~p~~F~Yi~~~~~~~~~~~~~~~~~gC~C~~~C~~~~~--~cC~~~~~~~~~Y~~~~~~-l~~~~ 161 (605)
+.-||+++|+||.+.+ .+|.|++..++.++...+.....+|.|.+.|..... |.|...+++.++|+..... .....
T Consensus 70 e~~~v~~~n~id~~~~-~~f~y~~~~~~~~~~~~~~~~~~~c~C~~~~~~~~~~~C~C~~~n~~~~~~~~~~~~~~~~~~ 148 (364)
T KOG1082|consen 70 ENLPVPLVNRIDEDAP-LYFQYIATEIVDPGELSDCENSTGCRCCSSCSSVLPLTCLCERHNGGLVAYTCDGDCGTLGKF 148 (364)
T ss_pred ccCceeeeeeccCCcc-ccceeccccccCccccccCccccCCCccCCCCCCCCccccChHhhCCccccccCCcccccccc
Confidence 4449999999998777 799999999999875555577899999988876433 6799999999999854322 56677
Q ss_pred CcccccCCCCcCCCcCCccceeeeCceeeeeEEEeeccCCceeEEEeecccccccccCCCceeEEeeeccc
Q psy14657 162 GTPIYECNKKCACDETCLNRVVQKGNLVQDCHTTITSRDGSVVLLRVADINLTKFSSDTMTEYEVESVLDS 232 (605)
Q Consensus 162 ~~~IyECn~~C~C~~~C~NRVvQ~G~~~~L~vf~~~~~kG~~~~~~~~~I~i~~~~~~~~~~y~vE~Ild~ 232 (605)
+.+||||++.|+|+++|.|||||+|++++|+||++.. +||| +++...++.++|++|++-..
T Consensus 149 ~~~i~EC~~~C~C~~~C~nRv~q~g~~~~leIfrt~~-kGwg---------vRs~~~I~~G~fvcEyaGe~ 209 (364)
T KOG1082|consen 149 KEPVFECSVACGCHPDCANRVVQKGLQFHLEVFRTPE-KGWG---------VRTLDPIPAGEFVCEYAGEV 209 (364)
T ss_pred CccccccccCCCCCCcCcchhhccccccceEEEecCC-ceee---------ecccccccCCCeeEEEeeEe
Confidence 8899999999999999999999999999999999555 8999 78888888999999988665
No 11
>PF05033 Pre-SET: Pre-SET motif; InterPro: IPR007728 This region is found in a number of histone lysine methyltransferases (HMTase), N-terminal to the SET domain; it is generally described as the pre-SET domain. Histone lysine methylation is part of the histone code that regulated chromatin function and epigenetic control of gene function. Histone lysine methyltransferases (HMTase) differ both in their substrate specificity for the various acceptor lysines as well as in their product specificity for the number of methyl groups (one, two, or three) they transfer. With just one exception [], the HMTases belong to SET family that can be classified according to the sequences surrounding the SET domain [, ]. Structural studies on the human SET7/9, a mono-methylase, have revealed the molecular basis for the specificity of the enzyme for the histone-target and the roles of the invariant residues in the SET domain in determining the methylation specificities []. The pre-SET domain, as found in the SUV39 SET family, contains nine invariant cysteine residues that are grouped into two segments separated by a region of variable length. These 9 cysteines coordinate 3 zinc ions to form a triangular cluster, where each of the zinc ions is coordinated by 4 four cysteines to give a tetrahedral configuration. The function of this domain is structural, holding together 2 long segments of random coils and stabilising the SET domain. The C-terminal region including the post-SET domain is disordered when not interacting with a histone tail and in the absence of zinc. The three conserved cysteines in the post-SET domain form a zinc-binding site [] when coupled to a fourth conserved cysteine in the knot-like structure close to the SET domain active site []. The structured post-SET region brings in the C-terminal residues that participate in S-adenosylmethine-binding and histone tail interactions. The three conserved cysteine residues are essential for HMTase activity, as replacement with serine abolishes HMTase activity []. ; GO: 0008270 zinc ion binding, 0018024 histone-lysine N-methyltransferase activity, 0034968 histone lysine methylation, 0005634 nucleus; PDB: 3K5K_A 2O8J_D 3RJW_B 1ML9_A 1PEG_B 1MVH_A 1MVX_A 3BO5_A 2RFI_B 3MO5_B ....
Probab=99.68 E-value=5.9e-17 Score=142.45 Aligned_cols=98 Identities=34% Similarity=0.806 Sum_probs=69.0
Q ss_pred ccCCCcceEEEcCCCCCCCCCCcEEeeecccCCCcc-ccCCCCcccccCCCCcCCCccc-ccccCCCCcceecccCcccc
Q psy14657 345 ACETAARLTLENNFDLESPPMDFTYIPSSVPRDGVV-VTDDPVIWCECRGNCVSNRDAC-CSDLNDADFAYSRRTKRLKL 422 (605)
Q Consensus 345 ~~~~~~pi~~~N~vd~~~~p~~F~yi~~~~~~~~~~-~~~~~~~gC~C~~~C~~~~~~C-c~~~~~~~~~y~~~~~~l~~ 422 (605)
.+++..||+++|++|++.+|++|+||+++++++++. .+.....||+|.++|..... | |.+..+..++|+.. |+|..
T Consensus 4 ~g~e~~pI~~~N~vd~~~~p~~F~Yi~~~~~~~~~~~~~~~~~~~C~C~~~C~~~~~-C~C~~~~~~~~~Y~~~-g~l~~ 81 (103)
T PF05033_consen 4 RGKENVPIPVVNDVDDEPPPPNFEYIPENIYGEGVPDIDPEFLQGCDCSGDCSNPSN-CECLQRNGGIFAYDSN-GRLRI 81 (103)
T ss_dssp CTSSSS-EEEEESSSS--SSTSSEE-SS-EESTTSS-TBGGGTS----SSSSTCTTT-SHHHCCTSSS-SB-TT-SSBSS
T ss_pred CCccCCCEEEEeCCCCCCCCCCeEEeeeEEcCCCccccccccCccCccCCCCCCCCC-CcCccccCccccccCC-CcCcc
Confidence 356779999999999999999999999999999876 44556789999988955554 6 77777777899974 88888
Q ss_pred cCCCceEecCCCCCCCCCCcCc
Q psy14657 423 EKGTPIYECNKKCACDETCLNR 444 (605)
Q Consensus 423 ~~~~~i~EC~~~C~C~~~C~Nr 444 (605)
....+||||++.|+|+.+|+||
T Consensus 82 ~~~~~i~EC~~~C~C~~~C~NR 103 (103)
T PF05033_consen 82 PDKPPIFECNDNCGCSPSCRNR 103 (103)
T ss_dssp SSTSEEE---TTSSS-TTSTT-
T ss_pred CCCCeEEeCCCCCCCCCCCCCC
Confidence 8899999999999999999997
No 12
>KOG1085|consensus
Probab=99.67 E-value=1.5e-16 Score=156.87 Aligned_cols=124 Identities=31% Similarity=0.478 Sum_probs=101.8
Q ss_pred cccCCCcCEEEEEeCCCCceeEEcCCccCCCcEEEEeeeeeecHHHHHhh--------cCCceeeeccCCCCCCCceEEE
Q psy14657 446 VQKGITLPLTIFKTKNNRGWGVRTPDKIKAGTFVCEYVGEILTHENALQR--------TNQTYCFNLDFNQDSNSVAFVL 517 (605)
Q Consensus 446 ~q~g~~~~l~i~~t~~~kG~Gv~A~~~I~~G~~I~ey~Gevi~~~e~~~r--------~~~~y~f~l~~~~~~~~~~~~I 517 (605)
+..|....|.+... .+||.||+|...+.+|+||.||.|.+|.-.+|..| ....|+|-..++ ...|+|
T Consensus 250 vl~g~~egl~~~~~-dgKGRGv~a~~~F~rgdFVVEY~Gdliei~eAk~rE~~Ya~De~~GcYMYyF~h~----sk~yCi 324 (392)
T KOG1085|consen 250 VLKGTNEGLLEVYK-DGKGRGVRAKVNFERGDFVVEYRGDLIEISEAKVREEQYANDEEIGCYMYYFEHN----SKKYCI 324 (392)
T ss_pred HHhccccceeEEee-ccccceeEeecccccCceEEEEecceeeechHHHHHHHhccCcccceEEEeeecc----Ceeeee
Confidence 44566777777777 68999999999999999999999999988888776 334566655443 347999
Q ss_pred eeeec-cCcccccccCCCCCeeEEEEecCCCCeeEEEEEEcCCCCCCCeEEEecCCCccc
Q psy14657 518 DAARY-GNVSHFINHSCDPNLEVSRINNLNPDLHHVALFAKRDINKNEELSFCYLDLTKA 576 (605)
Q Consensus 518 Da~~~-GN~~RfiNHSC~PN~~~~~~~~~d~~~~~i~~fA~rdI~~GEELT~dY~~~~~~ 576 (605)
||+.- +-++|.||||=.+|+....+.. | +.|++.++|.|||.+||||++|||+...+
T Consensus 325 DAT~et~~lGRLINHS~~gNl~TKvv~I-d-g~pHLiLvA~rdIa~GEELlYDYGDRSke 382 (392)
T KOG1085|consen 325 DATKETPWLGRLINHSVRGNLKTKVVEI-D-GSPHLILVARRDIAQGEELLYDYGDRSKE 382 (392)
T ss_pred ecccccccchhhhcccccCcceeeEEEe-c-CCceEEEEeccccccchhhhhhccccchh
Confidence 99874 5589999999999999985442 2 78999999999999999999999997643
No 13
>smart00468 PreSET N-terminal to some SET domains. A Cys-rich putative Zn2+-binding domain that occurs N-terminal to some SET domains. Function is unknown. Unpublished.
Probab=99.59 E-value=1.8e-15 Score=131.81 Aligned_cols=90 Identities=33% Similarity=0.721 Sum_probs=77.7
Q ss_pred cCCCcceEEEcCCCCCCCCCCcEEeeecccCCCcc--ccCCCCcccccCCCCcCCCcccccccCCCCcceecccCccccc
Q psy14657 346 CETAARLTLENNFDLESPPMDFTYIPSSVPRDGVV--VTDDPVIWCECRGNCVSNRDACCSDLNDADFAYSRRTKRLKLE 423 (605)
Q Consensus 346 ~~~~~pi~~~N~vd~~~~p~~F~yi~~~~~~~~~~--~~~~~~~gC~C~~~C~~~~~~Cc~~~~~~~~~y~~~~~~l~~~ 423 (605)
+++..||+++|+||++.||++|+||++++++.++. .+.++..||+|.++|.+....+|.+.+++.|+|+. .+++.+.
T Consensus 7 G~E~~pI~~vN~vD~~~~p~~F~Yi~~~~~~~gv~~~~~~~~~~gC~C~~~C~~~~~C~C~~~~~~~~~Y~~-~~~~~~~ 85 (98)
T smart00468 7 GKENVPVPLVNEVDEDPPPPDFEYISEYIYGQGVPIDRSPSPLVGCSCSGDCSSSNKCECARKNGGEFAYEL-NGGLRLK 85 (98)
T ss_pred CccCCCcceEecCCCCCCCCCcEECcceEcCCCcccccCCCCCCCCcCCCCCCCCCcCCcHhhcCCccCccc-CCCEEeC
Confidence 56779999999999999999999999999999975 35677899999999999874148888889999963 3778888
Q ss_pred CCCceEecCCCCC
Q psy14657 424 KGTPIYECNKKCA 436 (605)
Q Consensus 424 ~~~~i~EC~~~C~ 436 (605)
.+.+|||||+.|+
T Consensus 86 ~~~~IyECn~~C~ 98 (98)
T smart00468 86 RKPLIYECNSRCS 98 (98)
T ss_pred CCCEEEcCCCCCC
Confidence 9999999999985
No 14
>COG2940 Proteins containing SET domain [General function prediction only]
Probab=99.56 E-value=6.6e-16 Score=171.25 Aligned_cols=167 Identities=28% Similarity=0.425 Sum_probs=121.3
Q ss_pred ecCCCCCCCCCCcCcccccCCCcCEEEEEeCCCCceeEEcCCccCCCcEEEEeeeeeecHHHHHhhcCCc----eeeecc
Q psy14657 430 ECNKKCACDETCLNRVVQKGITLPLTIFKTKNNRGWGVRTPDKIKAGTFVCEYVGEILTHENALQRTNQT----YCFNLD 505 (605)
Q Consensus 430 EC~~~C~C~~~C~Nrv~q~g~~~~l~i~~t~~~kG~Gv~A~~~I~~G~~I~ey~Gevi~~~e~~~r~~~~----y~f~l~ 505 (605)
++.........+.|.........+..+... ..+|||+||+..|++|++|.+|.|+++...++..+.... +.+.+.
T Consensus 310 ~~~s~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~g~fa~~~i~~~e~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 388 (480)
T COG2940 310 FSKSNVSKLKELLNSNGCKKRREPNVVQES-EIKGYGVFALESIKKGEFIIEYHGEIIRRKEAREREENYDLLGNEFSFG 388 (480)
T ss_pred cccccCccccchhhhcccccccchhhhhhh-cccccceeehhhccchHHHHHhcCcccchHHHHhhhccccccccccchh
Confidence 344444444446555555555555555555 889999999999999999999999999988887772222 111111
Q ss_pred CCCCCCCceEEEeeeeccCcccccccCCCCCeeEEEEecCCCCeeEEEEEEcCCCCCCCeEEEecCCCcccccchhhhcC
Q psy14657 506 FNQDSNSVAFVLDAARYGNVSHFINHSCDPNLEVSRINNLNPDLHHVALFAKRDINKNEELSFCYLDLTKAKFTSSKRKK 585 (605)
Q Consensus 506 ~~~~~~~~~~~IDa~~~GN~~RfiNHSC~PN~~~~~~~~~d~~~~~i~~fA~rdI~~GEELT~dY~~~~~~~~~~~~~~~ 585 (605)
.. .....++|+...|+++|||||||.||+....+... +..++.++|+|||.+|||||+||+...+....+. ...
T Consensus 389 ~~---~~~~~~~d~~~~g~~~r~~nHS~~pN~~~~~~~~~--g~~~~~~~~~rDI~~geEl~~dy~~~~~~~~~~~-~~~ 462 (480)
T COG2940 389 LL---EDKDKVRDSQKAGDVARFINHSCTPNCEASPIEVN--GIFKISIYAIRDIKAGEELTYDYGPSLEDNRELK-KLL 462 (480)
T ss_pred hc---cccchhhhhhhcccccceeecCCCCCcceeccccc--ccceeeecccccchhhhhhccccccccccchhhh-hhh
Confidence 11 11267999999999999999999999999855432 3679999999999999999999999876432101 112
Q ss_pred CCCeeeecCCCCCccccc
Q psy14657 586 LVRNECRCGSSNCLGYYY 603 (605)
Q Consensus 586 ~~~~~C~Cgs~~Crg~l~ 603 (605)
.....|.||+..|+++|.
T Consensus 463 ~~~~~~~~~~~~~~~~~~ 480 (480)
T COG2940 463 EKRWGCACGEDRCSHTMS 480 (480)
T ss_pred hhhhccccCCCccCCCCC
Confidence 355789999999999873
No 15
>KOG1141|consensus
Probab=99.51 E-value=3e-15 Score=164.05 Aligned_cols=156 Identities=31% Similarity=0.586 Sum_probs=121.2
Q ss_pred cCCCcceEEEcCCCCCCCCCCcEEeeecccCCCccc--cCCCCcccccCCCCcCCCccc-ccccCC--------CCc---
Q psy14657 346 CETAARLTLENNFDLESPPMDFTYIPSSVPRDGVVV--TDDPVIWCECRGNCVSNRDAC-CSDLND--------ADF--- 411 (605)
Q Consensus 346 ~~~~~pi~~~N~vd~~~~p~~F~yi~~~~~~~~~~~--~~~~~~gC~C~~~C~~~~~~C-c~~~~~--------~~~--- 411 (605)
+++.+||...|++|.-.+ +.|.|-...|++.+..+ ..+...+|+|..+|.+..+ | |.++.. +..
T Consensus 681 g~e~vpis~~neids~~l-pq~ay~K~~ip~~~nl~n~~~~fl~scdc~~gcid~~k-cachQltvk~~~t~p~~~v~~t 758 (1262)
T KOG1141|consen 681 GREHVPISEKNEIDSHRL-PQAAYKKHMIPTNNNLSNRRKDFLQSCDCPTGCIDSMK-CACHQLTVKKKTTGPNQNVAST 758 (1262)
T ss_pred CccccccceeecccCcCC-ccchhheeeccCCCcccccChhhhhcCCCCcchhhhhh-hhHHHHHHHhhccCCCcccccC
Confidence 466689999999997654 48999999999988654 3456889999999988776 5 443210 000
Q ss_pred -ceecccCcccccCCCceEecCCCCCCCCC-CcCcccccCCCcCEEEEEeCCCCceeEEcCCccCCCcEEEEeeeeeecH
Q psy14657 412 -AYSRRTKRLKLEKGTPIYECNKKCACDET-CLNRVVQKGITLPLTIFKTKNNRGWGVRTPDKIKAGTFVCEYVGEILTH 489 (605)
Q Consensus 412 -~y~~~~~~l~~~~~~~i~EC~~~C~C~~~-C~Nrv~q~g~~~~l~i~~t~~~kG~Gv~A~~~I~~G~~I~ey~Gevi~~ 489 (605)
.| .++||.-...+.+|||+..|+|.+. |.||++|+|.+.+|++|+| ..+|||++...+|.+|.|||.|.|.++++
T Consensus 759 ~gy--kyKRl~e~~ptg~yEc~k~ckc~~~~C~nrmvqhg~qvRlq~fkt-~~kGWg~rclddi~~g~fVciy~g~~l~~ 835 (1262)
T KOG1141|consen 759 NGY--KYKRLIEIRPTGPYECLKACKCCGPDCLNRMVQHGYQVRLQRFKT-IHKGWGRRCLDDITGGNFVCIYPGGALLH 835 (1262)
T ss_pred cch--hhHHHHHhcCCCHHHHHHhhccCcHHHHHHHhhcCceeEeeeccc-cccccceEeeeecCCceEEEEecchhhhh
Confidence 11 1245544445668999999999877 9999999999999999999 78999999999999999999999999887
Q ss_pred HHHHhh---cCCceeeeccC
Q psy14657 490 ENALQR---TNQTYCFNLDF 506 (605)
Q Consensus 490 ~e~~~r---~~~~y~f~l~~ 506 (605)
.-++.. .+..|+..+|.
T Consensus 836 ~~sdks~~~~~~~~~~~id~ 855 (1262)
T KOG1141|consen 836 QISDKSEYIHVTRSLLTIDC 855 (1262)
T ss_pred hhchhhhhcccchhhhcccc
Confidence 766554 44556655553
No 16
>PF00856 SET: SET domain; InterPro: IPR001214 The SET domain appears generally as one part of a larger multidomain protein, and recently there were described three structures of very different proteins with distinct domain compositions: Neurospora crassa DIM-5, a member of the Su(var) family of HKMTs which methylate histone H3 on lysine 9,human SET7 (also called SET9), which methylates H3 on lysine 4 and garden pea Rubisco LSMT, an enzyme that does not modify histones, but instead methylates lysine 14 in the flexible tail of the large subunit of the enzyme Rubisco. The SET domain itself turned out to be an uncommon structure. Although in all three studies, electron density maps revealed the location of the AdoMet or AdoHcy cofactor, the SET domain bears no similarity at all to the canonical/AdoMet-dependent methyltransferase fold. Strictly conserved in the C-terminal motif of the SET domain tyrosine could be involved in abstracting a proton from the protonated amino group of the substrate lysine, promoting its nucleophilic attack on the sulphonium methyl group of the AdoMet cofactor. In contrast to the AdoMet-dependent protein methyltranferases of the classical type, which tend to bind their polypeptide substrates on top of the cofactor, it is noted from the Rubisco LSMT structure that the AdoMet seems to bind in a separate cleft, suggesting how a polypeptide substrate could be subjected to multiple rounds of methylation without having to be released from the enzyme. In contrast, SET7/9 is able to add only a single methyl group to its substrate. It has been demonstrated that association of SET domain and myotubularin-related proteins modulates growth control []. The SET domain-containing Drosophila melanogaster (Fruit fly) protein, enhancer of zeste, has a function in segment determination and the mammalian homologue may be involved in the regulation of gene transcription and chromatin structure. Histone lysine methylation is part of the histone code that regulated chromatin function and epigenetic control of gene function. Histone lysine methyltransferases (HMTase) differ both in their substrate specificity for the various acceptor lysines as well as in their product specificity for the number of methyl groups (one, two, or three) they transfer. With just one exception [], the HMTases belong to SET family that can be classified according to the sequences surrounding the SET domain [, ]. Structural studies on the human SET7/9, a mono-methylase, have revealed the molecular basis for the specificity of the enzyme for the histone-target and the roles of the invariant residues in the SET domain in determining the methylation specificities []. The pre-SET domain, as found in the SUV39 SET family, contains nine invariant cysteine residues that are grouped into two segments separated by a region of variable length. These 9 cysteines coordinate 3 zinc ions to form to form a triangular cluster, where each of the zinc ions is coordinated by 4 four cysteines to give a tetrahedral configuration. The function of this domain is structural, holding together 2 long segments of random coils. The C-terminal region including the post-SET domain is disordered when not interacting with a histone tail and in the absence of zinc. The three conserved cysteines in the post-SET domain form a zinc-binding site when coupled to a fourth conserved cysteine in the knot-like structure close to the SET domain active site []. The structured post-SET region brings in the C-terminal residues that participate in S-adenosylmethine-binding and histone tail interactions. The three conserved cysteine residues are essential for HMTase activity, as replacement with serine abolishes HMTase activity [], []. ; GO: 0005515 protein binding; PDB: 3TG5_A 3S7F_A 3RIB_B 3TG4_A 3S7J_A 3S7D_A 3S7B_A 3H6L_A 3SMT_A 3K5K_A ....
Probab=99.43 E-value=3.7e-13 Score=125.14 Aligned_cols=106 Identities=28% Similarity=0.303 Sum_probs=71.0
Q ss_pred ceeEEcCCccCCCcEEEEeeeeeecHHHHHhh--------cCCc-------e----------------------------
Q psy14657 464 GWGVRTPDKIKAGTFVCEYVGEILTHENALQR--------TNQT-------Y---------------------------- 500 (605)
Q Consensus 464 G~Gv~A~~~I~~G~~I~ey~Gevi~~~e~~~r--------~~~~-------y---------------------------- 500 (605)
|+||+|+++|++|++|++..+.+++...+... .... .
T Consensus 1 GrGl~At~dI~~Ge~I~~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 80 (162)
T PF00856_consen 1 GRGLFATRDIKAGEVILIPRPAILTPDEVSPQPELLRLQLSKALEEQSRSDFSIQKKQKAEKSERSPQLESLHSISLRSE 80 (162)
T ss_dssp SEEEEESS-B-TTEEEEEESEEEEEHHHHHCHHHHSHHTTCSSSCSHHTTHHHHHHHHHHHHHHHHHHHHHHHHHCHTTT
T ss_pred CEEEEECccCCCCCEEEEECcceEEehhhhhcccchhhhhhhhhcccccccccccccccccccccccccccccccccccc
Confidence 89999999999999999999999987766431 0000 0
Q ss_pred -e----ee--c------cCCCCCCCceEEEeeeeccCcccccccCCCCCeeEEEEecCCCCeeEEEEEEcCCCCCCCeEE
Q psy14657 501 -C----FN--L------DFNQDSNSVAFVLDAARYGNVSHFINHSCDPNLEVSRINNLNPDLHHVALFAKRDINKNEELS 567 (605)
Q Consensus 501 -~----f~--l------~~~~~~~~~~~~IDa~~~GN~~RfiNHSC~PN~~~~~~~~~d~~~~~i~~fA~rdI~~GEELT 567 (605)
. +. . .............++.....++.|+||||.|||.+.+... .....+.|.|.|||++|||||
T Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~p~~d~~NHsc~pn~~~~~~~~--~~~~~~~~~a~r~I~~GeEi~ 158 (162)
T PF00856_consen 81 LQFSQAFQWSWFISWTRSDFSSRSFSEDDRDGIALYPFADMLNHSCDPNCEVSFDFD--GDGGCLVVRATRDIKKGEEIF 158 (162)
T ss_dssp CCTCCHHHHHHHHHHHHHEEEEEEETTEEEEEEEEETGGGGSEEESSTSEEEEEEEE--TTTTEEEEEESS-B-TTSBEE
T ss_pred ccccccccchhhccccceeeeccccccccccccccCcHhHheccccccccceeeEee--cccceEEEEECCccCCCCEEE
Confidence 0 00 0 0000001113355677788899999999999999996521 234599999999999999999
Q ss_pred EecC
Q psy14657 568 FCYL 571 (605)
Q Consensus 568 ~dY~ 571 (605)
++||
T Consensus 159 isYG 162 (162)
T PF00856_consen 159 ISYG 162 (162)
T ss_dssp EEST
T ss_pred EEEC
Confidence 9997
No 17
>PF00385 Chromo: Chromo (CHRromatin Organisation MOdifier) domain; InterPro: IPR023780 The CHROMO (CHRromatin Organization MOdifier) domain [, , , ] is a conserved region of around 60 amino acids, originally identified in Drosophila modifiers of variegation. These are proteins that alter the structure of chromatin to the condensed morphology of heterochromatin, a cytologically visible condition where gene expression is repressed. In one of these proteins, Polycomb, the chromo domain has been shown to be important for chromatin targeting. Proteins that contain a chromo domain appear to fall into 3 classes. The first class includes proteins having an N-terminal chromo domain followed by a region termed the chromo shadow domain, with weak but significant sequence similarity to the N-terminal chromo domain,[], eg. Drosophila and human heterochromatin protein Su(var)205 (HP1). The second class includes proteins with a single chromo domain, eg. Drosophila protein Polycomb (Pc); mammalian modifier 3; human Mi-2 autoantigen and several yeast and Caenorhabditis elegans hypothetical proteins. In the third class paired tandem chromo domains are found, eg. in mammalian DNA-binding/helicase proteins CHD-1 to CHD-4 and yeast protein CHD1. Functional dissections of chromo domain proteins suggests a mechanistic role for chromo domains in targeting chromo domain proteins to specific regions of the nucleus. The mechanism of targeting may involve protein-protein and/or protein/nucleic acid interactions. Hence, several line of evidence show that the HP1 chromo domain is a methyl-specific histone binding module, whereas the chromo domain of two protein components of the drosophila dosage compensation complex, MSL3 and MOF, contain chromo domains that bind to RNA in vitro []. The high resolution structures of HP1-family protein chromo and chromo shadow domain reveal a conserved chromo domain fold motif consisting of three beta strands packed against an alpha helix. The chromo domain fold belongs to the OB (oligonucleotide/oligosaccharide binding)-fold class found in a variety of prokaryotic and eukaryotic nucleic acid binding protein [].; PDB: 2H1E_B 3MWY_W 2DY8_A 1KNE_A 1KNA_A 1Q3L_A 2EE1_A 1AP0_A 1GUW_A 1X3P_A ....
Probab=99.32 E-value=1.6e-12 Score=100.81 Aligned_cols=51 Identities=51% Similarity=0.864 Sum_probs=45.7
Q ss_pred eEEeeeccceeecCCcE---EEEEeccCCCCCCCcccccccccch-HHHHHHHHHh
Q psy14657 224 YEVESVLDSLELTSDMT---VYLVKWKNYDPEYNTWEPIENLGNC-AKKLAEFLKA 275 (605)
Q Consensus 224 y~vE~Ild~~~~~~~~~---~YlVkW~gy~~~~~TWEp~~nl~~~-~~li~~f~~~ 275 (605)
|+||+||++ +..+++. +|||||+||+.+++||||+++|.++ +++|++|.++
T Consensus 1 ~~Ve~Il~~-r~~~~~~~~~~ylVkW~g~~~~~~tWe~~~~l~~~~~~li~~f~~r 55 (55)
T PF00385_consen 1 YEVERILDH-RVVKGGNKVYEYLVKWKGYPYSENTWEPEENLKNCFPELIEEFEKR 55 (55)
T ss_dssp EEEEEEEEE-EEETTEESEEEEEEEETTSSGGGEEEEEGGGCSSHCHHHHHHHHHH
T ss_pred CEEEEEEEE-EEeCCCcccEEEEEEECCCCCCCCeEeeHHHHhHhhHHHHHHHhCC
Confidence 789999999 5555555 9999999999999999999999887 9999999875
No 18
>cd00024 CHROMO Chromatin organization modifier (chromo) domain is a conserved region of around 50 amino acids found in a variety of chromosomal proteins, which appear to play a role in the functional organization of the eukaryotic nucleus. Experimental evidence implicates the chromo domain in the binding activity of these proteins to methylated histone tails and maybe RNA. May occur as single instance, in a tandem arrangement or followd by a related "chromo shadow" domain.
Probab=99.15 E-value=3.6e-11 Score=92.96 Aligned_cols=52 Identities=48% Similarity=0.832 Sum_probs=46.8
Q ss_pred ceeEEeeeccceeecC--CcEEEEEeccCCCCCCCcccccccccchHHHHHHHHH
Q psy14657 222 TEYEVESVLDSLELTS--DMTVYLVKWKNYDPEYNTWEPIENLGNCAKKLAEFLK 274 (605)
Q Consensus 222 ~~y~vE~Ild~~~~~~--~~~~YlVkW~gy~~~~~TWEp~~nl~~~~~li~~f~~ 274 (605)
++|+||+||++ +... +..+|+|||+||+..++||||+++|..++.+|++|.+
T Consensus 1 ~e~~ve~Il~~-r~~~~~~~~~y~VkW~g~~~~~~tWe~~~~l~~~~~~i~~~~~ 54 (55)
T cd00024 1 EEYEVEKILDH-RKKKDGGEYEYLVKWKGYSYSEDTWEPEENLEDCKELIDEFKK 54 (55)
T ss_pred CCceEeeeeee-eecCCCCcEEEEEEECCCCCccCccccHHHhCchHHHHHHHHh
Confidence 46899999999 5555 8899999999999999999999999888899999975
No 19
>KOG1911|consensus
Probab=98.99 E-value=2.2e-10 Score=117.90 Aligned_cols=67 Identities=33% Similarity=0.552 Sum_probs=57.6
Q ss_pred ccccccCCCceeEEeeeccceeecCCcEEEEEeccCCCCCCCcccccccccchHHHHHHHHHhCCCCc
Q psy14657 213 LTKFSSDTMTEYEVESVLDSLELTSDMTVYLVKWKNYDPEYNTWEPIENLGNCAKKLAEFLKAGPDQE 280 (605)
Q Consensus 213 i~~~~~~~~~~y~vE~Ild~~~~~~~~~~YlVkW~gy~~~~~TWEp~~nl~~~~~li~~f~~~~~~~~ 280 (605)
.......+.++|+||.|+++ +..+|+.+|||+|+||++.+|||||+.++.+|+++|++|.+++....
T Consensus 38 ~~~~~~~~~~~~vvEki~~~-r~~~g~~eYlvkW~Gy~~~~ntWEPee~~~~C~~li~~~~~~~~~~k 104 (270)
T KOG1911|consen 38 VEASEEEEEEEYVVEKILKR-RKKNGKIEYLVKWKGYPDPDNTWEPEEHNLDCPELIDEFEKSQKKLK 104 (270)
T ss_pred cccccccccchhhhhhhhhc-cccCCCceeeeecCCCCCccccCCchhhccccHHHHHHHHHHhcccC
Confidence 34445666778999999999 88888899999999999999999999966699999999999975433
No 20
>KOG2748|consensus
Probab=98.98 E-value=1.2e-10 Score=118.59 Aligned_cols=57 Identities=33% Similarity=0.610 Sum_probs=52.5
Q ss_pred cCCCceeEEeeeccceeecCCcEEEEEeccCCCCCCCcccccccccchHHHHHHHHHhC
Q psy14657 218 SDTMTEYEVESVLDSLELTSDMTVYLVKWKNYDPEYNTWEPIENLGNCAKKLAEFLKAG 276 (605)
Q Consensus 218 ~~~~~~y~vE~Ild~~~~~~~~~~YlVkW~gy~~~~~TWEp~~nl~~~~~li~~f~~~~ 276 (605)
+.++..|.+|.||.+ |.++|..+|||||+||+..+|||||++|| -.+.||..|.++.
T Consensus 5 avGe~VfAaEsIlkk-RirKGrvEYlVKWkGWs~kyNTWEPEENI-LDpRLi~AFe~rE 61 (369)
T KOG2748|consen 5 AVGERVFAAESILKK-RIRKGRVEYLVKWKGWSQKYNTWEPEENI-LDPRLIAAFEQRE 61 (369)
T ss_pred cchhhHHHHHHHHHH-HhhccceEEEEEecccccccCccCccccc-cCHHHHHHHHhhh
Confidence 445677999999999 99999999999999999999999999999 6799999999885
No 21
>smart00298 CHROMO Chromatin organization modifier domain.
Probab=98.84 E-value=1.9e-09 Score=83.18 Aligned_cols=54 Identities=50% Similarity=0.777 Sum_probs=47.3
Q ss_pred eeEEeeeccceeecCCcEEEEEeccCCCCCCCcccccccccchHHHHHHHHHhC
Q psy14657 223 EYEVESVLDSLELTSDMTVYLVKWKNYDPEYNTWEPIENLGNCAKKLAEFLKAG 276 (605)
Q Consensus 223 ~y~vE~Ild~~~~~~~~~~YlVkW~gy~~~~~TWEp~~nl~~~~~li~~f~~~~ 276 (605)
.|.||+|++++...++..+|||||+|++..++||+|.++|..++++|.+|..+.
T Consensus 1 ~~~v~~Il~~r~~~~~~~~ylVkW~g~~~~~~tW~~~~~l~~~~~~v~~~~~~~ 54 (55)
T smart00298 1 EYEVEKILDHRWKKKGELEYLVKWKGYSYSEDTWEPEENLLNCSKKLDNYKKKE 54 (55)
T ss_pred CcchheeeeeeecCCCcEEEEEEECCCCCccCceeeHHHHHHHHHHHHHHHHhh
Confidence 367999999932778889999999999999999999999966999999998763
No 22
>KOG2589|consensus
Probab=98.73 E-value=7.6e-09 Score=106.10 Aligned_cols=103 Identities=20% Similarity=0.250 Sum_probs=73.8
Q ss_pred CCceeEEcCCccCCCcEEEEeeeeeecHHHHHhh---cCCceeeeccCCCCCCCceEEEeeeeccCcccccccCCCCCee
Q psy14657 462 NRGWGVRTPDKIKAGTFVCEYVGEILTHENALQR---TNQTYCFNLDFNQDSNSVAFVLDAARYGNVSHFINHSCDPNLE 538 (605)
Q Consensus 462 ~kG~Gv~A~~~I~~G~~I~ey~Gevi~~~e~~~r---~~~~y~f~l~~~~~~~~~~~~IDa~~~GN~~RfiNHSC~PN~~ 538 (605)
..|--|.|++.+.+|+=|--.+|-|+...+++++ .+..-.|.+-+..-..-..+++ ..|+||||.|.|||.
T Consensus 136 ~~gAkivst~~w~~ndkIe~LvGcIaeLse~eE~~ll~~g~nDFSvmyStRk~caqLwL------GPaafINHDCrpnCk 209 (453)
T KOG2589|consen 136 QNGAKIVSTKSWSRNDKIELLVGCIAELSEAEERSLLRGGGNDFSVMYSTRKRCAQLWL------GPAAFINHDCRPNCK 209 (453)
T ss_pred CCCceEEeeccccCCccHHHhhhhhhhcChhhhHHHHhccCCceeeeeecccchhhhee------ccHHhhcCCCCCCce
Confidence 4677788899999999999999999877666665 1122222222221111112223 257899999999999
Q ss_pred EEEEecCCCCeeEEEEEEcCCCCCCCeEEEecCCCcc
Q psy14657 539 VSRINNLNPDLHHVALFAKRDINKNEELSFCYLDLTK 575 (605)
Q Consensus 539 ~~~~~~~d~~~~~i~~fA~rdI~~GEELT~dY~~~~~ 575 (605)
+.-. +..++.+.++|||+||||||--|+..+.
T Consensus 210 Fvs~-----g~~tacvkvlRDIePGeEITcFYgs~fF 241 (453)
T KOG2589|consen 210 FVST-----GRDTACVKVLRDIEPGEEITCFYGSGFF 241 (453)
T ss_pred eecC-----CCceeeeehhhcCCCCceeEEeeccccc
Confidence 9622 4458999999999999999999999875
No 23
>KOG1081|consensus
Probab=98.67 E-value=7.3e-09 Score=113.63 Aligned_cols=140 Identities=31% Similarity=0.528 Sum_probs=102.5
Q ss_pred Eec-CCCCCCCCCCcCcccccCCCcCEEEEEeCCCCceeEEcCCccCCCcEEEEeeeeeecHHHHHhh------cCCcee
Q psy14657 429 YEC-NKKCACDETCLNRVVQKGITLPLTIFKTKNNRGWGVRTPDKIKAGTFVCEYVGEILTHENALQR------TNQTYC 501 (605)
Q Consensus 429 ~EC-~~~C~C~~~C~Nrv~q~g~~~~l~i~~t~~~kG~Gv~A~~~I~~G~~I~ey~Gevi~~~e~~~r------~~~~y~ 501 (605)
++| +..|.+...|.|+-.-...... +.. +|..+|.+| +|+++...+...+ ....-.
T Consensus 290 ~~~~p~~~~~~~~~~~~~~sk~~~~e-------~~~----~~~~~~~k~------vg~~i~~~e~~~~~~~~~~~~~~~~ 352 (463)
T KOG1081|consen 290 YEVHPKVCSAEERCHNQQFSKESYPE-------PQK----TAKADIRKG------VGEVIDDKECKARLQRVKESDLVDF 352 (463)
T ss_pred hhhcccccccccccccchhhhhcccc-------cch----hhHHhhhcc------cCcccchhhheeehhhhhccchhhh
Confidence 444 7889999889888764443333 111 788888888 8999988777655 111111
Q ss_pred eeccCCCCCCCceEEEeeeeccCcccccccCCCCCeeEE-EEecCCCCeeEEEEEEcCCCCCCCeEEEecCCCcccccch
Q psy14657 502 FNLDFNQDSNSVAFVLDAARYGNVSHFINHSCDPNLEVS-RINNLNPDLHHVALFAKRDINKNEELSFCYLDLTKAKFTS 580 (605)
Q Consensus 502 f~l~~~~~~~~~~~~IDa~~~GN~~RfiNHSC~PN~~~~-~~~~~d~~~~~i~~fA~rdI~~GEELT~dY~~~~~~~~~~ 580 (605)
+..... .. . .||+..+||.+||+||||.||+.-. |... +..++.++|.+.|++|+|||++|.....
T Consensus 353 ~~~~~e---~~-~-~id~~~~~n~sr~~nh~~~~~v~~~k~~~~---~~t~~~~~a~~~i~~g~e~t~~~n~~~~----- 419 (463)
T KOG1081|consen 353 YMVFIQ---KD-R-IIDAGPKGNYSRFLNHSCQPNVETEKWQVI---GDTRVGLFAPRQIEAGEELTFNYNGNCE----- 419 (463)
T ss_pred hhhhhh---cc-c-ccccccccchhhhhcccCCCceeechhhee---cccccccccccccccchhhhheeecccc-----
Confidence 111111 11 2 8999999999999999999999988 6543 5679999999999999999999987532
Q ss_pred hhhcCCCCeeeecCCCCCccccc
Q psy14657 581 SKRKKLVRNECRCGSSNCLGYYY 603 (605)
Q Consensus 581 ~~~~~~~~~~C~Cgs~~Crg~l~ 603 (605)
...+.|.||+.+|.+++-
T Consensus 420 -----~~~~~~~~~~e~~~~~~~ 437 (463)
T KOG1081|consen 420 -----GNEKRCCCGSENCTETKG 437 (463)
T ss_pred -----CCcceEeecccccccCCc
Confidence 134689999999998763
No 24
>KOG2461|consensus
Probab=98.22 E-value=1.3e-06 Score=93.87 Aligned_cols=116 Identities=23% Similarity=0.254 Sum_probs=87.7
Q ss_pred CCcCEEEEEeC-CCCceeEEcCCccCCCcEEEEeeeeeecHHHHHhhcCCceeeeccCCCCCCCceEEEeee--eccCcc
Q psy14657 450 ITLPLTIFKTK-NNRGWGVRTPDKIKAGTFVCEYVGEILTHENALQRTNQTYCFNLDFNQDSNSVAFVLDAA--RYGNVS 526 (605)
Q Consensus 450 ~~~~l~i~~t~-~~kG~Gv~A~~~I~~G~~I~ey~Gevi~~~e~~~r~~~~y~f~l~~~~~~~~~~~~IDa~--~~GN~~ 526 (605)
....|.|..++ +..|.||.+...|++|+--+-|.|+++ ..+..+..+..|++.+... +...++||++ ...|+.
T Consensus 26 LP~~l~i~~Ssv~~~~lgV~s~~~i~~G~~FGP~~G~~~-~~~~~~~~n~~y~W~I~~~---d~~~~~iDg~d~~~sNWm 101 (396)
T KOG2461|consen 26 LPPELRIKPSSVPVTGLGVWSNASILPGTSFGPFEGEII-ASIDSKSANNRYMWEIFSS---DNGYEYIDGTDEEHSNWM 101 (396)
T ss_pred CCCceEeeccccCCccccccccccccCcccccCccCccc-cccccccccCcceEEEEeC---CCceEEeccCChhhccee
Confidence 44556666654 667899999999999999999999982 2222233456677766542 2457899987 478999
Q ss_pred cccccCCC---CCeeEEEEecCCCCeeEEEEEEcCCCCCCCeEEEecCCCcc
Q psy14657 527 HFINHSCD---PNLEVSRINNLNPDLHHVALFAKRDINKNEELSFCYLDLTK 575 (605)
Q Consensus 527 RfiNHSC~---PN~~~~~~~~~d~~~~~i~~fA~rdI~~GEELT~dY~~~~~ 575 (605)
||+|-.++ -|+...-+ ...|.++|+|+|.+||||.++|+.++.
T Consensus 102 RYV~~Ar~~eeQNL~A~Q~------~~~Ifyrt~r~I~p~eELlVWY~~e~~ 147 (396)
T KOG2461|consen 102 RYVNSARSEEEQNLLAFQI------GENIFYRTIRDIRPNEELLVWYGSEYA 147 (396)
T ss_pred eeecccCChhhhhHHHHhc------cCceEEEecccCCCCCeEEEEeccchH
Confidence 99999884 47766411 237889999999999999999999874
No 25
>KOG4442|consensus
Probab=96.60 E-value=0.0017 Score=72.68 Aligned_cols=57 Identities=21% Similarity=0.331 Sum_probs=49.6
Q ss_pred ccCCC-CcC-CCcCCccceeeeCceeeeeEEEeeccCCceeEEEeecccccccccCCCceeEEeeeccc
Q psy14657 166 YECNK-KCA-CDETCLNRVVQKGNLVQDCHTTITSRDGSVVLLRVADINLTKFSSDTMTEYEVESVLDS 232 (605)
Q Consensus 166 yECn~-~C~-C~~~C~NRVvQ~G~~~~L~vf~~~~~kG~~~~~~~~~I~i~~~~~~~~~~y~vE~Ild~ 232 (605)
.||++ .|. |+..|.|+--|+----+++||+ +.++||| ++....++.+.|+.|||-..
T Consensus 95 iECs~~~C~~cg~~C~NQRFQkkqyA~vevF~-Te~KG~G---------LRA~~dI~~g~FI~EY~GEV 153 (729)
T KOG4442|consen 95 IECSDRECPRCGVYCKNQRFQKKQYAKVEVFL-TEKKGCG---------LRAEEDIPKGQFILEYIGEV 153 (729)
T ss_pred cccCCccCCCccccccchhhhhhccCceeEEE-ecCcccc---------eeeccccCCCcEEeeecccc
Confidence 49999 999 9999999999999999999999 5589999 66677777888888887654
No 26
>smart00508 PostSET Cysteine-rich motif following a subset of SET domains.
Probab=96.41 E-value=0.0016 Score=42.13 Aligned_cols=16 Identities=38% Similarity=1.095 Sum_probs=14.2
Q ss_pred CeeeecCCCCCccccc
Q psy14657 588 RNECRCGSSNCLGYYY 603 (605)
Q Consensus 588 ~~~C~Cgs~~Crg~l~ 603 (605)
.+.|+|||++|||+|.
T Consensus 2 ~~~C~CGs~~CRG~l~ 17 (26)
T smart00508 2 KQPCLCGAPNCRGFLG 17 (26)
T ss_pred CeeeeCCCccccceec
Confidence 4689999999999994
No 27
>KOG0384|consensus
Probab=95.39 E-value=0.0084 Score=71.17 Aligned_cols=51 Identities=27% Similarity=0.561 Sum_probs=43.2
Q ss_pred eEEeeeccceeecCCcEEEEEeccCCCCCCCcccccccccc-hHHHHHHHHHhC
Q psy14657 224 YEVESVLDSLELTSDMTVYLVKWKNYDPEYNTWEPIENLGN-CAKKLAEFLKAG 276 (605)
Q Consensus 224 y~vE~Ild~~~~~~~~~~YlVkW~gy~~~~~TWEp~~nl~~-~~~li~~f~~~~ 276 (605)
-.||+|+++ ..... .+|||||+|-|-++.|||+++.+.. .+..+++|..+.
T Consensus 286 ~~VdRIia~-~~~~d-~eYLvKW~~LpY~e~TWE~~~~I~~~~~~~~~~~~~Re 337 (1373)
T KOG0384|consen 286 VIVDRIIAE-QTSKD-PEYLVKWRGLPYEECTWEDAEDIAKKAQEEIEEFQSRE 337 (1373)
T ss_pred hhhhhhhhc-ccCCC-ceeEEEecCCCcccccccchhhhhhhHHHHHHHHhhhh
Confidence 389999999 54444 8999999999999999999999965 567788888875
No 28
>cd00034 ChSh Chromo Shadow Domain, found in association with N-terminal chromo (CHRromatin Organization MOdifier) domain; Chromo domains mediate the interaction of the heterochromatin with other heterochromatin proteins, thereby affecting chromatin structure (e.g. Drosophila and human heterochromatin protein (HP1) and mammalian modifier 1 and modifier 2)
Probab=94.00 E-value=0.04 Score=42.51 Aligned_cols=48 Identities=19% Similarity=0.376 Sum_probs=36.8
Q ss_pred EeeeccceeecC-CcEEEEEeccCCCCCCCcccccccc-cchHHHHHHHHHhCC
Q psy14657 226 VESVLDSLELTS-DMTVYLVKWKNYDPEYNTWEPIENL-GNCAKKLAEFLKAGP 277 (605)
Q Consensus 226 vE~Ild~~~~~~-~~~~YlVkW~gy~~~~~TWEp~~nl-~~~~~li~~f~~~~~ 277 (605)
++.|+.. ...+ |...||++|+| ++.++.|...+ ..||.+|-+|++++.
T Consensus 2 ~~~I~ga-t~~~~g~l~fl~kwk~---~~~~lVp~~~~~~k~P~~vI~FYE~~l 51 (54)
T cd00034 2 VKPISGA-SKSDDGELTFLAKWKD---GQASLVPNKELNVKCPLLVISFYEEHL 51 (54)
T ss_pred ceEEEEE-EEcCCCeEEEEEEEeC---CeEEEEEHHHHHhhCcHHHHHHHHHhc
Confidence 4666665 4444 89999999999 44568887777 369999999998864
No 29
>PF01393 Chromo_shadow: Chromo shadow domain Web page maintained by Rein Aasland; InterPro: IPR008251 Chromo shadow domain is distantly related to chromo domain. It is always found in association with a chromo domain. The CHROMO (CHRromatin Organization MOdifier) domain [, , , ] is a conserved region of around 60 amino acids, originally identified in Drosophila modifiers of variegation. These are proteins that alter the structure of chromatin to the condensed morphology of heterochromatin, a cytologically visible condition where gene expression is repressed. In one of these proteins, Polycomb, the chromo domain has been shown to be important for chromatin targeting. Proteins that contain a chromo domain appear to fall into 3 classes. The first class includes proteins having an N-terminal chromo domain followed by a region termed the chromo shadow domain [], eg. Drosophila and human heterochromatin protein Su(var)205 (HP1); and mammalian modifier 1 and modifier 2. The second class includes proteins with a single chromo domain, eg. Drosophila protein Polycomb (Pc); mammalian modifier 3; human Mi-2 autoantigenand and several yeast and Caenorhabditis elegans hypothetical proteins. In the third class paired tandem chromo domains are found, eg. in mammalian DNA-binding/helicase proteins CHD-1 to CHD-4 and yeast protein CHD1.; GO: 0005634 nucleus; PDB: 3Q6S_C 2FMM_B 3P7J_B 1E0B_B 3I3C_A 1DZ1_B 1S4Z_A 3KUP_D.
Probab=93.49 E-value=0.067 Score=41.89 Aligned_cols=50 Identities=24% Similarity=0.466 Sum_probs=40.6
Q ss_pred EEeeeccceeecCCcEEEEEeccCCCCCCCcccccccc-cchHHHHHHHHHhCC
Q psy14657 225 EVESVLDSLELTSDMTVYLVKWKNYDPEYNTWEPIENL-GNCAKKLAEFLKAGP 277 (605)
Q Consensus 225 ~vE~Ild~~~~~~~~~~YlVkW~gy~~~~~TWEp~~nl-~~~~~li~~f~~~~~ 277 (605)
..|.|++. ....|...|||||+|-+ +.++.|...+ ..||.+|-+|++++.
T Consensus 4 ~~E~Ivg~-~d~~G~l~~likwk~~~--~~~~v~~~~~~~k~Pq~vI~FYE~~l 54 (58)
T PF01393_consen 4 EWEKIVGA-TDTNGELMFLIKWKNSG--EKDLVPSKEANEKCPQKVIKFYESHL 54 (58)
T ss_dssp TEEEEEEE-EECTSSEEEEEEETTSS--SEEEEEHHHHHHHSHHHHHHHHHHTC
T ss_pred ChHHHhee-ecCCCcEEEEEEECCCC--ceEEeeHHHHHHHCcHHHHHHHHHHe
Confidence 46888888 67779999999999974 4678887765 369999999999874
No 30
>KOG2084|consensus
Probab=93.04 E-value=0.12 Score=57.04 Aligned_cols=62 Identities=27% Similarity=0.440 Sum_probs=42.6
Q ss_pred ccccCCCCCeeEEEEecCCCCeeEEEEEEcCCCCCCC-eEEEecCCCccccc-chhhhcCCCCeeeecCC
Q psy14657 528 FINHSCDPNLEVSRINNLNPDLHHVALFAKRDINKNE-ELSFCYLDLTKAKF-TSSKRKKLVRNECRCGS 595 (605)
Q Consensus 528 fiNHSC~PN~~~~~~~~~d~~~~~i~~fA~rdI~~GE-ELT~dY~~~~~~~~-~~~~~~~~~~~~C~Cgs 595 (605)
++||||.||+.+.+.. ....+.+...+.+++ ||++.|....+... +.......+.|.|.|+.
T Consensus 208 ~~~hsC~pn~~~~~~~------~~~~~~~~~~~~~~~~~l~~~y~~~~~~~~~r~~~l~~~~~f~c~c~r 271 (482)
T KOG2084|consen 208 LFNHSCFPNISVIFDG------RGLALLVPAGIDAGEEELTISYTDPLLSTASRQKQLRQSKLFSCQCPR 271 (482)
T ss_pred hcccCCCCCeEEEECC------ceeEEEeecccCCCCCEEEEeecccccCHHHHHHHHhhccceeeecCC
Confidence 7999999999965432 255667778888876 99999998765332 22233444447777754
No 31
>smart00300 ChSh Chromo Shadow Domain.
Probab=93.03 E-value=0.048 Score=43.15 Aligned_cols=49 Identities=18% Similarity=0.415 Sum_probs=39.8
Q ss_pred EEeeeccceeecCCcEEEEEeccCCCCCCCcccccccc-cchHHHHHHHHHhCC
Q psy14657 225 EVESVLDSLELTSDMTVYLVKWKNYDPEYNTWEPIENL-GNCAKKLAEFLKAGP 277 (605)
Q Consensus 225 ~vE~Ild~~~~~~~~~~YlVkW~gy~~~~~TWEp~~nl-~~~~~li~~f~~~~~ 277 (605)
..|+|+.. ...+|...||++|+| ++.++.|...+ ..||.+|-+|++++.
T Consensus 8 ~~e~Ivg~-~d~~G~l~flikwk~---~~~~lVp~~~~~~k~P~~vI~FYE~~l 57 (61)
T smart00300 8 SWEDIVGI-TKDDGELTFLIKWKD---DAASLVPNKEANVKCPQKVIRFYESHL 57 (61)
T ss_pred CHHHHhce-ecCCCeEEEEEEEeC---CcEEEEEHHHHHHHChHHHHHHHHHhC
Confidence 45777777 667999999999999 45678888777 369999999998874
No 32
>smart00570 AWS associated with SET domains. subdomain of PRESET
Probab=92.79 E-value=0.062 Score=40.82 Aligned_cols=24 Identities=33% Similarity=0.749 Sum_probs=21.3
Q ss_pred ceEecCCCCCCCCCCcCcccccCC
Q psy14657 427 PIYECNKKCACDETCLNRVVQKGI 450 (605)
Q Consensus 427 ~i~EC~~~C~C~~~C~Nrv~q~g~ 450 (605)
..+||++.|.|+..|.||.+|+..
T Consensus 27 l~~EC~~~C~~G~~C~NqrFqk~~ 50 (51)
T smart00570 27 LLIECSSDCPCGSYCSNQRFQKRQ 50 (51)
T ss_pred HhhhcCCCCCCCcCccCcccccCc
Confidence 458999999999999999999864
No 33
>PF11717 Tudor-knot: RNA binding activity-knot of a chromodomain ; PDB: 2EKO_A 2RO0_A 2RNZ_A 1WGS_A 3E9G_A 3E9F_A 2K3X_A 2K3Y_A 2EFI_A 2F5K_F ....
Probab=91.92 E-value=0.29 Score=37.79 Aligned_cols=41 Identities=15% Similarity=0.396 Sum_probs=33.9
Q ss_pred CceeEEeeeccceeecCCcEEEEEeccCCCCCCCcccccccc
Q psy14657 221 MTEYEVESVLDSLELTSDMTVYLVKWKNYDPEYNTWEPIENL 262 (605)
Q Consensus 221 ~~~y~vE~Ild~~~~~~~~~~YlVkW~gy~~~~~TWEp~~nl 262 (605)
.+.+---+|++. +..++..+|+|.+.||+...|-|.|.++|
T Consensus 13 ~~~~y~A~I~~~-r~~~~~~~YyVHY~g~nkR~DeWV~~~~i 53 (55)
T PF11717_consen 13 DGQWYEAKILDI-REKNGEPEYYVHYQGWNKRLDEWVPESRI 53 (55)
T ss_dssp TTEEEEEEEEEE-EECTTCEEEEEEETTSTGCC-EEEETTTE
T ss_pred CCcEEEEEEEEE-EecCCCEEEEEEcCCCCCCceeeecHHHc
Confidence 345555677788 77888899999999999999999999987
No 34
>PF14061 Mtf2_C: Polycomb-like MTF2 factor 2
Probab=89.75 E-value=0.33 Score=36.18 Aligned_cols=22 Identities=27% Similarity=0.445 Sum_probs=18.1
Q ss_pred eccceeecCCcEEEEEeccCCC
Q psy14657 229 VLDSLELTSDMTVYLVKWKNYD 250 (605)
Q Consensus 229 Ild~~~~~~~~~~YlVkW~gy~ 250 (605)
|+..|...+|+.+|||.|.|.+
T Consensus 28 VlArRV~~dG~vQYLvEWeg~~ 49 (50)
T PF14061_consen 28 VLARRVTPDGKVQYLVEWEGAT 49 (50)
T ss_pred EEEEEEcCCCcEEEEEEecCcC
Confidence 4666677789999999999974
No 35
>KOG0384|consensus
Probab=88.23 E-value=0.2 Score=60.02 Aligned_cols=39 Identities=33% Similarity=0.507 Sum_probs=31.0
Q ss_pred CcEEEEEeccCCCCCCCccccccccc--c--hHHHHHHHHHhC
Q psy14657 238 DMTVYLVKWKNYDPEYNTWEPIENLG--N--CAKKLAEFLKAG 276 (605)
Q Consensus 238 ~~~~YlVkW~gy~~~~~TWEp~~nl~--~--~~~li~~f~~~~ 276 (605)
+..+|||||+||+.-++|||++++|. + -...|+.|.++.
T Consensus 224 e~~qFlIKWkg~SyLHctWet~~~L~~~~~rG~kKv~nf~kK~ 266 (1373)
T KOG0384|consen 224 EEEQFLIKWKGWSYLHCTWETESELLEMNVRGLKKVDNFKKKV 266 (1373)
T ss_pred hhhhhheeeccccceeccccchHHHHhhhHHHHHHHHHHHHHH
Confidence 34689999999999999999999975 2 234577777774
No 36
>smart00570 AWS associated with SET domains. subdomain of PRESET
Probab=87.76 E-value=0.27 Score=37.40 Aligned_cols=23 Identities=35% Similarity=0.808 Sum_probs=20.5
Q ss_pred ccccCCCCcCCCcCCccceeeeC
Q psy14657 164 PIYECNKKCACDETCLNRVVQKG 186 (605)
Q Consensus 164 ~IyECn~~C~C~~~C~NRVvQ~G 186 (605)
..+||.+.|.|+..|.||.-|+.
T Consensus 27 l~~EC~~~C~~G~~C~NqrFqk~ 49 (51)
T smart00570 27 LLIECSSDCPCGSYCSNQRFQKR 49 (51)
T ss_pred HhhhcCCCCCCCcCccCcccccC
Confidence 36799999999999999999974
No 37
>KOG1337|consensus
Probab=73.18 E-value=2.7 Score=46.94 Aligned_cols=42 Identities=29% Similarity=0.298 Sum_probs=32.0
Q ss_pred ccccCCCCCeeEEEEecCCCCeeEEEEEEcCCCCCCCeEEEecCCCc
Q psy14657 528 FINHSCDPNLEVSRINNLNPDLHHVALFAKRDINKNEELSFCYLDLT 574 (605)
Q Consensus 528 fiNHSC~PN~~~~~~~~~d~~~~~i~~fA~rdI~~GEELT~dY~~~~ 574 (605)
+.||++.+... ..... .+.+.+++.++|.+||||++.||...
T Consensus 239 ~~NH~~~~~~~--~~~~~---d~~~~l~~~~~v~~geevfi~YG~~~ 280 (472)
T KOG1337|consen 239 LLNHSPEVIKA--GYNQE---DEAVELVAERDVSAGEEVFINYGPKS 280 (472)
T ss_pred hhccCchhccc--cccCC---CCcEEEEEeeeecCCCeEEEecCCCc
Confidence 68999998221 22222 23888999999999999999999854
No 38
>KOG1079|consensus
Probab=63.42 E-value=4.3 Score=46.16 Aligned_cols=39 Identities=21% Similarity=0.347 Sum_probs=29.4
Q ss_pred ccccCC-CCcCC----------CcCCccceeeeCceeeeeEEEeeccCCce
Q psy14657 164 PIYECN-KKCAC----------DETCLNRVVQKGNLVQDCHTTITSRDGSV 203 (605)
Q Consensus 164 ~IyECn-~~C~C----------~~~C~NRVvQ~G~~~~L~vf~~~~~kG~~ 203 (605)
..-||. ..|.+ ..+|.|--+|+|.+-++-+--.. =.|||
T Consensus 558 A~rECdPd~Cl~cg~~~~~d~~~~~C~N~~l~~~~qkr~llapSd-VaGwG 607 (739)
T KOG1079|consen 558 AVRECDPDVCLMCGNVDHFDSSKISCKNTNLQRGEQKRVLLAPSD-VAGWG 607 (739)
T ss_pred hccccCchHHhccCcccccccCccccccchhhhhhhcceeechhh-ccccc
Confidence 367898 66765 24799999999999998777533 34888
No 39
>KOG1338|consensus
Probab=44.10 E-value=15 Score=39.49 Aligned_cols=40 Identities=25% Similarity=0.424 Sum_probs=30.8
Q ss_pred cccccccC---CCCCeeEEEEecCCCCeeEEEEEEcCCCCCCCeEEEecCC
Q psy14657 525 VSHFINHS---CDPNLEVSRINNLNPDLHHVALFAKRDINKNEELSFCYLD 572 (605)
Q Consensus 525 ~~RfiNHS---C~PN~~~~~~~~~d~~~~~i~~fA~rdI~~GEELT~dY~~ 572 (605)
.+-|+||- |..|+.+ +..-+-+.|.|+|++|+|+.-.||-
T Consensus 218 ~ad~lNhd~~k~nanl~y--------~~NcL~mva~r~iekgdev~n~dg~ 260 (466)
T KOG1338|consen 218 IADFLNHDGLKANANLRY--------EDNCLEMVADRNIEKGDEVDNSDGL 260 (466)
T ss_pred hhhhhccchhhcccceec--------cCcceeeeecCCCCCcccccccccc
Confidence 56689995 5666555 2235678999999999999999973
No 40
>cd01395 HMT_MBD Methyl-CpG binding domains (MBD) present in putative histone methyltransferases (HMT) such as CLLD8 and SETDB1 proteins; CLLD8 contains a MBD, a PreSET and a bifurcated SET domain, suggesting that CLLD8 might be associated with methylation-mediated transcriptional repression. SETDB1 and other proteins in this group have a similar domain architecture. SETDB1 is a novel KAP-1-associated histone H3, lysine 9-specific methyltransferase that contributes to HP1-mediated silencing of euchromatic genes by KRAB zinc-finger proteins.
Probab=43.42 E-value=7.2 Score=30.77 Aligned_cols=32 Identities=16% Similarity=0.247 Sum_probs=24.9
Q ss_pred eeecCChhhhh--hHHHHHHhhhcccCCCcchhHHHHHHH
Q psy14657 20 TLIEPNADEVH--RTVSRFLFNRKLLSSAKDFAYAHMLLL 57 (605)
Q Consensus 20 ~~~~~~~~~~~--~~~~~~l~~~~~~~~~~~~~~~~~~~~ 57 (605)
.--.|..+-+| .||++||.+ + ++++..++|+|
T Consensus 27 ~Y~aPCGr~Lr~~~EV~~YL~~-----t-~~~L~~d~FsF 60 (60)
T cd01395 27 IYKAPCGRSLRNMSEVHRYLRE-----T-CSFLTVDNFSF 60 (60)
T ss_pred EEECCcchhhhcHHHHHHHHHh-----c-cccceeecccC
Confidence 34567777544 699999999 7 99999998864
No 41
>PF08666 SAF: SAF domain; InterPro: IPR013974 This entry includes a range of different proteins, such as antifreeze proteins, flagellar FlgA proteins, and CpaB pilus proteins. ; PDB: 1C89_A 3NLA_A 3RDN_A 1C8A_A 3FRN_A 1WVO_A 3K3S_H 3G8R_B 1XUU_A 1XUZ_A ....
Probab=43.07 E-value=15 Score=28.33 Aligned_cols=16 Identities=25% Similarity=0.345 Sum_probs=11.7
Q ss_pred EEEEcCCCCCCCeEEE
Q psy14657 553 ALFAKRDINKNEELSF 568 (605)
Q Consensus 553 ~~fA~rdI~~GEELT~ 568 (605)
.++|.|||++|+.|+-
T Consensus 3 vvVA~~di~~G~~i~~ 18 (63)
T PF08666_consen 3 VVVAARDIPAGTVITA 18 (63)
T ss_dssp EEEESSTB-TT-BECT
T ss_pred EEEEeCccCCCCEEcc
Confidence 4789999999999853
No 42
>KOG2155|consensus
Probab=31.70 E-value=24 Score=38.50 Aligned_cols=56 Identities=18% Similarity=0.361 Sum_probs=40.0
Q ss_pred ceEEEeeeeccCcccccccCCCCCeeEE-EEecCCCCeeEEEEEEcCCCCCCCeEEEecCCCc
Q psy14657 513 VAFVLDAARYGNVSHFINHSCDPNLEVS-RINNLNPDLHHVALFAKRDINKNEELSFCYLDLT 574 (605)
Q Consensus 513 ~~~~IDa~~~GN~~RfiNHSC~PN~~~~-~~~~~d~~~~~i~~fA~rdI~~GEELT~dY~~~~ 574 (605)
..|++|- ++.-+.||-.||..+. +. -+...-..-.++-+|+...|||+|-|+..+.
T Consensus 197 vWYvMDe-----fGsrvrHsdePnf~~aPf~-fmPq~vaYsimwp~k~~~tgeE~trDfasg~ 253 (631)
T KOG2155|consen 197 VWYVMDE-----FGSRVRHSDEPNFRIAPFM-FMPQNVAYSIMWPTKPVNTGEEITRDFASGV 253 (631)
T ss_pred eeEEHhh-----hhhhhccCCCCcceeeehe-ecchhcceeEEeeccCCCCchHHHHHHhhcC
Confidence 4556653 4556899999999988 21 1112344556899999999999999987654
No 43
>KOG0383|consensus
Probab=29.58 E-value=21 Score=41.71 Aligned_cols=55 Identities=18% Similarity=0.304 Sum_probs=44.2
Q ss_pred eeEEeeeccceeecCCcEEEEEeccCCCCCCCcccccc-cccchHHHHHHHHHhCC
Q psy14657 223 EYEVESVLDSLELTSDMTVYLVKWKNYDPEYNTWEPIE-NLGNCAKKLAEFLKAGP 277 (605)
Q Consensus 223 ~y~vE~Ild~~~~~~~~~~YlVkW~gy~~~~~TWEp~~-nl~~~~~li~~f~~~~~ 277 (605)
.-.+-.|+.++....+...|+|+|+.-+.++.+||... ++.+....+++|+....
T Consensus 208 ~~~i~rii~~~~s~~~~~~~~Vk~k~l~~d~~~~e~~~~~ip~~~~~~qe~~~~~~ 263 (696)
T KOG0383|consen 208 WMPIARIINRRSSQKGATDYLVKWKELSYDEQEWEVEDPDIPGYSSAGQEAWHHRE 263 (696)
T ss_pred ccccchhhhhhcccccceeeEeeeccCCccccCCCcCCCCcccCcccccccccccC
Confidence 34566777775677889999999999999999999998 66777778888876553
No 44
>PF00856 SET: SET domain; InterPro: IPR001214 The SET domain appears generally as one part of a larger multidomain protein, and recently there were described three structures of very different proteins with distinct domain compositions: Neurospora crassa DIM-5, a member of the Su(var) family of HKMTs which methylate histone H3 on lysine 9,human SET7 (also called SET9), which methylates H3 on lysine 4 and garden pea Rubisco LSMT, an enzyme that does not modify histones, but instead methylates lysine 14 in the flexible tail of the large subunit of the enzyme Rubisco. The SET domain itself turned out to be an uncommon structure. Although in all three studies, electron density maps revealed the location of the AdoMet or AdoHcy cofactor, the SET domain bears no similarity at all to the canonical/AdoMet-dependent methyltransferase fold. Strictly conserved in the C-terminal motif of the SET domain tyrosine could be involved in abstracting a proton from the protonated amino group of the substrate lysine, promoting its nucleophilic attack on the sulphonium methyl group of the AdoMet cofactor. In contrast to the AdoMet-dependent protein methyltranferases of the classical type, which tend to bind their polypeptide substrates on top of the cofactor, it is noted from the Rubisco LSMT structure that the AdoMet seems to bind in a separate cleft, suggesting how a polypeptide substrate could be subjected to multiple rounds of methylation without having to be released from the enzyme. In contrast, SET7/9 is able to add only a single methyl group to its substrate. It has been demonstrated that association of SET domain and myotubularin-related proteins modulates growth control []. The SET domain-containing Drosophila melanogaster (Fruit fly) protein, enhancer of zeste, has a function in segment determination and the mammalian homologue may be involved in the regulation of gene transcription and chromatin structure. Histone lysine methylation is part of the histone code that regulated chromatin function and epigenetic control of gene function. Histone lysine methyltransferases (HMTase) differ both in their substrate specificity for the various acceptor lysines as well as in their product specificity for the number of methyl groups (one, two, or three) they transfer. With just one exception [], the HMTases belong to SET family that can be classified according to the sequences surrounding the SET domain [, ]. Structural studies on the human SET7/9, a mono-methylase, have revealed the molecular basis for the specificity of the enzyme for the histone-target and the roles of the invariant residues in the SET domain in determining the methylation specificities []. The pre-SET domain, as found in the SUV39 SET family, contains nine invariant cysteine residues that are grouped into two segments separated by a region of variable length. These 9 cysteines coordinate 3 zinc ions to form to form a triangular cluster, where each of the zinc ions is coordinated by 4 four cysteines to give a tetrahedral configuration. The function of this domain is structural, holding together 2 long segments of random coils. The C-terminal region including the post-SET domain is disordered when not interacting with a histone tail and in the absence of zinc. The three conserved cysteines in the post-SET domain form a zinc-binding site when coupled to a fourth conserved cysteine in the knot-like structure close to the SET domain active site []. The structured post-SET region brings in the C-terminal residues that participate in S-adenosylmethine-binding and histone tail interactions. The three conserved cysteine residues are essential for HMTase activity, as replacement with serine abolishes HMTase activity [], []. ; GO: 0005515 protein binding; PDB: 3TG5_A 3S7F_A 3RIB_B 3TG4_A 3S7J_A 3S7D_A 3S7B_A 3H6L_A 3SMT_A 3K5K_A ....
Probab=24.99 E-value=38 Score=30.32 Aligned_cols=16 Identities=44% Similarity=0.659 Sum_probs=12.3
Q ss_pred EEEEcCCCCCCCeEEE
Q psy14657 553 ALFAKRDINKNEELSF 568 (605)
Q Consensus 553 ~~fA~rdI~~GEELT~ 568 (605)
++||+|||++||-|.+
T Consensus 3 Gl~At~dI~~Ge~I~~ 18 (162)
T PF00856_consen 3 GLFATRDIKAGEVILI 18 (162)
T ss_dssp EEEESS-B-TTEEEEE
T ss_pred EEEECccCCCCCEEEE
Confidence 6899999999998874
Done!