Query         psy14657
Match_columns 605
No_of_seqs    606 out of 2827
Neff          7.2 
Searched_HMMs 46136
Date          Fri Aug 16 17:19:24 2013
Command       hhsearch -i /work/01045/syshi/Psyhhblits/psy14657.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/14657hhsearch_cdd -cpu 12 -v 0 

 No Hit                             Prob E-value P-value  Score    SS Cols Query HMM  Template HMM
  1 KOG1141|consensus              100.0 2.1E-50 4.5E-55  435.4   8.1  478   30-602   634-1261(1262)
  2 KOG1082|consensus              100.0 5.8E-42 1.2E-46  365.3  16.3  257  346-604    68-355 (364)
  3 KOG4442|consensus              100.0 2.1E-39 4.5E-44  350.3  12.7  156  428-603    94-260 (729)
  4 KOG1080|consensus               99.9 1.8E-27 3.9E-32  273.9   9.7  129  455-602   868-1004(1005)
  5 KOG1079|consensus               99.9 6.6E-25 1.4E-29  236.7   9.9  137  427-574   558-712 (739)
  6 smart00317 SET SET (Su(var)3-9  99.9 3.9E-21 8.5E-26  170.6  13.2  107  454-570     2-116 (116)
  7 KOG1083|consensus               99.8 1.3E-20 2.9E-25  210.5   2.3  123  441-574  1166-1297(1306)
  8 PF05033 Pre-SET:  Pre-SET moti  99.8 9.7E-20 2.1E-24  160.1   5.7   97   84-181     6-103 (103)
  9 smart00468 PreSET N-terminal t  99.8 2.5E-19 5.5E-24  156.1   7.2   88   85-173     9-98  (98)
 10 KOG1082|consensus               99.8 7.3E-19 1.6E-23  187.9   8.9  137   85-232    70-209 (364)
 11 PF05033 Pre-SET:  Pre-SET moti  99.7 5.9E-17 1.3E-21  142.5   7.1   98  345-444     4-103 (103)
 12 KOG1085|consensus               99.7 1.5E-16 3.2E-21  156.9   9.3  124  446-576   250-382 (392)
 13 smart00468 PreSET N-terminal t  99.6 1.8E-15 3.9E-20  131.8   6.9   90  346-436     7-98  (98)
 14 COG2940 Proteins containing SE  99.6 6.6E-16 1.4E-20  171.3   2.2  167  430-603   310-480 (480)
 15 KOG1141|consensus               99.5   3E-15 6.6E-20  164.0   1.7  156  346-506   681-855 (1262)
 16 PF00856 SET:  SET domain;  Int  99.4 3.7E-13 7.9E-18  125.1   9.1  106  464-571     1-162 (162)
 17 PF00385 Chromo:  Chromo (CHRro  99.3 1.6E-12 3.5E-17  100.8   4.9   51  224-275     1-55  (55)
 18 cd00024 CHROMO Chromatin organ  99.2 3.6E-11 7.8E-16   93.0   4.6   52  222-274     1-54  (55)
 19 KOG1911|consensus               99.0 2.2E-10 4.8E-15  117.9   4.0   67  213-280    38-104 (270)
 20 KOG2748|consensus               99.0 1.2E-10 2.5E-15  118.6   1.0   57  218-276     5-61  (369)
 21 smart00298 CHROMO Chromatin or  98.8 1.9E-09 4.1E-14   83.2   3.2   54  223-276     1-54  (55)
 22 KOG2589|consensus               98.7 7.6E-09 1.6E-13  106.1   4.4  103  462-575   136-241 (453)
 23 KOG1081|consensus               98.7 7.3E-09 1.6E-13  113.6   2.4  140  429-603   290-437 (463)
 24 KOG2461|consensus               98.2 1.3E-06 2.8E-11   93.9   5.6  116  450-575    26-147 (396)
 25 KOG4442|consensus               96.6  0.0017 3.7E-08   72.7   4.0   57  166-232    95-153 (729)
 26 smart00508 PostSET Cysteine-ri  96.4  0.0016 3.4E-08   42.1   1.3   16  588-603     2-17  (26)
 27 KOG0384|consensus               95.4  0.0084 1.8E-07   71.2   2.4   51  224-276   286-337 (1373)
 28 cd00034 ChSh Chromo Shadow Dom  94.0    0.04 8.6E-07   42.5   2.3   48  226-277     2-51  (54)
 29 PF01393 Chromo_shadow:  Chromo  93.5   0.067 1.5E-06   41.9   2.8   50  225-277     4-54  (58)
 30 KOG2084|consensus               93.0    0.12 2.6E-06   57.0   5.1   62  528-595   208-271 (482)
 31 smart00300 ChSh Chromo Shadow   93.0   0.048   1E-06   43.1   1.4   49  225-277     8-57  (61)
 32 smart00570 AWS associated with  92.8   0.062 1.3E-06   40.8   1.6   24  427-450    27-50  (51)
 33 PF11717 Tudor-knot:  RNA bindi  91.9    0.29 6.3E-06   37.8   4.5   41  221-262    13-53  (55)
 34 PF14061 Mtf2_C:  Polycomb-like  89.7    0.33 7.2E-06   36.2   2.7   22  229-250    28-49  (50)
 35 KOG0384|consensus               88.2     0.2 4.4E-06   60.0   1.2   39  238-276   224-266 (1373)
 36 smart00570 AWS associated with  87.8    0.27 5.8E-06   37.4   1.2   23  164-186    27-49  (51)
 37 KOG1337|consensus               73.2     2.7   6E-05   46.9   3.1   42  528-574   239-280 (472)
 38 KOG1079|consensus               63.4     4.3 9.4E-05   46.2   2.1   39  164-203   558-607 (739)
 39 KOG1338|consensus               44.1      15 0.00033   39.5   2.2   40  525-572   218-260 (466)
 40 cd01395 HMT_MBD Methyl-CpG bin  43.4     7.2 0.00016   30.8  -0.2   32   20-57     27-60  (60)
 41 PF08666 SAF:  SAF domain;  Int  43.1      15 0.00033   28.3   1.7   16  553-568     3-18  (63)
 42 KOG2155|consensus               31.7      24 0.00053   38.5   1.4   56  513-574   197-253 (631)
 43 KOG0383|consensus               29.6      21 0.00045   41.7   0.5   55  223-277   208-263 (696)
 44 PF00856 SET:  SET domain;  Int  25.0      38 0.00083   30.3   1.4   16  553-568     3-18  (162)

No 1  
>KOG1141|consensus
Probab=100.00  E-value=2.1e-50  Score=435.42  Aligned_cols=478  Identities=25%  Similarity=0.426  Sum_probs=320.7

Q ss_pred             hhHHHHHHhhhcccCCCcchhHHHHHHHHHHHhhhhHHHHHHHHHHHHHH-HhhcCCCCccEEEeCCCCCCCCCCcEEcc
Q psy14657         30 HRTVSRFLFNRKLLSSAKDFAYAHMLLLTHFLHGKRKQQLANIREAEERY-NAACETAARLTLENNFDLESPPMDFTYIP  108 (605)
Q Consensus        30 ~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~l~~~~~~~-n~~~~~~~~I~~~N~vD~~~~p~~F~Yi~  108 (605)
                      -.++++||++     ++|+|++..+|+++.|++..|..+..+--.-.-+| +.  ....||+.+|++|+..|| .|.|..
T Consensus       634 ~~el~ryL~e-----t~c~flf~~~f~~~~yV~~~r~~~p~kp~~~~~Di~~g--~e~vpis~~neids~~lp-q~ay~K  705 (1262)
T KOG1141|consen  634 RIELYRYLVE-----TRCKFLFVIGFDRAFYVVRHRAPNPLKPGNRCTDIPCG--REHVPISEKNEIDSHRLP-QAAYKK  705 (1262)
T ss_pred             HHHHHHHHHH-----hcCcEEEEeecccchheeecccCCCcCCcceeccccCC--ccccccceeecccCcCCc-cchhhe
Confidence            3688999999     99999999999999999887776544222222233 22  245599999999988877 899999


Q ss_pred             CCCCCCCccccC--CCcccccCCCCCcCCCcccccccc--------CCCccc--cccccceeeccCcccccCCCCcCCC-
Q psy14657        109 SSVPRDGVVVTD--DPVIWCECRGNCVSNRDACCSDLN--------DADFAY--SRRTKRLKLEKGTPIYECNKKCACD-  175 (605)
Q Consensus       109 ~~~~~~~~~~~~--~~~~gC~C~~~C~~~~~~cC~~~~--------~~~~~Y--~~~~~~l~~~~~~~IyECn~~C~C~-  175 (605)
                      +.|++.+..+..  +++++|+|..+|.+..+|+|.++.        ++..+-  +++++||.....+++|||+.+|+|. 
T Consensus       706 ~~ip~~~nl~n~~~~fl~scdc~~gcid~~kcachQltvk~~~t~p~~~v~~t~gykyKRl~e~~ptg~yEc~k~ckc~~  785 (1262)
T KOG1141|consen  706 HMIPTNNNLSNRRKDFLQSCDCPTGCIDSMKCACHQLTVKKKTTGPNQNVASTNGYKYKRLIEIRPTGPYECLKACKCCG  785 (1262)
T ss_pred             eeccCCCcccccChhhhhcCCCCcchhhhhhhhHHHHHHHhhccCCCcccccCcchhhHHHHHhcCCCHHHHHHhhccCc
Confidence            999999988643  678999999999998899997663        111111  2235788888889999999999998 


Q ss_pred             cCCccceeeeCceeeeeEEEeeccCCceeEEEeecccccccccCCCceeEEeeeccceeec-CCcEEEEEeccCCCCCCC
Q psy14657        176 ETCLNRVVQKGNLVQDCHTTITSRDGSVVLLRVADINLTKFSSDTMTEYEVESVLDSLELT-SDMTVYLVKWKNYDPEYN  254 (605)
Q Consensus       176 ~~C~NRVvQ~G~~~~L~vf~~~~~kG~~~~~~~~~I~i~~~~~~~~~~y~vE~Ild~~~~~-~~~~~YlVkW~gy~~~~~  254 (605)
                      +.|.||+||||.+|+||+|+ +.-+|||         ++-...+..+.|++=|--...... ..+       .||-.-. 
T Consensus       786 ~~C~nrmvqhg~qvRlq~fk-t~~kGWg---------~rclddi~~g~fVciy~g~~l~~~~sdk-------s~~~~~~-  847 (1262)
T KOG1141|consen  786 PDCLNRMVQHGYQVRLQRFK-TIHKGWG---------RRCLDDITGGNFVCIYPGGALLHQISDK-------SEYIHVT-  847 (1262)
T ss_pred             HHHHHHHhhcCceeEeeecc-ccccccc---------eEeeeecCCceEEEEecchhhhhhhchh-------hhhcccc-
Confidence            56999999999999999999 7778999         555556666666665543221000 000       0110000 


Q ss_pred             cccccccccchHHHHHHHHHhCCCCchhhHHHHhhccCCChhHHHHHHHHHHhcccchhhhhHHHHHHHHhhhhHHHHhh
Q psy14657        255 TWEPIENLGNCAKKLAEFLKAGPDQERTDFEKMKSFLSQHTEEEVESVLAKLRNKKDFAYAHMLLLTHFLHGKRKQQLAN  334 (605)
Q Consensus       255 TWEp~~nl~~~~~li~~f~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~k~~~~~~~~~~~~~~~~~~l~~kR~~ql~~  334 (605)
                        +...+| +|    ..|.+.                              +....+....              .....
T Consensus       848 --~~~~~i-d~----~~f~~~------------------------------~dt~~~~tvD--------------~~g~d  876 (1262)
T KOG1141|consen  848 --RSLLTI-DC----FSFDAR------------------------------IDTATYITVD--------------DKGLD  876 (1262)
T ss_pred             --hhhhcc-cc----cchhcc------------------------------ccccceeecc--------------ccccc
Confidence              000111 11    011111                              1111110000              00001


Q ss_pred             HHHHHHHHHhccCCCcceEEEcCCCCCCCCC------CcEEeeecccCCCccccCCCCcccccCCCCcCCCccc-ccccC
Q psy14657        335 IREAEERYNAACETAARLTLENNFDLESPPM------DFTYIPSSVPRDGVVVTDDPVIWCECRGNCVSNRDAC-CSDLN  407 (605)
Q Consensus       335 l~~we~~~n~~~~~~~pi~~~N~vd~~~~p~------~F~yi~~~~~~~~~~~~~~~~~gC~C~~~C~~~~~~C-c~~~~  407 (605)
                      ++.++     .+....|||.+|.+|.+.||.      .|.|..+.-.. +  .......||+|.+.|.+... | |....
T Consensus       877 ~~d~~-----~g~sg~~~p~~~~~d~~~~~~c~d~~~~~~~~~~~~~s-~--~~~~~~~~~s~d~hp~d~~~-~~~~~~~  947 (1262)
T KOG1141|consen  877 VADFS-----LGTSGIPIPLVNSVDNDEPPSCEDSKRRFQYNDQVDIS-S--VSRDFCSGCSCDGHPSDASK-CECQQLS  947 (1262)
T ss_pred             hhhhh-----ccccCCCCccccccccCCCccccccceeecccccchhh-h--hccccccccccCCCCcccCc-ccCCCCC
Confidence            11111     134568999999999888875      34443321111 1  23455679999988987766 5 43322


Q ss_pred             C-------CCccee-------cccCcccccCCCceEecCCCCCCCCCCcCcccccCCCcCEE--------EEEeCCCCce
Q psy14657        408 D-------ADFAYS-------RRTKRLKLEKGTPIYECNKKCACDETCLNRVVQKGITLPLT--------IFKTKNNRGW  465 (605)
Q Consensus       408 ~-------~~~~y~-------~~~~~l~~~~~~~i~EC~~~C~C~~~C~Nrv~q~g~~~~l~--------i~~t~~~kG~  465 (605)
                      .       ..+..+       ...... .......+||+..|.|...|.|+++|.+.+.+++        |+++ ...||
T Consensus       948 ~~~~~~cpp~~s~d~~~~~~eS~~~~n-s~~~~~f~e~~~hss~~~~e~~~~v~~~~~~~me~~s~~~l~i~~~-~~~~~ 1025 (1262)
T KOG1141|consen  948 IEAMKRCPPNLSFDGHDELYESSEKQN-SFLKLFFFECNDHSSCHRKEYNRVVQNNIKYPMEVSSFNDLQIFKT-AQSGW 1025 (1262)
T ss_pred             hhhhcCCCCccccCchhhhhhhhhhcc-hhhhccceeccccchhcccccchhhhcCCccceeeeeccccccccc-ccccc
Confidence            1       011111       000000 0011246899999999999999999998877665        5555 56799


Q ss_pred             eEEcCCccCCCcEEEEeeeeeecHHHHHhh---cCCceeeeccCCC----------------------------------
Q psy14657        466 GVRTPDKIKAGTFVCEYVGEILTHENALQR---TNQTYCFNLDFNQ----------------------------------  508 (605)
Q Consensus       466 Gv~A~~~I~~G~~I~ey~Gevi~~~e~~~r---~~~~y~f~l~~~~----------------------------------  508 (605)
                      |+++..||+.-+||++|+|...+..-+.+.   ..+.|.-++|...                                  
T Consensus      1026 ~~~edtD~~~~~~~~~~~~~ppt~~l~~~~r~aqad~~sn~~D~~~~~~l~es~~~~~T~~r~~t~~~~~~~~~d~dd~q 1105 (1262)
T KOG1141|consen 1026 GVREDTDIPQSTFICTYVGAPPTDDLADELRNAQADQYSNDLDLKDTVELEESREDHETDFRGDTSDYDDEEGSDGDDGQ 1105 (1262)
T ss_pred             cccccccCCCCcccccccCCCCchhhHHHHhhhhhccccCccchhhhhhhhhcccccccccCCCCCCCcccccccCccHH
Confidence            999999999999999999988766544432   0111111111000                                  


Q ss_pred             ----------------C---------------------------------------------------CCCceEEEeeee
Q psy14657        509 ----------------D---------------------------------------------------SNSVAFVLDAAR  521 (605)
Q Consensus       509 ----------------~---------------------------------------------------~~~~~~~IDa~~  521 (605)
                                      +                                                   ....-|+|||+.
T Consensus      1106 ~I~k~ve~qd~~~~~~~T~~~~RQ~~~~s~k~~~~~s~~~~~~ts~~~~~~dkges~~~~~~~~~~y~~~~~~yvIDAk~ 1185 (1262)
T KOG1141|consen 1106 DIMKMVERQDSSESGEETKRLTRQKRKQSKKSGKGGSVEKDDTTSRDSMEKDKGESKDEPVFNWDKYFEPFPLYVIDAKQ 1185 (1262)
T ss_pred             HHHHHhhcccccccccccchhhhhhhhhhhhcccCccccccccCccchhhhccCccCcccccchhhccCCCceEEEeccc
Confidence                            0                                                   012578999999


Q ss_pred             ccCcccccccCCCCCeeEE--EEecCCCCeeEEEEEEcCCCCCCCeEEEecCCCcccccchhhhcCCCCeeeecCCCCCc
Q psy14657        522 YGNVSHFINHSCDPNLEVS--RINNLNPDLHHVALFAKRDINKNEELSFCYLDLTKAKFTSSKRKKLVRNECRCGSSNCL  599 (605)
Q Consensus       522 ~GN~~RfiNHSC~PN~~~~--~~~~~d~~~~~i~~fA~rdI~~GEELT~dY~~~~~~~~~~~~~~~~~~~~C~Cgs~~Cr  599 (605)
                      .||++||+||||+||+.++  +++.+|..+|.++|||.+-|+||+||||||++..+..       ......|+||+.+||
T Consensus      1186 eGNlGRfLNHSC~PNl~VQnVfvdTHdlrfPwVAFFt~kyVkAgtELTWDY~Ye~g~v-------~~keL~C~CGa~~Cr 1258 (1262)
T KOG1141|consen 1186 EGNLGRFLNHSCDPNLHVQNVFVDTHDLRFPWVAFFTRKYVKAGTELTWDYQYEQGQV-------ATKELTCHCGAENCR 1258 (1262)
T ss_pred             ccchhhhhccCCCccceeeeeeeeccccCCchhhhhhhhhhccCceeeeecccccccc-------ccceEEEecChhhhh
Confidence            9999999999999999999  8999999999999999999999999999999976521       224578999999999


Q ss_pred             ccc
Q psy14657        600 GYY  602 (605)
Q Consensus       600 g~l  602 (605)
                      |.|
T Consensus      1259 grL 1261 (1262)
T KOG1141|consen 1259 GRL 1261 (1262)
T ss_pred             ccc
Confidence            987


No 2  
>KOG1082|consensus
Probab=100.00  E-value=5.8e-42  Score=365.31  Aligned_cols=257  Identities=33%  Similarity=0.589  Sum_probs=206.5

Q ss_pred             cCCCcceEEEcCCCCCCCCCCcEEeeecccCCCccccCCCCcccccCCCCcCCCcc--cccccCCCCcceecccCc-ccc
Q psy14657        346 CETAARLTLENNFDLESPPMDFTYIPSSVPRDGVVVTDDPVIWCECRGNCVSNRDA--CCSDLNDADFAYSRRTKR-LKL  422 (605)
Q Consensus       346 ~~~~~pi~~~N~vd~~~~p~~F~yi~~~~~~~~~~~~~~~~~gC~C~~~C~~~~~~--Cc~~~~~~~~~y~~~~~~-l~~  422 (605)
                      +.+..||+++|+||.+.+ ..|.|++..++..+.........+|.|...|......  +|...++..++|...... ...
T Consensus        68 ~~e~~~v~~~n~id~~~~-~~f~y~~~~~~~~~~~~~~~~~~~c~C~~~~~~~~~~~C~C~~~n~~~~~~~~~~~~~~~~  146 (364)
T KOG1082|consen   68 GSENLPVPLVNRIDEDAP-LYFQYIATEIVDPGELSDCENSTGCRCCSSCSSVLPLTCLCERHNGGLVAYTCDGDCGTLG  146 (364)
T ss_pred             ccccCceeeeeeccCCcc-ccceeccccccCccccccCccccCCCccCCCCCCCCccccChHhhCCccccccCCcccccc
Confidence            344569999999998877 8999999999887644445678899999888775431  377778888888654211 456


Q ss_pred             cCCCceEecCCCCCCCCCCcCcccccCCCcCEEEEEeCCCCceeEEcCCccCCCcEEEEeeeeeecHHHHHhh-cCCcee
Q psy14657        423 EKGTPIYECNKKCACDETCLNRVVQKGITLPLTIFKTKNNRGWGVRTPDKIKAGTFVCEYVGEILTHENALQR-TNQTYC  501 (605)
Q Consensus       423 ~~~~~i~EC~~~C~C~~~C~Nrv~q~g~~~~l~i~~t~~~kG~Gv~A~~~I~~G~~I~ey~Gevi~~~e~~~r-~~~~y~  501 (605)
                      ..+.++|||+..|+|+++|.|||+|+|.+.+|+||+|.+ +||||||++.|++|+||+||+||+++.++++.+ ....|.
T Consensus       147 ~~~~~i~EC~~~C~C~~~C~nRv~q~g~~~~leIfrt~~-kGwgvRs~~~I~~G~fvcEyaGe~~t~~e~~~~~~~~~~~  225 (364)
T KOG1082|consen  147 KFKEPVFECSVACGCHPDCANRVVQKGLQFHLEVFRTPE-KGWGVRTLDPIPAGEFVCEYAGEVLTSEEAQRRTHLREYL  225 (364)
T ss_pred             ccCccccccccCCCCCCcCcchhhccccccceEEEecCC-ceeeecccccccCCCeeEEEeeEecChHHhhhcccccccc
Confidence            677899999999999999999999999999999999954 999999999999999999999999999999977 222333


Q ss_pred             e---------eccCCC--------------CCCCceEEEeeeeccCcccccccCCCCCeeEE--EEecCCCCeeEEEEEE
Q psy14657        502 F---------NLDFNQ--------------DSNSVAFVLDAARYGNVSHFINHSCDPNLEVS--RINNLNPDLHHVALFA  556 (605)
Q Consensus       502 f---------~l~~~~--------------~~~~~~~~IDa~~~GN~~RfiNHSC~PN~~~~--~~~~~d~~~~~i~~fA  556 (605)
                      +         ..++..              ......+.|||...||++|||||||.||+.++  +.++.++..++++|||
T Consensus       226 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ida~~~GNv~RfinHSC~PN~~~~~v~~~~~~~~~~~i~ffa  305 (364)
T KOG1082|consen  226 DDDCDAYSIADREWVDESPVGNTFVAPSLPGGPGRELLIDAKPHGNVARFINHSCSPNLLYQAVFQDEFVLLYLRIGFFA  305 (364)
T ss_pred             ccccccchhhhccccccccccccccccccccCCCcceEEchhhcccccccccCCCCccceeeeeeecCCccchheeeeee
Confidence            3         222221              12357899999999999999999999999999  6667788899999999


Q ss_pred             cCCCCCCCeEEEecCCCcccccchhh--hcCCCCeeeecCCCCCcccccC
Q psy14657        557 KRDINKNEELSFCYLDLTKAKFTSSK--RKKLVRNECRCGSSNCLGYYYL  604 (605)
Q Consensus       557 ~rdI~~GEELT~dY~~~~~~~~~~~~--~~~~~~~~C~Cgs~~Crg~l~~  604 (605)
                      +++|.||||||+||+..+........  ........|.||+.+||+.++.
T Consensus       306 ~~~I~p~~ELT~dYg~~~~~~~~~~~~~~~~~~~~~c~c~~~~cr~~~~~  355 (364)
T KOG1082|consen  306 LRDISPGEELTLDYGKAYKLLVQDGANIYTPVMKKNCNCGLEKCRGLLGS  355 (364)
T ss_pred             ccccCCCcccchhhcccccccccccccccccccchhhcCCCHHhCcccCC
Confidence            99999999999999987531111111  1133667999999999999863


No 3  
>KOG4442|consensus
Probab=100.00  E-value=2.1e-39  Score=350.28  Aligned_cols=156  Identities=35%  Similarity=0.631  Sum_probs=139.2

Q ss_pred             eEecCC-CCC-CCCCCcCcccccCCCcCEEEEEeCCCCceeEEcCCccCCCcEEEEeeeeeecHHHHHhh--------cC
Q psy14657        428 IYECNK-KCA-CDETCLNRVVQKGITLPLTIFKTKNNRGWGVRTPDKIKAGTFVCEYVGEILTHENALQR--------TN  497 (605)
Q Consensus       428 i~EC~~-~C~-C~~~C~Nrv~q~g~~~~l~i~~t~~~kG~Gv~A~~~I~~G~~I~ey~Gevi~~~e~~~r--------~~  497 (605)
                      ..||++ .|. |+..|.|+.+|+....+++||.| .++||||+|.++|++|+||+||.||||+..++..|        ..
T Consensus        94 ~iECs~~~C~~cg~~C~NQRFQkkqyA~vevF~T-e~KG~GLRA~~dI~~g~FI~EY~GEVI~~~Ef~kR~~~Y~~d~~k  172 (729)
T KOG4442|consen   94 SIECSDRECPRCGVYCKNQRFQKKQYAKVEVFLT-EKKGCGLRAEEDIPKGQFILEYIGEVIEEKEFEKRVKRYAKDGIK  172 (729)
T ss_pred             hcccCCccCCCccccccchhhhhhccCceeEEEe-cCcccceeeccccCCCcEEeeeccccccHHHHHHHHHHHHhcCCc
Confidence            469977 998 99999999999999999999999 69999999999999999999999999999999988        33


Q ss_pred             CceeeeccCCCCCCCceEEEeeeeccCcccccccCCCCCeeEE-EEecCCCCeeEEEEEEcCCCCCCCeEEEecCCCccc
Q psy14657        498 QTYCFNLDFNQDSNSVAFVLDAARYGNVSHFINHSCDPNLEVS-RINNLNPDLHHVALFAKRDINKNEELSFCYLDLTKA  576 (605)
Q Consensus       498 ~~y~f~l~~~~~~~~~~~~IDa~~~GN~~RfiNHSC~PN~~~~-~~~~~d~~~~~i~~fA~rdI~~GEELT~dY~~~~~~  576 (605)
                      ..|.|.|..       .++|||+.+||.||||||||+|||.+. |...   +..||+|||.|+|++||||||||+.....
T Consensus       173 h~Yfm~L~~-------~e~IDAT~KGnlaRFiNHSC~PNa~~~KWtV~---~~lRvGiFakk~I~~GEEITFDYqf~rYG  242 (729)
T KOG4442|consen  173 HYYFMALQG-------GEYIDATKKGNLARFINHSCDPNAEVQKWTVP---DELRVGIFAKKVIKPGEEITFDYQFDRYG  242 (729)
T ss_pred             eEEEEEecC-------CceecccccCcHHHhhcCCCCCCceeeeeeeC---CeeEEEEeEecccCCCceeeEeccccccc
Confidence            455555543       469999999999999999999999999 9874   68999999999999999999999986421


Q ss_pred             ccchhhhcCCCCeeeecCCCCCccccc
Q psy14657        577 KFTSSKRKKLVRNECRCGSSNCLGYYY  603 (605)
Q Consensus       577 ~~~~~~~~~~~~~~C~Cgs~~Crg~l~  603 (605)
                               ....+|.||+++|||||.
T Consensus       243 ---------r~AQ~CyCgeanC~G~IG  260 (729)
T KOG4442|consen  243 ---------RDAQPCYCGEANCRGWIG  260 (729)
T ss_pred             ---------ccccccccCCcccccccC
Confidence                     145689999999999984


No 4  
>KOG1080|consensus
Probab=99.94  E-value=1.8e-27  Score=273.87  Aligned_cols=129  Identities=36%  Similarity=0.658  Sum_probs=111.7

Q ss_pred             EEEEeCCCCceeEEcCCccCCCcEEEEeeeeeecHHHHHhh--------cCCceeeeccCCCCCCCceEEEeeeeccCcc
Q psy14657        455 TIFKTKNNRGWGVRTPDKIKAGTFVCEYVGEILTHENALQR--------TNQTYCFNLDFNQDSNSVAFVLDAARYGNVS  526 (605)
Q Consensus       455 ~i~~t~~~kG~Gv~A~~~I~~G~~I~ey~Gevi~~~e~~~r--------~~~~y~f~l~~~~~~~~~~~~IDa~~~GN~~  526 (605)
                      .-|..+..+||||||++.|.+|++|+||+||+|...-|+.|        .++.|+|.+|.       .++|||++.||+|
T Consensus       868 ~~F~~s~iH~wglfa~~~i~~~dmViEY~Ge~vR~~iad~RE~~Y~~~gi~~sYlfrid~-------~~ViDAtk~gniA  940 (1005)
T KOG1080|consen  868 VKFGRSGIHGWGLFAMENIAAGDMVIEYRGELVRSSIADLREARYERMGIGDSYLFRIDD-------EVVVDATKKGNIA  940 (1005)
T ss_pred             hccccccccccceeeccCccccceEEEeeceehhhhHHHHHHHHHhccCcccceeeeccc-------ceEEeccccCchh
Confidence            34455589999999999999999999999999988877766        57899999985       4599999999999


Q ss_pred             cccccCCCCCeeEEEEecCCCCeeEEEEEEcCCCCCCCeEEEecCCCcccccchhhhcCCCCeeeecCCCCCcccc
Q psy14657        527 HFINHSCDPNLEVSRINNLNPDLHHVALFAKRDINKNEELSFCYLDLTKAKFTSSKRKKLVRNECRCGSSNCLGYY  602 (605)
Q Consensus       527 RfiNHSC~PN~~~~~~~~~d~~~~~i~~fA~rdI~~GEELT~dY~~~~~~~~~~~~~~~~~~~~C~Cgs~~Crg~l  602 (605)
                      |||||||+|||....+.. + +..+|+|||.|+|.+|||||+||.+...+          ....|+|||++|||+|
T Consensus       941 r~InHsC~PNCyakvi~V-~-g~~~IvIyakr~I~~~EElTYDYkF~~e~----------~kipClCgap~Crg~~ 1004 (1005)
T KOG1080|consen  941 RFINHSCNPNCYAKVITV-E-GDKRIVIYSKRDIAAGEELTYDYKFPTED----------DKIPCLCGAPNCRGFL 1004 (1005)
T ss_pred             heeecccCCCceeeEEEe-c-CeeEEEEEEecccccCceeeeeccccccc----------cccccccCCCcccccc
Confidence            999999999999984332 2 56699999999999999999999875431          3679999999999987


No 5  
>KOG1079|consensus
Probab=99.91  E-value=6.6e-25  Score=236.69  Aligned_cols=137  Identities=32%  Similarity=0.576  Sum_probs=119.6

Q ss_pred             ceEec-CCCCCC-C---------CCCcCcccccCCCcCEEEEEeCCCCceeEEcCCccCCCcEEEEeeeeeecHHHHHhh
Q psy14657        427 PIYEC-NKKCAC-D---------ETCLNRVVQKGITLPLTIFKTKNNRGWGVRTPDKIKAGTFVCEYVGEILTHENALQR  495 (605)
Q Consensus       427 ~i~EC-~~~C~C-~---------~~C~Nrv~q~g~~~~l~i~~t~~~kG~Gv~A~~~I~~G~~I~ey~Gevi~~~e~~~r  495 (605)
                      +..|| |..|.+ +         .+|.|--+|++.+.++.+..+ ...|||+|+.+...+++||.||+||+|+.+||++|
T Consensus       558 A~rECdPd~Cl~cg~~~~~d~~~~~C~N~~l~~~~qkr~llapS-dVaGwGlFlKe~v~KnefisEY~GE~IS~dEADrR  636 (739)
T KOG1079|consen  558 AVRECDPDVCLMCGNVDHFDSSKISCKNTNLQRGEQKRVLLAPS-DVAGWGLFLKESVSKNEFISEYTGEIISHDEADRR  636 (739)
T ss_pred             hccccCchHHhccCcccccccCccccccchhhhhhhcceeechh-hccccceeeccccCCCceeeeecceeccchhhhhc
Confidence            45699 478865 2         369999999999999998777 88999999999999999999999999999999999


Q ss_pred             ------cCCceeeeccCCCCCCCceEEEeeeeccCcccccccCCCCCeeEE-EEecCCCCeeEEEEEEcCCCCCCCeEEE
Q psy14657        496 ------TNQTYCFNLDFNQDSNSVAFVLDAARYGNVSHFINHSCDPNLEVS-RINNLNPDLHHVALFAKRDINKNEELSF  568 (605)
Q Consensus       496 ------~~~~y~f~l~~~~~~~~~~~~IDa~~~GN~~RfiNHSC~PN~~~~-~~~~~d~~~~~i~~fA~rdI~~GEELT~  568 (605)
                            ..-+|+|+|..       .|+|||++.||.+||+|||-+|||... .+..   +..+|+|||.|.|.+||||||
T Consensus       637 GkiYDr~~cSflFnln~-------dyviDs~rkGnk~rFANHS~nPNCYAkvm~V~---GdhRIGifAkRaIeagEELff  706 (739)
T KOG1079|consen  637 GKIYDRYMCSFLFNLNN-------DYVIDSTRKGNKIRFANHSFNPNCYAKVMMVA---GDHRIGIFAKRAIEAGEELFF  706 (739)
T ss_pred             ccccccccceeeeeccc-------cceEeeeeecchhhhccCCCCCCcEEEEEEec---CCcceeeeehhhcccCceeee
Confidence                  44567777643       489999999999999999999999988 4433   567999999999999999999


Q ss_pred             ecCCCc
Q psy14657        569 CYLDLT  574 (605)
Q Consensus       569 dY~~~~  574 (605)
                      ||.++-
T Consensus       707 DYrYs~  712 (739)
T KOG1079|consen  707 DYRYSP  712 (739)
T ss_pred             eeccCc
Confidence            999864


No 6  
>smart00317 SET SET (Su(var)3-9, Enhancer-of-zeste, Trithorax) domain. Putative methyl transferase, based on outlier plant homologues
Probab=99.86  E-value=3.9e-21  Score=170.57  Aligned_cols=107  Identities=44%  Similarity=0.799  Sum_probs=90.1

Q ss_pred             EEEEEeCCCCceeEEcCCccCCCcEEEEeeeeeecHHHHHhh------cC--CceeeeccCCCCCCCceEEEeeeeccCc
Q psy14657        454 LTIFKTKNNRGWGVRTPDKIKAGTFVCEYVGEILTHENALQR------TN--QTYCFNLDFNQDSNSVAFVLDAARYGNV  525 (605)
Q Consensus       454 l~i~~t~~~kG~Gv~A~~~I~~G~~I~ey~Gevi~~~e~~~r------~~--~~y~f~l~~~~~~~~~~~~IDa~~~GN~  525 (605)
                      ++++++ +++|+||+|+++|++|++|++|.|.++...++...      .+  ..|+|....       .++||+...||+
T Consensus         2 ~~~~~~-~~~G~gl~a~~~i~~g~~i~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-------~~~id~~~~~~~   73 (116)
T smart00317        2 LEVFKS-PGKGWGVRATEDIPKGEFIGEYVGEIITSEEAEERSKAYDTDGADSFYLFEIDS-------DLCIDARRKGNI   73 (116)
T ss_pred             cEEEec-CCCcEEEEECCccCCCCEEEEEEeEEECHHHHHHHHHHHHhcCCCCEEEEECCC-------CEEEeCCccCcH
Confidence            566777 79999999999999999999999999988776654      22  366666532       579999999999


Q ss_pred             ccccccCCCCCeeEEEEecCCCCeeEEEEEEcCCCCCCCeEEEec
Q psy14657        526 SHFINHSCDPNLEVSRINNLNPDLHHVALFAKRDINKNEELSFCY  570 (605)
Q Consensus       526 ~RfiNHSC~PN~~~~~~~~~d~~~~~i~~fA~rdI~~GEELT~dY  570 (605)
                      +|||||||.||+.+.+....  +..++.++|+|||++|||||+||
T Consensus        74 ~~~iNHsc~pN~~~~~~~~~--~~~~~~~~a~r~I~~GeEi~i~Y  116 (116)
T smart00317       74 ARFINHSCEPNCELLFVEVN--GDSRIVIFALRDIKPGEELTIDY  116 (116)
T ss_pred             HHeeCCCCCCCEEEEEEEEC--CCcEEEEEECCCcCCCCEEeecC
Confidence            99999999999999954432  34489999999999999999999


No 7  
>KOG1083|consensus
Probab=99.79  E-value=1.3e-20  Score=210.49  Aligned_cols=123  Identities=43%  Similarity=0.761  Sum_probs=106.1

Q ss_pred             CcCccccc-CCCcCEEEEEeCCCCceeEEcCCccCCCcEEEEeeeeeecHHHHHhh-------cCCceeeeccCCCCCCC
Q psy14657        441 CLNRVVQK-GITLPLTIFKTKNNRGWGVRTPDKIKAGTFVCEYVGEILTHENALQR-------TNQTYCFNLDFNQDSNS  512 (605)
Q Consensus       441 C~Nrv~q~-g~~~~l~i~~t~~~kG~Gv~A~~~I~~G~~I~ey~Gevi~~~e~~~r-------~~~~y~f~l~~~~~~~~  512 (605)
                      |.|+.+|+ +.-.+|++|+. +.+||||+|.++|++|+||+||+|+|++.++++.+       ....|...++       
T Consensus      1166 c~nqrm~r~e~cp~L~v~~g-p~~G~~v~tk~PikagtfI~EYvGeVit~ke~e~~mmtl~~~d~~~~cL~I~------- 1237 (1306)
T KOG1083|consen 1166 CSNQRMQRHEECPPLEVFRG-PKKGWGVRTKEPIKAGTFIMEYVGEVITEKEFEPRMMTLYHNDDDHYCLVID------- 1237 (1306)
T ss_pred             hhhHHhhhhccCCCcceecc-CCCCccccccccccccchHHHHHHHHHHHHhhcccccccCCCCCcccccccC-------
Confidence            78888776 55677999999 89999999999999999999999999999988876       2234444443       


Q ss_pred             ceEEEeeeeccCcccccccCCCCCeeEE-EEecCCCCeeEEEEEEcCCCCCCCeEEEecCCCc
Q psy14657        513 VAFVLDAARYGNVSHFINHSCDPNLEVS-RINNLNPDLHHVALFAKRDINKNEELSFCYLDLT  574 (605)
Q Consensus       513 ~~~~IDa~~~GN~~RfiNHSC~PN~~~~-~~~~~d~~~~~i~~fA~rdI~~GEELT~dY~~~~  574 (605)
                      ...+||+.++||.+||+||||.|||.++ |..+   +..|+.+||+|||.+|||||+||+...
T Consensus      1238 p~l~id~~R~~n~~RfinhscKPNc~~qkwSVN---G~~Rv~L~A~rDi~kGEELtYDYN~ks 1297 (1306)
T KOG1083|consen 1238 PGLFIDIPRMGNGARFINHSCKPNCEMQKWSVN---GEYRVGLFALRDLPKGEELTYDYNFKS 1297 (1306)
T ss_pred             ccccCChhhccccccccccccCCCCcccccccc---ceeeeeeeecCCCCCCceEEEeccccc
Confidence            3569999999999999999999999999 7654   889999999999999999999998743


No 8  
>PF05033 Pre-SET:  Pre-SET motif;  InterPro: IPR007728 This region is found in a number of histone lysine methyltransferases (HMTase), N-terminal to the SET domain; it is generally described as the pre-SET domain. Histone lysine methylation is part of the histone code that regulated chromatin function and epigenetic control of gene function. Histone lysine methyltransferases (HMTase) differ both in their substrate specificity for the various acceptor lysines as well as in their product specificity for the number of methyl groups (one, two, or three) they transfer. With just one exception [], the HMTases belong to SET family that can be classified according to the sequences surrounding the SET domain [, ]. Structural studies on the human SET7/9, a mono-methylase, have revealed the molecular basis for the specificity of the enzyme for the histone-target and the roles of the invariant residues in the SET domain in determining the methylation specificities [].  The pre-SET domain, as found in the SUV39 SET family, contains nine invariant cysteine residues that are grouped into two segments separated by a region of variable length. These 9 cysteines coordinate 3 zinc ions to form a triangular cluster, where each of the zinc ions is coordinated by 4 four cysteines to give a tetrahedral configuration. The function of this domain is structural, holding together 2 long segments of random coils and stabilising the SET domain. The C-terminal region including the post-SET domain is disordered when not interacting with a histone tail and in the absence of zinc. The three conserved cysteines in the post-SET domain form a zinc-binding site [] when coupled to a fourth conserved cysteine in the knot-like structure close to the SET domain active site []. The structured post-SET region brings in the C-terminal residues that participate in S-adenosylmethine-binding and histone tail interactions. The three conserved cysteine residues are essential for HMTase activity, as replacement with serine abolishes HMTase activity []. ; GO: 0008270 zinc ion binding, 0018024 histone-lysine N-methyltransferase activity, 0034968 histone lysine methylation, 0005634 nucleus; PDB: 3K5K_A 2O8J_D 3RJW_B 1ML9_A 1PEG_B 1MVH_A 1MVX_A 3BO5_A 2RFI_B 3MO5_B ....
Probab=99.79  E-value=9.7e-20  Score=160.15  Aligned_cols=97  Identities=33%  Similarity=0.776  Sum_probs=69.4

Q ss_pred             CCCCccEEEeCCCCCCCCCCcEEccCCCCCCCcc-ccCCCcccccCCCCCcCCCccccccccCCCccccccccceeeccC
Q psy14657         84 ETAARLTLENNFDLESPPMDFTYIPSSVPRDGVV-VTDDPVIWCECRGNCVSNRDACCSDLNDADFAYSRRTKRLKLEKG  162 (605)
Q Consensus        84 ~~~~~I~~~N~vD~~~~p~~F~Yi~~~~~~~~~~-~~~~~~~gC~C~~~C~~~~~~cC~~~~~~~~~Y~~~~~~l~~~~~  162 (605)
                      ++..||+|+|+||++.||++|+||+++++++++. ++.....||+|.+.|.+...|.|.+.++..++|+. .++|.+...
T Consensus         6 ~e~~pI~~~N~vd~~~~p~~F~Yi~~~~~~~~~~~~~~~~~~~C~C~~~C~~~~~C~C~~~~~~~~~Y~~-~g~l~~~~~   84 (103)
T PF05033_consen    6 KENVPIPVVNDVDDEPPPPNFEYIPENIYGEGVPDIDPEFLQGCDCSGDCSNPSNCECLQRNGGIFAYDS-NGRLRIPDK   84 (103)
T ss_dssp             SSSS-EEEEESSSS--SSTSSEE-SS-EESTTSS-TBGGGTS----SSSSTCTTTSHHHCCTSSS-SB-T-TSSBSSSST
T ss_pred             ccCCCEEEEeCCCCCCCCCCeEEeeeEEcCCCccccccccCccCccCCCCCCCCCCcCccccCccccccC-CCcCccCCC
Confidence            3566999999999999988999999999999887 44566789999999966667779999888899997 477777788


Q ss_pred             cccccCCCCcCCCcCCccc
Q psy14657        163 TPIYECNKKCACDETCLNR  181 (605)
Q Consensus       163 ~~IyECn~~C~C~~~C~NR  181 (605)
                      .+|||||+.|+|+++|+||
T Consensus        85 ~~i~EC~~~C~C~~~C~NR  103 (103)
T PF05033_consen   85 PPIFECNDNCGCSPSCRNR  103 (103)
T ss_dssp             SEEE---TTSSS-TTSTT-
T ss_pred             CeEEeCCCCCCCCCCCCCC
Confidence            9999999999999999998


No 9  
>smart00468 PreSET N-terminal to some SET domains. A Cys-rich putative Zn2+-binding domain that occurs N-terminal to some SET domains. Function is unknown. Unpublished.
Probab=99.78  E-value=2.5e-19  Score=156.10  Aligned_cols=88  Identities=34%  Similarity=0.740  Sum_probs=78.4

Q ss_pred             CCCccEEEeCCCCCCCCCCcEEccCCCCCCCcc--ccCCCcccccCCCCCcCCCccccccccCCCccccccccceeeccC
Q psy14657         85 TAARLTLENNFDLESPPMDFTYIPSSVPRDGVV--VTDDPVIWCECRGNCVSNRDACCSDLNDADFAYSRRTKRLKLEKG  162 (605)
Q Consensus        85 ~~~~I~~~N~vD~~~~p~~F~Yi~~~~~~~~~~--~~~~~~~gC~C~~~C~~~~~~cC~~~~~~~~~Y~~~~~~l~~~~~  162 (605)
                      +..||+++|+||++.||.+|+||+++++++|+.  .+..++.||+|.++|++...|+|.+++++.+||+. .+++.+..+
T Consensus         9 E~~pI~~vN~vD~~~~p~~F~Yi~~~~~~~gv~~~~~~~~~~gC~C~~~C~~~~~C~C~~~~~~~~~Y~~-~~~~~~~~~   87 (98)
T smart00468        9 ENVPVPLVNEVDEDPPPPDFEYISEYIYGQGVPIDRSPSPLVGCSCSGDCSSSNKCECARKNGGEFAYEL-NGGLRLKRK   87 (98)
T ss_pred             cCCCcceEecCCCCCCCCCcEECcceEcCCCcccccCCCCCCCCcCCCCCCCCCcCCcHhhcCCccCccc-CCCEEeCCC
Confidence            456999999999999988999999999999986  35678899999999999766889999999999963 478889999


Q ss_pred             cccccCCCCcC
Q psy14657        163 TPIYECNKKCA  173 (605)
Q Consensus       163 ~~IyECn~~C~  173 (605)
                      +||||||++|+
T Consensus        88 ~~IyECn~~C~   98 (98)
T smart00468       88 PLIYECNSRCS   98 (98)
T ss_pred             CEEEcCCCCCC
Confidence            99999999995


No 10 
>KOG1082|consensus
Probab=99.76  E-value=7.3e-19  Score=187.92  Aligned_cols=137  Identities=23%  Similarity=0.380  Sum_probs=116.1

Q ss_pred             CCCccEEEeCCCCCCCCCCcEEccCCCCCCCccccCCCcccccCCCCCcCCCc--cccccccCCCccccccccc-eeecc
Q psy14657         85 TAARLTLENNFDLESPPMDFTYIPSSVPRDGVVVTDDPVIWCECRGNCVSNRD--ACCSDLNDADFAYSRRTKR-LKLEK  161 (605)
Q Consensus        85 ~~~~I~~~N~vD~~~~p~~F~Yi~~~~~~~~~~~~~~~~~gC~C~~~C~~~~~--~cC~~~~~~~~~Y~~~~~~-l~~~~  161 (605)
                      +.-||+++|+||.+.+ .+|.|++..++.++...+.....+|.|.+.|.....  |.|...+++.++|+..... .....
T Consensus        70 e~~~v~~~n~id~~~~-~~f~y~~~~~~~~~~~~~~~~~~~c~C~~~~~~~~~~~C~C~~~n~~~~~~~~~~~~~~~~~~  148 (364)
T KOG1082|consen   70 ENLPVPLVNRIDEDAP-LYFQYIATEIVDPGELSDCENSTGCRCCSSCSSVLPLTCLCERHNGGLVAYTCDGDCGTLGKF  148 (364)
T ss_pred             ccCceeeeeeccCCcc-ccceeccccccCccccccCccccCCCccCCCCCCCCccccChHhhCCccccccCCcccccccc
Confidence            4449999999998777 799999999999875555577899999988876433  6799999999999854322 56677


Q ss_pred             CcccccCCCCcCCCcCCccceeeeCceeeeeEEEeeccCCceeEEEeecccccccccCCCceeEEeeeccc
Q psy14657        162 GTPIYECNKKCACDETCLNRVVQKGNLVQDCHTTITSRDGSVVLLRVADINLTKFSSDTMTEYEVESVLDS  232 (605)
Q Consensus       162 ~~~IyECn~~C~C~~~C~NRVvQ~G~~~~L~vf~~~~~kG~~~~~~~~~I~i~~~~~~~~~~y~vE~Ild~  232 (605)
                      +.+||||++.|+|+++|.|||||+|++++|+||++.. +|||         +++...++.++|++|++-..
T Consensus       149 ~~~i~EC~~~C~C~~~C~nRv~q~g~~~~leIfrt~~-kGwg---------vRs~~~I~~G~fvcEyaGe~  209 (364)
T KOG1082|consen  149 KEPVFECSVACGCHPDCANRVVQKGLQFHLEVFRTPE-KGWG---------VRTLDPIPAGEFVCEYAGEV  209 (364)
T ss_pred             CccccccccCCCCCCcCcchhhccccccceEEEecCC-ceee---------ecccccccCCCeeEEEeeEe
Confidence            8899999999999999999999999999999999555 8999         78888888999999988665


No 11 
>PF05033 Pre-SET:  Pre-SET motif;  InterPro: IPR007728 This region is found in a number of histone lysine methyltransferases (HMTase), N-terminal to the SET domain; it is generally described as the pre-SET domain. Histone lysine methylation is part of the histone code that regulated chromatin function and epigenetic control of gene function. Histone lysine methyltransferases (HMTase) differ both in their substrate specificity for the various acceptor lysines as well as in their product specificity for the number of methyl groups (one, two, or three) they transfer. With just one exception [], the HMTases belong to SET family that can be classified according to the sequences surrounding the SET domain [, ]. Structural studies on the human SET7/9, a mono-methylase, have revealed the molecular basis for the specificity of the enzyme for the histone-target and the roles of the invariant residues in the SET domain in determining the methylation specificities [].  The pre-SET domain, as found in the SUV39 SET family, contains nine invariant cysteine residues that are grouped into two segments separated by a region of variable length. These 9 cysteines coordinate 3 zinc ions to form a triangular cluster, where each of the zinc ions is coordinated by 4 four cysteines to give a tetrahedral configuration. The function of this domain is structural, holding together 2 long segments of random coils and stabilising the SET domain. The C-terminal region including the post-SET domain is disordered when not interacting with a histone tail and in the absence of zinc. The three conserved cysteines in the post-SET domain form a zinc-binding site [] when coupled to a fourth conserved cysteine in the knot-like structure close to the SET domain active site []. The structured post-SET region brings in the C-terminal residues that participate in S-adenosylmethine-binding and histone tail interactions. The three conserved cysteine residues are essential for HMTase activity, as replacement with serine abolishes HMTase activity []. ; GO: 0008270 zinc ion binding, 0018024 histone-lysine N-methyltransferase activity, 0034968 histone lysine methylation, 0005634 nucleus; PDB: 3K5K_A 2O8J_D 3RJW_B 1ML9_A 1PEG_B 1MVH_A 1MVX_A 3BO5_A 2RFI_B 3MO5_B ....
Probab=99.68  E-value=5.9e-17  Score=142.45  Aligned_cols=98  Identities=34%  Similarity=0.806  Sum_probs=69.0

Q ss_pred             ccCCCcceEEEcCCCCCCCCCCcEEeeecccCCCcc-ccCCCCcccccCCCCcCCCccc-ccccCCCCcceecccCcccc
Q psy14657        345 ACETAARLTLENNFDLESPPMDFTYIPSSVPRDGVV-VTDDPVIWCECRGNCVSNRDAC-CSDLNDADFAYSRRTKRLKL  422 (605)
Q Consensus       345 ~~~~~~pi~~~N~vd~~~~p~~F~yi~~~~~~~~~~-~~~~~~~gC~C~~~C~~~~~~C-c~~~~~~~~~y~~~~~~l~~  422 (605)
                      .+++..||+++|++|++.+|++|+||+++++++++. .+.....||+|.++|..... | |.+..+..++|+.. |+|..
T Consensus         4 ~g~e~~pI~~~N~vd~~~~p~~F~Yi~~~~~~~~~~~~~~~~~~~C~C~~~C~~~~~-C~C~~~~~~~~~Y~~~-g~l~~   81 (103)
T PF05033_consen    4 RGKENVPIPVVNDVDDEPPPPNFEYIPENIYGEGVPDIDPEFLQGCDCSGDCSNPSN-CECLQRNGGIFAYDSN-GRLRI   81 (103)
T ss_dssp             CTSSSS-EEEEESSSS--SSTSSEE-SS-EESTTSS-TBGGGTS----SSSSTCTTT-SHHHCCTSSS-SB-TT-SSBSS
T ss_pred             CCccCCCEEEEeCCCCCCCCCCeEEeeeEEcCCCccccccccCccCccCCCCCCCCC-CcCccccCccccccCC-CcCcc
Confidence            356779999999999999999999999999999876 44556789999988955554 6 77777777899974 88888


Q ss_pred             cCCCceEecCCCCCCCCCCcCc
Q psy14657        423 EKGTPIYECNKKCACDETCLNR  444 (605)
Q Consensus       423 ~~~~~i~EC~~~C~C~~~C~Nr  444 (605)
                      ....+||||++.|+|+.+|+||
T Consensus        82 ~~~~~i~EC~~~C~C~~~C~NR  103 (103)
T PF05033_consen   82 PDKPPIFECNDNCGCSPSCRNR  103 (103)
T ss_dssp             SSTSEEE---TTSSS-TTSTT-
T ss_pred             CCCCeEEeCCCCCCCCCCCCCC
Confidence            8899999999999999999997


No 12 
>KOG1085|consensus
Probab=99.67  E-value=1.5e-16  Score=156.87  Aligned_cols=124  Identities=31%  Similarity=0.478  Sum_probs=101.8

Q ss_pred             cccCCCcCEEEEEeCCCCceeEEcCCccCCCcEEEEeeeeeecHHHHHhh--------cCCceeeeccCCCCCCCceEEE
Q psy14657        446 VQKGITLPLTIFKTKNNRGWGVRTPDKIKAGTFVCEYVGEILTHENALQR--------TNQTYCFNLDFNQDSNSVAFVL  517 (605)
Q Consensus       446 ~q~g~~~~l~i~~t~~~kG~Gv~A~~~I~~G~~I~ey~Gevi~~~e~~~r--------~~~~y~f~l~~~~~~~~~~~~I  517 (605)
                      +..|....|.+... .+||.||+|...+.+|+||.||.|.+|.-.+|..|        ....|+|-..++    ...|+|
T Consensus       250 vl~g~~egl~~~~~-dgKGRGv~a~~~F~rgdFVVEY~Gdliei~eAk~rE~~Ya~De~~GcYMYyF~h~----sk~yCi  324 (392)
T KOG1085|consen  250 VLKGTNEGLLEVYK-DGKGRGVRAKVNFERGDFVVEYRGDLIEISEAKVREEQYANDEEIGCYMYYFEHN----SKKYCI  324 (392)
T ss_pred             HHhccccceeEEee-ccccceeEeecccccCceEEEEecceeeechHHHHHHHhccCcccceEEEeeecc----Ceeeee
Confidence            44566777777777 68999999999999999999999999988888776        334566655443    347999


Q ss_pred             eeeec-cCcccccccCCCCCeeEEEEecCCCCeeEEEEEEcCCCCCCCeEEEecCCCccc
Q psy14657        518 DAARY-GNVSHFINHSCDPNLEVSRINNLNPDLHHVALFAKRDINKNEELSFCYLDLTKA  576 (605)
Q Consensus       518 Da~~~-GN~~RfiNHSC~PN~~~~~~~~~d~~~~~i~~fA~rdI~~GEELT~dY~~~~~~  576 (605)
                      ||+.- +-++|.||||=.+|+....+.. | +.|++.++|.|||.+||||++|||+...+
T Consensus       325 DAT~et~~lGRLINHS~~gNl~TKvv~I-d-g~pHLiLvA~rdIa~GEELlYDYGDRSke  382 (392)
T KOG1085|consen  325 DATKETPWLGRLINHSVRGNLKTKVVEI-D-GSPHLILVARRDIAQGEELLYDYGDRSKE  382 (392)
T ss_pred             ecccccccchhhhcccccCcceeeEEEe-c-CCceEEEEeccccccchhhhhhccccchh
Confidence            99874 5589999999999999985442 2 78999999999999999999999997643


No 13 
>smart00468 PreSET N-terminal to some SET domains. A Cys-rich putative Zn2+-binding domain that occurs N-terminal to some SET domains. Function is unknown. Unpublished.
Probab=99.59  E-value=1.8e-15  Score=131.81  Aligned_cols=90  Identities=33%  Similarity=0.721  Sum_probs=77.7

Q ss_pred             cCCCcceEEEcCCCCCCCCCCcEEeeecccCCCcc--ccCCCCcccccCCCCcCCCcccccccCCCCcceecccCccccc
Q psy14657        346 CETAARLTLENNFDLESPPMDFTYIPSSVPRDGVV--VTDDPVIWCECRGNCVSNRDACCSDLNDADFAYSRRTKRLKLE  423 (605)
Q Consensus       346 ~~~~~pi~~~N~vd~~~~p~~F~yi~~~~~~~~~~--~~~~~~~gC~C~~~C~~~~~~Cc~~~~~~~~~y~~~~~~l~~~  423 (605)
                      +++..||+++|+||++.||++|+||++++++.++.  .+.++..||+|.++|.+....+|.+.+++.|+|+. .+++.+.
T Consensus         7 G~E~~pI~~vN~vD~~~~p~~F~Yi~~~~~~~gv~~~~~~~~~~gC~C~~~C~~~~~C~C~~~~~~~~~Y~~-~~~~~~~   85 (98)
T smart00468        7 GKENVPVPLVNEVDEDPPPPDFEYISEYIYGQGVPIDRSPSPLVGCSCSGDCSSSNKCECARKNGGEFAYEL-NGGLRLK   85 (98)
T ss_pred             CccCCCcceEecCCCCCCCCCcEECcceEcCCCcccccCCCCCCCCcCCCCCCCCCcCCcHhhcCCccCccc-CCCEEeC
Confidence            56779999999999999999999999999999975  35677899999999999874148888889999963 3778888


Q ss_pred             CCCceEecCCCCC
Q psy14657        424 KGTPIYECNKKCA  436 (605)
Q Consensus       424 ~~~~i~EC~~~C~  436 (605)
                      .+.+|||||+.|+
T Consensus        86 ~~~~IyECn~~C~   98 (98)
T smart00468       86 RKPLIYECNSRCS   98 (98)
T ss_pred             CCCEEEcCCCCCC
Confidence            9999999999985


No 14 
>COG2940 Proteins containing SET domain [General function prediction only]
Probab=99.56  E-value=6.6e-16  Score=171.25  Aligned_cols=167  Identities=28%  Similarity=0.425  Sum_probs=121.3

Q ss_pred             ecCCCCCCCCCCcCcccccCCCcCEEEEEeCCCCceeEEcCCccCCCcEEEEeeeeeecHHHHHhhcCCc----eeeecc
Q psy14657        430 ECNKKCACDETCLNRVVQKGITLPLTIFKTKNNRGWGVRTPDKIKAGTFVCEYVGEILTHENALQRTNQT----YCFNLD  505 (605)
Q Consensus       430 EC~~~C~C~~~C~Nrv~q~g~~~~l~i~~t~~~kG~Gv~A~~~I~~G~~I~ey~Gevi~~~e~~~r~~~~----y~f~l~  505 (605)
                      ++.........+.|.........+..+... ..+|||+||+..|++|++|.+|.|+++...++..+....    +.+.+.
T Consensus       310 ~~~s~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~g~fa~~~i~~~e~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~  388 (480)
T COG2940         310 FSKSNVSKLKELLNSNGCKKRREPNVVQES-EIKGYGVFALESIKKGEFIIEYHGEIIRRKEAREREENYDLLGNEFSFG  388 (480)
T ss_pred             cccccCccccchhhhcccccccchhhhhhh-cccccceeehhhccchHHHHHhcCcccchHHHHhhhccccccccccchh
Confidence            344444444446555555555555555555 889999999999999999999999999988887772222    111111


Q ss_pred             CCCCCCCceEEEeeeeccCcccccccCCCCCeeEEEEecCCCCeeEEEEEEcCCCCCCCeEEEecCCCcccccchhhhcC
Q psy14657        506 FNQDSNSVAFVLDAARYGNVSHFINHSCDPNLEVSRINNLNPDLHHVALFAKRDINKNEELSFCYLDLTKAKFTSSKRKK  585 (605)
Q Consensus       506 ~~~~~~~~~~~IDa~~~GN~~RfiNHSC~PN~~~~~~~~~d~~~~~i~~fA~rdI~~GEELT~dY~~~~~~~~~~~~~~~  585 (605)
                      ..   .....++|+...|+++|||||||.||+....+...  +..++.++|+|||.+|||||+||+...+....+. ...
T Consensus       389 ~~---~~~~~~~d~~~~g~~~r~~nHS~~pN~~~~~~~~~--g~~~~~~~~~rDI~~geEl~~dy~~~~~~~~~~~-~~~  462 (480)
T COG2940         389 LL---EDKDKVRDSQKAGDVARFINHSCTPNCEASPIEVN--GIFKISIYAIRDIKAGEELTYDYGPSLEDNRELK-KLL  462 (480)
T ss_pred             hc---cccchhhhhhhcccccceeecCCCCCcceeccccc--ccceeeecccccchhhhhhccccccccccchhhh-hhh
Confidence            11   11267999999999999999999999999855432  3679999999999999999999999876432101 112


Q ss_pred             CCCeeeecCCCCCccccc
Q psy14657        586 LVRNECRCGSSNCLGYYY  603 (605)
Q Consensus       586 ~~~~~C~Cgs~~Crg~l~  603 (605)
                      .....|.||+..|+++|.
T Consensus       463 ~~~~~~~~~~~~~~~~~~  480 (480)
T COG2940         463 EKRWGCACGEDRCSHTMS  480 (480)
T ss_pred             hhhhccccCCCccCCCCC
Confidence            355789999999999873


No 15 
>KOG1141|consensus
Probab=99.51  E-value=3e-15  Score=164.05  Aligned_cols=156  Identities=31%  Similarity=0.586  Sum_probs=121.2

Q ss_pred             cCCCcceEEEcCCCCCCCCCCcEEeeecccCCCccc--cCCCCcccccCCCCcCCCccc-ccccCC--------CCc---
Q psy14657        346 CETAARLTLENNFDLESPPMDFTYIPSSVPRDGVVV--TDDPVIWCECRGNCVSNRDAC-CSDLND--------ADF---  411 (605)
Q Consensus       346 ~~~~~pi~~~N~vd~~~~p~~F~yi~~~~~~~~~~~--~~~~~~gC~C~~~C~~~~~~C-c~~~~~--------~~~---  411 (605)
                      +++.+||...|++|.-.+ +.|.|-...|++.+..+  ..+...+|+|..+|.+..+ | |.++..        +..   
T Consensus       681 g~e~vpis~~neids~~l-pq~ay~K~~ip~~~nl~n~~~~fl~scdc~~gcid~~k-cachQltvk~~~t~p~~~v~~t  758 (1262)
T KOG1141|consen  681 GREHVPISEKNEIDSHRL-PQAAYKKHMIPTNNNLSNRRKDFLQSCDCPTGCIDSMK-CACHQLTVKKKTTGPNQNVAST  758 (1262)
T ss_pred             CccccccceeecccCcCC-ccchhheeeccCCCcccccChhhhhcCCCCcchhhhhh-hhHHHHHHHhhccCCCcccccC
Confidence            466689999999997654 48999999999988654  3456889999999988776 5 443210        000   


Q ss_pred             -ceecccCcccccCCCceEecCCCCCCCCC-CcCcccccCCCcCEEEEEeCCCCceeEEcCCccCCCcEEEEeeeeeecH
Q psy14657        412 -AYSRRTKRLKLEKGTPIYECNKKCACDET-CLNRVVQKGITLPLTIFKTKNNRGWGVRTPDKIKAGTFVCEYVGEILTH  489 (605)
Q Consensus       412 -~y~~~~~~l~~~~~~~i~EC~~~C~C~~~-C~Nrv~q~g~~~~l~i~~t~~~kG~Gv~A~~~I~~G~~I~ey~Gevi~~  489 (605)
                       .|  .++||.-...+.+|||+..|+|.+. |.||++|+|.+.+|++|+| ..+|||++...+|.+|.|||.|.|.++++
T Consensus       759 ~gy--kyKRl~e~~ptg~yEc~k~ckc~~~~C~nrmvqhg~qvRlq~fkt-~~kGWg~rclddi~~g~fVciy~g~~l~~  835 (1262)
T KOG1141|consen  759 NGY--KYKRLIEIRPTGPYECLKACKCCGPDCLNRMVQHGYQVRLQRFKT-IHKGWGRRCLDDITGGNFVCIYPGGALLH  835 (1262)
T ss_pred             cch--hhHHHHHhcCCCHHHHHHhhccCcHHHHHHHhhcCceeEeeeccc-cccccceEeeeecCCceEEEEecchhhhh
Confidence             11  1245544445668999999999877 9999999999999999999 78999999999999999999999999887


Q ss_pred             HHHHhh---cCCceeeeccC
Q psy14657        490 ENALQR---TNQTYCFNLDF  506 (605)
Q Consensus       490 ~e~~~r---~~~~y~f~l~~  506 (605)
                      .-++..   .+..|+..+|.
T Consensus       836 ~~sdks~~~~~~~~~~~id~  855 (1262)
T KOG1141|consen  836 QISDKSEYIHVTRSLLTIDC  855 (1262)
T ss_pred             hhchhhhhcccchhhhcccc
Confidence            766554   44556655553


No 16 
>PF00856 SET:  SET domain;  InterPro: IPR001214 The SET domain appears generally as one part of a larger multidomain protein, and recently there were described three structures of very different proteins with distinct domain compositions: Neurospora crassa DIM-5, a member of the Su(var) family of HKMTs which methylate histone H3 on lysine 9,human SET7 (also called SET9), which methylates H3 on lysine 4 and garden pea Rubisco LSMT, an enzyme that does not modify histones, but instead methylates lysine 14 in the flexible tail of the large subunit of the enzyme Rubisco. The SET domain itself turned out to be an uncommon structure. Although in all three studies, electron density maps revealed the location of the AdoMet or AdoHcy cofactor, the SET domain bears no similarity at all to the canonical/AdoMet-dependent methyltransferase fold. Strictly conserved in the C-terminal motif of the SET domain tyrosine could be involved in abstracting a proton from the protonated amino group of the substrate lysine, promoting its nucleophilic attack on the sulphonium methyl group of the AdoMet cofactor. In contrast to the AdoMet-dependent protein methyltranferases of the classical type, which tend to bind their polypeptide substrates on top of the cofactor, it is noted from the Rubisco LSMT structure that the AdoMet seems to bind in a separate cleft, suggesting how a polypeptide substrate could be subjected to multiple rounds of methylation without having to be released from the enzyme. In contrast, SET7/9 is able to add only a single methyl group to its substrate. It has been demonstrated that association of SET domain and myotubularin-related proteins modulates growth control []. The SET domain-containing Drosophila melanogaster (Fruit fly) protein, enhancer of zeste, has a function in segment determination and the mammalian homologue may be involved in the regulation of gene transcription and chromatin structure. Histone lysine methylation is part of the histone code that regulated chromatin function and epigenetic control of gene function. Histone lysine methyltransferases (HMTase) differ both in their substrate specificity for the various acceptor lysines as well as in their product specificity for the number of methyl groups (one, two, or three) they transfer. With just one exception [], the HMTases belong to SET family that can be classified according to the sequences surrounding the SET domain [, ]. Structural studies on the human SET7/9, a mono-methylase, have revealed the molecular basis for the specificity of the enzyme for the histone-target and the roles of the invariant residues in the SET domain in determining the methylation specificities [].  The pre-SET domain, as found in the SUV39 SET family, contains nine invariant cysteine residues that are grouped into two segments separated by a region of variable length. These 9 cysteines coordinate 3 zinc ions to form to form a triangular cluster, where each of the zinc ions is coordinated by 4 four cysteines to give a tetrahedral configuration. The function of this domain is structural, holding together 2 long segments of random coils. The C-terminal region including the post-SET domain is disordered when not interacting with a histone tail and in the absence of zinc. The three conserved cysteines in the post-SET domain form a zinc-binding site when coupled to a fourth conserved cysteine in the knot-like structure close to the SET domain active site []. The structured post-SET region brings in the C-terminal residues that participate in S-adenosylmethine-binding and histone tail interactions. The three conserved cysteine residues are essential for HMTase activity, as replacement with serine abolishes HMTase activity [], []. ; GO: 0005515 protein binding; PDB: 3TG5_A 3S7F_A 3RIB_B 3TG4_A 3S7J_A 3S7D_A 3S7B_A 3H6L_A 3SMT_A 3K5K_A ....
Probab=99.43  E-value=3.7e-13  Score=125.14  Aligned_cols=106  Identities=28%  Similarity=0.303  Sum_probs=71.0

Q ss_pred             ceeEEcCCccCCCcEEEEeeeeeecHHHHHhh--------cCCc-------e----------------------------
Q psy14657        464 GWGVRTPDKIKAGTFVCEYVGEILTHENALQR--------TNQT-------Y----------------------------  500 (605)
Q Consensus       464 G~Gv~A~~~I~~G~~I~ey~Gevi~~~e~~~r--------~~~~-------y----------------------------  500 (605)
                      |+||+|+++|++|++|++..+.+++...+...        ....       .                            
T Consensus         1 GrGl~At~dI~~Ge~I~~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~   80 (162)
T PF00856_consen    1 GRGLFATRDIKAGEVILIPRPAILTPDEVSPQPELLRLQLSKALEEQSRSDFSIQKKQKAEKSERSPQLESLHSISLRSE   80 (162)
T ss_dssp             SEEEEESS-B-TTEEEEEESEEEEEHHHHHCHHHHSHHTTCSSSCSHHTTHHHHHHHHHHHHHHHHHHHHHHHHHCHTTT
T ss_pred             CEEEEECccCCCCCEEEEECcceEEehhhhhcccchhhhhhhhhcccccccccccccccccccccccccccccccccccc
Confidence            89999999999999999999999987766431        0000       0                            


Q ss_pred             -e----ee--c------cCCCCCCCceEEEeeeeccCcccccccCCCCCeeEEEEecCCCCeeEEEEEEcCCCCCCCeEE
Q psy14657        501 -C----FN--L------DFNQDSNSVAFVLDAARYGNVSHFINHSCDPNLEVSRINNLNPDLHHVALFAKRDINKNEELS  567 (605)
Q Consensus       501 -~----f~--l------~~~~~~~~~~~~IDa~~~GN~~RfiNHSC~PN~~~~~~~~~d~~~~~i~~fA~rdI~~GEELT  567 (605)
                       .    +.  .      .............++.....++.|+||||.|||.+.+...  .....+.|.|.|||++|||||
T Consensus        81 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~p~~d~~NHsc~pn~~~~~~~~--~~~~~~~~~a~r~I~~GeEi~  158 (162)
T PF00856_consen   81 LQFSQAFQWSWFISWTRSDFSSRSFSEDDRDGIALYPFADMLNHSCDPNCEVSFDFD--GDGGCLVVRATRDIKKGEEIF  158 (162)
T ss_dssp             CCTCCHHHHHHHHHHHHHEEEEEEETTEEEEEEEEETGGGGSEEESSTSEEEEEEEE--TTTTEEEEEESS-B-TTSBEE
T ss_pred             ccccccccchhhccccceeeeccccccccccccccCcHhHheccccccccceeeEee--cccceEEEEECCccCCCCEEE
Confidence             0    00  0      0000001113355677788899999999999999996521  234599999999999999999


Q ss_pred             EecC
Q psy14657        568 FCYL  571 (605)
Q Consensus       568 ~dY~  571 (605)
                      ++||
T Consensus       159 isYG  162 (162)
T PF00856_consen  159 ISYG  162 (162)
T ss_dssp             EEST
T ss_pred             EEEC
Confidence            9997


No 17 
>PF00385 Chromo:  Chromo (CHRromatin Organisation MOdifier) domain;  InterPro: IPR023780 The CHROMO (CHRromatin Organization MOdifier) domain [, , , ] is a conserved region of around 60 amino acids, originally identified in Drosophila modifiers of variegation. These are proteins that alter the structure of chromatin to the condensed morphology of heterochromatin, a cytologically visible condition where gene expression is repressed. In one of these proteins, Polycomb, the chromo domain has been shown to be important for chromatin targeting.  Proteins that contain a chromo domain appear to fall into 3 classes. The first class includes proteins having an N-terminal chromo domain followed by a region termed the chromo shadow domain, with weak but significant sequence similarity to the N-terminal chromo domain,[], eg. Drosophila and human heterochromatin protein Su(var)205 (HP1). The second class includes proteins with a single chromo domain, eg. Drosophila protein Polycomb (Pc); mammalian modifier 3; human Mi-2 autoantigen and several yeast and Caenorhabditis elegans hypothetical proteins. In the third class paired tandem chromo domains are found, eg. in mammalian DNA-binding/helicase proteins CHD-1 to CHD-4 and yeast protein CHD1. Functional dissections of chromo domain proteins suggests a mechanistic role for chromo domains in targeting chromo domain proteins to specific regions of the nucleus. The mechanism of targeting may involve protein-protein and/or protein/nucleic acid interactions. Hence, several line of evidence show that the HP1 chromo domain is a methyl-specific histone binding module, whereas the chromo domain of two protein components of the drosophila dosage compensation complex, MSL3 and MOF, contain chromo domains that bind to RNA in vitro []. The high resolution structures of HP1-family protein chromo and chromo shadow domain reveal a conserved chromo domain fold motif consisting of three beta strands packed against an alpha helix. The chromo domain fold belongs to the OB (oligonucleotide/oligosaccharide binding)-fold class found in a variety of prokaryotic and eukaryotic nucleic acid binding protein [].; PDB: 2H1E_B 3MWY_W 2DY8_A 1KNE_A 1KNA_A 1Q3L_A 2EE1_A 1AP0_A 1GUW_A 1X3P_A ....
Probab=99.32  E-value=1.6e-12  Score=100.81  Aligned_cols=51  Identities=51%  Similarity=0.864  Sum_probs=45.7

Q ss_pred             eEEeeeccceeecCCcE---EEEEeccCCCCCCCcccccccccch-HHHHHHHHHh
Q psy14657        224 YEVESVLDSLELTSDMT---VYLVKWKNYDPEYNTWEPIENLGNC-AKKLAEFLKA  275 (605)
Q Consensus       224 y~vE~Ild~~~~~~~~~---~YlVkW~gy~~~~~TWEp~~nl~~~-~~li~~f~~~  275 (605)
                      |+||+||++ +..+++.   +|||||+||+.+++||||+++|.++ +++|++|.++
T Consensus         1 ~~Ve~Il~~-r~~~~~~~~~~ylVkW~g~~~~~~tWe~~~~l~~~~~~li~~f~~r   55 (55)
T PF00385_consen    1 YEVERILDH-RVVKGGNKVYEYLVKWKGYPYSENTWEPEENLKNCFPELIEEFEKR   55 (55)
T ss_dssp             EEEEEEEEE-EEETTEESEEEEEEEETTSSGGGEEEEEGGGCSSHCHHHHHHHHHH
T ss_pred             CEEEEEEEE-EEeCCCcccEEEEEEECCCCCCCCeEeeHHHHhHhhHHHHHHHhCC
Confidence            789999999 5555555   9999999999999999999999887 9999999875


No 18 
>cd00024 CHROMO Chromatin organization modifier (chromo) domain is a conserved region of around 50 amino acids found in a variety of chromosomal proteins, which appear to play a role in the functional organization of the eukaryotic nucleus. Experimental evidence implicates the chromo domain in the binding activity of these proteins to methylated histone tails and maybe RNA. May occur as single instance, in a tandem arrangement or followd by a related "chromo shadow" domain.
Probab=99.15  E-value=3.6e-11  Score=92.96  Aligned_cols=52  Identities=48%  Similarity=0.832  Sum_probs=46.8

Q ss_pred             ceeEEeeeccceeecC--CcEEEEEeccCCCCCCCcccccccccchHHHHHHHHH
Q psy14657        222 TEYEVESVLDSLELTS--DMTVYLVKWKNYDPEYNTWEPIENLGNCAKKLAEFLK  274 (605)
Q Consensus       222 ~~y~vE~Ild~~~~~~--~~~~YlVkW~gy~~~~~TWEp~~nl~~~~~li~~f~~  274 (605)
                      ++|+||+||++ +...  +..+|+|||+||+..++||||+++|..++.+|++|.+
T Consensus         1 ~e~~ve~Il~~-r~~~~~~~~~y~VkW~g~~~~~~tWe~~~~l~~~~~~i~~~~~   54 (55)
T cd00024           1 EEYEVEKILDH-RKKKDGGEYEYLVKWKGYSYSEDTWEPEENLEDCKELIDEFKK   54 (55)
T ss_pred             CCceEeeeeee-eecCCCCcEEEEEEECCCCCccCccccHHHhCchHHHHHHHHh
Confidence            46899999999 5555  8899999999999999999999999888899999975


No 19 
>KOG1911|consensus
Probab=98.99  E-value=2.2e-10  Score=117.90  Aligned_cols=67  Identities=33%  Similarity=0.552  Sum_probs=57.6

Q ss_pred             ccccccCCCceeEEeeeccceeecCCcEEEEEeccCCCCCCCcccccccccchHHHHHHHHHhCCCCc
Q psy14657        213 LTKFSSDTMTEYEVESVLDSLELTSDMTVYLVKWKNYDPEYNTWEPIENLGNCAKKLAEFLKAGPDQE  280 (605)
Q Consensus       213 i~~~~~~~~~~y~vE~Ild~~~~~~~~~~YlVkW~gy~~~~~TWEp~~nl~~~~~li~~f~~~~~~~~  280 (605)
                      .......+.++|+||.|+++ +..+|+.+|||+|+||++.+|||||+.++.+|+++|++|.+++....
T Consensus        38 ~~~~~~~~~~~~vvEki~~~-r~~~g~~eYlvkW~Gy~~~~ntWEPee~~~~C~~li~~~~~~~~~~k  104 (270)
T KOG1911|consen   38 VEASEEEEEEEYVVEKILKR-RKKNGKIEYLVKWKGYPDPDNTWEPEEHNLDCPELIDEFEKSQKKLK  104 (270)
T ss_pred             cccccccccchhhhhhhhhc-cccCCCceeeeecCCCCCccccCCchhhccccHHHHHHHHHHhcccC
Confidence            34445666778999999999 88888899999999999999999999966699999999999975433


No 20 
>KOG2748|consensus
Probab=98.98  E-value=1.2e-10  Score=118.59  Aligned_cols=57  Identities=33%  Similarity=0.610  Sum_probs=52.5

Q ss_pred             cCCCceeEEeeeccceeecCCcEEEEEeccCCCCCCCcccccccccchHHHHHHHHHhC
Q psy14657        218 SDTMTEYEVESVLDSLELTSDMTVYLVKWKNYDPEYNTWEPIENLGNCAKKLAEFLKAG  276 (605)
Q Consensus       218 ~~~~~~y~vE~Ild~~~~~~~~~~YlVkW~gy~~~~~TWEp~~nl~~~~~li~~f~~~~  276 (605)
                      +.++..|.+|.||.+ |.++|..+|||||+||+..+|||||++|| -.+.||..|.++.
T Consensus         5 avGe~VfAaEsIlkk-RirKGrvEYlVKWkGWs~kyNTWEPEENI-LDpRLi~AFe~rE   61 (369)
T KOG2748|consen    5 AVGERVFAAESILKK-RIRKGRVEYLVKWKGWSQKYNTWEPEENI-LDPRLIAAFEQRE   61 (369)
T ss_pred             cchhhHHHHHHHHHH-HhhccceEEEEEecccccccCccCccccc-cCHHHHHHHHhhh
Confidence            445677999999999 99999999999999999999999999999 6799999999885


No 21 
>smart00298 CHROMO Chromatin organization modifier domain.
Probab=98.84  E-value=1.9e-09  Score=83.18  Aligned_cols=54  Identities=50%  Similarity=0.777  Sum_probs=47.3

Q ss_pred             eeEEeeeccceeecCCcEEEEEeccCCCCCCCcccccccccchHHHHHHHHHhC
Q psy14657        223 EYEVESVLDSLELTSDMTVYLVKWKNYDPEYNTWEPIENLGNCAKKLAEFLKAG  276 (605)
Q Consensus       223 ~y~vE~Ild~~~~~~~~~~YlVkW~gy~~~~~TWEp~~nl~~~~~li~~f~~~~  276 (605)
                      .|.||+|++++...++..+|||||+|++..++||+|.++|..++++|.+|..+.
T Consensus         1 ~~~v~~Il~~r~~~~~~~~ylVkW~g~~~~~~tW~~~~~l~~~~~~v~~~~~~~   54 (55)
T smart00298        1 EYEVEKILDHRWKKKGELEYLVKWKGYSYSEDTWEPEENLLNCSKKLDNYKKKE   54 (55)
T ss_pred             CcchheeeeeeecCCCcEEEEEEECCCCCccCceeeHHHHHHHHHHHHHHHHhh
Confidence            367999999932778889999999999999999999999966999999998763


No 22 
>KOG2589|consensus
Probab=98.73  E-value=7.6e-09  Score=106.10  Aligned_cols=103  Identities=20%  Similarity=0.250  Sum_probs=73.8

Q ss_pred             CCceeEEcCCccCCCcEEEEeeeeeecHHHHHhh---cCCceeeeccCCCCCCCceEEEeeeeccCcccccccCCCCCee
Q psy14657        462 NRGWGVRTPDKIKAGTFVCEYVGEILTHENALQR---TNQTYCFNLDFNQDSNSVAFVLDAARYGNVSHFINHSCDPNLE  538 (605)
Q Consensus       462 ~kG~Gv~A~~~I~~G~~I~ey~Gevi~~~e~~~r---~~~~y~f~l~~~~~~~~~~~~IDa~~~GN~~RfiNHSC~PN~~  538 (605)
                      ..|--|.|++.+.+|+=|--.+|-|+...+++++   .+..-.|.+-+..-..-..+++      ..|+||||.|.|||.
T Consensus       136 ~~gAkivst~~w~~ndkIe~LvGcIaeLse~eE~~ll~~g~nDFSvmyStRk~caqLwL------GPaafINHDCrpnCk  209 (453)
T KOG2589|consen  136 QNGAKIVSTKSWSRNDKIELLVGCIAELSEAEERSLLRGGGNDFSVMYSTRKRCAQLWL------GPAAFINHDCRPNCK  209 (453)
T ss_pred             CCCceEEeeccccCCccHHHhhhhhhhcChhhhHHHHhccCCceeeeeecccchhhhee------ccHHhhcCCCCCCce
Confidence            4677788899999999999999999877666665   1122222222221111112223      257899999999999


Q ss_pred             EEEEecCCCCeeEEEEEEcCCCCCCCeEEEecCCCcc
Q psy14657        539 VSRINNLNPDLHHVALFAKRDINKNEELSFCYLDLTK  575 (605)
Q Consensus       539 ~~~~~~~d~~~~~i~~fA~rdI~~GEELT~dY~~~~~  575 (605)
                      +.-.     +..++.+.++|||+||||||--|+..+.
T Consensus       210 Fvs~-----g~~tacvkvlRDIePGeEITcFYgs~fF  241 (453)
T KOG2589|consen  210 FVST-----GRDTACVKVLRDIEPGEEITCFYGSGFF  241 (453)
T ss_pred             eecC-----CCceeeeehhhcCCCCceeEEeeccccc
Confidence            9622     4458999999999999999999999875


No 23 
>KOG1081|consensus
Probab=98.67  E-value=7.3e-09  Score=113.63  Aligned_cols=140  Identities=31%  Similarity=0.528  Sum_probs=102.5

Q ss_pred             Eec-CCCCCCCCCCcCcccccCCCcCEEEEEeCCCCceeEEcCCccCCCcEEEEeeeeeecHHHHHhh------cCCcee
Q psy14657        429 YEC-NKKCACDETCLNRVVQKGITLPLTIFKTKNNRGWGVRTPDKIKAGTFVCEYVGEILTHENALQR------TNQTYC  501 (605)
Q Consensus       429 ~EC-~~~C~C~~~C~Nrv~q~g~~~~l~i~~t~~~kG~Gv~A~~~I~~G~~I~ey~Gevi~~~e~~~r------~~~~y~  501 (605)
                      ++| +..|.+...|.|+-.-......       +..    +|..+|.+|      +|+++...+...+      ....-.
T Consensus       290 ~~~~p~~~~~~~~~~~~~~sk~~~~e-------~~~----~~~~~~~k~------vg~~i~~~e~~~~~~~~~~~~~~~~  352 (463)
T KOG1081|consen  290 YEVHPKVCSAEERCHNQQFSKESYPE-------PQK----TAKADIRKG------VGEVIDDKECKARLQRVKESDLVDF  352 (463)
T ss_pred             hhhcccccccccccccchhhhhcccc-------cch----hhHHhhhcc------cCcccchhhheeehhhhhccchhhh
Confidence            444 7889999889888764443333       111    788888888      8999988777655      111111


Q ss_pred             eeccCCCCCCCceEEEeeeeccCcccccccCCCCCeeEE-EEecCCCCeeEEEEEEcCCCCCCCeEEEecCCCcccccch
Q psy14657        502 FNLDFNQDSNSVAFVLDAARYGNVSHFINHSCDPNLEVS-RINNLNPDLHHVALFAKRDINKNEELSFCYLDLTKAKFTS  580 (605)
Q Consensus       502 f~l~~~~~~~~~~~~IDa~~~GN~~RfiNHSC~PN~~~~-~~~~~d~~~~~i~~fA~rdI~~GEELT~dY~~~~~~~~~~  580 (605)
                      +.....   .. . .||+..+||.+||+||||.||+.-. |...   +..++.++|.+.|++|+|||++|.....     
T Consensus       353 ~~~~~e---~~-~-~id~~~~~n~sr~~nh~~~~~v~~~k~~~~---~~t~~~~~a~~~i~~g~e~t~~~n~~~~-----  419 (463)
T KOG1081|consen  353 YMVFIQ---KD-R-IIDAGPKGNYSRFLNHSCQPNVETEKWQVI---GDTRVGLFAPRQIEAGEELTFNYNGNCE-----  419 (463)
T ss_pred             hhhhhh---cc-c-ccccccccchhhhhcccCCCceeechhhee---cccccccccccccccchhhhheeecccc-----
Confidence            111111   11 2 8999999999999999999999988 6543   5679999999999999999999987532     


Q ss_pred             hhhcCCCCeeeecCCCCCccccc
Q psy14657        581 SKRKKLVRNECRCGSSNCLGYYY  603 (605)
Q Consensus       581 ~~~~~~~~~~C~Cgs~~Crg~l~  603 (605)
                           ...+.|.||+.+|.+++-
T Consensus       420 -----~~~~~~~~~~e~~~~~~~  437 (463)
T KOG1081|consen  420 -----GNEKRCCCGSENCTETKG  437 (463)
T ss_pred             -----CCcceEeecccccccCCc
Confidence                 134689999999998763


No 24 
>KOG2461|consensus
Probab=98.22  E-value=1.3e-06  Score=93.87  Aligned_cols=116  Identities=23%  Similarity=0.254  Sum_probs=87.7

Q ss_pred             CCcCEEEEEeC-CCCceeEEcCCccCCCcEEEEeeeeeecHHHHHhhcCCceeeeccCCCCCCCceEEEeee--eccCcc
Q psy14657        450 ITLPLTIFKTK-NNRGWGVRTPDKIKAGTFVCEYVGEILTHENALQRTNQTYCFNLDFNQDSNSVAFVLDAA--RYGNVS  526 (605)
Q Consensus       450 ~~~~l~i~~t~-~~kG~Gv~A~~~I~~G~~I~ey~Gevi~~~e~~~r~~~~y~f~l~~~~~~~~~~~~IDa~--~~GN~~  526 (605)
                      ....|.|..++ +..|.||.+...|++|+--+-|.|+++ ..+..+..+..|++.+...   +...++||++  ...|+.
T Consensus        26 LP~~l~i~~Ssv~~~~lgV~s~~~i~~G~~FGP~~G~~~-~~~~~~~~n~~y~W~I~~~---d~~~~~iDg~d~~~sNWm  101 (396)
T KOG2461|consen   26 LPPELRIKPSSVPVTGLGVWSNASILPGTSFGPFEGEII-ASIDSKSANNRYMWEIFSS---DNGYEYIDGTDEEHSNWM  101 (396)
T ss_pred             CCCceEeeccccCCccccccccccccCcccccCccCccc-cccccccccCcceEEEEeC---CCceEEeccCChhhccee
Confidence            44556666654 667899999999999999999999982 2222233456677766542   2457899987  478999


Q ss_pred             cccccCCC---CCeeEEEEecCCCCeeEEEEEEcCCCCCCCeEEEecCCCcc
Q psy14657        527 HFINHSCD---PNLEVSRINNLNPDLHHVALFAKRDINKNEELSFCYLDLTK  575 (605)
Q Consensus       527 RfiNHSC~---PN~~~~~~~~~d~~~~~i~~fA~rdI~~GEELT~dY~~~~~  575 (605)
                      ||+|-.++   -|+...-+      ...|.++|+|+|.+||||.++|+.++.
T Consensus       102 RYV~~Ar~~eeQNL~A~Q~------~~~Ifyrt~r~I~p~eELlVWY~~e~~  147 (396)
T KOG2461|consen  102 RYVNSARSEEEQNLLAFQI------GENIFYRTIRDIRPNEELLVWYGSEYA  147 (396)
T ss_pred             eeecccCChhhhhHHHHhc------cCceEEEecccCCCCCeEEEEeccchH
Confidence            99999884   47766411      237889999999999999999999874


No 25 
>KOG4442|consensus
Probab=96.60  E-value=0.0017  Score=72.68  Aligned_cols=57  Identities=21%  Similarity=0.331  Sum_probs=49.6

Q ss_pred             ccCCC-CcC-CCcCCccceeeeCceeeeeEEEeeccCCceeEEEeecccccccccCCCceeEEeeeccc
Q psy14657        166 YECNK-KCA-CDETCLNRVVQKGNLVQDCHTTITSRDGSVVLLRVADINLTKFSSDTMTEYEVESVLDS  232 (605)
Q Consensus       166 yECn~-~C~-C~~~C~NRVvQ~G~~~~L~vf~~~~~kG~~~~~~~~~I~i~~~~~~~~~~y~vE~Ild~  232 (605)
                      .||++ .|. |+..|.|+--|+----+++||+ +.++|||         ++....++.+.|+.|||-..
T Consensus        95 iECs~~~C~~cg~~C~NQRFQkkqyA~vevF~-Te~KG~G---------LRA~~dI~~g~FI~EY~GEV  153 (729)
T KOG4442|consen   95 IECSDRECPRCGVYCKNQRFQKKQYAKVEVFL-TEKKGCG---------LRAEEDIPKGQFILEYIGEV  153 (729)
T ss_pred             cccCCccCCCccccccchhhhhhccCceeEEE-ecCcccc---------eeeccccCCCcEEeeecccc
Confidence            49999 999 9999999999999999999999 5589999         66677777888888887654


No 26 
>smart00508 PostSET Cysteine-rich motif following a subset of SET domains.
Probab=96.41  E-value=0.0016  Score=42.13  Aligned_cols=16  Identities=38%  Similarity=1.095  Sum_probs=14.2

Q ss_pred             CeeeecCCCCCccccc
Q psy14657        588 RNECRCGSSNCLGYYY  603 (605)
Q Consensus       588 ~~~C~Cgs~~Crg~l~  603 (605)
                      .+.|+|||++|||+|.
T Consensus         2 ~~~C~CGs~~CRG~l~   17 (26)
T smart00508        2 KQPCLCGAPNCRGFLG   17 (26)
T ss_pred             CeeeeCCCccccceec
Confidence            4689999999999994


No 27 
>KOG0384|consensus
Probab=95.39  E-value=0.0084  Score=71.17  Aligned_cols=51  Identities=27%  Similarity=0.561  Sum_probs=43.2

Q ss_pred             eEEeeeccceeecCCcEEEEEeccCCCCCCCcccccccccc-hHHHHHHHHHhC
Q psy14657        224 YEVESVLDSLELTSDMTVYLVKWKNYDPEYNTWEPIENLGN-CAKKLAEFLKAG  276 (605)
Q Consensus       224 y~vE~Ild~~~~~~~~~~YlVkW~gy~~~~~TWEp~~nl~~-~~~li~~f~~~~  276 (605)
                      -.||+|+++ ..... .+|||||+|-|-++.|||+++.+.. .+..+++|..+.
T Consensus       286 ~~VdRIia~-~~~~d-~eYLvKW~~LpY~e~TWE~~~~I~~~~~~~~~~~~~Re  337 (1373)
T KOG0384|consen  286 VIVDRIIAE-QTSKD-PEYLVKWRGLPYEECTWEDAEDIAKKAQEEIEEFQSRE  337 (1373)
T ss_pred             hhhhhhhhc-ccCCC-ceeEEEecCCCcccccccchhhhhhhHHHHHHHHhhhh
Confidence            389999999 54444 8999999999999999999999965 567788888875


No 28 
>cd00034 ChSh Chromo Shadow Domain,  found in association with N-terminal chromo (CHRromatin Organization MOdifier) domain; Chromo domains mediate the interaction of the heterochromatin with other heterochromatin proteins, thereby affecting chromatin structure (e.g. Drosophila and human heterochromatin protein (HP1) and mammalian modifier 1 and modifier 2)
Probab=94.00  E-value=0.04  Score=42.51  Aligned_cols=48  Identities=19%  Similarity=0.376  Sum_probs=36.8

Q ss_pred             EeeeccceeecC-CcEEEEEeccCCCCCCCcccccccc-cchHHHHHHHHHhCC
Q psy14657        226 VESVLDSLELTS-DMTVYLVKWKNYDPEYNTWEPIENL-GNCAKKLAEFLKAGP  277 (605)
Q Consensus       226 vE~Ild~~~~~~-~~~~YlVkW~gy~~~~~TWEp~~nl-~~~~~li~~f~~~~~  277 (605)
                      ++.|+.. ...+ |...||++|+|   ++.++.|...+ ..||.+|-+|++++.
T Consensus         2 ~~~I~ga-t~~~~g~l~fl~kwk~---~~~~lVp~~~~~~k~P~~vI~FYE~~l   51 (54)
T cd00034           2 VKPISGA-SKSDDGELTFLAKWKD---GQASLVPNKELNVKCPLLVISFYEEHL   51 (54)
T ss_pred             ceEEEEE-EEcCCCeEEEEEEEeC---CeEEEEEHHHHHhhCcHHHHHHHHHhc
Confidence            4666665 4444 89999999999   44568887777 369999999998864


No 29 
>PF01393 Chromo_shadow:  Chromo shadow domain Web page maintained by Rein Aasland;  InterPro: IPR008251 Chromo shadow domain is distantly related to chromo domain. It is always found in association with a chromo domain.  The CHROMO (CHRromatin Organization MOdifier) domain [, , , ] is a conserved region of around 60 amino acids, originally identified in Drosophila modifiers of variegation. These are proteins that alter the structure of chromatin to the condensed morphology of heterochromatin, a cytologically visible condition where gene expression is repressed. In one of these proteins, Polycomb, the chromo domain has been shown to be important for chromatin targeting. Proteins that contain a chromo domain appear to fall into 3 classes. The first class includes proteins having an N-terminal chromo domain followed by a region termed the chromo shadow domain [], eg. Drosophila and human heterochromatin protein Su(var)205 (HP1); and mammalian modifier 1 and modifier 2. The second class includes proteins with a single chromo domain, eg. Drosophila protein Polycomb (Pc); mammalian modifier 3; human Mi-2 autoantigenand and several yeast and Caenorhabditis elegans hypothetical proteins. In the third class paired tandem chromo domains are found, eg. in mammalian DNA-binding/helicase proteins CHD-1 to CHD-4 and yeast protein CHD1.; GO: 0005634 nucleus; PDB: 3Q6S_C 2FMM_B 3P7J_B 1E0B_B 3I3C_A 1DZ1_B 1S4Z_A 3KUP_D.
Probab=93.49  E-value=0.067  Score=41.89  Aligned_cols=50  Identities=24%  Similarity=0.466  Sum_probs=40.6

Q ss_pred             EEeeeccceeecCCcEEEEEeccCCCCCCCcccccccc-cchHHHHHHHHHhCC
Q psy14657        225 EVESVLDSLELTSDMTVYLVKWKNYDPEYNTWEPIENL-GNCAKKLAEFLKAGP  277 (605)
Q Consensus       225 ~vE~Ild~~~~~~~~~~YlVkW~gy~~~~~TWEp~~nl-~~~~~li~~f~~~~~  277 (605)
                      ..|.|++. ....|...|||||+|-+  +.++.|...+ ..||.+|-+|++++.
T Consensus         4 ~~E~Ivg~-~d~~G~l~~likwk~~~--~~~~v~~~~~~~k~Pq~vI~FYE~~l   54 (58)
T PF01393_consen    4 EWEKIVGA-TDTNGELMFLIKWKNSG--EKDLVPSKEANEKCPQKVIKFYESHL   54 (58)
T ss_dssp             TEEEEEEE-EECTSSEEEEEEETTSS--SEEEEEHHHHHHHSHHHHHHHHHHTC
T ss_pred             ChHHHhee-ecCCCcEEEEEEECCCC--ceEEeeHHHHHHHCcHHHHHHHHHHe
Confidence            46888888 67779999999999974  4678887765 369999999999874


No 30 
>KOG2084|consensus
Probab=93.04  E-value=0.12  Score=57.04  Aligned_cols=62  Identities=27%  Similarity=0.440  Sum_probs=42.6

Q ss_pred             ccccCCCCCeeEEEEecCCCCeeEEEEEEcCCCCCCC-eEEEecCCCccccc-chhhhcCCCCeeeecCC
Q psy14657        528 FINHSCDPNLEVSRINNLNPDLHHVALFAKRDINKNE-ELSFCYLDLTKAKF-TSSKRKKLVRNECRCGS  595 (605)
Q Consensus       528 fiNHSC~PN~~~~~~~~~d~~~~~i~~fA~rdI~~GE-ELT~dY~~~~~~~~-~~~~~~~~~~~~C~Cgs  595 (605)
                      ++||||.||+.+.+..      ....+.+...+.+++ ||++.|....+... +.......+.|.|.|+.
T Consensus       208 ~~~hsC~pn~~~~~~~------~~~~~~~~~~~~~~~~~l~~~y~~~~~~~~~r~~~l~~~~~f~c~c~r  271 (482)
T KOG2084|consen  208 LFNHSCFPNISVIFDG------RGLALLVPAGIDAGEEELTISYTDPLLSTASRQKQLRQSKLFSCQCPR  271 (482)
T ss_pred             hcccCCCCCeEEEECC------ceeEEEeecccCCCCCEEEEeecccccCHHHHHHHHhhccceeeecCC
Confidence            7999999999965432      255667778888876 99999998765332 22233444447777754


No 31 
>smart00300 ChSh Chromo Shadow Domain.
Probab=93.03  E-value=0.048  Score=43.15  Aligned_cols=49  Identities=18%  Similarity=0.415  Sum_probs=39.8

Q ss_pred             EEeeeccceeecCCcEEEEEeccCCCCCCCcccccccc-cchHHHHHHHHHhCC
Q psy14657        225 EVESVLDSLELTSDMTVYLVKWKNYDPEYNTWEPIENL-GNCAKKLAEFLKAGP  277 (605)
Q Consensus       225 ~vE~Ild~~~~~~~~~~YlVkW~gy~~~~~TWEp~~nl-~~~~~li~~f~~~~~  277 (605)
                      ..|+|+.. ...+|...||++|+|   ++.++.|...+ ..||.+|-+|++++.
T Consensus         8 ~~e~Ivg~-~d~~G~l~flikwk~---~~~~lVp~~~~~~k~P~~vI~FYE~~l   57 (61)
T smart00300        8 SWEDIVGI-TKDDGELTFLIKWKD---DAASLVPNKEANVKCPQKVIRFYESHL   57 (61)
T ss_pred             CHHHHhce-ecCCCeEEEEEEEeC---CcEEEEEHHHHHHHChHHHHHHHHHhC
Confidence            45777777 667999999999999   45678888777 369999999998874


No 32 
>smart00570 AWS associated with SET domains. subdomain of PRESET
Probab=92.79  E-value=0.062  Score=40.82  Aligned_cols=24  Identities=33%  Similarity=0.749  Sum_probs=21.3

Q ss_pred             ceEecCCCCCCCCCCcCcccccCC
Q psy14657        427 PIYECNKKCACDETCLNRVVQKGI  450 (605)
Q Consensus       427 ~i~EC~~~C~C~~~C~Nrv~q~g~  450 (605)
                      ..+||++.|.|+..|.||.+|+..
T Consensus        27 l~~EC~~~C~~G~~C~NqrFqk~~   50 (51)
T smart00570       27 LLIECSSDCPCGSYCSNQRFQKRQ   50 (51)
T ss_pred             HhhhcCCCCCCCcCccCcccccCc
Confidence            458999999999999999999864


No 33 
>PF11717 Tudor-knot:  RNA binding activity-knot of a chromodomain ; PDB: 2EKO_A 2RO0_A 2RNZ_A 1WGS_A 3E9G_A 3E9F_A 2K3X_A 2K3Y_A 2EFI_A 2F5K_F ....
Probab=91.92  E-value=0.29  Score=37.79  Aligned_cols=41  Identities=15%  Similarity=0.396  Sum_probs=33.9

Q ss_pred             CceeEEeeeccceeecCCcEEEEEeccCCCCCCCcccccccc
Q psy14657        221 MTEYEVESVLDSLELTSDMTVYLVKWKNYDPEYNTWEPIENL  262 (605)
Q Consensus       221 ~~~y~vE~Ild~~~~~~~~~~YlVkW~gy~~~~~TWEp~~nl  262 (605)
                      .+.+---+|++. +..++..+|+|.+.||+...|-|.|.++|
T Consensus        13 ~~~~y~A~I~~~-r~~~~~~~YyVHY~g~nkR~DeWV~~~~i   53 (55)
T PF11717_consen   13 DGQWYEAKILDI-REKNGEPEYYVHYQGWNKRLDEWVPESRI   53 (55)
T ss_dssp             TTEEEEEEEEEE-EECTTCEEEEEEETTSTGCC-EEEETTTE
T ss_pred             CCcEEEEEEEEE-EecCCCEEEEEEcCCCCCCceeeecHHHc
Confidence            345555677788 77888899999999999999999999987


No 34 
>PF14061 Mtf2_C:  Polycomb-like MTF2 factor 2
Probab=89.75  E-value=0.33  Score=36.18  Aligned_cols=22  Identities=27%  Similarity=0.445  Sum_probs=18.1

Q ss_pred             eccceeecCCcEEEEEeccCCC
Q psy14657        229 VLDSLELTSDMTVYLVKWKNYD  250 (605)
Q Consensus       229 Ild~~~~~~~~~~YlVkW~gy~  250 (605)
                      |+..|...+|+.+|||.|.|.+
T Consensus        28 VlArRV~~dG~vQYLvEWeg~~   49 (50)
T PF14061_consen   28 VLARRVTPDGKVQYLVEWEGAT   49 (50)
T ss_pred             EEEEEEcCCCcEEEEEEecCcC
Confidence            4666677789999999999974


No 35 
>KOG0384|consensus
Probab=88.23  E-value=0.2  Score=60.02  Aligned_cols=39  Identities=33%  Similarity=0.507  Sum_probs=31.0

Q ss_pred             CcEEEEEeccCCCCCCCccccccccc--c--hHHHHHHHHHhC
Q psy14657        238 DMTVYLVKWKNYDPEYNTWEPIENLG--N--CAKKLAEFLKAG  276 (605)
Q Consensus       238 ~~~~YlVkW~gy~~~~~TWEp~~nl~--~--~~~li~~f~~~~  276 (605)
                      +..+|||||+||+.-++|||++++|.  +  -...|+.|.++.
T Consensus       224 e~~qFlIKWkg~SyLHctWet~~~L~~~~~rG~kKv~nf~kK~  266 (1373)
T KOG0384|consen  224 EEEQFLIKWKGWSYLHCTWETESELLEMNVRGLKKVDNFKKKV  266 (1373)
T ss_pred             hhhhhheeeccccceeccccchHHHHhhhHHHHHHHHHHHHHH
Confidence            34689999999999999999999975  2  234577777774


No 36 
>smart00570 AWS associated with SET domains. subdomain of PRESET
Probab=87.76  E-value=0.27  Score=37.40  Aligned_cols=23  Identities=35%  Similarity=0.808  Sum_probs=20.5

Q ss_pred             ccccCCCCcCCCcCCccceeeeC
Q psy14657        164 PIYECNKKCACDETCLNRVVQKG  186 (605)
Q Consensus       164 ~IyECn~~C~C~~~C~NRVvQ~G  186 (605)
                      ..+||.+.|.|+..|.||.-|+.
T Consensus        27 l~~EC~~~C~~G~~C~NqrFqk~   49 (51)
T smart00570       27 LLIECSSDCPCGSYCSNQRFQKR   49 (51)
T ss_pred             HhhhcCCCCCCCcCccCcccccC
Confidence            36799999999999999999974


No 37 
>KOG1337|consensus
Probab=73.18  E-value=2.7  Score=46.94  Aligned_cols=42  Identities=29%  Similarity=0.298  Sum_probs=32.0

Q ss_pred             ccccCCCCCeeEEEEecCCCCeeEEEEEEcCCCCCCCeEEEecCCCc
Q psy14657        528 FINHSCDPNLEVSRINNLNPDLHHVALFAKRDINKNEELSFCYLDLT  574 (605)
Q Consensus       528 fiNHSC~PN~~~~~~~~~d~~~~~i~~fA~rdI~~GEELT~dY~~~~  574 (605)
                      +.||++.+...  .....   .+.+.+++.++|.+||||++.||...
T Consensus       239 ~~NH~~~~~~~--~~~~~---d~~~~l~~~~~v~~geevfi~YG~~~  280 (472)
T KOG1337|consen  239 LLNHSPEVIKA--GYNQE---DEAVELVAERDVSAGEEVFINYGPKS  280 (472)
T ss_pred             hhccCchhccc--cccCC---CCcEEEEEeeeecCCCeEEEecCCCc
Confidence            68999998221  22222   23888999999999999999999854


No 38 
>KOG1079|consensus
Probab=63.42  E-value=4.3  Score=46.16  Aligned_cols=39  Identities=21%  Similarity=0.347  Sum_probs=29.4

Q ss_pred             ccccCC-CCcCC----------CcCCccceeeeCceeeeeEEEeeccCCce
Q psy14657        164 PIYECN-KKCAC----------DETCLNRVVQKGNLVQDCHTTITSRDGSV  203 (605)
Q Consensus       164 ~IyECn-~~C~C----------~~~C~NRVvQ~G~~~~L~vf~~~~~kG~~  203 (605)
                      ..-||. ..|.+          ..+|.|--+|+|.+-++-+--.. =.|||
T Consensus       558 A~rECdPd~Cl~cg~~~~~d~~~~~C~N~~l~~~~qkr~llapSd-VaGwG  607 (739)
T KOG1079|consen  558 AVRECDPDVCLMCGNVDHFDSSKISCKNTNLQRGEQKRVLLAPSD-VAGWG  607 (739)
T ss_pred             hccccCchHHhccCcccccccCccccccchhhhhhhcceeechhh-ccccc
Confidence            367898 66765          24799999999999998777533 34888


No 39 
>KOG1338|consensus
Probab=44.10  E-value=15  Score=39.49  Aligned_cols=40  Identities=25%  Similarity=0.424  Sum_probs=30.8

Q ss_pred             cccccccC---CCCCeeEEEEecCCCCeeEEEEEEcCCCCCCCeEEEecCC
Q psy14657        525 VSHFINHS---CDPNLEVSRINNLNPDLHHVALFAKRDINKNEELSFCYLD  572 (605)
Q Consensus       525 ~~RfiNHS---C~PN~~~~~~~~~d~~~~~i~~fA~rdI~~GEELT~dY~~  572 (605)
                      .+-|+||-   |..|+.+        +..-+-+.|.|+|++|+|+.-.||-
T Consensus       218 ~ad~lNhd~~k~nanl~y--------~~NcL~mva~r~iekgdev~n~dg~  260 (466)
T KOG1338|consen  218 IADFLNHDGLKANANLRY--------EDNCLEMVADRNIEKGDEVDNSDGL  260 (466)
T ss_pred             hhhhhccchhhcccceec--------cCcceeeeecCCCCCcccccccccc
Confidence            56689995   5666555        2235678999999999999999973


No 40 
>cd01395 HMT_MBD Methyl-CpG binding domains (MBD) present in putative histone methyltransferases (HMT) such as CLLD8 and SETDB1 proteins; CLLD8 contains a MBD, a PreSET and a bifurcated SET domain, suggesting that CLLD8 might be associated with methylation-mediated transcriptional repression. SETDB1 and other proteins in this group have a similar domain architecture. SETDB1 is a novel KAP-1-associated histone H3, lysine 9-specific methyltransferase that contributes to HP1-mediated silencing of euchromatic genes by KRAB zinc-finger proteins.
Probab=43.42  E-value=7.2  Score=30.77  Aligned_cols=32  Identities=16%  Similarity=0.247  Sum_probs=24.9

Q ss_pred             eeecCChhhhh--hHHHHHHhhhcccCCCcchhHHHHHHH
Q psy14657         20 TLIEPNADEVH--RTVSRFLFNRKLLSSAKDFAYAHMLLL   57 (605)
Q Consensus        20 ~~~~~~~~~~~--~~~~~~l~~~~~~~~~~~~~~~~~~~~   57 (605)
                      .--.|..+-+|  .||++||.+     + ++++..++|+|
T Consensus        27 ~Y~aPCGr~Lr~~~EV~~YL~~-----t-~~~L~~d~FsF   60 (60)
T cd01395          27 IYKAPCGRSLRNMSEVHRYLRE-----T-CSFLTVDNFSF   60 (60)
T ss_pred             EEECCcchhhhcHHHHHHHHHh-----c-cccceeecccC
Confidence            34567777544  699999999     7 99999998864


No 41 
>PF08666 SAF:  SAF domain;  InterPro: IPR013974  This entry includes a range of different proteins, such as antifreeze proteins, flagellar FlgA proteins, and CpaB pilus proteins. ; PDB: 1C89_A 3NLA_A 3RDN_A 1C8A_A 3FRN_A 1WVO_A 3K3S_H 3G8R_B 1XUU_A 1XUZ_A ....
Probab=43.07  E-value=15  Score=28.33  Aligned_cols=16  Identities=25%  Similarity=0.345  Sum_probs=11.7

Q ss_pred             EEEEcCCCCCCCeEEE
Q psy14657        553 ALFAKRDINKNEELSF  568 (605)
Q Consensus       553 ~~fA~rdI~~GEELT~  568 (605)
                      .++|.|||++|+.|+-
T Consensus         3 vvVA~~di~~G~~i~~   18 (63)
T PF08666_consen    3 VVVAARDIPAGTVITA   18 (63)
T ss_dssp             EEEESSTB-TT-BECT
T ss_pred             EEEEeCccCCCCEEcc
Confidence            4789999999999853


No 42 
>KOG2155|consensus
Probab=31.70  E-value=24  Score=38.50  Aligned_cols=56  Identities=18%  Similarity=0.361  Sum_probs=40.0

Q ss_pred             ceEEEeeeeccCcccccccCCCCCeeEE-EEecCCCCeeEEEEEEcCCCCCCCeEEEecCCCc
Q psy14657        513 VAFVLDAARYGNVSHFINHSCDPNLEVS-RINNLNPDLHHVALFAKRDINKNEELSFCYLDLT  574 (605)
Q Consensus       513 ~~~~IDa~~~GN~~RfiNHSC~PN~~~~-~~~~~d~~~~~i~~fA~rdI~~GEELT~dY~~~~  574 (605)
                      ..|++|-     ++.-+.||-.||..+. +. -+...-..-.++-+|+...|||+|-|+..+.
T Consensus       197 vWYvMDe-----fGsrvrHsdePnf~~aPf~-fmPq~vaYsimwp~k~~~tgeE~trDfasg~  253 (631)
T KOG2155|consen  197 VWYVMDE-----FGSRVRHSDEPNFRIAPFM-FMPQNVAYSIMWPTKPVNTGEEITRDFASGV  253 (631)
T ss_pred             eeEEHhh-----hhhhhccCCCCcceeeehe-ecchhcceeEEeeccCCCCchHHHHHHhhcC
Confidence            4556653     4556899999999988 21 1112344556899999999999999987654


No 43 
>KOG0383|consensus
Probab=29.58  E-value=21  Score=41.71  Aligned_cols=55  Identities=18%  Similarity=0.304  Sum_probs=44.2

Q ss_pred             eeEEeeeccceeecCCcEEEEEeccCCCCCCCcccccc-cccchHHHHHHHHHhCC
Q psy14657        223 EYEVESVLDSLELTSDMTVYLVKWKNYDPEYNTWEPIE-NLGNCAKKLAEFLKAGP  277 (605)
Q Consensus       223 ~y~vE~Ild~~~~~~~~~~YlVkW~gy~~~~~TWEp~~-nl~~~~~li~~f~~~~~  277 (605)
                      .-.+-.|+.++....+...|+|+|+.-+.++.+||... ++.+....+++|+....
T Consensus       208 ~~~i~rii~~~~s~~~~~~~~Vk~k~l~~d~~~~e~~~~~ip~~~~~~qe~~~~~~  263 (696)
T KOG0383|consen  208 WMPIARIINRRSSQKGATDYLVKWKELSYDEQEWEVEDPDIPGYSSAGQEAWHHRE  263 (696)
T ss_pred             ccccchhhhhhcccccceeeEeeeccCCccccCCCcCCCCcccCcccccccccccC
Confidence            34566777775677889999999999999999999998 66777778888876553


No 44 
>PF00856 SET:  SET domain;  InterPro: IPR001214 The SET domain appears generally as one part of a larger multidomain protein, and recently there were described three structures of very different proteins with distinct domain compositions: Neurospora crassa DIM-5, a member of the Su(var) family of HKMTs which methylate histone H3 on lysine 9,human SET7 (also called SET9), which methylates H3 on lysine 4 and garden pea Rubisco LSMT, an enzyme that does not modify histones, but instead methylates lysine 14 in the flexible tail of the large subunit of the enzyme Rubisco. The SET domain itself turned out to be an uncommon structure. Although in all three studies, electron density maps revealed the location of the AdoMet or AdoHcy cofactor, the SET domain bears no similarity at all to the canonical/AdoMet-dependent methyltransferase fold. Strictly conserved in the C-terminal motif of the SET domain tyrosine could be involved in abstracting a proton from the protonated amino group of the substrate lysine, promoting its nucleophilic attack on the sulphonium methyl group of the AdoMet cofactor. In contrast to the AdoMet-dependent protein methyltranferases of the classical type, which tend to bind their polypeptide substrates on top of the cofactor, it is noted from the Rubisco LSMT structure that the AdoMet seems to bind in a separate cleft, suggesting how a polypeptide substrate could be subjected to multiple rounds of methylation without having to be released from the enzyme. In contrast, SET7/9 is able to add only a single methyl group to its substrate. It has been demonstrated that association of SET domain and myotubularin-related proteins modulates growth control []. The SET domain-containing Drosophila melanogaster (Fruit fly) protein, enhancer of zeste, has a function in segment determination and the mammalian homologue may be involved in the regulation of gene transcription and chromatin structure. Histone lysine methylation is part of the histone code that regulated chromatin function and epigenetic control of gene function. Histone lysine methyltransferases (HMTase) differ both in their substrate specificity for the various acceptor lysines as well as in their product specificity for the number of methyl groups (one, two, or three) they transfer. With just one exception [], the HMTases belong to SET family that can be classified according to the sequences surrounding the SET domain [, ]. Structural studies on the human SET7/9, a mono-methylase, have revealed the molecular basis for the specificity of the enzyme for the histone-target and the roles of the invariant residues in the SET domain in determining the methylation specificities [].  The pre-SET domain, as found in the SUV39 SET family, contains nine invariant cysteine residues that are grouped into two segments separated by a region of variable length. These 9 cysteines coordinate 3 zinc ions to form to form a triangular cluster, where each of the zinc ions is coordinated by 4 four cysteines to give a tetrahedral configuration. The function of this domain is structural, holding together 2 long segments of random coils. The C-terminal region including the post-SET domain is disordered when not interacting with a histone tail and in the absence of zinc. The three conserved cysteines in the post-SET domain form a zinc-binding site when coupled to a fourth conserved cysteine in the knot-like structure close to the SET domain active site []. The structured post-SET region brings in the C-terminal residues that participate in S-adenosylmethine-binding and histone tail interactions. The three conserved cysteine residues are essential for HMTase activity, as replacement with serine abolishes HMTase activity [], []. ; GO: 0005515 protein binding; PDB: 3TG5_A 3S7F_A 3RIB_B 3TG4_A 3S7J_A 3S7D_A 3S7B_A 3H6L_A 3SMT_A 3K5K_A ....
Probab=24.99  E-value=38  Score=30.32  Aligned_cols=16  Identities=44%  Similarity=0.659  Sum_probs=12.3

Q ss_pred             EEEEcCCCCCCCeEEE
Q psy14657        553 ALFAKRDINKNEELSF  568 (605)
Q Consensus       553 ~~fA~rdI~~GEELT~  568 (605)
                      ++||+|||++||-|.+
T Consensus         3 Gl~At~dI~~Ge~I~~   18 (162)
T PF00856_consen    3 GLFATRDIKAGEVILI   18 (162)
T ss_dssp             EEEESS-B-TTEEEEE
T ss_pred             EEEECccCCCCCEEEE
Confidence            6899999999998874


Done!