Query         psy7758
Match_columns 235
No_of_seqs    262 out of 1483
Neff          7.6 
Searched_HMMs 46136
Date          Fri Aug 16 18:03:58 2013
Command       hhsearch -i /work/01045/syshi/Psyhhblits/psy7758.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/7758hhsearch_cdd -cpu 12 -v 0 

 No Hit                             Prob E-value P-value  Score    SS Cols Query HMM  Template HMM
  1 KOG4442|consensus              100.0 2.2E-48 4.7E-53  361.3  14.5  175    2-177    93-268 (729)
  2 KOG1082|consensus              100.0 5.7E-32 1.2E-36  244.7  11.1  171    1-172   150-357 (364)
  3 KOG1080|consensus              100.0 1.7E-31 3.8E-36  261.8  11.9  139   28-168   866-1004(1005)
  4 smart00317 SET SET (Su(var)3-9  99.9 6.8E-27 1.5E-31  176.5  15.0  116   29-144     1-116 (116)
  5 KOG1079|consensus               99.9 9.1E-28   2E-32  222.9   9.1  146    4-151   560-715 (739)
  6 KOG1083|consensus               99.9 1.5E-24 3.2E-29  208.7   1.8  137   15-152  1164-1301(1306)
  7 KOG1085|consensus               99.8 1.4E-20   3E-25  160.5   9.6  124   25-148   253-380 (392)
  8 KOG1141|consensus               99.8 4.8E-19   1E-23  167.6   7.9  165    3-168   981-1261(1262)
  9 PF00856 SET:  SET domain;  Int  99.8 1.9E-18   4E-23  136.1   7.1  107   39-145     1-162 (162)
 10 COG2940 Proteins containing SE  99.7 5.3E-19 1.1E-23  165.6   2.0  145   24-168   328-479 (480)
 11 KOG2589|consensus               99.6 8.5E-16 1.8E-20  135.1   7.1  123   36-168   135-257 (453)
 12 KOG1081|consensus               99.5 3.1E-15 6.8E-20  138.8   1.7  151    3-171   289-439 (463)
 13 KOG2461|consensus               98.7 2.1E-08 4.6E-13   91.6   5.5  115   26-150    26-148 (396)
 14 KOG1141|consensus               97.8 5.7E-06 1.2E-10   80.1   0.0   67    3-70    774-841 (1262)
 15 smart00570 AWS associated with  97.5 5.1E-05 1.1E-09   49.3   1.4   25    2-27     27-51  (51)
 16 smart00508 PostSET Cysteine-ri  96.4  0.0017 3.6E-08   36.2   1.4   18  154-171     2-19  (26)
 17 KOG2084|consensus               95.3   0.023 5.1E-07   52.5   4.6   44  104-151   208-252 (482)
 18 KOG1337|consensus               92.9   0.085 1.8E-06   49.7   3.1   41  103-146   238-278 (472)
 19 smart00317 SET SET (Su(var)3-9  76.3     4.3 9.4E-05   29.3   3.9   34   22-55     77-113 (116)
 20 PF08666 SAF:  SAF domain;  Int  75.5     1.6 3.5E-05   28.7   1.3   15  127-141     3-17  (63)
 21 KOG1338|consensus               67.5     4.6  0.0001   37.2   2.7   45  101-149   218-263 (466)
 22 smart00858 SAF This domain fam  51.3      10 0.00022   24.7   1.5   16  127-142     3-18  (64)
 23 TIGR02059 swm_rep_I cyanobacte  49.2      40 0.00087   25.0   4.5   24  124-147    75-98  (101)
 24 PF14100 PmoA:  Methane oxygena  47.8      29 0.00062   30.3   4.2  104   29-147   144-253 (271)
 25 KOG1338|consensus               43.8      16 0.00034   33.8   2.0   34   27-60     25-62  (466)
 26 KOG1081|consensus               40.2      16 0.00035   34.6   1.5   20  213-233    86-105 (463)
 27 PF00856 SET:  SET domain;  Int  38.8      49  0.0011   24.7   3.9   34   23-56    123-159 (162)
 28 KOG2155|consensus               38.3      17 0.00038   34.1   1.4   48  101-148   204-253 (631)
 29 COG1188 Ribosome-associated he  34.0      47   0.001   24.7   2.8   23  128-150    43-65  (100)
 30 PF00098 zf-CCHC:  Zinc knuckle  32.1      24 0.00052   17.6   0.7   10  218-228     2-11  (18)
 31 TIGR01164 rplP_bact ribosomal   30.3      80  0.0017   24.3   3.8   41   29-71     77-117 (126)
 32 smart00400 ZnF_CHCC zinc finge  29.1      15 0.00033   23.6  -0.4   15  215-230    22-36  (55)
 33 TIGR03569 NeuB_NnaB N-acetylne  25.8      34 0.00073   30.9   1.1   20  125-144   277-296 (329)
 34 PF05077 DUF678:  Protein of un  25.7      34 0.00073   23.8   0.8   15  209-223    15-29  (74)
 35 PF00628 PHD:  PHD-finger;  Int  24.1      50  0.0011   20.5   1.4   15  219-234     2-19  (51)
 36 KOG2084|consensus               22.9      69  0.0015   29.3   2.6   25   36-60     22-46  (482)
 37 PRK09203 rplP 50S ribosomal pr  22.1 1.3E+02  0.0029   23.4   3.7   42   29-72     78-119 (138)
 38 TIGR03170 flgA_cterm flagella   21.1      43 0.00093   25.0   0.7   16  128-143     2-17  (122)
 39 PF06718 DUF1203:  Protein of u  21.1      75  0.0016   24.2   2.0   18  129-146     6-24  (117)
 40 smart00249 PHD PHD zinc finger  21.1      58  0.0013   19.1   1.2   16  219-234     2-19  (47)
 41 CHL00044 rpl16 ribosomal prote  21.0 1.8E+02  0.0039   22.7   4.2   39   29-69     78-116 (135)

No 1  
>KOG4442|consensus
Probab=100.00  E-value=2.2e-48  Score=361.32  Aligned_cols=175  Identities=42%  Similarity=0.913  Sum_probs=169.0

Q ss_pred             ceEEcCCCCCC-CCCCCCCccCCCCCCCCEEEEEcCCCceEEEEceeCCCCCEEEEeccEEeCHHHHHHHHHhhccCCCC
Q psy7758           2 LYVECNPDSCP-ARTKCQNRDFETRNYPPLEVFNTGSRGWGLKALTDLKRGQFVVEYVGEMIDQKELNRRRRDMDRNNDH   80 (235)
Q Consensus         2 ~~~eC~~~~C~-~~~~C~N~~~~~~~~~~lev~~s~~~G~GlfA~~~I~~Ge~I~ey~G~ii~~~e~~~~~~~~~~~~~~   80 (235)
                      +..||+++.|+ ||.+|+|+.||+.++.+++||.+.++||||+|..+|++|+||+||.|++|+..++..|...|..++..
T Consensus        93 t~iECs~~~C~~cg~~C~NQRFQkkqyA~vevF~Te~KG~GLRA~~dI~~g~FI~EY~GEVI~~~Ef~kR~~~Y~~d~~k  172 (729)
T KOG4442|consen   93 TSIECSDRECPRCGVYCKNQRFQKKQYAKVEVFLTEKKGCGLRAEEDIPKGQFILEYIGEVIEEKEFEKRVKRYAKDGIK  172 (729)
T ss_pred             hhcccCCccCCCccccccchhhhhhccCceeEEEecCcccceeeccccCCCcEEeeeccccccHHHHHHHHHHHHhcCCc
Confidence            35799999999 99999999999999999999999999999999999999999999999999999999999999999999


Q ss_pred             ceeeeecCCCeEEeccccCCccccccCCCCCCceeEEEEECCceEEEEEEccCCCCCCeEEEecCCCccCCCCCeEEecC
Q psy7758          81 NYYFLSLDNSRYIDAGKKGNLARFMNHSCEPNCTAEKWTVSGDTRVGLFALRDVPAGTELVFNYELQKADNDGMRRCMCG  160 (235)
Q Consensus        81 ~~~~~~~~~~~~iDa~~~gn~aRfiNHSC~PN~~~~~~~~~~~~~~~i~A~rdI~~GEElt~~Y~~~~~~~~~~~~C~Cg  160 (235)
                      ++|++.+.++.+|||+.+||+||||||||+|||.++.|.|++..+|.|||.|+|++||||||||..+.++.. ...|.||
T Consensus       173 h~Yfm~L~~~e~IDAT~KGnlaRFiNHSC~PNa~~~KWtV~~~lRvGiFakk~I~~GEEITFDYqf~rYGr~-AQ~CyCg  251 (729)
T KOG4442|consen  173 HYYFMALQGGEYIDATKKGNLARFINHSCDPNAEVQKWTVPDELRVGIFAKKVIKPGEEITFDYQFDRYGRD-AQPCYCG  251 (729)
T ss_pred             eEEEEEecCCceecccccCcHHHhhcCCCCCCceeeeeeeCCeeEEEEeEecccCCCceeeEeccccccccc-ccccccC
Confidence            999999999999999999999999999999999999999999999999999999999999999999999988 8999999


Q ss_pred             CCCCceeccCCCCCCCC
Q psy7758         161 AASCSGFIGAKKAVPPT  177 (235)
Q Consensus       161 ~~~C~g~~~~~~~~~~~  177 (235)
                      +++|+|||++.++....
T Consensus       252 eanC~G~IGgk~q~da~  268 (729)
T KOG4442|consen  252 EANCRGWIGGKPQTDAS  268 (729)
T ss_pred             CcccccccCCCCccccc
Confidence            99999999998765433


No 2  
>KOG1082|consensus
Probab=99.97  E-value=5.7e-32  Score=244.71  Aligned_cols=171  Identities=34%  Similarity=0.632  Sum_probs=141.1

Q ss_pred             CceEEcCCCCCCCCCCCCCccCCCCCCCCEEEEEcCCCceEEEEceeCCCCCEEEEeccEEeCHHHHHHHHHhhccCCC-
Q psy7758           1 DLYVECNPDSCPARTKCQNRDFETRNYPPLEVFNTGSRGWGLKALTDLKRGQFVVEYVGEMIDQKELNRRRRDMDRNND-   79 (235)
Q Consensus         1 ~~~~eC~~~~C~~~~~C~N~~~~~~~~~~lev~~s~~~G~GlfA~~~I~~Ge~I~ey~G~ii~~~e~~~~~~~~~~~~~-   79 (235)
                      +++|||++ .|+|+..|.||+.|++...+|+|++++.+||||++...|++|++|+||.|++++.++++.+......... 
T Consensus       150 ~~i~EC~~-~C~C~~~C~nRv~q~g~~~~leIfrt~~kGwgvRs~~~I~~G~fvcEyaGe~~t~~e~~~~~~~~~~~~~~  228 (364)
T KOG1082|consen  150 EPVFECSV-ACGCHPDCANRVVQKGLQFHLEVFRTPEKGWGVRTLDPIPAGEFVCEYAGEVLTSEEAQRRTHLREYLDDD  228 (364)
T ss_pred             cccccccc-CCCCCCcCcchhhccccccceEEEecCCceeeecccccccCCCeeEEEeeEecChHHhhhccccccccccc
Confidence            46899998 8999999999999999999999999999999999999999999999999999999998876332222111 


Q ss_pred             -Cce------e---------------eeecCCCeEEeccccCCccccccCCCCCCceeEEEEECCc----eEEEEEEccC
Q psy7758          80 -HNY------Y---------------FLSLDNSRYIDAGKKGNLARFMNHSCEPNCTAEKWTVSGD----TRVGLFALRD  133 (235)
Q Consensus        80 -~~~------~---------------~~~~~~~~~iDa~~~gn~aRfiNHSC~PN~~~~~~~~~~~----~~~~i~A~rd  133 (235)
                       ..+      |               .......+.|||...||++|||||||.||..++.+..+..    .++.++|+++
T Consensus       229 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ida~~~GNv~RfinHSC~PN~~~~~v~~~~~~~~~~~i~ffa~~~  308 (364)
T KOG1082|consen  229 CDAYSIADREWVDESPVGNTFVAPSLPGGPGRELLIDAKPHGNVARFINHSCSPNLLYQAVFQDEFVLLYLRIGFFALRD  308 (364)
T ss_pred             cccchhhhccccccccccccccccccccCCCcceEEchhhcccccccccCCCCccceeeeeeecCCccchheeeeeeccc
Confidence             000      0               1112456789999999999999999999999887766532    7899999999


Q ss_pred             CCCCCeEEEecCCCcc-----C-----CCCCeEEecCCCCCceeccCCC
Q psy7758         134 VPAGTELVFNYELQKA-----D-----NDGMRRCMCGAASCSGFIGAKK  172 (235)
Q Consensus       134 I~~GEElt~~Y~~~~~-----~-----~~~~~~C~Cg~~~C~g~~~~~~  172 (235)
                      |.+|||||+||+..+.     +     ......|.|++..|++.+....
T Consensus       309 I~p~~ELT~dYg~~~~~~~~~~~~~~~~~~~~~c~c~~~~cr~~~~~~~  357 (364)
T KOG1082|consen  309 ISPGEELTLDYGKAYKLLVQDGANIYTPVMKKNCNCGLEKCRGLLGSAP  357 (364)
T ss_pred             cCCCcccchhhcccccccccccccccccccchhhcCCCHHhCcccCCCc
Confidence            9999999999997753     1     2247789999999999987643


No 3  
>KOG1080|consensus
Probab=99.97  E-value=1.7e-31  Score=261.79  Aligned_cols=139  Identities=38%  Similarity=0.796  Sum_probs=131.5

Q ss_pred             CCEEEEEcCCCceEEEEceeCCCCCEEEEeccEEeCHHHHHHHHHhhccCCCCceeeeecCCCeEEeccccCCccccccC
Q psy7758          28 PPLEVFNTGSRGWGLKALTDLKRGQFVVEYVGEMIDQKELNRRRRDMDRNNDHNYYFLSLDNSRYIDAGKKGNLARFMNH  107 (235)
Q Consensus        28 ~~lev~~s~~~G~GlfA~~~I~~Ge~I~ey~G~ii~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~iDa~~~gn~aRfiNH  107 (235)
                      ..|...++.++||||||+++|.+|++|+||+|+++...-++.|+..|...+....|+|.++..++|||+..||+||||||
T Consensus       866 k~~~F~~s~iH~wglfa~~~i~~~dmViEY~Ge~vR~~iad~RE~~Y~~~gi~~sYlfrid~~~ViDAtk~gniAr~InH  945 (1005)
T KOG1080|consen  866 KYVKFGRSGIHGWGLFAMENIAAGDMVIEYRGELVRSSIADLREARYERMGIGDSYLFRIDDEVVVDATKKGNIARFINH  945 (1005)
T ss_pred             hhhccccccccccceeeccCccccceEEEeeceehhhhHHHHHHHHHhccCcccceeeecccceEEeccccCchhheeec
Confidence            34788889999999999999999999999999999999999999999999989999999999999999999999999999


Q ss_pred             CCCCCceeEEEEECCceEEEEEEccCCCCCCeEEEecCCCccCCCCCeEEecCCCCCceec
Q psy7758         108 SCEPNCTAEKWTVSGDTRVGLFALRDVPAGTELVFNYELQKADNDGMRRCMCGAASCSGFI  168 (235)
Q Consensus       108 SC~PN~~~~~~~~~~~~~~~i~A~rdI~~GEElt~~Y~~~~~~~~~~~~C~Cg~~~C~g~~  168 (235)
                      ||.|||....+.++|+.+|+|+|.|+|.+|||||+||-....+.  ...|+|||++|||++
T Consensus       946 sC~PNCyakvi~V~g~~~IvIyakr~I~~~EElTYDYkF~~e~~--kipClCgap~Crg~~ 1004 (1005)
T KOG1080|consen  946 SCNPNCYAKVITVEGDKRIVIYSKRDIAAGEELTYDYKFPTEDD--KIPCLCGAPNCRGFL 1004 (1005)
T ss_pred             ccCCCceeeEEEecCeeEEEEEEecccccCceeeeecccccccc--ccccccCCCcccccc
Confidence            99999999999999999999999999999999999998766554  899999999999987


No 4  
>smart00317 SET SET (Su(var)3-9, Enhancer-of-zeste, Trithorax) domain. Putative methyl transferase, based on outlier plant homologues
Probab=99.95  E-value=6.8e-27  Score=176.47  Aligned_cols=116  Identities=45%  Similarity=0.903  Sum_probs=100.4

Q ss_pred             CEEEEEcCCCceEEEEceeCCCCCEEEEeccEEeCHHHHHHHHHhhccCCCCceeeeecCCCeEEeccccCCccccccCC
Q psy7758          29 PLEVFNTGSRGWGLKALTDLKRGQFVVEYVGEMIDQKELNRRRRDMDRNNDHNYYFLSLDNSRYIDAGKKGNLARFMNHS  108 (235)
Q Consensus        29 ~lev~~s~~~G~GlfA~~~I~~Ge~I~ey~G~ii~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~iDa~~~gn~aRfiNHS  108 (235)
                      +++++.++++|+||||+++|++|++|++|.|.++...+.......+........+++.......||+...++++||||||
T Consensus         1 ~~~~~~~~~~G~gl~a~~~i~~g~~i~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~id~~~~~~~~~~iNHs   80 (116)
T smart00317        1 KLEVFKSPGKGWGVRATEDIPKGEFIGEYVGEIITSEEAEERSKAYDTDGADSFYLFEIDSDLCIDARRKGNIARFINHS   80 (116)
T ss_pred             CcEEEecCCCcEEEEECCccCCCCEEEEEEeEEECHHHHHHHHHHHHhcCCCCEEEEECCCCEEEeCCccCcHHHeeCCC
Confidence            36788899999999999999999999999999999988877654333333324567777777899999999999999999


Q ss_pred             CCCCceeEEEEECCceEEEEEEccCCCCCCeEEEec
Q psy7758         109 CEPNCTAEKWTVSGDTRVGLFALRDVPAGTELVFNY  144 (235)
Q Consensus       109 C~PN~~~~~~~~~~~~~~~i~A~rdI~~GEElt~~Y  144 (235)
                      |.||+.+..+..++..++.++|+|||++|||||++|
T Consensus        81 c~pN~~~~~~~~~~~~~~~~~a~r~I~~GeEi~i~Y  116 (116)
T smart00317       81 CEPNCELLFVEVNGDSRIVIFALRDIKPGEELTIDY  116 (116)
T ss_pred             CCCCEEEEEEEECCCcEEEEEECCCcCCCCEEeecC
Confidence            999999987777666689999999999999999999


No 5  
>KOG1079|consensus
Probab=99.94  E-value=9.1e-28  Score=222.91  Aligned_cols=146  Identities=36%  Similarity=0.687  Sum_probs=136.4

Q ss_pred             EEcCCCCCCC-C---------CCCCCccCCCCCCCCEEEEEcCCCceEEEEceeCCCCCEEEEeccEEeCHHHHHHHHHh
Q psy7758           4 VECNPDSCPA-R---------TKCQNRDFETRNYPPLEVFNTGSRGWGLKALTDLKRGQFVVEYVGEMIDQKELNRRRRD   73 (235)
Q Consensus         4 ~eC~~~~C~~-~---------~~C~N~~~~~~~~~~lev~~s~~~G~GlfA~~~I~~Ge~I~ey~G~ii~~~e~~~~~~~   73 (235)
                      -||+|++|.+ +         -+|.|-.++.+.+.++.+.+|...|||||+++.+.++++|.||+|++|+.+|+++|...
T Consensus       560 rECdPd~Cl~cg~~~~~d~~~~~C~N~~l~~~~qkr~llapSdVaGwGlFlKe~v~KnefisEY~GE~IS~dEADrRGki  639 (739)
T KOG1079|consen  560 RECDPDVCLMCGNVDHFDSSKISCKNTNLQRGEQKRVLLAPSDVAGWGLFLKESVSKNEFISEYTGEIISHDEADRRGKI  639 (739)
T ss_pred             cccCchHHhccCcccccccCccccccchhhhhhhcceeechhhccccceeeccccCCCceeeeecceeccchhhhhcccc
Confidence            5899999986 3         39999999999999999999999999999999999999999999999999999999988


Q ss_pred             hccCCCCceeeeecCCCeEEeccccCCccccccCCCCCCceeEEEEECCceEEEEEEccCCCCCCeEEEecCCCccCC
Q psy7758          74 MDRNNDHNYYFLSLDNSRYIDAGKKGNLARFMNHSCEPNCTAEKWTVSGDTRVGLFALRDVPAGTELVFNYELQKADN  151 (235)
Q Consensus        74 ~~~~~~~~~~~~~~~~~~~iDa~~~gn~aRfiNHSC~PN~~~~~~~~~~~~~~~i~A~rdI~~GEElt~~Y~~~~~~~  151 (235)
                      ++..  ...|+|.+..+++||++..||.+||+|||-.|||....+.++|..+|.|+|.|+|.+|||||+||.++-...
T Consensus       640 YDr~--~cSflFnln~dyviDs~rkGnk~rFANHS~nPNCYAkvm~V~GdhRIGifAkRaIeagEELffDYrYs~~~~  715 (739)
T KOG1079|consen  640 YDRY--MCSFLFNLNNDYVIDSTRKGNKIRFANHSFNPNCYAKVMMVAGDHRIGIFAKRAIEAGEELFFDYRYSPEHA  715 (739)
T ss_pred             cccc--cceeeeeccccceEeeeeecchhhhccCCCCCCcEEEEEEecCCcceeeeehhhcccCceeeeeeccCcccc
Confidence            8876  556888999999999999999999999999999999999999999999999999999999999998876553


No 6  
>KOG1083|consensus
Probab=99.89  E-value=1.5e-24  Score=208.73  Aligned_cols=137  Identities=43%  Similarity=0.896  Sum_probs=121.7

Q ss_pred             CCCCCccCCCCC-CCCEEEEEcCCCceEEEEceeCCCCCEEEEeccEEeCHHHHHHHHHhhccCCCCceeeeecCCCeEE
Q psy7758          15 TKCQNRDFETRN-YPPLEVFNTGSRGWGLKALTDLKRGQFVVEYVGEMIDQKELNRRRRDMDRNNDHNYYFLSLDNSRYI   93 (235)
Q Consensus        15 ~~C~N~~~~~~~-~~~lev~~s~~~G~GlfA~~~I~~Ge~I~ey~G~ii~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~i   93 (235)
                      +.|.|+.|++.. -++|+|++.+..||||.|.++|++||+|+||+|++++..+++.++.....+. ...|.+.+..+.++
T Consensus      1164 d~c~nqrm~r~e~cp~L~v~~gp~~G~~v~tk~PikagtfI~EYvGeVit~ke~e~~mmtl~~~d-~~~~cL~I~p~l~i 1242 (1306)
T KOG1083|consen 1164 DSCSNQRMQRHEECPPLEVFRGPKKGWGVRTKEPIKAGTFIMEYVGEVITEKEFEPRMMTLYHND-DDHYCLVIDPGLFI 1242 (1306)
T ss_pred             hhhhhHHhhhhccCCCcceeccCCCCccccccccccccchHHHHHHHHHHHHhhcccccccCCCC-CcccccccCccccC
Confidence            468899888654 5899999999999999999999999999999999999999888854444433 34578899999999


Q ss_pred             eccccCCccccccCCCCCCceeEEEEECCceEEEEEEccCCCCCCeEEEecCCCccCCC
Q psy7758          94 DAGKKGNLARFMNHSCEPNCTAEKWTVSGDTRVGLFALRDVPAGTELVFNYELQKADND  152 (235)
Q Consensus        94 Da~~~gn~aRfiNHSC~PN~~~~~~~~~~~~~~~i~A~rdI~~GEElt~~Y~~~~~~~~  152 (235)
                      |+..++|.+||+||+|.|||..+.|.++|..||.|||+|||.+|||||+||+...+...
T Consensus      1243 d~~R~~n~~RfinhscKPNc~~qkwSVNG~~Rv~L~A~rDi~kGEELtYDYN~ks~~~p 1301 (1306)
T KOG1083|consen 1243 DIPRMGNGARFINHSCKPNCEMQKWSVNGEYRVGLFALRDLPKGEELTYDYNFKSFNYP 1301 (1306)
T ss_pred             ChhhccccccccccccCCCCccccccccceeeeeeeecCCCCCCceEEEeccccccCCc
Confidence            99999999999999999999999999999999999999999999999999987655433


No 7  
>KOG1085|consensus
Probab=99.83  E-value=1.4e-20  Score=160.52  Aligned_cols=124  Identities=30%  Similarity=0.485  Sum_probs=109.3

Q ss_pred             CCCCCEEEEEcCCCceEEEEceeCCCCCEEEEeccEEeCHHHHHHHHHhhccCC---CCceeeeecCCCeEEeccccCC-
Q psy7758          25 RNYPPLEVFNTGSRGWGLKALTDLKRGQFVVEYVGEMIDQKELNRRRRDMDRNN---DHNYYFLSLDNSRYIDAGKKGN-  100 (235)
Q Consensus        25 ~~~~~lev~~s~~~G~GlfA~~~I~~Ge~I~ey~G~ii~~~e~~~~~~~~~~~~---~~~~~~~~~~~~~~iDa~~~gn-  100 (235)
                      +....+.+....++|+||+|++++.+|+||.||.|.+|...++..++..|..+.   .+.|||-+....++|||+...+ 
T Consensus       253 g~~egl~~~~~dgKGRGv~a~~~F~rgdFVVEY~Gdliei~eAk~rE~~Ya~De~~GcYMYyF~h~sk~yCiDAT~et~~  332 (392)
T KOG1085|consen  253 GTNEGLLEVYKDGKGRGVRAKVNFERGDFVVEYRGDLIEISEAKVREEQYANDEEIGCYMYYFEHNSKKYCIDATKETPW  332 (392)
T ss_pred             ccccceeEEeeccccceeEeecccccCceEEEEecceeeechHHHHHHHhccCcccceEEEeeeccCeeeeeeccccccc
Confidence            455567777788899999999999999999999999999999999988887654   3566666778899999996555 


Q ss_pred             ccccccCCCCCCceeEEEEECCceEEEEEEccCCCCCCeEEEecCCCc
Q psy7758         101 LARFMNHSCEPNCTAEKWTVSGDTRVGLFALRDVPAGTELVFNYELQK  148 (235)
Q Consensus       101 ~aRfiNHSC~PN~~~~~~~~~~~~~~~i~A~rdI~~GEElt~~Y~~~~  148 (235)
                      ++|.||||-.+|+....+.+++.++++++|.|||.+||||++||+.-.
T Consensus       333 lGRLINHS~~gNl~TKvv~Idg~pHLiLvA~rdIa~GEELlYDYGDRS  380 (392)
T KOG1085|consen  333 LGRLINHSVRGNLKTKVVEIDGSPHLILVARRDIAQGEELLYDYGDRS  380 (392)
T ss_pred             chhhhcccccCcceeeEEEecCCceEEEEeccccccchhhhhhccccc
Confidence            789999999999999999999999999999999999999999998754


No 8  
>KOG1141|consensus
Probab=99.77  E-value=4.8e-19  Score=167.60  Aligned_cols=165  Identities=30%  Similarity=0.549  Sum_probs=123.1

Q ss_pred             eEEcCCCCCCCCCCCCCccCCCCCCC--------CEEEEEcCCCceEEEEceeCCCCCEEEEeccEEeCHHHHHHHHHhh
Q psy7758           3 YVECNPDSCPARTKCQNRDFETRNYP--------PLEVFNTGSRGWGLKALTDLKRGQFVVEYVGEMIDQKELNRRRRDM   74 (235)
Q Consensus         3 ~~eC~~~~C~~~~~C~N~~~~~~~~~--------~lev~~s~~~G~GlfA~~~I~~Ge~I~ey~G~ii~~~e~~~~~~~~   74 (235)
                      ++||+. -|.|...|.|++++.....        .|.|+.+...|||+.+..||+.-++|.+|.|...+..-+.+.-...
T Consensus       981 f~e~~~-hss~~~~e~~~~v~~~~~~~me~~s~~~l~i~~~~~~~~~~~edtD~~~~~~~~~~~~~ppt~~l~~~~r~aq 1059 (1262)
T KOG1141|consen  981 FFECND-HSSCHRKEYNRVVQNNIKYPMEVSSFNDLQIFKTAQSGWGVREDTDIPQSTFICTYVGAPPTDDLADELRNAQ 1059 (1262)
T ss_pred             ceeccc-cchhcccccchhhhcCCccceeeeecccccccccccccccccccccCCCCcccccccCCCCchhhHHHHhhhh
Confidence            578985 7999999999998766443        4556666778999999999999999999998875543322111100


Q ss_pred             ccCC-------------------CCc------------------------------------------------------
Q psy7758          75 DRNN-------------------DHN------------------------------------------------------   81 (235)
Q Consensus        75 ~~~~-------------------~~~------------------------------------------------------   81 (235)
                      ....                   .+.                                                      
T Consensus      1060 ad~~sn~~D~~~~~~l~es~~~~~T~~r~~t~~~~~~~~~d~dd~q~I~k~ve~qd~~~~~~~T~~~~RQ~~~~s~k~~~ 1139 (1262)
T KOG1141|consen 1060 ADQYSNDLDLKDTVELEESREDHETDFRGDTSDYDDEEGSDGDDGQDIMKMVERQDSSESGEETKRLTRQKRKQSKKSGK 1139 (1262)
T ss_pred             hccccCccchhhhhhhhhcccccccccCCCCCCCcccccccCccHHHHHHHhhcccccccccccchhhhhhhhhhhhccc
Confidence            0000                   000                                                      


Q ss_pred             -----------e-------------------eeeecCCCeEEeccccCCccccccCCCCCCceeEEEEECCc----eEEE
Q psy7758          82 -----------Y-------------------YFLSLDNSRYIDAGKKGNLARFMNHSCEPNCTAEKWTVSGD----TRVG  127 (235)
Q Consensus        82 -----------~-------------------~~~~~~~~~~iDa~~~gn~aRfiNHSC~PN~~~~~~~~~~~----~~~~  127 (235)
                                 .                   +++....-++|||...||++||+||||.||..++.++++.+    +.|+
T Consensus      1140 ~~s~~~~~~ts~~~~~~dkges~~~~~~~~~~y~~~~~~yvIDAk~eGNlGRfLNHSC~PNl~VQnVfvdTHdlrfPwVA 1219 (1262)
T KOG1141|consen 1140 GGSVEKDDTTSRDSMEKDKGESKDEPVFNWDKYFEPFPLYVIDAKQEGNLGRFLNHSCDPNLHVQNVFVDTHDLRFPWVA 1219 (1262)
T ss_pred             CccccccccCccchhhhccCccCcccccchhhccCCCceEEEecccccchhhhhccCCCccceeeeeeeeccccCCchhh
Confidence                       0                   00001123579999999999999999999999999998765    7799


Q ss_pred             EEEccCCCCCCeEEEecCCCccCCC-CCeEEecCCCCCceec
Q psy7758         128 LFALRDVPAGTELVFNYELQKADND-GMRRCMCGAASCSGFI  168 (235)
Q Consensus       128 i~A~rdI~~GEElt~~Y~~~~~~~~-~~~~C~Cg~~~C~g~~  168 (235)
                      +||.+-|++|.||||||++.-.... ....|+||+.+|||.|
T Consensus      1220 FFt~kyVkAgtELTWDY~Ye~g~v~~keL~C~CGa~~CrgrL 1261 (1262)
T KOG1141|consen 1220 FFTRKYVKAGTELTWDYQYEQGQVATKELTCHCGAENCRGRL 1261 (1262)
T ss_pred             hhhhhhhccCceeeeeccccccccccceEEEecChhhhhccc
Confidence            9999999999999999988765433 3678999999999976


No 9  
>PF00856 SET:  SET domain;  InterPro: IPR001214 The SET domain appears generally as one part of a larger multidomain protein, and recently there were described three structures of very different proteins with distinct domain compositions: Neurospora crassa DIM-5, a member of the Su(var) family of HKMTs which methylate histone H3 on lysine 9,human SET7 (also called SET9), which methylates H3 on lysine 4 and garden pea Rubisco LSMT, an enzyme that does not modify histones, but instead methylates lysine 14 in the flexible tail of the large subunit of the enzyme Rubisco. The SET domain itself turned out to be an uncommon structure. Although in all three studies, electron density maps revealed the location of the AdoMet or AdoHcy cofactor, the SET domain bears no similarity at all to the canonical/AdoMet-dependent methyltransferase fold. Strictly conserved in the C-terminal motif of the SET domain tyrosine could be involved in abstracting a proton from the protonated amino group of the substrate lysine, promoting its nucleophilic attack on the sulphonium methyl group of the AdoMet cofactor. In contrast to the AdoMet-dependent protein methyltranferases of the classical type, which tend to bind their polypeptide substrates on top of the cofactor, it is noted from the Rubisco LSMT structure that the AdoMet seems to bind in a separate cleft, suggesting how a polypeptide substrate could be subjected to multiple rounds of methylation without having to be released from the enzyme. In contrast, SET7/9 is able to add only a single methyl group to its substrate. It has been demonstrated that association of SET domain and myotubularin-related proteins modulates growth control []. The SET domain-containing Drosophila melanogaster (Fruit fly) protein, enhancer of zeste, has a function in segment determination and the mammalian homologue may be involved in the regulation of gene transcription and chromatin structure. Histone lysine methylation is part of the histone code that regulated chromatin function and epigenetic control of gene function. Histone lysine methyltransferases (HMTase) differ both in their substrate specificity for the various acceptor lysines as well as in their product specificity for the number of methyl groups (one, two, or three) they transfer. With just one exception [], the HMTases belong to SET family that can be classified according to the sequences surrounding the SET domain [, ]. Structural studies on the human SET7/9, a mono-methylase, have revealed the molecular basis for the specificity of the enzyme for the histone-target and the roles of the invariant residues in the SET domain in determining the methylation specificities [].  The pre-SET domain, as found in the SUV39 SET family, contains nine invariant cysteine residues that are grouped into two segments separated by a region of variable length. These 9 cysteines coordinate 3 zinc ions to form to form a triangular cluster, where each of the zinc ions is coordinated by 4 four cysteines to give a tetrahedral configuration. The function of this domain is structural, holding together 2 long segments of random coils. The C-terminal region including the post-SET domain is disordered when not interacting with a histone tail and in the absence of zinc. The three conserved cysteines in the post-SET domain form a zinc-binding site when coupled to a fourth conserved cysteine in the knot-like structure close to the SET domain active site []. The structured post-SET region brings in the C-terminal residues that participate in S-adenosylmethine-binding and histone tail interactions. The three conserved cysteine residues are essential for HMTase activity, as replacement with serine abolishes HMTase activity [], []. ; GO: 0005515 protein binding; PDB: 3TG5_A 3S7F_A 3RIB_B 3TG4_A 3S7J_A 3S7D_A 3S7B_A 3H6L_A 3SMT_A 3K5K_A ....
Probab=99.75  E-value=1.9e-18  Score=136.07  Aligned_cols=107  Identities=21%  Similarity=0.270  Sum_probs=72.8

Q ss_pred             ceEEEEceeCCCCCEEEEeccEEeCHHHHHHHHH---hhc------------------------------------cCCC
Q psy7758          39 GWGLKALTDLKRGQFVVEYVGEMIDQKELNRRRR---DMD------------------------------------RNND   79 (235)
Q Consensus        39 G~GlfA~~~I~~Ge~I~ey~G~ii~~~e~~~~~~---~~~------------------------------------~~~~   79 (235)
                      |+||||+++|++|++|+...+.+++.........   ...                                    ....
T Consensus         1 GrGl~At~dI~~Ge~I~~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~   80 (162)
T PF00856_consen    1 GRGLFATRDIKAGEVILIPRPAILTPDEVSPQPELLRLQLSKALEEQSRSDFSIQKKQKAEKSERSPQLESLHSISLRSE   80 (162)
T ss_dssp             SEEEEESS-B-TTEEEEEESEEEEEHHHHHCHHHHSHHTTCSSSCSHHTTHHHHHHHHHHHHHHHHHHHHHHHHHCHTTT
T ss_pred             CEEEEECccCCCCCEEEEECcceEEehhhhhcccchhhhhhhhhcccccccccccccccccccccccccccccccccccc
Confidence            8999999999999999988888888776643100   000                                    0000


Q ss_pred             ----------------CceeeeecCCCeEEeccccCCccccccCCCCCCceeEEEEECCceEEEEEEccCCCCCCeEEEe
Q psy7758          80 ----------------HNYYFLSLDNSRYIDAGKKGNLARFMNHSCEPNCTAEKWTVSGDTRVGLFALRDVPAGTELVFN  143 (235)
Q Consensus        80 ----------------~~~~~~~~~~~~~iDa~~~gn~aRfiNHSC~PN~~~~~~~~~~~~~~~i~A~rdI~~GEElt~~  143 (235)
                                      ..............++....+++.|+||||.|||.+..........+.|+|+|||++|||||++
T Consensus        81 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~p~~d~~NHsc~pn~~~~~~~~~~~~~~~~~a~r~I~~GeEi~is  160 (162)
T PF00856_consen   81 LQFSQAFQWSWFISWTRSDFSSRSFSEDDRDGIALYPFADMLNHSCDPNCEVSFDFDGDGGCLVVRATRDIKKGEEIFIS  160 (162)
T ss_dssp             CCTCCHHHHHHHHHHHHHEEEEEEETTEEEEEEEEETGGGGSEEESSTSEEEEEEEETTTTEEEEEESS-B-TTSBEEEE
T ss_pred             ccccccccchhhccccceeeeccccccccccccccCcHhHheccccccccceeeEeecccceEEEEECCccCCCCEEEEE
Confidence                            0001111112334555667778999999999999997765567799999999999999999999


Q ss_pred             cC
Q psy7758         144 YE  145 (235)
Q Consensus       144 Y~  145 (235)
                      ||
T Consensus       161 YG  162 (162)
T PF00856_consen  161 YG  162 (162)
T ss_dssp             ST
T ss_pred             EC
Confidence            97


No 10 
>COG2940 Proteins containing SET domain [General function prediction only]
Probab=99.74  E-value=5.3e-19  Score=165.56  Aligned_cols=145  Identities=35%  Similarity=0.667  Sum_probs=118.4

Q ss_pred             CCCCCCEEEEEcCCCceEEEEceeCCCCCEEEEeccEEeCHHHHHHHHHhhccCCCCceeeeecCCCeEEeccccCCccc
Q psy7758          24 TRNYPPLEVFNTGSRGWGLKALTDLKRGQFVVEYVGEMIDQKELNRRRRDMDRNNDHNYYFLSLDNSRYIDAGKKGNLAR  103 (235)
Q Consensus        24 ~~~~~~lev~~s~~~G~GlfA~~~I~~Ge~I~ey~G~ii~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~iDa~~~gn~aR  103 (235)
                      .....+..+..+..+|+|+||.+.|++|++|.+|.|+++...++..+............+........++|+...|+.+|
T Consensus       328 ~~~~~~~~~~~~~~~~~g~fa~~~i~~~e~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~g~~~r  407 (480)
T COG2940         328 KKRREPNVVQESEIKGYGVFALESIKKGEFIIEYHGEIIRRKEAREREENYDLLGNEFSFGLLEDKDKVRDSQKAGDVAR  407 (480)
T ss_pred             ccccchhhhhhhcccccceeehhhccchHHHHHhcCcccchHHHHhhhccccccccccchhhccccchhhhhhhcccccc
Confidence            33445667777789999999999999999999999999999888888776644333222222222377899999999999


Q ss_pred             cccCCCCCCceeEEEEECCceEEEEEEccCCCCCCeEEEecCCCccCCC-------CCeEEecCCCCCceec
Q psy7758         104 FMNHSCEPNCTAEKWTVSGDTRVGLFALRDVPAGTELVFNYELQKADND-------GMRRCMCGAASCSGFI  168 (235)
Q Consensus       104 fiNHSC~PN~~~~~~~~~~~~~~~i~A~rdI~~GEElt~~Y~~~~~~~~-------~~~~C~Cg~~~C~g~~  168 (235)
                      |+||||.||+.......++...+.++|+|||.+||||+++|+...+...       ....|.|++..|++++
T Consensus       408 ~~nHS~~pN~~~~~~~~~g~~~~~~~~~rDI~~geEl~~dy~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~  479 (480)
T COG2940         408 FINHSCTPNCEASPIEVNGIFKISIYAIRDIKAGEELTYDYGPSLEDNRELKKLLEKRWGCACGEDRCSHTM  479 (480)
T ss_pred             eeecCCCCCcceecccccccceeeecccccchhhhhhccccccccccchhhhhhhhhhhccccCCCccCCCC
Confidence            9999999999987665555568999999999999999999999888755       4778999999999876


No 11 
>KOG2589|consensus
Probab=99.61  E-value=8.5e-16  Score=135.10  Aligned_cols=123  Identities=23%  Similarity=0.335  Sum_probs=94.1

Q ss_pred             CCCceEEEEceeCCCCCEEEEeccEEeCHHHHHHHHHhhccCCCCceeeeecCCCeEEeccccCCccccccCCCCCCcee
Q psy7758          36 GSRGWGLKALTDLKRGQFVVEYVGEMIDQKELNRRRRDMDRNNDHNYYFLSLDNSRYIDAGKKGNLARFMNHSCEPNCTA  115 (235)
Q Consensus        36 ~~~G~GlfA~~~I~~Ge~I~ey~G~ii~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~iDa~~~gn~aRfiNHSC~PN~~~  115 (235)
                      ...|--|+|++.+.+|+-|...+|.|....+.+++........+++..+    ....-.+..+-.+++||||.|.|||.|
T Consensus       135 ~~~gAkivst~~w~~ndkIe~LvGcIaeLse~eE~~ll~~g~nDFSvmy----StRk~caqLwLGPaafINHDCrpnCkF  210 (453)
T KOG2589|consen  135 SQNGAKIVSTKSWSRNDKIELLVGCIAELSEAEERSLLRGGGNDFSVMY----STRKRCAQLWLGPAAFINHDCRPNCKF  210 (453)
T ss_pred             cCCCceEEeeccccCCccHHHhhhhhhhcChhhhHHHHhccCCceeeee----ecccchhhheeccHHhhcCCCCCCcee
Confidence            3579999999999999999999999987777777744444433332222    112233556778899999999999998


Q ss_pred             EEEEECCceEEEEEEccCCCCCCeEEEecCCCccCCCCCeEEecCCCCCceec
Q psy7758         116 EKWTVSGDTRVGLFALRDVPAGTELVFNYELQKADNDGMRRCMCGAASCSGFI  168 (235)
Q Consensus       116 ~~~~~~~~~~~~i~A~rdI~~GEElt~~Y~~~~~~~~~~~~C~Cg~~~C~g~~  168 (235)
                      +   ..|..++.|.++|||+||||||.-|+..+++.. +..|.|  ..|....
T Consensus       211 v---s~g~~tacvkvlRDIePGeEITcFYgs~fFG~~-N~~CeC--~TCER~g  257 (453)
T KOG2589|consen  211 V---STGRDTACVKVLRDIEPGEEITCFYGSGFFGEN-NEECEC--VTCERRG  257 (453)
T ss_pred             e---cCCCceeeeehhhcCCCCceeEEeecccccCCC-CceeEE--eeccccc
Confidence            4   334589999999999999999999999999988 777777  5665443


No 12 
>KOG1081|consensus
Probab=99.51  E-value=3.1e-15  Score=138.83  Aligned_cols=151  Identities=44%  Similarity=0.858  Sum_probs=129.2

Q ss_pred             eEEcCCCCCCCCCCCCCccCCCCCCCCEEEEEcCCCceEEEEceeCCCCCEEEEeccEEeCHHHHHHHHHhhccCCCCce
Q psy7758           3 YVECNPDSCPARTKCQNRDFETRNYPPLEVFNTGSRGWGLKALTDLKRGQFVVEYVGEMIDQKELNRRRRDMDRNNDHNY   82 (235)
Q Consensus         3 ~~eC~~~~C~~~~~C~N~~~~~~~~~~lev~~s~~~G~GlfA~~~I~~Ge~I~ey~G~ii~~~e~~~~~~~~~~~~~~~~   82 (235)
                      +++|.|..|+++..|.|+.+.+...+.      +..    +|.++|.+|      .|++++..+...+...........+
T Consensus       289 ~~~~~p~~~~~~~~~~~~~~sk~~~~e------~~~----~~~~~~~k~------vg~~i~~~e~~~~~~~~~~~~~~~~  352 (463)
T KOG1081|consen  289 AYEVHPKVCSAEERCHNQQFSKESYPE------PQK----TAKADIRKG------VGEVIDDKECKARLQRVKESDLVDF  352 (463)
T ss_pred             hhhhcccccccccccccchhhhhcccc------cch----hhHHhhhcc------cCcccchhhheeehhhhhccchhhh
Confidence            567888899999999999988887755      222    899999998      8889999998888888887777888


Q ss_pred             eeeecCCCeEEeccccCCccccccCCCCCCceeEEEEECCceEEEEEEccCCCCCCeEEEecCCCccCCCCCeEEecCCC
Q psy7758          83 YFLSLDNSRYIDAGKKGNLARFMNHSCEPNCTAEKWTVSGDTRVGLFALRDVPAGTELVFNYELQKADNDGMRRCMCGAA  162 (235)
Q Consensus        83 ~~~~~~~~~~iDa~~~gn~aRfiNHSC~PN~~~~~~~~~~~~~~~i~A~rdI~~GEElt~~Y~~~~~~~~~~~~C~Cg~~  162 (235)
                      |+.....+..||+..++|.+||+||||.||+....|.+.+..++.++|.+.|++|||||++|...-...  .+.|.|++.
T Consensus       353 ~~~~~e~~~~id~~~~~n~sr~~nh~~~~~v~~~k~~~~~~t~~~~~a~~~i~~g~e~t~~~n~~~~~~--~~~~~~~~e  430 (463)
T KOG1081|consen  353 YMVFIQKDRIIDAGPKGNYSRFLNHSCQPNVETEKWQVIGDTRVGLFAPRQIEAGEELTFNYNGNCEGN--EKRCCCGSE  430 (463)
T ss_pred             hhhhhhcccccccccccchhhhhcccCCCceeechhheecccccccccccccccchhhhheeeccccCC--cceEeeccc
Confidence            877777666999999999999999999999999999999999999999999999999999998765543  578999888


Q ss_pred             CCceeccCC
Q psy7758         163 SCSGFIGAK  171 (235)
Q Consensus       163 ~C~g~~~~~  171 (235)
                      +|.++++..
T Consensus       431 ~~~~~~~k~  439 (463)
T KOG1081|consen  431 NCTETKGKK  439 (463)
T ss_pred             ccccCCccc
Confidence            887776654


No 13 
>KOG2461|consensus
Probab=98.69  E-value=2.1e-08  Score=91.58  Aligned_cols=115  Identities=23%  Similarity=0.299  Sum_probs=82.9

Q ss_pred             CCCCEEEEEc--CCCceEEEEceeCCCCCEEEEeccEEeCHHHHHHHHHhhccCCCCceeeeecC-CCeEEecc--ccCC
Q psy7758          26 NYPPLEVFNT--GSRGWGLKALTDLKRGQFVVEYVGEMIDQKELNRRRRDMDRNNDHNYYFLSLD-NSRYIDAG--KKGN  100 (235)
Q Consensus        26 ~~~~lev~~s--~~~G~GlfA~~~I~~Ge~I~ey~G~ii~~~e~~~~~~~~~~~~~~~~~~~~~~-~~~~iDa~--~~gn  100 (235)
                      ..+.|.|..+  +..|.||++...|.+|+.-+.|.|+++....      ....+......++..+ ...+||++  ...|
T Consensus        26 LP~~l~i~~Ssv~~~~lgV~s~~~i~~G~~FGP~~G~~~~~~~------~~~~n~~y~W~I~~~d~~~~~iDg~d~~~sN   99 (396)
T KOG2461|consen   26 LPPELRIKPSSVPVTGLGVWSNASILPGTSFGPFEGEIIASID------SKSANNRYMWEIFSSDNGYEYIDGTDEEHSN   99 (396)
T ss_pred             CCCceEeeccccCCccccccccccccCcccccCccCccccccc------cccccCcceEEEEeCCCceEEeccCChhhcc
Confidence            5678888888  5688999999999999999999999711110      0011111222222332 44789987  4789


Q ss_pred             ccccccCCCCC---CceeEEEEECCceEEEEEEccCCCCCCeEEEecCCCccC
Q psy7758         101 LARFMNHSCEP---NCTAEKWTVSGDTRVGLFALRDVPAGTELVFNYELQKAD  150 (235)
Q Consensus       101 ~aRfiNHSC~P---N~~~~~~~~~~~~~~~i~A~rdI~~GEElt~~Y~~~~~~  150 (235)
                      +.||||-.+..   |...    ......|+++|+|+|.+||||.|+|+.++..
T Consensus       100 WmRYV~~Ar~~eeQNL~A----~Q~~~~Ifyrt~r~I~p~eELlVWY~~e~~~  148 (396)
T KOG2461|consen  100 WMRYVNSARSEEEQNLLA----FQIGENIFYRTIRDIRPNEELLVWYGSEYAE  148 (396)
T ss_pred             eeeeecccCChhhhhHHH----HhccCceEEEecccCCCCCeEEEEeccchHh
Confidence            99999988864   6543    2335789999999999999999999988743


No 14 
>KOG1141|consensus
Probab=97.76  E-value=5.7e-06  Score=80.10  Aligned_cols=67  Identities=28%  Similarity=0.538  Sum_probs=60.2

Q ss_pred             eEEcCCCCCCC-CCCCCCccCCCCCCCCEEEEEcCCCceEEEEceeCCCCCEEEEeccEEeCHHHHHHH
Q psy7758           3 YVECNPDSCPA-RTKCQNRDFETRNYPPLEVFNTGSRGWGLKALTDLKRGQFVVEYVGEMIDQKELNRR   70 (235)
Q Consensus         3 ~~eC~~~~C~~-~~~C~N~~~~~~~~~~lev~~s~~~G~GlfA~~~I~~Ge~I~ey~G~ii~~~e~~~~   70 (235)
                      +|||+. .|.| +.-|.||+++.+.+.+|.++.+..+|||++...+|..|.+|+.|.|.+++..-.+..
T Consensus       774 ~yEc~k-~ckc~~~~C~nrmvqhg~qvRlq~fkt~~kGWg~rclddi~~g~fVciy~g~~l~~~~sdks  841 (1262)
T KOG1141|consen  774 PYECLK-ACKCCGPDCLNRMVQHGYQVRLQRFKTIHKGWGRRCLDDITGGNFVCIYPGGALLHQISDKS  841 (1262)
T ss_pred             HHHHHH-hhccCcHHHHHHHhhcCceeEeeeccccccccceEeeeecCCceEEEEecchhhhhhhchhh
Confidence            589986 8986 579999999999999999999999999999999999999999999999887654443


No 15 
>smart00570 AWS associated with SET domains. subdomain of PRESET
Probab=97.48  E-value=5.1e-05  Score=49.34  Aligned_cols=25  Identities=40%  Similarity=1.069  Sum_probs=22.2

Q ss_pred             ceEEcCCCCCCCCCCCCCccCCCCCC
Q psy7758           2 LYVECNPDSCPARTKCQNRDFETRNY   27 (235)
Q Consensus         2 ~~~eC~~~~C~~~~~C~N~~~~~~~~   27 (235)
                      |++|| |..|+||+.|+||.|+++++
T Consensus        27 l~~EC-~~~C~~G~~C~NqrFqk~~y   51 (51)
T smart00570       27 LLIEC-SSDCPCGSYCSNQRFQKRQY   51 (51)
T ss_pred             Hhhhc-CCCCCCCcCccCcccccCcC
Confidence            57899 57999999999999998864


No 16 
>smart00508 PostSET Cysteine-rich motif following a subset of SET domains.
Probab=96.44  E-value=0.0017  Score=36.19  Aligned_cols=18  Identities=44%  Similarity=1.261  Sum_probs=15.6

Q ss_pred             CeEEecCCCCCceeccCC
Q psy7758         154 MRRCMCGAASCSGFIGAK  171 (235)
Q Consensus       154 ~~~C~Cg~~~C~g~~~~~  171 (235)
                      .+.|.||+++|||++...
T Consensus         2 ~~~C~CGs~~CRG~l~~~   19 (26)
T smart00508        2 KQPCLCGAPNCRGFLGXX   19 (26)
T ss_pred             CeeeeCCCccccceeccc
Confidence            579999999999999654


No 17 
>KOG2084|consensus
Probab=95.28  E-value=0.023  Score=52.51  Aligned_cols=44  Identities=27%  Similarity=0.468  Sum_probs=32.3

Q ss_pred             cccCCCCCCceeEEEEECCceEEEEEEccCCCCCC-eEEEecCCCccCC
Q psy7758         104 FMNHSCEPNCTAEKWTVSGDTRVGLFALRDVPAGT-ELVFNYELQKADN  151 (235)
Q Consensus       104 fiNHSC~PN~~~~~~~~~~~~~~~i~A~rdI~~GE-Elt~~Y~~~~~~~  151 (235)
                      ++||||.||+..   ..+ .....+.+..++.+++ ||+++|....++.
T Consensus       208 ~~~hsC~pn~~~---~~~-~~~~~~~~~~~~~~~~~~l~~~y~~~~~~~  252 (482)
T KOG2084|consen  208 LFNHSCFPNISV---IFD-GRGLALLVPAGIDAGEEELTISYTDPLLST  252 (482)
T ss_pred             hcccCCCCCeEE---EEC-CceeEEEeecccCCCCCEEEEeecccccCH
Confidence            789999999983   233 3445566777777776 9999998877653


No 18 
>KOG1337|consensus
Probab=92.90  E-value=0.085  Score=49.69  Aligned_cols=41  Identities=39%  Similarity=0.399  Sum_probs=31.5

Q ss_pred             ccccCCCCCCceeEEEEECCceEEEEEEccCCCCCCeEEEecCC
Q psy7758         103 RFMNHSCEPNCTAEKWTVSGDTRVGLFALRDVPAGTELVFNYEL  146 (235)
Q Consensus       103 RfiNHSC~PN~~~~~~~~~~~~~~~i~A~rdI~~GEElt~~Y~~  146 (235)
                      -+.||++.+...   .+......+.+++.++|.+||||+++||.
T Consensus       238 D~~NH~~~~~~~---~~~~~d~~~~l~~~~~v~~geevfi~YG~  278 (472)
T KOG1337|consen  238 DLLNHSPEVIKA---GYNQEDEAVELVAERDVSAGEEVFINYGP  278 (472)
T ss_pred             HhhccCchhccc---cccCCCCcEEEEEeeeecCCCeEEEecCC
Confidence            467999999222   11222348999999999999999999987


No 19 
>smart00317 SET SET (Su(var)3-9, Enhancer-of-zeste, Trithorax) domain. Putative methyl transferase, based on outlier plant homologues
Probab=76.25  E-value=4.3  Score=29.30  Aligned_cols=34  Identities=24%  Similarity=0.277  Sum_probs=25.5

Q ss_pred             CCCCCCCCEEEEEcCCCc---eEEEEceeCCCCCEEE
Q psy7758          22 FETRNYPPLEVFNTGSRG---WGLKALTDLKRGQFVV   55 (235)
Q Consensus        22 ~~~~~~~~lev~~s~~~G---~GlfA~~~I~~Ge~I~   55 (235)
                      +.+...|...+......|   ..++|+++|++||-|.
T Consensus        77 iNHsc~pN~~~~~~~~~~~~~~~~~a~r~I~~GeEi~  113 (116)
T smart00317       77 INHSCEPNCELLFVEVNGDSRIVIFALRDIKPGEELT  113 (116)
T ss_pred             eCCCCCCCEEEEEEEECCCcEEEEEECCCcCCCCEEe
Confidence            445556777776665544   8999999999999875


No 20 
>PF08666 SAF:  SAF domain;  InterPro: IPR013974  This entry includes a range of different proteins, such as antifreeze proteins, flagellar FlgA proteins, and CpaB pilus proteins. ; PDB: 1C89_A 3NLA_A 3RDN_A 1C8A_A 3FRN_A 1WVO_A 3K3S_H 3G8R_B 1XUU_A 1XUZ_A ....
Probab=75.55  E-value=1.6  Score=28.70  Aligned_cols=15  Identities=47%  Similarity=0.718  Sum_probs=11.3

Q ss_pred             EEEEccCCCCCCeEE
Q psy7758         127 GLFALRDVPAGTELV  141 (235)
Q Consensus       127 ~i~A~rdI~~GEElt  141 (235)
                      +++|.+||++|+.|+
T Consensus         3 vvVA~~di~~G~~i~   17 (63)
T PF08666_consen    3 VVVAARDIPAGTVIT   17 (63)
T ss_dssp             EEEESSTB-TT-BEC
T ss_pred             EEEEeCccCCCCEEc
Confidence            478999999999993


No 21 
>KOG1338|consensus
Probab=67.50  E-value=4.6  Score=37.18  Aligned_cols=45  Identities=24%  Similarity=0.324  Sum_probs=31.8

Q ss_pred             ccccccCCCC-CCceeEEEEECCceEEEEEEccCCCCCCeEEEecCCCcc
Q psy7758         101 LARFMNHSCE-PNCTAEKWTVSGDTRVGLFALRDVPAGTELVFNYELQKA  149 (235)
Q Consensus       101 ~aRfiNHSC~-PN~~~~~~~~~~~~~~~i~A~rdI~~GEElt~~Y~~~~~  149 (235)
                      .+-|+||.-. -|+...+    +..-+.++|.|+|.+|+|+.-.|+....
T Consensus       218 ~ad~lNhd~~k~nanl~y----~~NcL~mva~r~iekgdev~n~dg~~p~  263 (466)
T KOG1338|consen  218 IADFLNHDGLKANANLRY----EDNCLEMVADRNIEKGDEVDNSDGLKPM  263 (466)
T ss_pred             hhhhhccchhhcccceec----cCcceeeeecCCCCCccccccccccCcc
Confidence            4678999743 3443321    2355678999999999999999975443


No 22 
>smart00858 SAF This domain family includes a range of different proteins. Such as antifreeze proteins and flagellar FlgA proteins, and CpaB pilus proteins.
Probab=51.33  E-value=10  Score=24.69  Aligned_cols=16  Identities=38%  Similarity=0.544  Sum_probs=14.0

Q ss_pred             EEEEccCCCCCCeEEE
Q psy7758         127 GLFALRDVPAGTELVF  142 (235)
Q Consensus       127 ~i~A~rdI~~GEElt~  142 (235)
                      +++|.++|.+|+.|+-
T Consensus         3 v~va~~~i~~G~~i~~   18 (64)
T smart00858        3 VVVAARDLPAGEVITA   18 (64)
T ss_pred             EEEEeCccCCCCCcch
Confidence            4789999999999985


No 23 
>TIGR02059 swm_rep_I cyanobacterial long protein repeat. This domain appears in 29 copies in a large (10000 amino protein in Synechococcus sp. WH8102 associated with a novel flagellar system, as one of three different repeats. Similar domains are found in two different large (<3500) proteins of Synechocystis PCC6803.
Probab=49.22  E-value=40  Score=25.02  Aligned_cols=24  Identities=25%  Similarity=0.364  Sum_probs=20.9

Q ss_pred             eEEEEEEccCCCCCCeEEEecCCC
Q psy7758         124 TRVGLFALRDVPAGTELVFNYELQ  147 (235)
Q Consensus       124 ~~~~i~A~rdI~~GEElt~~Y~~~  147 (235)
                      ..+.|.-.+.|..||++|++|...
T Consensus        75 ktVTLTL~~~V~~Gq~VTVsYt~p   98 (101)
T TIGR02059        75 TTITLTLAQVVEDGDEVTLSYTKN   98 (101)
T ss_pred             cEEEEEecccccCCCEEEEEeeCC
Confidence            478888899999999999999654


No 24 
>PF14100 PmoA:  Methane oxygenase PmoA
Probab=47.80  E-value=29  Score=30.31  Aligned_cols=104  Identities=18%  Similarity=0.312  Sum_probs=52.5

Q ss_pred             CEEEEEcCCCceEEEEceeCCCCCEEEEeccEEeCHHHHHHHHHhhccCCCCceeeeecCCCeEEeccccCCccccccCC
Q psy7758          29 PLEVFNTGSRGWGLKALTDLKRGQFVVEYVGEMIDQKELNRRRRDMDRNNDHNYYFLSLDNSRYIDAGKKGNLARFMNHS  108 (235)
Q Consensus        29 ~lev~~s~~~G~GlfA~~~I~~Ge~I~ey~G~ii~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~iDa~~~gn~aRfiNHS  108 (235)
                      +|.+....-.|+++++.+.+..|+++..- |       ................|.-..++.     ..... .-|++|.
T Consensus       144 ~v~l~~~~yGGl~~R~~~~~~~g~v~~s~-G-------~~g~~~~~g~~a~Wv~~~g~~~~~-----~~~~~-i~~~dhP  209 (271)
T PF14100_consen  144 PVTLGDPGYGGLFWRAARSWDGGTVLTSE-G-------KTGEEAAWGKRAPWVDYSGPIDGE-----DGTSG-IAILDHP  209 (271)
T ss_pred             ceEecCCCcceEEEEccCcccCCeEECCC-C-------CcCcccccCCccCceEEEeeeCCC-----cceEE-EEEEeCC
Confidence            56666555678999999988665555331 1       000000001111111111111111     00112 2377888


Q ss_pred             CCCCceeEEEEECCceEEEE------EEccCCCCCCeEEEecCCC
Q psy7758         109 CEPNCTAEKWTVSGDTRVGL------FALRDVPAGTELVFNYELQ  147 (235)
Q Consensus       109 C~PN~~~~~~~~~~~~~~~i------~A~rdI~~GEElt~~Y~~~  147 (235)
                      =+||-- ..|.+.+...+.+      ..--.|++||.|++.|..-
T Consensus       210 ~N~~~P-~~W~vR~~g~~~~~p~~~~~~~~~l~~G~~l~~rYr~~  253 (271)
T PF14100_consen  210 SNPNYP-TPWHVRGYGLFGANPAPAFDGPLTLPPGETLTLRYRVV  253 (271)
T ss_pred             CCCCCC-cceEEeccCcceecccccccCceecCCCCeEEEEEEEE
Confidence            887644 3355554444433      3445699999999999543


No 25 
>KOG1338|consensus
Probab=43.80  E-value=16  Score=33.80  Aligned_cols=34  Identities=21%  Similarity=0.376  Sum_probs=25.8

Q ss_pred             CCCEEEEEcC----CCceEEEEceeCCCCCEEEEeccE
Q psy7758          27 YPPLEVFNTG----SRGWGLKALTDLKRGQFVVEYVGE   60 (235)
Q Consensus        27 ~~~lev~~s~----~~G~GlfA~~~I~~Ge~I~ey~G~   60 (235)
                      +|++.+..-+    ..|.|++|+++|++|+.++.|.+.
T Consensus        25 SpKi~~ndl~~v~~~~G~g~vAtesIkkgE~Lf~~prd   62 (466)
T KOG1338|consen   25 SPKIDNNDLPWVERIAGAGIVATESIKKGESLFAYPRD   62 (466)
T ss_pred             cccccccccchhhhhcccceeeehhhcCCceEEEecCc
Confidence            3445554442    359999999999999999988665


No 26 
>KOG1081|consensus
Probab=40.22  E-value=16  Score=34.56  Aligned_cols=20  Identities=25%  Similarity=0.587  Sum_probs=17.8

Q ss_pred             ccccccccccCCCCCeEEEee
Q psy7758         213 PAVVEVCERCGAEGGEMLRCA  233 (235)
Q Consensus       213 ~~~~d~~~~c~~~~~~~~~~~  233 (235)
                      +..+++||.|. +||.+|.||
T Consensus        86 ~~~~~~c~vc~-~ggs~v~~~  105 (463)
T KOG1081|consen   86 KIEPSECFVCF-KGGSLVTCK  105 (463)
T ss_pred             CCCcchhcccc-CCCccceec
Confidence            44579999999 999999998


No 27 
>PF00856 SET:  SET domain;  InterPro: IPR001214 The SET domain appears generally as one part of a larger multidomain protein, and recently there were described three structures of very different proteins with distinct domain compositions: Neurospora crassa DIM-5, a member of the Su(var) family of HKMTs which methylate histone H3 on lysine 9,human SET7 (also called SET9), which methylates H3 on lysine 4 and garden pea Rubisco LSMT, an enzyme that does not modify histones, but instead methylates lysine 14 in the flexible tail of the large subunit of the enzyme Rubisco. The SET domain itself turned out to be an uncommon structure. Although in all three studies, electron density maps revealed the location of the AdoMet or AdoHcy cofactor, the SET domain bears no similarity at all to the canonical/AdoMet-dependent methyltransferase fold. Strictly conserved in the C-terminal motif of the SET domain tyrosine could be involved in abstracting a proton from the protonated amino group of the substrate lysine, promoting its nucleophilic attack on the sulphonium methyl group of the AdoMet cofactor. In contrast to the AdoMet-dependent protein methyltranferases of the classical type, which tend to bind their polypeptide substrates on top of the cofactor, it is noted from the Rubisco LSMT structure that the AdoMet seems to bind in a separate cleft, suggesting how a polypeptide substrate could be subjected to multiple rounds of methylation without having to be released from the enzyme. In contrast, SET7/9 is able to add only a single methyl group to its substrate. It has been demonstrated that association of SET domain and myotubularin-related proteins modulates growth control []. The SET domain-containing Drosophila melanogaster (Fruit fly) protein, enhancer of zeste, has a function in segment determination and the mammalian homologue may be involved in the regulation of gene transcription and chromatin structure. Histone lysine methylation is part of the histone code that regulated chromatin function and epigenetic control of gene function. Histone lysine methyltransferases (HMTase) differ both in their substrate specificity for the various acceptor lysines as well as in their product specificity for the number of methyl groups (one, two, or three) they transfer. With just one exception [], the HMTases belong to SET family that can be classified according to the sequences surrounding the SET domain [, ]. Structural studies on the human SET7/9, a mono-methylase, have revealed the molecular basis for the specificity of the enzyme for the histone-target and the roles of the invariant residues in the SET domain in determining the methylation specificities [].  The pre-SET domain, as found in the SUV39 SET family, contains nine invariant cysteine residues that are grouped into two segments separated by a region of variable length. These 9 cysteines coordinate 3 zinc ions to form to form a triangular cluster, where each of the zinc ions is coordinated by 4 four cysteines to give a tetrahedral configuration. The function of this domain is structural, holding together 2 long segments of random coils. The C-terminal region including the post-SET domain is disordered when not interacting with a histone tail and in the absence of zinc. The three conserved cysteines in the post-SET domain form a zinc-binding site when coupled to a fourth conserved cysteine in the knot-like structure close to the SET domain active site []. The structured post-SET region brings in the C-terminal residues that participate in S-adenosylmethine-binding and histone tail interactions. The three conserved cysteine residues are essential for HMTase activity, as replacement with serine abolishes HMTase activity [], []. ; GO: 0005515 protein binding; PDB: 3TG5_A 3S7F_A 3RIB_B 3TG4_A 3S7J_A 3S7D_A 3S7B_A 3H6L_A 3SMT_A 3K5K_A ....
Probab=38.77  E-value=49  Score=24.71  Aligned_cols=34  Identities=21%  Similarity=0.191  Sum_probs=24.4

Q ss_pred             CCCCCCCEEEEEc---CCCceEEEEceeCCCCCEEEE
Q psy7758          23 ETRNYPPLEVFNT---GSRGWGLKALTDLKRGQFVVE   56 (235)
Q Consensus        23 ~~~~~~~lev~~s---~~~G~GlfA~~~I~~Ge~I~e   56 (235)
                      ++...|...+...   ....+-++|+++|++|+-|.-
T Consensus       123 NHsc~pn~~~~~~~~~~~~~~~~~a~r~I~~GeEi~i  159 (162)
T PF00856_consen  123 NHSCDPNCEVSFDFDGDGGCLVVRATRDIKKGEEIFI  159 (162)
T ss_dssp             EEESSTSEEEEEEEETTTTEEEEEESS-B-TTSBEEE
T ss_pred             ccccccccceeeEeecccceEEEEECCccCCCCEEEE
Confidence            3344556777766   688999999999999998864


No 28 
>KOG2155|consensus
Probab=38.33  E-value=17  Score=34.10  Aligned_cols=48  Identities=17%  Similarity=0.249  Sum_probs=34.2

Q ss_pred             ccccccCCCCCCceeEEEEECCc--eEEEEEEccCCCCCCeEEEecCCCc
Q psy7758         101 LARFMNHSCEPNCTAEKWTVSGD--TRVGLFALRDVPAGTELVFNYELQK  148 (235)
Q Consensus       101 ~aRfiNHSC~PN~~~~~~~~~~~--~~~~i~A~rdI~~GEElt~~Y~~~~  148 (235)
                      ++.-+.||=.||..+.....-..  .--++.-+|++..|||||-|+....
T Consensus       204 fGsrvrHsdePnf~~aPf~fmPq~vaYsimwp~k~~~tgeE~trDfasg~  253 (631)
T KOG2155|consen  204 FGSRVRHSDEPNFRIAPFMFMPQNVAYSIMWPTKPVNTGEEITRDFASGV  253 (631)
T ss_pred             hhhhhccCCCCcceeeeheecchhcceeEEeeccCCCCchHHHHHHhhcC
Confidence            34568999999987654333222  2334688999999999999986544


No 29 
>COG1188 Ribosome-associated heat shock protein implicated in the recycling of the 50S subunit (S4 paralog) [Translation, ribosomal structure and biogenesis]
Probab=34.03  E-value=47  Score=24.65  Aligned_cols=23  Identities=13%  Similarity=0.189  Sum_probs=20.1

Q ss_pred             EEEccCCCCCCeEEEecCCCccC
Q psy7758         128 LFALRDVPAGTELVFNYELQKAD  150 (235)
Q Consensus       128 i~A~rdI~~GEElt~~Y~~~~~~  150 (235)
                      ..+..+++.|++|+|.|+...+.
T Consensus        43 aKpS~~VK~GD~l~i~~~~~~~~   65 (100)
T COG1188          43 AKPSKEVKVGDILTIRFGNKEFT   65 (100)
T ss_pred             cccccccCCCCEEEEEeCCcEEE
Confidence            38999999999999999887764


No 30 
>PF00098 zf-CCHC:  Zinc knuckle;  InterPro: IPR001878 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target.  This entry represents the CysCysHisCys (CCHC) type zinc finger domains, and have the sequence:  C-X2-C-X4-H-X4-C  where X can be any amino acid, and number indicates the number of residues. These 18 residues CCHC zinc finger domains are mainly found in the nucleocapsid protein of retroviruses. It is required for viral genome packaging and for early infection process [, , ]. It is also found in eukaryotic proteins involved in RNA binding or single-stranded DNA binding []. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; GO: 0003676 nucleic acid binding, 0008270 zinc ion binding; PDB: 2L44_A 1A1T_A 1WWG_A 1U6P_A 1WWD_A 1WWE_A 1A6B_B 1F6U_A 1MFS_A 1NCP_C ....
Probab=32.11  E-value=24  Score=17.55  Aligned_cols=10  Identities=50%  Similarity=1.109  Sum_probs=7.8

Q ss_pred             cccccCCCCCe
Q psy7758         218 VCERCGAEGGE  228 (235)
Q Consensus       218 ~~~~c~~~~~~  228 (235)
                      .||.|| +-|-
T Consensus         2 ~C~~C~-~~GH   11 (18)
T PF00098_consen    2 KCFNCG-EPGH   11 (18)
T ss_dssp             BCTTTS-CSSS
T ss_pred             cCcCCC-CcCc
Confidence            599999 7764


No 31 
>TIGR01164 rplP_bact ribosomal protein L16, bacterial/organelle. This model describes bacterial and organellar ribosomal protein L16. The homologous protein of the eukaryotic cytosol is designated L10
Probab=30.34  E-value=80  Score=24.31  Aligned_cols=41  Identities=20%  Similarity=0.221  Sum_probs=29.2

Q ss_pred             CEEEEEcCCCceEEEEceeCCCCCEEEEeccEEeCHHHHHHHH
Q psy7758          29 PLEVFNTGSRGWGLKALTDLKRGQFVVEYVGEMIDQKELNRRR   71 (235)
Q Consensus        29 ~lev~~s~~~G~GlfA~~~I~~Ge~I~ey~G~ii~~~e~~~~~   71 (235)
                      +++++...++|-=..-...|++|++|+|..|  ++.+.+..-.
T Consensus        77 ~~~~RMGkGKG~~~~~varV~~G~ilfEi~~--~~~~~a~~al  117 (126)
T TIGR01164        77 PLETRMGKGKGNPEYWVAVVKPGKILFEIAG--VPEEVAREAF  117 (126)
T ss_pred             chhccccCCCCCCCEEEEEECCCCEEEEEeC--CCHHHHHHHH
Confidence            4555666666666666778999999999999  5665554433


No 32 
>smart00400 ZnF_CHCC zinc finger.
Probab=29.09  E-value=15  Score=23.64  Aligned_cols=15  Identities=33%  Similarity=0.798  Sum_probs=12.0

Q ss_pred             ccccccccCCCCCeEE
Q psy7758         215 VVEVCERCGAEGGEML  230 (235)
Q Consensus       215 ~~d~~~~c~~~~~~~~  230 (235)
                      .-+.||.|| .||..+
T Consensus        22 n~~~Cf~cg-~gGd~i   36 (55)
T smart00400       22 QFFHCFGCG-AGGNVI   36 (55)
T ss_pred             CEEEEeCCC-CCCCHH
Confidence            347999999 898754


No 33 
>TIGR03569 NeuB_NnaB N-acetylneuraminate synthase. This family is a subset of the Pfam model pfam03102 and is believed to include only authentic NeuB N-acetylneuraminate (sialic acid) synthase enzymes. The majority of the genes identified by this model are observed adjacent to both the NeuA and NeuC genes which together effect the biosynthesis of CMP-N-acetylneuraminate from UDP-N-acetylglucosamine.
Probab=25.82  E-value=34  Score=30.87  Aligned_cols=20  Identities=25%  Similarity=0.350  Sum_probs=17.0

Q ss_pred             EEEEEEccCCCCCCeEEEec
Q psy7758         125 RVGLFALRDVPAGTELVFNY  144 (235)
Q Consensus       125 ~~~i~A~rdI~~GEElt~~Y  144 (235)
                      +-.|+|.+||++||.||.+-
T Consensus       277 rrsl~a~~di~~G~~lt~~~  296 (329)
T TIGR03569       277 RKSLVAAKDIKKGEIFTEDN  296 (329)
T ss_pred             ceEEEEccCcCCCCEecHHh
Confidence            55689999999999998754


No 34 
>PF05077 DUF678:  Protein of unknown function (DUF678);  InterPro: IPR007769 This family contains poxvirus proteins belonging to the A19 family. The proteins are of unknown function.
Probab=25.66  E-value=34  Score=23.82  Aligned_cols=15  Identities=27%  Similarity=0.523  Sum_probs=8.5

Q ss_pred             CCCcccccccccccC
Q psy7758         209 KGEVPAVVEVCERCG  223 (235)
Q Consensus       209 ~~~~~~~~d~~~~c~  223 (235)
                      +..+.+.||.|.-|.
T Consensus        15 ~t~v~~~~d~c~tCS   29 (74)
T PF05077_consen   15 KTTVDDEEDDCTTCS   29 (74)
T ss_pred             CceeecCCCCccchh
Confidence            335554467766664


No 35 
>PF00628 PHD:  PHD-finger;  InterPro: IPR019787 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target.  This entry represents the PHD (homeodomain) zinc finger domain [,], which is a C4HC3 zinc-finger-like motif found in nuclear proteins thought to be involved in chromatin-mediated transcriptional regulation. The PHD finger motif is reminiscent of, but distinct from the C3HC4 type RING finger. The function of this domain is not yet known but in analogy with the LIM domain it could be involved in protein-protein interaction and be important for the assembly or activity of multicomponent complexes involved in transcriptional activation or repression. Alternatively, the interactions could be intra-molecular and be important in maintaining the structural integrity of the protein. In similarity to the RING finger and the LIM domain, the PHD finger is thought to bind two zinc ions. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; GO: 0005515 protein binding; PDB: 3ZVY_A 2LGG_A 3SOW_A 3SOU_B 3ASL_A 3ASK_A 3ZVZ_B 3T6R_A 2LGK_A 3SOX_B ....
Probab=24.14  E-value=50  Score=20.48  Aligned_cols=15  Identities=40%  Similarity=1.241  Sum_probs=10.8

Q ss_pred             ccccCCC---CCeEEEeec
Q psy7758         219 CERCGAE---GGEMLRCAL  234 (235)
Q Consensus       219 ~~~c~~~---~~~~~~~~~  234 (235)
                      |..|+ .   .+++|.||.
T Consensus         2 C~vC~-~~~~~~~~i~C~~   19 (51)
T PF00628_consen    2 CPVCG-QSDDDGDMIQCDS   19 (51)
T ss_dssp             BTTTT-SSCTTSSEEEBST
T ss_pred             CcCCC-CcCCCCCeEEcCC
Confidence            56666 4   688898874


No 36 
>KOG2084|consensus
Probab=22.88  E-value=69  Score=29.35  Aligned_cols=25  Identities=24%  Similarity=0.300  Sum_probs=20.9

Q ss_pred             CCCceEEEEceeCCCCCEEEEeccE
Q psy7758          36 GSRGWGLKALTDLKRGQFVVEYVGE   60 (235)
Q Consensus        36 ~~~G~GlfA~~~I~~Ge~I~ey~G~   60 (235)
                      +..|.|++|+++|.+|++|++-...
T Consensus        22 ~~~Gr~~~a~~~i~~g~~i~~e~p~   46 (482)
T KOG2084|consen   22 PELGRGLVATQAIEAGEVILEEEPL   46 (482)
T ss_pred             cccCcceeeecccCCCceEEecCcc
Confidence            6889999999999999988764333


No 37 
>PRK09203 rplP 50S ribosomal protein L16; Reviewed
Probab=22.12  E-value=1.3e+02  Score=23.44  Aligned_cols=42  Identities=21%  Similarity=0.251  Sum_probs=28.7

Q ss_pred             CEEEEEcCCCceEEEEceeCCCCCEEEEeccEEeCHHHHHHHHH
Q psy7758          29 PLEVFNTGSRGWGLKALTDLKRGQFVVEYVGEMIDQKELNRRRR   72 (235)
Q Consensus        29 ~lev~~s~~~G~GlfA~~~I~~Ge~I~ey~G~ii~~~e~~~~~~   72 (235)
                      +++++...++|-=-.-...|++|++|+|..|  ++.+.+.+-..
T Consensus        78 ~~~~RMGkGKG~~~~~varVk~G~iifEi~~--~~~~~a~~al~  119 (138)
T PRK09203         78 PAEVRMGKGKGSPEYWVAVVKPGRILFEIAG--VSEELAREALR  119 (138)
T ss_pred             hhhccccCCCCCCcEEEEEECCCCEEEEEeC--CCHHHHHHHHH
Confidence            4455555666655566668999999999999  56655544443


No 38 
>TIGR03170 flgA_cterm flagella basal body P-ring formation protein FlgA. This model describes a conserved C-terminal region of the flagellar basal body P-ring formation protein FlgA. This sequence region contains a SAF domain, now described by Pfam model pfam08666.
Probab=21.09  E-value=43  Score=25.04  Aligned_cols=16  Identities=19%  Similarity=0.216  Sum_probs=13.8

Q ss_pred             EEEccCCCCCCeEEEe
Q psy7758         128 LFALRDVPAGTELVFN  143 (235)
Q Consensus       128 i~A~rdI~~GEElt~~  143 (235)
                      ++|.++|++||-|+-+
T Consensus         2 ~Va~r~I~~G~~i~~~   17 (122)
T TIGR03170         2 VVAKRPLKRGEVISPE   17 (122)
T ss_pred             EEECcccCCCCCcCHH
Confidence            6899999999998764


No 39 
>PF06718 DUF1203:  Protein of unknown function (DUF1203);  InterPro: IPR009593 This family consists of several hypothetical bacterial proteins of around 155 residues in length. Family members are present in Rhizobium, Agrobacterium and Streptomyces species.
Probab=21.09  E-value=75  Score=24.23  Aligned_cols=18  Identities=39%  Similarity=0.711  Sum_probs=14.1

Q ss_pred             EEccCCCCCCeE-EEecCC
Q psy7758         129 FALRDVPAGTEL-VFNYEL  146 (235)
Q Consensus       129 ~A~rdI~~GEEl-t~~Y~~  146 (235)
                      +.++||++||++ .++|..
T Consensus         6 ~cL~~~~~Ge~~lLlsy~p   24 (117)
T PF06718_consen    6 HCLRDAEPGEELLLLSYRP   24 (117)
T ss_pred             EecccCCCCCeEEEEecCC
Confidence            479999999985 468854


No 40 
>smart00249 PHD PHD zinc finger. The plant homeodomain (PHD) finger is a C4HC3 zinc-finger-like motif found in nuclear proteins thought to be involved in epigenetics and chromatin-mediated transcriptional regulation. The PHD finger binds two zinc ions using the so-called 'cross-brace' motif and is thus structurally related to the CHL00044 rpl16 ribosomal protein L16
Probab=21.05  E-value=1.8e+02  Score=22.70  Aligned_cols=39  Identities=18%  Similarity=0.186  Sum_probs=27.6

Q ss_pred             CEEEEEcCCCceEEEEceeCCCCCEEEEeccEEeCHHHHHH
Q psy7758          29 PLEVFNTGSRGWGLKALTDLKRGQFVVEYVGEMIDQKELNR   69 (235)
Q Consensus        29 ~lev~~s~~~G~GlfA~~~I~~Ge~I~ey~G~ii~~~e~~~   69 (235)
                      +.+++...++|-=..-...|++|++|+|..|  ++.+.+.+
T Consensus        78 p~e~RMGkGKG~~~~~va~V~~G~ilfEi~g--~~~~~ak~  116 (135)
T CHL00044         78 PAETRMGSGKGSPEYWVAVVKPGRILYEMGG--VSETIARA  116 (135)
T ss_pred             cccccccCCCCCccEEEEEECCCcEEEEEeC--CCHHHHHH
Confidence            3566666667766677778999999999998  34444333


Done!