Query         000554
Match_columns 1428
No_of_seqs    804 out of 4735
Neff          5.7 
Searched_HMMs 46136
Date          Mon Apr  1 19:04:17 2013
Command       hhsearch -i /work/01045/syshi/lefta3m/000554.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/leftcdd/000554hhsearch_cdd -cpu 12 -v 0 

 No Hit                             Prob E-value P-value  Score    SS Cols Query HMM  Template HMM
  1 KOG1082 Histone H3 (Lys9) meth  99.9 7.6E-26 1.7E-30  267.6  12.4  169 1224-1428   52-227 (364)
  2 KOG1141 Predicted histone meth  99.9 3.7E-25   8E-30  263.8   2.2  239 1160-1421  569-841 (1262)
  3 KOG2462 C2H2-type Zn-finger pr  99.9 2.7E-22 5.8E-27  220.7   4.5  133  882-1038  131-263 (279)
  4 KOG2462 C2H2-type Zn-finger pr  99.8 1.3E-21 2.9E-26  215.1   4.2  137  915-1069  130-266 (279)
  5 KOG1074 Transcriptional repres  99.8 7.4E-21 1.6E-25  230.5   6.1  173  882-1070  606-931 (958)
  6 KOG1074 Transcriptional repres  99.8 3.4E-19 7.5E-24  216.2  12.5   88  846-942   604-694 (958)
  7 KOG3608 Zn finger proteins [Ge  99.8 1.5E-19 3.3E-24  201.8   4.3  199  833-1059  193-399 (467)
  8 PF05033 Pre-SET:  Pre-SET moti  99.7 1.8E-18 3.9E-23  169.8   7.6  103 1236-1371    1-103 (103)
  9 KOG3608 Zn finger proteins [Ge  99.7 4.9E-18 1.1E-22  189.9   0.5  188  851-1070  183-377 (467)
 10 smart00468 PreSET N-terminal t  99.7 6.6E-17 1.4E-21  157.6   7.8   96 1234-1363    1-98  (98)
 11 KOG3623 Homeobox transcription  99.6 1.4E-16   3E-21  190.4   3.3   79  983-1067  893-971 (1007)
 12 KOG4442 Clathrin coat binding   99.5 4.2E-15 9.1E-20  179.6   5.1   73 1353-1425   92-166 (729)
 13 KOG3576 Ovo and related transc  99.3   2E-13 4.4E-18  143.9  -0.4  118  915-1045  117-239 (267)
 14 KOG3576 Ovo and related transc  99.2 1.7E-12 3.6E-17  137.1   0.4   88  843-942   113-200 (267)
 15 KOG3623 Homeobox transcription  99.2 3.1E-12 6.6E-17  153.9   1.5  123  882-1042  211-333 (1007)
 16 KOG1079 Transcriptional repres  99.1 1.7E-11 3.7E-16  147.9   3.9   99 1294-1428  534-650 (739)
 17 KOG1141 Predicted histone meth  99.0 4.1E-10   9E-15  136.9   5.7  185 1216-1420  850-1054(1262)
 18 PLN03086 PRLI-interacting fact  98.8 6.1E-09 1.3E-13  128.0   8.0  144  848-1041  408-564 (567)
 19 PLN03086 PRLI-interacting fact  98.8 8.3E-09 1.8E-13  126.9   6.9  140  882-1064  408-559 (567)
 20 PHA00733 hypothetical protein   98.6 3.8E-08 8.2E-13  101.0   3.5   86  914-1043   39-124 (128)
 21 PF01352 KRAB:  KRAB box;  Inte  98.4 6.9E-08 1.5E-12   79.7   0.5   35  732-771     1-40  (41)
 22 PHA00733 hypothetical protein   98.3   3E-07 6.6E-12   94.3   2.4   93  835-940    28-124 (128)
 23 KOG3993 Transcription factor (  98.2 2.5E-07 5.4E-12  107.5   0.4   39  834-873   282-320 (500)
 24 PHA02768 hypothetical protein;  98.1 1.3E-06 2.8E-11   76.2   2.5   43  985-1035    6-48  (55)
 25 KOG3993 Transcription factor (  98.1   4E-07 8.7E-12  105.8  -1.2  181  847-1042  267-483 (500)
 26 KOG1083 Putative transcription  98.1 4.1E-07 8.9E-12  114.5  -2.8   56 1368-1423 1166-1222(1306)
 27 PHA02768 hypothetical protein;  97.9 1.9E-06 4.1E-11   75.2   0.2   45 1018-1064    5-49  (55)
 28 PF13465 zf-H2C2_2:  Zinc-finge  97.9 6.6E-06 1.4E-10   61.5   1.8   26  971-996     1-26  (26)
 29 PHA00732 hypothetical protein   97.4 9.7E-05 2.1E-09   69.9   3.2   48  984-1043    1-49  (79)
 30 PF13465 zf-H2C2_2:  Zinc-finge  97.4  0.0001 2.2E-09   55.2   1.9   26  999-1030    1-26  (26)
 31 PHA00616 hypothetical protein   97.2 5.9E-05 1.3E-09   63.1  -0.4   26  984-1010    1-26  (44)
 32 smart00317 SET SET (Su(var)3-9  97.2 0.00044 9.4E-09   67.9   5.0   43 1380-1422    1-43  (116)
 33 PHA00616 hypothetical protein   97.0 0.00027   6E-09   59.2   1.0   34 1018-1051    1-34  (44)
 34 PHA00732 hypothetical protein   96.9 0.00047   1E-08   65.3   2.2   45  881-936     1-45  (79)
 35 PF05605 zf-Di19:  Drought indu  96.8 0.00087 1.9E-08   58.9   2.8   52  984-1042    2-53  (54)
 36 KOG1085 Predicted methyltransf  96.7  0.0011 2.4E-08   74.6   3.7   53 1375-1427  252-304 (392)
 37 COG5189 SFP1 Putative transcri  96.6  0.0013 2.8E-08   75.0   2.7   57  982-1038  347-418 (423)
 38 KOG1080 Histone H3 (Lys4) meth  96.4   0.002 4.2E-08   85.0   3.0   45 1379-1423  866-910 (1005)
 39 PF05605 zf-Di19:  Drought indu  96.1  0.0021 4.6E-08   56.4   1.1   51  882-939     3-53  (54)
 40 PF00096 zf-C2H2:  Zinc finger,  95.6  0.0068 1.5E-07   43.6   1.6   23  985-1008    1-23  (23)
 41 PF00096 zf-C2H2:  Zinc finger,  95.6  0.0029 6.2E-08   45.6  -0.4   23 1019-1041    1-23  (23)
 42 COG5189 SFP1 Putative transcri  95.3  0.0063 1.4E-07   69.5   1.1   71  844-935   346-418 (423)
 43 PF12756 zf-C2H2_2:  C2H2 type   95.3  0.0069 1.5E-07   58.3   1.1   73  849-938     1-73  (100)
 44 PF12756 zf-C2H2_2:  C2H2 type   95.2   0.011 2.3E-07   57.1   2.3   71  917-1005    1-71  (100)
 45 COG5048 FOG: Zn-finger [Genera  94.8   0.028 6.2E-07   66.9   4.7   62  990-1055  394-455 (467)
 46 cd01395 HMT_MBD Methyl-CpG bin  94.6  0.0072 1.6E-07   54.3  -0.8   37 1184-1220    1-49  (60)
 47 KOG2231 Predicted E3 ubiquitin  94.4   0.031 6.8E-07   70.9   3.9  140  880-1050  114-275 (669)
 48 PF13912 zf-C2H2_6:  C2H2-type   94.2   0.016 3.5E-07   43.4   0.5   24  984-1008    1-24  (27)
 49 PF13912 zf-C2H2_6:  C2H2-type   94.0   0.033 7.2E-07   41.7   1.9   26 1018-1043    1-26  (27)
 50 PF13894 zf-C2H2_4:  C2H2-type   94.0   0.024 5.2E-07   40.5   1.0   22  883-904     2-23  (24)
 51 KOG2231 Predicted E3 ubiquitin  93.2   0.088 1.9E-06   66.9   4.7   74  917-1003  117-201 (669)
 52 PF13894 zf-C2H2_4:  C2H2-type   93.1    0.07 1.5E-06   38.0   2.2   18  985-1002    1-18  (24)
 53 COG5048 FOG: Zn-finger [Genera  92.7     0.1 2.2E-06   62.3   4.1  168  846-1036  288-463 (467)
 54 KOG1146 Homeobox protein [Gene  92.7   0.069 1.5E-06   71.2   2.8  157  884-1067  439-639 (1406)
 55 COG2940 Proteins containing SE  92.6   0.045 9.8E-07   68.4   1.0   72 1354-1425  307-378 (480)
 56 KOG1146 Homeobox protein [Gene  92.4    0.04 8.7E-07   73.3   0.2   84  849-937   438-540 (1406)
 57 PRK04860 hypothetical protein;  92.1   0.088 1.9E-06   56.5   2.3   38  984-1031  119-156 (160)
 58 PF09237 GAGA:  GAGA factor;  I  91.9   0.053 1.1E-06   46.9   0.3   30 1016-1045   22-51  (54)
 59 smart00355 ZnF_C2H2 zinc finge  91.4   0.072 1.6E-06   38.4   0.6   24  985-1009    1-24  (26)
 60 smart00355 ZnF_C2H2 zinc finge  91.1    0.14 3.1E-06   36.8   1.9   24 1019-1042    1-24  (26)
 61 smart00570 AWS associated with  90.9   0.093   2E-06   45.9   0.8   25 1353-1377   26-50  (51)
 62 cd05162 PWWP The PWWP domain,   90.3    0.26 5.7E-06   47.3   3.4   60  157-220     6-66  (87)
 63 PRK04860 hypothetical protein;  89.4    0.14   3E-06   55.1   0.8   39 1017-1059  118-156 (160)
 64 cd05840 SPBC215_ISWI_like The   88.9    0.31 6.6E-06   47.9   2.7   59  157-216     6-65  (93)
 65 PF09237 GAGA:  GAGA factor;  I  86.5    0.38 8.2E-06   41.9   1.5   29  880-908    23-51  (54)
 66 COG5236 Uncharacterized conser  85.3    0.43 9.4E-06   55.6   1.8  103  915-1039  151-272 (493)
 67 PF12874 zf-met:  Zinc-finger o  84.7    0.26 5.7E-06   36.1  -0.2   21 1019-1039    1-21  (25)
 68 PF11722 zf-TRM13_CCCH:  CCCH z  84.0    0.35 7.5E-06   38.1   0.2   29  533-561     2-30  (31)
 69 PF13909 zf-H2C2_5:  C2H2-type   81.8     0.6 1.3E-05   34.1   0.7   23  882-905     1-23  (24)
 70 PF12874 zf-met:  Zinc-finger o  81.4    0.63 1.4E-05   34.1   0.7   21  883-903     2-22  (25)
 71 cd07765 KRAB_A-box KRAB (Krupp  81.2    0.88 1.9E-05   32.7   1.5   28  732-764     1-28  (40)
 72 PF12171 zf-C2H2_jaz:  Zinc-fin  80.8    0.92   2E-05   34.2   1.4   22 1019-1040    2-23  (27)
 73 PF13909 zf-H2C2_5:  C2H2-type   77.1     1.8 3.9E-05   31.5   2.0   17  985-1002    1-17  (24)
 74 PF12171 zf-C2H2_jaz:  Zinc-fin  76.5     1.3 2.9E-05   33.3   1.2   21  882-902     2-22  (27)
 75 COG5236 Uncharacterized conser  75.6     1.8 3.9E-05   50.7   2.4  135  848-1010  152-307 (493)
 76 KOG2893 Zn finger protein [Gen  74.3     1.4 2.9E-05   49.4   1.0   46  987-1042   13-59  (341)
 77 KOG4173 Alpha-SNAP protein [In  74.1    0.64 1.4E-05   51.0  -1.5   91  880-1010   78-172 (253)
 78 KOG2482 Predicted C2H2-type Zn  72.8     3.7 8.1E-05   48.3   4.0   76  895-978   129-217 (423)
 79 KOG2482 Predicted C2H2-type Zn  70.2     2.7 5.9E-05   49.4   2.2   78  916-1008  280-357 (423)
 80 cd05837 MSH6_like The PWWP dom  67.5     5.7 0.00012   40.2   3.7   63  157-219     8-71  (110)
 81 KOG2893 Zn finger protein [Gen  62.2     3.1 6.7E-05   46.7   0.6   47  884-940    13-59  (341)
 82 KOG2785 C2H2-type Zn-finger pr  61.2     9.5  0.0002   46.0   4.4   55  983-1038  165-240 (390)
 83 smart00391 MBD Methyl-CpG bind  56.4     4.8  0.0001   38.3   0.8   36 1184-1219    3-52  (77)
 84 PF13913 zf-C2HC_2:  zinc-finge  55.2     8.3 0.00018   28.9   1.7   18  985-1003    3-20  (25)
 85 smart00451 ZnF_U1 U1-like zinc  54.9     4.3 9.3E-05   32.0   0.2   21 1018-1038    3-23  (35)
 86 PF13913 zf-C2HC_2:  zinc-finge  53.8       7 0.00015   29.3   1.1   19  883-902     4-22  (25)
 87 KOG4173 Alpha-SNAP protein [In  51.6     5.5 0.00012   44.0   0.4   93  952-1048   78-177 (253)
 88 smart00451 ZnF_U1 U1-like zinc  50.4     8.6 0.00019   30.3   1.2   22  915-936     3-24  (35)
 89 COG4049 Uncharacterized protei  47.4     8.3 0.00018   34.4   0.7   32  978-1009   11-42  (65)
 90 PF09986 DUF2225:  Uncharacteri  47.2     6.1 0.00013   44.6  -0.1   20  983-1002    4-23  (214)
 91 smart00293 PWWP domain with co  42.5      27 0.00059   31.7   3.3   56  157-215     6-62  (63)
 92 cd00350 rubredoxin_like Rubred  41.4      18  0.0004   28.8   1.8   11  985-995     2-12  (33)
 93 PF00855 PWWP:  PWWP domain;  I  41.1      26 0.00057   33.1   3.2   56  157-219     6-62  (86)
 94 COG1997 RPL43A Ribosomal prote  39.3      13 0.00028   36.2   0.8   34  983-1032   34-67  (89)
 95 PF06524 NOA36:  NOA36 protein;  38.9      32  0.0007   39.6   3.8   27 1016-1042  207-233 (314)
 96 TIGR02098 MJ0042_CXXC MJ0042 f  38.5      16 0.00035   29.6   1.1   34  985-1029    3-36  (38)
 97 cd05838 WHSC1_related The PWWP  38.0      25 0.00055   34.8   2.6   54  158-214     7-61  (95)
 98 TIGR00622 ssl1 transcription f  37.8      39 0.00085   34.6   3.9   48  883-939    57-104 (112)
 99 TIGR00373 conserved hypothetic  37.5      24 0.00052   38.1   2.6   40  974-1028   99-138 (158)
100 PF14353 CpXC:  CpXC protein     37.1      22 0.00049   36.6   2.2   50  986-1042    3-62  (128)
101 KOG3813 Uncharacterized conser  37.0      16 0.00035   45.4   1.2   19 1299-1318  307-325 (640)
102 PF09538 FYDLN_acid:  Protein o  37.0      19 0.00041   36.6   1.6   30  985-1031   10-39  (108)
103 smart00531 TFIIE Transcription  35.9      29 0.00062   36.9   2.8   39  980-1028   95-133 (147)
104 cd01397 HAT_MBD Methyl-CpG bin  35.2      13 0.00029   35.2   0.1   25 1194-1218   23-48  (73)
105 smart00834 CxxC_CXXC_SSSS Puta  35.2      14  0.0003   30.3   0.2   12  985-996     6-17  (41)
106 PRK00464 nrdR transcriptional   33.6      18 0.00039   39.0   0.8   19 1017-1035   27-45  (154)
107 COG1198 PriA Primosomal protei  33.3      27 0.00058   46.3   2.5   43 1111-1154  602-645 (730)
108 KOG2461 Transcription factor B  32.2      90   0.002   38.7   6.5   78  971-1054  318-395 (396)
109 PRK06266 transcription initiat  32.1      30 0.00065   38.1   2.3   35  980-1029  113-147 (178)
110 PF09723 Zn-ribbon_8:  Zinc rib  31.5      16 0.00034   30.8  -0.0   12  985-996     6-17  (42)
111 PHA00626 hypothetical protein   31.4      19 0.00041   32.3   0.4   13 1018-1030   23-35  (59)
112 cd00122 MBD MeCP2, MBD1, MBD2,  31.2      15 0.00033   33.3  -0.1   27 1194-1220   23-50  (62)
113 PF13891 zf-C3Hc3H:  Potential   31.0      15 0.00033   33.8  -0.2   23  587-609     3-25  (65)
114 PF12013 DUF3505:  Protein of u  30.8      52  0.0011   33.1   3.5   27 1017-1043   79-109 (109)
115 TIGR02605 CxxC_CxxC_SSSS putat  30.5      19  0.0004   31.3   0.3   12  985-996     6-17  (52)
116 cd05839 BR140_related The PWWP  30.1      78  0.0017   32.5   4.6   61  157-217     6-80  (111)
117 PF09986 DUF2225:  Uncharacteri  30.0      28  0.0006   39.4   1.6   42 1016-1057    3-59  (214)
118 cd00729 rubredoxin_SM Rubredox  29.6      35 0.00077   27.5   1.7   10  985-994     3-12  (34)
119 PF11722 zf-TRM13_CCCH:  CCCH z  29.4      31 0.00066   27.5   1.3   21  589-609    11-31  (31)
120 COG4049 Uncharacterized protei  27.8      15 0.00033   32.8  -0.7   31  841-871    11-41  (65)
121 KOG2186 Cell growth-regulating  27.8      21 0.00046   40.9   0.2   48  848-904     4-51  (276)
122 COG2888 Predicted Zn-ribbon RN  26.9      46   0.001   30.3   2.1   32  984-1026   27-58  (61)
123 PF13717 zinc_ribbon_4:  zinc-r  26.1      39 0.00084   27.7   1.4   33  985-1028    3-35  (36)
124 COG1996 RPC10 DNA-directed RNA  25.9      34 0.00075   30.1   1.1   29  983-1027    5-33  (49)
125 PF09723 Zn-ribbon_8:  Zinc rib  25.2      38 0.00083   28.5   1.2   13  848-860     6-18  (42)
126 PF02892 zf-BED:  BED zinc fing  25.1      54  0.0012   27.4   2.1   28  981-1008   13-44  (45)
127 TIGR02300 FYDLN_acid conserved  24.9      47   0.001   34.7   2.0   34  985-1035   10-43  (129)
128 KOG2186 Cell growth-regulating  24.0      39 0.00085   38.9   1.4   47  881-936     3-49  (276)
129 cd05834 HDGF_related The PWWP   23.4   1E+02  0.0022   29.9   3.9   52  157-218     8-60  (83)
130 PRK14890 putative Zn-ribbon RN  23.1      54  0.0012   29.9   1.8   32  983-1026   24-56  (59)
131 PF09845 DUF2072:  Zn-ribbon co  22.8      45 0.00096   35.0   1.4   15  984-998     1-15  (131)
132 TIGR00622 ssl1 transcription f  22.8      86  0.0019   32.2   3.4   50  848-905    56-105 (112)
133 PF03604 DNA_RNApol_7kD:  DNA d  22.7      44 0.00095   26.9   1.0   11  985-995     1-11  (32)
134 PF08879 WRC:  WRC;  InterPro:   22.6      30 0.00065   30.0   0.1   20  589-608    13-32  (46)
135 PF12013 DUF3505:  Protein of u  22.2   1E+02  0.0022   31.0   3.8   24  985-1008   81-108 (109)
136 PF13719 zinc_ribbon_5:  zinc-r  21.5      61  0.0013   26.6   1.7   32  986-1028    4-35  (37)
137 PRK00464 nrdR transcriptional   21.3      46   0.001   35.9   1.2   16  882-897    29-44  (154)
138 KOG2593 Transcription initiati  21.0      63  0.0014   39.9   2.4   42  977-1027  121-162 (436)
139 PF14353 CpXC:  CpXC protein     20.1      38 0.00083   34.9   0.3   15  848-862     2-16  (128)
140 PRK00398 rpoP DNA-directed RNA  20.0      46   0.001   28.3   0.8   13  984-996     3-15  (46)

No 1  
>KOG1082 consensus Histone H3 (Lys9) methyltransferase SUV39H1/Clr4, required for transcriptional silencing [Chromatin structure and dynamics; Transcription]
Probab=99.93  E-value=7.6e-26  Score=267.56  Aligned_cols=169  Identities=34%  Similarity=0.582  Sum_probs=137.1

Q ss_pred             CCCCCcCeeEeecCcCCCCCCCeEEEECCCCcccccccCCCCCcccccCCCCCCCcEEccccCCCCCCCCcccCCCCCcc
Q 000554         1224 RKPLLRGTVLCDDISSGLESVPVACVVDDGLLETLCISADSSDSQKTRCSMPWESFTYVTKPLLDQSLDLDAESLQLGCA 1303 (1428)
Q Consensus      1224 ~~~~~r~~vi~~DIS~G~E~~PV~~vnd~d~~~~~~~~g~~s~~~~~~~~~Pp~~F~Yit~~i~~~~~~~~~~~~~~gC~ 1303 (1428)
                      .....+...+.+||+.|.|++||+.+|++|++                  .| ..|+|++..++..+. ........+|.
T Consensus        52 ~~~~~~~~~~~~d~~~~~e~~~v~~~n~id~~------------------~~-~~f~y~~~~~~~~~~-~~~~~~~~~c~  111 (364)
T KOG1082|consen   52 DKDKLEAKSELEDIALGSENLPVPLVNRIDED------------------AP-LYFQYIATEIVDPGE-LSDCENSTGCR  111 (364)
T ss_pred             cccccccccccccccCccccCceeeeeeccCC------------------cc-ccceeccccccCccc-cccCccccCCC
Confidence            34456777889999999999999999999974                  12 579999999888852 22334467999


Q ss_pred             cCCCCcCCCC---CCccccccccccccccccCCCCCCCcccCCCCC--eeecCCccccccCcCCCCCCCCCCceeeccce
Q 000554         1304 CANSTCFPET---CDHVYLFDNDYEDAKDIDGKSVHGRFPYDQTGR--VILEEGYLIYECNHMCSCDRTCPNRVLQNGVR 1378 (1428)
Q Consensus      1304 C~~~~C~~~~---C~C~~l~~~~y~~~~~~~g~~~~~~~~Y~~~G~--l~~~~~~~IyECn~~C~C~~~C~NRvvQ~G~~ 1378 (1428)
                      |.+ .|....   |.|..               .+.+.++|..+|.  .....+.+||||+..|+|+.+|.|||+|+|++
T Consensus       112 C~~-~~~~~~~~~C~C~~---------------~n~~~~~~~~~~~~~~~~~~~~~i~EC~~~C~C~~~C~nRv~q~g~~  175 (364)
T KOG1082|consen  112 CCS-SCSSVLPLTCLCER---------------HNGGLVAYTCDGDCGTLGKFKEPVFECSVACGCHPDCANRVVQKGLQ  175 (364)
T ss_pred             ccC-CCCCCCCccccChH---------------hhCCccccccCCccccccccCccccccccCCCCCCcCcchhhccccc
Confidence            986 343332   77743               2345678877763  33456679999999999999999999999999


Q ss_pred             eeEEEEeecCCccceeecccCCCCCEEEEeeeEEcCHHHHHHHhhh--ccCC
Q 000554         1379 VKLEVFKTENKGWAVRAGQAILRGTFVCEYIGEVLDELETNKRRSR--LLFD 1428 (1428)
Q Consensus      1379 ~~LeVFkT~~kGWGVra~~~Ip~GtFIcEYvGEvIt~~Ea~~R~~~--YlFD 1428 (1428)
                      .+|+||||..+|||||++++||+|+|||||+|||++..|+++|...  |+||
T Consensus       176 ~~leIfrt~~kGwgvRs~~~I~~G~fvcEyaGe~~t~~e~~~~~~~~~~~~~  227 (364)
T KOG1082|consen  176 FHLEVFRTPEKGWGVRTLDPIPAGEFVCEYAGEVLTSEEAQRRTHLREYLDD  227 (364)
T ss_pred             cceEEEecCCceeeecccccccCCCeeEEEeeEecChHHhhhcccccccccc
Confidence            9999999999999999999999999999999999999999998543  6654


No 2  
>KOG1141 consensus Predicted histone methyl transferase [Chromatin structure and dynamics]
Probab=99.90  E-value=3.7e-25  Score=263.83  Aligned_cols=239  Identities=22%  Similarity=0.268  Sum_probs=181.0

Q ss_pred             ccceecccccCCCcccCCC--CCCCCCCC-CCcccC-----------CcccccccCCCCCCc-cccccceeee-ccC---
Q 000554         1160 VEWHREGFLCSNGCKIFKD--PHLPPHLE-PLPSVS-----------AGIRSSDSSDFVNNQ-WEVDECHCII-DSR--- 1220 (1428)
Q Consensus      1160 ~~wh~~~~~c~~g~~~~~~--~~~~~Pl~-p~~~~~-----------~~~k~v~~~~p~~~~-w~~~e~~~~l-~~~--- 1220 (1428)
                      +-.|.|...|-+.-....+  +.+-.||+ |..+.|           ...-.|.|.+|||.. +.|.|+.+|| +.+   
T Consensus       569 y~sh~cs~acl~~~~~~~~~~~~g~npl~lp~~~~F~r~~a~~rs~~~~~fhv~yktpcg~~lr~~~el~ryL~et~c~f  648 (1262)
T KOG1141|consen  569 YFSHKCSIACLNAAQIAIMVGQPGGNPLNLPYFLTFHRIRASHRSAYIRDFHVEYKTPCGMPLRMRIELYRYLVETRCKF  648 (1262)
T ss_pred             ccchhhHHHHHhccchhhhccCCCCCccccceEEEeeehhhhhhhhhhhcceeeccCCCccchHHHHHHHHHHHHhcCcE
Confidence            3467788778666555543  56778998 988888           233368899999988 8888877655 321   


Q ss_pred             ----cc---------CCCCCCcCeeEeecCcCCCCCCCeEEEECCCCcccccccCCCCCcccccCCCCCCCcEEccccCC
Q 000554         1221 ----HL---------GRKPLLRGTVLCDDISSGLESVPVACVVDDGLLETLCISADSSDSQKTRCSMPWESFTYVTKPLL 1287 (1428)
Q Consensus      1221 ----~~---------~~~~~~r~~vi~~DIS~G~E~~PV~~vnd~d~~~~~~~~g~~s~~~~~~~~~Pp~~F~Yit~~i~ 1287 (1428)
                          .|         +..++.++++.|-||++|+|.+||.++|++|..                   |++.|.|-.+.|.
T Consensus       649 lf~~~f~~~~yV~~~r~~~p~kp~~~~~Di~~g~e~vpis~~neids~-------------------~lpq~ay~K~~ip  709 (1262)
T KOG1141|consen  649 LFVIGFDRAFYVVRHRAPNPLKPGNRCTDIPCGREHVPISEKNEIDSH-------------------RLPQAAYKKHMIP  709 (1262)
T ss_pred             EEEeecccchheeecccCCCcCCcceeccccCCccccccceeecccCc-------------------CCccchhheeecc
Confidence                11         334578999999999999999999999999852                   3468999988887


Q ss_pred             CCCCCC-cccCCCCCcccCCCCcCCCCCCccccccccccccccccCCCCCCCcccCCCCCeeecCCccccccCcCCCCCC
Q 000554         1288 DQSLDL-DAESLQLGCACANSTCFPETCDHVYLFDNDYEDAKDIDGKSVHGRFPYDQTGRVILEEGYLIYECNHMCSCDR 1366 (1428)
Q Consensus      1288 ~~~~~~-~~~~~~~gC~C~~~~C~~~~C~C~~l~~~~y~~~~~~~g~~~~~~~~Y~~~G~l~~~~~~~IyECn~~C~C~~ 1366 (1428)
                      +...-. -.+.|..+|+|..||-+...|+|.++....-... .........++.|.   |++......+|||+.+|+|.+
T Consensus       710 ~~~nl~n~~~~fl~scdc~~gcid~~kcachQltvk~~~t~-p~~~v~~t~gykyK---Rl~e~~ptg~yEc~k~ckc~~  785 (1262)
T KOG1141|consen  710 TNNNLSNRRKDFLQSCDCPTGCIDSMKCACHQLTVKKKTTG-PNQNVASTNGYKYK---RLIEIRPTGPYECLKACKCCG  785 (1262)
T ss_pred             CCCcccccChhhhhcCCCCcchhhhhhhhHHHHHHHhhccC-CCcccccCcchhhH---HHHHhcCCCHHHHHHhhccCc
Confidence            765312 2366789999999877778999988743211100 01111122345553   444445678999999999986


Q ss_pred             -CCCCceeeccceeeEEEEeecCCccceeecccCCCCCEEEEeeeEEcCHHHHHHH
Q 000554         1367 -TCPNRVLQNGVRVKLEVFKTENKGWAVRAGQAILRGTFVCEYIGEVLDELETNKR 1421 (1428)
Q Consensus      1367 -~C~NRvvQ~G~~~~LeVFkT~~kGWGVra~~~Ip~GtFIcEYvGEvIt~~Ea~~R 1421 (1428)
                       .|.||++|+|.+++|++|+|.++|||+|++++|.+|.|||.|.|-+++++-+++-
T Consensus       786 ~~C~nrmvqhg~qvRlq~fkt~~kGWg~rclddi~~g~fVciy~g~~l~~~~sdks  841 (1262)
T KOG1141|consen  786 PDCLNRMVQHGYQVRLQRFKTIHKGWGRRCLDDITGGNFVCIYPGGALLHQISDKS  841 (1262)
T ss_pred             HHHHHHHhhcCceeEeeeccccccccceEeeeecCCceEEEEecchhhhhhhchhh
Confidence             6999999999999999999999999999999999999999999999998887765


No 3  
>KOG2462 consensus C2H2-type Zn-finger protein [Transcription]
Probab=99.85  E-value=2.7e-22  Score=220.67  Aligned_cols=133  Identities=20%  Similarity=0.302  Sum_probs=77.0

Q ss_pred             ccccccccccCChhhhhhhhhhcccccccccccccccccCCCCCCChhhhhhhhhhcccccccchhhhhccccccCcCCC
Q 000554          882 YACAICLDSFTNKKVLESHVQERHHVQFVEQCMLQQCIPCGSHFGNTEELWLHVQSVHAIDFKMSEVAQQHNQSVGEDSP  961 (1428)
Q Consensus       882 ykC~~CgKsF~~ks~L~~H~r~Hhgek~~e~~kpfkC~~CgKsF~sks~L~~H~rsvHsgEf~~~s~~~~kp~~C~~C~~  961 (1428)
                      |+|..|||.+.+.++|.+|.+.|-.-..   .+.+.|..|+|.|.+...|..|+| +|+           -++.|.+|  
T Consensus       131 ~~c~eCgk~ysT~snLsrHkQ~H~~~~s---~ka~~C~~C~K~YvSmpALkMHir-TH~-----------l~c~C~iC--  193 (279)
T KOG2462|consen  131 YKCPECGKSYSTSSNLSRHKQTHRSLDS---KKAFSCKYCGKVYVSMPALKMHIR-THT-----------LPCECGIC--  193 (279)
T ss_pred             eeccccccccccccccchhhcccccccc---cccccCCCCCceeeehHHHhhHhh-ccC-----------CCcccccc--
Confidence            3344444444444444444444332211   124455555555555555555544 443           23444444  


Q ss_pred             CccccCChhhhhhhhhhcCCccceecCccCcccCChhhHHHHHHhhccCCCCCCCCCcccCCCCcccCCchhhhccc
Q 000554          962 KKLELGYSASVENHSENLGSIRKFICRFCGLKFDLLPDLGRHHQAAHMGPNLVNSRPHKKGIRFYAYKLKSGRLSRP 1038 (1428)
Q Consensus       962 k~~sf~sks~L~~H~rtHtGeKpykC~~CGKsFs~~s~L~rHHqrvHtge~~~~eKpykC~~CgKsFs~ks~L~~H~ 1038 (1428)
                       ++.|...--|+-|+|+|||||||.|+.|+|+|..+++|+. |+++|.+     .|+|+|..|+|+|..++.|.+|.
T Consensus       194 -GKaFSRPWLLQGHiRTHTGEKPF~C~hC~kAFADRSNLRA-HmQTHS~-----~K~~qC~~C~KsFsl~SyLnKH~  263 (279)
T KOG2462|consen  194 -GKAFSRPWLLQGHIRTHTGEKPFSCPHCGKAFADRSNLRA-HMQTHSD-----VKKHQCPRCGKSFALKSYLNKHS  263 (279)
T ss_pred             -cccccchHHhhcccccccCCCCccCCcccchhcchHHHHH-HHHhhcC-----CccccCcchhhHHHHHHHHHHhh
Confidence             3333333355555666777777777778888888888877 6777777     67778888888887777777776


No 4  
>KOG2462 consensus C2H2-type Zn-finger protein [Transcription]
Probab=99.83  E-value=1.3e-21  Score=215.15  Aligned_cols=137  Identities=15%  Similarity=0.095  Sum_probs=126.7

Q ss_pred             ccccccCCCCCCChhhhhhhhhhcccccccchhhhhccccccCcCCCCccccCChhhhhhhhhhcCCccceecCccCccc
Q 000554          915 LQQCIPCGSHFGNTEELWLHVQSVHAIDFKMSEVAQQHNQSVGEDSPKKLELGYSASVENHSENLGSIRKFICRFCGLKF  994 (1428)
Q Consensus       915 pfkC~~CgKsF~sks~L~~H~rsvHsgEf~~~s~~~~kp~~C~~C~~k~~sf~sks~L~~H~rtHtGeKpykC~~CGKsF  994 (1428)
                      .|+|..|||.+.+.++|.+|.+ +|-.-      ..++.+.|.+|   ++.+.+.-.|+.|+|+|+  -+++|.+|||.|
T Consensus       130 r~~c~eCgk~ysT~snLsrHkQ-~H~~~------~s~ka~~C~~C---~K~YvSmpALkMHirTH~--l~c~C~iCGKaF  197 (279)
T KOG2462|consen  130 RYKCPECGKSYSTSSNLSRHKQ-THRSL------DSKKAFSCKYC---GKVYVSMPALKMHIRTHT--LPCECGICGKAF  197 (279)
T ss_pred             ceeccccccccccccccchhhc-ccccc------cccccccCCCC---CceeeehHHHhhHhhccC--CCcccccccccc
Confidence            6899999999999999999986 77432      22577999999   888888889999999998  789999999999


Q ss_pred             CChhhHHHHHHhhccCCCCCCCCCcccCCCCcccCCchhhhcccccccCCCccccCCCCCcCcChHHHHhhcCCC
Q 000554          995 DLLPDLGRHHQAAHMGPNLVNSRPHKKGIRFYAYKLKSGRLSRPRFKKGLGAVSYRIRNRGAAGMKKRIQTLKPL 1069 (1428)
Q Consensus       995 s~~s~L~rHHqrvHtge~~~~eKpykC~~CgKsFs~ks~L~~H~r~H~gekpy~C~~C~ksf~~~~~l~~H~ksh 1069 (1428)
                      .+.--|+- |.|+|||     ||||.|+.|+|+|.++++|+.||++|.+.|+|+|..|+|+|..+..|.+|..+-
T Consensus       198 SRPWLLQG-HiRTHTG-----EKPF~C~hC~kAFADRSNLRAHmQTHS~~K~~qC~~C~KsFsl~SyLnKH~ES~  266 (279)
T KOG2462|consen  198 SRPWLLQG-HIRTHTG-----EKPFSCPHCGKAFADRSNLRAHMQTHSDVKKHQCPRCGKSFALKSYLNKHSESA  266 (279)
T ss_pred             cchHHhhc-ccccccC-----CCCccCCcccchhcchHHHHHHHHhhcCCccccCcchhhHHHHHHHHHHhhhhc
Confidence            99999999 8999999     999999999999999999999999999999999999999999999999998853


No 5  
>KOG1074 consensus Transcriptional repressor SALM [Transcription]
Probab=99.82  E-value=7.4e-21  Score=230.52  Aligned_cols=173  Identities=13%  Similarity=0.122  Sum_probs=142.6

Q ss_pred             ccccccccccCChhhhhhhhhhcccccccccccccccccCCCCCCChhhhhhhhhhcccccccchhhhhccccccC---c
Q 000554          882 YACAICLDSFTNKKVLESHVQERHHVQFVEQCMLQQCIPCGSHFGNTEELWLHVQSVHAIDFKMSEVAQQHNQSVG---E  958 (1428)
Q Consensus       882 ykC~~CgKsF~~ks~L~~H~r~Hhgek~~e~~kpfkC~~CgKsF~sks~L~~H~rsvHsgEf~~~s~~~~kp~~C~---~  958 (1428)
                      -+|-+|-+...-++.|+.|.++|+|++      ||+|.+||+.|.++.+|+.|+- +|...     ..-+-++.|.   +
T Consensus       606 NqCiiC~rVlSC~saLqmHyrtHtGER------PFkCKiCgRAFtTkGNLkaH~~-vHka~-----p~~R~q~ScP~~~i  673 (958)
T KOG1074|consen  606 NQCIICLRVLSCPSALQMHYRTHTGER------PFKCKICGRAFTTKGNLKAHMS-VHKAK-----PPARVQFSCPSTFI  673 (958)
T ss_pred             cceeeeeecccchhhhhhhhhcccCcC------ccccccccchhccccchhhccc-ccccC-----ccccccccCCchhh
Confidence            489999999999999999999999997      9999999999999999999995 88643     1222467788   7


Q ss_pred             CCCCccccCChhhhhhhhhhcCCc-c------------ceecCccCcccCChhhHHHHHHhhccC---------------
Q 000554          959 DSPKKLELGYSASVENHSENLGSI-R------------KFICRFCGLKFDLLPDLGRHHQAAHMG--------------- 1010 (1428)
Q Consensus       959 C~~k~~sf~sks~L~~H~rtHtGe-K------------pykC~~CGKsFs~~s~L~rHHqrvHtg--------------- 1010 (1428)
                      |   ...|.+.-.|.+|+++|.+. .            .-+|..|.+.|.....+.. ++.-|.+               
T Consensus       674 c---~~kftn~V~lpQhIriH~~~~~s~g~~a~e~~~~adq~~~~qk~~~~a~~f~~-~~se~~~~~s~~~~~~~~~t~t  749 (958)
T KOG1074|consen  674 C---QKKFTNAVTLPQHIRIHLGGQISNGGTAAEGILAADQCSSCQKTFSDARSFSQ-QISEQPSPESEPDEQMDERTET  749 (958)
T ss_pred             h---cccccccccccceEEeecCCCCCCCcccccccchhcccchhhhcccccccchh-hhhccCCcccCCcccccccccc
Confidence            8   66777777899999999842 2            2469999999988877777 5555511               


Q ss_pred             --------------------------------------------------------CCCC--------------------
Q 000554         1011 --------------------------------------------------------PNLV-------------------- 1014 (1428)
Q Consensus      1011 --------------------------------------------------------e~~~-------------------- 1014 (1428)
                                                                              ++..                    
T Consensus       750 ~~~~~tp~~~e~~~~~~~~~e~~i~~~g~te~asa~~~~vg~~s~~~~~~~~~~T~~k~~~~~~~~~~~~~~~v~~~pvl  829 (958)
T KOG1074|consen  750 EELDVTPPPPENSCGRELEGEMAISVRGSTEEASANLDEVGTVSAAGEAGEEDDTSEKPTQASSFPGEILAPSVNMDPVL  829 (958)
T ss_pred             cccccCCCccccccccccCcccccccccchhhhhcChhhhcCccccchhhhhcccCCCCcccccCCCcCCccccccCchh
Confidence                                                                    0000                    


Q ss_pred             ----------------------------------------------CCCCcccCCCCcccCCchhhhcccccccCCCccc
Q 000554         1015 ----------------------------------------------NSRPHKKGIRFYAYKLKSGRLSRPRFKKGLGAVS 1048 (1428)
Q Consensus      1015 ----------------------------------------------~eKpykC~~CgKsFs~ks~L~~H~r~H~gekpy~ 1048 (1428)
                                                                    ......|.+|++.|...+.|..|+|+|+++|||.
T Consensus       830 ~~~~~~~l~eg~~t~~n~~t~~~~~~sv~qs~~~p~l~p~l~~~~pvnn~h~C~vCgk~FsSSsALqiH~rTHtg~KPF~  909 (958)
T KOG1074|consen  830 WNQETSMLNEGLATKTNEITPEGPADSVIQSGGVPTLEPSLGRPGPVNNAHVCNVCGKQFSSSAALEIHMRTHTGPKPFF  909 (958)
T ss_pred             hcccccccccccccccccccCCCcchhhhhhccccccCCCCCCCCcccchhhhccchhcccchHHHHHhhhcCCCCCCcc
Confidence                                                          0223789999999999999999999999999999


Q ss_pred             cCCCCCcCcChHHHHhhcCCCC
Q 000554         1049 YRIRNRGAAGMKKRIQTLKPLA 1070 (1428)
Q Consensus      1049 C~~C~ksf~~~~~l~~H~ksh~ 1070 (1428)
                      |.+|+++|..+..|..|+.+|.
T Consensus       910 C~fC~~aFttrgnLKvHMgtH~  931 (958)
T KOG1074|consen  910 CHFCEEAFTTRGNLKVHMGTHM  931 (958)
T ss_pred             chhhhhhhhhhhhhhhhhcccc
Confidence            9999999999999999999886


No 6  
>KOG1074 consensus Transcriptional repressor SALM [Transcription]
Probab=99.79  E-value=3.4e-19  Score=216.24  Aligned_cols=88  Identities=27%  Similarity=0.498  Sum_probs=79.4

Q ss_pred             CcccCCCCCcccccccccccccccccchhhhcccCcccccccccccCChhhhhhhhhhcccccccccccccccc---cCC
Q 000554          846 KTHKCKICSQVFLHDQELGVHWMDNHKKEAQWLFRGYACAICLDSFTNKKVLESHVQERHHVQFVEQCMLQQCI---PCG  922 (1428)
Q Consensus       846 kpykC~~CgK~F~s~s~L~~H~~r~Ht~e~~~l~KpykC~~CgKsF~~ks~L~~H~r~Hhgek~~e~~kpfkC~---~Cg  922 (1428)
                      .|-.|-+|-+....++.|+.| .++|++|     +||+|.+||+.|.++.+|+.|+-.|.......  ..|.|+   +|-
T Consensus       604 dPNqCiiC~rVlSC~saLqmH-yrtHtGE-----RPFkCKiCgRAFtTkGNLkaH~~vHka~p~~R--~q~ScP~~~ic~  675 (958)
T KOG1074|consen  604 DPNQCIICLRVLSCPSALQMH-YRTHTGE-----RPFKCKICGRAFTTKGNLKAHMSVHKAKPPAR--VQFSCPSTFICQ  675 (958)
T ss_pred             Cccceeeeeecccchhhhhhh-hhcccCc-----CccccccccchhccccchhhcccccccCcccc--ccccCCchhhhc
Confidence            357899999999999999999 9999999     99999999999999999999999998765332  468999   999


Q ss_pred             CCCCChhhhhhhhhhccccc
Q 000554          923 SHFGNTEELWLHVQSVHAID  942 (1428)
Q Consensus       923 KsF~sks~L~~H~rsvHsgE  942 (1428)
                      +.|.+.-.|.+|++ +|.+.
T Consensus       676 ~kftn~V~lpQhIr-iH~~~  694 (958)
T KOG1074|consen  676 KKFTNAVTLPQHIR-IHLGG  694 (958)
T ss_pred             ccccccccccceEE-eecCC
Confidence            99999999999997 89843


No 7  
>KOG3608 consensus Zn finger proteins [General function prediction only]
Probab=99.77  E-value=1.5e-19  Score=201.77  Aligned_cols=199  Identities=17%  Similarity=0.254  Sum_probs=172.1

Q ss_pred             chhhhhhcccCCCCcccCCCCCcccccccccccccccccchhhhcccCcccccccccccCChhhhhhhhhhccccccccc
Q 000554          833 VLPLAIAGRSEDEKTHKCKICSQVFLHDQELGVHWMDNHKKEAQWLFRGYACAICLDSFTNKKVLESHVQERHHVQFVEQ  912 (1428)
Q Consensus       833 ~~L~~H~r~H~gekpykC~~CgK~F~s~s~L~~H~~r~Ht~e~~~l~KpykC~~CgKsF~~ks~L~~H~r~Hhgek~~e~  912 (1428)
                      ..|.+|.+.|+++|...|+.||..|.++..|-.|+++ .+.-.   ..+|.|..|.|.|.+...|..|+..|-.      
T Consensus       193 ~~LreH~r~Hs~eKvvACp~Cg~~F~~~tkl~DH~rR-qt~l~---~n~fqC~~C~KrFaTeklL~~Hv~rHvn------  262 (467)
T KOG3608|consen  193 YRLREHIRTHSNEKVVACPHCGELFRTKTKLFDHLRR-QTELN---TNSFQCAQCFKRFATEKLLKSHVVRHVN------  262 (467)
T ss_pred             HHHHHHHHhcCCCeEEecchHHHHhccccHHHHHHHh-hhhhc---CCchHHHHHHHHHhHHHHHHHHHHHhhh------
Confidence            3599999999999999999999999999999999543 33221   1689999999999999999999998875      


Q ss_pred             ccccccccCCCCCCChhhhhhhhhhcccccccchhhhhccccccCcCCCCccccCChhhhhhhhhhcCCccceecCc--c
Q 000554          913 CMLQQCIPCGSHFGNTEELWLHVQSVHAIDFKMSEVAQQHNQSVGEDSPKKLELGYSASVENHSENLGSIRKFICRF--C  990 (1428)
Q Consensus       913 ~kpfkC~~CgKsF~sks~L~~H~rsvHsgEf~~~s~~~~kp~~C~~C~~k~~sf~sks~L~~H~rtHtGeKpykC~~--C  990 (1428)
                        -|+|+.|..+....+.|..|++..|+.+         |||+|+.|   ...+.+.+.|.+|..+|+ +-.|.|+.  |
T Consensus       263 --~ykCplCdmtc~~~ssL~~H~r~rHs~d---------kpfKCd~C---d~~c~~esdL~kH~~~HS-~~~y~C~h~~C  327 (467)
T KOG3608|consen  263 --CYKCPLCDMTCSSASSLTTHIRYRHSKD---------KPFKCDEC---DTRCVRESDLAKHVQVHS-KTVYQCEHPDC  327 (467)
T ss_pred             --cccccccccCCCChHHHHHHHHhhhccC---------CCccccch---hhhhccHHHHHHHHHhcc-ccceecCCCCC
Confidence              6899999999999999999999889877         99999999   777888889999999998 77899988  9


Q ss_pred             CcccCChhhHHHHHHhhccCCCCCCCCCcccCCCCcccCCchhhhcccc-ccc-----CCCccccCCCCCcCcCh
Q 000554          991 GLKFDLLPDLGRHHQAAHMGPNLVNSRPHKKGIRFYAYKLKSGRLSRPR-FKK-----GLGAVSYRIRNRGAAGM 1059 (1428)
Q Consensus       991 GKsFs~~s~L~rHHqrvHtge~~~~eKpykC~~CgKsFs~ks~L~~H~r-~H~-----gekpy~C~~C~ksf~~~ 1059 (1428)
                      ..+|.....|++|...+|.|.+   +-+|.|..|++.|++..+|..|++ .|.     |-+.|.++.|..+|.++
T Consensus       328 ~~s~r~~~q~~~H~~evhEg~n---p~~Y~CH~Cdr~ft~G~~L~~HL~kkH~f~~PsGh~RFtYk~~edG~mRL  399 (467)
T KOG3608|consen  328 HYSVRTYTQMRRHFLEVHEGNN---PILYACHCCDRFFTSGKSLSAHLMKKHGFRLPSGHKRFTYKVDEDGFMRL  399 (467)
T ss_pred             cHHHHHHHHHHHHHHHhccCCC---CCceeeecchhhhccchhHHHHHHHhhcccCCCCCCceeeeeccCceeee
Confidence            9999999999998888887854   458999999999999999999984 443     44566677777777543


No 8  
>PF05033 Pre-SET:  Pre-SET motif;  InterPro: IPR007728 This region is found in a number of histone lysine methyltransferases (HMTase), N-terminal to the SET domain; it is generally described as the pre-SET domain. Histone lysine methylation is part of the histone code that regulated chromatin function and epigenetic control of gene function. Histone lysine methyltransferases (HMTase) differ both in their substrate specificity for the various acceptor lysines as well as in their product specificity for the number of methyl groups (one, two, or three) they transfer. With just one exception [], the HMTases belong to SET family that can be classified according to the sequences surrounding the SET domain [, ]. Structural studies on the human SET7/9, a mono-methylase, have revealed the molecular basis for the specificity of the enzyme for the histone-target and the roles of the invariant residues in the SET domain in determining the methylation specificities [].  The pre-SET domain, as found in the SUV39 SET family, contains nine invariant cysteine residues that are grouped into two segments separated by a region of variable length. These 9 cysteines coordinate 3 zinc ions to form a triangular cluster, where each of the zinc ions is coordinated by 4 four cysteines to give a tetrahedral configuration. The function of this domain is structural, holding together 2 long segments of random coils and stabilising the SET domain. The C-terminal region including the post-SET domain is disordered when not interacting with a histone tail and in the absence of zinc. The three conserved cysteines in the post-SET domain form a zinc-binding site [] when coupled to a fourth conserved cysteine in the knot-like structure close to the SET domain active site []. The structured post-SET region brings in the C-terminal residues that participate in S-adenosylmethine-binding and histone tail interactions. The three conserved cysteine residues are essential for HMTase activity, as replacement with serine abolishes HMTase activity []. ; GO: 0008270 zinc ion binding, 0018024 histone-lysine N-methyltransferase activity, 0034968 histone lysine methylation, 0005634 nucleus; PDB: 3K5K_A 2O8J_D 3RJW_B 1ML9_A 1PEG_B 1MVH_A 1MVX_A 3BO5_A 2RFI_B 3MO5_B ....
Probab=99.75  E-value=1.8e-18  Score=169.76  Aligned_cols=103  Identities=31%  Similarity=0.621  Sum_probs=71.1

Q ss_pred             cCcCCCCCCCeEEEECCCCcccccccCCCCCcccccCCCCCCCcEEccccCCCCCCCCcccCCCCCcccCCCCcCCCCCC
Q 000554         1236 DISSGLESVPVACVVDDGLLETLCISADSSDSQKTRCSMPWESFTYVTKPLLDQSLDLDAESLQLGCACANSTCFPETCD 1315 (1428)
Q Consensus      1236 DIS~G~E~~PV~~vnd~d~~~~~~~~g~~s~~~~~~~~~Pp~~F~Yit~~i~~~~~~~~~~~~~~gC~C~~~~C~~~~C~ 1315 (1428)
                      |||.|+|++||+++|++|++                  .||+.|+||+++++..++......+..||+|.++|-.+.+|.
T Consensus         1 Dis~g~e~~pI~~~N~vd~~------------------~~p~~F~Yi~~~~~~~~~~~~~~~~~~~C~C~~~C~~~~~C~   62 (103)
T PF05033_consen    1 DISRGKENVPIPVVNDVDDE------------------PPPPNFEYIPENIYGEGVPDIDPEFLQGCDCSGDCSNPSNCE   62 (103)
T ss_dssp             -TTCTSSSS-EEEEESSSS--------------------SSTSSEE-SS-EESTTSS-TBGGGTS----SSSSTCTTTSH
T ss_pred             CCCCCccCCCEEEEeCCCCC------------------CCCCCeEEeeeEEcCCCccccccccCccCccCCCCCCCCCCc
Confidence            89999999999999999975                  345799999999999987634466678999986433778999


Q ss_pred             ccccccccccccccccCCCCCCCcccCCCCCeeecCCccccccCcCCCCCCCCCCc
Q 000554         1316 HVYLFDNDYEDAKDIDGKSVHGRFPYDQTGRVILEEGYLIYECNHMCSCDRTCPNR 1371 (1428)
Q Consensus      1316 C~~l~~~~y~~~~~~~g~~~~~~~~Y~~~G~l~~~~~~~IyECn~~C~C~~~C~NR 1371 (1428)
                      |+.++               ++.++|+.+|+|.+....+|||||+.|.|+.+|+||
T Consensus        63 C~~~~---------------~~~~~Y~~~g~l~~~~~~~i~EC~~~C~C~~~C~NR  103 (103)
T PF05033_consen   63 CLQRN---------------GGIFAYDSNGRLRIPDKPPIFECNDNCGCSPSCRNR  103 (103)
T ss_dssp             HHCCT---------------SSS-SB-TTSSBSSSSTSEEE---TTSSS-TTSTT-
T ss_pred             Ccccc---------------CccccccCCCcCccCCCCeEEeCCCCCCCCCCCCCC
Confidence            97642               235799999998877889999999999999999998


No 9  
>KOG3608 consensus Zn finger proteins [General function prediction only]
Probab=99.68  E-value=4.9e-18  Score=189.87  Aligned_cols=188  Identities=19%  Similarity=0.208  Sum_probs=165.6

Q ss_pred             CCCCcccccccccccccccccchhhhcccCcccccccccccCChhhhhhhhhhcccccccccccccccccCCCCCCChhh
Q 000554          851 KICSQVFLHDQELGVHWMDNHKKEAQWLFRGYACAICLDSFTNKKVLESHVQERHHVQFVEQCMLQQCIPCGSHFGNTEE  930 (1428)
Q Consensus       851 ~~CgK~F~s~s~L~~H~~r~Ht~e~~~l~KpykC~~CgKsF~~ks~L~~H~r~Hhgek~~e~~kpfkC~~CgKsF~sks~  930 (1428)
                      ..|.+.|.++..|+.| .+.|+++     |...|+.||.-|.++..|..|++..+.-..    .+|.|..|.|.|.+...
T Consensus       183 ~~Ct~~~~~k~~LreH-~r~Hs~e-----KvvACp~Cg~~F~~~tkl~DH~rRqt~l~~----n~fqC~~C~KrFaTekl  252 (467)
T KOG3608|consen  183 AMCTKHMGNKYRLREH-IRTHSNE-----KVVACPHCGELFRTKTKLFDHLRRQTELNT----NSFQCAQCFKRFATEKL  252 (467)
T ss_pred             hhhhhhhccHHHHHHH-HHhcCCC-----eEEecchHHHHhccccHHHHHHHhhhhhcC----CchHHHHHHHHHhHHHH
Confidence            4699999999999999 8999999     999999999999999999999998765432    38999999999999999


Q ss_pred             hhhhhhhcccccccchhhhhccccccCcCCCCccccCChhhhhhhhhh-cCCccceecCccCcccCChhhHHHHHHhhcc
Q 000554          931 LWLHVQSVHAIDFKMSEVAQQHNQSVGEDSPKKLELGYSASVENHSEN-LGSIRKFICRFCGLKFDLLPDLGRHHQAAHM 1009 (1428)
Q Consensus       931 L~~H~rsvHsgEf~~~s~~~~kp~~C~~C~~k~~sf~sks~L~~H~rt-HtGeKpykC~~CGKsFs~~s~L~rHHqrvHt 1009 (1428)
                      |..|++ .|..           -|+|+.|   ..+.+..++|..|++. |...|||+|+.|++.|.+.++|.+ |..+|+
T Consensus       253 L~~Hv~-rHvn-----------~ykCplC---dmtc~~~ssL~~H~r~rHs~dkpfKCd~Cd~~c~~esdL~k-H~~~HS  316 (467)
T KOG3608|consen  253 LKSHVV-RHVN-----------CYKCPLC---DMTCSSASSLTTHIRYRHSKDKPFKCDECDTRCVRESDLAK-HVQVHS  316 (467)
T ss_pred             HHHHHH-Hhhh-----------ccccccc---ccCCCChHHHHHHHHhhhccCCCccccchhhhhccHHHHHH-HHHhcc
Confidence            999997 6753           5889998   8888888999999985 889999999999999999999999 666998


Q ss_pred             CCCCCCCCCcccCC--CCcccCCchhhhcccc-cccCC--CccccCCCCCcCcChHHHHhhc-CCCC
Q 000554         1010 GPNLVNSRPHKKGI--RFYAYKLKSGRLSRPR-FKKGL--GAVSYRIRNRGAAGMKKRIQTL-KPLA 1070 (1428)
Q Consensus      1010 ge~~~~eKpykC~~--CgKsFs~ks~L~~H~r-~H~ge--kpy~C~~C~ksf~~~~~l~~H~-ksh~ 1070 (1428)
                      .      -.|+|..  |.++|+....|++|++ +|.|.  -+|.|..|.+.|.+-..|..|. |.|+
T Consensus       317 ~------~~y~C~h~~C~~s~r~~~q~~~H~~evhEg~np~~Y~CH~Cdr~ft~G~~L~~HL~kkH~  377 (467)
T KOG3608|consen  317 K------TVYQCEHPDCHYSVRTYTQMRRHFLEVHEGNNPILYACHCCDRFFTSGKSLSAHLMKKHG  377 (467)
T ss_pred             c------cceecCCCCCcHHHHHHHHHHHHHHHhccCCCCCceeeecchhhhccchhHHHHHHHhhc
Confidence            6      4799999  9999999999999995 55454  6899999999999888887664 3354


No 10 
>smart00468 PreSET N-terminal to some SET domains. A Cys-rich putative Zn2+-binding domain that occurs N-terminal to some SET domains. Function is unknown. Unpublished.
Probab=99.67  E-value=6.6e-17  Score=157.60  Aligned_cols=96  Identities=34%  Similarity=0.652  Sum_probs=79.5

Q ss_pred             eecCcCCCCCCCeEEEECCCCcccccccCCCCCcccccCCCCCCCcEEccccCCCCCCCC-cccCCCCCcccCCCCcCCC
Q 000554         1234 CDDISSGLESVPVACVVDDGLLETLCISADSSDSQKTRCSMPWESFTYVTKPLLDQSLDL-DAESLQLGCACANSTCFPE 1312 (1428)
Q Consensus      1234 ~~DIS~G~E~~PV~~vnd~d~~~~~~~~g~~s~~~~~~~~~Pp~~F~Yit~~i~~~~~~~-~~~~~~~gC~C~~~~C~~~ 1312 (1428)
                      +.|||+|+|++||++||++|++                  .||++|+||++++++.++.+ ....+..||+|.+ .|.+.
T Consensus         1 ~~Dis~G~E~~pI~~vN~vD~~------------------~~p~~F~Yi~~~~~~~gv~~~~~~~~~~gC~C~~-~C~~~   61 (98)
T smart00468        1 CLDISNGKENVPVPLVNEVDED------------------PPPPDFEYISEYIYGQGVPIDRSPSPLVGCSCSG-DCSSS   61 (98)
T ss_pred             CccccCCccCCCcceEecCCCC------------------CCCCCcEECcceEcCCCcccccCCCCCCCCcCCC-CCCCC
Confidence            3699999999999999999985                  23479999999999998753 4467788999998 57776


Q ss_pred             C-CCccccccccccccccccCCCCCCCcccCCCCCeeecCCccccccCcCCC
Q 000554         1313 T-CDHVYLFDNDYEDAKDIDGKSVHGRFPYDQTGRVILEEGYLIYECNHMCS 1363 (1428)
Q Consensus      1313 ~-C~C~~l~~~~y~~~~~~~g~~~~~~~~Y~~~G~l~~~~~~~IyECn~~C~ 1363 (1428)
                      . |.|+.+               .++.|+|+..+++++..+.+|||||+.|+
T Consensus        62 ~~C~C~~~---------------~~~~~~Y~~~~~~~~~~~~~IyECn~~C~   98 (98)
T smart00468       62 NKCECARK---------------NGGEFAYELNGGLRLKRKPLIYECNSRCS   98 (98)
T ss_pred             CcCCcHhh---------------cCCccCcccCCCEEeCCCCEEEcCCCCCC
Confidence            6 999754               24679997777788889999999999985


No 11 
>KOG3623 consensus Homeobox transcription factor SIP1 [Transcription]
Probab=99.62  E-value=1.4e-16  Score=190.41  Aligned_cols=79  Identities=22%  Similarity=0.255  Sum_probs=72.2

Q ss_pred             cceecCccCcccCChhhHHHHHHhhccCCCCCCCCCcccCCCCcccCCchhhhcccccccCCCccccCCCCCcCcChHHH
Q 000554          983 RKFICRFCGLKFDLLPDLGRHHQAAHMGPNLVNSRPHKKGIRFYAYKLKSGRLSRPRFKKGLGAVSYRIRNRGAAGMKKR 1062 (1428)
Q Consensus       983 KpykC~~CGKsFs~~s~L~rHHqrvHtge~~~~eKpykC~~CgKsFs~ks~L~~H~r~H~gekpy~C~~C~ksf~~~~~l 1062 (1428)
                      .+|.|+.|+|.|...+.|.| |+--|+|     .|||+|.+|.|+|..+.+|..|+|.|.|+|||.|+.|+|.|+....-
T Consensus       893 gmyaCDqCDK~FqKqSSLaR-HKYEHsG-----qRPyqC~iCkKAFKHKHHLtEHkRLHSGEKPfQCdKClKRFSHSGSY  966 (1007)
T KOG3623|consen  893 GMYACDQCDKAFQKQSSLAR-HKYEHSG-----QRPYQCIICKKAFKHKHHLTEHKRLHSGEKPFQCDKCLKRFSHSGSY  966 (1007)
T ss_pred             ccchHHHHHHHHHhhHHHHH-hhhhhcC-----CCCcccchhhHhhhhhhhhhhhhhhccCCCcchhhhhhhhcccccch
Confidence            57999999999999999999 8999999     99999999999999999999999999999999999999999755444


Q ss_pred             HhhcC
Q 000554         1063 IQTLK 1067 (1428)
Q Consensus      1063 ~~H~k 1067 (1428)
                      .+|+.
T Consensus       967 SQHMN  971 (1007)
T KOG3623|consen  967 SQHMN  971 (1007)
T ss_pred             Hhhhc
Confidence            44444


No 12 
>KOG4442 consensus Clathrin coat binding protein/Huntingtin interacting protein HIP1, involved in regulation of endocytosis [Intracellular trafficking, secretion, and vesicular transport]
Probab=99.53  E-value=4.2e-15  Score=179.58  Aligned_cols=73  Identities=42%  Similarity=0.715  Sum_probs=69.6

Q ss_pred             ccccccCc-CCC-CCCCCCCceeeccceeeEEEEeecCCccceeecccCCCCCEEEEeeeEEcCHHHHHHHhhhc
Q 000554         1353 YLIYECNH-MCS-CDRTCPNRVLQNGVRVKLEVFKTENKGWAVRAGQAILRGTFVCEYIGEVLDELETNKRRSRL 1425 (1428)
Q Consensus      1353 ~~IyECn~-~C~-C~~~C~NRvvQ~G~~~~LeVFkT~~kGWGVra~~~Ip~GtFIcEYvGEvIt~~Ea~~R~~~Y 1425 (1428)
                      ....||++ .|. |+..|.|+.+|+....+++||+|++|||||||..+||+|+||.||+||||+.+|+++|...|
T Consensus        92 ~t~iECs~~~C~~cg~~C~NQRFQkkqyA~vevF~Te~KG~GLRA~~dI~~g~FI~EY~GEVI~~~Ef~kR~~~Y  166 (729)
T KOG4442|consen   92 MTSIECSDRECPRCGVYCKNQRFQKKQYAKVEVFLTEKKGCGLRAEEDIPKGQFILEYIGEVIEEKEFEKRVKRY  166 (729)
T ss_pred             hhhcccCCccCCCccccccchhhhhhccCceeEEEecCcccceeeccccCCCcEEeeeccccccHHHHHHHHHHH
Confidence            35679988 999 99999999999999999999999999999999999999999999999999999999998865


No 13 
>KOG3576 consensus Ovo and related transcription factors [Transcription]
Probab=99.32  E-value=2e-13  Score=143.90  Aligned_cols=118  Identities=14%  Similarity=0.167  Sum_probs=95.4

Q ss_pred             ccccccCCCCCCChhhhhhhhhhcccccccchhhhhccccccCcCCCCccccCChhhhhhhhhhcCCccceecCccCccc
Q 000554          915 LQQCIPCGSHFGNTEELWLHVQSVHAIDFKMSEVAQQHNQSVGEDSPKKLELGYSASVENHSENLGSIRKFICRFCGLKF  994 (1428)
Q Consensus       915 pfkC~~CgKsF~sks~L~~H~rsvHsgEf~~~s~~~~kp~~C~~C~~k~~sf~sks~L~~H~rtHtGeKpykC~~CGKsF  994 (1428)
                      .|.|.+|+|.|.-...|.+|++ .|..-         +.+.|..|   ++.|.....|++|+|+|+|.+||+|..|+|+|
T Consensus       117 ~ftCrvCgK~F~lQRmlnrh~k-ch~~v---------kr~lct~c---gkgfndtfdlkrh~rthtgvrpykc~~c~kaf  183 (267)
T KOG3576|consen  117 SFTCRVCGKKFGLQRMLNRHLK-CHSDV---------KRHLCTFC---GKGFNDTFDLKRHTRTHTGVRPYKCSLCEKAF  183 (267)
T ss_pred             eeeeehhhhhhhHHHHHHHHhh-hccHH---------HHHHHhhc---cCcccchhhhhhhhccccCccccchhhhhHHH
Confidence            5778888888888888888876 67654         67778877   55555556899999999999999999999999


Q ss_pred             CChhhHHHHHHhhccCCCCC-----CCCCcccCCCCcccCCchhhhcccccccCCC
Q 000554          995 DLLPDLGRHHQAAHMGPNLV-----NSRPHKKGIRFYAYKLKSGRLSRPRFKKGLG 1045 (1428)
Q Consensus       995 s~~s~L~rHHqrvHtge~~~-----~eKpykC~~CgKsFs~ks~L~~H~r~H~gek 1045 (1428)
                      .++-.|..|.+++|.-...+     ..|.|.|..||++-.....+..|++.|+...
T Consensus       184 tqrcsleshl~kvhgv~~~yaykerr~kl~vcedcg~t~~~~e~~~~h~~~~hp~S  239 (267)
T KOG3576|consen  184 TQRCSLESHLKKVHGVQHQYAYKERRAKLYVCEDCGYTSERPEVYYLHLKLHHPFS  239 (267)
T ss_pred             HhhccHHHHHHHHcCchHHHHHHHhhhheeeecccCCCCCChhHHHHHHHhcCCCC
Confidence            99999999888999764322     2567999999999999999999998887443


No 14 
>KOG3576 consensus Ovo and related transcription factors [Transcription]
Probab=99.22  E-value=1.7e-12  Score=137.11  Aligned_cols=88  Identities=23%  Similarity=0.500  Sum_probs=81.7

Q ss_pred             CCCCcccCCCCCcccccccccccccccccchhhhcccCcccccccccccCChhhhhhhhhhcccccccccccccccccCC
Q 000554          843 EDEKTHKCKICSQVFLHDQELGVHWMDNHKKEAQWLFRGYACAICLDSFTNKKVLESHVQERHHVQFVEQCMLQQCIPCG  922 (1428)
Q Consensus       843 ~gekpykC~~CgK~F~s~s~L~~H~~r~Ht~e~~~l~KpykC~~CgKsF~~ks~L~~H~r~Hhgek~~e~~kpfkC~~Cg  922 (1428)
                      .+...|.|.+|+|.|....-|.+| ++.|...     |.|-|..|||.|...-.|++|+++|+|.+      ||+|..|+
T Consensus       113 sd~d~ftCrvCgK~F~lQRmlnrh-~kch~~v-----kr~lct~cgkgfndtfdlkrh~rthtgvr------pykc~~c~  180 (267)
T KOG3576|consen  113 SDQDSFTCRVCGKKFGLQRMLNRH-LKCHSDV-----KRHLCTFCGKGFNDTFDLKRHTRTHTGVR------PYKCSLCE  180 (267)
T ss_pred             CCCCeeeeehhhhhhhHHHHHHHH-hhhccHH-----HHHHHhhccCcccchhhhhhhhccccCcc------ccchhhhh
Confidence            445679999999999999999999 8999988     89999999999999999999999999997      99999999


Q ss_pred             CCCCChhhhhhhhhhccccc
Q 000554          923 SHFGNTEELWLHVQSVHAID  942 (1428)
Q Consensus       923 KsF~sks~L~~H~rsvHsgE  942 (1428)
                      |.|.+...|..|.+.+|...
T Consensus       181 kaftqrcsleshl~kvhgv~  200 (267)
T KOG3576|consen  181 KAFTQRCSLESHLKKVHGVQ  200 (267)
T ss_pred             HHHHhhccHHHHHHHHcCch
Confidence            99999999999999888643


No 15 
>KOG3623 consensus Homeobox transcription factor SIP1 [Transcription]
Probab=99.21  E-value=3.1e-12  Score=153.88  Aligned_cols=123  Identities=23%  Similarity=0.344  Sum_probs=100.5

Q ss_pred             ccccccccccCChhhhhhhhhhcccccccccccccccccCCCCCCChhhhhhhhhhcccccccchhhhhccccccCcCCC
Q 000554          882 YACAICLDSFTNKKVLESHVQERHHVQFVEQCMLQQCIPCGSHFGNTEELWLHVQSVHAIDFKMSEVAQQHNQSVGEDSP  961 (1428)
Q Consensus       882 ykC~~CgKsF~~ks~L~~H~r~Hhgek~~e~~kpfkC~~CgKsF~sks~L~~H~rsvHsgEf~~~s~~~~kp~~C~~C~~  961 (1428)
                      ..|++|.+.+.+...|+.|++..|....    ..|.|..|..+|..+..|.+|+. .|..-                   
T Consensus       211 ltcpycdrgykrltslkeHikyrhekne----~nfsC~lCsytFAyRtQLErhm~-~hkpg-------------------  266 (1007)
T KOG3623|consen  211 LTCPYCDRGYKRLTSLKEHIKYRHEKNE----PNFSCMLCSYTFAYRTQLERHMQ-LHKPG-------------------  266 (1007)
T ss_pred             hcchhHHHHHHHHHHHHHHHHHHHhhCC----CCCcchhhhhhhhhHHHHHHHHH-hhcCC-------------------
Confidence            5799999999999999999998776542    36899999999999999999996 67421                   


Q ss_pred             CccccCChhhhhhhhhhcCCccceecCccCcccCChhhHHHHHHhhccCCCCCCCCCcccCCCCcccCCchhhhcccccc
Q 000554          962 KKLELGYSASVENHSENLGSIRKFICRFCGLKFDLLPDLGRHHQAAHMGPNLVNSRPHKKGIRFYAYKLKSGRLSRPRFK 1041 (1428)
Q Consensus       962 k~~sf~sks~L~~H~rtHtGeKpykC~~CGKsFs~~s~L~rHHqrvHtge~~~~eKpykC~~CgKsFs~ks~L~~H~r~H 1041 (1428)
                       +-.       .+|+-.-.+.|.|+|.+|||+|..+.+|+. |.|+|.|     +|||.|+.|+|.|+....+..||...
T Consensus       267 -~dq-------a~sltqsa~lRKFKCtECgKAFKfKHHLKE-HlRIHSG-----EKPfeCpnCkKRFSHSGSySSHmSSK  332 (1007)
T KOG3623|consen  267 -GDQ-------AISLTQSALLRKFKCTECGKAFKFKHHLKE-HLRIHSG-----EKPFECPNCKKRFSHSGSYSSHMSSK  332 (1007)
T ss_pred             -Ccc-------cccccchhhhccccccccchhhhhHHHHHh-hheeecC-----CCCcCCcccccccccCCccccccccc
Confidence             000       011222234588999999999999999999 8999999     89999999999999999999999665


Q ss_pred             c
Q 000554         1042 K 1042 (1428)
Q Consensus      1042 ~ 1042 (1428)
                      +
T Consensus       333 K  333 (1007)
T KOG3623|consen  333 K  333 (1007)
T ss_pred             c
Confidence            5


No 16 
>KOG1079 consensus Transcriptional repressor EZH1 [Transcription]
Probab=99.15  E-value=1.7e-11  Score=147.89  Aligned_cols=99  Identities=25%  Similarity=0.625  Sum_probs=83.3

Q ss_pred             cccCCCCCcccCCCCcCCCCCCccccccccccccccccCCCCCCCcccCCCCCeeecCCccccccC-cCCCC-C------
Q 000554         1294 DAESLQLGCACANSTCFPETCDHVYLFDNDYEDAKDIDGKSVHGRFPYDQTGRVILEEGYLIYECN-HMCSC-D------ 1365 (1428)
Q Consensus      1294 ~~~~~~~gC~C~~~~C~~~~C~C~~l~~~~y~~~~~~~g~~~~~~~~Y~~~G~l~~~~~~~IyECn-~~C~C-~------ 1365 (1428)
                      +-.+.+.||.| .+.|....|+|..                                   ...||. +.|.+ +      
T Consensus       534 dC~nrF~GC~C-k~QC~tkqCpC~~-----------------------------------A~rECdPd~Cl~cg~~~~~d  577 (739)
T KOG1079|consen  534 DCRNRFPGCRC-KAQCNTKQCPCYL-----------------------------------AVRECDPDVCLMCGNVDHFD  577 (739)
T ss_pred             HHHhcCCCCCc-ccccccCcCchhh-----------------------------------hccccCchHHhccCcccccc
Confidence            33556789999 4588888899842                                   245785 57754 2      


Q ss_pred             ---CCCCCceeeccceeeEEEEeecCCccceeecccCCCCCEEEEeeeEEcCHHHHHHHhhh-------ccCC
Q 000554         1366 ---RTCPNRVLQNGVRVKLEVFKTENKGWAVRAGQAILRGTFVCEYIGEVLDELETNKRRSR-------LLFD 1428 (1428)
Q Consensus      1366 ---~~C~NRvvQ~G~~~~LeVFkT~~kGWGVra~~~Ip~GtFIcEYvGEvIt~~Ea~~R~~~-------YlFD 1428 (1428)
                         -+|+|.-+|+|.+.++.|-.+.-.|||++..+++.+++||.||+||+|+++||++|+..       ||||
T Consensus       578 ~~~~~C~N~~l~~~~qkr~llapSdVaGwGlFlKe~v~KnefisEY~GE~IS~dEADrRGkiYDr~~cSflFn  650 (739)
T KOG1079|consen  578 SSKISCKNTNLQRGEQKRVLLAPSDVAGWGLFLKESVSKNEFISEYTGEIISHDEADRRGKIYDRYMCSFLFN  650 (739)
T ss_pred             cCccccccchhhhhhhcceeechhhccccceeeccccCCCceeeeecceeccchhhhhcccccccccceeeee
Confidence               27999999999999999999999999999999999999999999999999999999973       7775


No 17 
>KOG1141 consensus Predicted histone methyl transferase [Chromatin structure and dynamics]
Probab=98.97  E-value=4.1e-10  Score=136.87  Aligned_cols=185  Identities=23%  Similarity=0.399  Sum_probs=121.2

Q ss_pred             eeccCccCCCC---------CCcCeeEeecCcCCCCCCCeEEEECCCCcccccccCCCCCcccccCCCCCCCcEEccccC
Q 000554         1216 IIDSRHLGRKP---------LLRGTVLCDDISSGLESVPVACVVDDGLLETLCISADSSDSQKTRCSMPWESFTYVTKPL 1286 (1428)
Q Consensus      1216 ~l~~~~~~~~~---------~~r~~vi~~DIS~G~E~~PV~~vnd~d~~~~~~~~g~~s~~~~~~~~~Pp~~F~Yit~~i 1286 (1428)
                      +++.++|.|..         ....++-.+|.+.|.+.+|||.||.+|..+.+.-+    ++.        -.|.|..+..
T Consensus       850 ~~~id~~~f~~~~dt~~~~tvD~~g~d~~d~~~g~sg~~~p~~~~~d~~~~~~c~----d~~--------~~~~~~~~~~  917 (1262)
T KOG1141|consen  850 LLTIDCFSFDARIDTATYITVDDKGLDVADFSLGTSGIPIPLVNSVDNDEPPSCE----DSK--------RRFQYNDQVD  917 (1262)
T ss_pred             hhcccccchhccccccceeeccccccchhhhhccccCCCCccccccccCCCcccc----ccc--------eeecccccch
Confidence            44456665543         23455667899999999999999999875433211    111        1233433221


Q ss_pred             CCCCCCCcccCCCCCcccCCCCcCCCCCCccccccccccccc---cccCCCCCCCcccCCCCCeeecCCccccccCcCCC
Q 000554         1287 LDQSLDLDAESLQLGCACANSTCFPETCDHVYLFDNDYEDAK---DIDGKSVHGRFPYDQTGRVILEEGYLIYECNHMCS 1363 (1428)
Q Consensus      1287 ~~~~~~~~~~~~~~gC~C~~~~C~~~~C~C~~l~~~~y~~~~---~~~g~~~~~~~~Y~~~G~l~~~~~~~IyECn~~C~ 1363 (1428)
                      +    ......+..||.|.+++-+-+.|.|.++.........   ...|...--.-+|+.+..+    ....|||++.|.
T Consensus       918 ~----s~~~~~~~~~~s~d~hp~d~~~~~~~~~~~~~~~~cpp~~s~d~~~~~~eS~~~~ns~~----~~~f~e~~~hss  989 (1262)
T KOG1141|consen  918 I----SSVSRDFCSGCSCDGHPSDASKCECQQLSIEAMKRCPPNLSFDGHDELYESSEKQNSFL----KLFFFECNDHSS  989 (1262)
T ss_pred             h----hhhccccccccccCCCCcccCcccCCCCChhhhcCCCCccccCchhhhhhhhhhcchhh----hccceeccccch
Confidence            1    1123567789999876555677888765332221110   0111111111122222211    235789999999


Q ss_pred             CCCCCCCceeeccceee--------EEEEeecCCccceeecccCCCCCEEEEeeeEEcCHHHHHH
Q 000554         1364 CDRTCPNRVLQNGVRVK--------LEVFKTENKGWAVRAGQAILRGTFVCEYIGEVLDELETNK 1420 (1428)
Q Consensus      1364 C~~~C~NRvvQ~G~~~~--------LeVFkT~~kGWGVra~~~Ip~GtFIcEYvGEvIt~~Ea~~ 1420 (1428)
                      |...|.||++|++++++        |+||+|..-|||+|...+||.-+|||+|+|...++.-|++
T Consensus       990 ~~~~e~~~~v~~~~~~~me~~s~~~l~i~~~~~~~~~~~edtD~~~~~~~~~~~~~ppt~~l~~~ 1054 (1262)
T KOG1141|consen  990 CHRKEYNRVVQNNIKYPMEVSSFNDLQIFKTAQSGWGVREDTDIPQSTFICTYVGAPPTDDLADE 1054 (1262)
T ss_pred             hcccccchhhhcCCccceeeeecccccccccccccccccccccCCCCcccccccCCCCchhhHHH
Confidence            99999999999998876        5578888999999999999999999999999999988775


No 18 
>PLN03086 PRLI-interacting factor K; Provisional
Probab=98.81  E-value=6.1e-09  Score=128.04  Aligned_cols=144  Identities=19%  Similarity=0.307  Sum_probs=85.2

Q ss_pred             ccCCCCCcccccccccccccccccchhhhcccCcccccc--cccccCChhhhhhhhhhcccccccccccccccccCCCCC
Q 000554          848 HKCKICSQVFLHDQELGVHWMDNHKKEAQWLFRGYACAI--CLDSFTNKKVLESHVQERHHVQFVEQCMLQQCIPCGSHF  925 (1428)
Q Consensus       848 ykC~~CgK~F~s~s~L~~H~~r~Ht~e~~~l~KpykC~~--CgKsF~~ks~L~~H~r~Hhgek~~e~~kpfkC~~CgKsF  925 (1428)
                      -.|+.|.+..... .|..| ...+ .-     ..-.|+.  |+..|. +..+..|               +.|+.|++.|
T Consensus       408 V~C~NC~~~i~l~-~l~lH-e~~C-~r-----~~V~Cp~~~Cg~v~~-r~el~~H---------------~~C~~Cgk~f  463 (567)
T PLN03086        408 VECRNCKHYIPSR-SIALH-EAYC-SR-----HNVVCPHDGCGIVLR-VEEAKNH---------------VHCEKCGQAF  463 (567)
T ss_pred             EECCCCCCccchh-HHHHH-HhhC-CC-----cceeCCcccccceee-ccccccC---------------ccCCCCCCcc
Confidence            3566666655433 24455 2221 11     2334663  777662 3333333               3477777777


Q ss_pred             CChhhhhhhhhhcccccccchhhhhccccccCcCCCCccccCChhhhhhhhhhcCCccceecCccCcccC----------
Q 000554          926 GNTEELWLHVQSVHAIDFKMSEVAQQHNQSVGEDSPKKLELGYSASVENHSENLGSIRKFICRFCGLKFD----------  995 (1428)
Q Consensus       926 ~sks~L~~H~rsvHsgEf~~~s~~~~kp~~C~~C~~k~~sf~sks~L~~H~rtHtGeKpykC~~CGKsFs----------  995 (1428)
                      . ...|..|++. |.           +++.|. |   +..+ .+..|..|+++|.+++++.|++|++.|.          
T Consensus       464 ~-~s~LekH~~~-~H-----------kpv~Cp-C---g~~~-~R~~L~~H~~thCp~Kpi~C~fC~~~v~~g~~~~d~~d  525 (567)
T PLN03086        464 Q-QGEMEKHMKV-FH-----------EPLQCP-C---GVVL-EKEQMVQHQASTCPLRLITCRFCGDMVQAGGSAMDVRD  525 (567)
T ss_pred             c-hHHHHHHHHh-cC-----------CCccCC-C---CCCc-chhHHHhhhhccCCCCceeCCCCCCccccCccccchhh
Confidence            4 4667777663 32           367776 6   3322 3457777777788888888888888774          


Q ss_pred             ChhhHHHHHHhhccCCCCCCCCCcccCCCCcccCCchhhhccc-ccc
Q 000554          996 LLPDLGRHHQAAHMGPNLVNSRPHKKGIRFYAYKLKSGRLSRP-RFK 1041 (1428)
Q Consensus       996 ~~s~L~rHHqrvHtge~~~~eKpykC~~CgKsFs~ks~L~~H~-r~H 1041 (1428)
                      ..+.|.. |..++ |     .+++.|..||+.|..+ .|..|+ ..|
T Consensus       526 ~~s~Lt~-HE~~C-G-----~rt~~C~~Cgk~Vrlr-dm~~H~~~~h  564 (567)
T PLN03086        526 RLRGMSE-HESIC-G-----SRTAPCDSCGRSVMLK-EMDIHQIAVH  564 (567)
T ss_pred             hhhhHHH-HHHhc-C-----CcceEccccCCeeeeh-hHHHHHHHhh
Confidence            2356777 56664 5     6788888888777654 355565 344


No 19 
>PLN03086 PRLI-interacting factor K; Provisional
Probab=98.76  E-value=8.3e-09  Score=126.87  Aligned_cols=140  Identities=16%  Similarity=0.159  Sum_probs=105.7

Q ss_pred             ccccccccccCChhhhhhhhhhccccccccccccccccc--CCCCCCChhhhhhhhhhcccccccchhhhhccccccCcC
Q 000554          882 YACAICLDSFTNKKVLESHVQERHHVQFVEQCMLQQCIP--CGSHFGNTEELWLHVQSVHAIDFKMSEVAQQHNQSVGED  959 (1428)
Q Consensus       882 ykC~~CgKsF~~ks~L~~H~r~Hhgek~~e~~kpfkC~~--CgKsF~sks~L~~H~rsvHsgEf~~~s~~~~kp~~C~~C  959 (1428)
                      -.|..|..... ...|..|.......       .-.|+.  ||..|. +..+..|                   +.|..|
T Consensus       408 V~C~NC~~~i~-l~~l~lHe~~C~r~-------~V~Cp~~~Cg~v~~-r~el~~H-------------------~~C~~C  459 (567)
T PLN03086        408 VECRNCKHYIP-SRSIALHEAYCSRH-------NVVCPHDGCGIVLR-VEEAKNH-------------------VHCEKC  459 (567)
T ss_pred             EECCCCCCccc-hhHHHHHHhhCCCc-------ceeCCcccccceee-ccccccC-------------------ccCCCC
Confidence            45999987654 45566887543332       356885  999883 3333333                   468888


Q ss_pred             CCCccccCChhhhhhhhhhcCCccceecCccCcccCChhhHHHHHHhhccCCCCCCCCCcccCCCCcccCC---------
Q 000554          960 SPKKLELGYSASVENHSENLGSIRKFICRFCGLKFDLLPDLGRHHQAAHMGPNLVNSRPHKKGIRFYAYKL--------- 1030 (1428)
Q Consensus       960 ~~k~~sf~sks~L~~H~rtHtGeKpykC~~CGKsFs~~s~L~rHHqrvHtge~~~~eKpykC~~CgKsFs~--------- 1030 (1428)
                         +..|. ...|..|+++|+  ++|.|+ ||+.| .+..|.. |+++|..     .+++.|++|++.|..         
T Consensus       460 ---gk~f~-~s~LekH~~~~H--kpv~Cp-Cg~~~-~R~~L~~-H~~thCp-----~Kpi~C~fC~~~v~~g~~~~d~~d  525 (567)
T PLN03086        460 ---GQAFQ-QGEMEKHMKVFH--EPLQCP-CGVVL-EKEQMVQ-HQASTCP-----LRLITCRFCGDMVQAGGSAMDVRD  525 (567)
T ss_pred             ---CCccc-hHHHHHHHHhcC--CCccCC-CCCCc-chhHHHh-hhhccCC-----CCceeCCCCCCccccCccccchhh
Confidence               55554 468999999986  899999 99765 6689999 7899999     899999999999952         


Q ss_pred             -chhhhcccccccCCCccccCCCCCcCcChHHHHh
Q 000554         1031 -KSGRLSRPRFKKGLGAVSYRIRNRGAAGMKKRIQ 1064 (1428)
Q Consensus      1031 -ks~L~~H~r~H~gekpy~C~~C~ksf~~~~~l~~ 1064 (1428)
                       .+.|..|...+ |.+++.|..|++.+..+..-.|
T Consensus       526 ~~s~Lt~HE~~C-G~rt~~C~~Cgk~Vrlrdm~~H  559 (567)
T PLN03086        526 RLRGMSEHESIC-GSRTAPCDSCGRSVMLKEMDIH  559 (567)
T ss_pred             hhhhHHHHHHhc-CCcceEccccCCeeeehhHHHH
Confidence             35899999886 9999999999999875544433


No 20 
>PHA00733 hypothetical protein
Probab=98.55  E-value=3.8e-08  Score=100.96  Aligned_cols=86  Identities=10%  Similarity=0.031  Sum_probs=65.4

Q ss_pred             cccccccCCCCCCChhhhhhhhhhcccccccchhhhhccccccCcCCCCccccCChhhhhhhhhhcCCccceecCccCcc
Q 000554          914 MLQQCIPCGSHFGNTEELWLHVQSVHAIDFKMSEVAQQHNQSVGEDSPKKLELGYSASVENHSENLGSIRKFICRFCGLK  993 (1428)
Q Consensus       914 kpfkC~~CgKsF~sks~L~~H~rsvHsgEf~~~s~~~~kp~~C~~C~~k~~sf~sks~L~~H~rtHtGeKpykC~~CGKs  993 (1428)
                      +++.|.+|.+.|.....|..|.                                   .|.+|+..| +.+||.|+.||+.
T Consensus        39 ~~~~~~~~~~~~~~~~~l~~~~-----------------------------------~l~~~~~~~-~~kPy~C~~Cgk~   82 (128)
T PHA00733         39 KRLIRAVVKTLIYNPQLLDESS-----------------------------------YLYKLLTSK-AVSPYVCPLCLMP   82 (128)
T ss_pred             hhHHHHHHhhhccChhhhcchH-----------------------------------HHHhhcccC-CCCCccCCCCCCc
Confidence            3677777777777665555542                                   355565444 4789999999999


Q ss_pred             cCChhhHHHHHHhhccCCCCCCCCCcccCCCCcccCCchhhhcccccccC
Q 000554          994 FDLLPDLGRHHQAAHMGPNLVNSRPHKKGIRFYAYKLKSGRLSRPRFKKG 1043 (1428)
Q Consensus       994 Fs~~s~L~rHHqrvHtge~~~~eKpykC~~CgKsFs~ks~L~~H~r~H~g 1043 (1428)
                      |.+...|.+ |++.|+.       +|.|+.|++.|.....|.+|+..+++
T Consensus        83 Fss~s~L~~-H~r~h~~-------~~~C~~CgK~F~~~~sL~~H~~~~h~  124 (128)
T PHA00733         83 FSSSVSLKQ-HIRYTEH-------SKVCPVCGKEFRNTDSTLDHVCKKHN  124 (128)
T ss_pred             CCCHHHHHH-HHhcCCc-------CccCCCCCCccCCHHHHHHHHHHhcC
Confidence            999999999 6777643       68999999999999999999866553


No 21 
>PF01352 KRAB:  KRAB box;  InterPro: IPR001909 The Krueppel-associated box (KRAB) is a domain of around 75 amino acids that is found in the N-terminal part of about one third of eukaryotic Krueppel-type C2H2 zinc finger proteins (ZFPs) []. It is enriched in charged amino acids and can be divided into subregions A and B, which are predicted to fold into two amphipathic alpha-helices. The KRAB A and B boxes can be separated by variable spacer segments and many KRAB proteins contain only the A box []. The functions currently known for members of the KRAB-containing protein family include transcriptional repression of RNA polymerase I, II, and III promoters, binding and splicing of RNA, and control of nucleolus function. The KRAB domain functions as a transcriptional repressor when tethered to the template DNA by a DNA-binding domain. A sequence of 45 amino acids in the KRAB A subdomain has been shown to be necessary and sufficient for transcriptional repression. The B box does not repress by itself but does potentiate the repression exerted by the KRAB A subdomain [, ]. Gene silencing requires the binding of the KRAB domain to the RING-B box-coiled coil (RBCC) domain of the KAP-1/TIF1-beta corepressor. As KAP-1 binds to the heterochromatin proteins HP1, it has been proposed that the KRAB-ZFP-bound target gene could be silenced following recruitment to heterochromatin [, ]. KRAB-ZFPs probably constitute the single largest class of transcription factors within the human genome []. Although the function of KRAB-ZFPs is largely unknown, they appear to play important roles during cell differentiation and development. The KRAB domain is generally encoded by two exons. The regions coded by the two exons are known as KRAB-A and KRAB-B.; GO: 0003676 nucleic acid binding, 0006355 regulation of transcription, DNA-dependent, 0005622 intracellular; PDB: 1V65_A.
Probab=98.39  E-value=6.9e-08  Score=79.71  Aligned_cols=35  Identities=23%  Similarity=0.256  Sum_probs=20.5

Q ss_pred             eecceeeeecccccCChhhhcccchhhhhhhhc-----hhhHhhc
Q 000554          732 IISKEVFLELLKDCCSLEQKLHLHLACELFYKL-----LKSILSL  771 (1428)
Q Consensus       732 VTFkDVAV~F~r~c~SqEEW~~LdPaCrkLYrd-----y~nLvSH  771 (1428)
                      |||+||||+|     |+|||.+|+|+|+.+|++     |++++++
T Consensus         1 Vtf~Dvav~f-----s~eEW~~L~~~Qk~ly~dvm~Eny~~l~sl   40 (41)
T PF01352_consen    1 VTFEDVAVYF-----SQEEWELLDPAQKNLYRDVMLENYRNLVSL   40 (41)
T ss_dssp             ------TT--------HHHHHTS-HHHHHHHHHHHHHTTTS---S
T ss_pred             CeEEEEEEEc-----ChhhcccccceecccchhHHHHhhcccEec
Confidence            7999999999     999999999999999998     6777665


No 22 
>PHA00733 hypothetical protein
Probab=98.29  E-value=3e-07  Score=94.35  Aligned_cols=93  Identities=22%  Similarity=0.352  Sum_probs=73.0

Q ss_pred             hhhhhcccCCCCcccCCCCCcccccccccccc--c--ccccchhhhcccCcccccccccccCChhhhhhhhhhccccccc
Q 000554          835 PLAIAGRSEDEKTHKCKICSQVFLHDQELGVH--W--MDNHKKEAQWLFRGYACAICLDSFTNKKVLESHVQERHHVQFV  910 (1428)
Q Consensus       835 L~~H~r~H~gekpykC~~CgK~F~s~s~L~~H--~--~r~Ht~e~~~l~KpykC~~CgKsF~~ks~L~~H~r~Hhgek~~  910 (1428)
                      |..+......++++.|.+|.+.|.....|..|  +  ...+.+.     +||.|..|++.|.....|..|++.|  +.  
T Consensus        28 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~l~~~~~~~~~-----kPy~C~~Cgk~Fss~s~L~~H~r~h--~~--   98 (128)
T PHA00733         28 LKRYHSLTPEQKRLIRAVVKTLIYNPQLLDESSYLYKLLTSKAV-----SPYVCPLCLMPFSSSVSLKQHIRYT--EH--   98 (128)
T ss_pred             hhhhhcCChhhhhHHHHHHhhhccChhhhcchHHHHhhcccCCC-----CCccCCCCCCcCCCHHHHHHHHhcC--Cc--
Confidence            33333444557889999999999988777665  1  1122334     8999999999999999999999976  22  


Q ss_pred             ccccccccccCCCCCCChhhhhhhhhhccc
Q 000554          911 EQCMLQQCIPCGSHFGNTEELWLHVQSVHA  940 (1428)
Q Consensus       911 e~~kpfkC~~CgKsF~sks~L~~H~rsvHs  940 (1428)
                          +|.|..|++.|.....|..|+...|.
T Consensus        99 ----~~~C~~CgK~F~~~~sL~~H~~~~h~  124 (128)
T PHA00733         99 ----SKVCPVCGKEFRNTDSTLDHVCKKHN  124 (128)
T ss_pred             ----CccCCCCCCccCCHHHHHHHHHHhcC
Confidence                78999999999999999999986553


No 23 
>KOG3993 consensus Transcription factor (contains Zn finger) [Transcription]
Probab=98.23  E-value=2.5e-07  Score=107.53  Aligned_cols=39  Identities=21%  Similarity=0.206  Sum_probs=29.3

Q ss_pred             hhhhhhcccCCCCcccCCCCCcccccccccccccccccch
Q 000554          834 LPLAIAGRSEDEKTHKCKICSQVFLHDQELGVHWMDNHKK  873 (1428)
Q Consensus       834 ~L~~H~r~H~gekpykC~~CgK~F~s~s~L~~H~~r~Ht~  873 (1428)
                      .|.+|.-...----|+|++|+|.|+...+|..| ++.|..
T Consensus       282 ~LAQHrC~RIV~vEYrCPEC~KVFsCPANLASH-RRWHKP  320 (500)
T KOG3993|consen  282 ALAQHRCPRIVHVEYRCPECDKVFSCPANLASH-RRWHKP  320 (500)
T ss_pred             HHhhccCCeeEEeeecCCcccccccCchhhhhh-hcccCC
Confidence            466665333333349999999999999999999 899964


No 24 
>PHA02768 hypothetical protein; Provisional
Probab=98.13  E-value=1.3e-06  Score=76.25  Aligned_cols=43  Identities=14%  Similarity=0.084  Sum_probs=33.5

Q ss_pred             eecCccCcccCChhhHHHHHHhhccCCCCCCCCCcccCCCCcccCCchhhh
Q 000554          985 FICRFCGLKFDLLPDLGRHHQAAHMGPNLVNSRPHKKGIRFYAYKLKSGRL 1035 (1428)
Q Consensus       985 ykC~~CGKsFs~~s~L~rHHqrvHtge~~~~eKpykC~~CgKsFs~ks~L~ 1035 (1428)
                      |+|+.||+.|.+.++|.. |+++|+.       +|+|..|++.|.+++.|.
T Consensus         6 y~C~~CGK~Fs~~~~L~~-H~r~H~k-------~~kc~~C~k~f~~~s~l~   48 (55)
T PHA02768          6 YECPICGEIYIKRKSMIT-HLRKHNT-------NLKLSNCKRISLRTGEYI   48 (55)
T ss_pred             cCcchhCCeeccHHHHHH-HHHhcCC-------cccCCcccceecccceeE
Confidence            778888888888888888 6777773       678888888888777664


No 25 
>KOG3993 consensus Transcription factor (contains Zn finger) [Transcription]
Probab=98.11  E-value=4e-07  Score=105.84  Aligned_cols=181  Identities=12%  Similarity=0.077  Sum_probs=107.0

Q ss_pred             cccCCCCCcccccccccccccccccchhhhcccCcccccccccccCChhhhhhhhhhccccccc----------------
Q 000554          847 THKCKICSQVFLHDQELGVHWMDNHKKEAQWLFRGYACAICLDSFTNKKVLESHVQERHHVQFV----------------  910 (1428)
Q Consensus       847 pykC~~CgK~F~s~s~L~~H~~r~Ht~e~~~l~KpykC~~CgKsF~~ks~L~~H~r~Hhgek~~----------------  910 (1428)
                      -|.|..|...|...-.|.+| +-..--.     --|+|++|+|.|.-..+|..|.|.|......                
T Consensus       267 dyiCqLCK~kYeD~F~LAQH-rC~RIV~-----vEYrCPEC~KVFsCPANLASHRRWHKPR~eaa~a~~~P~k~~~~~ra  340 (500)
T KOG3993|consen  267 DYICQLCKEKYEDAFALAQH-RCPRIVH-----VEYRCPECDKVFSCPANLASHRRWHKPRPEAAKAGSPPPKQAVETRA  340 (500)
T ss_pred             HHHHHHHHHhhhhHHHHhhc-cCCeeEE-----eeecCCcccccccCchhhhhhhcccCCchhhhhcCCCChhhhhhhhh
Confidence            39999999999999999999 3211111     3499999999999999999999998643211                


Q ss_pred             -----------ccccccccccCCCCCCChhhhhhhhhhcccccccc-------hhhhhccccccCcCCCCccccCChhhh
Q 000554          911 -----------EQCMLQQCIPCGSHFGNTEELWLHVQSVHAIDFKM-------SEVAQQHNQSVGEDSPKKLELGYSASV  972 (1428)
Q Consensus       911 -----------e~~kpfkC~~CgKsF~sks~L~~H~rsvHsgEf~~-------~s~~~~kp~~C~~C~~k~~sf~sks~L  972 (1428)
                                 ..+..|.|.+|+|.|.....|+.|+.+.|......       .+....-.+-|..|   .-.+.....-
T Consensus       341 e~~ea~rsg~dss~gi~~C~~C~KkFrRqAYLrKHqlthq~~~~~k~~a~~f~~s~~~~l~~~~~~~---a~h~~a~~~~  417 (500)
T KOG3993|consen  341 EVQEAERSGDDSSSGIFSCHTCGKKFRRQAYLRKHQLTHQRAPLAKEKAPKFLLSRVIPLMHFNQAV---ATHSSASDSH  417 (500)
T ss_pred             hhhhccccCCcccCceeecHHhhhhhHHHHHHHHhHHhhhccccchhcccCcchhhccccccccccc---cccccccccc
Confidence                       11246999999999999999999987444333000       00000011223333   1111110000


Q ss_pred             hhhhhhcCC-ccceecCccCcccCChhhHHHHHHhhccCCCCCCCCCcccCCCCcccCCchhhhccc-cccc
Q 000554          973 ENHSENLGS-IRKFICRFCGLKFDLLPDLGRHHQAAHMGPNLVNSRPHKKGIRFYAYKLKSGRLSRP-RFKK 1042 (1428)
Q Consensus       973 ~~H~rtHtG-eKpykC~~CGKsFs~~s~L~rHHqrvHtge~~~~eKpykC~~CgKsFs~ks~L~~H~-r~H~ 1042 (1428)
                      -.|...+.+ .....|+.||-.+..+..-.. +.+.-..     +.-|.|.+|.-+|....+|.+|+ +-|-
T Consensus       418 g~~vl~~a~sael~~pp~~~~ppsss~~sgg-~~rlg~~-----~q~f~~ky~~atfyss~~ltrhin~~Hp  483 (500)
T KOG3993|consen  418 GDEVLYVAGSAELELPPYDGSPPSSSGSSGG-YGRLGIA-----EQGFTCKYCPATFYSSPGLTRHINKCHP  483 (500)
T ss_pred             ccceeeeeccccccCCCCCCCCcccCCCCCc-cccccch-----hhccccccchHhhhcCcchHhHhhhcCh
Confidence            011111111 122346777766665554444 2222111     45677888888888888888877 3343


No 26 
>KOG1083 consensus Putative transcription factor ASH1/LIN-59 [Transcription]
Probab=98.05  E-value=4.1e-07  Score=114.53  Aligned_cols=56  Identities=43%  Similarity=0.732  Sum_probs=51.0

Q ss_pred             CCCceeec-cceeeEEEEeecCCccceeecccCCCCCEEEEeeeEEcCHHHHHHHhh
Q 000554         1368 CPNRVLQN-GVRVKLEVFKTENKGWAVRAGQAILRGTFVCEYIGEVLDELETNKRRS 1423 (1428)
Q Consensus      1368 C~NRvvQ~-G~~~~LeVFkT~~kGWGVra~~~Ip~GtFIcEYvGEvIt~~Ea~~R~~ 1423 (1428)
                      |.|+.+|+ +.-.+|+||++..+||||++..+|++|+|||||+||||+.++++.|+.
T Consensus      1166 c~nqrm~r~e~cp~L~v~~gp~~G~~v~tk~PikagtfI~EYvGeVit~ke~e~~mm 1222 (1306)
T KOG1083|consen 1166 CSNQRMQRHEECPPLEVFRGPKKGWGVRTKEPIKAGTFIMEYVGEVITEKEFEPRMM 1222 (1306)
T ss_pred             hhhHHhhhhccCCCcceeccCCCCccccccccccccchHHHHHHHHHHHHhhccccc
Confidence            88888876 456889999999999999999999999999999999999999998843


No 27 
>PHA02768 hypothetical protein; Provisional
Probab=97.95  E-value=1.9e-06  Score=75.21  Aligned_cols=45  Identities=11%  Similarity=-0.033  Sum_probs=41.1

Q ss_pred             CcccCCCCcccCCchhhhcccccccCCCccccCCCCCcCcChHHHHh
Q 000554         1018 PHKKGIRFYAYKLKSGRLSRPRFKKGLGAVSYRIRNRGAAGMKKRIQ 1064 (1428)
Q Consensus      1018 pykC~~CgKsFs~ks~L~~H~r~H~gekpy~C~~C~ksf~~~~~l~~ 1064 (1428)
                      -|+|+.||+.|++.++|..|+++|+  ++|+|..|++.|.....++.
T Consensus         5 ~y~C~~CGK~Fs~~~~L~~H~r~H~--k~~kc~~C~k~f~~~s~l~~   49 (55)
T PHA02768          5 GYECPICGEIYIKRKSMITHLRKHN--TNLKLSNCKRISLRTGEYIE   49 (55)
T ss_pred             ccCcchhCCeeccHHHHHHHHHhcC--CcccCCcccceecccceeEE
Confidence            5899999999999999999999999  79999999999997776653


No 28 
>PF13465 zf-H2C2_2:  Zinc-finger double domain; PDB: 2EN7_A 1TF6_A 1TF3_A 2ELT_A 2EOS_A 2EN2_A 2DMD_A 2WBS_A 2WBU_A 2EM5_A ....
Probab=97.86  E-value=6.6e-06  Score=61.51  Aligned_cols=26  Identities=19%  Similarity=0.577  Sum_probs=19.2

Q ss_pred             hhhhhhhhcCCccceecCccCcccCC
Q 000554          971 SVENHSENLGSIRKFICRFCGLKFDL  996 (1428)
Q Consensus       971 ~L~~H~rtHtGeKpykC~~CGKsFs~  996 (1428)
                      +|.+|+++|+|+|||+|+.|+++|.+
T Consensus         1 ~l~~H~~~H~~~k~~~C~~C~k~F~~   26 (26)
T PF13465_consen    1 NLRRHMRTHTGEKPYKCPYCGKSFSN   26 (26)
T ss_dssp             HHHHHHHHHSSSSSEEESSSSEEESS
T ss_pred             CHHHHhhhcCCCCCCCCCCCcCeeCc
Confidence            36777777777777777777777753


No 29 
>PHA00732 hypothetical protein
Probab=97.42  E-value=9.7e-05  Score=69.87  Aligned_cols=48  Identities=21%  Similarity=0.214  Sum_probs=37.8

Q ss_pred             ceecCccCcccCChhhHHHHHHh-hccCCCCCCCCCcccCCCCcccCCchhhhcccccccC
Q 000554          984 KFICRFCGLKFDLLPDLGRHHQA-AHMGPNLVNSRPHKKGIRFYAYKLKSGRLSRPRFKKG 1043 (1428)
Q Consensus       984 pykC~~CGKsFs~~s~L~rHHqr-vHtge~~~~eKpykC~~CgKsFs~ks~L~~H~r~H~g 1043 (1428)
                      ||.|+.||+.|.+.++|.+ |++ .|++        +.|+.|+++|.   .|..|++.+..
T Consensus         1 py~C~~Cgk~F~s~s~Lk~-H~r~~H~~--------~~C~~CgKsF~---~l~~H~~~~~~   49 (79)
T PHA00732          1 MFKCPICGFTTVTLFALKQ-HARRNHTL--------TKCPVCNKSYR---RLNQHFYSQYD   49 (79)
T ss_pred             CccCCCCCCccCCHHHHHH-HhhcccCC--------CccCCCCCEeC---ChhhhhcccCC
Confidence            5889999999999999999 555 4654        47999999997   58888866654


No 30 
>PF13465 zf-H2C2_2:  Zinc-finger double domain; PDB: 2EN7_A 1TF6_A 1TF3_A 2ELT_A 2EOS_A 2EN2_A 2DMD_A 2WBS_A 2WBU_A 2EM5_A ....
Probab=97.36  E-value=0.0001  Score=55.24  Aligned_cols=26  Identities=27%  Similarity=0.349  Sum_probs=20.5

Q ss_pred             hHHHHHHhhccCCCCCCCCCcccCCCCcccCC
Q 000554          999 DLGRHHQAAHMGPNLVNSRPHKKGIRFYAYKL 1030 (1428)
Q Consensus       999 ~L~rHHqrvHtge~~~~eKpykC~~CgKsFs~ 1030 (1428)
                      +|.+ |+++|+|     ++||+|++|+++|.+
T Consensus         1 ~l~~-H~~~H~~-----~k~~~C~~C~k~F~~   26 (26)
T PF13465_consen    1 NLRR-HMRTHTG-----EKPYKCPYCGKSFSN   26 (26)
T ss_dssp             HHHH-HHHHHSS-----SSSEEESSSSEEESS
T ss_pred             CHHH-HhhhcCC-----CCCCCCCCCcCeeCc
Confidence            4777 6778888     788888888888863


No 31 
>PHA00616 hypothetical protein
Probab=97.24  E-value=5.9e-05  Score=63.14  Aligned_cols=26  Identities=19%  Similarity=0.280  Sum_probs=13.9

Q ss_pred             ceecCccCcccCChhhHHHHHHhhccC
Q 000554          984 KFICRFCGLKFDLLPDLGRHHQAAHMG 1010 (1428)
Q Consensus       984 pykC~~CGKsFs~~s~L~rHHqrvHtg 1010 (1428)
                      ||+|+.||+.|.++++|.+ |.+.|+|
T Consensus         1 pYqC~~CG~~F~~~s~l~~-H~r~~hg   26 (44)
T PHA00616          1 MYQCLRCGGIFRKKKEVIE-HLLSVHK   26 (44)
T ss_pred             CCccchhhHHHhhHHHHHH-HHHHhcC
Confidence            3555555555555555555 4455555


No 32 
>smart00317 SET SET (Su(var)3-9, Enhancer-of-zeste, Trithorax) domain. Putative methyl transferase, based on outlier plant homologues
Probab=97.19  E-value=0.00044  Score=67.87  Aligned_cols=43  Identities=49%  Similarity=0.921  Sum_probs=39.5

Q ss_pred             eEEEEeecCCccceeecccCCCCCEEEEeeeEEcCHHHHHHHh
Q 000554         1380 KLEVFKTENKGWAVRAGQAILRGTFVCEYIGEVLDELETNKRR 1422 (1428)
Q Consensus      1380 ~LeVFkT~~kGWGVra~~~Ip~GtFIcEYvGEvIt~~Ea~~R~ 1422 (1428)
                      +++++++..+|+||+|..+|++|++|++|.|+++...++..+.
T Consensus         1 ~~~~~~~~~~G~gl~a~~~i~~g~~i~~~~g~~~~~~~~~~~~   43 (116)
T smart00317        1 KLEVFKSPGKGWGVRATEDIPKGEFIGEYVGEIITSEEAEERS   43 (116)
T ss_pred             CcEEEecCCCcEEEEECCccCCCCEEEEEEeEEECHHHHHHHH
Confidence            4688999999999999999999999999999999998888764


No 33 
>PHA00616 hypothetical protein
Probab=96.96  E-value=0.00027  Score=59.22  Aligned_cols=34  Identities=3%  Similarity=-0.224  Sum_probs=31.3

Q ss_pred             CcccCCCCcccCCchhhhcccccccCCCccccCC
Q 000554         1018 PHKKGIRFYAYKLKSGRLSRPRFKKGLGAVSYRI 1051 (1428)
Q Consensus      1018 pykC~~CgKsFs~ks~L~~H~r~H~gekpy~C~~ 1051 (1428)
                      ||+|+.||+.|..++.|.+|++.|+|++++.|+.
T Consensus         1 pYqC~~CG~~F~~~s~l~~H~r~~hg~~~~~~~~   34 (44)
T PHA00616          1 MYQCLRCGGIFRKKKEVIEHLLSVHKQNKLTLEY   34 (44)
T ss_pred             CCccchhhHHHhhHHHHHHHHHHhcCCCccceeE
Confidence            6899999999999999999999999999998864


No 34 
>PHA00732 hypothetical protein
Probab=96.91  E-value=0.00047  Score=65.30  Aligned_cols=45  Identities=24%  Similarity=0.487  Sum_probs=35.6

Q ss_pred             cccccccccccCChhhhhhhhhhcccccccccccccccccCCCCCCChhhhhhhhh
Q 000554          881 GYACAICLDSFTNKKVLESHVQERHHVQFVEQCMLQQCIPCGSHFGNTEELWLHVQ  936 (1428)
Q Consensus       881 pykC~~CgKsF~~ks~L~~H~r~Hhgek~~e~~kpfkC~~CgKsF~sks~L~~H~r  936 (1428)
                      ||.|..|++.|.+...|..|++.+|.        ++.|+.|++.|.   .|..|++
T Consensus         1 py~C~~Cgk~F~s~s~Lk~H~r~~H~--------~~~C~~CgKsF~---~l~~H~~   45 (79)
T PHA00732          1 MFKCPICGFTTVTLFALKQHARRNHT--------LTKCPVCNKSYR---RLNQHFY   45 (79)
T ss_pred             CccCCCCCCccCCHHHHHHHhhcccC--------CCccCCCCCEeC---Chhhhhc
Confidence            57899999999999999999885432        346999999987   5788875


No 35 
>PF05605 zf-Di19:  Drought induced 19 protein (Di19), zinc-binding;  InterPro: IPR008598 This entry consists of several drought induced 19 (Di19) like and RING finger 114 proteins. Di19 has been found to be strongly expressed in both the roots and leaves of Arabidopsis thaliana during progressive drought [], whilst RING finger proteins are thought to play a role in spermatogenesis. The precise function is unknown.
Probab=96.79  E-value=0.00087  Score=58.86  Aligned_cols=52  Identities=17%  Similarity=0.198  Sum_probs=41.0

Q ss_pred             ceecCccCcccCChhhHHHHHHhhccCCCCCCCCCcccCCCCcccCCchhhhccccccc
Q 000554          984 KFICRFCGLKFDLLPDLGRHHQAAHMGPNLVNSRPHKKGIRFYAYKLKSGRLSRPRFKK 1042 (1428)
Q Consensus       984 pykC~~CGKsFs~~s~L~rHHqrvHtge~~~~eKpykC~~CgKsFs~ks~L~~H~r~H~ 1042 (1428)
                      .|.|++|++. .....|..|....|..+    .+.+.|++|...+.  .+|.+|+..++
T Consensus         2 ~f~CP~C~~~-~~~~~L~~H~~~~H~~~----~~~v~CPiC~~~~~--~~l~~Hl~~~H   53 (54)
T PF05605_consen    2 SFTCPYCGKG-FSESSLVEHCEDEHRSE----SKNVVCPICSSRVT--DNLIRHLNSQH   53 (54)
T ss_pred             CcCCCCCCCc-cCHHHHHHHHHhHCcCC----CCCccCCCchhhhh--hHHHHHHHHhc
Confidence            4899999995 45678999888888884    45799999998655  48899986654


No 36 
>KOG1085 consensus Predicted methyltransferase (contains a SET domain) [General function prediction only]
Probab=96.74  E-value=0.0011  Score=74.59  Aligned_cols=53  Identities=30%  Similarity=0.429  Sum_probs=45.4

Q ss_pred             ccceeeEEEEeecCCccceeecccCCCCCEEEEeeeEEcCHHHHHHHhhhccC
Q 000554         1375 NGVRVKLEVFKTENKGWAVRAGQAILRGTFVCEYIGEVLDELETNKRRSRLLF 1427 (1428)
Q Consensus      1375 ~G~~~~LeVFkT~~kGWGVra~~~Ip~GtFIcEYvGEvIt~~Ea~~R~~~YlF 1427 (1428)
                      .|....|.+..-.+||-||++..++.+|+||.||.|.||.-.||..|+..|--
T Consensus       252 ~g~~egl~~~~~dgKGRGv~a~~~F~rgdFVVEY~Gdliei~eAk~rE~~Ya~  304 (392)
T KOG1085|consen  252 KGTNEGLLEVYKDGKGRGVRAKVNFERGDFVVEYRGDLIEISEAKVREEQYAN  304 (392)
T ss_pred             hccccceeEEeeccccceeEeecccccCceEEEEecceeeechHHHHHHHhcc
Confidence            45556667766677999999999999999999999999999999999986543


No 37 
>COG5189 SFP1 Putative transcriptional repressor regulating G2/M transition [Transcription / Cell division and chromosome partitioning]
Probab=96.57  E-value=0.0013  Score=74.95  Aligned_cols=57  Identities=18%  Similarity=0.244  Sum_probs=44.1

Q ss_pred             ccceecCc--cCcccCChhhHHHHHHhhccCCC-------------CCCCCCcccCCCCcccCCchhhhccc
Q 000554          982 IRKFICRF--CGLKFDLLPDLGRHHQAAHMGPN-------------LVNSRPHKKGIRFYAYKLKSGRLSRP 1038 (1428)
Q Consensus       982 eKpykC~~--CGKsFs~~s~L~rHHqrvHtge~-------------~~~eKpykC~~CgKsFs~ks~L~~H~ 1038 (1428)
                      +|||+|++  |.|++.....|+.|...-|...+             ..+.|||.|++|+|.|.....|+.|.
T Consensus       347 ~KpykCpV~gC~K~YknqnGLKYH~lhGH~~~~~~~~p~p~~~~~F~~~~KPYrCevC~KRYKNlNGLKYHr  418 (423)
T COG5189         347 GKPYKCPVEGCNKKYKNQNGLKYHMLHGHQNQKLHENPSPEKMNIFSAKDKPYRCEVCDKRYKNLNGLKYHR  418 (423)
T ss_pred             CceecCCCCCchhhhccccchhhhhhccccCcccCCCCCccccccccccCCceeccccchhhccCccceecc
Confidence            58899965  88999999999997555553321             11368999999999999999999986


No 38 
>KOG1080 consensus Histone H3 (Lys4) methyltransferase complex, subunit SET1 and related methyltransferases [Chromatin structure and dynamics; Transcription]
Probab=96.35  E-value=0.002  Score=84.96  Aligned_cols=45  Identities=24%  Similarity=0.394  Sum_probs=39.7

Q ss_pred             eeEEEEeecCCccceeecccCCCCCEEEEeeeEEcCHHHHHHHhh
Q 000554         1379 VKLEVFKTENKGWAVRAGQAILRGTFVCEYIGEVLDELETNKRRS 1423 (1428)
Q Consensus      1379 ~~LeVFkT~~kGWGVra~~~Ip~GtFIcEYvGEvIt~~Ea~~R~~ 1423 (1428)
                      ..|..-++.-.||||+|.++|.+|+||.||+||+|...=|+.|+.
T Consensus       866 k~~~F~~s~iH~wglfa~~~i~~~dmViEY~Ge~vR~~iad~RE~  910 (1005)
T KOG1080|consen  866 KYVKFGRSGIHGWGLFAMENIAAGDMVIEYRGELVRSSIADLREA  910 (1005)
T ss_pred             hhhccccccccccceeeccCccccceEEEeeceehhhhHHHHHHH
Confidence            336666777899999999999999999999999999888888876


No 39 
>PF05605 zf-Di19:  Drought induced 19 protein (Di19), zinc-binding;  InterPro: IPR008598 This entry consists of several drought induced 19 (Di19) like and RING finger 114 proteins. Di19 has been found to be strongly expressed in both the roots and leaves of Arabidopsis thaliana during progressive drought [], whilst RING finger proteins are thought to play a role in spermatogenesis. The precise function is unknown.
Probab=96.12  E-value=0.0021  Score=56.43  Aligned_cols=51  Identities=24%  Similarity=0.440  Sum_probs=28.2

Q ss_pred             ccccccccccCChhhhhhhhhhcccccccccccccccccCCCCCCChhhhhhhhhhcc
Q 000554          882 YACAICLDSFTNKKVLESHVQERHHVQFVEQCMLQQCIPCGSHFGNTEELWLHVQSVH  939 (1428)
Q Consensus       882 ykC~~CgKsF~~ks~L~~H~r~Hhgek~~e~~kpfkC~~CgKsF~sks~L~~H~rsvH  939 (1428)
                      |.|+.|++ ..+...|..|....|....    +.+.|++|...+.  .+|..|+...|
T Consensus         3 f~CP~C~~-~~~~~~L~~H~~~~H~~~~----~~v~CPiC~~~~~--~~l~~Hl~~~H   53 (54)
T PF05605_consen    3 FTCPYCGK-GFSESSLVEHCEDEHRSES----KNVVCPICSSRVT--DNLIRHLNSQH   53 (54)
T ss_pred             cCCCCCCC-ccCHHHHHHHHHhHCcCCC----CCccCCCchhhhh--hHHHHHHHHhc
Confidence            56666666 3344566666655554321    3566666666543  26666665444


No 40 
>PF00096 zf-C2H2:  Zinc finger, C2H2 type;  InterPro: IPR007087 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target.  The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger: #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C], where X can be any amino acid, and numbers in brackets indicate the number of residues. The positions marked # are those that are important for the stable fold of the zinc finger. The final position can be either his or cys. The C2H2 zinc finger is composed of two short beta strands followed by an alpha helix. The amino terminal part of the helix binds the major groove in DNA binding zinc fingers. The accepted consensus binding sequence for Sp1 is usually defined by the asymmetric hexanucleotide core GGGCGG but this sequence does not include, among others, the GAG (=CTC) repeat that constitutes a high-affinity site for Sp1 binding to the wt1 promoter []. This entry represents the classical C2H2 zinc finger domain.  More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; GO: 0008270 zinc ion binding, 0005622 intracellular; PDB: 2D9H_A 2EPC_A 1SP1_A 1VA3_A 2WBT_B 2ELR_A 2YTP_A 2YTT_A 1VA1_A 2ELO_A ....
Probab=95.57  E-value=0.0068  Score=43.62  Aligned_cols=23  Identities=35%  Similarity=0.782  Sum_probs=14.6

Q ss_pred             eecCccCcccCChhhHHHHHHhhc
Q 000554          985 FICRFCGLKFDLLPDLGRHHQAAH 1008 (1428)
Q Consensus       985 ykC~~CGKsFs~~s~L~rHHqrvH 1008 (1428)
                      |+|+.|++.|.+...|.+ |++.|
T Consensus         1 y~C~~C~~~f~~~~~l~~-H~~~H   23 (23)
T PF00096_consen    1 YKCPICGKSFSSKSNLKR-HMRRH   23 (23)
T ss_dssp             EEETTTTEEESSHHHHHH-HHHHH
T ss_pred             CCCCCCCCccCCHHHHHH-HHhHC
Confidence            567777777777777777 44434


No 41 
>PF00096 zf-C2H2:  Zinc finger, C2H2 type;  InterPro: IPR007087 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target.  The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger: #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C], where X can be any amino acid, and numbers in brackets indicate the number of residues. The positions marked # are those that are important for the stable fold of the zinc finger. The final position can be either his or cys. The C2H2 zinc finger is composed of two short beta strands followed by an alpha helix. The amino terminal part of the helix binds the major groove in DNA binding zinc fingers. The accepted consensus binding sequence for Sp1 is usually defined by the asymmetric hexanucleotide core GGGCGG but this sequence does not include, among others, the GAG (=CTC) repeat that constitutes a high-affinity site for Sp1 binding to the wt1 promoter []. This entry represents the classical C2H2 zinc finger domain.  More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; GO: 0008270 zinc ion binding, 0005622 intracellular; PDB: 2D9H_A 2EPC_A 1SP1_A 1VA3_A 2WBT_B 2ELR_A 2YTP_A 2YTT_A 1VA1_A 2ELO_A ....
Probab=95.55  E-value=0.0029  Score=45.57  Aligned_cols=23  Identities=22%  Similarity=-0.042  Sum_probs=21.2

Q ss_pred             cccCCCCcccCCchhhhcccccc
Q 000554         1019 HKKGIRFYAYKLKSGRLSRPRFK 1041 (1428)
Q Consensus      1019 ykC~~CgKsFs~ks~L~~H~r~H 1041 (1428)
                      |+|+.|++.|..+..|.+|++.|
T Consensus         1 y~C~~C~~~f~~~~~l~~H~~~H   23 (23)
T PF00096_consen    1 YKCPICGKSFSSKSNLKRHMRRH   23 (23)
T ss_dssp             EEETTTTEEESSHHHHHHHHHHH
T ss_pred             CCCCCCCCccCCHHHHHHHHhHC
Confidence            78999999999999999999765


No 42 
>COG5189 SFP1 Putative transcriptional repressor regulating G2/M transition [Transcription / Cell division and chromosome partitioning]
Probab=95.33  E-value=0.0063  Score=69.52  Aligned_cols=71  Identities=20%  Similarity=0.327  Sum_probs=45.6

Q ss_pred             CCCcccCCC--CCcccccccccccccccccchhhhcccCcccccccccccCChhhhhhhhhhcccccccccccccccccC
Q 000554          844 DEKTHKCKI--CSQVFLHDQELGVHWMDNHKKEAQWLFRGYACAICLDSFTNKKVLESHVQERHHVQFVEQCMLQQCIPC  921 (1428)
Q Consensus       844 gekpykC~~--CgK~F~s~s~L~~H~~r~Ht~e~~~l~KpykC~~CgKsF~~ks~L~~H~r~Hhgek~~e~~kpfkC~~C  921 (1428)
                      ++|||+|++  |.|.++....|+-|+..-|...     +...-+          .-..|.-.      ..+.|||.|++|
T Consensus       346 d~KpykCpV~gC~K~YknqnGLKYH~lhGH~~~-----~~~~~p----------~p~~~~~F------~~~~KPYrCevC  404 (423)
T COG5189         346 DGKPYKCPVEGCNKKYKNQNGLKYHMLHGHQNQ-----KLHENP----------SPEKMNIF------SAKDKPYRCEVC  404 (423)
T ss_pred             cCceecCCCCCchhhhccccchhhhhhccccCc-----ccCCCC----------Cccccccc------cccCCceecccc
Confidence            359999987  9999999999999954444332     111111          11111111      112358888888


Q ss_pred             CCCCCChhhhhhhh
Q 000554          922 GSHFGNTEELWLHV  935 (1428)
Q Consensus       922 gKsF~sks~L~~H~  935 (1428)
                      +|.+++...|+-|.
T Consensus       405 ~KRYKNlNGLKYHr  418 (423)
T COG5189         405 DKRYKNLNGLKYHR  418 (423)
T ss_pred             chhhccCccceecc
Confidence            88888888888885


No 43 
>PF12756 zf-C2H2_2:  C2H2 type zinc-finger (2 copies); PDB: 2DMI_A.
Probab=95.27  E-value=0.0069  Score=58.34  Aligned_cols=73  Identities=19%  Similarity=0.303  Sum_probs=20.9

Q ss_pred             cCCCCCcccccccccccccccccchhhhcccCcccccccccccCChhhhhhhhhhcccccccccccccccccCCCCCCCh
Q 000554          849 KCKICSQVFLHDQELGVHWMDNHKKEAQWLFRGYACAICLDSFTNKKVLESHVQERHHVQFVEQCMLQQCIPCGSHFGNT  928 (1428)
Q Consensus       849 kC~~CgK~F~s~s~L~~H~~r~Ht~e~~~l~KpykC~~CgKsF~~ks~L~~H~r~Hhgek~~e~~kpfkC~~CgKsF~sk  928 (1428)
                      +|..|+..|.+...|..|+...|.-.     -+     ....+.....+..+.+.....       .+.|..|++.|.+.
T Consensus         1 ~C~~C~~~f~~~~~l~~H~~~~H~~~-----~~-----~~~~l~~~~~~~~~~~~~~~~-------~~~C~~C~~~f~s~   63 (100)
T PF12756_consen    1 QCLFCDESFSSVDDLLQHMKKKHGFD-----IP-----DQKYLVDPNRLLNYLRKKVKE-------SFRCPYCNKTFRSR   63 (100)
T ss_dssp             ----------------------------------------------------------S-------SEEBSSSS-EESSH
T ss_pred             Cccccccccccccccccccccccccc-----cc-----cccccccccccccccccccCC-------CCCCCccCCCCcCH
Confidence            58999999999999999976677543     11     222233444455554432222       58999999999999


Q ss_pred             hhhhhhhhhc
Q 000554          929 EELWLHVQSV  938 (1428)
Q Consensus       929 s~L~~H~rsv  938 (1428)
                      ..|..|++..
T Consensus        64 ~~l~~Hm~~~   73 (100)
T PF12756_consen   64 EALQEHMRSK   73 (100)
T ss_dssp             HHHHHHHHHT
T ss_pred             HHHHHHHcCc
Confidence            9999999854


No 44 
>PF12756 zf-C2H2_2:  C2H2 type zinc-finger (2 copies); PDB: 2DMI_A.
Probab=95.24  E-value=0.011  Score=57.06  Aligned_cols=71  Identities=23%  Similarity=0.440  Sum_probs=17.1

Q ss_pred             ccccCCCCCCChhhhhhhhhhcccccccchhhhhccccccCcCCCCccccCChhhhhhhhhhcCCccceecCccCcccCC
Q 000554          917 QCIPCGSHFGNTEELWLHVQSVHAIDFKMSEVAQQHNQSVGEDSPKKLELGYSASVENHSENLGSIRKFICRFCGLKFDL  996 (1428)
Q Consensus       917 kC~~CgKsF~sks~L~~H~rsvHsgEf~~~s~~~~kp~~C~~C~~k~~sf~sks~L~~H~rtHtGeKpykC~~CGKsFs~  996 (1428)
                      +|..|+..|.+...|..|+...|.-.+         +    .    .........+..+.+.. -...+.|..|++.|..
T Consensus         1 ~C~~C~~~f~~~~~l~~H~~~~H~~~~---------~----~----~~~l~~~~~~~~~~~~~-~~~~~~C~~C~~~f~s   62 (100)
T PF12756_consen    1 QCLFCDESFSSVDDLLQHMKKKHGFDI---------P----D----QKYLVDPNRLLNYLRKK-VKESFRCPYCNKTFRS   62 (100)
T ss_dssp             ------------------------------------------------------------------SSEEBSSSS-EESS
T ss_pred             Ccccccccccccccccccccccccccc---------c----c----ccccccccccccccccc-cCCCCCCCccCCCCcC
Confidence            488899999999999999887775330         0    0    00111111333333221 1126888888888888


Q ss_pred             hhhHHHHHH
Q 000554          997 LPDLGRHHQ 1005 (1428)
Q Consensus       997 ~s~L~rHHq 1005 (1428)
                      ...|..|..
T Consensus        63 ~~~l~~Hm~   71 (100)
T PF12756_consen   63 REALQEHMR   71 (100)
T ss_dssp             HHHHHHHHH
T ss_pred             HHHHHHHHc
Confidence            888888443


No 45 
>COG5048 FOG: Zn-finger [General function prediction only]
Probab=94.78  E-value=0.028  Score=66.92  Aligned_cols=62  Identities=11%  Similarity=0.084  Sum_probs=40.4

Q ss_pred             cCcccCChhhHHHHHHhhccCCCCCCCCCcccCCCCcccCCchhhhcccccccCCCccccCCCCCc
Q 000554          990 CGLKFDLLPDLGRHHQAAHMGPNLVNSRPHKKGIRFYAYKLKSGRLSRPRFKKGLGAVSYRIRNRG 1055 (1428)
Q Consensus       990 CGKsFs~~s~L~rHHqrvHtge~~~~eKpykC~~CgKsFs~ks~L~~H~r~H~gekpy~C~~C~ks 1055 (1428)
                      |-..+.....+.. |...|....   ...+.+..|.+.|.....+..|++.|....+..|..+...
T Consensus       394 ~~~~~~~~~~~~~-~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~  455 (467)
T COG5048         394 CIRNFKRDSNLSL-HIITHLSFR---PYNCKNPPCSKSFNRHYNLIPHKKIHTNHAPLLCSILKSF  455 (467)
T ss_pred             hhhhhcccccccc-ccccccccC---CcCCCCCcchhhccCcccccccccccccCCceeecccccc
Confidence            5566666666666 555555511   2256677788888888888888888877776666555443


No 46 
>cd01395 HMT_MBD Methyl-CpG binding domains (MBD) present in putative histone methyltransferases (HMT) such as CLLD8 and SETDB1 proteins; CLLD8 contains a MBD, a PreSET and a bifurcated SET domain, suggesting that CLLD8 might be associated with methylation-mediated transcriptional repression. SETDB1 and other proteins in this group have a similar domain architecture. SETDB1 is a novel KAP-1-associated histone H3, lysine 9-specific methyltransferase that contributes to HP1-mediated silencing of euchromatic genes by KRAB zinc-finger proteins.
Probab=94.59  E-value=0.0072  Score=54.32  Aligned_cols=37  Identities=14%  Similarity=0.039  Sum_probs=31.6

Q ss_pred             CCC-CCcccC----------CcccccccCCCCCCc-cccccceeeeccC
Q 000554         1184 HLE-PLPSVS----------AGIRSSDSSDFVNNQ-WEVDECHCIIDSR 1220 (1428)
Q Consensus      1184 Pl~-p~~~~~----------~~~k~v~~~~p~~~~-w~~~e~~~~l~~~ 1220 (1428)
                      ||+ |+.+||          +.++.|+|++|||.. ++|.|++.||...
T Consensus         1 PL~~Pll~gw~R~~~~~~~~~~k~~V~Y~aPCGr~Lr~~~EV~~YL~~t   49 (60)
T cd01395           1 PLHTPLLCGFQRMKYRARVGKVKKHVIYKAPCGRSLRNMSEVHRYLRET   49 (60)
T ss_pred             CcccccccCeEEEEEeccCCCcccceEEECCcchhhhcHHHHHHHHHhc
Confidence            677 889999          257789999999999 9999999988743


No 47 
>KOG2231 consensus Predicted E3 ubiquitin ligase [Posttranslational modification, protein turnover, chaperones]
Probab=94.40  E-value=0.031  Score=70.86  Aligned_cols=140  Identities=20%  Similarity=0.247  Sum_probs=70.0

Q ss_pred             CcccccccccccCChhhhhhhhhhcccccccccccccccccCC---CCC------CChhhhhhhhhhcccccccchhhhh
Q 000554          880 RGYACAICLDSFTNKKVLESHVQERHHVQFVEQCMLQQCIPCG---SHF------GNTEELWLHVQSVHAIDFKMSEVAQ  950 (1428)
Q Consensus       880 KpykC~~CgKsF~~ks~L~~H~r~Hhgek~~e~~kpfkC~~Cg---KsF------~sks~L~~H~rsvHsgEf~~~s~~~  950 (1428)
                      ..-.|..| -.|.....|+.|+...|.        .+.|..|-   +.|      -+...|.+|++.   ++.-..+..+
T Consensus       114 ~~~~~~~c-~~~~s~~~Lk~H~~~~H~--------~~~c~lC~~~~kif~~e~k~Yt~~el~~h~~~---gd~d~~s~rG  181 (669)
T KOG2231|consen  114 NKKECLHC-TEFKSVENLKNHMRDQHK--------LHLCSLCLQNLKIFINERKLYTRAELNLHLMF---GDPDDESCRG  181 (669)
T ss_pred             ccCCCccc-cchhHHHHHHHHHHHhhh--------hhccccccccceeeeeeeehehHHHHHHHHhc---CCCccccccC
Confidence            33456666 666677777777765554        34455442   222      234556666541   1100000000


Q ss_pred             ccccccCcCCCCccccCChhhhhhhhhhcCCccceecCccC------cccCChhhHHHHHHhhccCCCCCCCCCcccC--
Q 000554          951 QHNQSVGEDSPKKLELGYSASVENHSENLGSIRKFICRFCG------LKFDLLPDLGRHHQAAHMGPNLVNSRPHKKG-- 1022 (1428)
Q Consensus       951 ~kp~~C~~C~~k~~sf~sks~L~~H~rtHtGeKpykC~~CG------KsFs~~s~L~rHHqrvHtge~~~~eKpykC~-- 1022 (1428)
                        .-.|..|   ...|-....|.+|++.++    |.|.+|.      .-|.....|..|-+.-|          |.|.  
T Consensus       182 --hp~C~~C---~~~fld~~el~rH~~~~h----~~chfC~~~~~~neyy~~~~dLe~HfR~~H----------flCE~~  242 (669)
T KOG2231|consen  182 --HPLCKFC---HERFLDDDELYRHLRFDH----EFCHFCDYKTGQNEYYNDYDDLEEHFRKGH----------FLCEEE  242 (669)
T ss_pred             --Cccchhh---hhhhccHHHHHHhhccce----eheeecCcccccchhcccchHHHHHhhhcC----------cccccc
Confidence              1234444   445555556666666543    5666663      34666667777433333          2343  


Q ss_pred             CCC-----cccCCchhhhcccccccCCCccccC
Q 000554         1023 IRF-----YAYKLKSGRLSRPRFKKGLGAVSYR 1050 (1428)
Q Consensus      1023 ~Cg-----KsFs~ks~L~~H~r~H~gekpy~C~ 1050 (1428)
                      .|-     -.|.....|+.|.+.+.-++.|.|.
T Consensus       243 ~C~~~~f~~~~~~ei~lk~~~~~~~~e~~~~~~  275 (669)
T KOG2231|consen  243 FCRTKKFYVAFELEIELKAHNRFIQHEKCYICR  275 (669)
T ss_pred             ccccceeeehhHHHHHHHhhccccchheeccCC
Confidence            232     2334455566666655566666664


No 48 
>PF13912 zf-C2H2_6:  C2H2-type zinc finger; PDB: 1JN7_A 1FU9_A 2L1O_A 1NJQ_A 2EN8_A 2EMM_A 1FV5_A 1Y0J_B 2L6Z_B.
Probab=94.24  E-value=0.016  Score=43.36  Aligned_cols=24  Identities=38%  Similarity=0.750  Sum_probs=12.0

Q ss_pred             ceecCccCcccCChhhHHHHHHhhc
Q 000554          984 KFICRFCGLKFDLLPDLGRHHQAAH 1008 (1428)
Q Consensus       984 pykC~~CGKsFs~~s~L~rHHqrvH 1008 (1428)
                      ||+|..|++.|.....|.. |++.|
T Consensus         1 ~~~C~~C~~~F~~~~~l~~-H~~~h   24 (27)
T PF13912_consen    1 PFECDECGKTFSSLSALRE-HKRSH   24 (27)
T ss_dssp             SEEETTTTEEESSHHHHHH-HHCTT
T ss_pred             CCCCCccCCccCChhHHHH-HhHHh
Confidence            3455555555555555555 34433


No 49 
>PF13912 zf-C2H2_6:  C2H2-type zinc finger; PDB: 1JN7_A 1FU9_A 2L1O_A 1NJQ_A 2EN8_A 2EMM_A 1FV5_A 1Y0J_B 2L6Z_B.
Probab=94.03  E-value=0.033  Score=41.65  Aligned_cols=26  Identities=12%  Similarity=-0.074  Sum_probs=23.5

Q ss_pred             CcccCCCCcccCCchhhhcccccccC
Q 000554         1018 PHKKGIRFYAYKLKSGRLSRPRFKKG 1043 (1428)
Q Consensus      1018 pykC~~CgKsFs~ks~L~~H~r~H~g 1043 (1428)
                      ||+|..|++.|.....|..|++.|.+
T Consensus         1 ~~~C~~C~~~F~~~~~l~~H~~~h~~   26 (27)
T PF13912_consen    1 PFECDECGKTFSSLSALREHKRSHCS   26 (27)
T ss_dssp             SEEETTTTEEESSHHHHHHHHCTTTT
T ss_pred             CCCCCccCCccCChhHHHHHhHHhcC
Confidence            68999999999999999999988864


No 50 
>PF13894 zf-C2H2_4:  C2H2-type zinc finger; PDB: 2ELX_A 2EPP_A 2DLK_A 1X6H_A 2EOU_A 2EMB_A 2GQJ_A 2CSH_A 2WBT_B 2ELM_A ....
Probab=93.96  E-value=0.024  Score=40.46  Aligned_cols=22  Identities=36%  Similarity=0.738  Sum_probs=9.8

Q ss_pred             cccccccccCChhhhhhhhhhc
Q 000554          883 ACAICLDSFTNKKVLESHVQER  904 (1428)
Q Consensus       883 kC~~CgKsF~~ks~L~~H~r~H  904 (1428)
                      .|++|++.|.+...|..|++.|
T Consensus         2 ~C~~C~~~~~~~~~l~~H~~~~   23 (24)
T PF13894_consen    2 QCPICGKSFRSKSELRQHMRTH   23 (24)
T ss_dssp             E-SSTS-EESSHHHHHHHHHHH
T ss_pred             CCcCCCCcCCcHHHHHHHHHhh
Confidence            4445555555555555554444


No 51 
>KOG2231 consensus Predicted E3 ubiquitin ligase [Posttranslational modification, protein turnover, chaperones]
Probab=93.17  E-value=0.088  Score=66.94  Aligned_cols=74  Identities=22%  Similarity=0.280  Sum_probs=36.1

Q ss_pred             ccccCCCCCCChhhhhhhhhhcccccccchhhhhccccccCcCCCCccccC------ChhhhhhhhhhcC-Ccc----ce
Q 000554          917 QCIPCGSHFGNTEELWLHVQSVHAIDFKMSEVAQQHNQSVGEDSPKKLELG------YSASVENHSENLG-SIR----KF  985 (1428)
Q Consensus       917 kC~~CgKsF~sks~L~~H~rsvHsgEf~~~s~~~~kp~~C~~C~~k~~sf~------sks~L~~H~rtHt-GeK----py  985 (1428)
                      .|.+| -.|.+...|+.|+...|            +.+.|..|..-.+.|.      ....|..|++.-. +++    .-
T Consensus       117 ~~~~c-~~~~s~~~Lk~H~~~~H------------~~~~c~lC~~~~kif~~e~k~Yt~~el~~h~~~gd~d~~s~rGhp  183 (669)
T KOG2231|consen  117 ECLHC-TEFKSVENLKNHMRDQH------------KLHLCSLCLQNLKIFINERKLYTRAELNLHLMFGDPDDESCRGHP  183 (669)
T ss_pred             CCccc-cchhHHHHHHHHHHHhh------------hhhccccccccceeeeeeeehehHHHHHHHHhcCCCccccccCCc
Confidence            36666 55666666666665555            2344544432222111      2345555554311 111    13


Q ss_pred             ecCccCcccCChhhHHHH
Q 000554          986 ICRFCGLKFDLLPDLGRH 1003 (1428)
Q Consensus       986 kC~~CGKsFs~~s~L~rH 1003 (1428)
                      .|..|...|-....|.+|
T Consensus       184 ~C~~C~~~fld~~el~rH  201 (669)
T KOG2231|consen  184 LCKFCHERFLDDDELYRH  201 (669)
T ss_pred             cchhhhhhhccHHHHHHh
Confidence            466666666666666664


No 52 
>PF13894 zf-C2H2_4:  C2H2-type zinc finger; PDB: 2ELX_A 2EPP_A 2DLK_A 1X6H_A 2EOU_A 2EMB_A 2GQJ_A 2CSH_A 2WBT_B 2ELM_A ....
Probab=93.06  E-value=0.07  Score=38.02  Aligned_cols=18  Identities=33%  Similarity=0.794  Sum_probs=10.3

Q ss_pred             eecCccCcccCChhhHHH
Q 000554          985 FICRFCGLKFDLLPDLGR 1002 (1428)
Q Consensus       985 ykC~~CGKsFs~~s~L~r 1002 (1428)
                      |.|+.|++.|.+...|.+
T Consensus         1 ~~C~~C~~~~~~~~~l~~   18 (24)
T PF13894_consen    1 FQCPICGKSFRSKSELRQ   18 (24)
T ss_dssp             EE-SSTS-EESSHHHHHH
T ss_pred             CCCcCCCCcCCcHHHHHH
Confidence            456666666666666666


No 53 
>COG5048 FOG: Zn-finger [General function prediction only]
Probab=92.70  E-value=0.1  Score=62.30  Aligned_cols=168  Identities=15%  Similarity=0.204  Sum_probs=109.4

Q ss_pred             CcccCCCCCccccccccccccccc--ccchhhhcccCccccc--ccccccCChhhhhhhhhhccccccccccccccccc-
Q 000554          846 KTHKCKICSQVFLHDQELGVHWMD--NHKKEAQWLFRGYACA--ICLDSFTNKKVLESHVQERHHVQFVEQCMLQQCIP-  920 (1428)
Q Consensus       846 kpykC~~CgK~F~s~s~L~~H~~r--~Ht~e~~~l~KpykC~--~CgKsF~~ks~L~~H~r~Hhgek~~e~~kpfkC~~-  920 (1428)
                      .++.|..|...|.....|..| .+  .|..+.   .+++.|+  .|++.|.+...+..|...|.+..      ++.|.. 
T Consensus       288 ~~~~~~~~~~~~s~~~~l~~~-~~~~~h~~~~---~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~------~~~~~~~  357 (467)
T COG5048         288 LPIKSKQCNISFSRSSPLTRH-LRSVNHSGES---LKPFSCPYSLCGKLFSRNDALKRHILLHTSIS------PAKEKLL  357 (467)
T ss_pred             cCCCCccccCCcccccccccc-cccccccccc---CCceeeeccCCCccccccccccCCcccccCCC------ccccccc
Confidence            578999999999999999999 66  787762   2689999  79999999999999999999876      455543 


Q ss_pred             -CCCCCCChhhhhhhhhhcccccccchhhhhccccccCcCCCCccccCChhhhhhhhhhcCCcc--ceecCccCcccCCh
Q 000554          921 -CGSHFGNTEELWLHVQSVHAIDFKMSEVAQQHNQSVGEDSPKKLELGYSASVENHSENLGSIR--KFICRFCGLKFDLL  997 (1428)
Q Consensus       921 -CgKsF~sks~L~~H~rsvHsgEf~~~s~~~~kp~~C~~C~~k~~sf~sks~L~~H~rtHtGeK--pykC~~CGKsFs~~  997 (1428)
                       |.+.+.....-..+.. .+...    .....+.+.+..- .+...+.....+..|...|...+  .+.|..|.+.|...
T Consensus       358 ~~~~~~~~~~~~~~~~~-~~~~~----~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~  431 (467)
T COG5048         358 NSSSKFSPLLNNEPPQS-LQQYK----DLKNDKKSETLSN-SCIRNFKRDSNLSLHIITHLSFRPYNCKNPPCSKSFNRH  431 (467)
T ss_pred             cCccccccccCCCCccc-hhhcc----CccCCcccccccc-chhhhhccccccccccccccccCCcCCCCCcchhhccCc
Confidence             5555544433221211 11000    0011123333222 22444455556777777777665  57778999999999


Q ss_pred             hhHHHHHHhhccCCCCCCCCCcccCCCCcccCCchhhhc
Q 000554          998 PDLGRHHQAAHMGPNLVNSRPHKKGIRFYAYKLKSGRLS 1036 (1428)
Q Consensus       998 s~L~rHHqrvHtge~~~~eKpykC~~CgKsFs~ks~L~~ 1036 (1428)
                      ..|.. |.+.|..     ..++.|..+ +.|.....+..
T Consensus       432 ~~~~~-~~~~~~~-----~~~~~~~~~-~~~~~~~~~~~  463 (467)
T COG5048         432 YNLIP-HKKIHTN-----HAPLLCSIL-KSFRRDLDLSN  463 (467)
T ss_pred             ccccc-ccccccc-----CCceeeccc-cccchhhhhhc
Confidence            99999 7888887     455555444 34444444433


No 54 
>KOG1146 consensus Homeobox protein [General function prediction only]
Probab=92.67  E-value=0.069  Score=71.18  Aligned_cols=157  Identities=15%  Similarity=0.110  Sum_probs=94.5

Q ss_pred             ccccccccCChhhhhhhhhhcccccccccccccccccCCCCCCChhhhhhhhhhcccccccchhhhhccccccCcCCCCc
Q 000554          884 CAICLDSFTNKKVLESHVQERHHVQFVEQCMLQQCIPCGSHFGNTEELWLHVQSVHAIDFKMSEVAQQHNQSVGEDSPKK  963 (1428)
Q Consensus       884 C~~CgKsF~~ks~L~~H~r~Hhgek~~e~~kpfkC~~CgKsF~sks~L~~H~rsvHsgEf~~~s~~~~kp~~C~~C~~k~  963 (1428)
                      |..|+..+.++..+..|+..-+...     +.|+|+.|+..|.....|..|+|..|..-         ..   ..|   .
T Consensus       439 ~~~~e~~~~s~r~~~~~t~~L~S~~-----kt~~cpkc~~~yk~a~~L~vhmRskhp~~---------~~---~~c---~  498 (1406)
T KOG1146|consen  439 LTKAEPLLESKRSLEGQTVVLHSFF-----KTLKCPKCNWHYKLAQTLGVHMRSKHPES---------QS---AYC---K  498 (1406)
T ss_pred             ccchhhhhhhhcccccceeeeeccc-----ccccCCccchhhhhHHHhhhccccccccc---------ch---hHh---H
Confidence            4556666666677777666555443     46788888888888888888887666532         00   222   0


Q ss_pred             cccCChhhhhhhhhhc------CCccceecCccCcccCChhhHHHHHHhh-ccCC-------------------------
Q 000554          964 LELGYSASVENHSENL------GSIRKFICRFCGLKFDLLPDLGRHHQAA-HMGP------------------------- 1011 (1428)
Q Consensus       964 ~sf~sks~L~~H~rtH------tGeKpykC~~CGKsFs~~s~L~rHHqrv-Htge------------------------- 1011 (1428)
                             ....|.+.-      .+.++|.|..|..+|....+|.+|.+.. |..+                         
T Consensus       499 -------~gq~~~~~arg~~~~~~~~p~~C~~C~~stttng~LsihlqS~~h~~~lee~~~~~g~~v~~~~~~v~s~~P~  571 (1406)
T KOG1146|consen  499 -------AGQNHPRLARGEVYRCPGKPYPCRACNYSTTTNGNLSIHLQSDLHRNELEEAEENAGEQVRLLPASVTSAVPE  571 (1406)
T ss_pred             -------hccccccccccccccCCCCcccceeeeeeeecchHHHHHHHHHhhHHHHHHHHhccccchhhhhhhhcccCcc
Confidence                   111222211      2347888888888888888888864432 2110                         


Q ss_pred             ----------C-CCCCCCcccCCCCcccCCchhhhcccc-cccCCCccccCCCCCcCcChHHHHhhcC
Q 000554         1012 ----------N-LVNSRPHKKGIRFYAYKLKSGRLSRPR-FKKGLGAVSYRIRNRGAAGMKKRIQTLK 1067 (1428)
Q Consensus      1012 ----------~-~~~eKpykC~~CgKsFs~ks~L~~H~r-~H~gekpy~C~~C~ksf~~~~~l~~H~k 1067 (1428)
                                . +...-.+.|.+|++--.-..+|+.||. .|+-..|.-|-.|+-.+.....+..+.+
T Consensus       572 ~ag~~~~ags~~pktkP~~~C~vc~yetniarnlrihmtss~~s~~p~~~Lq~~it~~l~~~~~~~~~  639 (1406)
T KOG1146|consen  572 EAGLGPSAGSSGPKTKPSWRCEVCSYETNIARNLRIHMTASPSSSPPSLVLQQNITSSLASLLGGQGR  639 (1406)
T ss_pred             cccCCCCCCCCCCCCCCCcchhhhcchhhhhhccccccccCCCCCChHHHhhhcchhhccccccCcCC
Confidence                      0 111335899999999999999999993 4444444556555555444333333333


No 55 
>COG2940 Proteins containing SET domain [General function prediction only]
Probab=92.60  E-value=0.045  Score=68.44  Aligned_cols=72  Identities=25%  Similarity=0.215  Sum_probs=59.8

Q ss_pred             cccccCcCCCCCCCCCCceeeccceeeEEEEeecCCccceeecccCCCCCEEEEeeeEEcCHHHHHHHhhhc
Q 000554         1354 LIYECNHMCSCDRTCPNRVLQNGVRVKLEVFKTENKGWAVRAGQAILRGTFVCEYIGEVLDELETNKRRSRL 1425 (1428)
Q Consensus      1354 ~IyECn~~C~C~~~C~NRvvQ~G~~~~LeVFkT~~kGWGVra~~~Ip~GtFIcEYvGEvIt~~Ea~~R~~~Y 1425 (1428)
                      .+.+++..+.....+.|...+.....+..+..+..+||||+++..|++|+||.+|.|+++...++..|...|
T Consensus       307 ~~~~~~s~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~fa~~~i~~~e~i~~~~~~~~~~~~~~~~~~~~  378 (480)
T COG2940         307 SSDFSKSNVSKLKELLNSNGCKKRREPNVVQESEIKGYGVFALESIKKGEFIIEYHGEIIRRKEAREREENY  378 (480)
T ss_pred             ccccccccCccccchhhhcccccccchhhhhhhcccccceeehhhccchHHHHHhcCcccchHHHHhhhccc
Confidence            344455555555567777777788888888999999999999999999999999999999999999887753


No 56 
>KOG1146 consensus Homeobox protein [General function prediction only]
Probab=92.37  E-value=0.04  Score=73.29  Aligned_cols=84  Identities=15%  Similarity=0.200  Sum_probs=65.4

Q ss_pred             cCCCCCcccccccccccccccccchhhhcccCcccccccccccCChhhhhhhhhhcccccccc-----------------
Q 000554          849 KCKICSQVFLHDQELGVHWMDNHKKEAQWLFRGYACAICLDSFTNKKVLESHVQERHHVQFVE-----------------  911 (1428)
Q Consensus       849 kC~~CgK~F~s~s~L~~H~~r~Ht~e~~~l~KpykC~~CgKsF~~ks~L~~H~r~Hhgek~~e-----------------  911 (1428)
                      .|..|+..|.....+..|+...|...     +.|+|+.|+..|+....|..|||..|.+-...                 
T Consensus       438 e~~~~e~~~~s~r~~~~~t~~L~S~~-----kt~~cpkc~~~yk~a~~L~vhmRskhp~~~~~~c~~gq~~~~~arg~~~  512 (1406)
T KOG1146|consen  438 ELTKAEPLLESKRSLEGQTVVLHSFF-----KTLKCPKCNWHYKLAQTLGVHMRSKHPESQSAYCKAGQNHPRLARGEVY  512 (1406)
T ss_pred             cccchhhhhhhhcccccceeeeeccc-----ccccCCccchhhhhHHHhhhcccccccccchhHhHhccccccccccccc
Confidence            35667777777777888866667665     88999999999999999999999855432111                 


Q ss_pred             --cccccccccCCCCCCChhhhhhhhhh
Q 000554          912 --QCMLQQCIPCGSHFGNTEELWLHVQS  937 (1428)
Q Consensus       912 --~~kpfkC~~CgKsF~sks~L~~H~rs  937 (1428)
                        .-++|.|..|...+..+.+|.+|++.
T Consensus       513 ~~~~~p~~C~~C~~stttng~LsihlqS  540 (1406)
T KOG1146|consen  513 RCPGKPYPCRACNYSTTTNGNLSIHLQS  540 (1406)
T ss_pred             cCCCCcccceeeeeeeecchHHHHHHHH
Confidence              11679999999999999999999874


No 57 
>PRK04860 hypothetical protein; Provisional
Probab=92.06  E-value=0.088  Score=56.54  Aligned_cols=38  Identities=16%  Similarity=0.168  Sum_probs=23.7

Q ss_pred             ceecCccCcccCChhhHHHHHHhhccCCCCCCCCCcccCCCCcccCCc
Q 000554          984 KFICRFCGLKFDLLPDLGRHHQAAHMGPNLVNSRPHKKGIRFYAYKLK 1031 (1428)
Q Consensus       984 pykC~~CGKsFs~~s~L~rHHqrvHtge~~~~eKpykC~~CgKsFs~k 1031 (1428)
                      +|.|. |++   ....+++ |.++|++     +++|.|..|+..|...
T Consensus       119 ~Y~C~-C~~---~~~~~rr-H~ri~~g-----~~~YrC~~C~~~l~~~  156 (160)
T PRK04860        119 PYRCK-CQE---HQLTVRR-HNRVVRG-----EAVYRCRRCGETLVFK  156 (160)
T ss_pred             EEEcC-CCC---eeCHHHH-HHHHhcC-----CccEECCCCCceeEEe
Confidence            56665 665   5555666 6666666     5666666666666543


No 58 
>PF09237 GAGA:  GAGA factor;  InterPro: IPR015318 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target.  Members of this entry bind to a 5'-GAGAG-3' DNA consensus binding site, and contain a Cys2-His2 zinc finger core as well as an N-terminal extension containing two highly basic regions. The zinc finger core binds in the DNA major groove and recognises the first three GAG bases of the consensus in a manner similar to that seen in other classical zinc finger-DNA complexes. The second basic region forms a helix that interacts in the major groove recognising the last G of the consensus, while the first basic region wraps around the DNA in the minor groove and recognises the A in the fourth position of the consensus sequence [].  More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; PDB: 1YUI_A 1YUJ_A.
Probab=91.88  E-value=0.053  Score=46.94  Aligned_cols=30  Identities=7%  Similarity=-0.115  Sum_probs=11.9

Q ss_pred             CCCcccCCCCcccCCchhhhcccccccCCC
Q 000554         1016 SRPHKKGIRFYAYKLKSGRLSRPRFKKGLG 1045 (1428)
Q Consensus      1016 eKpykC~~CgKsFs~ks~L~~H~r~H~gek 1045 (1428)
                      +.|..|++|+..+++..+|++|+.++++.|
T Consensus        22 ~~PatCP~C~a~~~~srnLrRHle~~H~~k   51 (54)
T PF09237_consen   22 EQPATCPICGAVIRQSRNLRRHLEIRHFKK   51 (54)
T ss_dssp             S--EE-TTT--EESSHHHHHHHHHHHTTTS
T ss_pred             CCCCCCCcchhhccchhhHHHHHHHHhccc
Confidence            444445555555555555555544444433


No 59 
>smart00355 ZnF_C2H2 zinc finger.
Probab=91.42  E-value=0.072  Score=38.37  Aligned_cols=24  Identities=29%  Similarity=0.544  Sum_probs=13.8

Q ss_pred             eecCccCcccCChhhHHHHHHhhcc
Q 000554          985 FICRFCGLKFDLLPDLGRHHQAAHM 1009 (1428)
Q Consensus       985 ykC~~CGKsFs~~s~L~rHHqrvHt 1009 (1428)
                      |+|..|++.|.....|.. |++.|.
T Consensus         1 ~~C~~C~~~f~~~~~l~~-H~~~H~   24 (26)
T smart00355        1 YRCPECGKVFKSKSALKE-HMRTHX   24 (26)
T ss_pred             CCCCCCcchhCCHHHHHH-HHHHhc
Confidence            456666666666666666 444443


No 60 
>smart00355 ZnF_C2H2 zinc finger.
Probab=91.13  E-value=0.14  Score=36.76  Aligned_cols=24  Identities=17%  Similarity=-0.100  Sum_probs=21.1

Q ss_pred             cccCCCCcccCCchhhhccccccc
Q 000554         1019 HKKGIRFYAYKLKSGRLSRPRFKK 1042 (1428)
Q Consensus      1019 ykC~~CgKsFs~ks~L~~H~r~H~ 1042 (1428)
                      |+|+.|+++|.....|..|++.|.
T Consensus         1 ~~C~~C~~~f~~~~~l~~H~~~H~   24 (26)
T smart00355        1 YRCPECGKVFKSKSALKEHMRTHX   24 (26)
T ss_pred             CCCCCCcchhCCHHHHHHHHHHhc
Confidence            679999999999999999998775


No 61 
>smart00570 AWS associated with SET domains. subdomain of PRESET
Probab=90.90  E-value=0.093  Score=45.87  Aligned_cols=25  Identities=32%  Similarity=0.777  Sum_probs=22.5

Q ss_pred             ccccccCcCCCCCCCCCCceeeccc
Q 000554         1353 YLIYECNHMCSCDRTCPNRVLQNGV 1377 (1428)
Q Consensus      1353 ~~IyECn~~C~C~~~C~NRvvQ~G~ 1377 (1428)
                      .+.+||+..|+|+..|.||.+|+..
T Consensus        26 ~l~~EC~~~C~~G~~C~NqrFqk~~   50 (51)
T smart00570       26 MLLIECSSDCPCGSYCSNQRFQKRQ   50 (51)
T ss_pred             HHhhhcCCCCCCCcCccCcccccCc
Confidence            5679999999999999999999863


No 62 
>cd05162 PWWP The PWWP domain, named for a conserved Pro-Trp-Trp-Pro motif, is a small domain consisting of 100-150 amino acids.  The PWWP domain is found in numerous proteins that are involved in cell division, growth and differentiation.  Most PWWP-domain proteins seem to be nuclear, often DNA-binding, proteins that function as transcription factors regulating a variety of developmental processes.  The function of the PWWP domain is still not known precisely; however, based on the fact that other regions of PWWP-domain proteins are responsible for nuclear localization and DNA-binding, is likely that the PWWP domain acts as a site for protein-protein binding interactions, influencing chromatin remodeling and thereby regulating transcriptional processes.  Some PWWP-domain proteins have been linked to cancer or other diseases; some are known to function as growth factors.
Probab=90.32  E-value=0.26  Score=47.32  Aligned_cols=60  Identities=18%  Similarity=0.474  Sum_probs=47.7

Q ss_pred             EEEEEecc-ccccceeeeeccCCCccccccccCCCccEEEEEeccCCcchhhhhhccccccCCCc
Q 000554          157 ALWVKWRG-KWQAGIRCARADWPLPTLKAKPTHDRKKYFVIFFPHTRNYSWADMLLVRSINEFPQ  220 (1428)
Q Consensus       157 ~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~  220 (1428)
                      -+|+|.+| -|--|+-+...+.+...   .+......|.|.||+ +++|.||+---|.+..++-.
T Consensus         6 lVwaK~~g~pwWPa~V~~~~~~~~~~---~~~~~~~~~~V~Ffg-~~~~~wv~~~~l~pf~~~~~   66 (87)
T cd05162           6 LVWAKMKGYPWWPALVVDPPKDSKKA---KKKAKEGKVLVLFFG-DKTFAWVGAERLKPFTEHKE   66 (87)
T ss_pred             EEEEeCCCCCCCCEEEccccccchhh---hccCCCCEEEEEEeC-CCcEEEeCccceeeccchHH
Confidence            48999999 78888888777776543   233345789999999 99999999999988887653


No 63 
>PRK04860 hypothetical protein; Provisional
Probab=89.39  E-value=0.14  Score=55.09  Aligned_cols=39  Identities=10%  Similarity=-0.119  Sum_probs=34.8

Q ss_pred             CCcccCCCCcccCCchhhhcccccccCCCccccCCCCCcCcCh
Q 000554         1017 RPHKKGIRFYAYKLKSGRLSRPRFKKGLGAVSYRIRNRGAAGM 1059 (1428)
Q Consensus      1017 KpykC~~CgKsFs~ks~L~~H~r~H~gekpy~C~~C~ksf~~~ 1059 (1428)
                      -+|.|. |++   ....+++|.++|+++++|.|..|+..+...
T Consensus       118 ~~Y~C~-C~~---~~~~~rrH~ri~~g~~~YrC~~C~~~l~~~  156 (160)
T PRK04860        118 FPYRCK-CQE---HQLTVRRHNRVVRGEAVYRCRRCGETLVFK  156 (160)
T ss_pred             EEEEcC-CCC---eeCHHHHHHHHhcCCccEECCCCCceeEEe
Confidence            379998 998   888899999999999999999999987643


No 64 
>cd05840 SPBC215_ISWI_like The PWWP domain is a component of the S. pombe hypothetical protein SPBC215, as well as ISWI complex protein 4.  The ISWI (imitation switch) proteins are ATPases responsible for chromatin remodeling in eukaryotes, and SPBC215 is proposed to also bind chromatin.   The PWWP domain, named for a conserved Pro-Trp-Trp-Pro motif, is a small domain consisting of 100-150 amino acids. The PWWP domain is found in numerous proteins that are involved in cell division, growth and differentiation. Most PWWP-domain proteins seem to be nuclear, often DNA-binding,  proteins that function as transcription factors regulating a variety of developmental processes.
Probab=88.90  E-value=0.31  Score=47.89  Aligned_cols=59  Identities=24%  Similarity=0.430  Sum_probs=48.9

Q ss_pred             EEEEEeccc-cccceeeeeccCCCccccccccCCCccEEEEEeccCCcchhhhhhcccccc
Q 000554          157 ALWVKWRGK-WQAGIRCARADWPLPTLKAKPTHDRKKYFVIFFPHTRNYSWADMLLVRSIN  216 (1428)
Q Consensus       157 ~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~  216 (1428)
                      -+|.|-+|- |=-|+=|...+-|-.-|++++......|.|.||+. ++|.|++--.+.+..
T Consensus         6 lVwaK~~GyPwWPA~V~~~~~~p~~~l~~~~~~~~~~~~V~FFg~-~~~~Wv~~~~l~pl~   65 (93)
T cd05840           6 RVLAKVKGFPAWPAIVVPEEMLPDSVLKGKKKKNKRTYPVMFFPD-GDYYWVPNKDLKPLT   65 (93)
T ss_pred             EEEEeCCCCCCCCEEECChHHCCHHHHhcccCCCCCeEEEEEeCC-CcEEEEChhhcccCC
Confidence            389999994 66677777777888888888888899999999995 699999887777665


No 65 
>PF09237 GAGA:  GAGA factor;  InterPro: IPR015318 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target.  Members of this entry bind to a 5'-GAGAG-3' DNA consensus binding site, and contain a Cys2-His2 zinc finger core as well as an N-terminal extension containing two highly basic regions. The zinc finger core binds in the DNA major groove and recognises the first three GAG bases of the consensus in a manner similar to that seen in other classical zinc finger-DNA complexes. The second basic region forms a helix that interacts in the major groove recognising the last G of the consensus, while the first basic region wraps around the DNA in the minor groove and recognises the A in the fourth position of the consensus sequence [].  More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; PDB: 1YUI_A 1YUJ_A.
Probab=86.52  E-value=0.38  Score=41.90  Aligned_cols=29  Identities=24%  Similarity=0.500  Sum_probs=17.8

Q ss_pred             CcccccccccccCChhhhhhhhhhccccc
Q 000554          880 RGYACAICLDSFTNKKVLESHVQERHHVQ  908 (1428)
Q Consensus       880 KpykC~~CgKsF~~ks~L~~H~r~Hhgek  908 (1428)
                      .|-.|++|+..+.+..+|++|+..+|+.+
T Consensus        23 ~PatCP~C~a~~~~srnLrRHle~~H~~k   51 (54)
T PF09237_consen   23 QPATCPICGAVIRQSRNLRRHLEIRHFKK   51 (54)
T ss_dssp             --EE-TTT--EESSHHHHHHHHHHHTTTS
T ss_pred             CCCCCCcchhhccchhhHHHHHHHHhccc
Confidence            66777777777777777777777777655


No 66 
>COG5236 Uncharacterized conserved protein, contains RING Zn-finger [General function prediction only]
Probab=85.33  E-value=0.43  Score=55.58  Aligned_cols=103  Identities=21%  Similarity=0.224  Sum_probs=58.1

Q ss_pred             cccccc--CCCCCCChhhhhhhhhhcccccccchhhhhccccccCcCCCCccccC------ChhhhhhhhhhcCCccc--
Q 000554          915 LQQCIP--CGSHFGNTEELWLHVQSVHAIDFKMSEVAQQHNQSVGEDSPKKLELG------YSASVENHSENLGSIRK--  984 (1428)
Q Consensus       915 pfkC~~--CgKsF~sks~L~~H~rsvHsgEf~~~s~~~~kp~~C~~C~~k~~sf~------sks~L~~H~rtHtGeKp--  984 (1428)
                      .|.|+.  |.........|..|.+..|.            .+.|.+|......|.      ++..|..|...-..+.-  
T Consensus       151 ~F~CP~skc~~~C~~~k~lk~H~K~~H~------------~~~C~~C~~nKk~F~~E~~lF~~~~Lr~H~~~G~~e~GFK  218 (493)
T COG5236         151 SFKCPKSKCHRRCGSLKELKKHYKAQHG------------FVLCSECIGNKKDFWNEIRLFRSSTLRDHKNGGLEEEGFK  218 (493)
T ss_pred             HhcCCchhhhhhhhhHHHHHHHHHhhcC------------cEEhHhhhcCcccCccceeeeecccccccccCCccccCcC
Confidence            356654  55555556667777765552            455666643333333      23456666543332222  


Q ss_pred             --eecCccCcccCChhhHHHHHHhhccCCCCCCCCCcccCCCCcc-------cCCchhhhcccc
Q 000554          985 --FICRFCGLKFDLLPDLGRHHQAAHMGPNLVNSRPHKKGIRFYA-------YKLKSGRLSRPR 1039 (1428)
Q Consensus       985 --ykC~~CGKsFs~~s~L~rHHqrvHtge~~~~eKpykC~~CgKs-------Fs~ks~L~~H~r 1039 (1428)
                        -.|.+|.+.|-.-..|.+|.+..|..          |.+|++.       |..-..|..|.+
T Consensus       219 GHP~C~FC~~~FYdDDEL~~HcR~~HE~----------ChICD~v~p~~~QYFK~Y~~Le~HF~  272 (493)
T COG5236         219 GHPLCIFCKIYFYDDDELRRHCRLRHEA----------CHICDMVGPIRYQYFKSYEDLEAHFR  272 (493)
T ss_pred             CCchhhhccceecChHHHHHHHHhhhhh----------hhhhhccCccchhhhhCHHHHHHHhh
Confidence              24788888888888888854444544          6666653       555566666653


No 67 
>PF12874 zf-met:  Zinc-finger of C2H2 type; PDB: 1ZU1_A 2KVG_A.
Probab=84.73  E-value=0.26  Score=36.10  Aligned_cols=21  Identities=10%  Similarity=-0.042  Sum_probs=11.0

Q ss_pred             cccCCCCcccCCchhhhcccc
Q 000554         1019 HKKGIRFYAYKLKSGRLSRPR 1039 (1428)
Q Consensus      1019 ykC~~CgKsFs~ks~L~~H~r 1039 (1428)
                      |.|.+|++.|.....|..|++
T Consensus         1 ~~C~~C~~~f~s~~~~~~H~~   21 (25)
T PF12874_consen    1 FYCDICNKSFSSENSLRQHLR   21 (25)
T ss_dssp             EEETTTTEEESSHHHHHHHHT
T ss_pred             CCCCCCCCCcCCHHHHHHHHC
Confidence            345555555555555555554


No 68 
>PF11722 zf-TRM13_CCCH:  CCCH zinc finger in TRM13 protein;  InterPro: IPR021721  This domain is found at the N terminus of TRM13 methyltransferase proteins. It is presumed to be a zinc binding domain. ; GO: 0008168 methyltransferase activity
Probab=83.99  E-value=0.35  Score=38.14  Aligned_cols=29  Identities=28%  Similarity=0.619  Sum_probs=27.0

Q ss_pred             cccchhhhhcCceeeEeecCCceEEEEec
Q 000554          533 RQCTAFIESKGRQCVRWANEGDVYCCVHL  561 (1428)
Q Consensus       533 ~~c~a~~~~kgrqc~r~a~~~~~ycc~h~  561 (1428)
                      -+|.-||+.|.|.|.=.+..|..||--|+
T Consensus         2 ~~C~f~l~~K~R~C~m~~~~g~~fC~~H~   30 (31)
T PF11722_consen    2 GRCEFFLPRKKRFCKMTRKPGSRFCGEHM   30 (31)
T ss_pred             CcceEECCccccccCCeecCcCCccccCC
Confidence            37999999999999999999999999885


No 69 
>PF13909 zf-H2C2_5:  C2H2-type zinc-finger domain; PDB: 1X5W_A.
Probab=81.78  E-value=0.6  Score=34.05  Aligned_cols=23  Identities=35%  Similarity=0.689  Sum_probs=11.9

Q ss_pred             ccccccccccCChhhhhhhhhhcc
Q 000554          882 YACAICLDSFTNKKVLESHVQERH  905 (1428)
Q Consensus       882 ykC~~CgKsF~~ks~L~~H~r~Hh  905 (1428)
                      |+|+.|+.... +..|..|++.||
T Consensus         1 y~C~~C~y~t~-~~~l~~H~~~~H   23 (24)
T PF13909_consen    1 YKCPHCSYSTS-KSNLKRHLKRHH   23 (24)
T ss_dssp             EE-SSSS-EES-HHHHHHHHHHHH
T ss_pred             CCCCCCCCcCC-HHHHHHHHHhhC
Confidence            45566665555 555666665554


No 70 
>PF12874 zf-met:  Zinc-finger of C2H2 type; PDB: 1ZU1_A 2KVG_A.
Probab=81.43  E-value=0.63  Score=34.07  Aligned_cols=21  Identities=33%  Similarity=0.773  Sum_probs=11.1

Q ss_pred             cccccccccCChhhhhhhhhh
Q 000554          883 ACAICLDSFTNKKVLESHVQE  903 (1428)
Q Consensus       883 kC~~CgKsF~~ks~L~~H~r~  903 (1428)
                      .|.+|++.|.+...|..|++.
T Consensus         2 ~C~~C~~~f~s~~~~~~H~~s   22 (25)
T PF12874_consen    2 YCDICNKSFSSENSLRQHLRS   22 (25)
T ss_dssp             EETTTTEEESSHHHHHHHHTT
T ss_pred             CCCCCCCCcCCHHHHHHHHCc
Confidence            455555555555555555543


No 71 
>cd07765 KRAB_A-box KRAB (Kruppel-associated box) domain -A box. The KRAB domain is a transcription repression module, found in a subgroup of the zinc finger proteins (ZFPs) of the C2H2 family, KRAB-ZFPs. KRAB-ZFPs comprise the largest group of transcriptional regulators in mammals, and are only found in tetrapods. These proteins have been shown to play important roles in cell differentiation and organ development, and in regulating viral replication and transcription. A KRAB domain may consist of an A-box, or of an A-box plus either a B-box, a divergent B-box (b), or a C-box. Only the A-box is included in this model. The A-box is needed for repression, the B- and C- boxes are not. KRAB-ZFPs have one or two KRAB domains at their amino-terminal end, and multiple C2H2 zinc finger motifs at their C-termini. Some KRAB-ZFPs also contain a SCAN domain which mediates homo- and hetero-oligomerization. The KRAB domain is a protein-protein interaction module which represses transcription through 
Probab=81.21  E-value=0.88  Score=32.73  Aligned_cols=28  Identities=21%  Similarity=0.127  Sum_probs=25.3

Q ss_pred             eecceeeeecccccCChhhhcccchhhhhhhhc
Q 000554          732 IISKEVFLELLKDCCSLEQKLHLHLACELFYKL  764 (1428)
Q Consensus       732 VTFkDVAV~F~r~c~SqEEW~~LdPaCrkLYrd  764 (1428)
                      ++|+||++.|     +.++|.++.+.++.+|.+
T Consensus         1 ~~~~~~~~~~-----~~~~~~~~~~~~~~~~~~   28 (40)
T cd07765           1 VTFEDVAVYF-----SQEEWELLDPAQRDLYRD   28 (40)
T ss_pred             Ccceeeeeec-----CHHHHhcCCHHHHHHHHH
Confidence            3678999999     999999999999999886


No 72 
>PF12171 zf-C2H2_jaz:  Zinc-finger double-stranded RNA-binding;  InterPro: IPR022755  This zinc finger is found in archaea and eukaryotes, and is approximately 30 amino acids in length. The mammalian members of this group occur multiple times along the protein, joined by flexible linkers, and are referred to as JAZ - dsRNA-binding ZF protein - zinc-fingers. The JAZ proteins are expressed in all tissues tested and localise in the nucleus, particularly the nucleolus []. JAZ preferentially binds to double-stranded (ds) RNA or RNA/DNA hybrids rather than DNA. In addition to binding double-stranded RNA, these zinc-fingers are required for nucleolar localisation.   This entry represents the multiple-adjacent-C2H2 zinc finger, JAZ. ; PDB: 4DGW_A 1ZR9_A.
Probab=80.75  E-value=0.92  Score=34.19  Aligned_cols=22  Identities=0%  Similarity=-0.277  Sum_probs=17.3

Q ss_pred             cccCCCCcccCCchhhhccccc
Q 000554         1019 HKKGIRFYAYKLKSGRLSRPRF 1040 (1428)
Q Consensus      1019 ykC~~CgKsFs~ks~L~~H~r~ 1040 (1428)
                      |.|..|++.|.+...|..|++.
T Consensus         2 ~~C~~C~k~f~~~~~~~~H~~s   23 (27)
T PF12171_consen    2 FYCDACDKYFSSENQLKQHMKS   23 (27)
T ss_dssp             CBBTTTTBBBSSHHHHHCCTTS
T ss_pred             CCcccCCCCcCCHHHHHHHHcc
Confidence            6788888888888888888754


No 73 
>PF13909 zf-H2C2_5:  C2H2-type zinc-finger domain; PDB: 1X5W_A.
Probab=77.11  E-value=1.8  Score=31.53  Aligned_cols=17  Identities=24%  Similarity=0.626  Sum_probs=7.8

Q ss_pred             eecCccCcccCChhhHHH
Q 000554          985 FICRFCGLKFDLLPDLGR 1002 (1428)
Q Consensus       985 ykC~~CGKsFs~~s~L~r 1002 (1428)
                      |+|+.|+.... ...|.+
T Consensus         1 y~C~~C~y~t~-~~~l~~   17 (24)
T PF13909_consen    1 YKCPHCSYSTS-KSNLKR   17 (24)
T ss_dssp             EE-SSSS-EES-HHHHHH
T ss_pred             CCCCCCCCcCC-HHHHHH
Confidence            44555555554 555555


No 74 
>PF12171 zf-C2H2_jaz:  Zinc-finger double-stranded RNA-binding;  InterPro: IPR022755  This zinc finger is found in archaea and eukaryotes, and is approximately 30 amino acids in length. The mammalian members of this group occur multiple times along the protein, joined by flexible linkers, and are referred to as JAZ - dsRNA-binding ZF protein - zinc-fingers. The JAZ proteins are expressed in all tissues tested and localise in the nucleus, particularly the nucleolus []. JAZ preferentially binds to double-stranded (ds) RNA or RNA/DNA hybrids rather than DNA. In addition to binding double-stranded RNA, these zinc-fingers are required for nucleolar localisation.   This entry represents the multiple-adjacent-C2H2 zinc finger, JAZ. ; PDB: 4DGW_A 1ZR9_A.
Probab=76.45  E-value=1.3  Score=33.28  Aligned_cols=21  Identities=24%  Similarity=0.695  Sum_probs=10.3

Q ss_pred             ccccccccccCChhhhhhhhh
Q 000554          882 YACAICLDSFTNKKVLESHVQ  902 (1428)
Q Consensus       882 ykC~~CgKsF~~ks~L~~H~r  902 (1428)
                      |.|..|++.|.+...|..|++
T Consensus         2 ~~C~~C~k~f~~~~~~~~H~~   22 (27)
T PF12171_consen    2 FYCDACDKYFSSENQLKQHMK   22 (27)
T ss_dssp             CBBTTTTBBBSSHHHHHCCTT
T ss_pred             CCcccCCCCcCCHHHHHHHHc
Confidence            345555555555555555544


No 75 
>COG5236 Uncharacterized conserved protein, contains RING Zn-finger [General function prediction only]
Probab=75.57  E-value=1.8  Score=50.69  Aligned_cols=135  Identities=22%  Similarity=0.313  Sum_probs=70.0

Q ss_pred             ccCCC--CCcccccccccccccccccchhhhcccCcccccccc---cccC------Chhhhhhhhhhccccccccccccc
Q 000554          848 HKCKI--CSQVFLHDQELGVHWMDNHKKEAQWLFRGYACAICL---DSFT------NKKVLESHVQERHHVQFVEQCMLQ  916 (1428)
Q Consensus       848 ykC~~--CgK~F~s~s~L~~H~~r~Ht~e~~~l~KpykC~~Cg---KsF~------~ks~L~~H~r~Hhgek~~e~~kpf  916 (1428)
                      |.|+.  |.........|+.|.+..|.        .+-|.+|-   +.|.      ++..|..|...-..+....  .-=
T Consensus       152 F~CP~skc~~~C~~~k~lk~H~K~~H~--------~~~C~~C~~nKk~F~~E~~lF~~~~Lr~H~~~G~~e~GFK--GHP  221 (493)
T COG5236         152 FKCPKSKCHRRCGSLKELKKHYKAQHG--------FVLCSECIGNKKDFWNEIRLFRSSTLRDHKNGGLEEEGFK--GHP  221 (493)
T ss_pred             hcCCchhhhhhhhhHHHHHHHHHhhcC--------cEEhHhhhcCcccCccceeeeecccccccccCCccccCcC--CCc
Confidence            56654  44444445556666332332        24566663   2333      3344555544322221000  122


Q ss_pred             ccccCCCCCCChhhhhhhhhhcccccccchhhhhccccccCcCC----CCccccCChhhhhhhhhhcCCccceecCc--c
Q 000554          917 QCIPCGSHFGNTEELWLHVQSVHAIDFKMSEVAQQHNQSVGEDS----PKKLELGYSASVENHSENLGSIRKFICRF--C  990 (1428)
Q Consensus       917 kC~~CgKsF~sks~L~~H~rsvHsgEf~~~s~~~~kp~~C~~C~----~k~~sf~sks~L~~H~rtHtGeKpykC~~--C  990 (1428)
                      .|..|...|-+-..|..|+|..|.              .|.+|-    ..-..|.+-..|..|.+.-+    |.|.+  |
T Consensus       222 ~C~FC~~~FYdDDEL~~HcR~~HE--------------~ChICD~v~p~~~QYFK~Y~~Le~HF~~~h----y~ct~qtc  283 (493)
T COG5236         222 LCIFCKIYFYDDDELRRHCRLRHE--------------ACHICDMVGPIRYQYFKSYEDLEAHFRNAH----YCCTFQTC  283 (493)
T ss_pred             hhhhccceecChHHHHHHHHhhhh--------------hhhhhhccCccchhhhhCHHHHHHHhhcCc----eEEEEEEE
Confidence            588888888888888888876553              344441    11122445556666664322    55532  3


Q ss_pred             C----cccCChhhHHHHHHhhccC
Q 000554          991 G----LKFDLLPDLGRHHQAAHMG 1010 (1428)
Q Consensus       991 G----KsFs~~s~L~rHHqrvHtg 1010 (1428)
                      -    ..|.....|..|..+.|..
T Consensus       284 ~~~k~~vf~~~~el~~h~~~~h~~  307 (493)
T COG5236         284 RVGKCYVFPYHTELLEHLTRFHKV  307 (493)
T ss_pred             ecCcEEEeccHHHHHHHHHHHhhc
Confidence            2    3577777777776666755


No 76 
>KOG2893 consensus Zn finger protein [General function prediction only]
Probab=74.34  E-value=1.4  Score=49.37  Aligned_cols=46  Identities=24%  Similarity=0.247  Sum_probs=35.9

Q ss_pred             cCccCcccCChhhHHHHHHhhccCCCCCCCCCcccCCCCcccCCchhhhccc-cccc
Q 000554          987 CRFCGLKFDLLPDLGRHHQAAHMGPNLVNSRPHKKGIRFYAYKLKSGRLSRP-RFKK 1042 (1428)
Q Consensus       987 C~~CGKsFs~~s~L~rHHqrvHtge~~~~eKpykC~~CgKsFs~ks~L~~H~-r~H~ 1042 (1428)
                      |-+|.+.|....-|.+ |++         .|-|+|.+|.|...+.-.|..|- ++|+
T Consensus        13 cwycnrefddekiliq-hqk---------akhfkchichkkl~sgpglsihcmqvhk   59 (341)
T KOG2893|consen   13 CWYCNREFDDEKILIQ-HQK---------AKHFKCHICHKKLFSGPGLSIHCMQVHK   59 (341)
T ss_pred             eeecccccchhhhhhh-hhh---------hccceeeeehhhhccCCCceeehhhhhh
Confidence            8888888888888888 443         46788888888888888888874 6665


No 77 
>KOG4173 consensus Alpha-SNAP protein [Intracellular trafficking, secretion, and vesicular transport]
Probab=74.06  E-value=0.64  Score=50.98  Aligned_cols=91  Identities=23%  Similarity=0.325  Sum_probs=67.7

Q ss_pred             Ccccccc--cccccCChhhhhhhhhhcccccccccccccccccCCCCCCChhhhhhhhhhcccccccchhhhhccccccC
Q 000554          880 RGYACAI--CLDSFTNKKVLESHVQERHHVQFVEQCMLQQCIPCGSHFGNTEELWLHVQSVHAIDFKMSEVAQQHNQSVG  957 (1428)
Q Consensus       880 KpykC~~--CgKsF~~ks~L~~H~r~Hhgek~~e~~kpfkC~~CgKsF~sks~L~~H~rsvHsgEf~~~s~~~~kp~~C~  957 (1428)
                      ..|.|++  |...|........|-...|+..         |..|.+.|.+...|..|+...|..-|.             
T Consensus        78 ~~~~cqvagc~~~~d~lD~~E~hY~~~h~~s---------Cs~C~r~~Pt~hLLd~HI~E~HDs~Fq-------------  135 (253)
T KOG4173|consen   78 PAFACQVAGCCQVFDALDDYEHHYHTLHGNS---------CSFCKRAFPTGHLLDAHILEWHDSLFQ-------------  135 (253)
T ss_pred             ccccccccchHHHHhhhhhHHHhhhhcccch---------hHHHHHhCCchhhhhHHHHHHHHHHHH-------------
Confidence            3477876  7788888888888887777764         999999999999999998766631100             


Q ss_pred             cCCCCccccCChhhhhhhhhhcCCccceec--CccCcccCChhhHHHHHHhhccC
Q 000554          958 EDSPKKLELGYSASVENHSENLGSIRKFIC--RFCGLKFDLLPDLGRHHQAAHMG 1010 (1428)
Q Consensus       958 ~C~~k~~sf~sks~L~~H~rtHtGeKpykC--~~CGKsFs~~s~L~rHHqrvHtg 1010 (1428)
                                        ..+-.|.-.|+|  ..|+..|.+...-+.|..+.|.=
T Consensus       136 ------------------a~veRG~dMy~ClvEgCt~KFkT~r~RkdH~I~~Hk~  172 (253)
T KOG4173|consen  136 ------------------ALVERGQDMYQCLVEGCTEKFKTSRDRKDHMIRMHKY  172 (253)
T ss_pred             ------------------HHHHcCccHHHHHHHhhhhhhhhhhhhhhHHHHhccC
Confidence                              112334556888  56999999999999988888876


No 78 
>KOG2482 consensus Predicted C2H2-type Zn-finger protein [Transcription]
Probab=72.78  E-value=3.7  Score=48.34  Aligned_cols=76  Identities=20%  Similarity=0.286  Sum_probs=40.0

Q ss_pred             hhhhhhhhhcccccccccccccccccCCCCC-CChhhhhhhhhhcccccccc----------hhhh--hccccccCcCCC
Q 000554          895 KVLESHVQERHHVQFVEQCMLQQCIPCGSHF-GNTEELWLHVQSVHAIDFKM----------SEVA--QQHNQSVGEDSP  961 (1428)
Q Consensus       895 s~L~~H~r~Hhgek~~e~~kpfkC~~CgKsF-~sks~L~~H~rsvHsgEf~~----------~s~~--~~kp~~C~~C~~  961 (1428)
                      ..|..|++...+..     ...+|-.|...+ .+.+....|+-.+|.-....          ....  +-..+.|-.|  
T Consensus       129 eaLeqqQ~Eredt~-----fslqClFCn~e~lgnRs~~l~Hlf~~H~lniGlpDniVyvnelLehLkekL~r~~CLyC--  201 (423)
T KOG2482|consen  129 EALEQQQKEREDTI-----FSLQCLFCNNEGLGNRSEILEHLFHVHGLNIGLPDNIVYVNELLEHLKEKLERLRCLYC--  201 (423)
T ss_pred             HHHHHHHHHhcCCe-----eeeEEEEecchhcccHHHHHHHHHHHhhhccCCCcceeeHHHHHHHHHHHHhhheeeee--
Confidence            44555655554433     245677776544 34455666655455321000          0001  1124667666  


Q ss_pred             CccccCChhhhhhhhhh
Q 000554          962 KKLELGYSASVENHSEN  978 (1428)
Q Consensus       962 k~~sf~sks~L~~H~rt  978 (1428)
                       .+.|+.+..|+.|||.
T Consensus       202 -ekifrdkntLkeHMrk  217 (423)
T KOG2482|consen  202 -EKIFRDKNTLKEHMRK  217 (423)
T ss_pred             -ccccCCcHHHHHHHHh
Confidence             7777777788888853


No 79 
>KOG2482 consensus Predicted C2H2-type Zn-finger protein [Transcription]
Probab=70.18  E-value=2.7  Score=49.41  Aligned_cols=78  Identities=24%  Similarity=0.334  Sum_probs=47.6

Q ss_pred             cccccCCCCCCChhhhhhhhhhcccccccchhhhhccccccCcCCCCccccCChhhhhhhhhhcCCccceecCccCcccC
Q 000554          916 QQCIPCGSHFGNTEELWLHVQSVHAIDFKMSEVAQQHNQSVGEDSPKKLELGYSASVENHSENLGSIRKFICRFCGLKFD  995 (1428)
Q Consensus       916 fkC~~CgKsF~sks~L~~H~rsvHsgEf~~~s~~~~kp~~C~~C~~k~~sf~sks~L~~H~rtHtGeKpykC~~CGKsFs  995 (1428)
                      ..|-.|.....+...|..||+.+|.-++..  .      +-    ..+..|-..-.+....|.  ..+.-.|-.|.-.|.
T Consensus       280 v~CLfC~~~~en~~~l~eHmk~vHe~Dl~K--i------~s----d~~Ln~YqrvrviNyiRk--q~~~~~c~~cd~~F~  345 (423)
T KOG2482|consen  280 VVCLFCTNFYENPVFLFEHMKIVHEFDLLK--I------QS----DYSLNFYQRVRVINYIRK--QKKKSRCAECDLSFW  345 (423)
T ss_pred             eEEEeeccchhhHHHHHHHHHHHHHhhHHh--h------cc----ccccchhhhhhHHHHHHH--Hhhcccccccccccc
Confidence            589999999999999999999999644100  0      00    111222221222222221  113356788889999


Q ss_pred             ChhhHHHHHHhhc
Q 000554          996 LLPDLGRHHQAAH 1008 (1428)
Q Consensus       996 ~~s~L~rHHqrvH 1008 (1428)
                      ....|.. |+.-|
T Consensus       346 ~e~~l~~-hm~e~  357 (423)
T KOG2482|consen  346 KEPGLLI-HMVED  357 (423)
T ss_pred             Ccchhhh-hcccc
Confidence            9999999 44433


No 80 
>cd05837 MSH6_like The PWWP domain is present in MSH6, a mismatch repair protein homologous to bacterial MutS.   The PWWP domain of histone-lysine N-methyltransferase, also known as Nuclear SET domain-containing protein 3, is also included. Mutations in MSH6 have been linked to increased cancer susceptibility, particularly in hereditary nonpolyposis colorectal cancer in humans.  The role of the PWWP domain in MSH6 is not clear; MSH6 orthologs found in S. cerevisiae, Caenorhabditis elegans and Arabidopsis thaliana lack the PWWP domain.   Histone methyltransferases (HMTases) induce the posttranslational methylation of lysine residues in histones and play a role in apoptosis.  In the HMTase Whistle, the PWWP domain is necessary for HMTase activity. The PWWP domain, named for a conserved Pro-Trp-Trp-Pro motif, is a small domain consisting of 100-150 amino acids. The PWWP domain is found in numerous proteins that are involved in cell division, growth and differentiation. Most PWWP-domain pro
Probab=67.51  E-value=5.7  Score=40.23  Aligned_cols=63  Identities=17%  Similarity=0.374  Sum_probs=45.8

Q ss_pred             EEEEEeccc-cccceeeeeccCCCccccccccCCCccEEEEEeccCCcchhhhhhccccccCCC
Q 000554          157 ALWVKWRGK-WQAGIRCARADWPLPTLKAKPTHDRKKYFVIFFPHTRNYSWADMLLVRSINEFP  219 (1428)
Q Consensus       157 ~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~  219 (1428)
                      -+|.|=+|- |--|+-+...+=|..+.+..+....+.|.|.||..+.+|.||.---+.++.+.-
T Consensus         8 lVWaK~~g~PwWPa~V~~~~~~~~~~~~~~~~~~~~~~~V~FFG~~~~~aWv~~~~l~pf~~~~   71 (110)
T cd05837           8 LVWAKVSGYPWWPCMVCSDPLLGTYTKTKRNKRKPRQYHVQFFGDNPERAWISEKSLKPFKGSK   71 (110)
T ss_pred             EEEEeCCCCCCCCEEEecccccchhhhhhhccCCCCeEEEEEcCCCCCEEEecHHHccccCCch
Confidence            479999884 666666654444444444445555689999999999999999988888877654


No 81 
>KOG2893 consensus Zn finger protein [General function prediction only]
Probab=62.23  E-value=3.1  Score=46.67  Aligned_cols=47  Identities=26%  Similarity=0.497  Sum_probs=35.8

Q ss_pred             ccccccccCChhhhhhhhhhcccccccccccccccccCCCCCCChhhhhhhhhhccc
Q 000554          884 CAICLDSFTNKKVLESHVQERHHVQFVEQCMLQQCIPCGSHFGNTEELWLHVQSVHA  940 (1428)
Q Consensus       884 C~~CgKsF~~ks~L~~H~r~Hhgek~~e~~kpfkC~~CgKsF~sks~L~~H~rsvHs  940 (1428)
                      |-+|++.|....-|..|++..          -|+|.+|.|..-+--.|..|-..+|.
T Consensus        13 cwycnrefddekiliqhqkak----------hfkchichkkl~sgpglsihcmqvhk   59 (341)
T KOG2893|consen   13 CWYCNREFDDEKILIQHQKAK----------HFKCHICHKKLFSGPGLSIHCMQVHK   59 (341)
T ss_pred             eeecccccchhhhhhhhhhhc----------cceeeeehhhhccCCCceeehhhhhh
Confidence            888888888888888887753          47788888877777777777655664


No 82 
>KOG2785 consensus C2H2-type Zn-finger protein [General function prediction only]
Probab=61.20  E-value=9.5  Score=45.97  Aligned_cols=55  Identities=13%  Similarity=-0.028  Sum_probs=41.2

Q ss_pred             cceecCccCcccCChhhHHHHHHhhccCCCCC------------------CCCCcccCCCC---cccCCchhhhccc
Q 000554          983 RKFICRFCGLKFDLLPDLGRHHQAAHMGPNLV------------------NSRPHKKGIRF---YAYKLKSGRLSRP 1038 (1428)
Q Consensus       983 KpykC~~CGKsFs~~s~L~rHHqrvHtge~~~------------------~eKpykC~~Cg---KsFs~ks~L~~H~ 1038 (1428)
                      -|-.|-+|++.|.+...-..| +..|.|.-..                  ...-|.|-.|+   +.|.+-...+.||
T Consensus       165 ~Pt~CLfC~~~~k~~e~~~~H-M~~~HgffIPdreYL~D~~GLl~YLgeKV~~~~~CL~CN~~~~~f~sleavr~HM  240 (390)
T KOG2785|consen  165 IPTDCLFCDKKSKSLEENLKH-MFKEHGFFIPDREYLTDEKGLLKYLGEKVGIGFICLFCNELGRPFSSLEAVRAHM  240 (390)
T ss_pred             CCcceeecCCCcccHHHHHHH-HhhccCCcCCchHhhhchhHHHHHHHHHhccCceEEEeccccCcccccHHHHHHH
Confidence            357899999999999999994 5444441100                  03468888898   8999999999999


No 83 
>smart00391 MBD Methyl-CpG binding domain. Methyl-CpG binding domain, also known as the TAM (TTF-IIP5, ARBP, MeCP1) domain
Probab=56.38  E-value=4.8  Score=38.32  Aligned_cols=36  Identities=19%  Similarity=0.106  Sum_probs=28.5

Q ss_pred             CCC-CCcccC------------CcccccccCCCCCCc-cccccceeeecc
Q 000554         1184 HLE-PLPSVS------------AGIRSSDSSDFVNNQ-WEVDECHCIIDS 1219 (1428)
Q Consensus      1184 Pl~-p~~~~~------------~~~k~v~~~~p~~~~-w~~~e~~~~l~~ 1219 (1428)
                      |+. |++.||            .++..|.|..|||.. +.+.|+..||..
T Consensus         3 ~~~~Plp~GW~R~~~~r~~g~~~~~~dV~Y~sP~GkklRs~~ev~~YL~~   52 (77)
T smart00391        3 PLRLPLPCGWRRETKQRKSGRSAGKFDVYYISPCGKKLRSKSELARYLHK   52 (77)
T ss_pred             cccCCCCCCcEEEEEEecCCCCCCcccEEEECCCCCeeeCHHHHHHHHHh
Confidence            444 677777            135678999999999 999999998863


No 84 
>PF13913 zf-C2HC_2:  zinc-finger of a C2HC-type
Probab=55.24  E-value=8.3  Score=28.93  Aligned_cols=18  Identities=39%  Similarity=0.755  Sum_probs=12.9

Q ss_pred             eecCccCcccCChhhHHHH
Q 000554          985 FICRFCGLKFDLLPDLGRH 1003 (1428)
Q Consensus       985 ykC~~CGKsFs~~s~L~rH 1003 (1428)
                      ..|+.||+.| ....|.+|
T Consensus         3 ~~C~~CgR~F-~~~~l~~H   20 (25)
T PF13913_consen    3 VPCPICGRKF-NPDRLEKH   20 (25)
T ss_pred             CcCCCCCCEE-CHHHHHHH
Confidence            4688888888 56677773


No 85 
>smart00451 ZnF_U1 U1-like zinc finger. Family of C2H2-type zinc fingers, present in matrin, U1 small nuclear ribonucleoprotein C and other RNA-binding proteins.
Probab=54.89  E-value=4.3  Score=31.99  Aligned_cols=21  Identities=0%  Similarity=-0.236  Sum_probs=13.6

Q ss_pred             CcccCCCCcccCCchhhhccc
Q 000554         1018 PHKKGIRFYAYKLKSGRLSRP 1038 (1428)
Q Consensus      1018 pykC~~CgKsFs~ks~L~~H~ 1038 (1428)
                      +|.|.+|++.|.....+..|+
T Consensus         3 ~~~C~~C~~~~~~~~~~~~H~   23 (35)
T smart00451        3 GFYCKLCNVTFTDEISVEAHL   23 (35)
T ss_pred             CeEccccCCccCCHHHHHHHH
Confidence            456666666666666666666


No 86 
>PF13913 zf-C2HC_2:  zinc-finger of a C2HC-type
Probab=53.76  E-value=7  Score=29.34  Aligned_cols=19  Identities=42%  Similarity=0.789  Sum_probs=10.0

Q ss_pred             cccccccccCChhhhhhhhh
Q 000554          883 ACAICLDSFTNKKVLESHVQ  902 (1428)
Q Consensus       883 kC~~CgKsF~~ks~L~~H~r  902 (1428)
                      .|+.||+.| ....|.+|++
T Consensus         4 ~C~~CgR~F-~~~~l~~H~~   22 (25)
T PF13913_consen    4 PCPICGRKF-NPDRLEKHEK   22 (25)
T ss_pred             cCCCCCCEE-CHHHHHHHHH
Confidence            355555555 4455555543


No 87 
>KOG4173 consensus Alpha-SNAP protein [Intracellular trafficking, secretion, and vesicular transport]
Probab=51.59  E-value=5.5  Score=44.03  Aligned_cols=93  Identities=15%  Similarity=0.015  Sum_probs=66.9

Q ss_pred             cccccCcCCCCccccCChhhhhhhhhhcCCccceecCccCcccCChhhHHHHHHhhccC----CCCCCCCCcccCC--CC
Q 000554          952 HNQSVGEDSPKKLELGYSASVENHSENLGSIRKFICRFCGLKFDLLPDLGRHHQAAHMG----PNLVNSRPHKKGI--RF 1025 (1428)
Q Consensus       952 kp~~C~~C~~k~~sf~sks~L~~H~rtHtGeKpykC~~CGKsFs~~s~L~rHHqrvHtg----e~~~~eKpykC~~--Cg 1025 (1428)
                      ..+.|.+- +|...+.+...+..|..+-+|   -.|.+|.+.|.+..-|..|....|..    ....+.-.|+|-+  |+
T Consensus        78 ~~~~cqva-gc~~~~d~lD~~E~hY~~~h~---~sCs~C~r~~Pt~hLLd~HI~E~HDs~Fqa~veRG~dMy~ClvEgCt  153 (253)
T KOG4173|consen   78 PAFACQVA-GCCQVFDALDDYEHHYHTLHG---NSCSFCKRAFPTGHLLDAHILEWHDSLFQALVERGQDMYQCLVEGCT  153 (253)
T ss_pred             cccccccc-chHHHHhhhhhHHHhhhhccc---chhHHHHHhCCchhhhhHHHHHHHHHHHHHHHHcCccHHHHHHHhhh
Confidence            45778776 666777766666777644333   38999999999999999987666732    0001145799955  99


Q ss_pred             cccCCchhhhccc-ccccCCCccc
Q 000554         1026 YAYKLKSGRLSRP-RFKKGLGAVS 1048 (1428)
Q Consensus      1026 KsFs~ks~L~~H~-r~H~gekpy~ 1048 (1428)
                      ..|.+....+.|+ ++|.--..|.
T Consensus       154 ~KFkT~r~RkdH~I~~Hk~Pa~fr  177 (253)
T KOG4173|consen  154 EKFKTSRDRKDHMIRMHKYPADFR  177 (253)
T ss_pred             hhhhhhhhhhhHHHHhccCCccee
Confidence            9999999999999 7887544444


No 88 
>smart00451 ZnF_U1 U1-like zinc finger. Family of C2H2-type zinc fingers, present in matrin, U1 small nuclear ribonucleoprotein C and other RNA-binding proteins.
Probab=50.37  E-value=8.6  Score=30.26  Aligned_cols=22  Identities=18%  Similarity=0.306  Sum_probs=19.2

Q ss_pred             ccccccCCCCCCChhhhhhhhh
Q 000554          915 LQQCIPCGSHFGNTEELWLHVQ  936 (1428)
Q Consensus       915 pfkC~~CgKsF~sks~L~~H~r  936 (1428)
                      +|.|..|++.|.+...+..|++
T Consensus         3 ~~~C~~C~~~~~~~~~~~~H~~   24 (35)
T smart00451        3 GFYCKLCNVTFTDEISVEAHLK   24 (35)
T ss_pred             CeEccccCCccCCHHHHHHHHC
Confidence            5789999999999889988876


No 89 
>COG4049 Uncharacterized protein containing archaeal-type C2H2 Zn-finger [General function prediction only]
Probab=47.40  E-value=8.3  Score=34.36  Aligned_cols=32  Identities=28%  Similarity=0.385  Sum_probs=22.3

Q ss_pred             hcCCccceecCccCcccCChhhHHHHHHhhcc
Q 000554          978 NLGSIRKFICRFCGLKFDLLPDLGRHHQAAHM 1009 (1428)
Q Consensus       978 tHtGeKpykC~~CGKsFs~~s~L~rHHqrvHt 1009 (1428)
                      .-.||--+.|+.||+.|....+..+|.-+.|.
T Consensus        11 ~RDGE~~lrCPRC~~~FR~~K~Y~RHVNKaH~   42 (65)
T COG4049          11 DRDGEEFLRCPRCGMVFRRRKDYIRHVNKAHG   42 (65)
T ss_pred             ccCCceeeeCCchhHHHHHhHHHHHHhhHHhh
Confidence            34566677788888888877777776555553


No 90 
>PF09986 DUF2225:  Uncharacterized protein conserved in bacteria (DUF2225);  InterPro: IPR018708 This conserved bacterial family has no known function.
Probab=47.25  E-value=6.1  Score=44.60  Aligned_cols=20  Identities=25%  Similarity=0.534  Sum_probs=13.5

Q ss_pred             cceecCccCcccCChhhHHH
Q 000554          983 RKFICRFCGLKFDLLPDLGR 1002 (1428)
Q Consensus       983 KpykC~~CGKsFs~~s~L~r 1002 (1428)
                      |.++|+.|++.|....-+..
T Consensus         4 k~~~CPvC~~~F~~~~vrs~   23 (214)
T PF09986_consen    4 KKITCPVCGKEFKTKKVRSG   23 (214)
T ss_pred             CceECCCCCCeeeeeEEEcC
Confidence            56778888888876544333


No 91 
>smart00293 PWWP domain with conserved PWWP motif. conservation of Pro-Trp-Trp-Pro residues
Probab=42.52  E-value=27  Score=31.74  Aligned_cols=56  Identities=20%  Similarity=0.433  Sum_probs=38.5

Q ss_pred             EEEEEecc-ccccceeeeeccCCCccccccccCCCccEEEEEeccCCcchhhhhhccccc
Q 000554          157 ALWVKWRG-KWQAGIRCARADWPLPTLKAKPTHDRKKYFVIFFPHTRNYSWADMLLVRSI  215 (1428)
Q Consensus       157 ~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~  215 (1428)
                      -+|.|=+| -|--|+-+...+-|...++  +.-..+.|.|.||.. .+|.|++--.+.++
T Consensus         6 lVwaK~~G~p~WPa~V~~~~~~~~~~~~--~~~~~~~~~V~Ffg~-~~~awv~~~~l~p~   62 (63)
T smart00293        6 LVWAKMKGFPWWPALVVSPKETPDNIRK--RKRFENLYPVLFFGD-KDTAWISSSKLFPL   62 (63)
T ss_pred             EEEEECCCCCCCCeEEcCcccCChhHhh--ccCCCCEEEEEEeCC-CCEEEECccceeeC
Confidence            37999999 7777777766665554332  334456788888875 55699987766654


No 92 
>cd00350 rubredoxin_like Rubredoxin_like; nonheme iron binding domain containing a [Fe(SCys)4] center. The family includes rubredoxins, a small electron transfer protein, and a slightly smaller modular rubredoxin domain present in rubrerythrin and nigerythrin and detected either N- or C-terminal to such proteins as flavin reductase, NAD(P)H-nitrite reductase, and ferredoxin-thioredoxin reductase. In rubredoxin, the iron atom is coordinated by four cysteine residues (Fe(S-Cys)4), but iron can also be replaced by cobalt, nickel or zinc and believed to be involved in electron transfer.  Rubrerythrins and nigerythrins are small homodimeric proteins, generally consisting of 2 domains: a rubredoxin domain C-terminal to a non-sulfur, oxo-bridged diiron site in the N-terminal rubrerythrin domain.  Rubrerythrins and nigerythrins have putative peroxide activity.
Probab=41.44  E-value=18  Score=28.81  Aligned_cols=11  Identities=36%  Similarity=1.352  Sum_probs=6.7

Q ss_pred             eecCccCcccC
Q 000554          985 FICRFCGLKFD  995 (1428)
Q Consensus       985 ykC~~CGKsFs  995 (1428)
                      |+|..||..+.
T Consensus         2 ~~C~~CGy~y~   12 (33)
T cd00350           2 YVCPVCGYIYD   12 (33)
T ss_pred             EECCCCCCEEC
Confidence            56666666544


No 93 
>PF00855 PWWP:  PWWP domain;  InterPro: IPR000313 Upon characterisation of WHSC1, a gene mapping to the Wolf-Hirschhornsyndrome critical region and at its C terminus similar to the Drosophila melanogaster ASH1/trithorax group proteins, a novel protein domain designated PWWP domain was identified []. The PWWP domain is named after a conserved Pro-Trp-Trp-Pro motif. It is present in proteins of nuclear origin and plays a role in cell growth and differentiation. Due to its position, the composition of amino acids close to the PWWP motif and the pattern of other domains present it has been suggested that the domain is involved in protein-protein interactions [].; PDB: 3LYI_B 2L89_A 2NLU_A 1RI0_A 1KHC_A 3QKJ_C 2DAQ_A 1N27_A 3PFS_B 3QJ6_A ....
Probab=41.14  E-value=26  Score=33.13  Aligned_cols=56  Identities=23%  Similarity=0.577  Sum_probs=38.4

Q ss_pred             EEEEEecc-ccccceeeeeccCCCccccccccCCCccEEEEEeccCCcchhhhhhccccccCCC
Q 000554          157 ALWVKWRG-KWQAGIRCARADWPLPTLKAKPTHDRKKYFVIFFPHTRNYSWADMLLVRSINEFP  219 (1428)
Q Consensus       157 ~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~  219 (1428)
                      -+|+|=+| -|=-|+=|...+.+-     + ......|.|.||... +|.|++.-.|.+.+++-
T Consensus         6 lVWaK~~g~pwWPa~V~~~~~~~~-----~-~~~~~~~~V~Ffg~~-~~~wv~~~~i~~f~~~~   62 (86)
T PF00855_consen    6 LVWAKLKGYPWWPARVCDPDEKSK-----K-KRKDGHVLVRFFGDN-DYAWVKPSNIKPFSEFK   62 (86)
T ss_dssp             EEEEEETTSEEEEEEEEECCHCTS-----C-SSSSTEEEEEETTTT-EEEEEEGGGEEECCHHH
T ss_pred             EEEEEeCCCCCCceEEeecccccc-----c-CCCCCEEEEEecCCC-CEEEECHHHhhChhhhH
Confidence            48999987 355666666664443     1 334466777777766 99999998888877544


No 94 
>COG1997 RPL43A Ribosomal protein L37AE/L43A [Translation, ribosomal structure and biogenesis]
Probab=39.33  E-value=13  Score=36.16  Aligned_cols=34  Identities=21%  Similarity=0.263  Sum_probs=23.2

Q ss_pred             cceecCccCcccCChhhHHHHHHhhccCCCCCCCCCcccCCCCcccCCch
Q 000554          983 RKFICRFCGLKFDLLPDLGRHHQAAHMGPNLVNSRPHKKGIRFYAYKLKS 1032 (1428)
Q Consensus       983 KpykC~~CGKsFs~~s~L~rHHqrvHtge~~~~eKpykC~~CgKsFs~ks 1032 (1428)
                      .+|.|+.|++. .        +.|+-+|       -+.|..|++.|.-..
T Consensus        34 ~~~~Cp~C~~~-~--------VkR~a~G-------IW~C~kCg~~fAGga   67 (89)
T COG1997          34 AKHVCPFCGRT-T--------VKRIATG-------IWKCRKCGAKFAGGA   67 (89)
T ss_pred             cCCcCCCCCCc-c--------eeeeccC-------eEEcCCCCCeecccc
Confidence            46788888876 1        4555555       788888888876443


No 95 
>PF06524 NOA36:  NOA36 protein;  InterPro: IPR010531 This family consists of several NOA36 proteins which contain 29 highly conserved cysteine residues. The function of this protein is unknown.; GO: 0008270 zinc ion binding, 0005634 nucleus
Probab=38.94  E-value=32  Score=39.62  Aligned_cols=27  Identities=15%  Similarity=-0.012  Sum_probs=21.1

Q ss_pred             CCCcccCCCCcccCCchhhhccccccc
Q 000554         1016 SRPHKKGIRFYAYKLKSGRLSRPRFKK 1042 (1428)
Q Consensus      1016 eKpykC~~CgKsFs~ks~L~~H~r~H~ 1042 (1428)
                      .+++.|+.|+........|..-.|.|.
T Consensus       207 ~k~~PCPKCg~et~eTkdLSmStR~hk  233 (314)
T PF06524_consen  207 GKPIPCPKCGYETQETKDLSMSTRSHK  233 (314)
T ss_pred             CCCCCCCCCCCcccccccceeeeecch
Confidence            578889999888888777877666665


No 96 
>TIGR02098 MJ0042_CXXC MJ0042 family finger-like domain. This domain contains a CXXCX(19)CXXC motif suggestive of both zinc fingers and thioredoxin, usually found at the N-terminus of prokaryotic proteins. One partially characterized gene, agmX, is among a large set in Myxococcus whose interruption affects adventurous gliding motility.
Probab=38.52  E-value=16  Score=29.64  Aligned_cols=34  Identities=12%  Similarity=0.123  Sum_probs=19.9

Q ss_pred             eecCccCcccCChhhHHHHHHhhccCCCCCCCCCcccCCCCcccC
Q 000554          985 FICRFCGLKFDLLPDLGRHHQAAHMGPNLVNSRPHKKGIRFYAYK 1029 (1428)
Q Consensus       985 ykC~~CGKsFs~~s~L~rHHqrvHtge~~~~eKpykC~~CgKsFs 1029 (1428)
                      ++|+.|+..|.-......      ..     .....|+.|+..|.
T Consensus         3 ~~CP~C~~~~~v~~~~~~------~~-----~~~v~C~~C~~~~~   36 (38)
T TIGR02098         3 IQCPNCKTSFRVVDSQLG------AN-----GGKVRCGKCGHVWY   36 (38)
T ss_pred             EECCCCCCEEEeCHHHcC------CC-----CCEEECCCCCCEEE
Confidence            567777777766544322      11     22467777777663


No 97 
>cd05838 WHSC1_related The PWWP domain was first identified in the WHSC1 (Wolf-Hirschhorn syndrome candidate 1) protein, a protein implicated in Wolf-Hirschhorn syndrome (WHS).  When translocated, WHSC1 plays a role in lymphoid multiple myeloma (MM) disease, also known as plasmacytoma. WHCS1 proteins typically contain two copies of the PWWP domain.  The PWWP domain, named for a conserved Pro-Trp-Trp-Pro motif, is a small domain consisting of 100-150 amino acids. The PWWP domain is found in numerous proteins that are involved in cell division, growth and differentiation. Most PWWP-domain proteins seem to be nuclear, often DNA-binding, proteins that function as transcription factors regulating a variety of developmental processes.
Probab=38.00  E-value=25  Score=34.75  Aligned_cols=54  Identities=26%  Similarity=0.543  Sum_probs=34.1

Q ss_pred             EEEEecc-ccccceeeeeccCCCccccccccCCCccEEEEEeccCCcchhhhhhcccc
Q 000554          158 LWVKWRG-KWQAGIRCARADWPLPTLKAKPTHDRKKYFVIFFPHTRNYSWADMLLVRS  214 (1428)
Q Consensus       158 ~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~  214 (1428)
                      +|+|-+| -|=-|+-|-..+=|-..+..+  +....|.|.|| .+++|.|++--.|-+
T Consensus         7 VWaK~~g~pwWPa~V~~~~~~p~~~~~~~--~~~~~~~V~Ff-gs~~y~Wv~~~~l~p   61 (95)
T cd05838           7 VWAKLGNFRWWPAIICDPREVPPNIQVLR--HCIGEFCVMFF-GTHDYYWVHRGRVFP   61 (95)
T ss_pred             EEEECCCCCCCCeEEcChhhcChhHhhcc--CCCCeEEEEEe-CCCCEEEeccccccc
Confidence            7999998 455666665543333222211  23356888888 589999999744443


No 98 
>TIGR00622 ssl1 transcription factor ssl1. This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University).
Probab=37.85  E-value=39  Score=34.59  Aligned_cols=48  Identities=19%  Similarity=0.390  Sum_probs=29.3

Q ss_pred             cccccccccCChhhhhhhhhhcccccccccccccccccCCCCCCChhhhhhhhhhcc
Q 000554          883 ACAICLDSFTNKKVLESHVQERHHVQFVEQCMLQQCIPCGSHFGNTEELWLHVQSVH  939 (1428)
Q Consensus       883 kC~~CgKsF~~ks~L~~H~r~Hhgek~~e~~kpfkC~~CgKsF~sks~L~~H~rsvH  939 (1428)
                      .|--|.+.|.......      .++  ......|+|+.|...|-..-+...|.. .|
T Consensus        57 ~C~~C~~~f~~~~~~~------~~~--~~~~~~y~C~~C~~~FC~dCD~fiHe~-Lh  104 (112)
T TIGR00622        57 FCFGCQGPFPKPPVSP------FDE--LKDSHRYVCAVCKNVFCVDCDVFVHES-LH  104 (112)
T ss_pred             cccCcCCCCCCccccc------ccc--cccccceeCCCCCCccccccchhhhhh-cc
Confidence            3777888887653211      110  001136888888888888888887863 45


No 99 
>TIGR00373 conserved hypothetical protein TIGR00373. This family of proteins is, so far, restricted to archaeal genomes. The family appears to be distantly related to the N-terminal region of the eukaryotic transcription initiation factor IIE alpha chain.
Probab=37.53  E-value=24  Score=38.09  Aligned_cols=40  Identities=13%  Similarity=-0.029  Sum_probs=28.1

Q ss_pred             hhhhhcCCccceecCccCcccCChhhHHHHHHhhccCCCCCCCCCcccCCCCccc
Q 000554          974 NHSENLGSIRKFICRFCGLKFDLLPDLGRHHQAAHMGPNLVNSRPHKKGIRFYAY 1028 (1428)
Q Consensus       974 ~H~rtHtGeKpykC~~CGKsFs~~s~L~rHHqrvHtge~~~~eKpykC~~CgKsF 1028 (1428)
                      .-+.......-|.|+.|+..|+....+..               -|.|+.||...
T Consensus        99 ~~l~~e~~~~~Y~Cp~c~~r~tf~eA~~~---------------~F~Cp~Cg~~L  138 (158)
T TIGR00373        99 EKLEFETNNMFFICPNMCVRFTFNEAMEL---------------NFTCPRCGAML  138 (158)
T ss_pred             HHHhhccCCCeEECCCCCcEeeHHHHHHc---------------CCcCCCCCCEe
Confidence            33334455567889999988887776643               58899998653


No 100
>PF14353 CpXC:  CpXC protein
Probab=37.15  E-value=22  Score=36.60  Aligned_cols=50  Identities=22%  Similarity=0.264  Sum_probs=32.5

Q ss_pred             ecCccCcccCC----------hhhHHHHHHhhccCCCCCCCCCcccCCCCcccCCchhhhccccccc
Q 000554          986 ICRFCGLKFDL----------LPDLGRHHQAAHMGPNLVNSRPHKKGIRFYAYKLKSGRLSRPRFKK 1042 (1428)
Q Consensus       986 kC~~CGKsFs~----------~s~L~rHHqrvHtge~~~~eKpykC~~CgKsFs~ks~L~~H~r~H~ 1042 (1428)
                      .|+.||..|..          ...|+.   ++-.|.    --.|.|+.||+.|.-...+..|-..|.
T Consensus         3 tCP~C~~~~~~~v~~~I~~~~~p~l~e---~il~g~----l~~~~CP~Cg~~~~~~~p~lY~D~~~~   62 (128)
T PF14353_consen    3 TCPHCGHEFEFEVWTSINADEDPELKE---KILDGS----LFSFTCPSCGHKFRLEYPLLYHDPEKK   62 (128)
T ss_pred             CCCCCCCeeEEEEEeEEcCcCCHHHHH---HHHcCC----cCEEECCCCCCceecCCCEEEEcCCCC
Confidence            57888877753          223332   233442    346889999999988888888765543


No 101
>KOG3813 consensus Uncharacterized conserved protein (tumor-suppressor AXUD1 in humans) [General function prediction only]
Probab=37.04  E-value=16  Score=45.37  Aligned_cols=19  Identities=42%  Similarity=1.027  Sum_probs=16.5

Q ss_pred             CCCcccCCCCcCCCCCCccc
Q 000554         1299 QLGCACANSTCFPETCDHVY 1318 (1428)
Q Consensus      1299 ~~gC~C~~~~C~~~~C~C~~ 1318 (1428)
                      .+||+|.. -|+|++|+|.+
T Consensus       307 eCGCsCr~-~CdPETCaCSq  325 (640)
T KOG3813|consen  307 ECGCSCRG-VCDPETCACSQ  325 (640)
T ss_pred             hhCCcccc-eeChhhcchhc
Confidence            57999994 89999999964


No 102
>PF09538 FYDLN_acid:  Protein of unknown function (FYDLN_acid);  InterPro: IPR012644 Members of this family are bacterial proteins with a conserved motif [KR]FYDLN, sometimes flanked by a pair of CXXC motifs, followed by a long region of low complexity sequence in which roughly half the residues are Asp and Glu, including multiple runs of five or more acidic residues. The function of members of this family is unknown.
Probab=37.02  E-value=19  Score=36.58  Aligned_cols=30  Identities=23%  Similarity=0.222  Sum_probs=22.0

Q ss_pred             eecCccCcccCChhhHHHHHHhhccCCCCCCCCCcccCCCCcccCCc
Q 000554          985 FICRFCGLKFDLLPDLGRHHQAAHMGPNLVNSRPHKKGIRFYAYKLK 1031 (1428)
Q Consensus       985 ykC~~CGKsFs~~s~L~rHHqrvHtge~~~~eKpykC~~CgKsFs~k 1031 (1428)
                      ..|+.||++|--.   .              ..|..|+.||..|.-.
T Consensus        10 R~Cp~CG~kFYDL---n--------------k~PivCP~CG~~~~~~   39 (108)
T PF09538_consen   10 RTCPSCGAKFYDL---N--------------KDPIVCPKCGTEFPPE   39 (108)
T ss_pred             ccCCCCcchhccC---C--------------CCCccCCCCCCccCcc
Confidence            5788888888643   2              2477888888888766


No 103
>smart00531 TFIIE Transcription initiation factor IIE.
Probab=35.90  E-value=29  Score=36.93  Aligned_cols=39  Identities=13%  Similarity=0.059  Sum_probs=25.0

Q ss_pred             CCccceecCccCcccCChhhHHHHHHhhccCCCCCCCCCcccCCCCccc
Q 000554          980 GSIRKFICRFCGLKFDLLPDLGRHHQAAHMGPNLVNSRPHKKGIRFYAY 1028 (1428)
Q Consensus       980 tGeKpykC~~CGKsFs~~s~L~rHHqrvHtge~~~~eKpykC~~CgKsF 1028 (1428)
                      ....-|.|+.|++.|.....+..    .+.      ...|.|+.||...
T Consensus        95 ~~~~~Y~Cp~C~~~y~~~ea~~~----~d~------~~~f~Cp~Cg~~l  133 (147)
T smart00531       95 TNNAYYKCPNCQSKYTFLEANQL----LDM------DGTFTCPRCGEEL  133 (147)
T ss_pred             cCCcEEECcCCCCEeeHHHHHHh----cCC------CCcEECCCCCCEE
Confidence            34456899999988886544332    111      2348999998764


No 104
>cd01397 HAT_MBD Methyl-CpG binding domains (MBD) present in putative chromatin remodelling factor such as BAZ2A; BAZ2A contains a MBD, DDT, PHD-type zinc finger and Bromo domain suggesting that BAZ2A might be associated with histone acetyltransferase (HAT) activity. The Drosophila melanogaster toutatis protein, a putative subunit of the chromatin-remodeling complex, and other such proteins in this group share a similar domain architecture with BAZ2A, as does the Caenorhabditis elegans flectin homolog.
Probab=35.18  E-value=13  Score=35.16  Aligned_cols=25  Identities=4%  Similarity=-0.186  Sum_probs=21.4

Q ss_pred             cccccccCCCCCCc-cccccceeeec
Q 000554         1194 GIRSSDSSDFVNNQ-WEVDECHCIID 1218 (1428)
Q Consensus      1194 ~~k~v~~~~p~~~~-w~~~e~~~~l~ 1218 (1428)
                      ++..|.|.+|||.. +++.|++.||.
T Consensus        23 ~~~dV~Y~aPcGKklRs~~ev~~yL~   48 (73)
T cd01397          23 IQGEVAYYAPCGKKLRQYPEVIKYLS   48 (73)
T ss_pred             ccceEEEECCCCcccccHHHHHHHHH
Confidence            34468899999999 99999998886


No 105
>smart00834 CxxC_CXXC_SSSS Putative regulatory protein. CxxC_CXXC_SSSS represents a region of about 41 amino acids found in a number of small proteins in a wide range of bacteria. The region usually begins with the initiator Met and contains two CxxC motifs separated by 17 amino acids. One protein in this entry has been noted as a putative regulatory protein, designated FmdB. Most proteins in this entry have a C-terminal region containing highly degenerate sequence.
Probab=35.17  E-value=14  Score=30.27  Aligned_cols=12  Identities=33%  Similarity=1.052  Sum_probs=8.4

Q ss_pred             eecCccCcccCC
Q 000554          985 FICRFCGLKFDL  996 (1428)
Q Consensus       985 ykC~~CGKsFs~  996 (1428)
                      |+|..||+.|..
T Consensus         6 y~C~~Cg~~fe~   17 (41)
T smart00834        6 YRCEDCGHTFEV   17 (41)
T ss_pred             EEcCCCCCEEEE
Confidence            677777777754


No 106
>PRK00464 nrdR transcriptional regulator NrdR; Validated
Probab=33.63  E-value=18  Score=39.02  Aligned_cols=19  Identities=5%  Similarity=-0.303  Sum_probs=13.8

Q ss_pred             CCcccCCCCcccCCchhhh
Q 000554         1017 RPHKKGIRFYAYKLKSGRL 1035 (1428)
Q Consensus      1017 KpykC~~CgKsFs~ks~L~ 1035 (1428)
                      +.++|+.||++|..-..+.
T Consensus        27 ~~~~c~~c~~~f~~~e~~~   45 (154)
T PRK00464         27 RRRECLACGKRFTTFERVE   45 (154)
T ss_pred             eeeeccccCCcceEeEecc
Confidence            3488888888887665544


No 107
>COG1198 PriA Primosomal protein N' (replication factor Y) - superfamily II helicase [DNA replication, recombination, and repair]
Probab=33.28  E-value=27  Score=46.25  Aligned_cols=43  Identities=21%  Similarity=0.178  Sum_probs=28.4

Q ss_pred             CCCChhhhhhhhhhhhHHHHHHHHhhh-cCCCCcccccccccccc
Q 000554         1111 RPNSHEILSMARLACCKVSLKASLEEK-YGALPENICLKAAKLCS 1154 (1428)
Q Consensus      1111 ~P~n~diLsiars~CcK~~l~~~L~~k-~g~lpe~l~~~aakl~~ 1154 (1428)
                      .|.+..|..+-.. =.-.|..+.|..+ -..+||--++-+...-+
T Consensus       602 ~P~hp~i~~~~~~-dy~~F~~~El~~Rk~~~~PPf~~l~~v~~~~  645 (730)
T COG1198         602 NPDHPAIQALKRG-DYEAFYEQELAERKELGLPPFSRLAAVIASA  645 (730)
T ss_pred             CCCcHHHHHHHhc-CHHHHHHHHHHHHHhcCCCChhhheeeEecC
Confidence            3666666555554 3446777888777 68889988776655543


No 108
>KOG2461 consensus Transcription factor BLIMP-1/PRDI-BF1, contains C2H2-type Zn-finger and SET domains [Transcription]
Probab=32.17  E-value=90  Score=38.68  Aligned_cols=78  Identities=0%  Similarity=-0.293  Sum_probs=53.8

Q ss_pred             hhhhhhhhcCCccceecCccCcccCChhhHHHHHHhhccCCCCCCCCCcccCCCCcccCCchhhhcccccccCCCccccC
Q 000554          971 SVENHSENLGSIRKFICRFCGLKFDLLPDLGRHHQAAHMGPNLVNSRPHKKGIRFYAYKLKSGRLSRPRFKKGLGAVSYR 1050 (1428)
Q Consensus       971 ~L~~H~rtHtGeKpykC~~CGKsFs~~s~L~rHHqrvHtge~~~~eKpykC~~CgKsFs~ks~L~~H~r~H~gekpy~C~ 1050 (1428)
                      .+..|...|++..++.+..+.+.+.....+.. +...|.+     +.++.+..+...+.....+..+..+|...+.+.+.
T Consensus       318 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~-----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~  391 (396)
T KOG2461|consen  318 VLDQSEVPATVSVWTGETIPVRTPAGQLIYTQ-SHSMEVA-----EPTDMAPNQIWKIYHTGVLGFLIITTDESECNNMS  391 (396)
T ss_pred             ccccccccccccccCcCcccccccccccchhh-hhhcccC-----CCCcccccccccceeccccceeeeecccccccccc
Confidence            55667777888888888888888888778888 6666776     55555555555555556666666677766667666


Q ss_pred             CCCC
Q 000554         1051 IRNR 1054 (1428)
Q Consensus      1051 ~C~k 1054 (1428)
                      .|.+
T Consensus       392 ~~~~  395 (396)
T KOG2461|consen  392 FVCK  395 (396)
T ss_pred             ccCC
Confidence            6554


No 109
>PRK06266 transcription initiation factor E subunit alpha; Validated
Probab=32.12  E-value=30  Score=38.14  Aligned_cols=35  Identities=11%  Similarity=0.125  Sum_probs=24.7

Q ss_pred             CCccceecCccCcccCChhhHHHHHHhhccCCCCCCCCCcccCCCCcccC
Q 000554          980 GSIRKFICRFCGLKFDLLPDLGRHHQAAHMGPNLVNSRPHKKGIRFYAYK 1029 (1428)
Q Consensus       980 tGeKpykC~~CGKsFs~~s~L~rHHqrvHtge~~~~eKpykC~~CgKsFs 1029 (1428)
                      ....-|.|+.|++.|+....+..               -|.|+.||....
T Consensus       113 ~~~~~Y~Cp~C~~rytf~eA~~~---------------~F~Cp~Cg~~L~  147 (178)
T PRK06266        113 ENNMFFFCPNCHIRFTFDEAMEY---------------GFRCPQCGEMLE  147 (178)
T ss_pred             cCCCEEECCCCCcEEeHHHHhhc---------------CCcCCCCCCCCe
Confidence            34456889888888887765532               588888886543


No 110
>PF09723 Zn-ribbon_8:  Zinc ribbon domain;  InterPro: IPR013429  This entry represents a region of about 41 amino acids found in a number of small proteins in a wide range of bacteria. The region usually begins with the initiator Met and contains two CxxC motifs separated by 17 amino acids. One protein in this entry has been noted as a putative regulatory protein, designated FmdB []. Most proteins in this entry have a C-terminal region containing highly degenerate sequence.
Probab=31.47  E-value=16  Score=30.80  Aligned_cols=12  Identities=33%  Similarity=1.099  Sum_probs=8.1

Q ss_pred             eecCccCcccCC
Q 000554          985 FICRFCGLKFDL  996 (1428)
Q Consensus       985 ykC~~CGKsFs~  996 (1428)
                      |+|..||..|..
T Consensus         6 y~C~~Cg~~fe~   17 (42)
T PF09723_consen    6 YRCEECGHEFEV   17 (42)
T ss_pred             EEeCCCCCEEEE
Confidence            667777776654


No 111
>PHA00626 hypothetical protein
Probab=31.38  E-value=19  Score=32.33  Aligned_cols=13  Identities=8%  Similarity=-0.455  Sum_probs=7.9

Q ss_pred             CcccCCCCcccCC
Q 000554         1018 PHKKGIRFYAYKL 1030 (1428)
Q Consensus      1018 pykC~~CgKsFs~ 1030 (1428)
                      .|+|+.||+.|+.
T Consensus        23 rYkCkdCGY~ft~   35 (59)
T PHA00626         23 DYVCCDCGYNDSK   35 (59)
T ss_pred             ceEcCCCCCeech
Confidence            5666666666653


No 112
>cd00122 MBD MeCP2, MBD1, MBD2, MBD3, MBD4, CLLD8-like, and BAZ2A-like proteins constitute a family of proteins that share the methyl-CpG-binding domain (MBD). The MBD consists of about 70 residues and is defined as the minimal region required for binding to methylated DNA by a methyl-CpG-binding protein which binds specifically to methylated DNA. The MBD can recognize a single symmetrically methylated CpG either as naked DNA or within chromatin.  MeCP2, MBD1 and MBD2 (and likely MBD3) form complexes with histone deacetylase and are involved in histone deacetylase-dependent repression of transcription. MBD4 is an endonuclease that forms a complex with the DNA mismatch-repair protein MLH1. The MBDs present in putative chromatin remodelling subunit, BAZ2A, and putative histone methyltransferase, CLLD8, represent two phylogenetically distinct groups within the MBD protein family.
Probab=31.17  E-value=15  Score=33.32  Aligned_cols=27  Identities=7%  Similarity=-0.031  Sum_probs=22.7

Q ss_pred             cccccccCCCCCCc-cccccceeeeccC
Q 000554         1194 GIRSSDSSDFVNNQ-WEVDECHCIIDSR 1220 (1428)
Q Consensus      1194 ~~k~v~~~~p~~~~-w~~~e~~~~l~~~ 1220 (1428)
                      ++..|.|..|+|.. +.+.|+..||..+
T Consensus        23 ~k~dv~Y~sP~Gk~~Rs~~ev~~yL~~~   50 (62)
T cd00122          23 GKGDVYYYSPCGKKLRSKPEVARYLEKT   50 (62)
T ss_pred             CcceEEEECCCCceecCHHHHHHHHHhC
Confidence            45578999999988 9999999988754


No 113
>PF13891 zf-C3Hc3H:  Potential DNA-binding domain
Probab=31.05  E-value=15  Score=33.77  Aligned_cols=23  Identities=39%  Similarity=0.686  Sum_probs=20.4

Q ss_pred             eeccCcccccccCCCcccccCCC
Q 000554          587 TVLGTRCKHRALYGSSFCKKHRP  609 (1428)
Q Consensus       587 ~~~g~~ckh~~~~~~~~c~~~~~  609 (1428)
                      +..|+.|+.+++||+.||-+|-.
T Consensus         3 ~~~~~~C~~~~lp~~~yC~~HIl   25 (65)
T PF13891_consen    3 TYSGRGCSQPALPGSKYCIRHIL   25 (65)
T ss_pred             CCCCCCcCcccCchhhHHHHHhc
Confidence            45789999999999999999874


No 114
>PF12013 DUF3505:  Protein of unknown function (DUF3505);  InterPro: IPR022698  This family of proteins is functionally uncharacterised. This protein is found in eukaryotes. Proteins in this family are typically between 247 to 1018 amino acids in length. This region contains two segments that are likely to be C2H2 zinc binding domains. 
Probab=30.77  E-value=52  Score=33.06  Aligned_cols=27  Identities=15%  Similarity=-0.070  Sum_probs=22.8

Q ss_pred             CCccc----CCCCcccCCchhhhcccccccC
Q 000554         1017 RPHKK----GIRFYAYKLKSGRLSRPRFKKG 1043 (1428)
Q Consensus      1017 KpykC----~~CgKsFs~ks~L~~H~r~H~g 1043 (1428)
                      .-|.|    ..|++.+.+...+++|.+.++|
T Consensus        79 ~G~~C~~~~~~C~y~~~~~~~m~~H~~~~Hg  109 (109)
T PF12013_consen   79 DGYRCQCDPPHCGYITRSKKTMRKHWRKEHG  109 (109)
T ss_pred             CCeeeecCCCCCCcEeccHHHHHHHHHHhcC
Confidence            45889    9999999999999999977654


No 115
>TIGR02605 CxxC_CxxC_SSSS putative regulatory protein, FmdB family. This model represents a region of about 50 amino acids found in a number of small proteins in a wide range of bacteria. The region begins usually with the initiator Met and contains two CxxC motifs separated by 17 amino acids. One member of this family is has been noted as a putative regulatory protein, designated FmdB (PubMed:8841393). Most members of this family have a C-terminal region containing highly degenerate sequence, such as SSTSESTKSSGSSGSSGSSESKASGSTEKSTSSTTAAAAV in Mycobacterium tuberculosis and VAVGGSAPAPSPAPRAGGGGGGCCGGGCCG in Streptomyces avermitilis. These low complexity regions, which are not included in the model, resemble low-complexity C-terminal regions of some heterocycle-containing bacteriocin precursors.
Probab=30.55  E-value=19  Score=31.28  Aligned_cols=12  Identities=33%  Similarity=1.168  Sum_probs=7.7

Q ss_pred             eecCccCcccCC
Q 000554          985 FICRFCGLKFDL  996 (1428)
Q Consensus       985 ykC~~CGKsFs~  996 (1428)
                      |+|..||..|..
T Consensus         6 y~C~~Cg~~fe~   17 (52)
T TIGR02605         6 YRCTACGHRFEV   17 (52)
T ss_pred             EEeCCCCCEeEE
Confidence            666666666653


No 116
>cd05839 BR140_related The PWWP domain is found in the BR140 family, which includes peregrin and BR140-like proteins 1 and 2.   BR140 is the only family to contain the PWWP domain at the C terminus, with PHD and bromo domains in the N-terminal region.  In myeloid leukemias, BR140 is disrupted by chromosomal translocations, similar to translocations of WHSC1 in lymphoid multiple myeloma.  The PWWP domain, named for a conserved Pro-Trp-Trp-Pro motif, is a small domain consisting of 100-150 amino acids. The PWWP domain is found in numerous proteins that are involved in cell division, growth and differentiation. Most PWWP-domain proteins seem to be nuclear, often DNA-binding proteins, that function as transcription factors regulating a variety of developmental processes.
Probab=30.07  E-value=78  Score=32.48  Aligned_cols=61  Identities=20%  Similarity=0.359  Sum_probs=40.7

Q ss_pred             EEEEEeccc-cccceeeeec----cCC-----Ccccc----ccccCCCccEEEEEeccCCcchhhhhhccccccC
Q 000554          157 ALWVKWRGK-WQAGIRCARA----DWP-----LPTLK----AKPTHDRKKYFVIFFPHTRNYSWADMLLVRSINE  217 (1428)
Q Consensus       157 ~~~~~~~~~-~~~~~~~~~~----~~~-----~~~~~----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~  217 (1428)
                      -||.|-+|- |.-|+-.-..    ..+     ++-|+    .+.-.+.+.|+|-||=.+++|.|++---+.+..+
T Consensus         6 lVwaK~~g~P~wPa~iidp~~~~~~~~~~~~p~~~l~~~~~~~~~~~~~~~lV~FFd~~~s~~Wv~~~~l~pl~~   80 (111)
T cd05839           6 LVWAKCRGYPSYPALIIDPKMPRDGVFHNGVPPDVLTLGEARAQNADERLYLVLFFDNKRTWQWLPGDKLEPLGV   80 (111)
T ss_pred             EeeeeecCCCCCCeEeeCCCCCCcccccCCCCchhhhHHHHHhccCCCcEEEEEEecCCCcceecCHHHCccccc
Confidence            379998883 6666554422    111     12222    2334688889999999999999999887776654


No 117
>PF09986 DUF2225:  Uncharacterized protein conserved in bacteria (DUF2225);  InterPro: IPR018708 This conserved bacterial family has no known function.
Probab=29.98  E-value=28  Score=39.41  Aligned_cols=42  Identities=17%  Similarity=0.059  Sum_probs=30.7

Q ss_pred             CCCcccCCCCcccCCchhhhcccccc----------cCCCccc-----cCCCCCcCc
Q 000554         1016 SRPHKKGIRFYAYKLKSGRLSRPRFK----------KGLGAVS-----YRIRNRGAA 1057 (1428)
Q Consensus      1016 eKpykC~~CgKsFs~ks~L~~H~r~H----------~gekpy~-----C~~C~ksf~ 1057 (1428)
                      .+.+.||+|++.|..+.-+....++-          .+..|+-     |+.||.++.
T Consensus         3 ~k~~~CPvC~~~F~~~~vrs~~~r~~~~d~D~~~~Y~~vnP~~Y~V~vCP~CgyA~~   59 (214)
T PF09986_consen    3 DKKITCPVCGKEFKTKKVRSGKIRVIRRDSDFCPRYKGVNPLFYEVWVCPHCGYAAF   59 (214)
T ss_pred             CCceECCCCCCeeeeeEEEcCCceEeeecCCCccccCCCCCeeeeEEECCCCCCccc
Confidence            57889999999999987777666431          2233332     999998875


No 118
>cd00729 rubredoxin_SM Rubredoxin, Small Modular nonheme iron binding domain containing a [Fe(SCys)4] center, present in rubrerythrin and nigerythrin and detected either N- or C-terminal to such proteins as flavin reductase, NAD(P)H-nitrite reductase, and ferredoxin-thioredoxin reductase. In rubredoxin, the iron atom is coordinated by four cysteine residues (Fe(S-Cys)4), and  believed to be involved in electron transfer. Rubrerythrins and nigerythrins are small homodimeric proteins, generally consisting of 2 domains: a rubredoxin domain C-terminal to a non-sulfur, oxo-bridged diiron site in the N-terminal rubrerythrin domain. Rubrerythrins and nigerythrins have putative peroxide activity.
Probab=29.61  E-value=35  Score=27.54  Aligned_cols=10  Identities=30%  Similarity=1.109  Sum_probs=6.1

Q ss_pred             eecCccCccc
Q 000554          985 FICRFCGLKF  994 (1428)
Q Consensus       985 ykC~~CGKsF  994 (1428)
                      |+|..||..+
T Consensus         3 ~~C~~CG~i~   12 (34)
T cd00729           3 WVCPVCGYIH   12 (34)
T ss_pred             EECCCCCCEe
Confidence            5666666543


No 119
>PF11722 zf-TRM13_CCCH:  CCCH zinc finger in TRM13 protein;  InterPro: IPR021721  This domain is found at the N terminus of TRM13 methyltransferase proteins. It is presumed to be a zinc binding domain. ; GO: 0008168 methyltransferase activity
Probab=29.38  E-value=31  Score=27.52  Aligned_cols=21  Identities=38%  Similarity=0.634  Sum_probs=18.4

Q ss_pred             ccCcccccccCCCcccccCCC
Q 000554          589 LGTRCKHRALYGSSFCKKHRP  609 (1428)
Q Consensus       589 ~g~~ckh~~~~~~~~c~~~~~  609 (1428)
                      -.|.|+-...+|+.||.-|.|
T Consensus        11 K~R~C~m~~~~g~~fC~~H~~   31 (31)
T PF11722_consen   11 KKRFCKMTRKPGSRFCGEHMP   31 (31)
T ss_pred             cccccCCeecCcCCccccCCC
Confidence            357899999999999999975


No 120
>COG4049 Uncharacterized protein containing archaeal-type C2H2 Zn-finger [General function prediction only]
Probab=27.83  E-value=15  Score=32.80  Aligned_cols=31  Identities=23%  Similarity=0.417  Sum_probs=18.5

Q ss_pred             ccCCCCcccCCCCCccccccccccccccccc
Q 000554          841 RSEDEKTHKCKICSQVFLHDQELGVHWMDNH  871 (1428)
Q Consensus       841 ~H~gekpykC~~CgK~F~s~s~L~~H~~r~H  871 (1428)
                      ...|+..++|+.|+..|.....+.+|.-+.|
T Consensus        11 ~RDGE~~lrCPRC~~~FR~~K~Y~RHVNKaH   41 (65)
T COG4049          11 DRDGEEFLRCPRCGMVFRRRKDYIRHVNKAH   41 (65)
T ss_pred             ccCCceeeeCCchhHHHHHhHHHHHHhhHHh
Confidence            3445556666666666666666666644444


No 121
>KOG2186 consensus Cell growth-regulating nucleolar protein [Cell cycle control, cell division, chromosome partitioning]
Probab=27.78  E-value=21  Score=40.93  Aligned_cols=48  Identities=17%  Similarity=0.463  Sum_probs=22.8

Q ss_pred             ccCCCCCcccccccccccccccccchhhhcccCcccccccccccCChhhhhhhhhhc
Q 000554          848 HKCKICSQVFLHDQELGVHWMDNHKKEAQWLFRGYACAICLDSFTNKKVLESHVQER  904 (1428)
Q Consensus       848 ykC~~CgK~F~s~s~L~~H~~r~Ht~e~~~l~KpykC~~CgKsF~~ks~L~~H~r~H  904 (1428)
                      |.|..||.....+ .+.+| +....+      .-|.|-.|++.|.. ..+..|.+--
T Consensus         4 FtCnvCgEsvKKp-~vekH-~srCrn------~~fSCIDC~k~F~~-~sYknH~kCI   51 (276)
T KOG2186|consen    4 FTCNVCGESVKKP-QVEKH-MSRCRN------AYFSCIDCGKTFER-VSYKNHTKCI   51 (276)
T ss_pred             Eehhhhhhhcccc-chHHH-HHhccC------CeeEEeeccccccc-chhhhhhhhc
Confidence            4555555554433 24445 222222      23555556655555 4455554433


No 122
>COG2888 Predicted Zn-ribbon RNA-binding protein with a function in translation [Translation, ribosomal structure and biogenesis]
Probab=26.89  E-value=46  Score=30.34  Aligned_cols=32  Identities=22%  Similarity=0.104  Sum_probs=19.6

Q ss_pred             ceecCccCcccCChhhHHHHHHhhccCCCCCCCCCcccCCCCc
Q 000554          984 KFICRFCGLKFDLLPDLGRHHQAAHMGPNLVNSRPHKKGIRFY 1026 (1428)
Q Consensus       984 pykC~~CGKsFs~~s~L~rHHqrvHtge~~~~eKpykC~~CgK 1026 (1428)
                      .|.|+.||..-..+..--+    .+.       .+|.|+.||.
T Consensus        27 ~F~CPnCGe~~I~Rc~~CR----k~g-------~~Y~Cp~CGF   58 (61)
T COG2888          27 KFPCPNCGEVEIYRCAKCR----KLG-------NPYRCPKCGF   58 (61)
T ss_pred             EeeCCCCCceeeehhhhHH----HcC-------CceECCCcCc
Confidence            5888888866554433222    233       4888888873


No 123
>PF13717 zinc_ribbon_4:  zinc-ribbon domain
Probab=26.08  E-value=39  Score=27.66  Aligned_cols=33  Identities=12%  Similarity=0.146  Sum_probs=19.2

Q ss_pred             eecCccCcccCChhhHHHHHHhhccCCCCCCCCCcccCCCCccc
Q 000554          985 FICRFCGLKFDLLPDLGRHHQAAHMGPNLVNSRPHKKGIRFYAY 1028 (1428)
Q Consensus       985 ykC~~CGKsFs~~s~L~rHHqrvHtge~~~~eKpykC~~CgKsF 1028 (1428)
                      ..|+.|+..|.-......      ..     .+..+|+.|+..|
T Consensus         3 i~Cp~C~~~y~i~d~~ip------~~-----g~~v~C~~C~~~f   35 (36)
T PF13717_consen    3 ITCPNCQAKYEIDDEKIP------PK-----GRKVRCSKCGHVF   35 (36)
T ss_pred             EECCCCCCEEeCCHHHCC------CC-----CcEEECCCCCCEe
Confidence            457777777766554332      11     3456777777665


No 124
>COG1996 RPC10 DNA-directed RNA polymerase, subunit RPC10 (contains C4-type Zn-finger) [Transcription]
Probab=25.91  E-value=34  Score=30.06  Aligned_cols=29  Identities=14%  Similarity=0.135  Sum_probs=19.8

Q ss_pred             cceecCccCcccCChhhHHHHHHhhccCCCCCCCCCcccCCCCcc
Q 000554          983 RKFICRFCGLKFDLLPDLGRHHQAAHMGPNLVNSRPHKKGIRFYA 1027 (1428)
Q Consensus       983 KpykC~~CGKsFs~~s~L~rHHqrvHtge~~~~eKpykC~~CgKs 1027 (1428)
                      ..|+|-.||+.|.   .+..             .....|+.||..
T Consensus         5 ~~Y~C~~Cg~~~~---~~~~-------------~~~irCp~Cg~r   33 (49)
T COG1996           5 MEYKCARCGREVE---LDQE-------------TRGIRCPYCGSR   33 (49)
T ss_pred             EEEEhhhcCCeee---hhhc-------------cCceeCCCCCcE
Confidence            4588999999882   1222             456789998854


No 125
>PF09723 Zn-ribbon_8:  Zinc ribbon domain;  InterPro: IPR013429  This entry represents a region of about 41 amino acids found in a number of small proteins in a wide range of bacteria. The region usually begins with the initiator Met and contains two CxxC motifs separated by 17 amino acids. One protein in this entry has been noted as a putative regulatory protein, designated FmdB []. Most proteins in this entry have a C-terminal region containing highly degenerate sequence.
Probab=25.25  E-value=38  Score=28.48  Aligned_cols=13  Identities=23%  Similarity=0.511  Sum_probs=8.7

Q ss_pred             ccCCCCCcccccc
Q 000554          848 HKCKICSQVFLHD  860 (1428)
Q Consensus       848 ykC~~CgK~F~s~  860 (1428)
                      |+|..||..|...
T Consensus         6 y~C~~Cg~~fe~~   18 (42)
T PF09723_consen    6 YRCEECGHEFEVL   18 (42)
T ss_pred             EEeCCCCCEEEEE
Confidence            6777777776544


No 126
>PF02892 zf-BED:  BED zinc finger;  InterPro: IPR003656 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target.  This entry represents predicted BED-type zinc finger domains. The BED finger which was named after the Drosophila proteins BEAF and DREF, is found in one or more copies in cellular regulatory factors and transposases from plants, animals and fungi. The BED finger is an about 50 to 60 amino acid residues domain that contains a characteristic motif with two highly conserved aromatic positions, as well as a shared pattern of cysteines and histidines that is predicted to form a zinc finger. As diverse BED fingers are able to bind DNA, it has been suggested that DNA-binding is the general function of this domain []. Some proteins known to contain a BED domain include animal, plant and fungi AC1 and Hobo-like transposases; Caenorhabditis elegans Dpy-20 protein, a predicted cuticular gene transcriptional regulator; Drosophila BEAF (boundary element-associated factor), thought to be involved in chromatin insulation; Drosophila DREF, a transcriptional regulator for S-phase genes; and tobacco 3AF1 and tomato E4/E8-BP1, light- and ethylene-regulated DNA binding proteins that contain two BED fingers. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; GO: 0003677 DNA binding; PDB: 2DJR_A 2CT5_A.
Probab=25.12  E-value=54  Score=27.44  Aligned_cols=28  Identities=29%  Similarity=0.642  Sum_probs=15.4

Q ss_pred             CccceecCccCcccCCh----hhHHHHHHhhc
Q 000554          981 SIRKFICRFCGLKFDLL----PDLGRHHQAAH 1008 (1428)
Q Consensus       981 GeKpykC~~CGKsFs~~----s~L~rHHqrvH 1008 (1428)
                      +....+|..|++.+...    +.|.+|..+.|
T Consensus        13 ~~~~a~C~~C~~~~~~~~~~ts~l~~HL~~~h   44 (45)
T PF02892_consen   13 DKKKAKCKYCGKVIKYSSGGTSNLKRHLKKKH   44 (45)
T ss_dssp             CSS-EEETTTTEE-----SSTHHHHHHHHHTT
T ss_pred             CcCeEEeCCCCeEEeeCCCcHHHHHHhhhhhC
Confidence            34557788888777664    67777544555


No 127
>TIGR02300 FYDLN_acid conserved hypothetical protein TIGR02300. Members of this family are bacterial proteins with a conserved motif [KR]FYDLN, sometimes flanked by a pair of CXXC motifs, followed by a long region of low complexity sequence in which roughly half the residues are Asp and Glu, including multiple runs of five or more acidic residues. The function of members of this family is unknown.
Probab=24.87  E-value=47  Score=34.67  Aligned_cols=34  Identities=24%  Similarity=0.188  Sum_probs=23.1

Q ss_pred             eecCccCcccCChhhHHHHHHhhccCCCCCCCCCcccCCCCcccCCchhhh
Q 000554          985 FICRFCGLKFDLLPDLGRHHQAAHMGPNLVNSRPHKKGIRFYAYKLKSGRL 1035 (1428)
Q Consensus       985 ykC~~CGKsFs~~s~L~rHHqrvHtge~~~~eKpykC~~CgKsFs~ks~L~ 1035 (1428)
                      ..|+.||++|-..   .              ..|..|+.||..|.....++
T Consensus        10 r~Cp~cg~kFYDL---n--------------k~p~vcP~cg~~~~~~~~~~   43 (129)
T TIGR02300        10 RICPNTGSKFYDL---N--------------RRPAVSPYTGEQFPPEEALK   43 (129)
T ss_pred             ccCCCcCcccccc---C--------------CCCccCCCcCCccCcchhhc
Confidence            5788888888642   2              35788888888876553333


No 128
>KOG2186 consensus Cell growth-regulating nucleolar protein [Cell cycle control, cell division, chromosome partitioning]
Probab=23.99  E-value=39  Score=38.88  Aligned_cols=47  Identities=26%  Similarity=0.551  Sum_probs=39.1

Q ss_pred             cccccccccccCChhhhhhhhhhcccccccccccccccccCCCCCCChhhhhhhhh
Q 000554          881 GYACAICLDSFTNKKVLESHVQERHHVQFVEQCMLQQCIPCGSHFGNTEELWLHVQ  936 (1428)
Q Consensus       881 pykC~~CgKsF~~ks~L~~H~r~Hhgek~~e~~kpfkC~~CgKsF~sks~L~~H~r  936 (1428)
                      -|.|..||.... +..+.+|+-.-++.       -|.|-.|++.|.. .....|..
T Consensus         3 ~FtCnvCgEsvK-Kp~vekH~srCrn~-------~fSCIDC~k~F~~-~sYknH~k   49 (276)
T KOG2186|consen    3 FFTCNVCGESVK-KPQVEKHMSRCRNA-------YFSCIDCGKTFER-VSYKNHTK   49 (276)
T ss_pred             EEehhhhhhhcc-ccchHHHHHhccCC-------eeEEeeccccccc-chhhhhhh
Confidence            488999999876 45577799888885       5899999999988 78888876


No 129
>cd05834 HDGF_related The PWWP domain is an essential part of the Hepatoma Derived Growth Factor (HDGF) family of proteins, and is necessary for DNA binding by HDGF. This family of endogenous nuclear-targeted mitogens includes HRP (HDGF-related proteins 1, 2, 3, 4, or HPR1, HPR2, HPR3, HPR4, respectively) and lens epithelium-derived growth factor, LEDGF. Members of the HDGF family have been linked to human diseases, and HDGF is a prognostic factor in several types of cancer. The PWWP domain, named for a conserved Pro-Trp-Trp-Pro motif, is a small domain consisting of 100-150 amino acids. The PWWP domain is found in numerous proteins that are involved in cell division, growth and differentiation. Most PWWP-domain proteins seem to be nuclear, often DNA-binding, proteins that function as transcription factors regulating a variety of developmental processes.
Probab=23.41  E-value=1e+02  Score=29.86  Aligned_cols=52  Identities=23%  Similarity=0.233  Sum_probs=35.6

Q ss_pred             EEEEEeccc-cccceeeeeccCCCccccccccCCCccEEEEEeccCCcchhhhhhccccccCC
Q 000554          157 ALWVKWRGK-WQAGIRCARADWPLPTLKAKPTHDRKKYFVIFFPHTRNYSWADMLLVRSINEF  218 (1428)
Q Consensus       157 ~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~  218 (1428)
                      -+|.|=+|- |=-|+=|...+.         +-..++|.|.||. |..|.||..-.+.++.++
T Consensus         8 lVwaK~kGyp~WPa~I~~~~~~---------~~~~~~~~V~FfG-t~~~a~v~~~~l~pf~~~   60 (83)
T cd05834           8 LVFAKVKGYPAWPARVDEPEDW---------KPPGKKYPVYFFG-THETAFLKPEDLFPYTEN   60 (83)
T ss_pred             EEEEecCCCCCCCEEEeccccc---------CCCCCEEEEEEeC-CCCEeEECHHHceecccc
Confidence            368887773 333444444332         2235789999999 789999998888888775


No 130
>PRK14890 putative Zn-ribbon RNA-binding protein; Provisional
Probab=23.12  E-value=54  Score=29.90  Aligned_cols=32  Identities=22%  Similarity=0.285  Sum_probs=18.0

Q ss_pred             cceecCccCcc-cCChhhHHHHHHhhccCCCCCCCCCcccCCCCc
Q 000554          983 RKFICRFCGLK-FDLLPDLGRHHQAAHMGPNLVNSRPHKKGIRFY 1026 (1428)
Q Consensus       983 KpykC~~CGKs-Fs~~s~L~rHHqrvHtge~~~~eKpykC~~CgK 1026 (1428)
                      -.|.|+.||+. -.+-..-++     +       ..+|.|+.||.
T Consensus        24 ~~F~CPnCG~~~I~RC~~CRk-----~-------~~~Y~CP~CGF   56 (59)
T PRK14890         24 VKFLCPNCGEVIIYRCEKCRK-----Q-------SNPYTCPKCGF   56 (59)
T ss_pred             CEeeCCCCCCeeEeechhHHh-----c-------CCceECCCCCC
Confidence            34777777776 333222222     2       34788888874


No 131
>PF09845 DUF2072:  Zn-ribbon containing protein (DUF2072);  InterPro: IPR018645  This archaeal Zinc-ribbon containing proteins have no known function. 
Probab=22.83  E-value=45  Score=35.03  Aligned_cols=15  Identities=27%  Similarity=0.474  Sum_probs=12.0

Q ss_pred             ceecCccCcccCChh
Q 000554          984 KFICRFCGLKFDLLP  998 (1428)
Q Consensus       984 pykC~~CGKsFs~~s  998 (1428)
                      |++|..||+.|...+
T Consensus         1 PH~Ct~Cg~~f~dgs   15 (131)
T PF09845_consen    1 PHQCTKCGRVFEDGS   15 (131)
T ss_pred             CcccCcCCCCcCCCc
Confidence            578888888888765


No 132
>TIGR00622 ssl1 transcription factor ssl1. This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University).
Probab=22.81  E-value=86  Score=32.22  Aligned_cols=50  Identities=20%  Similarity=0.276  Sum_probs=34.9

Q ss_pred             ccCCCCCcccccccccccccccccchhhhcccCcccccccccccCChhhhhhhhhhcc
Q 000554          848 HKCKICSQVFLHDQELGVHWMDNHKKEAQWLFRGYACAICLDSFTNKKVLESHVQERH  905 (1428)
Q Consensus       848 ykC~~CgK~F~s~s~L~~H~~r~Ht~e~~~l~KpykC~~CgKsF~~ks~L~~H~r~Hh  905 (1428)
                      ..|--|.+.|........- .  -...     ..|.|+.|...|-..-+...|...|.
T Consensus        56 ~~C~~C~~~f~~~~~~~~~-~--~~~~-----~~y~C~~C~~~FC~dCD~fiHe~Lh~  105 (112)
T TIGR00622        56 RFCFGCQGPFPKPPVSPFD-E--LKDS-----HRYVCAVCKNVFCVDCDVFVHESLHC  105 (112)
T ss_pred             CcccCcCCCCCCccccccc-c--cccc-----cceeCCCCCCccccccchhhhhhccC
Confidence            3599999999865422211 0  0112     56999999999999999999976554


No 133
>PF03604 DNA_RNApol_7kD:  DNA directed RNA polymerase, 7 kDa subunit;  InterPro: IPR006591 DNA-dependent RNA polymerase catalyzes the transcription of DNA into RNA using the four ribonucleoside triphosphates as substrates. Each class of RNA polymerase is assembled from 9 to 15 different polypeptides. Rbp10 (RNA polymerase CX) is a domain found in RNA polymerase subunit 10; present in RNA polymerase I, II and III.; GO: 0003677 DNA binding, 0003899 DNA-directed RNA polymerase activity, 0006351 transcription, DNA-dependent; PDB: 2PMZ_Z 3HKZ_X 2NVX_L 3S1Q_L 2JA6_L 3S17_L 3HOW_L 3HOV_L 3PO2_L 3HOZ_L ....
Probab=22.74  E-value=44  Score=26.86  Aligned_cols=11  Identities=36%  Similarity=1.108  Sum_probs=6.9

Q ss_pred             eecCccCcccC
Q 000554          985 FICRFCGLKFD  995 (1428)
Q Consensus       985 ykC~~CGKsFs  995 (1428)
                      |.|..||..+.
T Consensus         1 Y~C~~Cg~~~~   11 (32)
T PF03604_consen    1 YICGECGAEVE   11 (32)
T ss_dssp             EBESSSSSSE-
T ss_pred             CCCCcCCCeeE
Confidence            56777777665


No 134
>PF08879 WRC:  WRC;  InterPro: IPR014977 WRC is named after the conserved Trp-Arg-Cys motif, it contains two distinctive features: a putative nuclear localisation signal and a zinc-finger motif (C3H). It is suggested that WRC functions in DNA binding []. ; GO: 0005515 protein binding
Probab=22.57  E-value=30  Score=30.02  Aligned_cols=20  Identities=50%  Similarity=0.865  Sum_probs=18.1

Q ss_pred             ccCcccccccCCCcccccCC
Q 000554          589 LGTRCKHRALYGSSFCKKHR  608 (1428)
Q Consensus       589 ~g~~ckh~~~~~~~~c~~~~  608 (1428)
                      -|=||+..+++|.++|.+|.
T Consensus        13 K~WrC~~~a~~g~~~Ce~H~   32 (46)
T PF08879_consen   13 KGWRCSRRALPGYSLCEHHL   32 (46)
T ss_pred             CccccCCccCCCccHHHHHH
Confidence            45699999999999999997


No 135
>PF12013 DUF3505:  Protein of unknown function (DUF3505);  InterPro: IPR022698  This family of proteins is functionally uncharacterised. This protein is found in eukaryotes. Proteins in this family are typically between 247 to 1018 amino acids in length. This region contains two segments that are likely to be C2H2 zinc binding domains. 
Probab=22.22  E-value=1e+02  Score=31.01  Aligned_cols=24  Identities=21%  Similarity=0.538  Sum_probs=20.2

Q ss_pred             eec----CccCcccCChhhHHHHHHhhc
Q 000554          985 FIC----RFCGLKFDLLPDLGRHHQAAH 1008 (1428)
Q Consensus       985 ykC----~~CGKsFs~~s~L~rHHqrvH 1008 (1428)
                      |.|    ..|+..+.+...+.+|....|
T Consensus        81 ~~C~~~~~~C~y~~~~~~~m~~H~~~~H  108 (109)
T PF12013_consen   81 YRCQCDPPHCGYITRSKKTMRKHWRKEH  108 (109)
T ss_pred             eeeecCCCCCCcEeccHHHHHHHHHHhc
Confidence            899    999999999999999544444


No 136
>PF13719 zinc_ribbon_5:  zinc-ribbon domain
Probab=21.49  E-value=61  Score=26.61  Aligned_cols=32  Identities=19%  Similarity=0.229  Sum_probs=16.6

Q ss_pred             ecCccCcccCChhhHHHHHHhhccCCCCCCCCCcccCCCCccc
Q 000554          986 ICRFCGLKFDLLPDLGRHHQAAHMGPNLVNSRPHKKGIRFYAY 1028 (1428)
Q Consensus       986 kC~~CGKsFs~~s~L~rHHqrvHtge~~~~eKpykC~~CgKsF 1028 (1428)
                      .|+.|+..|.-..+-..      .+     .+..+|+.|+..|
T Consensus         4 ~CP~C~~~f~v~~~~l~------~~-----~~~vrC~~C~~~f   35 (37)
T PF13719_consen    4 TCPNCQTRFRVPDDKLP------AG-----GRKVRCPKCGHVF   35 (37)
T ss_pred             ECCCCCceEEcCHHHcc------cC-----CcEEECCCCCcEe
Confidence            56666666665443211      11     3456666666655


No 137
>PRK00464 nrdR transcriptional regulator NrdR; Validated
Probab=21.33  E-value=46  Score=35.93  Aligned_cols=16  Identities=25%  Similarity=0.476  Sum_probs=9.4

Q ss_pred             ccccccccccCChhhh
Q 000554          882 YACAICLDSFTNKKVL  897 (1428)
Q Consensus       882 ykC~~CgKsF~~ks~L  897 (1428)
                      ++|+.||++|.+...+
T Consensus        29 ~~c~~c~~~f~~~e~~   44 (154)
T PRK00464         29 RECLACGKRFTTFERV   44 (154)
T ss_pred             eeccccCCcceEeEec
Confidence            5666666666654443


No 138
>KOG2593 consensus Transcription initiation factor IIE, alpha subunit [Transcription]
Probab=20.96  E-value=63  Score=39.93  Aligned_cols=42  Identities=12%  Similarity=0.105  Sum_probs=29.1

Q ss_pred             hhcCCccceecCccCcccCChhhHHHHHHhhccCCCCCCCCCcccCCCCcc
Q 000554          977 ENLGSIRKFICRFCGLKFDLLPDLGRHHQAAHMGPNLVNSRPHKKGIRFYA 1027 (1428)
Q Consensus       977 rtHtGeKpykC~~CGKsFs~~s~L~rHHqrvHtge~~~~eKpykC~~CgKs 1027 (1428)
                      +.-+...-|.|+.|.++|+....|+-    +-..     ...|.|..|+--
T Consensus       121 ~d~t~~~~Y~Cp~C~kkyt~Lea~~L----~~~~-----~~~F~C~~C~ge  162 (436)
T KOG2593|consen  121 RDDTNVAGYVCPNCQKKYTSLEALQL----LDNE-----TGEFHCENCGGE  162 (436)
T ss_pred             hhccccccccCCccccchhhhHHHHh----hccc-----CceEEEecCCCc
Confidence            33445567999999999988777655    2221     347999999743


No 139
>PF14353 CpXC:  CpXC protein
Probab=20.12  E-value=38  Score=34.92  Aligned_cols=15  Identities=20%  Similarity=0.419  Sum_probs=9.3

Q ss_pred             ccCCCCCcccccccc
Q 000554          848 HKCKICSQVFLHDQE  862 (1428)
Q Consensus       848 ykC~~CgK~F~s~s~  862 (1428)
                      ..|+.|+..|.....
T Consensus         2 itCP~C~~~~~~~v~   16 (128)
T PF14353_consen    2 ITCPHCGHEFEFEVW   16 (128)
T ss_pred             cCCCCCCCeeEEEEE
Confidence            357777777765443


No 140
>PRK00398 rpoP DNA-directed RNA polymerase subunit P; Provisional
Probab=20.01  E-value=46  Score=28.35  Aligned_cols=13  Identities=31%  Similarity=0.917  Sum_probs=7.9

Q ss_pred             ceecCccCcccCC
Q 000554          984 KFICRFCGLKFDL  996 (1428)
Q Consensus       984 pykC~~CGKsFs~  996 (1428)
                      .|+|+.||..|..
T Consensus         3 ~y~C~~CG~~~~~   15 (46)
T PRK00398          3 EYKCARCGREVEL   15 (46)
T ss_pred             EEECCCCCCEEEE
Confidence            4666666666543


Done!