Query         psy6598
Match_columns 154
No_of_seqs    216 out of 1616
Neff          10.3
Searched_HMMs 46136
Date          Fri Aug 16 22:11:12 2013
Command       hhsearch -i /work/01045/syshi/Psyhhblits/psy6598.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/6598hhsearch_cdd -cpu 12 -v 0 

 No Hit                             Prob E-value P-value  Score    SS Cols Query HMM  Template HMM
  1 KOG2462|consensus               99.9 2.7E-25 5.8E-30  152.9   5.8  105    2-112   160-264 (279)
  2 KOG2462|consensus               99.9 3.8E-24 8.1E-29  147.3   2.8   79    3-85    187-265 (279)
  3 KOG3576|consensus               99.8 1.2E-19 2.5E-24  120.2   0.9   84    3-90    117-200 (267)
  4 KOG3623|consensus               99.7 3.3E-19 7.2E-24  136.4   1.8   79    2-84    893-971 (1007)
  5 KOG1074|consensus               99.6 3.1E-16 6.7E-21  121.8   1.1   52    3-58    353-404 (958)
  6 KOG3623|consensus               99.5 5.2E-15 1.1E-19  113.8   2.6   78    4-85    241-331 (1007)
  7 KOG3576|consensus               99.5 1.5E-14 3.3E-19   96.1   2.5   85    2-90    144-239 (267)
  8 PHA00733 hypothetical protein   99.4 2.2E-13 4.7E-18   86.4   3.7   83    1-89     38-125 (128)
  9 KOG1074|consensus               99.3 2.1E-13 4.6E-18  106.3   1.4   85    3-91    605-696 (958)
 10 KOG3608|consensus               99.3 2.2E-12 4.8E-17   92.3   3.8   79    4-87    264-345 (467)
 11 KOG3608|consensus               99.2 1.2E-11 2.5E-16   88.7   5.1   88    1-91    290-380 (467)
 12 PHA02768 hypothetical protein;  99.2   9E-12 1.9E-16   66.3   2.0   42   34-79      6-47  (55)
 13 PHA02768 hypothetical protein;  99.1 5.2E-11 1.1E-15   63.4   2.0   44    3-52      5-48  (55)
 14 PLN03086 PRLI-interacting fact  99.1   4E-10 8.7E-15   86.7   6.5   92    1-102   451-552 (567)
 15 PF13465 zf-H2C2_2:  Zinc-finge  99.0 2.1E-10 4.6E-15   52.4   2.5   24   51-74      2-25  (26)
 16 KOG3993|consensus               98.9 7.8E-10 1.7E-14   81.0   2.7   86    4-93    268-386 (500)
 17 PF13465 zf-H2C2_2:  Zinc-finge  98.9 2.2E-09 4.8E-14   49.0   3.0   26   20-47      1-26  (26)
 18 PHA00733 hypothetical protein   98.8   6E-09 1.3E-13   66.2   4.5   54    1-60     71-124 (128)
 19 PHA00616 hypothetical protein   98.8 4.5E-09 9.7E-14   53.4   1.9   34    3-38      1-34  (44)
 20 PLN03086 PRLI-interacting fact  98.7 2.6E-08 5.7E-13   76.9   6.0   79    2-88    477-565 (567)
 21 PHA00732 hypothetical protein   98.5 6.4E-08 1.4E-12   56.2   2.5   48    3-60      1-49  (79)
 22 KOG3993|consensus               98.5 8.2E-08 1.8E-12   70.7   2.8   52    3-58    295-379 (500)
 23 PHA00732 hypothetical protein   98.5 1.6E-07 3.5E-12   54.5   3.0   46   33-86      1-47  (79)
 24 PHA00616 hypothetical protein   98.4 7.1E-08 1.5E-12   49.0   1.0   34   33-68      1-34  (44)
 25 PF00096 zf-C2H2:  Zinc finger,  98.3 3.7E-07 8.1E-12   40.3   2.0   22   64-85      1-22  (23)
 26 COG5189 SFP1 Putative transcri  98.3 2.6E-07 5.7E-12   65.8   1.3   54   31-84    347-419 (423)
 27 PF00096 zf-C2H2:  Zinc finger,  98.3 1.1E-06 2.3E-11   38.7   2.5   23    4-28      1-23  (23)
 28 PF05605 zf-Di19:  Drought indu  98.2 2.2E-06 4.7E-11   46.2   3.9   49   34-87      3-53  (54)
 29 PF13894 zf-C2H2_4:  C2H2-type   98.2 1.2E-06 2.7E-11   38.8   2.3   24   64-87      1-24  (24)
 30 PF13912 zf-C2H2_6:  C2H2-type   98.1 1.8E-06 3.8E-11   39.6   1.7   25   63-87      1-25  (27)
 31 PF12756 zf-C2H2_2:  C2H2 type   98.0 3.3E-06 7.2E-11   51.1   2.5   74    5-87      1-74  (100)
 32 PF05605 zf-Di19:  Drought indu  98.0 1.2E-05 2.7E-10   43.2   3.9   50    3-59      2-53  (54)
 33 PF13912 zf-C2H2_6:  C2H2-type   97.9   1E-05 2.2E-10   37.0   2.1   25    3-29      1-25  (27)
 34 COG5189 SFP1 Putative transcri  97.9 4.7E-06   1E-10   59.6   0.9   54    1-56    347-419 (423)
 35 PF13894 zf-C2H2_4:  C2H2-type   97.9 1.8E-05 3.9E-10   34.9   2.6   23    4-28      1-23  (24)
 36 PF09237 GAGA:  GAGA factor;  I  97.6 4.8E-05   1E-09   39.6   2.2   40   52-91     12-52  (54)
 37 smart00355 ZnF_C2H2 zinc finge  97.4 0.00011 2.5E-09   32.7   2.0   23   64-86      1-23  (26)
 38 PF09237 GAGA:  GAGA factor;  I  97.4 0.00026 5.7E-09   36.9   3.0   33   29-63     20-52  (54)
 39 PF13909 zf-H2C2_5:  C2H2-type   97.4 0.00018 3.8E-09   31.8   2.0   23   64-87      1-23  (24)
 40 smart00355 ZnF_C2H2 zinc finge  97.3 0.00024 5.2E-09   31.5   2.4   24    4-29      1-24  (26)
 41 PRK04860 hypothetical protein;  97.2 0.00026 5.6E-09   46.7   2.5   39   33-77    119-157 (160)
 42 PF12874 zf-met:  Zinc-finger o  97.2 0.00029 6.3E-09   31.4   1.6   22   64-85      1-22  (25)
 43 PF12874 zf-met:  Zinc-finger o  97.1  0.0005 1.1E-08   30.6   2.0   22    4-27      1-22  (25)
 44 PF13909 zf-H2C2_5:  C2H2-type   96.9  0.0013 2.8E-08   29.0   2.3   23    4-29      1-23  (24)
 45 PF12171 zf-C2H2_jaz:  Zinc-fin  96.5 0.00069 1.5E-08   30.8   0.1   22   64-85      2-23  (27)
 46 PF13913 zf-C2HC_2:  zinc-finge  96.3  0.0035 7.5E-08   28.0   1.7   21   64-85      3-23  (25)
 47 PF12171 zf-C2H2_jaz:  Zinc-fin  96.3  0.0011 2.3E-08   30.2  -0.0   21    4-26      2-22  (27)
 48 COG5048 FOG: Zn-finger [Genera  96.0  0.0018 3.9E-08   48.8  -0.1   57   10-66    294-354 (467)
 49 COG5236 Uncharacterized conser  95.5   0.055 1.2E-06   39.9   5.9   83    5-94    222-312 (493)
 50 smart00451 ZnF_U1 U1-like zinc  95.4   0.012 2.5E-07   28.3   1.7   23   63-85      3-25  (35)
 51 COG5048 FOG: Zn-finger [Genera  95.2  0.0066 1.4E-07   45.7   0.5   62   32-95    288-355 (467)
 52 smart00451 ZnF_U1 U1-like zinc  95.0   0.021 4.7E-07   27.3   1.9   22    3-26      3-24  (35)
 53 KOG1146|consensus               94.7   0.013 2.9E-07   49.8   1.0   75   10-86    441-541 (1406)
 54 PF13913 zf-C2HC_2:  zinc-finge  94.5    0.04 8.6E-07   24.5   2.0   20    4-26      3-22  (25)
 55 KOG2482|consensus               94.2   0.071 1.5E-06   39.2   3.6   23    3-27    195-217 (423)
 56 COG5236 Uncharacterized conser  94.1   0.038 8.2E-07   40.7   2.1   27    3-29    151-177 (493)
 57 KOG1146|consensus               93.8   0.016 3.5E-07   49.3  -0.2   81    4-93   1261-1358(1406)
 58 KOG2893|consensus               93.7   0.021 4.6E-07   39.8   0.3   52   31-89      9-60  (341)
 59 PF12013 DUF3505:  Protein of u  93.6    0.21 4.5E-06   30.8   4.6   25   64-88     81-109 (109)
 60 KOG2231|consensus               93.6   0.084 1.8E-06   42.4   3.4   13   17-29    126-139 (669)
 61 COG4049 Uncharacterized protei  93.4    0.04 8.6E-07   29.3   0.9   33   58-90     12-44  (65)
 62 PF12756 zf-C2H2_2:  C2H2 type   93.4   0.098 2.1E-06   31.2   2.8   20   35-56     52-71  (100)
 63 KOG4173|consensus               92.9   0.065 1.4E-06   36.6   1.6   81    4-89     80-172 (253)
 64 cd00350 rubredoxin_like Rubred  92.4     0.1 2.2E-06   24.8   1.6   10   62-71     16-25  (33)
 65 KOG2231|consensus               92.0    0.21 4.6E-06   40.2   3.7   71    5-85    184-261 (669)
 66 KOG2893|consensus               92.0   0.052 1.1E-06   37.9   0.3   45    8-58     13-58  (341)
 67 PRK04860 hypothetical protein;  91.0     0.2 4.4E-06   33.2   2.3   36   63-102   119-154 (160)
 68 PF02892 zf-BED:  BED zinc fing  90.7     0.3 6.4E-06   24.8   2.4   26   60-85     13-42  (45)
 69 KOG4173|consensus               90.1   0.078 1.7E-06   36.2  -0.2   53    5-59    108-171 (253)
 70 PF12013 DUF3505:  Protein of u  90.0    0.93   2E-05   27.9   4.5   25   34-60     81-109 (109)
 71 KOG2482|consensus               89.0    0.41 8.9E-06   35.4   2.7   81    4-88    145-304 (423)
 72 smart00614 ZnF_BED BED zinc fi  88.6    0.43 9.2E-06   24.9   2.0   21   64-84     19-44  (50)
 73 PF05443 ROS_MUCR:  ROS/MUCR tr  86.6    0.46 9.9E-06   30.4   1.6   26   62-90     71-96  (132)
 74 KOG2186|consensus               85.9    0.56 1.2E-05   33.2   1.8   47    4-57      4-50  (276)
 75 KOG2785|consensus               85.7     1.2 2.6E-05   33.4   3.5   74    3-85    166-242 (390)
 76 cd00729 rubredoxin_SM Rubredox  85.1    0.67 1.4E-05   22.2   1.4    9   63-71     18-26  (34)
 77 KOG2785|consensus               84.8     1.1 2.4E-05   33.7   3.0   22    3-26     68-89  (390)
 78 TIGR00373 conserved hypothetic  84.2     1.6 3.4E-05   28.9   3.3   39   28-77    104-142 (158)
 79 PF09986 DUF2225:  Uncharacteri  83.4    0.43 9.3E-06   33.2   0.4   13   64-76     49-61  (214)
 80 PF13717 zinc_ribbon_4:  zinc-r  82.5     1.8   4E-05   20.9   2.3   32   35-73      4-35  (36)
 81 COG4049 Uncharacterized protei  82.3    0.77 1.7E-05   24.5   1.0   27   28-56     12-38  (65)
 82 COG1592 Rubrerythrin [Energy p  82.3       1 2.2E-05   30.0   1.8   25   32-71    133-157 (166)
 83 PF04959 ARS2:  Arsenite-resist  81.7    0.63 1.4E-05   32.4   0.7   29   62-90     76-104 (214)
 84 smart00659 RPOLCX RNA polymera  81.6     1.6 3.4E-05   22.3   2.0   26   34-72      3-28  (44)
 85 PF13719 zinc_ribbon_5:  zinc-r  81.6     1.7 3.8E-05   21.1   2.1   28   41-73      8-35  (37)
 86 smart00531 TFIIE Transcription  81.5     2.3 4.9E-05   27.7   3.3   41   30-76     96-136 (147)
 87 smart00734 ZnF_Rad18 Rad18-lik  81.2     1.3 2.9E-05   19.7   1.5   21   64-85      2-22  (26)
 88 KOG2186|consensus               81.0     1.3 2.7E-05   31.5   2.0   58   33-95      3-60  (276)
 89 TIGR02098 MJ0042_CXXC MJ0042 f  80.6     1.9 4.2E-05   20.9   2.1   10   64-73     26-35  (38)
 90 PRK06266 transcription initiat  80.5     2.1 4.5E-05   28.9   2.9   37   30-77    114-150 (178)
 91 KOG4167|consensus               80.4    0.37   8E-06   39.1  -0.8   26   63-88    792-817 (907)
 92 TIGR00622 ssl1 transcription f  80.3     3.2   7E-05   25.7   3.4   79    3-87     15-105 (112)
 93 KOG4124|consensus               79.8    0.29 6.3E-06   36.2  -1.5   53   31-83    347-418 (442)
 94 PRK00398 rpoP DNA-directed RNA  76.9       1 2.2E-05   23.0   0.5   11   63-73     21-31  (46)
 95 PF05290 Baculo_IE-1:  Baculovi  76.0     1.5 3.2E-05   28.0   1.1   25   49-78    112-136 (140)
 96 PF09986 DUF2225:  Uncharacteri  75.8    0.57 1.2E-05   32.6  -0.9   44    1-48      3-61  (214)
 97 KOG4167|consensus               75.6    0.79 1.7E-05   37.3  -0.3   24    4-29    793-816 (907)
 98 PF02176 zf-TRAF:  TRAF-type zi  75.0     2.7 5.8E-05   22.5   1.8   44   31-75      7-54  (60)
 99 PF15269 zf-C2H2_7:  Zinc-finge  74.9     2.7 5.8E-05   21.4   1.6   22    4-27     21-42  (54)
100 TIGR02605 CxxC_CxxC_SSSS putat  72.7     1.4   3E-05   23.0   0.3    7   64-70     27-33  (52)
101 PF10571 UPF0547:  Uncharacteri  71.6     2.1 4.6E-05   19.1   0.7   10   65-74     16-25  (26)
102 PF09538 FYDLN_acid:  Protein o  71.1     2.2 4.8E-05   26.3   1.0   30   34-76     10-39  (108)
103 COG1996 RPC10 DNA-directed RNA  70.4     3.6 7.8E-05   21.5   1.5   11   33-45      6-16  (49)
104 COG1997 RPL43A Ribosomal prote  70.2     2.1 4.5E-05   25.2   0.7   33   32-76     34-66  (89)
105 PF05443 ROS_MUCR:  ROS/MUCR tr  70.1     3.1 6.8E-05   26.6   1.5   21    5-30     74-94  (132)
106 KOG4377|consensus               69.4       5 0.00011   30.7   2.6   26   64-89    402-429 (480)
107 COG2888 Predicted Zn-ribbon RN  68.7     6.6 0.00014   21.3   2.3   33   32-71     26-58  (61)
108 PRK09678 DNA-binding transcrip  64.1     1.8 3.8E-05   24.6  -0.4   18   32-49     26-43  (72)
109 PF08790 zf-LYAR:  LYAR-type C2  61.7     1.6 3.4E-05   19.9  -0.7   19    4-25      1-19  (28)
110 KOG1842|consensus               60.5     5.1 0.00011   30.9   1.3   30   62-91     14-43  (505)
111 PF11789 zf-Nse:  Zinc-finger o  60.1     7.3 0.00016   21.0   1.6   31   32-68     23-53  (57)
112 KOG2593|consensus               59.5      12 0.00026   28.9   3.1   38   29-71    124-161 (436)
113 PF01428 zf-AN1:  AN1-like Zinc  59.1     4.4 9.6E-05   20.3   0.6   15   62-76     12-26  (43)
114 PRK00464 nrdR transcriptional   57.5     1.2 2.7E-05   29.3  -2.0   16   34-51     29-44  (154)
115 TIGR00373 conserved hypothetic  57.2     6.7 0.00015   25.9   1.3   31    4-47    110-140 (158)
116 PRK14890 putative Zn-ribbon RN  57.2      13 0.00028   20.2   2.2   33   32-71     24-56  (59)
117 TIGR02300 FYDLN_acid conserved  56.8     6.3 0.00014   25.0   1.1   33   34-79     10-42  (129)
118 PRK06266 transcription initiat  54.7     7.2 0.00016   26.4   1.2   33    3-48    117-149 (178)
119 PF03604 DNA_RNApol_7kD:  DNA d  52.9      12 0.00026   17.6   1.5   10   63-72     17-26  (32)
120 COG4957 Predicted transcriptio  52.1      12 0.00026   24.1   1.7   20    6-30     79-98  (148)
121 PF04959 ARS2:  Arsenite-resist  51.9     5.5 0.00012   27.8   0.3   27   30-58     74-100 (214)
122 smart00154 ZnF_AN1 AN1-like Zi  51.7       8 0.00017   19.0   0.8   14    3-18     12-25  (39)
123 KOG3408|consensus               51.6     9.3  0.0002   24.0   1.2   25   61-85     55-79  (129)
124 KOG4124|consensus               51.5       2 4.4E-05   32.0  -1.9   53    1-55    347-418 (442)
125 COG1198 PriA Primosomal protei  51.0      10 0.00022   31.6   1.6   13   60-72    472-484 (730)
126 KOG3408|consensus               49.3      11 0.00024   23.7   1.3   16   41-56     63-78  (129)
127 KOG2071|consensus               49.2      12 0.00026   30.0   1.8   27   61-87    416-442 (579)
128 PF10276 zf-CHCC:  Zinc-finger   49.2     9.2  0.0002   19.0   0.8   12   62-73     28-39  (40)
129 PF13878 zf-C2H2_3:  zinc-finge  49.1      23 0.00049   17.6   2.2   24    4-29     14-39  (41)
130 PLN02294 cytochrome c oxidase   49.0      10 0.00022   25.4   1.1   21   55-76    134-154 (174)
131 PLN03238 probable histone acet  48.9      20 0.00043   26.2   2.7   29   61-89     46-74  (290)
132 PF09723 Zn-ribbon_8:  Zinc rib  48.6     8.8 0.00019   19.1   0.7   11    4-16      6-16  (42)
133 PHA00626 hypothetical protein   48.1     9.8 0.00021   20.4   0.8   13    3-17     23-35  (59)
134 cd00924 Cyt_c_Oxidase_Vb Cytoc  48.0      11 0.00024   22.8   1.1   20   55-75     72-91  (97)
135 COG5151 SSL1 RNA polymerase II  47.6      14 0.00031   27.2   1.8   48   36-87    365-412 (421)
136 PF06220 zf-U1:  U1 zinc finger  46.9      17 0.00036   17.8   1.5   12    2-15      2-13  (38)
137 smart00834 CxxC_CXXC_SSSS Puta  46.9     9.6 0.00021   18.5   0.7   11    4-16      6-16  (41)
138 PF15135 UPF0515:  Uncharacteri  46.0      13 0.00029   26.5   1.4   15   62-76    154-168 (278)
139 PF04780 DUF629:  Protein of un  44.9      18 0.00038   28.5   2.0   32   59-90     53-84  (466)
140 PF09845 DUF2072:  Zn-ribbon co  44.8      11 0.00023   24.2   0.7   15   33-49      1-15  (131)
141 COG3677 Transposase and inacti  44.2      11 0.00025   24.0   0.8   46   22-76     21-66  (129)
142 PF07754 DUF1610:  Domain of un  43.5      11 0.00024   16.5   0.5    8    3-12     16-23  (24)
143 KOG0717|consensus               42.8      14 0.00031   28.9   1.2   21   64-84    293-313 (508)
144 COG4888 Uncharacterized Zn rib  41.7      10 0.00022   23.0   0.3   39   31-75     20-58  (104)
145 PF08274 PhnA_Zn_Ribbon:  PhnA   41.5      16 0.00034   16.9   0.8    9   63-71     19-27  (30)
146 PRK14873 primosome assembly pr  41.0      18  0.0004   29.8   1.7   10   63-72    422-431 (665)
147 KOG0782|consensus               40.9     3.9 8.4E-05   32.8  -2.0   53   18-77    238-290 (1004)
148 KOG0717|consensus               40.3      20 0.00043   28.1   1.7   21    4-26    293-313 (508)
149 PF05191 ADK_lid:  Adenylate ki  40.2     6.6 0.00014   19.0  -0.6    9    5-15      3-11  (36)
150 PLN00104 MYST -like histone ac  39.4      24 0.00051   27.6   2.0   30   61-90    196-225 (450)
151 KOG2593|consensus               39.3      26 0.00055   27.2   2.1   36    3-45    128-163 (436)
152 PRK03824 hypA hydrogenase nick  39.2      15 0.00032   23.7   0.8   15   32-48     69-83  (135)
153 PF04423 Rad50_zn_hook:  Rad50   39.1      14  0.0003   19.5   0.5   12   65-76     22-33  (54)
154 PTZ00064 histone acetyltransfe  39.0      28  0.0006   27.7   2.3   30   61-90    278-307 (552)
155 COG0068 HypF Hydrogenase matur  38.6     4.2 9.1E-05   33.4  -2.1   57    6-72    126-182 (750)
156 PF13451 zf-trcl:  Probable zin  38.2      15 0.00032   19.2   0.5   12    2-15      3-14  (49)
157 TIGR00100 hypA hydrogenase nic  37.7      20 0.00043   22.3   1.2   24   34-71     71-94  (115)
158 PF05495 zf-CHY:  CHY zinc fing  37.7     9.7 0.00021   21.5  -0.2   31   32-72     40-70  (71)
159 PF01780 Ribosomal_L37ae:  Ribo  37.3      18 0.00039   21.5   0.9   31   32-74     34-64  (90)
160 KOG2636|consensus               36.3      25 0.00055   27.4   1.7   28   56-83    394-422 (497)
161 PF12760 Zn_Tnp_IS1595:  Transp  36.2      49  0.0011   16.7   2.3   11   61-71     35-45  (46)
162 KOG2747|consensus               36.1      22 0.00047   27.3   1.4   33   61-93    156-188 (396)
163 COG3364 Zn-ribbon containing p  35.9      22 0.00049   21.6   1.1   13    3-17      2-14  (112)
164 KOG2807|consensus               35.9      77  0.0017   23.8   4.0   25   63-87    345-369 (378)
165 COG1773 Rubredoxin [Energy pro  35.6      19 0.00042   19.3   0.7   13    2-16      2-14  (55)
166 PTZ00255 60S ribosomal protein  35.4      19 0.00041   21.5   0.7   32   32-75     35-66  (90)
167 COG1655 Uncharacterized protei  35.2     9.7 0.00021   26.9  -0.5   39   31-71     17-70  (267)
168 COG4530 Uncharacterized protei  35.0      16 0.00036   22.5   0.5   25   39-74     13-37  (129)
169 KOG2071|consensus               34.4      26 0.00057   28.2   1.6   26    2-29    417-442 (579)
170 PF06524 NOA36:  NOA36 protein;  33.9      29 0.00064   25.0   1.6   24   62-85    208-231 (314)
171 COG4391 Uncharacterized protei  33.8      19 0.00041   19.7   0.5   43   24-75     15-60  (62)
172 PF09963 DUF2197:  Uncharacteri  32.9      37 0.00081   18.3   1.6    6   10-15      7-12  (56)
173 PLN03238 probable histone acet  32.5      49  0.0011   24.3   2.6   25   31-57     46-70  (290)
174 PF01215 COX5B:  Cytochrome c o  32.3      20 0.00043   23.2   0.5   19   57-76    107-125 (136)
175 TIGR00280 L37a ribosomal prote  32.3      20 0.00044   21.4   0.5   33   32-76     34-66  (91)
176 PRK12380 hydrogenase nickel in  32.2      29 0.00063   21.5   1.3   24   34-71     71-94  (113)
177 smart00440 ZnF_C2C2 C2C2 Zinc   32.1     8.8 0.00019   19.0  -0.9   10   64-73     29-38  (40)
178 PF10537 WAC_Acf1_DNA_bd:  ATP-  32.0 1.1E+02  0.0023   18.7   3.7   42    1-47      1-42  (102)
179 PF04780 DUF629:  Protein of un  31.7      42 0.00091   26.5   2.3   25    4-30     58-83  (466)
180 PTZ00448 hypothetical protein;  31.7      30 0.00064   26.3   1.4   24   63-86    314-337 (373)
181 PF14446 Prok-RING_1:  Prokaryo  31.5      21 0.00045   19.1   0.5    9   64-72     22-30  (54)
182 COG1675 TFA1 Transcription ini  30.6      74  0.0016   21.6   3.0   40   25-75    105-144 (176)
183 KOG1994|consensus               30.3      34 0.00074   24.1   1.4   23   32-56    238-260 (268)
184 PF08209 Sgf11:  Sgf11 (transcr  30.2      69  0.0015   15.1   2.1   24   63-87      4-27  (33)
185 smart00647 IBR In Between Ring  30.1      41 0.00089   17.8   1.6   34   35-76     20-53  (64)
186 KOG4377|consensus               29.8      18 0.00039   27.8   0.1   25   34-58    402-426 (480)
187 PF10263 SprT-like:  SprT-like   29.8      20 0.00044   23.2   0.3   32   33-74    123-154 (157)
188 KOG4727|consensus               29.5      35 0.00077   22.9   1.4   22    3-26     75-96  (193)
189 COG3091 SprT Zn-dependent meta  29.4      41 0.00089   22.2   1.6   32   33-71    117-148 (156)
190 PLN03239 histone acetyltransfe  28.9      36 0.00078   25.7   1.5   25   61-85    104-128 (351)
191 PF03145 Sina:  Seven in absent  28.2      83  0.0018   21.4   3.1   47   40-89     23-73  (198)
192 PTZ00043 cytochrome c oxidase   28.1      31 0.00066   24.4   0.9   17   60-76    178-194 (268)
193 PRK00564 hypA hydrogenase nick  27.9      34 0.00073   21.4   1.1   13   34-48     72-84  (117)
194 PF07975 C1_4:  TFIIH C1-like d  27.7      19 0.00042   18.9  -0.1   26   31-58     19-44  (51)
195 PF04606 Ogr_Delta:  Ogr/Delta-  27.6      12 0.00026   19.2  -0.9   17   32-48     24-40  (47)
196 PF13821 DUF4187:  Domain of un  27.3   1E+02  0.0023   16.4   2.7   19   34-54     28-46  (55)
197 PF07150 DUF1390:  Protein of u  27.1      60  0.0013   23.0   2.2   21   64-85      8-28  (229)
198 PF14447 Prok-RING_4:  Prokaryo  26.8      35 0.00076   18.3   0.8   13   64-76     40-52  (55)
199 KOG1280|consensus               26.6      71  0.0015   24.2   2.6   41   28-70     74-116 (381)
200 PLN02748 tRNA dimethylallyltra  26.5      48   0.001   26.2   1.8   21   34-56    419-440 (468)
201 PRK03976 rpl37ae 50S ribosomal  26.5      28  0.0006   20.8   0.4   32   32-75     35-66  (90)
202 PRK00432 30S ribosomal protein  26.4      48   0.001   17.3   1.3   12   62-73     36-47  (50)
203 KOG1813|consensus               26.3      41 0.00089   24.8   1.3   22   65-86    277-302 (313)
204 PF09082 DUF1922:  Domain of un  26.2      33 0.00072   19.2   0.7   16   57-73     14-29  (68)
205 PF01155 HypA:  Hydrogenase exp  26.1      36 0.00078   21.1   0.9   25   34-72     71-95  (113)
206 TIGR00595 priA primosomal prot  25.1      22 0.00048   28.2  -0.2   11   61-71    251-261 (505)
207 PF12230 PRP21_like_P:  Pre-mRN  24.9      24 0.00053   24.7   0.0   34   61-95    166-199 (229)
208 KOG2807|consensus               24.8 1.5E+02  0.0032   22.4   3.9   30   33-70    345-374 (378)
209 PF00301 Rubredoxin:  Rubredoxi  24.7      32  0.0007   17.7   0.5    7   64-70     35-41  (47)
210 COG5027 SAS2 Histone acetyltra  24.7      47   0.001   25.1   1.4   31   62-92    157-187 (395)
211 PTZ00448 hypothetical protein;  24.3      59  0.0013   24.8   1.9   21    4-26    315-335 (373)
212 PRK03681 hypA hydrogenase nick  24.2      35 0.00075   21.2   0.6   25   34-71     71-95  (114)
213 PF07800 DUF1644:  Protein of u  24.2 2.3E+02  0.0049   19.0   5.6   81    8-92     29-137 (162)
214 cd00065 FYVE FYVE domain; Zinc  24.0      54  0.0012   17.0   1.3    7   10-16      7-13  (57)
215 KOG2636|consensus               23.6      68  0.0015   25.2   2.1   29   26-56    394-423 (497)
216 PF04641 Rtf2:  Rtf2 RING-finge  23.2      92   0.002   22.4   2.7   36   33-76    128-163 (260)
217 PF03811 Zn_Tnp_IS1:  InsA N-te  23.0      31 0.00068   16.6   0.2   19   51-69     17-35  (36)
218 TIGR00627 tfb4 transcription f  23.0      64  0.0014   23.6   1.8   11   64-74    256-266 (279)
219 PF14311 DUF4379:  Domain of un  22.9      70  0.0015   16.7   1.6   13   34-48     29-41  (55)
220 KOG2312|consensus               22.9      26 0.00057   28.9  -0.2   24    4-27    644-667 (847)
221 PF13824 zf-Mss51:  Zinc-finger  22.8      64  0.0014   17.3   1.3   10   62-71     13-22  (55)
222 PTZ00064 histone acetyltransfe  22.6      75  0.0016   25.4   2.2   25   31-57    278-302 (552)
223 TIGR01206 lysW lysine biosynth  22.5      40 0.00088   17.9   0.6   13   64-76      3-15  (54)
224 smart00064 FYVE Protein presen  22.4      56  0.0012   17.8   1.2   10   64-73     27-36  (68)
225 PF06397 Desulfoferrod_N:  Desu  22.3      42  0.0009   16.3   0.5   26    3-43      6-31  (36)
226 PLN00104 MYST -like histone ac  22.0      71  0.0015   25.1   2.0   24   32-57    197-220 (450)
227 cd00730 rubredoxin Rubredoxin;  21.9      41 0.00088   17.6   0.5    7   64-70     35-41  (50)
228 PF12907 zf-met2:  Zinc-binding  21.7      51  0.0011   16.4   0.8   25    4-30      2-30  (40)
229 KOG2907|consensus               21.6      57  0.0012   20.3   1.2   38   34-75     75-114 (116)
230 PF14634 zf-RING_5:  zinc-RING   21.4 1.1E+02  0.0025   15.0   2.1   11   61-71     34-44  (44)
231 KOG0696|consensus               21.0      34 0.00074   26.9   0.1   47   21-70     81-128 (683)
232 KOG0978|consensus               20.2      36 0.00077   28.3   0.1   21   62-82    677-697 (698)

No 1  
>KOG2462|consensus
Probab=99.92  E-value=2.7e-25  Score=152.94  Aligned_cols=105  Identities=25%  Similarity=0.464  Sum_probs=97.1

Q ss_pred             CCeecccccccccccCHHHHHHHHHHhcCCCceeecccccCcccCChhhHHHHhhhhcCCCceeCCCCcccccCchhHHH
Q psy6598           2 KPFFYFPWDCKISFASATNYKNHMRIHSGEKPYVCQVRDCQKRFTEYSSLYKHTLVHSDIRPFICDRCPRSYRQLCTLNV   81 (154)
Q Consensus         2 k~~~C~~~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~f~~~~~l~~   81 (154)
                      +.+.|+  +|||.|.+...|..|+++|+  -+++|.+  |||.|.+...|+.|+|+|+|+|||.|+.|+|.|.-+++|+.
T Consensus       160 ka~~C~--~C~K~YvSmpALkMHirTH~--l~c~C~i--CGKaFSRPWLLQGHiRTHTGEKPF~C~hC~kAFADRSNLRA  233 (279)
T KOG2462|consen  160 KAFSCK--YCGKVYVSMPALKMHIRTHT--LPCECGI--CGKAFSRPWLLQGHIRTHTGEKPFSCPHCGKAFADRSNLRA  233 (279)
T ss_pred             ccccCC--CCCceeeehHHHhhHhhccC--CCccccc--ccccccchHHhhcccccccCCCCccCCcccchhcchHHHHH
Confidence            568899  99999999999999999996  6799999  99999999999999999999999999999999999999999


Q ss_pred             HHHhhcCcCCCCCCCCCcchhhhHHhhhhcC
Q psy6598          82 HKKTNHRESKNKKPSNRNNIVKQECVQKINS  112 (154)
Q Consensus        82 H~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~  112 (154)
                      ||++|...+++.+..+.+.+....-+.++..
T Consensus       234 HmQTHS~~K~~qC~~C~KsFsl~SyLnKH~E  264 (279)
T KOG2462|consen  234 HMQTHSDVKKHQCPRCGKSFALKSYLNKHSE  264 (279)
T ss_pred             HHHhhcCCccccCcchhhHHHHHHHHHHhhh
Confidence            9999999999999999998887766666544


No 2  
>KOG2462|consensus
Probab=99.89  E-value=3.8e-24  Score=147.30  Aligned_cols=79  Identities=34%  Similarity=0.640  Sum_probs=76.7

Q ss_pred             CeecccccccccccCHHHHHHHHHHhcCCCceeecccccCcccCChhhHHHHhhhhcCCCceeCCCCcccccCchhHHHH
Q psy6598           3 PFFYFPWDCKISFASATNYKNHMRIHSGEKPYVCQVRDCQKRFTEYSSLYKHTLVHSDIRPFICDRCPRSYRQLCTLNVH   82 (154)
Q Consensus         3 ~~~C~~~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~f~~~~~l~~H   82 (154)
                      ++.|.  +|||.|...+.|+.|+|+|+|+|||.|..  |+++|..++.|+.|+++|.+.|+|+|..|+|.|.+.+.|.+|
T Consensus       187 ~c~C~--iCGKaFSRPWLLQGHiRTHTGEKPF~C~h--C~kAFADRSNLRAHmQTHS~~K~~qC~~C~KsFsl~SyLnKH  262 (279)
T KOG2462|consen  187 PCECG--ICGKAFSRPWLLQGHIRTHTGEKPFSCPH--CGKAFADRSNLRAHMQTHSDVKKHQCPRCGKSFALKSYLNKH  262 (279)
T ss_pred             Ccccc--cccccccchHHhhcccccccCCCCccCCc--ccchhcchHHHHHHHHhhcCCccccCcchhhHHHHHHHHHHh
Confidence            67899  99999999999999999999999999988  999999999999999999999999999999999999999999


Q ss_pred             HHh
Q psy6598          83 KKT   85 (154)
Q Consensus        83 ~~~   85 (154)
                      ...
T Consensus       263 ~ES  265 (279)
T KOG2462|consen  263 SES  265 (279)
T ss_pred             hhh
Confidence            875


No 3  
>KOG3576|consensus
Probab=99.76  E-value=1.2e-19  Score=120.15  Aligned_cols=84  Identities=30%  Similarity=0.604  Sum_probs=80.6

Q ss_pred             CeecccccccccccCHHHHHHHHHHhcCCCceeecccccCcccCChhhHHHHhhhhcCCCceeCCCCcccccCchhHHHH
Q psy6598           3 PFFYFPWDCKISFASATNYKNHMRIHSGEKPYVCQVRDCQKRFTEYSSLYKHTLVHSDIRPFICDRCPRSYRQLCTLNVH   82 (154)
Q Consensus         3 ~~~C~~~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~f~~~~~l~~H   82 (154)
                      .|.|.  +|+|.|.-...|.+|++.|...+.|.|..  ||++|...-.|.+|+++|+|.+||+|..|+|.|.+.-+|..|
T Consensus       117 ~ftCr--vCgK~F~lQRmlnrh~kch~~vkr~lct~--cgkgfndtfdlkrh~rthtgvrpykc~~c~kaftqrcslesh  192 (267)
T KOG3576|consen  117 SFTCR--VCGKKFGLQRMLNRHLKCHSDVKRHLCTF--CGKGFNDTFDLKRHTRTHTGVRPYKCSLCEKAFTQRCSLESH  192 (267)
T ss_pred             eeeee--hhhhhhhHHHHHHHHhhhccHHHHHHHhh--ccCcccchhhhhhhhccccCccccchhhhhHHHHhhccHHHH
Confidence            58999  99999999999999999999999999999  999999999999999999999999999999999999999999


Q ss_pred             HHhhcCcC
Q psy6598          83 KKTNHRES   90 (154)
Q Consensus        83 ~~~~~~~~   90 (154)
                      ++..|+..
T Consensus       193 l~kvhgv~  200 (267)
T KOG3576|consen  193 LKKVHGVQ  200 (267)
T ss_pred             HHHHcCch
Confidence            99888754


No 4  
>KOG3623|consensus
Probab=99.74  E-value=3.3e-19  Score=136.37  Aligned_cols=79  Identities=28%  Similarity=0.636  Sum_probs=76.7

Q ss_pred             CCeecccccccccccCHHHHHHHHHHhcCCCceeecccccCcccCChhhHHHHhhhhcCCCceeCCCCcccccCchhHHH
Q psy6598           2 KPFFYFPWDCKISFASATNYKNHMRIHSGEKPYVCQVRDCQKRFTEYSSLYKHTLVHSDIRPFICDRCPRSYRQLCTLNV   81 (154)
Q Consensus         2 k~~~C~~~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~f~~~~~l~~   81 (154)
                      -+|-|+  .|+|+|...+.|.+|.--|+|.+||.|.+  |.|+|..+-+|..|.|.|.|+|||+|+.|+|+|+...++.+
T Consensus       893 gmyaCD--qCDK~FqKqSSLaRHKYEHsGqRPyqC~i--CkKAFKHKHHLtEHkRLHSGEKPfQCdKClKRFSHSGSYSQ  968 (1007)
T KOG3623|consen  893 GMYACD--QCDKAFQKQSSLARHKYEHSGQRPYQCII--CKKAFKHKHHLTEHKRLHSGEKPFQCDKCLKRFSHSGSYSQ  968 (1007)
T ss_pred             ccchHH--HHHHHHHhhHHHHHhhhhhcCCCCcccch--hhHhhhhhhhhhhhhhhccCCCcchhhhhhhhcccccchHh
Confidence            368999  99999999999999999999999999999  99999999999999999999999999999999999999999


Q ss_pred             HHH
Q psy6598          82 HKK   84 (154)
Q Consensus        82 H~~   84 (154)
                      ||.
T Consensus       969 HMN  971 (1007)
T KOG3623|consen  969 HMN  971 (1007)
T ss_pred             hhc
Confidence            986


No 5  
>KOG1074|consensus
Probab=99.58  E-value=3.1e-16  Score=121.80  Aligned_cols=52  Identities=33%  Similarity=0.603  Sum_probs=46.4

Q ss_pred             CeecccccccccccCHHHHHHHHHHhcCCCceeecccccCcccCChhhHHHHhhhh
Q psy6598           3 PFFYFPWDCKISFASATNYKNHMRIHSGEKPYVCQVRDCQKRFTEYSSLYKHTLVH   58 (154)
Q Consensus         3 ~~~C~~~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~~~C~~~f~~~~~l~~H~~~h   58 (154)
                      +++|.  .|.++|.+.+.|+.|+|.|+|++||+|.+  ||..|.++..|..|...|
T Consensus       353 khkCr--~CakvfgS~SaLqiHlRSHTGERPfqCnv--CG~~FSTkGNLKvH~~rH  404 (958)
T KOG1074|consen  353 KHKCR--FCAKVFGSDSALQIHLRSHTGERPFQCNV--CGNRFSTKGNLKVHFQRH  404 (958)
T ss_pred             cchhh--hhHhhcCchhhhhhhhhccCCCCCeeecc--cccccccccceeeeeeec
Confidence            46799  99999999999999999999999999998  999999999998887665


No 6  
>KOG3623|consensus
Probab=99.51  E-value=5.2e-15  Score=113.78  Aligned_cols=78  Identities=27%  Similarity=0.571  Sum_probs=59.4

Q ss_pred             eecccccccccccCHHHHHHHHHHhcC-------------CCceeecccccCcccCChhhHHHHhhhhcCCCceeCCCCc
Q psy6598           4 FFYFPWDCKISFASATNYKNHMRIHSG-------------EKPYVCQVRDCQKRFTEYSSLYKHTLVHSDIRPFICDRCP   70 (154)
Q Consensus         4 ~~C~~~~C~~~f~~~~~l~~H~~~h~~-------------~~~~~C~~~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~   70 (154)
                      |.|.  .|..+|.+...|.+|+..|..             .+.|+|..  |||+|..+-+|..|+|+|.|+|||.|+-|+
T Consensus       241 fsC~--lCsytFAyRtQLErhm~~hkpg~dqa~sltqsa~lRKFKCtE--CgKAFKfKHHLKEHlRIHSGEKPfeCpnCk  316 (1007)
T KOG3623|consen  241 FSCM--LCSYTFAYRTQLERHMQLHKPGGDQAISLTQSALLRKFKCTE--CGKAFKFKHHLKEHLRIHSGEKPFECPNCK  316 (1007)
T ss_pred             Ccch--hhhhhhhhHHHHHHHHHhhcCCCcccccccchhhhccccccc--cchhhhhHHHHHhhheeecCCCCcCCcccc
Confidence            6677  788888888888888777742             25578866  888888888888888888888888888888


Q ss_pred             ccccCchhHHHHHHh
Q psy6598          71 RSYRQLCTLNVHKKT   85 (154)
Q Consensus        71 ~~f~~~~~l~~H~~~   85 (154)
                      |+|+...++..||..
T Consensus       317 KRFSHSGSySSHmSS  331 (1007)
T KOG3623|consen  317 KRFSHSGSYSSHMSS  331 (1007)
T ss_pred             cccccCCcccccccc
Confidence            888888777777654


No 7  
>KOG3576|consensus
Probab=99.48  E-value=1.5e-14  Score=96.06  Aligned_cols=85  Identities=29%  Similarity=0.555  Sum_probs=76.1

Q ss_pred             CCeecccccccccccCHHHHHHHHHHhcCCCceeecccccCcccCChhhHHHHhhh-hc----------CCCceeCCCCc
Q psy6598           2 KPFFYFPWDCKISFASATNYKNHMRIHSGEKPYVCQVRDCQKRFTEYSSLYKHTLV-HS----------DIRPFICDRCP   70 (154)
Q Consensus         2 k~~~C~~~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~~~C~~~f~~~~~l~~H~~~-h~----------~~~~~~C~~C~   70 (154)
                      |.|-|.  .||+.|.+.-.|++|+++|+|.+||+|..  |+++|+.+..|..|.+. |.          ..|.|.|..||
T Consensus       144 kr~lct--~cgkgfndtfdlkrh~rthtgvrpykc~~--c~kaftqrcsleshl~kvhgv~~~yaykerr~kl~vcedcg  219 (267)
T KOG3576|consen  144 KRHLCT--FCGKGFNDTFDLKRHTRTHTGVRPYKCSL--CEKAFTQRCSLESHLKKVHGVQHQYAYKERRAKLYVCEDCG  219 (267)
T ss_pred             HHHHHh--hccCcccchhhhhhhhccccCccccchhh--hhHHHHhhccHHHHHHHHcCchHHHHHHHhhhheeeecccC
Confidence            457799  99999999999999999999999999999  99999999999999873 42          24779999999


Q ss_pred             ccccCchhHHHHHHhhcCcC
Q psy6598          71 RSYRQLCTLNVHKKTNHRES   90 (154)
Q Consensus        71 ~~f~~~~~l~~H~~~~~~~~   90 (154)
                      .+-.....+..|+..||...
T Consensus       220 ~t~~~~e~~~~h~~~~hp~S  239 (267)
T KOG3576|consen  220 YTSERPEVYYLHLKLHHPFS  239 (267)
T ss_pred             CCCCChhHHHHHHHhcCCCC
Confidence            99999999999999987653


No 8  
>PHA00733 hypothetical protein
Probab=99.40  E-value=2.2e-13  Score=86.45  Aligned_cols=83  Identities=19%  Similarity=0.297  Sum_probs=70.7

Q ss_pred             CCCeecccccccccccCHHHHHHH--HH---HhcCCCceeecccccCcccCChhhHHHHhhhhcCCCceeCCCCcccccC
Q psy6598           1 MKPFFYFPWDCKISFASATNYKNH--MR---IHSGEKPYVCQVRDCQKRFTEYSSLYKHTLVHSDIRPFICDRCPRSYRQ   75 (154)
Q Consensus         1 ~k~~~C~~~~C~~~f~~~~~l~~H--~~---~h~~~~~~~C~~~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~f~~   75 (154)
                      +|++.|.  .|++.|.....|..+  ++   .+.+.++|.|..  |++.|.....|..|++.|  ..+|.|..|++.|..
T Consensus        38 ~~~~~~~--~~~~~~~~~~~l~~~~~l~~~~~~~~~kPy~C~~--Cgk~Fss~s~L~~H~r~h--~~~~~C~~CgK~F~~  111 (128)
T PHA00733         38 QKRLIRA--VVKTLIYNPQLLDESSYLYKLLTSKAVSPYVCPL--CLMPFSSSVSLKQHIRYT--EHSKVCPVCGKEFRN  111 (128)
T ss_pred             hhhHHHH--HHhhhccChhhhcchHHHHhhcccCCCCCccCCC--CCCcCCCHHHHHHHHhcC--CcCccCCCCCCccCC
Confidence            3678899  999998888777665  22   234578999988  999999999999999977  457999999999999


Q ss_pred             chhHHHHHHhhcCc
Q psy6598          76 LCTLNVHKKTNHRE   89 (154)
Q Consensus        76 ~~~l~~H~~~~~~~   89 (154)
                      ...|..|+...|+.
T Consensus       112 ~~sL~~H~~~~h~~  125 (128)
T PHA00733        112 TDSTLDHVCKKHNI  125 (128)
T ss_pred             HHHHHHHHHHhcCc
Confidence            99999999988764


No 9  
>KOG1074|consensus
Probab=99.35  E-value=2.1e-13  Score=106.33  Aligned_cols=85  Identities=26%  Similarity=0.529  Sum_probs=77.4

Q ss_pred             CeecccccccccccCHHHHHHHHHHhcCCCceeecccccCcccCChhhHHHHhhhhcCC----CceeCC---CCcccccC
Q psy6598           3 PFFYFPWDCKISFASATNYKNHMRIHSGEKPYVCQVRDCQKRFTEYSSLYKHTLVHSDI----RPFICD---RCPRSYRQ   75 (154)
Q Consensus         3 ~~~C~~~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~~~C~~~f~~~~~l~~H~~~h~~~----~~~~C~---~C~~~f~~   75 (154)
                      |-+|.  .|-++..-.+.|+.|++.|+|+|||+|.+  ||++|+++..|+.|+-+|...    .+|.|+   +|-+.|..
T Consensus       605 PNqCi--iC~rVlSC~saLqmHyrtHtGERPFkCKi--CgRAFtTkGNLkaH~~vHka~p~~R~q~ScP~~~ic~~kftn  680 (958)
T KOG1074|consen  605 PNQCI--ICLRVLSCPSALQMHYRTHTGERPFKCKI--CGRAFTTKGNLKAHMSVHKAKPPARVQFSCPSTFICQKKFTN  680 (958)
T ss_pred             cccee--eeeecccchhhhhhhhhcccCcCcccccc--ccchhccccchhhcccccccCccccccccCCchhhhcccccc
Confidence            45799  99999999999999999999999999999  999999999999999988643    458999   99999999


Q ss_pred             chhHHHHHHhhcCcCC
Q psy6598          76 LCTLNVHKKTNHRESK   91 (154)
Q Consensus        76 ~~~l~~H~~~~~~~~~   91 (154)
                      .-.|..|++.|.+...
T Consensus       681 ~V~lpQhIriH~~~~~  696 (958)
T KOG1074|consen  681 AVTLPQHIRIHLGGQI  696 (958)
T ss_pred             cccccceEEeecCCCC
Confidence            9999999999985543


No 10 
>KOG3608|consensus
Probab=99.29  E-value=2.2e-12  Score=92.32  Aligned_cols=79  Identities=32%  Similarity=0.565  Sum_probs=47.8

Q ss_pred             eecccccccccccCHHHHHHHHH-HhcCCCceeecccccCcccCChhhHHHHhhhhcCCCceeCCC--CcccccCchhHH
Q psy6598           4 FFYFPWDCKISFASATNYKNHMR-IHSGEKPYVCQVRDCQKRFTEYSSLYKHTLVHSDIRPFICDR--CPRSYRQLCTLN   80 (154)
Q Consensus         4 ~~C~~~~C~~~f~~~~~l~~H~~-~h~~~~~~~C~~~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~--C~~~f~~~~~l~   80 (154)
                      |+|+  .|+.+....+.|.+|++ .|..++||+|+.  |.+.|.+.+.|.+|...|. +.-|.|..  |...|.+...|.
T Consensus       264 ykCp--lCdmtc~~~ssL~~H~r~rHs~dkpfKCd~--Cd~~c~~esdL~kH~~~HS-~~~y~C~h~~C~~s~r~~~q~~  338 (467)
T KOG3608|consen  264 YKCP--LCDMTCSSASSLTTHIRYRHSKDKPFKCDE--CDTRCVRESDLAKHVQVHS-KTVYQCEHPDCHYSVRTYTQMR  338 (467)
T ss_pred             cccc--ccccCCCChHHHHHHHHhhhccCCCccccc--hhhhhccHHHHHHHHHhcc-ccceecCCCCCcHHHHHHHHHH
Confidence            5555  55555555555666655 355566666665  6666666666666666555 34466655  666666666666


Q ss_pred             HHHHhhc
Q psy6598          81 VHKKTNH   87 (154)
Q Consensus        81 ~H~~~~~   87 (154)
                      +|++.+|
T Consensus       339 ~H~~evh  345 (467)
T KOG3608|consen  339 RHFLEVH  345 (467)
T ss_pred             HHHHHhc
Confidence            6666655


No 11 
>KOG3608|consensus
Probab=99.24  E-value=1.2e-11  Score=88.67  Aligned_cols=88  Identities=23%  Similarity=0.448  Sum_probs=79.4

Q ss_pred             CCCeecccccccccccCHHHHHHHHHHhcCCCceeecccccCcccCChhhHHHHhhhhc-C--CCceeCCCCcccccCch
Q psy6598           1 MKPFFYFPWDCKISFASATNYKNHMRIHSGEKPYVCQVRDCQKRFTEYSSLYKHTLVHS-D--IRPFICDRCPRSYRQLC   77 (154)
Q Consensus         1 ~k~~~C~~~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~~~C~~~f~~~~~l~~H~~~h~-~--~~~~~C~~C~~~f~~~~   77 (154)
                      .|||+|+  .|++.|...+.|..|...|. +..|.|..++|...|.....+++|++.+. |  +.+|.|-.|.+.|.+..
T Consensus       290 dkpfKCd--~Cd~~c~~esdL~kH~~~HS-~~~y~C~h~~C~~s~r~~~q~~~H~~evhEg~np~~Y~CH~Cdr~ft~G~  366 (467)
T KOG3608|consen  290 DKPFKCD--ECDTRCVRESDLAKHVQVHS-KTVYQCEHPDCHYSVRTYTQMRRHFLEVHEGNNPILYACHCCDRFFTSGK  366 (467)
T ss_pred             CCCcccc--chhhhhccHHHHHHHHHhcc-ccceecCCCCCcHHHHHHHHHHHHHHHhccCCCCCceeeecchhhhccch
Confidence            4899999  99999999999999999998 67899999999999999999999998644 4  46799999999999999


Q ss_pred             hHHHHHHhhcCcCC
Q psy6598          78 TLNVHKKTNHRESK   91 (154)
Q Consensus        78 ~l~~H~~~~~~~~~   91 (154)
                      +|..|++..|+-+.
T Consensus       367 ~L~~HL~kkH~f~~  380 (467)
T KOG3608|consen  367 SLSAHLMKKHGFRL  380 (467)
T ss_pred             hHHHHHHHhhcccC
Confidence            99999998776543


No 12 
>PHA02768 hypothetical protein; Provisional
Probab=99.19  E-value=9e-12  Score=66.29  Aligned_cols=42  Identities=19%  Similarity=0.411  Sum_probs=25.5

Q ss_pred             eeecccccCcccCChhhHHHHhhhhcCCCceeCCCCcccccCchhH
Q psy6598          34 YVCQVRDCQKRFTEYSSLYKHTLVHSDIRPFICDRCPRSYRQLCTL   79 (154)
Q Consensus        34 ~~C~~~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~f~~~~~l   79 (154)
                      |.|+.  ||+.|...+.|.+|+++|+  ++|+|..|++.|.+.+.|
T Consensus         6 y~C~~--CGK~Fs~~~~L~~H~r~H~--k~~kc~~C~k~f~~~s~l   47 (55)
T PHA02768          6 YECPI--CGEIYIKRKSMITHLRKHN--TNLKLSNCKRISLRTGEY   47 (55)
T ss_pred             cCcch--hCCeeccHHHHHHHHHhcC--CcccCCcccceeccccee
Confidence            55655  6666666666666666665  456666666666655444


No 13 
>PHA02768 hypothetical protein; Provisional
Probab=99.09  E-value=5.2e-11  Score=63.39  Aligned_cols=44  Identities=11%  Similarity=0.156  Sum_probs=39.6

Q ss_pred             CeecccccccccccCHHHHHHHHHHhcCCCceeecccccCcccCChhhHH
Q psy6598           3 PFFYFPWDCKISFASATNYKNHMRIHSGEKPYVCQVRDCQKRFTEYSSLY   52 (154)
Q Consensus         3 ~~~C~~~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~~~C~~~f~~~~~l~   52 (154)
                      .|.|+  .||+.|...+.|..|+++|+  ++|+|..  |++.|...+.|.
T Consensus         5 ~y~C~--~CGK~Fs~~~~L~~H~r~H~--k~~kc~~--C~k~f~~~s~l~   48 (55)
T PHA02768          5 GYECP--ICGEIYIKRKSMITHLRKHN--TNLKLSN--CKRISLRTGEYI   48 (55)
T ss_pred             ccCcc--hhCCeeccHHHHHHHHHhcC--CcccCCc--ccceecccceeE
Confidence            48999  99999999999999999998  6899977  999999877664


No 14 
>PLN03086 PRLI-interacting factor K; Provisional
Probab=99.06  E-value=4e-10  Score=86.67  Aligned_cols=92  Identities=16%  Similarity=0.338  Sum_probs=75.4

Q ss_pred             CCCeecccccccccccCHHHHHHHHHHhcCCCceeecccccCcccCChhhHHHHhhhhcCCCceeCCCCcccccC-----
Q psy6598           1 MKPFFYFPWDCKISFASATNYKNHMRIHSGEKPYVCQVRDCQKRFTEYSSLYKHTLVHSDIRPFICDRCPRSYRQ-----   75 (154)
Q Consensus         1 ~k~~~C~~~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~f~~-----   75 (154)
                      ++.+.|+  .|++.|. ...|..|+..|+  +++.|.   ||+.+ .+..|..|+.+|.+.+++.|.+|++.|..     
T Consensus       451 ~~H~~C~--~Cgk~f~-~s~LekH~~~~H--kpv~Cp---Cg~~~-~R~~L~~H~~thCp~Kpi~C~fC~~~v~~g~~~~  521 (567)
T PLN03086        451 KNHVHCE--KCGQAFQ-QGEMEKHMKVFH--EPLQCP---CGVVL-EKEQMVQHQASTCPLRLITCRFCGDMVQAGGSAM  521 (567)
T ss_pred             ccCccCC--CCCCccc-hHHHHHHHHhcC--CCccCC---CCCCc-chhHHHhhhhccCCCCceeCCCCCCccccCcccc
Confidence            3567899  9999995 688999999875  789994   99755 66899999999999999999999999952     


Q ss_pred             -----chhHHHHHHhhcCcCCCCCCCCCcchh
Q psy6598          76 -----LCTLNVHKKTNHRESKNKKPSNRNNIV  102 (154)
Q Consensus        76 -----~~~l~~H~~~~~~~~~~~~~~~~~~~~  102 (154)
                           ...|..|+..+ +.+...|..|...+.
T Consensus       522 d~~d~~s~Lt~HE~~C-G~rt~~C~~Cgk~Vr  552 (567)
T PLN03086        522 DVRDRLRGMSEHESIC-GSRTAPCDSCGRSVM  552 (567)
T ss_pred             chhhhhhhHHHHHHhc-CCcceEccccCCeee
Confidence                 35799999884 777777776665443


No 15 
>PF13465 zf-H2C2_2:  Zinc-finger double domain; PDB: 2EN7_A 1TF6_A 1TF3_A 2ELT_A 2EOS_A 2EN2_A 2DMD_A 2WBS_A 2WBU_A 2EM5_A ....
Probab=99.03  E-value=2.1e-10  Score=52.44  Aligned_cols=24  Identities=29%  Similarity=0.764  Sum_probs=12.6

Q ss_pred             HHHHhhhhcCCCceeCCCCccccc
Q psy6598          51 LYKHTLVHSDIRPFICDRCPRSYR   74 (154)
Q Consensus        51 l~~H~~~h~~~~~~~C~~C~~~f~   74 (154)
                      |.+|+++|++++||.|+.|++.|.
T Consensus         2 l~~H~~~H~~~k~~~C~~C~k~F~   25 (26)
T PF13465_consen    2 LRRHMRTHTGEKPYKCPYCGKSFS   25 (26)
T ss_dssp             HHHHHHHHSSSSSEEESSSSEEES
T ss_pred             HHHHhhhcCCCCCCCCCCCcCeeC
Confidence            445555555555555555555553


No 16 
>KOG3993|consensus
Probab=98.90  E-value=7.8e-10  Score=81.04  Aligned_cols=86  Identities=24%  Similarity=0.400  Sum_probs=71.8

Q ss_pred             eecccccccccccCHHHHHHHHHHhcCCCceeecccccCcccCChhhHHHHhhhhcC-----------------------
Q psy6598           4 FFYFPWDCKISFASATNYKNHMRIHSGEKPYVCQVRDCQKRFTEYSSLYKHTLVHSD-----------------------   60 (154)
Q Consensus         4 ~~C~~~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~~~C~~~f~~~~~l~~H~~~h~~-----------------------   60 (154)
                      |.|.  .|...|.+.-.|.+|.-...-..-|+|..  |+|.|....+|..|.|+|..                       
T Consensus       268 yiCq--LCK~kYeD~F~LAQHrC~RIV~vEYrCPE--C~KVFsCPANLASHRRWHKPR~eaa~a~~~P~k~~~~~rae~~  343 (500)
T KOG3993|consen  268 YICQ--LCKEKYEDAFALAQHRCPRIVHVEYRCPE--CDKVFSCPANLASHRRWHKPRPEAAKAGSPPPKQAVETRAEVQ  343 (500)
T ss_pred             HHHH--HHHHhhhhHHHHhhccCCeeEEeeecCCc--ccccccCchhhhhhhcccCCchhhhhcCCCChhhhhhhhhhhh
Confidence            7899  99999999999999965433334599976  99999999999999999931                       


Q ss_pred             ----------CCceeCCCCcccccCchhHHHHHHhhcCcCCCC
Q psy6598          61 ----------IRPFICDRCPRSYRQLCTLNVHKKTNHRESKNK   93 (154)
Q Consensus        61 ----------~~~~~C~~C~~~f~~~~~l~~H~~~~~~~~~~~   93 (154)
                                +-.|.|..|+|.|.+...|++|+.+|+......
T Consensus       344 ea~rsg~dss~gi~~C~~C~KkFrRqAYLrKHqlthq~~~~~k  386 (500)
T KOG3993|consen  344 EAERSGDDSSSGIFSCHTCGKKFRRQAYLRKHQLTHQRAPLAK  386 (500)
T ss_pred             hccccCCcccCceeecHHhhhhhHHHHHHHHhHHhhhccccch
Confidence                      123999999999999999999999988665444


No 17 
>PF13465 zf-H2C2_2:  Zinc-finger double domain; PDB: 2EN7_A 1TF6_A 1TF3_A 2ELT_A 2EOS_A 2EN2_A 2DMD_A 2WBS_A 2WBU_A 2EM5_A ....
Probab=98.88  E-value=2.2e-09  Score=49.03  Aligned_cols=26  Identities=54%  Similarity=1.138  Sum_probs=23.8

Q ss_pred             HHHHHHHHhcCCCceeecccccCcccCC
Q psy6598          20 NYKNHMRIHSGEKPYVCQVRDCQKRFTE   47 (154)
Q Consensus        20 ~l~~H~~~h~~~~~~~C~~~~C~~~f~~   47 (154)
                      +|.+|++.|+|++||.|..  |++.|..
T Consensus         1 ~l~~H~~~H~~~k~~~C~~--C~k~F~~   26 (26)
T PF13465_consen    1 NLRRHMRTHTGEKPYKCPY--CGKSFSN   26 (26)
T ss_dssp             HHHHHHHHHSSSSSEEESS--SSEEESS
T ss_pred             CHHHHhhhcCCCCCCCCCC--CcCeeCc
Confidence            5899999999999999988  9999964


No 18 
>PHA00733 hypothetical protein
Probab=98.82  E-value=6e-09  Score=66.19  Aligned_cols=54  Identities=30%  Similarity=0.585  Sum_probs=47.9

Q ss_pred             CCCeecccccccccccCHHHHHHHHHHhcCCCceeecccccCcccCChhhHHHHhhhhcC
Q psy6598           1 MKPFFYFPWDCKISFASATNYKNHMRIHSGEKPYVCQVRDCQKRFTEYSSLYKHTLVHSD   60 (154)
Q Consensus         1 ~k~~~C~~~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~~~C~~~f~~~~~l~~H~~~h~~   60 (154)
                      ++||.|.  .||+.|.+...|..|++.|  ..+|.|..  |++.|.....|..|+...++
T Consensus        71 ~kPy~C~--~Cgk~Fss~s~L~~H~r~h--~~~~~C~~--CgK~F~~~~sL~~H~~~~h~  124 (128)
T PHA00733         71 VSPYVCP--LCLMPFSSSVSLKQHIRYT--EHSKVCPV--CGKEFRNTDSTLDHVCKKHN  124 (128)
T ss_pred             CCCccCC--CCCCcCCCHHHHHHHHhcC--CcCccCCC--CCCccCCHHHHHHHHHHhcC
Confidence            3689999  9999999999999999987  35799988  99999999999999886543


No 19 
>PHA00616 hypothetical protein
Probab=98.76  E-value=4.5e-09  Score=53.36  Aligned_cols=34  Identities=9%  Similarity=0.127  Sum_probs=28.6

Q ss_pred             CeecccccccccccCHHHHHHHHHHhcCCCceeecc
Q psy6598           3 PFFYFPWDCKISFASATNYKNHMRIHSGEKPYVCQV   38 (154)
Q Consensus         3 ~~~C~~~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~   38 (154)
                      ||+|.  .||+.|..++.|..|++.|+|++++.|++
T Consensus         1 pYqC~--~CG~~F~~~s~l~~H~r~~hg~~~~~~~~   34 (44)
T PHA00616          1 MYQCL--RCGGIFRKKKEVIEHLLSVHKQNKLTLEY   34 (44)
T ss_pred             CCccc--hhhHHHhhHHHHHHHHHHhcCCCccceeE
Confidence            68888  88888888888888888888888888764


No 20 
>PLN03086 PRLI-interacting factor K; Provisional
Probab=98.72  E-value=2.6e-08  Score=76.86  Aligned_cols=79  Identities=18%  Similarity=0.379  Sum_probs=65.9

Q ss_pred             CCeecccccccccccCHHHHHHHHHHhcCCCceeecccccCcccCC----------hhhHHHHhhhhcCCCceeCCCCcc
Q psy6598           2 KPFFYFPWDCKISFASATNYKNHMRIHSGEKPYVCQVRDCQKRFTE----------YSSLYKHTLVHSDIRPFICDRCPR   71 (154)
Q Consensus         2 k~~~C~~~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~~~C~~~f~~----------~~~l~~H~~~h~~~~~~~C~~C~~   71 (154)
                      +++.|+   ||+.+ ....|..|+..|.+.+++.|.+  |++.|..          ...|..|...+ |.+++.|..||+
T Consensus       477 kpv~Cp---Cg~~~-~R~~L~~H~~thCp~Kpi~C~f--C~~~v~~g~~~~d~~d~~s~Lt~HE~~C-G~rt~~C~~Cgk  549 (567)
T PLN03086        477 EPLQCP---CGVVL-EKEQMVQHQASTCPLRLITCRF--CGDMVQAGGSAMDVRDRLRGMSEHESIC-GSRTAPCDSCGR  549 (567)
T ss_pred             CCccCC---CCCCc-chhHHHhhhhccCCCCceeCCC--CCCccccCccccchhhhhhhHHHHHHhc-CCcceEccccCC
Confidence            577886   99755 6789999999999999999998  9999852          35799998885 899999999999


Q ss_pred             cccCchhHHHHHHhhcC
Q psy6598          72 SYRQLCTLNVHKKTNHR   88 (154)
Q Consensus        72 ~f~~~~~l~~H~~~~~~   88 (154)
                      .|..+ .|..|+...|.
T Consensus       550 ~Vrlr-dm~~H~~~~h~  565 (567)
T PLN03086        550 SVMLK-EMDIHQIAVHQ  565 (567)
T ss_pred             eeeeh-hHHHHHHHhhc
Confidence            87655 57888877664


No 21 
>PHA00732 hypothetical protein
Probab=98.53  E-value=6.4e-08  Score=56.16  Aligned_cols=48  Identities=27%  Similarity=0.406  Sum_probs=39.1

Q ss_pred             CeecccccccccccCHHHHHHHHHH-hcCCCceeecccccCcccCChhhHHHHhhhhcC
Q psy6598           3 PFFYFPWDCKISFASATNYKNHMRI-HSGEKPYVCQVRDCQKRFTEYSSLYKHTLVHSD   60 (154)
Q Consensus         3 ~~~C~~~~C~~~f~~~~~l~~H~~~-h~~~~~~~C~~~~C~~~f~~~~~l~~H~~~h~~   60 (154)
                      ||.|.  .||+.|.+...|..|++. |.+   +.|..  |++.|.   .|..|.+++..
T Consensus         1 py~C~--~Cgk~F~s~s~Lk~H~r~~H~~---~~C~~--CgKsF~---~l~~H~~~~~~   49 (79)
T PHA00732          1 MFKCP--ICGFTTVTLFALKQHARRNHTL---TKCPV--CNKSYR---RLNQHFYSQYD   49 (79)
T ss_pred             CccCC--CCCCccCCHHHHHHHhhcccCC---CccCC--CCCEeC---ChhhhhcccCC
Confidence            68899  999999999999999984 653   57987  999998   47888876543


No 22 
>KOG3993|consensus
Probab=98.49  E-value=8.2e-08  Score=70.70  Aligned_cols=52  Identities=29%  Similarity=0.578  Sum_probs=45.9

Q ss_pred             CeecccccccccccCHHHHHHHHHHhcC--------CC-------------------------ceeecccccCcccCChh
Q psy6598           3 PFFYFPWDCKISFASATNYKNHMRIHSG--------EK-------------------------PYVCQVRDCQKRFTEYS   49 (154)
Q Consensus         3 ~~~C~~~~C~~~f~~~~~l~~H~~~h~~--------~~-------------------------~~~C~~~~C~~~f~~~~   49 (154)
                      -|+|.  .|+|+|.-..+|..|+|.|..        ..                         .|.|..  |++.|.+..
T Consensus       295 EYrCP--EC~KVFsCPANLASHRRWHKPR~eaa~a~~~P~k~~~~~rae~~ea~rsg~dss~gi~~C~~--C~KkFrRqA  370 (500)
T KOG3993|consen  295 EYRCP--ECDKVFSCPANLASHRRWHKPRPEAAKAGSPPPKQAVETRAEVQEAERSGDDSSSGIFSCHT--CGKKFRRQA  370 (500)
T ss_pred             eecCC--cccccccCchhhhhhhcccCCchhhhhcCCCChhhhhhhhhhhhhccccCCcccCceeecHH--hhhhhHHHH
Confidence            38999  999999999999999999942        11                         289998  999999999


Q ss_pred             hHHHHhhhh
Q psy6598          50 SLYKHTLVH   58 (154)
Q Consensus        50 ~l~~H~~~h   58 (154)
                      .|+.|+.+|
T Consensus       371 YLrKHqlth  379 (500)
T KOG3993|consen  371 YLRKHQLTH  379 (500)
T ss_pred             HHHHhHHhh
Confidence            999998877


No 23 
>PHA00732 hypothetical protein
Probab=98.46  E-value=1.6e-07  Score=54.49  Aligned_cols=46  Identities=28%  Similarity=0.501  Sum_probs=39.1

Q ss_pred             ceeecccccCcccCChhhHHHHhhh-hcCCCceeCCCCcccccCchhHHHHHHhh
Q psy6598          33 PYVCQVRDCQKRFTEYSSLYKHTLV-HSDIRPFICDRCPRSYRQLCTLNVHKKTN   86 (154)
Q Consensus        33 ~~~C~~~~C~~~f~~~~~l~~H~~~-h~~~~~~~C~~C~~~f~~~~~l~~H~~~~   86 (154)
                      ||.|..  |++.|.....|..|++. |.+   +.|+.|++.|.   .|..|.++.
T Consensus         1 py~C~~--Cgk~F~s~s~Lk~H~r~~H~~---~~C~~CgKsF~---~l~~H~~~~   47 (79)
T PHA00732          1 MFKCPI--CGFTTVTLFALKQHARRNHTL---TKCPVCNKSYR---RLNQHFYSQ   47 (79)
T ss_pred             CccCCC--CCCccCCHHHHHHHhhcccCC---CccCCCCCEeC---Chhhhhccc
Confidence            588988  99999999999999984 653   68999999998   588888653


No 24 
>PHA00616 hypothetical protein
Probab=98.44  E-value=7.1e-08  Score=48.99  Aligned_cols=34  Identities=18%  Similarity=0.314  Sum_probs=23.3

Q ss_pred             ceeecccccCcccCChhhHHHHhhhhcCCCceeCCC
Q psy6598          33 PYVCQVRDCQKRFTEYSSLYKHTLVHSDIRPFICDR   68 (154)
Q Consensus        33 ~~~C~~~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~   68 (154)
                      ||.|..  ||+.|..+..|.+|++.|++++++.|+.
T Consensus         1 pYqC~~--CG~~F~~~s~l~~H~r~~hg~~~~~~~~   34 (44)
T PHA00616          1 MYQCLR--CGGIFRKKKEVIEHLLSVHKQNKLTLEY   34 (44)
T ss_pred             CCccch--hhHHHhhHHHHHHHHHHhcCCCccceeE
Confidence            466755  7777777777777777777777776654


No 25 
>PF00096 zf-C2H2:  Zinc finger, C2H2 type;  InterPro: IPR007087 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target.  The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger: #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C], where X can be any amino acid, and numbers in brackets indicate the number of residues. The positions marked # are those that are important for the stable fold of the zinc finger. The final position can be either his or cys. The C2H2 zinc finger is composed of two short beta strands followed by an alpha helix. The amino terminal part of the helix binds the major groove in DNA binding zinc fingers. The accepted consensus binding sequence for Sp1 is usually defined by the asymmetric hexanucleotide core GGGCGG but this sequence does not include, among others, the GAG (=CTC) repeat that constitutes a high-affinity site for Sp1 binding to the wt1 promoter []. This entry represents the classical C2H2 zinc finger domain.  More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; GO: 0008270 zinc ion binding, 0005622 intracellular; PDB: 2D9H_A 2EPC_A 1SP1_A 1VA3_A 2WBT_B 2ELR_A 2YTP_A 2YTT_A 1VA1_A 2ELO_A ....
Probab=98.34  E-value=3.7e-07  Score=40.28  Aligned_cols=22  Identities=23%  Similarity=0.602  Sum_probs=15.1

Q ss_pred             eeCCCCcccccCchhHHHHHHh
Q psy6598          64 FICDRCPRSYRQLCTLNVHKKT   85 (154)
Q Consensus        64 ~~C~~C~~~f~~~~~l~~H~~~   85 (154)
                      |.|+.|++.|.....|..|++.
T Consensus         1 y~C~~C~~~f~~~~~l~~H~~~   22 (23)
T PF00096_consen    1 YKCPICGKSFSSKSNLKRHMRR   22 (23)
T ss_dssp             EEETTTTEEESSHHHHHHHHHH
T ss_pred             CCCCCCCCccCCHHHHHHHHhH
Confidence            5666777777777777777665


No 26 
>COG5189 SFP1 Putative transcriptional repressor regulating G2/M transition [Transcription / Cell division and chromosome partitioning]
Probab=98.29  E-value=2.6e-07  Score=65.84  Aligned_cols=54  Identities=35%  Similarity=0.818  Sum_probs=44.8

Q ss_pred             CCceeecccccCcccCChhhHHHHhhh-h------------------cCCCceeCCCCcccccCchhHHHHHH
Q psy6598          31 EKPYVCQVRDCQKRFTEYSSLYKHTLV-H------------------SDIRPFICDRCPRSYRQLCTLNVHKK   84 (154)
Q Consensus        31 ~~~~~C~~~~C~~~f~~~~~l~~H~~~-h------------------~~~~~~~C~~C~~~f~~~~~l~~H~~   84 (154)
                      +|||+|.+++|.|.|+....|.-|+.- |                  ...|||.|.+|+|.|+..-.|+.|+.
T Consensus       347 ~KpykCpV~gC~K~YknqnGLKYH~lhGH~~~~~~~~p~p~~~~~F~~~~KPYrCevC~KRYKNlNGLKYHr~  419 (423)
T COG5189         347 GKPYKCPVEGCNKKYKNQNGLKYHMLHGHQNQKLHENPSPEKMNIFSAKDKPYRCEVCDKRYKNLNGLKYHRK  419 (423)
T ss_pred             CceecCCCCCchhhhccccchhhhhhccccCcccCCCCCccccccccccCCceeccccchhhccCccceeccc
Confidence            488999888999999988888888762 3                  12489999999999999999988865


No 27 
>PF00096 zf-C2H2:  Zinc finger, C2H2 type;  InterPro: IPR007087 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target.  The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger: #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C], where X can be any amino acid, and numbers in brackets indicate the number of residues. The positions marked # are those that are important for the stable fold of the zinc finger. The final position can be either his or cys. The C2H2 zinc finger is composed of two short beta strands followed by an alpha helix. The amino terminal part of the helix binds the major groove in DNA binding zinc fingers. The accepted consensus binding sequence for Sp1 is usually defined by the asymmetric hexanucleotide core GGGCGG but this sequence does not include, among others, the GAG (=CTC) repeat that constitutes a high-affinity site for Sp1 binding to the wt1 promoter []. This entry represents the classical C2H2 zinc finger domain.  More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; GO: 0008270 zinc ion binding, 0005622 intracellular; PDB: 2D9H_A 2EPC_A 1SP1_A 1VA3_A 2WBT_B 2ELR_A 2YTP_A 2YTT_A 1VA1_A 2ELO_A ....
Probab=98.25  E-value=1.1e-06  Score=38.75  Aligned_cols=23  Identities=43%  Similarity=0.602  Sum_probs=17.0

Q ss_pred             eecccccccccccCHHHHHHHHHHh
Q psy6598           4 FFYFPWDCKISFASATNYKNHMRIH   28 (154)
Q Consensus         4 ~~C~~~~C~~~f~~~~~l~~H~~~h   28 (154)
                      |+|+  .|++.|.+...|..|++.|
T Consensus         1 y~C~--~C~~~f~~~~~l~~H~~~H   23 (23)
T PF00096_consen    1 YKCP--ICGKSFSSKSNLKRHMRRH   23 (23)
T ss_dssp             EEET--TTTEEESSHHHHHHHHHHH
T ss_pred             CCCC--CCCCccCCHHHHHHHHhHC
Confidence            5677  7777777777777777654


No 28 
>PF05605 zf-Di19:  Drought induced 19 protein (Di19), zinc-binding;  InterPro: IPR008598 This entry consists of several drought induced 19 (Di19) like and RING finger 114 proteins. Di19 has been found to be strongly expressed in both the roots and leaves of Arabidopsis thaliana during progressive drought [], whilst RING finger proteins are thought to play a role in spermatogenesis. The precise function is unknown.
Probab=98.23  E-value=2.2e-06  Score=46.22  Aligned_cols=49  Identities=27%  Similarity=0.442  Sum_probs=24.4

Q ss_pred             eeecccccCcccCChhhHHHHhhh-hcC-CCceeCCCCcccccCchhHHHHHHhhc
Q psy6598          34 YVCQVRDCQKRFTEYSSLYKHTLV-HSD-IRPFICDRCPRSYRQLCTLNVHKKTNH   87 (154)
Q Consensus        34 ~~C~~~~C~~~f~~~~~l~~H~~~-h~~-~~~~~C~~C~~~f~~~~~l~~H~~~~~   87 (154)
                      |.|.+  |++ ......|..|... |.. .+.+.|++|...+.  .+|..|+..+|
T Consensus         3 f~CP~--C~~-~~~~~~L~~H~~~~H~~~~~~v~CPiC~~~~~--~~l~~Hl~~~H   53 (54)
T PF05605_consen    3 FTCPY--CGK-GFSESSLVEHCEDEHRSESKNVVCPICSSRVT--DNLIRHLNSQH   53 (54)
T ss_pred             cCCCC--CCC-ccCHHHHHHHHHhHCcCCCCCccCCCchhhhh--hHHHHHHHHhc
Confidence            45555  555 3334455555442 332 23455666655433  35666665554


No 29 
>PF13894 zf-C2H2_4:  C2H2-type zinc finger; PDB: 2ELX_A 2EPP_A 2DLK_A 1X6H_A 2EOU_A 2EMB_A 2GQJ_A 2CSH_A 2WBT_B 2ELM_A ....
Probab=98.21  E-value=1.2e-06  Score=38.77  Aligned_cols=24  Identities=38%  Similarity=0.778  Sum_probs=16.0

Q ss_pred             eeCCCCcccccCchhHHHHHHhhc
Q psy6598          64 FICDRCPRSYRQLCTLNVHKKTNH   87 (154)
Q Consensus        64 ~~C~~C~~~f~~~~~l~~H~~~~~   87 (154)
                      |.|++|++.|.....|..|++++|
T Consensus         1 ~~C~~C~~~~~~~~~l~~H~~~~H   24 (24)
T PF13894_consen    1 FQCPICGKSFRSKSELRQHMRTHH   24 (24)
T ss_dssp             EE-SSTS-EESSHHHHHHHHHHHS
T ss_pred             CCCcCCCCcCCcHHHHHHHHHhhC
Confidence            567777777777777777777664


No 30 
>PF13912 zf-C2H2_6:  C2H2-type zinc finger; PDB: 1JN7_A 1FU9_A 2L1O_A 1NJQ_A 2EN8_A 2EMM_A 1FV5_A 1Y0J_B 2L6Z_B.
Probab=98.11  E-value=1.8e-06  Score=39.63  Aligned_cols=25  Identities=36%  Similarity=0.907  Sum_probs=17.3

Q ss_pred             ceeCCCCcccccCchhHHHHHHhhc
Q psy6598          63 PFICDRCPRSYRQLCTLNVHKKTNH   87 (154)
Q Consensus        63 ~~~C~~C~~~f~~~~~l~~H~~~~~   87 (154)
                      ||.|..|++.|.....|..|++.|+
T Consensus         1 ~~~C~~C~~~F~~~~~l~~H~~~h~   25 (27)
T PF13912_consen    1 PFECDECGKTFSSLSALREHKRSHC   25 (27)
T ss_dssp             SEEETTTTEEESSHHHHHHHHCTTT
T ss_pred             CCCCCccCCccCChhHHHHHhHHhc
Confidence            4667777777777777777776654


No 31 
>PF12756 zf-C2H2_2:  C2H2 type zinc-finger (2 copies); PDB: 2DMI_A.
Probab=98.05  E-value=3.3e-06  Score=51.10  Aligned_cols=74  Identities=23%  Similarity=0.348  Sum_probs=22.4

Q ss_pred             ecccccccccccCHHHHHHHHHHhcCCCceeecccccCcccCChhhHHHHhhhhcCCCceeCCCCcccccCchhHHHHHH
Q psy6598           5 FYFPWDCKISFASATNYKNHMRIHSGEKPYVCQVRDCQKRFTEYSSLYKHTLVHSDIRPFICDRCPRSYRQLCTLNVHKK   84 (154)
Q Consensus         5 ~C~~~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~f~~~~~l~~H~~   84 (154)
                      +|.  .|+..|.+...|..|+...++-..-      ....+.....+..+.+.-. ...+.|..|++.|.....|..||+
T Consensus         1 ~C~--~C~~~f~~~~~l~~H~~~~H~~~~~------~~~~l~~~~~~~~~~~~~~-~~~~~C~~C~~~f~s~~~l~~Hm~   71 (100)
T PF12756_consen    1 QCL--FCDESFSSVDDLLQHMKKKHGFDIP------DQKYLVDPNRLLNYLRKKV-KESFRCPYCNKTFRSREALQEHMR   71 (100)
T ss_dssp             ---------------------------------------------------------SSEEBSSSS-EESSHHHHHHHHH
T ss_pred             Ccc--ccccccccccccccccccccccccc------ccccccccccccccccccc-CCCCCCCccCCCCcCHHHHHHHHc
Confidence            488  9999999999999999754432211      1122233444444443221 226999999999999999999999


Q ss_pred             hhc
Q psy6598          85 TNH   87 (154)
Q Consensus        85 ~~~   87 (154)
                      .++
T Consensus        72 ~~~   74 (100)
T PF12756_consen   72 SKH   74 (100)
T ss_dssp             HTT
T ss_pred             Ccc
Confidence            753


No 32 
>PF05605 zf-Di19:  Drought induced 19 protein (Di19), zinc-binding;  InterPro: IPR008598 This entry consists of several drought induced 19 (Di19) like and RING finger 114 proteins. Di19 has been found to be strongly expressed in both the roots and leaves of Arabidopsis thaliana during progressive drought [], whilst RING finger proteins are thought to play a role in spermatogenesis. The precise function is unknown.
Probab=98.00  E-value=1.2e-05  Score=43.22  Aligned_cols=50  Identities=28%  Similarity=0.418  Sum_probs=39.8

Q ss_pred             CeecccccccccccCHHHHHHHHH-HhcCC-CceeecccccCcccCChhhHHHHhhhhc
Q psy6598           3 PFFYFPWDCKISFASATNYKNHMR-IHSGE-KPYVCQVRDCQKRFTEYSSLYKHTLVHS   59 (154)
Q Consensus         3 ~~~C~~~~C~~~f~~~~~l~~H~~-~h~~~-~~~~C~~~~C~~~f~~~~~l~~H~~~h~   59 (154)
                      .|.|+  +|++ ..+...|..|.. .|..+ +.+.|.+  |...+.  ..|.+|+..+.
T Consensus         2 ~f~CP--~C~~-~~~~~~L~~H~~~~H~~~~~~v~CPi--C~~~~~--~~l~~Hl~~~H   53 (54)
T PF05605_consen    2 SFTCP--YCGK-GFSESSLVEHCEDEHRSESKNVVCPI--CSSRVT--DNLIRHLNSQH   53 (54)
T ss_pred             CcCCC--CCCC-ccCHHHHHHHHHhHCcCCCCCccCCC--chhhhh--hHHHHHHHHhc
Confidence            58999  9999 667889999977 46553 5799988  998655  48999988654


No 33 
>PF13912 zf-C2H2_6:  C2H2-type zinc finger; PDB: 1JN7_A 1FU9_A 2L1O_A 1NJQ_A 2EN8_A 2EMM_A 1FV5_A 1Y0J_B 2L6Z_B.
Probab=97.90  E-value=1e-05  Score=37.04  Aligned_cols=25  Identities=32%  Similarity=0.610  Sum_probs=16.8

Q ss_pred             CeecccccccccccCHHHHHHHHHHhc
Q psy6598           3 PFFYFPWDCKISFASATNYKNHMRIHS   29 (154)
Q Consensus         3 ~~~C~~~~C~~~f~~~~~l~~H~~~h~   29 (154)
                      +|+|.  .|++.|.+...|..|++.|.
T Consensus         1 ~~~C~--~C~~~F~~~~~l~~H~~~h~   25 (27)
T PF13912_consen    1 PFECD--ECGKTFSSLSALREHKRSHC   25 (27)
T ss_dssp             SEEET--TTTEEESSHHHHHHHHCTTT
T ss_pred             CCCCC--ccCCccCChhHHHHHhHHhc
Confidence            46677  77777777777777766553


No 34 
>COG5189 SFP1 Putative transcriptional repressor regulating G2/M transition [Transcription / Cell division and chromosome partitioning]
Probab=97.86  E-value=4.7e-06  Score=59.64  Aligned_cols=54  Identities=31%  Similarity=0.575  Sum_probs=46.6

Q ss_pred             CCCeecccccccccccCHHHHHHHHH-Hhc------------------CCCceeecccccCcccCChhhHHHHhh
Q psy6598           1 MKPFFYFPWDCKISFASATNYKNHMR-IHS------------------GEKPYVCQVRDCQKRFTEYSSLYKHTL   56 (154)
Q Consensus         1 ~k~~~C~~~~C~~~f~~~~~l~~H~~-~h~------------------~~~~~~C~~~~C~~~f~~~~~l~~H~~   56 (154)
                      +|||+|+-..|.|+|+....|+.|+. -|.                  ..|||.|.+  |+|.|+....|.-|+.
T Consensus       347 ~KpykCpV~gC~K~YknqnGLKYH~lhGH~~~~~~~~p~p~~~~~F~~~~KPYrCev--C~KRYKNlNGLKYHr~  419 (423)
T COG5189         347 GKPYKCPVEGCNKKYKNQNGLKYHMLHGHQNQKLHENPSPEKMNIFSAKDKPYRCEV--CDKRYKNLNGLKYHRK  419 (423)
T ss_pred             CceecCCCCCchhhhccccchhhhhhccccCcccCCCCCccccccccccCCceeccc--cchhhccCccceeccc
Confidence            58999998899999999999999976 341                  248999999  9999999999988864


No 35 
>PF13894 zf-C2H2_4:  C2H2-type zinc finger; PDB: 2ELX_A 2EPP_A 2DLK_A 1X6H_A 2EOU_A 2EMB_A 2GQJ_A 2CSH_A 2WBT_B 2ELM_A ....
Probab=97.86  E-value=1.8e-05  Score=34.86  Aligned_cols=23  Identities=39%  Similarity=0.598  Sum_probs=14.7

Q ss_pred             eecccccccccccCHHHHHHHHHHh
Q psy6598           4 FFYFPWDCKISFASATNYKNHMRIH   28 (154)
Q Consensus         4 ~~C~~~~C~~~f~~~~~l~~H~~~h   28 (154)
                      |.|+  .|++.|.+...|..|++.|
T Consensus         1 ~~C~--~C~~~~~~~~~l~~H~~~~   23 (24)
T PF13894_consen    1 FQCP--ICGKSFRSKSELRQHMRTH   23 (24)
T ss_dssp             EE-S--STS-EESSHHHHHHHHHHH
T ss_pred             CCCc--CCCCcCCcHHHHHHHHHhh
Confidence            5677  7777777777777777654


No 36 
>PF09237 GAGA:  GAGA factor;  InterPro: IPR015318 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target.  Members of this entry bind to a 5'-GAGAG-3' DNA consensus binding site, and contain a Cys2-His2 zinc finger core as well as an N-terminal extension containing two highly basic regions. The zinc finger core binds in the DNA major groove and recognises the first three GAG bases of the consensus in a manner similar to that seen in other classical zinc finger-DNA complexes. The second basic region forms a helix that interacts in the major groove recognising the last G of the consensus, while the first basic region wraps around the DNA in the minor groove and recognises the A in the fourth position of the consensus sequence [].  More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; PDB: 1YUI_A 1YUJ_A.
Probab=97.63  E-value=4.8e-05  Score=39.59  Aligned_cols=40  Identities=23%  Similarity=0.277  Sum_probs=23.5

Q ss_pred             HHHhhh-hcCCCceeCCCCcccccCchhHHHHHHhhcCcCC
Q psy6598          52 YKHTLV-HSDIRPFICDRCPRSYRQLCTLNVHKKTNHRESK   91 (154)
Q Consensus        52 ~~H~~~-h~~~~~~~C~~C~~~f~~~~~l~~H~~~~~~~~~   91 (154)
                      ..+.+. +..+.|-.|++|+..+.+..+|.+|+.+.|+.++
T Consensus        12 ~~~~k~~~~S~~PatCP~C~a~~~~srnLrRHle~~H~~k~   52 (54)
T PF09237_consen   12 TKKPKSKSQSEQPATCPICGAVIRQSRNLRRHLEIRHFKKP   52 (54)
T ss_dssp             ----CCCCTTS--EE-TTT--EESSHHHHHHHHHHHTTTS-
T ss_pred             hhHHHHhhccCCCCCCCcchhhccchhhHHHHHHHHhcccC
Confidence            334433 3456788999999999999999999988887654


No 37 
>smart00355 ZnF_C2H2 zinc finger.
Probab=97.45  E-value=0.00011  Score=32.68  Aligned_cols=23  Identities=22%  Similarity=0.667  Sum_probs=13.9

Q ss_pred             eeCCCCcccccCchhHHHHHHhh
Q psy6598          64 FICDRCPRSYRQLCTLNVHKKTN   86 (154)
Q Consensus        64 ~~C~~C~~~f~~~~~l~~H~~~~   86 (154)
                      |.|..|++.|.....|..|++.|
T Consensus         1 ~~C~~C~~~f~~~~~l~~H~~~H   23 (26)
T smart00355        1 YRCPECGKVFKSKSALKEHMRTH   23 (26)
T ss_pred             CCCCCCcchhCCHHHHHHHHHHh
Confidence            35666666666666666666543


No 38 
>PF09237 GAGA:  GAGA factor;  InterPro: IPR015318 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target.  Members of this entry bind to a 5'-GAGAG-3' DNA consensus binding site, and contain a Cys2-His2 zinc finger core as well as an N-terminal extension containing two highly basic regions. The zinc finger core binds in the DNA major groove and recognises the first three GAG bases of the consensus in a manner similar to that seen in other classical zinc finger-DNA complexes. The second basic region forms a helix that interacts in the major groove recognising the last G of the consensus, while the first basic region wraps around the DNA in the minor groove and recognises the A in the fourth position of the consensus sequence [].  More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; PDB: 1YUI_A 1YUJ_A.
Probab=97.37  E-value=0.00026  Score=36.89  Aligned_cols=33  Identities=21%  Similarity=0.495  Sum_probs=20.3

Q ss_pred             cCCCceeecccccCcccCChhhHHHHhhhhcCCCc
Q psy6598          29 SGEKPYVCQVRDCQKRFTEYSSLYKHTLVHSDIRP   63 (154)
Q Consensus        29 ~~~~~~~C~~~~C~~~f~~~~~l~~H~~~h~~~~~   63 (154)
                      ..+.|..|.+  |+..+.....|.+|+..+++.||
T Consensus        20 ~S~~PatCP~--C~a~~~~srnLrRHle~~H~~k~   52 (54)
T PF09237_consen   20 QSEQPATCPI--CGAVIRQSRNLRRHLEIRHFKKP   52 (54)
T ss_dssp             TTS--EE-TT--T--EESSHHHHHHHHHHHTTTS-
T ss_pred             ccCCCCCCCc--chhhccchhhHHHHHHHHhcccC
Confidence            3456788877  88888888888888877666654


No 39 
>PF13909 zf-H2C2_5:  C2H2-type zinc-finger domain; PDB: 1X5W_A.
Probab=97.36  E-value=0.00018  Score=31.82  Aligned_cols=23  Identities=30%  Similarity=0.580  Sum_probs=14.4

Q ss_pred             eeCCCCcccccCchhHHHHHHhhc
Q psy6598          64 FICDRCPRSYRQLCTLNVHKKTNH   87 (154)
Q Consensus        64 ~~C~~C~~~f~~~~~l~~H~~~~~   87 (154)
                      |+|+.|+.... ...|..|++.+|
T Consensus         1 y~C~~C~y~t~-~~~l~~H~~~~H   23 (24)
T PF13909_consen    1 YKCPHCSYSTS-KSNLKRHLKRHH   23 (24)
T ss_dssp             EE-SSSS-EES-HHHHHHHHHHHH
T ss_pred             CCCCCCCCcCC-HHHHHHHHHhhC
Confidence            56777776666 667777777665


No 40 
>smart00355 ZnF_C2H2 zinc finger.
Probab=97.33  E-value=0.00024  Score=31.55  Aligned_cols=24  Identities=33%  Similarity=0.588  Sum_probs=16.2

Q ss_pred             eecccccccccccCHHHHHHHHHHhc
Q psy6598           4 FFYFPWDCKISFASATNYKNHMRIHS   29 (154)
Q Consensus         4 ~~C~~~~C~~~f~~~~~l~~H~~~h~   29 (154)
                      |.|.  .|++.|.....|..|++.|.
T Consensus         1 ~~C~--~C~~~f~~~~~l~~H~~~H~   24 (26)
T smart00355        1 YRCP--ECGKVFKSKSALKEHMRTHX   24 (26)
T ss_pred             CCCC--CCcchhCCHHHHHHHHHHhc
Confidence            4566  77777777777777766553


No 41 
>PRK04860 hypothetical protein; Provisional
Probab=97.22  E-value=0.00026  Score=46.72  Aligned_cols=39  Identities=26%  Similarity=0.554  Sum_probs=32.9

Q ss_pred             ceeecccccCcccCChhhHHHHhhhhcCCCceeCCCCcccccCch
Q psy6598          33 PYVCQVRDCQKRFTEYSSLYKHTLVHSDIRPFICDRCPRSYRQLC   77 (154)
Q Consensus        33 ~~~C~~~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~f~~~~   77 (154)
                      +|.|.   |+.   ....+.+|.++|+++++|.|..|+..|....
T Consensus       119 ~Y~C~---C~~---~~~~~rrH~ri~~g~~~YrC~~C~~~l~~~~  157 (160)
T PRK04860        119 PYRCK---CQE---HQLTVRRHNRVVRGEAVYRCRRCGETLVFKG  157 (160)
T ss_pred             EEEcC---CCC---eeCHHHHHHHHhcCCccEECCCCCceeEEec
Confidence            58885   886   6778899999999999999999999887653


No 42 
>PF12874 zf-met:  Zinc-finger of C2H2 type; PDB: 1ZU1_A 2KVG_A.
Probab=97.16  E-value=0.00029  Score=31.39  Aligned_cols=22  Identities=32%  Similarity=0.735  Sum_probs=13.9

Q ss_pred             eeCCCCcccccCchhHHHHHHh
Q psy6598          64 FICDRCPRSYRQLCTLNVHKKT   85 (154)
Q Consensus        64 ~~C~~C~~~f~~~~~l~~H~~~   85 (154)
                      |.|.+|++.|.....|..|++.
T Consensus         1 ~~C~~C~~~f~s~~~~~~H~~s   22 (25)
T PF12874_consen    1 FYCDICNKSFSSENSLRQHLRS   22 (25)
T ss_dssp             EEETTTTEEESSHHHHHHHHTT
T ss_pred             CCCCCCCCCcCCHHHHHHHHCc
Confidence            4566666666666666666654


No 43 
>PF12874 zf-met:  Zinc-finger of C2H2 type; PDB: 1ZU1_A 2KVG_A.
Probab=97.07  E-value=0.0005  Score=30.58  Aligned_cols=22  Identities=32%  Similarity=0.644  Sum_probs=14.3

Q ss_pred             eecccccccccccCHHHHHHHHHH
Q psy6598           4 FFYFPWDCKISFASATNYKNHMRI   27 (154)
Q Consensus         4 ~~C~~~~C~~~f~~~~~l~~H~~~   27 (154)
                      |.|.  .|++.|.+...|..|++.
T Consensus         1 ~~C~--~C~~~f~s~~~~~~H~~s   22 (25)
T PF12874_consen    1 FYCD--ICNKSFSSENSLRQHLRS   22 (25)
T ss_dssp             EEET--TTTEEESSHHHHHHHHTT
T ss_pred             CCCC--CCCCCcCCHHHHHHHHCc
Confidence            4566  666666666666666654


No 44 
>PF13909 zf-H2C2_5:  C2H2-type zinc-finger domain; PDB: 1X5W_A.
Probab=96.86  E-value=0.0013  Score=28.95  Aligned_cols=23  Identities=26%  Similarity=0.482  Sum_probs=14.3

Q ss_pred             eecccccccccccCHHHHHHHHHHhc
Q psy6598           4 FFYFPWDCKISFASATNYKNHMRIHS   29 (154)
Q Consensus         4 ~~C~~~~C~~~f~~~~~l~~H~~~h~   29 (154)
                      |+|.  .|+.... ...|..|++.|.
T Consensus         1 y~C~--~C~y~t~-~~~l~~H~~~~H   23 (24)
T PF13909_consen    1 YKCP--HCSYSTS-KSNLKRHLKRHH   23 (24)
T ss_dssp             EE-S--SSS-EES-HHHHHHHHHHHH
T ss_pred             CCCC--CCCCcCC-HHHHHHHHHhhC
Confidence            5677  7776666 667777776653


No 45 
>PF12171 zf-C2H2_jaz:  Zinc-finger double-stranded RNA-binding;  InterPro: IPR022755  This zinc finger is found in archaea and eukaryotes, and is approximately 30 amino acids in length. The mammalian members of this group occur multiple times along the protein, joined by flexible linkers, and are referred to as JAZ - dsRNA-binding ZF protein - zinc-fingers. The JAZ proteins are expressed in all tissues tested and localise in the nucleus, particularly the nucleolus []. JAZ preferentially binds to double-stranded (ds) RNA or RNA/DNA hybrids rather than DNA. In addition to binding double-stranded RNA, these zinc-fingers are required for nucleolar localisation.   This entry represents the multiple-adjacent-C2H2 zinc finger, JAZ. ; PDB: 4DGW_A 1ZR9_A.
Probab=96.52  E-value=0.00069  Score=30.82  Aligned_cols=22  Identities=32%  Similarity=0.706  Sum_probs=13.3

Q ss_pred             eeCCCCcccccCchhHHHHHHh
Q psy6598          64 FICDRCPRSYRQLCTLNVHKKT   85 (154)
Q Consensus        64 ~~C~~C~~~f~~~~~l~~H~~~   85 (154)
                      |.|..|++.|.....|..|++.
T Consensus         2 ~~C~~C~k~f~~~~~~~~H~~s   23 (27)
T PF12171_consen    2 FYCDACDKYFSSENQLKQHMKS   23 (27)
T ss_dssp             CBBTTTTBBBSSHHHHHCCTTS
T ss_pred             CCcccCCCCcCCHHHHHHHHcc
Confidence            4566666666666666666543


No 46 
>PF13913 zf-C2HC_2:  zinc-finger of a C2HC-type
Probab=96.28  E-value=0.0035  Score=27.97  Aligned_cols=21  Identities=29%  Similarity=0.536  Sum_probs=14.6

Q ss_pred             eeCCCCcccccCchhHHHHHHh
Q psy6598          64 FICDRCPRSYRQLCTLNVHKKT   85 (154)
Q Consensus        64 ~~C~~C~~~f~~~~~l~~H~~~   85 (154)
                      ..|+.||+.| ..+.|..|+..
T Consensus         3 ~~C~~CgR~F-~~~~l~~H~~~   23 (25)
T PF13913_consen    3 VPCPICGRKF-NPDRLEKHEKI   23 (25)
T ss_pred             CcCCCCCCEE-CHHHHHHHHHh
Confidence            4677777777 56677777654


No 47 
>PF12171 zf-C2H2_jaz:  Zinc-finger double-stranded RNA-binding;  InterPro: IPR022755  This zinc finger is found in archaea and eukaryotes, and is approximately 30 amino acids in length. The mammalian members of this group occur multiple times along the protein, joined by flexible linkers, and are referred to as JAZ - dsRNA-binding ZF protein - zinc-fingers. The JAZ proteins are expressed in all tissues tested and localise in the nucleus, particularly the nucleolus []. JAZ preferentially binds to double-stranded (ds) RNA or RNA/DNA hybrids rather than DNA. In addition to binding double-stranded RNA, these zinc-fingers are required for nucleolar localisation.   This entry represents the multiple-adjacent-C2H2 zinc finger, JAZ. ; PDB: 4DGW_A 1ZR9_A.
Probab=96.26  E-value=0.0011  Score=30.17  Aligned_cols=21  Identities=33%  Similarity=0.686  Sum_probs=13.9

Q ss_pred             eecccccccccccCHHHHHHHHH
Q psy6598           4 FFYFPWDCKISFASATNYKNHMR   26 (154)
Q Consensus         4 ~~C~~~~C~~~f~~~~~l~~H~~   26 (154)
                      |.|.  .|++.|.+...|..|++
T Consensus         2 ~~C~--~C~k~f~~~~~~~~H~~   22 (27)
T PF12171_consen    2 FYCD--ACDKYFSSENQLKQHMK   22 (27)
T ss_dssp             CBBT--TTTBBBSSHHHHHCCTT
T ss_pred             CCcc--cCCCCcCCHHHHHHHHc
Confidence            5566  66766666666666655


No 48 
>COG5048 FOG: Zn-finger [General function prediction only]
Probab=95.96  E-value=0.0018  Score=48.82  Aligned_cols=57  Identities=39%  Similarity=0.747  Sum_probs=26.8

Q ss_pred             cccccccCHHHHHHHHH--HhcCC--CceeecccccCcccCChhhHHHHhhhhcCCCceeC
Q psy6598          10 DCKISFASATNYKNHMR--IHSGE--KPYVCQVRDCQKRFTEYSSLYKHTLVHSDIRPFIC   66 (154)
Q Consensus        10 ~C~~~f~~~~~l~~H~~--~h~~~--~~~~C~~~~C~~~f~~~~~l~~H~~~h~~~~~~~C   66 (154)
                      .|...|.....|.+|.+  .|.++  +++.|.+..|++.|.....+.+|...|.+..++.+
T Consensus       294 ~~~~~~s~~~~l~~~~~~~~h~~~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~  354 (467)
T COG5048         294 QCNISFSRSSPLTRHLRSVNHSGESLKPFSCPYSLCGKLFSRNDALKRHILLHTSISPAKE  354 (467)
T ss_pred             cccCCccccccccccccccccccccCCceeeeccCCCccccccccccCCcccccCCCcccc
Confidence            44445544444555544  44444  44544411155555555555555444444444444


No 49 
>COG5236 Uncharacterized conserved protein, contains RING Zn-finger [General function prediction only]
Probab=95.50  E-value=0.055  Score=39.86  Aligned_cols=83  Identities=20%  Similarity=0.341  Sum_probs=60.3

Q ss_pred             ecccccccccccCHHHHHHHHHHhcCCCceeecccc--cCcccCChhhHHHHhhhhcCCCceeCCC--C--c--ccccCc
Q psy6598           5 FYFPWDCKISFASATNYKNHMRIHSGEKPYVCQVRD--CQKRFTEYSSLYKHTLVHSDIRPFICDR--C--P--RSYRQL   76 (154)
Q Consensus         5 ~C~~~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~~~--C~~~f~~~~~l~~H~~~h~~~~~~~C~~--C--~--~~f~~~   76 (154)
                      .|.  .|.+.|-+...|..|++..+ ++.|.|+..+  -...|.....|.+|.+.-    -|.|.+  |  |  ..|...
T Consensus       222 ~C~--FC~~~FYdDDEL~~HcR~~H-E~ChICD~v~p~~~QYFK~Y~~Le~HF~~~----hy~ct~qtc~~~k~~vf~~~  294 (493)
T COG5236         222 LCI--FCKIYFYDDDELRRHCRLRH-EACHICDMVGPIRYQYFKSYEDLEAHFRNA----HYCCTFQTCRVGKCYVFPYH  294 (493)
T ss_pred             hhh--hccceecChHHHHHHHHhhh-hhhhhhhccCccchhhhhCHHHHHHHhhcC----ceEEEEEEEecCcEEEeccH
Confidence            488  99999999999999998543 5677776511  113688889999998732    266653  4  2  368889


Q ss_pred             hhHHHHHHhhcCcCCCCC
Q psy6598          77 CTLNVHKKTNHRESKNKK   94 (154)
Q Consensus        77 ~~l~~H~~~~~~~~~~~~   94 (154)
                      ..|..|+-..|+......
T Consensus       295 ~el~~h~~~~h~~~~~~~  312 (493)
T COG5236         295 TELLEHLTRFHKVNARLS  312 (493)
T ss_pred             HHHHHHHHHHhhcccccC
Confidence            999999988887654443


No 50 
>smart00451 ZnF_U1 U1-like zinc finger. Family of C2H2-type zinc fingers, present in matrin, U1 small nuclear ribonucleoprotein C and other RNA-binding proteins.
Probab=95.43  E-value=0.012  Score=28.26  Aligned_cols=23  Identities=22%  Similarity=0.534  Sum_probs=18.0

Q ss_pred             ceeCCCCcccccCchhHHHHHHh
Q psy6598          63 PFICDRCPRSYRQLCTLNVHKKT   85 (154)
Q Consensus        63 ~~~C~~C~~~f~~~~~l~~H~~~   85 (154)
                      +|.|.+|++.|.....+..|+..
T Consensus         3 ~~~C~~C~~~~~~~~~~~~H~~g   25 (35)
T smart00451        3 GFYCKLCNVTFTDEISVEAHLKG   25 (35)
T ss_pred             CeEccccCCccCCHHHHHHHHCh
Confidence            47788888888888888888764


No 51 
>COG5048 FOG: Zn-finger [General function prediction only]
Probab=95.19  E-value=0.0066  Score=45.74  Aligned_cols=62  Identities=26%  Similarity=0.405  Sum_probs=55.3

Q ss_pred             CceeecccccCcccCChhhHHHHhh--hhcCC--CceeCC--CCcccccCchhHHHHHHhhcCcCCCCCC
Q psy6598          32 KPYVCQVRDCQKRFTEYSSLYKHTL--VHSDI--RPFICD--RCPRSYRQLCTLNVHKKTNHRESKNKKP   95 (154)
Q Consensus        32 ~~~~C~~~~C~~~f~~~~~l~~H~~--~h~~~--~~~~C~--~C~~~f~~~~~l~~H~~~~~~~~~~~~~   95 (154)
                      .++.|..  |...|.....|.+|.+  .|.++  +++.|+  .|++.|.+...+..|...|.+.......
T Consensus       288 ~~~~~~~--~~~~~s~~~~l~~~~~~~~h~~~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~  355 (467)
T COG5048         288 LPIKSKQ--CNISFSRSSPLTRHLRSVNHSGESLKPFSCPYSLCGKLFSRNDALKRHILLHTSISPAKEK  355 (467)
T ss_pred             cCCCCcc--ccCCccccccccccccccccccccCCceeeeccCCCccccccccccCCcccccCCCccccc
Confidence            3678877  9999999999999999  89999  999999  7999999999999999999887755443


No 52 
>smart00451 ZnF_U1 U1-like zinc finger. Family of C2H2-type zinc fingers, present in matrin, U1 small nuclear ribonucleoprotein C and other RNA-binding proteins.
Probab=95.03  E-value=0.021  Score=27.30  Aligned_cols=22  Identities=18%  Similarity=0.596  Sum_probs=15.5

Q ss_pred             CeecccccccccccCHHHHHHHHH
Q psy6598           3 PFFYFPWDCKISFASATNYKNHMR   26 (154)
Q Consensus         3 ~~~C~~~~C~~~f~~~~~l~~H~~   26 (154)
                      +|.|+  .|++.|.+...+..|+.
T Consensus         3 ~~~C~--~C~~~~~~~~~~~~H~~   24 (35)
T smart00451        3 GFYCK--LCNVTFTDEISVEAHLK   24 (35)
T ss_pred             CeEcc--ccCCccCCHHHHHHHHC
Confidence            46677  77777777777777765


No 53 
>KOG1146|consensus
Probab=94.69  E-value=0.013  Score=49.77  Aligned_cols=75  Identities=20%  Similarity=0.325  Sum_probs=47.6

Q ss_pred             cccccccCHHHHHHHHH-HhcCCCceeecccccCcccCChhhHHHHhhh-h------------------------cCCCc
Q psy6598          10 DCKISFASATNYKNHMR-IHSGEKPYVCQVRDCQKRFTEYSSLYKHTLV-H------------------------SDIRP   63 (154)
Q Consensus        10 ~C~~~f~~~~~l~~H~~-~h~~~~~~~C~~~~C~~~f~~~~~l~~H~~~-h------------------------~~~~~   63 (154)
                      .|+..+..+..+-.|+. .|.-.+.|+|..  |+..|+....|..|+|. |                        .+.++
T Consensus       441 ~~e~~~~s~r~~~~~t~~L~S~~kt~~cpk--c~~~yk~a~~L~vhmRskhp~~~~~~c~~gq~~~~~arg~~~~~~~~p  518 (1406)
T KOG1146|consen  441 KAEPLLESKRSLEGQTVVLHSFFKTLKCPK--CNWHYKLAQTLGVHMRSKHPESQSAYCKAGQNHPRLARGEVYRCPGKP  518 (1406)
T ss_pred             chhhhhhhhcccccceeeeecccccccCCc--cchhhhhHHHhhhcccccccccchhHhHhccccccccccccccCCCCc
Confidence            34444444444444433 233345566644  66666666666666654 1                        23478


Q ss_pred             eeCCCCcccccCchhHHHHHHhh
Q psy6598          64 FICDRCPRSYRQLCTLNVHKKTN   86 (154)
Q Consensus        64 ~~C~~C~~~f~~~~~l~~H~~~~   86 (154)
                      |.|..|...+....+|..||+.-
T Consensus       519 ~~C~~C~~stttng~LsihlqS~  541 (1406)
T KOG1146|consen  519 YPCRACNYSTTTNGNLSIHLQSD  541 (1406)
T ss_pred             ccceeeeeeeecchHHHHHHHHH
Confidence            99999999999999999998853


No 54 
>PF13913 zf-C2HC_2:  zinc-finger of a C2HC-type
Probab=94.54  E-value=0.04  Score=24.48  Aligned_cols=20  Identities=15%  Similarity=0.227  Sum_probs=14.0

Q ss_pred             eecccccccccccCHHHHHHHHH
Q psy6598           4 FFYFPWDCKISFASATNYKNHMR   26 (154)
Q Consensus         4 ~~C~~~~C~~~f~~~~~l~~H~~   26 (154)
                      .+|.  .||+.| ....|..|+.
T Consensus         3 ~~C~--~CgR~F-~~~~l~~H~~   22 (25)
T PF13913_consen    3 VPCP--ICGRKF-NPDRLEKHEK   22 (25)
T ss_pred             CcCC--CCCCEE-CHHHHHHHHH
Confidence            4577  888888 4566777765


No 55 
>KOG2482|consensus
Probab=94.16  E-value=0.071  Score=39.20  Aligned_cols=23  Identities=26%  Similarity=0.434  Sum_probs=20.9

Q ss_pred             CeecccccccccccCHHHHHHHHHH
Q psy6598           3 PFFYFPWDCKISFASATNYKNHMRI   27 (154)
Q Consensus         3 ~~~C~~~~C~~~f~~~~~l~~H~~~   27 (154)
                      .+.|-  .|.+.|..+..|..||+.
T Consensus       195 r~~CL--yCekifrdkntLkeHMrk  217 (423)
T KOG2482|consen  195 RLRCL--YCEKIFRDKNTLKEHMRK  217 (423)
T ss_pred             hheee--eeccccCCcHHHHHHHHh
Confidence            47899  999999999999999985


No 56 
>COG5236 Uncharacterized conserved protein, contains RING Zn-finger [General function prediction only]
Probab=94.06  E-value=0.038  Score=40.66  Aligned_cols=27  Identities=19%  Similarity=0.252  Sum_probs=19.2

Q ss_pred             CeecccccccccccCHHHHHHHHHHhc
Q psy6598           3 PFFYFPWDCKISFASATNYKNHMRIHS   29 (154)
Q Consensus         3 ~~~C~~~~C~~~f~~~~~l~~H~~~h~   29 (154)
                      .|.|+...|..+......|..|.+..+
T Consensus       151 ~F~CP~skc~~~C~~~k~lk~H~K~~H  177 (493)
T COG5236         151 SFKCPKSKCHRRCGSLKELKKHYKAQH  177 (493)
T ss_pred             HhcCCchhhhhhhhhHHHHHHHHHhhc
Confidence            367777777777777777888877543


No 57 
>KOG1146|consensus
Probab=93.81  E-value=0.016  Score=49.29  Aligned_cols=81  Identities=15%  Similarity=0.230  Sum_probs=58.0

Q ss_pred             eecccccccccccCHHHHHHHHHHhcCCCceeecccccCcccCChhhHHHHhhh-----------------hcCCCceeC
Q psy6598           4 FFYFPWDCKISFASATNYKNHMRIHSGEKPYVCQVRDCQKRFTEYSSLYKHTLV-----------------HSDIRPFIC   66 (154)
Q Consensus         4 ~~C~~~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~~~C~~~f~~~~~l~~H~~~-----------------h~~~~~~~C   66 (154)
                      +.|.  .|++.|.....+. |+..   ..+|.|..  |...|.....|..|.+.                 +....+| |
T Consensus      1261 ~~c~--~~~~~~~~~~~~~-~l~~---~~~~~~~~--~~~~~~~~~~l~~~~~k~~~~~~~~~~~~~~~l~~~d~~~~-c 1331 (1406)
T KOG1146|consen 1261 GECG--AVDELLTPSFGIS-TLDV---THRYLCRQ--CKMAFDGEAPLTAHQRKFCFAGRGSGGSMPPPLRVPDCTYH-C 1331 (1406)
T ss_pred             chhh--hccccccCcccee-eccc---chhHHHHH--HHhhhcchhHHHHHHHHHHhccCccccCCCCcccCcccccc-c
Confidence            5577  7777776666665 5443   34577877  88888888888877642                 2223456 9


Q ss_pred             CCCcccccCchhHHHHHHhhcCcCCCC
Q psy6598          67 DRCPRSYRQLCTLNVHKKTNHRESKNK   93 (154)
Q Consensus        67 ~~C~~~f~~~~~l~~H~~~~~~~~~~~   93 (154)
                      ..|...|.....|..||+..++..+..
T Consensus      1332 ~~c~~~~~~~~alqihm~~~~~~~kt~ 1358 (1406)
T KOG1146|consen 1332 LACEVLLSGREALQIHMRSSAHRRKTA 1358 (1406)
T ss_pred             hHHHhhcchhHHHHHHHHHhhhcccCC
Confidence            999999999999999999876655443


No 58 
>KOG2893|consensus
Probab=93.72  E-value=0.021  Score=39.79  Aligned_cols=52  Identities=27%  Similarity=0.586  Sum_probs=34.6

Q ss_pred             CCceeecccccCcccCChhhHHHHhhhhcCCCceeCCCCcccccCchhHHHHHHhhcCc
Q psy6598          31 EKPYVCQVRDCQKRFTEYSSLYKHTLVHSDIRPFICDRCPRSYRQLCTLNVHKKTNHRE   89 (154)
Q Consensus        31 ~~~~~C~~~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~f~~~~~l~~H~~~~~~~   89 (154)
                      .++| |=+  |++.|....-|.+|++    .|-|+|.+|.|..-+...|..|-...|++
T Consensus         9 ~kpw-cwy--cnrefddekiliqhqk----akhfkchichkkl~sgpglsihcmqvhke   60 (341)
T KOG2893|consen    9 DKPW-CWY--CNREFDDEKILIQHQK----AKHFKCHICHKKLFSGPGLSIHCMQVHKE   60 (341)
T ss_pred             CCce-eee--cccccchhhhhhhhhh----hccceeeeehhhhccCCCceeehhhhhhh
Confidence            3455 545  7788888777777766    33488888877776777777775544444


No 59 
>PF12013 DUF3505:  Protein of unknown function (DUF3505);  InterPro: IPR022698  This family of proteins is functionally uncharacterised. This protein is found in eukaryotes. Proteins in this family are typically between 247 to 1018 amino acids in length. This region contains two segments that are likely to be C2H2 zinc binding domains. 
Probab=93.59  E-value=0.21  Score=30.78  Aligned_cols=25  Identities=24%  Similarity=0.449  Sum_probs=23.4

Q ss_pred             eeC----CCCcccccCchhHHHHHHhhcC
Q psy6598          64 FIC----DRCPRSYRQLCTLNVHKKTNHR   88 (154)
Q Consensus        64 ~~C----~~C~~~f~~~~~l~~H~~~~~~   88 (154)
                      |.|    ..|+....+...+..|.+.+|+
T Consensus        81 ~~C~~~~~~C~y~~~~~~~m~~H~~~~Hg  109 (109)
T PF12013_consen   81 YRCQCDPPHCGYITRSKKTMRKHWRKEHG  109 (109)
T ss_pred             eeeecCCCCCCcEeccHHHHHHHHHHhcC
Confidence            999    9999999999999999999875


No 60 
>KOG2231|consensus
Probab=93.58  E-value=0.084  Score=42.40  Aligned_cols=13  Identities=62%  Similarity=0.825  Sum_probs=9.3

Q ss_pred             CHHHHHHHHH-Hhc
Q psy6598          17 SATNYKNHMR-IHS   29 (154)
Q Consensus        17 ~~~~l~~H~~-~h~   29 (154)
                      ....|+.|+. .|.
T Consensus       126 s~~~Lk~H~~~~H~  139 (669)
T KOG2231|consen  126 SVENLKNHMRDQHK  139 (669)
T ss_pred             HHHHHHHHHHHhhh
Confidence            6778888884 453


No 61 
>COG4049 Uncharacterized protein containing archaeal-type C2H2 Zn-finger [General function prediction only]
Probab=93.40  E-value=0.04  Score=29.33  Aligned_cols=33  Identities=18%  Similarity=0.228  Sum_probs=23.7

Q ss_pred             hcCCCceeCCCCcccccCchhHHHHHHhhcCcC
Q psy6598          58 HSDIRPFICDRCPRSYRQLCTLNVHKKTNHRES   90 (154)
Q Consensus        58 h~~~~~~~C~~C~~~f~~~~~l~~H~~~~~~~~   90 (154)
                      ..|+..+.|+-||..|.......+|....|+-.
T Consensus        12 RDGE~~lrCPRC~~~FR~~K~Y~RHVNKaH~~~   44 (65)
T COG4049          12 RDGEEFLRCPRCGMVFRRRKDYIRHVNKAHGWL   44 (65)
T ss_pred             cCCceeeeCCchhHHHHHhHHHHHHhhHHhhhh
Confidence            446667788888888888888888877665543


No 62 
>PF12756 zf-C2H2_2:  C2H2 type zinc-finger (2 copies); PDB: 2DMI_A.
Probab=93.37  E-value=0.098  Score=31.17  Aligned_cols=20  Identities=30%  Similarity=0.610  Sum_probs=10.0

Q ss_pred             eecccccCcccCChhhHHHHhh
Q psy6598          35 VCQVRDCQKRFTEYSSLYKHTL   56 (154)
Q Consensus        35 ~C~~~~C~~~f~~~~~l~~H~~   56 (154)
                      .|.+  |++.|.+...|..|++
T Consensus        52 ~C~~--C~~~f~s~~~l~~Hm~   71 (100)
T PF12756_consen   52 RCPY--CNKTFRSREALQEHMR   71 (100)
T ss_dssp             EBSS--SS-EESSHHHHHHHHH
T ss_pred             CCCc--cCCCCcCHHHHHHHHc
Confidence            4544  5555555555555554


No 63 
>KOG4173|consensus
Probab=92.93  E-value=0.065  Score=36.59  Aligned_cols=81  Identities=21%  Similarity=0.382  Sum_probs=61.1

Q ss_pred             eecccccccccccCHHHHHHHHHHhcCCCceeecccccCcccCChhhHHHHhh-hh---------cCCCceeCC--CCcc
Q psy6598           4 FFYFPWDCKISFASATNYKNHMRIHSGEKPYVCQVRDCQKRFTEYSSLYKHTL-VH---------SDIRPFICD--RCPR   71 (154)
Q Consensus         4 ~~C~~~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~~~C~~~f~~~~~l~~H~~-~h---------~~~~~~~C~--~C~~   71 (154)
                      |-|+-..|...|........|...-++   -.|.+  |.+.|.+.-.|..|+. +|         .|.--|.|-  .|+-
T Consensus        80 ~~cqvagc~~~~d~lD~~E~hY~~~h~---~sCs~--C~r~~Pt~hLLd~HI~E~HDs~Fqa~veRG~dMy~ClvEgCt~  154 (253)
T KOG4173|consen   80 FACQVAGCCQVFDALDDYEHHYHTLHG---NSCSF--CKRAFPTGHLLDAHILEWHDSLFQALVERGQDMYQCLVEGCTE  154 (253)
T ss_pred             ccccccchHHHHhhhhhHHHhhhhccc---chhHH--HHHhCCchhhhhHHHHHHHHHHHHHHHHcCccHHHHHHHhhhh
Confidence            557655777888888888888764333   25988  9999999999999976 34         233448885  4999


Q ss_pred             cccCchhHHHHHHhhcCc
Q psy6598          72 SYRQLCTLNVHKKTNHRE   89 (154)
Q Consensus        72 ~f~~~~~l~~H~~~~~~~   89 (154)
                      .|.+......||-..|.-
T Consensus       155 KFkT~r~RkdH~I~~Hk~  172 (253)
T KOG4173|consen  155 KFKTSRDRKDHMIRMHKY  172 (253)
T ss_pred             hhhhhhhhhhHHHHhccC
Confidence            999999999998887754


No 64 
>cd00350 rubredoxin_like Rubredoxin_like; nonheme iron binding domain containing a [Fe(SCys)4] center. The family includes rubredoxins, a small electron transfer protein, and a slightly smaller modular rubredoxin domain present in rubrerythrin and nigerythrin and detected either N- or C-terminal to such proteins as flavin reductase, NAD(P)H-nitrite reductase, and ferredoxin-thioredoxin reductase. In rubredoxin, the iron atom is coordinated by four cysteine residues (Fe(S-Cys)4), but iron can also be replaced by cobalt, nickel or zinc and believed to be involved in electron transfer.  Rubrerythrins and nigerythrins are small homodimeric proteins, generally consisting of 2 domains: a rubredoxin domain C-terminal to a non-sulfur, oxo-bridged diiron site in the N-terminal rubrerythrin domain.  Rubrerythrins and nigerythrins have putative peroxide activity.
Probab=92.42  E-value=0.1  Score=24.78  Aligned_cols=10  Identities=30%  Similarity=1.059  Sum_probs=5.0

Q ss_pred             CceeCCCCcc
Q psy6598          62 RPFICDRCPR   71 (154)
Q Consensus        62 ~~~~C~~C~~   71 (154)
                      .++.|+.||.
T Consensus        16 ~~~~CP~Cg~   25 (33)
T cd00350          16 APWVCPVCGA   25 (33)
T ss_pred             CCCcCcCCCC
Confidence            3455555543


No 65 
>KOG2231|consensus
Probab=92.04  E-value=0.21  Score=40.23  Aligned_cols=71  Identities=18%  Similarity=0.435  Sum_probs=49.0

Q ss_pred             ecccccccccccCHHHHHHHHHHhcCCCceeecc----cccCcccCChhhHHHHhhhhcCCCceeCC--CCc-ccccCch
Q psy6598           5 FYFPWDCKISFASATNYKNHMRIHSGEKPYVCQV----RDCQKRFTEYSSLYKHTLVHSDIRPFICD--RCP-RSYRQLC   77 (154)
Q Consensus         5 ~C~~~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~----~~C~~~f~~~~~l~~H~~~h~~~~~~~C~--~C~-~~f~~~~   77 (154)
                      .|.  .|...|.....|.+|++.++    |.|.+    .+++..|.....|..|.+.++    |.|.  .|. +.|....
T Consensus       184 ~C~--~C~~~fld~~el~rH~~~~h----~~chfC~~~~~~neyy~~~~dLe~HfR~~H----flCE~~~C~~~~f~~~~  253 (669)
T KOG2231|consen  184 LCK--FCHERFLDDDELYRHLRFDH----EFCHFCDYKTGQNEYYNDYDDLEEHFRKGH----FLCEEEFCRTKKFYVAF  253 (669)
T ss_pred             cch--hhhhhhccHHHHHHhhccce----eheeecCcccccchhcccchHHHHHhhhcC----ccccccccccceeeehh
Confidence            377  89999999999999988654    44544    123456777788988888543    7787  564 4555555


Q ss_pred             hHHHHHHh
Q psy6598          78 TLNVHKKT   85 (154)
Q Consensus        78 ~l~~H~~~   85 (154)
                      .+..|++.
T Consensus       254 ~~ei~lk~  261 (669)
T KOG2231|consen  254 ELEIELKA  261 (669)
T ss_pred             HHHHHHHh
Confidence            66666663


No 66 
>KOG2893|consensus
Probab=92.01  E-value=0.052  Score=37.94  Aligned_cols=45  Identities=27%  Similarity=0.614  Sum_probs=36.8

Q ss_pred             cccccccccCHHHHHHHHHHhcCCCceeecccccCcccCChhhHHHHhh-hh
Q psy6598           8 PWDCKISFASATNYKNHMRIHSGEKPYVCQVRDCQKRFTEYSSLYKHTL-VH   58 (154)
Q Consensus         8 ~~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~~~C~~~f~~~~~l~~H~~-~h   58 (154)
                      +|+|++-|.+..-|..|++.    +-|+|.+  |-+...+-..|..|.. +|
T Consensus        13 cwycnrefddekiliqhqka----khfkchi--chkkl~sgpglsihcmqvh   58 (341)
T KOG2893|consen   13 CWYCNREFDDEKILIQHQKA----KHFKCHI--CHKKLFSGPGLSIHCMQVH   58 (341)
T ss_pred             eeecccccchhhhhhhhhhh----ccceeee--ehhhhccCCCceeehhhhh
Confidence            34999999999999999885    4489999  9998888888887754 44


No 67 
>PRK04860 hypothetical protein; Provisional
Probab=91.03  E-value=0.2  Score=33.17  Aligned_cols=36  Identities=19%  Similarity=0.434  Sum_probs=30.7

Q ss_pred             ceeCCCCcccccCchhHHHHHHhhcCcCCCCCCCCCcchh
Q psy6598          63 PFICDRCPRSYRQLCTLNVHKKTNHRESKNKKPSNRNNIV  102 (154)
Q Consensus        63 ~~~C~~C~~~f~~~~~l~~H~~~~~~~~~~~~~~~~~~~~  102 (154)
                      +|.|. |+.   ....+.+|.++|.++.++.+..|...+.
T Consensus       119 ~Y~C~-C~~---~~~~~rrH~ri~~g~~~YrC~~C~~~l~  154 (160)
T PRK04860        119 PYRCK-CQE---HQLTVRRHNRVVRGEAVYRCRRCGETLV  154 (160)
T ss_pred             EEEcC-CCC---eeCHHHHHHHHhcCCccEECCCCCceeE
Confidence            69998 997   7788999999999999999988776543


No 68 
>PF02892 zf-BED:  BED zinc finger;  InterPro: IPR003656 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target.  This entry represents predicted BED-type zinc finger domains. The BED finger which was named after the Drosophila proteins BEAF and DREF, is found in one or more copies in cellular regulatory factors and transposases from plants, animals and fungi. The BED finger is an about 50 to 60 amino acid residues domain that contains a characteristic motif with two highly conserved aromatic positions, as well as a shared pattern of cysteines and histidines that is predicted to form a zinc finger. As diverse BED fingers are able to bind DNA, it has been suggested that DNA-binding is the general function of this domain []. Some proteins known to contain a BED domain include animal, plant and fungi AC1 and Hobo-like transposases; Caenorhabditis elegans Dpy-20 protein, a predicted cuticular gene transcriptional regulator; Drosophila BEAF (boundary element-associated factor), thought to be involved in chromatin insulation; Drosophila DREF, a transcriptional regulator for S-phase genes; and tobacco 3AF1 and tomato E4/E8-BP1, light- and ethylene-regulated DNA binding proteins that contain two BED fingers. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; GO: 0003677 DNA binding; PDB: 2DJR_A 2CT5_A.
Probab=90.74  E-value=0.3  Score=24.75  Aligned_cols=26  Identities=23%  Similarity=0.397  Sum_probs=14.0

Q ss_pred             CCCceeCCCCcccccCc----hhHHHHHHh
Q psy6598          60 DIRPFICDRCPRSYRQL----CTLNVHKKT   85 (154)
Q Consensus        60 ~~~~~~C~~C~~~f~~~----~~l~~H~~~   85 (154)
                      +.....|..|++.+...    ++|..|++.
T Consensus        13 ~~~~a~C~~C~~~~~~~~~~ts~l~~HL~~   42 (45)
T PF02892_consen   13 DKKKAKCKYCGKVIKYSSGGTSNLKRHLKK   42 (45)
T ss_dssp             CSS-EEETTTTEE-----SSTHHHHHHHHH
T ss_pred             CcCeEEeCCCCeEEeeCCCcHHHHHHhhhh
Confidence            34456777777776653    567777744


No 69 
>KOG4173|consensus
Probab=90.12  E-value=0.078  Score=36.20  Aligned_cols=53  Identities=25%  Similarity=0.500  Sum_probs=44.8

Q ss_pred             ecccccccccccCHHHHHHHHH-Hhc---------CCCceeecccccCcccCChhhHHHHhh-hhc
Q psy6598           5 FYFPWDCKISFASATNYKNHMR-IHS---------GEKPYVCQVRDCQKRFTEYSSLYKHTL-VHS   59 (154)
Q Consensus         5 ~C~~~~C~~~f~~~~~l~~H~~-~h~---------~~~~~~C~~~~C~~~f~~~~~l~~H~~-~h~   59 (154)
                      .|.  .|.+.|.+...|..|+. .|.         |...|.|-+++|+..|.+...-..|+- .|.
T Consensus       108 sCs--~C~r~~Pt~hLLd~HI~E~HDs~Fqa~veRG~dMy~ClvEgCt~KFkT~r~RkdH~I~~Hk  171 (253)
T KOG4173|consen  108 SCS--FCKRAFPTGHLLDAHILEWHDSLFQALVERGQDMYQCLVEGCTEKFKTSRDRKDHMIRMHK  171 (253)
T ss_pred             hhH--HHHHhCCchhhhhHHHHHHHHHHHHHHHHcCccHHHHHHHhhhhhhhhhhhhhhHHHHhcc
Confidence            688  99999999999999976 452         556799999999999999999889975 564


No 70 
>PF12013 DUF3505:  Protein of unknown function (DUF3505);  InterPro: IPR022698  This family of proteins is functionally uncharacterised. This protein is found in eukaryotes. Proteins in this family are typically between 247 to 1018 amino acids in length. This region contains two segments that are likely to be C2H2 zinc binding domains. 
Probab=89.99  E-value=0.93  Score=27.85  Aligned_cols=25  Identities=20%  Similarity=0.288  Sum_probs=21.7

Q ss_pred             eee----cccccCcccCChhhHHHHhhhhcC
Q psy6598          34 YVC----QVRDCQKRFTEYSSLYKHTLVHSD   60 (154)
Q Consensus        34 ~~C----~~~~C~~~f~~~~~l~~H~~~h~~   60 (154)
                      |.|    ..  |+..+.+...+.+|.+.++|
T Consensus        81 ~~C~~~~~~--C~y~~~~~~~m~~H~~~~Hg  109 (109)
T PF12013_consen   81 YRCQCDPPH--CGYITRSKKTMRKHWRKEHG  109 (109)
T ss_pred             eeeecCCCC--CCcEeccHHHHHHHHHHhcC
Confidence            899    66  99999999999999987653


No 71 
>KOG2482|consensus
Probab=88.99  E-value=0.41  Score=35.37  Aligned_cols=81  Identities=25%  Similarity=0.428  Sum_probs=58.6

Q ss_pred             eeccccccccccc-CHHHHHHHHH-Hhc---C------------------CCceeecccccCcccCChhhHHHHhhh--h
Q psy6598           4 FFYFPWDCKISFA-SATNYKNHMR-IHS---G------------------EKPYVCQVRDCQKRFTEYSSLYKHTLV--H   58 (154)
Q Consensus         4 ~~C~~~~C~~~f~-~~~~l~~H~~-~h~---~------------------~~~~~C~~~~C~~~f~~~~~l~~H~~~--h   58 (154)
                      .+|-  .|..-+. ..+..-.|+- .|.   |                  -..+.|-+  |.+.|..+..|+.||+.  |
T Consensus       145 lqCl--FCn~e~lgnRs~~l~Hlf~~H~lniGlpDniVyvnelLehLkekL~r~~CLy--CekifrdkntLkeHMrkK~H  220 (423)
T KOG2482|consen  145 LQCL--FCNNEGLGNRSEILEHLFHVHGLNIGLPDNIVYVNELLEHLKEKLERLRCLY--CEKIFRDKNTLKEHMRKKRH  220 (423)
T ss_pred             eEEE--EecchhcccHHHHHHHHHHHhhhccCCCcceeeHHHHHHHHHHHHhhheeee--eccccCCcHHHHHHHHhccC
Confidence            4677  8876664 3455555643 342   1                  12388988  99999999999999983  5


Q ss_pred             cCCCc------------------------------------------------------eeCCCCcccccCchhHHHHHH
Q psy6598          59 SDIRP------------------------------------------------------FICDRCPRSYRQLCTLNVHKK   84 (154)
Q Consensus        59 ~~~~~------------------------------------------------------~~C~~C~~~f~~~~~l~~H~~   84 (154)
                      ....|                                                      ..|-+|....-....|..||+
T Consensus       221 rrinPknreYDkfyiINY~ev~ks~t~~~~e~dret~~d~~E~D~~wsDw~ed~a~a~~v~CLfC~~~~en~~~l~eHmk  300 (423)
T KOG2482|consen  221 RRINPKNREYDKFYIINYLEVGKSWTIVHSEDDRETNEDINETDDTWSDWNEDDAEALSVVCLFCTNFYENPVFLFEHMK  300 (423)
T ss_pred             cccCCCccccceEEEEeHhhcCCccchhhhhhhhhhhccccccccchhhhhcCCCCccceEEEeeccchhhHHHHHHHHH
Confidence            22111                                                      589999998888999999999


Q ss_pred             hhcC
Q psy6598          85 TNHR   88 (154)
Q Consensus        85 ~~~~   88 (154)
                      ..|.
T Consensus       301 ~vHe  304 (423)
T KOG2482|consen  301 IVHE  304 (423)
T ss_pred             HHHH
Confidence            8874


No 72 
>smart00614 ZnF_BED BED zinc finger. DNA-binding domain in chromatin-boundary-element-binding proteins and transposases
Probab=88.58  E-value=0.43  Score=24.93  Aligned_cols=21  Identities=19%  Similarity=0.433  Sum_probs=13.6

Q ss_pred             eeCCCCcccccCc-----hhHHHHHH
Q psy6598          64 FICDRCPRSYRQL-----CTLNVHKK   84 (154)
Q Consensus        64 ~~C~~C~~~f~~~-----~~l~~H~~   84 (154)
                      -.|..|++.+...     ++|.+|++
T Consensus        19 a~C~~C~~~l~~~~~~gTs~L~rHl~   44 (50)
T smart00614       19 AKCKYCGKKLSRSSKGGTSNLRRHLR   44 (50)
T ss_pred             EEecCCCCEeeeCCCCCcHHHHHHHH
Confidence            5667777766554     46777766


No 73 
>PF05443 ROS_MUCR:  ROS/MUCR transcriptional regulator protein;  InterPro: IPR008807 This family consists of several ROS/MUCR transcriptional regulator proteins. The ros chromosomal gene is present in octopine and nopaline strains of Agrobacterium tumefaciens as well as in Rhizobium meliloti (Sinorhizobium meliloti). This gene encodes a 15.5 kDa protein that specifically represses the virC and virD operons in the virulence region of the Ti plasmid [] and is necessary for succinoglycan production []. S. meliloti can produce two types of acidic exopolysaccharides, succinoglycan and galactoglucan, that are interchangeable for infection of Medicago sativa (Alfalfa) nodules. MucR from S. meliloti acts as a transcriptional repressor that blocks the expression of the exp genes responsible for galactoglucan production therefore allowing the exclusive production of succinoglycan [].; GO: 0003677 DNA binding, 0008270 zinc ion binding, 0006355 regulation of transcription, DNA-dependent; PDB: 2JSP_A.
Probab=86.64  E-value=0.46  Score=30.41  Aligned_cols=26  Identities=27%  Similarity=0.600  Sum_probs=16.6

Q ss_pred             CceeCCCCcccccCchhHHHHHHhhcCcC
Q psy6598          62 RPFICDRCPRSYRQLCTLNVHKKTNHRES   90 (154)
Q Consensus        62 ~~~~C~~C~~~f~~~~~l~~H~~~~~~~~   90 (154)
                      .-..|-+||+.|..   |.+|++.|||..
T Consensus        71 d~i~clecGk~~k~---LkrHL~~~~glt   96 (132)
T PF05443_consen   71 DYIICLECGKKFKT---LKRHLRTHHGLT   96 (132)
T ss_dssp             S-EE-TBT--EESB---HHHHHHHTT-S-
T ss_pred             CeeEEccCCcccch---HHHHHHHccCCC
Confidence            34899999998865   599999998753


No 74 
>KOG2186|consensus
Probab=85.92  E-value=0.56  Score=33.18  Aligned_cols=47  Identities=26%  Similarity=0.362  Sum_probs=33.6

Q ss_pred             eecccccccccccCHHHHHHHHHHhcCCCceeecccccCcccCChhhHHHHhhh
Q psy6598           4 FFYFPWDCKISFASATNYKNHMRIHSGEKPYVCQVRDCQKRFTEYSSLYKHTLV   57 (154)
Q Consensus         4 ~~C~~~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~~~C~~~f~~~~~l~~H~~~   57 (154)
                      |.|.  .||...... .+.+|+..-.+ ..|.|.-  |++.|.. ..+..|..-
T Consensus         4 FtCn--vCgEsvKKp-~vekH~srCrn-~~fSCID--C~k~F~~-~sYknH~kC   50 (276)
T KOG2186|consen    4 FTCN--VCGESVKKP-QVEKHMSRCRN-AYFSCID--CGKTFER-VSYKNHTKC   50 (276)
T ss_pred             Eehh--hhhhhcccc-chHHHHHhccC-CeeEEee--ccccccc-chhhhhhhh
Confidence            6788  888776544 46668776555 6688876  8888888 666777653


No 75 
>KOG2785|consensus
Probab=85.70  E-value=1.2  Score=33.45  Aligned_cols=74  Identities=22%  Similarity=0.253  Sum_probs=47.7

Q ss_pred             CeecccccccccccCHHHHHHHHHHhcCCCceeecccccCcccCChhhHHHHhhhhcCCCceeCCCCc---ccccCchhH
Q psy6598           3 PFFYFPWDCKISFASATNYKNHMRIHSGEKPYVCQVRDCQKRFTEYSSLYKHTLVHSDIRPFICDRCP---RSYRQLCTL   79 (154)
Q Consensus         3 ~~~C~~~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~---~~f~~~~~l   79 (154)
                      |-.|-  .|++.|.+...-..||..++|-  |.-+.    ...+....|..-+..-. ..-|.|-.|+   +.|.+..+.
T Consensus       166 Pt~CL--fC~~~~k~~e~~~~HM~~~Hgf--fIPdr----eYL~D~~GLl~YLgeKV-~~~~~CL~CN~~~~~f~sleav  236 (390)
T KOG2785|consen  166 PTDCL--FCDKKSKSLEENLKHMFKEHGF--FIPDR----EYLTDEKGLLKYLGEKV-GIGFICLFCNELGRPFSSLEAV  236 (390)
T ss_pred             Cccee--ecCCCcccHHHHHHHHhhccCC--cCCch----HhhhchhHHHHHHHHHh-ccCceEEEeccccCcccccHHH
Confidence            34688  9999999999999999987763  22111    11222223333222111 2347888888   889999999


Q ss_pred             HHHHHh
Q psy6598          80 NVHKKT   85 (154)
Q Consensus        80 ~~H~~~   85 (154)
                      ..||..
T Consensus       237 r~HM~~  242 (390)
T KOG2785|consen  237 RAHMRD  242 (390)
T ss_pred             HHHHhh
Confidence            999875


No 76 
>cd00729 rubredoxin_SM Rubredoxin, Small Modular nonheme iron binding domain containing a [Fe(SCys)4] center, present in rubrerythrin and nigerythrin and detected either N- or C-terminal to such proteins as flavin reductase, NAD(P)H-nitrite reductase, and ferredoxin-thioredoxin reductase. In rubredoxin, the iron atom is coordinated by four cysteine residues (Fe(S-Cys)4), and  believed to be involved in electron transfer. Rubrerythrins and nigerythrins are small homodimeric proteins, generally consisting of 2 domains: a rubredoxin domain C-terminal to a non-sulfur, oxo-bridged diiron site in the N-terminal rubrerythrin domain. Rubrerythrins and nigerythrins have putative peroxide activity.
Probab=85.13  E-value=0.67  Score=22.16  Aligned_cols=9  Identities=33%  Similarity=0.715  Sum_probs=4.9

Q ss_pred             ceeCCCCcc
Q psy6598          63 PFICDRCPR   71 (154)
Q Consensus        63 ~~~C~~C~~   71 (154)
                      |..|+.||.
T Consensus        18 p~~CP~Cg~   26 (34)
T cd00729          18 PEKCPICGA   26 (34)
T ss_pred             CCcCcCCCC
Confidence            445666653


No 77 
>KOG2785|consensus
Probab=84.84  E-value=1.1  Score=33.67  Aligned_cols=22  Identities=32%  Similarity=0.563  Sum_probs=16.4

Q ss_pred             CeecccccccccccCHHHHHHHHH
Q psy6598           3 PFFYFPWDCKISFASATNYKNHMR   26 (154)
Q Consensus         3 ~~~C~~~~C~~~f~~~~~l~~H~~   26 (154)
                      ++.|.  .|.+.|.+......|+.
T Consensus        68 ~~~c~--~c~k~~~s~~a~~~hl~   89 (390)
T KOG2785|consen   68 VVYCE--ACNKSFASPKAHENHLK   89 (390)
T ss_pred             ceehH--HhhccccChhhHHHHHH
Confidence            56788  88888887777776654


No 78 
>TIGR00373 conserved hypothetical protein TIGR00373. This family of proteins is, so far, restricted to archaeal genomes. The family appears to be distantly related to the N-terminal region of the eukaryotic transcription initiation factor IIE alpha chain.
Probab=84.16  E-value=1.6  Score=28.91  Aligned_cols=39  Identities=23%  Similarity=0.395  Sum_probs=29.1

Q ss_pred             hcCCCceeecccccCcccCChhhHHHHhhhhcCCCceeCCCCcccccCch
Q psy6598          28 HSGEKPYVCQVRDCQKRFTEYSSLYKHTLVHSDIRPFICDRCPRSYRQLC   77 (154)
Q Consensus        28 h~~~~~~~C~~~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~f~~~~   77 (154)
                      .....-|.|..  |+..|+....+.         .-|.|+.||.......
T Consensus       104 e~~~~~Y~Cp~--c~~r~tf~eA~~---------~~F~Cp~Cg~~L~~~d  142 (158)
T TIGR00373       104 ETNNMFFICPN--MCVRFTFNEAME---------LNFTCPRCGAMLDYLD  142 (158)
T ss_pred             ccCCCeEECCC--CCcEeeHHHHHH---------cCCcCCCCCCEeeecc
Confidence            34556799965  999999988875         2499999998755443


No 79 
>PF09986 DUF2225:  Uncharacterized protein conserved in bacteria (DUF2225);  InterPro: IPR018708 This conserved bacterial family has no known function.
Probab=83.43  E-value=0.43  Score=33.20  Aligned_cols=13  Identities=15%  Similarity=0.534  Sum_probs=10.0

Q ss_pred             eeCCCCcccccCc
Q psy6598          64 FICDRCPRSYRQL   76 (154)
Q Consensus        64 ~~C~~C~~~f~~~   76 (154)
                      ..|+.||.+|...
T Consensus        49 ~vCP~CgyA~~~~   61 (214)
T PF09986_consen   49 WVCPHCGYAAFEE   61 (214)
T ss_pred             EECCCCCCccccc
Confidence            6899999876644


No 80 
>PF13717 zinc_ribbon_4:  zinc-ribbon domain
Probab=82.50  E-value=1.8  Score=20.91  Aligned_cols=32  Identities=19%  Similarity=0.515  Sum_probs=17.3

Q ss_pred             eecccccCcccCChhhHHHHhhhhcCCCceeCCCCcccc
Q psy6598          35 VCQVRDCQKRFTEYSSLYKHTLVHSDIRPFICDRCPRSY   73 (154)
Q Consensus        35 ~C~~~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~f   73 (154)
                      .|..  |+..|.-.+...     -......+|+.|+..|
T Consensus         4 ~Cp~--C~~~y~i~d~~i-----p~~g~~v~C~~C~~~f   35 (36)
T PF13717_consen    4 TCPN--CQAKYEIDDEKI-----PPKGRKVRCSKCGHVF   35 (36)
T ss_pred             ECCC--CCCEEeCCHHHC-----CCCCcEEECCCCCCEe
Confidence            4543  666666655431     1223346777776665


No 81 
>COG4049 Uncharacterized protein containing archaeal-type C2H2 Zn-finger [General function prediction only]
Probab=82.33  E-value=0.77  Score=24.52  Aligned_cols=27  Identities=22%  Similarity=0.422  Sum_probs=18.8

Q ss_pred             hcCCCceeecccccCcccCChhhHHHHhh
Q psy6598          28 HSGEKPYVCQVRDCQKRFTEYSSLYKHTL   56 (154)
Q Consensus        28 h~~~~~~~C~~~~C~~~f~~~~~l~~H~~   56 (154)
                      ..|+.-+.|.-  |+..|.......+|..
T Consensus        12 RDGE~~lrCPR--C~~~FR~~K~Y~RHVN   38 (65)
T COG4049          12 RDGEEFLRCPR--CGMVFRRRKDYIRHVN   38 (65)
T ss_pred             cCCceeeeCCc--hhHHHHHhHHHHHHhh
Confidence            35666677755  7777777777777765


No 82 
>COG1592 Rubrerythrin [Energy production and conversion]
Probab=82.27  E-value=1  Score=30.01  Aligned_cols=25  Identities=36%  Similarity=0.790  Sum_probs=19.9

Q ss_pred             CceeecccccCcccCChhhHHHHhhhhcCCCceeCCCCcc
Q psy6598          32 KPYVCQVRDCQKRFTEYSSLYKHTLVHSDIRPFICDRCPR   71 (154)
Q Consensus        32 ~~~~C~~~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~   71 (154)
                      +-|.|.+  ||..             |.++-|-.|++||.
T Consensus       133 ~~~vC~v--CGy~-------------~~ge~P~~CPiCga  157 (166)
T COG1592         133 KVWVCPV--CGYT-------------HEGEAPEVCPICGA  157 (166)
T ss_pred             CEEEcCC--CCCc-------------ccCCCCCcCCCCCC
Confidence            3699988  9865             55678899999984


No 83 
>PF04959 ARS2:  Arsenite-resistance protein 2;  InterPro: IPR007042 This entry represents Arsenite-resistance protein 2 (also known as Serrate RNA effector molecule homolog) which is thought to play a role in arsenite resistance [], although does not directly confer arsenite resistance but rather modulates arsenic sensitivity []. Arsenite is a carcinogenic compound which can act as a comutagen by inhibiting DNA repair. It is also involved in cell cycle progression at S phase. ; PDB: 3AX1_A.
Probab=81.74  E-value=0.63  Score=32.35  Aligned_cols=29  Identities=17%  Similarity=0.344  Sum_probs=18.3

Q ss_pred             CceeCCCCcccccCchhHHHHHHhhcCcC
Q psy6598          62 RPFICDRCPRSYRQLCTLNVHKKTNHRES   90 (154)
Q Consensus        62 ~~~~C~~C~~~f~~~~~l~~H~~~~~~~~   90 (154)
                      ..|.|..|+|.|.-..-..+|+..-|.+.
T Consensus        76 ~K~~C~lc~KlFkg~eFV~KHI~nKH~e~  104 (214)
T PF04959_consen   76 DKWRCPLCGKLFKGPEFVRKHIFNKHPEK  104 (214)
T ss_dssp             EEEEE-SSS-EESSHHHHHHHHHHH-HHH
T ss_pred             CEECCCCCCcccCChHHHHHHHhhcCHHH
Confidence            34777777777777777777777666543


No 84 
>smart00659 RPOLCX RNA polymerase subunit CX. present in RNA polymerase I, II and III
Probab=81.62  E-value=1.6  Score=22.25  Aligned_cols=26  Identities=19%  Similarity=0.496  Sum_probs=15.2

Q ss_pred             eeecccccCcccCChhhHHHHhhhhcCCCceeCCCCccc
Q psy6598          34 YVCQVRDCQKRFTEYSSLYKHTLVHSDIRPFICDRCPRS   72 (154)
Q Consensus        34 ~~C~~~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~   72 (154)
                      |.|..  ||..|...           ...+..|+.||..
T Consensus         3 Y~C~~--Cg~~~~~~-----------~~~~irC~~CG~r   28 (44)
T smart00659        3 YICGE--CGRENEIK-----------SKDVVRCRECGYR   28 (44)
T ss_pred             EECCC--CCCEeecC-----------CCCceECCCCCce
Confidence            66655  77665543           2344677777654


No 85 
>PF13719 zinc_ribbon_5:  zinc-ribbon domain
Probab=81.58  E-value=1.7  Score=21.09  Aligned_cols=28  Identities=25%  Similarity=0.560  Sum_probs=14.4

Q ss_pred             cCcccCChhhHHHHhhhhcCCCceeCCCCcccc
Q psy6598          41 CQKRFTEYSSLYKHTLVHSDIRPFICDRCPRSY   73 (154)
Q Consensus        41 C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~f   73 (154)
                      |+..|.-...-     .-.+....+|+.|+..|
T Consensus         8 C~~~f~v~~~~-----l~~~~~~vrC~~C~~~f   35 (37)
T PF13719_consen    8 CQTRFRVPDDK-----LPAGGRKVRCPKCGHVF   35 (37)
T ss_pred             CCceEEcCHHH-----cccCCcEEECCCCCcEe
Confidence            66666655431     11223346666666655


No 86 
>smart00531 TFIIE Transcription initiation factor IIE.
Probab=81.50  E-value=2.3  Score=27.72  Aligned_cols=41  Identities=22%  Similarity=0.417  Sum_probs=27.1

Q ss_pred             CCCceeecccccCcccCChhhHHHHhhhhcCCCceeCCCCcccccCc
Q psy6598          30 GEKPYVCQVRDCQKRFTEYSSLYKHTLVHSDIRPFICDRCPRSYRQL   76 (154)
Q Consensus        30 ~~~~~~C~~~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~f~~~   76 (154)
                      ...-|.|..  |+..|.....+..-   .. ...|.|+.||......
T Consensus        96 ~~~~Y~Cp~--C~~~y~~~ea~~~~---d~-~~~f~Cp~Cg~~l~~~  136 (147)
T smart00531       96 NNAYYKCPN--CQSKYTFLEANQLL---DM-DGTFTCPRCGEELEED  136 (147)
T ss_pred             CCcEEECcC--CCCEeeHHHHHHhc---CC-CCcEECCCCCCEEEEc
Confidence            455799966  99999876554321   11 3339999999875443


No 87 
>smart00734 ZnF_Rad18 Rad18-like CCHC zinc finger. Yeast Rad18p functions with Rad5p in error-free post-replicative DNA repair. This zinc finger is likely to bind nucleic-acids.
Probab=81.18  E-value=1.3  Score=19.71  Aligned_cols=21  Identities=24%  Similarity=0.381  Sum_probs=15.1

Q ss_pred             eeCCCCcccccCchhHHHHHHh
Q psy6598          64 FICDRCPRSYRQLCTLNVHKKT   85 (154)
Q Consensus        64 ~~C~~C~~~f~~~~~l~~H~~~   85 (154)
                      ..|++|++.+ ....+..|+..
T Consensus         2 v~CPiC~~~v-~~~~in~HLD~   22 (26)
T smart00734        2 VQCPVCFREV-PENLINSHLDS   22 (26)
T ss_pred             CcCCCCcCcc-cHHHHHHHHHH
Confidence            4688888887 55677777753


No 88 
>KOG2186|consensus
Probab=80.95  E-value=1.3  Score=31.50  Aligned_cols=58  Identities=21%  Similarity=0.429  Sum_probs=44.4

Q ss_pred             ceeecccccCcccCChhhHHHHhhhhcCCCceeCCCCcccccCchhHHHHHHhhcCcCCCCCC
Q psy6598          33 PYVCQVRDCQKRFTEYSSLYKHTLVHSDIRPFICDRCPRSYRQLCTLNVHKKTNHRESKNKKP   95 (154)
Q Consensus        33 ~~~C~~~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~f~~~~~l~~H~~~~~~~~~~~~~   95 (154)
                      .|.|..  ||..... ..+.+|+...++ .-|.|--|++.|.+ .....|..--.....|...
T Consensus         3 ~FtCnv--CgEsvKK-p~vekH~srCrn-~~fSCIDC~k~F~~-~sYknH~kCITEaQKYg~~   60 (276)
T KOG2186|consen    3 FFTCNV--CGESVKK-PQVEKHMSRCRN-AYFSCIDCGKTFER-VSYKNHTKCITEAQKYGKK   60 (276)
T ss_pred             EEehhh--hhhhccc-cchHHHHHhccC-CeeEEeeccccccc-chhhhhhhhcchHHHhhhh
Confidence            378988  9987665 456779887777 56999999999998 6778887766665555544


No 89 
>TIGR02098 MJ0042_CXXC MJ0042 family finger-like domain. This domain contains a CXXCX(19)CXXC motif suggestive of both zinc fingers and thioredoxin, usually found at the N-terminus of prokaryotic proteins. One partially characterized gene, agmX, is among a large set in Myxococcus whose interruption affects adventurous gliding motility.
Probab=80.56  E-value=1.9  Score=20.85  Aligned_cols=10  Identities=20%  Similarity=0.853  Sum_probs=5.8

Q ss_pred             eeCCCCcccc
Q psy6598          64 FICDRCPRSY   73 (154)
Q Consensus        64 ~~C~~C~~~f   73 (154)
                      +.|+.|+..|
T Consensus        26 v~C~~C~~~~   35 (38)
T TIGR02098        26 VRCGKCGHVW   35 (38)
T ss_pred             EECCCCCCEE
Confidence            5566666554


No 90 
>PRK06266 transcription initiation factor E subunit alpha; Validated
Probab=80.53  E-value=2.1  Score=28.94  Aligned_cols=37  Identities=22%  Similarity=0.462  Sum_probs=27.8

Q ss_pred             CCCceeecccccCcccCChhhHHHHhhhhcCCCceeCCCCcccccCch
Q psy6598          30 GEKPYVCQVRDCQKRFTEYSSLYKHTLVHSDIRPFICDRCPRSYRQLC   77 (154)
Q Consensus        30 ~~~~~~C~~~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~f~~~~   77 (154)
                      ...-|.|..  |+..|+....+.         .-|.|+.||.......
T Consensus       114 ~~~~Y~Cp~--C~~rytf~eA~~---------~~F~Cp~Cg~~L~~~d  150 (178)
T PRK06266        114 NNMFFFCPN--CHIRFTFDEAME---------YGFRCPQCGEMLEEYD  150 (178)
T ss_pred             CCCEEECCC--CCcEEeHHHHhh---------cCCcCCCCCCCCeecc
Confidence            446699965  999999888763         2599999998765543


No 91 
>KOG4167|consensus
Probab=80.44  E-value=0.37  Score=39.12  Aligned_cols=26  Identities=27%  Similarity=0.580  Sum_probs=23.7

Q ss_pred             ceeCCCCcccccCchhHHHHHHhhcC
Q psy6598          63 PFICDRCPRSYRQLCTLNVHKKTNHR   88 (154)
Q Consensus        63 ~~~C~~C~~~f~~~~~l~~H~~~~~~   88 (154)
                      .|.|..|+|.|-.-.++..||++|.-
T Consensus       792 iFpCreC~kvF~KiKSrNAHMK~Hr~  817 (907)
T KOG4167|consen  792 IFPCRECGKVFFKIKSRNAHMKTHRQ  817 (907)
T ss_pred             eeehHHHHHHHHHHhhhhHHHHHHHH
Confidence            39999999999999999999999853


No 92 
>TIGR00622 ssl1 transcription factor ssl1. This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University).
Probab=80.34  E-value=3.2  Score=25.74  Aligned_cols=79  Identities=18%  Similarity=0.276  Sum_probs=46.4

Q ss_pred             CeecccccccccccCHHHHHHHHHHhcCCCce------------eecccccCcccCChhhHHHHhhhhcCCCceeCCCCc
Q psy6598           3 PFFYFPWDCKISFASATNYKNHMRIHSGEKPY------------VCQVRDCQKRFTEYSSLYKHTLVHSDIRPFICDRCP   70 (154)
Q Consensus         3 ~~~C~~~~C~~~f~~~~~l~~H~~~h~~~~~~------------~C~~~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~   70 (154)
                      |-.|+  .||-+......|.+-..--...++|            .|--  |...|........  ..-.....|.|+.|.
T Consensus        15 P~~Cp--iCgLtLVss~HLARSyHHLfPl~~f~ev~~~~~~~~~~C~~--C~~~f~~~~~~~~--~~~~~~~~y~C~~C~   88 (112)
T TIGR00622        15 PVECP--ICGLTLILSTHLARSYHHLFPLKAFQEIPLEEYNGSRFCFG--CQGPFPKPPVSPF--DELKDSHRYVCAVCK   88 (112)
T ss_pred             CCcCC--cCCCEEeccchHHHhhhccCCCcccccccccccCCCCcccC--cCCCCCCcccccc--cccccccceeCCCCC
Confidence            55677  8888777777776532111111111            3755  8888876431110  001223458999999


Q ss_pred             ccccCchhHHHHHHhhc
Q psy6598          71 RSYRQLCTLNVHKKTNH   87 (154)
Q Consensus        71 ~~f~~~~~l~~H~~~~~   87 (154)
                      ..|-..-..-.|...|.
T Consensus        89 ~~FC~dCD~fiHe~Lh~  105 (112)
T TIGR00622        89 NVFCVDCDVFVHESLHC  105 (112)
T ss_pred             CccccccchhhhhhccC
Confidence            98888877777877654


No 93 
>KOG4124|consensus
Probab=79.81  E-value=0.29  Score=36.21  Aligned_cols=53  Identities=34%  Similarity=0.839  Sum_probs=38.4

Q ss_pred             CCceeecccccCcccCChhhHHHHhhh-h------------------cCCCceeCCCCcccccCchhHHHHH
Q psy6598          31 EKPYVCQVRDCQKRFTEYSSLYKHTLV-H------------------SDIRPFICDRCPRSYRQLCTLNVHK   83 (154)
Q Consensus        31 ~~~~~C~~~~C~~~f~~~~~l~~H~~~-h------------------~~~~~~~C~~C~~~f~~~~~l~~H~   83 (154)
                      .++|.|.++.|.+.+.....|.-|... |                  ...|+|+|++|.+.++-...|..|.
T Consensus       347 ~~~~~~~vp~~~~~~~n~ng~~~~~~~~h~s~i~~~s~~~~ph~~~~~~nk~~r~~i~~~~~k~~~~l~~~~  418 (442)
T KOG4124|consen  347 DKPYKCPVPNCDKAYKNQNGLKYHKLHGHCSPITTPTPAPIPHQGFVVENKPYRCEVCSKRYKNLNGLKYHR  418 (442)
T ss_pred             cCCCCCCCCcchhhcccCcceeeccccCcCCCCCCCCCCCCCcceeeeccCcccChhhhhhhccCCCCCcee
Confidence            467888888888888877777666432 2                  1247899999999888777665553


No 94 
>PRK00398 rpoP DNA-directed RNA polymerase subunit P; Provisional
Probab=76.89  E-value=1  Score=23.00  Aligned_cols=11  Identities=18%  Similarity=0.374  Sum_probs=6.0

Q ss_pred             ceeCCCCcccc
Q psy6598          63 PFICDRCPRSY   73 (154)
Q Consensus        63 ~~~C~~C~~~f   73 (154)
                      .+.|+.||..+
T Consensus        21 ~~~Cp~CG~~~   31 (46)
T PRK00398         21 GVRCPYCGYRI   31 (46)
T ss_pred             ceECCCCCCeE
Confidence            35666666443


No 95 
>PF05290 Baculo_IE-1:  Baculovirus immediate-early protein (IE-0);  InterPro: IPR007954 This entry contains the Baculovirus immediate-early protein IE-0.
Probab=76.02  E-value=1.5  Score=27.99  Aligned_cols=25  Identities=20%  Similarity=0.505  Sum_probs=16.1

Q ss_pred             hhHHHHhhhhcCCCceeCCCCcccccCchh
Q psy6598          49 SSLYKHTLVHSDIRPFICDRCPRSYRQLCT   78 (154)
Q Consensus        49 ~~l~~H~~~h~~~~~~~C~~C~~~f~~~~~   78 (154)
                      ..|..|-..|     =.|+.|.-+|++.+.
T Consensus       112 a~LWK~~~~y-----pvCPvCkTSFKss~~  136 (140)
T PF05290_consen  112 ANLWKFCNLY-----PVCPVCKTSFKSSSS  136 (140)
T ss_pred             HHHHHHcccC-----CCCCccccccccccc
Confidence            4455555444     478888888877643


No 96 
>PF09986 DUF2225:  Uncharacterized protein conserved in bacteria (DUF2225);  InterPro: IPR018708 This conserved bacterial family has no known function.
Probab=75.83  E-value=0.57  Score=32.60  Aligned_cols=44  Identities=23%  Similarity=0.378  Sum_probs=30.2

Q ss_pred             CCCeecccccccccccCHHHHHHHHHH-h---------cCCCc-----eeecccccCcccCCh
Q psy6598           1 MKPFFYFPWDCKISFASATNYKNHMRI-H---------SGEKP-----YVCQVRDCQKRFTEY   48 (154)
Q Consensus         1 ~k~~~C~~~~C~~~f~~~~~l~~H~~~-h---------~~~~~-----~~C~~~~C~~~f~~~   48 (154)
                      +|.++|+  +|++.|..+.-.....+. .         .+..|     ..|..  ||.+|...
T Consensus         3 ~k~~~CP--vC~~~F~~~~vrs~~~r~~~~d~D~~~~Y~~vnP~~Y~V~vCP~--CgyA~~~~   61 (214)
T PF09986_consen    3 DKKITCP--VCGKEFKTKKVRSGKIRVIRRDSDFCPRYKGVNPLFYEVWVCPH--CGYAAFEE   61 (214)
T ss_pred             CCceECC--CCCCeeeeeEEEcCCceEeeecCCCccccCCCCCeeeeEEECCC--CCCccccc
Confidence            4778999  999999887655554432 1         12223     68966  99887765


No 97 
>KOG4167|consensus
Probab=75.64  E-value=0.79  Score=37.32  Aligned_cols=24  Identities=25%  Similarity=0.437  Sum_probs=19.7

Q ss_pred             eecccccccccccCHHHHHHHHHHhc
Q psy6598           4 FFYFPWDCKISFASATNYKNHMRIHS   29 (154)
Q Consensus         4 ~~C~~~~C~~~f~~~~~l~~H~~~h~   29 (154)
                      |.|.  .|+|+|.....+..||+.|.
T Consensus       793 FpCr--eC~kvF~KiKSrNAHMK~Hr  816 (907)
T KOG4167|consen  793 FPCR--ECGKVFFKIKSRNAHMKTHR  816 (907)
T ss_pred             eehH--HHHHHHHHHhhhhHHHHHHH
Confidence            7788  88888888888888888874


No 98 
>PF02176 zf-TRAF:  TRAF-type zinc finger; PDB: 2EOD_A 2YUC_A 3HCU_A 3HCS_B 3HCT_A.
Probab=75.01  E-value=2.7  Score=22.52  Aligned_cols=44  Identities=20%  Similarity=0.384  Sum_probs=23.4

Q ss_pred             CCceeecccccCcccCChhhHHHHhhhhcCCCceeCCC----CcccccC
Q psy6598          31 EKPYVCQVRDCQKRFTEYSSLYKHTLVHSDIRPFICDR----CPRSYRQ   75 (154)
Q Consensus        31 ~~~~~C~~~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~----C~~~f~~   75 (154)
                      ..+-.|.. +|+..-..+..|..|...--..++..|++    |+..+.+
T Consensus         7 ~~~v~C~~-~cc~~~i~r~~l~~H~~~~C~~~~v~C~~~~~GC~~~~~~   54 (60)
T PF02176_consen    7 FRPVPCPN-GCCNEMIPRKELDDHLENECPKRPVPCPYSPYGCKERVPR   54 (60)
T ss_dssp             TSEEE-TT---S-BEEECCCHHHHHHTTSTTSEEE-SS----S--EEEH
T ss_pred             CCEeeCCC-CCcccceeHHHHHHHHHccCCCCcEECCCCCCCCCCccch
Confidence            34556653 34444344567888887666667788888    7776543


No 99 
>PF15269 zf-C2H2_7:  Zinc-finger
Probab=74.85  E-value=2.7  Score=21.38  Aligned_cols=22  Identities=23%  Similarity=0.558  Sum_probs=17.2

Q ss_pred             eecccccccccccCHHHHHHHHHH
Q psy6598           4 FFYFPWDCKISFASATNYKNHMRI   27 (154)
Q Consensus         4 ~~C~~~~C~~~f~~~~~l~~H~~~   27 (154)
                      |+|-  +|..+...++.|-.||+.
T Consensus        21 ykcf--qcpftc~~kshl~nhmky   42 (54)
T PF15269_consen   21 YKCF--QCPFTCNEKSHLFNHMKY   42 (54)
T ss_pred             ceee--cCCcccchHHHHHHHHHH
Confidence            5676  888888888888888774


No 100
>TIGR02605 CxxC_CxxC_SSSS putative regulatory protein, FmdB family. This model represents a region of about 50 amino acids found in a number of small proteins in a wide range of bacteria. The region begins usually with the initiator Met and contains two CxxC motifs separated by 17 amino acids. One member of this family is has been noted as a putative regulatory protein, designated FmdB (PubMed:8841393). Most members of this family have a C-terminal region containing highly degenerate sequence, such as SSTSESTKSSGSSGSSGSSESKASGSTEKSTSSTTAAAAV in Mycobacterium tuberculosis and VAVGGSAPAPSPAPRAGGGGGGCCGGGCCG in Streptomyces avermitilis. These low complexity regions, which are not included in the model, resemble low-complexity C-terminal regions of some heterocycle-containing bacteriocin precursors.
Probab=72.71  E-value=1.4  Score=23.02  Aligned_cols=7  Identities=29%  Similarity=0.866  Sum_probs=3.4

Q ss_pred             eeCCCCc
Q psy6598          64 FICDRCP   70 (154)
Q Consensus        64 ~~C~~C~   70 (154)
                      ..|+.||
T Consensus        27 ~~CP~Cg   33 (52)
T TIGR02605        27 ATCPECG   33 (52)
T ss_pred             CCCCCCC
Confidence            3455554


No 101
>PF10571 UPF0547:  Uncharacterised protein family UPF0547;  InterPro: IPR018886  This domain may well be a type of zinc-finger as it carries two pairs of highly conserved cysteine residues though with no accompanying histidines. Several members are annotated as putative helicases. 
Probab=71.60  E-value=2.1  Score=19.11  Aligned_cols=10  Identities=20%  Similarity=0.883  Sum_probs=6.0

Q ss_pred             eCCCCccccc
Q psy6598          65 ICDRCPRSYR   74 (154)
Q Consensus        65 ~C~~C~~~f~   74 (154)
                      .|+.||..|.
T Consensus        16 ~Cp~CG~~F~   25 (26)
T PF10571_consen   16 FCPHCGYDFE   25 (26)
T ss_pred             cCCCCCCCCc
Confidence            5666666653


No 102
>PF09538 FYDLN_acid:  Protein of unknown function (FYDLN_acid);  InterPro: IPR012644 Members of this family are bacterial proteins with a conserved motif [KR]FYDLN, sometimes flanked by a pair of CXXC motifs, followed by a long region of low complexity sequence in which roughly half the residues are Asp and Glu, including multiple runs of five or more acidic residues. The function of members of this family is unknown.
Probab=71.13  E-value=2.2  Score=26.33  Aligned_cols=30  Identities=20%  Similarity=0.710  Sum_probs=22.6

Q ss_pred             eeecccccCcccCChhhHHHHhhhhcCCCceeCCCCcccccCc
Q psy6598          34 YVCQVRDCQKRFTEYSSLYKHTLVHSDIRPFICDRCPRSYRQL   76 (154)
Q Consensus        34 ~~C~~~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~f~~~   76 (154)
                      ..|..  ||+.|.-..           ..|..|+.||..|...
T Consensus        10 R~Cp~--CG~kFYDLn-----------k~PivCP~CG~~~~~~   39 (108)
T PF09538_consen   10 RTCPS--CGAKFYDLN-----------KDPIVCPKCGTEFPPE   39 (108)
T ss_pred             ccCCC--CcchhccCC-----------CCCccCCCCCCccCcc
Confidence            46855  998887633           2578899999988776


No 103
>COG1996 RPC10 DNA-directed RNA polymerase, subunit RPC10 (contains C4-type Zn-finger) [Transcription]
Probab=70.38  E-value=3.6  Score=21.47  Aligned_cols=11  Identities=27%  Similarity=0.866  Sum_probs=7.3

Q ss_pred             ceeecccccCccc
Q psy6598          33 PYVCQVRDCQKRF   45 (154)
Q Consensus        33 ~~~C~~~~C~~~f   45 (154)
                      .|.|..  ||+.|
T Consensus         6 ~Y~C~~--Cg~~~   16 (49)
T COG1996           6 EYKCAR--CGREV   16 (49)
T ss_pred             EEEhhh--cCCee
Confidence            467765  77766


No 104
>COG1997 RPL43A Ribosomal protein L37AE/L43A [Translation, ribosomal structure and biogenesis]
Probab=70.15  E-value=2.1  Score=25.19  Aligned_cols=33  Identities=15%  Similarity=0.452  Sum_probs=21.4

Q ss_pred             CceeecccccCcccCChhhHHHHhhhhcCCCceeCCCCcccccCc
Q psy6598          32 KPYVCQVRDCQKRFTEYSSLYKHTLVHSDIRPFICDRCPRSYRQL   76 (154)
Q Consensus        32 ~~~~C~~~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~f~~~   76 (154)
                      ..|.|..  |++.        .+.+.-+  ..+.|..||..|.=.
T Consensus        34 ~~~~Cp~--C~~~--------~VkR~a~--GIW~C~kCg~~fAGg   66 (89)
T COG1997          34 AKHVCPF--CGRT--------TVKRIAT--GIWKCRKCGAKFAGG   66 (89)
T ss_pred             cCCcCCC--CCCc--------ceeeecc--CeEEcCCCCCeeccc
Confidence            4588977  9865        2233333  348999999887543


No 105
>PF05443 ROS_MUCR:  ROS/MUCR transcriptional regulator protein;  InterPro: IPR008807 This family consists of several ROS/MUCR transcriptional regulator proteins. The ros chromosomal gene is present in octopine and nopaline strains of Agrobacterium tumefaciens as well as in Rhizobium meliloti (Sinorhizobium meliloti). This gene encodes a 15.5 kDa protein that specifically represses the virC and virD operons in the virulence region of the Ti plasmid [] and is necessary for succinoglycan production []. S. meliloti can produce two types of acidic exopolysaccharides, succinoglycan and galactoglucan, that are interchangeable for infection of Medicago sativa (Alfalfa) nodules. MucR from S. meliloti acts as a transcriptional repressor that blocks the expression of the exp genes responsible for galactoglucan production therefore allowing the exclusive production of succinoglycan [].; GO: 0003677 DNA binding, 0008270 zinc ion binding, 0006355 regulation of transcription, DNA-dependent; PDB: 2JSP_A.
Probab=70.08  E-value=3.1  Score=26.65  Aligned_cols=21  Identities=33%  Similarity=0.801  Sum_probs=9.4

Q ss_pred             ecccccccccccCHHHHHHHHHHhcC
Q psy6598           5 FYFPWDCKISFASATNYKNHMRIHSG   30 (154)
Q Consensus         5 ~C~~~~C~~~f~~~~~l~~H~~~h~~   30 (154)
                      .|-  .||+.|+.   |.+|++.|+|
T Consensus        74 ~cl--ecGk~~k~---LkrHL~~~~g   94 (132)
T PF05443_consen   74 ICL--ECGKKFKT---LKRHLRTHHG   94 (132)
T ss_dssp             E-T--BT--EESB---HHHHHHHTT-
T ss_pred             EEc--cCCcccch---HHHHHHHccC
Confidence            355  66666643   3566666644


No 106
>KOG4377|consensus
Probab=69.37  E-value=5  Score=30.68  Aligned_cols=26  Identities=27%  Similarity=0.689  Sum_probs=18.9

Q ss_pred             eeCC--CCcccccCchhHHHHHHhhcCc
Q psy6598          64 FICD--RCPRSYRQLCTLNVHKKTNHRE   89 (154)
Q Consensus        64 ~~C~--~C~~~f~~~~~l~~H~~~~~~~   89 (154)
                      |.|.  -|+..|...+.+..|.|.|.+.
T Consensus       402 fhc~r~Gc~~tl~s~sqm~shkrkheRq  429 (480)
T KOG4377|consen  402 FHCDRLGCEATLYSVSQMASHKRKHERQ  429 (480)
T ss_pred             eeecccCCceEEEehhhhhhhhhhhhhh
Confidence            4453  3788888888888888887554


No 107
>COG2888 Predicted Zn-ribbon RNA-binding protein with a function in translation [Translation, ribosomal structure and biogenesis]
Probab=68.66  E-value=6.6  Score=21.35  Aligned_cols=33  Identities=15%  Similarity=0.334  Sum_probs=19.7

Q ss_pred             CceeecccccCcccCChhhHHHHhhhhcCCCceeCCCCcc
Q psy6598          32 KPYVCQVRDCQKRFTEYSSLYKHTLVHSDIRPFICDRCPR   71 (154)
Q Consensus        32 ~~~~C~~~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~   71 (154)
                      -.|.|..  ||..-..+..     +-.....+|.|+.||.
T Consensus        26 v~F~CPn--CGe~~I~Rc~-----~CRk~g~~Y~Cp~CGF   58 (61)
T COG2888          26 VKFPCPN--CGEVEIYRCA-----KCRKLGNPYRCPKCGF   58 (61)
T ss_pred             eEeeCCC--CCceeeehhh-----hHHHcCCceECCCcCc
Confidence            3478844  9866544432     2222356799988873


No 108
>PRK09678 DNA-binding transcriptional regulator; Provisional
Probab=64.09  E-value=1.8  Score=24.64  Aligned_cols=18  Identities=28%  Similarity=0.621  Sum_probs=8.0

Q ss_pred             CceeecccccCcccCChh
Q psy6598          32 KPYVCQVRDCQKRFTEYS   49 (154)
Q Consensus        32 ~~~~C~~~~C~~~f~~~~   49 (154)
                      +-+.|.-.+||..|....
T Consensus        26 ~Y~qC~N~eCg~tF~t~e   43 (72)
T PRK09678         26 RYHQCQNVNCSATFITYE   43 (72)
T ss_pred             eeeecCCCCCCCEEEEEE
Confidence            334454223555554443


No 109
>PF08790 zf-LYAR:  LYAR-type C2HC zinc finger ;  InterPro: IPR014898 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target.  This C2HC zinc finger domain is found in LYAR proteins such as Q08288 from SWISSPROT, which are involved in cell growth regulation.  More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; PDB: 1WJV_A.
Probab=61.74  E-value=1.6  Score=19.92  Aligned_cols=19  Identities=37%  Similarity=0.800  Sum_probs=9.1

Q ss_pred             eecccccccccccCHHHHHHHH
Q psy6598           4 FFYFPWDCKISFASATNYKNHM   25 (154)
Q Consensus         4 ~~C~~~~C~~~f~~~~~l~~H~   25 (154)
                      |.|.  .|++.| .....+.|.
T Consensus         1 ~sCi--DC~~~F-~~~~y~~Ht   19 (28)
T PF08790_consen    1 FSCI--DCSKDF-DGDSYKSHT   19 (28)
T ss_dssp             EEET--TTTEEE-EGGGTTT--
T ss_pred             Ceee--cCCCCc-CcCCcCCCC
Confidence            4566  666666 333344443


No 110
>KOG1842|consensus
Probab=60.48  E-value=5.1  Score=30.94  Aligned_cols=30  Identities=30%  Similarity=0.515  Sum_probs=23.8

Q ss_pred             CceeCCCCcccccCchhHHHHHHhhcCcCC
Q psy6598          62 RPFICDRCPRSYRQLCTLNVHKKTNHRESK   91 (154)
Q Consensus        62 ~~~~C~~C~~~f~~~~~l~~H~~~~~~~~~   91 (154)
                      ..|.|++|...|.....|..|.-.-|.+..
T Consensus        14 egflCPiC~~dl~~~~~L~~H~d~eH~~ed   43 (505)
T KOG1842|consen   14 EGFLCPICLLDLPNLSALNDHLDVEHFEED   43 (505)
T ss_pred             hcccCchHhhhhhhHHHHHHHHhhhccccc
Confidence            458888888888888888888877666554


No 111
>PF11789 zf-Nse:  Zinc-finger of the MIZ type in Nse subunit; PDB: 2YU4_A 3HTK_C.
Probab=60.07  E-value=7.3  Score=20.96  Aligned_cols=31  Identities=13%  Similarity=0.207  Sum_probs=13.2

Q ss_pred             CceeecccccCcccCChhhHHHHhhhhcCCCceeCCC
Q psy6598          32 KPYVCQVRDCQKRFTEYSSLYKHTLVHSDIRPFICDR   68 (154)
Q Consensus        32 ~~~~C~~~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~   68 (154)
                      .|.....  |+..|....- ...+   ...+...|+.
T Consensus        23 ~PV~s~~--C~H~fek~aI-~~~i---~~~~~~~CPv   53 (57)
T PF11789_consen   23 DPVKSKK--CGHTFEKEAI-LQYI---QRNGSKRCPV   53 (57)
T ss_dssp             SEEEESS--S--EEEHHHH-HHHC---TTTS-EE-SC
T ss_pred             CCcCcCC--CCCeecHHHH-HHHH---HhcCCCCCCC
Confidence            4555544  7766665433 2233   2344566665


No 112
>KOG2593|consensus
Probab=59.49  E-value=12  Score=28.88  Aligned_cols=38  Identities=29%  Similarity=0.698  Sum_probs=25.9

Q ss_pred             cCCCceeecccccCcccCChhhHHHHhhhhcCCCceeCCCCcc
Q psy6598          29 SGEKPYVCQVRDCQKRFTEYSSLYKHTLVHSDIRPFICDRCPR   71 (154)
Q Consensus        29 ~~~~~~~C~~~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~   71 (154)
                      +...-|.|..  |.+.|+....++-   .-...-.|.|..|+-
T Consensus       124 t~~~~Y~Cp~--C~kkyt~Lea~~L---~~~~~~~F~C~~C~g  161 (436)
T KOG2593|consen  124 TNVAGYVCPN--CQKKYTSLEALQL---LDNETGEFHCENCGG  161 (436)
T ss_pred             cccccccCCc--cccchhhhHHHHh---hcccCceEEEecCCC
Confidence            4456799976  9999988877643   222233499998864


No 113
>PF01428 zf-AN1:  AN1-like Zinc finger;  InterPro: IPR000058 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target.  This entry represents the AN1-type zinc finger domain, which has a dimetal (zinc)-bound alpha/beta fold. This domain was first identified as a zinc finger at the C terminus of AN1 Q91889 from SWISSPROT, a ubiquitin-like protein in Xenopus laevis []. The AN1-type zinc finger contains six conserved cysteines and two histidines that could potentially coordinate 2 zinc atoms. Certain stress-associated proteins (SAP) contain AN1 domain, often in combination with A20 zinc finger domains (SAP8) or C2H2 domains (SAP16) []. For example, the human protein Znf216 has an A20 zinc-finger at the N terminus and an AN1 zinc-finger at the C terminus, acting to negatively regulate the NFkappaB activation pathway and to interact with components of the immune response like RIP, IKKgamma and TRAF6. The interact of Znf216 with IKK-gamma and RIP is mediated by the A20 zinc-finger domain, while its interaction with TRAF6 is mediated by the AN1 zinc-finger domain; therefore, both zinc-finger domains are involved in regulating the immune response []. The AN1 zinc finger domain is also found in proteins containing a ubiquitin-like domain, which are involved in the ubiquitination pathway []. Proteins containing an AN1-type zinc finger include:   Ascidian posterior end mark 6 (pem-6) protein []. Human AWP1 protein (associated with PRK1), which is expressed during early embryogenesis []. Human immunoglobulin mu binding protein 2 (SMUBP-2), mutations in which cause muscular atrophy with respiratory distress type 1 [].   More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; GO: 0008270 zinc ion binding; PDB: 1WFP_A 1WYS_A 1WG2_A 1WFH_A 1X4W_A 1WFE_A 1WFL_A 1X4V_A.
Probab=59.05  E-value=4.4  Score=20.33  Aligned_cols=15  Identities=33%  Similarity=0.820  Sum_probs=9.5

Q ss_pred             CceeCCCCcccccCc
Q psy6598          62 RPFICDRCPRSYRQL   76 (154)
Q Consensus        62 ~~~~C~~C~~~f~~~   76 (154)
                      .||.|..|++.|=..
T Consensus        12 ~~~~C~~C~~~FC~~   26 (43)
T PF01428_consen   12 LPFKCKHCGKSFCLK   26 (43)
T ss_dssp             SHEE-TTTS-EE-TT
T ss_pred             CCeECCCCCcccCcc
Confidence            479999999988544


No 114
>PRK00464 nrdR transcriptional regulator NrdR; Validated
Probab=57.54  E-value=1.2  Score=29.26  Aligned_cols=16  Identities=38%  Similarity=0.854  Sum_probs=10.0

Q ss_pred             eeecccccCcccCChhhH
Q psy6598          34 YVCQVRDCQKRFTEYSSL   51 (154)
Q Consensus        34 ~~C~~~~C~~~f~~~~~l   51 (154)
                      +.|..  ||+.|.....+
T Consensus        29 ~~c~~--c~~~f~~~e~~   44 (154)
T PRK00464         29 RECLA--CGKRFTTFERV   44 (154)
T ss_pred             eeccc--cCCcceEeEec
Confidence            66754  77777666544


No 115
>TIGR00373 conserved hypothetical protein TIGR00373. This family of proteins is, so far, restricted to archaeal genomes. The family appears to be distantly related to the N-terminal region of the eukaryotic transcription initiation factor IIE alpha chain.
Probab=57.22  E-value=6.7  Score=25.93  Aligned_cols=31  Identities=13%  Similarity=0.316  Sum_probs=24.1

Q ss_pred             eecccccccccccCHHHHHHHHHHhcCCCceeecccccCcccCC
Q psy6598           4 FFYFPWDCKISFASATNYKNHMRIHSGEKPYVCQVRDCQKRFTE   47 (154)
Q Consensus         4 ~~C~~~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~~~C~~~f~~   47 (154)
                      |.|+  .|+..|.....+.         .-|.|+.  ||.....
T Consensus       110 Y~Cp--~c~~r~tf~eA~~---------~~F~Cp~--Cg~~L~~  140 (158)
T TIGR00373       110 FICP--NMCVRFTFNEAME---------LNFTCPR--CGAMLDY  140 (158)
T ss_pred             EECC--CCCcEeeHHHHHH---------cCCcCCC--CCCEeee
Confidence            8899  9999988777664         2599977  9986543


No 116
>PRK14890 putative Zn-ribbon RNA-binding protein; Provisional
Probab=57.16  E-value=13  Score=20.23  Aligned_cols=33  Identities=15%  Similarity=0.505  Sum_probs=19.7

Q ss_pred             CceeecccccCcccCChhhHHHHhhhhcCCCceeCCCCcc
Q psy6598          32 KPYVCQVRDCQKRFTEYSSLYKHTLVHSDIRPFICDRCPR   71 (154)
Q Consensus        32 ~~~~C~~~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~   71 (154)
                      -.|.|..  ||.....+-.     +-.....+|.|+.||.
T Consensus        24 ~~F~CPn--CG~~~I~RC~-----~CRk~~~~Y~CP~CGF   56 (59)
T PRK14890         24 VKFLCPN--CGEVIIYRCE-----KCRKQSNPYTCPKCGF   56 (59)
T ss_pred             CEeeCCC--CCCeeEeech-----hHHhcCCceECCCCCC
Confidence            4588954  9876333321     2222356799999984


No 117
>TIGR02300 FYDLN_acid conserved hypothetical protein TIGR02300. Members of this family are bacterial proteins with a conserved motif [KR]FYDLN, sometimes flanked by a pair of CXXC motifs, followed by a long region of low complexity sequence in which roughly half the residues are Asp and Glu, including multiple runs of five or more acidic residues. The function of members of this family is unknown.
Probab=56.82  E-value=6.3  Score=25.02  Aligned_cols=33  Identities=15%  Similarity=0.304  Sum_probs=21.9

Q ss_pred             eeecccccCcccCChhhHHHHhhhhcCCCceeCCCCcccccCchhH
Q psy6598          34 YVCQVRDCQKRFTEYSSLYKHTLVHSDIRPFICDRCPRSYRQLCTL   79 (154)
Q Consensus        34 ~~C~~~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~f~~~~~l   79 (154)
                      ..|..  ||+.|.-..           ..|..|+.||..|.....+
T Consensus        10 r~Cp~--cg~kFYDLn-----------k~p~vcP~cg~~~~~~~~~   42 (129)
T TIGR02300        10 RICPN--TGSKFYDLN-----------RRPAVSPYTGEQFPPEEAL   42 (129)
T ss_pred             ccCCC--cCccccccC-----------CCCccCCCcCCccCcchhh
Confidence            46755  888876532           2568888888887665443


No 118
>PRK06266 transcription initiation factor E subunit alpha; Validated
Probab=54.74  E-value=7.2  Score=26.37  Aligned_cols=33  Identities=27%  Similarity=0.623  Sum_probs=24.5

Q ss_pred             CeecccccccccccCHHHHHHHHHHhcCCCceeecccccCcccCCh
Q psy6598           3 PFFYFPWDCKISFASATNYKNHMRIHSGEKPYVCQVRDCQKRFTEY   48 (154)
Q Consensus         3 ~~~C~~~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~~~C~~~f~~~   48 (154)
                      -|.|+  .|+..|.....+.         .-|.|+.  ||......
T Consensus       117 ~Y~Cp--~C~~rytf~eA~~---------~~F~Cp~--Cg~~L~~~  149 (178)
T PRK06266        117 FFFCP--NCHIRFTFDEAME---------YGFRCPQ--CGEMLEEY  149 (178)
T ss_pred             EEECC--CCCcEEeHHHHhh---------cCCcCCC--CCCCCeec
Confidence            38899  9999987776553         3599977  99865543


No 119
>PF03604 DNA_RNApol_7kD:  DNA directed RNA polymerase, 7 kDa subunit;  InterPro: IPR006591 DNA-dependent RNA polymerase catalyzes the transcription of DNA into RNA using the four ribonucleoside triphosphates as substrates. Each class of RNA polymerase is assembled from 9 to 15 different polypeptides. Rbp10 (RNA polymerase CX) is a domain found in RNA polymerase subunit 10; present in RNA polymerase I, II and III.; GO: 0003677 DNA binding, 0003899 DNA-directed RNA polymerase activity, 0006351 transcription, DNA-dependent; PDB: 2PMZ_Z 3HKZ_X 2NVX_L 3S1Q_L 2JA6_L 3S17_L 3HOW_L 3HOV_L 3PO2_L 3HOZ_L ....
Probab=52.90  E-value=12  Score=17.57  Aligned_cols=10  Identities=30%  Similarity=0.923  Sum_probs=5.5

Q ss_pred             ceeCCCCccc
Q psy6598          63 PFICDRCPRS   72 (154)
Q Consensus        63 ~~~C~~C~~~   72 (154)
                      +.+|+.||..
T Consensus        17 ~irC~~CG~R   26 (32)
T PF03604_consen   17 PIRCPECGHR   26 (32)
T ss_dssp             TSSBSSSS-S
T ss_pred             cEECCcCCCe
Confidence            4566666654


No 120
>COG4957 Predicted transcriptional regulator [Transcription]
Probab=52.09  E-value=12  Score=24.07  Aligned_cols=20  Identities=25%  Similarity=0.461  Sum_probs=11.3

Q ss_pred             cccccccccccCHHHHHHHHHHhcC
Q psy6598           6 YFPWDCKISFASATNYKNHMRIHSG   30 (154)
Q Consensus         6 C~~~~C~~~f~~~~~l~~H~~~h~~   30 (154)
                      |-  .+|+.|+   .|++|+.+|.|
T Consensus        79 cL--EDGkkfK---SLKRHL~t~~g   98 (148)
T COG4957          79 CL--EDGKKFK---SLKRHLTTHYG   98 (148)
T ss_pred             Ee--ccCcchH---HHHHHHhcccC
Confidence            55  5666653   35666666554


No 121
>PF04959 ARS2:  Arsenite-resistance protein 2;  InterPro: IPR007042 This entry represents Arsenite-resistance protein 2 (also known as Serrate RNA effector molecule homolog) which is thought to play a role in arsenite resistance [], although does not directly confer arsenite resistance but rather modulates arsenic sensitivity []. Arsenite is a carcinogenic compound which can act as a comutagen by inhibiting DNA repair. It is also involved in cell cycle progression at S phase. ; PDB: 3AX1_A.
Probab=51.87  E-value=5.5  Score=27.80  Aligned_cols=27  Identities=26%  Similarity=0.493  Sum_probs=20.4

Q ss_pred             CCCceeecccccCcccCChhhHHHHhhhh
Q psy6598          30 GEKPYVCQVRDCQKRFTEYSSLYKHTLVH   58 (154)
Q Consensus        30 ~~~~~~C~~~~C~~~f~~~~~l~~H~~~h   58 (154)
                      .+..|.|..  |+|.|.....+..|+..-
T Consensus        74 ~~~K~~C~l--c~KlFkg~eFV~KHI~nK  100 (214)
T PF04959_consen   74 DEDKWRCPL--CGKLFKGPEFVRKHIFNK  100 (214)
T ss_dssp             SSEEEEE-S--SS-EESSHHHHHHHHHHH
T ss_pred             cCCEECCCC--CCcccCChHHHHHHHhhc
Confidence            455799987  999999999999998753


No 122
>smart00154 ZnF_AN1 AN1-like Zinc finger. Zinc finger at the C-terminus of An1, a ubiquitin-like protein in Xenopus laevis.
Probab=51.66  E-value=8  Score=19.02  Aligned_cols=14  Identities=21%  Similarity=0.292  Sum_probs=9.3

Q ss_pred             CeecccccccccccCH
Q psy6598           3 PFFYFPWDCKISFASA   18 (154)
Q Consensus         3 ~~~C~~~~C~~~f~~~   18 (154)
                      ||+|.  .|++.|-..
T Consensus        12 ~f~C~--~C~~~FC~~   25 (39)
T smart00154       12 GFKCR--HCGNLFCGE   25 (39)
T ss_pred             CeECC--ccCCccccc
Confidence            56777  777777543


No 123
>KOG3408|consensus
Probab=51.57  E-value=9.3  Score=24.04  Aligned_cols=25  Identities=32%  Similarity=0.490  Sum_probs=21.5

Q ss_pred             CCceeCCCCcccccCchhHHHHHHh
Q psy6598          61 IRPFICDRCPRSYRQLCTLNVHKKT   85 (154)
Q Consensus        61 ~~~~~C~~C~~~f~~~~~l~~H~~~   85 (154)
                      .-.|-|-.|.+-|.....|..|.++
T Consensus        55 ~GqfyCi~CaRyFi~~~~l~~H~kt   79 (129)
T KOG3408|consen   55 GGQFYCIECARYFIDAKALKTHFKT   79 (129)
T ss_pred             CceeehhhhhhhhcchHHHHHHHhc
Confidence            3458999999999999999999876


No 124
>KOG4124|consensus
Probab=51.54  E-value=2  Score=31.98  Aligned_cols=53  Identities=30%  Similarity=0.565  Sum_probs=40.4

Q ss_pred             CCCeecccccccccccCHHHHHHHHH-Hhc------------------CCCceeecccccCcccCChhhHHHHh
Q psy6598           1 MKPFFYFPWDCKISFASATNYKNHMR-IHS------------------GEKPYVCQVRDCQKRFTEYSSLYKHT   55 (154)
Q Consensus         1 ~k~~~C~~~~C~~~f~~~~~l~~H~~-~h~------------------~~~~~~C~~~~C~~~f~~~~~l~~H~   55 (154)
                      .++|+|.-+.|.+.+....+|..|.. .|.                  ..|+|.|.+  |.+.++....|.-|.
T Consensus       347 ~~~~~~~vp~~~~~~~n~ng~~~~~~~~h~s~i~~~s~~~~ph~~~~~~nk~~r~~i--~~~~~k~~~~l~~~~  418 (442)
T KOG4124|consen  347 DKPYKCPVPNCDKAYKNQNGLKYHKLHGHCSPITTPTPAPIPHQGFVVENKPYRCEV--CSKRYKNLNGLKYHR  418 (442)
T ss_pred             cCCCCCCCCcchhhcccCcceeeccccCcCCCCCCCCCCCCCcceeeeccCcccChh--hhhhhccCCCCCcee
Confidence            47899998899999998888887743 232                  147899988  998888777665553


No 125
>COG1198 PriA Primosomal protein N' (replication factor Y) - superfamily II helicase [DNA replication, recombination, and repair]
Probab=50.98  E-value=10  Score=31.58  Aligned_cols=13  Identities=23%  Similarity=0.565  Sum_probs=9.6

Q ss_pred             CCCceeCCCCccc
Q psy6598          60 DIRPFICDRCPRS   72 (154)
Q Consensus        60 ~~~~~~C~~C~~~   72 (154)
                      ...|..|+.||..
T Consensus       472 ~~~p~~Cp~Cgs~  484 (730)
T COG1198         472 EPIPQSCPECGSE  484 (730)
T ss_pred             CCCCCCCCCCCCC
Confidence            4667888888864


No 126
>KOG3408|consensus
Probab=49.34  E-value=11  Score=23.70  Aligned_cols=16  Identities=25%  Similarity=0.501  Sum_probs=9.2

Q ss_pred             cCcccCChhhHHHHhh
Q psy6598          41 CQKRFTEYSSLYKHTL   56 (154)
Q Consensus        41 C~~~f~~~~~l~~H~~   56 (154)
                      |.+.|.....|..|.+
T Consensus        63 CaRyFi~~~~l~~H~k   78 (129)
T KOG3408|consen   63 CARYFIDAKALKTHFK   78 (129)
T ss_pred             hhhhhcchHHHHHHHh
Confidence            5555555555555554


No 127
>KOG2071|consensus
Probab=49.25  E-value=12  Score=30.01  Aligned_cols=27  Identities=15%  Similarity=0.267  Sum_probs=21.3

Q ss_pred             CCceeCCCCcccccCchhHHHHHHhhc
Q psy6598          61 IRPFICDRCPRSYRQLCTLNVHKKTNH   87 (154)
Q Consensus        61 ~~~~~C~~C~~~f~~~~~l~~H~~~~~   87 (154)
                      ..|-.|..||.+|........||..|-
T Consensus       416 ~~pnqC~~CG~R~~~~ee~sk~md~H~  442 (579)
T KOG2071|consen  416 DSPNQCKSCGLRFDDSEERSKHMDIHD  442 (579)
T ss_pred             CCcchhcccccccccchhhhhHhhhhh
Confidence            456889999999988888777777664


No 128
>PF10276 zf-CHCC:  Zinc-finger domain;  InterPro: IPR019401 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target.   This entry represents a short conserved zinc-finger domain. It contains the sequence motif Cx8Hx14Cx2C. ; PDB: 2JVM_A 2JRR_A 2JZ8_A.
Probab=49.24  E-value=9.2  Score=19.02  Aligned_cols=12  Identities=33%  Similarity=1.002  Sum_probs=9.4

Q ss_pred             CceeCCCCcccc
Q psy6598          62 RPFICDRCPRSY   73 (154)
Q Consensus        62 ~~~~C~~C~~~f   73 (154)
                      ++..|++|+..|
T Consensus        28 ~~~~CpYCg~~y   39 (40)
T PF10276_consen   28 GPVVCPYCGTRY   39 (40)
T ss_dssp             CEEEETTTTEEE
T ss_pred             CeEECCCCCCEE
Confidence            467888888877


No 129
>PF13878 zf-C2H2_3:  zinc-finger of acetyl-transferase ESCO
Probab=49.14  E-value=23  Score=17.60  Aligned_cols=24  Identities=21%  Similarity=0.328  Sum_probs=16.1

Q ss_pred             eeccccccccccc--CHHHHHHHHHHhc
Q psy6598           4 FFYFPWDCKISFA--SATNYKNHMRIHS   29 (154)
Q Consensus         4 ~~C~~~~C~~~f~--~~~~l~~H~~~h~   29 (154)
                      -+|.  .||..|.  ....-..|.+.|.
T Consensus        14 ~~C~--~CgM~Y~~~~~eD~~~H~~yH~   39 (41)
T PF13878_consen   14 TTCP--TCGMLYSPGSPEDEKLHKKYHD   39 (41)
T ss_pred             cCCC--CCCCEECCCCHHHHHHHHHHHh
Confidence            4688  8887774  4456667777663


No 130
>PLN02294 cytochrome c oxidase subunit Vb
Probab=49.01  E-value=10  Score=25.43  Aligned_cols=21  Identities=14%  Similarity=0.373  Sum_probs=15.3

Q ss_pred             hhhhcCCCceeCCCCcccccCc
Q psy6598          55 TLVHSDIRPFICDRCPRSYRQL   76 (154)
Q Consensus        55 ~~~h~~~~~~~C~~C~~~f~~~   76 (154)
                      +..+. .+|+.|+.||..|...
T Consensus       134 f~L~k-Gkp~RCpeCG~~fkL~  154 (174)
T PLN02294        134 FWLEK-GKSFECPVCTQYFELE  154 (174)
T ss_pred             EEecC-CCceeCCCCCCEEEEE
Confidence            33444 4789999999998654


No 131
>PLN03238 probable histone acetyltransferase MYST; Provisional
Probab=48.92  E-value=20  Score=26.24  Aligned_cols=29  Identities=21%  Similarity=0.382  Sum_probs=23.6

Q ss_pred             CCceeCCCCcccccCchhHHHHHHhhcCc
Q psy6598          61 IRPFICDRCPRSYRQLCTLNVHKKTNHRE   89 (154)
Q Consensus        61 ~~~~~C~~C~~~f~~~~~l~~H~~~~~~~   89 (154)
                      ...|.|..|-+.|.....|.+|+....-.
T Consensus        46 ~~lyiCe~Clky~~~~~~l~~H~~~C~~r   74 (290)
T PLN03238         46 TKLYICEYCLKYMRKKKSLLRHLAKCDIR   74 (290)
T ss_pred             CeEEEcCCCcchhCCHHHHHHHHHhCCCC
Confidence            55699999999999999999998865433


No 132
>PF09723 Zn-ribbon_8:  Zinc ribbon domain;  InterPro: IPR013429  This entry represents a region of about 41 amino acids found in a number of small proteins in a wide range of bacteria. The region usually begins with the initiator Met and contains two CxxC motifs separated by 17 amino acids. One protein in this entry has been noted as a putative regulatory protein, designated FmdB []. Most proteins in this entry have a C-terminal region containing highly degenerate sequence.
Probab=48.55  E-value=8.8  Score=19.13  Aligned_cols=11  Identities=18%  Similarity=0.474  Sum_probs=6.1

Q ss_pred             eeccccccccccc
Q psy6598           4 FFYFPWDCKISFA   16 (154)
Q Consensus         4 ~~C~~~~C~~~f~   16 (154)
                      |+|.  .||..|.
T Consensus         6 y~C~--~Cg~~fe   16 (42)
T PF09723_consen    6 YRCE--ECGHEFE   16 (42)
T ss_pred             EEeC--CCCCEEE
Confidence            5555  5665554


No 133
>PHA00626 hypothetical protein
Probab=48.09  E-value=9.8  Score=20.43  Aligned_cols=13  Identities=15%  Similarity=0.391  Sum_probs=8.6

Q ss_pred             CeecccccccccccC
Q psy6598           3 PFFYFPWDCKISFAS   17 (154)
Q Consensus         3 ~~~C~~~~C~~~f~~   17 (154)
                      .|.|.  .||..|..
T Consensus        23 rYkCk--dCGY~ft~   35 (59)
T PHA00626         23 DYVCC--DCGYNDSK   35 (59)
T ss_pred             ceEcC--CCCCeech
Confidence            47777  77776653


No 134
>cd00924 Cyt_c_Oxidase_Vb Cytochrome c oxidase subunit Vb.  Cytochrome c oxidase (CcO), the terminal oxidase in the respiratory chains of eukaryotes and most bacteria, is a multi-chain transmembrane protein located in the inner membrane of mitochondria and the cell membrane of prokaryotes.  It catalyzes the reduction of O2 and simultaneously pumps protons across the membrane.  The number of subunits varies from three to five in bacteria and up to 13 in mammalian mitochondria. Subunits I, II, and III of mammalian CcO are encoded within the mitochondrial genome and the remaining 10 subunits are encoded within the nuclear genome.  Found only in eukaryotes, subunit Vb is one of three mammalian subunits that lacks a transmembrane region.  Subunit Vb is located on the matrix side of the membrane and binds the regulatory subunit of protein kinase A.  The abnormally extended conformation is stable only in the CcO assembly.
Probab=48.04  E-value=11  Score=22.81  Aligned_cols=20  Identities=15%  Similarity=0.544  Sum_probs=15.3

Q ss_pred             hhhhcCCCceeCCCCcccccC
Q psy6598          55 TLVHSDIRPFICDRCPRSYRQ   75 (154)
Q Consensus        55 ~~~h~~~~~~~C~~C~~~f~~   75 (154)
                      +..+.+ ++.+|..||..|..
T Consensus        72 ~~l~~g-~~~rC~eCG~~fkL   91 (97)
T cd00924          72 MWLEKG-KPKRCPECGHVFKL   91 (97)
T ss_pred             EEEeCC-CceeCCCCCcEEEE
Confidence            334555 79999999999864


No 135
>COG5151 SSL1 RNA polymerase II transcription initiation/nucleotide excision repair factor TFIIH, subunit SSL1 [Transcription / DNA replication, recombination, and repair]
Probab=47.63  E-value=14  Score=27.25  Aligned_cols=48  Identities=19%  Similarity=0.446  Sum_probs=30.6

Q ss_pred             ecccccCcccCChhhHHHHhhhhcCCCceeCCCCcccccCchhHHHHHHhhc
Q psy6598          36 CQVRDCQKRFTEYSSLYKHTLVHSDIRPFICDRCPRSYRQLCTLNVHKKTNH   87 (154)
Q Consensus        36 C~~~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~f~~~~~l~~H~~~~~   87 (154)
                      |-.  |...|.....-..-  .-+..-.|.|..|...|-..-..-.|...|.
T Consensus       365 Cf~--CQ~~fp~~~~~~~~--~~~ss~rY~Ce~CK~~FC~dCdvfiHe~Lh~  412 (421)
T COG5151         365 CFV--CQGPFPKPPVSPFD--ESTSSGRYQCELCKSTFCSDCDVFIHETLHF  412 (421)
T ss_pred             cee--ccCCCCCCCCCccc--ccccccceechhhhhhhhhhhHHHHHHHHhh
Confidence            555  66666654321111  1122345999999999998888888887764


No 136
>PF06220 zf-U1:  U1 zinc finger;  InterPro: IPR013085 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target.  C2H2-type (classical) zinc fingers (Znf) were the first class to be characterised. They contain a short beta hairpin and an alpha helix (beta/beta/alpha structure), where a single zinc atom is held in place by Cys(2)His(2) (C2H2) residues in a tetrahedral array. C2H2 Znf's can be divided into three groups based on the number and pattern of fingers: triple-C2H2 (binds single ligand), multiple-adjacent-C2H2 (binds multiple ligands), and separated paired-C2H2 []. C2H2 Znf's are the most common DNA-binding motifs found in eukaryotic transcription factors, and have also been identified in prokaryotes []. Transcription factors usually contain several Znf's (each with a conserved beta/beta/alpha structure) capable of making multiple contacts along the DNA, where the C2H2 Znf motifs recognise DNA sequences by binding to the major groove of DNA via a short alpha-helix in the Znf, the Znf spanning 3-4 bases of the DNA []. C2H2 Znf's can also bind to RNA and protein targets []. This entry represents a C2H2-type zinc finger motif found in several U1 small nuclear ribonucleoprotein C (U1-C) proteins. Some proteins contain multiple copies of this motif. The U1 small nuclear ribonucleoprotein (U1 snRNP) binds to the pre-mRNA 5' splice site at early stages of spliceosome assembly. Recruitment of U1 to a class of weak 5' splice site is promoted by binding of the protein TIA-1 to uridine-rich sequences immediately downstream from the 5' splice site. Binding of TIA-1 in the vicinity of a 5' splice site helps to stabilise U1 snRNP recruitment, at least in part, via a direct interaction with U1-C, thus providing one molecular mechanism for the function of this splicing regulator []. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; GO: 0008270 zinc ion binding; PDB: 2VRD_A.
Probab=46.89  E-value=17  Score=17.78  Aligned_cols=12  Identities=17%  Similarity=0.379  Sum_probs=4.0

Q ss_pred             CCeecccccccccc
Q psy6598           2 KPFFYFPWDCKISF   15 (154)
Q Consensus         2 k~~~C~~~~C~~~f   15 (154)
                      +.|-|+  .|...|
T Consensus         2 ~ryyCd--yC~~~~   13 (38)
T PF06220_consen    2 PRYYCD--YCKKYL   13 (38)
T ss_dssp             -S-B-T--TT--B-
T ss_pred             cCeecc--ccccee
Confidence            456667  777666


No 137
>smart00834 CxxC_CXXC_SSSS Putative regulatory protein. CxxC_CXXC_SSSS represents a region of about 41 amino acids found in a number of small proteins in a wide range of bacteria. The region usually begins with the initiator Met and contains two CxxC motifs separated by 17 amino acids. One protein in this entry has been noted as a putative regulatory protein, designated FmdB. Most proteins in this entry have a C-terminal region containing highly degenerate sequence.
Probab=46.86  E-value=9.6  Score=18.46  Aligned_cols=11  Identities=27%  Similarity=0.573  Sum_probs=6.2

Q ss_pred             eeccccccccccc
Q psy6598           4 FFYFPWDCKISFA   16 (154)
Q Consensus         4 ~~C~~~~C~~~f~   16 (154)
                      |+|.  .||..|.
T Consensus         6 y~C~--~Cg~~fe   16 (41)
T smart00834        6 YRCE--DCGHTFE   16 (41)
T ss_pred             EEcC--CCCCEEE
Confidence            5555  6666553


No 138
>PF15135 UPF0515:  Uncharacterised protein UPF0515
Probab=45.97  E-value=13  Score=26.47  Aligned_cols=15  Identities=27%  Similarity=0.890  Sum_probs=11.6

Q ss_pred             CceeCCCCcccccCc
Q psy6598          62 RPFICDRCPRSYRQL   76 (154)
Q Consensus        62 ~~~~C~~C~~~f~~~   76 (154)
                      --|.|+.|+..|.-.
T Consensus       154 aef~C~~C~h~F~G~  168 (278)
T PF15135_consen  154 AEFHCPKCRHNFRGF  168 (278)
T ss_pred             eeeecccccccchhh
Confidence            348999999888754


No 139
>PF04780 DUF629:  Protein of unknown function (DUF629);  InterPro: IPR006865 This domain represents a region of several plant proteins of unknown function. A C2H2 zinc finger is predicted in this region in some family members, but the spacing between the cysteine residues is not conserved throughout the family.
Probab=44.90  E-value=18  Score=28.47  Aligned_cols=32  Identities=22%  Similarity=0.494  Sum_probs=25.4

Q ss_pred             cCCCceeCCCCcccccCchhHHHHHHhhcCcC
Q psy6598          59 SDIRPFICDRCPRSYRQLCTLNVHKKTNHRES   90 (154)
Q Consensus        59 ~~~~~~~C~~C~~~f~~~~~l~~H~~~~~~~~   90 (154)
                      ..-+-+.|+.|.+.|.....+..|+..-|...
T Consensus        53 ~sWrFWiCp~CskkF~d~~~~~~H~~~eH~~~   84 (466)
T PF04780_consen   53 KSWRFWICPRCSKKFSDAESCLSHMEQEHPAG   84 (466)
T ss_pred             CceeEeeCCcccceeCCHHHHHHHHHHhhhhh
Confidence            33455899999999999999999998665443


No 140
>PF09845 DUF2072:  Zn-ribbon containing protein (DUF2072);  InterPro: IPR018645  This archaeal Zinc-ribbon containing proteins have no known function. 
Probab=44.85  E-value=11  Score=24.15  Aligned_cols=15  Identities=33%  Similarity=0.727  Sum_probs=8.8

Q ss_pred             ceeecccccCcccCChh
Q psy6598          33 PYVCQVRDCQKRFTEYS   49 (154)
Q Consensus        33 ~~~C~~~~C~~~f~~~~   49 (154)
                      |+.|..  ||+.|..-+
T Consensus         1 PH~Ct~--Cg~~f~dgs   15 (131)
T PF09845_consen    1 PHQCTK--CGRVFEDGS   15 (131)
T ss_pred             CcccCc--CCCCcCCCc
Confidence            455655  666666544


No 141
>COG3677 Transposase and inactivated derivatives [DNA replication, recombination, and repair]
Probab=44.17  E-value=11  Score=23.99  Aligned_cols=46  Identities=13%  Similarity=0.325  Sum_probs=25.5

Q ss_pred             HHHHHHhcCCCceeecccccCcccCChhhHHHHhhhhcCCCceeCCCCcccccCc
Q psy6598          22 KNHMRIHSGEKPYVCQVRDCQKRFTEYSSLYKHTLVHSDIRPFICDRCPRSYRQL   76 (154)
Q Consensus        22 ~~H~~~h~~~~~~~C~~~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~f~~~   76 (154)
                      ..+++...+.  -.|..  |+.....     .+-..-.+...|+|+.|++.|...
T Consensus        21 ~~~~~~~~~~--~~cP~--C~s~~~~-----k~g~~~~~~qRyrC~~C~~tf~~~   66 (129)
T COG3677          21 AYAIRMQITK--VNCPR--CKSSNVV-----KIGGIRRGHQRYKCKSCGSTFTVE   66 (129)
T ss_pred             HHHHhhhccc--CcCCC--CCcccee-----eECCccccccccccCCcCcceeee
Confidence            3455554432  24755  7755311     111122235569999999998754


No 142
>PF07754 DUF1610:  Domain of unknown function (DUF1610);  InterPro: IPR011668 This domain is found in archaeal species. It is likely to bind zinc via its four well-conserved cysteine residues.
Probab=43.53  E-value=11  Score=16.48  Aligned_cols=8  Identities=38%  Similarity=0.795  Sum_probs=5.0

Q ss_pred             Ceeccccccc
Q psy6598           3 PFFYFPWDCK   12 (154)
Q Consensus         3 ~~~C~~~~C~   12 (154)
                      +|.|+  .||
T Consensus        16 ~f~CP--nCG   23 (24)
T PF07754_consen   16 PFPCP--NCG   23 (24)
T ss_pred             eEeCC--CCC
Confidence            36666  666


No 143
>KOG0717|consensus
Probab=42.77  E-value=14  Score=28.87  Aligned_cols=21  Identities=24%  Similarity=0.578  Sum_probs=17.6

Q ss_pred             eeCCCCcccccCchhHHHHHH
Q psy6598          64 FICDRCPRSYRQLCTLNVHKK   84 (154)
Q Consensus        64 ~~C~~C~~~f~~~~~l~~H~~   84 (154)
                      +-|.+|.|.|.+..+|..|..
T Consensus       293 lyC~vCnKsFKseKq~kNHEn  313 (508)
T KOG0717|consen  293 LYCVVCNKSFKSEKQLKNHEN  313 (508)
T ss_pred             eEEeeccccccchHHHHhhHH
Confidence            788889999998888888854


No 144
>COG4888 Uncharacterized Zn ribbon-containing protein [General function prediction only]
Probab=41.68  E-value=10  Score=23.04  Aligned_cols=39  Identities=18%  Similarity=0.454  Sum_probs=22.9

Q ss_pred             CCceeecccccCcccCChhhHHHHhhhhcCCCceeCCCCcccccC
Q psy6598          31 EKPYVCQVRDCQKRFTEYSSLYKHTLVHSDIRPFICDRCPRSYRQ   75 (154)
Q Consensus        31 ~~~~~C~~~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~f~~   75 (154)
                      ++.|.|..  |+..-.....+.    .-...-...|..||..|..
T Consensus        20 ~k~FtCp~--Cghe~vs~ctvk----k~~~~g~~~Cg~CGls~e~   58 (104)
T COG4888          20 PKTFTCPR--CGHEKVSSCTVK----KTVNIGTAVCGNCGLSFEC   58 (104)
T ss_pred             CceEecCc--cCCeeeeEEEEE----ecCceeEEEcccCcceEEE
Confidence            45688877  887655443322    1122234688888887743


No 145
>PF08274 PhnA_Zn_Ribbon:  PhnA Zinc-Ribbon ;  InterPro: IPR013987 The PhnA protein family includes the uncharacterised Escherichia coli protein PhnA and its homologues. The E. coli phnA gene is part of a large operon associated with alkylphosphonate uptake and carbon-phosphorus bond cleavage []. The protein is not related to the characterised phosphonoacetate hydrolase designated PhnA []. This entry represents the N-terminal domain of PhnA, which is predicted to form a zinc-ribbon.; PDB: 2AKL_A.
Probab=41.55  E-value=16  Score=16.94  Aligned_cols=9  Identities=22%  Similarity=0.907  Sum_probs=4.8

Q ss_pred             ceeCCCCcc
Q psy6598          63 PFICDRCPR   71 (154)
Q Consensus        63 ~~~C~~C~~   71 (154)
                      .|.|+.|+.
T Consensus        19 ~~vCp~C~~   27 (30)
T PF08274_consen   19 LLVCPECGH   27 (30)
T ss_dssp             SEEETTTTE
T ss_pred             EEeCCcccc
Confidence            355555554


No 146
>PRK14873 primosome assembly protein PriA; Provisional
Probab=40.99  E-value=18  Score=29.81  Aligned_cols=10  Identities=30%  Similarity=0.873  Sum_probs=7.6

Q ss_pred             ceeCCCCccc
Q psy6598          63 PFICDRCPRS   72 (154)
Q Consensus        63 ~~~C~~C~~~   72 (154)
                      |+.|+.||..
T Consensus       422 p~~Cp~Cgs~  431 (665)
T PRK14873        422 DWRCPRCGSD  431 (665)
T ss_pred             CccCCCCcCC
Confidence            5788888764


No 147
>KOG0782|consensus
Probab=40.88  E-value=3.9  Score=32.77  Aligned_cols=53  Identities=19%  Similarity=0.433  Sum_probs=36.5

Q ss_pred             HHHHHHHHHHhcCCCceeecccccCcccCChhhHHHHhhhhcCCCceeCCCCcccccCch
Q psy6598          18 ATNYKNHMRIHSGEKPYVCQVRDCQKRFTEYSSLYKHTLVHSDIRPFICDRCPRSYRQLC   77 (154)
Q Consensus        18 ~~~l~~H~~~h~~~~~~~C~~~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~f~~~~   77 (154)
                      .+.+.+|--+|.-..--+|..  ||++|..+-.+..-.     .....|..|...|-.+-
T Consensus       238 E~~fvrHHWVHrrRqeGkC~~--CgKgFQQKf~FhsKE-----ivAisCSWCKqayH~Kv  290 (1004)
T KOG0782|consen  238 ESGFVRHHWVHRRRQEGKCNT--CGKGFQQKFFFHSKE-----IVAISCSWCKQAYHLKV  290 (1004)
T ss_pred             cccchHHhHhhHhhhccccch--hhhhhhhheeecccc-----EEEEEehHHHHHhhcch
Confidence            346777766665555567987  999998876653322     34578888988876664


No 148
>KOG0717|consensus
Probab=40.31  E-value=20  Score=28.11  Aligned_cols=21  Identities=33%  Similarity=0.589  Sum_probs=17.7

Q ss_pred             eecccccccccccCHHHHHHHHH
Q psy6598           4 FFYFPWDCKISFASATNYKNHMR   26 (154)
Q Consensus         4 ~~C~~~~C~~~f~~~~~l~~H~~   26 (154)
                      +-|.  .|.++|.+...|..|..
T Consensus       293 lyC~--vCnKsFKseKq~kNHEn  313 (508)
T KOG0717|consen  293 LYCV--VCNKSFKSEKQLKNHEN  313 (508)
T ss_pred             eEEe--eccccccchHHHHhhHH
Confidence            6788  99999999888888865


No 149
>PF05191 ADK_lid:  Adenylate kinase, active site lid;  InterPro: IPR007862 Adenylate kinases (ADK; 2.7.4.3 from EC) are phosphotransferases that catalyse the Mg-dependent reversible conversion of ATP and AMP to two molecules of ADP, an essential reaction for many processes in living cells. In large variants of adenylate kinase, the AMP and ATP substrates are buried in a domain that undergoes conformational changes from an open to a closed state when bound to substrate; the ligand is then contained within a highly specific environment required for catalysis. Adenylate kinase is a 3-domain protein consisting of a large central CORE domain flanked by a LID domain on one side and the AMP-binding NMPbind domain on the other []. The LID domain binds ATP and covers the phosphates at the active site. The substrates first bind the CORE domain, followed by closure of the active site by the LID and NMPbind domains. Comparisons of adenylate kinases have revealed a particular divergence in the active site lid. In some organisms, particularly the Gram-positive bacteria, residues in the lid domain have been mutated to cysteines and these cysteine residues (two CX(n)C motifs) are responsible for the binding of a zinc ion. The bound zinc ion in the lid domain is clearly structurally homologous to Zinc-finger domains. However, it is unclear whether the adenylate kinase lid is a novel zinc-finger DNA/RNA binding domain, or that the lid bound zinc serves a purely structural function [].; GO: 0004017 adenylate kinase activity; PDB: 3BE4_A 2OSB_B 2ORI_A 2EU8_A 3DL0_A 1P3J_A 2QAJ_A 2OO7_A 2P3S_A 3DKV_A ....
Probab=40.19  E-value=6.6  Score=19.00  Aligned_cols=9  Identities=11%  Similarity=0.316  Sum_probs=5.2

Q ss_pred             ecccccccccc
Q psy6598           5 FYFPWDCKISF   15 (154)
Q Consensus         5 ~C~~~~C~~~f   15 (154)
                      .|.  .||+.|
T Consensus         3 ~C~--~Cg~~Y   11 (36)
T PF05191_consen    3 ICP--KCGRIY   11 (36)
T ss_dssp             EET--TTTEEE
T ss_pred             CcC--CCCCcc
Confidence            455  666655


No 150
>PLN00104 MYST -like histone acetyltransferase; Provisional
Probab=39.39  E-value=24  Score=27.63  Aligned_cols=30  Identities=17%  Similarity=0.353  Sum_probs=24.5

Q ss_pred             CCceeCCCCcccccCchhHHHHHHhhcCcC
Q psy6598          61 IRPFICDRCPRSYRQLCTLNVHKKTNHRES   90 (154)
Q Consensus        61 ~~~~~C~~C~~~f~~~~~l~~H~~~~~~~~   90 (154)
                      ...|.|..|-+-|.....|.+|+....-..
T Consensus       196 ~~lyiCe~Cl~y~~~~~~~~~H~~~C~~~~  225 (450)
T PLN00104        196 SKLYFCEFCLKFMKRKEQLQRHMKKCDLKH  225 (450)
T ss_pred             CeEEEchhhhhhhcCHHHHHHHHhcCCCCC
Confidence            345999999999999999999998654333


No 151
>KOG2593|consensus
Probab=39.25  E-value=26  Score=27.17  Aligned_cols=36  Identities=11%  Similarity=0.372  Sum_probs=25.0

Q ss_pred             CeecccccccccccCHHHHHHHHHHhcCCCceeecccccCccc
Q psy6598           3 PFFYFPWDCKISFASATNYKNHMRIHSGEKPYVCQVRDCQKRF   45 (154)
Q Consensus         3 ~~~C~~~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~~~C~~~f   45 (154)
                      .|.|+  .|.+.|.....++.-   -...-.|.|..  |+.-.
T Consensus       128 ~Y~Cp--~C~kkyt~Lea~~L~---~~~~~~F~C~~--C~gel  163 (436)
T KOG2593|consen  128 GYVCP--NCQKKYTSLEALQLL---DNETGEFHCEN--CGGEL  163 (436)
T ss_pred             cccCC--ccccchhhhHHHHhh---cccCceEEEec--CCCch
Confidence            59999  999999876655532   22233599988  98543


No 152
>PRK03824 hypA hydrogenase nickel incorporation protein; Provisional
Probab=39.18  E-value=15  Score=23.66  Aligned_cols=15  Identities=13%  Similarity=0.419  Sum_probs=10.7

Q ss_pred             CceeecccccCcccCCh
Q psy6598          32 KPYVCQVRDCQKRFTEY   48 (154)
Q Consensus        32 ~~~~C~~~~C~~~f~~~   48 (154)
                      ..+.|..  ||..|...
T Consensus        69 ~~~~C~~--CG~~~~~~   83 (135)
T PRK03824         69 AVLKCRN--CGNEWSLK   83 (135)
T ss_pred             eEEECCC--CCCEEecc
Confidence            3488866  99887664


No 153
>PF04423 Rad50_zn_hook:  Rad50 zinc hook motif;  InterPro: IPR007517 The Mre11 complex (Mre11 Rad50 Nbs1) is central to chromosomal maintenance and functions in homologous recombination, telomere maintenance and sister chromatid association. The Rad50 coiled-coil region contains a dimer interface at the apex of the coiled coils in which pairs of conserved Cys-X-X-Cys motifs form interlocking hooks that bind one Zn ion. This alignment includes the zinc hook motif and a short stretch of coiled-coil on either side.; GO: 0004518 nuclease activity, 0005524 ATP binding, 0008270 zinc ion binding, 0006281 DNA repair; PDB: 1L8D_B.
Probab=39.08  E-value=14  Score=19.46  Aligned_cols=12  Identities=25%  Similarity=0.587  Sum_probs=6.4

Q ss_pred             eCCCCcccccCc
Q psy6598          65 ICDRCPRSYRQL   76 (154)
Q Consensus        65 ~C~~C~~~f~~~   76 (154)
                      .|+.|++.|...
T Consensus        22 ~CPlC~r~l~~e   33 (54)
T PF04423_consen   22 CCPLCGRPLDEE   33 (54)
T ss_dssp             E-TTT--EE-HH
T ss_pred             cCCCCCCCCCHH
Confidence            899999988644


No 154
>PTZ00064 histone acetyltransferase; Provisional
Probab=38.96  E-value=28  Score=27.69  Aligned_cols=30  Identities=13%  Similarity=0.088  Sum_probs=24.6

Q ss_pred             CCceeCCCCcccccCchhHHHHHHhhcCcC
Q psy6598          61 IRPFICDRCPRSYRQLCTLNVHKKTNHRES   90 (154)
Q Consensus        61 ~~~~~C~~C~~~f~~~~~l~~H~~~~~~~~   90 (154)
                      ...|.|..|-+.|.....|.+|+....-..
T Consensus       278 d~LYICEfCLkY~~s~~~l~rH~~~C~~rh  307 (552)
T PTZ00064        278 DTLHFCEYCLDFFCFEDELIRHLSRCQLRH  307 (552)
T ss_pred             CeEEEccchhhhhCCHHHHHHHHhcCCCCC
Confidence            345999999999999999999988654333


No 155
>COG0068 HypF Hydrogenase maturation factor [Posttranslational modification, protein turnover, chaperones]
Probab=38.58  E-value=4.2  Score=33.35  Aligned_cols=57  Identities=18%  Similarity=0.354  Sum_probs=35.9

Q ss_pred             cccccccccccCHHHHHHHHHHhcCCCceeecccccCcccCChhhHHHHhhhhcCCCceeCCCCccc
Q psy6598           6 YFPWDCKISFASATNYKNHMRIHSGEKPYVCQVRDCQKRFTEYSSLYKHTLVHSDIRPFICDRCPRS   72 (154)
Q Consensus         6 C~~~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~   72 (154)
                      |.  .||-.|+-...|-.-+..-+-..--.|+.  |.+.|.....-+-    |  ..|..|+.||..
T Consensus       126 CT--~CGPRfTIi~alPYDR~nTsM~~F~lC~~--C~~EY~dP~nRRf----H--AQp~aCp~CGP~  182 (750)
T COG0068         126 CT--NCGPRFTIIEALPYDRENTSMADFPLCPF--CDKEYKDPLNRRF----H--AQPIACPKCGPH  182 (750)
T ss_pred             cC--CCCcceeeeccCCCCcccCccccCcCCHH--HHHHhcCcccccc----c--cccccCcccCCC
Confidence            88  89988876665544322111112235876  8888877765333    4  457899999974


No 156
>PF13451 zf-trcl:  Probable zinc-binding domain
Probab=38.18  E-value=15  Score=19.21  Aligned_cols=12  Identities=33%  Similarity=0.578  Sum_probs=5.8

Q ss_pred             CCeecccccccccc
Q psy6598           2 KPFFYFPWDCKISF   15 (154)
Q Consensus         2 k~~~C~~~~C~~~f   15 (154)
                      +.+.|.  .||..|
T Consensus         3 k~l~C~--dCg~~F   14 (49)
T PF13451_consen    3 KTLTCK--DCGAEF   14 (49)
T ss_pred             eeEEcc--cCCCeE
Confidence            344555  555444


No 157
>TIGR00100 hypA hydrogenase nickel insertion protein HypA. In Hpylori, hypA mutant abolished hydrogenase activity and decrease in urease activity. Nickel supplementation in media restored urease activity and partial hydrogenase activity. HypA probably involved in inserting Ni in enzymes.
Probab=37.68  E-value=20  Score=22.33  Aligned_cols=24  Identities=17%  Similarity=0.578  Sum_probs=14.3

Q ss_pred             eeecccccCcccCChhhHHHHhhhhcCCCceeCCCCcc
Q psy6598          34 YVCQVRDCQKRFTEYSSLYKHTLVHSDIRPFICDRCPR   71 (154)
Q Consensus        34 ~~C~~~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~   71 (154)
                      ..|..  |+..|.....            .+.|+.||.
T Consensus        71 ~~C~~--Cg~~~~~~~~------------~~~CP~Cgs   94 (115)
T TIGR00100        71 CECED--CSEEVSPEID------------LYRCPKCHG   94 (115)
T ss_pred             EEccc--CCCEEecCCc------------CccCcCCcC
Confidence            66755  7766655432            256777764


No 158
>PF05495 zf-CHY:  CHY zinc finger;  InterPro: IPR008913 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target.  Pirh2 is an eukaryotic ubiquitin protein ligase, which has been shown to promote p53 degradation in mammals. Pirh2 physically interacts with p53 and promotes ubiquitination of p53 independently of MDM2. Like MDM2, Pirh2 is thought to participate in an autoregulatory feedback loop that controls p53 function. Pirh2 proteins contain three distinct zinc fingers, the CHY-type, the CTCHY-type which is C-terminal to the CHY-type zinc finger and a RING finger. The CHY-type zinc finger has no currently known function []. As well as Pirh2, the CHY-type zinc finger is also found in the following proteins:   Yeast helper of Tim protein 13. Hot13 may have a role in the assembly and recycling of the small Tims, a complex of the mitochondrial intermembrane space that participates in the TIM22 import pathway for assembly of the inner membrane [] Several plant hypothetical proteins that also contain haemerythrin cation binding domains Several protozoan hypothetical proteins that also contain a Myb domain   The solution structure of this zinc finger has been solved and binds 3 zinc atoms as shown in the following schematic representation:   ++---------+-----+ || | | CXHYxxxxxxxxxCCxxxxxCxxCHxxxxxHxxxxxxxxxxxCxxCxxxxxxxxxCxxC | | | | | | | | +-+-----------------+--+ +--+---------+--+ 'C': conserved cysteine involved in the binding of one zinc atom. 'H': conserved histidine involved in the binding of one zinc atom.  More information about these proteins can be found at Protein of the Month: Zinc Fingers []; GO: 0008270 zinc ion binding; PDB: 2DKT_A 2K2C_A.
Probab=37.67  E-value=9.7  Score=21.47  Aligned_cols=31  Identities=16%  Similarity=0.387  Sum_probs=13.1

Q ss_pred             CceeecccccCcccCChhhHHHHhhhhcCCCceeCCCCccc
Q psy6598          32 KPYVCQVRDCQKRFTEYSSLYKHTLVHSDIRPFICDRCPRS   72 (154)
Q Consensus        32 ~~~~C~~~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~   72 (154)
                      ..-.|..  |+..+.....       ..+ ..+.|+.|+..
T Consensus        40 ~~v~Cg~--C~~~~~~~~~-------~c~-~~~~C~~C~~~   70 (71)
T PF05495_consen   40 KRVICGK--CRTEQPIDEY-------SCG-ADYFCPICGLY   70 (71)
T ss_dssp             -EEEETT--T--EEES-SB-------TT---SEEETTTTEE
T ss_pred             cCeECCC--CCCccChhhh-------hcC-CCccCcCcCCC
Confidence            3456655  6665554433       011 34666666544


No 159
>PF01780 Ribosomal_L37ae:  Ribosomal L37ae protein family;  InterPro: IPR002674 Ribosomes are the particles that catalyse mRNA-directed protein synthesis in all organisms. The codons of the mRNA are exposed on the ribosome to allow tRNA binding. This leads to the incorporation of amino acids into the growing polypeptide chain in accordance with the genetic information. Incoming amino acid monomers enter the ribosomal A site in the form of aminoacyl-tRNAs complexed with elongation factor Tu (EF-Tu) and GTP. The growing polypeptide chain, situated in the P site as peptidyl-tRNA, is then transferred to aminoacyl-tRNA and the new peptidyl-tRNA, extended by one residue, is translocated to the P site with the aid the elongation factor G (EF-G) and GTP as the deacylated tRNA is released from the ribosome through one or more exit sites [, ]. About 2/3 of the mass of the ribosome consists of RNA and 1/3 of protein. The proteins are named in accordance with the subunit of the ribosome which they belong to - the small (S1 to S31) and the large (L1 to L44). Usually they decorate the rRNA cores of the subunits.  Many ribosomal proteins, particularly those of the large subunit, are composed of a globular, surfaced-exposed domain with long finger-like projections that extend into the rRNA core to stabilise its structure. Most of the proteins interact with multiple RNA elements, often from different domains. In the large subunit, about 1/3 of the 23S rRNA nucleotides are at least in van der Waal's contact with protein, and L22 interacts with all six domains of the 23S rRNA. Proteins S4 and S7, which initiate assembly of the 16S rRNA, are located at junctions of five and four RNA helices, respectively. In this way proteins serve to organise and stabilise the rRNA tertiary structure. While the crucial activities of decoding and peptide transfer are RNA based, proteins play an active role in functions that may have evolved to streamline the process of protein synthesis. In addition to their function in the ribosome, many ribosomal proteins have some function 'outside' the ribosome [, ]. This ribosomal protein is found in archaebacteria and eukaryotes []. Ribosomal protein L37 has a single zinc finger-like motif of the C2-C2 type [].; GO: 0003735 structural constituent of ribosome, 0006412 translation, 0005622 intracellular, 0005840 ribosome; PDB: 4A1E_Y 4A17_Y 4A1C_Y 4A1A_Y 3O58_g 3IZS_m 3O5H_g 1S1I_9 3IZR_m 1YSH_D ....
Probab=37.29  E-value=18  Score=21.53  Aligned_cols=31  Identities=23%  Similarity=0.676  Sum_probs=20.3

Q ss_pred             CceeecccccCcccCChhhHHHHhhhhcCCCceeCCCCccccc
Q psy6598          32 KPYVCQVRDCQKRFTEYSSLYKHTLVHSDIRPFICDRCPRSYR   74 (154)
Q Consensus        32 ~~~~C~~~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~f~   74 (154)
                      ..|.|.+  |++.-..        +...|  .+.|..|++.|.
T Consensus        34 ~ky~Cp~--Cgk~~vk--------R~a~G--IW~C~~C~~~~A   64 (90)
T PF01780_consen   34 AKYTCPF--CGKTSVK--------RVATG--IWKCKKCGKKFA   64 (90)
T ss_dssp             S-BEESS--SSSSEEE--------EEETT--EEEETTTTEEEE
T ss_pred             CCCcCCC--CCCceeE--------EeeeE--EeecCCCCCEEe
Confidence            4589988  9976322        22333  389999998875


No 160
>KOG2636|consensus
Probab=36.29  E-value=25  Score=27.37  Aligned_cols=28  Identities=18%  Similarity=0.318  Sum_probs=22.6

Q ss_pred             hhhcCCCceeCCCCc-ccccCchhHHHHH
Q psy6598          56 LVHSDIRPFICDRCP-RSYRQLCTLNVHK   83 (154)
Q Consensus        56 ~~h~~~~~~~C~~C~-~~f~~~~~l~~H~   83 (154)
                      +.|--..-|.|.+|| +++.=..++.+|-
T Consensus       394 KLHGL~~ey~CEICGNy~Y~GrkaF~RHF  422 (497)
T KOG2636|consen  394 KLHGLDIEYNCEICGNYVYKGRKAFDRHF  422 (497)
T ss_pred             hhcCCCcccceeeccCccccCcHHHHHHh
Confidence            345556679999999 8899999999994


No 161
>PF12760 Zn_Tnp_IS1595:  Transposase zinc-ribbon domain;  InterPro: IPR024442 This zinc binding domain is found in a range of transposase proteins such as ISSPO8, ISSOD11, ISRSSP2 etc. It may be a zinc-binding beta ribbon domain that could bind DNA.
Probab=36.16  E-value=49  Score=16.67  Aligned_cols=11  Identities=18%  Similarity=0.700  Sum_probs=7.8

Q ss_pred             CCceeCCCCcc
Q psy6598          61 IRPFICDRCPR   71 (154)
Q Consensus        61 ~~~~~C~~C~~   71 (154)
                      ...|+|..|++
T Consensus        35 ~~~~~C~~C~~   45 (46)
T PF12760_consen   35 RGRYRCKACRK   45 (46)
T ss_pred             CCeEECCCCCC
Confidence            45688888765


No 162
>KOG2747|consensus
Probab=36.14  E-value=22  Score=27.27  Aligned_cols=33  Identities=18%  Similarity=0.328  Sum_probs=26.8

Q ss_pred             CCceeCCCCcccccCchhHHHHHHhhcCcCCCC
Q psy6598          61 IRPFICDRCPRSYRQLCTLNVHKKTNHRESKNK   93 (154)
Q Consensus        61 ~~~~~C~~C~~~f~~~~~l~~H~~~~~~~~~~~   93 (154)
                      ...|.|.+|.+.+.....|.+|+....-..++.
T Consensus       156 ~~lYiCEfCLkY~~s~~~l~rH~~kC~~rhPPG  188 (396)
T KOG2747|consen  156 DKLYICEFCLKYMKSRTSLQRHLKKCKLRHPPG  188 (396)
T ss_pred             CeEEEehHHHhHhchHHHHHHHHHhcCCCCCCc
Confidence            456999999999999999999999875544443


No 163
>COG3364 Zn-ribbon containing protein [General function prediction only]
Probab=35.89  E-value=22  Score=21.63  Aligned_cols=13  Identities=23%  Similarity=0.406  Sum_probs=5.9

Q ss_pred             CeecccccccccccC
Q psy6598           3 PFFYFPWDCKISFAS   17 (154)
Q Consensus         3 ~~~C~~~~C~~~f~~   17 (154)
                      |++|.  .||..|.+
T Consensus         2 pH~Ct--rCG~vf~~   14 (112)
T COG3364           2 PHQCT--RCGEVFDD   14 (112)
T ss_pred             Cceec--cccccccc
Confidence            34444  44444443


No 164
>KOG2807|consensus
Probab=35.87  E-value=77  Score=23.81  Aligned_cols=25  Identities=16%  Similarity=0.600  Sum_probs=18.8

Q ss_pred             ceeCCCCcccccCchhHHHHHHhhc
Q psy6598          63 PFICDRCPRSYRQLCTLNVHKKTNH   87 (154)
Q Consensus        63 ~~~C~~C~~~f~~~~~l~~H~~~~~   87 (154)
                      .|.|..|...|-..-..-.|...|.
T Consensus       345 ~y~C~~Ck~~FCldCDv~iHesLh~  369 (378)
T KOG2807|consen  345 RYRCESCKNVFCLDCDVFIHESLHN  369 (378)
T ss_pred             cEEchhccceeeccchHHHHhhhhc
Confidence            4888888888887777777777653


No 165
>COG1773 Rubredoxin [Energy production and conversion]
Probab=35.57  E-value=19  Score=19.27  Aligned_cols=13  Identities=15%  Similarity=0.258  Sum_probs=8.0

Q ss_pred             CCeeccccccccccc
Q psy6598           2 KPFFYFPWDCKISFA   16 (154)
Q Consensus         2 k~~~C~~~~C~~~f~   16 (154)
                      +.|+|.  .||..|.
T Consensus         2 ~~~~C~--~CG~vYd   14 (55)
T COG1773           2 KRWRCS--VCGYVYD   14 (55)
T ss_pred             CceEec--CCceEec
Confidence            356666  6666664


No 166
>PTZ00255 60S ribosomal protein L37a; Provisional
Probab=35.35  E-value=19  Score=21.47  Aligned_cols=32  Identities=19%  Similarity=0.537  Sum_probs=20.5

Q ss_pred             CceeecccccCcccCChhhHHHHhhhhcCCCceeCCCCcccccC
Q psy6598          32 KPYVCQVRDCQKRFTEYSSLYKHTLVHSDIRPFICDRCPRSYRQ   75 (154)
Q Consensus        32 ~~~~C~~~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~f~~   75 (154)
                      ..|.|.+  |++.-..+        ..  .-.+.|..|++.|.=
T Consensus        35 a~y~Cpf--Cgk~~vkR--------~a--~GIW~C~~C~~~~AG   66 (90)
T PTZ00255         35 AKYFCPF--CGKHAVKR--------QA--VGIWRCKGCKKTVAG   66 (90)
T ss_pred             CCccCCC--CCCCceee--------ee--eEEEEcCCCCCEEeC
Confidence            4588988  98642221        11  234899999988754


No 167
>COG1655 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=35.24  E-value=9.7  Score=26.87  Aligned_cols=39  Identities=21%  Similarity=0.430  Sum_probs=24.2

Q ss_pred             CCceeecccccCcccCChhhHHHHhhhhcCCC----------c-----eeCCCCcc
Q psy6598          31 EKPYVCQVRDCQKRFTEYSSLYKHTLVHSDIR----------P-----FICDRCPR   71 (154)
Q Consensus        31 ~~~~~C~~~~C~~~f~~~~~l~~H~~~h~~~~----------~-----~~C~~C~~   71 (154)
                      ++.+.|++  |+..|....-+..-.|+-.|+-          |     ..|+.|+.
T Consensus        17 kk~ieCPv--C~tkFkkeev~tgsiRiiagDld~~lkygninP~fY~VvvCP~C~y   70 (267)
T COG1655          17 KKTIECPV--CNTKFKKEEVKTGSIRIIAGDLDFFLKYGNINPYFYDVVVCPICYY   70 (267)
T ss_pred             hceeccCc--ccchhhhhheeccceeEecccccceeeccccCCceeEEEEcchhhH
Confidence            34577877  8877777666555555433321          1     58999975


No 168
>COG4530 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=34.95  E-value=16  Score=22.52  Aligned_cols=25  Identities=20%  Similarity=0.417  Sum_probs=17.5

Q ss_pred             cccCcccCChhhHHHHhhhhcCCCceeCCCCccccc
Q psy6598          39 RDCQKRFTEYSSLYKHTLVHSDIRPFICDRCPRSYR   74 (154)
Q Consensus        39 ~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~f~   74 (154)
                      |+||+.|.-..           ..|..|++||++|.
T Consensus        13 Petg~KFYDLN-----------rdPiVsPytG~s~P   37 (129)
T COG4530          13 PETGKKFYDLN-----------RDPIVSPYTGKSYP   37 (129)
T ss_pred             ccccchhhccC-----------CCccccCcccccch
Confidence            34888776532           35788899999884


No 169
>KOG2071|consensus
Probab=34.44  E-value=26  Score=28.24  Aligned_cols=26  Identities=27%  Similarity=0.519  Sum_probs=20.0

Q ss_pred             CCeecccccccccccCHHHHHHHHHHhc
Q psy6598           2 KPFFYFPWDCKISFASATNYKNHMRIHS   29 (154)
Q Consensus         2 k~~~C~~~~C~~~f~~~~~l~~H~~~h~   29 (154)
                      ++-.|.  .||..|........||..|.
T Consensus       417 ~pnqC~--~CG~R~~~~ee~sk~md~H~  442 (579)
T KOG2071|consen  417 SPNQCK--SCGLRFDDSEERSKHMDIHD  442 (579)
T ss_pred             Ccchhc--ccccccccchhhhhHhhhhh
Confidence            467899  99999988877777666653


No 170
>PF06524 NOA36:  NOA36 protein;  InterPro: IPR010531 This family consists of several NOA36 proteins which contain 29 highly conserved cysteine residues. The function of this protein is unknown.; GO: 0008270 zinc ion binding, 0005634 nucleus
Probab=33.94  E-value=29  Score=25.04  Aligned_cols=24  Identities=17%  Similarity=0.533  Sum_probs=11.6

Q ss_pred             CceeCCCCcccccCchhHHHHHHh
Q psy6598          62 RPFICDRCPRSYRQLCTLNVHKKT   85 (154)
Q Consensus        62 ~~~~C~~C~~~f~~~~~l~~H~~~   85 (154)
                      +++.|+.||........|..-.|+
T Consensus       208 k~~PCPKCg~et~eTkdLSmStR~  231 (314)
T PF06524_consen  208 KPIPCPKCGYETQETKDLSMSTRS  231 (314)
T ss_pred             CCCCCCCCCCcccccccceeeeec
Confidence            455555555554444444433333


No 171
>COG4391 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=33.83  E-value=19  Score=19.75  Aligned_cols=43  Identities=28%  Similarity=0.528  Sum_probs=25.5

Q ss_pred             HHHHhcCCCceeecccccCcccCChhhHHHHhhhh---cCCCceeCCCCcccccC
Q psy6598          24 HMRIHSGEKPYVCQVRDCQKRFTEYSSLYKHTLVH---SDIRPFICDRCPRSYRQ   75 (154)
Q Consensus        24 H~~~h~~~~~~~C~~~~C~~~f~~~~~l~~H~~~h---~~~~~~~C~~C~~~f~~   75 (154)
                      |..++.+..++.|.-  =+-.|       .|-++.   -.+.-..|++|+..|..
T Consensus        15 ~~~I~~~~~~l~C~g--~~~p~-------~HPrV~L~mg~~gev~CPYC~t~y~l   60 (62)
T COG4391          15 HETIEIGDLPLMCPG--PEPPN-------DHPRVFLDMGDEGEVVCPYCSTRYRL   60 (62)
T ss_pred             ceEEEeCCeeEEcCC--CCCCC-------CCCEEEEEcCCCCcEecCccccEEEe
Confidence            555667788888854  32222       232221   22344799999998864


No 172
>PF09963 DUF2197:  Uncharacterized protein conserved in bacteria (DUF2197);  InterPro: IPR019241  This family represents various hypothetical bacterial proteins with no known function. 
Probab=32.91  E-value=37  Score=18.27  Aligned_cols=6  Identities=17%  Similarity=0.119  Sum_probs=2.7

Q ss_pred             cccccc
Q psy6598          10 DCKISF   15 (154)
Q Consensus        10 ~C~~~f   15 (154)
                      .|++.+
T Consensus         7 lCdk~~   12 (56)
T PF09963_consen    7 LCDKKE   12 (56)
T ss_pred             ecCCEE
Confidence            454443


No 173
>PLN03238 probable histone acetyltransferase MYST; Provisional
Probab=32.53  E-value=49  Score=24.29  Aligned_cols=25  Identities=32%  Similarity=0.619  Sum_probs=20.7

Q ss_pred             CCceeecccccCcccCChhhHHHHhhh
Q psy6598          31 EKPYVCQVRDCQKRFTEYSSLYKHTLV   57 (154)
Q Consensus        31 ~~~~~C~~~~C~~~f~~~~~l~~H~~~   57 (154)
                      ..-|.|..  |-+.|.+...|.+|+..
T Consensus        46 ~~lyiCe~--Clky~~~~~~l~~H~~~   70 (290)
T PLN03238         46 TKLYICEY--CLKYMRKKKSLLRHLAK   70 (290)
T ss_pred             CeEEEcCC--CcchhCCHHHHHHHHHh
Confidence            34589988  99999999999999763


No 174
>PF01215 COX5B:  Cytochrome c oxidase subunit Vb This family consists of chains F and S ;  InterPro: IPR002124 Cytochrome c oxidase (1.9.3.1 from EC) is an oligomeric enzymatic complex which is a component of the respiratory chain complex and is involved in the transfer of electrons from cytochrome c to oxygen []. In eukaryotes this enzyme complex is located in the mitochondrial inner membrane; in aerobic prokaryotes it is found in the plasma membrane.  In eukaryotes, in addition to the three large subunits, I, II and III, that form the catalytic centre of the enzyme complex, there are a variable number of small polypeptidic subunits. One of these subunits, which is known as Vb in mammals, V in Dictyostelium discoideum (Slime mold) and IV in yeast, binds a zinc atom. The sequence of subunit Vb is well conserved and includes three conserved cysteines that coordinate the zinc ion [, ]. Two of these cysteines are clustered in the C-terminal section of the subunit.; GO: 0004129 cytochrome-c oxidase activity, 0005740 mitochondrial envelope; PDB: 2EIL_S 2ZXW_S 3ASN_S 1OCO_S 3AG4_S 3ABK_S 1OCZ_S 1OCC_F 3ASO_S 3ABL_S ....
Probab=32.33  E-value=20  Score=23.17  Aligned_cols=19  Identities=21%  Similarity=0.684  Sum_probs=14.6

Q ss_pred             hhcCCCceeCCCCcccccCc
Q psy6598          57 VHSDIRPFICDRCPRSYRQL   76 (154)
Q Consensus        57 ~h~~~~~~~C~~C~~~f~~~   76 (154)
                      .+.+ ++.+|..||..|...
T Consensus       107 l~~g-~~~RCpeCG~~fkL~  125 (136)
T PF01215_consen  107 LHKG-KPQRCPECGQVFKLK  125 (136)
T ss_dssp             EETT-SEEEETTTEEEEEEE
T ss_pred             EeCC-CccCCCCCCeEEEEE
Confidence            4444 589999999998754


No 175
>TIGR00280 L37a ribosomal protein L37a. This model finds eukaryotic ribosomal protein L37a and its archaeal orthologs. The nomeclature is tricky because eukaryotes have proteins called both L37 and L37a.
Probab=32.27  E-value=20  Score=21.39  Aligned_cols=33  Identities=21%  Similarity=0.531  Sum_probs=20.8

Q ss_pred             CceeecccccCcccCChhhHHHHhhhhcCCCceeCCCCcccccCc
Q psy6598          32 KPYVCQVRDCQKRFTEYSSLYKHTLVHSDIRPFICDRCPRSYRQL   76 (154)
Q Consensus        32 ~~~~C~~~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~f~~~   76 (154)
                      ..|.|.+  |++.-..        +.-  .-.+.|..|++.|.=.
T Consensus        34 a~y~Cpf--Cgk~~vk--------R~a--~GIW~C~~C~~~~AGG   66 (91)
T TIGR00280        34 AKYVCPF--CGKKTVK--------RGS--TGIWTCRKCGAKFAGG   66 (91)
T ss_pred             cCccCCC--CCCCceE--------EEe--eEEEEcCCCCCEEeCC
Confidence            4588988  9865222        112  3348999999887543


No 176
>PRK12380 hydrogenase nickel incorporation protein HybF; Provisional
Probab=32.17  E-value=29  Score=21.51  Aligned_cols=24  Identities=17%  Similarity=0.336  Sum_probs=14.0

Q ss_pred             eeecccccCcccCChhhHHHHhhhhcCCCceeCCCCcc
Q psy6598          34 YVCQVRDCQKRFTEYSSLYKHTLVHSDIRPFICDRCPR   71 (154)
Q Consensus        34 ~~C~~~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~   71 (154)
                      +.|..  ||..|....            ..+.|+.||.
T Consensus        71 ~~C~~--Cg~~~~~~~------------~~~~CP~Cgs   94 (113)
T PRK12380         71 AWCWD--CSQVVEIHQ------------HDAQCPHCHG   94 (113)
T ss_pred             EEccc--CCCEEecCC------------cCccCcCCCC
Confidence            66755  776655432            2255777773


No 177
>smart00440 ZnF_C2C2 C2C2 Zinc finger. Nucleic-acid-binding motif in transcriptional elongation factor TFIIS and RNA polymerases.
Probab=32.10  E-value=8.8  Score=18.96  Aligned_cols=10  Identities=20%  Similarity=1.225  Sum_probs=4.5

Q ss_pred             eeCCCCcccc
Q psy6598          64 FICDRCPRSY   73 (154)
Q Consensus        64 ~~C~~C~~~f   73 (154)
                      |.|..|+..|
T Consensus        29 y~C~~C~~~w   38 (40)
T smart00440       29 YVCTKCGHRW   38 (40)
T ss_pred             EEeCCCCCEe
Confidence            4444444433


No 178
>PF10537 WAC_Acf1_DNA_bd:  ATP-utilising chromatin assembly and remodelling N-terminal;  InterPro: IPR013136 ACF (for ATP-utilising chromatin assembly and remodeling factor) is a chromatin-remodeling complex that catalyzes the ATP-dependent assembly of periodic nucleosome arrays. This reaction utilises the energy of ATP hydrolysis by ISWI, the smaller of the two subunits of ACF. Acf1, the large subunit of ACF, is essential for the full activity of the complex. The WAC (WSTF/Acf1/cbp146) domain is an ~110-residue module present at the N-termini of Acf1-related proteins in a variety of organisms. It is found in association with other domains such as the bromodomain, the PHD-type zinc finger, DDT or WAKS. The DNA-binding region of Acf1 includes the WAC domain, which is necessary for the efficient binding of ACF complex to DNA. It seems probable that the WAC domain will be involved in DNA binding in other related factors [, ].   Some proteins known to contain a WAC domain are the Drosophila melanogaster (Fruit fly) ATP-dependent chromatin assembly factor large subunit Acf1, human WSTF (Williams syndrome transcription factor), mouse cbp146, yeast imitation switch two complex protein 1 (ITC1 or YGL133w), and yeast protein YPL216w. 
Probab=32.00  E-value=1.1e+02  Score=18.72  Aligned_cols=42  Identities=14%  Similarity=0.239  Sum_probs=27.4

Q ss_pred             CCCeecccccccccccCHHHHHHHHHHhcCCCceeecccccCcccCC
Q psy6598           1 MKPFFYFPWDCKISFASATNYKNHMRIHSGEKPYVCQVRDCQKRFTE   47 (154)
Q Consensus         1 ~k~~~C~~~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~~~C~~~f~~   47 (154)
                      |..|.|+  .-|..|.+...+...+..- ..+.|.|..  .|+.-.+
T Consensus         1 eeVf~i~--~T~EiF~dYe~Y~~R~~~y-~~~vwtC~~--TGk~~LT   42 (102)
T PF10537_consen    1 EEVFYIP--FTGEIFRDYEEYLKRMILY-NQRVWTCEI--TGKSNLT   42 (102)
T ss_pred             CceEEeC--CCCcccCCHHHHHHHHHHH-hCCeeEEec--CCCCCCC
Confidence            4567777  7888887776666555443 356688877  7764333


No 179
>PF04780 DUF629:  Protein of unknown function (DUF629);  InterPro: IPR006865 This domain represents a region of several plant proteins of unknown function. A C2H2 zinc finger is predicted in this region in some family members, but the spacing between the cysteine residues is not conserved throughout the family.
Probab=31.72  E-value=42  Score=26.48  Aligned_cols=25  Identities=24%  Similarity=0.462  Sum_probs=21.5

Q ss_pred             eecccccccccccCHHHHHHHHH-HhcC
Q psy6598           4 FFYFPWDCKISFASATNYKNHMR-IHSG   30 (154)
Q Consensus         4 ~~C~~~~C~~~f~~~~~l~~H~~-~h~~   30 (154)
                      +.|+  .|.+.|.+...+..|+. .|.+
T Consensus        58 WiCp--~CskkF~d~~~~~~H~~~eH~~   83 (466)
T PF04780_consen   58 WICP--RCSKKFSDAESCLSHMEQEHPA   83 (466)
T ss_pred             eeCC--cccceeCCHHHHHHHHHHhhhh
Confidence            5688  99999999999999987 5654


No 180
>PTZ00448 hypothetical protein; Provisional
Probab=31.70  E-value=30  Score=26.31  Aligned_cols=24  Identities=13%  Similarity=0.433  Sum_probs=21.1

Q ss_pred             ceeCCCCcccccCchhHHHHHHhh
Q psy6598          63 PFICDRCPRSYRQLCTLNVHKKTN   86 (154)
Q Consensus        63 ~~~C~~C~~~f~~~~~l~~H~~~~   86 (154)
                      .|.|..|+-.|......+.|+++-
T Consensus       314 ~~tC~~C~v~F~~~~~qR~H~KSD  337 (373)
T PTZ00448        314 MLLCRKCNIQLMDHNAFKQHYRSE  337 (373)
T ss_pred             CccccccccccCCHHHHHHHhhhh
Confidence            588999999999889999998864


No 181
>PF14446 Prok-RING_1:  Prokaryotic RING finger family 1
Probab=31.52  E-value=21  Score=19.07  Aligned_cols=9  Identities=22%  Similarity=0.892  Sum_probs=4.4

Q ss_pred             eeCCCCccc
Q psy6598          64 FICDRCPRS   72 (154)
Q Consensus        64 ~~C~~C~~~   72 (154)
                      ..|+.|+..
T Consensus        22 VvCp~Cgap   30 (54)
T PF14446_consen   22 VVCPECGAP   30 (54)
T ss_pred             EECCCCCCc
Confidence            455555543


No 182
>COG1675 TFA1 Transcription initiation factor IIE, alpha subunit [Transcription]
Probab=30.57  E-value=74  Score=21.57  Aligned_cols=40  Identities=18%  Similarity=0.544  Sum_probs=24.6

Q ss_pred             HHHhcCCCceeecccccCcccCChhhHHHHhhhhcCCCceeCCCCcccccC
Q psy6598          25 MRIHSGEKPYVCQVRDCQKRFTEYSSLYKHTLVHSDIRPFICDRCPRSYRQ   75 (154)
Q Consensus        25 ~~~h~~~~~~~C~~~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~f~~   75 (154)
                      +..-....-|.|..  |.-.|+.-.+..      .   .|.|+.||-....
T Consensus       105 le~~~~~~~y~C~~--~~~r~sfdeA~~------~---~F~Cp~Cg~~L~~  144 (176)
T COG1675         105 LEKETENNYYVCPN--CHVKYSFDEAME------L---GFTCPKCGEDLEE  144 (176)
T ss_pred             HHhhccCCceeCCC--CCCcccHHHHHH------h---CCCCCCCCchhhh
Confidence            33345567799954  776666544432      1   2999999975433


No 183
>KOG1994|consensus
Probab=30.33  E-value=34  Score=24.15  Aligned_cols=23  Identities=26%  Similarity=0.710  Sum_probs=19.7

Q ss_pred             CceeecccccCcccCChhhHHHHhh
Q psy6598          32 KPYVCQVRDCQKRFTEYSSLYKHTL   56 (154)
Q Consensus        32 ~~~~C~~~~C~~~f~~~~~l~~H~~   56 (154)
                      -.|-|-+  ||..|.....|..|..
T Consensus       238 eh~YC~f--CG~~y~~~edl~ehCP  260 (268)
T KOG1994|consen  238 EHYYCFF--CGIKYKDEEDLYEHCP  260 (268)
T ss_pred             cceEEEE--eccccCCHHHHHHhCC
Confidence            3488999  9999999999998853


No 184
>PF08209 Sgf11:  Sgf11 (transcriptional regulation protein);  InterPro: IPR013246 The Sgf11 family is a SAGA complex subunit in Saccharomyces cerevisiae (Baker's yeast). The SAGA complex is a multisubunit protein complex involved in transcriptional regulation. SAGA combines proteins involved in interactions with DNA-bound activators and TATA-binding protein (TBP), as well as enzymes for histone acetylation and deubiquitylation [].; PDB: 3M99_B 2LO2_A 3MHH_C 3MHS_C.
Probab=30.21  E-value=69  Score=15.13  Aligned_cols=24  Identities=17%  Similarity=0.271  Sum_probs=13.4

Q ss_pred             ceeCCCCcccccCchhHHHHHHhhc
Q psy6598          63 PFICDRCPRSYRQLCTLNVHKKTNH   87 (154)
Q Consensus        63 ~~~C~~C~~~f~~~~~l~~H~~~~~   87 (154)
                      -|.|+.|++.+. ...+..|+...-
T Consensus         4 ~~~C~nC~R~v~-a~RfA~HLekCm   27 (33)
T PF08209_consen    4 YVECPNCGRPVA-ASRFAPHLEKCM   27 (33)
T ss_dssp             EEE-TTTSSEEE-GGGHHHHHHHHT
T ss_pred             eEECCCCcCCcc-hhhhHHHHHHHH
Confidence            367777777543 344566666543


No 185
>smart00647 IBR In Between Ring fingers. the domains occurs between pairs og RING fingers
Probab=30.06  E-value=41  Score=17.79  Aligned_cols=34  Identities=18%  Similarity=0.385  Sum_probs=20.9

Q ss_pred             eecccccCcccCChhhHHHHhhhhcCCCceeCCCCcccccCc
Q psy6598          35 VCQVRDCQKRFTEYSSLYKHTLVHSDIRPFICDRCPRSYRQL   76 (154)
Q Consensus        35 ~C~~~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~f~~~   76 (154)
                      -|..++|+.......        ..+.....|+.|+..|=..
T Consensus        20 ~CP~~~C~~~~~~~~--------~~~~~~v~C~~C~~~fC~~   53 (64)
T smart00647       20 WCPAPDCSAAIIVTE--------EEGCNRVTCPKCGFSFCFR   53 (64)
T ss_pred             CCCCCCCcceEEecC--------CCCCCeeECCCCCCeECCC
Confidence            376456877655432        2234568888888877544


No 186
>KOG4377|consensus
Probab=29.83  E-value=18  Score=27.83  Aligned_cols=25  Identities=20%  Similarity=0.461  Sum_probs=17.6

Q ss_pred             eeecccccCcccCChhhHHHHhhhh
Q psy6598          34 YVCQVRDCQKRFTEYSSLYKHTLVH   58 (154)
Q Consensus        34 ~~C~~~~C~~~f~~~~~l~~H~~~h   58 (154)
                      |.|.+++|+..+...+.+..|.+.|
T Consensus       402 fhc~r~Gc~~tl~s~sqm~shkrkh  426 (480)
T KOG4377|consen  402 FHCDRLGCEATLYSVSQMASHKRKH  426 (480)
T ss_pred             eeecccCCceEEEehhhhhhhhhhh
Confidence            5666677777777777777776655


No 187
>PF10263 SprT-like:  SprT-like family;  InterPro: IPR006640 This is a family of uncharacterised bacterial proteins which includes Escherichia coli SprT (P39902 from SWISSPROT). SprT is described as a regulator of bolA gene in stationary phase []. The majority of members contain the metallopeptidase zinc binding signature which has a HExxH motif, however there is no evidence for them being metallopeptidases. 
Probab=29.78  E-value=20  Score=23.20  Aligned_cols=32  Identities=19%  Similarity=0.564  Sum_probs=17.4

Q ss_pred             ceeecccccCcccCChhhHHHHhhhhcCCCceeCCCCccccc
Q psy6598          33 PYVCQVRDCQKRFTEYSSLYKHTLVHSDIRPFICDRCPRSYR   74 (154)
Q Consensus        33 ~~~C~~~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~f~   74 (154)
                      .|.|..  |+..+...      .+.  ....|.|..|+..|.
T Consensus       123 ~~~C~~--C~~~~~r~------~~~--~~~~~~C~~C~~~l~  154 (157)
T PF10263_consen  123 VYRCPS--CGREYKRH------RRS--KRKRYRCGRCGGPLV  154 (157)
T ss_pred             EEEcCC--CCCEeeee------ccc--chhhEECCCCCCEEE
Confidence            467754  77665332      222  223377887876553


No 188
>KOG4727|consensus
Probab=29.49  E-value=35  Score=22.93  Aligned_cols=22  Identities=14%  Similarity=0.442  Sum_probs=19.0

Q ss_pred             CeecccccccccccCHHHHHHHHH
Q psy6598           3 PFFYFPWDCKISFASATNYKNHMR   26 (154)
Q Consensus         3 ~~~C~~~~C~~~f~~~~~l~~H~~   26 (154)
                      -|-|.  +|+-+|...-++..|+.
T Consensus        75 GyyCd--VCdcvvKDSinflDHiN   96 (193)
T KOG4727|consen   75 GYYCD--VCDCVVKDSINFLDHIN   96 (193)
T ss_pred             ceeee--ecceeehhhHHHHHHhc
Confidence            38899  99999999988888875


No 189
>COG3091 SprT Zn-dependent metalloprotease, SprT family [General function prediction only]
Probab=29.37  E-value=41  Score=22.17  Aligned_cols=32  Identities=31%  Similarity=0.857  Sum_probs=17.2

Q ss_pred             ceeecccccCcccCChhhHHHHhhhhcCCCceeCCCCcc
Q psy6598          33 PYVCQVRDCQKRFTEYSSLYKHTLVHSDIRPFICDRCPR   71 (154)
Q Consensus        33 ~~~C~~~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~   71 (154)
                      +|.|.   |+..|.+   ..+|-..-.|+ .|.|..|+-
T Consensus       117 ~Y~C~---C~q~~l~---~RRhn~~~~g~-~YrC~~C~g  148 (156)
T COG3091         117 PYRCQ---CQQHYLR---IRRHNTVRRGE-VYRCGKCGG  148 (156)
T ss_pred             eEEee---cCCccch---hhhcccccccc-eEEeccCCc
Confidence            46664   6665444   23343444444 577777754


No 190
>PLN03239 histone acetyltransferase; Provisional
Probab=28.87  E-value=36  Score=25.70  Aligned_cols=25  Identities=8%  Similarity=0.103  Sum_probs=22.2

Q ss_pred             CCceeCCCCcccccCchhHHHHHHh
Q psy6598          61 IRPFICDRCPRSYRQLCTLNVHKKT   85 (154)
Q Consensus        61 ~~~~~C~~C~~~f~~~~~l~~H~~~   85 (154)
                      ...|.|..|.+-|.+...|.+|+..
T Consensus       104 ~~lYiCE~Clky~~~~~~l~~H~~~  128 (351)
T PLN03239        104 DVLYVCEFSFGFFARKSELLRFQAK  128 (351)
T ss_pred             ceEEEeccchhhhcCHHHHHHHHHh
Confidence            4679999999999999999999864


No 191
>PF03145 Sina:  Seven in absentia protein family;  InterPro: IPR018121 The seven in absentia (sina) gene was first identified in Drosophila. The Drosophila Sina protein is essential for the determination of the R7 pathway in photoreceptor cell development: the loss of functional Sina results in the transformation of the R7 precursor cell to a non-neuronal cell type. The Sina protein contains an N-terminal RING finger domain C3HC4-type. Through this domain, Sina binds E2 ubiquitin-conjugating enzymes (UbcD1) Sina also interacts with Tramtrack (TTK88) via PHYL. Tramtrack is a transcriptional repressor that blocks photoreceptor determination, while PHYL down-regulates the activity of TTK88. In turn, the activity of PHYL requires the activation of the Sevenless receptor tyrosine kinase, a process essential for R7 determination. It is thought that Sina targets TTK88 for degradation, therefore promoting the R7 pathway. Murine and human homologues of Sina have also been identified. The human homologue Siah-1 [] also binds E2 enzymes (UbcH5) and through a series of physical interactions, targets beta-catenin for ubiquitin degradation. Siah-1 expression is enhanced by p53, itself promoted by DNA damage. Thus this pathway links DNA damage to beta-catenin degradation [, ]. Sina proteins, therefore, physically interact with a variety of proteins. The N-terminal RING finger domain that binds ubiquitin conjugating enzymes is a C3HC4-type, and does not form part of the alignment for this family. The remainder C-terminal part is involved in interactions with other proteins, and is included in this alignment. In addition to the Drosophila protein and mammalian homologues, whose similarity was noted previously, this family also includes putative homologues from Caenorhabditis elegans, Arabidopsis thaliana.; GO: 0006511 ubiquitin-dependent protein catabolic process, 0007275 multicellular organismal development, 0005634 nucleus; PDB: 2A25_A 1K2F_B 2AN6_C.
Probab=28.24  E-value=83  Score=21.42  Aligned_cols=47  Identities=23%  Similarity=0.334  Sum_probs=17.5

Q ss_pred             ccCcccCChhhHHHHhhhhcCCCceeCCC----CcccccCchhHHHHHHhhcCc
Q psy6598          40 DCQKRFTEYSSLYKHTLVHSDIRPFICDR----CPRSYRQLCTLNVHKKTNHRE   89 (154)
Q Consensus        40 ~C~~~f~~~~~l~~H~~~h~~~~~~~C~~----C~~~f~~~~~l~~H~~~~~~~   89 (154)
                      ||...+.. .....|.... .-+|+.|+.    |+-. .....|..|....|..
T Consensus        23 GC~~~~~~-~~~~~HE~~C-~~~p~~CP~~~~~C~~~-G~~~~l~~Hl~~~H~~   73 (198)
T PF03145_consen   23 GCTETFPY-SEKREHEEEC-PFRPCSCPFPGSGCDWQ-GSYKELLDHLRDKHSW   73 (198)
T ss_dssp             T---EE-G-GGHHHHHHT--TTSEEE-SSSSTT---E-EECCCHHHHHHHHTTT
T ss_pred             CCcccccc-cChhhHhccC-CCcCCcCCCCCCCcccc-CCHHHHHHHHHHHCCC
Confidence            35444322 2344454432 234566655    4321 2334566666655554


No 192
>PTZ00043 cytochrome c oxidase subunit; Provisional
Probab=28.10  E-value=31  Score=24.38  Aligned_cols=17  Identities=12%  Similarity=0.423  Sum_probs=13.9

Q ss_pred             CCCceeCCCCcccccCc
Q psy6598          60 DIRPFICDRCPRSYRQL   76 (154)
Q Consensus        60 ~~~~~~C~~C~~~f~~~   76 (154)
                      ..++.+|..||..|...
T Consensus       178 EGkpqRCpECGqVFKLV  194 (268)
T PTZ00043        178 EGFLYRCGECDQIFMLV  194 (268)
T ss_pred             CCCCccCCCCCcEEEEE
Confidence            45689999999999764


No 193
>PRK00564 hypA hydrogenase nickel incorporation protein; Provisional
Probab=27.93  E-value=34  Score=21.36  Aligned_cols=13  Identities=23%  Similarity=0.608  Sum_probs=7.5

Q ss_pred             eeecccccCcccCCh
Q psy6598          34 YVCQVRDCQKRFTEY   48 (154)
Q Consensus        34 ~~C~~~~C~~~f~~~   48 (154)
                      +.|..  ||..|...
T Consensus        72 ~~C~~--Cg~~~~~~   84 (117)
T PRK00564         72 LECKD--CSHVFKPN   84 (117)
T ss_pred             EEhhh--CCCccccC
Confidence            56655  77655543


No 194
>PF07975 C1_4:  TFIIH C1-like domain;  InterPro: IPR004595 All proteins in this domain for which functions are known are components of the TFIIH complex which is involved in the initiation of transcription and nucleotide excision repair. It includes the yeast transcription factor Ssl1 (Suppressor of stem-loop protein 1) that is essential for translation initiation and affects UV resistance. The C-terminal region is essential for transcription activity. This regions binds three zinc atoms through two independent domain. The first contains a C4 zinc finger motif, whereas the second is characterised by a CX(2)CX(2-4)FCADCD motif. The solution structure of the second C-terminal domain revealed homology with the regulatory domain of protein kinase C [].; GO: 0006281 DNA repair, 0005634 nucleus; PDB: 1Z60_A.
Probab=27.65  E-value=19  Score=18.94  Aligned_cols=26  Identities=23%  Similarity=0.616  Sum_probs=11.8

Q ss_pred             CCceeecccccCcccCChhhHHHHhhhh
Q psy6598          31 EKPYVCQVRDCQKRFTEYSSLYKHTLVH   58 (154)
Q Consensus        31 ~~~~~C~~~~C~~~f~~~~~l~~H~~~h   58 (154)
                      ...|.|..  |+..|-..-.+-.|...|
T Consensus        19 ~~~y~C~~--C~~~FC~dCD~fiHE~LH   44 (51)
T PF07975_consen   19 SSRYRCPK--CKNHFCIDCDVFIHETLH   44 (51)
T ss_dssp             -EEE--TT--TT--B-HHHHHTTTTTS-
T ss_pred             CCeEECCC--CCCccccCcChhhhcccc
Confidence            34577765  777777766666664443


No 195
>PF04606 Ogr_Delta:  Ogr/Delta-like zinc finger;  InterPro: IPR007684 This entry is represented by Bacteriophage P2, Ogr. The characteristics of the protein distribution suggest prophage matches in addition to the phage matches. This is a viral family of phage zinc-binding transcriptional activators, which also contains cryptic members in some bacterial genomes []. The P4 phage delta protein contains two such domains attached covalently, while the P2 phage Ogr proteins possess one domain but function as dimers. All the members of this family have the following consensus sequence: C-X(2)-C-X(3)-A-(X)2-R-X(15)-C-X(4)-C-X(3)-F [].; GO: 0006355 regulation of transcription, DNA-dependent
Probab=27.63  E-value=12  Score=19.17  Aligned_cols=17  Identities=18%  Similarity=0.401  Sum_probs=10.1

Q ss_pred             CceeecccccCcccCCh
Q psy6598          32 KPYVCQVRDCQKRFTEY   48 (154)
Q Consensus        32 ~~~~C~~~~C~~~f~~~   48 (154)
                      .-|.|....||..|...
T Consensus        24 ~Y~qC~N~~Cg~tfv~~   40 (47)
T PF04606_consen   24 LYCQCTNPECGHTFVAN   40 (47)
T ss_pred             EEEEECCCcCCCEEEEE
Confidence            34666655677766543


No 196
>PF13821 DUF4187:  Domain of unknown function (DUF4187)
Probab=27.30  E-value=1e+02  Score=16.38  Aligned_cols=19  Identities=16%  Similarity=0.494  Sum_probs=15.0

Q ss_pred             eeecccccCcccCChhhHHHH
Q psy6598          34 YVCQVRDCQKRFTEYSSLYKH   54 (154)
Q Consensus        34 ~~C~~~~C~~~f~~~~~l~~H   54 (154)
                      +-|-+  ||..|.....|.++
T Consensus        28 ~YC~~--Cg~~Y~d~~dL~~~   46 (55)
T PF13821_consen   28 NYCFW--CGTKYDDEEDLERN   46 (55)
T ss_pred             ceeee--eCCccCCHHHHHhC
Confidence            56888  99888888887654


No 197
>PF07150 DUF1390:  Protein of unknown function (DUF1390);  InterPro: IPR009820 This entry is represented by Paramecium bursaria Chlorella virus 1, A79R; it is a family of uncharacterised viral proteins.
Probab=27.13  E-value=60  Score=23.00  Aligned_cols=21  Identities=29%  Similarity=0.494  Sum_probs=10.2

Q ss_pred             eeCCCCcccccCchhHHHHHHh
Q psy6598          64 FICDRCPRSYRQLCTLNVHKKT   85 (154)
Q Consensus        64 ~~C~~C~~~f~~~~~l~~H~~~   85 (154)
                      |.| .||+.-...++..+|.++
T Consensus         8 Y~C-~CGYkT~~~gnAsKHkKt   28 (229)
T PF07150_consen    8 YTC-GCGYKTTNKGNASKHKKT   28 (229)
T ss_pred             EEe-cCCceecCCccccccCCC
Confidence            445 255544444555555443


No 198
>PF14447 Prok-RING_4:  Prokaryotic RING finger family 4
Probab=26.82  E-value=35  Score=18.28  Aligned_cols=13  Identities=15%  Similarity=0.232  Sum_probs=8.7

Q ss_pred             eeCCCCcccccCc
Q psy6598          64 FICDRCPRSYRQL   76 (154)
Q Consensus        64 ~~C~~C~~~f~~~   76 (154)
                      =-|++|++.|...
T Consensus        40 ngCPfC~~~~~~~   52 (55)
T PF14447_consen   40 NGCPFCGTPFEFD   52 (55)
T ss_pred             cCCCCCCCcccCC
Confidence            4677888777543


No 199
>KOG1280|consensus
Probab=26.59  E-value=71  Score=24.16  Aligned_cols=41  Identities=20%  Similarity=0.378  Sum_probs=0.0

Q ss_pred             hcCCCceeecccccCcccCChhhHHHHhhhhcCCCc--eeCCCCc
Q psy6598          28 HSGEKPYVCQVRDCQKRFTEYSSLYKHTLVHSDIRP--FICDRCP   70 (154)
Q Consensus        28 h~~~~~~~C~~~~C~~~f~~~~~l~~H~~~h~~~~~--~~C~~C~   70 (154)
                      |.....|.|.+  |+..-.+...|..|...-+.+-.  ..|++|+
T Consensus        74 ~y~~qSftCPy--C~~~Gfte~~f~~Hv~s~Hpda~~~~icp~c~  116 (381)
T KOG1280|consen   74 HYDPQSFTCPY--CGIMGFTERQFGTHVLSQHPEASTSVICPLCA  116 (381)
T ss_pred             ccccccccCCc--ccccccchhHHHHHhhhcCcccCcceeeeccc


No 200
>PLN02748 tRNA dimethylallyltransferase
Probab=26.52  E-value=48  Score=26.23  Aligned_cols=21  Identities=24%  Similarity=0.559  Sum_probs=10.7

Q ss_pred             eeecccccCc-ccCChhhHHHHhh
Q psy6598          34 YVCQVRDCQK-RFTEYSSLYKHTL   56 (154)
Q Consensus        34 ~~C~~~~C~~-~f~~~~~l~~H~~   56 (154)
                      |.|..  |++ .+........|++
T Consensus       419 ~~Ce~--C~~~~~~G~~eW~~Hlk  440 (468)
T PLN02748        419 YVCEA--CGNKVLRGAHEWEQHKQ  440 (468)
T ss_pred             ccccC--CCCcccCCHHHHHHHhc
Confidence            34555  554 4555555555544


No 201
>PRK03976 rpl37ae 50S ribosomal protein L37Ae; Reviewed
Probab=26.50  E-value=28  Score=20.77  Aligned_cols=32  Identities=22%  Similarity=0.509  Sum_probs=20.1

Q ss_pred             CceeecccccCcccCChhhHHHHhhhhcCCCceeCCCCcccccC
Q psy6598          32 KPYVCQVRDCQKRFTEYSSLYKHTLVHSDIRPFICDRCPRSYRQ   75 (154)
Q Consensus        32 ~~~~C~~~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~f~~   75 (154)
                      ..|.|.+  |++.-..+        ..  .-.+.|..|++.|.-
T Consensus        35 a~y~Cpf--Cgk~~vkR--------~a--~GIW~C~~C~~~~AG   66 (90)
T PRK03976         35 AKHVCPV--CGRPKVKR--------VG--TGIWECRKCGAKFAG   66 (90)
T ss_pred             cCccCCC--CCCCceEE--------EE--EEEEEcCCCCCEEeC
Confidence            4588987  97652221        12  234889999988753


No 202
>PRK00432 30S ribosomal protein S27ae; Validated
Probab=26.42  E-value=48  Score=17.27  Aligned_cols=12  Identities=17%  Similarity=0.595  Sum_probs=8.2

Q ss_pred             CceeCCCCcccc
Q psy6598          62 RPFICDRCPRSY   73 (154)
Q Consensus        62 ~~~~C~~C~~~f   73 (154)
                      ..+.|..||..+
T Consensus        36 ~r~~C~~Cgyt~   47 (50)
T PRK00432         36 DRWHCGKCGYTE   47 (50)
T ss_pred             CcEECCCcCCEE
Confidence            457888887654


No 203
>KOG1813|consensus
Probab=26.26  E-value=41  Score=24.76  Aligned_cols=22  Identities=27%  Similarity=0.377  Sum_probs=12.3

Q ss_pred             eCCCCcc----cccCchhHHHHHHhh
Q psy6598          65 ICDRCPR----SYRQLCTLNVHKKTN   86 (154)
Q Consensus        65 ~C~~C~~----~f~~~~~l~~H~~~~   86 (154)
                      .|.+|++    .|.....|..-|...
T Consensus       277 ~c~vC~~~t~g~~~~akeL~~~L~~k  302 (313)
T KOG1813|consen  277 KCYVCSQQTHGSFNVAKELLVSLKLK  302 (313)
T ss_pred             cceecccccccccchHHHHHHHHHhh
Confidence            5556665    355555666555543


No 204
>PF09082 DUF1922:  Domain of unknown function (DUF1922);  InterPro: IPR015166 Members of this family consist of a beta-sheet region followed by an alpha-helix and an unstructured C terminus. The beta-sheet region contains a CXCX...XCXC sequence with Cys residues located in two proximal loops and pointing towards each other. This precise function of this set of bacterial proteins is, as yet, unknown []. ; PDB: 1GH9_A.
Probab=26.17  E-value=33  Score=19.25  Aligned_cols=16  Identities=13%  Similarity=0.443  Sum_probs=9.1

Q ss_pred             hhcCCCceeCCCCcccc
Q psy6598          57 VHSDIRPFICDRCPRSY   73 (154)
Q Consensus        57 ~h~~~~~~~C~~C~~~f   73 (154)
                      ...+.+.-+| .||+..
T Consensus        14 a~e~~kTkkC-~CG~~l   29 (68)
T PF09082_consen   14 AKEGAKTKKC-VCGKTL   29 (68)
T ss_dssp             EETT-SEEEE-TTTEEE
T ss_pred             ecCCcceeEe-cCCCee
Confidence            3444566677 777753


No 205
>PF01155 HypA:  Hydrogenase expression/synthesis hypA family;  InterPro: IPR000688 Bacterial membrane-bound nickel-dependent hydrogenases requires a number of accessory proteins which are involved in their maturation. The exact role of these proteins is not yet clear, but some seem to be required for the incorporation of the nickel ions []. One of these proteins is generally known as hypA. It is a protein of about 12 to 14 kDa that contains, in its C-terminal region, four conserved cysteines that form a zinc-finger like motif. Escherichia coli has two proteins that belong to this family, hypA and hybF. A homologue, MJ0214, has also been found in a number of archaeal species, including the genome of Methanocaldococcus jannaschii (Methanococcus jannaschii).; GO: 0016151 nickel ion binding, 0006464 protein modification process; PDB: 2KDX_A 3A44_D 3A43_B.
Probab=26.14  E-value=36  Score=21.05  Aligned_cols=25  Identities=28%  Similarity=0.603  Sum_probs=14.2

Q ss_pred             eeecccccCcccCChhhHHHHhhhhcCCCceeCCCCccc
Q psy6598          34 YVCQVRDCQKRFTEYSSLYKHTLVHSDIRPFICDRCPRS   72 (154)
Q Consensus        34 ~~C~~~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~   72 (154)
                      +.|..  ||..|......            +.|+.||..
T Consensus        71 ~~C~~--Cg~~~~~~~~~------------~~CP~Cgs~   95 (113)
T PF01155_consen   71 ARCRD--CGHEFEPDEFD------------FSCPRCGSP   95 (113)
T ss_dssp             EEETT--TS-EEECHHCC------------HH-SSSSSS
T ss_pred             EECCC--CCCEEecCCCC------------CCCcCCcCC
Confidence            67766  88777664321            448888754


No 206
>TIGR00595 priA primosomal protein N'. All proteins in this family for which functions are known are components of the primosome which is involved in replication, repair, and recombination.This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University).
Probab=25.11  E-value=22  Score=28.22  Aligned_cols=11  Identities=27%  Similarity=0.628  Sum_probs=7.2

Q ss_pred             CCceeCCCCcc
Q psy6598          61 IRPFICDRCPR   71 (154)
Q Consensus        61 ~~~~~C~~C~~   71 (154)
                      ..|..|+.||.
T Consensus       251 ~~~~~Cp~C~s  261 (505)
T TIGR00595       251 PIPKTCPQCGS  261 (505)
T ss_pred             CCCCCCCCCCC
Confidence            34567777765


No 207
>PF12230 PRP21_like_P:  Pre-mRNA splicing factor PRP21 like protein;  InterPro: IPR022030  This domain family is found in eukaryotes, and is typically between 212 and 238 amino acids in length. The family is found in association with PF01805 from PFAM. There are two completely conserved residues (W and H) that may be functionally important. PRP21 is required for assembly of the prespliceosome and it interacts with U2 snRNP and/or pre-mRNA in the prespliceosome. This family also contains proteins similar to PRP21, such as the mammalian SF3a. SF3a also interacts with U2 snRNP from the prespliceosome, converting it to its active form. ; PDB: 4DGW_B.
Probab=24.86  E-value=24  Score=24.72  Aligned_cols=34  Identities=12%  Similarity=0.147  Sum_probs=0.0

Q ss_pred             CCceeCCCCcccccCchhHHHHHHhhcCcCCCCCC
Q psy6598          61 IRPFICDRCPRSYRQLCTLNVHKKTNHRESKNKKP   95 (154)
Q Consensus        61 ~~~~~C~~C~~~f~~~~~l~~H~~~~~~~~~~~~~   95 (154)
                      ..-..|++||.... .+.|..|||+..-+..++..
T Consensus       166 ~~~~~cPitGe~IP-~~e~~eHmRi~LlDP~wkEq  199 (229)
T PF12230_consen  166 EKMIICPITGEMIP-ADEMDEHMRIELLDPRWKEQ  199 (229)
T ss_dssp             -----------------------------------
T ss_pred             cccccccccccccc-cccccccccccccccccccc
Confidence            34589999998754 56799999999888877643


No 208
>KOG2807|consensus
Probab=24.77  E-value=1.5e+02  Score=22.42  Aligned_cols=30  Identities=27%  Similarity=0.720  Sum_probs=21.9

Q ss_pred             ceeecccccCcccCChhhHHHHhhhhcCCCceeCCCCc
Q psy6598          33 PYVCQVRDCQKRFTEYSSLYKHTLVHSDIRPFICDRCP   70 (154)
Q Consensus        33 ~~~C~~~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~   70 (154)
                      .|.|..  |...|...-....|...|      .|+.|.
T Consensus       345 ~y~C~~--Ck~~FCldCDv~iHesLh------~CpgCe  374 (378)
T KOG2807|consen  345 RYRCES--CKNVFCLDCDVFIHESLH------NCPGCE  374 (378)
T ss_pred             cEEchh--ccceeeccchHHHHhhhh------cCCCcC
Confidence            377766  888888888877776655      566664


No 209
>PF00301 Rubredoxin:  Rubredoxin;  InterPro: IPR004039 Rubredoxin is a low molecular weight iron-containing bacterial protein involved in electron transfer [, ], sometimes replacing ferredoxin as an electron carrier []. The 3-D structures of a number of rubredoxins have been solved [, ]. The fold belongs to the alpha+beta class, with 2 alpha-helices and 2-3 beta-strands. Its active site contains an iron ion which is co-ordinated by the sulphurs of four conserved cysteine residues forming an almost regular tetrahedron. The conserved cysteines reside on two loops, which are the most conserved regions of the protein. In addition, a ring of acidic residues in the proximity of the [Fe(Cys)4] centre is also well-conserved []. ; GO: 0009055 electron carrier activity, 0046872 metal ion binding; PDB: 2RDV_C 1RDV_A 1S24_A 1T9O_B 1B2J_A 1SMW_A 2PVE_B 1BFY_A 1T9P_C 1C09_C ....
Probab=24.74  E-value=32  Score=17.71  Aligned_cols=7  Identities=29%  Similarity=1.208  Sum_probs=2.7

Q ss_pred             eeCCCCc
Q psy6598          64 FICDRCP   70 (154)
Q Consensus        64 ~~C~~C~   70 (154)
                      |.|+.|+
T Consensus        35 w~CP~C~   41 (47)
T PF00301_consen   35 WVCPVCG   41 (47)
T ss_dssp             -B-TTTS
T ss_pred             CcCcCCC
Confidence            5555554


No 210
>COG5027 SAS2 Histone acetyltransferase (MYST family) [Chromatin structure and dynamics]
Probab=24.69  E-value=47  Score=25.14  Aligned_cols=31  Identities=23%  Similarity=0.430  Sum_probs=25.0

Q ss_pred             CceeCCCCcccccCchhHHHHHHhhcCcCCC
Q psy6598          62 RPFICDRCPRSYRQLCTLNVHKKTNHRESKN   92 (154)
Q Consensus        62 ~~~~C~~C~~~f~~~~~l~~H~~~~~~~~~~   92 (154)
                      .-|.|..|-+.|.....|.+|.+.+.-..++
T Consensus       157 ~vyICefClkY~~s~~~~~rH~~kC~~~~pP  187 (395)
T COG5027         157 IVYICEFCLKYYGSQTSLVRHRKKCSLQHPP  187 (395)
T ss_pred             eEEEhhhhHHHhcchhHHHHHHhcCcCcCCC
Confidence            3499999999999999999999876544333


No 211
>PTZ00448 hypothetical protein; Provisional
Probab=24.26  E-value=59  Score=24.78  Aligned_cols=21  Identities=24%  Similarity=0.604  Sum_probs=9.9

Q ss_pred             eecccccccccccCHHHHHHHHH
Q psy6598           4 FFYFPWDCKISFASATNYKNHMR   26 (154)
Q Consensus         4 ~~C~~~~C~~~f~~~~~l~~H~~   26 (154)
                      |.|.  .|+..|.+....+.|++
T Consensus       315 ~tC~--~C~v~F~~~~~qR~H~K  335 (373)
T PTZ00448        315 LLCR--KCNIQLMDHNAFKQHYR  335 (373)
T ss_pred             cccc--ccccccCCHHHHHHHhh
Confidence            4455  55555544444444443


No 212
>PRK03681 hypA hydrogenase nickel incorporation protein; Validated
Probab=24.25  E-value=35  Score=21.21  Aligned_cols=25  Identities=28%  Similarity=0.594  Sum_probs=14.0

Q ss_pred             eeecccccCcccCChhhHHHHhhhhcCCCceeCCCCcc
Q psy6598          34 YVCQVRDCQKRFTEYSSLYKHTLVHSDIRPFICDRCPR   71 (154)
Q Consensus        34 ~~C~~~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~   71 (154)
                      +.|..  |+..|.....           ..+.|+.||.
T Consensus        71 ~~C~~--Cg~~~~~~~~-----------~~~~CP~Cgs   95 (114)
T PRK03681         71 CWCET--CQQYVTLLTQ-----------RVRRCPQCHG   95 (114)
T ss_pred             EEccc--CCCeeecCCc-----------cCCcCcCcCC
Confidence            66755  8765544221           1156888874


No 213
>PF07800 DUF1644:  Protein of unknown function (DUF1644);  InterPro: IPR012866 This family consists of sequences found in a number of hypothetical plant proteins of unknown function. The region of interest contains nine highly conserved cysteine residues and is approximately 160 amino acids in length, which probably represent a zinc-binding domain. 
Probab=24.24  E-value=2.3e+02  Score=18.97  Aligned_cols=81  Identities=14%  Similarity=0.369  Sum_probs=45.5

Q ss_pred             cccccccccCHHHHHHHHHHhcCC--------------------------CceeecccccCcccCChhhHHHHhhhhcCC
Q psy6598           8 PWDCKISFASATNYKNHMRIHSGE--------------------------KPYVCQVRDCQKRFTEYSSLYKHTLVHSDI   61 (154)
Q Consensus         8 ~~~C~~~f~~~~~l~~H~~~h~~~--------------------------~~~~C~~~~C~~~f~~~~~l~~H~~~h~~~   61 (154)
                      +..|+..|....-|.+-.+.+...                          ....|..  |.....-..-.. --+.+...
T Consensus        29 pymc~Ts~rhSNCLdqfkka~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~L~CPL--CRG~V~GWtvve-~AR~~LN~  105 (162)
T PF07800_consen   29 PYMCDTSYRHSNCLDQFKKAYGKSSSSSSQSSSSAPSDSSSSESSESQEQPELACPL--CRGEVKGWTVVE-PARRFLNA  105 (162)
T ss_pred             ccccCCccchhHHHHHHHHHhcCCCCccccccccCcCCCcccccccccccccccCcc--ccCceeceEEch-HHHHHhcc
Confidence            345666777666666555544321                          1356766  765433333222 23455556


Q ss_pred             CceeCCC--CcccccCchhHHHHHHhhcCcCCC
Q psy6598          62 RPFICDR--CPRSYRQLCTLNVHKKTNHRESKN   92 (154)
Q Consensus        62 ~~~~C~~--C~~~f~~~~~l~~H~~~~~~~~~~   92 (154)
                      |+-.|..  |... .+...|.+|.+.-|....+
T Consensus       106 K~RsC~~e~C~F~-GtY~eLrKHar~~HP~~rP  137 (162)
T PF07800_consen  106 KKRSCSQESCSFS-GTYSELRKHARSEHPSARP  137 (162)
T ss_pred             CCccCcccccccc-cCHHHHHHHHHhhCCCCCC
Confidence            6666654  4422 3667899999987765443


No 214
>cd00065 FYVE FYVE domain; Zinc-binding domain; targets proteins to membrane lipids via interaction with phosphatidylinositol-3-phosphate, PI3P; present in Fab1, YOTB, Vac1, and EEA1;
Probab=24.03  E-value=54  Score=17.02  Aligned_cols=7  Identities=29%  Similarity=0.781  Sum_probs=3.7

Q ss_pred             ccccccc
Q psy6598          10 DCKISFA   16 (154)
Q Consensus        10 ~C~~~f~   16 (154)
                      .|++.|.
T Consensus         7 ~C~~~F~   13 (57)
T cd00065           7 GCGKPFT   13 (57)
T ss_pred             ccCcccc
Confidence            5555554


No 215
>KOG2636|consensus
Probab=23.56  E-value=68  Score=25.17  Aligned_cols=29  Identities=17%  Similarity=0.488  Sum_probs=24.4

Q ss_pred             HHhcCCCceeecccccC-cccCChhhHHHHhh
Q psy6598          26 RIHSGEKPYVCQVRDCQ-KRFTEYSSLYKHTL   56 (154)
Q Consensus        26 ~~h~~~~~~~C~~~~C~-~~f~~~~~l~~H~~   56 (154)
                      +.|.-...|.|.+  || +++.-+..+.+|..
T Consensus       394 KLHGL~~ey~CEI--CGNy~Y~GrkaF~RHF~  423 (497)
T KOG2636|consen  394 KLHGLDIEYNCEI--CGNYVYKGRKAFDRHFN  423 (497)
T ss_pred             hhcCCCcccceee--ccCccccCcHHHHHHhH
Confidence            4566677899999  99 89999999999976


No 216
>PF04641 Rtf2:  Rtf2 RING-finger
Probab=23.22  E-value=92  Score=22.38  Aligned_cols=36  Identities=19%  Similarity=0.404  Sum_probs=19.7

Q ss_pred             ceeecccccCcccCChhhHHHHhhhhcCCCceeCCCCcccccCc
Q psy6598          33 PYVCQVRDCQKRFTEYSSLYKHTLVHSDIRPFICDRCPRSYRQL   76 (154)
Q Consensus        33 ~~~C~~~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~f~~~   76 (154)
                      .|.+. ..||-.|..... .. +.     +--.|+.|+..|...
T Consensus       128 ~fv~l-~~cG~V~s~~al-ke-~k-----~~~~Cp~c~~~f~~~  163 (260)
T PF04641_consen  128 KFVYL-RPCGCVFSEKAL-KE-LK-----KSKKCPVCGKPFTEE  163 (260)
T ss_pred             eEEEE-cCCCCEeeHHHH-Hh-hc-----ccccccccCCccccC
Confidence            34444 257766655432 11 11     224599999988755


No 217
>PF03811 Zn_Tnp_IS1:  InsA N-terminal domain;  InterPro: IPR003220 Insertion elements are mobile elements in DNA, usually encoding proteins required for transposition, for example transposases. Protein InsA is absolutely required for transposition of insertion element 1. This entry represents a short zinc binding domain found in IS1 InsA family protein. It is found at the N terminus of the protein and may be a DNA-binding domain.; GO: 0006313 transposition, DNA-mediated
Probab=23.01  E-value=31  Score=16.63  Aligned_cols=19  Identities=21%  Similarity=0.447  Sum_probs=9.9

Q ss_pred             HHHHhhhhcCCCceeCCCC
Q psy6598          51 LYKHTLVHSDIRPFICDRC   69 (154)
Q Consensus        51 l~~H~~~h~~~~~~~C~~C   69 (154)
                      +.+|-+...|...|.|..|
T Consensus        17 v~k~G~~~~G~qryrC~~C   35 (36)
T PF03811_consen   17 VKKNGKSPSGHQRYRCKDC   35 (36)
T ss_pred             ceeCCCCCCCCEeEecCcC
Confidence            3444444455555666655


No 218
>TIGR00627 tfb4 transcription factor tfb4. This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University).
Probab=22.97  E-value=64  Score=23.63  Aligned_cols=11  Identities=27%  Similarity=0.785  Sum_probs=6.2

Q ss_pred             eeCCCCccccc
Q psy6598          64 FICDRCPRSYR   74 (154)
Q Consensus        64 ~~C~~C~~~f~   74 (154)
                      |.|+.|.-.|-
T Consensus       256 yvCs~Clsi~C  266 (279)
T TIGR00627       256 FVCSVCLSVLC  266 (279)
T ss_pred             EECCCccCCcC
Confidence            66666655443


No 219
>PF14311 DUF4379:  Domain of unknown function (DUF4379)
Probab=22.94  E-value=70  Score=16.73  Aligned_cols=13  Identities=15%  Similarity=0.649  Sum_probs=5.9

Q ss_pred             eeecccccCcccCCh
Q psy6598          34 YVCQVRDCQKRFTEY   48 (154)
Q Consensus        34 ~~C~~~~C~~~f~~~   48 (154)
                      ++|..  ||..|...
T Consensus        29 W~C~~--Cgh~w~~~   41 (55)
T PF14311_consen   29 WKCPK--CGHEWKAS   41 (55)
T ss_pred             EECCC--CCCeeEcc
Confidence            44544  54444443


No 220
>KOG2312|consensus
Probab=22.91  E-value=26  Score=28.94  Aligned_cols=24  Identities=21%  Similarity=0.314  Sum_probs=20.1

Q ss_pred             eecccccccccccCHHHHHHHHHH
Q psy6598           4 FFYFPWDCKISFASATNYKNHMRI   27 (154)
Q Consensus         4 ~~C~~~~C~~~f~~~~~l~~H~~~   27 (154)
                      |.|.|..|.+.|...+.+-.|...
T Consensus       644 ~mc~w~~c~~~~~tPs~vl~h~~~  667 (847)
T KOG2312|consen  644 FMCLWQSCKKWFQTPSQVLYHAAT  667 (847)
T ss_pred             eeecccccchhhcCcchhheecch
Confidence            778888888888888888888764


No 221
>PF13824 zf-Mss51:  Zinc-finger of mitochondrial splicing suppressor 51
Probab=22.76  E-value=64  Score=17.31  Aligned_cols=10  Identities=30%  Similarity=0.647  Sum_probs=5.6

Q ss_pred             CceeCCCCcc
Q psy6598          62 RPFICDRCPR   71 (154)
Q Consensus        62 ~~~~C~~C~~   71 (154)
                      ..|.|+.||-
T Consensus        13 v~~~Cp~cGi   22 (55)
T PF13824_consen   13 VNFECPDCGI   22 (55)
T ss_pred             cCCcCCCCCC
Confidence            3466666664


No 222
>PTZ00064 histone acetyltransferase; Provisional
Probab=22.62  E-value=75  Score=25.43  Aligned_cols=25  Identities=20%  Similarity=0.333  Sum_probs=21.1

Q ss_pred             CCceeecccccCcccCChhhHHHHhhh
Q psy6598          31 EKPYVCQVRDCQKRFTEYSSLYKHTLV   57 (154)
Q Consensus        31 ~~~~~C~~~~C~~~f~~~~~l~~H~~~   57 (154)
                      +.-|.|.+  |-+.|.....|.+|+..
T Consensus       278 d~LYICEf--CLkY~~s~~~l~rH~~~  302 (552)
T PTZ00064        278 DTLHFCEY--CLDFFCFEDELIRHLSR  302 (552)
T ss_pred             CeEEEccc--hhhhhCCHHHHHHHHhc
Confidence            34599988  99999999999999863


No 223
>TIGR01206 lysW lysine biosynthesis protein LysW. This very small, poorly characterized protein has been shown essential in Thermus thermophilus for an unusual pathway of Lys biosynthesis from aspartate by way of alpha-aminoadipate (AAA) rather than diaminopimelate. It is found also in Deinococcus radiodurans and Pyrococcus horikoshii, which appear to share the AAA pathway.
Probab=22.55  E-value=40  Score=17.94  Aligned_cols=13  Identities=23%  Similarity=0.503  Sum_probs=9.8

Q ss_pred             eeCCCCcccccCc
Q psy6598          64 FICDRCPRSYRQL   76 (154)
Q Consensus        64 ~~C~~C~~~f~~~   76 (154)
                      |.|+.||..+.-.
T Consensus         3 ~~CP~CG~~iev~   15 (54)
T TIGR01206         3 FECPDCGAEIELE   15 (54)
T ss_pred             cCCCCCCCEEecC
Confidence            7899999876543


No 224
>smart00064 FYVE Protein present in Fab1, YOTB, Vac1, and EEA1. The FYVE zinc finger is named after four proteins where it was first found: Fab1, YOTB/ZK632.12, Vac1, and EEA1. The FYVE finger has been shown to bind two Zn2+ ions. The FYVE finger has eight potential zinc coordinating cysteine positions. The FYVE finger is structurally related to the PF06397 Desulfoferrod_N:  Desulfoferrodoxin, N-terminal domain;  InterPro: IPR004462 This domain is found as essentially the full length of desulforedoxin, a 37-residue homodimeric non-haem iron protein. It is also found as the N-terminal domain of desulfoferrodoxin (rbo), a homodimeric non-haem iron protein with 2 Fe atoms per monomer in different oxidation states. This domain binds the ferric rather than the ferrous Fe of desulfoferrodoxin. Neelaredoxin, a monomeric blue non-haem iron protein, lacks this domain.; GO: 0005506 iron ion binding; PDB: 1DFX_A 1VZI_B 2JI2_D 1VZH_B 2JI3_C 2JI1_C 1VZG_A 1CFW_A 2LK5_B 1DHG_B ....
Probab=22.28  E-value=42  Score=16.27  Aligned_cols=26  Identities=27%  Similarity=0.595  Sum_probs=12.1

Q ss_pred             CeecccccccccccCHHHHHHHHHHhcCCCceeecccccCc
Q psy6598           3 PFFYFPWDCKISFASATNYKNHMRIHSGEKPYVCQVRDCQK   43 (154)
Q Consensus         3 ~~~C~~~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~~~C~~   43 (154)
                      -|+|.  .||....         -.+.|.-+..|    ||.
T Consensus         6 ~YkC~--~CGniVe---------v~~~g~g~lvC----CGe   31 (36)
T PF06397_consen    6 FYKCE--HCGNIVE---------VVHDGGGPLVC----CGE   31 (36)
T ss_dssp             EEE-T--TT--EEE---------EEE--SS-EEE----TTE
T ss_pred             EEEcc--CCCCEEE---------EEECCCCCEEe----CCc
Confidence            48888  8987642         23445555666    664


No 226
>PLN00104 MYST -like histone acetyltransferase; Provisional
Probab=22.03  E-value=71  Score=25.14  Aligned_cols=24  Identities=29%  Similarity=0.542  Sum_probs=20.8

Q ss_pred             CceeecccccCcccCChhhHHHHhhh
Q psy6598          32 KPYVCQVRDCQKRFTEYSSLYKHTLV   57 (154)
Q Consensus        32 ~~~~C~~~~C~~~f~~~~~l~~H~~~   57 (154)
                      .-|.|.+  |-+.|.....|.+|+..
T Consensus       197 ~lyiCe~--Cl~y~~~~~~~~~H~~~  220 (450)
T PLN00104        197 KLYFCEF--CLKFMKRKEQLQRHMKK  220 (450)
T ss_pred             eEEEchh--hhhhhcCHHHHHHHHhc
Confidence            4599988  99999999999999863


No 227
>cd00730 rubredoxin Rubredoxin; nonheme iron binding domains containing a [Fe(SCys)4] center. Rubredoxins are small nonheme iron proteins. The iron atom is coordinated by four cysteine residues (Fe(S-Cys)4), but iron can also be replaced by cobalt, nickel or zinc. They are believed to be involved in electron transfer.
Probab=21.88  E-value=41  Score=17.58  Aligned_cols=7  Identities=29%  Similarity=1.208  Sum_probs=3.9

Q ss_pred             eeCCCCc
Q psy6598          64 FICDRCP   70 (154)
Q Consensus        64 ~~C~~C~   70 (154)
                      |.|+.|+
T Consensus        35 w~CP~C~   41 (50)
T cd00730          35 WVCPVCG   41 (50)
T ss_pred             CCCCCCC
Confidence            5555554


No 228
>PF12907 zf-met2:  Zinc-binding
Probab=21.75  E-value=51  Score=16.38  Aligned_cols=25  Identities=20%  Similarity=0.390  Sum_probs=14.3

Q ss_pred             eeccccccccccc---CHHHHHHHHH-HhcC
Q psy6598           4 FFYFPWDCKISFA---SATNYKNHMR-IHSG   30 (154)
Q Consensus         4 ~~C~~~~C~~~f~---~~~~l~~H~~-~h~~   30 (154)
                      +.|.  +|-.+|.   ....|..|.. .|..
T Consensus         2 i~C~--iC~qtF~~t~~~~~L~eH~enKHpK   30 (40)
T PF12907_consen    2 IICK--ICRQTFMQTTNEPQLKEHAENKHPK   30 (40)
T ss_pred             cCcH--HhhHHHHhcCCHHHHHHHHHccCCC
Confidence            4567  7775553   3455777755 4544


No 229
>KOG2907|consensus
Probab=21.65  E-value=57  Score=20.31  Aligned_cols=38  Identities=16%  Similarity=0.413  Sum_probs=21.0

Q ss_pred             eeecccccCcccCChhhHHHHhhhhc--CCCceeCCCCcccccC
Q psy6598          34 YVCQVRDCQKRFTEYSSLYKHTLVHS--DIRPFICDRCPRSYRQ   75 (154)
Q Consensus        34 ~~C~~~~C~~~f~~~~~l~~H~~~h~--~~~~~~C~~C~~~f~~   75 (154)
                      .+|..  ||..=.....|+.  |+.-  .-.-|.|+.|++.|..
T Consensus        75 ~kCpk--Cghe~m~Y~T~Ql--RSADEGQTVFYTC~kC~~k~~e  114 (116)
T KOG2907|consen   75 HKCPK--CGHEEMSYHTLQL--RSADEGQTVFYTCPKCKYKFTE  114 (116)
T ss_pred             ccCcc--cCCchhhhhhhhc--ccccCCceEEEEcCccceeeec
Confidence            67855  8865333333322  2221  1234899999888764


No 230
>PF14634 zf-RING_5:  zinc-RING finger domain
Probab=21.43  E-value=1.1e+02  Score=14.96  Aligned_cols=11  Identities=18%  Similarity=0.555  Sum_probs=6.9

Q ss_pred             CCceeCCCCcc
Q psy6598          61 IRPFICDRCPR   71 (154)
Q Consensus        61 ~~~~~C~~C~~   71 (154)
                      .....|++|++
T Consensus        34 ~~~~~CP~C~k   44 (44)
T PF14634_consen   34 GKSVKCPICRK   44 (44)
T ss_pred             CCCCCCcCCCC
Confidence            44567777753


No 231
>KOG0696|consensus
Probab=21.01  E-value=34  Score=26.87  Aligned_cols=47  Identities=26%  Similarity=0.460  Sum_probs=27.0

Q ss_pred             HHHHHHHhcCCCceeecccccCcccCChhhHHHHh-hhhcCCCceeCCCCc
Q psy6598          21 YKNHMRIHSGEKPYVCQVRDCQKRFTEYSSLYKHT-LVHSDIRPFICDRCP   70 (154)
Q Consensus        21 l~~H~~~h~~~~~~~C~~~~C~~~f~~~~~l~~H~-~~h~~~~~~~C~~C~   70 (154)
                      +..|.+.|. =..|.|  |+-++.+.+-..-..|. +.|+-.-|--|+.||
T Consensus        81 fvvHkrChe-fVtF~C--PGadkg~dtDdpr~kHkf~~~tYssPTFCDhCG  128 (683)
T KOG0696|consen   81 FVVHKRCHE-FVTFSC--PGADKGPDTDDPRSKHKFKIHTYSSPTFCDHCG  128 (683)
T ss_pred             ehhhhhhcc-eEEEEC--CCCCCCCCCCCcccccceeeeecCCCchhhhHH
Confidence            567777774 235777  34666666665555553 245555555555555


No 232
>KOG0978|consensus
Probab=20.24  E-value=36  Score=28.28  Aligned_cols=21  Identities=19%  Similarity=0.461  Sum_probs=15.8

Q ss_pred             CceeCCCCcccccCchhHHHH
Q psy6598          62 RPFICDRCPRSYRQLCTLNVH   82 (154)
Q Consensus        62 ~~~~C~~C~~~f~~~~~l~~H   82 (154)
                      +.-+||.|+..|...+-+..|
T Consensus       677 RqRKCP~Cn~aFganDv~~I~  697 (698)
T KOG0978|consen  677 RQRKCPKCNAAFGANDVHRIH  697 (698)
T ss_pred             hcCCCCCCCCCCCcccccccC
Confidence            345899999999887765544


Done!