Query         psy4572
Match_columns 345
No_of_seqs    430 out of 1637
Neff          8.0 
Searched_HMMs 46136
Date          Fri Aug 16 20:27:13 2013
Command       hhsearch -i /work/01045/syshi/Psyhhblits/psy4572.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/4572hhsearch_cdd -cpu 12 -v 0 

 No Hit                             Prob E-value P-value  Score    SS Cols Query HMM  Template HMM
  1 KOG2462|consensus               99.9 1.3E-26 2.9E-31  207.0   8.3  132   75-211   131-265 (279)
  2 KOG2462|consensus               99.9 1.1E-26 2.4E-31  207.6   6.1  137   43-181   127-266 (279)
  3 KOG3608|consensus               99.8 2.3E-21   5E-26  177.7   7.2  182   32-216   193-379 (467)
  4 KOG1074|consensus               99.8 2.9E-21 6.3E-26  193.3   3.9  171   42-215   601-933 (958)
  5 KOG3608|consensus               99.8   6E-20 1.3E-24  168.4  10.4  179   47-237   178-363 (467)
  6 KOG1074|consensus               99.8 4.6E-20   1E-24  184.8   3.7   53   45-97    352-404 (958)
  7 KOG3576|consensus               99.7 2.9E-17 6.3E-22  140.6   4.2  120  100-219   115-242 (267)
  8 KOG3576|consensus               99.6 2.5E-16 5.3E-21  134.9   6.7  123   44-183   115-237 (267)
  9 PLN03086 PRLI-interacting fact  99.5 3.2E-13 6.9E-18  134.7  10.9  149   45-215   406-566 (567)
 10 KOG3623|consensus               99.4   2E-14 4.3E-19  142.5   1.8  119   76-212   212-332 (1007)
 11 KOG3623|consensus               99.4 6.1E-14 1.3E-18  139.2   3.6  112   40-151   204-330 (1007)
 12 PLN03086 PRLI-interacting fact  99.2 4.5E-11 9.7E-16  119.4   6.8  119   47-182   434-564 (567)
 13 PHA00733 hypothetical protein   99.1 6.2E-11 1.3E-15   97.6   4.4   85  127-216    37-126 (128)
 14 KOG3993|consensus               98.9 6.5E-11 1.4E-15  111.7  -2.9  173   45-220   266-489 (500)
 15 PHA00733 hypothetical protein   98.9   2E-09 4.3E-14   88.6   5.5   53  128-182    71-123 (128)
 16 PHA02768 hypothetical protein;  98.6 9.9E-09 2.1E-13   70.7   1.6   44  158-206     5-48  (55)
 17 PHA02768 hypothetical protein;  98.6 2.6E-08 5.6E-13   68.6   1.7   43   46-90      5-47  (55)
 18 KOG3993|consensus               98.5 7.9E-09 1.7E-13   97.8  -2.9  152   33-184   282-484 (500)
 19 PF13465 zf-H2C2_2:  Zinc-finge  98.2   9E-07 1.9E-11   52.0   2.6   25   61-85      1-25  (26)
 20 PF13465 zf-H2C2_2:  Zinc-finge  98.2 1.3E-06 2.8E-11   51.3   2.5   25  173-200     1-25  (26)
 21 PF05605 zf-Di19:  Drought indu  98.2 2.8E-06 6.2E-11   59.1   4.2   53  158-214     2-54  (54)
 22 PHA00616 hypothetical protein   98.1 1.8E-06 3.8E-11   56.8   1.8   34   46-79      1-34  (44)
 23 PHA00732 hypothetical protein   98.0 2.6E-06 5.5E-11   64.0   2.0   44  158-209     1-44  (79)
 24 PHA00616 hypothetical protein   98.0 3.2E-06 6.8E-11   55.6   1.8   34  158-194     1-34  (44)
 25 PHA00732 hypothetical protein   97.9   1E-05 2.2E-10   60.8   2.8   48  130-182     1-48  (79)
 26 PF05605 zf-Di19:  Drought indu  97.7 6.4E-05 1.4E-09   52.2   4.3   50   46-98      2-53  (54)
 27 PF00096 zf-C2H2:  Zinc finger,  97.6 4.6E-05   1E-09   43.1   2.6   22  190-211     1-22  (23)
 28 PF12756 zf-C2H2_2:  C2H2 type   97.6 4.4E-05 9.5E-10   59.3   3.2   79  132-218     1-79  (100)
 29 PF13894 zf-C2H2_4:  C2H2-type   97.6 6.8E-05 1.5E-09   42.4   2.7   24  190-213     1-24  (24)
 30 KOG2231|consensus               97.4 0.00028 6.1E-09   72.1   7.0  121   76-210   117-260 (669)
 31 PF13912 zf-C2H2_6:  C2H2-type   97.4 7.9E-05 1.7E-09   43.9   1.6   26  189-214     1-26  (27)
 32 COG5189 SFP1 Putative transcri  97.4 4.5E-05 9.7E-10   70.3   0.3   58  156-213   347-422 (423)
 33 PF00096 zf-C2H2:  Zinc finger,  97.3 0.00017 3.6E-09   40.7   2.3   22   47-68      1-22  (23)
 34 PF12756 zf-C2H2_2:  C2H2 type   97.2 0.00023 4.9E-09   55.2   2.3   72   48-124     1-72  (100)
 35 PF13894 zf-C2H2_4:  C2H2-type   97.1 0.00047   1E-08   38.8   2.4   23   47-69      1-23  (24)
 36 COG5189 SFP1 Putative transcri  96.8  0.0006 1.3E-08   63.0   2.0   68   44-123   347-419 (423)
 37 COG5236 Uncharacterized conser  96.8  0.0016 3.6E-08   60.8   4.9  131   75-218   152-310 (493)
 38 PF13912 zf-C2H2_6:  C2H2-type   96.8 0.00075 1.6E-08   39.6   1.7   23   47-69      2-24  (27)
 39 KOG2231|consensus               96.8  0.0027 5.9E-08   65.1   6.6  133   75-229   100-252 (669)
 40 PF09237 GAGA:  GAGA factor;  I  96.5  0.0021 4.7E-08   43.3   2.7   40  175-216    12-51  (54)
 41 PF13909 zf-H2C2_5:  C2H2-type   96.4   0.003 6.5E-08   35.9   2.4   24  190-214     1-24  (24)
 42 smart00355 ZnF_C2H2 zinc finge  96.2  0.0038 8.2E-08   35.5   2.0   23  190-212     1-23  (26)
 43 PF09237 GAGA:  GAGA factor;  I  96.0   0.011 2.4E-07   39.9   3.8   31   44-74     22-52  (54)
 44 COG5048 FOG: Zn-finger [Genera  96.0  0.0021 4.4E-08   62.5   0.3  138   45-182   288-442 (467)
 45 smart00355 ZnF_C2H2 zinc finge  95.9  0.0058 1.3E-07   34.7   2.1   24  159-182     1-24  (26)
 46 COG5236 Uncharacterized conser  95.8   0.011 2.5E-07   55.4   4.4  129   46-184   151-307 (493)
 47 PF12874 zf-met:  Zinc-finger o  95.8  0.0066 1.4E-07   34.8   1.7   23  190-212     1-23  (25)
 48 KOG1146|consensus               95.4  0.0074 1.6E-07   65.5   1.8   75   49-123   439-539 (1406)
 49 PRK04860 hypothetical protein;  95.2   0.013 2.7E-07   50.1   2.5   37   46-86    119-155 (160)
 50 KOG2785|consensus               94.9   0.063 1.4E-06   51.3   6.2   56  157-212   165-243 (390)
 51 KOG1146|consensus               94.9  0.0053 1.1E-07   66.6  -1.0  138   77-217   439-617 (1406)
 52 PF13909 zf-H2C2_5:  C2H2-type   94.8   0.021 4.5E-07   32.4   1.8   22   47-69      1-22  (24)
 53 PF12874 zf-met:  Zinc-finger o  94.7   0.019 4.2E-07   32.7   1.5   20   48-67      2-21  (25)
 54 PRK04860 hypothetical protein;  94.7   0.015 3.3E-07   49.6   1.5   37  158-201   119-155 (160)
 55 COG5048 FOG: Zn-finger [Genera  94.7   0.003 6.6E-08   61.3  -3.3  144   73-217   288-446 (467)
 56 KOG2785|consensus               94.6   0.083 1.8E-06   50.5   6.3   27  189-215   166-192 (390)
 57 KOG2482|consensus               94.0    0.17 3.6E-06   47.7   6.7   54  159-212   280-357 (423)
 58 PF12171 zf-C2H2_jaz:  Zinc-fin  93.8   0.014 3.1E-07   34.2  -0.4   22  190-211     2-23  (27)
 59 PF13913 zf-C2HC_2:  zinc-finge  92.7   0.094   2E-06   30.2   1.9   21  190-211     3-23  (25)
 60 PF12171 zf-C2H2_jaz:  Zinc-fin  92.6   0.035 7.7E-07   32.4   0.1   22  159-180     2-23  (27)
 61 KOG2482|consensus               92.4    0.19 4.2E-06   47.3   4.7   26  190-215   280-305 (423)
 62 TIGR00622 ssl1 transcription f  91.9    0.19 4.1E-06   40.1   3.5   25  157-181    80-104 (112)
 63 KOG4173|consensus               91.3    0.11 2.4E-06   45.6   1.6   77   75-154    80-170 (253)
 64 KOG4173|consensus               91.1    0.11 2.4E-06   45.5   1.4   77  101-180    78-168 (253)
 65 KOG2893|consensus               91.1    0.06 1.3E-06   48.2  -0.2   47  161-214    13-59  (341)
 66 smart00451 ZnF_U1 U1-like zinc  90.9    0.16 3.5E-06   31.3   1.8   24  189-212     3-26  (35)
 67 TIGR00622 ssl1 transcription f  90.6    0.26 5.6E-06   39.4   3.0   85   73-165    14-110 (112)
 68 PF12013 DUF3505:  Protein of u  90.2    0.33 7.2E-06   38.5   3.4   85  129-214    10-109 (109)
 69 PF13913 zf-C2HC_2:  zinc-finge  90.0    0.25 5.4E-06   28.4   1.8   19  160-179     4-22  (25)
 70 KOG2893|consensus               89.1     0.1 2.2E-06   46.7  -0.4   44   47-94     11-54  (341)
 71 smart00451 ZnF_U1 U1-like zinc  88.8    0.34 7.4E-06   29.8   2.0   22   46-67      3-24  (35)
 72 COG4049 Uncharacterized protei  88.5    0.27 5.9E-06   33.8   1.5   31  185-215    13-43  (65)
 73 cd00350 rubredoxin_like Rubred  86.0    0.37 7.9E-06   29.7   0.9   11   47-57      2-12  (33)
 74 PF09986 DUF2225:  Uncharacteri  84.2    0.18 3.8E-06   45.3  -1.6   46  157-202     4-61  (214)
 75 PF12013 DUF3505:  Protein of u  83.8     1.7 3.7E-05   34.4   4.1   24  159-182    81-108 (109)
 76 PF09538 FYDLN_acid:  Protein o  83.7    0.54 1.2E-05   37.4   1.1   30  159-202    10-39  (108)
 77 KOG2186|consensus               81.3    0.87 1.9E-05   41.3   1.7   47   47-96      4-50  (276)
 78 PF09538 FYDLN_acid:  Protein o  81.2    0.88 1.9E-05   36.2   1.5   30   47-87     10-39  (108)
 79 COG4049 Uncharacterized protei  81.1    0.83 1.8E-05   31.5   1.1   32   40-71     11-42  (65)
 80 PRK04023 DNA polymerase II lar  79.7     2.3 4.9E-05   46.0   4.3   24   44-83    624-647 (1121)
 81 PF02892 zf-BED:  BED zinc fing  78.4     1.6 3.5E-05   28.5   1.9   27  187-213    14-44  (45)
 82 PF13719 zinc_ribbon_5:  zinc-r  78.4     1.6 3.4E-05   27.6   1.7   34   47-85      3-36  (37)
 83 TIGR02098 MJ0042_CXXC MJ0042 f  77.6     1.7 3.7E-05   27.4   1.7   33   47-84      3-35  (38)
 84 PRK04023 DNA polymerase II lar  77.0     2.1 4.5E-05   46.2   3.1   22   74-111   626-647 (1121)
 85 PRK00464 nrdR transcriptional   76.1     0.6 1.3E-05   39.6  -0.9   20  190-209    29-48  (154)
 86 PF13717 zinc_ribbon_4:  zinc-r  75.5     2.4 5.3E-05   26.6   2.0   33   47-84      3-35  (36)
 87 smart00531 TFIIE Transcription  75.5     1.7 3.6E-05   36.6   1.6   38  155-199    96-133 (147)
 88 KOG2807|consensus               74.6     3.7   8E-05   38.7   3.7   26  156-181   343-368 (378)
 89 PF09986 DUF2225:  Uncharacteri  74.6     0.5 1.1E-05   42.4  -1.9   21   45-65      4-24  (214)
 90 TIGR02300 FYDLN_acid conserved  73.1     1.7 3.8E-05   35.3   1.1   30  159-202    10-39  (129)
 91 COG1198 PriA Primosomal protei  71.6       2 4.3E-05   45.4   1.4   12  187-198   473-484 (730)
 92 smart00531 TFIIE Transcription  71.0       7 0.00015   32.8   4.4   37   72-112    97-133 (147)
 93 PF10571 UPF0547:  Uncharacteri  70.8     2.1 4.6E-05   24.9   0.8    9  160-168    16-24  (26)
 94 smart00659 RPOLCX RNA polymera  70.8     2.7 5.9E-05   27.7   1.4   11   47-57      3-13  (44)
 95 TIGR00373 conserved hypothetic  70.1     5.3 0.00012   34.0   3.5   15   75-89    110-124 (158)
 96 PF04959 ARS2:  Arsenite-resist  69.4     4.8  0.0001   36.1   3.2   30  185-214    73-102 (214)
 97 TIGR00373 conserved hypothetic  68.4     6.5 0.00014   33.5   3.7   42   90-140    97-138 (158)
 98 COG5151 SSL1 RNA polymerase II  68.4     2.6 5.7E-05   39.4   1.3   48  132-181   364-411 (421)
 99 cd00729 rubredoxin_SM Rubredox  68.1     2.5 5.4E-05   26.2   0.8   11   47-57      3-13  (34)
100 TIGR02300 FYDLN_acid conserved  67.7     3.2   7E-05   33.8   1.6   30   47-87     10-39  (129)
101 PF05443 ROS_MUCR:  ROS/MUCR tr  67.7     3.5 7.6E-05   34.0   1.8   26  189-217    72-97  (132)
102 COG2888 Predicted Zn-ribbon RN  67.3     3.5 7.6E-05   28.9   1.4    7  190-196    51-57  (61)
103 KOG4167|consensus               67.2     3.6 7.9E-05   42.8   2.1   29   43-71    789-817 (907)
104 TIGR02605 CxxC_CxxC_SSSS putat  65.7     1.4   3E-05   29.8  -0.8   12   47-58      6-17  (52)
105 KOG2807|consensus               65.6     7.7 0.00017   36.7   3.8   24  189-212   345-368 (378)
106 COG1198 PriA Primosomal protei  65.4     3.6 7.8E-05   43.6   1.8   10   47-56    436-445 (730)
107 PRK06266 transcription initiat  64.8     6.5 0.00014   34.2   3.0   13  103-115   118-130 (178)
108 smart00834 CxxC_CXXC_SSSS Puta  64.7     1.5 3.2E-05   28.0  -0.8   12   47-58      6-17  (41)
109 PRK00464 nrdR transcriptional   64.2     1.1 2.4E-05   38.0  -1.8   15   75-89     29-43  (154)
110 PRK06266 transcription initiat  63.4       8 0.00017   33.6   3.3   34   71-113   114-147 (178)
111 KOG2186|consensus               63.0     4.7  0.0001   36.7   1.8   47   74-123     3-49  (276)
112 PRK14890 putative Zn-ribbon RN  62.8     4.2 9.1E-05   28.5   1.2    7  190-196    49-55  (59)
113 COG1997 RPL43A Ribosomal prote  62.3     3.9 8.4E-05   31.0   1.0   12  189-200    53-64  (89)
114 COG1592 Rubrerythrin [Energy p  61.8     4.8  0.0001   34.6   1.6   10  128-137   147-156 (166)
115 PRK00398 rpoP DNA-directed RNA  61.6     2.4 5.2E-05   28.0  -0.2   12   47-58      4-15  (46)
116 smart00734 ZnF_Rad18 Rad18-lik  61.5     6.5 0.00014   22.7   1.7   21  190-211     2-22  (26)
117 PF05443 ROS_MUCR:  ROS/MUCR tr  60.7     6.5 0.00014   32.5   2.1   27   44-73     70-96  (132)
118 COG1592 Rubrerythrin [Energy p  59.6     5.4 0.00012   34.2   1.5   11  155-165   146-156 (166)
119 PF14353 CpXC:  CpXC protein     59.2     6.9 0.00015   31.8   2.1   20  189-208    38-57  (128)
120 smart00614 ZnF_BED BED zinc fi  59.2     7.1 0.00015   26.2   1.8   26  189-214    18-48  (50)
121 COG4957 Predicted transcriptio  57.7     4.8  0.0001   33.1   0.9   25   45-72     75-99  (148)
122 PRK14714 DNA polymerase II lar  57.7      14  0.0003   41.2   4.6   10   46-55    667-676 (1337)
123 KOG2593|consensus               56.5      10 0.00022   37.2   3.1   37  155-197   125-161 (436)
124 PF09723 Zn-ribbon_8:  Zinc rib  56.2     2.7 5.9E-05   27.3  -0.7   12   47-58      6-17  (42)
125 PF04959 ARS2:  Arsenite-resist  54.9     4.3 9.2E-05   36.4   0.1   32   41-72     72-103 (214)
126 COG1996 RPC10 DNA-directed RNA  52.3     8.1 0.00018   26.1   1.1   10   47-56      7-16  (49)
127 COG0068 HypF Hydrogenase matur  51.5       2 4.2E-05   44.8  -2.8   28  133-166   154-181 (750)
128 PF15135 UPF0515:  Uncharacteri  51.0      10 0.00022   34.5   1.9   74   58-144    91-169 (278)
129 PF02176 zf-TRAF:  TRAF-type zi  50.5     6.7 0.00015   27.1   0.6   39   46-85      9-53  (60)
130 PRK14873 primosome assembly pr  49.5      11 0.00023   39.8   2.1    8   48-55    385-392 (665)
131 PF12760 Zn_Tnp_IS1595:  Transp  47.7      15 0.00032   24.2   1.9   10  188-197    36-45  (46)
132 PF03604 DNA_RNApol_7kD:  DNA d  47.4      12 0.00026   22.9   1.2    9   48-56      2-10  (32)
133 COG4530 Uncharacterized protei  47.4     8.1 0.00018   30.6   0.6   27  160-200    11-37  (129)
134 COG5151 SSL1 RNA polymerase II  47.3      11 0.00023   35.5   1.5   50  159-213   363-412 (421)
135 PRK14714 DNA polymerase II lar  46.0      26 0.00057   39.2   4.4   63  189-256   692-754 (1337)
136 COG4957 Predicted transcriptio  45.6      14 0.00031   30.4   1.8   26  190-218    77-102 (148)
137 PRK09678 DNA-binding transcrip  45.3     4.3 9.4E-05   29.8  -1.1    8   48-55      3-10  (72)
138 TIGR00595 priA primosomal prot  44.9      13 0.00029   37.7   2.0    9  102-110   253-261 (505)
139 KOG2593|consensus               44.6      22 0.00047   35.1   3.2   39   70-111   124-162 (436)
140 PF03833 PolC_DP2:  DNA polymer  44.6     7.3 0.00016   41.5   0.0   11   45-55    654-664 (900)
141 PF06524 NOA36:  NOA36 protein;  44.5     8.4 0.00018   35.3   0.4   90   97-209   137-229 (314)
142 KOG0782|consensus               43.1     8.2 0.00018   39.2   0.1   58  143-208   238-296 (1004)
143 PF08209 Sgf11:  Sgf11 (transcr  42.1      18 0.00038   22.3   1.4   28  189-217     4-31  (33)
144 KOG3408|consensus               42.0      34 0.00073   27.7   3.3   27   42-68     53-79  (129)
145 PHA00626 hypothetical protein   41.7      13 0.00029   25.7   0.9   16  187-202    21-36  (59)
146 COG1571 Predicted DNA-binding   41.2      11 0.00024   37.1   0.7   29  160-202   352-380 (421)
147 PF06524 NOA36:  NOA36 protein;  40.7       7 0.00015   35.8  -0.7   26  156-181   207-232 (314)
148 smart00661 RPOL9 RNA polymeras  39.6      18 0.00039   24.1   1.3   12  189-200    20-31  (52)
149 PF15269 zf-C2H2_7:  Zinc-finge  39.2      36 0.00078   22.5   2.6   23   46-68     20-42  (54)
150 COG0068 HypF Hydrogenase matur  38.5     1.9   4E-05   44.9  -5.3   56  132-197   125-181 (750)
151 TIGR00244 transcriptional regu  37.5     7.3 0.00016   32.7  -1.1   16  159-174    29-44  (147)
152 KOG1280|consensus               37.4      28  0.0006   33.3   2.6   25  158-182    79-103 (381)
153 COG3357 Predicted transcriptio  37.1      17 0.00038   27.8   1.0   10  132-141    60-69  (97)
154 PF07754 DUF1610:  Domain of un  36.8      22 0.00048   20.2   1.2   10  188-197    15-24  (24)
155 PF13878 zf-C2H2_3:  zinc-finge  36.2      34 0.00073   22.0   2.1   24  190-213    14-39  (41)
156 PRK03824 hypA hydrogenase nick  36.1      14 0.00029   30.6   0.3   11   47-57     71-81  (135)
157 KOG4167|consensus               35.9      11 0.00025   39.3  -0.2   28  189-216   792-819 (907)
158 PF08274 PhnA_Zn_Ribbon:  PhnA   35.8      14 0.00031   22.2   0.3    9   74-82     19-27  (30)
159 PF07975 C1_4:  TFIIH C1-like d  35.7     5.7 0.00012   27.1  -1.6   20  158-177    21-40  (51)
160 PF05290 Baculo_IE-1:  Baculovi  35.3      23 0.00049   29.2   1.5   56  127-203    77-135 (140)
161 PF09845 DUF2072:  Zn-ribbon co  35.1      19 0.00041   29.6   1.0   15   74-88      1-15  (131)
162 PTZ00255 60S ribosomal protein  35.0      18 0.00038   27.8   0.8   12  189-200    54-65  (90)
163 KOG4124|consensus               34.8     5.6 0.00012   37.9  -2.3   56  156-211   347-420 (442)
164 PF01363 FYVE:  FYVE zinc finge  34.3      24 0.00053   25.1   1.4   10  160-169    11-20  (69)
165 PRK05580 primosome assembly pr  34.2      26 0.00057   37.1   2.2    9  102-110   421-429 (679)
166 PF04216 FdhE:  Protein involve  33.8       4 8.7E-05   38.3  -3.6   77  103-200   173-249 (290)
167 TIGR01206 lysW lysine biosynth  33.7      17 0.00037   25.1   0.5   30  159-199     3-32  (54)
168 TIGR00280 L37a ribosomal prote  33.5      17 0.00036   27.9   0.4   12  189-200    53-64  (91)
169 PF07282 OrfB_Zn_ribbon:  Putat  33.5      35 0.00075   24.3   2.1   30  160-202    30-59  (69)
170 PRK14559 putative protein seri  33.4      41 0.00089   35.4   3.4    8   48-55      3-10  (645)
171 KOG2907|consensus               33.4      24 0.00052   28.1   1.3   13  189-201   102-114 (116)
172 COG1327 Predicted transcriptio  33.1      10 0.00022   31.9  -0.9   15  159-173    29-43  (156)
173 PRK14873 primosome assembly pr  32.6      12 0.00025   39.6  -0.8    6  104-109   385-390 (665)
174 COG1655 Uncharacterized protei  32.6      11 0.00024   33.9  -0.8   19  156-174    17-35  (267)
175 PF13240 zinc_ribbon_2:  zinc-r  32.1      18 0.00039   20.2   0.3    6  161-166    16-21  (23)
176 PF13451 zf-trcl:  Probable zin  32.0      35 0.00075   23.1   1.7    7  158-164    33-39  (49)
177 PF15135 UPF0515:  Uncharacteri  31.8      38 0.00081   31.0   2.4   73   86-171    91-168 (278)
178 KOG4377|consensus               30.7      14 0.00029   36.1  -0.6   26  190-215   402-429 (480)
179 PRK14559 putative protein seri  30.6      50  0.0011   34.7   3.5   12  160-171    43-54  (645)
180 PF04780 DUF629:  Protein of un  30.6      38 0.00082   34.0   2.5   28  189-216    57-84  (466)
181 PF12907 zf-met2:  Zinc-binding  30.4      33  0.0007   22.2   1.3   27  190-216     2-31  (40)
182 PRK12380 hydrogenase nickel in  30.3      23 0.00049   28.3   0.7   26   45-82     69-94  (113)
183 KOG1842|consensus               30.3      29 0.00063   34.3   1.6   31  188-218    14-44  (505)
184 PTZ00303 phosphatidylinositol   30.0      22 0.00047   37.7   0.7   13  131-143   461-473 (1374)
185 PF14446 Prok-RING_1:  Prokaryo  29.3      37 0.00081   23.4   1.5   26   47-84      6-31  (54)
186 PF01780 Ribosomal_L37ae:  Ribo  29.2      11 0.00024   28.9  -1.2   12  189-200    53-64  (90)
187 PF13453 zf-TFIIB:  Transcripti  28.9      28  0.0006   22.2   0.8   14  104-117    21-34  (41)
188 KOG1280|consensus               28.5      61  0.0013   31.1   3.3   28   45-72     78-105 (381)
189 COG3091 SprT Zn-dependent meta  27.8      31 0.00068   29.1   1.2   33   45-82    116-148 (156)
190 PRK00432 30S ribosomal protein  27.8      33 0.00071   23.2   1.1   11  189-199    37-47  (50)
191 KOG2907|consensus               27.4      29 0.00062   27.7   0.8   40   47-86     75-114 (116)
192 PRK03976 rpl37ae 50S ribosomal  27.4      24 0.00052   27.1   0.4   12  189-200    54-65  (90)
193 COG4888 Uncharacterized Zn rib  27.2      22 0.00047   27.8   0.1   39   43-85     19-57  (104)
194 TIGR00100 hypA hydrogenase nic  27.0      26 0.00056   28.1   0.5   27   45-83     69-95  (115)
195 COG3364 Zn-ribbon containing p  26.6      39 0.00085   26.5   1.4   12   75-86      3-14  (112)
196 PRK03564 formate dehydrogenase  25.4      31 0.00068   32.7   0.8   79  101-201   186-264 (309)
197 TIGR00595 priA primosomal prot  24.7      21 0.00046   36.3  -0.5   12  128-139   238-249 (505)
198 KOG2272|consensus               24.5      42 0.00091   30.7   1.4  126   47-176   100-239 (332)
199 KOG3408|consensus               24.4      51  0.0011   26.7   1.7   27  188-214    56-82  (129)
200 KOG4377|consensus               24.4      39 0.00085   33.1   1.3   18   80-97    279-296 (480)
201 PF08271 TF_Zn_Ribbon:  TFIIB z  24.1      31 0.00067   22.2   0.4    9   75-83     20-28  (43)
202 PRK03681 hypA hydrogenase nick  23.7      46   0.001   26.6   1.4   14   45-58     69-82  (114)
203 smart00154 ZnF_AN1 AN1-like Zi  23.6      38 0.00083   21.5   0.7   13  189-201    12-24  (39)
204 COG1571 Predicted DNA-binding   23.0      44 0.00095   33.0   1.4   13   74-86    367-379 (421)
205 PF03811 Zn_Tnp_IS1:  InsA N-te  22.8      14  0.0003   23.3  -1.4    8  188-195    28-35  (36)
206 PRK12496 hypothetical protein;  22.1      44 0.00095   28.6   1.0   12  130-141   127-138 (164)
207 PF05495 zf-CHY:  CHY zinc fing  21.9      17 0.00037   26.4  -1.3   12   47-58     11-22  (71)
208 PF13821 DUF4187:  Domain of un  21.7      95  0.0021   21.4   2.5   18  190-207    28-45  (55)
209 PRK00564 hypA hydrogenase nick  21.5      39 0.00084   27.2   0.6   14   45-58     70-83  (117)
210 PF04606 Ogr_Delta:  Ogr/Delta-  21.0      24 0.00052   23.4  -0.6   37  160-201     1-39  (47)
211 TIGR00686 phnA alkylphosphonat  21.0      44 0.00094   26.5   0.7   30  160-203     4-33  (109)
212 PF08790 zf-LYAR:  LYAR-type C2  20.9      21 0.00046   21.1  -0.8   18  190-208     1-18  (28)
213 KOG0320|consensus               20.9      82  0.0018   27.4   2.4   19   42-60    127-145 (187)
214 PF05191 ADK_lid:  Adenylate ki  20.9      27 0.00058   21.9  -0.4    8   77-84      4-11  (36)
215 PRK10220 hypothetical protein;  20.8      55  0.0012   26.0   1.2   30  160-203     5-34  (111)
216 PF04423 Rad50_zn_hook:  Rad50   20.5      40 0.00086   22.9   0.4   12  191-202    22-33  (54)
217 cd00924 Cyt_c_Oxidase_Vb Cytoc  20.4      47   0.001   25.9   0.8   14  187-200    77-90  (97)
218 PF01428 zf-AN1:  AN1-like Zinc  20.4      40 0.00087   21.8   0.4   14  188-201    12-25  (43)
219 PF01155 HypA:  Hydrogenase exp  20.2      41  0.0009   26.8   0.5   27   45-83     69-95  (113)
220 PLN02294 cytochrome c oxidase   20.2      48   0.001   28.5   0.9   16  186-201   138-153 (174)
221 COG2331 Uncharacterized protei  20.2      44 0.00095   24.7   0.5    9   47-55     13-21  (82)

No 1  
>KOG2462|consensus
Probab=99.93  E-value=1.3e-26  Score=207.04  Aligned_cols=132  Identities=27%  Similarity=0.527  Sum_probs=66.2

Q ss_pred             eeccccccccCCHHHHHHHHHhcCC---CCCccCCCCcccccCHHHHHHHHHHhcCCCccccCCCCCCcCCHHHHHHHHH
Q psy4572          75 FPCTVCLKMFKTKAQREEHIASHVT---DSEFKCDQCPYVAKRIKYLRYHMKKHRKEYSAYCEPCQEGFFSRDKLETHNI  151 (345)
Q Consensus        75 ~~C~~C~k~F~~~~~L~~H~~~h~~---~k~~~C~~C~~~f~~~~~L~~H~~~h~~~~~~~C~~C~k~F~~~~~l~~H~~  151 (345)
                      |+|+.||+.+.+..+|.+|.++|..   .+.+.|+.|++.+.+...|+.|+++|.  -+++|.+|||.|....-|+.|+|
T Consensus       131 ~~c~eCgk~ysT~snLsrHkQ~H~~~~s~ka~~C~~C~K~YvSmpALkMHirTH~--l~c~C~iCGKaFSRPWLLQGHiR  208 (279)
T KOG2462|consen  131 YKCPECGKSYSTSSNLSRHKQTHRSLDSKKAFSCKYCGKVYVSMPALKMHIRTHT--LPCECGICGKAFSRPWLLQGHIR  208 (279)
T ss_pred             eeccccccccccccccchhhcccccccccccccCCCCCceeeehHHHhhHhhccC--CCcccccccccccchHHhhcccc
Confidence            4444444444444444444444421   234455555555555555555555543  34455555555555555555555


Q ss_pred             HhcCCCCCCCcchhhccCCHHHHHHHHhhhCCCCcCceeecCccccccCCHHHHHHHHHH
Q psy4572         152 AKHGAQPYQCETCRKLFLSKYSLFTHNKKHLPIEERLHYQCELCGKSFNNKQTFSLSNMY  211 (345)
Q Consensus       152 ~hh~~k~y~C~~Cgk~F~s~s~L~~H~~~H~~~~~~k~y~C~~Cgk~F~~~~~L~~H~~~  211 (345)
                      +|+|||||.|+.|+|.|.++++|+.|+++|.   +.|+|+|+.|+|.|..++.|.+|...
T Consensus       209 THTGEKPF~C~hC~kAFADRSNLRAHmQTHS---~~K~~qC~~C~KsFsl~SyLnKH~ES  265 (279)
T KOG2462|consen  209 THTGEKPFSCPHCGKAFADRSNLRAHMQTHS---DVKKHQCPRCGKSFALKSYLNKHSES  265 (279)
T ss_pred             cccCCCCccCCcccchhcchHHHHHHHHhhc---CCccccCcchhhHHHHHHHHHHhhhh
Confidence            5555555555555555555555555555554   33345555555555555555555443


No 2  
>KOG2462|consensus
Probab=99.93  E-value=1.1e-26  Score=207.64  Aligned_cols=137  Identities=27%  Similarity=0.505  Sum_probs=129.2

Q ss_pred             CCCceeccccccccCCHHHHHHHHHHcCC---CCceeccccccccCCHHHHHHHHHhcCCCCCccCCCCcccccCHHHHH
Q psy4572          43 LDGPHKCEHCEKEFENRRALKRHEKNHTQ---EKTFPCTVCLKMFKTKAQREEHIASHVTDSEFKCDQCPYVAKRIKYLR  119 (345)
Q Consensus        43 ~~~~~~C~~C~k~F~~~~~L~~H~~~h~~---~k~~~C~~C~k~F~~~~~L~~H~~~h~~~k~~~C~~C~~~f~~~~~L~  119 (345)
                      ....|+|+.||+.+.+.+.|.+|.++|-.   .+.+.|+.|++.|.+...|+.|+++|.  -+++|.+||+.|.+.+-|+
T Consensus       127 ~~~r~~c~eCgk~ysT~snLsrHkQ~H~~~~s~ka~~C~~C~K~YvSmpALkMHirTH~--l~c~C~iCGKaFSRPWLLQ  204 (279)
T KOG2462|consen  127 KHPRYKCPECGKSYSTSSNLSRHKQTHRSLDSKKAFSCKYCGKVYVSMPALKMHIRTHT--LPCECGICGKAFSRPWLLQ  204 (279)
T ss_pred             cCCceeccccccccccccccchhhcccccccccccccCCCCCceeeehHHHhhHhhccC--CCcccccccccccchHHhh
Confidence            34569999999999999999999999853   467999999999999999999999986  5799999999999999999


Q ss_pred             HHHHHhcCCCccccCCCCCCcCCHHHHHHHHHHhcCCCCCCCcchhhccCCHHHHHHHHhhh
Q psy4572         120 YHMKKHRKEYSAYCEPCQEGFFSRDKLETHNIAKHGAQPYQCETCRKLFLSKYSLFTHNKKH  181 (345)
Q Consensus       120 ~H~~~h~~~~~~~C~~C~k~F~~~~~l~~H~~~hh~~k~y~C~~Cgk~F~s~s~L~~H~~~H  181 (345)
                      .|+|+|+||+||.|+.|++.|.++++|+.|+++|.+.|+|+|+.|+|+|..++-|.+|...-
T Consensus       205 GHiRTHTGEKPF~C~hC~kAFADRSNLRAHmQTHS~~K~~qC~~C~KsFsl~SyLnKH~ES~  266 (279)
T KOG2462|consen  205 GHIRTHTGEKPFSCPHCGKAFADRSNLRAHMQTHSDVKKHQCPRCGKSFALKSYLNKHSESA  266 (279)
T ss_pred             cccccccCCCCccCCcccchhcchHHHHHHHHhhcCCccccCcchhhHHHHHHHHHHhhhhc
Confidence            99999999999999999999999999999999999999999999999999999999998753


No 3  
>KOG3608|consensus
Probab=99.84  E-value=2.3e-21  Score=177.66  Aligned_cols=182  Identities=21%  Similarity=0.429  Sum_probs=131.3

Q ss_pred             cccccccCCCCCCCceeccccccccCCHHHHHHHHHHcC--CCCceeccccccccCCHHHHHHHHHhcCCCCCccCCCCc
Q psy4572          32 KKRSRKSTPQKLDGPHKCEHCEKEFENRRALKRHEKNHT--QEKTFPCTVCLKMFKTKAQREEHIASHVTDSEFKCDQCP  109 (345)
Q Consensus        32 ~~~~~~~~~~~~~~~~~C~~C~k~F~~~~~L~~H~~~h~--~~k~~~C~~C~k~F~~~~~L~~H~~~h~~~k~~~C~~C~  109 (345)
                      .++.++...+.+++...|+.||..|.++..|-.|.+..+  ...+|.|..|.+.|.+...|..|+..|..  .|+|+.|+
T Consensus       193 ~~LreH~r~Hs~eKvvACp~Cg~~F~~~tkl~DH~rRqt~l~~n~fqC~~C~KrFaTeklL~~Hv~rHvn--~ykCplCd  270 (467)
T KOG3608|consen  193 YRLREHIRTHSNEKVVACPHCGELFRTKTKLFDHLRRQTELNTNSFQCAQCFKRFATEKLLKSHVVRHVN--CYKCPLCD  270 (467)
T ss_pred             HHHHHHHHhcCCCeEEecchHHHHhccccHHHHHHHhhhhhcCCchHHHHHHHHHhHHHHHHHHHHHhhh--cccccccc
Confidence            344555566666666666666666666666666655433  33466777777777777777777666643  37777777


Q ss_pred             ccccCHHHHHHHHHH-hcCCCccccCCCCCCcCCHHHHHHHHHHhcCCCCCCCcc--hhhccCCHHHHHHHHhhhCCCCc
Q psy4572         110 YVAKRIKYLRYHMKK-HRKEYSAYCEPCQEGFFSRDKLETHNIAKHGAQPYQCET--CRKLFLSKYSLFTHNKKHLPIEE  186 (345)
Q Consensus       110 ~~f~~~~~L~~H~~~-h~~~~~~~C~~C~k~F~~~~~l~~H~~~hh~~k~y~C~~--Cgk~F~s~s~L~~H~~~H~~~~~  186 (345)
                      -+....+.|..|++. |...+||+|+.|++.|.+..+|.+|...|. +..|+|+.  |..+|.+...+++|++.++....
T Consensus       271 mtc~~~ssL~~H~r~rHs~dkpfKCd~Cd~~c~~esdL~kH~~~HS-~~~y~C~h~~C~~s~r~~~q~~~H~~evhEg~n  349 (467)
T KOG3608|consen  271 MTCSSASSLTTHIRYRHSKDKPFKCDECDTRCVRESDLAKHVQVHS-KTVYQCEHPDCHYSVRTYTQMRRHFLEVHEGNN  349 (467)
T ss_pred             cCCCChHHHHHHHHhhhccCCCccccchhhhhccHHHHHHHHHhcc-ccceecCCCCCcHHHHHHHHHHHHHHHhccCCC
Confidence            777777777777663 566778888888888888888888877655 66788876  88888888888888887776556


Q ss_pred             CceeecCccccccCCHHHHHHHHHHhcCCC
Q psy4572         187 RLHYQCELCGKSFNNKQTFSLSNMYVLSLS  216 (345)
Q Consensus       187 ~k~y~C~~Cgk~F~~~~~L~~H~~~~H~~~  216 (345)
                      ..+|.|..|++.|++-.+|.+|++..|++.
T Consensus       350 p~~Y~CH~Cdr~ft~G~~L~~HL~kkH~f~  379 (467)
T KOG3608|consen  350 PILYACHCCDRFFTSGKSLSAHLMKKHGFR  379 (467)
T ss_pred             CCceeeecchhhhccchhHHHHHHHhhccc
Confidence            677888888888888888888888888753


No 4  
>KOG1074|consensus
Probab=99.82  E-value=2.9e-21  Score=193.35  Aligned_cols=171  Identities=23%  Similarity=0.465  Sum_probs=148.6

Q ss_pred             CCCCceeccccccccCCHHHHHHHHHHcCCCCceeccccccccCCHHHHHHHHHhcCCC----CCccCC---CCcccccC
Q psy4572          42 KLDGPHKCEHCEKEFENRRALKRHEKNHTQEKTFPCTVCLKMFKTKAQREEHIASHVTD----SEFKCD---QCPYVAKR  114 (345)
Q Consensus        42 ~~~~~~~C~~C~k~F~~~~~L~~H~~~h~~~k~~~C~~C~k~F~~~~~L~~H~~~h~~~----k~~~C~---~C~~~f~~  114 (345)
                      +...+-+|-+|.++...++.|+.|.++|+|++||+|.+||+.|+++++|+.|+..|...    -.|.|+   +|.+.|..
T Consensus       601 ~~TdPNqCiiC~rVlSC~saLqmHyrtHtGERPFkCKiCgRAFtTkGNLkaH~~vHka~p~~R~q~ScP~~~ic~~kftn  680 (958)
T KOG1074|consen  601 KRTDPNQCIICLRVLSCPSALQMHYRTHTGERPFKCKICGRAFTTKGNLKAHMSVHKAKPPARVQFSCPSTFICQKKFTN  680 (958)
T ss_pred             ccCCccceeeeeecccchhhhhhhhhcccCcCccccccccchhccccchhhcccccccCccccccccCCchhhhcccccc
Confidence            33467899999999999999999999999999999999999999999999999988654    348899   99999999


Q ss_pred             HHHHHHHHHHhcCC-C------------ccccCCCCCCcCCHHHHHHHHHHhcCC-------------------------
Q psy4572         115 IKYLRYHMKKHRKE-Y------------SAYCEPCQEGFFSRDKLETHNIAKHGA-------------------------  156 (345)
Q Consensus       115 ~~~L~~H~~~h~~~-~------------~~~C~~C~k~F~~~~~l~~H~~~hh~~-------------------------  156 (345)
                      .-.+.+|++.|.+. .            .-+|..|.+.|.....+..++..|.+.                         
T Consensus       681 ~V~lpQhIriH~~~~~s~g~~a~e~~~~adq~~~~qk~~~~a~~f~~~~se~~~~~s~~~~~~~~~t~t~~~~~tp~~~e  760 (958)
T KOG1074|consen  681 AVTLPQHIRIHLGGQISNGGTAAEGILAADQCSSCQKTFSDARSFSQQISEQPSPESEPDEQMDERTETEELDVTPPPPE  760 (958)
T ss_pred             cccccceEEeecCCCCCCCcccccccchhcccchhhhcccccccchhhhhccCCcccCCcccccccccccccccCCCccc
Confidence            99999999998632 1            135889999998887777776665210                         


Q ss_pred             --------------------------------------------------------------------------------
Q psy4572         157 --------------------------------------------------------------------------------  156 (345)
Q Consensus       157 --------------------------------------------------------------------------------  156 (345)
                                                                                                      
T Consensus       761 ~~~~~~~~~e~~i~~~g~te~asa~~~~vg~~s~~~~~~~~~~T~~k~~~~~~~~~~~~~~~v~~~pvl~~~~~~~l~eg  840 (958)
T KOG1074|consen  761 NSCGRELEGEMAISVRGSTEEASANLDEVGTVSAAGEAGEEDDTSEKPTQASSFPGEILAPSVNMDPVLWNQETSMLNEG  840 (958)
T ss_pred             cccccccCcccccccccchhhhhcChhhhcCccccchhhhhcccCCCCcccccCCCcCCccccccCchhhcccccccccc
Confidence                                                                                            


Q ss_pred             -------------------------------------CCCCCcchhhccCCHHHHHHHHhhhCCCCcCceeecCcccccc
Q psy4572         157 -------------------------------------QPYQCETCRKLFLSKYSLFTHNKKHLPIEERLHYQCELCGKSF  199 (345)
Q Consensus       157 -------------------------------------k~y~C~~Cgk~F~s~s~L~~H~~~H~~~~~~k~y~C~~Cgk~F  199 (345)
                                                           ....|..||+.|.+.+.|+.|+++|+   ++|||.|..|++.|
T Consensus       841 ~~t~~n~~t~~~~~~sv~qs~~~p~l~p~l~~~~pvnn~h~C~vCgk~FsSSsALqiH~rTHt---g~KPF~C~fC~~aF  917 (958)
T KOG1074|consen  841 LATKTNEITPEGPADSVIQSGGVPTLEPSLGRPGPVNNAHVCNVCGKQFSSSAALEIHMRTHT---GPKPFFCHFCEEAF  917 (958)
T ss_pred             cccccccccCCCcchhhhhhccccccCCCCCCCCcccchhhhccchhcccchHHHHHhhhcCC---CCCCccchhhhhhh
Confidence                                                 00679999999999999999999998   88899999999999


Q ss_pred             CCHHHHHHHHHHhcCC
Q psy4572         200 NNKQTFSLSNMYVLSL  215 (345)
Q Consensus       200 ~~~~~L~~H~~~~H~~  215 (345)
                      ..+.+|+.||.+|+..
T Consensus       918 ttrgnLKvHMgtH~w~  933 (958)
T KOG1074|consen  918 TTRGNLKVHMGTHMWV  933 (958)
T ss_pred             hhhhhhhhhhcccccc
Confidence            9999999999988764


No 5  
>KOG3608|consensus
Probab=99.81  E-value=6e-20  Score=168.41  Aligned_cols=179  Identities=25%  Similarity=0.477  Sum_probs=158.0

Q ss_pred             eec--cccccccCCHHHHHHHHHHcCCCCceeccccccccCCHHHHHHHHHhcC--CCCCccCCCCcccccCHHHHHHHH
Q psy4572          47 HKC--EHCEKEFENRRALKRHEKNHTQEKTFPCTVCLKMFKTKAQREEHIASHV--TDSEFKCDQCPYVAKRIKYLRYHM  122 (345)
Q Consensus        47 ~~C--~~C~k~F~~~~~L~~H~~~h~~~k~~~C~~C~k~F~~~~~L~~H~~~h~--~~k~~~C~~C~~~f~~~~~L~~H~  122 (345)
                      +.|  .-|-+.|.++..|+.|.+.|+++|...|+.||.-|.++..|-.|.+..+  ...+|.|..|.+.|.+...|..|+
T Consensus       178 ~~C~W~~Ct~~~~~k~~LreH~r~Hs~eKvvACp~Cg~~F~~~tkl~DH~rRqt~l~~n~fqC~~C~KrFaTeklL~~Hv  257 (467)
T KOG3608|consen  178 TMCNWAMCTKHMGNKYRLREHIRTHSNEKVVACPHCGELFRTKTKLFDHLRRQTELNTNSFQCAQCFKRFATEKLLKSHV  257 (467)
T ss_pred             eeccchhhhhhhccHHHHHHHHHhcCCCeEEecchHHHHhccccHHHHHHHhhhhhcCCchHHHHHHHHHhHHHHHHHHH
Confidence            556  4699999999999999999999999999999999999999999997643  456899999999999999999999


Q ss_pred             HHhcCCCccccCCCCCCcCCHHHHHHHHHHhc-CCCCCCCcchhhccCCHHHHHHHHhhhCCCCcCceeecCc--ccccc
Q psy4572         123 KKHRKEYSAYCEPCQEGFFSRDKLETHNIAKH-GAQPYQCETCRKLFLSKYSLFTHNKKHLPIEERLHYQCEL--CGKSF  199 (345)
Q Consensus       123 ~~h~~~~~~~C~~C~k~F~~~~~l~~H~~~hh-~~k~y~C~~Cgk~F~s~s~L~~H~~~H~~~~~~k~y~C~~--Cgk~F  199 (345)
                      ..|-  ..|+|+.|+.+...+++|.+|++..| ..+||+|+.|.+.|.+.+.|.+|...|.    +..|+|+.  |..+|
T Consensus       258 ~rHv--n~ykCplCdmtc~~~ssL~~H~r~rHs~dkpfKCd~Cd~~c~~esdL~kH~~~HS----~~~y~C~h~~C~~s~  331 (467)
T KOG3608|consen  258 VRHV--NCYKCPLCDMTCSSASSLTTHIRYRHSKDKPFKCDECDTRCVRESDLAKHVQVHS----KTVYQCEHPDCHYSV  331 (467)
T ss_pred             HHhh--hcccccccccCCCChHHHHHHHHhhhccCCCccccchhhhhccHHHHHHHHHhcc----ccceecCCCCCcHHH
Confidence            9886  46899999999999999999998755 6899999999999999999999999885    45699988  99999


Q ss_pred             CCHHHHHHHHHHhcCCCCcccccccceeeechhhHHHH
Q psy4572         200 NNKQTFSLSNMYVLSLSNMYVLSSSNMYVLSLSNMYVL  237 (345)
Q Consensus       200 ~~~~~L~~H~~~~H~~~~~~~~s~~~~~~~~l~~~~~~  237 (345)
                      .+...+++|++.+|...+      ...|.|-++|++..
T Consensus       332 r~~~q~~~H~~evhEg~n------p~~Y~CH~Cdr~ft  363 (467)
T KOG3608|consen  332 RTYTQMRRHFLEVHEGNN------PILYACHCCDRFFT  363 (467)
T ss_pred             HHHHHHHHHHHHhccCCC------CCceeeecchhhhc
Confidence            999999999999994322      23566777776653


No 6  
>KOG1074|consensus
Probab=99.78  E-value=4.6e-20  Score=184.79  Aligned_cols=53  Identities=40%  Similarity=0.710  Sum_probs=50.6

Q ss_pred             CceeccccccccCCHHHHHHHHHHcCCCCceeccccccccCCHHHHHHHHHhc
Q psy4572          45 GPHKCEHCEKEFENRRALKRHEKNHTQEKTFPCTVCLKMFKTKAQREEHIASH   97 (345)
Q Consensus        45 ~~~~C~~C~k~F~~~~~L~~H~~~h~~~k~~~C~~C~k~F~~~~~L~~H~~~h   97 (345)
                      .+++|.+|.+.|...+.|+.|.|.|+|++||+|.+||..|.++.+|+.|...|
T Consensus       352 ~khkCr~CakvfgS~SaLqiHlRSHTGERPfqCnvCG~~FSTkGNLKvH~~rH  404 (958)
T KOG1074|consen  352 FKHKCRFCAKVFGSDSALQIHLRSHTGERPFQCNVCGNRFSTKGNLKVHFQRH  404 (958)
T ss_pred             ccchhhhhHhhcCchhhhhhhhhccCCCCCeeecccccccccccceeeeeeec
Confidence            45889999999999999999999999999999999999999999999998877


No 7  
>KOG3576|consensus
Probab=99.67  E-value=2.9e-17  Score=140.57  Aligned_cols=120  Identities=27%  Similarity=0.471  Sum_probs=103.9

Q ss_pred             CCCccCCCCcccccCHHHHHHHHHHhcCCCccccCCCCCCcCCHHHHHHHHHHhcCCCCCCCcchhhccCCHHHHHHHHh
Q psy4572         100 DSEFKCDQCPYVAKRIKYLRYHMKKHRKEYSAYCEPCQEGFFSRDKLETHNIAKHGAQPYQCETCRKLFLSKYSLFTHNK  179 (345)
Q Consensus       100 ~k~~~C~~C~~~f~~~~~L~~H~~~h~~~~~~~C~~C~k~F~~~~~l~~H~~~hh~~k~y~C~~Cgk~F~s~s~L~~H~~  179 (345)
                      ...|.|..|++.|.-...|.+|++-|...+.+-|..||+.|.+..+|++|.++|+|.+||+|..|+|.|+.+-.|..|.+
T Consensus       115 ~d~ftCrvCgK~F~lQRmlnrh~kch~~vkr~lct~cgkgfndtfdlkrh~rthtgvrpykc~~c~kaftqrcsleshl~  194 (267)
T KOG3576|consen  115 QDSFTCRVCGKKFGLQRMLNRHLKCHSDVKRHLCTFCGKGFNDTFDLKRHTRTHTGVRPYKCSLCEKAFTQRCSLESHLK  194 (267)
T ss_pred             CCeeeeehhhhhhhHHHHHHHHhhhccHHHHHHHhhccCcccchhhhhhhhccccCccccchhhhhHHHHhhccHHHHHH
Confidence            44588888888888888888898888888888899999999999999999999999999999999999999999999988


Q ss_pred             hhCCC--------CcCceeecCccccccCCHHHHHHHHHHhcCCCCcc
Q psy4572         180 KHLPI--------EERLHYQCELCGKSFNNKQTFSLSNMYVLSLSNMY  219 (345)
Q Consensus       180 ~H~~~--------~~~k~y~C~~Cgk~F~~~~~L~~H~~~~H~~~~~~  219 (345)
                      ..|+.        +..|.|.|+.||..-...+....|++.||..+.-+
T Consensus       195 kvhgv~~~yaykerr~kl~vcedcg~t~~~~e~~~~h~~~~hp~Spal  242 (267)
T KOG3576|consen  195 KVHGVQHQYAYKERRAKLYVCEDCGYTSERPEVYYLHLKLHHPFSPAL  242 (267)
T ss_pred             HHcCchHHHHHHHhhhheeeecccCCCCCChhHHHHHHHhcCCCCHHH
Confidence            76654        24567999999999999999999999999866543


No 8  
>KOG3576|consensus
Probab=99.64  E-value=2.5e-16  Score=134.92  Aligned_cols=123  Identities=28%  Similarity=0.435  Sum_probs=90.4

Q ss_pred             CCceeccccccccCCHHHHHHHHHHcCCCCceeccccccccCCHHHHHHHHHhcCCCCCccCCCCcccccCHHHHHHHHH
Q psy4572          44 DGPHKCEHCEKEFENRRALKRHEKNHTQEKTFPCTVCLKMFKTKAQREEHIASHVTDSEFKCDQCPYVAKRIKYLRYHMK  123 (345)
Q Consensus        44 ~~~~~C~~C~k~F~~~~~L~~H~~~h~~~k~~~C~~C~k~F~~~~~L~~H~~~h~~~k~~~C~~C~~~f~~~~~L~~H~~  123 (345)
                      ...|.|.+|++.|.-...|.+|++-|...+.+-|..||+.|....+|++|+++|+|.+||+|..|++.|..+-.|..|.+
T Consensus       115 ~d~ftCrvCgK~F~lQRmlnrh~kch~~vkr~lct~cgkgfndtfdlkrh~rthtgvrpykc~~c~kaftqrcsleshl~  194 (267)
T KOG3576|consen  115 QDSFTCRVCGKKFGLQRMLNRHLKCHSDVKRHLCTFCGKGFNDTFDLKRHTRTHTGVRPYKCSLCEKAFTQRCSLESHLK  194 (267)
T ss_pred             CCeeeeehhhhhhhHHHHHHHHhhhccHHHHHHHhhccCcccchhhhhhhhccccCccccchhhhhHHHHhhccHHHHHH
Confidence            34588888888888888888888888888888888888888888888888888888888888888888888888888876


Q ss_pred             HhcCCCccccCCCCCCcCCHHHHHHHHHHhcCCCCCCCcchhhccCCHHHHHHHHhhhCC
Q psy4572         124 KHRKEYSAYCEPCQEGFFSRDKLETHNIAKHGAQPYQCETCRKLFLSKYSLFTHNKKHLP  183 (345)
Q Consensus       124 ~h~~~~~~~C~~C~k~F~~~~~l~~H~~~hh~~k~y~C~~Cgk~F~s~s~L~~H~~~H~~  183 (345)
                      .-++...            ...+     ....++-|.|+.||.+-.....+..|++.||+
T Consensus       195 kvhgv~~------------~yay-----kerr~kl~vcedcg~t~~~~e~~~~h~~~~hp  237 (267)
T KOG3576|consen  195 KVHGVQH------------QYAY-----KERRAKLYVCEDCGYTSERPEVYYLHLKLHHP  237 (267)
T ss_pred             HHcCchH------------HHHH-----HHhhhheeeecccCCCCCChhHHHHHHHhcCC
Confidence            5443210            0000     01124556666666666666666666666663


No 9  
>PLN03086 PRLI-interacting factor K; Provisional
Probab=99.45  E-value=3.2e-13  Score=134.66  Aligned_cols=149  Identities=17%  Similarity=0.397  Sum_probs=119.6

Q ss_pred             CceeccccccccCCHHHHHHHHHHcCCCCceeccc--cccccCCHHHHHHHHHhcCCCCCccCCCCcccccCHHHHHHHH
Q psy4572          45 GPHKCEHCEKEFENRRALKRHEKNHTQEKTFPCTV--CLKMFKTKAQREEHIASHVTDSEFKCDQCPYVAKRIKYLRYHM  122 (345)
Q Consensus        45 ~~~~C~~C~k~F~~~~~L~~H~~~h~~~k~~~C~~--C~k~F~~~~~L~~H~~~h~~~k~~~C~~C~~~f~~~~~L~~H~  122 (345)
                      ..-.|+.|...... ..|..|.... ......|+.  |+..|.. ..+..|         +.|+.|++.|. ...+..|+
T Consensus       406 ~~V~C~NC~~~i~l-~~l~lHe~~C-~r~~V~Cp~~~Cg~v~~r-~el~~H---------~~C~~Cgk~f~-~s~LekH~  472 (567)
T PLN03086        406 DTVECRNCKHYIPS-RSIALHEAYC-SRHNVVCPHDGCGIVLRV-EEAKNH---------VHCEKCGQAFQ-QGEMEKHM  472 (567)
T ss_pred             CeEECCCCCCccch-hHHHHHHhhC-CCcceeCCcccccceeec-cccccC---------ccCCCCCCccc-hHHHHHHH
Confidence            34689999987655 5567887554 334567985  9998833 333433         58999999996 67899999


Q ss_pred             HHhcCCCccccCCCCCCcCCHHHHHHHHHHhcCCCCCCCcchhhccCC----------HHHHHHHHhhhCCCCcCceeec
Q psy4572         123 KKHRKEYSAYCEPCQEGFFSRDKLETHNIAKHGAQPYQCETCRKLFLS----------KYSLFTHNKKHLPIEERLHYQC  192 (345)
Q Consensus       123 ~~h~~~~~~~C~~C~k~F~~~~~l~~H~~~hh~~k~y~C~~Cgk~F~s----------~s~L~~H~~~H~~~~~~k~y~C  192 (345)
                      +.|+  +++.|+ |++.+ .+..|..|...|.+.+++.|+.|++.|..          ...|..|...+    +.+++.|
T Consensus       473 ~~~H--kpv~Cp-Cg~~~-~R~~L~~H~~thCp~Kpi~C~fC~~~v~~g~~~~d~~d~~s~Lt~HE~~C----G~rt~~C  544 (567)
T PLN03086        473 KVFH--EPLQCP-CGVVL-EKEQMVQHQASTCPLRLITCRFCGDMVQAGGSAMDVRDRLRGMSEHESIC----GSRTAPC  544 (567)
T ss_pred             HhcC--CCccCC-CCCCc-chhHHHhhhhccCCCCceeCCCCCCccccCccccchhhhhhhHHHHHHhc----CCcceEc
Confidence            9985  789999 99765 67899999999999999999999999952          45788888885    6779999


Q ss_pred             CccccccCCHHHHHHHHHHhcCC
Q psy4572         193 ELCGKSFNNKQTFSLSNMYVLSL  215 (345)
Q Consensus       193 ~~Cgk~F~~~~~L~~H~~~~H~~  215 (345)
                      ..||+.|..+ .+..|+..+|.+
T Consensus       545 ~~Cgk~Vrlr-dm~~H~~~~h~~  566 (567)
T PLN03086        545 DSCGRSVMLK-EMDIHQIAVHQK  566 (567)
T ss_pred             cccCCeeeeh-hHHHHHHHhhcC
Confidence            9999999865 799999999975


No 10 
>KOG3623|consensus
Probab=99.45  E-value=2e-14  Score=142.55  Aligned_cols=119  Identities=23%  Similarity=0.393  Sum_probs=78.1

Q ss_pred             eccccccccCCHHHHHHHHHhcC--CCCCccCCCCcccccCHHHHHHHHHHhcCCCccccCCCCCCcCCHHHHHHHHHHh
Q psy4572          76 PCTVCLKMFKTKAQREEHIASHV--TDSEFKCDQCPYVAKRIKYLRYHMKKHRKEYSAYCEPCQEGFFSRDKLETHNIAK  153 (345)
Q Consensus        76 ~C~~C~k~F~~~~~L~~H~~~h~--~~k~~~C~~C~~~f~~~~~L~~H~~~h~~~~~~~C~~C~k~F~~~~~l~~H~~~h  153 (345)
                      .|+.|.+.+.....++.|++.-+  .+..|.|..|.++|..+..|.+|+..|....+-.               .+.-.-
T Consensus       212 tcpycdrgykrltslkeHikyrhekne~nfsC~lCsytFAyRtQLErhm~~hkpg~dqa---------------~sltqs  276 (1007)
T KOG3623|consen  212 TCPYCDRGYKRLTSLKEHIKYRHEKNEPNFSCMLCSYTFAYRTQLERHMQLHKPGGDQA---------------ISLTQS  276 (1007)
T ss_pred             cchhHHHHHHHHHHHHHHHHHHHhhCCCCCcchhhhhhhhhHHHHHHHHHhhcCCCccc---------------ccccch
Confidence            45555555555555555554322  2233555555555555555555555553211100               111111


Q ss_pred             cCCCCCCCcchhhccCCHHHHHHHHhhhCCCCcCceeecCccccccCCHHHHHHHHHHh
Q psy4572         154 HGAQPYQCETCRKLFLSKYSLFTHNKKHLPIEERLHYQCELCGKSFNNKQTFSLSNMYV  212 (345)
Q Consensus       154 h~~k~y~C~~Cgk~F~s~s~L~~H~~~H~~~~~~k~y~C~~Cgk~F~~~~~L~~H~~~~  212 (345)
                      ..-+.|+|.+|||.|+.+.+|+.|+|+|.   |+|||.|+.|+|+|+....+..|+-..
T Consensus       277 a~lRKFKCtECgKAFKfKHHLKEHlRIHS---GEKPfeCpnCkKRFSHSGSySSHmSSK  332 (1007)
T KOG3623|consen  277 ALLRKFKCTECGKAFKFKHHLKEHLRIHS---GEKPFECPNCKKRFSHSGSYSSHMSSK  332 (1007)
T ss_pred             hhhccccccccchhhhhHHHHHhhheeec---CCCCcCCcccccccccCCccccccccc
Confidence            23467999999999999999999999998   889999999999999999999998543


No 11 
>KOG3623|consensus
Probab=99.43  E-value=6.1e-14  Score=139.16  Aligned_cols=112  Identities=26%  Similarity=0.513  Sum_probs=93.8

Q ss_pred             CCCCCCceeccccccccCCHHHHHHHHHHcCC--CCceeccccccccCCHHHHHHHHHhcCC-------------CCCcc
Q psy4572          40 PQKLDGPHKCEHCEKEFENRRALKRHEKNHTQ--EKTFPCTVCLKMFKTKAQREEHIASHVT-------------DSEFK  104 (345)
Q Consensus        40 ~~~~~~~~~C~~C~k~F~~~~~L~~H~~~h~~--~k~~~C~~C~k~F~~~~~L~~H~~~h~~-------------~k~~~  104 (345)
                      +........|+.|++.+.....|+.|++..+.  +..|.|..|..+|..+..|.+|+..|..             .+.|+
T Consensus       204 pdAfsqlltcpycdrgykrltslkeHikyrhekne~nfsC~lCsytFAyRtQLErhm~~hkpg~dqa~sltqsa~lRKFK  283 (1007)
T KOG3623|consen  204 PDAFSQLLTCPYCDRGYKRLTSLKEHIKYRHEKNEPNFSCMLCSYTFAYRTQLERHMQLHKPGGDQAISLTQSALLRKFK  283 (1007)
T ss_pred             cchhhhhhcchhHHHHHHHHHHHHHHHHHHHhhCCCCCcchhhhhhhhhHHHHHHHHHhhcCCCcccccccchhhhcccc
Confidence            33334557899999999999999999887543  4559999999999999999999998843             35588


Q ss_pred             CCCCcccccCHHHHHHHHHHhcCCCccccCCCCCCcCCHHHHHHHHH
Q psy4572         105 CDQCPYVAKRIKYLRYHMKKHRKEYSAYCEPCQEGFFSRDKLETHNI  151 (345)
Q Consensus       105 C~~C~~~f~~~~~L~~H~~~h~~~~~~~C~~C~k~F~~~~~l~~H~~  151 (345)
                      |..||+.|..+-.|+.|+|+|.|++||.|+.|+|+|.....+..|+.
T Consensus       284 CtECgKAFKfKHHLKEHlRIHSGEKPfeCpnCkKRFSHSGSySSHmS  330 (1007)
T KOG3623|consen  284 CTECGKAFKFKHHLKEHLRIHSGEKPFECPNCKKRFSHSGSYSSHMS  330 (1007)
T ss_pred             ccccchhhhhHHHHHhhheeecCCCCcCCcccccccccCCccccccc
Confidence            88888888888888888888888888888888888888888887764


No 12 
>PLN03086 PRLI-interacting factor K; Provisional
Probab=99.16  E-value=4.5e-11  Score=119.45  Aligned_cols=119  Identities=18%  Similarity=0.374  Sum_probs=97.5

Q ss_pred             eeccc--cccccCCHHHHHHHHHHcCCCCceeccccccccCCHHHHHHHHHhcCCCCCccCCCCcccccCHHHHHHHHHH
Q psy4572          47 HKCEH--CEKEFENRRALKRHEKNHTQEKTFPCTVCLKMFKTKAQREEHIASHVTDSEFKCDQCPYVAKRIKYLRYHMKK  124 (345)
Q Consensus        47 ~~C~~--C~k~F~~~~~L~~H~~~h~~~k~~~C~~C~k~F~~~~~L~~H~~~h~~~k~~~C~~C~~~f~~~~~L~~H~~~  124 (345)
                      -.|+.  |+..|. +..+..|         +.|+.|++.|. ...+..|+..|+  +++.|+ ||+.+ ....|..|+..
T Consensus       434 V~Cp~~~Cg~v~~-r~el~~H---------~~C~~Cgk~f~-~s~LekH~~~~H--kpv~Cp-Cg~~~-~R~~L~~H~~t  498 (567)
T PLN03086        434 VVCPHDGCGIVLR-VEEAKNH---------VHCEKCGQAFQ-QGEMEKHMKVFH--EPLQCP-CGVVL-EKEQMVQHQAS  498 (567)
T ss_pred             eeCCcccccceee-ccccccC---------ccCCCCCCccc-hHHHHHHHHhcC--CCccCC-CCCCc-chhHHHhhhhc
Confidence            46874  999883 3334443         68999999996 678999999986  789999 99765 66899999999


Q ss_pred             hcCCCccccCCCCCCcCC----------HHHHHHHHHHhcCCCCCCCcchhhccCCHHHHHHHHhhhC
Q psy4572         125 HRKEYSAYCEPCQEGFFS----------RDKLETHNIAKHGAQPYQCETCRKLFLSKYSLFTHNKKHL  182 (345)
Q Consensus       125 h~~~~~~~C~~C~k~F~~----------~~~l~~H~~~hh~~k~y~C~~Cgk~F~s~s~L~~H~~~H~  182 (345)
                      |.+++++.|+.|++.|..          ...|..|.... |.+++.|..||+.+..+ .+..|+...|
T Consensus       499 hCp~Kpi~C~fC~~~v~~g~~~~d~~d~~s~Lt~HE~~C-G~rt~~C~~Cgk~Vrlr-dm~~H~~~~h  564 (567)
T PLN03086        499 TCPLRLITCRFCGDMVQAGGSAMDVRDRLRGMSEHESIC-GSRTAPCDSCGRSVMLK-EMDIHQIAVH  564 (567)
T ss_pred             cCCCCceeCCCCCCccccCccccchhhhhhhHHHHHHhc-CCcceEccccCCeeeeh-hHHHHHHHhh
Confidence            999999999999999852          35799998884 89999999999998765 5788877665


No 13 
>PHA00733 hypothetical protein
Probab=99.11  E-value=6.2e-11  Score=97.56  Aligned_cols=85  Identities=19%  Similarity=0.286  Sum_probs=65.7

Q ss_pred             CCCccccCCCCCCcCCHHHHHHH--H---HHhcCCCCCCCcchhhccCCHHHHHHHHhhhCCCCcCceeecCccccccCC
Q psy4572         127 KEYSAYCEPCQEGFFSRDKLETH--N---IAKHGAQPYQCETCRKLFLSKYSLFTHNKKHLPIEERLHYQCELCGKSFNN  201 (345)
Q Consensus       127 ~~~~~~C~~C~k~F~~~~~l~~H--~---~~hh~~k~y~C~~Cgk~F~s~s~L~~H~~~H~~~~~~k~y~C~~Cgk~F~~  201 (345)
                      .++++.|.+|++.|.....+..|  .   ..+++.+||.|+.||+.|.+...|..|++.|.     .+|.|+.|++.|..
T Consensus        37 ~~~~~~~~~~~~~~~~~~~l~~~~~l~~~~~~~~~kPy~C~~Cgk~Fss~s~L~~H~r~h~-----~~~~C~~CgK~F~~  111 (128)
T PHA00733         37 EQKRLIRAVVKTLIYNPQLLDESSYLYKLLTSKAVSPYVCPLCLMPFSSSVSLKQHIRYTE-----HSKVCPVCGKEFRN  111 (128)
T ss_pred             hhhhHHHHHHhhhccChhhhcchHHHHhhcccCCCCCccCCCCCCcCCCHHHHHHHHhcCC-----cCccCCCCCCccCC
Confidence            45566677776666665554444  1   23345789999999999999999999998752     35899999999999


Q ss_pred             HHHHHHHHHHhcCCC
Q psy4572         202 KQTFSLSNMYVLSLS  216 (345)
Q Consensus       202 ~~~L~~H~~~~H~~~  216 (345)
                      ...|.+|+...|+++
T Consensus       112 ~~sL~~H~~~~h~~~  126 (128)
T PHA00733        112 TDSTLDHVCKKHNIC  126 (128)
T ss_pred             HHHHHHHHHHhcCcc
Confidence            999999999999875


No 14 
>KOG3993|consensus
Probab=98.92  E-value=6.5e-11  Score=111.70  Aligned_cols=173  Identities=18%  Similarity=0.236  Sum_probs=111.8

Q ss_pred             CceeccccccccCCHHHHHHHHHHcCCCCceeccccccccCCHHHHHHHHHhcCCC------------------------
Q psy4572          45 GPHKCEHCEKEFENRRALKRHEKNHTQEKTFPCTVCLKMFKTKAQREEHIASHVTD------------------------  100 (345)
Q Consensus        45 ~~~~C~~C~k~F~~~~~L~~H~~~h~~~k~~~C~~C~k~F~~~~~L~~H~~~h~~~------------------------  100 (345)
                      +-|.|..|...|.+.-.|.+|.....-...|+|++|+|.|+-..+|..|.+.|...                        
T Consensus       266 GdyiCqLCK~kYeD~F~LAQHrC~RIV~vEYrCPEC~KVFsCPANLASHRRWHKPR~eaa~a~~~P~k~~~~~rae~~ea  345 (500)
T KOG3993|consen  266 GDYICQLCKEKYEDAFALAQHRCPRIVHVEYRCPECDKVFSCPANLASHRRWHKPRPEAAKAGSPPPKQAVETRAEVQEA  345 (500)
T ss_pred             HHHHHHHHHHhhhhHHHHhhccCCeeEEeeecCCcccccccCchhhhhhhcccCCchhhhhcCCCChhhhhhhhhhhhhc
Confidence            34999999999999999999976544445599999999999999999999988421                        


Q ss_pred             ---------CCccCCCCcccccCHHHHHHHHHHhcCCCc-----------------cccCCCCCCcCCHHHHHHHHHHhc
Q psy4572         101 ---------SEFKCDQCPYVAKRIKYLRYHMKKHRKEYS-----------------AYCEPCQEGFFSRDKLETHNIAKH  154 (345)
Q Consensus       101 ---------k~~~C~~C~~~f~~~~~L~~H~~~h~~~~~-----------------~~C~~C~k~F~~~~~l~~H~~~hh  154 (345)
                               ..|.|..|++.|.+...|+.|+.+|+....                 +.|+.|...+...+.-..+...+.
T Consensus       346 ~rsg~dss~gi~~C~~C~KkFrRqAYLrKHqlthq~~~~~k~~a~~f~~s~~~~l~~~~~~~a~h~~a~~~~g~~vl~~a  425 (500)
T KOG3993|consen  346 ERSGDDSSSGIFSCHTCGKKFRRQAYLRKHQLTHQRAPLAKEKAPKFLLSRVIPLMHFNQAVATHSSASDSHGDEVLYVA  425 (500)
T ss_pred             cccCCcccCceeecHHhhhhhHHHHHHHHhHHhhhccccchhcccCcchhhcccccccccccccccccccccccceeeee
Confidence                     125666666666666666666655543110                 112222222222111111111110


Q ss_pred             C-CCCCCCcchhhccCCHHHHHHHHhhhCCCCcCceeecCccccccCCHHHHHHHHHHhcCCCCccc
Q psy4572         155 G-AQPYQCETCRKLFLSKYSLFTHNKKHLPIEERLHYQCELCGKSFNNKQTFSLSNMYVLSLSNMYV  220 (345)
Q Consensus       155 ~-~k~y~C~~Cgk~F~s~s~L~~H~~~H~~~~~~k~y~C~~Cgk~F~~~~~L~~H~~~~H~~~~~~~  220 (345)
                      + .....|+.||..+.+...-..|.+.-+   .+..|.|.+|.-.|.....|.+|+...|.-..+++
T Consensus       426 ~sael~~pp~~~~ppsss~~sgg~~rlg~---~~q~f~~ky~~atfyss~~ltrhin~~Hpse~rqv  489 (500)
T KOG3993|consen  426 GSAELELPPYDGSPPSSSGSSGGYGRLGI---AEQGFTCKYCPATFYSSPGLTRHINKCHPSELRQV  489 (500)
T ss_pred             ccccccCCCCCCCCcccCCCCCccccccc---hhhccccccchHhhhcCcchHhHhhhcChHHhhhh
Confidence            0 122456777766666655444544333   33459999999999999999999999998655543


No 15 
>PHA00733 hypothetical protein
Probab=98.90  E-value=2e-09  Score=88.65  Aligned_cols=53  Identities=25%  Similarity=0.280  Sum_probs=27.1

Q ss_pred             CCccccCCCCCCcCCHHHHHHHHHHhcCCCCCCCcchhhccCCHHHHHHHHhhhC
Q psy4572         128 EYSAYCEPCQEGFFSRDKLETHNIAKHGAQPYQCETCRKLFLSKYSLFTHNKKHL  182 (345)
Q Consensus       128 ~~~~~C~~C~k~F~~~~~l~~H~~~hh~~k~y~C~~Cgk~F~s~s~L~~H~~~H~  182 (345)
                      +++|.|+.|++.|.+...|..|++.+  +.+|.|+.|++.|.....|..|+...|
T Consensus        71 ~kPy~C~~Cgk~Fss~s~L~~H~r~h--~~~~~C~~CgK~F~~~~sL~~H~~~~h  123 (128)
T PHA00733         71 VSPYVCPLCLMPFSSSVSLKQHIRYT--EHSKVCPVCGKEFRNTDSTLDHVCKKH  123 (128)
T ss_pred             CCCccCCCCCCcCCCHHHHHHHHhcC--CcCccCCCCCCccCCHHHHHHHHHHhc
Confidence            44455555555555555555554433  234555555555555555555555544


No 16 
>PHA02768 hypothetical protein; Provisional
Probab=98.65  E-value=9.9e-09  Score=70.67  Aligned_cols=44  Identities=20%  Similarity=0.508  Sum_probs=34.2

Q ss_pred             CCCCcchhhccCCHHHHHHHHhhhCCCCcCceeecCccccccCCHHHHH
Q psy4572         158 PYQCETCRKLFLSKYSLFTHNKKHLPIEERLHYQCELCGKSFNNKQTFS  206 (345)
Q Consensus       158 ~y~C~~Cgk~F~s~s~L~~H~~~H~~~~~~k~y~C~~Cgk~F~~~~~L~  206 (345)
                      -|+|+.||+.|...++|..|+++|+     ++|+|..|++.|.+.+.|.
T Consensus         5 ~y~C~~CGK~Fs~~~~L~~H~r~H~-----k~~kc~~C~k~f~~~s~l~   48 (55)
T PHA02768          5 GYECPICGEIYIKRKSMITHLRKHN-----TNLKLSNCKRISLRTGEYI   48 (55)
T ss_pred             ccCcchhCCeeccHHHHHHHHHhcC-----CcccCCcccceecccceeE
Confidence            3678888888888888888888875     3688888888887766553


No 17 
>PHA02768 hypothetical protein; Provisional
Probab=98.56  E-value=2.6e-08  Score=68.63  Aligned_cols=43  Identities=14%  Similarity=0.393  Sum_probs=31.1

Q ss_pred             ceeccccccccCCHHHHHHHHHHcCCCCceeccccccccCCHHHH
Q psy4572          46 PHKCEHCEKEFENRRALKRHEKNHTQEKTFPCTVCLKMFKTKAQR   90 (345)
Q Consensus        46 ~~~C~~C~k~F~~~~~L~~H~~~h~~~k~~~C~~C~k~F~~~~~L   90 (345)
                      .|+|+.||+.|...+.|..|+++|+  ++++|..|++.|.....|
T Consensus         5 ~y~C~~CGK~Fs~~~~L~~H~r~H~--k~~kc~~C~k~f~~~s~l   47 (55)
T PHA02768          5 GYECPICGEIYIKRKSMITHLRKHN--TNLKLSNCKRISLRTGEY   47 (55)
T ss_pred             ccCcchhCCeeccHHHHHHHHHhcC--CcccCCcccceeccccee
Confidence            3677777777777777777777776  467777777777655543


No 18 
>KOG3993|consensus
Probab=98.50  E-value=7.9e-09  Score=97.85  Aligned_cols=152  Identities=17%  Similarity=0.182  Sum_probs=102.7

Q ss_pred             ccccccCCCCCCCceeccccccccCCHHHHHHHHHHcCC---------------------------------CCceeccc
Q psy4572          33 KRSRKSTPQKLDGPHKCEHCEKEFENRRALKRHEKNHTQ---------------------------------EKTFPCTV   79 (345)
Q Consensus        33 ~~~~~~~~~~~~~~~~C~~C~k~F~~~~~L~~H~~~h~~---------------------------------~k~~~C~~   79 (345)
                      .+.+|.-..-..-.|+|++|+|+|....+|..|.|.|..                                 +..|.|.+
T Consensus       282 ~LAQHrC~RIV~vEYrCPEC~KVFsCPANLASHRRWHKPR~eaa~a~~~P~k~~~~~rae~~ea~rsg~dss~gi~~C~~  361 (500)
T KOG3993|consen  282 ALAQHRCPRIVHVEYRCPECDKVFSCPANLASHRRWHKPRPEAAKAGSPPPKQAVETRAEVQEAERSGDDSSSGIFSCHT  361 (500)
T ss_pred             HHhhccCCeeEEeeecCCcccccccCchhhhhhhcccCCchhhhhcCCCChhhhhhhhhhhhhccccCCcccCceeecHH
Confidence            445555444444569999999999999999999998831                                 12489999


Q ss_pred             cccccCCHHHHHHHHHhcCCCCC-----------------ccCCCCcccccCHHHHHHHHHHhcC-CCccccCCCCCCcC
Q psy4572          80 CLKMFKTKAQREEHIASHVTDSE-----------------FKCDQCPYVAKRIKYLRYHMKKHRK-EYSAYCEPCQEGFF  141 (345)
Q Consensus        80 C~k~F~~~~~L~~H~~~h~~~k~-----------------~~C~~C~~~f~~~~~L~~H~~~h~~-~~~~~C~~C~k~F~  141 (345)
                      |++.|.+..-|+.|+.+|+....                 +-|+.|...+.....-..+...+.+ .....|+.|+..+.
T Consensus       362 C~KkFrRqAYLrKHqlthq~~~~~k~~a~~f~~s~~~~l~~~~~~~a~h~~a~~~~g~~vl~~a~sael~~pp~~~~pps  441 (500)
T KOG3993|consen  362 CGKKFRRQAYLRKHQLTHQRAPLAKEKAPKFLLSRVIPLMHFNQAVATHSSASDSHGDEVLYVAGSAELELPPYDGSPPS  441 (500)
T ss_pred             hhhhhHHHHHHHHhHHhhhccccchhcccCcchhhcccccccccccccccccccccccceeeeeccccccCCCCCCCCcc
Confidence            99999999999999988754211                 2233333322222111111111111 11234788887777


Q ss_pred             CHHHHHHHHHHhcCCCCCCCcchhhccCCHHHHHHHHhhhCCC
Q psy4572         142 SRDKLETHNIAKHGAQPYQCETCRKLFLSKYSLFTHNKKHLPI  184 (345)
Q Consensus       142 ~~~~l~~H~~~hh~~k~y~C~~Cgk~F~s~s~L~~H~~~H~~~  184 (345)
                      +...-..|.+.-+.+.-|.|.+|.-+|.+...|.+|+...|..
T Consensus       442 ss~~sgg~~rlg~~~q~f~~ky~~atfyss~~ltrhin~~Hps  484 (500)
T KOG3993|consen  442 SSGSSGGYGRLGIAEQGFTCKYCPATFYSSPGLTRHINKCHPS  484 (500)
T ss_pred             cCCCCCccccccchhhccccccchHhhhcCcchHhHhhhcChH
Confidence            6665555555555677899999999999999999999987743


No 19 
>PF13465 zf-H2C2_2:  Zinc-finger double domain; PDB: 2EN7_A 1TF6_A 1TF3_A 2ELT_A 2EOS_A 2EN2_A 2DMD_A 2WBS_A 2WBU_A 2EM5_A ....
Probab=98.23  E-value=9e-07  Score=51.96  Aligned_cols=25  Identities=44%  Similarity=0.849  Sum_probs=19.2

Q ss_pred             HHHHHHHHcCCCCceeccccccccC
Q psy4572          61 ALKRHEKNHTQEKTFPCTVCLKMFK   85 (345)
Q Consensus        61 ~L~~H~~~h~~~k~~~C~~C~k~F~   85 (345)
                      +|..|++.|++++||.|+.|++.|.
T Consensus         1 ~l~~H~~~H~~~k~~~C~~C~k~F~   25 (26)
T PF13465_consen    1 NLRRHMRTHTGEKPYKCPYCGKSFS   25 (26)
T ss_dssp             HHHHHHHHHSSSSSEEESSSSEEES
T ss_pred             CHHHHhhhcCCCCCCCCCCCcCeeC
Confidence            3677888888888888888887775


No 20 
>PF13465 zf-H2C2_2:  Zinc-finger double domain; PDB: 2EN7_A 1TF6_A 1TF3_A 2ELT_A 2EOS_A 2EN2_A 2DMD_A 2WBS_A 2WBU_A 2EM5_A ....
Probab=98.18  E-value=1.3e-06  Score=51.26  Aligned_cols=25  Identities=40%  Similarity=0.812  Sum_probs=18.0

Q ss_pred             HHHHHHhhhCCCCcCceeecCccccccC
Q psy4572         173 SLFTHNKKHLPIEERLHYQCELCGKSFN  200 (345)
Q Consensus       173 ~L~~H~~~H~~~~~~k~y~C~~Cgk~F~  200 (345)
                      +|.+|+++|+   +++||.|+.|++.|.
T Consensus         1 ~l~~H~~~H~---~~k~~~C~~C~k~F~   25 (26)
T PF13465_consen    1 NLRRHMRTHT---GEKPYKCPYCGKSFS   25 (26)
T ss_dssp             HHHHHHHHHS---SSSSEEESSSSEEES
T ss_pred             CHHHHhhhcC---CCCCCCCCCCcCeeC
Confidence            3677777776   666777777777775


No 21 
>PF05605 zf-Di19:  Drought induced 19 protein (Di19), zinc-binding;  InterPro: IPR008598 This entry consists of several drought induced 19 (Di19) like and RING finger 114 proteins. Di19 has been found to be strongly expressed in both the roots and leaves of Arabidopsis thaliana during progressive drought [], whilst RING finger proteins are thought to play a role in spermatogenesis. The precise function is unknown.
Probab=98.15  E-value=2.8e-06  Score=59.09  Aligned_cols=53  Identities=19%  Similarity=0.310  Sum_probs=41.9

Q ss_pred             CCCCcchhhccCCHHHHHHHHhhhCCCCcCceeecCccccccCCHHHHHHHHHHhcC
Q psy4572         158 PYQCETCRKLFLSKYSLFTHNKKHLPIEERLHYQCELCGKSFNNKQTFSLSNMYVLS  214 (345)
Q Consensus       158 ~y~C~~Cgk~F~s~s~L~~H~~~H~~~~~~k~y~C~~Cgk~F~~~~~L~~H~~~~H~  214 (345)
                      .|.||.|++ ..+...|..|....|..+ .+.+.||+|...+.  .+|.+|+..+|+
T Consensus         2 ~f~CP~C~~-~~~~~~L~~H~~~~H~~~-~~~v~CPiC~~~~~--~~l~~Hl~~~H~   54 (54)
T PF05605_consen    2 SFTCPYCGK-GFSESSLVEHCEDEHRSE-SKNVVCPICSSRVT--DNLIRHLNSQHR   54 (54)
T ss_pred             CcCCCCCCC-ccCHHHHHHHHHhHCcCC-CCCccCCCchhhhh--hHHHHHHHHhcC
Confidence            478999999 556788999988777553 44689999998765  489999998884


No 22 
>PHA00616 hypothetical protein
Probab=98.07  E-value=1.8e-06  Score=56.76  Aligned_cols=34  Identities=12%  Similarity=0.296  Sum_probs=22.9

Q ss_pred             ceeccccccccCCHHHHHHHHHHcCCCCceeccc
Q psy4572          46 PHKCEHCEKEFENRRALKRHEKNHTQEKTFPCTV   79 (345)
Q Consensus        46 ~~~C~~C~k~F~~~~~L~~H~~~h~~~k~~~C~~   79 (345)
                      ||+|+.||+.|...+.+..|++.||+++++.|+.
T Consensus         1 pYqC~~CG~~F~~~s~l~~H~r~~hg~~~~~~~~   34 (44)
T PHA00616          1 MYQCLRCGGIFRKKKEVIEHLLSVHKQNKLTLEY   34 (44)
T ss_pred             CCccchhhHHHhhHHHHHHHHHHhcCCCccceeE
Confidence            4667777777777777777777776666666654


No 23 
>PHA00732 hypothetical protein
Probab=98.02  E-value=2.6e-06  Score=64.01  Aligned_cols=44  Identities=18%  Similarity=0.515  Sum_probs=28.5

Q ss_pred             CCCCcchhhccCCHHHHHHHHhhhCCCCcCceeecCccccccCCHHHHHHHH
Q psy4572         158 PYQCETCRKLFLSKYSLFTHNKKHLPIEERLHYQCELCGKSFNNKQTFSLSN  209 (345)
Q Consensus       158 ~y~C~~Cgk~F~s~s~L~~H~~~H~~~~~~k~y~C~~Cgk~F~~~~~L~~H~  209 (345)
                      ||.|+.||+.|.+...|+.|++.+|.     ++.|+.||+.|.   .+..|.
T Consensus         1 py~C~~Cgk~F~s~s~Lk~H~r~~H~-----~~~C~~CgKsF~---~l~~H~   44 (79)
T PHA00732          1 MFKCPICGFTTVTLFALKQHARRNHT-----LTKCPVCNKSYR---RLNQHF   44 (79)
T ss_pred             CccCCCCCCccCCHHHHHHHhhcccC-----CCccCCCCCEeC---Chhhhh
Confidence            46677777777777777777764321     246777777776   366666


No 24 
>PHA00616 hypothetical protein
Probab=98.00  E-value=3.2e-06  Score=55.55  Aligned_cols=34  Identities=24%  Similarity=0.366  Sum_probs=20.4

Q ss_pred             CCCCcchhhccCCHHHHHHHHhhhCCCCcCceeecCc
Q psy4572         158 PYQCETCRKLFLSKYSLFTHNKKHLPIEERLHYQCEL  194 (345)
Q Consensus       158 ~y~C~~Cgk~F~s~s~L~~H~~~H~~~~~~k~y~C~~  194 (345)
                      ||+|+.||+.|...+.+.+|++.||   +++++.|+.
T Consensus         1 pYqC~~CG~~F~~~s~l~~H~r~~h---g~~~~~~~~   34 (44)
T PHA00616          1 MYQCLRCGGIFRKKKEVIEHLLSVH---KQNKLTLEY   34 (44)
T ss_pred             CCccchhhHHHhhHHHHHHHHHHhc---CCCccceeE
Confidence            4566666666666666666666666   444555543


No 25 
>PHA00732 hypothetical protein
Probab=97.86  E-value=1e-05  Score=60.77  Aligned_cols=48  Identities=21%  Similarity=0.286  Sum_probs=31.6

Q ss_pred             ccccCCCCCCcCCHHHHHHHHHHhcCCCCCCCcchhhccCCHHHHHHHHhhhC
Q psy4572         130 SAYCEPCQEGFFSRDKLETHNIAKHGAQPYQCETCRKLFLSKYSLFTHNKKHL  182 (345)
Q Consensus       130 ~~~C~~C~k~F~~~~~l~~H~~~hh~~k~y~C~~Cgk~F~s~s~L~~H~~~H~  182 (345)
                      ||.|+.|++.|.+...++.|++.+|.  ++.|+.||+.|.   .+..|.+++-
T Consensus         1 py~C~~Cgk~F~s~s~Lk~H~r~~H~--~~~C~~CgKsF~---~l~~H~~~~~   48 (79)
T PHA00732          1 MFKCPICGFTTVTLFALKQHARRNHT--LTKCPVCNKSYR---RLNQHFYSQY   48 (79)
T ss_pred             CccCCCCCCccCCHHHHHHHhhcccC--CCccCCCCCEeC---ChhhhhcccC
Confidence            45677777777777777777764332  246788887776   4667776553


No 26 
>PF05605 zf-Di19:  Drought induced 19 protein (Di19), zinc-binding;  InterPro: IPR008598 This entry consists of several drought induced 19 (Di19) like and RING finger 114 proteins. Di19 has been found to be strongly expressed in both the roots and leaves of Arabidopsis thaliana during progressive drought [], whilst RING finger proteins are thought to play a role in spermatogenesis. The precise function is unknown.
Probab=97.68  E-value=6.4e-05  Score=52.23  Aligned_cols=50  Identities=20%  Similarity=0.385  Sum_probs=27.9

Q ss_pred             ceeccccccccCCHHHHHHHHHHcCC--CCceeccccccccCCHHHHHHHHHhcC
Q psy4572          46 PHKCEHCEKEFENRRALKRHEKNHTQ--EKTFPCTVCLKMFKTKAQREEHIASHV   98 (345)
Q Consensus        46 ~~~C~~C~k~F~~~~~L~~H~~~h~~--~k~~~C~~C~k~F~~~~~L~~H~~~h~   98 (345)
                      .|.||.|++ ..+...|..|....|.  .+.+.||+|...+.  .+|..|+..+|
T Consensus         2 ~f~CP~C~~-~~~~~~L~~H~~~~H~~~~~~v~CPiC~~~~~--~~l~~Hl~~~H   53 (54)
T PF05605_consen    2 SFTCPYCGK-GFSESSLVEHCEDEHRSESKNVVCPICSSRVT--DNLIRHLNSQH   53 (54)
T ss_pred             CcCCCCCCC-ccCHHHHHHHHHhHCcCCCCCccCCCchhhhh--hHHHHHHHHhc
Confidence            366677766 3445666666555332  23466666665433  25666665543


No 27 
>PF00096 zf-C2H2:  Zinc finger, C2H2 type;  InterPro: IPR007087 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target.  The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger: #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C], where X can be any amino acid, and numbers in brackets indicate the number of residues. The positions marked # are those that are important for the stable fold of the zinc finger. The final position can be either his or cys. The C2H2 zinc finger is composed of two short beta strands followed by an alpha helix. The amino terminal part of the helix binds the major groove in DNA binding zinc fingers. The accepted consensus binding sequence for Sp1 is usually defined by the asymmetric hexanucleotide core GGGCGG but this sequence does not include, among others, the GAG (=CTC) repeat that constitutes a high-affinity site for Sp1 binding to the wt1 promoter []. This entry represents the classical C2H2 zinc finger domain.  More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; GO: 0008270 zinc ion binding, 0005622 intracellular; PDB: 2D9H_A 2EPC_A 1SP1_A 1VA3_A 2WBT_B 2ELR_A 2YTP_A 2YTT_A 1VA1_A 2ELO_A ....
Probab=97.64  E-value=4.6e-05  Score=43.06  Aligned_cols=22  Identities=36%  Similarity=0.807  Sum_probs=18.2

Q ss_pred             eecCccccccCCHHHHHHHHHH
Q psy4572         190 YQCELCGKSFNNKQTFSLSNMY  211 (345)
Q Consensus       190 y~C~~Cgk~F~~~~~L~~H~~~  211 (345)
                      |.|+.|++.|.++..|.+|++.
T Consensus         1 y~C~~C~~~f~~~~~l~~H~~~   22 (23)
T PF00096_consen    1 YKCPICGKSFSSKSNLKRHMRR   22 (23)
T ss_dssp             EEETTTTEEESSHHHHHHHHHH
T ss_pred             CCCCCCCCccCCHHHHHHHHhH
Confidence            6788888888888888888876


No 28 
>PF12756 zf-C2H2_2:  C2H2 type zinc-finger (2 copies); PDB: 2DMI_A.
Probab=97.63  E-value=4.4e-05  Score=59.33  Aligned_cols=79  Identities=23%  Similarity=0.366  Sum_probs=23.5

Q ss_pred             ccCCCCCCcCCHHHHHHHHHHhcCCCCCCCcchhhccCCHHHHHHHHhhhCCCCcCceeecCccccccCCHHHHHHHHHH
Q psy4572         132 YCEPCQEGFFSRDKLETHNIAKHGAQPYQCETCRKLFLSKYSLFTHNKKHLPIEERLHYQCELCGKSFNNKQTFSLSNMY  211 (345)
Q Consensus       132 ~C~~C~k~F~~~~~l~~H~~~hh~~k~y~C~~Cgk~F~s~s~L~~H~~~H~~~~~~k~y~C~~Cgk~F~~~~~L~~H~~~  211 (345)
                      +|..|+..|.+...+..|+...|+...-    ....+.....+..+.+...    ...+.|..|++.|.+...|..|++.
T Consensus         1 ~C~~C~~~f~~~~~l~~H~~~~H~~~~~----~~~~l~~~~~~~~~~~~~~----~~~~~C~~C~~~f~s~~~l~~Hm~~   72 (100)
T PF12756_consen    1 QCLFCDESFSSVDDLLQHMKKKHGFDIP----DQKYLVDPNRLLNYLRKKV----KESFRCPYCNKTFRSREALQEHMRS   72 (100)
T ss_dssp             --------------------------------------------------------SSEEBSSSS-EESSHHHHHHHHHH
T ss_pred             Cccccccccccccccccccccccccccc----ccccccccccccccccccc----CCCCCCCccCCCCcCHHHHHHHHcC
Confidence            3777777777777788887766653211    1122223444444443321    2258999999999999999999988


Q ss_pred             hcCCCCc
Q psy4572         212 VLSLSNM  218 (345)
Q Consensus       212 ~H~~~~~  218 (345)
                      +++....
T Consensus        73 ~~H~~~~   79 (100)
T PF12756_consen   73 KHHKKRN   79 (100)
T ss_dssp             TTTTC-S
T ss_pred             ccCCCcc
Confidence            7655443


No 29 
>PF13894 zf-C2H2_4:  C2H2-type zinc finger; PDB: 2ELX_A 2EPP_A 2DLK_A 1X6H_A 2EOU_A 2EMB_A 2GQJ_A 2CSH_A 2WBT_B 2ELM_A ....
Probab=97.57  E-value=6.8e-05  Score=42.41  Aligned_cols=24  Identities=33%  Similarity=0.652  Sum_probs=17.9

Q ss_pred             eecCccccccCCHHHHHHHHHHhc
Q psy4572         190 YQCELCGKSFNNKQTFSLSNMYVL  213 (345)
Q Consensus       190 y~C~~Cgk~F~~~~~L~~H~~~~H  213 (345)
                      |.|++|++.|.++..|..|++.+|
T Consensus         1 ~~C~~C~~~~~~~~~l~~H~~~~H   24 (24)
T PF13894_consen    1 FQCPICGKSFRSKSELRQHMRTHH   24 (24)
T ss_dssp             EE-SSTS-EESSHHHHHHHHHHHS
T ss_pred             CCCcCCCCcCCcHHHHHHHHHhhC
Confidence            678888888888888888888776


No 30 
>KOG2231|consensus
Probab=97.45  E-value=0.00028  Score=72.07  Aligned_cols=121  Identities=22%  Similarity=0.402  Sum_probs=69.7

Q ss_pred             eccccccccCCHHHHHHHHHhcCCCCCccCCCCc---------ccccCHHHHHHHHHHhcC-CC----ccccCCCCCCcC
Q psy4572          76 PCTVCLKMFKTKAQREEHIASHVTDSEFKCDQCP---------YVAKRIKYLRYHMKKHRK-EY----SAYCEPCQEGFF  141 (345)
Q Consensus        76 ~C~~C~k~F~~~~~L~~H~~~h~~~k~~~C~~C~---------~~f~~~~~L~~H~~~h~~-~~----~~~C~~C~k~F~  141 (345)
                      .|..| ..|.....|+.|+...|+  .+.|..|-         ...-+...|..|++.-.. +.    .-.|+.|...|.
T Consensus       117 ~~~~c-~~~~s~~~Lk~H~~~~H~--~~~c~lC~~~~kif~~e~k~Yt~~el~~h~~~gd~d~~s~rGhp~C~~C~~~fl  193 (669)
T KOG2231|consen  117 ECLHC-TEFKSVENLKNHMRDQHK--LHLCSLCLQNLKIFINERKLYTRAELNLHLMFGDPDDESCRGHPLCKFCHERFL  193 (669)
T ss_pred             CCccc-cchhHHHHHHHHHHHhhh--hhccccccccceeeeeeeehehHHHHHHHHhcCCCccccccCCccchhhhhhhc
Confidence            34445 445577778888754332  23444332         222345666777654221 11    135788888888


Q ss_pred             CHHHHHHHHHHhcCCCCCCCcch------hhccCCHHHHHHHHhhhCCCCcCceeecC--ccc-cccCCHHHHHHHHH
Q psy4572         142 SRDKLETHNIAKHGAQPYQCETC------RKLFLSKYSLFTHNKKHLPIEERLHYQCE--LCG-KSFNNKQTFSLSNM  210 (345)
Q Consensus       142 ~~~~l~~H~~~hh~~k~y~C~~C------gk~F~s~s~L~~H~~~H~~~~~~k~y~C~--~Cg-k~F~~~~~L~~H~~  210 (345)
                      ....+.+|++.+|    |.|..|      +.-|.....|..|.+.+|       |.|.  .|. +.|.....+..|++
T Consensus       194 d~~el~rH~~~~h----~~chfC~~~~~~neyy~~~~dLe~HfR~~H-------flCE~~~C~~~~f~~~~~~ei~lk  260 (669)
T KOG2231|consen  194 DDDELYRHLRFDH----EFCHFCDYKTGQNEYYNDYDDLEEHFRKGH-------FLCEEEFCRTKKFYVAFELEIELK  260 (669)
T ss_pred             cHHHHHHhhccce----eheeecCcccccchhcccchHHHHHhhhcC-------ccccccccccceeeehhHHHHHHH
Confidence            8888888877766    344444      456677778888888777       6666  453 34444444455544


No 31 
>PF13912 zf-C2H2_6:  C2H2-type zinc finger; PDB: 1JN7_A 1FU9_A 2L1O_A 1NJQ_A 2EN8_A 2EMM_A 1FV5_A 1Y0J_B 2L6Z_B.
Probab=97.41  E-value=7.9e-05  Score=43.87  Aligned_cols=26  Identities=23%  Similarity=0.540  Sum_probs=20.0

Q ss_pred             eeecCccccccCCHHHHHHHHHHhcC
Q psy4572         189 HYQCELCGKSFNNKQTFSLSNMYVLS  214 (345)
Q Consensus       189 ~y~C~~Cgk~F~~~~~L~~H~~~~H~  214 (345)
                      ||+|+.|++.|.+...|..|++.|++
T Consensus         1 ~~~C~~C~~~F~~~~~l~~H~~~h~~   26 (27)
T PF13912_consen    1 PFECDECGKTFSSLSALREHKRSHCS   26 (27)
T ss_dssp             SEEETTTTEEESSHHHHHHHHCTTTT
T ss_pred             CCCCCccCCccCChhHHHHHhHHhcC
Confidence            47788888888888888888777664


No 32 
>COG5189 SFP1 Putative transcriptional repressor regulating G2/M transition [Transcription / Cell division and chromosome partitioning]
Probab=97.37  E-value=4.5e-05  Score=70.30  Aligned_cols=58  Identities=22%  Similarity=0.480  Sum_probs=47.7

Q ss_pred             CCCCCCcc--hhhccCCHHHHHHHHhhhCCC----------------CcCceeecCccccccCCHHHHHHHHHHhc
Q psy4572         156 AQPYQCET--CRKLFLSKYSLFTHNKKHLPI----------------EERLHYQCELCGKSFNNKQTFSLSNMYVL  213 (345)
Q Consensus       156 ~k~y~C~~--Cgk~F~s~s~L~~H~~~H~~~----------------~~~k~y~C~~Cgk~F~~~~~L~~H~~~~H  213 (345)
                      ++||+|++  |.|+++....|+.|+.--|..                .+.|||+|++|+|++.....|+.|++.-|
T Consensus       347 ~KpykCpV~gC~K~YknqnGLKYH~lhGH~~~~~~~~p~p~~~~~F~~~~KPYrCevC~KRYKNlNGLKYHr~Hsh  422 (423)
T COG5189         347 GKPYKCPVEGCNKKYKNQNGLKYHMLHGHQNQKLHENPSPEKMNIFSAKDKPYRCEVCDKRYKNLNGLKYHRKHSH  422 (423)
T ss_pred             CceecCCCCCchhhhccccchhhhhhccccCcccCCCCCccccccccccCCceeccccchhhccCccceecccccC
Confidence            58999987  999999999999998743310                13589999999999999999999976544


No 33 
>PF00096 zf-C2H2:  Zinc finger, C2H2 type;  InterPro: IPR007087 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target.  The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger: #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C], where X can be any amino acid, and numbers in brackets indicate the number of residues. The positions marked # are those that are important for the stable fold of the zinc finger. The final position can be either his or cys. The C2H2 zinc finger is composed of two short beta strands followed by an alpha helix. The amino terminal part of the helix binds the major groove in DNA binding zinc fingers. The accepted consensus binding sequence for Sp1 is usually defined by the asymmetric hexanucleotide core GGGCGG but this sequence does not include, among others, the GAG (=CTC) repeat that constitutes a high-affinity site for Sp1 binding to the wt1 promoter []. This entry represents the classical C2H2 zinc finger domain.  More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; GO: 0008270 zinc ion binding, 0005622 intracellular; PDB: 2D9H_A 2EPC_A 1SP1_A 1VA3_A 2WBT_B 2ELR_A 2YTP_A 2YTT_A 1VA1_A 2ELO_A ....
Probab=97.33  E-value=0.00017  Score=40.70  Aligned_cols=22  Identities=41%  Similarity=0.804  Sum_probs=12.1

Q ss_pred             eeccccccccCCHHHHHHHHHH
Q psy4572          47 HKCEHCEKEFENRRALKRHEKN   68 (345)
Q Consensus        47 ~~C~~C~k~F~~~~~L~~H~~~   68 (345)
                      |+|+.|++.|.++..|..|++.
T Consensus         1 y~C~~C~~~f~~~~~l~~H~~~   22 (23)
T PF00096_consen    1 YKCPICGKSFSSKSNLKRHMRR   22 (23)
T ss_dssp             EEETTTTEEESSHHHHHHHHHH
T ss_pred             CCCCCCCCccCCHHHHHHHHhH
Confidence            4555555555555555555554


No 34 
>PF12756 zf-C2H2_2:  C2H2 type zinc-finger (2 copies); PDB: 2DMI_A.
Probab=97.17  E-value=0.00023  Score=55.23  Aligned_cols=72  Identities=18%  Similarity=0.344  Sum_probs=14.0

Q ss_pred             eccccccccCCHHHHHHHHHHcCCCCceeccccccccCCHHHHHHHHHhcCCCCCccCCCCcccccCHHHHHHHHHH
Q psy4572          48 KCEHCEKEFENRRALKRHEKNHTQEKTFPCTVCLKMFKTKAQREEHIASHVTDSEFKCDQCPYVAKRIKYLRYHMKK  124 (345)
Q Consensus        48 ~C~~C~k~F~~~~~L~~H~~~h~~~k~~~C~~C~k~F~~~~~L~~H~~~h~~~k~~~C~~C~~~f~~~~~L~~H~~~  124 (345)
                      +|..|+..|.+...+..|+...|+-.   .+ ....+.....+..+.+.. -...+.|..|++.|.+...|..|++.
T Consensus         1 ~C~~C~~~f~~~~~l~~H~~~~H~~~---~~-~~~~l~~~~~~~~~~~~~-~~~~~~C~~C~~~f~s~~~l~~Hm~~   72 (100)
T PF12756_consen    1 QCLFCDESFSSVDDLLQHMKKKHGFD---IP-DQKYLVDPNRLLNYLRKK-VKESFRCPYCNKTFRSREALQEHMRS   72 (100)
T ss_dssp             -----------------------------------------------------SSEEBSSSS-EESSHHHHHHHHHH
T ss_pred             Cccccccccccccccccccccccccc---cc-cccccccccccccccccc-cCCCCCCCccCCCCcCHHHHHHHHcC
Confidence            46677777777777777766554421   11 111111222233332221 11246666666666666666666654


No 35 
>PF13894 zf-C2H2_4:  C2H2-type zinc finger; PDB: 2ELX_A 2EPP_A 2DLK_A 1X6H_A 2EOU_A 2EMB_A 2GQJ_A 2CSH_A 2WBT_B 2ELM_A ....
Probab=97.08  E-value=0.00047  Score=38.82  Aligned_cols=23  Identities=30%  Similarity=0.819  Sum_probs=11.6

Q ss_pred             eeccccccccCCHHHHHHHHHHc
Q psy4572          47 HKCEHCEKEFENRRALKRHEKNH   69 (345)
Q Consensus        47 ~~C~~C~k~F~~~~~L~~H~~~h   69 (345)
                      |+|+.|++.|.+...|+.|+++|
T Consensus         1 ~~C~~C~~~~~~~~~l~~H~~~~   23 (24)
T PF13894_consen    1 FQCPICGKSFRSKSELRQHMRTH   23 (24)
T ss_dssp             EE-SSTS-EESSHHHHHHHHHHH
T ss_pred             CCCcCCCCcCCcHHHHHHHHHhh
Confidence            45555555555555555555544


No 36 
>COG5189 SFP1 Putative transcriptional repressor regulating G2/M transition [Transcription / Cell division and chromosome partitioning]
Probab=96.82  E-value=0.0006  Score=63.04  Aligned_cols=68  Identities=28%  Similarity=0.576  Sum_probs=42.0

Q ss_pred             CCceeccc--cccccCCHHHHHHHHHHcC-CCCceecc--ccccccCCHHHHHHHHHhcCCCCCccCCCCcccccCHHHH
Q psy4572          44 DGPHKCEH--CEKEFENRRALKRHEKNHT-QEKTFPCT--VCLKMFKTKAQREEHIASHVTDSEFKCDQCPYVAKRIKYL  118 (345)
Q Consensus        44 ~~~~~C~~--C~k~F~~~~~L~~H~~~h~-~~k~~~C~--~C~k~F~~~~~L~~H~~~h~~~k~~~C~~C~~~f~~~~~L  118 (345)
                      ++||+|++  |+|.|++...|+-|+..-| .++...-+  +=-..|            -..+|||+|+.|++++.....|
T Consensus       347 ~KpykCpV~gC~K~YknqnGLKYH~lhGH~~~~~~~~p~p~~~~~F------------~~~~KPYrCevC~KRYKNlNGL  414 (423)
T COG5189         347 GKPYKCPVEGCNKKYKNQNGLKYHMLHGHQNQKLHENPSPEKMNIF------------SAKDKPYRCEVCDKRYKNLNGL  414 (423)
T ss_pred             CceecCCCCCchhhhccccchhhhhhccccCcccCCCCCccccccc------------cccCCceeccccchhhccCccc
Confidence            47788865  8888888888887765422 11111111  111111            2356888888888888888888


Q ss_pred             HHHHH
Q psy4572         119 RYHMK  123 (345)
Q Consensus       119 ~~H~~  123 (345)
                      +.|+.
T Consensus       415 KYHr~  419 (423)
T COG5189         415 KYHRK  419 (423)
T ss_pred             eeccc
Confidence            88764


No 37 
>COG5236 Uncharacterized conserved protein, contains RING Zn-finger [General function prediction only]
Probab=96.82  E-value=0.0016  Score=60.83  Aligned_cols=131  Identities=21%  Similarity=0.355  Sum_probs=94.6

Q ss_pred             eeccc--cccccCCHHHHHHHHHhcCCCCCccCCCCc---cccc------CHHHHHHHHHHhcCCCcc----ccCCCCCC
Q psy4572          75 FPCTV--CLKMFKTKAQREEHIASHVTDSEFKCDQCP---YVAK------RIKYLRYHMKKHRKEYSA----YCEPCQEG  139 (345)
Q Consensus        75 ~~C~~--C~k~F~~~~~L~~H~~~h~~~k~~~C~~C~---~~f~------~~~~L~~H~~~h~~~~~~----~C~~C~k~  139 (345)
                      |.||.  |..+......++.|.+..|+.  +.|..|-   +.|.      +...|..|...-..+..|    .|..|...
T Consensus       152 F~CP~skc~~~C~~~k~lk~H~K~~H~~--~~C~~C~~nKk~F~~E~~lF~~~~Lr~H~~~G~~e~GFKGHP~C~FC~~~  229 (493)
T COG5236         152 FKCPKSKCHRRCGSLKELKKHYKAQHGF--VLCSECIGNKKDFWNEIRLFRSSTLRDHKNGGLEEEGFKGHPLCIFCKIY  229 (493)
T ss_pred             hcCCchhhhhhhhhHHHHHHHHHhhcCc--EEhHhhhcCcccCccceeeeecccccccccCCccccCcCCCchhhhccce
Confidence            77887  788888888999999875543  4565553   2332      445567776543323222    59999999


Q ss_pred             cCCHHHHHHHHHHhcCCCCCCCcchh-------hccCCHHHHHHHHhhhCCCCcCceeecCc--cc----cccCCHHHHH
Q psy4572         140 FFSRDKLETHNIAKHGAQPYQCETCR-------KLFLSKYSLFTHNKKHLPIEERLHYQCEL--CG----KSFNNKQTFS  206 (345)
Q Consensus       140 F~~~~~l~~H~~~hh~~k~y~C~~Cg-------k~F~s~s~L~~H~~~H~~~~~~k~y~C~~--Cg----k~F~~~~~L~  206 (345)
                      |.+-.+|.+|.+..| ++   |-+|.       .-|++...|..|.+.-|       |.|..  |-    ..|.....|.
T Consensus       230 FYdDDEL~~HcR~~H-E~---ChICD~v~p~~~QYFK~Y~~Le~HF~~~h-------y~ct~qtc~~~k~~vf~~~~el~  298 (493)
T COG5236         230 FYDDDELRRHCRLRH-EA---CHICDMVGPIRYQYFKSYEDLEAHFRNAH-------YCCTFQTCRVGKCYVFPYHTELL  298 (493)
T ss_pred             ecChHHHHHHHHhhh-hh---hhhhhccCccchhhhhCHHHHHHHhhcCc-------eEEEEEEEecCcEEEeccHHHHH
Confidence            999999999999887 43   55554       34788889999988765       77754  42    5799999999


Q ss_pred             HHHHHhcCCCCc
Q psy4572         207 LSNMYVLSLSNM  218 (345)
Q Consensus       207 ~H~~~~H~~~~~  218 (345)
                      .|+...|+.-.+
T Consensus       299 ~h~~~~h~~~~~  310 (493)
T COG5236         299 EHLTRFHKVNAR  310 (493)
T ss_pred             HHHHHHhhcccc
Confidence            999999986444


No 38 
>PF13912 zf-C2H2_6:  C2H2-type zinc finger; PDB: 1JN7_A 1FU9_A 2L1O_A 1NJQ_A 2EN8_A 2EMM_A 1FV5_A 1Y0J_B 2L6Z_B.
Probab=96.81  E-value=0.00075  Score=39.59  Aligned_cols=23  Identities=35%  Similarity=0.891  Sum_probs=12.8

Q ss_pred             eeccccccccCCHHHHHHHHHHc
Q psy4572          47 HKCEHCEKEFENRRALKRHEKNH   69 (345)
Q Consensus        47 ~~C~~C~k~F~~~~~L~~H~~~h   69 (345)
                      |+|..|++.|.+...|..|++.|
T Consensus         2 ~~C~~C~~~F~~~~~l~~H~~~h   24 (27)
T PF13912_consen    2 FECDECGKTFSSLSALREHKRSH   24 (27)
T ss_dssp             EEETTTTEEESSHHHHHHHHCTT
T ss_pred             CCCCccCCccCChhHHHHHhHHh
Confidence            55555555555555555555544


No 39 
>KOG2231|consensus
Probab=96.79  E-value=0.0027  Score=65.05  Aligned_cols=133  Identities=20%  Similarity=0.421  Sum_probs=92.3

Q ss_pred             eeccccccccCCHHHHHHHHHhcCCCCCccCCCCcccccCHHHHHHHHHHhcCCCccccCCCC---------CCcCCHHH
Q psy4572          75 FPCTVCLKMFKTKAQREEHIASHVTDSEFKCDQCPYVAKRIKYLRYHMKKHRKEYSAYCEPCQ---------EGFFSRDK  145 (345)
Q Consensus        75 ~~C~~C~k~F~~~~~L~~H~~~h~~~k~~~C~~C~~~f~~~~~L~~H~~~h~~~~~~~C~~C~---------k~F~~~~~  145 (345)
                      +.|.+|++.|.-..            ..-.|..| ..|.....|+.|+...|+  .+.|..|-         ....++.+
T Consensus       100 ~~C~~C~~~~~~~~------------~~~~~~~c-~~~~s~~~Lk~H~~~~H~--~~~c~lC~~~~kif~~e~k~Yt~~e  164 (669)
T KOG2231|consen  100 HSCHICDRRFRALY------------NKKECLHC-TEFKSVENLKNHMRDQHK--LHLCSLCLQNLKIFINERKLYTRAE  164 (669)
T ss_pred             hhcCccccchhhhc------------ccCCCccc-cchhHHHHHHHHHHHhhh--hhccccccccceeeeeeeehehHHH
Confidence            56777776653221            12368888 788899999999854332  23444442         22335677


Q ss_pred             HHHHHHHhcC-C----CCCCCcchhhccCCHHHHHHHHhhhCCCCcCceeecCcc------ccccCCHHHHHHHHHHhcC
Q psy4572         146 LETHNIAKHG-A----QPYQCETCRKLFLSKYSLFTHNKKHLPIEERLHYQCELC------GKSFNNKQTFSLSNMYVLS  214 (345)
Q Consensus       146 l~~H~~~hh~-~----k~y~C~~Cgk~F~s~s~L~~H~~~H~~~~~~k~y~C~~C------gk~F~~~~~L~~H~~~~H~  214 (345)
                      +..|+..-.. +    ..-.|..|...|-....|.+|++.+|       |.|..|      +..|..-+.|..|.+..|-
T Consensus       165 l~~h~~~gd~d~~s~rGhp~C~~C~~~fld~~el~rH~~~~h-------~~chfC~~~~~~neyy~~~~dLe~HfR~~Hf  237 (669)
T KOG2231|consen  165 LNLHLMFGDPDDESCRGHPLCKFCHERFLDDDELYRHLRFDH-------EFCHFCDYKTGQNEYYNDYDDLEEHFRKGHF  237 (669)
T ss_pred             HHHHHhcCCCccccccCCccchhhhhhhccHHHHHHhhccce-------eheeecCcccccchhcccchHHHHHhhhcCc
Confidence            8888764322 1    12569999999999999999999887       777777      4678899999999999999


Q ss_pred             CCCcccccccceeee
Q psy4572         215 LSNMYVLSSSNMYVL  229 (345)
Q Consensus       215 ~~~~~~~s~~~~~~~  229 (345)
                      .+..-.....-.+..
T Consensus       238 lCE~~~C~~~~f~~~  252 (669)
T KOG2231|consen  238 LCEEEFCRTKKFYVA  252 (669)
T ss_pred             cccccccccceeeeh
Confidence            988665544444433


No 40 
>PF09237 GAGA:  GAGA factor;  InterPro: IPR015318 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target.  Members of this entry bind to a 5'-GAGAG-3' DNA consensus binding site, and contain a Cys2-His2 zinc finger core as well as an N-terminal extension containing two highly basic regions. The zinc finger core binds in the DNA major groove and recognises the first three GAG bases of the consensus in a manner similar to that seen in other classical zinc finger-DNA complexes. The second basic region forms a helix that interacts in the major groove recognising the last G of the consensus, while the first basic region wraps around the DNA in the minor groove and recognises the A in the fourth position of the consensus sequence [].  More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; PDB: 1YUI_A 1YUJ_A.
Probab=96.55  E-value=0.0021  Score=43.29  Aligned_cols=40  Identities=10%  Similarity=0.139  Sum_probs=24.7

Q ss_pred             HHHHhhhCCCCcCceeecCccccccCCHHHHHHHHHHhcCCC
Q psy4572         175 FTHNKKHLPIEERLHYQCELCGKSFNNKQTFSLSNMYVLSLS  216 (345)
Q Consensus       175 ~~H~~~H~~~~~~k~y~C~~Cgk~F~~~~~L~~H~~~~H~~~  216 (345)
                      ..+.+.+.  ..+.|-.|++|+..+++..+|++|+...|..+
T Consensus        12 ~~~~k~~~--~S~~PatCP~C~a~~~~srnLrRHle~~H~~k   51 (54)
T PF09237_consen   12 TKKPKSKS--QSEQPATCPICGAVIRQSRNLRRHLEIRHFKK   51 (54)
T ss_dssp             ----CCCC--TTS--EE-TTT--EESSHHHHHHHHHHHTTTS
T ss_pred             hhHHHHhh--ccCCCCCCCcchhhccchhhHHHHHHHHhccc
Confidence            44555554  35668899999999999999999998888764


No 41 
>PF13909 zf-H2C2_5:  C2H2-type zinc-finger domain; PDB: 1X5W_A.
Probab=96.40  E-value=0.003  Score=35.94  Aligned_cols=24  Identities=21%  Similarity=0.321  Sum_probs=18.1

Q ss_pred             eecCccccccCCHHHHHHHHHHhcC
Q psy4572         190 YQCELCGKSFNNKQTFSLSNMYVLS  214 (345)
Q Consensus       190 y~C~~Cgk~F~~~~~L~~H~~~~H~  214 (345)
                      |+|+.|+.... +..|.+|++.+|+
T Consensus         1 y~C~~C~y~t~-~~~l~~H~~~~H~   24 (24)
T PF13909_consen    1 YKCPHCSYSTS-KSNLKRHLKRHHP   24 (24)
T ss_dssp             EE-SSSS-EES-HHHHHHHHHHHHS
T ss_pred             CCCCCCCCcCC-HHHHHHHHHhhCc
Confidence            67888888887 8888888888874


No 42 
>smart00355 ZnF_C2H2 zinc finger.
Probab=96.17  E-value=0.0038  Score=35.47  Aligned_cols=23  Identities=30%  Similarity=0.606  Sum_probs=17.5

Q ss_pred             eecCccccccCCHHHHHHHHHHh
Q psy4572         190 YQCELCGKSFNNKQTFSLSNMYV  212 (345)
Q Consensus       190 y~C~~Cgk~F~~~~~L~~H~~~~  212 (345)
                      |.|+.|++.|.....|..|++.|
T Consensus         1 ~~C~~C~~~f~~~~~l~~H~~~H   23 (26)
T smart00355        1 YRCPECGKVFKSKSALKEHMRTH   23 (26)
T ss_pred             CCCCCCcchhCCHHHHHHHHHHh
Confidence            46888888888888888887743


No 43 
>PF09237 GAGA:  GAGA factor;  InterPro: IPR015318 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target.  Members of this entry bind to a 5'-GAGAG-3' DNA consensus binding site, and contain a Cys2-His2 zinc finger core as well as an N-terminal extension containing two highly basic regions. The zinc finger core binds in the DNA major groove and recognises the first three GAG bases of the consensus in a manner similar to that seen in other classical zinc finger-DNA complexes. The second basic region forms a helix that interacts in the major groove recognising the last G of the consensus, while the first basic region wraps around the DNA in the minor groove and recognises the A in the fourth position of the consensus sequence [].  More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; PDB: 1YUI_A 1YUJ_A.
Probab=96.00  E-value=0.011  Score=39.90  Aligned_cols=31  Identities=26%  Similarity=0.509  Sum_probs=16.9

Q ss_pred             CCceeccccccccCCHHHHHHHHHHcCCCCc
Q psy4572          44 DGPHKCEHCEKEFENRRALKRHEKNHTQEKT   74 (345)
Q Consensus        44 ~~~~~C~~C~k~F~~~~~L~~H~~~h~~~k~   74 (345)
                      +.|..||+|+..+....+|++|+..+|+.+|
T Consensus        22 ~~PatCP~C~a~~~~srnLrRHle~~H~~k~   52 (54)
T PF09237_consen   22 EQPATCPICGAVIRQSRNLRRHLEIRHFKKP   52 (54)
T ss_dssp             S--EE-TTT--EESSHHHHHHHHHHHTTTS-
T ss_pred             CCCCCCCcchhhccchhhHHHHHHHHhcccC
Confidence            4556777777777777777777766665543


No 44 
>COG5048 FOG: Zn-finger [General function prediction only]
Probab=95.97  E-value=0.0021  Score=62.50  Aligned_cols=138  Identities=25%  Similarity=0.392  Sum_probs=92.3

Q ss_pred             CceeccccccccCCHHHHHHHHH--HcCCC--Cceecc--ccccccCCHHHHHHHHHhcCCCCCccCCCC--cccccCHH
Q psy4572          45 GPHKCEHCEKEFENRRALKRHEK--NHTQE--KTFPCT--VCLKMFKTKAQREEHIASHVTDSEFKCDQC--PYVAKRIK  116 (345)
Q Consensus        45 ~~~~C~~C~k~F~~~~~L~~H~~--~h~~~--k~~~C~--~C~k~F~~~~~L~~H~~~h~~~k~~~C~~C--~~~f~~~~  116 (345)
                      .++.|..|...|.....+..|.+  .|.++  +++.|+  .|++.|.+...+..|...|.+..++.|...  ...+....
T Consensus       288 ~~~~~~~~~~~~s~~~~l~~~~~~~~h~~~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~  367 (467)
T COG5048         288 LPIKSKQCNISFSRSSPLTRHLRSVNHSGESLKPFSCPYSLCGKLFSRNDALKRHILLHTSISPAKEKLLNSSSKFSPLL  367 (467)
T ss_pred             cCCCCccccCCccccccccccccccccccccCCceeeeccCCCccccccccccCCcccccCCCccccccccCcccccccc
Confidence            36788889999999999999988  78888  889998  699999998888888888888777776543  33332221


Q ss_pred             HH-----HHHHHHhcCCCcccc--CCCCCCcCCHHHHHHHHHHhcCCC--CCCCcchhhccCCHHHHHHHHhhhC
Q psy4572         117 YL-----RYHMKKHRKEYSAYC--EPCQEGFFSRDKLETHNIAKHGAQ--PYQCETCRKLFLSKYSLFTHNKKHL  182 (345)
Q Consensus       117 ~L-----~~H~~~h~~~~~~~C--~~C~k~F~~~~~l~~H~~~hh~~k--~y~C~~Cgk~F~s~s~L~~H~~~H~  182 (345)
                      .-     ..............+  ..|...+.....+..|...+-...  .+.+..|.+.|.....+..|.+.|.
T Consensus       368 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~  442 (467)
T COG5048         368 NNEPPQSLQQYKDLKNDKKSETLSNSCIRNFKRDSNLSLHIITHLSFRPYNCKNPPCSKSFNRHYNLIPHKKIHT  442 (467)
T ss_pred             CCCCccchhhccCccCCccccccccchhhhhccccccccccccccccCCcCCCCCcchhhccCcccccccccccc
Confidence            11     111111112222333  335556666666666666555444  4566778888888888888888775


No 45 
>smart00355 ZnF_C2H2 zinc finger.
Probab=95.95  E-value=0.0058  Score=34.65  Aligned_cols=24  Identities=42%  Similarity=0.805  Sum_probs=19.9

Q ss_pred             CCCcchhhccCCHHHHHHHHhhhC
Q psy4572         159 YQCETCRKLFLSKYSLFTHNKKHL  182 (345)
Q Consensus       159 y~C~~Cgk~F~s~s~L~~H~~~H~  182 (345)
                      |+|+.|++.|.....+..|++.|.
T Consensus         1 ~~C~~C~~~f~~~~~l~~H~~~H~   24 (26)
T smart00355        1 YRCPECGKVFKSKSALKEHMRTHX   24 (26)
T ss_pred             CCCCCCcchhCCHHHHHHHHHHhc
Confidence            578889999998888988888663


No 46 
>COG5236 Uncharacterized conserved protein, contains RING Zn-finger [General function prediction only]
Probab=95.82  E-value=0.011  Score=55.38  Aligned_cols=129  Identities=25%  Similarity=0.451  Sum_probs=91.7

Q ss_pred             ceeccc--cccccCCHHHHHHHHHHcCCCCceeccccc---cccC------CHHHHHHHHHhcCCCCCc----cCCCCcc
Q psy4572          46 PHKCEH--CEKEFENRRALKRHEKNHTQEKTFPCTVCL---KMFK------TKAQREEHIASHVTDSEF----KCDQCPY  110 (345)
Q Consensus        46 ~~~C~~--C~k~F~~~~~L~~H~~~h~~~k~~~C~~C~---k~F~------~~~~L~~H~~~h~~~k~~----~C~~C~~  110 (345)
                      .|.||.  |+.+......|..|.+..|+.  +.|.+|-   +.|.      +...|..|...-..+..|    .|..|..
T Consensus       151 ~F~CP~skc~~~C~~~k~lk~H~K~~H~~--~~C~~C~~nKk~F~~E~~lF~~~~Lr~H~~~G~~e~GFKGHP~C~FC~~  228 (493)
T COG5236         151 SFKCPKSKCHRRCGSLKELKKHYKAQHGF--VLCSECIGNKKDFWNEIRLFRSSTLRDHKNGGLEEEGFKGHPLCIFCKI  228 (493)
T ss_pred             HhcCCchhhhhhhhhHHHHHHHHHhhcCc--EEhHhhhcCcccCccceeeeecccccccccCCccccCcCCCchhhhccc
Confidence            488975  888777788999999987764  7788873   3443      344566666432222223    5999999


Q ss_pred             cccCHHHHHHHHHHhcCCCccccCCCCC-------CcCCHHHHHHHHHHhcCCCCCCCcc--h--h--hccCCHHHHHHH
Q psy4572         111 VAKRIKYLRYHMKKHRKEYSAYCEPCQE-------GFFSRDKLETHNIAKHGAQPYQCET--C--R--KLFLSKYSLFTH  177 (345)
Q Consensus       111 ~f~~~~~L~~H~~~h~~~~~~~C~~C~k-------~F~~~~~l~~H~~~hh~~k~y~C~~--C--g--k~F~s~s~L~~H  177 (345)
                      .|-+.+.|..|.+..|.    .|.+|++       -|.+...|..|.+..|    |.|..  |  |  ..|.....|+.|
T Consensus       229 ~FYdDDEL~~HcR~~HE----~ChICD~v~p~~~QYFK~Y~~Le~HF~~~h----y~ct~qtc~~~k~~vf~~~~el~~h  300 (493)
T COG5236         229 YFYDDDELRRHCRLRHE----ACHICDMVGPIRYQYFKSYEDLEAHFRNAH----YCCTFQTCRVGKCYVFPYHTELLEH  300 (493)
T ss_pred             eecChHHHHHHHHhhhh----hhhhhhccCccchhhhhCHHHHHHHhhcCc----eEEEEEEEecCcEEEeccHHHHHHH
Confidence            99999999999987553    4666654       3778888999987655    55543  4  2  468889999999


Q ss_pred             HhhhCCC
Q psy4572         178 NKKHLPI  184 (345)
Q Consensus       178 ~~~H~~~  184 (345)
                      +...|+.
T Consensus       301 ~~~~h~~  307 (493)
T COG5236         301 LTRFHKV  307 (493)
T ss_pred             HHHHhhc
Confidence            8876643


No 47 
>PF12874 zf-met:  Zinc-finger of C2H2 type; PDB: 1ZU1_A 2KVG_A.
Probab=95.75  E-value=0.0066  Score=34.76  Aligned_cols=23  Identities=22%  Similarity=0.603  Sum_probs=17.8

Q ss_pred             eecCccccccCCHHHHHHHHHHh
Q psy4572         190 YQCELCGKSFNNKQTFSLSNMYV  212 (345)
Q Consensus       190 y~C~~Cgk~F~~~~~L~~H~~~~  212 (345)
                      |.|++|++.|.+...+..|++.+
T Consensus         1 ~~C~~C~~~f~s~~~~~~H~~s~   23 (25)
T PF12874_consen    1 FYCDICNKSFSSENSLRQHLRSK   23 (25)
T ss_dssp             EEETTTTEEESSHHHHHHHHTTH
T ss_pred             CCCCCCCCCcCCHHHHHHHHCcC
Confidence            57888888888888888887643


No 48 
>KOG1146|consensus
Probab=95.39  E-value=0.0074  Score=65.51  Aligned_cols=75  Identities=23%  Similarity=0.398  Sum_probs=37.9

Q ss_pred             ccccccccCCHHHHHHHHHH-cCCCCceeccccccccCCHHHHHHHHHhcC-------------------------CCCC
Q psy4572          49 CEHCEKEFENRRALKRHEKN-HTQEKTFPCTVCLKMFKTKAQREEHIASHV-------------------------TDSE  102 (345)
Q Consensus        49 C~~C~k~F~~~~~L~~H~~~-h~~~k~~~C~~C~k~F~~~~~L~~H~~~h~-------------------------~~k~  102 (345)
                      |..|+..+.....+..|+.. |...+.++|+.|+..|+....|..|++..|                         +.++
T Consensus       439 ~~~~e~~~~s~r~~~~~t~~L~S~~kt~~cpkc~~~yk~a~~L~vhmRskhp~~~~~~c~~gq~~~~~arg~~~~~~~~p  518 (1406)
T KOG1146|consen  439 LTKAEPLLESKRSLEGQTVVLHSFFKTLKCPKCNWHYKLAQTLGVHMRSKHPESQSAYCKAGQNHPRLARGEVYRCPGKP  518 (1406)
T ss_pred             ccchhhhhhhhcccccceeeeecccccccCCccchhhhhHHHhhhcccccccccchhHhHhccccccccccccccCCCCc
Confidence            33344444444444444332 233345555555555555555555555411                         1244


Q ss_pred             ccCCCCcccccCHHHHHHHHH
Q psy4572         103 FKCDQCPYVAKRIKYLRYHMK  123 (345)
Q Consensus       103 ~~C~~C~~~f~~~~~L~~H~~  123 (345)
                      |.|..|...+.....|..|++
T Consensus       519 ~~C~~C~~stttng~Lsihlq  539 (1406)
T KOG1146|consen  519 YPCRACNYSTTTNGNLSIHLQ  539 (1406)
T ss_pred             ccceeeeeeeecchHHHHHHH
Confidence            666666666666666666654


No 49 
>PRK04860 hypothetical protein; Provisional
Probab=95.25  E-value=0.013  Score=50.15  Aligned_cols=37  Identities=22%  Similarity=0.651  Sum_probs=24.5

Q ss_pred             ceeccccccccCCHHHHHHHHHHcCCCCceeccccccccCC
Q psy4572          46 PHKCEHCEKEFENRRALKRHEKNHTQEKTFPCTVCLKMFKT   86 (345)
Q Consensus        46 ~~~C~~C~k~F~~~~~L~~H~~~h~~~k~~~C~~C~k~F~~   86 (345)
                      +|.|. |++   ....+++|.++|.++++|.|..|+..|..
T Consensus       119 ~Y~C~-C~~---~~~~~rrH~ri~~g~~~YrC~~C~~~l~~  155 (160)
T PRK04860        119 PYRCK-CQE---HQLTVRRHNRVVRGEAVYRCRRCGETLVF  155 (160)
T ss_pred             EEEcC-CCC---eeCHHHHHHHHhcCCccEECCCCCceeEE
Confidence            56776 766   45556677777777777777777766643


No 50 
>KOG2785|consensus
Probab=94.89  E-value=0.063  Score=51.30  Aligned_cols=56  Identities=18%  Similarity=0.237  Sum_probs=40.9

Q ss_pred             CCCCCcchhhccCCHHHHHHHHhhhCCCC--------------------cCceeecCccc---cccCCHHHHHHHHHHh
Q psy4572         157 QPYQCETCRKLFLSKYSLFTHNKKHLPIE--------------------ERLHYQCELCG---KSFNNKQTFSLSNMYV  212 (345)
Q Consensus       157 k~y~C~~Cgk~F~s~s~L~~H~~~H~~~~--------------------~~k~y~C~~Cg---k~F~~~~~L~~H~~~~  212 (345)
                      -|-.|-.|++.+.+...-..||..+|+.-                    ...-|.|-.|+   +.|.+....++||...
T Consensus       165 ~Pt~CLfC~~~~k~~e~~~~HM~~~HgffIPdreYL~D~~GLl~YLgeKV~~~~~CL~CN~~~~~f~sleavr~HM~~K  243 (390)
T KOG2785|consen  165 IPTDCLFCDKKSKSLEENLKHMFKEHGFFIPDREYLTDEKGLLKYLGEKVGIGFICLFCNELGRPFSSLEAVRAHMRDK  243 (390)
T ss_pred             CCcceeecCCCcccHHHHHHHHhhccCCcCCchHhhhchhHHHHHHHHHhccCceEEEeccccCcccccHHHHHHHhhc
Confidence            34667778888888777778877776421                    11237888888   9999999999998654


No 51 
>KOG1146|consensus
Probab=94.89  E-value=0.0053  Score=66.58  Aligned_cols=138  Identities=20%  Similarity=0.349  Sum_probs=84.1

Q ss_pred             ccccccccCCHHHHHHHHHh-cCCCCCccCCCCcccccCHHHHHHHHHHhcCCCccccCCCCCCcCCHHHHHHHHHHhcC
Q psy4572          77 CTVCLKMFKTKAQREEHIAS-HVTDSEFKCDQCPYVAKRIKYLRYHMKKHRKEYSAYCEPCQEGFFSRDKLETHNIAKHG  155 (345)
Q Consensus        77 C~~C~k~F~~~~~L~~H~~~-h~~~k~~~C~~C~~~f~~~~~L~~H~~~h~~~~~~~C~~C~k~F~~~~~l~~H~~~hh~  155 (345)
                      |..|+..+.++..+..|+.. +...+.++|+.|+..+.....|..|+|.-+.+..-  ..|... .....+.+-..-..+
T Consensus       439 ~~~~e~~~~s~r~~~~~t~~L~S~~kt~~cpkc~~~yk~a~~L~vhmRskhp~~~~--~~c~~g-q~~~~~arg~~~~~~  515 (1406)
T KOG1146|consen  439 LTKAEPLLESKRSLEGQTVVLHSFFKTLKCPKCNWHYKLAQTLGVHMRSKHPESQS--AYCKAG-QNHPRLARGEVYRCP  515 (1406)
T ss_pred             ccchhhhhhhhcccccceeeeecccccccCCccchhhhhHHHhhhcccccccccch--hHhHhc-cccccccccccccCC
Confidence            33344444444444444432 44557789999999999999999998863322111  111111 110011110111123


Q ss_pred             CCCCCCcchhhccCCHHHHHHHHhhhC--C--------------------------------------CCcCceeecCcc
Q psy4572         156 AQPYQCETCRKLFLSKYSLFTHNKKHL--P--------------------------------------IEERLHYQCELC  195 (345)
Q Consensus       156 ~k~y~C~~Cgk~F~s~s~L~~H~~~H~--~--------------------------------------~~~~k~y~C~~C  195 (345)
                      .++|.|..|...++.+.+|..|++.--  .                                      ...+-++.|.+|
T Consensus       516 ~~p~~C~~C~~stttng~LsihlqS~~h~~~lee~~~~~g~~v~~~~~~v~s~~P~~ag~~~~ags~~pktkP~~~C~vc  595 (1406)
T KOG1146|consen  516 GKPYPCRACNYSTTTNGNLSIHLQSDLHRNELEEAEENAGEQVRLLPASVTSAVPEEAGLGPSAGSSGPKTKPSWRCEVC  595 (1406)
T ss_pred             CCcccceeeeeeeecchHHHHHHHHHhhHHHHHHHHhccccchhhhhhhhcccCcccccCCCCCCCCCCCCCCCcchhhh
Confidence            567888888888888888888875320  0                                      011235899999


Q ss_pred             ccccCCHHHHHHHHHHhcCCCC
Q psy4572         196 GKSFNNKQTFSLSNMYVLSLSN  217 (345)
Q Consensus       196 gk~F~~~~~L~~H~~~~H~~~~  217 (345)
                      ++.-+-..+|..|+...++...
T Consensus       596 ~yetniarnlrihmtss~~s~~  617 (1406)
T KOG1146|consen  596 SYETNIARNLRIHMTASPSSSP  617 (1406)
T ss_pred             cchhhhhhccccccccCCCCCC
Confidence            9999999999999998887766


No 52 
>PF13909 zf-H2C2_5:  C2H2-type zinc-finger domain; PDB: 1X5W_A.
Probab=94.85  E-value=0.021  Score=32.38  Aligned_cols=22  Identities=45%  Similarity=0.797  Sum_probs=10.0

Q ss_pred             eeccccccccCCHHHHHHHHHHc
Q psy4572          47 HKCEHCEKEFENRRALKRHEKNH   69 (345)
Q Consensus        47 ~~C~~C~k~F~~~~~L~~H~~~h   69 (345)
                      |+|+.|+.... ...|.+|++.+
T Consensus         1 y~C~~C~y~t~-~~~l~~H~~~~   22 (24)
T PF13909_consen    1 YKCPHCSYSTS-KSNLKRHLKRH   22 (24)
T ss_dssp             EE-SSSS-EES-HHHHHHHHHHH
T ss_pred             CCCCCCCCcCC-HHHHHHHHHhh
Confidence            34555555444 44555555543


No 53 
>PF12874 zf-met:  Zinc-finger of C2H2 type; PDB: 1ZU1_A 2KVG_A.
Probab=94.72  E-value=0.019  Score=32.74  Aligned_cols=20  Identities=30%  Similarity=0.828  Sum_probs=10.1

Q ss_pred             eccccccccCCHHHHHHHHH
Q psy4572          48 KCEHCEKEFENRRALKRHEK   67 (345)
Q Consensus        48 ~C~~C~k~F~~~~~L~~H~~   67 (345)
                      .|..|++.|.+...++.|++
T Consensus         2 ~C~~C~~~f~s~~~~~~H~~   21 (25)
T PF12874_consen    2 YCDICNKSFSSENSLRQHLR   21 (25)
T ss_dssp             EETTTTEEESSHHHHHHHHT
T ss_pred             CCCCCCCCcCCHHHHHHHHC
Confidence            45555555555555555543


No 54 
>PRK04860 hypothetical protein; Provisional
Probab=94.70  E-value=0.015  Score=49.62  Aligned_cols=37  Identities=27%  Similarity=0.761  Sum_probs=22.6

Q ss_pred             CCCCcchhhccCCHHHHHHHHhhhCCCCcCceeecCccccccCC
Q psy4572         158 PYQCETCRKLFLSKYSLFTHNKKHLPIEERLHYQCELCGKSFNN  201 (345)
Q Consensus       158 ~y~C~~Cgk~F~s~s~L~~H~~~H~~~~~~k~y~C~~Cgk~F~~  201 (345)
                      +|.|. |++   ....+.+|.++|.   ++++|.|..|+..|..
T Consensus       119 ~Y~C~-C~~---~~~~~rrH~ri~~---g~~~YrC~~C~~~l~~  155 (160)
T PRK04860        119 PYRCK-CQE---HQLTVRRHNRVVR---GEAVYRCRRCGETLVF  155 (160)
T ss_pred             EEEcC-CCC---eeCHHHHHHHHhc---CCccEECCCCCceeEE
Confidence            46665 665   4555666666665   5556666666666543


No 55 
>COG5048 FOG: Zn-finger [General function prediction only]
Probab=94.65  E-value=0.003  Score=61.30  Aligned_cols=144  Identities=17%  Similarity=0.200  Sum_probs=104.4

Q ss_pred             CceeccccccccCCHHHHHHHHH--hcCCC--CCccCC--CCcccccCHHHHHHHHHHhcCCCccccCC--CCCCcCCHH
Q psy4572          73 KTFPCTVCLKMFKTKAQREEHIA--SHVTD--SEFKCD--QCPYVAKRIKYLRYHMKKHRKEYSAYCEP--CQEGFFSRD  144 (345)
Q Consensus        73 k~~~C~~C~k~F~~~~~L~~H~~--~h~~~--k~~~C~--~C~~~f~~~~~L~~H~~~h~~~~~~~C~~--C~k~F~~~~  144 (345)
                      .++.|..|...|.....+..|.+  .|.++  +++.|+  .|++.|.+...+..|...|.+..+..|..  |...+....
T Consensus       288 ~~~~~~~~~~~~s~~~~l~~~~~~~~h~~~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~  367 (467)
T COG5048         288 LPIKSKQCNISFSRSSPLTRHLRSVNHSGESLKPFSCPYSLCGKLFSRNDALKRHILLHTSISPAKEKLLNSSSKFSPLL  367 (467)
T ss_pred             cCCCCccccCCccccccccccccccccccccCCceeeeccCCCccccccccccCCcccccCCCccccccccCcccccccc
Confidence            36889999999999999999999  79898  899999  79999999999999999998877776643  333333322


Q ss_pred             H-----HHHHHHHhcCCCCCCCc--chhhccCCHHHHHHHHhhhCCCCcCceeecCccccccCCHHHHHHHHHHhcCCCC
Q psy4572         145 K-----LETHNIAKHGAQPYQCE--TCRKLFLSKYSLFTHNKKHLPIEERLHYQCELCGKSFNNKQTFSLSNMYVLSLSN  217 (345)
Q Consensus       145 ~-----l~~H~~~hh~~k~y~C~--~Cgk~F~s~s~L~~H~~~H~~~~~~k~y~C~~Cgk~F~~~~~L~~H~~~~H~~~~  217 (345)
                      .     ...............|.  .|...+.....+..|...|... +...+.|..|.+.|.....+..|.+.+.....
T Consensus       368 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~  446 (467)
T COG5048         368 NNEPPQSLQQYKDLKNDKKSETLSNSCIRNFKRDSNLSLHIITHLSF-RPYNCKNPPCSKSFNRHYNLIPHKKIHTNHAP  446 (467)
T ss_pred             CCCCccchhhccCccCCccccccccchhhhhcccccccccccccccc-CCcCCCCCcchhhccCcccccccccccccCCc
Confidence            2     11111112223444443  3777888888888887777532 22357889999999999999999887765433


No 56 
>KOG2785|consensus
Probab=94.64  E-value=0.083  Score=50.49  Aligned_cols=27  Identities=11%  Similarity=0.129  Sum_probs=22.5

Q ss_pred             eeecCccccccCCHHHHHHHHHHhcCC
Q psy4572         189 HYQCELCGKSFNNKQTFSLSNMYVLSL  215 (345)
Q Consensus       189 ~y~C~~Cgk~F~~~~~L~~H~~~~H~~  215 (345)
                      |-.|-.|++.|.+-..-..||..+|++
T Consensus       166 Pt~CLfC~~~~k~~e~~~~HM~~~Hgf  192 (390)
T KOG2785|consen  166 PTDCLFCDKKSKSLEENLKHMFKEHGF  192 (390)
T ss_pred             CcceeecCCCcccHHHHHHHHhhccCC
Confidence            567888888888888888888888884


No 57 
>KOG2482|consensus
Probab=93.98  E-value=0.17  Score=47.74  Aligned_cols=54  Identities=19%  Similarity=0.332  Sum_probs=36.4

Q ss_pred             CCCcchhhccCCHHHHHHHHhhhCCC------------------------CcCceeecCccccccCCHHHHHHHHHHh
Q psy4572         159 YQCETCRKLFLSKYSLFTHNKKHLPI------------------------EERLHYQCELCGKSFNNKQTFSLSNMYV  212 (345)
Q Consensus       159 y~C~~Cgk~F~s~s~L~~H~~~H~~~------------------------~~~k~y~C~~Cgk~F~~~~~L~~H~~~~  212 (345)
                      ..|-.|....-+...|..|++..|.-                        +..+.-.|-.|.-.|-....|..|+..+
T Consensus       280 v~CLfC~~~~en~~~l~eHmk~vHe~Dl~Ki~sd~~Ln~YqrvrviNyiRkq~~~~~c~~cd~~F~~e~~l~~hm~e~  357 (423)
T KOG2482|consen  280 VVCLFCTNFYENPVFLFEHMKIVHEFDLLKIQSDYSLNFYQRVRVINYIRKQKKKSRCAECDLSFWKEPGLLIHMVED  357 (423)
T ss_pred             eEEEeeccchhhHHHHHHHHHHHHHhhHHhhccccccchhhhhhHHHHHHHHhhccccccccccccCcchhhhhcccc
Confidence            35666666666666666666654411                        0122356888999999999999998644


No 58 
>PF12171 zf-C2H2_jaz:  Zinc-finger double-stranded RNA-binding;  InterPro: IPR022755  This zinc finger is found in archaea and eukaryotes, and is approximately 30 amino acids in length. The mammalian members of this group occur multiple times along the protein, joined by flexible linkers, and are referred to as JAZ - dsRNA-binding ZF protein - zinc-fingers. The JAZ proteins are expressed in all tissues tested and localise in the nucleus, particularly the nucleolus []. JAZ preferentially binds to double-stranded (ds) RNA or RNA/DNA hybrids rather than DNA. In addition to binding double-stranded RNA, these zinc-fingers are required for nucleolar localisation.   This entry represents the multiple-adjacent-C2H2 zinc finger, JAZ. ; PDB: 4DGW_A 1ZR9_A.
Probab=93.77  E-value=0.014  Score=34.18  Aligned_cols=22  Identities=18%  Similarity=0.528  Sum_probs=15.0

Q ss_pred             eecCccccccCCHHHHHHHHHH
Q psy4572         190 YQCELCGKSFNNKQTFSLSNMY  211 (345)
Q Consensus       190 y~C~~Cgk~F~~~~~L~~H~~~  211 (345)
                      |.|..|++.|.+...+..|++.
T Consensus         2 ~~C~~C~k~f~~~~~~~~H~~s   23 (27)
T PF12171_consen    2 FYCDACDKYFSSENQLKQHMKS   23 (27)
T ss_dssp             CBBTTTTBBBSSHHHHHCCTTS
T ss_pred             CCcccCCCCcCCHHHHHHHHcc
Confidence            5677777777777777776543


No 59 
>PF13913 zf-C2HC_2:  zinc-finger of a C2HC-type
Probab=92.68  E-value=0.094  Score=30.21  Aligned_cols=21  Identities=24%  Similarity=0.683  Sum_probs=13.8

Q ss_pred             eecCccccccCCHHHHHHHHHH
Q psy4572         190 YQCELCGKSFNNKQTFSLSNMY  211 (345)
Q Consensus       190 y~C~~Cgk~F~~~~~L~~H~~~  211 (345)
                      ..|+.||+.| ..+.|.+|.+.
T Consensus         3 ~~C~~CgR~F-~~~~l~~H~~~   23 (25)
T PF13913_consen    3 VPCPICGRKF-NPDRLEKHEKI   23 (25)
T ss_pred             CcCCCCCCEE-CHHHHHHHHHh
Confidence            3577777777 56667777653


No 60 
>PF12171 zf-C2H2_jaz:  Zinc-finger double-stranded RNA-binding;  InterPro: IPR022755  This zinc finger is found in archaea and eukaryotes, and is approximately 30 amino acids in length. The mammalian members of this group occur multiple times along the protein, joined by flexible linkers, and are referred to as JAZ - dsRNA-binding ZF protein - zinc-fingers. The JAZ proteins are expressed in all tissues tested and localise in the nucleus, particularly the nucleolus []. JAZ preferentially binds to double-stranded (ds) RNA or RNA/DNA hybrids rather than DNA. In addition to binding double-stranded RNA, these zinc-fingers are required for nucleolar localisation.   This entry represents the multiple-adjacent-C2H2 zinc finger, JAZ. ; PDB: 4DGW_A 1ZR9_A.
Probab=92.59  E-value=0.035  Score=32.45  Aligned_cols=22  Identities=36%  Similarity=0.720  Sum_probs=16.2

Q ss_pred             CCCcchhhccCCHHHHHHHHhh
Q psy4572         159 YQCETCRKLFLSKYSLFTHNKK  180 (345)
Q Consensus       159 y~C~~Cgk~F~s~s~L~~H~~~  180 (345)
                      |.|..|++.|.+...+..|++.
T Consensus         2 ~~C~~C~k~f~~~~~~~~H~~s   23 (27)
T PF12171_consen    2 FYCDACDKYFSSENQLKQHMKS   23 (27)
T ss_dssp             CBBTTTTBBBSSHHHHHCCTTS
T ss_pred             CCcccCCCCcCCHHHHHHHHcc
Confidence            5677777777777777777664


No 61 
>KOG2482|consensus
Probab=92.44  E-value=0.19  Score=47.32  Aligned_cols=26  Identities=15%  Similarity=0.227  Sum_probs=23.2

Q ss_pred             eecCccccccCCHHHHHHHHHHhcCC
Q psy4572         190 YQCELCGKSFNNKQTFSLSNMYVLSL  215 (345)
Q Consensus       190 y~C~~Cgk~F~~~~~L~~H~~~~H~~  215 (345)
                      ..|-.|.....+...|..|++.+|..
T Consensus       280 v~CLfC~~~~en~~~l~eHmk~vHe~  305 (423)
T KOG2482|consen  280 VVCLFCTNFYENPVFLFEHMKIVHEF  305 (423)
T ss_pred             eEEEeeccchhhHHHHHHHHHHHHHh
Confidence            58999999999999999999998863


No 62 
>TIGR00622 ssl1 transcription factor ssl1. This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University).
Probab=91.92  E-value=0.19  Score=40.09  Aligned_cols=25  Identities=28%  Similarity=0.794  Sum_probs=13.9

Q ss_pred             CCCCCcchhhccCCHHHHHHHHhhh
Q psy4572         157 QPYQCETCRKLFLSKYSLFTHNKKH  181 (345)
Q Consensus       157 k~y~C~~Cgk~F~s~s~L~~H~~~H  181 (345)
                      ..|+|+.|+..|--.=+...|...|
T Consensus        80 ~~y~C~~C~~~FC~dCD~fiHe~Lh  104 (112)
T TIGR00622        80 HRYVCAVCKNVFCVDCDVFVHESLH  104 (112)
T ss_pred             cceeCCCCCCccccccchhhhhhcc
Confidence            3466666666665555555554444


No 63 
>KOG4173|consensus
Probab=91.32  E-value=0.11  Score=45.55  Aligned_cols=77  Identities=26%  Similarity=0.468  Sum_probs=42.8

Q ss_pred             eeccc--cccccCCHHHHHHHHHhcCCCCCccCCCCcccccCHHHHHHHHHHh----------cCCCcccc--CCCCCCc
Q psy4572          75 FPCTV--CLKMFKTKAQREEHIASHVTDSEFKCDQCPYVAKRIKYLRYHMKKH----------RKEYSAYC--EPCQEGF  140 (345)
Q Consensus        75 ~~C~~--C~k~F~~~~~L~~H~~~h~~~k~~~C~~C~~~f~~~~~L~~H~~~h----------~~~~~~~C--~~C~k~F  140 (345)
                      +.|++  |.+.|........|..+-|+.   .|..|.+.|.+..-|..|+...          .|...|.|  +.|+..|
T Consensus        80 ~~cqvagc~~~~d~lD~~E~hY~~~h~~---sCs~C~r~~Pt~hLLd~HI~E~HDs~Fqa~veRG~dMy~ClvEgCt~KF  156 (253)
T KOG4173|consen   80 FACQVAGCCQVFDALDDYEHHYHTLHGN---SCSFCKRAFPTGHLLDAHILEWHDSLFQALVERGQDMYQCLVEGCTEKF  156 (253)
T ss_pred             ccccccchHHHHhhhhhHHHhhhhcccc---hhHHHHHhCCchhhhhHHHHHHHHHHHHHHHHcCccHHHHHHHhhhhhh
Confidence            55665  666666666666665543332   5666666666666666665422          23444555  4455555


Q ss_pred             CCHHHHHHHHHHhc
Q psy4572         141 FSRDKLETHNIAKH  154 (345)
Q Consensus       141 ~~~~~l~~H~~~hh  154 (345)
                      .+....+.|+...|
T Consensus       157 kT~r~RkdH~I~~H  170 (253)
T KOG4173|consen  157 KTSRDRKDHMIRMH  170 (253)
T ss_pred             hhhhhhhhHHHHhc
Confidence            55555555544433


No 64 
>KOG4173|consensus
Probab=91.07  E-value=0.11  Score=45.51  Aligned_cols=77  Identities=25%  Similarity=0.481  Sum_probs=46.4

Q ss_pred             CCccCCC--CcccccCHHHHHHHHHHhcCCCccccCCCCCCcCCHHHHHHHHHHhc----------CCCCCCC--cchhh
Q psy4572         101 SEFKCDQ--CPYVAKRIKYLRYHMKKHRKEYSAYCEPCQEGFFSRDKLETHNIAKH----------GAQPYQC--ETCRK  166 (345)
Q Consensus       101 k~~~C~~--C~~~f~~~~~L~~H~~~h~~~~~~~C~~C~k~F~~~~~l~~H~~~hh----------~~k~y~C--~~Cgk  166 (345)
                      ..|.|+.  |...|.....+..|..+-++.   .|..|.+.|.+..-|..|+...|          |...|+|  ..|+.
T Consensus        78 ~~~~cqvagc~~~~d~lD~~E~hY~~~h~~---sCs~C~r~~Pt~hLLd~HI~E~HDs~Fqa~veRG~dMy~ClvEgCt~  154 (253)
T KOG4173|consen   78 PAFACQVAGCCQVFDALDDYEHHYHTLHGN---SCSFCKRAFPTGHLLDAHILEWHDSLFQALVERGQDMYQCLVEGCTE  154 (253)
T ss_pred             ccccccccchHHHHhhhhhHHHhhhhcccc---hhHHHHHhCCchhhhhHHHHHHHHHHHHHHHHcCccHHHHHHHhhhh
Confidence            4467764  666777777777776554432   57777777777777777766554          2333555  23555


Q ss_pred             ccCCHHHHHHHHhh
Q psy4572         167 LFLSKYSLFTHNKK  180 (345)
Q Consensus       167 ~F~s~s~L~~H~~~  180 (345)
                      .|.+...-..|+-.
T Consensus       155 KFkT~r~RkdH~I~  168 (253)
T KOG4173|consen  155 KFKTSRDRKDHMIR  168 (253)
T ss_pred             hhhhhhhhhhHHHH
Confidence            55555555555443


No 65 
>KOG2893|consensus
Probab=91.07  E-value=0.06  Score=48.17  Aligned_cols=47  Identities=26%  Similarity=0.514  Sum_probs=27.6

Q ss_pred             CcchhhccCCHHHHHHHHhhhCCCCcCceeecCccccccCCHHHHHHHHHHhcC
Q psy4572         161 CETCRKLFLSKYSLFTHNKKHLPIEERLHYQCELCGKSFNNKQTFSLSNMYVLS  214 (345)
Q Consensus       161 C~~Cgk~F~s~s~L~~H~~~H~~~~~~k~y~C~~Cgk~F~~~~~L~~H~~~~H~  214 (345)
                      |=+|++.|....-|.+|++..|       |+|.+|.|..-+--.|..|-..+|.
T Consensus        13 cwycnrefddekiliqhqkakh-------fkchichkkl~sgpglsihcmqvhk   59 (341)
T KOG2893|consen   13 CWYCNREFDDEKILIQHQKAKH-------FKCHICHKKLFSGPGLSIHCMQVHK   59 (341)
T ss_pred             eeecccccchhhhhhhhhhhcc-------ceeeeehhhhccCCCceeehhhhhh
Confidence            5556666666666666655544       6666666655555555555444443


No 66 
>smart00451 ZnF_U1 U1-like zinc finger. Family of C2H2-type zinc fingers, present in matrin, U1 small nuclear ribonucleoprotein C and other RNA-binding proteins.
Probab=90.95  E-value=0.16  Score=31.27  Aligned_cols=24  Identities=17%  Similarity=0.414  Sum_probs=20.4

Q ss_pred             eeecCccccccCCHHHHHHHHHHh
Q psy4572         189 HYQCELCGKSFNNKQTFSLSNMYV  212 (345)
Q Consensus       189 ~y~C~~Cgk~F~~~~~L~~H~~~~  212 (345)
                      +|.|++|++.|.+...+..|++..
T Consensus         3 ~~~C~~C~~~~~~~~~~~~H~~gk   26 (35)
T smart00451        3 GFYCKLCNVTFTDEISVEAHLKGK   26 (35)
T ss_pred             CeEccccCCccCCHHHHHHHHChH
Confidence            588999999999999999997644


No 67 
>TIGR00622 ssl1 transcription factor ssl1. This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University).
Probab=90.65  E-value=0.26  Score=39.36  Aligned_cols=85  Identities=12%  Similarity=0.208  Sum_probs=46.6

Q ss_pred             CceeccccccccCCHHHHHHHHHhcCC------------CCCccCCCCcccccCHHHHHHHHHHhcCCCccccCCCCCCc
Q psy4572          73 KTFPCTVCLKMFKTKAQREEHIASHVT------------DSEFKCDQCPYVAKRIKYLRYHMKKHRKEYSAYCEPCQEGF  140 (345)
Q Consensus        73 k~~~C~~C~k~F~~~~~L~~H~~~h~~------------~k~~~C~~C~~~f~~~~~L~~H~~~h~~~~~~~C~~C~k~F  140 (345)
                      -|..|+.|+-..-....|.+...--..            ...-.|-.|...|.......  ...-.....|+|+.|...|
T Consensus        14 LP~~CpiCgLtLVss~HLARSyHHLfPl~~f~ev~~~~~~~~~~C~~C~~~f~~~~~~~--~~~~~~~~~y~C~~C~~~F   91 (112)
T TIGR00622        14 LPVECPICGLTLILSTHLARSYHHLFPLKAFQEIPLEEYNGSRFCFGCQGPFPKPPVSP--FDELKDSHRYVCAVCKNVF   91 (112)
T ss_pred             CCCcCCcCCCEEeccchHHHhhhccCCCcccccccccccCCCCcccCcCCCCCCccccc--ccccccccceeCCCCCCcc
Confidence            356677777666665555543210000            11124888888776543111  0001233467888888888


Q ss_pred             CCHHHHHHHHHHhcCCCCCCCcchh
Q psy4572         141 FSRDKLETHNIAKHGAQPYQCETCR  165 (345)
Q Consensus       141 ~~~~~l~~H~~~hh~~k~y~C~~Cg  165 (345)
                      =-.-+.-.|...|      .|+.|.
T Consensus        92 C~dCD~fiHe~Lh------~CPGC~  110 (112)
T TIGR00622        92 CVDCDVFVHESLH------CCPGCI  110 (112)
T ss_pred             ccccchhhhhhcc------CCcCCC
Confidence            7776666775554      377665


No 68 
>PF12013 DUF3505:  Protein of unknown function (DUF3505);  InterPro: IPR022698  This family of proteins is functionally uncharacterised. This protein is found in eukaryotes. Proteins in this family are typically between 247 to 1018 amino acids in length. This region contains two segments that are likely to be C2H2 zinc binding domains. 
Probab=90.21  E-value=0.33  Score=38.54  Aligned_cols=85  Identities=16%  Similarity=0.225  Sum_probs=47.5

Q ss_pred             CccccCCCCCCcCCHHHHHHHHHHhcCCC-CCCCcchhhccCCHHHHHH-HHhhhCCCCc---------Cceeec----C
Q psy4572         129 YSAYCEPCQEGFFSRDKLETHNIAKHGAQ-PYQCETCRKLFLSKYSLFT-HNKKHLPIEE---------RLHYQC----E  193 (345)
Q Consensus       129 ~~~~C~~C~k~F~~~~~l~~H~~~hh~~k-~y~C~~Cgk~F~s~s~L~~-H~~~H~~~~~---------~k~y~C----~  193 (345)
                      +...|..|+..... .++..|.+..|... ......=.+.+.....+.. ......+...         ..-|.|    +
T Consensus        10 ~vlIC~~C~~av~~-~~v~~HL~~~H~~~~~~~~~~i~~~~~~~~~l~~~~~~~~~p~~~~~Pi~gLp~~~G~~C~~~~~   88 (109)
T PF12013_consen   10 RVLICRQCQYAVQP-SEVESHLRKRHHILKSQERQRIVEAIRQWPDLLPDPDDLQIPPDPSPPIPGLPVYDGYRCQCDPP   88 (109)
T ss_pred             CEEEeCCCCcccCc-hHHHHHHHHhcccccHHHHHHHHHHHHhhhhcccCccccCCCCCCCCcCCCCCCCCCeeeecCCC
Confidence            34579999988766 88999998655332 1111100011111110100 0000000000         112999    9


Q ss_pred             ccccccCCHHHHHHHHHHhcC
Q psy4572         194 LCGKSFNNKQTFSLSNMYVLS  214 (345)
Q Consensus       194 ~Cgk~F~~~~~L~~H~~~~H~  214 (345)
                      .|++.+.+...+.+|.+.+|+
T Consensus        89 ~C~y~~~~~~~m~~H~~~~Hg  109 (109)
T PF12013_consen   89 HCGYITRSKKTMRKHWRKEHG  109 (109)
T ss_pred             CCCcEeccHHHHHHHHHHhcC
Confidence            999999999999999999996


No 69 
>PF13913 zf-C2HC_2:  zinc-finger of a C2HC-type
Probab=90.02  E-value=0.25  Score=28.43  Aligned_cols=19  Identities=32%  Similarity=0.630  Sum_probs=11.4

Q ss_pred             CCcchhhccCCHHHHHHHHh
Q psy4572         160 QCETCRKLFLSKYSLFTHNK  179 (345)
Q Consensus       160 ~C~~Cgk~F~s~s~L~~H~~  179 (345)
                      .|+.||+.| ..+.|.+|..
T Consensus         4 ~C~~CgR~F-~~~~l~~H~~   22 (25)
T PF13913_consen    4 PCPICGRKF-NPDRLEKHEK   22 (25)
T ss_pred             cCCCCCCEE-CHHHHHHHHH
Confidence            466666666 4555666654


No 70 
>KOG2893|consensus
Probab=89.05  E-value=0.1  Score=46.74  Aligned_cols=44  Identities=30%  Similarity=0.613  Sum_probs=36.6

Q ss_pred             eeccccccccCCHHHHHHHHHHcCCCCceeccccccccCCHHHHHHHH
Q psy4572          47 HKCEHCEKEFENRRALKRHEKNHTQEKTFPCTVCLKMFKTKAQREEHI   94 (345)
Q Consensus        47 ~~C~~C~k~F~~~~~L~~H~~~h~~~k~~~C~~C~k~F~~~~~L~~H~   94 (345)
                      -.|-.|++.|.++.-|.+|++.    +.|+|.+|.+...+--.|..|.
T Consensus        11 pwcwycnrefddekiliqhqka----khfkchichkkl~sgpglsihc   54 (341)
T KOG2893|consen   11 PWCWYCNREFDDEKILIQHQKA----KHFKCHICHKKLFSGPGLSIHC   54 (341)
T ss_pred             ceeeecccccchhhhhhhhhhh----ccceeeeehhhhccCCCceeeh
Confidence            4799999999999999999887    3499999998877766666664


No 71 
>smart00451 ZnF_U1 U1-like zinc finger. Family of C2H2-type zinc fingers, present in matrin, U1 small nuclear ribonucleoprotein C and other RNA-binding proteins.
Probab=88.81  E-value=0.34  Score=29.78  Aligned_cols=22  Identities=23%  Similarity=0.525  Sum_probs=13.9

Q ss_pred             ceeccccccccCCHHHHHHHHH
Q psy4572          46 PHKCEHCEKEFENRRALKRHEK   67 (345)
Q Consensus        46 ~~~C~~C~k~F~~~~~L~~H~~   67 (345)
                      +|.|..|++.|.+...+..|++
T Consensus         3 ~~~C~~C~~~~~~~~~~~~H~~   24 (35)
T smart00451        3 GFYCKLCNVTFTDEISVEAHLK   24 (35)
T ss_pred             CeEccccCCccCCHHHHHHHHC
Confidence            4666666666666666666654


No 72 
>COG4049 Uncharacterized protein containing archaeal-type C2H2 Zn-finger [General function prediction only]
Probab=88.52  E-value=0.27  Score=33.83  Aligned_cols=31  Identities=13%  Similarity=0.319  Sum_probs=25.5

Q ss_pred             CcCceeecCccccccCCHHHHHHHHHHhcCC
Q psy4572         185 EERLHYQCELCGKSFNNKQTFSLSNMYVLSL  215 (345)
Q Consensus       185 ~~~k~y~C~~Cgk~F~~~~~L~~H~~~~H~~  215 (345)
                      +++.-+.||-||..|..+....+|....|+.
T Consensus        13 DGE~~lrCPRC~~~FR~~K~Y~RHVNKaH~~   43 (65)
T COG4049          13 DGEEFLRCPRCGMVFRRRKDYIRHVNKAHGW   43 (65)
T ss_pred             CCceeeeCCchhHHHHHhHHHHHHhhHHhhh
Confidence            3566788999999999999999998877764


No 73 
>cd00350 rubredoxin_like Rubredoxin_like; nonheme iron binding domain containing a [Fe(SCys)4] center. The family includes rubredoxins, a small electron transfer protein, and a slightly smaller modular rubredoxin domain present in rubrerythrin and nigerythrin and detected either N- or C-terminal to such proteins as flavin reductase, NAD(P)H-nitrite reductase, and ferredoxin-thioredoxin reductase. In rubredoxin, the iron atom is coordinated by four cysteine residues (Fe(S-Cys)4), but iron can also be replaced by cobalt, nickel or zinc and believed to be involved in electron transfer.  Rubrerythrins and nigerythrins are small homodimeric proteins, generally consisting of 2 domains: a rubredoxin domain C-terminal to a non-sulfur, oxo-bridged diiron site in the N-terminal rubrerythrin domain.  Rubrerythrins and nigerythrins have putative peroxide activity.
Probab=86.04  E-value=0.37  Score=29.68  Aligned_cols=11  Identities=18%  Similarity=0.570  Sum_probs=5.8

Q ss_pred             eeccccccccC
Q psy4572          47 HKCEHCEKEFE   57 (345)
Q Consensus        47 ~~C~~C~k~F~   57 (345)
                      |+|..||..+.
T Consensus         2 ~~C~~CGy~y~   12 (33)
T cd00350           2 YVCPVCGYIYD   12 (33)
T ss_pred             EECCCCCCEEC
Confidence            45555555544


No 74 
>PF09986 DUF2225:  Uncharacterized protein conserved in bacteria (DUF2225);  InterPro: IPR018708 This conserved bacterial family has no known function.
Probab=84.17  E-value=0.18  Score=45.33  Aligned_cols=46  Identities=17%  Similarity=0.251  Sum_probs=24.8

Q ss_pred             CCCCCcchhhccCCHHHHHHHHhhhC-C------CCcCc-----eeecCccccccCCH
Q psy4572         157 QPYQCETCRKLFLSKYSLFTHNKKHL-P------IEERL-----HYQCELCGKSFNNK  202 (345)
Q Consensus       157 k~y~C~~Cgk~F~s~s~L~~H~~~H~-~------~~~~k-----~y~C~~Cgk~F~~~  202 (345)
                      +.+.||+|++.|.+..-.....+.-. .      -++-.     ...||.||..|...
T Consensus         4 k~~~CPvC~~~F~~~~vrs~~~r~~~~d~D~~~~Y~~vnP~~Y~V~vCP~CgyA~~~~   61 (214)
T PF09986_consen    4 KKITCPVCGKEFKTKKVRSGKIRVIRRDSDFCPRYKGVNPLFYEVWVCPHCGYAAFEE   61 (214)
T ss_pred             CceECCCCCCeeeeeEEEcCCceEeeecCCCccccCCCCCeeeeEEECCCCCCccccc
Confidence            45667777777776543333333210 0      01111     25699999987654


No 75 
>PF12013 DUF3505:  Protein of unknown function (DUF3505);  InterPro: IPR022698  This family of proteins is functionally uncharacterised. This protein is found in eukaryotes. Proteins in this family are typically between 247 to 1018 amino acids in length. This region contains two segments that are likely to be C2H2 zinc binding domains. 
Probab=83.80  E-value=1.7  Score=34.43  Aligned_cols=24  Identities=29%  Similarity=0.530  Sum_probs=22.7

Q ss_pred             CCC----cchhhccCCHHHHHHHHhhhC
Q psy4572         159 YQC----ETCRKLFLSKYSLFTHNKKHL  182 (345)
Q Consensus       159 y~C----~~Cgk~F~s~s~L~~H~~~H~  182 (345)
                      |.|    +.|+..+.+...+.+|.+.+|
T Consensus        81 ~~C~~~~~~C~y~~~~~~~m~~H~~~~H  108 (109)
T PF12013_consen   81 YRCQCDPPHCGYITRSKKTMRKHWRKEH  108 (109)
T ss_pred             eeeecCCCCCCcEeccHHHHHHHHHHhc
Confidence            889    999999999999999999887


No 76 
>PF09538 FYDLN_acid:  Protein of unknown function (FYDLN_acid);  InterPro: IPR012644 Members of this family are bacterial proteins with a conserved motif [KR]FYDLN, sometimes flanked by a pair of CXXC motifs, followed by a long region of low complexity sequence in which roughly half the residues are Asp and Glu, including multiple runs of five or more acidic residues. The function of members of this family is unknown.
Probab=83.70  E-value=0.54  Score=37.45  Aligned_cols=30  Identities=23%  Similarity=0.602  Sum_probs=18.5

Q ss_pred             CCCcchhhccCCHHHHHHHHhhhCCCCcCceeecCccccccCCH
Q psy4572         159 YQCETCRKLFLSKYSLFTHNKKHLPIEERLHYQCELCGKSFNNK  202 (345)
Q Consensus       159 y~C~~Cgk~F~s~s~L~~H~~~H~~~~~~k~y~C~~Cgk~F~~~  202 (345)
                      ..|+.||++|-..              ++.|-.||.||..|.-.
T Consensus        10 R~Cp~CG~kFYDL--------------nk~PivCP~CG~~~~~~   39 (108)
T PF09538_consen   10 RTCPSCGAKFYDL--------------NKDPIVCPKCGTEFPPE   39 (108)
T ss_pred             ccCCCCcchhccC--------------CCCCccCCCCCCccCcc
Confidence            3577777776553              33466677777776544


No 77 
>KOG2186|consensus
Probab=81.33  E-value=0.87  Score=41.33  Aligned_cols=47  Identities=19%  Similarity=0.504  Sum_probs=30.0

Q ss_pred             eeccccccccCCHHHHHHHHHHcCCCCceeccccccccCCHHHHHHHHHh
Q psy4572          47 HKCEHCEKEFENRRALKRHEKNHTQEKTFPCTVCLKMFKTKAQREEHIAS   96 (345)
Q Consensus        47 ~~C~~C~k~F~~~~~L~~H~~~h~~~k~~~C~~C~k~F~~~~~L~~H~~~   96 (345)
                      |.|..||....... +.+|+-..++ .-|.|-.|++.|.. .+...|..-
T Consensus         4 FtCnvCgEsvKKp~-vekH~srCrn-~~fSCIDC~k~F~~-~sYknH~kC   50 (276)
T KOG2186|consen    4 FTCNVCGESVKKPQ-VEKHMSRCRN-AYFSCIDCGKTFER-VSYKNHTKC   50 (276)
T ss_pred             Eehhhhhhhccccc-hHHHHHhccC-CeeEEeeccccccc-chhhhhhhh
Confidence            66777777655533 4557766666 44777777777766 556666543


No 78 
>PF09538 FYDLN_acid:  Protein of unknown function (FYDLN_acid);  InterPro: IPR012644 Members of this family are bacterial proteins with a conserved motif [KR]FYDLN, sometimes flanked by a pair of CXXC motifs, followed by a long region of low complexity sequence in which roughly half the residues are Asp and Glu, including multiple runs of five or more acidic residues. The function of members of this family is unknown.
Probab=81.22  E-value=0.88  Score=36.24  Aligned_cols=30  Identities=20%  Similarity=0.576  Sum_probs=18.6

Q ss_pred             eeccccccccCCHHHHHHHHHHcCCCCceeccccccccCCH
Q psy4572          47 HKCEHCEKEFENRRALKRHEKNHTQEKTFPCTVCLKMFKTK   87 (345)
Q Consensus        47 ~~C~~C~k~F~~~~~L~~H~~~h~~~k~~~C~~C~k~F~~~   87 (345)
                      ..|+.||+.|...           +..|..||.||..|.-.
T Consensus        10 R~Cp~CG~kFYDL-----------nk~PivCP~CG~~~~~~   39 (108)
T PF09538_consen   10 RTCPSCGAKFYDL-----------NKDPIVCPKCGTEFPPE   39 (108)
T ss_pred             ccCCCCcchhccC-----------CCCCccCCCCCCccCcc
Confidence            4677777776652           23566677777766554


No 79 
>COG4049 Uncharacterized protein containing archaeal-type C2H2 Zn-finger [General function prediction only]
Probab=81.10  E-value=0.83  Score=31.51  Aligned_cols=32  Identities=19%  Similarity=0.374  Sum_probs=21.1

Q ss_pred             CCCCCCceeccccccccCCHHHHHHHHHHcCC
Q psy4572          40 PQKLDGPHKCEHCEKEFENRRALKRHEKNHTQ   71 (345)
Q Consensus        40 ~~~~~~~~~C~~C~k~F~~~~~L~~H~~~h~~   71 (345)
                      ...++.-+.||.|+..|.......+|...-|+
T Consensus        11 ~RDGE~~lrCPRC~~~FR~~K~Y~RHVNKaH~   42 (65)
T COG4049          11 DRDGEEFLRCPRCGMVFRRRKDYIRHVNKAHG   42 (65)
T ss_pred             ccCCceeeeCCchhHHHHHhHHHHHHhhHHhh
Confidence            34455567777777777777777777665443


No 80 
>PRK04023 DNA polymerase II large subunit; Validated
Probab=79.67  E-value=2.3  Score=45.99  Aligned_cols=24  Identities=33%  Similarity=0.557  Sum_probs=15.4

Q ss_pred             CCceeccccccccCCHHHHHHHHHHcCCCCceeccccccc
Q psy4572          44 DGPHKCEHCEKEFENRRALKRHEKNHTQEKTFPCTVCLKM   83 (345)
Q Consensus        44 ~~~~~C~~C~k~F~~~~~L~~H~~~h~~~k~~~C~~C~k~   83 (345)
                      .....|+.||...                ..+.|+.||..
T Consensus       624 Vg~RfCpsCG~~t----------------~~frCP~CG~~  647 (1121)
T PRK04023        624 IGRRKCPSCGKET----------------FYRRCPFCGTH  647 (1121)
T ss_pred             ccCccCCCCCCcC----------------CcccCCCCCCC
Confidence            3456788888762                12677777765


No 81 
>PF02892 zf-BED:  BED zinc finger;  InterPro: IPR003656 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target.  This entry represents predicted BED-type zinc finger domains. The BED finger which was named after the Drosophila proteins BEAF and DREF, is found in one or more copies in cellular regulatory factors and transposases from plants, animals and fungi. The BED finger is an about 50 to 60 amino acid residues domain that contains a characteristic motif with two highly conserved aromatic positions, as well as a shared pattern of cysteines and histidines that is predicted to form a zinc finger. As diverse BED fingers are able to bind DNA, it has been suggested that DNA-binding is the general function of this domain []. Some proteins known to contain a BED domain include animal, plant and fungi AC1 and Hobo-like transposases; Caenorhabditis elegans Dpy-20 protein, a predicted cuticular gene transcriptional regulator; Drosophila BEAF (boundary element-associated factor), thought to be involved in chromatin insulation; Drosophila DREF, a transcriptional regulator for S-phase genes; and tobacco 3AF1 and tomato E4/E8-BP1, light- and ethylene-regulated DNA binding proteins that contain two BED fingers. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; GO: 0003677 DNA binding; PDB: 2DJR_A 2CT5_A.
Probab=78.39  E-value=1.6  Score=28.47  Aligned_cols=27  Identities=15%  Similarity=0.301  Sum_probs=16.4

Q ss_pred             CceeecCccccccCCH----HHHHHHHHHhc
Q psy4572         187 RLHYQCELCGKSFNNK----QTFSLSNMYVL  213 (345)
Q Consensus       187 ~k~y~C~~Cgk~F~~~----~~L~~H~~~~H  213 (345)
                      +...+|..|++.+...    +.|.+|++..|
T Consensus        14 ~~~a~C~~C~~~~~~~~~~ts~l~~HL~~~h   44 (45)
T PF02892_consen   14 KKKAKCKYCGKVIKYSSGGTSNLKRHLKKKH   44 (45)
T ss_dssp             SS-EEETTTTEE-----SSTHHHHHHHHHTT
T ss_pred             cCeEEeCCCCeEEeeCCCcHHHHHHhhhhhC
Confidence            4457788888877664    78888886655


No 82 
>PF13719 zinc_ribbon_5:  zinc-ribbon domain
Probab=78.37  E-value=1.6  Score=27.61  Aligned_cols=34  Identities=18%  Similarity=0.501  Sum_probs=18.1

Q ss_pred             eeccccccccCCHHHHHHHHHHcCCCCceeccccccccC
Q psy4572          47 HKCEHCEKEFENRRALKRHEKNHTQEKTFPCTVCLKMFK   85 (345)
Q Consensus        47 ~~C~~C~k~F~~~~~L~~H~~~h~~~k~~~C~~C~k~F~   85 (345)
                      ..||.|+..|.-...-     ...+.+..+|+.|+..|.
T Consensus         3 i~CP~C~~~f~v~~~~-----l~~~~~~vrC~~C~~~f~   36 (37)
T PF13719_consen    3 ITCPNCQTRFRVPDDK-----LPAGGRKVRCPKCGHVFR   36 (37)
T ss_pred             EECCCCCceEEcCHHH-----cccCCcEEECCCCCcEee
Confidence            3577777766654331     112233466777766653


No 83 
>TIGR02098 MJ0042_CXXC MJ0042 family finger-like domain. This domain contains a CXXCX(19)CXXC motif suggestive of both zinc fingers and thioredoxin, usually found at the N-terminus of prokaryotic proteins. One partially characterized gene, agmX, is among a large set in Myxococcus whose interruption affects adventurous gliding motility.
Probab=77.56  E-value=1.7  Score=27.37  Aligned_cols=33  Identities=15%  Similarity=0.442  Sum_probs=16.3

Q ss_pred             eeccccccccCCHHHHHHHHHHcCCCCceecccccccc
Q psy4572          47 HKCEHCEKEFENRRALKRHEKNHTQEKTFPCTVCLKMF   84 (345)
Q Consensus        47 ~~C~~C~k~F~~~~~L~~H~~~h~~~k~~~C~~C~k~F   84 (345)
                      +.|+.|+..|.-.....     ..+.....|+.|+..|
T Consensus         3 ~~CP~C~~~~~v~~~~~-----~~~~~~v~C~~C~~~~   35 (38)
T TIGR02098         3 IQCPNCKTSFRVVDSQL-----GANGGKVRCGKCGHVW   35 (38)
T ss_pred             EECCCCCCEEEeCHHHc-----CCCCCEEECCCCCCEE
Confidence            45666666665443321     0111235666666655


No 84 
>PRK04023 DNA polymerase II large subunit; Validated
Probab=77.00  E-value=2.1  Score=46.22  Aligned_cols=22  Identities=23%  Similarity=0.318  Sum_probs=16.0

Q ss_pred             ceeccccccccCCHHHHHHHHHhcCCCCCccCCCCccc
Q psy4572          74 TFPCTVCLKMFKTKAQREEHIASHVTDSEFKCDQCPYV  111 (345)
Q Consensus        74 ~~~C~~C~k~F~~~~~L~~H~~~h~~~k~~~C~~C~~~  111 (345)
                      ...|+.||...                ..+.|+.||..
T Consensus       626 ~RfCpsCG~~t----------------~~frCP~CG~~  647 (1121)
T PRK04023        626 RRKCPSCGKET----------------FYRRCPFCGTH  647 (1121)
T ss_pred             CccCCCCCCcC----------------CcccCCCCCCC
Confidence            46799999873                23688888865


No 85 
>PRK00464 nrdR transcriptional regulator NrdR; Validated
Probab=76.13  E-value=0.6  Score=39.61  Aligned_cols=20  Identities=30%  Similarity=0.619  Sum_probs=10.9

Q ss_pred             eecCccccccCCHHHHHHHH
Q psy4572         190 YQCELCGKSFNNKQTFSLSN  209 (345)
Q Consensus       190 y~C~~Cgk~F~~~~~L~~H~  209 (345)
                      ++|+.||+.|.+-..+..=+
T Consensus        29 ~~c~~c~~~f~~~e~~~~~~   48 (154)
T PRK00464         29 RECLACGKRFTTFERVELVP   48 (154)
T ss_pred             eeccccCCcceEeEeccCcc
Confidence            66666666665544443333


No 86 
>PF13717 zinc_ribbon_4:  zinc-ribbon domain
Probab=75.53  E-value=2.4  Score=26.58  Aligned_cols=33  Identities=18%  Similarity=0.551  Sum_probs=18.0

Q ss_pred             eeccccccccCCHHHHHHHHHHcCCCCceecccccccc
Q psy4572          47 HKCEHCEKEFENRRALKRHEKNHTQEKTFPCTVCLKMF   84 (345)
Q Consensus        47 ~~C~~C~k~F~~~~~L~~H~~~h~~~k~~~C~~C~k~F   84 (345)
                      ..|+.|+..|.-.....     -.+....+|+.|+..|
T Consensus         3 i~Cp~C~~~y~i~d~~i-----p~~g~~v~C~~C~~~f   35 (36)
T PF13717_consen    3 ITCPNCQAKYEIDDEKI-----PPKGRKVRCSKCGHVF   35 (36)
T ss_pred             EECCCCCCEEeCCHHHC-----CCCCcEEECCCCCCEe
Confidence            45777777666544321     1222346677776665


No 87 
>smart00531 TFIIE Transcription initiation factor IIE.
Probab=75.47  E-value=1.7  Score=36.58  Aligned_cols=38  Identities=16%  Similarity=0.446  Sum_probs=21.4

Q ss_pred             CCCCCCCcchhhccCCHHHHHHHHhhhCCCCcCceeecCcccccc
Q psy4572         155 GAQPYQCETCRKLFLSKYSLFTHNKKHLPIEERLHYQCELCGKSF  199 (345)
Q Consensus       155 ~~k~y~C~~Cgk~F~s~s~L~~H~~~H~~~~~~k~y~C~~Cgk~F  199 (345)
                      ...-|.|+.|+..|.....+..-   ..    ...|.||.||...
T Consensus        96 ~~~~Y~Cp~C~~~y~~~ea~~~~---d~----~~~f~Cp~Cg~~l  133 (147)
T smart00531       96 NNAYYKCPNCQSKYTFLEANQLL---DM----DGTFTCPRCGEEL  133 (147)
T ss_pred             CCcEEECcCCCCEeeHHHHHHhc---CC----CCcEECCCCCCEE
Confidence            34457788888777754333220   01    1238888888654


No 88 
>KOG2807|consensus
Probab=74.58  E-value=3.7  Score=38.74  Aligned_cols=26  Identities=31%  Similarity=0.832  Sum_probs=18.3

Q ss_pred             CCCCCCcchhhccCCHHHHHHHHhhh
Q psy4572         156 AQPYQCETCRKLFLSKYSLFTHNKKH  181 (345)
Q Consensus       156 ~k~y~C~~Cgk~F~s~s~L~~H~~~H  181 (345)
                      ...|+|+.|...|-..-+...|...|
T Consensus       343 ~~~y~C~~Ck~~FCldCDv~iHesLh  368 (378)
T KOG2807|consen  343 SGRYRCESCKNVFCLDCDVFIHESLH  368 (378)
T ss_pred             CCcEEchhccceeeccchHHHHhhhh
Confidence            45577888877777776666776665


No 89 
>PF09986 DUF2225:  Uncharacterized protein conserved in bacteria (DUF2225);  InterPro: IPR018708 This conserved bacterial family has no known function.
Probab=74.57  E-value=0.5  Score=42.38  Aligned_cols=21  Identities=24%  Similarity=0.547  Sum_probs=13.2

Q ss_pred             CceeccccccccCCHHHHHHH
Q psy4572          45 GPHKCEHCEKEFENRRALKRH   65 (345)
Q Consensus        45 ~~~~C~~C~k~F~~~~~L~~H   65 (345)
                      +...||.|++.|.........
T Consensus         4 k~~~CPvC~~~F~~~~vrs~~   24 (214)
T PF09986_consen    4 KKITCPVCGKEFKTKKVRSGK   24 (214)
T ss_pred             CceECCCCCCeeeeeEEEcCC
Confidence            456777777777765444333


No 90 
>TIGR02300 FYDLN_acid conserved hypothetical protein TIGR02300. Members of this family are bacterial proteins with a conserved motif [KR]FYDLN, sometimes flanked by a pair of CXXC motifs, followed by a long region of low complexity sequence in which roughly half the residues are Asp and Glu, including multiple runs of five or more acidic residues. The function of members of this family is unknown.
Probab=73.09  E-value=1.7  Score=35.33  Aligned_cols=30  Identities=17%  Similarity=0.244  Sum_probs=17.5

Q ss_pred             CCCcchhhccCCHHHHHHHHhhhCCCCcCceeecCccccccCCH
Q psy4572         159 YQCETCRKLFLSKYSLFTHNKKHLPIEERLHYQCELCGKSFNNK  202 (345)
Q Consensus       159 y~C~~Cgk~F~s~s~L~~H~~~H~~~~~~k~y~C~~Cgk~F~~~  202 (345)
                      ..|+.||++|-..              ++.|-.|+.||..|.-.
T Consensus        10 r~Cp~cg~kFYDL--------------nk~p~vcP~cg~~~~~~   39 (129)
T TIGR02300        10 RICPNTGSKFYDL--------------NRRPAVSPYTGEQFPPE   39 (129)
T ss_pred             ccCCCcCcccccc--------------CCCCccCCCcCCccCcc
Confidence            3566666666543              33456677777666544


No 91 
>COG1198 PriA Primosomal protein N' (replication factor Y) - superfamily II helicase [DNA replication, recombination, and repair]
Probab=71.57  E-value=2  Score=45.43  Aligned_cols=12  Identities=25%  Similarity=0.719  Sum_probs=6.7

Q ss_pred             CceeecCccccc
Q psy4572         187 RLHYQCELCGKS  198 (345)
Q Consensus       187 ~k~y~C~~Cgk~  198 (345)
                      ..|..|+.||-.
T Consensus       473 ~~p~~Cp~Cgs~  484 (730)
T COG1198         473 PIPQSCPECGSE  484 (730)
T ss_pred             CCCCCCCCCCCC
Confidence            345666666643


No 92 
>smart00531 TFIIE Transcription initiation factor IIE.
Probab=71.03  E-value=7  Score=32.78  Aligned_cols=37  Identities=16%  Similarity=0.426  Sum_probs=20.7

Q ss_pred             CCceeccccccccCCHHHHHHHHHhcCCCCCccCCCCcccc
Q psy4572          72 EKTFPCTVCLKMFKTKAQREEHIASHVTDSEFKCDQCPYVA  112 (345)
Q Consensus        72 ~k~~~C~~C~k~F~~~~~L~~H~~~h~~~k~~~C~~C~~~f  112 (345)
                      ..-|.|+.|+..|.....+..   .. ....|.|+.||...
T Consensus        97 ~~~Y~Cp~C~~~y~~~ea~~~---~d-~~~~f~Cp~Cg~~l  133 (147)
T smart00531       97 NAYYKCPNCQSKYTFLEANQL---LD-MDGTFTCPRCGEEL  133 (147)
T ss_pred             CcEEECcCCCCEeeHHHHHHh---cC-CCCcEECCCCCCEE
Confidence            345778888877764332221   01 23337777777654


No 93 
>PF10571 UPF0547:  Uncharacterised protein family UPF0547;  InterPro: IPR018886  This domain may well be a type of zinc-finger as it carries two pairs of highly conserved cysteine residues though with no accompanying histidines. Several members are annotated as putative helicases. 
Probab=70.77  E-value=2.1  Score=24.87  Aligned_cols=9  Identities=33%  Similarity=0.722  Sum_probs=4.4

Q ss_pred             CCcchhhcc
Q psy4572         160 QCETCRKLF  168 (345)
Q Consensus       160 ~C~~Cgk~F  168 (345)
                      .|+.||..|
T Consensus        16 ~Cp~CG~~F   24 (26)
T PF10571_consen   16 FCPHCGYDF   24 (26)
T ss_pred             cCCCCCCCC
Confidence            355555444


No 94 
>smart00659 RPOLCX RNA polymerase subunit CX. present in RNA polymerase I, II and III
Probab=70.76  E-value=2.7  Score=27.72  Aligned_cols=11  Identities=36%  Similarity=0.736  Sum_probs=6.2

Q ss_pred             eeccccccccC
Q psy4572          47 HKCEHCEKEFE   57 (345)
Q Consensus        47 ~~C~~C~k~F~   57 (345)
                      |.|..||..|.
T Consensus         3 Y~C~~Cg~~~~   13 (44)
T smart00659        3 YICGECGRENE   13 (44)
T ss_pred             EECCCCCCEee
Confidence            55666665554


No 95 
>TIGR00373 conserved hypothetical protein TIGR00373. This family of proteins is, so far, restricted to archaeal genomes. The family appears to be distantly related to the N-terminal region of the eukaryotic transcription initiation factor IIE alpha chain.
Probab=70.11  E-value=5.3  Score=34.00  Aligned_cols=15  Identities=20%  Similarity=0.301  Sum_probs=6.6

Q ss_pred             eeccccccccCCHHH
Q psy4572          75 FPCTVCLKMFKTKAQ   89 (345)
Q Consensus        75 ~~C~~C~k~F~~~~~   89 (345)
                      |.|+.|+..|+....
T Consensus       110 Y~Cp~c~~r~tf~eA  124 (158)
T TIGR00373       110 FICPNMCVRFTFNEA  124 (158)
T ss_pred             EECCCCCcEeeHHHH
Confidence            444444444444333


No 96 
>PF04959 ARS2:  Arsenite-resistance protein 2;  InterPro: IPR007042 This entry represents Arsenite-resistance protein 2 (also known as Serrate RNA effector molecule homolog) which is thought to play a role in arsenite resistance [], although does not directly confer arsenite resistance but rather modulates arsenic sensitivity []. Arsenite is a carcinogenic compound which can act as a comutagen by inhibiting DNA repair. It is also involved in cell cycle progression at S phase. ; PDB: 3AX1_A.
Probab=69.40  E-value=4.8  Score=36.08  Aligned_cols=30  Identities=23%  Similarity=0.468  Sum_probs=22.0

Q ss_pred             CcCceeecCccccccCCHHHHHHHHHHhcC
Q psy4572         185 EERLHYQCELCGKSFNNKQTFSLSNMYVLS  214 (345)
Q Consensus       185 ~~~k~y~C~~Cgk~F~~~~~L~~H~~~~H~  214 (345)
                      +++..|.|+.|+|.|.-..-..+|+...|.
T Consensus        73 ~~~~K~~C~lc~KlFkg~eFV~KHI~nKH~  102 (214)
T PF04959_consen   73 EDEDKWRCPLCGKLFKGPEFVRKHIFNKHP  102 (214)
T ss_dssp             SSSEEEEE-SSS-EESSHHHHHHHHHHH-H
T ss_pred             HcCCEECCCCCCcccCChHHHHHHHhhcCH
Confidence            455578899999999888888888888885


No 97 
>TIGR00373 conserved hypothetical protein TIGR00373. This family of proteins is, so far, restricted to archaeal genomes. The family appears to be distantly related to the N-terminal region of the eukaryotic transcription initiation factor IIE alpha chain.
Probab=68.43  E-value=6.5  Score=33.48  Aligned_cols=42  Identities=12%  Similarity=0.155  Sum_probs=27.6

Q ss_pred             HHHHHHhcCCCCCccCCCCcccccCHHHHHHHHHHhcCCCccccCCCCCCc
Q psy4572          90 REEHIASHVTDSEFKCDQCPYVAKRIKYLRYHMKKHRKEYSAYCEPCQEGF  140 (345)
Q Consensus        90 L~~H~~~h~~~k~~~C~~C~~~f~~~~~L~~H~~~h~~~~~~~C~~C~k~F  140 (345)
                      +..-+....+...|.|+.|+..|....++.         ..|.||.||...
T Consensus        97 lk~~l~~e~~~~~Y~Cp~c~~r~tf~eA~~---------~~F~Cp~Cg~~L  138 (158)
T TIGR00373        97 LREKLEFETNNMFFICPNMCVRFTFNEAME---------LNFTCPRCGAML  138 (158)
T ss_pred             HHHHHhhccCCCeEECCCCCcEeeHHHHHH---------cCCcCCCCCCEe
Confidence            334444445566688888888877776664         257888888653


No 98 
>COG5151 SSL1 RNA polymerase II transcription initiation/nucleotide excision repair factor TFIIH, subunit SSL1 [Transcription / DNA replication, recombination, and repair]
Probab=68.36  E-value=2.6  Score=39.39  Aligned_cols=48  Identities=29%  Similarity=0.522  Sum_probs=29.6

Q ss_pred             ccCCCCCCcCCHHHHHHHHHHhcCCCCCCCcchhhccCCHHHHHHHHhhh
Q psy4572         132 YCEPCQEGFFSRDKLETHNIAKHGAQPYQCETCRKLFLSKYSLFTHNKKH  181 (345)
Q Consensus       132 ~C~~C~k~F~~~~~l~~H~~~hh~~k~y~C~~Cgk~F~s~s~L~~H~~~H  181 (345)
                      .|-.|.-.|.-...-..-  .-.....|+|+.|...|-..-....|...|
T Consensus       364 ~Cf~CQ~~fp~~~~~~~~--~~~ss~rY~Ce~CK~~FC~dCdvfiHe~Lh  411 (421)
T COG5151         364 HCFVCQGPFPKPPVSPFD--ESTSSGRYQCELCKSTFCSDCDVFIHETLH  411 (421)
T ss_pred             cceeccCCCCCCCCCccc--ccccccceechhhhhhhhhhhHHHHHHHHh
Confidence            366666666543221000  011245688998988888888888887776


No 99 
>cd00729 rubredoxin_SM Rubredoxin, Small Modular nonheme iron binding domain containing a [Fe(SCys)4] center, present in rubrerythrin and nigerythrin and detected either N- or C-terminal to such proteins as flavin reductase, NAD(P)H-nitrite reductase, and ferredoxin-thioredoxin reductase. In rubredoxin, the iron atom is coordinated by four cysteine residues (Fe(S-Cys)4), and  believed to be involved in electron transfer. Rubrerythrins and nigerythrins are small homodimeric proteins, generally consisting of 2 domains: a rubredoxin domain C-terminal to a non-sulfur, oxo-bridged diiron site in the N-terminal rubrerythrin domain. Rubrerythrins and nigerythrins have putative peroxide activity.
Probab=68.13  E-value=2.5  Score=26.17  Aligned_cols=11  Identities=27%  Similarity=0.483  Sum_probs=5.7

Q ss_pred             eeccccccccC
Q psy4572          47 HKCEHCEKEFE   57 (345)
Q Consensus        47 ~~C~~C~k~F~   57 (345)
                      |.|..||..+.
T Consensus         3 ~~C~~CG~i~~   13 (34)
T cd00729           3 WVCPVCGYIHE   13 (34)
T ss_pred             EECCCCCCEeE
Confidence            45555555443


No 100
>TIGR02300 FYDLN_acid conserved hypothetical protein TIGR02300. Members of this family are bacterial proteins with a conserved motif [KR]FYDLN, sometimes flanked by a pair of CXXC motifs, followed by a long region of low complexity sequence in which roughly half the residues are Asp and Glu, including multiple runs of five or more acidic residues. The function of members of this family is unknown.
Probab=67.73  E-value=3.2  Score=33.77  Aligned_cols=30  Identities=10%  Similarity=0.190  Sum_probs=18.5

Q ss_pred             eeccccccccCCHHHHHHHHHHcCCCCceeccccccccCCH
Q psy4572          47 HKCEHCEKEFENRRALKRHEKNHTQEKTFPCTVCLKMFKTK   87 (345)
Q Consensus        47 ~~C~~C~k~F~~~~~L~~H~~~h~~~k~~~C~~C~k~F~~~   87 (345)
                      ..|+.||+.|...           +..|..|+.||..|...
T Consensus        10 r~Cp~cg~kFYDL-----------nk~p~vcP~cg~~~~~~   39 (129)
T TIGR02300        10 RICPNTGSKFYDL-----------NRRPAVSPYTGEQFPPE   39 (129)
T ss_pred             ccCCCcCcccccc-----------CCCCccCCCcCCccCcc
Confidence            4677777776652           23566777777666544


No 101
>PF05443 ROS_MUCR:  ROS/MUCR transcriptional regulator protein;  InterPro: IPR008807 This family consists of several ROS/MUCR transcriptional regulator proteins. The ros chromosomal gene is present in octopine and nopaline strains of Agrobacterium tumefaciens as well as in Rhizobium meliloti (Sinorhizobium meliloti). This gene encodes a 15.5 kDa protein that specifically represses the virC and virD operons in the virulence region of the Ti plasmid [] and is necessary for succinoglycan production []. S. meliloti can produce two types of acidic exopolysaccharides, succinoglycan and galactoglucan, that are interchangeable for infection of Medicago sativa (Alfalfa) nodules. MucR from S. meliloti acts as a transcriptional repressor that blocks the expression of the exp genes responsible for galactoglucan production therefore allowing the exclusive production of succinoglycan [].; GO: 0003677 DNA binding, 0008270 zinc ion binding, 0006355 regulation of transcription, DNA-dependent; PDB: 2JSP_A.
Probab=67.66  E-value=3.5  Score=34.02  Aligned_cols=26  Identities=23%  Similarity=0.322  Sum_probs=15.5

Q ss_pred             eeecCccccccCCHHHHHHHHHHhcCCCC
Q psy4572         189 HYQCELCGKSFNNKQTFSLSNMYVLSLSN  217 (345)
Q Consensus       189 ~y~C~~Cgk~F~~~~~L~~H~~~~H~~~~  217 (345)
                      ...|-+||+.|..   |.+|++.||++..
T Consensus        72 ~i~clecGk~~k~---LkrHL~~~~gltp   97 (132)
T PF05443_consen   72 YIICLECGKKFKT---LKRHLRTHHGLTP   97 (132)
T ss_dssp             -EE-TBT--EESB---HHHHHHHTT-S-H
T ss_pred             eeEEccCCcccch---HHHHHHHccCCCH
Confidence            3678889998864   4889988888653


No 102
>COG2888 Predicted Zn-ribbon RNA-binding protein with a function in translation [Translation, ribosomal structure and biogenesis]
Probab=67.28  E-value=3.5  Score=28.88  Aligned_cols=7  Identities=57%  Similarity=1.725  Sum_probs=3.0

Q ss_pred             eecCccc
Q psy4572         190 YQCELCG  196 (345)
Q Consensus       190 y~C~~Cg  196 (345)
                      |+|+.||
T Consensus        51 Y~Cp~CG   57 (61)
T COG2888          51 YRCPKCG   57 (61)
T ss_pred             eECCCcC
Confidence            4444443


No 103
>KOG4167|consensus
Probab=67.16  E-value=3.6  Score=42.78  Aligned_cols=29  Identities=31%  Similarity=0.516  Sum_probs=22.7

Q ss_pred             CCCceeccccccccCCHHHHHHHHHHcCC
Q psy4572          43 LDGPHKCEHCEKEFENRRALKRHEKNHTQ   71 (345)
Q Consensus        43 ~~~~~~C~~C~k~F~~~~~L~~H~~~h~~   71 (345)
                      .++-|.|.+|++.|.....+..||++|..
T Consensus       789 ~~giFpCreC~kvF~KiKSrNAHMK~Hr~  817 (907)
T KOG4167|consen  789 PTGIFPCRECGKVFFKIKSRNAHMKTHRQ  817 (907)
T ss_pred             CCceeehHHHHHHHHHHhhhhHHHHHHHH
Confidence            34568888888888888888888888743


No 104
>TIGR02605 CxxC_CxxC_SSSS putative regulatory protein, FmdB family. This model represents a region of about 50 amino acids found in a number of small proteins in a wide range of bacteria. The region begins usually with the initiator Met and contains two CxxC motifs separated by 17 amino acids. One member of this family is has been noted as a putative regulatory protein, designated FmdB (PubMed:8841393). Most members of this family have a C-terminal region containing highly degenerate sequence, such as SSTSESTKSSGSSGSSGSSESKASGSTEKSTSSTTAAAAV in Mycobacterium tuberculosis and VAVGGSAPAPSPAPRAGGGGGGCCGGGCCG in Streptomyces avermitilis. These low complexity regions, which are not included in the model, resemble low-complexity C-terminal regions of some heterocycle-containing bacteriocin precursors.
Probab=65.67  E-value=1.4  Score=29.85  Aligned_cols=12  Identities=33%  Similarity=0.866  Sum_probs=6.5

Q ss_pred             eeccccccccCC
Q psy4572          47 HKCEHCEKEFEN   58 (345)
Q Consensus        47 ~~C~~C~k~F~~   58 (345)
                      |+|+.|+..|..
T Consensus         6 y~C~~Cg~~fe~   17 (52)
T TIGR02605         6 YRCTACGHRFEV   17 (52)
T ss_pred             EEeCCCCCEeEE
Confidence            555555555553


No 105
>KOG2807|consensus
Probab=65.59  E-value=7.7  Score=36.69  Aligned_cols=24  Identities=21%  Similarity=0.429  Sum_probs=20.0

Q ss_pred             eeecCccccccCCHHHHHHHHHHh
Q psy4572         189 HYQCELCGKSFNNKQTFSLSNMYV  212 (345)
Q Consensus       189 ~y~C~~Cgk~F~~~~~L~~H~~~~  212 (345)
                      .|+|+.|...|-..-+...|...|
T Consensus       345 ~y~C~~Ck~~FCldCDv~iHesLh  368 (378)
T KOG2807|consen  345 RYRCESCKNVFCLDCDVFIHESLH  368 (378)
T ss_pred             cEEchhccceeeccchHHHHhhhh
Confidence            599999999998888888887655


No 106
>COG1198 PriA Primosomal protein N' (replication factor Y) - superfamily II helicase [DNA replication, recombination, and repair]
Probab=65.41  E-value=3.6  Score=43.58  Aligned_cols=10  Identities=20%  Similarity=0.398  Sum_probs=5.7

Q ss_pred             eecccccccc
Q psy4572          47 HKCEHCEKEF   56 (345)
Q Consensus        47 ~~C~~C~k~F   56 (345)
                      ..|..||..+
T Consensus       436 l~C~~Cg~v~  445 (730)
T COG1198         436 LLCRDCGYIA  445 (730)
T ss_pred             eecccCCCcc
Confidence            4466666553


No 107
>PRK06266 transcription initiation factor E subunit alpha; Validated
Probab=64.85  E-value=6.5  Score=34.21  Aligned_cols=13  Identities=23%  Similarity=0.352  Sum_probs=5.4

Q ss_pred             ccCCCCcccccCH
Q psy4572         103 FKCDQCPYVAKRI  115 (345)
Q Consensus       103 ~~C~~C~~~f~~~  115 (345)
                      |.|+.|+..|...
T Consensus       118 Y~Cp~C~~rytf~  130 (178)
T PRK06266        118 FFCPNCHIRFTFD  130 (178)
T ss_pred             EECCCCCcEEeHH
Confidence            4444444444333


No 108
>smart00834 CxxC_CXXC_SSSS Putative regulatory protein. CxxC_CXXC_SSSS represents a region of about 41 amino acids found in a number of small proteins in a wide range of bacteria. The region usually begins with the initiator Met and contains two CxxC motifs separated by 17 amino acids. One protein in this entry has been noted as a putative regulatory protein, designated FmdB. Most proteins in this entry have a C-terminal region containing highly degenerate sequence.
Probab=64.66  E-value=1.5  Score=27.98  Aligned_cols=12  Identities=42%  Similarity=0.988  Sum_probs=6.4

Q ss_pred             eeccccccccCC
Q psy4572          47 HKCEHCEKEFEN   58 (345)
Q Consensus        47 ~~C~~C~k~F~~   58 (345)
                      |+|+.||..|..
T Consensus         6 y~C~~Cg~~fe~   17 (41)
T smart00834        6 YRCEDCGHTFEV   17 (41)
T ss_pred             EEcCCCCCEEEE
Confidence            555555555543


No 109
>PRK00464 nrdR transcriptional regulator NrdR; Validated
Probab=64.19  E-value=1.1  Score=38.02  Aligned_cols=15  Identities=33%  Similarity=0.512  Sum_probs=9.2

Q ss_pred             eeccccccccCCHHH
Q psy4572          75 FPCTVCLKMFKTKAQ   89 (345)
Q Consensus        75 ~~C~~C~k~F~~~~~   89 (345)
                      ++|+.||++|.+...
T Consensus        29 ~~c~~c~~~f~~~e~   43 (154)
T PRK00464         29 RECLACGKRFTTFER   43 (154)
T ss_pred             eeccccCCcceEeEe
Confidence            566666666665543


No 110
>PRK06266 transcription initiation factor E subunit alpha; Validated
Probab=63.40  E-value=8  Score=33.65  Aligned_cols=34  Identities=26%  Similarity=0.518  Sum_probs=24.7

Q ss_pred             CCCceeccccccccCCHHHHHHHHHhcCCCCCccCCCCccccc
Q psy4572          71 QEKTFPCTVCLKMFKTKAQREEHIASHVTDSEFKCDQCPYVAK  113 (345)
Q Consensus        71 ~~k~~~C~~C~k~F~~~~~L~~H~~~h~~~k~~~C~~C~~~f~  113 (345)
                      ...-|.|+.|+..|+....+.         ..|.|+.||....
T Consensus       114 ~~~~Y~Cp~C~~rytf~eA~~---------~~F~Cp~Cg~~L~  147 (178)
T PRK06266        114 NNMFFFCPNCHIRFTFDEAME---------YGFRCPQCGEMLE  147 (178)
T ss_pred             CCCEEECCCCCcEEeHHHHhh---------cCCcCCCCCCCCe
Confidence            345688999998887776542         3589999987543


No 111
>KOG2186|consensus
Probab=63.05  E-value=4.7  Score=36.73  Aligned_cols=47  Identities=30%  Similarity=0.525  Sum_probs=38.9

Q ss_pred             ceeccccccccCCHHHHHHHHHhcCCCCCccCCCCcccccCHHHHHHHHH
Q psy4572          74 TFPCTVCLKMFKTKAQREEHIASHVTDSEFKCDQCPYVAKRIKYLRYHMK  123 (345)
Q Consensus        74 ~~~C~~C~k~F~~~~~L~~H~~~h~~~k~~~C~~C~~~f~~~~~L~~H~~  123 (345)
                      -|.|..||...... .+.+|+.+.++ .-|.|-.|++.|.+ .....|..
T Consensus         3 ~FtCnvCgEsvKKp-~vekH~srCrn-~~fSCIDC~k~F~~-~sYknH~k   49 (276)
T KOG2186|consen    3 FFTCNVCGESVKKP-QVEKHMSRCRN-AYFSCIDCGKTFER-VSYKNHTK   49 (276)
T ss_pred             EEehhhhhhhcccc-chHHHHHhccC-CeeEEeeccccccc-chhhhhhh
Confidence            38899999887654 58889988877 67999999999998 67788865


No 112
>PRK14890 putative Zn-ribbon RNA-binding protein; Provisional
Probab=62.79  E-value=4.2  Score=28.51  Aligned_cols=7  Identities=57%  Similarity=1.654  Sum_probs=3.4

Q ss_pred             eecCccc
Q psy4572         190 YQCELCG  196 (345)
Q Consensus       190 y~C~~Cg  196 (345)
                      |.|+.||
T Consensus        49 Y~CP~CG   55 (59)
T PRK14890         49 YTCPKCG   55 (59)
T ss_pred             eECCCCC
Confidence            4444444


No 113
>COG1997 RPL43A Ribosomal protein L37AE/L43A [Translation, ribosomal structure and biogenesis]
Probab=62.31  E-value=3.9  Score=31.01  Aligned_cols=12  Identities=33%  Similarity=1.046  Sum_probs=8.0

Q ss_pred             eeecCccccccC
Q psy4572         189 HYQCELCGKSFN  200 (345)
Q Consensus       189 ~y~C~~Cgk~F~  200 (345)
                      .+.|..||..|.
T Consensus        53 IW~C~kCg~~fA   64 (89)
T COG1997          53 IWKCRKCGAKFA   64 (89)
T ss_pred             eEEcCCCCCeec
Confidence            467777777664


No 114
>COG1592 Rubrerythrin [Energy production and conversion]
Probab=61.82  E-value=4.8  Score=34.57  Aligned_cols=10  Identities=30%  Similarity=0.654  Sum_probs=4.5

Q ss_pred             CCccccCCCC
Q psy4572         128 EYSAYCEPCQ  137 (345)
Q Consensus       128 ~~~~~C~~C~  137 (345)
                      +.|..||.||
T Consensus       147 e~P~~CPiCg  156 (166)
T COG1592         147 EAPEVCPICG  156 (166)
T ss_pred             CCCCcCCCCC
Confidence            3444444444


No 115
>PRK00398 rpoP DNA-directed RNA polymerase subunit P; Provisional
Probab=61.61  E-value=2.4  Score=28.04  Aligned_cols=12  Identities=42%  Similarity=0.855  Sum_probs=6.5

Q ss_pred             eeccccccccCC
Q psy4572          47 HKCEHCEKEFEN   58 (345)
Q Consensus        47 ~~C~~C~k~F~~   58 (345)
                      |.|+.||..|..
T Consensus         4 y~C~~CG~~~~~   15 (46)
T PRK00398          4 YKCARCGREVEL   15 (46)
T ss_pred             EECCCCCCEEEE
Confidence            556666655443


No 116
>smart00734 ZnF_Rad18 Rad18-like CCHC zinc finger. Yeast Rad18p functions with Rad5p in error-free post-replicative DNA repair. This zinc finger is likely to bind nucleic-acids.
Probab=61.51  E-value=6.5  Score=22.72  Aligned_cols=21  Identities=14%  Similarity=0.241  Sum_probs=15.7

Q ss_pred             eecCccccccCCHHHHHHHHHH
Q psy4572         190 YQCELCGKSFNNKQTFSLSNMY  211 (345)
Q Consensus       190 y~C~~Cgk~F~~~~~L~~H~~~  211 (345)
                      ..||+|++.+ ....+.+|+..
T Consensus         2 v~CPiC~~~v-~~~~in~HLD~   22 (26)
T smart00734        2 VQCPVCFREV-PENLINSHLDS   22 (26)
T ss_pred             CcCCCCcCcc-cHHHHHHHHHH
Confidence            3588888888 56778888764


No 117
>PF05443 ROS_MUCR:  ROS/MUCR transcriptional regulator protein;  InterPro: IPR008807 This family consists of several ROS/MUCR transcriptional regulator proteins. The ros chromosomal gene is present in octopine and nopaline strains of Agrobacterium tumefaciens as well as in Rhizobium meliloti (Sinorhizobium meliloti). This gene encodes a 15.5 kDa protein that specifically represses the virC and virD operons in the virulence region of the Ti plasmid [] and is necessary for succinoglycan production []. S. meliloti can produce two types of acidic exopolysaccharides, succinoglycan and galactoglucan, that are interchangeable for infection of Medicago sativa (Alfalfa) nodules. MucR from S. meliloti acts as a transcriptional repressor that blocks the expression of the exp genes responsible for galactoglucan production therefore allowing the exclusive production of succinoglycan [].; GO: 0003677 DNA binding, 0008270 zinc ion binding, 0006355 regulation of transcription, DNA-dependent; PDB: 2JSP_A.
Probab=60.70  E-value=6.5  Score=32.48  Aligned_cols=27  Identities=33%  Similarity=0.559  Sum_probs=17.2

Q ss_pred             CCceeccccccccCCHHHHHHHHHHcCCCC
Q psy4572          44 DGPHKCEHCEKEFENRRALKRHEKNHTQEK   73 (345)
Q Consensus        44 ~~~~~C~~C~k~F~~~~~L~~H~~~h~~~k   73 (345)
                      +....|-+||+.|..   |++|++.|+|-.
T Consensus        70 ~d~i~clecGk~~k~---LkrHL~~~~glt   96 (132)
T PF05443_consen   70 PDYIICLECGKKFKT---LKRHLRTHHGLT   96 (132)
T ss_dssp             SS-EE-TBT--EESB---HHHHHHHTT-S-
T ss_pred             cCeeEEccCCcccch---HHHHHHHccCCC
Confidence            345789999999988   589999997754


No 118
>COG1592 Rubrerythrin [Energy production and conversion]
Probab=59.63  E-value=5.4  Score=34.25  Aligned_cols=11  Identities=36%  Similarity=0.930  Sum_probs=6.0

Q ss_pred             CCCCCCCcchh
Q psy4572         155 GAQPYQCETCR  165 (345)
Q Consensus       155 ~~k~y~C~~Cg  165 (345)
                      |+.|-+||.||
T Consensus       146 ge~P~~CPiCg  156 (166)
T COG1592         146 GEAPEVCPICG  156 (166)
T ss_pred             CCCCCcCCCCC
Confidence            34555566665


No 119
>PF14353 CpXC:  CpXC protein
Probab=59.19  E-value=6.9  Score=31.81  Aligned_cols=20  Identities=20%  Similarity=0.604  Sum_probs=12.5

Q ss_pred             eeecCccccccCCHHHHHHH
Q psy4572         189 HYQCELCGKSFNNKQTFSLS  208 (345)
Q Consensus       189 ~y~C~~Cgk~F~~~~~L~~H  208 (345)
                      .|.||.||..|.-...+.-|
T Consensus        38 ~~~CP~Cg~~~~~~~p~lY~   57 (128)
T PF14353_consen   38 SFTCPSCGHKFRLEYPLLYH   57 (128)
T ss_pred             EEECCCCCCceecCCCEEEE
Confidence            47777777777655444444


No 120
>smart00614 ZnF_BED BED zinc finger. DNA-binding domain in chromatin-boundary-element-binding proteins and transposases
Probab=59.17  E-value=7.1  Score=26.21  Aligned_cols=26  Identities=15%  Similarity=0.356  Sum_probs=18.7

Q ss_pred             eeecCccccccCCH-----HHHHHHHHHhcC
Q psy4572         189 HYQCELCGKSFNNK-----QTFSLSNMYVLS  214 (345)
Q Consensus       189 ~y~C~~Cgk~F~~~-----~~L~~H~~~~H~  214 (345)
                      .-.|..|++.+...     +.|.+|++..|+
T Consensus        18 ~a~C~~C~~~l~~~~~~gTs~L~rHl~~~h~   48 (50)
T smart00614       18 RAKCKYCGKKLSRSSKGGTSNLRRHLRRKHP   48 (50)
T ss_pred             EEEecCCCCEeeeCCCCCcHHHHHHHHhHCc
Confidence            45688888877655     588888885543


No 121
>COG4957 Predicted transcriptional regulator [Transcription]
Probab=57.73  E-value=4.8  Score=33.08  Aligned_cols=25  Identities=32%  Similarity=0.365  Sum_probs=19.7

Q ss_pred             CceeccccccccCCHHHHHHHHHHcCCC
Q psy4572          45 GPHKCEHCEKEFENRRALKRHEKNHTQE   72 (345)
Q Consensus        45 ~~~~C~~C~k~F~~~~~L~~H~~~h~~~   72 (345)
                      .-..|-++|+.|++   |++|+.+|++-
T Consensus        75 D~IicLEDGkkfKS---LKRHL~t~~gm   99 (148)
T COG4957          75 DYIICLEDGKKFKS---LKRHLTTHYGL   99 (148)
T ss_pred             CeEEEeccCcchHH---HHHHHhcccCC
Confidence            34679999998876   88899888764


No 122
>PRK14714 DNA polymerase II large subunit; Provisional
Probab=57.70  E-value=14  Score=41.22  Aligned_cols=10  Identities=30%  Similarity=0.919  Sum_probs=7.5

Q ss_pred             ceeccccccc
Q psy4572          46 PHKCEHCEKE   55 (345)
Q Consensus        46 ~~~C~~C~k~   55 (345)
                      .++|+.||..
T Consensus       667 ~rkCPkCG~~  676 (1337)
T PRK14714        667 RRRCPSCGTE  676 (1337)
T ss_pred             EEECCCCCCc
Confidence            3788888874


No 123
>KOG2593|consensus
Probab=56.54  E-value=10  Score=37.21  Aligned_cols=37  Identities=30%  Similarity=0.621  Sum_probs=25.3

Q ss_pred             CCCCCCCcchhhccCCHHHHHHHHhhhCCCCcCceeecCcccc
Q psy4572         155 GAQPYQCETCRKLFLSKYSLFTHNKKHLPIEERLHYQCELCGK  197 (345)
Q Consensus       155 ~~k~y~C~~Cgk~F~s~s~L~~H~~~H~~~~~~k~y~C~~Cgk  197 (345)
                      ....|.|+.|.+.|+....++-=-.      ....|.|..|+-
T Consensus       125 ~~~~Y~Cp~C~kkyt~Lea~~L~~~------~~~~F~C~~C~g  161 (436)
T KOG2593|consen  125 NVAGYVCPNCQKKYTSLEALQLLDN------ETGEFHCENCGG  161 (436)
T ss_pred             ccccccCCccccchhhhHHHHhhcc------cCceEEEecCCC
Confidence            4566999999999988776543211      223589999973


No 124
>PF09723 Zn-ribbon_8:  Zinc ribbon domain;  InterPro: IPR013429  This entry represents a region of about 41 amino acids found in a number of small proteins in a wide range of bacteria. The region usually begins with the initiator Met and contains two CxxC motifs separated by 17 amino acids. One protein in this entry has been noted as a putative regulatory protein, designated FmdB []. Most proteins in this entry have a C-terminal region containing highly degenerate sequence.
Probab=56.16  E-value=2.7  Score=27.28  Aligned_cols=12  Identities=50%  Similarity=1.090  Sum_probs=6.9

Q ss_pred             eeccccccccCC
Q psy4572          47 HKCEHCEKEFEN   58 (345)
Q Consensus        47 ~~C~~C~k~F~~   58 (345)
                      |+|..||..|..
T Consensus         6 y~C~~Cg~~fe~   17 (42)
T PF09723_consen    6 YRCEECGHEFEV   17 (42)
T ss_pred             EEeCCCCCEEEE
Confidence            556666655554


No 125
>PF04959 ARS2:  Arsenite-resistance protein 2;  InterPro: IPR007042 This entry represents Arsenite-resistance protein 2 (also known as Serrate RNA effector molecule homolog) which is thought to play a role in arsenite resistance [], although does not directly confer arsenite resistance but rather modulates arsenic sensitivity []. Arsenite is a carcinogenic compound which can act as a comutagen by inhibiting DNA repair. It is also involved in cell cycle progression at S phase. ; PDB: 3AX1_A.
Probab=54.86  E-value=4.3  Score=36.41  Aligned_cols=32  Identities=22%  Similarity=0.456  Sum_probs=21.3

Q ss_pred             CCCCCceeccccccccCCHHHHHHHHHHcCCC
Q psy4572          41 QKLDGPHKCEHCEKEFENRRALKRHEKNHTQE   72 (345)
Q Consensus        41 ~~~~~~~~C~~C~k~F~~~~~L~~H~~~h~~~   72 (345)
                      ...+..|.|+.|+|.|....-..+|+...|.+
T Consensus        72 e~~~~K~~C~lc~KlFkg~eFV~KHI~nKH~e  103 (214)
T PF04959_consen   72 EEDEDKWRCPLCGKLFKGPEFVRKHIFNKHPE  103 (214)
T ss_dssp             SSSSEEEEE-SSS-EESSHHHHHHHHHHH-HH
T ss_pred             HHcCCEECCCCCCcccCChHHHHHHHhhcCHH
Confidence            34455688888888888888888888776654


No 126
>COG1996 RPC10 DNA-directed RNA polymerase, subunit RPC10 (contains C4-type Zn-finger) [Transcription]
Probab=52.28  E-value=8.1  Score=26.09  Aligned_cols=10  Identities=40%  Similarity=1.006  Sum_probs=4.7

Q ss_pred             eecccccccc
Q psy4572          47 HKCEHCEKEF   56 (345)
Q Consensus        47 ~~C~~C~k~F   56 (345)
                      |.|..||+.|
T Consensus         7 Y~C~~Cg~~~   16 (49)
T COG1996           7 YKCARCGREV   16 (49)
T ss_pred             EEhhhcCCee
Confidence            4444444444


No 127
>COG0068 HypF Hydrogenase maturation factor [Posttranslational modification, protein turnover, chaperones]
Probab=51.48  E-value=2  Score=44.76  Aligned_cols=28  Identities=29%  Similarity=0.631  Sum_probs=15.7

Q ss_pred             cCCCCCCcCCHHHHHHHHHHhcCCCCCCCcchhh
Q psy4572         133 CEPCQEGFFSRDKLETHNIAKHGAQPYQCETCRK  166 (345)
Q Consensus       133 C~~C~k~F~~~~~l~~H~~~hh~~k~y~C~~Cgk  166 (345)
                      |+.|.+.|.++.+.+-|      .+|..|+.||-
T Consensus       154 C~~C~~EY~dP~nRRfH------AQp~aCp~CGP  181 (750)
T COG0068         154 CPFCDKEYKDPLNRRFH------AQPIACPKCGP  181 (750)
T ss_pred             CHHHHHHhcCccccccc------cccccCcccCC
Confidence            66666665555554433      44556666664


No 128
>PF15135 UPF0515:  Uncharacterised protein UPF0515
Probab=51.03  E-value=10  Score=34.53  Aligned_cols=74  Identities=15%  Similarity=0.321  Sum_probs=43.5

Q ss_pred             CHHHHHHHHHHcCCC-----CceeccccccccCCHHHHHHHHHhcCCCCCccCCCCcccccCHHHHHHHHHHhcCCCccc
Q psy4572          58 NRRALKRHEKNHTQE-----KTFPCTVCLKMFKTKAQREEHIASHVTDSEFKCDQCPYVAKRIKYLRYHMKKHRKEYSAY  132 (345)
Q Consensus        58 ~~~~L~~H~~~h~~~-----k~~~C~~C~k~F~~~~~L~~H~~~h~~~k~~~C~~C~~~f~~~~~L~~H~~~h~~~~~~~  132 (345)
                      +...|+.+.+.+.+.     +.|.|..|....  .   +   +.-...+.-+|..|.+.|.-...     ..-.|..-|.
T Consensus        91 Te~Nlrm~d~a~~~~ip~~drqFaC~~Cd~~W--w---R---rvp~rKeVSRCr~C~~rYDPVP~-----dkmwG~aef~  157 (278)
T PF15135_consen   91 TEENLRMFDDAQENLIPSVDRQFACSSCDHMW--W---R---RVPQRKEVSRCRKCRKRYDPVPC-----DKMWGIAEFH  157 (278)
T ss_pred             hHHHHHHhhhhhhccccccceeeeccccchHH--H---h---ccCcccccccccccccccCCCcc-----ccccceeeee
Confidence            445565555554443     679999996432  1   1   11122344689999887754321     1123555688


Q ss_pred             cCCCCCCcCCHH
Q psy4572         133 CEPCQEGFFSRD  144 (345)
Q Consensus       133 C~~C~k~F~~~~  144 (345)
                      |+.|+..|+...
T Consensus       158 C~~C~h~F~G~~  169 (278)
T PF15135_consen  158 CPKCRHNFRGFA  169 (278)
T ss_pred             cccccccchhhh
Confidence            999999998653


No 129
>PF02176 zf-TRAF:  TRAF-type zinc finger; PDB: 2EOD_A 2YUC_A 3HCU_A 3HCS_B 3HCT_A.
Probab=50.49  E-value=6.7  Score=27.06  Aligned_cols=39  Identities=26%  Similarity=0.679  Sum_probs=17.4

Q ss_pred             ceeccc--cccccCCHHHHHHHHHHcCCCCceeccc----cccccC
Q psy4572          46 PHKCEH--CEKEFENRRALKRHEKNHTQEKTFPCTV----CLKMFK   85 (345)
Q Consensus        46 ~~~C~~--C~k~F~~~~~L~~H~~~h~~~k~~~C~~----C~k~F~   85 (345)
                      +..|+.  |...+. +..|..|....-..++..|+.    |+..+.
T Consensus         9 ~v~C~~~cc~~~i~-r~~l~~H~~~~C~~~~v~C~~~~~GC~~~~~   53 (60)
T PF02176_consen    9 PVPCPNGCCNEMIP-RKELDDHLENECPKRPVPCPYSPYGCKERVP   53 (60)
T ss_dssp             EEE-TT--S-BEEE-CCCHHHHHHTTSTTSEEE-SS----S--EEE
T ss_pred             EeeCCCCCccccee-HHHHHHHHHccCCCCcEECCCCCCCCCCccc
Confidence            455655  333333 345666666444455566666    655543


No 130
>PRK14873 primosome assembly protein PriA; Provisional
Probab=49.47  E-value=11  Score=39.78  Aligned_cols=8  Identities=25%  Similarity=0.966  Sum_probs=4.5

Q ss_pred             eccccccc
Q psy4572          48 KCEHCEKE   55 (345)
Q Consensus        48 ~C~~C~k~   55 (345)
                      .|..||..
T Consensus       385 ~C~~Cg~~  392 (665)
T PRK14873        385 ACARCRTP  392 (665)
T ss_pred             EhhhCcCe
Confidence            56666554


No 131
>PF12760 Zn_Tnp_IS1595:  Transposase zinc-ribbon domain;  InterPro: IPR024442 This zinc binding domain is found in a range of transposase proteins such as ISSPO8, ISSOD11, ISRSSP2 etc. It may be a zinc-binding beta ribbon domain that could bind DNA.
Probab=47.66  E-value=15  Score=24.18  Aligned_cols=10  Identities=40%  Similarity=1.022  Sum_probs=6.8

Q ss_pred             ceeecCcccc
Q psy4572         188 LHYQCELCGK  197 (345)
Q Consensus       188 k~y~C~~Cgk  197 (345)
                      ..|+|..|++
T Consensus        36 ~~~~C~~C~~   45 (46)
T PF12760_consen   36 GRYRCKACRK   45 (46)
T ss_pred             CeEECCCCCC
Confidence            4577777765


No 132
>PF03604 DNA_RNApol_7kD:  DNA directed RNA polymerase, 7 kDa subunit;  InterPro: IPR006591 DNA-dependent RNA polymerase catalyzes the transcription of DNA into RNA using the four ribonucleoside triphosphates as substrates. Each class of RNA polymerase is assembled from 9 to 15 different polypeptides. Rbp10 (RNA polymerase CX) is a domain found in RNA polymerase subunit 10; present in RNA polymerase I, II and III.; GO: 0003677 DNA binding, 0003899 DNA-directed RNA polymerase activity, 0006351 transcription, DNA-dependent; PDB: 2PMZ_Z 3HKZ_X 2NVX_L 3S1Q_L 2JA6_L 3S17_L 3HOW_L 3HOV_L 3PO2_L 3HOZ_L ....
Probab=47.38  E-value=12  Score=22.87  Aligned_cols=9  Identities=33%  Similarity=0.851  Sum_probs=3.9

Q ss_pred             ecccccccc
Q psy4572          48 KCEHCEKEF   56 (345)
Q Consensus        48 ~C~~C~k~F   56 (345)
                      .|..|+..+
T Consensus         2 ~C~~Cg~~~   10 (32)
T PF03604_consen    2 ICGECGAEV   10 (32)
T ss_dssp             BESSSSSSE
T ss_pred             CCCcCCCee
Confidence            344444443


No 133
>COG4530 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=47.35  E-value=8.1  Score=30.58  Aligned_cols=27  Identities=22%  Similarity=0.161  Sum_probs=17.3

Q ss_pred             CCcchhhccCCHHHHHHHHhhhCCCCcCceeecCccccccC
Q psy4572         160 QCETCRKLFLSKYSLFTHNKKHLPIEERLHYQCELCGKSFN  200 (345)
Q Consensus       160 ~C~~Cgk~F~s~s~L~~H~~~H~~~~~~k~y~C~~Cgk~F~  200 (345)
                      .|+.||+.|-..              .+.|..||.||++|.
T Consensus        11 idPetg~KFYDL--------------NrdPiVsPytG~s~P   37 (129)
T COG4530          11 IDPETGKKFYDL--------------NRDPIVSPYTGKSYP   37 (129)
T ss_pred             cCccccchhhcc--------------CCCccccCcccccch
Confidence            467777766543              345677777777773


No 134
>COG5151 SSL1 RNA polymerase II transcription initiation/nucleotide excision repair factor TFIIH, subunit SSL1 [Transcription / DNA replication, recombination, and repair]
Probab=47.27  E-value=11  Score=35.46  Aligned_cols=50  Identities=20%  Similarity=0.355  Sum_probs=33.2

Q ss_pred             CCCcchhhccCCHHHHHHHHhhhCCCCcCceeecCccccccCCHHHHHHHHHHhc
Q psy4572         159 YQCETCRKLFLSKYSLFTHNKKHLPIEERLHYQCELCGKSFNNKQTFSLSNMYVL  213 (345)
Q Consensus       159 y~C~~Cgk~F~s~s~L~~H~~~H~~~~~~k~y~C~~Cgk~F~~~~~L~~H~~~~H  213 (345)
                      -.|-.|.-.|.....   |.....  .....|+|+.|...|-..-+...|...|-
T Consensus       363 ~~Cf~CQ~~fp~~~~---~~~~~~--~ss~rY~Ce~CK~~FC~dCdvfiHe~Lh~  412 (421)
T COG5151         363 THCFVCQGPFPKPPV---SPFDES--TSSGRYQCELCKSTFCSDCDVFIHETLHF  412 (421)
T ss_pred             ccceeccCCCCCCCC---Cccccc--ccccceechhhhhhhhhhhHHHHHHHHhh
Confidence            348888877765421   111111  11235999999999999999999987764


No 135
>PRK14714 DNA polymerase II large subunit; Provisional
Probab=45.98  E-value=26  Score=39.17  Aligned_cols=63  Identities=11%  Similarity=0.163  Sum_probs=34.1

Q ss_pred             eeecCccccccCCHHHHHHHHHHhcCCCCcccccccceeeechhhHHHHHHhHhhhcccCCeEEEeec
Q psy4572         189 HYQCELCGKSFNNKQTFSLSNMYVLSLSNMYVLSSSNMYVLSLSNMYVLSLSKVRLNSSSNMYVLSLS  256 (345)
Q Consensus       189 ~y~C~~Cgk~F~~~~~L~~H~~~~H~~~~~~~~s~~~~~~~~l~~~~~~~~s~~~~~~~~~~~~~~~~  256 (345)
                      +|.|+.||........-    ...++.|+... .......+++.+.+...+..+..+..+=--|-||.
T Consensus       692 vy~CPsCGaev~~des~----a~~CP~CGtpl-v~~~~~~i~~~~~~~~A~~~~g~~~~~~~~~KgVk  754 (1337)
T PRK14714        692 VYVCPDCGAEVPPDESG----RVECPRCDVEL-TPYQRRTINVKEEYRSALENVGERENAFDLLKGVK  754 (1337)
T ss_pred             ceeCccCCCccCCCccc----cccCCCCCCcc-cccceEEecHHHHHHHHHHHhCcccccccccccce
Confidence            36677776654332110    33345444432 23345577888888888887777665433344443


No 136
>COG4957 Predicted transcriptional regulator [Transcription]
Probab=45.59  E-value=14  Score=30.38  Aligned_cols=26  Identities=19%  Similarity=0.184  Sum_probs=20.4

Q ss_pred             eecCccccccCCHHHHHHHHHHhcCCCCc
Q psy4572         190 YQCELCGKSFNNKQTFSLSNMYVLSLSNM  218 (345)
Q Consensus       190 y~C~~Cgk~F~~~~~L~~H~~~~H~~~~~  218 (345)
                      ..|-.||+.|   ..|++|+.+||+..-.
T Consensus        77 IicLEDGkkf---KSLKRHL~t~~gmTPd  102 (148)
T COG4957          77 IICLEDGKKF---KSLKRHLTTHYGLTPD  102 (148)
T ss_pred             EEEeccCcch---HHHHHHHhcccCCCHH
Confidence            4688899888   4688999988887544


No 137
>PRK09678 DNA-binding transcriptional regulator; Provisional
Probab=45.28  E-value=4.3  Score=29.79  Aligned_cols=8  Identities=25%  Similarity=0.978  Sum_probs=3.8

Q ss_pred             eccccccc
Q psy4572          48 KCEHCEKE   55 (345)
Q Consensus        48 ~C~~C~k~   55 (345)
                      .|+.||..
T Consensus         3 ~CP~Cg~~   10 (72)
T PRK09678          3 HCPLCQHA   10 (72)
T ss_pred             cCCCCCCc
Confidence            45555543


No 138
>TIGR00595 priA primosomal protein N'. All proteins in this family for which functions are known are components of the primosome which is involved in replication, repair, and recombination.This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University).
Probab=44.88  E-value=13  Score=37.72  Aligned_cols=9  Identities=33%  Similarity=0.659  Sum_probs=4.5

Q ss_pred             CccCCCCcc
Q psy4572         102 EFKCDQCPY  110 (345)
Q Consensus       102 ~~~C~~C~~  110 (345)
                      +..|+.|+.
T Consensus       253 ~~~Cp~C~s  261 (505)
T TIGR00595       253 PKTCPQCGS  261 (505)
T ss_pred             CCCCCCCCC
Confidence            345555543


No 139
>KOG2593|consensus
Probab=44.61  E-value=22  Score=35.06  Aligned_cols=39  Identities=21%  Similarity=0.457  Sum_probs=26.3

Q ss_pred             CCCCceeccccccccCCHHHHHHHHHhcCCCCCccCCCCccc
Q psy4572          70 TQEKTFPCTVCLKMFKTKAQREEHIASHVTDSEFKCDQCPYV  111 (345)
Q Consensus        70 ~~~k~~~C~~C~k~F~~~~~L~~H~~~h~~~k~~~C~~C~~~  111 (345)
                      +....|.|+.|.+.|+....++   ..-.....|.|..|+..
T Consensus       124 t~~~~Y~Cp~C~kkyt~Lea~~---L~~~~~~~F~C~~C~ge  162 (436)
T KOG2593|consen  124 TNVAGYVCPNCQKKYTSLEALQ---LLDNETGEFHCENCGGE  162 (436)
T ss_pred             cccccccCCccccchhhhHHHH---hhcccCceEEEecCCCc
Confidence            4456699999999987765443   22333456899988753


No 140
>PF03833 PolC_DP2:  DNA polymerase II large subunit DP2;  InterPro: IPR016033 DP2 is the large subunit of a two-subunit novel archaebacterial replicative DNA polymerase first characterised for Pyrococcus furiosus. The structure of DP2 appears to be organised as a ~950 residue component separated from a ~300 residue component by a ~150 residue intein. The other subunit, DP1, has sequence similarity to the eukaryotic DNA polymerase delta small subunit. This entry represents the N-terminal ~950 residue component of DP2.; GO: 0003887 DNA-directed DNA polymerase activity; PDB: 3O59_X.
Probab=44.60  E-value=7.3  Score=41.51  Aligned_cols=11  Identities=45%  Similarity=1.153  Sum_probs=0.0

Q ss_pred             Cceeccccccc
Q psy4572          45 GPHKCEHCEKE   55 (345)
Q Consensus        45 ~~~~C~~C~k~   55 (345)
                      ...+|+.||..
T Consensus       654 ~~r~Cp~Cg~~  664 (900)
T PF03833_consen  654 GRRRCPKCGKE  664 (900)
T ss_dssp             -----------
T ss_pred             ecccCcccCCc
Confidence            34567777653


No 141
>PF06524 NOA36:  NOA36 protein;  InterPro: IPR010531 This family consists of several NOA36 proteins which contain 29 highly conserved cysteine residues. The function of this protein is unknown.; GO: 0008270 zinc ion binding, 0005634 nucleus
Probab=44.53  E-value=8.4  Score=35.29  Aligned_cols=90  Identities=20%  Similarity=0.407  Sum_probs=45.9

Q ss_pred             cCCCCCccCCCCcccccCHHHHHHHHHHhc--CCCccccCCCCCCcCCHHHHHHHHHHhcCCCCCCCcchhhccCCHHHH
Q psy4572          97 HVTDSEFKCDQCPYVAKRIKYLRYHMKKHR--KEYSAYCEPCQEGFFSRDKLETHNIAKHGAQPYQCETCRKLFLSKYSL  174 (345)
Q Consensus        97 h~~~k~~~C~~C~~~f~~~~~L~~H~~~h~--~~~~~~C~~C~k~F~~~~~l~~H~~~hh~~k~y~C~~Cgk~F~s~s~L  174 (345)
                      .||.+.|+|..|....---. --.|+....  ....|+|.-|++.                 ..|.|-.|.-.|-.    
T Consensus       137 ~hGGrif~CsfC~~flCEDD-QFEHQAsCQvLe~E~~KC~SCNrl-----------------Gq~sCLRCK~cfCd----  194 (314)
T PF06524_consen  137 DHGGRIFKCSFCDNFLCEDD-QFEHQASCQVLESETFKCQSCNRL-----------------GQYSCLRCKICFCD----  194 (314)
T ss_pred             cCCCeEEEeecCCCeeeccc-hhhhhhhhhhhhcccccccccccc-----------------cchhhhheeeeehh----
Confidence            45778899999975433333 334544321  2345777777653                 12334444433432    


Q ss_pred             HHHHhhh-CCCCcCceeecCccccccCCHHHHHHHH
Q psy4572         175 FTHNKKH-LPIEERLHYQCELCGKSFNNKQTFSLSN  209 (345)
Q Consensus       175 ~~H~~~H-~~~~~~k~y~C~~Cgk~F~~~~~L~~H~  209 (345)
                       .|.+.. ...+..+++.||.||........|..-.
T Consensus       195 -dHvrrKg~ky~k~k~~PCPKCg~et~eTkdLSmSt  229 (314)
T PF06524_consen  195 -DHVRRKGFKYEKGKPIPCPKCGYETQETKDLSMST  229 (314)
T ss_pred             -hhhhhcccccccCCCCCCCCCCCcccccccceeee
Confidence             233321 1112335677777776665555554443


No 142
>KOG0782|consensus
Probab=43.12  E-value=8.2  Score=39.20  Aligned_cols=58  Identities=26%  Similarity=0.502  Sum_probs=40.8

Q ss_pred             HHHHHHHHHHhcCCCCCCCcchhhccCCHHHHHHHHhhhCCCCcCceeecCccccccCCHH-HHHHH
Q psy4572         143 RDKLETHNIAKHGAQPYQCETCRKLFLSKYSLFTHNKKHLPIEERLHYQCELCGKSFNNKQ-TFSLS  208 (345)
Q Consensus       143 ~~~l~~H~~~hh~~k~y~C~~Cgk~F~s~s~L~~H~~~H~~~~~~k~y~C~~Cgk~F~~~~-~L~~H  208 (345)
                      ...+.+|...|.....=+|..|||.|..+..+      |.  ++-....|.+|.+.|..+. .|..|
T Consensus       238 E~~fvrHHWVHrrRqeGkC~~CgKgFQQKf~F------hs--KEivAisCSWCKqayH~KvtCFml~  296 (1004)
T KOG0782|consen  238 ESGFVRHHWVHRRRQEGKCNTCGKGFQQKFFF------HS--KEIVAISCSWCKQAYHLKVTCFMLD  296 (1004)
T ss_pred             cccchHHhHhhHhhhccccchhhhhhhhheee------cc--ccEEEEEehHHHHHhhcchhhhhhh
Confidence            34677787777666666899999999877554      21  1233578999999998876 44444


No 143
>PF08209 Sgf11:  Sgf11 (transcriptional regulation protein);  InterPro: IPR013246 The Sgf11 family is a SAGA complex subunit in Saccharomyces cerevisiae (Baker's yeast). The SAGA complex is a multisubunit protein complex involved in transcriptional regulation. SAGA combines proteins involved in interactions with DNA-bound activators and TATA-binding protein (TBP), as well as enzymes for histone acetylation and deubiquitylation [].; PDB: 3M99_B 2LO2_A 3MHH_C 3MHS_C.
Probab=42.06  E-value=18  Score=22.31  Aligned_cols=28  Identities=14%  Similarity=0.325  Sum_probs=18.4

Q ss_pred             eeecCccccccCCHHHHHHHHHHhcCCCC
Q psy4572         189 HYQCELCGKSFNNKQTFSLSNMYVLSLSN  217 (345)
Q Consensus       189 ~y~C~~Cgk~F~~~~~L~~H~~~~H~~~~  217 (345)
                      .+.|+.|++.+. ...+..|+...-+...
T Consensus         4 ~~~C~nC~R~v~-a~RfA~HLekCmg~g~   31 (33)
T PF08209_consen    4 YVECPNCGRPVA-ASRFAPHLEKCMGRGR   31 (33)
T ss_dssp             EEE-TTTSSEEE-GGGHHHHHHHHTCH--
T ss_pred             eEECCCCcCCcc-hhhhHHHHHHHHccCC
Confidence            478888888775 5678888877766443


No 144
>KOG3408|consensus
Probab=41.99  E-value=34  Score=27.71  Aligned_cols=27  Identities=33%  Similarity=0.541  Sum_probs=22.5

Q ss_pred             CCCCceeccccccccCCHHHHHHHHHH
Q psy4572          42 KLDGPHKCEHCEKEFENRRALKRHEKN   68 (345)
Q Consensus        42 ~~~~~~~C~~C~k~F~~~~~L~~H~~~   68 (345)
                      .+-+.|.|-.|.+.|.+...|..|.++
T Consensus        53 PG~GqfyCi~CaRyFi~~~~l~~H~kt   79 (129)
T KOG3408|consen   53 PGGGQFYCIECARYFIDAKALKTHFKT   79 (129)
T ss_pred             CCCceeehhhhhhhhcchHHHHHHHhc
Confidence            445679999999999999999999775


No 145
>PHA00626 hypothetical protein
Probab=41.68  E-value=13  Score=25.71  Aligned_cols=16  Identities=25%  Similarity=0.451  Sum_probs=12.8

Q ss_pred             CceeecCccccccCCH
Q psy4572         187 RLHYQCELCGKSFNNK  202 (345)
Q Consensus       187 ~k~y~C~~Cgk~F~~~  202 (345)
                      ...|.|+.||+.|+..
T Consensus        21 snrYkCkdCGY~ft~~   36 (59)
T PHA00626         21 SDDYVCCDCGYNDSKD   36 (59)
T ss_pred             CcceEcCCCCCeechh
Confidence            3469999999999743


No 146
>COG1571 Predicted DNA-binding protein containing a Zn-ribbon domain [General function prediction only]
Probab=41.21  E-value=11  Score=37.10  Aligned_cols=29  Identities=24%  Similarity=0.592  Sum_probs=19.4

Q ss_pred             CCcchhhccCCHHHHHHHHhhhCCCCcCceeecCccccccCCH
Q psy4572         160 QCETCRKLFLSKYSLFTHNKKHLPIEERLHYQCELCGKSFNNK  202 (345)
Q Consensus       160 ~C~~Cgk~F~s~s~L~~H~~~H~~~~~~k~y~C~~Cgk~F~~~  202 (345)
                      .|+.||.+..+.              ++.-|+|+.||+.+...
T Consensus       352 ~Cp~Cg~~m~S~--------------G~~g~rC~kCg~~~~~~  380 (421)
T COG1571         352 VCPRCGGRMKSA--------------GRNGFRCKKCGTRARET  380 (421)
T ss_pred             CCCccCCchhhc--------------CCCCcccccccccCCcc
Confidence            588888665443              33358888888877654


No 147
>PF06524 NOA36:  NOA36 protein;  InterPro: IPR010531 This family consists of several NOA36 proteins which contain 29 highly conserved cysteine residues. The function of this protein is unknown.; GO: 0008270 zinc ion binding, 0005634 nucleus
Probab=40.72  E-value=7  Score=35.79  Aligned_cols=26  Identities=19%  Similarity=0.386  Sum_probs=14.7

Q ss_pred             CCCCCCcchhhccCCHHHHHHHHhhh
Q psy4572         156 AQPYQCETCRKLFLSKYSLFTHNKKH  181 (345)
Q Consensus       156 ~k~y~C~~Cgk~F~s~s~L~~H~~~H  181 (345)
                      .+++.||.||........|..-.|+|
T Consensus       207 ~k~~PCPKCg~et~eTkdLSmStR~h  232 (314)
T PF06524_consen  207 GKPIPCPKCGYETQETKDLSMSTRSH  232 (314)
T ss_pred             CCCCCCCCCCCcccccccceeeeecc
Confidence            46677777776655555554444443


No 148
>smart00661 RPOL9 RNA polymerase subunit 9.
Probab=39.60  E-value=18  Score=24.13  Aligned_cols=12  Identities=25%  Similarity=0.706  Sum_probs=8.3

Q ss_pred             eeecCccccccC
Q psy4572         189 HYQCELCGKSFN  200 (345)
Q Consensus       189 ~y~C~~Cgk~F~  200 (345)
                      .|.|+.||+.+.
T Consensus        20 ~~vC~~Cg~~~~   31 (52)
T smart00661       20 RFVCRKCGYEEP   31 (52)
T ss_pred             EEECCcCCCeEE
Confidence            577888886544


No 149
>PF15269 zf-C2H2_7:  Zinc-finger
Probab=39.23  E-value=36  Score=22.47  Aligned_cols=23  Identities=26%  Similarity=0.382  Sum_probs=18.9

Q ss_pred             ceeccccccccCCHHHHHHHHHH
Q psy4572          46 PHKCEHCEKEFENRRALKRHEKN   68 (345)
Q Consensus        46 ~~~C~~C~k~F~~~~~L~~H~~~   68 (345)
                      .|+|-.|..+...++.|-.||+-
T Consensus        20 ~ykcfqcpftc~~kshl~nhmky   42 (54)
T PF15269_consen   20 KYKCFQCPFTCNEKSHLFNHMKY   42 (54)
T ss_pred             cceeecCCcccchHHHHHHHHHH
Confidence            47888888888888888888864


No 150
>COG0068 HypF Hydrogenase maturation factor [Posttranslational modification, protein turnover, chaperones]
Probab=38.54  E-value=1.9  Score=44.91  Aligned_cols=56  Identities=20%  Similarity=0.395  Sum_probs=35.0

Q ss_pred             ccCCCCCCcCCHHHHHHHHHHhcCCCC-CCCcchhhccCCHHHHHHHHhhhCCCCcCceeecCcccc
Q psy4572         132 YCEPCQEGFFSRDKLETHNIAKHGAQP-YQCETCRKLFLSKYSLFTHNKKHLPIEERLHYQCELCGK  197 (345)
Q Consensus       132 ~C~~C~k~F~~~~~l~~H~~~hh~~k~-y~C~~Cgk~F~s~s~L~~H~~~H~~~~~~k~y~C~~Cgk  197 (345)
                      .|..||-+|+-...+---. ..+.-+. --|+.|.+.|....+-+-|.+         |.-|+.||=
T Consensus       125 ~CT~CGPRfTIi~alPYDR-~nTsM~~F~lC~~C~~EY~dP~nRRfHAQ---------p~aCp~CGP  181 (750)
T COG0068         125 NCTNCGPRFTIIEALPYDR-ENTSMADFPLCPFCDKEYKDPLNRRFHAQ---------PIACPKCGP  181 (750)
T ss_pred             ccCCCCcceeeeccCCCCc-ccCccccCcCCHHHHHHhcCccccccccc---------cccCcccCC
Confidence            4888888877544332111 1122222 359999999998877555533         577999985


No 151
>TIGR00244 transcriptional regulator NrdR. Members of this almost entirely bacterial family contain an ATP cone domain (PFAM:PF03477). There is never more than one member per genome. Common gene symbols given include nrdR, ybaD, ribX and ytcG. The member from Streptomyces coelicolor is found upstream in the operon of the class II oxygen-independent ribonucleotide reductase gene nrdJ and was shown to repress nrdJ expression. Many members of this family are found near genes for riboflavin biosynthesis in Gram-negative bacteria, suggesting a role in that pathway. However, a phylogenetic profiling study associates members of this family with the presence of a palindromic signal with consensus acaCwAtATaTwGtgt, termed the NrdR-box, an upstream element for most operons for ribonucleotide reductase of all three classes in bacterial genomes.
Probab=37.50  E-value=7.3  Score=32.66  Aligned_cols=16  Identities=19%  Similarity=0.405  Sum_probs=8.9

Q ss_pred             CCCcchhhccCCHHHH
Q psy4572         159 YQCETCRKLFLSKYSL  174 (345)
Q Consensus       159 y~C~~Cgk~F~s~s~L  174 (345)
                      -+|..||+.|++....
T Consensus        29 ReC~~C~~RFTTyErv   44 (147)
T TIGR00244        29 RECLECHERFTTFERA   44 (147)
T ss_pred             ccCCccCCccceeeec
Confidence            4566666666655433


No 152
>KOG1280|consensus
Probab=37.38  E-value=28  Score=33.34  Aligned_cols=25  Identities=16%  Similarity=0.323  Sum_probs=15.8

Q ss_pred             CCCCcchhhccCCHHHHHHHHhhhC
Q psy4572         158 PYQCETCRKLFLSKYSLFTHNKKHL  182 (345)
Q Consensus       158 ~y~C~~Cgk~F~s~s~L~~H~~~H~  182 (345)
                      .|.|++|++.-.+...+..|....|
T Consensus        79 SftCPyC~~~Gfte~~f~~Hv~s~H  103 (381)
T KOG1280|consen   79 SFTCPYCGIMGFTERQFGTHVLSQH  103 (381)
T ss_pred             cccCCcccccccchhHHHHHhhhcC
Confidence            4666666666666666666665544


No 153
>COG3357 Predicted transcriptional regulator containing an HTH domain fused to a Zn-ribbon [Transcription]
Probab=37.11  E-value=17  Score=27.77  Aligned_cols=10  Identities=30%  Similarity=0.663  Sum_probs=4.4

Q ss_pred             ccCCCCCCcC
Q psy4572         132 YCEPCQEGFF  141 (345)
Q Consensus       132 ~C~~C~k~F~  141 (345)
                      .|..||..|.
T Consensus        60 ~CkkCGfef~   69 (97)
T COG3357          60 RCKKCGFEFR   69 (97)
T ss_pred             hhcccCcccc
Confidence            3444444443


No 154
>PF07754 DUF1610:  Domain of unknown function (DUF1610);  InterPro: IPR011668 This domain is found in archaeal species. It is likely to bind zinc via its four well-conserved cysteine residues.
Probab=36.84  E-value=22  Score=20.22  Aligned_cols=10  Identities=30%  Similarity=0.946  Sum_probs=7.3

Q ss_pred             ceeecCcccc
Q psy4572         188 LHYQCELCGK  197 (345)
Q Consensus       188 k~y~C~~Cgk  197 (345)
                      .+|.||.||+
T Consensus        15 v~f~CPnCG~   24 (24)
T PF07754_consen   15 VPFPCPNCGF   24 (24)
T ss_pred             ceEeCCCCCC
Confidence            4688888874


No 155
>PF13878 zf-C2H2_3:  zinc-finger of acetyl-transferase ESCO
Probab=36.19  E-value=34  Score=22.03  Aligned_cols=24  Identities=21%  Similarity=0.440  Sum_probs=16.8

Q ss_pred             eecCccccccC--CHHHHHHHHHHhc
Q psy4572         190 YQCELCGKSFN--NKQTFSLSNMYVL  213 (345)
Q Consensus       190 y~C~~Cgk~F~--~~~~L~~H~~~~H  213 (345)
                      -+|+.||-.|.  ....-..|.+.|-
T Consensus        14 ~~C~~CgM~Y~~~~~eD~~~H~~yH~   39 (41)
T PF13878_consen   14 TTCPTCGMLYSPGSPEDEKLHKKYHD   39 (41)
T ss_pred             cCCCCCCCEECCCCHHHHHHHHHHHh
Confidence            47888886664  5677777777764


No 156
>PRK03824 hypA hydrogenase nickel incorporation protein; Provisional
Probab=36.13  E-value=14  Score=30.65  Aligned_cols=11  Identities=36%  Similarity=1.008  Sum_probs=5.2

Q ss_pred             eeccccccccC
Q psy4572          47 HKCEHCEKEFE   57 (345)
Q Consensus        47 ~~C~~C~k~F~   57 (345)
                      +.|+.||..|.
T Consensus        71 ~~C~~CG~~~~   81 (135)
T PRK03824         71 LKCRNCGNEWS   81 (135)
T ss_pred             EECCCCCCEEe
Confidence            44555554443


No 157
>KOG4167|consensus
Probab=35.86  E-value=11  Score=39.33  Aligned_cols=28  Identities=18%  Similarity=0.277  Sum_probs=25.2

Q ss_pred             eeecCccccccCCHHHHHHHHHHhcCCC
Q psy4572         189 HYQCELCGKSFNNKQTFSLSNMYVLSLS  216 (345)
Q Consensus       189 ~y~C~~Cgk~F~~~~~L~~H~~~~H~~~  216 (345)
                      .|.|..|+|.|-....+..|+++|--..
T Consensus       792 iFpCreC~kvF~KiKSrNAHMK~Hr~q~  819 (907)
T KOG4167|consen  792 IFPCRECGKVFFKIKSRNAHMKTHRQQE  819 (907)
T ss_pred             eeehHHHHHHHHHHhhhhHHHHHHHHHH
Confidence            4999999999999999999999987654


No 158
>PF08274 PhnA_Zn_Ribbon:  PhnA Zinc-Ribbon ;  InterPro: IPR013987 The PhnA protein family includes the uncharacterised Escherichia coli protein PhnA and its homologues. The E. coli phnA gene is part of a large operon associated with alkylphosphonate uptake and carbon-phosphorus bond cleavage []. The protein is not related to the characterised phosphonoacetate hydrolase designated PhnA []. This entry represents the N-terminal domain of PhnA, which is predicted to form a zinc-ribbon.; PDB: 2AKL_A.
Probab=35.80  E-value=14  Score=22.19  Aligned_cols=9  Identities=22%  Similarity=0.604  Sum_probs=4.8

Q ss_pred             ceecccccc
Q psy4572          74 TFPCTVCLK   82 (345)
Q Consensus        74 ~~~C~~C~k   82 (345)
                      .+.|+.|+.
T Consensus        19 ~~vCp~C~~   27 (30)
T PF08274_consen   19 LLVCPECGH   27 (30)
T ss_dssp             SEEETTTTE
T ss_pred             EEeCCcccc
Confidence            355665553


No 159
>PF07975 C1_4:  TFIIH C1-like domain;  InterPro: IPR004595 All proteins in this domain for which functions are known are components of the TFIIH complex which is involved in the initiation of transcription and nucleotide excision repair. It includes the yeast transcription factor Ssl1 (Suppressor of stem-loop protein 1) that is essential for translation initiation and affects UV resistance. The C-terminal region is essential for transcription activity. This regions binds three zinc atoms through two independent domain. The first contains a C4 zinc finger motif, whereas the second is characterised by a CX(2)CX(2-4)FCADCD motif. The solution structure of the second C-terminal domain revealed homology with the regulatory domain of protein kinase C [].; GO: 0006281 DNA repair, 0005634 nucleus; PDB: 1Z60_A.
Probab=35.68  E-value=5.7  Score=27.10  Aligned_cols=20  Identities=30%  Similarity=0.850  Sum_probs=6.4

Q ss_pred             CCCCcchhhccCCHHHHHHH
Q psy4572         158 PYQCETCRKLFLSKYSLFTH  177 (345)
Q Consensus       158 ~y~C~~Cgk~F~s~s~L~~H  177 (345)
                      .|+|+.|+..|--.=....|
T Consensus        21 ~y~C~~C~~~FC~dCD~fiH   40 (51)
T PF07975_consen   21 RYRCPKCKNHFCIDCDVFIH   40 (51)
T ss_dssp             EE--TTTT--B-HHHHHTTT
T ss_pred             eEECCCCCCccccCcChhhh
Confidence            34555555444433333333


No 160
>PF05290 Baculo_IE-1:  Baculovirus immediate-early protein (IE-0);  InterPro: IPR007954 This entry contains the Baculovirus immediate-early protein IE-0.
Probab=35.32  E-value=23  Score=29.21  Aligned_cols=56  Identities=20%  Similarity=0.399  Sum_probs=34.9

Q ss_pred             CCCccccCCCCCCcCCHHHHHHHHHHhcCCCCCCC---cchhhccCCHHHHHHHHhhhCCCCcCceeecCccccccCCHH
Q psy4572         127 KEYSAYCEPCQEGFFSRDKLETHNIAKHGAQPYQC---ETCRKLFLSKYSLFTHNKKHLPIEERLHYQCELCGKSFNNKQ  203 (345)
Q Consensus       127 ~~~~~~C~~C~k~F~~~~~l~~H~~~hh~~k~y~C---~~Cgk~F~s~s~L~~H~~~H~~~~~~k~y~C~~Cgk~F~~~~  203 (345)
                      ..+.|+|.+|..+...+.-          -+|-+|   .+|+.-   -..|++|-..|.        .||.|..+|.+..
T Consensus        77 d~~lYeCnIC~etS~ee~F----------LKPneCCgY~iCn~C---ya~LWK~~~~yp--------vCPvCkTSFKss~  135 (140)
T PF05290_consen   77 DPKLYECNICKETSAEERF----------LKPNECCGYSICNAC---YANLWKFCNLYP--------VCPVCKTSFKSSS  135 (140)
T ss_pred             CCCceeccCcccccchhhc----------CCcccccchHHHHHH---HHHHHHHcccCC--------CCCcccccccccc
Confidence            4467888888877655422          344455   123322   456777766653        6999999987654


No 161
>PF09845 DUF2072:  Zn-ribbon containing protein (DUF2072);  InterPro: IPR018645  This archaeal Zinc-ribbon containing proteins have no known function. 
Probab=35.13  E-value=19  Score=29.56  Aligned_cols=15  Identities=27%  Similarity=0.704  Sum_probs=9.8

Q ss_pred             ceeccccccccCCHH
Q psy4572          74 TFPCTVCLKMFKTKA   88 (345)
Q Consensus        74 ~~~C~~C~k~F~~~~   88 (345)
                      |++|..||+.|....
T Consensus         1 PH~Ct~Cg~~f~dgs   15 (131)
T PF09845_consen    1 PHQCTKCGRVFEDGS   15 (131)
T ss_pred             CcccCcCCCCcCCCc
Confidence            456777777776544


No 162
>PTZ00255 60S ribosomal protein L37a; Provisional
Probab=35.00  E-value=18  Score=27.79  Aligned_cols=12  Identities=25%  Similarity=0.739  Sum_probs=7.8

Q ss_pred             eeecCccccccC
Q psy4572         189 HYQCELCGKSFN  200 (345)
Q Consensus       189 ~y~C~~Cgk~F~  200 (345)
                      .+.|..|++.|.
T Consensus        54 IW~C~~C~~~~A   65 (90)
T PTZ00255         54 IWRCKGCKKTVA   65 (90)
T ss_pred             EEEcCCCCCEEe
Confidence            466777776664


No 163
>KOG4124|consensus
Probab=34.84  E-value=5.6  Score=37.88  Aligned_cols=56  Identities=25%  Similarity=0.531  Sum_probs=40.5

Q ss_pred             CCCCCCcc--hhhccCCHHHHHHHHhhhCCC----------------CcCceeecCccccccCCHHHHHHHHHH
Q psy4572         156 AQPYQCET--CRKLFLSKYSLFTHNKKHLPI----------------EERLHYQCELCGKSFNNKQTFSLSNMY  211 (345)
Q Consensus       156 ~k~y~C~~--Cgk~F~s~s~L~~H~~~H~~~----------------~~~k~y~C~~Cgk~F~~~~~L~~H~~~  211 (345)
                      .++|+|++  |.+.++....+..|...-|..                ...|+|+|++|.+++.....|.-|...
T Consensus       347 ~~~~~~~vp~~~~~~~n~ng~~~~~~~~h~s~i~~~s~~~~ph~~~~~~nk~~r~~i~~~~~k~~~~l~~~~~~  420 (442)
T KOG4124|consen  347 DKPYKCPVPNCDKAYKNQNGLKYHKLHGHCSPITTPTPAPIPHQGFVVENKPYRCEVCSKRYKNLNGLKYHRTH  420 (442)
T ss_pred             cCCCCCCCCcchhhcccCcceeeccccCcCCCCCCCCCCCCCcceeeeccCcccChhhhhhhccCCCCCceeeh
Confidence            46788865  999999888888886543311                135789999999998877666666433


No 164
>PF01363 FYVE:  FYVE zinc finger;  InterPro: IPR000306 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target.  The FYVE zinc finger is named after four proteins that it has been found in: Fab1, YOTB/ZK632.12, Vac1, and EEA1. The FYVE finger has been shown to bind two zinc ions []. The FYVE finger has eight potential zinc coordinating cysteine positions. Many members of this family also include two histidines in a motif R+HHC+XCG, where + represents a charged residue and X any residue. FYVE-type domains are divided into two known classes: FYVE domains that specifically bind to phosphatidylinositol 3-phosphate in lipid bilayers and FYVE-related domains of undetermined function []. Those that bind to phosphatidylinositol 3-phosphate are often found in proteins targeted to lipid membranes that are involved in regulating membrane traffic [, , ]. Most FYVE domains target proteins to endosomes by binding specifically to phosphatidylinositol-3-phosphate at the membrane surface. By contrast, the CARP2 FYVE-like domain is not optimized to bind to phosphoinositides or insert into lipid bilayers. FYVE domains are distinguished from other zinc fingers by three signature sequences: an N-terminal WxxD motif, a basic R(R/K)HHCR patch, and a C-terminal RVC motif. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; GO: 0046872 metal ion binding; PDB: 1HYI_A 1JOC_B 1HYJ_A 1DVP_A 3ZYQ_A 4AVX_A 1VFY_A 3T7L_A 1X4U_A 1WFK_A ....
Probab=34.30  E-value=24  Score=25.06  Aligned_cols=10  Identities=40%  Similarity=0.870  Sum_probs=3.5

Q ss_pred             CCcchhhccC
Q psy4572         160 QCETCRKLFL  169 (345)
Q Consensus       160 ~C~~Cgk~F~  169 (345)
                      .|..|++.|.
T Consensus        11 ~C~~C~~~F~   20 (69)
T PF01363_consen   11 NCMICGKKFS   20 (69)
T ss_dssp             B-TTT--B-B
T ss_pred             cCcCcCCcCC
Confidence            4666666664


No 165
>PRK05580 primosome assembly protein PriA; Validated
Probab=34.20  E-value=26  Score=37.06  Aligned_cols=9  Identities=22%  Similarity=0.604  Sum_probs=5.1

Q ss_pred             CccCCCCcc
Q psy4572         102 EFKCDQCPY  110 (345)
Q Consensus       102 ~~~C~~C~~  110 (345)
                      +..|+.|+.
T Consensus       421 ~~~Cp~Cg~  429 (679)
T PRK05580        421 PKACPECGS  429 (679)
T ss_pred             CCCCCCCcC
Confidence            445666654


No 166
>PF04216 FdhE:  Protein involved in formate dehydrogenase formation;  InterPro: IPR006452 This family of sequences describe an accessory protein required for the assembly of formate dehydrogenase of certain proteobacteria although not present in the final complex []. The exact nature of the function of FdhE in the assembly of the complex is unknown, but considering the presence of selenocysteine, molybdopterin, iron-sulphur clusters and cytochrome b556, it is likely to be involved in the insertion of cofactors. ; GO: 0005737 cytoplasm; PDB: 2FIY_B.
Probab=33.85  E-value=4  Score=38.29  Aligned_cols=77  Identities=19%  Similarity=0.230  Sum_probs=30.2

Q ss_pred             ccCCCCcccccCHHHHHHHHHHhcCCCccccCCCCCCcCCHHHHHHHHHHhcCCCCCCCcchhhccCCHHHHHHHHhhhC
Q psy4572         103 FKCDQCPYVAKRIKYLRYHMKKHRKEYSAYCEPCQEGFFSRDKLETHNIAKHGAQPYQCETCRKLFLSKYSLFTHNKKHL  182 (345)
Q Consensus       103 ~~C~~C~~~f~~~~~L~~H~~~h~~~~~~~C~~C~k~F~~~~~l~~H~~~hh~~k~y~C~~Cgk~F~s~s~L~~H~~~H~  182 (345)
                      -.|+.||..-.-. .+..-..  .|.+-..|..|+....              -....|+.||..-...-....    ..
T Consensus       173 g~CPvCGs~P~~s-~l~~~~~--~G~R~L~Cs~C~t~W~--------------~~R~~Cp~Cg~~~~~~l~~~~----~e  231 (290)
T PF04216_consen  173 GYCPVCGSPPVLS-VLRGGER--EGKRYLHCSLCGTEWR--------------FVRIKCPYCGNTDHEKLEYFT----VE  231 (290)
T ss_dssp             SS-TTT---EEEE-EEE--------EEEEEETTT--EEE----------------TTS-TTT---SS-EEE---------
T ss_pred             CcCCCCCCcCceE-EEecCCC--CccEEEEcCCCCCeee--------------ecCCCCcCCCCCCCcceeeEe----cC
Confidence            4789998532111 0110000  2455667888876543              234579999976544322211    11


Q ss_pred             CCCcCceeecCccccccC
Q psy4572         183 PIEERLHYQCELCGKSFN  200 (345)
Q Consensus       183 ~~~~~k~y~C~~Cgk~F~  200 (345)
                      +..+.+-+.|..|+..+.
T Consensus       232 ~~~~~rve~C~~C~~YlK  249 (290)
T PF04216_consen  232 GEPAYRVEVCESCGSYLK  249 (290)
T ss_dssp             ---SEEEEEETTTTEEEE
T ss_pred             CCCcEEEEECCcccchHH
Confidence            122455688999986554


No 167
>TIGR01206 lysW lysine biosynthesis protein LysW. This very small, poorly characterized protein has been shown essential in Thermus thermophilus for an unusual pathway of Lys biosynthesis from aspartate by way of alpha-aminoadipate (AAA) rather than diaminopimelate. It is found also in Deinococcus radiodurans and Pyrococcus horikoshii, which appear to share the AAA pathway.
Probab=33.67  E-value=17  Score=25.05  Aligned_cols=30  Identities=17%  Similarity=0.431  Sum_probs=16.0

Q ss_pred             CCCcchhhccCCHHHHHHHHhhhCCCCcCceeecCcccccc
Q psy4572         159 YQCETCRKLFLSKYSLFTHNKKHLPIEERLHYQCELCGKSF  199 (345)
Q Consensus       159 y~C~~Cgk~F~s~s~L~~H~~~H~~~~~~k~y~C~~Cgk~F  199 (345)
                      ++|+.||..+.-...          ..+ ....|+.||..+
T Consensus         3 ~~CP~CG~~iev~~~----------~~G-eiV~Cp~CGael   32 (54)
T TIGR01206         3 FECPDCGAEIELENP----------ELG-ELVICDECGAEL   32 (54)
T ss_pred             cCCCCCCCEEecCCC----------ccC-CEEeCCCCCCEE
Confidence            567777765532211          111 246777777655


No 168
>TIGR00280 L37a ribosomal protein L37a. This model finds eukaryotic ribosomal protein L37a and its archaeal orthologs. The nomeclature is tricky because eukaryotes have proteins called both L37 and L37a.
Probab=33.50  E-value=17  Score=27.95  Aligned_cols=12  Identities=33%  Similarity=1.005  Sum_probs=8.1

Q ss_pred             eeecCccccccC
Q psy4572         189 HYQCELCGKSFN  200 (345)
Q Consensus       189 ~y~C~~Cgk~F~  200 (345)
                      .+.|..|++.|.
T Consensus        53 IW~C~~C~~~~A   64 (91)
T TIGR00280        53 IWTCRKCGAKFA   64 (91)
T ss_pred             EEEcCCCCCEEe
Confidence            467777777664


No 169
>PF07282 OrfB_Zn_ribbon:  Putative transposase DNA-binding domain;  InterPro: IPR010095 This entry represents a region of a sequence similarity between a family of putative transposases of Thermoanaerobacter tengcongensis, smaller related proteins from Bacillus anthracis, putative transposes described by IPR001959 from INTERPRO, and other proteins. More information about these proteins can be found at Protein of the Month: Transposase [].
Probab=33.49  E-value=35  Score=24.27  Aligned_cols=30  Identities=17%  Similarity=0.449  Sum_probs=18.1

Q ss_pred             CCcchhhccCCHHHHHHHHhhhCCCCcCceeecCccccccCCH
Q psy4572         160 QCETCRKLFLSKYSLFTHNKKHLPIEERLHYQCELCGKSFNNK  202 (345)
Q Consensus       160 ~C~~Cgk~F~s~s~L~~H~~~H~~~~~~k~y~C~~Cgk~F~~~  202 (345)
                      .|+.||..-..             ....+.|.|+.||..+...
T Consensus        30 ~C~~CG~~~~~-------------~~~~r~~~C~~Cg~~~~rD   59 (69)
T PF07282_consen   30 TCPRCGHRNKK-------------RRSGRVFTCPNCGFEMDRD   59 (69)
T ss_pred             CccCccccccc-------------ccccceEEcCCCCCEECcH
Confidence            47777755443             0133468888888776543


No 170
>PRK14559 putative protein serine/threonine phosphatase; Provisional
Probab=33.45  E-value=41  Score=35.36  Aligned_cols=8  Identities=38%  Similarity=0.982  Sum_probs=3.9

Q ss_pred             eccccccc
Q psy4572          48 KCEHCEKE   55 (345)
Q Consensus        48 ~C~~C~k~   55 (345)
                      .|+.||..
T Consensus         3 ~Cp~Cg~~   10 (645)
T PRK14559          3 ICPQCQFE   10 (645)
T ss_pred             cCCCCCCc
Confidence            35555543


No 171
>KOG2907|consensus
Probab=33.44  E-value=24  Score=28.13  Aligned_cols=13  Identities=31%  Similarity=0.856  Sum_probs=10.1

Q ss_pred             eeecCccccccCC
Q psy4572         189 HYQCELCGKSFNN  201 (345)
Q Consensus       189 ~y~C~~Cgk~F~~  201 (345)
                      -|.|+.|++.|+.
T Consensus       102 FYTC~kC~~k~~e  114 (116)
T KOG2907|consen  102 FYTCPKCKYKFTE  114 (116)
T ss_pred             EEEcCccceeeec
Confidence            4788888888864


No 172
>COG1327 Predicted transcriptional regulator, consists of a Zn-ribbon and ATP-cone domains [Transcription]
Probab=33.14  E-value=10  Score=31.92  Aligned_cols=15  Identities=20%  Similarity=0.423  Sum_probs=9.5

Q ss_pred             CCCcchhhccCCHHH
Q psy4572         159 YQCETCRKLFLSKYS  173 (345)
Q Consensus       159 y~C~~Cgk~F~s~s~  173 (345)
                      -.|+.||+.|++...
T Consensus        29 ReC~~C~~RFTTfE~   43 (156)
T COG1327          29 RECLECGERFTTFER   43 (156)
T ss_pred             hcccccccccchhhe
Confidence            457777777776543


No 173
>PRK14873 primosome assembly protein PriA; Provisional
Probab=32.63  E-value=12  Score=39.57  Aligned_cols=6  Identities=33%  Similarity=1.270  Sum_probs=2.9

Q ss_pred             cCCCCc
Q psy4572         104 KCDQCP  109 (345)
Q Consensus       104 ~C~~C~  109 (345)
                      .|..||
T Consensus       385 ~C~~Cg  390 (665)
T PRK14873        385 ACARCR  390 (665)
T ss_pred             EhhhCc
Confidence            455554


No 174
>COG1655 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=32.57  E-value=11  Score=33.93  Aligned_cols=19  Identities=16%  Similarity=0.412  Sum_probs=11.5

Q ss_pred             CCCCCCcchhhccCCHHHH
Q psy4572         156 AQPYQCETCRKLFLSKYSL  174 (345)
Q Consensus       156 ~k~y~C~~Cgk~F~s~s~L  174 (345)
                      ++.+.||+|+..|....-+
T Consensus        17 kk~ieCPvC~tkFkkeev~   35 (267)
T COG1655          17 KKTIECPVCNTKFKKEEVK   35 (267)
T ss_pred             hceeccCcccchhhhhhee
Confidence            4456677777776655433


No 175
>PF13240 zinc_ribbon_2:  zinc-ribbon domain
Probab=32.07  E-value=18  Score=20.21  Aligned_cols=6  Identities=33%  Similarity=1.226  Sum_probs=2.8

Q ss_pred             Ccchhh
Q psy4572         161 CETCRK  166 (345)
Q Consensus       161 C~~Cgk  166 (345)
                      |+.||.
T Consensus        16 C~~CG~   21 (23)
T PF13240_consen   16 CPNCGT   21 (23)
T ss_pred             hhhhCC
Confidence            444443


No 176
>PF13451 zf-trcl:  Probable zinc-binding domain
Probab=32.02  E-value=35  Score=23.09  Aligned_cols=7  Identities=43%  Similarity=1.474  Sum_probs=3.0

Q ss_pred             CCCCcch
Q psy4572         158 PYQCETC  164 (345)
Q Consensus       158 ~y~C~~C  164 (345)
                      |-.|+.|
T Consensus        33 p~RC~~C   39 (49)
T PF13451_consen   33 PKRCPSC   39 (49)
T ss_pred             CccCHHH
Confidence            3344444


No 177
>PF15135 UPF0515:  Uncharacterised protein UPF0515
Probab=31.80  E-value=38  Score=30.96  Aligned_cols=73  Identities=18%  Similarity=0.386  Sum_probs=43.9

Q ss_pred             CHHHHHHHHHhcCC-----CCCccCCCCcccccCHHHHHHHHHHhcCCCccccCCCCCCcCCHHHHHHHHHHhcCCCCCC
Q psy4572          86 TKAQREEHIASHVT-----DSEFKCDQCPYVAKRIKYLRYHMKKHRKEYSAYCEPCQEGFFSRDKLETHNIAKHGAQPYQ  160 (345)
Q Consensus        86 ~~~~L~~H~~~h~~-----~k~~~C~~C~~~f~~~~~L~~H~~~h~~~~~~~C~~C~k~F~~~~~l~~H~~~hh~~k~y~  160 (345)
                      +..+++.+-+.+.+     ++.|.|..|+.....        +.......-.|..|.+.|.-...-+     --|-..|.
T Consensus        91 Te~Nlrm~d~a~~~~ip~~drqFaC~~Cd~~WwR--------rvp~rKeVSRCr~C~~rYDPVP~dk-----mwG~aef~  157 (278)
T PF15135_consen   91 TEENLRMFDDAQENLIPSVDRQFACSSCDHMWWR--------RVPQRKEVSRCRKCRKRYDPVPCDK-----MWGIAEFH  157 (278)
T ss_pred             hHHHHHHhhhhhhccccccceeeeccccchHHHh--------ccCcccccccccccccccCCCcccc-----ccceeeee
Confidence            44556655554433     367999999754322        3333344567999988876433100     11344589


Q ss_pred             CcchhhccCCH
Q psy4572         161 CETCRKLFLSK  171 (345)
Q Consensus       161 C~~Cgk~F~s~  171 (345)
                      |+.|+..|+..
T Consensus       158 C~~C~h~F~G~  168 (278)
T PF15135_consen  158 CPKCRHNFRGF  168 (278)
T ss_pred             cccccccchhh
Confidence            99999999855


No 178
>KOG4377|consensus
Probab=30.74  E-value=14  Score=36.15  Aligned_cols=26  Identities=8%  Similarity=0.241  Sum_probs=19.1

Q ss_pred             eecC--ccccccCCHHHHHHHHHHhcCC
Q psy4572         190 YQCE--LCGKSFNNKQTFSLSNMYVLSL  215 (345)
Q Consensus       190 y~C~--~Cgk~F~~~~~L~~H~~~~H~~  215 (345)
                      |.|.  -|+..|..-+....|.+.|-..
T Consensus       402 fhc~r~Gc~~tl~s~sqm~shkrkheRq  429 (480)
T KOG4377|consen  402 FHCDRLGCEATLYSVSQMASHKRKHERQ  429 (480)
T ss_pred             eeecccCCceEEEehhhhhhhhhhhhhh
Confidence            4553  4888888888888887777544


No 179
>PRK14559 putative protein serine/threonine phosphatase; Provisional
Probab=30.64  E-value=50  Score=34.73  Aligned_cols=12  Identities=17%  Similarity=0.357  Sum_probs=6.7

Q ss_pred             CCcchhhccCCH
Q psy4572         160 QCETCRKLFLSK  171 (345)
Q Consensus       160 ~C~~Cgk~F~s~  171 (345)
                      .|+.||......
T Consensus        43 fC~~CG~~~~~~   54 (645)
T PRK14559         43 HCPNCGAETGTI   54 (645)
T ss_pred             cccccCCcccch
Confidence            466666555443


No 180
>PF04780 DUF629:  Protein of unknown function (DUF629);  InterPro: IPR006865 This domain represents a region of several plant proteins of unknown function. A C2H2 zinc finger is predicted in this region in some family members, but the spacing between the cysteine residues is not conserved throughout the family.
Probab=30.56  E-value=38  Score=33.99  Aligned_cols=28  Identities=14%  Similarity=0.284  Sum_probs=23.8

Q ss_pred             eeecCccccccCCHHHHHHHHHHhcCCC
Q psy4572         189 HYQCELCGKSFNNKQTFSLSNMYVLSLS  216 (345)
Q Consensus       189 ~y~C~~Cgk~F~~~~~L~~H~~~~H~~~  216 (345)
                      -+.|+.|.+.|.....+..|+...|.-.
T Consensus        57 FWiCp~CskkF~d~~~~~~H~~~eH~~~   84 (466)
T PF04780_consen   57 FWICPRCSKKFSDAESCLSHMEQEHPAG   84 (466)
T ss_pred             EeeCCcccceeCCHHHHHHHHHHhhhhh
Confidence            3679999999999999999998877643


No 181
>PF12907 zf-met2:  Zinc-binding
Probab=30.37  E-value=33  Score=22.16  Aligned_cols=27  Identities=15%  Similarity=0.368  Sum_probs=17.5

Q ss_pred             eecCcccccc---CCHHHHHHHHHHhcCCC
Q psy4572         190 YQCELCGKSF---NNKQTFSLSNMYVLSLS  216 (345)
Q Consensus       190 y~C~~Cgk~F---~~~~~L~~H~~~~H~~~  216 (345)
                      +.|.+|...|   .+...|..|....|++.
T Consensus         2 i~C~iC~qtF~~t~~~~~L~eH~enKHpK~   31 (40)
T PF12907_consen    2 IICKICRQTFMQTTNEPQLKEHAENKHPKN   31 (40)
T ss_pred             cCcHHhhHHHHhcCCHHHHHHHHHccCCCC
Confidence            4677777555   34566777777777663


No 182
>PRK12380 hydrogenase nickel incorporation protein HybF; Provisional
Probab=30.35  E-value=23  Score=28.33  Aligned_cols=26  Identities=19%  Similarity=0.438  Sum_probs=16.5

Q ss_pred             CceeccccccccCCHHHHHHHHHHcCCCCceecccccc
Q psy4572          45 GPHKCEHCEKEFENRRALKRHEKNHTQEKTFPCTVCLK   82 (345)
Q Consensus        45 ~~~~C~~C~k~F~~~~~L~~H~~~h~~~k~~~C~~C~k   82 (345)
                      ....|+.|+..|....            ..+.||.||.
T Consensus        69 ~~~~C~~Cg~~~~~~~------------~~~~CP~Cgs   94 (113)
T PRK12380         69 AQAWCWDCSQVVEIHQ------------HDAQCPHCHG   94 (113)
T ss_pred             cEEEcccCCCEEecCC------------cCccCcCCCC
Confidence            4578888887765532            2255888874


No 183
>KOG1842|consensus
Probab=30.30  E-value=29  Score=34.32  Aligned_cols=31  Identities=10%  Similarity=0.054  Sum_probs=24.0

Q ss_pred             ceeecCccccccCCHHHHHHHHHHhcCCCCc
Q psy4572         188 LHYQCELCGKSFNNKQTFSLSNMYVLSLSNM  218 (345)
Q Consensus       188 k~y~C~~Cgk~F~~~~~L~~H~~~~H~~~~~  218 (345)
                      .-|.||+|...|..-..|..|....|+..+.
T Consensus        14 egflCPiC~~dl~~~~~L~~H~d~eH~~ed~   44 (505)
T KOG1842|consen   14 EGFLCPICLLDLPNLSALNDHLDVEHFEEDE   44 (505)
T ss_pred             hcccCchHhhhhhhHHHHHHHHhhhccccch
Confidence            3578888888888888888888888876664


No 184
>PTZ00303 phosphatidylinositol kinase; Provisional
Probab=29.96  E-value=22  Score=37.71  Aligned_cols=13  Identities=31%  Similarity=0.733  Sum_probs=9.8

Q ss_pred             cccCCCCCCcCCH
Q psy4572         131 AYCEPCQEGFFSR  143 (345)
Q Consensus       131 ~~C~~C~k~F~~~  143 (345)
                      -.|..|++.|...
T Consensus       461 dtC~~C~kkFfSl  473 (1374)
T PTZ00303        461 DSCPSCGRAFISL  473 (1374)
T ss_pred             CcccCcCCccccc
Confidence            4588899888654


No 185
>PF14446 Prok-RING_1:  Prokaryotic RING finger family 1
Probab=29.30  E-value=37  Score=23.42  Aligned_cols=26  Identities=27%  Similarity=0.710  Sum_probs=15.0

Q ss_pred             eeccccccccCCHHHHHHHHHHcCCCCceecccccccc
Q psy4572          47 HKCEHCEKEFENRRALKRHEKNHTQEKTFPCTVCLKMF   84 (345)
Q Consensus        47 ~~C~~C~k~F~~~~~L~~H~~~h~~~k~~~C~~C~k~F   84 (345)
                      -+|+.|++.|..            ++....|+.|+..+
T Consensus         6 ~~C~~Cg~~~~~------------~dDiVvCp~Cgapy   31 (54)
T PF14446_consen    6 CKCPVCGKKFKD------------GDDIVVCPECGAPY   31 (54)
T ss_pred             ccChhhCCcccC------------CCCEEECCCCCCcc
Confidence            467777776654            22335677776554


No 186
>PF01780 Ribosomal_L37ae:  Ribosomal L37ae protein family;  InterPro: IPR002674 Ribosomes are the particles that catalyse mRNA-directed protein synthesis in all organisms. The codons of the mRNA are exposed on the ribosome to allow tRNA binding. This leads to the incorporation of amino acids into the growing polypeptide chain in accordance with the genetic information. Incoming amino acid monomers enter the ribosomal A site in the form of aminoacyl-tRNAs complexed with elongation factor Tu (EF-Tu) and GTP. The growing polypeptide chain, situated in the P site as peptidyl-tRNA, is then transferred to aminoacyl-tRNA and the new peptidyl-tRNA, extended by one residue, is translocated to the P site with the aid the elongation factor G (EF-G) and GTP as the deacylated tRNA is released from the ribosome through one or more exit sites [, ]. About 2/3 of the mass of the ribosome consists of RNA and 1/3 of protein. The proteins are named in accordance with the subunit of the ribosome which they belong to - the small (S1 to S31) and the large (L1 to L44). Usually they decorate the rRNA cores of the subunits.  Many ribosomal proteins, particularly those of the large subunit, are composed of a globular, surfaced-exposed domain with long finger-like projections that extend into the rRNA core to stabilise its structure. Most of the proteins interact with multiple RNA elements, often from different domains. In the large subunit, about 1/3 of the 23S rRNA nucleotides are at least in van der Waal's contact with protein, and L22 interacts with all six domains of the 23S rRNA. Proteins S4 and S7, which initiate assembly of the 16S rRNA, are located at junctions of five and four RNA helices, respectively. In this way proteins serve to organise and stabilise the rRNA tertiary structure. While the crucial activities of decoding and peptide transfer are RNA based, proteins play an active role in functions that may have evolved to streamline the process of protein synthesis. In addition to their function in the ribosome, many ribosomal proteins have some function 'outside' the ribosome [, ]. This ribosomal protein is found in archaebacteria and eukaryotes []. Ribosomal protein L37 has a single zinc finger-like motif of the C2-C2 type [].; GO: 0003735 structural constituent of ribosome, 0006412 translation, 0005622 intracellular, 0005840 ribosome; PDB: 4A1E_Y 4A17_Y 4A1C_Y 4A1A_Y 3O58_g 3IZS_m 3O5H_g 1S1I_9 3IZR_m 1YSH_D ....
Probab=29.22  E-value=11  Score=28.89  Aligned_cols=12  Identities=42%  Similarity=1.168  Sum_probs=7.5

Q ss_pred             eeecCccccccC
Q psy4572         189 HYQCELCGKSFN  200 (345)
Q Consensus       189 ~y~C~~Cgk~F~  200 (345)
                      -++|..|++.|.
T Consensus        53 IW~C~~C~~~~A   64 (90)
T PF01780_consen   53 IWKCKKCGKKFA   64 (90)
T ss_dssp             EEEETTTTEEEE
T ss_pred             EeecCCCCCEEe
Confidence            366777766663


No 187
>PF13453 zf-TFIIB:  Transcription factor zinc-finger
Probab=28.86  E-value=28  Score=22.23  Aligned_cols=14  Identities=14%  Similarity=0.157  Sum_probs=5.7

Q ss_pred             cCCCCcccccCHHH
Q psy4572         104 KCDQCPYVAKRIKY  117 (345)
Q Consensus       104 ~C~~C~~~f~~~~~  117 (345)
                      .|+.|+-.+.+...
T Consensus        21 ~C~~C~G~W~d~~e   34 (41)
T PF13453_consen   21 VCPSCGGIWFDAGE   34 (41)
T ss_pred             ECCCCCeEEccHHH
Confidence            34444444433333


No 188
>KOG1280|consensus
Probab=28.51  E-value=61  Score=31.08  Aligned_cols=28  Identities=18%  Similarity=0.228  Sum_probs=16.6

Q ss_pred             CceeccccccccCCHHHHHHHHHHcCCC
Q psy4572          45 GPHKCEHCEKEFENRRALKRHEKNHTQE   72 (345)
Q Consensus        45 ~~~~C~~C~k~F~~~~~L~~H~~~h~~~   72 (345)
                      ..|.|+.|++.-.+...+..|....|.+
T Consensus        78 qSftCPyC~~~Gfte~~f~~Hv~s~Hpd  105 (381)
T KOG1280|consen   78 QSFTCPYCGIMGFTERQFGTHVLSQHPE  105 (381)
T ss_pred             ccccCCcccccccchhHHHHHhhhcCcc
Confidence            3566666666666666666666554443


No 189
>COG3091 SprT Zn-dependent metalloprotease, SprT family [General function prediction only]
Probab=27.77  E-value=31  Score=29.06  Aligned_cols=33  Identities=24%  Similarity=0.674  Sum_probs=17.1

Q ss_pred             CceeccccccccCCHHHHHHHHHHcCCCCceecccccc
Q psy4572          45 GPHKCEHCEKEFENRRALKRHEKNHTQEKTFPCTVCLK   82 (345)
Q Consensus        45 ~~~~C~~C~k~F~~~~~L~~H~~~h~~~k~~~C~~C~k   82 (345)
                      -+|.|. |+..|.+.   ++|-..-.|+ .|.|..|+.
T Consensus       116 ~~Y~C~-C~q~~l~~---RRhn~~~~g~-~YrC~~C~g  148 (156)
T COG3091         116 YPYRCQ-CQQHYLRI---RRHNTVRRGE-VYRCGKCGG  148 (156)
T ss_pred             eeEEee-cCCccchh---hhcccccccc-eEEeccCCc
Confidence            356666 66665442   3343333444 566666654


No 190
>PRK00432 30S ribosomal protein S27ae; Validated
Probab=27.77  E-value=33  Score=23.21  Aligned_cols=11  Identities=27%  Similarity=0.851  Sum_probs=7.2

Q ss_pred             eeecCcccccc
Q psy4572         189 HYQCELCGKSF  199 (345)
Q Consensus       189 ~y~C~~Cgk~F  199 (345)
                      .+.|..||..+
T Consensus        37 r~~C~~Cgyt~   47 (50)
T PRK00432         37 RWHCGKCGYTE   47 (50)
T ss_pred             cEECCCcCCEE
Confidence            46777777654


No 191
>KOG2907|consensus
Probab=27.45  E-value=29  Score=27.70  Aligned_cols=40  Identities=25%  Similarity=0.517  Sum_probs=18.2

Q ss_pred             eeccccccccCCHHHHHHHHHHcCCCCceeccccccccCC
Q psy4572          47 HKCEHCEKEFENRRALKRHEKNHTQEKTFPCTVCLKMFKT   86 (345)
Q Consensus        47 ~~C~~C~k~F~~~~~L~~H~~~h~~~k~~~C~~C~k~F~~   86 (345)
                      .+||.||..--....++.--.......-|.|+.|++.|..
T Consensus        75 ~kCpkCghe~m~Y~T~QlRSADEGQTVFYTC~kC~~k~~e  114 (116)
T KOG2907|consen   75 HKCPKCGHEEMSYHTLQLRSADEGQTVFYTCPKCKYKFTE  114 (116)
T ss_pred             ccCcccCCchhhhhhhhcccccCCceEEEEcCccceeeec
Confidence            5677777543222222211111111123677777776653


No 192
>PRK03976 rpl37ae 50S ribosomal protein L37Ae; Reviewed
Probab=27.38  E-value=24  Score=27.06  Aligned_cols=12  Identities=33%  Similarity=1.052  Sum_probs=7.7

Q ss_pred             eeecCccccccC
Q psy4572         189 HYQCELCGKSFN  200 (345)
Q Consensus       189 ~y~C~~Cgk~F~  200 (345)
                      .+.|..|++.|.
T Consensus        54 IW~C~~C~~~~A   65 (90)
T PRK03976         54 IWECRKCGAKFA   65 (90)
T ss_pred             EEEcCCCCCEEe
Confidence            466777766664


No 193
>COG4888 Uncharacterized Zn ribbon-containing protein [General function prediction only]
Probab=27.18  E-value=22  Score=27.76  Aligned_cols=39  Identities=23%  Similarity=0.373  Sum_probs=21.3

Q ss_pred             CCCceeccccccccCCHHHHHHHHHHcCCCCceeccccccccC
Q psy4572          43 LDGPHKCEHCEKEFENRRALKRHEKNHTQEKTFPCTVCLKMFK   85 (345)
Q Consensus        43 ~~~~~~C~~C~k~F~~~~~L~~H~~~h~~~k~~~C~~C~k~F~   85 (345)
                      -++.|.|+.|+..-...-.++    ...+.....|..||..|.
T Consensus        19 L~k~FtCp~Cghe~vs~ctvk----k~~~~g~~~Cg~CGls~e   57 (104)
T COG4888          19 LPKTFTCPRCGHEKVSSCTVK----KTVNIGTAVCGNCGLSFE   57 (104)
T ss_pred             CCceEecCccCCeeeeEEEEE----ecCceeEEEcccCcceEE
Confidence            456788888886543322211    112223356777877664


No 194
>TIGR00100 hypA hydrogenase nickel insertion protein HypA. In Hpylori, hypA mutant abolished hydrogenase activity and decrease in urease activity. Nickel supplementation in media restored urease activity and partial hydrogenase activity. HypA probably involved in inserting Ni in enzymes.
Probab=26.99  E-value=26  Score=28.11  Aligned_cols=27  Identities=22%  Similarity=0.605  Sum_probs=16.9

Q ss_pred             CceeccccccccCCHHHHHHHHHHcCCCCceeccccccc
Q psy4572          45 GPHKCEHCEKEFENRRALKRHEKNHTQEKTFPCTVCLKM   83 (345)
Q Consensus        45 ~~~~C~~C~k~F~~~~~L~~H~~~h~~~k~~~C~~C~k~   83 (345)
                      ....|+.|+..|.....            .+.||.||..
T Consensus        69 ~~~~C~~Cg~~~~~~~~------------~~~CP~Cgs~   95 (115)
T TIGR00100        69 VECECEDCSEEVSPEID------------LYRCPKCHGI   95 (115)
T ss_pred             cEEEcccCCCEEecCCc------------CccCcCCcCC
Confidence            44778888877665321            2567777743


No 195
>COG3364 Zn-ribbon containing protein [General function prediction only]
Probab=26.57  E-value=39  Score=26.46  Aligned_cols=12  Identities=33%  Similarity=0.808  Sum_probs=6.2

Q ss_pred             eeccccccccCC
Q psy4572          75 FPCTVCLKMFKT   86 (345)
Q Consensus        75 ~~C~~C~k~F~~   86 (345)
                      +.|..||..|.+
T Consensus         3 H~CtrCG~vf~~   14 (112)
T COG3364           3 HQCTRCGEVFDD   14 (112)
T ss_pred             ceeccccccccc
Confidence            445555555544


No 196
>PRK03564 formate dehydrogenase accessory protein FdhE; Provisional
Probab=25.38  E-value=31  Score=32.73  Aligned_cols=79  Identities=13%  Similarity=0.244  Sum_probs=42.2

Q ss_pred             CCccCCCCcccccCHHHHHHHHHHhcCCCccccCCCCCCcCCHHHHHHHHHHhcCCCCCCCcchhhccCCHHHHHHHHhh
Q psy4572         101 SEFKCDQCPYVAKRIKYLRYHMKKHRKEYSAYCEPCQEGFFSRDKLETHNIAKHGAQPYQCETCRKLFLSKYSLFTHNKK  180 (345)
Q Consensus       101 k~~~C~~C~~~f~~~~~L~~H~~~h~~~~~~~C~~C~k~F~~~~~l~~H~~~hh~~k~y~C~~Cgk~F~s~s~L~~H~~~  180 (345)
                      ..-.|+.||..-.-.  +. +.....|.+-..|..|+....              -.+.+|+.||..    ..+..+ ..
T Consensus       186 ~~~~CPvCGs~P~~s--~v-~~~~~~G~RyL~CslC~teW~--------------~~R~~C~~Cg~~----~~l~y~-~~  243 (309)
T PRK03564        186 QRQFCPVCGSMPVSS--VV-QIGTTQGLRYLHCNLCESEWH--------------VVRVKCSNCEQS----GKLHYW-SL  243 (309)
T ss_pred             CCCCCCCCCCcchhh--ee-eccCCCCceEEEcCCCCCccc--------------ccCccCCCCCCC----Cceeee-ee
Confidence            346799998653211  11 222334566677988876532              234579999952    112111 00


Q ss_pred             hCCCCcCceeecCccccccCC
Q psy4572         181 HLPIEERLHYQCELCGKSFNN  201 (345)
Q Consensus       181 H~~~~~~k~y~C~~Cgk~F~~  201 (345)
                      .......+...|..|+.....
T Consensus       244 ~~~~~~~r~e~C~~C~~YlK~  264 (309)
T PRK03564        244 DSEQAAVKAESCGDCGTYLKI  264 (309)
T ss_pred             cCCCcceEeeeccccccccee
Confidence            110113356789999876654


No 197
>TIGR00595 priA primosomal protein N'. All proteins in this family for which functions are known are components of the primosome which is involved in replication, repair, and recombination.This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University).
Probab=24.74  E-value=21  Score=36.30  Aligned_cols=12  Identities=25%  Similarity=0.343  Sum_probs=6.0

Q ss_pred             CCccccCCCCCC
Q psy4572         128 EYSAYCEPCQEG  139 (345)
Q Consensus       128 ~~~~~C~~C~k~  139 (345)
                      +....|..||..
T Consensus       238 ~~~l~Ch~Cg~~  249 (505)
T TIGR00595       238 EGKLRCHYCGYQ  249 (505)
T ss_pred             CCeEEcCCCcCc
Confidence            334556666543


No 198
>KOG2272|consensus
Probab=24.54  E-value=42  Score=30.72  Aligned_cols=126  Identities=16%  Similarity=0.287  Sum_probs=0.0

Q ss_pred             eeccccccccCCHHHHH---HHHHHcCCCC-------ceeccccccccCCHHHHHHHHHhcCCCCCccCCCCcccccCHH
Q psy4572          47 HKCEHCEKEFENRRALK---RHEKNHTQEK-------TFPCTVCLKMFKTKAQREEHIASHVTDSEFKCDQCPYVAKRIK  116 (345)
Q Consensus        47 ~~C~~C~k~F~~~~~L~---~H~~~h~~~k-------~~~C~~C~k~F~~~~~L~~H~~~h~~~k~~~C~~C~~~f~~~~  116 (345)
                      |.|.+|.+...+...++   +++-.--.++       .|.|..|...... ..|.-.-...|+.. |+|..|++...+..
T Consensus       100 F~Cd~Cn~~Lad~gf~rnqgr~LC~~Cn~k~Ka~~~g~YvC~KCh~~iD~-~~l~fr~d~yH~yH-FkCt~C~keL~sda  177 (332)
T KOG2272|consen  100 FRCDLCNKHLADQGFYRNQGRALCRECNQKEKAKGRGRYVCQKCHAHIDE-QPLTFRGDPYHPYH-FKCTTCGKELTSDA  177 (332)
T ss_pred             chhHHHHHHHhhhhhHhhcchHHhhhhhhhhcccccceeehhhhhhhccc-ccccccCCCCCccc-eecccccccccchh


Q ss_pred             HHHHH----HHHhcCCCccccCCCCCCcCCHHHHHHHHHHhcCCCCCCCcchhhccCCHHHHHH
Q psy4572         117 YLRYH----MKKHRKEYSAYCEPCQEGFFSRDKLETHNIAKHGAQPYQCETCRKLFLSKYSLFT  176 (345)
Q Consensus       117 ~L~~H----~~~h~~~~~~~C~~C~k~F~~~~~l~~H~~~hh~~k~y~C~~Cgk~F~s~s~L~~  176 (345)
                      .-..-    .+-|..-..-.|..|.+...  .....-+-.|=....|.|..|.+-|-......+
T Consensus       178 Revk~eLyClrChD~mgipiCgaC~rpIe--ervi~amgKhWHveHFvCa~CekPFlGHrHYEk  239 (332)
T KOG2272|consen  178 REVKGELYCLRCHDKMGIPICGACRRPIE--ERVIFAMGKHWHVEHFVCAKCEKPFLGHRHYEK  239 (332)
T ss_pred             hhhccceeccccccccCCcccccccCchH--HHHHHHhccccchhheeehhcCCcccchhhhhh


No 199
>KOG3408|consensus
Probab=24.44  E-value=51  Score=26.71  Aligned_cols=27  Identities=11%  Similarity=0.263  Sum_probs=22.7

Q ss_pred             ceeecCccccccCCHHHHHHHHHHhcC
Q psy4572         188 LHYQCELCGKSFNNKQTFSLSNMYVLS  214 (345)
Q Consensus       188 k~y~C~~Cgk~F~~~~~L~~H~~~~H~  214 (345)
                      ..|.|-.|.+-|.+...|..|.+..-+
T Consensus        56 GqfyCi~CaRyFi~~~~l~~H~ktK~H   82 (129)
T KOG3408|consen   56 GQFYCIECARYFIDAKALKTHFKTKVH   82 (129)
T ss_pred             ceeehhhhhhhhcchHHHHHHHhccHH
Confidence            359999999999999999999876544


No 200
>KOG4377|consensus
Probab=24.40  E-value=39  Score=33.07  Aligned_cols=18  Identities=22%  Similarity=0.344  Sum_probs=8.1

Q ss_pred             cccccCCHHHHHHHHHhc
Q psy4572          80 CLKMFKTKAQREEHIASH   97 (345)
Q Consensus        80 C~k~F~~~~~L~~H~~~h   97 (345)
                      |+..+..+.++.+|.+.|
T Consensus       279 C~ykr~~k~DvirH~~~h  296 (480)
T KOG4377|consen  279 CFYKRGQKNDVIRHVEIH  296 (480)
T ss_pred             ccccccchhhhHHHHHHH
Confidence            444444444444444443


No 201
>PF08271 TF_Zn_Ribbon:  TFIIB zinc-binding;  InterPro: IPR013137 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target.  This entry represents a zinc finger motif found in transcription factor IIB (TFIIB). In eukaryotes the initiation of transcription of protein encoding genes by the polymerase II complexe (Pol II) is modulated by general and specific transcription factors. The general transcription factors operate through common promoters elements (such as the TATA box). At least seven different proteins associate to form the general transcription factors: TFIIA, -IIB, -IID, -IIE, -IIF, -IIG, and -IIH [].  TFIIB and TFIID are responsible for promoter recognition and interaction with pol II; together with Pol II, they form a minimal initiation complex capable of transcription under certain conditions. The TATA box of a Pol II promoter is bound in the initiation complex by the TBP subunit of TFIID, which bends the DNA around the C-terminal domain of TFIIB whereas the N-terminal zinc finger of TFIIB interacts with Pol II [, ]. The TFIIB zinc finger adopts a zinc ribbon fold characterised by two beta-hairpins forming two structurally similar zinc-binding sub-sites []. The zinc finger contacts the rbp1 subunit of Pol II through its dock domain, a conserved region of about 70 amino acids located close to the polymerase active site []. In the Pol II complex this surface is located near the RNA exit groove. Interestingly this sequence is best conserved in the three polymerases that utilise a TFIIB-like general transcription factor (Pol II, Pol III, and archaeal RNA polymerase) but not in Pol I [].  More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; GO: 0008270 zinc ion binding, 0006355 regulation of transcription, DNA-dependent; PDB: 1VD4_A 1PFT_A 3K1F_M 3K7A_M 1RO4_A 1RLY_A 1DL6_A.
Probab=24.08  E-value=31  Score=22.24  Aligned_cols=9  Identities=22%  Similarity=0.600  Sum_probs=4.2

Q ss_pred             eeccccccc
Q psy4572          75 FPCTVCLKM   83 (345)
Q Consensus        75 ~~C~~C~k~   83 (345)
                      +.|+.||..
T Consensus        20 ~vC~~CG~V   28 (43)
T PF08271_consen   20 LVCPNCGLV   28 (43)
T ss_dssp             EEETTT-BB
T ss_pred             EECCCCCCE
Confidence            455555543


No 202
>PRK03681 hypA hydrogenase nickel incorporation protein; Validated
Probab=23.67  E-value=46  Score=26.59  Aligned_cols=14  Identities=21%  Similarity=0.454  Sum_probs=9.5

Q ss_pred             CceeccccccccCC
Q psy4572          45 GPHKCEHCEKEFEN   58 (345)
Q Consensus        45 ~~~~C~~C~k~F~~   58 (345)
                      ....|..|+..|..
T Consensus        69 ~~~~C~~Cg~~~~~   82 (114)
T PRK03681         69 AECWCETCQQYVTL   82 (114)
T ss_pred             cEEEcccCCCeeec
Confidence            45778888876554


No 203
>smart00154 ZnF_AN1 AN1-like Zinc finger. Zinc finger at the C-terminus of An1, a ubiquitin-like protein in Xenopus laevis.
Probab=23.64  E-value=38  Score=21.53  Aligned_cols=13  Identities=31%  Similarity=0.963  Sum_probs=10.9

Q ss_pred             eeecCccccccCC
Q psy4572         189 HYQCELCGKSFNN  201 (345)
Q Consensus       189 ~y~C~~Cgk~F~~  201 (345)
                      ||+|..|++.|-.
T Consensus        12 ~f~C~~C~~~FC~   24 (39)
T smart00154       12 GFKCRHCGNLFCG   24 (39)
T ss_pred             CeECCccCCcccc
Confidence            7899999998854


No 204
>COG1571 Predicted DNA-binding protein containing a Zn-ribbon domain [General function prediction only]
Probab=22.96  E-value=44  Score=33.04  Aligned_cols=13  Identities=23%  Similarity=0.531  Sum_probs=6.4

Q ss_pred             ceeccccccccCC
Q psy4572          74 TFPCTVCLKMFKT   86 (345)
Q Consensus        74 ~~~C~~C~k~F~~   86 (345)
                      -|+|+.|+..+..
T Consensus       367 g~rC~kCg~~~~~  379 (421)
T COG1571         367 GFRCKKCGTRARE  379 (421)
T ss_pred             CcccccccccCCc
Confidence            3555555554443


No 205
>PF03811 Zn_Tnp_IS1:  InsA N-terminal domain;  InterPro: IPR003220 Insertion elements are mobile elements in DNA, usually encoding proteins required for transposition, for example transposases. Protein InsA is absolutely required for transposition of insertion element 1. This entry represents a short zinc binding domain found in IS1 InsA family protein. It is found at the N terminus of the protein and may be a DNA-binding domain.; GO: 0006313 transposition, DNA-mediated
Probab=22.78  E-value=14  Score=23.26  Aligned_cols=8  Identities=38%  Similarity=1.186  Sum_probs=3.8

Q ss_pred             ceeecCcc
Q psy4572         188 LHYQCELC  195 (345)
Q Consensus       188 k~y~C~~C  195 (345)
                      ..|.|..|
T Consensus        28 qryrC~~C   35 (36)
T PF03811_consen   28 QRYRCKDC   35 (36)
T ss_pred             EeEecCcC
Confidence            34555544


No 206
>PRK12496 hypothetical protein; Provisional
Probab=22.07  E-value=44  Score=28.60  Aligned_cols=12  Identities=17%  Similarity=0.661  Sum_probs=7.4

Q ss_pred             ccccCCCCCCcC
Q psy4572         130 SAYCEPCQEGFF  141 (345)
Q Consensus       130 ~~~C~~C~k~F~  141 (345)
                      .+.|+.|++.|.
T Consensus       127 ~~~C~gC~~~~~  138 (164)
T PRK12496        127 RKVCKGCKKKYP  138 (164)
T ss_pred             eEECCCCCcccc
Confidence            355777776664


No 207
>PF05495 zf-CHY:  CHY zinc finger;  InterPro: IPR008913 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target.  Pirh2 is an eukaryotic ubiquitin protein ligase, which has been shown to promote p53 degradation in mammals. Pirh2 physically interacts with p53 and promotes ubiquitination of p53 independently of MDM2. Like MDM2, Pirh2 is thought to participate in an autoregulatory feedback loop that controls p53 function. Pirh2 proteins contain three distinct zinc fingers, the CHY-type, the CTCHY-type which is C-terminal to the CHY-type zinc finger and a RING finger. The CHY-type zinc finger has no currently known function []. As well as Pirh2, the CHY-type zinc finger is also found in the following proteins:   Yeast helper of Tim protein 13. Hot13 may have a role in the assembly and recycling of the small Tims, a complex of the mitochondrial intermembrane space that participates in the TIM22 import pathway for assembly of the inner membrane [] Several plant hypothetical proteins that also contain haemerythrin cation binding domains Several protozoan hypothetical proteins that also contain a Myb domain   The solution structure of this zinc finger has been solved and binds 3 zinc atoms as shown in the following schematic representation:   ++---------+-----+ || | | CXHYxxxxxxxxxCCxxxxxCxxCHxxxxxHxxxxxxxxxxxCxxCxxxxxxxxxCxxC | | | | | | | | +-+-----------------+--+ +--+---------+--+ 'C': conserved cysteine involved in the binding of one zinc atom. 'H': conserved histidine involved in the binding of one zinc atom.  More information about these proteins can be found at Protein of the Month: Zinc Fingers []; GO: 0008270 zinc ion binding; PDB: 2DKT_A 2K2C_A.
Probab=21.89  E-value=17  Score=26.43  Aligned_cols=12  Identities=17%  Similarity=0.332  Sum_probs=6.7

Q ss_pred             eeccccccccCC
Q psy4572          47 HKCEHCEKEFEN   58 (345)
Q Consensus        47 ~~C~~C~k~F~~   58 (345)
                      ++|+-|++.|.-
T Consensus        11 ~~~~cC~~~y~C   22 (71)
T PF05495_consen   11 IRFPCCGKYYPC   22 (71)
T ss_dssp             EEETTTTEEESS
T ss_pred             EECCcccCeecH
Confidence            455566665554


No 208
>PF13821 DUF4187:  Domain of unknown function (DUF4187)
Probab=21.72  E-value=95  Score=21.42  Aligned_cols=18  Identities=17%  Similarity=0.639  Sum_probs=12.2

Q ss_pred             eecCccccccCCHHHHHH
Q psy4572         190 YQCELCGKSFNNKQTFSL  207 (345)
Q Consensus       190 y~C~~Cgk~F~~~~~L~~  207 (345)
                      +-|-.||..|.+...|.+
T Consensus        28 ~YC~~Cg~~Y~d~~dL~~   45 (55)
T PF13821_consen   28 NYCFWCGTKYDDEEDLER   45 (55)
T ss_pred             ceeeeeCCccCCHHHHHh
Confidence            567777777777766654


No 209
>PRK00564 hypA hydrogenase nickel incorporation protein; Provisional
Probab=21.49  E-value=39  Score=27.18  Aligned_cols=14  Identities=21%  Similarity=0.627  Sum_probs=9.9

Q ss_pred             CceeccccccccCC
Q psy4572          45 GPHKCEHCEKEFEN   58 (345)
Q Consensus        45 ~~~~C~~C~k~F~~   58 (345)
                      ..+.|..|+..|..
T Consensus        70 ~~~~C~~Cg~~~~~   83 (117)
T PRK00564         70 VELECKDCSHVFKP   83 (117)
T ss_pred             CEEEhhhCCCcccc
Confidence            45788888876655


No 210
>PF04606 Ogr_Delta:  Ogr/Delta-like zinc finger;  InterPro: IPR007684 This entry is represented by Bacteriophage P2, Ogr. The characteristics of the protein distribution suggest prophage matches in addition to the phage matches. This is a viral family of phage zinc-binding transcriptional activators, which also contains cryptic members in some bacterial genomes []. The P4 phage delta protein contains two such domains attached covalently, while the P2 phage Ogr proteins possess one domain but function as dimers. All the members of this family have the following consensus sequence: C-X(2)-C-X(3)-A-(X)2-R-X(15)-C-X(4)-C-X(3)-F [].; GO: 0006355 regulation of transcription, DNA-dependent
Probab=20.98  E-value=24  Score=23.36  Aligned_cols=37  Identities=24%  Similarity=0.416  Sum_probs=18.7

Q ss_pred             CCcchhhccCCHHHHHHHHhhhCCCCcCceeecCc--cccccCC
Q psy4572         160 QCETCRKLFLSKYSLFTHNKKHLPIEERLHYQCEL--CGKSFNN  201 (345)
Q Consensus       160 ~C~~Cgk~F~s~s~L~~H~~~H~~~~~~k~y~C~~--Cgk~F~~  201 (345)
                      .||.||....-......+...     .+.-|+|..  ||..|..
T Consensus         1 ~CP~Cg~~a~ir~S~~~s~~~-----~~~Y~qC~N~~Cg~tfv~   39 (47)
T PF04606_consen    1 RCPHCGSKARIRTSRQLSPLT-----RELYCQCTNPECGHTFVA   39 (47)
T ss_pred             CcCCCCCeeEEEEchhhCcce-----EEEEEEECCCcCCCEEEE
Confidence            377777654433332222111     233467765  8877753


No 211
>TIGR00686 phnA alkylphosphonate utilization operon protein PhnA. The protein family includes an uncharacterized member designated phnA in Escherichia coli, part of a large operon associated with alkylphosphonate uptake and carbon-phosphorus bond cleavage. This protein is not related to the characterized phosphonoacetate hydrolase designated PhnA by Kulakova, et al. (2001, 1997).
Probab=20.97  E-value=44  Score=26.51  Aligned_cols=30  Identities=17%  Similarity=0.524  Sum_probs=20.8

Q ss_pred             CCcchhhccCCHHHHHHHHhhhCCCCcCceeecCccccccCCHH
Q psy4572         160 QCETCRKLFLSKYSLFTHNKKHLPIEERLHYQCELCGKSFNNKQ  203 (345)
Q Consensus       160 ~C~~Cgk~F~s~s~L~~H~~~H~~~~~~k~y~C~~Cgk~F~~~~  203 (345)
                      .||.|+-.|...              ....|.||.|+..+....
T Consensus         4 ~CP~C~seytY~--------------dg~~~iCpeC~~EW~~~~   33 (109)
T TIGR00686         4 PCPKCNSEYTYH--------------DGTQLICPSCLYEWNENE   33 (109)
T ss_pred             cCCcCCCcceEe--------------cCCeeECccccccccccc
Confidence            488887766543              223589999998887654


No 212
>PF08790 zf-LYAR:  LYAR-type C2HC zinc finger ;  InterPro: IPR014898 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target.  This C2HC zinc finger domain is found in LYAR proteins such as Q08288 from SWISSPROT, which are involved in cell growth regulation.  More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; PDB: 1WJV_A.
Probab=20.95  E-value=21  Score=21.13  Aligned_cols=18  Identities=22%  Similarity=0.724  Sum_probs=9.8

Q ss_pred             eecCccccccCCHHHHHHH
Q psy4572         190 YQCELCGKSFNNKQTFSLS  208 (345)
Q Consensus       190 y~C~~Cgk~F~~~~~L~~H  208 (345)
                      |.|-.|++.| .....+.|
T Consensus         1 ~sCiDC~~~F-~~~~y~~H   18 (28)
T PF08790_consen    1 FSCIDCSKDF-DGDSYKSH   18 (28)
T ss_dssp             EEETTTTEEE-EGGGTTT-
T ss_pred             CeeecCCCCc-CcCCcCCC
Confidence            4566777777 34444444


No 213
>KOG0320|consensus
Probab=20.92  E-value=82  Score=27.38  Aligned_cols=19  Identities=21%  Similarity=0.546  Sum_probs=11.9

Q ss_pred             CCCCceeccccccccCCHH
Q psy4572          42 KLDGPHKCEHCEKEFENRR   60 (345)
Q Consensus        42 ~~~~~~~C~~C~k~F~~~~   60 (345)
                      ..+..|.|++|-..|..+.
T Consensus       127 ~~~~~~~CPiCl~~~sek~  145 (187)
T KOG0320|consen  127 RKEGTYKCPICLDSVSEKV  145 (187)
T ss_pred             ccccccCCCceecchhhcc
Confidence            3345588888877665543


No 214
>PF05191 ADK_lid:  Adenylate kinase, active site lid;  InterPro: IPR007862 Adenylate kinases (ADK; 2.7.4.3 from EC) are phosphotransferases that catalyse the Mg-dependent reversible conversion of ATP and AMP to two molecules of ADP, an essential reaction for many processes in living cells. In large variants of adenylate kinase, the AMP and ATP substrates are buried in a domain that undergoes conformational changes from an open to a closed state when bound to substrate; the ligand is then contained within a highly specific environment required for catalysis. Adenylate kinase is a 3-domain protein consisting of a large central CORE domain flanked by a LID domain on one side and the AMP-binding NMPbind domain on the other []. The LID domain binds ATP and covers the phosphates at the active site. The substrates first bind the CORE domain, followed by closure of the active site by the LID and NMPbind domains. Comparisons of adenylate kinases have revealed a particular divergence in the active site lid. In some organisms, particularly the Gram-positive bacteria, residues in the lid domain have been mutated to cysteines and these cysteine residues (two CX(n)C motifs) are responsible for the binding of a zinc ion. The bound zinc ion in the lid domain is clearly structurally homologous to Zinc-finger domains. However, it is unclear whether the adenylate kinase lid is a novel zinc-finger DNA/RNA binding domain, or that the lid bound zinc serves a purely structural function [].; GO: 0004017 adenylate kinase activity; PDB: 3BE4_A 2OSB_B 2ORI_A 2EU8_A 3DL0_A 1P3J_A 2QAJ_A 2OO7_A 2P3S_A 3DKV_A ....
Probab=20.87  E-value=27  Score=21.88  Aligned_cols=8  Identities=25%  Similarity=1.136  Sum_probs=3.6

Q ss_pred             cccccccc
Q psy4572          77 CTVCLKMF   84 (345)
Q Consensus        77 C~~C~k~F   84 (345)
                      |+.||+.|
T Consensus         4 C~~Cg~~Y   11 (36)
T PF05191_consen    4 CPKCGRIY   11 (36)
T ss_dssp             ETTTTEEE
T ss_pred             cCCCCCcc
Confidence            44444444


No 215
>PRK10220 hypothetical protein; Provisional
Probab=20.76  E-value=55  Score=26.00  Aligned_cols=30  Identities=20%  Similarity=0.662  Sum_probs=20.8

Q ss_pred             CCcchhhccCCHHHHHHHHhhhCCCCcCceeecCccccccCCHH
Q psy4572         160 QCETCRKLFLSKYSLFTHNKKHLPIEERLHYQCELCGKSFNNKQ  203 (345)
Q Consensus       160 ~C~~Cgk~F~s~s~L~~H~~~H~~~~~~k~y~C~~Cgk~F~~~~  203 (345)
                      .|+.|+-.|...              ....|.|+.|+..+....
T Consensus         5 ~CP~C~seytY~--------------d~~~~vCpeC~hEW~~~~   34 (111)
T PRK10220          5 HCPKCNSEYTYE--------------DNGMYICPECAHEWNDAE   34 (111)
T ss_pred             cCCCCCCcceEc--------------CCCeEECCcccCcCCccc
Confidence            488887766543              223588999998887654


No 216
>PF04423 Rad50_zn_hook:  Rad50 zinc hook motif;  InterPro: IPR007517 The Mre11 complex (Mre11 Rad50 Nbs1) is central to chromosomal maintenance and functions in homologous recombination, telomere maintenance and sister chromatid association. The Rad50 coiled-coil region contains a dimer interface at the apex of the coiled coils in which pairs of conserved Cys-X-X-Cys motifs form interlocking hooks that bind one Zn ion. This alignment includes the zinc hook motif and a short stretch of coiled-coil on either side.; GO: 0004518 nuclease activity, 0005524 ATP binding, 0008270 zinc ion binding, 0006281 DNA repair; PDB: 1L8D_B.
Probab=20.50  E-value=40  Score=22.89  Aligned_cols=12  Identities=33%  Similarity=1.113  Sum_probs=6.5

Q ss_pred             ecCccccccCCH
Q psy4572         191 QCELCGKSFNNK  202 (345)
Q Consensus       191 ~C~~Cgk~F~~~  202 (345)
                      .||+|++.|...
T Consensus        22 ~CPlC~r~l~~e   33 (54)
T PF04423_consen   22 CCPLCGRPLDEE   33 (54)
T ss_dssp             E-TTT--EE-HH
T ss_pred             cCCCCCCCCCHH
Confidence            899999999753


No 217
>cd00924 Cyt_c_Oxidase_Vb Cytochrome c oxidase subunit Vb.  Cytochrome c oxidase (CcO), the terminal oxidase in the respiratory chains of eukaryotes and most bacteria, is a multi-chain transmembrane protein located in the inner membrane of mitochondria and the cell membrane of prokaryotes.  It catalyzes the reduction of O2 and simultaneously pumps protons across the membrane.  The number of subunits varies from three to five in bacteria and up to 13 in mammalian mitochondria. Subunits I, II, and III of mammalian CcO are encoded within the mitochondrial genome and the remaining 10 subunits are encoded within the nuclear genome.  Found only in eukaryotes, subunit Vb is one of three mammalian subunits that lacks a transmembrane region.  Subunit Vb is located on the matrix side of the membrane and binds the regulatory subunit of protein kinase A.  The abnormally extended conformation is stable only in the CcO assembly.
Probab=20.44  E-value=47  Score=25.85  Aligned_cols=14  Identities=29%  Similarity=0.688  Sum_probs=11.3

Q ss_pred             CceeecCccccccC
Q psy4572         187 RLHYQCELCGKSFN  200 (345)
Q Consensus       187 ~k~y~C~~Cgk~F~  200 (345)
                      .++.+|+.||..|.
T Consensus        77 g~~~rC~eCG~~fk   90 (97)
T cd00924          77 GKPKRCPECGHVFK   90 (97)
T ss_pred             CCceeCCCCCcEEE
Confidence            36889999998885


No 218
>PF01428 zf-AN1:  AN1-like Zinc finger;  InterPro: IPR000058 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target.  This entry represents the AN1-type zinc finger domain, which has a dimetal (zinc)-bound alpha/beta fold. This domain was first identified as a zinc finger at the C terminus of AN1 Q91889 from SWISSPROT, a ubiquitin-like protein in Xenopus laevis []. The AN1-type zinc finger contains six conserved cysteines and two histidines that could potentially coordinate 2 zinc atoms. Certain stress-associated proteins (SAP) contain AN1 domain, often in combination with A20 zinc finger domains (SAP8) or C2H2 domains (SAP16) []. For example, the human protein Znf216 has an A20 zinc-finger at the N terminus and an AN1 zinc-finger at the C terminus, acting to negatively regulate the NFkappaB activation pathway and to interact with components of the immune response like RIP, IKKgamma and TRAF6. The interact of Znf216 with IKK-gamma and RIP is mediated by the A20 zinc-finger domain, while its interaction with TRAF6 is mediated by the AN1 zinc-finger domain; therefore, both zinc-finger domains are involved in regulating the immune response []. The AN1 zinc finger domain is also found in proteins containing a ubiquitin-like domain, which are involved in the ubiquitination pathway []. Proteins containing an AN1-type zinc finger include:   Ascidian posterior end mark 6 (pem-6) protein []. Human AWP1 protein (associated with PRK1), which is expressed during early embryogenesis []. Human immunoglobulin mu binding protein 2 (SMUBP-2), mutations in which cause muscular atrophy with respiratory distress type 1 [].   More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; GO: 0008270 zinc ion binding; PDB: 1WFP_A 1WYS_A 1WG2_A 1WFH_A 1X4W_A 1WFE_A 1WFL_A 1X4V_A.
Probab=20.42  E-value=40  Score=21.78  Aligned_cols=14  Identities=50%  Similarity=1.094  Sum_probs=9.0

Q ss_pred             ceeecCccccccCC
Q psy4572         188 LHYQCELCGKSFNN  201 (345)
Q Consensus       188 k~y~C~~Cgk~F~~  201 (345)
                      -|+.|+.|++.|=.
T Consensus        12 ~~~~C~~C~~~FC~   25 (43)
T PF01428_consen   12 LPFKCKHCGKSFCL   25 (43)
T ss_dssp             SHEE-TTTS-EE-T
T ss_pred             CCeECCCCCcccCc
Confidence            47999999998853


No 219
>PF01155 HypA:  Hydrogenase expression/synthesis hypA family;  InterPro: IPR000688 Bacterial membrane-bound nickel-dependent hydrogenases requires a number of accessory proteins which are involved in their maturation. The exact role of these proteins is not yet clear, but some seem to be required for the incorporation of the nickel ions []. One of these proteins is generally known as hypA. It is a protein of about 12 to 14 kDa that contains, in its C-terminal region, four conserved cysteines that form a zinc-finger like motif. Escherichia coli has two proteins that belong to this family, hypA and hybF. A homologue, MJ0214, has also been found in a number of archaeal species, including the genome of Methanocaldococcus jannaschii (Methanococcus jannaschii).; GO: 0016151 nickel ion binding, 0006464 protein modification process; PDB: 2KDX_A 3A44_D 3A43_B.
Probab=20.23  E-value=41  Score=26.78  Aligned_cols=27  Identities=30%  Similarity=0.716  Sum_probs=15.0

Q ss_pred             CceeccccccccCCHHHHHHHHHHcCCCCceeccccccc
Q psy4572          45 GPHKCEHCEKEFENRRALKRHEKNHTQEKTFPCTVCLKM   83 (345)
Q Consensus        45 ~~~~C~~C~k~F~~~~~L~~H~~~h~~~k~~~C~~C~k~   83 (345)
                      ....|..|+..|.-...            .+.||.|+..
T Consensus        69 ~~~~C~~Cg~~~~~~~~------------~~~CP~Cgs~   95 (113)
T PF01155_consen   69 ARARCRDCGHEFEPDEF------------DFSCPRCGSP   95 (113)
T ss_dssp             -EEEETTTS-EEECHHC------------CHH-SSSSSS
T ss_pred             CcEECCCCCCEEecCCC------------CCCCcCCcCC
Confidence            34778888887765432            1447777754


No 220
>PLN02294 cytochrome c oxidase subunit Vb
Probab=20.16  E-value=48  Score=28.52  Aligned_cols=16  Identities=19%  Similarity=0.675  Sum_probs=12.4

Q ss_pred             cCceeecCccccccCC
Q psy4572         186 ERLHYQCELCGKSFNN  201 (345)
Q Consensus       186 ~~k~y~C~~Cgk~F~~  201 (345)
                      ..++.+|+.||..|.-
T Consensus       138 kGkp~RCpeCG~~fkL  153 (174)
T PLN02294        138 KGKSFECPVCTQYFEL  153 (174)
T ss_pred             CCCceeCCCCCCEEEE
Confidence            4468889999998853


No 221
>COG2331 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=20.16  E-value=44  Score=24.74  Aligned_cols=9  Identities=22%  Similarity=0.840  Sum_probs=4.4

Q ss_pred             eeccccccc
Q psy4572          47 HKCEHCEKE   55 (345)
Q Consensus        47 ~~C~~C~k~   55 (345)
                      |+|..|+..
T Consensus        13 Y~c~~cg~~   21 (82)
T COG2331          13 YECTECGNR   21 (82)
T ss_pred             EeecccchH
Confidence            455555543


Done!