Query         044013
Match_columns 153
No_of_seqs    265 out of 2146
Neff          8.0 
Searched_HMMs 46136
Date          Fri Mar 29 10:37:17 2013
Command       hhsearch -i /work/01045/syshi/csienesis_hhblits_a3m/044013.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/044013hhsearch_cdd -cpu 12 -v 0 

 No Hit                             Prob E-value P-value  Score    SS Cols Query HMM  Template HMM
  1 KOG2462 C2H2-type Zn-finger pr  99.8   2E-21 4.3E-26  150.7   0.7   92   12-116   177-268 (279)
  2 KOG2462 C2H2-type Zn-finger pr  99.6 1.5E-16 3.2E-21  123.7   3.8   85   10-105   144-230 (279)
  3 KOG3623 Homeobox transcription  99.4 5.5E-14 1.2E-18  120.9   2.1   78   23-110   892-969 (1007)
  4 KOG3576 Ovo and related transc  99.3 1.2E-13 2.7E-18  103.6  -1.5   78   23-110   115-192 (267)
  5 KOG3623 Homeobox transcription  99.1   1E-11 2.2E-16  107.2   0.4   63   11-81    909-971 (1007)
  6 KOG1074 Transcriptional repres  99.1 6.5E-11 1.4E-15  103.5   3.2   78   24-110   604-687 (958)
  7 PHA02768 hypothetical protein;  99.1 9.5E-11 2.1E-15   70.7   2.7   44   25-77      5-48  (55)
  8 KOG1074 Transcriptional repres  99.0 4.6E-10   1E-14   98.3   3.7   57   25-89    353-409 (958)
  9 KOG3576 Ovo and related transc  99.0 3.2E-11 6.9E-16   90.7  -3.0   69   12-88    133-201 (267)
 10 PHA00733 hypothetical protein   98.6   2E-08 4.3E-13   71.3   3.2   72   23-105    38-114 (128)
 11 KOG3608 Zn finger proteins [Ge  98.6 4.1E-09 8.9E-14   85.0  -1.2  104   10-123   277-385 (467)
 12 PHA00733 hypothetical protein   98.5 2.9E-08 6.3E-13   70.4   1.1   52   21-81     69-120 (128)
 13 PHA00616 hypothetical protein   98.5 4.8E-08 1.1E-12   56.3   1.5   33   25-57      1-33  (44)
 14 PF13465 zf-H2C2_2:  Zinc-finge  98.5 2.7E-08 5.8E-13   51.4   0.4   23   40-62      1-23  (26)
 15 PHA02768 hypothetical protein;  98.4 1.1E-07 2.3E-12   57.5   1.3   42   53-105     5-46  (55)
 16 PHA00732 hypothetical protein   98.3 7.2E-07 1.6E-11   58.1   2.9   44   25-81      1-45  (79)
 17 PLN03086 PRLI-interacting fact  98.3 9.5E-07 2.1E-11   76.1   4.3   67   23-103   451-517 (567)
 18 KOG3608 Zn finger proteins [Ge  98.1 3.8E-07 8.2E-12   73.9  -0.7   68   24-101   236-303 (467)
 19 PLN03086 PRLI-interacting fact  98.1   1E-06 2.2E-11   75.9   1.3   74   12-100   468-551 (567)
 20 PF13465 zf-H2C2_2:  Zinc-finge  98.1 5.5E-07 1.2E-11   46.4  -0.3   25   75-101     1-25  (26)
 21 PF00096 zf-C2H2:  Zinc finger,  98.0 4.2E-06 9.2E-11   41.4   2.2   23   26-48      1-23  (23)
 22 PF13912 zf-C2H2_6:  C2H2-type   97.6 3.8E-05 8.3E-10   39.4   2.0   25   25-49      1-25  (27)
 23 COG5189 SFP1 Putative transcri  97.6   3E-05 6.5E-10   62.2   1.7   52   23-81    347-419 (423)
 24 PF13894 zf-C2H2_4:  C2H2-type   97.5   8E-05 1.7E-09   36.6   2.2   23   26-48      1-23  (24)
 25 PF05605 zf-Di19:  Drought indu  97.5  0.0001 2.2E-09   44.4   2.7   47   25-81      2-50  (54)
 26 PHA00616 hypothetical protein   97.5 1.1E-05 2.5E-10   46.5  -1.5   34   53-95      1-34  (44)
 27 KOG3993 Transcription factor (  97.2 3.7E-05 8.1E-10   63.7  -1.5   70    7-85    278-380 (500)
 28 PF09237 GAGA:  GAGA factor;  I  97.2 0.00054 1.2E-08   40.6   3.3   37   17-53     16-52  (54)
 29 smart00355 ZnF_C2H2 zinc finge  97.1 0.00041 8.8E-09   34.4   2.1   24   26-49      1-24  (26)
 30 PHA00732 hypothetical protein   97.0 0.00042 9.2E-09   45.1   2.1   39   53-102     1-39  (79)
 31 PRK04860 hypothetical protein;  97.0 0.00039 8.5E-09   51.1   1.9   36   23-62    117-152 (160)
 32 PF12874 zf-met:  Zinc-finger o  96.7  0.0013 2.8E-08   32.9   1.7   23   26-48      1-23  (25)
 33 PF00096 zf-C2H2:  Zinc finger,  95.9 0.00043 9.4E-09   34.0  -2.6   23   54-84      1-23  (23)
 34 COG5048 FOG: Zn-finger [Genera  95.6  0.0054 1.2E-07   49.7   1.3   60   24-91    288-353 (467)
 35 PF12171 zf-C2H2_jaz:  Zinc-fin  95.4   0.005 1.1E-07   31.5   0.4   22   26-47      2-23  (27)
 36 PF12756 zf-C2H2_2:  C2H2 type   95.3  0.0023   5E-08   42.1  -1.4   21   27-47      1-21  (100)
 37 PF13909 zf-H2C2_5:  C2H2-type   95.1    0.02 4.3E-07   28.2   2.1   23   26-49      1-23  (24)
 38 COG5189 SFP1 Putative transcri  95.0   0.014 3.1E-07   47.2   1.9   45   50-100   346-408 (423)
 39 PF13913 zf-C2HC_2:  zinc-finge  94.9   0.021 4.5E-07   28.8   1.7   21   26-47      3-23  (25)
 40 KOG3993 Transcription factor (  94.8  0.0065 1.4E-07   50.7  -0.6   53   25-85    267-319 (500)
 41 PF13912 zf-C2H2_6:  C2H2-type   94.6  0.0036 7.9E-08   31.8  -1.6   25   53-85      1-25  (27)
 42 smart00355 ZnF_C2H2 zinc finge  94.3   0.016 3.5E-07   28.2   0.5   24   54-85      1-24  (26)
 43 PF13894 zf-C2H2_4:  C2H2-type   94.1  0.0042   9E-08   30.1  -2.0   21   54-81      1-21  (24)
 44 smart00451 ZnF_U1 U1-like zinc  93.5   0.059 1.3E-06   28.8   1.8   23   25-47      3-25  (35)
 45 PF09237 GAGA:  GAGA factor;  I  93.3   0.011 2.5E-07   35.0  -1.3   33   49-89     20-52  (54)
 46 PF05605 zf-Di19:  Drought indu  93.3   0.048 1.1E-06   32.5   1.4   40   53-100     2-41  (54)
 47 PRK04860 hypothetical protein;  92.8   0.038 8.2E-07   40.7   0.4   36   53-101   119-154 (160)
 48 PF12756 zf-C2H2_2:  C2H2 type   92.0    0.14 2.9E-06   33.5   2.4   25   25-49     50-74  (100)
 49 cd00350 rubredoxin_like Rubred  91.1    0.19 4.2E-06   26.9   1.9   25   26-62      2-26  (33)
 50 KOG2893 Zn finger protein [Gen  87.5    0.35 7.6E-06   37.9   1.7   50   23-85      9-59  (341)
 51 KOG1146 Homeobox protein [Gene  86.0    0.33 7.2E-06   45.9   1.0   52   23-81    463-539 (1406)
 52 COG1592 Rubrerythrin [Energy p  84.6    0.69 1.5E-05   34.2   2.0   23   25-60    134-156 (166)
 53 COG5048 FOG: Zn-finger [Genera  84.6     1.2 2.7E-05   35.9   3.7   61   22-88     30-90  (467)
 54 cd00729 rubredoxin_SM Rubredox  82.8    0.95 2.1E-05   24.4   1.6   26   25-62      2-27  (34)
 55 TIGR00373 conserved hypothetic  80.9       1 2.2E-05   32.9   1.6   31   23-62    107-137 (158)
 56 PRK06266 transcription initiat  80.7       1 2.2E-05   33.7   1.6   30   24-62    116-145 (178)
 57 PF09986 DUF2225:  Uncharacteri  79.5    0.53 1.1E-05   36.1  -0.3   24   23-46      3-26  (214)
 58 smart00531 TFIIE Transcription  79.3    0.96 2.1E-05   32.5   1.1   35   24-62     98-132 (147)
 59 PF09723 Zn-ribbon_8:  Zinc rib  78.9     1.1 2.3E-05   25.3   1.0   31   25-62      5-35  (42)
 60 TIGR02605 CxxC_CxxC_SSSS putat  78.2    0.82 1.8E-05   26.7   0.4   31   25-62      5-35  (52)
 61 smart00834 CxxC_CXXC_SSSS Puta  77.4    0.88 1.9E-05   25.0   0.3   30   25-61      5-34  (41)
 62 PF02892 zf-BED:  BED zinc fing  75.3       3 6.5E-05   23.4   2.2   25   22-46     13-41  (45)
 63 PF05443 ROS_MUCR:  ROS/MUCR tr  73.5     1.9 4.2E-05   30.7   1.3   27   24-53     71-97  (132)
 64 COG4049 Uncharacterized protei  73.2     1.9 4.1E-05   26.1   1.0   26   22-47     14-39  (65)
 65 smart00614 ZnF_BED BED zinc fi  73.1       3 6.6E-05   24.2   1.9   25   25-49     18-48  (50)
 66 PF09538 FYDLN_acid:  Protein o  72.3     2.6 5.7E-05   28.9   1.7   30   26-73     10-39  (108)
 67 COG1996 RPC10 DNA-directed RNA  67.6     3.2 6.8E-05   24.4   1.1   30   23-62      4-33  (49)
 68 COG2888 Predicted Zn-ribbon RN  67.2     3.2 6.9E-05   25.4   1.1   31   25-60     27-57  (61)
 69 PF13719 zinc_ribbon_5:  zinc-r  66.8     3.2 6.9E-05   22.7   1.0   32   26-62      3-34  (37)
 70 PRK00398 rpoP DNA-directed RNA  66.7     2.6 5.6E-05   24.0   0.6   29   24-62      2-30  (46)
 71 smart00659 RPOLCX RNA polymera  66.6       4 8.6E-05   23.3   1.4   27   25-62      2-28  (44)
 72 TIGR02098 MJ0042_CXXC MJ0042 f  66.4       3 6.5E-05   22.6   0.8   32   26-62      3-34  (38)
 73 TIGR02300 FYDLN_acid conserved  63.9     5.4 0.00012   28.2   1.9   34   26-77     10-43  (129)
 74 PRK14890 putative Zn-ribbon RN  63.9     5.3 0.00012   24.4   1.6   30   25-60     25-55  (59)
 75 PF13717 zinc_ribbon_4:  zinc-r  59.4     6.6 0.00014   21.3   1.4   32   26-62      3-34  (36)
 76 PF04959 ARS2:  Arsenite-resist  58.9     8.5 0.00018   29.6   2.4   25   23-47     75-99  (214)
 77 PRK00464 nrdR transcriptional   58.6     1.9 4.1E-05   31.5  -1.2   10   26-35      1-10  (154)
 78 COG3357 Predicted transcriptio  56.0     6.5 0.00014   26.2   1.1   14   24-37     57-70  (97)
 79 smart00734 ZnF_Rad18 Rad18-lik  56.0     9.3  0.0002   19.2   1.5   19   27-46      3-21  (26)
 80 COG3364 Zn-ribbon containing p  55.9     7.9 0.00017   26.4   1.5   28   25-62      2-29  (112)
 81 COG4957 Predicted transcriptio  54.8     6.5 0.00014   28.1   1.0   25   26-53     77-101 (148)
 82 PRK03824 hypA hydrogenase nick  52.0     7.5 0.00016   27.6   1.0   15   24-38     69-83  (135)
 83 KOG2593 Transcription initiati  48.4     5.7 0.00012   33.6  -0.1   37   23-62    126-162 (436)
 84 TIGR00100 hypA hydrogenase nic  47.7      12 0.00025   25.8   1.4   26   25-62     70-95  (115)
 85 KOG4167 Predicted DNA-binding   47.3     5.2 0.00011   36.2  -0.5   26   25-50    792-817 (907)
 86 COG3091 SprT Zn-dependent meta  47.1     8.4 0.00018   28.1   0.6   35   23-62    115-149 (156)
 87 KOG4124 Putative transcription  46.9     5.9 0.00013   32.8  -0.2   23   24-46    348-372 (442)
 88 KOG3214 Uncharacterized Zn rib  46.2     4.4 9.6E-05   27.4  -0.9   20   90-109    47-66  (109)
 89 PF12013 DUF3505:  Protein of u  45.4      19  0.0004   24.3   2.1   24   26-49     81-108 (109)
 90 KOG3408 U1-like Zn-finger-cont  45.1      14  0.0003   26.0   1.4   26   22-47     54-79  (129)
 91 smart00154 ZnF_AN1 AN1-like Zi  44.9      11 0.00023   20.9   0.7   13   25-37     12-24  (39)
 92 PRK12380 hydrogenase nickel in  43.6      16 0.00035   25.1   1.6   26   25-62     70-95  (113)
 93 PRK00564 hypA hydrogenase nick  43.1      15 0.00033   25.3   1.4   14   25-38     71-84  (117)
 94 PF07754 DUF1610:  Domain of un  42.0      12 0.00027   18.6   0.6   10   24-33     15-24  (24)
 95 PF06524 NOA36:  NOA36 protein;  41.3      54  0.0012   26.3   4.3   24   51-81    207-230 (314)
 96 PF01428 zf-AN1:  AN1-like Zinc  40.1      12 0.00026   21.0   0.4   15   24-38     12-26  (43)
 97 PF07800 DUF1644:  Protein of u  38.6      22 0.00048   26.1   1.7   58   25-90     80-138 (162)
 98 PF03604 DNA_RNApol_7kD:  DNA d  38.4      25 0.00054   18.6   1.5   26   26-62      1-26  (32)
 99 PF10571 UPF0547:  Uncharacteri  38.4      15 0.00032   18.5   0.5   11   26-36     15-25  (26)
100 KOG1146 Homeobox protein [Gene  36.6     7.9 0.00017   37.2  -1.1   51   27-84    438-489 (1406)
101 COG1655 Uncharacterized protei  36.2     7.8 0.00017   30.3  -1.0   29   23-51     17-45  (267)
102 KOG2186 Cell growth-regulating  36.0      16 0.00035   28.9   0.7   54   26-90      4-57  (276)
103 PHA00626 hypothetical protein   36.0      17 0.00038   22.0   0.7   15   24-38     22-36  (59)
104 PF01155 HypA:  Hydrogenase exp  35.8      17 0.00037   24.9   0.8   26   25-62     70-95  (113)
105 PF09845 DUF2072:  Zn-ribbon co  34.5      17 0.00037   25.8   0.6   15   25-39      1-15  (131)
106 KOG0717 Molecular chaperone (D  34.2      25 0.00054   30.3   1.6   22   26-47    293-314 (508)
107 PF07295 DUF1451:  Protein of u  33.9      33 0.00071   24.8   2.0   10   25-34    112-121 (146)
108 smart00064 FYVE Protein presen  33.9      26 0.00057   21.2   1.3   30   26-67     11-40  (68)
109 KOG2785 C2H2-type Zn-finger pr  33.7      37  0.0008   28.5   2.5   54   24-81    165-241 (390)
110 PF07975 C1_4:  TFIIH C1-like d  33.7      14  0.0003   21.9  -0.0   25   24-48     20-44  (51)
111 KOG2071 mRNA cleavage and poly  33.2      25 0.00054   31.1   1.5   26   24-49    417-442 (579)
112 PRK03681 hypA hydrogenase nick  32.9      22 0.00047   24.5   0.9   27   25-62     70-96  (114)
113 COG0068 HypF Hydrogenase matur  32.8      11 0.00023   34.1  -0.8   29   70-100   133-161 (750)
114 cd00065 FYVE FYVE domain; Zinc  31.8      35 0.00077   19.7   1.6   29   27-67      4-32  (57)
115 KOG2482 Predicted C2H2-type Zn  30.9      30 0.00066   28.7   1.5   23   25-47    195-217 (423)
116 PF08790 zf-LYAR:  LYAR-type C2  30.3      17 0.00038   18.7   0.0   19   26-45      1-19  (28)
117 COG1997 RPL43A Ribosomal prote  29.8      21 0.00046   23.6   0.4   28   25-62     35-62  (89)
118 PF10013 DUF2256:  Uncharacteri  27.7      25 0.00054   19.9   0.4   14   27-40     10-23  (42)
119 PF13824 zf-Mss51:  Zinc-finger  27.7      49  0.0011   19.9   1.7   12   50-61     11-22  (55)
120 PF10276 zf-CHCC:  Zinc-finger   27.2      32 0.00069   19.2   0.8   12   24-35     28-39  (40)
121 PRK09678 DNA-binding transcrip  26.6      17 0.00037   23.1  -0.5   42   26-76      2-45  (72)
122 PF14311 DUF4379:  Domain of un  25.9      60  0.0013   18.9   1.9   14   26-39     29-42  (55)
123 PRK00762 hypA hydrogenase nick  25.8      34 0.00074   23.8   0.9   13   25-38     70-82  (124)
124 COG1675 TFA1 Transcription ini  25.8      46   0.001   24.9   1.6   14   25-38    113-126 (176)
125 PF01363 FYVE:  FYVE zinc finge  25.7      19 0.00041   21.9  -0.4   30   26-67     10-39  (69)
126 KOG0696 Serine/threonine prote  25.6      30 0.00066   29.9   0.7   59   24-99     72-130 (683)
127 PF11789 zf-Nse:  Zinc-finger o  25.5      37  0.0008   20.3   0.9   34   24-61     23-56  (57)
128 KOG2636 Splicing factor 3a, su  25.4      39 0.00084   29.0   1.3   24   23-46    399-423 (497)
129 PF04423 Rad50_zn_hook:  Rad50   25.3      28 0.00061   20.3   0.3   12   27-38     22-33  (54)
130 KOG0978 E3 ubiquitin ligase in  25.1      57  0.0012   29.6   2.3   11   26-36    644-654 (698)
131 PF11672 DUF3268:  Protein of u  25.1      24 0.00053   23.9   0.0   11   24-34      1-11  (102)
132 PF13878 zf-C2H2_3:  zinc-finge  24.9      71  0.0015   17.7   1.9   24   26-49     14-39  (41)
133 PF05191 ADK_lid:  Adenylate ki  24.7      40 0.00086   18.3   0.8   12   26-37      2-13  (36)
134 PF13451 zf-trcl:  Probable zin  24.2      35 0.00076   20.0   0.6   18   23-40      2-19  (49)
135 PLN02294 cytochrome c oxidase   23.8      34 0.00073   25.5   0.6   14   24-37    140-153 (174)
136 KOG2461 Transcription factor B  23.8 1.4E+02   0.003   25.3   4.2   15   41-55    347-361 (396)
137 PF11931 DUF3449:  Domain of un  23.6      27 0.00057   26.6   0.0   21   24-44    100-121 (196)
138 KOG4602 Nanos and related prot  23.6      48   0.001   26.5   1.4   12   23-34    231-242 (318)
139 PRK01343 zinc-binding protein;  22.7      62  0.0013   19.6   1.5   11   26-36     10-20  (57)
140 cd02335 ZZ_ADA2 Zinc finger, Z  22.7      47   0.001   19.0   1.0   28   52-81     14-42  (49)
141 COG4530 Uncharacterized protei  22.4      39 0.00084   23.4   0.6   25   27-62     11-35  (129)
142 cd00924 Cyt_c_Oxidase_Vb Cytoc  22.4      37 0.00079   22.8   0.5   14   24-37     78-91  (97)
143 PRK05978 hypothetical protein;  22.3      65  0.0014   23.4   1.8   11   92-102    54-64  (148)
144 COG1198 PriA Primosomal protei  21.9      86  0.0019   28.7   2.9   25   74-100   461-485 (730)
145 PTZ00303 phosphatidylinositol   21.8      70  0.0015   29.8   2.2   66   26-100   461-529 (1374)
146 PF06397 Desulfoferrod_N:  Desu  21.2      43 0.00093   18.3   0.5   12   24-35      5-16  (36)
147 COG1773 Rubredoxin [Energy pro  21.1      44 0.00096   20.1   0.6   15   25-39      3-17  (55)
148 PTZ00255 60S ribosomal protein  21.0      25 0.00055   23.3  -0.5    8   25-32     36-43  (90)
149 PF05741 zf-nanos:  Nanos RNA b  20.8      63  0.0014   19.4   1.2   23   12-34     14-42  (55)

No 1  
>KOG2462 consensus C2H2-type Zn-finger protein [Transcription]
Probab=99.82  E-value=2e-21  Score=150.70  Aligned_cols=92  Identities=17%  Similarity=0.330  Sum_probs=77.1

Q ss_pred             CCCcccccCCCCcceeccccccccCChhHHHHHHHhhcCCCCCCCCCCCCCCCCCCccCCCCChhhhccccccCCCCCCC
Q 044013           12 SSSDDETDAGTGRSYECVFCKRGFTTAQALGGHMNIHRKDRAKNRPPSSSSISDKLTFFPSPSTWDDHQRSVHTNNNNHS   91 (153)
Q Consensus        12 ~~~h~~~~~~gekp~~C~~Cgk~F~~~~~L~~H~r~H~gekp~~C~~c~~~~Cgk~~~F~~~~~L~~H~~~~Ht~~kp~~   91 (153)
                      |..|.++|+   -|++|.+|||.|..++-|++|+|+|||||||.|+.|+++       |+..++|..||+ +|.+.|+| 
T Consensus       177 LkMHirTH~---l~c~C~iCGKaFSRPWLLQGHiRTHTGEKPF~C~hC~kA-------FADRSNLRAHmQ-THS~~K~~-  244 (279)
T KOG2462|consen  177 LKMHIRTHT---LPCECGICGKAFSRPWLLQGHIRTHTGEKPFSCPHCGKA-------FADRSNLRAHMQ-THSDVKKH-  244 (279)
T ss_pred             HhhHhhccC---CCcccccccccccchHHhhcccccccCCCCccCCcccch-------hcchHHHHHHHH-hhcCCccc-
Confidence            445555555   489999999999999999999999999999999955555       999999999999 99999999 


Q ss_pred             CCCCCccccccCCcccchhhhcCCC
Q 044013           92 NYDSYGQNYHQHLNMIGDEIWRGNR  116 (153)
Q Consensus        92 ~~~~~~~~f~~~~~~~~~~~~~~~~  116 (153)
                      .|..|+|+|++.. .++.+....++
T Consensus       245 qC~~C~KsFsl~S-yLnKH~ES~C~  268 (279)
T KOG2462|consen  245 QCPRCGKSFALKS-YLNKHSESACL  268 (279)
T ss_pred             cCcchhhHHHHHH-HHHHhhhhccc
Confidence            8999999996643 36666554443


No 2  
>KOG2462 consensus C2H2-type Zn-finger protein [Transcription]
Probab=99.63  E-value=1.5e-16  Score=123.69  Aligned_cols=85  Identities=22%  Similarity=0.416  Sum_probs=72.5

Q ss_pred             CCCCCcccccCC--CCcceeccccccccCChhHHHHHHHhhcCCCCCCCCCCCCCCCCCCccCCCCChhhhccccccCCC
Q 044013           10 SKSSSDDETDAG--TGRSYECVFCKRGFTTAQALGGHMNIHRKDRAKNRPPSSSSISDKLTFFPSPSTWDDHQRSVHTNN   87 (153)
Q Consensus        10 ~~~~~h~~~~~~--gekp~~C~~Cgk~F~~~~~L~~H~r~H~gekp~~C~~c~~~~Cgk~~~F~~~~~L~~H~~~~Ht~~   87 (153)
                      +.|+.|...|-.  ..+.+.|.+|+|.|....+|+.|+|+|+  -|++|.     +|||+  |+++=-|+.|+| +||||
T Consensus       144 snLsrHkQ~H~~~~s~ka~~C~~C~K~YvSmpALkMHirTH~--l~c~C~-----iCGKa--FSRPWLLQGHiR-THTGE  213 (279)
T KOG2462|consen  144 SNLSRHKQTHRSLDSKKAFSCKYCGKVYVSMPALKMHIRTHT--LPCECG-----ICGKA--FSRPWLLQGHIR-THTGE  213 (279)
T ss_pred             cccchhhcccccccccccccCCCCCceeeehHHHhhHhhccC--CCcccc-----ccccc--ccchHHhhcccc-cccCC
Confidence            457788765554  3577999999999999999999999999  578888     56666  999999999999 99999


Q ss_pred             CCCCCCCCCccccccCCc
Q 044013           88 NNHSNYDSYGQNYHQHLN  105 (153)
Q Consensus        88 kp~~~~~~~~~~f~~~~~  105 (153)
                      ||| .|..|+|+|....+
T Consensus       214 KPF-~C~hC~kAFADRSN  230 (279)
T KOG2462|consen  214 KPF-SCPHCGKAFADRSN  230 (279)
T ss_pred             CCc-cCCcccchhcchHH
Confidence            999 89999999965444


No 3  
>KOG3623 consensus Homeobox transcription factor SIP1 [Transcription]
Probab=99.42  E-value=5.5e-14  Score=120.94  Aligned_cols=78  Identities=19%  Similarity=0.364  Sum_probs=72.2

Q ss_pred             CcceeccccccccCChhHHHHHHHhhcCCCCCCCCCCCCCCCCCCccCCCCChhhhccccccCCCCCCCCCCCCcccccc
Q 044013           23 GRSYECVFCKRGFTTAQALGGHMNIHRKDRAKNRPPSSSSISDKLTFFPSPSTWDDHQRSVHTNNNNHSNYDSYGQNYHQ  102 (153)
Q Consensus        23 ekp~~C~~Cgk~F~~~~~L~~H~r~H~gekp~~C~~c~~~~Cgk~~~F~~~~~L~~H~~~~Ht~~kp~~~~~~~~~~f~~  102 (153)
                      .-+|.|+.|.|.|...+.|.+|.--|+|.+||+|.     ||.|+  |..+..|..|+| .|.|+||| .|+-|++.|+.
T Consensus       892 ~gmyaCDqCDK~FqKqSSLaRHKYEHsGqRPyqC~-----iCkKA--FKHKHHLtEHkR-LHSGEKPf-QCdKClKRFSH  962 (1007)
T KOG3623|consen  892 DGMYACDQCDKAFQKQSSLARHKYEHSGQRPYQCI-----ICKKA--FKHKHHLTEHKR-LHSGEKPF-QCDKCLKRFSH  962 (1007)
T ss_pred             cccchHHHHHHHHHhhHHHHHhhhhhcCCCCcccc-----hhhHh--hhhhhhhhhhhh-hccCCCcc-hhhhhhhhccc
Confidence            35899999999999999999999999999999999     66666  999999999999 99999999 79999999998


Q ss_pred             CCcccchh
Q 044013          103 HLNMIGDE  110 (153)
Q Consensus       103 ~~~~~~~~  110 (153)
                      +++ +.++
T Consensus       963 SGS-YSQH  969 (1007)
T KOG3623|consen  963 SGS-YSQH  969 (1007)
T ss_pred             ccc-hHhh
Confidence            888 6666


No 4  
>KOG3576 consensus Ovo and related transcription factors [Transcription]
Probab=99.32  E-value=1.2e-13  Score=103.60  Aligned_cols=78  Identities=17%  Similarity=0.300  Sum_probs=70.1

Q ss_pred             CcceeccccccccCChhHHHHHHHhhcCCCCCCCCCCCCCCCCCCccCCCCChhhhccccccCCCCCCCCCCCCcccccc
Q 044013           23 GRSYECVFCKRGFTTAQALGGHMNIHRKDRAKNRPPSSSSISDKLTFFPSPSTWDDHQRSVHTNNNNHSNYDSYGQNYHQ  102 (153)
Q Consensus        23 ekp~~C~~Cgk~F~~~~~L~~H~r~H~gekp~~C~~c~~~~Cgk~~~F~~~~~L~~H~~~~Ht~~kp~~~~~~~~~~f~~  102 (153)
                      ...|.|.+|+|.|....-|.+|+..|...|.|.|..||+.       |.....|++|++ +|||.+|| .|..|.++|.+
T Consensus       115 ~d~ftCrvCgK~F~lQRmlnrh~kch~~vkr~lct~cgkg-------fndtfdlkrh~r-thtgvrpy-kc~~c~kaftq  185 (267)
T KOG3576|consen  115 QDSFTCRVCGKKFGLQRMLNRHLKCHSDVKRHLCTFCGKG-------FNDTFDLKRHTR-THTGVRPY-KCSLCEKAFTQ  185 (267)
T ss_pred             CCeeeeehhhhhhhHHHHHHHHhhhccHHHHHHHhhccCc-------ccchhhhhhhhc-cccCcccc-chhhhhHHHHh
Confidence            3569999999999999999999999999999999966655       999999999999 99999999 89999999988


Q ss_pred             CCcccchh
Q 044013          103 HLNMIGDE  110 (153)
Q Consensus       103 ~~~~~~~~  110 (153)
                      ..+ +..+
T Consensus       186 rcs-lesh  192 (267)
T KOG3576|consen  186 RCS-LESH  192 (267)
T ss_pred             hcc-HHHH
Confidence            777 4443


No 5  
>KOG3623 consensus Homeobox transcription factor SIP1 [Transcription]
Probab=99.12  E-value=1e-11  Score=107.22  Aligned_cols=63  Identities=19%  Similarity=0.484  Sum_probs=56.1

Q ss_pred             CCCCcccccCCCCcceeccccccccCChhHHHHHHHhhcCCCCCCCCCCCCCCCCCCccCCCCChhhhccc
Q 044013           11 KSSSDDETDAGTGRSYECVFCKRGFTTAQALGGHMNIHRKDRAKNRPPSSSSISDKLTFFPSPSTWDDHQR   81 (153)
Q Consensus        11 ~~~~h~~~~~~gekp~~C~~Cgk~F~~~~~L~~H~r~H~gekp~~C~~c~~~~Cgk~~~F~~~~~L~~H~~   81 (153)
                      .|.+|.=.|++ .|||+|.+|.|+|..+-.|..|+|.|.|||||.|..|++.       |+...+...||.
T Consensus       909 SLaRHKYEHsG-qRPyqC~iCkKAFKHKHHLtEHkRLHSGEKPfQCdKClKR-------FSHSGSYSQHMN  971 (1007)
T KOG3623|consen  909 SLARHKYEHSG-QRPYQCIICKKAFKHKHHLTEHKRLHSGEKPFQCDKCLKR-------FSHSGSYSQHMN  971 (1007)
T ss_pred             HHHHhhhhhcC-CCCcccchhhHhhhhhhhhhhhhhhccCCCcchhhhhhhh-------cccccchHhhhc
Confidence            35555545555 9999999999999999999999999999999999988888       999999999996


No 6  
>KOG1074 consensus Transcriptional repressor SALM [Transcription]
Probab=99.07  E-value=6.5e-11  Score=103.47  Aligned_cols=78  Identities=17%  Similarity=0.339  Sum_probs=65.4

Q ss_pred             cceeccccccccCChhHHHHHHHhhcCCCCCCCCCCCCCCCCCCccCCCCChhhhccccccCCCCCCC---CCC---CCc
Q 044013           24 RSYECVFCKRGFTTAQALGGHMNIHRKDRAKNRPPSSSSISDKLTFFPSPSTWDDHQRSVHTNNNNHS---NYD---SYG   97 (153)
Q Consensus        24 kp~~C~~Cgk~F~~~~~L~~H~r~H~gekp~~C~~c~~~~Cgk~~~F~~~~~L~~H~~~~Ht~~kp~~---~~~---~~~   97 (153)
                      -|-+|.+|.+..+-+++|+.|.|+|+||+||+|.     |||++  |+++.+|+.||. +|....|+.   +|.   +|-
T Consensus       604 dPNqCiiC~rVlSC~saLqmHyrtHtGERPFkCK-----iCgRA--FtTkGNLkaH~~-vHka~p~~R~q~ScP~~~ic~  675 (958)
T KOG1074|consen  604 DPNQCIICLRVLSCPSALQMHYRTHTGERPFKCK-----ICGRA--FTTKGNLKAHMS-VHKAKPPARVQFSCPSTFICQ  675 (958)
T ss_pred             CccceeeeeecccchhhhhhhhhcccCcCccccc-----cccch--hccccchhhccc-ccccCccccccccCCchhhhc
Confidence            4789999999999999999999999999999999     66666  999999999999 998877653   677   888


Q ss_pred             cccccCCcccchh
Q 044013           98 QNYHQHLNMIGDE  110 (153)
Q Consensus        98 ~~f~~~~~~~~~~  110 (153)
                      +.|..... +.++
T Consensus       676 ~kftn~V~-lpQh  687 (958)
T KOG1074|consen  676 KKFTNAVT-LPQH  687 (958)
T ss_pred             cccccccc-ccce
Confidence            88854443 4444


No 7  
>PHA02768 hypothetical protein; Provisional
Probab=99.07  E-value=9.5e-11  Score=70.74  Aligned_cols=44  Identities=14%  Similarity=0.326  Sum_probs=38.3

Q ss_pred             ceeccccccccCChhHHHHHHHhhcCCCCCCCCCCCCCCCCCCccCCCCChhh
Q 044013           25 SYECVFCKRGFTTAQALGGHMNIHRKDRAKNRPPSSSSISDKLTFFPSPSTWD   77 (153)
Q Consensus        25 p~~C~~Cgk~F~~~~~L~~H~r~H~gekp~~C~~c~~~~Cgk~~~F~~~~~L~   77 (153)
                      -|+|++||+.|+..++|..|+++|+  +||+|..     |+|+  |.+.+.|.
T Consensus         5 ~y~C~~CGK~Fs~~~~L~~H~r~H~--k~~kc~~-----C~k~--f~~~s~l~   48 (55)
T PHA02768          5 GYECPICGEIYIKRKSMITHLRKHN--TNLKLSN-----CKRI--SLRTGEYI   48 (55)
T ss_pred             ccCcchhCCeeccHHHHHHHHHhcC--CcccCCc-----ccce--ecccceeE
Confidence            4899999999999999999999999  7999984     4555  99888775


No 8  
>KOG1074 consensus Transcriptional repressor SALM [Transcription]
Probab=98.95  E-value=4.6e-10  Score=98.25  Aligned_cols=57  Identities=21%  Similarity=0.363  Sum_probs=51.5

Q ss_pred             ceeccccccccCChhHHHHHHHhhcCCCCCCCCCCCCCCCCCCccCCCCChhhhccccccCCCCC
Q 044013           25 SYECVFCKRGFTTAQALGGHMNIHRKDRAKNRPPSSSSISDKLTFFPSPSTWDDHQRSVHTNNNN   89 (153)
Q Consensus        25 p~~C~~Cgk~F~~~~~L~~H~r~H~gekp~~C~~c~~~~Cgk~~~F~~~~~L~~H~~~~Ht~~kp   89 (153)
                      +++|.+|+|.|...++|+.|.|.|||++||+|.+|+.+       |+++.+|+.|.. .|+-+-|
T Consensus       353 khkCr~CakvfgS~SaLqiHlRSHTGERPfqCnvCG~~-------FSTkGNLKvH~~-rH~e~~p  409 (958)
T KOG1074|consen  353 KHKCRFCAKVFGSDSALQIHLRSHTGERPFQCNVCGNR-------FSTKGNLKVHFQ-RHREKYP  409 (958)
T ss_pred             cchhhhhHhhcCchhhhhhhhhccCCCCCeeecccccc-------cccccceeeeee-eccccCC
Confidence            47899999999999999999999999999999977777       999999999998 7776554


No 9  
>KOG3576 consensus Ovo and related transcription factors [Transcription]
Probab=98.95  E-value=3.2e-11  Score=90.73  Aligned_cols=69  Identities=19%  Similarity=0.342  Sum_probs=57.0

Q ss_pred             CCCcccccCCCCcceeccccccccCChhHHHHHHHhhcCCCCCCCCCCCCCCCCCCccCCCCChhhhccccccCCCC
Q 044013           12 SSSDDETDAGTGRSYECVFCKRGFTTAQALGGHMNIHRKDRAKNRPPSSSSISDKLTFFPSPSTWDDHQRSVHTNNN   88 (153)
Q Consensus        12 ~~~h~~~~~~gekp~~C~~Cgk~F~~~~~L~~H~r~H~gekp~~C~~c~~~~Cgk~~~F~~~~~L~~H~~~~Ht~~k   88 (153)
                      |..|++-+.. .|.|.|..|||.|...-.|++|+|+|||.+||+|..|+++       |.+..+|..|.+.+|.-..
T Consensus       133 lnrh~kch~~-vkr~lct~cgkgfndtfdlkrh~rthtgvrpykc~~c~ka-------ftqrcsleshl~kvhgv~~  201 (267)
T KOG3576|consen  133 LNRHLKCHSD-VKRHLCTFCGKGFNDTFDLKRHTRTHTGVRPYKCSLCEKA-------FTQRCSLESHLKKVHGVQH  201 (267)
T ss_pred             HHHHhhhccH-HHHHHHhhccCcccchhhhhhhhccccCccccchhhhhHH-------HHhhccHHHHHHHHcCchH
Confidence            3445555544 5679999999999999999999999999999999966666       9999999999875776443


No 10 
>PHA00733 hypothetical protein
Probab=98.65  E-value=2e-08  Score=71.27  Aligned_cols=72  Identities=13%  Similarity=0.118  Sum_probs=51.1

Q ss_pred             CcceeccccccccCChhHHHHH--HH---hhcCCCCCCCCCCCCCCCCCCccCCCCChhhhccccccCCCCCCCCCCCCc
Q 044013           23 GRSYECVFCKRGFTTAQALGGH--MN---IHRKDRAKNRPPSSSSISDKLTFFPSPSTWDDHQRSVHTNNNNHSNYDSYG   97 (153)
Q Consensus        23 ekp~~C~~Cgk~F~~~~~L~~H--~r---~H~gekp~~C~~c~~~~Cgk~~~F~~~~~L~~H~~~~Ht~~kp~~~~~~~~   97 (153)
                      .+++.|.+|.+.|..+..|..+  ++   .+.+.+||.|+.|++.       |.....|..|++ +|  ..+| .|..|+
T Consensus        38 ~~~~~~~~~~~~~~~~~~l~~~~~l~~~~~~~~~kPy~C~~Cgk~-------Fss~s~L~~H~r-~h--~~~~-~C~~Cg  106 (128)
T PHA00733         38 QKRLIRAVVKTLIYNPQLLDESSYLYKLLTSKAVSPYVCPLCLMP-------FSSSVSLKQHIR-YT--EHSK-VCPVCG  106 (128)
T ss_pred             hhhHHHHHHhhhccChhhhcchHHHHhhcccCCCCCccCCCCCCc-------CCCHHHHHHHHh-cC--CcCc-cCCCCC
Confidence            4678888888888877766655  21   2234778888855555       888888888888 66  3466 788888


Q ss_pred             cccccCCc
Q 044013           98 QNYHQHLN  105 (153)
Q Consensus        98 ~~f~~~~~  105 (153)
                      +.|.....
T Consensus       107 K~F~~~~s  114 (128)
T PHA00733        107 KEFRNTDS  114 (128)
T ss_pred             CccCCHHH
Confidence            88865554


No 11 
>KOG3608 consensus Zn finger proteins [General function prediction only]
Probab=98.62  E-value=4.1e-09  Score=85.03  Aligned_cols=104  Identities=13%  Similarity=0.200  Sum_probs=84.1

Q ss_pred             CCCCCcccccCCCCcceeccccccccCChhHHHHHHHhhcCCCCCCCCC--CCCCCCCCCccCCCCChhhhccccccCCC
Q 044013           10 SKSSSDDETDAGTGRSYECVFCKRGFTTAQALGGHMNIHRKDRAKNRPP--SSSSISDKLTFFPSPSTWDDHQRSVHTNN   87 (153)
Q Consensus        10 ~~~~~h~~~~~~gekp~~C~~Cgk~F~~~~~L~~H~r~H~gekp~~C~~--c~~~~Cgk~~~F~~~~~L~~H~~~~Ht~~   87 (153)
                      +.|..|++.+++..|||+|+.|.+.|.+.+.|..|..+|+ +..|.|..  |..+       |.+..++.+|++.+|.|.
T Consensus       277 ssL~~H~r~rHs~dkpfKCd~Cd~~c~~esdL~kH~~~HS-~~~y~C~h~~C~~s-------~r~~~q~~~H~~evhEg~  348 (467)
T KOG3608|consen  277 SSLTTHIRYRHSKDKPFKCDECDTRCVRESDLAKHVQVHS-KTVYQCEHPDCHYS-------VRTYTQMRRHFLEVHEGN  348 (467)
T ss_pred             HHHHHHHHhhhccCCCccccchhhhhccHHHHHHHHHhcc-ccceecCCCCCcHH-------HHHHHHHHHHHHHhccCC
Confidence            4467788888888999999999999999999999999999 77899986  5555       899999999998788777


Q ss_pred             CC--CCCCCCCccccccCCcccchhhhc-CCCCcccCCC
Q 044013           88 NN--HSNYDSYGQNYHQHLNMIGDEIWR-GNRSLRVGPS  123 (153)
Q Consensus        88 kp--~~~~~~~~~~f~~~~~~~~~~~~~-~~~sl~~~~~  123 (153)
                      .|  | .|-.|.|.|.++.+ +..++.. ..+.+-.|-.
T Consensus       349 np~~Y-~CH~Cdr~ft~G~~-L~~HL~kkH~f~~PsGh~  385 (467)
T KOG3608|consen  349 NPILY-ACHCCDRFFTSGKS-LSAHLMKKHGFRLPSGHK  385 (467)
T ss_pred             CCCce-eeecchhhhccchh-HHHHHHHhhcccCCCCCC
Confidence            65  6 78999999987777 6665444 4555555543


No 12 
>PHA00733 hypothetical protein
Probab=98.52  E-value=2.9e-08  Score=70.43  Aligned_cols=52  Identities=21%  Similarity=0.379  Sum_probs=45.9

Q ss_pred             CCCcceeccccccccCChhHHHHHHHhhcCCCCCCCCCCCCCCCCCCccCCCCChhhhccc
Q 044013           21 GTGRSYECVFCKRGFTTAQALGGHMNIHRKDRAKNRPPSSSSISDKLTFFPSPSTWDDHQR   81 (153)
Q Consensus        21 ~gekp~~C~~Cgk~F~~~~~L~~H~r~H~gekp~~C~~c~~~~Cgk~~~F~~~~~L~~H~~   81 (153)
                      ++.+||.|+.|++.|.....|..|+++|  +.+|.|+.|++.       |.....|..|+.
T Consensus        69 ~~~kPy~C~~Cgk~Fss~s~L~~H~r~h--~~~~~C~~CgK~-------F~~~~sL~~H~~  120 (128)
T PHA00733         69 KAVSPYVCPLCLMPFSSSVSLKQHIRYT--EHSKVCPVCGKE-------FRNTDSTLDHVC  120 (128)
T ss_pred             CCCCCccCCCCCCcCCCHHHHHHHHhcC--CcCccCCCCCCc-------cCCHHHHHHHHH
Confidence            3478999999999999999999999987  468999955555       999999999987


No 13 
>PHA00616 hypothetical protein
Probab=98.50  E-value=4.8e-08  Score=56.31  Aligned_cols=33  Identities=18%  Similarity=0.303  Sum_probs=31.1

Q ss_pred             ceeccccccccCChhHHHHHHHhhcCCCCCCCC
Q 044013           25 SYECVFCKRGFTTAQALGGHMNIHRKDRAKNRP   57 (153)
Q Consensus        25 p~~C~~Cgk~F~~~~~L~~H~r~H~gekp~~C~   57 (153)
                      ||+|..||+.|...+.|..|++.|+|++|+.|+
T Consensus         1 pYqC~~CG~~F~~~s~l~~H~r~~hg~~~~~~~   33 (44)
T PHA00616          1 MYQCLRCGGIFRKKKEVIEHLLSVHKQNKLTLE   33 (44)
T ss_pred             CCccchhhHHHhhHHHHHHHHHHhcCCCcccee
Confidence            789999999999999999999999999999886


No 14 
>PF13465 zf-H2C2_2:  Zinc-finger double domain; PDB: 2EN7_A 1TF6_A 1TF3_A 2ELT_A 2EOS_A 2EN2_A 2DMD_A 2WBS_A 2WBU_A 2EM5_A ....
Probab=98.50  E-value=2.7e-08  Score=51.37  Aligned_cols=23  Identities=26%  Similarity=0.377  Sum_probs=18.2

Q ss_pred             HHHHHHHhhcCCCCCCCCCCCCC
Q 044013           40 ALGGHMNIHRKDRAKNRPPSSSS   62 (153)
Q Consensus        40 ~L~~H~r~H~gekp~~C~~c~~~   62 (153)
                      +|..|+++|++++||.|+.|++.
T Consensus         1 ~l~~H~~~H~~~k~~~C~~C~k~   23 (26)
T PF13465_consen    1 NLRRHMRTHTGEKPYKCPYCGKS   23 (26)
T ss_dssp             HHHHHHHHHSSSSSEEESSSSEE
T ss_pred             CHHHHhhhcCCCCCCCCCCCcCe
Confidence            47888888888888888855555


No 15 
>PHA02768 hypothetical protein; Provisional
Probab=98.40  E-value=1.1e-07  Score=57.48  Aligned_cols=42  Identities=12%  Similarity=0.074  Sum_probs=35.0

Q ss_pred             CCCCCCCCCCCCCCCccCCCCChhhhccccccCCCCCCCCCCCCccccccCCc
Q 044013           53 AKNRPPSSSSISDKLTFFPSPSTWDDHQRSVHTNNNNHSNYDSYGQNYHQHLN  105 (153)
Q Consensus        53 p~~C~~c~~~~Cgk~~~F~~~~~L~~H~~~~Ht~~kp~~~~~~~~~~f~~~~~  105 (153)
                      .|.|+.|++.       |+..++|..|++ +|+  +|| .|..|++.|++...
T Consensus         5 ~y~C~~CGK~-------Fs~~~~L~~H~r-~H~--k~~-kc~~C~k~f~~~s~   46 (55)
T PHA02768          5 GYECPICGEI-------YIKRKSMITHLR-KHN--TNL-KLSNCKRISLRTGE   46 (55)
T ss_pred             ccCcchhCCe-------eccHHHHHHHHH-hcC--Ccc-cCCcccceecccce
Confidence            4789955555       999999999999 999  688 78999999976544


No 16 
>PHA00732 hypothetical protein
Probab=98.26  E-value=7.2e-07  Score=58.11  Aligned_cols=44  Identities=23%  Similarity=0.348  Sum_probs=36.5

Q ss_pred             ceeccccccccCChhHHHHHHH-hhcCCCCCCCCCCCCCCCCCCccCCCCChhhhccc
Q 044013           25 SYECVFCKRGFTTAQALGGHMN-IHRKDRAKNRPPSSSSISDKLTFFPSPSTWDDHQR   81 (153)
Q Consensus        25 p~~C~~Cgk~F~~~~~L~~H~r-~H~gekp~~C~~c~~~~Cgk~~~F~~~~~L~~H~~   81 (153)
                      ||.|..|++.|.....|..|++ .|+   ++.|+.|++.       |.   .|..|+.
T Consensus         1 py~C~~Cgk~F~s~s~Lk~H~r~~H~---~~~C~~CgKs-------F~---~l~~H~~   45 (79)
T PHA00732          1 MFKCPICGFTTVTLFALKQHARRNHT---LTKCPVCNKS-------YR---RLNQHFY   45 (79)
T ss_pred             CccCCCCCCccCCHHHHHHHhhcccC---CCccCCCCCE-------eC---Chhhhhc
Confidence            6899999999999999999998 466   4679966666       87   5888876


No 17 
>PLN03086 PRLI-interacting factor K; Provisional
Probab=98.26  E-value=9.5e-07  Score=76.07  Aligned_cols=67  Identities=13%  Similarity=0.191  Sum_probs=55.7

Q ss_pred             CcceeccccccccCChhHHHHHHHhhcCCCCCCCCCCCCCCCCCCccCCCCChhhhccccccCCCCCCCCCCCCcccccc
Q 044013           23 GRSYECVFCKRGFTTAQALGGHMNIHRKDRAKNRPPSSSSISDKLTFFPSPSTWDDHQRSVHTNNNNHSNYDSYGQNYHQ  102 (153)
Q Consensus        23 ekp~~C~~Cgk~F~~~~~L~~H~r~H~gekp~~C~~c~~~~Cgk~~~F~~~~~L~~H~~~~Ht~~kp~~~~~~~~~~f~~  102 (153)
                      ++++.|+.|++.|. ...|..|+.+|+  +|+.|+ |+..       + ....|..|+. +|..++|+ .|.+|+..|.+
T Consensus       451 ~~H~~C~~Cgk~f~-~s~LekH~~~~H--kpv~Cp-Cg~~-------~-~R~~L~~H~~-thCp~Kpi-~C~fC~~~v~~  516 (567)
T PLN03086        451 KNHVHCEKCGQAFQ-QGEMEKHMKVFH--EPLQCP-CGVV-------L-EKEQMVQHQA-STCPLRLI-TCRFCGDMVQA  516 (567)
T ss_pred             ccCccCCCCCCccc-hHHHHHHHHhcC--CCccCC-CCCC-------c-chhHHHhhhh-ccCCCCce-eCCCCCCcccc
Confidence            45678999999996 577999999985  899998 7754       6 5578999998 99999999 79999999854


Q ss_pred             C
Q 044013          103 H  103 (153)
Q Consensus       103 ~  103 (153)
                      +
T Consensus       517 g  517 (567)
T PLN03086        517 G  517 (567)
T ss_pred             C
Confidence            3


No 18 
>KOG3608 consensus Zn finger proteins [General function prediction only]
Probab=98.13  E-value=3.8e-07  Score=73.85  Aligned_cols=68  Identities=22%  Similarity=0.369  Sum_probs=49.6

Q ss_pred             cceeccccccccCChhHHHHHHHhhcCCCCCCCCCCCCCCCCCCccCCCCChhhhccccccCCCCCCCCCCCCccccc
Q 044013           24 RSYECVFCKRGFTTAQALGGHMNIHRKDRAKNRPPSSSSISDKLTFFPSPSTWDDHQRSVHTNNNNHSNYDSYGQNYH  101 (153)
Q Consensus        24 kp~~C~~Cgk~F~~~~~L~~H~r~H~gekp~~C~~c~~~~Cgk~~~F~~~~~L~~H~~~~Ht~~kp~~~~~~~~~~f~  101 (153)
                      .+|.|..|.|.|.+...|..|++.|.  .-|+|+.|...       .+..++|..|++..|...||| .|+.|...|-
T Consensus       236 n~fqC~~C~KrFaTeklL~~Hv~rHv--n~ykCplCdmt-------c~~~ssL~~H~r~rHs~dkpf-KCd~Cd~~c~  303 (467)
T KOG3608|consen  236 NSFQCAQCFKRFATEKLLKSHVVRHV--NCYKCPLCDMT-------CSSASSLTTHIRYRHSKDKPF-KCDECDTRCV  303 (467)
T ss_pred             CchHHHHHHHHHhHHHHHHHHHHHhh--hcccccccccC-------CCChHHHHHHHHhhhccCCCc-cccchhhhhc
Confidence            46666666666666666666666654  25677766666       788888888888568888888 7888888873


No 19 
>PLN03086 PRLI-interacting factor K; Provisional
Probab=98.10  E-value=1e-06  Score=75.86  Aligned_cols=74  Identities=11%  Similarity=0.113  Sum_probs=59.1

Q ss_pred             CCCcccccCCCCcceeccccccccCChhHHHHHHHhhcCCCCCCCCCCCCCCCCCCccCCC----------CChhhhccc
Q 044013           12 SSSDDETDAGTGRSYECVFCKRGFTTAQALGGHMNIHRKDRAKNRPPSSSSISDKLTFFPS----------PSTWDDHQR   81 (153)
Q Consensus        12 ~~~h~~~~~~gekp~~C~~Cgk~F~~~~~L~~H~r~H~gekp~~C~~c~~~~Cgk~~~F~~----------~~~L~~H~~   81 (153)
                      |..|++..+   +|+.|+ ||+.+ ....|..|+++|...+|+.|+.|+..       |..          ...|..|..
T Consensus       468 LekH~~~~H---kpv~Cp-Cg~~~-~R~~L~~H~~thCp~Kpi~C~fC~~~-------v~~g~~~~d~~d~~s~Lt~HE~  535 (567)
T PLN03086        468 MEKHMKVFH---EPLQCP-CGVVL-EKEQMVQHQASTCPLRLITCRFCGDM-------VQAGGSAMDVRDRLRGMSEHES  535 (567)
T ss_pred             HHHHHHhcC---CCccCC-CCCCc-chhHHHhhhhccCCCCceeCCCCCCc-------cccCccccchhhhhhhHHHHHH
Confidence            445555542   789999 99765 66899999999999999999966666       742          347999998


Q ss_pred             cccCCCCCCCCCCCCcccc
Q 044013           82 SVHTNNNNHSNYDSYGQNY  100 (153)
Q Consensus        82 ~~Ht~~kp~~~~~~~~~~f  100 (153)
                       +. |.+++ .|..|++.+
T Consensus       536 -~C-G~rt~-~C~~Cgk~V  551 (567)
T PLN03086        536 -IC-GSRTA-PCDSCGRSV  551 (567)
T ss_pred             -hc-CCcce-EccccCCee
Confidence             64 88998 789999997


No 20 
>PF13465 zf-H2C2_2:  Zinc-finger double domain; PDB: 2EN7_A 1TF6_A 1TF3_A 2ELT_A 2EOS_A 2EN2_A 2DMD_A 2WBS_A 2WBU_A 2EM5_A ....
Probab=98.09  E-value=5.5e-07  Score=46.36  Aligned_cols=25  Identities=20%  Similarity=0.368  Sum_probs=23.0

Q ss_pred             hhhhccccccCCCCCCCCCCCCccccc
Q 044013           75 TWDDHQRSVHTNNNNHSNYDSYGQNYH  101 (153)
Q Consensus        75 ~L~~H~~~~Ht~~kp~~~~~~~~~~f~  101 (153)
                      +|..|++ +|++++|| .|..|+++|.
T Consensus         1 ~l~~H~~-~H~~~k~~-~C~~C~k~F~   25 (26)
T PF13465_consen    1 NLRRHMR-THTGEKPY-KCPYCGKSFS   25 (26)
T ss_dssp             HHHHHHH-HHSSSSSE-EESSSSEEES
T ss_pred             CHHHHhh-hcCCCCCC-CCCCCcCeeC
Confidence            5889999 99999999 8999999984


No 21 
>PF00096 zf-C2H2:  Zinc finger, C2H2 type;  InterPro: IPR007087 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target.  The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger: #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C], where X can be any amino acid, and numbers in brackets indicate the number of residues. The positions marked # are those that are important for the stable fold of the zinc finger. The final position can be either his or cys. The C2H2 zinc finger is composed of two short beta strands followed by an alpha helix. The amino terminal part of the helix binds the major groove in DNA binding zinc fingers. The accepted consensus binding sequence for Sp1 is usually defined by the asymmetric hexanucleotide core GGGCGG but this sequence does not include, among others, the GAG (=CTC) repeat that constitutes a high-affinity site for Sp1 binding to the wt1 promoter []. This entry represents the classical C2H2 zinc finger domain.  More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; GO: 0008270 zinc ion binding, 0005622 intracellular; PDB: 2D9H_A 2EPC_A 1SP1_A 1VA3_A 2WBT_B 2ELR_A 2YTP_A 2YTT_A 1VA1_A 2ELO_A ....
Probab=98.02  E-value=4.2e-06  Score=41.44  Aligned_cols=23  Identities=35%  Similarity=0.852  Sum_probs=21.4

Q ss_pred             eeccccccccCChhHHHHHHHhh
Q 044013           26 YECVFCKRGFTTAQALGGHMNIH   48 (153)
Q Consensus        26 ~~C~~Cgk~F~~~~~L~~H~r~H   48 (153)
                      |.|++|++.|..+..|..|++.|
T Consensus         1 y~C~~C~~~f~~~~~l~~H~~~H   23 (23)
T PF00096_consen    1 YKCPICGKSFSSKSNLKRHMRRH   23 (23)
T ss_dssp             EEETTTTEEESSHHHHHHHHHHH
T ss_pred             CCCCCCCCccCCHHHHHHHHhHC
Confidence            78999999999999999999875


No 22 
>PF13912 zf-C2H2_6:  C2H2-type zinc finger; PDB: 1JN7_A 1FU9_A 2L1O_A 1NJQ_A 2EN8_A 2EMM_A 1FV5_A 1Y0J_B 2L6Z_B.
Probab=97.63  E-value=3.8e-05  Score=39.41  Aligned_cols=25  Identities=32%  Similarity=0.618  Sum_probs=23.0

Q ss_pred             ceeccccccccCChhHHHHHHHhhc
Q 044013           25 SYECVFCKRGFTTAQALGGHMNIHR   49 (153)
Q Consensus        25 p~~C~~Cgk~F~~~~~L~~H~r~H~   49 (153)
                      ||+|..|++.|.+...|..|++.|.
T Consensus         1 ~~~C~~C~~~F~~~~~l~~H~~~h~   25 (27)
T PF13912_consen    1 PFECDECGKTFSSLSALREHKRSHC   25 (27)
T ss_dssp             SEEETTTTEEESSHHHHHHHHCTTT
T ss_pred             CCCCCccCCccCChhHHHHHhHHhc
Confidence            6899999999999999999998775


No 23 
>COG5189 SFP1 Putative transcriptional repressor regulating G2/M transition [Transcription / Cell division and chromosome partitioning]
Probab=97.59  E-value=3e-05  Score=62.23  Aligned_cols=52  Identities=21%  Similarity=0.459  Sum_probs=39.8

Q ss_pred             Ccceeccc--cccccCChhHHHHHHH-hh------------------cCCCCCCCCCCCCCCCCCCccCCCCChhhhccc
Q 044013           23 GRSYECVF--CKRGFTTAQALGGHMN-IH------------------RKDRAKNRPPSSSSISDKLTFFPSPSTWDDHQR   81 (153)
Q Consensus        23 ekp~~C~~--Cgk~F~~~~~L~~H~r-~H------------------~gekp~~C~~c~~~~Cgk~~~F~~~~~L~~H~~   81 (153)
                      +|||+|++  |.|.|++...|+-|+. -|                  ...|||+|++|++.       |....-|+.|..
T Consensus       347 ~KpykCpV~gC~K~YknqnGLKYH~lhGH~~~~~~~~p~p~~~~~F~~~~KPYrCevC~KR-------YKNlNGLKYHr~  419 (423)
T COG5189         347 GKPYKCPVEGCNKKYKNQNGLKYHMLHGHQNQKLHENPSPEKMNIFSAKDKPYRCEVCDKR-------YKNLNGLKYHRK  419 (423)
T ss_pred             CceecCCCCCchhhhccccchhhhhhccccCcccCCCCCccccccccccCCceeccccchh-------hccCccceeccc
Confidence            59999997  9999999999999975 23                  12378888866666       777777777754


No 24 
>PF13894 zf-C2H2_4:  C2H2-type zinc finger; PDB: 2ELX_A 2EPP_A 2DLK_A 1X6H_A 2EOU_A 2EMB_A 2GQJ_A 2CSH_A 2WBT_B 2ELM_A ....
Probab=97.52  E-value=8e-05  Score=36.58  Aligned_cols=23  Identities=30%  Similarity=0.827  Sum_probs=19.2

Q ss_pred             eeccccccccCChhHHHHHHHhh
Q 044013           26 YECVFCKRGFTTAQALGGHMNIH   48 (153)
Q Consensus        26 ~~C~~Cgk~F~~~~~L~~H~r~H   48 (153)
                      |.|++|++.|.+...|..|+++|
T Consensus         1 ~~C~~C~~~~~~~~~l~~H~~~~   23 (24)
T PF13894_consen    1 FQCPICGKSFRSKSELRQHMRTH   23 (24)
T ss_dssp             EE-SSTS-EESSHHHHHHHHHHH
T ss_pred             CCCcCCCCcCCcHHHHHHHHHhh
Confidence            78999999999999999999876


No 25 
>PF05605 zf-Di19:  Drought induced 19 protein (Di19), zinc-binding;  InterPro: IPR008598 This entry consists of several drought induced 19 (Di19) like and RING finger 114 proteins. Di19 has been found to be strongly expressed in both the roots and leaves of Arabidopsis thaliana during progressive drought [], whilst RING finger proteins are thought to play a role in spermatogenesis. The precise function is unknown.
Probab=97.48  E-value=0.0001  Score=44.39  Aligned_cols=47  Identities=26%  Similarity=0.467  Sum_probs=36.2

Q ss_pred             ceeccccccccCChhHHHHHHH-hhcCC-CCCCCCCCCCCCCCCCccCCCCChhhhccc
Q 044013           25 SYECVFCKRGFTTAQALGGHMN-IHRKD-RAKNRPPSSSSISDKLTFFPSPSTWDDHQR   81 (153)
Q Consensus        25 p~~C~~Cgk~F~~~~~L~~H~r-~H~ge-kp~~C~~c~~~~Cgk~~~F~~~~~L~~H~~   81 (153)
                      .|.|++|++. .....|..|.. .|..+ +.+.||.|...       +..  +|..|+.
T Consensus         2 ~f~CP~C~~~-~~~~~L~~H~~~~H~~~~~~v~CPiC~~~-------~~~--~l~~Hl~   50 (54)
T PF05605_consen    2 SFTCPYCGKG-FSESSLVEHCEDEHRSESKNVVCPICSSR-------VTD--NLIRHLN   50 (54)
T ss_pred             CcCCCCCCCc-cCHHHHHHHHHhHCcCCCCCccCCCchhh-------hhh--HHHHHHH
Confidence            4899999994 55788999977 46654 57999977655       553  8999987


No 26 
>PHA00616 hypothetical protein
Probab=97.47  E-value=1.1e-05  Score=46.46  Aligned_cols=34  Identities=9%  Similarity=-0.018  Sum_probs=28.7

Q ss_pred             CCCCCCCCCCCCCCCccCCCCChhhhccccccCCCCCCCCCCC
Q 044013           53 AKNRPPSSSSISDKLTFFPSPSTWDDHQRSVHTNNNNHSNYDS   95 (153)
Q Consensus        53 p~~C~~c~~~~Cgk~~~F~~~~~L~~H~~~~Ht~~kp~~~~~~   95 (153)
                      ||+|+.||+.       |...++|..|++ .|+|++|+ .++.
T Consensus         1 pYqC~~CG~~-------F~~~s~l~~H~r-~~hg~~~~-~~~~   34 (44)
T PHA00616          1 MYQCLRCGGI-------FRKKKEVIEHLL-SVHKQNKL-TLEY   34 (44)
T ss_pred             CCccchhhHH-------HhhHHHHHHHHH-HhcCCCcc-ceeE
Confidence            7899955555       999999999999 99999998 5543


No 27 
>KOG3993 consensus Transcription factor (contains Zn finger) [Transcription]
Probab=97.25  E-value=3.7e-05  Score=63.67  Aligned_cols=70  Identities=20%  Similarity=0.249  Sum_probs=51.3

Q ss_pred             CCCCCCCCcccccCCCCcceeccccccccCChhHHHHHHHhhc--------CCCC-------------------------
Q 044013            7 EDYSKSSSDDETDAGTGRSYECVFCKRGFTTAQALGGHMNIHR--------KDRA-------------------------   53 (153)
Q Consensus         7 ~~~~~~~~h~~~~~~gekp~~C~~Cgk~F~~~~~L~~H~r~H~--------gekp-------------------------   53 (153)
                      +|.+.|..|.=.++- .--|+|+.|+|.|+-+.+|..|.|-|.        +..|                         
T Consensus       278 eD~F~LAQHrC~RIV-~vEYrCPEC~KVFsCPANLASHRRWHKPR~eaa~a~~~P~k~~~~~rae~~ea~rsg~dss~gi  356 (500)
T KOG3993|consen  278 EDAFALAQHRCPRIV-HVEYRCPECDKVFSCPANLASHRRWHKPRPEAAKAGSPPPKQAVETRAEVQEAERSGDDSSSGI  356 (500)
T ss_pred             hhHHHHhhccCCeeE-EeeecCCcccccccCchhhhhhhcccCCchhhhhcCCCChhhhhhhhhhhhhccccCCcccCce
Confidence            355677777655554 335999999999999999999999873        2222                         


Q ss_pred             CCCCCCCCCCCCCCccCCCCChhhhccccccC
Q 044013           54 KNRPPSSSSISDKLTFFPSPSTWDDHQRSVHT   85 (153)
Q Consensus        54 ~~C~~c~~~~Cgk~~~F~~~~~L~~H~~~~Ht   85 (153)
                      |.|.     +|+|.  |.+...|+.|+. +|.
T Consensus       357 ~~C~-----~C~Kk--FrRqAYLrKHql-thq  380 (500)
T KOG3993|consen  357 FSCH-----TCGKK--FRRQAYLRKHQL-THQ  380 (500)
T ss_pred             eecH-----Hhhhh--hHHHHHHHHhHH-hhh
Confidence            5566     55555  999999999987 543


No 28 
>PF09237 GAGA:  GAGA factor;  InterPro: IPR015318 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target.  Members of this entry bind to a 5'-GAGAG-3' DNA consensus binding site, and contain a Cys2-His2 zinc finger core as well as an N-terminal extension containing two highly basic regions. The zinc finger core binds in the DNA major groove and recognises the first three GAG bases of the consensus in a manner similar to that seen in other classical zinc finger-DNA complexes. The second basic region forms a helix that interacts in the major groove recognising the last G of the consensus, while the first basic region wraps around the DNA in the minor groove and recognises the A in the fourth position of the consensus sequence [].  More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; PDB: 1YUI_A 1YUJ_A.
Probab=97.18  E-value=0.00054  Score=40.60  Aligned_cols=37  Identities=14%  Similarity=0.371  Sum_probs=26.0

Q ss_pred             cccCCCCcceeccccccccCChhHHHHHHHhhcCCCC
Q 044013           17 ETDAGTGRSYECVFCKRGFTTAQALGGHMNIHRKDRA   53 (153)
Q Consensus        17 ~~~~~gekp~~C~~Cgk~F~~~~~L~~H~r~H~gekp   53 (153)
                      +.+...+.|..|++|+..++...+|++|+.++.+.||
T Consensus        16 k~~~~S~~PatCP~C~a~~~~srnLrRHle~~H~~k~   52 (54)
T PF09237_consen   16 KSKSQSEQPATCPICGAVIRQSRNLRRHLEIRHFKKP   52 (54)
T ss_dssp             CCCCTTS--EE-TTT--EESSHHHHHHHHHHHTTTS-
T ss_pred             HHhhccCCCCCCCcchhhccchhhHHHHHHHHhcccC
Confidence            3445557899999999999999999999998888776


No 29 
>smart00355 ZnF_C2H2 zinc finger.
Probab=97.11  E-value=0.00041  Score=34.36  Aligned_cols=24  Identities=38%  Similarity=0.728  Sum_probs=20.4

Q ss_pred             eeccccccccCChhHHHHHHHhhc
Q 044013           26 YECVFCKRGFTTAQALGGHMNIHR   49 (153)
Q Consensus        26 ~~C~~Cgk~F~~~~~L~~H~r~H~   49 (153)
                      |.|..|++.|.....|..|+++|.
T Consensus         1 ~~C~~C~~~f~~~~~l~~H~~~H~   24 (26)
T smart00355        1 YRCPECGKVFKSKSALKEHMRTHX   24 (26)
T ss_pred             CCCCCCcchhCCHHHHHHHHHHhc
Confidence            578889999999999999988765


No 30 
>PHA00732 hypothetical protein
Probab=97.03  E-value=0.00042  Score=45.08  Aligned_cols=39  Identities=15%  Similarity=0.144  Sum_probs=31.0

Q ss_pred             CCCCCCCCCCCCCCCccCCCCChhhhccccccCCCCCCCCCCCCcccccc
Q 044013           53 AKNRPPSSSSISDKLTFFPSPSTWDDHQRSVHTNNNNHSNYDSYGQNYHQ  102 (153)
Q Consensus        53 p~~C~~c~~~~Cgk~~~F~~~~~L~~H~~~~Ht~~kp~~~~~~~~~~f~~  102 (153)
                      ||.|..|++.       |.+..+|..|++..|++   + .|..|+++|.+
T Consensus         1 py~C~~Cgk~-------F~s~s~Lk~H~r~~H~~---~-~C~~CgKsF~~   39 (79)
T PHA00732          1 MFKCPICGFT-------TVTLFALKQHARRNHTL---T-KCPVCNKSYRR   39 (79)
T ss_pred             CccCCCCCCc-------cCCHHHHHHHhhcccCC---C-ccCCCCCEeCC
Confidence            6889966555       99999999999723654   4 69999999953


No 31 
>PRK04860 hypothetical protein; Provisional
Probab=97.00  E-value=0.00039  Score=51.11  Aligned_cols=36  Identities=14%  Similarity=0.309  Sum_probs=30.3

Q ss_pred             CcceeccccccccCChhHHHHHHHhhcCCCCCCCCCCCCC
Q 044013           23 GRSYECVFCKRGFTTAQALGGHMNIHRKDRAKNRPPSSSS   62 (153)
Q Consensus        23 ekp~~C~~Cgk~F~~~~~L~~H~r~H~gekp~~C~~c~~~   62 (153)
                      .-+|.|. |++   ....+.+|.++|+++++|.|..|+..
T Consensus       117 ~~~Y~C~-C~~---~~~~~rrH~ri~~g~~~YrC~~C~~~  152 (160)
T PRK04860        117 TFPYRCK-CQE---HQLTVRRHNRVVRGEAVYRCRRCGET  152 (160)
T ss_pred             EEEEEcC-CCC---eeCHHHHHHHHhcCCccEECCCCCce
Confidence            3579998 987   56678999999999999999966665


No 32 
>PF12874 zf-met:  Zinc-finger of C2H2 type; PDB: 1ZU1_A 2KVG_A.
Probab=96.66  E-value=0.0013  Score=32.91  Aligned_cols=23  Identities=22%  Similarity=0.712  Sum_probs=20.9

Q ss_pred             eeccccccccCChhHHHHHHHhh
Q 044013           26 YECVFCKRGFTTAQALGGHMNIH   48 (153)
Q Consensus        26 ~~C~~Cgk~F~~~~~L~~H~r~H   48 (153)
                      |.|.+|++.|.....|..|++.+
T Consensus         1 ~~C~~C~~~f~s~~~~~~H~~s~   23 (25)
T PF12874_consen    1 FYCDICNKSFSSENSLRQHLRSK   23 (25)
T ss_dssp             EEETTTTEEESSHHHHHHHHTTH
T ss_pred             CCCCCCCCCcCCHHHHHHHHCcC
Confidence            68999999999999999999865


No 33 
>PF00096 zf-C2H2:  Zinc finger, C2H2 type;  InterPro: IPR007087 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target.  The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger: #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C], where X can be any amino acid, and numbers in brackets indicate the number of residues. The positions marked # are those that are important for the stable fold of the zinc finger. The final position can be either his or cys. The C2H2 zinc finger is composed of two short beta strands followed by an alpha helix. The amino terminal part of the helix binds the major groove in DNA binding zinc fingers. The accepted consensus binding sequence for Sp1 is usually defined by the asymmetric hexanucleotide core GGGCGG but this sequence does not include, among others, the GAG (=CTC) repeat that constitutes a high-affinity site for Sp1 binding to the wt1 promoter []. This entry represents the classical C2H2 zinc finger domain.  More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; GO: 0008270 zinc ion binding, 0005622 intracellular; PDB: 2D9H_A 2EPC_A 1SP1_A 1VA3_A 2WBT_B 2ELR_A 2YTP_A 2YTT_A 1VA1_A 2ELO_A ....
Probab=95.87  E-value=0.00043  Score=33.96  Aligned_cols=23  Identities=35%  Similarity=0.424  Sum_probs=17.8

Q ss_pred             CCCCCCCCCCCCCCccCCCCChhhhcccccc
Q 044013           54 KNRPPSSSSISDKLTFFPSPSTWDDHQRSVH   84 (153)
Q Consensus        54 ~~C~~c~~~~Cgk~~~F~~~~~L~~H~~~~H   84 (153)
                      |.|+.|++.       |.....|..|++ .|
T Consensus         1 y~C~~C~~~-------f~~~~~l~~H~~-~H   23 (23)
T PF00096_consen    1 YKCPICGKS-------FSSKSNLKRHMR-RH   23 (23)
T ss_dssp             EEETTTTEE-------ESSHHHHHHHHH-HH
T ss_pred             CCCCCCCCc-------cCCHHHHHHHHh-HC
Confidence            468855555       999999999997 44


No 34 
>COG5048 FOG: Zn-finger [General function prediction only]
Probab=95.59  E-value=0.0054  Score=49.72  Aligned_cols=60  Identities=17%  Similarity=0.274  Sum_probs=54.3

Q ss_pred             cceeccccccccCChhHHHHHHH--hhcCC--CCCCCC--CCCCCCCCCCccCCCCChhhhccccccCCCCCCC
Q 044013           24 RSYECVFCKRGFTTAQALGGHMN--IHRKD--RAKNRP--PSSSSISDKLTFFPSPSTWDDHQRSVHTNNNNHS   91 (153)
Q Consensus        24 kp~~C~~Cgk~F~~~~~L~~H~r--~H~ge--kp~~C~--~c~~~~Cgk~~~F~~~~~L~~H~~~~Ht~~kp~~   91 (153)
                      .++.|..|...|.....|..|.+  .|+++  +|+.|+  .|++.       |.+...+..|.. +|++.+++.
T Consensus       288 ~~~~~~~~~~~~s~~~~l~~~~~~~~h~~~~~~~~~~p~~~~~~~-------~~~~~~~~~~~~-~~~~~~~~~  353 (467)
T COG5048         288 LPIKSKQCNISFSRSSPLTRHLRSVNHSGESLKPFSCPYSLCGKL-------FSRNDALKRHIL-LHTSISPAK  353 (467)
T ss_pred             cCCCCccccCCccccccccccccccccccccCCceeeeccCCCcc-------ccccccccCCcc-cccCCCccc
Confidence            47899999999999999999999  89999  999999  56666       999999999999 999988763


No 35 
>PF12171 zf-C2H2_jaz:  Zinc-finger double-stranded RNA-binding;  InterPro: IPR022755  This zinc finger is found in archaea and eukaryotes, and is approximately 30 amino acids in length. The mammalian members of this group occur multiple times along the protein, joined by flexible linkers, and are referred to as JAZ - dsRNA-binding ZF protein - zinc-fingers. The JAZ proteins are expressed in all tissues tested and localise in the nucleus, particularly the nucleolus []. JAZ preferentially binds to double-stranded (ds) RNA or RNA/DNA hybrids rather than DNA. In addition to binding double-stranded RNA, these zinc-fingers are required for nucleolar localisation.   This entry represents the multiple-adjacent-C2H2 zinc finger, JAZ. ; PDB: 4DGW_A 1ZR9_A.
Probab=95.44  E-value=0.005  Score=31.47  Aligned_cols=22  Identities=27%  Similarity=0.623  Sum_probs=18.2

Q ss_pred             eeccccccccCChhHHHHHHHh
Q 044013           26 YECVFCKRGFTTAQALGGHMNI   47 (153)
Q Consensus        26 ~~C~~Cgk~F~~~~~L~~H~r~   47 (153)
                      |.|..|++.|.+...|..|+++
T Consensus         2 ~~C~~C~k~f~~~~~~~~H~~s   23 (27)
T PF12171_consen    2 FYCDACDKYFSSENQLKQHMKS   23 (27)
T ss_dssp             CBBTTTTBBBSSHHHHHCCTTS
T ss_pred             CCcccCCCCcCCHHHHHHHHcc
Confidence            6788888888888888888764


No 36 
>PF12756 zf-C2H2_2:  C2H2 type zinc-finger (2 copies); PDB: 2DMI_A.
Probab=95.33  E-value=0.0023  Score=42.14  Aligned_cols=21  Identities=33%  Similarity=0.842  Sum_probs=0.0

Q ss_pred             eccccccccCChhHHHHHHHh
Q 044013           27 ECVFCKRGFTTAQALGGHMNI   47 (153)
Q Consensus        27 ~C~~Cgk~F~~~~~L~~H~r~   47 (153)
                      +|.+|+..|.....|..|+..
T Consensus         1 ~C~~C~~~f~~~~~l~~H~~~   21 (100)
T PF12756_consen    1 QCLFCDESFSSVDDLLQHMKK   21 (100)
T ss_dssp             ---------------------
T ss_pred             Ccccccccccccccccccccc
Confidence            478888888888888888763


No 37 
>PF13909 zf-H2C2_5:  C2H2-type zinc-finger domain; PDB: 1X5W_A.
Probab=95.13  E-value=0.02  Score=28.22  Aligned_cols=23  Identities=26%  Similarity=0.628  Sum_probs=17.6

Q ss_pred             eeccccccccCChhHHHHHHHhhc
Q 044013           26 YECVFCKRGFTTAQALGGHMNIHR   49 (153)
Q Consensus        26 ~~C~~Cgk~F~~~~~L~~H~r~H~   49 (153)
                      |+|+.|..... ...|..|++.|.
T Consensus         1 y~C~~C~y~t~-~~~l~~H~~~~H   23 (24)
T PF13909_consen    1 YKCPHCSYSTS-KSNLKRHLKRHH   23 (24)
T ss_dssp             EE-SSSS-EES-HHHHHHHHHHHH
T ss_pred             CCCCCCCCcCC-HHHHHHHHHhhC
Confidence            78999998887 889999998753


No 38 
>COG5189 SFP1 Putative transcriptional repressor regulating G2/M transition [Transcription / Cell division and chromosome partitioning]
Probab=95.00  E-value=0.014  Score=47.18  Aligned_cols=45  Identities=11%  Similarity=0.178  Sum_probs=34.8

Q ss_pred             CCCCCCCCCCCCCCCCCCccCCCCChhhhccccccC------------------CCCCCCCCCCCcccc
Q 044013           50 KDRAKNRPPSSSSISDKLTFFPSPSTWDDHQRSVHT------------------NNNNHSNYDSYGQNY  100 (153)
Q Consensus        50 gekp~~C~~c~~~~Cgk~~~F~~~~~L~~H~~~~Ht------------------~~kp~~~~~~~~~~f  100 (153)
                      ++|||+|++-   .|.|+  |....-|+.|+...|.                  ..||| .|.+|++.|
T Consensus       346 d~KpykCpV~---gC~K~--YknqnGLKYH~lhGH~~~~~~~~p~p~~~~~F~~~~KPY-rCevC~KRY  408 (423)
T COG5189         346 DGKPYKCPVE---GCNKK--YKNQNGLKYHMLHGHQNQKLHENPSPEKMNIFSAKDKPY-RCEVCDKRY  408 (423)
T ss_pred             cCceecCCCC---Cchhh--hccccchhhhhhccccCcccCCCCCccccccccccCCce-eccccchhh
Confidence            4699999875   45577  9999999999863451                  12688 799999998


No 39 
>PF13913 zf-C2HC_2:  zinc-finger of a C2HC-type
Probab=94.90  E-value=0.021  Score=28.85  Aligned_cols=21  Identities=33%  Similarity=0.669  Sum_probs=17.3

Q ss_pred             eeccccccccCChhHHHHHHHh
Q 044013           26 YECVFCKRGFTTAQALGGHMNI   47 (153)
Q Consensus        26 ~~C~~Cgk~F~~~~~L~~H~r~   47 (153)
                      ..|++||+.| ....|..|+.+
T Consensus         3 ~~C~~CgR~F-~~~~l~~H~~~   23 (25)
T PF13913_consen    3 VPCPICGRKF-NPDRLEKHEKI   23 (25)
T ss_pred             CcCCCCCCEE-CHHHHHHHHHh
Confidence            4799999999 67889989764


No 40 
>KOG3993 consensus Transcription factor (contains Zn finger) [Transcription]
Probab=94.76  E-value=0.0065  Score=50.73  Aligned_cols=53  Identities=30%  Similarity=0.502  Sum_probs=44.1

Q ss_pred             ceeccccccccCChhHHHHHHHhhcCCCCCCCCCCCCCCCCCCccCCCCChhhhccccccC
Q 044013           25 SYECVFCKRGFTTAQALGGHMNIHRKDRAKNRPPSSSSISDKLTFFPSPSTWDDHQRSVHT   85 (153)
Q Consensus        25 p~~C~~Cgk~F~~~~~L~~H~r~H~gekp~~C~~c~~~~Cgk~~~F~~~~~L~~H~~~~Ht   85 (153)
                      -|.|..|.-.|-..-.|.+|.-.-.-..-|+|+     .|+|+  |+...+|..|.| -|.
T Consensus       267 dyiCqLCK~kYeD~F~LAQHrC~RIV~vEYrCP-----EC~KV--FsCPANLASHRR-WHK  319 (500)
T KOG3993|consen  267 DYICQLCKEKYEDAFALAQHRCPRIVHVEYRCP-----ECDKV--FSCPANLASHRR-WHK  319 (500)
T ss_pred             HHHHHHHHHhhhhHHHHhhccCCeeEEeeecCC-----ccccc--ccCchhhhhhhc-ccC
Confidence            399999999999999999997654545579999     66666  999999999988 664


No 41 
>PF13912 zf-C2H2_6:  C2H2-type zinc finger; PDB: 1JN7_A 1FU9_A 2L1O_A 1NJQ_A 2EN8_A 2EMM_A 1FV5_A 1Y0J_B 2L6Z_B.
Probab=94.62  E-value=0.0036  Score=31.80  Aligned_cols=25  Identities=24%  Similarity=0.336  Sum_probs=20.3

Q ss_pred             CCCCCCCCCCCCCCCccCCCCChhhhccccccC
Q 044013           53 AKNRPPSSSSISDKLTFFPSPSTWDDHQRSVHT   85 (153)
Q Consensus        53 p~~C~~c~~~~Cgk~~~F~~~~~L~~H~~~~Ht   85 (153)
                      ||.|..|+..       |.....|..|++ .|.
T Consensus         1 ~~~C~~C~~~-------F~~~~~l~~H~~-~h~   25 (27)
T PF13912_consen    1 PFECDECGKT-------FSSLSALREHKR-SHC   25 (27)
T ss_dssp             SEEETTTTEE-------ESSHHHHHHHHC-TTT
T ss_pred             CCCCCccCCc-------cCChhHHHHHhH-Hhc
Confidence            6788866666       999999999998 654


No 42 
>smart00355 ZnF_C2H2 zinc finger.
Probab=94.33  E-value=0.016  Score=28.24  Aligned_cols=24  Identities=29%  Similarity=0.391  Sum_probs=19.3

Q ss_pred             CCCCCCCCCCCCCCccCCCCChhhhccccccC
Q 044013           54 KNRPPSSSSISDKLTFFPSPSTWDDHQRSVHT   85 (153)
Q Consensus        54 ~~C~~c~~~~Cgk~~~F~~~~~L~~H~~~~Ht   85 (153)
                      |.|..|++.       |.....|..|++ .|.
T Consensus         1 ~~C~~C~~~-------f~~~~~l~~H~~-~H~   24 (26)
T smart00355        1 YRCPECGKV-------FKSKSALKEHMR-THX   24 (26)
T ss_pred             CCCCCCcch-------hCCHHHHHHHHH-Hhc
Confidence            568866655       999999999998 765


No 43 
>PF13894 zf-C2H2_4:  C2H2-type zinc finger; PDB: 2ELX_A 2EPP_A 2DLK_A 1X6H_A 2EOU_A 2EMB_A 2GQJ_A 2CSH_A 2WBT_B 2ELM_A ....
Probab=94.13  E-value=0.0042  Score=30.13  Aligned_cols=21  Identities=33%  Similarity=0.357  Sum_probs=15.6

Q ss_pred             CCCCCCCCCCCCCCccCCCCChhhhccc
Q 044013           54 KNRPPSSSSISDKLTFFPSPSTWDDHQR   81 (153)
Q Consensus        54 ~~C~~c~~~~Cgk~~~F~~~~~L~~H~~   81 (153)
                      |.|+.|+..       |.....|..|++
T Consensus         1 ~~C~~C~~~-------~~~~~~l~~H~~   21 (24)
T PF13894_consen    1 FQCPICGKS-------FRSKSELRQHMR   21 (24)
T ss_dssp             EE-SSTS-E-------ESSHHHHHHHHH
T ss_pred             CCCcCCCCc-------CCcHHHHHHHHH
Confidence            468855555       999999999997


No 44 
>smart00451 ZnF_U1 U1-like zinc finger. Family of C2H2-type zinc fingers, present in matrin, U1 small nuclear ribonucleoprotein C and other RNA-binding proteins.
Probab=93.50  E-value=0.059  Score=28.78  Aligned_cols=23  Identities=22%  Similarity=0.567  Sum_probs=19.9

Q ss_pred             ceeccccccccCChhHHHHHHHh
Q 044013           25 SYECVFCKRGFTTAQALGGHMNI   47 (153)
Q Consensus        25 p~~C~~Cgk~F~~~~~L~~H~r~   47 (153)
                      +|.|.+|++.|.....+..|+..
T Consensus         3 ~~~C~~C~~~~~~~~~~~~H~~g   25 (35)
T smart00451        3 GFYCKLCNVTFTDEISVEAHLKG   25 (35)
T ss_pred             CeEccccCCccCCHHHHHHHHCh
Confidence            58899999999988888888764


No 45 
>PF09237 GAGA:  GAGA factor;  InterPro: IPR015318 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target.  Members of this entry bind to a 5'-GAGAG-3' DNA consensus binding site, and contain a Cys2-His2 zinc finger core as well as an N-terminal extension containing two highly basic regions. The zinc finger core binds in the DNA major groove and recognises the first three GAG bases of the consensus in a manner similar to that seen in other classical zinc finger-DNA complexes. The second basic region forms a helix that interacts in the major groove recognising the last G of the consensus, while the first basic region wraps around the DNA in the minor groove and recognises the A in the fourth position of the consensus sequence [].  More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; PDB: 1YUI_A 1YUJ_A.
Probab=93.32  E-value=0.011  Score=35.02  Aligned_cols=33  Identities=6%  Similarity=0.168  Sum_probs=21.2

Q ss_pred             cCCCCCCCCCCCCCCCCCCccCCCCChhhhccccccCCCCC
Q 044013           49 RKDRAKNRPPSSSSISDKLTFFPSPSTWDDHQRSVHTNNNN   89 (153)
Q Consensus        49 ~gekp~~C~~c~~~~Cgk~~~F~~~~~L~~H~~~~Ht~~kp   89 (153)
                      ..+.|-.|+.|+     ..  +++..+|.+|+. ++.+.+|
T Consensus        20 ~S~~PatCP~C~-----a~--~~~srnLrRHle-~~H~~k~   52 (54)
T PF09237_consen   20 QSEQPATCPICG-----AV--IRQSRNLRRHLE-IRHFKKP   52 (54)
T ss_dssp             TTS--EE-TTT-------E--ESSHHHHHHHHH-HHTTTS-
T ss_pred             ccCCCCCCCcch-----hh--ccchhhHHHHHH-HHhcccC
Confidence            356788999554     44  999999999998 6666655


No 46 
>PF05605 zf-Di19:  Drought induced 19 protein (Di19), zinc-binding;  InterPro: IPR008598 This entry consists of several drought induced 19 (Di19) like and RING finger 114 proteins. Di19 has been found to be strongly expressed in both the roots and leaves of Arabidopsis thaliana during progressive drought [], whilst RING finger proteins are thought to play a role in spermatogenesis. The precise function is unknown.
Probab=93.30  E-value=0.048  Score=32.52  Aligned_cols=40  Identities=13%  Similarity=0.154  Sum_probs=30.1

Q ss_pred             CCCCCCCCCCCCCCCccCCCCChhhhccccccCCCCCCCCCCCCcccc
Q 044013           53 AKNRPPSSSSISDKLTFFPSPSTWDDHQRSVHTNNNNHSNYDSYGQNY  100 (153)
Q Consensus        53 p~~C~~c~~~~Cgk~~~F~~~~~L~~H~~~~Ht~~kp~~~~~~~~~~f  100 (153)
                      .|.||.|++.       |.. ..|..|....|..+.....|++|...+
T Consensus         2 ~f~CP~C~~~-------~~~-~~L~~H~~~~H~~~~~~v~CPiC~~~~   41 (54)
T PF05605_consen    2 SFTCPYCGKG-------FSE-SSLVEHCEDEHRSESKNVVCPICSSRV   41 (54)
T ss_pred             CcCCCCCCCc-------cCH-HHHHHHHHhHCcCCCCCccCCCchhhh
Confidence            4789977665       876 469999875788765445899999864


No 47 
>PRK04860 hypothetical protein; Provisional
Probab=92.78  E-value=0.038  Score=40.66  Aligned_cols=36  Identities=14%  Similarity=0.075  Sum_probs=29.5

Q ss_pred             CCCCCCCCCCCCCCCccCCCCChhhhccccccCCCCCCCCCCCCccccc
Q 044013           53 AKNRPPSSSSISDKLTFFPSPSTWDDHQRSVHTNNNNHSNYDSYGQNYH  101 (153)
Q Consensus        53 p~~C~~c~~~~Cgk~~~F~~~~~L~~H~~~~Ht~~kp~~~~~~~~~~f~  101 (153)
                      +|.|. |+.          ....+..|.+ +|++++|| .|..|++.|.
T Consensus       119 ~Y~C~-C~~----------~~~~~rrH~r-i~~g~~~Y-rC~~C~~~l~  154 (160)
T PRK04860        119 PYRCK-CQE----------HQLTVRRHNR-VVRGEAVY-RCRRCGETLV  154 (160)
T ss_pred             EEEcC-CCC----------eeCHHHHHHH-HhcCCccE-ECCCCCceeE
Confidence            67885 543          3667899999 99999999 7889999873


No 48 
>PF12756 zf-C2H2_2:  C2H2 type zinc-finger (2 copies); PDB: 2DMI_A.
Probab=92.05  E-value=0.14  Score=33.48  Aligned_cols=25  Identities=32%  Similarity=0.805  Sum_probs=21.5

Q ss_pred             ceeccccccccCChhHHHHHHHhhc
Q 044013           25 SYECVFCKRGFTTAQALGGHMNIHR   49 (153)
Q Consensus        25 p~~C~~Cgk~F~~~~~L~~H~r~H~   49 (153)
                      ++.|.+|++.|.....|..|++.+.
T Consensus        50 ~~~C~~C~~~f~s~~~l~~Hm~~~~   74 (100)
T PF12756_consen   50 SFRCPYCNKTFRSREALQEHMRSKH   74 (100)
T ss_dssp             SEEBSSSS-EESSHHHHHHHHHHTT
T ss_pred             CCCCCccCCCCcCHHHHHHHHcCcc
Confidence            6899999999999999999999653


No 49 
>cd00350 rubredoxin_like Rubredoxin_like; nonheme iron binding domain containing a [Fe(SCys)4] center. The family includes rubredoxins, a small electron transfer protein, and a slightly smaller modular rubredoxin domain present in rubrerythrin and nigerythrin and detected either N- or C-terminal to such proteins as flavin reductase, NAD(P)H-nitrite reductase, and ferredoxin-thioredoxin reductase. In rubredoxin, the iron atom is coordinated by four cysteine residues (Fe(S-Cys)4), but iron can also be replaced by cobalt, nickel or zinc and believed to be involved in electron transfer.  Rubrerythrins and nigerythrins are small homodimeric proteins, generally consisting of 2 domains: a rubredoxin domain C-terminal to a non-sulfur, oxo-bridged diiron site in the N-terminal rubrerythrin domain.  Rubrerythrins and nigerythrins have putative peroxide activity.
Probab=91.10  E-value=0.19  Score=26.90  Aligned_cols=25  Identities=16%  Similarity=0.336  Sum_probs=18.1

Q ss_pred             eeccccccccCChhHHHHHHHhhcCCCCCCCCCCCCC
Q 044013           26 YECVFCKRGFTTAQALGGHMNIHRKDRAKNRPPSSSS   62 (153)
Q Consensus        26 ~~C~~Cgk~F~~~~~L~~H~r~H~gekp~~C~~c~~~   62 (153)
                      |.|.+||..+....            .|+.||.|+..
T Consensus         2 ~~C~~CGy~y~~~~------------~~~~CP~Cg~~   26 (33)
T cd00350           2 YVCPVCGYIYDGEE------------APWVCPVCGAP   26 (33)
T ss_pred             EECCCCCCEECCCc------------CCCcCcCCCCc
Confidence            78999997765432            68899977644


No 50 
>KOG2893 consensus Zn finger protein [General function prediction only]
Probab=87.52  E-value=0.35  Score=37.90  Aligned_cols=50  Identities=24%  Similarity=0.352  Sum_probs=37.2

Q ss_pred             CcceeccccccccCChhHHHHHHHhhcCCCCCCCCCCCCCCCCCCccCCCCChhhhc-cccccC
Q 044013           23 GRSYECVFCKRGFTTAQALGGHMNIHRKDRAKNRPPSSSSISDKLTFFPSPSTWDDH-QRSVHT   85 (153)
Q Consensus        23 ekp~~C~~Cgk~F~~~~~L~~H~r~H~gekp~~C~~c~~~~Cgk~~~F~~~~~L~~H-~~~~Ht   85 (153)
                      .|| .|=+|.+.|-...-|.+|+.    .|-|+|-     ||.|.  .-+..-|..| |+ +|.
T Consensus         9 ~kp-wcwycnrefddekiliqhqk----akhfkch-----ichkk--l~sgpglsihcmq-vhk   59 (341)
T KOG2893|consen    9 DKP-WCWYCNREFDDEKILIQHQK----AKHFKCH-----ICHKK--LFSGPGLSIHCMQ-VHK   59 (341)
T ss_pred             CCc-eeeecccccchhhhhhhhhh----hccceee-----eehhh--hccCCCceeehhh-hhh
Confidence            344 58899999999999998875    6789999     55555  6666667777 45 553


No 51 
>KOG1146 consensus Homeobox protein [General function prediction only]
Probab=86.02  E-value=0.33  Score=45.95  Aligned_cols=52  Identities=21%  Similarity=0.288  Sum_probs=42.7

Q ss_pred             CcceeccccccccCChhHHHHHHHh-h------------------------cCCCCCCCCCCCCCCCCCCccCCCCChhh
Q 044013           23 GRSYECVFCKRGFTTAQALGGHMNI-H------------------------RKDRAKNRPPSSSSISDKLTFFPSPSTWD   77 (153)
Q Consensus        23 ekp~~C~~Cgk~F~~~~~L~~H~r~-H------------------------~gekp~~C~~c~~~~Cgk~~~F~~~~~L~   77 (153)
                      .|.|.|+.|+-.|+....|..|+|. |                        -+.+||.|..|..+       ++...+|.
T Consensus       463 ~kt~~cpkc~~~yk~a~~L~vhmRskhp~~~~~~c~~gq~~~~~arg~~~~~~~~p~~C~~C~~s-------tttng~Ls  535 (1406)
T KOG1146|consen  463 FKTLKCPKCNWHYKLAQTLGVHMRSKHPESQSAYCKAGQNHPRLARGEVYRCPGKPYPCRACNYS-------TTTNGNLS  535 (1406)
T ss_pred             cccccCCccchhhhhHHHhhhcccccccccchhHhHhccccccccccccccCCCCcccceeeeee-------eecchHHH
Confidence            3789999999999999999999998 1                        11357788877777       88888998


Q ss_pred             hccc
Q 044013           78 DHQR   81 (153)
Q Consensus        78 ~H~~   81 (153)
                      .|+.
T Consensus       536 ihlq  539 (1406)
T KOG1146|consen  536 IHLQ  539 (1406)
T ss_pred             HHHH
Confidence            8875


No 52 
>COG1592 Rubrerythrin [Energy production and conversion]
Probab=84.65  E-value=0.69  Score=34.22  Aligned_cols=23  Identities=17%  Similarity=0.421  Sum_probs=18.3

Q ss_pred             ceeccccccccCChhHHHHHHHhhcCCCCCCCCCCC
Q 044013           25 SYECVFCKRGFTTAQALGGHMNIHRKDRAKNRPPSS   60 (153)
Q Consensus        25 p~~C~~Cgk~F~~~~~L~~H~r~H~gekp~~C~~c~   60 (153)
                      -|.|++||.             ++.|+-|-.||.|+
T Consensus       134 ~~vC~vCGy-------------~~~ge~P~~CPiCg  156 (166)
T COG1592         134 VWVCPVCGY-------------THEGEAPEVCPICG  156 (166)
T ss_pred             EEEcCCCCC-------------cccCCCCCcCCCCC
Confidence            689999985             45668899999666


No 53 
>COG5048 FOG: Zn-finger [General function prediction only]
Probab=84.64  E-value=1.2  Score=35.87  Aligned_cols=61  Identities=26%  Similarity=0.443  Sum_probs=49.7

Q ss_pred             CCcceeccccccccCChhHHHHHHHhhcCCCCCCCCCCCCCCCCCCccCCCCChhhhccccccCCCC
Q 044013           22 TGRSYECVFCKRGFTTAQALGGHMNIHRKDRAKNRPPSSSSISDKLTFFPSPSTWDDHQRSVHTNNN   88 (153)
Q Consensus        22 gekp~~C~~Cgk~F~~~~~L~~H~r~H~gekp~~C~~c~~~~Cgk~~~F~~~~~L~~H~~~~Ht~~k   88 (153)
                      ..+++.|+.|...|........|.++|++++++.|..+++.   +.  |.....+..|.+ .+....
T Consensus        30 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~--~~~~~~~~~~~~-~~~~~~   90 (467)
T COG5048          30 APRPDSCPNCTDSFSRLEHLTRHIRSHTGEKPSQCSYSGCD---KS--FSRPLELSRHLR-THHNNP   90 (467)
T ss_pred             CCchhhcccccccccccchhhhhcccccccCCccccccccc---cc--cCCcchhhhhcc-cccccc
Confidence            35678999999999999999999999999999999866433   55  788888888887 555443


No 54 
>cd00729 rubredoxin_SM Rubredoxin, Small Modular nonheme iron binding domain containing a [Fe(SCys)4] center, present in rubrerythrin and nigerythrin and detected either N- or C-terminal to such proteins as flavin reductase, NAD(P)H-nitrite reductase, and ferredoxin-thioredoxin reductase. In rubredoxin, the iron atom is coordinated by four cysteine residues (Fe(S-Cys)4), and  believed to be involved in electron transfer. Rubrerythrins and nigerythrins are small homodimeric proteins, generally consisting of 2 domains: a rubredoxin domain C-terminal to a non-sulfur, oxo-bridged diiron site in the N-terminal rubrerythrin domain. Rubrerythrins and nigerythrins have putative peroxide activity.
Probab=82.82  E-value=0.95  Score=24.41  Aligned_cols=26  Identities=12%  Similarity=0.296  Sum_probs=17.6

Q ss_pred             ceeccccccccCChhHHHHHHHhhcCCCCCCCCCCCCC
Q 044013           25 SYECVFCKRGFTTAQALGGHMNIHRKDRAKNRPPSSSS   62 (153)
Q Consensus        25 p~~C~~Cgk~F~~~~~L~~H~r~H~gekp~~C~~c~~~   62 (153)
                      .|.|.+||..+...            +.|..|+.|+..
T Consensus         2 ~~~C~~CG~i~~g~------------~~p~~CP~Cg~~   27 (34)
T cd00729           2 VWVCPVCGYIHEGE------------EAPEKCPICGAP   27 (34)
T ss_pred             eEECCCCCCEeECC------------cCCCcCcCCCCc
Confidence            48899999764332            257789977643


No 55 
>TIGR00373 conserved hypothetical protein TIGR00373. This family of proteins is, so far, restricted to archaeal genomes. The family appears to be distantly related to the N-terminal region of the eukaryotic transcription initiation factor IIE alpha chain.
Probab=80.89  E-value=1  Score=32.94  Aligned_cols=31  Identities=16%  Similarity=0.117  Sum_probs=22.7

Q ss_pred             CcceeccccccccCChhHHHHHHHhhcCCCCCCCCCCCCC
Q 044013           23 GRSYECVFCKRGFTTAQALGGHMNIHRKDRAKNRPPSSSS   62 (153)
Q Consensus        23 ekp~~C~~Cgk~F~~~~~L~~H~r~H~gekp~~C~~c~~~   62 (153)
                      ..-|.|+.|+..|+...++.         --|.|+.||..
T Consensus       107 ~~~Y~Cp~c~~r~tf~eA~~---------~~F~Cp~Cg~~  137 (158)
T TIGR00373       107 NMFFICPNMCVRFTFNEAME---------LNFTCPRCGAM  137 (158)
T ss_pred             CCeEECCCCCcEeeHHHHHH---------cCCcCCCCCCE
Confidence            34588999998888887774         26889966544


No 56 
>PRK06266 transcription initiation factor E subunit alpha; Validated
Probab=80.71  E-value=1  Score=33.68  Aligned_cols=30  Identities=20%  Similarity=0.283  Sum_probs=21.0

Q ss_pred             cceeccccccccCChhHHHHHHHhhcCCCCCCCCCCCCC
Q 044013           24 RSYECVFCKRGFTTAQALGGHMNIHRKDRAKNRPPSSSS   62 (153)
Q Consensus        24 kp~~C~~Cgk~F~~~~~L~~H~r~H~gekp~~C~~c~~~   62 (153)
                      .-|.|+.|++.|+...++.         --|.|+.|+..
T Consensus       116 ~~Y~Cp~C~~rytf~eA~~---------~~F~Cp~Cg~~  145 (178)
T PRK06266        116 MFFFCPNCHIRFTFDEAME---------YGFRCPQCGEM  145 (178)
T ss_pred             CEEECCCCCcEEeHHHHhh---------cCCcCCCCCCC
Confidence            4588888888888777653         35888855544


No 57 
>PF09986 DUF2225:  Uncharacterized protein conserved in bacteria (DUF2225);  InterPro: IPR018708 This conserved bacterial family has no known function.
Probab=79.48  E-value=0.53  Score=36.13  Aligned_cols=24  Identities=21%  Similarity=0.600  Sum_probs=18.5

Q ss_pred             CcceeccccccccCChhHHHHHHH
Q 044013           23 GRSYECVFCKRGFTTAQALGGHMN   46 (153)
Q Consensus        23 ekp~~C~~Cgk~F~~~~~L~~H~r   46 (153)
                      .+.+.|++|++.|.+........+
T Consensus         3 ~k~~~CPvC~~~F~~~~vrs~~~r   26 (214)
T PF09986_consen    3 DKKITCPVCGKEFKTKKVRSGKIR   26 (214)
T ss_pred             CCceECCCCCCeeeeeEEEcCCce
Confidence            356899999999998866655554


No 58 
>smart00531 TFIIE Transcription initiation factor IIE.
Probab=79.34  E-value=0.96  Score=32.54  Aligned_cols=35  Identities=20%  Similarity=0.272  Sum_probs=20.7

Q ss_pred             cceeccccccccCChhHHHHHHHhhcCCCCCCCCCCCCC
Q 044013           24 RSYECVFCKRGFTTAQALGGHMNIHRKDRAKNRPPSSSS   62 (153)
Q Consensus        24 kp~~C~~Cgk~F~~~~~L~~H~r~H~gekp~~C~~c~~~   62 (153)
                      .-|.|+.|+..|....++..-   .. ..-|.|+.|+..
T Consensus        98 ~~Y~Cp~C~~~y~~~ea~~~~---d~-~~~f~Cp~Cg~~  132 (147)
T smart00531       98 AYYKCPNCQSKYTFLEANQLL---DM-DGTFTCPRCGEE  132 (147)
T ss_pred             cEEECcCCCCEeeHHHHHHhc---CC-CCcEECCCCCCE
Confidence            458888888888765543320   11 233888855544


No 59 
>PF09723 Zn-ribbon_8:  Zinc ribbon domain;  InterPro: IPR013429  This entry represents a region of about 41 amino acids found in a number of small proteins in a wide range of bacteria. The region usually begins with the initiator Met and contains two CxxC motifs separated by 17 amino acids. One protein in this entry has been noted as a putative regulatory protein, designated FmdB []. Most proteins in this entry have a C-terminal region containing highly degenerate sequence.
Probab=78.95  E-value=1.1  Score=25.31  Aligned_cols=31  Identities=29%  Similarity=0.530  Sum_probs=21.3

Q ss_pred             ceeccccccccCChhHHHHHHHhhcCCCCCCCCCCCCC
Q 044013           25 SYECVFCKRGFTTAQALGGHMNIHRKDRAKNRPPSSSS   62 (153)
Q Consensus        25 p~~C~~Cgk~F~~~~~L~~H~r~H~gekp~~C~~c~~~   62 (153)
                      -|.|..||..|.....      +.. ..+-.|+.|+..
T Consensus         5 ey~C~~Cg~~fe~~~~------~~~-~~~~~CP~Cg~~   35 (42)
T PF09723_consen    5 EYRCEECGHEFEVLQS------ISE-DDPVPCPECGST   35 (42)
T ss_pred             EEEeCCCCCEEEEEEE------cCC-CCCCcCCCCCCC
Confidence            4899999998876543      222 456679977763


No 60 
>TIGR02605 CxxC_CxxC_SSSS putative regulatory protein, FmdB family. This model represents a region of about 50 amino acids found in a number of small proteins in a wide range of bacteria. The region begins usually with the initiator Met and contains two CxxC motifs separated by 17 amino acids. One member of this family is has been noted as a putative regulatory protein, designated FmdB (PubMed:8841393). Most members of this family have a C-terminal region containing highly degenerate sequence, such as SSTSESTKSSGSSGSSGSSESKASGSTEKSTSSTTAAAAV in Mycobacterium tuberculosis and VAVGGSAPAPSPAPRAGGGGGGCCGGGCCG in Streptomyces avermitilis. These low complexity regions, which are not included in the model, resemble low-complexity C-terminal regions of some heterocycle-containing bacteriocin precursors.
Probab=78.24  E-value=0.82  Score=26.70  Aligned_cols=31  Identities=23%  Similarity=0.502  Sum_probs=20.4

Q ss_pred             ceeccccccccCChhHHHHHHHhhcCCCCCCCCCCCCC
Q 044013           25 SYECVFCKRGFTTAQALGGHMNIHRKDRAKNRPPSSSS   62 (153)
Q Consensus        25 p~~C~~Cgk~F~~~~~L~~H~r~H~gekp~~C~~c~~~   62 (153)
                      -|.|..||..|.....+      .. ..+-.|+.|+..
T Consensus         5 ey~C~~Cg~~fe~~~~~------~~-~~~~~CP~Cg~~   35 (52)
T TIGR02605         5 EYRCTACGHRFEVLQKM------SD-DPLATCPECGGE   35 (52)
T ss_pred             EEEeCCCCCEeEEEEec------CC-CCCCCCCCCCCC
Confidence            38899999988764322      11 345569977753


No 61 
>smart00834 CxxC_CXXC_SSSS Putative regulatory protein. CxxC_CXXC_SSSS represents a region of about 41 amino acids found in a number of small proteins in a wide range of bacteria. The region usually begins with the initiator Met and contains two CxxC motifs separated by 17 amino acids. One protein in this entry has been noted as a putative regulatory protein, designated FmdB. Most proteins in this entry have a C-terminal region containing highly degenerate sequence.
Probab=77.43  E-value=0.88  Score=25.00  Aligned_cols=30  Identities=23%  Similarity=0.470  Sum_probs=19.7

Q ss_pred             ceeccccccccCChhHHHHHHHhhcCCCCCCCCCCCC
Q 044013           25 SYECVFCKRGFTTAQALGGHMNIHRKDRAKNRPPSSS   61 (153)
Q Consensus        25 p~~C~~Cgk~F~~~~~L~~H~r~H~gekp~~C~~c~~   61 (153)
                      -|+|..||..|......      .. ..+-.|+.|+.
T Consensus         5 ~y~C~~Cg~~fe~~~~~------~~-~~~~~CP~Cg~   34 (41)
T smart00834        5 EYRCEDCGHTFEVLQKI------SD-DPLATCPECGG   34 (41)
T ss_pred             EEEcCCCCCEEEEEEec------CC-CCCCCCCCCCC
Confidence            48899999988754432      11 34556887765


No 62 
>PF02892 zf-BED:  BED zinc finger;  InterPro: IPR003656 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target.  This entry represents predicted BED-type zinc finger domains. The BED finger which was named after the Drosophila proteins BEAF and DREF, is found in one or more copies in cellular regulatory factors and transposases from plants, animals and fungi. The BED finger is an about 50 to 60 amino acid residues domain that contains a characteristic motif with two highly conserved aromatic positions, as well as a shared pattern of cysteines and histidines that is predicted to form a zinc finger. As diverse BED fingers are able to bind DNA, it has been suggested that DNA-binding is the general function of this domain []. Some proteins known to contain a BED domain include animal, plant and fungi AC1 and Hobo-like transposases; Caenorhabditis elegans Dpy-20 protein, a predicted cuticular gene transcriptional regulator; Drosophila BEAF (boundary element-associated factor), thought to be involved in chromatin insulation; Drosophila DREF, a transcriptional regulator for S-phase genes; and tobacco 3AF1 and tomato E4/E8-BP1, light- and ethylene-regulated DNA binding proteins that contain two BED fingers. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; GO: 0003677 DNA binding; PDB: 2DJR_A 2CT5_A.
Probab=75.34  E-value=3  Score=23.36  Aligned_cols=25  Identities=16%  Similarity=0.496  Sum_probs=16.0

Q ss_pred             CCcceeccccccccCCh----hHHHHHHH
Q 044013           22 TGRSYECVFCKRGFTTA----QALGGHMN   46 (153)
Q Consensus        22 gekp~~C~~Cgk~F~~~----~~L~~H~r   46 (153)
                      +..-..|.+|++.+...    ++|..|++
T Consensus        13 ~~~~a~C~~C~~~~~~~~~~ts~l~~HL~   41 (45)
T PF02892_consen   13 DKKKAKCKYCGKVIKYSSGGTSNLKRHLK   41 (45)
T ss_dssp             CSS-EEETTTTEE-----SSTHHHHHHHH
T ss_pred             CcCeEEeCCCCeEEeeCCCcHHHHHHhhh
Confidence            35567899999988764    68888883


No 63 
>PF05443 ROS_MUCR:  ROS/MUCR transcriptional regulator protein;  InterPro: IPR008807 This family consists of several ROS/MUCR transcriptional regulator proteins. The ros chromosomal gene is present in octopine and nopaline strains of Agrobacterium tumefaciens as well as in Rhizobium meliloti (Sinorhizobium meliloti). This gene encodes a 15.5 kDa protein that specifically represses the virC and virD operons in the virulence region of the Ti plasmid [] and is necessary for succinoglycan production []. S. meliloti can produce two types of acidic exopolysaccharides, succinoglycan and galactoglucan, that are interchangeable for infection of Medicago sativa (Alfalfa) nodules. MucR from S. meliloti acts as a transcriptional repressor that blocks the expression of the exp genes responsible for galactoglucan production therefore allowing the exclusive production of succinoglycan [].; GO: 0003677 DNA binding, 0008270 zinc ion binding, 0006355 regulation of transcription, DNA-dependent; PDB: 2JSP_A.
Probab=73.51  E-value=1.9  Score=30.68  Aligned_cols=27  Identities=26%  Similarity=0.463  Sum_probs=17.0

Q ss_pred             cceeccccccccCChhHHHHHHHhhcCCCC
Q 044013           24 RSYECVFCKRGFTTAQALGGHMNIHRKDRA   53 (153)
Q Consensus        24 kp~~C~~Cgk~F~~~~~L~~H~r~H~gekp   53 (153)
                      ..-.|-+|||.|..   |++|++.|.|-.|
T Consensus        71 d~i~clecGk~~k~---LkrHL~~~~gltp   97 (132)
T PF05443_consen   71 DYIICLECGKKFKT---LKRHLRTHHGLTP   97 (132)
T ss_dssp             S-EE-TBT--EESB---HHHHHHHTT-S-H
T ss_pred             CeeEEccCCcccch---HHHHHHHccCCCH
Confidence            34679999999987   6999999976544


No 64 
>COG4049 Uncharacterized protein containing archaeal-type C2H2 Zn-finger [General function prediction only]
Probab=73.25  E-value=1.9  Score=26.14  Aligned_cols=26  Identities=19%  Similarity=0.245  Sum_probs=21.4

Q ss_pred             CCcceeccccccccCChhHHHHHHHh
Q 044013           22 TGRSYECVFCKRGFTTAQALGGHMNI   47 (153)
Q Consensus        22 gekp~~C~~Cgk~F~~~~~L~~H~r~   47 (153)
                      ||.-+.|+.|+..|.......+|.+.
T Consensus        14 GE~~lrCPRC~~~FR~~K~Y~RHVNK   39 (65)
T COG4049          14 GEEFLRCPRCGMVFRRRKDYIRHVNK   39 (65)
T ss_pred             CceeeeCCchhHHHHHhHHHHHHhhH
Confidence            47778899999999888888888764


No 65 
>smart00614 ZnF_BED BED zinc finger. DNA-binding domain in chromatin-boundary-element-binding proteins and transposases
Probab=73.12  E-value=3  Score=24.18  Aligned_cols=25  Identities=20%  Similarity=0.588  Sum_probs=19.0

Q ss_pred             ceeccccccccCCh-----hHHHHHHH-hhc
Q 044013           25 SYECVFCKRGFTTA-----QALGGHMN-IHR   49 (153)
Q Consensus        25 p~~C~~Cgk~F~~~-----~~L~~H~r-~H~   49 (153)
                      --.|..|++.++..     ++|.+|++ .|.
T Consensus        18 ~a~C~~C~~~l~~~~~~gTs~L~rHl~~~h~   48 (50)
T smart00614       18 RAKCKYCGKKLSRSSKGGTSNLRRHLRRKHP   48 (50)
T ss_pred             EEEecCCCCEeeeCCCCCcHHHHHHHHhHCc
Confidence            46799999988765     57888887 443


No 66 
>PF09538 FYDLN_acid:  Protein of unknown function (FYDLN_acid);  InterPro: IPR012644 Members of this family are bacterial proteins with a conserved motif [KR]FYDLN, sometimes flanked by a pair of CXXC motifs, followed by a long region of low complexity sequence in which roughly half the residues are Asp and Glu, including multiple runs of five or more acidic residues. The function of members of this family is unknown.
Probab=72.28  E-value=2.6  Score=28.94  Aligned_cols=30  Identities=20%  Similarity=0.401  Sum_probs=21.4

Q ss_pred             eeccccccccCChhHHHHHHHhhcCCCCCCCCCCCCCCCCCCccCCCC
Q 044013           26 YECVFCKRGFTTAQALGGHMNIHRKDRAKNRPPSSSSISDKLTFFPSP   73 (153)
Q Consensus        26 ~~C~~Cgk~F~~~~~L~~H~r~H~gekp~~C~~c~~~~Cgk~~~F~~~   73 (153)
                      ..|+.||++|.-..           ..|-.||.||..       |...
T Consensus        10 R~Cp~CG~kFYDLn-----------k~PivCP~CG~~-------~~~~   39 (108)
T PF09538_consen   10 RTCPSCGAKFYDLN-----------KDPIVCPKCGTE-------FPPE   39 (108)
T ss_pred             ccCCCCcchhccCC-----------CCCccCCCCCCc-------cCcc
Confidence            47999999886531           357789966666       7666


No 67 
>COG1996 RPC10 DNA-directed RNA polymerase, subunit RPC10 (contains C4-type Zn-finger) [Transcription]
Probab=67.55  E-value=3.2  Score=24.44  Aligned_cols=30  Identities=23%  Similarity=0.401  Sum_probs=20.5

Q ss_pred             CcceeccccccccCChhHHHHHHHhhcCCCCCCCCCCCCC
Q 044013           23 GRSYECVFCKRGFTTAQALGGHMNIHRKDRAKNRPPSSSS   62 (153)
Q Consensus        23 ekp~~C~~Cgk~F~~~~~L~~H~r~H~gekp~~C~~c~~~   62 (153)
                      ..-|.|..||+.|...          .......|+.|+..
T Consensus         4 ~~~Y~C~~Cg~~~~~~----------~~~~~irCp~Cg~r   33 (49)
T COG1996           4 MMEYKCARCGREVELD----------QETRGIRCPYCGSR   33 (49)
T ss_pred             eEEEEhhhcCCeeehh----------hccCceeCCCCCcE
Confidence            3569999999998321          12445689988765


No 68 
>COG2888 Predicted Zn-ribbon RNA-binding protein with a function in translation [Translation, ribosomal structure and biogenesis]
Probab=67.16  E-value=3.2  Score=25.43  Aligned_cols=31  Identities=10%  Similarity=0.162  Sum_probs=16.9

Q ss_pred             ceeccccccccCChhHHHHHHHhhcCCCCCCCCCCC
Q 044013           25 SYECVFCKRGFTTAQALGGHMNIHRKDRAKNRPPSS   60 (153)
Q Consensus        25 p~~C~~Cgk~F~~~~~L~~H~r~H~gekp~~C~~c~   60 (153)
                      .|.|+.||..--..     -.+.-.-..||.|+.||
T Consensus        27 ~F~CPnCGe~~I~R-----c~~CRk~g~~Y~Cp~CG   57 (61)
T COG2888          27 KFPCPNCGEVEIYR-----CAKCRKLGNPYRCPKCG   57 (61)
T ss_pred             EeeCCCCCceeeeh-----hhhHHHcCCceECCCcC
Confidence            47888888543222     12222235688888554


No 69 
>PF13719 zinc_ribbon_5:  zinc-ribbon domain
Probab=66.82  E-value=3.2  Score=22.67  Aligned_cols=32  Identities=16%  Similarity=0.347  Sum_probs=19.9

Q ss_pred             eeccccccccCChhHHHHHHHhhcCCCCCCCCCCCCC
Q 044013           26 YECVFCKRGFTTAQALGGHMNIHRKDRAKNRPPSSSS   62 (153)
Q Consensus        26 ~~C~~Cgk~F~~~~~L~~H~r~H~gekp~~C~~c~~~   62 (153)
                      ..|+.|+..|.-...     ++..+.+..+|+.|+..
T Consensus         3 i~CP~C~~~f~v~~~-----~l~~~~~~vrC~~C~~~   34 (37)
T PF13719_consen    3 ITCPNCQTRFRVPDD-----KLPAGGRKVRCPKCGHV   34 (37)
T ss_pred             EECCCCCceEEcCHH-----HcccCCcEEECCCCCcE
Confidence            368888888876554     23334456678866554


No 70 
>PRK00398 rpoP DNA-directed RNA polymerase subunit P; Provisional
Probab=66.71  E-value=2.6  Score=24.05  Aligned_cols=29  Identities=17%  Similarity=0.334  Sum_probs=19.2

Q ss_pred             cceeccccccccCChhHHHHHHHhhcCCCCCCCCCCCCC
Q 044013           24 RSYECVFCKRGFTTAQALGGHMNIHRKDRAKNRPPSSSS   62 (153)
Q Consensus        24 kp~~C~~Cgk~F~~~~~L~~H~r~H~gekp~~C~~c~~~   62 (153)
                      ..|.|..||..|.....          .....|+.|+..
T Consensus         2 ~~y~C~~CG~~~~~~~~----------~~~~~Cp~CG~~   30 (46)
T PRK00398          2 AEYKCARCGREVELDEY----------GTGVRCPYCGYR   30 (46)
T ss_pred             CEEECCCCCCEEEECCC----------CCceECCCCCCe
Confidence            35889999987765322          115689977755


No 71 
>smart00659 RPOLCX RNA polymerase subunit CX. present in RNA polymerase I, II and III
Probab=66.63  E-value=4  Score=23.34  Aligned_cols=27  Identities=15%  Similarity=0.144  Sum_probs=19.3

Q ss_pred             ceeccccccccCChhHHHHHHHhhcCCCCCCCCCCCCC
Q 044013           25 SYECVFCKRGFTTAQALGGHMNIHRKDRAKNRPPSSSS   62 (153)
Q Consensus        25 p~~C~~Cgk~F~~~~~L~~H~r~H~gekp~~C~~c~~~   62 (153)
                      .|.|..||..|...           ..-+..|+.|+..
T Consensus         2 ~Y~C~~Cg~~~~~~-----------~~~~irC~~CG~r   28 (44)
T smart00659        2 IYICGECGRENEIK-----------SKDVVRCRECGYR   28 (44)
T ss_pred             EEECCCCCCEeecC-----------CCCceECCCCCce
Confidence            38899999887643           2346789977765


No 72 
>TIGR02098 MJ0042_CXXC MJ0042 family finger-like domain. This domain contains a CXXCX(19)CXXC motif suggestive of both zinc fingers and thioredoxin, usually found at the N-terminus of prokaryotic proteins. One partially characterized gene, agmX, is among a large set in Myxococcus whose interruption affects adventurous gliding motility.
Probab=66.40  E-value=3  Score=22.60  Aligned_cols=32  Identities=13%  Similarity=0.173  Sum_probs=18.9

Q ss_pred             eeccccccccCChhHHHHHHHhhcCCCCCCCCCCCCC
Q 044013           26 YECVFCKRGFTTAQALGGHMNIHRKDRAKNRPPSSSS   62 (153)
Q Consensus        26 ~~C~~Cgk~F~~~~~L~~H~r~H~gekp~~C~~c~~~   62 (153)
                      +.|+.|+..|.-.....     .....-..|+.|+..
T Consensus         3 ~~CP~C~~~~~v~~~~~-----~~~~~~v~C~~C~~~   34 (38)
T TIGR02098         3 IQCPNCKTSFRVVDSQL-----GANGGKVRCGKCGHV   34 (38)
T ss_pred             EECCCCCCEEEeCHHHc-----CCCCCEEECCCCCCE
Confidence            57888988887665422     111224568866544


No 73 
>TIGR02300 FYDLN_acid conserved hypothetical protein TIGR02300. Members of this family are bacterial proteins with a conserved motif [KR]FYDLN, sometimes flanked by a pair of CXXC motifs, followed by a long region of low complexity sequence in which roughly half the residues are Asp and Glu, including multiple runs of five or more acidic residues. The function of members of this family is unknown.
Probab=63.90  E-value=5.4  Score=28.23  Aligned_cols=34  Identities=18%  Similarity=0.273  Sum_probs=24.2

Q ss_pred             eeccccccccCChhHHHHHHHhhcCCCCCCCCCCCCCCCCCCccCCCCChhh
Q 044013           26 YECVFCKRGFTTAQALGGHMNIHRKDRAKNRPPSSSSISDKLTFFPSPSTWD   77 (153)
Q Consensus        26 ~~C~~Cgk~F~~~~~L~~H~r~H~gekp~~C~~c~~~~Cgk~~~F~~~~~L~   77 (153)
                      ..|+.||+.|.-.           +..|-.|+.|+..       |.....++
T Consensus        10 r~Cp~cg~kFYDL-----------nk~p~vcP~cg~~-------~~~~~~~~   43 (129)
T TIGR02300        10 RICPNTGSKFYDL-----------NRRPAVSPYTGEQ-------FPPEEALK   43 (129)
T ss_pred             ccCCCcCcccccc-----------CCCCccCCCcCCc-------cCcchhhc
Confidence            5899999998652           2368889977766       76664444


No 74 
>PRK14890 putative Zn-ribbon RNA-binding protein; Provisional
Probab=63.88  E-value=5.3  Score=24.39  Aligned_cols=30  Identities=10%  Similarity=0.300  Sum_probs=16.6

Q ss_pred             ceeccccccc-cCChhHHHHHHHhhcCCCCCCCCCCC
Q 044013           25 SYECVFCKRG-FTTAQALGGHMNIHRKDRAKNRPPSS   60 (153)
Q Consensus        25 p~~C~~Cgk~-F~~~~~L~~H~r~H~gekp~~C~~c~   60 (153)
                      .|.|+.||.. -..-.      +--.-..+|.|+.|+
T Consensus        25 ~F~CPnCG~~~I~RC~------~CRk~~~~Y~CP~CG   55 (59)
T PRK14890         25 KFLCPNCGEVIIYRCE------KCRKQSNPYTCPKCG   55 (59)
T ss_pred             EeeCCCCCCeeEeech------hHHhcCCceECCCCC
Confidence            4888888865 22211      112224678888554


No 75 
>PF13717 zinc_ribbon_4:  zinc-ribbon domain
Probab=59.43  E-value=6.6  Score=21.30  Aligned_cols=32  Identities=16%  Similarity=0.263  Sum_probs=19.3

Q ss_pred             eeccccccccCChhHHHHHHHhhcCCCCCCCCCCCCC
Q 044013           26 YECVFCKRGFTTAQALGGHMNIHRKDRAKNRPPSSSS   62 (153)
Q Consensus        26 ~~C~~Cgk~F~~~~~L~~H~r~H~gekp~~C~~c~~~   62 (153)
                      ..|+.|+..|.-....     +....+-.+|+.|+..
T Consensus         3 i~Cp~C~~~y~i~d~~-----ip~~g~~v~C~~C~~~   34 (36)
T PF13717_consen    3 ITCPNCQAKYEIDDEK-----IPPKGRKVRCSKCGHV   34 (36)
T ss_pred             EECCCCCCEEeCCHHH-----CCCCCcEEECCCCCCE
Confidence            4688888888766542     2233445677755544


No 76 
>PF04959 ARS2:  Arsenite-resistance protein 2;  InterPro: IPR007042 This entry represents Arsenite-resistance protein 2 (also known as Serrate RNA effector molecule homolog) which is thought to play a role in arsenite resistance [], although does not directly confer arsenite resistance but rather modulates arsenic sensitivity []. Arsenite is a carcinogenic compound which can act as a comutagen by inhibiting DNA repair. It is also involved in cell cycle progression at S phase. ; PDB: 3AX1_A.
Probab=58.86  E-value=8.5  Score=29.65  Aligned_cols=25  Identities=16%  Similarity=0.426  Sum_probs=20.5

Q ss_pred             CcceeccccccccCChhHHHHHHHh
Q 044013           23 GRSYECVFCKRGFTTAQALGGHMNI   47 (153)
Q Consensus        23 ekp~~C~~Cgk~F~~~~~L~~H~r~   47 (153)
                      +..|.|..|+|.|........|+..
T Consensus        75 ~~K~~C~lc~KlFkg~eFV~KHI~n   99 (214)
T PF04959_consen   75 EDKWRCPLCGKLFKGPEFVRKHIFN   99 (214)
T ss_dssp             SEEEEE-SSS-EESSHHHHHHHHHH
T ss_pred             CCEECCCCCCcccCChHHHHHHHhh
Confidence            5569999999999999999999884


No 77 
>PRK00464 nrdR transcriptional regulator NrdR; Validated
Probab=58.63  E-value=1.9  Score=31.51  Aligned_cols=10  Identities=30%  Similarity=0.726  Sum_probs=7.4

Q ss_pred             eecccccccc
Q 044013           26 YECVFCKRGF   35 (153)
Q Consensus        26 ~~C~~Cgk~F   35 (153)
                      .+|++||-.+
T Consensus         1 m~cp~c~~~~   10 (154)
T PRK00464          1 MRCPFCGHPD   10 (154)
T ss_pred             CcCCCCCCCC
Confidence            3699998655


No 78 
>COG3357 Predicted transcriptional regulator containing an HTH domain fused to a Zn-ribbon [Transcription]
Probab=55.96  E-value=6.5  Score=26.19  Aligned_cols=14  Identities=29%  Similarity=0.548  Sum_probs=9.7

Q ss_pred             cceeccccccccCC
Q 044013           24 RSYECVFCKRGFTT   37 (153)
Q Consensus        24 kp~~C~~Cgk~F~~   37 (153)
                      +|-.|..||..|..
T Consensus        57 ~Pa~CkkCGfef~~   70 (97)
T COG3357          57 RPARCKKCGFEFRD   70 (97)
T ss_pred             cChhhcccCccccc
Confidence            46677777777754


No 79 
>smart00734 ZnF_Rad18 Rad18-like CCHC zinc finger. Yeast Rad18p functions with Rad5p in error-free post-replicative DNA repair. This zinc finger is likely to bind nucleic-acids.
Probab=55.95  E-value=9.3  Score=19.17  Aligned_cols=19  Identities=21%  Similarity=0.665  Sum_probs=11.8

Q ss_pred             eccccccccCChhHHHHHHH
Q 044013           27 ECVFCKRGFTTAQALGGHMN   46 (153)
Q Consensus        27 ~C~~Cgk~F~~~~~L~~H~r   46 (153)
                      .|++|++.+ ....+..|..
T Consensus         3 ~CPiC~~~v-~~~~in~HLD   21 (26)
T smart00734        3 QCPVCFREV-PENLINSHLD   21 (26)
T ss_pred             cCCCCcCcc-cHHHHHHHHH
Confidence            577887776 4445555543


No 80 
>COG3364 Zn-ribbon containing protein [General function prediction only]
Probab=55.95  E-value=7.9  Score=26.37  Aligned_cols=28  Identities=18%  Similarity=0.349  Sum_probs=19.4

Q ss_pred             ceeccccccccCChhHHHHHHHhhcCCCCCCCCCCCCC
Q 044013           25 SYECVFCKRGFTTAQALGGHMNIHRKDRAKNRPPSSSS   62 (153)
Q Consensus        25 p~~C~~Cgk~F~~~~~L~~H~r~H~gekp~~C~~c~~~   62 (153)
                      |+.|..||..|...+..     +-.     -|+.||..
T Consensus         2 pH~CtrCG~vf~~g~~~-----il~-----GCp~CG~n   29 (112)
T COG3364           2 PHQCTRCGEVFDDGSEE-----ILS-----GCPKCGCN   29 (112)
T ss_pred             CceecccccccccccHH-----HHc-----cCccccch
Confidence            78999999999875432     222     38877655


No 81 
>COG4957 Predicted transcriptional regulator [Transcription]
Probab=54.83  E-value=6.5  Score=28.15  Aligned_cols=25  Identities=20%  Similarity=0.291  Sum_probs=21.1

Q ss_pred             eeccccccccCChhHHHHHHHhhcCCCC
Q 044013           26 YECVFCKRGFTTAQALGGHMNIHRKDRA   53 (153)
Q Consensus        26 ~~C~~Cgk~F~~~~~L~~H~r~H~gekp   53 (153)
                      ..|-++||.|..   |++|+.+|.|-.|
T Consensus        77 IicLEDGkkfKS---LKRHL~t~~gmTP  101 (148)
T COG4957          77 IICLEDGKKFKS---LKRHLTTHYGLTP  101 (148)
T ss_pred             EEEeccCcchHH---HHHHHhcccCCCH
Confidence            569999999974   9999999988665


No 82 
>PRK03824 hypA hydrogenase nickel incorporation protein; Provisional
Probab=51.99  E-value=7.5  Score=27.60  Aligned_cols=15  Identities=13%  Similarity=0.430  Sum_probs=11.6

Q ss_pred             cceeccccccccCCh
Q 044013           24 RSYECVFCKRGFTTA   38 (153)
Q Consensus        24 kp~~C~~Cgk~F~~~   38 (153)
                      ..+.|..||..|...
T Consensus        69 ~~~~C~~CG~~~~~~   83 (135)
T PRK03824         69 AVLKCRNCGNEWSLK   83 (135)
T ss_pred             eEEECCCCCCEEecc
Confidence            358899999888664


No 83 
>KOG2593 consensus Transcription initiation factor IIE, alpha subunit [Transcription]
Probab=48.41  E-value=5.7  Score=33.63  Aligned_cols=37  Identities=16%  Similarity=0.302  Sum_probs=26.7

Q ss_pred             CcceeccccccccCChhHHHHHHHhhcCCCCCCCCCCCCC
Q 044013           23 GRSYECVFCKRGFTTAQALGGHMNIHRKDRAKNRPPSSSS   62 (153)
Q Consensus        23 ekp~~C~~Cgk~F~~~~~L~~H~r~H~gekp~~C~~c~~~   62 (153)
                      ..-|.|+.|.+.|+...++.-   +-...-.|.|..|+-.
T Consensus       126 ~~~Y~Cp~C~kkyt~Lea~~L---~~~~~~~F~C~~C~ge  162 (436)
T KOG2593|consen  126 VAGYVCPNCQKKYTSLEALQL---LDNETGEFHCENCGGE  162 (436)
T ss_pred             cccccCCccccchhhhHHHHh---hcccCceEEEecCCCc
Confidence            567999999999988766542   3333457889988743


No 84 
>TIGR00100 hypA hydrogenase nickel insertion protein HypA. In Hpylori, hypA mutant abolished hydrogenase activity and decrease in urease activity. Nickel supplementation in media restored urease activity and partial hydrogenase activity. HypA probably involved in inserting Ni in enzymes.
Probab=47.65  E-value=12  Score=25.84  Aligned_cols=26  Identities=15%  Similarity=0.250  Sum_probs=17.6

Q ss_pred             ceeccccccccCChhHHHHHHHhhcCCCCCCCCCCCCC
Q 044013           25 SYECVFCKRGFTTAQALGGHMNIHRKDRAKNRPPSSSS   62 (153)
Q Consensus        25 p~~C~~Cgk~F~~~~~L~~H~r~H~gekp~~C~~c~~~   62 (153)
                      ...|..|++.|.....            -+.||.|+..
T Consensus        70 ~~~C~~Cg~~~~~~~~------------~~~CP~Cgs~   95 (115)
T TIGR00100        70 ECECEDCSEEVSPEID------------LYRCPKCHGI   95 (115)
T ss_pred             EEEcccCCCEEecCCc------------CccCcCCcCC
Confidence            3789999987765422            3678877654


No 85 
>KOG4167 consensus Predicted DNA-binding protein, contains SANT and ELM2 domains [Transcription]
Probab=47.30  E-value=5.2  Score=36.21  Aligned_cols=26  Identities=27%  Similarity=0.594  Sum_probs=23.4

Q ss_pred             ceeccccccccCChhHHHHHHHhhcC
Q 044013           25 SYECVFCKRGFTTAQALGGHMNIHRK   50 (153)
Q Consensus        25 p~~C~~Cgk~F~~~~~L~~H~r~H~g   50 (153)
                      -|.|..|+|.|-.-..+..||.+|.-
T Consensus       792 iFpCreC~kvF~KiKSrNAHMK~Hr~  817 (907)
T KOG4167|consen  792 IFPCRECGKVFFKIKSRNAHMKTHRQ  817 (907)
T ss_pred             eeehHHHHHHHHHHhhhhHHHHHHHH
Confidence            49999999999998999999999973


No 86 
>COG3091 SprT Zn-dependent metalloprotease, SprT family [General function prediction only]
Probab=47.12  E-value=8.4  Score=28.12  Aligned_cols=35  Identities=14%  Similarity=0.405  Sum_probs=24.0

Q ss_pred             CcceeccccccccCChhHHHHHHHhhcCCCCCCCCCCCCC
Q 044013           23 GRSYECVFCKRGFTTAQALGGHMNIHRKDRAKNRPPSSSS   62 (153)
Q Consensus        23 ekp~~C~~Cgk~F~~~~~L~~H~r~H~gekp~~C~~c~~~   62 (153)
                      .-+|.|. |+..|.+   ..+|-.+-.|+ .|.|..|+-.
T Consensus       115 ~~~Y~C~-C~q~~l~---~RRhn~~~~g~-~YrC~~C~gk  149 (156)
T COG3091         115 TYPYRCQ-CQQHYLR---IRRHNTVRRGE-VYRCGKCGGK  149 (156)
T ss_pred             ceeEEee-cCCccch---hhhcccccccc-eEEeccCCce
Confidence            5689999 9987654   34455555666 8989877543


No 87 
>KOG4124 consensus Putative transcriptional repressor regulating G2/M transition [Transcription; Cell cycle control, cell division, chromosome partitioning]
Probab=46.86  E-value=5.9  Score=32.84  Aligned_cols=23  Identities=22%  Similarity=0.642  Sum_probs=18.2

Q ss_pred             cceeccc--cccccCChhHHHHHHH
Q 044013           24 RSYECVF--CKRGFTTAQALGGHMN   46 (153)
Q Consensus        24 kp~~C~~--Cgk~F~~~~~L~~H~r   46 (153)
                      +||+|.+  |.+.++....|+.|..
T Consensus       348 ~~~~~~vp~~~~~~~n~ng~~~~~~  372 (442)
T KOG4124|consen  348 KPYKCPVPNCDKAYKNQNGLKYHKL  372 (442)
T ss_pred             CCCCCCCCcchhhcccCcceeeccc
Confidence            7899975  9999888877777753


No 88 
>KOG3214 consensus Uncharacterized Zn ribbon-containing protein [Function unknown]
Probab=46.16  E-value=4.4  Score=27.45  Aligned_cols=20  Identities=0%  Similarity=0.265  Sum_probs=14.7

Q ss_pred             CCCCCCCccccccCCcccch
Q 044013           90 HSNYDSYGQNYHQHLNMIGD  109 (153)
Q Consensus        90 ~~~~~~~~~~f~~~~~~~~~  109 (153)
                      +.+|.+|+.+|+.....+..
T Consensus        47 ~~sC~iC~esFqt~it~Lse   66 (109)
T KOG3214|consen   47 KASCRICEESFQTTITALSE   66 (109)
T ss_pred             eeeeeehhhhhccchHhhcc
Confidence            45799999999766655544


No 89 
>PF12013 DUF3505:  Protein of unknown function (DUF3505);  InterPro: IPR022698  This family of proteins is functionally uncharacterised. This protein is found in eukaryotes. Proteins in this family are typically between 247 to 1018 amino acids in length. This region contains two segments that are likely to be C2H2 zinc binding domains. 
Probab=45.45  E-value=19  Score=24.27  Aligned_cols=24  Identities=17%  Similarity=0.384  Sum_probs=20.8

Q ss_pred             eec----cccccccCChhHHHHHHHhhc
Q 044013           26 YEC----VFCKRGFTTAQALGGHMNIHR   49 (153)
Q Consensus        26 ~~C----~~Cgk~F~~~~~L~~H~r~H~   49 (153)
                      |.|    ..|+..+.+...+..|.+.+.
T Consensus        81 ~~C~~~~~~C~y~~~~~~~m~~H~~~~H  108 (109)
T PF12013_consen   81 YRCQCDPPHCGYITRSKKTMRKHWRKEH  108 (109)
T ss_pred             eeeecCCCCCCcEeccHHHHHHHHHHhc
Confidence            889    899999999999999988654


No 90 
>KOG3408 consensus U1-like Zn-finger-containing protein, probabl erole in RNA processing/splicing [RNA processing and modification]
Probab=45.07  E-value=14  Score=25.96  Aligned_cols=26  Identities=35%  Similarity=0.666  Sum_probs=21.9

Q ss_pred             CCcceeccccccccCChhHHHHHHHh
Q 044013           22 TGRSYECVFCKRGFTTAQALGGHMNI   47 (153)
Q Consensus        22 gekp~~C~~Cgk~F~~~~~L~~H~r~   47 (153)
                      |--.|-|-.|.+-|.+..+|..|.++
T Consensus        54 G~GqfyCi~CaRyFi~~~~l~~H~kt   79 (129)
T KOG3408|consen   54 GGGQFYCIECARYFIDAKALKTHFKT   79 (129)
T ss_pred             CCceeehhhhhhhhcchHHHHHHHhc
Confidence            44569999999999999999999764


No 91 
>smart00154 ZnF_AN1 AN1-like Zinc finger. Zinc finger at the C-terminus of An1, a ubiquitin-like protein in Xenopus laevis.
Probab=44.90  E-value=11  Score=20.88  Aligned_cols=13  Identities=23%  Similarity=0.713  Sum_probs=11.1

Q ss_pred             ceeccccccccCC
Q 044013           25 SYECVFCKRGFTT   37 (153)
Q Consensus        25 p~~C~~Cgk~F~~   37 (153)
                      ||+|..|++.|-.
T Consensus        12 ~f~C~~C~~~FC~   24 (39)
T smart00154       12 GFKCRHCGNLFCG   24 (39)
T ss_pred             CeECCccCCcccc
Confidence            8899999988865


No 92 
>PRK12380 hydrogenase nickel incorporation protein HybF; Provisional
Probab=43.63  E-value=16  Score=25.06  Aligned_cols=26  Identities=15%  Similarity=0.189  Sum_probs=17.0

Q ss_pred             ceeccccccccCChhHHHHHHHhhcCCCCCCCCCCCCC
Q 044013           25 SYECVFCKRGFTTAQALGGHMNIHRKDRAKNRPPSSSS   62 (153)
Q Consensus        25 p~~C~~Cgk~F~~~~~L~~H~r~H~gekp~~C~~c~~~   62 (153)
                      .+.|..||..|....            ..+.||.|+..
T Consensus        70 ~~~C~~Cg~~~~~~~------------~~~~CP~Cgs~   95 (113)
T PRK12380         70 QAWCWDCSQVVEIHQ------------HDAQCPHCHGE   95 (113)
T ss_pred             EEEcccCCCEEecCC------------cCccCcCCCCC
Confidence            478999997775432            34568877643


No 93 
>PRK00564 hypA hydrogenase nickel incorporation protein; Provisional
Probab=43.06  E-value=15  Score=25.34  Aligned_cols=14  Identities=29%  Similarity=0.593  Sum_probs=10.2

Q ss_pred             ceeccccccccCCh
Q 044013           25 SYECVFCKRGFTTA   38 (153)
Q Consensus        25 p~~C~~Cgk~F~~~   38 (153)
                      .+.|..||..|...
T Consensus        71 ~~~C~~Cg~~~~~~   84 (117)
T PRK00564         71 ELECKDCSHVFKPN   84 (117)
T ss_pred             EEEhhhCCCccccC
Confidence            37888998777653


No 94 
>PF07754 DUF1610:  Domain of unknown function (DUF1610);  InterPro: IPR011668 This domain is found in archaeal species. It is likely to bind zinc via its four well-conserved cysteine residues.
Probab=41.95  E-value=12  Score=18.56  Aligned_cols=10  Identities=20%  Similarity=0.561  Sum_probs=7.9

Q ss_pred             cceecccccc
Q 044013           24 RSYECVFCKR   33 (153)
Q Consensus        24 kp~~C~~Cgk   33 (153)
                      -+|.|+.||+
T Consensus        15 v~f~CPnCG~   24 (24)
T PF07754_consen   15 VPFPCPNCGF   24 (24)
T ss_pred             ceEeCCCCCC
Confidence            3699999984


No 95 
>PF06524 NOA36:  NOA36 protein;  InterPro: IPR010531 This family consists of several NOA36 proteins which contain 29 highly conserved cysteine residues. The function of this protein is unknown.; GO: 0008270 zinc ion binding, 0005634 nucleus
Probab=41.30  E-value=54  Score=26.25  Aligned_cols=24  Identities=8%  Similarity=0.044  Sum_probs=14.1

Q ss_pred             CCCCCCCCCCCCCCCCCccCCCCChhhhccc
Q 044013           51 DRAKNRPPSSSSISDKLTFFPSPSTWDDHQR   81 (153)
Q Consensus        51 ekp~~C~~c~~~~Cgk~~~F~~~~~L~~H~~   81 (153)
                      .+|+.||.|+..       ......|..-.|
T Consensus       207 ~k~~PCPKCg~e-------t~eTkdLSmStR  230 (314)
T PF06524_consen  207 GKPIPCPKCGYE-------TQETKDLSMSTR  230 (314)
T ss_pred             CCCCCCCCCCCc-------ccccccceeeee
Confidence            356777766655       555555555444


No 96 
>PF01428 zf-AN1:  AN1-like Zinc finger;  InterPro: IPR000058 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target.  This entry represents the AN1-type zinc finger domain, which has a dimetal (zinc)-bound alpha/beta fold. This domain was first identified as a zinc finger at the C terminus of AN1 Q91889 from SWISSPROT, a ubiquitin-like protein in Xenopus laevis []. The AN1-type zinc finger contains six conserved cysteines and two histidines that could potentially coordinate 2 zinc atoms. Certain stress-associated proteins (SAP) contain AN1 domain, often in combination with A20 zinc finger domains (SAP8) or C2H2 domains (SAP16) []. For example, the human protein Znf216 has an A20 zinc-finger at the N terminus and an AN1 zinc-finger at the C terminus, acting to negatively regulate the NFkappaB activation pathway and to interact with components of the immune response like RIP, IKKgamma and TRAF6. The interact of Znf216 with IKK-gamma and RIP is mediated by the A20 zinc-finger domain, while its interaction with TRAF6 is mediated by the AN1 zinc-finger domain; therefore, both zinc-finger domains are involved in regulating the immune response []. The AN1 zinc finger domain is also found in proteins containing a ubiquitin-like domain, which are involved in the ubiquitination pathway []. Proteins containing an AN1-type zinc finger include:   Ascidian posterior end mark 6 (pem-6) protein []. Human AWP1 protein (associated with PRK1), which is expressed during early embryogenesis []. Human immunoglobulin mu binding protein 2 (SMUBP-2), mutations in which cause muscular atrophy with respiratory distress type 1 [].   More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; GO: 0008270 zinc ion binding; PDB: 1WFP_A 1WYS_A 1WG2_A 1WFH_A 1X4W_A 1WFE_A 1WFL_A 1X4V_A.
Probab=40.08  E-value=12  Score=20.97  Aligned_cols=15  Identities=20%  Similarity=0.722  Sum_probs=9.7

Q ss_pred             cceeccccccccCCh
Q 044013           24 RSYECVFCKRGFTTA   38 (153)
Q Consensus        24 kp~~C~~Cgk~F~~~   38 (153)
                      -||.|..|++.|-..
T Consensus        12 ~~~~C~~C~~~FC~~   26 (43)
T PF01428_consen   12 LPFKCKHCGKSFCLK   26 (43)
T ss_dssp             SHEE-TTTS-EE-TT
T ss_pred             CCeECCCCCcccCcc
Confidence            589999999998653


No 97 
>PF07800 DUF1644:  Protein of unknown function (DUF1644);  InterPro: IPR012866 This family consists of sequences found in a number of hypothetical plant proteins of unknown function. The region of interest contains nine highly conserved cysteine residues and is approximately 160 amino acids in length, which probably represent a zinc-binding domain. 
Probab=38.56  E-value=22  Score=26.14  Aligned_cols=58  Identities=17%  Similarity=0.277  Sum_probs=41.2

Q ss_pred             ceeccccccccCChhHHHHHHHhhcCCCCCCCCCCCCCCCCCCccCC-CCChhhhccccccCCCCCC
Q 044013           25 SYECVFCKRGFTTAQALGGHMNIHRKDRAKNRPPSSSSISDKLTFFP-SPSTWDDHQRSVHTNNNNH   90 (153)
Q Consensus        25 p~~C~~Cgk~F~~~~~L~~H~r~H~gekp~~C~~c~~~~Cgk~~~F~-~~~~L~~H~~~~Ht~~kp~   90 (153)
                      .-.|+.|--.-.-+... .--|.+...|+..|..=++.       |. +-..|.+|.+..|...+|-
T Consensus        80 ~L~CPLCRG~V~GWtvv-e~AR~~LN~K~RsC~~e~C~-------F~GtY~eLrKHar~~HP~~rP~  138 (162)
T PF07800_consen   80 ELACPLCRGEVKGWTVV-EPARRFLNAKKRSCSQESCS-------FSGTYSELRKHARSEHPSARPS  138 (162)
T ss_pred             cccCccccCceeceEEc-hHHHHHhccCCccCcccccc-------cccCHHHHHHHHHhhCCCCCCc
Confidence            46799986544433333 33567778899888877777       75 5578999998678888875


No 98 
>PF03604 DNA_RNApol_7kD:  DNA directed RNA polymerase, 7 kDa subunit;  InterPro: IPR006591 DNA-dependent RNA polymerase catalyzes the transcription of DNA into RNA using the four ribonucleoside triphosphates as substrates. Each class of RNA polymerase is assembled from 9 to 15 different polypeptides. Rbp10 (RNA polymerase CX) is a domain found in RNA polymerase subunit 10; present in RNA polymerase I, II and III.; GO: 0003677 DNA binding, 0003899 DNA-directed RNA polymerase activity, 0006351 transcription, DNA-dependent; PDB: 2PMZ_Z 3HKZ_X 2NVX_L 3S1Q_L 2JA6_L 3S17_L 3HOW_L 3HOV_L 3PO2_L 3HOZ_L ....
Probab=38.42  E-value=25  Score=18.62  Aligned_cols=26  Identities=15%  Similarity=0.255  Sum_probs=14.5

Q ss_pred             eeccccccccCChhHHHHHHHhhcCCCCCCCCCCCCC
Q 044013           26 YECVFCKRGFTTAQALGGHMNIHRKDRAKNRPPSSSS   62 (153)
Q Consensus        26 ~~C~~Cgk~F~~~~~L~~H~r~H~gekp~~C~~c~~~   62 (153)
                      |.|..||..+....           .-+-.|+.||..
T Consensus         1 Y~C~~Cg~~~~~~~-----------~~~irC~~CG~R   26 (32)
T PF03604_consen    1 YICGECGAEVELKP-----------GDPIRCPECGHR   26 (32)
T ss_dssp             EBESSSSSSE-BST-----------SSTSSBSSSS-S
T ss_pred             CCCCcCCCeeEcCC-----------CCcEECCcCCCe
Confidence            67778887665221           235578866654


No 99 
>PF10571 UPF0547:  Uncharacterised protein family UPF0547;  InterPro: IPR018886  This domain may well be a type of zinc-finger as it carries two pairs of highly conserved cysteine residues though with no accompanying histidines. Several members are annotated as putative helicases. 
Probab=38.36  E-value=15  Score=18.54  Aligned_cols=11  Identities=27%  Similarity=0.582  Sum_probs=7.8

Q ss_pred             eeccccccccC
Q 044013           26 YECVFCKRGFT   36 (153)
Q Consensus        26 ~~C~~Cgk~F~   36 (153)
                      -.|+.||..|.
T Consensus        15 ~~Cp~CG~~F~   25 (26)
T PF10571_consen   15 KFCPHCGYDFE   25 (26)
T ss_pred             CcCCCCCCCCc
Confidence            36788887774


No 100
>KOG1146 consensus Homeobox protein [General function prediction only]
Probab=36.55  E-value=7.9  Score=37.22  Aligned_cols=51  Identities=20%  Similarity=0.212  Sum_probs=38.4

Q ss_pred             eccccccccCChhHHHHHHH-hhcCCCCCCCCCCCCCCCCCCccCCCCChhhhcccccc
Q 044013           27 ECVFCKRGFTTAQALGGHMN-IHRKDRAKNRPPSSSSISDKLTFFPSPSTWDDHQRSVH   84 (153)
Q Consensus        27 ~C~~Cgk~F~~~~~L~~H~r-~H~gekp~~C~~c~~~~Cgk~~~F~~~~~L~~H~~~~H   84 (153)
                      .|.-|+..+.....+..|+. .|+-.+-|+|+.|+-.       |.....|-.|||+.|
T Consensus       438 e~~~~e~~~~s~r~~~~~t~~L~S~~kt~~cpkc~~~-------yk~a~~L~vhmRskh  489 (1406)
T KOG1146|consen  438 ELTKAEPLLESKRSLEGQTVVLHSFFKTLKCPKCNWH-------YKLAQTLGVHMRSKH  489 (1406)
T ss_pred             cccchhhhhhhhcccccceeeeecccccccCCccchh-------hhhHHHhhhcccccc
Confidence            35556666666666777665 4666789999977777       999999999999533


No 101
>COG1655 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=36.21  E-value=7.8  Score=30.30  Aligned_cols=29  Identities=24%  Similarity=0.536  Sum_probs=22.7

Q ss_pred             CcceeccccccccCChhHHHHHHHhhcCC
Q 044013           23 GRSYECVFCKRGFTTAQALGGHMNIHRKD   51 (153)
Q Consensus        23 ekp~~C~~Cgk~F~~~~~L~~H~r~H~ge   51 (153)
                      ++.+.|++|+-.|.....+..-.|+-.|+
T Consensus        17 kk~ieCPvC~tkFkkeev~tgsiRiiagD   45 (267)
T COG1655          17 KKTIECPVCNTKFKKEEVKTGSIRIIAGD   45 (267)
T ss_pred             hceeccCcccchhhhhheeccceeEeccc
Confidence            35689999999999888777777766554


No 102
>KOG2186 consensus Cell growth-regulating nucleolar protein [Cell cycle control, cell division, chromosome partitioning]
Probab=36.05  E-value=16  Score=28.90  Aligned_cols=54  Identities=15%  Similarity=0.327  Sum_probs=38.4

Q ss_pred             eeccccccccCChhHHHHHHHhhcCCCCCCCCCCCCCCCCCCccCCCCChhhhccccccCCCCCC
Q 044013           26 YECVFCKRGFTTAQALGGHMNIHRKDRAKNRPPSSSSISDKLTFFPSPSTWDDHQRSVHTNNNNH   90 (153)
Q Consensus        26 ~~C~~Cgk~F~~~~~L~~H~r~H~gekp~~C~~c~~~~Cgk~~~F~~~~~L~~H~~~~Ht~~kp~   90 (153)
                      |.|.+||-...-+ .+..|+-.-++ .-|.|--|+..       |.. .....|.. .-+-..-|
T Consensus         4 FtCnvCgEsvKKp-~vekH~srCrn-~~fSCIDC~k~-------F~~-~sYknH~k-CITEaQKY   57 (276)
T KOG2186|consen    4 FTCNVCGESVKKP-QVEKHMSRCRN-AYFSCIDCGKT-------FER-VSYKNHTK-CITEAQKY   57 (276)
T ss_pred             Eehhhhhhhcccc-chHHHHHhccC-CeeEEeecccc-------ccc-chhhhhhh-hcchHHHh
Confidence            7899999876654 46668877776 67888855555       988 56788877 55555555


No 103
>PHA00626 hypothetical protein
Probab=36.03  E-value=17  Score=21.96  Aligned_cols=15  Identities=20%  Similarity=0.421  Sum_probs=12.4

Q ss_pred             cceeccccccccCCh
Q 044013           24 RSYECVFCKRGFTTA   38 (153)
Q Consensus        24 kp~~C~~Cgk~F~~~   38 (153)
                      ..|+|..||..|+.-
T Consensus        22 nrYkCkdCGY~ft~~   36 (59)
T PHA00626         22 DDYVCCDCGYNDSKD   36 (59)
T ss_pred             cceEcCCCCCeechh
Confidence            469999999988753


No 104
>PF01155 HypA:  Hydrogenase expression/synthesis hypA family;  InterPro: IPR000688 Bacterial membrane-bound nickel-dependent hydrogenases requires a number of accessory proteins which are involved in their maturation. The exact role of these proteins is not yet clear, but some seem to be required for the incorporation of the nickel ions []. One of these proteins is generally known as hypA. It is a protein of about 12 to 14 kDa that contains, in its C-terminal region, four conserved cysteines that form a zinc-finger like motif. Escherichia coli has two proteins that belong to this family, hypA and hybF. A homologue, MJ0214, has also been found in a number of archaeal species, including the genome of Methanocaldococcus jannaschii (Methanococcus jannaschii).; GO: 0016151 nickel ion binding, 0006464 protein modification process; PDB: 2KDX_A 3A44_D 3A43_B.
Probab=35.81  E-value=17  Score=24.86  Aligned_cols=26  Identities=19%  Similarity=0.264  Sum_probs=16.2

Q ss_pred             ceeccccccccCChhHHHHHHHhhcCCCCCCCCCCCCC
Q 044013           25 SYECVFCKRGFTTAQALGGHMNIHRKDRAKNRPPSSSS   62 (153)
Q Consensus        25 p~~C~~Cgk~F~~~~~L~~H~r~H~gekp~~C~~c~~~   62 (153)
                      .+.|..||+.|......            +.||.|+..
T Consensus        70 ~~~C~~Cg~~~~~~~~~------------~~CP~Cgs~   95 (113)
T PF01155_consen   70 RARCRDCGHEFEPDEFD------------FSCPRCGSP   95 (113)
T ss_dssp             EEEETTTS-EEECHHCC------------HH-SSSSSS
T ss_pred             cEECCCCCCEEecCCCC------------CCCcCCcCC
Confidence            37899999998765432            349977654


No 105
>PF09845 DUF2072:  Zn-ribbon containing protein (DUF2072);  InterPro: IPR018645  This archaeal Zinc-ribbon containing proteins have no known function. 
Probab=34.54  E-value=17  Score=25.84  Aligned_cols=15  Identities=27%  Similarity=0.611  Sum_probs=12.3

Q ss_pred             ceeccccccccCChh
Q 044013           25 SYECVFCKRGFTTAQ   39 (153)
Q Consensus        25 p~~C~~Cgk~F~~~~   39 (153)
                      |++|..||+.|...+
T Consensus         1 PH~Ct~Cg~~f~dgs   15 (131)
T PF09845_consen    1 PHQCTKCGRVFEDGS   15 (131)
T ss_pred             CcccCcCCCCcCCCc
Confidence            688999999988654


No 106
>KOG0717 consensus Molecular chaperone (DnaJ superfamily) [Posttranslational modification, protein turnover, chaperones]
Probab=34.16  E-value=25  Score=30.30  Aligned_cols=22  Identities=32%  Similarity=0.711  Sum_probs=20.3

Q ss_pred             eeccccccccCChhHHHHHHHh
Q 044013           26 YECVFCKRGFTTAQALGGHMNI   47 (153)
Q Consensus        26 ~~C~~Cgk~F~~~~~L~~H~r~   47 (153)
                      +-|.+|.|.|.+...|..|..+
T Consensus       293 lyC~vCnKsFKseKq~kNHEnS  314 (508)
T KOG0717|consen  293 LYCVVCNKSFKSEKQLKNHENS  314 (508)
T ss_pred             eEEeeccccccchHHHHhhHHH
Confidence            7899999999999999999874


No 107
>PF07295 DUF1451:  Protein of unknown function (DUF1451);  InterPro: IPR009912 This family consists of several hypothetical bacterial proteins of around 160 residues in length. Members of this family contain four highly conserved cysteine resides toward the C-terminal region of the protein. The function of this family is unknown.
Probab=33.88  E-value=33  Score=24.80  Aligned_cols=10  Identities=20%  Similarity=0.544  Sum_probs=7.3

Q ss_pred             ceeccccccc
Q 044013           25 SYECVFCKRG   34 (153)
Q Consensus        25 p~~C~~Cgk~   34 (153)
                      .|.|..||..
T Consensus       112 ~l~C~~Cg~~  121 (146)
T PF07295_consen  112 TLVCENCGHE  121 (146)
T ss_pred             eEecccCCCE
Confidence            4788888764


No 108
>smart00064 FYVE Protein present in Fab1, YOTB, Vac1, and EEA1. The FYVE zinc finger is named after four proteins where it was first found: Fab1, YOTB/ZK632.12, Vac1, and EEA1. The FYVE finger has been shown to bind two Zn2+ ions. The FYVE finger has eight potential zinc coordinating cysteine positions. The FYVE finger is structurally related to the KOG2785 consensus C2H2-type Zn-finger protein [General function prediction only]
Probab=33.74  E-value=37  Score=28.45  Aligned_cols=54  Identities=17%  Similarity=0.137  Sum_probs=40.3

Q ss_pred             cceeccccccccCChhHHHHHHHhhcCC-----------------------CCCCCCCCCCCCCCCCccCCCCChhhhcc
Q 044013           24 RSYECVFCKRGFTTAQALGGHMNIHRKD-----------------------RAKNRPPSSSSISDKLTFFPSPSTWDDHQ   80 (153)
Q Consensus        24 kp~~C~~Cgk~F~~~~~L~~H~r~H~ge-----------------------kp~~C~~c~~~~Cgk~~~F~~~~~L~~H~   80 (153)
                      .|-.|-+|++.|........||..+.|.                       .-|.|-.|.-.  ++.  |....+...||
T Consensus       165 ~Pt~CLfC~~~~k~~e~~~~HM~~~HgffIPdreYL~D~~GLl~YLgeKV~~~~~CL~CN~~--~~~--f~sleavr~HM  240 (390)
T KOG2785|consen  165 IPTDCLFCDKKSKSLEENLKHMFKEHGFFIPDREYLTDEKGLLKYLGEKVGIGFICLFCNEL--GRP--FSSLEAVRAHM  240 (390)
T ss_pred             CCcceeecCCCcccHHHHHHHHhhccCCcCCchHhhhchhHHHHHHHHHhccCceEEEeccc--cCc--ccccHHHHHHH
Confidence            4678999999999999889999877662                       12347766611  233  99999999998


Q ss_pred             c
Q 044013           81 R   81 (153)
Q Consensus        81 ~   81 (153)
                      +
T Consensus       241 ~  241 (390)
T KOG2785|consen  241 R  241 (390)
T ss_pred             h
Confidence            7


No 110
>PF07975 C1_4:  TFIIH C1-like domain;  InterPro: IPR004595 All proteins in this domain for which functions are known are components of the TFIIH complex which is involved in the initiation of transcription and nucleotide excision repair. It includes the yeast transcription factor Ssl1 (Suppressor of stem-loop protein 1) that is essential for translation initiation and affects UV resistance. The C-terminal region is essential for transcription activity. This regions binds three zinc atoms through two independent domain. The first contains a C4 zinc finger motif, whereas the second is characterised by a CX(2)CX(2-4)FCADCD motif. The solution structure of the second C-terminal domain revealed homology with the regulatory domain of protein kinase C [].; GO: 0006281 DNA repair, 0005634 nucleus; PDB: 1Z60_A.
Probab=33.69  E-value=14  Score=21.89  Aligned_cols=25  Identities=28%  Similarity=0.482  Sum_probs=11.6

Q ss_pred             cceeccccccccCChhHHHHHHHhh
Q 044013           24 RSYECVFCKRGFTTAQALGGHMNIH   48 (153)
Q Consensus        24 kp~~C~~Cgk~F~~~~~L~~H~r~H   48 (153)
                      ..|+|+.|+..|-.-=.+-.|..+|
T Consensus        20 ~~y~C~~C~~~FC~dCD~fiHE~LH   44 (51)
T PF07975_consen   20 SRYRCPKCKNHFCIDCDVFIHETLH   44 (51)
T ss_dssp             EEE--TTTT--B-HHHHHTTTTTS-
T ss_pred             CeEECCCCCCccccCcChhhhcccc
Confidence            4577777777776655555554444


No 111
>KOG2071 consensus mRNA cleavage and polyadenylation factor I/II complex, subunit Pcf11 [RNA processing and modification]
Probab=33.20  E-value=25  Score=31.05  Aligned_cols=26  Identities=27%  Similarity=0.562  Sum_probs=22.8

Q ss_pred             cceeccccccccCChhHHHHHHHhhc
Q 044013           24 RSYECVFCKRGFTTAQALGGHMNIHR   49 (153)
Q Consensus        24 kp~~C~~Cgk~F~~~~~L~~H~r~H~   49 (153)
                      +|..|..||.+|........||.+|.
T Consensus       417 ~pnqC~~CG~R~~~~ee~sk~md~H~  442 (579)
T KOG2071|consen  417 SPNQCKSCGLRFDDSEERSKHMDIHD  442 (579)
T ss_pred             CcchhcccccccccchhhhhHhhhhh
Confidence            57899999999999998888888875


No 112
>PRK03681 hypA hydrogenase nickel incorporation protein; Validated
Probab=32.92  E-value=22  Score=24.47  Aligned_cols=27  Identities=19%  Similarity=0.335  Sum_probs=16.5

Q ss_pred             ceeccccccccCChhHHHHHHHhhcCCCCCCCCCCCCC
Q 044013           25 SYECVFCKRGFTTAQALGGHMNIHRKDRAKNRPPSSSS   62 (153)
Q Consensus        25 p~~C~~Cgk~F~~~~~L~~H~r~H~gekp~~C~~c~~~   62 (153)
                      .+.|..||..|.....           ..+.||.|+..
T Consensus        70 ~~~C~~Cg~~~~~~~~-----------~~~~CP~Cgs~   96 (114)
T PRK03681         70 ECWCETCQQYVTLLTQ-----------RVRRCPQCHGD   96 (114)
T ss_pred             EEEcccCCCeeecCCc-----------cCCcCcCcCCC
Confidence            4789999976654211           11568877643


No 113
>COG0068 HypF Hydrogenase maturation factor [Posttranslational modification, protein turnover, chaperones]
Probab=32.76  E-value=11  Score=34.13  Aligned_cols=29  Identities=14%  Similarity=0.116  Sum_probs=17.0

Q ss_pred             CCCCChhhhccccccCCCCCCCCCCCCcccc
Q 044013           70 FPSPSTWDDHQRSVHTNNNNHSNYDSYGQNY  100 (153)
Q Consensus        70 F~~~~~L~~H~~~~Ht~~kp~~~~~~~~~~f  100 (153)
                      |+--..|..- | -+|.-+.|.-|+.|-+.|
T Consensus       133 fTIi~alPYD-R-~nTsM~~F~lC~~C~~EY  161 (750)
T COG0068         133 FTIIEALPYD-R-ENTSMADFPLCPFCDKEY  161 (750)
T ss_pred             eeeeccCCCC-c-ccCccccCcCCHHHHHHh
Confidence            6665555332 3 455566666677776666


No 114
>cd00065 FYVE FYVE domain; Zinc-binding domain; targets proteins to membrane lipids via interaction with phosphatidylinositol-3-phosphate, PI3P; present in Fab1, YOTB, Vac1, and EEA1;
Probab=31.83  E-value=35  Score=19.71  Aligned_cols=29  Identities=21%  Similarity=0.361  Sum_probs=18.9

Q ss_pred             eccccccccCChhHHHHHHHhhcCCCCCCCCCCCCCCCCCC
Q 044013           27 ECVFCKRGFTTAQALGGHMNIHRKDRAKNRPPSSSSISDKL   67 (153)
Q Consensus        27 ~C~~Cgk~F~~~~~L~~H~r~H~gekp~~C~~c~~~~Cgk~   67 (153)
                      .|..|++.|..            ..+.+.|..|+..+|+..
T Consensus         4 ~C~~C~~~F~~------------~~rk~~Cr~Cg~~~C~~C   32 (57)
T cd00065           4 SCMGCGKPFTL------------TRRRHHCRNCGRIFCSKC   32 (57)
T ss_pred             cCcccCccccC------------CccccccCcCcCCcChHH
Confidence            57888888875            134466887776655444


No 115
>KOG2482 consensus Predicted C2H2-type Zn-finger protein [Transcription]
Probab=30.90  E-value=30  Score=28.71  Aligned_cols=23  Identities=26%  Similarity=0.693  Sum_probs=20.7

Q ss_pred             ceeccccccccCChhHHHHHHHh
Q 044013           25 SYECVFCKRGFTTAQALGGHMNI   47 (153)
Q Consensus        25 p~~C~~Cgk~F~~~~~L~~H~r~   47 (153)
                      .+.|-.|.|.|+.+..|+.|||.
T Consensus       195 r~~CLyCekifrdkntLkeHMrk  217 (423)
T KOG2482|consen  195 RLRCLYCEKIFRDKNTLKEHMRK  217 (423)
T ss_pred             hheeeeeccccCCcHHHHHHHHh
Confidence            47899999999999999999984


No 116
>PF08790 zf-LYAR:  LYAR-type C2HC zinc finger ;  InterPro: IPR014898 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target.  This C2HC zinc finger domain is found in LYAR proteins such as Q08288 from SWISSPROT, which are involved in cell growth regulation.  More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; PDB: 1WJV_A.
Probab=30.26  E-value=17  Score=18.74  Aligned_cols=19  Identities=21%  Similarity=0.772  Sum_probs=10.8

Q ss_pred             eeccccccccCChhHHHHHH
Q 044013           26 YECVFCKRGFTTAQALGGHM   45 (153)
Q Consensus        26 ~~C~~Cgk~F~~~~~L~~H~   45 (153)
                      |.|-.|++.|. ....+.|.
T Consensus         1 ~sCiDC~~~F~-~~~y~~Ht   19 (28)
T PF08790_consen    1 FSCIDCSKDFD-GDSYKSHT   19 (28)
T ss_dssp             EEETTTTEEEE-GGGTTT--
T ss_pred             CeeecCCCCcC-cCCcCCCC
Confidence            56778888883 34444443


No 117
>COG1997 RPL43A Ribosomal protein L37AE/L43A [Translation, ribosomal structure and biogenesis]
Probab=29.83  E-value=21  Score=23.57  Aligned_cols=28  Identities=18%  Similarity=0.228  Sum_probs=16.0

Q ss_pred             ceeccccccccCChhHHHHHHHhhcCCCCCCCCCCCCC
Q 044013           25 SYECVFCKRGFTTAQALGGHMNIHRKDRAKNRPPSSSS   62 (153)
Q Consensus        25 p~~C~~Cgk~F~~~~~L~~H~r~H~gekp~~C~~c~~~   62 (153)
                      +|.|+.|++.        .+.|+-+  .-+.|..|+..
T Consensus        35 ~~~Cp~C~~~--------~VkR~a~--GIW~C~kCg~~   62 (89)
T COG1997          35 KHVCPFCGRT--------TVKRIAT--GIWKCRKCGAK   62 (89)
T ss_pred             CCcCCCCCCc--------ceeeecc--CeEEcCCCCCe
Confidence            5778888764        1233333  34667766555


No 118
>PF10013 DUF2256:  Uncharacterized protein conserved in bacteria (DUF2256);  InterPro: IPR017136 There is currently no experimental data for members of this group or their homologues, nor do they exhibit features indicative of any function.
Probab=27.73  E-value=25  Score=19.94  Aligned_cols=14  Identities=36%  Similarity=0.646  Sum_probs=10.7

Q ss_pred             eccccccccCChhH
Q 044013           27 ECVFCKRGFTTAQA   40 (153)
Q Consensus        27 ~C~~Cgk~F~~~~~   40 (153)
                      .|.+||+.|+-...
T Consensus        10 ~C~~C~rpf~WRKK   23 (42)
T PF10013_consen   10 ICPVCGRPFTWRKK   23 (42)
T ss_pred             cCcccCCcchHHHH
Confidence            68999999876543


No 119
>PF13824 zf-Mss51:  Zinc-finger of mitochondrial splicing suppressor 51
Probab=27.72  E-value=49  Score=19.90  Aligned_cols=12  Identities=8%  Similarity=-0.033  Sum_probs=6.7

Q ss_pred             CCCCCCCCCCCC
Q 044013           50 KDRAKNRPPSSS   61 (153)
Q Consensus        50 gekp~~C~~c~~   61 (153)
                      +...|.|+.||-
T Consensus        11 ~~v~~~Cp~cGi   22 (55)
T PF13824_consen   11 AHVNFECPDCGI   22 (55)
T ss_pred             cccCCcCCCCCC
Confidence            345677775543


No 120
>PF10276 zf-CHCC:  Zinc-finger domain;  InterPro: IPR019401 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target.   This entry represents a short conserved zinc-finger domain. It contains the sequence motif Cx8Hx14Cx2C. ; PDB: 2JVM_A 2JRR_A 2JZ8_A.
Probab=27.21  E-value=32  Score=19.23  Aligned_cols=12  Identities=17%  Similarity=0.708  Sum_probs=10.2

Q ss_pred             cceecccccccc
Q 044013           24 RSYECVFCKRGF   35 (153)
Q Consensus        24 kp~~C~~Cgk~F   35 (153)
                      ++-.|++|+..|
T Consensus        28 ~~~~CpYCg~~y   39 (40)
T PF10276_consen   28 GPVVCPYCGTRY   39 (40)
T ss_dssp             CEEEETTTTEEE
T ss_pred             CeEECCCCCCEE
Confidence            478999999887


No 121
>PRK09678 DNA-binding transcriptional regulator; Provisional
Probab=26.65  E-value=17  Score=23.10  Aligned_cols=42  Identities=14%  Similarity=0.228  Sum_probs=20.2

Q ss_pred             eeccccccccCChhHHHHHHHhhcCCCCCCCC--CCCCCCCCCCccCCCCChh
Q 044013           26 YECVFCKRGFTTAQALGGHMNIHRKDRAKNRP--PSSSSISDKLTFFPSPSTW   76 (153)
Q Consensus        26 ~~C~~Cgk~F~~~~~L~~H~r~H~gekp~~C~--~c~~~~Cgk~~~F~~~~~L   76 (153)
                      +.|+.||........-....  -+.++-+.|.  .|+..       |.....+
T Consensus         2 m~CP~Cg~~a~irtSr~~s~--~~~~~Y~qC~N~eCg~t-------F~t~es~   45 (72)
T PRK09678          2 FHCPLCQHAAHARTSRYITD--TTKERYHQCQNVNCSAT-------FITYESV   45 (72)
T ss_pred             ccCCCCCCccEEEEChhcCh--hhheeeeecCCCCCCCE-------EEEEEEE
Confidence            46778875543322221111  1334556665  45555       6655444


No 122
>PF14311 DUF4379:  Domain of unknown function (DUF4379)
Probab=25.92  E-value=60  Score=18.91  Aligned_cols=14  Identities=14%  Similarity=0.615  Sum_probs=10.2

Q ss_pred             eeccccccccCChh
Q 044013           26 YECVFCKRGFTTAQ   39 (153)
Q Consensus        26 ~~C~~Cgk~F~~~~   39 (153)
                      .+|..||..|...-
T Consensus        29 W~C~~Cgh~w~~~v   42 (55)
T PF14311_consen   29 WKCPKCGHEWKASV   42 (55)
T ss_pred             EECCCCCCeeEccH
Confidence            77888887776543


No 123
>PRK00762 hypA hydrogenase nickel incorporation protein; Provisional
Probab=25.77  E-value=34  Score=23.85  Aligned_cols=13  Identities=23%  Similarity=0.335  Sum_probs=9.3

Q ss_pred             ceeccccccccCCh
Q 044013           25 SYECVFCKRGFTTA   38 (153)
Q Consensus        25 p~~C~~Cgk~F~~~   38 (153)
                      .+.| .||..|...
T Consensus        70 ~~~C-~Cg~~~~~~   82 (124)
T PRK00762         70 EIEC-ECGYEGVVD   82 (124)
T ss_pred             eEEe-eCcCccccc
Confidence            4788 888777654


No 124
>COG1675 TFA1 Transcription initiation factor IIE, alpha subunit [Transcription]
Probab=25.75  E-value=46  Score=24.85  Aligned_cols=14  Identities=21%  Similarity=0.565  Sum_probs=8.5

Q ss_pred             ceeccccccccCCh
Q 044013           25 SYECVFCKRGFTTA   38 (153)
Q Consensus        25 p~~C~~Cgk~F~~~   38 (153)
                      -|.|+.|.-.|+.-
T Consensus       113 ~y~C~~~~~r~sfd  126 (176)
T COG1675         113 YYVCPNCHVKYSFD  126 (176)
T ss_pred             ceeCCCCCCcccHH
Confidence            47777776655543


No 125
>PF01363 FYVE:  FYVE zinc finger;  InterPro: IPR000306 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target.  The FYVE zinc finger is named after four proteins that it has been found in: Fab1, YOTB/ZK632.12, Vac1, and EEA1. The FYVE finger has been shown to bind two zinc ions []. The FYVE finger has eight potential zinc coordinating cysteine positions. Many members of this family also include two histidines in a motif R+HHC+XCG, where + represents a charged residue and X any residue. FYVE-type domains are divided into two known classes: FYVE domains that specifically bind to phosphatidylinositol 3-phosphate in lipid bilayers and FYVE-related domains of undetermined function []. Those that bind to phosphatidylinositol 3-phosphate are often found in proteins targeted to lipid membranes that are involved in regulating membrane traffic [, , ]. Most FYVE domains target proteins to endosomes by binding specifically to phosphatidylinositol-3-phosphate at the membrane surface. By contrast, the CARP2 FYVE-like domain is not optimized to bind to phosphoinositides or insert into lipid bilayers. FYVE domains are distinguished from other zinc fingers by three signature sequences: an N-terminal WxxD motif, a basic R(R/K)HHCR patch, and a C-terminal RVC motif. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; GO: 0046872 metal ion binding; PDB: 1HYI_A 1JOC_B 1HYJ_A 1DVP_A 3ZYQ_A 4AVX_A 1VFY_A 3T7L_A 1X4U_A 1WFK_A ....
Probab=25.67  E-value=19  Score=21.92  Aligned_cols=30  Identities=13%  Similarity=0.370  Sum_probs=12.7

Q ss_pred             eeccccccccCChhHHHHHHHhhcCCCCCCCCCCCCCCCCCC
Q 044013           26 YECVFCKRGFTTAQALGGHMNIHRKDRAKNRPPSSSSISDKL   67 (153)
Q Consensus        26 ~~C~~Cgk~F~~~~~L~~H~r~H~gekp~~C~~c~~~~Cgk~   67 (153)
                      -.|.+|++.|...            .+.+.|..||..+|..-
T Consensus        10 ~~C~~C~~~F~~~------------~rrhhCr~CG~~vC~~C   39 (69)
T PF01363_consen   10 SNCMICGKKFSLF------------RRRHHCRNCGRVVCSSC   39 (69)
T ss_dssp             SB-TTT--B-BSS------------S-EEE-TTT--EEECCC
T ss_pred             CcCcCcCCcCCCc------------eeeEccCCCCCEECCch
Confidence            4688888888531            33445887766655444


No 126
>KOG0696 consensus Serine/threonine protein kinase [Signal transduction mechanisms]
Probab=25.60  E-value=30  Score=29.87  Aligned_cols=59  Identities=17%  Similarity=0.264  Sum_probs=36.3

Q ss_pred             cceeccccccccCChhHHHHHHHhhcCCCCCCCCCCCCCCCCCCccCCCCChhhhccccccCCCCCCCCCCCCccc
Q 044013           24 RSYECVFCKRGFTTAQALGGHMNIHRKDRAKNRPPSSSSISDKLTFFPSPSTWDDHQRSVHTNNNNHSNYDSYGQN   99 (153)
Q Consensus        24 kp~~C~~Cgk~F~~~~~L~~H~r~H~gekp~~C~~c~~~~Cgk~~~F~~~~~L~~H~~~~Ht~~kp~~~~~~~~~~   99 (153)
                      .-|+|.+|-        +..|+|.|. -..|.|+--.+.       +..-..-..|...+||=.-|- -|+-|+.-
T Consensus        72 QGfQCqvC~--------fvvHkrChe-fVtF~CPGadkg-------~dtDdpr~kHkf~~~tYssPT-FCDhCGsL  130 (683)
T KOG0696|consen   72 QGFQCQVCC--------FVVHKRCHE-FVTFSCPGADKG-------PDTDDPRSKHKFKIHTYSSPT-FCDHCGSL  130 (683)
T ss_pred             CceeeeEEe--------ehhhhhhcc-eEEEECCCCCCC-------CCCCCcccccceeeeecCCCc-hhhhHHHH
Confidence            345666652        457888775 567888855555       777777777765366655453 35555543


No 127
>PF11789 zf-Nse:  Zinc-finger of the MIZ type in Nse subunit; PDB: 2YU4_A 3HTK_C.
Probab=25.49  E-value=37  Score=20.32  Aligned_cols=34  Identities=12%  Similarity=0.246  Sum_probs=15.6

Q ss_pred             cceeccccccccCChhHHHHHHHhhcCCCCCCCCCCCC
Q 044013           24 RSYECVFCKRGFTTAQALGGHMNIHRKDRAKNRPPSSS   61 (153)
Q Consensus        24 kp~~C~~Cgk~F~~~~~L~~H~r~H~gekp~~C~~c~~   61 (153)
                      .|+....|+-.|.... +...+   ...+...||+.|+
T Consensus        23 ~PV~s~~C~H~fek~a-I~~~i---~~~~~~~CPv~GC   56 (57)
T PF11789_consen   23 DPVKSKKCGHTFEKEA-ILQYI---QRNGSKRCPVAGC   56 (57)
T ss_dssp             SEEEESSS--EEEHHH-HHHHC---TTTS-EE-SCCC-
T ss_pred             CCcCcCCCCCeecHHH-HHHHH---HhcCCCCCCCCCC
Confidence            4666667777776443 33333   2345567776543


No 128
>KOG2636 consensus Splicing factor 3a, subunit 3 [RNA processing and modification]
Probab=25.37  E-value=39  Score=29.01  Aligned_cols=24  Identities=25%  Similarity=0.589  Sum_probs=19.5

Q ss_pred             Ccceeccccc-cccCChhHHHHHHH
Q 044013           23 GRSYECVFCK-RGFTTAQALGGHMN   46 (153)
Q Consensus        23 ekp~~C~~Cg-k~F~~~~~L~~H~r   46 (153)
                      .+-|.|.||| +++.-+.++.+|-.
T Consensus       399 ~~ey~CEICGNy~Y~GrkaF~RHF~  423 (497)
T KOG2636|consen  399 DIEYNCEICGNYVYKGRKAFDRHFN  423 (497)
T ss_pred             CcccceeeccCccccCcHHHHHHhH
Confidence            4569999999 78888888888865


No 129
>PF04423 Rad50_zn_hook:  Rad50 zinc hook motif;  InterPro: IPR007517 The Mre11 complex (Mre11 Rad50 Nbs1) is central to chromosomal maintenance and functions in homologous recombination, telomere maintenance and sister chromatid association. The Rad50 coiled-coil region contains a dimer interface at the apex of the coiled coils in which pairs of conserved Cys-X-X-Cys motifs form interlocking hooks that bind one Zn ion. This alignment includes the zinc hook motif and a short stretch of coiled-coil on either side.; GO: 0004518 nuclease activity, 0005524 ATP binding, 0008270 zinc ion binding, 0006281 DNA repair; PDB: 1L8D_B.
Probab=25.27  E-value=28  Score=20.32  Aligned_cols=12  Identities=25%  Similarity=0.670  Sum_probs=6.5

Q ss_pred             eccccccccCCh
Q 044013           27 ECVFCKRGFTTA   38 (153)
Q Consensus        27 ~C~~Cgk~F~~~   38 (153)
                      .|+.|++.|...
T Consensus        22 ~CPlC~r~l~~e   33 (54)
T PF04423_consen   22 CCPLCGRPLDEE   33 (54)
T ss_dssp             E-TTT--EE-HH
T ss_pred             cCCCCCCCCCHH
Confidence            899999998753


No 130
>KOG0978 consensus E3 ubiquitin ligase involved in syntaxin degradation [Posttranslational modification, protein turnover, chaperones]
Probab=25.13  E-value=57  Score=29.59  Aligned_cols=11  Identities=18%  Similarity=0.806  Sum_probs=7.0

Q ss_pred             eeccccccccC
Q 044013           26 YECVFCKRGFT   36 (153)
Q Consensus        26 ~~C~~Cgk~F~   36 (153)
                      -.|+.|...+.
T Consensus       644 LkCs~Cn~R~K  654 (698)
T KOG0978|consen  644 LKCSVCNTRWK  654 (698)
T ss_pred             eeCCCccCchh
Confidence            46888875443


No 131
>PF11672 DUF3268:  Protein of unknown function (DUF3268);  InterPro: IPR021686 This entry is represented by Listeria phage P100, Gp150. The characteristics of the protein distribution suggest prophage matches in addition to the phage matches.
Probab=25.09  E-value=24  Score=23.93  Aligned_cols=11  Identities=18%  Similarity=0.676  Sum_probs=8.0

Q ss_pred             cceeccccccc
Q 044013           24 RSYECVFCKRG   34 (153)
Q Consensus        24 kp~~C~~Cgk~   34 (153)
                      ||-.|+.||..
T Consensus         1 ~p~~CpYCg~~   11 (102)
T PF11672_consen    1 KPIICPYCGGP   11 (102)
T ss_pred             CCcccCCCCCe
Confidence            46678888854


No 132
>PF13878 zf-C2H2_3:  zinc-finger of acetyl-transferase ESCO
Probab=24.89  E-value=71  Score=17.68  Aligned_cols=24  Identities=17%  Similarity=0.283  Sum_probs=17.5

Q ss_pred             eeccccccccCC--hhHHHHHHHhhc
Q 044013           26 YECVFCKRGFTT--AQALGGHMNIHR   49 (153)
Q Consensus        26 ~~C~~Cgk~F~~--~~~L~~H~r~H~   49 (153)
                      -.|+.||-.|..  +..-..|.+.|.
T Consensus        14 ~~C~~CgM~Y~~~~~eD~~~H~~yH~   39 (41)
T PF13878_consen   14 TTCPTCGMLYSPGSPEDEKLHKKYHD   39 (41)
T ss_pred             cCCCCCCCEECCCCHHHHHHHHHHHh
Confidence            489999987765  445677777765


No 133
>PF05191 ADK_lid:  Adenylate kinase, active site lid;  InterPro: IPR007862 Adenylate kinases (ADK; 2.7.4.3 from EC) are phosphotransferases that catalyse the Mg-dependent reversible conversion of ATP and AMP to two molecules of ADP, an essential reaction for many processes in living cells. In large variants of adenylate kinase, the AMP and ATP substrates are buried in a domain that undergoes conformational changes from an open to a closed state when bound to substrate; the ligand is then contained within a highly specific environment required for catalysis. Adenylate kinase is a 3-domain protein consisting of a large central CORE domain flanked by a LID domain on one side and the AMP-binding NMPbind domain on the other []. The LID domain binds ATP and covers the phosphates at the active site. The substrates first bind the CORE domain, followed by closure of the active site by the LID and NMPbind domains. Comparisons of adenylate kinases have revealed a particular divergence in the active site lid. In some organisms, particularly the Gram-positive bacteria, residues in the lid domain have been mutated to cysteines and these cysteine residues (two CX(n)C motifs) are responsible for the binding of a zinc ion. The bound zinc ion in the lid domain is clearly structurally homologous to Zinc-finger domains. However, it is unclear whether the adenylate kinase lid is a novel zinc-finger DNA/RNA binding domain, or that the lid bound zinc serves a purely structural function [].; GO: 0004017 adenylate kinase activity; PDB: 3BE4_A 2OSB_B 2ORI_A 2EU8_A 3DL0_A 1P3J_A 2QAJ_A 2OO7_A 2P3S_A 3DKV_A ....
Probab=24.69  E-value=40  Score=18.27  Aligned_cols=12  Identities=25%  Similarity=0.465  Sum_probs=7.9

Q ss_pred             eeccccccccCC
Q 044013           26 YECVFCKRGFTT   37 (153)
Q Consensus        26 ~~C~~Cgk~F~~   37 (153)
                      ..|..||+.|..
T Consensus         2 r~C~~Cg~~Yh~   13 (36)
T PF05191_consen    2 RICPKCGRIYHI   13 (36)
T ss_dssp             EEETTTTEEEET
T ss_pred             cCcCCCCCcccc
Confidence            357778777643


No 134
>PF13451 zf-trcl:  Probable zinc-binding domain
Probab=24.21  E-value=35  Score=20.01  Aligned_cols=18  Identities=17%  Similarity=0.449  Sum_probs=13.1

Q ss_pred             CcceeccccccccCChhH
Q 044013           23 GRSYECVFCKRGFTTAQA   40 (153)
Q Consensus        23 ekp~~C~~Cgk~F~~~~~   40 (153)
                      .+++.|..||..|.-...
T Consensus         2 Dk~l~C~dCg~~FvfTa~   19 (49)
T PF13451_consen    2 DKTLTCKDCGAEFVFTAG   19 (49)
T ss_pred             CeeEEcccCCCeEEEehh
Confidence            357889999988775443


No 135
>PLN02294 cytochrome c oxidase subunit Vb
Probab=23.83  E-value=34  Score=25.50  Aligned_cols=14  Identities=36%  Similarity=0.906  Sum_probs=11.6

Q ss_pred             cceeccccccccCC
Q 044013           24 RSYECVFCKRGFTT   37 (153)
Q Consensus        24 kp~~C~~Cgk~F~~   37 (153)
                      +|++|..||..|..
T Consensus       140 kp~RCpeCG~~fkL  153 (174)
T PLN02294        140 KSFECPVCTQYFEL  153 (174)
T ss_pred             CceeCCCCCCEEEE
Confidence            68899999988864


No 136
>KOG2461 consensus Transcription factor BLIMP-1/PRDI-BF1, contains C2H2-type Zn-finger and SET domains [Transcription]
Probab=23.75  E-value=1.4e+02  Score=25.28  Aligned_cols=15  Identities=0%  Similarity=-0.031  Sum_probs=5.7

Q ss_pred             HHHHHHhhcCCCCCC
Q 044013           41 LGGHMNIHRKDRAKN   55 (153)
Q Consensus        41 L~~H~r~H~gekp~~   55 (153)
                      +..+...+.++.++.
T Consensus       347 ~~~~~~~~~~~~~~~  361 (396)
T KOG2461|consen  347 YTQSHSMEVAEPTDM  361 (396)
T ss_pred             hhhhhhcccCCCCcc
Confidence            333333333333333


No 137
>PF11931 DUF3449:  Domain of unknown function (DUF3449);  InterPro: IPR024598 This presumed domain is functionally uncharacterised. It has two conserved sequence motifs: PIP and CEICG and contains a zinc-finger of the C2H2-type.; PDB: 4DGW_A.
Probab=23.63  E-value=27  Score=26.61  Aligned_cols=21  Identities=24%  Similarity=0.651  Sum_probs=0.0

Q ss_pred             cceeccccccc-cCChhHHHHH
Q 044013           24 RSYECVFCKRG-FTTAQALGGH   44 (153)
Q Consensus        24 kp~~C~~Cgk~-F~~~~~L~~H   44 (153)
                      +.|.|.+||.. |.-+.++..|
T Consensus       100 ~ey~CEICGN~~Y~GrkaFekH  121 (196)
T PF11931_consen  100 VEYKCEICGNQSYKGRKAFEKH  121 (196)
T ss_dssp             ----------------------
T ss_pred             CeeeeEeCCCcceecHHHHHHh
Confidence            46899999953 3334444444


No 138
>KOG4602 consensus Nanos and related proteins [General function prediction only]
Probab=23.56  E-value=48  Score=26.46  Aligned_cols=12  Identities=42%  Similarity=0.869  Sum_probs=7.8

Q ss_pred             Ccceeccccccc
Q 044013           23 GRSYECVFCKRG   34 (153)
Q Consensus        23 ekp~~C~~Cgk~   34 (153)
                      .++..|.+|...
T Consensus       231 ~r~l~CvFC~nN  242 (318)
T KOG4602|consen  231 PRPLCCVFCFNN  242 (318)
T ss_pred             CCceeEEeecCC
Confidence            356678888543


No 139
>PRK01343 zinc-binding protein; Provisional
Probab=22.71  E-value=62  Score=19.60  Aligned_cols=11  Identities=27%  Similarity=0.446  Sum_probs=6.4

Q ss_pred             eeccccccccC
Q 044013           26 YECVFCKRGFT   36 (153)
Q Consensus        26 ~~C~~Cgk~F~   36 (153)
                      ..|++|++.|.
T Consensus        10 ~~CP~C~k~~~   20 (57)
T PRK01343         10 RPCPECGKPST   20 (57)
T ss_pred             CcCCCCCCcCc
Confidence            45666666553


No 140
>cd02335 ZZ_ADA2 Zinc finger, ZZ type. Zinc finger present in ADA2, a putative transcriptional adaptor, and related proteins. The ZZ motif coordinates two zinc ions and most likely participates in ligand binding or molecular scaffolding.
Probab=22.70  E-value=47  Score=19.03  Aligned_cols=28  Identities=7%  Similarity=0.058  Sum_probs=15.1

Q ss_pred             CCCCCCCC-CCCCCCCCccCCCCChhhhccc
Q 044013           52 RAKNRPPS-SSSISDKLTFFPSPSTWDDHQR   81 (153)
Q Consensus        52 kp~~C~~c-~~~~Cgk~~~F~~~~~L~~H~~   81 (153)
                      .-|+|..| ...+|..-  |........|..
T Consensus        14 ~r~~C~~C~d~dLC~~C--f~~g~~~~~H~~   42 (49)
T cd02335          14 IRIKCAECPDFDLCLEC--FSAGAEIGKHRN   42 (49)
T ss_pred             cEEECCCCCCcchhHHh--hhCcCCCCCCCC
Confidence            34566655 34455555  666655555554


No 141
>COG4530 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=22.38  E-value=39  Score=23.40  Aligned_cols=25  Identities=12%  Similarity=-0.039  Sum_probs=17.8

Q ss_pred             eccccccccCChhHHHHHHHhhcCCCCCCCCCCCCC
Q 044013           27 ECVFCKRGFTTAQALGGHMNIHRKDRAKNRPPSSSS   62 (153)
Q Consensus        27 ~C~~Cgk~F~~~~~L~~H~r~H~gekp~~C~~c~~~   62 (153)
                      .|+.||+.|.-..           ..|-.|+.|+++
T Consensus        11 idPetg~KFYDLN-----------rdPiVsPytG~s   35 (129)
T COG4530          11 IDPETGKKFYDLN-----------RDPIVSPYTGKS   35 (129)
T ss_pred             cCccccchhhccC-----------CCccccCccccc
Confidence            6788888887665           356778866666


No 142
>cd00924 Cyt_c_Oxidase_Vb Cytochrome c oxidase subunit Vb.  Cytochrome c oxidase (CcO), the terminal oxidase in the respiratory chains of eukaryotes and most bacteria, is a multi-chain transmembrane protein located in the inner membrane of mitochondria and the cell membrane of prokaryotes.  It catalyzes the reduction of O2 and simultaneously pumps protons across the membrane.  The number of subunits varies from three to five in bacteria and up to 13 in mammalian mitochondria. Subunits I, II, and III of mammalian CcO are encoded within the mitochondrial genome and the remaining 10 subunits are encoded within the nuclear genome.  Found only in eukaryotes, subunit Vb is one of three mammalian subunits that lacks a transmembrane region.  Subunit Vb is located on the matrix side of the membrane and binds the regulatory subunit of protein kinase A.  The abnormally extended conformation is stable only in the CcO assembly.
Probab=22.38  E-value=37  Score=22.83  Aligned_cols=14  Identities=21%  Similarity=0.494  Sum_probs=11.9

Q ss_pred             cceeccccccccCC
Q 044013           24 RSYECVFCKRGFTT   37 (153)
Q Consensus        24 kp~~C~~Cgk~F~~   37 (153)
                      +|++|..||.-|..
T Consensus        78 ~~~rC~eCG~~fkL   91 (97)
T cd00924          78 KPKRCPECGHVFKL   91 (97)
T ss_pred             CceeCCCCCcEEEE
Confidence            69999999988864


No 143
>PRK05978 hypothetical protein; Provisional
Probab=22.29  E-value=65  Score=23.36  Aligned_cols=11  Identities=9%  Similarity=0.531  Sum_probs=8.1

Q ss_pred             CCCCCcccccc
Q 044013           92 NYDSYGQNYHQ  102 (153)
Q Consensus        92 ~~~~~~~~f~~  102 (153)
                      .|+.|+-.|..
T Consensus        54 ~C~~CG~~~~~   64 (148)
T PRK05978         54 HCAACGEDFTH   64 (148)
T ss_pred             CccccCCcccc
Confidence            48888888743


No 144
>COG1198 PriA Primosomal protein N' (replication factor Y) - superfamily II helicase [DNA replication, recombination, and repair]
Probab=21.92  E-value=86  Score=28.69  Aligned_cols=25  Identities=8%  Similarity=-0.061  Sum_probs=13.8

Q ss_pred             ChhhhccccccCCCCCCCCCCCCcccc
Q 044013           74 STWDDHQRSVHTNNNNHSNYDSYGQNY  100 (153)
Q Consensus        74 ~~L~~H~~~~Ht~~kp~~~~~~~~~~f  100 (153)
                      ..|.-|.- .|....|. .|+-|+...
T Consensus       461 ~~L~CH~C-g~~~~~p~-~Cp~Cgs~~  485 (730)
T COG1198         461 GQLRCHYC-GYQEPIPQ-SCPECGSEH  485 (730)
T ss_pred             CeeEeCCC-CCCCCCCC-CCCCCCCCe
Confidence            34444444 34444455 678887774


No 145
>PTZ00303 phosphatidylinositol kinase; Provisional
Probab=21.78  E-value=70  Score=29.82  Aligned_cols=66  Identities=12%  Similarity=0.143  Sum_probs=36.4

Q ss_pred             eeccccccccCChhHHHHHHHhhcCCCCCCCCCCCCCCCCCCccCCCCChhhhc-cc--cccCCCCCCCCCCCCcccc
Q 044013           26 YECVFCKRGFTTAQALGGHMNIHRKDRAKNRPPSSSSISDKLTFFPSPSTWDDH-QR--SVHTNNNNHSNYDSYGQNY  100 (153)
Q Consensus        26 ~~C~~Cgk~F~~~~~L~~H~r~H~gekp~~C~~c~~~~Cgk~~~F~~~~~L~~H-~~--~~Ht~~kp~~~~~~~~~~f  100 (153)
                      -.|..|++.|.....+       .+.+.+.|..||...|++.  -.....+..- ..  ..-....|+..|..|-+.+
T Consensus       461 dtC~~C~kkFfSlsK~-------L~~RKHHCRkCGrVFC~~C--SSnRs~yp~aKLpKPgsseE~ppRRVCD~CYdq~  529 (1374)
T PTZ00303        461 DSCPSCGRAFISLSRP-------LGTRAHHCRSCGIRLCVFC--ITKRAHYSFAKLAKPGSSDEAEERLVCDTCYKEY  529 (1374)
T ss_pred             CcccCcCCcccccccc-------cccccccccCCccccCccc--cCCcccCcccccCCCCCcccccccchhHHHHHHH
Confidence            4699999999764210       1234566998888877776  4443322111 11  0111223555677777655


No 146
>PF06397 Desulfoferrod_N:  Desulfoferrodoxin, N-terminal domain;  InterPro: IPR004462 This domain is found as essentially the full length of desulforedoxin, a 37-residue homodimeric non-haem iron protein. It is also found as the N-terminal domain of desulfoferrodoxin (rbo), a homodimeric non-haem iron protein with 2 Fe atoms per monomer in different oxidation states. This domain binds the ferric rather than the ferrous Fe of desulfoferrodoxin. Neelaredoxin, a monomeric blue non-haem iron protein, lacks this domain.; GO: 0005506 iron ion binding; PDB: 1DFX_A 1VZI_B 2JI2_D 1VZH_B 2JI3_C 2JI1_C 1VZG_A 1CFW_A 2LK5_B 1DHG_B ....
Probab=21.16  E-value=43  Score=18.30  Aligned_cols=12  Identities=25%  Similarity=0.622  Sum_probs=6.7

Q ss_pred             cceecccccccc
Q 044013           24 RSYECVFCKRGF   35 (153)
Q Consensus        24 kp~~C~~Cgk~F   35 (153)
                      +-|+|..||...
T Consensus         5 ~~YkC~~CGniV   16 (36)
T PF06397_consen    5 EFYKCEHCGNIV   16 (36)
T ss_dssp             EEEE-TTT--EE
T ss_pred             cEEEccCCCCEE
Confidence            469999998754


No 147
>COG1773 Rubredoxin [Energy production and conversion]
Probab=21.09  E-value=44  Score=20.08  Aligned_cols=15  Identities=13%  Similarity=0.598  Sum_probs=11.2

Q ss_pred             ceeccccccccCChh
Q 044013           25 SYECVFCKRGFTTAQ   39 (153)
Q Consensus        25 p~~C~~Cgk~F~~~~   39 (153)
                      .|+|..||..|--..
T Consensus         3 ~~~C~~CG~vYd~e~   17 (55)
T COG1773           3 RWRCSVCGYVYDPEK   17 (55)
T ss_pred             ceEecCCceEecccc
Confidence            588999988776543


No 148
>PTZ00255 60S ribosomal protein L37a; Provisional
Probab=21.01  E-value=25  Score=23.34  Aligned_cols=8  Identities=50%  Similarity=1.294  Sum_probs=3.9

Q ss_pred             ceeccccc
Q 044013           25 SYECVFCK   32 (153)
Q Consensus        25 p~~C~~Cg   32 (153)
                      .|.|++|+
T Consensus        36 ~y~CpfCg   43 (90)
T PTZ00255         36 KYFCPFCG   43 (90)
T ss_pred             CccCCCCC
Confidence            34555554


No 149
>PF05741 zf-nanos:  Nanos RNA binding domain;  InterPro: IPR024161 Nanos is a highly conserved RNA-binding protein in higher eukaryotes and functions as a key regulatory protein in translational control using a 3' untranslated region during the development and maintenance of germ cells. Nanos comprises a non-conserved amino-terminus and highly conserved carboxy- terminal regions. The C-terminal region has two conserved Cys-Cys-His-Cys (CCHC)-type zinc-finger motifs that are indispensable for nanos function [, , ]. The structure of the nanos-type zinc finger is composed of two independent zinc-finger (ZF) lobes, the N-terminal ZF1 and the C-terminal ZF2, which are connected by a linker helix []. These lobes create a large cleft. Zinc ions in ZF1 and ZF2 are bound to the CCHC motif by tetrahedral coordination.; PDB: 3ALR_B.
Probab=20.77  E-value=63  Score=19.40  Aligned_cols=23  Identities=30%  Similarity=0.484  Sum_probs=7.5

Q ss_pred             CCCcccccCCCC------cceeccccccc
Q 044013           12 SSSDDETDAGTG------RSYECVFCKRG   34 (153)
Q Consensus        12 ~~~h~~~~~~ge------kp~~C~~Cgk~   34 (153)
                      .++|.-+...|.      +.|.|++||..
T Consensus        14 y~sH~lk~~~G~v~CPvLr~y~Cp~CgAt   42 (55)
T PF05741_consen   14 YSSHTLKDPDGRVTCPVLRKYVCPICGAT   42 (55)
T ss_dssp             HTSB-SB-TTS-B--TTGGG---TTT---
T ss_pred             EcccEEECCCCCEeCHHHhcCcCCCCcCc
Confidence            455554444433      45777777753


Done!