Query         032068
Match_columns 148
No_of_seqs    115 out of 1519
Neff          10.2
Searched_HMMs 46136
Date          Fri Mar 29 09:10:49 2013
Command       hhsearch -i /work/01045/syshi/csienesis_hhblits_a3m/032068.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/032068hhsearch_cdd -cpu 12 -v 0 

 No Hit                             Prob E-value P-value  Score    SS Cols Query HMM  Template HMM
  1 KOG2462 C2H2-type Zn-finger pr  99.9 3.8E-23 8.3E-28  141.7   4.1  116    7-123   128-263 (279)
  2 KOG2462 C2H2-type Zn-finger pr  99.8 8.6E-20 1.9E-24  125.3   3.3   98    7-108   159-265 (279)
  3 PLN03086 PRLI-interacting fact  99.4 2.9E-13 6.2E-18  103.0   7.3  123    8-131   406-563 (567)
  4 KOG1074 Transcriptional repres  99.4 1.2E-13 2.7E-18  107.1   1.3   49   60-108   605-655 (958)
  5 KOG3623 Homeobox transcription  99.3 5.4E-13 1.2E-17  102.4   1.9   97   10-107   211-330 (1007)
  6 KOG3608 Zn finger proteins [Ge  99.3 1.4E-12   3E-17   92.9   1.3  107    2-108   200-314 (467)
  7 KOG3623 Homeobox transcription  99.3   1E-12 2.2E-17  100.9   0.7   78    6-107   891-971 (1007)
  8 KOG3608 Zn finger proteins [Ge  99.2 4.9E-11 1.1E-15   85.2   6.4  131    9-141   263-406 (467)
  9 KOG3576 Ovo and related transc  99.2 4.4E-12 9.6E-17   84.1   1.1   77   32-108   114-195 (267)
 10 KOG3576 Ovo and related transc  99.1 3.5E-11 7.7E-16   79.9   2.6  103    5-108   113-234 (267)
 11 PLN03086 PRLI-interacting fact  99.1 4.8E-10   1E-14   85.8   7.1  100    6-109   450-563 (567)
 12 KOG1074 Transcriptional repres  99.0 1.9E-10   4E-15   89.8   1.1   99    7-106   603-729 (958)
 13 PHA02768 hypothetical protein;  98.7   1E-08 2.2E-13   54.5   1.6   41   60-102     5-47  (55)
 14 PHA00733 hypothetical protein   98.6 3.9E-08 8.6E-13   62.2   2.5   74   33-108    38-121 (128)
 15 PF13465 zf-H2C2_2:  Zinc-finge  98.5   4E-08 8.8E-13   44.6   0.8   25   74-98      1-25  (26)
 16 PHA00733 hypothetical protein   98.2 8.3E-07 1.8E-11   56.2   2.3   77    6-82     37-122 (128)
 17 PHA02768 hypothetical protein;  98.2 8.6E-07 1.9E-11   47.1   2.0   37   10-48      6-44  (55)
 18 PF13465 zf-H2C2_2:  Zinc-finge  98.1 1.9E-06 4.1E-11   39.0   1.5   24   23-46      1-25  (26)
 19 PHA00616 hypothetical protein   98.0 2.7E-06 5.9E-11   43.1   1.5   33   60-92      1-34  (44)
 20 PHA00616 hypothetical protein   97.7 1.4E-05   3E-10   40.5   0.8   32    9-40      1-34  (44)
 21 PHA00732 hypothetical protein   97.7 3.8E-05 8.3E-10   44.4   2.5   42   60-106     1-44  (79)
 22 PHA00732 hypothetical protein   97.6 7.2E-05 1.6E-09   43.2   2.9   41    9-54      1-44  (79)
 23 KOG3993 Transcription factor (  97.5 2.7E-05 5.9E-10   57.5   0.7   29   87-115   356-385 (500)
 24 PF05605 zf-Di19:  Drought indu  97.4  0.0003 6.4E-09   37.6   3.7   46   61-107     3-50  (54)
 25 PF05605 zf-Di19:  Drought indu  97.4 0.00033 7.2E-09   37.5   3.8   45    9-54      2-49  (54)
 26 PF00096 zf-C2H2:  Zinc finger,  97.2 0.00016 3.6E-09   31.4   1.0   21   88-108     1-22  (23)
 27 PF00096 zf-C2H2:  Zinc finger,  96.9 0.00031 6.6E-09   30.5   0.5   21   61-81      1-22  (23)
 28 PF13913 zf-C2HC_2:  zinc-finge  96.8  0.0011 2.4E-08   29.5   2.0   22   88-109     3-24  (25)
 29 PF13912 zf-C2H2_6:  C2H2-type   96.7 0.00077 1.7E-08   30.4   0.8   22   61-82      2-24  (27)
 30 PF13894 zf-C2H2_4:  C2H2-type   96.5  0.0018   4E-08   28.0   1.6   21   88-108     1-22  (24)
 31 KOG3993 Transcription factor (  96.4  0.0005 1.1E-08   51.0  -0.9   20  109-128   362-381 (500)
 32 PF13912 zf-C2H2_6:  C2H2-type   96.2  0.0011 2.5E-08   29.9  -0.1   23    9-31      1-25  (27)
 33 PF13894 zf-C2H2_4:  C2H2-type   95.9  0.0037 8.1E-08   26.9   0.8   21   61-81      1-22  (24)
 34 PRK04860 hypothetical protein;  95.8  0.0051 1.1E-07   40.5   1.5   36   60-98    119-154 (160)
 35 smart00355 ZnF_C2H2 zinc finge  95.7   0.011 2.4E-07   25.7   2.0   20   88-107     1-21  (26)
 36 PF12756 zf-C2H2_2:  C2H2 type   95.7  0.0072 1.6E-07   36.0   1.8   16   12-27      2-18  (100)
 37 PF02176 zf-TRAF:  TRAF-type zi  95.6   0.003 6.5E-08   34.3  -0.2   46   60-105     9-60  (60)
 38 PF02176 zf-TRAF:  TRAF-type zi  95.4  0.0068 1.5E-07   32.9   0.9   45    7-51      7-58  (60)
 39 PRK04860 hypothetical protein;  94.5   0.035 7.6E-07   36.5   2.5   35    9-46    119-154 (160)
 40 COG5189 SFP1 Putative transcri  94.4   0.025 5.5E-07   40.9   1.7   15   84-98    395-409 (423)
 41 COG5189 SFP1 Putative transcri  94.4  0.0099 2.1E-07   42.9  -0.3   64    7-79    347-418 (423)
 42 PF09237 GAGA:  GAGA factor;  I  94.2   0.031 6.7E-07   29.2   1.4   21    7-27     22-43  (54)
 43 KOG2186 Cell growth-regulating  93.9   0.041 8.9E-07   38.5   1.9   47    9-55      3-49  (276)
 44 PF09237 GAGA:  GAGA factor;  I  93.8    0.04 8.8E-07   28.8   1.3   28   60-87     24-52  (54)
 45 COG2888 Predicted Zn-ribbon RN  93.4     0.1 2.3E-06   28.1   2.5   10   35-44     27-36  (61)
 46 PF12756 zf-C2H2_2:  C2H2 type   93.4   0.043 9.3E-07   32.6   1.2   71   37-109     1-73  (100)
 47 PF12874 zf-met:  Zinc-finger o  92.9   0.036 7.9E-07   24.2   0.3   20   88-107     1-21  (25)
 48 PF12171 zf-C2H2_jaz:  Zinc-fin  92.8   0.061 1.3E-06   24.1   1.0   20   61-80      2-22  (27)
 49 PF13909 zf-H2C2_5:  C2H2-type   91.6    0.15 3.3E-06   21.9   1.5   21   88-108     1-21  (24)
 50 smart00734 ZnF_Rad18 Rad18-lik  91.2    0.29 6.3E-06   21.8   2.2   21   89-109     3-23  (26)
 51 PRK14890 putative Zn-ribbon RN  91.0    0.33 7.1E-06   26.2   2.6   10   86-95     47-56  (59)
 52 smart00451 ZnF_U1 U1-like zinc  89.0    0.32   7E-06   22.9   1.5   21   87-107     3-24  (35)
 53 KOG2186 Cell growth-regulating  88.7    0.25 5.3E-06   34.7   1.3   46   35-80      3-49  (276)
 54 PF08209 Sgf11:  Sgf11 (transcr  87.8     0.4 8.7E-06   22.7   1.4   25   87-111     4-28  (33)
 55 PF10571 UPF0547:  Uncharacteri  86.4    0.68 1.5E-05   20.6   1.6    9   37-45     16-24  (26)
 56 PRK00398 rpoP DNA-directed RNA  81.6    0.95 2.1E-05   23.0   1.2   29    9-46      3-32  (46)
 57 PF09538 FYDLN_acid:  Protein o  81.5     1.2 2.6E-05   27.3   1.8   13   60-72     26-38  (108)
 58 cd00350 rubredoxin_like Rubred  81.0     1.2 2.6E-05   20.9   1.4   23   10-43      2-25  (33)
 59 TIGR02098 MJ0042_CXXC MJ0042 f  81.0    0.83 1.8E-05   22.1   0.8   11   10-20      3-13  (38)
 60 PF13717 zinc_ribbon_4:  zinc-r  80.3    0.47   1E-05   22.9  -0.2    6   89-94     27-32  (36)
 61 PF06524 NOA36:  NOA36 protein;  79.8     1.9 4.1E-05   30.7   2.5   73   31-103   138-226 (314)
 62 KOG2231 Predicted E3 ubiquitin  78.5     3.6 7.9E-05   33.3   4.0   20   61-80    183-203 (669)
 63 COG1592 Rubrerythrin [Energy p  77.2     1.4 3.1E-05   29.2   1.3   23   60-94    134-156 (166)
 64 PF13451 zf-trcl:  Probable zin  76.7     1.5 3.2E-05   22.9   1.0   14    7-20      2-15  (49)
 65 cd00729 rubredoxin_SM Rubredox  76.2     2.1 4.6E-05   20.3   1.4   25    9-43      2-26  (34)
 66 COG1198 PriA Primosomal protei  76.1     1.3 2.9E-05   36.2   1.1   12   85-96    473-484 (730)
 67 TIGR00622 ssl1 transcription f  75.2     5.5 0.00012   24.6   3.3   19    8-26     14-33  (112)
 68 PF09723 Zn-ribbon_8:  Zinc rib  75.1     1.6 3.5E-05   21.8   0.9   11   10-20      6-16  (42)
 69 TIGR02605 CxxC_CxxC_SSSS putat  74.2     1.3 2.7E-05   23.1   0.4   11   10-20      6-16  (52)
 70 smart00659 RPOLCX RNA polymera  74.1     2.3   5E-05   21.5   1.3   27   10-45      3-29  (44)
 71 TIGR00373 conserved hypothetic  74.0     2.3 5.1E-05   27.9   1.7   28   60-96    109-137 (158)
 72 COG4049 Uncharacterized protei  73.7     1.7 3.8E-05   23.1   0.8   32   82-113    12-45  (65)
 73 PF03145 Sina:  Seven in absent  73.6     1.8   4E-05   29.4   1.1   46   14-59     23-72  (198)
 74 smart00834 CxxC_CXXC_SSSS Puta  73.1     1.5 3.3E-05   21.4   0.5   11   10-20      6-16  (41)
 75 PRK06266 transcription initiat  71.4     2.8   6E-05   28.2   1.6   29   60-97    117-146 (178)
 76 KOG0297 TNF receptor-associate  71.2     7.2 0.00016   29.7   3.9   80   13-96     94-177 (391)
 77 smart00531 TFIIE Transcription  70.7     3.3 7.1E-05   26.8   1.8   34   60-97     99-133 (147)
 78 KOG1146 Homeobox protein [Gene  70.2     1.8 3.9E-05   37.5   0.6   79   30-109   460-541 (1406)
 79 PF03145 Sina:  Seven in absent  69.5     3.5 7.6E-05   28.0   1.8   44   64-108    22-69  (198)
 80 PF14353 CpXC:  CpXC protein     69.4     3.2   7E-05   26.1   1.5   11   60-70     38-48  (128)
 81 smart00614 ZnF_BED BED zinc fi  68.2     3.9 8.5E-05   21.1   1.4   23   88-110    19-48  (50)
 82 TIGR02300 FYDLN_acid conserved  65.4     5.7 0.00012   25.1   2.0   13   60-72     26-38  (129)
 83 PF03604 DNA_RNApol_7kD:  DNA d  64.7     5.3 0.00011   18.7   1.3   11   10-20      1-11  (32)
 84 PF02892 zf-BED:  BED zinc fing  63.5     4.3 9.4E-05   20.2   1.0   21   86-106    15-40  (45)
 85 KOG4173 Alpha-SNAP protein [In  61.2     3.8 8.2E-05   28.1   0.7   71   36-107    80-167 (253)
 86 PF07754 DUF1610:  Domain of un  60.4     4.7  0.0001   17.6   0.7    8   87-94     16-23  (24)
 87 PF01530 zf-C2HC:  Zinc finger,  60.2     4.5 9.8E-05   18.8   0.6   12  136-147    16-27  (31)
 88 PHA00626 hypothetical protein   59.2     6.5 0.00014   21.0   1.2   18   86-103    22-39  (59)
 89 KOG2893 Zn finger protein [Gen  58.9     2.7 5.8E-05   29.6  -0.3   28   40-69     15-43  (341)
 90 COG1996 RPC10 DNA-directed RNA  58.6     4.1 8.9E-05   21.2   0.4   11   60-70      6-16  (49)
 91 COG5048 FOG: Zn-finger [Genera  58.3     2.9 6.2E-05   31.2  -0.3   49   60-108   289-345 (467)
 92 KOG2893 Zn finger protein [Gen  57.2     3.8 8.3E-05   28.8   0.2   30   63-96     13-43  (341)
 93 KOG1146 Homeobox protein [Gene  56.7     7.2 0.00016   34.1   1.7   26    4-29    460-487 (1406)
 94 PRK00464 nrdR transcriptional   55.6     4.6 9.9E-05   26.5   0.3   11   36-46     29-39  (154)
 95 KOG0297 TNF receptor-associate  51.6      26 0.00057   26.7   3.9   63   64-129    94-162 (391)
 96 PF15135 UPF0515:  Uncharacteri  51.0     9.9 0.00021   27.0   1.4   13    5-17    108-120 (278)
 97 COG3357 Predicted transcriptio  50.4      12 0.00027   22.1   1.5   28   34-68     57-84  (97)
 98 PF09845 DUF2072:  Zn-ribbon co  49.9      13 0.00028   23.6   1.7   12    9-20      1-12  (131)
 99 PF12230 PRP21_like_P:  Pre-mRN  49.8     5.4 0.00012   27.8   0.0   25   84-108   165-189 (229)
100 COG1997 RPL43A Ribosomal prote  49.3      14  0.0003   21.8   1.6   32   34-72     34-65  (89)
101 PF13248 zf-ribbon_3:  zinc-rib  48.6      14 0.00029   16.2   1.2    8   11-18      4-11  (26)
102 PF14369 zf-RING_3:  zinc-finge  47.4      17 0.00037   17.3   1.6   30    9-46      2-32  (35)
103 PRK03824 hypA hydrogenase nick  46.6     7.9 0.00017   24.7   0.4   12    9-20     70-81  (135)
104 KOG2907 RNA polymerase I trans  44.4      13 0.00029   22.9   1.1   38   60-98     74-113 (116)
105 PF05253 zf-U11-48K:  U11-48K-l  44.1      21 0.00046   15.8   1.5   22   89-110     4-26  (27)
106 COG5236 Uncharacterized conser  42.9      17 0.00037   27.2   1.6   59   49-107   168-241 (493)
107 PRK04023 DNA polymerase II lar  42.2      40 0.00087   29.1   3.7   10   87-96    663-672 (1121)
108 PF09963 DUF2197:  Uncharacteri  41.2      12 0.00025   20.1   0.4   34   11-44      4-40  (56)
109 PF12907 zf-met2:  Zinc-binding  40.8     4.9 0.00011   19.9  -1.0   18   10-27      2-23  (40)
110 PRK12380 hydrogenase nickel in  39.9      15 0.00033   22.6   0.9    9   37-45     72-80  (113)
111 PF13878 zf-C2H2_3:  zinc-finge  39.8      30 0.00065   17.1   1.8   11   10-20     14-24  (41)
112 PF02591 DUF164:  Putative zinc  39.7      32 0.00069   18.1   2.0   33   60-95     22-54  (56)
113 PLN03238 probable histone acet  39.1      27 0.00059   25.4   2.2   38   84-121    45-83  (290)
114 smart00440 ZnF_C2C2 C2C2 Zinc   38.8     5.4 0.00012   19.7  -1.0   10   88-97     29-38  (40)
115 KOG2231 Predicted E3 ubiquitin  37.3      91   0.002   25.8   4.9   19   89-107   184-203 (669)
116 PRK14714 DNA polymerase II lar  36.6      39 0.00084   29.9   2.9   27   60-96    692-718 (1337)
117 PF07975 C1_4:  TFIIH C1-like d  36.6      13 0.00029   19.5   0.2   14   35-48     21-34  (51)
118 PRK00564 hypA hydrogenase nick  36.5      20 0.00044   22.2   1.1    9   37-45     73-81  (117)
119 KOG2593 Transcription initiati  36.0      28 0.00062   26.8   1.9   16   33-48    126-141 (436)
120 TIGR00100 hypA hydrogenase nic  35.8      22 0.00048   22.0   1.2    9   37-45     72-80  (115)
121 TIGR00595 priA primosomal prot  35.0      12 0.00025   29.6  -0.2    8   60-67    253-260 (505)
122 PRK03681 hypA hydrogenase nick  34.2      21 0.00046   22.0   0.9    8   37-44     72-79  (114)
123 PF01155 HypA:  Hydrogenase exp  33.2      12 0.00026   23.0  -0.3    9   37-45     72-80  (113)
124 PF14446 Prok-RING_1:  Prokaryo  32.3      28 0.00061   18.5   1.0   10   11-20      7-16  (54)
125 PF14205 Cys_rich_KTR:  Cystein  32.2      13 0.00028   19.8  -0.3   34   35-69      4-37  (55)
126 COG2331 Uncharacterized protei  31.3      15 0.00032   21.0  -0.1   30   10-44     13-42  (82)
127 COG1579 Zn-ribbon protein, pos  30.3      35 0.00076   24.2   1.5   11   60-70    221-231 (239)
128 PF05191 ADK_lid:  Adenylate ki  30.2      18 0.00038   17.4   0.1    9   62-70      3-11  (36)
129 PF10013 DUF2256:  Uncharacteri  28.9      48   0.001   16.6   1.5   16   89-104    10-26  (42)
130 COG4530 Uncharacterized protei  28.7      42 0.00091   20.7   1.5   19   86-104    25-43  (129)
131 smart00154 ZnF_AN1 AN1-like Zi  28.5      32 0.00068   16.8   0.8   12    9-20     12-23  (39)
132 PRK09678 DNA-binding transcrip  26.0      33 0.00072   19.4   0.7   12   60-71     27-40  (72)
133 PTZ00064 histone acetyltransfe  25.9      41 0.00088   26.7   1.3   37   85-121   278-315 (552)
134 PF05290 Baculo_IE-1:  Baculovi  25.9      68  0.0015   20.6   2.1   16    5-20     76-91  (140)
135 PRK05978 hypothetical protein;  25.5      28 0.00061   22.7   0.4    7   62-68     35-41  (148)
136 PF01428 zf-AN1:  AN1-like Zinc  25.3      20 0.00043   17.8  -0.3   12   86-97     12-23  (43)
137 PF08790 zf-LYAR:  LYAR-type C2  25.3      16 0.00035   16.6  -0.5    9   63-71      3-11  (28)
138 PF12760 Zn_Tnp_IS1595:  Transp  24.6      26 0.00056   17.6   0.1    9   86-94     36-44  (46)
139 COG0068 HypF Hydrogenase matur  24.3      13 0.00027   30.6  -1.6   11   38-48    126-136 (750)
140 PF12773 DZR:  Double zinc ribb  23.9   1E+02  0.0022   15.4   2.3    9   60-68     29-37  (50)
141 COG1656 Uncharacterized conser  23.4      68  0.0015   21.4   1.8   11   88-98    131-141 (165)
142 PF14445 Prok-RING_2:  Prokaryo  23.3      36 0.00078   17.8   0.4   18   87-104     7-24  (57)
143 KOG2482 Predicted C2H2-type Zn  23.2      50  0.0011   24.8   1.3   22   87-108   195-217 (423)
144 PF11789 zf-Nse:  Zinc-finger o  23.2      55  0.0012   17.4   1.2   31   59-92     23-53  (57)
145 COG4896 Uncharacterized protei  23.0      28 0.00061   19.0   0.0   35   11-45      4-41  (68)
146 PF11672 DUF3268:  Protein of u  23.0      44 0.00096   20.3   0.9   12    9-20      2-13  (102)
147 PF04959 ARS2:  Arsenite-resist  22.8      50  0.0011   23.0   1.2   25   84-108    74-99  (214)
148 PF01096 TFIIS_C:  Transcriptio  22.8     2.7 5.8E-05   20.6  -3.7    9   88-96     29-37  (39)
149 PF07800 DUF1644:  Protein of u  22.7      72  0.0016   21.1   1.8   48   60-108    80-130 (162)
150 KOG3408 U1-like Zn-finger-cont  22.2      46 0.00099   20.9   0.8   22   60-81     57-79  (129)
151 PF04423 Rad50_zn_hook:  Rad50   22.0      30 0.00066   18.0   0.0   12   37-48     22-33  (54)
152 PLN00104 MYST -like histone ac  20.8      52  0.0011   25.7   1.0   37   85-121   196-233 (450)
153 COG1327 Predicted transcriptio  20.7      30 0.00064   22.7  -0.2   13   60-72     28-40  (156)
154 KOG3002 Zn finger protein [Gen  20.7 1.8E+02   0.004   21.4   3.8   96   10-108    49-161 (299)
155 COG5048 FOG: Zn-finger [Genera  20.6      59  0.0013   24.1   1.3   50   59-108    32-85  (467)
156 KOG2612 Predicted integral mem  20.4      65  0.0014   19.0   1.1   22   89-110    76-97  (103)
157 PF01286 XPA_N:  XPA protein N-  20.3      12 0.00025   17.9  -1.6   16   62-77      5-20  (34)
158 PF07295 DUF1451:  Protein of u  20.2      35 0.00076   22.2   0.0   31   60-99    112-143 (146)
159 COG5151 SSL1 RNA polymerase II  20.1      57  0.0012   24.2   1.1   12   87-98    388-399 (421)

No 1  
>KOG2462 consensus C2H2-type Zn-finger protein [Transcription]
Probab=99.87  E-value=3.8e-23  Score=141.67  Aligned_cols=116  Identities=23%  Similarity=0.443  Sum_probs=102.7

Q ss_pred             CCcccCCCCCCCCC-cchHHHH-HhhcC---CCCccccccCCccChh-hHHHHHHcCCC-CccCccchhccch-HHHHhh
Q 032068            7 ETTKICSHCDRAIP-SSNIDLH-FAHCS---RNLERCKVCGDMVPRK-YAEEHFLNTHA-PVACSQCSETMER-EILAIH   78 (148)
Q Consensus         7 ~~~~~C~~C~~~~~-~~~l~~H-~~~~~---~~~~~C~~C~~~f~~~-~l~~h~~~~h~-~~~C~~C~~~~~~-~~l~~h   78 (148)
                      ...|.|..|++.+. .++|..| +.|+.   .+.+.|+.|++.+... .+.-|+ ++|+ +..|..||+.|.+ =.|+.|
T Consensus       128 ~~r~~c~eCgk~ysT~snLsrHkQ~H~~~~s~ka~~C~~C~K~YvSmpALkMHi-rTH~l~c~C~iCGKaFSRPWLLQGH  206 (279)
T KOG2462|consen  128 HPRYKCPECGKSYSTSSNLSRHKQTHRSLDSKKAFSCKYCGKVYVSMPALKMHI-RTHTLPCECGICGKAFSRPWLLQGH  206 (279)
T ss_pred             CCceeccccccccccccccchhhcccccccccccccCCCCCceeeehHHHhhHh-hccCCCcccccccccccchHHhhcc
Confidence            33589999999999 9999999 78886   5678999999999877 566675 7776 9999999999998 567999


Q ss_pred             HhhcCCCCcccCCCCCCCCC-hhhHHHHHHH-----------hcccccCCcchhhhh
Q 032068           79 KGENCPQRIVTCDFCEFPLP-AVDLAEHQVA-----------IVNNFENGKTIADLS  123 (148)
Q Consensus        79 ~~~~~~~~~~~C~~C~~~f~-~~~l~~H~~~-----------c~~~~~~~~~~~~~~  123 (148)
                      +|+|+|||||.|+.|+++|. +++|.+|+++           |++.|..-+.|.+..
T Consensus       207 iRTHTGEKPF~C~hC~kAFADRSNLRAHmQTHS~~K~~qC~~C~KsFsl~SyLnKH~  263 (279)
T KOG2462|consen  207 IRTHTGEKPFSCPHCGKAFADRSNLRAHMQTHSDVKKHQCPRCGKSFALKSYLNKHS  263 (279)
T ss_pred             cccccCCCCccCCcccchhcchHHHHHHHHhhcCCccccCcchhhHHHHHHHHHHhh
Confidence            99999999999999999999 9999999986           899998888887766


No 2  
>KOG2462 consensus C2H2-type Zn-finger protein [Transcription]
Probab=99.78  E-value=8.6e-20  Score=125.32  Aligned_cols=98  Identities=18%  Similarity=0.409  Sum_probs=87.6

Q ss_pred             CCcccCCCCCCCCC-cchHHHH-HhhcCCCCccccccCCccChh-----hHHHHHHcCCCCccCccchhccch-HHHHhh
Q 032068            7 ETTKICSHCDRAIP-SSNIDLH-FAHCSRNLERCKVCGDMVPRK-----YAEEHFLNTHAPVACSQCSETMER-EILAIH   78 (148)
Q Consensus         7 ~~~~~C~~C~~~~~-~~~l~~H-~~~~~~~~~~C~~C~~~f~~~-----~l~~h~~~~h~~~~C~~C~~~~~~-~~l~~h   78 (148)
                      .+.+.|..|+++|. ...|+.| ++|.-  +..|..||+.|+..     ++++|.++  +||.|..|++.|.. ++|+.|
T Consensus       159 ~ka~~C~~C~K~YvSmpALkMHirTH~l--~c~C~iCGKaFSRPWLLQGHiRTHTGE--KPF~C~hC~kAFADRSNLRAH  234 (279)
T KOG2462|consen  159 KKAFSCKYCGKVYVSMPALKMHIRTHTL--PCECGICGKAFSRPWLLQGHIRTHTGE--KPFSCPHCGKAFADRSNLRAH  234 (279)
T ss_pred             cccccCCCCCceeeehHHHhhHhhccCC--CcccccccccccchHHhhcccccccCC--CCccCCcccchhcchHHHHHH
Confidence            56689999999999 9999999 78884  78999999999977     34444444  69999999999988 999999


Q ss_pred             HhhcCCCCcccCCCCCCCCC-hhhHHHHHHH
Q 032068           79 KGENCPQRIVTCDFCEFPLP-AVDLAEHQVA  108 (148)
Q Consensus        79 ~~~~~~~~~~~C~~C~~~f~-~~~l~~H~~~  108 (148)
                      +++|.+.++|.|+.|++.|. .+-|..|...
T Consensus       235 mQTHS~~K~~qC~~C~KsFsl~SyLnKH~ES  265 (279)
T KOG2462|consen  235 MQTHSDVKKHQCPRCGKSFALKSYLNKHSES  265 (279)
T ss_pred             HHhhcCCccccCcchhhHHHHHHHHHHhhhh
Confidence            99999999999999999999 9999999875


No 3  
>PLN03086 PRLI-interacting factor K; Provisional
Probab=99.44  E-value=2.9e-13  Score=103.05  Aligned_cols=123  Identities=28%  Similarity=0.537  Sum_probs=99.4

Q ss_pred             CcccCCCCCCCCCcchHHHHHhhcCCC--------------------CccccccCCccChhhHHHHHHcCCCCccCccch
Q 032068            8 TTKICSHCDRAIPSSNIDLHFAHCSRN--------------------LERCKVCGDMVPRKYAEEHFLNTHAPVACSQCS   67 (148)
Q Consensus         8 ~~~~C~~C~~~~~~~~l~~H~~~~~~~--------------------~~~C~~C~~~f~~~~l~~h~~~~h~~~~C~~C~   67 (148)
                      ....|..|.+.++..+|..|+.+|...                    .+.|+.|+..|....+..|....|+++.|+ |+
T Consensus       406 ~~V~C~NC~~~i~l~~l~lHe~~C~r~~V~Cp~~~Cg~v~~r~el~~H~~C~~Cgk~f~~s~LekH~~~~Hkpv~Cp-Cg  484 (567)
T PLN03086        406 DTVECRNCKHYIPSRSIALHEAYCSRHNVVCPHDGCGIVLRVEEAKNHVHCEKCGQAFQQGEMEKHMKVFHEPLQCP-CG  484 (567)
T ss_pred             CeEECCCCCCccchhHHHHHHhhCCCcceeCCcccccceeeccccccCccCCCCCCccchHHHHHHHHhcCCCccCC-CC
Confidence            346799999999989999997777643                    345678888887667888876667899999 99


Q ss_pred             hccchHHHHhhHhhcCCCCcccCCCCCCCCC-----------hhhHHHHHHHhcccccCCc----chhhhhhhccCcce
Q 032068           68 ETMEREILAIHKGENCPQRIVTCDFCEFPLP-----------AVDLAEHQVAIVNNFENGK----TIADLSYLGVMLCC  131 (148)
Q Consensus        68 ~~~~~~~l~~h~~~~~~~~~~~C~~C~~~f~-----------~~~l~~H~~~c~~~~~~~~----~~~~~~~~~~~~~~  131 (148)
                      ..+....|..|..++++++++.|++|++.|.           .+.|..|..+||.....+.    .+...+++.|+..|
T Consensus       485 ~~~~R~~L~~H~~thCp~Kpi~C~fC~~~v~~g~~~~d~~d~~s~Lt~HE~~CG~rt~~C~~Cgk~Vrlrdm~~H~~~~  563 (567)
T PLN03086        485 VVLEKEQMVQHQASTCPLRLITCRFCGDMVQAGGSAMDVRDRLRGMSEHESICGSRTAPCDSCGRSVMLKEMDIHQIAV  563 (567)
T ss_pred             CCcchhHHHhhhhccCCCCceeCCCCCCccccCccccchhhhhhhHHHHHHhcCCcceEccccCCeeeehhHHHHHHHh
Confidence            7666699999999999999999999999885           2479999999988887754    35667777777654


No 4  
>KOG1074 consensus Transcriptional repressor SALM [Transcription]
Probab=99.37  E-value=1.2e-13  Score=107.06  Aligned_cols=49  Identities=20%  Similarity=0.438  Sum_probs=45.7

Q ss_pred             CccCccchhccch-HHHHhhHhhcCCCCcccCCCCCCCCC-hhhHHHHHHH
Q 032068           60 PVACSQCSETMER-EILAIHKGENCPQRIVTCDFCEFPLP-AVDLAEHQVA  108 (148)
Q Consensus        60 ~~~C~~C~~~~~~-~~l~~h~~~~~~~~~~~C~~C~~~f~-~~~l~~H~~~  108 (148)
                      |-+|..|.+...- +.|+-|.++|+||+||+|++||++|. +.+|+.|+-+
T Consensus       605 PNqCiiC~rVlSC~saLqmHyrtHtGERPFkCKiCgRAFtTkGNLkaH~~v  655 (958)
T KOG1074|consen  605 PNQCIICLRVLSCPSALQMHYRTHTGERPFKCKICGRAFTTKGNLKAHMSV  655 (958)
T ss_pred             ccceeeeeecccchhhhhhhhhcccCcCccccccccchhccccchhhcccc
Confidence            6689999999887 99999999999999999999999999 9999999854


No 5  
>KOG3623 consensus Homeobox transcription factor SIP1 [Transcription]
Probab=99.32  E-value=5.4e-13  Score=102.39  Aligned_cols=97  Identities=19%  Similarity=0.382  Sum_probs=81.8

Q ss_pred             ccCCCCCCCCC-cchHHHH-H-hhcC-CCCccccccCCccChh-hHHHHHHcCC----------------CCccCccchh
Q 032068           10 KICSHCDRAIP-SSNIDLH-F-AHCS-RNLERCKVCGDMVPRK-YAEEHFLNTH----------------APVACSQCSE   68 (148)
Q Consensus        10 ~~C~~C~~~~~-~~~l~~H-~-~~~~-~~~~~C~~C~~~f~~~-~l~~h~~~~h----------------~~~~C~~C~~   68 (148)
                      .+|+.|.+.+. ...|+.| + +|-. +..|.|..|...|... .|..|. ..|                +.|.|..||+
T Consensus       211 ltcpycdrgykrltslkeHikyrhekne~nfsC~lCsytFAyRtQLErhm-~~hkpg~dqa~sltqsa~lRKFKCtECgK  289 (1007)
T KOG3623|consen  211 LTCPYCDRGYKRLTSLKEHIKYRHEKNEPNFSCMLCSYTFAYRTQLERHM-QLHKPGGDQAISLTQSALLRKFKCTECGK  289 (1007)
T ss_pred             hcchhHHHHHHHHHHHHHHHHHHHhhCCCCCcchhhhhhhhhHHHHHHHH-HhhcCCCcccccccchhhhccccccccch
Confidence            47899999999 8899999 6 4443 4468999999999988 344443 222                3899999999


Q ss_pred             ccch-HHHHhhHhhcCCCCcccCCCCCCCCC-hhhHHHHHH
Q 032068           69 TMER-EILAIHKGENCPQRIVTCDFCEFPLP-AVDLAEHQV  107 (148)
Q Consensus        69 ~~~~-~~l~~h~~~~~~~~~~~C~~C~~~f~-~~~l~~H~~  107 (148)
                      .|.. ..|..|+|+|.||+||.|+.|+++|. +.....|+-
T Consensus       290 AFKfKHHLKEHlRIHSGEKPfeCpnCkKRFSHSGSySSHmS  330 (1007)
T KOG3623|consen  290 AFKFKHHLKEHLRIHSGEKPFECPNCKKRFSHSGSYSSHMS  330 (1007)
T ss_pred             hhhhHHHHHhhheeecCCCCcCCcccccccccCCccccccc
Confidence            9999 89999999999999999999999999 889999974


No 6  
>KOG3608 consensus Zn finger proteins [General function prediction only]
Probab=99.26  E-value=1.4e-12  Score=92.90  Aligned_cols=107  Identities=21%  Similarity=0.344  Sum_probs=63.2

Q ss_pred             CccCCCCcccCCCCCCCCC-cchHHHH-H--hhcCCCCccccccCCccChh-hHHHHHHcCCCCccCccchhccch-HHH
Q 032068            2 AMTSDETTKICSHCDRAIP-SSNIDLH-F--AHCSRNLERCKVCGDMVPRK-YAEEHFLNTHAPVACSQCSETMER-EIL   75 (148)
Q Consensus         2 ~~h~~~~~~~C~~C~~~~~-~~~l~~H-~--~~~~~~~~~C~~C~~~f~~~-~l~~h~~~~h~~~~C~~C~~~~~~-~~l   75 (148)
                      ++|+++|...|+.|+.-|. ...|-+| +  .-...++|.|..|.+.|.++ -|..|..+--..|.|+.|.-..+. +.|
T Consensus       200 r~Hs~eKvvACp~Cg~~F~~~tkl~DH~rRqt~l~~n~fqC~~C~KrFaTeklL~~Hv~rHvn~ykCplCdmtc~~~ssL  279 (467)
T KOG3608|consen  200 RTHSNEKVVACPHCGELFRTKTKLFDHLRRQTELNTNSFQCAQCFKRFATEKLLKSHVVRHVNCYKCPLCDMTCSSASSL  279 (467)
T ss_pred             HhcCCCeEEecchHHHHhccccHHHHHHHhhhhhcCCchHHHHHHHHHhHHHHHHHHHHHhhhcccccccccCCCChHHH
Confidence            4677777777777777777 7777777 4  33345567777777777666 233443211124555555555544 555


Q ss_pred             HhhHhh-cCCCCcccCCCCCCCCC-hhhHHHHHHH
Q 032068           76 AIHKGE-NCPQRIVTCDFCEFPLP-AVDLAEHQVA  108 (148)
Q Consensus        76 ~~h~~~-~~~~~~~~C~~C~~~f~-~~~l~~H~~~  108 (148)
                      ..|++. |...+|++|+.|+..+. -+.|+.|..+
T Consensus       280 ~~H~r~rHs~dkpfKCd~Cd~~c~~esdL~kH~~~  314 (467)
T KOG3608|consen  280 TTHIRYRHSKDKPFKCDECDTRCVRESDLAKHVQV  314 (467)
T ss_pred             HHHHHhhhccCCCccccchhhhhccHHHHHHHHHh
Confidence            555542 44556666666666555 6666666544


No 7  
>KOG3623 consensus Homeobox transcription factor SIP1 [Transcription]
Probab=99.26  E-value=1e-12  Score=100.91  Aligned_cols=78  Identities=22%  Similarity=0.481  Sum_probs=55.2

Q ss_pred             CCCcccCCCCCCCCC-cchHHHH-HhhcCCCCccccccCCccChhhHHHHHHcCCCCccCccchhccchHHHHhhHhhcC
Q 032068            6 DETTKICSHCDRAIP-SSNIDLH-FAHCSRNLERCKVCGDMVPRKYAEEHFLNTHAPVACSQCSETMEREILAIHKGENC   83 (148)
Q Consensus         6 ~~~~~~C~~C~~~~~-~~~l~~H-~~~~~~~~~~C~~C~~~f~~~~l~~h~~~~h~~~~C~~C~~~~~~~~l~~h~~~~~   83 (148)
                      .+-.|.|..|+++|. .+.|.+| -.|+|.+|++|..|.+.|..+                        ..|..|.|.|.
T Consensus       891 e~gmyaCDqCDK~FqKqSSLaRHKYEHsGqRPyqC~iCkKAFKHK------------------------HHLtEHkRLHS  946 (1007)
T KOG3623|consen  891 EDGMYACDQCDKAFQKQSSLARHKYEHSGQRPYQCIICKKAFKHK------------------------HHLTEHKRLHS  946 (1007)
T ss_pred             ccccchHHHHHHHHHhhHHHHHhhhhhcCCCCcccchhhHhhhhh------------------------hhhhhhhhhcc
Confidence            344566666666666 6666666 466666666666666655544                        34455778899


Q ss_pred             CCCcccCCCCCCCCC-hhhHHHHHH
Q 032068           84 PQRIVTCDFCEFPLP-AVDLAEHQV  107 (148)
Q Consensus        84 ~~~~~~C~~C~~~f~-~~~l~~H~~  107 (148)
                      ||+||.|+.|+++|. +.....|+.
T Consensus       947 GEKPfQCdKClKRFSHSGSYSQHMN  971 (1007)
T KOG3623|consen  947 GEKPFQCDKCLKRFSHSGSYSQHMN  971 (1007)
T ss_pred             CCCcchhhhhhhhcccccchHhhhc
Confidence            999999999999999 888888864


No 8  
>KOG3608 consensus Zn finger proteins [General function prediction only]
Probab=99.20  E-value=4.9e-11  Score=85.15  Aligned_cols=131  Identities=18%  Similarity=0.270  Sum_probs=96.3

Q ss_pred             cccCCCCCCCCC-cchHHHH-H-hhcCCCCccccccCCccChh-hHHHHHHcCCC--CccCcc--chhccch-HHHHhhH
Q 032068            9 TKICSHCDRAIP-SSNIDLH-F-AHCSRNLERCKVCGDMVPRK-YAEEHFLNTHA--PVACSQ--CSETMER-EILAIHK   79 (148)
Q Consensus         9 ~~~C~~C~~~~~-~~~l~~H-~-~~~~~~~~~C~~C~~~f~~~-~l~~h~~~~h~--~~~C~~--C~~~~~~-~~l~~h~   79 (148)
                      -|+|+.|+.+.+ .+.|..| + .|...+||+|..|+..+... +|..|.. +|.  .|.|..  |..++.. .+|..|.
T Consensus       263 ~ykCplCdmtc~~~ssL~~H~r~rHs~dkpfKCd~Cd~~c~~esdL~kH~~-~HS~~~y~C~h~~C~~s~r~~~q~~~H~  341 (467)
T KOG3608|consen  263 CYKCPLCDMTCSSASSLTTHIRYRHSKDKPFKCDECDTRCVRESDLAKHVQ-VHSKTVYQCEHPDCHYSVRTYTQMRRHF  341 (467)
T ss_pred             cccccccccCCCChHHHHHHHHhhhccCCCccccchhhhhccHHHHHHHHH-hccccceecCCCCCcHHHHHHHHHHHHH
Confidence            367888888877 8888888 6 77778888888888777766 5666653 442  677755  7777766 7777776


Q ss_pred             hhc-CC--CCcccCCCCCCCCC-hhhHHHHHHHhcccccCCcchhhhhhhccCcceeeeccchhhh
Q 032068           80 GEN-CP--QRIVTCDFCEFPLP-AVDLAEHQVAIVNNFENGKTIADLSYLGVMLCCFVIVGSLWES  141 (148)
Q Consensus        80 ~~~-~~--~~~~~C~~C~~~f~-~~~l~~H~~~c~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~  141 (148)
                      +.+ .|  ..+|.|-.|.+.|. ...|..|+.. ...|..-+...++.|..+.|+.-++...++||
T Consensus       342 ~evhEg~np~~Y~CH~Cdr~ft~G~~L~~HL~k-kH~f~~PsGh~RFtYk~~edG~mRLqt~rles  406 (467)
T KOG3608|consen  342 LEVHEGNNPILYACHCCDRFFTSGKSLSAHLMK-KHGFRLPSGHKRFTYKVDEDGFMRLQTTRLES  406 (467)
T ss_pred             HHhccCCCCCceeeecchhhhccchhHHHHHHH-hhcccCCCCCCceeeeeccCceeeeeeeehhh
Confidence            532 23  34578888888888 7788888753 24577777888999999999999998888876


No 9  
>KOG3576 consensus Ovo and related transcription factors [Transcription]
Probab=99.20  E-value=4.4e-12  Score=84.14  Aligned_cols=77  Identities=21%  Similarity=0.393  Sum_probs=63.5

Q ss_pred             CCCCccccccCCccChhhHHHHHHcCCC---CccCccchhccch-HHHHhhHhhcCCCCcccCCCCCCCCC-hhhHHHHH
Q 032068           32 SRNLERCKVCGDMVPRKYAEEHFLNTHA---PVACSQCSETMER-EILAIHKGENCPQRIVTCDFCEFPLP-AVDLAEHQ  106 (148)
Q Consensus        32 ~~~~~~C~~C~~~f~~~~l~~h~~~~h~---~~~C~~C~~~~~~-~~l~~h~~~~~~~~~~~C~~C~~~f~-~~~l~~H~  106 (148)
                      +...+.|..|+++|....|...+..+|.   .+.|..||+.|.. -+|..|.++|+|.+||+|..|+++|. +-.|..|+
T Consensus       114 d~d~ftCrvCgK~F~lQRmlnrh~kch~~vkr~lct~cgkgfndtfdlkrh~rthtgvrpykc~~c~kaftqrcsleshl  193 (267)
T KOG3576|consen  114 DQDSFTCRVCGKKFGLQRMLNRHLKCHSDVKRHLCTFCGKGFNDTFDLKRHTRTHTGVRPYKCSLCEKAFTQRCSLESHL  193 (267)
T ss_pred             CCCeeeeehhhhhhhHHHHHHHHhhhccHHHHHHHhhccCcccchhhhhhhhccccCccccchhhhhHHHHhhccHHHHH
Confidence            3456889999999987754433346663   7789999999988 88999999999999999999999999 88999998


Q ss_pred             HH
Q 032068          107 VA  108 (148)
Q Consensus       107 ~~  108 (148)
                      +.
T Consensus       194 ~k  195 (267)
T KOG3576|consen  194 KK  195 (267)
T ss_pred             HH
Confidence            75


No 10 
>KOG3576 consensus Ovo and related transcription factors [Transcription]
Probab=99.12  E-value=3.5e-11  Score=79.90  Aligned_cols=103  Identities=20%  Similarity=0.390  Sum_probs=82.7

Q ss_pred             CCCCcccCCCCCCCCC-cchHHHH-HhhcCCCCccccccCCccChh-hHHHHHHcCC---CCccCccchhccch-HHHHh
Q 032068            5 SDETTKICSHCDRAIP-SSNIDLH-FAHCSRNLERCKVCGDMVPRK-YAEEHFLNTH---APVACSQCSETMER-EILAI   77 (148)
Q Consensus         5 ~~~~~~~C~~C~~~~~-~~~l~~H-~~~~~~~~~~C~~C~~~f~~~-~l~~h~~~~h---~~~~C~~C~~~~~~-~~l~~   77 (148)
                      .+...+.|..|++.|. ..-|.+| ..|...+.+.|..|++.|-.. ++..|. +.|   +||.|..|++.|.. -.|..
T Consensus       113 sd~d~ftCrvCgK~F~lQRmlnrh~kch~~vkr~lct~cgkgfndtfdlkrh~-rthtgvrpykc~~c~kaftqrcsles  191 (267)
T KOG3576|consen  113 SDQDSFTCRVCGKKFGLQRMLNRHLKCHSDVKRHLCTFCGKGFNDTFDLKRHT-RTHTGVRPYKCSLCEKAFTQRCSLES  191 (267)
T ss_pred             CCCCeeeeehhhhhhhHHHHHHHHhhhccHHHHHHHhhccCcccchhhhhhhh-ccccCccccchhhhhHHHHhhccHHH
Confidence            4566799999999999 8888899 699999999999999999877 677665 555   49999999999987 67777


Q ss_pred             hHhhcC-----------CCCcccCCCCCCCCC-hhhHHHHHHH
Q 032068           78 HKGENC-----------PQRIVTCDFCEFPLP-AVDLAEHQVA  108 (148)
Q Consensus        78 h~~~~~-----------~~~~~~C~~C~~~f~-~~~l~~H~~~  108 (148)
                      |...-.           ..+.|.|+.||..-. ...+..|.+.
T Consensus       192 hl~kvhgv~~~yaykerr~kl~vcedcg~t~~~~e~~~~h~~~  234 (267)
T KOG3576|consen  192 HLKKVHGVQHQYAYKERRAKLYVCEDCGYTSERPEVYYLHLKL  234 (267)
T ss_pred             HHHHHcCchHHHHHHHhhhheeeecccCCCCCChhHHHHHHHh
Confidence            765322           346799999998766 6667777664


No 11 
>PLN03086 PRLI-interacting factor K; Provisional
Probab=99.06  E-value=4.8e-10  Score=85.76  Aligned_cols=100  Identities=20%  Similarity=0.486  Sum_probs=80.5

Q ss_pred             CCCcccCCCCCCCCCcchHHHH-HhhcCCCCccccccCCccChhhHHHHHHc-C-CCCccCccchhccc----------h
Q 032068            6 DETTKICSHCDRAIPSSNIDLH-FAHCSRNLERCKVCGDMVPRKYAEEHFLN-T-HAPVACSQCSETME----------R   72 (148)
Q Consensus         6 ~~~~~~C~~C~~~~~~~~l~~H-~~~~~~~~~~C~~C~~~f~~~~l~~h~~~-~-h~~~~C~~C~~~~~----------~   72 (148)
                      -++.+.|..|++.|....|..| ..+.  .++.|+ |+..+....+..|... + .+++.|..|+..|.          .
T Consensus       450 l~~H~~C~~Cgk~f~~s~LekH~~~~H--kpv~Cp-Cg~~~~R~~L~~H~~thCp~Kpi~C~fC~~~v~~g~~~~d~~d~  526 (567)
T PLN03086        450 AKNHVHCEKCGQAFQQGEMEKHMKVFH--EPLQCP-CGVVLEKEQMVQHQASTCPLRLITCRFCGDMVQAGGSAMDVRDR  526 (567)
T ss_pred             cccCccCCCCCCccchHHHHHHHHhcC--CCccCC-CCCCcchhHHHhhhhccCCCCceeCCCCCCccccCccccchhhh
Confidence            3567899999999997789999 5553  689999 9977655577777532 1 25999999999884          2


Q ss_pred             -HHHHhhHhhcCCCCcccCCCCCCCCChhhHHHHHHHh
Q 032068           73 -EILAIHKGENCPQRIVTCDFCEFPLPAVDLAEHQVAI  109 (148)
Q Consensus        73 -~~l~~h~~~~~~~~~~~C~~C~~~f~~~~l~~H~~~c  109 (148)
                       +.|..|+.. +|.+++.|..||+.|+...+..|+-.|
T Consensus       527 ~s~Lt~HE~~-CG~rt~~C~~Cgk~Vrlrdm~~H~~~~  563 (567)
T PLN03086        527 LRGMSEHESI-CGSRTAPCDSCGRSVMLKEMDIHQIAV  563 (567)
T ss_pred             hhhHHHHHHh-cCCcceEccccCCeeeehhHHHHHHHh
Confidence             579999976 799999999999999988889998655


No 12 
>KOG1074 consensus Transcriptional repressor SALM [Transcription]
Probab=98.95  E-value=1.9e-10  Score=89.83  Aligned_cols=99  Identities=20%  Similarity=0.444  Sum_probs=80.2

Q ss_pred             CCcccCCCCCCCCC-cchHHHH-HhhcCCCCccccccCCccChh-hHHHHHHcCCC-------CccCc---cchhccch-
Q 032068            7 ETTKICSHCDRAIP-SSNIDLH-FAHCSRNLERCKVCGDMVPRK-YAEEHFLNTHA-------PVACS---QCSETMER-   72 (148)
Q Consensus         7 ~~~~~C~~C~~~~~-~~~l~~H-~~~~~~~~~~C~~C~~~f~~~-~l~~h~~~~h~-------~~~C~---~C~~~~~~-   72 (148)
                      -.|.+|..|.++.. .+.|+.| +.|.+++||+|..|++.|..+ +|+.|. .+|+       .+.|+   .|.+.|.. 
T Consensus       603 TdPNqCiiC~rVlSC~saLqmHyrtHtGERPFkCKiCgRAFtTkGNLkaH~-~vHka~p~~R~q~ScP~~~ic~~kftn~  681 (958)
T KOG1074|consen  603 TDPNQCIICLRVLSCPSALQMHYRTHTGERPFKCKICGRAFTTKGNLKAHM-SVHKAKPPARVQFSCPSTFICQKKFTNA  681 (958)
T ss_pred             CCccceeeeeecccchhhhhhhhhcccCcCccccccccchhccccchhhcc-cccccCccccccccCCchhhhccccccc
Confidence            45789999999999 9999999 899999999999999999999 788886 5553       77898   99999988 


Q ss_pred             HHHHhhHhhcCCC-C------------cccCCCCCCCCC-hhhHHHHH
Q 032068           73 EILAIHKGENCPQ-R------------IVTCDFCEFPLP-AVDLAEHQ  106 (148)
Q Consensus        73 ~~l~~h~~~~~~~-~------------~~~C~~C~~~f~-~~~l~~H~  106 (148)
                      -.|.+|+++|.+. .            ...|..|.+.|. -..+..++
T Consensus       682 V~lpQhIriH~~~~~s~g~~a~e~~~~adq~~~~qk~~~~a~~f~~~~  729 (958)
T KOG1074|consen  682 VTLPQHIRIHLGGQISNGGTAAEGILAADQCSSCQKTFSDARSFSQQI  729 (958)
T ss_pred             ccccceEEeecCCCCCCCcccccccchhcccchhhhcccccccchhhh
Confidence            7889999988722 2            235777888875 44444443


No 13 
>PHA02768 hypothetical protein; Provisional
Probab=98.67  E-value=1e-08  Score=54.49  Aligned_cols=41  Identities=12%  Similarity=0.194  Sum_probs=35.2

Q ss_pred             CccCccchhccch-HHHHhhHhhcCCCCcccCCCCCCCCC-hhhH
Q 032068           60 PVACSQCSETMER-EILAIHKGENCPQRIVTCDFCEFPLP-AVDL  102 (148)
Q Consensus        60 ~~~C~~C~~~~~~-~~l~~h~~~~~~~~~~~C~~C~~~f~-~~~l  102 (148)
                      .|.|+.||+.|.. +.|..|+++|.  +++.|..|++.|. .+.|
T Consensus         5 ~y~C~~CGK~Fs~~~~L~~H~r~H~--k~~kc~~C~k~f~~~s~l   47 (55)
T PHA02768          5 GYECPICGEIYIKRKSMITHLRKHN--TNLKLSNCKRISLRTGEY   47 (55)
T ss_pred             ccCcchhCCeeccHHHHHHHHHhcC--CcccCCcccceeccccee
Confidence            5789999999988 89999999988  6889999999888 5544


No 14 
>PHA00733 hypothetical protein
Probab=98.57  E-value=3.9e-08  Score=62.16  Aligned_cols=74  Identities=14%  Similarity=0.201  Sum_probs=56.5

Q ss_pred             CCCccccccCCccChhhHHHHH------HcCC--CCccCccchhccch-HHHHhhHhhcCCCCcccCCCCCCCCC-hhhH
Q 032068           33 RNLERCKVCGDMVPRKYAEEHF------LNTH--APVACSQCSETMER-EILAIHKGENCPQRIVTCDFCEFPLP-AVDL  102 (148)
Q Consensus        33 ~~~~~C~~C~~~f~~~~l~~h~------~~~h--~~~~C~~C~~~~~~-~~l~~h~~~~~~~~~~~C~~C~~~f~-~~~l  102 (148)
                      .+++.|..|...|....+..+.      ...+  .+|.|+.|++.|.. ..|..|++.+  +.++.|+.|++.|. ...|
T Consensus        38 ~~~~~~~~~~~~~~~~~~l~~~~~l~~~~~~~~~kPy~C~~Cgk~Fss~s~L~~H~r~h--~~~~~C~~CgK~F~~~~sL  115 (128)
T PHA00733         38 QKRLIRAVVKTLIYNPQLLDESSYLYKLLTSKAVSPYVCPLCLMPFSSSVSLKQHIRYT--EHSKVCPVCGKEFRNTDST  115 (128)
T ss_pred             hhhHHHHHHhhhccChhhhcchHHHHhhcccCCCCCccCCCCCCcCCCHHHHHHHHhcC--CcCccCCCCCCccCCHHHH
Confidence            4567898998888766332221      1111  39999999999988 8999999865  46799999999999 8999


Q ss_pred             HHHHHH
Q 032068          103 AEHQVA  108 (148)
Q Consensus       103 ~~H~~~  108 (148)
                      ..|...
T Consensus       116 ~~H~~~  121 (128)
T PHA00733        116 LDHVCK  121 (128)
T ss_pred             HHHHHH
Confidence            999763


No 15 
>PF13465 zf-H2C2_2:  Zinc-finger double domain; PDB: 2EN7_A 1TF6_A 1TF3_A 2ELT_A 2EOS_A 2EN2_A 2DMD_A 2WBS_A 2WBU_A 2EM5_A ....
Probab=98.49  E-value=4e-08  Score=44.64  Aligned_cols=25  Identities=16%  Similarity=0.371  Sum_probs=22.6

Q ss_pred             HHHhhHhhcCCCCcccCCCCCCCCC
Q 032068           74 ILAIHKGENCPQRIVTCDFCEFPLP   98 (148)
Q Consensus        74 ~l~~h~~~~~~~~~~~C~~C~~~f~   98 (148)
                      +|..|+++|.+++||.|+.|++.|.
T Consensus         1 ~l~~H~~~H~~~k~~~C~~C~k~F~   25 (26)
T PF13465_consen    1 NLRRHMRTHTGEKPYKCPYCGKSFS   25 (26)
T ss_dssp             HHHHHHHHHSSSSSEEESSSSEEES
T ss_pred             CHHHHhhhcCCCCCCCCCCCcCeeC
Confidence            4788999999999999999998874


No 16 
>PHA00733 hypothetical protein
Probab=98.22  E-value=8.3e-07  Score=56.15  Aligned_cols=77  Identities=13%  Similarity=0.181  Sum_probs=58.2

Q ss_pred             CCCcccCCCCCCCCC-cchHHHH------HhhcCCCCccccccCCccChh-hHHHHHHcCCCCccCccchhccch-HHHH
Q 032068            6 DETTKICSHCDRAIP-SSNIDLH------FAHCSRNLERCKVCGDMVPRK-YAEEHFLNTHAPVACSQCSETMER-EILA   76 (148)
Q Consensus         6 ~~~~~~C~~C~~~~~-~~~l~~H------~~~~~~~~~~C~~C~~~f~~~-~l~~h~~~~h~~~~C~~C~~~~~~-~~l~   76 (148)
                      .++++.|..|++.|. ...|..+      +...+.+|+.|+.|++.|... .+..|....-.++.|+.|++.|.. ..|.
T Consensus        37 ~~~~~~~~~~~~~~~~~~~l~~~~~l~~~~~~~~~kPy~C~~Cgk~Fss~s~L~~H~r~h~~~~~C~~CgK~F~~~~sL~  116 (128)
T PHA00733         37 EQKRLIRAVVKTLIYNPQLLDESSYLYKLLTSKAVSPYVCPLCLMPFSSSVSLKQHIRYTEHSKVCPVCGKEFRNTDSTL  116 (128)
T ss_pred             hhhhHHHHHHhhhccChhhhcchHHHHhhcccCCCCCccCCCCCCcCCCHHHHHHHHhcCCcCccCCCCCCccCCHHHHH
Confidence            467789999998887 6666654      123347799999999999988 566776421147999999999988 8899


Q ss_pred             hhHhhc
Q 032068           77 IHKGEN   82 (148)
Q Consensus        77 ~h~~~~   82 (148)
                      .|+...
T Consensus       117 ~H~~~~  122 (128)
T PHA00733        117 DHVCKK  122 (128)
T ss_pred             HHHHHh
Confidence            888643


No 17 
>PHA02768 hypothetical protein; Provisional
Probab=98.21  E-value=8.6e-07  Score=47.14  Aligned_cols=37  Identities=16%  Similarity=0.231  Sum_probs=27.2

Q ss_pred             ccCCCCCCCCC-cchHHHH-HhhcCCCCccccccCCccChh
Q 032068           10 KICSHCDRAIP-SSNIDLH-FAHCSRNLERCKVCGDMVPRK   48 (148)
Q Consensus        10 ~~C~~C~~~~~-~~~l~~H-~~~~~~~~~~C~~C~~~f~~~   48 (148)
                      |+|+.||+.|. .++|..| +.|.  ++++|..|++.|...
T Consensus         6 y~C~~CGK~Fs~~~~L~~H~r~H~--k~~kc~~C~k~f~~~   44 (55)
T PHA02768          6 YECPICGEIYIKRKSMITHLRKHN--TNLKLSNCKRISLRT   44 (55)
T ss_pred             cCcchhCCeeccHHHHHHHHHhcC--CcccCCcccceeccc
Confidence            67888888888 7788888 5666  467777777777644


No 18 
>PF13465 zf-H2C2_2:  Zinc-finger double domain; PDB: 2EN7_A 1TF6_A 1TF3_A 2ELT_A 2EOS_A 2EN2_A 2DMD_A 2WBS_A 2WBU_A 2EM5_A ....
Probab=98.09  E-value=1.9e-06  Score=39.05  Aligned_cols=24  Identities=25%  Similarity=0.553  Sum_probs=20.4

Q ss_pred             hHHHH-HhhcCCCCccccccCCccC
Q 032068           23 NIDLH-FAHCSRNLERCKVCGDMVP   46 (148)
Q Consensus        23 ~l~~H-~~~~~~~~~~C~~C~~~f~   46 (148)
                      +|..| +.|.+++|+.|+.|++.|.
T Consensus         1 ~l~~H~~~H~~~k~~~C~~C~k~F~   25 (26)
T PF13465_consen    1 NLRRHMRTHTGEKPYKCPYCGKSFS   25 (26)
T ss_dssp             HHHHHHHHHSSSSSEEESSSSEEES
T ss_pred             CHHHHhhhcCCCCCCCCCCCcCeeC
Confidence            47888 6899999999999998875


No 19 
>PHA00616 hypothetical protein
Probab=98.02  E-value=2.7e-06  Score=43.09  Aligned_cols=33  Identities=15%  Similarity=0.267  Sum_probs=29.0

Q ss_pred             CccCccchhccch-HHHHhhHhhcCCCCcccCCC
Q 032068           60 PVACSQCSETMER-EILAIHKGENCPQRIVTCDF   92 (148)
Q Consensus        60 ~~~C~~C~~~~~~-~~l~~h~~~~~~~~~~~C~~   92 (148)
                      ||+|+.||+.|.. +.|..|++.+.+++++.|+.
T Consensus         1 pYqC~~CG~~F~~~s~l~~H~r~~hg~~~~~~~~   34 (44)
T PHA00616          1 MYQCLRCGGIFRKKKEVIEHLLSVHKQNKLTLEY   34 (44)
T ss_pred             CCccchhhHHHhhHHHHHHHHHHhcCCCccceeE
Confidence            6889999999988 89999999999999988764


No 20 
>PHA00616 hypothetical protein
Probab=97.70  E-value=1.4e-05  Score=40.54  Aligned_cols=32  Identities=13%  Similarity=0.185  Sum_probs=28.1

Q ss_pred             cccCCCCCCCCC-cchHHHH-HhhcCCCCccccc
Q 032068            9 TKICSHCDRAIP-SSNIDLH-FAHCSRNLERCKV   40 (148)
Q Consensus         9 ~~~C~~C~~~~~-~~~l~~H-~~~~~~~~~~C~~   40 (148)
                      ||+|..||+.|. .++|..| ..+.++.++.|+.
T Consensus         1 pYqC~~CG~~F~~~s~l~~H~r~~hg~~~~~~~~   34 (44)
T PHA00616          1 MYQCLRCGGIFRKKKEVIEHLLSVHKQNKLTLEY   34 (44)
T ss_pred             CCccchhhHHHhhHHHHHHHHHHhcCCCccceeE
Confidence            689999999999 9999999 7888888888763


No 21 
>PHA00732 hypothetical protein
Probab=97.67  E-value=3.8e-05  Score=44.37  Aligned_cols=42  Identities=21%  Similarity=0.388  Sum_probs=26.4

Q ss_pred             CccCccchhccch-HHHHhhHhh-cCCCCcccCCCCCCCCChhhHHHHH
Q 032068           60 PVACSQCSETMER-EILAIHKGE-NCPQRIVTCDFCEFPLPAVDLAEHQ  106 (148)
Q Consensus        60 ~~~C~~C~~~~~~-~~l~~h~~~-~~~~~~~~C~~C~~~f~~~~l~~H~  106 (148)
                      ||.|..|++.|.. ..|..|++. |.   ++.|+.|++.|.  .+..|.
T Consensus         1 py~C~~Cgk~F~s~s~Lk~H~r~~H~---~~~C~~CgKsF~--~l~~H~   44 (79)
T PHA00732          1 MFKCPICGFTTVTLFALKQHARRNHT---LTKCPVCNKSYR--RLNQHF   44 (79)
T ss_pred             CccCCCCCCccCCHHHHHHHhhcccC---CCccCCCCCEeC--Chhhhh
Confidence            4667777777766 677777653 43   246777777765  345554


No 22 
>PHA00732 hypothetical protein
Probab=97.59  E-value=7.2e-05  Score=43.25  Aligned_cols=41  Identities=24%  Similarity=0.470  Sum_probs=30.2

Q ss_pred             cccCCCCCCCCC-cchHHHH-H-hhcCCCCccccccCCccChhhHHHHH
Q 032068            9 TKICSHCDRAIP-SSNIDLH-F-AHCSRNLERCKVCGDMVPRKYAEEHF   54 (148)
Q Consensus         9 ~~~C~~C~~~~~-~~~l~~H-~-~~~~~~~~~C~~C~~~f~~~~l~~h~   54 (148)
                      ||.|..|++.|. .+.|..| . .|.+   +.|+.|++.|..  +..|.
T Consensus         1 py~C~~Cgk~F~s~s~Lk~H~r~~H~~---~~C~~CgKsF~~--l~~H~   44 (79)
T PHA00732          1 MFKCPICGFTTVTLFALKQHARRNHTL---TKCPVCNKSYRR--LNQHF   44 (79)
T ss_pred             CccCCCCCCccCCHHHHHHHhhcccCC---CccCCCCCEeCC--hhhhh
Confidence            578899999998 8888888 4 3543   478888888873  44454


No 23 
>KOG3993 consensus Transcription factor (contains Zn finger) [Transcription]
Probab=97.53  E-value=2.7e-05  Score=57.49  Aligned_cols=29  Identities=21%  Similarity=0.293  Sum_probs=23.5

Q ss_pred             cccCCCCCCCCC-hhhHHHHHHHhcccccC
Q 032068           87 IVTCDFCEFPLP-AVDLAEHQVAIVNNFEN  115 (148)
Q Consensus        87 ~~~C~~C~~~f~-~~~l~~H~~~c~~~~~~  115 (148)
                      .|.|..|++.|+ ..-|..|+..+.+....
T Consensus       356 i~~C~~C~KkFrRqAYLrKHqlthq~~~~~  385 (500)
T KOG3993|consen  356 IFSCHTCGKKFRRQAYLRKHQLTHQRAPLA  385 (500)
T ss_pred             eeecHHhhhhhHHHHHHHHhHHhhhccccc
Confidence            589999999999 88999998876554444


No 24 
>PF05605 zf-Di19:  Drought induced 19 protein (Di19), zinc-binding;  InterPro: IPR008598 This entry consists of several drought induced 19 (Di19) like and RING finger 114 proteins. Di19 has been found to be strongly expressed in both the roots and leaves of Arabidopsis thaliana during progressive drought [], whilst RING finger proteins are thought to play a role in spermatogenesis. The precise function is unknown.
Probab=97.42  E-value=0.0003  Score=37.64  Aligned_cols=46  Identities=20%  Similarity=0.445  Sum_probs=25.5

Q ss_pred             ccCccchhccchHHHHhhHhh-cCCC-CcccCCCCCCCCChhhHHHHHH
Q 032068           61 VACSQCSETMEREILAIHKGE-NCPQ-RIVTCDFCEFPLPAVDLAEHQV  107 (148)
Q Consensus        61 ~~C~~C~~~~~~~~l~~h~~~-~~~~-~~~~C~~C~~~f~~~~l~~H~~  107 (148)
                      |.|+.|++.|....|..|... |..+ +.+.||.|...+. .+|..|+.
T Consensus         3 f~CP~C~~~~~~~~L~~H~~~~H~~~~~~v~CPiC~~~~~-~~l~~Hl~   50 (54)
T PF05605_consen    3 FTCPYCGKGFSESSLVEHCEDEHRSESKNVVCPICSSRVT-DNLIRHLN   50 (54)
T ss_pred             cCCCCCCCccCHHHHHHHHHhHCcCCCCCccCCCchhhhh-hHHHHHHH
Confidence            566666665444556666432 3332 4567777765433 36666654


No 25 
>PF05605 zf-Di19:  Drought induced 19 protein (Di19), zinc-binding;  InterPro: IPR008598 This entry consists of several drought induced 19 (Di19) like and RING finger 114 proteins. Di19 has been found to be strongly expressed in both the roots and leaves of Arabidopsis thaliana during progressive drought [], whilst RING finger proteins are thought to play a role in spermatogenesis. The precise function is unknown.
Probab=97.40  E-value=0.00033  Score=37.45  Aligned_cols=45  Identities=22%  Similarity=0.425  Sum_probs=23.8

Q ss_pred             cccCCCCCCCCCcchHHHH-H-hhcCC-CCccccccCCccChhhHHHHH
Q 032068            9 TKICSHCDRAIPSSNIDLH-F-AHCSR-NLERCKVCGDMVPRKYAEEHF   54 (148)
Q Consensus         9 ~~~C~~C~~~~~~~~l~~H-~-~~~~~-~~~~C~~C~~~f~~~~l~~h~   54 (148)
                      .|.|+.|++.|....|..| . .|..+ +.+.||.|...+.. ++..|+
T Consensus         2 ~f~CP~C~~~~~~~~L~~H~~~~H~~~~~~v~CPiC~~~~~~-~l~~Hl   49 (54)
T PF05605_consen    2 SFTCPYCGKGFSESSLVEHCEDEHRSESKNVVCPICSSRVTD-NLIRHL   49 (54)
T ss_pred             CcCCCCCCCccCHHHHHHHHHhHCcCCCCCccCCCchhhhhh-HHHHHH
Confidence            3566666664445566666 4 44443 34566666654332 344443


No 26 
>PF00096 zf-C2H2:  Zinc finger, C2H2 type;  InterPro: IPR007087 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target.  The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger: #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C], where X can be any amino acid, and numbers in brackets indicate the number of residues. The positions marked # are those that are important for the stable fold of the zinc finger. The final position can be either his or cys. The C2H2 zinc finger is composed of two short beta strands followed by an alpha helix. The amino terminal part of the helix binds the major groove in DNA binding zinc fingers. The accepted consensus binding sequence for Sp1 is usually defined by the asymmetric hexanucleotide core GGGCGG but this sequence does not include, among others, the GAG (=CTC) repeat that constitutes a high-affinity site for Sp1 binding to the wt1 promoter []. This entry represents the classical C2H2 zinc finger domain.  More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; GO: 0008270 zinc ion binding, 0005622 intracellular; PDB: 2D9H_A 2EPC_A 1SP1_A 1VA3_A 2WBT_B 2ELR_A 2YTP_A 2YTT_A 1VA1_A 2ELO_A ....
Probab=97.20  E-value=0.00016  Score=31.39  Aligned_cols=21  Identities=19%  Similarity=0.449  Sum_probs=15.0

Q ss_pred             ccCCCCCCCCC-hhhHHHHHHH
Q 032068           88 VTCDFCEFPLP-AVDLAEHQVA  108 (148)
Q Consensus        88 ~~C~~C~~~f~-~~~l~~H~~~  108 (148)
                      |.|+.|++.|. ...|..|++.
T Consensus         1 y~C~~C~~~f~~~~~l~~H~~~   22 (23)
T PF00096_consen    1 YKCPICGKSFSSKSNLKRHMRR   22 (23)
T ss_dssp             EEETTTTEEESSHHHHHHHHHH
T ss_pred             CCCCCCCCccCCHHHHHHHHhH
Confidence            46777777777 7777777654


No 27 
>PF00096 zf-C2H2:  Zinc finger, C2H2 type;  InterPro: IPR007087 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target.  The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger: #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C], where X can be any amino acid, and numbers in brackets indicate the number of residues. The positions marked # are those that are important for the stable fold of the zinc finger. The final position can be either his or cys. The C2H2 zinc finger is composed of two short beta strands followed by an alpha helix. The amino terminal part of the helix binds the major groove in DNA binding zinc fingers. The accepted consensus binding sequence for Sp1 is usually defined by the asymmetric hexanucleotide core GGGCGG but this sequence does not include, among others, the GAG (=CTC) repeat that constitutes a high-affinity site for Sp1 binding to the wt1 promoter []. This entry represents the classical C2H2 zinc finger domain.  More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; GO: 0008270 zinc ion binding, 0005622 intracellular; PDB: 2D9H_A 2EPC_A 1SP1_A 1VA3_A 2WBT_B 2ELR_A 2YTP_A 2YTT_A 1VA1_A 2ELO_A ....
Probab=96.92  E-value=0.00031  Score=30.51  Aligned_cols=21  Identities=19%  Similarity=0.434  Sum_probs=15.2

Q ss_pred             ccCccchhccch-HHHHhhHhh
Q 032068           61 VACSQCSETMER-EILAIHKGE   81 (148)
Q Consensus        61 ~~C~~C~~~~~~-~~l~~h~~~   81 (148)
                      |.|+.|++.|.. ..|..|++.
T Consensus         1 y~C~~C~~~f~~~~~l~~H~~~   22 (23)
T PF00096_consen    1 YKCPICGKSFSSKSNLKRHMRR   22 (23)
T ss_dssp             EEETTTTEEESSHHHHHHHHHH
T ss_pred             CCCCCCCCccCCHHHHHHHHhH
Confidence            467778888866 777777754


No 28 
>PF13913 zf-C2HC_2:  zinc-finger of a C2HC-type
Probab=96.83  E-value=0.0011  Score=29.51  Aligned_cols=22  Identities=23%  Similarity=0.451  Sum_probs=15.7

Q ss_pred             ccCCCCCCCCChhhHHHHHHHh
Q 032068           88 VTCDFCEFPLPAVDLAEHQVAI  109 (148)
Q Consensus        88 ~~C~~C~~~f~~~~l~~H~~~c  109 (148)
                      ..|+.||+.|..+.|..|+.+|
T Consensus         3 ~~C~~CgR~F~~~~l~~H~~~C   24 (25)
T PF13913_consen    3 VPCPICGRKFNPDRLEKHEKIC   24 (25)
T ss_pred             CcCCCCCCEECHHHHHHHHHhc
Confidence            3577777777777777777665


No 29 
>PF13912 zf-C2H2_6:  C2H2-type zinc finger; PDB: 1JN7_A 1FU9_A 2L1O_A 1NJQ_A 2EN8_A 2EMM_A 1FV5_A 1Y0J_B 2L6Z_B.
Probab=96.67  E-value=0.00077  Score=30.44  Aligned_cols=22  Identities=27%  Similarity=0.617  Sum_probs=11.9

Q ss_pred             ccCccchhccch-HHHHhhHhhc
Q 032068           61 VACSQCSETMER-EILAIHKGEN   82 (148)
Q Consensus        61 ~~C~~C~~~~~~-~~l~~h~~~~   82 (148)
                      +.|..|++.|.. ..|..|.+.|
T Consensus         2 ~~C~~C~~~F~~~~~l~~H~~~h   24 (27)
T PF13912_consen    2 FECDECGKTFSSLSALREHKRSH   24 (27)
T ss_dssp             EEETTTTEEESSHHHHHHHHCTT
T ss_pred             CCCCccCCccCChhHHHHHhHHh
Confidence            455555555555 5555555443


No 30 
>PF13894 zf-C2H2_4:  C2H2-type zinc finger; PDB: 2ELX_A 2EPP_A 2DLK_A 1X6H_A 2EOU_A 2EMB_A 2GQJ_A 2CSH_A 2WBT_B 2ELM_A ....
Probab=96.52  E-value=0.0018  Score=27.95  Aligned_cols=21  Identities=19%  Similarity=0.490  Sum_probs=13.6

Q ss_pred             ccCCCCCCCCC-hhhHHHHHHH
Q 032068           88 VTCDFCEFPLP-AVDLAEHQVA  108 (148)
Q Consensus        88 ~~C~~C~~~f~-~~~l~~H~~~  108 (148)
                      |.|+.|++.|. ...|..|++.
T Consensus         1 ~~C~~C~~~~~~~~~l~~H~~~   22 (24)
T PF13894_consen    1 FQCPICGKSFRSKSELRQHMRT   22 (24)
T ss_dssp             EE-SSTS-EESSHHHHHHHHHH
T ss_pred             CCCcCCCCcCCcHHHHHHHHHh
Confidence            46777777777 7777777653


No 31 
>KOG3993 consensus Transcription factor (contains Zn finger) [Transcription]
Probab=96.44  E-value=0.0005  Score=51.04  Aligned_cols=20  Identities=5%  Similarity=-0.052  Sum_probs=12.8

Q ss_pred             hcccccCCcchhhhhhhccC
Q 032068          109 IVNNFENGKTIADLSYLGVM  128 (148)
Q Consensus       109 c~~~~~~~~~~~~~~~~~~~  128 (148)
                      |++.|.....|+...+.-+|
T Consensus       362 C~KkFrRqAYLrKHqlthq~  381 (500)
T KOG3993|consen  362 CGKKFRRQAYLRKHQLTHQR  381 (500)
T ss_pred             hhhhhHHHHHHHHhHHhhhc
Confidence            66777666666666555555


No 32 
>PF13912 zf-C2H2_6:  C2H2-type zinc finger; PDB: 1JN7_A 1FU9_A 2L1O_A 1NJQ_A 2EN8_A 2EMM_A 1FV5_A 1Y0J_B 2L6Z_B.
Probab=96.21  E-value=0.0011  Score=29.86  Aligned_cols=23  Identities=26%  Similarity=0.634  Sum_probs=15.4

Q ss_pred             cccCCCCCCCCC-cchHHHH-Hhhc
Q 032068            9 TKICSHCDRAIP-SSNIDLH-FAHC   31 (148)
Q Consensus         9 ~~~C~~C~~~~~-~~~l~~H-~~~~   31 (148)
                      ||+|..|++.|. ...|..| +.|+
T Consensus         1 ~~~C~~C~~~F~~~~~l~~H~~~h~   25 (27)
T PF13912_consen    1 PFECDECGKTFSSLSALREHKRSHC   25 (27)
T ss_dssp             SEEETTTTEEESSHHHHHHHHCTTT
T ss_pred             CCCCCccCCccCChhHHHHHhHHhc
Confidence            467777777777 7777777 3443


No 33 
>PF13894 zf-C2H2_4:  C2H2-type zinc finger; PDB: 2ELX_A 2EPP_A 2DLK_A 1X6H_A 2EOU_A 2EMB_A 2GQJ_A 2CSH_A 2WBT_B 2ELM_A ....
Probab=95.87  E-value=0.0037  Score=26.91  Aligned_cols=21  Identities=19%  Similarity=0.458  Sum_probs=13.4

Q ss_pred             ccCccchhccch-HHHHhhHhh
Q 032068           61 VACSQCSETMER-EILAIHKGE   81 (148)
Q Consensus        61 ~~C~~C~~~~~~-~~l~~h~~~   81 (148)
                      |.|+.|+..|.. ..|..|+..
T Consensus         1 ~~C~~C~~~~~~~~~l~~H~~~   22 (24)
T PF13894_consen    1 FQCPICGKSFRSKSELRQHMRT   22 (24)
T ss_dssp             EE-SSTS-EESSHHHHHHHHHH
T ss_pred             CCCcCCCCcCCcHHHHHHHHHh
Confidence            457777777766 777777754


No 34 
>PRK04860 hypothetical protein; Provisional
Probab=95.80  E-value=0.0051  Score=40.46  Aligned_cols=36  Identities=22%  Similarity=0.445  Sum_probs=30.5

Q ss_pred             CccCccchhccchHHHHhhHhhcCCCCcccCCCCCCCCC
Q 032068           60 PVACSQCSETMEREILAIHKGENCPQRIVTCDFCEFPLP   98 (148)
Q Consensus        60 ~~~C~~C~~~~~~~~l~~h~~~~~~~~~~~C~~C~~~f~   98 (148)
                      +|.|. |+.  ....+..|.+++.++++|.|..|+..|.
T Consensus       119 ~Y~C~-C~~--~~~~~rrH~ri~~g~~~YrC~~C~~~l~  154 (160)
T PRK04860        119 PYRCK-CQE--HQLTVRRHNRVVRGEAVYRCRRCGETLV  154 (160)
T ss_pred             EEEcC-CCC--eeCHHHHHHHHhcCCccEECCCCCceeE
Confidence            78997 886  2366788999999999999999998876


No 35 
>smart00355 ZnF_C2H2 zinc finger.
Probab=95.69  E-value=0.011  Score=25.73  Aligned_cols=20  Identities=25%  Similarity=0.353  Sum_probs=16.3

Q ss_pred             ccCCCCCCCCC-hhhHHHHHH
Q 032068           88 VTCDFCEFPLP-AVDLAEHQV  107 (148)
Q Consensus        88 ~~C~~C~~~f~-~~~l~~H~~  107 (148)
                      |.|+.|++.|. ...|..|++
T Consensus         1 ~~C~~C~~~f~~~~~l~~H~~   21 (26)
T smart00355        1 YRCPECGKVFKSKSALKEHMR   21 (26)
T ss_pred             CCCCCCcchhCCHHHHHHHHH
Confidence            46888888888 888888876


No 36 
>PF12756 zf-C2H2_2:  C2H2 type zinc-finger (2 copies); PDB: 2DMI_A.
Probab=95.68  E-value=0.0072  Score=36.01  Aligned_cols=16  Identities=25%  Similarity=0.694  Sum_probs=0.0

Q ss_pred             CCCCCCCCC-cchHHHH
Q 032068           12 CSHCDRAIP-SSNIDLH   27 (148)
Q Consensus        12 C~~C~~~~~-~~~l~~H   27 (148)
                      |..|+..|. ...|..|
T Consensus         2 C~~C~~~f~~~~~l~~H   18 (100)
T PF12756_consen    2 CLFCDESFSSVDDLLQH   18 (100)
T ss_dssp             -----------------
T ss_pred             ccccccccccccccccc
Confidence            555555555 5555555


No 37 
>PF02176 zf-TRAF:  TRAF-type zinc finger; PDB: 2EOD_A 2YUC_A 3HCU_A 3HCS_B 3HCT_A.
Probab=95.58  E-value=0.003  Score=34.30  Aligned_cols=46  Identities=41%  Similarity=0.902  Sum_probs=24.8

Q ss_pred             CccCcc--chhccchHHHHhhHhhcCCCCcccCCC----CCCCCChhhHHHH
Q 032068           60 PVACSQ--CSETMEREILAIHKGENCPQRIVTCDF----CEFPLPAVDLAEH  105 (148)
Q Consensus        60 ~~~C~~--C~~~~~~~~l~~h~~~~~~~~~~~C~~----C~~~f~~~~l~~H  105 (148)
                      +..|+.  |...+....|..|....+..++..|++    |+..+.+..|.+|
T Consensus         9 ~v~C~~~cc~~~i~r~~l~~H~~~~C~~~~v~C~~~~~GC~~~~~~~~l~~H   60 (60)
T PF02176_consen    9 PVPCPNGCCNEMIPRKELDDHLENECPKRPVPCPYSPYGCKERVPREDLEEH   60 (60)
T ss_dssp             EEE-TT--S-BEEECCCHHHHHHTTSTTSEEE-SS----S--EEEHHHHHHC
T ss_pred             EeeCCCCCcccceeHHHHHHHHHccCCCCcEECCCCCCCCCCccchhHHhCC
Confidence            445554  334455566666666566666777777    7666666666554


No 38 
>PF02176 zf-TRAF:  TRAF-type zinc finger; PDB: 2EOD_A 2YUC_A 3HCU_A 3HCS_B 3HCT_A.
Probab=95.43  E-value=0.0068  Score=32.87  Aligned_cols=45  Identities=29%  Similarity=0.621  Sum_probs=24.1

Q ss_pred             CCcccCCC--CCCCCCcchHHHH-HhhcCCCCccccc----cCCccChhhHH
Q 032068            7 ETTKICSH--CDRAIPSSNIDLH-FAHCSRNLERCKV----CGDMVPRKYAE   51 (148)
Q Consensus         7 ~~~~~C~~--C~~~~~~~~l~~H-~~~~~~~~~~C~~----C~~~f~~~~l~   51 (148)
                      ..+..|+.  |...+.+..|..| ...++.++..|+.    |...+...++.
T Consensus         7 ~~~v~C~~~cc~~~i~r~~l~~H~~~~C~~~~v~C~~~~~GC~~~~~~~~l~   58 (60)
T PF02176_consen    7 FRPVPCPNGCCNEMIPRKELDDHLENECPKRPVPCPYSPYGCKERVPREDLE   58 (60)
T ss_dssp             TSEEE-TT--S-BEEECCCHHHHHHTTSTTSEEE-SS----S--EEEHHHHH
T ss_pred             CCEeeCCCCCcccceeHHHHHHHHHccCCCCcEECCCCCCCCCCccchhHHh
Confidence            34566655  4444556777777 4577777777776    66655544443


No 39 
>PRK04860 hypothetical protein; Provisional
Probab=94.48  E-value=0.035  Score=36.53  Aligned_cols=35  Identities=20%  Similarity=0.510  Sum_probs=22.5

Q ss_pred             cccCCCCCCCCCcchHHHH-HhhcCCCCccccccCCccC
Q 032068            9 TKICSHCDRAIPSSNIDLH-FAHCSRNLERCKVCGDMVP   46 (148)
Q Consensus         9 ~~~C~~C~~~~~~~~l~~H-~~~~~~~~~~C~~C~~~f~   46 (148)
                      +|.|. |+.  ....+..| +.+.+++++.|..|+..|.
T Consensus       119 ~Y~C~-C~~--~~~~~rrH~ri~~g~~~YrC~~C~~~l~  154 (160)
T PRK04860        119 PYRCK-CQE--HQLTVRRHNRVVRGEAVYRCRRCGETLV  154 (160)
T ss_pred             EEEcC-CCC--eeCHHHHHHHHhcCCccEECCCCCceeE
Confidence            46676 665  24556667 5777777777777776654


No 40 
>COG5189 SFP1 Putative transcriptional repressor regulating G2/M transition [Transcription / Cell division and chromosome partitioning]
Probab=94.39  E-value=0.025  Score=40.89  Aligned_cols=15  Identities=13%  Similarity=0.507  Sum_probs=8.1

Q ss_pred             CCCcccCCCCCCCCC
Q 032068           84 PQRIVTCDFCEFPLP   98 (148)
Q Consensus        84 ~~~~~~C~~C~~~f~   98 (148)
                      ..+||.|+.|++++.
T Consensus       395 ~~KPYrCevC~KRYK  409 (423)
T COG5189         395 KDKPYRCEVCDKRYK  409 (423)
T ss_pred             cCCceeccccchhhc
Confidence            345555555555554


No 41 
>COG5189 SFP1 Putative transcriptional repressor regulating G2/M transition [Transcription / Cell division and chromosome partitioning]
Probab=94.37  E-value=0.0099  Score=42.90  Aligned_cols=64  Identities=16%  Similarity=0.399  Sum_probs=42.5

Q ss_pred             CCcccC--CCCCCCCC-cchHHHH-H-hhcCCCCccccc--cCCccChhhHHHHHHcCCCCccCccchhccch-HHHHhh
Q 032068            7 ETTKIC--SHCDRAIP-SSNIDLH-F-AHCSRNLERCKV--CGDMVPRKYAEEHFLNTHAPVACSQCSETMER-EILAIH   78 (148)
Q Consensus         7 ~~~~~C--~~C~~~~~-~~~l~~H-~-~~~~~~~~~C~~--C~~~f~~~~l~~h~~~~h~~~~C~~C~~~~~~-~~l~~h   78 (148)
                      +|||.|  +.|.+++. ...|+-| + .|+..+...-+.  -...|         ....+||.|..|++.+.. ..|+=|
T Consensus       347 ~KpykCpV~gC~K~YknqnGLKYH~lhGH~~~~~~~~p~p~~~~~F---------~~~~KPYrCevC~KRYKNlNGLKYH  417 (423)
T COG5189         347 GKPYKCPVEGCNKKYKNQNGLKYHMLHGHQNQKLHENPSPEKMNIF---------SAKDKPYRCEVCDKRYKNLNGLKYH  417 (423)
T ss_pred             CceecCCCCCchhhhccccchhhhhhccccCcccCCCCCccccccc---------cccCCceeccccchhhccCccceec
Confidence            589999  45999998 8888888 4 666655544331  11111         111269999999999977 666655


Q ss_pred             H
Q 032068           79 K   79 (148)
Q Consensus        79 ~   79 (148)
                      .
T Consensus       418 r  418 (423)
T COG5189         418 R  418 (423)
T ss_pred             c
Confidence            4


No 42 
>PF09237 GAGA:  GAGA factor;  InterPro: IPR015318 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target.  Members of this entry bind to a 5'-GAGAG-3' DNA consensus binding site, and contain a Cys2-His2 zinc finger core as well as an N-terminal extension containing two highly basic regions. The zinc finger core binds in the DNA major groove and recognises the first three GAG bases of the consensus in a manner similar to that seen in other classical zinc finger-DNA complexes. The second basic region forms a helix that interacts in the major groove recognising the last G of the consensus, while the first basic region wraps around the DNA in the minor groove and recognises the A in the fourth position of the consensus sequence [].  More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; PDB: 1YUI_A 1YUJ_A.
Probab=94.21  E-value=0.031  Score=29.17  Aligned_cols=21  Identities=33%  Similarity=0.615  Sum_probs=9.3

Q ss_pred             CCcccCCCCCCCCC-cchHHHH
Q 032068            7 ETTKICSHCDRAIP-SSNIDLH   27 (148)
Q Consensus         7 ~~~~~C~~C~~~~~-~~~l~~H   27 (148)
                      +.|..|+.|+..+. ..+|..|
T Consensus        22 ~~PatCP~C~a~~~~srnLrRH   43 (54)
T PF09237_consen   22 EQPATCPICGAVIRQSRNLRRH   43 (54)
T ss_dssp             S--EE-TTT--EESSHHHHHHH
T ss_pred             CCCCCCCcchhhccchhhHHHH
Confidence            34555666666555 5556655


No 43 
>KOG2186 consensus Cell growth-regulating nucleolar protein [Cell cycle control, cell division, chromosome partitioning]
Probab=93.90  E-value=0.041  Score=38.49  Aligned_cols=47  Identities=19%  Similarity=0.484  Sum_probs=40.1

Q ss_pred             cccCCCCCCCCCcchHHHHHhhcCCCCccccccCCccChhhHHHHHH
Q 032068            9 TKICSHCDRAIPSSNIDLHFAHCSRNLERCKVCGDMVPRKYAEEHFL   55 (148)
Q Consensus         9 ~~~C~~C~~~~~~~~l~~H~~~~~~~~~~C~~C~~~f~~~~l~~h~~   55 (148)
                      -|.|..||.......+..|+.-|...-|.|..|+..|-..+...|..
T Consensus         3 ~FtCnvCgEsvKKp~vekH~srCrn~~fSCIDC~k~F~~~sYknH~k   49 (276)
T KOG2186|consen    3 FFTCNVCGESVKKPQVEKHMSRCRNAYFSCIDCGKTFERVSYKNHTK   49 (276)
T ss_pred             EEehhhhhhhccccchHHHHHhccCCeeEEeecccccccchhhhhhh
Confidence            37899999999988899998778778899999999998877777764


No 44 
>PF09237 GAGA:  GAGA factor;  InterPro: IPR015318 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target.  Members of this entry bind to a 5'-GAGAG-3' DNA consensus binding site, and contain a Cys2-His2 zinc finger core as well as an N-terminal extension containing two highly basic regions. The zinc finger core binds in the DNA major groove and recognises the first three GAG bases of the consensus in a manner similar to that seen in other classical zinc finger-DNA complexes. The second basic region forms a helix that interacts in the major groove recognising the last G of the consensus, while the first basic region wraps around the DNA in the minor groove and recognises the A in the fourth position of the consensus sequence [].  More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; PDB: 1YUI_A 1YUJ_A.
Probab=93.79  E-value=0.04  Score=28.75  Aligned_cols=28  Identities=18%  Similarity=0.363  Sum_probs=14.6

Q ss_pred             CccCccchhccch-HHHHhhHhhcCCCCc
Q 032068           60 PVACSQCSETMER-EILAIHKGENCPQRI   87 (148)
Q Consensus        60 ~~~C~~C~~~~~~-~~l~~h~~~~~~~~~   87 (148)
                      |-.|+.|+..+.. .+|+.|+++..+.+|
T Consensus        24 PatCP~C~a~~~~srnLrRHle~~H~~k~   52 (54)
T PF09237_consen   24 PATCPICGAVIRQSRNLRRHLEIRHFKKP   52 (54)
T ss_dssp             -EE-TTT--EESSHHHHHHHHHHHTTTS-
T ss_pred             CCCCCcchhhccchhhHHHHHHHHhcccC
Confidence            5666777666665 666666665555544


No 45 
>COG2888 Predicted Zn-ribbon RNA-binding protein with a function in translation [Translation, ribosomal structure and biogenesis]
Probab=93.38  E-value=0.1  Score=28.07  Aligned_cols=10  Identities=30%  Similarity=0.806  Sum_probs=4.9

Q ss_pred             CccccccCCc
Q 032068           35 LERCKVCGDM   44 (148)
Q Consensus        35 ~~~C~~C~~~   44 (148)
                      .|.|+.|++.
T Consensus        27 ~F~CPnCGe~   36 (61)
T COG2888          27 KFPCPNCGEV   36 (61)
T ss_pred             EeeCCCCCce
Confidence            3455555543


No 46 
>PF12756 zf-C2H2_2:  C2H2 type zinc-finger (2 copies); PDB: 2DMI_A.
Probab=93.36  E-value=0.043  Score=32.57  Aligned_cols=71  Identities=14%  Similarity=0.237  Sum_probs=20.0

Q ss_pred             cccccCCccChh-hHHHHHHcCCCCccCccchhccchHHHHhhHhhcCCCCcccCCCCCCCCC-hhhHHHHHHHh
Q 032068           37 RCKVCGDMVPRK-YAEEHFLNTHAPVACSQCSETMEREILAIHKGENCPQRIVTCDFCEFPLP-AVDLAEHQVAI  109 (148)
Q Consensus        37 ~C~~C~~~f~~~-~l~~h~~~~h~~~~C~~C~~~~~~~~l~~h~~~~~~~~~~~C~~C~~~f~-~~~l~~H~~~c  109 (148)
                      +|..|+..|... .+..|+...|. +..+..........+..+.+. .-...+.|..|++.|. ...|..|++.-
T Consensus         1 ~C~~C~~~f~~~~~l~~H~~~~H~-~~~~~~~~l~~~~~~~~~~~~-~~~~~~~C~~C~~~f~s~~~l~~Hm~~~   73 (100)
T PF12756_consen    1 QCLFCDESFSSVDDLLQHMKKKHG-FDIPDQKYLVDPNRLLNYLRK-KVKESFRCPYCNKTFRSREALQEHMRSK   73 (100)
T ss_dssp             --------------------------------------------------SSEEBSSSS-EESSHHHHHHHHHHT
T ss_pred             Cccccccccccccccccccccccc-ccccccccccccccccccccc-ccCCCCCCCccCCCCcCHHHHHHHHcCc
Confidence            488899998877 56677755553 111111111111333333322 2223689999999999 99999999863


No 47 
>PF12874 zf-met:  Zinc-finger of C2H2 type; PDB: 1ZU1_A 2KVG_A.
Probab=92.90  E-value=0.036  Score=24.20  Aligned_cols=20  Identities=25%  Similarity=0.552  Sum_probs=12.0

Q ss_pred             ccCCCCCCCCC-hhhHHHHHH
Q 032068           88 VTCDFCEFPLP-AVDLAEHQV  107 (148)
Q Consensus        88 ~~C~~C~~~f~-~~~l~~H~~  107 (148)
                      |.|..|+..|. ...+..|++
T Consensus         1 ~~C~~C~~~f~s~~~~~~H~~   21 (25)
T PF12874_consen    1 FYCDICNKSFSSENSLRQHLR   21 (25)
T ss_dssp             EEETTTTEEESSHHHHHHHHT
T ss_pred             CCCCCCCCCcCCHHHHHHHHC
Confidence            34666666666 666666653


No 48 
>PF12171 zf-C2H2_jaz:  Zinc-finger double-stranded RNA-binding;  InterPro: IPR022755  This zinc finger is found in archaea and eukaryotes, and is approximately 30 amino acids in length. The mammalian members of this group occur multiple times along the protein, joined by flexible linkers, and are referred to as JAZ - dsRNA-binding ZF protein - zinc-fingers. The JAZ proteins are expressed in all tissues tested and localise in the nucleus, particularly the nucleolus []. JAZ preferentially binds to double-stranded (ds) RNA or RNA/DNA hybrids rather than DNA. In addition to binding double-stranded RNA, these zinc-fingers are required for nucleolar localisation.   This entry represents the multiple-adjacent-C2H2 zinc finger, JAZ. ; PDB: 4DGW_A 1ZR9_A.
Probab=92.83  E-value=0.061  Score=24.10  Aligned_cols=20  Identities=20%  Similarity=0.448  Sum_probs=13.7

Q ss_pred             ccCccchhccch-HHHHhhHh
Q 032068           61 VACSQCSETMER-EILAIHKG   80 (148)
Q Consensus        61 ~~C~~C~~~~~~-~~l~~h~~   80 (148)
                      |.|..|++.|.. ..+..|+.
T Consensus         2 ~~C~~C~k~f~~~~~~~~H~~   22 (27)
T PF12171_consen    2 FYCDACDKYFSSENQLKQHMK   22 (27)
T ss_dssp             CBBTTTTBBBSSHHHHHCCTT
T ss_pred             CCcccCCCCcCCHHHHHHHHc
Confidence            567777777766 67777664


No 49 
>PF13909 zf-H2C2_5:  C2H2-type zinc-finger domain; PDB: 1X5W_A.
Probab=91.58  E-value=0.15  Score=21.94  Aligned_cols=21  Identities=19%  Similarity=0.523  Sum_probs=13.1

Q ss_pred             ccCCCCCCCCChhhHHHHHHH
Q 032068           88 VTCDFCEFPLPAVDLAEHQVA  108 (148)
Q Consensus        88 ~~C~~C~~~f~~~~l~~H~~~  108 (148)
                      |.|+.|+.......|..|++.
T Consensus         1 y~C~~C~y~t~~~~l~~H~~~   21 (24)
T PF13909_consen    1 YKCPHCSYSTSKSNLKRHLKR   21 (24)
T ss_dssp             EE-SSSS-EESHHHHHHHHHH
T ss_pred             CCCCCCCCcCCHHHHHHHHHh
Confidence            568888866555677887654


No 50 
>smart00734 ZnF_Rad18 Rad18-like CCHC zinc finger. Yeast Rad18p functions with Rad5p in error-free post-replicative DNA repair. This zinc finger is likely to bind nucleic-acids.
Probab=91.17  E-value=0.29  Score=21.83  Aligned_cols=21  Identities=19%  Similarity=0.354  Sum_probs=14.9

Q ss_pred             cCCCCCCCCChhhHHHHHHHh
Q 032068           89 TCDFCEFPLPAVDLAEHQVAI  109 (148)
Q Consensus        89 ~C~~C~~~f~~~~l~~H~~~c  109 (148)
                      .||.|++.+....+..|...|
T Consensus         3 ~CPiC~~~v~~~~in~HLD~C   23 (26)
T smart00734        3 QCPVCFREVPENLINSHLDSC   23 (26)
T ss_pred             cCCCCcCcccHHHHHHHHHHh
Confidence            577777777666777777655


No 51 
>PRK14890 putative Zn-ribbon RNA-binding protein; Provisional
Probab=90.98  E-value=0.33  Score=26.21  Aligned_cols=10  Identities=40%  Similarity=0.810  Sum_probs=5.7

Q ss_pred             CcccCCCCCC
Q 032068           86 RIVTCDFCEF   95 (148)
Q Consensus        86 ~~~~C~~C~~   95 (148)
                      .+|.|+.||+
T Consensus        47 ~~Y~CP~CGF   56 (59)
T PRK14890         47 NPYTCPKCGF   56 (59)
T ss_pred             CceECCCCCC
Confidence            4556666654


No 52 
>smart00451 ZnF_U1 U1-like zinc finger. Family of C2H2-type zinc fingers, present in matrin, U1 small nuclear ribonucleoprotein C and other RNA-binding proteins.
Probab=88.97  E-value=0.32  Score=22.92  Aligned_cols=21  Identities=14%  Similarity=0.485  Sum_probs=15.9

Q ss_pred             cccCCCCCCCCC-hhhHHHHHH
Q 032068           87 IVTCDFCEFPLP-AVDLAEHQV  107 (148)
Q Consensus        87 ~~~C~~C~~~f~-~~~l~~H~~  107 (148)
                      +|.|+.|+..|. ...+..|..
T Consensus         3 ~~~C~~C~~~~~~~~~~~~H~~   24 (35)
T smart00451        3 GFYCKLCNVTFTDEISVEAHLK   24 (35)
T ss_pred             CeEccccCCccCCHHHHHHHHC
Confidence            467888888888 777777764


No 53 
>KOG2186 consensus Cell growth-regulating nucleolar protein [Cell cycle control, cell division, chromosome partitioning]
Probab=88.65  E-value=0.25  Score=34.75  Aligned_cols=46  Identities=28%  Similarity=0.573  Sum_probs=34.0

Q ss_pred             CccccccCCccChhhHHHHHHcCCC-CccCccchhccchHHHHhhHh
Q 032068           35 LERCKVCGDMVPRKYAEEHFLNTHA-PVACSQCSETMEREILAIHKG   80 (148)
Q Consensus        35 ~~~C~~C~~~f~~~~l~~h~~~~h~-~~~C~~C~~~~~~~~l~~h~~   80 (148)
                      .|.|..|+.......+..|..+++. .|.|..|++.|.+..+..|..
T Consensus         3 ~FtCnvCgEsvKKp~vekH~srCrn~~fSCIDC~k~F~~~sYknH~k   49 (276)
T KOG2186|consen    3 FFTCNVCGESVKKPQVEKHMSRCRNAYFSCIDCGKTFERVSYKNHTK   49 (276)
T ss_pred             EEehhhhhhhccccchHHHHHhccCCeeEEeecccccccchhhhhhh
Confidence            3678888888777767667766653 778888888887766777764


No 54 
>PF08209 Sgf11:  Sgf11 (transcriptional regulation protein);  InterPro: IPR013246 The Sgf11 family is a SAGA complex subunit in Saccharomyces cerevisiae (Baker's yeast). The SAGA complex is a multisubunit protein complex involved in transcriptional regulation. SAGA combines proteins involved in interactions with DNA-bound activators and TATA-binding protein (TBP), as well as enzymes for histone acetylation and deubiquitylation [].; PDB: 3M99_B 2LO2_A 3MHH_C 3MHS_C.
Probab=87.77  E-value=0.4  Score=22.74  Aligned_cols=25  Identities=28%  Similarity=0.474  Sum_probs=18.4

Q ss_pred             cccCCCCCCCCChhhHHHHHHHhcc
Q 032068           87 IVTCDFCEFPLPAVDLAEHQVAIVN  111 (148)
Q Consensus        87 ~~~C~~C~~~f~~~~l~~H~~~c~~  111 (148)
                      .+.|+.|++.+..+-+..|+..|..
T Consensus         4 ~~~C~nC~R~v~a~RfA~HLekCmg   28 (33)
T PF08209_consen    4 YVECPNCGRPVAASRFAPHLEKCMG   28 (33)
T ss_dssp             EEE-TTTSSEEEGGGHHHHHHHHTC
T ss_pred             eEECCCCcCCcchhhhHHHHHHHHc
Confidence            4578888888877788888888743


No 55 
>PF10571 UPF0547:  Uncharacterised protein family UPF0547;  InterPro: IPR018886  This domain may well be a type of zinc-finger as it carries two pairs of highly conserved cysteine residues though with no accompanying histidines. Several members are annotated as putative helicases. 
Probab=86.38  E-value=0.68  Score=20.65  Aligned_cols=9  Identities=33%  Similarity=0.670  Sum_probs=5.3

Q ss_pred             cccccCCcc
Q 032068           37 RCKVCGDMV   45 (148)
Q Consensus        37 ~C~~C~~~f   45 (148)
                      .|+.||..|
T Consensus        16 ~Cp~CG~~F   24 (26)
T PF10571_consen   16 FCPHCGYDF   24 (26)
T ss_pred             cCCCCCCCC
Confidence            466666555


No 56 
>PRK00398 rpoP DNA-directed RNA polymerase subunit P; Provisional
Probab=81.57  E-value=0.95  Score=23.03  Aligned_cols=29  Identities=24%  Similarity=0.566  Sum_probs=16.3

Q ss_pred             cccCCCCCCCCC-cchHHHHHhhcCCCCccccccCCccC
Q 032068            9 TKICSHCDRAIP-SSNIDLHFAHCSRNLERCKVCGDMVP   46 (148)
Q Consensus         9 ~~~C~~C~~~~~-~~~l~~H~~~~~~~~~~C~~C~~~f~   46 (148)
                      .|.|+.|+..+. ...         .....|+.|+..+.
T Consensus         3 ~y~C~~CG~~~~~~~~---------~~~~~Cp~CG~~~~   32 (46)
T PRK00398          3 EYKCARCGREVELDEY---------GTGVRCPYCGYRIL   32 (46)
T ss_pred             EEECCCCCCEEEECCC---------CCceECCCCCCeEE
Confidence            467777777655 221         11456777765543


No 57 
>PF09538 FYDLN_acid:  Protein of unknown function (FYDLN_acid);  InterPro: IPR012644 Members of this family are bacterial proteins with a conserved motif [KR]FYDLN, sometimes flanked by a pair of CXXC motifs, followed by a long region of low complexity sequence in which roughly half the residues are Asp and Glu, including multiple runs of five or more acidic residues. The function of members of this family is unknown.
Probab=81.49  E-value=1.2  Score=27.33  Aligned_cols=13  Identities=23%  Similarity=0.925  Sum_probs=7.1

Q ss_pred             CccCccchhccch
Q 032068           60 PVACSQCSETMER   72 (148)
Q Consensus        60 ~~~C~~C~~~~~~   72 (148)
                      |..|++||..|..
T Consensus        26 PivCP~CG~~~~~   38 (108)
T PF09538_consen   26 PIVCPKCGTEFPP   38 (108)
T ss_pred             CccCCCCCCccCc
Confidence            5555555555543


No 58 
>cd00350 rubredoxin_like Rubredoxin_like; nonheme iron binding domain containing a [Fe(SCys)4] center. The family includes rubredoxins, a small electron transfer protein, and a slightly smaller modular rubredoxin domain present in rubrerythrin and nigerythrin and detected either N- or C-terminal to such proteins as flavin reductase, NAD(P)H-nitrite reductase, and ferredoxin-thioredoxin reductase. In rubredoxin, the iron atom is coordinated by four cysteine residues (Fe(S-Cys)4), but iron can also be replaced by cobalt, nickel or zinc and believed to be involved in electron transfer.  Rubrerythrins and nigerythrins are small homodimeric proteins, generally consisting of 2 domains: a rubredoxin domain C-terminal to a non-sulfur, oxo-bridged diiron site in the N-terminal rubrerythrin domain.  Rubrerythrins and nigerythrins have putative peroxide activity.
Probab=81.03  E-value=1.2  Score=20.94  Aligned_cols=23  Identities=26%  Similarity=0.611  Sum_probs=12.9

Q ss_pred             ccCCCCCCCCC-cchHHHHHhhcCCCCccccccCC
Q 032068           10 KICSHCDRAIP-SSNIDLHFAHCSRNLERCKVCGD   43 (148)
Q Consensus        10 ~~C~~C~~~~~-~~~l~~H~~~~~~~~~~C~~C~~   43 (148)
                      |.|..||..+. ..           .+..||.|+.
T Consensus         2 ~~C~~CGy~y~~~~-----------~~~~CP~Cg~   25 (33)
T cd00350           2 YVCPVCGYIYDGEE-----------APWVCPVCGA   25 (33)
T ss_pred             EECCCCCCEECCCc-----------CCCcCcCCCC
Confidence            55666666543 21           3556666664


No 59 
>TIGR02098 MJ0042_CXXC MJ0042 family finger-like domain. This domain contains a CXXCX(19)CXXC motif suggestive of both zinc fingers and thioredoxin, usually found at the N-terminus of prokaryotic proteins. One partially characterized gene, agmX, is among a large set in Myxococcus whose interruption affects adventurous gliding motility.
Probab=80.98  E-value=0.83  Score=22.11  Aligned_cols=11  Identities=18%  Similarity=0.670  Sum_probs=5.7

Q ss_pred             ccCCCCCCCCC
Q 032068           10 KICSHCDRAIP   20 (148)
Q Consensus        10 ~~C~~C~~~~~   20 (148)
                      +.|+.|+..|.
T Consensus         3 ~~CP~C~~~~~   13 (38)
T TIGR02098         3 IQCPNCKTSFR   13 (38)
T ss_pred             EECCCCCCEEE
Confidence            34555555544


No 60 
>PF13717 zinc_ribbon_4:  zinc-ribbon domain
Probab=80.27  E-value=0.47  Score=22.93  Aligned_cols=6  Identities=33%  Similarity=1.065  Sum_probs=2.3

Q ss_pred             cCCCCC
Q 032068           89 TCDFCE   94 (148)
Q Consensus        89 ~C~~C~   94 (148)
                      .|+.|+
T Consensus        27 ~C~~C~   32 (36)
T PF13717_consen   27 RCSKCG   32 (36)
T ss_pred             ECCCCC
Confidence            333333


No 61 
>PF06524 NOA36:  NOA36 protein;  InterPro: IPR010531 This family consists of several NOA36 proteins which contain 29 highly conserved cysteine residues. The function of this protein is unknown.; GO: 0008270 zinc ion binding, 0005634 nucleus
Probab=79.78  E-value=1.9  Score=30.67  Aligned_cols=73  Identities=16%  Similarity=0.352  Sum_probs=42.5

Q ss_pred             cCCCCccccccCCccChhhHHHHHHcCC----CCccCccchhccchHHH-------HhhHhh----cCCCCcccCCCCCC
Q 032068           31 CSRNLERCKVCGDMVPRKYAEEHFLNTH----APVACSQCSETMEREIL-------AIHKGE----NCPQRIVTCDFCEF   95 (148)
Q Consensus        31 ~~~~~~~C~~C~~~f~~~~l~~h~~~~h----~~~~C~~C~~~~~~~~l-------~~h~~~----~~~~~~~~C~~C~~   95 (148)
                      .+.+.|.|..|.....-.+...|+..++    ..|.|..|.+.-..+-|       ..|.+.    ....+++.||.|+.
T Consensus       138 hGGrif~CsfC~~flCEDDQFEHQAsCQvLe~E~~KC~SCNrlGq~sCLRCK~cfCddHvrrKg~ky~k~k~~PCPKCg~  217 (314)
T PF06524_consen  138 HGGRIFKCSFCDNFLCEDDQFEHQASCQVLESETFKCQSCNRLGQYSCLRCKICFCDDHVRRKGFKYEKGKPIPCPKCGY  217 (314)
T ss_pred             CCCeEEEeecCCCeeeccchhhhhhhhhhhhcccccccccccccchhhhheeeeehhhhhhhcccccccCCCCCCCCCCC
Confidence            3567788888865444345666765543    26777777654332322       244432    22347889999997


Q ss_pred             CCC-hhhHH
Q 032068           96 PLP-AVDLA  103 (148)
Q Consensus        96 ~f~-~~~l~  103 (148)
                      ... ...|.
T Consensus       218 et~eTkdLS  226 (314)
T PF06524_consen  218 ETQETKDLS  226 (314)
T ss_pred             cccccccce
Confidence            755 44443


No 62 
>KOG2231 consensus Predicted E3 ubiquitin ligase [Posttranslational modification, protein turnover, chaperones]
Probab=78.55  E-value=3.6  Score=33.31  Aligned_cols=20  Identities=25%  Similarity=0.360  Sum_probs=10.1

Q ss_pred             ccCccchhccch-HHHHhhHh
Q 032068           61 VACSQCSETMER-EILAIHKG   80 (148)
Q Consensus        61 ~~C~~C~~~~~~-~~l~~h~~   80 (148)
                      -.|+.|...|.. ..|..|.+
T Consensus       183 p~C~~C~~~fld~~el~rH~~  203 (669)
T KOG2231|consen  183 PLCKFCHERFLDDDELYRHLR  203 (669)
T ss_pred             ccchhhhhhhccHHHHHHhhc
Confidence            345555555544 45555544


No 63 
>COG1592 Rubrerythrin [Energy production and conversion]
Probab=77.20  E-value=1.4  Score=29.16  Aligned_cols=23  Identities=17%  Similarity=0.430  Sum_probs=17.9

Q ss_pred             CccCccchhccchHHHHhhHhhcCCCCcccCCCCC
Q 032068           60 PVACSQCSETMEREILAIHKGENCPQRIVTCDFCE   94 (148)
Q Consensus        60 ~~~C~~C~~~~~~~~l~~h~~~~~~~~~~~C~~C~   94 (148)
                      .|.|+.||            -++.|+.|-.||.||
T Consensus       134 ~~vC~vCG------------y~~~ge~P~~CPiCg  156 (166)
T COG1592         134 VWVCPVCG------------YTHEGEAPEVCPICG  156 (166)
T ss_pred             EEEcCCCC------------CcccCCCCCcCCCCC
Confidence            48899987            245667888999998


No 64 
>PF13451 zf-trcl:  Probable zinc-binding domain
Probab=76.67  E-value=1.5  Score=22.85  Aligned_cols=14  Identities=21%  Similarity=0.591  Sum_probs=9.4

Q ss_pred             CCcccCCCCCCCCC
Q 032068            7 ETTKICSHCDRAIP   20 (148)
Q Consensus         7 ~~~~~C~~C~~~~~   20 (148)
                      ++++.|..||..|.
T Consensus         2 Dk~l~C~dCg~~Fv   15 (49)
T PF13451_consen    2 DKTLTCKDCGAEFV   15 (49)
T ss_pred             CeeEEcccCCCeEE
Confidence            45667777777766


No 65 
>cd00729 rubredoxin_SM Rubredoxin, Small Modular nonheme iron binding domain containing a [Fe(SCys)4] center, present in rubrerythrin and nigerythrin and detected either N- or C-terminal to such proteins as flavin reductase, NAD(P)H-nitrite reductase, and ferredoxin-thioredoxin reductase. In rubredoxin, the iron atom is coordinated by four cysteine residues (Fe(S-Cys)4), and  believed to be involved in electron transfer. Rubrerythrins and nigerythrins are small homodimeric proteins, generally consisting of 2 domains: a rubredoxin domain C-terminal to a non-sulfur, oxo-bridged diiron site in the N-terminal rubrerythrin domain. Rubrerythrins and nigerythrins have putative peroxide activity.
Probab=76.15  E-value=2.1  Score=20.30  Aligned_cols=25  Identities=24%  Similarity=0.698  Sum_probs=13.7

Q ss_pred             cccCCCCCCCCCcchHHHHHhhcCCCCccccccCC
Q 032068            9 TKICSHCDRAIPSSNIDLHFAHCSRNLERCKVCGD   43 (148)
Q Consensus         9 ~~~C~~C~~~~~~~~l~~H~~~~~~~~~~C~~C~~   43 (148)
                      .|.|..||..+..          ...|..|+.|+.
T Consensus         2 ~~~C~~CG~i~~g----------~~~p~~CP~Cg~   26 (34)
T cd00729           2 VWVCPVCGYIHEG----------EEAPEKCPICGA   26 (34)
T ss_pred             eEECCCCCCEeEC----------CcCCCcCcCCCC
Confidence            3667777765430          112456777764


No 66 
>COG1198 PriA Primosomal protein N' (replication factor Y) - superfamily II helicase [DNA replication, recombination, and repair]
Probab=76.10  E-value=1.3  Score=36.21  Aligned_cols=12  Identities=17%  Similarity=0.254  Sum_probs=7.5

Q ss_pred             CCcccCCCCCCC
Q 032068           85 QRIVTCDFCEFP   96 (148)
Q Consensus        85 ~~~~~C~~C~~~   96 (148)
                      ..|..|+.||..
T Consensus       473 ~~p~~Cp~Cgs~  484 (730)
T COG1198         473 PIPQSCPECGSE  484 (730)
T ss_pred             CCCCCCCCCCCC
Confidence            356677777754


No 67 
>TIGR00622 ssl1 transcription factor ssl1. This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University).
Probab=75.20  E-value=5.5  Score=24.63  Aligned_cols=19  Identities=16%  Similarity=0.343  Sum_probs=10.5

Q ss_pred             CcccCCCCCCCCC-cchHHH
Q 032068            8 TTKICSHCDRAIP-SSNIDL   26 (148)
Q Consensus         8 ~~~~C~~C~~~~~-~~~l~~   26 (148)
                      -|..|+.|+-... ..+|.+
T Consensus        14 LP~~CpiCgLtLVss~HLAR   33 (112)
T TIGR00622        14 LPVECPICGLTLILSTHLAR   33 (112)
T ss_pred             CCCcCCcCCCEEeccchHHH
Confidence            4556666666555 444444


No 68 
>PF09723 Zn-ribbon_8:  Zinc ribbon domain;  InterPro: IPR013429  This entry represents a region of about 41 amino acids found in a number of small proteins in a wide range of bacteria. The region usually begins with the initiator Met and contains two CxxC motifs separated by 17 amino acids. One protein in this entry has been noted as a putative regulatory protein, designated FmdB []. Most proteins in this entry have a C-terminal region containing highly degenerate sequence.
Probab=75.12  E-value=1.6  Score=21.76  Aligned_cols=11  Identities=18%  Similarity=0.616  Sum_probs=7.1

Q ss_pred             ccCCCCCCCCC
Q 032068           10 KICSHCDRAIP   20 (148)
Q Consensus        10 ~~C~~C~~~~~   20 (148)
                      |.|..|+..|-
T Consensus         6 y~C~~Cg~~fe   16 (42)
T PF09723_consen    6 YRCEECGHEFE   16 (42)
T ss_pred             EEeCCCCCEEE
Confidence            66666666655


No 69 
>TIGR02605 CxxC_CxxC_SSSS putative regulatory protein, FmdB family. This model represents a region of about 50 amino acids found in a number of small proteins in a wide range of bacteria. The region begins usually with the initiator Met and contains two CxxC motifs separated by 17 amino acids. One member of this family is has been noted as a putative regulatory protein, designated FmdB (PubMed:8841393). Most members of this family have a C-terminal region containing highly degenerate sequence, such as SSTSESTKSSGSSGSSGSSESKASGSTEKSTSSTTAAAAV in Mycobacterium tuberculosis and VAVGGSAPAPSPAPRAGGGGGGCCGGGCCG in Streptomyces avermitilis. These low complexity regions, which are not included in the model, resemble low-complexity C-terminal regions of some heterocycle-containing bacteriocin precursors.
Probab=74.22  E-value=1.3  Score=23.09  Aligned_cols=11  Identities=18%  Similarity=0.600  Sum_probs=6.8

Q ss_pred             ccCCCCCCCCC
Q 032068           10 KICSHCDRAIP   20 (148)
Q Consensus        10 ~~C~~C~~~~~   20 (148)
                      |.|..|+..|.
T Consensus         6 y~C~~Cg~~fe   16 (52)
T TIGR02605         6 YRCTACGHRFE   16 (52)
T ss_pred             EEeCCCCCEeE
Confidence            56666666554


No 70 
>smart00659 RPOLCX RNA polymerase subunit CX. present in RNA polymerase I, II and III
Probab=74.07  E-value=2.3  Score=21.53  Aligned_cols=27  Identities=33%  Similarity=0.817  Sum_probs=14.8

Q ss_pred             ccCCCCCCCCCcchHHHHHhhcCCCCccccccCCcc
Q 032068           10 KICSHCDRAIPSSNIDLHFAHCSRNLERCKVCGDMV   45 (148)
Q Consensus        10 ~~C~~C~~~~~~~~l~~H~~~~~~~~~~C~~C~~~f   45 (148)
                      |.|..|+..+...         ...+..|+.|+...
T Consensus         3 Y~C~~Cg~~~~~~---------~~~~irC~~CG~rI   29 (44)
T smart00659        3 YICGECGRENEIK---------SKDVVRCRECGYRI   29 (44)
T ss_pred             EECCCCCCEeecC---------CCCceECCCCCceE
Confidence            6677777765511         12355666666443


No 71 
>TIGR00373 conserved hypothetical protein TIGR00373. This family of proteins is, so far, restricted to archaeal genomes. The family appears to be distantly related to the N-terminal region of the eukaryotic transcription initiation factor IIE alpha chain.
Probab=73.98  E-value=2.3  Score=27.93  Aligned_cols=28  Identities=18%  Similarity=0.243  Sum_probs=19.6

Q ss_pred             CccCccchhccch-HHHHhhHhhcCCCCcccCCCCCCC
Q 032068           60 PVACSQCSETMER-EILAIHKGENCPQRIVTCDFCEFP   96 (148)
Q Consensus        60 ~~~C~~C~~~~~~-~~l~~h~~~~~~~~~~~C~~C~~~   96 (148)
                      -|.|+.|+..|+. ..+.         .-|.||.||..
T Consensus       109 ~Y~Cp~c~~r~tf~eA~~---------~~F~Cp~Cg~~  137 (158)
T TIGR00373       109 FFICPNMCVRFTFNEAME---------LNFTCPRCGAM  137 (158)
T ss_pred             eEECCCCCcEeeHHHHHH---------cCCcCCCCCCE
Confidence            6678888877777 5543         25788888865


No 72 
>COG4049 Uncharacterized protein containing archaeal-type C2H2 Zn-finger [General function prediction only]
Probab=73.70  E-value=1.7  Score=23.11  Aligned_cols=32  Identities=16%  Similarity=0.191  Sum_probs=23.4

Q ss_pred             cCCCCcccCCCCCCCCC-hhhHHHHHHH-hcccc
Q 032068           82 NCPQRIVTCDFCEFPLP-AVDLAEHQVA-IVNNF  113 (148)
Q Consensus        82 ~~~~~~~~C~~C~~~f~-~~~l~~H~~~-c~~~~  113 (148)
                      --||.-+.||-|+..|. +.....|... .+..|
T Consensus        12 RDGE~~lrCPRC~~~FR~~K~Y~RHVNKaH~~~~   45 (65)
T COG4049          12 RDGEEFLRCPRCGMVFRRRKDYIRHVNKAHGWLF   45 (65)
T ss_pred             cCCceeeeCCchhHHHHHhHHHHHHhhHHhhhhh
Confidence            34677789999999999 8888888754 34444


No 73 
>PF03145 Sina:  Seven in absentia protein family;  InterPro: IPR018121 The seven in absentia (sina) gene was first identified in Drosophila. The Drosophila Sina protein is essential for the determination of the R7 pathway in photoreceptor cell development: the loss of functional Sina results in the transformation of the R7 precursor cell to a non-neuronal cell type. The Sina protein contains an N-terminal RING finger domain C3HC4-type. Through this domain, Sina binds E2 ubiquitin-conjugating enzymes (UbcD1) Sina also interacts with Tramtrack (TTK88) via PHYL. Tramtrack is a transcriptional repressor that blocks photoreceptor determination, while PHYL down-regulates the activity of TTK88. In turn, the activity of PHYL requires the activation of the Sevenless receptor tyrosine kinase, a process essential for R7 determination. It is thought that Sina targets TTK88 for degradation, therefore promoting the R7 pathway. Murine and human homologues of Sina have also been identified. The human homologue Siah-1 [] also binds E2 enzymes (UbcH5) and through a series of physical interactions, targets beta-catenin for ubiquitin degradation. Siah-1 expression is enhanced by p53, itself promoted by DNA damage. Thus this pathway links DNA damage to beta-catenin degradation [, ]. Sina proteins, therefore, physically interact with a variety of proteins. The N-terminal RING finger domain that binds ubiquitin conjugating enzymes is a C3HC4-type, and does not form part of the alignment for this family. The remainder C-terminal part is involved in interactions with other proteins, and is included in this alignment. In addition to the Drosophila protein and mammalian homologues, whose similarity was noted previously, this family also includes putative homologues from Caenorhabditis elegans, Arabidopsis thaliana.; GO: 0006511 ubiquitin-dependent protein catabolic process, 0007275 multicellular organismal development, 0005634 nucleus; PDB: 2A25_A 1K2F_B 2AN6_C.
Probab=73.58  E-value=1.8  Score=29.39  Aligned_cols=46  Identities=22%  Similarity=0.348  Sum_probs=23.9

Q ss_pred             CCCCCCCcchHHHHHhhcCCCCccccc----cCCccChhhHHHHHHcCCC
Q 032068           14 HCDRAIPSSNIDLHFAHCSRNLERCKV----CGDMVPRKYAEEHFLNTHA   59 (148)
Q Consensus        14 ~C~~~~~~~~l~~H~~~~~~~~~~C~~----C~~~f~~~~l~~h~~~~h~   59 (148)
                      .|...++......|+..|..+|+.||.    |+..-....+..|....|.
T Consensus        23 GC~~~~~~~~~~~HE~~C~~~p~~CP~~~~~C~~~G~~~~l~~Hl~~~H~   72 (198)
T PF03145_consen   23 GCTETFPYSEKREHEEECPFRPCSCPFPGSGCDWQGSYKELLDHLRDKHS   72 (198)
T ss_dssp             T---EE-GGGHHHHHHT-TTSEEE-SSSSTT---EEECCCHHHHHHHHTT
T ss_pred             CCcccccccChhhHhccCCCcCCcCCCCCCCccccCCHHHHHHHHHHHCC
Confidence            377776666777787777777888876    4333333366667655553


No 74 
>smart00834 CxxC_CXXC_SSSS Putative regulatory protein. CxxC_CXXC_SSSS represents a region of about 41 amino acids found in a number of small proteins in a wide range of bacteria. The region usually begins with the initiator Met and contains two CxxC motifs separated by 17 amino acids. One protein in this entry has been noted as a putative regulatory protein, designated FmdB. Most proteins in this entry have a C-terminal region containing highly degenerate sequence.
Probab=73.11  E-value=1.5  Score=21.36  Aligned_cols=11  Identities=18%  Similarity=0.634  Sum_probs=6.8

Q ss_pred             ccCCCCCCCCC
Q 032068           10 KICSHCDRAIP   20 (148)
Q Consensus        10 ~~C~~C~~~~~   20 (148)
                      |.|..|+..|.
T Consensus         6 y~C~~Cg~~fe   16 (41)
T smart00834        6 YRCEDCGHTFE   16 (41)
T ss_pred             EEcCCCCCEEE
Confidence            56666666554


No 75 
>PRK06266 transcription initiation factor E subunit alpha; Validated
Probab=71.38  E-value=2.8  Score=28.19  Aligned_cols=29  Identities=21%  Similarity=0.336  Sum_probs=21.2

Q ss_pred             CccCccchhccch-HHHHhhHhhcCCCCcccCCCCCCCC
Q 032068           60 PVACSQCSETMER-EILAIHKGENCPQRIVTCDFCEFPL   97 (148)
Q Consensus        60 ~~~C~~C~~~~~~-~~l~~h~~~~~~~~~~~C~~C~~~f   97 (148)
                      -|.|+.|+..|+. ..+.         .-|.|+.||...
T Consensus       117 ~Y~Cp~C~~rytf~eA~~---------~~F~Cp~Cg~~L  146 (178)
T PRK06266        117 FFFCPNCHIRFTFDEAME---------YGFRCPQCGEML  146 (178)
T ss_pred             EEECCCCCcEEeHHHHhh---------cCCcCCCCCCCC
Confidence            6788888888877 5442         357889888664


No 76 
>KOG0297 consensus TNF receptor-associated factor [Signal transduction mechanisms]
Probab=71.25  E-value=7.2  Score=29.67  Aligned_cols=80  Identities=26%  Similarity=0.582  Sum_probs=50.5

Q ss_pred             CCCCCCCCcchHHHHHhhcCCCCccccc-cCCccChhhHHHHHHc-C-CCCccCccchhccchHHHHhhHhhcCCCCccc
Q 032068           13 SHCDRAIPSSNIDLHFAHCSRNLERCKV-CGDMVPRKYAEEHFLN-T-HAPVACSQCSETMEREILAIHKGENCPQRIVT   89 (148)
Q Consensus        13 ~~C~~~~~~~~l~~H~~~~~~~~~~C~~-C~~~f~~~~l~~h~~~-~-h~~~~C~~C~~~~~~~~l~~h~~~~~~~~~~~   89 (148)
                      ..|.+......++.|...+  .+.+|+. |+..+...++..|++. + -....|..|+..+....+..|...  +..+..
T Consensus        94 ~GC~~~~~l~~~~~Hl~~c--~~~~C~~~C~~~~~~~d~~~hl~~~C~~~~~~c~~~~~~~~~~~~~~h~~~--~~~~~~  169 (391)
T KOG0297|consen   94 RGCRADLELEALQGHLSTC--DPLKCPHRCGVQVPRDDLEDHLEAECPRRSLKCSLCQSDSILILLEAHEEN--PQAEVS  169 (391)
T ss_pred             CCccccccHHHHHhHhccC--CcccCccccccccchHHHHHHHhcccccccccchhhcCccchhhhhhcCCC--CCcccc
Confidence            3577766667777775444  6777876 8877777777777532 1 136667777766666565666543  455666


Q ss_pred             CCC-CCCC
Q 032068           90 CDF-CEFP   96 (148)
Q Consensus        90 C~~-C~~~   96 (148)
                      |.. |+..
T Consensus       170 c~~k~~~~  177 (391)
T KOG0297|consen  170 CELKCGKQ  177 (391)
T ss_pred             ccccchhh
Confidence            665 6554


No 77 
>smart00531 TFIIE Transcription initiation factor IIE.
Probab=70.73  E-value=3.3  Score=26.84  Aligned_cols=34  Identities=21%  Similarity=0.358  Sum_probs=22.3

Q ss_pred             CccCccchhccch-HHHHhhHhhcCCCCcccCCCCCCCC
Q 032068           60 PVACSQCSETMER-EILAIHKGENCPQRIVTCDFCEFPL   97 (148)
Q Consensus        60 ~~~C~~C~~~~~~-~~l~~h~~~~~~~~~~~C~~C~~~f   97 (148)
                      -|.|+.|+..|.. ..+.. .  . ....|.||.||..+
T Consensus        99 ~Y~Cp~C~~~y~~~ea~~~-~--d-~~~~f~Cp~Cg~~l  133 (147)
T smart00531       99 YYKCPNCQSKYTFLEANQL-L--D-MDGTFTCPRCGEEL  133 (147)
T ss_pred             EEECcCCCCEeeHHHHHHh-c--C-CCCcEECCCCCCEE
Confidence            7789999988876 43322 1  1 13348999998764


No 78 
>KOG1146 consensus Homeobox protein [General function prediction only]
Probab=70.23  E-value=1.8  Score=37.51  Aligned_cols=79  Identities=18%  Similarity=0.273  Sum_probs=43.6

Q ss_pred             hcCCCCccccccCCccChh-hHHHHHHcCCCCccCccchhccch-HHHHhhHhhcCCCCcccCCCCCCCCC-hhhHHHHH
Q 032068           30 HCSRNLERCKVCGDMVPRK-YAEEHFLNTHAPVACSQCSETMER-EILAIHKGENCPQRIVTCDFCEFPLP-AVDLAEHQ  106 (148)
Q Consensus        30 ~~~~~~~~C~~C~~~f~~~-~l~~h~~~~h~~~~C~~C~~~~~~-~~l~~h~~~~~~~~~~~C~~C~~~f~-~~~l~~H~  106 (148)
                      +.-.+.+.|+.|+..+... .+..|.+..|-......|. .+.. ..+..-....++.++|.|..|...+. ...|..|+
T Consensus       460 ~S~~kt~~cpkc~~~yk~a~~L~vhmRskhp~~~~~~c~-~gq~~~~~arg~~~~~~~~p~~C~~C~~stttng~Lsihl  538 (1406)
T KOG1146|consen  460 HSFFKTLKCPKCNWHYKLAQTLGVHMRSKHPESQSAYCK-AGQNHPRLARGEVYRCPGKPYPCRACNYSTTTNGNLSIHL  538 (1406)
T ss_pred             ecccccccCCccchhhhhHHHhhhcccccccccchhHhH-hccccccccccccccCCCCcccceeeeeeeecchHHHHHH
Confidence            4445677788887777766 4555554422111111111 0000 11111111234557899999999988 88999998


Q ss_pred             HHh
Q 032068          107 VAI  109 (148)
Q Consensus       107 ~~c  109 (148)
                      ..+
T Consensus       539 qS~  541 (1406)
T KOG1146|consen  539 QSD  541 (1406)
T ss_pred             HHH
Confidence            764


No 79 
>PF03145 Sina:  Seven in absentia protein family;  InterPro: IPR018121 The seven in absentia (sina) gene was first identified in Drosophila. The Drosophila Sina protein is essential for the determination of the R7 pathway in photoreceptor cell development: the loss of functional Sina results in the transformation of the R7 precursor cell to a non-neuronal cell type. The Sina protein contains an N-terminal RING finger domain C3HC4-type. Through this domain, Sina binds E2 ubiquitin-conjugating enzymes (UbcD1) Sina also interacts with Tramtrack (TTK88) via PHYL. Tramtrack is a transcriptional repressor that blocks photoreceptor determination, while PHYL down-regulates the activity of TTK88. In turn, the activity of PHYL requires the activation of the Sevenless receptor tyrosine kinase, a process essential for R7 determination. It is thought that Sina targets TTK88 for degradation, therefore promoting the R7 pathway. Murine and human homologues of Sina have also been identified. The human homologue Siah-1 [] also binds E2 enzymes (UbcH5) and through a series of physical interactions, targets beta-catenin for ubiquitin degradation. Siah-1 expression is enhanced by p53, itself promoted by DNA damage. Thus this pathway links DNA damage to beta-catenin degradation [, ]. Sina proteins, therefore, physically interact with a variety of proteins. The N-terminal RING finger domain that binds ubiquitin conjugating enzymes is a C3HC4-type, and does not form part of the alignment for this family. The remainder C-terminal part is involved in interactions with other proteins, and is included in this alignment. In addition to the Drosophila protein and mammalian homologues, whose similarity was noted previously, this family also includes putative homologues from Caenorhabditis elegans, Arabidopsis thaliana.; GO: 0006511 ubiquitin-dependent protein catabolic process, 0007275 multicellular organismal development, 0005634 nucleus; PDB: 2A25_A 1K2F_B 2AN6_C.
Probab=69.54  E-value=3.5  Score=28.01  Aligned_cols=44  Identities=27%  Similarity=0.552  Sum_probs=24.1

Q ss_pred             ccchhccchHHHHhhHhhcCCCCcccCCC----CCCCCChhhHHHHHHH
Q 032068           64 SQCSETMEREILAIHKGENCPQRIVTCDF----CEFPLPAVDLAEHQVA  108 (148)
Q Consensus        64 ~~C~~~~~~~~l~~h~~~~~~~~~~~C~~----C~~~f~~~~l~~H~~~  108 (148)
                      .+|...++......|+. .+.-+|+.||.    |+..-....|..|...
T Consensus        22 ~GC~~~~~~~~~~~HE~-~C~~~p~~CP~~~~~C~~~G~~~~l~~Hl~~   69 (198)
T PF03145_consen   22 YGCTETFPYSEKREHEE-ECPFRPCSCPFPGSGCDWQGSYKELLDHLRD   69 (198)
T ss_dssp             GT---EE-GGGHHHHHH-T-TTSEEE-SSSSTT---EEECCCHHHHHHH
T ss_pred             CCCcccccccChhhHhc-cCCCcCCcCCCCCCCccccCCHHHHHHHHHH
Confidence            45777766677777875 57778888877    5544445677788765


No 80 
>PF14353 CpXC:  CpXC protein
Probab=69.42  E-value=3.2  Score=26.06  Aligned_cols=11  Identities=18%  Similarity=0.824  Sum_probs=6.1

Q ss_pred             CccCccchhcc
Q 032068           60 PVACSQCSETM   70 (148)
Q Consensus        60 ~~~C~~C~~~~   70 (148)
                      .+.|+.||..+
T Consensus        38 ~~~CP~Cg~~~   48 (128)
T PF14353_consen   38 SFTCPSCGHKF   48 (128)
T ss_pred             EEECCCCCCce
Confidence            45556665555


No 81 
>smart00614 ZnF_BED BED zinc finger. DNA-binding domain in chromatin-boundary-element-binding proteins and transposases
Probab=68.24  E-value=3.9  Score=21.10  Aligned_cols=23  Identities=22%  Similarity=0.473  Sum_probs=14.7

Q ss_pred             ccCCCCCCCCC-h-----hhHHHHHH-Hhc
Q 032068           88 VTCDFCEFPLP-A-----VDLAEHQV-AIV  110 (148)
Q Consensus        88 ~~C~~C~~~f~-~-----~~l~~H~~-~c~  110 (148)
                      ..|..|++.+. .     +.|..|++ .|+
T Consensus        19 a~C~~C~~~l~~~~~~gTs~L~rHl~~~h~   48 (50)
T smart00614       19 AKCKYCGKKLSRSSKGGTSNLRRHLRRKHP   48 (50)
T ss_pred             EEecCCCCEeeeCCCCCcHHHHHHHHhHCc
Confidence            46777776654 2     57777776 454


No 82 
>TIGR02300 FYDLN_acid conserved hypothetical protein TIGR02300. Members of this family are bacterial proteins with a conserved motif [KR]FYDLN, sometimes flanked by a pair of CXXC motifs, followed by a long region of low complexity sequence in which roughly half the residues are Asp and Glu, including multiple runs of five or more acidic residues. The function of members of this family is unknown.
Probab=65.45  E-value=5.7  Score=25.09  Aligned_cols=13  Identities=15%  Similarity=0.266  Sum_probs=7.9

Q ss_pred             CccCccchhccch
Q 032068           60 PVACSQCSETMER   72 (148)
Q Consensus        60 ~~~C~~C~~~~~~   72 (148)
                      |..|++||..|..
T Consensus        26 p~vcP~cg~~~~~   38 (129)
T TIGR02300        26 PAVSPYTGEQFPP   38 (129)
T ss_pred             CccCCCcCCccCc
Confidence            6666666666544


No 83 
>PF03604 DNA_RNApol_7kD:  DNA directed RNA polymerase, 7 kDa subunit;  InterPro: IPR006591 DNA-dependent RNA polymerase catalyzes the transcription of DNA into RNA using the four ribonucleoside triphosphates as substrates. Each class of RNA polymerase is assembled from 9 to 15 different polypeptides. Rbp10 (RNA polymerase CX) is a domain found in RNA polymerase subunit 10; present in RNA polymerase I, II and III.; GO: 0003677 DNA binding, 0003899 DNA-directed RNA polymerase activity, 0006351 transcription, DNA-dependent; PDB: 2PMZ_Z 3HKZ_X 2NVX_L 3S1Q_L 2JA6_L 3S17_L 3HOW_L 3HOV_L 3PO2_L 3HOZ_L ....
Probab=64.67  E-value=5.3  Score=18.74  Aligned_cols=11  Identities=27%  Similarity=0.842  Sum_probs=5.1

Q ss_pred             ccCCCCCCCCC
Q 032068           10 KICSHCDRAIP   20 (148)
Q Consensus        10 ~~C~~C~~~~~   20 (148)
                      |.|..|+..+.
T Consensus         1 Y~C~~Cg~~~~   11 (32)
T PF03604_consen    1 YICGECGAEVE   11 (32)
T ss_dssp             EBESSSSSSE-
T ss_pred             CCCCcCCCeeE
Confidence            34555555543


No 84 
>PF02892 zf-BED:  BED zinc finger;  InterPro: IPR003656 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target.  This entry represents predicted BED-type zinc finger domains. The BED finger which was named after the Drosophila proteins BEAF and DREF, is found in one or more copies in cellular regulatory factors and transposases from plants, animals and fungi. The BED finger is an about 50 to 60 amino acid residues domain that contains a characteristic motif with two highly conserved aromatic positions, as well as a shared pattern of cysteines and histidines that is predicted to form a zinc finger. As diverse BED fingers are able to bind DNA, it has been suggested that DNA-binding is the general function of this domain []. Some proteins known to contain a BED domain include animal, plant and fungi AC1 and Hobo-like transposases; Caenorhabditis elegans Dpy-20 protein, a predicted cuticular gene transcriptional regulator; Drosophila BEAF (boundary element-associated factor), thought to be involved in chromatin insulation; Drosophila DREF, a transcriptional regulator for S-phase genes; and tobacco 3AF1 and tomato E4/E8-BP1, light- and ethylene-regulated DNA binding proteins that contain two BED fingers. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; GO: 0003677 DNA binding; PDB: 2DJR_A 2CT5_A.
Probab=63.50  E-value=4.3  Score=20.21  Aligned_cols=21  Identities=19%  Similarity=0.578  Sum_probs=10.6

Q ss_pred             CcccCCCCCCCCC-h----hhHHHHH
Q 032068           86 RIVTCDFCEFPLP-A----VDLAEHQ  106 (148)
Q Consensus        86 ~~~~C~~C~~~f~-~----~~l~~H~  106 (148)
                      ....|..|++.+. .    +.|..|+
T Consensus        15 ~~a~C~~C~~~~~~~~~~ts~l~~HL   40 (45)
T PF02892_consen   15 KKAKCKYCGKVIKYSSGGTSNLKRHL   40 (45)
T ss_dssp             S-EEETTTTEE-----SSTHHHHHHH
T ss_pred             CeEEeCCCCeEEeeCCCcHHHHHHhh
Confidence            3446666766554 2    4666666


No 85 
>KOG4173 consensus Alpha-SNAP protein [Intracellular trafficking, secretion, and vesicular transport]
Probab=61.17  E-value=3.8  Score=28.15  Aligned_cols=71  Identities=24%  Similarity=0.417  Sum_probs=44.0

Q ss_pred             ccccc--cCCccChhh-HHHHHHcCCCCccCccchhccch-HHHHhhHhh----------cCCCCcccCC--CCCCCCC-
Q 032068           36 ERCKV--CGDMVPRKY-AEEHFLNTHAPVACSQCSETMER-EILAIHKGE----------NCPQRIVTCD--FCEFPLP-   98 (148)
Q Consensus        36 ~~C~~--C~~~f~~~~-l~~h~~~~h~~~~C~~C~~~~~~-~~l~~h~~~----------~~~~~~~~C~--~C~~~f~-   98 (148)
                      +.|+.  |.+.|...+ ...|....| .-.|..|.+.|+. -.|..|+..          .-|.-.|.|-  .|+..|. 
T Consensus        80 ~~cqvagc~~~~d~lD~~E~hY~~~h-~~sCs~C~r~~Pt~hLLd~HI~E~HDs~Fqa~veRG~dMy~ClvEgCt~KFkT  158 (253)
T KOG4173|consen   80 FACQVAGCCQVFDALDDYEHHYHTLH-GNSCSFCKRAFPTGHLLDAHILEWHDSLFQALVERGQDMYQCLVEGCTEKFKT  158 (253)
T ss_pred             ccccccchHHHHhhhhhHHHhhhhcc-cchhHHHHHhCCchhhhhHHHHHHHHHHHHHHHHcCccHHHHHHHhhhhhhhh
Confidence            45764  444444332 222221222 4579999999977 555666532          3456678884  4999998 


Q ss_pred             hhhHHHHHH
Q 032068           99 AVDLAEHQV  107 (148)
Q Consensus        99 ~~~l~~H~~  107 (148)
                      ...-.+|+-
T Consensus       159 ~r~RkdH~I  167 (253)
T KOG4173|consen  159 SRDRKDHMI  167 (253)
T ss_pred             hhhhhhHHH
Confidence            777788864


No 86 
>PF07754 DUF1610:  Domain of unknown function (DUF1610);  InterPro: IPR011668 This domain is found in archaeal species. It is likely to bind zinc via its four well-conserved cysteine residues.
Probab=60.37  E-value=4.7  Score=17.60  Aligned_cols=8  Identities=25%  Similarity=0.667  Sum_probs=4.3

Q ss_pred             cccCCCCC
Q 032068           87 IVTCDFCE   94 (148)
Q Consensus        87 ~~~C~~C~   94 (148)
                      +|.||.||
T Consensus        16 ~f~CPnCG   23 (24)
T PF07754_consen   16 PFPCPNCG   23 (24)
T ss_pred             eEeCCCCC
Confidence            45555554


No 87 
>PF01530 zf-C2HC:  Zinc finger, C2HC type;  InterPro: IPR002515 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target.  This entry represents the CysCysHisCys (C2HC) type zinc finger domain found in eukaryotes. Proteins containing these domains include:   MYST family histone acetyltransferases [, [] Myelin transcription factor Myt1 [] Suppressor of tumourigenicity protein 18 (ST18) []   More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; GO: 0003700 sequence-specific DNA binding transcription factor activity, 0008270 zinc ion binding, 0006355 regulation of transcription, DNA-dependent, 0005634 nucleus; PDB: 2CS8_A 1PXE_A 2JX1_A 2JYD_A.
Probab=60.22  E-value=4.5  Score=18.83  Aligned_cols=12  Identities=33%  Similarity=0.601  Sum_probs=8.6

Q ss_pred             cchhhhhccccC
Q 032068          136 GSLWESHRTLSS  147 (148)
Q Consensus       136 ~~~~~~~~~~~~  147 (148)
                      .-.|++|+||++
T Consensus        16 ~G~~~~H~s~sg   27 (31)
T PF01530_consen   16 TGKYATHRSLSG   27 (31)
T ss_dssp             TSSSSB-SSTTT
T ss_pred             cCCccCccCccc
Confidence            348999999875


No 88 
>PHA00626 hypothetical protein
Probab=59.18  E-value=6.5  Score=21.03  Aligned_cols=18  Identities=11%  Similarity=0.165  Sum_probs=13.6

Q ss_pred             CcccCCCCCCCCChhhHH
Q 032068           86 RIVTCDFCEFPLPAVDLA  103 (148)
Q Consensus        86 ~~~~C~~C~~~f~~~~l~  103 (148)
                      ..|.|+.||..|....+.
T Consensus        22 nrYkCkdCGY~ft~~~~~   39 (59)
T PHA00626         22 DDYVCCDCGYNDSKDAFG   39 (59)
T ss_pred             cceEcCCCCCeechhhhh
Confidence            578999999888855443


No 89 
>KOG2893 consensus Zn finger protein [General function prediction only]
Probab=58.90  E-value=2.7  Score=29.57  Aligned_cols=28  Identities=14%  Similarity=0.308  Sum_probs=12.7

Q ss_pred             ccCCccChhh-HHHHHHcCCCCccCccchhc
Q 032068           40 VCGDMVPRKY-AEEHFLNTHAPVACSQCSET   69 (148)
Q Consensus        40 ~C~~~f~~~~-l~~h~~~~h~~~~C~~C~~~   69 (148)
                      .|.+.|.... |.+|+..  +.|.|.+|.+.
T Consensus        15 ycnrefddekiliqhqka--khfkchichkk   43 (341)
T KOG2893|consen   15 YCNREFDDEKILIQHQKA--KHFKCHICHKK   43 (341)
T ss_pred             ecccccchhhhhhhhhhh--ccceeeeehhh
Confidence            4555555442 2334333  34555555443


No 90 
>COG1996 RPC10 DNA-directed RNA polymerase, subunit RPC10 (contains C4-type Zn-finger) [Transcription]
Probab=58.60  E-value=4.1  Score=21.18  Aligned_cols=11  Identities=18%  Similarity=0.785  Sum_probs=6.6

Q ss_pred             CccCccchhcc
Q 032068           60 PVACSQCSETM   70 (148)
Q Consensus        60 ~~~C~~C~~~~   70 (148)
                      .|.|..||+.|
T Consensus         6 ~Y~C~~Cg~~~   16 (49)
T COG1996           6 EYKCARCGREV   16 (49)
T ss_pred             EEEhhhcCCee
Confidence            45566666655


No 91 
>COG5048 FOG: Zn-finger [General function prediction only]
Probab=58.35  E-value=2.9  Score=31.20  Aligned_cols=49  Identities=20%  Similarity=0.430  Sum_probs=39.2

Q ss_pred             CccCccchhccch-HHHHhhHh--hcCCC--CcccCC--CCCCCCC-hhhHHHHHHH
Q 032068           60 PVACSQCSETMER-EILAIHKG--ENCPQ--RIVTCD--FCEFPLP-AVDLAEHQVA  108 (148)
Q Consensus        60 ~~~C~~C~~~~~~-~~l~~h~~--~~~~~--~~~~C~--~C~~~f~-~~~l~~H~~~  108 (148)
                      ++.|..|...|.. ..+..|.+  .|.++  +++.|+  .|++.|. ...+..|...
T Consensus       289 ~~~~~~~~~~~s~~~~l~~~~~~~~h~~~~~~~~~~p~~~~~~~~~~~~~~~~~~~~  345 (467)
T COG5048         289 PIKSKQCNISFSRSSPLTRHLRSVNHSGESLKPFSCPYSLCGKLFSRNDALKRHILL  345 (467)
T ss_pred             CCCCccccCCccccccccccccccccccccCCceeeeccCCCccccccccccCCccc
Confidence            4677888888877 78888888  78888  899998  6998888 7767777654


No 92 
>KOG2893 consensus Zn finger protein [General function prediction only]
Probab=57.19  E-value=3.8  Score=28.81  Aligned_cols=30  Identities=23%  Similarity=0.641  Sum_probs=16.2

Q ss_pred             Cccchhccch-HHHHhhHhhcCCCCcccCCCCCCC
Q 032068           63 CSQCSETMER-EILAIHKGENCPQRIVTCDFCEFP   96 (148)
Q Consensus        63 C~~C~~~~~~-~~l~~h~~~~~~~~~~~C~~C~~~   96 (148)
                      |=.|.+.|.. ..|.+|+..    +-|+|-.|.+.
T Consensus        13 cwycnrefddekiliqhqka----khfkchichkk   43 (341)
T KOG2893|consen   13 CWYCNREFDDEKILIQHQKA----KHFKCHICHKK   43 (341)
T ss_pred             eeecccccchhhhhhhhhhh----ccceeeeehhh
Confidence            4455666655 455555532    44566666654


No 93 
>KOG1146 consensus Homeobox protein [General function prediction only]
Probab=56.72  E-value=7.2  Score=34.14  Aligned_cols=26  Identities=19%  Similarity=0.319  Sum_probs=22.4

Q ss_pred             cCCCCcccCCCCCCCCC-cchHHHH-Hh
Q 032068            4 TSDETTKICSHCDRAIP-SSNIDLH-FA   29 (148)
Q Consensus         4 h~~~~~~~C~~C~~~~~-~~~l~~H-~~   29 (148)
                      ++..+.+.|..|++.|. ...|..| +.
T Consensus       460 ~S~~kt~~cpkc~~~yk~a~~L~vhmRs  487 (1406)
T KOG1146|consen  460 HSFFKTLKCPKCNWHYKLAQTLGVHMRS  487 (1406)
T ss_pred             ecccccccCCccchhhhhHHHhhhcccc
Confidence            45568899999999999 9999999 54


No 94 
>PRK00464 nrdR transcriptional regulator NrdR; Validated
Probab=55.62  E-value=4.6  Score=26.51  Aligned_cols=11  Identities=27%  Similarity=0.830  Sum_probs=5.3

Q ss_pred             ccccccCCccC
Q 032068           36 ERCKVCGDMVP   46 (148)
Q Consensus        36 ~~C~~C~~~f~   46 (148)
                      .+|+.|+..|.
T Consensus        29 ~~c~~c~~~f~   39 (154)
T PRK00464         29 RECLACGKRFT   39 (154)
T ss_pred             eeccccCCcce
Confidence            44555554443


No 95 
>KOG0297 consensus TNF receptor-associated factor [Signal transduction mechanisms]
Probab=51.64  E-value=26  Score=26.69  Aligned_cols=63  Identities=19%  Similarity=0.267  Sum_probs=39.1

Q ss_pred             ccchhccchHHHHhhHhhcCCCCcccCCC-CCCCCChhhHHHHH-HHhcccccCCc----chhhhhhhccCc
Q 032068           64 SQCSETMEREILAIHKGENCPQRIVTCDF-CEFPLPAVDLAEHQ-VAIVNNFENGK----TIADLSYLGVML  129 (148)
Q Consensus        64 ~~C~~~~~~~~l~~h~~~~~~~~~~~C~~-C~~~f~~~~l~~H~-~~c~~~~~~~~----~~~~~~~~~~~~  129 (148)
                      .+|........++.|... +  .+.+|+. |+..+.+..+.+|+ +.|+..-..+.    ......++.|.+
T Consensus        94 ~GC~~~~~l~~~~~Hl~~-c--~~~~C~~~C~~~~~~~d~~~hl~~~C~~~~~~c~~~~~~~~~~~~~~h~~  162 (391)
T KOG0297|consen   94 RGCRADLELEALQGHLST-C--DPLKCPHRCGVQVPRDDLEDHLEAECPRRSLKCSLCQSDSILILLEAHEE  162 (391)
T ss_pred             CCccccccHHHHHhHhcc-C--CcccCccccccccchHHHHHHHhcccccccccchhhcCccchhhhhhcCC
Confidence            456555555777777753 2  6778877 88888888888887 45665544433    233344444544


No 96 
>PF15135 UPF0515:  Uncharacterised protein UPF0515
Probab=51.03  E-value=9.9  Score=26.97  Aligned_cols=13  Identities=38%  Similarity=0.718  Sum_probs=7.7

Q ss_pred             CCCCcccCCCCCC
Q 032068            5 SDETTKICSHCDR   17 (148)
Q Consensus         5 ~~~~~~~C~~C~~   17 (148)
                      +.++.|.|..|..
T Consensus       108 ~~drqFaC~~Cd~  120 (278)
T PF15135_consen  108 SVDRQFACSSCDH  120 (278)
T ss_pred             ccceeeeccccch
Confidence            3445677777654


No 97 
>COG3357 Predicted transcriptional regulator containing an HTH domain fused to a Zn-ribbon [Transcription]
Probab=50.36  E-value=12  Score=22.10  Aligned_cols=28  Identities=29%  Similarity=0.657  Sum_probs=18.5

Q ss_pred             CCccccccCCccChhhHHHHHHcCCCCccCccchh
Q 032068           34 NLERCKVCGDMVPRKYAEEHFLNTHAPVACSQCSE   68 (148)
Q Consensus        34 ~~~~C~~C~~~f~~~~l~~h~~~~h~~~~C~~C~~   68 (148)
                      +|-.|..||..|....+       ..|..|+.|..
T Consensus        57 ~Pa~CkkCGfef~~~~i-------k~pSRCP~CKS   84 (97)
T COG3357          57 RPARCKKCGFEFRDDKI-------KKPSRCPKCKS   84 (97)
T ss_pred             cChhhcccCcccccccc-------CCcccCCcchh
Confidence            46678889877754221       14778888853


No 98 
>PF09845 DUF2072:  Zn-ribbon containing protein (DUF2072);  InterPro: IPR018645  This archaeal Zinc-ribbon containing proteins have no known function. 
Probab=49.91  E-value=13  Score=23.63  Aligned_cols=12  Identities=25%  Similarity=0.814  Sum_probs=9.0

Q ss_pred             cccCCCCCCCCC
Q 032068            9 TKICSHCDRAIP   20 (148)
Q Consensus         9 ~~~C~~C~~~~~   20 (148)
                      |++|..|++.|.
T Consensus         1 PH~Ct~Cg~~f~   12 (131)
T PF09845_consen    1 PHQCTKCGRVFE   12 (131)
T ss_pred             CcccCcCCCCcC
Confidence            567777888777


No 99 
>PF12230 PRP21_like_P:  Pre-mRNA splicing factor PRP21 like protein;  InterPro: IPR022030  This domain family is found in eukaryotes, and is typically between 212 and 238 amino acids in length. The family is found in association with PF01805 from PFAM. There are two completely conserved residues (W and H) that may be functionally important. PRP21 is required for assembly of the prespliceosome and it interacts with U2 snRNP and/or pre-mRNA in the prespliceosome. This family also contains proteins similar to PRP21, such as the mammalian SF3a. SF3a also interacts with U2 snRNP from the prespliceosome, converting it to its active form. ; PDB: 4DGW_B.
Probab=49.84  E-value=5.4  Score=27.81  Aligned_cols=25  Identities=20%  Similarity=0.453  Sum_probs=0.0

Q ss_pred             CCCcccCCCCCCCCChhhHHHHHHH
Q 032068           84 PQRIVTCDFCEFPLPAVDLAEHQVA  108 (148)
Q Consensus        84 ~~~~~~C~~C~~~f~~~~l~~H~~~  108 (148)
                      +.....|+.||..++-+.+..|+++
T Consensus       165 ~~~~~~cPitGe~IP~~e~~eHmRi  189 (229)
T PF12230_consen  165 KEKMIICPITGEMIPADEMDEHMRI  189 (229)
T ss_dssp             -------------------------
T ss_pred             ccccccccccccccccccccccccc
Confidence            3455799999999999999999997


No 100
>COG1997 RPL43A Ribosomal protein L37AE/L43A [Translation, ribosomal structure and biogenesis]
Probab=49.31  E-value=14  Score=21.75  Aligned_cols=32  Identities=19%  Similarity=0.445  Sum_probs=18.1

Q ss_pred             CCccccccCCccChhhHHHHHHcCCCCccCccchhccch
Q 032068           34 NLERCKVCGDMVPRKYAEEHFLNTHAPVACSQCSETMER   72 (148)
Q Consensus        34 ~~~~C~~C~~~f~~~~l~~h~~~~h~~~~C~~C~~~~~~   72 (148)
                      ..+.||.|++..- +-+    ..  .-+.|..|+..|..
T Consensus        34 ~~~~Cp~C~~~~V-kR~----a~--GIW~C~kCg~~fAG   65 (89)
T COG1997          34 AKHVCPFCGRTTV-KRI----AT--GIWKCRKCGAKFAG   65 (89)
T ss_pred             cCCcCCCCCCcce-eee----cc--CeEEcCCCCCeecc
Confidence            3567888876511 100    00  25778888877743


No 101
>PF13248 zf-ribbon_3:  zinc-ribbon domain
Probab=48.59  E-value=14  Score=16.20  Aligned_cols=8  Identities=25%  Similarity=1.045  Sum_probs=4.0

Q ss_pred             cCCCCCCC
Q 032068           11 ICSHCDRA   18 (148)
Q Consensus        11 ~C~~C~~~   18 (148)
                      .|+.|+..
T Consensus         4 ~Cp~Cg~~   11 (26)
T PF13248_consen    4 FCPNCGAE   11 (26)
T ss_pred             CCcccCCc
Confidence            45555553


No 102
>PF14369 zf-RING_3:  zinc-finger
Probab=47.36  E-value=17  Score=17.30  Aligned_cols=30  Identities=17%  Similarity=0.418  Sum_probs=15.3

Q ss_pred             cccCCCCCCCCC-cchHHHHHhhcCCCCccccccCCccC
Q 032068            9 TKICSHCDRAIP-SSNIDLHFAHCSRNLERCKVCGDMVP   46 (148)
Q Consensus         9 ~~~C~~C~~~~~-~~~l~~H~~~~~~~~~~C~~C~~~f~   46 (148)
                      .|-|-.|.+.+. ...        +.....|+.|+..|.
T Consensus         2 ~ywCh~C~~~V~~~~~--------~~~~~~CP~C~~gFv   32 (35)
T PF14369_consen    2 RYWCHQCNRFVRIAPS--------PDSDVACPRCHGGFV   32 (35)
T ss_pred             CEeCccCCCEeEeCcC--------CCCCcCCcCCCCcEe
Confidence            356667766654 211        111224777776663


No 103
>PRK03824 hypA hydrogenase nickel incorporation protein; Provisional
Probab=46.57  E-value=7.9  Score=24.73  Aligned_cols=12  Identities=17%  Similarity=0.523  Sum_probs=7.4

Q ss_pred             cccCCCCCCCCC
Q 032068            9 TKICSHCDRAIP   20 (148)
Q Consensus         9 ~~~C~~C~~~~~   20 (148)
                      .+.|..|+..|.
T Consensus        70 ~~~C~~CG~~~~   81 (135)
T PRK03824         70 VLKCRNCGNEWS   81 (135)
T ss_pred             EEECCCCCCEEe
Confidence            356666666655


No 104
>KOG2907 consensus RNA polymerase I transcription factor TFIIS, subunit A12.2/RPA12 [Transcription]
Probab=44.38  E-value=13  Score=22.93  Aligned_cols=38  Identities=18%  Similarity=0.466  Sum_probs=21.8

Q ss_pred             CccCccchhccch-HHHHhhHhhcCCC-CcccCCCCCCCCC
Q 032068           60 PVACSQCSETMER-EILAIHKGENCPQ-RIVTCDFCEFPLP   98 (148)
Q Consensus        60 ~~~C~~C~~~~~~-~~l~~h~~~~~~~-~~~~C~~C~~~f~   98 (148)
                      ...|+.||..=+. ..++-.- .-.|+ .-|.|+.|++.|.
T Consensus        74 ~~kCpkCghe~m~Y~T~QlRS-ADEGQTVFYTC~kC~~k~~  113 (116)
T KOG2907|consen   74 KHKCPKCGHEEMSYHTLQLRS-ADEGQTVFYTCPKCKYKFT  113 (116)
T ss_pred             hccCcccCCchhhhhhhhccc-ccCCceEEEEcCccceeee
Confidence            4578888865544 3333221 12222 3478999988775


No 105
>PF05253 zf-U11-48K:  U11-48K-like CHHC zinc finger;  InterPro: IPR022776  This zinc binding domain [] has four conserved zinc chelating residues in a CHHC pattern. This domain is predicted to have an RNA-binding function []. ; PDB: 2VY5_A 2VY4_A.
Probab=44.11  E-value=21  Score=15.79  Aligned_cols=22  Identities=23%  Similarity=0.412  Sum_probs=12.3

Q ss_pred             cCCCCCCC-CChhhHHHHHHHhc
Q 032068           89 TCDFCEFP-LPAVDLAEHQVAIV  110 (148)
Q Consensus        89 ~C~~C~~~-f~~~~l~~H~~~c~  110 (148)
                      .|++-..- ++...|..|+..|.
T Consensus         4 ~CPyn~~H~v~~~~l~~Hi~~C~   26 (27)
T PF05253_consen    4 RCPYNPSHRVPASELQKHIKKCP   26 (27)
T ss_dssp             E-TTTSS-EEEGGGHHHHHHHHH
T ss_pred             eCCCCCCcCcCHHHHHHHHHHcC
Confidence            45543333 33677888887775


No 106
>COG5236 Uncharacterized conserved protein, contains RING Zn-finger [General function prediction only]
Probab=42.85  E-value=17  Score=27.18  Aligned_cols=59  Identities=24%  Similarity=0.408  Sum_probs=35.1

Q ss_pred             hHHHHHHcCCCCccCccch---hcc-------chHHHHhhHhhcCCCCcc----cCCCCCCCCC-hhhHHHHHH
Q 032068           49 YAEEHFLNTHAPVACSQCS---ETM-------EREILAIHKGENCPQRIV----TCDFCEFPLP-AVDLAEHQV  107 (148)
Q Consensus        49 ~l~~h~~~~h~~~~C~~C~---~~~-------~~~~l~~h~~~~~~~~~~----~C~~C~~~f~-~~~l~~H~~  107 (148)
                      .+..|....|..+.|..|-   +.|       ....|..|......+.-|    .|.+|...|- ...|..|.+
T Consensus       168 ~lk~H~K~~H~~~~C~~C~~nKk~F~~E~~lF~~~~Lr~H~~~G~~e~GFKGHP~C~FC~~~FYdDDEL~~HcR  241 (493)
T COG5236         168 ELKKHYKAQHGFVLCSECIGNKKDFWNEIRLFRSSTLRDHKNGGLEEEGFKGHPLCIFCKIYFYDDDELRRHCR  241 (493)
T ss_pred             HHHHHHHhhcCcEEhHhhhcCcccCccceeeeecccccccccCCccccCcCCCchhhhccceecChHHHHHHHH
Confidence            5666665545556666653   223       235677776543333222    5778887777 778888865


No 107
>PRK04023 DNA polymerase II large subunit; Validated
Probab=42.17  E-value=40  Score=29.13  Aligned_cols=10  Identities=20%  Similarity=0.408  Sum_probs=5.7

Q ss_pred             cccCCCCCCC
Q 032068           87 IVTCDFCEFP   96 (148)
Q Consensus        87 ~~~C~~C~~~   96 (148)
                      ++.|+.|+..
T Consensus       663 ~y~CPKCG~E  672 (1121)
T PRK04023        663 EDECEKCGRE  672 (1121)
T ss_pred             CCcCCCCCCC
Confidence            3556666654


No 108
>PF09963 DUF2197:  Uncharacterized protein conserved in bacteria (DUF2197);  InterPro: IPR019241  This family represents various hypothetical bacterial proteins with no known function. 
Probab=41.15  E-value=12  Score=20.10  Aligned_cols=34  Identities=15%  Similarity=0.365  Sum_probs=15.6

Q ss_pred             cCCCCCCCCC--cchHHHH-HhhcCCCCccccccCCc
Q 032068           11 ICSHCDRAIP--SSNIDLH-FAHCSRNLERCKVCGDM   44 (148)
Q Consensus        11 ~C~~C~~~~~--~~~l~~H-~~~~~~~~~~C~~C~~~   44 (148)
                      .|..|++.+.  ..++..- +...+..+|.|+.|...
T Consensus         4 kC~lCdk~~~Id~~~~~aKrLrnrPi~tYmC~eC~~R   40 (56)
T PF09963_consen    4 KCILCDKKEEIDEDTPEAKRLRNRPIHTYMCDECKER   40 (56)
T ss_pred             EEEecCCEEEeccCCHHHHHhhcCCCcceeChhHHHH
Confidence            4555555544  2233333 23344445555555433


No 109
>PF12907 zf-met2:  Zinc-binding
Probab=40.79  E-value=4.9  Score=19.94  Aligned_cols=18  Identities=22%  Similarity=0.599  Sum_probs=9.3

Q ss_pred             ccCCCCCCCCC-c---chHHHH
Q 032068           10 KICSHCDRAIP-S---SNIDLH   27 (148)
Q Consensus        10 ~~C~~C~~~~~-~---~~l~~H   27 (148)
                      +.|..|..+|. .   ..|.+|
T Consensus         2 i~C~iC~qtF~~t~~~~~L~eH   23 (40)
T PF12907_consen    2 IICKICRQTFMQTTNEPQLKEH   23 (40)
T ss_pred             cCcHHhhHHHHhcCCHHHHHHH
Confidence            45666664444 2   345566


No 110
>PRK12380 hydrogenase nickel incorporation protein HybF; Provisional
Probab=39.94  E-value=15  Score=22.65  Aligned_cols=9  Identities=33%  Similarity=0.789  Sum_probs=4.3

Q ss_pred             cccccCCcc
Q 032068           37 RCKVCGDMV   45 (148)
Q Consensus        37 ~C~~C~~~f   45 (148)
                      .|..|+..|
T Consensus        72 ~C~~Cg~~~   80 (113)
T PRK12380         72 WCWDCSQVV   80 (113)
T ss_pred             EcccCCCEE
Confidence            455555443


No 111
>PF13878 zf-C2H2_3:  zinc-finger of acetyl-transferase ESCO
Probab=39.78  E-value=30  Score=17.06  Aligned_cols=11  Identities=18%  Similarity=0.591  Sum_probs=6.3

Q ss_pred             ccCCCCCCCCC
Q 032068           10 KICSHCDRAIP   20 (148)
Q Consensus        10 ~~C~~C~~~~~   20 (148)
                      ..|+.|+..|.
T Consensus        14 ~~C~~CgM~Y~   24 (41)
T PF13878_consen   14 TTCPTCGMLYS   24 (41)
T ss_pred             cCCCCCCCEEC
Confidence            45666666555


No 112
>PF02591 DUF164:  Putative zinc ribbon domain;  InterPro: IPR003743 This entry describes proteins of unknown function.
Probab=39.67  E-value=32  Score=18.09  Aligned_cols=33  Identities=24%  Similarity=0.344  Sum_probs=19.2

Q ss_pred             CccCccchhccchHHHHhhHhhcCCCCcccCCCCCC
Q 032068           60 PVACSQCSETMEREILAIHKGENCPQRIVTCDFCEF   95 (148)
Q Consensus        60 ~~~C~~C~~~~~~~~l~~h~~~~~~~~~~~C~~C~~   95 (148)
                      .-.|..|+-..+...+..   ...++....|+.||+
T Consensus        22 ~~~C~gC~~~l~~~~~~~---i~~~~~i~~Cp~CgR   54 (56)
T PF02591_consen   22 GGTCSGCHMELPPQELNE---IRKGDEIVFCPNCGR   54 (56)
T ss_pred             CCccCCCCEEcCHHHHHH---HHcCCCeEECcCCCc
Confidence            346777776666543332   123356677888875


No 113
>PLN03238 probable histone acetyltransferase MYST; Provisional
Probab=39.10  E-value=27  Score=25.41  Aligned_cols=38  Identities=16%  Similarity=0.227  Sum_probs=29.4

Q ss_pred             CCCcccCCCCCCCCC-hhhHHHHHHHhcccccCCcchhh
Q 032068           84 PQRIVTCDFCEFPLP-AVDLAEHQVAIVNNFENGKTIAD  121 (148)
Q Consensus        84 ~~~~~~C~~C~~~f~-~~~l~~H~~~c~~~~~~~~~~~~  121 (148)
                      ....|.|+.|=+-|. ...+..|...|..+..++..+.+
T Consensus        45 ~~~lyiCe~Clky~~~~~~l~~H~~~C~~r~PPG~eIYr   83 (290)
T PLN03238         45 CTKLYICEYCLKYMRKKKSLLRHLAKCDIRQPPGGGIYG   83 (290)
T ss_pred             CCeEEEcCCCcchhCCHHHHHHHHHhCCCCCCCcCEeEe
Confidence            346788888887777 88889998888877777666665


No 114
>smart00440 ZnF_C2C2 C2C2 Zinc finger. Nucleic-acid-binding motif in transcriptional elongation factor TFIIS and RNA polymerases.
Probab=38.84  E-value=5.4  Score=19.66  Aligned_cols=10  Identities=20%  Similarity=0.537  Sum_probs=5.9

Q ss_pred             ccCCCCCCCC
Q 032068           88 VTCDFCEFPL   97 (148)
Q Consensus        88 ~~C~~C~~~f   97 (148)
                      |.|..|+..+
T Consensus        29 y~C~~C~~~w   38 (40)
T smart00440       29 YVCTKCGHRW   38 (40)
T ss_pred             EEeCCCCCEe
Confidence            5666666543


No 115
>KOG2231 consensus Predicted E3 ubiquitin ligase [Posttranslational modification, protein turnover, chaperones]
Probab=37.28  E-value=91  Score=25.76  Aligned_cols=19  Identities=26%  Similarity=0.502  Sum_probs=9.8

Q ss_pred             cCCCCCCCCC-hhhHHHHHH
Q 032068           89 TCDFCEFPLP-AVDLAEHQV  107 (148)
Q Consensus        89 ~C~~C~~~f~-~~~l~~H~~  107 (148)
                      .|++|...|. ...|..|++
T Consensus       184 ~C~~C~~~fld~~el~rH~~  203 (669)
T KOG2231|consen  184 LCKFCHERFLDDDELYRHLR  203 (669)
T ss_pred             cchhhhhhhccHHHHHHhhc
Confidence            3445555555 555555554


No 116
>PRK14714 DNA polymerase II large subunit; Provisional
Probab=36.61  E-value=39  Score=29.94  Aligned_cols=27  Identities=19%  Similarity=0.573  Sum_probs=15.0

Q ss_pred             CccCccchhccchHHHHhhHhhcCCCCcccCCCCCCC
Q 032068           60 PVACSQCSETMEREILAIHKGENCPQRIVTCDFCEFP   96 (148)
Q Consensus        60 ~~~C~~C~~~~~~~~l~~h~~~~~~~~~~~C~~C~~~   96 (148)
                      .+.|+.||......        .++  ...|+.|+..
T Consensus       692 vy~CPsCGaev~~d--------es~--a~~CP~CGtp  718 (1337)
T PRK14714        692 VYVCPDCGAEVPPD--------ESG--RVECPRCDVE  718 (1337)
T ss_pred             ceeCccCCCccCCC--------ccc--cccCCCCCCc
Confidence            56777777554330        011  3368888854


No 117
>PF07975 C1_4:  TFIIH C1-like domain;  InterPro: IPR004595 All proteins in this domain for which functions are known are components of the TFIIH complex which is involved in the initiation of transcription and nucleotide excision repair. It includes the yeast transcription factor Ssl1 (Suppressor of stem-loop protein 1) that is essential for translation initiation and affects UV resistance. The C-terminal region is essential for transcription activity. This regions binds three zinc atoms through two independent domain. The first contains a C4 zinc finger motif, whereas the second is characterised by a CX(2)CX(2-4)FCADCD motif. The solution structure of the second C-terminal domain revealed homology with the regulatory domain of protein kinase C [].; GO: 0006281 DNA repair, 0005634 nucleus; PDB: 1Z60_A.
Probab=36.56  E-value=13  Score=19.46  Aligned_cols=14  Identities=21%  Similarity=0.368  Sum_probs=4.3

Q ss_pred             CccccccCCccChh
Q 032068           35 LERCKVCGDMVPRK   48 (148)
Q Consensus        35 ~~~C~~C~~~f~~~   48 (148)
                      .+.|+.|...|-..
T Consensus        21 ~y~C~~C~~~FC~d   34 (51)
T PF07975_consen   21 RYRCPKCKNHFCID   34 (51)
T ss_dssp             EE--TTTT--B-HH
T ss_pred             eEECCCCCCccccC
Confidence            34455555555444


No 118
>PRK00564 hypA hydrogenase nickel incorporation protein; Provisional
Probab=36.55  E-value=20  Score=22.24  Aligned_cols=9  Identities=33%  Similarity=0.970  Sum_probs=4.0

Q ss_pred             cccccCCcc
Q 032068           37 RCKVCGDMV   45 (148)
Q Consensus        37 ~C~~C~~~f   45 (148)
                      .|..|+..|
T Consensus        73 ~C~~Cg~~~   81 (117)
T PRK00564         73 ECKDCSHVF   81 (117)
T ss_pred             EhhhCCCcc
Confidence            344454333


No 119
>KOG2593 consensus Transcription initiation factor IIE, alpha subunit [Transcription]
Probab=35.95  E-value=28  Score=26.79  Aligned_cols=16  Identities=13%  Similarity=0.077  Sum_probs=9.4

Q ss_pred             CCCccccccCCccChh
Q 032068           33 RNLERCKVCGDMVPRK   48 (148)
Q Consensus        33 ~~~~~C~~C~~~f~~~   48 (148)
                      ...|.|+.|.++|...
T Consensus       126 ~~~Y~Cp~C~kkyt~L  141 (436)
T KOG2593|consen  126 VAGYVCPNCQKKYTSL  141 (436)
T ss_pred             cccccCCccccchhhh
Confidence            3456666666666554


No 120
>TIGR00100 hypA hydrogenase nickel insertion protein HypA. In Hpylori, hypA mutant abolished hydrogenase activity and decrease in urease activity. Nickel supplementation in media restored urease activity and partial hydrogenase activity. HypA probably involved in inserting Ni in enzymes.
Probab=35.80  E-value=22  Score=21.97  Aligned_cols=9  Identities=33%  Similarity=0.970  Sum_probs=4.4

Q ss_pred             cccccCCcc
Q 032068           37 RCKVCGDMV   45 (148)
Q Consensus        37 ~C~~C~~~f   45 (148)
                      .|..|+..|
T Consensus        72 ~C~~Cg~~~   80 (115)
T TIGR00100        72 ECEDCSEEV   80 (115)
T ss_pred             EcccCCCEE
Confidence            455555444


No 121
>TIGR00595 priA primosomal protein N'. All proteins in this family for which functions are known are components of the primosome which is involved in replication, repair, and recombination.This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University).
Probab=34.98  E-value=12  Score=29.58  Aligned_cols=8  Identities=50%  Similarity=1.373  Sum_probs=3.8

Q ss_pred             CccCccch
Q 032068           60 PVACSQCS   67 (148)
Q Consensus        60 ~~~C~~C~   67 (148)
                      |..|+.|+
T Consensus       253 ~~~Cp~C~  260 (505)
T TIGR00595       253 PKTCPQCG  260 (505)
T ss_pred             CCCCCCCC
Confidence            44455554


No 122
>PRK03681 hypA hydrogenase nickel incorporation protein; Validated
Probab=34.20  E-value=21  Score=22.03  Aligned_cols=8  Identities=25%  Similarity=0.929  Sum_probs=3.9

Q ss_pred             cccccCCc
Q 032068           37 RCKVCGDM   44 (148)
Q Consensus        37 ~C~~C~~~   44 (148)
                      .|..|+..
T Consensus        72 ~C~~Cg~~   79 (114)
T PRK03681         72 WCETCQQY   79 (114)
T ss_pred             EcccCCCe
Confidence            45555543


No 123
>PF01155 HypA:  Hydrogenase expression/synthesis hypA family;  InterPro: IPR000688 Bacterial membrane-bound nickel-dependent hydrogenases requires a number of accessory proteins which are involved in their maturation. The exact role of these proteins is not yet clear, but some seem to be required for the incorporation of the nickel ions []. One of these proteins is generally known as hypA. It is a protein of about 12 to 14 kDa that contains, in its C-terminal region, four conserved cysteines that form a zinc-finger like motif. Escherichia coli has two proteins that belong to this family, hypA and hybF. A homologue, MJ0214, has also been found in a number of archaeal species, including the genome of Methanocaldococcus jannaschii (Methanococcus jannaschii).; GO: 0016151 nickel ion binding, 0006464 protein modification process; PDB: 2KDX_A 3A44_D 3A43_B.
Probab=33.24  E-value=12  Score=23.05  Aligned_cols=9  Identities=44%  Similarity=1.206  Sum_probs=4.1

Q ss_pred             cccccCCcc
Q 032068           37 RCKVCGDMV   45 (148)
Q Consensus        37 ~C~~C~~~f   45 (148)
                      .|..|+..|
T Consensus        72 ~C~~Cg~~~   80 (113)
T PF01155_consen   72 RCRDCGHEF   80 (113)
T ss_dssp             EETTTS-EE
T ss_pred             ECCCCCCEE
Confidence            455555444


No 124
>PF14446 Prok-RING_1:  Prokaryotic RING finger family 1
Probab=32.33  E-value=28  Score=18.50  Aligned_cols=10  Identities=20%  Similarity=0.737  Sum_probs=5.9

Q ss_pred             cCCCCCCCCC
Q 032068           11 ICSHCDRAIP   20 (148)
Q Consensus        11 ~C~~C~~~~~   20 (148)
                      .|..|++.|.
T Consensus         7 ~C~~Cg~~~~   16 (54)
T PF14446_consen    7 KCPVCGKKFK   16 (54)
T ss_pred             cChhhCCccc
Confidence            4666666654


No 125
>PF14205 Cys_rich_KTR:  Cysteine-rich KTR
Probab=32.16  E-value=13  Score=19.79  Aligned_cols=34  Identities=24%  Similarity=0.584  Sum_probs=16.5

Q ss_pred             CccccccCCccChhhHHHHHHcCCCCccCccchhc
Q 032068           35 LERCKVCGDMVPRKYAEEHFLNTHAPVACSQCSET   69 (148)
Q Consensus        35 ~~~C~~C~~~f~~~~l~~h~~~~h~~~~C~~C~~~   69 (148)
                      ...||.|+.+...+-.. .......|..|+.|.+.
T Consensus         4 Wi~CP~CgnKTR~kir~-DT~LkNfPlyCpKCK~E   37 (55)
T PF14205_consen    4 WILCPICGNKTRLKIRE-DTVLKNFPLYCPKCKQE   37 (55)
T ss_pred             EEECCCCCCccceeeec-CceeccccccCCCCCce
Confidence            45677777665433100 10010137777777544


No 126
>COG2331 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=31.32  E-value=15  Score=21.02  Aligned_cols=30  Identities=20%  Similarity=0.538  Sum_probs=15.7

Q ss_pred             ccCCCCCCCCCcchHHHHHhhcCCCCccccccCCc
Q 032068           10 KICSHCDRAIPSSNIDLHFAHCSRNLERCKVCGDM   44 (148)
Q Consensus        10 ~~C~~C~~~~~~~~l~~H~~~~~~~~~~C~~C~~~   44 (148)
                      |+|.+|+..|.   +.+++...+  ...|+.|+..
T Consensus        13 Y~c~~cg~~~d---vvq~~~ddp--lt~ce~c~a~   42 (82)
T COG2331          13 YECTECGNRFD---VVQAMTDDP--LTTCEECGAR   42 (82)
T ss_pred             EeecccchHHH---HHHhcccCc--cccChhhChH
Confidence            67777777643   322322222  3457777654


No 127
>COG1579 Zn-ribbon protein, possibly nucleic acid-binding [General function prediction only]
Probab=30.30  E-value=35  Score=24.20  Aligned_cols=11  Identities=27%  Similarity=0.703  Sum_probs=6.8

Q ss_pred             CccCccchhcc
Q 032068           60 PVACSQCSETM   70 (148)
Q Consensus        60 ~~~C~~C~~~~   70 (148)
                      ...||.||+..
T Consensus       221 iv~CP~CgRIL  231 (239)
T COG1579         221 IVFCPYCGRIL  231 (239)
T ss_pred             CccCCccchHH
Confidence            55677777543


No 128
>PF05191 ADK_lid:  Adenylate kinase, active site lid;  InterPro: IPR007862 Adenylate kinases (ADK; 2.7.4.3 from EC) are phosphotransferases that catalyse the Mg-dependent reversible conversion of ATP and AMP to two molecules of ADP, an essential reaction for many processes in living cells. In large variants of adenylate kinase, the AMP and ATP substrates are buried in a domain that undergoes conformational changes from an open to a closed state when bound to substrate; the ligand is then contained within a highly specific environment required for catalysis. Adenylate kinase is a 3-domain protein consisting of a large central CORE domain flanked by a LID domain on one side and the AMP-binding NMPbind domain on the other []. The LID domain binds ATP and covers the phosphates at the active site. The substrates first bind the CORE domain, followed by closure of the active site by the LID and NMPbind domains. Comparisons of adenylate kinases have revealed a particular divergence in the active site lid. In some organisms, particularly the Gram-positive bacteria, residues in the lid domain have been mutated to cysteines and these cysteine residues (two CX(n)C motifs) are responsible for the binding of a zinc ion. The bound zinc ion in the lid domain is clearly structurally homologous to Zinc-finger domains. However, it is unclear whether the adenylate kinase lid is a novel zinc-finger DNA/RNA binding domain, or that the lid bound zinc serves a purely structural function [].; GO: 0004017 adenylate kinase activity; PDB: 3BE4_A 2OSB_B 2ORI_A 2EU8_A 3DL0_A 1P3J_A 2QAJ_A 2OO7_A 2P3S_A 3DKV_A ....
Probab=30.17  E-value=18  Score=17.39  Aligned_cols=9  Identities=22%  Similarity=0.892  Sum_probs=4.5

Q ss_pred             cCccchhcc
Q 032068           62 ACSQCSETM   70 (148)
Q Consensus        62 ~C~~C~~~~   70 (148)
                      .|+.||+.|
T Consensus         3 ~C~~Cg~~Y   11 (36)
T PF05191_consen    3 ICPKCGRIY   11 (36)
T ss_dssp             EETTTTEEE
T ss_pred             CcCCCCCcc
Confidence            355555544


No 129
>PF10013 DUF2256:  Uncharacterized protein conserved in bacteria (DUF2256);  InterPro: IPR017136 There is currently no experimental data for members of this group or their homologues, nor do they exhibit features indicative of any function.
Probab=28.94  E-value=48  Score=16.62  Aligned_cols=16  Identities=25%  Similarity=0.580  Sum_probs=12.1

Q ss_pred             cCCCCCCCCC-hhhHHH
Q 032068           89 TCDFCEFPLP-AVDLAE  104 (148)
Q Consensus        89 ~C~~C~~~f~-~~~l~~  104 (148)
                      .|+.|++.|. +.....
T Consensus        10 ~C~~C~rpf~WRKKW~~   26 (42)
T PF10013_consen   10 ICPVCGRPFTWRKKWAR   26 (42)
T ss_pred             cCcccCCcchHHHHHHH
Confidence            5888888888 766654


No 130
>COG4530 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=28.71  E-value=42  Score=20.70  Aligned_cols=19  Identities=11%  Similarity=0.165  Sum_probs=12.7

Q ss_pred             CcccCCCCCCCCChhhHHH
Q 032068           86 RIVTCDFCEFPLPAVDLAE  104 (148)
Q Consensus        86 ~~~~C~~C~~~f~~~~l~~  104 (148)
                      .|..|++||++|+++-|..
T Consensus        25 dPiVsPytG~s~P~s~fe~   43 (129)
T COG4530          25 DPIVSPYTGKSYPRSYFEE   43 (129)
T ss_pred             CccccCcccccchHHHHHh
Confidence            4667777888777655544


No 131
>smart00154 ZnF_AN1 AN1-like Zinc finger. Zinc finger at the C-terminus of An1, a ubiquitin-like protein in Xenopus laevis.
Probab=28.55  E-value=32  Score=16.78  Aligned_cols=12  Identities=25%  Similarity=0.539  Sum_probs=8.8

Q ss_pred             cccCCCCCCCCC
Q 032068            9 TKICSHCDRAIP   20 (148)
Q Consensus         9 ~~~C~~C~~~~~   20 (148)
                      |+.|..|++.|-
T Consensus        12 ~f~C~~C~~~FC   23 (39)
T smart00154       12 GFKCRHCGNLFC   23 (39)
T ss_pred             CeECCccCCccc
Confidence            677777777765


No 132
>PRK09678 DNA-binding transcriptional regulator; Provisional
Probab=26.04  E-value=33  Score=19.40  Aligned_cols=12  Identities=33%  Similarity=0.617  Sum_probs=7.3

Q ss_pred             CccCc--cchhccc
Q 032068           60 PVACS--QCSETME   71 (148)
Q Consensus        60 ~~~C~--~C~~~~~   71 (148)
                      -+.|.  .||..|.
T Consensus        27 Y~qC~N~eCg~tF~   40 (72)
T PRK09678         27 YHQCQNVNCSATFI   40 (72)
T ss_pred             eeecCCCCCCCEEE
Confidence            45564  6776664


No 133
>PTZ00064 histone acetyltransferase; Provisional
Probab=25.91  E-value=41  Score=26.66  Aligned_cols=37  Identities=16%  Similarity=0.237  Sum_probs=30.1

Q ss_pred             CCcccCCCCCCCCC-hhhHHHHHHHhcccccCCcchhh
Q 032068           85 QRIVTCDFCEFPLP-AVDLAEHQVAIVNNFENGKTIAD  121 (148)
Q Consensus        85 ~~~~~C~~C~~~f~-~~~l~~H~~~c~~~~~~~~~~~~  121 (148)
                      ...|.|++|=+-|. ...|..|...|.....++..+.+
T Consensus       278 d~LYICEfCLkY~~s~~~l~rH~~~C~~rhPPG~EIYR  315 (552)
T PTZ00064        278 DTLHFCEYCLDFFCFEDELIRHLSRCQLRHPPGNEIYR  315 (552)
T ss_pred             CeEEEccchhhhhCCHHHHHHHHhcCCCCCCCCCeEEE
Confidence            45788999988888 89999999999887777776654


No 134
>PF05290 Baculo_IE-1:  Baculovirus immediate-early protein (IE-0);  InterPro: IPR007954 This entry contains the Baculovirus immediate-early protein IE-0.
Probab=25.88  E-value=68  Score=20.61  Aligned_cols=16  Identities=19%  Similarity=0.372  Sum_probs=11.8

Q ss_pred             CCCCcccCCCCCCCCC
Q 032068            5 SDETTKICSHCDRAIP   20 (148)
Q Consensus         5 ~~~~~~~C~~C~~~~~   20 (148)
                      .+.+-|+|..|.++..
T Consensus        76 ~d~~lYeCnIC~etS~   91 (140)
T PF05290_consen   76 LDPKLYECNICKETSA   91 (140)
T ss_pred             cCCCceeccCcccccc
Confidence            3456799999988754


No 135
>PRK05978 hypothetical protein; Provisional
Probab=25.47  E-value=28  Score=22.71  Aligned_cols=7  Identities=43%  Similarity=1.265  Sum_probs=3.4

Q ss_pred             cCccchh
Q 032068           62 ACSQCSE   68 (148)
Q Consensus        62 ~C~~C~~   68 (148)
                      .|+.||+
T Consensus        35 rCP~CG~   41 (148)
T PRK05978         35 RCPACGE   41 (148)
T ss_pred             cCCCCCC
Confidence            4555543


No 136
>PF01428 zf-AN1:  AN1-like Zinc finger;  InterPro: IPR000058 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target.  This entry represents the AN1-type zinc finger domain, which has a dimetal (zinc)-bound alpha/beta fold. This domain was first identified as a zinc finger at the C terminus of AN1 Q91889 from SWISSPROT, a ubiquitin-like protein in Xenopus laevis []. The AN1-type zinc finger contains six conserved cysteines and two histidines that could potentially coordinate 2 zinc atoms. Certain stress-associated proteins (SAP) contain AN1 domain, often in combination with A20 zinc finger domains (SAP8) or C2H2 domains (SAP16) []. For example, the human protein Znf216 has an A20 zinc-finger at the N terminus and an AN1 zinc-finger at the C terminus, acting to negatively regulate the NFkappaB activation pathway and to interact with components of the immune response like RIP, IKKgamma and TRAF6. The interact of Znf216 with IKK-gamma and RIP is mediated by the A20 zinc-finger domain, while its interaction with TRAF6 is mediated by the AN1 zinc-finger domain; therefore, both zinc-finger domains are involved in regulating the immune response []. The AN1 zinc finger domain is also found in proteins containing a ubiquitin-like domain, which are involved in the ubiquitination pathway []. Proteins containing an AN1-type zinc finger include:   Ascidian posterior end mark 6 (pem-6) protein []. Human AWP1 protein (associated with PRK1), which is expressed during early embryogenesis []. Human immunoglobulin mu binding protein 2 (SMUBP-2), mutations in which cause muscular atrophy with respiratory distress type 1 [].   More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; GO: 0008270 zinc ion binding; PDB: 1WFP_A 1WYS_A 1WG2_A 1WFH_A 1X4W_A 1WFE_A 1WFL_A 1X4V_A.
Probab=25.34  E-value=20  Score=17.78  Aligned_cols=12  Identities=17%  Similarity=0.476  Sum_probs=7.1

Q ss_pred             CcccCCCCCCCC
Q 032068           86 RIVTCDFCEFPL   97 (148)
Q Consensus        86 ~~~~C~~C~~~f   97 (148)
                      .|+.|+.|+..|
T Consensus        12 ~~~~C~~C~~~F   23 (43)
T PF01428_consen   12 LPFKCKHCGKSF   23 (43)
T ss_dssp             SHEE-TTTS-EE
T ss_pred             CCeECCCCCccc
Confidence            467788887765


No 137
>PF08790 zf-LYAR:  LYAR-type C2HC zinc finger ;  InterPro: IPR014898 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target.  This C2HC zinc finger domain is found in LYAR proteins such as Q08288 from SWISSPROT, which are involved in cell growth regulation.  More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; PDB: 1WJV_A.
Probab=25.32  E-value=16  Score=16.55  Aligned_cols=9  Identities=33%  Similarity=1.103  Sum_probs=3.8

Q ss_pred             Cccchhccc
Q 032068           63 CSQCSETME   71 (148)
Q Consensus        63 C~~C~~~~~   71 (148)
                      |..|++.|.
T Consensus         3 CiDC~~~F~   11 (28)
T PF08790_consen    3 CIDCSKDFD   11 (28)
T ss_dssp             ETTTTEEEE
T ss_pred             eecCCCCcC
Confidence            334444443


No 138
>PF12760 Zn_Tnp_IS1595:  Transposase zinc-ribbon domain;  InterPro: IPR024442 This zinc binding domain is found in a range of transposase proteins such as ISSPO8, ISSOD11, ISRSSP2 etc. It may be a zinc-binding beta ribbon domain that could bind DNA.
Probab=24.64  E-value=26  Score=17.64  Aligned_cols=9  Identities=22%  Similarity=0.626  Sum_probs=4.0

Q ss_pred             CcccCCCCC
Q 032068           86 RIVTCDFCE   94 (148)
Q Consensus        86 ~~~~C~~C~   94 (148)
                      ..|.|..|+
T Consensus        36 ~~~~C~~C~   44 (46)
T PF12760_consen   36 GRYRCKACR   44 (46)
T ss_pred             CeEECCCCC
Confidence            344444443


No 139
>COG0068 HypF Hydrogenase maturation factor [Posttranslational modification, protein turnover, chaperones]
Probab=24.26  E-value=13  Score=30.60  Aligned_cols=11  Identities=27%  Similarity=0.631  Sum_probs=6.2

Q ss_pred             ccccCCccChh
Q 032068           38 CKVCGDMVPRK   48 (148)
Q Consensus        38 C~~C~~~f~~~   48 (148)
                      |..||-.|+.-
T Consensus       126 CT~CGPRfTIi  136 (750)
T COG0068         126 CTNCGPRFTII  136 (750)
T ss_pred             cCCCCcceeee
Confidence            66666665533


No 140
>PF12773 DZR:  Double zinc ribbon
Probab=23.89  E-value=1e+02  Score=15.43  Aligned_cols=9  Identities=22%  Similarity=0.792  Sum_probs=4.7

Q ss_pred             CccCccchh
Q 032068           60 PVACSQCSE   68 (148)
Q Consensus        60 ~~~C~~C~~   68 (148)
                      ...|+.|+.
T Consensus        29 ~~~C~~Cg~   37 (50)
T PF12773_consen   29 KKICPNCGA   37 (50)
T ss_pred             CCCCcCCcC
Confidence            445555553


No 141
>COG1656 Uncharacterized conserved protein [Function unknown]
Probab=23.36  E-value=68  Score=21.36  Aligned_cols=11  Identities=18%  Similarity=0.184  Sum_probs=5.3

Q ss_pred             ccCCCCCCCCC
Q 032068           88 VTCDFCEFPLP   98 (148)
Q Consensus        88 ~~C~~C~~~f~   98 (148)
                      +.|+.|++.|=
T Consensus       131 ~~C~~CgkiYW  141 (165)
T COG1656         131 YRCPKCGKIYW  141 (165)
T ss_pred             eECCCCccccc
Confidence            34555555443


No 142
>PF14445 Prok-RING_2:  Prokaryotic RING finger family 2
Probab=23.30  E-value=36  Score=17.80  Aligned_cols=18  Identities=28%  Similarity=0.702  Sum_probs=11.3

Q ss_pred             cccCCCCCCCCChhhHHH
Q 032068           87 IVTCDFCEFPLPAVDLAE  104 (148)
Q Consensus        87 ~~~C~~C~~~f~~~~l~~  104 (148)
                      .|.|+.|...|+.+.|..
T Consensus         7 ry~CDLCn~~~p~~~LRQ   24 (57)
T PF14445_consen    7 RYSCDLCNSSHPISELRQ   24 (57)
T ss_pred             hHhHHhhcccCcHHHHHH
Confidence            466777777777555443


No 143
>KOG2482 consensus Predicted C2H2-type Zn-finger protein [Transcription]
Probab=23.19  E-value=50  Score=24.81  Aligned_cols=22  Identities=27%  Similarity=0.445  Sum_probs=19.7

Q ss_pred             cccCCCCCCCCC-hhhHHHHHHH
Q 032068           87 IVTCDFCEFPLP-AVDLAEHQVA  108 (148)
Q Consensus        87 ~~~C~~C~~~f~-~~~l~~H~~~  108 (148)
                      ...|-.|.+.|+ +..|++|++.
T Consensus       195 r~~CLyCekifrdkntLkeHMrk  217 (423)
T KOG2482|consen  195 RLRCLYCEKIFRDKNTLKEHMRK  217 (423)
T ss_pred             hheeeeeccccCCcHHHHHHHHh
Confidence            467889999999 9999999986


No 144
>PF11789 zf-Nse:  Zinc-finger of the MIZ type in Nse subunit; PDB: 2YU4_A 3HTK_C.
Probab=23.17  E-value=55  Score=17.44  Aligned_cols=31  Identities=23%  Similarity=0.394  Sum_probs=15.7

Q ss_pred             CCccCccchhccchHHHHhhHhhcCCCCcccCCC
Q 032068           59 APVACSQCSETMEREILAIHKGENCPQRIVTCDF   92 (148)
Q Consensus        59 ~~~~C~~C~~~~~~~~l~~h~~~~~~~~~~~C~~   92 (148)
                      .|+....|+..|..+.+...+   .......||.
T Consensus        23 ~PV~s~~C~H~fek~aI~~~i---~~~~~~~CPv   53 (57)
T PF11789_consen   23 DPVKSKKCGHTFEKEAILQYI---QRNGSKRCPV   53 (57)
T ss_dssp             SEEEESSS--EEEHHHHHHHC---TTTS-EE-SC
T ss_pred             CCcCcCCCCCeecHHHHHHHH---HhcCCCCCCC
Confidence            366666777777775555544   2234556665


No 145
>COG4896 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=22.98  E-value=28  Score=18.98  Aligned_cols=35  Identities=20%  Similarity=0.366  Sum_probs=15.7

Q ss_pred             cCCCCCCCCC--cchHHHH-HhhcCCCCccccccCCcc
Q 032068           11 ICSHCDRAIP--SSNIDLH-FAHCSRNLERCKVCGDMV   45 (148)
Q Consensus        11 ~C~~C~~~~~--~~~l~~H-~~~~~~~~~~C~~C~~~f   45 (148)
                      .|..|++.-.  ...+..- ....+.+.|.|++|....
T Consensus         4 kCiiCd~v~~iD~rt~~tKrLrN~PIrtymC~eC~~Rv   41 (68)
T COG4896           4 KCIICDRVDEIDNRTFKTKRLRNKPIRTYMCPECEHRV   41 (68)
T ss_pred             eEEEecceeeecchhHHHHHhhCCCceeEechhhHhhh
Confidence            3445544422  3333333 344455555566555433


No 146
>PF11672 DUF3268:  Protein of unknown function (DUF3268);  InterPro: IPR021686 This entry is represented by Listeria phage P100, Gp150. The characteristics of the protein distribution suggest prophage matches in addition to the phage matches.
Probab=22.96  E-value=44  Score=20.32  Aligned_cols=12  Identities=25%  Similarity=0.592  Sum_probs=6.7

Q ss_pred             cccCCCCCCCCC
Q 032068            9 TKICSHCDRAIP   20 (148)
Q Consensus         9 ~~~C~~C~~~~~   20 (148)
                      |..|+.|+....
T Consensus         2 p~~CpYCg~~~~   13 (102)
T PF11672_consen    2 PIICPYCGGPAE   13 (102)
T ss_pred             CcccCCCCCeeE
Confidence            455666665543


No 147
>PF04959 ARS2:  Arsenite-resistance protein 2;  InterPro: IPR007042 This entry represents Arsenite-resistance protein 2 (also known as Serrate RNA effector molecule homolog) which is thought to play a role in arsenite resistance [], although does not directly confer arsenite resistance but rather modulates arsenic sensitivity []. Arsenite is a carcinogenic compound which can act as a comutagen by inhibiting DNA repair. It is also involved in cell cycle progression at S phase. ; PDB: 3AX1_A.
Probab=22.76  E-value=50  Score=23.02  Aligned_cols=25  Identities=12%  Similarity=0.206  Sum_probs=16.8

Q ss_pred             CCCcccCCCCCCCCC-hhhHHHHHHH
Q 032068           84 PQRIVTCDFCEFPLP-AVDLAEHQVA  108 (148)
Q Consensus        84 ~~~~~~C~~C~~~f~-~~~l~~H~~~  108 (148)
                      .+.-|.|+.|+|.|. ..=+..|+..
T Consensus        74 ~~~K~~C~lc~KlFkg~eFV~KHI~n   99 (214)
T PF04959_consen   74 DEDKWRCPLCGKLFKGPEFVRKHIFN   99 (214)
T ss_dssp             SSEEEEE-SSS-EESSHHHHHHHHHH
T ss_pred             cCCEECCCCCCcccCChHHHHHHHhh
Confidence            455689999999998 6666777653


No 148
>PF01096 TFIIS_C:  Transcription factor S-II (TFIIS);  InterPro: IPR001222 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target.  This entry represents a zinc finger motif found in transcription factor IIs (TFIIS). In eukaryotes the initiation of transcription of protein encoding genes by polymerase II (Pol II) is modulated by general and specific transcription factors. The general transcription factors operate through common promoters elements (such as the TATA box). At least eight different proteins associate to form the general transcription factors: TFIIA, -IIB, -IID, -IIE, -IIF, -IIG, -IIH and -IIS []. During mRNA elongation, Pol II can encounter DNA sequences that cause reverse movement of the enzyme. Such backtracking involves extrusion of the RNA 3'-end into the pore, and can lead to transcriptional arrest. Escape from arrest requires cleavage of the extruded RNA with the help of TFIIS, which induces mRNA cleavage by enhancing the intrinsic nuclease activity of RNA polymerase (Pol) II, past template-encoded pause sites []. TFIIS extends from the polymerase surface via a pore to the internal active site. Two essential and invariant acidic residues in a TFIIS loop complement the Pol II active site and could position a metal ion and a water molecule for hydrolytic RNA cleavage. TFIIS also induces extensive structural changes in Pol II that would realign nucleic acids in the active centre.  TFIIS is a protein of about 300 amino acids. It contains three regions: a variable N-terminal domain not required for TFIIS activity; a conserved central domain required for Pol II binding; and a conserved C-terminal C4-type zinc finger essential for RNA cleavage. The zinc finger folds in a conformation termed a zinc ribbon [] characterised by a three-stranded antiparallel beta-sheet and two beta-hairpins. A backbone model for Pol II-TFIIS complex was obtained from X-ray analysis. It shows that a beta hairpin protrudes from the zinc finger and complements the pol II active site [].  Some viral proteins also contain the TFIIS zinc ribbon C-terminal domain. The Vaccinia virus protein, unlike its eukaryotic homologue, is an integral RNA polymerase subunit rather than a readily separable transcription factor []. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; GO: 0003676 nucleic acid binding, 0008270 zinc ion binding, 0006351 transcription, DNA-dependent; PDB: 3M4O_I 3S14_I 2E2J_I 4A3J_I 3HOZ_I 1TWA_I 3S1Q_I 3S1N_I 1TWG_I 3I4M_I ....
Probab=22.76  E-value=2.7  Score=20.62  Aligned_cols=9  Identities=22%  Similarity=0.512  Sum_probs=4.9

Q ss_pred             ccCCCCCCC
Q 032068           88 VTCDFCEFP   96 (148)
Q Consensus        88 ~~C~~C~~~   96 (148)
                      |.|..|+..
T Consensus        29 y~C~~C~~~   37 (39)
T PF01096_consen   29 YVCCNCGHR   37 (39)
T ss_dssp             EEESSSTEE
T ss_pred             EEeCCCCCe
Confidence            556666543


No 149
>PF07800 DUF1644:  Protein of unknown function (DUF1644);  InterPro: IPR012866 This family consists of sequences found in a number of hypothetical plant proteins of unknown function. The region of interest contains nine highly conserved cysteine residues and is approximately 160 amino acids in length, which probably represent a zinc-binding domain. 
Probab=22.71  E-value=72  Score=21.14  Aligned_cols=48  Identities=17%  Similarity=0.394  Sum_probs=29.9

Q ss_pred             CccCccchhccch-HHHHhhHhhcCCCCcccCCC--CCCCCChhhHHHHHHH
Q 032068           60 PVACSQCSETMER-EILAIHKGENCPQRIVTCDF--CEFPLPAVDLAEHQVA  108 (148)
Q Consensus        60 ~~~C~~C~~~~~~-~~l~~h~~~~~~~~~~~C~~--C~~~f~~~~l~~H~~~  108 (148)
                      ...|+.|.-.+.. .... ..+.+-..|+-.|..  |.+......|..|.+.
T Consensus        80 ~L~CPLCRG~V~GWtvve-~AR~~LN~K~RsC~~e~C~F~GtY~eLrKHar~  130 (162)
T PF07800_consen   80 ELACPLCRGEVKGWTVVE-PARRFLNAKKRSCSQESCSFSGTYSELRKHARS  130 (162)
T ss_pred             cccCccccCceeceEEch-HHHHHhccCCccCcccccccccCHHHHHHHHHh
Confidence            6678888655543 2221 133334445556654  8777678899999876


No 150
>KOG3408 consensus U1-like Zn-finger-containing protein, probabl erole in RNA processing/splicing [RNA processing and modification]
Probab=22.15  E-value=46  Score=20.94  Aligned_cols=22  Identities=18%  Similarity=0.360  Sum_probs=18.1

Q ss_pred             CccCccchhccch-HHHHhhHhh
Q 032068           60 PVACSQCSETMER-EILAIHKGE   81 (148)
Q Consensus        60 ~~~C~~C~~~~~~-~~l~~h~~~   81 (148)
                      -|.|..|.+-|.. +.|..|.++
T Consensus        57 qfyCi~CaRyFi~~~~l~~H~kt   79 (129)
T KOG3408|consen   57 QFYCIECARYFIDAKALKTHFKT   79 (129)
T ss_pred             eeehhhhhhhhcchHHHHHHHhc
Confidence            5789999999987 888888764


No 151
>PF04423 Rad50_zn_hook:  Rad50 zinc hook motif;  InterPro: IPR007517 The Mre11 complex (Mre11 Rad50 Nbs1) is central to chromosomal maintenance and functions in homologous recombination, telomere maintenance and sister chromatid association. The Rad50 coiled-coil region contains a dimer interface at the apex of the coiled coils in which pairs of conserved Cys-X-X-Cys motifs form interlocking hooks that bind one Zn ion. This alignment includes the zinc hook motif and a short stretch of coiled-coil on either side.; GO: 0004518 nuclease activity, 0005524 ATP binding, 0008270 zinc ion binding, 0006281 DNA repair; PDB: 1L8D_B.
Probab=21.99  E-value=30  Score=18.01  Aligned_cols=12  Identities=25%  Similarity=0.794  Sum_probs=6.7

Q ss_pred             cccccCCccChh
Q 032068           37 RCKVCGDMVPRK   48 (148)
Q Consensus        37 ~C~~C~~~f~~~   48 (148)
                      .||.|++.|...
T Consensus        22 ~CPlC~r~l~~e   33 (54)
T PF04423_consen   22 CCPLCGRPLDEE   33 (54)
T ss_dssp             E-TTT--EE-HH
T ss_pred             cCCCCCCCCCHH
Confidence            799999998766


No 152
>PLN00104 MYST -like histone acetyltransferase; Provisional
Probab=20.78  E-value=52  Score=25.70  Aligned_cols=37  Identities=19%  Similarity=0.307  Sum_probs=29.3

Q ss_pred             CCcccCCCCCCCCC-hhhHHHHHHHhcccccCCcchhh
Q 032068           85 QRIVTCDFCEFPLP-AVDLAEHQVAIVNNFENGKTIAD  121 (148)
Q Consensus        85 ~~~~~C~~C~~~f~-~~~l~~H~~~c~~~~~~~~~~~~  121 (148)
                      ...|.|++|=+-|. ...|..|+..|...-.++..+.+
T Consensus       196 ~~lyiCe~Cl~y~~~~~~~~~H~~~C~~~~PPG~eIYr  233 (450)
T PLN00104        196 SKLYFCEFCLKFMKRKEQLQRHMKKCDLKHPPGDEIYR  233 (450)
T ss_pred             CeEEEchhhhhhhcCHHHHHHHHhcCCCCCCCcCeEEE
Confidence            45788999988888 89999999999777666665554


No 153
>COG1327 Predicted transcriptional regulator, consists of a Zn-ribbon and ATP-cone domains [Transcription]
Probab=20.71  E-value=30  Score=22.68  Aligned_cols=13  Identities=23%  Similarity=0.636  Sum_probs=10.3

Q ss_pred             CccCccchhccch
Q 032068           60 PVACSQCSETMER   72 (148)
Q Consensus        60 ~~~C~~C~~~~~~   72 (148)
                      .-.|+.|+..|..
T Consensus        28 RReC~~C~~RFTT   40 (156)
T COG1327          28 RRECLECGERFTT   40 (156)
T ss_pred             hhcccccccccch
Confidence            5679999988865


No 154
>KOG3002 consensus Zn finger protein [General function prediction only]
Probab=20.68  E-value=1.8e+02  Score=21.42  Aligned_cols=96  Identities=20%  Similarity=0.407  Sum_probs=54.2

Q ss_pred             ccCCCCCCCCCcchHHH---H-H-hhcC-CCCccccccCCccChh-hH-HHHHHcCCCCccC----ccchhccch-HHHH
Q 032068           10 KICSHCDRAIPSSNIDL---H-F-AHCS-RNLERCKVCGDMVPRK-YA-EEHFLNTHAPVAC----SQCSETMER-EILA   76 (148)
Q Consensus        10 ~~C~~C~~~~~~~~l~~---H-~-~~~~-~~~~~C~~C~~~f~~~-~l-~~h~~~~h~~~~C----~~C~~~~~~-~~l~   76 (148)
                      ..|+.|...+...-++-   | . ..+- +....||.|...+... .+ ..+... ...+.|    .+|.+.|.- .. .
T Consensus        49 leCPvC~~~l~~Pi~QC~nGHlaCssC~~~~~~~CP~Cr~~~g~~R~~amEkV~e-~~~vpC~~~~~GC~~~~~Y~~~-~  126 (299)
T KOG3002|consen   49 LDCPVCFNPLSPPIFQCDNGHLACSSCRTKVSNKCPTCRLPIGNIRCRAMEKVAE-AVLVPCKNAKLGCTKSFPYGEK-S  126 (299)
T ss_pred             ccCchhhccCcccceecCCCcEehhhhhhhhcccCCccccccccHHHHHHHHHHH-hceecccccccCCceeeccccc-c
Confidence            46777766654333332   3 2 2232 6677888888776633 11 111111 123444    578888876 44 6


Q ss_pred             hhHhhcCCCCcccCCC----CCCCCChhhHHHHHHH
Q 032068           77 IHKGENCPQRIVTCDF----CEFPLPAVDLAEHQVA  108 (148)
Q Consensus        77 ~h~~~~~~~~~~~C~~----C~~~f~~~~l~~H~~~  108 (148)
                      .|++. +.-.|+.||.    |...-....|..|.+.
T Consensus       127 ~HE~~-C~f~~~~CP~p~~~C~~~G~~~~l~~H~~~  161 (299)
T KOG3002|consen  127 KHEKV-CEFRPCSCPVPGAECKYTGSYKDLYAHLND  161 (299)
T ss_pred             ccccc-cccCCcCCCCCcccCCccCcHHHHHHHHHh
Confidence            77764 4448888876    4444446778888765


No 155
>COG5048 FOG: Zn-finger [General function prediction only]
Probab=20.56  E-value=59  Score=24.13  Aligned_cols=50  Identities=22%  Similarity=0.615  Sum_probs=37.4

Q ss_pred             CCccCccchhccch-HHHHhhHhhcCCCCcccCCC--CCCCCC-hhhHHHHHHH
Q 032068           59 APVACSQCSETMER-EILAIHKGENCPQRIVTCDF--CEFPLP-AVDLAEHQVA  108 (148)
Q Consensus        59 ~~~~C~~C~~~~~~-~~l~~h~~~~~~~~~~~C~~--C~~~f~-~~~l~~H~~~  108 (148)
                      ..+.|+.|...|.. .....|.+.+.+++++.|..  |...+. ...+..|...
T Consensus        32 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~   85 (467)
T COG5048          32 RPDSCPNCTDSFSRLEHLTRHIRSHTGEKPSQCSYSGCDKSFSRPLELSRHLRT   85 (467)
T ss_pred             chhhcccccccccccchhhhhcccccccCCccccccccccccCCcchhhhhccc
Confidence            36788899888877 88888899999999999987  444455 5555555543


No 156
>KOG2612 consensus Predicted integral membrane protein [Function unknown]
Probab=20.37  E-value=65  Score=18.96  Aligned_cols=22  Identities=23%  Similarity=0.495  Sum_probs=16.7

Q ss_pred             cCCCCCCCCChhhHHHHHHHhc
Q 032068           89 TCDFCEFPLPAVDLAEHQVAIV  110 (148)
Q Consensus        89 ~C~~C~~~f~~~~l~~H~~~c~  110 (148)
                      .|+.|++.+.-.-+..|+..|.
T Consensus        76 hCeNC~RdVaAaR~AaHLekCl   97 (103)
T KOG2612|consen   76 HCENCDRDVAAARFAAHLEKCL   97 (103)
T ss_pred             cCCCCccHHHHHHHHHHHHHHH
Confidence            5677888777667788888774


No 157
>PF01286 XPA_N:  XPA protein N-terminal;  InterPro: IPR022652 Xeroderma pigmentosum (XP) [] is a human autosomal recessive disease, characterised by a high incidence of sunlight-induced skin cancer. Skin cells of individual's with this condition are hypersensitive to ultraviolet light, due to defects in the incision step of DNA excision repair. There are a minimum of seven genetic complementation groups involved in this pathway: XP-A to XP-G. XP-A is the most severe form of the disease and is due to defects in a 30 kDa nuclear protein called XPA (or XPAC) []. The sequence of the XPA protein is conserved from higher eukaryotes [] to yeast (gene RAD14) []. XPA is a hydrophilic protein of 247 to 296 amino-acid residues which has a C4-type zinc finger motif in its central section. This entry contains the zinc-finger containing region in the XPA protein. It is found N-terminal to PF05181 from PFAM ; PDB: 1D4U_A 1XPA_A.
Probab=20.30  E-value=12  Score=17.88  Aligned_cols=16  Identities=19%  Similarity=0.528  Sum_probs=7.5

Q ss_pred             cCccchhccchHHHHh
Q 032068           62 ACSQCSETMEREILAI   77 (148)
Q Consensus        62 ~C~~C~~~~~~~~l~~   77 (148)
                      .|..|++.|..+.|..
T Consensus         5 ~C~eC~~~f~dSyL~~   20 (34)
T PF01286_consen    5 KCDECGKPFMDSYLLN   20 (34)
T ss_dssp             E-TTT--EES-SSCCC
T ss_pred             hHhHhCCHHHHHHHHH
Confidence            5777777777655443


No 158
>PF07295 DUF1451:  Protein of unknown function (DUF1451);  InterPro: IPR009912 This family consists of several hypothetical bacterial proteins of around 160 residues in length. Members of this family contain four highly conserved cysteine resides toward the C-terminal region of the protein. The function of this family is unknown.
Probab=20.25  E-value=35  Score=22.20  Aligned_cols=31  Identities=19%  Similarity=0.590  Sum_probs=16.5

Q ss_pred             CccCccchhccchHHHHhhHhhcCCCCcccCCCCCCC-CCh
Q 032068           60 PVACSQCSETMEREILAIHKGENCPQRIVTCDFCEFP-LPA   99 (148)
Q Consensus        60 ~~~C~~C~~~~~~~~l~~h~~~~~~~~~~~C~~C~~~-f~~   99 (148)
                      .+.|..||.....        .+. ...-.|+.|+.. |.+
T Consensus       112 ~l~C~~Cg~~~~~--------~~~-~~l~~Cp~C~~~~F~R  143 (146)
T PF07295_consen  112 TLVCENCGHEVEL--------THP-ERLPPCPKCGHTEFTR  143 (146)
T ss_pred             eEecccCCCEEEe--------cCC-CcCCCCCCCCCCeeee
Confidence            5677777744322        112 223368888754 543


No 159
>COG5151 SSL1 RNA polymerase II transcription initiation/nucleotide excision repair factor TFIIH, subunit SSL1 [Transcription / DNA replication, recombination, and repair]
Probab=20.13  E-value=57  Score=24.19  Aligned_cols=12  Identities=17%  Similarity=0.600  Sum_probs=5.7

Q ss_pred             cccCCCCCCCCC
Q 032068           87 IVTCDFCEFPLP   98 (148)
Q Consensus        87 ~~~C~~C~~~f~   98 (148)
                      .|.|+.|...|-
T Consensus       388 rY~Ce~CK~~FC  399 (421)
T COG5151         388 RYQCELCKSTFC  399 (421)
T ss_pred             ceechhhhhhhh
Confidence            455555544443


Done!