Query         psy2838
Match_columns 458
No_of_seqs    673 out of 2761
Neff          9.3 
Searched_HMMs 46136
Date          Fri Aug 16 18:11:07 2013
Command       hhsearch -i /work/01045/syshi/Psyhhblits/psy2838.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/2838hhsearch_cdd -cpu 12 -v 0 

 No Hit                             Prob E-value P-value  Score    SS Cols Query HMM  Template HMM
  1 KOG1074|consensus               99.9 1.6E-28 3.5E-33  245.4   2.6   59  321-379   880-938 (958)
  2 KOG2462|consensus               99.9 1.8E-27 3.8E-32  211.6   6.0  133  237-371   131-266 (279)
  3 KOG2462|consensus               99.9 1.9E-27 4.2E-32  211.4   5.4  135  178-314   128-265 (279)
  4 KOG1074|consensus               99.9 1.7E-26 3.7E-31  231.0   3.6  141  209-349   606-936 (958)
  5 KOG3608|consensus               99.9 1.6E-24 3.4E-29  197.1   6.9  225  147-374   131-378 (467)
  6 KOG3608|consensus               99.9 1.3E-23 2.7E-28  191.3   9.0  208  179-389   133-365 (467)
  7 KOG3623|consensus               99.9 3.4E-24 7.4E-29  210.9   3.8  109  149-257   209-330 (1007)
  8 KOG3623|consensus               99.9 2.6E-22 5.6E-27  197.8   5.1  105  180-284   210-329 (1007)
  9 KOG3576|consensus               99.7 2.1E-18 4.5E-23  145.8   2.7  115  262-376   115-240 (267)
 10 KOG3576|consensus               99.7 3.1E-17 6.6E-22  138.8   2.3   86  177-262   114-199 (267)
 11 PLN03086 PRLI-interacting fact  99.4 1.7E-12 3.6E-17  130.4  10.4  146  209-373   408-565 (567)
 12 PLN03086 PRLI-interacting fact  99.4 6.2E-13 1.3E-17  133.5   7.0  148  179-345   406-565 (567)
 13 KOG3993|consensus               99.1 5.2E-12 1.1E-16  118.7  -0.9  195  180-374   267-484 (500)
 14 PF01352 KRAB:  KRAB box;  Inte  99.1 1.2E-12 2.7E-17   83.8  -4.1   36   43-78      1-37  (41)
 15 PHA00733 hypothetical protein   99.1 1.7E-10 3.8E-15   94.9   4.9   80  292-374    40-125 (128)
 16 KOG3993|consensus               99.0 1.9E-11 4.2E-16  114.9  -2.8  192  150-345   267-483 (500)
 17 PHA00733 hypothetical protein   99.0 3.5E-10 7.5E-15   93.2   3.9   81  263-345    39-124 (128)
 18 PHA02768 hypothetical protein;  98.8 3.2E-09 6.9E-14   71.8   1.6   42  293-336     6-47  (55)
 19 PHA02768 hypothetical protein;  98.7 9.2E-09   2E-13   69.6   2.4   43  209-253     6-48  (55)
 20 PF13465 zf-H2C2_2:  Zinc-finge  98.4   2E-07 4.2E-12   53.8   1.6   26  307-332     1-26  (26)
 21 PF13465 zf-H2C2_2:  Zinc-finge  98.3 5.7E-07 1.2E-11   51.9   2.5   25  223-247     1-25  (26)
 22 PF06524 NOA36:  NOA36 protein;  98.1 2.6E-06 5.6E-11   75.7   4.8   95  259-372   137-233 (314)
 23 PHA00732 hypothetical protein   98.1 1.6E-06 3.4E-11   64.7   2.4   45  293-342     2-46  (79)
 24 PHA00616 hypothetical protein   98.1 1.5E-06 3.2E-11   56.0   1.9   31  349-379     2-32  (44)
 25 PHA00616 hypothetical protein   98.1 1.6E-06 3.6E-11   55.7   1.4   32  293-324     2-33  (44)
 26 PF05605 zf-Di19:  Drought indu  98.0 7.6E-06 1.7E-10   56.6   4.4   51  320-373     2-54  (54)
 27 PHA00732 hypothetical protein   98.0 4.4E-06 9.5E-11   62.3   2.7   44  209-257     2-45  (79)
 28 PF05605 zf-Di19:  Drought indu  97.9 1.7E-05 3.6E-10   54.9   4.4   50  180-232     2-53  (54)
 29 KOG2231|consensus               97.6 0.00023 5.1E-09   73.2   9.5   48  293-344   183-236 (669)
 30 PF06524 NOA36:  NOA36 protein;  97.6 0.00012 2.5E-09   65.5   5.8   85  286-393   136-226 (314)
 31 PF12756 zf-C2H2_2:  C2H2 type   97.5 4.8E-05   1E-09   60.1   2.5   74  294-372     1-74  (100)
 32 PF00096 zf-C2H2:  Zinc finger,  97.5 7.6E-05 1.7E-09   41.6   2.0   21  350-370     2-22  (23)
 33 KOG1146|consensus               97.5 1.2E-05 2.5E-10   86.9  -2.7   49  321-369   735-790 (1406)
 34 PF13894 zf-C2H2_4:  C2H2-type   97.4 0.00012 2.7E-09   41.0   2.3   24  349-372     1-24  (24)
 35 PF12756 zf-C2H2_2:  C2H2 type   97.3 0.00014   3E-09   57.4   2.8   22  264-285    50-71  (100)
 36 PF13894 zf-C2H2_4:  C2H2-type   97.3 0.00012 2.7E-09   41.0   1.7   23  321-343     1-23  (24)
 37 COG5236 Uncharacterized conser  97.3 0.00034 7.5E-09   65.0   5.2   78  293-374   221-307 (493)
 38 KOG2231|consensus               97.3 0.00041   9E-09   71.4   5.7  104  210-324   117-240 (669)
 39 COG5189 SFP1 Putative transcri  97.1 0.00013 2.9E-09   66.9   0.4   23  317-339   395-417 (423)
 40 KOG1146|consensus               97.0 0.00071 1.5E-08   73.7   5.0  106  153-259   439-612 (1406)
 41 COG5236 Uncharacterized conser  97.0  0.0015 3.2E-08   60.9   5.8  128  209-346   152-307 (493)
 42 PF13912 zf-C2H2_6:  C2H2-type   97.0 0.00042 9.1E-09   40.2   1.5   20  321-340     2-21  (27)
 43 PF13912 zf-C2H2_6:  C2H2-type   96.9 0.00058 1.3E-08   39.6   1.6   24  180-203     1-24  (27)
 44 PF13909 zf-H2C2_5:  C2H2-type   96.6  0.0013 2.8E-08   36.9   1.7   24  349-373     1-24  (24)
 45 COG5189 SFP1 Putative transcri  96.6  0.0011 2.4E-08   61.1   1.8   70  178-257   347-419 (423)
 46 KOG2482|consensus               96.5  0.0036 7.9E-08   58.4   4.8   51  321-371   280-357 (423)
 47 KOG2785|consensus               96.2  0.0073 1.6E-07   57.6   5.3   52  320-371   166-243 (390)
 48 PF09237 GAGA:  GAGA factor;  I  96.1  0.0042 9.1E-08   41.0   2.1   28  348-375    24-51  (54)
 49 PF09237 GAGA:  GAGA factor;  I  96.1  0.0029 6.2E-08   41.8   1.3   31  317-347    21-51  (54)
 50 smart00355 ZnF_C2H2 zinc finge  96.1  0.0057 1.2E-07   34.6   2.5   23  349-371     1-23  (26)
 51 COG5048 FOG: Zn-finger [Genera  96.0  0.0019 4.2E-08   65.3   0.2  144  208-351   289-449 (467)
 52 PF13909 zf-H2C2_5:  C2H2-type   95.9  0.0042   9E-08   34.8   1.4   23  321-344     1-23  (24)
 53 KOG2482|consensus               95.9   0.017 3.6E-07   54.1   5.9   22  321-342   335-356 (423)
 54 smart00355 ZnF_C2H2 zinc finge  95.9  0.0063 1.4E-07   34.4   2.1   20  322-341     2-21  (26)
 55 KOG2785|consensus               95.6   0.028 6.1E-07   53.7   6.4   50  293-342   167-242 (390)
 56 PF12874 zf-met:  Zinc-finger o  95.6   0.008 1.7E-07   34.0   1.8   20  294-313     2-21  (25)
 57 COG5048 FOG: Zn-finger [Genera  95.6   0.002 4.4E-08   65.1  -1.3  159  179-338   288-463 (467)
 58 PF12874 zf-met:  Zinc-finger o  95.6  0.0058 1.3E-07   34.6   1.1   22  349-370     1-22  (25)
 59 PRK04860 hypothetical protein;  95.2   0.012 2.6E-07   50.4   2.2   37  292-332   119-155 (160)
 60 PF12171 zf-C2H2_jaz:  Zinc-fin  95.0  0.0062 1.3E-07   35.3  -0.2   21  349-369     2-22  (27)
 61 PRK04860 hypothetical protein;  94.9   0.017 3.7E-07   49.5   2.2   26  222-247   129-154 (160)
 62 PF12171 zf-C2H2_jaz:  Zinc-fin  94.4   0.017 3.7E-07   33.4   0.7   21  293-313     2-22  (27)
 63 KOG4173|consensus               94.2   0.021 4.6E-07   49.5   1.2   29  317-345   141-171 (253)
 64 KOG2893|consensus               94.1    0.02 4.4E-07   50.7   1.0   49  209-261    11-59  (341)
 65 KOG4173|consensus               93.1   0.031 6.8E-07   48.5   0.4   52  178-232    77-130 (253)
 66 cd07765 KRAB_A-box KRAB (Krupp  93.1  0.0079 1.7E-07   36.2  -2.6   29   43-71      1-29  (40)
 67 TIGR00622 ssl1 transcription f  92.5    0.14 3.1E-06   40.6   3.3   23  293-315    82-104 (112)
 68 KOG2893|consensus               92.5   0.028 6.1E-07   49.9  -0.7   46  295-344    13-58  (341)
 69 PF12013 DUF3505:  Protein of u  91.8    0.26 5.6E-06   39.5   4.2   25  349-373    81-109 (109)
 70 smart00451 ZnF_U1 U1-like zinc  91.2    0.17 3.6E-06   31.1   1.9   23  348-370     3-25  (35)
 71 PF13913 zf-C2HC_2:  zinc-finge  90.9    0.21 4.6E-06   28.2   2.0   19  350-369     4-22  (25)
 72 PF13913 zf-C2HC_2:  zinc-finge  90.4    0.25 5.5E-06   27.9   2.0   18  210-228     4-21  (25)
 73 PF12013 DUF3505:  Protein of u  89.5    0.45 9.7E-06   38.1   3.7   24  209-233    12-35  (109)
 74 PF09538 FYDLN_acid:  Protein o  89.3    0.74 1.6E-05   36.6   4.6   29  321-360    10-38  (108)
 75 smart00451 ZnF_U1 U1-like zinc  88.0    0.43 9.4E-06   29.2   2.1   22  293-314     4-25  (35)
 76 COG4049 Uncharacterized protei  87.6    0.25 5.3E-06   33.3   0.8   27  347-373    16-42  (65)
 77 cd00350 rubredoxin_like Rubred  87.2    0.36 7.7E-06   29.4   1.3   11  293-303     2-12  (33)
 78 TIGR00622 ssl1 transcription f  86.0     1.3 2.8E-05   35.2   4.2   45  238-284    57-101 (112)
 79 COG4049 Uncharacterized protei  85.5    0.34 7.4E-06   32.7   0.6   30  316-345    13-42  (65)
 80 KOG2186|consensus               85.0    0.36 7.7E-06   43.7   0.7   46  209-257     4-49  (276)
 81 PF09538 FYDLN_acid:  Protein o  84.2    0.57 1.2E-05   37.2   1.5   34  289-333     6-39  (108)
 82 KOG0943|consensus               82.0     1.5 3.3E-05   48.2   3.9   10  178-187  1255-1264(3015)
 83 PF02892 zf-BED:  BED zinc fing  80.8     1.1 2.3E-05   29.3   1.6   24  349-372    17-44  (45)
 84 KOG1832|consensus               80.7    0.72 1.6E-05   49.2   1.1   14  352-365  1303-1316(1516)
 85 PF09986 DUF2225:  Uncharacteri  78.7    0.47   1E-05   43.0  -0.9   13  293-305     6-18  (214)
 86 KOG2186|consensus               78.7       1 2.2E-05   40.8   1.3   55  264-321     3-57  (276)
 87 TIGR02300 FYDLN_acid conserved  74.5     1.8   4E-05   35.0   1.5   14  348-361    26-39  (129)
 88 PF09986 DUF2225:  Uncharacteri  73.4     1.7 3.8E-05   39.3   1.3   23  235-257     4-26  (214)
 89 PF04959 ARS2:  Arsenite-resist  73.2     1.5 3.2E-05   39.5   0.8   26  320-345    77-102 (214)
 90 cd00729 rubredoxin_SM Rubredox  72.6     2.2 4.7E-05   26.1   1.2   10  293-302     3-12  (34)
 91 PF13717 zinc_ribbon_4:  zinc-r  72.6     3.3 7.1E-05   25.7   2.0   10  238-247     4-13  (36)
 92 PF13719 zinc_ribbon_5:  zinc-r  72.4     3.2   7E-05   25.9   2.0    9  239-247     5-13  (37)
 93 TIGR02300 FYDLN_acid conserved  72.4     7.5 0.00016   31.5   4.5   31  346-387     7-37  (129)
 94 PF10571 UPF0547:  Uncharacteri  72.3     2.5 5.5E-05   24.1   1.3   10  350-359    16-25  (26)
 95 COG2888 Predicted Zn-ribbon RN  71.7     3.1 6.7E-05   28.7   1.9    9  348-356    50-58  (61)
 96 KOG4124|consensus               69.8    0.74 1.6E-05   43.6  -1.9   51  206-257   176-233 (442)
 97 PF02892 zf-BED:  BED zinc fing  69.4     3.3 7.2E-05   26.9   1.7   24  319-342    15-42  (45)
 98 smart00531 TFIIE Transcription  69.3     5.2 0.00011   33.9   3.2   11  293-303   100-110 (147)
 99 TIGR02098 MJ0042_CXXC MJ0042 f  69.2     3.7 8.1E-05   25.6   1.8    9  238-246     4-12  (38)
100 KOG1189|consensus               69.0     4.8  0.0001   42.4   3.4   20   44-63    434-453 (960)
101 PRK00464 nrdR transcriptional   67.8     2.4 5.1E-05   36.1   0.8   16  237-252    29-44  (154)
102 PRK00464 nrdR transcriptional   67.5     1.2 2.6E-05   37.9  -1.0   12  349-360    29-40  (154)
103 PRK14890 putative Zn-ribbon RN  66.8     3.7 8.1E-05   28.4   1.5    9  348-356    48-56  (59)
104 PF02176 zf-TRAF:  TRAF-type zi  65.6     5.1 0.00011   27.8   2.1    6  264-269    38-43  (60)
105 smart00659 RPOLCX RNA polymera  64.6     3.4 7.4E-05   26.9   0.9   11  293-303     3-13  (44)
106 PF12907 zf-met2:  Zinc-binding  64.2     5.2 0.00011   25.4   1.7   12  334-345    18-29  (40)
107 KOG2807|consensus               64.2     5.5 0.00012   37.7   2.5   85  236-355   276-374 (378)
108 TIGR00373 conserved hypothetic  64.0     6.7 0.00014   33.7   2.9   16  293-308   110-125 (158)
109 smart00614 ZnF_BED BED zinc fi  64.0     5.2 0.00011   26.8   1.8   25  349-373    19-48  (50)
110 PF02176 zf-TRAF:  TRAF-type zi  61.8     2.9 6.3E-05   29.1   0.2   24  335-358    25-52  (60)
111 COG1592 Rubrerythrin [Energy p  61.2       5 0.00011   34.5   1.6   23  292-327   134-156 (166)
112 PF15269 zf-C2H2_7:  Zinc-finge  60.0     6.9 0.00015   25.2   1.6   23  349-371    21-43  (54)
113 PF09723 Zn-ribbon_8:  Zinc rib  59.5     2.9 6.4E-05   26.9  -0.1   11  293-303     6-16  (42)
114 TIGR00373 conserved hypothetic  58.9     8.7 0.00019   33.0   2.7   16  209-224   110-125 (158)
115 KOG1832|consensus               58.5     3.5 7.7E-05   44.2   0.3    9  326-334  1305-1313(1516)
116 PHA00626 hypothetical protein   58.5     5.8 0.00013   27.0   1.2   10  265-274    24-33  (59)
117 PRK06266 transcription initiat  58.4     7.9 0.00017   33.9   2.4   15  293-307   118-132 (178)
118 PRK00398 rpoP DNA-directed RNA  58.1     6.8 0.00015   25.7   1.5    9  237-245     4-12  (46)
119 TIGR02605 CxxC_CxxC_SSSS putat  56.3     5.5 0.00012   26.8   0.9   10  237-246     6-15  (52)
120 smart00834 CxxC_CXXC_SSSS Puta  55.5     5.8 0.00013   25.1   0.8   11  237-247     6-16  (41)
121 PF15269 zf-C2H2_7:  Zinc-finge  55.3     8.8 0.00019   24.7   1.6   23  180-202    20-42  (54)
122 COG5151 SSL1 RNA polymerase II  55.1     3.8 8.2E-05   38.3  -0.1   74  263-341   321-409 (421)
123 PRK04023 DNA polymerase II lar  54.9      10 0.00022   41.7   2.9   24   67-97    408-431 (1121)
124 KOG1701|consensus               54.3     3.1 6.6E-05   40.8  -0.9   41  210-250   276-316 (468)
125 smart00734 ZnF_Rad18 Rad18-lik  54.2      12 0.00025   21.3   1.8   19  350-369     3-21  (26)
126 COG1996 RPC10 DNA-directed RNA  54.1     6.8 0.00015   26.1   1.0   10  293-302     7-16  (49)
127 PRK04023 DNA polymerase II lar  54.1      11 0.00024   41.4   3.1   11  179-189   625-635 (1121)
128 smart00531 TFIIE Transcription  53.8      16 0.00034   30.9   3.4   12  209-220   100-111 (147)
129 KOG1280|consensus               52.9      11 0.00023   36.1   2.4   26  293-318    80-105 (381)
130 PRK06266 transcription initiat  52.1      12 0.00026   32.8   2.5   13  209-221   118-130 (178)
131 COG1997 RPL43A Ribosomal prote  51.7     4.8  0.0001   30.2  -0.0   10  321-330    54-63  (89)
132 KOG3408|consensus               51.3     8.5 0.00018   30.8   1.3   26  347-372    56-81  (129)
133 PRK09678 DNA-binding transcrip  51.2     2.9 6.4E-05   30.5  -1.2   42  209-252     2-45  (72)
134 PF03604 DNA_RNApol_7kD:  DNA d  51.1      11 0.00024   22.7   1.4    7  209-215    18-24  (32)
135 KOG1701|consensus               50.7     1.8 3.8E-05   42.4  -3.1   12  180-191   302-313 (468)
136 PF05443 ROS_MUCR:  ROS/MUCR tr  47.7      10 0.00023   31.3   1.3   25  348-375    72-96  (132)
137 KOG2593|consensus               47.6      16 0.00036   36.1   2.9   33  293-328   129-161 (436)
138 PRK09678 DNA-binding transcrip  47.4     7.6 0.00017   28.3   0.5   39  237-277     2-42  (72)
139 PF09845 DUF2072:  Zn-ribbon co  46.9      31 0.00067   28.3   3.9   12  350-361     3-14  (131)
140 PF04959 ARS2:  Arsenite-resist  46.9      10 0.00023   34.1   1.3   28  292-319    77-104 (214)
141 KOG2807|consensus               46.1      25 0.00053   33.5   3.6   32  346-383   343-374 (378)
142 KOG1044|consensus               45.7     3.9 8.5E-05   41.7  -1.7   92  264-364   133-234 (670)
143 PF14353 CpXC:  CpXC protein     45.6     5.7 0.00012   32.7  -0.5   21  237-257    39-59  (128)
144 TIGR01384 TFS_arch transcripti  44.6     4.4 9.6E-05   32.0  -1.3   12  236-247    16-27  (104)
145 KOG0943|consensus               44.0      17 0.00037   40.7   2.5    9  149-157  1256-1264(3015)
146 PF05443 ROS_MUCR:  ROS/MUCR tr  44.0      14 0.00031   30.5   1.6   23  293-318    73-95  (132)
147 KOG1280|consensus               43.6      18 0.00038   34.7   2.3   36  180-215    79-116 (381)
148 KOG1834|consensus               43.1      15 0.00033   38.2   1.9    8  103-110   371-378 (952)
149 COG1592 Rubrerythrin [Energy p  42.7      15 0.00033   31.6   1.7   23  236-271   134-156 (166)
150 KOG4377|consensus               42.7      14 0.00031   36.1   1.6   21  353-373   408-428 (480)
151 KOG1189|consensus               42.4      18  0.0004   38.3   2.4   10   44-53    381-390 (960)
152 PF03344 Daxx:  Daxx Family;  I  41.9     8.5 0.00018   41.3   0.0    9   63-71     81-89  (713)
153 COG5188 PRP9 Splicing factor 3  41.5      16 0.00034   35.0   1.6   47  293-339   239-394 (470)
154 COG4957 Predicted transcriptio  41.1      15 0.00032   30.1   1.2   23  209-234    77-99  (148)
155 KOG4167|consensus               40.1     7.1 0.00015   41.0  -0.9   25  209-233   793-817 (907)
156 PF12760 Zn_Tnp_IS1595:  Transp  39.9      47   0.001   21.7   3.3    9  263-271    36-44  (46)
157 KOG3214|consensus               39.6      11 0.00024   29.0   0.3   40  319-362    22-61  (109)
158 PRK14714 DNA polymerase II lar  38.8      28  0.0006   39.5   3.2   25   67-98    428-452 (1337)
159 KOG4377|consensus               37.5      19 0.00042   35.2   1.6   24  321-344   402-427 (480)
160 PF08274 PhnA_Zn_Ribbon:  PhnA   37.4      14  0.0003   21.9   0.4    8  348-355    19-26  (30)
161 KOG4167|consensus               37.3     9.6 0.00021   40.1  -0.5   27  179-205   791-817 (907)
162 KOG1924|consensus               36.8      33 0.00071   36.7   3.2    6  104-109   644-649 (1102)
163 PF05285 SDA1:  SDA1;  InterPro  36.7      49  0.0011   32.1   4.3   13  362-374    25-37  (324)
164 KOG0262|consensus               36.1      31 0.00067   38.9   3.0   22   16-37    676-697 (1640)
165 COG4530 Uncharacterized protei  36.0      19 0.00041   28.3   1.0   11  348-358    26-36  (129)
166 PF14283 DUF4366:  Domain of un  35.9      11 0.00024   34.2  -0.3    6  278-283   108-113 (218)
167 COG1198 PriA Primosomal protei  35.8      22 0.00048   38.4   1.9   14  260-273   471-484 (730)
168 COG1198 PriA Primosomal protei  35.0      10 0.00022   40.8  -0.7   46  181-245   436-484 (730)
169 PF04889 Cwf_Cwc_15:  Cwf15/Cwc  35.0      28  0.0006   32.2   2.2    6  363-368    58-63  (244)
170 PF07975 C1_4:  TFIIH C1-like d  34.8      16 0.00035   24.7   0.4   20  293-312    22-41  (51)
171 PTZ00303 phosphatidylinositol   32.7      15 0.00033   39.0   0.2   11  182-192   462-472 (1374)
172 smart00440 ZnF_C2C2 C2C2 Zinc   32.6      20 0.00043   22.8   0.6   11  264-274    28-38  (40)
173 PRK14714 DNA polymerase II lar  32.4      36 0.00079   38.7   2.9   11  236-246   692-702 (1337)
174 PF05191 ADK_lid:  Adenylate ki  31.4      23 0.00049   22.0   0.7    9  238-246     3-11  (36)
175 KOG3540|consensus               31.2      31 0.00067   34.6   1.9    9  260-268   112-120 (615)
176 KOG2593|consensus               31.1      58  0.0013   32.4   3.7   34  209-245   129-162 (436)
177 PF01096 TFIIS_C:  Transcriptio  30.8      13 0.00029   23.4  -0.4   12  263-274    27-38  (39)
178 smart00349 KRAB krueppel assoc  30.6       6 0.00013   25.0  -2.3   29   43-71      1-29  (61)
179 COG3357 Predicted transcriptio  30.5      27 0.00059   26.5   1.1   12  209-220    59-70  (97)
180 PRK00432 30S ribosomal protein  30.5      18  0.0004   24.2   0.2   11  348-358    37-47  (50)
181 PF07754 DUF1610:  Domain of un  30.3      22 0.00048   19.8   0.4    8  320-327    16-23  (24)
182 PF06220 zf-U1:  U1 zinc finger  30.0      38 0.00083   21.2   1.6   12  348-359     3-14  (38)
183 COG1998 RPS31 Ribosomal protei  29.9      30 0.00064   23.0   1.0   10  348-357    37-46  (51)
184 smart00735 ZM ZASP-like motif.  29.5      27 0.00058   19.8   0.7   22    2-23      3-24  (26)
185 COG0068 HypF Hydrogenase matur  29.5      10 0.00022   40.0  -1.7   81  151-245   102-182 (750)
186 KOG3555|consensus               29.2      45 0.00098   32.1   2.5   25  215-239   155-179 (434)
187 KOG0717|consensus               29.0      32 0.00069   34.5   1.6   22  349-370   293-314 (508)
188 KOG2907|consensus               28.9      32  0.0007   27.2   1.3    8  183-190    28-35  (116)
189 COG5406 Nucleosome binding fac  28.4      42  0.0009   35.0   2.3   30   41-70    475-504 (1001)
190 KOG0782|consensus               28.4      21 0.00045   36.5   0.2   27  279-305   240-266 (1004)
191 KOG3214|consensus               28.3      26 0.00056   27.0   0.7    7  349-355    24-30  (109)
192 COG3091 SprT Zn-dependent meta  28.1      26 0.00056   29.5   0.7    8  348-355   140-147 (156)
193 PRK03824 hypA hydrogenase nick  27.3      36 0.00078   28.3   1.4   10  237-246    71-80  (135)
194 PF13878 zf-C2H2_3:  zinc-finge  27.0      65  0.0014   20.5   2.3   10  182-191    15-24  (41)
195 PF05495 zf-CHY:  CHY zinc fing  26.4      21 0.00046   25.9  -0.1   11  236-246    41-51  (71)
196 PF13821 DUF4187:  Domain of un  26.1      65  0.0014   22.1   2.3   18  349-366    28-45  (55)
197 COG4957 Predicted transcriptio  25.8      43 0.00092   27.5   1.5   22  293-317    77-98  (148)
198 TIGR01206 lysW lysine biosynth  25.4      25 0.00053   24.1   0.1    7  238-244    24-30  (54)
199 PRK03564 formate dehydrogenase  25.2      39 0.00085   32.4   1.5   49  179-244   186-234 (309)
200 COG5112 UFD2 U1-like Zn-finger  25.2      29 0.00062   27.1   0.4   24  348-371    55-78  (126)
201 PF13453 zf-TFIIB:  Transcripti  24.9      41 0.00088   21.3   1.1   12  351-362    22-33  (41)
202 PF08790 zf-LYAR:  LYAR-type C2  24.8      38 0.00082   19.7   0.8    9  238-246     2-10  (28)
203 COG2879 Uncharacterized small   24.7      69  0.0015   22.5   2.2   18  360-377    24-41  (65)
204 PF04780 DUF629:  Protein of un  24.4      52  0.0011   33.4   2.2   26  350-375    59-84  (466)
205 PF09416 UPF1_Zn_bind:  RNA hel  23.7      78  0.0017   26.8   2.8   10  235-244    59-68  (152)
206 PF14311 DUF4379:  Domain of un  23.0      47   0.001   22.6   1.2   10  293-302    29-38  (55)
207 COG5151 SSL1 RNA polymerase II  22.6      88  0.0019   29.6   3.1   91  209-315   309-411 (421)
208 PF04780 DUF629:  Protein of un  22.4      54  0.0012   33.3   1.9   25  321-345    58-82  (466)
209 COG3364 Zn-ribbon containing p  22.4      49  0.0011   25.7   1.2   14  180-193     2-15  (112)
210 PF15135 UPF0515:  Uncharacteri  22.3      78  0.0017   29.0   2.6   14  263-276   154-167 (278)
211 PF03115 Astro_capsid:  Astrovi  22.3      29 0.00063   37.7   0.0   54   12-71    152-205 (787)
212 KOG4124|consensus               21.4      25 0.00054   33.7  -0.6   21  178-199   211-231 (442)
213 COG3091 SprT Zn-dependent meta  21.3      45 0.00098   28.1   0.9   11  264-275   117-127 (156)
214 COG2331 Uncharacterized protei  21.1      41 0.00088   24.6   0.5   33  179-218    11-43  (82)
215 PF01363 FYVE:  FYVE zinc finge  21.0      60  0.0013   23.1   1.4    9  210-218    11-19  (69)
216 KOG0978|consensus               20.6      48   0.001   35.4   1.1   18  238-255   680-697 (698)
217 PF13451 zf-trcl:  Probable zin  20.5      45 0.00098   22.3   0.6   14  320-333     4-17  (49)
218 PRK12380 hydrogenase nickel in  20.5      49  0.0011   26.5   0.9   10  293-302    71-80  (113)
219 TIGR00100 hypA hydrogenase nic  20.2      47   0.001   26.7   0.8   10  293-302    71-80  (115)
220 COG1655 Uncharacterized protei  20.2      44 0.00095   30.2   0.6   18  235-252    18-35  (267)
221 TIGR00244 transcriptional regu  20.1      49  0.0011   27.8   0.9   15  237-251    29-43  (147)
222 PF01780 Ribosomal_L37ae:  Ribo  20.1      35 0.00075   26.1  -0.0   10  321-330    54-63  (90)
223 KOG3408|consensus               20.0      67  0.0015   25.9   1.6   27  176-202    53-79  (129)

No 1  
>KOG1074|consensus
Probab=99.94  E-value=1.6e-28  Score=245.40  Aligned_cols=59  Identities=17%  Similarity=0.277  Sum_probs=54.0

Q ss_pred             ccCccccccCCCHHHHHHHHhhccCCCccccCcCcccCCCHHHHHHHHHHhCCCCCccc
Q psy2838         321 YKCADCGRLFAQEHTLRTHTLSVHSGVRLTCRTCGLFCDSKRALTAHVREAHPGCEYKK  379 (458)
Q Consensus       321 ~~C~~C~~~f~~~~~L~~H~~~~h~~~~~~C~~C~~~f~~~~~L~~H~~~~h~~~~~~~  379 (458)
                      ..|..|++.|...+.|..|+++|.+.++|.|.+|++.|..+.+|+.||.+|+...+...
T Consensus       880 h~C~vCgk~FsSSsALqiH~rTHtg~KPF~C~fC~~aFttrgnLKvHMgtH~w~q~~sr  938 (958)
T KOG1074|consen  880 HVCNVCGKQFSSSAALEIHMRTHTGPKPFFCHFCEEAFTTRGNLKVHMGTHMWVQPPSR  938 (958)
T ss_pred             hhhccchhcccchHHHHHhhhcCCCCCCccchhhhhhhhhhhhhhhhhccccccCCCcc
Confidence            67999999999999999999999999999999999999999999999999987655443


No 2  
>KOG2462|consensus
Probab=99.94  E-value=1.8e-27  Score=211.64  Aligned_cols=133  Identities=26%  Similarity=0.555  Sum_probs=105.2

Q ss_pred             eecCccccccCCchhhhhccchhcc---CcceecCcccccccCHHHHHHHHHHhhcCceeecCccCcccCChhHHHHHHh
Q psy2838         237 FKCDHCVKSFNQKPNLISHIKSVHE---GIKYKCEECTKTFAQKQNLLSHVQAIHMGIKYTCEVCSKSYNQVTNLLSHMT  313 (458)
Q Consensus       237 ~~C~~C~~~f~~~~~L~~H~~~~~~---~~~~~C~~C~~~f~~~~~L~~H~~~~h~~~~~~C~~C~~~f~~~~~L~~H~~  313 (458)
                      |+|+.|++.+.+.++|.+|..+|..   .+.+.|.+|+|.|.+...|+.|+++|.  .++.|.+|||.|.+...|+.|+|
T Consensus       131 ~~c~eCgk~ysT~snLsrHkQ~H~~~~s~ka~~C~~C~K~YvSmpALkMHirTH~--l~c~C~iCGKaFSRPWLLQGHiR  208 (279)
T KOG2462|consen  131 YKCPECGKSYSTSSNLSRHKQTHRSLDSKKAFSCKYCGKVYVSMPALKMHIRTHT--LPCECGICGKAFSRPWLLQGHIR  208 (279)
T ss_pred             eeccccccccccccccchhhcccccccccccccCCCCCceeeehHHHhhHhhccC--CCcccccccccccchHHhhcccc
Confidence            5555555555555555555554432   366888888888888888888886654  45888889999988888999999


Q ss_pred             hhccCccccCccccccCCCHHHHHHHHhhccCCCccccCcCcccCCCHHHHHHHHHHh
Q psy2838         314 TVHQAQKYKCADCGRLFAQEHTLRTHTLSVHSGVRLTCRTCGLFCDSKRALTAHVREA  371 (458)
Q Consensus       314 ~h~~~~~~~C~~C~~~f~~~~~L~~H~~~~h~~~~~~C~~C~~~f~~~~~L~~H~~~~  371 (458)
                      +|+|+|||.|+.|++.|..+++|+.||++|...++|+|..|+|.|..++.|.+|....
T Consensus       209 THTGEKPF~C~hC~kAFADRSNLRAHmQTHS~~K~~qC~~C~KsFsl~SyLnKH~ES~  266 (279)
T KOG2462|consen  209 THTGEKPFSCPHCGKAFADRSNLRAHMQTHSDVKKHQCPRCGKSFALKSYLNKHSESA  266 (279)
T ss_pred             cccCCCCccCCcccchhcchHHHHHHHHhhcCCccccCcchhhHHHHHHHHHHhhhhc
Confidence            9999999999999999999999999998888888888999999999999999987643


No 3  
>KOG2462|consensus
Probab=99.94  E-value=1.9e-27  Score=211.38  Aligned_cols=135  Identities=27%  Similarity=0.564  Sum_probs=121.4

Q ss_pred             CCcccccccccccCCHHHHHHHHHhhcC---CCccccccCccccCChHHHHHHHHHhcCCceeecCccccccCCchhhhh
Q psy2838         178 VKKFPCDQCGKSFTLKQNLTYHVNSIHL---GVRIKCDFCDKTFAHKPNLIAHMNSAHNGVRFKCDHCVKSFNQKPNLIS  254 (458)
Q Consensus       178 ~k~~~C~~C~~~f~~~~~L~~H~~~~h~---~~~~~C~~C~~~f~~~~~L~~H~~~h~~~~~~~C~~C~~~f~~~~~L~~  254 (458)
                      ...|+|..|++.|.+.++|.+|.+.|-.   .+.+.|++|+|.|.+...|+.|+++|.  .+++|.+|||.|...+.|+-
T Consensus       128 ~~r~~c~eCgk~ysT~snLsrHkQ~H~~~~s~ka~~C~~C~K~YvSmpALkMHirTH~--l~c~C~iCGKaFSRPWLLQG  205 (279)
T KOG2462|consen  128 HPRYKCPECGKSYSTSSNLSRHKQTHRSLDSKKAFSCKYCGKVYVSMPALKMHIRTHT--LPCECGICGKAFSRPWLLQG  205 (279)
T ss_pred             CCceeccccccccccccccchhhcccccccccccccCCCCCceeeehHHHhhHhhccC--CCcccccccccccchHHhhc
Confidence            4579999999999999999999986643   234899999999999999999999997  68999999999999999999


Q ss_pred             ccchhccCcceecCcccccccCHHHHHHHHHHhhcCceeecCccCcccCChhHHHHHHhh
Q psy2838         255 HIKSVHEGIKYKCEECTKTFAQKQNLLSHVQAIHMGIKYTCEVCSKSYNQVTNLLSHMTT  314 (458)
Q Consensus       255 H~~~~~~~~~~~C~~C~~~f~~~~~L~~H~~~~h~~~~~~C~~C~~~f~~~~~L~~H~~~  314 (458)
                      |+|+|+|+|||.|..|++.|..+++|+.||++|...++|+|..|+|+|...+.|.+|...
T Consensus       206 HiRTHTGEKPF~C~hC~kAFADRSNLRAHmQTHS~~K~~qC~~C~KsFsl~SyLnKH~ES  265 (279)
T KOG2462|consen  206 HIRTHTGEKPFSCPHCGKAFADRSNLRAHMQTHSDVKKHQCPRCGKSFALKSYLNKHSES  265 (279)
T ss_pred             ccccccCCCCccCCcccchhcchHHHHHHHHhhcCCccccCcchhhHHHHHHHHHHhhhh
Confidence            999999999999999999999999999999988888889999999999988888888764


No 4  
>KOG1074|consensus
Probab=99.92  E-value=1.7e-26  Score=230.96  Aligned_cols=141  Identities=23%  Similarity=0.447  Sum_probs=125.8

Q ss_pred             cccccCccccCChHHHHHHHHHhcCCceeecCccccccCCchhhhhccchhccC----cceecC---cccccccCHHHHH
Q psy2838         209 IKCDFCDKTFAHKPNLIAHMNSAHNGVRFKCDHCVKSFNQKPNLISHIKSVHEG----IKYKCE---ECTKTFAQKQNLL  281 (458)
Q Consensus       209 ~~C~~C~~~f~~~~~L~~H~~~h~~~~~~~C~~C~~~f~~~~~L~~H~~~~~~~----~~~~C~---~C~~~f~~~~~L~  281 (458)
                      -.|-+|.++....+.|+.|.++|+|++||+|.+|++.|.++.+|+.||-.|...    ..|.|+   +|.+.|.+.-.|.
T Consensus       606 NqCiiC~rVlSC~saLqmHyrtHtGERPFkCKiCgRAFtTkGNLkaH~~vHka~p~~R~q~ScP~~~ic~~kftn~V~lp  685 (958)
T KOG1074|consen  606 NQCIICLRVLSCPSALQMHYRTHTGERPFKCKICGRAFTTKGNLKAHMSVHKAKPPARVQFSCPSTFICQKKFTNAVTLP  685 (958)
T ss_pred             cceeeeeecccchhhhhhhhhcccCcCccccccccchhccccchhhcccccccCccccccccCCchhhhccccccccccc
Confidence            469999999999999999999999999999999999999999999999865543    458899   9999999999999


Q ss_pred             HHHHHhhcCc----------------------------------------------------------------------
Q psy2838         282 SHVQAIHMGI----------------------------------------------------------------------  291 (458)
Q Consensus       282 ~H~~~~h~~~----------------------------------------------------------------------  291 (458)
                      .|++.|.++.                                                                      
T Consensus       686 QhIriH~~~~~s~g~~a~e~~~~adq~~~~qk~~~~a~~f~~~~se~~~~~s~~~~~~~~~t~t~~~~~tp~~~e~~~~~  765 (958)
T KOG1074|consen  686 QHIRIHLGGQISNGGTAAEGILAADQCSSCQKTFSDARSFSQQISEQPSPESEPDEQMDERTETEELDVTPPPPENSCGR  765 (958)
T ss_pred             ceEEeecCCCCCCCcccccccchhcccchhhhcccccccchhhhhccCCcccCCcccccccccccccccCCCcccccccc
Confidence            9986654211                                                                      


Q ss_pred             --------------------------------------------------------------------------------
Q psy2838         292 --------------------------------------------------------------------------------  291 (458)
Q Consensus       292 --------------------------------------------------------------------------------  291 (458)
                                                                                                      
T Consensus       766 ~~~~e~~i~~~g~te~asa~~~~vg~~s~~~~~~~~~~T~~k~~~~~~~~~~~~~~~v~~~pvl~~~~~~~l~eg~~t~~  845 (958)
T KOG1074|consen  766 ELEGEMAISVRGSTEEASANLDEVGTVSAAGEAGEEDDTSEKPTQASSFPGEILAPSVNMDPVLWNQETSMLNEGLATKT  845 (958)
T ss_pred             ccCcccccccccchhhhhcChhhhcCccccchhhhhcccCCCCcccccCCCcCCccccccCchhhccccccccccccccc
Confidence                                                                                            


Q ss_pred             ---------------------------------eeecCccCcccCChhHHHHHHhhhccCccccCccccccCCCHHHHHH
Q psy2838         292 ---------------------------------KYTCEVCSKSYNQVTNLLSHMTTVHQAQKYKCADCGRLFAQEHTLRT  338 (458)
Q Consensus       292 ---------------------------------~~~C~~C~~~f~~~~~L~~H~~~h~~~~~~~C~~C~~~f~~~~~L~~  338 (458)
                                                       +..|.+|++.|...+.|..|+++|+++|||.|.+|++.|..+.+|+.
T Consensus       846 n~~t~~~~~~sv~qs~~~p~l~p~l~~~~pvnn~h~C~vCgk~FsSSsALqiH~rTHtg~KPF~C~fC~~aFttrgnLKv  925 (958)
T KOG1074|consen  846 NEITPEGPADSVIQSGGVPTLEPSLGRPGPVNNAHVCNVCGKQFSSSAALEIHMRTHTGPKPFFCHFCEEAFTTRGNLKV  925 (958)
T ss_pred             ccccCCCcchhhhhhccccccCCCCCCCCcccchhhhccchhcccchHHHHHhhhcCCCCCCccchhhhhhhhhhhhhhh
Confidence                                             26899999999999999999999999999999999999999999999


Q ss_pred             HHhhccCCCcc
Q psy2838         339 HTLSVHSGVRL  349 (458)
Q Consensus       339 H~~~~h~~~~~  349 (458)
                      ||.+|+-..++
T Consensus       926 HMgtH~w~q~~  936 (958)
T KOG1074|consen  926 HMGTHMWVQPP  936 (958)
T ss_pred             hhccccccCCC
Confidence            99988765543


No 5  
>KOG3608|consensus
Probab=99.90  E-value=1.6e-24  Score=197.11  Aligned_cols=225  Identities=24%  Similarity=0.443  Sum_probs=188.8

Q ss_pred             cCcccchh--hhhhhhcCCcccccccc----------CCCCCCC-Cccccc--ccccccCCHHHHHHHHHhhcCCCcccc
Q psy2838         147 AGLEFNRS--QFEKTFQNYSMTTTGHR----------GGKGPPV-KKFPCD--QCGKSFTLKQNLTYHVNSIHLGVRIKC  211 (458)
Q Consensus       147 ~~~~~~c~--~c~~~~~~~~~~~~~~~----------~~~~~~~-k~~~C~--~C~~~f~~~~~L~~H~~~~h~~~~~~C  211 (458)
                      .+..|.|.  .|+..|.+...+..|..          ......+ ..+.|.  .|-+.+.++..|+.|++.|.+++...|
T Consensus       131 ~g~~f~C~WedCe~~F~s~~ef~dHV~~H~l~ceyd~~~~~~D~~pv~~C~W~~Ct~~~~~k~~LreH~r~Hs~eKvvAC  210 (467)
T KOG3608|consen  131 LGQNFRCGWEDCEREFVSIVEFQDHVVKHALFCEYDIQKTPEDERPVTMCNWAMCTKHMGNKYRLREHIRTHSNEKVVAC  210 (467)
T ss_pred             chhhhccChhhcCCcccCHHHHHHHHHHhhhhhhhhhhhCCCCCCceeeccchhhhhhhccHHHHHHHHHhcCCCeEEec
Confidence            44556664  47777776544433311          1122222 236675  599999999999999998888888999


Q ss_pred             ccCccccCChHHHHHHHHHhc--CCceeecCccccccCCchhhhhccchhccCcceecCcccccccCHHHHHHHHHHhhc
Q psy2838         212 DFCDKTFAHKPNLIAHMNSAH--NGVRFKCDHCVKSFNQKPNLISHIKSVHEGIKYKCEECTKTFAQKQNLLSHVQAIHM  289 (458)
Q Consensus       212 ~~C~~~f~~~~~L~~H~~~h~--~~~~~~C~~C~~~f~~~~~L~~H~~~~~~~~~~~C~~C~~~f~~~~~L~~H~~~~h~  289 (458)
                      +.||..|.++..|..|++..+  ...+|.|..|.+.|.+...|..|+..|.  .-|+|+.|..+....+.|.+|++..|.
T Consensus       211 p~Cg~~F~~~tkl~DH~rRqt~l~~n~fqC~~C~KrFaTeklL~~Hv~rHv--n~ykCplCdmtc~~~ssL~~H~r~rHs  288 (467)
T KOG3608|consen  211 PHCGELFRTKTKLFDHLRRQTELNTNSFQCAQCFKRFATEKLLKSHVVRHV--NCYKCPLCDMTCSSASSLTTHIRYRHS  288 (467)
T ss_pred             chHHHHhccccHHHHHHHhhhhhcCCchHHHHHHHHHhHHHHHHHHHHHhh--hcccccccccCCCChHHHHHHHHhhhc
Confidence            999999999999999998654  4568999999999999999999998665  469999999999999999999987775


Q ss_pred             C-ceeecCccCcccCChhHHHHHHhhhccCccccCcc--ccccCCCHHHHHHHHhhccCC---CccccCcCcccCCCHHH
Q psy2838         290 G-IKYTCEVCSKSYNQVTNLLSHMTTVHQAQKYKCAD--CGRLFAQEHTLRTHTLSVHSG---VRLTCRTCGLFCDSKRA  363 (458)
Q Consensus       290 ~-~~~~C~~C~~~f~~~~~L~~H~~~h~~~~~~~C~~--C~~~f~~~~~L~~H~~~~h~~---~~~~C~~C~~~f~~~~~  363 (458)
                      . ++|+|..|++.|.+.+.|.+|...|. +..|+|..  |.++|++...|++|++.+|.+   .+|.|..|.+.|++-.+
T Consensus       289 ~dkpfKCd~Cd~~c~~esdL~kH~~~HS-~~~y~C~h~~C~~s~r~~~q~~~H~~evhEg~np~~Y~CH~Cdr~ft~G~~  367 (467)
T KOG3608|consen  289 KDKPFKCDECDTRCVRESDLAKHVQVHS-KTVYQCEHPDCHYSVRTYTQMRRHFLEVHEGNNPILYACHCCDRFFTSGKS  367 (467)
T ss_pred             cCCCccccchhhhhccHHHHHHHHHhcc-ccceecCCCCCcHHHHHHHHHHHHHHHhccCCCCCceeeecchhhhccchh
Confidence            4 56999999999999999999999877 67799987  999999999999999999954   35889999999999999


Q ss_pred             HHHHHHHhCCC
Q psy2838         364 LTAHVREAHPG  374 (458)
Q Consensus       364 L~~H~~~~h~~  374 (458)
                      |.+|+++.|+-
T Consensus       368 L~~HL~kkH~f  378 (467)
T KOG3608|consen  368 LSAHLMKKHGF  378 (467)
T ss_pred             HHHHHHHhhcc
Confidence            99999999863


No 6  
>KOG3608|consensus
Probab=99.89  E-value=1.3e-23  Score=191.25  Aligned_cols=208  Identities=25%  Similarity=0.487  Sum_probs=180.5

Q ss_pred             Cccccc--ccccccCCHHHHHHHHHhhcC------------CCc-cccc--cCccccCChHHHHHHHHHhcCCceeecCc
Q psy2838         179 KKFPCD--QCGKSFTLKQNLTYHVNSIHL------------GVR-IKCD--FCDKTFAHKPNLIAHMNSAHNGVRFKCDH  241 (458)
Q Consensus       179 k~~~C~--~C~~~f~~~~~L~~H~~~~h~------------~~~-~~C~--~C~~~f~~~~~L~~H~~~h~~~~~~~C~~  241 (458)
                      ..|.|.  .|+..|.+...|..|+..|..            +++ +.|.  .|-+.|.++..|+.|++.|.+++...|+.
T Consensus       133 ~~f~C~WedCe~~F~s~~ef~dHV~~H~l~ceyd~~~~~~D~~pv~~C~W~~Ct~~~~~k~~LreH~r~Hs~eKvvACp~  212 (467)
T KOG3608|consen  133 QNFRCGWEDCEREFVSIVEFQDHVVKHALFCEYDIQKTPEDERPVTMCNWAMCTKHMGNKYRLREHIRTHSNEKVVACPH  212 (467)
T ss_pred             hhhccChhhcCCcccCHHHHHHHHHHhhhhhhhhhhhCCCCCCceeeccchhhhhhhccHHHHHHHHHhcCCCeEEecch
Confidence            347785  599999999999999874432            112 5665  49999999999999999999999999999


Q ss_pred             cccccCCchhhhhccchhcc--CcceecCcccccccCHHHHHHHHHHhhcCceeecCccCcccCChhHHHHHHhh-hccC
Q psy2838         242 CVKSFNQKPNLISHIKSVHE--GIKYKCEECTKTFAQKQNLLSHVQAIHMGIKYTCEVCSKSYNQVTNLLSHMTT-VHQA  318 (458)
Q Consensus       242 C~~~f~~~~~L~~H~~~~~~--~~~~~C~~C~~~f~~~~~L~~H~~~~h~~~~~~C~~C~~~f~~~~~L~~H~~~-h~~~  318 (458)
                      |+..|.++..|..|++..+.  ..+|.|..|.|.|.+...|..|+..|-.  -|+|+.|+.+....++|..|++. |...
T Consensus       213 Cg~~F~~~tkl~DH~rRqt~l~~n~fqC~~C~KrFaTeklL~~Hv~rHvn--~ykCplCdmtc~~~ssL~~H~r~rHs~d  290 (467)
T KOG3608|consen  213 CGELFRTKTKLFDHLRRQTELNTNSFQCAQCFKRFATEKLLKSHVVRHVN--CYKCPLCDMTCSSASSLTTHIRYRHSKD  290 (467)
T ss_pred             HHHHhccccHHHHHHHhhhhhcCCchHHHHHHHHHhHHHHHHHHHHHhhh--cccccccccCCCChHHHHHHHHhhhccC
Confidence            99999999999999986553  3689999999999999999999865543  49999999999999999999986 4568


Q ss_pred             ccccCccccccCCCHHHHHHHHhhccCCCccccCc--CcccCCCHHHHHHHHHHhCCC---CCcccCCCCCCCCcC
Q psy2838         319 QKYKCADCGRLFAQEHTLRTHTLSVHSGVRLTCRT--CGLFCDSKRALTAHVREAHPG---CEYKKRGRGRPKLEV  389 (458)
Q Consensus       319 ~~~~C~~C~~~f~~~~~L~~H~~~~h~~~~~~C~~--C~~~f~~~~~L~~H~~~~h~~---~~~~~~~~~~~~~~~  389 (458)
                      +||+|..|++.|.+.++|.+|.. .|++..|.|..  |.++|.+...|++|++.+|.+   .+|.|-.|.+.....
T Consensus       291 kpfKCd~Cd~~c~~esdL~kH~~-~HS~~~y~C~h~~C~~s~r~~~q~~~H~~evhEg~np~~Y~CH~Cdr~ft~G  365 (467)
T KOG3608|consen  291 KPFKCDECDTRCVRESDLAKHVQ-VHSKTVYQCEHPDCHYSVRTYTQMRRHFLEVHEGNNPILYACHCCDRFFTSG  365 (467)
T ss_pred             CCccccchhhhhccHHHHHHHHH-hccccceecCCCCCcHHHHHHHHHHHHHHHhccCCCCCceeeecchhhhccc
Confidence            99999999999999999999976 67788899988  999999999999999999943   579999998766443


No 7  
>KOG3623|consensus
Probab=99.89  E-value=3.4e-24  Score=210.90  Aligned_cols=109  Identities=19%  Similarity=0.382  Sum_probs=92.2

Q ss_pred             cccchhhhhhhhcCCccccccccCCCCCCCCcccccccccccCCHHHHHHHHHhhcC-------------CCccccccCc
Q psy2838         149 LEFNRSQFEKTFQNYSMTTTGHRGGKGPPVKKFPCDQCGKSFTLKQNLTYHVNSIHL-------------GVRIKCDFCD  215 (458)
Q Consensus       149 ~~~~c~~c~~~~~~~~~~~~~~~~~~~~~~k~~~C~~C~~~f~~~~~L~~H~~~~h~-------------~~~~~C~~C~  215 (458)
                      ....|.+|...+.....+..|........+..|.|..|..+|.....|.+||..|..             .++|+|..|+
T Consensus       209 qlltcpycdrgykrltslkeHikyrhekne~nfsC~lCsytFAyRtQLErhm~~hkpg~dqa~sltqsa~lRKFKCtECg  288 (1007)
T KOG3623|consen  209 QLLTCPYCDRGYKRLTSLKEHIKYRHEKNEPNFSCMLCSYTFAYRTQLERHMQLHKPGGDQAISLTQSALLRKFKCTECG  288 (1007)
T ss_pred             hhhcchhHHHHHHHHHHHHHHHHHHHhhCCCCCcchhhhhhhhhHHHHHHHHHhhcCCCcccccccchhhhccccccccc
Confidence            456788998887776666666543444456679999999999999999999986543             2459999999


Q ss_pred             cccCChHHHHHHHHHhcCCceeecCccccccCCchhhhhccc
Q psy2838         216 KTFAHKPNLIAHMNSAHNGVRFKCDHCVKSFNQKPNLISHIK  257 (458)
Q Consensus       216 ~~f~~~~~L~~H~~~h~~~~~~~C~~C~~~f~~~~~L~~H~~  257 (458)
                      |.|..+..|+.|+|+|.|++||.|+.|+|.|+...++..||.
T Consensus       289 KAFKfKHHLKEHlRIHSGEKPfeCpnCkKRFSHSGSySSHmS  330 (1007)
T KOG3623|consen  289 KAFKFKHHLKEHLRIHSGEKPFECPNCKKRFSHSGSYSSHMS  330 (1007)
T ss_pred             hhhhhHHHHHhhheeecCCCCcCCcccccccccCCccccccc
Confidence            999999999999999999999999999999999999999974


No 8  
>KOG3623|consensus
Probab=99.85  E-value=2.6e-22  Score=197.80  Aligned_cols=105  Identities=28%  Similarity=0.577  Sum_probs=96.2

Q ss_pred             cccccccccccCCHHHHHHHHHhhcCCC--ccccccCccccCChHHHHHHHHHhcCC-------------ceeecCcccc
Q psy2838         180 KFPCDQCGKSFTLKQNLTYHVNSIHLGV--RIKCDFCDKTFAHKPNLIAHMNSAHNG-------------VRFKCDHCVK  244 (458)
Q Consensus       180 ~~~C~~C~~~f~~~~~L~~H~~~~h~~~--~~~C~~C~~~f~~~~~L~~H~~~h~~~-------------~~~~C~~C~~  244 (458)
                      ...|++|.+.+.....|+.|++..|...  .|.|..|.++|..+..|.+||.+|...             +.|+|..|+|
T Consensus       210 lltcpycdrgykrltslkeHikyrhekne~nfsC~lCsytFAyRtQLErhm~~hkpg~dqa~sltqsa~lRKFKCtECgK  289 (1007)
T KOG3623|consen  210 LLTCPYCDRGYKRLTSLKEHIKYRHEKNEPNFSCMLCSYTFAYRTQLERHMQLHKPGGDQAISLTQSALLRKFKCTECGK  289 (1007)
T ss_pred             hhcchhHHHHHHHHHHHHHHHHHHHhhCCCCCcchhhhhhhhhHHHHHHHHHhhcCCCcccccccchhhhccccccccch
Confidence            4789999999999999999998766543  399999999999999999999998642             4599999999


Q ss_pred             ccCCchhhhhccchhccCcceecCcccccccCHHHHHHHH
Q psy2838         245 SFNQKPNLISHIKSVHEGIKYKCEECTKTFAQKQNLLSHV  284 (458)
Q Consensus       245 ~f~~~~~L~~H~~~~~~~~~~~C~~C~~~f~~~~~L~~H~  284 (458)
                      .|.-+..|+.|+|+|.|+|||.|+.|+|.|.....+..||
T Consensus       290 AFKfKHHLKEHlRIHSGEKPfeCpnCkKRFSHSGSySSHm  329 (1007)
T KOG3623|consen  290 AFKFKHHLKEHLRIHSGEKPFECPNCKKRFSHSGSYSSHM  329 (1007)
T ss_pred             hhhhHHHHHhhheeecCCCCcCCcccccccccCCcccccc
Confidence            9999999999999999999999999999999999998887


No 9  
>KOG3576|consensus
Probab=99.72  E-value=2.1e-18  Score=145.83  Aligned_cols=115  Identities=25%  Similarity=0.493  Sum_probs=100.4

Q ss_pred             CcceecCcccccccCHHHHHHHHHHhhcCceeecCccCcccCChhHHHHHHhhhccCccccCccccccCCCHHHHHHHHh
Q psy2838         262 GIKYKCEECTKTFAQKQNLLSHVQAIHMGIKYTCEVCSKSYNQVTNLLSHMTTVHQAQKYKCADCGRLFAQEHTLRTHTL  341 (458)
Q Consensus       262 ~~~~~C~~C~~~f~~~~~L~~H~~~~h~~~~~~C~~C~~~f~~~~~L~~H~~~h~~~~~~~C~~C~~~f~~~~~L~~H~~  341 (458)
                      ...|.|.+|+|.|.-...|.+|++-|..-++|.|..||+.|.....|++|+++|++.+||+|..|++.|+++-+|..|++
T Consensus       115 ~d~ftCrvCgK~F~lQRmlnrh~kch~~vkr~lct~cgkgfndtfdlkrh~rthtgvrpykc~~c~kaftqrcsleshl~  194 (267)
T KOG3576|consen  115 QDSFTCRVCGKKFGLQRMLNRHLKCHSDVKRHLCTFCGKGFNDTFDLKRHTRTHTGVRPYKCSLCEKAFTQRCSLESHLK  194 (267)
T ss_pred             CCeeeeehhhhhhhHHHHHHHHhhhccHHHHHHHhhccCcccchhhhhhhhccccCccccchhhhhHHHHhhccHHHHHH
Confidence            45688888888888888888888766666678999999999999999999999999999999999999999999999999


Q ss_pred             hccCC-----------CccccCcCcccCCCHHHHHHHHHHhCCCCC
Q psy2838         342 SVHSG-----------VRLTCRTCGLFCDSKRALTAHVREAHPGCE  376 (458)
Q Consensus       342 ~~h~~-----------~~~~C~~C~~~f~~~~~L~~H~~~~h~~~~  376 (458)
                      ++|+.           +-|.|..||++-.....+..|++.||+.-+
T Consensus       195 kvhgv~~~yaykerr~kl~vcedcg~t~~~~e~~~~h~~~~hp~Sp  240 (267)
T KOG3576|consen  195 KVHGVQHQYAYKERRAKLYVCEDCGYTSERPEVYYLHLKLHHPFSP  240 (267)
T ss_pred             HHcCchHHHHHHHhhhheeeecccCCCCCChhHHHHHHHhcCCCCH
Confidence            98874           236799999999999999999999998644


No 10 
>KOG3576|consensus
Probab=99.65  E-value=3.1e-17  Score=138.81  Aligned_cols=86  Identities=34%  Similarity=0.586  Sum_probs=61.7

Q ss_pred             CCCcccccccccccCCHHHHHHHHHhhcCCCccccccCccccCChHHHHHHHHHhcCCceeecCccccccCCchhhhhcc
Q psy2838         177 PVKKFPCDQCGKSFTLKQNLTYHVNSIHLGVRIKCDFCDKTFAHKPNLIAHMNSAHNGVRFKCDHCVKSFNQKPNLISHI  256 (458)
Q Consensus       177 ~~k~~~C~~C~~~f~~~~~L~~H~~~~h~~~~~~C~~C~~~f~~~~~L~~H~~~h~~~~~~~C~~C~~~f~~~~~L~~H~  256 (458)
                      +...|.|.+|+|.|.....|.+|++-|..-+++-|..||+.|.....|++|+++|++.+||+|..|++.|.++-+|..|+
T Consensus       114 d~d~ftCrvCgK~F~lQRmlnrh~kch~~vkr~lct~cgkgfndtfdlkrh~rthtgvrpykc~~c~kaftqrcsleshl  193 (267)
T KOG3576|consen  114 DQDSFTCRVCGKKFGLQRMLNRHLKCHSDVKRHLCTFCGKGFNDTFDLKRHTRTHTGVRPYKCSLCEKAFTQRCSLESHL  193 (267)
T ss_pred             CCCeeeeehhhhhhhHHHHHHHHhhhccHHHHHHHhhccCcccchhhhhhhhccccCccccchhhhhHHHHhhccHHHHH
Confidence            34567777777777777777777775444455777777777777777777777777777777777777777777777777


Q ss_pred             chhccC
Q psy2838         257 KSVHEG  262 (458)
Q Consensus       257 ~~~~~~  262 (458)
                      +..|+.
T Consensus       194 ~kvhgv  199 (267)
T KOG3576|consen  194 KKVHGV  199 (267)
T ss_pred             HHHcCc
Confidence            666653


No 11 
>PLN03086 PRLI-interacting factor K; Provisional
Probab=99.38  E-value=1.7e-12  Score=130.41  Aligned_cols=146  Identities=18%  Similarity=0.432  Sum_probs=115.2

Q ss_pred             cccccCccccCChHHHHHHHHHhcCCceeecCc--cccccCCchhhhhccchhccCcceecCcccccccCHHHHHHHHHH
Q psy2838         209 IKCDFCDKTFAHKPNLIAHMNSAHNGVRFKCDH--CVKSFNQKPNLISHIKSVHEGIKYKCEECTKTFAQKQNLLSHVQA  286 (458)
Q Consensus       209 ~~C~~C~~~f~~~~~L~~H~~~h~~~~~~~C~~--C~~~f~~~~~L~~H~~~~~~~~~~~C~~C~~~f~~~~~L~~H~~~  286 (458)
                      ..|+.|..... ...|..|.... ......|+.  |+..|. +..+..|         +.|+.|++.|. ...|..|+..
T Consensus       408 V~C~NC~~~i~-l~~l~lHe~~C-~r~~V~Cp~~~Cg~v~~-r~el~~H---------~~C~~Cgk~f~-~s~LekH~~~  474 (567)
T PLN03086        408 VECRNCKHYIP-SRSIALHEAYC-SRHNVVCPHDGCGIVLR-VEEAKNH---------VHCEKCGQAFQ-QGEMEKHMKV  474 (567)
T ss_pred             EECCCCCCccc-hhHHHHHHhhC-CCcceeCCcccccceee-ccccccC---------ccCCCCCCccc-hHHHHHHHHh
Confidence            57999988754 55777888543 335567885  999883 4445555         48999999996 6789999987


Q ss_pred             hhcCceeecCccCcccCChhHHHHHHhhhccCccccCccccccCC----------CHHHHHHHHhhccCCCccccCcCcc
Q psy2838         287 IHMGIKYTCEVCSKSYNQVTNLLSHMTTVHQAQKYKCADCGRLFA----------QEHTLRTHTLSVHSGVRLTCRTCGL  356 (458)
Q Consensus       287 ~h~~~~~~C~~C~~~f~~~~~L~~H~~~h~~~~~~~C~~C~~~f~----------~~~~L~~H~~~~h~~~~~~C~~C~~  356 (458)
                      +|  .++.|+ |++.+ .+..|..|+.+|.+.+++.|..|++.|.          ....|..|... ++.+.+.|..|++
T Consensus       475 ~H--kpv~Cp-Cg~~~-~R~~L~~H~~thCp~Kpi~C~fC~~~v~~g~~~~d~~d~~s~Lt~HE~~-CG~rt~~C~~Cgk  549 (567)
T PLN03086        475 FH--EPLQCP-CGVVL-EKEQMVQHQASTCPLRLITCRFCGDMVQAGGSAMDVRDRLRGMSEHESI-CGSRTAPCDSCGR  549 (567)
T ss_pred             cC--CCccCC-CCCCc-chhHHHhhhhccCCCCceeCCCCCCccccCccccchhhhhhhHHHHHHh-cCCcceEccccCC
Confidence            76  679999 99755 6689999999999999999999999985          24589999655 6888889999999


Q ss_pred             cCCCHHHHHHHHHHhCC
Q psy2838         357 FCDSKRALTAHVREAHP  373 (458)
Q Consensus       357 ~f~~~~~L~~H~~~~h~  373 (458)
                      .+..+ .|..|+...|.
T Consensus       550 ~Vrlr-dm~~H~~~~h~  565 (567)
T PLN03086        550 SVMLK-EMDIHQIAVHQ  565 (567)
T ss_pred             eeeeh-hHHHHHHHhhc
Confidence            98655 58999988875


No 12 
>PLN03086 PRLI-interacting factor K; Provisional
Probab=99.38  E-value=6.2e-13  Score=133.53  Aligned_cols=148  Identities=18%  Similarity=0.475  Sum_probs=119.3

Q ss_pred             CcccccccccccCCHHHHHHHHHhhcCCCcccccc--CccccCChHHHHHHHHHhcCCceeecCccccccCCchhhhhcc
Q psy2838         179 KKFPCDQCGKSFTLKQNLTYHVNSIHLGVRIKCDF--CDKTFAHKPNLIAHMNSAHNGVRFKCDHCVKSFNQKPNLISHI  256 (458)
Q Consensus       179 k~~~C~~C~~~f~~~~~L~~H~~~~h~~~~~~C~~--C~~~f~~~~~L~~H~~~h~~~~~~~C~~C~~~f~~~~~L~~H~  256 (458)
                      ..-.|..|..... ..+|..|.. +.......|+.  |+..|. +..+..|         +.|+.|++.|. ...|..|+
T Consensus       406 ~~V~C~NC~~~i~-l~~l~lHe~-~C~r~~V~Cp~~~Cg~v~~-r~el~~H---------~~C~~Cgk~f~-~s~LekH~  472 (567)
T PLN03086        406 DTVECRNCKHYIP-SRSIALHEA-YCSRHNVVCPHDGCGIVLR-VEEAKNH---------VHCEKCGQAFQ-QGEMEKHM  472 (567)
T ss_pred             CeEECCCCCCccc-hhHHHHHHh-hCCCcceeCCcccccceee-ccccccC---------ccCCCCCCccc-hHHHHHHH
Confidence            3457999988764 556778986 45555578985  999884 3444444         58999999996 67899999


Q ss_pred             chhccCcceecCcccccccCHHHHHHHHHHhhcCceeecCccCcccC----------ChhHHHHHHhhhccCccccCccc
Q psy2838         257 KSVHEGIKYKCEECTKTFAQKQNLLSHVQAIHMGIKYTCEVCSKSYN----------QVTNLLSHMTTVHQAQKYKCADC  326 (458)
Q Consensus       257 ~~~~~~~~~~C~~C~~~f~~~~~L~~H~~~~h~~~~~~C~~C~~~f~----------~~~~L~~H~~~h~~~~~~~C~~C  326 (458)
                      +.+|  +++.|+ |++.+ .+..|..|+..+...+++.|..|++.|.          ..+.|..|..+. +.+++.|..|
T Consensus       473 ~~~H--kpv~Cp-Cg~~~-~R~~L~~H~~thCp~Kpi~C~fC~~~v~~g~~~~d~~d~~s~Lt~HE~~C-G~rt~~C~~C  547 (567)
T PLN03086        473 KVFH--EPLQCP-CGVVL-EKEQMVQHQASTCPLRLITCRFCGDMVQAGGSAMDVRDRLRGMSEHESIC-GSRTAPCDSC  547 (567)
T ss_pred             HhcC--CCccCC-CCCCc-chhHHHhhhhccCCCCceeCCCCCCccccCccccchhhhhhhHHHHHHhc-CCcceEcccc
Confidence            9876  789999 99765 6789999998888888899999999995          245899999885 8999999999


Q ss_pred             cccCCCHHHHHHHHhhccC
Q psy2838         327 GRLFAQEHTLRTHTLSVHS  345 (458)
Q Consensus       327 ~~~f~~~~~L~~H~~~~h~  345 (458)
                      |+.|..+ .|..|+...|.
T Consensus       548 gk~Vrlr-dm~~H~~~~h~  565 (567)
T PLN03086        548 GRSVMLK-EMDIHQIAVHQ  565 (567)
T ss_pred             CCeeeeh-hHHHHHHHhhc
Confidence            9988776 68899888775


No 13 
>KOG3993|consensus
Probab=99.14  E-value=5.2e-12  Score=118.71  Aligned_cols=195  Identities=16%  Similarity=0.185  Sum_probs=115.0

Q ss_pred             cccccccccccCCHHHHHHHHHhhcCCCccccccCccccCChHHHHHHHHHhcCCceeec---CccccccCCchhhhhcc
Q psy2838         180 KFPCDQCGKSFTLKQNLTYHVNSIHLGVRIKCDFCDKTFAHKPNLIAHMNSAHNGVRFKC---DHCVKSFNQKPNLISHI  256 (458)
Q Consensus       180 ~~~C~~C~~~f~~~~~L~~H~~~~h~~~~~~C~~C~~~f~~~~~L~~H~~~h~~~~~~~C---~~C~~~f~~~~~L~~H~  256 (458)
                      .|.|..|...|.+...|.+|.-..-...-|+|+.|+|.|....+|..|.|.|....--.=   +-=.+...+....+.--
T Consensus       267 dyiCqLCK~kYeD~F~LAQHrC~RIV~vEYrCPEC~KVFsCPANLASHRRWHKPR~eaa~a~~~P~k~~~~~rae~~ea~  346 (500)
T KOG3993|consen  267 DYICQLCKEKYEDAFALAQHRCPRIVHVEYRCPECDKVFSCPANLASHRRWHKPRPEAAKAGSPPPKQAVETRAEVQEAE  346 (500)
T ss_pred             HHHHHHHHHhhhhHHHHhhccCCeeEEeeecCCcccccccCchhhhhhhcccCCchhhhhcCCCChhhhhhhhhhhhhcc
Confidence            478888888888888888775432222347788888888888888888777753211000   00000000000000000


Q ss_pred             ch--hccCcceecCcccccccCHHHHHHHHHHhhcCce-----------------eecCccCcccCChhHHHHHHhhhcc
Q psy2838         257 KS--VHEGIKYKCEECTKTFAQKQNLLSHVQAIHMGIK-----------------YTCEVCSKSYNQVTNLLSHMTTVHQ  317 (458)
Q Consensus       257 ~~--~~~~~~~~C~~C~~~f~~~~~L~~H~~~~h~~~~-----------------~~C~~C~~~f~~~~~L~~H~~~h~~  317 (458)
                      +.  ......|.|.+|+|.|.+...|+.|+..|+....                 +.|..|.-.+.....--.+...+.+
T Consensus       347 rsg~dss~gi~~C~~C~KkFrRqAYLrKHqlthq~~~~~k~~a~~f~~s~~~~l~~~~~~~a~h~~a~~~~g~~vl~~a~  426 (500)
T KOG3993|consen  347 RSGDDSSSGIFSCHTCGKKFRRQAYLRKHQLTHQRAPLAKEKAPKFLLSRVIPLMHFNQAVATHSSASDSHGDEVLYVAG  426 (500)
T ss_pred             ccCCcccCceeecHHhhhhhHHHHHHHHhHHhhhccccchhcccCcchhhcccccccccccccccccccccccceeeeec
Confidence            00  0112368888888888888888888766664221                 3445555444433322222222221


Q ss_pred             -CccccCccccccCCCHHHHHHHHhhccCCCccccCcCcccCCCHHHHHHHHHHhCCC
Q psy2838         318 -AQKYKCADCGRLFAQEHTLRTHTLSVHSGVRLTCRTCGLFCDSKRALTAHVREAHPG  374 (458)
Q Consensus       318 -~~~~~C~~C~~~f~~~~~L~~H~~~~h~~~~~~C~~C~~~f~~~~~L~~H~~~~h~~  374 (458)
                       .....|+.|+-.+.++..--.|.+.-+....|.|.+|.-.|.+...|.+|+.+.|+.
T Consensus       427 sael~~pp~~~~ppsss~~sgg~~rlg~~~q~f~~ky~~atfyss~~ltrhin~~Hps  484 (500)
T KOG3993|consen  427 SAELELPPYDGSPPSSSGSSGGYGRLGIAEQGFTCKYCPATFYSSPGLTRHINKCHPS  484 (500)
T ss_pred             cccccCCCCCCCCcccCCCCCccccccchhhccccccchHhhhcCcchHhHhhhcChH
Confidence             122457788877777665555554455556688999999999999999999999974


No 14 
>PF01352 KRAB:  KRAB box;  InterPro: IPR001909 The Krueppel-associated box (KRAB) is a domain of around 75 amino acids that is found in the N-terminal part of about one third of eukaryotic Krueppel-type C2H2 zinc finger proteins (ZFPs) []. It is enriched in charged amino acids and can be divided into subregions A and B, which are predicted to fold into two amphipathic alpha-helices. The KRAB A and B boxes can be separated by variable spacer segments and many KRAB proteins contain only the A box []. The functions currently known for members of the KRAB-containing protein family include transcriptional repression of RNA polymerase I, II, and III promoters, binding and splicing of RNA, and control of nucleolus function. The KRAB domain functions as a transcriptional repressor when tethered to the template DNA by a DNA-binding domain. A sequence of 45 amino acids in the KRAB A subdomain has been shown to be necessary and sufficient for transcriptional repression. The B box does not repress by itself but does potentiate the repression exerted by the KRAB A subdomain [, ]. Gene silencing requires the binding of the KRAB domain to the RING-B box-coiled coil (RBCC) domain of the KAP-1/TIF1-beta corepressor. As KAP-1 binds to the heterochromatin proteins HP1, it has been proposed that the KRAB-ZFP-bound target gene could be silenced following recruitment to heterochromatin [, ]. KRAB-ZFPs probably constitute the single largest class of transcription factors within the human genome []. Although the function of KRAB-ZFPs is largely unknown, they appear to play important roles during cell differentiation and development. The KRAB domain is generally encoded by two exons. The regions coded by the two exons are known as KRAB-A and KRAB-B.; GO: 0003676 nucleic acid binding, 0006355 regulation of transcription, DNA-dependent, 0005622 intracellular; PDB: 1V65_A.
Probab=99.12  E-value=1.2e-12  Score=83.84  Aligned_cols=36  Identities=11%  Similarity=0.032  Sum_probs=21.4

Q ss_pred             ceeeecchhHHHHHHHhcCCCcCCCCCCCCC-CCCCC
Q psy2838          43 YNYYQLSSDEKFEILKSISASFTPGYGGFLP-NTPPL   78 (458)
Q Consensus        43 ~~~~~~~~~~~~~~~~~~~~~~~~~y~~~~~-~~~~~   78 (458)
                      |+|+||||+||+|||++|+|+||.+|++++. ++..+
T Consensus         1 Vtf~Dvav~fs~eEW~~L~~~Qk~ly~dvm~Eny~~l   37 (41)
T PF01352_consen    1 VTFEDVAVYFSQEEWELLDPAQKNLYRDVMLENYRNL   37 (41)
T ss_dssp             ------TT---HHHHHTS-HHHHHHHHHHHHHTTTS-
T ss_pred             CeEEEEEEEcChhhcccccceecccchhHHHHhhccc
Confidence            7999999999999999999999999999953 44333


No 15 
>PHA00733 hypothetical protein
Probab=99.06  E-value=1.7e-10  Score=94.94  Aligned_cols=80  Identities=19%  Similarity=0.230  Sum_probs=58.6

Q ss_pred             eeecCccCcccCChhHHH------HHHhhhccCccccCccccccCCCHHHHHHHHhhccCCCccccCcCcccCCCHHHHH
Q psy2838         292 KYTCEVCSKSYNQVTNLL------SHMTTVHQAQKYKCADCGRLFAQEHTLRTHTLSVHSGVRLTCRTCGLFCDSKRALT  365 (458)
Q Consensus       292 ~~~C~~C~~~f~~~~~L~------~H~~~h~~~~~~~C~~C~~~f~~~~~L~~H~~~~h~~~~~~C~~C~~~f~~~~~L~  365 (458)
                      ++.|.+|.+.|.+...|.      +|+. +++.++|.|+.|++.|.+...|..|++.+  ..+|.|..|++.|.....|.
T Consensus        40 ~~~~~~~~~~~~~~~~l~~~~~l~~~~~-~~~~kPy~C~~Cgk~Fss~s~L~~H~r~h--~~~~~C~~CgK~F~~~~sL~  116 (128)
T PHA00733         40 RLIRAVVKTLIYNPQLLDESSYLYKLLT-SKAVSPYVCPLCLMPFSSSVSLKQHIRYT--EHSKVCPVCGKEFRNTDSTL  116 (128)
T ss_pred             hHHHHHHhhhccChhhhcchHHHHhhcc-cCCCCCccCCCCCCcCCCHHHHHHHHhcC--CcCccCCCCCCccCCHHHHH
Confidence            355555555555544433      3433 34578899999999999999999997653  34688999999999999999


Q ss_pred             HHHHHhCCC
Q psy2838         366 AHVREAHPG  374 (458)
Q Consensus       366 ~H~~~~h~~  374 (458)
                      .|++.+|+.
T Consensus       117 ~H~~~~h~~  125 (128)
T PHA00733        117 DHVCKKHNI  125 (128)
T ss_pred             HHHHHhcCc
Confidence            999988864


No 16 
>KOG3993|consensus
Probab=99.01  E-value=1.9e-11  Score=114.93  Aligned_cols=192  Identities=13%  Similarity=0.111  Sum_probs=115.9

Q ss_pred             ccchhhhhhhhcCCccccccccCCCCCCCCcccccccccccCCHHHHHHHHHhhcCCCcccccc----CccccCChHHHH
Q psy2838         150 EFNRSQFEKTFQNYSMTTTGHRGGKGPPVKKFPCDQCGKSFTLKQNLTYHVNSIHLGVRIKCDF----CDKTFAHKPNLI  225 (458)
Q Consensus       150 ~~~c~~c~~~~~~~~~~~~~~~~~~~~~~k~~~C~~C~~~f~~~~~L~~H~~~~h~~~~~~C~~----C~~~f~~~~~L~  225 (458)
                      .|.|..|...|.+...|..|.-.+.  -.-.|+|+.|+|+|....+|..|.|. |..++-.=.-    =.+...+. .+.
T Consensus       267 dyiCqLCK~kYeD~F~LAQHrC~RI--V~vEYrCPEC~KVFsCPANLASHRRW-HKPR~eaa~a~~~P~k~~~~~r-ae~  342 (500)
T KOG3993|consen  267 DYICQLCKEKYEDAFALAQHRCPRI--VHVEYRCPECDKVFSCPANLASHRRW-HKPRPEAAKAGSPPPKQAVETR-AEV  342 (500)
T ss_pred             HHHHHHHHHhhhhHHHHhhccCCee--EEeeecCCcccccccCchhhhhhhcc-cCCchhhhhcCCCChhhhhhhh-hhh
Confidence            6889999999998888877642111  12359999999999999999999984 4433211000    00000000 111


Q ss_pred             HHH-HH--hcCCceeecCccccccCCchhhhhccchhccCcc-----------------eecCcccccccCHHHHHHHHH
Q psy2838         226 AHM-NS--AHNGVRFKCDHCVKSFNQKPNLISHIKSVHEGIK-----------------YKCEECTKTFAQKQNLLSHVQ  285 (458)
Q Consensus       226 ~H~-~~--h~~~~~~~C~~C~~~f~~~~~L~~H~~~~~~~~~-----------------~~C~~C~~~f~~~~~L~~H~~  285 (458)
                      +-. +.  -.....|.|.+|++.|.....|+.|+.+|+....                 +-|..|.-.+.....--.+..
T Consensus       343 ~ea~rsg~dss~gi~~C~~C~KkFrRqAYLrKHqlthq~~~~~k~~a~~f~~s~~~~l~~~~~~~a~h~~a~~~~g~~vl  422 (500)
T KOG3993|consen  343 QEAERSGDDSSSGIFSCHTCGKKFRRQAYLRKHQLTHQRAPLAKEKAPKFLLSRVIPLMHFNQAVATHSSASDSHGDEVL  422 (500)
T ss_pred             hhccccCCcccCceeecHHhhhhhHHHHHHHHhHHhhhccccchhcccCcchhhccccccccccccccccccccccccee
Confidence            000 00  0112358888888888888888888776664311                 223333333322111111110


Q ss_pred             Hhh-cCceeecCccCcccCChhHHHHHHhhhccCccccCccccccCCCHHHHHHHHhhccC
Q psy2838         286 AIH-MGIKYTCEVCSKSYNQVTNLLSHMTTVHQAQKYKCADCGRLFAQEHTLRTHTLSVHS  345 (458)
Q Consensus       286 ~~h-~~~~~~C~~C~~~f~~~~~L~~H~~~h~~~~~~~C~~C~~~f~~~~~L~~H~~~~h~  345 (458)
                      .+. ......|++|+..+.++..--.|.+.-+.+..|.|.+|.-.|.+...|.+|+...|.
T Consensus       423 ~~a~sael~~pp~~~~ppsss~~sgg~~rlg~~~q~f~~ky~~atfyss~~ltrhin~~Hp  483 (500)
T KOG3993|consen  423 YVAGSAELELPPYDGSPPSSSGSSGGYGRLGIAEQGFTCKYCPATFYSSPGLTRHINKCHP  483 (500)
T ss_pred             eeeccccccCCCCCCCCcccCCCCCccccccchhhccccccchHhhhcCcchHhHhhhcCh
Confidence            000 011256888888887776666666666677789999999999999999999887775


No 17 
>PHA00733 hypothetical protein
Probab=98.98  E-value=3.5e-10  Score=93.18  Aligned_cols=81  Identities=16%  Similarity=0.283  Sum_probs=58.1

Q ss_pred             cceecCcccccccCHHHHHHH--H---HHhhcCceeecCccCcccCChhHHHHHHhhhccCccccCccccccCCCHHHHH
Q psy2838         263 IKYKCEECTKTFAQKQNLLSH--V---QAIHMGIKYTCEVCSKSYNQVTNLLSHMTTVHQAQKYKCADCGRLFAQEHTLR  337 (458)
Q Consensus       263 ~~~~C~~C~~~f~~~~~L~~H--~---~~~h~~~~~~C~~C~~~f~~~~~L~~H~~~h~~~~~~~C~~C~~~f~~~~~L~  337 (458)
                      +++.|.+|.+.|.....|..|  +   ..++...+|.|+.|++.|.+...|..|++.|  ..+|.|..|++.|.....|.
T Consensus        39 ~~~~~~~~~~~~~~~~~l~~~~~l~~~~~~~~~kPy~C~~Cgk~Fss~s~L~~H~r~h--~~~~~C~~CgK~F~~~~sL~  116 (128)
T PHA00733         39 KRLIRAVVKTLIYNPQLLDESSYLYKLLTSKAVSPYVCPLCLMPFSSSVSLKQHIRYT--EHSKVCPVCGKEFRNTDSTL  116 (128)
T ss_pred             hhHHHHHHhhhccChhhhcchHHHHhhcccCCCCCccCCCCCCcCCCHHHHHHHHhcC--CcCccCCCCCCccCCHHHHH
Confidence            556666666666655555444  1   1234456688888888888888888888865  34688888888888888888


Q ss_pred             HHHhhccC
Q psy2838         338 THTLSVHS  345 (458)
Q Consensus       338 ~H~~~~h~  345 (458)
                      +|+...|+
T Consensus       117 ~H~~~~h~  124 (128)
T PHA00733        117 DHVCKKHN  124 (128)
T ss_pred             HHHHHhcC
Confidence            88887775


No 18 
>PHA02768 hypothetical protein; Provisional
Probab=98.75  E-value=3.2e-09  Score=71.78  Aligned_cols=42  Identities=19%  Similarity=0.581  Sum_probs=26.0

Q ss_pred             eecCccCcccCChhHHHHHHhhhccCccccCccccccCCCHHHH
Q psy2838         293 YTCEVCSKSYNQVTNLLSHMTTVHQAQKYKCADCGRLFAQEHTL  336 (458)
Q Consensus       293 ~~C~~C~~~f~~~~~L~~H~~~h~~~~~~~C~~C~~~f~~~~~L  336 (458)
                      |.|+.||+.|...++|..|+++|+  ++|+|..|++.|.+.+.|
T Consensus         6 y~C~~CGK~Fs~~~~L~~H~r~H~--k~~kc~~C~k~f~~~s~l   47 (55)
T PHA02768          6 YECPICGEIYIKRKSMITHLRKHN--TNLKLSNCKRISLRTGEY   47 (55)
T ss_pred             cCcchhCCeeccHHHHHHHHHhcC--CcccCCcccceeccccee
Confidence            566666666666666666666665  456666666666655544


No 19 
>PHA02768 hypothetical protein; Provisional
Probab=98.69  E-value=9.2e-09  Score=69.58  Aligned_cols=43  Identities=16%  Similarity=0.504  Sum_probs=30.6

Q ss_pred             cccccCccccCChHHHHHHHHHhcCCceeecCccccccCCchhhh
Q psy2838         209 IKCDFCDKTFAHKPNLIAHMNSAHNGVRFKCDHCVKSFNQKPNLI  253 (458)
Q Consensus       209 ~~C~~C~~~f~~~~~L~~H~~~h~~~~~~~C~~C~~~f~~~~~L~  253 (458)
                      |.|+.||+.|...++|..|+++|+  ++|+|..|++.|...+.|.
T Consensus         6 y~C~~CGK~Fs~~~~L~~H~r~H~--k~~kc~~C~k~f~~~s~l~   48 (55)
T PHA02768          6 YECPICGEIYIKRKSMITHLRKHN--TNLKLSNCKRISLRTGEYI   48 (55)
T ss_pred             cCcchhCCeeccHHHHHHHHHhcC--CcccCCcccceecccceeE
Confidence            677777777777777777777776  4677777777777666553


No 20 
>PF13465 zf-H2C2_2:  Zinc-finger double domain; PDB: 2EN7_A 1TF6_A 1TF3_A 2ELT_A 2EOS_A 2EN2_A 2DMD_A 2WBS_A 2WBU_A 2EM5_A ....
Probab=98.35  E-value=2e-07  Score=53.83  Aligned_cols=26  Identities=42%  Similarity=0.809  Sum_probs=19.9

Q ss_pred             HHHHHHhhhccCccccCccccccCCC
Q psy2838         307 NLLSHMTTVHQAQKYKCADCGRLFAQ  332 (458)
Q Consensus       307 ~L~~H~~~h~~~~~~~C~~C~~~f~~  332 (458)
                      +|.+|+++|++++||.|+.|++.|.+
T Consensus         1 ~l~~H~~~H~~~k~~~C~~C~k~F~~   26 (26)
T PF13465_consen    1 NLRRHMRTHTGEKPYKCPYCGKSFSN   26 (26)
T ss_dssp             HHHHHHHHHSSSSSEEESSSSEEESS
T ss_pred             CHHHHhhhcCCCCCCCCCCCcCeeCc
Confidence            46778888888888888888887753


No 21 
>PF13465 zf-H2C2_2:  Zinc-finger double domain; PDB: 2EN7_A 1TF6_A 1TF3_A 2ELT_A 2EOS_A 2EN2_A 2DMD_A 2WBS_A 2WBU_A 2EM5_A ....
Probab=98.29  E-value=5.7e-07  Score=51.89  Aligned_cols=25  Identities=40%  Similarity=0.683  Sum_probs=18.4

Q ss_pred             HHHHHHHHhcCCceeecCccccccC
Q psy2838         223 NLIAHMNSAHNGVRFKCDHCVKSFN  247 (458)
Q Consensus       223 ~L~~H~~~h~~~~~~~C~~C~~~f~  247 (458)
                      +|.+|+++|++++||.|+.|++.|.
T Consensus         1 ~l~~H~~~H~~~k~~~C~~C~k~F~   25 (26)
T PF13465_consen    1 NLRRHMRTHTGEKPYKCPYCGKSFS   25 (26)
T ss_dssp             HHHHHHHHHSSSSSEEESSSSEEES
T ss_pred             CHHHHhhhcCCCCCCCCCCCcCeeC
Confidence            4677777777777777777777775


No 22 
>PF06524 NOA36:  NOA36 protein;  InterPro: IPR010531 This family consists of several NOA36 proteins which contain 29 highly conserved cysteine residues. The function of this protein is unknown.; GO: 0008270 zinc ion binding, 0005634 nucleus
Probab=98.15  E-value=2.6e-06  Score=75.71  Aligned_cols=95  Identities=21%  Similarity=0.557  Sum_probs=53.1

Q ss_pred             hccCcceecCcccccccCHHHHHHHHHHhh--cCceeecCccCcccCChhHHHHHHhhhccCccccCccccccCCCHHHH
Q psy2838         259 VHEGIKYKCEECTKTFAQKQNLLSHVQAIH--MGIKYTCEVCSKSYNQVTNLLSHMTTVHQAQKYKCADCGRLFAQEHTL  336 (458)
Q Consensus       259 ~~~~~~~~C~~C~~~f~~~~~L~~H~~~~h--~~~~~~C~~C~~~f~~~~~L~~H~~~h~~~~~~~C~~C~~~f~~~~~L  336 (458)
                      .|+.+.|+|..|....-.- .-..|+....  ....|+|.-|++.                 -.|.|-.|...|-.-...
T Consensus       137 ~hGGrif~CsfC~~flCED-DQFEHQAsCQvLe~E~~KC~SCNrl-----------------Gq~sCLRCK~cfCddHvr  198 (314)
T PF06524_consen  137 DHGGRIFKCSFCDNFLCED-DQFEHQASCQVLESETFKCQSCNRL-----------------GQYSCLRCKICFCDDHVR  198 (314)
T ss_pred             cCCCeEEEeecCCCeeecc-chhhhhhhhhhhhcccccccccccc-----------------cchhhhheeeeehhhhhh
Confidence            4666777777776533322 2223432221  2233777766642                 124555555555443322


Q ss_pred             HHHHhhccCCCccccCcCcccCCCHHHHHHHHHHhC
Q psy2838         337 RTHTLSVHSGVRLTCRTCGLFCDSKRALTAHVREAH  372 (458)
Q Consensus       337 ~~H~~~~h~~~~~~C~~C~~~f~~~~~L~~H~~~~h  372 (458)
                      ++- .+.-.++++.|+.|++.......|..-.|+|-
T Consensus       199 rKg-~ky~k~k~~PCPKCg~et~eTkdLSmStR~hk  233 (314)
T PF06524_consen  199 RKG-FKYEKGKPIPCPKCGYETQETKDLSMSTRSHK  233 (314)
T ss_pred             hcc-cccccCCCCCCCCCCCcccccccceeeeecch
Confidence            222 22334567889999999888888887777664


No 23 
>PHA00732 hypothetical protein
Probab=98.12  E-value=1.6e-06  Score=64.73  Aligned_cols=45  Identities=24%  Similarity=0.524  Sum_probs=29.5

Q ss_pred             eecCccCcccCChhHHHHHHhhhccCccccCccccccCCCHHHHHHHHhh
Q psy2838         293 YTCEVCSKSYNQVTNLLSHMTTVHQAQKYKCADCGRLFAQEHTLRTHTLS  342 (458)
Q Consensus       293 ~~C~~C~~~f~~~~~L~~H~~~h~~~~~~~C~~C~~~f~~~~~L~~H~~~  342 (458)
                      |.|..|++.|.+...|..|++.+|.  ++.|+.|++.|.   .|..|+++
T Consensus         2 y~C~~Cgk~F~s~s~Lk~H~r~~H~--~~~C~~CgKsF~---~l~~H~~~   46 (79)
T PHA00732          2 FKCPICGFTTVTLFALKQHARRNHT--LTKCPVCNKSYR---RLNQHFYS   46 (79)
T ss_pred             ccCCCCCCccCCHHHHHHHhhcccC--CCccCCCCCEeC---Chhhhhcc
Confidence            5677777777777777777764322  256777777776   36667543


No 24 
>PHA00616 hypothetical protein
Probab=98.11  E-value=1.5e-06  Score=56.00  Aligned_cols=31  Identities=19%  Similarity=0.405  Sum_probs=16.3

Q ss_pred             cccCcCcccCCCHHHHHHHHHHhCCCCCccc
Q psy2838         349 LTCRTCGLFCDSKRALTAHVREAHPGCEYKK  379 (458)
Q Consensus       349 ~~C~~C~~~f~~~~~L~~H~~~~h~~~~~~~  379 (458)
                      |+|+.||+.|..++.|.+|+++||+..++.|
T Consensus         2 YqC~~CG~~F~~~s~l~~H~r~~hg~~~~~~   32 (44)
T PHA00616          2 YQCLRCGGIFRKKKEVIEHLLSVHKQNKLTL   32 (44)
T ss_pred             CccchhhHHHhhHHHHHHHHHHhcCCCccce
Confidence            4455555555555555555555555544443


No 25 
>PHA00616 hypothetical protein
Probab=98.06  E-value=1.6e-06  Score=55.75  Aligned_cols=32  Identities=22%  Similarity=0.591  Sum_probs=16.5

Q ss_pred             eecCccCcccCChhHHHHHHhhhccCccccCc
Q psy2838         293 YTCEVCSKSYNQVTNLLSHMTTVHQAQKYKCA  324 (458)
Q Consensus       293 ~~C~~C~~~f~~~~~L~~H~~~h~~~~~~~C~  324 (458)
                      |+|+.||+.|...+.|..|++.||+++++.|.
T Consensus         2 YqC~~CG~~F~~~s~l~~H~r~~hg~~~~~~~   33 (44)
T PHA00616          2 YQCLRCGGIFRKKKEVIEHLLSVHKQNKLTLE   33 (44)
T ss_pred             CccchhhHHHhhHHHHHHHHHHhcCCCcccee
Confidence            44555555555555555555555555555443


No 26 
>PF05605 zf-Di19:  Drought induced 19 protein (Di19), zinc-binding;  InterPro: IPR008598 This entry consists of several drought induced 19 (Di19) like and RING finger 114 proteins. Di19 has been found to be strongly expressed in both the roots and leaves of Arabidopsis thaliana during progressive drought [], whilst RING finger proteins are thought to play a role in spermatogenesis. The precise function is unknown.
Probab=98.03  E-value=7.6e-06  Score=56.61  Aligned_cols=51  Identities=24%  Similarity=0.486  Sum_probs=35.1

Q ss_pred             cccCccccccCCCHHHHHHHHhhccCCC--ccccCcCcccCCCHHHHHHHHHHhCC
Q psy2838         320 KYKCADCGRLFAQEHTLRTHTLSVHSGV--RLTCRTCGLFCDSKRALTAHVREAHP  373 (458)
Q Consensus       320 ~~~C~~C~~~f~~~~~L~~H~~~~h~~~--~~~C~~C~~~f~~~~~L~~H~~~~h~  373 (458)
                      .|.|+.|++ ..+...|..|....|...  .+.|++|...+.  .+|..|+..+|+
T Consensus         2 ~f~CP~C~~-~~~~~~L~~H~~~~H~~~~~~v~CPiC~~~~~--~~l~~Hl~~~H~   54 (54)
T PF05605_consen    2 SFTCPYCGK-GFSESSLVEHCEDEHRSESKNVVCPICSSRVT--DNLIRHLNSQHR   54 (54)
T ss_pred             CcCCCCCCC-ccCHHHHHHHHHhHCcCCCCCccCCCchhhhh--hHHHHHHHHhcC
Confidence            377888888 455667888877766643  356888877644  378888887763


No 27 
>PHA00732 hypothetical protein
Probab=97.99  E-value=4.4e-06  Score=62.33  Aligned_cols=44  Identities=32%  Similarity=0.499  Sum_probs=21.7

Q ss_pred             cccccCccccCChHHHHHHHHHhcCCceeecCccccccCCchhhhhccc
Q psy2838         209 IKCDFCDKTFAHKPNLIAHMNSAHNGVRFKCDHCVKSFNQKPNLISHIK  257 (458)
Q Consensus       209 ~~C~~C~~~f~~~~~L~~H~~~h~~~~~~~C~~C~~~f~~~~~L~~H~~  257 (458)
                      |.|..|++.|.+...|..|++.+|.  ++.|+.|++.|.   .|..|++
T Consensus         2 y~C~~Cgk~F~s~s~Lk~H~r~~H~--~~~C~~CgKsF~---~l~~H~~   45 (79)
T PHA00732          2 FKCPICGFTTVTLFALKQHARRNHT--LTKCPVCNKSYR---RLNQHFY   45 (79)
T ss_pred             ccCCCCCCccCCHHHHHHHhhcccC--CCccCCCCCEeC---Chhhhhc
Confidence            4455555555555555555543221  235555555554   3555554


No 28 
>PF05605 zf-Di19:  Drought induced 19 protein (Di19), zinc-binding;  InterPro: IPR008598 This entry consists of several drought induced 19 (Di19) like and RING finger 114 proteins. Di19 has been found to be strongly expressed in both the roots and leaves of Arabidopsis thaliana during progressive drought [], whilst RING finger proteins are thought to play a role in spermatogenesis. The precise function is unknown.
Probab=97.92  E-value=1.7e-05  Score=54.91  Aligned_cols=50  Identities=34%  Similarity=0.640  Sum_probs=32.5

Q ss_pred             cccccccccccCCHHHHHHHHHhhcCCC--ccccccCccccCChHHHHHHHHHhc
Q psy2838         180 KFPCDQCGKSFTLKQNLTYHVNSIHLGV--RIKCDFCDKTFAHKPNLIAHMNSAH  232 (458)
Q Consensus       180 ~~~C~~C~~~f~~~~~L~~H~~~~h~~~--~~~C~~C~~~f~~~~~L~~H~~~h~  232 (458)
                      .|.|++|++ ..+...|..|+...|...  .+.|++|...+.  .+|..|+..+|
T Consensus         2 ~f~CP~C~~-~~~~~~L~~H~~~~H~~~~~~v~CPiC~~~~~--~~l~~Hl~~~H   53 (54)
T PF05605_consen    2 SFTCPYCGK-GFSESSLVEHCEDEHRSESKNVVCPICSSRVT--DNLIRHLNSQH   53 (54)
T ss_pred             CcCCCCCCC-ccCHHHHHHHHHhHCcCCCCCccCCCchhhhh--hHHHHHHHHhc
Confidence            367777777 445667777776666543  377777776543  37777776665


No 29 
>KOG2231|consensus
Probab=97.64  E-value=0.00023  Score=73.21  Aligned_cols=48  Identities=21%  Similarity=0.415  Sum_probs=31.2

Q ss_pred             eecCccCcccCChhHHHHHHhhhccCccccCccc------cccCCCHHHHHHHHhhcc
Q psy2838         293 YTCEVCSKSYNQVTNLLSHMTTVHQAQKYKCADC------GRLFAQEHTLRTHTLSVH  344 (458)
Q Consensus       293 ~~C~~C~~~f~~~~~L~~H~~~h~~~~~~~C~~C------~~~f~~~~~L~~H~~~~h  344 (458)
                      -.|..|...|.....|.+|++.+|    |.|..|      +..|.....|..|.+..|
T Consensus       183 p~C~~C~~~fld~~el~rH~~~~h----~~chfC~~~~~~neyy~~~~dLe~HfR~~H  236 (669)
T KOG2231|consen  183 PLCKFCHERFLDDDELYRHLRFDH----EFCHFCDYKTGQNEYYNDYDDLEEHFRKGH  236 (669)
T ss_pred             ccchhhhhhhccHHHHHHhhccce----eheeecCcccccchhcccchHHHHHhhhcC
Confidence            456777777777777777776655    445555      345666677777776666


No 30 
>PF06524 NOA36:  NOA36 protein;  InterPro: IPR010531 This family consists of several NOA36 proteins which contain 29 highly conserved cysteine residues. The function of this protein is unknown.; GO: 0008270 zinc ion binding, 0005634 nucleus
Probab=97.61  E-value=0.00012  Score=65.46  Aligned_cols=85  Identities=21%  Similarity=0.393  Sum_probs=48.6

Q ss_pred             HhhcCceeecCccCcccCChhHHHHHHhhh--ccCccccCccccccCCCHHHHHHHHhhccCCCccccCcCcccCCCHHH
Q psy2838         286 AIHMGIKYTCEVCSKSYNQVTNLLSHMTTV--HQAQKYKCADCGRLFAQEHTLRTHTLSVHSGVRLTCRTCGLFCDSKRA  363 (458)
Q Consensus       286 ~~h~~~~~~C~~C~~~f~~~~~L~~H~~~h--~~~~~~~C~~C~~~f~~~~~L~~H~~~~h~~~~~~C~~C~~~f~~~~~  363 (458)
                      ..|+++.|+|.+|....-.-. --.|+.+-  .....|+|.-|++.                 ..|.|-.|...|-    
T Consensus       136 w~hGGrif~CsfC~~flCEDD-QFEHQAsCQvLe~E~~KC~SCNrl-----------------Gq~sCLRCK~cfC----  193 (314)
T PF06524_consen  136 WDHGGRIFKCSFCDNFLCEDD-QFEHQASCQVLESETFKCQSCNRL-----------------GQYSCLRCKICFC----  193 (314)
T ss_pred             ccCCCeEEEeecCCCeeeccc-hhhhhhhhhhhhcccccccccccc-----------------cchhhhheeeeeh----
Confidence            357788888888875443322 23454432  12345888888752                 1233444544453    


Q ss_pred             HHHHHHHh----CCCCCcccCCCCCCCCcCCCCC
Q psy2838         364 LTAHVREA----HPGCEYKKRGRGRPKLEVPGGG  393 (458)
Q Consensus       364 L~~H~~~~----h~~~~~~~~~~~~~~~~~~~~~  393 (458)
                       ..|+|..    -.++++.|++|+-....+-..+
T Consensus       194 -ddHvrrKg~ky~k~k~~PCPKCg~et~eTkdLS  226 (314)
T PF06524_consen  194 -DDHVRRKGFKYEKGKPIPCPKCGYETQETKDLS  226 (314)
T ss_pred             -hhhhhhcccccccCCCCCCCCCCCcccccccce
Confidence             3455442    3457899999997765544433


No 31 
>PF12756 zf-C2H2_2:  C2H2 type zinc-finger (2 copies); PDB: 2DMI_A.
Probab=97.55  E-value=4.8e-05  Score=60.07  Aligned_cols=74  Identities=24%  Similarity=0.421  Sum_probs=19.6

Q ss_pred             ecCccCcccCChhHHHHHHhhhccCccccCccccccCCCHHHHHHHHhhccCCCccccCcCcccCCCHHHHHHHHHHhC
Q psy2838         294 TCEVCSKSYNQVTNLLSHMTTVHQAQKYKCADCGRLFAQEHTLRTHTLSVHSGVRLTCRTCGLFCDSKRALTAHVREAH  372 (458)
Q Consensus       294 ~C~~C~~~f~~~~~L~~H~~~h~~~~~~~C~~C~~~f~~~~~L~~H~~~~h~~~~~~C~~C~~~f~~~~~L~~H~~~~h  372 (458)
                      +|.+|+..|.+...|..|+...|+...-    ....+.....+..++.. .....+.|..|++.|.+...|..||+.++
T Consensus         1 ~C~~C~~~f~~~~~l~~H~~~~H~~~~~----~~~~l~~~~~~~~~~~~-~~~~~~~C~~C~~~f~s~~~l~~Hm~~~~   74 (100)
T PF12756_consen    1 QCLFCDESFSSVDDLLQHMKKKHGFDIP----DQKYLVDPNRLLNYLRK-KVKESFRCPYCNKTFRSREALQEHMRSKH   74 (100)
T ss_dssp             -----------------------------------------------------SSEEBSSSS-EESSHHHHHHHHHHTT
T ss_pred             Cccccccccccccccccccccccccccc----ccccccccccccccccc-ccCCCCCCCccCCCCcCHHHHHHHHcCcc
Confidence            4777777777777777777666653211    11122233444444322 12225778888888888888888888753


No 32 
>PF00096 zf-C2H2:  Zinc finger, C2H2 type;  InterPro: IPR007087 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target.  The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger: #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C], where X can be any amino acid, and numbers in brackets indicate the number of residues. The positions marked # are those that are important for the stable fold of the zinc finger. The final position can be either his or cys. The C2H2 zinc finger is composed of two short beta strands followed by an alpha helix. The amino terminal part of the helix binds the major groove in DNA binding zinc fingers. The accepted consensus binding sequence for Sp1 is usually defined by the asymmetric hexanucleotide core GGGCGG but this sequence does not include, among others, the GAG (=CTC) repeat that constitutes a high-affinity site for Sp1 binding to the wt1 promoter []. This entry represents the classical C2H2 zinc finger domain.  More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; GO: 0008270 zinc ion binding, 0005622 intracellular; PDB: 2D9H_A 2EPC_A 1SP1_A 1VA3_A 2WBT_B 2ELR_A 2YTP_A 2YTT_A 1VA1_A 2ELO_A ....
Probab=97.47  E-value=7.6e-05  Score=41.59  Aligned_cols=21  Identities=38%  Similarity=0.700  Sum_probs=11.4

Q ss_pred             ccCcCcccCCCHHHHHHHHHH
Q psy2838         350 TCRTCGLFCDSKRALTAHVRE  370 (458)
Q Consensus       350 ~C~~C~~~f~~~~~L~~H~~~  370 (458)
                      .|+.|++.|.++..|.+|++.
T Consensus         2 ~C~~C~~~f~~~~~l~~H~~~   22 (23)
T PF00096_consen    2 KCPICGKSFSSKSNLKRHMRR   22 (23)
T ss_dssp             EETTTTEEESSHHHHHHHHHH
T ss_pred             CCCCCCCccCCHHHHHHHHhH
Confidence            455555555555555555554


No 33 
>KOG1146|consensus
Probab=97.46  E-value=1.2e-05  Score=86.94  Aligned_cols=49  Identities=16%  Similarity=0.295  Sum_probs=33.0

Q ss_pred             ccCcccccc-CCCHHHHHHHHhhc------cCCCccccCcCcccCCCHHHHHHHHH
Q psy2838         321 YKCADCGRL-FAQEHTLRTHTLSV------HSGVRLTCRTCGLFCDSKRALTAHVR  369 (458)
Q Consensus       321 ~~C~~C~~~-f~~~~~L~~H~~~~------h~~~~~~C~~C~~~f~~~~~L~~H~~  369 (458)
                      |.|-.|+.- ..+...|+.|..+-      ..+..|.|.+|.+....+.++..|.+
T Consensus       735 ~~civcd~~st~~l~~l~~h~~~~rs~ke~v~g~~~~c~l~~y~t~~kanfqlh~K  790 (1406)
T KOG1146|consen  735 FDCIVCDVFSTDRLDQLWFHNTRERSRKEQVPGDVPSCKLKPYATNTKANFQLHNK  790 (1406)
T ss_pred             HHHhhhhhhhhhhHHHHhhcchhhhhhhhcccCCCCcceecccccccchhhhhhcc
Confidence            788888853 33445666674321      12345789999999888888877765


No 34 
>PF13894 zf-C2H2_4:  C2H2-type zinc finger; PDB: 2ELX_A 2EPP_A 2DLK_A 1X6H_A 2EOU_A 2EMB_A 2GQJ_A 2CSH_A 2WBT_B 2ELM_A ....
Probab=97.40  E-value=0.00012  Score=41.03  Aligned_cols=24  Identities=38%  Similarity=0.699  Sum_probs=12.4

Q ss_pred             cccCcCcccCCCHHHHHHHHHHhC
Q psy2838         349 LTCRTCGLFCDSKRALTAHVREAH  372 (458)
Q Consensus       349 ~~C~~C~~~f~~~~~L~~H~~~~h  372 (458)
                      |.|++|++.|.+...|..|++.+|
T Consensus         1 ~~C~~C~~~~~~~~~l~~H~~~~H   24 (24)
T PF13894_consen    1 FQCPICGKSFRSKSELRQHMRTHH   24 (24)
T ss_dssp             EE-SSTS-EESSHHHHHHHHHHHS
T ss_pred             CCCcCCCCcCCcHHHHHHHHHhhC
Confidence            345555555555555555555554


No 35 
>PF12756 zf-C2H2_2:  C2H2 type zinc-finger (2 copies); PDB: 2DMI_A.
Probab=97.35  E-value=0.00014  Score=57.40  Aligned_cols=22  Identities=32%  Similarity=0.853  Sum_probs=10.7

Q ss_pred             ceecCcccccccCHHHHHHHHH
Q psy2838         264 KYKCEECTKTFAQKQNLLSHVQ  285 (458)
Q Consensus       264 ~~~C~~C~~~f~~~~~L~~H~~  285 (458)
                      .+.|..|++.|.+...|..|++
T Consensus        50 ~~~C~~C~~~f~s~~~l~~Hm~   71 (100)
T PF12756_consen   50 SFRCPYCNKTFRSREALQEHMR   71 (100)
T ss_dssp             SEEBSSSS-EESSHHHHHHHHH
T ss_pred             CCCCCccCCCCcCHHHHHHHHc
Confidence            3455555555555555555554


No 36 
>PF13894 zf-C2H2_4:  C2H2-type zinc finger; PDB: 2ELX_A 2EPP_A 2DLK_A 1X6H_A 2EOU_A 2EMB_A 2GQJ_A 2CSH_A 2WBT_B 2ELM_A ....
Probab=97.33  E-value=0.00012  Score=41.03  Aligned_cols=23  Identities=30%  Similarity=0.756  Sum_probs=10.3

Q ss_pred             ccCccccccCCCHHHHHHHHhhc
Q psy2838         321 YKCADCGRLFAQEHTLRTHTLSV  343 (458)
Q Consensus       321 ~~C~~C~~~f~~~~~L~~H~~~~  343 (458)
                      |.|++|++.|.+...|+.|+.++
T Consensus         1 ~~C~~C~~~~~~~~~l~~H~~~~   23 (24)
T PF13894_consen    1 FQCPICGKSFRSKSELRQHMRTH   23 (24)
T ss_dssp             EE-SSTS-EESSHHHHHHHHHHH
T ss_pred             CCCcCCCCcCCcHHHHHHHHHhh
Confidence            34555555555555555554443


No 37 
>COG5236 Uncharacterized conserved protein, contains RING Zn-finger [General function prediction only]
Probab=97.31  E-value=0.00034  Score=64.99  Aligned_cols=78  Identities=21%  Similarity=0.334  Sum_probs=48.5

Q ss_pred             eecCccCcccCChhHHHHHHhhhccCccccCccccc-------cCCCHHHHHHHHhhccCCCccc-cCcC-cccCCCHHH
Q psy2838         293 YTCEVCSKSYNQVTNLLSHMTTVHQAQKYKCADCGR-------LFAQEHTLRTHTLSVHSGVRLT-CRTC-GLFCDSKRA  363 (458)
Q Consensus       293 ~~C~~C~~~f~~~~~L~~H~~~h~~~~~~~C~~C~~-------~f~~~~~L~~H~~~~h~~~~~~-C~~C-~~~f~~~~~  363 (458)
                      -.|..|...|..-..|.+|++..|.    .|-+|++       -|.+..+|.+|.+..|---.++ |..= -+.|.....
T Consensus       221 P~C~FC~~~FYdDDEL~~HcR~~HE----~ChICD~v~p~~~QYFK~Y~~Le~HF~~~hy~ct~qtc~~~k~~vf~~~~e  296 (493)
T COG5236         221 PLCIFCKIYFYDDDELRRHCRLRHE----ACHICDMVGPIRYQYFKSYEDLEAHFRNAHYCCTFQTCRVGKCYVFPYHTE  296 (493)
T ss_pred             chhhhccceecChHHHHHHHHhhhh----hhhhhhccCccchhhhhCHHHHHHHhhcCceEEEEEEEecCcEEEeccHHH
Confidence            3577777777777777777776553    3555544       3677778888866554222222 2110 145777788


Q ss_pred             HHHHHHHhCCC
Q psy2838         364 LTAHVREAHPG  374 (458)
Q Consensus       364 L~~H~~~~h~~  374 (458)
                      |..|+...|+.
T Consensus       297 l~~h~~~~h~~  307 (493)
T COG5236         297 LLEHLTRFHKV  307 (493)
T ss_pred             HHHHHHHHhhc
Confidence            88888887764


No 38 
>KOG2231|consensus
Probab=97.26  E-value=0.00041  Score=71.44  Aligned_cols=104  Identities=25%  Similarity=0.530  Sum_probs=63.1

Q ss_pred             ccccCccccCChHHHHHHHHHhcCCceeecCcccc---------ccCCchhhhhccchhcc-C----cceecCccccccc
Q psy2838         210 KCDFCDKTFAHKPNLIAHMNSAHNGVRFKCDHCVK---------SFNQKPNLISHIKSVHE-G----IKYKCEECTKTFA  275 (458)
Q Consensus       210 ~C~~C~~~f~~~~~L~~H~~~h~~~~~~~C~~C~~---------~f~~~~~L~~H~~~~~~-~----~~~~C~~C~~~f~  275 (458)
                      .|..| ..|.....|+.|+..-|.  -+.|..|-.         ...+...|..|++.--. .    -.-.|..|...|.
T Consensus       117 ~~~~c-~~~~s~~~Lk~H~~~~H~--~~~c~lC~~~~kif~~e~k~Yt~~el~~h~~~gd~d~~s~rGhp~C~~C~~~fl  193 (669)
T KOG2231|consen  117 ECLHC-TEFKSVENLKNHMRDQHK--LHLCSLCLQNLKIFINERKLYTRAELNLHLMFGDPDDESCRGHPLCKFCHERFL  193 (669)
T ss_pred             CCccc-cchhHHHHHHHHHHHhhh--hhccccccccceeeeeeeehehHHHHHHHHhcCCCccccccCCccchhhhhhhc
Confidence            45555 555666777777754443  244444421         12234556666553221 1    1246888888888


Q ss_pred             CHHHHHHHHHHhhcCceeecCcc------CcccCChhHHHHHHhhhccCccccCc
Q psy2838         276 QKQNLLSHVQAIHMGIKYTCEVC------SKSYNQVTNLLSHMTTVHQAQKYKCA  324 (458)
Q Consensus       276 ~~~~L~~H~~~~h~~~~~~C~~C------~~~f~~~~~L~~H~~~h~~~~~~~C~  324 (458)
                      ....|.+|++..|    |.|.+|      +.-|.....|..|.+.+|    |.|.
T Consensus       194 d~~el~rH~~~~h----~~chfC~~~~~~neyy~~~~dLe~HfR~~H----flCE  240 (669)
T KOG2231|consen  194 DDDELYRHLRFDH----EFCHFCDYKTGQNEYYNDYDDLEEHFRKGH----FLCE  240 (669)
T ss_pred             cHHHHHHhhccce----eheeecCcccccchhcccchHHHHHhhhcC----cccc
Confidence            8888888886655    556666      344666778888888777    5665


No 39 
>COG5189 SFP1 Putative transcriptional repressor regulating G2/M transition [Transcription / Cell division and chromosome partitioning]
Probab=97.11  E-value=0.00013  Score=66.92  Aligned_cols=23  Identities=22%  Similarity=0.678  Sum_probs=17.2

Q ss_pred             cCccccCccccccCCCHHHHHHH
Q psy2838         317 QAQKYKCADCGRLFAQEHTLRTH  339 (458)
Q Consensus       317 ~~~~~~C~~C~~~f~~~~~L~~H  339 (458)
                      ..|||+|++|+|+|.....|+-|
T Consensus       395 ~~KPYrCevC~KRYKNlNGLKYH  417 (423)
T COG5189         395 KDKPYRCEVCDKRYKNLNGLKYH  417 (423)
T ss_pred             cCCceeccccchhhccCccceec
Confidence            34677787788777777777777


No 40 
>KOG1146|consensus
Probab=97.03  E-value=0.00071  Score=73.69  Aligned_cols=106  Identities=23%  Similarity=0.351  Sum_probs=76.4

Q ss_pred             hhhhhhhhcCCccccccccCCCCCCCCcccccccccccCCHHHHHHHHHhhcC-------------------------CC
Q psy2838         153 RSQFEKTFQNYSMTTTGHRGGKGPPVKKFPCDQCGKSFTLKQNLTYHVNSIHL-------------------------GV  207 (458)
Q Consensus       153 c~~c~~~~~~~~~~~~~~~~~~~~~~k~~~C~~C~~~f~~~~~L~~H~~~~h~-------------------------~~  207 (458)
                      +..++..+...+.+..+.+ ..+.-.+.|+|+.|+..|+....|..|||..|.                         .+
T Consensus       439 ~~~~e~~~~s~r~~~~~t~-~L~S~~kt~~cpkc~~~yk~a~~L~vhmRskhp~~~~~~c~~gq~~~~~arg~~~~~~~~  517 (1406)
T KOG1146|consen  439 LTKAEPLLESKRSLEGQTV-VLHSFFKTLKCPKCNWHYKLAQTLGVHMRSKHPESQSAYCKAGQNHPRLARGEVYRCPGK  517 (1406)
T ss_pred             ccchhhhhhhhccccccee-eeecccccccCCccchhhhhHHHhhhcccccccccchhHhHhccccccccccccccCCCC
Confidence            3444445544444333322 223334889999999999999999999997442                         34


Q ss_pred             ccccccCccccCChHHHHHHHHHhc--C-----------------------------------------CceeecCcccc
Q psy2838         208 RIKCDFCDKTFAHKPNLIAHMNSAH--N-----------------------------------------GVRFKCDHCVK  244 (458)
Q Consensus       208 ~~~C~~C~~~f~~~~~L~~H~~~h~--~-----------------------------------------~~~~~C~~C~~  244 (458)
                      +|.|..|...+..+.+|.+|+..-.  .                                         +-.+.|..|++
T Consensus       518 p~~C~~C~~stttng~LsihlqS~~h~~~lee~~~~~g~~v~~~~~~v~s~~P~~ag~~~~ags~~pktkP~~~C~vc~y  597 (1406)
T KOG1146|consen  518 PYPCRACNYSTTTNGNLSIHLQSDLHRNELEEAEENAGEQVRLLPASVTSAVPEEAGLGPSAGSSGPKTKPSWRCEVCSY  597 (1406)
T ss_pred             cccceeeeeeeecchHHHHHHHHHhhHHHHHHHHhccccchhhhhhhhcccCcccccCCCCCCCCCCCCCCCcchhhhcc
Confidence            5889999999999999999986421  0                                         01288999999


Q ss_pred             ccCCchhhhhccchh
Q psy2838         245 SFNQKPNLISHIKSV  259 (458)
Q Consensus       245 ~f~~~~~L~~H~~~~  259 (458)
                      .-+-..+|+.|+..-
T Consensus       598 etniarnlrihmtss  612 (1406)
T KOG1146|consen  598 ETNIARNLRIHMTAS  612 (1406)
T ss_pred             hhhhhhccccccccC
Confidence            988889999998643


No 41 
>COG5236 Uncharacterized conserved protein, contains RING Zn-finger [General function prediction only]
Probab=96.97  E-value=0.0015  Score=60.94  Aligned_cols=128  Identities=24%  Similarity=0.474  Sum_probs=93.4

Q ss_pred             ccccc--CccccCChHHHHHHHHHhcCCceeecCccc---cccC------CchhhhhccchhccCcc----eecCccccc
Q psy2838         209 IKCDF--CDKTFAHKPNLIAHMNSAHNGVRFKCDHCV---KSFN------QKPNLISHIKSVHEGIK----YKCEECTKT  273 (458)
Q Consensus       209 ~~C~~--C~~~f~~~~~L~~H~~~h~~~~~~~C~~C~---~~f~------~~~~L~~H~~~~~~~~~----~~C~~C~~~  273 (458)
                      |.|+.  |..+......|+.|.+..|+  .+.|.+|-   +.|.      ++..|..|...-..+.-    -.|..|...
T Consensus       152 F~CP~skc~~~C~~~k~lk~H~K~~H~--~~~C~~C~~nKk~F~~E~~lF~~~~Lr~H~~~G~~e~GFKGHP~C~FC~~~  229 (493)
T COG5236         152 FKCPKSKCHRRCGSLKELKKHYKAQHG--FVLCSECIGNKKDFWNEIRLFRSSTLRDHKNGGLEEEGFKGHPLCIFCKIY  229 (493)
T ss_pred             hcCCchhhhhhhhhHHHHHHHHHhhcC--cEEhHhhhcCcccCccceeeeecccccccccCCccccCcCCCchhhhccce
Confidence            77875  77777778899999998876  37788875   3343      34556667543222222    369999999


Q ss_pred             ccCHHHHHHHHHHhhcCceeecCccCcc-------cCChhHHHHHHhhhccCccccCcc--cc----ccCCCHHHHHHHH
Q psy2838         274 FAQKQNLLSHVQAIHMGIKYTCEVCSKS-------YNQVTNLLSHMTTVHQAQKYKCAD--CG----RLFAQEHTLRTHT  340 (458)
Q Consensus       274 f~~~~~L~~H~~~~h~~~~~~C~~C~~~-------f~~~~~L~~H~~~h~~~~~~~C~~--C~----~~f~~~~~L~~H~  340 (458)
                      |-....|..|++..|.    .|-+|++.       |.+...|..|.+..|    |.|..  |-    ..|.....|..|+
T Consensus       230 FYdDDEL~~HcR~~HE----~ChICD~v~p~~~QYFK~Y~~Le~HF~~~h----y~ct~qtc~~~k~~vf~~~~el~~h~  301 (493)
T COG5236         230 FYDDDELRRHCRLRHE----ACHICDMVGPIRYQYFKSYEDLEAHFRNAH----YCCTFQTCRVGKCYVFPYHTELLEHL  301 (493)
T ss_pred             ecChHHHHHHHHhhhh----hhhhhhccCccchhhhhCHHHHHHHhhcCc----eEEEEEEEecCcEEEeccHHHHHHHH
Confidence            9999999999987774    56677654       778889999987654    66643  42    3799999999999


Q ss_pred             hhccCC
Q psy2838         341 LSVHSG  346 (458)
Q Consensus       341 ~~~h~~  346 (458)
                      ...|+.
T Consensus       302 ~~~h~~  307 (493)
T COG5236         302 TRFHKV  307 (493)
T ss_pred             HHHhhc
Confidence            888764


No 42 
>PF13912 zf-C2H2_6:  C2H2-type zinc finger; PDB: 1JN7_A 1FU9_A 2L1O_A 1NJQ_A 2EN8_A 2EMM_A 1FV5_A 1Y0J_B 2L6Z_B.
Probab=96.96  E-value=0.00042  Score=40.24  Aligned_cols=20  Identities=35%  Similarity=1.004  Sum_probs=8.1

Q ss_pred             ccCccccccCCCHHHHHHHH
Q psy2838         321 YKCADCGRLFAQEHTLRTHT  340 (458)
Q Consensus       321 ~~C~~C~~~f~~~~~L~~H~  340 (458)
                      |.|..|++.|.+...|..|+
T Consensus         2 ~~C~~C~~~F~~~~~l~~H~   21 (27)
T PF13912_consen    2 FECDECGKTFSSLSALREHK   21 (27)
T ss_dssp             EEETTTTEEESSHHHHHHHH
T ss_pred             CCCCccCCccCChhHHHHHh
Confidence            33444444444444444443


No 43 
>PF13912 zf-C2H2_6:  C2H2-type zinc finger; PDB: 1JN7_A 1FU9_A 2L1O_A 1NJQ_A 2EN8_A 2EMM_A 1FV5_A 1Y0J_B 2L6Z_B.
Probab=96.88  E-value=0.00058  Score=39.63  Aligned_cols=24  Identities=42%  Similarity=0.785  Sum_probs=14.3

Q ss_pred             cccccccccccCCHHHHHHHHHhh
Q psy2838         180 KFPCDQCGKSFTLKQNLTYHVNSI  203 (458)
Q Consensus       180 ~~~C~~C~~~f~~~~~L~~H~~~~  203 (458)
                      +|.|..|++.|.+...|..|++.+
T Consensus         1 ~~~C~~C~~~F~~~~~l~~H~~~h   24 (27)
T PF13912_consen    1 PFECDECGKTFSSLSALREHKRSH   24 (27)
T ss_dssp             SEEETTTTEEESSHHHHHHHHCTT
T ss_pred             CCCCCccCCccCChhHHHHHhHHh
Confidence            355666666666666666666543


No 44 
>PF13909 zf-H2C2_5:  C2H2-type zinc-finger domain; PDB: 1X5W_A.
Probab=96.61  E-value=0.0013  Score=36.94  Aligned_cols=24  Identities=29%  Similarity=0.686  Sum_probs=14.7

Q ss_pred             cccCcCcccCCCHHHHHHHHHHhCC
Q psy2838         349 LTCRTCGLFCDSKRALTAHVREAHP  373 (458)
Q Consensus       349 ~~C~~C~~~f~~~~~L~~H~~~~h~  373 (458)
                      |+|+.|++... ...|.+|++.+|+
T Consensus         1 y~C~~C~y~t~-~~~l~~H~~~~H~   24 (24)
T PF13909_consen    1 YKCPHCSYSTS-KSNLKRHLKRHHP   24 (24)
T ss_dssp             EE-SSSS-EES-HHHHHHHHHHHHS
T ss_pred             CCCCCCCCcCC-HHHHHHHHHhhCc
Confidence            45777777766 6677777776653


No 45 
>COG5189 SFP1 Putative transcriptional repressor regulating G2/M transition [Transcription / Cell division and chromosome partitioning]
Probab=96.56  E-value=0.0011  Score=61.06  Aligned_cols=70  Identities=20%  Similarity=0.357  Sum_probs=38.8

Q ss_pred             CCcccccc--cccccCCHHHHHHHHHhhcCCCc-cccccCccccCChHHHHHHHHHhcCCceeecCccccccCCchhhhh
Q psy2838         178 VKKFPCDQ--CGKSFTLKQNLTYHVNSIHLGVR-IKCDFCDKTFAHKPNLIAHMNSAHNGVRFKCDHCVKSFNQKPNLIS  254 (458)
Q Consensus       178 ~k~~~C~~--C~~~f~~~~~L~~H~~~~h~~~~-~~C~~C~~~f~~~~~L~~H~~~h~~~~~~~C~~C~~~f~~~~~L~~  254 (458)
                      .|||+|++  |.|.|++...|+.|+..-|.... ..=+          .-..|.-.....+||+|..|+|.+.+...|+-
T Consensus       347 ~KpykCpV~gC~K~YknqnGLKYH~lhGH~~~~~~~~p----------~p~~~~~F~~~~KPYrCevC~KRYKNlNGLKY  416 (423)
T COG5189         347 GKPYKCPVEGCNKKYKNQNGLKYHMLHGHQNQKLHENP----------SPEKMNIFSAKDKPYRCEVCDKRYKNLNGLKY  416 (423)
T ss_pred             CceecCCCCCchhhhccccchhhhhhccccCcccCCCC----------CccccccccccCCceeccccchhhccCcccee
Confidence            47888876  88888888888888762221111 0000          00001111233467777777777777777776


Q ss_pred             ccc
Q psy2838         255 HIK  257 (458)
Q Consensus       255 H~~  257 (458)
                      |+.
T Consensus       417 Hr~  419 (423)
T COG5189         417 HRK  419 (423)
T ss_pred             ccc
Confidence            654


No 46 
>KOG2482|consensus
Probab=96.50  E-value=0.0036  Score=58.37  Aligned_cols=51  Identities=24%  Similarity=0.337  Sum_probs=41.0

Q ss_pred             ccCccccccCCCHHHHHHHHhhccCC---------------------------CccccCcCcccCCCHHHHHHHHHHh
Q psy2838         321 YKCADCGRLFAQEHTLRTHTLSVHSG---------------------------VRLTCRTCGLFCDSKRALTAHVREA  371 (458)
Q Consensus       321 ~~C~~C~~~f~~~~~L~~H~~~~h~~---------------------------~~~~C~~C~~~f~~~~~L~~H~~~~  371 (458)
                      ..|-.|....-+...|..||...|.-                           ..-.|-.|.-.|.....|..||..+
T Consensus       280 v~CLfC~~~~en~~~l~eHmk~vHe~Dl~Ki~sd~~Ln~YqrvrviNyiRkq~~~~~c~~cd~~F~~e~~l~~hm~e~  357 (423)
T KOG2482|consen  280 VVCLFCTNFYENPVFLFEHMKIVHEFDLLKIQSDYSLNFYQRVRVINYIRKQKKKSRCAECDLSFWKEPGLLIHMVED  357 (423)
T ss_pred             eEEEeeccchhhHHHHHHHHHHHHHhhHHhhccccccchhhhhhHHHHHHHHhhccccccccccccCcchhhhhcccc
Confidence            58999998888899999999887730                           0123899999999999999998754


No 47 
>KOG2785|consensus
Probab=96.24  E-value=0.0073  Score=57.58  Aligned_cols=52  Identities=23%  Similarity=0.397  Sum_probs=41.2

Q ss_pred             cccCccccccCCCHHHHHHHHhhccC-----------------------CCccccCcCc---ccCCCHHHHHHHHHHh
Q psy2838         320 KYKCADCGRLFAQEHTLRTHTLSVHS-----------------------GVRLTCRTCG---LFCDSKRALTAHVREA  371 (458)
Q Consensus       320 ~~~C~~C~~~f~~~~~L~~H~~~~h~-----------------------~~~~~C~~C~---~~f~~~~~L~~H~~~~  371 (458)
                      |-.|-.|++.+.+...-..||..+|+                       +.-+.|-.|+   +.|.+....+.||+..
T Consensus       166 Pt~CLfC~~~~k~~e~~~~HM~~~HgffIPdreYL~D~~GLl~YLgeKV~~~~~CL~CN~~~~~f~sleavr~HM~~K  243 (390)
T KOG2785|consen  166 PTDCLFCDKKSKSLEENLKHMFKEHGFFIPDREYLTDEKGLLKYLGEKVGIGFICLFCNELGRPFSSLEAVRAHMRDK  243 (390)
T ss_pred             CcceeecCCCcccHHHHHHHHhhccCCcCCchHhhhchhHHHHHHHHHhccCceEEEeccccCcccccHHHHHHHhhc
Confidence            45678888888888888888877775                       1235699998   9999999999999864


No 48 
>PF09237 GAGA:  GAGA factor;  InterPro: IPR015318 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target.  Members of this entry bind to a 5'-GAGAG-3' DNA consensus binding site, and contain a Cys2-His2 zinc finger core as well as an N-terminal extension containing two highly basic regions. The zinc finger core binds in the DNA major groove and recognises the first three GAG bases of the consensus in a manner similar to that seen in other classical zinc finger-DNA complexes. The second basic region forms a helix that interacts in the major groove recognising the last G of the consensus, while the first basic region wraps around the DNA in the minor groove and recognises the A in the fourth position of the consensus sequence [].  More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; PDB: 1YUI_A 1YUJ_A.
Probab=96.10  E-value=0.0042  Score=41.01  Aligned_cols=28  Identities=29%  Similarity=0.443  Sum_probs=15.4

Q ss_pred             ccccCcCcccCCCHHHHHHHHHHhCCCC
Q psy2838         348 RLTCRTCGLFCDSKRALTAHVREAHPGC  375 (458)
Q Consensus       348 ~~~C~~C~~~f~~~~~L~~H~~~~h~~~  375 (458)
                      +..|++|+..+....+|++|+...|..+
T Consensus        24 PatCP~C~a~~~~srnLrRHle~~H~~k   51 (54)
T PF09237_consen   24 PATCPICGAVIRQSRNLRRHLEIRHFKK   51 (54)
T ss_dssp             -EE-TTT--EESSHHHHHHHHHHHTTTS
T ss_pred             CCCCCcchhhccchhhHHHHHHHHhccc
Confidence            4456666666666666666666666543


No 49 
>PF09237 GAGA:  GAGA factor;  InterPro: IPR015318 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target.  Members of this entry bind to a 5'-GAGAG-3' DNA consensus binding site, and contain a Cys2-His2 zinc finger core as well as an N-terminal extension containing two highly basic regions. The zinc finger core binds in the DNA major groove and recognises the first three GAG bases of the consensus in a manner similar to that seen in other classical zinc finger-DNA complexes. The second basic region forms a helix that interacts in the major groove recognising the last G of the consensus, while the first basic region wraps around the DNA in the minor groove and recognises the A in the fourth position of the consensus sequence [].  More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; PDB: 1YUI_A 1YUJ_A.
Probab=96.10  E-value=0.0029  Score=41.77  Aligned_cols=31  Identities=29%  Similarity=0.419  Sum_probs=16.8

Q ss_pred             cCccccCccccccCCCHHHHHHHHhhccCCC
Q psy2838         317 QAQKYKCADCGRLFAQEHTLRTHTLSVHSGV  347 (458)
Q Consensus       317 ~~~~~~C~~C~~~f~~~~~L~~H~~~~h~~~  347 (458)
                      .+.|-.|++|+..+.+..+|++|+...|..+
T Consensus        21 S~~PatCP~C~a~~~~srnLrRHle~~H~~k   51 (54)
T PF09237_consen   21 SEQPATCPICGAVIRQSRNLRRHLEIRHFKK   51 (54)
T ss_dssp             TS--EE-TTT--EESSHHHHHHHHHHHTTTS
T ss_pred             cCCCCCCCcchhhccchhhHHHHHHHHhccc
Confidence            3456667777777777777777766666543


No 50 
>smart00355 ZnF_C2H2 zinc finger.
Probab=96.08  E-value=0.0057  Score=34.60  Aligned_cols=23  Identities=39%  Similarity=0.713  Sum_probs=17.0

Q ss_pred             cccCcCcccCCCHHHHHHHHHHh
Q psy2838         349 LTCRTCGLFCDSKRALTAHVREA  371 (458)
Q Consensus       349 ~~C~~C~~~f~~~~~L~~H~~~~  371 (458)
                      |.|..|++.|.....|..|++.|
T Consensus         1 ~~C~~C~~~f~~~~~l~~H~~~H   23 (26)
T smart00355        1 YRCPECGKVFKSKSALKEHMRTH   23 (26)
T ss_pred             CCCCCCcchhCCHHHHHHHHHHh
Confidence            45777788887777788777754


No 51 
>COG5048 FOG: Zn-finger [General function prediction only]
Probab=95.98  E-value=0.0019  Score=65.27  Aligned_cols=144  Identities=22%  Similarity=0.317  Sum_probs=85.5

Q ss_pred             ccccccCccccCChHHHHHHHH--HhcCC--ceeecC--ccccccCCchhhhhccchhccCcceecCc--ccccccCHHH
Q psy2838         208 RIKCDFCDKTFAHKPNLIAHMN--SAHNG--VRFKCD--HCVKSFNQKPNLISHIKSVHEGIKYKCEE--CTKTFAQKQN  279 (458)
Q Consensus       208 ~~~C~~C~~~f~~~~~L~~H~~--~h~~~--~~~~C~--~C~~~f~~~~~L~~H~~~~~~~~~~~C~~--C~~~f~~~~~  279 (458)
                      ++.|..|...|.....|..|.+  .|.++  +++.|+  .|++.|.....+..|...+....++.|..  |...+.....
T Consensus       289 ~~~~~~~~~~~s~~~~l~~~~~~~~h~~~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~  368 (467)
T COG5048         289 PIKSKQCNISFSRSSPLTRHLRSVNHSGESLKPFSCPYSLCGKLFSRNDALKRHILLHTSISPAKEKLLNSSSKFSPLLN  368 (467)
T ss_pred             CCCCccccCCccccccccccccccccccccCCceeeeccCCCccccccccccCCcccccCCCccccccccCccccccccC
Confidence            4677777777777777777777  67777  778887  68888888888888877776666555544  3333332222


Q ss_pred             H-----HHHHHHhhcCceeecCc--cCcccCChhHHHHHHhhhccCc--cccCccccccCCCHHHHHHHHhhccCCCccc
Q psy2838         280 L-----LSHVQAIHMGIKYTCEV--CSKSYNQVTNLLSHMTTVHQAQ--KYKCADCGRLFAQEHTLRTHTLSVHSGVRLT  350 (458)
Q Consensus       280 L-----~~H~~~~h~~~~~~C~~--C~~~f~~~~~L~~H~~~h~~~~--~~~C~~C~~~f~~~~~L~~H~~~~h~~~~~~  350 (458)
                      -     ............+.|..  |...+.....+..|...|....  .+.+..|.+.|.....+..|++.+.....+.
T Consensus       369 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~  448 (467)
T COG5048         369 NEPPQSLQQYKDLKNDKKSETLSNSCIRNFKRDSNLSLHIITHLSFRPYNCKNPPCSKSFNRHYNLIPHKKIHTNHAPLL  448 (467)
T ss_pred             CCCccchhhccCccCCccccccccchhhhhccccccccccccccccCCcCCCCCcchhhccCcccccccccccccCCcee
Confidence            1     01101112222233322  5556666666666666665544  3555677777777777777755544433333


Q ss_pred             c
Q psy2838         351 C  351 (458)
Q Consensus       351 C  351 (458)
                      |
T Consensus       449 ~  449 (467)
T COG5048         449 C  449 (467)
T ss_pred             e
Confidence            3


No 52 
>PF13909 zf-H2C2_5:  C2H2-type zinc-finger domain; PDB: 1X5W_A.
Probab=95.95  E-value=0.0042  Score=34.84  Aligned_cols=23  Identities=30%  Similarity=0.645  Sum_probs=10.6

Q ss_pred             ccCccccccCCCHHHHHHHHhhcc
Q psy2838         321 YKCADCGRLFAQEHTLRTHTLSVH  344 (458)
Q Consensus       321 ~~C~~C~~~f~~~~~L~~H~~~~h  344 (458)
                      |+|+.|++... ...|.+|++.+|
T Consensus         1 y~C~~C~y~t~-~~~l~~H~~~~H   23 (24)
T PF13909_consen    1 YKCPHCSYSTS-KSNLKRHLKRHH   23 (24)
T ss_dssp             EE-SSSS-EES-HHHHHHHHHHHH
T ss_pred             CCCCCCCCcCC-HHHHHHHHHhhC
Confidence            34555555554 555555554444


No 53 
>KOG2482|consensus
Probab=95.92  E-value=0.017  Score=54.12  Aligned_cols=22  Identities=32%  Similarity=0.600  Sum_probs=19.2

Q ss_pred             ccCccccccCCCHHHHHHHHhh
Q psy2838         321 YKCADCGRLFAQEHTLRTHTLS  342 (458)
Q Consensus       321 ~~C~~C~~~f~~~~~L~~H~~~  342 (458)
                      -.|-.|...|.....|..||..
T Consensus       335 ~~c~~cd~~F~~e~~l~~hm~e  356 (423)
T KOG2482|consen  335 SRCAECDLSFWKEPGLLIHMVE  356 (423)
T ss_pred             cccccccccccCcchhhhhccc
Confidence            4688899999999999999854


No 54 
>smart00355 ZnF_C2H2 zinc finger.
Probab=95.88  E-value=0.0063  Score=34.39  Aligned_cols=20  Identities=30%  Similarity=0.966  Sum_probs=9.3

Q ss_pred             cCccccccCCCHHHHHHHHh
Q psy2838         322 KCADCGRLFAQEHTLRTHTL  341 (458)
Q Consensus       322 ~C~~C~~~f~~~~~L~~H~~  341 (458)
                      .|..|++.|.....|..|++
T Consensus         2 ~C~~C~~~f~~~~~l~~H~~   21 (26)
T smart00355        2 RCPECGKVFKSKSALKEHMR   21 (26)
T ss_pred             CCCCCcchhCCHHHHHHHHH
Confidence            34444444444444444443


No 55 
>KOG2785|consensus
Probab=95.64  E-value=0.028  Score=53.71  Aligned_cols=50  Identities=26%  Similarity=0.472  Sum_probs=38.3

Q ss_pred             eecCccCcccCChhHHHHHHhhhccCc-----------------------cccCcccc---ccCCCHHHHHHHHhh
Q psy2838         293 YTCEVCSKSYNQVTNLLSHMTTVHQAQ-----------------------KYKCADCG---RLFAQEHTLRTHTLS  342 (458)
Q Consensus       293 ~~C~~C~~~f~~~~~L~~H~~~h~~~~-----------------------~~~C~~C~---~~f~~~~~L~~H~~~  342 (458)
                      -.|-.|++.+.+...-..||..+|+-.                       -|.|-.|+   +.|.+....+.||..
T Consensus       167 t~CLfC~~~~k~~e~~~~HM~~~HgffIPdreYL~D~~GLl~YLgeKV~~~~~CL~CN~~~~~f~sleavr~HM~~  242 (390)
T KOG2785|consen  167 TDCLFCDKKSKSLEENLKHMFKEHGFFIPDREYLTDEKGLLKYLGEKVGIGFICLFCNELGRPFSSLEAVRAHMRD  242 (390)
T ss_pred             cceeecCCCcccHHHHHHHHhhccCCcCCchHhhhchhHHHHHHHHHhccCceEEEeccccCcccccHHHHHHHhh
Confidence            567777777777777777777776531                       27899998   999999999999865


No 56 
>PF12874 zf-met:  Zinc-finger of C2H2 type; PDB: 1ZU1_A 2KVG_A.
Probab=95.64  E-value=0.008  Score=33.98  Aligned_cols=20  Identities=30%  Similarity=0.825  Sum_probs=9.2

Q ss_pred             ecCccCcccCChhHHHHHHh
Q psy2838         294 TCEVCSKSYNQVTNLLSHMT  313 (458)
Q Consensus       294 ~C~~C~~~f~~~~~L~~H~~  313 (458)
                      .|.+|++.|.+...|..|++
T Consensus         2 ~C~~C~~~f~s~~~~~~H~~   21 (25)
T PF12874_consen    2 YCDICNKSFSSENSLRQHLR   21 (25)
T ss_dssp             EETTTTEEESSHHHHHHHHT
T ss_pred             CCCCCCCCcCCHHHHHHHHC
Confidence            34444444444444444443


No 57 
>COG5048 FOG: Zn-finger [General function prediction only]
Probab=95.63  E-value=0.002  Score=65.13  Aligned_cols=159  Identities=21%  Similarity=0.258  Sum_probs=114.4

Q ss_pred             CcccccccccccCCHHHHHHHHH--hhcCC--Cccccc--cCccccCChHHHHHHHHHhcCCceeecCc--cccccCCch
Q psy2838         179 KKFPCDQCGKSFTLKQNLTYHVN--SIHLG--VRIKCD--FCDKTFAHKPNLIAHMNSAHNGVRFKCDH--CVKSFNQKP  250 (458)
Q Consensus       179 k~~~C~~C~~~f~~~~~L~~H~~--~~h~~--~~~~C~--~C~~~f~~~~~L~~H~~~h~~~~~~~C~~--C~~~f~~~~  250 (458)
                      .++.|..|...|.....|..|.+  .|..+  .++.|+  .|++.|.....+..|...|.+..++.|..  |...+....
T Consensus       288 ~~~~~~~~~~~~s~~~~l~~~~~~~~h~~~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~  367 (467)
T COG5048         288 LPIKSKQCNISFSRSSPLTRHLRSVNHSGESLKPFSCPYSLCGKLFSRNDALKRHILLHTSISPAKEKLLNSSSKFSPLL  367 (467)
T ss_pred             cCCCCccccCCccccccccccccccccccccCCceeeeccCCCccccccccccCCcccccCCCccccccccCcccccccc
Confidence            57999999999999999999999  67777  789999  79999999999999999999988887765  444444433


Q ss_pred             hhhhccc-----hhccCcceecCc--ccccccCHHHHHHHHHHhhcCc--eeecCccCcccCChhHHHHHHhhhccCccc
Q psy2838         251 NLISHIK-----SVHEGIKYKCEE--CTKTFAQKQNLLSHVQAIHMGI--KYTCEVCSKSYNQVTNLLSHMTTVHQAQKY  321 (458)
Q Consensus       251 ~L~~H~~-----~~~~~~~~~C~~--C~~~f~~~~~L~~H~~~~h~~~--~~~C~~C~~~f~~~~~L~~H~~~h~~~~~~  321 (458)
                      .-..+..     .......+.|..  |...+.....+..|...+-...  .+.+..|.+.|.....+..|++.|....++
T Consensus       368 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~  447 (467)
T COG5048         368 NNEPPQSLQQYKDLKNDKKSETLSNSCIRNFKRDSNLSLHIITHLSFRPYNCKNPPCSKSFNRHYNLIPHKKIHTNHAPL  447 (467)
T ss_pred             CCCCccchhhccCccCCccccccccchhhhhccccccccccccccccCCcCCCCCcchhhccCcccccccccccccCCce
Confidence            3211111     112223444433  6677777777777764444433  378888999999999999999999887766


Q ss_pred             cCccccccCCCHHHHHH
Q psy2838         322 KCADCGRLFAQEHTLRT  338 (458)
Q Consensus       322 ~C~~C~~~f~~~~~L~~  338 (458)
                      .|..+. .|.....+..
T Consensus       448 ~~~~~~-~~~~~~~~~~  463 (467)
T COG5048         448 LCSILK-SFRRDLDLSN  463 (467)
T ss_pred             eecccc-ccchhhhhhc
Confidence            665544 3444444433


No 58 
>PF12874 zf-met:  Zinc-finger of C2H2 type; PDB: 1ZU1_A 2KVG_A.
Probab=95.61  E-value=0.0058  Score=34.59  Aligned_cols=22  Identities=27%  Similarity=0.584  Sum_probs=13.0

Q ss_pred             cccCcCcccCCCHHHHHHHHHH
Q psy2838         349 LTCRTCGLFCDSKRALTAHVRE  370 (458)
Q Consensus       349 ~~C~~C~~~f~~~~~L~~H~~~  370 (458)
                      |.|.+|++.|.+...|..|++.
T Consensus         1 ~~C~~C~~~f~s~~~~~~H~~s   22 (25)
T PF12874_consen    1 FYCDICNKSFSSENSLRQHLRS   22 (25)
T ss_dssp             EEETTTTEEESSHHHHHHHHTT
T ss_pred             CCCCCCCCCcCCHHHHHHHHCc
Confidence            3466666666666666666543


No 59 
>PRK04860 hypothetical protein; Provisional
Probab=95.19  E-value=0.012  Score=50.38  Aligned_cols=37  Identities=24%  Similarity=0.662  Sum_probs=24.9

Q ss_pred             eeecCccCcccCChhHHHHHHhhhccCccccCccccccCCC
Q psy2838         292 KYTCEVCSKSYNQVTNLLSHMTTVHQAQKYKCADCGRLFAQ  332 (458)
Q Consensus       292 ~~~C~~C~~~f~~~~~L~~H~~~h~~~~~~~C~~C~~~f~~  332 (458)
                      +|.|. |+.   ....+.+|.++|.++++|.|..|+..|..
T Consensus       119 ~Y~C~-C~~---~~~~~rrH~ri~~g~~~YrC~~C~~~l~~  155 (160)
T PRK04860        119 PYRCK-CQE---HQLTVRRHNRVVRGEAVYRCRRCGETLVF  155 (160)
T ss_pred             EEEcC-CCC---eeCHHHHHHHHhcCCccEECCCCCceeEE
Confidence            36776 765   45566777777777777777777766654


No 60 
>PF12171 zf-C2H2_jaz:  Zinc-finger double-stranded RNA-binding;  InterPro: IPR022755  This zinc finger is found in archaea and eukaryotes, and is approximately 30 amino acids in length. The mammalian members of this group occur multiple times along the protein, joined by flexible linkers, and are referred to as JAZ - dsRNA-binding ZF protein - zinc-fingers. The JAZ proteins are expressed in all tissues tested and localise in the nucleus, particularly the nucleolus []. JAZ preferentially binds to double-stranded (ds) RNA or RNA/DNA hybrids rather than DNA. In addition to binding double-stranded RNA, these zinc-fingers are required for nucleolar localisation.   This entry represents the multiple-adjacent-C2H2 zinc finger, JAZ. ; PDB: 4DGW_A 1ZR9_A.
Probab=94.97  E-value=0.0062  Score=35.27  Aligned_cols=21  Identities=24%  Similarity=0.697  Sum_probs=11.3

Q ss_pred             cccCcCcccCCCHHHHHHHHH
Q psy2838         349 LTCRTCGLFCDSKRALTAHVR  369 (458)
Q Consensus       349 ~~C~~C~~~f~~~~~L~~H~~  369 (458)
                      |+|..|++.|.+...|..|++
T Consensus         2 ~~C~~C~k~f~~~~~~~~H~~   22 (27)
T PF12171_consen    2 FYCDACDKYFSSENQLKQHMK   22 (27)
T ss_dssp             CBBTTTTBBBSSHHHHHCCTT
T ss_pred             CCcccCCCCcCCHHHHHHHHc
Confidence            445555555555555555543


No 61 
>PRK04860 hypothetical protein; Provisional
Probab=94.86  E-value=0.017  Score=49.48  Aligned_cols=26  Identities=12%  Similarity=0.371  Sum_probs=12.3

Q ss_pred             HHHHHHHHHhcCCceeecCccccccC
Q psy2838         222 PNLIAHMNSAHNGVRFKCDHCVKSFN  247 (458)
Q Consensus       222 ~~L~~H~~~h~~~~~~~C~~C~~~f~  247 (458)
                      ..+.+|.++|.++++|.|..|+..|.
T Consensus       129 ~~~rrH~ri~~g~~~YrC~~C~~~l~  154 (160)
T PRK04860        129 LTVRRHNRVVRGEAVYRCRRCGETLV  154 (160)
T ss_pred             CHHHHHHHHhcCCccEECCCCCceeE
Confidence            34444444444444444444444443


No 62 
>PF12171 zf-C2H2_jaz:  Zinc-finger double-stranded RNA-binding;  InterPro: IPR022755  This zinc finger is found in archaea and eukaryotes, and is approximately 30 amino acids in length. The mammalian members of this group occur multiple times along the protein, joined by flexible linkers, and are referred to as JAZ - dsRNA-binding ZF protein - zinc-fingers. The JAZ proteins are expressed in all tissues tested and localise in the nucleus, particularly the nucleolus []. JAZ preferentially binds to double-stranded (ds) RNA or RNA/DNA hybrids rather than DNA. In addition to binding double-stranded RNA, these zinc-fingers are required for nucleolar localisation.   This entry represents the multiple-adjacent-C2H2 zinc finger, JAZ. ; PDB: 4DGW_A 1ZR9_A.
Probab=94.38  E-value=0.017  Score=33.39  Aligned_cols=21  Identities=29%  Similarity=0.774  Sum_probs=11.2

Q ss_pred             eecCccCcccCChhHHHHHHh
Q psy2838         293 YTCEVCSKSYNQVTNLLSHMT  313 (458)
Q Consensus       293 ~~C~~C~~~f~~~~~L~~H~~  313 (458)
                      |.|..|++.|.+...|..|++
T Consensus         2 ~~C~~C~k~f~~~~~~~~H~~   22 (27)
T PF12171_consen    2 FYCDACDKYFSSENQLKQHMK   22 (27)
T ss_dssp             CBBTTTTBBBSSHHHHHCCTT
T ss_pred             CCcccCCCCcCCHHHHHHHHc
Confidence            445555555555555555554


No 63 
>KOG4173|consensus
Probab=94.16  E-value=0.021  Score=49.50  Aligned_cols=29  Identities=21%  Similarity=0.592  Sum_probs=20.1

Q ss_pred             cCccccC--ccccccCCCHHHHHHHHhhccC
Q psy2838         317 QAQKYKC--ADCGRLFAQEHTLRTHTLSVHS  345 (458)
Q Consensus       317 ~~~~~~C--~~C~~~f~~~~~L~~H~~~~h~  345 (458)
                      |..-|+|  ..|+..|.+...-+.|+...|.
T Consensus       141 G~dMy~ClvEgCt~KFkT~r~RkdH~I~~Hk  171 (253)
T KOG4173|consen  141 GQDMYQCLVEGCTEKFKTSRDRKDHMIRMHK  171 (253)
T ss_pred             CccHHHHHHHhhhhhhhhhhhhhhHHHHhcc
Confidence            3455777  4588888887777777766664


No 64 
>KOG2893|consensus
Probab=94.14  E-value=0.02  Score=50.75  Aligned_cols=49  Identities=35%  Similarity=0.682  Sum_probs=32.9

Q ss_pred             cccccCccccCChHHHHHHHHHhcCCceeecCccccccCCchhhhhccchhcc
Q psy2838         209 IKCDFCDKTFAHKPNLIAHMNSAHNGVRFKCDHCVKSFNQKPNLISHIKSVHE  261 (458)
Q Consensus       209 ~~C~~C~~~f~~~~~L~~H~~~h~~~~~~~C~~C~~~f~~~~~L~~H~~~~~~  261 (458)
                      -.|.+|++.|....-|.+|++..|    |+|.+|.+...+--.|..|....|.
T Consensus        11 pwcwycnrefddekiliqhqkakh----fkchichkkl~sgpglsihcmqvhk   59 (341)
T KOG2893|consen   11 PWCWYCNREFDDEKILIQHQKAKH----FKCHICHKKLFSGPGLSIHCMQVHK   59 (341)
T ss_pred             ceeeecccccchhhhhhhhhhhcc----ceeeeehhhhccCCCceeehhhhhh
Confidence            357777777777777777776543    6777777776666667776654443


No 65 
>KOG4173|consensus
Probab=93.06  E-value=0.031  Score=48.49  Aligned_cols=52  Identities=29%  Similarity=0.606  Sum_probs=40.9

Q ss_pred             CCcccccc--cccccCCHHHHHHHHHhhcCCCccccccCccccCChHHHHHHHHHhc
Q psy2838         178 VKKFPCDQ--CGKSFTLKQNLTYHVNSIHLGVRIKCDFCDKTFAHKPNLIAHMNSAH  232 (458)
Q Consensus       178 ~k~~~C~~--C~~~f~~~~~L~~H~~~~h~~~~~~C~~C~~~f~~~~~L~~H~~~h~  232 (458)
                      ...|.|++  |...|.....+..|..+.|+.   .|.+|.+.|.+...|..|+...|
T Consensus        77 ~~~~~cqvagc~~~~d~lD~~E~hY~~~h~~---sCs~C~r~~Pt~hLLd~HI~E~H  130 (253)
T KOG4173|consen   77 VPAFACQVAGCCQVFDALDDYEHHYHTLHGN---SCSFCKRAFPTGHLLDAHILEWH  130 (253)
T ss_pred             cccccccccchHHHHhhhhhHHHhhhhcccc---hhHHHHHhCCchhhhhHHHHHHH
Confidence            45688886  888888888888887655553   59999999999999999987655


No 66 
>cd07765 KRAB_A-box KRAB (Kruppel-associated box) domain -A box. The KRAB domain is a transcription repression module, found in a subgroup of the zinc finger proteins (ZFPs) of the C2H2 family, KRAB-ZFPs. KRAB-ZFPs comprise the largest group of transcriptional regulators in mammals, and are only found in tetrapods. These proteins have been shown to play important roles in cell differentiation and organ development, and in regulating viral replication and transcription. A KRAB domain may consist of an A-box, or of an A-box plus either a B-box, a divergent B-box (b), or a C-box. Only the A-box is included in this model. The A-box is needed for repression, the B- and C- boxes are not. KRAB-ZFPs have one or two KRAB domains at their amino-terminal end, and multiple C2H2 zinc finger motifs at their C-termini. Some KRAB-ZFPs also contain a SCAN domain which mediates homo- and hetero-oligomerization. The KRAB domain is a protein-protein interaction module which represses transcription through 
Probab=93.06  E-value=0.0079  Score=36.21  Aligned_cols=29  Identities=7%  Similarity=-0.015  Sum_probs=26.4

Q ss_pred             ceeeecchhHHHHHHHhcCCCcCCCCCCC
Q psy2838          43 YNYYQLSSDEKFEILKSISASFTPGYGGF   71 (458)
Q Consensus        43 ~~~~~~~~~~~~~~~~~~~~~~~~~y~~~   71 (458)
                      ++|.||++.|+.++|.++.++++.+|+++
T Consensus         1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~   29 (40)
T cd07765           1 VTFEDVAVYFSQEEWELLDPAQRDLYRDV   29 (40)
T ss_pred             CcceeeeeecCHHHHhcCCHHHHHHHHHH
Confidence            46899999999999999999999998876


No 67 
>TIGR00622 ssl1 transcription factor ssl1. This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University).
Probab=92.47  E-value=0.14  Score=40.57  Aligned_cols=23  Identities=22%  Similarity=0.576  Sum_probs=11.7

Q ss_pred             eecCccCcccCChhHHHHHHhhh
Q psy2838         293 YTCEVCSKSYNQVTNLLSHMTTV  315 (458)
Q Consensus       293 ~~C~~C~~~f~~~~~L~~H~~~h  315 (458)
                      |.|+.|...|-..-.+..|...|
T Consensus        82 y~C~~C~~~FC~dCD~fiHe~Lh  104 (112)
T TIGR00622        82 YVCAVCKNVFCVDCDVFVHESLH  104 (112)
T ss_pred             eeCCCCCCccccccchhhhhhcc
Confidence            55555555555544444554443


No 68 
>KOG2893|consensus
Probab=92.47  E-value=0.028  Score=49.85  Aligned_cols=46  Identities=24%  Similarity=0.584  Sum_probs=25.6

Q ss_pred             cCccCcccCChhHHHHHHhhhccCccccCccccccCCCHHHHHHHHhhcc
Q psy2838         295 CEVCSKSYNQVTNLLSHMTTVHQAQKYKCADCGRLFAQEHTLRTHTLSVH  344 (458)
Q Consensus       295 C~~C~~~f~~~~~L~~H~~~h~~~~~~~C~~C~~~f~~~~~L~~H~~~~h  344 (458)
                      |-+|++.|.....|.+|++..    -|+|.+|.+...+--.|..|-..+|
T Consensus        13 cwycnrefddekiliqhqkak----hfkchichkkl~sgpglsihcmqvh   58 (341)
T KOG2893|consen   13 CWYCNREFDDEKILIQHQKAK----HFKCHICHKKLFSGPGLSIHCMQVH   58 (341)
T ss_pred             eeecccccchhhhhhhhhhhc----cceeeeehhhhccCCCceeehhhhh
Confidence            556666666666666665543    2666666665555545555543333


No 69 
>PF12013 DUF3505:  Protein of unknown function (DUF3505);  InterPro: IPR022698  This family of proteins is functionally uncharacterised. This protein is found in eukaryotes. Proteins in this family are typically between 247 to 1018 amino acids in length. This region contains two segments that are likely to be C2H2 zinc binding domains. 
Probab=91.78  E-value=0.26  Score=39.48  Aligned_cols=25  Identities=32%  Similarity=0.707  Sum_probs=19.9

Q ss_pred             ccc----CcCcccCCCHHHHHHHHHHhCC
Q psy2838         349 LTC----RTCGLFCDSKRALTAHVREAHP  373 (458)
Q Consensus       349 ~~C----~~C~~~f~~~~~L~~H~~~~h~  373 (458)
                      |.|    ..|++.+.+...|.+|++.+|+
T Consensus        81 ~~C~~~~~~C~y~~~~~~~m~~H~~~~Hg  109 (109)
T PF12013_consen   81 YRCQCDPPHCGYITRSKKTMRKHWRKEHG  109 (109)
T ss_pred             eeeecCCCCCCcEeccHHHHHHHHHHhcC
Confidence            668    8888888888888888888774


No 70 
>smart00451 ZnF_U1 U1-like zinc finger. Family of C2H2-type zinc fingers, present in matrin, U1 small nuclear ribonucleoprotein C and other RNA-binding proteins.
Probab=91.15  E-value=0.17  Score=31.09  Aligned_cols=23  Identities=17%  Similarity=0.535  Sum_probs=15.5

Q ss_pred             ccccCcCcccCCCHHHHHHHHHH
Q psy2838         348 RLTCRTCGLFCDSKRALTAHVRE  370 (458)
Q Consensus       348 ~~~C~~C~~~f~~~~~L~~H~~~  370 (458)
                      +|.|.+|++.|.+...+..|++.
T Consensus         3 ~~~C~~C~~~~~~~~~~~~H~~g   25 (35)
T smart00451        3 GFYCKLCNVTFTDEISVEAHLKG   25 (35)
T ss_pred             CeEccccCCccCCHHHHHHHHCh
Confidence            45677777777777777777654


No 71 
>PF13913 zf-C2HC_2:  zinc-finger of a C2HC-type
Probab=90.90  E-value=0.21  Score=28.20  Aligned_cols=19  Identities=26%  Similarity=0.555  Sum_probs=10.5

Q ss_pred             ccCcCcccCCCHHHHHHHHH
Q psy2838         350 TCRTCGLFCDSKRALTAHVR  369 (458)
Q Consensus       350 ~C~~C~~~f~~~~~L~~H~~  369 (458)
                      .|+.||+.| ....|.+|++
T Consensus         4 ~C~~CgR~F-~~~~l~~H~~   22 (25)
T PF13913_consen    4 PCPICGRKF-NPDRLEKHEK   22 (25)
T ss_pred             cCCCCCCEE-CHHHHHHHHH
Confidence            356666666 4455555554


No 72 
>PF13913 zf-C2HC_2:  zinc-finger of a C2HC-type
Probab=90.37  E-value=0.25  Score=27.88  Aligned_cols=18  Identities=28%  Similarity=0.698  Sum_probs=8.2

Q ss_pred             ccccCccccCChHHHHHHH
Q psy2838         210 KCDFCDKTFAHKPNLIAHM  228 (458)
Q Consensus       210 ~C~~C~~~f~~~~~L~~H~  228 (458)
                      .|+.|++.| ....|.+|+
T Consensus         4 ~C~~CgR~F-~~~~l~~H~   21 (25)
T PF13913_consen    4 PCPICGRKF-NPDRLEKHE   21 (25)
T ss_pred             cCCCCCCEE-CHHHHHHHH
Confidence            344444444 334444444


No 73 
>PF12013 DUF3505:  Protein of unknown function (DUF3505);  InterPro: IPR022698  This family of proteins is functionally uncharacterised. This protein is found in eukaryotes. Proteins in this family are typically between 247 to 1018 amino acids in length. This region contains two segments that are likely to be C2H2 zinc binding domains. 
Probab=89.49  E-value=0.45  Score=38.09  Aligned_cols=24  Identities=17%  Similarity=0.505  Sum_probs=17.0

Q ss_pred             cccccCccccCChHHHHHHHHHhcC
Q psy2838         209 IKCDFCDKTFAHKPNLIAHMNSAHN  233 (458)
Q Consensus       209 ~~C~~C~~~f~~~~~L~~H~~~h~~  233 (458)
                      ..|..|+....- +++..|++..|.
T Consensus        12 lIC~~C~~av~~-~~v~~HL~~~H~   35 (109)
T PF12013_consen   12 LICRQCQYAVQP-SEVESHLRKRHH   35 (109)
T ss_pred             EEeCCCCcccCc-hHHHHHHHHhcc
Confidence            578888877654 788888885443


No 74 
>PF09538 FYDLN_acid:  Protein of unknown function (FYDLN_acid);  InterPro: IPR012644 Members of this family are bacterial proteins with a conserved motif [KR]FYDLN, sometimes flanked by a pair of CXXC motifs, followed by a long region of low complexity sequence in which roughly half the residues are Asp and Glu, including multiple runs of five or more acidic residues. The function of members of this family is unknown.
Probab=89.29  E-value=0.74  Score=36.60  Aligned_cols=29  Identities=24%  Similarity=0.581  Sum_probs=15.2

Q ss_pred             ccCccccccCCCHHHHHHHHhhccCCCccccCcCcccCCC
Q psy2838         321 YKCADCGRLFAQEHTLRTHTLSVHSGVRLTCRTCGLFCDS  360 (458)
Q Consensus       321 ~~C~~C~~~f~~~~~L~~H~~~~h~~~~~~C~~C~~~f~~  360 (458)
                      ..|+.||++|-...           ..+..|+.||..|.-
T Consensus        10 R~Cp~CG~kFYDLn-----------k~PivCP~CG~~~~~   38 (108)
T PF09538_consen   10 RTCPSCGAKFYDLN-----------KDPIVCPKCGTEFPP   38 (108)
T ss_pred             ccCCCCcchhccCC-----------CCCccCCCCCCccCc
Confidence            45666666554321           134556666666543


No 75 
>smart00451 ZnF_U1 U1-like zinc finger. Family of C2H2-type zinc fingers, present in matrin, U1 small nuclear ribonucleoprotein C and other RNA-binding proteins.
Probab=88.01  E-value=0.43  Score=29.16  Aligned_cols=22  Identities=14%  Similarity=0.647  Sum_probs=13.3

Q ss_pred             eecCccCcccCChhHHHHHHhh
Q psy2838         293 YTCEVCSKSYNQVTNLLSHMTT  314 (458)
Q Consensus       293 ~~C~~C~~~f~~~~~L~~H~~~  314 (458)
                      |.|.+|++.|.+...+..|++.
T Consensus         4 ~~C~~C~~~~~~~~~~~~H~~g   25 (35)
T smart00451        4 FYCKLCNVTFTDEISVEAHLKG   25 (35)
T ss_pred             eEccccCCccCCHHHHHHHHCh
Confidence            5566666666666666666543


No 76 
>COG4049 Uncharacterized protein containing archaeal-type C2H2 Zn-finger [General function prediction only]
Probab=87.64  E-value=0.25  Score=33.31  Aligned_cols=27  Identities=30%  Similarity=0.623  Sum_probs=19.7

Q ss_pred             CccccCcCcccCCCHHHHHHHHHHhCC
Q psy2838         347 VRLTCRTCGLFCDSKRALTAHVREAHP  373 (458)
Q Consensus       347 ~~~~C~~C~~~f~~~~~L~~H~~~~h~  373 (458)
                      .-+.|+-|+..|.......+|+.+.|+
T Consensus        16 ~~lrCPRC~~~FR~~K~Y~RHVNKaH~   42 (65)
T COG4049          16 EFLRCPRCGMVFRRRKDYIRHVNKAHG   42 (65)
T ss_pred             eeeeCCchhHHHHHhHHHHHHhhHHhh
Confidence            345577777777777777777777775


No 77 
>cd00350 rubredoxin_like Rubredoxin_like; nonheme iron binding domain containing a [Fe(SCys)4] center. The family includes rubredoxins, a small electron transfer protein, and a slightly smaller modular rubredoxin domain present in rubrerythrin and nigerythrin and detected either N- or C-terminal to such proteins as flavin reductase, NAD(P)H-nitrite reductase, and ferredoxin-thioredoxin reductase. In rubredoxin, the iron atom is coordinated by four cysteine residues (Fe(S-Cys)4), but iron can also be replaced by cobalt, nickel or zinc and believed to be involved in electron transfer.  Rubrerythrins and nigerythrins are small homodimeric proteins, generally consisting of 2 domains: a rubredoxin domain C-terminal to a non-sulfur, oxo-bridged diiron site in the N-terminal rubrerythrin domain.  Rubrerythrins and nigerythrins have putative peroxide activity.
Probab=87.21  E-value=0.36  Score=29.35  Aligned_cols=11  Identities=45%  Similarity=1.216  Sum_probs=5.6

Q ss_pred             eecCccCcccC
Q psy2838         293 YTCEVCSKSYN  303 (458)
Q Consensus       293 ~~C~~C~~~f~  303 (458)
                      |+|.+||..+.
T Consensus         2 ~~C~~CGy~y~   12 (33)
T cd00350           2 YVCPVCGYIYD   12 (33)
T ss_pred             EECCCCCCEEC
Confidence            45555555443


No 78 
>TIGR00622 ssl1 transcription factor ssl1. This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University).
Probab=85.98  E-value=1.3  Score=35.22  Aligned_cols=45  Identities=20%  Similarity=0.478  Sum_probs=26.3

Q ss_pred             ecCccccccCCchhhhhccchhccCcceecCcccccccCHHHHHHHH
Q psy2838         238 KCDHCVKSFNQKPNLISHIKSVHEGIKYKCEECTKTFAQKQNLLSHV  284 (458)
Q Consensus       238 ~C~~C~~~f~~~~~L~~H~~~~~~~~~~~C~~C~~~f~~~~~L~~H~  284 (458)
                      .|--|...|........-  .......|.|+.|...|-..-++-.|.
T Consensus        57 ~C~~C~~~f~~~~~~~~~--~~~~~~~y~C~~C~~~FC~dCD~fiHe  101 (112)
T TIGR00622        57 FCFGCQGPFPKPPVSPFD--ELKDSHRYVCAVCKNVFCVDCDVFVHE  101 (112)
T ss_pred             cccCcCCCCCCccccccc--ccccccceeCCCCCCccccccchhhhh
Confidence            377777777654311100  012234678888888887777777775


No 79 
>COG4049 Uncharacterized protein containing archaeal-type C2H2 Zn-finger [General function prediction only]
Probab=85.52  E-value=0.34  Score=32.66  Aligned_cols=30  Identities=20%  Similarity=0.513  Sum_probs=22.9

Q ss_pred             ccCccccCccccccCCCHHHHHHHHhhccC
Q psy2838         316 HQAQKYKCADCGRLFAQEHTLRTHTLSVHS  345 (458)
Q Consensus       316 ~~~~~~~C~~C~~~f~~~~~L~~H~~~~h~  345 (458)
                      .++.-+.|+.|+..|....+..+|+.+.|+
T Consensus        13 DGE~~lrCPRC~~~FR~~K~Y~RHVNKaH~   42 (65)
T COG4049          13 DGEEFLRCPRCGMVFRRRKDYIRHVNKAHG   42 (65)
T ss_pred             CCceeeeCCchhHHHHHhHHHHHHhhHHhh
Confidence            355667888888888888888888776664


No 80 
>KOG2186|consensus
Probab=85.01  E-value=0.36  Score=43.66  Aligned_cols=46  Identities=30%  Similarity=0.705  Sum_probs=26.5

Q ss_pred             cccccCccccCChHHHHHHHHHhcCCceeecCccccccCCchhhhhccc
Q psy2838         209 IKCDFCDKTFAHKPNLIAHMNSAHNGVRFKCDHCVKSFNQKPNLISHIK  257 (458)
Q Consensus       209 ~~C~~C~~~f~~~~~L~~H~~~h~~~~~~~C~~C~~~f~~~~~L~~H~~  257 (458)
                      |.|..||.... +..|.+|+-..++ ..|.|-.|++.|.. ..+..|..
T Consensus         4 FtCnvCgEsvK-Kp~vekH~srCrn-~~fSCIDC~k~F~~-~sYknH~k   49 (276)
T KOG2186|consen    4 FTCNVCGESVK-KPQVEKHMSRCRN-AYFSCIDCGKTFER-VSYKNHTK   49 (276)
T ss_pred             Eehhhhhhhcc-ccchHHHHHhccC-CeeEEeeccccccc-chhhhhhh
Confidence            55666666543 3455556665555 45666666666655 45555554


No 81 
>PF09538 FYDLN_acid:  Protein of unknown function (FYDLN_acid);  InterPro: IPR012644 Members of this family are bacterial proteins with a conserved motif [KR]FYDLN, sometimes flanked by a pair of CXXC motifs, followed by a long region of low complexity sequence in which roughly half the residues are Asp and Glu, including multiple runs of five or more acidic residues. The function of members of this family is unknown.
Probab=84.21  E-value=0.57  Score=37.25  Aligned_cols=34  Identities=29%  Similarity=0.753  Sum_probs=27.4

Q ss_pred             cCceeecCccCcccCChhHHHHHHhhhccCccccCccccccCCCH
Q psy2838         289 MGIKYTCEVCSKSYNQVTNLLSHMTTVHQAQKYKCADCGRLFAQE  333 (458)
Q Consensus       289 ~~~~~~C~~C~~~f~~~~~L~~H~~~h~~~~~~~C~~C~~~f~~~  333 (458)
                      .+.+..|+.||+.|....           ..|..|++||..|.-.
T Consensus         6 lGtKR~Cp~CG~kFYDLn-----------k~PivCP~CG~~~~~~   39 (108)
T PF09538_consen    6 LGTKRTCPSCGAKFYDLN-----------KDPIVCPKCGTEFPPE   39 (108)
T ss_pred             cCCcccCCCCcchhccCC-----------CCCccCCCCCCccCcc
Confidence            355689999999997642           3788999999998776


No 82 
>KOG0943|consensus
Probab=81.96  E-value=1.5  Score=48.23  Aligned_cols=10  Identities=40%  Similarity=1.115  Sum_probs=4.9

Q ss_pred             CCcccccccc
Q psy2838         178 VKKFPCDQCG  187 (458)
Q Consensus       178 ~k~~~C~~C~  187 (458)
                      ..-|.|..||
T Consensus      1255 QDIfECkTCG 1264 (3015)
T KOG0943|consen 1255 QDIFECKTCG 1264 (3015)
T ss_pred             chhhhhcccc
Confidence            3345555554


No 83 
>PF02892 zf-BED:  BED zinc finger;  InterPro: IPR003656 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target.  This entry represents predicted BED-type zinc finger domains. The BED finger which was named after the Drosophila proteins BEAF and DREF, is found in one or more copies in cellular regulatory factors and transposases from plants, animals and fungi. The BED finger is an about 50 to 60 amino acid residues domain that contains a characteristic motif with two highly conserved aromatic positions, as well as a shared pattern of cysteines and histidines that is predicted to form a zinc finger. As diverse BED fingers are able to bind DNA, it has been suggested that DNA-binding is the general function of this domain []. Some proteins known to contain a BED domain include animal, plant and fungi AC1 and Hobo-like transposases; Caenorhabditis elegans Dpy-20 protein, a predicted cuticular gene transcriptional regulator; Drosophila BEAF (boundary element-associated factor), thought to be involved in chromatin insulation; Drosophila DREF, a transcriptional regulator for S-phase genes; and tobacco 3AF1 and tomato E4/E8-BP1, light- and ethylene-regulated DNA binding proteins that contain two BED fingers. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; GO: 0003677 DNA binding; PDB: 2DJR_A 2CT5_A.
Probab=80.78  E-value=1.1  Score=29.30  Aligned_cols=24  Identities=25%  Similarity=0.628  Sum_probs=12.3

Q ss_pred             cccCcCcccCCCH----HHHHHHHHHhC
Q psy2838         349 LTCRTCGLFCDSK----RALTAHVREAH  372 (458)
Q Consensus       349 ~~C~~C~~~f~~~----~~L~~H~~~~h  372 (458)
                      ..|.+|++.+...    +.|.+|++..|
T Consensus        17 a~C~~C~~~~~~~~~~ts~l~~HL~~~h   44 (45)
T PF02892_consen   17 AKCKYCGKVIKYSSGGTSNLKRHLKKKH   44 (45)
T ss_dssp             EEETTTTEE-----SSTHHHHHHHHHTT
T ss_pred             EEeCCCCeEEeeCCCcHHHHHHhhhhhC
Confidence            4466666666553    56666665544


No 84 
>KOG1832|consensus
Probab=80.73  E-value=0.72  Score=49.15  Aligned_cols=14  Identities=14%  Similarity=0.356  Sum_probs=8.2

Q ss_pred             CcCcccCCCHHHHH
Q psy2838         352 RTCGLFCDSKRALT  365 (458)
Q Consensus       352 ~~C~~~f~~~~~L~  365 (458)
                      ..|...|.+.....
T Consensus      1303 dqc~VtFNstG~Vm 1316 (1516)
T KOG1832|consen 1303 DQCAVTFNSTGDVM 1316 (1516)
T ss_pred             cceEEEeccCccch
Confidence            45667776655433


No 85 
>PF09986 DUF2225:  Uncharacterized protein conserved in bacteria (DUF2225);  InterPro: IPR018708 This conserved bacterial family has no known function.
Probab=78.72  E-value=0.47  Score=43.05  Aligned_cols=13  Identities=38%  Similarity=0.915  Sum_probs=6.5

Q ss_pred             eecCccCcccCCh
Q psy2838         293 YTCEVCSKSYNQV  305 (458)
Q Consensus       293 ~~C~~C~~~f~~~  305 (458)
                      +.|++|++.|.+.
T Consensus         6 ~~CPvC~~~F~~~   18 (214)
T PF09986_consen    6 ITCPVCGKEFKTK   18 (214)
T ss_pred             eECCCCCCeeeee
Confidence            4455555555443


No 86 
>KOG2186|consensus
Probab=78.71  E-value=1  Score=40.78  Aligned_cols=55  Identities=20%  Similarity=0.545  Sum_probs=36.6

Q ss_pred             ceecCcccccccCHHHHHHHHHHhhcCceeecCccCcccCChhHHHHHHhhhccCccc
Q psy2838         264 KYKCEECTKTFAQKQNLLSHVQAIHMGIKYTCEVCSKSYNQVTNLLSHMTTVHQAQKY  321 (458)
Q Consensus       264 ~~~C~~C~~~f~~~~~L~~H~~~~h~~~~~~C~~C~~~f~~~~~L~~H~~~h~~~~~~  321 (458)
                      -|.|..||....- ..+.+|+-+.++ .-|.|-.|++.|.. ..+..|..--+....|
T Consensus         3 ~FtCnvCgEsvKK-p~vekH~srCrn-~~fSCIDC~k~F~~-~sYknH~kCITEaQKY   57 (276)
T KOG2186|consen    3 FFTCNVCGESVKK-PQVEKHMSRCRN-AYFSCIDCGKTFER-VSYKNHTKCITEAQKY   57 (276)
T ss_pred             EEehhhhhhhccc-cchHHHHHhccC-CeeEEeeccccccc-chhhhhhhhcchHHHh
Confidence            3778888877653 456678755555 56888888888877 5667776655544444


No 87 
>TIGR02300 FYDLN_acid conserved hypothetical protein TIGR02300. Members of this family are bacterial proteins with a conserved motif [KR]FYDLN, sometimes flanked by a pair of CXXC motifs, followed by a long region of low complexity sequence in which roughly half the residues are Asp and Glu, including multiple runs of five or more acidic residues. The function of members of this family is unknown.
Probab=74.50  E-value=1.8  Score=34.97  Aligned_cols=14  Identities=7%  Similarity=-0.092  Sum_probs=7.8

Q ss_pred             ccccCcCcccCCCH
Q psy2838         348 RLTCRTCGLFCDSK  361 (458)
Q Consensus       348 ~~~C~~C~~~f~~~  361 (458)
                      +..|+.||..|.-.
T Consensus        26 p~vcP~cg~~~~~~   39 (129)
T TIGR02300        26 PAVSPYTGEQFPPE   39 (129)
T ss_pred             CccCCCcCCccCcc
Confidence            45566666665443


No 88 
>PF09986 DUF2225:  Uncharacterized protein conserved in bacteria (DUF2225);  InterPro: IPR018708 This conserved bacterial family has no known function.
Probab=73.38  E-value=1.7  Score=39.34  Aligned_cols=23  Identities=26%  Similarity=0.499  Sum_probs=14.2

Q ss_pred             ceeecCccccccCCchhhhhccc
Q psy2838         235 VRFKCDHCVKSFNQKPNLISHIK  257 (458)
Q Consensus       235 ~~~~C~~C~~~f~~~~~L~~H~~  257 (458)
                      +.+.|++|+..|.++.-+....+
T Consensus         4 k~~~CPvC~~~F~~~~vrs~~~r   26 (214)
T PF09986_consen    4 KKITCPVCGKEFKTKKVRSGKIR   26 (214)
T ss_pred             CceECCCCCCeeeeeEEEcCCce
Confidence            45677777777776655444443


No 89 
>PF04959 ARS2:  Arsenite-resistance protein 2;  InterPro: IPR007042 This entry represents Arsenite-resistance protein 2 (also known as Serrate RNA effector molecule homolog) which is thought to play a role in arsenite resistance [], although does not directly confer arsenite resistance but rather modulates arsenic sensitivity []. Arsenite is a carcinogenic compound which can act as a comutagen by inhibiting DNA repair. It is also involved in cell cycle progression at S phase. ; PDB: 3AX1_A.
Probab=73.16  E-value=1.5  Score=39.47  Aligned_cols=26  Identities=35%  Similarity=0.839  Sum_probs=13.2

Q ss_pred             cccCccccccCCCHHHHHHHHhhccC
Q psy2838         320 KYKCADCGRLFAQEHTLRTHTLSVHS  345 (458)
Q Consensus       320 ~~~C~~C~~~f~~~~~L~~H~~~~h~  345 (458)
                      .|.|+.|+|.|.-..-.++|+.+.|.
T Consensus        77 K~~C~lc~KlFkg~eFV~KHI~nKH~  102 (214)
T PF04959_consen   77 KWRCPLCGKLFKGPEFVRKHIFNKHP  102 (214)
T ss_dssp             EEEE-SSS-EESSHHHHHHHHHHH-H
T ss_pred             EECCCCCCcccCChHHHHHHHhhcCH
Confidence            35555555555555555555555554


No 90 
>cd00729 rubredoxin_SM Rubredoxin, Small Modular nonheme iron binding domain containing a [Fe(SCys)4] center, present in rubrerythrin and nigerythrin and detected either N- or C-terminal to such proteins as flavin reductase, NAD(P)H-nitrite reductase, and ferredoxin-thioredoxin reductase. In rubredoxin, the iron atom is coordinated by four cysteine residues (Fe(S-Cys)4), and  believed to be involved in electron transfer. Rubrerythrins and nigerythrins are small homodimeric proteins, generally consisting of 2 domains: a rubredoxin domain C-terminal to a non-sulfur, oxo-bridged diiron site in the N-terminal rubrerythrin domain. Rubrerythrins and nigerythrins have putative peroxide activity.
Probab=72.62  E-value=2.2  Score=26.12  Aligned_cols=10  Identities=30%  Similarity=0.810  Sum_probs=5.1

Q ss_pred             eecCccCccc
Q psy2838         293 YTCEVCSKSY  302 (458)
Q Consensus       293 ~~C~~C~~~f  302 (458)
                      |+|.+||..+
T Consensus         3 ~~C~~CG~i~   12 (34)
T cd00729           3 WVCPVCGYIH   12 (34)
T ss_pred             EECCCCCCEe
Confidence            4555555443


No 91 
>PF13717 zinc_ribbon_4:  zinc-ribbon domain
Probab=72.57  E-value=3.3  Score=25.69  Aligned_cols=10  Identities=20%  Similarity=0.926  Sum_probs=4.4

Q ss_pred             ecCccccccC
Q psy2838         238 KCDHCVKSFN  247 (458)
Q Consensus       238 ~C~~C~~~f~  247 (458)
                      .|+.|+..|.
T Consensus         4 ~Cp~C~~~y~   13 (36)
T PF13717_consen    4 TCPNCQAKYE   13 (36)
T ss_pred             ECCCCCCEEe
Confidence            3444444443


No 92 
>PF13719 zinc_ribbon_5:  zinc-ribbon domain
Probab=72.41  E-value=3.2  Score=25.87  Aligned_cols=9  Identities=33%  Similarity=1.080  Sum_probs=3.8

Q ss_pred             cCccccccC
Q psy2838         239 CDHCVKSFN  247 (458)
Q Consensus       239 C~~C~~~f~  247 (458)
                      |+.|+..|.
T Consensus         5 CP~C~~~f~   13 (37)
T PF13719_consen    5 CPNCQTRFR   13 (37)
T ss_pred             CCCCCceEE
Confidence            444444443


No 93 
>TIGR02300 FYDLN_acid conserved hypothetical protein TIGR02300. Members of this family are bacterial proteins with a conserved motif [KR]FYDLN, sometimes flanked by a pair of CXXC motifs, followed by a long region of low complexity sequence in which roughly half the residues are Asp and Glu, including multiple runs of five or more acidic residues. The function of members of this family is unknown.
Probab=72.38  E-value=7.5  Score=31.55  Aligned_cols=31  Identities=13%  Similarity=0.011  Sum_probs=22.7

Q ss_pred             CCccccCcCcccCCCHHHHHHHHHHhCCCCCcccCCCCCCCC
Q psy2838         346 GVRLTCRTCGLFCDSKRALTAHVREAHPGCEYKKRGRGRPKL  387 (458)
Q Consensus       346 ~~~~~C~~C~~~f~~~~~L~~H~~~~h~~~~~~~~~~~~~~~  387 (458)
                      +.+..|+.||++|...           ...|..|+.||....
T Consensus         7 GtKr~Cp~cg~kFYDL-----------nk~p~vcP~cg~~~~   37 (129)
T TIGR02300         7 GTKRICPNTGSKFYDL-----------NRRPAVSPYTGEQFP   37 (129)
T ss_pred             CccccCCCcCcccccc-----------CCCCccCCCcCCccC
Confidence            4466799999999753           235788999987543


No 94 
>PF10571 UPF0547:  Uncharacterised protein family UPF0547;  InterPro: IPR018886  This domain may well be a type of zinc-finger as it carries two pairs of highly conserved cysteine residues though with no accompanying histidines. Several members are annotated as putative helicases. 
Probab=72.28  E-value=2.5  Score=24.07  Aligned_cols=10  Identities=30%  Similarity=0.786  Sum_probs=5.9

Q ss_pred             ccCcCcccCC
Q psy2838         350 TCRTCGLFCD  359 (458)
Q Consensus       350 ~C~~C~~~f~  359 (458)
                      .|+.||+.|.
T Consensus        16 ~Cp~CG~~F~   25 (26)
T PF10571_consen   16 FCPHCGYDFE   25 (26)
T ss_pred             cCCCCCCCCc
Confidence            3666666653


No 95 
>COG2888 Predicted Zn-ribbon RNA-binding protein with a function in translation [Translation, ribosomal structure and biogenesis]
Probab=71.75  E-value=3.1  Score=28.72  Aligned_cols=9  Identities=33%  Similarity=1.099  Sum_probs=4.4

Q ss_pred             ccccCcCcc
Q psy2838         348 RLTCRTCGL  356 (458)
Q Consensus       348 ~~~C~~C~~  356 (458)
                      +|.|+.||+
T Consensus        50 ~Y~Cp~CGF   58 (61)
T COG2888          50 PYRCPKCGF   58 (61)
T ss_pred             ceECCCcCc
Confidence            455555543


No 96 
>KOG4124|consensus
Probab=69.80  E-value=0.74  Score=43.57  Aligned_cols=51  Identities=12%  Similarity=0.015  Sum_probs=33.5

Q ss_pred             CCcccccc--CccccCChHHHHHH-----HHHhcCCceeecCccccccCCchhhhhccc
Q psy2838         206 GVRIKCDF--CDKTFAHKPNLIAH-----MNSAHNGVRFKCDHCVKSFNQKPNLISHIK  257 (458)
Q Consensus       206 ~~~~~C~~--C~~~f~~~~~L~~H-----~~~h~~~~~~~C~~C~~~f~~~~~L~~H~~  257 (458)
                      +.++.|..  |.+..........|     +-.-+..+||+|+ |++.+.++..|+.|-.
T Consensus       176 E~~~S~~vp~~~~~~~~~Ns~~~~S~~~~~T~~t~~~p~k~~-~~~~~~T~~~l~~HS~  233 (442)
T KOG4124|consen  176 EYRVSVVVPAAAAAAAAANSSDMSSDEASSTAETTGTPKKMP-ESLVMDTSSPLSDHSM  233 (442)
T ss_pred             cccccccCchhhhhhhccccccccccccccccccccCCccCc-ccccccccchhhhccc
Confidence            45577765  66665544444333     3333456788887 8899999888888865


No 97 
>PF02892 zf-BED:  BED zinc finger;  InterPro: IPR003656 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target.  This entry represents predicted BED-type zinc finger domains. The BED finger which was named after the Drosophila proteins BEAF and DREF, is found in one or more copies in cellular regulatory factors and transposases from plants, animals and fungi. The BED finger is an about 50 to 60 amino acid residues domain that contains a characteristic motif with two highly conserved aromatic positions, as well as a shared pattern of cysteines and histidines that is predicted to form a zinc finger. As diverse BED fingers are able to bind DNA, it has been suggested that DNA-binding is the general function of this domain []. Some proteins known to contain a BED domain include animal, plant and fungi AC1 and Hobo-like transposases; Caenorhabditis elegans Dpy-20 protein, a predicted cuticular gene transcriptional regulator; Drosophila BEAF (boundary element-associated factor), thought to be involved in chromatin insulation; Drosophila DREF, a transcriptional regulator for S-phase genes; and tobacco 3AF1 and tomato E4/E8-BP1, light- and ethylene-regulated DNA binding proteins that contain two BED fingers. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; GO: 0003677 DNA binding; PDB: 2DJR_A 2CT5_A.
Probab=69.42  E-value=3.3  Score=26.88  Aligned_cols=24  Identities=29%  Similarity=0.624  Sum_probs=11.2

Q ss_pred             ccccCccccccCCC----HHHHHHHHhh
Q psy2838         319 QKYKCADCGRLFAQ----EHTLRTHTLS  342 (458)
Q Consensus       319 ~~~~C~~C~~~f~~----~~~L~~H~~~  342 (458)
                      ...+|..|++.+..    .+.|.+|++.
T Consensus        15 ~~a~C~~C~~~~~~~~~~ts~l~~HL~~   42 (45)
T PF02892_consen   15 KKAKCKYCGKVIKYSSGGTSNLKRHLKK   42 (45)
T ss_dssp             S-EEETTTTEE-----SSTHHHHHHHHH
T ss_pred             CeEEeCCCCeEEeeCCCcHHHHHHhhhh
Confidence            34556666665544    2566666543


No 98 
>smart00531 TFIIE Transcription initiation factor IIE.
Probab=69.28  E-value=5.2  Score=33.88  Aligned_cols=11  Identities=36%  Similarity=1.117  Sum_probs=5.4

Q ss_pred             eecCccCcccC
Q psy2838         293 YTCEVCSKSYN  303 (458)
Q Consensus       293 ~~C~~C~~~f~  303 (458)
                      |.|+.|+..|.
T Consensus       100 Y~Cp~C~~~y~  110 (147)
T smart00531      100 YKCPNCQSKYT  110 (147)
T ss_pred             EECcCCCCEee
Confidence            44555554444


No 99 
>TIGR02098 MJ0042_CXXC MJ0042 family finger-like domain. This domain contains a CXXCX(19)CXXC motif suggestive of both zinc fingers and thioredoxin, usually found at the N-terminus of prokaryotic proteins. One partially characterized gene, agmX, is among a large set in Myxococcus whose interruption affects adventurous gliding motility.
Probab=69.20  E-value=3.7  Score=25.60  Aligned_cols=9  Identities=44%  Similarity=1.206  Sum_probs=3.6

Q ss_pred             ecCcccccc
Q psy2838         238 KCDHCVKSF  246 (458)
Q Consensus       238 ~C~~C~~~f  246 (458)
                      .|+.|+..|
T Consensus         4 ~CP~C~~~~   12 (38)
T TIGR02098         4 QCPNCKTSF   12 (38)
T ss_pred             ECCCCCCEE
Confidence            344444433


No 100
>KOG1189|consensus
Probab=68.96  E-value=4.8  Score=42.44  Aligned_cols=20  Identities=20%  Similarity=0.233  Sum_probs=9.9

Q ss_pred             eeeecchhHHHHHHHhcCCC
Q psy2838          44 NYYQLSSDEKFEILKSISAS   63 (458)
Q Consensus        44 ~~~~~~~~~~~~~~~~~~~~   63 (458)
                      -++.||+..-.+-.+.|+.+
T Consensus       434 hQkeLa~qlnee~~~Rls~~  453 (960)
T KOG1189|consen  434 HQKELADQLNEEALRRLSNQ  453 (960)
T ss_pred             HHHHHHHHHHHHHHHHHhhc
Confidence            34445555555555555444


No 101
>PRK00464 nrdR transcriptional regulator NrdR; Validated
Probab=67.77  E-value=2.4  Score=36.13  Aligned_cols=16  Identities=25%  Similarity=0.453  Sum_probs=9.9

Q ss_pred             eecCccccccCCchhh
Q psy2838         237 FKCDHCVKSFNQKPNL  252 (458)
Q Consensus       237 ~~C~~C~~~f~~~~~L  252 (458)
                      ++|+.||++|.+...+
T Consensus        29 ~~c~~c~~~f~~~e~~   44 (154)
T PRK00464         29 RECLACGKRFTTFERV   44 (154)
T ss_pred             eeccccCCcceEeEec
Confidence            6677777776654443


No 102
>PRK00464 nrdR transcriptional regulator NrdR; Validated
Probab=67.48  E-value=1.2  Score=37.91  Aligned_cols=12  Identities=25%  Similarity=0.587  Sum_probs=6.7

Q ss_pred             cccCcCcccCCC
Q psy2838         349 LTCRTCGLFCDS  360 (458)
Q Consensus       349 ~~C~~C~~~f~~  360 (458)
                      ++|+.||++|.+
T Consensus        29 ~~c~~c~~~f~~   40 (154)
T PRK00464         29 RECLACGKRFTT   40 (154)
T ss_pred             eeccccCCcceE
Confidence            446666666554


No 103
>PRK14890 putative Zn-ribbon RNA-binding protein; Provisional
Probab=66.83  E-value=3.7  Score=28.39  Aligned_cols=9  Identities=44%  Similarity=1.198  Sum_probs=4.3

Q ss_pred             ccccCcCcc
Q psy2838         348 RLTCRTCGL  356 (458)
Q Consensus       348 ~~~C~~C~~  356 (458)
                      .|.|+.||+
T Consensus        48 ~Y~CP~CGF   56 (59)
T PRK14890         48 PYTCPKCGF   56 (59)
T ss_pred             ceECCCCCC
Confidence            444555543


No 104
>PF02176 zf-TRAF:  TRAF-type zinc finger; PDB: 2EOD_A 2YUC_A 3HCU_A 3HCS_B 3HCT_A.
Probab=65.57  E-value=5.1  Score=27.82  Aligned_cols=6  Identities=17%  Similarity=0.334  Sum_probs=1.9

Q ss_pred             ceecCc
Q psy2838         264 KYKCEE  269 (458)
Q Consensus       264 ~~~C~~  269 (458)
                      +..|..
T Consensus        38 ~v~C~~   43 (60)
T PF02176_consen   38 PVPCPY   43 (60)
T ss_dssp             EEE-SS
T ss_pred             cEECCC
Confidence            333333


No 105
>smart00659 RPOLCX RNA polymerase subunit CX. present in RNA polymerase I, II and III
Probab=64.59  E-value=3.4  Score=26.93  Aligned_cols=11  Identities=27%  Similarity=0.918  Sum_probs=5.0

Q ss_pred             eecCccCcccC
Q psy2838         293 YTCEVCSKSYN  303 (458)
Q Consensus       293 ~~C~~C~~~f~  303 (458)
                      |.|..|+..|.
T Consensus         3 Y~C~~Cg~~~~   13 (44)
T smart00659        3 YICGECGRENE   13 (44)
T ss_pred             EECCCCCCEee
Confidence            44444444443


No 106
>PF12907 zf-met2:  Zinc-binding
Probab=64.22  E-value=5.2  Score=25.43  Aligned_cols=12  Identities=25%  Similarity=0.415  Sum_probs=5.1

Q ss_pred             HHHHHHHhhccC
Q psy2838         334 HTLRTHTLSVHS  345 (458)
Q Consensus       334 ~~L~~H~~~~h~  345 (458)
                      ..|..|....|.
T Consensus        18 ~~L~eH~enKHp   29 (40)
T PF12907_consen   18 PQLKEHAENKHP   29 (40)
T ss_pred             HHHHHHHHccCC
Confidence            334444444443


No 107
>KOG2807|consensus
Probab=64.18  E-value=5.5  Score=37.65  Aligned_cols=85  Identities=19%  Similarity=0.485  Sum_probs=48.9

Q ss_pred             eeecCccccccCCchhhhhccchhccCcceecCcccccccCHHHHHHHHHHhhcC--ce------------eecCccCcc
Q psy2838         236 RFKCDHCVKSFNQKPNLISHIKSVHEGIKYKCEECTKTFAQKQNLLSHVQAIHMG--IK------------YTCEVCSKS  301 (458)
Q Consensus       236 ~~~C~~C~~~f~~~~~L~~H~~~~~~~~~~~C~~C~~~f~~~~~L~~H~~~~h~~--~~------------~~C~~C~~~  301 (458)
                      -|.|+.|...              .-..|..|+.|+.+......|.+-.  ||.-  ++            -.|-.|+-.
T Consensus       276 Gy~CP~Ckak--------------vCsLP~eCpiC~ltLVss~hLARSy--hhL~PL~~F~Eip~~~~~~~~~Cf~C~~~  339 (378)
T KOG2807|consen  276 GYFCPQCKAK--------------VCSLPIECPICSLTLVSSPHLARSY--HHLFPLKPFVEIPETEYNGSRFCFACQGE  339 (378)
T ss_pred             ceeCCcccCe--------------eecCCccCCccceeEecchHHHHHH--HhhcCCcchhhccccccCCCcceeeeccc
Confidence            3778888643              2335778888888877777776543  2210  00            125555111


Q ss_pred             cCChhHHHHHHhhhccCccccCccccccCCCHHHHHHHHhhccCCCccccCcCc
Q psy2838         302 YNQVTNLLSHMTTVHQAQKYKCADCGRLFAQEHTLRTHTLSVHSGVRLTCRTCG  355 (458)
Q Consensus       302 f~~~~~L~~H~~~h~~~~~~~C~~C~~~f~~~~~L~~H~~~~h~~~~~~C~~C~  355 (458)
                                   -.....|+|..|...|-.--+...|...|      .|+.|.
T Consensus       340 -------------~~~~~~y~C~~Ck~~FCldCDv~iHesLh------~CpgCe  374 (378)
T KOG2807|consen  340 -------------LLSSGRYRCESCKNVFCLDCDVFIHESLH------NCPGCE  374 (378)
T ss_pred             -------------cCCCCcEEchhccceeeccchHHHHhhhh------cCCCcC
Confidence                         11223488888888887777777774322      366664


No 108
>TIGR00373 conserved hypothetical protein TIGR00373. This family of proteins is, so far, restricted to archaeal genomes. The family appears to be distantly related to the N-terminal region of the eukaryotic transcription initiation factor IIE alpha chain.
Probab=64.04  E-value=6.7  Score=33.67  Aligned_cols=16  Identities=6%  Similarity=0.289  Sum_probs=8.5

Q ss_pred             eecCccCcccCChhHH
Q psy2838         293 YTCEVCSKSYNQVTNL  308 (458)
Q Consensus       293 ~~C~~C~~~f~~~~~L  308 (458)
                      |.|+.|+..|+....+
T Consensus       110 Y~Cp~c~~r~tf~eA~  125 (158)
T TIGR00373       110 FICPNMCVRFTFNEAM  125 (158)
T ss_pred             EECCCCCcEeeHHHHH
Confidence            5555555555544444


No 109
>smart00614 ZnF_BED BED zinc finger. DNA-binding domain in chromatin-boundary-element-binding proteins and transposases
Probab=63.99  E-value=5.2  Score=26.79  Aligned_cols=25  Identities=32%  Similarity=0.688  Sum_probs=15.6

Q ss_pred             cccCcCcccCCCH-----HHHHHHHHHhCC
Q psy2838         349 LTCRTCGLFCDSK-----RALTAHVREAHP  373 (458)
Q Consensus       349 ~~C~~C~~~f~~~-----~~L~~H~~~~h~  373 (458)
                      -.|..|++.+...     +.|.+|++..|+
T Consensus        19 a~C~~C~~~l~~~~~~gTs~L~rHl~~~h~   48 (50)
T smart00614       19 AKCKYCGKKLSRSSKGGTSNLRRHLRRKHP   48 (50)
T ss_pred             EEecCCCCEeeeCCCCCcHHHHHHHHhHCc
Confidence            3477777766544     577777775443


No 110
>PF02176 zf-TRAF:  TRAF-type zinc finger; PDB: 2EOD_A 2YUC_A 3HCU_A 3HCS_B 3HCT_A.
Probab=61.85  E-value=2.9  Score=29.09  Aligned_cols=24  Identities=17%  Similarity=0.205  Sum_probs=10.1

Q ss_pred             HHHHHHhhccCCCccccCc----CcccC
Q psy2838         335 TLRTHTLSVHSGVRLTCRT----CGLFC  358 (458)
Q Consensus       335 ~L~~H~~~~h~~~~~~C~~----C~~~f  358 (458)
                      .|..|+...-......|++    |+..+
T Consensus        25 ~l~~H~~~~C~~~~v~C~~~~~GC~~~~   52 (60)
T PF02176_consen   25 ELDDHLENECPKRPVPCPYSPYGCKERV   52 (60)
T ss_dssp             CHHHHHHTTSTTSEEE-SS----S--EE
T ss_pred             HHHHHHHccCCCCcEECCCCCCCCCCcc
Confidence            4555544344444444555    55544


No 111
>COG1592 Rubrerythrin [Energy production and conversion]
Probab=61.20  E-value=5  Score=34.52  Aligned_cols=23  Identities=26%  Similarity=0.670  Sum_probs=15.8

Q ss_pred             eeecCccCcccCChhHHHHHHhhhccCccccCcccc
Q psy2838         292 KYTCEVCSKSYNQVTNLLSHMTTVHQAQKYKCADCG  327 (458)
Q Consensus       292 ~~~C~~C~~~f~~~~~L~~H~~~h~~~~~~~C~~C~  327 (458)
                      .|.|++||..+             -++.|-+||+||
T Consensus       134 ~~vC~vCGy~~-------------~ge~P~~CPiCg  156 (166)
T COG1592         134 VWVCPVCGYTH-------------EGEAPEVCPICG  156 (166)
T ss_pred             EEEcCCCCCcc-------------cCCCCCcCCCCC
Confidence            47788887643             346677788887


No 112
>PF15269 zf-C2H2_7:  Zinc-finger
Probab=60.01  E-value=6.9  Score=25.21  Aligned_cols=23  Identities=26%  Similarity=0.644  Sum_probs=18.0

Q ss_pred             cccCcCcccCCCHHHHHHHHHHh
Q psy2838         349 LTCRTCGLFCDSKRALTAHVREA  371 (458)
Q Consensus       349 ~~C~~C~~~f~~~~~L~~H~~~~  371 (458)
                      |+|-.|..++..++.|..||+-.
T Consensus        21 ykcfqcpftc~~kshl~nhmky~   43 (54)
T PF15269_consen   21 YKCFQCPFTCNEKSHLFNHMKYS   43 (54)
T ss_pred             ceeecCCcccchHHHHHHHHHHH
Confidence            55888888888888888888643


No 113
>PF09723 Zn-ribbon_8:  Zinc ribbon domain;  InterPro: IPR013429  This entry represents a region of about 41 amino acids found in a number of small proteins in a wide range of bacteria. The region usually begins with the initiator Met and contains two CxxC motifs separated by 17 amino acids. One protein in this entry has been noted as a putative regulatory protein, designated FmdB []. Most proteins in this entry have a C-terminal region containing highly degenerate sequence.
Probab=59.51  E-value=2.9  Score=26.90  Aligned_cols=11  Identities=36%  Similarity=1.195  Sum_probs=5.6

Q ss_pred             eecCccCcccC
Q psy2838         293 YTCEVCSKSYN  303 (458)
Q Consensus       293 ~~C~~C~~~f~  303 (458)
                      |+|..||..|.
T Consensus         6 y~C~~Cg~~fe   16 (42)
T PF09723_consen    6 YRCEECGHEFE   16 (42)
T ss_pred             EEeCCCCCEEE
Confidence            45555555544


No 114
>TIGR00373 conserved hypothetical protein TIGR00373. This family of proteins is, so far, restricted to archaeal genomes. The family appears to be distantly related to the N-terminal region of the eukaryotic transcription initiation factor IIE alpha chain.
Probab=58.92  E-value=8.7  Score=32.96  Aligned_cols=16  Identities=13%  Similarity=0.229  Sum_probs=7.4

Q ss_pred             cccccCccccCChHHH
Q psy2838         209 IKCDFCDKTFAHKPNL  224 (458)
Q Consensus       209 ~~C~~C~~~f~~~~~L  224 (458)
                      |.|+.|+..|....++
T Consensus       110 Y~Cp~c~~r~tf~eA~  125 (158)
T TIGR00373       110 FICPNMCVRFTFNEAM  125 (158)
T ss_pred             EECCCCCcEeeHHHHH
Confidence            4455555444444333


No 115
>KOG1832|consensus
Probab=58.50  E-value=3.5  Score=44.23  Aligned_cols=9  Identities=22%  Similarity=0.523  Sum_probs=4.0

Q ss_pred             ccccCCCHH
Q psy2838         326 CGRLFAQEH  334 (458)
Q Consensus       326 C~~~f~~~~  334 (458)
                      |.-+|.+..
T Consensus      1305 c~VtFNstG 1313 (1516)
T KOG1832|consen 1305 CAVTFNSTG 1313 (1516)
T ss_pred             eEEEeccCc
Confidence            444444443


No 116
>PHA00626 hypothetical protein
Probab=58.45  E-value=5.8  Score=27.01  Aligned_cols=10  Identities=30%  Similarity=0.720  Sum_probs=4.5

Q ss_pred             eecCcccccc
Q psy2838         265 YKCEECTKTF  274 (458)
Q Consensus       265 ~~C~~C~~~f  274 (458)
                      |+|+.|++.|
T Consensus        24 YkCkdCGY~f   33 (59)
T PHA00626         24 YVCCDCGYND   33 (59)
T ss_pred             eEcCCCCCee
Confidence            4444444444


No 117
>PRK06266 transcription initiation factor E subunit alpha; Validated
Probab=58.36  E-value=7.9  Score=33.95  Aligned_cols=15  Identities=13%  Similarity=0.454  Sum_probs=7.6

Q ss_pred             eecCccCcccCChhH
Q psy2838         293 YTCEVCSKSYNQVTN  307 (458)
Q Consensus       293 ~~C~~C~~~f~~~~~  307 (458)
                      |.|+.|+..|+....
T Consensus       118 Y~Cp~C~~rytf~eA  132 (178)
T PRK06266        118 FFCPNCHIRFTFDEA  132 (178)
T ss_pred             EECCCCCcEEeHHHH
Confidence            555555555544443


No 118
>PRK00398 rpoP DNA-directed RNA polymerase subunit P; Provisional
Probab=58.05  E-value=6.8  Score=25.70  Aligned_cols=9  Identities=33%  Similarity=1.154  Sum_probs=3.8

Q ss_pred             eecCccccc
Q psy2838         237 FKCDHCVKS  245 (458)
Q Consensus       237 ~~C~~C~~~  245 (458)
                      |+|+.|+..
T Consensus         4 y~C~~CG~~   12 (46)
T PRK00398          4 YKCARCGRE   12 (46)
T ss_pred             EECCCCCCE
Confidence            444444443


No 119
>TIGR02605 CxxC_CxxC_SSSS putative regulatory protein, FmdB family. This model represents a region of about 50 amino acids found in a number of small proteins in a wide range of bacteria. The region begins usually with the initiator Met and contains two CxxC motifs separated by 17 amino acids. One member of this family is has been noted as a putative regulatory protein, designated FmdB (PubMed:8841393). Most members of this family have a C-terminal region containing highly degenerate sequence, such as SSTSESTKSSGSSGSSGSSESKASGSTEKSTSSTTAAAAV in Mycobacterium tuberculosis and VAVGGSAPAPSPAPRAGGGGGGCCGGGCCG in Streptomyces avermitilis. These low complexity regions, which are not included in the model, resemble low-complexity C-terminal regions of some heterocycle-containing bacteriocin precursors.
Probab=56.28  E-value=5.5  Score=26.83  Aligned_cols=10  Identities=30%  Similarity=1.135  Sum_probs=4.1

Q ss_pred             eecCcccccc
Q psy2838         237 FKCDHCVKSF  246 (458)
Q Consensus       237 ~~C~~C~~~f  246 (458)
                      |+|..|+..|
T Consensus         6 y~C~~Cg~~f   15 (52)
T TIGR02605         6 YRCTACGHRF   15 (52)
T ss_pred             EEeCCCCCEe
Confidence            3444444433


No 120
>smart00834 CxxC_CXXC_SSSS Putative regulatory protein. CxxC_CXXC_SSSS represents a region of about 41 amino acids found in a number of small proteins in a wide range of bacteria. The region usually begins with the initiator Met and contains two CxxC motifs separated by 17 amino acids. One protein in this entry has been noted as a putative regulatory protein, designated FmdB. Most proteins in this entry have a C-terminal region containing highly degenerate sequence.
Probab=55.47  E-value=5.8  Score=25.05  Aligned_cols=11  Identities=27%  Similarity=1.229  Sum_probs=5.2

Q ss_pred             eecCccccccC
Q psy2838         237 FKCDHCVKSFN  247 (458)
Q Consensus       237 ~~C~~C~~~f~  247 (458)
                      |+|..|+..|.
T Consensus         6 y~C~~Cg~~fe   16 (41)
T smart00834        6 YRCEDCGHTFE   16 (41)
T ss_pred             EEcCCCCCEEE
Confidence            44555554443


No 121
>PF15269 zf-C2H2_7:  Zinc-finger
Probab=55.31  E-value=8.8  Score=24.73  Aligned_cols=23  Identities=30%  Similarity=0.492  Sum_probs=20.3

Q ss_pred             cccccccccccCCHHHHHHHHHh
Q psy2838         180 KFPCDQCGKSFTLKQNLTYHVNS  202 (458)
Q Consensus       180 ~~~C~~C~~~f~~~~~L~~H~~~  202 (458)
                      .|+|-+|..+..-++.|-.||+.
T Consensus        20 ~ykcfqcpftc~~kshl~nhmky   42 (54)
T PF15269_consen   20 KYKCFQCPFTCNEKSHLFNHMKY   42 (54)
T ss_pred             cceeecCCcccchHHHHHHHHHH
Confidence            48899999999999999999973


No 122
>COG5151 SSL1 RNA polymerase II transcription initiation/nucleotide excision repair factor TFIIH, subunit SSL1 [Transcription / DNA replication, recombination, and repair]
Probab=55.10  E-value=3.8  Score=38.31  Aligned_cols=74  Identities=16%  Similarity=0.373  Sum_probs=41.5

Q ss_pred             cceecCcccccccCHHHHHHHHHHhhc-------C-------ceeecCccCcccCChhHHHHHHh-hhccCccccCcccc
Q psy2838         263 IKYKCEECTKTFAQKQNLLSHVQAIHM-------G-------IKYTCEVCSKSYNQVTNLLSHMT-TVHQAQKYKCADCG  327 (458)
Q Consensus       263 ~~~~C~~C~~~f~~~~~L~~H~~~~h~-------~-------~~~~C~~C~~~f~~~~~L~~H~~-~h~~~~~~~C~~C~  327 (458)
                      .|..|+.|.........|.+-.  ||.       +       +.-.|-.|.-.|.....   |-. .-+..-.|+|+.|.
T Consensus       321 LPi~CP~Csl~LilsthLarSy--hhL~PLk~f~E~p~~~~~ks~~Cf~CQ~~fp~~~~---~~~~~~~ss~rY~Ce~CK  395 (421)
T COG5151         321 LPISCPICSLQLILSTHLARSY--HHLYPLKPFVEKPEGTNPKSTHCFVCQGPFPKPPV---SPFDESTSSGRYQCELCK  395 (421)
T ss_pred             CCccCcchhHHHHHHHHHHHHH--HhhccCcccccccCCCCCCCccceeccCCCCCCCC---Ccccccccccceechhhh
Confidence            4677777776665555554332  121       0       11347777777764321   110 01123458888888


Q ss_pred             ccCCCHHHHHHHHh
Q psy2838         328 RLFAQEHTLRTHTL  341 (458)
Q Consensus       328 ~~f~~~~~L~~H~~  341 (458)
                      ..|-.--+...|..
T Consensus       396 ~~FC~dCdvfiHe~  409 (421)
T COG5151         396 STFCSDCDVFIHET  409 (421)
T ss_pred             hhhhhhhHHHHHHH
Confidence            88888888777744


No 123
>PRK04023 DNA polymerase II large subunit; Validated
Probab=54.89  E-value=10  Score=41.70  Aligned_cols=24  Identities=42%  Similarity=0.578  Sum_probs=13.0

Q ss_pred             CCCCCCCCCCCCcccccCCCCCCCccceeee
Q psy2838          67 GYGGFLPNTPPLGHHYLSPSGLPGKAEYLEI   97 (458)
Q Consensus        67 ~y~~~~~~~~~~~~~~~~~~~~p~~~~~~~~   97 (458)
                      .|+|||.+-.+|+     |.  ++.+.||.+
T Consensus       408 ~yGdFlENNhpL~-----Ps--~y~~EWW~q  431 (1121)
T PRK04023        408 NYGDFLENNHPLL-----PS--SYCEEWWIQ  431 (1121)
T ss_pred             ccchhhhcCCcCC-----Cc--cccHHHHHH
Confidence            4666665555554     21  255666655


No 124
>KOG1701|consensus
Probab=54.27  E-value=3.1  Score=40.81  Aligned_cols=41  Identities=22%  Similarity=0.522  Sum_probs=19.2

Q ss_pred             ccccCccccCChHHHHHHHHHhcCCceeecCccccccCCch
Q psy2838         210 KCDFCDKTFAHKPNLIAHMNSAHNGVRFKCDHCVKSFNQKP  250 (458)
Q Consensus       210 ~C~~C~~~f~~~~~L~~H~~~h~~~~~~~C~~C~~~f~~~~  250 (458)
                      .|..|+|...-...-.+=|..--...-|+|..|.+...-.+
T Consensus       276 iC~~C~K~V~g~~~ac~Am~~~fHv~CFtC~~C~r~L~Gq~  316 (468)
T KOG1701|consen  276 ICAFCHKTVSGQGLAVEAMDQLFHVQCFTCRTCRRQLAGQS  316 (468)
T ss_pred             hhhhcCCcccCcchHHHHhhhhhcccceehHhhhhhhcccc
Confidence            56666665433332222232222223567777766554433


No 125
>smart00734 ZnF_Rad18 Rad18-like CCHC zinc finger. Yeast Rad18p functions with Rad5p in error-free post-replicative DNA repair. This zinc finger is likely to bind nucleic-acids.
Probab=54.16  E-value=12  Score=21.30  Aligned_cols=19  Identities=16%  Similarity=0.412  Sum_probs=13.5

Q ss_pred             ccCcCcccCCCHHHHHHHHH
Q psy2838         350 TCRTCGLFCDSKRALTAHVR  369 (458)
Q Consensus       350 ~C~~C~~~f~~~~~L~~H~~  369 (458)
                      .|++|++.+ ....+..|+.
T Consensus         3 ~CPiC~~~v-~~~~in~HLD   21 (26)
T smart00734        3 QCPVCFREV-PENLINSHLD   21 (26)
T ss_pred             cCCCCcCcc-cHHHHHHHHH
Confidence            478888777 5567777765


No 126
>COG1996 RPC10 DNA-directed RNA polymerase, subunit RPC10 (contains C4-type Zn-finger) [Transcription]
Probab=54.14  E-value=6.8  Score=26.11  Aligned_cols=10  Identities=30%  Similarity=1.036  Sum_probs=4.1

Q ss_pred             eecCccCccc
Q psy2838         293 YTCEVCSKSY  302 (458)
Q Consensus       293 ~~C~~C~~~f  302 (458)
                      |.|..|++.|
T Consensus         7 Y~C~~Cg~~~   16 (49)
T COG1996           7 YKCARCGREV   16 (49)
T ss_pred             EEhhhcCCee
Confidence            3444444433


No 127
>PRK04023 DNA polymerase II large subunit; Validated
Probab=54.09  E-value=11  Score=41.40  Aligned_cols=11  Identities=36%  Similarity=0.918  Sum_probs=6.8

Q ss_pred             Ccccccccccc
Q psy2838         179 KKFPCDQCGKS  189 (458)
Q Consensus       179 k~~~C~~C~~~  189 (458)
                      ....|+.||..
T Consensus       625 g~RfCpsCG~~  635 (1121)
T PRK04023        625 GRRKCPSCGKE  635 (1121)
T ss_pred             cCccCCCCCCc
Confidence            34567777765


No 128
>smart00531 TFIIE Transcription initiation factor IIE.
Probab=53.83  E-value=16  Score=30.94  Aligned_cols=12  Identities=25%  Similarity=0.789  Sum_probs=6.6

Q ss_pred             cccccCccccCC
Q psy2838         209 IKCDFCDKTFAH  220 (458)
Q Consensus       209 ~~C~~C~~~f~~  220 (458)
                      |.|+.|+..|..
T Consensus       100 Y~Cp~C~~~y~~  111 (147)
T smart00531      100 YKCPNCQSKYTF  111 (147)
T ss_pred             EECcCCCCEeeH
Confidence            555555555543


No 129
>KOG1280|consensus
Probab=52.90  E-value=11  Score=36.12  Aligned_cols=26  Identities=19%  Similarity=0.368  Sum_probs=12.8

Q ss_pred             eecCccCcccCChhHHHHHHhhhccC
Q psy2838         293 YTCEVCSKSYNQVTNLLSHMTTVHQA  318 (458)
Q Consensus       293 ~~C~~C~~~f~~~~~L~~H~~~h~~~  318 (458)
                      |.|++|++.=.+...|..|+...|.+
T Consensus        80 ftCPyC~~~Gfte~~f~~Hv~s~Hpd  105 (381)
T KOG1280|consen   80 FTCPYCGIMGFTERQFGTHVLSQHPE  105 (381)
T ss_pred             ccCCcccccccchhHHHHHhhhcCcc
Confidence            55555555544445555555444433


No 130
>PRK06266 transcription initiation factor E subunit alpha; Validated
Probab=52.12  E-value=12  Score=32.78  Aligned_cols=13  Identities=23%  Similarity=0.590  Sum_probs=5.7

Q ss_pred             cccccCccccCCh
Q psy2838         209 IKCDFCDKTFAHK  221 (458)
Q Consensus       209 ~~C~~C~~~f~~~  221 (458)
                      |.|+.|+..|...
T Consensus       118 Y~Cp~C~~rytf~  130 (178)
T PRK06266        118 FFCPNCHIRFTFD  130 (178)
T ss_pred             EECCCCCcEEeHH
Confidence            4444444444333


No 131
>COG1997 RPL43A Ribosomal protein L37AE/L43A [Translation, ribosomal structure and biogenesis]
Probab=51.67  E-value=4.8  Score=30.23  Aligned_cols=10  Identities=50%  Similarity=1.368  Sum_probs=4.8

Q ss_pred             ccCccccccC
Q psy2838         321 YKCADCGRLF  330 (458)
Q Consensus       321 ~~C~~C~~~f  330 (458)
                      ++|..|+..|
T Consensus        54 W~C~kCg~~f   63 (89)
T COG1997          54 WKCRKCGAKF   63 (89)
T ss_pred             EEcCCCCCee
Confidence            4455554444


No 132
>KOG3408|consensus
Probab=51.34  E-value=8.5  Score=30.82  Aligned_cols=26  Identities=19%  Similarity=0.434  Sum_probs=21.8

Q ss_pred             CccccCcCcccCCCHHHHHHHHHHhC
Q psy2838         347 VRLTCRTCGLFCDSKRALTAHVREAH  372 (458)
Q Consensus       347 ~~~~C~~C~~~f~~~~~L~~H~~~~h  372 (458)
                      ..|+|-.|.+-|.+...|..|.+..-
T Consensus        56 GqfyCi~CaRyFi~~~~l~~H~ktK~   81 (129)
T KOG3408|consen   56 GQFYCIECARYFIDAKALKTHFKTKV   81 (129)
T ss_pred             ceeehhhhhhhhcchHHHHHHHhccH
Confidence            35889999999999999999988653


No 133
>PRK09678 DNA-binding transcriptional regulator; Provisional
Probab=51.21  E-value=2.9  Score=30.46  Aligned_cols=42  Identities=12%  Similarity=0.383  Sum_probs=20.2

Q ss_pred             cccccCccccCChHHHHHHHHHhcCCceeecC--ccccccCCchhh
Q psy2838         209 IKCDFCDKTFAHKPNLIAHMNSAHNGVRFKCD--HCVKSFNQKPNL  252 (458)
Q Consensus       209 ~~C~~C~~~f~~~~~L~~H~~~h~~~~~~~C~--~C~~~f~~~~~L  252 (458)
                      +.|+.|+.......+-..+..  ..+..+.|.  .|+.+|.....+
T Consensus         2 m~CP~Cg~~a~irtSr~~s~~--~~~~Y~qC~N~eCg~tF~t~es~   45 (72)
T PRK09678          2 FHCPLCQHAAHARTSRYITDT--TKERYHQCQNVNCSATFITYESV   45 (72)
T ss_pred             ccCCCCCCccEEEEChhcChh--hheeeeecCCCCCCCEEEEEEEE
Confidence            356666655432222222211  334556676  677777654443


No 134
>PF03604 DNA_RNApol_7kD:  DNA directed RNA polymerase, 7 kDa subunit;  InterPro: IPR006591 DNA-dependent RNA polymerase catalyzes the transcription of DNA into RNA using the four ribonucleoside triphosphates as substrates. Each class of RNA polymerase is assembled from 9 to 15 different polypeptides. Rbp10 (RNA polymerase CX) is a domain found in RNA polymerase subunit 10; present in RNA polymerase I, II and III.; GO: 0003677 DNA binding, 0003899 DNA-directed RNA polymerase activity, 0006351 transcription, DNA-dependent; PDB: 2PMZ_Z 3HKZ_X 2NVX_L 3S1Q_L 2JA6_L 3S17_L 3HOW_L 3HOV_L 3PO2_L 3HOZ_L ....
Probab=51.06  E-value=11  Score=22.70  Aligned_cols=7  Identities=43%  Similarity=1.198  Sum_probs=2.9

Q ss_pred             cccccCc
Q psy2838         209 IKCDFCD  215 (458)
Q Consensus       209 ~~C~~C~  215 (458)
                      .+|+.||
T Consensus        18 irC~~CG   24 (32)
T PF03604_consen   18 IRCPECG   24 (32)
T ss_dssp             SSBSSSS
T ss_pred             EECCcCC
Confidence            3444444


No 135
>KOG1701|consensus
Probab=50.72  E-value=1.8  Score=42.42  Aligned_cols=12  Identities=25%  Similarity=0.874  Sum_probs=6.3

Q ss_pred             cccccccccccC
Q psy2838         180 KFPCDQCGKSFT  191 (458)
Q Consensus       180 ~~~C~~C~~~f~  191 (458)
                      -|+|..|++...
T Consensus       302 CFtC~~C~r~L~  313 (468)
T KOG1701|consen  302 CFTCRTCRRQLA  313 (468)
T ss_pred             ceehHhhhhhhc
Confidence            455655655443


No 136
>PF05443 ROS_MUCR:  ROS/MUCR transcriptional regulator protein;  InterPro: IPR008807 This family consists of several ROS/MUCR transcriptional regulator proteins. The ros chromosomal gene is present in octopine and nopaline strains of Agrobacterium tumefaciens as well as in Rhizobium meliloti (Sinorhizobium meliloti). This gene encodes a 15.5 kDa protein that specifically represses the virC and virD operons in the virulence region of the Ti plasmid [] and is necessary for succinoglycan production []. S. meliloti can produce two types of acidic exopolysaccharides, succinoglycan and galactoglucan, that are interchangeable for infection of Medicago sativa (Alfalfa) nodules. MucR from S. meliloti acts as a transcriptional repressor that blocks the expression of the exp genes responsible for galactoglucan production therefore allowing the exclusive production of succinoglycan [].; GO: 0003677 DNA binding, 0008270 zinc ion binding, 0006355 regulation of transcription, DNA-dependent; PDB: 2JSP_A.
Probab=47.67  E-value=10  Score=31.29  Aligned_cols=25  Identities=28%  Similarity=0.509  Sum_probs=15.9

Q ss_pred             ccccCcCcccCCCHHHHHHHHHHhCCCC
Q psy2838         348 RLTCRTCGLFCDSKRALTAHVREAHPGC  375 (458)
Q Consensus       348 ~~~C~~C~~~f~~~~~L~~H~~~~h~~~  375 (458)
                      ...|-.||+.|..   |.+|++.||+-.
T Consensus        72 ~i~clecGk~~k~---LkrHL~~~~glt   96 (132)
T PF05443_consen   72 YIICLECGKKFKT---LKRHLRTHHGLT   96 (132)
T ss_dssp             -EE-TBT--EESB---HHHHHHHTT-S-
T ss_pred             eeEEccCCcccch---HHHHHHHccCCC
Confidence            3569999999976   589999998753


No 137
>KOG2593|consensus
Probab=47.56  E-value=16  Score=36.09  Aligned_cols=33  Identities=27%  Similarity=0.786  Sum_probs=17.3

Q ss_pred             eecCccCcccCChhHHHHHHhhhccCccccCccccc
Q psy2838         293 YTCEVCSKSYNQVTNLLSHMTTVHQAQKYKCADCGR  328 (458)
Q Consensus       293 ~~C~~C~~~f~~~~~L~~H~~~h~~~~~~~C~~C~~  328 (458)
                      |.|+.|.+.|+....++.   .-...-.|.|..|+-
T Consensus       129 Y~Cp~C~kkyt~Lea~~L---~~~~~~~F~C~~C~g  161 (436)
T KOG2593|consen  129 YVCPNCQKKYTSLEALQL---LDNETGEFHCENCGG  161 (436)
T ss_pred             ccCCccccchhhhHHHHh---hcccCceEEEecCCC
Confidence            667777666665544332   111223466666653


No 138
>PRK09678 DNA-binding transcriptional regulator; Provisional
Probab=47.43  E-value=7.6  Score=28.33  Aligned_cols=39  Identities=18%  Similarity=0.429  Sum_probs=18.3

Q ss_pred             eecCccccccCCchhhhhccchhccCcceecC--cccccccCH
Q psy2838         237 FKCDHCVKSFNQKPNLISHIKSVHEGIKYKCE--ECTKTFAQK  277 (458)
Q Consensus       237 ~~C~~C~~~f~~~~~L~~H~~~~~~~~~~~C~--~C~~~f~~~  277 (458)
                      +.|+.|+.......+-..+..  ..+..+.|.  .||.+|...
T Consensus         2 m~CP~Cg~~a~irtSr~~s~~--~~~~Y~qC~N~eCg~tF~t~   42 (72)
T PRK09678          2 FHCPLCQHAAHARTSRYITDT--TKERYHQCQNVNCSATFITY   42 (72)
T ss_pred             ccCCCCCCccEEEEChhcChh--hheeeeecCCCCCCCEEEEE
Confidence            456677655433222222211  233455665  566666543


No 139
>PF09845 DUF2072:  Zn-ribbon containing protein (DUF2072);  InterPro: IPR018645  This archaeal Zinc-ribbon containing proteins have no known function. 
Probab=46.89  E-value=31  Score=28.33  Aligned_cols=12  Identities=25%  Similarity=0.797  Sum_probs=6.3

Q ss_pred             ccCcCcccCCCH
Q psy2838         350 TCRTCGLFCDSK  361 (458)
Q Consensus       350 ~C~~C~~~f~~~  361 (458)
                      +|..||+.|...
T Consensus         3 ~Ct~Cg~~f~dg   14 (131)
T PF09845_consen    3 QCTKCGRVFEDG   14 (131)
T ss_pred             ccCcCCCCcCCC
Confidence            455555555543


No 140
>PF04959 ARS2:  Arsenite-resistance protein 2;  InterPro: IPR007042 This entry represents Arsenite-resistance protein 2 (also known as Serrate RNA effector molecule homolog) which is thought to play a role in arsenite resistance [], although does not directly confer arsenite resistance but rather modulates arsenic sensitivity []. Arsenite is a carcinogenic compound which can act as a comutagen by inhibiting DNA repair. It is also involved in cell cycle progression at S phase. ; PDB: 3AX1_A.
Probab=46.86  E-value=10  Score=34.13  Aligned_cols=28  Identities=21%  Similarity=0.535  Sum_probs=16.0

Q ss_pred             eeecCccCcccCChhHHHHHHhhhccCc
Q psy2838         292 KYTCEVCSKSYNQVTNLLSHMTTVHQAQ  319 (458)
Q Consensus       292 ~~~C~~C~~~f~~~~~L~~H~~~h~~~~  319 (458)
                      +|.|..|+|.|.-..-.++|+...|.++
T Consensus        77 K~~C~lc~KlFkg~eFV~KHI~nKH~e~  104 (214)
T PF04959_consen   77 KWRCPLCGKLFKGPEFVRKHIFNKHPEK  104 (214)
T ss_dssp             EEEE-SSS-EESSHHHHHHHHHHH-HHH
T ss_pred             EECCCCCCcccCChHHHHHHHhhcCHHH
Confidence            4666666666666666666666666553


No 141
>KOG2807|consensus
Probab=46.10  E-value=25  Score=33.48  Aligned_cols=32  Identities=16%  Similarity=0.150  Sum_probs=25.0

Q ss_pred             CCccccCcCcccCCCHHHHHHHHHHhCCCCCcccCCCC
Q psy2838         346 GVRLTCRTCGLFCDSKRALTAHVREAHPGCEYKKRGRG  383 (458)
Q Consensus       346 ~~~~~C~~C~~~f~~~~~L~~H~~~~h~~~~~~~~~~~  383 (458)
                      ...|.|..|...|----+...|-..|      .|.+|.
T Consensus       343 ~~~y~C~~Ck~~FCldCDv~iHesLh------~CpgCe  374 (378)
T KOG2807|consen  343 SGRYRCESCKNVFCLDCDVFIHESLH------NCPGCE  374 (378)
T ss_pred             CCcEEchhccceeeccchHHHHhhhh------cCCCcC
Confidence            34688999999999888888887776      355554


No 142
>KOG1044|consensus
Probab=45.67  E-value=3.9  Score=41.66  Aligned_cols=92  Identities=18%  Similarity=0.444  Sum_probs=45.6

Q ss_pred             ceecCcccccccCHHHHHHHHHHhhcCceeecCccCcccCChhHHHHH-----HhhhccCccccCccccccCCCHHHHHH
Q psy2838         264 KYKCEECTKTFAQKQNLLSHVQAIHMGIKYTCEVCSKSYNQVTNLLSH-----MTTVHQAQKYKCADCGRLFAQEHTLRT  338 (458)
Q Consensus       264 ~~~C~~C~~~f~~~~~L~~H~~~~h~~~~~~C~~C~~~f~~~~~L~~H-----~~~h~~~~~~~C~~C~~~f~~~~~L~~  338 (458)
                      +-.|.-|+.....-..|..--+.-|. .-|+|..|+....... +..|     .+--+..--.+|..|.+.+.-+. |. 
T Consensus       133 ps~cagc~~~lk~gq~llald~qwhv-~cfkc~~c~~vL~gey-~skdg~pyce~dy~~~fgvkc~~c~~fisgkv-Lq-  208 (670)
T KOG1044|consen  133 PSTCAGCGEELKNGQALLALDKQWHV-SCFKCKSCSAVLNGEY-MSKDGVPYCEKDYQAKFGVKCEECEKFISGKV-LQ-  208 (670)
T ss_pred             Cccccchhhhhhccceeeeeccceee-eeeehhhhccccccee-eccCCCcchhhhhhhhcCeehHHhhhhhhhhh-hh-
Confidence            45677777666555444322111111 1278888877654321 1100     00011112246888876555442 22 


Q ss_pred             HHhhccCCCcc-----ccCcCcccCCCHHHH
Q psy2838         339 HTLSVHSGVRL-----TCRTCGLFCDSKRAL  364 (458)
Q Consensus       339 H~~~~h~~~~~-----~C~~C~~~f~~~~~L  364 (458)
                           -+++.|     .|..|+..|..-..|
T Consensus       209 -----ag~kh~HPtCARCsRCgqmF~eGEEM  234 (670)
T KOG1044|consen  209 -----AGDKHFHPTCARCSRCGQMFGEGEEM  234 (670)
T ss_pred             -----ccCcccCcchhhhhhhccccccchhe
Confidence                 222223     488899999876554


No 143
>PF14353 CpXC:  CpXC protein
Probab=45.59  E-value=5.7  Score=32.65  Aligned_cols=21  Identities=29%  Similarity=0.572  Sum_probs=10.9

Q ss_pred             eecCccccccCCchhhhhccc
Q psy2838         237 FKCDHCVKSFNQKPNLISHIK  257 (458)
Q Consensus       237 ~~C~~C~~~f~~~~~L~~H~~  257 (458)
                      |.|+.||..|.-...+.-|-.
T Consensus        39 ~~CP~Cg~~~~~~~p~lY~D~   59 (128)
T PF14353_consen   39 FTCPSCGHKFRLEYPLLYHDP   59 (128)
T ss_pred             EECCCCCCceecCCCEEEEcC
Confidence            555555555554444444433


No 144
>TIGR01384 TFS_arch transcription factor S, archaeal. There has been an apparent duplication event in the Halobacteriaceae lineage (Haloarcula, Haloferax, Haloquadratum, Halobacterium and Natromonas). There appears to be a separate duplication in Methanosphaera stadtmanae.
Probab=44.63  E-value=4.4  Score=31.96  Aligned_cols=12  Identities=17%  Similarity=0.443  Sum_probs=6.2

Q ss_pred             eeecCccccccC
Q psy2838         236 RFKCDHCVKSFN  247 (458)
Q Consensus       236 ~~~C~~C~~~f~  247 (458)
                      .+.|+.|++.+.
T Consensus        16 ~~~C~~C~~~~~   27 (104)
T TIGR01384        16 VYVCPSCGYEKE   27 (104)
T ss_pred             eEECcCCCCccc
Confidence            355555555443


No 145
>KOG0943|consensus
Probab=44.03  E-value=17  Score=40.67  Aligned_cols=9  Identities=11%  Similarity=-0.090  Sum_probs=3.7

Q ss_pred             cccchhhhh
Q psy2838         149 LEFNRSQFE  157 (458)
Q Consensus       149 ~~~~c~~c~  157 (458)
                      ..|.|..|+
T Consensus      1256 DIfECkTCG 1264 (3015)
T KOG0943|consen 1256 DIFECKTCG 1264 (3015)
T ss_pred             hhhhhcccc
Confidence            334444433


No 146
>PF05443 ROS_MUCR:  ROS/MUCR transcriptional regulator protein;  InterPro: IPR008807 This family consists of several ROS/MUCR transcriptional regulator proteins. The ros chromosomal gene is present in octopine and nopaline strains of Agrobacterium tumefaciens as well as in Rhizobium meliloti (Sinorhizobium meliloti). This gene encodes a 15.5 kDa protein that specifically represses the virC and virD operons in the virulence region of the Ti plasmid [] and is necessary for succinoglycan production []. S. meliloti can produce two types of acidic exopolysaccharides, succinoglycan and galactoglucan, that are interchangeable for infection of Medicago sativa (Alfalfa) nodules. MucR from S. meliloti acts as a transcriptional repressor that blocks the expression of the exp genes responsible for galactoglucan production therefore allowing the exclusive production of succinoglycan [].; GO: 0003677 DNA binding, 0008270 zinc ion binding, 0006355 regulation of transcription, DNA-dependent; PDB: 2JSP_A.
Probab=43.96  E-value=14  Score=30.47  Aligned_cols=23  Identities=30%  Similarity=0.598  Sum_probs=10.2

Q ss_pred             eecCccCcccCChhHHHHHHhhhccC
Q psy2838         293 YTCEVCSKSYNQVTNLLSHMTTVHQA  318 (458)
Q Consensus       293 ~~C~~C~~~f~~~~~L~~H~~~h~~~  318 (458)
                      ..|-.||+.|..   |++|++.|||-
T Consensus        73 i~clecGk~~k~---LkrHL~~~~gl   95 (132)
T PF05443_consen   73 IICLECGKKFKT---LKRHLRTHHGL   95 (132)
T ss_dssp             EE-TBT--EESB---HHHHHHHTT-S
T ss_pred             eEEccCCcccch---HHHHHHHccCC
Confidence            555556655554   35555555543


No 147
>KOG1280|consensus
Probab=43.58  E-value=18  Score=34.68  Aligned_cols=36  Identities=28%  Similarity=0.548  Sum_probs=23.0

Q ss_pred             cccccccccccCCHHHHHHHHHhhcCCCc--cccccCc
Q psy2838         180 KFPCDQCGKSFTLKQNLTYHVNSIHLGVR--IKCDFCD  215 (458)
Q Consensus       180 ~~~C~~C~~~f~~~~~L~~H~~~~h~~~~--~~C~~C~  215 (458)
                      -|.|++|++.=.+...|..|+...|.+..  ..|++|.
T Consensus        79 SftCPyC~~~Gfte~~f~~Hv~s~Hpda~~~~icp~c~  116 (381)
T KOG1280|consen   79 SFTCPYCGIMGFTERQFGTHVLSQHPEASTSVICPLCA  116 (381)
T ss_pred             cccCCcccccccchhHHHHHhhhcCcccCcceeeeccc
Confidence            56777777666666777777766665544  4555554


No 148
>KOG1834|consensus
Probab=43.13  E-value=15  Score=38.20  Aligned_cols=8  Identities=38%  Similarity=0.891  Sum_probs=3.9

Q ss_pred             EeeeeeCC
Q psy2838         103 YTWVKTEP  110 (458)
Q Consensus       103 ~~~~~~e~  110 (458)
                      ..|||+.|
T Consensus       371 SfwMkHg~  378 (952)
T KOG1834|consen  371 SFWMKHGP  378 (952)
T ss_pred             EEeeecCC
Confidence            34555554


No 149
>COG1592 Rubrerythrin [Energy production and conversion]
Probab=42.73  E-value=15  Score=31.57  Aligned_cols=23  Identities=17%  Similarity=0.405  Sum_probs=14.1

Q ss_pred             eeecCccccccCCchhhhhccchhccCcceecCccc
Q psy2838         236 RFKCDHCVKSFNQKPNLISHIKSVHEGIKYKCEECT  271 (458)
Q Consensus       236 ~~~C~~C~~~f~~~~~L~~H~~~~~~~~~~~C~~C~  271 (458)
                      .|.|++||..+             -++-|-+|++|+
T Consensus       134 ~~vC~vCGy~~-------------~ge~P~~CPiCg  156 (166)
T COG1592         134 VWVCPVCGYTH-------------EGEAPEVCPICG  156 (166)
T ss_pred             EEEcCCCCCcc-------------cCCCCCcCCCCC
Confidence            57777776543             234566777776


No 150
>KOG4377|consensus
Probab=42.71  E-value=14  Score=36.08  Aligned_cols=21  Identities=19%  Similarity=0.246  Sum_probs=18.1

Q ss_pred             cCcccCCCHHHHHHHHHHhCC
Q psy2838         353 TCGLFCDSKRALTAHVREAHP  373 (458)
Q Consensus       353 ~C~~~f~~~~~L~~H~~~~h~  373 (458)
                      -|+..+.+.+.+..|.|+|-.
T Consensus       408 Gc~~tl~s~sqm~shkrkheR  428 (480)
T KOG4377|consen  408 GCEATLYSVSQMASHKRKHER  428 (480)
T ss_pred             CCceEEEehhhhhhhhhhhhh
Confidence            489999999999999988853


No 151
>KOG1189|consensus
Probab=42.40  E-value=18  Score=38.33  Aligned_cols=10  Identities=20%  Similarity=-0.187  Sum_probs=5.0

Q ss_pred             eeeecchhHH
Q psy2838          44 NYYQLSSDEK   53 (458)
Q Consensus        44 ~~~~~~~~~~   53 (458)
                      .+.||++.|-
T Consensus       381 ~~~dv~~~f~  390 (960)
T KOG1189|consen  381 AVKDVSYFFK  390 (960)
T ss_pred             hhcccceeec
Confidence            4455555554


No 152
>PF03344 Daxx:  Daxx Family;  InterPro: IPR005012  Daxx is a ubiquitously expressed protein that functions, in part, as a transcriptional co-repressor through its interaction with a growing number of nuclear, DNA-associated proteins. Human Daxx contains four structural domains commonly found in transcriptional regulatory proteins: two predicted paired amphipathic helices, an acid-rich domain and a Ser/Pro/Thr (SPT)-rich domain. The post-translational modification status of the SPT-domain of hDaxx regulates its association with transcription factors such as Pax3 and ETS-1, effectively bringing hDaxx to sites of active transcription. Through its presence at the site of active transcription, hDaxx could then be able to associate with acetylated histones present in the nucleosomes and Dek that is associated with chromatin. Through its association with the SPT-domain of hDaxx, histone deacetylases may also be brought to the site of active transcription. As a consequence, nucleosomes in the vicinity of the site of active transcription will have the histone tails deacetylated, allowing the deactylated tail to bind to DNA, thereby leading to an inactive chromatin structure and transcriptional repression [].  The Daxx protein (also known as the Fas-binding protein) is thought to play a role in apoptosis as a component of nuclear promyelocytic leukemia protein (PML) oncogenic domains (PODS). Daxx associates with PODs through a direct interaction with PML, a critical component of PODs. The interaction is a dynamic, cell cycle regulated event and is dependent on the post-translational modification of PML by the small ubiquitin-related modifier SUMO-1. ; PDB: 2KZS_A 2KZU_A.
Probab=41.91  E-value=8.5  Score=41.34  Aligned_cols=9  Identities=11%  Similarity=-0.034  Sum_probs=3.6

Q ss_pred             CcCCCCCCC
Q psy2838          63 SFTPGYGGF   71 (458)
Q Consensus        63 ~~~~~y~~~   71 (458)
                      .+.....++
T Consensus        81 ~~~~d~~ev   89 (713)
T PF03344_consen   81 AQTSDMPEV   89 (713)
T ss_dssp             HHTTT-TTH
T ss_pred             ccccCCHHH
Confidence            344444444


No 153
>COG5188 PRP9 Splicing factor 3a, subunit 3 [RNA processing and modification]
Probab=41.54  E-value=16  Score=35.04  Aligned_cols=47  Identities=15%  Similarity=0.528  Sum_probs=0.0

Q ss_pred             eecCccCcccCChhHHHHHHhhhccCcc----------------------------------------------------
Q psy2838         293 YTCEVCSKSYNQVTNLLSHMTTVHQAQK----------------------------------------------------  320 (458)
Q Consensus       293 ~~C~~C~~~f~~~~~L~~H~~~h~~~~~----------------------------------------------------  320 (458)
                      +-|..|++.|....-+..|.......+.                                                    
T Consensus       239 ~YC~~C~r~f~~~~VFe~Hl~gK~H~k~~~~~~~~v~~Ey~l~r~~kyl~d~~s~trs~V~r~la~ta~ER~aei~~l~r  318 (470)
T COG5188         239 VYCVKCGREFSRSKVFEYHLEGKRHCKEGQGKEEFVYSEYVLHRYLKYLGDPVSETRSLVLRSLAITAKERKAEISLLSR  318 (470)
T ss_pred             eeeHhhhhHhhhhHHHHHHHhhhhhhhhhhhhhHHHHHHHHHHHHHHHhCChhHHHHHHHHHHHHHHHHHHHHHhHHHHH


Q ss_pred             --------------------------------------------------------ccCcccc-ccCCCHHHHHHH
Q psy2838         321 --------------------------------------------------------YKCADCG-RLFAQEHTLRTH  339 (458)
Q Consensus       321 --------------------------------------------------------~~C~~C~-~~f~~~~~L~~H  339 (458)
                                                                              |.|.+|| +++.-+..+.+|
T Consensus       319 ~~~~~at~S~e~EGaeq~d~eQ~DE~~~~k~fdmPLG~DG~PmP~WL~klhgLd~ef~CEICgNyvy~GR~~FdrH  394 (470)
T COG5188         319 RKKQPATKSSEKEGAEQVDGEQRDEHVSGKSFDMPLGPDGLPMPRWLCKLHGLDIEFECEICGNYVYYGRDRFDRH  394 (470)
T ss_pred             HhhccCCCchhhcccccccccccchhhccCcccCCCCCCCCCCchHHHHhcCCCcceeeeecccccccchHHHHhh


No 154
>COG4957 Predicted transcriptional regulator [Transcription]
Probab=41.10  E-value=15  Score=30.12  Aligned_cols=23  Identities=26%  Similarity=0.267  Sum_probs=14.5

Q ss_pred             cccccCccccCChHHHHHHHHHhcCC
Q psy2838         209 IKCDFCDKTFAHKPNLIAHMNSAHNG  234 (458)
Q Consensus       209 ~~C~~C~~~f~~~~~L~~H~~~h~~~  234 (458)
                      ..|-++|+.|.   +|++|+.+|++-
T Consensus        77 IicLEDGkkfK---SLKRHL~t~~gm   99 (148)
T COG4957          77 IICLEDGKKFK---SLKRHLTTHYGL   99 (148)
T ss_pred             EEEeccCcchH---HHHHHHhcccCC
Confidence            35666666664   567777776653


No 155
>KOG4167|consensus
Probab=40.07  E-value=7.1  Score=41.01  Aligned_cols=25  Identities=28%  Similarity=0.487  Sum_probs=21.4

Q ss_pred             cccccCccccCChHHHHHHHHHhcC
Q psy2838         209 IKCDFCDKTFAHKPNLIAHMNSAHN  233 (458)
Q Consensus       209 ~~C~~C~~~f~~~~~L~~H~~~h~~  233 (458)
                      |.|.+|++.|....++..||++|.-
T Consensus       793 FpCreC~kvF~KiKSrNAHMK~Hr~  817 (907)
T KOG4167|consen  793 FPCRECGKVFFKIKSRNAHMKTHRQ  817 (907)
T ss_pred             eehHHHHHHHHHHhhhhHHHHHHHH
Confidence            8899999999888888899888863


No 156
>PF12760 Zn_Tnp_IS1595:  Transposase zinc-ribbon domain;  InterPro: IPR024442 This zinc binding domain is found in a range of transposase proteins such as ISSPO8, ISSOD11, ISRSSP2 etc. It may be a zinc-binding beta ribbon domain that could bind DNA.
Probab=39.88  E-value=47  Score=21.65  Aligned_cols=9  Identities=33%  Similarity=1.209  Sum_probs=4.3

Q ss_pred             cceecCccc
Q psy2838         263 IKYKCEECT  271 (458)
Q Consensus       263 ~~~~C~~C~  271 (458)
                      ..|+|..|+
T Consensus        36 ~~~~C~~C~   44 (46)
T PF12760_consen   36 GRYRCKACR   44 (46)
T ss_pred             CeEECCCCC
Confidence            345555544


No 157
>KOG3214|consensus
Probab=39.64  E-value=11  Score=28.97  Aligned_cols=40  Identities=20%  Similarity=0.344  Sum_probs=21.1

Q ss_pred             ccccCccccccCCCHHHHHHHHhhccCCCccccCcCcccCCCHH
Q psy2838         319 QKYKCADCGRLFAQEHTLRTHTLSVHSGVRLTCRTCGLFCDSKR  362 (458)
Q Consensus       319 ~~~~C~~C~~~f~~~~~L~~H~~~~h~~~~~~C~~C~~~f~~~~  362 (458)
                      ..|.|+.|+---.-...|    .+.|......|.+|+..|.+.-
T Consensus        22 t~FnClfcnHek~v~~~~----Dk~~~iG~~sC~iC~esFqt~i   61 (109)
T KOG3214|consen   22 TQFNCLFCNHEKSVSCTL----DKKHNIGKASCRICEESFQTTI   61 (109)
T ss_pred             eeeccCccccccceeeee----hhhcCcceeeeeehhhhhccch
Confidence            347777776322111111    1223333456999999887653


No 158
>PRK14714 DNA polymerase II large subunit; Provisional
Probab=38.75  E-value=28  Score=39.55  Aligned_cols=25  Identities=32%  Similarity=0.505  Sum_probs=13.8

Q ss_pred             CCCCCCCCCCCCcccccCCCCCCCccceeeee
Q psy2838          67 GYGGFLPNTPPLGHHYLSPSGLPGKAEYLEIH   98 (458)
Q Consensus        67 ~y~~~~~~~~~~~~~~~~~~~~p~~~~~~~~~   98 (458)
                      .|++|+.+-.+|+     |.  .+.+.||.+.
T Consensus       428 ~yGdFlENNh~L~-----P~--~y~~EWw~~e  452 (1337)
T PRK14714        428 NYGEFVENNHPLL-----PA--SYCEEWWIQE  452 (1337)
T ss_pred             ccchhhhcCCcCC-----Cc--cchHHHHHHH
Confidence            4666666555554     22  1566777663


No 159
>KOG4377|consensus
Probab=37.53  E-value=19  Score=35.23  Aligned_cols=24  Identities=13%  Similarity=0.402  Sum_probs=17.7

Q ss_pred             ccC--ccccccCCCHHHHHHHHhhcc
Q psy2838         321 YKC--ADCGRLFAQEHTLRTHTLSVH  344 (458)
Q Consensus       321 ~~C--~~C~~~f~~~~~L~~H~~~~h  344 (458)
                      |.|  .-|+..+.+.+.+..|.++|-
T Consensus       402 fhc~r~Gc~~tl~s~sqm~shkrkhe  427 (480)
T KOG4377|consen  402 FHCDRLGCEATLYSVSQMASHKRKHE  427 (480)
T ss_pred             eeecccCCceEEEehhhhhhhhhhhh
Confidence            556  358888888888888866554


No 160
>PF08274 PhnA_Zn_Ribbon:  PhnA Zinc-Ribbon ;  InterPro: IPR013987 The PhnA protein family includes the uncharacterised Escherichia coli protein PhnA and its homologues. The E. coli phnA gene is part of a large operon associated with alkylphosphonate uptake and carbon-phosphorus bond cleavage []. The protein is not related to the characterised phosphonoacetate hydrolase designated PhnA []. This entry represents the N-terminal domain of PhnA, which is predicted to form a zinc-ribbon.; PDB: 2AKL_A.
Probab=37.37  E-value=14  Score=21.92  Aligned_cols=8  Identities=50%  Similarity=1.265  Sum_probs=3.7

Q ss_pred             ccccCcCc
Q psy2838         348 RLTCRTCG  355 (458)
Q Consensus       348 ~~~C~~C~  355 (458)
                      .+.|+.|+
T Consensus        19 ~~vCp~C~   26 (30)
T PF08274_consen   19 LLVCPECG   26 (30)
T ss_dssp             SEEETTTT
T ss_pred             EEeCCccc
Confidence            34444444


No 161
>KOG4167|consensus
Probab=37.30  E-value=9.6  Score=40.10  Aligned_cols=27  Identities=30%  Similarity=0.623  Sum_probs=23.8

Q ss_pred             CcccccccccccCCHHHHHHHHHhhcC
Q psy2838         179 KKFPCDQCGKSFTLKQNLTYHVNSIHL  205 (458)
Q Consensus       179 k~~~C~~C~~~f~~~~~L~~H~~~~h~  205 (458)
                      .-|.|..|++.|.....+..||++|..
T Consensus       791 giFpCreC~kvF~KiKSrNAHMK~Hr~  817 (907)
T KOG4167|consen  791 GIFPCRECGKVFFKIKSRNAHMKTHRQ  817 (907)
T ss_pred             ceeehHHHHHHHHHHhhhhHHHHHHHH
Confidence            459999999999999999999997653


No 162
>KOG1924|consensus
Probab=36.80  E-value=33  Score=36.72  Aligned_cols=6  Identities=50%  Similarity=1.082  Sum_probs=2.9

Q ss_pred             eeeeeC
Q psy2838         104 TWVKTE  109 (458)
Q Consensus       104 ~~~~~e  109 (458)
                      =|++..
T Consensus       644 FWvkv~  649 (1102)
T KOG1924|consen  644 FWVKVN  649 (1102)
T ss_pred             eeeecc
Confidence            355544


No 163
>PF05285 SDA1:  SDA1;  InterPro: IPR007949 This domain consists of several SDA1 protein homologues. SDA1 is a Saccharomyces cerevisiae protein which is involved in the control of the actin cytoskeleton. The protein is essential for cell viability and is localised in the nucleus [].
Probab=36.68  E-value=49  Score=32.09  Aligned_cols=13  Identities=38%  Similarity=0.603  Sum_probs=7.3

Q ss_pred             HHHHHHHHHhCCC
Q psy2838         362 RALTAHVREAHPG  374 (458)
Q Consensus       362 ~~L~~H~~~~h~~  374 (458)
                      ..|..-.|...|.
T Consensus        25 rsli~l~Rev~P~   37 (324)
T PF05285_consen   25 RSLINLFREVNPE   37 (324)
T ss_pred             HHHHHHHHHHCHH
Confidence            3455556666554


No 164
>KOG0262|consensus
Probab=36.09  E-value=31  Score=38.88  Aligned_cols=22  Identities=23%  Similarity=0.167  Sum_probs=13.6

Q ss_pred             CCCCCCCCCCCCCCCCCCCCCC
Q psy2838          16 TSRRDSFPPPHPLGQNGFDPTG   37 (458)
Q Consensus        16 ~~~~~~~~~~~~~~~~~~~~~~   37 (458)
                      +.+|--.+||+-+.|.-.+-|-
T Consensus       676 ~~~rI~~lpPtIlkP~~LWTGK  697 (1640)
T KOG0262|consen  676 TFKRIKLLPPTILKPAPLWTGK  697 (1640)
T ss_pred             hccceeccCccccccccccccc
Confidence            5566677777777775444433


No 165
>COG4530 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=35.96  E-value=19  Score=28.27  Aligned_cols=11  Identities=9%  Similarity=-0.000  Sum_probs=6.4

Q ss_pred             ccccCcCcccC
Q psy2838         348 RLTCRTCGLFC  358 (458)
Q Consensus       348 ~~~C~~C~~~f  358 (458)
                      +..|++||++|
T Consensus        26 PiVsPytG~s~   36 (129)
T COG4530          26 PIVSPYTGKSY   36 (129)
T ss_pred             ccccCcccccc
Confidence            45566666665


No 166
>PF14283 DUF4366:  Domain of unknown function (DUF4366)
Probab=35.85  E-value=11  Score=34.19  Aligned_cols=6  Identities=33%  Similarity=0.512  Sum_probs=2.3

Q ss_pred             HHHHHH
Q psy2838         278 QNLLSH  283 (458)
Q Consensus       278 ~~L~~H  283 (458)
                      ..|..-
T Consensus       108 ~DLl~l  113 (218)
T PF14283_consen  108 ADLLAL  113 (218)
T ss_pred             HHHHHH
Confidence            344433


No 167
>COG1198 PriA Primosomal protein N' (replication factor Y) - superfamily II helicase [DNA replication, recombination, and repair]
Probab=35.80  E-value=22  Score=38.36  Aligned_cols=14  Identities=36%  Similarity=0.714  Sum_probs=8.9

Q ss_pred             ccCcceecCccccc
Q psy2838         260 HEGIKYKCEECTKT  273 (458)
Q Consensus       260 ~~~~~~~C~~C~~~  273 (458)
                      +...|..|+.||..
T Consensus       471 ~~~~p~~Cp~Cgs~  484 (730)
T COG1198         471 QEPIPQSCPECGSE  484 (730)
T ss_pred             CCCCCCCCCCCCCC
Confidence            34467777777743


No 168
>COG1198 PriA Primosomal protein N' (replication factor Y) - superfamily II helicase [DNA replication, recombination, and repair]
Probab=35.02  E-value=10  Score=40.82  Aligned_cols=46  Identities=22%  Similarity=0.574  Sum_probs=26.0

Q ss_pred             ccccccccccCCH---HHHHHHHHhhcCCCccccccCccccCChHHHHHHHHHhcCCceeecCccccc
Q psy2838         181 FPCDQCGKSFTLK---QNLTYHVNSIHLGVRIKCDFCDKTFAHKPNLIAHMNSAHNGVRFKCDHCVKS  245 (458)
Q Consensus       181 ~~C~~C~~~f~~~---~~L~~H~~~~h~~~~~~C~~C~~~f~~~~~L~~H~~~h~~~~~~~C~~C~~~  245 (458)
                      ..|..||..+...   ..|..|+.    .....|.+||..               ...|..|+.|+..
T Consensus       436 l~C~~Cg~v~~Cp~Cd~~lt~H~~----~~~L~CH~Cg~~---------------~~~p~~Cp~Cgs~  484 (730)
T COG1198         436 LLCRDCGYIAECPNCDSPLTLHKA----TGQLRCHYCGYQ---------------EPIPQSCPECGSE  484 (730)
T ss_pred             eecccCCCcccCCCCCcceEEecC----CCeeEeCCCCCC---------------CCCCCCCCCCCCC
Confidence            3466666555332   12333432    244667777653               3468899999865


No 169
>PF04889 Cwf_Cwc_15:  Cwf15/Cwc15 cell cycle control protein;  InterPro: IPR006973 This family represents Cwf15/Cwc15 (from Schizosaccharomyces pombe and Saccharomyces cerevisiae respectively) and their homologues. The function of these proteins is unknown, but they form part of the spliceosome and are thus thought to be involved in mRNA splicing [].; GO: 0000398 nuclear mRNA splicing, via spliceosome, 0005681 spliceosomal complex
Probab=34.99  E-value=28  Score=32.22  Aligned_cols=6  Identities=33%  Similarity=0.451  Sum_probs=2.3

Q ss_pred             HHHHHH
Q psy2838         363 ALTAHV  368 (458)
Q Consensus       363 ~L~~H~  368 (458)
                      .|+..+
T Consensus        58 dlr~eL   63 (244)
T PF04889_consen   58 DLRAEL   63 (244)
T ss_pred             HHHHHH
Confidence            343333


No 170
>PF07975 C1_4:  TFIIH C1-like domain;  InterPro: IPR004595 All proteins in this domain for which functions are known are components of the TFIIH complex which is involved in the initiation of transcription and nucleotide excision repair. It includes the yeast transcription factor Ssl1 (Suppressor of stem-loop protein 1) that is essential for translation initiation and affects UV resistance. The C-terminal region is essential for transcription activity. This regions binds three zinc atoms through two independent domain. The first contains a C4 zinc finger motif, whereas the second is characterised by a CX(2)CX(2-4)FCADCD motif. The solution structure of the second C-terminal domain revealed homology with the regulatory domain of protein kinase C [].; GO: 0006281 DNA repair, 0005634 nucleus; PDB: 1Z60_A.
Probab=34.75  E-value=16  Score=24.67  Aligned_cols=20  Identities=20%  Similarity=0.589  Sum_probs=6.6

Q ss_pred             eecCccCcccCChhHHHHHH
Q psy2838         293 YTCEVCSKSYNQVTNLLSHM  312 (458)
Q Consensus       293 ~~C~~C~~~f~~~~~L~~H~  312 (458)
                      |.|+.|...|--.-.+-.|.
T Consensus        22 y~C~~C~~~FC~dCD~fiHE   41 (51)
T PF07975_consen   22 YRCPKCKNHFCIDCDVFIHE   41 (51)
T ss_dssp             E--TTTT--B-HHHHHTTTT
T ss_pred             EECCCCCCccccCcChhhhc
Confidence            55555555444444433333


No 171
>PTZ00303 phosphatidylinositol kinase; Provisional
Probab=32.70  E-value=15  Score=38.96  Aligned_cols=11  Identities=36%  Similarity=1.138  Sum_probs=5.9

Q ss_pred             cccccccccCC
Q psy2838         182 PCDQCGKSFTL  192 (458)
Q Consensus       182 ~C~~C~~~f~~  192 (458)
                      .|..|++.|..
T Consensus       462 tC~~C~kkFfS  472 (1374)
T PTZ00303        462 SCPSCGRAFIS  472 (1374)
T ss_pred             cccCcCCcccc
Confidence            35555555543


No 172
>smart00440 ZnF_C2C2 C2C2 Zinc finger. Nucleic-acid-binding motif in transcriptional elongation factor TFIIS and RNA polymerases.
Probab=32.63  E-value=20  Score=22.76  Aligned_cols=11  Identities=27%  Similarity=0.902  Sum_probs=6.1

Q ss_pred             ceecCcccccc
Q psy2838         264 KYKCEECTKTF  274 (458)
Q Consensus       264 ~~~C~~C~~~f  274 (458)
                      -|.|..|+..+
T Consensus        28 fy~C~~C~~~w   38 (40)
T smart00440       28 FYVCTKCGHRW   38 (40)
T ss_pred             EEEeCCCCCEe
Confidence            45666665544


No 173
>PRK14714 DNA polymerase II large subunit; Provisional
Probab=32.44  E-value=36  Score=38.66  Aligned_cols=11  Identities=18%  Similarity=0.628  Sum_probs=7.2

Q ss_pred             eeecCcccccc
Q psy2838         236 RFKCDHCVKSF  246 (458)
Q Consensus       236 ~~~C~~C~~~f  246 (458)
                      .|.|+.|+...
T Consensus       692 vy~CPsCGaev  702 (1337)
T PRK14714        692 VYVCPDCGAEV  702 (1337)
T ss_pred             ceeCccCCCcc
Confidence            46777777654


No 174
>PF05191 ADK_lid:  Adenylate kinase, active site lid;  InterPro: IPR007862 Adenylate kinases (ADK; 2.7.4.3 from EC) are phosphotransferases that catalyse the Mg-dependent reversible conversion of ATP and AMP to two molecules of ADP, an essential reaction for many processes in living cells. In large variants of adenylate kinase, the AMP and ATP substrates are buried in a domain that undergoes conformational changes from an open to a closed state when bound to substrate; the ligand is then contained within a highly specific environment required for catalysis. Adenylate kinase is a 3-domain protein consisting of a large central CORE domain flanked by a LID domain on one side and the AMP-binding NMPbind domain on the other []. The LID domain binds ATP and covers the phosphates at the active site. The substrates first bind the CORE domain, followed by closure of the active site by the LID and NMPbind domains. Comparisons of adenylate kinases have revealed a particular divergence in the active site lid. In some organisms, particularly the Gram-positive bacteria, residues in the lid domain have been mutated to cysteines and these cysteine residues (two CX(n)C motifs) are responsible for the binding of a zinc ion. The bound zinc ion in the lid domain is clearly structurally homologous to Zinc-finger domains. However, it is unclear whether the adenylate kinase lid is a novel zinc-finger DNA/RNA binding domain, or that the lid bound zinc serves a purely structural function [].; GO: 0004017 adenylate kinase activity; PDB: 3BE4_A 2OSB_B 2ORI_A 2EU8_A 3DL0_A 1P3J_A 2QAJ_A 2OO7_A 2P3S_A 3DKV_A ....
Probab=31.37  E-value=23  Score=21.96  Aligned_cols=9  Identities=22%  Similarity=0.870  Sum_probs=4.2

Q ss_pred             ecCcccccc
Q psy2838         238 KCDHCVKSF  246 (458)
Q Consensus       238 ~C~~C~~~f  246 (458)
                      .|+.|+..|
T Consensus         3 ~C~~Cg~~Y   11 (36)
T PF05191_consen    3 ICPKCGRIY   11 (36)
T ss_dssp             EETTTTEEE
T ss_pred             CcCCCCCcc
Confidence            344444444


No 175
>KOG3540|consensus
Probab=31.21  E-value=31  Score=34.60  Aligned_cols=9  Identities=33%  Similarity=0.682  Sum_probs=4.1

Q ss_pred             ccCcceecC
Q psy2838         260 HEGIKYKCE  268 (458)
Q Consensus       260 ~~~~~~~C~  268 (458)
                      |..+||.|.
T Consensus       112 h~Vrp~~Cl  120 (615)
T KOG3540|consen  112 HPVRPYRCL  120 (615)
T ss_pred             cccccceee
Confidence            334455553


No 176
>KOG2593|consensus
Probab=31.06  E-value=58  Score=32.40  Aligned_cols=34  Identities=21%  Similarity=0.478  Sum_probs=17.1

Q ss_pred             cccccCccccCChHHHHHHHHHhcCCceeecCccccc
Q psy2838         209 IKCDFCDKTFAHKPNLIAHMNSAHNGVRFKCDHCVKS  245 (458)
Q Consensus       209 ~~C~~C~~~f~~~~~L~~H~~~h~~~~~~~C~~C~~~  245 (458)
                      |.|+.|.+.|.....+.   ..-.....|.|..|+-.
T Consensus       129 Y~Cp~C~kkyt~Lea~~---L~~~~~~~F~C~~C~ge  162 (436)
T KOG2593|consen  129 YVCPNCQKKYTSLEALQ---LLDNETGEFHCENCGGE  162 (436)
T ss_pred             ccCCccccchhhhHHHH---hhcccCceEEEecCCCc
Confidence            66666666665443332   12222234666666543


No 177
>PF01096 TFIIS_C:  Transcription factor S-II (TFIIS);  InterPro: IPR001222 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target.  This entry represents a zinc finger motif found in transcription factor IIs (TFIIS). In eukaryotes the initiation of transcription of protein encoding genes by polymerase II (Pol II) is modulated by general and specific transcription factors. The general transcription factors operate through common promoters elements (such as the TATA box). At least eight different proteins associate to form the general transcription factors: TFIIA, -IIB, -IID, -IIE, -IIF, -IIG, -IIH and -IIS []. During mRNA elongation, Pol II can encounter DNA sequences that cause reverse movement of the enzyme. Such backtracking involves extrusion of the RNA 3'-end into the pore, and can lead to transcriptional arrest. Escape from arrest requires cleavage of the extruded RNA with the help of TFIIS, which induces mRNA cleavage by enhancing the intrinsic nuclease activity of RNA polymerase (Pol) II, past template-encoded pause sites []. TFIIS extends from the polymerase surface via a pore to the internal active site. Two essential and invariant acidic residues in a TFIIS loop complement the Pol II active site and could position a metal ion and a water molecule for hydrolytic RNA cleavage. TFIIS also induces extensive structural changes in Pol II that would realign nucleic acids in the active centre.  TFIIS is a protein of about 300 amino acids. It contains three regions: a variable N-terminal domain not required for TFIIS activity; a conserved central domain required for Pol II binding; and a conserved C-terminal C4-type zinc finger essential for RNA cleavage. The zinc finger folds in a conformation termed a zinc ribbon [] characterised by a three-stranded antiparallel beta-sheet and two beta-hairpins. A backbone model for Pol II-TFIIS complex was obtained from X-ray analysis. It shows that a beta hairpin protrudes from the zinc finger and complements the pol II active site [].  Some viral proteins also contain the TFIIS zinc ribbon C-terminal domain. The Vaccinia virus protein, unlike its eukaryotic homologue, is an integral RNA polymerase subunit rather than a readily separable transcription factor []. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; GO: 0003676 nucleic acid binding, 0008270 zinc ion binding, 0006351 transcription, DNA-dependent; PDB: 3M4O_I 3S14_I 2E2J_I 4A3J_I 3HOZ_I 1TWA_I 3S1Q_I 3S1N_I 1TWG_I 3I4M_I ....
Probab=30.78  E-value=13  Score=23.42  Aligned_cols=12  Identities=25%  Similarity=0.816  Sum_probs=7.5

Q ss_pred             cceecCcccccc
Q psy2838         263 IKYKCEECTKTF  274 (458)
Q Consensus       263 ~~~~C~~C~~~f  274 (458)
                      .-|.|..|+..|
T Consensus        27 ~fy~C~~C~~~w   38 (39)
T PF01096_consen   27 LFYVCCNCGHRW   38 (39)
T ss_dssp             EEEEESSSTEEE
T ss_pred             EEEEeCCCCCee
Confidence            346777777654


No 178
>smart00349 KRAB krueppel associated box.
Probab=30.58  E-value=6  Score=24.98  Aligned_cols=29  Identities=7%  Similarity=0.009  Sum_probs=24.6

Q ss_pred             ceeeecchhHHHHHHHhcCCCcCCCCCCC
Q psy2838          43 YNYYQLSSDEKFEILKSISASFTPGYGGF   71 (458)
Q Consensus        43 ~~~~~~~~~~~~~~~~~~~~~~~~~y~~~   71 (458)
                      +.|.++++.+..++|.++.+.++.++.+.
T Consensus         1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~   29 (61)
T smart00349        1 VTFEDVAVYFSQEEWEQLDPAQRNLYRDV   29 (61)
T ss_pred             CeeecceeecCHHHHhccCHHHHHHHHHH
Confidence            36789999999999999999888877665


No 179
>COG3357 Predicted transcriptional regulator containing an HTH domain fused to a Zn-ribbon [Transcription]
Probab=30.50  E-value=27  Score=26.45  Aligned_cols=12  Identities=25%  Similarity=0.711  Sum_probs=7.2

Q ss_pred             cccccCccccCC
Q psy2838         209 IKCDFCDKTFAH  220 (458)
Q Consensus       209 ~~C~~C~~~f~~  220 (458)
                      -.|..||+.|..
T Consensus        59 a~CkkCGfef~~   70 (97)
T COG3357          59 ARCKKCGFEFRD   70 (97)
T ss_pred             hhhcccCccccc
Confidence            456666666654


No 180
>PRK00432 30S ribosomal protein S27ae; Validated
Probab=30.50  E-value=18  Score=24.25  Aligned_cols=11  Identities=36%  Similarity=0.824  Sum_probs=6.1

Q ss_pred             ccccCcCcccC
Q psy2838         348 RLTCRTCGLFC  358 (458)
Q Consensus       348 ~~~C~~C~~~f  358 (458)
                      ++.|..|++.+
T Consensus        37 r~~C~~Cgyt~   47 (50)
T PRK00432         37 RWHCGKCGYTE   47 (50)
T ss_pred             cEECCCcCCEE
Confidence            45566665543


No 181
>PF07754 DUF1610:  Domain of unknown function (DUF1610);  InterPro: IPR011668 This domain is found in archaeal species. It is likely to bind zinc via its four well-conserved cysteine residues.
Probab=30.29  E-value=22  Score=19.84  Aligned_cols=8  Identities=38%  Similarity=1.497  Sum_probs=4.5

Q ss_pred             cccCcccc
Q psy2838         320 KYKCADCG  327 (458)
Q Consensus       320 ~~~C~~C~  327 (458)
                      .|.|+.||
T Consensus        16 ~f~CPnCG   23 (24)
T PF07754_consen   16 PFPCPNCG   23 (24)
T ss_pred             eEeCCCCC
Confidence            45555555


No 182
>PF06220 zf-U1:  U1 zinc finger;  InterPro: IPR013085 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target.  C2H2-type (classical) zinc fingers (Znf) were the first class to be characterised. They contain a short beta hairpin and an alpha helix (beta/beta/alpha structure), where a single zinc atom is held in place by Cys(2)His(2) (C2H2) residues in a tetrahedral array. C2H2 Znf's can be divided into three groups based on the number and pattern of fingers: triple-C2H2 (binds single ligand), multiple-adjacent-C2H2 (binds multiple ligands), and separated paired-C2H2 []. C2H2 Znf's are the most common DNA-binding motifs found in eukaryotic transcription factors, and have also been identified in prokaryotes []. Transcription factors usually contain several Znf's (each with a conserved beta/beta/alpha structure) capable of making multiple contacts along the DNA, where the C2H2 Znf motifs recognise DNA sequences by binding to the major groove of DNA via a short alpha-helix in the Znf, the Znf spanning 3-4 bases of the DNA []. C2H2 Znf's can also bind to RNA and protein targets []. This entry represents a C2H2-type zinc finger motif found in several U1 small nuclear ribonucleoprotein C (U1-C) proteins. Some proteins contain multiple copies of this motif. The U1 small nuclear ribonucleoprotein (U1 snRNP) binds to the pre-mRNA 5' splice site at early stages of spliceosome assembly. Recruitment of U1 to a class of weak 5' splice site is promoted by binding of the protein TIA-1 to uridine-rich sequences immediately downstream from the 5' splice site. Binding of TIA-1 in the vicinity of a 5' splice site helps to stabilise U1 snRNP recruitment, at least in part, via a direct interaction with U1-C, thus providing one molecular mechanism for the function of this splicing regulator []. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; GO: 0008270 zinc ion binding; PDB: 2VRD_A.
Probab=29.98  E-value=38  Score=21.23  Aligned_cols=12  Identities=25%  Similarity=0.667  Sum_probs=4.9

Q ss_pred             ccccCcCcccCC
Q psy2838         348 RLTCRTCGLFCD  359 (458)
Q Consensus       348 ~~~C~~C~~~f~  359 (458)
                      +|.|.+|...|.
T Consensus         3 ryyCdyC~~~~~   14 (38)
T PF06220_consen    3 RYYCDYCKKYLT   14 (38)
T ss_dssp             S-B-TTT--B-S
T ss_pred             Ceecccccceec
Confidence            567888888773


No 183
>COG1998 RPS31 Ribosomal protein S27AE [Translation, ribosomal structure and biogenesis]
Probab=29.85  E-value=30  Score=23.05  Aligned_cols=10  Identities=40%  Similarity=1.036  Sum_probs=5.3

Q ss_pred             ccccCcCccc
Q psy2838         348 RLTCRTCGLF  357 (458)
Q Consensus       348 ~~~C~~C~~~  357 (458)
                      ++.|..|+++
T Consensus        37 R~~CGkCgyT   46 (51)
T COG1998          37 RWACGKCGYT   46 (51)
T ss_pred             eeEeccccce
Confidence            4555555543


No 184
>smart00735 ZM ZASP-like motif. Short motif (26 amino acids) present in an alpha-actinin-binding protein, ZASP, and similar molecules.
Probab=29.54  E-value=27  Score=19.82  Aligned_cols=22  Identities=23%  Similarity=0.330  Sum_probs=16.8

Q ss_pred             cccccCCCCCCCCCCCCCCCCC
Q psy2838           2 VTYSYNTPSPAPSTTSRRDSFP   23 (458)
Q Consensus         2 ~~~~~~~~~~~~~~~~~~~~~~   23 (458)
                      |+-.||+|..-.|..+..++++
T Consensus         3 v~~qyn~P~glys~~n~~~~l~   24 (26)
T smart00735        3 VHKQYNSPIGLYSSENIAETLQ   24 (26)
T ss_pred             cccccCCCCCCCCcccHHHhhc
Confidence            4568999998888877776654


No 185
>COG0068 HypF Hydrogenase maturation factor [Posttranslational modification, protein turnover, chaperones]
Probab=29.46  E-value=10  Score=40.01  Aligned_cols=81  Identities=20%  Similarity=0.340  Sum_probs=49.2

Q ss_pred             cchhhhhhhhcCCccccccccCCCCCCCCcccccccccccCCHHHHHHHHHhhcCCCccccccCccccCChHHHHHHHHH
Q psy2838         151 FNRSQFEKTFQNYSMTTTGHRGGKGPPVKKFPCDQCGKSFTLKQNLTYHVNSIHLGVRIKCDFCDKTFAHKPNLIAHMNS  230 (458)
Q Consensus       151 ~~c~~c~~~~~~~~~~~~~~~~~~~~~~k~~~C~~C~~~f~~~~~L~~H~~~~h~~~~~~C~~C~~~f~~~~~L~~H~~~  230 (458)
                      -.|..|-..+.+.+.-..+.        .-..|..||-.|+-...|-.-........--.|+.|.+.|.+..+-+-|.  
T Consensus       102 a~C~~Cl~Ei~dp~~rrY~Y--------PF~~CT~CGPRfTIi~alPYDR~nTsM~~F~lC~~C~~EY~dP~nRRfHA--  171 (750)
T COG0068         102 ATCEDCLEEIFDPNSRRYLY--------PFINCTNCGPRFTIIEALPYDRENTSMADFPLCPFCDKEYKDPLNRRFHA--  171 (750)
T ss_pred             hhhHHHHHHhcCCCCcceec--------cccccCCCCcceeeeccCCCCcccCccccCcCCHHHHHHhcCcccccccc--
Confidence            35888888777666544432        12359999998876655533222111112246899988887776644443  


Q ss_pred             hcCCceeecCccccc
Q psy2838         231 AHNGVRFKCDHCVKS  245 (458)
Q Consensus       231 h~~~~~~~C~~C~~~  245 (458)
                          .+..|+.||-.
T Consensus       172 ----Qp~aCp~CGP~  182 (750)
T COG0068         172 ----QPIACPKCGPH  182 (750)
T ss_pred             ----ccccCcccCCC
Confidence                46789999853


No 186
>KOG3555|consensus
Probab=29.21  E-value=45  Score=32.08  Aligned_cols=25  Identities=16%  Similarity=0.336  Sum_probs=10.4

Q ss_pred             ccccCChHHHHHHHHHhcCCceeec
Q psy2838         215 DKTFAHKPNLIAHMNSAHNGVRFKC  239 (458)
Q Consensus       215 ~~~f~~~~~L~~H~~~h~~~~~~~C  239 (458)
                      |.+|.....|..|.-.+.....+.|
T Consensus       155 ghtYss~ckLe~~aC~~sksiav~c  179 (434)
T KOG3555|consen  155 GHTYSSRCKLEYHACHVSKSIAVIC  179 (434)
T ss_pred             CCeehhhhhHHHHhhhhhhhhhhhh
Confidence            3344444444444443333333333


No 187
>KOG0717|consensus
Probab=28.98  E-value=32  Score=34.49  Aligned_cols=22  Identities=27%  Similarity=0.495  Sum_probs=14.2

Q ss_pred             cccCcCcccCCCHHHHHHHHHH
Q psy2838         349 LTCRTCGLFCDSKRALTAHVRE  370 (458)
Q Consensus       349 ~~C~~C~~~f~~~~~L~~H~~~  370 (458)
                      ++|.+|++.|.+...|..|...
T Consensus       293 lyC~vCnKsFKseKq~kNHEnS  314 (508)
T KOG0717|consen  293 LYCVVCNKSFKSEKQLKNHENS  314 (508)
T ss_pred             eEEeeccccccchHHHHhhHHH
Confidence            5666666666666666666543


No 188
>KOG2907|consensus
Probab=28.90  E-value=32  Score=27.24  Aligned_cols=8  Identities=25%  Similarity=0.949  Sum_probs=4.7

Q ss_pred             cccccccc
Q psy2838         183 CDQCGKSF  190 (458)
Q Consensus       183 C~~C~~~f  190 (458)
                      |..|...|
T Consensus        28 C~~Ck~~~   35 (116)
T KOG2907|consen   28 CIRCKIEY   35 (116)
T ss_pred             eccccccC
Confidence            66665554


No 189
>COG5406 Nucleosome binding factor SPN, SPT16 subunit [Transcription / DNA replication, recombination, and repair / Chromatin structure and dynamics]
Probab=28.45  E-value=42  Score=35.02  Aligned_cols=30  Identities=13%  Similarity=-0.045  Sum_probs=18.5

Q ss_pred             ccceeeecchhHHHHHHHhcCCCcCCCCCC
Q psy2838          41 SEYNYYQLSSDEKFEILKSISASFTPGYGG   70 (458)
Q Consensus        41 ~~~~~~~~~~~~~~~~~~~~~~~~~~~y~~   70 (458)
                      ....++.|+--.-.+-+..+++++..+...
T Consensus       475 r~EhQK~L~~k~~~egL~rf~~a~~~gpds  504 (1001)
T COG5406         475 RVEHQKELLDKIIEEGLERFRNASDAGPDS  504 (1001)
T ss_pred             HHHHHHHHHHHHHHHHHHHHhhhcccCCcc
Confidence            334455566666666666666677666655


No 190
>KOG0782|consensus
Probab=28.38  E-value=21  Score=36.53  Aligned_cols=27  Identities=19%  Similarity=0.386  Sum_probs=11.7

Q ss_pred             HHHHHHHHhhcCceeecCccCcccCCh
Q psy2838         279 NLLSHVQAIHMGIKYTCEVCSKSYNQV  305 (458)
Q Consensus       279 ~L~~H~~~~h~~~~~~C~~C~~~f~~~  305 (458)
                      .+.+|-..|.....-+|..|+|.|.++
T Consensus       240 ~fvrHHWVHrrRqeGkC~~CgKgFQQK  266 (1004)
T KOG0782|consen  240 GFVRHHWVHRRRQEGKCNTCGKGFQQK  266 (1004)
T ss_pred             cchHHhHhhHhhhccccchhhhhhhhh
Confidence            344443333222234555555555443


No 191
>KOG3214|consensus
Probab=28.26  E-value=26  Score=27.03  Aligned_cols=7  Identities=29%  Similarity=1.023  Sum_probs=3.3

Q ss_pred             cccCcCc
Q psy2838         349 LTCRTCG  355 (458)
Q Consensus       349 ~~C~~C~  355 (458)
                      |.|++|+
T Consensus        24 FnClfcn   30 (109)
T KOG3214|consen   24 FNCLFCN   30 (109)
T ss_pred             eccCccc
Confidence            4455554


No 192
>COG3091 SprT Zn-dependent metalloprotease, SprT family [General function prediction only]
Probab=28.09  E-value=26  Score=29.48  Aligned_cols=8  Identities=38%  Similarity=1.103  Sum_probs=3.5

Q ss_pred             ccccCcCc
Q psy2838         348 RLTCRTCG  355 (458)
Q Consensus       348 ~~~C~~C~  355 (458)
                      .|.|..|+
T Consensus       140 ~YrC~~C~  147 (156)
T COG3091         140 VYRCGKCG  147 (156)
T ss_pred             eEEeccCC
Confidence            34444443


No 193
>PRK03824 hypA hydrogenase nickel incorporation protein; Provisional
Probab=27.27  E-value=36  Score=28.32  Aligned_cols=10  Identities=30%  Similarity=1.012  Sum_probs=4.2

Q ss_pred             eecCcccccc
Q psy2838         237 FKCDHCVKSF  246 (458)
Q Consensus       237 ~~C~~C~~~f  246 (458)
                      +.|..|+..|
T Consensus        71 ~~C~~CG~~~   80 (135)
T PRK03824         71 LKCRNCGNEW   80 (135)
T ss_pred             EECCCCCCEE
Confidence            3444444333


No 194
>PF13878 zf-C2H2_3:  zinc-finger of acetyl-transferase ESCO
Probab=27.00  E-value=65  Score=20.54  Aligned_cols=10  Identities=30%  Similarity=1.069  Sum_probs=5.2

Q ss_pred             cccccccccC
Q psy2838         182 PCDQCGKSFT  191 (458)
Q Consensus       182 ~C~~C~~~f~  191 (458)
                      +|+.||..|.
T Consensus        15 ~C~~CgM~Y~   24 (41)
T PF13878_consen   15 TCPTCGMLYS   24 (41)
T ss_pred             CCCCCCCEEC
Confidence            4555555443


No 195
>PF05495 zf-CHY:  CHY zinc finger;  InterPro: IPR008913 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target.  Pirh2 is an eukaryotic ubiquitin protein ligase, which has been shown to promote p53 degradation in mammals. Pirh2 physically interacts with p53 and promotes ubiquitination of p53 independently of MDM2. Like MDM2, Pirh2 is thought to participate in an autoregulatory feedback loop that controls p53 function. Pirh2 proteins contain three distinct zinc fingers, the CHY-type, the CTCHY-type which is C-terminal to the CHY-type zinc finger and a RING finger. The CHY-type zinc finger has no currently known function []. As well as Pirh2, the CHY-type zinc finger is also found in the following proteins:   Yeast helper of Tim protein 13. Hot13 may have a role in the assembly and recycling of the small Tims, a complex of the mitochondrial intermembrane space that participates in the TIM22 import pathway for assembly of the inner membrane [] Several plant hypothetical proteins that also contain haemerythrin cation binding domains Several protozoan hypothetical proteins that also contain a Myb domain   The solution structure of this zinc finger has been solved and binds 3 zinc atoms as shown in the following schematic representation:   ++---------+-----+ || | | CXHYxxxxxxxxxCCxxxxxCxxCHxxxxxHxxxxxxxxxxxCxxCxxxxxxxxxCxxC | | | | | | | | +-+-----------------+--+ +--+---------+--+ 'C': conserved cysteine involved in the binding of one zinc atom. 'H': conserved histidine involved in the binding of one zinc atom.  More information about these proteins can be found at Protein of the Month: Zinc Fingers []; GO: 0008270 zinc ion binding; PDB: 2DKT_A 2K2C_A.
Probab=26.42  E-value=21  Score=25.94  Aligned_cols=11  Identities=27%  Similarity=0.670  Sum_probs=4.3

Q ss_pred             eeecCcccccc
Q psy2838         236 RFKCDHCVKSF  246 (458)
Q Consensus       236 ~~~C~~C~~~f  246 (458)
                      ...|..|...+
T Consensus        41 ~v~Cg~C~~~~   51 (71)
T PF05495_consen   41 RVICGKCRTEQ   51 (71)
T ss_dssp             EEEETTT--EE
T ss_pred             CeECCCCCCcc
Confidence            44455554443


No 196
>PF13821 DUF4187:  Domain of unknown function (DUF4187)
Probab=26.12  E-value=65  Score=22.12  Aligned_cols=18  Identities=28%  Similarity=0.422  Sum_probs=12.6

Q ss_pred             cccCcCcccCCCHHHHHH
Q psy2838         349 LTCRTCGLFCDSKRALTA  366 (458)
Q Consensus       349 ~~C~~C~~~f~~~~~L~~  366 (458)
                      ++|-.||..|.+...|..
T Consensus        28 ~YC~~Cg~~Y~d~~dL~~   45 (55)
T PF13821_consen   28 NYCFWCGTKYDDEEDLER   45 (55)
T ss_pred             ceeeeeCCccCCHHHHHh
Confidence            467777777777776654


No 197
>COG4957 Predicted transcriptional regulator [Transcription]
Probab=25.78  E-value=43  Score=27.52  Aligned_cols=22  Identities=27%  Similarity=0.351  Sum_probs=11.1

Q ss_pred             eecCccCcccCChhHHHHHHhhhcc
Q psy2838         293 YTCEVCSKSYNQVTNLLSHMTTVHQ  317 (458)
Q Consensus       293 ~~C~~C~~~f~~~~~L~~H~~~h~~  317 (458)
                      ..|-.+|+.|.   +|++|+.+|++
T Consensus        77 IicLEDGkkfK---SLKRHL~t~~g   98 (148)
T COG4957          77 IICLEDGKKFK---SLKRHLTTHYG   98 (148)
T ss_pred             EEEeccCcchH---HHHHHHhcccC
Confidence            44555555554   34555555544


No 198
>TIGR01206 lysW lysine biosynthesis protein LysW. This very small, poorly characterized protein has been shown essential in Thermus thermophilus for an unusual pathway of Lys biosynthesis from aspartate by way of alpha-aminoadipate (AAA) rather than diaminopimelate. It is found also in Deinococcus radiodurans and Pyrococcus horikoshii, which appear to share the AAA pathway.
Probab=25.35  E-value=25  Score=24.09  Aligned_cols=7  Identities=43%  Similarity=1.061  Sum_probs=3.0

Q ss_pred             ecCcccc
Q psy2838         238 KCDHCVK  244 (458)
Q Consensus       238 ~C~~C~~  244 (458)
                      .|+.|+.
T Consensus        24 ~Cp~CGa   30 (54)
T TIGR01206        24 ICDECGA   30 (54)
T ss_pred             eCCCCCC
Confidence            4444443


No 199
>PRK03564 formate dehydrogenase accessory protein FdhE; Provisional
Probab=25.25  E-value=39  Score=32.38  Aligned_cols=49  Identities=20%  Similarity=0.572  Sum_probs=24.0

Q ss_pred             CcccccccccccCCHHHHHHHHHhhcCCCccccccCccccCChHHHHHHHHHhcCCceeecCcccc
Q psy2838         179 KKFPCDQCGKSFTLKQNLTYHVNSIHLGVRIKCDFCDKTFAHKPNLIAHMNSAHNGVRFKCDHCVK  244 (458)
Q Consensus       179 k~~~C~~C~~~f~~~~~L~~H~~~~h~~~~~~C~~C~~~f~~~~~L~~H~~~h~~~~~~~C~~C~~  244 (458)
                      ..-.|++||..=. .+.+  +.....+.+-..|..|+.....              ...+|+.|+.
T Consensus       186 ~~~~CPvCGs~P~-~s~v--~~~~~~G~RyL~CslC~teW~~--------------~R~~C~~Cg~  234 (309)
T PRK03564        186 QRQFCPVCGSMPV-SSVV--QIGTTQGLRYLHCNLCESEWHV--------------VRVKCSNCEQ  234 (309)
T ss_pred             CCCCCCCCCCcch-hhee--eccCCCCceEEEcCCCCCcccc--------------cCccCCCCCC
Confidence            3456788875421 1111  2221223333677777755421              2356777774


No 200
>COG5112 UFD2 U1-like Zn-finger-containing protein [General function prediction only]
Probab=25.24  E-value=29  Score=27.09  Aligned_cols=24  Identities=21%  Similarity=0.473  Sum_probs=20.3

Q ss_pred             ccccCcCcccCCCHHHHHHHHHHh
Q psy2838         348 RLTCRTCGLFCDSKRALTAHVREA  371 (458)
Q Consensus       348 ~~~C~~C~~~f~~~~~L~~H~~~~  371 (458)
                      .++|-.|.+.|.+...|..|.+..
T Consensus        55 qhYCieCaryf~t~~aL~~Hkkgk   78 (126)
T COG5112          55 QHYCIECARYFITEKALMEHKKGK   78 (126)
T ss_pred             eeeeehhHHHHHHHHHHHHHhccc
Confidence            478999999999999999998643


No 201
>PF13453 zf-TFIIB:  Transcription factor zinc-finger
Probab=24.87  E-value=41  Score=21.32  Aligned_cols=12  Identities=25%  Similarity=0.567  Sum_probs=4.8

Q ss_pred             cCcCcccCCCHH
Q psy2838         351 CRTCGLFCDSKR  362 (458)
Q Consensus       351 C~~C~~~f~~~~  362 (458)
                      |+.|+-.+-...
T Consensus        22 C~~C~G~W~d~~   33 (41)
T PF13453_consen   22 CPSCGGIWFDAG   33 (41)
T ss_pred             CCCCCeEEccHH
Confidence            444444333333


No 202
>PF08790 zf-LYAR:  LYAR-type C2HC zinc finger ;  InterPro: IPR014898 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target.  This C2HC zinc finger domain is found in LYAR proteins such as Q08288 from SWISSPROT, which are involved in cell growth regulation.  More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; PDB: 1WJV_A.
Probab=24.82  E-value=38  Score=19.69  Aligned_cols=9  Identities=44%  Similarity=1.088  Sum_probs=4.0

Q ss_pred             ecCcccccc
Q psy2838         238 KCDHCVKSF  246 (458)
Q Consensus       238 ~C~~C~~~f  246 (458)
                      .|-.|++.|
T Consensus         2 sCiDC~~~F   10 (28)
T PF08790_consen    2 SCIDCSKDF   10 (28)
T ss_dssp             EETTTTEEE
T ss_pred             eeecCCCCc
Confidence            344444444


No 203
>COG2879 Uncharacterized small protein [Function unknown]
Probab=24.74  E-value=69  Score=22.53  Aligned_cols=18  Identities=22%  Similarity=0.338  Sum_probs=15.0

Q ss_pred             CHHHHHHHHHHhCCCCCc
Q psy2838         360 SKRALTAHVREAHPGCEY  377 (458)
Q Consensus       360 ~~~~L~~H~~~~h~~~~~  377 (458)
                      +-.+...|||.+|+++|.
T Consensus        24 dYdnYVehmr~~hPd~p~   41 (65)
T COG2879          24 DYDNYVEHMRKKHPDKPP   41 (65)
T ss_pred             cHHHHHHHHHHhCcCCCc
Confidence            457889999999998764


No 204
>PF04780 DUF629:  Protein of unknown function (DUF629);  InterPro: IPR006865 This domain represents a region of several plant proteins of unknown function. A C2H2 zinc finger is predicted in this region in some family members, but the spacing between the cysteine residues is not conserved throughout the family.
Probab=24.40  E-value=52  Score=33.41  Aligned_cols=26  Identities=19%  Similarity=0.484  Sum_probs=23.0

Q ss_pred             ccCcCcccCCCHHHHHHHHHHhCCCC
Q psy2838         350 TCRTCGLFCDSKRALTAHVREAHPGC  375 (458)
Q Consensus       350 ~C~~C~~~f~~~~~L~~H~~~~h~~~  375 (458)
                      .|+.|.+.|.....+..|+...|.+.
T Consensus        59 iCp~CskkF~d~~~~~~H~~~eH~~~   84 (466)
T PF04780_consen   59 ICPRCSKKFSDAESCLSHMEQEHPAG   84 (466)
T ss_pred             eCCcccceeCCHHHHHHHHHHhhhhh
Confidence            49999999999999999999888763


No 205
>PF09416 UPF1_Zn_bind:  RNA helicase (UPF2 interacting domain);  InterPro: IPR018999 UPF1 (or regulator of nonsense transcripts 1 homologue) is an essential RNA helicase that detects mRNAs containing premature stop codons and triggers their degradation. This domain contains 3 zinc binding motifs and forms interactions with another protein (UPF2) that is also involved nonsense-mediated mRNA decay (NMD) []. ; GO: 0003677 DNA binding, 0004386 helicase activity, 0005524 ATP binding, 0008270 zinc ion binding, 0000184 nuclear-transcribed mRNA catabolic process, nonsense-mediated decay, 0005737 cytoplasm; PDB: 2IYK_B 2WJY_A 2WJV_A 2XZL_A.
Probab=23.71  E-value=78  Score=26.84  Aligned_cols=10  Identities=20%  Similarity=0.647  Sum_probs=2.5

Q ss_pred             ceeecCcccc
Q psy2838         235 VRFKCDHCVK  244 (458)
Q Consensus       235 ~~~~C~~C~~  244 (458)
                      ..+.|-.||.
T Consensus        59 t~leCy~Cg~   68 (152)
T PF09416_consen   59 TVLECYNCGS   68 (152)
T ss_dssp             -B---TTT--
T ss_pred             cEEEEEecCC
Confidence            3466766663


No 206
>PF14311 DUF4379:  Domain of unknown function (DUF4379)
Probab=23.03  E-value=47  Score=22.59  Aligned_cols=10  Identities=20%  Similarity=1.009  Sum_probs=4.1

Q ss_pred             eecCccCccc
Q psy2838         293 YTCEVCSKSY  302 (458)
Q Consensus       293 ~~C~~C~~~f  302 (458)
                      ++|..|+..|
T Consensus        29 W~C~~Cgh~w   38 (55)
T PF14311_consen   29 WKCPKCGHEW   38 (55)
T ss_pred             EECCCCCCee
Confidence            3444444333


No 207
>COG5151 SSL1 RNA polymerase II transcription initiation/nucleotide excision repair factor TFIIH, subunit SSL1 [Transcription / DNA replication, recombination, and repair]
Probab=22.58  E-value=88  Score=29.60  Aligned_cols=91  Identities=15%  Similarity=0.340  Sum_probs=53.8

Q ss_pred             cccccCccccCChHHHHHHHHHhcCCceeecCccccccCCchhhhhccchhccC------------cceecCcccccccC
Q psy2838         209 IKCDFCDKTFAHKPNLIAHMNSAHNGVRFKCDHCVKSFNQKPNLISHIKSVHEG------------IKYKCEECTKTFAQ  276 (458)
Q Consensus       209 ~~C~~C~~~f~~~~~L~~H~~~h~~~~~~~C~~C~~~f~~~~~L~~H~~~~~~~------------~~~~C~~C~~~f~~  276 (458)
                      |.|+.|....-              ..|..|+.|........+|.+-..-.-.-            +.-.|-.|.-.|..
T Consensus       309 y~CP~CktkVC--------------sLPi~CP~Csl~LilsthLarSyhhL~PLk~f~E~p~~~~~ks~~Cf~CQ~~fp~  374 (421)
T COG5151         309 YECPVCKTKVC--------------SLPISCPICSLQLILSTHLARSYHHLYPLKPFVEKPEGTNPKSTHCFVCQGPFPK  374 (421)
T ss_pred             eeCCcccceee--------------cCCccCcchhHHHHHHHHHHHHHHhhccCcccccccCCCCCCCccceeccCCCCC
Confidence            77888864322              24677888887766666665433200011            22347777777765


Q ss_pred             HHHHHHHHHHhhcCceeecCccCcccCChhHHHHHHhhh
Q psy2838         277 KQNLLSHVQAIHMGIKYTCEVCSKSYNQVTNLLSHMTTV  315 (458)
Q Consensus       277 ~~~L~~H~~~~h~~~~~~C~~C~~~f~~~~~L~~H~~~h  315 (458)
                      ...-..-  ......+|+|+.|...|-.......|...|
T Consensus       375 ~~~~~~~--~~~ss~rY~Ce~CK~~FC~dCdvfiHe~Lh  411 (421)
T COG5151         375 PPVSPFD--ESTSSGRYQCELCKSTFCSDCDVFIHETLH  411 (421)
T ss_pred             CCCCccc--ccccccceechhhhhhhhhhhHHHHHHHHh
Confidence            4321111  112233599999999998888888887765


No 208
>PF04780 DUF629:  Protein of unknown function (DUF629);  InterPro: IPR006865 This domain represents a region of several plant proteins of unknown function. A C2H2 zinc finger is predicted in this region in some family members, but the spacing between the cysteine residues is not conserved throughout the family.
Probab=22.41  E-value=54  Score=33.32  Aligned_cols=25  Identities=20%  Similarity=0.561  Sum_probs=18.7

Q ss_pred             ccCccccccCCCHHHHHHHHhhccC
Q psy2838         321 YKCADCGRLFAQEHTLRTHTLSVHS  345 (458)
Q Consensus       321 ~~C~~C~~~f~~~~~L~~H~~~~h~  345 (458)
                      +.|+.|.+.|.....+..|+...|.
T Consensus        58 WiCp~CskkF~d~~~~~~H~~~eH~   82 (466)
T PF04780_consen   58 WICPRCSKKFSDAESCLSHMEQEHP   82 (466)
T ss_pred             eeCCcccceeCCHHHHHHHHHHhhh
Confidence            5677888888888888888776663


No 209
>COG3364 Zn-ribbon containing protein [General function prediction only]
Probab=22.37  E-value=49  Score=25.70  Aligned_cols=14  Identities=29%  Similarity=0.769  Sum_probs=7.5

Q ss_pred             cccccccccccCCH
Q psy2838         180 KFPCDQCGKSFTLK  193 (458)
Q Consensus       180 ~~~C~~C~~~f~~~  193 (458)
                      +++|..||..|.+-
T Consensus         2 pH~CtrCG~vf~~g   15 (112)
T COG3364           2 PHQCTRCGEVFDDG   15 (112)
T ss_pred             Cceecccccccccc
Confidence            34555566655543


No 210
>PF15135 UPF0515:  Uncharacterised protein UPF0515
Probab=22.29  E-value=78  Score=29.01  Aligned_cols=14  Identities=21%  Similarity=0.859  Sum_probs=7.9

Q ss_pred             cceecCcccccccC
Q psy2838         263 IKYKCEECTKTFAQ  276 (458)
Q Consensus       263 ~~~~C~~C~~~f~~  276 (458)
                      --|.|+.|+..|..
T Consensus       154 aef~C~~C~h~F~G  167 (278)
T PF15135_consen  154 AEFHCPKCRHNFRG  167 (278)
T ss_pred             eeeecccccccchh
Confidence            34666666666653


No 211
>PF03115 Astro_capsid:  Astrovirus capsid protein precursor;  InterPro: IPR004337 The astrovirus genome is apparently organised with nonstructural proteins encoded at the 5' end and structural proteins at the 3' end []. Proteins in this family are encoded by astrovirus ORF2, one of the three astrovirus ORFs (1a, 1b, 2). The proteins contain a viral RNA-dependent RNA polymerase motif []. The 87kDa precursor polyprotein undergoes an intracellular cleavage to form a 79kDa protein. Subsequently, extracellular trypsin cleavage yields the three proteins forming the infectious virion [].; PDB: 3QSQ_A 3TS3_D.
Probab=22.27  E-value=29  Score=37.69  Aligned_cols=54  Identities=13%  Similarity=-0.082  Sum_probs=0.0

Q ss_pred             CCCCCCCCCCCCCCCCCCCCCCCCCCcccccceeeecchhHHHHHHHhcCCCcCCCCCCC
Q psy2838          12 APSTTSRRDSFPPPHPLGQNGFDPTGQVKSEYNYYQLSSDEKFEILKSISASFTPGYGGF   71 (458)
Q Consensus        12 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~y~~~   71 (458)
                      |-+-|--|-|.=+-+.  |+  ...|..+++..=.||.|.  +.-.=+|.+.+-.+.|+.
T Consensus       152 aV~Gtv~r~sln~~~~--p~--~tsws~lgARkH~dv~~G--~~~~~~l~~r~l~GPr~g  205 (787)
T PF03115_consen  152 AVSGTVVRVSLNLTGT--PS--STSWSGLGARKHVDVPVG--RPAVFKLTARDLAGPREG  205 (787)
T ss_dssp             ------------------------------------------------------------
T ss_pred             ccCceEEEEEecCCCC--CC--CCCccccCceeEEEEcCC--CceEEEECHHHcCCCcCc
Confidence            3344444555544333  22  234556666655555543  222224455555555554


No 212
>KOG4124|consensus
Probab=21.40  E-value=25  Score=33.69  Aligned_cols=21  Identities=10%  Similarity=-0.102  Sum_probs=14.2

Q ss_pred             CCcccccccccccCCHHHHHHH
Q psy2838         178 VKKFPCDQCGKSFTLKQNLTYH  199 (458)
Q Consensus       178 ~k~~~C~~C~~~f~~~~~L~~H  199 (458)
                      .++|+|. |++.+.++..|+.|
T Consensus       211 ~~p~k~~-~~~~~~T~~~l~~H  231 (442)
T KOG4124|consen  211 GTPKKMP-ESLVMDTSSPLSDH  231 (442)
T ss_pred             cCCccCc-ccccccccchhhhc
Confidence            4567775 77777777776666


No 213
>COG3091 SprT Zn-dependent metalloprotease, SprT family [General function prediction only]
Probab=21.33  E-value=45  Score=28.10  Aligned_cols=11  Identities=27%  Similarity=1.201  Sum_probs=5.3

Q ss_pred             ceecCccccccc
Q psy2838         264 KYKCEECTKTFA  275 (458)
Q Consensus       264 ~~~C~~C~~~f~  275 (458)
                      +|.|. |+..|.
T Consensus       117 ~Y~C~-C~q~~l  127 (156)
T COG3091         117 PYRCQ-CQQHYL  127 (156)
T ss_pred             eEEee-cCCccc
Confidence            45555 555443


No 214
>COG2331 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=21.11  E-value=41  Score=24.61  Aligned_cols=33  Identities=18%  Similarity=0.606  Sum_probs=18.7

Q ss_pred             CcccccccccccCCHHHHHHHHHhhcCCCccccccCcccc
Q psy2838         179 KKFPCDQCGKSFTLKQNLTYHVNSIHLGVRIKCDFCDKTF  218 (458)
Q Consensus       179 k~~~C~~C~~~f~~~~~L~~H~~~~h~~~~~~C~~C~~~f  218 (458)
                      -.|+|..|+..|    .+.+||.   ....-.|+.|+..+
T Consensus        11 Y~Y~c~~cg~~~----dvvq~~~---ddplt~ce~c~a~~   43 (82)
T COG2331          11 YSYECTECGNRF----DVVQAMT---DDPLTTCEECGARL   43 (82)
T ss_pred             eEEeecccchHH----HHHHhcc---cCccccChhhChHH
Confidence            457788887766    3444543   11224677777543


No 215
>PF01363 FYVE:  FYVE zinc finger;  InterPro: IPR000306 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target.  The FYVE zinc finger is named after four proteins that it has been found in: Fab1, YOTB/ZK632.12, Vac1, and EEA1. The FYVE finger has been shown to bind two zinc ions []. The FYVE finger has eight potential zinc coordinating cysteine positions. Many members of this family also include two histidines in a motif R+HHC+XCG, where + represents a charged residue and X any residue. FYVE-type domains are divided into two known classes: FYVE domains that specifically bind to phosphatidylinositol 3-phosphate in lipid bilayers and FYVE-related domains of undetermined function []. Those that bind to phosphatidylinositol 3-phosphate are often found in proteins targeted to lipid membranes that are involved in regulating membrane traffic [, , ]. Most FYVE domains target proteins to endosomes by binding specifically to phosphatidylinositol-3-phosphate at the membrane surface. By contrast, the CARP2 FYVE-like domain is not optimized to bind to phosphoinositides or insert into lipid bilayers. FYVE domains are distinguished from other zinc fingers by three signature sequences: an N-terminal WxxD motif, a basic R(R/K)HHCR patch, and a C-terminal RVC motif. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; GO: 0046872 metal ion binding; PDB: 1HYI_A 1JOC_B 1HYJ_A 1DVP_A 3ZYQ_A 4AVX_A 1VFY_A 3T7L_A 1X4U_A 1WFK_A ....
Probab=21.00  E-value=60  Score=23.08  Aligned_cols=9  Identities=44%  Similarity=1.191  Sum_probs=2.7

Q ss_pred             ccccCcccc
Q psy2838         210 KCDFCDKTF  218 (458)
Q Consensus       210 ~C~~C~~~f  218 (458)
                      .|..|++.|
T Consensus        11 ~C~~C~~~F   19 (69)
T PF01363_consen   11 NCMICGKKF   19 (69)
T ss_dssp             B-TTT--B-
T ss_pred             cCcCcCCcC
Confidence            455555555


No 216
>KOG0978|consensus
Probab=20.61  E-value=48  Score=35.39  Aligned_cols=18  Identities=28%  Similarity=0.549  Sum_probs=10.0

Q ss_pred             ecCccccccCCchhhhhc
Q psy2838         238 KCDHCVKSFNQKPNLISH  255 (458)
Q Consensus       238 ~C~~C~~~f~~~~~L~~H  255 (458)
                      +||.|+..|....-+..|
T Consensus       680 KCP~Cn~aFganDv~~I~  697 (698)
T KOG0978|consen  680 KCPKCNAAFGANDVHRIH  697 (698)
T ss_pred             CCCCCCCCCCcccccccC
Confidence            566666666655444433


No 217
>PF13451 zf-trcl:  Probable zinc-binding domain
Probab=20.49  E-value=45  Score=22.30  Aligned_cols=14  Identities=36%  Similarity=0.835  Sum_probs=6.5

Q ss_pred             cccCccccccCCCH
Q psy2838         320 KYKCADCGRLFAQE  333 (458)
Q Consensus       320 ~~~C~~C~~~f~~~  333 (458)
                      .+.|..||..|...
T Consensus         4 ~l~C~dCg~~FvfT   17 (49)
T PF13451_consen    4 TLTCKDCGAEFVFT   17 (49)
T ss_pred             eEEcccCCCeEEEe
Confidence            34455555444433


No 218
>PRK12380 hydrogenase nickel incorporation protein HybF; Provisional
Probab=20.48  E-value=49  Score=26.55  Aligned_cols=10  Identities=30%  Similarity=0.392  Sum_probs=4.9

Q ss_pred             eecCccCccc
Q psy2838         293 YTCEVCSKSY  302 (458)
Q Consensus       293 ~~C~~C~~~f  302 (458)
                      +.|..|+..|
T Consensus        71 ~~C~~Cg~~~   80 (113)
T PRK12380         71 AWCWDCSQVV   80 (113)
T ss_pred             EEcccCCCEE
Confidence            4455555444


No 219
>TIGR00100 hypA hydrogenase nickel insertion protein HypA. In Hpylori, hypA mutant abolished hydrogenase activity and decrease in urease activity. Nickel supplementation in media restored urease activity and partial hydrogenase activity. HypA probably involved in inserting Ni in enzymes.
Probab=20.20  E-value=47  Score=26.72  Aligned_cols=10  Identities=40%  Similarity=0.853  Sum_probs=4.5

Q ss_pred             eecCccCccc
Q psy2838         293 YTCEVCSKSY  302 (458)
Q Consensus       293 ~~C~~C~~~f  302 (458)
                      ..|..|+..|
T Consensus        71 ~~C~~Cg~~~   80 (115)
T TIGR00100        71 CECEDCSEEV   80 (115)
T ss_pred             EEcccCCCEE
Confidence            4444444444


No 220
>COG1655 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=20.18  E-value=44  Score=30.21  Aligned_cols=18  Identities=17%  Similarity=0.451  Sum_probs=9.4

Q ss_pred             ceeecCccccccCCchhh
Q psy2838         235 VRFKCDHCVKSFNQKPNL  252 (458)
Q Consensus       235 ~~~~C~~C~~~f~~~~~L  252 (458)
                      +.+.|++|+..|....-+
T Consensus        18 k~ieCPvC~tkFkkeev~   35 (267)
T COG1655          18 KTIECPVCNTKFKKEEVK   35 (267)
T ss_pred             ceeccCcccchhhhhhee
Confidence            345566666665544433


No 221
>TIGR00244 transcriptional regulator NrdR. Members of this almost entirely bacterial family contain an ATP cone domain (PFAM:PF03477). There is never more than one member per genome. Common gene symbols given include nrdR, ybaD, ribX and ytcG. The member from Streptomyces coelicolor is found upstream in the operon of the class II oxygen-independent ribonucleotide reductase gene nrdJ and was shown to repress nrdJ expression. Many members of this family are found near genes for riboflavin biosynthesis in Gram-negative bacteria, suggesting a role in that pathway. However, a phylogenetic profiling study associates members of this family with the presence of a palindromic signal with consensus acaCwAtATaTwGtgt, termed the NrdR-box, an upstream element for most operons for ribonucleotide reductase of all three classes in bacterial genomes.
Probab=20.11  E-value=49  Score=27.80  Aligned_cols=15  Identities=20%  Similarity=0.454  Sum_probs=8.2

Q ss_pred             eecCccccccCCchh
Q psy2838         237 FKCDHCVKSFNQKPN  251 (458)
Q Consensus       237 ~~C~~C~~~f~~~~~  251 (458)
                      -.|..|++.|++...
T Consensus        29 ReC~~C~~RFTTyEr   43 (147)
T TIGR00244        29 RECLECHERFTTFER   43 (147)
T ss_pred             ccCCccCCccceeee
Confidence            346666666655443


No 222
>PF01780 Ribosomal_L37ae:  Ribosomal L37ae protein family;  InterPro: IPR002674 Ribosomes are the particles that catalyse mRNA-directed protein synthesis in all organisms. The codons of the mRNA are exposed on the ribosome to allow tRNA binding. This leads to the incorporation of amino acids into the growing polypeptide chain in accordance with the genetic information. Incoming amino acid monomers enter the ribosomal A site in the form of aminoacyl-tRNAs complexed with elongation factor Tu (EF-Tu) and GTP. The growing polypeptide chain, situated in the P site as peptidyl-tRNA, is then transferred to aminoacyl-tRNA and the new peptidyl-tRNA, extended by one residue, is translocated to the P site with the aid the elongation factor G (EF-G) and GTP as the deacylated tRNA is released from the ribosome through one or more exit sites [, ]. About 2/3 of the mass of the ribosome consists of RNA and 1/3 of protein. The proteins are named in accordance with the subunit of the ribosome which they belong to - the small (S1 to S31) and the large (L1 to L44). Usually they decorate the rRNA cores of the subunits.  Many ribosomal proteins, particularly those of the large subunit, are composed of a globular, surfaced-exposed domain with long finger-like projections that extend into the rRNA core to stabilise its structure. Most of the proteins interact with multiple RNA elements, often from different domains. In the large subunit, about 1/3 of the 23S rRNA nucleotides are at least in van der Waal's contact with protein, and L22 interacts with all six domains of the 23S rRNA. Proteins S4 and S7, which initiate assembly of the 16S rRNA, are located at junctions of five and four RNA helices, respectively. In this way proteins serve to organise and stabilise the rRNA tertiary structure. While the crucial activities of decoding and peptide transfer are RNA based, proteins play an active role in functions that may have evolved to streamline the process of protein synthesis. In addition to their function in the ribosome, many ribosomal proteins have some function 'outside' the ribosome [, ]. This ribosomal protein is found in archaebacteria and eukaryotes []. Ribosomal protein L37 has a single zinc finger-like motif of the C2-C2 type [].; GO: 0003735 structural constituent of ribosome, 0006412 translation, 0005622 intracellular, 0005840 ribosome; PDB: 4A1E_Y 4A17_Y 4A1C_Y 4A1A_Y 3O58_g 3IZS_m 3O5H_g 1S1I_9 3IZR_m 1YSH_D ....
Probab=20.08  E-value=35  Score=26.09  Aligned_cols=10  Identities=50%  Similarity=1.484  Sum_probs=5.6

Q ss_pred             ccCccccccC
Q psy2838         321 YKCADCGRLF  330 (458)
Q Consensus       321 ~~C~~C~~~f  330 (458)
                      |.|..|++.|
T Consensus        54 W~C~~C~~~~   63 (90)
T PF01780_consen   54 WKCKKCGKKF   63 (90)
T ss_dssp             EEETTTTEEE
T ss_pred             eecCCCCCEE
Confidence            5555555544


No 223
>KOG3408|consensus
Probab=20.03  E-value=67  Score=25.90  Aligned_cols=27  Identities=26%  Similarity=0.531  Sum_probs=22.4

Q ss_pred             CCCCcccccccccccCCHHHHHHHHHh
Q psy2838         176 PPVKKFPCDQCGKSFTLKQNLTYHVNS  202 (458)
Q Consensus       176 ~~~k~~~C~~C~~~f~~~~~L~~H~~~  202 (458)
                      ++...|-|-.|.+.|.+...|..|.++
T Consensus        53 PG~GqfyCi~CaRyFi~~~~l~~H~kt   79 (129)
T KOG3408|consen   53 PGGGQFYCIECARYFIDAKALKTHFKT   79 (129)
T ss_pred             CCCceeehhhhhhhhcchHHHHHHHhc
Confidence            445568899999999999999999874


Done!