Query         psy8050
Match_columns 311
No_of_seqs    388 out of 3471
Neff          9.5 
Searched_HMMs 46136
Date          Fri Aug 16 17:08:00 2013
Command       hhsearch -i /work/01045/syshi/Psyhhblits/psy8050.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/8050hhsearch_cdd -cpu 12 -v 0 

 No Hit                             Prob E-value P-value  Score    SS Cols Query HMM  Template HMM
  1 KOG2462|consensus              100.0 1.3E-28 2.7E-33  204.0   7.1  138  110-249   125-265 (279)
  2 KOG2462|consensus               99.9   4E-27 8.7E-32  195.1   4.6  134  141-283   128-264 (279)
  3 KOG3608|consensus               99.9 1.7E-25 3.8E-30  190.0   2.9  265   33-310    66-369 (467)
  4 KOG3608|consensus               99.9 7.3E-23 1.6E-27  174.1   5.2  235   35-287   133-377 (467)
  5 KOG1074|consensus               99.9 1.9E-23 4.1E-28  194.9   1.8   79  173-251   607-692 (958)
  6 KOG1074|consensus               99.8 1.2E-21 2.6E-26  182.9   4.1  106  200-308   606-724 (958)
  7 KOG3623|consensus               99.8 3.2E-21   7E-26  177.3   0.4  117   35-164   209-330 (1007)
  8 KOG3623|consensus               99.7 2.4E-19 5.2E-24  165.2   0.8  108   80-193   211-331 (1007)
  9 KOG3576|consensus               99.7 2.3E-17   5E-22  130.6   3.0  114  141-254   115-239 (267)
 10 KOG3576|consensus               99.6 2.6E-16 5.6E-21  124.7   5.5  118  110-227   112-240 (267)
 11 PLN03086 PRLI-interacting fact  99.1 8.4E-11 1.8E-15  109.7   7.7  143  114-282   406-560 (567)
 12 PLN03086 PRLI-interacting fact  99.1 9.9E-11 2.1E-15  109.3   6.6  118  115-249   433-562 (567)
 13 PHA00733 hypothetical protein   99.0 2.4E-10 5.2E-15   87.5   4.2   55  196-252    70-124 (128)
 14 PHA00733 hypothetical protein   98.9 7.5E-10 1.6E-14   84.8   3.6   83  140-224    37-124 (128)
 15 PHA02768 hypothetical protein;  98.7   4E-09 8.6E-14   66.8   1.9   42  200-243     6-47  (55)
 16 KOG3993|consensus               98.7 9.5E-10 2.1E-14   96.8  -1.5   80  144-223   268-380 (500)
 17 KOG3993|consensus               98.7 1.4E-09   3E-14   95.7  -1.2  164  114-284   266-480 (500)
 18 PHA02768 hypothetical protein;  98.6 2.4E-08 5.2E-13   63.2   2.6   42  144-187     6-47  (55)
 19 PF13465 zf-H2C2_2:  Zinc-finge  98.5 8.5E-08 1.8E-12   51.8   2.5   26  214-239     1-26  (26)
 20 PF13465 zf-H2C2_2:  Zinc-finge  98.3 4.2E-07 9.2E-12   49.1   2.7   24  131-154     2-25  (26)
 21 PHA00616 hypothetical protein   98.1   1E-06 2.3E-11   53.1   1.5   32  200-231     2-33  (44)
 22 PHA00732 hypothetical protein   98.1 2.9E-06 6.4E-11   58.9   2.7   46  199-250     1-47  (79)
 23 PHA00616 hypothetical protein   98.0 2.1E-06 4.6E-11   51.8   1.6   33  143-175     1-33  (44)
 24 PHA00732 hypothetical protein   98.0 5.5E-06 1.2E-10   57.6   2.8   48  227-287     1-49  (79)
 25 PF05605 zf-Di19:  Drought indu  97.8 4.3E-05 9.2E-10   49.2   4.4   47  200-249     3-51  (54)
 26 PF00096 zf-C2H2:  Zinc finger,  97.7 2.7E-05 5.9E-10   40.6   2.4   23  228-250     1-23  (23)
 27 PF00096 zf-C2H2:  Zinc finger,  97.5 7.4E-05 1.6E-09   38.9   2.3   22  144-165     1-22  (23)
 28 PF05605 zf-Di19:  Drought indu  97.5 0.00019 4.1E-09   46.2   4.5   49  116-167     3-53  (54)
 29 PF13894 zf-C2H2_4:  C2H2-type   97.5 0.00013 2.7E-09   38.2   2.6   23  228-250     1-23  (24)
 30 PF13912 zf-C2H2_6:  C2H2-type   97.4 9.7E-05 2.1E-09   40.1   1.5   25  227-251     1-25  (27)
 31 PF12756 zf-C2H2_2:  C2H2 type   97.4 0.00015 3.3E-09   52.9   3.0   24  227-250    50-73  (100)
 32 PF13894 zf-C2H2_4:  C2H2-type   97.2 0.00031 6.8E-09   36.6   2.5   23  144-166     1-23  (24)
 33 PF12756 zf-C2H2_2:  C2H2 type   97.2 0.00018   4E-09   52.5   2.0   72  146-222     2-73  (100)
 34 PF13912 zf-C2H2_6:  C2H2-type   97.0  0.0004 8.6E-09   37.6   1.6   23  144-166     2-24  (27)
 35 COG5189 SFP1 Putative transcri  96.8 0.00037 8.1E-09   59.9   0.6   20  264-283   400-419 (423)
 36 smart00355 ZnF_C2H2 zinc finge  96.6  0.0021 4.5E-08   33.9   2.5   23  228-250     1-23  (26)
 37 COG5189 SFP1 Putative transcri  96.6 0.00075 1.6E-08   58.1   0.8   51  197-247   347-418 (423)
 38 KOG2231|consensus               96.4  0.0052 1.1E-07   59.0   5.4  100  146-252   118-237 (669)
 39 KOG2231|consensus               96.4  0.0078 1.7E-07   57.9   6.4  122  118-250   118-262 (669)
 40 smart00355 ZnF_C2H2 zinc finge  96.1  0.0055 1.2E-07   32.2   2.3   24  200-223     1-24  (26)
 41 PF13909 zf-H2C2_5:  C2H2-type   96.0  0.0055 1.2E-07   32.0   1.9   23  228-251     1-23  (24)
 42 PF12874 zf-met:  Zinc-finger o  96.0  0.0041 8.8E-08   32.9   1.4   21  228-248     1-21  (25)
 43 COG5236 Uncharacterized conser  95.9   0.011 2.4E-07   51.7   4.5  129  116-254   152-309 (493)
 44 PF09237 GAGA:  GAGA factor;  I  95.9  0.0069 1.5E-07   37.5   2.3   27  226-252    23-49  (54)
 45 PRK04860 hypothetical protein;  95.7   0.005 1.1E-07   49.0   1.4   39  198-240   118-156 (160)
 46 PF09237 GAGA:  GAGA factor;  I  95.7   0.014 3.1E-07   36.1   3.1   29  142-170    23-51  (54)
 47 COG5048 FOG: Zn-finger [Genera  95.5  0.0075 1.6E-07   55.9   2.1  144  143-291   289-447 (467)
 48 PF12874 zf-met:  Zinc-finger o  95.4  0.0086 1.9E-07   31.5   1.3   23  200-222     1-23  (25)
 49 PF13909 zf-H2C2_5:  C2H2-type   95.4   0.017 3.6E-07   30.1   2.3   22  144-166     1-22  (24)
 50 KOG1146|consensus               95.0   0.013 2.7E-07   59.9   1.9   51  115-165   436-487 (1406)
 51 PRK04860 hypothetical protein;  94.9   0.023 4.9E-07   45.3   2.8   34  172-209   120-153 (160)
 52 KOG2482|consensus               94.9   0.061 1.3E-06   47.1   5.5   57  228-284   280-356 (423)
 53 COG5048 FOG: Zn-finger [Genera  94.5  0.0093   2E-07   55.3  -0.4  142  114-255   288-446 (467)
 54 PF12171 zf-C2H2_jaz:  Zinc-fin  94.4   0.018 3.9E-07   31.0   0.8   20  200-219     2-21  (27)
 55 KOG1146|consensus               94.2   0.028   6E-07   57.6   2.2   59  226-284   517-611 (1406)
 56 PF12171 zf-C2H2_jaz:  Zinc-fin  93.9   0.026 5.7E-07   30.4   0.9   22  172-193     2-23  (27)
 57 KOG2785|consensus               93.1     0.4 8.6E-06   42.9   7.2   23  142-164    67-89  (390)
 58 TIGR00622 ssl1 transcription f  93.0    0.17 3.8E-06   37.4   4.2   48  201-250    57-104 (112)
 59 KOG4173|consensus               92.7   0.037 8.1E-07   44.9   0.3   46  144-192    80-127 (253)
 60 PF13913 zf-C2HC_2:  zinc-finge  92.2    0.14 2.9E-06   27.1   2.0   20  228-248     3-22  (25)
 61 PF13913 zf-C2HC_2:  zinc-finge  92.0    0.14   3E-06   27.1   1.9   19  145-164     4-22  (25)
 62 COG5236 Uncharacterized conser  92.0    0.24 5.2E-06   43.6   4.4  133  143-284   151-303 (493)
 63 smart00451 ZnF_U1 U1-like zinc  90.7    0.23   5E-06   28.3   2.1   22  227-248     3-24  (35)
 64 KOG2482|consensus               89.9    0.37   8E-06   42.4   3.6   51  172-222   280-357 (423)
 65 KOG2893|consensus               89.2     0.1 2.2E-06   43.4  -0.3   49  202-254    13-62  (341)
 66 smart00451 ZnF_U1 U1-like zinc  88.8    0.37   8E-06   27.4   2.0   22  143-164     3-24  (35)
 67 TIGR00622 ssl1 transcription f  88.7     1.1 2.4E-05   33.2   4.9   85  142-234    14-110 (112)
 68 PF12013 DUF3505:  Protein of u  88.2     1.2 2.7E-05   32.9   5.0   25  172-196    81-109 (109)
 69 cd00350 rubredoxin_like Rubred  88.2    0.39 8.4E-06   27.2   1.8    8  173-180     3-10  (33)
 70 KOG2893|consensus               86.7    0.27 5.9E-06   41.0   0.8   41  118-162    13-53  (341)
 71 PF06524 NOA36:  NOA36 protein;  86.2    0.55 1.2E-05   39.8   2.3   26  225-250   207-232 (314)
 72 PF12013 DUF3505:  Protein of u  85.0     1.1 2.5E-05   33.1   3.4   24  142-166    10-33  (109)
 73 PF09986 DUF2225:  Uncharacteri  83.3     0.2 4.4E-06   42.1  -1.5   48  226-275     4-61  (214)
 74 PF09538 FYDLN_acid:  Protein o  82.7       1 2.2E-05   33.3   2.1   34  112-156     6-39  (108)
 75 KOG2186|consensus               82.4    0.83 1.8E-05   38.6   1.8   49  115-166     3-51  (276)
 76 COG4049 Uncharacterized protei  80.2    0.97 2.1E-05   28.6   1.1   29  139-167    13-41  (65)
 77 KOG4173|consensus               79.8    0.61 1.3E-05   38.1   0.1   77  171-250    79-170 (253)
 78 PF13717 zinc_ribbon_4:  zinc-r  79.6       2 4.3E-05   24.8   2.2   33  116-153     3-35  (36)
 79 cd00729 rubredoxin_SM Rubredox  76.9       2 4.3E-05   24.5   1.7    9  172-180     3-11  (34)
 80 PF09538 FYDLN_acid:  Protein o  76.1     1.8 3.9E-05   32.0   1.7   14  226-239    25-38  (108)
 81 PF09986 DUF2225:  Uncharacteri  75.6     1.4 3.1E-05   37.0   1.2   13  200-212    49-61  (214)
 82 COG4049 Uncharacterized protei  75.3     1.6 3.4E-05   27.7   1.0   25  224-248    14-38  (65)
 83 PF13719 zinc_ribbon_5:  zinc-r  75.0       3 6.5E-05   24.2   2.1   33  116-153     3-35  (37)
 84 TIGR00373 conserved hypothetic  74.4     3.6 7.9E-05   32.7   3.2   34  139-181   105-138 (158)
 85 TIGR02098 MJ0042_CXXC MJ0042 f  74.3     2.7 5.7E-05   24.4   1.8   10  144-153    26-35  (38)
 86 COG1592 Rubrerythrin [Energy p  71.6     2.8 6.1E-05   33.5   1.9   10  143-152   134-143 (166)
 87 KOG2186|consensus               70.5     2.8   6E-05   35.6   1.7   47  143-192     3-49  (276)
 88 COG1592 Rubrerythrin [Energy p  69.9     2.6 5.6E-05   33.7   1.4   10  225-234   147-156 (166)
 89 TIGR02300 FYDLN_acid conserved  68.6     3.9 8.4E-05   30.9   2.0   34  112-156     6-39  (129)
 90 smart00531 TFIIE Transcription  68.6     9.4  0.0002   29.9   4.3   38  140-181    96-133 (147)
 91 PRK06266 transcription initiat  67.0     5.8 0.00013   32.3   2.9   32  141-181   115-146 (178)
 92 KOG1701|consensus               66.2     2.7 5.8E-05   38.4   0.9   44  264-307   396-441 (468)
 93 TIGR02605 CxxC_CxxC_SSSS putat  65.8     3.7 8.1E-05   25.7   1.3   10  144-153     6-15  (52)
 94 COG2888 Predicted Zn-ribbon RN  64.3     8.5 0.00018   24.9   2.6   10  170-179    49-58  (61)
 95 TIGR01384 TFS_arch transcripti  64.0     3.2 6.9E-05   30.4   0.8   13   79-91     16-28  (104)
 96 KOG1280|consensus               63.8     7.4 0.00016   34.6   3.1   22  143-164    79-100 (381)
 97 smart00659 RPOLCX RNA polymera  63.0     6.5 0.00014   23.9   1.9   10  144-153     3-12  (44)
 98 smart00834 CxxC_CXXC_SSSS Puta  63.0     4.7  0.0001   23.6   1.3   10  172-181     6-15  (41)
 99 PRK00398 rpoP DNA-directed RNA  62.8       5 0.00011   24.5   1.4   10  144-153     4-13  (46)
100 KOG2879|consensus               62.3      11 0.00024   32.5   3.7   28   16-43    219-246 (298)
101 PRK00464 nrdR transcriptional   62.2       2 4.4E-05   33.9  -0.5   16  200-215    29-44  (154)
102 PRK14890 putative Zn-ribbon RN  60.9      14  0.0003   23.9   3.1   10  170-179    47-56  (59)
103 TIGR00373 conserved hypothetic  60.5       8 0.00017   30.8   2.6   32  168-208   106-137 (158)
104 KOG4752|consensus               60.1     8.1 0.00018   19.6   1.6   11   10-20      7-17  (26)
105 COG1198 PriA Primosomal protei  59.5       7 0.00015   39.0   2.5   12  197-208   473-484 (730)
106 TIGR02300 FYDLN_acid conserved  58.2     7.1 0.00015   29.5   1.7   13  226-238    25-37  (129)
107 smart00614 ZnF_BED BED zinc fi  58.1     9.4  0.0002   23.7   2.1   21  228-248    19-44  (50)
108 smart00734 ZnF_Rad18 Rad18-lik  57.9     9.3  0.0002   20.2   1.7   17  146-163     4-20  (26)
109 PF08188 Protamine_3:  Spermato  57.4     9.7 0.00021   22.2   1.8   23    2-24      8-30  (48)
110 smart00531 TFIIE Transcription  57.2      14 0.00029   29.0   3.3   39  112-154    96-134 (147)
111 KOG2785|consensus               55.9      14  0.0003   33.5   3.4   49  116-164   167-241 (390)
112 PRK06266 transcription initiat  54.1      10 0.00023   30.8   2.3   30  170-208   116-145 (178)
113 PF09723 Zn-ribbon_8:  Zinc rib  53.9     7.9 0.00017   23.1   1.2   11  172-182     6-16  (42)
114 PF02892 zf-BED:  BED zinc fing  53.6      16 0.00035   21.8   2.6   22  226-247    15-40  (45)
115 COG5151 SSL1 RNA polymerase II  53.6     6.1 0.00013   34.6   0.9   23  228-250   389-411 (421)
116 COG1198 PriA Primosomal protei  53.5      10 0.00022   38.0   2.4   40  172-236   445-484 (730)
117 PF10571 UPF0547:  Uncharacteri  52.9     7.8 0.00017   20.6   0.9    9  296-304    17-25  (26)
118 KOG2807|consensus               52.5      26 0.00056   31.1   4.5   25  226-250   344-368 (378)
119 PF05129 Elf1:  Transcription e  52.4     6.2 0.00014   27.5   0.6   10   35-44     21-30  (81)
120 KOG4377|consensus               52.0      11 0.00025   34.3   2.3  110  142-254   270-430 (480)
121 KOG4602|consensus               51.6      14 0.00031   31.4   2.7   10  143-152   268-277 (318)
122 PF05290 Baculo_IE-1:  Baculovi  51.5     8.1 0.00018   29.4   1.1   17  168-184    77-93  (140)
123 PF02176 zf-TRAF:  TRAF-type zi  51.3      11 0.00023   24.2   1.6   33  143-176     9-43  (60)
124 PF05443 ROS_MUCR:  ROS/MUCR tr  49.4      10 0.00022   29.1   1.4   26  113-141    70-95  (132)
125 PF14446 Prok-RING_1:  Prokaryo  49.2     9.3  0.0002   24.3   1.0   12  264-275     7-18  (54)
126 PRK04023 DNA polymerase II lar  48.7      18 0.00039   37.1   3.4    9  144-152   627-635 (1121)
127 PF08274 PhnA_Zn_Ribbon:  PhnA   48.6     8.6 0.00019   21.2   0.7   10  142-151    18-27  (30)
128 CHL00152 rpl32 ribosomal prote  48.5      10 0.00022   24.0   1.0   22    2-23      4-25  (53)
129 PF05191 ADK_lid:  Adenylate ki  47.8     9.1  0.0002   22.1   0.7   11  264-274     3-13  (36)
130 COG1655 Uncharacterized protei  47.8     3.6 7.7E-05   34.4  -1.3   46  226-271    18-71  (267)
131 KOG2807|consensus               47.3      21 0.00045   31.6   3.1   23  264-286   347-369 (378)
132 cd00730 rubredoxin Rubredoxin;  47.2      25 0.00055   22.0   2.7    9   37-45      2-10  (50)
133 COG1996 RPC10 DNA-directed RNA  47.1      14 0.00031   22.9   1.6   10  144-153     7-16  (49)
134 PF15269 zf-C2H2_7:  Zinc-finge  47.0      24 0.00052   21.3   2.4   22  115-136    20-41  (54)
135 PF03604 DNA_RNApol_7kD:  DNA d  44.9      20 0.00043   20.1   1.8    7  144-150    18-24  (32)
136 PRK14873 primosome assembly pr  44.8      16 0.00035   36.2   2.4    8  145-152   385-392 (665)
137 COG1997 RPL43A Ribosomal prote  44.2      20 0.00043   25.2   2.1   14  114-127    52-65  (89)
138 COG4530 Uncharacterized protei  44.0      18 0.00039   26.5   1.9   32  112-154     6-37  (129)
139 PF00301 Rubredoxin:  Rubredoxi  43.2      23  0.0005   21.8   2.1   13  172-184     2-14  (47)
140 PRK09678 DNA-binding transcrip  41.4     7.3 0.00016   26.5  -0.4   17  170-186    26-44  (72)
141 PF01783 Ribosomal_L32p:  Ribos  41.4       9 0.00019   24.6   0.0   31    2-43      3-33  (56)
142 KOG2593|consensus               39.9      36 0.00079   31.5   3.6   39  139-180   124-162 (436)
143 PRK04023 DNA polymerase II lar  37.8      30 0.00066   35.6   3.0    9  144-152   639-647 (1121)
144 TIGR00595 priA primosomal prot  37.7      25 0.00055   33.7   2.5   10  198-207   252-261 (505)
145 PF05443 ROS_MUCR:  ROS/MUCR tr  37.6      23 0.00049   27.3   1.7   25  227-254    72-96  (132)
146 COG5151 SSL1 RNA polymerase II  37.4      46 0.00099   29.4   3.7   79  142-222   321-411 (421)
147 KOG2071|consensus               36.8      31 0.00067   33.3   2.8   27  141-167   416-442 (579)
148 PF13878 zf-C2H2_3:  zinc-finge  36.6      38 0.00082   20.1   2.2   24  228-251    14-39  (41)
149 KOG4167|consensus               33.9      11 0.00025   36.9  -0.5   27  141-167   790-816 (907)
150 PRK03824 hypA hydrogenase nick  32.5      29 0.00062   26.8   1.6   11  144-154    71-81  (135)
151 COG3677 Transposase and inacti  31.1      64  0.0014   24.7   3.3   17  291-307    51-67  (129)
152 COG4957 Predicted transcriptio  30.5      56  0.0012   25.1   2.7   25  113-140    74-98  (148)
153 KOG4377|consensus               30.2      37 0.00081   31.1   2.1  100  119-223   277-427 (480)
154 KOG0782|consensus               29.6      13 0.00029   35.4  -0.8   53  212-276   238-290 (1004)
155 PHA00626 hypothetical protein   29.3      28 0.00061   22.2   0.9   15  226-240    22-36  (59)
156 PF14353 CpXC:  CpXC protein     29.2      16 0.00034   27.8  -0.3   17  143-159    38-54  (128)
157 KOG0847|consensus               29.2      22 0.00048   29.5   0.5   20   13-32    217-236 (288)
158 PF07754 DUF1610:  Domain of un  29.0      26 0.00056   18.2   0.5   10   34-43     14-23  (24)
159 PF04959 ARS2:  Arsenite-resist  29.0      29 0.00064   29.1   1.2   25  199-223    77-101 (214)
160 PF09845 DUF2072:  Zn-ribbon co  28.9      29 0.00063   26.5   1.0   14  144-157     2-15  (131)
161 PF14122 YokU:  YokU-like prote  28.8      52  0.0011   23.1   2.2   21  111-131    31-51  (87)
162 PF05495 zf-CHY:  CHY zinc fing  28.6      20 0.00044   24.2   0.2    9  145-153    12-20  (71)
163 PF15135 UPF0515:  Uncharacteri  28.2      53  0.0011   28.0   2.5   12  142-153   111-122 (278)
164 COG3357 Predicted transcriptio  27.8      37  0.0008   24.0   1.3   13  171-183    58-70  (97)
165 KOG2593|consensus               27.6      39 0.00084   31.3   1.8   36  114-152   127-162 (436)
166 KOG1280|consensus               27.3      52  0.0011   29.5   2.4   36  199-234    79-116 (381)
167 PTZ00255 60S ribosomal protein  27.2      29 0.00064   24.6   0.8   10  200-209    55-64  (90)
168 PF04959 ARS2:  Arsenite-resist  27.1      23 0.00051   29.7   0.3   30  169-198    75-104 (214)
169 KOG2906|consensus               27.1      12 0.00026   26.8  -1.2   13   78-90     20-32  (105)
170 PF01363 FYVE:  FYVE zinc finge  26.9      35 0.00076   22.5   1.1    9  264-272    11-19  (69)
171 COG1779 C4-type Zn-finger prot  26.7      20 0.00043   29.4  -0.2   11  295-305    45-55  (201)
172 PF01780 Ribosomal_L37ae:  Ribo  26.5      25 0.00054   25.0   0.3   10  200-209    54-63  (90)
173 TIGR01206 lysW lysine biosynth  26.4      41 0.00089   21.4   1.2    9  264-272     4-12  (54)
174 KOG4167|consensus               25.4      21 0.00046   35.2  -0.3   31  110-140   787-817 (907)
175 PF01096 TFIIS_C:  Transcriptio  25.2      15 0.00033   21.5  -0.8    6  174-179     3-8   (39)
176 PF13453 zf-TFIIB:  Transcripti  25.0      45 0.00097   19.6   1.2   17  144-160    20-36  (41)
177 PF10058 DUF2296:  Predicted in  24.6      77  0.0017   20.1   2.3    9  143-151    44-52  (54)
178 TIGR00280 L37a ribosomal prote  24.5      31 0.00068   24.5   0.5   11  200-210    54-64  (91)
179 COG0333 RpmF Ribosomal protein  23.8      45 0.00097   21.5   1.1   30    3-43      5-34  (57)
180 TIGR00595 priA primosomal prot  23.1      53  0.0012   31.5   1.9   30  192-236   233-262 (505)
181 PRK00432 30S ribosomal protein  22.9      65  0.0014   20.1   1.6   11  143-153    37-47  (50)
182 PRK14714 DNA polymerase II lar  22.8   1E+02  0.0022   32.9   3.9    8  144-151   668-675 (1337)
183 TIGR01562 FdhE formate dehydro  22.6      58  0.0013   29.0   1.9   12  168-179   207-218 (305)
184 PRK12286 rpmF 50S ribosomal pr  22.4      25 0.00054   22.6  -0.3    9   35-43     26-34  (57)
185 TIGR00686 phnA alkylphosphonat  22.1      54  0.0012   24.1   1.3   15  142-156    18-32  (109)
186 PRK12380 hydrogenase nickel in  22.0      56  0.0012   24.3   1.5   11  144-154    71-81  (113)
187 KOG0782|consensus               22.0      30 0.00065   33.1   0.0   53  240-307   238-290 (1004)
188 PRK03564 formate dehydrogenase  21.8      30 0.00066   30.8   0.0    6  201-206   228-233 (309)
189 PF12647 RNHCP:  RNHCP domain;   21.8 3.1E+02  0.0067   19.6   5.0    9   35-43      3-11  (92)
190 KOG3408|consensus               21.7      49  0.0011   24.8   1.1   26  196-221    54-79  (129)
191 TIGR00100 hypA hydrogenase nic  21.5      57  0.0012   24.3   1.4   11  144-154    71-81  (115)
192 PF10013 DUF2256:  Uncharacteri  21.5      40 0.00087   20.2   0.5   16  116-131     9-24  (42)
193 COG2331 Uncharacterized protei  21.4      30 0.00064   23.6  -0.1   10  228-237    13-22  (82)
194 KOG3014|consensus               21.4      42 0.00091   28.7   0.8   31   35-68     36-66  (257)
195 TIGR03829 YokU_near_AblA uncha  21.1      54  0.0012   23.3   1.1   16  115-130    35-50  (89)
196 PF09788 Tmemb_55A:  Transmembr  21.0 3.7E+02   0.008   23.2   6.2   41   36-81     65-107 (256)
197 PF07295 DUF1451:  Protein of u  20.9      65  0.0014   25.2   1.7   11  227-237   112-122 (146)
198 PF07649 C1_3:  C1-like domain;  20.6      56  0.0012   17.6   0.9    8  143-150    15-22  (30)
199 PF13451 zf-trcl:  Probable zin  20.5      90   0.002   19.4   1.9   18  226-243     3-20  (49)
200 PRK05978 hypothetical protein;  20.0      58  0.0013   25.6   1.2    9  265-273    55-63  (148)

No 1  
>KOG2462|consensus
Probab=99.95  E-value=1.3e-28  Score=204.03  Aligned_cols=138  Identities=36%  Similarity=0.654  Sum_probs=126.5

Q ss_pred             CCCcceeeccCCCCcCCChhHHHHHHhhhCC---CCCeecCcCccccCChhHHHHHHHHhcCCCceecCCCCccccCcch
Q psy8050         110 SGIRLILIQFIPGKTYARPSTLKTHLRTHSG---EKPYRCEDCNKSFSQAANLTAHVRTHSGEKPFRCPICDRRFSQSSS  186 (311)
Q Consensus       110 ~~~~~~~~C~~C~~~f~~~~~L~~H~~~h~~---~~~~~C~~C~~~f~~~~~L~~H~~~h~~~~~~~C~~C~~~f~~~~~  186 (311)
                      ......++|..||+.|.+.++|.+|.+.|-.   .+.+.|++|+++|.+...|..|+++|+  .+++|.+||+.|...+.
T Consensus       125 ~~~~~r~~c~eCgk~ysT~snLsrHkQ~H~~~~s~ka~~C~~C~K~YvSmpALkMHirTH~--l~c~C~iCGKaFSRPWL  202 (279)
T KOG2462|consen  125 AAKHPRYKCPECGKSYSTSSNLSRHKQTHRSLDSKKAFSCKYCGKVYVSMPALKMHIRTHT--LPCECGICGKAFSRPWL  202 (279)
T ss_pred             cccCCceeccccccccccccccchhhcccccccccccccCCCCCceeeehHHHhhHhhccC--CCcccccccccccchHH
Confidence            3455678999999999999999999998864   467999999999999999999999996  57899999999999999


Q ss_pred             hhhhhhhccccccccccccccccCChHHHHHHhhhhcCCCCeecCccccccCChHHHHHHHHH
Q psy8050         187 VTTHMRTHSGERPYRCRLCKKAFSDSSTLTKHLRIHSGEKPYQCKLCLLRFSQSGNLNRHMRV  249 (311)
Q Consensus       187 l~~H~~~h~~~k~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~f~~~~~L~~H~~~  249 (311)
                      |+-|+++|+||+||.|+.|+++|...++|+.|+++|.+.+.|+|..|++.|...+.|..|...
T Consensus       203 LQGHiRTHTGEKPF~C~hC~kAFADRSNLRAHmQTHS~~K~~qC~~C~KsFsl~SyLnKH~ES  265 (279)
T KOG2462|consen  203 LQGHIRTHTGEKPFSCPHCGKAFADRSNLRAHMQTHSDVKKHQCPRCGKSFALKSYLNKHSES  265 (279)
T ss_pred             hhcccccccCCCCccCCcccchhcchHHHHHHHHhhcCCccccCcchhhHHHHHHHHHHhhhh
Confidence            999999999999999999999999999999999999999999999999999999999999764


No 2  
>KOG2462|consensus
Probab=99.93  E-value=4e-27  Score=195.13  Aligned_cols=134  Identities=32%  Similarity=0.671  Sum_probs=125.5

Q ss_pred             CCCeecCcCccccCChhHHHHHHHHhcC---CCceecCCCCccccCcchhhhhhhhccccccccccccccccCChHHHHH
Q psy8050         141 EKPYRCEDCNKSFSQAANLTAHVRTHSG---EKPFRCPICDRRFSQSSSVTTHMRTHSGERPYRCRLCKKAFSDSSTLTK  217 (311)
Q Consensus       141 ~~~~~C~~C~~~f~~~~~L~~H~~~h~~---~~~~~C~~C~~~f~~~~~l~~H~~~h~~~k~~~C~~C~~~f~~~~~l~~  217 (311)
                      ...|.|..||+.+.+.++|.+|.++|..   .+.+.|.+||+.|.+...|..|+++|+  -+++|.+||+.|.....|+.
T Consensus       128 ~~r~~c~eCgk~ysT~snLsrHkQ~H~~~~s~ka~~C~~C~K~YvSmpALkMHirTH~--l~c~C~iCGKaFSRPWLLQG  205 (279)
T KOG2462|consen  128 HPRYKCPECGKSYSTSSNLSRHKQTHRSLDSKKAFSCKYCGKVYVSMPALKMHIRTHT--LPCECGICGKAFSRPWLLQG  205 (279)
T ss_pred             CCceeccccccccccccccchhhcccccccccccccCCCCCceeeehHHHhhHhhccC--CCcccccccccccchHHhhc
Confidence            3459999999999999999999999864   467999999999999999999999997  57999999999999999999


Q ss_pred             HhhhhcCCCCeecCccccccCChHHHHHHHHHhCCCCCCCCcccccccccccccccccccchhhce
Q psy8050         218 HLRIHSGEKPYQCKLCLLRFSQSGNLNRHMRVHGSQYNMLPVIYMGVNQVCSNIVDQRRGIYVVEA  283 (311)
Q Consensus       218 H~~~h~~~~~~~C~~C~~~f~~~~~L~~H~~~H~~~~~~~~~~~~~~C~~C~~~f~~~~~l~~H~~  283 (311)
                      |+++|+||+||.|+.|++.|..+++|+.||++|.+.+..       +|..|++.|.....|.+|..
T Consensus       206 HiRTHTGEKPF~C~hC~kAFADRSNLRAHmQTHS~~K~~-------qC~~C~KsFsl~SyLnKH~E  264 (279)
T KOG2462|consen  206 HIRTHTGEKPFSCPHCGKAFADRSNLRAHMQTHSDVKKH-------QCPRCGKSFALKSYLNKHSE  264 (279)
T ss_pred             ccccccCCCCccCCcccchhcchHHHHHHHHhhcCCccc-------cCcchhhHHHHHHHHHHhhh
Confidence            999999999999999999999999999999999998853       89999999999999999964


No 3  
>KOG3608|consensus
Probab=99.91  E-value=1.7e-25  Score=189.96  Aligned_cols=265  Identities=22%  Similarity=0.348  Sum_probs=204.3

Q ss_pred             hccCCCCCCCCCCCCCCchhhHhhhccccccCC----------------CCCCCC--------CCCCCCCCCCCC--CCC
Q psy8050          33 IQDRSECPGSGMGKPPHDVADVLLSLKHAVVHP----------------GQMTPP--------PYGDPGAPYQVG--APQ   86 (311)
Q Consensus        33 ~~~~~~C~~c~~~~~~~~~~~~~~~~~H~~~h~----------------~~~~p~--------~~~~p~~~~~C~--~c~   86 (311)
                      -.|.++|.+=+|++...+  +...|.+|...|-                .+..+-        ..+.-|.-|.|+  .|+
T Consensus        66 h~de~qC~w~~C~f~~~~--~s~~l~RHvy~H~y~~~l~q~G~~al~~~~dig~c~~~f~~~~~ip~~g~~f~C~WedCe  143 (467)
T KOG3608|consen   66 HTDEHQCTWNSCDFRTEN--SSADLIRHVYFHCYHTKLKQQGKLALDLHPDIGACTAPFRLMEKIPALGQNFRCGWEDCE  143 (467)
T ss_pred             cccceeEEeccCCccccc--hHHHHHhhhhhhhhHHHHHHHHHHHHhcCCCcCcccCCcchhhccccchhhhccChhhcC
Confidence            445799998887765444  3344455655442                122111        113334456675  489


Q ss_pred             CCCCCCCCccccCCCCCCC-----CCCCCCCcceeeccC--CCCcCCChhHHHHHHhhhCCCCCeecCcCccccCChhHH
Q psy8050          87 QFPTASLSYTVHPHQSSRG-----RPLNSGIRLILIQFI--PGKTYARPSTLKTHLRTHSGEKPYRCEDCNKSFSQAANL  159 (311)
Q Consensus        87 ~~~~~~~~~~~h~~~~~~~-----~~~~~~~~~~~~C~~--C~~~f~~~~~L~~H~~~h~~~~~~~C~~C~~~f~~~~~L  159 (311)
                      ..|.+...+..|+..|.-.     ...+.+.++.+.|.+  |-+.+.++..|+.|++.|++++...|+.||..|.+...|
T Consensus       144 ~~F~s~~ef~dHV~~H~l~ceyd~~~~~~D~~pv~~C~W~~Ct~~~~~k~~LreH~r~Hs~eKvvACp~Cg~~F~~~tkl  223 (467)
T KOG3608|consen  144 REFVSIVEFQDHVVKHALFCEYDIQKTPEDERPVTMCNWAMCTKHMGNKYRLREHIRTHSNEKVVACPHCGELFRTKTKL  223 (467)
T ss_pred             CcccCHHHHHHHHHHhhhhhhhhhhhCCCCCCceeeccchhhhhhhccHHHHHHHHHhcCCCeEEecchHHHHhccccHH
Confidence            9999999999887666443     222334445566654  999999999999999999999999999999999999999


Q ss_pred             HHHHHHhc--CCCceecCCCCccccCcchhhhhhhhccccccccccccccccCChHHHHHHhh-hhcCCCCeecCccccc
Q psy8050         160 TAHVRTHS--GEKPFRCPICDRRFSQSSSVTTHMRTHSGERPYRCRLCKKAFSDSSTLTKHLR-IHSGEKPYQCKLCLLR  236 (311)
Q Consensus       160 ~~H~~~h~--~~~~~~C~~C~~~f~~~~~l~~H~~~h~~~k~~~C~~C~~~f~~~~~l~~H~~-~h~~~~~~~C~~C~~~  236 (311)
                      -.|++..+  ...+|.|..|.+.|.+...|..|+..|.  .-|+|+.|+......++|..|++ .|...+||+|+.|+..
T Consensus       224 ~DH~rRqt~l~~n~fqC~~C~KrFaTeklL~~Hv~rHv--n~ykCplCdmtc~~~ssL~~H~r~rHs~dkpfKCd~Cd~~  301 (467)
T KOG3608|consen  224 FDHLRRQTELNTNSFQCAQCFKRFATEKLLKSHVVRHV--NCYKCPLCDMTCSSASSLTTHIRYRHSKDKPFKCDECDTR  301 (467)
T ss_pred             HHHHHhhhhhcCCchHHHHHHHHHhHHHHHHHHHHHhh--hcccccccccCCCChHHHHHHHHhhhccCCCccccchhhh
Confidence            99986543  4568999999999999999999999885  45899999999999999999988 4667899999999999


Q ss_pred             cCChHHHHHHHHHhCCCCCCCCccccccccc--ccccccccccchhhceeee-ecccccCCCCCCCCccccccCCCC
Q psy8050         237 FSQSGNLNRHMRVHGSQYNMLPVIYMGVNQV--CSNIVDQRRGIYVVEAVWT-VQSTWIDGRSTMCGHSFIVKLPLS  310 (311)
Q Consensus       237 f~~~~~L~~H~~~H~~~~~~~~~~~~~~C~~--C~~~f~~~~~l~~H~~~h~-~~~~~~~~kC~~C~~~f~~~~~L~  310 (311)
                      |.+.++|.+|..+|....    +    .|..  |.+.|.+...|+.|++.++ +..+. .|+|-.|++-|++=.+|.
T Consensus       302 c~~esdL~kH~~~HS~~~----y----~C~h~~C~~s~r~~~q~~~H~~evhEg~np~-~Y~CH~Cdr~ft~G~~L~  369 (467)
T KOG3608|consen  302 CVRESDLAKHVQVHSKTV----Y----QCEHPDCHYSVRTYTQMRRHFLEVHEGNNPI-LYACHCCDRFFTSGKSLS  369 (467)
T ss_pred             hccHHHHHHHHHhccccc----e----ecCCCCCcHHHHHHHHHHHHHHHhccCCCCC-ceeeecchhhhccchhHH
Confidence            999999999999887322    1    6877  9999999999999987654 55543 357999999998876663


No 4  
>KOG3608|consensus
Probab=99.87  E-value=7.3e-23  Score=174.09  Aligned_cols=235  Identities=21%  Similarity=0.365  Sum_probs=189.9

Q ss_pred             cCCCCCCCCCCCCCCchhhHhhhccccccCCCCC---CCCCCCCCCCCCCCC--CCCCCCCCCCCccccCCCCCCCCCCC
Q psy8050          35 DRSECPGSGMGKPPHDVADVLLSLKHAVVHPGQM---TPPPYGDPGAPYQVG--APQQFPTASLSYTVHPHQSSRGRPLN  109 (311)
Q Consensus        35 ~~~~C~~c~~~~~~~~~~~~~~~~~H~~~h~~~~---~p~~~~~p~~~~~C~--~c~~~~~~~~~~~~h~~~~~~~~~~~  109 (311)
                      +-|-|.+-.|+.+|.+...+   ..|+..|..-.   ....+++-..-+.|.  .|-+.+.+...|..|.+.|++++.  
T Consensus       133 ~~f~C~WedCe~~F~s~~ef---~dHV~~H~l~ceyd~~~~~~D~~pv~~C~W~~Ct~~~~~k~~LreH~r~Hs~eKv--  207 (467)
T KOG3608|consen  133 QNFRCGWEDCEREFVSIVEF---QDHVVKHALFCEYDIQKTPEDERPVTMCNWAMCTKHMGNKYRLREHIRTHSNEKV--  207 (467)
T ss_pred             hhhccChhhcCCcccCHHHH---HHHHHHhhhhhhhhhhhCCCCCCceeeccchhhhhhhccHHHHHHHHHhcCCCeE--
Confidence            55888887777776665544   45555443110   011223332345564  478888999999999999987764  


Q ss_pred             CCCcceeeccCCCCcCCChhHHHHHHhhhC--CCCCeecCcCccccCChhHHHHHHHHhcCCCceecCCCCccccCcchh
Q psy8050         110 SGIRLILIQFIPGKTYARPSTLKTHLRTHS--GEKPYRCEDCNKSFSQAANLTAHVRTHSGEKPFRCPICDRRFSQSSSV  187 (311)
Q Consensus       110 ~~~~~~~~C~~C~~~f~~~~~L~~H~~~h~--~~~~~~C~~C~~~f~~~~~L~~H~~~h~~~~~~~C~~C~~~f~~~~~l  187 (311)
                            +.|+.||..|.+...|-.|++..+  ...+|.|..|.+.|.+...|..|+..|-  ..|+|+.|+.+....++|
T Consensus       208 ------vACp~Cg~~F~~~tkl~DH~rRqt~l~~n~fqC~~C~KrFaTeklL~~Hv~rHv--n~ykCplCdmtc~~~ssL  279 (467)
T KOG3608|consen  208 ------VACPHCGELFRTKTKLFDHLRRQTELNTNSFQCAQCFKRFATEKLLKSHVVRHV--NCYKCPLCDMTCSSASSL  279 (467)
T ss_pred             ------EecchHHHHhccccHHHHHHHhhhhhcCCchHHHHHHHHHhHHHHHHHHHHHhh--hcccccccccCCCChHHH
Confidence                  568999999999999999986544  4568999999999999999999999885  459999999999999999


Q ss_pred             hhhhhh-ccccccccccccccccCChHHHHHHhhhhcCCCCeecCc--cccccCChHHHHHHHHHhCCCCCCCCcccccc
Q psy8050         188 TTHMRT-HSGERPYRCRLCKKAFSDSSTLTKHLRIHSGEKPYQCKL--CLLRFSQSGNLNRHMRVHGSQYNMLPVIYMGV  264 (311)
Q Consensus       188 ~~H~~~-h~~~k~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~--C~~~f~~~~~L~~H~~~H~~~~~~~~~~~~~~  264 (311)
                      ..|++. |...+||+|+.|++.|.+.+.|.+|...|. +..|.|+.  |..+|.+...|.+|++.++...+..++    .
T Consensus       280 ~~H~r~rHs~dkpfKCd~Cd~~c~~esdL~kH~~~HS-~~~y~C~h~~C~~s~r~~~q~~~H~~evhEg~np~~Y----~  354 (467)
T KOG3608|consen  280 TTHIRYRHSKDKPFKCDECDTRCVRESDLAKHVQVHS-KTVYQCEHPDCHYSVRTYTQMRRHFLEVHEGNNPILY----A  354 (467)
T ss_pred             HHHHHhhhccCCCccccchhhhhccHHHHHHHHHhcc-ccceecCCCCCcHHHHHHHHHHHHHHHhccCCCCCce----e
Confidence            999985 777899999999999999999999999988 77799988  999999999999999877655554344    7


Q ss_pred             cccccccccccccchhhceeeee
Q psy8050         265 NQVCSNIVDQRRGIYVVEAVWTV  287 (311)
Q Consensus       265 C~~C~~~f~~~~~l~~H~~~h~~  287 (311)
                      |..|.+.|++-..|-.|++.-++
T Consensus       355 CH~Cdr~ft~G~~L~~HL~kkH~  377 (467)
T KOG3608|consen  355 CHCCDRFFTSGKSLSAHLMKKHG  377 (467)
T ss_pred             eecchhhhccchhHHHHHHHhhc
Confidence            99999999999999999855333


No 5  
>KOG1074|consensus
Probab=99.87  E-value=1.9e-23  Score=194.87  Aligned_cols=79  Identities=35%  Similarity=0.869  Sum_probs=60.8

Q ss_pred             ecCCCCccccCcchhhhhhhhccccccccccccccccCChHHHHHHhhhhcCCC----CeecC---ccccccCChHHHHH
Q psy8050         173 RCPICDRRFSQSSSVTTHMRTHSGERPYRCRLCKKAFSDSSTLTKHLRIHSGEK----PYQCK---LCLLRFSQSGNLNR  245 (311)
Q Consensus       173 ~C~~C~~~f~~~~~l~~H~~~h~~~k~~~C~~C~~~f~~~~~l~~H~~~h~~~~----~~~C~---~C~~~f~~~~~L~~  245 (311)
                      +|.+|.++..-...|+.|.++|+||+||+|.+||++|+++.+|+.|+.+|...-    .+.|+   +|.+.|...-.|..
T Consensus       607 qCiiC~rVlSC~saLqmHyrtHtGERPFkCKiCgRAFtTkGNLkaH~~vHka~p~~R~q~ScP~~~ic~~kftn~V~lpQ  686 (958)
T KOG1074|consen  607 QCIICLRVLSCPSALQMHYRTHTGERPFKCKICGRAFTTKGNLKAHMSVHKAKPPARVQFSCPSTFICQKKFTNAVTLPQ  686 (958)
T ss_pred             ceeeeeecccchhhhhhhhhcccCcCccccccccchhccccchhhcccccccCccccccccCCchhhhcccccccccccc
Confidence            677888888888888888888888888888888888888888888887775432    36677   77777777777777


Q ss_pred             HHHHhC
Q psy8050         246 HMRVHG  251 (311)
Q Consensus       246 H~~~H~  251 (311)
                      |+++|.
T Consensus       687 hIriH~  692 (958)
T KOG1074|consen  687 HIRIHL  692 (958)
T ss_pred             eEEeec
Confidence            777775


No 6  
>KOG1074|consensus
Probab=99.83  E-value=1.2e-21  Score=182.90  Aligned_cols=106  Identities=24%  Similarity=0.406  Sum_probs=89.5

Q ss_pred             ccccccccccCChHHHHHHhhhhcCCCCeecCccccccCChHHHHHHHHHhCCCCCCCCcccccccc---cccccccccc
Q psy8050         200 YRCRLCKKAFSDSSTLTKHLRIHSGEKPYQCKLCLLRFSQSGNLNRHMRVHGSQYNMLPVIYMGVNQ---VCSNIVDQRR  276 (311)
Q Consensus       200 ~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~f~~~~~L~~H~~~H~~~~~~~~~~~~~~C~---~C~~~f~~~~  276 (311)
                      -.|.+|.++..-.+.|+.|+++|+||+||+|.+||+.|+++.+|..|+-+|...-.+.- .  +.|+   +|-+.|.+..
T Consensus       606 NqCiiC~rVlSC~saLqmHyrtHtGERPFkCKiCgRAFtTkGNLkaH~~vHka~p~~R~-q--~ScP~~~ic~~kftn~V  682 (958)
T KOG1074|consen  606 NQCIICLRVLSCPSALQMHYRTHTGERPFKCKICGRAFTTKGNLKAHMSVHKAKPPARV-Q--FSCPSTFICQKKFTNAV  682 (958)
T ss_pred             cceeeeeecccchhhhhhhhhcccCcCccccccccchhccccchhhcccccccCccccc-c--ccCCchhhhcccccccc
Confidence            57999999999999999999999999999999999999999999999999976554431 1  2799   9999999999


Q ss_pred             cchhhceeeeeccc-cc---------CCCCCCCCccccccCC
Q psy8050         277 GIYVVEAVWTVQST-WI---------DGRSTMCGHSFIVKLP  308 (311)
Q Consensus       277 ~l~~H~~~h~~~~~-~~---------~~kC~~C~~~f~~~~~  308 (311)
                      .|.+|+++|.+... ..         ..+|..|.+.|..-.+
T Consensus       683 ~lpQhIriH~~~~~s~g~~a~e~~~~adq~~~~qk~~~~a~~  724 (958)
T KOG1074|consen  683 TLPQHIRIHLGGQISNGGTAAEGILAADQCSSCQKTFSDARS  724 (958)
T ss_pred             cccceEEeecCCCCCCCcccccccchhcccchhhhccccccc
Confidence            99999999984332 22         2389999988865443


No 7  
>KOG3623|consensus
Probab=99.81  E-value=3.2e-21  Score=177.35  Aligned_cols=117  Identities=21%  Similarity=0.297  Sum_probs=96.6

Q ss_pred             cCCCCCCCCCCCCCCchhhHhhhccccccCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCccccCCCCCCCCCC-----C
Q psy8050          35 DRSECPGSGMGKPPHDVADVLLSLKHAVVHPGQMTPPPYGDPGAPYQVGAPQQFPTASLSYTVHPHQSSRGRPL-----N  109 (311)
Q Consensus        35 ~~~~C~~c~~~~~~~~~~~~~~~~~H~~~h~~~~~p~~~~~p~~~~~C~~c~~~~~~~~~~~~h~~~~~~~~~~-----~  109 (311)
                      ....|++|.     .....+-.+..|+..-+...+        .-|.|..|.+.|.....|..|+..|....+.     .
T Consensus       209 qlltcpycd-----rgykrltslkeHikyrhekne--------~nfsC~lCsytFAyRtQLErhm~~hkpg~dqa~sltq  275 (1007)
T KOG3623|consen  209 QLLTCPYCD-----RGYKRLTSLKEHIKYRHEKNE--------PNFSCMLCSYTFAYRTQLERHMQLHKPGGDQAISLTQ  275 (1007)
T ss_pred             hhhcchhHH-----HHHHHHHHHHHHHHHHHhhCC--------CCCcchhhhhhhhhHHHHHHHHHhhcCCCcccccccc
Confidence            347899999     666777777888875443332        3567999999999999999999887655432     3


Q ss_pred             CCCcceeeccCCCCcCCChhHHHHHHhhhCCCCCeecCcCccccCChhHHHHHHH
Q psy8050         110 SGIRLILIQFIPGKTYARPSTLKTHLRTHSGEKPYRCEDCNKSFSQAANLTAHVR  164 (311)
Q Consensus       110 ~~~~~~~~C~~C~~~f~~~~~L~~H~~~h~~~~~~~C~~C~~~f~~~~~L~~H~~  164 (311)
                      .+....|.|.+||++|..+-.|+.|+++|.|++||.|+.|++.|.....+..||.
T Consensus       276 sa~lRKFKCtECgKAFKfKHHLKEHlRIHSGEKPfeCpnCkKRFSHSGSySSHmS  330 (1007)
T KOG3623|consen  276 SALLRKFKCTECGKAFKFKHHLKEHLRIHSGEKPFECPNCKKRFSHSGSYSSHMS  330 (1007)
T ss_pred             hhhhccccccccchhhhhHHHHHhhheeecCCCCcCCcccccccccCCccccccc
Confidence            4556679999999999999999999999999999999999999999999999974


No 8  
>KOG3623|consensus
Probab=99.74  E-value=2.4e-19  Score=165.17  Aligned_cols=108  Identities=29%  Similarity=0.498  Sum_probs=88.6

Q ss_pred             CCCCCCCCCCCCCCCccccCCCCCCCCCCCCCCcceeeccCCCCcCCChhHHHHHHhhhCC-------------CCCeec
Q psy8050          80 YQVGAPQQFPTASLSYTVHPHQSSRGRPLNSGIRLILIQFIPGKTYARPSTLKTHLRTHSG-------------EKPYRC  146 (311)
Q Consensus        80 ~~C~~c~~~~~~~~~~~~h~~~~~~~~~~~~~~~~~~~C~~C~~~f~~~~~L~~H~~~h~~-------------~~~~~C  146 (311)
                      ..|+.|++......+|..|+..-..      .....|-|.+|..+|..+..|.+||.+|..             .+-|+|
T Consensus       211 ltcpycdrgykrltslkeHikyrhe------kne~nfsC~lCsytFAyRtQLErhm~~hkpg~dqa~sltqsa~lRKFKC  284 (1007)
T KOG3623|consen  211 LTCPYCDRGYKRLTSLKEHIKYRHE------KNEPNFSCMLCSYTFAYRTQLERHMQLHKPGGDQAISLTQSALLRKFKC  284 (1007)
T ss_pred             hcchhHHHHHHHHHHHHHHHHHHHh------hCCCCCcchhhhhhhhhHHHHHHHHHhhcCCCcccccccchhhhccccc
Confidence            3478887777777777777542111      122346699999999999999999998853             244999


Q ss_pred             CcCccccCChhHHHHHHHHhcCCCceecCCCCccccCcchhhhhhhh
Q psy8050         147 EDCNKSFSQAANLTAHVRTHSGEKPFRCPICDRRFSQSSSVTTHMRT  193 (311)
Q Consensus       147 ~~C~~~f~~~~~L~~H~~~h~~~~~~~C~~C~~~f~~~~~l~~H~~~  193 (311)
                      ..||++|..+..|..|+++|.|++||.|+.|++.|....++..||..
T Consensus       285 tECgKAFKfKHHLKEHlRIHSGEKPfeCpnCkKRFSHSGSySSHmSS  331 (1007)
T KOG3623|consen  285 TECGKAFKFKHHLKEHLRIHSGEKPFECPNCKKRFSHSGSYSSHMSS  331 (1007)
T ss_pred             cccchhhhhHHHHHhhheeecCCCCcCCcccccccccCCcccccccc
Confidence            99999999999999999999999999999999999999999999854


No 9  
>KOG3576|consensus
Probab=99.67  E-value=2.3e-17  Score=130.61  Aligned_cols=114  Identities=32%  Similarity=0.675  Sum_probs=91.8

Q ss_pred             CCCeecCcCccccCChhHHHHHHHHhcCCCceecCCCCccccCcchhhhhhhhccccccccccccccccCChHHHHHHhh
Q psy8050         141 EKPYRCEDCNKSFSQAANLTAHVRTHSGEKPFRCPICDRRFSQSSSVTTHMRTHSGERPYRCRLCKKAFSDSSTLTKHLR  220 (311)
Q Consensus       141 ~~~~~C~~C~~~f~~~~~L~~H~~~h~~~~~~~C~~C~~~f~~~~~l~~H~~~h~~~k~~~C~~C~~~f~~~~~l~~H~~  220 (311)
                      ...|.|.+|++.|.....|.+|+.-|...+.+.|..||+.|...-+|.+|+++|+|.+||+|..|+++|++.-.|..|.+
T Consensus       115 ~d~ftCrvCgK~F~lQRmlnrh~kch~~vkr~lct~cgkgfndtfdlkrh~rthtgvrpykc~~c~kaftqrcsleshl~  194 (267)
T KOG3576|consen  115 QDSFTCRVCGKKFGLQRMLNRHLKCHSDVKRHLCTFCGKGFNDTFDLKRHTRTHTGVRPYKCSLCEKAFTQRCSLESHLK  194 (267)
T ss_pred             CCeeeeehhhhhhhHHHHHHHHhhhccHHHHHHHhhccCcccchhhhhhhhccccCccccchhhhhHHHHhhccHHHHHH
Confidence            34578888888888888888888888777778888888888888888888888888888888888888888888888876


Q ss_pred             hhcC-----------CCCeecCccccccCChHHHHHHHHHhCCCC
Q psy8050         221 IHSG-----------EKPYQCKLCLLRFSQSGNLNRHMRVHGSQY  254 (311)
Q Consensus       221 ~h~~-----------~~~~~C~~C~~~f~~~~~L~~H~~~H~~~~  254 (311)
                      .-+|           .+.|.|+.||.+-.....+..|+..|+...
T Consensus       195 kvhgv~~~yaykerr~kl~vcedcg~t~~~~e~~~~h~~~~hp~S  239 (267)
T KOG3576|consen  195 KVHGVQHQYAYKERRAKLYVCEDCGYTSERPEVYYLHLKLHHPFS  239 (267)
T ss_pred             HHcCchHHHHHHHhhhheeeecccCCCCCChhHHHHHHHhcCCCC
Confidence            5444           356899999998888888889988887655


No 10 
>KOG3576|consensus
Probab=99.63  E-value=2.6e-16  Score=124.72  Aligned_cols=118  Identities=29%  Similarity=0.583  Sum_probs=108.0

Q ss_pred             CCCcceeeccCCCCcCCChhHHHHHHhhhCCCCCeecCcCccccCChhHHHHHHHHhcCCCceecCCCCccccCcchhhh
Q psy8050         110 SGIRLILIQFIPGKTYARPSTLKTHLRTHSGEKPYRCEDCNKSFSQAANLTAHVRTHSGEKPFRCPICDRRFSQSSSVTT  189 (311)
Q Consensus       110 ~~~~~~~~C~~C~~~f~~~~~L~~H~~~h~~~~~~~C~~C~~~f~~~~~L~~H~~~h~~~~~~~C~~C~~~f~~~~~l~~  189 (311)
                      +.....+.|.+|++.|.-...|.+|++.|..-+.+.|..||+.|....+|.+|+++|+|.+||+|..|++.|...-+|..
T Consensus       112 ssd~d~ftCrvCgK~F~lQRmlnrh~kch~~vkr~lct~cgkgfndtfdlkrh~rthtgvrpykc~~c~kaftqrcsles  191 (267)
T KOG3576|consen  112 SSDQDSFTCRVCGKKFGLQRMLNRHLKCHSDVKRHLCTFCGKGFNDTFDLKRHTRTHTGVRPYKCSLCEKAFTQRCSLES  191 (267)
T ss_pred             CCCCCeeeeehhhhhhhHHHHHHHHhhhccHHHHHHHhhccCcccchhhhhhhhccccCccccchhhhhHHHHhhccHHH
Confidence            44466899999999999999999999999999999999999999999999999999999999999999999999999999


Q ss_pred             hhhhccc-----------cccccccccccccCChHHHHHHhhhhcCCCC
Q psy8050         190 HMRTHSG-----------ERPYRCRLCKKAFSDSSTLTKHLRIHSGEKP  227 (311)
Q Consensus       190 H~~~h~~-----------~k~~~C~~C~~~f~~~~~l~~H~~~h~~~~~  227 (311)
                      |++.-+|           .+.|.|..||.+-.....+..|+..|+...|
T Consensus       192 hl~kvhgv~~~yaykerr~kl~vcedcg~t~~~~e~~~~h~~~~hp~Sp  240 (267)
T KOG3576|consen  192 HLKKVHGVQHQYAYKERRAKLYVCEDCGYTSERPEVYYLHLKLHHPFSP  240 (267)
T ss_pred             HHHHHcCchHHHHHHHhhhheeeecccCCCCCChhHHHHHHHhcCCCCH
Confidence            9875444           4679999999999999999999999886543


No 11 
>PLN03086 PRLI-interacting factor K; Provisional
Probab=99.15  E-value=8.4e-11  Score=109.74  Aligned_cols=143  Identities=13%  Similarity=0.283  Sum_probs=94.8

Q ss_pred             ceeeccCCCCcCCChhHHHHHHhhhCCCCCeecCc--CccccCChhHHHHHHHHhcCCCceecCCCCccccCcchhhhhh
Q psy8050         114 LILIQFIPGKTYARPSTLKTHLRTHSGEKPYRCED--CNKSFSQAANLTAHVRTHSGEKPFRCPICDRRFSQSSSVTTHM  191 (311)
Q Consensus       114 ~~~~C~~C~~~f~~~~~L~~H~~~h~~~~~~~C~~--C~~~f~~~~~L~~H~~~h~~~~~~~C~~C~~~f~~~~~l~~H~  191 (311)
                      ....|.-|..... ...|..|...- ....-.|+.  |+..|. ...+..|         +.|+.|+..|. ...|..|+
T Consensus       406 ~~V~C~NC~~~i~-l~~l~lHe~~C-~r~~V~Cp~~~Cg~v~~-r~el~~H---------~~C~~Cgk~f~-~s~LekH~  472 (567)
T PLN03086        406 DTVECRNCKHYIP-SRSIALHEAYC-SRHNVVCPHDGCGIVLR-VEEAKNH---------VHCEKCGQAFQ-QGEMEKHM  472 (567)
T ss_pred             CeEECCCCCCccc-hhHHHHHHhhC-CCcceeCCcccccceee-ccccccC---------ccCCCCCCccc-hHHHHHHH
Confidence            3456777776544 34455666332 223456764  888773 3334444         57888888875 56788888


Q ss_pred             hhccccccccccccccccCChHHHHHHhhhhcCCCCeecCccccccCC----------hHHHHHHHHHhCCCCCCCCccc
Q psy8050         192 RTHSGERPYRCRLCKKAFSDSSTLTKHLRIHSGEKPYQCKLCLLRFSQ----------SGNLNRHMRVHGSQYNMLPVIY  261 (311)
Q Consensus       192 ~~h~~~k~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~f~~----------~~~L~~H~~~H~~~~~~~~~~~  261 (311)
                      .+++  .++.|+ |+..+ ....|..|+.+|.+++++.|..|+..|..          ...|..|..+. +..   ++  
T Consensus       473 ~~~H--kpv~Cp-Cg~~~-~R~~L~~H~~thCp~Kpi~C~fC~~~v~~g~~~~d~~d~~s~Lt~HE~~C-G~r---t~--  542 (567)
T PLN03086        473 KVFH--EPLQCP-CGVVL-EKEQMVQHQASTCPLRLITCRFCGDMVQAGGSAMDVRDRLRGMSEHESIC-GSR---TA--  542 (567)
T ss_pred             HhcC--CCccCC-CCCCc-chhHHHhhhhccCCCCceeCCCCCCccccCccccchhhhhhhHHHHHHhc-CCc---ce--
Confidence            8875  678888 88654 56788888888888888888888888742          34677777764 333   22  


Q ss_pred             ccccccccccccccccchhhc
Q psy8050         262 MGVNQVCSNIVDQRRGIYVVE  282 (311)
Q Consensus       262 ~~~C~~C~~~f~~~~~l~~H~  282 (311)
                        .|..||+.+..+ .+..|+
T Consensus       543 --~C~~Cgk~Vrlr-dm~~H~  560 (567)
T PLN03086        543 --PCDSCGRSVMLK-EMDIHQ  560 (567)
T ss_pred             --EccccCCeeeeh-hHHHHH
Confidence              688888888776 556664


No 12 
>PLN03086 PRLI-interacting factor K; Provisional
Probab=99.12  E-value=9.9e-11  Score=109.27  Aligned_cols=118  Identities=19%  Similarity=0.435  Sum_probs=95.9

Q ss_pred             eeeccC--CCCcCCChhHHHHHHhhhCCCCCeecCcCccccCChhHHHHHHHHhcCCCceecCCCCccccCcchhhhhhh
Q psy8050         115 ILIQFI--PGKTYARPSTLKTHLRTHSGEKPYRCEDCNKSFSQAANLTAHVRTHSGEKPFRCPICDRRFSQSSSVTTHMR  192 (311)
Q Consensus       115 ~~~C~~--C~~~f~~~~~L~~H~~~h~~~~~~~C~~C~~~f~~~~~L~~H~~~h~~~~~~~C~~C~~~f~~~~~l~~H~~  192 (311)
                      ...|+.  ||..|. +..+.         ..+.|+.|++.|. ...|..|+.+++  .++.|+ |+..+ ....|..|+.
T Consensus       433 ~V~Cp~~~Cg~v~~-r~el~---------~H~~C~~Cgk~f~-~s~LekH~~~~H--kpv~Cp-Cg~~~-~R~~L~~H~~  497 (567)
T PLN03086        433 NVVCPHDGCGIVLR-VEEAK---------NHVHCEKCGQAFQ-QGEMEKHMKVFH--EPLQCP-CGVVL-EKEQMVQHQA  497 (567)
T ss_pred             ceeCCcccccceee-ccccc---------cCccCCCCCCccc-hHHHHHHHHhcC--CCccCC-CCCCc-chhHHHhhhh
Confidence            456774  999883 33333         3468999999996 688999999985  789999 99765 6689999999


Q ss_pred             hccccccccccccccccC----------ChHHHHHHhhhhcCCCCeecCccccccCChHHHHHHHHH
Q psy8050         193 THSGERPYRCRLCKKAFS----------DSSTLTKHLRIHSGEKPYQCKLCLLRFSQSGNLNRHMRV  249 (311)
Q Consensus       193 ~h~~~k~~~C~~C~~~f~----------~~~~l~~H~~~h~~~~~~~C~~C~~~f~~~~~L~~H~~~  249 (311)
                      .|.+.+++.|..|++.|.          ....|..|..+. |.+++.|..||+.+..+ +|..|+..
T Consensus       498 thCp~Kpi~C~fC~~~v~~g~~~~d~~d~~s~Lt~HE~~C-G~rt~~C~~Cgk~Vrlr-dm~~H~~~  562 (567)
T PLN03086        498 STCPLRLITCRFCGDMVQAGGSAMDVRDRLRGMSEHESIC-GSRTAPCDSCGRSVMLK-EMDIHQIA  562 (567)
T ss_pred             ccCCCCceeCCCCCCccccCccccchhhhhhhHHHHHHhc-CCcceEccccCCeeeeh-hHHHHHHH
Confidence            999999999999999995          245899999885 89999999999998855 57788643


No 13 
>PHA00733 hypothetical protein
Probab=99.02  E-value=2.4e-10  Score=87.49  Aligned_cols=55  Identities=29%  Similarity=0.567  Sum_probs=39.4

Q ss_pred             ccccccccccccccCChHHHHHHhhhhcCCCCeecCccccccCChHHHHHHHHHhCC
Q psy8050         196 GERPYRCRLCKKAFSDSSTLTKHLRIHSGEKPYQCKLCLLRFSQSGNLNRHMRVHGS  252 (311)
Q Consensus       196 ~~k~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~f~~~~~L~~H~~~H~~  252 (311)
                      +.++|.|+.|++.|.+...|..|++.+  +.+|.|++|++.|.....|..|+...++
T Consensus        70 ~~kPy~C~~Cgk~Fss~s~L~~H~r~h--~~~~~C~~CgK~F~~~~sL~~H~~~~h~  124 (128)
T PHA00733         70 AVSPYVCPLCLMPFSSSVSLKQHIRYT--EHSKVCPVCGKEFRNTDSTLDHVCKKHN  124 (128)
T ss_pred             CCCCccCCCCCCcCCCHHHHHHHHhcC--CcCccCCCCCCccCCHHHHHHHHHHhcC
Confidence            466777777777777777777777765  3457777777777777777777766544


No 14 
>PHA00733 hypothetical protein
Probab=98.93  E-value=7.5e-10  Score=84.78  Aligned_cols=83  Identities=23%  Similarity=0.404  Sum_probs=63.5

Q ss_pred             CCCCeecCcCccccCChhHHHHH--H---HHhcCCCceecCCCCccccCcchhhhhhhhccccccccccccccccCChHH
Q psy8050         140 GEKPYRCEDCNKSFSQAANLTAH--V---RTHSGEKPFRCPICDRRFSQSSSVTTHMRTHSGERPYRCRLCKKAFSDSST  214 (311)
Q Consensus       140 ~~~~~~C~~C~~~f~~~~~L~~H--~---~~h~~~~~~~C~~C~~~f~~~~~l~~H~~~h~~~k~~~C~~C~~~f~~~~~  214 (311)
                      ..+++.|.+|...|.....|..+  +   ..++++.+|.|+.|++.|.....|..|+..+  +.+|.|..|++.|.....
T Consensus        37 ~~~~~~~~~~~~~~~~~~~l~~~~~l~~~~~~~~~kPy~C~~Cgk~Fss~s~L~~H~r~h--~~~~~C~~CgK~F~~~~s  114 (128)
T PHA00733         37 EQKRLIRAVVKTLIYNPQLLDESSYLYKLLTSKAVSPYVCPLCLMPFSSSVSLKQHIRYT--EHSKVCPVCGKEFRNTDS  114 (128)
T ss_pred             hhhhHHHHHHhhhccChhhhcchHHHHhhcccCCCCCccCCCCCCcCCCHHHHHHHHhcC--CcCccCCCCCCccCCHHH
Confidence            34668888888887776666555  1   1234577888999999999888888888876  356899999999999999


Q ss_pred             HHHHhhhhcC
Q psy8050         215 LTKHLRIHSG  224 (311)
Q Consensus       215 l~~H~~~h~~  224 (311)
                      |..|+...++
T Consensus       115 L~~H~~~~h~  124 (128)
T PHA00733        115 TLDHVCKKHN  124 (128)
T ss_pred             HHHHHHHhcC
Confidence            9999887664


No 15 
>PHA02768 hypothetical protein; Provisional
Probab=98.75  E-value=4e-09  Score=66.78  Aligned_cols=42  Identities=21%  Similarity=0.525  Sum_probs=24.2

Q ss_pred             ccccccccccCChHHHHHHhhhhcCCCCeecCccccccCChHHH
Q psy8050         200 YRCRLCKKAFSDSSTLTKHLRIHSGEKPYQCKLCLLRFSQSGNL  243 (311)
Q Consensus       200 ~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~f~~~~~L  243 (311)
                      |.|+.||+.|...++|..|+++|+  ++|+|..|++.|...+.|
T Consensus         6 y~C~~CGK~Fs~~~~L~~H~r~H~--k~~kc~~C~k~f~~~s~l   47 (55)
T PHA02768          6 YECPICGEIYIKRKSMITHLRKHN--TNLKLSNCKRISLRTGEY   47 (55)
T ss_pred             cCcchhCCeeccHHHHHHHHHhcC--CcccCCcccceeccccee
Confidence            556666666666666666666655  455566666655554444


No 16 
>KOG3993|consensus
Probab=98.74  E-value=9.5e-10  Score=96.76  Aligned_cols=80  Identities=30%  Similarity=0.611  Sum_probs=49.9

Q ss_pred             eecCcCccccCChhHHHHHHHHhcCCCceecCCCCccccCcchhhhhhhhccc---------------------------
Q psy8050         144 YRCEDCNKSFSQAANLTAHVRTHSGEKPFRCPICDRRFSQSSSVTTHMRTHSG---------------------------  196 (311)
Q Consensus       144 ~~C~~C~~~f~~~~~L~~H~~~h~~~~~~~C~~C~~~f~~~~~l~~H~~~h~~---------------------------  196 (311)
                      |.|..|...|.....|.+|.-...-...|+|++|++.|.-..+|..|.+.|-.                           
T Consensus       268 yiCqLCK~kYeD~F~LAQHrC~RIV~vEYrCPEC~KVFsCPANLASHRRWHKPR~eaa~a~~~P~k~~~~~rae~~ea~r  347 (500)
T KOG3993|consen  268 YICQLCKEKYEDAFALAQHRCPRIVHVEYRCPECDKVFSCPANLASHRRWHKPRPEAAKAGSPPPKQAVETRAEVQEAER  347 (500)
T ss_pred             HHHHHHHHhhhhHHHHhhccCCeeEEeeecCCcccccccCchhhhhhhcccCCchhhhhcCCCChhhhhhhhhhhhhccc
Confidence            67777777777777777775322222347777777777777777777776631                           


Q ss_pred             ------cccccccccccccCChHHHHHHhhhhc
Q psy8050         197 ------ERPYRCRLCKKAFSDSSTLTKHLRIHS  223 (311)
Q Consensus       197 ------~k~~~C~~C~~~f~~~~~l~~H~~~h~  223 (311)
                            +..|.|..|++.|.+...|+.|+.+|+
T Consensus       348 sg~dss~gi~~C~~C~KkFrRqAYLrKHqlthq  380 (500)
T KOG3993|consen  348 SGDDSSSGIFSCHTCGKKFRRQAYLRKHQLTHQ  380 (500)
T ss_pred             cCCcccCceeecHHhhhhhHHHHHHHHhHHhhh
Confidence                  123666666666666666666655554


No 17 
>KOG3993|consensus
Probab=98.72  E-value=1.4e-09  Score=95.74  Aligned_cols=164  Identities=20%  Similarity=0.190  Sum_probs=108.9

Q ss_pred             ceeeccCCCCcCCChhHHHHHHhhhCCCCCeecCcCccccCChhHHHHHHHHhcC-------------------------
Q psy8050         114 LILIQFIPGKTYARPSTLKTHLRTHSGEKPYRCEDCNKSFSQAANLTAHVRTHSG-------------------------  168 (311)
Q Consensus       114 ~~~~C~~C~~~f~~~~~L~~H~~~h~~~~~~~C~~C~~~f~~~~~L~~H~~~h~~-------------------------  168 (311)
                      ..++|.+|...|.+.-.|.+|...-.....|.|+.|+++|....+|..|.+.|-.                         
T Consensus       266 GdyiCqLCK~kYeD~F~LAQHrC~RIV~vEYrCPEC~KVFsCPANLASHRRWHKPR~eaa~a~~~P~k~~~~~rae~~ea  345 (500)
T KOG3993|consen  266 GDYICQLCKEKYEDAFALAQHRCPRIVHVEYRCPECDKVFSCPANLASHRRWHKPRPEAAKAGSPPPKQAVETRAEVQEA  345 (500)
T ss_pred             HHHHHHHHHHhhhhHHHHhhccCCeeEEeeecCCcccccccCchhhhhhhcccCCchhhhhcCCCChhhhhhhhhhhhhc
Confidence            3489999999999999999998654444569999999999999999999988721                         


Q ss_pred             --------CCceecCCCCccccCcchhhhhhhhccccc-----------------cccccccccccCChHHHHHHhhhhc
Q psy8050         169 --------EKPFRCPICDRRFSQSSSVTTHMRTHSGER-----------------PYRCRLCKKAFSDSSTLTKHLRIHS  223 (311)
Q Consensus       169 --------~~~~~C~~C~~~f~~~~~l~~H~~~h~~~k-----------------~~~C~~C~~~f~~~~~l~~H~~~h~  223 (311)
                              +..|.|.+|++.|.....|+.|+.+|+...                 .+-|..|+-.+...+.--.+...+.
T Consensus       346 ~rsg~dss~gi~~C~~C~KkFrRqAYLrKHqlthq~~~~~k~~a~~f~~s~~~~l~~~~~~~a~h~~a~~~~g~~vl~~a  425 (500)
T KOG3993|consen  346 ERSGDDSSSGIFSCHTCGKKFRRQAYLRKHQLTHQRAPLAKEKAPKFLLSRVIPLMHFNQAVATHSSASDSHGDEVLYVA  425 (500)
T ss_pred             cccCCcccCceeecHHhhhhhHHHHHHHHhHHhhhccccchhcccCcchhhcccccccccccccccccccccccceeeee
Confidence                    123899999999999999999988875321                 0234444444443332222222221


Q ss_pred             C-CCCeecCccccccCChHHHHHHHHHhCCCCCCCCcccccccccccccccccccchhhcee
Q psy8050         224 G-EKPYQCKLCLLRFSQSGNLNRHMRVHGSQYNMLPVIYMGVNQVCSNIVDQRRGIYVVEAV  284 (311)
Q Consensus       224 ~-~~~~~C~~C~~~f~~~~~L~~H~~~H~~~~~~~~~~~~~~C~~C~~~f~~~~~l~~H~~~  284 (311)
                      + .....|+.|+..+.++..--.+.+.-..++.       +.|.+|.-.|.+...|..|++.
T Consensus       426 ~sael~~pp~~~~ppsss~~sgg~~rlg~~~q~-------f~~ky~~atfyss~~ltrhin~  480 (500)
T KOG3993|consen  426 GSAELELPPYDGSPPSSSGSSGGYGRLGIAEQG-------FTCKYCPATFYSSPGLTRHINK  480 (500)
T ss_pred             ccccccCCCCCCCCcccCCCCCccccccchhhc-------cccccchHhhhcCcchHhHhhh
Confidence            1 1123467777666665443333332222211       1588888888888888888765


No 18 
>PHA02768 hypothetical protein; Provisional
Probab=98.62  E-value=2.4e-08  Score=63.24  Aligned_cols=42  Identities=19%  Similarity=0.521  Sum_probs=22.8

Q ss_pred             eecCcCccccCChhHHHHHHHHhcCCCceecCCCCccccCcchh
Q psy8050         144 YRCEDCNKSFSQAANLTAHVRTHSGEKPFRCPICDRRFSQSSSV  187 (311)
Q Consensus       144 ~~C~~C~~~f~~~~~L~~H~~~h~~~~~~~C~~C~~~f~~~~~l  187 (311)
                      |.|+.||+.|...++|..|+++|+  ++++|..|++.|...+.|
T Consensus         6 y~C~~CGK~Fs~~~~L~~H~r~H~--k~~kc~~C~k~f~~~s~l   47 (55)
T PHA02768          6 YECPICGEIYIKRKSMITHLRKHN--TNLKLSNCKRISLRTGEY   47 (55)
T ss_pred             cCcchhCCeeccHHHHHHHHHhcC--CcccCCcccceeccccee
Confidence            455555555555555555555554  345555555555544443


No 19 
>PF13465 zf-H2C2_2:  Zinc-finger double domain; PDB: 2EN7_A 1TF6_A 1TF3_A 2ELT_A 2EOS_A 2EN2_A 2DMD_A 2WBS_A 2WBU_A 2EM5_A ....
Probab=98.51  E-value=8.5e-08  Score=51.81  Aligned_cols=26  Identities=50%  Similarity=1.067  Sum_probs=19.9

Q ss_pred             HHHHHhhhhcCCCCeecCccccccCC
Q psy8050         214 TLTKHLRIHSGEKPYQCKLCLLRFSQ  239 (311)
Q Consensus       214 ~l~~H~~~h~~~~~~~C~~C~~~f~~  239 (311)
                      +|.+|+++|++++||.|+.|++.|.+
T Consensus         1 ~l~~H~~~H~~~k~~~C~~C~k~F~~   26 (26)
T PF13465_consen    1 NLRRHMRTHTGEKPYKCPYCGKSFSN   26 (26)
T ss_dssp             HHHHHHHHHSSSSSEEESSSSEEESS
T ss_pred             CHHHHhhhcCCCCCCCCCCCcCeeCc
Confidence            46778888888888888888887753


No 20 
>PF13465 zf-H2C2_2:  Zinc-finger double domain; PDB: 2EN7_A 1TF6_A 1TF3_A 2ELT_A 2EOS_A 2EN2_A 2DMD_A 2WBS_A 2WBU_A 2EM5_A ....
Probab=98.34  E-value=4.2e-07  Score=49.06  Aligned_cols=24  Identities=67%  Similarity=1.334  Sum_probs=13.2

Q ss_pred             HHHHHhhhCCCCCeecCcCccccC
Q psy8050         131 LKTHLRTHSGEKPYRCEDCNKSFS  154 (311)
Q Consensus       131 L~~H~~~h~~~~~~~C~~C~~~f~  154 (311)
                      |..|++.|+++++|.|+.|++.|.
T Consensus         2 l~~H~~~H~~~k~~~C~~C~k~F~   25 (26)
T PF13465_consen    2 LRRHMRTHTGEKPYKCPYCGKSFS   25 (26)
T ss_dssp             HHHHHHHHSSSSSEEESSSSEEES
T ss_pred             HHHHhhhcCCCCCCCCCCCcCeeC
Confidence            455555555555555555555553


No 21 
>PHA00616 hypothetical protein
Probab=98.14  E-value=1e-06  Score=53.07  Aligned_cols=32  Identities=19%  Similarity=0.407  Sum_probs=16.3

Q ss_pred             ccccccccccCChHHHHHHhhhhcCCCCeecC
Q psy8050         200 YRCRLCKKAFSDSSTLTKHLRIHSGEKPYQCK  231 (311)
Q Consensus       200 ~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~  231 (311)
                      |.|..||+.|...+.|..|++.|++++++.|+
T Consensus         2 YqC~~CG~~F~~~s~l~~H~r~~hg~~~~~~~   33 (44)
T PHA00616          2 YQCLRCGGIFRKKKEVIEHLLSVHKQNKLTLE   33 (44)
T ss_pred             CccchhhHHHhhHHHHHHHHHHhcCCCcccee
Confidence            44555555555555555555555555554444


No 22 
>PHA00732 hypothetical protein
Probab=98.06  E-value=2.9e-06  Score=58.95  Aligned_cols=46  Identities=24%  Similarity=0.551  Sum_probs=31.0

Q ss_pred             cccccccccccCChHHHHHHhhh-hcCCCCeecCccccccCChHHHHHHHHHh
Q psy8050         199 PYRCRLCKKAFSDSSTLTKHLRI-HSGEKPYQCKLCLLRFSQSGNLNRHMRVH  250 (311)
Q Consensus       199 ~~~C~~C~~~f~~~~~l~~H~~~-h~~~~~~~C~~C~~~f~~~~~L~~H~~~H  250 (311)
                      ||.|..|++.|.+...|..|++. |.   ++.|+.|++.|.   .|..|.+++
T Consensus         1 py~C~~Cgk~F~s~s~Lk~H~r~~H~---~~~C~~CgKsF~---~l~~H~~~~   47 (79)
T PHA00732          1 MFKCPICGFTTVTLFALKQHARRNHT---LTKCPVCNKSYR---RLNQHFYSQ   47 (79)
T ss_pred             CccCCCCCCccCCHHHHHHHhhcccC---CCccCCCCCEeC---Chhhhhccc
Confidence            46777777777777777777764 43   256777777776   366666444


No 23 
>PHA00616 hypothetical protein
Probab=98.05  E-value=2.1e-06  Score=51.76  Aligned_cols=33  Identities=15%  Similarity=0.399  Sum_probs=19.9

Q ss_pred             CeecCcCccccCChhHHHHHHHHhcCCCceecC
Q psy8050         143 PYRCEDCNKSFSQAANLTAHVRTHSGEKPFRCP  175 (311)
Q Consensus       143 ~~~C~~C~~~f~~~~~L~~H~~~h~~~~~~~C~  175 (311)
                      ||+|+.||+.|...++|..|++.|+++.++.|+
T Consensus         1 pYqC~~CG~~F~~~s~l~~H~r~~hg~~~~~~~   33 (44)
T PHA00616          1 MYQCLRCGGIFRKKKEVIEHLLSVHKQNKLTLE   33 (44)
T ss_pred             CCccchhhHHHhhHHHHHHHHHHhcCCCcccee
Confidence            355666666666666666666666666655554


No 24 
>PHA00732 hypothetical protein
Probab=97.97  E-value=5.5e-06  Score=57.61  Aligned_cols=48  Identities=17%  Similarity=0.163  Sum_probs=39.5

Q ss_pred             CeecCccccccCChHHHHHHHHH-hCCCCCCCCcccccccccccccccccccchhhceeeee
Q psy8050         227 PYQCKLCLLRFSQSGNLNRHMRV-HGSQYNMLPVIYMGVNQVCSNIVDQRRGIYVVEAVWTV  287 (311)
Q Consensus       227 ~~~C~~C~~~f~~~~~L~~H~~~-H~~~~~~~~~~~~~~C~~C~~~f~~~~~l~~H~~~h~~  287 (311)
                      ||.|+.|++.|.+...|..|++. |.+ .         .|+.|++.|.   .|..|.....+
T Consensus         1 py~C~~Cgk~F~s~s~Lk~H~r~~H~~-~---------~C~~CgKsF~---~l~~H~~~~~~   49 (79)
T PHA00732          1 MFKCPICGFTTVTLFALKQHARRNHTL-T---------KCPVCNKSYR---RLNQHFYSQYD   49 (79)
T ss_pred             CccCCCCCCccCCHHHHHHHhhcccCC-C---------ccCCCCCEeC---ChhhhhcccCC
Confidence            58999999999999999999985 543 2         5999999998   48888855443


No 25 
>PF05605 zf-Di19:  Drought induced 19 protein (Di19), zinc-binding;  InterPro: IPR008598 This entry consists of several drought induced 19 (Di19) like and RING finger 114 proteins. Di19 has been found to be strongly expressed in both the roots and leaves of Arabidopsis thaliana during progressive drought [], whilst RING finger proteins are thought to play a role in spermatogenesis. The precise function is unknown.
Probab=97.78  E-value=4.3e-05  Score=49.23  Aligned_cols=47  Identities=32%  Similarity=0.564  Sum_probs=19.9

Q ss_pred             ccccccccccCChHHHHHHhhhhcC-C-CCeecCccccccCChHHHHHHHHH
Q psy8050         200 YRCRLCKKAFSDSSTLTKHLRIHSG-E-KPYQCKLCLLRFSQSGNLNRHMRV  249 (311)
Q Consensus       200 ~~C~~C~~~f~~~~~l~~H~~~h~~-~-~~~~C~~C~~~f~~~~~L~~H~~~  249 (311)
                      |.|++|++ ..+...|..|....+. + +.+.|++|...+.  .+|..|+..
T Consensus         3 f~CP~C~~-~~~~~~L~~H~~~~H~~~~~~v~CPiC~~~~~--~~l~~Hl~~   51 (54)
T PF05605_consen    3 FTCPYCGK-GFSESSLVEHCEDEHRSESKNVVCPICSSRVT--DNLIRHLNS   51 (54)
T ss_pred             cCCCCCCC-ccCHHHHHHHHHhHCcCCCCCccCCCchhhhh--hHHHHHHHH
Confidence            44555555 3334444455433222 1 2344555544332  245555443


No 26 
>PF00096 zf-C2H2:  Zinc finger, C2H2 type;  InterPro: IPR007087 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target.  The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger: #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C], where X can be any amino acid, and numbers in brackets indicate the number of residues. The positions marked # are those that are important for the stable fold of the zinc finger. The final position can be either his or cys. The C2H2 zinc finger is composed of two short beta strands followed by an alpha helix. The amino terminal part of the helix binds the major groove in DNA binding zinc fingers. The accepted consensus binding sequence for Sp1 is usually defined by the asymmetric hexanucleotide core GGGCGG but this sequence does not include, among others, the GAG (=CTC) repeat that constitutes a high-affinity site for Sp1 binding to the wt1 promoter []. This entry represents the classical C2H2 zinc finger domain.  More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; GO: 0008270 zinc ion binding, 0005622 intracellular; PDB: 2D9H_A 2EPC_A 1SP1_A 1VA3_A 2WBT_B 2ELR_A 2YTP_A 2YTT_A 1VA1_A 2ELO_A ....
Probab=97.74  E-value=2.7e-05  Score=40.58  Aligned_cols=23  Identities=52%  Similarity=1.013  Sum_probs=15.2

Q ss_pred             eecCccccccCChHHHHHHHHHh
Q psy8050         228 YQCKLCLLRFSQSGNLNRHMRVH  250 (311)
Q Consensus       228 ~~C~~C~~~f~~~~~L~~H~~~H  250 (311)
                      |.|+.|++.|.+...|..|++.|
T Consensus         1 y~C~~C~~~f~~~~~l~~H~~~H   23 (23)
T PF00096_consen    1 YKCPICGKSFSSKSNLKRHMRRH   23 (23)
T ss_dssp             EEETTTTEEESSHHHHHHHHHHH
T ss_pred             CCCCCCCCccCCHHHHHHHHhHC
Confidence            56666777777666666666654


No 27 
>PF00096 zf-C2H2:  Zinc finger, C2H2 type;  InterPro: IPR007087 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target.  The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger: #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C], where X can be any amino acid, and numbers in brackets indicate the number of residues. The positions marked # are those that are important for the stable fold of the zinc finger. The final position can be either his or cys. The C2H2 zinc finger is composed of two short beta strands followed by an alpha helix. The amino terminal part of the helix binds the major groove in DNA binding zinc fingers. The accepted consensus binding sequence for Sp1 is usually defined by the asymmetric hexanucleotide core GGGCGG but this sequence does not include, among others, the GAG (=CTC) repeat that constitutes a high-affinity site for Sp1 binding to the wt1 promoter []. This entry represents the classical C2H2 zinc finger domain.  More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; GO: 0008270 zinc ion binding, 0005622 intracellular; PDB: 2D9H_A 2EPC_A 1SP1_A 1VA3_A 2WBT_B 2ELR_A 2YTP_A 2YTT_A 1VA1_A 2ELO_A ....
Probab=97.53  E-value=7.4e-05  Score=38.89  Aligned_cols=22  Identities=50%  Similarity=0.976  Sum_probs=12.5

Q ss_pred             eecCcCccccCChhHHHHHHHH
Q psy8050         144 YRCEDCNKSFSQAANLTAHVRT  165 (311)
Q Consensus       144 ~~C~~C~~~f~~~~~L~~H~~~  165 (311)
                      |.|+.|++.|.+...|..|++.
T Consensus         1 y~C~~C~~~f~~~~~l~~H~~~   22 (23)
T PF00096_consen    1 YKCPICGKSFSSKSNLKRHMRR   22 (23)
T ss_dssp             EEETTTTEEESSHHHHHHHHHH
T ss_pred             CCCCCCCCccCCHHHHHHHHhH
Confidence            4555555555555555555554


No 28 
>PF05605 zf-Di19:  Drought induced 19 protein (Di19), zinc-binding;  InterPro: IPR008598 This entry consists of several drought induced 19 (Di19) like and RING finger 114 proteins. Di19 has been found to be strongly expressed in both the roots and leaves of Arabidopsis thaliana during progressive drought [], whilst RING finger proteins are thought to play a role in spermatogenesis. The precise function is unknown.
Probab=97.52  E-value=0.00019  Score=46.17  Aligned_cols=49  Identities=27%  Similarity=0.355  Sum_probs=25.0

Q ss_pred             eeccCCCCcCCChhHHHHHHhh-hCCC-CCeecCcCccccCChhHHHHHHHHhc
Q psy8050         116 LIQFIPGKTYARPSTLKTHLRT-HSGE-KPYRCEDCNKSFSQAANLTAHVRTHS  167 (311)
Q Consensus       116 ~~C~~C~~~f~~~~~L~~H~~~-h~~~-~~~~C~~C~~~f~~~~~L~~H~~~h~  167 (311)
                      |.|+.|++ ..+...|..|... |..+ +.+.|++|...+.  .+|..|+..++
T Consensus         3 f~CP~C~~-~~~~~~L~~H~~~~H~~~~~~v~CPiC~~~~~--~~l~~Hl~~~H   53 (54)
T PF05605_consen    3 FTCPYCGK-GFSESSLVEHCEDEHRSESKNVVCPICSSRVT--DNLIRHLNSQH   53 (54)
T ss_pred             cCCCCCCC-ccCHHHHHHHHHhHCcCCCCCccCCCchhhhh--hHHHHHHHHhc
Confidence            45566666 3444556666533 3222 3466666665433  25566655443


No 29 
>PF13894 zf-C2H2_4:  C2H2-type zinc finger; PDB: 2ELX_A 2EPP_A 2DLK_A 1X6H_A 2EOU_A 2EMB_A 2GQJ_A 2CSH_A 2WBT_B 2ELM_A ....
Probab=97.46  E-value=0.00013  Score=38.21  Aligned_cols=23  Identities=39%  Similarity=0.891  Sum_probs=14.0

Q ss_pred             eecCccccccCChHHHHHHHHHh
Q psy8050         228 YQCKLCLLRFSQSGNLNRHMRVH  250 (311)
Q Consensus       228 ~~C~~C~~~f~~~~~L~~H~~~H  250 (311)
                      |.|++|+..|.+...|..|+.+|
T Consensus         1 ~~C~~C~~~~~~~~~l~~H~~~~   23 (24)
T PF13894_consen    1 FQCPICGKSFRSKSELRQHMRTH   23 (24)
T ss_dssp             EE-SSTS-EESSHHHHHHHHHHH
T ss_pred             CCCcCCCCcCCcHHHHHHHHHhh
Confidence            45666777777666676666655


No 30 
>PF13912 zf-C2H2_6:  C2H2-type zinc finger; PDB: 1JN7_A 1FU9_A 2L1O_A 1NJQ_A 2EN8_A 2EMM_A 1FV5_A 1Y0J_B 2L6Z_B.
Probab=97.37  E-value=9.7e-05  Score=40.11  Aligned_cols=25  Identities=36%  Similarity=0.704  Sum_probs=17.0

Q ss_pred             CeecCccccccCChHHHHHHHHHhC
Q psy8050         227 PYQCKLCLLRFSQSGNLNRHMRVHG  251 (311)
Q Consensus       227 ~~~C~~C~~~f~~~~~L~~H~~~H~  251 (311)
                      +|.|..|++.|.+...|..|++.|.
T Consensus         1 ~~~C~~C~~~F~~~~~l~~H~~~h~   25 (27)
T PF13912_consen    1 PFECDECGKTFSSLSALREHKRSHC   25 (27)
T ss_dssp             SEEETTTTEEESSHHHHHHHHCTTT
T ss_pred             CCCCCccCCccCChhHHHHHhHHhc
Confidence            4667777777777777777766654


No 31 
>PF12756 zf-C2H2_2:  C2H2 type zinc-finger (2 copies); PDB: 2DMI_A.
Probab=97.36  E-value=0.00015  Score=52.92  Aligned_cols=24  Identities=29%  Similarity=0.688  Sum_probs=16.4

Q ss_pred             CeecCccccccCChHHHHHHHHHh
Q psy8050         227 PYQCKLCLLRFSQSGNLNRHMRVH  250 (311)
Q Consensus       227 ~~~C~~C~~~f~~~~~L~~H~~~H  250 (311)
                      .+.|..|+..|.+...|..|++.+
T Consensus        50 ~~~C~~C~~~f~s~~~l~~Hm~~~   73 (100)
T PF12756_consen   50 SFRCPYCNKTFRSREALQEHMRSK   73 (100)
T ss_dssp             SEEBSSSS-EESSHHHHHHHHHHT
T ss_pred             CCCCCccCCCCcCHHHHHHHHcCc
Confidence            477777777777777777777653


No 32 
>PF13894 zf-C2H2_4:  C2H2-type zinc finger; PDB: 2ELX_A 2EPP_A 2DLK_A 1X6H_A 2EOU_A 2EMB_A 2GQJ_A 2CSH_A 2WBT_B 2ELM_A ....
Probab=97.22  E-value=0.00031  Score=36.63  Aligned_cols=23  Identities=43%  Similarity=0.928  Sum_probs=11.5

Q ss_pred             eecCcCccccCChhHHHHHHHHh
Q psy8050         144 YRCEDCNKSFSQAANLTAHVRTH  166 (311)
Q Consensus       144 ~~C~~C~~~f~~~~~L~~H~~~h  166 (311)
                      |.|++|+..|.+...|..|+..+
T Consensus         1 ~~C~~C~~~~~~~~~l~~H~~~~   23 (24)
T PF13894_consen    1 FQCPICGKSFRSKSELRQHMRTH   23 (24)
T ss_dssp             EE-SSTS-EESSHHHHHHHHHHH
T ss_pred             CCCcCCCCcCCcHHHHHHHHHhh
Confidence            34555555555555555555544


No 33 
>PF12756 zf-C2H2_2:  C2H2 type zinc-finger (2 copies); PDB: 2DMI_A.
Probab=97.22  E-value=0.00018  Score=52.47  Aligned_cols=72  Identities=25%  Similarity=0.582  Sum_probs=15.8

Q ss_pred             cCcCccccCChhHHHHHHHHhcCCCceecCCCCccccCcchhhhhhhhccccccccccccccccCChHHHHHHhhhh
Q psy8050         146 CEDCNKSFSQAANLTAHVRTHSGEKPFRCPICDRRFSQSSSVTTHMRTHSGERPYRCRLCKKAFSDSSTLTKHLRIH  222 (311)
Q Consensus       146 C~~C~~~f~~~~~L~~H~~~h~~~~~~~C~~C~~~f~~~~~l~~H~~~h~~~k~~~C~~C~~~f~~~~~l~~H~~~h  222 (311)
                      |..|+..|.+...|..|+...++-..   . ....+.....+..+..... ...+.|..|++.|.+...|..|++.+
T Consensus         2 C~~C~~~f~~~~~l~~H~~~~H~~~~---~-~~~~l~~~~~~~~~~~~~~-~~~~~C~~C~~~f~s~~~l~~Hm~~~   73 (100)
T PF12756_consen    2 CLFCDESFSSVDDLLQHMKKKHGFDI---P-DQKYLVDPNRLLNYLRKKV-KESFRCPYCNKTFRSREALQEHMRSK   73 (100)
T ss_dssp             ----------------------------------------------------SSEEBSSSS-EESSHHHHHHHHHHT
T ss_pred             cccccccccccccccccccccccccc---c-ccccccccccccccccccc-CCCCCCCccCCCCcCHHHHHHHHcCc
Confidence            55666666666666666644333211   0 0011112222222222111 12466777777777777777777654


No 34 
>PF13912 zf-C2H2_6:  C2H2-type zinc finger; PDB: 1JN7_A 1FU9_A 2L1O_A 1NJQ_A 2EN8_A 2EMM_A 1FV5_A 1Y0J_B 2L6Z_B.
Probab=97.01  E-value=0.0004  Score=37.64  Aligned_cols=23  Identities=39%  Similarity=0.989  Sum_probs=13.2

Q ss_pred             eecCcCccccCChhHHHHHHHHh
Q psy8050         144 YRCEDCNKSFSQAANLTAHVRTH  166 (311)
Q Consensus       144 ~~C~~C~~~f~~~~~L~~H~~~h  166 (311)
                      |.|..|++.|.+...|..|++.|
T Consensus         2 ~~C~~C~~~F~~~~~l~~H~~~h   24 (27)
T PF13912_consen    2 FECDECGKTFSSLSALREHKRSH   24 (27)
T ss_dssp             EEETTTTEEESSHHHHHHHHCTT
T ss_pred             CCCCccCCccCChhHHHHHhHHh
Confidence            55555555555555555555444


No 35 
>COG5189 SFP1 Putative transcriptional repressor regulating G2/M transition [Transcription / Cell division and chromosome partitioning]
Probab=96.81  E-value=0.00037  Score=59.89  Aligned_cols=20  Identities=15%  Similarity=0.237  Sum_probs=16.1

Q ss_pred             ccccccccccccccchhhce
Q psy8050         264 VNQVCSNIVDQRRGIYVVEA  283 (311)
Q Consensus       264 ~C~~C~~~f~~~~~l~~H~~  283 (311)
                      .|++|++.|.+..+|..|..
T Consensus       400 rCevC~KRYKNlNGLKYHr~  419 (423)
T COG5189         400 RCEVCDKRYKNLNGLKYHRK  419 (423)
T ss_pred             eccccchhhccCccceeccc
Confidence            68888888888888888864


No 36 
>smart00355 ZnF_C2H2 zinc finger.
Probab=96.60  E-value=0.0021  Score=33.93  Aligned_cols=23  Identities=39%  Similarity=0.784  Sum_probs=13.4

Q ss_pred             eecCccccccCChHHHHHHHHHh
Q psy8050         228 YQCKLCLLRFSQSGNLNRHMRVH  250 (311)
Q Consensus       228 ~~C~~C~~~f~~~~~L~~H~~~H  250 (311)
                      |.|..|++.|.+...|..|++.|
T Consensus         1 ~~C~~C~~~f~~~~~l~~H~~~H   23 (26)
T smart00355        1 YRCPECGKVFKSKSALKEHMRTH   23 (26)
T ss_pred             CCCCCCcchhCCHHHHHHHHHHh
Confidence            34566666666666666665544


No 37 
>COG5189 SFP1 Putative transcriptional repressor regulating G2/M transition [Transcription / Cell division and chromosome partitioning]
Probab=96.58  E-value=0.00075  Score=58.08  Aligned_cols=51  Identities=29%  Similarity=0.770  Sum_probs=36.9

Q ss_pred             cccccccc--cccccCChHHHHHHhhhhc-------------------CCCCeecCccccccCChHHHHHHH
Q psy8050         197 ERPYRCRL--CKKAFSDSSTLTKHLRIHS-------------------GEKPYQCKLCLLRFSQSGNLNRHM  247 (311)
Q Consensus       197 ~k~~~C~~--C~~~f~~~~~l~~H~~~h~-------------------~~~~~~C~~C~~~f~~~~~L~~H~  247 (311)
                      ++||+|++  |.+.|.+...|+.|+.--+                   ..+||.|++|++.+.....|.-|.
T Consensus       347 ~KpykCpV~gC~K~YknqnGLKYH~lhGH~~~~~~~~p~p~~~~~F~~~~KPYrCevC~KRYKNlNGLKYHr  418 (423)
T COG5189         347 GKPYKCPVEGCNKKYKNQNGLKYHMLHGHQNQKLHENPSPEKMNIFSAKDKPYRCEVCDKRYKNLNGLKYHR  418 (423)
T ss_pred             CceecCCCCCchhhhccccchhhhhhccccCcccCCCCCccccccccccCCceeccccchhhccCccceecc
Confidence            47777766  7777777777777764221                   247899999999998888887774


No 38 
>KOG2231|consensus
Probab=96.41  E-value=0.0052  Score=59.04  Aligned_cols=100  Identities=24%  Similarity=0.550  Sum_probs=61.5

Q ss_pred             cCcCccccCChhHHHHHHHHhcCCCceecCCCC---------ccccCcchhhhhhhhccc-cc----cccccccccccCC
Q psy8050         146 CEDCNKSFSQAANLTAHVRTHSGEKPFRCPICD---------RRFSQSSSVTTHMRTHSG-ER----PYRCRLCKKAFSD  211 (311)
Q Consensus       146 C~~C~~~f~~~~~L~~H~~~h~~~~~~~C~~C~---------~~f~~~~~l~~H~~~h~~-~k----~~~C~~C~~~f~~  211 (311)
                      |..| -.|.+...|..|+...|+  .+.|.+|-         ....+...|..|+..--. +.    .-.|..|...|..
T Consensus       118 ~~~c-~~~~s~~~Lk~H~~~~H~--~~~c~lC~~~~kif~~e~k~Yt~~el~~h~~~gd~d~~s~rGhp~C~~C~~~fld  194 (669)
T KOG2231|consen  118 CLHC-TEFKSVENLKNHMRDQHK--LHLCSLCLQNLKIFINERKLYTRAELNLHLMFGDPDDESCRGHPLCKFCHERFLD  194 (669)
T ss_pred             Cccc-cchhHHHHHHHHHHHhhh--hhccccccccceeeeeeeehehHHHHHHHHhcCCCccccccCCccchhhhhhhcc
Confidence            4455 445567788888843322  23444432         122344566666654211 11    2468888888888


Q ss_pred             hHHHHHHhhhhcCCCCeecCcc------ccccCChHHHHHHHHHhCC
Q psy8050         212 SSTLTKHLRIHSGEKPYQCKLC------LLRFSQSGNLNRHMRVHGS  252 (311)
Q Consensus       212 ~~~l~~H~~~h~~~~~~~C~~C------~~~f~~~~~L~~H~~~H~~  252 (311)
                      ...|.+|++.++    |.|..|      +.-|.....|..|.+.+|-
T Consensus       195 ~~el~rH~~~~h----~~chfC~~~~~~neyy~~~~dLe~HfR~~Hf  237 (669)
T KOG2231|consen  195 DDELYRHLRFDH----EFCHFCDYKTGQNEYYNDYDDLEEHFRKGHF  237 (669)
T ss_pred             HHHHHHhhccce----eheeecCcccccchhcccchHHHHHhhhcCc
Confidence            888888888765    556666      3457778888888876543


No 39 
>KOG2231|consensus
Probab=96.40  E-value=0.0078  Score=57.90  Aligned_cols=122  Identities=22%  Similarity=0.473  Sum_probs=79.5

Q ss_pred             ccCCCCcCCChhHHHHHHhhhCCCCCeecCcCc---------cccCChhHHHHHHHHhcC-CC----ceecCCCCccccC
Q psy8050         118 QFIPGKTYARPSTLKTHLRTHSGEKPYRCEDCN---------KSFSQAANLTAHVRTHSG-EK----PFRCPICDRRFSQ  183 (311)
Q Consensus       118 C~~C~~~f~~~~~L~~H~~~h~~~~~~~C~~C~---------~~f~~~~~L~~H~~~h~~-~~----~~~C~~C~~~f~~  183 (311)
                      |..| ..|.+...|+.|+...+  +.+.|.+|-         ....+...|..|+..-.. +.    .-.|..|...|..
T Consensus       118 ~~~c-~~~~s~~~Lk~H~~~~H--~~~~c~lC~~~~kif~~e~k~Yt~~el~~h~~~gd~d~~s~rGhp~C~~C~~~fld  194 (669)
T KOG2231|consen  118 CLHC-TEFKSVENLKNHMRDQH--KLHLCSLCLQNLKIFINERKLYTRAELNLHLMFGDPDDESCRGHPLCKFCHERFLD  194 (669)
T ss_pred             Cccc-cchhHHHHHHHHHHHhh--hhhccccccccceeeeeeeehehHHHHHHHHhcCCCccccccCCccchhhhhhhcc
Confidence            4445 45557889999995433  234454442         223346778888764321 21    2469999999999


Q ss_pred             cchhhhhhhhcccccccccccc------ccccCChHHHHHHhhhhcCCCCeecC--ccc-cccCChHHHHHHHHHh
Q psy8050         184 SSSVTTHMRTHSGERPYRCRLC------KKAFSDSSTLTKHLRIHSGEKPYQCK--LCL-LRFSQSGNLNRHMRVH  250 (311)
Q Consensus       184 ~~~l~~H~~~h~~~k~~~C~~C------~~~f~~~~~l~~H~~~h~~~~~~~C~--~C~-~~f~~~~~L~~H~~~H  250 (311)
                      ...|.+|++.++    |.|..|      +.-|.....|..|.+.+|    |.|+  .|. ..|.....+..+++.|
T Consensus       195 ~~el~rH~~~~h----~~chfC~~~~~~neyy~~~~dLe~HfR~~H----flCE~~~C~~~~f~~~~~~ei~lk~~  262 (669)
T KOG2231|consen  195 DDELYRHLRFDH----EFCHFCDYKTGQNEYYNDYDDLEEHFRKGH----FLCEEEFCRTKKFYVAFELEIELKAH  262 (669)
T ss_pred             HHHHHHhhccce----eheeecCcccccchhcccchHHHHHhhhcC----ccccccccccceeeehhHHHHHHHhh
Confidence            999999998765    556666      456788899999998877    7787  564 3344443455555433


No 40 
>smart00355 ZnF_C2H2 zinc finger.
Probab=96.08  E-value=0.0055  Score=32.23  Aligned_cols=24  Identities=46%  Similarity=0.962  Sum_probs=17.8

Q ss_pred             ccccccccccCChHHHHHHhhhhc
Q psy8050         200 YRCRLCKKAFSDSSTLTKHLRIHS  223 (311)
Q Consensus       200 ~~C~~C~~~f~~~~~l~~H~~~h~  223 (311)
                      |.|..|++.|.....|..|++.|.
T Consensus         1 ~~C~~C~~~f~~~~~l~~H~~~H~   24 (26)
T smart00355        1 YRCPECGKVFKSKSALKEHMRTHX   24 (26)
T ss_pred             CCCCCCcchhCCHHHHHHHHHHhc
Confidence            467778888888888888777553


No 41 
>PF13909 zf-H2C2_5:  C2H2-type zinc-finger domain; PDB: 1X5W_A.
Probab=96.00  E-value=0.0055  Score=32.01  Aligned_cols=23  Identities=39%  Similarity=0.803  Sum_probs=12.9

Q ss_pred             eecCccccccCChHHHHHHHHHhC
Q psy8050         228 YQCKLCLLRFSQSGNLNRHMRVHG  251 (311)
Q Consensus       228 ~~C~~C~~~f~~~~~L~~H~~~H~  251 (311)
                      |.|+.|+.... ...|..|++.|+
T Consensus         1 y~C~~C~y~t~-~~~l~~H~~~~H   23 (24)
T PF13909_consen    1 YKCPHCSYSTS-KSNLKRHLKRHH   23 (24)
T ss_dssp             EE-SSSS-EES-HHHHHHHHHHHH
T ss_pred             CCCCCCCCcCC-HHHHHHHHHhhC
Confidence            45666666665 666666666543


No 42 
>PF12874 zf-met:  Zinc-finger of C2H2 type; PDB: 1ZU1_A 2KVG_A.
Probab=95.98  E-value=0.0041  Score=32.85  Aligned_cols=21  Identities=33%  Similarity=0.866  Sum_probs=13.2

Q ss_pred             eecCccccccCChHHHHHHHH
Q psy8050         228 YQCKLCLLRFSQSGNLNRHMR  248 (311)
Q Consensus       228 ~~C~~C~~~f~~~~~L~~H~~  248 (311)
                      |.|.+|+..|.+...|..|++
T Consensus         1 ~~C~~C~~~f~s~~~~~~H~~   21 (25)
T PF12874_consen    1 FYCDICNKSFSSENSLRQHLR   21 (25)
T ss_dssp             EEETTTTEEESSHHHHHHHHT
T ss_pred             CCCCCCCCCcCCHHHHHHHHC
Confidence            456666666666666666654


No 43 
>COG5236 Uncharacterized conserved protein, contains RING Zn-finger [General function prediction only]
Probab=95.93  E-value=0.011  Score=51.67  Aligned_cols=129  Identities=24%  Similarity=0.434  Sum_probs=68.2

Q ss_pred             eeccC--CCCcCCChhHHHHHHhhhCCCCCeecCcCc---cccC------ChhHHHHHHHHhcCCCc----eecCCCCcc
Q psy8050         116 LIQFI--PGKTYARPSTLKTHLRTHSGEKPYRCEDCN---KSFS------QAANLTAHVRTHSGEKP----FRCPICDRR  180 (311)
Q Consensus       116 ~~C~~--C~~~f~~~~~L~~H~~~h~~~~~~~C~~C~---~~f~------~~~~L~~H~~~h~~~~~----~~C~~C~~~  180 (311)
                      |.|+.  |..+...-..|..|.+..++.  +.|.+|-   +.|.      +...|..|...-..+..    -.|..|...
T Consensus       152 F~CP~skc~~~C~~~k~lk~H~K~~H~~--~~C~~C~~nKk~F~~E~~lF~~~~Lr~H~~~G~~e~GFKGHP~C~FC~~~  229 (493)
T COG5236         152 FKCPKSKCHRRCGSLKELKKHYKAQHGF--VLCSECIGNKKDFWNEIRLFRSSTLRDHKNGGLEEEGFKGHPLCIFCKIY  229 (493)
T ss_pred             hcCCchhhhhhhhhHHHHHHHHHhhcCc--EEhHhhhcCcccCccceeeeecccccccccCCccccCcCCCchhhhccce
Confidence            44443  445555556666676654432  4455542   2232      23445555433222211    247777777


Q ss_pred             ccCcchhhhhhhhcccccccccccccc-------ccCChHHHHHHhhhhcCCCCeecCc--cc----cccCChHHHHHHH
Q psy8050         181 FSQSSSVTTHMRTHSGERPYRCRLCKK-------AFSDSSTLTKHLRIHSGEKPYQCKL--CL----LRFSQSGNLNRHM  247 (311)
Q Consensus       181 f~~~~~l~~H~~~h~~~k~~~C~~C~~-------~f~~~~~l~~H~~~h~~~~~~~C~~--C~----~~f~~~~~L~~H~  247 (311)
                      |....+|.+|++..+ +   .|-+|++       -|.+...|..|.+.-+    |.|.+  |.    ..|.+...|..|+
T Consensus       230 FYdDDEL~~HcR~~H-E---~ChICD~v~p~~~QYFK~Y~~Le~HF~~~h----y~ct~qtc~~~k~~vf~~~~el~~h~  301 (493)
T COG5236         230 FYDDDELRRHCRLRH-E---ACHICDMVGPIRYQYFKSYEDLEAHFRNAH----YCCTFQTCRVGKCYVFPYHTELLEHL  301 (493)
T ss_pred             ecChHHHHHHHHhhh-h---hhhhhhccCccchhhhhCHHHHHHHhhcCc----eEEEEEEEecCcEEEeccHHHHHHHH
Confidence            777777777777544 2   3444443       2666667777665332    65643  42    3577777777775


Q ss_pred             -HHhCCCC
Q psy8050         248 -RVHGSQY  254 (311)
Q Consensus       248 -~~H~~~~  254 (311)
                       +.|....
T Consensus       302 ~~~h~~~~  309 (493)
T COG5236         302 TRFHKVNA  309 (493)
T ss_pred             HHHhhccc
Confidence             3454333


No 44 
>PF09237 GAGA:  GAGA factor;  InterPro: IPR015318 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target.  Members of this entry bind to a 5'-GAGAG-3' DNA consensus binding site, and contain a Cys2-His2 zinc finger core as well as an N-terminal extension containing two highly basic regions. The zinc finger core binds in the DNA major groove and recognises the first three GAG bases of the consensus in a manner similar to that seen in other classical zinc finger-DNA complexes. The second basic region forms a helix that interacts in the major groove recognising the last G of the consensus, while the first basic region wraps around the DNA in the minor groove and recognises the A in the fourth position of the consensus sequence [].  More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; PDB: 1YUI_A 1YUJ_A.
Probab=95.91  E-value=0.0069  Score=37.48  Aligned_cols=27  Identities=33%  Similarity=0.626  Sum_probs=11.8

Q ss_pred             CCeecCccccccCChHHHHHHHHHhCC
Q psy8050         226 KPYQCKLCLLRFSQSGNLNRHMRVHGS  252 (311)
Q Consensus       226 ~~~~C~~C~~~f~~~~~L~~H~~~H~~  252 (311)
                      .|-.|++|+..+.+..+|.+|+.++++
T Consensus        23 ~PatCP~C~a~~~~srnLrRHle~~H~   49 (54)
T PF09237_consen   23 QPATCPICGAVIRQSRNLRRHLEIRHF   49 (54)
T ss_dssp             --EE-TTT--EESSHHHHHHHHHHHTT
T ss_pred             CCCCCCcchhhccchhhHHHHHHHHhc
Confidence            445555555555555555555554443


No 45 
>PRK04860 hypothetical protein; Provisional
Probab=95.70  E-value=0.005  Score=49.00  Aligned_cols=39  Identities=33%  Similarity=0.778  Sum_probs=32.3

Q ss_pred             ccccccccccccCChHHHHHHhhhhcCCCCeecCccccccCCh
Q psy8050         198 RPYRCRLCKKAFSDSSTLTKHLRIHSGEKPYQCKLCLLRFSQS  240 (311)
Q Consensus       198 k~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~f~~~  240 (311)
                      -+|.|. |+.   ....+.+|.++|.++++|.|..|+..|...
T Consensus       118 ~~Y~C~-C~~---~~~~~rrH~ri~~g~~~YrC~~C~~~l~~~  156 (160)
T PRK04860        118 FPYRCK-CQE---HQLTVRRHNRVVRGEAVYRCRRCGETLVFK  156 (160)
T ss_pred             EEEEcC-CCC---eeCHHHHHHHHhcCCccEECCCCCceeEEe
Confidence            468897 887   667788999999999999999999887654


No 46 
>PF09237 GAGA:  GAGA factor;  InterPro: IPR015318 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target.  Members of this entry bind to a 5'-GAGAG-3' DNA consensus binding site, and contain a Cys2-His2 zinc finger core as well as an N-terminal extension containing two highly basic regions. The zinc finger core binds in the DNA major groove and recognises the first three GAG bases of the consensus in a manner similar to that seen in other classical zinc finger-DNA complexes. The second basic region forms a helix that interacts in the major groove recognising the last G of the consensus, while the first basic region wraps around the DNA in the minor groove and recognises the A in the fourth position of the consensus sequence [].  More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; PDB: 1YUI_A 1YUJ_A.
Probab=95.69  E-value=0.014  Score=36.11  Aligned_cols=29  Identities=28%  Similarity=0.490  Sum_probs=14.9

Q ss_pred             CCeecCcCccccCChhHHHHHHHHhcCCC
Q psy8050         142 KPYRCEDCNKSFSQAANLTAHVRTHSGEK  170 (311)
Q Consensus       142 ~~~~C~~C~~~f~~~~~L~~H~~~h~~~~  170 (311)
                      .|-.|++|+..+....+|.+|+...++.+
T Consensus        23 ~PatCP~C~a~~~~srnLrRHle~~H~~k   51 (54)
T PF09237_consen   23 QPATCPICGAVIRQSRNLRRHLEIRHFKK   51 (54)
T ss_dssp             --EE-TTT--EESSHHHHHHHHHHHTTTS
T ss_pred             CCCCCCcchhhccchhhHHHHHHHHhccc
Confidence            45666666666666666666666555544


No 47 
>COG5048 FOG: Zn-finger [General function prediction only]
Probab=95.50  E-value=0.0075  Score=55.86  Aligned_cols=144  Identities=28%  Similarity=0.494  Sum_probs=88.4

Q ss_pred             CeecCcCccccCChhHHHHHHH--HhcCC--CceecC--CCCccccCcchhhhhhhhccccccccccc--cccccCChHH
Q psy8050         143 PYRCEDCNKSFSQAANLTAHVR--THSGE--KPFRCP--ICDRRFSQSSSVTTHMRTHSGERPYRCRL--CKKAFSDSST  214 (311)
Q Consensus       143 ~~~C~~C~~~f~~~~~L~~H~~--~h~~~--~~~~C~--~C~~~f~~~~~l~~H~~~h~~~k~~~C~~--C~~~f~~~~~  214 (311)
                      ++.|..|...|.....|..|..  .|.++  .++.|+  .|++.|.....+..|...|.+..++.+..  +...+.....
T Consensus       289 ~~~~~~~~~~~s~~~~l~~~~~~~~h~~~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~  368 (467)
T COG5048         289 PIKSKQCNISFSRSSPLTRHLRSVNHSGESLKPFSCPYSLCGKLFSRNDALKRHILLHTSISPAKEKLLNSSSKFSPLLN  368 (467)
T ss_pred             CCCCccccCCccccccccccccccccccccCCceeeeccCCCccccccccccCCcccccCCCccccccccCccccccccC
Confidence            5677777777777777777777  67777  777777  67777777777777777777766655544  3333332222


Q ss_pred             HH-----HHhhhhcCCCCeecC--ccccccCChHHHHHHHHHhCCCCCCCCcccccccccccccccccccchhhceeeee
Q psy8050         215 LT-----KHLRIHSGEKPYQCK--LCLLRFSQSGNLNRHMRVHGSQYNMLPVIYMGVNQVCSNIVDQRRGIYVVEAVWTV  287 (311)
Q Consensus       215 l~-----~H~~~h~~~~~~~C~--~C~~~f~~~~~L~~H~~~H~~~~~~~~~~~~~~C~~C~~~f~~~~~l~~H~~~h~~  287 (311)
                      -.     .....-.....+.+.  .|...+.....+..|...|........     .+..|...+.....|..|++.|.-
T Consensus       369 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-----~~~~~~~~~~~~~~~~~~~~~~~~  443 (467)
T COG5048         369 NEPPQSLQQYKDLKNDKKSETLSNSCIRNFKRDSNLSLHIITHLSFRPYNC-----KNPPCSKSFNRHYNLIPHKKIHTN  443 (467)
T ss_pred             CCCccchhhccCccCCccccccccchhhhhccccccccccccccccCCcCC-----CCCcchhhccCccccccccccccc
Confidence            11     111111112223332  255556666666666666655443211     578888888888888888887776


Q ss_pred             cccc
Q psy8050         288 QSTW  291 (311)
Q Consensus       288 ~~~~  291 (311)
                      ..+.
T Consensus       444 ~~~~  447 (467)
T COG5048         444 HAPL  447 (467)
T ss_pred             CCce
Confidence            6665


No 48 
>PF12874 zf-met:  Zinc-finger of C2H2 type; PDB: 1ZU1_A 2KVG_A.
Probab=95.40  E-value=0.0086  Score=31.55  Aligned_cols=23  Identities=39%  Similarity=0.941  Sum_probs=16.0

Q ss_pred             ccccccccccCChHHHHHHhhhh
Q psy8050         200 YRCRLCKKAFSDSSTLTKHLRIH  222 (311)
Q Consensus       200 ~~C~~C~~~f~~~~~l~~H~~~h  222 (311)
                      |.|.+|+..|.+...|..|++.+
T Consensus         1 ~~C~~C~~~f~s~~~~~~H~~s~   23 (25)
T PF12874_consen    1 FYCDICNKSFSSENSLRQHLRSK   23 (25)
T ss_dssp             EEETTTTEEESSHHHHHHHHTTH
T ss_pred             CCCCCCCCCcCCHHHHHHHHCcC
Confidence            46777777777777777776643


No 49 
>PF13909 zf-H2C2_5:  C2H2-type zinc-finger domain; PDB: 1X5W_A.
Probab=95.36  E-value=0.017  Score=30.12  Aligned_cols=22  Identities=41%  Similarity=0.889  Sum_probs=10.7

Q ss_pred             eecCcCccccCChhHHHHHHHHh
Q psy8050         144 YRCEDCNKSFSQAANLTAHVRTH  166 (311)
Q Consensus       144 ~~C~~C~~~f~~~~~L~~H~~~h  166 (311)
                      |.|+.|+.... ...|..|++.+
T Consensus         1 y~C~~C~y~t~-~~~l~~H~~~~   22 (24)
T PF13909_consen    1 YKCPHCSYSTS-KSNLKRHLKRH   22 (24)
T ss_dssp             EE-SSSS-EES-HHHHHHHHHHH
T ss_pred             CCCCCCCCcCC-HHHHHHHHHhh
Confidence            44555555554 55555555544


No 50 
>KOG1146|consensus
Probab=94.99  E-value=0.013  Score=59.94  Aligned_cols=51  Identities=22%  Similarity=0.303  Sum_probs=30.7

Q ss_pred             eeeccCCCCcCCChhHHHHHHh-hhCCCCCeecCcCccccCChhHHHHHHHH
Q psy8050         115 ILIQFIPGKTYARPSTLKTHLR-THSGEKPYRCEDCNKSFSQAANLTAHVRT  165 (311)
Q Consensus       115 ~~~C~~C~~~f~~~~~L~~H~~-~h~~~~~~~C~~C~~~f~~~~~L~~H~~~  165 (311)
                      ...|.-|+..+.....+..|+. .+...+.+.|+.|+..|.....|..||+.
T Consensus       436 l~e~~~~e~~~~s~r~~~~~t~~L~S~~kt~~cpkc~~~yk~a~~L~vhmRs  487 (1406)
T KOG1146|consen  436 LVELTKAEPLLESKRSLEGQTVVLHSFFKTLKCPKCNWHYKLAQTLGVHMRS  487 (1406)
T ss_pred             cccccchhhhhhhhcccccceeeeecccccccCCccchhhhhHHHhhhcccc
Confidence            3344444545555444444442 34444667788888877777777777765


No 51 
>PRK04860 hypothetical protein; Provisional
Probab=94.86  E-value=0.023  Score=45.26  Aligned_cols=34  Identities=38%  Similarity=0.902  Sum_probs=16.1

Q ss_pred             eecCCCCccccCcchhhhhhhhcccccccccccccccc
Q psy8050         172 FRCPICDRRFSQSSSVTTHMRTHSGERPYRCRLCKKAF  209 (311)
Q Consensus       172 ~~C~~C~~~f~~~~~l~~H~~~h~~~k~~~C~~C~~~f  209 (311)
                      |.|. |+.   ....+.+|.++|.++++|.|..|+..|
T Consensus       120 Y~C~-C~~---~~~~~rrH~ri~~g~~~YrC~~C~~~l  153 (160)
T PRK04860        120 YRCK-CQE---HQLTVRRHNRVVRGEAVYRCRRCGETL  153 (160)
T ss_pred             EEcC-CCC---eeCHHHHHHHHhcCCccEECCCCCcee
Confidence            4444 443   334444455555544445555554444


No 52 
>KOG2482|consensus
Probab=94.85  E-value=0.061  Score=47.12  Aligned_cols=57  Identities=18%  Similarity=0.255  Sum_probs=37.0

Q ss_pred             eecCccccccCChHHHHHHHHH-hCC-------CC--CCC----------Ccccccccccccccccccccchhhcee
Q psy8050         228 YQCKLCLLRFSQSGNLNRHMRV-HGS-------QY--NML----------PVIYMGVNQVCSNIVDQRRGIYVVEAV  284 (311)
Q Consensus       228 ~~C~~C~~~f~~~~~L~~H~~~-H~~-------~~--~~~----------~~~~~~~C~~C~~~f~~~~~l~~H~~~  284 (311)
                      ..|-.|.....+...|..||.+ |.-       +.  +..          ...+...|-.|.-.|....+|..|+..
T Consensus       280 v~CLfC~~~~en~~~l~eHmk~vHe~Dl~Ki~sd~~Ln~YqrvrviNyiRkq~~~~~c~~cd~~F~~e~~l~~hm~e  356 (423)
T KOG2482|consen  280 VVCLFCTNFYENPVFLFEHMKIVHEFDLLKIQSDYSLNFYQRVRVINYIRKQKKKSRCAECDLSFWKEPGLLIHMVE  356 (423)
T ss_pred             eEEEeeccchhhHHHHHHHHHHHHHhhHHhhccccccchhhhhhHHHHHHHHhhccccccccccccCcchhhhhccc
Confidence            4788888887778888888763 421       10  000          001223688888899999999999854


No 53 
>COG5048 FOG: Zn-finger [General function prediction only]
Probab=94.45  E-value=0.0093  Score=55.26  Aligned_cols=142  Identities=25%  Similarity=0.409  Sum_probs=105.1

Q ss_pred             ceeeccCCCCcCCChhHHHHHHh--hhCCC--CCeecC--cCccccCChhHHHHHHHHhcCCCceecCC--CCccccCcc
Q psy8050         114 LILIQFIPGKTYARPSTLKTHLR--THSGE--KPYRCE--DCNKSFSQAANLTAHVRTHSGEKPFRCPI--CDRRFSQSS  185 (311)
Q Consensus       114 ~~~~C~~C~~~f~~~~~L~~H~~--~h~~~--~~~~C~--~C~~~f~~~~~L~~H~~~h~~~~~~~C~~--C~~~f~~~~  185 (311)
                      ....|..|...|.....|..|..  .|.++  +++.|+  .|++.|.....+..|...|.+..++.+..  +...+....
T Consensus       288 ~~~~~~~~~~~~s~~~~l~~~~~~~~h~~~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~  367 (467)
T COG5048         288 LPIKSKQCNISFSRSSPLTRHLRSVNHSGESLKPFSCPYSLCGKLFSRNDALKRHILLHTSISPAKEKLLNSSSKFSPLL  367 (467)
T ss_pred             cCCCCccccCCccccccccccccccccccccCCceeeeccCCCccccccccccCCcccccCCCccccccccCcccccccc
Confidence            46788999999999999999999  89999  999999  79999999999999999999988777755  444433333


Q ss_pred             hhhhhhh-----hccccccccccc--cccccCChHHHHHHhhhhcCCC--CeecCccccccCChHHHHHHHHHhCCCCC
Q psy8050         186 SVTTHMR-----THSGERPYRCRL--CKKAFSDSSTLTKHLRIHSGEK--PYQCKLCLLRFSQSGNLNRHMRVHGSQYN  255 (311)
Q Consensus       186 ~l~~H~~-----~h~~~k~~~C~~--C~~~f~~~~~l~~H~~~h~~~~--~~~C~~C~~~f~~~~~L~~H~~~H~~~~~  255 (311)
                      .-..+..     .-.....+.+..  |...+.....+..|...|....  .+.+..|...|.....+..|++.|.....
T Consensus       368 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~  446 (467)
T COG5048         368 NNEPPQSLQQYKDLKNDKKSETLSNSCIRNFKRDSNLSLHIITHLSFRPYNCKNPPCSKSFNRHYNLIPHKKIHTNHAP  446 (467)
T ss_pred             CCCCccchhhccCccCCccccccccchhhhhccccccccccccccccCCcCCCCCcchhhccCcccccccccccccCCc
Confidence            2111111     112233344433  6667777777777777776655  35678899999999999999888866653


No 54 
>PF12171 zf-C2H2_jaz:  Zinc-finger double-stranded RNA-binding;  InterPro: IPR022755  This zinc finger is found in archaea and eukaryotes, and is approximately 30 amino acids in length. The mammalian members of this group occur multiple times along the protein, joined by flexible linkers, and are referred to as JAZ - dsRNA-binding ZF protein - zinc-fingers. The JAZ proteins are expressed in all tissues tested and localise in the nucleus, particularly the nucleolus []. JAZ preferentially binds to double-stranded (ds) RNA or RNA/DNA hybrids rather than DNA. In addition to binding double-stranded RNA, these zinc-fingers are required for nucleolar localisation.   This entry represents the multiple-adjacent-C2H2 zinc finger, JAZ. ; PDB: 4DGW_A 1ZR9_A.
Probab=94.37  E-value=0.018  Score=31.03  Aligned_cols=20  Identities=35%  Similarity=0.858  Sum_probs=9.1

Q ss_pred             ccccccccccCChHHHHHHh
Q psy8050         200 YRCRLCKKAFSDSSTLTKHL  219 (311)
Q Consensus       200 ~~C~~C~~~f~~~~~l~~H~  219 (311)
                      |.|..|++.|.+...|..|+
T Consensus         2 ~~C~~C~k~f~~~~~~~~H~   21 (27)
T PF12171_consen    2 FYCDACDKYFSSENQLKQHM   21 (27)
T ss_dssp             CBBTTTTBBBSSHHHHHCCT
T ss_pred             CCcccCCCCcCCHHHHHHHH
Confidence            34444444444444444444


No 55 
>KOG1146|consensus
Probab=94.18  E-value=0.028  Score=57.58  Aligned_cols=59  Identities=27%  Similarity=0.499  Sum_probs=41.2

Q ss_pred             CCeecCccccccCChHHHHHHHH--HhCCC-----CC--------------CCC---------------ccccccccccc
Q psy8050         226 KPYQCKLCLLRFSQSGNLNRHMR--VHGSQ-----YN--------------MLP---------------VIYMGVNQVCS  269 (311)
Q Consensus       226 ~~~~C~~C~~~f~~~~~L~~H~~--~H~~~-----~~--------------~~~---------------~~~~~~C~~C~  269 (311)
                      ++|.|..|..+++...+|..|+.  .|..+     ++              ..|               ....|.|.+|+
T Consensus       517 ~p~~C~~C~~stttng~LsihlqS~~h~~~lee~~~~~g~~v~~~~~~v~s~~P~~ag~~~~ags~~pktkP~~~C~vc~  596 (1406)
T KOG1146|consen  517 KPYPCRACNYSTTTNGNLSIHLQSDLHRNELEEAEENAGEQVRLLPASVTSAVPEEAGLGPSAGSSGPKTKPSWRCEVCS  596 (1406)
T ss_pred             CcccceeeeeeeecchHHHHHHHHHhhHHHHHHHHhccccchhhhhhhhcccCcccccCCCCCCCCCCCCCCCcchhhhc
Confidence            57888899999998888888875  23221     00              000               01236899999


Q ss_pred             ccccccccchhhcee
Q psy8050         270 NIVDQRRGIYVVEAV  284 (311)
Q Consensus       270 ~~f~~~~~l~~H~~~  284 (311)
                      +.-.-...|+.|+..
T Consensus       597 yetniarnlrihmts  611 (1406)
T KOG1146|consen  597 YETNIARNLRIHMTA  611 (1406)
T ss_pred             chhhhhhcccccccc
Confidence            999999999999864


No 56 
>PF12171 zf-C2H2_jaz:  Zinc-finger double-stranded RNA-binding;  InterPro: IPR022755  This zinc finger is found in archaea and eukaryotes, and is approximately 30 amino acids in length. The mammalian members of this group occur multiple times along the protein, joined by flexible linkers, and are referred to as JAZ - dsRNA-binding ZF protein - zinc-fingers. The JAZ proteins are expressed in all tissues tested and localise in the nucleus, particularly the nucleolus []. JAZ preferentially binds to double-stranded (ds) RNA or RNA/DNA hybrids rather than DNA. In addition to binding double-stranded RNA, these zinc-fingers are required for nucleolar localisation.   This entry represents the multiple-adjacent-C2H2 zinc finger, JAZ. ; PDB: 4DGW_A 1ZR9_A.
Probab=93.93  E-value=0.026  Score=30.37  Aligned_cols=22  Identities=36%  Similarity=0.899  Sum_probs=15.9

Q ss_pred             eecCCCCccccCcchhhhhhhh
Q psy8050         172 FRCPICDRRFSQSSSVTTHMRT  193 (311)
Q Consensus       172 ~~C~~C~~~f~~~~~l~~H~~~  193 (311)
                      |.|..|++.|.+...|..|+..
T Consensus         2 ~~C~~C~k~f~~~~~~~~H~~s   23 (27)
T PF12171_consen    2 FYCDACDKYFSSENQLKQHMKS   23 (27)
T ss_dssp             CBBTTTTBBBSSHHHHHCCTTS
T ss_pred             CCcccCCCCcCCHHHHHHHHcc
Confidence            5677777777777777777654


No 57 
>KOG2785|consensus
Probab=93.08  E-value=0.4  Score=42.94  Aligned_cols=23  Identities=35%  Similarity=0.681  Sum_probs=17.3

Q ss_pred             CCeecCcCccccCChhHHHHHHH
Q psy8050         142 KPYRCEDCNKSFSQAANLTAHVR  164 (311)
Q Consensus       142 ~~~~C~~C~~~f~~~~~L~~H~~  164 (311)
                      .++.|.+|.+.|.+......|+.
T Consensus        67 ~~~~c~~c~k~~~s~~a~~~hl~   89 (390)
T KOG2785|consen   67 SVVYCEACNKSFASPKAHENHLK   89 (390)
T ss_pred             cceehHHhhccccChhhHHHHHH
Confidence            45778888888888777777764


No 58 
>TIGR00622 ssl1 transcription factor ssl1. This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University).
Probab=93.02  E-value=0.17  Score=37.35  Aligned_cols=48  Identities=19%  Similarity=0.351  Sum_probs=27.3

Q ss_pred             cccccccccCChHHHHHHhhhhcCCCCeecCccccccCChHHHHHHHHHh
Q psy8050         201 RCRLCKKAFSDSSTLTKHLRIHSGEKPYQCKLCLLRFSQSGNLNRHMRVH  250 (311)
Q Consensus       201 ~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~f~~~~~L~~H~~~H  250 (311)
                      .|-.|...|........-  .-.....|.|+.|...|-..-++..|...|
T Consensus        57 ~C~~C~~~f~~~~~~~~~--~~~~~~~y~C~~C~~~FC~dCD~fiHe~Lh  104 (112)
T TIGR00622        57 FCFGCQGPFPKPPVSPFD--ELKDSHRYVCAVCKNVFCVDCDVFVHESLH  104 (112)
T ss_pred             cccCcCCCCCCccccccc--ccccccceeCCCCCCccccccchhhhhhcc
Confidence            377777777653211100  011234577888888877777777776655


No 59 
>KOG4173|consensus
Probab=92.72  E-value=0.037  Score=44.90  Aligned_cols=46  Identities=24%  Similarity=0.642  Sum_probs=28.3

Q ss_pred             eecCc--CccccCChhHHHHHHHHhcCCCceecCCCCccccCcchhhhhhh
Q psy8050         144 YRCED--CNKSFSQAANLTAHVRTHSGEKPFRCPICDRRFSQSSSVTTHMR  192 (311)
Q Consensus       144 ~~C~~--C~~~f~~~~~L~~H~~~h~~~~~~~C~~C~~~f~~~~~l~~H~~  192 (311)
                      |.|++  |.+.|.+......|..+-++.   .|..|.+.|.+...|..|+.
T Consensus        80 ~~cqvagc~~~~d~lD~~E~hY~~~h~~---sCs~C~r~~Pt~hLLd~HI~  127 (253)
T KOG4173|consen   80 FACQVAGCCQVFDALDDYEHHYHTLHGN---SCSFCKRAFPTGHLLDAHIL  127 (253)
T ss_pred             ccccccchHHHHhhhhhHHHhhhhcccc---hhHHHHHhCCchhhhhHHHH
Confidence            56654  666666666666665544443   46666666666666666654


No 60 
>PF13913 zf-C2HC_2:  zinc-finger of a C2HC-type
Probab=92.18  E-value=0.14  Score=27.07  Aligned_cols=20  Identities=25%  Similarity=0.713  Sum_probs=11.8

Q ss_pred             eecCccccccCChHHHHHHHH
Q psy8050         228 YQCKLCLLRFSQSGNLNRHMR  248 (311)
Q Consensus       228 ~~C~~C~~~f~~~~~L~~H~~  248 (311)
                      ..|+.||+.| ..+.|..|+.
T Consensus         3 ~~C~~CgR~F-~~~~l~~H~~   22 (25)
T PF13913_consen    3 VPCPICGRKF-NPDRLEKHEK   22 (25)
T ss_pred             CcCCCCCCEE-CHHHHHHHHH
Confidence            3566666666 4555666654


No 61 
>PF13913 zf-C2HC_2:  zinc-finger of a C2HC-type
Probab=92.05  E-value=0.14  Score=27.05  Aligned_cols=19  Identities=26%  Similarity=0.690  Sum_probs=9.5

Q ss_pred             ecCcCccccCChhHHHHHHH
Q psy8050         145 RCEDCNKSFSQAANLTAHVR  164 (311)
Q Consensus       145 ~C~~C~~~f~~~~~L~~H~~  164 (311)
                      .|+.||+.| ....|..|+.
T Consensus         4 ~C~~CgR~F-~~~~l~~H~~   22 (25)
T PF13913_consen    4 PCPICGRKF-NPDRLEKHEK   22 (25)
T ss_pred             cCCCCCCEE-CHHHHHHHHH
Confidence            355555555 3445555543


No 62 
>COG5236 Uncharacterized conserved protein, contains RING Zn-finger [General function prediction only]
Probab=92.02  E-value=0.24  Score=43.58  Aligned_cols=133  Identities=20%  Similarity=0.347  Sum_probs=83.7

Q ss_pred             CeecCc--CccccCChhHHHHHHHHhcCCCceecCCCC---cccc------Ccchhhhhhhhccccc----ccccccccc
Q psy8050         143 PYRCED--CNKSFSQAANLTAHVRTHSGEKPFRCPICD---RRFS------QSSSVTTHMRTHSGER----PYRCRLCKK  207 (311)
Q Consensus       143 ~~~C~~--C~~~f~~~~~L~~H~~~h~~~~~~~C~~C~---~~f~------~~~~l~~H~~~h~~~k----~~~C~~C~~  207 (311)
                      .|.|+.  |..+...-..|..|....++.  +.|.+|-   +.|.      +...|..|...-..+.    .-.|..|..
T Consensus       151 ~F~CP~skc~~~C~~~k~lk~H~K~~H~~--~~C~~C~~nKk~F~~E~~lF~~~~Lr~H~~~G~~e~GFKGHP~C~FC~~  228 (493)
T COG5236         151 SFKCPKSKCHRRCGSLKELKKHYKAQHGF--VLCSECIGNKKDFWNEIRLFRSSTLRDHKNGGLEEEGFKGHPLCIFCKI  228 (493)
T ss_pred             HhcCCchhhhhhhhhHHHHHHHHHhhcCc--EEhHhhhcCcccCccceeeeecccccccccCCccccCcCCCchhhhccc
Confidence            377864  666666677899998876654  6787763   2343      3456666665433332    236999999


Q ss_pred             ccCChHHHHHHhhhhcCCCCeecCccc----cccCChHHHHHHHHHhCCCCCCCCcccccccccc-cccccccccchhhc
Q psy8050         208 AFSDSSTLTKHLRIHSGEKPYQCKLCL----LRFSQSGNLNRHMRVHGSQYNMLPVIYMGVNQVC-SNIVDQRRGIYVVE  282 (311)
Q Consensus       208 ~f~~~~~l~~H~~~h~~~~~~~C~~C~----~~f~~~~~L~~H~~~H~~~~~~~~~~~~~~C~~C-~~~f~~~~~l~~H~  282 (311)
                      .|.....|..|++..+ ++-|.|+.-+    +-|.+..+|..|.+.-+---   .++   .|..= -.+|.....|..|+
T Consensus       229 ~FYdDDEL~~HcR~~H-E~ChICD~v~p~~~QYFK~Y~~Le~HF~~~hy~c---t~q---tc~~~k~~vf~~~~el~~h~  301 (493)
T COG5236         229 YFYDDDELRRHCRLRH-EACHICDMVGPIRYQYFKSYEDLEAHFRNAHYCC---TFQ---TCRVGKCYVFPYHTELLEHL  301 (493)
T ss_pred             eecChHHHHHHHHhhh-hhhhhhhccCccchhhhhCHHHHHHHhhcCceEE---EEE---EEecCcEEEeccHHHHHHHH
Confidence            9999999999999755 3323343322    34888899999976321110   010   12211 24688888899998


Q ss_pred             ee
Q psy8050         283 AV  284 (311)
Q Consensus       283 ~~  284 (311)
                      ..
T Consensus       302 ~~  303 (493)
T COG5236         302 TR  303 (493)
T ss_pred             HH
Confidence            54


No 63 
>smart00451 ZnF_U1 U1-like zinc finger. Family of C2H2-type zinc fingers, present in matrin, U1 small nuclear ribonucleoprotein C and other RNA-binding proteins.
Probab=90.68  E-value=0.23  Score=28.28  Aligned_cols=22  Identities=27%  Similarity=0.760  Sum_probs=18.0

Q ss_pred             CeecCccccccCChHHHHHHHH
Q psy8050         227 PYQCKLCLLRFSQSGNLNRHMR  248 (311)
Q Consensus       227 ~~~C~~C~~~f~~~~~L~~H~~  248 (311)
                      +|.|++|+..|.+...+..|+.
T Consensus         3 ~~~C~~C~~~~~~~~~~~~H~~   24 (35)
T smart00451        3 GFYCKLCNVTFTDEISVEAHLK   24 (35)
T ss_pred             CeEccccCCccCCHHHHHHHHC
Confidence            4778888888888888888874


No 64 
>KOG2482|consensus
Probab=89.95  E-value=0.37  Score=42.43  Aligned_cols=51  Identities=20%  Similarity=0.409  Sum_probs=38.9

Q ss_pred             eecCCCCccccCcchhhhhhhhccc---------------------------cccccccccccccCChHHHHHHhhhh
Q psy8050         172 FRCPICDRRFSQSSSVTTHMRTHSG---------------------------ERPYRCRLCKKAFSDSSTLTKHLRIH  222 (311)
Q Consensus       172 ~~C~~C~~~f~~~~~l~~H~~~h~~---------------------------~k~~~C~~C~~~f~~~~~l~~H~~~h  222 (311)
                      ..|-.|.....+...|..||..-+.                           ...-.|..|...|-....|..|+..+
T Consensus       280 v~CLfC~~~~en~~~l~eHmk~vHe~Dl~Ki~sd~~Ln~YqrvrviNyiRkq~~~~~c~~cd~~F~~e~~l~~hm~e~  357 (423)
T KOG2482|consen  280 VVCLFCTNFYENPVFLFEHMKIVHEFDLLKIQSDYSLNFYQRVRVINYIRKQKKKSRCAECDLSFWKEPGLLIHMVED  357 (423)
T ss_pred             eEEEeeccchhhHHHHHHHHHHHHHhhHHhhccccccchhhhhhHHHHHHHHhhccccccccccccCcchhhhhcccc
Confidence            5788888888888888888875431                           12246888999999999999998643


No 65 
>KOG2893|consensus
Probab=89.19  E-value=0.1  Score=43.39  Aligned_cols=49  Identities=27%  Similarity=0.686  Sum_probs=30.3

Q ss_pred             ccccccccCChHHHHHHhhhhcCCCCeecCccccccCChHHHHHH-HHHhCCCC
Q psy8050         202 CRLCKKAFSDSSTLTKHLRIHSGEKPYQCKLCLLRFSQSGNLNRH-MRVHGSQY  254 (311)
Q Consensus       202 C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~f~~~~~L~~H-~~~H~~~~  254 (311)
                      |=+|++.|.....|.+|++..+    |+|.+|.+..-+-..|..| +++|....
T Consensus        13 cwycnrefddekiliqhqkakh----fkchichkkl~sgpglsihcmqvhketi   62 (341)
T KOG2893|consen   13 CWYCNREFDDEKILIQHQKAKH----FKCHICHKKLFSGPGLSIHCMQVHKETI   62 (341)
T ss_pred             eeecccccchhhhhhhhhhhcc----ceeeeehhhhccCCCceeehhhhhhhhh
Confidence            5567777777777777665433    6777777665555556666 45564443


No 66 
>smart00451 ZnF_U1 U1-like zinc finger. Family of C2H2-type zinc fingers, present in matrin, U1 small nuclear ribonucleoprotein C and other RNA-binding proteins.
Probab=88.79  E-value=0.37  Score=27.38  Aligned_cols=22  Identities=27%  Similarity=0.751  Sum_probs=15.4

Q ss_pred             CeecCcCccccCChhHHHHHHH
Q psy8050         143 PYRCEDCNKSFSQAANLTAHVR  164 (311)
Q Consensus       143 ~~~C~~C~~~f~~~~~L~~H~~  164 (311)
                      +|.|++|+..|.+...+..|+.
T Consensus         3 ~~~C~~C~~~~~~~~~~~~H~~   24 (35)
T smart00451        3 GFYCKLCNVTFTDEISVEAHLK   24 (35)
T ss_pred             CeEccccCCccCCHHHHHHHHC
Confidence            3667777777777777777763


No 67 
>TIGR00622 ssl1 transcription factor ssl1. This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University).
Probab=88.73  E-value=1.1  Score=33.16  Aligned_cols=85  Identities=20%  Similarity=0.396  Sum_probs=47.2

Q ss_pred             CCeecCcCccccCChhHHHHHHHHhcCCC------------ceecCCCCccccCcchhhhhhhhcccccccccccccccc
Q psy8050         142 KPYRCEDCNKSFSQAANLTAHVRTHSGEK------------PFRCPICDRRFSQSSSVTTHMRTHSGERPYRCRLCKKAF  209 (311)
Q Consensus       142 ~~~~C~~C~~~f~~~~~L~~H~~~h~~~~------------~~~C~~C~~~f~~~~~l~~H~~~h~~~k~~~C~~C~~~f  209 (311)
                      .|..|++||-+......|.+....--.-.            .-.|--|...|........-  .-.....|.|+.|...|
T Consensus        14 LP~~CpiCgLtLVss~HLARSyHHLfPl~~f~ev~~~~~~~~~~C~~C~~~f~~~~~~~~~--~~~~~~~y~C~~C~~~F   91 (112)
T TIGR00622        14 LPVECPICGLTLILSTHLARSYHHLFPLKAFQEIPLEEYNGSRFCFGCQGPFPKPPVSPFD--ELKDSHRYVCAVCKNVF   91 (112)
T ss_pred             CCCcCCcCCCEEeccchHHHhhhccCCCcccccccccccCCCCcccCcCCCCCCccccccc--ccccccceeCCCCCCcc
Confidence            45566677666666666655321111111            12377788877654321100  01123468888888888


Q ss_pred             CChHHHHHHhhhhcCCCCeecCccc
Q psy8050         210 SDSSTLTKHLRIHSGEKPYQCKLCL  234 (311)
Q Consensus       210 ~~~~~l~~H~~~h~~~~~~~C~~C~  234 (311)
                      -..-+.-.|...|      .|+.|.
T Consensus        92 C~dCD~fiHe~Lh------~CPGC~  110 (112)
T TIGR00622        92 CVDCDVFVHESLH------CCPGCI  110 (112)
T ss_pred             ccccchhhhhhcc------CCcCCC
Confidence            8777777776655      366664


No 68 
>PF12013 DUF3505:  Protein of unknown function (DUF3505);  InterPro: IPR022698  This family of proteins is functionally uncharacterised. This protein is found in eukaryotes. Proteins in this family are typically between 247 to 1018 amino acids in length. This region contains two segments that are likely to be C2H2 zinc binding domains. 
Probab=88.19  E-value=1.2  Score=32.90  Aligned_cols=25  Identities=28%  Similarity=0.689  Sum_probs=21.2

Q ss_pred             eec----CCCCccccCcchhhhhhhhccc
Q psy8050         172 FRC----PICDRRFSQSSSVTTHMRTHSG  196 (311)
Q Consensus       172 ~~C----~~C~~~f~~~~~l~~H~~~h~~  196 (311)
                      |.|    ..|++.+.+...+..|++.++|
T Consensus        81 ~~C~~~~~~C~y~~~~~~~m~~H~~~~Hg  109 (109)
T PF12013_consen   81 YRCQCDPPHCGYITRSKKTMRKHWRKEHG  109 (109)
T ss_pred             eeeecCCCCCCcEeccHHHHHHHHHHhcC
Confidence            788    8899999999999999887664


No 69 
>cd00350 rubredoxin_like Rubredoxin_like; nonheme iron binding domain containing a [Fe(SCys)4] center. The family includes rubredoxins, a small electron transfer protein, and a slightly smaller modular rubredoxin domain present in rubrerythrin and nigerythrin and detected either N- or C-terminal to such proteins as flavin reductase, NAD(P)H-nitrite reductase, and ferredoxin-thioredoxin reductase. In rubredoxin, the iron atom is coordinated by four cysteine residues (Fe(S-Cys)4), but iron can also be replaced by cobalt, nickel or zinc and believed to be involved in electron transfer.  Rubrerythrins and nigerythrins are small homodimeric proteins, generally consisting of 2 domains: a rubredoxin domain C-terminal to a non-sulfur, oxo-bridged diiron site in the N-terminal rubrerythrin domain.  Rubrerythrins and nigerythrins have putative peroxide activity.
Probab=88.17  E-value=0.39  Score=27.18  Aligned_cols=8  Identities=38%  Similarity=1.148  Sum_probs=3.3

Q ss_pred             ecCCCCcc
Q psy8050         173 RCPICDRR  180 (311)
Q Consensus       173 ~C~~C~~~  180 (311)
                      .|..||..
T Consensus         3 ~C~~CGy~   10 (33)
T cd00350           3 VCPVCGYI   10 (33)
T ss_pred             ECCCCCCE
Confidence            34444433


No 70 
>KOG2893|consensus
Probab=86.74  E-value=0.27  Score=40.95  Aligned_cols=41  Identities=17%  Similarity=0.458  Sum_probs=21.0

Q ss_pred             ccCCCCcCCChhHHHHHHhhhCCCCCeecCcCccccCChhHHHHH
Q psy8050         118 QFIPGKTYARPSTLKTHLRTHSGEKPYRCEDCNKSFSQAANLTAH  162 (311)
Q Consensus       118 C~~C~~~f~~~~~L~~H~~~h~~~~~~~C~~C~~~f~~~~~L~~H  162 (311)
                      |..|++.|....-|.+|++..    -|.|.+|.+...+--.|..|
T Consensus        13 cwycnrefddekiliqhqkak----hfkchichkkl~sgpglsih   53 (341)
T KOG2893|consen   13 CWYCNREFDDEKILIQHQKAK----HFKCHICHKKLFSGPGLSIH   53 (341)
T ss_pred             eeecccccchhhhhhhhhhhc----cceeeeehhhhccCCCceee
Confidence            555555555555555555432    25555555554444444444


No 71 
>PF06524 NOA36:  NOA36 protein;  InterPro: IPR010531 This family consists of several NOA36 proteins which contain 29 highly conserved cysteine residues. The function of this protein is unknown.; GO: 0008270 zinc ion binding, 0005634 nucleus
Probab=86.20  E-value=0.55  Score=39.78  Aligned_cols=26  Identities=27%  Similarity=0.511  Sum_probs=17.1

Q ss_pred             CCCeecCccccccCChHHHHHHHHHh
Q psy8050         225 EKPYQCKLCLLRFSQSGNLNRHMRVH  250 (311)
Q Consensus       225 ~~~~~C~~C~~~f~~~~~L~~H~~~H  250 (311)
                      .+++.|+.||........|..-.|+|
T Consensus       207 ~k~~PCPKCg~et~eTkdLSmStR~h  232 (314)
T PF06524_consen  207 GKPIPCPKCGYETQETKDLSMSTRSH  232 (314)
T ss_pred             CCCCCCCCCCCcccccccceeeeecc
Confidence            35677888887776666665555555


No 72 
>PF12013 DUF3505:  Protein of unknown function (DUF3505);  InterPro: IPR022698  This family of proteins is functionally uncharacterised. This protein is found in eukaryotes. Proteins in this family are typically between 247 to 1018 amino acids in length. This region contains two segments that are likely to be C2H2 zinc binding domains. 
Probab=85.00  E-value=1.1  Score=33.08  Aligned_cols=24  Identities=17%  Similarity=0.564  Sum_probs=17.0

Q ss_pred             CCeecCcCccccCChhHHHHHHHHh
Q psy8050         142 KPYRCEDCNKSFSQAANLTAHVRTH  166 (311)
Q Consensus       142 ~~~~C~~C~~~f~~~~~L~~H~~~h  166 (311)
                      +...|..|+.+..- +++..|++..
T Consensus        10 ~vlIC~~C~~av~~-~~v~~HL~~~   33 (109)
T PF12013_consen   10 RVLICRQCQYAVQP-SEVESHLRKR   33 (109)
T ss_pred             CEEEeCCCCcccCc-hHHHHHHHHh
Confidence            45678888877655 7788887743


No 73 
>PF09986 DUF2225:  Uncharacterized protein conserved in bacteria (DUF2225);  InterPro: IPR018708 This conserved bacterial family has no known function.
Probab=83.32  E-value=0.2  Score=42.07  Aligned_cols=48  Identities=19%  Similarity=0.274  Sum_probs=26.3

Q ss_pred             CCeecCccccccCChHHHHHHHHH---------hCCCCCCCCcc-ccccccccccccccc
Q psy8050         226 KPYQCKLCLLRFSQSGNLNRHMRV---------HGSQYNMLPVI-YMGVNQVCSNIVDQR  275 (311)
Q Consensus       226 ~~~~C~~C~~~f~~~~~L~~H~~~---------H~~~~~~~~~~-~~~~C~~C~~~f~~~  275 (311)
                      +.+.|++|+..|.+..-+....+.         +-++  +.|.. ..++|+.||+.+...
T Consensus         4 k~~~CPvC~~~F~~~~vrs~~~r~~~~d~D~~~~Y~~--vnP~~Y~V~vCP~CgyA~~~~   61 (214)
T PF09986_consen    4 KKITCPVCGKEFKTKKVRSGKIRVIRRDSDFCPRYKG--VNPLFYEVWVCPHCGYAAFEE   61 (214)
T ss_pred             CceECCCCCCeeeeeEEEcCCceEeeecCCCccccCC--CCCeeeeEEECCCCCCccccc
Confidence            446777777777665444333322         1111  22322 235799998888754


No 74 
>PF09538 FYDLN_acid:  Protein of unknown function (FYDLN_acid);  InterPro: IPR012644 Members of this family are bacterial proteins with a conserved motif [KR]FYDLN, sometimes flanked by a pair of CXXC motifs, followed by a long region of low complexity sequence in which roughly half the residues are Asp and Glu, including multiple runs of five or more acidic residues. The function of members of this family is unknown.
Probab=82.69  E-value=1  Score=33.35  Aligned_cols=34  Identities=15%  Similarity=0.286  Sum_probs=18.9

Q ss_pred             CcceeeccCCCCcCCChhHHHHHHhhhCCCCCeecCcCccccCCh
Q psy8050         112 IRLILIQFIPGKTYARPSTLKTHLRTHSGEKPYRCEDCNKSFSQA  156 (311)
Q Consensus       112 ~~~~~~C~~C~~~f~~~~~L~~H~~~h~~~~~~~C~~C~~~f~~~  156 (311)
                      ++...+|+.||..|...           +..|..|+.||..|.-.
T Consensus         6 lGtKR~Cp~CG~kFYDL-----------nk~PivCP~CG~~~~~~   39 (108)
T PF09538_consen    6 LGTKRTCPSCGAKFYDL-----------NKDPIVCPKCGTEFPPE   39 (108)
T ss_pred             cCCcccCCCCcchhccC-----------CCCCccCCCCCCccCcc
Confidence            34445566666666432           23556666666666543


No 75 
>KOG2186|consensus
Probab=82.43  E-value=0.83  Score=38.61  Aligned_cols=49  Identities=18%  Similarity=0.493  Sum_probs=27.6

Q ss_pred             eeeccCCCCcCCChhHHHHHHhhhCCCCCeecCcCccccCChhHHHHHHHHh
Q psy8050         115 ILIQFIPGKTYARPSTLKTHLRTHSGEKPYRCEDCNKSFSQAANLTAHVRTH  166 (311)
Q Consensus       115 ~~~C~~C~~~f~~~~~L~~H~~~h~~~~~~~C~~C~~~f~~~~~L~~H~~~h  166 (311)
                      .|.|..||....-. .|-.|+..-++ .-|.|-.|+..|.. .+...|..--
T Consensus         3 ~FtCnvCgEsvKKp-~vekH~srCrn-~~fSCIDC~k~F~~-~sYknH~kCI   51 (276)
T KOG2186|consen    3 FFTCNVCGESVKKP-QVEKHMSRCRN-AYFSCIDCGKTFER-VSYKNHTKCI   51 (276)
T ss_pred             EEehhhhhhhcccc-chHHHHHhccC-CeeEEeeccccccc-chhhhhhhhc
Confidence            46666776665443 33346644444 44666667766665 5555665433


No 76 
>COG4049 Uncharacterized protein containing archaeal-type C2H2 Zn-finger [General function prediction only]
Probab=80.20  E-value=0.97  Score=28.60  Aligned_cols=29  Identities=28%  Similarity=0.576  Sum_probs=19.9

Q ss_pred             CCCCCeecCcCccccCChhHHHHHHHHhc
Q psy8050         139 SGEKPYRCEDCNKSFSQAANLTAHVRTHS  167 (311)
Q Consensus       139 ~~~~~~~C~~C~~~f~~~~~L~~H~~~h~  167 (311)
                      -|+..+.|+-|+..|....+..+|....+
T Consensus        13 DGE~~lrCPRC~~~FR~~K~Y~RHVNKaH   41 (65)
T COG4049          13 DGEEFLRCPRCGMVFRRRKDYIRHVNKAH   41 (65)
T ss_pred             CCceeeeCCchhHHHHHhHHHHHHhhHHh
Confidence            34556777777777777777777775443


No 77 
>KOG4173|consensus
Probab=79.78  E-value=0.61  Score=38.08  Aligned_cols=77  Identities=32%  Similarity=0.680  Sum_probs=52.7

Q ss_pred             ceecCC--CCccccCcchhhhhhhhccccccccccccccccCChHHHHHHhhhh----------cCCCCeecC--ccccc
Q psy8050         171 PFRCPI--CDRRFSQSSSVTTHMRTHSGERPYRCRLCKKAFSDSSTLTKHLRIH----------SGEKPYQCK--LCLLR  236 (311)
Q Consensus       171 ~~~C~~--C~~~f~~~~~l~~H~~~h~~~k~~~C~~C~~~f~~~~~l~~H~~~h----------~~~~~~~C~--~C~~~  236 (311)
                      .|.|++  |...|.+...+..|...-++.   .|..|.+.|.+...|..|+...          .|...|+|-  .|+..
T Consensus        79 ~~~cqvagc~~~~d~lD~~E~hY~~~h~~---sCs~C~r~~Pt~hLLd~HI~E~HDs~Fqa~veRG~dMy~ClvEgCt~K  155 (253)
T KOG4173|consen   79 AFACQVAGCCQVFDALDDYEHHYHTLHGN---SCSFCKRAFPTGHLLDAHILEWHDSLFQALVERGQDMYQCLVEGCTEK  155 (253)
T ss_pred             cccccccchHHHHhhhhhHHHhhhhcccc---hhHHHHHhCCchhhhhHHHHHHHHHHHHHHHHcCccHHHHHHHhhhhh
Confidence            466765  667777777777776655543   6888888888887777776532          245567774  48888


Q ss_pred             cCChHHHHHHH-HHh
Q psy8050         237 FSQSGNLNRHM-RVH  250 (311)
Q Consensus       237 f~~~~~L~~H~-~~H  250 (311)
                      |.+...-..|+ ++|
T Consensus       156 FkT~r~RkdH~I~~H  170 (253)
T KOG4173|consen  156 FKTSRDRKDHMIRMH  170 (253)
T ss_pred             hhhhhhhhhHHHHhc
Confidence            88887777775 455


No 78 
>PF13717 zinc_ribbon_4:  zinc-ribbon domain
Probab=79.59  E-value=2  Score=24.84  Aligned_cols=33  Identities=15%  Similarity=0.254  Sum_probs=15.3

Q ss_pred             eeccCCCCcCCChhHHHHHHhhhCCCCCeecCcCcccc
Q psy8050         116 LIQFIPGKTYARPSTLKTHLRTHSGEKPYRCEDCNKSF  153 (311)
Q Consensus       116 ~~C~~C~~~f~~~~~L~~H~~~h~~~~~~~C~~C~~~f  153 (311)
                      +.|+.|+..|.-....     +.......+|+.|+..|
T Consensus         3 i~Cp~C~~~y~i~d~~-----ip~~g~~v~C~~C~~~f   35 (36)
T PF13717_consen    3 ITCPNCQAKYEIDDEK-----IPPKGRKVRCSKCGHVF   35 (36)
T ss_pred             EECCCCCCEEeCCHHH-----CCCCCcEEECCCCCCEe
Confidence            3455555555444332     11222335566665554


No 79 
>cd00729 rubredoxin_SM Rubredoxin, Small Modular nonheme iron binding domain containing a [Fe(SCys)4] center, present in rubrerythrin and nigerythrin and detected either N- or C-terminal to such proteins as flavin reductase, NAD(P)H-nitrite reductase, and ferredoxin-thioredoxin reductase. In rubredoxin, the iron atom is coordinated by four cysteine residues (Fe(S-Cys)4), and  believed to be involved in electron transfer. Rubrerythrins and nigerythrins are small homodimeric proteins, generally consisting of 2 domains: a rubredoxin domain C-terminal to a non-sulfur, oxo-bridged diiron site in the N-terminal rubrerythrin domain. Rubrerythrins and nigerythrins have putative peroxide activity.
Probab=76.93  E-value=2  Score=24.47  Aligned_cols=9  Identities=33%  Similarity=1.154  Sum_probs=3.7

Q ss_pred             eecCCCCcc
Q psy8050         172 FRCPICDRR  180 (311)
Q Consensus       172 ~~C~~C~~~  180 (311)
                      |.|..||..
T Consensus         3 ~~C~~CG~i   11 (34)
T cd00729           3 WVCPVCGYI   11 (34)
T ss_pred             EECCCCCCE
Confidence            344444433


No 80 
>PF09538 FYDLN_acid:  Protein of unknown function (FYDLN_acid);  InterPro: IPR012644 Members of this family are bacterial proteins with a conserved motif [KR]FYDLN, sometimes flanked by a pair of CXXC motifs, followed by a long region of low complexity sequence in which roughly half the residues are Asp and Glu, including multiple runs of five or more acidic residues. The function of members of this family is unknown.
Probab=76.15  E-value=1.8  Score=32.01  Aligned_cols=14  Identities=29%  Similarity=0.669  Sum_probs=7.4

Q ss_pred             CCeecCccccccCC
Q psy8050         226 KPYQCKLCLLRFSQ  239 (311)
Q Consensus       226 ~~~~C~~C~~~f~~  239 (311)
                      .|..|+.||..|.-
T Consensus        25 ~PivCP~CG~~~~~   38 (108)
T PF09538_consen   25 DPIVCPKCGTEFPP   38 (108)
T ss_pred             CCccCCCCCCccCc
Confidence            44555555555543


No 81 
>PF09986 DUF2225:  Uncharacterized protein conserved in bacteria (DUF2225);  InterPro: IPR018708 This conserved bacterial family has no known function.
Probab=75.62  E-value=1.4  Score=36.96  Aligned_cols=13  Identities=23%  Similarity=0.496  Sum_probs=7.7

Q ss_pred             ccccccccccCCh
Q psy8050         200 YRCRLCKKAFSDS  212 (311)
Q Consensus       200 ~~C~~C~~~f~~~  212 (311)
                      ..|+.||.++...
T Consensus        49 ~vCP~CgyA~~~~   61 (214)
T PF09986_consen   49 WVCPHCGYAAFEE   61 (214)
T ss_pred             EECCCCCCccccc
Confidence            4577777665533


No 82 
>COG4049 Uncharacterized protein containing archaeal-type C2H2 Zn-finger [General function prediction only]
Probab=75.30  E-value=1.6  Score=27.68  Aligned_cols=25  Identities=28%  Similarity=0.596  Sum_probs=14.7

Q ss_pred             CCCCeecCccccccCChHHHHHHHH
Q psy8050         224 GEKPYQCKLCLLRFSQSGNLNRHMR  248 (311)
Q Consensus       224 ~~~~~~C~~C~~~f~~~~~L~~H~~  248 (311)
                      ||.-+.|+.||..|....+..+|..
T Consensus        14 GE~~lrCPRC~~~FR~~K~Y~RHVN   38 (65)
T COG4049          14 GEEFLRCPRCGMVFRRRKDYIRHVN   38 (65)
T ss_pred             CceeeeCCchhHHHHHhHHHHHHhh
Confidence            4444566666666666666666654


No 83 
>PF13719 zinc_ribbon_5:  zinc-ribbon domain
Probab=74.96  E-value=3  Score=24.22  Aligned_cols=33  Identities=18%  Similarity=0.382  Sum_probs=15.0

Q ss_pred             eeccCCCCcCCChhHHHHHHhhhCCCCCeecCcCcccc
Q psy8050         116 LIQFIPGKTYARPSTLKTHLRTHSGEKPYRCEDCNKSF  153 (311)
Q Consensus       116 ~~C~~C~~~f~~~~~L~~H~~~h~~~~~~~C~~C~~~f  153 (311)
                      ..|+.|+..|.-...-     ...+.....|+.|+..|
T Consensus         3 i~CP~C~~~f~v~~~~-----l~~~~~~vrC~~C~~~f   35 (37)
T PF13719_consen    3 ITCPNCQTRFRVPDDK-----LPAGGRKVRCPKCGHVF   35 (37)
T ss_pred             EECCCCCceEEcCHHH-----cccCCcEEECCCCCcEe
Confidence            3455555555443321     11122345566665555


No 84 
>TIGR00373 conserved hypothetical protein TIGR00373. This family of proteins is, so far, restricted to archaeal genomes. The family appears to be distantly related to the N-terminal region of the eukaryotic transcription initiation factor IIE alpha chain.
Probab=74.43  E-value=3.6  Score=32.73  Aligned_cols=34  Identities=18%  Similarity=0.478  Sum_probs=22.3

Q ss_pred             CCCCCeecCcCccccCChhHHHHHHHHhcCCCceecCCCCccc
Q psy8050         139 SGEKPYRCEDCNKSFSQAANLTAHVRTHSGEKPFRCPICDRRF  181 (311)
Q Consensus       139 ~~~~~~~C~~C~~~f~~~~~L~~H~~~h~~~~~~~C~~C~~~f  181 (311)
                      ....-|.|+.|+..|.....+.         ..|.|+.||...
T Consensus       105 ~~~~~Y~Cp~c~~r~tf~eA~~---------~~F~Cp~Cg~~L  138 (158)
T TIGR00373       105 TNNMFFICPNMCVRFTFNEAME---------LNFTCPRCGAML  138 (158)
T ss_pred             cCCCeEECCCCCcEeeHHHHHH---------cCCcCCCCCCEe
Confidence            3445577877877777666664         147777777653


No 85 
>TIGR02098 MJ0042_CXXC MJ0042 family finger-like domain. This domain contains a CXXCX(19)CXXC motif suggestive of both zinc fingers and thioredoxin, usually found at the N-terminus of prokaryotic proteins. One partially characterized gene, agmX, is among a large set in Myxococcus whose interruption affects adventurous gliding motility.
Probab=74.29  E-value=2.7  Score=24.41  Aligned_cols=10  Identities=30%  Similarity=0.996  Sum_probs=4.8

Q ss_pred             eecCcCcccc
Q psy8050         144 YRCEDCNKSF  153 (311)
Q Consensus       144 ~~C~~C~~~f  153 (311)
                      ..|+.|+..|
T Consensus        26 v~C~~C~~~~   35 (38)
T TIGR02098        26 VRCGKCGHVW   35 (38)
T ss_pred             EECCCCCCEE
Confidence            4455554444


No 86 
>COG1592 Rubrerythrin [Energy production and conversion]
Probab=71.59  E-value=2.8  Score=33.50  Aligned_cols=10  Identities=20%  Similarity=0.657  Sum_probs=5.2

Q ss_pred             CeecCcCccc
Q psy8050         143 PYRCEDCNKS  152 (311)
Q Consensus       143 ~~~C~~C~~~  152 (311)
                      .|.|++||.+
T Consensus       134 ~~vC~vCGy~  143 (166)
T COG1592         134 VWVCPVCGYT  143 (166)
T ss_pred             EEEcCCCCCc
Confidence            3555555544


No 87 
>KOG2186|consensus
Probab=70.49  E-value=2.8  Score=35.56  Aligned_cols=47  Identities=21%  Similarity=0.526  Sum_probs=35.8

Q ss_pred             CeecCcCccccCChhHHHHHHHHhcCCCceecCCCCccccCcchhhhhhh
Q psy8050         143 PYRCEDCNKSFSQAANLTAHVRTHSGEKPFRCPICDRRFSQSSSVTTHMR  192 (311)
Q Consensus       143 ~~~C~~C~~~f~~~~~L~~H~~~h~~~~~~~C~~C~~~f~~~~~l~~H~~  192 (311)
                      .|.|..||.... +..+.+|+...++ ..|.|-.|+..|.. .++..|..
T Consensus         3 ~FtCnvCgEsvK-Kp~vekH~srCrn-~~fSCIDC~k~F~~-~sYknH~k   49 (276)
T KOG2186|consen    3 FFTCNVCGESVK-KPQVEKHMSRCRN-AYFSCIDCGKTFER-VSYKNHTK   49 (276)
T ss_pred             EEehhhhhhhcc-ccchHHHHHhccC-CeeEEeeccccccc-chhhhhhh
Confidence            378888888866 4567778877777 56888889998887 66777754


No 88 
>COG1592 Rubrerythrin [Energy production and conversion]
Probab=69.90  E-value=2.6  Score=33.69  Aligned_cols=10  Identities=40%  Similarity=0.913  Sum_probs=4.8

Q ss_pred             CCCeecCccc
Q psy8050         225 EKPYQCKLCL  234 (311)
Q Consensus       225 ~~~~~C~~C~  234 (311)
                      +-|-.|++||
T Consensus       147 e~P~~CPiCg  156 (166)
T COG1592         147 EAPEVCPICG  156 (166)
T ss_pred             CCCCcCCCCC
Confidence            4444455554


No 89 
>TIGR02300 FYDLN_acid conserved hypothetical protein TIGR02300. Members of this family are bacterial proteins with a conserved motif [KR]FYDLN, sometimes flanked by a pair of CXXC motifs, followed by a long region of low complexity sequence in which roughly half the residues are Asp and Glu, including multiple runs of five or more acidic residues. The function of members of this family is unknown.
Probab=68.62  E-value=3.9  Score=30.93  Aligned_cols=34  Identities=12%  Similarity=0.108  Sum_probs=19.4

Q ss_pred             CcceeeccCCCCcCCChhHHHHHHhhhCCCCCeecCcCccccCCh
Q psy8050         112 IRLILIQFIPGKTYARPSTLKTHLRTHSGEKPYRCEDCNKSFSQA  156 (311)
Q Consensus       112 ~~~~~~C~~C~~~f~~~~~L~~H~~~h~~~~~~~C~~C~~~f~~~  156 (311)
                      ......|+.||+.|..           -+..|..|+.||..|.-.
T Consensus         6 lGtKr~Cp~cg~kFYD-----------Lnk~p~vcP~cg~~~~~~   39 (129)
T TIGR02300         6 LGTKRICPNTGSKFYD-----------LNRRPAVSPYTGEQFPPE   39 (129)
T ss_pred             hCccccCCCcCccccc-----------cCCCCccCCCcCCccCcc
Confidence            3444566677766643           233566677777666443


No 90 
>smart00531 TFIIE Transcription initiation factor IIE.
Probab=68.62  E-value=9.4  Score=29.93  Aligned_cols=38  Identities=18%  Similarity=0.592  Sum_probs=22.6

Q ss_pred             CCCCeecCcCccccCChhHHHHHHHHhcCCCceecCCCCccc
Q psy8050         140 GEKPYRCEDCNKSFSQAANLTAHVRTHSGEKPFRCPICDRRF  181 (311)
Q Consensus       140 ~~~~~~C~~C~~~f~~~~~L~~H~~~h~~~~~~~C~~C~~~f  181 (311)
                      ....|.|+.|+..|.....+..   .. ....|.|+.||...
T Consensus        96 ~~~~Y~Cp~C~~~y~~~ea~~~---~d-~~~~f~Cp~Cg~~l  133 (147)
T smart00531       96 NNAYYKCPNCQSKYTFLEANQL---LD-MDGTFTCPRCGEEL  133 (147)
T ss_pred             CCcEEECcCCCCEeeHHHHHHh---cC-CCCcEECCCCCCEE
Confidence            3456888888888775443322   01 12338888887754


No 91 
>PRK06266 transcription initiation factor E subunit alpha; Validated
Probab=67.04  E-value=5.8  Score=32.26  Aligned_cols=32  Identities=25%  Similarity=0.695  Sum_probs=20.0

Q ss_pred             CCCeecCcCccccCChhHHHHHHHHhcCCCceecCCCCccc
Q psy8050         141 EKPYRCEDCNKSFSQAANLTAHVRTHSGEKPFRCPICDRRF  181 (311)
Q Consensus       141 ~~~~~C~~C~~~f~~~~~L~~H~~~h~~~~~~~C~~C~~~f  181 (311)
                      ..-|.|+.|+..|.....+.         ..|.|+.||...
T Consensus       115 ~~~Y~Cp~C~~rytf~eA~~---------~~F~Cp~Cg~~L  146 (178)
T PRK06266        115 NMFFFCPNCHIRFTFDEAME---------YGFRCPQCGEML  146 (178)
T ss_pred             CCEEECCCCCcEEeHHHHhh---------cCCcCCCCCCCC
Confidence            34577777777776665542         247777777654


No 92 
>KOG1701|consensus
Probab=66.15  E-value=2.7  Score=38.40  Aligned_cols=44  Identities=18%  Similarity=0.177  Sum_probs=28.8

Q ss_pred             ccccccccccccccchhhceeeeecccccC--CCCCCCCccccccC
Q psy8050         264 VNQVCSNIVDQRRGIYVVEAVWTVQSTWID--GRSTMCGHSFIVKL  307 (311)
Q Consensus       264 ~C~~C~~~f~~~~~l~~H~~~h~~~~~~~~--~kC~~C~~~f~~~~  307 (311)
                      +|.+|+...-=...-..=.++...++.|-.  |+|+.|+-...++.
T Consensus       396 rCs~C~~PI~P~~G~~etvRvvamdr~fHv~CY~CEDCg~~LS~e~  441 (468)
T KOG1701|consen  396 RCSVCGNPILPRDGKDETVRVVAMDRDFHVNCYKCEDCGLLLSSEE  441 (468)
T ss_pred             chhhccCCccCCCCCcceEEEEEccccccccceehhhcCccccccC
Confidence            799999887666555555565555554322  47889987766543


No 93 
>TIGR02605 CxxC_CxxC_SSSS putative regulatory protein, FmdB family. This model represents a region of about 50 amino acids found in a number of small proteins in a wide range of bacteria. The region begins usually with the initiator Met and contains two CxxC motifs separated by 17 amino acids. One member of this family is has been noted as a putative regulatory protein, designated FmdB (PubMed:8841393). Most members of this family have a C-terminal region containing highly degenerate sequence, such as SSTSESTKSSGSSGSSGSSESKASGSTEKSTSSTTAAAAV in Mycobacterium tuberculosis and VAVGGSAPAPSPAPRAGGGGGGCCGGGCCG in Streptomyces avermitilis. These low complexity regions, which are not included in the model, resemble low-complexity C-terminal regions of some heterocycle-containing bacteriocin precursors.
Probab=65.76  E-value=3.7  Score=25.67  Aligned_cols=10  Identities=50%  Similarity=1.428  Sum_probs=4.0

Q ss_pred             eecCcCcccc
Q psy8050         144 YRCEDCNKSF  153 (311)
Q Consensus       144 ~~C~~C~~~f  153 (311)
                      |.|..||..|
T Consensus         6 y~C~~Cg~~f   15 (52)
T TIGR02605         6 YRCTACGHRF   15 (52)
T ss_pred             EEeCCCCCEe
Confidence            3344444433


No 94 
>COG2888 Predicted Zn-ribbon RNA-binding protein with a function in translation [Translation, ribosomal structure and biogenesis]
Probab=64.28  E-value=8.5  Score=24.85  Aligned_cols=10  Identities=50%  Similarity=1.451  Sum_probs=5.6

Q ss_pred             CceecCCCCc
Q psy8050         170 KPFRCPICDR  179 (311)
Q Consensus       170 ~~~~C~~C~~  179 (311)
                      .+|.|+.||.
T Consensus        49 ~~Y~Cp~CGF   58 (61)
T COG2888          49 NPYRCPKCGF   58 (61)
T ss_pred             CceECCCcCc
Confidence            3466666654


No 95 
>TIGR01384 TFS_arch transcription factor S, archaeal. There has been an apparent duplication event in the Halobacteriaceae lineage (Haloarcula, Haloferax, Haloquadratum, Halobacterium and Natromonas). There appears to be a separate duplication in Methanosphaera stadtmanae.
Probab=63.99  E-value=3.2  Score=30.35  Aligned_cols=13  Identities=8%  Similarity=-0.171  Sum_probs=9.4

Q ss_pred             CCCCCCCCCCCCC
Q psy8050          79 PYQVGAPQQFPTA   91 (311)
Q Consensus        79 ~~~C~~c~~~~~~   91 (311)
                      .+.|..|+.....
T Consensus        16 ~~~C~~C~~~~~~   28 (104)
T TIGR01384        16 VYVCPSCGYEKEK   28 (104)
T ss_pred             eEECcCCCCcccc
Confidence            5789999876543


No 96 
>KOG1280|consensus
Probab=63.83  E-value=7.4  Score=34.65  Aligned_cols=22  Identities=18%  Similarity=0.345  Sum_probs=10.9

Q ss_pred             CeecCcCccccCChhHHHHHHH
Q psy8050         143 PYRCEDCNKSFSQAANLTAHVR  164 (311)
Q Consensus       143 ~~~C~~C~~~f~~~~~L~~H~~  164 (311)
                      .|.|++|+..-.+...|..|..
T Consensus        79 SftCPyC~~~Gfte~~f~~Hv~  100 (381)
T KOG1280|consen   79 SFTCPYCGIMGFTERQFGTHVL  100 (381)
T ss_pred             cccCCcccccccchhHHHHHhh
Confidence            3555555555444455555543


No 97 
>smart00659 RPOLCX RNA polymerase subunit CX. present in RNA polymerase I, II and III
Probab=63.03  E-value=6.5  Score=23.87  Aligned_cols=10  Identities=30%  Similarity=1.016  Sum_probs=4.4

Q ss_pred             eecCcCcccc
Q psy8050         144 YRCEDCNKSF  153 (311)
Q Consensus       144 ~~C~~C~~~f  153 (311)
                      |.|..||..|
T Consensus         3 Y~C~~Cg~~~   12 (44)
T smart00659        3 YICGECGREN   12 (44)
T ss_pred             EECCCCCCEe
Confidence            4444444443


No 98 
>smart00834 CxxC_CXXC_SSSS Putative regulatory protein. CxxC_CXXC_SSSS represents a region of about 41 amino acids found in a number of small proteins in a wide range of bacteria. The region usually begins with the initiator Met and contains two CxxC motifs separated by 17 amino acids. One protein in this entry has been noted as a putative regulatory protein, designated FmdB. Most proteins in this entry have a C-terminal region containing highly degenerate sequence.
Probab=62.99  E-value=4.7  Score=23.63  Aligned_cols=10  Identities=40%  Similarity=1.198  Sum_probs=4.2

Q ss_pred             eecCCCCccc
Q psy8050         172 FRCPICDRRF  181 (311)
Q Consensus       172 ~~C~~C~~~f  181 (311)
                      |.|..||..|
T Consensus         6 y~C~~Cg~~f   15 (41)
T smart00834        6 YRCEDCGHTF   15 (41)
T ss_pred             EEcCCCCCEE
Confidence            3444444433


No 99 
>PRK00398 rpoP DNA-directed RNA polymerase subunit P; Provisional
Probab=62.77  E-value=5  Score=24.48  Aligned_cols=10  Identities=30%  Similarity=1.218  Sum_probs=4.3

Q ss_pred             eecCcCcccc
Q psy8050         144 YRCEDCNKSF  153 (311)
Q Consensus       144 ~~C~~C~~~f  153 (311)
                      |.|+.||..|
T Consensus         4 y~C~~CG~~~   13 (46)
T PRK00398          4 YKCARCGREV   13 (46)
T ss_pred             EECCCCCCEE
Confidence            3444444443


No 100
>KOG2879|consensus
Probab=62.26  E-value=11  Score=32.54  Aligned_cols=28  Identities=29%  Similarity=0.211  Sum_probs=22.2

Q ss_pred             HhhhhcCccccCcchhhhccCCCCCCCC
Q psy8050          16 KKRKKKGILANSSSKEKIQDRSECPGSG   43 (311)
Q Consensus        16 ~~~~~~~~~~~~~~~~~~~~~~~C~~c~   43 (311)
                      +.|..+-+.++..+++-.+..-+|+.||
T Consensus       219 ~sw~~~l~~ap~~sss~~t~~~~C~~Cg  246 (298)
T KOG2879|consen  219 KSWKLDLDRAPKFSSSTGTSDTECPVCG  246 (298)
T ss_pred             HhhcccccCCCCcccccccCCceeeccC
Confidence            4677776667777788888889999999


No 101
>PRK00464 nrdR transcriptional regulator NrdR; Validated
Probab=62.22  E-value=2  Score=33.92  Aligned_cols=16  Identities=25%  Similarity=0.567  Sum_probs=9.3

Q ss_pred             ccccccccccCChHHH
Q psy8050         200 YRCRLCKKAFSDSSTL  215 (311)
Q Consensus       200 ~~C~~C~~~f~~~~~l  215 (311)
                      +.|+.||+.|.+...+
T Consensus        29 ~~c~~c~~~f~~~e~~   44 (154)
T PRK00464         29 RECLACGKRFTTFERV   44 (154)
T ss_pred             eeccccCCcceEeEec
Confidence            5666666666655443


No 102
>PRK14890 putative Zn-ribbon RNA-binding protein; Provisional
Probab=60.86  E-value=14  Score=23.95  Aligned_cols=10  Identities=40%  Similarity=1.288  Sum_probs=5.8

Q ss_pred             CceecCCCCc
Q psy8050         170 KPFRCPICDR  179 (311)
Q Consensus       170 ~~~~C~~C~~  179 (311)
                      .+|.|+.||.
T Consensus        47 ~~Y~CP~CGF   56 (59)
T PRK14890         47 NPYTCPKCGF   56 (59)
T ss_pred             CceECCCCCC
Confidence            3466666654


No 103
>TIGR00373 conserved hypothetical protein TIGR00373. This family of proteins is, so far, restricted to archaeal genomes. The family appears to be distantly related to the N-terminal region of the eukaryotic transcription initiation factor IIE alpha chain.
Probab=60.50  E-value=8  Score=30.77  Aligned_cols=32  Identities=22%  Similarity=0.466  Sum_probs=18.6

Q ss_pred             CCCceecCCCCccccCcchhhhhhhhccccccccccccccc
Q psy8050         168 GEKPFRCPICDRRFSQSSSVTTHMRTHSGERPYRCRLCKKA  208 (311)
Q Consensus       168 ~~~~~~C~~C~~~f~~~~~l~~H~~~h~~~k~~~C~~C~~~  208 (311)
                      +..-|.|+.|+..|.....+.         ..|.|+.||..
T Consensus       106 ~~~~Y~Cp~c~~r~tf~eA~~---------~~F~Cp~Cg~~  137 (158)
T TIGR00373       106 NNMFFICPNMCVRFTFNEAME---------LNFTCPRCGAM  137 (158)
T ss_pred             CCCeEECCCCCcEeeHHHHHH---------cCCcCCCCCCE
Confidence            344566777776666555543         24666666654


No 104
>KOG4752|consensus
Probab=60.11  E-value=8.1  Score=19.63  Aligned_cols=11  Identities=45%  Similarity=0.751  Sum_probs=4.8

Q ss_pred             HHHHHHHhhhh
Q psy8050          10 KKKKKKKKRKK   20 (311)
Q Consensus        10 ~~~~~~~~~~~   20 (311)
                      +|+||+.||++
T Consensus         7 kkrmrrlkrkr   17 (26)
T KOG4752|consen    7 KKRMRRLKRKR   17 (26)
T ss_pred             HHHHHHHHHHH
Confidence            34444444443


No 105
>COG1198 PriA Primosomal protein N' (replication factor Y) - superfamily II helicase [DNA replication, recombination, and repair]
Probab=59.49  E-value=7  Score=39.02  Aligned_cols=12  Identities=25%  Similarity=0.587  Sum_probs=7.1

Q ss_pred             cccccccccccc
Q psy8050         197 ERPYRCRLCKKA  208 (311)
Q Consensus       197 ~k~~~C~~C~~~  208 (311)
                      ..|..|+.||-.
T Consensus       473 ~~p~~Cp~Cgs~  484 (730)
T COG1198         473 PIPQSCPECGSE  484 (730)
T ss_pred             CCCCCCCCCCCC
Confidence            455666666644


No 106
>TIGR02300 FYDLN_acid conserved hypothetical protein TIGR02300. Members of this family are bacterial proteins with a conserved motif [KR]FYDLN, sometimes flanked by a pair of CXXC motifs, followed by a long region of low complexity sequence in which roughly half the residues are Asp and Glu, including multiple runs of five or more acidic residues. The function of members of this family is unknown.
Probab=58.18  E-value=7.1  Score=29.54  Aligned_cols=13  Identities=15%  Similarity=0.189  Sum_probs=6.9

Q ss_pred             CCeecCccccccC
Q psy8050         226 KPYQCKLCLLRFS  238 (311)
Q Consensus       226 ~~~~C~~C~~~f~  238 (311)
                      .|..|+.||..|.
T Consensus        25 ~p~vcP~cg~~~~   37 (129)
T TIGR02300        25 RPAVSPYTGEQFP   37 (129)
T ss_pred             CCccCCCcCCccC
Confidence            4455555555554


No 107
>smart00614 ZnF_BED BED zinc finger. DNA-binding domain in chromatin-boundary-element-binding proteins and transposases
Probab=58.10  E-value=9.4  Score=23.69  Aligned_cols=21  Identities=48%  Similarity=0.893  Sum_probs=11.8

Q ss_pred             eecCccccccCCh-----HHHHHHHH
Q psy8050         228 YQCKLCLLRFSQS-----GNLNRHMR  248 (311)
Q Consensus       228 ~~C~~C~~~f~~~-----~~L~~H~~  248 (311)
                      -.|..|++.+...     +.|.+|++
T Consensus        19 a~C~~C~~~l~~~~~~gTs~L~rHl~   44 (50)
T smart00614       19 AKCKYCGKKLSRSSKGGTSNLRRHLR   44 (50)
T ss_pred             EEecCCCCEeeeCCCCCcHHHHHHHH
Confidence            3466666555433     46666665


No 108
>smart00734 ZnF_Rad18 Rad18-like CCHC zinc finger. Yeast Rad18p functions with Rad5p in error-free post-replicative DNA repair. This zinc finger is likely to bind nucleic-acids.
Probab=57.88  E-value=9.3  Score=20.22  Aligned_cols=17  Identities=18%  Similarity=0.554  Sum_probs=7.2

Q ss_pred             cCcCccccCChhHHHHHH
Q psy8050         146 CEDCNKSFSQAANLTAHV  163 (311)
Q Consensus       146 C~~C~~~f~~~~~L~~H~  163 (311)
                      |++|++.+ ....+..|+
T Consensus         4 CPiC~~~v-~~~~in~HL   20 (26)
T smart00734        4 CPVCFREV-PENLINSHL   20 (26)
T ss_pred             CCCCcCcc-cHHHHHHHH
Confidence            44444444 233444443


No 109
>PF08188 Protamine_3:  Spermatozal protamine family;  InterPro: IPR012601 This entry consists of the spermatozal protamines. Spermatozal protamines play an important role in remodelling of the sperm chromatin during mammalian spermiogenesis. Nuclear elongation and chromatin condensation are concomitant with modifications in the basic protein complement associated with DNA. Somatic histones are initially replaced by testis-specific histone variants, then by transitional proteins, and ultimately by protamines [].; GO: 0003677 DNA binding, 0035092 sperm chromatin condensation, 0000228 nuclear chromosome
Probab=57.41  E-value=9.7  Score=22.17  Aligned_cols=23  Identities=43%  Similarity=0.723  Sum_probs=12.3

Q ss_pred             hhhHHHHHHHHHHHHhhhhcCcc
Q psy8050           2 KKKKKKKKKKKKKKKKRKKKGIL   24 (311)
Q Consensus         2 ~~~~~~~~~~~~~~~~~~~~~~~   24 (311)
                      |+|.|+-+.+|-|+--|++|.++
T Consensus         8 kkkrksvrrrktrknqrkrknsl   30 (48)
T PF08188_consen    8 KKKRKSVRRRKTRKNQRKRKNSL   30 (48)
T ss_pred             HHHHHHHHHHHHHhhHHHhhhhh
Confidence            34455555555555556555443


No 110
>smart00531 TFIIE Transcription initiation factor IIE.
Probab=57.21  E-value=14  Score=29.01  Aligned_cols=39  Identities=8%  Similarity=0.087  Sum_probs=26.5

Q ss_pred             CcceeeccCCCCcCCChhHHHHHHhhhCCCCCeecCcCccccC
Q psy8050         112 IRLILIQFIPGKTYARPSTLKTHLRTHSGEKPYRCEDCNKSFS  154 (311)
Q Consensus       112 ~~~~~~C~~C~~~f~~~~~L~~H~~~h~~~~~~~C~~C~~~f~  154 (311)
                      ....+.|+.|+..|+....+..   .. ....|.|+.||....
T Consensus        96 ~~~~Y~Cp~C~~~y~~~ea~~~---~d-~~~~f~Cp~Cg~~l~  134 (147)
T smart00531       96 NNAYYKCPNCQSKYTFLEANQL---LD-MDGTFTCPRCGEELE  134 (147)
T ss_pred             CCcEEECcCCCCEeeHHHHHHh---cC-CCCcEECCCCCCEEE
Confidence            3557899999999886544332   01 134499999998764


No 111
>KOG2785|consensus
Probab=55.91  E-value=14  Score=33.48  Aligned_cols=49  Identities=22%  Similarity=0.269  Sum_probs=37.3

Q ss_pred             eeccCCCCcCCChhHHHHHHhhhCCCC-----------------------CeecCcCc---cccCChhHHHHHHH
Q psy8050         116 LIQFIPGKTYARPSTLKTHLRTHSGEK-----------------------PYRCEDCN---KSFSQAANLTAHVR  164 (311)
Q Consensus       116 ~~C~~C~~~f~~~~~L~~H~~~h~~~~-----------------------~~~C~~C~---~~f~~~~~L~~H~~  164 (311)
                      ..|..|+..+.+-..-..||..+++-.                       -|.|-.|+   ..|.+....+.||.
T Consensus       167 t~CLfC~~~~k~~e~~~~HM~~~HgffIPdreYL~D~~GLl~YLgeKV~~~~~CL~CN~~~~~f~sleavr~HM~  241 (390)
T KOG2785|consen  167 TDCLFCDKKSKSLEENLKHMFKEHGFFIPDREYLTDEKGLLKYLGEKVGIGFICLFCNELGRPFSSLEAVRAHMR  241 (390)
T ss_pred             cceeecCCCcccHHHHHHHHhhccCCcCCchHhhhchhHHHHHHHHHhccCceEEEeccccCcccccHHHHHHHh
Confidence            789999999999999999998877621                       15566666   66666677777764


No 112
>PRK06266 transcription initiation factor E subunit alpha; Validated
Probab=54.05  E-value=10  Score=30.78  Aligned_cols=30  Identities=30%  Similarity=0.718  Sum_probs=16.6

Q ss_pred             CceecCCCCccccCcchhhhhhhhccccccccccccccc
Q psy8050         170 KPFRCPICDRRFSQSSSVTTHMRTHSGERPYRCRLCKKA  208 (311)
Q Consensus       170 ~~~~C~~C~~~f~~~~~l~~H~~~h~~~k~~~C~~C~~~  208 (311)
                      .-|.|+.|+..|.....+.         ..|.|+.||..
T Consensus       116 ~~Y~Cp~C~~rytf~eA~~---------~~F~Cp~Cg~~  145 (178)
T PRK06266        116 MFFFCPNCHIRFTFDEAME---------YGFRCPQCGEM  145 (178)
T ss_pred             CEEECCCCCcEEeHHHHhh---------cCCcCCCCCCC
Confidence            4466666666665554432         24666666654


No 113
>PF09723 Zn-ribbon_8:  Zinc ribbon domain;  InterPro: IPR013429  This entry represents a region of about 41 amino acids found in a number of small proteins in a wide range of bacteria. The region usually begins with the initiator Met and contains two CxxC motifs separated by 17 amino acids. One protein in this entry has been noted as a putative regulatory protein, designated FmdB []. Most proteins in this entry have a C-terminal region containing highly degenerate sequence.
Probab=53.88  E-value=7.9  Score=23.12  Aligned_cols=11  Identities=36%  Similarity=1.147  Sum_probs=4.7

Q ss_pred             eecCCCCcccc
Q psy8050         172 FRCPICDRRFS  182 (311)
Q Consensus       172 ~~C~~C~~~f~  182 (311)
                      |.|..||..|.
T Consensus         6 y~C~~Cg~~fe   16 (42)
T PF09723_consen    6 YRCEECGHEFE   16 (42)
T ss_pred             EEeCCCCCEEE
Confidence            34444444443


No 114
>PF02892 zf-BED:  BED zinc finger;  InterPro: IPR003656 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target.  This entry represents predicted BED-type zinc finger domains. The BED finger which was named after the Drosophila proteins BEAF and DREF, is found in one or more copies in cellular regulatory factors and transposases from plants, animals and fungi. The BED finger is an about 50 to 60 amino acid residues domain that contains a characteristic motif with two highly conserved aromatic positions, as well as a shared pattern of cysteines and histidines that is predicted to form a zinc finger. As diverse BED fingers are able to bind DNA, it has been suggested that DNA-binding is the general function of this domain []. Some proteins known to contain a BED domain include animal, plant and fungi AC1 and Hobo-like transposases; Caenorhabditis elegans Dpy-20 protein, a predicted cuticular gene transcriptional regulator; Drosophila BEAF (boundary element-associated factor), thought to be involved in chromatin insulation; Drosophila DREF, a transcriptional regulator for S-phase genes; and tobacco 3AF1 and tomato E4/E8-BP1, light- and ethylene-regulated DNA binding proteins that contain two BED fingers. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; GO: 0003677 DNA binding; PDB: 2DJR_A 2CT5_A.
Probab=53.65  E-value=16  Score=21.79  Aligned_cols=22  Identities=36%  Similarity=0.652  Sum_probs=10.3

Q ss_pred             CCeecCccccccCC----hHHHHHHH
Q psy8050         226 KPYQCKLCLLRFSQ----SGNLNRHM  247 (311)
Q Consensus       226 ~~~~C~~C~~~f~~----~~~L~~H~  247 (311)
                      ....|..|++.+..    .+.|..|+
T Consensus        15 ~~a~C~~C~~~~~~~~~~ts~l~~HL   40 (45)
T PF02892_consen   15 KKAKCKYCGKVIKYSSGGTSNLKRHL   40 (45)
T ss_dssp             S-EEETTTTEE-----SSTHHHHHHH
T ss_pred             CeEEeCCCCeEEeeCCCcHHHHHHhh
Confidence            33556666665544    25566665


No 115
>COG5151 SSL1 RNA polymerase II transcription initiation/nucleotide excision repair factor TFIIH, subunit SSL1 [Transcription / DNA replication, recombination, and repair]
Probab=53.64  E-value=6.1  Score=34.58  Aligned_cols=23  Identities=35%  Similarity=0.728  Sum_probs=11.6

Q ss_pred             eecCccccccCChHHHHHHHHHh
Q psy8050         228 YQCKLCLLRFSQSGNLNRHMRVH  250 (311)
Q Consensus       228 ~~C~~C~~~f~~~~~L~~H~~~H  250 (311)
                      |.|+.|...|-..-+.-.|...|
T Consensus       389 Y~Ce~CK~~FC~dCdvfiHe~Lh  411 (421)
T COG5151         389 YQCELCKSTFCSDCDVFIHETLH  411 (421)
T ss_pred             eechhhhhhhhhhhHHHHHHHHh
Confidence            55555555555544444444444


No 116
>COG1198 PriA Primosomal protein N' (replication factor Y) - superfamily II helicase [DNA replication, recombination, and repair]
Probab=53.52  E-value=10  Score=37.96  Aligned_cols=40  Identities=30%  Similarity=0.658  Sum_probs=27.3

Q ss_pred             eecCCCCccccCcchhhhhhhhccccccccccccccccCChHHHHHHhhhhcCCCCeecCccccc
Q psy8050         172 FRCPICDRRFSQSSSVTTHMRTHSGERPYRCRLCKKAFSDSSTLTKHLRIHSGEKPYQCKLCLLR  236 (311)
Q Consensus       172 ~~C~~C~~~f~~~~~l~~H~~~h~~~k~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~  236 (311)
                      +.|+.|+-.      |.    .|.......|.+||..               ...|..|+.||..
T Consensus       445 ~~Cp~Cd~~------lt----~H~~~~~L~CH~Cg~~---------------~~~p~~Cp~Cgs~  484 (730)
T COG1198         445 AECPNCDSP------LT----LHKATGQLRCHYCGYQ---------------EPIPQSCPECGSE  484 (730)
T ss_pred             ccCCCCCcc------eE----EecCCCeeEeCCCCCC---------------CCCCCCCCCCCCC
Confidence            358888754      22    3334466889999863               3577899999876


No 117
>PF10571 UPF0547:  Uncharacterised protein family UPF0547;  InterPro: IPR018886  This domain may well be a type of zinc-finger as it carries two pairs of highly conserved cysteine residues though with no accompanying histidines. Several members are annotated as putative helicases. 
Probab=52.90  E-value=7.8  Score=20.57  Aligned_cols=9  Identities=33%  Similarity=0.726  Sum_probs=6.0

Q ss_pred             CCCCCcccc
Q psy8050         296 STMCGHSFI  304 (311)
Q Consensus       296 C~~C~~~f~  304 (311)
                      |.+||+.|.
T Consensus        17 Cp~CG~~F~   25 (26)
T PF10571_consen   17 CPHCGYDFE   25 (26)
T ss_pred             CCCCCCCCc
Confidence            677777664


No 118
>KOG2807|consensus
Probab=52.47  E-value=26  Score=31.10  Aligned_cols=25  Identities=24%  Similarity=0.486  Sum_probs=19.9

Q ss_pred             CCeecCccccccCChHHHHHHHHHh
Q psy8050         226 KPYQCKLCLLRFSQSGNLNRHMRVH  250 (311)
Q Consensus       226 ~~~~C~~C~~~f~~~~~L~~H~~~H  250 (311)
                      ..|.|+.|...|-..-+...|...|
T Consensus       344 ~~y~C~~Ck~~FCldCDv~iHesLh  368 (378)
T KOG2807|consen  344 GRYRCESCKNVFCLDCDVFIHESLH  368 (378)
T ss_pred             CcEEchhccceeeccchHHHHhhhh
Confidence            4588888888888877777787776


No 119
>PF05129 Elf1:  Transcription elongation factor Elf1 like;  InterPro: IPR007808 This family of uncharacterised, mostly short, proteins contain a putative zinc binding domain with four conserved cysteines.; PDB: 1WII_A.
Probab=52.38  E-value=6.2  Score=27.50  Aligned_cols=10  Identities=20%  Similarity=0.384  Sum_probs=3.4

Q ss_pred             cCCCCCCCCC
Q psy8050          35 DRSECPGSGM   44 (311)
Q Consensus        35 ~~~~C~~c~~   44 (311)
                      ..|.||.|+.
T Consensus        21 ~~F~CPfC~~   30 (81)
T PF05129_consen   21 KVFDCPFCNH   30 (81)
T ss_dssp             S----TTT--
T ss_pred             ceEcCCcCCC
Confidence            4599999993


No 120
>KOG4377|consensus
Probab=52.00  E-value=11  Score=34.30  Aligned_cols=110  Identities=25%  Similarity=0.522  Sum_probs=66.0

Q ss_pred             CCeec--CcCccccCChhHHHHHHHHhcCC------------CceecC--CCCccccCcchhhhhhhhcccc-------c
Q psy8050         142 KPYRC--EDCNKSFSQAANLTAHVRTHSGE------------KPFRCP--ICDRRFSQSSSVTTHMRTHSGE-------R  198 (311)
Q Consensus       142 ~~~~C--~~C~~~f~~~~~L~~H~~~h~~~------------~~~~C~--~C~~~f~~~~~l~~H~~~h~~~-------k  198 (311)
                      ..|.|  +.|+..+..+..+.+|..+|-..            ..|.|.  .|++   +.++...|-..|+..       .
T Consensus       270 Ehyhcl~e~C~ykr~~k~DvirH~~~hkkrdnsL~dgf~rfs~syhC~~~~C~k---sTsdV~~h~nFht~~~n~Gfrrt  346 (480)
T KOG4377|consen  270 EHYHCLNEYCFYKRGQKNDVIRHVEIHKKRDNSLIDGFHRFSNSYHCTGQICEK---STSDVLLHDNFHTDKRNNGFRRT  346 (480)
T ss_pred             hhhcccCccccccccchhhhHHHHHHHhhcccccccchhhcCccchhhhcccCc---ccccccccCccccccccCceecc
Confidence            44777  56988888888999998877432            125563  4777   344555666655421       2


Q ss_pred             cccccccc--cccCChHHHHHHhhhhcCCC------------------------CeecC--ccccccCChHHHHHHHHHh
Q psy8050         199 PYRCRLCK--KAFSDSSTLTKHLRIHSGEK------------------------PYQCK--LCLLRFSQSGNLNRHMRVH  250 (311)
Q Consensus       199 ~~~C~~C~--~~f~~~~~l~~H~~~h~~~~------------------------~~~C~--~C~~~f~~~~~L~~H~~~H  250 (311)
                      .|.|..|+  ..|.....-..|.+-+.++.                        .+.|.  -|+..+.+.+.+..|.+.|
T Consensus       347 hfhC~r~gCTdtfK~~khk~yh~kdda~~~dGfkkf~k~e~cay~gCkys~~cnhfhc~r~Gc~~tl~s~sqm~shkrkh  426 (480)
T KOG4377|consen  347 HFHCQRIGCTDTFKDSKHKPYHYKDDAGEIDGFKKFFKDENCAYTGCKYSGICNHFHCDRLGCEATLYSVSQMASHKRKH  426 (480)
T ss_pred             eeEEeccCCccccccccccccccCcchhhhhhhhhhhccccCCccCcccccceeeeeecccCCceEEEehhhhhhhhhhh
Confidence            36777766  55553333333333332211                        14453  3788888888888888877


Q ss_pred             CCCC
Q psy8050         251 GSQY  254 (311)
Q Consensus       251 ~~~~  254 (311)
                      ....
T Consensus       427 eRqe  430 (480)
T KOG4377|consen  427 ERQE  430 (480)
T ss_pred             hhhh
Confidence            5443


No 121
>KOG4602|consensus
Probab=51.58  E-value=14  Score=31.43  Aligned_cols=10  Identities=30%  Similarity=0.880  Sum_probs=6.4

Q ss_pred             CeecCcCccc
Q psy8050         143 PYRCEDCNKS  152 (311)
Q Consensus       143 ~~~C~~C~~~  152 (311)
                      .|.|++||.+
T Consensus       268 ~YVCPiCGAT  277 (318)
T KOG4602|consen  268 SYVCPICGAT  277 (318)
T ss_pred             hhcCcccccc
Confidence            3667777665


No 122
>PF05290 Baculo_IE-1:  Baculovirus immediate-early protein (IE-0);  InterPro: IPR007954 This entry contains the Baculovirus immediate-early protein IE-0.
Probab=51.49  E-value=8.1  Score=29.45  Aligned_cols=17  Identities=24%  Similarity=0.647  Sum_probs=9.7

Q ss_pred             CCCceecCCCCccccCc
Q psy8050         168 GEKPFRCPICDRRFSQS  184 (311)
Q Consensus       168 ~~~~~~C~~C~~~f~~~  184 (311)
                      +...|+|.+|..+....
T Consensus        77 d~~lYeCnIC~etS~ee   93 (140)
T PF05290_consen   77 DPKLYECNICKETSAEE   93 (140)
T ss_pred             CCCceeccCcccccchh
Confidence            34557777776654433


No 123
>PF02176 zf-TRAF:  TRAF-type zinc finger; PDB: 2EOD_A 2YUC_A 3HCU_A 3HCS_B 3HCT_A.
Probab=51.33  E-value=11  Score=24.23  Aligned_cols=33  Identities=27%  Similarity=0.764  Sum_probs=14.2

Q ss_pred             CeecCc--CccccCChhHHHHHHHHhcCCCceecCC
Q psy8050         143 PYRCED--CNKSFSQAANLTAHVRTHSGEKPFRCPI  176 (311)
Q Consensus       143 ~~~C~~--C~~~f~~~~~L~~H~~~h~~~~~~~C~~  176 (311)
                      +..|+.  |...+. ...|..|+...-...+..|..
T Consensus         9 ~v~C~~~cc~~~i~-r~~l~~H~~~~C~~~~v~C~~   43 (60)
T PF02176_consen    9 PVPCPNGCCNEMIP-RKELDDHLENECPKRPVPCPY   43 (60)
T ss_dssp             EEE-TT--S-BEEE-CCCHHHHHHTTSTTSEEE-SS
T ss_pred             EeeCCCCCccccee-HHHHHHHHHccCCCCcEECCC
Confidence            345554  333233 345555555444444555555


No 124
>PF05443 ROS_MUCR:  ROS/MUCR transcriptional regulator protein;  InterPro: IPR008807 This family consists of several ROS/MUCR transcriptional regulator proteins. The ros chromosomal gene is present in octopine and nopaline strains of Agrobacterium tumefaciens as well as in Rhizobium meliloti (Sinorhizobium meliloti). This gene encodes a 15.5 kDa protein that specifically represses the virC and virD operons in the virulence region of the Ti plasmid [] and is necessary for succinoglycan production []. S. meliloti can produce two types of acidic exopolysaccharides, succinoglycan and galactoglucan, that are interchangeable for infection of Medicago sativa (Alfalfa) nodules. MucR from S. meliloti acts as a transcriptional repressor that blocks the expression of the exp genes responsible for galactoglucan production therefore allowing the exclusive production of succinoglycan [].; GO: 0003677 DNA binding, 0008270 zinc ion binding, 0006355 regulation of transcription, DNA-dependent; PDB: 2JSP_A.
Probab=49.45  E-value=10  Score=29.13  Aligned_cols=26  Identities=42%  Similarity=0.554  Sum_probs=15.6

Q ss_pred             cceeeccCCCCcCCChhHHHHHHhhhCCC
Q psy8050         113 RLILIQFIPGKTYARPSTLKTHLRTHSGE  141 (311)
Q Consensus       113 ~~~~~C~~C~~~f~~~~~L~~H~~~h~~~  141 (311)
                      ...++|.+||+.|..   |++|++.|+|-
T Consensus        70 ~d~i~clecGk~~k~---LkrHL~~~~gl   95 (132)
T PF05443_consen   70 PDYIICLECGKKFKT---LKRHLRTHHGL   95 (132)
T ss_dssp             SS-EE-TBT--EESB---HHHHHHHTT-S
T ss_pred             cCeeEEccCCcccch---HHHHHHHccCC
Confidence            346789999998865   48888888663


No 125
>PF14446 Prok-RING_1:  Prokaryotic RING finger family 1
Probab=49.19  E-value=9.3  Score=24.29  Aligned_cols=12  Identities=17%  Similarity=0.302  Sum_probs=6.2

Q ss_pred             cccccccccccc
Q psy8050         264 VNQVCSNIVDQR  275 (311)
Q Consensus       264 ~C~~C~~~f~~~  275 (311)
                      .|+.|++.|...
T Consensus         7 ~C~~Cg~~~~~~   18 (54)
T PF14446_consen    7 KCPVCGKKFKDG   18 (54)
T ss_pred             cChhhCCcccCC
Confidence            355555555443


No 126
>PRK04023 DNA polymerase II large subunit; Validated
Probab=48.69  E-value=18  Score=37.14  Aligned_cols=9  Identities=33%  Similarity=1.025  Sum_probs=4.6

Q ss_pred             eecCcCccc
Q psy8050         144 YRCEDCNKS  152 (311)
Q Consensus       144 ~~C~~C~~~  152 (311)
                      ..|+.||..
T Consensus       627 RfCpsCG~~  635 (1121)
T PRK04023        627 RKCPSCGKE  635 (1121)
T ss_pred             ccCCCCCCc
Confidence            445555544


No 127
>PF08274 PhnA_Zn_Ribbon:  PhnA Zinc-Ribbon ;  InterPro: IPR013987 The PhnA protein family includes the uncharacterised Escherichia coli protein PhnA and its homologues. The E. coli phnA gene is part of a large operon associated with alkylphosphonate uptake and carbon-phosphorus bond cleavage []. The protein is not related to the characterised phosphonoacetate hydrolase designated PhnA []. This entry represents the N-terminal domain of PhnA, which is predicted to form a zinc-ribbon.; PDB: 2AKL_A.
Probab=48.63  E-value=8.6  Score=21.20  Aligned_cols=10  Identities=20%  Similarity=0.770  Sum_probs=5.0

Q ss_pred             CCeecCcCcc
Q psy8050         142 KPYRCEDCNK  151 (311)
Q Consensus       142 ~~~~C~~C~~  151 (311)
                      ..|.|+.|+.
T Consensus        18 ~~~vCp~C~~   27 (30)
T PF08274_consen   18 ELLVCPECGH   27 (30)
T ss_dssp             SSEEETTTTE
T ss_pred             CEEeCCcccc
Confidence            3455555543


No 128
>CHL00152 rpl32 ribosomal protein L32; Validated
Probab=48.51  E-value=10  Score=23.97  Aligned_cols=22  Identities=36%  Similarity=0.469  Sum_probs=18.3

Q ss_pred             hhhHHHHHHHHHHHHhhhhcCc
Q psy8050           2 KKKKKKKKKKKKKKKKRKKKGI   23 (311)
Q Consensus         2 ~~~~~~~~~~~~~~~~~~~~~~   23 (311)
                      ++|+.|+-+++.|+.-|++|-.
T Consensus         4 PKKRtSksKk~~Rk~~Wk~Ka~   25 (53)
T CHL00152          4 PKKRTSKSKKRIRKNIWKRKAY   25 (53)
T ss_pred             CCCccchHHHHHHHHHHHHHHH
Confidence            4677788889999999999854


No 129
>PF05191 ADK_lid:  Adenylate kinase, active site lid;  InterPro: IPR007862 Adenylate kinases (ADK; 2.7.4.3 from EC) are phosphotransferases that catalyse the Mg-dependent reversible conversion of ATP and AMP to two molecules of ADP, an essential reaction for many processes in living cells. In large variants of adenylate kinase, the AMP and ATP substrates are buried in a domain that undergoes conformational changes from an open to a closed state when bound to substrate; the ligand is then contained within a highly specific environment required for catalysis. Adenylate kinase is a 3-domain protein consisting of a large central CORE domain flanked by a LID domain on one side and the AMP-binding NMPbind domain on the other []. The LID domain binds ATP and covers the phosphates at the active site. The substrates first bind the CORE domain, followed by closure of the active site by the LID and NMPbind domains. Comparisons of adenylate kinases have revealed a particular divergence in the active site lid. In some organisms, particularly the Gram-positive bacteria, residues in the lid domain have been mutated to cysteines and these cysteine residues (two CX(n)C motifs) are responsible for the binding of a zinc ion. The bound zinc ion in the lid domain is clearly structurally homologous to Zinc-finger domains. However, it is unclear whether the adenylate kinase lid is a novel zinc-finger DNA/RNA binding domain, or that the lid bound zinc serves a purely structural function [].; GO: 0004017 adenylate kinase activity; PDB: 3BE4_A 2OSB_B 2ORI_A 2EU8_A 3DL0_A 1P3J_A 2QAJ_A 2OO7_A 2P3S_A 3DKV_A ....
Probab=47.79  E-value=9.1  Score=22.08  Aligned_cols=11  Identities=18%  Similarity=0.395  Sum_probs=6.1

Q ss_pred             ccccccccccc
Q psy8050         264 VNQVCSNIVDQ  274 (311)
Q Consensus       264 ~C~~C~~~f~~  274 (311)
                      .|+.||..|..
T Consensus         3 ~C~~Cg~~Yh~   13 (36)
T PF05191_consen    3 ICPKCGRIYHI   13 (36)
T ss_dssp             EETTTTEEEET
T ss_pred             CcCCCCCcccc
Confidence            36666666553


No 130
>COG1655 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=47.76  E-value=3.6  Score=34.43  Aligned_cols=46  Identities=20%  Similarity=0.421  Sum_probs=23.7

Q ss_pred             CCeecCccccccCChHHHHHHHHHhCCCC-------CCCCcc-ccccccccccc
Q psy8050         226 KPYQCKLCLLRFSQSGNLNRHMRVHGSQY-------NMLPVI-YMGVNQVCSNI  271 (311)
Q Consensus       226 ~~~~C~~C~~~f~~~~~L~~H~~~H~~~~-------~~~~~~-~~~~C~~C~~~  271 (311)
                      +.+.|++|+-.|....-+..-.|+-.++.       ++.|.+ -.++|++|.+.
T Consensus        18 k~ieCPvC~tkFkkeev~tgsiRiiagDld~~lkygninP~fY~VvvCP~C~yA   71 (267)
T COG1655          18 KTIECPVCNTKFKKEEVKTGSIRIIAGDLDFFLKYGNINPYFYDVVVCPICYYA   71 (267)
T ss_pred             ceeccCcccchhhhhheeccceeEecccccceeeccccCCceeEEEEcchhhHH
Confidence            34666666666654433333333333322       233433 33679999865


No 131
>KOG2807|consensus
Probab=47.32  E-value=21  Score=31.65  Aligned_cols=23  Identities=13%  Similarity=0.346  Sum_probs=18.4

Q ss_pred             ccccccccccccccchhhceeee
Q psy8050         264 VNQVCSNIVDQRRGIYVVEAVWT  286 (311)
Q Consensus       264 ~C~~C~~~f~~~~~l~~H~~~h~  286 (311)
                      .|+.|...|-.-=+.-.|...|.
T Consensus       347 ~C~~Ck~~FCldCDv~iHesLh~  369 (378)
T KOG2807|consen  347 RCESCKNVFCLDCDVFIHESLHN  369 (378)
T ss_pred             EchhccceeeccchHHHHhhhhc
Confidence            68888888887777788877775


No 132
>cd00730 rubredoxin Rubredoxin; nonheme iron binding domains containing a [Fe(SCys)4] center. Rubredoxins are small nonheme iron proteins. The iron atom is coordinated by four cysteine residues (Fe(S-Cys)4), but iron can also be replaced by cobalt, nickel or zinc. They are believed to be involved in electron transfer.
Probab=47.25  E-value=25  Score=21.95  Aligned_cols=9  Identities=33%  Similarity=0.364  Sum_probs=7.4

Q ss_pred             CCCCCCCCC
Q psy8050          37 SECPGSGMG   45 (311)
Q Consensus        37 ~~C~~c~~~   45 (311)
                      |+|..||+-
T Consensus         2 y~C~~Cgyi   10 (50)
T cd00730           2 YECRICGYI   10 (50)
T ss_pred             cCCCCCCeE
Confidence            899999954


No 133
>COG1996 RPC10 DNA-directed RNA polymerase, subunit RPC10 (contains C4-type Zn-finger) [Transcription]
Probab=47.07  E-value=14  Score=22.91  Aligned_cols=10  Identities=30%  Similarity=1.218  Sum_probs=5.1

Q ss_pred             eecCcCcccc
Q psy8050         144 YRCEDCNKSF  153 (311)
Q Consensus       144 ~~C~~C~~~f  153 (311)
                      |.|..||+.|
T Consensus         7 Y~C~~Cg~~~   16 (49)
T COG1996           7 YKCARCGREV   16 (49)
T ss_pred             EEhhhcCCee
Confidence            4555555554


No 134
>PF15269 zf-C2H2_7:  Zinc-finger
Probab=47.00  E-value=24  Score=21.31  Aligned_cols=22  Identities=23%  Similarity=0.102  Sum_probs=14.5

Q ss_pred             eeeccCCCCcCCChhHHHHHHh
Q psy8050         115 ILIQFIPGKTYARPSTLKTHLR  136 (311)
Q Consensus       115 ~~~C~~C~~~f~~~~~L~~H~~  136 (311)
                      .++|..|..+...++.|-.||.
T Consensus        20 ~ykcfqcpftc~~kshl~nhmk   41 (54)
T PF15269_consen   20 KYKCFQCPFTCNEKSHLFNHMK   41 (54)
T ss_pred             cceeecCCcccchHHHHHHHHH
Confidence            4566666666666666666664


No 135
>PF03604 DNA_RNApol_7kD:  DNA directed RNA polymerase, 7 kDa subunit;  InterPro: IPR006591 DNA-dependent RNA polymerase catalyzes the transcription of DNA into RNA using the four ribonucleoside triphosphates as substrates. Each class of RNA polymerase is assembled from 9 to 15 different polypeptides. Rbp10 (RNA polymerase CX) is a domain found in RNA polymerase subunit 10; present in RNA polymerase I, II and III.; GO: 0003677 DNA binding, 0003899 DNA-directed RNA polymerase activity, 0006351 transcription, DNA-dependent; PDB: 2PMZ_Z 3HKZ_X 2NVX_L 3S1Q_L 2JA6_L 3S17_L 3HOW_L 3HOV_L 3PO2_L 3HOZ_L ....
Probab=44.89  E-value=20  Score=20.10  Aligned_cols=7  Identities=43%  Similarity=1.407  Sum_probs=3.1

Q ss_pred             eecCcCc
Q psy8050         144 YRCEDCN  150 (311)
Q Consensus       144 ~~C~~C~  150 (311)
                      ..|..||
T Consensus        18 irC~~CG   24 (32)
T PF03604_consen   18 IRCPECG   24 (32)
T ss_dssp             SSBSSSS
T ss_pred             EECCcCC
Confidence            3444444


No 136
>PRK14873 primosome assembly protein PriA; Provisional
Probab=44.81  E-value=16  Score=36.24  Aligned_cols=8  Identities=25%  Similarity=0.953  Sum_probs=4.6

Q ss_pred             ecCcCccc
Q psy8050         145 RCEDCNKS  152 (311)
Q Consensus       145 ~C~~C~~~  152 (311)
                      .|..||..
T Consensus       385 ~C~~Cg~~  392 (665)
T PRK14873        385 ACARCRTP  392 (665)
T ss_pred             EhhhCcCe
Confidence            46666654


No 137
>COG1997 RPL43A Ribosomal protein L37AE/L43A [Translation, ribosomal structure and biogenesis]
Probab=44.22  E-value=20  Score=25.20  Aligned_cols=14  Identities=21%  Similarity=-0.026  Sum_probs=8.4

Q ss_pred             ceeeccCCCCcCCC
Q psy8050         114 LILIQFIPGKTYAR  127 (311)
Q Consensus       114 ~~~~C~~C~~~f~~  127 (311)
                      ..++|..||..|.-
T Consensus        52 GIW~C~kCg~~fAG   65 (89)
T COG1997          52 GIWKCRKCGAKFAG   65 (89)
T ss_pred             CeEEcCCCCCeecc
Confidence            35666666666643


No 138
>COG4530 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=43.96  E-value=18  Score=26.50  Aligned_cols=32  Identities=19%  Similarity=0.199  Sum_probs=22.1

Q ss_pred             CcceeeccCCCCcCCChhHHHHHHhhhCCCCCeecCcCccccC
Q psy8050         112 IRLILIQFIPGKTYARPSTLKTHLRTHSGEKPYRCEDCNKSFS  154 (311)
Q Consensus       112 ~~~~~~C~~C~~~f~~~~~L~~H~~~h~~~~~~~C~~C~~~f~  154 (311)
                      ....-+|+.||+.|..           -...|..|+.||++|.
T Consensus         6 LGtKridPetg~KFYD-----------LNrdPiVsPytG~s~P   37 (129)
T COG4530           6 LGTKRIDPETGKKFYD-----------LNRDPIVSPYTGKSYP   37 (129)
T ss_pred             ccccccCccccchhhc-----------cCCCccccCcccccch
Confidence            3445577888888743           2446788888888884


No 139
>PF00301 Rubredoxin:  Rubredoxin;  InterPro: IPR004039 Rubredoxin is a low molecular weight iron-containing bacterial protein involved in electron transfer [, ], sometimes replacing ferredoxin as an electron carrier []. The 3-D structures of a number of rubredoxins have been solved [, ]. The fold belongs to the alpha+beta class, with 2 alpha-helices and 2-3 beta-strands. Its active site contains an iron ion which is co-ordinated by the sulphurs of four conserved cysteine residues forming an almost regular tetrahedron. The conserved cysteines reside on two loops, which are the most conserved regions of the protein. In addition, a ring of acidic residues in the proximity of the [Fe(Cys)4] centre is also well-conserved []. ; GO: 0009055 electron carrier activity, 0046872 metal ion binding; PDB: 2RDV_C 1RDV_A 1S24_A 1T9O_B 1B2J_A 1SMW_A 2PVE_B 1BFY_A 1T9P_C 1C09_C ....
Probab=43.19  E-value=23  Score=21.80  Aligned_cols=13  Identities=23%  Similarity=1.081  Sum_probs=7.2

Q ss_pred             eecCCCCccccCc
Q psy8050         172 FRCPICDRRFSQS  184 (311)
Q Consensus       172 ~~C~~C~~~f~~~  184 (311)
                      |+|..|+..+...
T Consensus         2 y~C~~CgyvYd~~   14 (47)
T PF00301_consen    2 YQCPVCGYVYDPE   14 (47)
T ss_dssp             EEETTTSBEEETT
T ss_pred             cCCCCCCEEEcCC
Confidence            4566666555443


No 140
>PRK09678 DNA-binding transcriptional regulator; Provisional
Probab=41.42  E-value=7.3  Score=26.46  Aligned_cols=17  Identities=24%  Similarity=0.518  Sum_probs=8.5

Q ss_pred             CceecC--CCCccccCcch
Q psy8050         170 KPFRCP--ICDRRFSQSSS  186 (311)
Q Consensus       170 ~~~~C~--~C~~~f~~~~~  186 (311)
                      .-++|.  .||.+|.....
T Consensus        26 ~Y~qC~N~eCg~tF~t~es   44 (72)
T PRK09678         26 RYHQCQNVNCSATFITYES   44 (72)
T ss_pred             eeeecCCCCCCCEEEEEEE
Confidence            335554  55555554443


No 141
>PF01783 Ribosomal_L32p:  Ribosomal L32p protein family;  InterPro: IPR002677 Ribosomes are the particles that catalyse mRNA-directed protein synthesis in all organisms. The codons of the mRNA are exposed on the ribosome to allow tRNA binding. This leads to the incorporation of amino acids into the growing polypeptide chain in accordance with the genetic information. Incoming amino acid monomers enter the ribosomal A site in the form of aminoacyl-tRNAs complexed with elongation factor Tu (EF-Tu) and GTP. The growing polypeptide chain, situated in the P site as peptidyl-tRNA, is then transferred to aminoacyl-tRNA and the new peptidyl-tRNA, extended by one residue, is translocated to the P site with the aid the elongation factor G (EF-G) and GTP as the deacylated tRNA is released from the ribosome through one or more exit sites [, ]. About 2/3 of the mass of the ribosome consists of RNA and 1/3 of protein. The proteins are named in accordance with the subunit of the ribosome which they belong to - the small (S1 to S31) and the large (L1 to L44). Usually they decorate the rRNA cores of the subunits.  Many ribosomal proteins, particularly those of the large subunit, are composed of a globular, surfaced-exposed domain with long finger-like projections that extend into the rRNA core to stabilise its structure. Most of the proteins interact with multiple RNA elements, often from different domains. In the large subunit, about 1/3 of the 23S rRNA nucleotides are at least in van der Waal's contact with protein, and L22 interacts with all six domains of the 23S rRNA. Proteins S4 and S7, which initiate assembly of the 16S rRNA, are located at junctions of five and four RNA helices, respectively. In this way proteins serve to organise and stabilise the rRNA tertiary structure. While the crucial activities of decoding and peptide transfer are RNA based, proteins play an active role in functions that may have evolved to streamline the process of protein synthesis. In addition to their function in the ribosome, many ribosomal proteins have some function 'outside' the ribosome [, ]. Ribosomal protein L32p is part of the 50S ribosomal subunit. This family is found in both prokaryotes and eukaryotes. Ribosomal protein L32 of yeast binds to and regulates the splicing and the translation of the transcript of its own gene [].; GO: 0003735 structural constituent of ribosome, 0006412 translation, 0015934 large ribosomal subunit; PDB: 3PYT_2 3F1F_5 3PYV_2 3D5B_5 3MRZ_2 3D5D_5 3F1H_5 1VSP_Y 3PYR_2 3MS1_2 ....
Probab=41.37  E-value=9  Score=24.56  Aligned_cols=31  Identities=29%  Similarity=0.563  Sum_probs=17.6

Q ss_pred             hhhHHHHHHHHHHHHhhhhcCccccCcchhhhccCCCCCCCC
Q psy8050           2 KKKKKKKKKKKKKKKKRKKKGILANSSSKEKIQDRSECPGSG   43 (311)
Q Consensus         2 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~C~~c~   43 (311)
                      +|+|.|+-.+++|+.-|+.+-.           +...|+.||
T Consensus         3 PKrk~Sksr~~~Rrs~~~l~~~-----------~l~~c~~cg   33 (56)
T PF01783_consen    3 PKRKTSKSRKRMRRSHWKLKAP-----------NLVKCPNCG   33 (56)
T ss_dssp             -SS-SCHHHHHHHTTTTS--TT-----------SEEESSSSS
T ss_pred             CCCcCChhHccchhcccccccc-----------ceeeeccCC
Confidence            3566677777777766642221           347788888


No 142
>KOG2593|consensus
Probab=39.87  E-value=36  Score=31.45  Aligned_cols=39  Identities=21%  Similarity=0.534  Sum_probs=24.5

Q ss_pred             CCCCCeecCcCccccCChhHHHHHHHHhcCCCceecCCCCcc
Q psy8050         139 SGEKPYRCEDCNKSFSQAANLTAHVRTHSGEKPFRCPICDRR  180 (311)
Q Consensus       139 ~~~~~~~C~~C~~~f~~~~~L~~H~~~h~~~~~~~C~~C~~~  180 (311)
                      +....|.|+.|.+.|.....+   +..-...-.|.|..|+-.
T Consensus       124 t~~~~Y~Cp~C~kkyt~Lea~---~L~~~~~~~F~C~~C~ge  162 (436)
T KOG2593|consen  124 TNVAGYVCPNCQKKYTSLEAL---QLLDNETGEFHCENCGGE  162 (436)
T ss_pred             cccccccCCccccchhhhHHH---HhhcccCceEEEecCCCc
Confidence            444568888888888654443   333323345888888754


No 143
>PRK04023 DNA polymerase II large subunit; Validated
Probab=37.81  E-value=30  Score=35.63  Aligned_cols=9  Identities=33%  Similarity=0.785  Sum_probs=5.5

Q ss_pred             eecCcCccc
Q psy8050         144 YRCEDCNKS  152 (311)
Q Consensus       144 ~~C~~C~~~  152 (311)
                      +.|+.||..
T Consensus       639 frCP~CG~~  647 (1121)
T PRK04023        639 RRCPFCGTH  647 (1121)
T ss_pred             ccCCCCCCC
Confidence            566666654


No 144
>TIGR00595 priA primosomal protein N'. All proteins in this family for which functions are known are components of the primosome which is involved in replication, repair, and recombination.This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University).
Probab=37.73  E-value=25  Score=33.71  Aligned_cols=10  Identities=30%  Similarity=0.747  Sum_probs=5.9

Q ss_pred             cccccccccc
Q psy8050         198 RPYRCRLCKK  207 (311)
Q Consensus       198 k~~~C~~C~~  207 (311)
                      .+..|+.|+.
T Consensus       252 ~~~~Cp~C~s  261 (505)
T TIGR00595       252 IPKTCPQCGS  261 (505)
T ss_pred             CCCCCCCCCC
Confidence            3456666664


No 145
>PF05443 ROS_MUCR:  ROS/MUCR transcriptional regulator protein;  InterPro: IPR008807 This family consists of several ROS/MUCR transcriptional regulator proteins. The ros chromosomal gene is present in octopine and nopaline strains of Agrobacterium tumefaciens as well as in Rhizobium meliloti (Sinorhizobium meliloti). This gene encodes a 15.5 kDa protein that specifically represses the virC and virD operons in the virulence region of the Ti plasmid [] and is necessary for succinoglycan production []. S. meliloti can produce two types of acidic exopolysaccharides, succinoglycan and galactoglucan, that are interchangeable for infection of Medicago sativa (Alfalfa) nodules. MucR from S. meliloti acts as a transcriptional repressor that blocks the expression of the exp genes responsible for galactoglucan production therefore allowing the exclusive production of succinoglycan [].; GO: 0003677 DNA binding, 0008270 zinc ion binding, 0006355 regulation of transcription, DNA-dependent; PDB: 2JSP_A.
Probab=37.57  E-value=23  Score=27.27  Aligned_cols=25  Identities=32%  Similarity=0.537  Sum_probs=14.6

Q ss_pred             CeecCccccccCChHHHHHHHHHhCCCC
Q psy8050         227 PYQCKLCLLRFSQSGNLNRHMRVHGSQY  254 (311)
Q Consensus       227 ~~~C~~C~~~f~~~~~L~~H~~~H~~~~  254 (311)
                      -..|-+||+.|..   |.+|++.|+|-.
T Consensus        72 ~i~clecGk~~k~---LkrHL~~~~glt   96 (132)
T PF05443_consen   72 YIICLECGKKFKT---LKRHLRTHHGLT   96 (132)
T ss_dssp             -EE-TBT--EESB---HHHHHHHTT-S-
T ss_pred             eeEEccCCcccch---HHHHHHHccCCC
Confidence            3678888888875   488888886643


No 146
>COG5151 SSL1 RNA polymerase II transcription initiation/nucleotide excision repair factor TFIIH, subunit SSL1 [Transcription / DNA replication, recombination, and repair]
Probab=37.45  E-value=46  Score=29.37  Aligned_cols=79  Identities=20%  Similarity=0.459  Sum_probs=46.3

Q ss_pred             CCeecCcCccccCChhHHHHHHHHhcC----------C--CceecCCCCccccCcchhhhhhhhcccccccccccccccc
Q psy8050         142 KPYRCEDCNKSFSQAANLTAHVRTHSG----------E--KPFRCPICDRRFSQSSSVTTHMRTHSGERPYRCRLCKKAF  209 (311)
Q Consensus       142 ~~~~C~~C~~~f~~~~~L~~H~~~h~~----------~--~~~~C~~C~~~f~~~~~l~~H~~~h~~~k~~~C~~C~~~f  209 (311)
                      .|..|+.|.-.......|.+-...--.          +  +.--|-.|.-.|.....-..-  .-+....|.|+.|...|
T Consensus       321 LPi~CP~Csl~LilsthLarSyhhL~PLk~f~E~p~~~~~ks~~Cf~CQ~~fp~~~~~~~~--~~~ss~rY~Ce~CK~~F  398 (421)
T COG5151         321 LPISCPICSLQLILSTHLARSYHHLYPLKPFVEKPEGTNPKSTHCFVCQGPFPKPPVSPFD--ESTSSGRYQCELCKSTF  398 (421)
T ss_pred             CCccCcchhHHHHHHHHHHHHHHhhccCcccccccCCCCCCCccceeccCCCCCCCCCccc--ccccccceechhhhhhh
Confidence            467788888776555555443221111          1  112366676666544221111  11234569999999999


Q ss_pred             CChHHHHHHhhhh
Q psy8050         210 SDSSTLTKHLRIH  222 (311)
Q Consensus       210 ~~~~~l~~H~~~h  222 (311)
                      -..-..-.|...|
T Consensus       399 C~dCdvfiHe~Lh  411 (421)
T COG5151         399 CSDCDVFIHETLH  411 (421)
T ss_pred             hhhhHHHHHHHHh
Confidence            9988888887766


No 147
>KOG2071|consensus
Probab=36.80  E-value=31  Score=33.26  Aligned_cols=27  Identities=22%  Similarity=0.689  Sum_probs=21.4

Q ss_pred             CCCeecCcCccccCChhHHHHHHHHhc
Q psy8050         141 EKPYRCEDCNKSFSQAANLTAHVRTHS  167 (311)
Q Consensus       141 ~~~~~C~~C~~~f~~~~~L~~H~~~h~  167 (311)
                      ..+-.|..||..|........||..|.
T Consensus       416 ~~pnqC~~CG~R~~~~ee~sk~md~H~  442 (579)
T KOG2071|consen  416 DSPNQCKSCGLRFDDSEERSKHMDIHD  442 (579)
T ss_pred             CCcchhcccccccccchhhhhHhhhhh
Confidence            456789999999998888877777663


No 148
>PF13878 zf-C2H2_3:  zinc-finger of acetyl-transferase ESCO
Probab=36.61  E-value=38  Score=20.08  Aligned_cols=24  Identities=21%  Similarity=0.456  Sum_probs=14.9

Q ss_pred             eecCccccccC--ChHHHHHHHHHhC
Q psy8050         228 YQCKLCLLRFS--QSGNLNRHMRVHG  251 (311)
Q Consensus       228 ~~C~~C~~~f~--~~~~L~~H~~~H~  251 (311)
                      -.|+.||..|.  ...+-..|.+.|.
T Consensus        14 ~~C~~CgM~Y~~~~~eD~~~H~~yH~   39 (41)
T PF13878_consen   14 TTCPTCGMLYSPGSPEDEKLHKKYHD   39 (41)
T ss_pred             cCCCCCCCEECCCCHHHHHHHHHHHh
Confidence            46777776665  3555666766653


No 149
>KOG4167|consensus
Probab=33.88  E-value=11  Score=36.94  Aligned_cols=27  Identities=30%  Similarity=0.664  Sum_probs=23.9

Q ss_pred             CCCeecCcCccccCChhHHHHHHHHhc
Q psy8050         141 EKPYRCEDCNKSFSQAANLTAHVRTHS  167 (311)
Q Consensus       141 ~~~~~C~~C~~~f~~~~~L~~H~~~h~  167 (311)
                      ...|.|..|++.|....++..||..|.
T Consensus       790 ~giFpCreC~kvF~KiKSrNAHMK~Hr  816 (907)
T KOG4167|consen  790 TGIFPCRECGKVFFKIKSRNAHMKTHR  816 (907)
T ss_pred             CceeehHHHHHHHHHHhhhhHHHHHHH
Confidence            356999999999999999999998884


No 150
>PRK03824 hypA hydrogenase nickel incorporation protein; Provisional
Probab=32.45  E-value=29  Score=26.80  Aligned_cols=11  Identities=27%  Similarity=1.075  Sum_probs=5.1

Q ss_pred             eecCcCccccC
Q psy8050         144 YRCEDCNKSFS  154 (311)
Q Consensus       144 ~~C~~C~~~f~  154 (311)
                      +.|..||..|.
T Consensus        71 ~~C~~CG~~~~   81 (135)
T PRK03824         71 LKCRNCGNEWS   81 (135)
T ss_pred             EECCCCCCEEe
Confidence            44444544443


No 151
>COG3677 Transposase and inactivated derivatives [DNA replication, recombination, and repair]
Probab=31.12  E-value=64  Score=24.67  Aligned_cols=17  Identities=24%  Similarity=0.464  Sum_probs=12.1

Q ss_pred             ccCCCCCCCCccccccC
Q psy8050         291 WIDGRSTMCGHSFIVKL  307 (311)
Q Consensus       291 ~~~~kC~~C~~~f~~~~  307 (311)
                      ++.|+|..|++.|....
T Consensus        51 ~qRyrC~~C~~tf~~~~   67 (129)
T COG3677          51 HQRYKCKSCGSTFTVET   67 (129)
T ss_pred             ccccccCCcCcceeeec
Confidence            44557888888887643


No 152
>COG4957 Predicted transcriptional regulator [Transcription]
Probab=30.52  E-value=56  Score=25.05  Aligned_cols=25  Identities=40%  Similarity=0.581  Sum_probs=16.8

Q ss_pred             cceeeccCCCCcCCChhHHHHHHhhhCC
Q psy8050         113 RLILIQFIPGKTYARPSTLKTHLRTHSG  140 (311)
Q Consensus       113 ~~~~~C~~C~~~f~~~~~L~~H~~~h~~  140 (311)
                      ...++|.++|+.|.   +|++|+.+|.+
T Consensus        74 pD~IicLEDGkkfK---SLKRHL~t~~g   98 (148)
T COG4957          74 PDYIICLEDGKKFK---SLKRHLTTHYG   98 (148)
T ss_pred             CCeEEEeccCcchH---HHHHHHhcccC
Confidence            34567777777774   56777777665


No 153
>KOG4377|consensus
Probab=30.16  E-value=37  Score=31.11  Aligned_cols=100  Identities=20%  Similarity=0.376  Sum_probs=62.2

Q ss_pred             cCCCCcCCChhHHHHHHhhhCCC------------CCeec--CcCccccCChhHHHHHHHHhcCC-------CceecCCC
Q psy8050         119 FIPGKTYARPSTLKTHLRTHSGE------------KPYRC--EDCNKSFSQAANLTAHVRTHSGE-------KPFRCPIC  177 (311)
Q Consensus       119 ~~C~~~f~~~~~L~~H~~~h~~~------------~~~~C--~~C~~~f~~~~~L~~H~~~h~~~-------~~~~C~~C  177 (311)
                      ..|+..+.++..+..|..+|-..            ..|.|  ..|.+   +-++...|-..|+..       ..|.|.-|
T Consensus       277 e~C~ykr~~k~DvirH~~~hkkrdnsL~dgf~rfs~syhC~~~~C~k---sTsdV~~h~nFht~~~n~GfrrthfhC~r~  353 (480)
T KOG4377|consen  277 EYCFYKRGQKNDVIRHVEIHKKRDNSLIDGFHRFSNSYHCTGQICEK---STSDVLLHDNFHTDKRNNGFRRTHFHCQRI  353 (480)
T ss_pred             ccccccccchhhhHHHHHHHhhcccccccchhhcCccchhhhcccCc---ccccccccCccccccccCceecceeEEecc
Confidence            45888888888999998877432            22567  46877   455666666655432       23778777


Q ss_pred             CccccCcchhhhhhhhcc----cc------------------------ccccccc--cccccCChHHHHHHhhhhc
Q psy8050         178 DRRFSQSSSVTTHMRTHS----GE------------------------RPYRCRL--CKKAFSDSSTLTKHLRIHS  223 (311)
Q Consensus       178 ~~~f~~~~~l~~H~~~h~----~~------------------------k~~~C~~--C~~~f~~~~~l~~H~~~h~  223 (311)
                      +-++..+  ...|...|.    ++                        ..|-|..  |+..+.+.+.+..|.+.|.
T Consensus       354 gCTdtfK--~~khk~yh~kdda~~~dGfkkf~k~e~cay~gCkys~~cnhfhc~r~Gc~~tl~s~sqm~shkrkhe  427 (480)
T KOG4377|consen  354 GCTDTFK--DSKHKPYHYKDDAGEIDGFKKFFKDENCAYTGCKYSGICNHFHCDRLGCEATLYSVSQMASHKRKHE  427 (480)
T ss_pred             CCccccc--cccccccccCcchhhhhhhhhhhccccCCccCcccccceeeeeecccCCceEEEehhhhhhhhhhhh
Confidence            7333333  334444432    11                        1133433  8888999999998888774


No 154
>KOG0782|consensus
Probab=29.63  E-value=13  Score=35.44  Aligned_cols=53  Identities=17%  Similarity=0.253  Sum_probs=29.1

Q ss_pred             hHHHHHHhhhhcCCCCeecCccccccCChHHHHHHHHHhCCCCCCCCcccccccccccccccccc
Q psy8050         212 SSTLTKHLRIHSGEKPYQCKLCLLRFSQSGNLNRHMRVHGSQYNMLPVIYMGVNQVCSNIVDQRR  276 (311)
Q Consensus       212 ~~~l~~H~~~h~~~~~~~C~~C~~~f~~~~~L~~H~~~H~~~~~~~~~~~~~~C~~C~~~f~~~~  276 (311)
                      .+.|.+|-..|.....=+|..||+.|..+-.+      |..+.      ..+.|.+|...|..+.
T Consensus       238 E~~fvrHHWVHrrRqeGkC~~CgKgFQQKf~F------hsKEi------vAisCSWCKqayH~Kv  290 (1004)
T KOG0782|consen  238 ESGFVRHHWVHRRRQEGKCNTCGKGFQQKFFF------HSKEI------VAISCSWCKQAYHLKV  290 (1004)
T ss_pred             cccchHHhHhhHhhhccccchhhhhhhhheee------ccccE------EEEEehHHHHHhhcch
Confidence            34566666555443334677777777654332      33321      2235777777776665


No 155
>PHA00626 hypothetical protein
Probab=29.31  E-value=28  Score=22.21  Aligned_cols=15  Identities=27%  Similarity=0.399  Sum_probs=12.1

Q ss_pred             CCeecCccccccCCh
Q psy8050         226 KPYQCKLCLLRFSQS  240 (311)
Q Consensus       226 ~~~~C~~C~~~f~~~  240 (311)
                      ..|.|+.||..|+..
T Consensus        22 nrYkCkdCGY~ft~~   36 (59)
T PHA00626         22 DDYVCCDCGYNDSKD   36 (59)
T ss_pred             cceEcCCCCCeechh
Confidence            459999999999743


No 156
>PF14353 CpXC:  CpXC protein
Probab=29.17  E-value=16  Score=27.75  Aligned_cols=17  Identities=24%  Similarity=0.696  Sum_probs=10.8

Q ss_pred             CeecCcCccccCChhHH
Q psy8050         143 PYRCEDCNKSFSQAANL  159 (311)
Q Consensus       143 ~~~C~~C~~~f~~~~~L  159 (311)
                      .|.|+.||..|.-...+
T Consensus        38 ~~~CP~Cg~~~~~~~p~   54 (128)
T PF14353_consen   38 SFTCPSCGHKFRLEYPL   54 (128)
T ss_pred             EEECCCCCCceecCCCE
Confidence            47777777777554443


No 157
>KOG0847|consensus
Probab=29.16  E-value=22  Score=29.54  Aligned_cols=20  Identities=30%  Similarity=0.391  Sum_probs=15.1

Q ss_pred             HHHHhhhhcCccccCcchhh
Q psy8050          13 KKKKKRKKKGILANSSSKEK   32 (311)
Q Consensus        13 ~~~~~~~~~~~~~~~~~~~~   32 (311)
                      -||.||||||-.+..++..+
T Consensus       217 NRRTKWRKkhAaEmasakkk  236 (288)
T KOG0847|consen  217 NRRTKWRKKHAAEMASAKKK  236 (288)
T ss_pred             cchhhhhhhhccchhhcccc
Confidence            48899999998766665444


No 158
>PF07754 DUF1610:  Domain of unknown function (DUF1610);  InterPro: IPR011668 This domain is found in archaeal species. It is likely to bind zinc via its four well-conserved cysteine residues.
Probab=29.01  E-value=26  Score=18.24  Aligned_cols=10  Identities=30%  Similarity=0.630  Sum_probs=8.6

Q ss_pred             ccCCCCCCCC
Q psy8050          34 QDRSECPGSG   43 (311)
Q Consensus        34 ~~~~~C~~c~   43 (311)
                      ..+|.||.||
T Consensus        14 ~v~f~CPnCG   23 (24)
T PF07754_consen   14 AVPFPCPNCG   23 (24)
T ss_pred             CceEeCCCCC
Confidence            5679999998


No 159
>PF04959 ARS2:  Arsenite-resistance protein 2;  InterPro: IPR007042 This entry represents Arsenite-resistance protein 2 (also known as Serrate RNA effector molecule homolog) which is thought to play a role in arsenite resistance [], although does not directly confer arsenite resistance but rather modulates arsenic sensitivity []. Arsenite is a carcinogenic compound which can act as a comutagen by inhibiting DNA repair. It is also involved in cell cycle progression at S phase. ; PDB: 3AX1_A.
Probab=28.97  E-value=29  Score=29.07  Aligned_cols=25  Identities=32%  Similarity=0.651  Sum_probs=12.2

Q ss_pred             cccccccccccCChHHHHHHhhhhc
Q psy8050         199 PYRCRLCKKAFSDSSTLTKHLRIHS  223 (311)
Q Consensus       199 ~~~C~~C~~~f~~~~~l~~H~~~h~  223 (311)
                      .|.|.+|++.|....-...|+..-|
T Consensus        77 K~~C~lc~KlFkg~eFV~KHI~nKH  101 (214)
T PF04959_consen   77 KWRCPLCGKLFKGPEFVRKHIFNKH  101 (214)
T ss_dssp             EEEE-SSS-EESSHHHHHHHHHHH-
T ss_pred             EECCCCCCcccCChHHHHHHHhhcC
Confidence            3555555555555555555555433


No 160
>PF09845 DUF2072:  Zn-ribbon containing protein (DUF2072);  InterPro: IPR018645  This archaeal Zinc-ribbon containing proteins have no known function. 
Probab=28.86  E-value=29  Score=26.48  Aligned_cols=14  Identities=21%  Similarity=0.821  Sum_probs=7.7

Q ss_pred             eecCcCccccCChh
Q psy8050         144 YRCEDCNKSFSQAA  157 (311)
Q Consensus       144 ~~C~~C~~~f~~~~  157 (311)
                      ++|..||+.|..-+
T Consensus         2 H~Ct~Cg~~f~dgs   15 (131)
T PF09845_consen    2 HQCTKCGRVFEDGS   15 (131)
T ss_pred             cccCcCCCCcCCCc
Confidence            45666666665443


No 161
>PF14122 YokU:  YokU-like protein
Probab=28.83  E-value=52  Score=23.07  Aligned_cols=21  Identities=14%  Similarity=0.039  Sum_probs=11.4

Q ss_pred             CCcceeeccCCCCcCCChhHH
Q psy8050         111 GIRLILIQFIPGKTYARPSTL  131 (311)
Q Consensus       111 ~~~~~~~C~~C~~~f~~~~~L  131 (311)
                      ...+...|..||.+|-....+
T Consensus        31 ~~tP~i~C~~CgmvYq~d~vi   51 (87)
T PF14122_consen   31 TDTPAIICSNCGMVYQDDEVI   51 (87)
T ss_pred             cCCceeeecCCCcEEehhHHH
Confidence            334455666666666554443


No 162
>PF05495 zf-CHY:  CHY zinc finger;  InterPro: IPR008913 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target.  Pirh2 is an eukaryotic ubiquitin protein ligase, which has been shown to promote p53 degradation in mammals. Pirh2 physically interacts with p53 and promotes ubiquitination of p53 independently of MDM2. Like MDM2, Pirh2 is thought to participate in an autoregulatory feedback loop that controls p53 function. Pirh2 proteins contain three distinct zinc fingers, the CHY-type, the CTCHY-type which is C-terminal to the CHY-type zinc finger and a RING finger. The CHY-type zinc finger has no currently known function []. As well as Pirh2, the CHY-type zinc finger is also found in the following proteins:   Yeast helper of Tim protein 13. Hot13 may have a role in the assembly and recycling of the small Tims, a complex of the mitochondrial intermembrane space that participates in the TIM22 import pathway for assembly of the inner membrane [] Several plant hypothetical proteins that also contain haemerythrin cation binding domains Several protozoan hypothetical proteins that also contain a Myb domain   The solution structure of this zinc finger has been solved and binds 3 zinc atoms as shown in the following schematic representation:   ++---------+-----+ || | | CXHYxxxxxxxxxCCxxxxxCxxCHxxxxxHxxxxxxxxxxxCxxCxxxxxxxxxCxxC | | | | | | | | +-+-----------------+--+ +--+---------+--+ 'C': conserved cysteine involved in the binding of one zinc atom. 'H': conserved histidine involved in the binding of one zinc atom.  More information about these proteins can be found at Protein of the Month: Zinc Fingers []; GO: 0008270 zinc ion binding; PDB: 2DKT_A 2K2C_A.
Probab=28.61  E-value=20  Score=24.16  Aligned_cols=9  Identities=33%  Similarity=0.685  Sum_probs=3.6

Q ss_pred             ecCcCcccc
Q psy8050         145 RCEDCNKSF  153 (311)
Q Consensus       145 ~C~~C~~~f  153 (311)
                      +++-|++.|
T Consensus        12 ~~~cC~~~y   20 (71)
T PF05495_consen   12 RFPCCGKYY   20 (71)
T ss_dssp             EETTTTEEE
T ss_pred             ECCcccCee
Confidence            334444433


No 163
>PF15135 UPF0515:  Uncharacterised protein UPF0515
Probab=28.18  E-value=53  Score=28.04  Aligned_cols=12  Identities=17%  Similarity=0.991  Sum_probs=7.8

Q ss_pred             CCeecCcCcccc
Q psy8050         142 KPYRCEDCNKSF  153 (311)
Q Consensus       142 ~~~~C~~C~~~f  153 (311)
                      ..|.|..|+..+
T Consensus       111 rqFaC~~Cd~~W  122 (278)
T PF15135_consen  111 RQFACSSCDHMW  122 (278)
T ss_pred             eeeeccccchHH
Confidence            557777776543


No 164
>COG3357 Predicted transcriptional regulator containing an HTH domain fused to a Zn-ribbon [Transcription]
Probab=27.76  E-value=37  Score=24.05  Aligned_cols=13  Identities=38%  Similarity=0.904  Sum_probs=6.1

Q ss_pred             ceecCCCCccccC
Q psy8050         171 PFRCPICDRRFSQ  183 (311)
Q Consensus       171 ~~~C~~C~~~f~~  183 (311)
                      |-.|..||+.|..
T Consensus        58 Pa~CkkCGfef~~   70 (97)
T COG3357          58 PARCKKCGFEFRD   70 (97)
T ss_pred             ChhhcccCccccc
Confidence            3445555555443


No 165
>KOG2593|consensus
Probab=27.56  E-value=39  Score=31.27  Aligned_cols=36  Identities=17%  Similarity=0.279  Sum_probs=19.6

Q ss_pred             ceeeccCCCCcCCChhHHHHHHhhhCCCCCeecCcCccc
Q psy8050         114 LILIQFIPGKTYARPSTLKTHLRTHSGEKPYRCEDCNKS  152 (311)
Q Consensus       114 ~~~~C~~C~~~f~~~~~L~~H~~~h~~~~~~~C~~C~~~  152 (311)
                      ..|.|+.|++.|+....++.   .-.....|.|..|+-.
T Consensus       127 ~~Y~Cp~C~kkyt~Lea~~L---~~~~~~~F~C~~C~ge  162 (436)
T KOG2593|consen  127 AGYVCPNCQKKYTSLEALQL---LDNETGEFHCENCGGE  162 (436)
T ss_pred             ccccCCccccchhhhHHHHh---hcccCceEEEecCCCc
Confidence            44566666666666554442   2222345777777643


No 166
>KOG1280|consensus
Probab=27.27  E-value=52  Score=29.52  Aligned_cols=36  Identities=19%  Similarity=0.422  Sum_probs=19.9

Q ss_pred             cccccccccccCChHHHHHHhhhhcCCCC--eecCccc
Q psy8050         199 PYRCRLCKKAFSDSSTLTKHLRIHSGEKP--YQCKLCL  234 (311)
Q Consensus       199 ~~~C~~C~~~f~~~~~l~~H~~~h~~~~~--~~C~~C~  234 (311)
                      .|.|++|+..-.+...|..|....+.+-+  ..|++|+
T Consensus        79 SftCPyC~~~Gfte~~f~~Hv~s~Hpda~~~~icp~c~  116 (381)
T KOG1280|consen   79 SFTCPYCGIMGFTERQFGTHVLSQHPEASTSVICPLCA  116 (381)
T ss_pred             cccCCcccccccchhHHHHHhhhcCcccCcceeeeccc
Confidence            46666666666666666666555444433  2355554


No 167
>PTZ00255 60S ribosomal protein L37a; Provisional
Probab=27.19  E-value=29  Score=24.64  Aligned_cols=10  Identities=50%  Similarity=1.238  Sum_probs=4.6

Q ss_pred             cccccccccc
Q psy8050         200 YRCRLCKKAF  209 (311)
Q Consensus       200 ~~C~~C~~~f  209 (311)
                      |.|..|++.|
T Consensus        55 W~C~~C~~~~   64 (90)
T PTZ00255         55 WRCKGCKKTV   64 (90)
T ss_pred             EEcCCCCCEE
Confidence            4444444444


No 168
>PF04959 ARS2:  Arsenite-resistance protein 2;  InterPro: IPR007042 This entry represents Arsenite-resistance protein 2 (also known as Serrate RNA effector molecule homolog) which is thought to play a role in arsenite resistance [], although does not directly confer arsenite resistance but rather modulates arsenic sensitivity []. Arsenite is a carcinogenic compound which can act as a comutagen by inhibiting DNA repair. It is also involved in cell cycle progression at S phase. ; PDB: 3AX1_A.
Probab=27.15  E-value=23  Score=29.65  Aligned_cols=30  Identities=30%  Similarity=0.710  Sum_probs=21.1

Q ss_pred             CCceecCCCCccccCcchhhhhhhhccccc
Q psy8050         169 EKPFRCPICDRRFSQSSSVTTHMRTHSGER  198 (311)
Q Consensus       169 ~~~~~C~~C~~~f~~~~~l~~H~~~h~~~k  198 (311)
                      +..|.|..|++.|.-.....+|+..-|.++
T Consensus        75 ~~K~~C~lc~KlFkg~eFV~KHI~nKH~e~  104 (214)
T PF04959_consen   75 EDKWRCPLCGKLFKGPEFVRKHIFNKHPEK  104 (214)
T ss_dssp             SEEEEE-SSS-EESSHHHHHHHHHHH-HHH
T ss_pred             CCEECCCCCCcccCChHHHHHHHhhcCHHH
Confidence            345888888888888888888888777664


No 169
>KOG2906|consensus
Probab=27.09  E-value=12  Score=26.83  Aligned_cols=13  Identities=0%  Similarity=-0.123  Sum_probs=7.4

Q ss_pred             CCCCCCCCCCCCC
Q psy8050          78 APYQVGAPQQFPT   90 (311)
Q Consensus        78 ~~~~C~~c~~~~~   90 (311)
                      ..|.|..|++.+.
T Consensus        20 ~rf~C~tCpY~~~   32 (105)
T KOG2906|consen   20 NRFSCRTCPYVFP   32 (105)
T ss_pred             eeEEcCCCCceee
Confidence            3456666665543


No 170
>PF01363 FYVE:  FYVE zinc finger;  InterPro: IPR000306 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target.  The FYVE zinc finger is named after four proteins that it has been found in: Fab1, YOTB/ZK632.12, Vac1, and EEA1. The FYVE finger has been shown to bind two zinc ions []. The FYVE finger has eight potential zinc coordinating cysteine positions. Many members of this family also include two histidines in a motif R+HHC+XCG, where + represents a charged residue and X any residue. FYVE-type domains are divided into two known classes: FYVE domains that specifically bind to phosphatidylinositol 3-phosphate in lipid bilayers and FYVE-related domains of undetermined function []. Those that bind to phosphatidylinositol 3-phosphate are often found in proteins targeted to lipid membranes that are involved in regulating membrane traffic [, , ]. Most FYVE domains target proteins to endosomes by binding specifically to phosphatidylinositol-3-phosphate at the membrane surface. By contrast, the CARP2 FYVE-like domain is not optimized to bind to phosphoinositides or insert into lipid bilayers. FYVE domains are distinguished from other zinc fingers by three signature sequences: an N-terminal WxxD motif, a basic R(R/K)HHCR patch, and a C-terminal RVC motif. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; GO: 0046872 metal ion binding; PDB: 1HYI_A 1JOC_B 1HYJ_A 1DVP_A 3ZYQ_A 4AVX_A 1VFY_A 3T7L_A 1X4U_A 1WFK_A ....
Probab=26.85  E-value=35  Score=22.52  Aligned_cols=9  Identities=11%  Similarity=0.327  Sum_probs=3.3

Q ss_pred             ccccccccc
Q psy8050         264 VNQVCSNIV  272 (311)
Q Consensus       264 ~C~~C~~~f  272 (311)
                      .|.+|+..|
T Consensus        11 ~C~~C~~~F   19 (69)
T PF01363_consen   11 NCMICGKKF   19 (69)
T ss_dssp             B-TTT--B-
T ss_pred             cCcCcCCcC
Confidence            466777776


No 171
>COG1779 C4-type Zn-finger protein [General function prediction only]
Probab=26.69  E-value=20  Score=29.45  Aligned_cols=11  Identities=18%  Similarity=0.175  Sum_probs=5.9

Q ss_pred             CCCCCCccccc
Q psy8050         295 RSTMCGHSFIV  305 (311)
Q Consensus       295 kC~~C~~~f~~  305 (311)
                      .|..||+++..
T Consensus        45 ~C~~CgYR~~D   55 (201)
T COG1779          45 VCERCGYRSTD   55 (201)
T ss_pred             EccccCCcccc
Confidence            45555555543


No 172
>PF01780 Ribosomal_L37ae:  Ribosomal L37ae protein family;  InterPro: IPR002674 Ribosomes are the particles that catalyse mRNA-directed protein synthesis in all organisms. The codons of the mRNA are exposed on the ribosome to allow tRNA binding. This leads to the incorporation of amino acids into the growing polypeptide chain in accordance with the genetic information. Incoming amino acid monomers enter the ribosomal A site in the form of aminoacyl-tRNAs complexed with elongation factor Tu (EF-Tu) and GTP. The growing polypeptide chain, situated in the P site as peptidyl-tRNA, is then transferred to aminoacyl-tRNA and the new peptidyl-tRNA, extended by one residue, is translocated to the P site with the aid the elongation factor G (EF-G) and GTP as the deacylated tRNA is released from the ribosome through one or more exit sites [, ]. About 2/3 of the mass of the ribosome consists of RNA and 1/3 of protein. The proteins are named in accordance with the subunit of the ribosome which they belong to - the small (S1 to S31) and the large (L1 to L44). Usually they decorate the rRNA cores of the subunits.  Many ribosomal proteins, particularly those of the large subunit, are composed of a globular, surfaced-exposed domain with long finger-like projections that extend into the rRNA core to stabilise its structure. Most of the proteins interact with multiple RNA elements, often from different domains. In the large subunit, about 1/3 of the 23S rRNA nucleotides are at least in van der Waal's contact with protein, and L22 interacts with all six domains of the 23S rRNA. Proteins S4 and S7, which initiate assembly of the 16S rRNA, are located at junctions of five and four RNA helices, respectively. In this way proteins serve to organise and stabilise the rRNA tertiary structure. While the crucial activities of decoding and peptide transfer are RNA based, proteins play an active role in functions that may have evolved to streamline the process of protein synthesis. In addition to their function in the ribosome, many ribosomal proteins have some function 'outside' the ribosome [, ]. This ribosomal protein is found in archaebacteria and eukaryotes []. Ribosomal protein L37 has a single zinc finger-like motif of the C2-C2 type [].; GO: 0003735 structural constituent of ribosome, 0006412 translation, 0005622 intracellular, 0005840 ribosome; PDB: 4A1E_Y 4A17_Y 4A1C_Y 4A1A_Y 3O58_g 3IZS_m 3O5H_g 1S1I_9 3IZR_m 1YSH_D ....
Probab=26.51  E-value=25  Score=25.01  Aligned_cols=10  Identities=40%  Similarity=1.301  Sum_probs=5.3

Q ss_pred             cccccccccc
Q psy8050         200 YRCRLCKKAF  209 (311)
Q Consensus       200 ~~C~~C~~~f  209 (311)
                      |.|..|++.|
T Consensus        54 W~C~~C~~~~   63 (90)
T PF01780_consen   54 WKCKKCGKKF   63 (90)
T ss_dssp             EEETTTTEEE
T ss_pred             eecCCCCCEE
Confidence            5555555544


No 173
>TIGR01206 lysW lysine biosynthesis protein LysW. This very small, poorly characterized protein has been shown essential in Thermus thermophilus for an unusual pathway of Lys biosynthesis from aspartate by way of alpha-aminoadipate (AAA) rather than diaminopimelate. It is found also in Deinococcus radiodurans and Pyrococcus horikoshii, which appear to share the AAA pathway.
Probab=26.42  E-value=41  Score=21.40  Aligned_cols=9  Identities=11%  Similarity=0.195  Sum_probs=4.1

Q ss_pred             ccccccccc
Q psy8050         264 VNQVCSNIV  272 (311)
Q Consensus       264 ~C~~C~~~f  272 (311)
                      .|+.|+..+
T Consensus         4 ~CP~CG~~i   12 (54)
T TIGR01206         4 ECPDCGAEI   12 (54)
T ss_pred             CCCCCCCEE
Confidence            345554443


No 174
>KOG4167|consensus
Probab=25.43  E-value=21  Score=35.19  Aligned_cols=31  Identities=19%  Similarity=0.203  Sum_probs=26.9

Q ss_pred             CCCcceeeccCCCCcCCChhHHHHHHhhhCC
Q psy8050         110 SGIRLILIQFIPGKTYARPSTLKTHLRTHSG  140 (311)
Q Consensus       110 ~~~~~~~~C~~C~~~f~~~~~L~~H~~~h~~  140 (311)
                      .+....|-|.+|++.|....++..||+.|..
T Consensus       787 ~~~~giFpCreC~kvF~KiKSrNAHMK~Hr~  817 (907)
T KOG4167|consen  787 TDPTGIFPCRECGKVFFKIKSRNAHMKTHRQ  817 (907)
T ss_pred             CCCCceeehHHHHHHHHHHhhhhHHHHHHHH
Confidence            3456789999999999999999999998854


No 175
>PF01096 TFIIS_C:  Transcription factor S-II (TFIIS);  InterPro: IPR001222 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target.  This entry represents a zinc finger motif found in transcription factor IIs (TFIIS). In eukaryotes the initiation of transcription of protein encoding genes by polymerase II (Pol II) is modulated by general and specific transcription factors. The general transcription factors operate through common promoters elements (such as the TATA box). At least eight different proteins associate to form the general transcription factors: TFIIA, -IIB, -IID, -IIE, -IIF, -IIG, -IIH and -IIS []. During mRNA elongation, Pol II can encounter DNA sequences that cause reverse movement of the enzyme. Such backtracking involves extrusion of the RNA 3'-end into the pore, and can lead to transcriptional arrest. Escape from arrest requires cleavage of the extruded RNA with the help of TFIIS, which induces mRNA cleavage by enhancing the intrinsic nuclease activity of RNA polymerase (Pol) II, past template-encoded pause sites []. TFIIS extends from the polymerase surface via a pore to the internal active site. Two essential and invariant acidic residues in a TFIIS loop complement the Pol II active site and could position a metal ion and a water molecule for hydrolytic RNA cleavage. TFIIS also induces extensive structural changes in Pol II that would realign nucleic acids in the active centre.  TFIIS is a protein of about 300 amino acids. It contains three regions: a variable N-terminal domain not required for TFIIS activity; a conserved central domain required for Pol II binding; and a conserved C-terminal C4-type zinc finger essential for RNA cleavage. The zinc finger folds in a conformation termed a zinc ribbon [] characterised by a three-stranded antiparallel beta-sheet and two beta-hairpins. A backbone model for Pol II-TFIIS complex was obtained from X-ray analysis. It shows that a beta hairpin protrudes from the zinc finger and complements the pol II active site [].  Some viral proteins also contain the TFIIS zinc ribbon C-terminal domain. The Vaccinia virus protein, unlike its eukaryotic homologue, is an integral RNA polymerase subunit rather than a readily separable transcription factor []. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; GO: 0003676 nucleic acid binding, 0008270 zinc ion binding, 0006351 transcription, DNA-dependent; PDB: 3M4O_I 3S14_I 2E2J_I 4A3J_I 3HOZ_I 1TWA_I 3S1Q_I 3S1N_I 1TWG_I 3I4M_I ....
Probab=25.18  E-value=15  Score=21.50  Aligned_cols=6  Identities=50%  Similarity=1.619  Sum_probs=1.8

Q ss_pred             cCCCCc
Q psy8050         174 CPICDR  179 (311)
Q Consensus       174 C~~C~~  179 (311)
                      |+.||.
T Consensus         3 Cp~Cg~    8 (39)
T PF01096_consen    3 CPKCGH    8 (39)
T ss_dssp             -SSS-S
T ss_pred             CcCCCC
Confidence            444443


No 176
>PF13453 zf-TFIIB:  Transcription factor zinc-finger
Probab=25.02  E-value=45  Score=19.60  Aligned_cols=17  Identities=24%  Similarity=0.493  Sum_probs=9.0

Q ss_pred             eecCcCccccCChhHHH
Q psy8050         144 YRCEDCNKSFSQAANLT  160 (311)
Q Consensus       144 ~~C~~C~~~f~~~~~L~  160 (311)
                      ..|+.|+-.|.....|.
T Consensus        20 d~C~~C~G~W~d~~el~   36 (41)
T PF13453_consen   20 DVCPSCGGIWFDAGELE   36 (41)
T ss_pred             EECCCCCeEEccHHHHH
Confidence            45555555555555443


No 177
>PF10058 DUF2296:  Predicted integral membrane metal-binding protein (DUF2296);  InterPro: IPR019273  This domain, found mainly in the eukaryotic lunapark proteins, has no known function []. 
Probab=24.60  E-value=77  Score=20.12  Aligned_cols=9  Identities=44%  Similarity=1.184  Sum_probs=5.5

Q ss_pred             CeecCcCcc
Q psy8050         143 PYRCEDCNK  151 (311)
Q Consensus       143 ~~~C~~C~~  151 (311)
                      .|.|..|+.
T Consensus        44 ~y~C~~Cg~   52 (54)
T PF10058_consen   44 QYRCPYCGA   52 (54)
T ss_pred             EEEcCCCCC
Confidence            466666653


No 178
>TIGR00280 L37a ribosomal protein L37a. This model finds eukaryotic ribosomal protein L37a and its archaeal orthologs. The nomeclature is tricky because eukaryotes have proteins called both L37 and L37a.
Probab=24.54  E-value=31  Score=24.53  Aligned_cols=11  Identities=36%  Similarity=1.081  Sum_probs=5.1

Q ss_pred             ccccccccccC
Q psy8050         200 YRCRLCKKAFS  210 (311)
Q Consensus       200 ~~C~~C~~~f~  210 (311)
                      |.|..|++.|.
T Consensus        54 W~C~~C~~~~A   64 (91)
T TIGR00280        54 WTCRKCGAKFA   64 (91)
T ss_pred             EEcCCCCCEEe
Confidence            44444444443


No 179
>COG0333 RpmF Ribosomal protein L32 [Translation, ribosomal structure and biogenesis]
Probab=23.83  E-value=45  Score=21.49  Aligned_cols=30  Identities=27%  Similarity=0.534  Sum_probs=18.5

Q ss_pred             hhHHHHHHHHHHHHhhhhcCccccCcchhhhccCCCCCCCC
Q psy8050           3 KKKKKKKKKKKKKKKRKKKGILANSSSKEKIQDRSECPGSG   43 (311)
Q Consensus         3 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~C~~c~   43 (311)
                      ++|.||.+|.+|+.-++.+-.           ..-.|+.||
T Consensus         5 krktSksrr~~RRsh~~l~~~-----------~~~~c~~cG   34 (57)
T COG0333           5 KRKTSKSRRRMRRSHDALKAP-----------TLSVCPNCG   34 (57)
T ss_pred             cccCChhhhhhhhhhHhhhCc-----------cceeccCCC
Confidence            456677777777766554322           136788888


No 180
>TIGR00595 priA primosomal protein N'. All proteins in this family for which functions are known are components of the primosome which is involved in replication, repair, and recombination.This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University).
Probab=23.09  E-value=53  Score=31.52  Aligned_cols=30  Identities=27%  Similarity=0.484  Sum_probs=20.5

Q ss_pred             hhccccccccccccccccCChHHHHHHhhhhcCCCCeecCccccc
Q psy8050         192 RTHSGERPYRCRLCKKAFSDSSTLTKHLRIHSGEKPYQCKLCLLR  236 (311)
Q Consensus       192 ~~h~~~k~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~  236 (311)
                      ..|.......|..||...               ..|..|+.|+..
T Consensus       233 ~~h~~~~~l~Ch~Cg~~~---------------~~~~~Cp~C~s~  262 (505)
T TIGR00595       233 TYHKKEGKLRCHYCGYQE---------------PIPKTCPQCGSE  262 (505)
T ss_pred             EEecCCCeEEcCCCcCcC---------------CCCCCCCCCCCC
Confidence            344445667899998644               356789999864


No 181
>PRK00432 30S ribosomal protein S27ae; Validated
Probab=22.85  E-value=65  Score=20.09  Aligned_cols=11  Identities=18%  Similarity=0.679  Sum_probs=5.8

Q ss_pred             CeecCcCcccc
Q psy8050         143 PYRCEDCNKSF  153 (311)
Q Consensus       143 ~~~C~~C~~~f  153 (311)
                      .+.|..||.++
T Consensus        37 r~~C~~Cgyt~   47 (50)
T PRK00432         37 RWHCGKCGYTE   47 (50)
T ss_pred             cEECCCcCCEE
Confidence            35566565543


No 182
>PRK14714 DNA polymerase II large subunit; Provisional
Probab=22.75  E-value=1e+02  Score=32.91  Aligned_cols=8  Identities=38%  Similarity=1.099  Sum_probs=4.8

Q ss_pred             eecCcCcc
Q psy8050         144 YRCEDCNK  151 (311)
Q Consensus       144 ~~C~~C~~  151 (311)
                      +.|+.||.
T Consensus       668 rkCPkCG~  675 (1337)
T PRK14714        668 RRCPSCGT  675 (1337)
T ss_pred             EECCCCCC
Confidence            55666664


No 183
>TIGR01562 FdhE formate dehydrogenase accessory protein FdhE. The only sequence scoring between trusted and noise is that from Aquifex aeolicus, which shows certain structural differences from the proteobacterial forms in the alignment. However it is notable that A. aeolicus also has a sequence scoring above trusted to the alpha subunit of formate dehydrogenase (TIGR01553).
Probab=22.59  E-value=58  Score=28.95  Aligned_cols=12  Identities=25%  Similarity=0.858  Sum_probs=5.7

Q ss_pred             CCCceecCCCCc
Q psy8050         168 GEKPFRCPICDR  179 (311)
Q Consensus       168 ~~~~~~C~~C~~  179 (311)
                      |.+-..|..|+.
T Consensus       207 G~RyL~CslC~t  218 (305)
T TIGR01562       207 GLRYLSCSLCAT  218 (305)
T ss_pred             CceEEEcCCCCC
Confidence            334445555543


No 184
>PRK12286 rpmF 50S ribosomal protein L32; Reviewed
Probab=22.43  E-value=25  Score=22.65  Aligned_cols=9  Identities=44%  Similarity=0.988  Sum_probs=7.1

Q ss_pred             cCCCCCCCC
Q psy8050          35 DRSECPGSG   43 (311)
Q Consensus        35 ~~~~C~~c~   43 (311)
                      ....|+.||
T Consensus        26 ~l~~C~~CG   34 (57)
T PRK12286         26 GLVECPNCG   34 (57)
T ss_pred             cceECCCCC
Confidence            347899998


No 185
>TIGR00686 phnA alkylphosphonate utilization operon protein PhnA. The protein family includes an uncharacterized member designated phnA in Escherichia coli, part of a large operon associated with alkylphosphonate uptake and carbon-phosphorus bond cleavage. This protein is not related to the characterized phosphonoacetate hydrolase designated PhnA by Kulakova, et al. (2001, 1997).
Probab=22.08  E-value=54  Score=24.12  Aligned_cols=15  Identities=13%  Similarity=0.479  Sum_probs=8.5

Q ss_pred             CCeecCcCccccCCh
Q psy8050         142 KPYRCEDCNKSFSQA  156 (311)
Q Consensus       142 ~~~~C~~C~~~f~~~  156 (311)
                      ..|.|+.|+..+...
T Consensus        18 ~~~iCpeC~~EW~~~   32 (109)
T TIGR00686        18 TQLICPSCLYEWNEN   32 (109)
T ss_pred             CeeECcccccccccc
Confidence            346666666665433


No 186
>PRK12380 hydrogenase nickel incorporation protein HybF; Provisional
Probab=22.01  E-value=56  Score=24.29  Aligned_cols=11  Identities=27%  Similarity=0.646  Sum_probs=5.5

Q ss_pred             eecCcCccccC
Q psy8050         144 YRCEDCNKSFS  154 (311)
Q Consensus       144 ~~C~~C~~~f~  154 (311)
                      +.|..|+..|.
T Consensus        71 ~~C~~Cg~~~~   81 (113)
T PRK12380         71 AWCWDCSQVVE   81 (113)
T ss_pred             EEcccCCCEEe
Confidence            45555554443


No 187
>KOG0782|consensus
Probab=21.99  E-value=30  Score=33.15  Aligned_cols=53  Identities=17%  Similarity=0.248  Sum_probs=36.6

Q ss_pred             hHHHHHHHHHhCCCCCCCCcccccccccccccccccccchhhceeeeecccccCCCCCCCCccccccC
Q psy8050         240 SGNLNRHMRVHGSQYNMLPVIYMGVNQVCSNIVDQRRGIYVVEAVWTVQSTWIDGRSTMCGHSFIVKL  307 (311)
Q Consensus       240 ~~~L~~H~~~H~~~~~~~~~~~~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~~~kC~~C~~~f~~~~  307 (311)
                      .+.|.+|--+|......       +|..|++.|.++..+.      ..+..-+  -|+-|...|..+.
T Consensus       238 E~~fvrHHWVHrrRqeG-------kC~~CgKgFQQKf~Fh------sKEivAi--sCSWCKqayH~Kv  290 (1004)
T KOG0782|consen  238 ESGFVRHHWVHRRRQEG-------KCNTCGKGFQQKFFFH------SKEIVAI--SCSWCKQAYHLKV  290 (1004)
T ss_pred             cccchHHhHhhHhhhcc-------ccchhhhhhhhheeec------cccEEEE--EehHHHHHhhcch
Confidence            44788887777655543       7999999999885554      3333322  3888988887765


No 188
>PRK03564 formate dehydrogenase accessory protein FdhE; Provisional
Probab=21.79  E-value=30  Score=30.75  Aligned_cols=6  Identities=33%  Similarity=1.315  Sum_probs=3.0

Q ss_pred             cccccc
Q psy8050         201 RCRLCK  206 (311)
Q Consensus       201 ~C~~C~  206 (311)
                      .|+.||
T Consensus       228 ~C~~Cg  233 (309)
T PRK03564        228 KCSNCE  233 (309)
T ss_pred             cCCCCC
Confidence            455555


No 189
>PF12647 RNHCP:  RNHCP domain;  InterPro: IPR024439 This domain is found in uncharacterised bacterial proteins. It is typically between 94 and 143 amino acids in length and has a conserved RNHCP sequence motif.
Probab=21.76  E-value=3.1e+02  Score=19.62  Aligned_cols=9  Identities=22%  Similarity=0.541  Sum_probs=7.7

Q ss_pred             cCCCCCCCC
Q psy8050          35 DRSECPGSG   43 (311)
Q Consensus        35 ~~~~C~~c~   43 (311)
                      +.|.|..||
T Consensus         3 ~~F~C~~CG   11 (92)
T PF12647_consen    3 ESFTCVHCG   11 (92)
T ss_pred             cccCccccC
Confidence            459999999


No 190
>KOG3408|consensus
Probab=21.69  E-value=49  Score=24.83  Aligned_cols=26  Identities=27%  Similarity=0.664  Sum_probs=20.1

Q ss_pred             ccccccccccccccCChHHHHHHhhh
Q psy8050         196 GERPYRCRLCKKAFSDSSTLTKHLRI  221 (311)
Q Consensus       196 ~~k~~~C~~C~~~f~~~~~l~~H~~~  221 (311)
                      |...|.|-.|.+-|.+...|..|.++
T Consensus        54 G~GqfyCi~CaRyFi~~~~l~~H~kt   79 (129)
T KOG3408|consen   54 GGGQFYCIECARYFIDAKALKTHFKT   79 (129)
T ss_pred             CCceeehhhhhhhhcchHHHHHHHhc
Confidence            34457888888888888888888764


No 191
>TIGR00100 hypA hydrogenase nickel insertion protein HypA. In Hpylori, hypA mutant abolished hydrogenase activity and decrease in urease activity. Nickel supplementation in media restored urease activity and partial hydrogenase activity. HypA probably involved in inserting Ni in enzymes.
Probab=21.52  E-value=57  Score=24.35  Aligned_cols=11  Identities=45%  Similarity=1.084  Sum_probs=5.8

Q ss_pred             eecCcCccccC
Q psy8050         144 YRCEDCNKSFS  154 (311)
Q Consensus       144 ~~C~~C~~~f~  154 (311)
                      +.|..|+..|.
T Consensus        71 ~~C~~Cg~~~~   81 (115)
T TIGR00100        71 CECEDCSEEVS   81 (115)
T ss_pred             EEcccCCCEEe
Confidence            45555555543


No 192
>PF10013 DUF2256:  Uncharacterized protein conserved in bacteria (DUF2256);  InterPro: IPR017136 There is currently no experimental data for members of this group or their homologues, nor do they exhibit features indicative of any function.
Probab=21.52  E-value=40  Score=20.17  Aligned_cols=16  Identities=13%  Similarity=0.165  Sum_probs=11.9

Q ss_pred             eeccCCCCcCCChhHH
Q psy8050         116 LIQFIPGKTYARPSTL  131 (311)
Q Consensus       116 ~~C~~C~~~f~~~~~L  131 (311)
                      -+|..||+.|+.+..-
T Consensus         9 K~C~~C~rpf~WRKKW   24 (42)
T PF10013_consen    9 KICPVCGRPFTWRKKW   24 (42)
T ss_pred             CcCcccCCcchHHHHH
Confidence            4688899988876544


No 193
>COG2331 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=21.45  E-value=30  Score=23.63  Aligned_cols=10  Identities=50%  Similarity=1.135  Sum_probs=5.7

Q ss_pred             eecCcccccc
Q psy8050         228 YQCKLCLLRF  237 (311)
Q Consensus       228 ~~C~~C~~~f  237 (311)
                      |.|..|+..|
T Consensus        13 Y~c~~cg~~~   22 (82)
T COG2331          13 YECTECGNRF   22 (82)
T ss_pred             EeecccchHH
Confidence            5566665544


No 194
>KOG3014|consensus
Probab=21.42  E-value=42  Score=28.66  Aligned_cols=31  Identities=19%  Similarity=0.244  Sum_probs=22.9

Q ss_pred             cCCCCCCCCCCCCCCchhhHhhhccccccCCCCC
Q psy8050          35 DRSECPGSGMGKPPHDVADVLLSLKHAVVHPGQM   68 (311)
Q Consensus        35 ~~~~C~~c~~~~~~~~~~~~~~~~~H~~~h~~~~   68 (311)
                      ...+|..||+.+++.+-.+.   +.|...|..-+
T Consensus        36 g~~~C~~Cgm~Yt~~s~EDe---~~H~~fH~~~l   66 (257)
T KOG3014|consen   36 GAVKCKECGMKYTVTSPEDE---ALHEKFHNRFL   66 (257)
T ss_pred             CceehhhcCceecCCCHHHH---HHHHHHHHhhc
Confidence            45799999998877776666   56777777443


No 195
>TIGR03829 YokU_near_AblA uncharacterized protein, YokU family. Members of this protein family occur in various species of the genus Bacillus, always next to the gene (kamA or ablA) for lysine 2,3-aminomutase. Members have a pair of CXXC motifs, and share homology to the amino-terminal region of a family of putative transcription factors for which the C-terminal is modeled by pfam01381, a helix-turn-helix domain model. This family, however, is shorter and lacks the helix-turn-helix region. The function of this protein family is unknown, but a regulatory role in compatible solute biosynthesis is suggested by local genome context.
Probab=21.08  E-value=54  Score=23.26  Aligned_cols=16  Identities=19%  Similarity=0.094  Sum_probs=7.5

Q ss_pred             eeeccCCCCcCCChhH
Q psy8050         115 ILIQFIPGKTYARPST  130 (311)
Q Consensus       115 ~~~C~~C~~~f~~~~~  130 (311)
                      ...|..||..|-+...
T Consensus        35 a~~C~~CGe~y~~dev   50 (89)
T TIGR03829        35 SISCSHCGMEYQDDTT   50 (89)
T ss_pred             cccccCCCcEeecHHH
Confidence            3445555555444433


No 196
>PF09788 Tmemb_55A:  Transmembrane protein 55A;  InterPro: IPR019178  Members of this family catalyse the hydrolysis of the 4-position phosphate of phosphatidylinositol 4,5-bisphosphate, in the reaction:  1-phosphatidyl-myo-inositol 4,5-bisphosphate + H(2)O = 1-phosphatidyl-1D-myo-inositol 5-phosphate + phosphate.  
Probab=20.97  E-value=3.7e+02  Score=23.20  Aligned_cols=41  Identities=20%  Similarity=0.136  Sum_probs=22.8

Q ss_pred             CCCCCCCCCCCCCCchhhHhhhcccccc--CCCCCCCCCCCCCCCCCC
Q psy8050          36 RSECPGSGMGKPPHDVADVLLSLKHAVV--HPGQMTPPPYGDPGAPYQ   81 (311)
Q Consensus        36 ~~~C~~c~~~~~~~~~~~~~~~~~H~~~--h~~~~~p~~~~~p~~~~~   81 (311)
                      .-.|..|+     ....-.....+|.+.  +-.+.+|--..+||+-|.
T Consensus        65 ~v~CrVCq-----~~I~i~gk~~QhVVkC~~CnEATPIr~aPpGKKYV  107 (256)
T PF09788_consen   65 VVTCRVCQ-----SLIDIEGKMHQHVVKCSVCNEATPIRNAPPGKKYV  107 (256)
T ss_pred             eEEeecCC-----ceecccCccceeeEECCCCCccccccCCCCCCeeE
Confidence            35688887     322222333445442  345666666667778777


No 197
>PF07295 DUF1451:  Protein of unknown function (DUF1451);  InterPro: IPR009912 This family consists of several hypothetical bacterial proteins of around 160 residues in length. Members of this family contain four highly conserved cysteine resides toward the C-terminal region of the protein. The function of this family is unknown.
Probab=20.92  E-value=65  Score=25.23  Aligned_cols=11  Identities=18%  Similarity=0.425  Sum_probs=7.2

Q ss_pred             CeecCcccccc
Q psy8050         227 PYQCKLCLLRF  237 (311)
Q Consensus       227 ~~~C~~C~~~f  237 (311)
                      .|.|..||...
T Consensus       112 ~l~C~~Cg~~~  122 (146)
T PF07295_consen  112 TLVCENCGHEV  122 (146)
T ss_pred             eEecccCCCEE
Confidence            37777777554


No 198
>PF07649 C1_3:  C1-like domain;  InterPro: IPR011424 This short domain is rich in cysteines and histidines. The pattern of conservation is similar to that found in IPR002219 from INTERPRO. C1 domains are protein kinase C-like zinc finger structures. Diacylglycerol (DAG) kinases (DGKs) have a two or three commonly conserved cysteine-rich C1 domains []. DGKs modulate the balance between the two signaling lipids, DAG and phosphatidic acid (PA), by phosphorylating DAG to yield PA []. The PKD (protein kinase D) family are novel DAG receptors. They have twin C1 domains, designated C1a and C1b, which bind DAG or phorbol esters. Individual C1 domains differ in ligand-binding activity and selectivity []. ; GO: 0047134 protein-disulfide reductase activity, 0055114 oxidation-reduction process; PDB: 1V5N_A.
Probab=20.57  E-value=56  Score=17.62  Aligned_cols=8  Identities=50%  Similarity=1.531  Sum_probs=2.9

Q ss_pred             CeecCcCc
Q psy8050         143 PYRCEDCN  150 (311)
Q Consensus       143 ~~~C~~C~  150 (311)
                      .|.|..|+
T Consensus        15 ~Y~C~~Cd   22 (30)
T PF07649_consen   15 FYRCSECD   22 (30)
T ss_dssp             EEE-TTT-
T ss_pred             eEECccCC
Confidence            34555554


No 199
>PF13451 zf-trcl:  Probable zinc-binding domain
Probab=20.45  E-value=90  Score=19.42  Aligned_cols=18  Identities=28%  Similarity=0.433  Sum_probs=10.3

Q ss_pred             CCeecCccccccCChHHH
Q psy8050         226 KPYQCKLCLLRFSQSGNL  243 (311)
Q Consensus       226 ~~~~C~~C~~~f~~~~~L  243 (311)
                      +.+.|..||..|.....=
T Consensus         3 k~l~C~dCg~~FvfTa~E   20 (49)
T PF13451_consen    3 KTLTCKDCGAEFVFTAGE   20 (49)
T ss_pred             eeEEcccCCCeEEEehhH
Confidence            446666777666544433


No 200
>PRK05978 hypothetical protein; Provisional
Probab=20.02  E-value=58  Score=25.56  Aligned_cols=9  Identities=11%  Similarity=0.268  Sum_probs=4.8

Q ss_pred             ccccccccc
Q psy8050         265 NQVCSNIVD  273 (311)
Q Consensus       265 C~~C~~~f~  273 (311)
                      |+.||..|.
T Consensus        55 C~~CG~~~~   63 (148)
T PRK05978         55 CAACGEDFT   63 (148)
T ss_pred             ccccCCccc
Confidence            555555543


Done!