Query         psy5202
Match_columns 343
No_of_seqs    512 out of 2379
Neff          10.7
Searched_HMMs 46136
Date          Fri Aug 16 19:57:14 2013
Command       hhsearch -i /work/01045/syshi/Psyhhblits/psy5202.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/5202hhsearch_cdd -cpu 12 -v 0 

 No Hit                             Prob E-value P-value  Score    SS Cols Query HMM  Template HMM
  1 KOG2462|consensus              100.0 4.9E-34 1.1E-38  226.2  11.3  137   87-265   126-265 (279)
  2 KOG2462|consensus               99.9 1.5E-26 3.1E-31  183.9   6.0  126  116-280   127-252 (279)
  3 KOG1074|consensus               99.9 9.6E-23 2.1E-27  183.6   8.2   90  184-273   604-698 (958)
  4 KOG1074|consensus               99.8 3.6E-22 7.7E-27  179.9   2.7  177   89-272   603-936 (958)
  5 KOG3608|consensus               99.8 3.6E-21 7.8E-26  157.2   5.6  166   91-271   177-349 (467)
  6 KOG3608|consensus               99.8 2.7E-20 5.9E-25  152.1   7.7  174   85-270   201-379 (467)
  7 KOG3576|consensus               99.8 8.7E-20 1.9E-24  137.9   8.2   87  186-272   146-241 (267)
  8 KOG3576|consensus               99.8   1E-20 2.2E-25  142.9   3.0  110   87-237   113-236 (267)
  9 KOG3623|consensus               99.7 1.1E-17 2.4E-22  148.9   4.4  122   89-235   208-331 (1007)
 10 KOG3623|consensus               99.7 1.2E-17 2.5E-22  148.8   2.2  107  120-264   211-330 (1007)
 11 PLN03086 PRLI-interacting fact  99.5 3.2E-14 6.9E-19  127.8   7.2  129   92-268   408-565 (567)
 12 KOG3993|consensus               99.5 7.8E-14 1.7E-18  117.4   5.8  174   91-271   267-486 (500)
 13 PHA00733 hypothetical protein   99.3 2.6E-12 5.6E-17   94.4   4.6   82  183-268    38-124 (128)
 14 PLN03086 PRLI-interacting fact  99.2 2.4E-11 5.2E-16  109.5   8.5  104   89-237   451-564 (567)
 15 PHA00733 hypothetical protein   99.2 3.1E-11 6.8E-16   88.7   5.3   52  184-237    72-123 (128)
 16 PHA02768 hypothetical protein;  99.0 1.2E-10 2.6E-15   70.0   2.2   40  186-227     6-45  (55)
 17 PHA02768 hypothetical protein;  99.0 1.7E-10 3.6E-15   69.4   1.9   44   91-136     5-48  (55)
 18 KOG3993|consensus               98.9 4.3E-10 9.4E-15   95.2  -0.1   89  184-272   266-385 (500)
 19 PF13465 zf-H2C2_2:  Zinc-finge  98.8 2.8E-09 6.2E-14   54.8   2.7   25  200-224     1-25  (26)
 20 PF13465 zf-H2C2_2:  Zinc-finge  98.7 1.6E-08 3.5E-13   51.9   2.2   26  106-131     1-26  (26)
 21 PF05605 zf-Di19:  Drought indu  98.6 7.1E-08 1.5E-12   59.6   4.4   51  214-267     3-53  (54)
 22 PHA00732 hypothetical protein   98.6 3.2E-08 6.9E-13   65.8   2.6   45  186-236     2-47  (79)
 23 PHA00616 hypothetical protein   98.6 2.9E-08 6.3E-13   56.7   1.5   32  186-217     2-33  (44)
 24 PHA00616 hypothetical protein   98.5 3.1E-08 6.7E-13   56.6   1.5   34   91-124     1-34  (44)
 25 PHA00732 hypothetical protein   98.4   2E-07 4.4E-12   62.0   2.7   47  213-266     1-47  (79)
 26 PF05605 zf-Di19:  Drought indu  98.3 1.1E-06 2.4E-11   54.3   4.8   23  120-143     3-25  (54)
 27 PF00096 zf-C2H2:  Zinc finger,  98.2 8.7E-07 1.9E-11   44.2   1.3   22  214-235     1-22  (23)
 28 PF00096 zf-C2H2:  Zinc finger,  98.2 1.8E-06   4E-11   43.0   2.4   22  244-265     1-22  (23)
 29 COG5189 SFP1 Putative transcri  98.0 1.2E-06 2.6E-11   71.9   0.5   55  211-265   347-420 (423)
 30 PF13894 zf-C2H2_4:  C2H2-type   98.0 2.6E-06 5.7E-11   42.9   1.6   23  244-266     1-23  (24)
 31 KOG2231|consensus               98.0 1.5E-05 3.3E-10   73.5   7.1   71  185-265   182-261 (669)
 32 PF12756 zf-C2H2_2:  C2H2 type   98.0 3.9E-06 8.4E-11   59.7   1.9   74  187-267     1-74  (100)
 33 PF13894 zf-C2H2_4:  C2H2-type   97.9 9.7E-06 2.1E-10   40.8   2.6   24  214-237     1-24  (24)
 34 PF12756 zf-C2H2_2:  C2H2 type   97.9 8.2E-06 1.8E-10   58.0   2.9   24  121-144     1-24  (100)
 35 PF13912 zf-C2H2_6:  C2H2-type   97.8 8.2E-06 1.8E-10   42.4   1.1   24  244-267     2-25  (27)
 36 COG5189 SFP1 Putative transcri  97.8 7.2E-06 1.6E-10   67.5   0.6   53  182-234   346-419 (423)
 37 PF13912 zf-C2H2_6:  C2H2-type   97.7 2.3E-05 4.9E-10   40.7   1.7   25   91-115     1-25  (27)
 38 KOG2231|consensus               97.7  0.0001 2.2E-09   68.3   6.9  103  121-271   117-240 (669)
 39 COG5236 Uncharacterized conser  97.5 0.00012 2.6E-09   61.2   3.9  135   91-271   151-309 (493)
 40 KOG1146|consensus               97.4 0.00028 6.1E-09   69.3   5.5  160   93-266   438-641 (1406)
 41 PF13909 zf-H2C2_5:  C2H2-type   97.3 0.00014   3E-09   36.5   1.4   22  214-236     1-22  (24)
 42 smart00355 ZnF_C2H2 zinc finge  97.3 0.00025 5.4E-09   36.1   2.3   23  244-266     1-23  (26)
 43 PF09237 GAGA:  GAGA factor;  I  97.3 0.00036 7.7E-09   40.9   3.0   31   88-118    21-51  (54)
 44 KOG2785|consensus               97.2 0.00091   2E-08   57.2   6.5   60  214-273   167-250 (390)
 45 smart00355 ZnF_C2H2 zinc finge  97.2 0.00029 6.3E-09   35.9   2.1   24  214-237     1-24  (26)
 46 COG5048 FOG: Zn-finger [Genera  97.1 0.00043 9.4E-09   63.4   4.0  145   90-237   288-442 (467)
 47 COG5236 Uncharacterized conser  97.1 0.00034 7.4E-09   58.5   2.9  107  120-272   152-280 (493)
 48 PF09237 GAGA:  GAGA factor;  I  97.1 0.00062 1.3E-08   39.9   3.1   26  212-237    23-48  (54)
 49 KOG1146|consensus               97.1   7E-05 1.5E-09   73.3  -1.3  150   87-236   461-641 (1406)
 50 PF13909 zf-H2C2_5:  C2H2-type   97.1 0.00055 1.2E-08   34.2   2.2   23  120-143     1-23  (24)
 51 PRK04860 hypothetical protein;  96.9  0.0005 1.1E-08   52.6   1.7   36  185-224   119-154 (160)
 52 PRK04860 hypothetical protein;  96.9 0.00051 1.1E-08   52.6   1.7   40  212-257   118-157 (160)
 53 KOG2785|consensus               96.9  0.0041 8.9E-08   53.3   7.1  146   91-236     3-243 (390)
 54 PF12874 zf-met:  Zinc-finger o  96.8 0.00089 1.9E-08   33.8   1.8   20  215-234     2-21  (25)
 55 PF12874 zf-met:  Zinc-finger o  96.8 0.00056 1.2E-08   34.6   1.1   23  244-266     1-23  (25)
 56 KOG2482|consensus               96.6  0.0094   2E-07   50.2   7.4  132  118-265   143-356 (423)
 57 PF12171 zf-C2H2_jaz:  Zinc-fin  96.5 0.00084 1.8E-08   34.7   0.4   22  244-265     2-23  (27)
 58 PF13913 zf-C2HC_2:  zinc-finge  96.5  0.0023   5E-08   32.2   1.9   22  244-266     3-24  (25)
 59 PF12171 zf-C2H2_jaz:  Zinc-fin  96.0  0.0019   4E-08   33.3   0.2   22  214-235     2-23  (27)
 60 KOG2482|consensus               95.8   0.016 3.5E-07   48.8   4.8   54   91-144   195-304 (423)
 61 KOG4173|consensus               95.4    0.01 2.2E-07   46.2   2.2   82  184-268    78-171 (253)
 62 PF13913 zf-C2HC_2:  zinc-finge  95.4   0.014 3.1E-07   29.3   2.0   19  215-234     4-22  (25)
 63 TIGR00622 ssl1 transcription f  94.9   0.072 1.6E-06   37.8   4.9  109   92-250     2-110 (112)
 64 smart00451 ZnF_U1 U1-like zinc  94.7   0.027 5.9E-07   30.9   2.0   23  243-265     3-25  (35)
 65 cd00350 rubredoxin_like Rubred  94.4    0.02 4.4E-07   31.0   1.0   12  186-197     2-13  (33)
 66 KOG4173|consensus               94.2   0.021 4.5E-07   44.5   1.2   26  210-235   141-168 (253)
 67 KOG2893|consensus               93.8   0.015 3.2E-07   46.4  -0.3   47   93-143    12-58  (341)
 68 COG4049 Uncharacterized protei  93.8   0.027 5.9E-07   33.7   0.8   28  240-267    14-41  (65)
 69 smart00451 ZnF_U1 U1-like zinc  93.7   0.053 1.2E-06   29.7   1.9   23  213-235     3-25  (35)
 70 PF09538 FYDLN_acid:  Protein o  93.4   0.048   1E-06   38.8   1.7   30  214-256    10-39  (108)
 71 COG5048 FOG: Zn-finger [Genera  93.4   0.017 3.6E-07   52.9  -0.9   54  184-237   288-347 (467)
 72 PF12013 DUF3505:  Protein of u  93.3    0.18   4E-06   36.1   4.6   25  244-268    81-109 (109)
 73 COG4049 Uncharacterized protei  93.2    0.04 8.7E-07   33.0   0.8   29   86-114    12-40  (65)
 74 KOG2893|consensus               93.0   0.023   5E-07   45.3  -0.5   41  188-232    13-53  (341)
 75 cd00729 rubredoxin_SM Rubredox  92.9   0.079 1.7E-06   28.9   1.7   11  214-224     3-13  (34)
 76 PF12013 DUF3505:  Protein of u  92.7    0.25 5.4E-06   35.5   4.6   54   90-144    10-109 (109)
 77 TIGR00622 ssl1 transcription f  90.8    0.51 1.1E-05   33.6   4.3   77  184-266    14-104 (112)
 78 PF09538 FYDLN_acid:  Protein o  90.8    0.18 3.8E-06   35.9   2.0   32  184-226     8-39  (108)
 79 TIGR02300 FYDLN_acid conserved  90.4    0.18 3.8E-06   36.4   1.7   33  214-259    10-42  (129)
 80 COG1592 Rubrerythrin [Energy p  90.4    0.18   4E-06   38.7   1.9   23  213-250   134-156 (166)
 81 TIGR02098 MJ0042_CXXC MJ0042 f  90.3    0.13 2.9E-06   28.8   0.8   14  214-227     3-16  (38)
 82 PF09986 DUF2225:  Uncharacteri  90.2   0.067 1.4E-06   43.6  -0.7   45  212-256     4-61  (214)
 83 KOG2807|consensus               90.0    0.65 1.4E-05   39.3   4.9   31  212-250   344-374 (378)
 84 PF02892 zf-BED:  BED zinc fing  90.0    0.18   4E-06   29.4   1.3   25  241-265    14-42  (45)
 85 PF09986 DUF2225:  Uncharacteri  89.2   0.064 1.4E-06   43.7  -1.5   44   89-132     3-61  (214)
 86 PF13717 zinc_ribbon_4:  zinc-r  88.8    0.29 6.2E-06   27.1   1.4   14  215-228     4-17  (36)
 87 PF13719 zinc_ribbon_5:  zinc-r  88.7    0.28 6.2E-06   27.3   1.4   11  243-253    25-35  (37)
 88 PHA00626 hypothetical protein   88.5    0.18 3.9E-06   30.4   0.5   14  242-255    22-35  (59)
 89 PRK14890 putative Zn-ribbon RN  87.9    0.54 1.2E-05   29.0   2.3   10  242-251    47-56  (59)
 90 KOG2186|consensus               87.5    0.39 8.5E-06   39.1   2.0   47  185-234     3-49  (276)
 91 PRK00398 rpoP DNA-directed RNA  87.0    0.29 6.4E-06   28.8   0.8   11  214-224     4-14  (46)
 92 COG2888 Predicted Zn-ribbon RN  87.0    0.73 1.6E-05   28.3   2.5   10  242-251    49-58  (61)
 93 smart00734 ZnF_Rad18 Rad18-lik  86.8    0.74 1.6E-05   23.3   2.1   21  245-266     3-23  (26)
 94 KOG2186|consensus               86.8    0.44 9.5E-06   38.8   1.9   51   92-145     4-54  (276)
 95 COG5151 SSL1 RNA polymerase II  86.8     0.6 1.3E-05   39.2   2.7   55  186-250   363-417 (421)
 96 PF03604 DNA_RNApol_7kD:  DNA d  86.5    0.59 1.3E-05   25.0   1.7   11  214-224     1-11  (32)
 97 COG1996 RPC10 DNA-directed RNA  86.3     0.4 8.8E-06   28.4   1.1   11  185-195     6-16  (49)
 98 COG1198 PriA Primosomal protei  86.3    0.68 1.5E-05   44.8   3.2   40  186-252   445-484 (730)
 99 TIGR00373 conserved hypothetic  86.2    0.95 2.1E-05   34.9   3.5   34  209-253   105-138 (158)
100 TIGR02300 FYDLN_acid conserved  85.8    0.55 1.2E-05   34.0   1.8   35  184-229     8-42  (129)
101 COG1997 RPL43A Ribosomal prote  85.4     0.4 8.7E-06   32.0   0.9   13  242-254    52-64  (89)
102 smart00531 TFIIE Transcription  85.2     1.1 2.4E-05   34.1   3.4   38  210-253    96-133 (147)
103 PRK00464 nrdR transcriptional   85.1    0.16 3.5E-06   38.7  -1.2   44   92-138     1-47  (154)
104 COG1592 Rubrerythrin [Energy p  84.8    0.54 1.2E-05   36.2   1.5   25  184-221   133-157 (166)
105 smart00659 RPOLCX RNA polymera  84.8    0.61 1.3E-05   27.1   1.4   27   91-128     2-28  (44)
106 PRK04023 DNA polymerase II lar  84.6     1.1 2.4E-05   44.2   3.7   13  243-255   663-675 (1121)
107 smart00614 ZnF_BED BED zinc fi  84.4     0.7 1.5E-05   27.7   1.6   21  244-264    19-44  (50)
108 PRK00464 nrdR transcriptional   84.1    0.18 3.9E-06   38.5  -1.3   15  244-258    29-43  (154)
109 PRK06266 transcription initiat  83.8     1.2 2.6E-05   35.1   3.1   35  210-255   114-148 (178)
110 TIGR02605 CxxC_CxxC_SSSS putat  83.0    0.27 5.9E-06   29.8  -0.6   29   92-127     6-34  (52)
111 PF15269 zf-C2H2_7:  Zinc-finge  82.4       1 2.2E-05   25.7   1.5   22  214-235    21-42  (54)
112 smart00531 TFIIE Transcription  82.2     1.3 2.9E-05   33.7   2.7   38   88-129    96-133 (147)
113 KOG4167|consensus               81.6     3.8 8.3E-05   38.9   5.8   29   87-115   788-816 (907)
114 PRK06266 transcription initiat  81.4    0.75 1.6E-05   36.2   1.1   32  184-224   116-147 (178)
115 KOG4377|consensus               81.4     7.7 0.00017   34.3   7.2   27  119-145   271-299 (480)
116 COG5188 PRP9 Splicing factor 3  80.9     1.4 3.1E-05   37.7   2.6   23  184-206   237-259 (470)
117 smart00834 CxxC_CXXC_SSSS Puta  80.9    0.34 7.4E-06   27.5  -0.7   31   91-128     5-35  (41)
118 TIGR00373 conserved hypothetic  80.8     2.1 4.7E-05   33.0   3.5   20  117-136   107-126 (158)
119 COG4530 Uncharacterized protei  79.9     1.1 2.3E-05   31.4   1.3   27  215-254    11-37  (129)
120 PRK09678 DNA-binding transcrip  79.7    0.51 1.1E-05   30.7  -0.2   17  241-257    25-43  (72)
121 PRK09678 DNA-binding transcrip  79.3    0.41 8.9E-06   31.1  -0.8   41  186-228     2-44  (72)
122 PF09723 Zn-ribbon_8:  Zinc rib  79.2    0.81 1.8E-05   26.3   0.5   11  214-224     6-16  (42)
123 TIGR00595 priA primosomal prot  79.2     1.8 3.8E-05   40.5   3.0   41  185-252   222-262 (505)
124 PF04959 ARS2:  Arsenite-resist  79.1     1.3 2.7E-05   35.9   1.7   31  240-270    74-104 (214)
125 KOG2071|consensus               78.9     1.7 3.8E-05   40.2   2.8   26  184-209   417-442 (579)
126 PF05443 ROS_MUCR:  ROS/MUCR tr  77.6     1.2 2.6E-05   33.0   1.1   27   89-118    70-96  (132)
127 PF06524 NOA36:  NOA36 protein;  77.4     1.1 2.4E-05   36.6   1.0   32  240-271   206-237 (314)
128 PF06524 NOA36:  NOA36 protein;  77.1    0.96 2.1E-05   37.0   0.6   49  184-237   181-233 (314)
129 PRK14714 DNA polymerase II lar  77.1       3 6.6E-05   42.4   4.0   42  186-255   680-721 (1337)
130 PRK14873 primosome assembly pr  77.0     2.1 4.5E-05   41.4   2.8   40  185-252   392-431 (665)
131 PF02176 zf-TRAF:  TRAF-type zi  76.9     1.3 2.9E-05   27.6   1.1   21  226-248    23-43  (60)
132 PRK04023 DNA polymerase II lar  76.9     5.6 0.00012   39.6   5.6   25   88-128   623-647 (1121)
133 KOG2071|consensus               76.8     8.2 0.00018   36.0   6.4   30   86-115   413-442 (579)
134 PF10571 UPF0547:  Uncharacteri  76.0     1.6 3.4E-05   22.1   1.0   10  245-254    16-25  (26)
135 PF05443 ROS_MUCR:  ROS/MUCR tr  75.6     1.3 2.7E-05   32.8   0.8   26  242-270    71-96  (132)
136 PF15269 zf-C2H2_7:  Zinc-finge  74.8     3.7 8.1E-05   23.5   2.3   25   89-113    18-42  (54)
137 PF13878 zf-C2H2_3:  zinc-finge  73.8     3.1 6.7E-05   23.7   1.9   24   92-115    14-39  (41)
138 PF08274 PhnA_Zn_Ribbon:  PhnA   72.9     1.8 3.9E-05   22.8   0.7    9  188-196     5-13  (30)
139 PF14353 CpXC:  CpXC protein     72.7       3 6.4E-05   30.9   2.2   23  243-265    38-60  (128)
140 KOG4377|consensus               72.4      13 0.00029   32.9   6.2   52   88-142   268-335 (480)
141 TIGR01384 TFS_arch transcripti  71.5    0.21 4.5E-06   35.5  -4.1   27   93-131     2-28  (104)
142 KOG2593|consensus               70.7     4.6  0.0001   36.0   3.2   38  210-252   125-162 (436)
143 PF04959 ARS2:  Arsenite-resist  70.0     3.5 7.6E-05   33.4   2.2   26  212-237    76-101 (214)
144 PRK05580 primosome assembly pr  69.7       4 8.6E-05   39.8   2.9   41  185-252   390-430 (679)
145 KOG1280|consensus               68.1     4.5 9.7E-05   34.8   2.4   23  213-235    79-101 (381)
146 COG3357 Predicted transcriptio  67.8     2.7 5.8E-05   28.4   0.9   14  212-225    57-70  (97)
147 PF12760 Zn_Tnp_IS1595:  Transp  66.8     5.3 0.00011   23.4   2.0   10  242-251    36-45  (46)
148 PRK03824 hypA hydrogenase nick  66.7     2.4 5.2E-05   31.7   0.6    8  244-251   108-115 (135)
149 KOG2593|consensus               66.4     4.9 0.00011   35.8   2.5   22  116-137   125-146 (436)
150 TIGR01206 lysW lysine biosynth  66.3     2.3 5.1E-05   25.9   0.4   10  214-223     3-12  (54)
151 PF14446 Prok-RING_1:  Prokaryo  65.6     4.6  0.0001   24.5   1.5    9  187-195     7-15  (54)
152 KOG3408|consensus               65.3     3.5 7.6E-05   29.6   1.1   27  240-266    54-80  (129)
153 PRK12380 hydrogenase nickel in  65.2       4 8.7E-05   29.4   1.5   11  214-224    71-81  (113)
154 PF05191 ADK_lid:  Adenylate ki  64.9     1.5 3.3E-05   24.2  -0.6   10  187-196     3-12  (36)
155 PF07975 C1_4:  TFIIH C1-like d  64.9     1.7 3.7E-05   26.1  -0.4   23  213-235    21-43  (51)
156 PF07754 DUF1610:  Domain of un  64.8     3.3 7.3E-05   20.4   0.7    8  243-250    16-23  (24)
157 KOG2807|consensus               64.0     9.3  0.0002   32.7   3.5  100   90-220   275-374 (378)
158 KOG4167|consensus               63.2     1.2 2.6E-05   42.1  -1.9   29  242-270   791-819 (907)
159 PF08209 Sgf11:  Sgf11 (transcr  62.7     4.6  0.0001   21.8   1.0   25  243-268     4-28  (33)
160 PF09845 DUF2072:  Zn-ribbon co  62.0     4.5 9.8E-05   29.7   1.2   15  213-227     1-15  (131)
161 COG4957 Predicted transcriptio  61.8       4 8.8E-05   29.8   0.9   25   92-119    77-101 (148)
162 KOG4124|consensus               61.6     2.8 6.2E-05   36.0   0.2   55   87-141   345-420 (442)
163 COG1571 Predicted DNA-binding   61.1       5 0.00011   36.0   1.6   30  215-257   352-381 (421)
164 PF05290 Baculo_IE-1:  Baculovi  61.0       8 0.00017   28.4   2.3   55  184-258    79-136 (140)
165 TIGR00100 hypA hydrogenase nic  59.9     5.8 0.00013   28.7   1.5   12  214-225    71-82  (115)
166 PTZ00255 60S ribosomal protein  57.9     4.2 9.1E-05   27.7   0.5   13  242-254    53-65  (90)
167 PF01155 HypA:  Hydrogenase exp  57.7     3.4 7.4E-05   29.8   0.0   12  214-225    71-82  (113)
168 COG1655 Uncharacterized protei  57.5     1.8 3.9E-05   34.8  -1.5   39  184-222    18-71  (267)
169 PRK00564 hypA hydrogenase nick  57.4     6.6 0.00014   28.5   1.5   11  214-224    72-82  (117)
170 PF01780 Ribosomal_L37ae:  Ribo  57.1     3.1 6.7E-05   28.3  -0.3   11  214-224    54-64  (90)
171 PRK14873 primosome assembly pr  56.9     3.6 7.7E-05   39.8   0.0   33   92-128   384-419 (665)
172 PF09963 DUF2197:  Uncharacteri  56.2     6.9 0.00015   24.0   1.1   36   92-128     3-40  (56)
173 TIGR00280 L37a ribosomal prote  55.4     4.1 8.9E-05   27.8   0.1   13  242-254    52-64  (91)
174 PF13453 zf-TFIIB:  Transcripti  55.4     6.5 0.00014   22.3   0.9   16  214-229    20-35  (41)
175 PF01363 FYVE:  FYVE zinc finge  55.3     5.5 0.00012   25.6   0.7    9  187-195    11-19  (69)
176 COG3091 SprT Zn-dependent meta  55.1     5.8 0.00013   29.9   0.8   11  185-196   117-127 (156)
177 COG3091 SprT Zn-dependent meta  53.8     6.4 0.00014   29.7   0.9   36   88-128   114-149 (156)
178 PF13451 zf-trcl:  Probable zin  53.8       8 0.00017   23.0   1.1   10  186-195     5-14  (49)
179 PF07282 OrfB_Zn_ribbon:  Putat  53.7     9.5 0.00021   24.5   1.6   27  187-223    30-56  (69)
180 PRK00432 30S ribosomal protein  53.3     7.3 0.00016   23.4   0.9   12  212-223    36-47  (50)
181 COG5151 SSL1 RNA polymerase II  52.9     6.8 0.00015   33.2   1.0   48  215-266   364-411 (421)
182 PF15135 UPF0515:  Uncharacteri  52.1     6.5 0.00014   32.2   0.7   14  212-225   154-167 (278)
183 PRK03681 hypA hydrogenase nick  52.1     9.1  0.0002   27.7   1.4   10  214-223    71-80  (114)
184 COG4957 Predicted transcriptio  51.6     7.8 0.00017   28.4   1.0   26  185-213    76-101 (148)
185 TIGR00595 priA primosomal prot  51.2     5.8 0.00013   37.1   0.4   35   93-128   215-249 (505)
186 COG4896 Uncharacterized protei  51.2      12 0.00025   23.2   1.5   40   92-131     3-43  (68)
187 KOG3408|consensus               50.9     9.9 0.00022   27.3   1.4   26  210-235    54-79  (129)
188 COG1675 TFA1 Transcription ini  50.5      24 0.00051   27.7   3.6   34  208-252   108-141 (176)
189 cd00065 FYVE FYVE domain; Zinc  50.0      12 0.00026   22.8   1.6    9  245-253    20-28  (57)
190 smart00440 ZnF_C2C2 C2C2 Zinc   49.9     1.7 3.7E-05   24.6  -2.0   10  244-253    29-38  (40)
191 KOG1842|consensus               49.4      12 0.00026   33.6   2.0   22   92-113    16-37  (505)
192 PF13821 DUF4187:  Domain of un  48.9      20 0.00043   22.0   2.3   18  244-261    28-45  (55)
193 KOG1280|consensus               48.7      19 0.00042   31.1   3.0   28  118-145    78-105 (381)
194 PF06220 zf-U1:  U1 zinc finger  48.4      16 0.00034   20.5   1.7   22  243-264     3-26  (38)
195 KOG2907|consensus               48.1     9.2  0.0002   27.1   0.9   11  214-224   103-113 (116)
196 COG4888 Uncharacterized Zn rib  48.0     5.6 0.00012   27.5  -0.1   13  242-254    45-57  (104)
197 KOG4124|consensus               47.9       2 4.4E-05   36.8  -2.7   53   87-140   174-233 (442)
198 COG1571 Predicted DNA-binding   47.7      10 0.00022   34.0   1.4   34  185-229   350-383 (421)
199 PRK03976 rpl37ae 50S ribosomal  47.6     6.1 0.00013   27.0  -0.0   13  242-254    53-65  (90)
200 smart00661 RPOL9 RNA polymeras  47.6      12 0.00026   22.3   1.3   10  243-252    20-29  (52)
201 PF08271 TF_Zn_Ribbon:  TFIIB z  47.3     9.5 0.00021   21.9   0.8    8  244-251    20-27  (43)
202 PRK12496 hypothetical protein;  47.1      14  0.0003   28.8   1.8   11  214-224   128-138 (164)
203 COG1773 Rubredoxin [Energy pro  46.9     9.4  0.0002   23.3   0.7   40  212-251     2-44  (55)
204 COG1998 RPS31 Ribosomal protei  46.9     8.9 0.00019   22.7   0.6   10  213-222    37-46  (51)
205 KOG2272|consensus               46.1      23 0.00049   29.2   2.9   44   87-132   133-176 (332)
206 PF04780 DUF629:  Protein of un  45.4      15 0.00032   33.7   2.0   28  243-270    57-84  (466)
207 PF13824 zf-Mss51:  Zinc-finger  45.2      16 0.00035   22.3   1.5   12  242-253    13-24  (55)
208 smart00154 ZnF_AN1 AN1-like Zi  45.1      11 0.00024   21.2   0.8   14  243-256    12-25  (39)
209 PF14311 DUF4379:  Domain of un  45.0      14  0.0003   22.5   1.3   14  185-198    28-41  (55)
210 COG5216 Uncharacterized conser  44.3      12 0.00026   22.9   0.9   31  185-221    22-52  (67)
211 KOG2636|consensus               44.3      17 0.00037   32.7   2.1   29  206-234   394-423 (497)
212 PF03833 PolC_DP2:  DNA polymer  43.9     7.6 0.00016   38.0   0.0   13  245-257   694-706 (900)
213 PLN03238 probable histone acet  43.7      19 0.00041   30.6   2.2   27  241-267    46-72  (290)
214 COG1656 Uncharacterized conser  43.3      23 0.00049   27.3   2.4   14  244-257   131-144 (165)
215 PRK04351 hypothetical protein;  42.7      11 0.00024   28.7   0.7   12  242-253   131-142 (149)
216 COG1779 C4-type Zn-finger prot  42.3     3.1 6.7E-05   32.8  -2.3    8  186-193    15-22  (201)
217 KOG3507|consensus               42.3      13 0.00028   22.9   0.8   12  212-223    36-47  (62)
218 smart00064 FYVE Protein presen  41.9      18 0.00039   23.1   1.5   10  187-196    12-21  (68)
219 COG1594 RPB9 DNA-directed RNA   41.7     6.1 0.00013   28.5  -0.8   40  185-224    72-111 (113)
220 COG2331 Uncharacterized protei  40.7     5.7 0.00012   25.8  -0.9   10  214-223    13-22  (82)
221 TIGR01562 FdhE formate dehydro  40.7      19 0.00041   31.1   1.9   36  185-222   184-219 (305)
222 KOG2636|consensus               40.6      20 0.00043   32.3   2.0   29  234-264   394-423 (497)
223 PF10013 DUF2256:  Uncharacteri  40.2      21 0.00045   20.4   1.3   16  245-260    10-25  (42)
224 COG1198 PriA Primosomal protei  40.1     7.1 0.00015   38.0  -0.8   23  106-128   462-484 (730)
225 PRK03564 formate dehydrogenase  40.1      19  0.0004   31.2   1.7   36  184-222   186-221 (309)
226 smart00504 Ubox Modified RING   39.8      57  0.0012   20.1   3.6   45  186-254     2-46  (63)
227 COG0068 HypF Hydrogenase matur  39.7     4.6 9.9E-05   38.6  -2.1   57  187-252   125-182 (750)
228 PF10263 SprT-like:  SprT-like   39.4     8.7 0.00019   29.4  -0.3   11  185-195   123-133 (157)
229 PF14205 Cys_rich_KTR:  Cystein  39.0      17 0.00037   22.0   1.0   11  241-251    26-36  (55)
230 TIGR00686 phnA alkylphosphonat  38.3      17 0.00038   25.6   1.1   29  215-256     4-32  (109)
231 PF12907 zf-met2:  Zinc-binding  38.2      19  0.0004   20.5   1.0   21  214-234     2-25  (40)
232 PRK14892 putative transcriptio  38.1     8.5 0.00018   26.9  -0.5   12  244-255    43-54  (99)
233 COG1326 Uncharacterized archae  37.2      31 0.00067   27.3   2.4   14  241-254    28-41  (201)
234 KOG2923|consensus               36.8      25 0.00055   22.0   1.5   13  209-221    40-52  (67)
235 PRK10220 hypothetical protein;  36.8      24 0.00052   24.9   1.6   30  215-257     5-34  (111)
236 smart00731 SprT SprT homologue  36.5      12 0.00026   28.4   0.1   11  213-223   112-122 (146)
237 PRK14559 putative protein seri  36.3      38 0.00082   32.8   3.3   12  245-256    43-54  (645)
238 PF00301 Rubredoxin:  Rubredoxi  35.8      18 0.00038   21.4   0.7   39  214-252     2-43  (47)
239 PRK00762 hypA hydrogenase nick  35.7      14 0.00031   27.1   0.4   10  185-195    70-79  (124)
240 KOG1994|consensus               35.4      18 0.00039   29.3   0.8   22  242-263   238-259 (268)
241 PF10276 zf-CHCC:  Zinc-finger   35.3      18  0.0004   20.5   0.7   11  213-223    29-39  (40)
242 CHL00174 accD acetyl-CoA carbo  35.2      19 0.00041   30.9   1.1   33  185-226    38-70  (296)
243 COG5188 PRP9 Splicing factor 3  34.9      40 0.00087   29.3   2.9   29  113-141   368-397 (470)
244 PF04810 zf-Sec23_Sec24:  Sec23  34.9     7.3 0.00016   22.0  -1.0   10  212-221    23-32  (40)
245 PF07295 DUF1451:  Protein of u  34.7      14 0.00031   28.0   0.2   28  213-252   112-139 (146)
246 COG5112 UFD2 U1-like Zn-finger  34.1      14  0.0003   25.8   0.0   26  240-265    52-77  (126)
247 PF07649 C1_3:  C1-like domain;  33.8      23 0.00051   18.3   0.9   11  242-252    14-24  (30)
248 PF04780 DUF629:  Protein of un  33.7      34 0.00073   31.5   2.4   23  185-207    57-79  (466)
249 COG4306 Uncharacterized protei  33.5      22 0.00047   25.7   1.0   58  199-257    24-82  (160)
250 COG0068 HypF Hydrogenase matur  33.4     5.3 0.00011   38.2  -2.7   45   93-138   125-170 (750)
251 COG4640 Predicted membrane pro  33.4      29 0.00063   30.7   1.9   18  244-261    16-33  (465)
252 PTZ00064 histone acetyltransfe  33.3      26 0.00057   32.2   1.6   29  240-268   277-305 (552)
253 KOG0978|consensus               33.1      15 0.00032   35.4   0.1   20  243-262   678-697 (698)
254 cd00924 Cyt_c_Oxidase_Vb Cytoc  33.1      23  0.0005   24.7   1.0   18  207-225    74-91  (97)
255 TIGR00515 accD acetyl-CoA carb  33.0      22 0.00047   30.5   1.1   33  185-226    26-58  (285)
256 PF05495 zf-CHY:  CHY zinc fing  32.9     6.9 0.00015   25.5  -1.5   59  185-253    10-71  (71)
257 cd00730 rubredoxin Rubredoxin;  32.8      19  0.0004   21.6   0.4   37  214-251     2-42  (50)
258 PF01428 zf-AN1:  AN1-like Zinc  32.4      18 0.00039   20.8   0.3   14  212-225    12-25  (43)
259 COG1327 Predicted transcriptio  32.2      10 0.00022   28.6  -0.9   14  120-133    29-42  (156)
260 PLN00104 MYST -like histone ac  32.0      25 0.00055   32.0   1.4   28  240-267   195-222 (450)
261 KOG0717|consensus               31.8      26 0.00055   31.8   1.3   22  244-265   293-314 (508)
262 KOG0978|consensus               31.8      17 0.00037   35.0   0.3   49   90-138   642-697 (698)
263 PF04423 Rad50_zn_hook:  Rad50   31.6      14  0.0003   22.5  -0.2   13  245-257    22-34  (54)
264 PF11672 DUF3268:  Protein of u  31.2      18 0.00039   25.4   0.3    7  244-250    32-38  (102)
265 PRK05654 acetyl-CoA carboxylas  31.0      23 0.00049   30.5   0.9   34  184-226    26-59  (292)
266 PHA02998 RNA polymerase subuni  30.6     8.2 0.00018   29.9  -1.6   12  214-225   172-183 (195)
267 PLN02294 cytochrome c oxidase   30.5      26 0.00057   27.1   1.0   15  211-225   139-153 (174)
268 PF04216 FdhE:  Protein involve  30.5      11 0.00025   32.3  -1.0    9  186-194   173-181 (290)
269 PRK00420 hypothetical protein;  30.4      25 0.00054   25.2   0.9   14  184-197    39-52  (112)
270 KOG2747|consensus               30.2      21 0.00045   31.8   0.5   29  241-269   156-184 (396)
271 TIGR00310 ZPR1_znf ZPR1 zinc f  29.9       6 0.00013   31.6  -2.6   11  244-254    31-41  (192)
272 KOG3362|consensus               29.9      20 0.00043   26.7   0.3   24  242-265   128-151 (156)
273 KOG0717|consensus               29.8      32  0.0007   31.2   1.6   22  214-235   293-314 (508)
274 PF10537 WAC_Acf1_DNA_bd:  ATP-  29.8      80  0.0017   22.3   3.3   36  213-251     3-38  (102)
275 PF11931 DUF3449:  Domain of un  29.7      18 0.00038   28.9   0.0   27  208-234    96-123 (196)
276 PF08790 zf-LYAR:  LYAR-type C2  29.7      11 0.00024   19.4  -0.7   19   92-111     1-19  (28)
277 PF08882 Acetone_carb_G:  Aceto  29.5      17 0.00037   25.8  -0.1   37   91-128    24-83  (112)
278 KOG1842|consensus               29.4      28 0.00061   31.4   1.2   29  243-271    15-43  (505)
279 KOG0320|consensus               29.1      34 0.00074   26.7   1.4   15  184-198   166-180 (187)
280 PF03145 Sina:  Seven in absent  28.4      35 0.00077   27.4   1.6   46  219-269    24-73  (198)
281 PF11789 zf-Nse:  Zinc-finger o  28.3      51  0.0011   20.3   1.9   32   89-124    22-53  (57)
282 KOG4727|consensus               27.9      30 0.00066   26.6   1.0   22  243-264    75-96  (193)
283 COG3677 Transposase and inacti  27.8      29 0.00064   25.7   0.9   47  201-256    20-66  (129)
284 PF06397 Desulfoferrod_N:  Desu  27.2      23 0.00049   19.6   0.2   13   90-102     5-17  (36)
285 PF10122 Mu-like_Com:  Mu-like   27.2      22 0.00047   21.3   0.1    9  244-252    25-33  (51)
286 PTZ00448 hypothetical protein;  27.2      45 0.00097   29.5   2.0   25  243-267   314-338 (373)
287 PF03811 Zn_Tnp_IS1:  InsA N-te  26.9      14  0.0003   20.4  -0.7   13  207-219    23-35  (36)
288 COG3364 Zn-ribbon containing p  26.3      38 0.00082   23.6   1.1   27   91-127     2-28  (112)
289 KOG0782|consensus               26.1      11 0.00024   34.9  -1.9   53  198-257   238-290 (1004)
290 TIGR00627 tfb4 transcription f  25.7      52  0.0011   28.1   2.1   25  213-253   255-279 (279)
291 PF07800 DUF1644:  Protein of u  25.5 1.7E+02  0.0038   22.5   4.5   55   90-145    79-134 (162)
292 PF14369 zf-RING_3:  zinc-finge  25.2      51  0.0011   18.0   1.3   10  245-254    23-32  (35)
293 PF11023 DUF2614:  Protein of u  25.1      43 0.00094   23.9   1.3   12  212-223    68-79  (114)
294 TIGR00416 sms DNA repair prote  24.7      42 0.00092   31.0   1.5   25  184-222     6-30  (454)
295 KOG3002|consensus               24.6      86  0.0019   27.1   3.2   79  184-269    79-165 (299)
296 PRK06260 threonine synthase; V  24.4      51  0.0011   29.9   2.0   10  186-195     4-13  (397)
297 PF09332 Mcm10:  Mcm10 replicat  24.4      18  0.0004   31.7  -0.8   12  213-224   285-296 (344)
298 PRK11823 DNA repair protein Ra  24.3      43 0.00093   30.9   1.5   25  184-222     6-30  (446)
299 PF02748 PyrI_C:  Aspartate car  24.0      32  0.0007   20.8   0.4   19   86-104    30-48  (52)
300 PF02591 DUF164:  Putative zinc  24.0      62  0.0013   19.7   1.7   31  187-221    24-54  (56)
301 KOG4317|consensus               23.7      21 0.00045   30.5  -0.6   21  184-204    18-38  (383)
302 PF13639 zf-RING_2:  Ring finge  23.7      88  0.0019   17.7   2.3   15  211-226    14-28  (44)
303 PF03145 Sina:  Seven in absent  23.5      56  0.0012   26.2   1.9   52   91-145    14-73  (198)
304 KOG1994|consensus               23.5      44 0.00096   27.1   1.2   22  212-233   238-259 (268)
305 KOG4727|consensus               23.3      61  0.0013   25.1   1.8   25   88-112    72-96  (193)
306 PF01286 XPA_N:  XPA protein N-  23.3      24 0.00052   19.2  -0.2    9  216-224     6-14  (34)
307 PRK05978 hypothetical protein;  23.3      32 0.00069   26.2   0.3   10  245-254    54-63  (148)
308 COG4338 Uncharacterized protei  22.9      35 0.00076   20.0   0.4   16  245-260    14-29  (54)
309 PF04641 Rtf2:  Rtf2 RING-finge  22.8      74  0.0016   26.9   2.5   51  184-256   112-163 (260)
310 PF14447 Prok-RING_4:  Prokaryo  22.5      48  0.0011   20.3   1.0   12  244-255    40-51  (55)
311 PF11781 RRN7:  RNA polymerase   22.5      41  0.0009   18.5   0.6    7  188-194    11-17  (36)
312 COG4391 Uncharacterized protei  22.1      42 0.00091   21.0   0.6   15   88-102    45-59  (62)
313 PLN03238 probable histone acet  22.0      80  0.0017   26.9   2.5   26  117-142    46-71  (290)
314 PF14255 Cys_rich_CPXG:  Cystei  21.9      40 0.00087   20.4   0.5   10  245-254     2-11  (52)
315 KOG3214|consensus               21.7      30 0.00066   23.9  -0.0   16  242-257    46-61  (109)
316 PTZ00448 hypothetical protein;  21.6      62  0.0014   28.6   1.8   23  213-235   314-336 (373)
317 KOG4602|consensus               21.5      52  0.0011   27.2   1.3   66  184-266   232-304 (318)
318 COG5027 SAS2 Histone acetyltra  21.3      35 0.00077   29.7   0.3   30  240-269   155-184 (395)
319 PLN03239 histone acetyltransfe  21.2      82  0.0018   27.7   2.4   21  213-233   106-126 (351)
320 cd01121 Sms Sms (bacterial rad  21.0      55  0.0012   29.4   1.4    8  185-192    14-21  (372)
321 PRK12860 transcriptional activ  20.7      67  0.0015   25.5   1.7   48  192-250   114-161 (189)
322 PF09082 DUF1922:  Domain of un  20.6      23 0.00049   22.8  -0.7   13  240-253    17-29  (68)
323 PF14787 zf-CCHC_5:  GAG-polypr  20.3      54  0.0012   18.0   0.8   14  245-258     4-17  (36)
324 KOG1729|consensus               20.2      39 0.00085   28.9   0.3    7  187-193   170-176 (288)

No 1  
>KOG2462|consensus
Probab=100.00  E-value=4.9e-34  Score=226.21  Aligned_cols=137  Identities=34%  Similarity=0.645  Sum_probs=129.3

Q ss_pred             CCCCcccccccccccCChHHHHHHHHhccC---CCCeecCCCCccCCCHHHHHHHHHHhccCCCCccccCCCCcchHHHH
Q psy5202          87 QEEKNKECEFCGKKFKYENTLQAHRRNHTG---EKPFKCTVCDHACSQSAKLKKHMKIHDEDGDKEDREEGPSASSLAAA  163 (343)
Q Consensus        87 ~~~~~~~C~~C~~~f~~~~~L~~H~~~h~~---~~~~~C~~C~~~f~~~~~L~~H~~~h~~~~~~~~~~~~~~~~~~~~~  163 (343)
                      .....|+|..||+.|.+..+|.+|.++|..   .+.+.|++|++.|.+...|+.|+++|.                    
T Consensus       126 ~~~~r~~c~eCgk~ysT~snLsrHkQ~H~~~~s~ka~~C~~C~K~YvSmpALkMHirTH~--------------------  185 (279)
T KOG2462|consen  126 AKHPRYKCPECGKSYSTSSNLSRHKQTHRSLDSKKAFSCKYCGKVYVSMPALKMHIRTHT--------------------  185 (279)
T ss_pred             ccCCceeccccccccccccccchhhcccccccccccccCCCCCceeeehHHHhhHhhccC--------------------
Confidence            355679999999999999999999999965   577999999999999999999999996                    


Q ss_pred             HHHHHhhhcCCCCcccccCCCCcccccCcccccCHHHHHHHHHHccCCCceecCcCcccccChHHHHHHHHHhcCCCCcc
Q psy5202         164 AAAAALNLCLPNSHMKKESRRNDTCEFCGKVFKNCSNLTVHRRSHTGEKPYKCELCSYACAQSSKLTRHMKTHGRLGKDV  243 (343)
Q Consensus       164 ~~~~~~~~~~~~~~~~~~~~~~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~f~~~~~L~~H~~~h~~~~~~~  243 (343)
                                          -++.|.+||+.|...+.|+.|+|+|+|||||.|+.|+++|..+++|+.||++|.  +.++
T Consensus       186 --------------------l~c~C~iCGKaFSRPWLLQGHiRTHTGEKPF~C~hC~kAFADRSNLRAHmQTHS--~~K~  243 (279)
T KOG2462|consen  186 --------------------LPCECGICGKAFSRPWLLQGHIRTHTGEKPFSCPHCGKAFADRSNLRAHMQTHS--DVKK  243 (279)
T ss_pred             --------------------CCcccccccccccchHHhhcccccccCCCCccCCcccchhcchHHHHHHHHhhc--CCcc
Confidence                                578999999999999999999999999999999999999999999999999997  7789


Q ss_pred             cccCCCCCCCCChHHHHHHHhh
Q psy5202         244 YRCRFCEMPFSVPSTLEKHMRK  265 (343)
Q Consensus       244 ~~C~~C~~~f~~~~~l~~H~~~  265 (343)
                      |+|..|+|+|..++.|.+|...
T Consensus       244 ~qC~~C~KsFsl~SyLnKH~ES  265 (279)
T KOG2462|consen  244 HQCPRCGKSFALKSYLNKHSES  265 (279)
T ss_pred             ccCcchhhHHHHHHHHHHhhhh
Confidence            9999999999999999999874


No 2  
>KOG2462|consensus
Probab=99.93  E-value=1.5e-26  Score=183.86  Aligned_cols=126  Identities=28%  Similarity=0.575  Sum_probs=114.9

Q ss_pred             CCCCeecCCCCccCCCHHHHHHHHHHhccCCCCccccCCCCcchHHHHHHHHHhhhcCCCCcccccCCCCcccccCcccc
Q psy5202         116 GEKPFKCTVCDHACSQSAKLKKHMKIHDEDGDKEDREEGPSASSLAAAAAAAALNLCLPNSHMKKESRRNDTCEFCGKVF  195 (343)
Q Consensus       116 ~~~~~~C~~C~~~f~~~~~L~~H~~~h~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~C~~C~~~f  195 (343)
                      ....|+|+.|++.+.+..+|.+|.++|....                                   ..+.+.|.+|+++|
T Consensus       127 ~~~r~~c~eCgk~ysT~snLsrHkQ~H~~~~-----------------------------------s~ka~~C~~C~K~Y  171 (279)
T KOG2462|consen  127 KHPRYKCPECGKSYSTSSNLSRHKQTHRSLD-----------------------------------SKKAFSCKYCGKVY  171 (279)
T ss_pred             cCCceeccccccccccccccchhhccccccc-----------------------------------ccccccCCCCCcee
Confidence            3456999999999999999999999997532                                   12789999999999


Q ss_pred             cCHHHHHHHHHHccCCCceecCcCcccccChHHHHHHHHHhcCCCCcccccCCCCCCCCChHHHHHHHhhhcCccccccc
Q psy5202         196 KNCSNLTVHRRSHTGEKPYKCELCSYACAQSSKLTRHMKTHGRLGKDVYRCRFCEMPFSVPSTLEKHMRKCVVNQNMKME  275 (343)
Q Consensus       196 ~~~~~l~~H~~~h~~~~~~~C~~C~~~f~~~~~L~~H~~~h~~~~~~~~~C~~C~~~f~~~~~l~~H~~~~~~~~~~~~~  275 (343)
                      .....|..|+++|+  -+++|.+||+.|.+.+.|+.|+|+|+  ||+||.|+.|+|.|..+++|+.||++|...+.+.|.
T Consensus       172 vSmpALkMHirTH~--l~c~C~iCGKaFSRPWLLQGHiRTHT--GEKPF~C~hC~kAFADRSNLRAHmQTHS~~K~~qC~  247 (279)
T KOG2462|consen  172 VSMPALKMHIRTHT--LPCECGICGKAFSRPWLLQGHIRTHT--GEKPFSCPHCGKAFADRSNLRAHMQTHSDVKKHQCP  247 (279)
T ss_pred             eehHHHhhHhhccC--CCcccccccccccchHHhhccccccc--CCCCccCCcccchhcchHHHHHHHHhhcCCccccCc
Confidence            99999999999997  68999999999999999999999997  999999999999999999999999999999998887


Q ss_pred             chhHH
Q psy5202         276 SHNIM  280 (343)
Q Consensus       276 ~~~~~  280 (343)
                      .+...
T Consensus       248 ~C~Ks  252 (279)
T KOG2462|consen  248 RCGKS  252 (279)
T ss_pred             chhhH
Confidence            66544


No 3  
>KOG1074|consensus
Probab=99.88  E-value=9.6e-23  Score=183.56  Aligned_cols=90  Identities=31%  Similarity=0.626  Sum_probs=79.1

Q ss_pred             CCcccccCcccccCHHHHHHHHHHccCCCceecCcCcccccChHHHHHHHHHhcC--CCCcccccC---CCCCCCCChHH
Q psy5202         184 RNDTCEFCGKVFKNCSNLTVHRRSHTGEKPYKCELCSYACAQSSKLTRHMKTHGR--LGKDVYRCR---FCEMPFSVPST  258 (343)
Q Consensus       184 ~~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~f~~~~~L~~H~~~h~~--~~~~~~~C~---~C~~~f~~~~~  258 (343)
                      .+-.|-+|-++...++.|+-|.++|+||+||+|.+||+.|.++.+|+.||-+|..  .-.-.|.|+   +|-+.|...-.
T Consensus       604 dPNqCiiC~rVlSC~saLqmHyrtHtGERPFkCKiCgRAFtTkGNLkaH~~vHka~p~~R~q~ScP~~~ic~~kftn~V~  683 (958)
T KOG1074|consen  604 DPNQCIICLRVLSCPSALQMHYRTHTGERPFKCKICGRAFTTKGNLKAHMSVHKAKPPARVQFSCPSTFICQKKFTNAVT  683 (958)
T ss_pred             CccceeeeeecccchhhhhhhhhcccCcCccccccccchhccccchhhcccccccCccccccccCCchhhhccccccccc
Confidence            4667999999999999999999999999999999999999999999999999851  011348899   99999999999


Q ss_pred             HHHHHhhhcCccccc
Q psy5202         259 LEKHMRKCVVNQNMK  273 (343)
Q Consensus       259 l~~H~~~~~~~~~~~  273 (343)
                      |..|+|.|-+.....
T Consensus       684 lpQhIriH~~~~~s~  698 (958)
T KOG1074|consen  684 LPQHIRIHLGGQISN  698 (958)
T ss_pred             ccceEEeecCCCCCC
Confidence            999999988655433


No 4  
>KOG1074|consensus
Probab=99.84  E-value=3.6e-22  Score=179.92  Aligned_cols=177  Identities=27%  Similarity=0.510  Sum_probs=129.6

Q ss_pred             CCcccccccccccCChHHHHHHHHhccCCCCeecCCCCccCCCHHHHHHHHHHhccCCCCccccCCC-CcchHHHHHHHH
Q psy5202          89 EKNKECEFCGKKFKYENTLQAHRRNHTGEKPFKCTVCDHACSQSAKLKKHMKIHDEDGDKEDREEGP-SASSLAAAAAAA  167 (343)
Q Consensus        89 ~~~~~C~~C~~~f~~~~~L~~H~~~h~~~~~~~C~~C~~~f~~~~~L~~H~~~h~~~~~~~~~~~~~-~~~~~~~~~~~~  167 (343)
                      -.+-+|-+|-++...+..|+.|.++|+|++||+|.+||+.|.++.+|+.||-.|....+.......+ .+..........
T Consensus       603 TdPNqCiiC~rVlSC~saLqmHyrtHtGERPFkCKiCgRAFtTkGNLkaH~~vHka~p~~R~q~ScP~~~ic~~kftn~V  682 (958)
T KOG1074|consen  603 TDPNQCIICLRVLSCPSALQMHYRTHTGERPFKCKICGRAFTTKGNLKAHMSVHKAKPPARVQFSCPSTFICQKKFTNAV  682 (958)
T ss_pred             CCccceeeeeecccchhhhhhhhhcccCcCccccccccchhccccchhhcccccccCccccccccCCchhhhcccccccc
Confidence            4568999999999999999999999999999999999999999999999999987654332111110 000000000000


Q ss_pred             HhhhcCCCCcccc--------------cCCCCcccccCcccccCHHHHHHHHHHccCC----------------------
Q psy5202         168 ALNLCLPNSHMKK--------------ESRRNDTCEFCGKVFKNCSNLTVHRRSHTGE----------------------  211 (343)
Q Consensus       168 ~~~~~~~~~~~~~--------------~~~~~~~C~~C~~~f~~~~~l~~H~~~h~~~----------------------  211 (343)
                           ....+++.              .....-+|..|.+.|.....+..++..|.+.                      
T Consensus       683 -----~lpQhIriH~~~~~s~g~~a~e~~~~adq~~~~qk~~~~a~~f~~~~se~~~~~s~~~~~~~~~t~t~~~~~tp~  757 (958)
T KOG1074|consen  683 -----TLPQHIRIHLGGQISNGGTAAEGILAADQCSSCQKTFSDARSFSQQISEQPSPESEPDEQMDERTETEELDVTPP  757 (958)
T ss_pred             -----cccceEEeecCCCCCCCcccccccchhcccchhhhcccccccchhhhhccCCcccCCcccccccccccccccCCC
Confidence                 01111111              1112345777888887777777776655210                      


Q ss_pred             --------------------------------------------------------------------------------
Q psy5202         212 --------------------------------------------------------------------------------  211 (343)
Q Consensus       212 --------------------------------------------------------------------------------  211 (343)
                                                                                                      
T Consensus       758 ~~e~~~~~~~~~e~~i~~~g~te~asa~~~~vg~~s~~~~~~~~~~T~~k~~~~~~~~~~~~~~~v~~~pvl~~~~~~~l  837 (958)
T KOG1074|consen  758 PPENSCGRELEGEMAISVRGSTEEASANLDEVGTVSAAGEAGEEDDTSEKPTQASSFPGEILAPSVNMDPVLWNQETSML  837 (958)
T ss_pred             ccccccccccCcccccccccchhhhhcChhhhcCccccchhhhhcccCCCCcccccCCCcCCccccccCchhhccccccc
Confidence                                                                                            


Q ss_pred             ----------------------------------------CceecCcCcccccChHHHHHHHHHhcCCCCcccccCCCCC
Q psy5202         212 ----------------------------------------KPYKCELCSYACAQSSKLTRHMKTHGRLGKDVYRCRFCEM  251 (343)
Q Consensus       212 ----------------------------------------~~~~C~~C~~~f~~~~~L~~H~~~h~~~~~~~~~C~~C~~  251 (343)
                                                              ....|.+||+.|...+.|+.|+++|.  ++++|.|.+|++
T Consensus       838 ~eg~~t~~n~~t~~~~~~sv~qs~~~p~l~p~l~~~~pvnn~h~C~vCgk~FsSSsALqiH~rTHt--g~KPF~C~fC~~  915 (958)
T KOG1074|consen  838 NEGLATKTNEITPEGPADSVIQSGGVPTLEPSLGRPGPVNNAHVCNVCGKQFSSSAALEIHMRTHT--GPKPFFCHFCEE  915 (958)
T ss_pred             ccccccccccccCCCcchhhhhhccccccCCCCCCCCcccchhhhccchhcccchHHHHHhhhcCC--CCCCccchhhhh
Confidence                                                    12579999999999999999999997  899999999999


Q ss_pred             CCCChHHHHHHHhhhcCcccc
Q psy5202         252 PFSVPSTLEKHMRKCVVNQNM  272 (343)
Q Consensus       252 ~f~~~~~l~~H~~~~~~~~~~  272 (343)
                      .|..+.+|+.||.+|+..+..
T Consensus       916 aFttrgnLKvHMgtH~w~q~~  936 (958)
T KOG1074|consen  916 AFTTRGNLKVHMGTHMWVQPP  936 (958)
T ss_pred             hhhhhhhhhhhhccccccCCC
Confidence            999999999999999876654


No 5  
>KOG3608|consensus
Probab=99.83  E-value=3.6e-21  Score=157.24  Aligned_cols=166  Identities=25%  Similarity=0.499  Sum_probs=105.5

Q ss_pred             cccccc--cccccCChHHHHHHHHhccCCCCeecCCCCccCCCHHHHHHHHHHhccCC--CCccccCCCCcchHHHHHHH
Q psy5202          91 NKECEF--CGKKFKYENTLQAHRRNHTGEKPFKCTVCDHACSQSAKLKKHMKIHDEDG--DKEDREEGPSASSLAAAAAA  166 (343)
Q Consensus        91 ~~~C~~--C~~~f~~~~~L~~H~~~h~~~~~~~C~~C~~~f~~~~~L~~H~~~h~~~~--~~~~~~~~~~~~~~~~~~~~  166 (343)
                      .+.|..  |-+.|.++..|++|++.|.+++...|+.||..|.++..|..|++......  .+.+..+...+.........
T Consensus       177 v~~C~W~~Ct~~~~~k~~LreH~r~Hs~eKvvACp~Cg~~F~~~tkl~DH~rRqt~l~~n~fqC~~C~KrFaTeklL~~H  256 (467)
T KOG3608|consen  177 VTMCNWAMCTKHMGNKYRLREHIRTHSNEKVVACPHCGELFRTKTKLFDHLRRQTELNTNSFQCAQCFKRFATEKLLKSH  256 (467)
T ss_pred             eeeccchhhhhhhccHHHHHHHHHhcCCCeEEecchHHHHhccccHHHHHHHhhhhhcCCchHHHHHHHHHhHHHHHHHH
Confidence            356654  88999999999999999999999999999999999999999987665322  22232222222221111111


Q ss_pred             HHhhhcCCCCcccccCCCCcccccCcccccCHHHHHHHHHH-ccCCCceecCcCcccccChHHHHHHHHHhcCCCCcccc
Q psy5202         167 AALNLCLPNSHMKKESRRNDTCEFCGKVFKNCSNLTVHRRS-HTGEKPYKCELCSYACAQSSKLTRHMKTHGRLGKDVYR  245 (343)
Q Consensus       167 ~~~~~~~~~~~~~~~~~~~~~C~~C~~~f~~~~~l~~H~~~-h~~~~~~~C~~C~~~f~~~~~L~~H~~~h~~~~~~~~~  245 (343)
                      ..       +|     -..|+|++|+.+....+.|..|++. |...+||+|..|.+.|.+.+.|.+|..+|.   +..|+
T Consensus       257 v~-------rH-----vn~ykCplCdmtc~~~ssL~~H~r~rHs~dkpfKCd~Cd~~c~~esdL~kH~~~HS---~~~y~  321 (467)
T KOG3608|consen  257 VV-------RH-----VNCYKCPLCDMTCSSASSLTTHIRYRHSKDKPFKCDECDTRCVRESDLAKHVQVHS---KTVYQ  321 (467)
T ss_pred             HH-------Hh-----hhcccccccccCCCChHHHHHHHHhhhccCCCccccchhhhhccHHHHHHHHHhcc---cccee
Confidence            10       11     1456677777777777777777664 566667777777777777777777766663   45566


Q ss_pred             cCC--CCCCCCChHHHHHHHhhhcCccc
Q psy5202         246 CRF--CEMPFSVPSTLEKHMRKCVVNQN  271 (343)
Q Consensus       246 C~~--C~~~f~~~~~l~~H~~~~~~~~~  271 (343)
                      |..  |.+.|.+...|++|++.+|.+.+
T Consensus       322 C~h~~C~~s~r~~~q~~~H~~evhEg~n  349 (467)
T KOG3608|consen  322 CEHPDCHYSVRTYTQMRRHFLEVHEGNN  349 (467)
T ss_pred             cCCCCCcHHHHHHHHHHHHHHHhccCCC
Confidence            655  66666666666666665554433


No 6  
>KOG3608|consensus
Probab=99.82  E-value=2.7e-20  Score=152.12  Aligned_cols=174  Identities=24%  Similarity=0.404  Sum_probs=131.9

Q ss_pred             CCCCCCcccccccccccCChHHHHHHHHhcc--CCCCeecCCCCccCCCHHHHHHHHHHhccCCCCccccCCCCcchHHH
Q psy5202          85 TSQEEKNKECEFCGKKFKYENTLQAHRRNHT--GEKPFKCTVCDHACSQSAKLKKHMKIHDEDGDKEDREEGPSASSLAA  162 (343)
Q Consensus        85 ~~~~~~~~~C~~C~~~f~~~~~L~~H~~~h~--~~~~~~C~~C~~~f~~~~~L~~H~~~h~~~~~~~~~~~~~~~~~~~~  162 (343)
                      .+.++|...|+.||..|.++..|-.|++.-+  ...+|+|..|.+.|.++..|..|+..|-..  +.+..+.-.......
T Consensus       201 ~Hs~eKvvACp~Cg~~F~~~tkl~DH~rRqt~l~~n~fqC~~C~KrFaTeklL~~Hv~rHvn~--ykCplCdmtc~~~ss  278 (467)
T KOG3608|consen  201 THSNEKVVACPHCGELFRTKTKLFDHLRRQTELNTNSFQCAQCFKRFATEKLLKSHVVRHVNC--YKCPLCDMTCSSASS  278 (467)
T ss_pred             hcCCCeEEecchHHHHhccccHHHHHHHhhhhhcCCchHHHHHHHHHhHHHHHHHHHHHhhhc--ccccccccCCCChHH
Confidence            4567788888888888888888888876533  356788888888888888888888877532  333333322222211


Q ss_pred             HHHHHHhhhcCCCCcccccCCCCcccccCcccccCHHHHHHHHHHccCCCceecCc--CcccccChHHHHHHHHHhc-CC
Q psy5202         163 AAAAAALNLCLPNSHMKKESRRNDTCEFCGKVFKNCSNLTVHRRSHTGEKPYKCEL--CSYACAQSSKLTRHMKTHG-RL  239 (343)
Q Consensus       163 ~~~~~~~~~~~~~~~~~~~~~~~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~--C~~~f~~~~~L~~H~~~h~-~~  239 (343)
                      ...      .+.-+|   ...++|+|+.|++.|.+.+.|.+|+.+|. +..|+|..  |.+.|.+...|++|++.++ +.
T Consensus       279 L~~------H~r~rH---s~dkpfKCd~Cd~~c~~esdL~kH~~~HS-~~~y~C~h~~C~~s~r~~~q~~~H~~evhEg~  348 (467)
T KOG3608|consen  279 LTT------HIRYRH---SKDKPFKCDECDTRCVRESDLAKHVQVHS-KTVYQCEHPDCHYSVRTYTQMRRHFLEVHEGN  348 (467)
T ss_pred             HHH------HHHhhh---ccCCCccccchhhhhccHHHHHHHHHhcc-ccceecCCCCCcHHHHHHHHHHHHHHHhccCC
Confidence            111      111111   12489999999999999999999999988 77899988  9999999999999998766 55


Q ss_pred             CCcccccCCCCCCCCChHHHHHHHhhhcCcc
Q psy5202         240 GKDVYRCRFCEMPFSVPSTLEKHMRKCVVNQ  270 (343)
Q Consensus       240 ~~~~~~C~~C~~~f~~~~~l~~H~~~~~~~~  270 (343)
                      ...+|.|..|++.|.+-.+|..|+++.|+-+
T Consensus       349 np~~Y~CH~Cdr~ft~G~~L~~HL~kkH~f~  379 (467)
T KOG3608|consen  349 NPILYACHCCDRFFTSGKSLSAHLMKKHGFR  379 (467)
T ss_pred             CCCceeeecchhhhccchhHHHHHHHhhccc
Confidence            6678999999999999999999998877533


No 7  
>KOG3576|consensus
Probab=99.81  E-value=8.7e-20  Score=137.89  Aligned_cols=87  Identities=32%  Similarity=0.601  Sum_probs=57.2

Q ss_pred             cccccCcccccCHHHHHHHHHHccCCCceecCcCcccccChHHHHHHHHHhc---------CCCCcccccCCCCCCCCCh
Q psy5202         186 DTCEFCGKVFKNCSNLTVHRRSHTGEKPYKCELCSYACAQSSKLTRHMKTHG---------RLGKDVYRCRFCEMPFSVP  256 (343)
Q Consensus       186 ~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~f~~~~~L~~H~~~h~---------~~~~~~~~C~~C~~~f~~~  256 (343)
                      +.|..||+.|.+...|++|+++|+|.+||+|..|++.|..+-.|..|++..|         +...+.|.|+.||++-...
T Consensus       146 ~lct~cgkgfndtfdlkrh~rthtgvrpykc~~c~kaftqrcsleshl~kvhgv~~~yaykerr~kl~vcedcg~t~~~~  225 (267)
T KOG3576|consen  146 HLCTFCGKGFNDTFDLKRHTRTHTGVRPYKCSLCEKAFTQRCSLESHLKKVHGVQHQYAYKERRAKLYVCEDCGYTSERP  225 (267)
T ss_pred             HHHhhccCcccchhhhhhhhccccCccccchhhhhHHHHhhccHHHHHHHHcCchHHHHHHHhhhheeeecccCCCCCCh
Confidence            3455555555555555555555555566666666666665555555554322         1245778999999999999


Q ss_pred             HHHHHHHhhhcCcccc
Q psy5202         257 STLEKHMRKCVVNQNM  272 (343)
Q Consensus       257 ~~l~~H~~~~~~~~~~  272 (343)
                      ..+..|++.||.....
T Consensus       226 e~~~~h~~~~hp~Spa  241 (267)
T KOG3576|consen  226 EVYYLHLKLHHPFSPA  241 (267)
T ss_pred             hHHHHHHHhcCCCCHH
Confidence            9999999998877644


No 8  
>KOG3576|consensus
Probab=99.81  E-value=1e-20  Score=142.87  Aligned_cols=110  Identities=31%  Similarity=0.645  Sum_probs=98.7

Q ss_pred             CCCCcccccccccccCChHHHHHHHHhccCCCCee---cCCCCccCCCHHHHHHHHHHhccCCCCccccCCCCcchHHHH
Q psy5202          87 QEEKNKECEFCGKKFKYENTLQAHRRNHTGEKPFK---CTVCDHACSQSAKLKKHMKIHDEDGDKEDREEGPSASSLAAA  163 (343)
Q Consensus        87 ~~~~~~~C~~C~~~f~~~~~L~~H~~~h~~~~~~~---C~~C~~~f~~~~~L~~H~~~h~~~~~~~~~~~~~~~~~~~~~  163 (343)
                      .+...|.|.+|++.|.-...|.+|++-|..   .+   |..||+.|..-..|++|+++|++.                  
T Consensus       113 sd~d~ftCrvCgK~F~lQRmlnrh~kch~~---vkr~lct~cgkgfndtfdlkrh~rthtgv------------------  171 (267)
T KOG3576|consen  113 SDQDSFTCRVCGKKFGLQRMLNRHLKCHSD---VKRHLCTFCGKGFNDTFDLKRHTRTHTGV------------------  171 (267)
T ss_pred             CCCCeeeeehhhhhhhHHHHHHHHhhhccH---HHHHHHhhccCcccchhhhhhhhccccCc------------------
Confidence            345679999999999999999999999853   44   899999999999999999999987                  


Q ss_pred             HHHHHhhhcCCCCcccccCCCCcccccCcccccCHHHHHHHHHHccC-----------CCceecCcCcccccChHHHHHH
Q psy5202         164 AAAAALNLCLPNSHMKKESRRNDTCEFCGKVFKNCSNLTVHRRSHTG-----------EKPYKCELCSYACAQSSKLTRH  232 (343)
Q Consensus       164 ~~~~~~~~~~~~~~~~~~~~~~~~C~~C~~~f~~~~~l~~H~~~h~~-----------~~~~~C~~C~~~f~~~~~L~~H  232 (343)
                                          ++|+|..|++.|..+..|..|++.-+|           .+.|.|..||++-.....+..|
T Consensus       172 --------------------rpykc~~c~kaftqrcsleshl~kvhgv~~~yaykerr~kl~vcedcg~t~~~~e~~~~h  231 (267)
T KOG3576|consen  172 --------------------RPYKCSLCEKAFTQRCSLESHLKKVHGVQHQYAYKERRAKLYVCEDCGYTSERPEVYYLH  231 (267)
T ss_pred             --------------------cccchhhhhHHHHhhccHHHHHHHHcCchHHHHHHHhhhheeeecccCCCCCChhHHHHH
Confidence                                899999999999999999999876444           3569999999999999999999


Q ss_pred             HHHhc
Q psy5202         233 MKTHG  237 (343)
Q Consensus       233 ~~~h~  237 (343)
                      ++.|+
T Consensus       232 ~~~~h  236 (267)
T KOG3576|consen  232 LKLHH  236 (267)
T ss_pred             HHhcC
Confidence            99987


No 9  
>KOG3623|consensus
Probab=99.69  E-value=1.1e-17  Score=148.89  Aligned_cols=122  Identities=29%  Similarity=0.601  Sum_probs=99.0

Q ss_pred             CCcccccccccccCChHHHHHHHHhccC--CCCeecCCCCccCCCHHHHHHHHHHhccCCCCccccCCCCcchHHHHHHH
Q psy5202          89 EKNKECEFCGKKFKYENTLQAHRRNHTG--EKPFKCTVCDHACSQSAKLKKHMKIHDEDGDKEDREEGPSASSLAAAAAA  166 (343)
Q Consensus        89 ~~~~~C~~C~~~f~~~~~L~~H~~~h~~--~~~~~C~~C~~~f~~~~~L~~H~~~h~~~~~~~~~~~~~~~~~~~~~~~~  166 (343)
                      .....|++|.+.+.....|+.|++.-+.  +..|.|..|.++|..+..|.+||..|.........               
T Consensus       208 sqlltcpycdrgykrltslkeHikyrhekne~nfsC~lCsytFAyRtQLErhm~~hkpg~dqa~s---------------  272 (1007)
T KOG3623|consen  208 SQLLTCPYCDRGYKRLTSLKEHIKYRHEKNEPNFSCMLCSYTFAYRTQLERHMQLHKPGGDQAIS---------------  272 (1007)
T ss_pred             hhhhcchhHHHHHHHHHHHHHHHHHHHhhCCCCCcchhhhhhhhhHHHHHHHHHhhcCCCccccc---------------
Confidence            4457899999999999999999885332  45699999999999999999999999754322100               


Q ss_pred             HHhhhcCCCCcccccCCCCcccccCcccccCHHHHHHHHHHccCCCceecCcCcccccChHHHHHHHHH
Q psy5202         167 AALNLCLPNSHMKKESRRNDTCEFCGKVFKNCSNLTVHRRSHTGEKPYKCELCSYACAQSSKLTRHMKT  235 (343)
Q Consensus       167 ~~~~~~~~~~~~~~~~~~~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~f~~~~~L~~H~~~  235 (343)
                                .......+.|+|..||+.|+.+-.|+.|+|+|.|+|||.|+.|+++|.....+-.||..
T Consensus       273 ----------ltqsa~lRKFKCtECgKAFKfKHHLKEHlRIHSGEKPfeCpnCkKRFSHSGSySSHmSS  331 (1007)
T KOG3623|consen  273 ----------LTQSALLRKFKCTECGKAFKFKHHLKEHLRIHSGEKPFECPNCKKRFSHSGSYSSHMSS  331 (1007)
T ss_pred             ----------ccchhhhccccccccchhhhhHHHHHhhheeecCCCCcCCcccccccccCCcccccccc
Confidence                      00111237899999999999999999999999999999999999999999999888754


No 10 
>KOG3623|consensus
Probab=99.67  E-value=1.2e-17  Score=148.77  Aligned_cols=107  Identities=27%  Similarity=0.587  Sum_probs=98.3

Q ss_pred             eecCCCCccCCCHHHHHHHHHHhccCCCCccccCCCCcchHHHHHHHHHhhhcCCCCcccccCCCCcccccCcccccCHH
Q psy5202         120 FKCTVCDHACSQSAKLKKHMKIHDEDGDKEDREEGPSASSLAAAAAAAALNLCLPNSHMKKESRRNDTCEFCGKVFKNCS  199 (343)
Q Consensus       120 ~~C~~C~~~f~~~~~L~~H~~~h~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~C~~C~~~f~~~~  199 (343)
                      ..|++|++.+.....|+.|++..|...                                    +..|.|.+|..+|..+.
T Consensus       211 ltcpycdrgykrltslkeHikyrhekn------------------------------------e~nfsC~lCsytFAyRt  254 (1007)
T KOG3623|consen  211 LTCPYCDRGYKRLTSLKEHIKYRHEKN------------------------------------EPNFSCMLCSYTFAYRT  254 (1007)
T ss_pred             hcchhHHHHHHHHHHHHHHHHHHHhhC------------------------------------CCCCcchhhhhhhhhHH
Confidence            789999999999999999999877542                                    26789999999999999


Q ss_pred             HHHHHHHHccC-------------CCceecCcCcccccChHHHHHHHHHhcCCCCcccccCCCCCCCCChHHHHHHHh
Q psy5202         200 NLTVHRRSHTG-------------EKPYKCELCSYACAQSSKLTRHMKTHGRLGKDVYRCRFCEMPFSVPSTLEKHMR  264 (343)
Q Consensus       200 ~l~~H~~~h~~-------------~~~~~C~~C~~~f~~~~~L~~H~~~h~~~~~~~~~C~~C~~~f~~~~~l~~H~~  264 (343)
                      .|.+|+.+|..             .+.|+|..||++|..+..|+.|+|+|.  |++||.|+.|+|+|+....+-.||-
T Consensus       255 QLErhm~~hkpg~dqa~sltqsa~lRKFKCtECgKAFKfKHHLKEHlRIHS--GEKPfeCpnCkKRFSHSGSySSHmS  330 (1007)
T KOG3623|consen  255 QLERHMQLHKPGGDQAISLTQSALLRKFKCTECGKAFKFKHHLKEHLRIHS--GEKPFECPNCKKRFSHSGSYSSHMS  330 (1007)
T ss_pred             HHHHHHHhhcCCCcccccccchhhhccccccccchhhhhHHHHHhhheeec--CCCCcCCcccccccccCCccccccc
Confidence            99999999853             356999999999999999999999997  9999999999999999999999995


No 11 
>PLN03086 PRLI-interacting factor K; Provisional
Probab=99.50  E-value=3.2e-14  Score=127.83  Aligned_cols=129  Identities=19%  Similarity=0.418  Sum_probs=98.4

Q ss_pred             ccccccccccCChHHHHHHHHhccC-------------------CCCeecCCCCccCCCHHHHHHHHHHhccCCCCcccc
Q psy5202          92 KECEFCGKKFKYENTLQAHRRNHTG-------------------EKPFKCTVCDHACSQSAKLKKHMKIHDEDGDKEDRE  152 (343)
Q Consensus        92 ~~C~~C~~~f~~~~~L~~H~~~h~~-------------------~~~~~C~~C~~~f~~~~~L~~H~~~h~~~~~~~~~~  152 (343)
                      -.|+.|.+... ...|..|......                   ...+.|+.|++.|. ...|..|++.+|         
T Consensus       408 V~C~NC~~~i~-l~~l~lHe~~C~r~~V~Cp~~~Cg~v~~r~el~~H~~C~~Cgk~f~-~s~LekH~~~~H---------  476 (567)
T PLN03086        408 VECRNCKHYIP-SRSIALHEAYCSRHNVVCPHDGCGIVLRVEEAKNHVHCEKCGQAFQ-QGEMEKHMKVFH---------  476 (567)
T ss_pred             EECCCCCCccc-hhHHHHHHhhCCCcceeCCcccccceeeccccccCccCCCCCCccc-hHHHHHHHHhcC---------
Confidence            46888888766 3456677654321                   24467888888885 677888888875         


Q ss_pred             CCCCcchHHHHHHHHHhhhcCCCCcccccCCCCcccccCcccccCHHHHHHHHHHccCCCceecCcCccccc--------
Q psy5202         153 EGPSASSLAAAAAAAALNLCLPNSHMKKESRRNDTCEFCGKVFKNCSNLTVHRRSHTGEKPYKCELCSYACA--------  224 (343)
Q Consensus       153 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~f~--------  224 (343)
                                                     .++.|+ ||+.+ .+..|..|+..|..++++.|.+|++.|.        
T Consensus       477 -------------------------------kpv~Cp-Cg~~~-~R~~L~~H~~thCp~Kpi~C~fC~~~v~~g~~~~d~  523 (567)
T PLN03086        477 -------------------------------EPLQCP-CGVVL-EKEQMVQHQASTCPLRLITCRFCGDMVQAGGSAMDV  523 (567)
T ss_pred             -------------------------------CCccCC-CCCCc-chhHHHhhhhccCCCCceeCCCCCCccccCccccch
Confidence                                           568898 99755 6688999999999999999999999885        


Q ss_pred             --ChHHHHHHHHHhcCCCCcccccCCCCCCCCChHHHHHHHhhhcC
Q psy5202         225 --QSSKLTRHMKTHGRLGKDVYRCRFCEMPFSVPSTLEKHMRKCVV  268 (343)
Q Consensus       225 --~~~~L~~H~~~h~~~~~~~~~C~~C~~~f~~~~~l~~H~~~~~~  268 (343)
                        ....|..|...+   |.+++.|..||+.|..+ .|..|+..+|.
T Consensus       524 ~d~~s~Lt~HE~~C---G~rt~~C~~Cgk~Vrlr-dm~~H~~~~h~  565 (567)
T PLN03086        524 RDRLRGMSEHESIC---GSRTAPCDSCGRSVMLK-EMDIHQIAVHQ  565 (567)
T ss_pred             hhhhhhHHHHHHhc---CCcceEccccCCeeeeh-hHHHHHHHhhc
Confidence              245788998886   78889999999887554 67888887774


No 12 
>KOG3993|consensus
Probab=99.46  E-value=7.8e-14  Score=117.36  Aligned_cols=174  Identities=19%  Similarity=0.328  Sum_probs=116.7

Q ss_pred             cccccccccccCChHHHHHHHHhccCCCCeecCCCCccCCCHHHHHHHHHHhccCCCCccccCCCCcchHHHHHHHHHhh
Q psy5202          91 NKECEFCGKKFKYENTLQAHRRNHTGEKPFKCTVCDHACSQSAKLKKHMKIHDEDGDKEDREEGPSASSLAAAAAAAALN  170 (343)
Q Consensus        91 ~~~C~~C~~~f~~~~~L~~H~~~h~~~~~~~C~~C~~~f~~~~~L~~H~~~h~~~~~~~~~~~~~~~~~~~~~~~~~~~~  170 (343)
                      -|.|..|...|.+...|.+|...-.-...|+|+.|++.|....+|..|+++|............+.........  ....
T Consensus       267 dyiCqLCK~kYeD~F~LAQHrC~RIV~vEYrCPEC~KVFsCPANLASHRRWHKPR~eaa~a~~~P~k~~~~~ra--e~~e  344 (500)
T KOG3993|consen  267 DYICQLCKEKYEDAFALAQHRCPRIVHVEYRCPECDKVFSCPANLASHRRWHKPRPEAAKAGSPPPKQAVETRA--EVQE  344 (500)
T ss_pred             HHHHHHHHHhhhhHHHHhhccCCeeEEeeecCCcccccccCchhhhhhhcccCCchhhhhcCCCChhhhhhhhh--hhhh
Confidence            49999999999999999999765444456999999999999999999999997654332111111000000000  0000


Q ss_pred             hcCCCCcccccCCCCcccccCcccccCHHHHHHHHHHccCCC--------------------------------------
Q psy5202         171 LCLPNSHMKKESRRNDTCEFCGKVFKNCSNLTVHRRSHTGEK--------------------------------------  212 (343)
Q Consensus       171 ~~~~~~~~~~~~~~~~~C~~C~~~f~~~~~l~~H~~~h~~~~--------------------------------------  212 (343)
                      ..  . .-......-|.|.+|++.|.....|+.|+.+|+...                                      
T Consensus       345 a~--r-sg~dss~gi~~C~~C~KkFrRqAYLrKHqlthq~~~~~k~~a~~f~~s~~~~l~~~~~~~a~h~~a~~~~g~~v  421 (500)
T KOG3993|consen  345 AE--R-SGDDSSSGIFSCHTCGKKFRRQAYLRKHQLTHQRAPLAKEKAPKFLLSRVIPLMHFNQAVATHSSASDSHGDEV  421 (500)
T ss_pred             cc--c-cCCcccCceeecHHhhhhhHHHHHHHHhHHhhhccccchhcccCcchhhcccccccccccccccccccccccce
Confidence            00  0 000112357999999999999999999988875310                                      


Q ss_pred             --------ceecCcCcccccChHHHHHHHHHhcCCCCcccccCCCCCCCCChHHHHHHHhhhcCccc
Q psy5202         213 --------PYKCELCSYACAQSSKLTRHMKTHGRLGKDVYRCRFCEMPFSVPSTLEKHMRKCVVNQN  271 (343)
Q Consensus       213 --------~~~C~~C~~~f~~~~~L~~H~~~h~~~~~~~~~C~~C~~~f~~~~~l~~H~~~~~~~~~  271 (343)
                              -..|++|+--+.++..--.|.+.-.  .+..|.|.+|.-.|.+...|.+|+.++|....
T Consensus       422 l~~a~sael~~pp~~~~ppsss~~sgg~~rlg~--~~q~f~~ky~~atfyss~~ltrhin~~Hpse~  486 (500)
T KOG3993|consen  422 LYVAGSAELELPPYDGSPPSSSGSSGGYGRLGI--AEQGFTCKYCPATFYSSPGLTRHINKCHPSEL  486 (500)
T ss_pred             eeeeccccccCCCCCCCCcccCCCCCccccccc--hhhccccccchHhhhcCcchHhHhhhcChHHh
Confidence                    1235556655555444444433322  35569999999999999999999999996553


No 13 
>PHA00733 hypothetical protein
Probab=99.30  E-value=2.6e-12  Score=94.45  Aligned_cols=82  Identities=18%  Similarity=0.273  Sum_probs=70.6

Q ss_pred             CCCcccccCcccccCHHHHHHH--HH---HccCCCceecCcCcccccChHHHHHHHHHhcCCCCcccccCCCCCCCCChH
Q psy5202         183 RRNDTCEFCGKVFKNCSNLTVH--RR---SHTGEKPYKCELCSYACAQSSKLTRHMKTHGRLGKDVYRCRFCEMPFSVPS  257 (343)
Q Consensus       183 ~~~~~C~~C~~~f~~~~~l~~H--~~---~h~~~~~~~C~~C~~~f~~~~~L~~H~~~h~~~~~~~~~C~~C~~~f~~~~  257 (343)
                      ++.+.|.+|.+.|.....|..|  ++   .+++.++|.|+.|++.|.....|..|++.|.    .+|.|..|++.|....
T Consensus        38 ~~~~~~~~~~~~~~~~~~l~~~~~l~~~~~~~~~kPy~C~~Cgk~Fss~s~L~~H~r~h~----~~~~C~~CgK~F~~~~  113 (128)
T PHA00733         38 QKRLIRAVVKTLIYNPQLLDESSYLYKLLTSKAVSPYVCPLCLMPFSSSVSLKQHIRYTE----HSKVCPVCGKEFRNTD  113 (128)
T ss_pred             hhhHHHHHHhhhccChhhhcchHHHHhhcccCCCCCccCCCCCCcCCCHHHHHHHHhcCC----cCccCCCCCCccCCHH
Confidence            3789999999999888777766  22   2456889999999999999999999999762    4699999999999999


Q ss_pred             HHHHHHhhhcC
Q psy5202         258 TLEKHMRKCVV  268 (343)
Q Consensus       258 ~l~~H~~~~~~  268 (343)
                      .|..|+++.|.
T Consensus       114 sL~~H~~~~h~  124 (128)
T PHA00733        114 STLDHVCKKHN  124 (128)
T ss_pred             HHHHHHHHhcC
Confidence            99999998874


No 14 
>PLN03086 PRLI-interacting factor K; Provisional
Probab=99.24  E-value=2.4e-11  Score=109.54  Aligned_cols=104  Identities=20%  Similarity=0.513  Sum_probs=86.9

Q ss_pred             CCcccccccccccCChHHHHHHHHhccCCCCeecCCCCccCCCHHHHHHHHHHhccCCCCccccCCCCcchHHHHHHHHH
Q psy5202          89 EKNKECEFCGKKFKYENTLQAHRRNHTGEKPFKCTVCDHACSQSAKLKKHMKIHDEDGDKEDREEGPSASSLAAAAAAAA  168 (343)
Q Consensus        89 ~~~~~C~~C~~~f~~~~~L~~H~~~h~~~~~~~C~~C~~~f~~~~~L~~H~~~h~~~~~~~~~~~~~~~~~~~~~~~~~~  168 (343)
                      ++.+.|+.|++.|. ...|..|+..|+  +++.|+ |++.+ .+..|..|+..|.+.                       
T Consensus       451 ~~H~~C~~Cgk~f~-~s~LekH~~~~H--kpv~Cp-Cg~~~-~R~~L~~H~~thCp~-----------------------  502 (567)
T PLN03086        451 KNHVHCEKCGQAFQ-QGEMEKHMKVFH--EPLQCP-CGVVL-EKEQMVQHQASTCPL-----------------------  502 (567)
T ss_pred             ccCccCCCCCCccc-hHHHHHHHHhcC--CCccCC-CCCCc-chhHHHhhhhccCCC-----------------------
Confidence            45689999999996 688999999986  789999 99755 678999999888766                       


Q ss_pred             hhhcCCCCcccccCCCCcccccCccccc----------CHHHHHHHHHHccCCCceecCcCcccccChHHHHHHHHHhc
Q psy5202         169 LNLCLPNSHMKKESRRNDTCEFCGKVFK----------NCSNLTVHRRSHTGEKPYKCELCSYACAQSSKLTRHMKTHG  237 (343)
Q Consensus       169 ~~~~~~~~~~~~~~~~~~~C~~C~~~f~----------~~~~l~~H~~~h~~~~~~~C~~C~~~f~~~~~L~~H~~~h~  237 (343)
                                     +++.|.+|++.|.          ....|..|...+ |.+++.|..||+.|.. ..|..|+...|
T Consensus       503 ---------------Kpi~C~fC~~~v~~g~~~~d~~d~~s~Lt~HE~~C-G~rt~~C~~Cgk~Vrl-rdm~~H~~~~h  564 (567)
T PLN03086        503 ---------------RLITCRFCGDMVQAGGSAMDVRDRLRGMSEHESIC-GSRTAPCDSCGRSVML-KEMDIHQIAVH  564 (567)
T ss_pred             ---------------CceeCCCCCCccccCccccchhhhhhhHHHHHHhc-CCcceEccccCCeeee-hhHHHHHHHhh
Confidence                           8899999999995          245899998885 8999999999988876 46778876543


No 15 
>PHA00733 hypothetical protein
Probab=99.18  E-value=3.1e-11  Score=88.74  Aligned_cols=52  Identities=19%  Similarity=0.358  Sum_probs=40.7

Q ss_pred             CCcccccCcccccCHHHHHHHHHHccCCCceecCcCcccccChHHHHHHHHHhc
Q psy5202         184 RNDTCEFCGKVFKNCSNLTVHRRSHTGEKPYKCELCSYACAQSSKLTRHMKTHG  237 (343)
Q Consensus       184 ~~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~f~~~~~L~~H~~~h~  237 (343)
                      ++|.|+.|++.|.....|..|++.|  +.+|.|..|++.|.....|.+|+...|
T Consensus        72 kPy~C~~Cgk~Fss~s~L~~H~r~h--~~~~~C~~CgK~F~~~~sL~~H~~~~h  123 (128)
T PHA00733         72 SPYVCPLCLMPFSSSVSLKQHIRYT--EHSKVCPVCGKEFRNTDSTLDHVCKKH  123 (128)
T ss_pred             CCccCCCCCCcCCCHHHHHHHHhcC--CcCccCCCCCCccCCHHHHHHHHHHhc
Confidence            6778888888888888888887765  346888888888888888888877655


No 16 
>PHA02768 hypothetical protein; Provisional
Probab=99.03  E-value=1.2e-10  Score=69.99  Aligned_cols=40  Identities=20%  Similarity=0.507  Sum_probs=19.4

Q ss_pred             cccccCcccccCHHHHHHHHHHccCCCceecCcCcccccChH
Q psy5202         186 DTCEFCGKVFKNCSNLTVHRRSHTGEKPYKCELCSYACAQSS  227 (343)
Q Consensus       186 ~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~f~~~~  227 (343)
                      |.|+.||+.|.....|..|+++|+  ++|+|..|++.|...+
T Consensus         6 y~C~~CGK~Fs~~~~L~~H~r~H~--k~~kc~~C~k~f~~~s   45 (55)
T PHA02768          6 YECPICGEIYIKRKSMITHLRKHN--TNLKLSNCKRISLRTG   45 (55)
T ss_pred             cCcchhCCeeccHHHHHHHHHhcC--CcccCCcccceecccc
Confidence            445555555555555555555544  3444555555444433


No 17 
>PHA02768 hypothetical protein; Provisional
Probab=99.00  E-value=1.7e-10  Score=69.41  Aligned_cols=44  Identities=20%  Similarity=0.505  Sum_probs=39.9

Q ss_pred             cccccccccccCChHHHHHHHHhccCCCCeecCCCCccCCCHHHHH
Q psy5202          91 NKECEFCGKKFKYENTLQAHRRNHTGEKPFKCTVCDHACSQSAKLK  136 (343)
Q Consensus        91 ~~~C~~C~~~f~~~~~L~~H~~~h~~~~~~~C~~C~~~f~~~~~L~  136 (343)
                      -|+|+.||+.|.....|..|+++|.  ++|+|..|++.|...+.|.
T Consensus         5 ~y~C~~CGK~Fs~~~~L~~H~r~H~--k~~kc~~C~k~f~~~s~l~   48 (55)
T PHA02768          5 GYECPICGEIYIKRKSMITHLRKHN--TNLKLSNCKRISLRTGEYI   48 (55)
T ss_pred             ccCcchhCCeeccHHHHHHHHHhcC--CcccCCcccceecccceeE
Confidence            3899999999999999999999998  6899999999998777653


No 18 
>KOG3993|consensus
Probab=98.85  E-value=4.3e-10  Score=95.18  Aligned_cols=89  Identities=19%  Similarity=0.402  Sum_probs=74.8

Q ss_pred             CCcccccCcccccCHHHHHHHHHHccCCCceecCcCcccccChHHHHHHHHHhcC-------------------------
Q psy5202         184 RNDTCEFCGKVFKNCSNLTVHRRSHTGEKPYKCELCSYACAQSSKLTRHMKTHGR-------------------------  238 (343)
Q Consensus       184 ~~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~f~~~~~L~~H~~~h~~-------------------------  238 (343)
                      ..|.|.+|...|.+...|.+|....--..-|+|+.|++.|.-..+|..|.|.|.-                         
T Consensus       266 GdyiCqLCK~kYeD~F~LAQHrC~RIV~vEYrCPEC~KVFsCPANLASHRRWHKPR~eaa~a~~~P~k~~~~~rae~~ea  345 (500)
T KOG3993|consen  266 GDYICQLCKEKYEDAFALAQHRCPRIVHVEYRCPECDKVFSCPANLASHRRWHKPRPEAAKAGSPPPKQAVETRAEVQEA  345 (500)
T ss_pred             HHHHHHHHHHhhhhHHHHhhccCCeeEEeeecCCcccccccCchhhhhhhcccCCchhhhhcCCCChhhhhhhhhhhhhc
Confidence            4689999999999999999997543333459999999999999999999999961                         


Q ss_pred             ------CCCcccccCCCCCCCCChHHHHHHHhhhcCcccc
Q psy5202         239 ------LGKDVYRCRFCEMPFSVPSTLEKHMRKCVVNQNM  272 (343)
Q Consensus       239 ------~~~~~~~C~~C~~~f~~~~~l~~H~~~~~~~~~~  272 (343)
                            .....|.|.+|++.|.+...|++|+-+|+.....
T Consensus       346 ~rsg~dss~gi~~C~~C~KkFrRqAYLrKHqlthq~~~~~  385 (500)
T KOG3993|consen  346 ERSGDDSSSGIFSCHTCGKKFRRQAYLRKHQLTHQRAPLA  385 (500)
T ss_pred             cccCCcccCceeecHHhhhhhHHHHHHHHhHHhhhccccc
Confidence                  1223599999999999999999999999865543


No 19 
>PF13465 zf-H2C2_2:  Zinc-finger double domain; PDB: 2EN7_A 1TF6_A 1TF3_A 2ELT_A 2EOS_A 2EN2_A 2DMD_A 2WBS_A 2WBU_A 2EM5_A ....
Probab=98.84  E-value=2.8e-09  Score=54.77  Aligned_cols=25  Identities=56%  Similarity=1.050  Sum_probs=18.0

Q ss_pred             HHHHHHHHccCCCceecCcCccccc
Q psy5202         200 NLTVHRRSHTGEKPYKCELCSYACA  224 (343)
Q Consensus       200 ~l~~H~~~h~~~~~~~C~~C~~~f~  224 (343)
                      +|.+|+++|++++||.|++|++.|.
T Consensus         1 ~l~~H~~~H~~~k~~~C~~C~k~F~   25 (26)
T PF13465_consen    1 NLRRHMRTHTGEKPYKCPYCGKSFS   25 (26)
T ss_dssp             HHHHHHHHHSSSSSEEESSSSEEES
T ss_pred             CHHHHhhhcCCCCCCCCCCCcCeeC
Confidence            3667777777777777777777775


No 20 
>PF13465 zf-H2C2_2:  Zinc-finger double domain; PDB: 2EN7_A 1TF6_A 1TF3_A 2ELT_A 2EOS_A 2EN2_A 2DMD_A 2WBS_A 2WBU_A 2EM5_A ....
Probab=98.66  E-value=1.6e-08  Score=51.94  Aligned_cols=26  Identities=50%  Similarity=1.007  Sum_probs=24.0

Q ss_pred             HHHHHHHhccCCCCeecCCCCccCCC
Q psy5202         106 TLQAHRRNHTGEKPFKCTVCDHACSQ  131 (343)
Q Consensus       106 ~L~~H~~~h~~~~~~~C~~C~~~f~~  131 (343)
                      +|.+|+++|.++++|.|+.|++.|.+
T Consensus         1 ~l~~H~~~H~~~k~~~C~~C~k~F~~   26 (26)
T PF13465_consen    1 NLRRHMRTHTGEKPYKCPYCGKSFSN   26 (26)
T ss_dssp             HHHHHHHHHSSSSSEEESSSSEEESS
T ss_pred             CHHHHhhhcCCCCCCCCCCCcCeeCc
Confidence            58999999999999999999999964


No 21 
>PF05605 zf-Di19:  Drought induced 19 protein (Di19), zinc-binding;  InterPro: IPR008598 This entry consists of several drought induced 19 (Di19) like and RING finger 114 proteins. Di19 has been found to be strongly expressed in both the roots and leaves of Arabidopsis thaliana during progressive drought [], whilst RING finger proteins are thought to play a role in spermatogenesis. The precise function is unknown.
Probab=98.60  E-value=7.1e-08  Score=59.61  Aligned_cols=51  Identities=20%  Similarity=0.417  Sum_probs=27.7

Q ss_pred             eecCcCcccccChHHHHHHHHHhcCCCCcccccCCCCCCCCChHHHHHHHhhhc
Q psy5202         214 YKCELCSYACAQSSKLTRHMKTHGRLGKDVYRCRFCEMPFSVPSTLEKHMRKCV  267 (343)
Q Consensus       214 ~~C~~C~~~f~~~~~L~~H~~~h~~~~~~~~~C~~C~~~f~~~~~l~~H~~~~~  267 (343)
                      |.|++|++ ..+...|..|....|....+.+.|++|...+.  .+|..|+...|
T Consensus         3 f~CP~C~~-~~~~~~L~~H~~~~H~~~~~~v~CPiC~~~~~--~~l~~Hl~~~H   53 (54)
T PF05605_consen    3 FTCPYCGK-GFSESSLVEHCEDEHRSESKNVVCPICSSRVT--DNLIRHLNSQH   53 (54)
T ss_pred             cCCCCCCC-ccCHHHHHHHHHhHCcCCCCCccCCCchhhhh--hHHHHHHHHhc
Confidence            55666666 34445566665554433344566666665433  25666665554


No 22 
>PHA00732 hypothetical protein
Probab=98.58  E-value=3.2e-08  Score=65.81  Aligned_cols=45  Identities=29%  Similarity=0.552  Sum_probs=24.4

Q ss_pred             cccccCcccccCHHHHHHHHHH-ccCCCceecCcCcccccChHHHHHHHHHh
Q psy5202         186 DTCEFCGKVFKNCSNLTVHRRS-HTGEKPYKCELCSYACAQSSKLTRHMKTH  236 (343)
Q Consensus       186 ~~C~~C~~~f~~~~~l~~H~~~-h~~~~~~~C~~C~~~f~~~~~L~~H~~~h  236 (343)
                      |.|..|++.|.....|..|++. |.   ++.|+.|++.|.   .|..|++.+
T Consensus         2 y~C~~Cgk~F~s~s~Lk~H~r~~H~---~~~C~~CgKsF~---~l~~H~~~~   47 (79)
T PHA00732          2 FKCPICGFTTVTLFALKQHARRNHT---LTKCPVCNKSYR---RLNQHFYSQ   47 (79)
T ss_pred             ccCCCCCCccCCHHHHHHHhhcccC---CCccCCCCCEeC---Chhhhhccc
Confidence            4566666666666666666553 33   235666666555   355555443


No 23 
>PHA00616 hypothetical protein
Probab=98.55  E-value=2.9e-08  Score=56.74  Aligned_cols=32  Identities=22%  Similarity=0.369  Sum_probs=16.2

Q ss_pred             cccccCcccccCHHHHHHHHHHccCCCceecC
Q psy5202         186 DTCEFCGKVFKNCSNLTVHRRSHTGEKPYKCE  217 (343)
Q Consensus       186 ~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~  217 (343)
                      |+|+.||+.|.....|..|++.|+|++++.|+
T Consensus         2 YqC~~CG~~F~~~s~l~~H~r~~hg~~~~~~~   33 (44)
T PHA00616          2 YQCLRCGGIFRKKKEVIEHLLSVHKQNKLTLE   33 (44)
T ss_pred             CccchhhHHHhhHHHHHHHHHHhcCCCcccee
Confidence            44555555555555555555555555554444


No 24 
>PHA00616 hypothetical protein
Probab=98.55  E-value=3.1e-08  Score=56.64  Aligned_cols=34  Identities=15%  Similarity=0.272  Sum_probs=29.8

Q ss_pred             cccccccccccCChHHHHHHHHhccCCCCeecCC
Q psy5202          91 NKECEFCGKKFKYENTLQAHRRNHTGEKPFKCTV  124 (343)
Q Consensus        91 ~~~C~~C~~~f~~~~~L~~H~~~h~~~~~~~C~~  124 (343)
                      +|+|..||+.|.....|.+|++.|++++++.|++
T Consensus         1 pYqC~~CG~~F~~~s~l~~H~r~~hg~~~~~~~~   34 (44)
T PHA00616          1 MYQCLRCGGIFRKKKEVIEHLLSVHKQNKLTLEY   34 (44)
T ss_pred             CCccchhhHHHhhHHHHHHHHHHhcCCCccceeE
Confidence            5889999999999999999999999988888765


No 25 
>PHA00732 hypothetical protein
Probab=98.40  E-value=2e-07  Score=61.97  Aligned_cols=47  Identities=19%  Similarity=0.589  Sum_probs=39.6

Q ss_pred             ceecCcCcccccChHHHHHHHHHhcCCCCcccccCCCCCCCCChHHHHHHHhhh
Q psy5202         213 PYKCELCSYACAQSSKLTRHMKTHGRLGKDVYRCRFCEMPFSVPSTLEKHMRKC  266 (343)
Q Consensus       213 ~~~C~~C~~~f~~~~~L~~H~~~h~~~~~~~~~C~~C~~~f~~~~~l~~H~~~~  266 (343)
                      ||.|..|++.|.....|+.|++.++  .  ++.|+.||+.|.   .|..|+++.
T Consensus         1 py~C~~Cgk~F~s~s~Lk~H~r~~H--~--~~~C~~CgKsF~---~l~~H~~~~   47 (79)
T PHA00732          1 MFKCPICGFTTVTLFALKQHARRNH--T--LTKCPVCNKSYR---RLNQHFYSQ   47 (79)
T ss_pred             CccCCCCCCccCCHHHHHHHhhccc--C--CCccCCCCCEeC---Chhhhhccc
Confidence            6899999999999999999998643  1  358999999998   588898654


No 26 
>PF05605 zf-Di19:  Drought induced 19 protein (Di19), zinc-binding;  InterPro: IPR008598 This entry consists of several drought induced 19 (Di19) like and RING finger 114 proteins. Di19 has been found to be strongly expressed in both the roots and leaves of Arabidopsis thaliana during progressive drought [], whilst RING finger proteins are thought to play a role in spermatogenesis. The precise function is unknown.
Probab=98.34  E-value=1.1e-06  Score=54.30  Aligned_cols=23  Identities=22%  Similarity=0.485  Sum_probs=10.0

Q ss_pred             eecCCCCccCCCHHHHHHHHHHhc
Q psy5202         120 FKCTVCDHACSQSAKLKKHMKIHD  143 (343)
Q Consensus       120 ~~C~~C~~~f~~~~~L~~H~~~h~  143 (343)
                      |.|++|++ ..+...|..|....|
T Consensus         3 f~CP~C~~-~~~~~~L~~H~~~~H   25 (54)
T PF05605_consen    3 FTCPYCGK-GFSESSLVEHCEDEH   25 (54)
T ss_pred             cCCCCCCC-ccCHHHHHHHHHhHC
Confidence            44444444 233344444444433


No 27 
>PF00096 zf-C2H2:  Zinc finger, C2H2 type;  InterPro: IPR007087 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target.  The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger: #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C], where X can be any amino acid, and numbers in brackets indicate the number of residues. The positions marked # are those that are important for the stable fold of the zinc finger. The final position can be either his or cys. The C2H2 zinc finger is composed of two short beta strands followed by an alpha helix. The amino terminal part of the helix binds the major groove in DNA binding zinc fingers. The accepted consensus binding sequence for Sp1 is usually defined by the asymmetric hexanucleotide core GGGCGG but this sequence does not include, among others, the GAG (=CTC) repeat that constitutes a high-affinity site for Sp1 binding to the wt1 promoter []. This entry represents the classical C2H2 zinc finger domain.  More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; GO: 0008270 zinc ion binding, 0005622 intracellular; PDB: 2D9H_A 2EPC_A 1SP1_A 1VA3_A 2WBT_B 2ELR_A 2YTP_A 2YTT_A 1VA1_A 2ELO_A ....
Probab=98.16  E-value=8.7e-07  Score=44.21  Aligned_cols=22  Identities=41%  Similarity=0.922  Sum_probs=11.0

Q ss_pred             eecCcCcccccChHHHHHHHHH
Q psy5202         214 YKCELCSYACAQSSKLTRHMKT  235 (343)
Q Consensus       214 ~~C~~C~~~f~~~~~L~~H~~~  235 (343)
                      |+|+.|++.|.+...|..|++.
T Consensus         1 y~C~~C~~~f~~~~~l~~H~~~   22 (23)
T PF00096_consen    1 YKCPICGKSFSSKSNLKRHMRR   22 (23)
T ss_dssp             EEETTTTEEESSHHHHHHHHHH
T ss_pred             CCCCCCCCccCCHHHHHHHHhH
Confidence            3455555555555555555444


No 28 
>PF00096 zf-C2H2:  Zinc finger, C2H2 type;  InterPro: IPR007087 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target.  The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger: #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C], where X can be any amino acid, and numbers in brackets indicate the number of residues. The positions marked # are those that are important for the stable fold of the zinc finger. The final position can be either his or cys. The C2H2 zinc finger is composed of two short beta strands followed by an alpha helix. The amino terminal part of the helix binds the major groove in DNA binding zinc fingers. The accepted consensus binding sequence for Sp1 is usually defined by the asymmetric hexanucleotide core GGGCGG but this sequence does not include, among others, the GAG (=CTC) repeat that constitutes a high-affinity site for Sp1 binding to the wt1 promoter []. This entry represents the classical C2H2 zinc finger domain.  More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; GO: 0008270 zinc ion binding, 0005622 intracellular; PDB: 2D9H_A 2EPC_A 1SP1_A 1VA3_A 2WBT_B 2ELR_A 2YTP_A 2YTT_A 1VA1_A 2ELO_A ....
Probab=98.15  E-value=1.8e-06  Score=42.99  Aligned_cols=22  Identities=45%  Similarity=1.025  Sum_probs=14.5

Q ss_pred             cccCCCCCCCCChHHHHHHHhh
Q psy5202         244 YRCRFCEMPFSVPSTLEKHMRK  265 (343)
Q Consensus       244 ~~C~~C~~~f~~~~~l~~H~~~  265 (343)
                      |.|+.|++.|.++..|..|++.
T Consensus         1 y~C~~C~~~f~~~~~l~~H~~~   22 (23)
T PF00096_consen    1 YKCPICGKSFSSKSNLKRHMRR   22 (23)
T ss_dssp             EEETTTTEEESSHHHHHHHHHH
T ss_pred             CCCCCCCCccCCHHHHHHHHhH
Confidence            4566666666666666666665


No 29 
>COG5189 SFP1 Putative transcriptional repressor regulating G2/M transition [Transcription / Cell division and chromosome partitioning]
Probab=98.04  E-value=1.2e-06  Score=71.94  Aligned_cols=55  Identities=27%  Similarity=0.577  Sum_probs=45.4

Q ss_pred             CCceecCc--CcccccChHHHHHHHHHhc-----------------CCCCcccccCCCCCCCCChHHHHHHHhh
Q psy5202         211 EKPYKCEL--CSYACAQSSKLTRHMKTHG-----------------RLGKDVYRCRFCEMPFSVPSTLEKHMRK  265 (343)
Q Consensus       211 ~~~~~C~~--C~~~f~~~~~L~~H~~~h~-----------------~~~~~~~~C~~C~~~f~~~~~l~~H~~~  265 (343)
                      ++||+|++  |.+.+.....|+.|+..-|                 ....+||+|++|+|+|+....|+.|+..
T Consensus       347 ~KpykCpV~gC~K~YknqnGLKYH~lhGH~~~~~~~~p~p~~~~~F~~~~KPYrCevC~KRYKNlNGLKYHr~H  420 (423)
T COG5189         347 GKPYKCPVEGCNKKYKNQNGLKYHMLHGHQNQKLHENPSPEKMNIFSAKDKPYRCEVCDKRYKNLNGLKYHRKH  420 (423)
T ss_pred             CceecCCCCCchhhhccccchhhhhhccccCcccCCCCCccccccccccCCceeccccchhhccCccceecccc
Confidence            58999976  9999999999999976422                 1345899999999999999999999753


No 30 
>PF13894 zf-C2H2_4:  C2H2-type zinc finger; PDB: 2ELX_A 2EPP_A 2DLK_A 1X6H_A 2EOU_A 2EMB_A 2GQJ_A 2CSH_A 2WBT_B 2ELM_A ....
Probab=98.03  E-value=2.6e-06  Score=42.91  Aligned_cols=23  Identities=35%  Similarity=0.795  Sum_probs=10.8

Q ss_pred             cccCCCCCCCCChHHHHHHHhhh
Q psy5202         244 YRCRFCEMPFSVPSTLEKHMRKC  266 (343)
Q Consensus       244 ~~C~~C~~~f~~~~~l~~H~~~~  266 (343)
                      |.|++|++.|.+...|..|+++|
T Consensus         1 ~~C~~C~~~~~~~~~l~~H~~~~   23 (24)
T PF13894_consen    1 FQCPICGKSFRSKSELRQHMRTH   23 (24)
T ss_dssp             EE-SSTS-EESSHHHHHHHHHHH
T ss_pred             CCCcCCCCcCCcHHHHHHHHHhh
Confidence            34555555555555555555544


No 31 
>KOG2231|consensus
Probab=98.01  E-value=1.5e-05  Score=73.52  Aligned_cols=71  Identities=25%  Similarity=0.538  Sum_probs=52.7

Q ss_pred             CcccccCcccccCHHHHHHHHHHccCCCceecCcC------cccccChHHHHHHHHHhcCCCCcccccC--CCC-CCCCC
Q psy5202         185 NDTCEFCGKVFKNCSNLTVHRRSHTGEKPYKCELC------SYACAQSSKLTRHMKTHGRLGKDVYRCR--FCE-MPFSV  255 (343)
Q Consensus       185 ~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C------~~~f~~~~~L~~H~~~h~~~~~~~~~C~--~C~-~~f~~  255 (343)
                      .-.|..|...|.....|.+|++.++    |.|..|      +..|.....|..|.+.+|      |.|.  .|- +.|..
T Consensus       182 hp~C~~C~~~fld~~el~rH~~~~h----~~chfC~~~~~~neyy~~~~dLe~HfR~~H------flCE~~~C~~~~f~~  251 (669)
T KOG2231|consen  182 HPLCKFCHERFLDDDELYRHLRFDH----EFCHFCDYKTGQNEYYNDYDDLEEHFRKGH------FLCEEEFCRTKKFYV  251 (669)
T ss_pred             CccchhhhhhhccHHHHHHhhccce----eheeecCcccccchhcccchHHHHHhhhcC------ccccccccccceeee
Confidence            4579999999999999999998766    566666      345778899999998877      7787  565 34444


Q ss_pred             hHHHHHHHhh
Q psy5202         256 PSTLEKHMRK  265 (343)
Q Consensus       256 ~~~l~~H~~~  265 (343)
                      .-.+..|++.
T Consensus       252 ~~~~ei~lk~  261 (669)
T KOG2231|consen  252 AFELEIELKA  261 (669)
T ss_pred             hhHHHHHHHh
Confidence            4455555553


No 32 
>PF12756 zf-C2H2_2:  C2H2 type zinc-finger (2 copies); PDB: 2DMI_A.
Probab=97.95  E-value=3.9e-06  Score=59.68  Aligned_cols=74  Identities=23%  Similarity=0.502  Sum_probs=22.3

Q ss_pred             ccccCcccccCHHHHHHHHHHccCCCceecCcCcccccChHHHHHHHHHhcCCCCcccccCCCCCCCCChHHHHHHHhhh
Q psy5202         187 TCEFCGKVFKNCSNLTVHRRSHTGEKPYKCELCSYACAQSSKLTRHMKTHGRLGKDVYRCRFCEMPFSVPSTLEKHMRKC  266 (343)
Q Consensus       187 ~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~f~~~~~L~~H~~~h~~~~~~~~~C~~C~~~f~~~~~l~~H~~~~  266 (343)
                      .|.+|+..|.+...|..|+...++...   + ....+.....+..+++..   -...+.|.+|++.|.+...|..||+.+
T Consensus         1 ~C~~C~~~f~~~~~l~~H~~~~H~~~~---~-~~~~l~~~~~~~~~~~~~---~~~~~~C~~C~~~f~s~~~l~~Hm~~~   73 (100)
T PF12756_consen    1 QCLFCDESFSSVDDLLQHMKKKHGFDI---P-DQKYLVDPNRLLNYLRKK---VKESFRCPYCNKTFRSREALQEHMRSK   73 (100)
T ss_dssp             -------------------------------------------------------SSEEBSSSS-EESSHHHHHHHHHHT
T ss_pred             Ccccccccccccccccccccccccccc---c-cccccccccccccccccc---cCCCCCCCccCCCCcCHHHHHHHHcCc
Confidence            489999999999999999977555321   1 222233445555555433   123699999999999999999999975


Q ss_pred             c
Q psy5202         267 V  267 (343)
Q Consensus       267 ~  267 (343)
                      +
T Consensus        74 ~   74 (100)
T PF12756_consen   74 H   74 (100)
T ss_dssp             T
T ss_pred             c
Confidence            3


No 33 
>PF13894 zf-C2H2_4:  C2H2-type zinc finger; PDB: 2ELX_A 2EPP_A 2DLK_A 1X6H_A 2EOU_A 2EMB_A 2GQJ_A 2CSH_A 2WBT_B 2ELM_A ....
Probab=97.93  E-value=9.7e-06  Score=40.76  Aligned_cols=24  Identities=33%  Similarity=0.819  Sum_probs=15.0

Q ss_pred             eecCcCcccccChHHHHHHHHHhc
Q psy5202         214 YKCELCSYACAQSSKLTRHMKTHG  237 (343)
Q Consensus       214 ~~C~~C~~~f~~~~~L~~H~~~h~  237 (343)
                      |.|++|++.|.+...|+.|++.||
T Consensus         1 ~~C~~C~~~~~~~~~l~~H~~~~H   24 (24)
T PF13894_consen    1 FQCPICGKSFRSKSELRQHMRTHH   24 (24)
T ss_dssp             EE-SSTS-EESSHHHHHHHHHHHS
T ss_pred             CCCcCCCCcCCcHHHHHHHHHhhC
Confidence            567777777777777777776653


No 34 
>PF12756 zf-C2H2_2:  C2H2 type zinc-finger (2 copies); PDB: 2DMI_A.
Probab=97.91  E-value=8.2e-06  Score=57.99  Aligned_cols=24  Identities=33%  Similarity=0.630  Sum_probs=0.0

Q ss_pred             ecCCCCccCCCHHHHHHHHHHhcc
Q psy5202         121 KCTVCDHACSQSAKLKKHMKIHDE  144 (343)
Q Consensus       121 ~C~~C~~~f~~~~~L~~H~~~h~~  144 (343)
                      +|.+|+..|.+...|..|+...|+
T Consensus         1 ~C~~C~~~f~~~~~l~~H~~~~H~   24 (100)
T PF12756_consen    1 QCLFCDESFSSVDDLLQHMKKKHG   24 (100)
T ss_dssp             ------------------------
T ss_pred             Cccccccccccccccccccccccc
Confidence            488888888888888888877665


No 35 
>PF13912 zf-C2H2_6:  C2H2-type zinc finger; PDB: 1JN7_A 1FU9_A 2L1O_A 1NJQ_A 2EN8_A 2EMM_A 1FV5_A 1Y0J_B 2L6Z_B.
Probab=97.81  E-value=8.2e-06  Score=42.40  Aligned_cols=24  Identities=33%  Similarity=0.620  Sum_probs=13.1

Q ss_pred             cccCCCCCCCCChHHHHHHHhhhc
Q psy5202         244 YRCRFCEMPFSVPSTLEKHMRKCV  267 (343)
Q Consensus       244 ~~C~~C~~~f~~~~~l~~H~~~~~  267 (343)
                      |.|..|++.|.+...|..|++.|+
T Consensus         2 ~~C~~C~~~F~~~~~l~~H~~~h~   25 (27)
T PF13912_consen    2 FECDECGKTFSSLSALREHKRSHC   25 (27)
T ss_dssp             EEETTTTEEESSHHHHHHHHCTTT
T ss_pred             CCCCccCCccCChhHHHHHhHHhc
Confidence            455555555555555555555443


No 36 
>COG5189 SFP1 Putative transcriptional repressor regulating G2/M transition [Transcription / Cell division and chromosome partitioning]
Probab=97.76  E-value=7.2e-06  Score=67.48  Aligned_cols=53  Identities=32%  Similarity=0.638  Sum_probs=45.9

Q ss_pred             CCCCccccc--CcccccCHHHHHHHHHH-cc------------------CCCceecCcCcccccChHHHHHHHH
Q psy5202         182 SRRNDTCEF--CGKVFKNCSNLTVHRRS-HT------------------GEKPYKCELCSYACAQSSKLTRHMK  234 (343)
Q Consensus       182 ~~~~~~C~~--C~~~f~~~~~l~~H~~~-h~------------------~~~~~~C~~C~~~f~~~~~L~~H~~  234 (343)
                      ++++|+|++  |.++|.....|+.|+.- |.                  ..|||+|++|+++++....|+.|..
T Consensus       346 d~KpykCpV~gC~K~YknqnGLKYH~lhGH~~~~~~~~p~p~~~~~F~~~~KPYrCevC~KRYKNlNGLKYHr~  419 (423)
T COG5189         346 DGKPYKCPVEGCNKKYKNQNGLKYHMLHGHQNQKLHENPSPEKMNIFSAKDKPYRCEVCDKRYKNLNGLKYHRK  419 (423)
T ss_pred             cCceecCCCCCchhhhccccchhhhhhccccCcccCCCCCccccccccccCCceeccccchhhccCccceeccc
Confidence            459999998  99999999999999874 31                  2489999999999999999998864


No 37 
>PF13912 zf-C2H2_6:  C2H2-type zinc finger; PDB: 1JN7_A 1FU9_A 2L1O_A 1NJQ_A 2EN8_A 2EMM_A 1FV5_A 1Y0J_B 2L6Z_B.
Probab=97.70  E-value=2.3e-05  Score=40.71  Aligned_cols=25  Identities=40%  Similarity=0.804  Sum_probs=16.6

Q ss_pred             cccccccccccCChHHHHHHHHhcc
Q psy5202          91 NKECEFCGKKFKYENTLQAHRRNHT  115 (343)
Q Consensus        91 ~~~C~~C~~~f~~~~~L~~H~~~h~  115 (343)
                      +|+|..|++.|.+...|..|++.|.
T Consensus         1 ~~~C~~C~~~F~~~~~l~~H~~~h~   25 (27)
T PF13912_consen    1 PFECDECGKTFSSLSALREHKRSHC   25 (27)
T ss_dssp             SEEETTTTEEESSHHHHHHHHCTTT
T ss_pred             CCCCCccCCccCChhHHHHHhHHhc
Confidence            4667777777777777777766654


No 38 
>KOG2231|consensus
Probab=97.69  E-value=0.0001  Score=68.27  Aligned_cols=103  Identities=22%  Similarity=0.445  Sum_probs=74.2

Q ss_pred             ecCCCCccCCCHHHHHHHHHHhccCCCCccccCCCCcchHHHHHHHHHhhhcCCCCcccccCCCCccccc----------
Q psy5202         121 KCTVCDHACSQSAKLKKHMKIHDEDGDKEDREEGPSASSLAAAAAAAALNLCLPNSHMKKESRRNDTCEF----------  190 (343)
Q Consensus       121 ~C~~C~~~f~~~~~L~~H~~~h~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~C~~----------  190 (343)
                      .|..| ..|.+...|+.|+...|.                                        .+.|.+          
T Consensus       117 ~~~~c-~~~~s~~~Lk~H~~~~H~----------------------------------------~~~c~lC~~~~kif~~  155 (669)
T KOG2231|consen  117 ECLHC-TEFKSVENLKNHMRDQHK----------------------------------------LHLCSLCLQNLKIFIN  155 (669)
T ss_pred             CCccc-cchhHHHHHHHHHHHhhh----------------------------------------hhccccccccceeeee
Confidence            46666 666688999999976663                                        233333          


Q ss_pred             CcccccCHHHHHHHHHHc-cCCCc----eecCcCcccccChHHHHHHHHHhcCCCCcccccCCCC------CCCCChHHH
Q psy5202         191 CGKVFKNCSNLTVHRRSH-TGEKP----YKCELCSYACAQSSKLTRHMKTHGRLGKDVYRCRFCE------MPFSVPSTL  259 (343)
Q Consensus       191 C~~~f~~~~~l~~H~~~h-~~~~~----~~C~~C~~~f~~~~~L~~H~~~h~~~~~~~~~C~~C~------~~f~~~~~l  259 (343)
                      +.+.| ....|..|++.- .+++.    -.|..|...|.....|.+|++.+|      |.|.+|.      ..|.....|
T Consensus       156 e~k~Y-t~~el~~h~~~gd~d~~s~rGhp~C~~C~~~fld~~el~rH~~~~h------~~chfC~~~~~~neyy~~~~dL  228 (669)
T KOG2231|consen  156 ERKLY-TRAELNLHLMFGDPDDESCRGHPLCKFCHERFLDDDELYRHLRFDH------EFCHFCDYKTGQNEYYNDYDDL  228 (669)
T ss_pred             eeehe-hHHHHHHHHhcCCCccccccCCccchhhhhhhccHHHHHHhhccce------eheeecCcccccchhcccchHH
Confidence            33444 566788887652 11221    369999999999999999999876      6677774      568888999


Q ss_pred             HHHHhhhcCccc
Q psy5202         260 EKHMRKCVVNQN  271 (343)
Q Consensus       260 ~~H~~~~~~~~~  271 (343)
                      ..|.|..|..-.
T Consensus       229 e~HfR~~HflCE  240 (669)
T KOG2231|consen  229 EEHFRKGHFLCE  240 (669)
T ss_pred             HHHhhhcCcccc
Confidence            999999886554


No 39 
>COG5236 Uncharacterized conserved protein, contains RING Zn-finger [General function prediction only]
Probab=97.48  E-value=0.00012  Score=61.22  Aligned_cols=135  Identities=21%  Similarity=0.438  Sum_probs=90.6

Q ss_pred             cccccc--cccccCChHHHHHHHHhccCCCCeecCCCC---ccCCCH------HHHHHHHHHhccCCCCccccCCCCcch
Q psy5202          91 NKECEF--CGKKFKYENTLQAHRRNHTGEKPFKCTVCD---HACSQS------AKLKKHMKIHDEDGDKEDREEGPSASS  159 (343)
Q Consensus        91 ~~~C~~--C~~~f~~~~~L~~H~~~h~~~~~~~C~~C~---~~f~~~------~~L~~H~~~h~~~~~~~~~~~~~~~~~  159 (343)
                      .|.|+.  |..+......|..|.+..++  .+.|.+|-   +.|...      ..|..|...-..+.       +     
T Consensus       151 ~F~CP~skc~~~C~~~k~lk~H~K~~H~--~~~C~~C~~nKk~F~~E~~lF~~~~Lr~H~~~G~~e~-------G-----  216 (493)
T COG5236         151 SFKCPKSKCHRRCGSLKELKKHYKAQHG--FVLCSECIGNKKDFWNEIRLFRSSTLRDHKNGGLEEE-------G-----  216 (493)
T ss_pred             HhcCCchhhhhhhhhHHHHHHHHHhhcC--cEEhHhhhcCcccCccceeeeecccccccccCCcccc-------C-----
Confidence            377876  77777778889999887544  37788874   234333      33444332111100       0     


Q ss_pred             HHHHHHHHHhhhcCCCCcccccCCCCcccccCcccccCHHHHHHHHHHccCCCceecCcCcc-------cccChHHHHHH
Q psy5202         160 LAAAAAAAALNLCLPNSHMKKESRRNDTCEFCGKVFKNCSNLTVHRRSHTGEKPYKCELCSY-------ACAQSSKLTRH  232 (343)
Q Consensus       160 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~-------~f~~~~~L~~H  232 (343)
                                            -+..-.|..|...|-+-..|.+|++..+.    +|.+|++       -|..-..|.+|
T Consensus       217 ----------------------FKGHP~C~FC~~~FYdDDEL~~HcR~~HE----~ChICD~v~p~~~QYFK~Y~~Le~H  270 (493)
T COG5236         217 ----------------------FKGHPLCIFCKIYFYDDDELRRHCRLRHE----ACHICDMVGPIRYQYFKSYEDLEAH  270 (493)
T ss_pred             ----------------------cCCCchhhhccceecChHHHHHHHHhhhh----hhhhhhccCccchhhhhCHHHHHHH
Confidence                                  01344699999999999999999997553    4666654       37888899999


Q ss_pred             HHHhcCCCCcccccCC--CC----CCCCChHHHHHHHhhhcCccc
Q psy5202         233 MKTHGRLGKDVYRCRF--CE----MPFSVPSTLEKHMRKCVVNQN  271 (343)
Q Consensus       233 ~~~h~~~~~~~~~C~~--C~----~~f~~~~~l~~H~~~~~~~~~  271 (343)
                      .+.-|      |.|.+  |-    +.|.....|..|+...|+...
T Consensus       271 F~~~h------y~ct~qtc~~~k~~vf~~~~el~~h~~~~h~~~~  309 (493)
T COG5236         271 FRNAH------YCCTFQTCRVGKCYVFPYHTELLEHLTRFHKVNA  309 (493)
T ss_pred             hhcCc------eEEEEEEEecCcEEEeccHHHHHHHHHHHhhccc
Confidence            86543      66654  53    579999999999988886553


No 40 
>KOG1146|consensus
Probab=97.37  E-value=0.00028  Score=69.33  Aligned_cols=160  Identities=20%  Similarity=0.297  Sum_probs=98.7

Q ss_pred             cccccccccCChHHHHHHHH-hccCCCCeecCCCCccCCCHHHHHHHHHHhccCCCCccccCCCCcchHHHHHHHHHhhh
Q psy5202          93 ECEFCGKKFKYENTLQAHRR-NHTGEKPFKCTVCDHACSQSAKLKKHMKIHDEDGDKEDREEGPSASSLAAAAAAAALNL  171 (343)
Q Consensus        93 ~C~~C~~~f~~~~~L~~H~~-~h~~~~~~~C~~C~~~f~~~~~L~~H~~~h~~~~~~~~~~~~~~~~~~~~~~~~~~~~~  171 (343)
                      .|..|+..+.....+.-|+. .|...+.|+|+.|+..|.....|..||+..|.+........+..             ..
T Consensus       438 e~~~~e~~~~s~r~~~~~t~~L~S~~kt~~cpkc~~~yk~a~~L~vhmRskhp~~~~~~c~~gq~-------------~~  504 (1406)
T KOG1146|consen  438 ELTKAEPLLESKRSLEGQTVVLHSFFKTLKCPKCNWHYKLAQTLGVHMRSKHPESQSAYCKAGQN-------------HP  504 (1406)
T ss_pred             cccchhhhhhhhcccccceeeeecccccccCCccchhhhhHHHhhhcccccccccchhHhHhccc-------------cc
Confidence            46777888887777777765 36667889999999999999999999998765532211000000             00


Q ss_pred             cCCCCcccccCCCCcccccCcccccCHHHHHHHHHH--ccCC--------------------------------------
Q psy5202         172 CLPNSHMKKESRRNDTCEFCGKVFKNCSNLTVHRRS--HTGE--------------------------------------  211 (343)
Q Consensus       172 ~~~~~~~~~~~~~~~~C~~C~~~f~~~~~l~~H~~~--h~~~--------------------------------------  211 (343)
                      .........-+.++|.|..|...+....+|.+|++.  |..+                                      
T Consensus       505 ~~arg~~~~~~~~p~~C~~C~~stttng~LsihlqS~~h~~~lee~~~~~g~~v~~~~~~v~s~~P~~ag~~~~ags~~p  584 (1406)
T KOG1146|consen  505 RLARGEVYRCPGKPYPCRACNYSTTTNGNLSIHLQSDLHRNELEEAEENAGEQVRLLPASVTSAVPEEAGLGPSAGSSGP  584 (1406)
T ss_pred             cccccccccCCCCcccceeeeeeeecchHHHHHHHHHhhHHHHHHHHhccccchhhhhhhhcccCcccccCCCCCCCCCC
Confidence            000001112234789999999999999999999874  2211                                      


Q ss_pred             ---CceecCcCcccccChHHHHHHHHHhcCCCCcccccCCCCCCCCChHHHHHHHhhh
Q psy5202         212 ---KPYKCELCSYACAQSSKLTRHMKTHGRLGKDVYRCRFCEMPFSVPSTLEKHMRKC  266 (343)
Q Consensus       212 ---~~~~C~~C~~~f~~~~~L~~H~~~h~~~~~~~~~C~~C~~~f~~~~~l~~H~~~~  266 (343)
                         -.|.|.+|++...-...|+-||..-.. -..+.-|-.|+-.+.....+..|.+-+
T Consensus       585 ktkP~~~C~vc~yetniarnlrihmtss~~-s~~p~~~Lq~~it~~l~~~~~~~~~lp  641 (1406)
T KOG1146|consen  585 KTKPSWRCEVCSYETNIARNLRIHMTASPS-SSPPSLVLQQNITSSLASLLGGQGRLP  641 (1406)
T ss_pred             CCCCCcchhhhcchhhhhhccccccccCCC-CCChHHHhhhcchhhccccccCcCCCC
Confidence               125666666666666666666654221 223355555666655555555555444


No 41 
>PF13909 zf-H2C2_5:  C2H2-type zinc-finger domain; PDB: 1X5W_A.
Probab=97.29  E-value=0.00014  Score=36.48  Aligned_cols=22  Identities=55%  Similarity=1.105  Sum_probs=10.3

Q ss_pred             eecCcCcccccChHHHHHHHHHh
Q psy5202         214 YKCELCSYACAQSSKLTRHMKTH  236 (343)
Q Consensus       214 ~~C~~C~~~f~~~~~L~~H~~~h  236 (343)
                      |+|+.|++... ...|.+|++.|
T Consensus         1 y~C~~C~y~t~-~~~l~~H~~~~   22 (24)
T PF13909_consen    1 YKCPHCSYSTS-KSNLKRHLKRH   22 (24)
T ss_dssp             EE-SSSS-EES-HHHHHHHHHHH
T ss_pred             CCCCCCCCcCC-HHHHHHHHHhh
Confidence            44555555554 55555555544


No 42 
>smart00355 ZnF_C2H2 zinc finger.
Probab=97.26  E-value=0.00025  Score=36.12  Aligned_cols=23  Identities=43%  Similarity=0.800  Sum_probs=17.4

Q ss_pred             cccCCCCCCCCChHHHHHHHhhh
Q psy5202         244 YRCRFCEMPFSVPSTLEKHMRKC  266 (343)
Q Consensus       244 ~~C~~C~~~f~~~~~l~~H~~~~  266 (343)
                      |.|..|++.|.....|..|++.|
T Consensus         1 ~~C~~C~~~f~~~~~l~~H~~~H   23 (26)
T smart00355        1 YRCPECGKVFKSKSALKEHMRTH   23 (26)
T ss_pred             CCCCCCcchhCCHHHHHHHHHHh
Confidence            56778888888888888887754


No 43 
>PF09237 GAGA:  GAGA factor;  InterPro: IPR015318 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target.  Members of this entry bind to a 5'-GAGAG-3' DNA consensus binding site, and contain a Cys2-His2 zinc finger core as well as an N-terminal extension containing two highly basic regions. The zinc finger core binds in the DNA major groove and recognises the first three GAG bases of the consensus in a manner similar to that seen in other classical zinc finger-DNA complexes. The second basic region forms a helix that interacts in the major groove recognising the last G of the consensus, while the first basic region wraps around the DNA in the minor groove and recognises the A in the fourth position of the consensus sequence [].  More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; PDB: 1YUI_A 1YUJ_A.
Probab=97.26  E-value=0.00036  Score=40.92  Aligned_cols=31  Identities=23%  Similarity=0.439  Sum_probs=14.2

Q ss_pred             CCCcccccccccccCChHHHHHHHHhccCCC
Q psy5202          88 EEKNKECEFCGKKFKYENTLQAHRRNHTGEK  118 (343)
Q Consensus        88 ~~~~~~C~~C~~~f~~~~~L~~H~~~h~~~~  118 (343)
                      .+.|-.|++|+..+....+|++|+..+++.+
T Consensus        21 S~~PatCP~C~a~~~~srnLrRHle~~H~~k   51 (54)
T PF09237_consen   21 SEQPATCPICGAVIRQSRNLRRHLEIRHFKK   51 (54)
T ss_dssp             TS--EE-TTT--EESSHHHHHHHHHHHTTTS
T ss_pred             cCCCCCCCcchhhccchhhHHHHHHHHhccc
Confidence            3445556666666666666666655544433


No 44 
>KOG2785|consensus
Probab=97.24  E-value=0.00091  Score=57.15  Aligned_cols=60  Identities=23%  Similarity=0.375  Sum_probs=46.9

Q ss_pred             eecCcCcccccChHHHHHHHHHhcCC------------------C---CcccccCCCC---CCCCChHHHHHHHhhhcCc
Q psy5202         214 YKCELCSYACAQSSKLTRHMKTHGRL------------------G---KDVYRCRFCE---MPFSVPSTLEKHMRKCVVN  269 (343)
Q Consensus       214 ~~C~~C~~~f~~~~~L~~H~~~h~~~------------------~---~~~~~C~~C~---~~f~~~~~l~~H~~~~~~~  269 (343)
                      -.|-.|++.|.....-..||..+|+.                  |   ..-|.|-.|.   +.|.+....+.||+....+
T Consensus       167 t~CLfC~~~~k~~e~~~~HM~~~HgffIPdreYL~D~~GLl~YLgeKV~~~~~CL~CN~~~~~f~sleavr~HM~~K~HC  246 (390)
T KOG2785|consen  167 TDCLFCDKKSKSLEENLKHMFKEHGFFIPDREYLTDEKGLLKYLGEKVGIGFICLFCNELGRPFSSLEAVRAHMRDKGHC  246 (390)
T ss_pred             cceeecCCCcccHHHHHHHHhhccCCcCCchHhhhchhHHHHHHHHHhccCceEEEeccccCcccccHHHHHHHhhccCc
Confidence            46999999999999999999988731                  0   1127788898   9999999999999865444


Q ss_pred             cccc
Q psy5202         270 QNMK  273 (343)
Q Consensus       270 ~~~~  273 (343)
                      +-..
T Consensus       247 kl~y  250 (390)
T KOG2785|consen  247 KLPY  250 (390)
T ss_pred             ccCC
Confidence            4433


No 45 
>smart00355 ZnF_C2H2 zinc finger.
Probab=97.20  E-value=0.00029  Score=35.86  Aligned_cols=24  Identities=38%  Similarity=0.760  Sum_probs=19.2

Q ss_pred             eecCcCcccccChHHHHHHHHHhc
Q psy5202         214 YKCELCSYACAQSSKLTRHMKTHG  237 (343)
Q Consensus       214 ~~C~~C~~~f~~~~~L~~H~~~h~  237 (343)
                      |+|..|++.|.....|..|++.|.
T Consensus         1 ~~C~~C~~~f~~~~~l~~H~~~H~   24 (26)
T smart00355        1 YRCPECGKVFKSKSALKEHMRTHX   24 (26)
T ss_pred             CCCCCCcchhCCHHHHHHHHHHhc
Confidence            578888888888888888887663


No 46 
>COG5048 FOG: Zn-finger [General function prediction only]
Probab=97.13  E-value=0.00043  Score=63.37  Aligned_cols=145  Identities=24%  Similarity=0.351  Sum_probs=92.6

Q ss_pred             CcccccccccccCChHHHHHHHH--hccCC--CCeecC--CCCccCCCHHHHHHHHHHhccCCCCccccCCCCcchHHHH
Q psy5202          90 KNKECEFCGKKFKYENTLQAHRR--NHTGE--KPFKCT--VCDHACSQSAKLKKHMKIHDEDGDKEDREEGPSASSLAAA  163 (343)
Q Consensus        90 ~~~~C~~C~~~f~~~~~L~~H~~--~h~~~--~~~~C~--~C~~~f~~~~~L~~H~~~h~~~~~~~~~~~~~~~~~~~~~  163 (343)
                      .++.|..|...|.....|.+|.+  .|.++  +++.|+  .|++.|.....+.+|...|.+.....+.............
T Consensus       288 ~~~~~~~~~~~~s~~~~l~~~~~~~~h~~~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~  367 (467)
T COG5048         288 LPIKSKQCNISFSRSSPLTRHLRSVNHSGESLKPFSCPYSLCGKLFSRNDALKRHILLHTSISPAKEKLLNSSSKFSPLL  367 (467)
T ss_pred             cCCCCccccCCccccccccccccccccccccCCceeeeccCCCccccccccccCCcccccCCCccccccccCcccccccc
Confidence            47999999999999999999999  89999  999999  7999999999999999999876544332221111100000


Q ss_pred             HHHHHhhhcCCCCcccccCCCCccccc--CcccccCHHHHHHHHHHccCCC--ceecCcCcccccChHHHHHHHHHhc
Q psy5202         164 AAAAALNLCLPNSHMKKESRRNDTCEF--CGKVFKNCSNLTVHRRSHTGEK--PYKCELCSYACAQSSKLTRHMKTHG  237 (343)
Q Consensus       164 ~~~~~~~~~~~~~~~~~~~~~~~~C~~--C~~~f~~~~~l~~H~~~h~~~~--~~~C~~C~~~f~~~~~L~~H~~~h~  237 (343)
                      ...   ................+.|..  |-..+.....+..|...|....  .+.|..|...|.....+..|++.|.
T Consensus       368 ~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~  442 (467)
T COG5048         368 NNE---PPQSLQQYKDLKNDKKSETLSNSCIRNFKRDSNLSLHIITHLSFRPYNCKNPPCSKSFNRHYNLIPHKKIHT  442 (467)
T ss_pred             CCC---CccchhhccCccCCccccccccchhhhhccccccccccccccccCCcCCCCCcchhhccCcccccccccccc
Confidence            000   000000000011112333322  5556666666666666665555  3556778888888888888877774


No 47 
>COG5236 Uncharacterized conserved protein, contains RING Zn-finger [General function prediction only]
Probab=97.12  E-value=0.00034  Score=58.54  Aligned_cols=107  Identities=27%  Similarity=0.504  Sum_probs=76.5

Q ss_pred             eecCC--CCccCCCHHHHHHHHHHhccCCCCccccCCCCcchHHHHHHHHHhhhcCCCCcccccCCCCcccccCcc----
Q psy5202         120 FKCTV--CDHACSQSAKLKKHMKIHDEDGDKEDREEGPSASSLAAAAAAAALNLCLPNSHMKKESRRNDTCEFCGK----  193 (343)
Q Consensus       120 ~~C~~--C~~~f~~~~~L~~H~~~h~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~C~~C~~----  193 (343)
                      |.|+.  |.........|..|.+..|+.                                        +.|.+|-.    
T Consensus       152 F~CP~skc~~~C~~~k~lk~H~K~~H~~----------------------------------------~~C~~C~~nKk~  191 (493)
T COG5236         152 FKCPKSKCHRRCGSLKELKKHYKAQHGF----------------------------------------VLCSECIGNKKD  191 (493)
T ss_pred             hcCCchhhhhhhhhHHHHHHHHHhhcCc----------------------------------------EEhHhhhcCccc
Confidence            88986  777777788999999987743                                        45555532    


Q ss_pred             -----cccCHHHHHHHHHHccCCCce----ecCcCcccccChHHHHHHHHHhcCCCCcccccCCCCC-------CCCChH
Q psy5202         194 -----VFKNCSNLTVHRRSHTGEKPY----KCELCSYACAQSSKLTRHMKTHGRLGKDVYRCRFCEM-------PFSVPS  257 (343)
Q Consensus       194 -----~f~~~~~l~~H~~~h~~~~~~----~C~~C~~~f~~~~~L~~H~~~h~~~~~~~~~C~~C~~-------~f~~~~  257 (343)
                           ..-....|..|...-..+.-|    .|..|...|..-..|.+|+|..|   +   +|-+|++       -|....
T Consensus       192 F~~E~~lF~~~~Lr~H~~~G~~e~GFKGHP~C~FC~~~FYdDDEL~~HcR~~H---E---~ChICD~v~p~~~QYFK~Y~  265 (493)
T COG5236         192 FWNEIRLFRSSTLRDHKNGGLEEEGFKGHPLCIFCKIYFYDDDELRRHCRLRH---E---ACHICDMVGPIRYQYFKSYE  265 (493)
T ss_pred             CccceeeeecccccccccCCccccCcCCCchhhhccceecChHHHHHHHHhhh---h---hhhhhhccCccchhhhhCHH
Confidence                 233455677776543323223    59999999999999999999866   3   4777764       489999


Q ss_pred             HHHHHHhhhcCcccc
Q psy5202         258 TLEKHMRKCVVNQNM  272 (343)
Q Consensus       258 ~l~~H~~~~~~~~~~  272 (343)
                      .|..|.+.-|..-.+
T Consensus       266 ~Le~HF~~~hy~ct~  280 (493)
T COG5236         266 DLEAHFRNAHYCCTF  280 (493)
T ss_pred             HHHHHhhcCceEEEE
Confidence            999999876654433


No 48 
>PF09237 GAGA:  GAGA factor;  InterPro: IPR015318 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target.  Members of this entry bind to a 5'-GAGAG-3' DNA consensus binding site, and contain a Cys2-His2 zinc finger core as well as an N-terminal extension containing two highly basic regions. The zinc finger core binds in the DNA major groove and recognises the first three GAG bases of the consensus in a manner similar to that seen in other classical zinc finger-DNA complexes. The second basic region forms a helix that interacts in the major groove recognising the last G of the consensus, while the first basic region wraps around the DNA in the minor groove and recognises the A in the fourth position of the consensus sequence [].  More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; PDB: 1YUI_A 1YUJ_A.
Probab=97.12  E-value=0.00062  Score=39.93  Aligned_cols=26  Identities=31%  Similarity=0.661  Sum_probs=10.2

Q ss_pred             CceecCcCcccccChHHHHHHHHHhc
Q psy5202         212 KPYKCELCSYACAQSSKLTRHMKTHG  237 (343)
Q Consensus       212 ~~~~C~~C~~~f~~~~~L~~H~~~h~  237 (343)
                      .|-.|++|+..+....+|++|+.++|
T Consensus        23 ~PatCP~C~a~~~~srnLrRHle~~H   48 (54)
T PF09237_consen   23 QPATCPICGAVIRQSRNLRRHLEIRH   48 (54)
T ss_dssp             --EE-TTT--EESSHHHHHHHHHHHT
T ss_pred             CCCCCCcchhhccchhhHHHHHHHHh
Confidence            34445555555555555555544433


No 49 
>KOG1146|consensus
Probab=97.12  E-value=7e-05  Score=73.33  Aligned_cols=150  Identities=20%  Similarity=0.309  Sum_probs=96.1

Q ss_pred             CCCCcccccccccccCChHHHHHHHHhcc-------------------------CCCCeecCCCCccCCCHHHHHHHHHH
Q psy5202          87 QEEKNKECEFCGKKFKYENTLQAHRRNHT-------------------------GEKPFKCTVCDHACSQSAKLKKHMKI  141 (343)
Q Consensus        87 ~~~~~~~C~~C~~~f~~~~~L~~H~~~h~-------------------------~~~~~~C~~C~~~f~~~~~L~~H~~~  141 (343)
                      .-.+.|+|+.|++.|+....|..|||..+                         +.++|.|..|...+..+.+|.+|++.
T Consensus       461 S~~kt~~cpkc~~~yk~a~~L~vhmRskhp~~~~~~c~~gq~~~~~arg~~~~~~~~p~~C~~C~~stttng~LsihlqS  540 (1406)
T KOG1146|consen  461 SFFKTLKCPKCNWHYKLAQTLGVHMRSKHPESQSAYCKAGQNHPRLARGEVYRCPGKPYPCRACNYSTTTNGNLSIHLQS  540 (1406)
T ss_pred             cccccccCCccchhhhhHHHhhhcccccccccchhHhHhccccccccccccccCCCCcccceeeeeeeecchHHHHHHHH
Confidence            34578999999999999999999999721                         12679999999999999999999975


Q ss_pred             hccCCCCccccC--CCCcchHHHHHHH--HHhhhcCCCCccc-ccCCCCcccccCcccccCHHHHHHHHHH-ccCCCcee
Q psy5202         142 HDEDGDKEDREE--GPSASSLAAAAAA--AALNLCLPNSHMK-KESRRNDTCEFCGKVFKNCSNLTVHRRS-HTGEKPYK  215 (343)
Q Consensus       142 h~~~~~~~~~~~--~~~~~~~~~~~~~--~~~~~~~~~~~~~-~~~~~~~~C~~C~~~f~~~~~l~~H~~~-h~~~~~~~  215 (343)
                      --.....+..+.  +............  ............. ......|.|.+|+....-..+|+.|+.. ++-..|.-
T Consensus       541 ~~h~~~lee~~~~~g~~v~~~~~~v~s~~P~~ag~~~~ags~~pktkP~~~C~vc~yetniarnlrihmtss~~s~~p~~  620 (1406)
T KOG1146|consen  541 DLHRNELEEAEENAGEQVRLLPASVTSAVPEEAGLGPSAGSSGPKTKPSWRCEVCSYETNIARNLRIHMTASPSSSPPSL  620 (1406)
T ss_pred             HhhHHHHHHHHhccccchhhhhhhhcccCcccccCCCCCCCCCCCCCCCcchhhhcchhhhhhccccccccCCCCCChHH
Confidence            321111111000  0000000000000  0000001111111 2234679999999999999999999876 44445577


Q ss_pred             cCcCcccccChHHHHHHHHHh
Q psy5202         216 CELCSYACAQSSKLTRHMKTH  236 (343)
Q Consensus       216 C~~C~~~f~~~~~L~~H~~~h  236 (343)
                      |..|+..+.....+..|.+.+
T Consensus       621 ~Lq~~it~~l~~~~~~~~~lp  641 (1406)
T KOG1146|consen  621 VLQQNITSSLASLLGGQGRLP  641 (1406)
T ss_pred             HhhhcchhhccccccCcCCCC
Confidence            888888888887777777766


No 50 
>PF13909 zf-H2C2_5:  C2H2-type zinc-finger domain; PDB: 1X5W_A.
Probab=97.06  E-value=0.00055  Score=34.23  Aligned_cols=23  Identities=39%  Similarity=0.846  Sum_probs=12.8

Q ss_pred             eecCCCCccCCCHHHHHHHHHHhc
Q psy5202         120 FKCTVCDHACSQSAKLKKHMKIHD  143 (343)
Q Consensus       120 ~~C~~C~~~f~~~~~L~~H~~~h~  143 (343)
                      |+|+.|++... ...|.+|++.+|
T Consensus         1 y~C~~C~y~t~-~~~l~~H~~~~H   23 (24)
T PF13909_consen    1 YKCPHCSYSTS-KSNLKRHLKRHH   23 (24)
T ss_dssp             EE-SSSS-EES-HHHHHHHHHHHH
T ss_pred             CCCCCCCCcCC-HHHHHHHHHhhC
Confidence            45666666665 666666666554


No 51 
>PRK04860 hypothetical protein; Provisional
Probab=96.87  E-value=0.0005  Score=52.64  Aligned_cols=36  Identities=25%  Similarity=0.572  Sum_probs=17.5

Q ss_pred             CcccccCcccccCHHHHHHHHHHccCCCceecCcCccccc
Q psy5202         185 NDTCEFCGKVFKNCSNLTVHRRSHTGEKPYKCELCSYACA  224 (343)
Q Consensus       185 ~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~f~  224 (343)
                      +|.|. |+.   ....+.+|.++|.++++|+|..|+..|.
T Consensus       119 ~Y~C~-C~~---~~~~~rrH~ri~~g~~~YrC~~C~~~l~  154 (160)
T PRK04860        119 PYRCK-CQE---HQLTVRRHNRVVRGEAVYRCRRCGETLV  154 (160)
T ss_pred             EEEcC-CCC---eeCHHHHHHHHhcCCccEECCCCCceeE
Confidence            34554 544   3334445555555555555555554443


No 52 
>PRK04860 hypothetical protein; Provisional
Probab=96.87  E-value=0.00051  Score=52.59  Aligned_cols=40  Identities=33%  Similarity=0.765  Sum_probs=34.7

Q ss_pred             CceecCcCcccccChHHHHHHHHHhcCCCCcccccCCCCCCCCChH
Q psy5202         212 KPYKCELCSYACAQSSKLTRHMKTHGRLGKDVYRCRFCEMPFSVPS  257 (343)
Q Consensus       212 ~~~~C~~C~~~f~~~~~L~~H~~~h~~~~~~~~~C~~C~~~f~~~~  257 (343)
                      -+|.|. |+.   ....+++|.++|.  ++++|.|..|+..|....
T Consensus       118 ~~Y~C~-C~~---~~~~~rrH~ri~~--g~~~YrC~~C~~~l~~~~  157 (160)
T PRK04860        118 FPYRCK-CQE---HQLTVRRHNRVVR--GEAVYRCRRCGETLVFKG  157 (160)
T ss_pred             EEEEcC-CCC---eeCHHHHHHHHhc--CCccEECCCCCceeEEec
Confidence            479998 997   7788999999997  889999999999987543


No 53 
>KOG2785|consensus
Probab=96.86  E-value=0.0041  Score=53.27  Aligned_cols=146  Identities=19%  Similarity=0.345  Sum_probs=92.0

Q ss_pred             cccccccccccCChHHHHHHHHh--cc-----------------------------------CCCCeecCCCCccCCCHH
Q psy5202          91 NKECEFCGKKFKYENTLQAHRRN--HT-----------------------------------GEKPFKCTVCDHACSQSA  133 (343)
Q Consensus        91 ~~~C~~C~~~f~~~~~L~~H~~~--h~-----------------------------------~~~~~~C~~C~~~f~~~~  133 (343)
                      -|.|.-|...|.+...-+.|+++  |.                                   ..-++.|..|.+.|.+..
T Consensus         3 ~ftC~tC~v~F~~ad~Qr~HyKSdWHRYNLKRkVA~lPPItaE~F~~k~~s~~~~~~~~~e~~~~~~~c~~c~k~~~s~~   82 (390)
T KOG2785|consen    3 GFTCNTCNVEFDDADEQRAHYKSDWHRYNLKRKVASLPPITAEEFNEKVLSDDSEKEENLEEAESVVYCEACNKSFASPK   82 (390)
T ss_pred             cceeeceeeeeccHHHHHHHhhhhHHHhhHHhHhhcCCCcCHHHHhHHHhhhhhhhhhhhhhcccceehHHhhccccChh
Confidence            38999999999999888889875  31                                   113589999999999999


Q ss_pred             HHHHHHHHhccCCCCccc---cCCCCcch----------HHH----HHHHHHhhhc---------------CCCCccccc
Q psy5202         134 KLKKHMKIHDEDGDKEDR---EEGPSASS----------LAA----AAAAAALNLC---------------LPNSHMKKE  181 (343)
Q Consensus       134 ~L~~H~~~h~~~~~~~~~---~~~~~~~~----------~~~----~~~~~~~~~~---------------~~~~~~~~~  181 (343)
                      ....|++...........   ..+.....          +..    ..........               +........
T Consensus        83 a~~~hl~Sk~h~~~~~~~~r~~e~d~a~~~q~~~~~p~~l~~~~e~e~~~~E~~~~~d~~~e~~~dd~~Edi~~d~~~e~  162 (390)
T KOG2785|consen   83 AHENHLKSKKHVENLSNHQRSEEGDSAKISQLPSRRPSNLQNKGESELKWYEVDSDEDSSEEEEEDDEEEDIEEDGDDED  162 (390)
T ss_pred             hHHHHHHHhhcchhhhhhhccccccchhhhhccccCccccccCCCcccchhhcccccccchhhccCcchhhhhhccchhc
Confidence            999998754321111000   00000000          000    0000000000               000000111


Q ss_pred             CCCCcccccCcccccCHHHHHHHHHHccCC-----------------------CceecCcCc---ccccChHHHHHHHHH
Q psy5202         182 SRRNDTCEFCGKVFKNCSNLTVHRRSHTGE-----------------------KPYKCELCS---YACAQSSKLTRHMKT  235 (343)
Q Consensus       182 ~~~~~~C~~C~~~f~~~~~l~~H~~~h~~~-----------------------~~~~C~~C~---~~f~~~~~L~~H~~~  235 (343)
                      ...|--|-+|++.+.....-..||..+||-                       .-|.|-.|.   +.|.+....+.||..
T Consensus       163 e~~Pt~CLfC~~~~k~~e~~~~HM~~~HgffIPdreYL~D~~GLl~YLgeKV~~~~~CL~CN~~~~~f~sleavr~HM~~  242 (390)
T KOG2785|consen  163 ELIPTDCLFCDKKSKSLEENLKHMFKEHGFFIPDREYLTDEKGLLKYLGEKVGIGFICLFCNELGRPFSSLEAVRAHMRD  242 (390)
T ss_pred             ccCCcceeecCCCcccHHHHHHHHhhccCCcCCchHhhhchhHHHHHHHHHhccCceEEEeccccCcccccHHHHHHHhh
Confidence            123467999999999999999999998873                       247899999   999999999999975


Q ss_pred             h
Q psy5202         236 H  236 (343)
Q Consensus       236 h  236 (343)
                      -
T Consensus       243 K  243 (390)
T KOG2785|consen  243 K  243 (390)
T ss_pred             c
Confidence            3


No 54 
>PF12874 zf-met:  Zinc-finger of C2H2 type; PDB: 1ZU1_A 2KVG_A.
Probab=96.82  E-value=0.00089  Score=33.81  Aligned_cols=20  Identities=20%  Similarity=0.747  Sum_probs=9.6

Q ss_pred             ecCcCcccccChHHHHHHHH
Q psy5202         215 KCELCSYACAQSSKLTRHMK  234 (343)
Q Consensus       215 ~C~~C~~~f~~~~~L~~H~~  234 (343)
                      .|.+|++.|.+...|+.|++
T Consensus         2 ~C~~C~~~f~s~~~~~~H~~   21 (25)
T PF12874_consen    2 YCDICNKSFSSENSLRQHLR   21 (25)
T ss_dssp             EETTTTEEESSHHHHHHHHT
T ss_pred             CCCCCCCCcCCHHHHHHHHC
Confidence            44444444444444444443


No 55 
>PF12874 zf-met:  Zinc-finger of C2H2 type; PDB: 1ZU1_A 2KVG_A.
Probab=96.82  E-value=0.00056  Score=34.56  Aligned_cols=23  Identities=30%  Similarity=0.778  Sum_probs=21.1

Q ss_pred             cccCCCCCCCCChHHHHHHHhhh
Q psy5202         244 YRCRFCEMPFSVPSTLEKHMRKC  266 (343)
Q Consensus       244 ~~C~~C~~~f~~~~~l~~H~~~~  266 (343)
                      |.|.+|++.|.+...|..|++.+
T Consensus         1 ~~C~~C~~~f~s~~~~~~H~~s~   23 (25)
T PF12874_consen    1 FYCDICNKSFSSENSLRQHLRSK   23 (25)
T ss_dssp             EEETTTTEEESSHHHHHHHHTTH
T ss_pred             CCCCCCCCCcCCHHHHHHHHCcC
Confidence            68999999999999999999854


No 56 
>KOG2482|consensus
Probab=96.62  E-value=0.0094  Score=50.18  Aligned_cols=132  Identities=19%  Similarity=0.296  Sum_probs=85.2

Q ss_pred             CCeecCCCCccC-CCHHHHHHHHHHhccCCCCccccCCCCcchHHHHHHHHHhhhcCCCCcccccCCCCcccccCccccc
Q psy5202         118 KPFKCTVCDHAC-SQSAKLKKHMKIHDEDGDKEDREEGPSASSLAAAAAAAALNLCLPNSHMKKESRRNDTCEFCGKVFK  196 (343)
Q Consensus       118 ~~~~C~~C~~~f-~~~~~L~~H~~~h~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~C~~C~~~f~  196 (343)
                      ...+|-+|...+ ..++....|+-..|+-.......  ......-......              .-..+.|-.|.+.|.
T Consensus       143 fslqClFCn~e~lgnRs~~l~Hlf~~H~lniGlpDn--iVyvnelLehLke--------------kL~r~~CLyCekifr  206 (423)
T KOG2482|consen  143 FSLQCLFCNNEGLGNRSEILEHLFHVHGLNIGLPDN--IVYVNELLEHLKE--------------KLERLRCLYCEKIFR  206 (423)
T ss_pred             eeeEEEEecchhcccHHHHHHHHHHHhhhccCCCcc--eeeHHHHHHHHHH--------------HHhhheeeeeccccC
Confidence            346899998766 45667777876655431110000  0000000000000              114678999999999


Q ss_pred             CHHHHHHHHHH--ccCC----------------------------------------------------Cc--eecCcCc
Q psy5202         197 NCSNLTVHRRS--HTGE----------------------------------------------------KP--YKCELCS  220 (343)
Q Consensus       197 ~~~~l~~H~~~--h~~~----------------------------------------------------~~--~~C~~C~  220 (343)
                      ++..|+.||+.  |..-                                                    .+  .+|-.|.
T Consensus       207 dkntLkeHMrkK~HrrinPknreYDkfyiINY~ev~ks~t~~~~e~dret~~d~~E~D~~wsDw~ed~a~a~~v~CLfC~  286 (423)
T KOG2482|consen  207 DKNTLKEHMRKKRHRRINPKNREYDKFYIINYLEVGKSWTIVHSEDDRETNEDINETDDTWSDWNEDDAEALSVVCLFCT  286 (423)
T ss_pred             CcHHHHHHHHhccCcccCCCccccceEEEEeHhhcCCccchhhhhhhhhhhccccccccchhhhhcCCCCccceEEEeec
Confidence            99999999985  3210                                                    11  4799999


Q ss_pred             ccccChHHHHHHHHHhcC-------------------------CCCcccccCCCCCCCCChHHHHHHHhh
Q psy5202         221 YACAQSSKLTRHMKTHGR-------------------------LGKDVYRCRFCEMPFSVPSTLEKHMRK  265 (343)
Q Consensus       221 ~~f~~~~~L~~H~~~h~~-------------------------~~~~~~~C~~C~~~f~~~~~l~~H~~~  265 (343)
                      ...-+...|..||+.-|.                         .....-.|-.|...|.....|..||.-
T Consensus       287 ~~~en~~~l~eHmk~vHe~Dl~Ki~sd~~Ln~YqrvrviNyiRkq~~~~~c~~cd~~F~~e~~l~~hm~e  356 (423)
T KOG2482|consen  287 NFYENPVFLFEHMKIVHEFDLLKIQSDYSLNFYQRVRVINYIRKQKKKSRCAECDLSFWKEPGLLIHMVE  356 (423)
T ss_pred             cchhhHHHHHHHHHHHHHhhHHhhccccccchhhhhhHHHHHHHHhhccccccccccccCcchhhhhccc
Confidence            988889999999987551                         112234688999999999999999964


No 57 
>PF12171 zf-C2H2_jaz:  Zinc-finger double-stranded RNA-binding;  InterPro: IPR022755  This zinc finger is found in archaea and eukaryotes, and is approximately 30 amino acids in length. The mammalian members of this group occur multiple times along the protein, joined by flexible linkers, and are referred to as JAZ - dsRNA-binding ZF protein - zinc-fingers. The JAZ proteins are expressed in all tissues tested and localise in the nucleus, particularly the nucleolus []. JAZ preferentially binds to double-stranded (ds) RNA or RNA/DNA hybrids rather than DNA. In addition to binding double-stranded RNA, these zinc-fingers are required for nucleolar localisation.   This entry represents the multiple-adjacent-C2H2 zinc finger, JAZ. ; PDB: 4DGW_A 1ZR9_A.
Probab=96.49  E-value=0.00084  Score=34.65  Aligned_cols=22  Identities=32%  Similarity=0.762  Sum_probs=16.0

Q ss_pred             cccCCCCCCCCChHHHHHHHhh
Q psy5202         244 YRCRFCEMPFSVPSTLEKHMRK  265 (343)
Q Consensus       244 ~~C~~C~~~f~~~~~l~~H~~~  265 (343)
                      |.|..|++.|.+...|..|++.
T Consensus         2 ~~C~~C~k~f~~~~~~~~H~~s   23 (27)
T PF12171_consen    2 FYCDACDKYFSSENQLKQHMKS   23 (27)
T ss_dssp             CBBTTTTBBBSSHHHHHCCTTS
T ss_pred             CCcccCCCCcCCHHHHHHHHcc
Confidence            5677777777777777777754


No 58 
>PF13913 zf-C2HC_2:  zinc-finger of a C2HC-type
Probab=96.47  E-value=0.0023  Score=32.20  Aligned_cols=22  Identities=41%  Similarity=0.872  Sum_probs=14.6

Q ss_pred             cccCCCCCCCCChHHHHHHHhhh
Q psy5202         244 YRCRFCEMPFSVPSTLEKHMRKC  266 (343)
Q Consensus       244 ~~C~~C~~~f~~~~~l~~H~~~~  266 (343)
                      ..|+.||+.| ....|.+|++.|
T Consensus         3 ~~C~~CgR~F-~~~~l~~H~~~C   24 (25)
T PF13913_consen    3 VPCPICGRKF-NPDRLEKHEKIC   24 (25)
T ss_pred             CcCCCCCCEE-CHHHHHHHHHhc
Confidence            4577777777 666677776543


No 59 
>PF12171 zf-C2H2_jaz:  Zinc-finger double-stranded RNA-binding;  InterPro: IPR022755  This zinc finger is found in archaea and eukaryotes, and is approximately 30 amino acids in length. The mammalian members of this group occur multiple times along the protein, joined by flexible linkers, and are referred to as JAZ - dsRNA-binding ZF protein - zinc-fingers. The JAZ proteins are expressed in all tissues tested and localise in the nucleus, particularly the nucleolus []. JAZ preferentially binds to double-stranded (ds) RNA or RNA/DNA hybrids rather than DNA. In addition to binding double-stranded RNA, these zinc-fingers are required for nucleolar localisation.   This entry represents the multiple-adjacent-C2H2 zinc finger, JAZ. ; PDB: 4DGW_A 1ZR9_A.
Probab=96.02  E-value=0.0019  Score=33.33  Aligned_cols=22  Identities=27%  Similarity=0.718  Sum_probs=15.4

Q ss_pred             eecCcCcccccChHHHHHHHHH
Q psy5202         214 YKCELCSYACAQSSKLTRHMKT  235 (343)
Q Consensus       214 ~~C~~C~~~f~~~~~L~~H~~~  235 (343)
                      |.|..|++.|.+...|..|++.
T Consensus         2 ~~C~~C~k~f~~~~~~~~H~~s   23 (27)
T PF12171_consen    2 FYCDACDKYFSSENQLKQHMKS   23 (27)
T ss_dssp             CBBTTTTBBBSSHHHHHCCTTS
T ss_pred             CCcccCCCCcCCHHHHHHHHcc
Confidence            5677777777777777777653


No 60 
>KOG2482|consensus
Probab=95.80  E-value=0.016  Score=48.81  Aligned_cols=54  Identities=33%  Similarity=0.591  Sum_probs=43.1

Q ss_pred             cccccccccccCChHHHHHHHHh--ccCC---------------------------------------------------
Q psy5202          91 NKECEFCGKKFKYENTLQAHRRN--HTGE---------------------------------------------------  117 (343)
Q Consensus        91 ~~~C~~C~~~f~~~~~L~~H~~~--h~~~---------------------------------------------------  117 (343)
                      .+.|-.|.+.|..+..|+.||+.  |...                                                   
T Consensus       195 r~~CLyCekifrdkntLkeHMrkK~HrrinPknreYDkfyiINY~ev~ks~t~~~~e~dret~~d~~E~D~~wsDw~ed~  274 (423)
T KOG2482|consen  195 RLRCLYCEKIFRDKNTLKEHMRKKRHRRINPKNREYDKFYIINYLEVGKSWTIVHSEDDRETNEDINETDDTWSDWNEDD  274 (423)
T ss_pred             hheeeeeccccCCcHHHHHHHHhccCcccCCCccccceEEEEeHhhcCCccchhhhhhhhhhhccccccccchhhhhcCC
Confidence            58899999999999999999985  4221                                                   


Q ss_pred             -C--CeecCCCCccCCCHHHHHHHHHHhcc
Q psy5202         118 -K--PFKCTVCDHACSQSAKLKKHMKIHDE  144 (343)
Q Consensus       118 -~--~~~C~~C~~~f~~~~~L~~H~~~h~~  144 (343)
                       .  ...|-+|....-+...|..||+..|.
T Consensus       275 a~a~~v~CLfC~~~~en~~~l~eHmk~vHe  304 (423)
T KOG2482|consen  275 AEALSVVCLFCTNFYENPVFLFEHMKIVHE  304 (423)
T ss_pred             CCccceEEEeeccchhhHHHHHHHHHHHHH
Confidence             1  15888888888888888889887764


No 61 
>KOG4173|consensus
Probab=95.44  E-value=0.01  Score=46.21  Aligned_cols=82  Identities=23%  Similarity=0.498  Sum_probs=63.2

Q ss_pred             CCccccc--CcccccCHHHHHHHHHHccCCCceecCcCcccccChHHHHHHHHHhc--------CCCCcccccCC--CCC
Q psy5202         184 RNDTCEF--CGKVFKNCSNLTVHRRSHTGEKPYKCELCSYACAQSSKLTRHMKTHG--------RLGKDVYRCRF--CEM  251 (343)
Q Consensus       184 ~~~~C~~--C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~f~~~~~L~~H~~~h~--------~~~~~~~~C~~--C~~  251 (343)
                      ..|.|++  |...|.....+..|-..-++   -.|..|.+.|.+...|-.|+.--|        ..|...|.|-+  |+.
T Consensus        78 ~~~~cqvagc~~~~d~lD~~E~hY~~~h~---~sCs~C~r~~Pt~hLLd~HI~E~HDs~Fqa~veRG~dMy~ClvEgCt~  154 (253)
T KOG4173|consen   78 PAFACQVAGCCQVFDALDDYEHHYHTLHG---NSCSFCKRAFPTGHLLDAHILEWHDSLFQALVERGQDMYQCLVEGCTE  154 (253)
T ss_pred             ccccccccchHHHHhhhhhHHHhhhhccc---chhHHHHHhCCchhhhhHHHHHHHHHHHHHHHHcCccHHHHHHHhhhh
Confidence            5677877  77888888777777655443   369999999999999999975432        24677899954  999


Q ss_pred             CCCChHHHHHHHhhhcC
Q psy5202         252 PFSVPSTLEKHMRKCVV  268 (343)
Q Consensus       252 ~f~~~~~l~~H~~~~~~  268 (343)
                      .|.+...-..|+-..|.
T Consensus       155 KFkT~r~RkdH~I~~Hk  171 (253)
T KOG4173|consen  155 KFKTSRDRKDHMIRMHK  171 (253)
T ss_pred             hhhhhhhhhhHHHHhcc
Confidence            99999999999876554


No 62 
>PF13913 zf-C2HC_2:  zinc-finger of a C2HC-type
Probab=95.39  E-value=0.014  Score=29.29  Aligned_cols=19  Identities=26%  Similarity=0.705  Sum_probs=10.1

Q ss_pred             ecCcCcccccChHHHHHHHH
Q psy5202         215 KCELCSYACAQSSKLTRHMK  234 (343)
Q Consensus       215 ~C~~C~~~f~~~~~L~~H~~  234 (343)
                      .|+.||+.| ....|.+|+.
T Consensus         4 ~C~~CgR~F-~~~~l~~H~~   22 (25)
T PF13913_consen    4 PCPICGRKF-NPDRLEKHEK   22 (25)
T ss_pred             cCCCCCCEE-CHHHHHHHHH
Confidence            455555555 4455555544


No 63 
>TIGR00622 ssl1 transcription factor ssl1. This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University).
Probab=94.88  E-value=0.072  Score=37.78  Aligned_cols=109  Identities=22%  Similarity=0.339  Sum_probs=62.7

Q ss_pred             ccccccccccCChHHHHHHHHhccCCCCeecCCCCccCCCHHHHHHHHHHhccCCCCccccCCCCcchHHHHHHHHHhhh
Q psy5202          92 KECEFCGKKFKYENTLQAHRRNHTGEKPFKCTVCDHACSQSAKLKKHMKIHDEDGDKEDREEGPSASSLAAAAAAAALNL  171 (343)
Q Consensus        92 ~~C~~C~~~f~~~~~L~~H~~~h~~~~~~~C~~C~~~f~~~~~L~~H~~~h~~~~~~~~~~~~~~~~~~~~~~~~~~~~~  171 (343)
                      |.|+.|+-..              =+-|..|+.|+-.......|.|-.  ||--......+. +.               
T Consensus         2 Y~CPrC~skv--------------C~LP~~CpiCgLtLVss~HLARSy--HHLfPl~~f~ev-~~---------------   49 (112)
T TIGR00622         2 YFCPQCRAKV--------------CELPVECPICGLTLILSTHLARSY--HHLFPLKAFQEI-PL---------------   49 (112)
T ss_pred             ccCCCCCCCc--------------cCCCCcCCcCCCEEeccchHHHhh--hccCCCcccccc-cc---------------
Confidence            6777776544              235678999998888877777652  331100000000 00               


Q ss_pred             cCCCCcccccCCCCcccccCcccccCHHHHHHHHHHccCCCceecCcCcccccChHHHHHHHHHhcCCCCcccccCCCC
Q psy5202         172 CLPNSHMKKESRRNDTCEFCGKVFKNCSNLTVHRRSHTGEKPYKCELCSYACAQSSKLTRHMKTHGRLGKDVYRCRFCE  250 (343)
Q Consensus       172 ~~~~~~~~~~~~~~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~f~~~~~L~~H~~~h~~~~~~~~~C~~C~  250 (343)
                              ......-.|--|...|........  ..-.....|+|+.|...|-..-++..|...|.        |+.|.
T Consensus        50 --------~~~~~~~~C~~C~~~f~~~~~~~~--~~~~~~~~y~C~~C~~~FC~dCD~fiHe~Lh~--------CPGC~  110 (112)
T TIGR00622        50 --------EEYNGSRFCFGCQGPFPKPPVSPF--DELKDSHRYVCAVCKNVFCVDCDVFVHESLHC--------CPGCI  110 (112)
T ss_pred             --------cccCCCCcccCcCCCCCCcccccc--cccccccceeCCCCCCccccccchhhhhhccC--------CcCCC
Confidence                    000022348888888876431110  00122346889999999888888888877773        77775


No 64 
>smart00451 ZnF_U1 U1-like zinc finger. Family of C2H2-type zinc fingers, present in matrin, U1 small nuclear ribonucleoprotein C and other RNA-binding proteins.
Probab=94.66  E-value=0.027  Score=30.89  Aligned_cols=23  Identities=22%  Similarity=0.732  Sum_probs=19.9

Q ss_pred             ccccCCCCCCCCChHHHHHHHhh
Q psy5202         243 VYRCRFCEMPFSVPSTLEKHMRK  265 (343)
Q Consensus       243 ~~~C~~C~~~f~~~~~l~~H~~~  265 (343)
                      +|.|.+|++.|.+...+..|++.
T Consensus         3 ~~~C~~C~~~~~~~~~~~~H~~g   25 (35)
T smart00451        3 GFYCKLCNVTFTDEISVEAHLKG   25 (35)
T ss_pred             CeEccccCCccCCHHHHHHHHCh
Confidence            57899999999999999999864


No 65 
>cd00350 rubredoxin_like Rubredoxin_like; nonheme iron binding domain containing a [Fe(SCys)4] center. The family includes rubredoxins, a small electron transfer protein, and a slightly smaller modular rubredoxin domain present in rubrerythrin and nigerythrin and detected either N- or C-terminal to such proteins as flavin reductase, NAD(P)H-nitrite reductase, and ferredoxin-thioredoxin reductase. In rubredoxin, the iron atom is coordinated by four cysteine residues (Fe(S-Cys)4), but iron can also be replaced by cobalt, nickel or zinc and believed to be involved in electron transfer.  Rubrerythrins and nigerythrins are small homodimeric proteins, generally consisting of 2 domains: a rubredoxin domain C-terminal to a non-sulfur, oxo-bridged diiron site in the N-terminal rubrerythrin domain.  Rubrerythrins and nigerythrins have putative peroxide activity.
Probab=94.36  E-value=0.02  Score=31.00  Aligned_cols=12  Identities=25%  Similarity=0.927  Sum_probs=6.7

Q ss_pred             cccccCcccccC
Q psy5202         186 DTCEFCGKVFKN  197 (343)
Q Consensus       186 ~~C~~C~~~f~~  197 (343)
                      |+|..||..+..
T Consensus         2 ~~C~~CGy~y~~   13 (33)
T cd00350           2 YVCPVCGYIYDG   13 (33)
T ss_pred             EECCCCCCEECC
Confidence            456666655543


No 66 
>KOG4173|consensus
Probab=94.24  E-value=0.021  Score=44.50  Aligned_cols=26  Identities=27%  Similarity=0.473  Sum_probs=18.7

Q ss_pred             CCCceecC--cCcccccChHHHHHHHHH
Q psy5202         210 GEKPYKCE--LCSYACAQSSKLTRHMKT  235 (343)
Q Consensus       210 ~~~~~~C~--~C~~~f~~~~~L~~H~~~  235 (343)
                      |...|+|-  .|+..|.+...-..|+-.
T Consensus       141 G~dMy~ClvEgCt~KFkT~r~RkdH~I~  168 (253)
T KOG4173|consen  141 GQDMYQCLVEGCTEKFKTSRDRKDHMIR  168 (253)
T ss_pred             CccHHHHHHHhhhhhhhhhhhhhhHHHH
Confidence            45567884  488888888888888643


No 67 
>KOG2893|consensus
Probab=93.84  E-value=0.015  Score=46.42  Aligned_cols=47  Identities=28%  Similarity=0.572  Sum_probs=38.4

Q ss_pred             cccccccccCChHHHHHHHHhccCCCCeecCCCCccCCCHHHHHHHHHHhc
Q psy5202          93 ECEFCGKKFKYENTLQAHRRNHTGEKPFKCTVCDHACSQSAKLKKHMKIHD  143 (343)
Q Consensus        93 ~C~~C~~~f~~~~~L~~H~~~h~~~~~~~C~~C~~~f~~~~~L~~H~~~h~  143 (343)
                      -|=+|++.|.+..-|..|++.    +.|+|.+|.+...+--.|..|....|
T Consensus        12 wcwycnrefddekiliqhqka----khfkchichkkl~sgpglsihcmqvh   58 (341)
T KOG2893|consen   12 WCWYCNREFDDEKILIQHQKA----KHFKCHICHKKLFSGPGLSIHCMQVH   58 (341)
T ss_pred             eeeecccccchhhhhhhhhhh----ccceeeeehhhhccCCCceeehhhhh
Confidence            488999999999999999876    34999999998887777777754443


No 68 
>COG4049 Uncharacterized protein containing archaeal-type C2H2 Zn-finger [General function prediction only]
Probab=93.76  E-value=0.027  Score=33.66  Aligned_cols=28  Identities=29%  Similarity=0.590  Sum_probs=16.7

Q ss_pred             CCcccccCCCCCCCCChHHHHHHHhhhc
Q psy5202         240 GKDVYRCRFCEMPFSVPSTLEKHMRKCV  267 (343)
Q Consensus       240 ~~~~~~C~~C~~~f~~~~~l~~H~~~~~  267 (343)
                      |+..++|+.||..|....++.+|+.+-|
T Consensus        14 GE~~lrCPRC~~~FR~~K~Y~RHVNKaH   41 (65)
T COG4049          14 GEEFLRCPRCGMVFRRRKDYIRHVNKAH   41 (65)
T ss_pred             CceeeeCCchhHHHHHhHHHHHHhhHHh
Confidence            4555666666666666666666665544


No 69 
>smart00451 ZnF_U1 U1-like zinc finger. Family of C2H2-type zinc fingers, present in matrin, U1 small nuclear ribonucleoprotein C and other RNA-binding proteins.
Probab=93.72  E-value=0.053  Score=29.71  Aligned_cols=23  Identities=22%  Similarity=0.598  Sum_probs=18.4

Q ss_pred             ceecCcCcccccChHHHHHHHHH
Q psy5202         213 PYKCELCSYACAQSSKLTRHMKT  235 (343)
Q Consensus       213 ~~~C~~C~~~f~~~~~L~~H~~~  235 (343)
                      +|.|.+|+..|.....+..|++.
T Consensus         3 ~~~C~~C~~~~~~~~~~~~H~~g   25 (35)
T smart00451        3 GFYCKLCNVTFTDEISVEAHLKG   25 (35)
T ss_pred             CeEccccCCccCCHHHHHHHHCh
Confidence            57788888888888888888754


No 70 
>PF09538 FYDLN_acid:  Protein of unknown function (FYDLN_acid);  InterPro: IPR012644 Members of this family are bacterial proteins with a conserved motif [KR]FYDLN, sometimes flanked by a pair of CXXC motifs, followed by a long region of low complexity sequence in which roughly half the residues are Asp and Glu, including multiple runs of five or more acidic residues. The function of members of this family is unknown.
Probab=93.42  E-value=0.048  Score=38.77  Aligned_cols=30  Identities=23%  Similarity=0.424  Sum_probs=24.0

Q ss_pred             eecCcCcccccChHHHHHHHHHhcCCCCcccccCCCCCCCCCh
Q psy5202         214 YKCELCSYACAQSSKLTRHMKTHGRLGKDVYRCRFCEMPFSVP  256 (343)
Q Consensus       214 ~~C~~C~~~f~~~~~L~~H~~~h~~~~~~~~~C~~C~~~f~~~  256 (343)
                      ..|+.||.+|...             ++.|..|++||..|.-.
T Consensus        10 R~Cp~CG~kFYDL-------------nk~PivCP~CG~~~~~~   39 (108)
T PF09538_consen   10 RTCPSCGAKFYDL-------------NKDPIVCPKCGTEFPPE   39 (108)
T ss_pred             ccCCCCcchhccC-------------CCCCccCCCCCCccCcc
Confidence            4689999888762             56788899999988766


No 71 
>COG5048 FOG: Zn-finger [General function prediction only]
Probab=93.36  E-value=0.017  Score=52.90  Aligned_cols=54  Identities=35%  Similarity=0.632  Sum_probs=49.6

Q ss_pred             CCcccccCcccccCHHHHHHHHH--HccCC--CceecC--cCcccccChHHHHHHHHHhc
Q psy5202         184 RNDTCEFCGKVFKNCSNLTVHRR--SHTGE--KPYKCE--LCSYACAQSSKLTRHMKTHG  237 (343)
Q Consensus       184 ~~~~C~~C~~~f~~~~~l~~H~~--~h~~~--~~~~C~--~C~~~f~~~~~L~~H~~~h~  237 (343)
                      ..+.|..|...|.....|..|.+  .|.++  +++.|+  .|++.|.+...+..|...|.
T Consensus       288 ~~~~~~~~~~~~s~~~~l~~~~~~~~h~~~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~~  347 (467)
T COG5048         288 LPIKSKQCNISFSRSSPLTRHLRSVNHSGESLKPFSCPYSLCGKLFSRNDALKRHILLHT  347 (467)
T ss_pred             cCCCCccccCCccccccccccccccccccccCCceeeeccCCCccccccccccCCccccc
Confidence            36889999999999999999999  89999  999999  79999999999999988886


No 72 
>PF12013 DUF3505:  Protein of unknown function (DUF3505);  InterPro: IPR022698  This family of proteins is functionally uncharacterised. This protein is found in eukaryotes. Proteins in this family are typically between 247 to 1018 amino acids in length. This region contains two segments that are likely to be C2H2 zinc binding domains. 
Probab=93.28  E-value=0.18  Score=36.15  Aligned_cols=25  Identities=36%  Similarity=0.541  Sum_probs=22.5

Q ss_pred             ccc----CCCCCCCCChHHHHHHHhhhcC
Q psy5202         244 YRC----RFCEMPFSVPSTLEKHMRKCVV  268 (343)
Q Consensus       244 ~~C----~~C~~~f~~~~~l~~H~~~~~~  268 (343)
                      |.|    ..|++.+.+...+..|++.+|+
T Consensus        81 ~~C~~~~~~C~y~~~~~~~m~~H~~~~Hg  109 (109)
T PF12013_consen   81 YRCQCDPPHCGYITRSKKTMRKHWRKEHG  109 (109)
T ss_pred             eeeecCCCCCCcEeccHHHHHHHHHHhcC
Confidence            789    9999999999999999998875


No 73 
>COG4049 Uncharacterized protein containing archaeal-type C2H2 Zn-finger [General function prediction only]
Probab=93.20  E-value=0.04  Score=32.97  Aligned_cols=29  Identities=21%  Similarity=0.387  Sum_probs=22.3

Q ss_pred             CCCCCcccccccccccCChHHHHHHHHhc
Q psy5202          86 SQEEKNKECEFCGKKFKYENTLQAHRRNH  114 (343)
Q Consensus        86 ~~~~~~~~C~~C~~~f~~~~~L~~H~~~h  114 (343)
                      -.++.-+.|+.|+..|.....+.+|+..-
T Consensus        12 RDGE~~lrCPRC~~~FR~~K~Y~RHVNKa   40 (65)
T COG4049          12 RDGEEFLRCPRCGMVFRRRKDYIRHVNKA   40 (65)
T ss_pred             cCCceeeeCCchhHHHHHhHHHHHHhhHH
Confidence            34666788888888888888888887643


No 74 
>KOG2893|consensus
Probab=92.98  E-value=0.023  Score=45.34  Aligned_cols=41  Identities=27%  Similarity=0.661  Sum_probs=24.5

Q ss_pred             cccCcccccCHHHHHHHHHHccCCCceecCcCcccccChHHHHHH
Q psy5202         188 CEFCGKVFKNCSNLTVHRRSHTGEKPYKCELCSYACAQSSKLTRH  232 (343)
Q Consensus       188 C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~f~~~~~L~~H  232 (343)
                      |-+|++.|.+..-|..|++.    +-|+|.+|.+...+--.|..|
T Consensus        13 cwycnrefddekiliqhqka----khfkchichkkl~sgpglsih   53 (341)
T KOG2893|consen   13 CWYCNREFDDEKILIQHQKA----KHFKCHICHKKLFSGPGLSIH   53 (341)
T ss_pred             eeecccccchhhhhhhhhhh----ccceeeeehhhhccCCCceee
Confidence            66666666666666666554    236666666655555555555


No 75 
>cd00729 rubredoxin_SM Rubredoxin, Small Modular nonheme iron binding domain containing a [Fe(SCys)4] center, present in rubrerythrin and nigerythrin and detected either N- or C-terminal to such proteins as flavin reductase, NAD(P)H-nitrite reductase, and ferredoxin-thioredoxin reductase. In rubredoxin, the iron atom is coordinated by four cysteine residues (Fe(S-Cys)4), and  believed to be involved in electron transfer. Rubrerythrins and nigerythrins are small homodimeric proteins, generally consisting of 2 domains: a rubredoxin domain C-terminal to a non-sulfur, oxo-bridged diiron site in the N-terminal rubrerythrin domain. Rubrerythrins and nigerythrins have putative peroxide activity.
Probab=92.89  E-value=0.079  Score=28.89  Aligned_cols=11  Identities=27%  Similarity=0.993  Sum_probs=6.8

Q ss_pred             eecCcCccccc
Q psy5202         214 YKCELCSYACA  224 (343)
Q Consensus       214 ~~C~~C~~~f~  224 (343)
                      |+|..||+.+.
T Consensus         3 ~~C~~CG~i~~   13 (34)
T cd00729           3 WVCPVCGYIHE   13 (34)
T ss_pred             EECCCCCCEeE
Confidence            66777775543


No 76 
>PF12013 DUF3505:  Protein of unknown function (DUF3505);  InterPro: IPR022698  This family of proteins is functionally uncharacterised. This protein is found in eukaryotes. Proteins in this family are typically between 247 to 1018 amino acids in length. This region contains two segments that are likely to be C2H2 zinc binding domains. 
Probab=92.69  E-value=0.25  Score=35.45  Aligned_cols=54  Identities=17%  Similarity=0.391  Sum_probs=43.5

Q ss_pred             CcccccccccccCChHHHHHHHHh-ccCC-----------------------------------------CCeec----C
Q psy5202          90 KNKECEFCGKKFKYENTLQAHRRN-HTGE-----------------------------------------KPFKC----T  123 (343)
Q Consensus        90 ~~~~C~~C~~~f~~~~~L~~H~~~-h~~~-----------------------------------------~~~~C----~  123 (343)
                      +-..|..|+....- ..+..|++. |+..                                         .-|.|    .
T Consensus        10 ~vlIC~~C~~av~~-~~v~~HL~~~H~~~~~~~~~~i~~~~~~~~~l~~~~~~~~~p~~~~~Pi~gLp~~~G~~C~~~~~   88 (109)
T PF12013_consen   10 RVLICRQCQYAVQP-SEVESHLRKRHHILKSQERQRIVEAIRQWPDLLPDPDDLQIPPDPSPPIPGLPVYDGYRCQCDPP   88 (109)
T ss_pred             CEEEeCCCCcccCc-hHHHHHHHHhcccccHHHHHHHHHHHHhhhhcccCccccCCCCCCCCcCCCCCCCCCeeeecCCC
Confidence            45689999988775 888999884 4221                                         12899    9


Q ss_pred             CCCccCCCHHHHHHHHHHhcc
Q psy5202         124 VCDHACSQSAKLKKHMKIHDE  144 (343)
Q Consensus       124 ~C~~~f~~~~~L~~H~~~h~~  144 (343)
                      .|++.+.+...|.+|++.+|+
T Consensus        89 ~C~y~~~~~~~m~~H~~~~Hg  109 (109)
T PF12013_consen   89 HCGYITRSKKTMRKHWRKEHG  109 (109)
T ss_pred             CCCcEeccHHHHHHHHHHhcC
Confidence            999999999999999998875


No 77 
>TIGR00622 ssl1 transcription factor ssl1. This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University).
Probab=90.80  E-value=0.51  Score=33.55  Aligned_cols=77  Identities=18%  Similarity=0.246  Sum_probs=52.1

Q ss_pred             CCcccccCcccccCHHHHHHHHHHcc-------CCC-------ceecCcCcccccChHHHHHHHHHhcCCCCcccccCCC
Q psy5202         184 RNDTCEFCGKVFKNCSNLTVHRRSHT-------GEK-------PYKCELCSYACAQSSKLTRHMKTHGRLGKDVYRCRFC  249 (343)
Q Consensus       184 ~~~~C~~C~~~f~~~~~l~~H~~~h~-------~~~-------~~~C~~C~~~f~~~~~L~~H~~~h~~~~~~~~~C~~C  249 (343)
                      -+..|+.||.+......|.+--  |+       .+.       ...|--|...|........=  .  ......|+|+.|
T Consensus        14 LP~~CpiCgLtLVss~HLARSy--HHLfPl~~f~ev~~~~~~~~~~C~~C~~~f~~~~~~~~~--~--~~~~~~y~C~~C   87 (112)
T TIGR00622        14 LPVECPICGLTLILSTHLARSY--HHLFPLKAFQEIPLEEYNGSRFCFGCQGPFPKPPVSPFD--E--LKDSHRYVCAVC   87 (112)
T ss_pred             CCCcCCcCCCEEeccchHHHhh--hccCCCcccccccccccCCCCcccCcCCCCCCccccccc--c--cccccceeCCCC
Confidence            5678999999998887776542  22       011       12399999999764311100  0  013457999999


Q ss_pred             CCCCCChHHHHHHHhhh
Q psy5202         250 EMPFSVPSTLEKHMRKC  266 (343)
Q Consensus       250 ~~~f~~~~~l~~H~~~~  266 (343)
                      ...|--.-++..|..-|
T Consensus        88 ~~~FC~dCD~fiHe~Lh  104 (112)
T TIGR00622        88 KNVFCVDCDVFVHESLH  104 (112)
T ss_pred             CCccccccchhhhhhcc
Confidence            99999999999997654


No 78 
>PF09538 FYDLN_acid:  Protein of unknown function (FYDLN_acid);  InterPro: IPR012644 Members of this family are bacterial proteins with a conserved motif [KR]FYDLN, sometimes flanked by a pair of CXXC motifs, followed by a long region of low complexity sequence in which roughly half the residues are Asp and Glu, including multiple runs of five or more acidic residues. The function of members of this family is unknown.
Probab=90.78  E-value=0.18  Score=35.92  Aligned_cols=32  Identities=25%  Similarity=0.608  Sum_probs=26.1

Q ss_pred             CCcccccCcccccCHHHHHHHHHHccCCCceecCcCcccccCh
Q psy5202         184 RNDTCEFCGKVFKNCSNLTVHRRSHTGEKPYKCELCSYACAQS  226 (343)
Q Consensus       184 ~~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~f~~~  226 (343)
                      ....|+.||+.|-..           +..|-.|++||..|.-.
T Consensus         8 tKR~Cp~CG~kFYDL-----------nk~PivCP~CG~~~~~~   39 (108)
T PF09538_consen    8 TKRTCPSCGAKFYDL-----------NKDPIVCPKCGTEFPPE   39 (108)
T ss_pred             CcccCCCCcchhccC-----------CCCCccCCCCCCccCcc
Confidence            456899999999653           34788999999999766


No 79 
>TIGR02300 FYDLN_acid conserved hypothetical protein TIGR02300. Members of this family are bacterial proteins with a conserved motif [KR]FYDLN, sometimes flanked by a pair of CXXC motifs, followed by a long region of low complexity sequence in which roughly half the residues are Asp and Glu, including multiple runs of five or more acidic residues. The function of members of this family is unknown.
Probab=90.37  E-value=0.18  Score=36.44  Aligned_cols=33  Identities=9%  Similarity=0.053  Sum_probs=24.0

Q ss_pred             eecCcCcccccChHHHHHHHHHhcCCCCcccccCCCCCCCCChHHH
Q psy5202         214 YKCELCSYACAQSSKLTRHMKTHGRLGKDVYRCRFCEMPFSVPSTL  259 (343)
Q Consensus       214 ~~C~~C~~~f~~~~~L~~H~~~h~~~~~~~~~C~~C~~~f~~~~~l  259 (343)
                      ..|+.||++|...             ++.|..|++||..|.-...+
T Consensus        10 r~Cp~cg~kFYDL-------------nk~p~vcP~cg~~~~~~~~~   42 (129)
T TIGR02300        10 RICPNTGSKFYDL-------------NRRPAVSPYTGEQFPPEEAL   42 (129)
T ss_pred             ccCCCcCcccccc-------------CCCCccCCCcCCccCcchhh
Confidence            5688888888652             56788899999887655333


No 80 
>COG1592 Rubrerythrin [Energy production and conversion]
Probab=90.36  E-value=0.18  Score=38.71  Aligned_cols=23  Identities=26%  Similarity=0.807  Sum_probs=17.6

Q ss_pred             ceecCcCcccccChHHHHHHHHHhcCCCCcccccCCCC
Q psy5202         213 PYKCELCSYACAQSSKLTRHMKTHGRLGKDVYRCRFCE  250 (343)
Q Consensus       213 ~~~C~~C~~~f~~~~~L~~H~~~h~~~~~~~~~C~~C~  250 (343)
                      .|.|++||+.+.               |+.|-+||+||
T Consensus       134 ~~vC~vCGy~~~---------------ge~P~~CPiCg  156 (166)
T COG1592         134 VWVCPVCGYTHE---------------GEAPEVCPICG  156 (166)
T ss_pred             EEEcCCCCCccc---------------CCCCCcCCCCC
Confidence            588888886542               57788888888


No 81 
>TIGR02098 MJ0042_CXXC MJ0042 family finger-like domain. This domain contains a CXXCX(19)CXXC motif suggestive of both zinc fingers and thioredoxin, usually found at the N-terminus of prokaryotic proteins. One partially characterized gene, agmX, is among a large set in Myxococcus whose interruption affects adventurous gliding motility.
Probab=90.29  E-value=0.13  Score=28.80  Aligned_cols=14  Identities=14%  Similarity=0.387  Sum_probs=7.1

Q ss_pred             eecCcCcccccChH
Q psy5202         214 YKCELCSYACAQSS  227 (343)
Q Consensus       214 ~~C~~C~~~f~~~~  227 (343)
                      +.|+.|+..|.-..
T Consensus         3 ~~CP~C~~~~~v~~   16 (38)
T TIGR02098         3 IQCPNCKTSFRVVD   16 (38)
T ss_pred             EECCCCCCEEEeCH
Confidence            34555555554443


No 82 
>PF09986 DUF2225:  Uncharacterized protein conserved in bacteria (DUF2225);  InterPro: IPR018708 This conserved bacterial family has no known function.
Probab=90.17  E-value=0.067  Score=43.58  Aligned_cols=45  Identities=16%  Similarity=0.348  Sum_probs=24.4

Q ss_pred             CceecCcCcccccChHHHHHHHHHhc-C-------CCCc-----ccccCCCCCCCCCh
Q psy5202         212 KPYKCELCSYACAQSSKLTRHMKTHG-R-------LGKD-----VYRCRFCEMPFSVP  256 (343)
Q Consensus       212 ~~~~C~~C~~~f~~~~~L~~H~~~h~-~-------~~~~-----~~~C~~C~~~f~~~  256 (343)
                      +.+.||+|+..|....-+....+.-. .       .+..     +..|+.||++|...
T Consensus         4 k~~~CPvC~~~F~~~~vrs~~~r~~~~d~D~~~~Y~~vnP~~Y~V~vCP~CgyA~~~~   61 (214)
T PF09986_consen    4 KKITCPVCGKEFKTKKVRSGKIRVIRRDSDFCPRYKGVNPLFYEVWVCPHCGYAAFEE   61 (214)
T ss_pred             CceECCCCCCeeeeeEEEcCCceEeeecCCCccccCCCCCeeeeEEECCCCCCccccc
Confidence            44566666666665544444433211 0       0122     25799999987654


No 83 
>KOG2807|consensus
Probab=89.98  E-value=0.65  Score=39.32  Aligned_cols=31  Identities=29%  Similarity=0.623  Sum_probs=21.6

Q ss_pred             CceecCcCcccccChHHHHHHHHHhcCCCCcccccCCCC
Q psy5202         212 KPYKCELCSYACAQSSKLTRHMKTHGRLGKDVYRCRFCE  250 (343)
Q Consensus       212 ~~~~C~~C~~~f~~~~~L~~H~~~h~~~~~~~~~C~~C~  250 (343)
                      ..|+|..|...|-.--+...|...|.        |+.|.
T Consensus       344 ~~y~C~~Ck~~FCldCDv~iHesLh~--------CpgCe  374 (378)
T KOG2807|consen  344 GRYRCESCKNVFCLDCDVFIHESLHN--------CPGCE  374 (378)
T ss_pred             CcEEchhccceeeccchHHHHhhhhc--------CCCcC
Confidence            45778777777777777777766663        66665


No 84 
>PF02892 zf-BED:  BED zinc finger;  InterPro: IPR003656 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target.  This entry represents predicted BED-type zinc finger domains. The BED finger which was named after the Drosophila proteins BEAF and DREF, is found in one or more copies in cellular regulatory factors and transposases from plants, animals and fungi. The BED finger is an about 50 to 60 amino acid residues domain that contains a characteristic motif with two highly conserved aromatic positions, as well as a shared pattern of cysteines and histidines that is predicted to form a zinc finger. As diverse BED fingers are able to bind DNA, it has been suggested that DNA-binding is the general function of this domain []. Some proteins known to contain a BED domain include animal, plant and fungi AC1 and Hobo-like transposases; Caenorhabditis elegans Dpy-20 protein, a predicted cuticular gene transcriptional regulator; Drosophila BEAF (boundary element-associated factor), thought to be involved in chromatin insulation; Drosophila DREF, a transcriptional regulator for S-phase genes; and tobacco 3AF1 and tomato E4/E8-BP1, light- and ethylene-regulated DNA binding proteins that contain two BED fingers. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; GO: 0003677 DNA binding; PDB: 2DJR_A 2CT5_A.
Probab=89.96  E-value=0.18  Score=29.45  Aligned_cols=25  Identities=28%  Similarity=0.733  Sum_probs=12.0

Q ss_pred             CcccccCCCCCCCCCh----HHHHHHHhh
Q psy5202         241 KDVYRCRFCEMPFSVP----STLEKHMRK  265 (343)
Q Consensus       241 ~~~~~C~~C~~~f~~~----~~l~~H~~~  265 (343)
                      ....+|.+|++.+...    ..|.+|++.
T Consensus        14 ~~~a~C~~C~~~~~~~~~~ts~l~~HL~~   42 (45)
T PF02892_consen   14 KKKAKCKYCGKVIKYSSGGTSNLKRHLKK   42 (45)
T ss_dssp             SS-EEETTTTEE-----SSTHHHHHHHHH
T ss_pred             cCeEEeCCCCeEEeeCCCcHHHHHHhhhh
Confidence            3445666666655543    566666643


No 85 
>PF09986 DUF2225:  Uncharacterized protein conserved in bacteria (DUF2225);  InterPro: IPR018708 This conserved bacterial family has no known function.
Probab=89.19  E-value=0.064  Score=43.69  Aligned_cols=44  Identities=27%  Similarity=0.557  Sum_probs=30.0

Q ss_pred             CCcccccccccccCChHHHHHHHHhc---c-------CCC-----CeecCCCCccCCCH
Q psy5202          89 EKNKECEFCGKKFKYENTLQAHRRNH---T-------GEK-----PFKCTVCDHACSQS  132 (343)
Q Consensus        89 ~~~~~C~~C~~~f~~~~~L~~H~~~h---~-------~~~-----~~~C~~C~~~f~~~  132 (343)
                      ++.+.|++|+..|.+..-.....+.-   .       +..     ...|+.||+.|...
T Consensus         3 ~k~~~CPvC~~~F~~~~vrs~~~r~~~~d~D~~~~Y~~vnP~~Y~V~vCP~CgyA~~~~   61 (214)
T PF09986_consen    3 DKKITCPVCGKEFKTKKVRSGKIRVIRRDSDFCPRYKGVNPLFYEVWVCPHCGYAAFEE   61 (214)
T ss_pred             CCceECCCCCCeeeeeEEEcCCceEeeecCCCccccCCCCCeeeeEEECCCCCCccccc
Confidence            46789999999999876555444321   1       112     26899999987644


No 86 
>PF13717 zinc_ribbon_4:  zinc-ribbon domain
Probab=88.81  E-value=0.29  Score=27.07  Aligned_cols=14  Identities=14%  Similarity=0.406  Sum_probs=6.8

Q ss_pred             ecCcCcccccChHH
Q psy5202         215 KCELCSYACAQSSK  228 (343)
Q Consensus       215 ~C~~C~~~f~~~~~  228 (343)
                      .|+.|+..|.-...
T Consensus         4 ~Cp~C~~~y~i~d~   17 (36)
T PF13717_consen    4 TCPNCQAKYEIDDE   17 (36)
T ss_pred             ECCCCCCEEeCCHH
Confidence            45555555544433


No 87 
>PF13719 zinc_ribbon_5:  zinc-ribbon domain
Probab=88.74  E-value=0.28  Score=27.28  Aligned_cols=11  Identities=36%  Similarity=0.721  Sum_probs=5.4

Q ss_pred             ccccCCCCCCC
Q psy5202         243 VYRCRFCEMPF  253 (343)
Q Consensus       243 ~~~C~~C~~~f  253 (343)
                      ..+|+.|+..|
T Consensus        25 ~vrC~~C~~~f   35 (37)
T PF13719_consen   25 KVRCPKCGHVF   35 (37)
T ss_pred             EEECCCCCcEe
Confidence            44555555444


No 88 
>PHA00626 hypothetical protein
Probab=88.49  E-value=0.18  Score=30.36  Aligned_cols=14  Identities=36%  Similarity=0.429  Sum_probs=8.8

Q ss_pred             cccccCCCCCCCCC
Q psy5202         242 DVYRCRFCEMPFSV  255 (343)
Q Consensus       242 ~~~~C~~C~~~f~~  255 (343)
                      ..|.|+.||+.|+.
T Consensus        22 nrYkCkdCGY~ft~   35 (59)
T PHA00626         22 DDYVCCDCGYNDSK   35 (59)
T ss_pred             cceEcCCCCCeech
Confidence            45677777766643


No 89 
>PRK14890 putative Zn-ribbon RNA-binding protein; Provisional
Probab=87.92  E-value=0.54  Score=28.96  Aligned_cols=10  Identities=30%  Similarity=0.919  Sum_probs=6.2

Q ss_pred             cccccCCCCC
Q psy5202         242 DVYRCRFCEM  251 (343)
Q Consensus       242 ~~~~C~~C~~  251 (343)
                      .+|.|+.||.
T Consensus        47 ~~Y~CP~CGF   56 (59)
T PRK14890         47 NPYTCPKCGF   56 (59)
T ss_pred             CceECCCCCC
Confidence            4566666664


No 90 
>KOG2186|consensus
Probab=87.47  E-value=0.39  Score=39.08  Aligned_cols=47  Identities=21%  Similarity=0.398  Sum_probs=37.9

Q ss_pred             CcccccCcccccCHHHHHHHHHHccCCCceecCcCcccccChHHHHHHHH
Q psy5202         185 NDTCEFCGKVFKNCSNLTVHRRSHTGEKPYKCELCSYACAQSSKLTRHMK  234 (343)
Q Consensus       185 ~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~f~~~~~L~~H~~  234 (343)
                      .|.|..||....-. .+.+|+-..++ ..|.|--|++.|.. ..+..|..
T Consensus         3 ~FtCnvCgEsvKKp-~vekH~srCrn-~~fSCIDC~k~F~~-~sYknH~k   49 (276)
T KOG2186|consen    3 FFTCNVCGESVKKP-QVEKHMSRCRN-AYFSCIDCGKTFER-VSYKNHTK   49 (276)
T ss_pred             EEehhhhhhhcccc-chHHHHHhccC-CeeEEeeccccccc-chhhhhhh
Confidence            48899999887654 46779888777 66999999999988 77888854


No 91 
>PRK00398 rpoP DNA-directed RNA polymerase subunit P; Provisional
Probab=87.02  E-value=0.29  Score=28.78  Aligned_cols=11  Identities=36%  Similarity=0.920  Sum_probs=5.9

Q ss_pred             eecCcCccccc
Q psy5202         214 YKCELCSYACA  224 (343)
Q Consensus       214 ~~C~~C~~~f~  224 (343)
                      |+|+.||..|.
T Consensus         4 y~C~~CG~~~~   14 (46)
T PRK00398          4 YKCARCGREVE   14 (46)
T ss_pred             EECCCCCCEEE
Confidence            55555555544


No 92 
>COG2888 Predicted Zn-ribbon RNA-binding protein with a function in translation [Translation, ribosomal structure and biogenesis]
Probab=86.95  E-value=0.73  Score=28.29  Aligned_cols=10  Identities=40%  Similarity=1.082  Sum_probs=5.8

Q ss_pred             cccccCCCCC
Q psy5202         242 DVYRCRFCEM  251 (343)
Q Consensus       242 ~~~~C~~C~~  251 (343)
                      .+|+|+.||.
T Consensus        49 ~~Y~Cp~CGF   58 (61)
T COG2888          49 NPYRCPKCGF   58 (61)
T ss_pred             CceECCCcCc
Confidence            4566666654


No 93 
>smart00734 ZnF_Rad18 Rad18-like CCHC zinc finger. Yeast Rad18p functions with Rad5p in error-free post-replicative DNA repair. This zinc finger is likely to bind nucleic-acids.
Probab=86.85  E-value=0.74  Score=23.29  Aligned_cols=21  Identities=19%  Similarity=0.618  Sum_probs=14.6

Q ss_pred             ccCCCCCCCCChHHHHHHHhhh
Q psy5202         245 RCRFCEMPFSVPSTLEKHMRKC  266 (343)
Q Consensus       245 ~C~~C~~~f~~~~~l~~H~~~~  266 (343)
                      .|++|++.+ ....+..|+..+
T Consensus         3 ~CPiC~~~v-~~~~in~HLD~C   23 (26)
T smart00734        3 QCPVCFREV-PENLINSHLDSC   23 (26)
T ss_pred             cCCCCcCcc-cHHHHHHHHHHh
Confidence            577888777 556777777644


No 94 
>KOG2186|consensus
Probab=86.76  E-value=0.44  Score=38.80  Aligned_cols=51  Identities=24%  Similarity=0.480  Sum_probs=42.1

Q ss_pred             ccccccccccCChHHHHHHHHhccCCCCeecCCCCccCCCHHHHHHHHHHhccC
Q psy5202          92 KECEFCGKKFKYENTLQAHRRNHTGEKPFKCTVCDHACSQSAKLKKHMKIHDED  145 (343)
Q Consensus        92 ~~C~~C~~~f~~~~~L~~H~~~h~~~~~~~C~~C~~~f~~~~~L~~H~~~h~~~  145 (343)
                      |.|..||..... ..+.+|+...++ .-|.|--|++.|.. ..+..|..-.+..
T Consensus         4 FtCnvCgEsvKK-p~vekH~srCrn-~~fSCIDC~k~F~~-~sYknH~kCITEa   54 (276)
T KOG2186|consen    4 FTCNVCGESVKK-PQVEKHMSRCRN-AYFSCIDCGKTFER-VSYKNHTKCITEA   54 (276)
T ss_pred             Eehhhhhhhccc-cchHHHHHhccC-CeeEEeeccccccc-chhhhhhhhcchH
Confidence            789999999874 557889988777 67999999999998 7888898765543


No 95 
>COG5151 SSL1 RNA polymerase II transcription initiation/nucleotide excision repair factor TFIIH, subunit SSL1 [Transcription / DNA replication, recombination, and repair]
Probab=86.76  E-value=0.6  Score=39.16  Aligned_cols=55  Identities=25%  Similarity=0.464  Sum_probs=35.0

Q ss_pred             cccccCcccccCHHHHHHHHHHccCCCceecCcCcccccChHHHHHHHHHhcCCCCcccccCCCC
Q psy5202         186 DTCEFCGKVFKNCSNLTVHRRSHTGEKPYKCELCSYACAQSSKLTRHMKTHGRLGKDVYRCRFCE  250 (343)
Q Consensus       186 ~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~f~~~~~L~~H~~~h~~~~~~~~~C~~C~  250 (343)
                      -.|-.|.-.|.....-..-  .-+....|+|+.|...|-.--+...|...|.        |..|.
T Consensus       363 ~~Cf~CQ~~fp~~~~~~~~--~~~ss~rY~Ce~CK~~FC~dCdvfiHe~Lh~--------C~gCe  417 (421)
T COG5151         363 THCFVCQGPFPKPPVSPFD--ESTSSGRYQCELCKSTFCSDCDVFIHETLHF--------CIGCE  417 (421)
T ss_pred             ccceeccCCCCCCCCCccc--ccccccceechhhhhhhhhhhHHHHHHHHhh--------CCCCc
Confidence            3477777777654221000  1122346899999999988888888877773        66665


No 96 
>PF03604 DNA_RNApol_7kD:  DNA directed RNA polymerase, 7 kDa subunit;  InterPro: IPR006591 DNA-dependent RNA polymerase catalyzes the transcription of DNA into RNA using the four ribonucleoside triphosphates as substrates. Each class of RNA polymerase is assembled from 9 to 15 different polypeptides. Rbp10 (RNA polymerase CX) is a domain found in RNA polymerase subunit 10; present in RNA polymerase I, II and III.; GO: 0003677 DNA binding, 0003899 DNA-directed RNA polymerase activity, 0006351 transcription, DNA-dependent; PDB: 2PMZ_Z 3HKZ_X 2NVX_L 3S1Q_L 2JA6_L 3S17_L 3HOW_L 3HOV_L 3PO2_L 3HOZ_L ....
Probab=86.51  E-value=0.59  Score=25.00  Aligned_cols=11  Identities=27%  Similarity=0.706  Sum_probs=5.8

Q ss_pred             eecCcCccccc
Q psy5202         214 YKCELCSYACA  224 (343)
Q Consensus       214 ~~C~~C~~~f~  224 (343)
                      |.|..|+..+.
T Consensus         1 Y~C~~Cg~~~~   11 (32)
T PF03604_consen    1 YICGECGAEVE   11 (32)
T ss_dssp             EBESSSSSSE-
T ss_pred             CCCCcCCCeeE
Confidence            45666665554


No 97 
>COG1996 RPC10 DNA-directed RNA polymerase, subunit RPC10 (contains C4-type Zn-finger) [Transcription]
Probab=86.29  E-value=0.4  Score=28.37  Aligned_cols=11  Identities=27%  Similarity=0.773  Sum_probs=5.8

Q ss_pred             CcccccCcccc
Q psy5202         185 NDTCEFCGKVF  195 (343)
Q Consensus       185 ~~~C~~C~~~f  195 (343)
                      .|.|..||+.|
T Consensus         6 ~Y~C~~Cg~~~   16 (49)
T COG1996           6 EYKCARCGREV   16 (49)
T ss_pred             EEEhhhcCCee
Confidence            35555555555


No 98 
>COG1198 PriA Primosomal protein N' (replication factor Y) - superfamily II helicase [DNA replication, recombination, and repair]
Probab=86.27  E-value=0.68  Score=44.75  Aligned_cols=40  Identities=20%  Similarity=0.528  Sum_probs=28.7

Q ss_pred             cccccCcccccCHHHHHHHHHHccCCCceecCcCcccccChHHHHHHHHHhcCCCCcccccCCCCCC
Q psy5202         186 DTCEFCGKVFKNCSNLTVHRRSHTGEKPYKCELCSYACAQSSKLTRHMKTHGRLGKDVYRCRFCEMP  252 (343)
Q Consensus       186 ~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~f~~~~~L~~H~~~h~~~~~~~~~C~~C~~~  252 (343)
                      +.|+.|+..+.          .|......+|.+||+.                 ...|..|+.||-.
T Consensus       445 ~~Cp~Cd~~lt----------~H~~~~~L~CH~Cg~~-----------------~~~p~~Cp~Cgs~  484 (730)
T COG1198         445 AECPNCDSPLT----------LHKATGQLRCHYCGYQ-----------------EPIPQSCPECGSE  484 (730)
T ss_pred             ccCCCCCcceE----------EecCCCeeEeCCCCCC-----------------CCCCCCCCCCCCC
Confidence            56888876643          3555577899999932                 3578899999965


No 99 
>TIGR00373 conserved hypothetical protein TIGR00373. This family of proteins is, so far, restricted to archaeal genomes. The family appears to be distantly related to the N-terminal region of the eukaryotic transcription initiation factor IIE alpha chain.
Probab=86.15  E-value=0.95  Score=34.89  Aligned_cols=34  Identities=12%  Similarity=0.231  Sum_probs=25.6

Q ss_pred             cCCCceecCcCcccccChHHHHHHHHHhcCCCCcccccCCCCCCC
Q psy5202         209 TGEKPYKCELCSYACAQSSKLTRHMKTHGRLGKDVYRCRFCEMPF  253 (343)
Q Consensus       209 ~~~~~~~C~~C~~~f~~~~~L~~H~~~h~~~~~~~~~C~~C~~~f  253 (343)
                      .+..-|.|+.|+..|+....+.           .-|.|+.||...
T Consensus       105 ~~~~~Y~Cp~c~~r~tf~eA~~-----------~~F~Cp~Cg~~L  138 (158)
T TIGR00373       105 TNNMFFICPNMCVRFTFNEAME-----------LNFTCPRCGAML  138 (158)
T ss_pred             cCCCeEECCCCCcEeeHHHHHH-----------cCCcCCCCCCEe
Confidence            3456688999999888887774           138899999653


No 100
>TIGR02300 FYDLN_acid conserved hypothetical protein TIGR02300. Members of this family are bacterial proteins with a conserved motif [KR]FYDLN, sometimes flanked by a pair of CXXC motifs, followed by a long region of low complexity sequence in which roughly half the residues are Asp and Glu, including multiple runs of five or more acidic residues. The function of members of this family is unknown.
Probab=85.76  E-value=0.55  Score=33.98  Aligned_cols=35  Identities=14%  Similarity=0.238  Sum_probs=27.0

Q ss_pred             CCcccccCcccccCHHHHHHHHHHccCCCceecCcCcccccChHHH
Q psy5202         184 RNDTCEFCGKVFKNCSNLTVHRRSHTGEKPYKCELCSYACAQSSKL  229 (343)
Q Consensus       184 ~~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~f~~~~~L  229 (343)
                      ....|+.||++|...           +..|..|++||..|.....+
T Consensus         8 tKr~Cp~cg~kFYDL-----------nk~p~vcP~cg~~~~~~~~~   42 (129)
T TIGR02300         8 TKRICPNTGSKFYDL-----------NRRPAVSPYTGEQFPPEEAL   42 (129)
T ss_pred             ccccCCCcCcccccc-----------CCCCccCCCcCCccCcchhh
Confidence            457899999999653           45789999999998665333


No 101
>COG1997 RPL43A Ribosomal protein L37AE/L43A [Translation, ribosomal structure and biogenesis]
Probab=85.39  E-value=0.4  Score=32.01  Aligned_cols=13  Identities=31%  Similarity=1.035  Sum_probs=6.9

Q ss_pred             cccccCCCCCCCC
Q psy5202         242 DVYRCRFCEMPFS  254 (343)
Q Consensus       242 ~~~~C~~C~~~f~  254 (343)
                      ..+.|..||..|.
T Consensus        52 GIW~C~kCg~~fA   64 (89)
T COG1997          52 GIWKCRKCGAKFA   64 (89)
T ss_pred             CeEEcCCCCCeec
Confidence            4455555555554


No 102
>smart00531 TFIIE Transcription initiation factor IIE.
Probab=85.17  E-value=1.1  Score=34.07  Aligned_cols=38  Identities=16%  Similarity=0.300  Sum_probs=24.1

Q ss_pred             CCCceecCcCcccccChHHHHHHHHHhcCCCCcccccCCCCCCC
Q psy5202         210 GEKPYKCELCSYACAQSSKLTRHMKTHGRLGKDVYRCRFCEMPF  253 (343)
Q Consensus       210 ~~~~~~C~~C~~~f~~~~~L~~H~~~h~~~~~~~~~C~~C~~~f  253 (343)
                      +...|.|+.|+..|.....+..    ..  ....|.|+.||...
T Consensus        96 ~~~~Y~Cp~C~~~y~~~ea~~~----~d--~~~~f~Cp~Cg~~l  133 (147)
T smart00531       96 NNAYYKCPNCQSKYTFLEANQL----LD--MDGTFTCPRCGEEL  133 (147)
T ss_pred             CCcEEECcCCCCEeeHHHHHHh----cC--CCCcEECCCCCCEE
Confidence            3456889999988886544432    01  12338899998754


No 103
>PRK00464 nrdR transcriptional regulator NrdR; Validated
Probab=85.12  E-value=0.16  Score=38.69  Aligned_cols=44  Identities=16%  Similarity=0.355  Sum_probs=24.4

Q ss_pred             ccccccccccCCh---HHHHHHHHhccCCCCeecCCCCccCCCHHHHHHH
Q psy5202          92 KECEFCGKKFKYE---NTLQAHRRNHTGEKPFKCTVCDHACSQSAKLKKH  138 (343)
Q Consensus        92 ~~C~~C~~~f~~~---~~L~~H~~~h~~~~~~~C~~C~~~f~~~~~L~~H  138 (343)
                      ++|++||..+...   ..+..  ..+ -.+.++|+.||+.|.+...+..-
T Consensus         1 m~cp~c~~~~~~~~~s~~~~~--~~~-~~~~~~c~~c~~~f~~~e~~~~~   47 (154)
T PRK00464          1 MRCPFCGHPDTRVIDSRPAED--GNA-IRRRRECLACGKRFTTFERVELV   47 (154)
T ss_pred             CcCCCCCCCCCEeEeccccCC--CCc-eeeeeeccccCCcceEeEeccCc
Confidence            4799999665211   11111  001 11238899999999876654433


No 104
>COG1592 Rubrerythrin [Energy production and conversion]
Probab=84.83  E-value=0.54  Score=36.20  Aligned_cols=25  Identities=32%  Similarity=0.783  Sum_probs=20.6

Q ss_pred             CCcccccCcccccCHHHHHHHHHHccCCCceecCcCcc
Q psy5202         184 RNDTCEFCGKVFKNCSNLTVHRRSHTGEKPYKCELCSY  221 (343)
Q Consensus       184 ~~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~  221 (343)
                      +.|+|++||..+             -|+.|-+||+||-
T Consensus       133 ~~~vC~vCGy~~-------------~ge~P~~CPiCga  157 (166)
T COG1592         133 KVWVCPVCGYTH-------------EGEAPEVCPICGA  157 (166)
T ss_pred             CEEEcCCCCCcc-------------cCCCCCcCCCCCC
Confidence            379999999764             4578999999993


No 105
>smart00659 RPOLCX RNA polymerase subunit CX. present in RNA polymerase I, II and III
Probab=84.77  E-value=0.61  Score=27.10  Aligned_cols=27  Identities=19%  Similarity=0.508  Sum_probs=20.8

Q ss_pred             cccccccccccCChHHHHHHHHhccCCCCeecCCCCcc
Q psy5202          91 NKECEFCGKKFKYENTLQAHRRNHTGEKPFKCTVCDHA  128 (343)
Q Consensus        91 ~~~C~~C~~~f~~~~~L~~H~~~h~~~~~~~C~~C~~~  128 (343)
                      .|.|..||..|...           ...+..|+.||..
T Consensus         2 ~Y~C~~Cg~~~~~~-----------~~~~irC~~CG~r   28 (44)
T smart00659        2 IYICGECGRENEIK-----------SKDVVRCRECGYR   28 (44)
T ss_pred             EEECCCCCCEeecC-----------CCCceECCCCCce
Confidence            48999999988743           3456899999864


No 106
>PRK04023 DNA polymerase II large subunit; Validated
Probab=84.58  E-value=1.1  Score=44.21  Aligned_cols=13  Identities=15%  Similarity=0.225  Sum_probs=8.6

Q ss_pred             ccccCCCCCCCCC
Q psy5202         243 VYRCRFCEMPFSV  255 (343)
Q Consensus       243 ~~~C~~C~~~f~~  255 (343)
                      ++.|+.||..-..
T Consensus       663 ~y~CPKCG~El~~  675 (1121)
T PRK04023        663 EDECEKCGREPTP  675 (1121)
T ss_pred             CCcCCCCCCCCCc
Confidence            4678888865443


No 107
>smart00614 ZnF_BED BED zinc finger. DNA-binding domain in chromatin-boundary-element-binding proteins and transposases
Probab=84.36  E-value=0.7  Score=27.73  Aligned_cols=21  Identities=33%  Similarity=0.863  Sum_probs=11.8

Q ss_pred             cccCCCCCCCCCh-----HHHHHHHh
Q psy5202         244 YRCRFCEMPFSVP-----STLEKHMR  264 (343)
Q Consensus       244 ~~C~~C~~~f~~~-----~~l~~H~~  264 (343)
                      -.|.+|++.+...     +.|.+|++
T Consensus        19 a~C~~C~~~l~~~~~~gTs~L~rHl~   44 (50)
T smart00614       19 AKCKYCGKKLSRSSKGGTSNLRRHLR   44 (50)
T ss_pred             EEecCCCCEeeeCCCCCcHHHHHHHH
Confidence            4466666555443     45666665


No 108
>PRK00464 nrdR transcriptional regulator NrdR; Validated
Probab=84.14  E-value=0.18  Score=38.45  Aligned_cols=15  Identities=20%  Similarity=0.416  Sum_probs=8.0

Q ss_pred             cccCCCCCCCCChHH
Q psy5202         244 YRCRFCEMPFSVPST  258 (343)
Q Consensus       244 ~~C~~C~~~f~~~~~  258 (343)
                      ++|+.||++|.+...
T Consensus        29 ~~c~~c~~~f~~~e~   43 (154)
T PRK00464         29 RECLACGKRFTTFER   43 (154)
T ss_pred             eeccccCCcceEeEe
Confidence            555555555555443


No 109
>PRK06266 transcription initiation factor E subunit alpha; Validated
Probab=83.85  E-value=1.2  Score=35.09  Aligned_cols=35  Identities=14%  Similarity=0.329  Sum_probs=26.3

Q ss_pred             CCCceecCcCcccccChHHHHHHHHHhcCCCCcccccCCCCCCCCC
Q psy5202         210 GEKPYKCELCSYACAQSSKLTRHMKTHGRLGKDVYRCRFCEMPFSV  255 (343)
Q Consensus       210 ~~~~~~C~~C~~~f~~~~~L~~H~~~h~~~~~~~~~C~~C~~~f~~  255 (343)
                      ...-|.|+.|+..|+....+.           .-|.|+.||.....
T Consensus       114 ~~~~Y~Cp~C~~rytf~eA~~-----------~~F~Cp~Cg~~L~~  148 (178)
T PRK06266        114 NNMFFFCPNCHIRFTFDEAME-----------YGFRCPQCGEMLEE  148 (178)
T ss_pred             CCCEEECCCCCcEEeHHHHhh-----------cCCcCCCCCCCCee
Confidence            345699999999998887752           24899999976543


No 110
>TIGR02605 CxxC_CxxC_SSSS putative regulatory protein, FmdB family. This model represents a region of about 50 amino acids found in a number of small proteins in a wide range of bacteria. The region begins usually with the initiator Met and contains two CxxC motifs separated by 17 amino acids. One member of this family is has been noted as a putative regulatory protein, designated FmdB (PubMed:8841393). Most members of this family have a C-terminal region containing highly degenerate sequence, such as SSTSESTKSSGSSGSSGSSESKASGSTEKSTSSTTAAAAV in Mycobacterium tuberculosis and VAVGGSAPAPSPAPRAGGGGGGCCGGGCCG in Streptomyces avermitilis. These low complexity regions, which are not included in the model, resemble low-complexity C-terminal regions of some heterocycle-containing bacteriocin precursors.
Probab=82.98  E-value=0.27  Score=29.79  Aligned_cols=29  Identities=21%  Similarity=0.627  Sum_probs=20.3

Q ss_pred             ccccccccccCChHHHHHHHHhccCCCCeecCCCCc
Q psy5202          92 KECEFCGKKFKYENTLQAHRRNHTGEKPFKCTVCDH  127 (343)
Q Consensus        92 ~~C~~C~~~f~~~~~L~~H~~~h~~~~~~~C~~C~~  127 (343)
                      |+|..|+..|.....+.       ......|+.|+.
T Consensus         6 y~C~~Cg~~fe~~~~~~-------~~~~~~CP~Cg~   34 (52)
T TIGR02605         6 YRCTACGHRFEVLQKMS-------DDPLATCPECGG   34 (52)
T ss_pred             EEeCCCCCEeEEEEecC-------CCCCCCCCCCCC
Confidence            89999999997543221       134567999986


No 111
>PF15269 zf-C2H2_7:  Zinc-finger
Probab=82.40  E-value=1  Score=25.74  Aligned_cols=22  Identities=45%  Similarity=0.970  Sum_probs=16.7

Q ss_pred             eecCcCcccccChHHHHHHHHH
Q psy5202         214 YKCELCSYACAQSSKLTRHMKT  235 (343)
Q Consensus       214 ~~C~~C~~~f~~~~~L~~H~~~  235 (343)
                      |+|-.|..+..-++.|..||+.
T Consensus        21 ykcfqcpftc~~kshl~nhmky   42 (54)
T PF15269_consen   21 YKCFQCPFTCNEKSHLFNHMKY   42 (54)
T ss_pred             ceeecCCcccchHHHHHHHHHH
Confidence            6777777777777777777764


No 112
>smart00531 TFIIE Transcription initiation factor IIE.
Probab=82.17  E-value=1.3  Score=33.67  Aligned_cols=38  Identities=18%  Similarity=0.449  Sum_probs=25.5

Q ss_pred             CCCcccccccccccCChHHHHHHHHhccCCCCeecCCCCccC
Q psy5202          88 EEKNKECEFCGKKFKYENTLQAHRRNHTGEKPFKCTVCDHAC  129 (343)
Q Consensus        88 ~~~~~~C~~C~~~f~~~~~L~~H~~~h~~~~~~~C~~C~~~f  129 (343)
                      ...-|.|+.|+..|.....+..   .. ....|.|+.||...
T Consensus        96 ~~~~Y~Cp~C~~~y~~~ea~~~---~d-~~~~f~Cp~Cg~~l  133 (147)
T smart00531       96 NNAYYKCPNCQSKYTFLEANQL---LD-MDGTFTCPRCGEEL  133 (147)
T ss_pred             CCcEEECcCCCCEeeHHHHHHh---cC-CCCcEECCCCCCEE
Confidence            3457999999999985443322   11 23449999999764


No 113
>KOG4167|consensus
Probab=81.62  E-value=3.8  Score=38.95  Aligned_cols=29  Identities=28%  Similarity=0.410  Sum_probs=24.8

Q ss_pred             CCCCcccccccccccCChHHHHHHHHhcc
Q psy5202          87 QEEKNKECEFCGKKFKYENTLQAHRRNHT  115 (343)
Q Consensus        87 ~~~~~~~C~~C~~~f~~~~~L~~H~~~h~  115 (343)
                      ....-|.|..|+++|....++..||++|.
T Consensus       788 ~~~giFpCreC~kvF~KiKSrNAHMK~Hr  816 (907)
T KOG4167|consen  788 DPTGIFPCRECGKVFFKIKSRNAHMKTHR  816 (907)
T ss_pred             CCCceeehHHHHHHHHHHhhhhHHHHHHH
Confidence            34567999999999999999999999984


No 114
>PRK06266 transcription initiation factor E subunit alpha; Validated
Probab=81.42  E-value=0.75  Score=36.23  Aligned_cols=32  Identities=16%  Similarity=0.275  Sum_probs=26.2

Q ss_pred             CCcccccCcccccCHHHHHHHHHHccCCCceecCcCccccc
Q psy5202         184 RNDTCEFCGKVFKNCSNLTVHRRSHTGEKPYKCELCSYACA  224 (343)
Q Consensus       184 ~~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~f~  224 (343)
                      ..|.|+.|+..|..-..+.         .-|.|+.||....
T Consensus       116 ~~Y~Cp~C~~rytf~eA~~---------~~F~Cp~Cg~~L~  147 (178)
T PRK06266        116 MFFFCPNCHIRFTFDEAME---------YGFRCPQCGEMLE  147 (178)
T ss_pred             CEEECCCCCcEEeHHHHhh---------cCCcCCCCCCCCe
Confidence            6799999999998877653         3699999997654


No 115
>KOG4377|consensus
Probab=81.36  E-value=7.7  Score=34.29  Aligned_cols=27  Identities=22%  Similarity=0.533  Sum_probs=21.2

Q ss_pred             Ceec--CCCCccCCCHHHHHHHHHHhccC
Q psy5202         119 PFKC--TVCDHACSQSAKLKKHMKIHDED  145 (343)
Q Consensus       119 ~~~C--~~C~~~f~~~~~L~~H~~~h~~~  145 (343)
                      .|.|  +.|++.+..+..+.+|.++|...
T Consensus       271 hyhcl~e~C~ykr~~k~DvirH~~~hkkr  299 (480)
T KOG4377|consen  271 HYHCLNEYCFYKRGQKNDVIRHVEIHKKR  299 (480)
T ss_pred             hhcccCccccccccchhhhHHHHHHHhhc
Confidence            3566  45998888899999999988643


No 116
>COG5188 PRP9 Splicing factor 3a, subunit 3 [RNA processing and modification]
Probab=80.92  E-value=1.4  Score=37.72  Aligned_cols=23  Identities=22%  Similarity=0.446  Sum_probs=20.5

Q ss_pred             CCcccccCcccccCHHHHHHHHH
Q psy5202         184 RNDTCEFCGKVFKNCSNLTVHRR  206 (343)
Q Consensus       184 ~~~~C~~C~~~f~~~~~l~~H~~  206 (343)
                      ..+-|..|++.|....-+..|+.
T Consensus       237 ~~~YC~~C~r~f~~~~VFe~Hl~  259 (470)
T COG5188         237 PKVYCVKCGREFSRSKVFEYHLE  259 (470)
T ss_pred             cceeeHhhhhHhhhhHHHHHHHh
Confidence            56889999999999999999975


No 117
>smart00834 CxxC_CXXC_SSSS Putative regulatory protein. CxxC_CXXC_SSSS represents a region of about 41 amino acids found in a number of small proteins in a wide range of bacteria. The region usually begins with the initiator Met and contains two CxxC motifs separated by 17 amino acids. One protein in this entry has been noted as a putative regulatory protein, designated FmdB. Most proteins in this entry have a C-terminal region containing highly degenerate sequence.
Probab=80.88  E-value=0.34  Score=27.55  Aligned_cols=31  Identities=23%  Similarity=0.581  Sum_probs=21.3

Q ss_pred             cccccccccccCChHHHHHHHHhccCCCCeecCCCCcc
Q psy5202          91 NKECEFCGKKFKYENTLQAHRRNHTGEKPFKCTVCDHA  128 (343)
Q Consensus        91 ~~~C~~C~~~f~~~~~L~~H~~~h~~~~~~~C~~C~~~  128 (343)
                      .|+|..||..|.......       ......|+.||..
T Consensus         5 ~y~C~~Cg~~fe~~~~~~-------~~~~~~CP~Cg~~   35 (41)
T smart00834        5 EYRCEDCGHTFEVLQKIS-------DDPLATCPECGGD   35 (41)
T ss_pred             EEEcCCCCCEEEEEEecC-------CCCCCCCCCCCCc
Confidence            389999999987443321       1355679999873


No 118
>TIGR00373 conserved hypothetical protein TIGR00373. This family of proteins is, so far, restricted to archaeal genomes. The family appears to be distantly related to the N-terminal region of the eukaryotic transcription initiation factor IIE alpha chain.
Probab=80.80  E-value=2.1  Score=32.96  Aligned_cols=20  Identities=10%  Similarity=0.100  Sum_probs=14.1

Q ss_pred             CCCeecCCCCccCCCHHHHH
Q psy5202         117 EKPFKCTVCDHACSQSAKLK  136 (343)
Q Consensus       117 ~~~~~C~~C~~~f~~~~~L~  136 (343)
                      ..-|.|+.|+..|.....+.
T Consensus       107 ~~~Y~Cp~c~~r~tf~eA~~  126 (158)
T TIGR00373       107 NMFFICPNMCVRFTFNEAME  126 (158)
T ss_pred             CCeEECCCCCcEeeHHHHHH
Confidence            45578888888777766653


No 119
>COG4530 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=79.87  E-value=1.1  Score=31.36  Aligned_cols=27  Identities=4%  Similarity=-0.054  Sum_probs=20.7

Q ss_pred             ecCcCcccccChHHHHHHHHHhcCCCCcccccCCCCCCCC
Q psy5202         215 KCELCSYACAQSSKLTRHMKTHGRLGKDVYRCRFCEMPFS  254 (343)
Q Consensus       215 ~C~~C~~~f~~~~~L~~H~~~h~~~~~~~~~C~~C~~~f~  254 (343)
                      .|+.|++.|...             ...|..|++||++|.
T Consensus        11 idPetg~KFYDL-------------NrdPiVsPytG~s~P   37 (129)
T COG4530          11 IDPETGKKFYDL-------------NRDPIVSPYTGKSYP   37 (129)
T ss_pred             cCccccchhhcc-------------CCCccccCcccccch
Confidence            578888888642             567888999999883


No 120
>PRK09678 DNA-binding transcriptional regulator; Provisional
Probab=79.73  E-value=0.51  Score=30.72  Aligned_cols=17  Identities=18%  Similarity=0.440  Sum_probs=9.6

Q ss_pred             CcccccC--CCCCCCCChH
Q psy5202         241 KDVYRCR--FCEMPFSVPS  257 (343)
Q Consensus       241 ~~~~~C~--~C~~~f~~~~  257 (343)
                      +.-++|.  .||.+|....
T Consensus        25 ~~Y~qC~N~eCg~tF~t~e   43 (72)
T PRK09678         25 ERYHQCQNVNCSATFITYE   43 (72)
T ss_pred             eeeeecCCCCCCCEEEEEE
Confidence            4455665  5666665543


No 121
>PRK09678 DNA-binding transcriptional regulator; Provisional
Probab=79.29  E-value=0.41  Score=31.13  Aligned_cols=41  Identities=17%  Similarity=0.316  Sum_probs=26.8

Q ss_pred             cccccCcccccCHHHHHHHHHHccCCCceecC--cCcccccChHH
Q psy5202         186 DTCEFCGKVFKNCSNLTVHRRSHTGEKPYKCE--LCSYACAQSSK  228 (343)
Q Consensus       186 ~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~--~C~~~f~~~~~  228 (343)
                      +.|+.||..-.....-..+.  ...+..++|.  .||.+|.....
T Consensus         2 m~CP~Cg~~a~irtSr~~s~--~~~~~Y~qC~N~eCg~tF~t~es   44 (72)
T PRK09678          2 FHCPLCQHAAHARTSRYITD--TTKERYHQCQNVNCSATFITYES   44 (72)
T ss_pred             ccCCCCCCccEEEEChhcCh--hhheeeeecCCCCCCCEEEEEEE
Confidence            67999997764333322222  2456778998  89999986544


No 122
>PF09723 Zn-ribbon_8:  Zinc ribbon domain;  InterPro: IPR013429  This entry represents a region of about 41 amino acids found in a number of small proteins in a wide range of bacteria. The region usually begins with the initiator Met and contains two CxxC motifs separated by 17 amino acids. One protein in this entry has been noted as a putative regulatory protein, designated FmdB []. Most proteins in this entry have a C-terminal region containing highly degenerate sequence.
Probab=79.22  E-value=0.81  Score=26.29  Aligned_cols=11  Identities=36%  Similarity=0.960  Sum_probs=5.6

Q ss_pred             eecCcCccccc
Q psy5202         214 YKCELCSYACA  224 (343)
Q Consensus       214 ~~C~~C~~~f~  224 (343)
                      |+|..||..|.
T Consensus         6 y~C~~Cg~~fe   16 (42)
T PF09723_consen    6 YRCEECGHEFE   16 (42)
T ss_pred             EEeCCCCCEEE
Confidence            45555555444


No 123
>TIGR00595 priA primosomal protein N'. All proteins in this family for which functions are known are components of the primosome which is involved in replication, repair, and recombination.This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University).
Probab=79.18  E-value=1.8  Score=40.51  Aligned_cols=41  Identities=22%  Similarity=0.454  Sum_probs=26.6

Q ss_pred             CcccccCcccccCHHHHHHHHHHccCCCceecCcCcccccChHHHHHHHHHhcCCCCcccccCCCCCC
Q psy5202         185 NDTCEFCGKVFKNCSNLTVHRRSHTGEKPYKCELCSYACAQSSKLTRHMKTHGRLGKDVYRCRFCEMP  252 (343)
Q Consensus       185 ~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~f~~~~~L~~H~~~h~~~~~~~~~C~~C~~~  252 (343)
                      ...|+.|+-..          ..|.......|.+||+..                 ..+..|+.||-.
T Consensus       222 ~~~C~~C~~~l----------~~h~~~~~l~Ch~Cg~~~-----------------~~~~~Cp~C~s~  262 (505)
T TIGR00595       222 ILCCPNCDVSL----------TYHKKEGKLRCHYCGYQE-----------------PIPKTCPQCGSE  262 (505)
T ss_pred             ccCCCCCCCce----------EEecCCCeEEcCCCcCcC-----------------CCCCCCCCCCCC
Confidence            35677776443          234556678899999443                 346789999864


No 124
>PF04959 ARS2:  Arsenite-resistance protein 2;  InterPro: IPR007042 This entry represents Arsenite-resistance protein 2 (also known as Serrate RNA effector molecule homolog) which is thought to play a role in arsenite resistance [], although does not directly confer arsenite resistance but rather modulates arsenic sensitivity []. Arsenite is a carcinogenic compound which can act as a comutagen by inhibiting DNA repair. It is also involved in cell cycle progression at S phase. ; PDB: 3AX1_A.
Probab=79.09  E-value=1.3  Score=35.92  Aligned_cols=31  Identities=26%  Similarity=0.532  Sum_probs=20.1

Q ss_pred             CCcccccCCCCCCCCChHHHHHHHhhhcCcc
Q psy5202         240 GKDVYRCRFCEMPFSVPSTLEKHMRKCVVNQ  270 (343)
Q Consensus       240 ~~~~~~C~~C~~~f~~~~~l~~H~~~~~~~~  270 (343)
                      .+..|.|+.|+|.|.-..-+.+|+...|...
T Consensus        74 ~~~K~~C~lc~KlFkg~eFV~KHI~nKH~e~  104 (214)
T PF04959_consen   74 DEDKWRCPLCGKLFKGPEFVRKHIFNKHPEK  104 (214)
T ss_dssp             SSEEEEE-SSS-EESSHHHHHHHHHHH-HHH
T ss_pred             cCCEECCCCCCcccCChHHHHHHHhhcCHHH
Confidence            4556778888888888888888887766543


No 125
>KOG2071|consensus
Probab=78.88  E-value=1.7  Score=40.19  Aligned_cols=26  Identities=23%  Similarity=0.535  Sum_probs=21.7

Q ss_pred             CCcccccCcccccCHHHHHHHHHHcc
Q psy5202         184 RNDTCEFCGKVFKNCSNLTVHRRSHT  209 (343)
Q Consensus       184 ~~~~C~~C~~~f~~~~~l~~H~~~h~  209 (343)
                      .+..|..||.+|.+......||..|.
T Consensus       417 ~pnqC~~CG~R~~~~ee~sk~md~H~  442 (579)
T KOG2071|consen  417 SPNQCKSCGLRFDDSEERSKHMDIHD  442 (579)
T ss_pred             CcchhcccccccccchhhhhHhhhhh
Confidence            56789999999999888888877664


No 126
>PF05443 ROS_MUCR:  ROS/MUCR transcriptional regulator protein;  InterPro: IPR008807 This family consists of several ROS/MUCR transcriptional regulator proteins. The ros chromosomal gene is present in octopine and nopaline strains of Agrobacterium tumefaciens as well as in Rhizobium meliloti (Sinorhizobium meliloti). This gene encodes a 15.5 kDa protein that specifically represses the virC and virD operons in the virulence region of the Ti plasmid [] and is necessary for succinoglycan production []. S. meliloti can produce two types of acidic exopolysaccharides, succinoglycan and galactoglucan, that are interchangeable for infection of Medicago sativa (Alfalfa) nodules. MucR from S. meliloti acts as a transcriptional repressor that blocks the expression of the exp genes responsible for galactoglucan production therefore allowing the exclusive production of succinoglycan [].; GO: 0003677 DNA binding, 0008270 zinc ion binding, 0006355 regulation of transcription, DNA-dependent; PDB: 2JSP_A.
Probab=77.62  E-value=1.2  Score=32.98  Aligned_cols=27  Identities=44%  Similarity=0.678  Sum_probs=17.3

Q ss_pred             CCcccccccccccCChHHHHHHHHhccCCC
Q psy5202          89 EKNKECEFCGKKFKYENTLQAHRRNHTGEK  118 (343)
Q Consensus        89 ~~~~~C~~C~~~f~~~~~L~~H~~~h~~~~  118 (343)
                      +.-..|-+||+.|..   |.+|++.|+|..
T Consensus        70 ~d~i~clecGk~~k~---LkrHL~~~~glt   96 (132)
T PF05443_consen   70 PDYIICLECGKKFKT---LKRHLRTHHGLT   96 (132)
T ss_dssp             SS-EE-TBT--EESB---HHHHHHHTT-S-
T ss_pred             cCeeEEccCCcccch---HHHHHHHccCCC
Confidence            344689999999975   699999998753


No 127
>PF06524 NOA36:  NOA36 protein;  InterPro: IPR010531 This family consists of several NOA36 proteins which contain 29 highly conserved cysteine residues. The function of this protein is unknown.; GO: 0008270 zinc ion binding, 0005634 nucleus
Probab=77.44  E-value=1.1  Score=36.64  Aligned_cols=32  Identities=13%  Similarity=0.146  Sum_probs=24.5

Q ss_pred             CCcccccCCCCCCCCChHHHHHHHhhhcCccc
Q psy5202         240 GKDVYRCRFCEMPFSVPSTLEKHMRKCVVNQN  271 (343)
Q Consensus       240 ~~~~~~C~~C~~~f~~~~~l~~H~~~~~~~~~  271 (343)
                      ..+++.||.||+.......|-.-.|+|...+.
T Consensus       206 k~k~~PCPKCg~et~eTkdLSmStR~hkyGRQ  237 (314)
T PF06524_consen  206 KGKPIPCPKCGYETQETKDLSMSTRSHKYGRQ  237 (314)
T ss_pred             cCCCCCCCCCCCcccccccceeeeecchhccc
Confidence            45789999999998888888777777655443


No 128
>PF06524 NOA36:  NOA36 protein;  InterPro: IPR010531 This family consists of several NOA36 proteins which contain 29 highly conserved cysteine residues. The function of this protein is unknown.; GO: 0008270 zinc ion binding, 0005634 nucleus
Probab=77.14  E-value=0.96  Score=37.02  Aligned_cols=49  Identities=24%  Similarity=0.518  Sum_probs=30.5

Q ss_pred             CCcccccCcccccCHHHHHHHHHHc----cCCCceecCcCcccccChHHHHHHHHHhc
Q psy5202         184 RNDTCEFCGKVFKNCSNLTVHRRSH----TGEKPYKCELCSYACAQSSKLTRHMKTHG  237 (343)
Q Consensus       184 ~~~~C~~C~~~f~~~~~l~~H~~~h----~~~~~~~C~~C~~~f~~~~~L~~H~~~h~  237 (343)
                      ..|.|-.|..-|-.     .|++..    ...+++.|+.|++-......|..-.|.|.
T Consensus       181 Gq~sCLRCK~cfCd-----dHvrrKg~ky~k~k~~PCPKCg~et~eTkdLSmStR~hk  233 (314)
T PF06524_consen  181 GQYSCLRCKICFCD-----DHVRRKGFKYEKGKPIPCPKCGYETQETKDLSMSTRSHK  233 (314)
T ss_pred             cchhhhheeeeehh-----hhhhhcccccccCCCCCCCCCCCcccccccceeeeecch
Confidence            34555555555533     355432    23478999999998887777765555553


No 129
>PRK14714 DNA polymerase II large subunit; Provisional
Probab=77.11  E-value=3  Score=42.44  Aligned_cols=42  Identities=24%  Similarity=0.584  Sum_probs=25.8

Q ss_pred             cccccCcccccCHHHHHHHHHHccCCCceecCcCcccccChHHHHHHHHHhcCCCCcccccCCCCCCCCC
Q psy5202         186 DTCEFCGKVFKNCSNLTVHRRSHTGEKPYKCELCSYACAQSSKLTRHMKTHGRLGKDVYRCRFCEMPFSV  255 (343)
Q Consensus       186 ~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~f~~~~~L~~H~~~h~~~~~~~~~C~~C~~~f~~  255 (343)
                      ..|+.||...               +..|.|+.||.......           .+  ...|+.||.....
T Consensus       680 ~fCP~CGs~t---------------e~vy~CPsCGaev~~de-----------s~--a~~CP~CGtplv~  721 (1337)
T PRK14714        680 NRCPDCGTHT---------------EPVYVCPDCGAEVPPDE-----------SG--RVECPRCDVELTP  721 (1337)
T ss_pred             ccCcccCCcC---------------CCceeCccCCCccCCCc-----------cc--cccCCCCCCcccc
Confidence            3677787664               23578999997543310           12  4469999965443


No 130
>PRK14873 primosome assembly protein PriA; Provisional
Probab=76.99  E-value=2.1  Score=41.39  Aligned_cols=40  Identities=20%  Similarity=0.492  Sum_probs=26.6

Q ss_pred             CcccccCcccccCHHHHHHHHHHccCCCceecCcCcccccChHHHHHHHHHhcCCCCcccccCCCCCC
Q psy5202         185 NDTCEFCGKVFKNCSNLTVHRRSHTGEKPYKCELCSYACAQSSKLTRHMKTHGRLGKDVYRCRFCEMP  252 (343)
Q Consensus       185 ~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~f~~~~~L~~H~~~h~~~~~~~~~C~~C~~~  252 (343)
                      ...|+.|+-....          |.......|.+||+.                  ..++.|+.||..
T Consensus       392 ~~~C~~C~~~L~~----------h~~~~~l~Ch~CG~~------------------~~p~~Cp~Cgs~  431 (665)
T PRK14873        392 PARCRHCTGPLGL----------PSAGGTPRCRWCGRA------------------APDWRCPRCGSD  431 (665)
T ss_pred             eeECCCCCCceeE----------ecCCCeeECCCCcCC------------------CcCccCCCCcCC
Confidence            4567777755432          444567889999942                  236789999865


No 131
>PF02176 zf-TRAF:  TRAF-type zinc finger; PDB: 2EOD_A 2YUC_A 3HCU_A 3HCS_B 3HCT_A.
Probab=76.95  E-value=1.3  Score=27.55  Aligned_cols=21  Identities=19%  Similarity=0.428  Sum_probs=8.8

Q ss_pred             hHHHHHHHHHhcCCCCcccccCC
Q psy5202         226 SSKLTRHMKTHGRLGKDVYRCRF  248 (343)
Q Consensus       226 ~~~L~~H~~~h~~~~~~~~~C~~  248 (343)
                      +..|..|+...-  ..++..|++
T Consensus        23 r~~l~~H~~~~C--~~~~v~C~~   43 (60)
T PF02176_consen   23 RKELDDHLENEC--PKRPVPCPY   43 (60)
T ss_dssp             CCCHHHHHHTTS--TTSEEE-SS
T ss_pred             HHHHHHHHHccC--CCCcEECCC
Confidence            345555554222  234445555


No 132
>PRK04023 DNA polymerase II large subunit; Validated
Probab=76.88  E-value=5.6  Score=39.60  Aligned_cols=25  Identities=28%  Similarity=0.638  Sum_probs=18.2

Q ss_pred             CCCcccccccccccCChHHHHHHHHhccCCCCeecCCCCcc
Q psy5202          88 EEKNKECEFCGKKFKYENTLQAHRRNHTGEKPFKCTVCDHA  128 (343)
Q Consensus        88 ~~~~~~C~~C~~~f~~~~~L~~H~~~h~~~~~~~C~~C~~~  128 (343)
                      ......|+.||...                ..+.|+.||..
T Consensus       623 EVg~RfCpsCG~~t----------------~~frCP~CG~~  647 (1121)
T PRK04023        623 EIGRRKCPSCGKET----------------FYRRCPFCGTH  647 (1121)
T ss_pred             cccCccCCCCCCcC----------------CcccCCCCCCC
Confidence            44567899999873                23788888865


No 133
>KOG2071|consensus
Probab=76.85  E-value=8.2  Score=36.00  Aligned_cols=30  Identities=20%  Similarity=0.455  Sum_probs=24.3

Q ss_pred             CCCCCcccccccccccCChHHHHHHHHhcc
Q psy5202          86 SQEEKNKECEFCGKKFKYENTLQAHRRNHT  115 (343)
Q Consensus        86 ~~~~~~~~C~~C~~~f~~~~~L~~H~~~h~  115 (343)
                      .....+.+|..||.+|........||..|.
T Consensus       413 Ly~~~pnqC~~CG~R~~~~ee~sk~md~H~  442 (579)
T KOG2071|consen  413 LYKDSPNQCKSCGLRFDDSEERSKHMDIHD  442 (579)
T ss_pred             hccCCcchhcccccccccchhhhhHhhhhh
Confidence            345677899999999999888888777763


No 134
>PF10571 UPF0547:  Uncharacterised protein family UPF0547;  InterPro: IPR018886  This domain may well be a type of zinc-finger as it carries two pairs of highly conserved cysteine residues though with no accompanying histidines. Several members are annotated as putative helicases. 
Probab=76.03  E-value=1.6  Score=22.10  Aligned_cols=10  Identities=30%  Similarity=0.810  Sum_probs=5.9

Q ss_pred             ccCCCCCCCC
Q psy5202         245 RCRFCEMPFS  254 (343)
Q Consensus       245 ~C~~C~~~f~  254 (343)
                      .|+.||+.|.
T Consensus        16 ~Cp~CG~~F~   25 (26)
T PF10571_consen   16 FCPHCGYDFE   25 (26)
T ss_pred             cCCCCCCCCc
Confidence            4666666653


No 135
>PF05443 ROS_MUCR:  ROS/MUCR transcriptional regulator protein;  InterPro: IPR008807 This family consists of several ROS/MUCR transcriptional regulator proteins. The ros chromosomal gene is present in octopine and nopaline strains of Agrobacterium tumefaciens as well as in Rhizobium meliloti (Sinorhizobium meliloti). This gene encodes a 15.5 kDa protein that specifically represses the virC and virD operons in the virulence region of the Ti plasmid [] and is necessary for succinoglycan production []. S. meliloti can produce two types of acidic exopolysaccharides, succinoglycan and galactoglucan, that are interchangeable for infection of Medicago sativa (Alfalfa) nodules. MucR from S. meliloti acts as a transcriptional repressor that blocks the expression of the exp genes responsible for galactoglucan production therefore allowing the exclusive production of succinoglycan [].; GO: 0003677 DNA binding, 0008270 zinc ion binding, 0006355 regulation of transcription, DNA-dependent; PDB: 2JSP_A.
Probab=75.58  E-value=1.3  Score=32.85  Aligned_cols=26  Identities=23%  Similarity=0.351  Sum_probs=15.4

Q ss_pred             cccccCCCCCCCCChHHHHHHHhhhcCcc
Q psy5202         242 DVYRCRFCEMPFSVPSTLEKHMRKCVVNQ  270 (343)
Q Consensus       242 ~~~~C~~C~~~f~~~~~l~~H~~~~~~~~  270 (343)
                      .-..|-+||+.|...   .+|+++||+-.
T Consensus        71 d~i~clecGk~~k~L---krHL~~~~glt   96 (132)
T PF05443_consen   71 DYIICLECGKKFKTL---KRHLRTHHGLT   96 (132)
T ss_dssp             S-EE-TBT--EESBH---HHHHHHTT-S-
T ss_pred             CeeEEccCCcccchH---HHHHHHccCCC
Confidence            346788899888765   88999887533


No 136
>PF15269 zf-C2H2_7:  Zinc-finger
Probab=74.79  E-value=3.7  Score=23.48  Aligned_cols=25  Identities=16%  Similarity=0.183  Sum_probs=19.3

Q ss_pred             CCcccccccccccCChHHHHHHHHh
Q psy5202          89 EKNKECEFCGKKFKYENTLQAHRRN  113 (343)
Q Consensus        89 ~~~~~C~~C~~~f~~~~~L~~H~~~  113 (343)
                      .-.|+|-+|......++.|-.||+.
T Consensus        18 p~~ykcfqcpftc~~kshl~nhmky   42 (54)
T PF15269_consen   18 PFKYKCFQCPFTCNEKSHLFNHMKY   42 (54)
T ss_pred             CccceeecCCcccchHHHHHHHHHH
Confidence            3457888888888888888888764


No 137
>PF13878 zf-C2H2_3:  zinc-finger of acetyl-transferase ESCO
Probab=73.75  E-value=3.1  Score=23.73  Aligned_cols=24  Identities=21%  Similarity=0.564  Sum_probs=18.9

Q ss_pred             ccccccccccC--ChHHHHHHHHhcc
Q psy5202          92 KECEFCGKKFK--YENTLQAHRRNHT  115 (343)
Q Consensus        92 ~~C~~C~~~f~--~~~~L~~H~~~h~  115 (343)
                      -.|+.||..|.  ....-..|.+.|.
T Consensus        14 ~~C~~CgM~Y~~~~~eD~~~H~~yH~   39 (41)
T PF13878_consen   14 TTCPTCGMLYSPGSPEDEKLHKKYHD   39 (41)
T ss_pred             cCCCCCCCEECCCCHHHHHHHHHHHh
Confidence            58999999987  5666778887774


No 138
>PF08274 PhnA_Zn_Ribbon:  PhnA Zinc-Ribbon ;  InterPro: IPR013987 The PhnA protein family includes the uncharacterised Escherichia coli protein PhnA and its homologues. The E. coli phnA gene is part of a large operon associated with alkylphosphonate uptake and carbon-phosphorus bond cleavage []. The protein is not related to the characterised phosphonoacetate hydrolase designated PhnA []. This entry represents the N-terminal domain of PhnA, which is predicted to form a zinc-ribbon.; PDB: 2AKL_A.
Probab=72.87  E-value=1.8  Score=22.75  Aligned_cols=9  Identities=33%  Similarity=1.276  Sum_probs=1.6

Q ss_pred             cccCccccc
Q psy5202         188 CEFCGKVFK  196 (343)
Q Consensus       188 C~~C~~~f~  196 (343)
                      |+.|+..+.
T Consensus         5 Cp~C~se~~   13 (30)
T PF08274_consen    5 CPLCGSEYT   13 (30)
T ss_dssp             -TTT-----
T ss_pred             CCCCCCcce
Confidence            555554443


No 139
>PF14353 CpXC:  CpXC protein
Probab=72.70  E-value=3  Score=30.86  Aligned_cols=23  Identities=22%  Similarity=0.424  Sum_probs=13.2

Q ss_pred             ccccCCCCCCCCChHHHHHHHhh
Q psy5202         243 VYRCRFCEMPFSVPSTLEKHMRK  265 (343)
Q Consensus       243 ~~~C~~C~~~f~~~~~l~~H~~~  265 (343)
                      .|.|+.||..|.-...+..|-..
T Consensus        38 ~~~CP~Cg~~~~~~~p~lY~D~~   60 (128)
T PF14353_consen   38 SFTCPSCGHKFRLEYPLLYHDPE   60 (128)
T ss_pred             EEECCCCCCceecCCCEEEEcCC
Confidence            45666666666555555555443


No 140
>KOG4377|consensus
Probab=72.38  E-value=13  Score=32.89  Aligned_cols=52  Identities=21%  Similarity=0.496  Sum_probs=34.1

Q ss_pred             CCCccccc--ccccccCChHHHHHHHHhccCC------------CCeecC--CCCccCCCHHHHHHHHHHh
Q psy5202          88 EEKNKECE--FCGKKFKYENTLQAHRRNHTGE------------KPFKCT--VCDHACSQSAKLKKHMKIH  142 (343)
Q Consensus        88 ~~~~~~C~--~C~~~f~~~~~L~~H~~~h~~~------------~~~~C~--~C~~~f~~~~~L~~H~~~h  142 (343)
                      +..-|.|-  .|+..+-.+..+.+|...|...            ..|.|.  .|.+   +-++...|-..|
T Consensus       268 ~rEhyhcl~e~C~ykr~~k~DvirH~~~hkkrdnsL~dgf~rfs~syhC~~~~C~k---sTsdV~~h~nFh  335 (480)
T KOG4377|consen  268 GREHYHCLNEYCFYKRGQKNDVIRHVEIHKKRDNSLIDGFHRFSNSYHCTGQICEK---STSDVLLHDNFH  335 (480)
T ss_pred             cchhhcccCccccccccchhhhHHHHHHHhhcccccccchhhcCccchhhhcccCc---ccccccccCccc
Confidence            45568884  5988877799999999988543            125564  3766   334455555555


No 141
>TIGR01384 TFS_arch transcription factor S, archaeal. There has been an apparent duplication event in the Halobacteriaceae lineage (Haloarcula, Haloferax, Haloquadratum, Halobacterium and Natromonas). There appears to be a separate duplication in Methanosphaera stadtmanae.
Probab=71.46  E-value=0.21  Score=35.55  Aligned_cols=27  Identities=19%  Similarity=0.495  Sum_probs=18.6

Q ss_pred             cccccccccCChHHHHHHHHhccCCCCeecCCCCccCCC
Q psy5202          93 ECEFCGKKFKYENTLQAHRRNHTGEKPFKCTVCDHACSQ  131 (343)
Q Consensus        93 ~C~~C~~~f~~~~~L~~H~~~h~~~~~~~C~~C~~~f~~  131 (343)
                      -|+.||..+...            ...+.|+.|++.+..
T Consensus         2 fC~~Cg~~l~~~------------~~~~~C~~C~~~~~~   28 (104)
T TIGR01384         2 FCPKCGSLMTPK------------NGVYVCPSCGYEKEK   28 (104)
T ss_pred             CCcccCcccccC------------CCeEECcCCCCcccc
Confidence            388888877421            135889999987654


No 142
>KOG2593|consensus
Probab=70.71  E-value=4.6  Score=35.97  Aligned_cols=38  Identities=16%  Similarity=0.298  Sum_probs=27.5

Q ss_pred             CCCceecCcCcccccChHHHHHHHHHhcCCCCcccccCCCCCC
Q psy5202         210 GEKPYKCELCSYACAQSSKLTRHMKTHGRLGKDVYRCRFCEMP  252 (343)
Q Consensus       210 ~~~~~~C~~C~~~f~~~~~L~~H~~~h~~~~~~~~~C~~C~~~  252 (343)
                      ...-|.|+.|.+.|.....++-=   -  .....|.|..|+-.
T Consensus       125 ~~~~Y~Cp~C~kkyt~Lea~~L~---~--~~~~~F~C~~C~ge  162 (436)
T KOG2593|consen  125 NVAGYVCPNCQKKYTSLEALQLL---D--NETGEFHCENCGGE  162 (436)
T ss_pred             ccccccCCccccchhhhHHHHhh---c--ccCceEEEecCCCc
Confidence            34569999999999988777542   1  12456999999854


No 143
>PF04959 ARS2:  Arsenite-resistance protein 2;  InterPro: IPR007042 This entry represents Arsenite-resistance protein 2 (also known as Serrate RNA effector molecule homolog) which is thought to play a role in arsenite resistance [], although does not directly confer arsenite resistance but rather modulates arsenic sensitivity []. Arsenite is a carcinogenic compound which can act as a comutagen by inhibiting DNA repair. It is also involved in cell cycle progression at S phase. ; PDB: 3AX1_A.
Probab=70.04  E-value=3.5  Score=33.42  Aligned_cols=26  Identities=15%  Similarity=0.445  Sum_probs=15.0

Q ss_pred             CceecCcCcccccChHHHHHHHHHhc
Q psy5202         212 KPYKCELCSYACAQSSKLTRHMKTHG  237 (343)
Q Consensus       212 ~~~~C~~C~~~f~~~~~L~~H~~~h~  237 (343)
                      ..|.|+.|+|.|.-..-+++|+..-|
T Consensus        76 ~K~~C~lc~KlFkg~eFV~KHI~nKH  101 (214)
T PF04959_consen   76 DKWRCPLCGKLFKGPEFVRKHIFNKH  101 (214)
T ss_dssp             EEEEE-SSS-EESSHHHHHHHHHHH-
T ss_pred             CEECCCCCCcccCChHHHHHHHhhcC
Confidence            34667777777777777777765533


No 144
>PRK05580 primosome assembly protein PriA; Validated
Probab=69.69  E-value=4  Score=39.80  Aligned_cols=41  Identities=20%  Similarity=0.480  Sum_probs=24.9

Q ss_pred             CcccccCcccccCHHHHHHHHHHccCCCceecCcCcccccChHHHHHHHHHhcCCCCcccccCCCCCC
Q psy5202         185 NDTCEFCGKVFKNCSNLTVHRRSHTGEKPYKCELCSYACAQSSKLTRHMKTHGRLGKDVYRCRFCEMP  252 (343)
Q Consensus       185 ~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~f~~~~~L~~H~~~h~~~~~~~~~C~~C~~~  252 (343)
                      ...|+.|+-.+.          .|.......|.+||+..                 ..+..|+.||..
T Consensus       390 ~~~C~~C~~~l~----------~h~~~~~l~Ch~Cg~~~-----------------~~~~~Cp~Cg~~  430 (679)
T PRK05580        390 VAECPHCDASLT----------LHRFQRRLRCHHCGYQE-----------------PIPKACPECGST  430 (679)
T ss_pred             ccCCCCCCCcee----------EECCCCeEECCCCcCCC-----------------CCCCCCCCCcCC
Confidence            356777765432          23445667788888433                 345678888754


No 145
>KOG1280|consensus
Probab=68.06  E-value=4.5  Score=34.84  Aligned_cols=23  Identities=13%  Similarity=0.465  Sum_probs=12.1

Q ss_pred             ceecCcCcccccChHHHHHHHHH
Q psy5202         213 PYKCELCSYACAQSSKLTRHMKT  235 (343)
Q Consensus       213 ~~~C~~C~~~f~~~~~L~~H~~~  235 (343)
                      -|.|++|++.-.+...|..|+..
T Consensus        79 SftCPyC~~~Gfte~~f~~Hv~s  101 (381)
T KOG1280|consen   79 SFTCPYCGIMGFTERQFGTHVLS  101 (381)
T ss_pred             cccCCcccccccchhHHHHHhhh
Confidence            35555555555555555555443


No 146
>COG3357 Predicted transcriptional regulator containing an HTH domain fused to a Zn-ribbon [Transcription]
Probab=67.79  E-value=2.7  Score=28.36  Aligned_cols=14  Identities=21%  Similarity=0.902  Sum_probs=8.8

Q ss_pred             CceecCcCcccccC
Q psy5202         212 KPYKCELCSYACAQ  225 (343)
Q Consensus       212 ~~~~C~~C~~~f~~  225 (343)
                      +|-+|..||+.|..
T Consensus        57 ~Pa~CkkCGfef~~   70 (97)
T COG3357          57 RPARCKKCGFEFRD   70 (97)
T ss_pred             cChhhcccCccccc
Confidence            35567777766654


No 147
>PF12760 Zn_Tnp_IS1595:  Transposase zinc-ribbon domain;  InterPro: IPR024442 This zinc binding domain is found in a range of transposase proteins such as ISSPO8, ISSOD11, ISRSSP2 etc. It may be a zinc-binding beta ribbon domain that could bind DNA.
Probab=66.80  E-value=5.3  Score=23.40  Aligned_cols=10  Identities=40%  Similarity=1.099  Sum_probs=6.1

Q ss_pred             cccccCCCCC
Q psy5202         242 DVYRCRFCEM  251 (343)
Q Consensus       242 ~~~~C~~C~~  251 (343)
                      ..|+|..|++
T Consensus        36 ~~~~C~~C~~   45 (46)
T PF12760_consen   36 GRYRCKACRK   45 (46)
T ss_pred             CeEECCCCCC
Confidence            4566666654


No 148
>PRK03824 hypA hydrogenase nickel incorporation protein; Provisional
Probab=66.73  E-value=2.4  Score=31.71  Aligned_cols=8  Identities=25%  Similarity=0.804  Sum_probs=4.4

Q ss_pred             cccCCCCC
Q psy5202         244 YRCRFCEM  251 (343)
Q Consensus       244 ~~C~~C~~  251 (343)
                      +.||.||.
T Consensus       108 ~~CP~Cgs  115 (135)
T PRK03824        108 LKCPKCGS  115 (135)
T ss_pred             cCCcCCCC
Confidence            45666653


No 149
>KOG2593|consensus
Probab=66.37  E-value=4.9  Score=35.82  Aligned_cols=22  Identities=14%  Similarity=0.365  Sum_probs=16.9

Q ss_pred             CCCCeecCCCCccCCCHHHHHH
Q psy5202         116 GEKPFKCTVCDHACSQSAKLKK  137 (343)
Q Consensus       116 ~~~~~~C~~C~~~f~~~~~L~~  137 (343)
                      ....|.|+.|.+.|.....++.
T Consensus       125 ~~~~Y~Cp~C~kkyt~Lea~~L  146 (436)
T KOG2593|consen  125 NVAGYVCPNCQKKYTSLEALQL  146 (436)
T ss_pred             ccccccCCccccchhhhHHHHh
Confidence            3466999999999988776553


No 150
>TIGR01206 lysW lysine biosynthesis protein LysW. This very small, poorly characterized protein has been shown essential in Thermus thermophilus for an unusual pathway of Lys biosynthesis from aspartate by way of alpha-aminoadipate (AAA) rather than diaminopimelate. It is found also in Deinococcus radiodurans and Pyrococcus horikoshii, which appear to share the AAA pathway.
Probab=66.26  E-value=2.3  Score=25.92  Aligned_cols=10  Identities=20%  Similarity=0.730  Sum_probs=4.6

Q ss_pred             eecCcCcccc
Q psy5202         214 YKCELCSYAC  223 (343)
Q Consensus       214 ~~C~~C~~~f  223 (343)
                      |.|+.||..+
T Consensus         3 ~~CP~CG~~i   12 (54)
T TIGR01206         3 FECPDCGAEI   12 (54)
T ss_pred             cCCCCCCCEE
Confidence            3455555443


No 151
>PF14446 Prok-RING_1:  Prokaryotic RING finger family 1
Probab=65.59  E-value=4.6  Score=24.53  Aligned_cols=9  Identities=56%  Similarity=1.398  Sum_probs=4.3

Q ss_pred             ccccCcccc
Q psy5202         187 TCEFCGKVF  195 (343)
Q Consensus       187 ~C~~C~~~f  195 (343)
                      +|..||+.|
T Consensus         7 ~C~~Cg~~~   15 (54)
T PF14446_consen    7 KCPVCGKKF   15 (54)
T ss_pred             cChhhCCcc
Confidence            444555444


No 152
>KOG3408|consensus
Probab=65.30  E-value=3.5  Score=29.55  Aligned_cols=27  Identities=22%  Similarity=0.439  Sum_probs=22.4

Q ss_pred             CCcccccCCCCCCCCChHHHHHHHhhh
Q psy5202         240 GKDVYRCRFCEMPFSVPSTLEKHMRKC  266 (343)
Q Consensus       240 ~~~~~~C~~C~~~f~~~~~l~~H~~~~  266 (343)
                      |...|.|-.|.+-|.+...|..|.++.
T Consensus        54 G~GqfyCi~CaRyFi~~~~l~~H~ktK   80 (129)
T KOG3408|consen   54 GGGQFYCIECARYFIDAKALKTHFKTK   80 (129)
T ss_pred             CCceeehhhhhhhhcchHHHHHHHhcc
Confidence            455688999999999999999998853


No 153
>PRK12380 hydrogenase nickel incorporation protein HybF; Provisional
Probab=65.22  E-value=4  Score=29.43  Aligned_cols=11  Identities=27%  Similarity=0.293  Sum_probs=6.1

Q ss_pred             eecCcCccccc
Q psy5202         214 YKCELCSYACA  224 (343)
Q Consensus       214 ~~C~~C~~~f~  224 (343)
                      ++|..|+..|.
T Consensus        71 ~~C~~Cg~~~~   81 (113)
T PRK12380         71 AWCWDCSQVVE   81 (113)
T ss_pred             EEcccCCCEEe
Confidence            55666665554


No 154
>PF05191 ADK_lid:  Adenylate kinase, active site lid;  InterPro: IPR007862 Adenylate kinases (ADK; 2.7.4.3 from EC) are phosphotransferases that catalyse the Mg-dependent reversible conversion of ATP and AMP to two molecules of ADP, an essential reaction for many processes in living cells. In large variants of adenylate kinase, the AMP and ATP substrates are buried in a domain that undergoes conformational changes from an open to a closed state when bound to substrate; the ligand is then contained within a highly specific environment required for catalysis. Adenylate kinase is a 3-domain protein consisting of a large central CORE domain flanked by a LID domain on one side and the AMP-binding NMPbind domain on the other []. The LID domain binds ATP and covers the phosphates at the active site. The substrates first bind the CORE domain, followed by closure of the active site by the LID and NMPbind domains. Comparisons of adenylate kinases have revealed a particular divergence in the active site lid. In some organisms, particularly the Gram-positive bacteria, residues in the lid domain have been mutated to cysteines and these cysteine residues (two CX(n)C motifs) are responsible for the binding of a zinc ion. The bound zinc ion in the lid domain is clearly structurally homologous to Zinc-finger domains. However, it is unclear whether the adenylate kinase lid is a novel zinc-finger DNA/RNA binding domain, or that the lid bound zinc serves a purely structural function [].; GO: 0004017 adenylate kinase activity; PDB: 3BE4_A 2OSB_B 2ORI_A 2EU8_A 3DL0_A 1P3J_A 2QAJ_A 2OO7_A 2P3S_A 3DKV_A ....
Probab=64.91  E-value=1.5  Score=24.16  Aligned_cols=10  Identities=30%  Similarity=1.222  Sum_probs=5.4

Q ss_pred             ccccCccccc
Q psy5202         187 TCEFCGKVFK  196 (343)
Q Consensus       187 ~C~~C~~~f~  196 (343)
                      .|+.||..|.
T Consensus         3 ~C~~Cg~~Yh   12 (36)
T PF05191_consen    3 ICPKCGRIYH   12 (36)
T ss_dssp             EETTTTEEEE
T ss_pred             CcCCCCCccc
Confidence            4555555553


No 155
>PF07975 C1_4:  TFIIH C1-like domain;  InterPro: IPR004595 All proteins in this domain for which functions are known are components of the TFIIH complex which is involved in the initiation of transcription and nucleotide excision repair. It includes the yeast transcription factor Ssl1 (Suppressor of stem-loop protein 1) that is essential for translation initiation and affects UV resistance. The C-terminal region is essential for transcription activity. This regions binds three zinc atoms through two independent domain. The first contains a C4 zinc finger motif, whereas the second is characterised by a CX(2)CX(2-4)FCADCD motif. The solution structure of the second C-terminal domain revealed homology with the regulatory domain of protein kinase C [].; GO: 0006281 DNA repair, 0005634 nucleus; PDB: 1Z60_A.
Probab=64.91  E-value=1.7  Score=26.09  Aligned_cols=23  Identities=17%  Similarity=0.390  Sum_probs=8.2

Q ss_pred             ceecCcCcccccChHHHHHHHHH
Q psy5202         213 PYKCELCSYACAQSSKLTRHMKT  235 (343)
Q Consensus       213 ~~~C~~C~~~f~~~~~L~~H~~~  235 (343)
                      .|+|+.|...|=..-++..|...
T Consensus        21 ~y~C~~C~~~FC~dCD~fiHE~L   43 (51)
T PF07975_consen   21 RYRCPKCKNHFCIDCDVFIHETL   43 (51)
T ss_dssp             EE--TTTT--B-HHHHHTTTTTS
T ss_pred             eEECCCCCCccccCcChhhhccc
Confidence            35555555555444444444333


No 156
>PF07754 DUF1610:  Domain of unknown function (DUF1610);  InterPro: IPR011668 This domain is found in archaeal species. It is likely to bind zinc via its four well-conserved cysteine residues.
Probab=64.76  E-value=3.3  Score=20.45  Aligned_cols=8  Identities=25%  Similarity=0.858  Sum_probs=4.4

Q ss_pred             ccccCCCC
Q psy5202         243 VYRCRFCE  250 (343)
Q Consensus       243 ~~~C~~C~  250 (343)
                      .|.|+.||
T Consensus        16 ~f~CPnCG   23 (24)
T PF07754_consen   16 PFPCPNCG   23 (24)
T ss_pred             eEeCCCCC
Confidence            45555555


No 157
>KOG2807|consensus
Probab=63.98  E-value=9.3  Score=32.70  Aligned_cols=100  Identities=22%  Similarity=0.313  Sum_probs=58.8

Q ss_pred             CcccccccccccCChHHHHHHHHhccCCCCeecCCCCccCCCHHHHHHHHHHhccCCCCccccCCCCcchHHHHHHHHHh
Q psy5202          90 KNKECEFCGKKFKYENTLQAHRRNHTGEKPFKCTVCDHACSQSAKLKKHMKIHDEDGDKEDREEGPSASSLAAAAAAAAL  169 (343)
Q Consensus        90 ~~~~C~~C~~~f~~~~~L~~H~~~h~~~~~~~C~~C~~~f~~~~~L~~H~~~h~~~~~~~~~~~~~~~~~~~~~~~~~~~  169 (343)
                      .-|.|+.|+...-              .-|..|++|+-.......|.+-...--.-+++................     
T Consensus       275 ~Gy~CP~CkakvC--------------sLP~eCpiC~ltLVss~hLARSyhhL~PL~~F~Eip~~~~~~~~~Cf~-----  335 (378)
T KOG2807|consen  275 GGYFCPQCKAKVC--------------SLPIECPICSLTLVSSPHLARSYHHLFPLKPFVEIPETEYNGSRFCFA-----  335 (378)
T ss_pred             CceeCCcccCeee--------------cCCccCCccceeEecchHHHHHHHhhcCCcchhhccccccCCCcceee-----
Confidence            3499999986542              357899999999888877766443322222221111111000000000     


Q ss_pred             hhcCCCCcccccCCCCcccccCcccccCHHHHHHHHHHccCCCceecCcCc
Q psy5202         170 NLCLPNSHMKKESRRNDTCEFCGKVFKNCSNLTVHRRSHTGEKPYKCELCS  220 (343)
Q Consensus       170 ~~~~~~~~~~~~~~~~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~  220 (343)
                        +    .-.......|+|..|...|-..-+...|...|.      |+.|.
T Consensus       336 --C----~~~~~~~~~y~C~~Ck~~FCldCDv~iHesLh~------CpgCe  374 (378)
T KOG2807|consen  336 --C----QGELLSSGRYRCESCKNVFCLDCDVFIHESLHN------CPGCE  374 (378)
T ss_pred             --e----ccccCCCCcEEchhccceeeccchHHHHhhhhc------CCCcC
Confidence              0    000112367999999999999999999987763      77775


No 158
>KOG4167|consensus
Probab=63.21  E-value=1.2  Score=42.11  Aligned_cols=29  Identities=21%  Similarity=0.434  Sum_probs=25.6

Q ss_pred             cccccCCCCCCCCChHHHHHHHhhhcCcc
Q psy5202         242 DVYRCRFCEMPFSVPSTLEKHMRKCVVNQ  270 (343)
Q Consensus       242 ~~~~C~~C~~~f~~~~~l~~H~~~~~~~~  270 (343)
                      ..|.|..|+|.|-...++..||++|....
T Consensus       791 giFpCreC~kvF~KiKSrNAHMK~Hr~q~  819 (907)
T KOG4167|consen  791 GIFPCRECGKVFFKIKSRNAHMKTHRQQE  819 (907)
T ss_pred             ceeehHHHHHHHHHHhhhhHHHHHHHHHH
Confidence            45999999999999999999999997544


No 159
>PF08209 Sgf11:  Sgf11 (transcriptional regulation protein);  InterPro: IPR013246 The Sgf11 family is a SAGA complex subunit in Saccharomyces cerevisiae (Baker's yeast). The SAGA complex is a multisubunit protein complex involved in transcriptional regulation. SAGA combines proteins involved in interactions with DNA-bound activators and TATA-binding protein (TBP), as well as enzymes for histone acetylation and deubiquitylation [].; PDB: 3M99_B 2LO2_A 3MHH_C 3MHS_C.
Probab=62.65  E-value=4.6  Score=21.77  Aligned_cols=25  Identities=28%  Similarity=0.646  Sum_probs=17.0

Q ss_pred             ccccCCCCCCCCChHHHHHHHhhhcC
Q psy5202         243 VYRCRFCEMPFSVPSTLEKHMRKCVV  268 (343)
Q Consensus       243 ~~~C~~C~~~f~~~~~l~~H~~~~~~  268 (343)
                      .|.|+.|++.+ ....+..|+.+|-+
T Consensus         4 ~~~C~nC~R~v-~a~RfA~HLekCmg   28 (33)
T PF08209_consen    4 YVECPNCGRPV-AASRFAPHLEKCMG   28 (33)
T ss_dssp             EEE-TTTSSEE-EGGGHHHHHHHHTC
T ss_pred             eEECCCCcCCc-chhhhHHHHHHHHc
Confidence            46788888765 45567888887764


No 160
>PF09845 DUF2072:  Zn-ribbon containing protein (DUF2072);  InterPro: IPR018645  This archaeal Zinc-ribbon containing proteins have no known function. 
Probab=62.04  E-value=4.5  Score=29.66  Aligned_cols=15  Identities=27%  Similarity=0.647  Sum_probs=11.1

Q ss_pred             ceecCcCcccccChH
Q psy5202         213 PYKCELCSYACAQSS  227 (343)
Q Consensus       213 ~~~C~~C~~~f~~~~  227 (343)
                      |++|..||+.|..-+
T Consensus         1 PH~Ct~Cg~~f~dgs   15 (131)
T PF09845_consen    1 PHQCTKCGRVFEDGS   15 (131)
T ss_pred             CcccCcCCCCcCCCc
Confidence            567888888887644


No 161
>COG4957 Predicted transcriptional regulator [Transcription]
Probab=61.78  E-value=4  Score=29.84  Aligned_cols=25  Identities=44%  Similarity=0.615  Sum_probs=21.0

Q ss_pred             ccccccccccCChHHHHHHHHhccCCCC
Q psy5202          92 KECEFCGKKFKYENTLQAHRRNHTGEKP  119 (343)
Q Consensus        92 ~~C~~C~~~f~~~~~L~~H~~~h~~~~~  119 (343)
                      ..|-+||+.|.   +|++|+.+|.+..|
T Consensus        77 IicLEDGkkfK---SLKRHL~t~~gmTP  101 (148)
T COG4957          77 IICLEDGKKFK---SLKRHLTTHYGLTP  101 (148)
T ss_pred             EEEeccCcchH---HHHHHHhcccCCCH
Confidence            57999999996   69999999988644


No 162
>KOG4124|consensus
Probab=61.62  E-value=2.8  Score=35.97  Aligned_cols=55  Identities=31%  Similarity=0.628  Sum_probs=41.0

Q ss_pred             CCCCcccccc--cccccCChHHHHHHHHh-cc------------------CCCCeecCCCCccCCCHHHHHHHHHH
Q psy5202          87 QEEKNKECEF--CGKKFKYENTLQAHRRN-HT------------------GEKPFKCTVCDHACSQSAKLKKHMKI  141 (343)
Q Consensus        87 ~~~~~~~C~~--C~~~f~~~~~L~~H~~~-h~------------------~~~~~~C~~C~~~f~~~~~L~~H~~~  141 (343)
                      ...++|+|++  |++.+.....|..|... |.                  ..|+|+|++|.+.++....|.-|+..
T Consensus       345 ~~~~~~~~~vp~~~~~~~n~ng~~~~~~~~h~s~i~~~s~~~~ph~~~~~~nk~~r~~i~~~~~k~~~~l~~~~~~  420 (442)
T KOG4124|consen  345 VVDKPYKCPVPNCDKAYKNQNGLKYHKLHGHCSPITTPTPAPIPHQGFVVENKPYRCEVCSKRYKNLNGLKYHRTH  420 (442)
T ss_pred             EecCCCCCCCCcchhhcccCcceeeccccCcCCCCCCCCCCCCCcceeeeccCcccChhhhhhhccCCCCCceeeh
Confidence            3567899976  99999988888888543 32                  13789999999988877666666543


No 163
>COG1571 Predicted DNA-binding protein containing a Zn-ribbon domain [General function prediction only]
Probab=61.13  E-value=5  Score=35.96  Aligned_cols=30  Identities=20%  Similarity=0.513  Sum_probs=20.8

Q ss_pred             ecCcCcccccChHHHHHHHHHhcCCCCcccccCCCCCCCCChH
Q psy5202         215 KCELCSYACAQSSKLTRHMKTHGRLGKDVYRCRFCEMPFSVPS  257 (343)
Q Consensus       215 ~C~~C~~~f~~~~~L~~H~~~h~~~~~~~~~C~~C~~~f~~~~  257 (343)
                      .|+.||.+..+.             |..-|+|+.||+++....
T Consensus       352 ~Cp~Cg~~m~S~-------------G~~g~rC~kCg~~~~~~~  381 (421)
T COG1571         352 VCPRCGGRMKSA-------------GRNGFRCKKCGTRARETL  381 (421)
T ss_pred             CCCccCCchhhc-------------CCCCcccccccccCCccc
Confidence            588888655442             344788999998876653


No 164
>PF05290 Baculo_IE-1:  Baculovirus immediate-early protein (IE-0);  InterPro: IPR007954 This entry contains the Baculovirus immediate-early protein IE-0.
Probab=61.03  E-value=8  Score=28.38  Aligned_cols=55  Identities=22%  Similarity=0.500  Sum_probs=35.0

Q ss_pred             CCcccccCcccccCHHHHHHHHHHccCCCceec---CcCcccccChHHHHHHHHHhcCCCCcccccCCCCCCCCChHH
Q psy5202         184 RNDTCEFCGKVFKNCSNLTVHRRSHTGEKPYKC---ELCSYACAQSSKLTRHMKTHGRLGKDVYRCRFCEMPFSVPST  258 (343)
Q Consensus       184 ~~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C---~~C~~~f~~~~~L~~H~~~h~~~~~~~~~C~~C~~~f~~~~~  258 (343)
                      +.|.|.+|..+.....-|          +|-.|   .+|+.-   -..|+.|-.+|.       .||.|.-+|++...
T Consensus        79 ~lYeCnIC~etS~ee~FL----------KPneCCgY~iCn~C---ya~LWK~~~~yp-------vCPvCkTSFKss~~  136 (140)
T PF05290_consen   79 KLYECNICKETSAEERFL----------KPNECCGYSICNAC---YANLWKFCNLYP-------VCPVCKTSFKSSSS  136 (140)
T ss_pred             CceeccCcccccchhhcC----------CcccccchHHHHHH---HHHHHHHcccCC-------CCCccccccccccc
Confidence            678888888877654322          44444   123322   256777766663       69999999987643


No 165
>TIGR00100 hypA hydrogenase nickel insertion protein HypA. In Hpylori, hypA mutant abolished hydrogenase activity and decrease in urease activity. Nickel supplementation in media restored urease activity and partial hydrogenase activity. HypA probably involved in inserting Ni in enzymes.
Probab=59.94  E-value=5.8  Score=28.71  Aligned_cols=12  Identities=33%  Similarity=0.659  Sum_probs=6.6

Q ss_pred             eecCcCcccccC
Q psy5202         214 YKCELCSYACAQ  225 (343)
Q Consensus       214 ~~C~~C~~~f~~  225 (343)
                      +.|..|+..|..
T Consensus        71 ~~C~~Cg~~~~~   82 (115)
T TIGR00100        71 CECEDCSEEVSP   82 (115)
T ss_pred             EEcccCCCEEec
Confidence            556666655543


No 166
>PTZ00255 60S ribosomal protein L37a; Provisional
Probab=57.93  E-value=4.2  Score=27.72  Aligned_cols=13  Identities=23%  Similarity=0.861  Sum_probs=6.9

Q ss_pred             cccccCCCCCCCC
Q psy5202         242 DVYRCRFCEMPFS  254 (343)
Q Consensus       242 ~~~~C~~C~~~f~  254 (343)
                      ..+.|..|++.|.
T Consensus        53 GIW~C~~C~~~~A   65 (90)
T PTZ00255         53 GIWRCKGCKKTVA   65 (90)
T ss_pred             EEEEcCCCCCEEe
Confidence            3455555555553


No 167
>PF01155 HypA:  Hydrogenase expression/synthesis hypA family;  InterPro: IPR000688 Bacterial membrane-bound nickel-dependent hydrogenases requires a number of accessory proteins which are involved in their maturation. The exact role of these proteins is not yet clear, but some seem to be required for the incorporation of the nickel ions []. One of these proteins is generally known as hypA. It is a protein of about 12 to 14 kDa that contains, in its C-terminal region, four conserved cysteines that form a zinc-finger like motif. Escherichia coli has two proteins that belong to this family, hypA and hybF. A homologue, MJ0214, has also been found in a number of archaeal species, including the genome of Methanocaldococcus jannaschii (Methanococcus jannaschii).; GO: 0016151 nickel ion binding, 0006464 protein modification process; PDB: 2KDX_A 3A44_D 3A43_B.
Probab=57.75  E-value=3.4  Score=29.82  Aligned_cols=12  Identities=17%  Similarity=0.476  Sum_probs=7.3

Q ss_pred             eecCcCcccccC
Q psy5202         214 YKCELCSYACAQ  225 (343)
Q Consensus       214 ~~C~~C~~~f~~  225 (343)
                      ++|..|++.|..
T Consensus        71 ~~C~~Cg~~~~~   82 (113)
T PF01155_consen   71 ARCRDCGHEFEP   82 (113)
T ss_dssp             EEETTTS-EEEC
T ss_pred             EECCCCCCEEec
Confidence            567777766654


No 168
>COG1655 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=57.50  E-value=1.8  Score=34.84  Aligned_cols=39  Identities=28%  Similarity=0.503  Sum_probs=25.8

Q ss_pred             CCcccccCcccccCHHHHHHHHHHccCCC---------------ceecCcCccc
Q psy5202         184 RNDTCEFCGKVFKNCSNLTVHRRSHTGEK---------------PYKCELCSYA  222 (343)
Q Consensus       184 ~~~~C~~C~~~f~~~~~l~~H~~~h~~~~---------------~~~C~~C~~~  222 (343)
                      +.+.|++|+..|....-+..-.|+-.|+-               ...||.|+++
T Consensus        18 k~ieCPvC~tkFkkeev~tgsiRiiagDld~~lkygninP~fY~VvvCP~C~yA   71 (267)
T COG1655          18 KTIECPVCNTKFKKEEVKTGSIRIIAGDLDFFLKYGNINPYFYDVVVCPICYYA   71 (267)
T ss_pred             ceeccCcccchhhhhheeccceeEecccccceeeccccCCceeEEEEcchhhHH
Confidence            67889999999987655444444433321               2579999864


No 169
>PRK00564 hypA hydrogenase nickel incorporation protein; Provisional
Probab=57.44  E-value=6.6  Score=28.51  Aligned_cols=11  Identities=27%  Similarity=0.615  Sum_probs=5.7

Q ss_pred             eecCcCccccc
Q psy5202         214 YKCELCSYACA  224 (343)
Q Consensus       214 ~~C~~C~~~f~  224 (343)
                      +.|..|+..|.
T Consensus        72 ~~C~~Cg~~~~   82 (117)
T PRK00564         72 LECKDCSHVFK   82 (117)
T ss_pred             EEhhhCCCccc
Confidence            45555554443


No 170
>PF01780 Ribosomal_L37ae:  Ribosomal L37ae protein family;  InterPro: IPR002674 Ribosomes are the particles that catalyse mRNA-directed protein synthesis in all organisms. The codons of the mRNA are exposed on the ribosome to allow tRNA binding. This leads to the incorporation of amino acids into the growing polypeptide chain in accordance with the genetic information. Incoming amino acid monomers enter the ribosomal A site in the form of aminoacyl-tRNAs complexed with elongation factor Tu (EF-Tu) and GTP. The growing polypeptide chain, situated in the P site as peptidyl-tRNA, is then transferred to aminoacyl-tRNA and the new peptidyl-tRNA, extended by one residue, is translocated to the P site with the aid the elongation factor G (EF-G) and GTP as the deacylated tRNA is released from the ribosome through one or more exit sites [, ]. About 2/3 of the mass of the ribosome consists of RNA and 1/3 of protein. The proteins are named in accordance with the subunit of the ribosome which they belong to - the small (S1 to S31) and the large (L1 to L44). Usually they decorate the rRNA cores of the subunits.  Many ribosomal proteins, particularly those of the large subunit, are composed of a globular, surfaced-exposed domain with long finger-like projections that extend into the rRNA core to stabilise its structure. Most of the proteins interact with multiple RNA elements, often from different domains. In the large subunit, about 1/3 of the 23S rRNA nucleotides are at least in van der Waal's contact with protein, and L22 interacts with all six domains of the 23S rRNA. Proteins S4 and S7, which initiate assembly of the 16S rRNA, are located at junctions of five and four RNA helices, respectively. In this way proteins serve to organise and stabilise the rRNA tertiary structure. While the crucial activities of decoding and peptide transfer are RNA based, proteins play an active role in functions that may have evolved to streamline the process of protein synthesis. In addition to their function in the ribosome, many ribosomal proteins have some function 'outside' the ribosome [, ]. This ribosomal protein is found in archaebacteria and eukaryotes []. Ribosomal protein L37 has a single zinc finger-like motif of the C2-C2 type [].; GO: 0003735 structural constituent of ribosome, 0006412 translation, 0005622 intracellular, 0005840 ribosome; PDB: 4A1E_Y 4A17_Y 4A1C_Y 4A1A_Y 3O58_g 3IZS_m 3O5H_g 1S1I_9 3IZR_m 1YSH_D ....
Probab=57.11  E-value=3.1  Score=28.34  Aligned_cols=11  Identities=36%  Similarity=0.875  Sum_probs=5.7

Q ss_pred             eecCcCccccc
Q psy5202         214 YKCELCSYACA  224 (343)
Q Consensus       214 ~~C~~C~~~f~  224 (343)
                      |.|..|++.|.
T Consensus        54 W~C~~C~~~~A   64 (90)
T PF01780_consen   54 WKCKKCGKKFA   64 (90)
T ss_dssp             EEETTTTEEEE
T ss_pred             eecCCCCCEEe
Confidence            55555555543


No 171
>PRK14873 primosome assembly protein PriA; Provisional
Probab=56.92  E-value=3.6  Score=39.83  Aligned_cols=33  Identities=18%  Similarity=0.406  Sum_probs=19.2

Q ss_pred             ccccccccccCCh---HHHHHHHHhccCCCCeecCCCCcc
Q psy5202          92 KECEFCGKKFKYE---NTLQAHRRNHTGEKPFKCTVCDHA  128 (343)
Q Consensus        92 ~~C~~C~~~f~~~---~~L~~H~~~h~~~~~~~C~~C~~~  128 (343)
                      ..|..||..+...   ..|..    |.......|.+||+.
T Consensus       384 l~C~~Cg~~~~C~~C~~~L~~----h~~~~~l~Ch~CG~~  419 (665)
T PRK14873        384 LACARCRTPARCRHCTGPLGL----PSAGGTPRCRWCGRA  419 (665)
T ss_pred             eEhhhCcCeeECCCCCCceeE----ecCCCeeECCCCcCC
Confidence            3677777766532   22333    334456788888863


No 172
>PF09963 DUF2197:  Uncharacterized protein conserved in bacteria (DUF2197);  InterPro: IPR019241  This family represents various hypothetical bacterial proteins with no known function. 
Probab=56.19  E-value=6.9  Score=24.00  Aligned_cols=36  Identities=31%  Similarity=0.566  Sum_probs=22.4

Q ss_pred             ccccccccccC-ChHHHHH-HHHhccCCCCeecCCCCcc
Q psy5202          92 KECEFCGKKFK-YENTLQA-HRRNHTGEKPFKCTVCDHA  128 (343)
Q Consensus        92 ~~C~~C~~~f~-~~~~L~~-H~~~h~~~~~~~C~~C~~~  128 (343)
                      -+|-.|++.+. ....+.. -++. .....|.|+.|...
T Consensus         3 vkC~lCdk~~~Id~~~~~aKrLrn-rPi~tYmC~eC~~R   40 (56)
T PF09963_consen    3 VKCILCDKKEEIDEDTPEAKRLRN-RPIHTYMCDECKER   40 (56)
T ss_pred             eEEEecCCEEEeccCCHHHHHhhc-CCCcceeChhHHHH
Confidence            47999998876 2333333 2333 35567999999643


No 173
>TIGR00280 L37a ribosomal protein L37a. This model finds eukaryotic ribosomal protein L37a and its archaeal orthologs. The nomeclature is tricky because eukaryotes have proteins called both L37 and L37a.
Probab=55.45  E-value=4.1  Score=27.81  Aligned_cols=13  Identities=31%  Similarity=0.961  Sum_probs=7.1

Q ss_pred             cccccCCCCCCCC
Q psy5202         242 DVYRCRFCEMPFS  254 (343)
Q Consensus       242 ~~~~C~~C~~~f~  254 (343)
                      ..+.|..|++.|.
T Consensus        52 GIW~C~~C~~~~A   64 (91)
T TIGR00280        52 GIWTCRKCGAKFA   64 (91)
T ss_pred             EEEEcCCCCCEEe
Confidence            3455666655553


No 174
>PF13453 zf-TFIIB:  Transcription factor zinc-finger
Probab=55.41  E-value=6.5  Score=22.32  Aligned_cols=16  Identities=19%  Similarity=0.330  Sum_probs=7.2

Q ss_pred             eecCcCcccccChHHH
Q psy5202         214 YKCELCSYACAQSSKL  229 (343)
Q Consensus       214 ~~C~~C~~~f~~~~~L  229 (343)
                      +.|+.|+-.+-....|
T Consensus        20 d~C~~C~G~W~d~~el   35 (41)
T PF13453_consen   20 DVCPSCGGIWFDAGEL   35 (41)
T ss_pred             EECCCCCeEEccHHHH
Confidence            4444444444444433


No 175
>PF01363 FYVE:  FYVE zinc finger;  InterPro: IPR000306 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target.  The FYVE zinc finger is named after four proteins that it has been found in: Fab1, YOTB/ZK632.12, Vac1, and EEA1. The FYVE finger has been shown to bind two zinc ions []. The FYVE finger has eight potential zinc coordinating cysteine positions. Many members of this family also include two histidines in a motif R+HHC+XCG, where + represents a charged residue and X any residue. FYVE-type domains are divided into two known classes: FYVE domains that specifically bind to phosphatidylinositol 3-phosphate in lipid bilayers and FYVE-related domains of undetermined function []. Those that bind to phosphatidylinositol 3-phosphate are often found in proteins targeted to lipid membranes that are involved in regulating membrane traffic [, , ]. Most FYVE domains target proteins to endosomes by binding specifically to phosphatidylinositol-3-phosphate at the membrane surface. By contrast, the CARP2 FYVE-like domain is not optimized to bind to phosphoinositides or insert into lipid bilayers. FYVE domains are distinguished from other zinc fingers by three signature sequences: an N-terminal WxxD motif, a basic R(R/K)HHCR patch, and a C-terminal RVC motif. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; GO: 0046872 metal ion binding; PDB: 1HYI_A 1JOC_B 1HYJ_A 1DVP_A 3ZYQ_A 4AVX_A 1VFY_A 3T7L_A 1X4U_A 1WFK_A ....
Probab=55.33  E-value=5.5  Score=25.60  Aligned_cols=9  Identities=56%  Similarity=1.390  Sum_probs=2.6

Q ss_pred             ccccCcccc
Q psy5202         187 TCEFCGKVF  195 (343)
Q Consensus       187 ~C~~C~~~f  195 (343)
                      .|..|++.|
T Consensus        11 ~C~~C~~~F   19 (69)
T PF01363_consen   11 NCMICGKKF   19 (69)
T ss_dssp             B-TTT--B-
T ss_pred             cCcCcCCcC
Confidence            455555555


No 176
>COG3091 SprT Zn-dependent metalloprotease, SprT family [General function prediction only]
Probab=55.09  E-value=5.8  Score=29.88  Aligned_cols=11  Identities=18%  Similarity=0.676  Sum_probs=5.2

Q ss_pred             CcccccCccccc
Q psy5202         185 NDTCEFCGKVFK  196 (343)
Q Consensus       185 ~~~C~~C~~~f~  196 (343)
                      .|.|. |+..|.
T Consensus       117 ~Y~C~-C~q~~l  127 (156)
T COG3091         117 PYRCQ-CQQHYL  127 (156)
T ss_pred             eEEee-cCCccc
Confidence            44455 554443


No 177
>COG3091 SprT Zn-dependent metalloprotease, SprT family [General function prediction only]
Probab=53.82  E-value=6.4  Score=29.65  Aligned_cols=36  Identities=19%  Similarity=0.600  Sum_probs=25.7

Q ss_pred             CCCcccccccccccCChHHHHHHHHhccCCCCeecCCCCcc
Q psy5202          88 EEKNKECEFCGKKFKYENTLQAHRRNHTGEKPFKCTVCDHA  128 (343)
Q Consensus        88 ~~~~~~C~~C~~~f~~~~~L~~H~~~h~~~~~~~C~~C~~~  128 (343)
                      ..-+|.|. |+..|.+   .++|-..-.|+ .|.|..|+..
T Consensus       114 ~~~~Y~C~-C~q~~l~---~RRhn~~~~g~-~YrC~~C~gk  149 (156)
T COG3091         114 TTYPYRCQ-CQQHYLR---IRRHNTVRRGE-VYRCGKCGGK  149 (156)
T ss_pred             cceeEEee-cCCccch---hhhcccccccc-eEEeccCCce
Confidence            34579999 9999765   34554444455 7999999864


No 178
>PF13451 zf-trcl:  Probable zinc-binding domain
Probab=53.76  E-value=8  Score=23.03  Aligned_cols=10  Identities=50%  Similarity=0.982  Sum_probs=4.3

Q ss_pred             cccccCcccc
Q psy5202         186 DTCEFCGKVF  195 (343)
Q Consensus       186 ~~C~~C~~~f  195 (343)
                      +.|..||..|
T Consensus         5 l~C~dCg~~F   14 (49)
T PF13451_consen    5 LTCKDCGAEF   14 (49)
T ss_pred             EEcccCCCeE
Confidence            3444444443


No 179
>PF07282 OrfB_Zn_ribbon:  Putative transposase DNA-binding domain;  InterPro: IPR010095 This entry represents a region of a sequence similarity between a family of putative transposases of Thermoanaerobacter tengcongensis, smaller related proteins from Bacillus anthracis, putative transposes described by IPR001959 from INTERPRO, and other proteins. More information about these proteins can be found at Protein of the Month: Transposase [].
Probab=53.72  E-value=9.5  Score=24.50  Aligned_cols=27  Identities=26%  Similarity=0.717  Sum_probs=14.2

Q ss_pred             ccccCcccccCHHHHHHHHHHccCCCceecCcCcccc
Q psy5202         187 TCEFCGKVFKNCSNLTVHRRSHTGEKPYKCELCSYAC  223 (343)
Q Consensus       187 ~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~f  223 (343)
                      .|+.||.....          -...+.|.|+.||..+
T Consensus        30 ~C~~CG~~~~~----------~~~~r~~~C~~Cg~~~   56 (69)
T PF07282_consen   30 TCPRCGHRNKK----------RRSGRVFTCPNCGFEM   56 (69)
T ss_pred             CccCccccccc----------ccccceEEcCCCCCEE
Confidence            46666655443          1223446666666554


No 180
>PRK00432 30S ribosomal protein S27ae; Validated
Probab=53.32  E-value=7.3  Score=23.36  Aligned_cols=12  Identities=25%  Similarity=0.839  Sum_probs=8.1

Q ss_pred             CceecCcCcccc
Q psy5202         212 KPYKCELCSYAC  223 (343)
Q Consensus       212 ~~~~C~~C~~~f  223 (343)
                      ..|.|..||+.+
T Consensus        36 ~r~~C~~Cgyt~   47 (50)
T PRK00432         36 DRWHCGKCGYTE   47 (50)
T ss_pred             CcEECCCcCCEE
Confidence            457777777654


No 181
>COG5151 SSL1 RNA polymerase II transcription initiation/nucleotide excision repair factor TFIIH, subunit SSL1 [Transcription / DNA replication, recombination, and repair]
Probab=52.94  E-value=6.8  Score=33.16  Aligned_cols=48  Identities=15%  Similarity=0.325  Sum_probs=31.4

Q ss_pred             ecCcCcccccChHHHHHHHHHhcCCCCcccccCCCCCCCCChHHHHHHHhhh
Q psy5202         215 KCELCSYACAQSSKLTRHMKTHGRLGKDVYRCRFCEMPFSVPSTLEKHMRKC  266 (343)
Q Consensus       215 ~C~~C~~~f~~~~~L~~H~~~h~~~~~~~~~C~~C~~~f~~~~~l~~H~~~~  266 (343)
                      .|-.|.-.|.....   |-.-.. +....|+|+.|...|-...+...|..-|
T Consensus       364 ~Cf~CQ~~fp~~~~---~~~~~~-~ss~rY~Ce~CK~~FC~dCdvfiHe~Lh  411 (421)
T COG5151         364 HCFVCQGPFPKPPV---SPFDES-TSSGRYQCELCKSTFCSDCDVFIHETLH  411 (421)
T ss_pred             cceeccCCCCCCCC---Cccccc-ccccceechhhhhhhhhhhHHHHHHHHh
Confidence            37777776654321   110000 1245699999999999999999997654


No 182
>PF15135 UPF0515:  Uncharacterised protein UPF0515
Probab=52.15  E-value=6.5  Score=32.20  Aligned_cols=14  Identities=14%  Similarity=0.479  Sum_probs=8.2

Q ss_pred             CceecCcCcccccC
Q psy5202         212 KPYKCELCSYACAQ  225 (343)
Q Consensus       212 ~~~~C~~C~~~f~~  225 (343)
                      ..|.|+.|+..|..
T Consensus       154 aef~C~~C~h~F~G  167 (278)
T PF15135_consen  154 AEFHCPKCRHNFRG  167 (278)
T ss_pred             eeeecccccccchh
Confidence            34666666666653


No 183
>PRK03681 hypA hydrogenase nickel incorporation protein; Validated
Probab=52.07  E-value=9.1  Score=27.66  Aligned_cols=10  Identities=30%  Similarity=0.587  Sum_probs=5.0

Q ss_pred             eecCcCcccc
Q psy5202         214 YKCELCSYAC  223 (343)
Q Consensus       214 ~~C~~C~~~f  223 (343)
                      +.|..|+..|
T Consensus        71 ~~C~~Cg~~~   80 (114)
T PRK03681         71 CWCETCQQYV   80 (114)
T ss_pred             EEcccCCCee
Confidence            4555555433


No 184
>COG4957 Predicted transcriptional regulator [Transcription]
Probab=51.62  E-value=7.8  Score=28.43  Aligned_cols=26  Identities=38%  Similarity=0.485  Sum_probs=19.0

Q ss_pred             CcccccCcccccCHHHHHHHHHHccCCCc
Q psy5202         185 NDTCEFCGKVFKNCSNLTVHRRSHTGEKP  213 (343)
Q Consensus       185 ~~~C~~C~~~f~~~~~l~~H~~~h~~~~~  213 (343)
                      -..|-.+|+.|.+   |++|+.+|.|-.|
T Consensus        76 ~IicLEDGkkfKS---LKRHL~t~~gmTP  101 (148)
T COG4957          76 YIICLEDGKKFKS---LKRHLTTHYGLTP  101 (148)
T ss_pred             eEEEeccCcchHH---HHHHHhcccCCCH
Confidence            4568888888864   7888888877543


No 185
>TIGR00595 priA primosomal protein N'. All proteins in this family for which functions are known are components of the primosome which is involved in replication, repair, and recombination.This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University).
Probab=51.21  E-value=5.8  Score=37.15  Aligned_cols=35  Identities=23%  Similarity=0.464  Sum_probs=19.3

Q ss_pred             cccccccccCChHHHHHHHHhccCCCCeecCCCCcc
Q psy5202          93 ECEFCGKKFKYENTLQAHRRNHTGEKPFKCTVCDHA  128 (343)
Q Consensus        93 ~C~~C~~~f~~~~~L~~H~~~h~~~~~~~C~~C~~~  128 (343)
                      .|..||....... -...+..|.......|.+||+.
T Consensus       215 ~C~~Cg~~~~C~~-C~~~l~~h~~~~~l~Ch~Cg~~  249 (505)
T TIGR00595       215 LCRSCGYILCCPN-CDVSLTYHKKEGKLRCHYCGYQ  249 (505)
T ss_pred             EhhhCcCccCCCC-CCCceEEecCCCeEEcCCCcCc
Confidence            5777776655321 1112334445556788888864


No 186
>COG4896 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=51.16  E-value=12  Score=23.24  Aligned_cols=40  Identities=18%  Similarity=0.470  Sum_probs=25.6

Q ss_pred             ccccccccc-cCChHHHHHHHHhccCCCCeecCCCCccCCC
Q psy5202          92 KECEFCGKK-FKYENTLQAHRRNHTGEKPFKCTVCDHACSQ  131 (343)
Q Consensus        92 ~~C~~C~~~-f~~~~~L~~H~~~h~~~~~~~C~~C~~~f~~  131 (343)
                      .+|-+|++. |-....+..-...-...+.|.|+.|......
T Consensus         3 vkCiiCd~v~~iD~rt~~tKrLrN~PIrtymC~eC~~Rva~   43 (68)
T COG4896           3 VKCIICDRVDEIDNRTFKTKRLRNKPIRTYMCPECEHRVAI   43 (68)
T ss_pred             ceEEEecceeeecchhHHHHHhhCCCceeEechhhHhhhch
Confidence            478999866 4455555554444445567999999755433


No 187
>KOG3408|consensus
Probab=50.90  E-value=9.9  Score=27.33  Aligned_cols=26  Identities=27%  Similarity=0.506  Sum_probs=21.0

Q ss_pred             CCCceecCcCcccccChHHHHHHHHH
Q psy5202         210 GEKPYKCELCSYACAQSSKLTRHMKT  235 (343)
Q Consensus       210 ~~~~~~C~~C~~~f~~~~~L~~H~~~  235 (343)
                      |...|.|-.|.+-|.+...|..|.+.
T Consensus        54 G~GqfyCi~CaRyFi~~~~l~~H~kt   79 (129)
T KOG3408|consen   54 GGGQFYCIECARYFIDAKALKTHFKT   79 (129)
T ss_pred             CCceeehhhhhhhhcchHHHHHHHhc
Confidence            44558898999999999999888765


No 188
>COG1675 TFA1 Transcription initiation factor IIE, alpha subunit [Transcription]
Probab=50.49  E-value=24  Score=27.72  Aligned_cols=34  Identities=18%  Similarity=0.379  Sum_probs=22.1

Q ss_pred             ccCCCceecCcCcccccChHHHHHHHHHhcCCCCcccccCCCCCC
Q psy5202         208 HTGEKPYKCELCSYACAQSSKLTRHMKTHGRLGKDVYRCRFCEMP  252 (343)
Q Consensus       208 h~~~~~~~C~~C~~~f~~~~~L~~H~~~h~~~~~~~~~C~~C~~~  252 (343)
                      -....-|.|+.|...|+.-.++.           .-|.|+.||-.
T Consensus       108 ~~~~~~y~C~~~~~r~sfdeA~~-----------~~F~Cp~Cg~~  141 (176)
T COG1675         108 ETENNYYVCPNCHVKYSFDEAME-----------LGFTCPKCGED  141 (176)
T ss_pred             hccCCceeCCCCCCcccHHHHHH-----------hCCCCCCCCch
Confidence            34556688888887776654432           12888888854


No 189
>cd00065 FYVE FYVE domain; Zinc-binding domain; targets proteins to membrane lipids via interaction with phosphatidylinositol-3-phosphate, PI3P; present in Fab1, YOTB, Vac1, and EEA1;
Probab=50.04  E-value=12  Score=22.82  Aligned_cols=9  Identities=44%  Similarity=1.003  Sum_probs=4.0

Q ss_pred             ccCCCCCCC
Q psy5202         245 RCRFCEMPF  253 (343)
Q Consensus       245 ~C~~C~~~f  253 (343)
                      .|..||+.|
T Consensus        20 ~Cr~Cg~~~   28 (57)
T cd00065          20 HCRNCGRIF   28 (57)
T ss_pred             ccCcCcCCc
Confidence            344444444


No 190
>smart00440 ZnF_C2C2 C2C2 Zinc finger. Nucleic-acid-binding motif in transcriptional elongation factor TFIIS and RNA polymerases.
Probab=49.93  E-value=1.7  Score=24.62  Aligned_cols=10  Identities=30%  Similarity=0.863  Sum_probs=4.8

Q ss_pred             cccCCCCCCC
Q psy5202         244 YRCRFCEMPF  253 (343)
Q Consensus       244 ~~C~~C~~~f  253 (343)
                      |.|..|+..|
T Consensus        29 y~C~~C~~~w   38 (40)
T smart00440       29 YVCTKCGHRW   38 (40)
T ss_pred             EEeCCCCCEe
Confidence            4455555443


No 191
>KOG1842|consensus
Probab=49.37  E-value=12  Score=33.57  Aligned_cols=22  Identities=18%  Similarity=0.241  Sum_probs=12.1

Q ss_pred             ccccccccccCChHHHHHHHHh
Q psy5202          92 KECEFCGKKFKYENTLQAHRRN  113 (343)
Q Consensus        92 ~~C~~C~~~f~~~~~L~~H~~~  113 (343)
                      |.|++|...|.....|..|+..
T Consensus        16 flCPiC~~dl~~~~~L~~H~d~   37 (505)
T KOG1842|consen   16 FLCPICLLDLPNLSALNDHLDV   37 (505)
T ss_pred             ccCchHhhhhhhHHHHHHHHhh
Confidence            5555555555555555555543


No 192
>PF13821 DUF4187:  Domain of unknown function (DUF4187)
Probab=48.88  E-value=20  Score=22.02  Aligned_cols=18  Identities=22%  Similarity=0.573  Sum_probs=11.0

Q ss_pred             cccCCCCCCCCChHHHHH
Q psy5202         244 YRCRFCEMPFSVPSTLEK  261 (343)
Q Consensus       244 ~~C~~C~~~f~~~~~l~~  261 (343)
                      +.|-.||..|.+...|..
T Consensus        28 ~YC~~Cg~~Y~d~~dL~~   45 (55)
T PF13821_consen   28 NYCFWCGTKYDDEEDLER   45 (55)
T ss_pred             ceeeeeCCccCCHHHHHh
Confidence            456666666666666553


No 193
>KOG1280|consensus
Probab=48.65  E-value=19  Score=31.14  Aligned_cols=28  Identities=14%  Similarity=0.394  Sum_probs=22.4

Q ss_pred             CCeecCCCCccCCCHHHHHHHHHHhccC
Q psy5202         118 KPFKCTVCDHACSQSAKLKKHMKIHDED  145 (343)
Q Consensus       118 ~~~~C~~C~~~f~~~~~L~~H~~~h~~~  145 (343)
                      ..|.|++|+..-.+...|..|....|.+
T Consensus        78 qSftCPyC~~~Gfte~~f~~Hv~s~Hpd  105 (381)
T KOG1280|consen   78 QSFTCPYCGIMGFTERQFGTHVLSQHPE  105 (381)
T ss_pred             ccccCCcccccccchhHHHHHhhhcCcc
Confidence            3588888888888888888888877765


No 194
>PF06220 zf-U1:  U1 zinc finger;  InterPro: IPR013085 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target.  C2H2-type (classical) zinc fingers (Znf) were the first class to be characterised. They contain a short beta hairpin and an alpha helix (beta/beta/alpha structure), where a single zinc atom is held in place by Cys(2)His(2) (C2H2) residues in a tetrahedral array. C2H2 Znf's can be divided into three groups based on the number and pattern of fingers: triple-C2H2 (binds single ligand), multiple-adjacent-C2H2 (binds multiple ligands), and separated paired-C2H2 []. C2H2 Znf's are the most common DNA-binding motifs found in eukaryotic transcription factors, and have also been identified in prokaryotes []. Transcription factors usually contain several Znf's (each with a conserved beta/beta/alpha structure) capable of making multiple contacts along the DNA, where the C2H2 Znf motifs recognise DNA sequences by binding to the major groove of DNA via a short alpha-helix in the Znf, the Znf spanning 3-4 bases of the DNA []. C2H2 Znf's can also bind to RNA and protein targets []. This entry represents a C2H2-type zinc finger motif found in several U1 small nuclear ribonucleoprotein C (U1-C) proteins. Some proteins contain multiple copies of this motif. The U1 small nuclear ribonucleoprotein (U1 snRNP) binds to the pre-mRNA 5' splice site at early stages of spliceosome assembly. Recruitment of U1 to a class of weak 5' splice site is promoted by binding of the protein TIA-1 to uridine-rich sequences immediately downstream from the 5' splice site. Binding of TIA-1 in the vicinity of a 5' splice site helps to stabilise U1 snRNP recruitment, at least in part, via a direct interaction with U1-C, thus providing one molecular mechanism for the function of this splicing regulator []. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; GO: 0008270 zinc ion binding; PDB: 2VRD_A.
Probab=48.40  E-value=16  Score=20.46  Aligned_cols=22  Identities=27%  Similarity=0.611  Sum_probs=8.9

Q ss_pred             ccccCCCCCCCC-Ch-HHHHHHHh
Q psy5202         243 VYRCRFCEMPFS-VP-STLEKHMR  264 (343)
Q Consensus       243 ~~~C~~C~~~f~-~~-~~l~~H~~  264 (343)
                      .|-|.+|...|. .. ..-+.|.+
T Consensus         3 ryyCdyC~~~~~~d~~~~Rk~H~~   26 (38)
T PF06220_consen    3 RYYCDYCKKYLTHDSPSIRKQHER   26 (38)
T ss_dssp             S-B-TTT--B-S--SHHHHHHHT-
T ss_pred             CeecccccceecCCChHHHHHhhc
Confidence            477888888883 33 33356654


No 195
>KOG2907|consensus
Probab=48.13  E-value=9.2  Score=27.11  Aligned_cols=11  Identities=36%  Similarity=1.077  Sum_probs=7.4

Q ss_pred             eecCcCccccc
Q psy5202         214 YKCELCSYACA  224 (343)
Q Consensus       214 ~~C~~C~~~f~  224 (343)
                      |.|+.|++.|.
T Consensus       103 YTC~kC~~k~~  113 (116)
T KOG2907|consen  103 YTCPKCKYKFT  113 (116)
T ss_pred             EEcCccceeee
Confidence            66777776664


No 196
>COG4888 Uncharacterized Zn ribbon-containing protein [General function prediction only]
Probab=48.02  E-value=5.6  Score=27.49  Aligned_cols=13  Identities=23%  Similarity=0.626  Sum_probs=8.0

Q ss_pred             cccccCCCCCCCC
Q psy5202         242 DVYRCRFCEMPFS  254 (343)
Q Consensus       242 ~~~~C~~C~~~f~  254 (343)
                      ....|..||.+|.
T Consensus        45 g~~~Cg~CGls~e   57 (104)
T COG4888          45 GTAVCGNCGLSFE   57 (104)
T ss_pred             eEEEcccCcceEE
Confidence            3456777776664


No 197
>KOG4124|consensus
Probab=47.91  E-value=2  Score=36.79  Aligned_cols=53  Identities=17%  Similarity=0.086  Sum_probs=35.0

Q ss_pred             CCCCcccccc--cccccCChHHHHHH-----HHhccCCCCeecCCCCccCCCHHHHHHHHH
Q psy5202          87 QEEKNKECEF--CGKKFKYENTLQAH-----RRNHTGEKPFKCTVCDHACSQSAKLKKHMK  140 (343)
Q Consensus        87 ~~~~~~~C~~--C~~~f~~~~~L~~H-----~~~h~~~~~~~C~~C~~~f~~~~~L~~H~~  140 (343)
                      .+|+++.|..  |.+.+........|     +..-...++|+|+ |++.+.++..|+.|-.
T Consensus       174 ~EE~~~S~~vp~~~~~~~~~Ns~~~~S~~~~~T~~t~~~p~k~~-~~~~~~T~~~l~~HS~  233 (442)
T KOG4124|consen  174 PEEYRVSVVVPAAAAAAAAANSSDMSSDEASSTAETTGTPKKMP-ESLVMDTSSPLSDHSM  233 (442)
T ss_pred             CccccccccCchhhhhhhccccccccccccccccccccCCccCc-ccccccccchhhhccc
Confidence            3778888876  76666544444333     2233345788885 8888888888888853


No 198
>COG1571 Predicted DNA-binding protein containing a Zn-ribbon domain [General function prediction only]
Probab=47.73  E-value=10  Score=34.03  Aligned_cols=34  Identities=24%  Similarity=0.617  Sum_probs=25.0

Q ss_pred             CcccccCcccccCHHHHHHHHHHccCCCceecCcCcccccChHHH
Q psy5202         185 NDTCEFCGKVFKNCSNLTVHRRSHTGEKPYKCELCSYACAQSSKL  229 (343)
Q Consensus       185 ~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~f~~~~~L  229 (343)
                      .-.|+.||....+.           |..-|+|+.||+.+......
T Consensus       350 ~p~Cp~Cg~~m~S~-----------G~~g~rC~kCg~~~~~~~~~  383 (421)
T COG1571         350 NPVCPRCGGRMKSA-----------GRNGFRCKKCGTRARETLIK  383 (421)
T ss_pred             CCCCCccCCchhhc-----------CCCCcccccccccCCccccc
Confidence            34799999876543           34479999999998765443


No 199
>PRK03976 rpl37ae 50S ribosomal protein L37Ae; Reviewed
Probab=47.60  E-value=6.1  Score=26.96  Aligned_cols=13  Identities=31%  Similarity=0.976  Sum_probs=6.9

Q ss_pred             cccccCCCCCCCC
Q psy5202         242 DVYRCRFCEMPFS  254 (343)
Q Consensus       242 ~~~~C~~C~~~f~  254 (343)
                      ..+.|..|++.|.
T Consensus        53 GIW~C~~C~~~~A   65 (90)
T PRK03976         53 GIWECRKCGAKFA   65 (90)
T ss_pred             EEEEcCCCCCEEe
Confidence            3455555555553


No 200
>smart00661 RPOL9 RNA polymerase subunit 9.
Probab=47.59  E-value=12  Score=22.34  Aligned_cols=10  Identities=30%  Similarity=0.800  Sum_probs=5.1

Q ss_pred             ccccCCCCCC
Q psy5202         243 VYRCRFCEMP  252 (343)
Q Consensus       243 ~~~C~~C~~~  252 (343)
                      .|.|+.||+.
T Consensus        20 ~~vC~~Cg~~   29 (52)
T smart00661       20 RFVCRKCGYE   29 (52)
T ss_pred             EEECCcCCCe
Confidence            4555555543


No 201
>PF08271 TF_Zn_Ribbon:  TFIIB zinc-binding;  InterPro: IPR013137 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target.  This entry represents a zinc finger motif found in transcription factor IIB (TFIIB). In eukaryotes the initiation of transcription of protein encoding genes by the polymerase II complexe (Pol II) is modulated by general and specific transcription factors. The general transcription factors operate through common promoters elements (such as the TATA box). At least seven different proteins associate to form the general transcription factors: TFIIA, -IIB, -IID, -IIE, -IIF, -IIG, and -IIH [].  TFIIB and TFIID are responsible for promoter recognition and interaction with pol II; together with Pol II, they form a minimal initiation complex capable of transcription under certain conditions. The TATA box of a Pol II promoter is bound in the initiation complex by the TBP subunit of TFIID, which bends the DNA around the C-terminal domain of TFIIB whereas the N-terminal zinc finger of TFIIB interacts with Pol II [, ]. The TFIIB zinc finger adopts a zinc ribbon fold characterised by two beta-hairpins forming two structurally similar zinc-binding sub-sites []. The zinc finger contacts the rbp1 subunit of Pol II through its dock domain, a conserved region of about 70 amino acids located close to the polymerase active site []. In the Pol II complex this surface is located near the RNA exit groove. Interestingly this sequence is best conserved in the three polymerases that utilise a TFIIB-like general transcription factor (Pol II, Pol III, and archaeal RNA polymerase) but not in Pol I [].  More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; GO: 0008270 zinc ion binding, 0006355 regulation of transcription, DNA-dependent; PDB: 1VD4_A 1PFT_A 3K1F_M 3K7A_M 1RO4_A 1RLY_A 1DL6_A.
Probab=47.31  E-value=9.5  Score=21.89  Aligned_cols=8  Identities=25%  Similarity=0.791  Sum_probs=3.3

Q ss_pred             cccCCCCC
Q psy5202         244 YRCRFCEM  251 (343)
Q Consensus       244 ~~C~~C~~  251 (343)
                      +.|..||.
T Consensus        20 ~vC~~CG~   27 (43)
T PF08271_consen   20 LVCPNCGL   27 (43)
T ss_dssp             EEETTT-B
T ss_pred             EECCCCCC
Confidence            44555543


No 202
>PRK12496 hypothetical protein; Provisional
Probab=47.13  E-value=14  Score=28.76  Aligned_cols=11  Identities=18%  Similarity=0.407  Sum_probs=6.9

Q ss_pred             eecCcCccccc
Q psy5202         214 YKCELCSYACA  224 (343)
Q Consensus       214 ~~C~~C~~~f~  224 (343)
                      |+|.-|++.|.
T Consensus       128 ~~C~gC~~~~~  138 (164)
T PRK12496        128 KVCKGCKKKYP  138 (164)
T ss_pred             EECCCCCcccc
Confidence            56666666663


No 203
>COG1773 Rubredoxin [Energy production and conversion]
Probab=46.91  E-value=9.4  Score=23.31  Aligned_cols=40  Identities=20%  Similarity=0.395  Sum_probs=21.2

Q ss_pred             CceecCcCcccccChHHHHHHHHHh-c--CCCCcccccCCCCC
Q psy5202         212 KPYKCELCSYACAQSSKLTRHMKTH-G--RLGKDVYRCRFCEM  251 (343)
Q Consensus       212 ~~~~C~~C~~~f~~~~~L~~H~~~h-~--~~~~~~~~C~~C~~  251 (343)
                      +.|+|..||+.|.-...--++-..- +  ..-..-+.|+.||.
T Consensus         2 ~~~~C~~CG~vYd~e~Gdp~~gi~pgT~fedlPd~w~CP~Cg~   44 (55)
T COG1773           2 KRWRCSVCGYVYDPEKGDPRCGIAPGTPFEDLPDDWVCPECGV   44 (55)
T ss_pred             CceEecCCceEeccccCCccCCCCCCCchhhCCCccCCCCCCC
Confidence            3589999999986543222210000 0  00011368999985


No 204
>COG1998 RPS31 Ribosomal protein S27AE [Translation, ribosomal structure and biogenesis]
Probab=46.88  E-value=8.9  Score=22.67  Aligned_cols=10  Identities=30%  Similarity=1.056  Sum_probs=6.8

Q ss_pred             ceecCcCccc
Q psy5202         213 PYKCELCSYA  222 (343)
Q Consensus       213 ~~~C~~C~~~  222 (343)
                      .|.|..||++
T Consensus        37 R~~CGkCgyT   46 (51)
T COG1998          37 RWACGKCGYT   46 (51)
T ss_pred             eeEeccccce
Confidence            4677777764


No 205
>KOG2272|consensus
Probab=46.12  E-value=23  Score=29.19  Aligned_cols=44  Identities=18%  Similarity=0.358  Sum_probs=24.4

Q ss_pred             CCCCcccccccccccCChHHHHHHHHhccCCCCeecCCCCccCCCH
Q psy5202          87 QEEKNKECEFCGKKFKYENTLQAHRRNHTGEKPFKCTVCDHACSQS  132 (343)
Q Consensus        87 ~~~~~~~C~~C~~~f~~~~~L~~H~~~h~~~~~~~C~~C~~~f~~~  132 (343)
                      .+...|.|..|...... ..|.---...+ .-.|+|..|++...+-
T Consensus       133 ~~~g~YvC~KCh~~iD~-~~l~fr~d~yH-~yHFkCt~C~keL~sd  176 (332)
T KOG2272|consen  133 KGRGRYVCQKCHAHIDE-QPLTFRGDPYH-PYHFKCTTCGKELTSD  176 (332)
T ss_pred             cccceeehhhhhhhccc-ccccccCCCCC-ccceecccccccccch
Confidence            34457889988876654 22221111111 2347888888876554


No 206
>PF04780 DUF629:  Protein of unknown function (DUF629);  InterPro: IPR006865 This domain represents a region of several plant proteins of unknown function. A C2H2 zinc finger is predicted in this region in some family members, but the spacing between the cysteine residues is not conserved throughout the family.
Probab=45.39  E-value=15  Score=33.71  Aligned_cols=28  Identities=21%  Similarity=0.324  Sum_probs=19.2

Q ss_pred             ccccCCCCCCCCChHHHHHHHhhhcCcc
Q psy5202         243 VYRCRFCEMPFSVPSTLEKHMRKCVVNQ  270 (343)
Q Consensus       243 ~~~C~~C~~~f~~~~~l~~H~~~~~~~~  270 (343)
                      -+.|+.|.+.|.....+..|+...|...
T Consensus        57 FWiCp~CskkF~d~~~~~~H~~~eH~~~   84 (466)
T PF04780_consen   57 FWICPRCSKKFSDAESCLSHMEQEHPAG   84 (466)
T ss_pred             EeeCCcccceeCCHHHHHHHHHHhhhhh
Confidence            3567777777777777777777555443


No 207
>PF13824 zf-Mss51:  Zinc-finger of mitochondrial splicing suppressor 51
Probab=45.23  E-value=16  Score=22.33  Aligned_cols=12  Identities=25%  Similarity=0.697  Sum_probs=6.4

Q ss_pred             cccccCCCCCCC
Q psy5202         242 DVYRCRFCEMPF  253 (343)
Q Consensus       242 ~~~~C~~C~~~f  253 (343)
                      ..|.|+.||..+
T Consensus        13 v~~~Cp~cGipt   24 (55)
T PF13824_consen   13 VNFECPDCGIPT   24 (55)
T ss_pred             cCCcCCCCCCcC
Confidence            345566666543


No 208
>smart00154 ZnF_AN1 AN1-like Zinc finger. Zinc finger at the C-terminus of An1, a ubiquitin-like protein in Xenopus laevis.
Probab=45.11  E-value=11  Score=21.18  Aligned_cols=14  Identities=29%  Similarity=0.717  Sum_probs=10.9

Q ss_pred             ccccCCCCCCCCCh
Q psy5202         243 VYRCRFCEMPFSVP  256 (343)
Q Consensus       243 ~~~C~~C~~~f~~~  256 (343)
                      +|.|..|++.|-..
T Consensus        12 ~f~C~~C~~~FC~~   25 (39)
T smart00154       12 GFKCRHCGNLFCGE   25 (39)
T ss_pred             CeECCccCCccccc
Confidence            78899999888543


No 209
>PF14311 DUF4379:  Domain of unknown function (DUF4379)
Probab=44.98  E-value=14  Score=22.52  Aligned_cols=14  Identities=29%  Similarity=0.534  Sum_probs=8.1

Q ss_pred             CcccccCcccccCH
Q psy5202         185 NDTCEFCGKVFKNC  198 (343)
Q Consensus       185 ~~~C~~C~~~f~~~  198 (343)
                      -++|+.||..|...
T Consensus        28 ~W~C~~Cgh~w~~~   41 (55)
T PF14311_consen   28 WWKCPKCGHEWKAS   41 (55)
T ss_pred             EEECCCCCCeeEcc
Confidence            35666666655544


No 210
>COG5216 Uncharacterized conserved protein [Function unknown]
Probab=44.32  E-value=12  Score=22.89  Aligned_cols=31  Identities=29%  Similarity=0.615  Sum_probs=15.8

Q ss_pred             CcccccCcccccCHHHHHHHHHHccCCCceecCcCcc
Q psy5202         185 NDTCEFCGKVFKNCSNLTVHRRSHTGEKPYKCELCSY  221 (343)
Q Consensus       185 ~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~  221 (343)
                      .|.|+ ||.+|.-.-  .   -...|+..-+|+-|..
T Consensus        22 tyPCP-CGDRFeIsL--e---Dl~~GE~VArCPSCSL   52 (67)
T COG5216          22 TYPCP-CGDRFEISL--E---DLRNGEVVARCPSCSL   52 (67)
T ss_pred             EecCC-CCCEeEEEH--H---HhhCCceEEEcCCceE
Confidence            45565 666664321  1   1234566666666654


No 211
>KOG2636|consensus
Probab=44.30  E-value=17  Score=32.68  Aligned_cols=29  Identities=28%  Similarity=0.692  Sum_probs=23.5

Q ss_pred             HHccCCCceecCcCc-ccccChHHHHHHHH
Q psy5202         206 RSHTGEKPYKCELCS-YACAQSSKLTRHMK  234 (343)
Q Consensus       206 ~~h~~~~~~~C~~C~-~~f~~~~~L~~H~~  234 (343)
                      +.|.-...|.|.+|| +++.-+..+.+|..
T Consensus       394 KLHGL~~ey~CEICGNy~Y~GrkaF~RHF~  423 (497)
T KOG2636|consen  394 KLHGLDIEYNCEICGNYVYKGRKAFDRHFN  423 (497)
T ss_pred             hhcCCCcccceeeccCccccCcHHHHHHhH
Confidence            456667789999999 88899999999953


No 212
>PF03833 PolC_DP2:  DNA polymerase II large subunit DP2;  InterPro: IPR016033 DP2 is the large subunit of a two-subunit novel archaebacterial replicative DNA polymerase first characterised for Pyrococcus furiosus. The structure of DP2 appears to be organised as a ~950 residue component separated from a ~300 residue component by a ~150 residue intein. The other subunit, DP1, has sequence similarity to the eukaryotic DNA polymerase delta small subunit. This entry represents the N-terminal ~950 residue component of DP2.; GO: 0003887 DNA-directed DNA polymerase activity; PDB: 3O59_X.
Probab=43.92  E-value=7.6  Score=37.96  Aligned_cols=13  Identities=23%  Similarity=0.350  Sum_probs=0.0

Q ss_pred             ccCCCCCCCCChH
Q psy5202         245 RCRFCEMPFSVPS  257 (343)
Q Consensus       245 ~C~~C~~~f~~~~  257 (343)
                      .|+.|+.......
T Consensus       694 ~C~~C~~~~~~~~  706 (900)
T PF03833_consen  694 ECPKCGRETTSYS  706 (900)
T ss_dssp             -------------
T ss_pred             ccccccccCcccc
Confidence            5777776554443


No 213
>PLN03238 probable histone acetyltransferase MYST; Provisional
Probab=43.68  E-value=19  Score=30.58  Aligned_cols=27  Identities=26%  Similarity=0.662  Sum_probs=20.2

Q ss_pred             CcccccCCCCCCCCChHHHHHHHhhhc
Q psy5202         241 KDVYRCRFCEMPFSVPSTLEKHMRKCV  267 (343)
Q Consensus       241 ~~~~~C~~C~~~f~~~~~l~~H~~~~~  267 (343)
                      ...|.|+.|-+-|.+...|.+|+.+|.
T Consensus        46 ~~lyiCe~Clky~~~~~~l~~H~~~C~   72 (290)
T PLN03238         46 TKLYICEYCLKYMRKKKSLLRHLAKCD   72 (290)
T ss_pred             CeEEEcCCCcchhCCHHHHHHHHHhCC
Confidence            456778888888888888888877544


No 214
>COG1656 Uncharacterized conserved protein [Function unknown]
Probab=43.28  E-value=23  Score=27.31  Aligned_cols=14  Identities=36%  Similarity=0.648  Sum_probs=7.4

Q ss_pred             cccCCCCCCCCChH
Q psy5202         244 YRCRFCEMPFSVPS  257 (343)
Q Consensus       244 ~~C~~C~~~f~~~~  257 (343)
                      |.|+.||+.|---+
T Consensus       131 ~~C~~CgkiYW~Gs  144 (165)
T COG1656         131 YRCPKCGKIYWKGS  144 (165)
T ss_pred             eECCCCcccccCch
Confidence            45666665554443


No 215
>PRK04351 hypothetical protein; Provisional
Probab=42.68  E-value=11  Score=28.72  Aligned_cols=12  Identities=33%  Similarity=0.777  Sum_probs=6.5

Q ss_pred             cccccCCCCCCC
Q psy5202         242 DVYRCRFCEMPF  253 (343)
Q Consensus       242 ~~~~C~~C~~~f  253 (343)
                      ..|.|..|+-.+
T Consensus       131 ~~yrCg~C~g~L  142 (149)
T PRK04351        131 KRYRCGKCRGKL  142 (149)
T ss_pred             CcEEeCCCCcEe
Confidence            445666665444


No 216
>COG1779 C4-type Zn-finger protein [General function prediction only]
Probab=42.27  E-value=3.1  Score=32.85  Aligned_cols=8  Identities=38%  Similarity=1.012  Sum_probs=4.3

Q ss_pred             cccccCcc
Q psy5202         186 DTCEFCGK  193 (343)
Q Consensus       186 ~~C~~C~~  193 (343)
                      ..|+.||.
T Consensus        15 ~~CPvCg~   22 (201)
T COG1779          15 IDCPVCGG   22 (201)
T ss_pred             ecCCcccc
Confidence            34555555


No 217
>KOG3507|consensus
Probab=42.26  E-value=13  Score=22.88  Aligned_cols=12  Identities=25%  Similarity=0.772  Sum_probs=6.4

Q ss_pred             CceecCcCcccc
Q psy5202         212 KPYKCELCSYAC  223 (343)
Q Consensus       212 ~~~~C~~C~~~f  223 (343)
                      ..++|..||+..
T Consensus        36 D~irCReCG~RI   47 (62)
T KOG3507|consen   36 DVIRCRECGYRI   47 (62)
T ss_pred             CcEehhhcchHH
Confidence            345666666543


No 218
>smart00064 FYVE Protein present in Fab1, YOTB, Vac1, and EEA1. The FYVE zinc finger is named after four proteins where it was first found: Fab1, YOTB/ZK632.12, Vac1, and EEA1. The FYVE finger has been shown to bind two Zn2+ ions. The FYVE finger has eight potential zinc coordinating cysteine positions. The FYVE finger is structurally related to the COG1594 RPB9 DNA-directed RNA polymerase, subunit M/Transcription elongation factor TFIIS [Transcription]
Probab=41.65  E-value=6.1  Score=28.48  Aligned_cols=40  Identities=23%  Similarity=0.502  Sum_probs=20.2

Q ss_pred             CcccccCcccccCHHHHHHHHHHccCCCceecCcCccccc
Q psy5202         185 NDTCEFCGKVFKNCSNLTVHRRSHTGEKPYKCELCSYACA  224 (343)
Q Consensus       185 ~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~f~  224 (343)
                      .+.|+.||..=.....|+.-..--....-|.|..||+.|.
T Consensus        72 ~~~CpkCg~~ea~y~~~QtRsaDEp~T~Fy~C~~Cg~~wr  111 (113)
T COG1594          72 KEKCPKCGNKEAYYWQLQTRSADEPETRFYKCTRCGYRWR  111 (113)
T ss_pred             cccCCCCCCceeEEEeeehhccCCCceEEEEecccCCEee
Confidence            3678888855433322221111111123388888887765


No 220
>COG2331 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=40.70  E-value=5.7  Score=25.82  Aligned_cols=10  Identities=30%  Similarity=0.889  Sum_probs=5.0

Q ss_pred             eecCcCcccc
Q psy5202         214 YKCELCSYAC  223 (343)
Q Consensus       214 ~~C~~C~~~f  223 (343)
                      |+|..|+..|
T Consensus        13 Y~c~~cg~~~   22 (82)
T COG2331          13 YECTECGNRF   22 (82)
T ss_pred             EeecccchHH
Confidence            4555555443


No 221
>TIGR01562 FdhE formate dehydrogenase accessory protein FdhE. The only sequence scoring between trusted and noise is that from Aquifex aeolicus, which shows certain structural differences from the proteobacterial forms in the alignment. However it is notable that A. aeolicus also has a sequence scoring above trusted to the alpha subunit of formate dehydrogenase (TIGR01553).
Probab=40.69  E-value=19  Score=31.13  Aligned_cols=36  Identities=22%  Similarity=0.483  Sum_probs=19.7

Q ss_pred             CcccccCcccccCHHHHHHHHHHccCCCceecCcCccc
Q psy5202         185 NDTCEFCGKVFKNCSNLTVHRRSHTGEKPYKCELCSYA  222 (343)
Q Consensus       185 ~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~  222 (343)
                      .-.|++||..-.-  .+.+......|.+...|..|+..
T Consensus       184 ~~~CPvCGs~P~~--s~~~~~~~~~G~RyL~CslC~te  219 (305)
T TIGR01562       184 RTLCPACGSPPVA--SMVRQGGKETGLRYLSCSLCATE  219 (305)
T ss_pred             CCcCCCCCChhhh--hhhcccCCCCCceEEEcCCCCCc
Confidence            4479999965421  11111112456667778888743


No 222
>KOG2636|consensus
Probab=40.58  E-value=20  Score=32.26  Aligned_cols=29  Identities=24%  Similarity=0.594  Sum_probs=24.2

Q ss_pred             HHhcCCCCcccccCCCC-CCCCChHHHHHHHh
Q psy5202         234 KTHGRLGKDVYRCRFCE-MPFSVPSTLEKHMR  264 (343)
Q Consensus       234 ~~h~~~~~~~~~C~~C~-~~f~~~~~l~~H~~  264 (343)
                      +.|.  -..-|.|.+|| +++.-+..+.+|..
T Consensus       394 KLHG--L~~ey~CEICGNy~Y~GrkaF~RHF~  423 (497)
T KOG2636|consen  394 KLHG--LDIEYNCEICGNYVYKGRKAFDRHFN  423 (497)
T ss_pred             hhcC--CCcccceeeccCccccCcHHHHHHhH
Confidence            3554  45679999999 99999999999975


No 223
>PF10013 DUF2256:  Uncharacterized protein conserved in bacteria (DUF2256);  InterPro: IPR017136 There is currently no experimental data for members of this group or their homologues, nor do they exhibit features indicative of any function.
Probab=40.17  E-value=21  Score=20.41  Aligned_cols=16  Identities=25%  Similarity=0.613  Sum_probs=11.3

Q ss_pred             ccCCCCCCCCChHHHH
Q psy5202         245 RCRFCEMPFSVPSTLE  260 (343)
Q Consensus       245 ~C~~C~~~f~~~~~l~  260 (343)
                      .|..||+.|.......
T Consensus        10 ~C~~C~rpf~WRKKW~   25 (42)
T PF10013_consen   10 ICPVCGRPFTWRKKWA   25 (42)
T ss_pred             cCcccCCcchHHHHHH
Confidence            5888888887765443


No 224
>COG1198 PriA Primosomal protein N' (replication factor Y) - superfamily II helicase [DNA replication, recombination, and repair]
Probab=40.10  E-value=7.1  Score=38.04  Aligned_cols=23  Identities=22%  Similarity=0.421  Sum_probs=14.2

Q ss_pred             HHHHHHHhccCCCCeecCCCCcc
Q psy5202         106 TLQAHRRNHTGEKPFKCTVCDHA  128 (343)
Q Consensus       106 ~L~~H~~~h~~~~~~~C~~C~~~  128 (343)
                      .|.-|...|....|..|+.||..
T Consensus       462 ~L~CH~Cg~~~~~p~~Cp~Cgs~  484 (730)
T COG1198         462 QLRCHYCGYQEPIPQSCPECGSE  484 (730)
T ss_pred             eeEeCCCCCCCCCCCCCCCCCCC
Confidence            34444444555677888888754


No 225
>PRK03564 formate dehydrogenase accessory protein FdhE; Provisional
Probab=40.05  E-value=19  Score=31.20  Aligned_cols=36  Identities=22%  Similarity=0.556  Sum_probs=21.2

Q ss_pred             CCcccccCcccccCHHHHHHHHHHccCCCceecCcCccc
Q psy5202         184 RNDTCEFCGKVFKNCSNLTVHRRSHTGEKPYKCELCSYA  222 (343)
Q Consensus       184 ~~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~  222 (343)
                      ..-.|++||..-.- + +. +.....|.+...|..|+..
T Consensus       186 ~~~~CPvCGs~P~~-s-~v-~~~~~~G~RyL~CslC~te  221 (309)
T PRK03564        186 QRQFCPVCGSMPVS-S-VV-QIGTTQGLRYLHCNLCESE  221 (309)
T ss_pred             CCCCCCCCCCcchh-h-ee-eccCCCCceEEEcCCCCCc
Confidence            45679999976431 1 11 2233456677778888743


No 226
>smart00504 Ubox Modified RING finger domain. Modified RING finger domain, without the full complement of Zn2+-binding ligands. Probable involvement in E2-dependent ubiquitination.
Probab=39.82  E-value=57  Score=20.11  Aligned_cols=45  Identities=11%  Similarity=0.110  Sum_probs=24.9

Q ss_pred             cccccCcccccCHHHHHHHHHHccCCCceecCcCcccccChHHHHHHHHHhcCCCCcccccCCCCCCCC
Q psy5202         186 DTCEFCGKVFKNCSNLTVHRRSHTGEKPYKCELCSYACAQSSKLTRHMKTHGRLGKDVYRCRFCEMPFS  254 (343)
Q Consensus       186 ~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~f~~~~~L~~H~~~h~~~~~~~~~C~~C~~~f~  254 (343)
                      |.|++|+..+...               . ...||..|- +..+..+++.+       ..|+.|+..+.
T Consensus         2 ~~Cpi~~~~~~~P---------------v-~~~~G~v~~-~~~i~~~~~~~-------~~cP~~~~~~~   46 (63)
T smart00504        2 FLCPISLEVMKDP---------------V-ILPSGQTYE-RRAIEKWLLSH-------GTDPVTGQPLT   46 (63)
T ss_pred             cCCcCCCCcCCCC---------------E-ECCCCCEEe-HHHHHHHHHHC-------CCCCCCcCCCC
Confidence            4677777776442               1 234565553 34444454432       35888887764


No 227
>COG0068 HypF Hydrogenase maturation factor [Posttranslational modification, protein turnover, chaperones]
Probab=39.67  E-value=4.6  Score=38.57  Aligned_cols=57  Identities=21%  Similarity=0.435  Sum_probs=38.0

Q ss_pred             ccccCcccccCHHHHHHHHHHccCCCce-ecCcCcccccChHHHHHHHHHhcCCCCcccccCCCCCC
Q psy5202         187 TCEFCGKVFKNCSNLTVHRRSHTGEKPY-KCELCSYACAQSSKLTRHMKTHGRLGKDVYRCRFCEMP  252 (343)
Q Consensus       187 ~C~~C~~~f~~~~~l~~H~~~h~~~~~~-~C~~C~~~f~~~~~L~~H~~~h~~~~~~~~~C~~C~~~  252 (343)
                      .|..||-+|+-...|-.-+. ++.-+.| .|+.|.+.|....+-+-|        ..+..|+.||-.
T Consensus       125 ~CT~CGPRfTIi~alPYDR~-nTsM~~F~lC~~C~~EY~dP~nRRfH--------AQp~aCp~CGP~  182 (750)
T COG0068         125 NCTNCGPRFTIIEALPYDRE-NTSMADFPLCPFCDKEYKDPLNRRFH--------AQPIACPKCGPH  182 (750)
T ss_pred             ccCCCCcceeeeccCCCCcc-cCccccCcCCHHHHHHhcCccccccc--------cccccCcccCCC
Confidence            49999999987655543322 2222333 599999988887765444        356789999953


No 228
>PF10263 SprT-like:  SprT-like family;  InterPro: IPR006640 This is a family of uncharacterised bacterial proteins which includes Escherichia coli SprT (P39902 from SWISSPROT). SprT is described as a regulator of bolA gene in stationary phase []. The majority of members contain the metallopeptidase zinc binding signature which has a HExxH motif, however there is no evidence for them being metallopeptidases. 
Probab=39.40  E-value=8.7  Score=29.43  Aligned_cols=11  Identities=27%  Similarity=0.818  Sum_probs=5.9

Q ss_pred             CcccccCcccc
Q psy5202         185 NDTCEFCGKVF  195 (343)
Q Consensus       185 ~~~C~~C~~~f  195 (343)
                      .|.|..|+..+
T Consensus       123 ~~~C~~C~~~~  133 (157)
T PF10263_consen  123 VYRCPSCGREY  133 (157)
T ss_pred             EEEcCCCCCEe
Confidence            45555555554


No 229
>PF14205 Cys_rich_KTR:  Cysteine-rich KTR
Probab=39.00  E-value=17  Score=22.04  Aligned_cols=11  Identities=18%  Similarity=0.356  Sum_probs=6.0

Q ss_pred             CcccccCCCCC
Q psy5202         241 KDVYRCRFCEM  251 (343)
Q Consensus       241 ~~~~~C~~C~~  251 (343)
                      ..|..|+.|.+
T Consensus        26 NfPlyCpKCK~   36 (55)
T PF14205_consen   26 NFPLYCPKCKQ   36 (55)
T ss_pred             cccccCCCCCc
Confidence            44555666654


No 230
>TIGR00686 phnA alkylphosphonate utilization operon protein PhnA. The protein family includes an uncharacterized member designated phnA in Escherichia coli, part of a large operon associated with alkylphosphonate uptake and carbon-phosphorus bond cleavage. This protein is not related to the characterized phosphonoacetate hydrolase designated PhnA by Kulakova, et al. (2001, 1997).
Probab=38.30  E-value=17  Score=25.62  Aligned_cols=29  Identities=14%  Similarity=0.341  Sum_probs=19.8

Q ss_pred             ecCcCcccccChHHHHHHHHHhcCCCCcccccCCCCCCCCCh
Q psy5202         215 KCELCSYACAQSSKLTRHMKTHGRLGKDVYRCRFCEMPFSVP  256 (343)
Q Consensus       215 ~C~~C~~~f~~~~~L~~H~~~h~~~~~~~~~C~~C~~~f~~~  256 (343)
                      .|+.|+..|.+.             ....|.|+.|+..+...
T Consensus         4 ~CP~C~seytY~-------------dg~~~iCpeC~~EW~~~   32 (109)
T TIGR00686         4 PCPKCNSEYTYH-------------DGTQLICPSCLYEWNEN   32 (109)
T ss_pred             cCCcCCCcceEe-------------cCCeeECcccccccccc
Confidence            488888776653             23458899998877654


No 231
>PF12907 zf-met2:  Zinc-binding
Probab=38.17  E-value=19  Score=20.45  Aligned_cols=21  Identities=19%  Similarity=0.580  Sum_probs=9.8

Q ss_pred             eecCcCcccc---cChHHHHHHHH
Q psy5202         214 YKCELCSYAC---AQSSKLTRHMK  234 (343)
Q Consensus       214 ~~C~~C~~~f---~~~~~L~~H~~  234 (343)
                      +.|.+|...|   .....|..|..
T Consensus         2 i~C~iC~qtF~~t~~~~~L~eH~e   25 (40)
T PF12907_consen    2 IICKICRQTFMQTTNEPQLKEHAE   25 (40)
T ss_pred             cCcHHhhHHHHhcCCHHHHHHHHH
Confidence            3455555333   23344555544


No 232
>PRK14892 putative transcription elongation factor Elf1; Provisional
Probab=38.07  E-value=8.5  Score=26.93  Aligned_cols=12  Identities=17%  Similarity=0.418  Sum_probs=6.8

Q ss_pred             cccCCCCCCCCC
Q psy5202         244 YRCRFCEMPFSV  255 (343)
Q Consensus       244 ~~C~~C~~~f~~  255 (343)
                      ..|+.||..|..
T Consensus        43 ~~C~~CG~y~~~   54 (99)
T PRK14892         43 ITCGNCGLYTEF   54 (99)
T ss_pred             EECCCCCCccCE
Confidence            456666665543


No 233
>COG1326 Uncharacterized archaeal Zn-finger protein [General function prediction only]
Probab=37.18  E-value=31  Score=27.30  Aligned_cols=14  Identities=21%  Similarity=0.536  Sum_probs=9.1

Q ss_pred             CcccccCCCCCCCC
Q psy5202         241 KDVYRCRFCEMPFS  254 (343)
Q Consensus       241 ~~~~~C~~C~~~f~  254 (343)
                      ...+.|..||..+.
T Consensus        28 ~~lvrC~eCG~V~~   41 (201)
T COG1326          28 EPLVRCEECGTVHP   41 (201)
T ss_pred             ceEEEccCCCcEee
Confidence            34567777777663


No 234
>KOG2923|consensus
Probab=36.85  E-value=25  Score=22.04  Aligned_cols=13  Identities=38%  Similarity=0.833  Sum_probs=7.0

Q ss_pred             cCCCceecCcCcc
Q psy5202         209 TGEKPYKCELCSY  221 (343)
Q Consensus       209 ~~~~~~~C~~C~~  221 (343)
                      .|+..-.|+-|..
T Consensus        40 ~ge~Va~CpsCSL   52 (67)
T KOG2923|consen   40 NGEDVARCPSCSL   52 (67)
T ss_pred             CCCeeecCCCceE
Confidence            3455555666654


No 235
>PRK10220 hypothetical protein; Provisional
Probab=36.76  E-value=24  Score=24.95  Aligned_cols=30  Identities=17%  Similarity=0.502  Sum_probs=20.2

Q ss_pred             ecCcCcccccChHHHHHHHHHhcCCCCcccccCCCCCCCCChH
Q psy5202         215 KCELCSYACAQSSKLTRHMKTHGRLGKDVYRCRFCEMPFSVPS  257 (343)
Q Consensus       215 ~C~~C~~~f~~~~~L~~H~~~h~~~~~~~~~C~~C~~~f~~~~  257 (343)
                      .|+.|+..|.+.             ....|.|+.|+..+....
T Consensus         5 ~CP~C~seytY~-------------d~~~~vCpeC~hEW~~~~   34 (111)
T PRK10220          5 HCPKCNSEYTYE-------------DNGMYICPECAHEWNDAE   34 (111)
T ss_pred             cCCCCCCcceEc-------------CCCeEECCcccCcCCccc
Confidence            488888766552             234588999988776553


No 236
>smart00731 SprT SprT homologues. Predicted to have roles in transcription elongation. Contains a conserved HExxH motif, indicating a metalloprotease function.
Probab=36.52  E-value=12  Score=28.37  Aligned_cols=11  Identities=36%  Similarity=1.035  Sum_probs=5.3

Q ss_pred             ceecCcCcccc
Q psy5202         213 PYKCELCSYAC  223 (343)
Q Consensus       213 ~~~C~~C~~~f  223 (343)
                      .|+|..|+..+
T Consensus       112 ~y~C~~C~~~~  122 (146)
T smart00731      112 PYRCTGCGQRY  122 (146)
T ss_pred             EEECCCCCCCC
Confidence            35555555443


No 237
>PRK14559 putative protein serine/threonine phosphatase; Provisional
Probab=36.30  E-value=38  Score=32.83  Aligned_cols=12  Identities=17%  Similarity=0.368  Sum_probs=5.6

Q ss_pred             ccCCCCCCCCCh
Q psy5202         245 RCRFCEMPFSVP  256 (343)
Q Consensus       245 ~C~~C~~~f~~~  256 (343)
                      .|+.||......
T Consensus        43 fC~~CG~~~~~~   54 (645)
T PRK14559         43 HCPNCGAETGTI   54 (645)
T ss_pred             cccccCCcccch
Confidence            355555544433


No 238
>PF00301 Rubredoxin:  Rubredoxin;  InterPro: IPR004039 Rubredoxin is a low molecular weight iron-containing bacterial protein involved in electron transfer [, ], sometimes replacing ferredoxin as an electron carrier []. The 3-D structures of a number of rubredoxins have been solved [, ]. The fold belongs to the alpha+beta class, with 2 alpha-helices and 2-3 beta-strands. Its active site contains an iron ion which is co-ordinated by the sulphurs of four conserved cysteine residues forming an almost regular tetrahedron. The conserved cysteines reside on two loops, which are the most conserved regions of the protein. In addition, a ring of acidic residues in the proximity of the [Fe(Cys)4] centre is also well-conserved []. ; GO: 0009055 electron carrier activity, 0046872 metal ion binding; PDB: 2RDV_C 1RDV_A 1S24_A 1T9O_B 1B2J_A 1SMW_A 2PVE_B 1BFY_A 1T9P_C 1C09_C ....
Probab=35.78  E-value=18  Score=21.38  Aligned_cols=39  Identities=21%  Similarity=0.470  Sum_probs=18.8

Q ss_pred             eecCcCcccccChHHHHHHH-HHhc--CCCCcccccCCCCCC
Q psy5202         214 YKCELCSYACAQSSKLTRHM-KTHG--RLGKDVYRCRFCEMP  252 (343)
Q Consensus       214 ~~C~~C~~~f~~~~~L~~H~-~~h~--~~~~~~~~C~~C~~~  252 (343)
                      |+|..|++.+.-...-..+- .--+  ..-..-|.|+.|+..
T Consensus         2 y~C~~CgyvYd~~~Gd~~~~i~pGt~F~~Lp~~w~CP~C~a~   43 (47)
T PF00301_consen    2 YQCPVCGYVYDPEKGDPENGIPPGTPFEDLPDDWVCPVCGAP   43 (47)
T ss_dssp             EEETTTSBEEETTTBBGGGTB-TT--GGGS-TT-B-TTTSSB
T ss_pred             cCCCCCCEEEcCCcCCcccCcCCCCCHHHCCCCCcCcCCCCc
Confidence            89999998886543211110 0000  001234899999853


No 239
>PRK00762 hypA hydrogenase nickel incorporation protein; Provisional
Probab=35.75  E-value=14  Score=27.11  Aligned_cols=10  Identities=30%  Similarity=0.451  Sum_probs=4.6

Q ss_pred             CcccccCcccc
Q psy5202         185 NDTCEFCGKVF  195 (343)
Q Consensus       185 ~~~C~~C~~~f  195 (343)
                      .+.| .||..|
T Consensus        70 ~~~C-~Cg~~~   79 (124)
T PRK00762         70 EIEC-ECGYEG   79 (124)
T ss_pred             eEEe-eCcCcc
Confidence            3445 455444


No 240
>KOG1994|consensus
Probab=35.35  E-value=18  Score=29.25  Aligned_cols=22  Identities=27%  Similarity=0.670  Sum_probs=18.4

Q ss_pred             cccccCCCCCCCCChHHHHHHH
Q psy5202         242 DVYRCRFCEMPFSVPSTLEKHM  263 (343)
Q Consensus       242 ~~~~C~~C~~~f~~~~~l~~H~  263 (343)
                      ..|.|-+||..|.+...|..|.
T Consensus       238 eh~YC~fCG~~y~~~edl~ehC  259 (268)
T KOG1994|consen  238 EHYYCFFCGIKYKDEEDLYEHC  259 (268)
T ss_pred             cceEEEEeccccCCHHHHHHhC
Confidence            3488999999999999998884


No 241
>PF10276 zf-CHCC:  Zinc-finger domain;  InterPro: IPR019401 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target.   This entry represents a short conserved zinc-finger domain. It contains the sequence motif Cx8Hx14Cx2C. ; PDB: 2JVM_A 2JRR_A 2JZ8_A.
Probab=35.34  E-value=18  Score=20.47  Aligned_cols=11  Identities=27%  Similarity=0.748  Sum_probs=5.7

Q ss_pred             ceecCcCcccc
Q psy5202         213 PYKCELCSYAC  223 (343)
Q Consensus       213 ~~~C~~C~~~f  223 (343)
                      +-.|++|+..|
T Consensus        29 ~~~CpYCg~~y   39 (40)
T PF10276_consen   29 PVVCPYCGTRY   39 (40)
T ss_dssp             EEEETTTTEEE
T ss_pred             eEECCCCCCEE
Confidence            34555555544


No 242
>CHL00174 accD acetyl-CoA carboxylase beta subunit; Reviewed
Probab=35.24  E-value=19  Score=30.88  Aligned_cols=33  Identities=30%  Similarity=0.375  Sum_probs=20.2

Q ss_pred             CcccccCcccccCHHHHHHHHHHccCCCceecCcCcccccCh
Q psy5202         185 NDTCEFCGKVFKNCSNLTVHRRSHTGEKPYKCELCSYACAQS  226 (343)
Q Consensus       185 ~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~f~~~  226 (343)
                      ..+|+.|+..+..+. |.        +.-+.|+.|++.|.-.
T Consensus        38 w~kc~~C~~~~~~~~-l~--------~~~~vcp~c~~h~rlt   70 (296)
T CHL00174         38 WVQCENCYGLNYKKF-LK--------SKMNICEQCGYHLKMS   70 (296)
T ss_pred             eeECCCccchhhHHH-HH--------HcCCCCCCCCCCcCCC
Confidence            457888887764332 11        2336788888877643


No 243
>COG5188 PRP9 Splicing factor 3a, subunit 3 [RNA processing and modification]
Probab=34.89  E-value=40  Score=29.33  Aligned_cols=29  Identities=17%  Similarity=0.460  Sum_probs=17.0

Q ss_pred             hccCCCCeecCCCC-ccCCCHHHHHHHHHH
Q psy5202         113 NHTGEKPFKCTVCD-HACSQSAKLKKHMKI  141 (343)
Q Consensus       113 ~h~~~~~~~C~~C~-~~f~~~~~L~~H~~~  141 (343)
                      .|.-.+.|.|.+|| +.+..+..+.+|...
T Consensus       368 lhgLd~ef~CEICgNyvy~GR~~FdrHF~E  397 (470)
T COG5188         368 LHGLDIEFECEICGNYVYYGRDRFDRHFEE  397 (470)
T ss_pred             hcCCCcceeeeecccccccchHHHHhhhhh
Confidence            34445566666666 566666666666543


No 244
>PF04810 zf-Sec23_Sec24:  Sec23/Sec24 zinc finger;  InterPro: IPR006895 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target.  COPII (coat protein complex II)-coated vesicles carry proteins from the endoplasmic reticulum (ER) to the Golgi complex []. COPII-coated vesicles form on the ER by the stepwise recruitment of three cytosolic components: Sar1-GTP to initiate coat formation, Sec23/24 heterodimer to select SNARE and cargo molecules, and Sec13/31 to induce coat polymerisation and membrane deformation [].  Sec23 p and Sec24p are structurally related, folding into five distinct domains: a beta-barrel, a zinc-finger, an alpha/beta trunk domain (IPR006896 from INTERPRO), an all-helical region (IPR006900 from INTERPRO), and a C-terminal gelsolin-like domain (IPR007123 from INTERPRO). This entry describes an approximately 55-residue Sec23/24 zinc-binding domain, which lies against the beta-barrel at the periphery of the complex. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; GO: 0008270 zinc ion binding, 0006886 intracellular protein transport, 0006888 ER to Golgi vesicle-mediated transport, 0030127 COPII vesicle coat; PDB: 3EFO_B 3EG9_B 3EGD_A 2YRC_A 2NUP_A 2YRD_A 3EGX_A 2NUT_A 3EH1_A 1PD0_A ....
Probab=34.88  E-value=7.3  Score=22.01  Aligned_cols=10  Identities=30%  Similarity=0.989  Sum_probs=3.8

Q ss_pred             CceecCcCcc
Q psy5202         212 KPYKCELCSY  221 (343)
Q Consensus       212 ~~~~C~~C~~  221 (343)
                      +.|.|.+|+.
T Consensus        23 ~~w~C~~C~~   32 (40)
T PF04810_consen   23 KTWICNFCGT   32 (40)
T ss_dssp             TEEEETTT--
T ss_pred             CEEECcCCCC
Confidence            3444544443


No 245
>PF07295 DUF1451:  Protein of unknown function (DUF1451);  InterPro: IPR009912 This family consists of several hypothetical bacterial proteins of around 160 residues in length. Members of this family contain four highly conserved cysteine resides toward the C-terminal region of the protein. The function of this family is unknown.
Probab=34.73  E-value=14  Score=27.96  Aligned_cols=28  Identities=25%  Similarity=0.489  Sum_probs=17.4

Q ss_pred             ceecCcCcccccChHHHHHHHHHhcCCCCcccccCCCCCC
Q psy5202         213 PYKCELCSYACAQSSKLTRHMKTHGRLGKDVYRCRFCEMP  252 (343)
Q Consensus       213 ~~~C~~C~~~f~~~~~L~~H~~~h~~~~~~~~~C~~C~~~  252 (343)
                      .|.|..||.....         .|   ....-.|+.||..
T Consensus       112 ~l~C~~Cg~~~~~---------~~---~~~l~~Cp~C~~~  139 (146)
T PF07295_consen  112 TLVCENCGHEVEL---------TH---PERLPPCPKCGHT  139 (146)
T ss_pred             eEecccCCCEEEe---------cC---CCcCCCCCCCCCC
Confidence            4889999865432         12   2344579999864


No 246
>COG5112 UFD2 U1-like Zn-finger-containing protein [General function prediction only]
Probab=34.11  E-value=14  Score=25.82  Aligned_cols=26  Identities=23%  Similarity=0.332  Sum_probs=20.6

Q ss_pred             CCcccccCCCCCCCCChHHHHHHHhh
Q psy5202         240 GKDVYRCRFCEMPFSVPSTLEKHMRK  265 (343)
Q Consensus       240 ~~~~~~C~~C~~~f~~~~~l~~H~~~  265 (343)
                      |-..|.|-.|.+-|.+...|..|.+-
T Consensus        52 GlGqhYCieCaryf~t~~aL~~Hkkg   77 (126)
T COG5112          52 GLGQHYCIECARYFITEKALMEHKKG   77 (126)
T ss_pred             CCceeeeehhHHHHHHHHHHHHHhcc
Confidence            33457888899999999999998873


No 247
>PF07649 C1_3:  C1-like domain;  InterPro: IPR011424 This short domain is rich in cysteines and histidines. The pattern of conservation is similar to that found in IPR002219 from INTERPRO. C1 domains are protein kinase C-like zinc finger structures. Diacylglycerol (DAG) kinases (DGKs) have a two or three commonly conserved cysteine-rich C1 domains []. DGKs modulate the balance between the two signaling lipids, DAG and phosphatidic acid (PA), by phosphorylating DAG to yield PA []. The PKD (protein kinase D) family are novel DAG receptors. They have twin C1 domains, designated C1a and C1b, which bind DAG or phorbol esters. Individual C1 domains differ in ligand-binding activity and selectivity []. ; GO: 0047134 protein-disulfide reductase activity, 0055114 oxidation-reduction process; PDB: 1V5N_A.
Probab=33.75  E-value=23  Score=18.32  Aligned_cols=11  Identities=36%  Similarity=0.906  Sum_probs=3.5

Q ss_pred             cccccCCCCCC
Q psy5202         242 DVYRCRFCEMP  252 (343)
Q Consensus       242 ~~~~C~~C~~~  252 (343)
                      ..|.|..|+..
T Consensus        14 ~~Y~C~~Cdf~   24 (30)
T PF07649_consen   14 WFYRCSECDFD   24 (30)
T ss_dssp             -EEE-TTT---
T ss_pred             ceEECccCCCc
Confidence            34566666543


No 248
>PF04780 DUF629:  Protein of unknown function (DUF629);  InterPro: IPR006865 This domain represents a region of several plant proteins of unknown function. A C2H2 zinc finger is predicted in this region in some family members, but the spacing between the cysteine residues is not conserved throughout the family.
Probab=33.67  E-value=34  Score=31.47  Aligned_cols=23  Identities=22%  Similarity=0.354  Sum_probs=13.9

Q ss_pred             CcccccCcccccCHHHHHHHHHH
Q psy5202         185 NDTCEFCGKVFKNCSNLTVHRRS  207 (343)
Q Consensus       185 ~~~C~~C~~~f~~~~~l~~H~~~  207 (343)
                      -+.|+.|.+.|.....+..|+..
T Consensus        57 FWiCp~CskkF~d~~~~~~H~~~   79 (466)
T PF04780_consen   57 FWICPRCSKKFSDAESCLSHMEQ   79 (466)
T ss_pred             EeeCCcccceeCCHHHHHHHHHH
Confidence            45566666666666666666553


No 249
>COG4306 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=33.54  E-value=22  Score=25.70  Aligned_cols=58  Identities=19%  Similarity=0.222  Sum_probs=33.8

Q ss_pred             HHHHHHHHHccCCC-ceecCcCcccccChHHHHHHHHHhcCCCCcccccCCCCCCCCChH
Q psy5202         199 SNLTVHRRSHTGEK-PYKCELCSYACAQSSKLTRHMKTHGRLGKDVYRCRFCEMPFSVPS  257 (343)
Q Consensus       199 ~~l~~H~~~h~~~~-~~~C~~C~~~f~~~~~L~~H~~~h~~~~~~~~~C~~C~~~f~~~~  257 (343)
                      ..|..-...+.|+. ..+|+.|.....-...+..=+-.-+ .-+.|-.|..||+.|.+..
T Consensus        24 pel~eafcskcgeati~qcp~csasirgd~~vegvlglg~-dye~psfchncgs~fpwte   82 (160)
T COG4306          24 PELMEAFCSKCGEATITQCPICSASIRGDYYVEGVLGLGG-DYEPPSFCHNCGSRFPWTE   82 (160)
T ss_pred             HHHHHHHHhhhchHHHhcCCccCCcccccceeeeeeccCC-CCCCcchhhcCCCCCCcHH
Confidence            34444444555543 3579999876655433322222221 2355677999999998764


No 250
>COG0068 HypF Hydrogenase maturation factor [Posttranslational modification, protein turnover, chaperones]
Probab=33.44  E-value=5.3  Score=38.16  Aligned_cols=45  Identities=27%  Similarity=0.595  Sum_probs=22.6

Q ss_pred             cccccccccCChHHHHHHHHhccCCCCe-ecCCCCccCCCHHHHHHH
Q psy5202          93 ECEFCGKKFKYENTLQAHRRNHTGEKPF-KCTVCDHACSQSAKLKKH  138 (343)
Q Consensus        93 ~C~~C~~~f~~~~~L~~H~~~h~~~~~~-~C~~C~~~f~~~~~L~~H  138 (343)
                      .|..||-+|+-...|-.- |.++....| .|+.|.+.|....+-+-|
T Consensus       125 ~CT~CGPRfTIi~alPYD-R~nTsM~~F~lC~~C~~EY~dP~nRRfH  170 (750)
T COG0068         125 NCTNCGPRFTIIEALPYD-RENTSMADFPLCPFCDKEYKDPLNRRFH  170 (750)
T ss_pred             ccCCCCcceeeeccCCCC-cccCccccCcCCHHHHHHhcCccccccc
Confidence            377777777654443211 111222222 477777766666554444


No 251
>COG4640 Predicted membrane protein [Function unknown]
Probab=33.41  E-value=29  Score=30.73  Aligned_cols=18  Identities=17%  Similarity=0.471  Sum_probs=9.7

Q ss_pred             cccCCCCCCCCChHHHHH
Q psy5202         244 YRCRFCEMPFSVPSTLEK  261 (343)
Q Consensus       244 ~~C~~C~~~f~~~~~l~~  261 (343)
                      ++|+.||..|.....+..
T Consensus        16 ~qC~qCG~~~t~~~sqan   33 (465)
T COG4640          16 VQCTQCGHKFTSRQSQAN   33 (465)
T ss_pred             ccccccCCcCCchhhhhh
Confidence            336666666655554443


No 252
>PTZ00064 histone acetyltransferase; Provisional
Probab=33.33  E-value=26  Score=32.21  Aligned_cols=29  Identities=21%  Similarity=0.537  Sum_probs=24.6

Q ss_pred             CCcccccCCCCCCCCChHHHHHHHhhhcC
Q psy5202         240 GKDVYRCRFCEMPFSVPSTLEKHMRKCVV  268 (343)
Q Consensus       240 ~~~~~~C~~C~~~f~~~~~l~~H~~~~~~  268 (343)
                      -...|.|++|-+-|.....|.+|+.+|..
T Consensus       277 ~d~LYICEfCLkY~~s~~~l~rH~~~C~~  305 (552)
T PTZ00064        277 VDTLHFCEYCLDFFCFEDELIRHLSRCQL  305 (552)
T ss_pred             CCeEEEccchhhhhCCHHHHHHHHhcCCC
Confidence            35679999999999999999999986543


No 253
>KOG0978|consensus
Probab=33.09  E-value=15  Score=35.41  Aligned_cols=20  Identities=20%  Similarity=0.471  Sum_probs=13.2

Q ss_pred             ccccCCCCCCCCChHHHHHH
Q psy5202         243 VYRCRFCEMPFSVPSTLEKH  262 (343)
Q Consensus       243 ~~~C~~C~~~f~~~~~l~~H  262 (343)
                      .-+||.|+..|.-..-+..|
T Consensus       678 qRKCP~Cn~aFganDv~~I~  697 (698)
T KOG0978|consen  678 QRKCPKCNAAFGANDVHRIH  697 (698)
T ss_pred             cCCCCCCCCCCCcccccccC
Confidence            34688888888766554444


No 254
>cd00924 Cyt_c_Oxidase_Vb Cytochrome c oxidase subunit Vb.  Cytochrome c oxidase (CcO), the terminal oxidase in the respiratory chains of eukaryotes and most bacteria, is a multi-chain transmembrane protein located in the inner membrane of mitochondria and the cell membrane of prokaryotes.  It catalyzes the reduction of O2 and simultaneously pumps protons across the membrane.  The number of subunits varies from three to five in bacteria and up to 13 in mammalian mitochondria. Subunits I, II, and III of mammalian CcO are encoded within the mitochondrial genome and the remaining 10 subunits are encoded within the nuclear genome.  Found only in eukaryotes, subunit Vb is one of three mammalian subunits that lacks a transmembrane region.  Subunit Vb is located on the matrix side of the membrane and binds the regulatory subunit of protein kinase A.  The abnormally extended conformation is stable only in the CcO assembly.
Probab=33.09  E-value=23  Score=24.72  Aligned_cols=18  Identities=28%  Similarity=0.586  Sum_probs=13.8

Q ss_pred             HccCCCceecCcCcccccC
Q psy5202         207 SHTGEKPYKCELCSYACAQ  225 (343)
Q Consensus       207 ~h~~~~~~~C~~C~~~f~~  225 (343)
                      .+.+ ++++|..||..|.-
T Consensus        74 l~~g-~~~rC~eCG~~fkL   91 (97)
T cd00924          74 LEKG-KPKRCPECGHVFKL   91 (97)
T ss_pred             EeCC-CceeCCCCCcEEEE
Confidence            3555 78999999988863


No 255
>TIGR00515 accD acetyl-CoA carboxylase, carboxyl transferase, beta subunit. The enzyme acetyl-CoA carboxylase contains a biotin carboxyl carrier protein or domain, a biotin carboxylase, and a carboxyl transferase. This model represents the beta chain of the carboxyl transferase for cases in which the architecture of the protein is as in E. coli, in which the carboxyltransferase portion consists of two non-identical subnits, alpha and beta.
Probab=32.99  E-value=22  Score=30.49  Aligned_cols=33  Identities=21%  Similarity=0.398  Sum_probs=20.1

Q ss_pred             CcccccCcccccCHHHHHHHHHHccCCCceecCcCcccccCh
Q psy5202         185 NDTCEFCGKVFKNCSNLTVHRRSHTGEKPYKCELCSYACAQS  226 (343)
Q Consensus       185 ~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~f~~~  226 (343)
                      ..+|+.|+..+..+. |.        ..-+.|+.|++.|.-.
T Consensus        26 ~~~c~~c~~~~~~~~-l~--------~~~~vc~~c~~h~rl~   58 (285)
T TIGR00515        26 WTKCPKCGQVLYTKE-LE--------RNLEVCPKCDHHMRMD   58 (285)
T ss_pred             eeECCCCcchhhHHH-HH--------hhCCCCCCCCCcCcCC
Confidence            567888887764332 21        1236788888877643


No 256
>PF05495 zf-CHY:  CHY zinc finger;  InterPro: IPR008913 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target.  Pirh2 is an eukaryotic ubiquitin protein ligase, which has been shown to promote p53 degradation in mammals. Pirh2 physically interacts with p53 and promotes ubiquitination of p53 independently of MDM2. Like MDM2, Pirh2 is thought to participate in an autoregulatory feedback loop that controls p53 function. Pirh2 proteins contain three distinct zinc fingers, the CHY-type, the CTCHY-type which is C-terminal to the CHY-type zinc finger and a RING finger. The CHY-type zinc finger has no currently known function []. As well as Pirh2, the CHY-type zinc finger is also found in the following proteins:   Yeast helper of Tim protein 13. Hot13 may have a role in the assembly and recycling of the small Tims, a complex of the mitochondrial intermembrane space that participates in the TIM22 import pathway for assembly of the inner membrane [] Several plant hypothetical proteins that also contain haemerythrin cation binding domains Several protozoan hypothetical proteins that also contain a Myb domain   The solution structure of this zinc finger has been solved and binds 3 zinc atoms as shown in the following schematic representation:   ++---------+-----+ || | | CXHYxxxxxxxxxCCxxxxxCxxCHxxxxxHxxxxxxxxxxxCxxCxxxxxxxxxCxxC | | | | | | | | +-+-----------------+--+ +--+---------+--+ 'C': conserved cysteine involved in the binding of one zinc atom. 'H': conserved histidine involved in the binding of one zinc atom.  More information about these proteins can be found at Protein of the Month: Zinc Fingers []; GO: 0008270 zinc ion binding; PDB: 2DKT_A 2K2C_A.
Probab=32.90  E-value=6.9  Score=25.49  Aligned_cols=59  Identities=22%  Similarity=0.317  Sum_probs=28.8

Q ss_pred             CcccccCcccccCHH---HHHHHHHHccCCCceecCcCcccccChHHHHHHHHHhcCCCCcccccCCCCCCC
Q psy5202         185 NDTCEFCGKVFKNCS---NLTVHRRSHTGEKPYKCELCSYACAQSSKLTRHMKTHGRLGKDVYRCRFCEMPF  253 (343)
Q Consensus       185 ~~~C~~C~~~f~~~~---~l~~H~~~h~~~~~~~C~~C~~~f~~~~~L~~H~~~h~~~~~~~~~C~~C~~~f  253 (343)
                      .++|+-|++.|.-..   .+..|...........|..|+..+.....       ..  + ..|.|+.|+..|
T Consensus        10 ~~~~~cC~~~y~C~~CHde~~~H~~~~~~~~~v~Cg~C~~~~~~~~~-------~c--~-~~~~C~~C~~~~   71 (71)
T PF05495_consen   10 AIRFPCCGKYYPCRFCHDELEDHPFDRWPVKRVICGKCRTEQPIDEY-------SC--G-ADYFCPICGLYF   71 (71)
T ss_dssp             EEEETTTTEEESSHHHHHHCSSS---TTT--EEEETTT--EEES-SB-------TT-----SEEETTTTEEE
T ss_pred             EEECCcccCeecHHHHHHHhccCccccccccCeECCCCCCccChhhh-------hc--C-CCccCcCcCCCC
Confidence            356667777776541   11122222233346789999887765444       11  2 458898887543


No 257
>cd00730 rubredoxin Rubredoxin; nonheme iron binding domains containing a [Fe(SCys)4] center. Rubredoxins are small nonheme iron proteins. The iron atom is coordinated by four cysteine residues (Fe(S-Cys)4), but iron can also be replaced by cobalt, nickel or zinc. They are believed to be involved in electron transfer.
Probab=32.79  E-value=19  Score=21.63  Aligned_cols=37  Identities=19%  Similarity=0.414  Sum_probs=20.2

Q ss_pred             eecCcCcccccChHHHHHHHHHhcC----CCCcccccCCCCC
Q psy5202         214 YKCELCSYACAQSSKLTRHMKTHGR----LGKDVYRCRFCEM  251 (343)
Q Consensus       214 ~~C~~C~~~f~~~~~L~~H~~~h~~----~~~~~~~C~~C~~  251 (343)
                      |+|..||+.+.-...-..+- +-.+    .-...|.|+.|+.
T Consensus         2 y~C~~CgyiYd~~~Gd~~~~-i~pGt~f~~Lp~~w~CP~C~a   42 (50)
T cd00730           2 YECRICGYIYDPAEGDPDEG-IPPGTPFEDLPDDWVCPVCGA   42 (50)
T ss_pred             cCCCCCCeEECCCCCCcccC-cCCCCCHhHCCCCCCCCCCCC
Confidence            89999999887532111100 0000    0123489999985


No 258
>PF01428 zf-AN1:  AN1-like Zinc finger;  InterPro: IPR000058 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target.  This entry represents the AN1-type zinc finger domain, which has a dimetal (zinc)-bound alpha/beta fold. This domain was first identified as a zinc finger at the C terminus of AN1 Q91889 from SWISSPROT, a ubiquitin-like protein in Xenopus laevis []. The AN1-type zinc finger contains six conserved cysteines and two histidines that could potentially coordinate 2 zinc atoms. Certain stress-associated proteins (SAP) contain AN1 domain, often in combination with A20 zinc finger domains (SAP8) or C2H2 domains (SAP16) []. For example, the human protein Znf216 has an A20 zinc-finger at the N terminus and an AN1 zinc-finger at the C terminus, acting to negatively regulate the NFkappaB activation pathway and to interact with components of the immune response like RIP, IKKgamma and TRAF6. The interact of Znf216 with IKK-gamma and RIP is mediated by the A20 zinc-finger domain, while its interaction with TRAF6 is mediated by the AN1 zinc-finger domain; therefore, both zinc-finger domains are involved in regulating the immune response []. The AN1 zinc finger domain is also found in proteins containing a ubiquitin-like domain, which are involved in the ubiquitination pathway []. Proteins containing an AN1-type zinc finger include:   Ascidian posterior end mark 6 (pem-6) protein []. Human AWP1 protein (associated with PRK1), which is expressed during early embryogenesis []. Human immunoglobulin mu binding protein 2 (SMUBP-2), mutations in which cause muscular atrophy with respiratory distress type 1 [].   More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; GO: 0008270 zinc ion binding; PDB: 1WFP_A 1WYS_A 1WG2_A 1WFH_A 1X4W_A 1WFE_A 1WFL_A 1X4V_A.
Probab=32.39  E-value=18  Score=20.75  Aligned_cols=14  Identities=29%  Similarity=0.800  Sum_probs=8.0

Q ss_pred             CceecCcCcccccC
Q psy5202         212 KPYKCELCSYACAQ  225 (343)
Q Consensus       212 ~~~~C~~C~~~f~~  225 (343)
                      .+|.|..|+..|=.
T Consensus        12 ~~~~C~~C~~~FC~   25 (43)
T PF01428_consen   12 LPFKCKHCGKSFCL   25 (43)
T ss_dssp             SHEE-TTTS-EE-T
T ss_pred             CCeECCCCCcccCc
Confidence            46788888887754


No 259
>COG1327 Predicted transcriptional regulator, consists of a Zn-ribbon and ATP-cone domains [Transcription]
Probab=32.24  E-value=10  Score=28.63  Aligned_cols=14  Identities=14%  Similarity=0.373  Sum_probs=7.4

Q ss_pred             eecCCCCccCCCHH
Q psy5202         120 FKCTVCDHACSQSA  133 (343)
Q Consensus       120 ~~C~~C~~~f~~~~  133 (343)
                      -.|..|+..|.+..
T Consensus        29 ReC~~C~~RFTTfE   42 (156)
T COG1327          29 RECLECGERFTTFE   42 (156)
T ss_pred             hcccccccccchhh
Confidence            34555555555543


No 260
>PLN00104 MYST -like histone acetyltransferase; Provisional
Probab=32.04  E-value=25  Score=32.00  Aligned_cols=28  Identities=32%  Similarity=0.713  Sum_probs=22.7

Q ss_pred             CCcccccCCCCCCCCChHHHHHHHhhhc
Q psy5202         240 GKDVYRCRFCEMPFSVPSTLEKHMRKCV  267 (343)
Q Consensus       240 ~~~~~~C~~C~~~f~~~~~l~~H~~~~~  267 (343)
                      ....|.|++|-+-|.....|.+|+.+|.
T Consensus       195 ~~~lyiCe~Cl~y~~~~~~~~~H~~~C~  222 (450)
T PLN00104        195 CSKLYFCEFCLKFMKRKEQLQRHMKKCD  222 (450)
T ss_pred             CCeEEEchhhhhhhcCHHHHHHHHhcCC
Confidence            3467889999999999999999987554


No 261
>KOG0717|consensus
Probab=31.82  E-value=26  Score=31.85  Aligned_cols=22  Identities=23%  Similarity=0.536  Sum_probs=20.0

Q ss_pred             cccCCCCCCCCChHHHHHHHhh
Q psy5202         244 YRCRFCEMPFSVPSTLEKHMRK  265 (343)
Q Consensus       244 ~~C~~C~~~f~~~~~l~~H~~~  265 (343)
                      +.|.+|.+.|.+...|..|...
T Consensus       293 lyC~vCnKsFKseKq~kNHEnS  314 (508)
T KOG0717|consen  293 LYCVVCNKSFKSEKQLKNHENS  314 (508)
T ss_pred             eEEeeccccccchHHHHhhHHH
Confidence            7899999999999999999863


No 262
>KOG0978|consensus
Probab=31.77  E-value=17  Score=35.04  Aligned_cols=49  Identities=18%  Similarity=0.384  Sum_probs=29.8

Q ss_pred             CcccccccccccCChHHH-HHHH------HhccCCCCeecCCCCccCCCHHHHHHH
Q psy5202          90 KNKECEFCGKKFKYENTL-QAHR------RNHTGEKPFKCTVCDHACSQSAKLKKH  138 (343)
Q Consensus        90 ~~~~C~~C~~~f~~~~~L-~~H~------~~h~~~~~~~C~~C~~~f~~~~~L~~H  138 (343)
                      ...+|+.|+..+...--. --|.      +.-...+.-+||.|+..|....-+..|
T Consensus       642 ~~LkCs~Cn~R~Kd~vI~kC~H~FC~~Cvq~r~etRqRKCP~Cn~aFganDv~~I~  697 (698)
T KOG0978|consen  642 ELLKCSVCNTRWKDAVITKCGHVFCEECVQTRYETRQRKCPKCNAAFGANDVHRIH  697 (698)
T ss_pred             hceeCCCccCchhhHHHHhcchHHHHHHHHHHHHHhcCCCCCCCCCCCcccccccC
Confidence            347899999666543221 1231      111233556899999999887655544


No 263
>PF04423 Rad50_zn_hook:  Rad50 zinc hook motif;  InterPro: IPR007517 The Mre11 complex (Mre11 Rad50 Nbs1) is central to chromosomal maintenance and functions in homologous recombination, telomere maintenance and sister chromatid association. The Rad50 coiled-coil region contains a dimer interface at the apex of the coiled coils in which pairs of conserved Cys-X-X-Cys motifs form interlocking hooks that bind one Zn ion. This alignment includes the zinc hook motif and a short stretch of coiled-coil on either side.; GO: 0004518 nuclease activity, 0005524 ATP binding, 0008270 zinc ion binding, 0006281 DNA repair; PDB: 1L8D_B.
Probab=31.65  E-value=14  Score=22.46  Aligned_cols=13  Identities=23%  Similarity=0.690  Sum_probs=7.0

Q ss_pred             ccCCCCCCCCChH
Q psy5202         245 RCRFCEMPFSVPS  257 (343)
Q Consensus       245 ~C~~C~~~f~~~~  257 (343)
                      .||+|++.|....
T Consensus        22 ~CPlC~r~l~~e~   34 (54)
T PF04423_consen   22 CCPLCGRPLDEEH   34 (54)
T ss_dssp             E-TTT--EE-HHH
T ss_pred             cCCCCCCCCCHHH
Confidence            8999999987643


No 264
>PF11672 DUF3268:  Protein of unknown function (DUF3268);  InterPro: IPR021686 This entry is represented by Listeria phage P100, Gp150. The characteristics of the protein distribution suggest prophage matches in addition to the phage matches.
Probab=31.21  E-value=18  Score=25.44  Aligned_cols=7  Identities=43%  Similarity=1.307  Sum_probs=4.2

Q ss_pred             cccCCCC
Q psy5202         244 YRCRFCE  250 (343)
Q Consensus       244 ~~C~~C~  250 (343)
                      |.|.-|+
T Consensus        32 y~C~~C~   38 (102)
T PF11672_consen   32 YVCTPCD   38 (102)
T ss_pred             EECCCCC
Confidence            5666665


No 265
>PRK05654 acetyl-CoA carboxylase subunit beta; Validated
Probab=30.97  E-value=23  Score=30.48  Aligned_cols=34  Identities=24%  Similarity=0.384  Sum_probs=21.5

Q ss_pred             CCcccccCcccccCHHHHHHHHHHccCCCceecCcCcccccCh
Q psy5202         184 RNDTCEFCGKVFKNCSNLTVHRRSHTGEKPYKCELCSYACAQS  226 (343)
Q Consensus       184 ~~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~f~~~  226 (343)
                      -..+|+.|+..+..+. |.        ..-+.|+.|++.|.-.
T Consensus        26 ~~~~c~~c~~~~~~~~-l~--------~~~~vc~~c~~h~rl~   59 (292)
T PRK05654         26 LWTKCPSCGQVLYRKE-LE--------ANLNVCPKCGHHMRIS   59 (292)
T ss_pred             CeeECCCccchhhHHH-HH--------hcCCCCCCCCCCeeCC
Confidence            3567999987764332 21        1236899999888643


No 266
>PHA02998 RNA polymerase subunit; Provisional
Probab=30.57  E-value=8.2  Score=29.85  Aligned_cols=12  Identities=17%  Similarity=0.351  Sum_probs=7.8

Q ss_pred             eecCcCcccccC
Q psy5202         214 YKCELCSYACAQ  225 (343)
Q Consensus       214 ~~C~~C~~~f~~  225 (343)
                      |.|..||+.|.-
T Consensus       172 YkC~~CG~~wkp  183 (195)
T PHA02998        172 HACRDCKKHFKP  183 (195)
T ss_pred             EEcCCCCCccCC
Confidence            677777766643


No 267
>PLN02294 cytochrome c oxidase subunit Vb
Probab=30.53  E-value=26  Score=27.06  Aligned_cols=15  Identities=20%  Similarity=0.636  Sum_probs=12.1

Q ss_pred             CCceecCcCcccccC
Q psy5202         211 EKPYKCELCSYACAQ  225 (343)
Q Consensus       211 ~~~~~C~~C~~~f~~  225 (343)
                      .++++|+.||..|.-
T Consensus       139 Gkp~RCpeCG~~fkL  153 (174)
T PLN02294        139 GKSFECPVCTQYFEL  153 (174)
T ss_pred             CCceeCCCCCCEEEE
Confidence            468899999988864


No 268
>PF04216 FdhE:  Protein involved in formate dehydrogenase formation;  InterPro: IPR006452 This family of sequences describe an accessory protein required for the assembly of formate dehydrogenase of certain proteobacteria although not present in the final complex []. The exact nature of the function of FdhE in the assembly of the complex is unknown, but considering the presence of selenocysteine, molybdopterin, iron-sulphur clusters and cytochrome b556, it is likely to be involved in the insertion of cofactors. ; GO: 0005737 cytoplasm; PDB: 2FIY_B.
Probab=30.52  E-value=11  Score=32.32  Aligned_cols=9  Identities=33%  Similarity=0.951  Sum_probs=3.2

Q ss_pred             cccccCccc
Q psy5202         186 DTCEFCGKV  194 (343)
Q Consensus       186 ~~C~~C~~~  194 (343)
                      -.|++||..
T Consensus       173 g~CPvCGs~  181 (290)
T PF04216_consen  173 GYCPVCGSP  181 (290)
T ss_dssp             SS-TTT---
T ss_pred             CcCCCCCCc
Confidence            367777754


No 269
>PRK00420 hypothetical protein; Validated
Probab=30.38  E-value=25  Score=25.25  Aligned_cols=14  Identities=29%  Similarity=0.496  Sum_probs=10.3

Q ss_pred             CCcccccCcccccC
Q psy5202         184 RNDTCEFCGKVFKN  197 (343)
Q Consensus       184 ~~~~C~~C~~~f~~  197 (343)
                      ....|+.||..+.-
T Consensus        39 g~~~Cp~Cg~~~~v   52 (112)
T PRK00420         39 GEVVCPVHGKVYIV   52 (112)
T ss_pred             CceECCCCCCeeee
Confidence            55789999986654


No 270
>KOG2747|consensus
Probab=30.22  E-value=21  Score=31.82  Aligned_cols=29  Identities=28%  Similarity=0.794  Sum_probs=23.2

Q ss_pred             CcccccCCCCCCCCChHHHHHHHhhhcCc
Q psy5202         241 KDVYRCRFCEMPFSVPSTLEKHMRKCVVN  269 (343)
Q Consensus       241 ~~~~~C~~C~~~f~~~~~l~~H~~~~~~~  269 (343)
                      ...|.|.+|-+-+.+...|.+|+++|...
T Consensus       156 ~~lYiCEfCLkY~~s~~~l~rH~~kC~~r  184 (396)
T KOG2747|consen  156 DKLYICEFCLKYMKSRTSLQRHLKKCKLR  184 (396)
T ss_pred             CeEEEehHHHhHhchHHHHHHHHHhcCCC
Confidence            46688888888888888888898876544


No 271
>TIGR00310 ZPR1_znf ZPR1 zinc finger domain.
Probab=29.92  E-value=6  Score=31.57  Aligned_cols=11  Identities=18%  Similarity=0.461  Sum_probs=5.2

Q ss_pred             cccCCCCCCCC
Q psy5202         244 YRCRFCEMPFS  254 (343)
Q Consensus       244 ~~C~~C~~~f~  254 (343)
                      |.|+.||++..
T Consensus        31 f~C~~CGyr~~   41 (192)
T TIGR00310        31 TICEHCGYRSN   41 (192)
T ss_pred             EECCCCCCccc
Confidence            44555554443


No 272
>KOG3362|consensus
Probab=29.92  E-value=20  Score=26.72  Aligned_cols=24  Identities=21%  Similarity=0.468  Sum_probs=18.9

Q ss_pred             cccccCCCCCCCCChHHHHHHHhh
Q psy5202         242 DVYRCRFCEMPFSVPSTLEKHMRK  265 (343)
Q Consensus       242 ~~~~C~~C~~~f~~~~~l~~H~~~  265 (343)
                      .+|.|.-||-.+-+..-|..|..+
T Consensus       128 S~ysC~~CG~kyCsv~C~~~HneT  151 (156)
T KOG3362|consen  128 SKYSCVNCGTKYCSVRCLKTHNET  151 (156)
T ss_pred             chhHHHhcCCceeechhhhhcccc
Confidence            357888888888888888888754


No 273
>KOG0717|consensus
Probab=29.83  E-value=32  Score=31.24  Aligned_cols=22  Identities=18%  Similarity=0.489  Sum_probs=20.1

Q ss_pred             eecCcCcccccChHHHHHHHHH
Q psy5202         214 YKCELCSYACAQSSKLTRHMKT  235 (343)
Q Consensus       214 ~~C~~C~~~f~~~~~L~~H~~~  235 (343)
                      +.|.+|.+.|.+...|..|...
T Consensus       293 lyC~vCnKsFKseKq~kNHEnS  314 (508)
T KOG0717|consen  293 LYCVVCNKSFKSEKQLKNHENS  314 (508)
T ss_pred             eEEeeccccccchHHHHhhHHH
Confidence            7799999999999999999754


No 274
>PF10537 WAC_Acf1_DNA_bd:  ATP-utilising chromatin assembly and remodelling N-terminal;  InterPro: IPR013136 ACF (for ATP-utilising chromatin assembly and remodeling factor) is a chromatin-remodeling complex that catalyzes the ATP-dependent assembly of periodic nucleosome arrays. This reaction utilises the energy of ATP hydrolysis by ISWI, the smaller of the two subunits of ACF. Acf1, the large subunit of ACF, is essential for the full activity of the complex. The WAC (WSTF/Acf1/cbp146) domain is an ~110-residue module present at the N-termini of Acf1-related proteins in a variety of organisms. It is found in association with other domains such as the bromodomain, the PHD-type zinc finger, DDT or WAKS. The DNA-binding region of Acf1 includes the WAC domain, which is necessary for the efficient binding of ACF complex to DNA. It seems probable that the WAC domain will be involved in DNA binding in other related factors [, ].   Some proteins known to contain a WAC domain are the Drosophila melanogaster (Fruit fly) ATP-dependent chromatin assembly factor large subunit Acf1, human WSTF (Williams syndrome transcription factor), mouse cbp146, yeast imitation switch two complex protein 1 (ITC1 or YGL133w), and yeast protein YPL216w. 
Probab=29.81  E-value=80  Score=22.30  Aligned_cols=36  Identities=8%  Similarity=0.157  Sum_probs=22.0

Q ss_pred             ceecCcCcccccChHHHHHHHHHhcCCCCcccccCCCCC
Q psy5202         213 PYKCELCSYACAQSSKLTRHMKTHGRLGKDVYRCRFCEM  251 (343)
Q Consensus       213 ~~~C~~C~~~f~~~~~L~~H~~~h~~~~~~~~~C~~C~~  251 (343)
                      .|.|+.=|..|.+-..+...+...   ..+.|.|...|+
T Consensus         3 Vf~i~~T~EiF~dYe~Y~~R~~~y---~~~vwtC~~TGk   38 (102)
T PF10537_consen    3 VFYIPFTGEIFRDYEEYLKRMILY---NQRVWTCEITGK   38 (102)
T ss_pred             eEEeCCCCcccCCHHHHHHHHHHH---hCCeeEEecCCC
Confidence            455666666666666666665555   355666666665


No 275
>PF11931 DUF3449:  Domain of unknown function (DUF3449);  InterPro: IPR024598 This presumed domain is functionally uncharacterised. It has two conserved sequence motifs: PIP and CEICG and contains a zinc-finger of the C2H2-type.; PDB: 4DGW_A.
Probab=29.74  E-value=18  Score=28.93  Aligned_cols=27  Identities=26%  Similarity=0.609  Sum_probs=0.0

Q ss_pred             ccCCCceecCcCcc-cccChHHHHHHHH
Q psy5202         208 HTGEKPYKCELCSY-ACAQSSKLTRHMK  234 (343)
Q Consensus       208 h~~~~~~~C~~C~~-~f~~~~~L~~H~~  234 (343)
                      |.-...|.|.+||- +|.-+..+.+|..
T Consensus        96 hGL~~ey~CEICGN~~Y~GrkaFekHF~  123 (196)
T PF11931_consen   96 HGLGVEYKCEICGNQSYKGRKAFEKHFQ  123 (196)
T ss_dssp             ----------------------------
T ss_pred             hCCCCeeeeEeCCCcceecHHHHHHhcC
Confidence            33345577777774 4556667777765


No 276
>PF08790 zf-LYAR:  LYAR-type C2HC zinc finger ;  InterPro: IPR014898 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target.  This C2HC zinc finger domain is found in LYAR proteins such as Q08288 from SWISSPROT, which are involved in cell growth regulation.  More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; PDB: 1WJV_A.
Probab=29.68  E-value=11  Score=19.43  Aligned_cols=19  Identities=26%  Similarity=0.567  Sum_probs=10.4

Q ss_pred             ccccccccccCChHHHHHHH
Q psy5202          92 KECEFCGKKFKYENTLQAHR  111 (343)
Q Consensus        92 ~~C~~C~~~f~~~~~L~~H~  111 (343)
                      |.|-.|++.| .....+.|.
T Consensus         1 ~sCiDC~~~F-~~~~y~~Ht   19 (28)
T PF08790_consen    1 FSCIDCSKDF-DGDSYKSHT   19 (28)
T ss_dssp             EEETTTTEEE-EGGGTTT--
T ss_pred             CeeecCCCCc-CcCCcCCCC
Confidence            4577777777 344455553


No 277
>PF08882 Acetone_carb_G:  Acetone carboxylase gamma subunit;  InterPro: IPR014979 Acetone carboxylase is the key enzyme of bacterial acetone metabolism, catalysing the condensation of acetone and CO2 to form acetoacetate [] according to the following reaction:  CH3COCH3 + CO2 + ATP = CH3COCH2COO- + AMP + 2P(i) + H+   It has the subunit composition: (alpha(2)beta(2)gamma(2) multimers of 85kDa, 78kDa, and 20kDa subunits). It is expressed to high levels (17 to 25% of soluble protein) in cells grown with acetone as the carbon source but are not present at detectable levels in cells grown with other carbon sources []. Acetone carboxylase may enable Helicobacter pylori to survive off acetone in the stomach of humans and other mammals where it is the etiological agent of peptic ulcer disease []. This entry represents the family of gamma subunit-related acetone carboxylase proteins.
Probab=29.49  E-value=17  Score=25.80  Aligned_cols=37  Identities=16%  Similarity=0.404  Sum_probs=23.4

Q ss_pred             cccccccccccCC-hHHHHHHHHhccCC----------------------CCeecCCCCcc
Q psy5202          91 NKECEFCGKKFKY-ENTLQAHRRNHTGE----------------------KPFKCTVCDHA  128 (343)
Q Consensus        91 ~~~C~~C~~~f~~-~~~L~~H~~~h~~~----------------------~~~~C~~C~~~  128 (343)
                      .-+| .||..|-. +.+.+.|...+-.+                      +.|-|+.||..
T Consensus        24 ~vkc-~CGh~f~d~r~NwK~~alv~vRd~~E~~~~iYp~~~aPdp~w~~irEyyCP~Cgt~   83 (112)
T PF08882_consen   24 VVKC-DCGHEFCDARENWKLGALVYVRDPEEIHPEIYPFTMAPDPEWQVIREYYCPGCGTQ   83 (112)
T ss_pred             eeec-cCCCeecChhcChhhCcEEEecChHHhhhhhcccccCCCCCcEEEEEEECCCCcce
Confidence            3445 49999874 45566665544221                      33889999864


No 278
>KOG1842|consensus
Probab=29.42  E-value=28  Score=31.35  Aligned_cols=29  Identities=17%  Similarity=0.401  Sum_probs=21.3

Q ss_pred             ccccCCCCCCCCChHHHHHHHhhhcCccc
Q psy5202         243 VYRCRFCEMPFSVPSTLEKHMRKCVVNQN  271 (343)
Q Consensus       243 ~~~C~~C~~~f~~~~~l~~H~~~~~~~~~  271 (343)
                      -|.||+|.+.|...+.|..|+..-|....
T Consensus        15 gflCPiC~~dl~~~~~L~~H~d~eH~~ed   43 (505)
T KOG1842|consen   15 GFLCPICLLDLPNLSALNDHLDVEHFEED   43 (505)
T ss_pred             cccCchHhhhhhhHHHHHHHHhhhccccc
Confidence            47788888888888888888776666554


No 279
>KOG0320|consensus
Probab=29.09  E-value=34  Score=26.71  Aligned_cols=15  Identities=20%  Similarity=0.521  Sum_probs=9.7

Q ss_pred             CCcccccCcccccCH
Q psy5202         184 RNDTCEFCGKVFKNC  198 (343)
Q Consensus       184 ~~~~C~~C~~~f~~~  198 (343)
                      ...+|+.|++....+
T Consensus       166 ~~~~CP~C~kkIt~k  180 (187)
T KOG0320|consen  166 NTNKCPTCRKKITHK  180 (187)
T ss_pred             hCCCCCCcccccchh
Confidence            445788888755443


No 280
>PF03145 Sina:  Seven in absentia protein family;  InterPro: IPR018121 The seven in absentia (sina) gene was first identified in Drosophila. The Drosophila Sina protein is essential for the determination of the R7 pathway in photoreceptor cell development: the loss of functional Sina results in the transformation of the R7 precursor cell to a non-neuronal cell type. The Sina protein contains an N-terminal RING finger domain C3HC4-type. Through this domain, Sina binds E2 ubiquitin-conjugating enzymes (UbcD1) Sina also interacts with Tramtrack (TTK88) via PHYL. Tramtrack is a transcriptional repressor that blocks photoreceptor determination, while PHYL down-regulates the activity of TTK88. In turn, the activity of PHYL requires the activation of the Sevenless receptor tyrosine kinase, a process essential for R7 determination. It is thought that Sina targets TTK88 for degradation, therefore promoting the R7 pathway. Murine and human homologues of Sina have also been identified. The human homologue Siah-1 [] also binds E2 enzymes (UbcH5) and through a series of physical interactions, targets beta-catenin for ubiquitin degradation. Siah-1 expression is enhanced by p53, itself promoted by DNA damage. Thus this pathway links DNA damage to beta-catenin degradation [, ]. Sina proteins, therefore, physically interact with a variety of proteins. The N-terminal RING finger domain that binds ubiquitin conjugating enzymes is a C3HC4-type, and does not form part of the alignment for this family. The remainder C-terminal part is involved in interactions with other proteins, and is included in this alignment. In addition to the Drosophila protein and mammalian homologues, whose similarity was noted previously, this family also includes putative homologues from Caenorhabditis elegans, Arabidopsis thaliana.; GO: 0006511 ubiquitin-dependent protein catabolic process, 0007275 multicellular organismal development, 0005634 nucleus; PDB: 2A25_A 1K2F_B 2AN6_C.
Probab=28.45  E-value=35  Score=27.36  Aligned_cols=46  Identities=20%  Similarity=0.247  Sum_probs=17.3

Q ss_pred             CcccccChHHHHHHHHHhcCCCCcccccCC----CCCCCCChHHHHHHHhhhcCc
Q psy5202         219 CSYACAQSSKLTRHMKTHGRLGKDVYRCRF----CEMPFSVPSTLEKHMRKCVVN  269 (343)
Q Consensus       219 C~~~f~~~~~L~~H~~~h~~~~~~~~~C~~----C~~~f~~~~~l~~H~~~~~~~  269 (343)
                      |...+.. .....|.+..   .-+|+.|+.    |+..= ....|..|++..|..
T Consensus        24 C~~~~~~-~~~~~HE~~C---~~~p~~CP~~~~~C~~~G-~~~~l~~Hl~~~H~~   73 (198)
T PF03145_consen   24 CTETFPY-SEKREHEEEC---PFRPCSCPFPGSGCDWQG-SYKELLDHLRDKHSW   73 (198)
T ss_dssp             ---EE-G-GGHHHHHHT----TTSEEE-SSSSTT---EE-ECCCHHHHHHHHTTT
T ss_pred             Ccccccc-cChhhHhccC---CCcCCcCCCCCCCccccC-CHHHHHHHHHHHCCC
Confidence            4444322 3444555554   245555555    43321 223455666555543


No 281
>PF11789 zf-Nse:  Zinc-finger of the MIZ type in Nse subunit; PDB: 2YU4_A 3HTK_C.
Probab=28.28  E-value=51  Score=20.34  Aligned_cols=32  Identities=19%  Similarity=0.388  Sum_probs=17.9

Q ss_pred             CCcccccccccccCChHHHHHHHHhccCCCCeecCC
Q psy5202          89 EKNKECEFCGKKFKYENTLQAHRRNHTGEKPFKCTV  124 (343)
Q Consensus        89 ~~~~~C~~C~~~f~~~~~L~~H~~~h~~~~~~~C~~  124 (343)
                      +.|++...|+-.|.. ..+...+   ...+...|++
T Consensus        22 ~~PV~s~~C~H~fek-~aI~~~i---~~~~~~~CPv   53 (57)
T PF11789_consen   22 EDPVKSKKCGHTFEK-EAILQYI---QRNGSKRCPV   53 (57)
T ss_dssp             SSEEEESSS--EEEH-HHHHHHC---TTTS-EE-SC
T ss_pred             hCCcCcCCCCCeecH-HHHHHHH---HhcCCCCCCC
Confidence            457777888888874 4444444   2446688887


No 282
>KOG4727|consensus
Probab=27.87  E-value=30  Score=26.63  Aligned_cols=22  Identities=18%  Similarity=0.461  Sum_probs=19.7

Q ss_pred             ccccCCCCCCCCChHHHHHHHh
Q psy5202         243 VYRCRFCEMPFSVPSTLEKHMR  264 (343)
Q Consensus       243 ~~~C~~C~~~f~~~~~l~~H~~  264 (343)
                      -|.|.+|+-.|....++.-|+.
T Consensus        75 GyyCdVCdcvvKDSinflDHiN   96 (193)
T KOG4727|consen   75 GYYCDVCDCVVKDSINFLDHIN   96 (193)
T ss_pred             ceeeeecceeehhhHHHHHHhc
Confidence            4899999999999999999984


No 283
>COG3677 Transposase and inactivated derivatives [DNA replication, recombination, and repair]
Probab=27.83  E-value=29  Score=25.68  Aligned_cols=47  Identities=19%  Similarity=0.361  Sum_probs=25.7

Q ss_pred             HHHHHHHccCCCceecCcCcccccChHHHHHHHHHhcCCCCcccccCCCCCCCCCh
Q psy5202         201 LTVHRRSHTGEKPYKCELCSYACAQSSKLTRHMKTHGRLGKDVYRCRFCEMPFSVP  256 (343)
Q Consensus       201 l~~H~~~h~~~~~~~C~~C~~~f~~~~~L~~H~~~h~~~~~~~~~C~~C~~~f~~~  256 (343)
                      ...++++..+.  -.|+.|+.....+...  ..     .+...|+|..|++.|...
T Consensus        20 ~~~~~~~~~~~--~~cP~C~s~~~~k~g~--~~-----~~~qRyrC~~C~~tf~~~   66 (129)
T COG3677          20 AAYAIRMQITK--VNCPRCKSSNVVKIGG--IR-----RGHQRYKCKSCGSTFTVE   66 (129)
T ss_pred             HHHHHhhhccc--CcCCCCCccceeeECC--cc-----ccccccccCCcCcceeee
Confidence            34445554432  3588887443111110  00     135669999999998764


No 284
>PF06397 Desulfoferrod_N:  Desulfoferrodoxin, N-terminal domain;  InterPro: IPR004462 This domain is found as essentially the full length of desulforedoxin, a 37-residue homodimeric non-haem iron protein. It is also found as the N-terminal domain of desulfoferrodoxin (rbo), a homodimeric non-haem iron protein with 2 Fe atoms per monomer in different oxidation states. This domain binds the ferric rather than the ferrous Fe of desulfoferrodoxin. Neelaredoxin, a monomeric blue non-haem iron protein, lacks this domain.; GO: 0005506 iron ion binding; PDB: 1DFX_A 1VZI_B 2JI2_D 1VZH_B 2JI3_C 2JI1_C 1VZG_A 1CFW_A 2LK5_B 1DHG_B ....
Probab=27.22  E-value=23  Score=19.56  Aligned_cols=13  Identities=31%  Similarity=0.777  Sum_probs=7.2

Q ss_pred             CcccccccccccC
Q psy5202          90 KNKECEFCGKKFK  102 (343)
Q Consensus        90 ~~~~C~~C~~~f~  102 (343)
                      +-|+|..||....
T Consensus         5 ~~YkC~~CGniVe   17 (36)
T PF06397_consen    5 EFYKCEHCGNIVE   17 (36)
T ss_dssp             EEEE-TTT--EEE
T ss_pred             cEEEccCCCCEEE
Confidence            4689999987653


No 285
>PF10122 Mu-like_Com:  Mu-like prophage protein Com;  InterPro: IPR019294  Members of this entry belong to the Com family of proteins that act as translational regulators of mom [, ]. 
Probab=27.18  E-value=22  Score=21.30  Aligned_cols=9  Identities=22%  Similarity=0.700  Sum_probs=4.8

Q ss_pred             cccCCCCCC
Q psy5202         244 YRCRFCEMP  252 (343)
Q Consensus       244 ~~C~~C~~~  252 (343)
                      .+|+.|+..
T Consensus        25 IKCpRC~ti   33 (51)
T PF10122_consen   25 IKCPRCKTI   33 (51)
T ss_pred             EECCCCCcc
Confidence            456666543


No 286
>PTZ00448 hypothetical protein; Provisional
Probab=27.17  E-value=45  Score=29.46  Aligned_cols=25  Identities=20%  Similarity=0.465  Sum_probs=21.9

Q ss_pred             ccccCCCCCCCCChHHHHHHHhhhc
Q psy5202         243 VYRCRFCEMPFSVPSTLEKHMRKCV  267 (343)
Q Consensus       243 ~~~C~~C~~~f~~~~~l~~H~~~~~  267 (343)
                      .|.|..|+..|.+....+.|+|+-+
T Consensus       314 ~~tC~~C~v~F~~~~~qR~H~KSDw  338 (373)
T PTZ00448        314 MLLCRKCNIQLMDHNAFKQHYRSEW  338 (373)
T ss_pred             CccccccccccCCHHHHHHHhhhhH
Confidence            6889999999999999999998633


No 287
>PF03811 Zn_Tnp_IS1:  InsA N-terminal domain;  InterPro: IPR003220 Insertion elements are mobile elements in DNA, usually encoding proteins required for transposition, for example transposases. Protein InsA is absolutely required for transposition of insertion element 1. This entry represents a short zinc binding domain found in IS1 InsA family protein. It is found at the N terminus of the protein and may be a DNA-binding domain.; GO: 0006313 transposition, DNA-mediated
Probab=26.94  E-value=14  Score=20.42  Aligned_cols=13  Identities=38%  Similarity=1.048  Sum_probs=6.1

Q ss_pred             HccCCCceecCcC
Q psy5202         207 SHTGEKPYKCELC  219 (343)
Q Consensus       207 ~h~~~~~~~C~~C  219 (343)
                      ...|...|+|..|
T Consensus        23 ~~~G~qryrC~~C   35 (36)
T PF03811_consen   23 SPSGHQRYRCKDC   35 (36)
T ss_pred             CCCCCEeEecCcC
Confidence            3334444555554


No 288
>COG3364 Zn-ribbon containing protein [General function prediction only]
Probab=26.33  E-value=38  Score=23.60  Aligned_cols=27  Identities=22%  Similarity=0.523  Sum_probs=18.3

Q ss_pred             cccccccccccCChHHHHHHHHhccCCCCeecCCCCc
Q psy5202          91 NKECEFCGKKFKYENTLQAHRRNHTGEKPFKCTVCDH  127 (343)
Q Consensus        91 ~~~C~~C~~~f~~~~~L~~H~~~h~~~~~~~C~~C~~  127 (343)
                      +++|..||..|.+-.....          --|+.||.
T Consensus         2 pH~CtrCG~vf~~g~~~il----------~GCp~CG~   28 (112)
T COG3364           2 PHQCTRCGEVFDDGSEEIL----------SGCPKCGC   28 (112)
T ss_pred             CceecccccccccccHHHH----------ccCccccc
Confidence            6789999999987543321          24777875


No 289
>KOG0782|consensus
Probab=26.12  E-value=11  Score=34.89  Aligned_cols=53  Identities=19%  Similarity=0.365  Sum_probs=35.8

Q ss_pred             HHHHHHHHHHccCCCceecCcCcccccChHHHHHHHHHhcCCCCcccccCCCCCCCCChH
Q psy5202         198 CSNLTVHRRSHTGEKPYKCELCSYACAQSSKLTRHMKTHGRLGKDVYRCRFCEMPFSVPS  257 (343)
Q Consensus       198 ~~~l~~H~~~h~~~~~~~C~~C~~~f~~~~~L~~H~~~h~~~~~~~~~C~~C~~~f~~~~  257 (343)
                      ...|.+|--+|.....-+|..|++.|..+..+      |. ...-...|..|...|-.+-
T Consensus       238 E~~fvrHHWVHrrRqeGkC~~CgKgFQQKf~F------hs-KEivAisCSWCKqayH~Kv  290 (1004)
T KOG0782|consen  238 ESGFVRHHWVHRRRQEGKCNTCGKGFQQKFFF------HS-KEIVAISCSWCKQAYHLKV  290 (1004)
T ss_pred             cccchHHhHhhHhhhccccchhhhhhhhheee------cc-ccEEEEEehHHHHHhhcch
Confidence            34677777666655556899999998765443      22 1223467999988887765


No 290
>TIGR00627 tfb4 transcription factor tfb4. This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University).
Probab=25.73  E-value=52  Score=28.10  Aligned_cols=25  Identities=24%  Similarity=0.636  Sum_probs=18.0

Q ss_pred             ceecCcCcccccChHHHHHHHHHhcCCCCcccccCCCCCCC
Q psy5202         213 PYKCELCSYACAQSSKLTRHMKTHGRLGKDVYRCRFCEMPF  253 (343)
Q Consensus       213 ~~~C~~C~~~f~~~~~L~~H~~~h~~~~~~~~~C~~C~~~f  253 (343)
                      -|.|+.|.-.|-.                .+-.|+.||-.|
T Consensus       255 GyvCs~Clsi~C~----------------~p~~C~~Cgt~f  279 (279)
T TIGR00627       255 GFVCSVCLSVLCQ----------------YTPICKTCKTAF  279 (279)
T ss_pred             eEECCCccCCcCC----------------CCCCCCCCCCCC
Confidence            3889998876642                234799998776


No 291
>PF07800 DUF1644:  Protein of unknown function (DUF1644);  InterPro: IPR012866 This family consists of sequences found in a number of hypothetical plant proteins of unknown function. The region of interest contains nine highly conserved cysteine residues and is approximately 160 amino acids in length, which probably represent a zinc-binding domain. 
Probab=25.47  E-value=1.7e+02  Score=22.47  Aligned_cols=55  Identities=16%  Similarity=0.284  Sum_probs=33.8

Q ss_pred             CcccccccccccCChHHHHHHHHhccCCCCeecCCCCccC-CCHHHHHHHHHHhccC
Q psy5202          90 KNKECEFCGKKFKYENTLQAHRRNHTGEKPFKCTVCDHAC-SQSAKLKKHMKIHDED  145 (343)
Q Consensus        90 ~~~~C~~C~~~f~~~~~L~~H~~~h~~~~~~~C~~C~~~f-~~~~~L~~H~~~h~~~  145 (343)
                      ....|+.|.....-..-. .-.+.+...|+-.|..=+-.| .+...|++|.+..|..
T Consensus        79 ~~L~CPLCRG~V~GWtvv-e~AR~~LN~K~RsC~~e~C~F~GtY~eLrKHar~~HP~  134 (162)
T PF07800_consen   79 PELACPLCRGEVKGWTVV-EPARRFLNAKKRSCSQESCSFSGTYSELRKHARSEHPS  134 (162)
T ss_pred             ccccCccccCceeceEEc-hHHHHHhccCCccCcccccccccCHHHHHHHHHhhCCC
Confidence            457899995443322222 224555566777776633333 4677899999987754


No 292
>PF14369 zf-RING_3:  zinc-finger
Probab=25.23  E-value=51  Score=17.99  Aligned_cols=10  Identities=30%  Similarity=0.653  Sum_probs=6.4

Q ss_pred             ccCCCCCCCC
Q psy5202         245 RCRFCEMPFS  254 (343)
Q Consensus       245 ~C~~C~~~f~  254 (343)
                      .|+.|+-.|.
T Consensus        23 ~CP~C~~gFv   32 (35)
T PF14369_consen   23 ACPRCHGGFV   32 (35)
T ss_pred             CCcCCCCcEe
Confidence            4777776663


No 293
>PF11023 DUF2614:  Protein of unknown function (DUF2614);  InterPro: IPR020912 This entry describes proteins of unknown function, which are thought to be membrane proteins.; GO: 0005887 integral to plasma membrane
Probab=25.08  E-value=43  Score=23.88  Aligned_cols=12  Identities=17%  Similarity=0.426  Sum_probs=6.7

Q ss_pred             CceecCcCcccc
Q psy5202         212 KPYKCELCSYAC  223 (343)
Q Consensus       212 ~~~~C~~C~~~f  223 (343)
                      ...+|+.|++..
T Consensus        68 v~V~CP~C~K~T   79 (114)
T PF11023_consen   68 VQVECPNCGKQT   79 (114)
T ss_pred             eeeECCCCCChH
Confidence            345666666543


No 294
>TIGR00416 sms DNA repair protein RadA. The gene protuct codes for a probable ATP-dependent protease involved in both DNA repair and degradation of proteins, peptides, glycopeptides. Also known as sms. Residues 11-28 of the SEED alignment contain a putative Zn binding domain. Residues 110-117 of the seed contain a putative ATP binding site both documented in Haemophilus and in Listeria monocytogenes. for E.coli see ( J. BACTERIOL. 178:5045-5048(1996)).
Probab=24.71  E-value=42  Score=31.01  Aligned_cols=25  Identities=24%  Similarity=0.463  Sum_probs=17.7

Q ss_pred             CCcccccCcccccCHHHHHHHHHHccCCCceecCcCccc
Q psy5202         184 RNDTCEFCGKVFKNCSNLTVHRRSHTGEKPYKCELCSYA  222 (343)
Q Consensus       184 ~~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~  222 (343)
                      ..|.|..||..+..+              .++|+.|+.+
T Consensus         6 ~~y~C~~Cg~~~~~~--------------~g~Cp~C~~w   30 (454)
T TIGR00416         6 SKFVCQHCGADSPKW--------------QGKCPACHAW   30 (454)
T ss_pred             CeEECCcCCCCCccc--------------cEECcCCCCc
Confidence            468899998877553              2578888754


No 295
>KOG3002|consensus
Probab=24.64  E-value=86  Score=27.14  Aligned_cols=79  Identities=15%  Similarity=0.249  Sum_probs=53.9

Q ss_pred             CCcccccCcccccCHHHHHHHHHHccCCCceecCc----CcccccChHHHHHHHHHhcCCCCcccccCC----CCCCCCC
Q psy5202         184 RNDTCEFCGKVFKNCSNLTVHRRSHTGEKPYKCEL----CSYACAQSSKLTRHMKTHGRLGKDVYRCRF----CEMPFSV  255 (343)
Q Consensus       184 ~~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~----C~~~f~~~~~L~~H~~~h~~~~~~~~~C~~----C~~~f~~  255 (343)
                      ....|+.|...+.....+  +|..-.....+.|+.    |.+.|..... ..|.+...   -++|.|+.    |.+. ..
T Consensus        79 ~~~~CP~Cr~~~g~~R~~--amEkV~e~~~vpC~~~~~GC~~~~~Y~~~-~~HE~~C~---f~~~~CP~p~~~C~~~-G~  151 (299)
T KOG3002|consen   79 VSNKCPTCRLPIGNIRCR--AMEKVAEAVLVPCKNAKLGCTKSFPYGEK-SKHEKVCE---FRPCSCPVPGAECKYT-GS  151 (299)
T ss_pred             hcccCCccccccccHHHH--HHHHHHHhceecccccccCCceeeccccc-cccccccc---cCCcCCCCCcccCCcc-Cc
Confidence            567899999999866444  444334445677864    9999988777 67777663   36777765    5544 45


Q ss_pred             hHHHHHHHhhhcCc
Q psy5202         256 PSTLEKHMRKCVVN  269 (343)
Q Consensus       256 ~~~l~~H~~~~~~~  269 (343)
                      ...|..|.+.-|..
T Consensus       152 ~~~l~~H~~~~hk~  165 (299)
T KOG3002|consen  152 YKDLYAHLNDTHKS  165 (299)
T ss_pred             HHHHHHHHHhhChh
Confidence            66788898866544


No 296
>PRK06260 threonine synthase; Validated
Probab=24.38  E-value=51  Score=29.87  Aligned_cols=10  Identities=40%  Similarity=0.849  Sum_probs=5.6

Q ss_pred             cccccCcccc
Q psy5202         186 DTCEFCGKVF  195 (343)
Q Consensus       186 ~~C~~C~~~f  195 (343)
                      ++|..||+.|
T Consensus         4 ~~C~~cg~~~   13 (397)
T PRK06260          4 LKCIECGKEY   13 (397)
T ss_pred             EEECCCCCCC
Confidence            4555555555


No 297
>PF09332 Mcm10:  Mcm10 replication factor;  InterPro: IPR015411 Mcm10 is a eukaryotic DNA replication factor that regulates the stability and chromatin association of DNA polymerase alpha []. ; PDB: 2KWQ_A.
Probab=24.36  E-value=18  Score=31.72  Aligned_cols=12  Identities=25%  Similarity=0.606  Sum_probs=4.5

Q ss_pred             ceecCcCccccc
Q psy5202         213 PYKCELCSYACA  224 (343)
Q Consensus       213 ~~~C~~C~~~f~  224 (343)
                      .|+|..|++.+.
T Consensus       285 FFkC~~C~~Rt~  296 (344)
T PF09332_consen  285 FFKCKDCGNRTI  296 (344)
T ss_dssp             EEE-T-TS-EEE
T ss_pred             eEECCCCCCeee
Confidence            355555555443


No 298
>PRK11823 DNA repair protein RadA; Provisional
Probab=24.26  E-value=43  Score=30.89  Aligned_cols=25  Identities=20%  Similarity=0.386  Sum_probs=17.8

Q ss_pred             CCcccccCcccccCHHHHHHHHHHccCCCceecCcCccc
Q psy5202         184 RNDTCEFCGKVFKNCSNLTVHRRSHTGEKPYKCELCSYA  222 (343)
Q Consensus       184 ~~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~  222 (343)
                      ..|.|..||..+..+              .++|+.|+.+
T Consensus         6 ~~y~C~~Cg~~~~~~--------------~g~Cp~C~~w   30 (446)
T PRK11823          6 TAYVCQECGAESPKW--------------LGRCPECGAW   30 (446)
T ss_pred             CeEECCcCCCCCccc--------------CeeCcCCCCc
Confidence            568899998877554              2578888754


No 299
>PF02748 PyrI_C:  Aspartate carbamoyltransferase regulatory chain, metal binding domain;  InterPro: IPR020542 Aspartate carbamoyltransferase (aspartate transcarbamylase, ATCase) 2.1.3.2 from EC is an allosteric enzyme that plays a central role in the regulation of the pyrimidine pathway in bacteria. The holoenzyme is a dodecamer composed of six catalytic chains, each with an active site, and six regulatory chains lacking catalytic activity []. The catalytic subunits exist as a dimer of catalytic trimers, (c3)2, while the regulatory subunits exist as a trimer of regulatory dimers, (r2)3, therefore the complete holoenzyme can be represented as (c3)2(r2)3. The association of the catalytic subunits c3 with the regulatory subunits r2 is responsible for the establishment of positive co-operativity between catalytic sites for the binding of aspartate and it dictates the pattern of allosteric response toward nucleotide effectors. ATCase from Escherichia coli is the most extensively studied allosteric enzyme []. The crystal structure of the T-state, the T-state with CTP bound, the R-state with N-phosphonacetyl-L-aspartate (PALA) bound, and the R-state with phosphonoacetamide plus malonate bound have been used in interpreting kinetic and mutational studies. A high-resolution structure of E. coli ATCase in the presence of PALA (a bisubstrate analog) allows a detailed description of the binding at the active site of the enzyme and allows a detailed model of the tetrahedral intermediate to be constructed. The entire regulatory chain has been traced showing that the N-terminal regions of the regulatory chains R1 and R6 are located in close proximity to each other and to the regulatory site. This portion of the molecule may be involved in the observed asymmetry between the regulatory binding sites as well as in the heterotropic response of the enzyme []. The C-terminal domain of the regulatory chains have a rubredoxin-like zinc-bound fold.  ATCase from Enterobacter agglomerans (Erwinia herbicola) (Pantoea agglomerans) differs from the other investigated enterobacterial ATCases by its absence of homotropic co-operativity toward the substrate aspartate and its lack of response to ATP which is an allosteric effector (activator) of this family of enzymes. Nevertheless, the E. herbicola ATCase has the same quaternary structure, two trimers of catalytic chains with three dimers of regulatory chains, (c3)2(r2)3, as other enterobacterial ATCases and shows extensive primary structure conservation [].  This entry represents the C-terminal domain.; PDB: 2YWW_B 1SKU_D 1Q95_L 8ATC_B 3AT1_D 1RAI_D 4E2F_D 1NBE_B 6AT1_B 2FZC_D ....
Probab=23.97  E-value=32  Score=20.78  Aligned_cols=19  Identities=16%  Similarity=0.553  Sum_probs=12.4

Q ss_pred             CCCCCcccccccccccCCh
Q psy5202          86 SQEEKNKECEFCGKKFKYE  104 (343)
Q Consensus        86 ~~~~~~~~C~~C~~~f~~~  104 (343)
                      ......|+|.+|++.|...
T Consensus        30 ~~~~~~~rC~YCe~~~~~~   48 (52)
T PF02748_consen   30 DKEPIKLRCHYCERIITED   48 (52)
T ss_dssp             ETTTCEEEETTT--EEEHH
T ss_pred             eCCCCEEEeeCCCCEeccc
Confidence            3455679999999988643


No 300
>PF02591 DUF164:  Putative zinc ribbon domain;  InterPro: IPR003743 This entry describes proteins of unknown function.
Probab=23.96  E-value=62  Score=19.72  Aligned_cols=31  Identities=19%  Similarity=0.313  Sum_probs=15.3

Q ss_pred             ccccCcccccCHHHHHHHHHHccCCCceecCcCcc
Q psy5202         187 TCEFCGKVFKNCSNLTVHRRSHTGEKPYKCELCSY  221 (343)
Q Consensus       187 ~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~  221 (343)
                      .|.-|+.........    .+..++....|+.||+
T Consensus        24 ~C~gC~~~l~~~~~~----~i~~~~~i~~Cp~CgR   54 (56)
T PF02591_consen   24 TCSGCHMELPPQELN----EIRKGDEIVFCPNCGR   54 (56)
T ss_pred             ccCCCCEEcCHHHHH----HHHcCCCeEECcCCCc
Confidence            566666665443221    1122345556666664


No 301
>KOG4317|consensus
Probab=23.75  E-value=21  Score=30.47  Aligned_cols=21  Identities=24%  Similarity=0.450  Sum_probs=12.5

Q ss_pred             CCcccccCcccccCHHHHHHH
Q psy5202         184 RNDTCEFCGKVFKNCSNLTVH  204 (343)
Q Consensus       184 ~~~~C~~C~~~f~~~~~l~~H  204 (343)
                      +.|+|+.|+..|-+..-++.|
T Consensus        18 ~~YtCPRCn~~YCsl~CYr~h   38 (383)
T KOG4317|consen   18 REYTCPRCNLLYCSLKCYRNH   38 (383)
T ss_pred             ccccCCCCCccceeeeeecCC
Confidence            447777777776554443333


No 302
>PF13639 zf-RING_2:  Ring finger domain; PDB: 2KIZ_A 4EPO_C 1IYM_A 2EP4_A 2ECT_A 2JRJ_A 2ECN_A 2ECM_A 3NG2_A 2EA6_A ....
Probab=23.68  E-value=88  Score=17.69  Aligned_cols=15  Identities=33%  Similarity=0.379  Sum_probs=7.8

Q ss_pred             CCceecCcCcccccCh
Q psy5202         211 EKPYKCELCSYACAQS  226 (343)
Q Consensus       211 ~~~~~C~~C~~~f~~~  226 (343)
                      +..+..+ |+..|-..
T Consensus        14 ~~~~~l~-C~H~fh~~   28 (44)
T PF13639_consen   14 EKVVKLP-CGHVFHRS   28 (44)
T ss_dssp             SCEEEET-TSEEEEHH
T ss_pred             CeEEEcc-CCCeeCHH
Confidence            3444444 66666543


No 303
>PF03145 Sina:  Seven in absentia protein family;  InterPro: IPR018121 The seven in absentia (sina) gene was first identified in Drosophila. The Drosophila Sina protein is essential for the determination of the R7 pathway in photoreceptor cell development: the loss of functional Sina results in the transformation of the R7 precursor cell to a non-neuronal cell type. The Sina protein contains an N-terminal RING finger domain C3HC4-type. Through this domain, Sina binds E2 ubiquitin-conjugating enzymes (UbcD1) Sina also interacts with Tramtrack (TTK88) via PHYL. Tramtrack is a transcriptional repressor that blocks photoreceptor determination, while PHYL down-regulates the activity of TTK88. In turn, the activity of PHYL requires the activation of the Sevenless receptor tyrosine kinase, a process essential for R7 determination. It is thought that Sina targets TTK88 for degradation, therefore promoting the R7 pathway. Murine and human homologues of Sina have also been identified. The human homologue Siah-1 [] also binds E2 enzymes (UbcH5) and through a series of physical interactions, targets beta-catenin for ubiquitin degradation. Siah-1 expression is enhanced by p53, itself promoted by DNA damage. Thus this pathway links DNA damage to beta-catenin degradation [, ]. Sina proteins, therefore, physically interact with a variety of proteins. The N-terminal RING finger domain that binds ubiquitin conjugating enzymes is a C3HC4-type, and does not form part of the alignment for this family. The remainder C-terminal part is involved in interactions with other proteins, and is included in this alignment. In addition to the Drosophila protein and mammalian homologues, whose similarity was noted previously, this family also includes putative homologues from Caenorhabditis elegans, Arabidopsis thaliana.; GO: 0006511 ubiquitin-dependent protein catabolic process, 0007275 multicellular organismal development, 0005634 nucleus; PDB: 2A25_A 1K2F_B 2AN6_C.
Probab=23.53  E-value=56  Score=26.19  Aligned_cols=52  Identities=19%  Similarity=0.404  Sum_probs=30.8

Q ss_pred             cccccc----cccccCChHHHHHHHHhccCCCCeecCC----CCccCCCHHHHHHHHHHhccC
Q psy5202          91 NKECEF----CGKKFKYENTLQAHRRNHTGEKPFKCTV----CDHACSQSAKLKKHMKIHDED  145 (343)
Q Consensus        91 ~~~C~~----C~~~f~~~~~L~~H~~~h~~~~~~~C~~----C~~~f~~~~~L~~H~~~h~~~  145 (343)
                      .|.|..    |...|. ......|.... .-+|+.|+.    |+..- ....|..|+...|..
T Consensus        14 ~~pC~~~~~GC~~~~~-~~~~~~HE~~C-~~~p~~CP~~~~~C~~~G-~~~~l~~Hl~~~H~~   73 (198)
T PF03145_consen   14 KFPCKNAKYGCTETFP-YSEKREHEEEC-PFRPCSCPFPGSGCDWQG-SYKELLDHLRDKHSW   73 (198)
T ss_dssp             -EE-CCGGGT---EE--GGGHHHHHHT--TTSEEE-SSSSTT---EE-ECCCHHHHHHHHTTT
T ss_pred             eecCCCCCCCCccccc-ccChhhHhccC-CCcCCcCCCCCCCccccC-CHHHHHHHHHHHCCC
Confidence            477887    887765 45678888765 568899998    76544 345899999988865


No 304
>KOG1994|consensus
Probab=23.48  E-value=44  Score=27.11  Aligned_cols=22  Identities=23%  Similarity=0.526  Sum_probs=18.2

Q ss_pred             CceecCcCcccccChHHHHHHH
Q psy5202         212 KPYKCELCSYACAQSSKLTRHM  233 (343)
Q Consensus       212 ~~~~C~~C~~~f~~~~~L~~H~  233 (343)
                      .-|-|-.||..|.+...|..|.
T Consensus       238 eh~YC~fCG~~y~~~edl~ehC  259 (268)
T KOG1994|consen  238 EHYYCFFCGIKYKDEEDLYEHC  259 (268)
T ss_pred             cceEEEEeccccCCHHHHHHhC
Confidence            3488999999999999998873


No 305
>KOG4727|consensus
Probab=23.32  E-value=61  Score=25.07  Aligned_cols=25  Identities=16%  Similarity=0.284  Sum_probs=20.3

Q ss_pred             CCCcccccccccccCChHHHHHHHH
Q psy5202          88 EEKNKECEFCGKKFKYENTLQAHRR  112 (343)
Q Consensus        88 ~~~~~~C~~C~~~f~~~~~L~~H~~  112 (343)
                      ...-|-|.+|+-+|....++..|+.
T Consensus        72 q~~GyyCdVCdcvvKDSinflDHiN   96 (193)
T KOG4727|consen   72 QKGGYYCDVCDCVVKDSINFLDHIN   96 (193)
T ss_pred             ccCceeeeecceeehhhHHHHHHhc
Confidence            3455889999999998888888874


No 306
>PF01286 XPA_N:  XPA protein N-terminal;  InterPro: IPR022652 Xeroderma pigmentosum (XP) [] is a human autosomal recessive disease, characterised by a high incidence of sunlight-induced skin cancer. Skin cells of individual's with this condition are hypersensitive to ultraviolet light, due to defects in the incision step of DNA excision repair. There are a minimum of seven genetic complementation groups involved in this pathway: XP-A to XP-G. XP-A is the most severe form of the disease and is due to defects in a 30 kDa nuclear protein called XPA (or XPAC) []. The sequence of the XPA protein is conserved from higher eukaryotes [] to yeast (gene RAD14) []. XPA is a hydrophilic protein of 247 to 296 amino-acid residues which has a C4-type zinc finger motif in its central section. This entry contains the zinc-finger containing region in the XPA protein. It is found N-terminal to PF05181 from PFAM ; PDB: 1D4U_A 1XPA_A.
Probab=23.31  E-value=24  Score=19.18  Aligned_cols=9  Identities=22%  Similarity=0.741  Sum_probs=2.7

Q ss_pred             cCcCccccc
Q psy5202         216 CELCSYACA  224 (343)
Q Consensus       216 C~~C~~~f~  224 (343)
                      |..|++.|.
T Consensus         6 C~eC~~~f~   14 (34)
T PF01286_consen    6 CDECGKPFM   14 (34)
T ss_dssp             -TTT--EES
T ss_pred             HhHhCCHHH
Confidence            444444443


No 307
>PRK05978 hypothetical protein; Provisional
Probab=23.30  E-value=32  Score=26.16  Aligned_cols=10  Identities=30%  Similarity=0.856  Sum_probs=5.5

Q ss_pred             ccCCCCCCCC
Q psy5202         245 RCRFCEMPFS  254 (343)
Q Consensus       245 ~C~~C~~~f~  254 (343)
                      .|+.||..|.
T Consensus        54 ~C~~CG~~~~   63 (148)
T PRK05978         54 HCAACGEDFT   63 (148)
T ss_pred             CccccCCccc
Confidence            3666665553


No 308
>COG4338 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=22.89  E-value=35  Score=20.01  Aligned_cols=16  Identities=31%  Similarity=0.679  Sum_probs=11.9

Q ss_pred             ccCCCCCCCCChHHHH
Q psy5202         245 RCRFCEMPFSVPSTLE  260 (343)
Q Consensus       245 ~C~~C~~~f~~~~~l~  260 (343)
                      .|+.|++-|.......
T Consensus        14 ICpvCqRPFsWRkKW~   29 (54)
T COG4338          14 ICPVCQRPFSWRKKWA   29 (54)
T ss_pred             hhhhhcCchHHHHHHH
Confidence            5999999998765443


No 309
>PF04641 Rtf2:  Rtf2 RING-finger
Probab=22.83  E-value=74  Score=26.86  Aligned_cols=51  Identities=20%  Similarity=0.400  Sum_probs=31.8

Q ss_pred             CCcccccCcccccCHHHHHHHHHHccCCCceecC-cCcccccChHHHHHHHHHhcCCCCcccccCCCCCCCCCh
Q psy5202         184 RNDTCEFCGKVFKNCSNLTVHRRSHTGEKPYKCE-LCSYACAQSSKLTRHMKTHGRLGKDVYRCRFCEMPFSVP  256 (343)
Q Consensus       184 ~~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~-~C~~~f~~~~~L~~H~~~h~~~~~~~~~C~~C~~~f~~~  256 (343)
                      ..|.|++.++.|...             ..|.+- -||-.|.....=..  .       .-..|+.||..|...
T Consensus       112 ~~~~CPvt~~~~~~~-------------~~fv~l~~cG~V~s~~alke~--k-------~~~~Cp~c~~~f~~~  163 (260)
T PF04641_consen  112 GRFICPVTGKEFNGK-------------HKFVYLRPCGCVFSEKALKEL--K-------KSKKCPVCGKPFTEE  163 (260)
T ss_pred             ceeECCCCCcccCCc-------------eeEEEEcCCCCEeeHHHHHhh--c-------ccccccccCCccccC
Confidence            679999998888443             234443 48877765432211  1       123499999999754


No 310
>PF14447 Prok-RING_4:  Prokaryotic RING finger family 4
Probab=22.55  E-value=48  Score=20.27  Aligned_cols=12  Identities=42%  Similarity=1.115  Sum_probs=7.2

Q ss_pred             cccCCCCCCCCC
Q psy5202         244 YRCRFCEMPFSV  255 (343)
Q Consensus       244 ~~C~~C~~~f~~  255 (343)
                      --|++|++.|..
T Consensus        40 ngCPfC~~~~~~   51 (55)
T PF14447_consen   40 NGCPFCGTPFEF   51 (55)
T ss_pred             cCCCCCCCcccC
Confidence            347777766643


No 311
>PF11781 RRN7:  RNA polymerase I-specific transcription initiation factor Rrn7;  InterPro: IPR021752  Rrn7 is a transcription binding factor that associates strongly with both Rrn6 and Rrn11 to form a complex which itself binds the TATA-binding protein and is required for transcription by the core domain of the RNA PolI promoter [],[]. 
Probab=22.53  E-value=41  Score=18.49  Aligned_cols=7  Identities=43%  Similarity=1.298  Sum_probs=3.4

Q ss_pred             cccCccc
Q psy5202         188 CEFCGKV  194 (343)
Q Consensus       188 C~~C~~~  194 (343)
                      |..|+..
T Consensus        11 C~~C~~~   17 (36)
T PF11781_consen   11 CPVCGSR   17 (36)
T ss_pred             CCCCCCe
Confidence            5555544


No 312
>COG4391 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=22.10  E-value=42  Score=20.98  Aligned_cols=15  Identities=20%  Similarity=0.913  Sum_probs=11.6

Q ss_pred             CCCcccccccccccC
Q psy5202          88 EEKNKECEFCGKKFK  102 (343)
Q Consensus        88 ~~~~~~C~~C~~~f~  102 (343)
                      .++.-.|++|+..|.
T Consensus        45 ~~gev~CPYC~t~y~   59 (62)
T COG4391          45 DEGEVVCPYCSTRYR   59 (62)
T ss_pred             CCCcEecCccccEEE
Confidence            345568999999886


No 313
>PLN03238 probable histone acetyltransferase MYST; Provisional
Probab=21.97  E-value=80  Score=26.94  Aligned_cols=26  Identities=19%  Similarity=0.383  Sum_probs=20.7

Q ss_pred             CCCeecCCCCccCCCHHHHHHHHHHh
Q psy5202         117 EKPFKCTVCDHACSQSAKLKKHMKIH  142 (343)
Q Consensus       117 ~~~~~C~~C~~~f~~~~~L~~H~~~h  142 (343)
                      ...|.|+.|-+-|.+...|.+|+...
T Consensus        46 ~~lyiCe~Clky~~~~~~l~~H~~~C   71 (290)
T PLN03238         46 TKLYICEYCLKYMRKKKSLLRHLAKC   71 (290)
T ss_pred             CeEEEcCCCcchhCCHHHHHHHHHhC
Confidence            45688888888888888888888743


No 314
>PF14255 Cys_rich_CPXG:  Cysteine-rich CPXCG
Probab=21.88  E-value=40  Score=20.40  Aligned_cols=10  Identities=30%  Similarity=1.155  Sum_probs=6.6

Q ss_pred             ccCCCCCCCC
Q psy5202         245 RCRFCEMPFS  254 (343)
Q Consensus       245 ~C~~C~~~f~  254 (343)
                      .|++||..+.
T Consensus         2 ~CPyCge~~~   11 (52)
T PF14255_consen    2 QCPYCGEPIE   11 (52)
T ss_pred             CCCCCCCeeE
Confidence            5777776653


No 315
>KOG3214|consensus
Probab=21.68  E-value=30  Score=23.93  Aligned_cols=16  Identities=31%  Similarity=0.671  Sum_probs=10.9

Q ss_pred             cccccCCCCCCCCChH
Q psy5202         242 DVYRCRFCEMPFSVPS  257 (343)
Q Consensus       242 ~~~~C~~C~~~f~~~~  257 (343)
                      ....|.+|+..|....
T Consensus        46 G~~sC~iC~esFqt~i   61 (109)
T KOG3214|consen   46 GKASCRICEESFQTTI   61 (109)
T ss_pred             ceeeeeehhhhhccch
Confidence            3456888888776654


No 316
>PTZ00448 hypothetical protein; Provisional
Probab=21.62  E-value=62  Score=28.61  Aligned_cols=23  Identities=13%  Similarity=0.427  Sum_probs=20.9

Q ss_pred             ceecCcCcccccChHHHHHHHHH
Q psy5202         213 PYKCELCSYACAQSSKLTRHMKT  235 (343)
Q Consensus       213 ~~~C~~C~~~f~~~~~L~~H~~~  235 (343)
                      .|.|..|+..|.+....+.|++.
T Consensus       314 ~~tC~~C~v~F~~~~~qR~H~KS  336 (373)
T PTZ00448        314 MLLCRKCNIQLMDHNAFKQHYRS  336 (373)
T ss_pred             CccccccccccCCHHHHHHHhhh
Confidence            58899999999999999999886


No 317
>KOG4602|consensus
Probab=21.53  E-value=52  Score=27.24  Aligned_cols=66  Identities=17%  Similarity=0.440  Sum_probs=35.2

Q ss_pred             CCcccccCcccccCHHHHHHHHHHccC-------CCceecCcCcccccChHHHHHHHHHhcCCCCcccccCCCCCCCCCh
Q psy5202         184 RNDTCEFCGKVFKNCSNLTVHRRSHTG-------EKPYKCELCSYACAQSSKLTRHMKTHGRLGKDVYRCRFCEMPFSVP  256 (343)
Q Consensus       184 ~~~~C~~C~~~f~~~~~l~~H~~~h~~-------~~~~~C~~C~~~f~~~~~L~~H~~~h~~~~~~~~~C~~C~~~f~~~  256 (343)
                      ++..|.+|.-.-.....+..|...-..       -+.|.|++||.+-                 +..+.=.+|.+.|...
T Consensus       232 r~l~CvFC~nN~E~~A~y~tH~lkd~dgRVLCPkLR~YVCPiCGATg-----------------DnAHTiKyCPl~~~~~  294 (318)
T KOG4602|consen  232 RPLCCVFCFNNAEEFARYHTHPLKDKDGRVLCPKLRSYVCPICGATG-----------------DNAHTIKYCPLAFGDD  294 (318)
T ss_pred             CceeEEeecCCCchhhheecccccCCCCcEechhHhhhcCccccccC-----------------CcccceecccccCCCC
Confidence            455677776555555555555332111       2457788887432                 2233334566666666


Q ss_pred             HHHHHHHhhh
Q psy5202         257 STLEKHMRKC  266 (343)
Q Consensus       257 ~~l~~H~~~~  266 (343)
                      .....|-..+
T Consensus       295 ~s~~~~~e~~  304 (318)
T KOG4602|consen  295 TSVYSHTERN  304 (318)
T ss_pred             ccccchhhhc
Confidence            6666665443


No 318
>COG5027 SAS2 Histone acetyltransferase (MYST family) [Chromatin structure and dynamics]
Probab=21.29  E-value=35  Score=29.73  Aligned_cols=30  Identities=30%  Similarity=0.661  Sum_probs=24.1

Q ss_pred             CCcccccCCCCCCCCChHHHHHHHhhhcCc
Q psy5202         240 GKDVYRCRFCEMPFSVPSTLEKHMRKCVVN  269 (343)
Q Consensus       240 ~~~~~~C~~C~~~f~~~~~l~~H~~~~~~~  269 (343)
                      ..-.|.|..|-+-|.+...|.+|.++|.-.
T Consensus       155 ~d~vyICefClkY~~s~~~~~rH~~kC~~~  184 (395)
T COG5027         155 LDIVYICEFCLKYYGSQTSLVRHRKKCSLQ  184 (395)
T ss_pred             cceEEEhhhhHHHhcchhHHHHHHhcCcCc
Confidence            345688999999999999999998877544


No 319
>PLN03239 histone acetyltransferase; Provisional
Probab=21.18  E-value=82  Score=27.74  Aligned_cols=21  Identities=33%  Similarity=0.395  Sum_probs=9.9

Q ss_pred             ceecCcCcccccChHHHHHHH
Q psy5202         213 PYKCELCSYACAQSSKLTRHM  233 (343)
Q Consensus       213 ~~~C~~C~~~f~~~~~L~~H~  233 (343)
                      -|.|..|-+-|.+...|.+|+
T Consensus       106 lYiCE~Clky~~~~~~l~~H~  126 (351)
T PLN03239        106 LYVCEFSFGFFARKSELLRFQ  126 (351)
T ss_pred             EEEeccchhhhcCHHHHHHHH
Confidence            344444444444444444443


No 320
>cd01121 Sms Sms (bacterial radA) DNA repair protein. This protein is not related to archael radA any more than is to other RecA-like NTPases. Sms has a role in recombination and recombinational repair and is responsible for the stabilization or processing of branched DNA molecules.
Probab=20.96  E-value=55  Score=29.38  Aligned_cols=8  Identities=38%  Similarity=0.933  Sum_probs=3.9

Q ss_pred             CcccccCc
Q psy5202         185 NDTCEFCG  192 (343)
Q Consensus       185 ~~~C~~C~  192 (343)
                      ..+|+.|+
T Consensus        14 ~g~cp~c~   21 (372)
T cd01121          14 LGKCPECG   21 (372)
T ss_pred             cEECcCCC
Confidence            34455554


No 321
>PRK12860 transcriptional activator FlhC; Provisional
Probab=20.74  E-value=67  Score=25.55  Aligned_cols=48  Identities=10%  Similarity=0.135  Sum_probs=30.2

Q ss_pred             cccccCHHHHHHHHHHccCCCceecCcCcccccChHHHHHHHHHhcCCCCcccccCCCC
Q psy5202         192 GKVFKNCSNLTVHRRSHTGEKPYKCELCSYACAQSSKLTRHMKTHGRLGKDVYRCRFCE  250 (343)
Q Consensus       192 ~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~f~~~~~L~~H~~~h~~~~~~~~~C~~C~  250 (343)
                      --.|.....|.+.+... .-..-.|..||-.|.....          .....|.|+.|.
T Consensus       114 ~Ls~tRAw~LvRf~~s~-~L~l~~C~~Cgg~fv~~~~----------e~~~~f~CplC~  161 (189)
T PRK12860        114 VLDLTRAWTLVRFFDAG-MLQLARCCRCGGKFVTHAH----------DLRHNFVCGLCQ  161 (189)
T ss_pred             eecHHHHHHHHHHhcCC-CeeeccCCCCCCCeecccc----------ccCCCCcCCCCC
Confidence            34566677777776643 2344678888888764321          133468899886


No 322
>PF09082 DUF1922:  Domain of unknown function (DUF1922);  InterPro: IPR015166 Members of this family consist of a beta-sheet region followed by an alpha-helix and an unstructured C terminus. The beta-sheet region contains a CXCX...XCXC sequence with Cys residues located in two proximal loops and pointing towards each other. This precise function of this set of bacterial proteins is, as yet, unknown []. ; PDB: 1GH9_A.
Probab=20.64  E-value=23  Score=22.75  Aligned_cols=13  Identities=23%  Similarity=0.777  Sum_probs=7.0

Q ss_pred             CCcccccCCCCCCC
Q psy5202         240 GKDVYRCRFCEMPF  253 (343)
Q Consensus       240 ~~~~~~C~~C~~~f  253 (343)
                      +.+.-+| .||+..
T Consensus        17 ~~kTkkC-~CG~~l   29 (68)
T PF09082_consen   17 GAKTKKC-VCGKTL   29 (68)
T ss_dssp             T-SEEEE-TTTEEE
T ss_pred             CcceeEe-cCCCee
Confidence            3445567 777653


No 323
>PF14787 zf-CCHC_5:  GAG-polyprotein viral zinc-finger; PDB: 1CL4_A 1DSV_A.
Probab=20.28  E-value=54  Score=18.02  Aligned_cols=14  Identities=29%  Similarity=0.465  Sum_probs=8.5

Q ss_pred             ccCCCCCCCCChHH
Q psy5202         245 RCRFCEMPFSVPST  258 (343)
Q Consensus       245 ~C~~C~~~f~~~~~  258 (343)
                      .|+.|++.|-....
T Consensus         4 ~CprC~kg~Hwa~~   17 (36)
T PF14787_consen    4 LCPRCGKGFHWASE   17 (36)
T ss_dssp             C-TTTSSSCS-TTT
T ss_pred             cCcccCCCcchhhh
Confidence            58888888866544


No 324
>KOG1729|consensus
Probab=20.17  E-value=39  Score=28.93  Aligned_cols=7  Identities=43%  Similarity=1.180  Sum_probs=4.1

Q ss_pred             ccccCcc
Q psy5202         187 TCEFCGK  193 (343)
Q Consensus       187 ~C~~C~~  193 (343)
                      .|.+|++
T Consensus       170 ~C~~C~~  176 (288)
T KOG1729|consen  170 ECMVCGC  176 (288)
T ss_pred             ecccCCC
Confidence            4666665


Done!