Query         psy679
Match_columns 510
No_of_seqs    338 out of 1804
Neff          5.1 
Searched_HMMs 46136
Date          Fri Aug 16 18:25:51 2013
Command       hhsearch -i /work/01045/syshi/Psyhhblits/psy679.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/679hhsearch_cdd -cpu 12 -v 0 

 No Hit                             Prob E-value P-value  Score    SS Cols Query HMM  Template HMM
  1 KOG2462|consensus               99.8 3.8E-19 8.2E-24  177.1   7.3  110  371-484   159-268 (279)
  2 KOG2462|consensus               99.7 1.1E-17 2.3E-22  166.8   2.6   83  369-453   183-265 (279)
  3 KOG3576|consensus               99.5 4.9E-15 1.1E-19  142.5   4.8   96  370-467   114-220 (267)
  4 KOG3623|consensus               99.5 1.6E-14 3.4E-19  157.8   2.3   90  371-462   892-982 (1007)
  5 KOG3576|consensus               99.3 3.3E-12 7.2E-17  123.2   4.8  101  358-460   130-241 (267)
  6 KOG3623|consensus               99.2 3.4E-12 7.3E-17  140.0   2.0   80  371-452   238-330 (1007)
  7 KOG1074|consensus               99.0 2.9E-10 6.3E-15  126.8   6.1   53  373-427   353-405 (958)
  8 KOG1074|consensus               98.9 1.4E-10 3.1E-15  129.2   0.1   84  371-456   603-693 (958)
  9 PHA00733 hypothetical protein   98.8 2.9E-09 6.2E-14   96.9   3.6   85  368-456    35-124 (128)
 10 KOG3608|consensus               98.7 2.6E-09 5.6E-14  110.1   1.0  101  363-467   197-300 (467)
 11 KOG3608|consensus               98.7 5.7E-09 1.2E-13  107.7   2.9  102  369-475   233-337 (467)
 12 PLN03086 PRLI-interacting fact  98.6 3.2E-08 6.9E-13  108.9   5.4   95  370-472   450-554 (567)
 13 PHA02768 hypothetical protein;  98.5 2.5E-08 5.5E-13   78.2   0.7   42  374-419     6-47  (55)
 14 PHA02768 hypothetical protein;  98.3 2.7E-07 5.9E-12   72.5   2.6   44  403-448     5-48  (55)
 15 PF13465 zf-H2C2_2:  Zinc-finge  98.1 7.1E-07 1.5E-11   59.7   1.1   24  391-414     2-25  (26)
 16 PF13465 zf-H2C2_2:  Zinc-finge  98.1 2.1E-06 4.5E-11   57.5   2.5   26  418-443     1-26  (26)
 17 PLN03086 PRLI-interacting fact  98.1 4.1E-06 8.8E-11   92.6   5.7   78  372-455   477-564 (567)
 18 PHA00733 hypothetical protein   98.0 6.6E-06 1.4E-10   75.0   4.0   56  369-428    69-124 (128)
 19 KOG3993|consensus               97.9 2.3E-06 4.9E-11   90.5   0.2   87  373-461   267-386 (500)
 20 COG5189 SFP1 Putative transcri  97.9 6.2E-06 1.3E-10   84.6   2.4   73  370-452   346-419 (423)
 21 PHA00732 hypothetical protein   97.7 1.8E-05   4E-10   66.5   1.9   47  373-427     1-48  (79)
 22 PHA00616 hypothetical protein   97.6 1.9E-05 4.2E-10   59.5   1.2   31  431-461     1-31  (44)
 23 KOG3993|consensus               97.4 4.6E-05   1E-09   80.8   1.1   53  373-427   295-380 (500)
 24 PHA00616 hypothetical protein   97.3 9.9E-05 2.1E-09   55.7   1.6   34  373-408     1-34  (44)
 25 PHA00732 hypothetical protein   97.3 0.00016 3.4E-09   61.0   2.9   48  403-456     1-49  (79)
 26 PF00096 zf-C2H2:  Zinc finger,  97.2  0.0003 6.6E-09   44.9   2.3   23  432-454     1-23  (23)
 27 PF05605 zf-Di19:  Drought indu  97.0 0.00059 1.3E-08   53.0   3.0   49  404-455     3-53  (54)
 28 PF13894 zf-C2H2_4:  C2H2-type   96.7  0.0012 2.7E-08   41.5   2.5   24  432-455     1-24  (24)
 29 PF12756 zf-C2H2_2:  C2H2 type   96.7  0.0013 2.8E-08   55.0   3.1   74  375-455     1-74  (100)
 30 PF05605 zf-Di19:  Drought indu  96.5  0.0028   6E-08   49.2   3.3   51  373-428     2-54  (54)
 31 PF13912 zf-C2H2_6:  C2H2-type   96.4  0.0015 3.2E-08   43.3   1.1   26  431-456     1-26  (27)
 32 PF00096 zf-C2H2:  Zinc finger,  96.1  0.0036 7.8E-08   39.9   1.9   23  374-398     1-23  (23)
 33 PRK04860 hypothetical protein;  95.8  0.0038 8.2E-08   59.3   1.2   37  403-443   119-155 (160)
 34 smart00355 ZnF_C2H2 zinc finge  95.7  0.0099 2.1E-07   37.6   2.5   24  432-455     1-24  (26)
 35 PF09237 GAGA:  GAGA factor;  I  95.3   0.019   4E-07   44.9   3.3   33  427-459    20-52  (54)
 36 COG5048 FOG: Zn-finger [Genera  94.8  0.0089 1.9E-07   60.7   0.3   73  372-446   288-368 (467)
 37 PF13894 zf-C2H2_4:  C2H2-type   94.7   0.026 5.7E-07   35.3   2.3   23  374-398     1-23  (24)
 38 COG5048 FOG: Zn-finger [Genera  94.3   0.022 4.7E-07   57.9   1.8   92  367-458   313-415 (467)
 39 PRK04860 hypothetical protein;  94.2   0.031 6.7E-07   53.1   2.4   40  372-417   118-157 (160)
 40 PF13912 zf-C2H2_6:  C2H2-type   93.9   0.028   6E-07   37.1   1.2   25  373-399     1-25  (27)
 41 PF12874 zf-met:  Zinc-finger o  93.0   0.052 1.1E-06   35.2   1.3   23  432-454     1-23  (25)
 42 KOG1146|consensus               93.0    0.23 4.9E-06   59.9   7.3   80  368-453   460-540 (1406)
 43 PF13909 zf-H2C2_5:  C2H2-type   92.7    0.12 2.5E-06   33.4   2.6   24  432-456     1-24  (24)
 44 smart00355 ZnF_C2H2 zinc finge  92.5    0.13 2.9E-06   32.2   2.7   24  374-399     1-24  (26)
 45 KOG4173|consensus               92.4   0.039 8.3E-07   54.4   0.1   82  371-455    77-171 (253)
 46 PF09237 GAGA:  GAGA factor;  I  90.5    0.35 7.5E-06   38.0   3.5   32  370-403    21-52  (54)
 47 PF12171 zf-C2H2_jaz:  Zinc-fin  89.0     0.1 2.2E-06   34.8  -0.4   22  432-453     2-23  (27)
 48 PF12874 zf-met:  Zinc-finger o  87.2    0.33 7.2E-06   31.3   1.2   23  374-398     1-23  (25)
 49 COG5189 SFP1 Putative transcri  86.5    0.26 5.6E-06   51.6   0.7   40  429-468   347-407 (423)
 50 COG5236 Uncharacterized conser  86.5    0.58 1.3E-05   49.5   3.2   80  373-454   151-243 (493)
 51 KOG2186|consensus               86.4    0.44 9.5E-06   48.4   2.2   50  404-456     4-53  (276)
 52 KOG4124|consensus               84.5    0.61 1.3E-05   49.4   2.2   72  370-451   346-418 (442)
 53 KOG2231|consensus               84.5    0.76 1.7E-05   52.5   3.2   18  381-398   188-205 (669)
 54 PF12171 zf-C2H2_jaz:  Zinc-fin  83.9    0.51 1.1E-05   31.4   0.9   22  404-425     2-23  (27)
 55 smart00451 ZnF_U1 U1-like zinc  83.5    0.95 2.1E-05   31.3   2.2   23  431-453     3-25  (35)
 56 PF13913 zf-C2HC_2:  zinc-finge  82.9     1.1 2.3E-05   29.8   2.2   21  432-453     3-23  (25)
 57 KOG2231|consensus               80.3     1.3 2.9E-05   50.6   3.1   13  386-398   125-138 (669)
 58 PF13909 zf-H2C2_5:  C2H2-type   80.2     1.5 3.3E-05   28.1   2.2   23  374-399     1-23  (24)
 59 PF09986 DUF2225:  Uncharacteri  76.9     1.2 2.7E-05   43.9   1.4   42  402-443     4-60  (214)
 60 KOG2893|consensus               75.6     1.5 3.3E-05   44.4   1.6   46  371-423     9-54  (341)
 61 KOG1146|consensus               72.1     1.6 3.4E-05   53.1   0.9   78  373-457  1260-1354(1406)
 62 PF09538 FYDLN_acid:  Protein o  71.5     2.5 5.4E-05   37.9   1.8   30  404-444    10-39  (108)
 63 KOG2893|consensus               70.0     1.1 2.3E-05   45.5  -0.9   44  402-450    10-53  (341)
 64 KOG4167|consensus               67.7     2.6 5.7E-05   48.5   1.4   26  431-456   792-817 (907)
 65 PF12013 DUF3505:  Protein of u  67.2     7.3 0.00016   34.1   3.9   25  432-456    81-109 (109)
 66 COG5236 Uncharacterized conser  65.2       3 6.4E-05   44.4   1.1   68  381-453   226-303 (493)
 67 PRK00464 nrdR transcriptional   64.1     3.4 7.4E-05   39.2   1.2   14  404-417    29-42  (154)
 68 cd00350 rubredoxin_like Rubred  64.0     4.6  0.0001   28.4   1.6   10  430-439    16-25  (33)
 69 COG4049 Uncharacterized protei  63.2       4 8.6E-05   32.8   1.2   29  425-453    11-39  (65)
 70 smart00451 ZnF_U1 U1-like zinc  62.7     6.3 0.00014   27.1   2.1   23  373-397     3-25  (35)
 71 COG1997 RPL43A Ribosomal prote  62.5     4.5 9.7E-05   35.1   1.5   32  402-443    34-65  (89)
 72 PF12756 zf-C2H2_2:  C2H2 type   62.4     6.2 0.00014   32.6   2.4   24  373-398    50-73  (100)
 73 TIGR00622 ssl1 transcription f  61.2      12 0.00026   33.9   4.0   80  372-455    14-105 (112)
 74 TIGR02300 FYDLN_acid conserved  57.8     6.9 0.00015   36.2   2.0   33  404-447    10-42  (129)
 75 PF02892 zf-BED:  BED zinc fing  56.7     7.4 0.00016   28.5   1.7   26  428-453    13-42  (45)
 76 smart00614 ZnF_BED BED zinc fi  54.3       9  0.0002   29.1   1.8   24  432-455    19-48  (50)
 77 KOG2482|consensus               53.8     8.3 0.00018   41.1   2.0   81  373-455   195-303 (423)
 78 smart00531 TFIIE Transcription  53.3      13 0.00028   34.6   3.1   38  401-442    97-134 (147)
 79 TIGR02098 MJ0042_CXXC MJ0042 f  52.0      11 0.00024   26.8   1.9   34  404-442     3-36  (38)
 80 PF05443 ROS_MUCR:  ROS/MUCR tr  52.0     8.2 0.00018   35.8   1.5   27  429-458    70-96  (132)
 81 PF13719 zinc_ribbon_5:  zinc-r  47.9      15 0.00032   26.6   2.0   34  404-442     3-36  (37)
 82 TIGR00373 conserved hypothetic  46.9      15 0.00033   34.7   2.6   34  400-442   106-139 (158)
 83 PRK09678 DNA-binding transcrip  46.0     9.7 0.00021   31.8   0.9   14  429-442    25-40  (72)
 84 PRK13185 chlL protochlorophyll  45.8     3.4 7.3E-05   41.2  -2.2   18   29-46      7-24  (270)
 85 PF04959 ARS2:  Arsenite-resist  44.7      20 0.00042   35.9   3.0   33  428-460    74-106 (214)
 86 PF09723 Zn-ribbon_8:  Zinc rib  43.3      13 0.00029   27.5   1.2   29  404-439     6-34  (42)
 87 PF13717 zinc_ribbon_4:  zinc-r  43.1      22 0.00047   25.6   2.2   33  404-441     3-35  (36)
 88 smart00834 CxxC_CXXC_SSSS Puta  43.1      12 0.00026   26.7   0.9   30  404-440     6-35  (41)
 89 KOG2186|consensus               42.8      11 0.00025   38.5   1.0   47  374-425     4-50  (276)
 90 KOG2785|consensus               42.0      18 0.00039   39.1   2.4   73  374-453   167-242 (390)
 91 cd00729 rubredoxin_SM Rubredox  41.5      17 0.00038   25.8   1.5   25  403-439     2-26  (34)
 92 COG1592 Rubrerythrin [Energy p  41.2      17 0.00038   35.0   2.0   25  402-439   133-157 (166)
 93 KOG0978|consensus               40.9     6.6 0.00014   45.4  -1.1   21  430-450   677-697 (698)
 94 cd02032 Bchl_like This family   40.8     4.4 9.6E-05   40.3  -2.2   18   29-46      5-22  (267)
 95 TIGR02605 CxxC_CxxC_SSSS putat  40.3      14  0.0003   28.0   1.0   11  404-414     6-16  (52)
 96 KOG2593|consensus               40.3      22 0.00047   39.0   2.7   38  400-440   125-162 (436)
 97 PRK13231 nitrogenase reductase  40.2       6 0.00013   39.3  -1.4   18   29-46      7-24  (264)
 98 COG4957 Predicted transcriptio  40.1      15 0.00033   34.4   1.3   25  432-459    77-101 (148)
 99 PRK00398 rpoP DNA-directed RNA  40.0      18 0.00039   27.1   1.5   12  403-414     3-14  (46)
100 PF14353 CpXC:  CpXC protein     38.3     8.1 0.00018   34.7  -0.7   19  404-422    39-57  (128)
101 PRK06266 transcription initiat  37.7      23 0.00049   34.3   2.2   34  401-443   115-148 (178)
102 PRK13235 nifH nitrogenase redu  37.6     5.2 0.00011   40.1  -2.3   38   29-85      6-43  (274)
103 KOG2461|consensus               37.3 1.8E+02  0.0039   31.8   9.1   55  384-438   340-394 (396)
104 COG4049 Uncharacterized protei  36.9     9.6 0.00021   30.7  -0.4   27  369-397    13-39  (65)
105 COG4957 Predicted transcriptio  36.2      37 0.00081   31.9   3.2   26  372-402    75-100 (148)
106 cd02117 NifH_like This family   36.0     5.6 0.00012   38.3  -2.3   18   29-46      5-22  (212)
107 PF01780 Ribosomal_L37ae:  Ribo  36.0      15 0.00032   32.1   0.5   31  402-442    34-64  (90)
108 KOG2071|consensus               35.1      32 0.00068   39.1   3.0   27  371-399   416-442 (579)
109 cd02040 NifH NifH gene encodes  34.9       6 0.00013   39.0  -2.3   17   30-46      7-23  (270)
110 smart00659 RPOLCX RNA polymera  34.3      32 0.00068   26.0   2.0   11  404-414     3-13  (44)
111 PF09986 DUF2225:  Uncharacteri  33.3     8.8 0.00019   38.0  -1.4   44  371-416     3-61  (214)
112 COG1192 Soj ATPases involved i  32.8     6.9 0.00015   38.7  -2.3   36   29-82      8-43  (259)
113 COG4530 Uncharacterized protei  32.6      25 0.00055   31.9   1.5   28  405-443    11-38  (129)
114 KOG2482|consensus               32.4      45 0.00098   35.8   3.5   23  431-453   195-217 (423)
115 PRK13230 nitrogenase reductase  32.3     7.3 0.00016   39.2  -2.2   18   29-46      6-23  (279)
116 PRK13233 nifH nitrogenase redu  32.1     7.9 0.00017   38.7  -2.0   17   30-46      8-24  (275)
117 PF12760 Zn_Tnp_IS1595:  Transp  31.8      73  0.0016   23.8   3.6   38  393-439     8-45  (46)
118 TIGR01287 nifH nitrogenase iro  31.8     9.7 0.00021   38.1  -1.5   18   29-46      5-22  (275)
119 COG1198 PriA Primosomal protei  31.1      32 0.00069   40.3   2.4   13  428-440   472-484 (730)
120 KOG2785|consensus               30.5      38 0.00083   36.6   2.6   24  402-425   216-242 (390)
121 CHL00072 chlL photochlorophyll  29.9     8.9 0.00019   39.4  -2.1   18   29-46      5-22  (290)
122 PF04959 ARS2:  Arsenite-resist  29.5      11 0.00024   37.6  -1.4   28  402-429    76-103 (214)
123 COG1996 RPC10 DNA-directed RNA  29.3      34 0.00073   26.7   1.5   11  403-413     6-16  (49)
124 COG5151 SSL1 RNA polymerase II  29.0      52  0.0011   34.9   3.2   49  405-455   364-412 (421)
125 PF09538 FYDLN_acid:  Protein o  28.7      50  0.0011   29.6   2.7   30  374-416    10-39  (108)
126 TIGR03371 cellulose_yhjQ cellu  28.4     9.3  0.0002   37.1  -2.2   18   29-46      7-24  (246)
127 PF01656 CbiA:  CobQ/CobB/MinD/  28.4      11 0.00024   34.7  -1.5   18   29-46      4-21  (195)
128 PRK10037 cell division protein  28.3     9.6 0.00021   37.7  -2.1   18   29-46      7-24  (250)
129 PF05443 ROS_MUCR:  ROS/MUCR tr  28.2      39 0.00084   31.4   2.0   30  368-402    67-96  (132)
130 KOG0782|consensus               27.5      20 0.00044   40.6  -0.0   54  387-445   237-290 (1004)
131 COG3677 Transposase and inacti  27.0      62  0.0013   29.8   3.1   47  391-444    20-66  (129)
132 KOG3598|consensus               26.1      19  0.0004   44.8  -0.6   20   33-52   1846-1865(2220)
133 PHA02518 ParA-like protein; Pr  25.7      12 0.00025   35.3  -2.0   18   29-46      6-23  (211)
134 PHA00626 hypothetical protein   25.4      43 0.00093   27.0   1.5   14  430-443    22-35  (59)
135 PF09818 ABC_ATPase:  Predicted  25.0      35 0.00077   37.7   1.3   80   25-113   245-330 (448)
136 KOG4377|consensus               24.7      34 0.00074   37.4   1.1   81  374-457   314-429 (480)
137 COG1327 Predicted transcriptio  24.4      36 0.00078   32.5   1.0   29  444-472    63-91  (156)
138 cd02037 MRP-like MRP (Multiple  24.3      18 0.00039   33.4  -1.0   18   29-46      5-22  (169)
139 COG3091 SprT Zn-dependent meta  24.1      43 0.00094   32.0   1.5   31  403-438   117-147 (156)
140 TIGR00373 conserved hypothetic  24.1      40 0.00086   31.9   1.3   33  371-414   107-139 (158)
141 PF10263 SprT-like:  SprT-like   23.9      35 0.00076   31.2   0.9   32  403-442   123-154 (157)
142 KOG2071|consensus               23.6      46   0.001   37.8   1.9   31  428-458   415-445 (579)
143 smart00440 ZnF_C2C2 C2C2 Zinc   23.4      36 0.00077   25.1   0.6   12  431-442    28-39  (40)
144 PTZ00255 60S ribosomal protein  23.2      41 0.00089   29.4   1.1   32  402-443    35-66  (90)
145 PRK14890 putative Zn-ribbon RN  23.2      60  0.0013   26.3   1.9   31  403-439    25-56  (59)
146 TIGR01281 DPOR_bchL light-inde  23.1      17 0.00037   36.1  -1.5   17   30-46      6-22  (268)
147 PF01927 Mut7-C:  Mut7-C RNAse   23.1      64  0.0014   29.9   2.4   47  404-451    92-144 (147)
148 PF15135 UPF0515:  Uncharacteri  22.8      62  0.0013   33.3   2.4   62  366-442   105-166 (278)
149 cd02036 MinD Bacterial cell di  22.7      18 0.00039   32.9  -1.3   18   29-46      5-22  (179)
150 TIGR01969 minD_arch cell divis  22.6      19  0.0004   34.9  -1.3   18   29-46      6-23  (251)
151 smart00734 ZnF_Rad18 Rad18-lik  22.5      76  0.0016   21.3   2.0   20  433-453     3-22  (26)
152 PRK06266 transcription initiat  22.5      44 0.00096   32.3   1.3   33  372-415   116-148 (178)
153 TIGR00244 transcriptional regu  22.4      43 0.00093   31.8   1.1   15  402-416    27-41  (147)
154 COG3357 Predicted transcriptio  22.3      45 0.00098   29.3   1.1   27  402-438    57-83  (97)
155 TIGR00280 L37a ribosomal prote  22.1      41 0.00088   29.5   0.8   32  402-443    34-65  (91)
156 COG0489 Mrp ATPases involved i  21.6      20 0.00043   36.5  -1.3   18   28-45     62-79  (265)
157 cd03110 Fer4_NifH_child This p  21.3      20 0.00043   33.2  -1.3   17   29-45      5-21  (179)
158 PF06524 NOA36:  NOA36 protein;  21.0      38 0.00083   35.0   0.5   84  368-455   137-233 (314)
159 PRK14873 primosome assembly pr  20.7      29 0.00062   40.2  -0.5   27  398-440   405-431 (665)
160 PRK03824 hypA hydrogenase nick  20.7      55  0.0012   30.2   1.5   13  403-415    70-82  (135)
161 PF05191 ADK_lid:  Adenylate ki  20.6      40 0.00086   24.4   0.4   31  404-442     2-32  (36)
162 COG1655 Uncharacterized protei  20.6      38 0.00082   34.5   0.4   40  401-440    17-71  (267)
163 PRK13234 nifH nitrogenase redu  20.2      17 0.00037   37.3  -2.2   37   30-85     10-46  (295)
164 PF01363 FYVE:  FYVE zinc finge  20.0      56  0.0012   26.0   1.2   13  402-414    24-36  (69)
165 PRK13849 putative crown gall t  20.0      17 0.00037   36.2  -2.2   18   29-46      7-24  (231)

No 1  
>KOG2462|consensus
Probab=99.77  E-value=3.8e-19  Score=177.07  Aligned_cols=110  Identities=32%  Similarity=0.639  Sum_probs=100.6

Q ss_pred             CCCcccCCCCCCCccCChHHHHHHHHHhcCCCceecccCCCccCCchhhhhhhhcccCCCceecCcCCCccCChHHHHHH
Q psy679          371 ERPYACPVENCDRRFSRSDELTRHIRIHTGQKPFQCRICMRSFSRSDHLTTHIRTHTGEKPFSCDVCGRKFARSDEKKRH  450 (510)
Q Consensus       371 EKPy~C~~e~CgKsFssss~LkrH~rtHtgeKPfkC~~CgKsF~~ks~L~rH~rtHtGeKPfkC~~CgKaFsrsssLkrH  450 (510)
                      .+-|.|+  .|+|.|.+-..|+.|+|+|+  -+++|.+|||.|.+.--|..|+|+|||||||.|..|+|+|+.+++|+.|
T Consensus       159 ~ka~~C~--~C~K~YvSmpALkMHirTH~--l~c~C~iCGKaFSRPWLLQGHiRTHTGEKPF~C~hC~kAFADRSNLRAH  234 (279)
T KOG2462|consen  159 KKAFSCK--YCGKVYVSMPALKMHIRTHT--LPCECGICGKAFSRPWLLQGHIRTHTGEKPFSCPHCGKAFADRSNLRAH  234 (279)
T ss_pred             cccccCC--CCCceeeehHHHhhHhhccC--CCcccccccccccchHHhhcccccccCCCCccCCcccchhcchHHHHHH
Confidence            5678895  59999999999999999997  5899999999999999999999999999999999999999999999999


Q ss_pred             HHHhhcccchhHHHHHHHHHHHHHHHHHHHhhhh
Q psy679          451 AKVHLKQRSKKESKMAVMMSQQQAQQQQQQNHVH  484 (510)
Q Consensus       451 lRtHtgeK~~k~~~~~~~~~QQQ~qQQqQQN~~~  484 (510)
                      |++|.+.|.|+|..|.+.|+..--...+......
T Consensus       235 mQTHS~~K~~qC~~C~KsFsl~SyLnKH~ES~C~  268 (279)
T KOG2462|consen  235 MQTHSDVKKHQCPRCGKSFALKSYLNKHSESACL  268 (279)
T ss_pred             HHhhcCCccccCcchhhHHHHHHHHHHhhhhccc
Confidence            9999999999999999999988766666554433


No 2  
>KOG2462|consensus
Probab=99.68  E-value=1.1e-17  Score=166.79  Aligned_cols=83  Identities=37%  Similarity=0.779  Sum_probs=77.7

Q ss_pred             CCCCCcccCCCCCCCccCChHHHHHHHHHhcCCCceecccCCCccCCchhhhhhhhcccCCCceecCcCCCccCChHHHH
Q psy679          369 VHERPYACPVENCDRRFSRSDELTRHIRIHTGQKPFQCRICMRSFSRSDHLTTHIRTHTGEKPFSCDVCGRKFARSDEKK  448 (510)
Q Consensus       369 ~~EKPy~C~~e~CgKsFssss~LkrH~rtHtgeKPfkC~~CgKsF~~ks~L~rH~rtHtGeKPfkC~~CgKaFsrsssLk  448 (510)
                      ++.-+++|.+  |||.|.+..-|+.|+|+|+|||||.|..|+|+|..+.+|+.|+++|.+.|+|.|..|+|.|++.+.|.
T Consensus       183 TH~l~c~C~i--CGKaFSRPWLLQGHiRTHTGEKPF~C~hC~kAFADRSNLRAHmQTHS~~K~~qC~~C~KsFsl~SyLn  260 (279)
T KOG2462|consen  183 THTLPCECGI--CGKAFSRPWLLQGHIRTHTGEKPFSCPHCGKAFADRSNLRAHMQTHSDVKKHQCPRCGKSFALKSYLN  260 (279)
T ss_pred             ccCCCccccc--ccccccchHHhhcccccccCCCCccCCcccchhcchHHHHHHHHhhcCCccccCcchhhHHHHHHHHH
Confidence            3446778855  99999999999999999999999999999999999999999999999999999999999999999999


Q ss_pred             HHHHH
Q psy679          449 RHAKV  453 (510)
Q Consensus       449 rHlRt  453 (510)
                      +|...
T Consensus       261 KH~ES  265 (279)
T KOG2462|consen  261 KHSES  265 (279)
T ss_pred             Hhhhh
Confidence            99754


No 3  
>KOG3576|consensus
Probab=99.53  E-value=4.9e-15  Score=142.50  Aligned_cols=96  Identities=26%  Similarity=0.582  Sum_probs=85.8

Q ss_pred             CCCCcccCCCCCCCccCChHHHHHHHHHhcCCCceecccCCCccCCchhhhhhhhcccCCCceecCcCCCccCChHHHHH
Q psy679          370 HERPYACPVENCDRRFSRSDELTRHIRIHTGQKPFQCRICMRSFSRSDHLTTHIRTHTGEKPFSCDVCGRKFARSDEKKR  449 (510)
Q Consensus       370 ~EKPy~C~~e~CgKsFssss~LkrH~rtHtgeKPfkC~~CgKsF~~ks~L~rH~rtHtGeKPfkC~~CgKaFsrsssLkr  449 (510)
                      +...|.|.+  |+|.|.-..-|.+|++.|...|.|-|..|+|.|....+|++|+|+|+|.+||+|..|+|+|.++..|..
T Consensus       114 d~d~ftCrv--CgK~F~lQRmlnrh~kch~~vkr~lct~cgkgfndtfdlkrh~rthtgvrpykc~~c~kaftqrcsles  191 (267)
T KOG3576|consen  114 DQDSFTCRV--CGKKFGLQRMLNRHLKCHSDVKRHLCTFCGKGFNDTFDLKRHTRTHTGVRPYKCSLCEKAFTQRCSLES  191 (267)
T ss_pred             CCCeeeeeh--hhhhhhHHHHHHHHhhhccHHHHHHHhhccCcccchhhhhhhhccccCccccchhhhhHHHHhhccHHH
Confidence            456799977  999999999999999999999999999999999999999999999999999999999999999999999


Q ss_pred             HHHHhhc-----------ccchhHHHHHH
Q psy679          450 HAKVHLK-----------QRSKKESKMAV  467 (510)
Q Consensus       450 HlRtHtg-----------eK~~k~~~~~~  467 (510)
                      |++.-++           .|.|.|++|..
T Consensus       192 hl~kvhgv~~~yaykerr~kl~vcedcg~  220 (267)
T KOG3576|consen  192 HLKKVHGVQHQYAYKERRAKLYVCEDCGY  220 (267)
T ss_pred             HHHHHcCchHHHHHHHhhhheeeecccCC
Confidence            9975443           45566777643


No 4  
>KOG3623|consensus
Probab=99.46  E-value=1.6e-14  Score=157.84  Aligned_cols=90  Identities=40%  Similarity=0.844  Sum_probs=81.4

Q ss_pred             CCCcccCCCCCCCccCChHHHHHHHHHhcCCCceecccCCCccCCchhhhhhhhcccCCCceecCcCCCccCChHHHHHH
Q psy679          371 ERPYACPVENCDRRFSRSDELTRHIRIHTGQKPFQCRICMRSFSRSDHLTTHIRTHTGEKPFSCDVCGRKFARSDEKKRH  450 (510)
Q Consensus       371 EKPy~C~~e~CgKsFssss~LkrH~rtHtgeKPfkC~~CgKsF~~ks~L~rH~rtHtGeKPfkC~~CgKaFsrsssLkrH  450 (510)
                      +..|.|  +.|.|.|...+.|.||.--|+|.|||+|.+|.|+|+.+.+|..|+|.|.|||||.|++|+|+|.....+..|
T Consensus       892 ~gmyaC--DqCDK~FqKqSSLaRHKYEHsGqRPyqC~iCkKAFKHKHHLtEHkRLHSGEKPfQCdKClKRFSHSGSYSQH  969 (1007)
T KOG3623|consen  892 DGMYAC--DQCDKAFQKQSSLARHKYEHSGQRPYQCIICKKAFKHKHHLTEHKRLHSGEKPFQCDKCLKRFSHSGSYSQH  969 (1007)
T ss_pred             cccchH--HHHHHHHHhhHHHHHhhhhhcCCCCcccchhhHhhhhhhhhhhhhhhccCCCcchhhhhhhhcccccchHhh
Confidence            467999  679999999999999999999999999999999999999999999999999999999999999999999999


Q ss_pred             HH-HhhcccchhH
Q psy679          451 AK-VHLKQRSKKE  462 (510)
Q Consensus       451 lR-tHtgeK~~k~  462 (510)
                      |. ...--|+|++
T Consensus       970 MNHRYSYCKpyrE  982 (1007)
T KOG3623|consen  970 MNHRYSYCKPYRE  982 (1007)
T ss_pred             hccchhcccchhh
Confidence            84 3334455543


No 5  
>KOG3576|consensus
Probab=99.27  E-value=3.3e-12  Score=123.18  Aligned_cols=101  Identities=28%  Similarity=0.597  Sum_probs=90.4

Q ss_pred             CCCCCCCCCCCCCCCCcccCCCCCCCccCChHHHHHHHHHhcCCCceecccCCCccCCchhhhhhhhcccC---------
Q psy679          358 RKYPNRPSKTPVHERPYACPVENCDRRFSRSDELTRHIRIHTGQKPFQCRICMRSFSRSDHLTTHIRTHTG---------  428 (510)
Q Consensus       358 ~~~p~~pskth~~EKPy~C~~e~CgKsFssss~LkrH~rtHtgeKPfkC~~CgKsF~~ks~L~rH~rtHtG---------  428 (510)
                      .++.+++.+-|...|.|-|.+  |||.|...-+|++|+|+|+|.+||+|..|+|+|..+..|..|.+.-+|         
T Consensus       130 QRmlnrh~kch~~vkr~lct~--cgkgfndtfdlkrh~rthtgvrpykc~~c~kaftqrcsleshl~kvhgv~~~yayke  207 (267)
T KOG3576|consen  130 QRMLNRHLKCHSDVKRHLCTF--CGKGFNDTFDLKRHTRTHTGVRPYKCSLCEKAFTQRCSLESHLKKVHGVQHQYAYKE  207 (267)
T ss_pred             HHHHHHHhhhccHHHHHHHhh--ccCcccchhhhhhhhccccCccccchhhhhHHHHhhccHHHHHHHHcCchHHHHHHH
Confidence            346688889999999999977  999999999999999999999999999999999999999999875333         


Q ss_pred             --CCceecCcCCCccCChHHHHHHHHHhhcccch
Q psy679          429 --EKPFSCDVCGRKFARSDEKKRHAKVHLKQRSK  460 (510)
Q Consensus       429 --eKPfkC~~CgKaFsrsssLkrHlRtHtgeK~~  460 (510)
                        +|-|.|+.||..-.+...+..|++.|+..-+.
T Consensus       208 rr~kl~vcedcg~t~~~~e~~~~h~~~~hp~Spa  241 (267)
T KOG3576|consen  208 RRAKLYVCEDCGYTSERPEVYYLHLKLHHPFSPA  241 (267)
T ss_pred             hhhheeeecccCCCCCChhHHHHHHHhcCCCCHH
Confidence              57899999999999999999999999876554


No 6  
>KOG3623|consensus
Probab=99.21  E-value=3.4e-12  Score=139.97  Aligned_cols=80  Identities=34%  Similarity=0.807  Sum_probs=72.9

Q ss_pred             CCCcccCCCCCCCccCChHHHHHHHHHhcC-------------CCceecccCCCccCCchhhhhhhhcccCCCceecCcC
Q psy679          371 ERPYACPVENCDRRFSRSDELTRHIRIHTG-------------QKPFQCRICMRSFSRSDHLTTHIRTHTGEKPFSCDVC  437 (510)
Q Consensus       371 EKPy~C~~e~CgKsFssss~LkrH~rtHtg-------------eKPfkC~~CgKsF~~ks~L~rH~rtHtGeKPfkC~~C  437 (510)
                      +..|.|..  |..+|..+..|.+||.+|..             -|.|+|..|+|+|+.+.+|+.|+|+|.|||||.|+.|
T Consensus       238 e~nfsC~l--CsytFAyRtQLErhm~~hkpg~dqa~sltqsa~lRKFKCtECgKAFKfKHHLKEHlRIHSGEKPfeCpnC  315 (1007)
T KOG3623|consen  238 EPNFSCML--CSYTFAYRTQLERHMQLHKPGGDQAISLTQSALLRKFKCTECGKAFKFKHHLKEHLRIHSGEKPFECPNC  315 (1007)
T ss_pred             CCCCcchh--hhhhhhhHHHHHHHHHhhcCCCcccccccchhhhccccccccchhhhhHHHHHhhheeecCCCCcCCccc
Confidence            34577954  99999999999999999863             3679999999999999999999999999999999999


Q ss_pred             CCccCChHHHHHHHH
Q psy679          438 GRKFARSDEKKRHAK  452 (510)
Q Consensus       438 gKaFsrsssLkrHlR  452 (510)
                      +|+|.....+..||.
T Consensus       316 kKRFSHSGSySSHmS  330 (1007)
T KOG3623|consen  316 KKRFSHSGSYSSHMS  330 (1007)
T ss_pred             ccccccCCccccccc
Confidence            999999999999984


No 7  
>KOG1074|consensus
Probab=99.01  E-value=2.9e-10  Score=126.80  Aligned_cols=53  Identities=36%  Similarity=0.813  Sum_probs=42.8

Q ss_pred             CcccCCCCCCCccCChHHHHHHHHHhcCCCceecccCCCccCCchhhhhhhhccc
Q psy679          373 PYACPVENCDRRFSRSDELTRHIRIHTGQKPFQCRICMRSFSRSDHLTTHIRTHT  427 (510)
Q Consensus       373 Py~C~~e~CgKsFssss~LkrH~rtHtgeKPfkC~~CgKsF~~ks~L~rH~rtHt  427 (510)
                      +++|.+  |.|.|...+.|+.|.|.|+|+|||+|.+|+.+|..+.+|+.|...|+
T Consensus       353 khkCr~--CakvfgS~SaLqiHlRSHTGERPfqCnvCG~~FSTkGNLKvH~~rH~  405 (958)
T KOG1074|consen  353 KHKCRF--CAKVFGSDSALQIHLRSHTGERPFQCNVCGNRFSTKGNLKVHFQRHR  405 (958)
T ss_pred             cchhhh--hHhhcCchhhhhhhhhccCCCCCeeecccccccccccceeeeeeecc
Confidence            356755  88888888888888888888888888888888888888888865543


No 8  
>KOG1074|consensus
Probab=98.94  E-value=1.4e-10  Score=129.17  Aligned_cols=84  Identities=36%  Similarity=0.805  Sum_probs=77.2

Q ss_pred             CCCcccCCCCCCCccCChHHHHHHHHHhcCCCceecccCCCccCCchhhhhhhhcccCC----CceecC---cCCCccCC
Q psy679          371 ERPYACPVENCDRRFSRSDELTRHIRIHTGQKPFQCRICMRSFSRSDHLTTHIRTHTGE----KPFSCD---VCGRKFAR  443 (510)
Q Consensus       371 EKPy~C~~e~CgKsFssss~LkrH~rtHtgeKPfkC~~CgKsF~~ks~L~rH~rtHtGe----KPfkC~---~CgKaFsr  443 (510)
                      ..|..|.+  |-|....++.|+.|.|+|+|+|||+|.+|+++|..+.+|+.|+.+|...    -.|.|.   +|-+.|..
T Consensus       603 TdPNqCii--C~rVlSC~saLqmHyrtHtGERPFkCKiCgRAFtTkGNLkaH~~vHka~p~~R~q~ScP~~~ic~~kftn  680 (958)
T KOG1074|consen  603 TDPNQCII--CLRVLSCPSALQMHYRTHTGERPFKCKICGRAFTTKGNLKAHMSVHKAKPPARVQFSCPSTFICQKKFTN  680 (958)
T ss_pred             CCccceee--eeecccchhhhhhhhhcccCcCccccccccchhccccchhhcccccccCccccccccCCchhhhcccccc
Confidence            35788977  9999999999999999999999999999999999999999999998543    357899   99999999


Q ss_pred             hHHHHHHHHHhhc
Q psy679          444 SDEKKRHAKVHLK  456 (510)
Q Consensus       444 sssLkrHlRtHtg  456 (510)
                      .-.|..|+|+|.+
T Consensus       681 ~V~lpQhIriH~~  693 (958)
T KOG1074|consen  681 AVTLPQHIRIHLG  693 (958)
T ss_pred             cccccceEEeecC
Confidence            9999999999994


No 9  
>PHA00733 hypothetical protein
Probab=98.80  E-value=2.9e-09  Score=96.87  Aligned_cols=85  Identities=24%  Similarity=0.445  Sum_probs=70.5

Q ss_pred             CCCCCCcccCCCCCCCccCChHHHHHH--HH---HhcCCCceecccCCCccCCchhhhhhhhcccCCCceecCcCCCccC
Q psy679          368 PVHERPYACPVENCDRRFSRSDELTRH--IR---IHTGQKPFQCRICMRSFSRSDHLTTHIRTHTGEKPFSCDVCGRKFA  442 (510)
Q Consensus       368 h~~EKPy~C~~e~CgKsFssss~LkrH--~r---tHtgeKPfkC~~CgKsF~~ks~L~rH~rtHtGeKPfkC~~CgKaFs  442 (510)
                      ....+++.|.+  |.+.|.....|..|  ++   .+.+.++|.|..|++.|.....|..|++.|  +++|.|..|++.|.
T Consensus        35 ~~~~~~~~~~~--~~~~~~~~~~l~~~~~l~~~~~~~~~kPy~C~~Cgk~Fss~s~L~~H~r~h--~~~~~C~~CgK~F~  110 (128)
T PHA00733         35 TPEQKRLIRAV--VKTLIYNPQLLDESSYLYKLLTSKAVSPYVCPLCLMPFSSSVSLKQHIRYT--EHSKVCPVCGKEFR  110 (128)
T ss_pred             ChhhhhHHHHH--HhhhccChhhhcchHHHHhhcccCCCCCccCCCCCCcCCCHHHHHHHHhcC--CcCccCCCCCCccC
Confidence            34567899965  99998888777665  22   234588999999999999999999999987  46799999999999


Q ss_pred             ChHHHHHHHHHhhc
Q psy679          443 RSDEKKRHAKVHLK  456 (510)
Q Consensus       443 rsssLkrHlRtHtg  456 (510)
                      ....|.+|++..++
T Consensus       111 ~~~sL~~H~~~~h~  124 (128)
T PHA00733        111 NTDSTLDHVCKKHN  124 (128)
T ss_pred             CHHHHHHHHHHhcC
Confidence            99999999977654


No 10 
>KOG3608|consensus
Probab=98.73  E-value=2.6e-09  Score=110.15  Aligned_cols=101  Identities=25%  Similarity=0.523  Sum_probs=49.0

Q ss_pred             CCCCCCCCCCCcccCCCCCCCccCChHHHHHHHHHhcC--CCceecccCCCccCCchhhhhhhhcccCCCceecCcCCCc
Q psy679          363 RPSKTPVHERPYACPVENCDRRFSRSDELTRHIRIHTG--QKPFQCRICMRSFSRSDHLTTHIRTHTGEKPFSCDVCGRK  440 (510)
Q Consensus       363 ~pskth~~EKPy~C~~e~CgKsFssss~LkrH~rtHtg--eKPfkC~~CgKsF~~ks~L~rH~rtHtGeKPfkC~~CgKa  440 (510)
                      ++.++|++||...|+.  ||..|+++..|-.|+|..+.  ..+|.|..|-|.|.....|..|++.|.  .-|+|+.|...
T Consensus       197 eH~r~Hs~eKvvACp~--Cg~~F~~~tkl~DH~rRqt~l~~n~fqC~~C~KrFaTeklL~~Hv~rHv--n~ykCplCdmt  272 (467)
T KOG3608|consen  197 EHIRTHSNEKVVACPH--CGELFRTKTKLFDHLRRQTELNTNSFQCAQCFKRFATEKLLKSHVVRHV--NCYKCPLCDMT  272 (467)
T ss_pred             HHHHhcCCCeEEecch--HHHHhccccHHHHHHHhhhhhcCCchHHHHHHHHHhHHHHHHHHHHHhh--hcccccccccC
Confidence            4456666666666633  66666666666666654332  234444444444444444444443332  12444444444


Q ss_pred             cCChHHHHHHHH-HhhcccchhHHHHHH
Q psy679          441 FARSDEKKRHAK-VHLKQRSKKESKMAV  467 (510)
Q Consensus       441 FsrsssLkrHlR-tHtgeK~~k~~~~~~  467 (510)
                      ....+.|.+|+| .|.+.|++||..|..
T Consensus       273 c~~~ssL~~H~r~rHs~dkpfKCd~Cd~  300 (467)
T KOG3608|consen  273 CSSASSLTTHIRYRHSKDKPFKCDECDT  300 (467)
T ss_pred             CCChHHHHHHHHhhhccCCCccccchhh
Confidence            444444444443 344444444444444


No 11 
>KOG3608|consensus
Probab=98.71  E-value=5.7e-09  Score=107.68  Aligned_cols=102  Identities=26%  Similarity=0.573  Sum_probs=80.2

Q ss_pred             CCCCCcccCCCCCCCccCChHHHHHHHHHhcCCCceecccCCCccCCchhhhhhhhc-ccCCCceecCcCCCccCChHHH
Q psy679          369 VHERPYACPVENCDRRFSRSDELTRHIRIHTGQKPFQCRICMRSFSRSDHLTTHIRT-HTGEKPFSCDVCGRKFARSDEK  447 (510)
Q Consensus       369 ~~EKPy~C~~e~CgKsFssss~LkrH~rtHtgeKPfkC~~CgKsF~~ks~L~rH~rt-HtGeKPfkC~~CgKaFsrsssL  447 (510)
                      ....+|.|  ..|-|+|.....|+.|++.|..  -|+|+.|.......+.|.+|++. |..+|||+|+.|.+.|.+.+.|
T Consensus       233 l~~n~fqC--~~C~KrFaTeklL~~Hv~rHvn--~ykCplCdmtc~~~ssL~~H~r~rHs~dkpfKCd~Cd~~c~~esdL  308 (467)
T KOG3608|consen  233 LNTNSFQC--AQCFKRFATEKLLKSHVVRHVN--CYKCPLCDMTCSSASSLTTHIRYRHSKDKPFKCDECDTRCVRESDL  308 (467)
T ss_pred             hcCCchHH--HHHHHHHhHHHHHHHHHHHhhh--cccccccccCCCChHHHHHHHHhhhccCCCccccchhhhhccHHHH
Confidence            34568999  5599999999999999988863  58899998888888889988874 7778899999999999998899


Q ss_pred             HHHHHHhhcccchhHHH--HHHHHHHHHHH
Q psy679          448 KRHAKVHLKQRSKKESK--MAVMMSQQQAQ  475 (510)
Q Consensus       448 krHlRtHtgeK~~k~~~--~~~~~~QQQ~q  475 (510)
                      .+|+.+|. +-.|.|+.  |...+....+.
T Consensus       309 ~kH~~~HS-~~~y~C~h~~C~~s~r~~~q~  337 (467)
T KOG3608|consen  309 AKHVQVHS-KTVYQCEHPDCHYSVRTYTQM  337 (467)
T ss_pred             HHHHHhcc-ccceecCCCCCcHHHHHHHHH
Confidence            99988887 66677766  55444433333


No 12 
>PLN03086 PRLI-interacting factor K; Provisional
Probab=98.62  E-value=3.2e-08  Score=108.87  Aligned_cols=95  Identities=16%  Similarity=0.431  Sum_probs=79.5

Q ss_pred             CCCCcccCCCCCCCccCChHHHHHHHHHhcCCCceecccCCCccCCchhhhhhhhcccCCCceecCcCCCccCC------
Q psy679          370 HERPYACPVENCDRRFSRSDELTRHIRIHTGQKPFQCRICMRSFSRSDHLTTHIRTHTGEKPFSCDVCGRKFAR------  443 (510)
Q Consensus       370 ~EKPy~C~~e~CgKsFssss~LkrH~rtHtgeKPfkC~~CgKsF~~ks~L~rH~rtHtGeKPfkC~~CgKaFsr------  443 (510)
                      .++.+.|.  .|++.|. ...|..|++.|+  +++.|+ |++.+ .+..|..|+++|..+|++.|..|++.|..      
T Consensus       450 l~~H~~C~--~Cgk~f~-~s~LekH~~~~H--kpv~Cp-Cg~~~-~R~~L~~H~~thCp~Kpi~C~fC~~~v~~g~~~~d  522 (567)
T PLN03086        450 AKNHVHCE--KCGQAFQ-QGEMEKHMKVFH--EPLQCP-CGVVL-EKEQMVQHQASTCPLRLITCRFCGDMVQAGGSAMD  522 (567)
T ss_pred             cccCccCC--CCCCccc-hHHHHHHHHhcC--CCccCC-CCCCc-chhHHHhhhhccCCCCceeCCCCCCccccCccccc
Confidence            35567894  5999996 678999999985  899999 99765 66899999999999999999999999952      


Q ss_pred             ----hHHHHHHHHHhhcccchhHHHHHHHHHHH
Q psy679          444 ----SDEKKRHAKVHLKQRSKKESKMAVMMSQQ  472 (510)
Q Consensus       444 ----sssLkrHlRtHtgeK~~k~~~~~~~~~QQ  472 (510)
                          ...|..|++++ +.+++.|..|.+.....
T Consensus       523 ~~d~~s~Lt~HE~~C-G~rt~~C~~Cgk~Vrlr  554 (567)
T PLN03086        523 VRDRLRGMSEHESIC-GSRTAPCDSCGRSVMLK  554 (567)
T ss_pred             hhhhhhhHHHHHHhc-CCcceEccccCCeeeeh
Confidence                45899999986 99999999997654433


No 13 
>PHA02768 hypothetical protein; Provisional
Probab=98.51  E-value=2.5e-08  Score=78.23  Aligned_cols=42  Identities=21%  Similarity=0.517  Sum_probs=25.2

Q ss_pred             cccCCCCCCCccCChHHHHHHHHHhcCCCceecccCCCccCCchhh
Q psy679          374 YACPVENCDRRFSRSDELTRHIRIHTGQKPFQCRICMRSFSRSDHL  419 (510)
Q Consensus       374 y~C~~e~CgKsFssss~LkrH~rtHtgeKPfkC~~CgKsF~~ks~L  419 (510)
                      |.|  +.||+.|.+..+|.+|+++|+  ++|+|..|++.|.+...|
T Consensus         6 y~C--~~CGK~Fs~~~~L~~H~r~H~--k~~kc~~C~k~f~~~s~l   47 (55)
T PHA02768          6 YEC--PICGEIYIKRKSMITHLRKHN--TNLKLSNCKRISLRTGEY   47 (55)
T ss_pred             cCc--chhCCeeccHHHHHHHHHhcC--CcccCCcccceeccccee
Confidence            556  346666666666666666665  456666666666555444


No 14 
>PHA02768 hypothetical protein; Provisional
Probab=98.34  E-value=2.7e-07  Score=72.46  Aligned_cols=44  Identities=25%  Similarity=0.532  Sum_probs=40.8

Q ss_pred             ceecccCCCccCCchhhhhhhhcccCCCceecCcCCCccCChHHHH
Q psy679          403 PFQCRICMRSFSRSDHLTTHIRTHTGEKPFSCDVCGRKFARSDEKK  448 (510)
Q Consensus       403 PfkC~~CgKsF~~ks~L~rH~rtHtGeKPfkC~~CgKaFsrsssLk  448 (510)
                      -|.|+.|++.|.+..+|.+|+++|+  ++|+|..|+|.|.+.+.|.
T Consensus         5 ~y~C~~CGK~Fs~~~~L~~H~r~H~--k~~kc~~C~k~f~~~s~l~   48 (55)
T PHA02768          5 GYECPICGEIYIKRKSMITHLRKHN--TNLKLSNCKRISLRTGEYI   48 (55)
T ss_pred             ccCcchhCCeeccHHHHHHHHHhcC--CcccCCcccceecccceeE
Confidence            4899999999999999999999998  7999999999999887764


No 15 
>PF13465 zf-H2C2_2:  Zinc-finger double domain; PDB: 2EN7_A 1TF6_A 1TF3_A 2ELT_A 2EOS_A 2EN2_A 2DMD_A 2WBS_A 2WBU_A 2EM5_A ....
Probab=98.15  E-value=7.1e-07  Score=59.70  Aligned_cols=24  Identities=58%  Similarity=1.211  Sum_probs=13.3

Q ss_pred             HHHHHHHhcCCCceecccCCCccC
Q psy679          391 LTRHIRIHTGQKPFQCRICMRSFS  414 (510)
Q Consensus       391 LkrH~rtHtgeKPfkC~~CgKsF~  414 (510)
                      |.+|+++|+++|||.|+.|++.|.
T Consensus         2 l~~H~~~H~~~k~~~C~~C~k~F~   25 (26)
T PF13465_consen    2 LRRHMRTHTGEKPYKCPYCGKSFS   25 (26)
T ss_dssp             HHHHHHHHSSSSSEEESSSSEEES
T ss_pred             HHHHhhhcCCCCCCCCCCCcCeeC
Confidence            455555555555555555555553


No 16 
>PF13465 zf-H2C2_2:  Zinc-finger double domain; PDB: 2EN7_A 1TF6_A 1TF3_A 2ELT_A 2EOS_A 2EN2_A 2DMD_A 2WBS_A 2WBU_A 2EM5_A ....
Probab=98.10  E-value=2.1e-06  Score=57.46  Aligned_cols=26  Identities=54%  Similarity=1.206  Sum_probs=24.0

Q ss_pred             hhhhhhhcccCCCceecCcCCCccCC
Q psy679          418 HLTTHIRTHTGEKPFSCDVCGRKFAR  443 (510)
Q Consensus       418 ~L~rH~rtHtGeKPfkC~~CgKaFsr  443 (510)
                      +|.+|+++|+|+|||.|+.|++.|.+
T Consensus         1 ~l~~H~~~H~~~k~~~C~~C~k~F~~   26 (26)
T PF13465_consen    1 NLRRHMRTHTGEKPYKCPYCGKSFSN   26 (26)
T ss_dssp             HHHHHHHHHSSSSSEEESSSSEEESS
T ss_pred             CHHHHhhhcCCCCCCCCCCCcCeeCc
Confidence            47899999999999999999999974


No 17 
>PLN03086 PRLI-interacting factor K; Provisional
Probab=98.07  E-value=4.1e-06  Score=92.55  Aligned_cols=78  Identities=21%  Similarity=0.449  Sum_probs=65.7

Q ss_pred             CCcccCCCCCCCccCChHHHHHHHHHhcCCCceecccCCCccCC----------chhhhhhhhcccCCCceecCcCCCcc
Q psy679          372 RPYACPVENCDRRFSRSDELTRHIRIHTGQKPFQCRICMRSFSR----------SDHLTTHIRTHTGEKPFSCDVCGRKF  441 (510)
Q Consensus       372 KPy~C~~e~CgKsFssss~LkrH~rtHtgeKPfkC~~CgKsF~~----------ks~L~rH~rtHtGeKPfkC~~CgKaF  441 (510)
                      +++.|  . |++.| .+..|..|+++|...|++.|.+|++.|..          ...|..|...+ |.+++.|..||+.|
T Consensus       477 kpv~C--p-Cg~~~-~R~~L~~H~~thCp~Kpi~C~fC~~~v~~g~~~~d~~d~~s~Lt~HE~~C-G~rt~~C~~Cgk~V  551 (567)
T PLN03086        477 EPLQC--P-CGVVL-EKEQMVQHQASTCPLRLITCRFCGDMVQAGGSAMDVRDRLRGMSEHESIC-GSRTAPCDSCGRSV  551 (567)
T ss_pred             CCccC--C-CCCCc-chhHHHhhhhccCCCCceeCCCCCCccccCccccchhhhhhhHHHHHHhc-CCcceEccccCCee
Confidence            78999  4 99765 66899999999999999999999999852          34789999986 89999999999998


Q ss_pred             CChHHHHHHHHHhh
Q psy679          442 ARSDEKKRHAKVHL  455 (510)
Q Consensus       442 srsssLkrHlRtHt  455 (510)
                      ..+ .+..|+...+
T Consensus       552 rlr-dm~~H~~~~h  564 (567)
T PLN03086        552 MLK-EMDIHQIAVH  564 (567)
T ss_pred             eeh-hHHHHHHHhh
Confidence            875 5677875544


No 18 
>PHA00733 hypothetical protein
Probab=97.97  E-value=6.6e-06  Score=74.99  Aligned_cols=56  Identities=29%  Similarity=0.550  Sum_probs=50.0

Q ss_pred             CCCCCcccCCCCCCCccCChHHHHHHHHHhcCCCceecccCCCccCCchhhhhhhhcccC
Q psy679          369 VHERPYACPVENCDRRFSRSDELTRHIRIHTGQKPFQCRICMRSFSRSDHLTTHIRTHTG  428 (510)
Q Consensus       369 ~~EKPy~C~~e~CgKsFssss~LkrH~rtHtgeKPfkC~~CgKsF~~ks~L~rH~rtHtG  428 (510)
                      ..+++|.|  +.|++.|.....|..|++.|  .++|.|..|++.|.....|.+|+..+++
T Consensus        69 ~~~kPy~C--~~Cgk~Fss~s~L~~H~r~h--~~~~~C~~CgK~F~~~~sL~~H~~~~h~  124 (128)
T PHA00733         69 KAVSPYVC--PLCLMPFSSSVSLKQHIRYT--EHSKVCPVCGKEFRNTDSTLDHVCKKHN  124 (128)
T ss_pred             CCCCCccC--CCCCCcCCCHHHHHHHHhcC--CcCccCCCCCCccCCHHHHHHHHHHhcC
Confidence            34789999  55999999999999999987  4579999999999999999999987765


No 19 
>KOG3993|consensus
Probab=97.92  E-value=2.3e-06  Score=90.45  Aligned_cols=87  Identities=29%  Similarity=0.604  Sum_probs=70.1

Q ss_pred             CcccCCCCCCCccCChHHHHHHHHHhcCCCceecccCCCccCCchhhhhhhhccc--------CCC--------------
Q psy679          373 PYACPVENCDRRFSRSDELTRHIRIHTGQKPFQCRICMRSFSRSDHLTTHIRTHT--------GEK--------------  430 (510)
Q Consensus       373 Py~C~~e~CgKsFssss~LkrH~rtHtgeKPfkC~~CgKsF~~ks~L~rH~rtHt--------GeK--------------  430 (510)
                      -|.|..  |...|.+.-.|..|.-...-.-.|+|+.|+|.|.-..+|..|.|=|.        +.+              
T Consensus       267 dyiCqL--CK~kYeD~F~LAQHrC~RIV~vEYrCPEC~KVFsCPANLASHRRWHKPR~eaa~a~~~P~k~~~~~rae~~e  344 (500)
T KOG3993|consen  267 DYICQL--CKEKYEDAFALAQHRCPRIVHVEYRCPECDKVFSCPANLASHRRWHKPRPEAAKAGSPPPKQAVETRAEVQE  344 (500)
T ss_pred             HHHHHH--HHHhhhhHHHHhhccCCeeEEeeecCCcccccccCchhhhhhhcccCCchhhhhcCCCChhhhhhhhhhhhh
Confidence            388955  99999999999999754333346999999999999999999987663        111              


Q ss_pred             -----------ceecCcCCCccCChHHHHHHHHHhhcccchh
Q psy679          431 -----------PFSCDVCGRKFARSDEKKRHAKVHLKQRSKK  461 (510)
Q Consensus       431 -----------PfkC~~CgKaFsrsssLkrHlRtHtgeK~~k  461 (510)
                                 .|.|.+|+|.|++...|+.|+-+|.....-+
T Consensus       345 a~rsg~dss~gi~~C~~C~KkFrRqAYLrKHqlthq~~~~~k  386 (500)
T KOG3993|consen  345 AERSGDDSSSGIFSCHTCGKKFRRQAYLRKHQLTHQRAPLAK  386 (500)
T ss_pred             ccccCCcccCceeecHHhhhhhHHHHHHHHhHHhhhccccch
Confidence                       3899999999999999999998887654433


No 20 
>COG5189 SFP1 Putative transcriptional repressor regulating G2/M transition [Transcription / Cell division and chromosome partitioning]
Probab=97.87  E-value=6.2e-06  Score=84.65  Aligned_cols=73  Identities=27%  Similarity=0.626  Sum_probs=53.9

Q ss_pred             CCCCcccCCCCCCCccCChHHHHHHHHH-hcCCCceecccCCCccCCchhhhhhhhcccCCCceecCcCCCccCChHHHH
Q psy679          370 HERPYACPVENCDRRFSRSDELTRHIRI-HTGQKPFQCRICMRSFSRSDHLTTHIRTHTGEKPFSCDVCGRKFARSDEKK  448 (510)
Q Consensus       370 ~EKPy~C~~e~CgKsFssss~LkrH~rt-HtgeKPfkC~~CgKsF~~ks~L~rH~rtHtGeKPfkC~~CgKaFsrsssLk  448 (510)
                      ++|||+|++++|.|.|+....|+.|+.- |...|-..-+.          -..|...-...|||+|++|+|+|....-|+
T Consensus       346 d~KpykCpV~gC~K~YknqnGLKYH~lhGH~~~~~~~~p~----------p~~~~~F~~~~KPYrCevC~KRYKNlNGLK  415 (423)
T COG5189         346 DGKPYKCPVEGCNKKYKNQNGLKYHMLHGHQNQKLHENPS----------PEKMNIFSAKDKPYRCEVCDKRYKNLNGLK  415 (423)
T ss_pred             cCceecCCCCCchhhhccccchhhhhhccccCcccCCCCC----------ccccccccccCCceeccccchhhccCccce
Confidence            3699999999999999999999999874 65544333221          111111133579999999999999999999


Q ss_pred             HHHH
Q psy679          449 RHAK  452 (510)
Q Consensus       449 rHlR  452 (510)
                      .|++
T Consensus       416 YHr~  419 (423)
T COG5189         416 YHRK  419 (423)
T ss_pred             eccc
Confidence            9864


No 21 
>PHA00732 hypothetical protein
Probab=97.68  E-value=1.8e-05  Score=66.54  Aligned_cols=47  Identities=26%  Similarity=0.506  Sum_probs=37.0

Q ss_pred             CcccCCCCCCCccCChHHHHHHHHH-hcCCCceecccCCCccCCchhhhhhhhccc
Q psy679          373 PYACPVENCDRRFSRSDELTRHIRI-HTGQKPFQCRICMRSFSRSDHLTTHIRTHT  427 (510)
Q Consensus       373 Py~C~~e~CgKsFssss~LkrH~rt-HtgeKPfkC~~CgKsF~~ks~L~rH~rtHt  427 (510)
                      ||.|  +.|++.|.+...|+.|++. |.   ++.|..|++.|.   .+..|.+++.
T Consensus         1 py~C--~~Cgk~F~s~s~Lk~H~r~~H~---~~~C~~CgKsF~---~l~~H~~~~~   48 (79)
T PHA00732          1 MFKC--PICGFTTVTLFALKQHARRNHT---LTKCPVCNKSYR---RLNQHFYSQY   48 (79)
T ss_pred             CccC--CCCCCccCCHHHHHHHhhcccC---CCccCCCCCEeC---ChhhhhcccC
Confidence            5788  5599999999999999984 65   357999999987   4778886654


No 22 
>PHA00616 hypothetical protein
Probab=97.64  E-value=1.9e-05  Score=59.47  Aligned_cols=31  Identities=26%  Similarity=0.382  Sum_probs=16.5

Q ss_pred             ceecCcCCCccCChHHHHHHHHHhhcccchh
Q psy679          431 PFSCDVCGRKFARSDEKKRHAKVHLKQRSKK  461 (510)
Q Consensus       431 PfkC~~CgKaFsrsssLkrHlRtHtgeK~~k  461 (510)
                      ||.|..||+.|....+|.+|++.|++++++.
T Consensus         1 pYqC~~CG~~F~~~s~l~~H~r~~hg~~~~~   31 (44)
T PHA00616          1 MYQCLRCGGIFRKKKEVIEHLLSVHKQNKLT   31 (44)
T ss_pred             CCccchhhHHHhhHHHHHHHHHHhcCCCccc
Confidence            3455555555555555555555555555443


No 23 
>KOG3993|consensus
Probab=97.42  E-value=4.6e-05  Score=80.82  Aligned_cols=53  Identities=36%  Similarity=0.690  Sum_probs=45.6

Q ss_pred             CcccCCCCCCCccCChHHHHHHHHHhcCC---------------------------------CceecccCCCccCCchhh
Q psy679          373 PYACPVENCDRRFSRSDELTRHIRIHTGQ---------------------------------KPFQCRICMRSFSRSDHL  419 (510)
Q Consensus       373 Py~C~~e~CgKsFssss~LkrH~rtHtge---------------------------------KPfkC~~CgKsF~~ks~L  419 (510)
                      .|+|  .+|+|.|....||..|+|+|.-.                                 --|.|.+|+|.|.+...|
T Consensus       295 EYrC--PEC~KVFsCPANLASHRRWHKPR~eaa~a~~~P~k~~~~~rae~~ea~rsg~dss~gi~~C~~C~KkFrRqAYL  372 (500)
T KOG3993|consen  295 EYRC--PECDKVFSCPANLASHRRWHKPRPEAAKAGSPPPKQAVETRAEVQEAERSGDDSSSGIFSCHTCGKKFRRQAYL  372 (500)
T ss_pred             eecC--CcccccccCchhhhhhhcccCCchhhhhcCCCChhhhhhhhhhhhhccccCCcccCceeecHHhhhhhHHHHHH
Confidence            4889  77999999999999999999521                                 138999999999999999


Q ss_pred             hhhhhccc
Q psy679          420 TTHIRTHT  427 (510)
Q Consensus       420 ~rH~rtHt  427 (510)
                      +.|..+|.
T Consensus       373 rKHqlthq  380 (500)
T KOG3993|consen  373 RKHQLTHQ  380 (500)
T ss_pred             HHhHHhhh
Confidence            99976654


No 24 
>PHA00616 hypothetical protein
Probab=97.33  E-value=9.9e-05  Score=55.69  Aligned_cols=34  Identities=21%  Similarity=0.415  Sum_probs=26.0

Q ss_pred             CcccCCCCCCCccCChHHHHHHHHHhcCCCceeccc
Q psy679          373 PYACPVENCDRRFSRSDELTRHIRIHTGQKPFQCRI  408 (510)
Q Consensus       373 Py~C~~e~CgKsFssss~LkrH~rtHtgeKPfkC~~  408 (510)
                      ||+|  ..||+.|..+.+|.+|++.|++++++.|+.
T Consensus         1 pYqC--~~CG~~F~~~s~l~~H~r~~hg~~~~~~~~   34 (44)
T PHA00616          1 MYQC--LRCGGIFRKKKEVIEHLLSVHKQNKLTLEY   34 (44)
T ss_pred             CCcc--chhhHHHhhHHHHHHHHHHhcCCCccceeE
Confidence            4677  458888888888888888888888877765


No 25 
>PHA00732 hypothetical protein
Probab=97.31  E-value=0.00016  Score=60.98  Aligned_cols=48  Identities=27%  Similarity=0.475  Sum_probs=40.8

Q ss_pred             ceecccCCCccCCchhhhhhhhc-ccCCCceecCcCCCccCChHHHHHHHHHhhc
Q psy679          403 PFQCRICMRSFSRSDHLTTHIRT-HTGEKPFSCDVCGRKFARSDEKKRHAKVHLK  456 (510)
Q Consensus       403 PfkC~~CgKsF~~ks~L~rH~rt-HtGeKPfkC~~CgKaFsrsssLkrHlRtHtg  456 (510)
                      ||.|..|++.|.....|..|++. |.+   +.|+.|++.|.   .|..|.+++..
T Consensus         1 py~C~~Cgk~F~s~s~Lk~H~r~~H~~---~~C~~CgKsF~---~l~~H~~~~~~   49 (79)
T PHA00732          1 MFKCPICGFTTVTLFALKQHARRNHTL---TKCPVCNKSYR---RLNQHFYSQYD   49 (79)
T ss_pred             CccCCCCCCccCCHHHHHHHhhcccCC---CccCCCCCEeC---ChhhhhcccCC
Confidence            58999999999999999999985 653   68999999998   47888866543


No 26 
>PF00096 zf-C2H2:  Zinc finger, C2H2 type;  InterPro: IPR007087 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target.  The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger: #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C], where X can be any amino acid, and numbers in brackets indicate the number of residues. The positions marked # are those that are important for the stable fold of the zinc finger. The final position can be either his or cys. The C2H2 zinc finger is composed of two short beta strands followed by an alpha helix. The amino terminal part of the helix binds the major groove in DNA binding zinc fingers. The accepted consensus binding sequence for Sp1 is usually defined by the asymmetric hexanucleotide core GGGCGG but this sequence does not include, among others, the GAG (=CTC) repeat that constitutes a high-affinity site for Sp1 binding to the wt1 promoter []. This entry represents the classical C2H2 zinc finger domain.  More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; GO: 0008270 zinc ion binding, 0005622 intracellular; PDB: 2D9H_A 2EPC_A 1SP1_A 1VA3_A 2WBT_B 2ELR_A 2YTP_A 2YTT_A 1VA1_A 2ELO_A ....
Probab=97.15  E-value=0.0003  Score=44.91  Aligned_cols=23  Identities=35%  Similarity=0.973  Sum_probs=20.5

Q ss_pred             eecCcCCCccCChHHHHHHHHHh
Q psy679          432 FSCDVCGRKFARSDEKKRHAKVH  454 (510)
Q Consensus       432 fkC~~CgKaFsrsssLkrHlRtH  454 (510)
                      |+|..|++.|.+...|++|++.|
T Consensus         1 y~C~~C~~~f~~~~~l~~H~~~H   23 (23)
T PF00096_consen    1 YKCPICGKSFSSKSNLKRHMRRH   23 (23)
T ss_dssp             EEETTTTEEESSHHHHHHHHHHH
T ss_pred             CCCCCCCCccCCHHHHHHHHhHC
Confidence            78999999999999999999875


No 27 
>PF05605 zf-Di19:  Drought induced 19 protein (Di19), zinc-binding;  InterPro: IPR008598 This entry consists of several drought induced 19 (Di19) like and RING finger 114 proteins. Di19 has been found to be strongly expressed in both the roots and leaves of Arabidopsis thaliana during progressive drought [], whilst RING finger proteins are thought to play a role in spermatogenesis. The precise function is unknown.
Probab=96.99  E-value=0.00059  Score=52.98  Aligned_cols=49  Identities=24%  Similarity=0.517  Sum_probs=23.3

Q ss_pred             eecccCCCccCCchhhhhhhhc-ccC-CCceecCcCCCccCChHHHHHHHHHhh
Q psy679          404 FQCRICMRSFSRSDHLTTHIRT-HTG-EKPFSCDVCGRKFARSDEKKRHAKVHL  455 (510)
Q Consensus       404 fkC~~CgKsF~~ks~L~rH~rt-HtG-eKPfkC~~CgKaFsrsssLkrHlRtHt  455 (510)
                      |.|++|++ ......|..|... |.. .+.+.|++|...+.  .+|.+|++.++
T Consensus         3 f~CP~C~~-~~~~~~L~~H~~~~H~~~~~~v~CPiC~~~~~--~~l~~Hl~~~H   53 (54)
T PF05605_consen    3 FTCPYCGK-GFSESSLVEHCEDEHRSESKNVVCPICSSRVT--DNLIRHLNSQH   53 (54)
T ss_pred             cCCCCCCC-ccCHHHHHHHHHhHCcCCCCCccCCCchhhhh--hHHHHHHHHhc
Confidence            45555555 2333445555433 222 23455666655433  25666665544


No 28 
>PF13894 zf-C2H2_4:  C2H2-type zinc finger; PDB: 2ELX_A 2EPP_A 2DLK_A 1X6H_A 2EOU_A 2EMB_A 2GQJ_A 2CSH_A 2WBT_B 2ELM_A ....
Probab=96.75  E-value=0.0012  Score=41.51  Aligned_cols=24  Identities=33%  Similarity=0.924  Sum_probs=19.2

Q ss_pred             eecCcCCCccCChHHHHHHHHHhh
Q psy679          432 FSCDVCGRKFARSDEKKRHAKVHL  455 (510)
Q Consensus       432 fkC~~CgKaFsrsssLkrHlRtHt  455 (510)
                      |.|+.|++.|.....|+.|+++|+
T Consensus         1 ~~C~~C~~~~~~~~~l~~H~~~~H   24 (24)
T PF13894_consen    1 FQCPICGKSFRSKSELRQHMRTHH   24 (24)
T ss_dssp             EE-SSTS-EESSHHHHHHHHHHHS
T ss_pred             CCCcCCCCcCCcHHHHHHHHHhhC
Confidence            689999999999999999998874


No 29 
>PF12756 zf-C2H2_2:  C2H2 type zinc-finger (2 copies); PDB: 2DMI_A.
Probab=96.69  E-value=0.0013  Score=54.98  Aligned_cols=74  Identities=22%  Similarity=0.512  Sum_probs=22.5

Q ss_pred             ccCCCCCCCccCChHHHHHHHHHhcCCCceecccCCCccCCchhhhhhhhcccCCCceecCcCCCccCChHHHHHHHHHh
Q psy679          375 ACPVENCDRRFSRSDELTRHIRIHTGQKPFQCRICMRSFSRSDHLTTHIRTHTGEKPFSCDVCGRKFARSDEKKRHAKVH  454 (510)
Q Consensus       375 ~C~~e~CgKsFssss~LkrH~rtHtgeKPfkC~~CgKsF~~ks~L~rH~rtHtGeKPfkC~~CgKaFsrsssLkrHlRtH  454 (510)
                      .|.+  |+..|.....|..|+....+...   . ....+.....+..+.+... ...+.|..|++.|.....|..|++.+
T Consensus         1 ~C~~--C~~~f~~~~~l~~H~~~~H~~~~---~-~~~~l~~~~~~~~~~~~~~-~~~~~C~~C~~~f~s~~~l~~Hm~~~   73 (100)
T PF12756_consen    1 QCLF--CDESFSSVDDLLQHMKKKHGFDI---P-DQKYLVDPNRLLNYLRKKV-KESFRCPYCNKTFRSREALQEHMRSK   73 (100)
T ss_dssp             -------------------------------------------------------SSEEBSSSS-EESSHHHHHHHHHHT
T ss_pred             Cccc--ccccccccccccccccccccccc---c-ccccccccccccccccccc-CCCCCCCccCCCCcCHHHHHHHHcCc
Confidence            3755  99999999999999976443211   1 1122223344444444322 23699999999999999999999975


Q ss_pred             h
Q psy679          455 L  455 (510)
Q Consensus       455 t  455 (510)
                      .
T Consensus        74 ~   74 (100)
T PF12756_consen   74 H   74 (100)
T ss_dssp             T
T ss_pred             c
Confidence            4


No 30 
>PF05605 zf-Di19:  Drought induced 19 protein (Di19), zinc-binding;  InterPro: IPR008598 This entry consists of several drought induced 19 (Di19) like and RING finger 114 proteins. Di19 has been found to be strongly expressed in both the roots and leaves of Arabidopsis thaliana during progressive drought [], whilst RING finger proteins are thought to play a role in spermatogenesis. The precise function is unknown.
Probab=96.47  E-value=0.0028  Score=49.24  Aligned_cols=51  Identities=25%  Similarity=0.597  Sum_probs=39.6

Q ss_pred             CcccCCCCCCCccCChHHHHHHHHH-hcCC-CceecccCCCccCCchhhhhhhhcccC
Q psy679          373 PYACPVENCDRRFSRSDELTRHIRI-HTGQ-KPFQCRICMRSFSRSDHLTTHIRTHTG  428 (510)
Q Consensus       373 Py~C~~e~CgKsFssss~LkrH~rt-Htge-KPfkC~~CgKsF~~ks~L~rH~rtHtG  428 (510)
                      .|.|++  |++ ..+...|..|... |..+ +.+.|++|...+.  .+|.+|++.+++
T Consensus         2 ~f~CP~--C~~-~~~~~~L~~H~~~~H~~~~~~v~CPiC~~~~~--~~l~~Hl~~~H~   54 (54)
T PF05605_consen    2 SFTCPY--CGK-GFSESSLVEHCEDEHRSESKNVVCPICSSRVT--DNLIRHLNSQHR   54 (54)
T ss_pred             CcCCCC--CCC-ccCHHHHHHHHHhHCcCCCCCccCCCchhhhh--hHHHHHHHHhcC
Confidence            488966  999 5567889999875 6654 5799999998755  489999987653


No 31 
>PF13912 zf-C2H2_6:  C2H2-type zinc finger; PDB: 1JN7_A 1FU9_A 2L1O_A 1NJQ_A 2EN8_A 2EMM_A 1FV5_A 1Y0J_B 2L6Z_B.
Probab=96.37  E-value=0.0015  Score=43.32  Aligned_cols=26  Identities=35%  Similarity=0.887  Sum_probs=21.0

Q ss_pred             ceecCcCCCccCChHHHHHHHHHhhc
Q psy679          431 PFSCDVCGRKFARSDEKKRHAKVHLK  456 (510)
Q Consensus       431 PfkC~~CgKaFsrsssLkrHlRtHtg  456 (510)
                      +|.|..|++.|.....|..|++.|.+
T Consensus         1 ~~~C~~C~~~F~~~~~l~~H~~~h~~   26 (27)
T PF13912_consen    1 PFECDECGKTFSSLSALREHKRSHCS   26 (27)
T ss_dssp             SEEETTTTEEESSHHHHHHHHCTTTT
T ss_pred             CCCCCccCCccCChhHHHHHhHHhcC
Confidence            57888888888888888888877753


No 32 
>PF00096 zf-C2H2:  Zinc finger, C2H2 type;  InterPro: IPR007087 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target.  The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger: #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C], where X can be any amino acid, and numbers in brackets indicate the number of residues. The positions marked # are those that are important for the stable fold of the zinc finger. The final position can be either his or cys. The C2H2 zinc finger is composed of two short beta strands followed by an alpha helix. The amino terminal part of the helix binds the major groove in DNA binding zinc fingers. The accepted consensus binding sequence for Sp1 is usually defined by the asymmetric hexanucleotide core GGGCGG but this sequence does not include, among others, the GAG (=CTC) repeat that constitutes a high-affinity site for Sp1 binding to the wt1 promoter []. This entry represents the classical C2H2 zinc finger domain.  More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; GO: 0008270 zinc ion binding, 0005622 intracellular; PDB: 2D9H_A 2EPC_A 1SP1_A 1VA3_A 2WBT_B 2ELR_A 2YTP_A 2YTT_A 1VA1_A 2ELO_A ....
Probab=96.11  E-value=0.0036  Score=39.88  Aligned_cols=23  Identities=43%  Similarity=0.953  Sum_probs=17.5

Q ss_pred             cccCCCCCCCccCChHHHHHHHHHh
Q psy679          374 YACPVENCDRRFSRSDELTRHIRIH  398 (510)
Q Consensus       374 y~C~~e~CgKsFssss~LkrH~rtH  398 (510)
                      |.|  +.|++.|.+...|.+|++.|
T Consensus         1 y~C--~~C~~~f~~~~~l~~H~~~H   23 (23)
T PF00096_consen    1 YKC--PICGKSFSSKSNLKRHMRRH   23 (23)
T ss_dssp             EEE--TTTTEEESSHHHHHHHHHHH
T ss_pred             CCC--CCCCCccCCHHHHHHHHhHC
Confidence            567  44888888888888888765


No 33 
>PRK04860 hypothetical protein; Provisional
Probab=95.78  E-value=0.0038  Score=59.25  Aligned_cols=37  Identities=27%  Similarity=0.743  Sum_probs=19.2

Q ss_pred             ceecccCCCccCCchhhhhhhhcccCCCceecCcCCCccCC
Q psy679          403 PFQCRICMRSFSRSDHLTTHIRTHTGEKPFSCDVCGRKFAR  443 (510)
Q Consensus       403 PfkC~~CgKsF~~ks~L~rH~rtHtGeKPfkC~~CgKaFsr  443 (510)
                      +|.|. |++   ....+.+|.++|+++++|+|..|++.|..
T Consensus       119 ~Y~C~-C~~---~~~~~rrH~ri~~g~~~YrC~~C~~~l~~  155 (160)
T PRK04860        119 PYRCK-CQE---HQLTVRRHNRVVRGEAVYRCRRCGETLVF  155 (160)
T ss_pred             EEEcC-CCC---eeCHHHHHHHHhcCCccEECCCCCceeEE
Confidence            35554 554   34445555555555555555555555543


No 34 
>smart00355 ZnF_C2H2 zinc finger.
Probab=95.66  E-value=0.0099  Score=37.59  Aligned_cols=24  Identities=29%  Similarity=0.764  Sum_probs=20.6

Q ss_pred             eecCcCCCccCChHHHHHHHHHhh
Q psy679          432 FSCDVCGRKFARSDEKKRHAKVHL  455 (510)
Q Consensus       432 fkC~~CgKaFsrsssLkrHlRtHt  455 (510)
                      |.|..|++.|.....|..|++.|.
T Consensus         1 ~~C~~C~~~f~~~~~l~~H~~~H~   24 (26)
T smart00355        1 YRCPECGKVFKSKSALKEHMRTHX   24 (26)
T ss_pred             CCCCCCcchhCCHHHHHHHHHHhc
Confidence            578899999999999999988765


No 35 
>PF09237 GAGA:  GAGA factor;  InterPro: IPR015318 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target.  Members of this entry bind to a 5'-GAGAG-3' DNA consensus binding site, and contain a Cys2-His2 zinc finger core as well as an N-terminal extension containing two highly basic regions. The zinc finger core binds in the DNA major groove and recognises the first three GAG bases of the consensus in a manner similar to that seen in other classical zinc finger-DNA complexes. The second basic region forms a helix that interacts in the major groove recognising the last G of the consensus, while the first basic region wraps around the DNA in the minor groove and recognises the A in the fourth position of the consensus sequence [].  More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; PDB: 1YUI_A 1YUJ_A.
Probab=95.32  E-value=0.019  Score=44.90  Aligned_cols=33  Identities=24%  Similarity=0.618  Sum_probs=23.4

Q ss_pred             cCCCceecCcCCCccCChHHHHHHHHHhhcccc
Q psy679          427 TGEKPFSCDVCGRKFARSDEKKRHAKVHLKQRS  459 (510)
Q Consensus       427 tGeKPfkC~~CgKaFsrsssLkrHlRtHtgeK~  459 (510)
                      ..+.|..|++|+..+.+..+|++|+..+++.|+
T Consensus        20 ~S~~PatCP~C~a~~~~srnLrRHle~~H~~k~   52 (54)
T PF09237_consen   20 QSEQPATCPICGAVIRQSRNLRRHLEIRHFKKP   52 (54)
T ss_dssp             TTS--EE-TTT--EESSHHHHHHHHHHHTTTS-
T ss_pred             ccCCCCCCCcchhhccchhhHHHHHHHHhcccC
Confidence            346889999999999999999999998888765


No 36 
>COG5048 FOG: Zn-finger [General function prediction only]
Probab=94.79  E-value=0.0089  Score=60.75  Aligned_cols=73  Identities=41%  Similarity=0.716  Sum_probs=61.4

Q ss_pred             CCcccCCCCCCCccCChHHHHHHHH--HhcCC--Cceecc--cCCCccCCchhhhhhhhcccCCCceecCc--CCCccCC
Q psy679          372 RPYACPVENCDRRFSRSDELTRHIR--IHTGQ--KPFQCR--ICMRSFSRSDHLTTHIRTHTGEKPFSCDV--CGRKFAR  443 (510)
Q Consensus       372 KPy~C~~e~CgKsFssss~LkrH~r--tHtge--KPfkC~--~CgKsF~~ks~L~rH~rtHtGeKPfkC~~--CgKaFsr  443 (510)
                      .++.|  ..|...|.+...|.+|.+  .|.++  +++.|+  .|++.|.+...+.+|..+|++.++++|..  |.+.+..
T Consensus       288 ~~~~~--~~~~~~~s~~~~l~~~~~~~~h~~~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~  365 (467)
T COG5048         288 LPIKS--KQCNISFSRSSPLTRHLRSVNHSGESLKPFSCPYSLCGKLFSRNDALKRHILLHTSISPAKEKLLNSSSKFSP  365 (467)
T ss_pred             cCCCC--ccccCCccccccccccccccccccccCCceeeeccCCCccccccccccCCcccccCCCccccccccCcccccc
Confidence            46777  569999999999999999  79999  999999  79999999999999999999988888864  6665554


Q ss_pred             hHH
Q psy679          444 SDE  446 (510)
Q Consensus       444 sss  446 (510)
                      ...
T Consensus       366 ~~~  368 (467)
T COG5048         366 LLN  368 (467)
T ss_pred             ccC
Confidence            433


No 37 
>PF13894 zf-C2H2_4:  C2H2-type zinc finger; PDB: 2ELX_A 2EPP_A 2DLK_A 1X6H_A 2EOU_A 2EMB_A 2GQJ_A 2CSH_A 2WBT_B 2ELM_A ....
Probab=94.73  E-value=0.026  Score=35.31  Aligned_cols=23  Identities=39%  Similarity=1.099  Sum_probs=15.2

Q ss_pred             cccCCCCCCCccCChHHHHHHHHHh
Q psy679          374 YACPVENCDRRFSRSDELTRHIRIH  398 (510)
Q Consensus       374 y~C~~e~CgKsFssss~LkrH~rtH  398 (510)
                      |.|.+  |++.|.+...|..|+++|
T Consensus         1 ~~C~~--C~~~~~~~~~l~~H~~~~   23 (24)
T PF13894_consen    1 FQCPI--CGKSFRSKSELRQHMRTH   23 (24)
T ss_dssp             EE-SS--TS-EESSHHHHHHHHHHH
T ss_pred             CCCcC--CCCcCCcHHHHHHHHHhh
Confidence            56744  888888888888777765


No 38 
>COG5048 FOG: Zn-finger [General function prediction only]
Probab=94.30  E-value=0.022  Score=57.95  Aligned_cols=92  Identities=29%  Similarity=0.465  Sum_probs=55.4

Q ss_pred             CCCCC--CCcccCCCCCCCccCChHHHHHHHHHhcCCCceeccc--CCCccCCchhhhhhhh-----cccCCCceecCc-
Q psy679          367 TPVHE--RPYACPVENCDRRFSRSDELTRHIRIHTGQKPFQCRI--CMRSFSRSDHLTTHIR-----THTGEKPFSCDV-  436 (510)
Q Consensus       367 th~~E--KPy~C~~e~CgKsFssss~LkrH~rtHtgeKPfkC~~--CgKsF~~ks~L~rH~r-----tHtGeKPfkC~~-  436 (510)
                      .|..+  +++.|.+..|++.|.+...+.+|...|.+.+++.|..  |.+.+.....-..+..     .....+.+.|.. 
T Consensus       313 ~h~~~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~  392 (467)
T COG5048         313 NHSGESLKPFSCPYSLCGKLFSRNDALKRHILLHTSISPAKEKLLNSSSKFSPLLNNEPPQSLQQYKDLKNDKKSETLSN  392 (467)
T ss_pred             ccccccCCceeeeccCCCccccccccccCCcccccCCCccccccccCccccccccCCCCccchhhccCccCCcccccccc
Confidence            57777  7777763357777777777777777777777766654  4444433333111111     122334555543 


Q ss_pred             -CCCccCChHHHHHHHHHhhccc
Q psy679          437 -CGRKFARSDEKKRHAKVHLKQR  458 (510)
Q Consensus       437 -CgKaFsrsssLkrHlRtHtgeK  458 (510)
                       |.+.+.+...+..|...|...+
T Consensus       393 ~~~~~~~~~~~~~~~~~~~~~~~  415 (467)
T COG5048         393 SCIRNFKRDSNLSLHIITHLSFR  415 (467)
T ss_pred             chhhhhccccccccccccccccC
Confidence             6677777777777777766655


No 39 
>PRK04860 hypothetical protein; Provisional
Probab=94.15  E-value=0.031  Score=53.12  Aligned_cols=40  Identities=28%  Similarity=0.694  Sum_probs=33.8

Q ss_pred             CCcccCCCCCCCccCChHHHHHHHHHhcCCCceecccCCCccCCch
Q psy679          372 RPYACPVENCDRRFSRSDELTRHIRIHTGQKPFQCRICMRSFSRSD  417 (510)
Q Consensus       372 KPy~C~~e~CgKsFssss~LkrH~rtHtgeKPfkC~~CgKsF~~ks  417 (510)
                      -+|.|  . |++   ....+.+|.++|+++++|.|..|++.|....
T Consensus       118 ~~Y~C--~-C~~---~~~~~rrH~ri~~g~~~YrC~~C~~~l~~~~  157 (160)
T PRK04860        118 FPYRC--K-CQE---HQLTVRRHNRVVRGEAVYRCRRCGETLVFKG  157 (160)
T ss_pred             EEEEc--C-CCC---eeCHHHHHHHHhcCCccEECCCCCceeEEec
Confidence            36899  4 887   6677999999999999999999999887543


No 40 
>PF13912 zf-C2H2_6:  C2H2-type zinc finger; PDB: 1JN7_A 1FU9_A 2L1O_A 1NJQ_A 2EN8_A 2EMM_A 1FV5_A 1Y0J_B 2L6Z_B.
Probab=93.94  E-value=0.028  Score=37.12  Aligned_cols=25  Identities=36%  Similarity=0.877  Sum_probs=15.5

Q ss_pred             CcccCCCCCCCccCChHHHHHHHHHhc
Q psy679          373 PYACPVENCDRRFSRSDELTRHIRIHT  399 (510)
Q Consensus       373 Py~C~~e~CgKsFssss~LkrH~rtHt  399 (510)
                      +|.|  ..|++.|.+...|..|++.|.
T Consensus         1 ~~~C--~~C~~~F~~~~~l~~H~~~h~   25 (27)
T PF13912_consen    1 PFEC--DECGKTFSSLSALREHKRSHC   25 (27)
T ss_dssp             SEEE--TTTTEEESSHHHHHHHHCTTT
T ss_pred             CCCC--CccCCccCChhHHHHHhHHhc
Confidence            3566  336666666666666666653


No 41 
>PF12874 zf-met:  Zinc-finger of C2H2 type; PDB: 1ZU1_A 2KVG_A.
Probab=92.99  E-value=0.052  Score=35.15  Aligned_cols=23  Identities=26%  Similarity=0.852  Sum_probs=18.8

Q ss_pred             eecCcCCCccCChHHHHHHHHHh
Q psy679          432 FSCDVCGRKFARSDEKKRHAKVH  454 (510)
Q Consensus       432 fkC~~CgKaFsrsssLkrHlRtH  454 (510)
                      |.|..|++.|.....|+.|++.+
T Consensus         1 ~~C~~C~~~f~s~~~~~~H~~s~   23 (25)
T PF12874_consen    1 FYCDICNKSFSSENSLRQHLRSK   23 (25)
T ss_dssp             EEETTTTEEESSHHHHHHHHTTH
T ss_pred             CCCCCCCCCcCCHHHHHHHHCcC
Confidence            67888888888888888888654


No 42 
>KOG1146|consensus
Probab=92.97  E-value=0.23  Score=59.86  Aligned_cols=80  Identities=18%  Similarity=0.398  Sum_probs=52.7

Q ss_pred             CCCCCCcccCCCCCCCccCChHHHHHHHHH-hcCCCceecccCCCccCCchhhhhhhhcccCCCceecCcCCCccCChHH
Q psy679          368 PVHERPYACPVENCDRRFSRSDELTRHIRI-HTGQKPFQCRICMRSFSRSDHLTTHIRTHTGEKPFSCDVCGRKFARSDE  446 (510)
Q Consensus       368 h~~EKPy~C~~e~CgKsFssss~LkrH~rt-HtgeKPfkC~~CgKsF~~ks~L~rH~rtHtGeKPfkC~~CgKaFsrsss  446 (510)
                      +...|.|+|+  .|+..|+....|..|||. |...+-   .+|... .....+.+-..---+.++|.|..|...|..+.+
T Consensus       460 ~S~~kt~~cp--kc~~~yk~a~~L~vhmRskhp~~~~---~~c~~g-q~~~~~arg~~~~~~~~p~~C~~C~~stttng~  533 (1406)
T KOG1146|consen  460 HSFFKTLKCP--KCNWHYKLAQTLGVHMRSKHPESQS---AYCKAG-QNHPRLARGEVYRCPGKPYPCRACNYSTTTNGN  533 (1406)
T ss_pred             ecccccccCC--ccchhhhhHHHhhhcccccccccch---hHhHhc-cccccccccccccCCCCcccceeeeeeeecchH
Confidence            4445789995  599999999999999997 433222   333211 111111111111234689999999999999999


Q ss_pred             HHHHHHH
Q psy679          447 KKRHAKV  453 (510)
Q Consensus       447 LkrHlRt  453 (510)
                      |..|+..
T Consensus       534 LsihlqS  540 (1406)
T KOG1146|consen  534 LSIHLQS  540 (1406)
T ss_pred             HHHHHHH
Confidence            9999864


No 43 
>PF13909 zf-H2C2_5:  C2H2-type zinc-finger domain; PDB: 1X5W_A.
Probab=92.70  E-value=0.12  Score=33.39  Aligned_cols=24  Identities=29%  Similarity=0.555  Sum_probs=17.1

Q ss_pred             eecCcCCCccCChHHHHHHHHHhhc
Q psy679          432 FSCDVCGRKFARSDEKKRHAKVHLK  456 (510)
Q Consensus       432 fkC~~CgKaFsrsssLkrHlRtHtg  456 (510)
                      |+|..|..... ...|.+|+++|++
T Consensus         1 y~C~~C~y~t~-~~~l~~H~~~~H~   24 (24)
T PF13909_consen    1 YKCPHCSYSTS-KSNLKRHLKRHHP   24 (24)
T ss_dssp             EE-SSSS-EES-HHHHHHHHHHHHS
T ss_pred             CCCCCCCCcCC-HHHHHHHHHhhCc
Confidence            67888887777 7788888887653


No 44 
>smart00355 ZnF_C2H2 zinc finger.
Probab=92.45  E-value=0.13  Score=32.22  Aligned_cols=24  Identities=33%  Similarity=0.811  Sum_probs=17.7

Q ss_pred             cccCCCCCCCccCChHHHHHHHHHhc
Q psy679          374 YACPVENCDRRFSRSDELTRHIRIHT  399 (510)
Q Consensus       374 y~C~~e~CgKsFssss~LkrH~rtHt  399 (510)
                      |.|  ..|++.|.....|..|++.|.
T Consensus         1 ~~C--~~C~~~f~~~~~l~~H~~~H~   24 (26)
T smart00355        1 YRC--PECGKVFKSKSALKEHMRTHX   24 (26)
T ss_pred             CCC--CCCcchhCCHHHHHHHHHHhc
Confidence            457  348888888888888887664


No 45 
>KOG4173|consensus
Probab=92.42  E-value=0.039  Score=54.39  Aligned_cols=82  Identities=29%  Similarity=0.622  Sum_probs=66.4

Q ss_pred             CCCcccCCCCCCCccCChHHHHHHHHHhcCCCceecccCCCccCCchhhhhhhhccc----------CCCceecC--cCC
Q psy679          371 ERPYACPVENCDRRFSRSDELTRHIRIHTGQKPFQCRICMRSFSRSDHLTTHIRTHT----------GEKPFSCD--VCG  438 (510)
Q Consensus       371 EKPy~C~~e~CgKsFssss~LkrH~rtHtgeKPfkC~~CgKsF~~ks~L~rH~rtHt----------GeKPfkC~--~Cg  438 (510)
                      .+.|.|.+.+|...|....+...|..+-++   -.|.+|.+.|....-|..|+...+          |...|.|-  .|+
T Consensus        77 ~~~~~cqvagc~~~~d~lD~~E~hY~~~h~---~sCs~C~r~~Pt~hLLd~HI~E~HDs~Fqa~veRG~dMy~ClvEgCt  153 (253)
T KOG4173|consen   77 VPAFACQVAGCCQVFDALDDYEHHYHTLHG---NSCSFCKRAFPTGHLLDAHILEWHDSLFQALVERGQDMYQCLVEGCT  153 (253)
T ss_pred             cccccccccchHHHHhhhhhHHHhhhhccc---chhHHHHHhCCchhhhhHHHHHHHHHHHHHHHHcCccHHHHHHHhhh
Confidence            456889999999999998888888865333   269999999999999999986533          45579994  599


Q ss_pred             CccCChHHHHHHH-HHhh
Q psy679          439 RKFARSDEKKRHA-KVHL  455 (510)
Q Consensus       439 KaFsrsssLkrHl-RtHt  455 (510)
                      -.|......+.|+ ++|.
T Consensus       154 ~KFkT~r~RkdH~I~~Hk  171 (253)
T KOG4173|consen  154 EKFKTSRDRKDHMIRMHK  171 (253)
T ss_pred             hhhhhhhhhhhHHHHhcc
Confidence            9999999999998 5664


No 46 
>PF09237 GAGA:  GAGA factor;  InterPro: IPR015318 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target.  Members of this entry bind to a 5'-GAGAG-3' DNA consensus binding site, and contain a Cys2-His2 zinc finger core as well as an N-terminal extension containing two highly basic regions. The zinc finger core binds in the DNA major groove and recognises the first three GAG bases of the consensus in a manner similar to that seen in other classical zinc finger-DNA complexes. The second basic region forms a helix that interacts in the major groove recognising the last G of the consensus, while the first basic region wraps around the DNA in the minor groove and recognises the A in the fourth position of the consensus sequence [].  More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; PDB: 1YUI_A 1YUJ_A.
Probab=90.49  E-value=0.35  Score=37.99  Aligned_cols=32  Identities=38%  Similarity=0.795  Sum_probs=18.3

Q ss_pred             CCCCcccCCCCCCCccCChHHHHHHHHHhcCCCc
Q psy679          370 HERPYACPVENCDRRFSRSDELTRHIRIHTGQKP  403 (510)
Q Consensus       370 ~EKPy~C~~e~CgKsFssss~LkrH~rtHtgeKP  403 (510)
                      .+.|-.|++  |+..+....+|++|+..+++.||
T Consensus        21 S~~PatCP~--C~a~~~~srnLrRHle~~H~~k~   52 (54)
T PF09237_consen   21 SEQPATCPI--CGAVIRQSRNLRRHLEIRHFKKP   52 (54)
T ss_dssp             TS--EE-TT--T--EESSHHHHHHHHHHHTTTS-
T ss_pred             cCCCCCCCc--chhhccchhhHHHHHHHHhcccC
Confidence            356677744  88888888888888776666654


No 47 
>PF12171 zf-C2H2_jaz:  Zinc-finger double-stranded RNA-binding;  InterPro: IPR022755  This zinc finger is found in archaea and eukaryotes, and is approximately 30 amino acids in length. The mammalian members of this group occur multiple times along the protein, joined by flexible linkers, and are referred to as JAZ - dsRNA-binding ZF protein - zinc-fingers. The JAZ proteins are expressed in all tissues tested and localise in the nucleus, particularly the nucleolus []. JAZ preferentially binds to double-stranded (ds) RNA or RNA/DNA hybrids rather than DNA. In addition to binding double-stranded RNA, these zinc-fingers are required for nucleolar localisation.   This entry represents the multiple-adjacent-C2H2 zinc finger, JAZ. ; PDB: 4DGW_A 1ZR9_A.
Probab=88.96  E-value=0.1  Score=34.83  Aligned_cols=22  Identities=36%  Similarity=0.844  Sum_probs=15.8

Q ss_pred             eecCcCCCccCChHHHHHHHHH
Q psy679          432 FSCDVCGRKFARSDEKKRHAKV  453 (510)
Q Consensus       432 fkC~~CgKaFsrsssLkrHlRt  453 (510)
                      |.|..|++.|.....|..|++.
T Consensus         2 ~~C~~C~k~f~~~~~~~~H~~s   23 (27)
T PF12171_consen    2 FYCDACDKYFSSENQLKQHMKS   23 (27)
T ss_dssp             CBBTTTTBBBSSHHHHHCCTTS
T ss_pred             CCcccCCCCcCCHHHHHHHHcc
Confidence            5677777777777777777643


No 48 
>PF12874 zf-met:  Zinc-finger of C2H2 type; PDB: 1ZU1_A 2KVG_A.
Probab=87.16  E-value=0.33  Score=31.30  Aligned_cols=23  Identities=30%  Similarity=0.901  Sum_probs=15.5

Q ss_pred             cccCCCCCCCccCChHHHHHHHHHh
Q psy679          374 YACPVENCDRRFSRSDELTRHIRIH  398 (510)
Q Consensus       374 y~C~~e~CgKsFssss~LkrH~rtH  398 (510)
                      |.|.+  |++.|.+...|..|++.+
T Consensus         1 ~~C~~--C~~~f~s~~~~~~H~~s~   23 (25)
T PF12874_consen    1 FYCDI--CNKSFSSENSLRQHLRSK   23 (25)
T ss_dssp             EEETT--TTEEESSHHHHHHHHTTH
T ss_pred             CCCCC--CCCCcCCHHHHHHHHCcC
Confidence            45644  777777777777777654


No 49 
>COG5189 SFP1 Putative transcriptional repressor regulating G2/M transition [Transcription / Cell division and chromosome partitioning]
Probab=86.51  E-value=0.26  Score=51.56  Aligned_cols=40  Identities=23%  Similarity=0.415  Sum_probs=31.8

Q ss_pred             CCceecCc--CCCccCChHHHHHHHHHhh-------------------cccchhHHHHHHH
Q psy679          429 EKPFSCDV--CGRKFARSDEKKRHAKVHL-------------------KQRSKKESKMAVM  468 (510)
Q Consensus       429 eKPfkC~~--CgKaFsrsssLkrHlRtHt-------------------geK~~k~~~~~~~  468 (510)
                      +|||+|++  |.|.|....-|+.|+.--+                   +.|+|+|+.|.+-
T Consensus       347 ~KpykCpV~gC~K~YknqnGLKYH~lhGH~~~~~~~~p~p~~~~~F~~~~KPYrCevC~KR  407 (423)
T COG5189         347 GKPYKCPVEGCNKKYKNQNGLKYHMLHGHQNQKLHENPSPEKMNIFSAKDKPYRCEVCDKR  407 (423)
T ss_pred             CceecCCCCCchhhhccccchhhhhhccccCcccCCCCCccccccccccCCceeccccchh
Confidence            58999986  9999999999999975422                   3589999998764


No 50 
>COG5236 Uncharacterized conserved protein, contains RING Zn-finger [General function prediction only]
Probab=86.45  E-value=0.58  Score=49.49  Aligned_cols=80  Identities=31%  Similarity=0.630  Sum_probs=55.5

Q ss_pred             CcccCCCCCCCccCChHHHHHHHHHhcCCCceecccCCC---cc------CCchhhhhhhhcccCCCce----ecCcCCC
Q psy679          373 PYACPVENCDRRFSRSDELTRHIRIHTGQKPFQCRICMR---SF------SRSDHLTTHIRTHTGEKPF----SCDVCGR  439 (510)
Q Consensus       373 Py~C~~e~CgKsFssss~LkrH~rtHtgeKPfkC~~CgK---sF------~~ks~L~rH~rtHtGeKPf----kC~~CgK  439 (510)
                      .|.|+...|..+......|+.|.+..++  .+.|.+|-+   .|      .++..|+.|...-..+.-|    .|..|.+
T Consensus       151 ~F~CP~skc~~~C~~~k~lk~H~K~~H~--~~~C~~C~~nKk~F~~E~~lF~~~~Lr~H~~~G~~e~GFKGHP~C~FC~~  228 (493)
T COG5236         151 SFKCPKSKCHRRCGSLKELKKHYKAQHG--FVLCSECIGNKKDFWNEIRLFRSSTLRDHKNGGLEEEGFKGHPLCIFCKI  228 (493)
T ss_pred             HhcCCchhhhhhhhhHHHHHHHHHhhcC--cEEhHhhhcCcccCccceeeeecccccccccCCccccCcCCCchhhhccc
Confidence            3889888888887778889999987443  367777642   33      3455566665432222122    5999999


Q ss_pred             ccCChHHHHHHHHHh
Q psy679          440 KFARSDEKKRHAKVH  454 (510)
Q Consensus       440 aFsrsssLkrHlRtH  454 (510)
                      .|-..+.|.+|+|..
T Consensus       229 ~FYdDDEL~~HcR~~  243 (493)
T COG5236         229 YFYDDDELRRHCRLR  243 (493)
T ss_pred             eecChHHHHHHHHhh
Confidence            999999999988753


No 51 
>KOG2186|consensus
Probab=86.37  E-value=0.44  Score=48.42  Aligned_cols=50  Identities=30%  Similarity=0.610  Sum_probs=36.3

Q ss_pred             eecccCCCccCCchhhhhhhhcccCCCceecCcCCCccCChHHHHHHHHHhhc
Q psy679          404 FQCRICMRSFSRSDHLTTHIRTHTGEKPFSCDVCGRKFARSDEKKRHAKVHLK  456 (510)
Q Consensus       404 fkC~~CgKsF~~ks~L~rH~rtHtGeKPfkC~~CgKaFsrsssLkrHlRtHtg  456 (510)
                      |.|..|+.... +..+.+|+...++ ..|.|-.|++.|.+ ..++.|.+.-+.
T Consensus         4 FtCnvCgEsvK-Kp~vekH~srCrn-~~fSCIDC~k~F~~-~sYknH~kCITE   53 (276)
T KOG2186|consen    4 FTCNVCGESVK-KPQVEKHMSRCRN-AYFSCIDCGKTFER-VSYKNHTKCITE   53 (276)
T ss_pred             Eehhhhhhhcc-ccchHHHHHhccC-CeeEEeeccccccc-chhhhhhhhcch
Confidence            67888887664 4556678877766 66888888888887 677788765443


No 52 
>KOG4124|consensus
Probab=84.52  E-value=0.61  Score=49.39  Aligned_cols=72  Identities=26%  Similarity=0.599  Sum_probs=48.1

Q ss_pred             CCCCcccCCCCCCCccCChHHHHHHHHH-hcCCCceecccCCCccCCchhhhhhhhcccCCCceecCcCCCccCChHHHH
Q psy679          370 HERPYACPVENCDRRFSRSDELTRHIRI-HTGQKPFQCRICMRSFSRSDHLTTHIRTHTGEKPFSCDVCGRKFARSDEKK  448 (510)
Q Consensus       370 ~EKPy~C~~e~CgKsFssss~LkrH~rt-HtgeKPfkC~~CgKsF~~ks~L~rH~rtHtGeKPfkC~~CgKaFsrsssLk  448 (510)
                      ..++|+|+++.|.+.++....|+.|... |....          -.....-.-|.....-.|+|+|++|.+++.....|+
T Consensus       346 ~~~~~~~~vp~~~~~~~n~ng~~~~~~~~h~s~i----------~~~s~~~~ph~~~~~~nk~~r~~i~~~~~k~~~~l~  415 (442)
T KOG4124|consen  346 VDKPYKCPVPNCDKAYKNQNGLKYHKLHGHCSPI----------TTPTPAPIPHQGFVVENKPYRCEVCSKRYKNLNGLK  415 (442)
T ss_pred             ecCCCCCCCCcchhhcccCcceeeccccCcCCCC----------CCCCCCCCCcceeeeccCcccChhhhhhhccCCCCC
Confidence            3678999999999999998888877653 32211          111111223444444568999999999888777776


Q ss_pred             HHH
Q psy679          449 RHA  451 (510)
Q Consensus       449 rHl  451 (510)
                      .|+
T Consensus       416 ~~~  418 (442)
T KOG4124|consen  416 YHR  418 (442)
T ss_pred             cee
Confidence            664


No 53 
>KOG2231|consensus
Probab=84.48  E-value=0.76  Score=52.48  Aligned_cols=18  Identities=50%  Similarity=0.964  Sum_probs=10.3

Q ss_pred             CCCccCChHHHHHHHHHh
Q psy679          381 CDRRFSRSDELTRHIRIH  398 (510)
Q Consensus       381 CgKsFssss~LkrH~rtH  398 (510)
                      |...|.....|.+|++.+
T Consensus       188 C~~~fld~~el~rH~~~~  205 (669)
T KOG2231|consen  188 CHERFLDDDELYRHLRFD  205 (669)
T ss_pred             hhhhhccHHHHHHhhccc
Confidence            555555555555555543


No 54 
>PF12171 zf-C2H2_jaz:  Zinc-finger double-stranded RNA-binding;  InterPro: IPR022755  This zinc finger is found in archaea and eukaryotes, and is approximately 30 amino acids in length. The mammalian members of this group occur multiple times along the protein, joined by flexible linkers, and are referred to as JAZ - dsRNA-binding ZF protein - zinc-fingers. The JAZ proteins are expressed in all tissues tested and localise in the nucleus, particularly the nucleolus []. JAZ preferentially binds to double-stranded (ds) RNA or RNA/DNA hybrids rather than DNA. In addition to binding double-stranded RNA, these zinc-fingers are required for nucleolar localisation.   This entry represents the multiple-adjacent-C2H2 zinc finger, JAZ. ; PDB: 4DGW_A 1ZR9_A.
Probab=83.87  E-value=0.51  Score=31.37  Aligned_cols=22  Identities=32%  Similarity=0.824  Sum_probs=13.4

Q ss_pred             eecccCCCccCCchhhhhhhhc
Q psy679          404 FQCRICMRSFSRSDHLTTHIRT  425 (510)
Q Consensus       404 fkC~~CgKsF~~ks~L~rH~rt  425 (510)
                      |.|..|++.|.....+..|++.
T Consensus         2 ~~C~~C~k~f~~~~~~~~H~~s   23 (27)
T PF12171_consen    2 FYCDACDKYFSSENQLKQHMKS   23 (27)
T ss_dssp             CBBTTTTBBBSSHHHHHCCTTS
T ss_pred             CCcccCCCCcCCHHHHHHHHcc
Confidence            4566666666666666666553


No 55 
>smart00451 ZnF_U1 U1-like zinc finger. Family of C2H2-type zinc fingers, present in matrin, U1 small nuclear ribonucleoprotein C and other RNA-binding proteins.
Probab=83.46  E-value=0.95  Score=31.30  Aligned_cols=23  Identities=26%  Similarity=0.505  Sum_probs=20.0

Q ss_pred             ceecCcCCCccCChHHHHHHHHH
Q psy679          431 PFSCDVCGRKFARSDEKKRHAKV  453 (510)
Q Consensus       431 PfkC~~CgKaFsrsssLkrHlRt  453 (510)
                      +|.|+.|++.|.....+..|++.
T Consensus         3 ~~~C~~C~~~~~~~~~~~~H~~g   25 (35)
T smart00451        3 GFYCKLCNVTFTDEISVEAHLKG   25 (35)
T ss_pred             CeEccccCCccCCHHHHHHHHCh
Confidence            58899999999999999999854


No 56 
>PF13913 zf-C2HC_2:  zinc-finger of a C2HC-type
Probab=82.93  E-value=1.1  Score=29.84  Aligned_cols=21  Identities=43%  Similarity=1.067  Sum_probs=16.0

Q ss_pred             eecCcCCCccCChHHHHHHHHH
Q psy679          432 FSCDVCGRKFARSDEKKRHAKV  453 (510)
Q Consensus       432 fkC~~CgKaFsrsssLkrHlRt  453 (510)
                      ..|..||+.| ..+.|.+|++.
T Consensus         3 ~~C~~CgR~F-~~~~l~~H~~~   23 (25)
T PF13913_consen    3 VPCPICGRKF-NPDRLEKHEKI   23 (25)
T ss_pred             CcCCCCCCEE-CHHHHHHHHHh
Confidence            3688888888 66788888764


No 57 
>KOG2231|consensus
Probab=80.28  E-value=1.3  Score=50.59  Aligned_cols=13  Identities=31%  Similarity=0.621  Sum_probs=9.2

Q ss_pred             CChHHHHHHHH-Hh
Q psy679          386 SRSDELTRHIR-IH  398 (510)
Q Consensus       386 ssss~LkrH~r-tH  398 (510)
                      .....|+.|++ .|
T Consensus       125 ~s~~~Lk~H~~~~H  138 (669)
T KOG2231|consen  125 KSVENLKNHMRDQH  138 (669)
T ss_pred             hHHHHHHHHHHHhh
Confidence            36778888885 35


No 58 
>PF13909 zf-H2C2_5:  C2H2-type zinc-finger domain; PDB: 1X5W_A.
Probab=80.24  E-value=1.5  Score=28.07  Aligned_cols=23  Identities=39%  Similarity=0.909  Sum_probs=13.2

Q ss_pred             cccCCCCCCCccCChHHHHHHHHHhc
Q psy679          374 YACPVENCDRRFSRSDELTRHIRIHT  399 (510)
Q Consensus       374 y~C~~e~CgKsFssss~LkrH~rtHt  399 (510)
                      |.|.  .|..... ...|.+|++.|.
T Consensus         1 y~C~--~C~y~t~-~~~l~~H~~~~H   23 (24)
T PF13909_consen    1 YKCP--HCSYSTS-KSNLKRHLKRHH   23 (24)
T ss_dssp             EE-S--SSS-EES-HHHHHHHHHHHH
T ss_pred             CCCC--CCCCcCC-HHHHHHHHHhhC
Confidence            4563  3776666 667777776643


No 59 
>PF09986 DUF2225:  Uncharacterized protein conserved in bacteria (DUF2225);  InterPro: IPR018708 This conserved bacterial family has no known function.
Probab=76.91  E-value=1.2  Score=43.94  Aligned_cols=42  Identities=26%  Similarity=0.598  Sum_probs=25.4

Q ss_pred             CceecccCCCccCCchhhhhhhhc----------ccCCCc-----eecCcCCCccCC
Q psy679          402 KPFQCRICMRSFSRSDHLTTHIRT----------HTGEKP-----FSCDVCGRKFAR  443 (510)
Q Consensus       402 KPfkC~~CgKsF~~ks~L~rH~rt----------HtGeKP-----fkC~~CgKaFsr  443 (510)
                      |.+.|++|++.|..+.-+....+.          ..+..|     ..|+.||.+|..
T Consensus         4 k~~~CPvC~~~F~~~~vrs~~~r~~~~d~D~~~~Y~~vnP~~Y~V~vCP~CgyA~~~   60 (214)
T PF09986_consen    4 KKITCPVCGKEFKTKKVRSGKIRVIRRDSDFCPRYKGVNPLFYEVWVCPHCGYAAFE   60 (214)
T ss_pred             CceECCCCCCeeeeeEEEcCCceEeeecCCCccccCCCCCeeeeEEECCCCCCcccc
Confidence            456677777777666555544432          112333     479999988764


No 60 
>KOG2893|consensus
Probab=75.57  E-value=1.5  Score=44.44  Aligned_cols=46  Identities=30%  Similarity=0.753  Sum_probs=35.3

Q ss_pred             CCCcccCCCCCCCccCChHHHHHHHHHhcCCCceecccCCCccCCchhhhhhh
Q psy679          371 ERPYACPVENCDRRFSRSDELTRHIRIHTGQKPFQCRICMRSFSRSDHLTTHI  423 (510)
Q Consensus       371 EKPy~C~~e~CgKsFssss~LkrH~rtHtgeKPfkC~~CgKsF~~ks~L~rH~  423 (510)
                      .|++ |-  .|++-|....-|..|.+.    |.|+|-+|.|....-..|..|.
T Consensus         9 ~kpw-cw--ycnrefddekiliqhqka----khfkchichkkl~sgpglsihc   54 (341)
T KOG2893|consen    9 DKPW-CW--YCNREFDDEKILIQHQKA----KHFKCHICHKKLFSGPGLSIHC   54 (341)
T ss_pred             CCce-ee--ecccccchhhhhhhhhhh----ccceeeeehhhhccCCCceeeh
Confidence            4555 53  399999998888888764    5699999998887777777774


No 61 
>KOG1146|consensus
Probab=72.07  E-value=1.6  Score=53.06  Aligned_cols=78  Identities=18%  Similarity=0.351  Sum_probs=58.0

Q ss_pred             CcccCCCCCCCccCChHHHHHHHHHhcCCCceecccCCCccCCchhhhhhhhc-----------------ccCCCceecC
Q psy679          373 PYACPVENCDRRFSRSDELTRHIRIHTGQKPFQCRICMRSFSRSDHLTTHIRT-----------------HTGEKPFSCD  435 (510)
Q Consensus       373 Py~C~~e~CgKsFssss~LkrH~rtHtgeKPfkC~~CgKsF~~ks~L~rH~rt-----------------HtGeKPfkC~  435 (510)
                      .+.|  ..|.+.|...-.+. |+-.   ..+|.|..|...|.....|..|.+.                 |...++| |.
T Consensus      1260 e~~c--~~~~~~~~~~~~~~-~l~~---~~~~~~~~~~~~~~~~~~l~~~~~k~~~~~~~~~~~~~~~l~~~d~~~~-c~ 1332 (1406)
T KOG1146|consen 1260 EGEC--GAVDELLTPSFGIS-TLDV---THRYLCRQCKMAFDGEAPLTAHQRKFCFAGRGSGGSMPPPLRVPDCTYH-CL 1332 (1406)
T ss_pred             cchh--hhccccccCcccee-eccc---chhHHHHHHHhhhcchhHHHHHHHHHHhccCccccCCCCcccCcccccc-ch
Confidence            3678  44888888776666 5543   3468899999888888888888642                 2223456 99


Q ss_pred             cCCCccCChHHHHHHHHHhhcc
Q psy679          436 VCGRKFARSDEKKRHAKVHLKQ  457 (510)
Q Consensus       436 ~CgKaFsrsssLkrHlRtHtge  457 (510)
                      .|...|.....|..|+|.-+++
T Consensus      1333 ~c~~~~~~~~alqihm~~~~~~ 1354 (1406)
T KOG1146|consen 1333 ACEVLLSGREALQIHMRSSAHR 1354 (1406)
T ss_pred             HHHhhcchhHHHHHHHHHhhhc
Confidence            9999999999999999864443


No 62 
>PF09538 FYDLN_acid:  Protein of unknown function (FYDLN_acid);  InterPro: IPR012644 Members of this family are bacterial proteins with a conserved motif [KR]FYDLN, sometimes flanked by a pair of CXXC motifs, followed by a long region of low complexity sequence in which roughly half the residues are Asp and Glu, including multiple runs of five or more acidic residues. The function of members of this family is unknown.
Probab=71.48  E-value=2.5  Score=37.86  Aligned_cols=30  Identities=30%  Similarity=0.738  Sum_probs=18.7

Q ss_pred             eecccCCCccCCchhhhhhhhcccCCCceecCcCCCccCCh
Q psy679          404 FQCRICMRSFSRSDHLTTHIRTHTGEKPFSCDVCGRKFARS  444 (510)
Q Consensus       404 fkC~~CgKsF~~ks~L~rH~rtHtGeKPfkC~~CgKaFsrs  444 (510)
                      ..|+.||++|+.   |        +..|..|++||..|.-.
T Consensus        10 R~Cp~CG~kFYD---L--------nk~PivCP~CG~~~~~~   39 (108)
T PF09538_consen   10 RTCPSCGAKFYD---L--------NKDPIVCPKCGTEFPPE   39 (108)
T ss_pred             ccCCCCcchhcc---C--------CCCCccCCCCCCccCcc
Confidence            467777777753   2        12566777777777654


No 63 
>KOG2893|consensus
Probab=69.99  E-value=1.1  Score=45.47  Aligned_cols=44  Identities=32%  Similarity=0.668  Sum_probs=34.6

Q ss_pred             CceecccCCCccCCchhhhhhhhcccCCCceecCcCCCccCChHHHHHH
Q psy679          402 KPFQCRICMRSFSRSDHLTTHIRTHTGEKPFSCDVCGRKFARSDEKKRH  450 (510)
Q Consensus       402 KPfkC~~CgKsF~~ks~L~rH~rtHtGeKPfkC~~CgKaFsrsssLkrH  450 (510)
                      |+ -|-+|++-|....-|..|++.    |-|+|.+|.|.....--|..|
T Consensus        10 kp-wcwycnrefddekiliqhqka----khfkchichkkl~sgpglsih   53 (341)
T KOG2893|consen   10 KP-WCWYCNREFDDEKILIQHQKA----KHFKCHICHKKLFSGPGLSIH   53 (341)
T ss_pred             Cc-eeeecccccchhhhhhhhhhh----ccceeeeehhhhccCCCceee
Confidence            44 388899999999999988864    569999999876665556555


No 64 
>KOG4167|consensus
Probab=67.74  E-value=2.6  Score=48.47  Aligned_cols=26  Identities=31%  Similarity=0.721  Sum_probs=24.1

Q ss_pred             ceecCcCCCccCChHHHHHHHHHhhc
Q psy679          431 PFSCDVCGRKFARSDEKKRHAKVHLK  456 (510)
Q Consensus       431 PfkC~~CgKaFsrsssLkrHlRtHtg  456 (510)
                      -|.|..|+|.|.....+..|||+|.-
T Consensus       792 iFpCreC~kvF~KiKSrNAHMK~Hr~  817 (907)
T KOG4167|consen  792 IFPCRECGKVFFKIKSRNAHMKTHRQ  817 (907)
T ss_pred             eeehHHHHHHHHHHhhhhHHHHHHHH
Confidence            48999999999999999999999964


No 65 
>PF12013 DUF3505:  Protein of unknown function (DUF3505);  InterPro: IPR022698  This family of proteins is functionally uncharacterised. This protein is found in eukaryotes. Proteins in this family are typically between 247 to 1018 amino acids in length. This region contains two segments that are likely to be C2H2 zinc binding domains. 
Probab=67.19  E-value=7.3  Score=34.13  Aligned_cols=25  Identities=16%  Similarity=0.404  Sum_probs=23.1

Q ss_pred             eec----CcCCCccCChHHHHHHHHHhhc
Q psy679          432 FSC----DVCGRKFARSDEKKRHAKVHLK  456 (510)
Q Consensus       432 fkC----~~CgKaFsrsssLkrHlRtHtg  456 (510)
                      |.|    ..|+..+.....+++|.+.++|
T Consensus        81 ~~C~~~~~~C~y~~~~~~~m~~H~~~~Hg  109 (109)
T PF12013_consen   81 YRCQCDPPHCGYITRSKKTMRKHWRKEHG  109 (109)
T ss_pred             eeeecCCCCCCcEeccHHHHHHHHHHhcC
Confidence            899    8999999999999999998775


No 66 
>COG5236 Uncharacterized conserved protein, contains RING Zn-finger [General function prediction only]
Probab=65.15  E-value=3  Score=44.38  Aligned_cols=68  Identities=28%  Similarity=0.534  Sum_probs=33.6

Q ss_pred             CCCccCChHHHHHHHHHhcCCCceecccCCC----ccCCchhhhhhhhcccCCCceecCc--C--C--CccCChHHHHHH
Q psy679          381 CDRRFSRSDELTRHIRIHTGQKPFQCRICMR----SFSRSDHLTTHIRTHTGEKPFSCDV--C--G--RKFARSDEKKRH  450 (510)
Q Consensus       381 CgKsFssss~LkrH~rtHtgeKPfkC~~CgK----sF~~ks~L~rH~rtHtGeKPfkC~~--C--g--KaFsrsssLkrH  450 (510)
                      |.+.|-....|.+|+|..+ ++-|.|..-+.    -|..-..|.+|.+.    .-|.|.+  |  |  ..|.....|..|
T Consensus       226 C~~~FYdDDEL~~HcR~~H-E~ChICD~v~p~~~QYFK~Y~~Le~HF~~----~hy~ct~qtc~~~k~~vf~~~~el~~h  300 (493)
T COG5236         226 CKIYFYDDDELRRHCRLRH-EACHICDMVGPIRYQYFKSYEDLEAHFRN----AHYCCTFQTCRVGKCYVFPYHTELLEH  300 (493)
T ss_pred             ccceecChHHHHHHHHhhh-hhhhhhhccCccchhhhhCHHHHHHHhhc----CceEEEEEEEecCcEEEeccHHHHHHH
Confidence            6666766667766666532 22233322211    24444555555442    2255543  3  2  246666666666


Q ss_pred             HHH
Q psy679          451 AKV  453 (510)
Q Consensus       451 lRt  453 (510)
                      +-.
T Consensus       301 ~~~  303 (493)
T COG5236         301 LTR  303 (493)
T ss_pred             HHH
Confidence            643


No 67 
>PRK00464 nrdR transcriptional regulator NrdR; Validated
Probab=64.09  E-value=3.4  Score=39.20  Aligned_cols=14  Identities=21%  Similarity=0.612  Sum_probs=9.3

Q ss_pred             eecccCCCccCCch
Q psy679          404 FQCRICMRSFSRSD  417 (510)
Q Consensus       404 fkC~~CgKsF~~ks  417 (510)
                      +.|+.|++.|....
T Consensus        29 ~~c~~c~~~f~~~e   42 (154)
T PRK00464         29 RECLACGKRFTTFE   42 (154)
T ss_pred             eeccccCCcceEeE
Confidence            67777777776543


No 68 
>cd00350 rubredoxin_like Rubredoxin_like; nonheme iron binding domain containing a [Fe(SCys)4] center. The family includes rubredoxins, a small electron transfer protein, and a slightly smaller modular rubredoxin domain present in rubrerythrin and nigerythrin and detected either N- or C-terminal to such proteins as flavin reductase, NAD(P)H-nitrite reductase, and ferredoxin-thioredoxin reductase. In rubredoxin, the iron atom is coordinated by four cysteine residues (Fe(S-Cys)4), but iron can also be replaced by cobalt, nickel or zinc and believed to be involved in electron transfer.  Rubrerythrins and nigerythrins are small homodimeric proteins, generally consisting of 2 domains: a rubredoxin domain C-terminal to a non-sulfur, oxo-bridged diiron site in the N-terminal rubrerythrin domain.  Rubrerythrins and nigerythrins have putative peroxide activity.
Probab=64.05  E-value=4.6  Score=28.41  Aligned_cols=10  Identities=50%  Similarity=1.381  Sum_probs=6.3

Q ss_pred             CceecCcCCC
Q psy679          430 KPFSCDVCGR  439 (510)
Q Consensus       430 KPfkC~~CgK  439 (510)
                      .++.|+.||.
T Consensus        16 ~~~~CP~Cg~   25 (33)
T cd00350          16 APWVCPVCGA   25 (33)
T ss_pred             CCCcCcCCCC
Confidence            5566777764


No 69 
>COG4049 Uncharacterized protein containing archaeal-type C2H2 Zn-finger [General function prediction only]
Probab=63.22  E-value=4  Score=32.80  Aligned_cols=29  Identities=31%  Similarity=0.613  Sum_probs=23.7

Q ss_pred             cccCCCceecCcCCCccCChHHHHHHHHH
Q psy679          425 THTGEKPFSCDVCGRKFARSDEKKRHAKV  453 (510)
Q Consensus       425 tHtGeKPfkC~~CgKaFsrsssLkrHlRt  453 (510)
                      .-.||--++|+-|++.|.....+.+|...
T Consensus        11 ~RDGE~~lrCPRC~~~FR~~K~Y~RHVNK   39 (65)
T COG4049          11 DRDGEEFLRCPRCGMVFRRRKDYIRHVNK   39 (65)
T ss_pred             ccCCceeeeCCchhHHHHHhHHHHHHhhH
Confidence            34577778999999999999999999854


No 70 
>smart00451 ZnF_U1 U1-like zinc finger. Family of C2H2-type zinc fingers, present in matrin, U1 small nuclear ribonucleoprotein C and other RNA-binding proteins.
Probab=62.71  E-value=6.3  Score=27.10  Aligned_cols=23  Identities=17%  Similarity=0.557  Sum_probs=17.6

Q ss_pred             CcccCCCCCCCccCChHHHHHHHHH
Q psy679          373 PYACPVENCDRRFSRSDELTRHIRI  397 (510)
Q Consensus       373 Py~C~~e~CgKsFssss~LkrH~rt  397 (510)
                      +|.|.+  |++.|.+...+..|++.
T Consensus         3 ~~~C~~--C~~~~~~~~~~~~H~~g   25 (35)
T smart00451        3 GFYCKL--CNVTFTDEISVEAHLKG   25 (35)
T ss_pred             CeEccc--cCCccCCHHHHHHHHCh
Confidence            467844  88888888888888764


No 71 
>COG1997 RPL43A Ribosomal protein L37AE/L43A [Translation, ribosomal structure and biogenesis]
Probab=62.47  E-value=4.5  Score=35.11  Aligned_cols=32  Identities=38%  Similarity=0.688  Sum_probs=23.1

Q ss_pred             CceecccCCCccCCchhhhhhhhcccCCCceecCcCCCccCC
Q psy679          402 KPFQCRICMRSFSRSDHLTTHIRTHTGEKPFSCDVCGRKFAR  443 (510)
Q Consensus       402 KPfkC~~CgKsF~~ks~L~rH~rtHtGeKPfkC~~CgKaFsr  443 (510)
                      ..|.|+.|++.        .+.|+-+  .-+.|..||..|+-
T Consensus        34 ~~~~Cp~C~~~--------~VkR~a~--GIW~C~kCg~~fAG   65 (89)
T COG1997          34 AKHVCPFCGRT--------TVKRIAT--GIWKCRKCGAKFAG   65 (89)
T ss_pred             cCCcCCCCCCc--------ceeeecc--CeEEcCCCCCeecc
Confidence            46899999875        2444444  45899999998864


No 72 
>PF12756 zf-C2H2_2:  C2H2 type zinc-finger (2 copies); PDB: 2DMI_A.
Probab=62.39  E-value=6.2  Score=32.60  Aligned_cols=24  Identities=29%  Similarity=0.897  Sum_probs=20.3

Q ss_pred             CcccCCCCCCCccCChHHHHHHHHHh
Q psy679          373 PYACPVENCDRRFSRSDELTRHIRIH  398 (510)
Q Consensus       373 Py~C~~e~CgKsFssss~LkrH~rtH  398 (510)
                      .+.|.+  |++.|.+...|..||+.+
T Consensus        50 ~~~C~~--C~~~f~s~~~l~~Hm~~~   73 (100)
T PF12756_consen   50 SFRCPY--CNKTFRSREALQEHMRSK   73 (100)
T ss_dssp             SEEBSS--SS-EESSHHHHHHHHHHT
T ss_pred             CCCCCc--cCCCCcCHHHHHHHHcCc
Confidence            688966  999999999999999975


No 73 
>TIGR00622 ssl1 transcription factor ssl1. This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University).
Probab=61.23  E-value=12  Score=33.93  Aligned_cols=80  Identities=21%  Similarity=0.357  Sum_probs=50.7

Q ss_pred             CCcccCCCCCCCccCChHHHHHHHHHhcCCCc------------eecccCCCccCCchhhhhhhhcccCCCceecCcCCC
Q psy679          372 RPYACPVENCDRRFSRSDELTRHIRIHTGQKP------------FQCRICMRSFSRSDHLTTHIRTHTGEKPFSCDVCGR  439 (510)
Q Consensus       372 KPy~C~~e~CgKsFssss~LkrH~rtHtgeKP------------fkC~~CgKsF~~ks~L~rH~rtHtGeKPfkC~~CgK  439 (510)
                      -|-.|++  |+-..-.+.+|.|...---.-++            ..|--|.+.|........-.  -.....|+|..|..
T Consensus        14 LP~~Cpi--CgLtLVss~HLARSyHHLfPl~~f~ev~~~~~~~~~~C~~C~~~f~~~~~~~~~~--~~~~~~y~C~~C~~   89 (112)
T TIGR00622        14 LPVECPI--CGLTLILSTHLARSYHHLFPLKAFQEIPLEEYNGSRFCFGCQGPFPKPPVSPFDE--LKDSHRYVCAVCKN   89 (112)
T ss_pred             CCCcCCc--CCCEEeccchHHHhhhccCCCcccccccccccCCCCcccCcCCCCCCcccccccc--cccccceeCCCCCC
Confidence            3556866  99998888888764211001111            23888888887543211100  12235799999999


Q ss_pred             ccCChHHHHHHHHHhh
Q psy679          440 KFARSDEKKRHAKVHL  455 (510)
Q Consensus       440 aFsrsssLkrHlRtHt  455 (510)
                      .|-..-....|...|.
T Consensus        90 ~FC~dCD~fiHe~Lh~  105 (112)
T TIGR00622        90 VFCVDCDVFVHESLHC  105 (112)
T ss_pred             ccccccchhhhhhccC
Confidence            9998888888887775


No 74 
>TIGR02300 FYDLN_acid conserved hypothetical protein TIGR02300. Members of this family are bacterial proteins with a conserved motif [KR]FYDLN, sometimes flanked by a pair of CXXC motifs, followed by a long region of low complexity sequence in which roughly half the residues are Asp and Glu, including multiple runs of five or more acidic residues. The function of members of this family is unknown.
Probab=57.80  E-value=6.9  Score=36.17  Aligned_cols=33  Identities=18%  Similarity=0.291  Sum_probs=21.5

Q ss_pred             eecccCCCccCCchhhhhhhhcccCCCceecCcCCCccCChHHH
Q psy679          404 FQCRICMRSFSRSDHLTTHIRTHTGEKPFSCDVCGRKFARSDEK  447 (510)
Q Consensus       404 fkC~~CgKsF~~ks~L~rH~rtHtGeKPfkC~~CgKaFsrsssL  447 (510)
                      ..|+.|+++|+.   |        +..|..|++||..|.....+
T Consensus        10 r~Cp~cg~kFYD---L--------nk~p~vcP~cg~~~~~~~~~   42 (129)
T TIGR02300        10 RICPNTGSKFYD---L--------NRRPAVSPYTGEQFPPEEAL   42 (129)
T ss_pred             ccCCCcCccccc---c--------CCCCccCCCcCCccCcchhh
Confidence            468888887754   2        23577788888777655333


No 75 
>PF02892 zf-BED:  BED zinc finger;  InterPro: IPR003656 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target.  This entry represents predicted BED-type zinc finger domains. The BED finger which was named after the Drosophila proteins BEAF and DREF, is found in one or more copies in cellular regulatory factors and transposases from plants, animals and fungi. The BED finger is an about 50 to 60 amino acid residues domain that contains a characteristic motif with two highly conserved aromatic positions, as well as a shared pattern of cysteines and histidines that is predicted to form a zinc finger. As diverse BED fingers are able to bind DNA, it has been suggested that DNA-binding is the general function of this domain []. Some proteins known to contain a BED domain include animal, plant and fungi AC1 and Hobo-like transposases; Caenorhabditis elegans Dpy-20 protein, a predicted cuticular gene transcriptional regulator; Drosophila BEAF (boundary element-associated factor), thought to be involved in chromatin insulation; Drosophila DREF, a transcriptional regulator for S-phase genes; and tobacco 3AF1 and tomato E4/E8-BP1, light- and ethylene-regulated DNA binding proteins that contain two BED fingers. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; GO: 0003677 DNA binding; PDB: 2DJR_A 2CT5_A.
Probab=56.71  E-value=7.4  Score=28.50  Aligned_cols=26  Identities=35%  Similarity=0.593  Sum_probs=15.8

Q ss_pred             CCCceecCcCCCccCCh----HHHHHHHHH
Q psy679          428 GEKPFSCDVCGRKFARS----DEKKRHAKV  453 (510)
Q Consensus       428 GeKPfkC~~CgKaFsrs----ssLkrHlRt  453 (510)
                      ++...+|..|++.+...    +.|.+|++.
T Consensus        13 ~~~~a~C~~C~~~~~~~~~~ts~l~~HL~~   42 (45)
T PF02892_consen   13 DKKKAKCKYCGKVIKYSSGGTSNLKRHLKK   42 (45)
T ss_dssp             CSS-EEETTTTEE-----SSTHHHHHHHHH
T ss_pred             CcCeEEeCCCCeEEeeCCCcHHHHHHhhhh
Confidence            45567888898887764    778888843


No 76 
>smart00614 ZnF_BED BED zinc finger. DNA-binding domain in chromatin-boundary-element-binding proteins and transposases
Probab=54.28  E-value=9  Score=29.13  Aligned_cols=24  Identities=38%  Similarity=0.818  Sum_probs=17.3

Q ss_pred             eecCcCCCccCCh-----HHHHHHHH-Hhh
Q psy679          432 FSCDVCGRKFARS-----DEKKRHAK-VHL  455 (510)
Q Consensus       432 fkC~~CgKaFsrs-----ssLkrHlR-tHt  455 (510)
                      -.|..|++.+...     ++|.+|++ +|.
T Consensus        19 a~C~~C~~~l~~~~~~gTs~L~rHl~~~h~   48 (50)
T smart00614       19 AKCKYCGKKLSRSSKGGTSNLRRHLRRKHP   48 (50)
T ss_pred             EEecCCCCEeeeCCCCCcHHHHHHHHhHCc
Confidence            4588888887665     58888887 453


No 77 
>KOG2482|consensus
Probab=53.80  E-value=8.3  Score=41.10  Aligned_cols=81  Identities=16%  Similarity=0.315  Sum_probs=46.2

Q ss_pred             CcccCCCCCCCccCChHHHHHHHHH--hcCCCc--------eeccc--CCCccCC-chhhhhhhh----cc---------
Q psy679          373 PYACPVENCDRRFSRSDELTRHIRI--HTGQKP--------FQCRI--CMRSFSR-SDHLTTHIR----TH---------  426 (510)
Q Consensus       373 Py~C~~e~CgKsFssss~LkrH~rt--HtgeKP--------fkC~~--CgKsF~~-ks~L~rH~r----tH---------  426 (510)
                      .+.|-+  |.|.|+.+..|+.|||.  |....|        |.=..  -+|+... ...+.+-..    .+         
T Consensus       195 r~~CLy--CekifrdkntLkeHMrkK~HrrinPknreYDkfyiINY~ev~ks~t~~~~e~dret~~d~~E~D~~wsDw~e  272 (423)
T KOG2482|consen  195 RLRCLY--CEKIFRDKNTLKEHMRKKRHRRINPKNREYDKFYIINYLEVGKSWTIVHSEDDRETNEDINETDDTWSDWNE  272 (423)
T ss_pred             hheeee--eccccCCcHHHHHHHHhccCcccCCCccccceEEEEeHhhcCCccchhhhhhhhhhhccccccccchhhhhc
Confidence            467966  99999999999999985  544322        11110  2222111 011100000    00         


Q ss_pred             cCCCc--eecCcCCCccCChHHHHHHHHHhh
Q psy679          427 TGEKP--FSCDVCGRKFARSDEKKRHAKVHL  455 (510)
Q Consensus       427 tGeKP--fkC~~CgKaFsrsssLkrHlRtHt  455 (510)
                      .+..+  .+|-.|.+..-....|..||++.+
T Consensus       273 d~a~a~~v~CLfC~~~~en~~~l~eHmk~vH  303 (423)
T KOG2482|consen  273 DDAEALSVVCLFCTNFYENPVFLFEHMKIVH  303 (423)
T ss_pred             CCCCccceEEEeeccchhhHHHHHHHHHHHH
Confidence            01122  478889888888888999998644


No 78 
>smart00531 TFIIE Transcription initiation factor IIE.
Probab=53.27  E-value=13  Score=34.56  Aligned_cols=38  Identities=16%  Similarity=0.564  Sum_probs=25.9

Q ss_pred             CCceecccCCCccCCchhhhhhhhcccCCCceecCcCCCccC
Q psy679          401 QKPFQCRICMRSFSRSDHLTTHIRTHTGEKPFSCDVCGRKFA  442 (510)
Q Consensus       401 eKPfkC~~CgKsF~~ks~L~rH~rtHtGeKPfkC~~CgKaFs  442 (510)
                      ..-|.|+.|+..|.....+.. .  +. +..|.|+.||....
T Consensus        97 ~~~Y~Cp~C~~~y~~~ea~~~-~--d~-~~~f~Cp~Cg~~l~  134 (147)
T smart00531       97 NAYYKCPNCQSKYTFLEANQL-L--DM-DGTFTCPRCGEELE  134 (147)
T ss_pred             CcEEECcCCCCEeeHHHHHHh-c--CC-CCcEECCCCCCEEE
Confidence            456999999999886544332 0  11 33499999998754


No 79 
>TIGR02098 MJ0042_CXXC MJ0042 family finger-like domain. This domain contains a CXXCX(19)CXXC motif suggestive of both zinc fingers and thioredoxin, usually found at the N-terminus of prokaryotic proteins. One partially characterized gene, agmX, is among a large set in Myxococcus whose interruption affects adventurous gliding motility.
Probab=52.03  E-value=11  Score=26.83  Aligned_cols=34  Identities=26%  Similarity=0.536  Sum_probs=19.0

Q ss_pred             eecccCCCccCCchhhhhhhhcccCCCceecCcCCCccC
Q psy679          404 FQCRICMRSFSRSDHLTTHIRTHTGEKPFSCDVCGRKFA  442 (510)
Q Consensus       404 fkC~~CgKsF~~ks~L~rH~rtHtGeKPfkC~~CgKaFs  442 (510)
                      +.|+.|+..|.-......     .......|..|+..|.
T Consensus         3 ~~CP~C~~~~~v~~~~~~-----~~~~~v~C~~C~~~~~   36 (38)
T TIGR02098         3 IQCPNCKTSFRVVDSQLG-----ANGGKVRCGKCGHVWY   36 (38)
T ss_pred             EECCCCCCEEEeCHHHcC-----CCCCEEECCCCCCEEE
Confidence            467777777655443221     1112467777777664


No 80 
>PF05443 ROS_MUCR:  ROS/MUCR transcriptional regulator protein;  InterPro: IPR008807 This family consists of several ROS/MUCR transcriptional regulator proteins. The ros chromosomal gene is present in octopine and nopaline strains of Agrobacterium tumefaciens as well as in Rhizobium meliloti (Sinorhizobium meliloti). This gene encodes a 15.5 kDa protein that specifically represses the virC and virD operons in the virulence region of the Ti plasmid [] and is necessary for succinoglycan production []. S. meliloti can produce two types of acidic exopolysaccharides, succinoglycan and galactoglucan, that are interchangeable for infection of Medicago sativa (Alfalfa) nodules. MucR from S. meliloti acts as a transcriptional repressor that blocks the expression of the exp genes responsible for galactoglucan production therefore allowing the exclusive production of succinoglycan [].; GO: 0003677 DNA binding, 0008270 zinc ion binding, 0006355 regulation of transcription, DNA-dependent; PDB: 2JSP_A.
Probab=51.98  E-value=8.2  Score=35.85  Aligned_cols=27  Identities=33%  Similarity=0.576  Sum_probs=16.4

Q ss_pred             CCceecCcCCCccCChHHHHHHHHHhhccc
Q psy679          429 EKPFSCDVCGRKFARSDEKKRHAKVHLKQR  458 (510)
Q Consensus       429 eKPfkC~~CgKaFsrsssLkrHlRtHtgeK  458 (510)
                      +.-..|-+|||.|..   |++|++.|+|-.
T Consensus        70 ~d~i~clecGk~~k~---LkrHL~~~~glt   96 (132)
T PF05443_consen   70 PDYIICLECGKKFKT---LKRHLRTHHGLT   96 (132)
T ss_dssp             SS-EE-TBT--EESB---HHHHHHHTT-S-
T ss_pred             cCeeEEccCCcccch---HHHHHHHccCCC
Confidence            344678899998875   589999997743


No 81 
>PF13719 zinc_ribbon_5:  zinc-ribbon domain
Probab=47.88  E-value=15  Score=26.57  Aligned_cols=34  Identities=26%  Similarity=0.643  Sum_probs=19.7

Q ss_pred             eecccCCCccCCchhhhhhhhcccCCCceecCcCCCccC
Q psy679          404 FQCRICMRSFSRSDHLTTHIRTHTGEKPFSCDVCGRKFA  442 (510)
Q Consensus       404 fkC~~CgKsF~~ks~L~rH~rtHtGeKPfkC~~CgKaFs  442 (510)
                      ..|+.|+..|.-..+-.     ..+.+..+|..|+..|.
T Consensus         3 i~CP~C~~~f~v~~~~l-----~~~~~~vrC~~C~~~f~   36 (37)
T PF13719_consen    3 ITCPNCQTRFRVPDDKL-----PAGGRKVRCPKCGHVFR   36 (37)
T ss_pred             EECCCCCceEEcCHHHc-----ccCCcEEECCCCCcEee
Confidence            35777777776554321     12334567777777664


No 82 
>TIGR00373 conserved hypothetical protein TIGR00373. This family of proteins is, so far, restricted to archaeal genomes. The family appears to be distantly related to the N-terminal region of the eukaryotic transcription initiation factor IIE alpha chain.
Probab=46.89  E-value=15  Score=34.66  Aligned_cols=34  Identities=21%  Similarity=0.468  Sum_probs=26.3

Q ss_pred             CCCceecccCCCccCCchhhhhhhhcccCCCceecCcCCCccC
Q psy679          400 GQKPFQCRICMRSFSRSDHLTTHIRTHTGEKPFSCDVCGRKFA  442 (510)
Q Consensus       400 geKPfkC~~CgKsF~~ks~L~rH~rtHtGeKPfkC~~CgKaFs  442 (510)
                      ...-|.|+.|+..|.....+.         .-|.|+.||....
T Consensus       106 ~~~~Y~Cp~c~~r~tf~eA~~---------~~F~Cp~Cg~~L~  139 (158)
T TIGR00373       106 NNMFFICPNMCVRFTFNEAME---------LNFTCPRCGAMLD  139 (158)
T ss_pred             CCCeEECCCCCcEeeHHHHHH---------cCCcCCCCCCEee
Confidence            456699999999888777664         2599999997654


No 83 
>PRK09678 DNA-binding transcriptional regulator; Provisional
Probab=46.01  E-value=9.7  Score=31.85  Aligned_cols=14  Identities=29%  Similarity=0.740  Sum_probs=6.8

Q ss_pred             CCceecC--cCCCccC
Q psy679          429 EKPFSCD--VCGRKFA  442 (510)
Q Consensus       429 eKPfkC~--~CgKaFs  442 (510)
                      ++.+.|.  .||..|.
T Consensus        25 ~~Y~qC~N~eCg~tF~   40 (72)
T PRK09678         25 ERYHQCQNVNCSATFI   40 (72)
T ss_pred             eeeeecCCCCCCCEEE
Confidence            3444554  4555554


No 84 
>PRK13185 chlL protochlorophyllide reductase iron-sulfur ATP-binding protein; Provisional
Probab=45.76  E-value=3.4  Score=41.20  Aligned_cols=18  Identities=44%  Similarity=0.590  Sum_probs=15.8

Q ss_pred             cccCCCCCceeeeeecCC
Q psy679           29 ISNSGGDGKTTSFVNLSH   46 (510)
Q Consensus        29 ~~~~~~~~~~~~~~~~~~   46 (510)
                      |.+-||-||||+-+||.+
T Consensus         7 v~~KGGVGKTT~~~nLA~   24 (270)
T PRK13185          7 VYGKGGIGKSTTSSNLSA   24 (270)
T ss_pred             EECCCCCCHHHHHHHHHH
Confidence            347899999999999986


No 85 
>PF04959 ARS2:  Arsenite-resistance protein 2;  InterPro: IPR007042 This entry represents Arsenite-resistance protein 2 (also known as Serrate RNA effector molecule homolog) which is thought to play a role in arsenite resistance [], although does not directly confer arsenite resistance but rather modulates arsenic sensitivity []. Arsenite is a carcinogenic compound which can act as a comutagen by inhibiting DNA repair. It is also involved in cell cycle progression at S phase. ; PDB: 3AX1_A.
Probab=44.73  E-value=20  Score=35.94  Aligned_cols=33  Identities=18%  Similarity=0.498  Sum_probs=24.7

Q ss_pred             CCCceecCcCCCccCChHHHHHHHHHhhcccch
Q psy679          428 GEKPFSCDVCGRKFARSDEKKRHAKVHLKQRSK  460 (510)
Q Consensus       428 GeKPfkC~~CgKaFsrsssLkrHlRtHtgeK~~  460 (510)
                      .+..|.|..|+|.|.-..-.++|+...|.++..
T Consensus        74 ~~~K~~C~lc~KlFkg~eFV~KHI~nKH~e~ve  106 (214)
T PF04959_consen   74 DEDKWRCPLCGKLFKGPEFVRKHIFNKHPEKVE  106 (214)
T ss_dssp             SSEEEEE-SSS-EESSHHHHHHHHHHH-HHHHH
T ss_pred             cCCEECCCCCCcccCChHHHHHHHhhcCHHHHH
Confidence            455699999999999999999999877766543


No 86 
>PF09723 Zn-ribbon_8:  Zinc ribbon domain;  InterPro: IPR013429  This entry represents a region of about 41 amino acids found in a number of small proteins in a wide range of bacteria. The region usually begins with the initiator Met and contains two CxxC motifs separated by 17 amino acids. One protein in this entry has been noted as a putative regulatory protein, designated FmdB []. Most proteins in this entry have a C-terminal region containing highly degenerate sequence.
Probab=43.29  E-value=13  Score=27.47  Aligned_cols=29  Identities=24%  Similarity=0.754  Sum_probs=17.3

Q ss_pred             eecccCCCccCCchhhhhhhhcccCCCceecCcCCC
Q psy679          404 FQCRICMRSFSRSDHLTTHIRTHTGEKPFSCDVCGR  439 (510)
Q Consensus       404 fkC~~CgKsF~~ks~L~rH~rtHtGeKPfkC~~CgK  439 (510)
                      |+|..|+..|.....      ... ...-.|+.||.
T Consensus         6 y~C~~Cg~~fe~~~~------~~~-~~~~~CP~Cg~   34 (42)
T PF09723_consen    6 YRCEECGHEFEVLQS------ISE-DDPVPCPECGS   34 (42)
T ss_pred             EEeCCCCCEEEEEEE------cCC-CCCCcCCCCCC
Confidence            777778777753222      222 35567877875


No 87 
>PF13717 zinc_ribbon_4:  zinc-ribbon domain
Probab=43.12  E-value=22  Score=25.64  Aligned_cols=33  Identities=21%  Similarity=0.598  Sum_probs=20.0

Q ss_pred             eecccCCCccCCchhhhhhhhcccCCCceecCcCCCcc
Q psy679          404 FQCRICMRSFSRSDHLTTHIRTHTGEKPFSCDVCGRKF  441 (510)
Q Consensus       404 fkC~~CgKsF~~ks~L~rH~rtHtGeKPfkC~~CgKaF  441 (510)
                      ..|+.|...|.-.+....     ...+..+|..|+..|
T Consensus         3 i~Cp~C~~~y~i~d~~ip-----~~g~~v~C~~C~~~f   35 (36)
T PF13717_consen    3 ITCPNCQAKYEIDDEKIP-----PKGRKVRCSKCGHVF   35 (36)
T ss_pred             EECCCCCCEEeCCHHHCC-----CCCcEEECCCCCCEe
Confidence            457777777766554321     223456788887766


No 88 
>smart00834 CxxC_CXXC_SSSS Putative regulatory protein. CxxC_CXXC_SSSS represents a region of about 41 amino acids found in a number of small proteins in a wide range of bacteria. The region usually begins with the initiator Met and contains two CxxC motifs separated by 17 amino acids. One protein in this entry has been noted as a putative regulatory protein, designated FmdB. Most proteins in this entry have a C-terminal region containing highly degenerate sequence.
Probab=43.11  E-value=12  Score=26.70  Aligned_cols=30  Identities=20%  Similarity=0.639  Sum_probs=16.2

Q ss_pred             eecccCCCccCCchhhhhhhhcccCCCceecCcCCCc
Q psy679          404 FQCRICMRSFSRSDHLTTHIRTHTGEKPFSCDVCGRK  440 (510)
Q Consensus       404 fkC~~CgKsF~~ks~L~rH~rtHtGeKPfkC~~CgKa  440 (510)
                      |+|..|++.|......      .. ...-.|+.||..
T Consensus         6 y~C~~Cg~~fe~~~~~------~~-~~~~~CP~Cg~~   35 (41)
T smart00834        6 YRCEDCGHTFEVLQKI------SD-DPLATCPECGGD   35 (41)
T ss_pred             EEcCCCCCEEEEEEec------CC-CCCCCCCCCCCc
Confidence            6777777766432221      11 234567777753


No 89 
>KOG2186|consensus
Probab=42.75  E-value=11  Score=38.54  Aligned_cols=47  Identities=21%  Similarity=0.590  Sum_probs=36.5

Q ss_pred             cccCCCCCCCccCChHHHHHHHHHhcCCCceecccCCCccCCchhhhhhhhc
Q psy679          374 YACPVENCDRRFSRSDELTRHIRIHTGQKPFQCRICMRSFSRSDHLTTHIRT  425 (510)
Q Consensus       374 y~C~~e~CgKsFssss~LkrH~rtHtgeKPfkC~~CgKsF~~ks~L~rH~rt  425 (510)
                      |.|.+  ||....... +.+|+..-++ .-|.|..|++.|.+ .....|..-
T Consensus         4 FtCnv--CgEsvKKp~-vekH~srCrn-~~fSCIDC~k~F~~-~sYknH~kC   50 (276)
T KOG2186|consen    4 FTCNV--CGESVKKPQ-VEKHMSRCRN-AYFSCIDCGKTFER-VSYKNHTKC   50 (276)
T ss_pred             Eehhh--hhhhccccc-hHHHHHhccC-CeeEEeeccccccc-chhhhhhhh
Confidence            77954  998877654 6779987666 56999999999988 667778653


No 90 
>KOG2785|consensus
Probab=42.03  E-value=18  Score=39.06  Aligned_cols=73  Identities=21%  Similarity=0.387  Sum_probs=47.8

Q ss_pred             cccCCCCCCCccCChHHHHHHHHHhcCCCceecccCCCccCCchhhhhhhhcccCCCceecCcCC---CccCChHHHHHH
Q psy679          374 YACPVENCDRRFSRSDELTRHIRIHTGQKPFQCRICMRSFSRSDHLTTHIRTHTGEKPFSCDVCG---RKFARSDEKKRH  450 (510)
Q Consensus       374 y~C~~e~CgKsFssss~LkrH~rtHtgeKPfkC~~CgKsF~~ks~L~rH~rtHtGeKPfkC~~Cg---KaFsrsssLkrH  450 (510)
                      -.|-+  |++.|+.-.....||..|+|-  |.=..  .-......|...+....+ .-|.|-.|+   +.|......+.|
T Consensus       167 t~CLf--C~~~~k~~e~~~~HM~~~Hgf--fIPdr--eYL~D~~GLl~YLgeKV~-~~~~CL~CN~~~~~f~sleavr~H  239 (390)
T KOG2785|consen  167 TDCLF--CDKKSKSLEENLKHMFKEHGF--FIPDR--EYLTDEKGLLKYLGEKVG-IGFICLFCNELGRPFSSLEAVRAH  239 (390)
T ss_pred             cceee--cCCCcccHHHHHHHHhhccCC--cCCch--HhhhchhHHHHHHHHHhc-cCceEEEeccccCcccccHHHHHH
Confidence            34855  999999999999999987762  11100  001223334444433332 348899998   999999999999


Q ss_pred             HHH
Q psy679          451 AKV  453 (510)
Q Consensus       451 lRt  453 (510)
                      |..
T Consensus       240 M~~  242 (390)
T KOG2785|consen  240 MRD  242 (390)
T ss_pred             Hhh
Confidence            963


No 91 
>cd00729 rubredoxin_SM Rubredoxin, Small Modular nonheme iron binding domain containing a [Fe(SCys)4] center, present in rubrerythrin and nigerythrin and detected either N- or C-terminal to such proteins as flavin reductase, NAD(P)H-nitrite reductase, and ferredoxin-thioredoxin reductase. In rubredoxin, the iron atom is coordinated by four cysteine residues (Fe(S-Cys)4), and  believed to be involved in electron transfer. Rubrerythrins and nigerythrins are small homodimeric proteins, generally consisting of 2 domains: a rubredoxin domain C-terminal to a non-sulfur, oxo-bridged diiron site in the N-terminal rubrerythrin domain. Rubrerythrins and nigerythrins have putative peroxide activity.
Probab=41.49  E-value=17  Score=25.84  Aligned_cols=25  Identities=28%  Similarity=0.748  Sum_probs=13.9

Q ss_pred             ceecccCCCccCCchhhhhhhhcccCCCceecCcCCC
Q psy679          403 PFQCRICMRSFSRSDHLTTHIRTHTGEKPFSCDVCGR  439 (510)
Q Consensus       403 PfkC~~CgKsF~~ks~L~rH~rtHtGeKPfkC~~CgK  439 (510)
                      .|+|..||..+...            +.|.+|+.||.
T Consensus         2 ~~~C~~CG~i~~g~------------~~p~~CP~Cg~   26 (34)
T cd00729           2 VWVCPVCGYIHEGE------------EAPEKCPICGA   26 (34)
T ss_pred             eEECCCCCCEeECC------------cCCCcCcCCCC
Confidence            36677776554221            14557777764


No 92 
>COG1592 Rubrerythrin [Energy production and conversion]
Probab=41.22  E-value=17  Score=34.98  Aligned_cols=25  Identities=40%  Similarity=0.995  Sum_probs=20.0

Q ss_pred             CceecccCCCccCCchhhhhhhhcccCCCceecCcCCC
Q psy679          402 KPFQCRICMRSFSRSDHLTTHIRTHTGEKPFSCDVCGR  439 (510)
Q Consensus       402 KPfkC~~CgKsF~~ks~L~rH~rtHtGeKPfkC~~CgK  439 (510)
                      +-|.|.+||..             |.|+-|-+|++||.
T Consensus       133 ~~~vC~vCGy~-------------~~ge~P~~CPiCga  157 (166)
T COG1592         133 KVWVCPVCGYT-------------HEGEAPEVCPICGA  157 (166)
T ss_pred             CEEEcCCCCCc-------------ccCCCCCcCCCCCC
Confidence            35999999865             44688999999993


No 93 
>KOG0978|consensus
Probab=40.92  E-value=6.6  Score=45.37  Aligned_cols=21  Identities=24%  Similarity=0.586  Sum_probs=15.9

Q ss_pred             CceecCcCCCccCChHHHHHH
Q psy679          430 KPFSCDVCGRKFARSDEKKRH  450 (510)
Q Consensus       430 KPfkC~~CgKaFsrsssLkrH  450 (510)
                      +.-+|+.|+.+|...+-+..|
T Consensus       677 RqRKCP~Cn~aFganDv~~I~  697 (698)
T KOG0978|consen  677 RQRKCPKCNAAFGANDVHRIH  697 (698)
T ss_pred             hcCCCCCCCCCCCcccccccC
Confidence            445799999999888776555


No 94 
>cd02032 Bchl_like This family of proteins contains bchL and chlL. Protochlorophyllide reductase catalyzes the reductive formation of chlorophyllide from protochlorophyllide during biosynthesis of chlorophylls and bacteriochlorophylls. Three genes, bchL, bchN and bchB, are involved in light-independent protochlorophyllide reduction in bacteriochlorophyll biosynthesis. In cyanobacteria, algae, and gymnosperms, three similar genes, chlL, chlN and chlB are involved in protochlorophyllide reduction during chlorophylls biosynthesis. BchL/chlL, bchN/chlN and bchB/chlB exhibit significant sequence similarity to the nifH, nifD and nifK subunits of nitrogenase, respectively. Nitrogenase catalyzes the reductive formation of ammonia from dinitrogen.
Probab=40.76  E-value=4.4  Score=40.34  Aligned_cols=18  Identities=44%  Similarity=0.567  Sum_probs=16.0

Q ss_pred             cccCCCCCceeeeeecCC
Q psy679           29 ISNSGGDGKTTSFVNLSH   46 (510)
Q Consensus        29 ~~~~~~~~~~~~~~~~~~   46 (510)
                      +++-||-||||+-+||.+
T Consensus         5 v~gKGGvGKTT~a~nLA~   22 (267)
T cd02032           5 VYGKGGIGKSTTSSNLSV   22 (267)
T ss_pred             EecCCCCCHHHHHHHHHH
Confidence            457899999999999986


No 95 
>TIGR02605 CxxC_CxxC_SSSS putative regulatory protein, FmdB family. This model represents a region of about 50 amino acids found in a number of small proteins in a wide range of bacteria. The region begins usually with the initiator Met and contains two CxxC motifs separated by 17 amino acids. One member of this family is has been noted as a putative regulatory protein, designated FmdB (PubMed:8841393). Most members of this family have a C-terminal region containing highly degenerate sequence, such as SSTSESTKSSGSSGSSGSSESKASGSTEKSTSSTTAAAAV in Mycobacterium tuberculosis and VAVGGSAPAPSPAPRAGGGGGGCCGGGCCG in Streptomyces avermitilis. These low complexity regions, which are not included in the model, resemble low-complexity C-terminal regions of some heterocycle-containing bacteriocin precursors.
Probab=40.35  E-value=14  Score=28.01  Aligned_cols=11  Identities=27%  Similarity=0.990  Sum_probs=6.2

Q ss_pred             eecccCCCccC
Q psy679          404 FQCRICMRSFS  414 (510)
Q Consensus       404 fkC~~CgKsF~  414 (510)
                      |+|..|+..|.
T Consensus         6 y~C~~Cg~~fe   16 (52)
T TIGR02605         6 YRCTACGHRFE   16 (52)
T ss_pred             EEeCCCCCEeE
Confidence            55666665554


No 96 
>KOG2593|consensus
Probab=40.25  E-value=22  Score=39.03  Aligned_cols=38  Identities=18%  Similarity=0.568  Sum_probs=26.5

Q ss_pred             CCCceecccCCCccCCchhhhhhhhcccCCCceecCcCCCc
Q psy679          400 GQKPFQCRICMRSFSRSDHLTTHIRTHTGEKPFSCDVCGRK  440 (510)
Q Consensus       400 geKPfkC~~CgKsF~~ks~L~rH~rtHtGeKPfkC~~CgKa  440 (510)
                      ...-|.|+.|.+.|.....+.-   .-...-.|.|..|+-.
T Consensus       125 ~~~~Y~Cp~C~kkyt~Lea~~L---~~~~~~~F~C~~C~ge  162 (436)
T KOG2593|consen  125 NVAGYVCPNCQKKYTSLEALQL---LDNETGEFHCENCGGE  162 (436)
T ss_pred             ccccccCCccccchhhhHHHHh---hcccCceEEEecCCCc
Confidence            4567999999999987666542   2222346999999744


No 97 
>PRK13231 nitrogenase reductase-like protein; Reviewed
Probab=40.19  E-value=6  Score=39.31  Aligned_cols=18  Identities=39%  Similarity=0.617  Sum_probs=15.8

Q ss_pred             cccCCCCCceeeeeecCC
Q psy679           29 ISNSGGDGKTTSFVNLSH   46 (510)
Q Consensus        29 ~~~~~~~~~~~~~~~~~~   46 (510)
                      |.+-||-||||+-+||.+
T Consensus         7 v~~KGGvGKTT~a~nLA~   24 (264)
T PRK13231          7 IYGKGGIGKSTTVSNMAA   24 (264)
T ss_pred             EECCCCCcHHHHHHHHhc
Confidence            347899999999999987


No 98 
>COG4957 Predicted transcriptional regulator [Transcription]
Probab=40.08  E-value=15  Score=34.38  Aligned_cols=25  Identities=32%  Similarity=0.469  Sum_probs=19.1

Q ss_pred             eecCcCCCccCChHHHHHHHHHhhcccc
Q psy679          432 FSCDVCGRKFARSDEKKRHAKVHLKQRS  459 (510)
Q Consensus       432 fkC~~CgKaFsrsssLkrHlRtHtgeK~  459 (510)
                      ..|-+|||.|.   .|++|+.+|.+--+
T Consensus        77 IicLEDGkkfK---SLKRHL~t~~gmTP  101 (148)
T COG4957          77 IICLEDGKKFK---SLKRHLTTHYGLTP  101 (148)
T ss_pred             EEEeccCcchH---HHHHHHhcccCCCH
Confidence            56888888886   58899988887433


No 99 
>PRK00398 rpoP DNA-directed RNA polymerase subunit P; Provisional
Probab=39.98  E-value=18  Score=27.06  Aligned_cols=12  Identities=25%  Similarity=0.772  Sum_probs=7.2

Q ss_pred             ceecccCCCccC
Q psy679          403 PFQCRICMRSFS  414 (510)
Q Consensus       403 PfkC~~CgKsF~  414 (510)
                      .|+|..|+..|.
T Consensus         3 ~y~C~~CG~~~~   14 (46)
T PRK00398          3 EYKCARCGREVE   14 (46)
T ss_pred             EEECCCCCCEEE
Confidence            466666666553


No 100
>PF14353 CpXC:  CpXC protein
Probab=38.33  E-value=8.1  Score=34.72  Aligned_cols=19  Identities=32%  Similarity=0.565  Sum_probs=9.0

Q ss_pred             eecccCCCccCCchhhhhh
Q psy679          404 FQCRICMRSFSRSDHLTTH  422 (510)
Q Consensus       404 fkC~~CgKsF~~ks~L~rH  422 (510)
                      |.|+.||+.|.-...+.-|
T Consensus        39 ~~CP~Cg~~~~~~~p~lY~   57 (128)
T PF14353_consen   39 FTCPSCGHKFRLEYPLLYH   57 (128)
T ss_pred             EECCCCCCceecCCCEEEE
Confidence            4455555555444444333


No 101
>PRK06266 transcription initiation factor E subunit alpha; Validated
Probab=37.66  E-value=23  Score=34.26  Aligned_cols=34  Identities=24%  Similarity=0.565  Sum_probs=25.8

Q ss_pred             CCceecccCCCccCCchhhhhhhhcccCCCceecCcCCCccCC
Q psy679          401 QKPFQCRICMRSFSRSDHLTTHIRTHTGEKPFSCDVCGRKFAR  443 (510)
Q Consensus       401 eKPfkC~~CgKsF~~ks~L~rH~rtHtGeKPfkC~~CgKaFsr  443 (510)
                      ..-|.|+.|++.|.....+.         .-|.|+.||.....
T Consensus       115 ~~~Y~Cp~C~~rytf~eA~~---------~~F~Cp~Cg~~L~~  148 (178)
T PRK06266        115 NMFFFCPNCHIRFTFDEAME---------YGFRCPQCGEMLEE  148 (178)
T ss_pred             CCEEECCCCCcEEeHHHHhh---------cCCcCCCCCCCCee
Confidence            35699999999887766653         25999999977554


No 102
>PRK13235 nifH nitrogenase reductase; Reviewed
Probab=37.63  E-value=5.2  Score=40.11  Aligned_cols=38  Identities=34%  Similarity=0.401  Sum_probs=26.1

Q ss_pred             cccCCCCCceeeeeecCCCCCCCccceeeeeecCceeeecCCCCceeeeeccchhhc
Q psy679           29 ISNSGGDGKTTSFVNLSHPPTRSIPEYLILVEMPKLRVLKRPKGCSCIVDISPYLES   85 (510)
Q Consensus        29 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~   85 (510)
                      |.+-||-||||+-|||.+--          -++ ..||        ++||..|=...
T Consensus         6 v~~KGGVGKTT~~~nLA~~L----------a~~-G~rV--------LlID~Dpq~~~   43 (274)
T PRK13235          6 IYGKGGIGKSTTTQNTVAGL----------AEM-GKKV--------MVVGCDPKADS   43 (274)
T ss_pred             EeCCCCccHHHHHHHHHHHH----------HHC-CCcE--------EEEecCCcccc
Confidence            34789999999999998632          222 2244        47888886543


No 103
>KOG2461|consensus
Probab=37.30  E-value=1.8e+02  Score=31.75  Aligned_cols=55  Identities=7%  Similarity=-0.116  Sum_probs=23.5

Q ss_pred             ccCChHHHHHHHHHhcCCCceecccCCCccCCchhhhhhhhcccCCCceecCcCC
Q psy679          384 RFSRSDELTRHIRIHTGQKPFQCRICMRSFSRSDHLTTHIRTHTGEKPFSCDVCG  438 (510)
Q Consensus       384 sFssss~LkrH~rtHtgeKPfkC~~CgKsF~~ks~L~rH~rtHtGeKPfkC~~Cg  438 (510)
                      .+.....+..|...|.++..+.|..+...+.....+..+..+|+..+.+.|..|+
T Consensus       340 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~  394 (396)
T KOG2461|consen  340 TPAGQLIYTQSHSMEVAEPTDMAPNQIWKIYHTGVLGFLIITTDESECNNMSFVC  394 (396)
T ss_pred             ccccccchhhhhhcccCCCCcccccccccceeccccceeeeeccccccccccccC
Confidence            3333333444444444444444444333333333334444444444445444443


No 104
>COG4049 Uncharacterized protein containing archaeal-type C2H2 Zn-finger [General function prediction only]
Probab=36.92  E-value=9.6  Score=30.66  Aligned_cols=27  Identities=30%  Similarity=0.599  Sum_probs=14.6

Q ss_pred             CCCCCcccCCCCCCCccCChHHHHHHHHH
Q psy679          369 VHERPYACPVENCDRRFSRSDELTRHIRI  397 (510)
Q Consensus       369 ~~EKPy~C~~e~CgKsFssss~LkrH~rt  397 (510)
                      .+|.-+.|+  -|++.|....+..+|...
T Consensus        13 DGE~~lrCP--RC~~~FR~~K~Y~RHVNK   39 (65)
T COG4049          13 DGEEFLRCP--RCGMVFRRRKDYIRHVNK   39 (65)
T ss_pred             CCceeeeCC--chhHHHHHhHHHHHHhhH
Confidence            344555553  366666655555555543


No 105
>COG4957 Predicted transcriptional regulator [Transcription]
Probab=36.20  E-value=37  Score=31.86  Aligned_cols=26  Identities=27%  Similarity=0.512  Sum_probs=19.7

Q ss_pred             CCcccCCCCCCCccCChHHHHHHHHHhcCCC
Q psy679          372 RPYACPVENCDRRFSRSDELTRHIRIHTGQK  402 (510)
Q Consensus       372 KPy~C~~e~CgKsFssss~LkrH~rtHtgeK  402 (510)
                      .--.|  -+|||.|++   |+||.++|.+--
T Consensus        75 D~Iic--LEDGkkfKS---LKRHL~t~~gmT  100 (148)
T COG4957          75 DYIIC--LEDGKKFKS---LKRHLTTHYGLT  100 (148)
T ss_pred             CeEEE--eccCcchHH---HHHHHhcccCCC
Confidence            33467  579999976   999999987743


No 106
>cd02117 NifH_like This family contains the NifH (iron protein) of nitrogenase, L subunit (BchL/ChlL) of the  protochlorophyllide reductase and the BchX subunit of the Chlorophyllide reductase. Members of this family use energey from ATP hydrolysis and transfer electrons through a Fe4-S4 cluster to other subunit for reduction of substrate.
Probab=36.03  E-value=5.6  Score=38.32  Aligned_cols=18  Identities=50%  Similarity=0.597  Sum_probs=15.6

Q ss_pred             cccCCCCCceeeeeecCC
Q psy679           29 ISNSGGDGKTTSFVNLSH   46 (510)
Q Consensus        29 ~~~~~~~~~~~~~~~~~~   46 (510)
                      |.+-||.||||+-+||.+
T Consensus         5 v~gKGGvGKTt~~~nLA~   22 (212)
T cd02117           5 IYGKGGIGKSTTSQNLSA   22 (212)
T ss_pred             EECCCcCcHHHHHHHHHH
Confidence            346899999999999976


No 107
>PF01780 Ribosomal_L37ae:  Ribosomal L37ae protein family;  InterPro: IPR002674 Ribosomes are the particles that catalyse mRNA-directed protein synthesis in all organisms. The codons of the mRNA are exposed on the ribosome to allow tRNA binding. This leads to the incorporation of amino acids into the growing polypeptide chain in accordance with the genetic information. Incoming amino acid monomers enter the ribosomal A site in the form of aminoacyl-tRNAs complexed with elongation factor Tu (EF-Tu) and GTP. The growing polypeptide chain, situated in the P site as peptidyl-tRNA, is then transferred to aminoacyl-tRNA and the new peptidyl-tRNA, extended by one residue, is translocated to the P site with the aid the elongation factor G (EF-G) and GTP as the deacylated tRNA is released from the ribosome through one or more exit sites [, ]. About 2/3 of the mass of the ribosome consists of RNA and 1/3 of protein. The proteins are named in accordance with the subunit of the ribosome which they belong to - the small (S1 to S31) and the large (L1 to L44). Usually they decorate the rRNA cores of the subunits.  Many ribosomal proteins, particularly those of the large subunit, are composed of a globular, surfaced-exposed domain with long finger-like projections that extend into the rRNA core to stabilise its structure. Most of the proteins interact with multiple RNA elements, often from different domains. In the large subunit, about 1/3 of the 23S rRNA nucleotides are at least in van der Waal's contact with protein, and L22 interacts with all six domains of the 23S rRNA. Proteins S4 and S7, which initiate assembly of the 16S rRNA, are located at junctions of five and four RNA helices, respectively. In this way proteins serve to organise and stabilise the rRNA tertiary structure. While the crucial activities of decoding and peptide transfer are RNA based, proteins play an active role in functions that may have evolved to streamline the process of protein synthesis. In addition to their function in the ribosome, many ribosomal proteins have some function 'outside' the ribosome [, ]. This ribosomal protein is found in archaebacteria and eukaryotes []. Ribosomal protein L37 has a single zinc finger-like motif of the C2-C2 type [].; GO: 0003735 structural constituent of ribosome, 0006412 translation, 0005622 intracellular, 0005840 ribosome; PDB: 4A1E_Y 4A17_Y 4A1C_Y 4A1A_Y 3O58_g 3IZS_m 3O5H_g 1S1I_9 3IZR_m 1YSH_D ....
Probab=36.02  E-value=15  Score=32.14  Aligned_cols=31  Identities=35%  Similarity=0.888  Sum_probs=21.4

Q ss_pred             CceecccCCCccCCchhhhhhhhcccCCCceecCcCCCccC
Q psy679          402 KPFQCRICMRSFSRSDHLTTHIRTHTGEKPFSCDVCGRKFA  442 (510)
Q Consensus       402 KPfkC~~CgKsF~~ks~L~rH~rtHtGeKPfkC~~CgKaFs  442 (510)
                      ..|.|++|++.-     +   .|.-.|  -+.|..|++.|+
T Consensus        34 ~ky~Cp~Cgk~~-----v---kR~a~G--IW~C~~C~~~~A   64 (90)
T PF01780_consen   34 AKYTCPFCGKTS-----V---KRVATG--IWKCKKCGKKFA   64 (90)
T ss_dssp             S-BEESSSSSSE-----E---EEEETT--EEEETTTTEEEE
T ss_pred             CCCcCCCCCCce-----e---EEeeeE--EeecCCCCCEEe
Confidence            469999999753     1   233334  499999999885


No 108
>KOG2071|consensus
Probab=35.09  E-value=32  Score=39.14  Aligned_cols=27  Identities=37%  Similarity=0.844  Sum_probs=21.2

Q ss_pred             CCCcccCCCCCCCccCChHHHHHHHHHhc
Q psy679          371 ERPYACPVENCDRRFSRSDELTRHIRIHT  399 (510)
Q Consensus       371 EKPy~C~~e~CgKsFssss~LkrH~rtHt  399 (510)
                      .++..|..  ||++|.+.....+||..|.
T Consensus       416 ~~pnqC~~--CG~R~~~~ee~sk~md~H~  442 (579)
T KOG2071|consen  416 DSPNQCKS--CGLRFDDSEERSKHMDIHD  442 (579)
T ss_pred             CCcchhcc--cccccccchhhhhHhhhhh
Confidence            56788955  9999999988877776653


No 109
>cd02040 NifH NifH gene encodes component II (iron protein) of nitrogenase. Nitrogenase is responsible for the biological nitrogen fixation, i.e. reduction of molecular nitrogen to ammonia. NifH consists of two oxygen-sensitive metallosulfur proteins: the mollybdenum-iron (alternatively, vanadium-iron or iron-iron) protein (commonly referred to as component 1), and the iron protein (commonly referred to as component 2). The iron protein is a homodimer, with an Fe4S4 cluster bound between the subunits and two ATP-binding domains. It supplies energy by ATP hydrolysis, and transfers electrons from reduced ferredoxin or flavodoxin to component 1 for the reduction of molecular nitrogen to ammonia.
Probab=34.87  E-value=6  Score=39.05  Aligned_cols=17  Identities=47%  Similarity=0.565  Sum_probs=15.3

Q ss_pred             ccCCCCCceeeeeecCC
Q psy679           30 SNSGGDGKTTSFVNLSH   46 (510)
Q Consensus        30 ~~~~~~~~~~~~~~~~~   46 (510)
                      .+-||-||||+-+||.+
T Consensus         7 ~~KGGvGKTT~~~nLA~   23 (270)
T cd02040           7 YGKGGIGKSTTTQNLSA   23 (270)
T ss_pred             EeCCcCCHHHHHHHHHH
Confidence            46899999999999986


No 110
>smart00659 RPOLCX RNA polymerase subunit CX. present in RNA polymerase I, II and III
Probab=34.26  E-value=32  Score=26.04  Aligned_cols=11  Identities=27%  Similarity=0.637  Sum_probs=6.3

Q ss_pred             eecccCCCccC
Q psy679          404 FQCRICMRSFS  414 (510)
Q Consensus       404 fkC~~CgKsF~  414 (510)
                      |.|..|+..|.
T Consensus         3 Y~C~~Cg~~~~   13 (44)
T smart00659        3 YICGECGRENE   13 (44)
T ss_pred             EECCCCCCEee
Confidence            55666665554


No 111
>PF09986 DUF2225:  Uncharacterized protein conserved in bacteria (DUF2225);  InterPro: IPR018708 This conserved bacterial family has no known function.
Probab=33.31  E-value=8.8  Score=38.00  Aligned_cols=44  Identities=25%  Similarity=0.648  Sum_probs=29.1

Q ss_pred             CCCcccCCCCCCCccCChHHHHHHHHH---h-------cCCCc-----eecccCCCccCCc
Q psy679          371 ERPYACPVENCDRRFSRSDELTRHIRI---H-------TGQKP-----FQCRICMRSFSRS  416 (510)
Q Consensus       371 EKPy~C~~e~CgKsFssss~LkrH~rt---H-------tgeKP-----fkC~~CgKsF~~k  416 (510)
                      +|.+.|++  |++.|....-+....|.   .       .+..|     ..|+.|+.+|...
T Consensus         3 ~k~~~CPv--C~~~F~~~~vrs~~~r~~~~d~D~~~~Y~~vnP~~Y~V~vCP~CgyA~~~~   61 (214)
T PF09986_consen    3 DKKITCPV--CGKEFKTKKVRSGKIRVIRRDSDFCPRYKGVNPLFYEVWVCPHCGYAAFEE   61 (214)
T ss_pred             CCceECCC--CCCeeeeeEEEcCCceEeeecCCCccccCCCCCeeeeEEECCCCCCccccc
Confidence            46678977  99999887655444432   1       12223     6799999987654


No 112
>COG1192 Soj ATPases involved in chromosome partitioning [Cell division and chromosome partitioning]
Probab=32.79  E-value=6.9  Score=38.65  Aligned_cols=36  Identities=36%  Similarity=0.558  Sum_probs=25.4

Q ss_pred             cccCCCCCceeeeeecCCCCCCCccceeeeeecCceeeecCCCCceeeeeccch
Q psy679           29 ISNSGGDGKTTSFVNLSHPPTRSIPEYLILVEMPKLRVLKRPKGCSCIVDISPY   82 (510)
Q Consensus        29 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~   82 (510)
                      .+..||-||||+-+||.+-          |-..-+.||        .+||+.|-
T Consensus         8 ~n~KGGvGKTT~a~nLa~~----------La~~~~~kV--------LliDlDpQ   43 (259)
T COG1192           8 ANQKGGVGKTTTAVNLAAA----------LAKRGGKKV--------LLIDLDPQ   43 (259)
T ss_pred             EecCCCccHHHHHHHHHHH----------HHHhcCCcE--------EEEeCCCc
Confidence            4678999999999999862          222222455        47999985


No 113
>COG4530 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=32.56  E-value=25  Score=31.91  Aligned_cols=28  Identities=18%  Similarity=0.300  Sum_probs=21.4

Q ss_pred             ecccCCCccCCchhhhhhhhcccCCCceecCcCCCccCC
Q psy679          405 QCRICMRSFSRSDHLTTHIRTHTGEKPFSCDVCGRKFAR  443 (510)
Q Consensus       405 kC~~CgKsF~~ks~L~rH~rtHtGeKPfkC~~CgKaFsr  443 (510)
                      .|+.|++.|..   |        +..|..|++||+.|.+
T Consensus        11 idPetg~KFYD---L--------NrdPiVsPytG~s~P~   38 (129)
T COG4530          11 IDPETGKKFYD---L--------NRDPIVSPYTGKSYPR   38 (129)
T ss_pred             cCccccchhhc---c--------CCCccccCcccccchH
Confidence            68889988864   2        2478899999999943


No 114
>KOG2482|consensus
Probab=32.38  E-value=45  Score=35.76  Aligned_cols=23  Identities=22%  Similarity=0.504  Sum_probs=21.1

Q ss_pred             ceecCcCCCccCChHHHHHHHHH
Q psy679          431 PFSCDVCGRKFARSDEKKRHAKV  453 (510)
Q Consensus       431 PfkC~~CgKaFsrsssLkrHlRt  453 (510)
                      .+.|-.|.|.|+.+..|+.|||.
T Consensus       195 r~~CLyCekifrdkntLkeHMrk  217 (423)
T KOG2482|consen  195 RLRCLYCEKIFRDKNTLKEHMRK  217 (423)
T ss_pred             hheeeeeccccCCcHHHHHHHHh
Confidence            47899999999999999999975


No 115
>PRK13230 nitrogenase reductase-like protein; Reviewed
Probab=32.25  E-value=7.3  Score=39.17  Aligned_cols=18  Identities=33%  Similarity=0.580  Sum_probs=15.8

Q ss_pred             cccCCCCCceeeeeecCC
Q psy679           29 ISNSGGDGKTTSFVNLSH   46 (510)
Q Consensus        29 ~~~~~~~~~~~~~~~~~~   46 (510)
                      |.+-||-||||+-+||.+
T Consensus         6 ~~gKGGVGKTT~a~nLA~   23 (279)
T PRK13230          6 FYGKGGIGKSTTVCNIAA   23 (279)
T ss_pred             EECCCCCcHHHHHHHHHH
Confidence            446899999999999986


No 116
>PRK13233 nifH nitrogenase reductase; Reviewed
Probab=32.12  E-value=7.9  Score=38.71  Aligned_cols=17  Identities=35%  Similarity=0.440  Sum_probs=15.1

Q ss_pred             ccCCCCCceeeeeecCC
Q psy679           30 SNSGGDGKTTSFVNLSH   46 (510)
Q Consensus        30 ~~~~~~~~~~~~~~~~~   46 (510)
                      .+-||-||||+-|||.+
T Consensus         8 ~~KGGVGKTT~a~nLA~   24 (275)
T PRK13233          8 YGKGGIGKSTTTQNTAA   24 (275)
T ss_pred             EcCCCCcHHHHHHHHHH
Confidence            37899999999999975


No 117
>PF12760 Zn_Tnp_IS1595:  Transposase zinc-ribbon domain;  InterPro: IPR024442 This zinc binding domain is found in a range of transposase proteins such as ISSPO8, ISSOD11, ISRSSP2 etc. It may be a zinc-binding beta ribbon domain that could bind DNA.
Probab=31.76  E-value=73  Score=23.83  Aligned_cols=38  Identities=16%  Similarity=0.406  Sum_probs=21.3

Q ss_pred             HHHHHhcCCCceecccCCCccCCchhhhhhhhcccCCCceecCcCCC
Q psy679          393 RHIRIHTGQKPFQCRICMRSFSRSDHLTTHIRTHTGEKPFSCDVCGR  439 (510)
Q Consensus       393 rH~rtHtgeKPfkC~~CgKsF~~ks~L~rH~rtHtGeKPfkC~~CgK  439 (510)
                      .++....-..-+.|+.|+..  ....+.       +.+.|+|..|++
T Consensus         8 ~~l~~~RW~~g~~CP~Cg~~--~~~~~~-------~~~~~~C~~C~~   45 (46)
T PF12760_consen    8 EYLEEIRWPDGFVCPHCGST--KHYRLK-------TRGRYRCKACRK   45 (46)
T ss_pred             HHHHHhcCCCCCCCCCCCCe--eeEEeC-------CCCeEECCCCCC
Confidence            34433333334789999864  111111       156799999975


No 118
>TIGR01287 nifH nitrogenase iron protein. This model describes nitrogenase (EC 1.18.6.1) iron protein, also called nitrogenase reductase or nitrogenase component II. This model includes molybdenum-iron nitrogenase reductase (nifH), vanadium-iron nitrogenase reductase (vnfH), and iron-iron nitrogenase reductase (anfH). The model excludes the homologous protein from the light-independent protochlorophyllide reductase.
Probab=31.75  E-value=9.7  Score=38.10  Aligned_cols=18  Identities=39%  Similarity=0.566  Sum_probs=15.8

Q ss_pred             cccCCCCCceeeeeecCC
Q psy679           29 ISNSGGDGKTTSFVNLSH   46 (510)
Q Consensus        29 ~~~~~~~~~~~~~~~~~~   46 (510)
                      |.+-||-||||+-+||.+
T Consensus         5 ~~gKGGVGKTT~a~nLA~   22 (275)
T TIGR01287         5 IYGKGGIGKSTTTQNIAA   22 (275)
T ss_pred             EeCCCcCcHHHHHHHHHH
Confidence            467899999999999974


No 119
>COG1198 PriA Primosomal protein N' (replication factor Y) - superfamily II helicase [DNA replication, recombination, and repair]
Probab=31.09  E-value=32  Score=40.28  Aligned_cols=13  Identities=38%  Similarity=0.815  Sum_probs=10.3

Q ss_pred             CCCceecCcCCCc
Q psy679          428 GEKPFSCDVCGRK  440 (510)
Q Consensus       428 GeKPfkC~~CgKa  440 (510)
                      ...|+.|+.||..
T Consensus       472 ~~~p~~Cp~Cgs~  484 (730)
T COG1198         472 EPIPQSCPECGSE  484 (730)
T ss_pred             CCCCCCCCCCCCC
Confidence            4578999999865


No 120
>KOG2785|consensus
Probab=30.49  E-value=38  Score=36.63  Aligned_cols=24  Identities=33%  Similarity=0.699  Sum_probs=17.8

Q ss_pred             CceecccCC---CccCCchhhhhhhhc
Q psy679          402 KPFQCRICM---RSFSRSDHLTTHIRT  425 (510)
Q Consensus       402 KPfkC~~Cg---KsF~~ks~L~rH~rt  425 (510)
                      +-|.|-.|+   +.|.+....+.||.-
T Consensus       216 ~~~~CL~CN~~~~~f~sleavr~HM~~  242 (390)
T KOG2785|consen  216 IGFICLFCNELGRPFSSLEAVRAHMRD  242 (390)
T ss_pred             cCceEEEeccccCcccccHHHHHHHhh
Confidence            347788887   788888888888753


No 121
>CHL00072 chlL photochlorophyllide reductase subunit L
Probab=29.90  E-value=8.9  Score=39.39  Aligned_cols=18  Identities=39%  Similarity=0.560  Sum_probs=16.2

Q ss_pred             cccCCCCCceeeeeecCC
Q psy679           29 ISNSGGDGKTTSFVNLSH   46 (510)
Q Consensus        29 ~~~~~~~~~~~~~~~~~~   46 (510)
                      |.+-||-||||+-+||.+
T Consensus         5 ~~gKGGVGKTTta~nLA~   22 (290)
T CHL00072          5 VYGKGGIGKSTTSCNISI   22 (290)
T ss_pred             EECCCCCcHHHHHHHHHH
Confidence            578899999999999986


No 122
>PF04959 ARS2:  Arsenite-resistance protein 2;  InterPro: IPR007042 This entry represents Arsenite-resistance protein 2 (also known as Serrate RNA effector molecule homolog) which is thought to play a role in arsenite resistance [], although does not directly confer arsenite resistance but rather modulates arsenic sensitivity []. Arsenite is a carcinogenic compound which can act as a comutagen by inhibiting DNA repair. It is also involved in cell cycle progression at S phase. ; PDB: 3AX1_A.
Probab=29.52  E-value=11  Score=37.60  Aligned_cols=28  Identities=21%  Similarity=0.565  Sum_probs=17.7

Q ss_pred             CceecccCCCccCCchhhhhhhhcccCC
Q psy679          402 KPFQCRICMRSFSRSDHLTTHIRTHTGE  429 (510)
Q Consensus       402 KPfkC~~CgKsF~~ks~L~rH~rtHtGe  429 (510)
                      ..|.|..|+|.|....-..+|+...|.+
T Consensus        76 ~K~~C~lc~KlFkg~eFV~KHI~nKH~e  103 (214)
T PF04959_consen   76 DKWRCPLCGKLFKGPEFVRKHIFNKHPE  103 (214)
T ss_dssp             EEEEE-SSS-EESSHHHHHHHHHHH-HH
T ss_pred             CEECCCCCCcccCChHHHHHHHhhcCHH
Confidence            4577878888887777777777665543


No 123
>COG1996 RPC10 DNA-directed RNA polymerase, subunit RPC10 (contains C4-type Zn-finger) [Transcription]
Probab=29.29  E-value=34  Score=26.75  Aligned_cols=11  Identities=27%  Similarity=0.836  Sum_probs=6.1

Q ss_pred             ceecccCCCcc
Q psy679          403 PFQCRICMRSF  413 (510)
Q Consensus       403 PfkC~~CgKsF  413 (510)
                      -|+|..|++.|
T Consensus         6 ~Y~C~~Cg~~~   16 (49)
T COG1996           6 EYKCARCGREV   16 (49)
T ss_pred             EEEhhhcCCee
Confidence            35555555555


No 124
>COG5151 SSL1 RNA polymerase II transcription initiation/nucleotide excision repair factor TFIIH, subunit SSL1 [Transcription / DNA replication, recombination, and repair]
Probab=29.03  E-value=52  Score=34.93  Aligned_cols=49  Identities=18%  Similarity=0.491  Sum_probs=31.9

Q ss_pred             ecccCCCccCCchhhhhhhhcccCCCceecCcCCCccCChHHHHHHHHHhh
Q psy679          405 QCRICMRSFSRSDHLTTHIRTHTGEKPFSCDVCGRKFARSDEKKRHAKVHL  455 (510)
Q Consensus       405 kC~~CgKsF~~ks~L~rH~rtHtGeKPfkC~~CgKaFsrsssLkrHlRtHt  455 (510)
                      .|-.|...|.....-..-..  +..-.|.|+.|...|-..-..-.|...|.
T Consensus       364 ~Cf~CQ~~fp~~~~~~~~~~--~ss~rY~Ce~CK~~FC~dCdvfiHe~Lh~  412 (421)
T COG5151         364 HCFVCQGPFPKPPVSPFDES--TSSGRYQCELCKSTFCSDCDVFIHETLHF  412 (421)
T ss_pred             cceeccCCCCCCCCCccccc--ccccceechhhhhhhhhhhHHHHHHHHhh
Confidence            37777777754332221111  22346999999999988888888877764


No 125
>PF09538 FYDLN_acid:  Protein of unknown function (FYDLN_acid);  InterPro: IPR012644 Members of this family are bacterial proteins with a conserved motif [KR]FYDLN, sometimes flanked by a pair of CXXC motifs, followed by a long region of low complexity sequence in which roughly half the residues are Asp and Glu, including multiple runs of five or more acidic residues. The function of members of this family is unknown.
Probab=28.73  E-value=50  Score=29.62  Aligned_cols=30  Identities=27%  Similarity=0.732  Sum_probs=23.1

Q ss_pred             cccCCCCCCCccCChHHHHHHHHHhcCCCceecccCCCccCCc
Q psy679          374 YACPVENCDRRFSRSDELTRHIRIHTGQKPFQCRICMRSFSRS  416 (510)
Q Consensus       374 y~C~~e~CgKsFssss~LkrH~rtHtgeKPfkC~~CgKsF~~k  416 (510)
                      ..|  ..||++|-.   |        +..|..|+.||..|...
T Consensus        10 R~C--p~CG~kFYD---L--------nk~PivCP~CG~~~~~~   39 (108)
T PF09538_consen   10 RTC--PSCGAKFYD---L--------NKDPIVCPKCGTEFPPE   39 (108)
T ss_pred             ccC--CCCcchhcc---C--------CCCCccCCCCCCccCcc
Confidence            458  569999954   2        34688999999999766


No 126
>TIGR03371 cellulose_yhjQ cellulose synthase operon protein YhjQ. Members of this family are the YhjQ protein, found immediately upsteam of bacterial cellulose synthase (bcs) genes in a broad range of bacteria, including both copies of the bcs locus in Klebsiella pneumoniae. In several species it is seen clearly as part of the bcs operon. It is identified as a probable component of the bacterial cellulose metabolic process not only by gene location, but also by partial phylogenetic profiling, or Haft-Selengut algorithm (PubMed:16930487), based on a bacterial cellulose biosynthesis genome property profile. Cellulose plays an important role in biofilm formation and structural integrity in some bacteria. Mutants in yhjQ in Escherichia coli, show altered morphology an growth, but the function of YhjQ has not yet been determined.
Probab=28.41  E-value=9.3  Score=37.06  Aligned_cols=18  Identities=44%  Similarity=0.575  Sum_probs=15.6

Q ss_pred             cccCCCCCceeeeeecCC
Q psy679           29 ISNSGGDGKTTSFVNLSH   46 (510)
Q Consensus        29 ~~~~~~~~~~~~~~~~~~   46 (510)
                      .+..||.||||+-+||.+
T Consensus         7 ~s~KGGvGKTt~a~nla~   24 (246)
T TIGR03371         7 VGVKGGVGKTTLTANLAS   24 (246)
T ss_pred             EeCCCCccHHHHHHHHHH
Confidence            356899999999999975


No 127
>PF01656 CbiA:  CobQ/CobB/MinD/ParA nucleotide binding domain;  InterPro: IPR002586 This entry consists of various cobyrinic acid a,c-diamide synthases. These include CbiA and CbiP from Salmonella typhimurium []., and CobQ from Rhodobacter capsulatus []. These amidases catalyse amidations to various side chains of hydrogenobyrinic acid or cobyrinic acid a,c-diamide in the biosynthesis of cobalamin (vitamin B12) from uroporphyrinogen III. Vitamin B12 is an important cofactor and an essential nutrient for many plants and animals and is primarily produced by bacteria [].; PDB: 3K9G_A 3K9H_B 3EZ9_B 3EZF_A 3EZ2_B 3EZ6_A 3EZ7_A 1G3Q_A 1G3R_A 1DTS_A ....
Probab=28.40  E-value=11  Score=34.72  Aligned_cols=18  Identities=50%  Similarity=0.617  Sum_probs=15.1

Q ss_pred             cccCCCCCceeeeeecCC
Q psy679           29 ISNSGGDGKTTSFVNLSH   46 (510)
Q Consensus        29 ~~~~~~~~~~~~~~~~~~   46 (510)
                      .+..||-||||+-+||..
T Consensus         4 ~~~kGG~GKTt~a~~la~   21 (195)
T PF01656_consen    4 TSGKGGVGKTTIAANLAQ   21 (195)
T ss_dssp             EESSTTSSHHHHHHHHHH
T ss_pred             EcCCCCccHHHHHHHHHh
Confidence            467899999999998863


No 128
>PRK10037 cell division protein; Provisional
Probab=28.28  E-value=9.6  Score=37.75  Aligned_cols=18  Identities=28%  Similarity=0.320  Sum_probs=15.7

Q ss_pred             cccCCCCCceeeeeecCC
Q psy679           29 ISNSGGDGKTTSFVNLSH   46 (510)
Q Consensus        29 ~~~~~~~~~~~~~~~~~~   46 (510)
                      .++-||-||||+-+||.+
T Consensus         7 ~n~KGGvGKTT~a~nLA~   24 (250)
T PRK10037          7 QGVRGGVGTTSITAALAW   24 (250)
T ss_pred             ecCCCCccHHHHHHHHHH
Confidence            467899999999999975


No 129
>PF05443 ROS_MUCR:  ROS/MUCR transcriptional regulator protein;  InterPro: IPR008807 This family consists of several ROS/MUCR transcriptional regulator proteins. The ros chromosomal gene is present in octopine and nopaline strains of Agrobacterium tumefaciens as well as in Rhizobium meliloti (Sinorhizobium meliloti). This gene encodes a 15.5 kDa protein that specifically represses the virC and virD operons in the virulence region of the Ti plasmid [] and is necessary for succinoglycan production []. S. meliloti can produce two types of acidic exopolysaccharides, succinoglycan and galactoglucan, that are interchangeable for infection of Medicago sativa (Alfalfa) nodules. MucR from S. meliloti acts as a transcriptional repressor that blocks the expression of the exp genes responsible for galactoglucan production therefore allowing the exclusive production of succinoglycan [].; GO: 0003677 DNA binding, 0008270 zinc ion binding, 0006355 regulation of transcription, DNA-dependent; PDB: 2JSP_A.
Probab=28.23  E-value=39  Score=31.42  Aligned_cols=30  Identities=30%  Similarity=0.709  Sum_probs=17.3

Q ss_pred             CCCCCCcccCCCCCCCccCChHHHHHHHHHhcCCC
Q psy679          368 PVHERPYACPVENCDRRFSRSDELTRHIRIHTGQK  402 (510)
Q Consensus       368 h~~EKPy~C~~e~CgKsFssss~LkrH~rtHtgeK  402 (510)
                      ...+.--.|  -.|||.|+.   |++|.+.|+|--
T Consensus        67 SI~~d~i~c--lecGk~~k~---LkrHL~~~~glt   96 (132)
T PF05443_consen   67 SITPDYIIC--LECGKKFKT---LKRHLRTHHGLT   96 (132)
T ss_dssp             TB-SS-EE---TBT--EESB---HHHHHHHTT-S-
T ss_pred             ccccCeeEE--ccCCcccch---HHHHHHHccCCC
Confidence            334455678  459999988   799999997753


No 130
>KOG0782|consensus
Probab=27.53  E-value=20  Score=40.64  Aligned_cols=54  Identities=19%  Similarity=0.415  Sum_probs=34.5

Q ss_pred             ChHHHHHHHHHhcCCCceecccCCCccCCchhhhhhhhcccCCCceecCcCCCccCChH
Q psy679          387 RSDELTRHIRIHTGQKPFQCRICMRSFSRSDHLTTHIRTHTGEKPFSCDVCGRKFARSD  445 (510)
Q Consensus       387 sss~LkrH~rtHtgeKPfkC~~CgKsF~~ks~L~rH~rtHtGeKPfkC~~CgKaFsrss  445 (510)
                      +...+.+|--+|.....=+|..|+|.|..+..+..-.     .....|.-|..+|..+.
T Consensus       237 rE~~fvrHHWVHrrRqeGkC~~CgKgFQQKf~FhsKE-----ivAisCSWCKqayH~Kv  290 (1004)
T KOG0782|consen  237 RESGFVRHHWVHRRRQEGKCNTCGKGFQQKFFFHSKE-----IVAISCSWCKQAYHLKV  290 (1004)
T ss_pred             ccccchHHhHhhHhhhccccchhhhhhhhheeecccc-----EEEEEehHHHHHhhcch
Confidence            3446777777776655668999999997765543211     12356877877775543


No 131
>COG3677 Transposase and inactivated derivatives [DNA replication, recombination, and repair]
Probab=27.00  E-value=62  Score=29.78  Aligned_cols=47  Identities=21%  Similarity=0.487  Sum_probs=27.3

Q ss_pred             HHHHHHHhcCCCceecccCCCccCCchhhhhhhhcccCCCceecCcCCCccCCh
Q psy679          391 LTRHIRIHTGQKPFQCRICMRSFSRSDHLTTHIRTHTGEKPFSCDVCGRKFARS  444 (510)
Q Consensus       391 LkrH~rtHtgeKPfkC~~CgKsF~~ks~L~rH~rtHtGeKPfkC~~CgKaFsrs  444 (510)
                      ...+++.+...  -.|+.|+...     ..++-..-.|...|+|..|++.|...
T Consensus        20 ~~~~~~~~~~~--~~cP~C~s~~-----~~k~g~~~~~~qRyrC~~C~~tf~~~   66 (129)
T COG3677          20 AAYAIRMQITK--VNCPRCKSSN-----VVKIGGIRRGHQRYKCKSCGSTFTVE   66 (129)
T ss_pred             HHHHHhhhccc--CcCCCCCccc-----eeeECCccccccccccCCcCcceeee
Confidence            34566665442  3688886543     11222222235679999999999753


No 132
>KOG3598|consensus
Probab=26.15  E-value=19  Score=44.82  Aligned_cols=20  Identities=25%  Similarity=0.212  Sum_probs=11.0

Q ss_pred             CCCCceeeeeecCCCCCCCc
Q psy679           33 GGDGKTTSFVNLSHPPTRSI   52 (510)
Q Consensus        33 ~~~~~~~~~~~~~~~~~~~~   52 (510)
                      +-.|-+|+--|+-||-..++
T Consensus      1846 ~p~~~~~s~~~~hhp~~~~~ 1865 (2220)
T KOG3598|consen 1846 SPKDDVTSEKNEHHPRASDA 1865 (2220)
T ss_pred             CCCCCCCChHhhcCCCchhh
Confidence            33444556666666655555


No 133
>PHA02518 ParA-like protein; Provisional
Probab=25.73  E-value=12  Score=35.34  Aligned_cols=18  Identities=44%  Similarity=0.697  Sum_probs=15.3

Q ss_pred             cccCCCCCceeeeeecCC
Q psy679           29 ISNSGGDGKTTSFVNLSH   46 (510)
Q Consensus        29 ~~~~~~~~~~~~~~~~~~   46 (510)
                      .+.-||-||||+-+||.+
T Consensus         6 ~~~KGGvGKTT~a~~la~   23 (211)
T PHA02518          6 LNQKGGAGKTTVATNLAS   23 (211)
T ss_pred             EcCCCCCCHHHHHHHHHH
Confidence            356799999999999875


No 134
>PHA00626 hypothetical protein
Probab=25.41  E-value=43  Score=27.00  Aligned_cols=14  Identities=21%  Similarity=0.612  Sum_probs=9.1

Q ss_pred             CceecCcCCCccCC
Q psy679          430 KPFSCDVCGRKFAR  443 (510)
Q Consensus       430 KPfkC~~CgKaFsr  443 (510)
                      ..|+|..||..|+.
T Consensus        22 nrYkCkdCGY~ft~   35 (59)
T PHA00626         22 DDYVCCDCGYNDSK   35 (59)
T ss_pred             cceEcCCCCCeech
Confidence            45777777766654


No 135
>PF09818 ABC_ATPase:  Predicted ATPase of the ABC class;  InterPro: IPR019195 ABC transporters belong to the ATP-Binding Cassette (ABC) superfamily, which uses the hydrolysis of ATP to energise diverse biological systems. ABC transporters minimally consist of two conserved regions: a highly conserved ATP binding cassette (ABC) and a less conserved transmembrane domain (TMD). These can be found on the same protein or on two different ones. Most ABC transporters function as a dimer and therefore are constituted of four domains, two ABC modules and two TMDs. ABC transporters are involved in the export or import of a wide variety of substrates ranging from small ions to macromolecules. The major function of ABC import systems is to provide essential nutrients to bacteria. They are found only in prokaryotes and their four constitutive domains are usually encoded by independent polypeptides (two ABC proteins and two TMD proteins). Prokaryotic importers require additional extracytoplasmic binding proteins (one or more per systems) for function. In contrast, export systems are involved in the extrusion of noxious substances, the export of extracellular toxins and the targeting of membrane components. They are found in all living organisms and in general the TMD is fused to the ABC module in a variety of combinations. Some eukaryotic exporters encode the four domains on the same polypeptide chain [].  The ABC module (approximately two hundred amino acid residues) is known to bind and hydrolyse ATP, thereby coupling transport to ATP hydrolysis in a large number of biological processes. The cassette is duplicated in several subfamilies. Its primary sequence is highly conserved, displaying a typical phosphate-binding loop: Walker A, and a magnesium binding site: Walker B. Besides these two regions, three other conserved motifs are present in the ABC cassette: the switch region which contains a histidine loop, postulated to polarise the attaching water molecule for hydrolysis, the signature conserved motif (LSGGQ) specific to the ABC transporter, and the Q-motif (between Walker A and the signature), which interacts with the gamma phosphate through a water bond. The Walker A, Walker B, Q-loop and switch region form the nucleotide binding site [, , ]. The 3D structure of a monomeric ABC module adopts a stubby L-shape with two distinct arms. ArmI (mainly beta-strand) contains Walker A and Walker B. The important residues for ATP hydrolysis and/or binding are located in the P-loop. The ATP-binding pocket is located at the extremity of armI. The perpendicular armII contains mostly the alpha helical subdomain with the signature motif. It only seems to be required for structural integrity of the ABC module. ArmII is in direct contact with the TMD. The hinge between armI and armII contains both the histidine loop and the Q-loop, making contact with the gamma phosphate of the ATP molecule. ATP hydrolysis leads to a conformational change that could facilitate ADP release. In the dimer the two ABC cassettes contact each other through hydrophobic interactions at the antiparallel beta-sheet of armI by a two-fold axis [, , , , , ]. The ATP-Binding Cassette (ABC) superfamily forms one of the largest of all protein families with a diversity of physiological functions []. Several studies have shown that there is a correlation between the functional characterisation and the phylogenetic classification of the ABC cassette [, ]. More than 50 subfamilies have been described based on a phylogenetic and functional classification [, , ]; (for further information see http://www.tcdb.org/tcdb/index.php?tc=3.A.1). This entry consists of various predicted ABC transporter class ATPases. 
Probab=24.97  E-value=35  Score=37.68  Aligned_cols=80  Identities=31%  Similarity=0.420  Sum_probs=50.7

Q ss_pred             CccccccCCCCCceeeeeecCCCCCCCcc----ceeeeeec-CceeeecCCCC-ceeeeeccchhhcccCCccccCCCcc
Q psy679           25 GADQISNSGGDGKTTSFVNLSHPPTRSIP----EYLILVEM-PKLRVLKRPKG-CSCIVDISPYLESLTGTRGSLSGNSF   98 (510)
Q Consensus        25 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~~~~~-~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~   98 (510)
                      |.--|...|--||||+.--|..---.-||    ||+|-+.- -|+|-   -.| .---||||||+..|-+-      .+=
T Consensus       245 GvTlIvGGGyHGKSTLL~Ale~GVYnHipGDGRE~VVT~~~avkirA---EDGR~V~~vDISpFI~~LP~g------~dT  315 (448)
T PF09818_consen  245 GVTLIVGGGYHGKSTLLEALERGVYNHIPGDGREFVVTDPDAVKIRA---EDGRSVEGVDISPFINNLPGG------KDT  315 (448)
T ss_pred             cEEEEECCCCccHHHHHHHHHhcccCCCCCCCceEEEECCCceEEEe---cCCceEeCccchHHHhhCCCC------CCC
Confidence            44557777889999997666544444444    78877642 23433   122 22348999999999753      222


Q ss_pred             CCCCCCCCCCCcccc
Q psy679           99 ERPSTKDDSGELCQP  113 (510)
Q Consensus        99 ~~~~~~~~~~~~~~~  113 (510)
                      ..=+|.|-||---|-
T Consensus       316 ~~FsT~~ASGSTSqA  330 (448)
T PF09818_consen  316 TCFSTENASGSTSQA  330 (448)
T ss_pred             CcccccCCCchHHHH
Confidence            345899999976553


No 136
>KOG4377|consensus
Probab=24.68  E-value=34  Score=37.40  Aligned_cols=81  Identities=26%  Similarity=0.568  Sum_probs=0.0

Q ss_pred             cccCCCCCCCccCChHHHHHHHHHhcCCCc-------eecccCC--CccCCchhhhhhhhcccCCCc-------------
Q psy679          374 YACPVENCDRRFSRSDELTRHIRIHTGQKP-------FQCRICM--RSFSRSDHLTTHIRTHTGEKP-------------  431 (510)
Q Consensus       374 y~C~~e~CgKsFssss~LkrH~rtHtgeKP-------fkC~~Cg--KsF~~ks~L~rH~rtHtGeKP-------------  431 (510)
                      |.|....|.|   +-++...|...|+..+.       |.|..|+  ..|....+-..|.+-+.+++-             
T Consensus       314 yhC~~~~C~k---sTsdV~~h~nFht~~~n~GfrrthfhC~r~gCTdtfK~~khk~yh~kdda~~~dGfkkf~k~e~cay  390 (480)
T KOG4377|consen  314 YHCTGQICEK---STSDVLLHDNFHTDKRNNGFRRTHFHCQRIGCTDTFKDSKHKPYHYKDDAGEIDGFKKFFKDENCAY  390 (480)
T ss_pred             chhhhcccCc---ccccccccCccccccccCceecceeEEeccCCccccccccccccccCcchhhhhhhhhhhccccCCc


Q ss_pred             -----------eecCc--CCCccCChHHHHHHHHHhhcc
Q psy679          432 -----------FSCDV--CGRKFARSDEKKRHAKVHLKQ  457 (510)
Q Consensus       432 -----------fkC~~--CgKaFsrsssLkrHlRtHtge  457 (510)
                                 |.|..  |+..|...+.+..|.|.|.++
T Consensus       391 ~gCkys~~cnhfhc~r~Gc~~tl~s~sqm~shkrkheRq  429 (480)
T KOG4377|consen  391 TGCKYSGICNHFHCDRLGCEATLYSVSQMASHKRKHERQ  429 (480)
T ss_pred             cCcccccceeeeeecccCCceEEEehhhhhhhhhhhhhh


No 137
>COG1327 Predicted transcriptional regulator, consists of a Zn-ribbon and ATP-cone domains [Transcription]
Probab=24.37  E-value=36  Score=32.49  Aligned_cols=29  Identities=10%  Similarity=0.130  Sum_probs=16.4

Q ss_pred             hHHHHHHHHHhhcccchhHHHHHHHHHHH
Q psy679          444 SDEKKRHAKVHLKQRSKKESKMAVMMSQQ  472 (510)
Q Consensus       444 sssLkrHlRtHtgeK~~k~~~~~~~~~QQ  472 (510)
                      .+.|.+-+..-...++........+....
T Consensus        63 r~Kl~~gl~~A~~KRpVs~e~ie~~v~~i   91 (156)
T COG1327          63 REKLRRGLIRACEKRPVSSEQIEEAVSHI   91 (156)
T ss_pred             HHHHHHHHHHHHhcCCCCHHHHHHHHHHH
Confidence            45566666666666666555555444443


No 138
>cd02037 MRP-like MRP (Multiple Resistance and pH adaptation) is a homologue of the Fer4_NifH superfamily. Like the other members of the superfamily, MRP contains a ATP-binding domain at the N-termini. It is found in bacteria as a membrane-spanning protein and functions as a Na+/H+ antiporter.
Probab=24.29  E-value=18  Score=33.44  Aligned_cols=18  Identities=50%  Similarity=0.674  Sum_probs=15.3

Q ss_pred             cccCCCCCceeeeeecCC
Q psy679           29 ISNSGGDGKTTSFVNLSH   46 (510)
Q Consensus        29 ~~~~~~~~~~~~~~~~~~   46 (510)
                      .+..||.||||+-+||..
T Consensus         5 ~s~kgG~GKTt~a~~LA~   22 (169)
T cd02037           5 MSGKGGVGKSTVAVNLAL   22 (169)
T ss_pred             ecCCCcCChhHHHHHHHH
Confidence            467899999999999874


No 139
>COG3091 SprT Zn-dependent metalloprotease, SprT family [General function prediction only]
Probab=24.12  E-value=43  Score=31.99  Aligned_cols=31  Identities=32%  Similarity=1.008  Sum_probs=13.6

Q ss_pred             ceecccCCCccCCchhhhhhhhcccCCCceecCcCC
Q psy679          403 PFQCRICMRSFSRSDHLTTHIRTHTGEKPFSCDVCG  438 (510)
Q Consensus       403 PfkC~~CgKsF~~ks~L~rH~rtHtGeKPfkC~~Cg  438 (510)
                      +|.|. |+..|.+   .++|-.+-.|+ .|.|..|+
T Consensus       117 ~Y~C~-C~q~~l~---~RRhn~~~~g~-~YrC~~C~  147 (156)
T COG3091         117 PYRCQ-CQQHYLR---IRRHNTVRRGE-VYRCGKCG  147 (156)
T ss_pred             eEEee-cCCccch---hhhcccccccc-eEEeccCC
Confidence            45555 5554432   12233333333 45555554


No 140
>TIGR00373 conserved hypothetical protein TIGR00373. This family of proteins is, so far, restricted to archaeal genomes. The family appears to be distantly related to the N-terminal region of the eukaryotic transcription initiation factor IIE alpha chain.
Probab=24.06  E-value=40  Score=31.91  Aligned_cols=33  Identities=21%  Similarity=0.459  Sum_probs=25.8

Q ss_pred             CCCcccCCCCCCCccCChHHHHHHHHHhcCCCceecccCCCccC
Q psy679          371 ERPYACPVENCDRRFSRSDELTRHIRIHTGQKPFQCRICMRSFS  414 (510)
Q Consensus       371 EKPy~C~~e~CgKsFssss~LkrH~rtHtgeKPfkC~~CgKsF~  414 (510)
                      ..-|.|  +.|+..|+....+.         .-|.|+.||....
T Consensus       107 ~~~Y~C--p~c~~r~tf~eA~~---------~~F~Cp~Cg~~L~  139 (158)
T TIGR00373       107 NMFFIC--PNMCVRFTFNEAME---------LNFTCPRCGAMLD  139 (158)
T ss_pred             CCeEEC--CCCCcEeeHHHHHH---------cCCcCCCCCCEee
Confidence            456899  56999998888774         2699999998653


No 141
>PF10263 SprT-like:  SprT-like family;  InterPro: IPR006640 This is a family of uncharacterised bacterial proteins which includes Escherichia coli SprT (P39902 from SWISSPROT). SprT is described as a regulator of bolA gene in stationary phase []. The majority of members contain the metallopeptidase zinc binding signature which has a HExxH motif, however there is no evidence for them being metallopeptidases. 
Probab=23.91  E-value=35  Score=31.21  Aligned_cols=32  Identities=28%  Similarity=0.791  Sum_probs=17.5

Q ss_pred             ceecccCCCccCCchhhhhhhhcccCCCceecCcCCCccC
Q psy679          403 PFQCRICMRSFSRSDHLTTHIRTHTGEKPFSCDVCGRKFA  442 (510)
Q Consensus       403 PfkC~~CgKsF~~ks~L~rH~rtHtGeKPfkC~~CgKaFs  442 (510)
                      .|.|..|+..+.      +|.+.  ..+.|.|..|+..|.
T Consensus       123 ~~~C~~C~~~~~------r~~~~--~~~~~~C~~C~~~l~  154 (157)
T PF10263_consen  123 VYRCPSCGREYK------RHRRS--KRKRYRCGRCGGPLV  154 (157)
T ss_pred             EEEcCCCCCEee------eeccc--chhhEECCCCCCEEE
Confidence            366777766552      23333  223477777776554


No 142
>KOG2071|consensus
Probab=23.61  E-value=46  Score=37.85  Aligned_cols=31  Identities=29%  Similarity=0.707  Sum_probs=24.8

Q ss_pred             CCCceecCcCCCccCChHHHHHHHHHhhccc
Q psy679          428 GEKPFSCDVCGRKFARSDEKKRHAKVHLKQR  458 (510)
Q Consensus       428 GeKPfkC~~CgKaFsrsssLkrHlRtHtgeK  458 (510)
                      ..+|..|..||++|.......+||-.|...+
T Consensus       415 ~~~pnqC~~CG~R~~~~ee~sk~md~H~dwh  445 (579)
T KOG2071|consen  415 KDSPNQCKSCGLRFDDSEERSKHMDIHDDWH  445 (579)
T ss_pred             cCCcchhcccccccccchhhhhHhhhhhhhh
Confidence            3567889999999999888888888776543


No 143
>smart00440 ZnF_C2C2 C2C2 Zinc finger. Nucleic-acid-binding motif in transcriptional elongation factor TFIIS and RNA polymerases.
Probab=23.38  E-value=36  Score=25.07  Aligned_cols=12  Identities=25%  Similarity=0.958  Sum_probs=9.0

Q ss_pred             ceecCcCCCccC
Q psy679          431 PFSCDVCGRKFA  442 (510)
Q Consensus       431 PfkC~~CgKaFs  442 (510)
                      -|.|..|++.|.
T Consensus        28 fy~C~~C~~~w~   39 (40)
T smart00440       28 FYVCTKCGHRWR   39 (40)
T ss_pred             EEEeCCCCCEeC
Confidence            478888887764


No 144
>PTZ00255 60S ribosomal protein L37a; Provisional
Probab=23.25  E-value=41  Score=29.43  Aligned_cols=32  Identities=19%  Similarity=0.517  Sum_probs=21.9

Q ss_pred             CceecccCCCccCCchhhhhhhhcccCCCceecCcCCCccCC
Q psy679          402 KPFQCRICMRSFSRSDHLTTHIRTHTGEKPFSCDVCGRKFAR  443 (510)
Q Consensus       402 KPfkC~~CgKsF~~ks~L~rH~rtHtGeKPfkC~~CgKaFsr  443 (510)
                      ..|.|++|++.-     +.   |.  +...+.|..|++.|+-
T Consensus        35 a~y~CpfCgk~~-----vk---R~--a~GIW~C~~C~~~~AG   66 (90)
T PTZ00255         35 AKYFCPFCGKHA-----VK---RQ--AVGIWRCKGCKKTVAG   66 (90)
T ss_pred             CCccCCCCCCCc-----ee---ee--eeEEEEcCCCCCEEeC
Confidence            469999998642     11   12  2356899999998864


No 145
>PRK14890 putative Zn-ribbon RNA-binding protein; Provisional
Probab=23.24  E-value=60  Score=26.35  Aligned_cols=31  Identities=29%  Similarity=0.739  Sum_probs=16.8

Q ss_pred             ceecccCCCc-cCCchhhhhhhhcccCCCceecCcCCC
Q psy679          403 PFQCRICMRS-FSRSDHLTTHIRTHTGEKPFSCDVCGR  439 (510)
Q Consensus       403 PfkC~~CgKs-F~~ks~L~rH~rtHtGeKPfkC~~CgK  439 (510)
                      .|.|+.|++. -.+-..-    |.  -..+|+|+.||.
T Consensus        25 ~F~CPnCG~~~I~RC~~C----Rk--~~~~Y~CP~CGF   56 (59)
T PRK14890         25 KFLCPNCGEVIIYRCEKC----RK--QSNPYTCPKCGF   56 (59)
T ss_pred             EeeCCCCCCeeEeechhH----Hh--cCCceECCCCCC
Confidence            4777777765 2221111    11  125788888873


No 146
>TIGR01281 DPOR_bchL light-independent protochlorophyllide reductase, iron-sulfur ATP-binding protein. The BchL peptide (ChlL in chloroplast and cyanobacteria) is an ATP-binding iron-sulfur protein of the dark form protochlorophyllide reductase, an enzyme similar to nitrogenase. This subunit resembles the nitrogenase NifH subunit.
Probab=23.07  E-value=17  Score=36.14  Aligned_cols=17  Identities=47%  Similarity=0.540  Sum_probs=14.7

Q ss_pred             ccCCCCCceeeeeecCC
Q psy679           30 SNSGGDGKTTSFVNLSH   46 (510)
Q Consensus        30 ~~~~~~~~~~~~~~~~~   46 (510)
                      ..-||-||||+-+||.+
T Consensus         6 ~gKGGVGKTT~~~nLA~   22 (268)
T TIGR01281         6 YGKGGIGKSTTSSNLSV   22 (268)
T ss_pred             EcCCcCcHHHHHHHHHH
Confidence            35799999999999875


No 147
>PF01927 Mut7-C:  Mut7-C RNAse domain;  InterPro: IPR002782 This prokaryotic family of proteins have no known function. The proteins contain four conserved cysteines that may be involved in metal binding or disulphide bridges.
Probab=23.05  E-value=64  Score=29.90  Aligned_cols=47  Identities=15%  Similarity=0.369  Sum_probs=27.9

Q ss_pred             eecccCCCccCCchh--hh----hhhhcccCCCceecCcCCCccCChHHHHHHH
Q psy679          404 FQCRICMRSFSRSDH--LT----THIRTHTGEKPFSCDVCGRKFARSDEKKRHA  451 (510)
Q Consensus       404 fkC~~CgKsF~~ks~--L~----rH~rtHtGeKPfkC~~CgKaFsrsssLkrHl  451 (510)
                      -.|..|+..+..-..  ..    ..... ..+.-+.|..|||.|-..+++++-.
T Consensus        92 sRC~~CN~~L~~v~~~~v~~~vp~~v~~-~~~~f~~C~~C~kiyW~GsH~~~~~  144 (147)
T PF01927_consen   92 SRCPKCNGPLRPVSKEEVKDRVPPYVYE-TYDEFWRCPGCGKIYWEGSHWRRME  144 (147)
T ss_pred             CccCCCCcEeeechhhccccccCccccc-cCCeEEECCCCCCEecccccHHHHH
Confidence            469999875533211  10    01111 1234578999999998887776644


No 148
>PF15135 UPF0515:  Uncharacterised protein UPF0515
Probab=22.80  E-value=62  Score=33.33  Aligned_cols=62  Identities=24%  Similarity=0.594  Sum_probs=34.2

Q ss_pred             CCCCCCCCcccCCCCCCCccCChHHHHHHHHHhcCCCceecccCCCccCCchhhhhhhhcccCCCceecCcCCCccC
Q psy679          366 KTPVHERPYACPVENCDRRFSRSDELTRHIRIHTGQKPFQCRICMRSFSRSDHLTTHIRTHTGEKPFSCDVCGRKFA  442 (510)
Q Consensus       366 kth~~EKPy~C~~e~CgKsFssss~LkrH~rtHtgeKPfkC~~CgKsF~~ks~L~rH~rtHtGeKPfkC~~CgKaFs  442 (510)
                      .++...+-|.|  ..|...+        ..|.-....--+|..|.+.|---..     -.--|.--|.|..|++.|.
T Consensus       105 ~ip~~drqFaC--~~Cd~~W--------wRrvp~rKeVSRCr~C~~rYDPVP~-----dkmwG~aef~C~~C~h~F~  166 (278)
T PF15135_consen  105 LIPSVDRQFAC--SSCDHMW--------WRRVPQRKEVSRCRKCRKRYDPVPC-----DKMWGIAEFHCPKCRHNFR  166 (278)
T ss_pred             cccccceeeec--cccchHH--------HhccCcccccccccccccccCCCcc-----ccccceeeeecccccccch
Confidence            44556688999  5586443        1122222334567778776643221     0112344588888887775


No 149
>cd02036 MinD Bacterial cell division requires the formation of a septum at mid-cell. The site is determined by the min operon products MinC, MinD and MinE. MinC is a nonspecific inhibitor of the septum protein FtsZ. MinE is the supressor of MinC. MinD plays a pivotal role, selecting the mid-cell over other sites through the activation and regulation of MinC and MinE. MinD is a membrane-associated ATPase, related to nitrogenase iron protein. More distantly related proteins include flagellar biosynthesis proteins and ParA chromosome partitioning proteins. MinD is a monomer.
Probab=22.66  E-value=18  Score=32.93  Aligned_cols=18  Identities=50%  Similarity=0.639  Sum_probs=15.1

Q ss_pred             cccCCCCCceeeeeecCC
Q psy679           29 ISNSGGDGKTTSFVNLSH   46 (510)
Q Consensus        29 ~~~~~~~~~~~~~~~~~~   46 (510)
                      .+..||-||||+-+||..
T Consensus         5 ~~~kgG~GKtt~a~~la~   22 (179)
T cd02036           5 TSGKGGVGKTTTTANLGT   22 (179)
T ss_pred             eeCCCCCCHHHHHHHHHH
Confidence            356899999999999864


No 150
>TIGR01969 minD_arch cell division ATPase MinD, archaeal. This model represents the archaeal branch of the MinD family. MinD, a weak ATPase, works in bacteria with MinC as a generalized cell division inhibitor and, through interaction with MinE, prevents septum placement inappropriate sites. Often several members of this family are found in archaeal genomes, and the function is uncharacterized. More distantly related proteins include flagellar biosynthesis proteins and ParA chromosome partitioning proteins. The exact roles of the various archaeal MinD homologs are unknown.
Probab=22.59  E-value=19  Score=34.93  Aligned_cols=18  Identities=50%  Similarity=0.597  Sum_probs=15.7

Q ss_pred             cccCCCCCceeeeeecCC
Q psy679           29 ISNSGGDGKTTSFVNLSH   46 (510)
Q Consensus        29 ~~~~~~~~~~~~~~~~~~   46 (510)
                      ++..||-||||+-+||..
T Consensus         6 ~~~KGGvGKTt~a~~LA~   23 (251)
T TIGR01969         6 ASGKGGTGKTTITANLGV   23 (251)
T ss_pred             EcCCCCCcHHHHHHHHHH
Confidence            467899999999999975


No 151
>smart00734 ZnF_Rad18 Rad18-like CCHC zinc finger. Yeast Rad18p functions with Rad5p in error-free post-replicative DNA repair. This zinc finger is likely to bind nucleic-acids.
Probab=22.51  E-value=76  Score=21.30  Aligned_cols=20  Identities=25%  Similarity=0.471  Sum_probs=14.8

Q ss_pred             ecCcCCCccCChHHHHHHHHH
Q psy679          433 SCDVCGRKFARSDEKKRHAKV  453 (510)
Q Consensus       433 kC~~CgKaFsrsssLkrHlRt  453 (510)
                      .|++|++.+ ....+.+|+..
T Consensus         3 ~CPiC~~~v-~~~~in~HLD~   22 (26)
T smart00734        3 QCPVCFREV-PENLINSHLDS   22 (26)
T ss_pred             cCCCCcCcc-cHHHHHHHHHH
Confidence            588888887 55677788753


No 152
>PRK06266 transcription initiation factor E subunit alpha; Validated
Probab=22.47  E-value=44  Score=32.30  Aligned_cols=33  Identities=24%  Similarity=0.612  Sum_probs=25.4

Q ss_pred             CCcccCCCCCCCccCChHHHHHHHHHhcCCCceecccCCCccCC
Q psy679          372 RPYACPVENCDRRFSRSDELTRHIRIHTGQKPFQCRICMRSFSR  415 (510)
Q Consensus       372 KPy~C~~e~CgKsFssss~LkrH~rtHtgeKPfkC~~CgKsF~~  415 (510)
                      .-|.|  +.|++.|+....+.         .-|.|+.||.....
T Consensus       116 ~~Y~C--p~C~~rytf~eA~~---------~~F~Cp~Cg~~L~~  148 (178)
T PRK06266        116 MFFFC--PNCHIRFTFDEAME---------YGFRCPQCGEMLEE  148 (178)
T ss_pred             CEEEC--CCCCcEEeHHHHhh---------cCCcCCCCCCCCee
Confidence            46899  55999998877653         36999999986543


No 153
>TIGR00244 transcriptional regulator NrdR. Members of this almost entirely bacterial family contain an ATP cone domain (PFAM:PF03477). There is never more than one member per genome. Common gene symbols given include nrdR, ybaD, ribX and ytcG. The member from Streptomyces coelicolor is found upstream in the operon of the class II oxygen-independent ribonucleotide reductase gene nrdJ and was shown to repress nrdJ expression. Many members of this family are found near genes for riboflavin biosynthesis in Gram-negative bacteria, suggesting a role in that pathway. However, a phylogenetic profiling study associates members of this family with the presence of a palindromic signal with consensus acaCwAtATaTwGtgt, termed the NrdR-box, an upstream element for most operons for ribonucleotide reductase of all three classes in bacterial genomes.
Probab=22.45  E-value=43  Score=31.79  Aligned_cols=15  Identities=20%  Similarity=0.507  Sum_probs=9.5

Q ss_pred             CceecccCCCccCCc
Q psy679          402 KPFQCRICMRSFSRS  416 (510)
Q Consensus       402 KPfkC~~CgKsF~~k  416 (510)
                      +.-.|..|++.|..-
T Consensus        27 RRReC~~C~~RFTTy   41 (147)
T TIGR00244        27 RRRECLECHERFTTF   41 (147)
T ss_pred             ecccCCccCCcccee
Confidence            334677777777653


No 154
>COG3357 Predicted transcriptional regulator containing an HTH domain fused to a Zn-ribbon [Transcription]
Probab=22.27  E-value=45  Score=29.33  Aligned_cols=27  Identities=33%  Similarity=0.860  Sum_probs=18.9

Q ss_pred             CceecccCCCccCCchhhhhhhhcccCCCceecCcCC
Q psy679          402 KPFQCRICMRSFSRSDHLTTHIRTHTGEKPFSCDVCG  438 (510)
Q Consensus       402 KPfkC~~CgKsF~~ks~L~rH~rtHtGeKPfkC~~Cg  438 (510)
                      +|-.|..||..|.. ..+         .+|-+|+.|.
T Consensus        57 ~Pa~CkkCGfef~~-~~i---------k~pSRCP~CK   83 (97)
T COG3357          57 RPARCKKCGFEFRD-DKI---------KKPSRCPKCK   83 (97)
T ss_pred             cChhhcccCccccc-ccc---------CCcccCCcch
Confidence            57788899888865 111         2577888885


No 155
>TIGR00280 L37a ribosomal protein L37a. This model finds eukaryotic ribosomal protein L37a and its archaeal orthologs. The nomeclature is tricky because eukaryotes have proteins called both L37 and L37a.
Probab=22.11  E-value=41  Score=29.51  Aligned_cols=32  Identities=28%  Similarity=0.656  Sum_probs=21.8

Q ss_pred             CceecccCCCccCCchhhhhhhhcccCCCceecCcCCCccCC
Q psy679          402 KPFQCRICMRSFSRSDHLTTHIRTHTGEKPFSCDVCGRKFAR  443 (510)
Q Consensus       402 KPfkC~~CgKsF~~ks~L~rH~rtHtGeKPfkC~~CgKaFsr  443 (510)
                      ..|.|++|++.-.        .|.-  ...+.|..|++.|+-
T Consensus        34 a~y~CpfCgk~~v--------kR~a--~GIW~C~~C~~~~AG   65 (91)
T TIGR00280        34 AKYVCPFCGKKTV--------KRGS--TGIWTCRKCGAKFAG   65 (91)
T ss_pred             cCccCCCCCCCce--------EEEe--eEEEEcCCCCCEEeC
Confidence            4699999986421        1222  346899999998863


No 156
>COG0489 Mrp ATPases involved in chromosome partitioning [Cell division and chromosome partitioning]
Probab=21.58  E-value=20  Score=36.49  Aligned_cols=18  Identities=50%  Similarity=0.625  Sum_probs=15.6

Q ss_pred             ccccCCCCCceeeeeecC
Q psy679           28 QISNSGGDGKTTSFVNLS   45 (510)
Q Consensus        28 ~~~~~~~~~~~~~~~~~~   45 (510)
                      -.|..||+||||..+||.
T Consensus        62 V~S~kgGvGKStva~nLA   79 (265)
T COG0489          62 VTSGKGGVGKSTVAVNLA   79 (265)
T ss_pred             EEeCCCCCcHHHHHHHHH
Confidence            357889999999999985


No 157
>cd03110 Fer4_NifH_child This protein family's function is unkown. It contains nucleotide binding site. It uses NTP as energy source to transfer electron or ion.
Probab=21.30  E-value=20  Score=33.23  Aligned_cols=17  Identities=53%  Similarity=0.714  Sum_probs=14.9

Q ss_pred             cccCCCCCceeeeeecC
Q psy679           29 ISNSGGDGKTTSFVNLS   45 (510)
Q Consensus        29 ~~~~~~~~~~~~~~~~~   45 (510)
                      .|..||-||||.-+||.
T Consensus         5 ~s~kgG~GKSt~a~nLA   21 (179)
T cd03110           5 ISGKGGTGKTTVTAALA   21 (179)
T ss_pred             EcCCCCCCHHHHHHHHH
Confidence            46789999999999995


No 158
>PF06524 NOA36:  NOA36 protein;  InterPro: IPR010531 This family consists of several NOA36 proteins which contain 29 highly conserved cysteine residues. The function of this protein is unknown.; GO: 0008270 zinc ion binding, 0005634 nucleus
Probab=21.02  E-value=38  Score=35.05  Aligned_cols=84  Identities=21%  Similarity=0.455  Sum_probs=43.9

Q ss_pred             CCCCCCcccCCCCCCCccCChHHHHHHHHHh--cCCCceecccCCCccCCchh-------hhhhhhc----ccCCCceec
Q psy679          368 PVHERPYACPVENCDRRFSRSDELTRHIRIH--TGQKPFQCRICMRSFSRSDH-------LTTHIRT----HTGEKPFSC  434 (510)
Q Consensus       368 h~~EKPy~C~~e~CgKsFssss~LkrH~rtH--tgeKPfkC~~CgKsF~~ks~-------L~rH~rt----HtGeKPfkC  434 (510)
                      ..+.+.|.|.+  |.........+ .|+..-  .....|+|.-|++.-. -+-       ...|+|.    ....+++.|
T Consensus       137 ~hGGrif~Csf--C~~flCEDDQF-EHQAsCQvLe~E~~KC~SCNrlGq-~sCLRCK~cfCddHvrrKg~ky~k~k~~PC  212 (314)
T PF06524_consen  137 DHGGRIFKCSF--CDNFLCEDDQF-EHQASCQVLESETFKCQSCNRLGQ-YSCLRCKICFCDDHVRRKGFKYEKGKPIPC  212 (314)
T ss_pred             cCCCeEEEeec--CCCeeeccchh-hhhhhhhhhhcccccccccccccc-hhhhheeeeehhhhhhhcccccccCCCCCC
Confidence            34567788855  87544433333 354321  1223577766664211 111       1123332    233478889


Q ss_pred             CcCCCccCChHHHHHHHHHhh
Q psy679          435 DVCGRKFARSDEKKRHAKVHL  455 (510)
Q Consensus       435 ~~CgKaFsrsssLkrHlRtHt  455 (510)
                      +.||..-.....|..-.|+|.
T Consensus       213 PKCg~et~eTkdLSmStR~hk  233 (314)
T PF06524_consen  213 PKCGYETQETKDLSMSTRSHK  233 (314)
T ss_pred             CCCCCcccccccceeeeecch
Confidence            889877666666655555554


No 159
>PRK14873 primosome assembly protein PriA; Provisional
Probab=20.69  E-value=29  Score=40.19  Aligned_cols=27  Identities=26%  Similarity=0.668  Sum_probs=17.2

Q ss_pred             hcCCCceecccCCCccCCchhhhhhhhcccCCCceecCcCCCc
Q psy679          398 HTGQKPFQCRICMRSFSRSDHLTTHIRTHTGEKPFSCDVCGRK  440 (510)
Q Consensus       398 HtgeKPfkC~~CgKsF~~ks~L~rH~rtHtGeKPfkC~~CgKa  440 (510)
                      |...+...|.+|+..                ..++.|..||..
T Consensus       405 h~~~~~l~Ch~CG~~----------------~~p~~Cp~Cgs~  431 (665)
T PRK14873        405 PSAGGTPRCRWCGRA----------------APDWRCPRCGSD  431 (665)
T ss_pred             ecCCCeeECCCCcCC----------------CcCccCCCCcCC
Confidence            444455667777632                246789999865


No 160
>PRK03824 hypA hydrogenase nickel incorporation protein; Provisional
Probab=20.67  E-value=55  Score=30.20  Aligned_cols=13  Identities=31%  Similarity=0.697  Sum_probs=7.2

Q ss_pred             ceecccCCCccCC
Q psy679          403 PFQCRICMRSFSR  415 (510)
Q Consensus       403 PfkC~~CgKsF~~  415 (510)
                      .+.|..|+..|..
T Consensus        70 ~~~C~~CG~~~~~   82 (135)
T PRK03824         70 VLKCRNCGNEWSL   82 (135)
T ss_pred             EEECCCCCCEEec
Confidence            3566666655543


No 161
>PF05191 ADK_lid:  Adenylate kinase, active site lid;  InterPro: IPR007862 Adenylate kinases (ADK; 2.7.4.3 from EC) are phosphotransferases that catalyse the Mg-dependent reversible conversion of ATP and AMP to two molecules of ADP, an essential reaction for many processes in living cells. In large variants of adenylate kinase, the AMP and ATP substrates are buried in a domain that undergoes conformational changes from an open to a closed state when bound to substrate; the ligand is then contained within a highly specific environment required for catalysis. Adenylate kinase is a 3-domain protein consisting of a large central CORE domain flanked by a LID domain on one side and the AMP-binding NMPbind domain on the other []. The LID domain binds ATP and covers the phosphates at the active site. The substrates first bind the CORE domain, followed by closure of the active site by the LID and NMPbind domains. Comparisons of adenylate kinases have revealed a particular divergence in the active site lid. In some organisms, particularly the Gram-positive bacteria, residues in the lid domain have been mutated to cysteines and these cysteine residues (two CX(n)C motifs) are responsible for the binding of a zinc ion. The bound zinc ion in the lid domain is clearly structurally homologous to Zinc-finger domains. However, it is unclear whether the adenylate kinase lid is a novel zinc-finger DNA/RNA binding domain, or that the lid bound zinc serves a purely structural function [].; GO: 0004017 adenylate kinase activity; PDB: 3BE4_A 2OSB_B 2ORI_A 2EU8_A 3DL0_A 1P3J_A 2QAJ_A 2OO7_A 2P3S_A 3DKV_A ....
Probab=20.64  E-value=40  Score=24.44  Aligned_cols=31  Identities=23%  Similarity=0.451  Sum_probs=16.6

Q ss_pred             eecccCCCccCCchhhhhhhhcccCCCceecCcCCCccC
Q psy679          404 FQCRICMRSFSRSDHLTTHIRTHTGEKPFSCDVCGRKFA  442 (510)
Q Consensus       404 fkC~~CgKsF~~ks~L~rH~rtHtGeKPfkC~~CgKaFs  442 (510)
                      +.|+.|++.|..        ........=+|+.||-.+.
T Consensus         2 r~C~~Cg~~Yh~--------~~~pP~~~~~Cd~cg~~L~   32 (36)
T PF05191_consen    2 RICPKCGRIYHI--------EFNPPKVEGVCDNCGGELV   32 (36)
T ss_dssp             EEETTTTEEEET--------TTB--SSTTBCTTTTEBEB
T ss_pred             cCcCCCCCcccc--------ccCCCCCCCccCCCCCeeE
Confidence            457777777632        2222233456777876544


No 162
>COG1655 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=20.58  E-value=38  Score=34.55  Aligned_cols=40  Identities=28%  Similarity=0.665  Sum_probs=25.3

Q ss_pred             CCceecccCCCccCCchhhhhhhhcccCCC----------c-----eecCcCCCc
Q psy679          401 QKPFQCRICMRSFSRSDHLTTHIRTHTGEK----------P-----FSCDVCGRK  440 (510)
Q Consensus       401 eKPfkC~~CgKsF~~ks~L~rH~rtHtGeK----------P-----fkC~~CgKa  440 (510)
                      ++.+.|++|+..|....-+..-.|+-.|+-          |     ..|+.|..+
T Consensus        17 kk~ieCPvC~tkFkkeev~tgsiRiiagDld~~lkygninP~fY~VvvCP~C~yA   71 (267)
T COG1655          17 KKTIECPVCNTKFKKEEVKTGSIRIIAGDLDFFLKYGNINPYFYDVVVCPICYYA   71 (267)
T ss_pred             hceeccCcccchhhhhheeccceeEecccccceeeccccCCceeEEEEcchhhHH
Confidence            456778888877776665555555544421          2     369999753


No 163
>PRK13234 nifH nitrogenase reductase; Reviewed
Probab=20.17  E-value=17  Score=37.29  Aligned_cols=37  Identities=38%  Similarity=0.425  Sum_probs=25.4

Q ss_pred             ccCCCCCceeeeeecCCCCCCCccceeeeeecCceeeecCCCCceeeeeccchhhc
Q psy679           30 SNSGGDGKTTSFVNLSHPPTRSIPEYLILVEMPKLRVLKRPKGCSCIVDISPYLES   85 (510)
Q Consensus        30 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~   85 (510)
                      +.-||-||||+-+||.+-          |.++ ..|        .+|||..|-..+
T Consensus        10 ~~KGGvGKTt~~~nLa~~----------la~~-g~k--------VLliD~D~q~~~   46 (295)
T PRK13234         10 YGKGGIGKSTTSQNTLAA----------LVEM-GQK--------ILIVGCDPKADS   46 (295)
T ss_pred             ECCCCccHHHHHHHHHHH----------HHHC-CCe--------EEEEeccccccc
Confidence            468999999999999752          2232 223        468898886433


No 164
>PF01363 FYVE:  FYVE zinc finger;  InterPro: IPR000306 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target.  The FYVE zinc finger is named after four proteins that it has been found in: Fab1, YOTB/ZK632.12, Vac1, and EEA1. The FYVE finger has been shown to bind two zinc ions []. The FYVE finger has eight potential zinc coordinating cysteine positions. Many members of this family also include two histidines in a motif R+HHC+XCG, where + represents a charged residue and X any residue. FYVE-type domains are divided into two known classes: FYVE domains that specifically bind to phosphatidylinositol 3-phosphate in lipid bilayers and FYVE-related domains of undetermined function []. Those that bind to phosphatidylinositol 3-phosphate are often found in proteins targeted to lipid membranes that are involved in regulating membrane traffic [, , ]. Most FYVE domains target proteins to endosomes by binding specifically to phosphatidylinositol-3-phosphate at the membrane surface. By contrast, the CARP2 FYVE-like domain is not optimized to bind to phosphoinositides or insert into lipid bilayers. FYVE domains are distinguished from other zinc fingers by three signature sequences: an N-terminal WxxD motif, a basic R(R/K)HHCR patch, and a C-terminal RVC motif. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; GO: 0046872 metal ion binding; PDB: 1HYI_A 1JOC_B 1HYJ_A 1DVP_A 3ZYQ_A 4AVX_A 1VFY_A 3T7L_A 1X4U_A 1WFK_A ....
Probab=20.05  E-value=56  Score=25.97  Aligned_cols=13  Identities=31%  Similarity=0.723  Sum_probs=5.3

Q ss_pred             CceecccCCCccC
Q psy679          402 KPFQCRICMRSFS  414 (510)
Q Consensus       402 KPfkC~~CgKsF~  414 (510)
                      +.+.|..||+.|-
T Consensus        24 rrhhCr~CG~~vC   36 (69)
T PF01363_consen   24 RRHHCRNCGRVVC   36 (69)
T ss_dssp             -EEE-TTT--EEE
T ss_pred             eeEccCCCCCEEC
Confidence            4456666666554


No 165
>PRK13849 putative crown gall tumor protein VirC1; Provisional
Probab=20.01  E-value=17  Score=36.16  Aligned_cols=18  Identities=44%  Similarity=0.634  Sum_probs=15.6

Q ss_pred             cccCCCCCceeeeeecCC
Q psy679           29 ISNSGGDGKTTSFVNLSH   46 (510)
Q Consensus        29 ~~~~~~~~~~~~~~~~~~   46 (510)
                      .+.-||-||||+-+||.+
T Consensus         7 ~n~KGGvGKTT~a~nLA~   24 (231)
T PRK13849          7 CSFKGGAGKTTALMGLCA   24 (231)
T ss_pred             ECCCCCccHHHHHHHHHH
Confidence            466799999999999875


Done!