Query         psy8702
Match_columns 194
No_of_seqs    190 out of 2086
Neff          8.7 
Searched_HMMs 46136
Date          Fri Aug 16 17:04:57 2013
Command       hhsearch -i /work/01045/syshi/Psyhhblits/psy8702.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/8702hhsearch_cdd -cpu 12 -v 0 

 No Hit                             Prob E-value P-value  Score    SS Cols Query HMM  Template HMM
  1 KOG2462|consensus               99.9 2.9E-24 6.2E-29  169.1   3.3  125   16-167   126-254 (279)
  2 KOG2462|consensus               99.9 7.4E-23 1.6E-27  161.2   3.2  110   18-157   159-268 (279)
  3 KOG1074|consensus               99.6 6.9E-16 1.5E-20  136.6   5.7   57   16-80    601-657 (958)
  4 KOG1074|consensus               99.6 5.4E-16 1.2E-20  137.2   3.0   56   17-80    350-405 (958)
  5 KOG3576|consensus               99.5   6E-16 1.3E-20  117.3  -1.3  116   15-155   112-239 (267)
  6 KOG3623|consensus               99.5 3.6E-14 7.7E-19  124.3   3.7  122   17-153   207-330 (1007)
  7 KOG3623|consensus               99.4 9.4E-14   2E-18  121.7   2.9   83   15-121   889-971 (1007)
  8 KOG3608|consensus               99.3 1.2E-13 2.5E-18  112.5  -1.3  128   24-161   183-324 (467)
  9 KOG3608|consensus               99.3 2.8E-13   6E-18  110.4  -0.6  142   14-175   231-395 (467)
 10 KOG3576|consensus               99.2 1.3E-12 2.9E-17   99.2  -2.0   84   53-153   114-198 (267)
 11 PLN03086 PRLI-interacting fact  98.8 2.9E-09 6.4E-14   93.4   4.5   91   19-140   452-552 (567)
 12 PHA02768 hypothetical protein;  98.8 3.8E-09 8.3E-14   64.3   1.9   43   21-73      6-48  (55)
 13 PHA00733 hypothetical protein   98.6   2E-08 4.4E-13   72.6   3.2   89   16-125    36-124 (128)
 14 PHA00733 hypothetical protein   98.5 1.9E-08 4.1E-13   72.8   0.2   79   53-153    37-120 (128)
 15 PF13465 zf-H2C2_2:  Zinc-finge  98.4 8.6E-08 1.9E-12   49.8   1.5   26   43-68      1-26  (26)
 16 PHA02768 hypothetical protein;  98.4 9.5E-08 2.1E-12   58.2   1.8   41  102-148     7-47  (55)
 17 PF13465 zf-H2C2_2:  Zinc-finge  98.4 2.9E-08 6.4E-13   51.6  -0.9   25  115-140     1-25  (26)
 18 PLN03086 PRLI-interacting fact  98.4 3.9E-07 8.4E-12   80.3   5.1   89   54-165   451-550 (567)
 19 KOG3993|consensus               98.0   1E-06 2.2E-11   73.9  -0.8   53   20-80    267-319 (500)
 20 PHA00616 hypothetical protein   97.9 4.1E-06 8.8E-11   48.7   1.3   32   21-60      2-33  (44)
 21 PHA00616 hypothetical protein   97.9 3.9E-06 8.3E-11   48.7   0.9   34   56-90      1-34  (44)
 22 KOG3993|consensus               97.7 1.7E-05 3.7E-10   66.7   2.1  124   18-153   293-479 (500)
 23 PHA00732 hypothetical protein   97.6 3.3E-05 7.1E-10   51.1   2.3   46   21-80      2-48  (79)
 24 PHA00732 hypothetical protein   97.5 0.00013 2.8E-09   48.2   3.2   47   56-124     1-48  (79)
 25 PF00096 zf-C2H2:  Zinc finger,  97.2 0.00012 2.7E-09   36.4   0.8   20   58-77      2-21  (23)
 26 PF00096 zf-C2H2:  Zinc finger,  97.2 0.00013 2.8E-09   36.3   0.7   23   21-51      1-23  (23)
 27 PF05605 zf-Di19:  Drought indu  97.1 0.00082 1.8E-08   41.0   3.5   51   19-80      1-53  (54)
 28 COG5189 SFP1 Putative transcri  96.8 0.00072 1.6E-08   55.3   2.1   55   82-153   346-419 (423)
 29 PF13894 zf-C2H2_4:  C2H2-type   96.7 0.00092   2E-08   33.1   1.3   23   57-79      1-23  (24)
 30 PF13912 zf-C2H2_6:  C2H2-type   96.6 0.00092   2E-08   34.5   0.9   25   56-80      1-25  (27)
 31 PF13894 zf-C2H2_4:  C2H2-type   96.5  0.0012 2.7E-08   32.6   1.0   23   21-51      1-23  (24)
 32 PF05605 zf-Di19:  Drought indu  96.4  0.0044 9.5E-08   37.7   3.4   51   56-124     2-53  (54)
 33 PF12756 zf-C2H2_2:  C2H2 type   96.0  0.0039 8.5E-08   42.2   1.8   57   22-79      1-73  (100)
 34 PF13912 zf-C2H2_6:  C2H2-type   95.9  0.0036 7.9E-08   32.2   0.9   24   21-52      2-25  (27)
 35 COG5048 FOG: Zn-finger [Genera  95.9  0.0074 1.6E-07   51.0   3.2  139   19-169   288-455 (467)
 36 smart00355 ZnF_C2H2 zinc finge  95.8  0.0068 1.5E-07   30.2   1.7   24   57-80      1-24  (26)
 37 smart00355 ZnF_C2H2 zinc finge  95.6  0.0084 1.8E-07   29.8   1.6   24   21-52      1-24  (26)
 38 PF09237 GAGA:  GAGA factor;  I  95.5    0.01 2.2E-07   35.4   1.9   33   52-85     20-52  (54)
 39 PRK04860 hypothetical protein;  95.4   0.011 2.4E-07   44.4   2.2   39   19-69    118-156 (160)
 40 COG5189 SFP1 Putative transcri  95.2  0.0084 1.8E-07   49.2   1.3   52   17-76    346-418 (423)
 41 PF12756 zf-C2H2_2:  C2H2 type   94.9  0.0062 1.3E-07   41.2  -0.3   65   58-123     1-73  (100)
 42 PF09237 GAGA:  GAGA factor;  I  94.8   0.026 5.5E-07   33.7   2.1   33   16-56     20-52  (54)
 43 PF12874 zf-met:  Zinc-finger o  94.4   0.017 3.6E-07   29.0   0.6   21   57-77      1-21  (25)
 44 PF12874 zf-met:  Zinc-finger o  94.2   0.017 3.6E-07   29.0   0.4   22   21-50      1-22  (25)
 45 PF12171 zf-C2H2_jaz:  Zinc-fin  93.7   0.012 2.6E-07   30.4  -0.7   19   58-76      3-21  (27)
 46 PF12171 zf-C2H2_jaz:  Zinc-fin  93.1   0.041 8.9E-07   28.3   0.8   21   20-48      1-21  (27)
 47 PRK04860 hypothetical protein;  92.3   0.086 1.9E-06   39.6   1.8   32   56-92    119-150 (160)
 48 KOG2893|consensus               90.9     0.1 2.2E-06   41.3   1.0   53   16-80      6-59  (341)
 49 PF13909 zf-H2C2_5:  C2H2-type   90.5    0.12 2.6E-06   25.6   0.7   22   57-79      1-22  (24)
 50 PF09538 FYDLN_acid:  Protein o  89.4    0.29 6.4E-06   34.2   2.2   34   17-69      6-39  (108)
 51 KOG2231|consensus               89.1     0.4 8.7E-06   43.6   3.4   41  108-152   220-263 (669)
 52 PF13913 zf-C2HC_2:  zinc-finge  88.6    0.34 7.4E-06   24.5   1.6   18   58-76      4-21  (25)
 53 KOG2893|consensus               87.6    0.24 5.2E-06   39.3   0.8   45  102-151    12-58  (341)
 54 smart00451 ZnF_U1 U1-like zinc  86.5    0.49 1.1E-05   25.5   1.5   22   56-77      3-24  (35)
 55 KOG2231|consensus               82.7    0.69 1.5E-05   42.2   1.5   56  100-160   182-240 (669)
 56 COG5048 FOG: Zn-finger [Genera  82.5    0.76 1.6E-05   38.7   1.7   70   55-138   288-362 (467)
 57 TIGR02300 FYDLN_acid conserved  82.4     1.1 2.4E-05   32.1   2.2   38   17-73      6-43  (129)
 58 cd00350 rubredoxin_like Rubred  82.3    0.89 1.9E-05   24.5   1.3   13   21-33      2-14  (33)
 59 PF09986 DUF2225:  Uncharacteri  78.9    0.71 1.5E-05   36.3   0.3   21   55-75      4-24  (214)
 60 KOG2186|consensus               77.2     1.6 3.6E-05   34.9   1.8   46   21-77      4-49  (276)
 61 PF10571 UPF0547:  Uncharacteri  73.8     3.1 6.7E-05   21.2   1.8   10  101-110    15-24  (26)
 62 KOG1146|consensus               72.3     1.6 3.5E-05   42.6   0.8   98   23-152   439-538 (1406)
 63 KOG2482|consensus               68.0     3.8 8.2E-05   34.4   2.0   66   57-122   280-356 (423)
 64 PRK00464 nrdR transcriptional   66.7     1.6 3.6E-05   32.5  -0.3   19   55-73     27-45  (154)
 65 TIGR02098 MJ0042_CXXC MJ0042 f  65.4     4.9 0.00011   22.1   1.6   34   21-67      3-36  (38)
 66 PF06524 NOA36:  NOA36 protein;  64.6     8.7 0.00019   31.1   3.4   95   53-157   139-234 (314)
 67 PF09723 Zn-ribbon_8:  Zinc rib  63.3     3.2   7E-05   23.6   0.6   13   21-33      6-18  (42)
 68 cd00729 rubredoxin_SM Rubredox  62.7     5.1 0.00011   21.7   1.3   12   21-32      3-14  (34)
 69 PF13719 zinc_ribbon_5:  zinc-r  61.8     7.2 0.00016   21.5   1.8   33   22-67      4-36  (37)
 70 PF13717 zinc_ribbon_4:  zinc-r  61.4     7.3 0.00016   21.4   1.8   32   22-66      4-35  (36)
 71 PRK00398 rpoP DNA-directed RNA  60.1     6.6 0.00014   22.7   1.5   14   20-33      3-16  (46)
 72 COG1997 RPL43A Ribosomal prote  59.4     6.5 0.00014   26.3   1.5   14   55-68     52-65  (89)
 73 TIGR02605 CxxC_CxxC_SSSS putat  59.3     3.8 8.2E-05   24.2   0.4   13   21-33      6-18  (52)
 74 PRK06266 transcription initiat  58.1     5.2 0.00011   30.5   1.1   17   55-71    116-132 (178)
 75 COG1592 Rubrerythrin [Energy p  57.5     7.4 0.00016   29.4   1.8   24   55-92    133-156 (166)
 76 COG5236 Uncharacterized conser  57.3      13 0.00027   31.5   3.2   96    4-122   203-303 (493)
 77 smart00834 CxxC_CXXC_SSSS Puta  56.7     6.5 0.00014   21.7   1.1   13   21-33      6-18  (41)
 78 TIGR00622 ssl1 transcription f  56.3      28 0.00061   24.5   4.4   77   55-139    14-91  (112)
 79 COG2888 Predicted Zn-ribbon RN  55.9     8.9 0.00019   23.7   1.6    9   84-92     49-57  (61)
 80 PF15135 UPF0515:  Uncharacteri  55.7      10 0.00022   30.5   2.3   11   55-65    111-121 (278)
 81 KOG1146|consensus               54.7     6.9 0.00015   38.5   1.5   51   19-77    464-539 (1406)
 82 smart00614 ZnF_BED BED zinc fi  53.7      12 0.00027   21.9   2.0   29   20-51     18-47  (50)
 83 smart00531 TFIIE Transcription  53.6      10 0.00022   27.9   1.9   37   52-93     95-131 (147)
 84 PF05443 ROS_MUCR:  ROS/MUCR tr  53.0     9.7 0.00021   27.6   1.7   29   17-56     69-97  (132)
 85 COG4049 Uncharacterized protei  47.8     7.1 0.00015   23.8   0.3   23   54-76     15-37  (65)
 86 COG4530 Uncharacterized protei  47.3      12 0.00025   26.2   1.3   32   18-68      7-38  (129)
 87 PF03604 DNA_RNApol_7kD:  DNA d  46.5      16 0.00035   19.6   1.5   13   21-33      1-13  (32)
 88 COG1571 Predicted DNA-binding   45.7      15 0.00032   31.9   2.0   37   16-71    346-382 (421)
 89 PRK09678 DNA-binding transcrip  45.2     8.7 0.00019   24.8   0.4   42   21-72      2-45  (72)
 90 smart00659 RPOLCX RNA polymera  43.1      17 0.00036   21.0   1.3   12   21-32      3-14  (44)
 91 PF13248 zf-ribbon_3:  zinc-rib  42.6      19 0.00042   18.0   1.4    7  101-107    17-23  (26)
 92 PRK14890 putative Zn-ribbon RN  41.9      19 0.00041   22.2   1.5    9   57-65     26-34  (59)
 93 smart00734 ZnF_Rad18 Rad18-lik  41.9      22 0.00047   18.0   1.5   20   57-77      2-21  (26)
 94 KOG4167|consensus               40.4      11 0.00024   34.9   0.4   28   18-53    790-817 (907)
 95 TIGR00686 phnA alkylphosphonat  40.3      19 0.00041   25.1   1.5   30   21-69      3-32  (109)
 96 TIGR00373 conserved hypothetic  39.8      28 0.00062   25.9   2.5   34   49-92    102-135 (158)
 97 KOG2186|consensus               39.8      12 0.00026   30.1   0.6   56   56-130     3-58  (276)
 98 PTZ00255 60S ribosomal protein  38.4      15 0.00033   24.7   0.8   14   55-68     53-66  (90)
 99 PHA00626 hypothetical protein   37.7      16 0.00034   22.3   0.7   16   54-69     21-36  (59)
100 PF07754 DUF1610:  Domain of un  37.6      11 0.00025   18.8   0.1   10  127-137    15-24  (24)
101 PF02892 zf-BED:  BED zinc fing  37.4      21 0.00046   20.1   1.2   12   56-67     16-27  (45)
102 KOG2593|consensus               36.9      23 0.00049   30.8   1.8   42   47-92    119-160 (436)
103 COG3677 Transposase and inacti  36.7      18 0.00038   26.1   1.0   16   54-69     51-66  (129)
104 COG4896 Uncharacterized protei  34.7      17 0.00037   22.6   0.5   30  108-138    11-40  (68)
105 PRK10220 hypothetical protein;  34.4      29 0.00063   24.2   1.7   31   21-70      4-34  (111)
106 PF04959 ARS2:  Arsenite-resist  34.1      19 0.00042   28.3   0.9   18   16-33     73-90  (214)
107 COG5236 Uncharacterized conser  33.2      13 0.00028   31.4  -0.2   58   58-132   222-279 (493)
108 KOG3583|consensus               33.1      23 0.00049   28.0   1.1   24  171-194   254-277 (279)
109 TIGR00280 L37a ribosomal prote  32.8      18 0.00039   24.4   0.4   12   56-67     53-64  (91)
110 PF01780 Ribosomal_L37ae:  Ribo  32.7      12 0.00026   25.3  -0.4   12   56-67     53-64  (90)
111 PRK03564 formate dehydrogenase  32.2      26 0.00056   29.3   1.3   49   55-108   186-234 (309)
112 COG1996 RPC10 DNA-directed RNA  31.8      25 0.00054   20.9   0.9   11   20-30      6-16  (49)
113 PF08274 PhnA_Zn_Ribbon:  PhnA   30.9      19 0.00042   19.0   0.3   12   21-32      3-14  (30)
114 PF14446 Prok-RING_1:  Prokaryo  30.5      39 0.00084   20.5   1.6   14   20-33      5-18  (54)
115 PF13451 zf-trcl:  Probable zin  29.8      43 0.00093   19.9   1.6   17   55-71      3-19  (49)
116 PF05191 ADK_lid:  Adenylate ki  29.4      36 0.00079   18.6   1.2   12   22-33      3-14  (36)
117 PRK14559 putative protein seri  29.2      83  0.0018   29.1   4.2    8  130-138    43-50  (645)
118 PF01286 XPA_N:  XPA protein N-  29.1      25 0.00054   19.1   0.5   14   20-33      3-16  (34)
119 KOG2593|consensus               29.0      39 0.00084   29.4   1.9   39   16-65    124-162 (436)
120 COG1198 PriA Primosomal protei  28.3      26 0.00056   32.8   0.8    9   23-31    438-446 (730)
121 PF01363 FYVE:  FYVE zinc finge  27.9      31 0.00068   21.4   0.9   16   17-32      6-21  (69)
122 COG4957 Predicted transcriptio  27.0      39 0.00084   24.6   1.3   25   21-56     77-101 (148)
123 PRK03976 rpl37ae 50S ribosomal  26.9      25 0.00054   23.7   0.4   12   56-67     54-65  (90)
124 COG1655 Uncharacterized protei  26.3      16 0.00034   29.1  -0.8   30   54-84     17-46  (267)
125 PF06906 DUF1272:  Protein of u  25.9      31 0.00067   21.1   0.6   27   85-111    26-52  (57)
126 COG3357 Predicted transcriptio  25.7      46   0.001   22.5   1.4   28   54-92     56-83  (97)
127 PF15269 zf-C2H2_7:  Zinc-finge  25.5 1.1E+02  0.0024   17.8   2.8   24   18-49     18-41  (54)
128 PF13878 zf-C2H2_3:  zinc-finge  25.3      76  0.0017   17.8   2.1   13   21-33     14-26  (41)
129 KOG4124|consensus               25.2      14  0.0003   31.3  -1.4   58   18-75    347-417 (442)
130 PF04959 ARS2:  Arsenite-resist  24.6      37 0.00081   26.7   1.0   28   53-80     74-101 (214)
131 TIGR00244 transcriptional regu  24.6      44 0.00096   24.7   1.3   11  129-140    29-39  (147)
132 KOG1813|consensus               24.4      48   0.001   27.5   1.6   45   55-109   240-284 (313)
133 PF12760 Zn_Tnp_IS1595:  Transp  24.3      36 0.00077   19.5   0.6    9   55-63     36-44  (46)
134 PF13824 zf-Mss51:  Zinc-finger  24.1      64  0.0014   19.6   1.7   14  124-138    10-23  (55)
135 PF12013 DUF3505:  Protein of u  23.3      53  0.0011   22.5   1.5   24   57-80     81-108 (109)
136 COG1327 Predicted transcriptio  23.1      52  0.0011   24.4   1.4   11  129-140    29-39  (156)
137 PRK12496 hypothetical protein;  23.0      46   0.001   24.9   1.2   14   19-32    126-139 (164)
138 PF12773 DZR:  Double zinc ribb  22.9      92   0.002   17.8   2.3   27  100-138    12-38  (50)
139 COG1198 PriA Primosomal protei  22.1      49  0.0011   31.0   1.4   23  115-138   462-484 (730)
140 PRK00432 30S ribosomal protein  20.8      68  0.0015   19.0   1.4   12   55-66     36-47  (50)

No 1  
>KOG2462|consensus
Probab=99.89  E-value=2.9e-24  Score=169.14  Aligned_cols=125  Identities=22%  Similarity=0.271  Sum_probs=82.3

Q ss_pred             CCCCCcccccccccccccccccccccchHHHHHhHcCC---CCCccCCcCCCCcccchHHHHhhhhcCCCCCCCCCCccc
Q psy8702          16 KDSVNPGLSLYTKSFTWKNLLTNVLTSNLKTHLLTHTD---HKPYECNSCGKVFRRNCDLRRHALTHAVGDVPPESSLAE   92 (194)
Q Consensus        16 ~~~~~~~c~~Cgk~f~~~~~L~~H~~s~L~~H~~~h~~---~k~~~C~~C~k~f~~~~~L~~H~~~h~~~~k~~~c~~c~   92 (194)
                      .....+.|++|||.|.+.        ++|.+|.++|-.   .+.+.|++|+|.|.....|.+|+++|.   .+.+|.+| 
T Consensus       126 ~~~~r~~c~eCgk~ysT~--------snLsrHkQ~H~~~~s~ka~~C~~C~K~YvSmpALkMHirTH~---l~c~C~iC-  193 (279)
T KOG2462|consen  126 AKHPRYKCPECGKSYSTS--------SNLSRHKQTHRSLDSKKAFSCKYCGKVYVSMPALKMHIRTHT---LPCECGIC-  193 (279)
T ss_pred             ccCCceeccccccccccc--------cccchhhcccccccccccccCCCCCceeeehHHHhhHhhccC---CCcccccc-
Confidence            355678999999999999        666666666533   445666666666666666666666665   55555544 


Q ss_pred             cccccccCCCCCCCCCCCCCChhhHhhhccCCCCCCCCChhhhCCCcc-CCCCCccccccCCCCcccccCCCCCcc
Q psy8702          93 SIGSLKRDNLPPAPSPTGSQSSEDIEIERTPSEERSLEPSDMLGRRSR-SRTPSPPLRSHSPISIFRRHQSPSPIG  167 (194)
Q Consensus        93 ~~~~~~~~~~~~~C~k~F~~~~~L~~H~~~H~~ekp~~C~~~Cgk~f~-~~~~~~hl~~H~~~~~~~~~~~~~~~~  167 (194)
                                    ||.|.+.=.|.-|+++|+|||||.|+. |+|.|. ..+|..||++|++++.|.+..+...|.
T Consensus       194 --------------GKaFSRPWLLQGHiRTHTGEKPF~C~h-C~kAFADRSNLRAHmQTHS~~K~~qC~~C~KsFs  254 (279)
T KOG2462|consen  194 --------------GKAFSRPWLLQGHIRTHTGEKPFSCPH-CGKAFADRSNLRAHMQTHSDVKKHQCPRCGKSFA  254 (279)
T ss_pred             --------------cccccchHHhhcccccccCCCCccCCc-ccchhcchHHHHHHHHhhcCCccccCcchhhHHH
Confidence                          466666666666666666666666664 666665 666666666666666666666555554


No 2  
>KOG2462|consensus
Probab=99.86  E-value=7.4e-23  Score=161.20  Aligned_cols=110  Identities=25%  Similarity=0.301  Sum_probs=100.8

Q ss_pred             CCCcccccccccccccccccccccchHHHHHhHcCCCCCccCCcCCCCcccchHHHHhhhhcCCCCCCCCCCcccccccc
Q psy8702          18 SVNPGLSLYTKSFTWKNLLTNVLTSNLKTHLLTHTDHKPYECNSCGKVFRRNCDLRRHALTHAVGDVPPESSLAESIGSL   97 (194)
Q Consensus        18 ~~~~~c~~Cgk~f~~~~~L~~H~~s~L~~H~~~h~~~k~~~C~~C~k~f~~~~~L~~H~~~h~~~~k~~~c~~c~~~~~~   97 (194)
                      .....|++|||.|..-        ..|..|+++|+  -+++|.+|||.|....-|+.|+|+|+ |||||.|..|      
T Consensus       159 ~ka~~C~~C~K~YvSm--------pALkMHirTH~--l~c~C~iCGKaFSRPWLLQGHiRTHT-GEKPF~C~hC------  221 (279)
T KOG2462|consen  159 KKAFSCKYCGKVYVSM--------PALKMHIRTHT--LPCECGICGKAFSRPWLLQGHIRTHT-GEKPFSCPHC------  221 (279)
T ss_pred             cccccCCCCCceeeeh--------HHHhhHhhccC--CCcccccccccccchHHhhccccccc-CCCCccCCcc------
Confidence            4557799999999999        99999999997  47999999999999999999999999 9999999887      


Q ss_pred             ccCCCCCCCCCCCCCChhhHhhhccCCCCCCCCChhhhCCCccCCCCCccccccCCCCcc
Q psy8702          98 KRDNLPPAPSPTGSQSSEDIEIERTPSEERSLEPSDMLGRRSRSRTPSPPLRSHSPISIF  157 (194)
Q Consensus        98 ~~~~~~~~C~k~F~~~~~L~~H~~~H~~ekp~~C~~~Cgk~f~~~~~~~hl~~H~~~~~~  157 (194)
                               +|.|...++|+.||+||.+.|+|+|.. |||.   |.+.+.|.+|....-.
T Consensus       222 ---------~kAFADRSNLRAHmQTHS~~K~~qC~~-C~Ks---Fsl~SyLnKH~ES~C~  268 (279)
T KOG2462|consen  222 ---------GKAFADRSNLRAHMQTHSDVKKHQCPR-CGKS---FALKSYLNKHSESACL  268 (279)
T ss_pred             ---------cchhcchHHHHHHHHhhcCCccccCcc-hhhH---HHHHHHHHHhhhhccc
Confidence                     589999999999999999999999996 9999   8888889998765444


No 3  
>KOG1074|consensus
Probab=99.61  E-value=6.9e-16  Score=136.59  Aligned_cols=57  Identities=26%  Similarity=0.542  Sum_probs=51.8

Q ss_pred             CCCCCcccccccccccccccccccccchHHHHHhHcCCCCCccCCcCCCCcccchHHHHhhhhcC
Q psy8702          16 KDSVNPGLSLYTKSFTWKNLLTNVLTSNLKTHLLTHTDHKPYECNSCGKVFRRNCDLRRHALTHA   80 (194)
Q Consensus        16 ~~~~~~~c~~Cgk~f~~~~~L~~H~~s~L~~H~~~h~~~k~~~C~~C~k~f~~~~~L~~H~~~h~   80 (194)
                      +....-+|-+|-++...+        +.|+.|.++|+||+||+|.+||+.|.++.+|+.|+.+|.
T Consensus       601 ~~TdPNqCiiC~rVlSC~--------saLqmHyrtHtGERPFkCKiCgRAFtTkGNLkaH~~vHk  657 (958)
T KOG1074|consen  601 KRTDPNQCIICLRVLSCP--------SALQMHYRTHTGERPFKCKICGRAFTTKGNLKAHMSVHK  657 (958)
T ss_pred             ccCCccceeeeeecccch--------hhhhhhhhcccCcCccccccccchhccccchhhcccccc
Confidence            344455799999999999        999999999999999999999999999999999998776


No 4  
>KOG1074|consensus
Probab=99.58  E-value=5.4e-16  Score=137.25  Aligned_cols=56  Identities=30%  Similarity=0.560  Sum_probs=50.9

Q ss_pred             CCCCcccccccccccccccccccccchHHHHHhHcCCCCCccCCcCCCCcccchHHHHhhhhcC
Q psy8702          17 DSVNPGLSLYTKSFTWKNLLTNVLTSNLKTHLLTHTDHKPYECNSCGKVFRRNCDLRRHALTHA   80 (194)
Q Consensus        17 ~~~~~~c~~Cgk~f~~~~~L~~H~~s~L~~H~~~h~~~k~~~C~~C~k~f~~~~~L~~H~~~h~   80 (194)
                      ...+..|..|+|.|...        +.|+.|.+.|++++||+|.+||..|.++.+|+.|...|.
T Consensus       350 ~~~khkCr~CakvfgS~--------SaLqiHlRSHTGERPfqCnvCG~~FSTkGNLKvH~~rH~  405 (958)
T KOG1074|consen  350 PFFKHKCRFCAKVFGSD--------SALQIHLRSHTGERPFQCNVCGNRFSTKGNLKVHFQRHR  405 (958)
T ss_pred             ccccchhhhhHhhcCch--------hhhhhhhhccCCCCCeeecccccccccccceeeeeeecc
Confidence            33456799999999999        999999999999999999999999999999999987665


No 5  
>KOG3576|consensus
Probab=99.53  E-value=6e-16  Score=117.25  Aligned_cols=116  Identities=28%  Similarity=0.331  Sum_probs=96.2

Q ss_pred             CCCCCCcccccccccccccccccccccchHHHHHhHcCCCCCccCCcCCCCcccchHHHHhhhhcCCCCCCCCCCccccc
Q psy8702          15 AKDSVNPGLSLYTKSFTWKNLLTNVLTSNLKTHLLTHTDHKPYECNSCGKVFRRNCDLRRHALTHAVGDVPPESSLAESI   94 (194)
Q Consensus        15 ~~~~~~~~c~~Cgk~f~~~~~L~~H~~s~L~~H~~~h~~~k~~~C~~C~k~f~~~~~L~~H~~~h~~~~k~~~c~~c~~~   94 (194)
                      ..+...+.|.+|||.|...        ..|.+|++.|.+.+.+-|..||+.|...-.|++|+++|+ |.+||+|..|+  
T Consensus       112 ssd~d~ftCrvCgK~F~lQ--------Rmlnrh~kch~~vkr~lct~cgkgfndtfdlkrh~rtht-gvrpykc~~c~--  180 (267)
T KOG3576|consen  112 SSDQDSFTCRVCGKKFGLQ--------RMLNRHLKCHSDVKRHLCTFCGKGFNDTFDLKRHTRTHT-GVRPYKCSLCE--  180 (267)
T ss_pred             CCCCCeeeeehhhhhhhHH--------HHHHHHhhhccHHHHHHHhhccCcccchhhhhhhhcccc-Cccccchhhhh--
Confidence            3456778899999999999        999999999999999999999999999999999999999 99999999985  


Q ss_pred             cccccCCCCCCCCCCCCCChhhHhhhccCC-----------CCCCCCChhhhCCCcc-CCCCCccccccCCCC
Q psy8702          95 GSLKRDNLPPAPSPTGSQSSEDIEIERTPS-----------EERSLEPSDMLGRRSR-SRTPSPPLRSHSPIS  155 (194)
Q Consensus        95 ~~~~~~~~~~~C~k~F~~~~~L~~H~~~H~-----------~ekp~~C~~~Cgk~f~-~~~~~~hl~~H~~~~  155 (194)
                                   |.|...-.|..|.+.-+           .+|.|.|++ ||..-. --....|++.|....
T Consensus       181 -------------kaftqrcsleshl~kvhgv~~~yaykerr~kl~vced-cg~t~~~~e~~~~h~~~~hp~S  239 (267)
T KOG3576|consen  181 -------------KAFTQRCSLESHLKKVHGVQHQYAYKERRAKLYVCED-CGYTSERPEVYYLHLKLHHPFS  239 (267)
T ss_pred             -------------HHHHhhccHHHHHHHHcCchHHHHHHHhhhheeeecc-cCCCCCChhHHHHHHHhcCCCC
Confidence                         79999999999975433           367899998 998822 222334566665433


No 6  
>KOG3623|consensus
Probab=99.45  E-value=3.6e-14  Score=124.31  Aligned_cols=122  Identities=20%  Similarity=0.165  Sum_probs=90.1

Q ss_pred             CCCCcccccccccccccccccccccchHHHHHhHcC--CCCCccCCcCCCCcccchHHHHhhhhcCCCCCCCCCCccccc
Q psy8702          17 DSVNPGLSLYTKSFTWKNLLTNVLTSNLKTHLLTHT--DHKPYECNSCGKVFRRNCDLRRHALTHAVGDVPPESSLAESI   94 (194)
Q Consensus        17 ~~~~~~c~~Cgk~f~~~~~L~~H~~s~L~~H~~~h~--~~k~~~C~~C~k~f~~~~~L~~H~~~h~~~~k~~~c~~c~~~   94 (194)
                      -+.-..|++|.+.+.+-        ..|+.|+..-+  .+-.|.|..|..+|....+|.+|+.+|..+.....--  . .
T Consensus       207 fsqlltcpycdrgykrl--------tslkeHikyrhekne~nfsC~lCsytFAyRtQLErhm~~hkpg~dqa~sl--t-q  275 (1007)
T KOG3623|consen  207 FSQLLTCPYCDRGYKRL--------TSLKEHIKYRHEKNEPNFSCMLCSYTFAYRTQLERHMQLHKPGGDQAISL--T-Q  275 (1007)
T ss_pred             hhhhhcchhHHHHHHHH--------HHHHHHHHHHHhhCCCCCcchhhhhhhhhHHHHHHHHHhhcCCCcccccc--c-c
Confidence            33445789999999998        88999987543  3556899999999999999999999987432221000  0 0


Q ss_pred             cccccCCCCCCCCCCCCCChhhHhhhccCCCCCCCCChhhhCCCccCCCCCccccccCC
Q psy8702          95 GSLKRDNLPPAPSPTGSQSSEDIEIERTPSEERSLEPSDMLGRRSRSRTPSPPLRSHSP  153 (194)
Q Consensus        95 ~~~~~~~~~~~C~k~F~~~~~L~~H~~~H~~ekp~~C~~~Cgk~f~~~~~~~hl~~H~~  153 (194)
                      ....+...|.+|||.|..+..|++|+++|.|||||+|+. |+|+   |+.+.....|+.
T Consensus       276 sa~lRKFKCtECgKAFKfKHHLKEHlRIHSGEKPfeCpn-CkKR---FSHSGSySSHmS  330 (1007)
T KOG3623|consen  276 SALLRKFKCTECGKAFKFKHHLKEHLRIHSGEKPFECPN-CKKR---FSHSGSYSSHMS  330 (1007)
T ss_pred             hhhhccccccccchhhhhHHHHHhhheeecCCCCcCCcc-cccc---cccCCccccccc
Confidence            111223345666899999999999999999999999996 9999   555555555543


No 7  
>KOG3623|consensus
Probab=99.40  E-value=9.4e-14  Score=121.73  Aligned_cols=83  Identities=23%  Similarity=0.353  Sum_probs=77.7

Q ss_pred             CCCCCCcccccccccccccccccccccchHHHHHhHcCCCCCccCCcCCCCcccchHHHHhhhhcCCCCCCCCCCccccc
Q psy8702          15 AKDSVNPGLSLYTKSFTWKNLLTNVLTSNLKTHLLTHTDHKPYECNSCGKVFRRNCDLRRHALTHAVGDVPPESSLAESI   94 (194)
Q Consensus        15 ~~~~~~~~c~~Cgk~f~~~~~L~~H~~s~L~~H~~~h~~~k~~~C~~C~k~f~~~~~L~~H~~~h~~~~k~~~c~~c~~~   94 (194)
                      .-....|.|+.|+|+|...        ++|.+|.-.|+|.+||+|.+|.|.|..+..|..|+|+|. |||||.|+.|   
T Consensus       889 kte~gmyaCDqCDK~FqKq--------SSLaRHKYEHsGqRPyqC~iCkKAFKHKHHLtEHkRLHS-GEKPfQCdKC---  956 (1007)
T KOG3623|consen  889 KTEDGMYACDQCDKAFQKQ--------SSLARHKYEHSGQRPYQCIICKKAFKHKHHLTEHKRLHS-GEKPFQCDKC---  956 (1007)
T ss_pred             cCccccchHHHHHHHHHhh--------HHHHHhhhhhcCCCCcccchhhHhhhhhhhhhhhhhhcc-CCCcchhhhh---
Confidence            3455678999999999999        999999999999999999999999999999999999999 9999999988   


Q ss_pred             cccccCCCCCCCCCCCCCChhhHhhhc
Q psy8702          95 GSLKRDNLPPAPSPTGSQSSEDIEIER  121 (194)
Q Consensus        95 ~~~~~~~~~~~C~k~F~~~~~L~~H~~  121 (194)
                                  +|.|.....+.+||.
T Consensus       957 ------------lKRFSHSGSYSQHMN  971 (1007)
T KOG3623|consen  957 ------------LKRFSHSGSYSQHMN  971 (1007)
T ss_pred             ------------hhhcccccchHhhhc
Confidence                        589999999999985


No 8  
>KOG3608|consensus
Probab=99.33  E-value=1.2e-13  Score=112.52  Aligned_cols=128  Identities=20%  Similarity=0.257  Sum_probs=106.3

Q ss_pred             ccccccccccccccccccchHHHHHhHcCCCCCccCCcCCCCcccchHHHHhhhhcC-CCCCCCCCCcccccccccc---
Q psy8702          24 SLYTKSFTWKNLLTNVLTSNLKTHLLTHTDHKPYECNSCGKVFRRNCDLRRHALTHA-VGDVPPESSLAESIGSLKR---   99 (194)
Q Consensus        24 ~~Cgk~f~~~~~L~~H~~s~L~~H~~~h~~~k~~~C~~C~k~f~~~~~L~~H~~~h~-~~~k~~~c~~c~~~~~~~~---   99 (194)
                      ..|.+.+.++        +.|++|++.|+++|...|+.||..|.++..|-.|.+..+ ....+|.|..|.+.....+   
T Consensus       183 ~~Ct~~~~~k--------~~LreH~r~Hs~eKvvACp~Cg~~F~~~tkl~DH~rRqt~l~~n~fqC~~C~KrFaTeklL~  254 (467)
T KOG3608|consen  183 AMCTKHMGNK--------YRLREHIRTHSNEKVVACPHCGELFRTKTKLFDHLRRQTELNTNSFQCAQCFKRFATEKLLK  254 (467)
T ss_pred             hhhhhhhccH--------HHHHHHHHhcCCCeEEecchHHHHhccccHHHHHHHhhhhhcCCchHHHHHHHHHhHHHHHH
Confidence            8999999999        999999999999999999999999999999999977443 1456899998864333322   


Q ss_pred             --------CCCCCCCCCCCCCChhhHhhhcc-CCCCCCCCChhhhCCC-ccCCCCCccccccCCCCcccccC
Q psy8702         100 --------DNLPPAPSPTGSQSSEDIEIERT-PSEERSLEPSDMLGRR-SRSRTPSPPLRSHSPISIFRRHQ  161 (194)
Q Consensus       100 --------~~~~~~C~k~F~~~~~L~~H~~~-H~~ekp~~C~~~Cgk~-f~~~~~~~hl~~H~~~~~~~~~~  161 (194)
                              .-.|+.|+-+....++|..|+++ |..+|||+|.+ |+++ |+...|..|+.+|+ +..|++..
T Consensus       255 ~Hv~rHvn~ykCplCdmtc~~~ssL~~H~r~rHs~dkpfKCd~-Cd~~c~~esdL~kH~~~HS-~~~y~C~h  324 (467)
T KOG3608|consen  255 SHVVRHVNCYKCPLCDMTCSSASSLTTHIRYRHSKDKPFKCDE-CDTRCVRESDLAKHVQVHS-KTVYQCEH  324 (467)
T ss_pred             HHHHHhhhcccccccccCCCChHHHHHHHHhhhccCCCccccc-hhhhhccHHHHHHHHHhcc-ccceecCC
Confidence                    22388999999999999999874 88899999987 9999 55888889999998 45555544


No 9  
>KOG3608|consensus
Probab=99.30  E-value=2.8e-13  Score=110.37  Aligned_cols=142  Identities=17%  Similarity=0.135  Sum_probs=102.1

Q ss_pred             CCCCCCCccccccccccccccccccc------------------ccchHHHHHhH-cCCCCCccCCcCCCCcccchHHHH
Q psy8702          14 VAKDSVNPGLSLYTKSFTWKNLLTNV------------------LTSNLKTHLLT-HTDHKPYECNSCGKVFRRNCDLRR   74 (194)
Q Consensus        14 ~~~~~~~~~c~~Cgk~f~~~~~L~~H------------------~~s~L~~H~~~-h~~~k~~~C~~C~k~f~~~~~L~~   74 (194)
                      .....+.|.|..|.|.|.++..|.-|                  ..+.|.+|++. |...+||+|+.|++.+.+...|.+
T Consensus       231 t~l~~n~fqC~~C~KrFaTeklL~~Hv~rHvn~ykCplCdmtc~~~ssL~~H~r~rHs~dkpfKCd~Cd~~c~~esdL~k  310 (467)
T KOG3608|consen  231 TELNTNSFQCAQCFKRFATEKLLKSHVVRHVNCYKCPLCDMTCSSASSLTTHIRYRHSKDKPFKCDECDTRCVRESDLAK  310 (467)
T ss_pred             hhhcCCchHHHHHHHHHhHHHHHHHHHHHhhhcccccccccCCCChHHHHHHHHhhhccCCCccccchhhhhccHHHHHH
Confidence            34445678888999999888777665                  35667777765 666777777777777777777777


Q ss_pred             hhhhcCCCCCCCCCCccccccccccCCCCCCCCCCCCCChhhHhhhccCC-CC--CCCCChhhhCCCcc-CCCCCccccc
Q psy8702          75 HALTHAVGDVPPESSLAESIGSLKRDNLPPAPSPTGSQSSEDIEIERTPS-EE--RSLEPSDMLGRRSR-SRTPSPPLRS  150 (194)
Q Consensus        75 H~~~h~~~~k~~~c~~c~~~~~~~~~~~~~~C~k~F~~~~~L~~H~~~H~-~e--kp~~C~~~Cgk~f~-~~~~~~hl~~  150 (194)
                      |..+|.  +-.|.|+.-             .|...|.....++.|++.+. |.  -+|.|. .|++.|. -++|..||.+
T Consensus       311 H~~~HS--~~~y~C~h~-------------~C~~s~r~~~q~~~H~~evhEg~np~~Y~CH-~Cdr~ft~G~~L~~HL~k  374 (467)
T KOG3608|consen  311 HVQVHS--KTVYQCEHP-------------DCHYSVRTYTQMRRHFLEVHEGNNPILYACH-CCDRFFTSGKSLSAHLMK  374 (467)
T ss_pred             HHHhcc--ccceecCCC-------------CCcHHHHHHHHHHHHHHHhccCCCCCceeee-cchhhhccchhHHHHHHH
Confidence            777664  666777652             24568888999999986544 54  469998 5999955 4555555544


Q ss_pred             cCCCCcccccCCCCCccccccccCC
Q psy8702         151 HSPISIFRRHQSPSPIGIYRRHSQS  175 (194)
Q Consensus       151 H~~~~~~~~~~~~~~~~~~~~~~~~  175 (194)
                      -    --.+++++.+...|+.+-+.
T Consensus       375 k----H~f~~PsGh~RFtYk~~edG  395 (467)
T KOG3608|consen  375 K----HGFRLPSGHKRFTYKVDEDG  395 (467)
T ss_pred             h----hcccCCCCCCceeeeeccCc
Confidence            3    34678999999999988653


No 10 
>KOG3576|consensus
Probab=99.18  E-value=1.3e-12  Score=99.20  Aligned_cols=84  Identities=18%  Similarity=0.210  Sum_probs=74.6

Q ss_pred             CCCCccCCcCCCCcccchHHHHhhhhcCCCCCCCCCCccccccccccCCCCCCCCCCCCCChhhHhhhccCCCCCCCCCh
Q psy8702          53 DHKPYECNSCGKVFRRNCDLRRHALTHAVGDVPPESSLAESIGSLKRDNLPPAPSPTGSQSSEDIEIERTPSEERSLEPS  132 (194)
Q Consensus        53 ~~k~~~C~~C~k~f~~~~~L~~H~~~h~~~~k~~~c~~c~~~~~~~~~~~~~~C~k~F~~~~~L~~H~~~H~~ekp~~C~  132 (194)
                      +...|.|.+|+|.|....-|++|+.-|. .-+.+-|..|               ||.|...-+|.+|+++|+|-+||+|.
T Consensus       114 d~d~ftCrvCgK~F~lQRmlnrh~kch~-~vkr~lct~c---------------gkgfndtfdlkrh~rthtgvrpykc~  177 (267)
T KOG3576|consen  114 DQDSFTCRVCGKKFGLQRMLNRHLKCHS-DVKRHLCTFC---------------GKGFNDTFDLKRHTRTHTGVRPYKCS  177 (267)
T ss_pred             CCCeeeeehhhhhhhHHHHHHHHhhhcc-HHHHHHHhhc---------------cCcccchhhhhhhhccccCccccchh
Confidence            4567999999999999999999999998 8888989877               68999999999999999999999998


Q ss_pred             hhhCCCcc-CCCCCccccccCC
Q psy8702         133 DMLGRRSR-SRTPSPPLRSHSP  153 (194)
Q Consensus       133 ~~Cgk~f~-~~~~~~hl~~H~~  153 (194)
                       .|+|.|. ..+|..||+.=.|
T Consensus       178 -~c~kaftqrcsleshl~kvhg  198 (267)
T KOG3576|consen  178 -LCEKAFTQRCSLESHLKKVHG  198 (267)
T ss_pred             -hhhHHHHhhccHHHHHHHHcC
Confidence             4999987 8888888865443


No 11 
>PLN03086 PRLI-interacting factor K; Provisional
Probab=98.83  E-value=2.9e-09  Score=93.40  Aligned_cols=91  Identities=14%  Similarity=0.193  Sum_probs=73.9

Q ss_pred             CCcccccccccccccccccccccchHHHHHhHcCCCCCccCCcCCCCcccchHHHHhhhhcCCCCCCCCCCccccccccc
Q psy8702          19 VNPGLSLYTKSFTWKNLLTNVLTSNLKTHLLTHTDHKPYECNSCGKVFRRNCDLRRHALTHAVGDVPPESSLAESIGSLK   98 (194)
Q Consensus        19 ~~~~c~~Cgk~f~~~~~L~~H~~s~L~~H~~~h~~~k~~~C~~C~k~f~~~~~L~~H~~~h~~~~k~~~c~~c~~~~~~~   98 (194)
                      ..+.|+.||+.|. .        ..|..|+.+++  +++.|+ ||+.+ ....|..|+.+|. +++++.|.+|+      
T Consensus       452 ~H~~C~~Cgk~f~-~--------s~LekH~~~~H--kpv~Cp-Cg~~~-~R~~L~~H~~thC-p~Kpi~C~fC~------  511 (567)
T PLN03086        452 NHVHCEKCGQAFQ-Q--------GEMEKHMKVFH--EPLQCP-CGVVL-EKEQMVQHQASTC-PLRLITCRFCG------  511 (567)
T ss_pred             cCccCCCCCCccc-h--------HHHHHHHHhcC--CCccCC-CCCCc-chhHHHhhhhccC-CCCceeCCCCC------
Confidence            3457899999886 4        67899999875  789999 99755 6689999999999 99999999885      


Q ss_pred             cCCCCCCCCCCCC----------CChhhHhhhccCCCCCCCCChhhhCCCcc
Q psy8702          99 RDNLPPAPSPTGS----------QSSEDIEIERTPSEERSLEPSDMLGRRSR  140 (194)
Q Consensus        99 ~~~~~~~C~k~F~----------~~~~L~~H~~~H~~ekp~~C~~~Cgk~f~  140 (194)
                               +.|.          ..+.|..|+.+. |.+++.|.. ||+.+.
T Consensus       512 ---------~~v~~g~~~~d~~d~~s~Lt~HE~~C-G~rt~~C~~-Cgk~Vr  552 (567)
T PLN03086        512 ---------DMVQAGGSAMDVRDRLRGMSEHESIC-GSRTAPCDS-CGRSVM  552 (567)
T ss_pred             ---------CccccCccccchhhhhhhHHHHHHhc-CCcceEccc-cCCeee
Confidence                     4553          235789999885 999999975 999944


No 12 
>PHA02768 hypothetical protein; Provisional
Probab=98.76  E-value=3.8e-09  Score=64.28  Aligned_cols=43  Identities=14%  Similarity=0.260  Sum_probs=38.4

Q ss_pred             cccccccccccccccccccccchHHHHHhHcCCCCCccCCcCCCCcccchHHH
Q psy8702          21 PGLSLYTKSFTWKNLLTNVLTSNLKTHLLTHTDHKPYECNSCGKVFRRNCDLR   73 (194)
Q Consensus        21 ~~c~~Cgk~f~~~~~L~~H~~s~L~~H~~~h~~~k~~~C~~C~k~f~~~~~L~   73 (194)
                      |.|+.||+.|...        ++|..|+++|+  ++++|..|++.|.....|.
T Consensus         6 y~C~~CGK~Fs~~--------~~L~~H~r~H~--k~~kc~~C~k~f~~~s~l~   48 (55)
T PHA02768          6 YECPICGEIYIKR--------KSMITHLRKHN--TNLKLSNCKRISLRTGEYI   48 (55)
T ss_pred             cCcchhCCeeccH--------HHHHHHHHhcC--CcccCCcccceecccceeE
Confidence            6899999999999        99999999998  6899999999999776653


No 13 
>PHA00733 hypothetical protein
Probab=98.64  E-value=2e-08  Score=72.63  Aligned_cols=89  Identities=13%  Similarity=0.141  Sum_probs=69.4

Q ss_pred             CCCCCcccccccccccccccccccccchHHHHHhHcCCCCCccCCcCCCCcccchHHHHhhhhcCCCCCCCCCCcccccc
Q psy8702          16 KDSVNPGLSLYTKSFTWKNLLTNVLTSNLKTHLLTHTDHKPYECNSCGKVFRRNCDLRRHALTHAVGDVPPESSLAESIG   95 (194)
Q Consensus        16 ~~~~~~~c~~Cgk~f~~~~~L~~H~~s~L~~H~~~h~~~k~~~C~~C~k~f~~~~~L~~H~~~h~~~~k~~~c~~c~~~~   95 (194)
                      .......|.+|++.|.....|..+  +.|..|+..+ ++++|.|+.|++.|.....|..|++.+.   .++.|..|    
T Consensus        36 ~~~~~~~~~~~~~~~~~~~~l~~~--~~l~~~~~~~-~~kPy~C~~Cgk~Fss~s~L~~H~r~h~---~~~~C~~C----  105 (128)
T PHA00733         36 PEQKRLIRAVVKTLIYNPQLLDES--SYLYKLLTSK-AVSPYVCPLCLMPFSSSVSLKQHIRYTE---HSKVCPVC----  105 (128)
T ss_pred             hhhhhHHHHHHhhhccChhhhcch--HHHHhhcccC-CCCCccCCCCCCcCCCHHHHHHHHhcCC---cCccCCCC----
Confidence            344556789999999998666642  2255555444 4789999999999999999999998764   46788766    


Q ss_pred             ccccCCCCCCCCCCCCCChhhHhhhccCCC
Q psy8702          96 SLKRDNLPPAPSPTGSQSSEDIEIERTPSE  125 (194)
Q Consensus        96 ~~~~~~~~~~C~k~F~~~~~L~~H~~~H~~  125 (194)
                                 ++.|.....|..|+...++
T Consensus       106 -----------gK~F~~~~sL~~H~~~~h~  124 (128)
T PHA00733        106 -----------GKEFRNTDSTLDHVCKKHN  124 (128)
T ss_pred             -----------CCccCCHHHHHHHHHHhcC
Confidence                       5899999999999876655


No 14 
>PHA00733 hypothetical protein
Probab=98.52  E-value=1.9e-08  Score=72.76  Aligned_cols=79  Identities=15%  Similarity=-0.038  Sum_probs=62.3

Q ss_pred             CCCCccCCcCCCCcccchHHHHh--hh---hcCCCCCCCCCCccccccccccCCCCCCCCCCCCCChhhHhhhccCCCCC
Q psy8702          53 DHKPYECNSCGKVFRRNCDLRRH--AL---THAVGDVPPESSLAESIGSLKRDNLPPAPSPTGSQSSEDIEIERTPSEER  127 (194)
Q Consensus        53 ~~k~~~C~~C~k~f~~~~~L~~H--~~---~h~~~~k~~~c~~c~~~~~~~~~~~~~~C~k~F~~~~~L~~H~~~H~~ek  127 (194)
                      ..+++.|.+|.+.|.....|..+  +.   .+. +++||.|+.|               |+.|.....|..|++.+  +.
T Consensus        37 ~~~~~~~~~~~~~~~~~~~l~~~~~l~~~~~~~-~~kPy~C~~C---------------gk~Fss~s~L~~H~r~h--~~   98 (128)
T PHA00733         37 EQKRLIRAVVKTLIYNPQLLDESSYLYKLLTSK-AVSPYVCPLC---------------LMPFSSSVSLKQHIRYT--EH   98 (128)
T ss_pred             hhhhHHHHHHhhhccChhhhcchHHHHhhcccC-CCCCccCCCC---------------CCcCCCHHHHHHHHhcC--Cc
Confidence            35789999999988887766665  21   223 5789999877               58999999999999987  46


Q ss_pred             CCCChhhhCCCccCCCCCccccccCC
Q psy8702         128 SLEPSDMLGRRSRSRTPSPPLRSHSP  153 (194)
Q Consensus       128 p~~C~~~Cgk~f~~~~~~~hl~~H~~  153 (194)
                      +|.|.. |++.   |....+|..|..
T Consensus        99 ~~~C~~-CgK~---F~~~~sL~~H~~  120 (128)
T PHA00733         99 SKVCPV-CGKE---FRNTDSTLDHVC  120 (128)
T ss_pred             CccCCC-CCCc---cCCHHHHHHHHH
Confidence            899995 9999   777777777754


No 15 
>PF13465 zf-H2C2_2:  Zinc-finger double domain; PDB: 2EN7_A 1TF6_A 1TF3_A 2ELT_A 2EOS_A 2EN2_A 2DMD_A 2WBS_A 2WBU_A 2EM5_A ....
Probab=98.45  E-value=8.6e-08  Score=49.83  Aligned_cols=26  Identities=54%  Similarity=1.080  Sum_probs=23.4

Q ss_pred             hHHHHHhHcCCCCCccCCcCCCCccc
Q psy8702          43 NLKTHLLTHTDHKPYECNSCGKVFRR   68 (194)
Q Consensus        43 ~L~~H~~~h~~~k~~~C~~C~k~f~~   68 (194)
                      +|.+|+++|++++||.|+.|++.|.+
T Consensus         1 ~l~~H~~~H~~~k~~~C~~C~k~F~~   26 (26)
T PF13465_consen    1 NLRRHMRTHTGEKPYKCPYCGKSFSN   26 (26)
T ss_dssp             HHHHHHHHHSSSSSEEESSSSEEESS
T ss_pred             CHHHHhhhcCCCCCCCCCCCcCeeCc
Confidence            47899999999999999999999863


No 16 
>PHA02768 hypothetical protein; Provisional
Probab=98.44  E-value=9.5e-08  Score=58.19  Aligned_cols=41  Identities=15%  Similarity=-0.013  Sum_probs=33.5

Q ss_pred             CCCCCCCCCCCChhhHhhhccCCCCCCCCChhhhCCCccCCCCCccc
Q psy8702         102 LPPAPSPTGSQSSEDIEIERTPSEERSLEPSDMLGRRSRSRTPSPPL  148 (194)
Q Consensus       102 ~~~~C~k~F~~~~~L~~H~~~H~~ekp~~C~~~Cgk~f~~~~~~~hl  148 (194)
                      .|+.||+.|...+.|..|+++|+  ++|.|.. |++.   |...+.|
T Consensus         7 ~C~~CGK~Fs~~~~L~~H~r~H~--k~~kc~~-C~k~---f~~~s~l   47 (55)
T PHA02768          7 ECPICGEIYIKRKSMITHLRKHN--TNLKLSN-CKRI---SLRTGEY   47 (55)
T ss_pred             CcchhCCeeccHHHHHHHHHhcC--CcccCCc-ccce---eccccee
Confidence            35566899999999999999999  7999996 9999   5444444


No 17 
>PF13465 zf-H2C2_2:  Zinc-finger double domain; PDB: 2EN7_A 1TF6_A 1TF3_A 2ELT_A 2EOS_A 2EN2_A 2DMD_A 2WBS_A 2WBU_A 2EM5_A ....
Probab=98.41  E-value=2.9e-08  Score=51.61  Aligned_cols=25  Identities=16%  Similarity=0.062  Sum_probs=22.4

Q ss_pred             hhHhhhccCCCCCCCCChhhhCCCcc
Q psy8702         115 EDIEIERTPSEERSLEPSDMLGRRSR  140 (194)
Q Consensus       115 ~L~~H~~~H~~ekp~~C~~~Cgk~f~  140 (194)
                      +|..|+++|+|++||.|+. |+++|.
T Consensus         1 ~l~~H~~~H~~~k~~~C~~-C~k~F~   25 (26)
T PF13465_consen    1 NLRRHMRTHTGEKPYKCPY-CGKSFS   25 (26)
T ss_dssp             HHHHHHHHHSSSSSEEESS-SSEEES
T ss_pred             CHHHHhhhcCCCCCCCCCC-CcCeeC
Confidence            5889999999999999996 999953


No 18 
>PLN03086 PRLI-interacting factor K; Provisional
Probab=98.39  E-value=3.9e-07  Score=80.26  Aligned_cols=89  Identities=11%  Similarity=0.150  Sum_probs=67.9

Q ss_pred             CCCccCCcCCCCcccchHHHHhhhhcCCCCCCCCCCccccccccccCCCCCCCCCCCCCChhhHhhhccCCCCCCCCChh
Q psy8702          54 HKPYECNSCGKVFRRNCDLRRHALTHAVGDVPPESSLAESIGSLKRDNLPPAPSPTGSQSSEDIEIERTPSEERSLEPSD  133 (194)
Q Consensus        54 ~k~~~C~~C~k~f~~~~~L~~H~~~h~~~~k~~~c~~c~~~~~~~~~~~~~~C~k~F~~~~~L~~H~~~H~~ekp~~C~~  133 (194)
                      ++.+.|+.|++.|. ...|..|+.+++   +++.|+ |+               +.+ ....|..|+.+|.+++++.|+.
T Consensus       451 ~~H~~C~~Cgk~f~-~s~LekH~~~~H---kpv~Cp-Cg---------------~~~-~R~~L~~H~~thCp~Kpi~C~f  509 (567)
T PLN03086        451 KNHVHCEKCGQAFQ-QGEMEKHMKVFH---EPLQCP-CG---------------VVL-EKEQMVQHQASTCPLRLITCRF  509 (567)
T ss_pred             ccCccCCCCCCccc-hHHHHHHHHhcC---CCccCC-CC---------------CCc-chhHHHhhhhccCCCCceeCCC
Confidence            45678999999996 678999999876   789998 74               444 6799999999999999999995


Q ss_pred             hhCCCccC-----------CCCCccccccCCCCcccccCCCCC
Q psy8702         134 MLGRRSRS-----------RTPSPPLRSHSPISIFRRHQSPSP  165 (194)
Q Consensus       134 ~Cgk~f~~-----------~~~~~hl~~H~~~~~~~~~~~~~~  165 (194)
                       |++.|..           ..+..|.... |.+.+.+..++..
T Consensus       510 -C~~~v~~g~~~~d~~d~~s~Lt~HE~~C-G~rt~~C~~Cgk~  550 (567)
T PLN03086        510 -CGDMVQAGGSAMDVRDRLRGMSEHESIC-GSRTAPCDSCGRS  550 (567)
T ss_pred             -CCCccccCccccchhhhhhhHHHHHHhc-CCcceEccccCCe
Confidence             9999531           1344455553 6777777776543


No 19 
>KOG3993|consensus
Probab=97.98  E-value=1e-06  Score=73.93  Aligned_cols=53  Identities=25%  Similarity=0.338  Sum_probs=46.1

Q ss_pred             CcccccccccccccccccccccchHHHHHhHcCCCCCccCCcCCCCcccchHHHHhhhhcC
Q psy8702          20 NPGLSLYTKSFTWKNLLTNVLTSNLKTHLLTHTDHKPYECNSCGKVFRRNCDLRRHALTHA   80 (194)
Q Consensus        20 ~~~c~~Cgk~f~~~~~L~~H~~s~L~~H~~~h~~~k~~~C~~C~k~f~~~~~L~~H~~~h~   80 (194)
                      .|.|..|...|.+.        ..|.+|...-.....|+|++|+|.|+-..+|..|.++|.
T Consensus       267 dyiCqLCK~kYeD~--------F~LAQHrC~RIV~vEYrCPEC~KVFsCPANLASHRRWHK  319 (500)
T KOG3993|consen  267 DYICQLCKEKYEDA--------FALAQHRCPRIVHVEYRCPECDKVFSCPANLASHRRWHK  319 (500)
T ss_pred             HHHHHHHHHhhhhH--------HHHhhccCCeeEEeeecCCcccccccCchhhhhhhcccC
Confidence            47789999999999        999999854444455999999999999999999999997


No 20 
>PHA00616 hypothetical protein
Probab=97.93  E-value=4.1e-06  Score=48.66  Aligned_cols=32  Identities=16%  Similarity=0.076  Sum_probs=16.4

Q ss_pred             cccccccccccccccccccccchHHHHHhHcCCCCCccCC
Q psy8702          21 PGLSLYTKSFTWKNLLTNVLTSNLKTHLLTHTDHKPYECN   60 (194)
Q Consensus        21 ~~c~~Cgk~f~~~~~L~~H~~s~L~~H~~~h~~~k~~~C~   60 (194)
                      |.|+.||+.|..+        +.|..|++.|++++++.|+
T Consensus         2 YqC~~CG~~F~~~--------s~l~~H~r~~hg~~~~~~~   33 (44)
T PHA00616          2 YQCLRCGGIFRKK--------KEVIEHLLSVHKQNKLTLE   33 (44)
T ss_pred             CccchhhHHHhhH--------HHHHHHHHHhcCCCcccee
Confidence            3455555555555        5555555555555554443


No 21 
>PHA00616 hypothetical protein
Probab=97.91  E-value=3.9e-06  Score=48.75  Aligned_cols=34  Identities=24%  Similarity=0.497  Sum_probs=32.0

Q ss_pred             CccCCcCCCCcccchHHHHhhhhcCCCCCCCCCCc
Q psy8702          56 PYECNSCGKVFRRNCDLRRHALTHAVGDVPPESSL   90 (194)
Q Consensus        56 ~~~C~~C~k~f~~~~~L~~H~~~h~~~~k~~~c~~   90 (194)
                      ||+|..||+.|.....|..|++.|+ |++++.|+.
T Consensus         1 pYqC~~CG~~F~~~s~l~~H~r~~h-g~~~~~~~~   34 (44)
T PHA00616          1 MYQCLRCGGIFRKKKEVIEHLLSVH-KQNKLTLEY   34 (44)
T ss_pred             CCccchhhHHHhhHHHHHHHHHHhc-CCCccceeE
Confidence            6899999999999999999999999 999999864


No 22 
>KOG3993|consensus
Probab=97.71  E-value=1.7e-05  Score=66.73  Aligned_cols=124  Identities=26%  Similarity=0.320  Sum_probs=80.9

Q ss_pred             CCCcccccccccccccccccccccchHHHHHhHc---------------------------------CCCCCccCCcCCC
Q psy8702          18 SVNPGLSLYTKSFTWKNLLTNVLTSNLKTHLLTH---------------------------------TDHKPYECNSCGK   64 (194)
Q Consensus        18 ~~~~~c~~Cgk~f~~~~~L~~H~~s~L~~H~~~h---------------------------------~~~k~~~C~~C~k   64 (194)
                      ...|.|++|+|.|.-.        .+|..|.|-|                                 ..+..|.|..|+|
T Consensus       293 ~vEYrCPEC~KVFsCP--------ANLASHRRWHKPR~eaa~a~~~P~k~~~~~rae~~ea~rsg~dss~gi~~C~~C~K  364 (500)
T KOG3993|consen  293 HVEYRCPECDKVFSCP--------ANLASHRRWHKPRPEAAKAGSPPPKQAVETRAEVQEAERSGDDSSSGIFSCHTCGK  364 (500)
T ss_pred             EeeecCCcccccccCc--------hhhhhhhcccCCchhhhhcCCCChhhhhhhhhhhhhccccCCcccCceeecHHhhh
Confidence            3568899999999999        5555554432                                 1123589999999


Q ss_pred             CcccchHHHHhhhhcCCCCCCC----------------CCCcccccccc--------------ccCCCCCCCCCCCCCCh
Q psy8702          65 VFRRNCDLRRHALTHAVGDVPP----------------ESSLAESIGSL--------------KRDNLPPAPSPTGSQSS  114 (194)
Q Consensus        65 ~f~~~~~L~~H~~~h~~~~k~~----------------~c~~c~~~~~~--------------~~~~~~~~C~k~F~~~~  114 (194)
                      .|.+...|+.|+.+|......-                -+..+..-...              ....+|+.|+..+..+.
T Consensus       365 kFrRqAYLrKHqlthq~~~~~k~~a~~f~~s~~~~l~~~~~~~a~h~~a~~~~g~~vl~~a~sael~~pp~~~~ppsss~  444 (500)
T KOG3993|consen  365 KFRRQAYLRKHQLTHQRAPLAKEKAPKFLLSRVIPLMHFNQAVATHSSASDSHGDEVLYVAGSAELELPPYDGSPPSSSG  444 (500)
T ss_pred             hhHHHHHHHHhHHhhhccccchhcccCcchhhcccccccccccccccccccccccceeeeeccccccCCCCCCCCcccCC
Confidence            9999999999988876222111                11111100000              01112567777777776


Q ss_pred             hhHhhhccCCCCCCCCChhhhCCCccCCCCCccccccCC
Q psy8702         115 EDIEIERTPSEERSLEPSDMLGRRSRSRTPSPPLRSHSP  153 (194)
Q Consensus       115 ~L~~H~~~H~~ekp~~C~~~Cgk~f~~~~~~~hl~~H~~  153 (194)
                      .---+.+.-..++-|.|. +|.-.   |..+.+|.+|..
T Consensus       445 ~sgg~~rlg~~~q~f~~k-y~~at---fyss~~ltrhin  479 (500)
T KOG3993|consen  445 SSGGYGRLGIAEQGFTCK-YCPAT---FYSSPGLTRHIN  479 (500)
T ss_pred             CCCccccccchhhccccc-cchHh---hhcCcchHhHhh
Confidence            666666666667789998 49888   777778887754


No 23 
>PHA00732 hypothetical protein
Probab=97.65  E-value=3.3e-05  Score=51.09  Aligned_cols=46  Identities=24%  Similarity=0.285  Sum_probs=37.8

Q ss_pred             cccccccccccccccccccccchHHHHHhH-cCCCCCccCCcCCCCcccchHHHHhhhhcC
Q psy8702          21 PGLSLYTKSFTWKNLLTNVLTSNLKTHLLT-HTDHKPYECNSCGKVFRRNCDLRRHALTHA   80 (194)
Q Consensus        21 ~~c~~Cgk~f~~~~~L~~H~~s~L~~H~~~-h~~~k~~~C~~C~k~f~~~~~L~~H~~~h~   80 (194)
                      +.|+.||+.|...        ..|..|++. |.+   +.|+.|++.|.   .|..|.....
T Consensus         2 y~C~~Cgk~F~s~--------s~Lk~H~r~~H~~---~~C~~CgKsF~---~l~~H~~~~~   48 (79)
T PHA00732          2 FKCPICGFTTVTL--------FALKQHARRNHTL---TKCPVCNKSYR---RLNQHFYSQY   48 (79)
T ss_pred             ccCCCCCCccCCH--------HHHHHHhhcccCC---CccCCCCCEeC---ChhhhhcccC
Confidence            5799999999999        999999984 654   58999999998   4777875543


No 24 
>PHA00732 hypothetical protein
Probab=97.45  E-value=0.00013  Score=48.20  Aligned_cols=47  Identities=15%  Similarity=0.171  Sum_probs=36.7

Q ss_pred             CccCCcCCCCcccchHHHHhhhh-cCCCCCCCCCCccccccccccCCCCCCCCCCCCCChhhHhhhccCC
Q psy8702          56 PYECNSCGKVFRRNCDLRRHALT-HAVGDVPPESSLAESIGSLKRDNLPPAPSPTGSQSSEDIEIERTPS  124 (194)
Q Consensus        56 ~~~C~~C~k~f~~~~~L~~H~~~-h~~~~k~~~c~~c~~~~~~~~~~~~~~C~k~F~~~~~L~~H~~~H~  124 (194)
                      ||.|..|++.|.....|..|++. |. +   +.|+.|               |+.|.   .|..|..+..
T Consensus         1 py~C~~Cgk~F~s~s~Lk~H~r~~H~-~---~~C~~C---------------gKsF~---~l~~H~~~~~   48 (79)
T PHA00732          1 MFKCPICGFTTVTLFALKQHARRNHT-L---TKCPVC---------------NKSYR---RLNQHFYSQY   48 (79)
T ss_pred             CccCCCCCCccCCHHHHHHHhhcccC-C---CccCCC---------------CCEeC---ChhhhhcccC
Confidence            58899999999999999999985 54 3   456555               67887   5888885543


No 25 
>PF00096 zf-C2H2:  Zinc finger, C2H2 type;  InterPro: IPR007087 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target.  The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger: #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C], where X can be any amino acid, and numbers in brackets indicate the number of residues. The positions marked # are those that are important for the stable fold of the zinc finger. The final position can be either his or cys. The C2H2 zinc finger is composed of two short beta strands followed by an alpha helix. The amino terminal part of the helix binds the major groove in DNA binding zinc fingers. The accepted consensus binding sequence for Sp1 is usually defined by the asymmetric hexanucleotide core GGGCGG but this sequence does not include, among others, the GAG (=CTC) repeat that constitutes a high-affinity site for Sp1 binding to the wt1 promoter []. This entry represents the classical C2H2 zinc finger domain.  More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; GO: 0008270 zinc ion binding, 0005622 intracellular; PDB: 2D9H_A 2EPC_A 1SP1_A 1VA3_A 2WBT_B 2ELR_A 2YTP_A 2YTT_A 1VA1_A 2ELO_A ....
Probab=97.22  E-value=0.00012  Score=36.39  Aligned_cols=20  Identities=40%  Similarity=0.908  Sum_probs=9.1

Q ss_pred             cCCcCCCCcccchHHHHhhh
Q psy8702          58 ECNSCGKVFRRNCDLRRHAL   77 (194)
Q Consensus        58 ~C~~C~k~f~~~~~L~~H~~   77 (194)
                      .|+.|++.|.....|..|++
T Consensus         2 ~C~~C~~~f~~~~~l~~H~~   21 (23)
T PF00096_consen    2 KCPICGKSFSSKSNLKRHMR   21 (23)
T ss_dssp             EETTTTEEESSHHHHHHHHH
T ss_pred             CCCCCCCccCCHHHHHHHHh
Confidence            34444444444444444443


No 26 
>PF00096 zf-C2H2:  Zinc finger, C2H2 type;  InterPro: IPR007087 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target.  The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger: #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C], where X can be any amino acid, and numbers in brackets indicate the number of residues. The positions marked # are those that are important for the stable fold of the zinc finger. The final position can be either his or cys. The C2H2 zinc finger is composed of two short beta strands followed by an alpha helix. The amino terminal part of the helix binds the major groove in DNA binding zinc fingers. The accepted consensus binding sequence for Sp1 is usually defined by the asymmetric hexanucleotide core GGGCGG but this sequence does not include, among others, the GAG (=CTC) repeat that constitutes a high-affinity site for Sp1 binding to the wt1 promoter []. This entry represents the classical C2H2 zinc finger domain.  More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; GO: 0008270 zinc ion binding, 0005622 intracellular; PDB: 2D9H_A 2EPC_A 1SP1_A 1VA3_A 2WBT_B 2ELR_A 2YTP_A 2YTT_A 1VA1_A 2ELO_A ....
Probab=97.19  E-value=0.00013  Score=36.33  Aligned_cols=23  Identities=43%  Similarity=0.509  Sum_probs=20.4

Q ss_pred             cccccccccccccccccccccchHHHHHhHc
Q psy8702          21 PGLSLYTKSFTWKNLLTNVLTSNLKTHLLTH   51 (194)
Q Consensus        21 ~~c~~Cgk~f~~~~~L~~H~~s~L~~H~~~h   51 (194)
                      |.|+.|++.|..+        ..|..|++.|
T Consensus         1 y~C~~C~~~f~~~--------~~l~~H~~~H   23 (23)
T PF00096_consen    1 YKCPICGKSFSSK--------SNLKRHMRRH   23 (23)
T ss_dssp             EEETTTTEEESSH--------HHHHHHHHHH
T ss_pred             CCCCCCCCccCCH--------HHHHHHHhHC
Confidence            5799999999999        9999998764


No 27 
>PF05605 zf-Di19:  Drought induced 19 protein (Di19), zinc-binding;  InterPro: IPR008598 This entry consists of several drought induced 19 (Di19) like and RING finger 114 proteins. Di19 has been found to be strongly expressed in both the roots and leaves of Arabidopsis thaliana during progressive drought [], whilst RING finger proteins are thought to play a role in spermatogenesis. The precise function is unknown.
Probab=97.06  E-value=0.00082  Score=41.03  Aligned_cols=51  Identities=24%  Similarity=0.252  Sum_probs=38.4

Q ss_pred             CCcccccccccccccccccccccchHHHHHhH-cCC-CCCccCCcCCCCcccchHHHHhhhhcC
Q psy8702          19 VNPGLSLYTKSFTWKNLLTNVLTSNLKTHLLT-HTD-HKPYECNSCGKVFRRNCDLRRHALTHA   80 (194)
Q Consensus        19 ~~~~c~~Cgk~f~~~~~L~~H~~s~L~~H~~~-h~~-~k~~~C~~C~k~f~~~~~L~~H~~~h~   80 (194)
                      +.|.|+.||+.|+.         ..|..|... |.. .+.+.|++|...+.  .+|..|+..++
T Consensus         1 ~~f~CP~C~~~~~~---------~~L~~H~~~~H~~~~~~v~CPiC~~~~~--~~l~~Hl~~~H   53 (54)
T PF05605_consen    1 DSFTCPYCGKGFSE---------SSLVEHCEDEHRSESKNVVCPICSSRVT--DNLIRHLNSQH   53 (54)
T ss_pred             CCcCCCCCCCccCH---------HHHHHHHHhHCcCCCCCccCCCchhhhh--hHHHHHHHHhc
Confidence            35789999996554         789999765 554 35799999998644  48888987654


No 28 
>COG5189 SFP1 Putative transcriptional repressor regulating G2/M transition [Transcription / Cell division and chromosome partitioning]
Probab=96.80  E-value=0.00072  Score=55.31  Aligned_cols=55  Identities=7%  Similarity=-0.001  Sum_probs=41.0

Q ss_pred             CCCCCCCCccccccccccCCCCCCCCCCCCCChhhHhhhcc-C------------------CCCCCCCChhhhCCCccCC
Q psy8702          82 GDVPPESSLAESIGSLKRDNLPPAPSPTGSQSSEDIEIERT-P------------------SEERSLEPSDMLGRRSRSR  142 (194)
Q Consensus        82 ~~k~~~c~~c~~~~~~~~~~~~~~C~k~F~~~~~L~~H~~~-H------------------~~ekp~~C~~~Cgk~f~~~  142 (194)
                      +++||+|++-+             |.|.+.+..-|+.|+.- |                  ..+|||.|+ +|+|+   +
T Consensus       346 d~KpykCpV~g-------------C~K~YknqnGLKYH~lhGH~~~~~~~~p~p~~~~~F~~~~KPYrCe-vC~KR---Y  408 (423)
T COG5189         346 DGKPYKCPVEG-------------CNKKYKNQNGLKYHMLHGHQNQKLHENPSPEKMNIFSAKDKPYRCE-VCDKR---Y  408 (423)
T ss_pred             cCceecCCCCC-------------chhhhccccchhhhhhccccCcccCCCCCccccccccccCCceecc-ccchh---h
Confidence            56999998754             57899999999988742 2                  246999998 59999   6


Q ss_pred             CCCccccccCC
Q psy8702         143 TPSPPLRSHSP  153 (194)
Q Consensus       143 ~~~~hl~~H~~  153 (194)
                      .-..-|+-|.+
T Consensus       409 KNlNGLKYHr~  419 (423)
T COG5189         409 KNLNGLKYHRK  419 (423)
T ss_pred             ccCccceeccc
Confidence            55556666643


No 29 
>PF13894 zf-C2H2_4:  C2H2-type zinc finger; PDB: 2ELX_A 2EPP_A 2DLK_A 1X6H_A 2EOU_A 2EMB_A 2GQJ_A 2CSH_A 2WBT_B 2ELM_A ....
Probab=96.68  E-value=0.00092  Score=33.08  Aligned_cols=23  Identities=48%  Similarity=1.054  Sum_probs=11.2

Q ss_pred             ccCCcCCCCcccchHHHHhhhhc
Q psy8702          57 YECNSCGKVFRRNCDLRRHALTH   79 (194)
Q Consensus        57 ~~C~~C~k~f~~~~~L~~H~~~h   79 (194)
                      |.|++|++.|.+...|..|+..+
T Consensus         1 ~~C~~C~~~~~~~~~l~~H~~~~   23 (24)
T PF13894_consen    1 FQCPICGKSFRSKSELRQHMRTH   23 (24)
T ss_dssp             EE-SSTS-EESSHHHHHHHHHHH
T ss_pred             CCCcCCCCcCCcHHHHHHHHHhh
Confidence            34555555555555555555443


No 30 
>PF13912 zf-C2H2_6:  C2H2-type zinc finger; PDB: 1JN7_A 1FU9_A 2L1O_A 1NJQ_A 2EN8_A 2EMM_A 1FV5_A 1Y0J_B 2L6Z_B.
Probab=96.60  E-value=0.00092  Score=34.53  Aligned_cols=25  Identities=44%  Similarity=1.004  Sum_probs=19.2

Q ss_pred             CccCCcCCCCcccchHHHHhhhhcC
Q psy8702          56 PYECNSCGKVFRRNCDLRRHALTHA   80 (194)
Q Consensus        56 ~~~C~~C~k~f~~~~~L~~H~~~h~   80 (194)
                      +|.|..|++.|.+...|..|++.|.
T Consensus         1 ~~~C~~C~~~F~~~~~l~~H~~~h~   25 (27)
T PF13912_consen    1 PFECDECGKTFSSLSALREHKRSHC   25 (27)
T ss_dssp             SEEETTTTEEESSHHHHHHHHCTTT
T ss_pred             CCCCCccCCccCChhHHHHHhHHhc
Confidence            4678888888888888888877664


No 31 
>PF13894 zf-C2H2_4:  C2H2-type zinc finger; PDB: 2ELX_A 2EPP_A 2DLK_A 1X6H_A 2EOU_A 2EMB_A 2GQJ_A 2CSH_A 2WBT_B 2ELM_A ....
Probab=96.48  E-value=0.0012  Score=32.58  Aligned_cols=23  Identities=39%  Similarity=0.469  Sum_probs=18.6

Q ss_pred             cccccccccccccccccccccchHHHHHhHc
Q psy8702          21 PGLSLYTKSFTWKNLLTNVLTSNLKTHLLTH   51 (194)
Q Consensus        21 ~~c~~Cgk~f~~~~~L~~H~~s~L~~H~~~h   51 (194)
                      |.|+.|++.|.+.        ..|..|+.++
T Consensus         1 ~~C~~C~~~~~~~--------~~l~~H~~~~   23 (24)
T PF13894_consen    1 FQCPICGKSFRSK--------SELRQHMRTH   23 (24)
T ss_dssp             EE-SSTS-EESSH--------HHHHHHHHHH
T ss_pred             CCCcCCCCcCCcH--------HHHHHHHHhh
Confidence            4699999999999        9999999875


No 32 
>PF05605 zf-Di19:  Drought induced 19 protein (Di19), zinc-binding;  InterPro: IPR008598 This entry consists of several drought induced 19 (Di19) like and RING finger 114 proteins. Di19 has been found to be strongly expressed in both the roots and leaves of Arabidopsis thaliana during progressive drought [], whilst RING finger proteins are thought to play a role in spermatogenesis. The precise function is unknown.
Probab=96.45  E-value=0.0044  Score=37.75  Aligned_cols=51  Identities=16%  Similarity=0.216  Sum_probs=36.6

Q ss_pred             CccCCcCCCCcccchHHHHhhhh-cCCCCCCCCCCccccccccccCCCCCCCCCCCCCChhhHhhhccCC
Q psy8702          56 PYECNSCGKVFRRNCDLRRHALT-HAVGDVPPESSLAESIGSLKRDNLPPAPSPTGSQSSEDIEIERTPS  124 (194)
Q Consensus        56 ~~~C~~C~k~f~~~~~L~~H~~~-h~~~~k~~~c~~c~~~~~~~~~~~~~~C~k~F~~~~~L~~H~~~H~  124 (194)
                      .|.|+.|++ ......|..|... |..+.+.+.|++|.               ..+  ..+|..|+..++
T Consensus         2 ~f~CP~C~~-~~~~~~L~~H~~~~H~~~~~~v~CPiC~---------------~~~--~~~l~~Hl~~~H   53 (54)
T PF05605_consen    2 SFTCPYCGK-GFSESSLVEHCEDEHRSESKNVVCPICS---------------SRV--TDNLIRHLNSQH   53 (54)
T ss_pred             CcCCCCCCC-ccCHHHHHHHHHhHCcCCCCCccCCCch---------------hhh--hhHHHHHHHHhc
Confidence            489999999 5556899999765 55234578888885               333  358888987654


No 33 
>PF12756 zf-C2H2_2:  C2H2 type zinc-finger (2 copies); PDB: 2DMI_A.
Probab=96.00  E-value=0.0039  Score=42.20  Aligned_cols=57  Identities=21%  Similarity=0.340  Sum_probs=21.4

Q ss_pred             cccccccccccccccccccc----------------chHHHHHhHcCCCCCccCCcCCCCcccchHHHHhhhhc
Q psy8702          22 GLSLYTKSFTWKNLLTNVLT----------------SNLKTHLLTHTDHKPYECNSCGKVFRRNCDLRRHALTH   79 (194)
Q Consensus        22 ~c~~Cgk~f~~~~~L~~H~~----------------s~L~~H~~~h~~~k~~~C~~C~k~f~~~~~L~~H~~~h   79 (194)
                      +|..|+..|.....|..|..                ..+....+. .....+.|..|++.|.+...|..|++.+
T Consensus         1 ~C~~C~~~f~~~~~l~~H~~~~H~~~~~~~~~l~~~~~~~~~~~~-~~~~~~~C~~C~~~f~s~~~l~~Hm~~~   73 (100)
T PF12756_consen    1 QCLFCDESFSSVDDLLQHMKKKHGFDIPDQKYLVDPNRLLNYLRK-KVKESFRCPYCNKTFRSREALQEHMRSK   73 (100)
T ss_dssp             -------------------------------------------------SSEEBSSSS-EESSHHHHHHHHHHT
T ss_pred             Ccccccccccccccccccccccccccccccccccccccccccccc-ccCCCCCCCccCCCCcCHHHHHHHHcCc
Confidence            48899999999977777621                111111111 0113589999999999999999999864


No 34 
>PF13912 zf-C2H2_6:  C2H2-type zinc finger; PDB: 1JN7_A 1FU9_A 2L1O_A 1NJQ_A 2EN8_A 2EMM_A 1FV5_A 1Y0J_B 2L6Z_B.
Probab=95.89  E-value=0.0036  Score=32.20  Aligned_cols=24  Identities=25%  Similarity=0.210  Sum_probs=21.4

Q ss_pred             cccccccccccccccccccccchHHHHHhHcC
Q psy8702          21 PGLSLYTKSFTWKNLLTNVLTSNLKTHLLTHT   52 (194)
Q Consensus        21 ~~c~~Cgk~f~~~~~L~~H~~s~L~~H~~~h~   52 (194)
                      +.|+.|++.|...        ..|..|++.|.
T Consensus         2 ~~C~~C~~~F~~~--------~~l~~H~~~h~   25 (27)
T PF13912_consen    2 FECDECGKTFSSL--------SALREHKRSHC   25 (27)
T ss_dssp             EEETTTTEEESSH--------HHHHHHHCTTT
T ss_pred             CCCCccCCccCCh--------hHHHHHhHHhc
Confidence            6799999999999        99999998764


No 35 
>COG5048 FOG: Zn-finger [General function prediction only]
Probab=95.86  E-value=0.0074  Score=50.99  Aligned_cols=139  Identities=22%  Similarity=0.204  Sum_probs=92.6

Q ss_pred             CCcccccccccccccccccccccchHHHHHh--HcCCC--CCccCC--cCCCCcccchHHHHhhhhcCCCCCCCCCCccc
Q psy8702          19 VNPGLSLYTKSFTWKNLLTNVLTSNLKTHLL--THTDH--KPYECN--SCGKVFRRNCDLRRHALTHAVGDVPPESSLAE   92 (194)
Q Consensus        19 ~~~~c~~Cgk~f~~~~~L~~H~~s~L~~H~~--~h~~~--k~~~C~--~C~k~f~~~~~L~~H~~~h~~~~k~~~c~~c~   92 (194)
                      ..+.|..|...|...        ..|..|.+  .|.++  +++.|.  .|++.|.+...+..|..+|. +..++.+....
T Consensus       288 ~~~~~~~~~~~~s~~--------~~l~~~~~~~~h~~~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~  358 (467)
T COG5048         288 LPIKSKQCNISFSRS--------SPLTRHLRSVNHSGESLKPFSCPYSLCGKLFSRNDALKRHILLHT-SISPAKEKLLN  358 (467)
T ss_pred             cCCCCccccCCcccc--------ccccccccccccccccCCceeeeccCCCccccccccccCCccccc-CCCcccccccc
Confidence            356789999999999        88888888  79999  999999  79999999999999999998 76666665332


Q ss_pred             cccccccCCC----------------------CCCCCCCCCCChhhHhhhccCCCCCCCCChh-hhCCCccCCCCCcccc
Q psy8702          93 SIGSLKRDNL----------------------PPAPSPTGSQSSEDIEIERTPSEERSLEPSD-MLGRRSRSRTPSPPLR  149 (194)
Q Consensus        93 ~~~~~~~~~~----------------------~~~C~k~F~~~~~L~~H~~~H~~ekp~~C~~-~Cgk~f~~~~~~~hl~  149 (194)
                      ..........                      -..|-..+.....+..|...+....++.|.. .|++.   +.....+.
T Consensus       359 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~  435 (467)
T COG5048         359 SSSKFSPLLNNEPPQSLQQYKDLKNDKKSETLSNSCIRNFKRDSNLSLHIITHLSFRPYNCKNPPCSKS---FNRHYNLI  435 (467)
T ss_pred             CccccccccCCCCccchhhccCccCCccccccccchhhhhccccccccccccccccCCcCCCCCcchhh---ccCccccc
Confidence            1111111000                      1146677777788888888888777554432 47777   44444444


Q ss_pred             ccCCCCcccccCCCCCcccc
Q psy8702         150 SHSPISIFRRHQSPSPIGIY  169 (194)
Q Consensus       150 ~H~~~~~~~~~~~~~~~~~~  169 (194)
                      .|+....-.........+.|
T Consensus       436 ~~~~~~~~~~~~~~~~~~~~  455 (467)
T COG5048         436 PHKKIHTNHAPLLCSILKSF  455 (467)
T ss_pred             ccccccccCCceeecccccc
Confidence            44444433333333333333


No 36 
>smart00355 ZnF_C2H2 zinc finger.
Probab=95.76  E-value=0.0068  Score=30.20  Aligned_cols=24  Identities=46%  Similarity=0.998  Sum_probs=16.2

Q ss_pred             ccCCcCCCCcccchHHHHhhhhcC
Q psy8702          57 YECNSCGKVFRRNCDLRRHALTHA   80 (194)
Q Consensus        57 ~~C~~C~k~f~~~~~L~~H~~~h~   80 (194)
                      |.|..|++.|.....|..|++.|.
T Consensus         1 ~~C~~C~~~f~~~~~l~~H~~~H~   24 (26)
T smart00355        1 YRCPECGKVFKSKSALKEHMRTHX   24 (26)
T ss_pred             CCCCCCcchhCCHHHHHHHHHHhc
Confidence            456777777777777777766553


No 37 
>smart00355 ZnF_C2H2 zinc finger.
Probab=95.58  E-value=0.0084  Score=29.84  Aligned_cols=24  Identities=38%  Similarity=0.281  Sum_probs=20.8

Q ss_pred             cccccccccccccccccccccchHHHHHhHcC
Q psy8702          21 PGLSLYTKSFTWKNLLTNVLTSNLKTHLLTHT   52 (194)
Q Consensus        21 ~~c~~Cgk~f~~~~~L~~H~~s~L~~H~~~h~   52 (194)
                      +.|+.|++.|...        ..|..|++.|.
T Consensus         1 ~~C~~C~~~f~~~--------~~l~~H~~~H~   24 (26)
T smart00355        1 YRCPECGKVFKSK--------SALKEHMRTHX   24 (26)
T ss_pred             CCCCCCcchhCCH--------HHHHHHHHHhc
Confidence            4699999999999        99999988664


No 38 
>PF09237 GAGA:  GAGA factor;  InterPro: IPR015318 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target.  Members of this entry bind to a 5'-GAGAG-3' DNA consensus binding site, and contain a Cys2-His2 zinc finger core as well as an N-terminal extension containing two highly basic regions. The zinc finger core binds in the DNA major groove and recognises the first three GAG bases of the consensus in a manner similar to that seen in other classical zinc finger-DNA complexes. The second basic region forms a helix that interacts in the major groove recognising the last G of the consensus, while the first basic region wraps around the DNA in the minor groove and recognises the A in the fourth position of the consensus sequence [].  More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; PDB: 1YUI_A 1YUJ_A.
Probab=95.50  E-value=0.01  Score=35.43  Aligned_cols=33  Identities=33%  Similarity=0.622  Sum_probs=20.4

Q ss_pred             CCCCCccCCcCCCCcccchHHHHhhhhcCCCCCC
Q psy8702          52 TDHKPYECNSCGKVFRRNCDLRRHALTHAVGDVP   85 (194)
Q Consensus        52 ~~~k~~~C~~C~k~f~~~~~L~~H~~~h~~~~k~   85 (194)
                      ..+.|..|++|+..+....+|++|+.+.+ +.+|
T Consensus        20 ~S~~PatCP~C~a~~~~srnLrRHle~~H-~~k~   52 (54)
T PF09237_consen   20 QSEQPATCPICGAVIRQSRNLRRHLEIRH-FKKP   52 (54)
T ss_dssp             TTS--EE-TTT--EESSHHHHHHHHHHHT-TTS-
T ss_pred             ccCCCCCCCcchhhccchhhHHHHHHHHh-cccC
Confidence            34678888888888888888888887666 5554


No 39 
>PRK04860 hypothetical protein; Provisional
Probab=95.38  E-value=0.011  Score=44.37  Aligned_cols=39  Identities=13%  Similarity=0.270  Sum_probs=28.5

Q ss_pred             CCcccccccccccccccccccccchHHHHHhHcCCCCCccCCcCCCCcccc
Q psy8702          19 VNPGLSLYTKSFTWKNLLTNVLTSNLKTHLLTHTDHKPYECNSCGKVFRRN   69 (194)
Q Consensus        19 ~~~~c~~Cgk~f~~~~~L~~H~~s~L~~H~~~h~~~k~~~C~~C~k~f~~~   69 (194)
                      -.|.|. |++   ..        ..+.+|.++++++++|.|..|+..|...
T Consensus       118 ~~Y~C~-C~~---~~--------~~~rrH~ri~~g~~~YrC~~C~~~l~~~  156 (160)
T PRK04860        118 FPYRCK-CQE---HQ--------LTVRRHNRVVRGEAVYRCRRCGETLVFK  156 (160)
T ss_pred             EEEEcC-CCC---ee--------CHHHHHHHHhcCCccEECCCCCceeEEe
Confidence            456776 777   34        5678888888888888888888776643


No 40 
>COG5189 SFP1 Putative transcriptional repressor regulating G2/M transition [Transcription / Cell division and chromosome partitioning]
Probab=95.23  E-value=0.0084  Score=49.21  Aligned_cols=52  Identities=29%  Similarity=0.530  Sum_probs=36.1

Q ss_pred             CCCCccc--ccccccccccccccccccchHHHHHhH-c------------------CCCCCccCCcCCCCcccchHHHHh
Q psy8702          17 DSVNPGL--SLYTKSFTWKNLLTNVLTSNLKTHLLT-H------------------TDHKPYECNSCGKVFRRNCDLRRH   75 (194)
Q Consensus        17 ~~~~~~c--~~Cgk~f~~~~~L~~H~~s~L~~H~~~-h------------------~~~k~~~C~~C~k~f~~~~~L~~H   75 (194)
                      .+..|.|  +-|.|.|+..        ..|+.|+.- |                  ...|||.|++|+|.|...-.|+.|
T Consensus       346 d~KpykCpV~gC~K~Yknq--------nGLKYH~lhGH~~~~~~~~p~p~~~~~F~~~~KPYrCevC~KRYKNlNGLKYH  417 (423)
T COG5189         346 DGKPYKCPVEGCNKKYKNQ--------NGLKYHMLHGHQNQKLHENPSPEKMNIFSAKDKPYRCEVCDKRYKNLNGLKYH  417 (423)
T ss_pred             cCceecCCCCCchhhhccc--------cchhhhhhccccCcccCCCCCccccccccccCCceeccccchhhccCccceec
Confidence            3466778  7899999999        444444431 1                  224788888888888888887776


Q ss_pred             h
Q psy8702          76 A   76 (194)
Q Consensus        76 ~   76 (194)
                      .
T Consensus       418 r  418 (423)
T COG5189         418 R  418 (423)
T ss_pred             c
Confidence            4


No 41 
>PF12756 zf-C2H2_2:  C2H2 type zinc-finger (2 copies); PDB: 2DMI_A.
Probab=94.91  E-value=0.0062  Score=41.23  Aligned_cols=65  Identities=18%  Similarity=0.124  Sum_probs=15.0

Q ss_pred             cCCcCCCCcccchHHHHhhhhcCCCCCCCCCC----c---cccccccc-cCCCCCCCCCCCCCChhhHhhhccC
Q psy8702          58 ECNSCGKVFRRNCDLRRHALTHAVGDVPPESS----L---AESIGSLK-RDNLPPAPSPTGSQSSEDIEIERTP  123 (194)
Q Consensus        58 ~C~~C~k~f~~~~~L~~H~~~h~~~~k~~~c~----~---c~~~~~~~-~~~~~~~C~k~F~~~~~L~~H~~~H  123 (194)
                      +|..|+..|.....|..|+...+ +-..-...    .   ........ ....|..|++.|.....|..|++.+
T Consensus         1 ~C~~C~~~f~~~~~l~~H~~~~H-~~~~~~~~~l~~~~~~~~~~~~~~~~~~~C~~C~~~f~s~~~l~~Hm~~~   73 (100)
T PF12756_consen    1 QCLFCDESFSSVDDLLQHMKKKH-GFDIPDQKYLVDPNRLLNYLRKKVKESFRCPYCNKTFRSREALQEHMRSK   73 (100)
T ss_dssp             -------------------------------------------------SSEEBSSSS-EESSHHHHHHHHHHT
T ss_pred             Ccccccccccccccccccccccc-ccccccccccccccccccccccccCCCCCCCccCCCCcCHHHHHHHHcCc
Confidence            58899999999999999987544 32111110    0   00000000 1234667777777777777777643


No 42 
>PF09237 GAGA:  GAGA factor;  InterPro: IPR015318 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target.  Members of this entry bind to a 5'-GAGAG-3' DNA consensus binding site, and contain a Cys2-His2 zinc finger core as well as an N-terminal extension containing two highly basic regions. The zinc finger core binds in the DNA major groove and recognises the first three GAG bases of the consensus in a manner similar to that seen in other classical zinc finger-DNA complexes. The second basic region forms a helix that interacts in the major groove recognising the last G of the consensus, while the first basic region wraps around the DNA in the minor groove and recognises the A in the fourth position of the consensus sequence [].  More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; PDB: 1YUI_A 1YUJ_A.
Probab=94.76  E-value=0.026  Score=33.74  Aligned_cols=33  Identities=18%  Similarity=0.202  Sum_probs=22.6

Q ss_pred             CCCCCcccccccccccccccccccccchHHHHHhHcCCCCC
Q psy8702          16 KDSVNPGLSLYTKSFTWKNLLTNVLTSNLKTHLLTHTDHKP   56 (194)
Q Consensus        16 ~~~~~~~c~~Cgk~f~~~~~L~~H~~s~L~~H~~~h~~~k~   56 (194)
                      .+....-|++|+..+...        .+|++|+.+.++.+|
T Consensus        20 ~S~~PatCP~C~a~~~~s--------rnLrRHle~~H~~k~   52 (54)
T PF09237_consen   20 QSEQPATCPICGAVIRQS--------RNLRRHLEIRHFKKP   52 (54)
T ss_dssp             TTS--EE-TTT--EESSH--------HHHHHHHHHHTTTS-
T ss_pred             ccCCCCCCCcchhhccch--------hhHHHHHHHHhcccC
Confidence            345556799999999999        999999988777665


No 43 
>PF12874 zf-met:  Zinc-finger of C2H2 type; PDB: 1ZU1_A 2KVG_A.
Probab=94.37  E-value=0.017  Score=29.04  Aligned_cols=21  Identities=33%  Similarity=0.752  Sum_probs=12.3

Q ss_pred             ccCCcCCCCcccchHHHHhhh
Q psy8702          57 YECNSCGKVFRRNCDLRRHAL   77 (194)
Q Consensus        57 ~~C~~C~k~f~~~~~L~~H~~   77 (194)
                      |.|.+|++.|.+...|..|++
T Consensus         1 ~~C~~C~~~f~s~~~~~~H~~   21 (25)
T PF12874_consen    1 FYCDICNKSFSSENSLRQHLR   21 (25)
T ss_dssp             EEETTTTEEESSHHHHHHHHT
T ss_pred             CCCCCCCCCcCCHHHHHHHHC
Confidence            345666666666666666654


No 44 
>PF12874 zf-met:  Zinc-finger of C2H2 type; PDB: 1ZU1_A 2KVG_A.
Probab=94.24  E-value=0.017  Score=29.05  Aligned_cols=22  Identities=27%  Similarity=0.359  Sum_probs=19.0

Q ss_pred             cccccccccccccccccccccchHHHHHhH
Q psy8702          21 PGLSLYTKSFTWKNLLTNVLTSNLKTHLLT   50 (194)
Q Consensus        21 ~~c~~Cgk~f~~~~~L~~H~~s~L~~H~~~   50 (194)
                      |.|+.|++.|...        ..|..|++.
T Consensus         1 ~~C~~C~~~f~s~--------~~~~~H~~s   22 (25)
T PF12874_consen    1 FYCDICNKSFSSE--------NSLRQHLRS   22 (25)
T ss_dssp             EEETTTTEEESSH--------HHHHHHHTT
T ss_pred             CCCCCCCCCcCCH--------HHHHHHHCc
Confidence            5799999999999        888888764


No 45 
>PF12171 zf-C2H2_jaz:  Zinc-finger double-stranded RNA-binding;  InterPro: IPR022755  This zinc finger is found in archaea and eukaryotes, and is approximately 30 amino acids in length. The mammalian members of this group occur multiple times along the protein, joined by flexible linkers, and are referred to as JAZ - dsRNA-binding ZF protein - zinc-fingers. The JAZ proteins are expressed in all tissues tested and localise in the nucleus, particularly the nucleolus []. JAZ preferentially binds to double-stranded (ds) RNA or RNA/DNA hybrids rather than DNA. In addition to binding double-stranded RNA, these zinc-fingers are required for nucleolar localisation.   This entry represents the multiple-adjacent-C2H2 zinc finger, JAZ. ; PDB: 4DGW_A 1ZR9_A.
Probab=93.69  E-value=0.012  Score=30.36  Aligned_cols=19  Identities=32%  Similarity=0.805  Sum_probs=9.0

Q ss_pred             cCCcCCCCcccchHHHHhh
Q psy8702          58 ECNSCGKVFRRNCDLRRHA   76 (194)
Q Consensus        58 ~C~~C~k~f~~~~~L~~H~   76 (194)
                      -|..|++.|.+...|..|+
T Consensus         3 ~C~~C~k~f~~~~~~~~H~   21 (27)
T PF12171_consen    3 YCDACDKYFSSENQLKQHM   21 (27)
T ss_dssp             BBTTTTBBBSSHHHHHCCT
T ss_pred             CcccCCCCcCCHHHHHHHH
Confidence            3444444444444444443


No 46 
>PF12171 zf-C2H2_jaz:  Zinc-finger double-stranded RNA-binding;  InterPro: IPR022755  This zinc finger is found in archaea and eukaryotes, and is approximately 30 amino acids in length. The mammalian members of this group occur multiple times along the protein, joined by flexible linkers, and are referred to as JAZ - dsRNA-binding ZF protein - zinc-fingers. The JAZ proteins are expressed in all tissues tested and localise in the nucleus, particularly the nucleolus []. JAZ preferentially binds to double-stranded (ds) RNA or RNA/DNA hybrids rather than DNA. In addition to binding double-stranded RNA, these zinc-fingers are required for nucleolar localisation.   This entry represents the multiple-adjacent-C2H2 zinc finger, JAZ. ; PDB: 4DGW_A 1ZR9_A.
Probab=93.11  E-value=0.041  Score=28.28  Aligned_cols=21  Identities=24%  Similarity=0.248  Sum_probs=16.3

Q ss_pred             CcccccccccccccccccccccchHHHHH
Q psy8702          20 NPGLSLYTKSFTWKNLLTNVLTSNLKTHL   48 (194)
Q Consensus        20 ~~~c~~Cgk~f~~~~~L~~H~~s~L~~H~   48 (194)
                      .++|+.|++.|.+.        ..|..|+
T Consensus         1 q~~C~~C~k~f~~~--------~~~~~H~   21 (27)
T PF12171_consen    1 QFYCDACDKYFSSE--------NQLKQHM   21 (27)
T ss_dssp             -CBBTTTTBBBSSH--------HHHHCCT
T ss_pred             CCCcccCCCCcCCH--------HHHHHHH
Confidence            47899999999999        5555554


No 47 
>PRK04860 hypothetical protein; Provisional
Probab=92.30  E-value=0.086  Score=39.58  Aligned_cols=32  Identities=25%  Similarity=0.563  Sum_probs=28.7

Q ss_pred             CccCCcCCCCcccchHHHHhhhhcCCCCCCCCCCccc
Q psy8702          56 PYECNSCGKVFRRNCDLRRHALTHAVGDVPPESSLAE   92 (194)
Q Consensus        56 ~~~C~~C~k~f~~~~~L~~H~~~h~~~~k~~~c~~c~   92 (194)
                      +|.|. |+.   ....+..|.++|. ++++|.|..|+
T Consensus       119 ~Y~C~-C~~---~~~~~rrH~ri~~-g~~~YrC~~C~  150 (160)
T PRK04860        119 PYRCK-CQE---HQLTVRRHNRVVR-GEAVYRCRRCG  150 (160)
T ss_pred             EEEcC-CCC---eeCHHHHHHHHhc-CCccEECCCCC
Confidence            59998 987   7788899999999 99999999886


No 48 
>KOG2893|consensus
Probab=90.92  E-value=0.1  Score=41.33  Aligned_cols=53  Identities=23%  Similarity=0.285  Sum_probs=41.0

Q ss_pred             CCCCCcccccccccccccccccccccchHHHHHhHcCCCCCccCCcCCCCcccchHHHHh-hhhcC
Q psy8702          16 KDSVNPGLSLYTKSFTWKNLLTNVLTSNLKTHLLTHTDHKPYECNSCGKVFRRNCDLRRH-ALTHA   80 (194)
Q Consensus        16 ~~~~~~~c~~Cgk~f~~~~~L~~H~~s~L~~H~~~h~~~k~~~C~~C~k~f~~~~~L~~H-~~~h~   80 (194)
                      +.-.++-|-+|++.|...        .-|..|++.    +.|+|-+|-|...+-..|..| +.+|.
T Consensus         6 kk~~kpwcwycnrefdde--------kiliqhqka----khfkchichkkl~sgpglsihcmqvhk   59 (341)
T KOG2893|consen    6 KKVDKPWCWYCNREFDDE--------KILIQHQKA----KHFKCHICHKKLFSGPGLSIHCMQVHK   59 (341)
T ss_pred             cccCCceeeecccccchh--------hhhhhhhhh----ccceeeeehhhhccCCCceeehhhhhh
Confidence            444567899999999999        888888764    669999999887777777777 44443


No 49 
>PF13909 zf-H2C2_5:  C2H2-type zinc-finger domain; PDB: 1X5W_A.
Probab=90.52  E-value=0.12  Score=25.62  Aligned_cols=22  Identities=32%  Similarity=0.739  Sum_probs=10.9

Q ss_pred             ccCCcCCCCcccchHHHHhhhhc
Q psy8702          57 YECNSCGKVFRRNCDLRRHALTH   79 (194)
Q Consensus        57 ~~C~~C~k~f~~~~~L~~H~~~h   79 (194)
                      |.|+.|+.... ...|..|++.+
T Consensus         1 y~C~~C~y~t~-~~~l~~H~~~~   22 (24)
T PF13909_consen    1 YKCPHCSYSTS-KSNLKRHLKRH   22 (24)
T ss_dssp             EE-SSSS-EES-HHHHHHHHHHH
T ss_pred             CCCCCCCCcCC-HHHHHHHHHhh
Confidence            35566665554 55566665543


No 50 
>PF09538 FYDLN_acid:  Protein of unknown function (FYDLN_acid);  InterPro: IPR012644 Members of this family are bacterial proteins with a conserved motif [KR]FYDLN, sometimes flanked by a pair of CXXC motifs, followed by a long region of low complexity sequence in which roughly half the residues are Asp and Glu, including multiple runs of five or more acidic residues. The function of members of this family is unknown.
Probab=89.42  E-value=0.29  Score=34.22  Aligned_cols=34  Identities=18%  Similarity=0.289  Sum_probs=27.9

Q ss_pred             CCCCcccccccccccccccccccccchHHHHHhHcCCCCCccCCcCCCCcccc
Q psy8702          17 DSVNPGLSLYTKSFTWKNLLTNVLTSNLKTHLLTHTDHKPYECNSCGKVFRRN   69 (194)
Q Consensus        17 ~~~~~~c~~Cgk~f~~~~~L~~H~~s~L~~H~~~h~~~k~~~C~~C~k~f~~~   69 (194)
                      -+.|..|+.||+.|..-                   +..|..|+.||..|.-.
T Consensus         6 lGtKR~Cp~CG~kFYDL-------------------nk~PivCP~CG~~~~~~   39 (108)
T PF09538_consen    6 LGTKRTCPSCGAKFYDL-------------------NKDPIVCPKCGTEFPPE   39 (108)
T ss_pred             cCCcccCCCCcchhccC-------------------CCCCccCCCCCCccCcc
Confidence            45677899999999775                   34788999999998876


No 51 
>KOG2231|consensus
Probab=89.12  E-value=0.4  Score=43.62  Aligned_cols=41  Identities=10%  Similarity=-0.043  Sum_probs=22.9

Q ss_pred             CCCCCChhhHhhhccCCCCCCCCChh-hhCCC-cc-CCCCCccccccC
Q psy8702         108 PTGSQSSEDIEIERTPSEERSLEPSD-MLGRR-SR-SRTPSPPLRSHS  152 (194)
Q Consensus       108 k~F~~~~~L~~H~~~H~~ekp~~C~~-~Cgk~-f~-~~~~~~hl~~H~  152 (194)
                      .-|..-..|..|-+.    .-|.|++ .|--. |. .+.+..+|+.|.
T Consensus       220 eyy~~~~dLe~HfR~----~HflCE~~~C~~~~f~~~~~~ei~lk~~~  263 (669)
T KOG2231|consen  220 EYYNDYDDLEEHFRK----GHFLCEEEFCRTKKFYVAFELEIELKAHN  263 (669)
T ss_pred             hhcccchHHHHHhhh----cCccccccccccceeeehhHHHHHHHhhc
Confidence            445555666666443    3366653 35433 44 557777788665


No 52 
>PF13913 zf-C2HC_2:  zinc-finger of a C2HC-type
Probab=88.63  E-value=0.34  Score=24.48  Aligned_cols=18  Identities=33%  Similarity=0.803  Sum_probs=8.9

Q ss_pred             cCCcCCCCcccchHHHHhh
Q psy8702          58 ECNSCGKVFRRNCDLRRHA   76 (194)
Q Consensus        58 ~C~~C~k~f~~~~~L~~H~   76 (194)
                      .|+.||+.| ....|..|+
T Consensus         4 ~C~~CgR~F-~~~~l~~H~   21 (25)
T PF13913_consen    4 PCPICGRKF-NPDRLEKHE   21 (25)
T ss_pred             cCCCCCCEE-CHHHHHHHH
Confidence            455555555 334444444


No 53 
>KOG2893|consensus
Probab=87.57  E-value=0.24  Score=39.33  Aligned_cols=45  Identities=7%  Similarity=-0.058  Sum_probs=35.0

Q ss_pred             CCCCCCCCCCCChhhHhhhccCCCCCCCCChhhhCCC-ccCCCCCcc-cccc
Q psy8702         102 LPPAPSPTGSQSSEDIEIERTPSEERSLEPSDMLGRR-SRSRTPSPP-LRSH  151 (194)
Q Consensus       102 ~~~~C~k~F~~~~~L~~H~~~H~~ekp~~C~~~Cgk~-f~~~~~~~h-l~~H  151 (194)
                      -|+.|.+.|....-|.+|++    .|-|+|. ||.|. |.--.|..| +..|
T Consensus        12 wcwycnrefddekiliqhqk----akhfkch-ichkkl~sgpglsihcmqvh   58 (341)
T KOG2893|consen   12 WCWYCNREFDDEKILIQHQK----AKHFKCH-ICHKKLFSGPGLSIHCMQVH   58 (341)
T ss_pred             eeeecccccchhhhhhhhhh----hccceee-eehhhhccCCCceeehhhhh
Confidence            37788999999999999986    4679997 79999 665555555 5555


No 54 
>smart00451 ZnF_U1 U1-like zinc finger. Family of C2H2-type zinc fingers, present in matrin, U1 small nuclear ribonucleoprotein C and other RNA-binding proteins.
Probab=86.51  E-value=0.49  Score=25.47  Aligned_cols=22  Identities=18%  Similarity=0.490  Sum_probs=16.2

Q ss_pred             CccCCcCCCCcccchHHHHhhh
Q psy8702          56 PYECNSCGKVFRRNCDLRRHAL   77 (194)
Q Consensus        56 ~~~C~~C~k~f~~~~~L~~H~~   77 (194)
                      +|.|++|++.|.....+..|+.
T Consensus         3 ~~~C~~C~~~~~~~~~~~~H~~   24 (35)
T smart00451        3 GFYCKLCNVTFTDEISVEAHLK   24 (35)
T ss_pred             CeEccccCCccCCHHHHHHHHC
Confidence            4678888888887777777764


No 55 
>KOG2231|consensus
Probab=82.72  E-value=0.69  Score=42.17  Aligned_cols=56  Identities=11%  Similarity=-0.145  Sum_probs=33.2

Q ss_pred             CCCCCCCCCCCCCChhhHhhhccCCCCCCCCChhhhCCC---ccCCCCCccccccCCCCccccc
Q psy8702         100 DNLPPAPSPTGSQSSEDIEIERTPSEERSLEPSDMLGRR---SRSRTPSPPLRSHSPISIFRRH  160 (194)
Q Consensus       100 ~~~~~~C~k~F~~~~~L~~H~~~H~~ekp~~C~~~Cgk~---f~~~~~~~hl~~H~~~~~~~~~  160 (194)
                      +.+|..|...|.....|..|++.++    |.|. +|.+.   .-.|.....|..|-+..-|.+.
T Consensus       182 hp~C~~C~~~fld~~el~rH~~~~h----~~ch-fC~~~~~~neyy~~~~dLe~HfR~~HflCE  240 (669)
T KOG2231|consen  182 HPLCKFCHERFLDDDELYRHLRFDH----EFCH-FCDYKTGQNEYYNDYDDLEEHFRKGHFLCE  240 (669)
T ss_pred             CccchhhhhhhccHHHHHHhhccce----ehee-ecCcccccchhcccchHHHHHhhhcCcccc
Confidence            3456677777777777777776543    4554 36432   1145555667777666666554


No 56 
>COG5048 FOG: Zn-finger [General function prediction only]
Probab=82.54  E-value=0.76  Score=38.66  Aligned_cols=70  Identities=16%  Similarity=0.154  Sum_probs=57.4

Q ss_pred             CCccCCcCCCCcccchHHHHhhh--hcCCCC--CCCCCCccccccccccCCCCCCCCCCCCCChhhHhhhccCCCCCCCC
Q psy8702          55 KPYECNSCGKVFRRNCDLRRHAL--THAVGD--VPPESSLAESIGSLKRDNLPPAPSPTGSQSSEDIEIERTPSEERSLE  130 (194)
Q Consensus        55 k~~~C~~C~k~f~~~~~L~~H~~--~h~~~~--k~~~c~~c~~~~~~~~~~~~~~C~k~F~~~~~L~~H~~~H~~ekp~~  130 (194)
                      .++.|..|...|.....|..|.+  .|. ++  +++.|+.-             .|++.|.....+..|..+|.+..++.
T Consensus       288 ~~~~~~~~~~~~s~~~~l~~~~~~~~h~-~~~~~~~~~p~~-------------~~~~~~~~~~~~~~~~~~~~~~~~~~  353 (467)
T COG5048         288 LPIKSKQCNISFSRSSPLTRHLRSVNHS-GESLKPFSCPYS-------------LCGKLFSRNDALKRHILLHTSISPAK  353 (467)
T ss_pred             cCCCCccccCCccccccccccccccccc-cccCCceeeecc-------------CCCccccccccccCCcccccCCCccc
Confidence            46889999999999999999999  788 88  89999731             23689999999999999999988877


Q ss_pred             Chh-hhCCC
Q psy8702         131 PSD-MLGRR  138 (194)
Q Consensus       131 C~~-~Cgk~  138 (194)
                      +.. .|.+.
T Consensus       354 ~~~~~~~~~  362 (467)
T COG5048         354 EKLLNSSSK  362 (467)
T ss_pred             cccccCccc
Confidence            743 34444


No 57 
>TIGR02300 FYDLN_acid conserved hypothetical protein TIGR02300. Members of this family are bacterial proteins with a conserved motif [KR]FYDLN, sometimes flanked by a pair of CXXC motifs, followed by a long region of low complexity sequence in which roughly half the residues are Asp and Glu, including multiple runs of five or more acidic residues. The function of members of this family is unknown.
Probab=82.42  E-value=1.1  Score=32.12  Aligned_cols=38  Identities=13%  Similarity=0.040  Sum_probs=29.1

Q ss_pred             CCCCcccccccccccccccccccccchHHHHHhHcCCCCCccCCcCCCCcccchHHH
Q psy8702          17 DSVNPGLSLYTKSFTWKNLLTNVLTSNLKTHLLTHTDHKPYECNSCGKVFRRNCDLR   73 (194)
Q Consensus        17 ~~~~~~c~~Cgk~f~~~~~L~~H~~s~L~~H~~~h~~~k~~~C~~C~k~f~~~~~L~   73 (194)
                      -+.|..|+.||+.|...                   +..|..|+.||..|.-...++
T Consensus         6 lGtKr~Cp~cg~kFYDL-------------------nk~p~vcP~cg~~~~~~~~~~   43 (129)
T TIGR02300         6 LGTKRICPNTGSKFYDL-------------------NRRPAVSPYTGEQFPPEEALK   43 (129)
T ss_pred             hCccccCCCcCcccccc-------------------CCCCccCCCcCCccCcchhhc
Confidence            35677899999999774                   347899999999887664443


No 58 
>cd00350 rubredoxin_like Rubredoxin_like; nonheme iron binding domain containing a [Fe(SCys)4] center. The family includes rubredoxins, a small electron transfer protein, and a slightly smaller modular rubredoxin domain present in rubrerythrin and nigerythrin and detected either N- or C-terminal to such proteins as flavin reductase, NAD(P)H-nitrite reductase, and ferredoxin-thioredoxin reductase. In rubredoxin, the iron atom is coordinated by four cysteine residues (Fe(S-Cys)4), but iron can also be replaced by cobalt, nickel or zinc and believed to be involved in electron transfer.  Rubrerythrins and nigerythrins are small homodimeric proteins, generally consisting of 2 domains: a rubredoxin domain C-terminal to a non-sulfur, oxo-bridged diiron site in the N-terminal rubrerythrin domain.  Rubrerythrins and nigerythrins have putative peroxide activity.
Probab=82.35  E-value=0.89  Score=24.53  Aligned_cols=13  Identities=0%  Similarity=-0.225  Sum_probs=8.6

Q ss_pred             ccccccccccccc
Q psy8702          21 PGLSLYTKSFTWK   33 (194)
Q Consensus        21 ~~c~~Cgk~f~~~   33 (194)
                      +.|..||..+...
T Consensus         2 ~~C~~CGy~y~~~   14 (33)
T cd00350           2 YVCPVCGYIYDGE   14 (33)
T ss_pred             EECCCCCCEECCC
Confidence            5677777776554


No 59 
>PF09986 DUF2225:  Uncharacterized protein conserved in bacteria (DUF2225);  InterPro: IPR018708 This conserved bacterial family has no known function.
Probab=78.88  E-value=0.71  Score=36.29  Aligned_cols=21  Identities=29%  Similarity=0.518  Sum_probs=13.3

Q ss_pred             CCccCCcCCCCcccchHHHHh
Q psy8702          55 KPYECNSCGKVFRRNCDLRRH   75 (194)
Q Consensus        55 k~~~C~~C~k~f~~~~~L~~H   75 (194)
                      +...|++|++.|.++.-+...
T Consensus         4 k~~~CPvC~~~F~~~~vrs~~   24 (214)
T PF09986_consen    4 KKITCPVCGKEFKTKKVRSGK   24 (214)
T ss_pred             CceECCCCCCeeeeeEEEcCC
Confidence            456777777777776544433


No 60 
>KOG2186|consensus
Probab=77.15  E-value=1.6  Score=34.94  Aligned_cols=46  Identities=20%  Similarity=0.426  Sum_probs=37.7

Q ss_pred             cccccccccccccccccccccchHHHHHhHcCCCCCccCCcCCCCcccchHHHHhhh
Q psy8702          21 PGLSLYTKSFTWKNLLTNVLTSNLKTHLLTHTDHKPYECNSCGKVFRRNCDLRRHAL   77 (194)
Q Consensus        21 ~~c~~Cgk~f~~~~~L~~H~~s~L~~H~~~h~~~k~~~C~~C~k~f~~~~~L~~H~~   77 (194)
                      +-|..||-....         ..+..|+....+ .-|.|..|++.|.. ..+..|..
T Consensus         4 FtCnvCgEsvKK---------p~vekH~srCrn-~~fSCIDC~k~F~~-~sYknH~k   49 (276)
T KOG2186|consen    4 FTCNVCGESVKK---------PQVEKHMSRCRN-AYFSCIDCGKTFER-VSYKNHTK   49 (276)
T ss_pred             Eehhhhhhhccc---------cchHHHHHhccC-CeeEEeeccccccc-chhhhhhh
Confidence            568999988877         467779988777 67999999999998 67777865


No 61 
>PF10571 UPF0547:  Uncharacterised protein family UPF0547;  InterPro: IPR018886  This domain may well be a type of zinc-finger as it carries two pairs of highly conserved cysteine residues though with no accompanying histidines. Several members are annotated as putative helicases. 
Probab=73.82  E-value=3.1  Score=21.22  Aligned_cols=10  Identities=10%  Similarity=-0.147  Sum_probs=5.0

Q ss_pred             CCCCCCCCCC
Q psy8702         101 NLPPAPSPTG  110 (194)
Q Consensus       101 ~~~~~C~k~F  110 (194)
                      ..|+.||..|
T Consensus        15 ~~Cp~CG~~F   24 (26)
T PF10571_consen   15 KFCPHCGYDF   24 (26)
T ss_pred             CcCCCCCCCC
Confidence            3345555555


No 62 
>KOG1146|consensus
Probab=72.27  E-value=1.6  Score=42.61  Aligned_cols=98  Identities=15%  Similarity=0.161  Sum_probs=55.4

Q ss_pred             cccccccccccccccccccchHHHHHhH-cCCCCCccCCcCCCCcccchHHHHhhhh-cCCCCCCCCCCccccccccccC
Q psy8702          23 LSLYTKSFTWKNLLTNVLTSNLKTHLLT-HTDHKPYECNSCGKVFRRNCDLRRHALT-HAVGDVPPESSLAESIGSLKRD  100 (194)
Q Consensus        23 c~~Cgk~f~~~~~L~~H~~s~L~~H~~~-h~~~k~~~C~~C~k~f~~~~~L~~H~~~-h~~~~k~~~c~~c~~~~~~~~~  100 (194)
                      |..|+..+..+        ..+..++.. +.-.+.+.|+.|+..|.....|..|++. |.+++.    .+|.        
T Consensus       439 ~~~~e~~~~s~--------r~~~~~t~~L~S~~kt~~cpkc~~~yk~a~~L~vhmRskhp~~~~----~~c~--------  498 (1406)
T KOG1146|consen  439 LTKAEPLLESK--------RSLEGQTVVLHSFFKTLKCPKCNWHYKLAQTLGVHMRSKHPESQS----AYCK--------  498 (1406)
T ss_pred             ccchhhhhhhh--------cccccceeeeecccccccCCccchhhhhHHHhhhcccccccccch----hHhH--------
Confidence            34445455554        334444432 3445789999999999999999999986 432333    3332        


Q ss_pred             CCCCCCCCCCCCChhhHhhhccCCCCCCCCChhhhCCCccCCCCCccccccC
Q psy8702         101 NLPPAPSPTGSQSSEDIEIERTPSEERSLEPSDMLGRRSRSRTPSPPLRSHS  152 (194)
Q Consensus       101 ~~~~~C~k~F~~~~~L~~H~~~H~~ekp~~C~~~Cgk~f~~~~~~~hl~~H~  152 (194)
                              .+.....+.+-.-.--+.++|.|. .|..+   ++....|-.|+
T Consensus       499 --------~gq~~~~~arg~~~~~~~~p~~C~-~C~~s---tttng~Lsihl  538 (1406)
T KOG1146|consen  499 --------AGQNHPRLARGEVYRCPGKPYPCR-ACNYS---TTTNGNLSIHL  538 (1406)
T ss_pred             --------hccccccccccccccCCCCcccce-eeeee---eecchHHHHHH
Confidence                    222222222212222345788997 49888   44444444443


No 63 
>KOG2482|consensus
Probab=68.01  E-value=3.8  Score=34.41  Aligned_cols=66  Identities=11%  Similarity=0.013  Sum_probs=34.2

Q ss_pred             ccCCcCCCCcccchHHHHhhhh-cCCC----CCCCCCCccc------cccccccCCCCCCCCCCCCCChhhHhhhcc
Q psy8702          57 YECNSCGKVFRRNCDLRRHALT-HAVG----DVPPESSLAE------SIGSLKRDNLPPAPSPTGSQSSEDIEIERT  122 (194)
Q Consensus        57 ~~C~~C~k~f~~~~~L~~H~~~-h~~~----~k~~~c~~c~------~~~~~~~~~~~~~C~k~F~~~~~L~~H~~~  122 (194)
                      ..|-.|....-+...|..|+.+ |.+.    ...|.-.+.+      ..........|..|.-.|.....|..||.-
T Consensus       280 v~CLfC~~~~en~~~l~eHmk~vHe~Dl~Ki~sd~~Ln~YqrvrviNyiRkq~~~~~c~~cd~~F~~e~~l~~hm~e  356 (423)
T KOG2482|consen  280 VVCLFCTNFYENPVFLFEHMKIVHEFDLLKIQSDYSLNFYQRVRVINYIRKQKKKSRCAECDLSFWKEPGLLIHMVE  356 (423)
T ss_pred             eEEEeeccchhhHHHHHHHHHHHHHhhHHhhccccccchhhhhhHHHHHHHHhhccccccccccccCcchhhhhccc
Confidence            4666677666666677777664 2211    1111111110      111111223367788888888888888753


No 64 
>PRK00464 nrdR transcriptional regulator NrdR; Validated
Probab=66.71  E-value=1.6  Score=32.47  Aligned_cols=19  Identities=32%  Similarity=0.613  Sum_probs=14.0

Q ss_pred             CCccCCcCCCCcccchHHH
Q psy8702          55 KPYECNSCGKVFRRNCDLR   73 (194)
Q Consensus        55 k~~~C~~C~k~f~~~~~L~   73 (194)
                      +.++|+.||+.|.+...+.
T Consensus        27 ~~~~c~~c~~~f~~~e~~~   45 (154)
T PRK00464         27 RRRECLACGKRFTTFERVE   45 (154)
T ss_pred             eeeeccccCCcceEeEecc
Confidence            3488999999998765443


No 65 
>TIGR02098 MJ0042_CXXC MJ0042 family finger-like domain. This domain contains a CXXCX(19)CXXC motif suggestive of both zinc fingers and thioredoxin, usually found at the N-terminus of prokaryotic proteins. One partially characterized gene, agmX, is among a large set in Myxococcus whose interruption affects adventurous gliding motility.
Probab=65.37  E-value=4.9  Score=22.05  Aligned_cols=34  Identities=18%  Similarity=0.283  Sum_probs=19.7

Q ss_pred             cccccccccccccccccccccchHHHHHhHcCCCCCccCCcCCCCcc
Q psy8702          21 PGLSLYTKSFTWKNLLTNVLTSNLKTHLLTHTDHKPYECNSCGKVFR   67 (194)
Q Consensus        21 ~~c~~Cgk~f~~~~~L~~H~~s~L~~H~~~h~~~k~~~C~~C~k~f~   67 (194)
                      ..|+.|+..|.-.        ...     .........|+.|+..|.
T Consensus         3 ~~CP~C~~~~~v~--------~~~-----~~~~~~~v~C~~C~~~~~   36 (38)
T TIGR02098         3 IQCPNCKTSFRVV--------DSQ-----LGANGGKVRCGKCGHVWY   36 (38)
T ss_pred             EECCCCCCEEEeC--------HHH-----cCCCCCEEECCCCCCEEE
Confidence            3578888877665        221     111223477888887664


No 66 
>PF06524 NOA36:  NOA36 protein;  InterPro: IPR010531 This family consists of several NOA36 proteins which contain 29 highly conserved cysteine residues. The function of this protein is unknown.; GO: 0008270 zinc ion binding, 0005634 nucleus
Probab=64.64  E-value=8.7  Score=31.11  Aligned_cols=95  Identities=6%  Similarity=0.076  Sum_probs=44.9

Q ss_pred             CCCCccCCcCCCCcccchHHHHhhhh-cCCCCCCCCCCccccccccccCCCCCCCCCCCCCChhhHhhhccCCCCCCCCC
Q psy8702          53 DHKPYECNSCGKVFRRNCDLRRHALT-HAVGDVPPESSLAESIGSLKRDNLPPAPSPTGSQSSEDIEIERTPSEERSLEP  131 (194)
Q Consensus        53 ~~k~~~C~~C~k~f~~~~~L~~H~~~-h~~~~k~~~c~~c~~~~~~~~~~~~~~C~k~F~~~~~L~~H~~~H~~ekp~~C  131 (194)
                      |.+.|.|..|..-.--..++. |+.. .......|+|..|+..+. ..   |.-|.-.|-.....+.-. .....+++.|
T Consensus       139 GGrif~CsfC~~flCEDDQFE-HQAsCQvLe~E~~KC~SCNrlGq-~s---CLRCK~cfCddHvrrKg~-ky~k~k~~PC  212 (314)
T PF06524_consen  139 GGRIFKCSFCDNFLCEDDQFE-HQASCQVLESETFKCQSCNRLGQ-YS---CLRCKICFCDDHVRRKGF-KYEKGKPIPC  212 (314)
T ss_pred             CCeEEEeecCCCeeeccchhh-hhhhhhhhhcccccccccccccc-hh---hhheeeeehhhhhhhccc-ccccCCCCCC
Confidence            446677777765443444433 3322 222445677777764331 11   222222222211111111 1234589999


Q ss_pred             hhhhCCCccCCCCCccccccCCCCcc
Q psy8702         132 SDMLGRRSRSRTPSPPLRSHSPISIF  157 (194)
Q Consensus       132 ~~~Cgk~f~~~~~~~hl~~H~~~~~~  157 (194)
                      +. ||..   ......|..-++...|
T Consensus       213 PK-Cg~e---t~eTkdLSmStR~hky  234 (314)
T PF06524_consen  213 PK-CGYE---TQETKDLSMSTRSHKY  234 (314)
T ss_pred             CC-CCCc---ccccccceeeeecchh
Confidence            97 9999   4444444444443344


No 67 
>PF09723 Zn-ribbon_8:  Zinc ribbon domain;  InterPro: IPR013429  This entry represents a region of about 41 amino acids found in a number of small proteins in a wide range of bacteria. The region usually begins with the initiator Met and contains two CxxC motifs separated by 17 amino acids. One protein in this entry has been noted as a putative regulatory protein, designated FmdB []. Most proteins in this entry have a C-terminal region containing highly degenerate sequence.
Probab=63.28  E-value=3.2  Score=23.61  Aligned_cols=13  Identities=8%  Similarity=-0.171  Sum_probs=8.8

Q ss_pred             ccccccccccccc
Q psy8702          21 PGLSLYTKSFTWK   33 (194)
Q Consensus        21 ~~c~~Cgk~f~~~   33 (194)
                      |.|..||..|...
T Consensus         6 y~C~~Cg~~fe~~   18 (42)
T PF09723_consen    6 YRCEECGHEFEVL   18 (42)
T ss_pred             EEeCCCCCEEEEE
Confidence            5677777776654


No 68 
>cd00729 rubredoxin_SM Rubredoxin, Small Modular nonheme iron binding domain containing a [Fe(SCys)4] center, present in rubrerythrin and nigerythrin and detected either N- or C-terminal to such proteins as flavin reductase, NAD(P)H-nitrite reductase, and ferredoxin-thioredoxin reductase. In rubredoxin, the iron atom is coordinated by four cysteine residues (Fe(S-Cys)4), and  believed to be involved in electron transfer. Rubrerythrins and nigerythrins are small homodimeric proteins, generally consisting of 2 domains: a rubredoxin domain C-terminal to a non-sulfur, oxo-bridged diiron site in the N-terminal rubrerythrin domain. Rubrerythrins and nigerythrins have putative peroxide activity.
Probab=62.74  E-value=5.1  Score=21.74  Aligned_cols=12  Identities=0%  Similarity=-0.476  Sum_probs=7.2

Q ss_pred             cccccccccccc
Q psy8702          21 PGLSLYTKSFTW   32 (194)
Q Consensus        21 ~~c~~Cgk~f~~   32 (194)
                      |.|..||..+..
T Consensus         3 ~~C~~CG~i~~g   14 (34)
T cd00729           3 WVCPVCGYIHEG   14 (34)
T ss_pred             EECCCCCCEeEC
Confidence            556677666544


No 69 
>PF13719 zinc_ribbon_5:  zinc-ribbon domain
Probab=61.76  E-value=7.2  Score=21.51  Aligned_cols=33  Identities=21%  Similarity=0.417  Sum_probs=19.8

Q ss_pred             ccccccccccccccccccccchHHHHHhHcCCCCCccCCcCCCCcc
Q psy8702          22 GLSLYTKSFTWKNLLTNVLTSNLKTHLLTHTDHKPYECNSCGKVFR   67 (194)
Q Consensus        22 ~c~~Cgk~f~~~~~L~~H~~s~L~~H~~~h~~~k~~~C~~C~k~f~   67 (194)
                      .|+.|+..|.-.        ..     ++-.+.....|..|+..|.
T Consensus         4 ~CP~C~~~f~v~--------~~-----~l~~~~~~vrC~~C~~~f~   36 (37)
T PF13719_consen    4 TCPNCQTRFRVP--------DD-----KLPAGGRKVRCPKCGHVFR   36 (37)
T ss_pred             ECCCCCceEEcC--------HH-----HcccCCcEEECCCCCcEee
Confidence            477788777665        11     1223345677888877663


No 70 
>PF13717 zinc_ribbon_4:  zinc-ribbon domain
Probab=61.44  E-value=7.3  Score=21.40  Aligned_cols=32  Identities=16%  Similarity=0.392  Sum_probs=19.3

Q ss_pred             ccccccccccccccccccccchHHHHHhHcCCCCCccCCcCCCCc
Q psy8702          22 GLSLYTKSFTWKNLLTNVLTSNLKTHLLTHTDHKPYECNSCGKVF   66 (194)
Q Consensus        22 ~c~~Cgk~f~~~~~L~~H~~s~L~~H~~~h~~~k~~~C~~C~k~f   66 (194)
                      .|+.|+..|.-.        ..     ++-......+|..|+..|
T Consensus         4 ~Cp~C~~~y~i~--------d~-----~ip~~g~~v~C~~C~~~f   35 (36)
T PF13717_consen    4 TCPNCQAKYEID--------DE-----KIPPKGRKVRCSKCGHVF   35 (36)
T ss_pred             ECCCCCCEEeCC--------HH-----HCCCCCcEEECCCCCCEe
Confidence            577788777665        11     122333457788887765


No 71 
>PRK00398 rpoP DNA-directed RNA polymerase subunit P; Provisional
Probab=60.07  E-value=6.6  Score=22.67  Aligned_cols=14  Identities=0%  Similarity=-0.111  Sum_probs=9.4

Q ss_pred             Cccccccccccccc
Q psy8702          20 NPGLSLYTKSFTWK   33 (194)
Q Consensus        20 ~~~c~~Cgk~f~~~   33 (194)
                      .|.|+.||..|...
T Consensus         3 ~y~C~~CG~~~~~~   16 (46)
T PRK00398          3 EYKCARCGREVELD   16 (46)
T ss_pred             EEECCCCCCEEEEC
Confidence            35677787776554


No 72 
>COG1997 RPL43A Ribosomal protein L37AE/L43A [Translation, ribosomal structure and biogenesis]
Probab=59.38  E-value=6.5  Score=26.28  Aligned_cols=14  Identities=29%  Similarity=0.831  Sum_probs=9.9

Q ss_pred             CCccCCcCCCCccc
Q psy8702          55 KPYECNSCGKVFRR   68 (194)
Q Consensus        55 k~~~C~~C~k~f~~   68 (194)
                      ..+.|..||..|..
T Consensus        52 GIW~C~kCg~~fAG   65 (89)
T COG1997          52 GIWKCRKCGAKFAG   65 (89)
T ss_pred             CeEEcCCCCCeecc
Confidence            45778888877764


No 73 
>TIGR02605 CxxC_CxxC_SSSS putative regulatory protein, FmdB family. This model represents a region of about 50 amino acids found in a number of small proteins in a wide range of bacteria. The region begins usually with the initiator Met and contains two CxxC motifs separated by 17 amino acids. One member of this family is has been noted as a putative regulatory protein, designated FmdB (PubMed:8841393). Most members of this family have a C-terminal region containing highly degenerate sequence, such as SSTSESTKSSGSSGSSGSSESKASGSTEKSTSSTTAAAAV in Mycobacterium tuberculosis and VAVGGSAPAPSPAPRAGGGGGGCCGGGCCG in Streptomyces avermitilis. These low complexity regions, which are not included in the model, resemble low-complexity C-terminal regions of some heterocycle-containing bacteriocin precursors.
Probab=59.32  E-value=3.8  Score=24.23  Aligned_cols=13  Identities=8%  Similarity=-0.107  Sum_probs=8.4

Q ss_pred             ccccccccccccc
Q psy8702          21 PGLSLYTKSFTWK   33 (194)
Q Consensus        21 ~~c~~Cgk~f~~~   33 (194)
                      |.|..||..|...
T Consensus         6 y~C~~Cg~~fe~~   18 (52)
T TIGR02605         6 YRCTACGHRFEVL   18 (52)
T ss_pred             EEeCCCCCEeEEE
Confidence            5677777766554


No 74 
>PRK06266 transcription initiation factor E subunit alpha; Validated
Probab=58.07  E-value=5.2  Score=30.52  Aligned_cols=17  Identities=18%  Similarity=0.315  Sum_probs=7.7

Q ss_pred             CCccCCcCCCCcccchH
Q psy8702          55 KPYECNSCGKVFRRNCD   71 (194)
Q Consensus        55 k~~~C~~C~k~f~~~~~   71 (194)
                      .-|.|+.|+..|+....
T Consensus       116 ~~Y~Cp~C~~rytf~eA  132 (178)
T PRK06266        116 MFFFCPNCHIRFTFDEA  132 (178)
T ss_pred             CEEECCCCCcEEeHHHH
Confidence            33445445444444433


No 75 
>COG1592 Rubrerythrin [Energy production and conversion]
Probab=57.54  E-value=7.4  Score=29.38  Aligned_cols=24  Identities=33%  Similarity=0.768  Sum_probs=18.3

Q ss_pred             CCccCCcCCCCcccchHHHHhhhhcCCCCCCCCCCccc
Q psy8702          55 KPYECNSCGKVFRRNCDLRRHALTHAVGDVPPESSLAE   92 (194)
Q Consensus        55 k~~~C~~C~k~f~~~~~L~~H~~~h~~~~k~~~c~~c~   92 (194)
                      +.|.|.+||.             ++. |+-|-.|+.|+
T Consensus       133 ~~~vC~vCGy-------------~~~-ge~P~~CPiCg  156 (166)
T COG1592         133 KVWVCPVCGY-------------THE-GEAPEVCPICG  156 (166)
T ss_pred             CEEEcCCCCC-------------ccc-CCCCCcCCCCC
Confidence            3688999985             345 78888888886


No 76 
>COG5236 Uncharacterized conserved protein, contains RING Zn-finger [General function prediction only]
Probab=57.33  E-value=13  Score=31.49  Aligned_cols=96  Identities=16%  Similarity=0.084  Sum_probs=56.1

Q ss_pred             CCCCCCCCCCCC-CCCCCcccccccccccccccccccccchHHHHHhHcCCCCCccCCcCCC----CcccchHHHHhhhh
Q psy8702           4 PRKNHSNVGNVA-KDSVNPGLSLYTKSFTWKNLLTNVLTSNLKTHLLTHTDHKPYECNSCGK----VFRRNCDLRRHALT   78 (194)
Q Consensus         4 ~~r~~~~~~~~~-~~~~~~~c~~Cgk~f~~~~~L~~H~~s~L~~H~~~h~~~k~~~C~~C~k----~f~~~~~L~~H~~~   78 (194)
                      +.|+|.+.+... -=..-|.|..|.+-|...        ..|..|++.-+. +-+.|+..+.    -|..-.+|..|.  
T Consensus       203 ~Lr~H~~~G~~e~GFKGHP~C~FC~~~FYdD--------DEL~~HcR~~HE-~ChICD~v~p~~~QYFK~Y~~Le~HF--  271 (493)
T COG5236         203 TLRDHKNGGLEEEGFKGHPLCIFCKIYFYDD--------DELRRHCRLRHE-ACHICDMVGPIRYQYFKSYEDLEAHF--  271 (493)
T ss_pred             cccccccCCccccCcCCCchhhhccceecCh--------HHHHHHHHhhhh-hhhhhhccCccchhhhhCHHHHHHHh--
Confidence            456777665554 234558899999999998        888999875433 3344433331    233344454443  


Q ss_pred             cCCCCCCCCCCccccccccccCCCCCCCCCCCCCChhhHhhhcc
Q psy8702          79 HAVGDVPPESSLAESIGSLKRDNLPPAPSPTGSQSSEDIEIERT  122 (194)
Q Consensus        79 h~~~~k~~~c~~c~~~~~~~~~~~~~~C~k~F~~~~~L~~H~~~  122 (194)
                         ...-|.|.+-.+..         .--..|.....|..|+..
T Consensus       272 ---~~~hy~ct~qtc~~---------~k~~vf~~~~el~~h~~~  303 (493)
T COG5236         272 ---RNAHYCCTFQTCRV---------GKCYVFPYHTELLEHLTR  303 (493)
T ss_pred             ---hcCceEEEEEEEec---------CcEEEeccHHHHHHHHHH
Confidence               33446676422111         001368888999999743


No 77 
>smart00834 CxxC_CXXC_SSSS Putative regulatory protein. CxxC_CXXC_SSSS represents a region of about 41 amino acids found in a number of small proteins in a wide range of bacteria. The region usually begins with the initiator Met and contains two CxxC motifs separated by 17 amino acids. One protein in this entry has been noted as a putative regulatory protein, designated FmdB. Most proteins in this entry have a C-terminal region containing highly degenerate sequence.
Probab=56.67  E-value=6.5  Score=21.75  Aligned_cols=13  Identities=8%  Similarity=-0.169  Sum_probs=8.7

Q ss_pred             ccccccccccccc
Q psy8702          21 PGLSLYTKSFTWK   33 (194)
Q Consensus        21 ~~c~~Cgk~f~~~   33 (194)
                      |.|..||..|...
T Consensus         6 y~C~~Cg~~fe~~   18 (41)
T smart00834        6 YRCEDCGHTFEVL   18 (41)
T ss_pred             EEcCCCCCEEEEE
Confidence            5677777776654


No 78 
>TIGR00622 ssl1 transcription factor ssl1. This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University).
Probab=56.34  E-value=28  Score=24.47  Aligned_cols=77  Identities=10%  Similarity=-0.016  Sum_probs=41.3

Q ss_pred             CCccCCcCCCCcccchHHHHhhhhcC-CCCCCCCCCccccccccccCCCCCCCCCCCCCChhhHhhhccCCCCCCCCChh
Q psy8702          55 KPYECNSCGKVFRRNCDLRRHALTHA-VGDVPPESSLAESIGSLKRDNLPPAPSPTGSQSSEDIEIERTPSEERSLEPSD  133 (194)
Q Consensus        55 k~~~C~~C~k~f~~~~~L~~H~~~h~-~~~k~~~c~~c~~~~~~~~~~~~~~C~k~F~~~~~L~~H~~~H~~ekp~~C~~  133 (194)
                      -|..|++||-+......|.+.  .|+ -.-++|+--.-   ........|..|.+.|........  ..-.....|.|+.
T Consensus        14 LP~~CpiCgLtLVss~HLARS--yHHLfPl~~f~ev~~---~~~~~~~~C~~C~~~f~~~~~~~~--~~~~~~~~y~C~~   86 (112)
T TIGR00622        14 LPVECPICGLTLILSTHLARS--YHHLFPLKAFQEIPL---EEYNGSRFCFGCQGPFPKPPVSPF--DELKDSHRYVCAV   86 (112)
T ss_pred             CCCcCCcCCCEEeccchHHHh--hhccCCCcccccccc---cccCCCCcccCcCCCCCCcccccc--cccccccceeCCC
Confidence            467899999888888887763  222 01222321100   011123347778888875432110  0122345788986


Q ss_pred             hhCCCc
Q psy8702         134 MLGRRS  139 (194)
Q Consensus       134 ~Cgk~f  139 (194)
                       |...|
T Consensus        87 -C~~~F   91 (112)
T TIGR00622        87 -CKNVF   91 (112)
T ss_pred             -CCCcc
Confidence             98883


No 79 
>COG2888 Predicted Zn-ribbon RNA-binding protein with a function in translation [Translation, ribosomal structure and biogenesis]
Probab=55.91  E-value=8.9  Score=23.71  Aligned_cols=9  Identities=11%  Similarity=0.032  Sum_probs=4.9

Q ss_pred             CCCCCCccc
Q psy8702          84 VPPESSLAE   92 (194)
Q Consensus        84 k~~~c~~c~   92 (194)
                      .+|.|+.|+
T Consensus        49 ~~Y~Cp~CG   57 (61)
T COG2888          49 NPYRCPKCG   57 (61)
T ss_pred             CceECCCcC
Confidence            455555554


No 80 
>PF15135 UPF0515:  Uncharacterised protein UPF0515
Probab=55.66  E-value=10  Score=30.46  Aligned_cols=11  Identities=27%  Similarity=1.090  Sum_probs=5.8

Q ss_pred             CCccCCcCCCC
Q psy8702          55 KPYECNSCGKV   65 (194)
Q Consensus        55 k~~~C~~C~k~   65 (194)
                      +.|.|..|+..
T Consensus       111 rqFaC~~Cd~~  121 (278)
T PF15135_consen  111 RQFACSSCDHM  121 (278)
T ss_pred             eeeeccccchH
Confidence            44556666433


No 81 
>KOG1146|consensus
Probab=54.74  E-value=6.9  Score=38.50  Aligned_cols=51  Identities=20%  Similarity=0.316  Sum_probs=41.3

Q ss_pred             CCcccccccccccccccccccccchHHHHHhHc-------------------------CCCCCccCCcCCCCcccchHHH
Q psy8702          19 VNPGLSLYTKSFTWKNLLTNVLTSNLKTHLLTH-------------------------TDHKPYECNSCGKVFRRNCDLR   73 (194)
Q Consensus        19 ~~~~c~~Cgk~f~~~~~L~~H~~s~L~~H~~~h-------------------------~~~k~~~C~~C~k~f~~~~~L~   73 (194)
                      ..+.|+.|+..|+..        ..|..|||.-                         -+.++|.|..|...+..+.+|.
T Consensus       464 kt~~cpkc~~~yk~a--------~~L~vhmRskhp~~~~~~c~~gq~~~~~arg~~~~~~~~p~~C~~C~~stttng~Ls  535 (1406)
T KOG1146|consen  464 KTLKCPKCNWHYKLA--------QTLGVHMRSKHPESQSAYCKAGQNHPRLARGEVYRCPGKPYPCRACNYSTTTNGNLS  535 (1406)
T ss_pred             ccccCCccchhhhhH--------HHhhhcccccccccchhHhHhccccccccccccccCCCCcccceeeeeeeecchHHH
Confidence            667899999999999        6666666651                         1236799999999999999999


Q ss_pred             Hhhh
Q psy8702          74 RHAL   77 (194)
Q Consensus        74 ~H~~   77 (194)
                      +|+.
T Consensus       536 ihlq  539 (1406)
T KOG1146|consen  536 IHLQ  539 (1406)
T ss_pred             HHHH
Confidence            9976


No 82 
>smart00614 ZnF_BED BED zinc finger. DNA-binding domain in chromatin-boundary-element-binding proteins and transposases
Probab=53.75  E-value=12  Score=21.90  Aligned_cols=29  Identities=28%  Similarity=0.252  Sum_probs=19.8

Q ss_pred             CcccccccccccccccccccccchHHHHHh-Hc
Q psy8702          20 NPGLSLYTKSFTWKNLLTNVLTSNLKTHLL-TH   51 (194)
Q Consensus        20 ~~~c~~Cgk~f~~~~~L~~H~~s~L~~H~~-~h   51 (194)
                      .=.|..|++.+...+.   ..+++|.+|+. .|
T Consensus        18 ~a~C~~C~~~l~~~~~---~gTs~L~rHl~~~h   47 (50)
T smart00614       18 RAKCKYCGKKLSRSSK---GGTSNLRRHLRRKH   47 (50)
T ss_pred             EEEecCCCCEeeeCCC---CCcHHHHHHHHhHC
Confidence            3468999999876532   34468888887 44


No 83 
>smart00531 TFIIE Transcription initiation factor IIE.
Probab=53.61  E-value=10  Score=27.89  Aligned_cols=37  Identities=14%  Similarity=0.125  Sum_probs=20.8

Q ss_pred             CCCCCccCCcCCCCcccchHHHHhhhhcCCCCCCCCCCcccc
Q psy8702          52 TDHKPYECNSCGKVFRRNCDLRRHALTHAVGDVPPESSLAES   93 (194)
Q Consensus        52 ~~~k~~~C~~C~k~f~~~~~L~~H~~~h~~~~k~~~c~~c~~   93 (194)
                      .+..-|.|+.|+..|.....+..    .. ....|.|+.|+.
T Consensus        95 ~~~~~Y~Cp~C~~~y~~~ea~~~----~d-~~~~f~Cp~Cg~  131 (147)
T smart00531       95 TNNAYYKCPNCQSKYTFLEANQL----LD-MDGTFTCPRCGE  131 (147)
T ss_pred             cCCcEEECcCCCCEeeHHHHHHh----cC-CCCcEECCCCCC
Confidence            34456778888877775544322    01 122377777763


No 84 
>PF05443 ROS_MUCR:  ROS/MUCR transcriptional regulator protein;  InterPro: IPR008807 This family consists of several ROS/MUCR transcriptional regulator proteins. The ros chromosomal gene is present in octopine and nopaline strains of Agrobacterium tumefaciens as well as in Rhizobium meliloti (Sinorhizobium meliloti). This gene encodes a 15.5 kDa protein that specifically represses the virC and virD operons in the virulence region of the Ti plasmid [] and is necessary for succinoglycan production []. S. meliloti can produce two types of acidic exopolysaccharides, succinoglycan and galactoglucan, that are interchangeable for infection of Medicago sativa (Alfalfa) nodules. MucR from S. meliloti acts as a transcriptional repressor that blocks the expression of the exp genes responsible for galactoglucan production therefore allowing the exclusive production of succinoglycan [].; GO: 0003677 DNA binding, 0008270 zinc ion binding, 0006355 regulation of transcription, DNA-dependent; PDB: 2JSP_A.
Probab=53.00  E-value=9.7  Score=27.62  Aligned_cols=29  Identities=31%  Similarity=0.195  Sum_probs=16.9

Q ss_pred             CCCCcccccccccccccccccccccchHHHHHhHcCCCCC
Q psy8702          17 DSVNPGLSLYTKSFTWKNLLTNVLTSNLKTHLLTHTDHKP   56 (194)
Q Consensus        17 ~~~~~~c~~Cgk~f~~~~~L~~H~~s~L~~H~~~h~~~k~   56 (194)
                      ......|-+|||.|..           |++|++.|.|-.|
T Consensus        69 ~~d~i~clecGk~~k~-----------LkrHL~~~~gltp   97 (132)
T PF05443_consen   69 TPDYIICLECGKKFKT-----------LKRHLRTHHGLTP   97 (132)
T ss_dssp             -SS-EE-TBT--EESB-----------HHHHHHHTT-S-H
T ss_pred             ccCeeEEccCCcccch-----------HHHHHHHccCCCH
Confidence            3444578999999855           7999999977443


No 85 
>COG4049 Uncharacterized protein containing archaeal-type C2H2 Zn-finger [General function prediction only]
Probab=47.78  E-value=7.1  Score=23.82  Aligned_cols=23  Identities=43%  Similarity=0.807  Sum_probs=10.5

Q ss_pred             CCCccCCcCCCCcccchHHHHhh
Q psy8702          54 HKPYECNSCGKVFRRNCDLRRHA   76 (194)
Q Consensus        54 ~k~~~C~~C~k~f~~~~~L~~H~   76 (194)
                      |.-+.|+-||..|.....+..|+
T Consensus        15 E~~lrCPRC~~~FR~~K~Y~RHV   37 (65)
T COG4049          15 EEFLRCPRCGMVFRRRKDYIRHV   37 (65)
T ss_pred             ceeeeCCchhHHHHHhHHHHHHh
Confidence            33344444444444444444443


No 86 
>COG4530 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=47.29  E-value=12  Score=26.23  Aligned_cols=32  Identities=19%  Similarity=0.110  Sum_probs=24.4

Q ss_pred             CCCcccccccccccccccccccccchHHHHHhHcCCCCCccCCcCCCCccc
Q psy8702          18 SVNPGLSLYTKSFTWKNLLTNVLTSNLKTHLLTHTDHKPYECNSCGKVFRR   68 (194)
Q Consensus        18 ~~~~~c~~Cgk~f~~~~~L~~H~~s~L~~H~~~h~~~k~~~C~~C~k~f~~   68 (194)
                      ..+--|+.||+.|...                   +..|..|+.||++|..
T Consensus         7 GtKridPetg~KFYDL-------------------NrdPiVsPytG~s~P~   38 (129)
T COG4530           7 GTKRIDPETGKKFYDL-------------------NRDPIVSPYTGKSYPR   38 (129)
T ss_pred             cccccCccccchhhcc-------------------CCCccccCcccccchH
Confidence            3445689999998764                   3478899999999854


No 87 
>PF03604 DNA_RNApol_7kD:  DNA directed RNA polymerase, 7 kDa subunit;  InterPro: IPR006591 DNA-dependent RNA polymerase catalyzes the transcription of DNA into RNA using the four ribonucleoside triphosphates as substrates. Each class of RNA polymerase is assembled from 9 to 15 different polypeptides. Rbp10 (RNA polymerase CX) is a domain found in RNA polymerase subunit 10; present in RNA polymerase I, II and III.; GO: 0003677 DNA binding, 0003899 DNA-directed RNA polymerase activity, 0006351 transcription, DNA-dependent; PDB: 2PMZ_Z 3HKZ_X 2NVX_L 3S1Q_L 2JA6_L 3S17_L 3HOW_L 3HOV_L 3PO2_L 3HOZ_L ....
Probab=46.52  E-value=16  Score=19.58  Aligned_cols=13  Identities=8%  Similarity=-0.273  Sum_probs=6.6

Q ss_pred             ccccccccccccc
Q psy8702          21 PGLSLYTKSFTWK   33 (194)
Q Consensus        21 ~~c~~Cgk~f~~~   33 (194)
                      |.|..||..+..+
T Consensus         1 Y~C~~Cg~~~~~~   13 (32)
T PF03604_consen    1 YICGECGAEVELK   13 (32)
T ss_dssp             EBESSSSSSE-BS
T ss_pred             CCCCcCCCeeEcC
Confidence            3466666665543


No 88 
>COG1571 Predicted DNA-binding protein containing a Zn-ribbon domain [General function prediction only]
Probab=45.70  E-value=15  Score=31.95  Aligned_cols=37  Identities=19%  Similarity=0.323  Sum_probs=27.8

Q ss_pred             CCCCCcccccccccccccccccccccchHHHHHhHcCCCCCccCCcCCCCcccchH
Q psy8702          16 KDSVNPGLSLYTKSFTWKNLLTNVLTSNLKTHLLTHTDHKPYECNSCGKVFRRNCD   71 (194)
Q Consensus        16 ~~~~~~~c~~Cgk~f~~~~~L~~H~~s~L~~H~~~h~~~k~~~C~~C~k~f~~~~~   71 (194)
                      ....+|.|+.||.+....                   |.+-|.|+.||..+.....
T Consensus       346 ~~~~~p~Cp~Cg~~m~S~-------------------G~~g~rC~kCg~~~~~~~~  382 (421)
T COG1571         346 YERVNPVCPRCGGRMKSA-------------------GRNGFRCKKCGTRARETLI  382 (421)
T ss_pred             eEEcCCCCCccCCchhhc-------------------CCCCcccccccccCCcccc
Confidence            456778999999886654                   4447999999988776543


No 89 
>PRK09678 DNA-binding transcriptional regulator; Provisional
Probab=45.21  E-value=8.7  Score=24.79  Aligned_cols=42  Identities=12%  Similarity=0.174  Sum_probs=25.8

Q ss_pred             cccccccccccccccccccccchHHHHHhHcCCCCCccCC--cCCCCcccchHH
Q psy8702          21 PGLSLYTKSFTWKNLLTNVLTSNLKTHLLTHTDHKPYECN--SCGKVFRRNCDL   72 (194)
Q Consensus        21 ~~c~~Cgk~f~~~~~L~~H~~s~L~~H~~~h~~~k~~~C~--~C~k~f~~~~~L   72 (194)
                      ..|+.||..-.-.        .+-...  .-.-+.-++|.  .||..|.....+
T Consensus         2 m~CP~Cg~~a~ir--------tSr~~s--~~~~~~Y~qC~N~eCg~tF~t~es~   45 (72)
T PRK09678          2 FHCPLCQHAAHAR--------TSRYIT--DTTKERYHQCQNVNCSATFITYESV   45 (72)
T ss_pred             ccCCCCCCccEEE--------EChhcC--hhhheeeeecCCCCCCCEEEEEEEE
Confidence            4689998765333        221111  11345678898  899999886554


No 90 
>smart00659 RPOLCX RNA polymerase subunit CX. present in RNA polymerase I, II and III
Probab=43.07  E-value=17  Score=21.00  Aligned_cols=12  Identities=0%  Similarity=-0.418  Sum_probs=6.9

Q ss_pred             cccccccccccc
Q psy8702          21 PGLSLYTKSFTW   32 (194)
Q Consensus        21 ~~c~~Cgk~f~~   32 (194)
                      |.|..||..|.-
T Consensus         3 Y~C~~Cg~~~~~   14 (44)
T smart00659        3 YICGECGRENEI   14 (44)
T ss_pred             EECCCCCCEeec
Confidence            456666665543


No 91 
>PF13248 zf-ribbon_3:  zinc-ribbon domain
Probab=42.55  E-value=19  Score=18.02  Aligned_cols=7  Identities=14%  Similarity=0.206  Sum_probs=3.0

Q ss_pred             CCCCCCC
Q psy8702         101 NLPPAPS  107 (194)
Q Consensus       101 ~~~~~C~  107 (194)
                      ..|+.||
T Consensus        17 ~fC~~CG   23 (26)
T PF13248_consen   17 KFCPNCG   23 (26)
T ss_pred             ccChhhC
Confidence            3344444


No 92 
>PRK14890 putative Zn-ribbon RNA-binding protein; Provisional
Probab=41.91  E-value=19  Score=22.23  Aligned_cols=9  Identities=44%  Similarity=1.350  Sum_probs=4.4

Q ss_pred             ccCCcCCCC
Q psy8702          57 YECNSCGKV   65 (194)
Q Consensus        57 ~~C~~C~k~   65 (194)
                      |.|+.||..
T Consensus        26 F~CPnCG~~   34 (59)
T PRK14890         26 FLCPNCGEV   34 (59)
T ss_pred             eeCCCCCCe
Confidence            445555544


No 93 
>smart00734 ZnF_Rad18 Rad18-like CCHC zinc finger. Yeast Rad18p functions with Rad5p in error-free post-replicative DNA repair. This zinc finger is likely to bind nucleic-acids.
Probab=41.86  E-value=22  Score=17.96  Aligned_cols=20  Identities=15%  Similarity=0.257  Sum_probs=13.1

Q ss_pred             ccCCcCCCCcccchHHHHhhh
Q psy8702          57 YECNSCGKVFRRNCDLRRHAL   77 (194)
Q Consensus        57 ~~C~~C~k~f~~~~~L~~H~~   77 (194)
                      ..|++|++.+ ....++.|..
T Consensus         2 v~CPiC~~~v-~~~~in~HLD   21 (26)
T smart00734        2 VQCPVCFREV-PENLINSHLD   21 (26)
T ss_pred             CcCCCCcCcc-cHHHHHHHHH
Confidence            3578888776 4456666654


No 94 
>KOG4167|consensus
Probab=40.36  E-value=11  Score=34.92  Aligned_cols=28  Identities=18%  Similarity=0.019  Sum_probs=22.0

Q ss_pred             CCCcccccccccccccccccccccchHHHHHhHcCC
Q psy8702          18 SVNPGLSLYTKSFTWKNLLTNVLTSNLKTHLLTHTD   53 (194)
Q Consensus        18 ~~~~~c~~Cgk~f~~~~~L~~H~~s~L~~H~~~h~~   53 (194)
                      ...|.|.+|+|.|...        .++..||++|.-
T Consensus       790 ~giFpCreC~kvF~Ki--------KSrNAHMK~Hr~  817 (907)
T KOG4167|consen  790 TGIFPCRECGKVFFKI--------KSRNAHMKTHRQ  817 (907)
T ss_pred             CceeehHHHHHHHHHH--------hhhhHHHHHHHH
Confidence            4568899999999998        777777777643


No 95 
>TIGR00686 phnA alkylphosphonate utilization operon protein PhnA. The protein family includes an uncharacterized member designated phnA in Escherichia coli, part of a large operon associated with alkylphosphonate uptake and carbon-phosphorus bond cleavage. This protein is not related to the characterized phosphonoacetate hydrolase designated PhnA by Kulakova, et al. (2001, 1997).
Probab=40.25  E-value=19  Score=25.12  Aligned_cols=30  Identities=23%  Similarity=0.444  Sum_probs=23.2

Q ss_pred             cccccccccccccccccccccchHHHHHhHcCCCCCccCCcCCCCcccc
Q psy8702          21 PGLSLYTKSFTWKNLLTNVLTSNLKTHLLTHTDHKPYECNSCGKVFRRN   69 (194)
Q Consensus        21 ~~c~~Cgk~f~~~~~L~~H~~s~L~~H~~~h~~~k~~~C~~C~k~f~~~   69 (194)
                      |.|+.|+-.|++.                   ....+.|++|+..|...
T Consensus         3 p~CP~C~seytY~-------------------dg~~~iCpeC~~EW~~~   32 (109)
T TIGR00686         3 PPCPKCNSEYTYH-------------------DGTQLICPSCLYEWNEN   32 (109)
T ss_pred             CcCCcCCCcceEe-------------------cCCeeECcccccccccc
Confidence            6799999888876                   23468999999877654


No 96 
>TIGR00373 conserved hypothetical protein TIGR00373. This family of proteins is, so far, restricted to archaeal genomes. The family appears to be distantly related to the N-terminal region of the eukaryotic transcription initiation factor IIE alpha chain.
Probab=39.78  E-value=28  Score=25.88  Aligned_cols=34  Identities=12%  Similarity=0.058  Sum_probs=24.3

Q ss_pred             hHcCCCCCccCCcCCCCcccchHHHHhhhhcCCCCCCCCCCccc
Q psy8702          49 LTHTDHKPYECNSCGKVFRRNCDLRRHALTHAVGDVPPESSLAE   92 (194)
Q Consensus        49 ~~h~~~k~~~C~~C~k~f~~~~~L~~H~~~h~~~~k~~~c~~c~   92 (194)
                      ....+..-|.|+.|+..|+....+.          ..|.|+.|+
T Consensus       102 ~~e~~~~~Y~Cp~c~~r~tf~eA~~----------~~F~Cp~Cg  135 (158)
T TIGR00373       102 EFETNNMFFICPNMCVRFTFNEAME----------LNFTCPRCG  135 (158)
T ss_pred             hhccCCCeEECCCCCcEeeHHHHHH----------cCCcCCCCC
Confidence            3345556788999998888777763          258888876


No 97 
>KOG2186|consensus
Probab=39.77  E-value=12  Score=30.13  Aligned_cols=56  Identities=13%  Similarity=0.163  Sum_probs=41.0

Q ss_pred             CccCCcCCCCcccchHHHHhhhhcCCCCCCCCCCccccccccccCCCCCCCCCCCCCChhhHhhhccCCCCCCCC
Q psy8702          56 PYECNSCGKVFRRNCDLRRHALTHAVGDVPPESSLAESIGSLKRDNLPPAPSPTGSQSSEDIEIERTPSEERSLE  130 (194)
Q Consensus        56 ~~~C~~C~k~f~~~~~L~~H~~~h~~~~k~~~c~~c~~~~~~~~~~~~~~C~k~F~~~~~L~~H~~~H~~ekp~~  130 (194)
                      -|.|.+||.... +..+..|+.... + .-|.|.-|               +++|.. .+...|...-++...|.
T Consensus         3 ~FtCnvCgEsvK-Kp~vekH~srCr-n-~~fSCIDC---------------~k~F~~-~sYknH~kCITEaQKYg   58 (276)
T KOG2186|consen    3 FFTCNVCGESVK-KPQVEKHMSRCR-N-AYFSCIDC---------------GKTFER-VSYKNHTKCITEAQKYG   58 (276)
T ss_pred             EEehhhhhhhcc-ccchHHHHHhcc-C-CeeEEeec---------------cccccc-chhhhhhhhcchHHHhh
Confidence            378999998644 567777888776 4 77888766               579987 78888887766655553


No 98 
>PTZ00255 60S ribosomal protein L37a; Provisional
Probab=38.40  E-value=15  Score=24.75  Aligned_cols=14  Identities=21%  Similarity=0.610  Sum_probs=8.3

Q ss_pred             CCccCCcCCCCccc
Q psy8702          55 KPYECNSCGKVFRR   68 (194)
Q Consensus        55 k~~~C~~C~k~f~~   68 (194)
                      ..+.|..|++.|..
T Consensus        53 GIW~C~~C~~~~AG   66 (90)
T PTZ00255         53 GIWRCKGCKKTVAG   66 (90)
T ss_pred             EEEEcCCCCCEEeC
Confidence            34666666665553


No 99 
>PHA00626 hypothetical protein
Probab=37.72  E-value=16  Score=22.34  Aligned_cols=16  Identities=25%  Similarity=0.619  Sum_probs=12.1

Q ss_pred             CCCccCCcCCCCcccc
Q psy8702          54 HKPYECNSCGKVFRRN   69 (194)
Q Consensus        54 ~k~~~C~~C~k~f~~~   69 (194)
                      ...|.|..||..|+..
T Consensus        21 snrYkCkdCGY~ft~~   36 (59)
T PHA00626         21 SDDYVCCDCGYNDSKD   36 (59)
T ss_pred             CcceEcCCCCCeechh
Confidence            3568898898877764


No 100
>PF07754 DUF1610:  Domain of unknown function (DUF1610);  InterPro: IPR011668 This domain is found in archaeal species. It is likely to bind zinc via its four well-conserved cysteine residues.
Probab=37.62  E-value=11  Score=18.80  Aligned_cols=10  Identities=10%  Similarity=0.046  Sum_probs=6.9

Q ss_pred             CCCCChhhhCC
Q psy8702         127 RSLEPSDMLGR  137 (194)
Q Consensus       127 kp~~C~~~Cgk  137 (194)
                      -+|.|+. ||+
T Consensus        15 v~f~CPn-CG~   24 (24)
T PF07754_consen   15 VPFPCPN-CGF   24 (24)
T ss_pred             ceEeCCC-CCC
Confidence            3578886 874


No 101
>PF02892 zf-BED:  BED zinc finger;  InterPro: IPR003656 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target.  This entry represents predicted BED-type zinc finger domains. The BED finger which was named after the Drosophila proteins BEAF and DREF, is found in one or more copies in cellular regulatory factors and transposases from plants, animals and fungi. The BED finger is an about 50 to 60 amino acid residues domain that contains a characteristic motif with two highly conserved aromatic positions, as well as a shared pattern of cysteines and histidines that is predicted to form a zinc finger. As diverse BED fingers are able to bind DNA, it has been suggested that DNA-binding is the general function of this domain []. Some proteins known to contain a BED domain include animal, plant and fungi AC1 and Hobo-like transposases; Caenorhabditis elegans Dpy-20 protein, a predicted cuticular gene transcriptional regulator; Drosophila BEAF (boundary element-associated factor), thought to be involved in chromatin insulation; Drosophila DREF, a transcriptional regulator for S-phase genes; and tobacco 3AF1 and tomato E4/E8-BP1, light- and ethylene-regulated DNA binding proteins that contain two BED fingers. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; GO: 0003677 DNA binding; PDB: 2DJR_A 2CT5_A.
Probab=37.42  E-value=21  Score=20.09  Aligned_cols=12  Identities=42%  Similarity=1.032  Sum_probs=4.2

Q ss_pred             CccCCcCCCCcc
Q psy8702          56 PYECNSCGKVFR   67 (194)
Q Consensus        56 ~~~C~~C~k~f~   67 (194)
                      ...|..|++.+.
T Consensus        16 ~a~C~~C~~~~~   27 (45)
T PF02892_consen   16 KAKCKYCGKVIK   27 (45)
T ss_dssp             -EEETTTTEE--
T ss_pred             eEEeCCCCeEEe
Confidence            344444544433


No 102
>KOG2593|consensus
Probab=36.93  E-value=23  Score=30.80  Aligned_cols=42  Identities=17%  Similarity=0.180  Sum_probs=29.3

Q ss_pred             HHhHcCCCCCccCCcCCCCcccchHHHHhhhhcCCCCCCCCCCccc
Q psy8702          47 HLLTHTDHKPYECNSCGKVFRRNCDLRRHALTHAVGDVPPESSLAE   92 (194)
Q Consensus        47 H~~~h~~~k~~~C~~C~k~f~~~~~L~~H~~~h~~~~k~~~c~~c~   92 (194)
                      -++--+....|.|+.|.+.|+....++.   +-. ..-.|.|..|+
T Consensus       119 ~~~d~t~~~~Y~Cp~C~kkyt~Lea~~L---~~~-~~~~F~C~~C~  160 (436)
T KOG2593|consen  119 RLRDDTNVAGYVCPNCQKKYTSLEALQL---LDN-ETGEFHCENCG  160 (436)
T ss_pred             HhhhccccccccCCccccchhhhHHHHh---hcc-cCceEEEecCC
Confidence            3344456678999999999998766543   222 45678888886


No 103
>COG3677 Transposase and inactivated derivatives [DNA replication, recombination, and repair]
Probab=36.73  E-value=18  Score=26.09  Aligned_cols=16  Identities=44%  Similarity=1.124  Sum_probs=12.0

Q ss_pred             CCCccCCcCCCCcccc
Q psy8702          54 HKPYECNSCGKVFRRN   69 (194)
Q Consensus        54 ~k~~~C~~C~k~f~~~   69 (194)
                      ...|.|..|++.|...
T Consensus        51 ~qRyrC~~C~~tf~~~   66 (129)
T COG3677          51 HQRYKCKSCGSTFTVE   66 (129)
T ss_pred             ccccccCCcCcceeee
Confidence            4668888888887754


No 104
>COG4896 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=34.71  E-value=17  Score=22.60  Aligned_cols=30  Identities=7%  Similarity=-0.114  Sum_probs=20.6

Q ss_pred             CCCCCChhhHhhhccCCCCCCCCChhhhCCC
Q psy8702         108 PTGSQSSEDIEIERTPSEERSLEPSDMLGRR  138 (194)
Q Consensus       108 k~F~~~~~L~~H~~~H~~ekp~~C~~~Cgk~  138 (194)
                      ..+.....+..-+.....-+.|.|++ |.-+
T Consensus        11 v~~iD~rt~~tKrLrN~PIrtymC~e-C~~R   40 (68)
T COG4896          11 VDEIDNRTFKTKRLRNKPIRTYMCPE-CEHR   40 (68)
T ss_pred             eeeecchhHHHHHhhCCCceeEechh-hHhh
Confidence            45666666665555555556799998 9877


No 105
>PRK10220 hypothetical protein; Provisional
Probab=34.42  E-value=29  Score=24.23  Aligned_cols=31  Identities=19%  Similarity=0.490  Sum_probs=23.6

Q ss_pred             cccccccccccccccccccccchHHHHHhHcCCCCCccCCcCCCCcccch
Q psy8702          21 PGLSLYTKSFTWKNLLTNVLTSNLKTHLLTHTDHKPYECNSCGKVFRRNC   70 (194)
Q Consensus        21 ~~c~~Cgk~f~~~~~L~~H~~s~L~~H~~~h~~~k~~~C~~C~k~f~~~~   70 (194)
                      |.|+.|+-.|++.                   ....|.|++|+..|....
T Consensus         4 P~CP~C~seytY~-------------------d~~~~vCpeC~hEW~~~~   34 (111)
T PRK10220          4 PHCPKCNSEYTYE-------------------DNGMYICPECAHEWNDAE   34 (111)
T ss_pred             CcCCCCCCcceEc-------------------CCCeEECCcccCcCCccc
Confidence            6799999888876                   234689999998776553


No 106
>PF04959 ARS2:  Arsenite-resistance protein 2;  InterPro: IPR007042 This entry represents Arsenite-resistance protein 2 (also known as Serrate RNA effector molecule homolog) which is thought to play a role in arsenite resistance [], although does not directly confer arsenite resistance but rather modulates arsenic sensitivity []. Arsenite is a carcinogenic compound which can act as a comutagen by inhibiting DNA repair. It is also involved in cell cycle progression at S phase. ; PDB: 3AX1_A.
Probab=34.07  E-value=19  Score=28.32  Aligned_cols=18  Identities=22%  Similarity=0.056  Sum_probs=11.6

Q ss_pred             CCCCCccccccccccccc
Q psy8702          16 KDSVNPGLSLYTKSFTWK   33 (194)
Q Consensus        16 ~~~~~~~c~~Cgk~f~~~   33 (194)
                      ....+|.|+.|+|.|.-.
T Consensus        73 ~~~~K~~C~lc~KlFkg~   90 (214)
T PF04959_consen   73 EDEDKWRCPLCGKLFKGP   90 (214)
T ss_dssp             SSSEEEEE-SSS-EESSH
T ss_pred             HcCCEECCCCCCcccCCh
Confidence            455677788888888777


No 107
>COG5236 Uncharacterized conserved protein, contains RING Zn-finger [General function prediction only]
Probab=33.20  E-value=13  Score=31.42  Aligned_cols=58  Identities=16%  Similarity=0.100  Sum_probs=35.6

Q ss_pred             cCCcCCCCcccchHHHHhhhhcCCCCCCCCCCccccccccccCCCCCCCCCCCCCChhhHhhhccCCCCCCCCCh
Q psy8702          58 ECNSCGKVFRRNCDLRRHALTHAVGDVPPESSLAESIGSLKRDNLPPAPSPTGSQSSEDIEIERTPSEERSLEPS  132 (194)
Q Consensus        58 ~C~~C~k~f~~~~~L~~H~~~h~~~~k~~~c~~c~~~~~~~~~~~~~~C~k~F~~~~~L~~H~~~H~~ekp~~C~  132 (194)
                      .|..|..-|.....|..|++..+  |+-|.|+.-+           +.=.+-|..-.+|..|-+    ..-|-|.
T Consensus       222 ~C~FC~~~FYdDDEL~~HcR~~H--E~ChICD~v~-----------p~~~QYFK~Y~~Le~HF~----~~hy~ct  279 (493)
T COG5236         222 LCIFCKIYFYDDDELRRHCRLRH--EACHICDMVG-----------PIRYQYFKSYEDLEAHFR----NAHYCCT  279 (493)
T ss_pred             hhhhccceecChHHHHHHHHhhh--hhhhhhhccC-----------ccchhhhhCHHHHHHHhh----cCceEEE
Confidence            47888888888888888887655  5555554321           011234666677777754    2335564


No 108
>KOG3583|consensus
Probab=33.13  E-value=23  Score=27.97  Aligned_cols=24  Identities=38%  Similarity=0.654  Sum_probs=20.0

Q ss_pred             cccCCCCCCCCCCCCcccCCCCCC
Q psy8702         171 RHSQSPQPAHSSSIPFRHHSPIKF  194 (194)
Q Consensus       171 ~~~~~~~~~~~~~~~~~~~~~~~~  194 (194)
                      .|+|-|+-.+--.||.|||+|.+|
T Consensus       254 s~hq~P~~s~~~~~~~~~~~~~~~  277 (279)
T KOG3583|consen  254 SHHQQPQYSHQQPMNPQHHSPMLL  277 (279)
T ss_pred             hhccCccccccCCCCcccccchhh
Confidence            455778888889999999999875


No 109
>TIGR00280 L37a ribosomal protein L37a. This model finds eukaryotic ribosomal protein L37a and its archaeal orthologs. The nomeclature is tricky because eukaryotes have proteins called both L37 and L37a.
Probab=32.83  E-value=18  Score=24.45  Aligned_cols=12  Identities=33%  Similarity=0.991  Sum_probs=7.4

Q ss_pred             CccCCcCCCCcc
Q psy8702          56 PYECNSCGKVFR   67 (194)
Q Consensus        56 ~~~C~~C~k~f~   67 (194)
                      .+.|..|++.|.
T Consensus        53 IW~C~~C~~~~A   64 (91)
T TIGR00280        53 IWTCRKCGAKFA   64 (91)
T ss_pred             EEEcCCCCCEEe
Confidence            466666666554


No 110
>PF01780 Ribosomal_L37ae:  Ribosomal L37ae protein family;  InterPro: IPR002674 Ribosomes are the particles that catalyse mRNA-directed protein synthesis in all organisms. The codons of the mRNA are exposed on the ribosome to allow tRNA binding. This leads to the incorporation of amino acids into the growing polypeptide chain in accordance with the genetic information. Incoming amino acid monomers enter the ribosomal A site in the form of aminoacyl-tRNAs complexed with elongation factor Tu (EF-Tu) and GTP. The growing polypeptide chain, situated in the P site as peptidyl-tRNA, is then transferred to aminoacyl-tRNA and the new peptidyl-tRNA, extended by one residue, is translocated to the P site with the aid the elongation factor G (EF-G) and GTP as the deacylated tRNA is released from the ribosome through one or more exit sites [, ]. About 2/3 of the mass of the ribosome consists of RNA and 1/3 of protein. The proteins are named in accordance with the subunit of the ribosome which they belong to - the small (S1 to S31) and the large (L1 to L44). Usually they decorate the rRNA cores of the subunits.  Many ribosomal proteins, particularly those of the large subunit, are composed of a globular, surfaced-exposed domain with long finger-like projections that extend into the rRNA core to stabilise its structure. Most of the proteins interact with multiple RNA elements, often from different domains. In the large subunit, about 1/3 of the 23S rRNA nucleotides are at least in van der Waal's contact with protein, and L22 interacts with all six domains of the 23S rRNA. Proteins S4 and S7, which initiate assembly of the 16S rRNA, are located at junctions of five and four RNA helices, respectively. In this way proteins serve to organise and stabilise the rRNA tertiary structure. While the crucial activities of decoding and peptide transfer are RNA based, proteins play an active role in functions that may have evolved to streamline the process of protein synthesis. In addition to their function in the ribosome, many ribosomal proteins have some function 'outside' the ribosome [, ]. This ribosomal protein is found in archaebacteria and eukaryotes []. Ribosomal protein L37 has a single zinc finger-like motif of the C2-C2 type [].; GO: 0003735 structural constituent of ribosome, 0006412 translation, 0005622 intracellular, 0005840 ribosome; PDB: 4A1E_Y 4A17_Y 4A1C_Y 4A1A_Y 3O58_g 3IZS_m 3O5H_g 1S1I_9 3IZR_m 1YSH_D ....
Probab=32.70  E-value=12  Score=25.26  Aligned_cols=12  Identities=42%  Similarity=1.140  Sum_probs=7.0

Q ss_pred             CccCCcCCCCcc
Q psy8702          56 PYECNSCGKVFR   67 (194)
Q Consensus        56 ~~~C~~C~k~f~   67 (194)
                      .|.|..|++.|.
T Consensus        53 IW~C~~C~~~~A   64 (90)
T PF01780_consen   53 IWKCKKCGKKFA   64 (90)
T ss_dssp             EEEETTTTEEEE
T ss_pred             EeecCCCCCEEe
Confidence            366666665554


No 111
>PRK03564 formate dehydrogenase accessory protein FdhE; Provisional
Probab=32.16  E-value=26  Score=29.25  Aligned_cols=49  Identities=16%  Similarity=0.021  Sum_probs=26.8

Q ss_pred             CCccCCcCCCCcccchHHHHhhhhcCCCCCCCCCCccccccccccCCCCCCCCC
Q psy8702          55 KPYECNSCGKVFRRNCDLRRHALTHAVGDVPPESSLAESIGSLKRDNLPPAPSP  108 (194)
Q Consensus        55 k~~~C~~C~k~f~~~~~L~~H~~~h~~~~k~~~c~~c~~~~~~~~~~~~~~C~k  108 (194)
                      ..-.|++||..-... -+  +..... |.+-..|..|+. .=......|+.||.
T Consensus       186 ~~~~CPvCGs~P~~s-~v--~~~~~~-G~RyL~CslC~t-eW~~~R~~C~~Cg~  234 (309)
T PRK03564        186 QRQFCPVCGSMPVSS-VV--QIGTTQ-GLRYLHCNLCES-EWHVVRVKCSNCEQ  234 (309)
T ss_pred             CCCCCCCCCCcchhh-ee--eccCCC-CceEEEcCCCCC-cccccCccCCCCCC
Confidence            446799998642211 11  112233 666678887752 11223556889985


No 112
>COG1996 RPC10 DNA-directed RNA polymerase, subunit RPC10 (contains C4-type Zn-finger) [Transcription]
Probab=31.76  E-value=25  Score=20.89  Aligned_cols=11  Identities=0%  Similarity=-0.133  Sum_probs=6.9

Q ss_pred             Ccccccccccc
Q psy8702          20 NPGLSLYTKSF   30 (194)
Q Consensus        20 ~~~c~~Cgk~f   30 (194)
                      .|.|..||+.|
T Consensus         6 ~Y~C~~Cg~~~   16 (49)
T COG1996           6 EYKCARCGREV   16 (49)
T ss_pred             EEEhhhcCCee
Confidence            35566666666


No 113
>PF08274 PhnA_Zn_Ribbon:  PhnA Zinc-Ribbon ;  InterPro: IPR013987 The PhnA protein family includes the uncharacterised Escherichia coli protein PhnA and its homologues. The E. coli phnA gene is part of a large operon associated with alkylphosphonate uptake and carbon-phosphorus bond cleavage []. The protein is not related to the characterised phosphonoacetate hydrolase designated PhnA []. This entry represents the N-terminal domain of PhnA, which is predicted to form a zinc-ribbon.; PDB: 2AKL_A.
Probab=30.87  E-value=19  Score=18.99  Aligned_cols=12  Identities=25%  Similarity=0.551  Sum_probs=2.6

Q ss_pred             cccccccccccc
Q psy8702          21 PGLSLYTKSFTW   32 (194)
Q Consensus        21 ~~c~~Cgk~f~~   32 (194)
                      +.|+.|+-.+..
T Consensus         3 p~Cp~C~se~~y   14 (30)
T PF08274_consen    3 PKCPLCGSEYTY   14 (30)
T ss_dssp             ---TTT-----E
T ss_pred             CCCCCCCCccee
Confidence            457777766544


No 114
>PF14446 Prok-RING_1:  Prokaryotic RING finger family 1
Probab=30.49  E-value=39  Score=20.49  Aligned_cols=14  Identities=14%  Similarity=-0.012  Sum_probs=9.7

Q ss_pred             Cccccccccccccc
Q psy8702          20 NPGLSLYTKSFTWK   33 (194)
Q Consensus        20 ~~~c~~Cgk~f~~~   33 (194)
                      ...|+.||+.|...
T Consensus         5 ~~~C~~Cg~~~~~~   18 (54)
T PF14446_consen    5 GCKCPVCGKKFKDG   18 (54)
T ss_pred             CccChhhCCcccCC
Confidence            34578888887654


No 115
>PF13451 zf-trcl:  Probable zinc-binding domain
Probab=29.77  E-value=43  Score=19.87  Aligned_cols=17  Identities=29%  Similarity=0.681  Sum_probs=11.1

Q ss_pred             CCccCCcCCCCcccchH
Q psy8702          55 KPYECNSCGKVFRRNCD   71 (194)
Q Consensus        55 k~~~C~~C~k~f~~~~~   71 (194)
                      +.+.|..||..|.....
T Consensus         3 k~l~C~dCg~~FvfTa~   19 (49)
T PF13451_consen    3 KTLTCKDCGAEFVFTAG   19 (49)
T ss_pred             eeEEcccCCCeEEEehh
Confidence            45667777777766544


No 116
>PF05191 ADK_lid:  Adenylate kinase, active site lid;  InterPro: IPR007862 Adenylate kinases (ADK; 2.7.4.3 from EC) are phosphotransferases that catalyse the Mg-dependent reversible conversion of ATP and AMP to two molecules of ADP, an essential reaction for many processes in living cells. In large variants of adenylate kinase, the AMP and ATP substrates are buried in a domain that undergoes conformational changes from an open to a closed state when bound to substrate; the ligand is then contained within a highly specific environment required for catalysis. Adenylate kinase is a 3-domain protein consisting of a large central CORE domain flanked by a LID domain on one side and the AMP-binding NMPbind domain on the other []. The LID domain binds ATP and covers the phosphates at the active site. The substrates first bind the CORE domain, followed by closure of the active site by the LID and NMPbind domains. Comparisons of adenylate kinases have revealed a particular divergence in the active site lid. In some organisms, particularly the Gram-positive bacteria, residues in the lid domain have been mutated to cysteines and these cysteine residues (two CX(n)C motifs) are responsible for the binding of a zinc ion. The bound zinc ion in the lid domain is clearly structurally homologous to Zinc-finger domains. However, it is unclear whether the adenylate kinase lid is a novel zinc-finger DNA/RNA binding domain, or that the lid bound zinc serves a purely structural function [].; GO: 0004017 adenylate kinase activity; PDB: 3BE4_A 2OSB_B 2ORI_A 2EU8_A 3DL0_A 1P3J_A 2QAJ_A 2OO7_A 2P3S_A 3DKV_A ....
Probab=29.39  E-value=36  Score=18.64  Aligned_cols=12  Identities=0%  Similarity=-0.114  Sum_probs=8.0

Q ss_pred             cccccccccccc
Q psy8702          22 GLSLYTKSFTWK   33 (194)
Q Consensus        22 ~c~~Cgk~f~~~   33 (194)
                      .|+.||+.|...
T Consensus         3 ~C~~Cg~~Yh~~   14 (36)
T PF05191_consen    3 ICPKCGRIYHIE   14 (36)
T ss_dssp             EETTTTEEEETT
T ss_pred             CcCCCCCccccc
Confidence            467777777654


No 117
>PRK14559 putative protein serine/threonine phosphatase; Provisional
Probab=29.19  E-value=83  Score=29.13  Aligned_cols=8  Identities=13%  Similarity=0.202  Sum_probs=3.9

Q ss_pred             CChhhhCCC
Q psy8702         130 EPSDMLGRR  138 (194)
Q Consensus       130 ~C~~~Cgk~  138 (194)
                      -|++ ||..
T Consensus        43 fC~~-CG~~   50 (645)
T PRK14559         43 HCPN-CGAE   50 (645)
T ss_pred             cccc-cCCc
Confidence            4554 5544


No 118
>PF01286 XPA_N:  XPA protein N-terminal;  InterPro: IPR022652 Xeroderma pigmentosum (XP) [] is a human autosomal recessive disease, characterised by a high incidence of sunlight-induced skin cancer. Skin cells of individual's with this condition are hypersensitive to ultraviolet light, due to defects in the incision step of DNA excision repair. There are a minimum of seven genetic complementation groups involved in this pathway: XP-A to XP-G. XP-A is the most severe form of the disease and is due to defects in a 30 kDa nuclear protein called XPA (or XPAC) []. The sequence of the XPA protein is conserved from higher eukaryotes [] to yeast (gene RAD14) []. XPA is a hydrophilic protein of 247 to 296 amino-acid residues which has a C4-type zinc finger motif in its central section. This entry contains the zinc-finger containing region in the XPA protein. It is found N-terminal to PF05181 from PFAM ; PDB: 1D4U_A 1XPA_A.
Probab=29.05  E-value=25  Score=19.14  Aligned_cols=14  Identities=21%  Similarity=0.112  Sum_probs=7.4

Q ss_pred             Cccccccccccccc
Q psy8702          20 NPGLSLYTKSFTWK   33 (194)
Q Consensus        20 ~~~c~~Cgk~f~~~   33 (194)
                      .+.|.+||+.|..+
T Consensus         3 ~~~C~eC~~~f~dS   16 (34)
T PF01286_consen    3 YPKCDECGKPFMDS   16 (34)
T ss_dssp             -EE-TTT--EES-S
T ss_pred             CchHhHhCCHHHHH
Confidence            46789999999885


No 119
>KOG2593|consensus
Probab=28.96  E-value=39  Score=29.41  Aligned_cols=39  Identities=15%  Similarity=0.283  Sum_probs=24.7

Q ss_pred             CCCCCcccccccccccccccccccccchHHHHHhHcCCCCCccCCcCCCC
Q psy8702          16 KDSVNPGLSLYTKSFTWKNLLTNVLTSNLKTHLLTHTDHKPYECNSCGKV   65 (194)
Q Consensus        16 ~~~~~~~c~~Cgk~f~~~~~L~~H~~s~L~~H~~~h~~~k~~~C~~C~k~   65 (194)
                      -....|.|+.|.+.|+.-           -.-+..-...-.|.|..|+-.
T Consensus       124 t~~~~Y~Cp~C~kkyt~L-----------ea~~L~~~~~~~F~C~~C~ge  162 (436)
T KOG2593|consen  124 TNVAGYVCPNCQKKYTSL-----------EALQLLDNETGEFHCENCGGE  162 (436)
T ss_pred             cccccccCCccccchhhh-----------HHHHhhcccCceEEEecCCCc
Confidence            345668899999888663           333333333456888888753


No 120
>COG1198 PriA Primosomal protein N' (replication factor Y) - superfamily II helicase [DNA replication, recombination, and repair]
Probab=28.32  E-value=26  Score=32.80  Aligned_cols=9  Identities=0%  Similarity=-0.500  Sum_probs=4.0

Q ss_pred             ccccccccc
Q psy8702          23 LSLYTKSFT   31 (194)
Q Consensus        23 c~~Cgk~f~   31 (194)
                      |..||..+.
T Consensus       438 C~~Cg~v~~  446 (730)
T COG1198         438 CRDCGYIAE  446 (730)
T ss_pred             cccCCCccc
Confidence            444444433


No 121
>PF01363 FYVE:  FYVE zinc finger;  InterPro: IPR000306 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target.  The FYVE zinc finger is named after four proteins that it has been found in: Fab1, YOTB/ZK632.12, Vac1, and EEA1. The FYVE finger has been shown to bind two zinc ions []. The FYVE finger has eight potential zinc coordinating cysteine positions. Many members of this family also include two histidines in a motif R+HHC+XCG, where + represents a charged residue and X any residue. FYVE-type domains are divided into two known classes: FYVE domains that specifically bind to phosphatidylinositol 3-phosphate in lipid bilayers and FYVE-related domains of undetermined function []. Those that bind to phosphatidylinositol 3-phosphate are often found in proteins targeted to lipid membranes that are involved in regulating membrane traffic [, , ]. Most FYVE domains target proteins to endosomes by binding specifically to phosphatidylinositol-3-phosphate at the membrane surface. By contrast, the CARP2 FYVE-like domain is not optimized to bind to phosphoinositides or insert into lipid bilayers. FYVE domains are distinguished from other zinc fingers by three signature sequences: an N-terminal WxxD motif, a basic R(R/K)HHCR patch, and a C-terminal RVC motif. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; GO: 0046872 metal ion binding; PDB: 1HYI_A 1JOC_B 1HYJ_A 1DVP_A 3ZYQ_A 4AVX_A 1VFY_A 3T7L_A 1X4U_A 1WFK_A ....
Probab=27.85  E-value=31  Score=21.38  Aligned_cols=16  Identities=25%  Similarity=0.312  Sum_probs=7.0

Q ss_pred             CCCCcccccccccccc
Q psy8702          17 DSVNPGLSLYTKSFTW   32 (194)
Q Consensus        17 ~~~~~~c~~Cgk~f~~   32 (194)
                      +.+...|..|++.|..
T Consensus         6 d~~~~~C~~C~~~F~~   21 (69)
T PF01363_consen    6 DSEASNCMICGKKFSL   21 (69)
T ss_dssp             GGG-SB-TTT--B-BS
T ss_pred             CCCCCcCcCcCCcCCC
Confidence            3455678888888843


No 122
>COG4957 Predicted transcriptional regulator [Transcription]
Probab=26.98  E-value=39  Score=24.62  Aligned_cols=25  Identities=36%  Similarity=0.296  Sum_probs=19.6

Q ss_pred             cccccccccccccccccccccchHHHHHhHcCCCCC
Q psy8702          21 PGLSLYTKSFTWKNLLTNVLTSNLKTHLLTHTDHKP   56 (194)
Q Consensus        21 ~~c~~Cgk~f~~~~~L~~H~~s~L~~H~~~h~~~k~   56 (194)
                      ..|-++||.|.           +|++|+.+|.+--|
T Consensus        77 IicLEDGkkfK-----------SLKRHL~t~~gmTP  101 (148)
T COG4957          77 IICLEDGKKFK-----------SLKRHLTTHYGLTP  101 (148)
T ss_pred             EEEeccCcchH-----------HHHHHHhcccCCCH
Confidence            46889999994           48999999877543


No 123
>PRK03976 rpl37ae 50S ribosomal protein L37Ae; Reviewed
Probab=26.86  E-value=25  Score=23.70  Aligned_cols=12  Identities=42%  Similarity=1.093  Sum_probs=6.8

Q ss_pred             CccCCcCCCCcc
Q psy8702          56 PYECNSCGKVFR   67 (194)
Q Consensus        56 ~~~C~~C~k~f~   67 (194)
                      .+.|..|++.|.
T Consensus        54 IW~C~~C~~~~A   65 (90)
T PRK03976         54 IWECRKCGAKFA   65 (90)
T ss_pred             EEEcCCCCCEEe
Confidence            456666665554


No 124
>COG1655 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=26.25  E-value=16  Score=29.13  Aligned_cols=30  Identities=27%  Similarity=0.449  Sum_probs=18.5

Q ss_pred             CCCccCCcCCCCcccchHHHHhhhhcCCCCC
Q psy8702          54 HKPYECNSCGKVFRRNCDLRRHALTHAVGDV   84 (194)
Q Consensus        54 ~k~~~C~~C~k~f~~~~~L~~H~~~h~~~~k   84 (194)
                      ++...|++|+-.|....-+..-.++-+ |+-
T Consensus        17 kk~ieCPvC~tkFkkeev~tgsiRiia-gDl   46 (267)
T COG1655          17 KKTIECPVCNTKFKKEEVKTGSIRIIA-GDL   46 (267)
T ss_pred             hceeccCcccchhhhhheeccceeEec-ccc
Confidence            456778888877776655555555544 443


No 125
>PF06906 DUF1272:  Protein of unknown function (DUF1272);  InterPro: IPR010696 This family consists of several hypothetical bacterial proteins of around 80 residues in length. This family contains a number of conserved cysteine residues and its function is unknown.
Probab=25.94  E-value=31  Score=21.09  Aligned_cols=27  Identities=7%  Similarity=-0.018  Sum_probs=17.9

Q ss_pred             CCCCCccccccccccCCCCCCCCCCCC
Q psy8702          85 PPESSLAESIGSLKRDNLPPAPSPTGS  111 (194)
Q Consensus        85 ~~~c~~c~~~~~~~~~~~~~~C~k~F~  111 (194)
                      .|+|.+|..+........|++||=.|.
T Consensus        26 SfECTFC~~C~e~~l~~~CPNCgGelv   52 (57)
T PF06906_consen   26 SFECTFCADCAETMLNGVCPNCGGELV   52 (57)
T ss_pred             eEeCcccHHHHHHHhcCcCcCCCCccc
Confidence            478888876555555677777776554


No 126
>COG3357 Predicted transcriptional regulator containing an HTH domain fused to a Zn-ribbon [Transcription]
Probab=25.69  E-value=46  Score=22.47  Aligned_cols=28  Identities=25%  Similarity=0.541  Sum_probs=19.8

Q ss_pred             CCCccCCcCCCCcccchHHHHhhhhcCCCCCCCCCCccc
Q psy8702          54 HKPYECNSCGKVFRRNCDLRRHALTHAVGDVPPESSLAE   92 (194)
Q Consensus        54 ~k~~~C~~C~k~f~~~~~L~~H~~~h~~~~k~~~c~~c~   92 (194)
                      .+|-.|..||..|....           -.+|..|+.|.
T Consensus        56 v~Pa~CkkCGfef~~~~-----------ik~pSRCP~CK   83 (97)
T COG3357          56 VRPARCKKCGFEFRDDK-----------IKKPSRCPKCK   83 (97)
T ss_pred             ecChhhcccCccccccc-----------cCCcccCCcch
Confidence            36788999998888731           24567777775


No 127
>PF15269 zf-C2H2_7:  Zinc-finger
Probab=25.45  E-value=1.1e+02  Score=17.77  Aligned_cols=24  Identities=17%  Similarity=-0.013  Sum_probs=16.0

Q ss_pred             CCCcccccccccccccccccccccchHHHHHh
Q psy8702          18 SVNPGLSLYTKSFTWKNLLTNVLTSNLKTHLL   49 (194)
Q Consensus        18 ~~~~~c~~Cgk~f~~~~~L~~H~~s~L~~H~~   49 (194)
                      ..+|.|=.|.-+...+        +.|-.||+
T Consensus        18 p~~ykcfqcpftc~~k--------shl~nhmk   41 (54)
T PF15269_consen   18 PFKYKCFQCPFTCNEK--------SHLFNHMK   41 (54)
T ss_pred             CccceeecCCcccchH--------HHHHHHHH
Confidence            3456777888887788        55555554


No 128
>PF13878 zf-C2H2_3:  zinc-finger of acetyl-transferase ESCO
Probab=25.32  E-value=76  Score=17.81  Aligned_cols=13  Identities=0%  Similarity=-0.202  Sum_probs=8.3

Q ss_pred             ccccccccccccc
Q psy8702          21 PGLSLYTKSFTWK   33 (194)
Q Consensus        21 ~~c~~Cgk~f~~~   33 (194)
                      -.|+.||..|...
T Consensus        14 ~~C~~CgM~Y~~~   26 (41)
T PF13878_consen   14 TTCPTCGMLYSPG   26 (41)
T ss_pred             cCCCCCCCEECCC
Confidence            3577777776554


No 129
>KOG4124|consensus
Probab=25.20  E-value=14  Score=31.28  Aligned_cols=58  Identities=22%  Similarity=0.291  Sum_probs=33.8

Q ss_pred             CCCccc--ccccccccccccccccc-----------cchHHHHHhHcCCCCCccCCcCCCCcccchHHHHh
Q psy8702          18 SVNPGL--SLYTKSFTWKNLLTNVL-----------TSNLKTHLLTHTDHKPYECNSCGKVFRRNCDLRRH   75 (194)
Q Consensus        18 ~~~~~c--~~Cgk~f~~~~~L~~H~-----------~s~L~~H~~~h~~~k~~~C~~C~k~f~~~~~L~~H   75 (194)
                      ...|.|  +.|.+.+...+.|..|.           ...-+-|+-.....|+|.|++|.+.+.....|..|
T Consensus       347 ~~~~~~~vp~~~~~~~n~ng~~~~~~~~h~s~i~~~s~~~~ph~~~~~~nk~~r~~i~~~~~k~~~~l~~~  417 (442)
T KOG4124|consen  347 DKPYKCPVPNCDKAYKNQNGLKYHKLHGHCSPITTPTPAPIPHQGFVVENKPYRCEVCSKRYKNLNGLKYH  417 (442)
T ss_pred             cCCCCCCCCcchhhcccCcceeeccccCcCCCCCCCCCCCCCcceeeeccCcccChhhhhhhccCCCCCce
Confidence            455666  89999999997776651           01111122222335677777777766665554444


No 130
>PF04959 ARS2:  Arsenite-resistance protein 2;  InterPro: IPR007042 This entry represents Arsenite-resistance protein 2 (also known as Serrate RNA effector molecule homolog) which is thought to play a role in arsenite resistance [], although does not directly confer arsenite resistance but rather modulates arsenic sensitivity []. Arsenite is a carcinogenic compound which can act as a comutagen by inhibiting DNA repair. It is also involved in cell cycle progression at S phase. ; PDB: 3AX1_A.
Probab=24.63  E-value=37  Score=26.73  Aligned_cols=28  Identities=29%  Similarity=0.692  Sum_probs=20.9

Q ss_pred             CCCCccCCcCCCCcccchHHHHhhhhcC
Q psy8702          53 DHKPYECNSCGKVFRRNCDLRRHALTHA   80 (194)
Q Consensus        53 ~~k~~~C~~C~k~f~~~~~L~~H~~~h~   80 (194)
                      ++.-|.|..|+|.|.-...+.+|+..-+
T Consensus        74 ~~~K~~C~lc~KlFkg~eFV~KHI~nKH  101 (214)
T PF04959_consen   74 DEDKWRCPLCGKLFKGPEFVRKHIFNKH  101 (214)
T ss_dssp             SSEEEEE-SSS-EESSHHHHHHHHHHH-
T ss_pred             cCCEECCCCCCcccCChHHHHHHHhhcC
Confidence            3446999999999999999999987544


No 131
>TIGR00244 transcriptional regulator NrdR. Members of this almost entirely bacterial family contain an ATP cone domain (PFAM:PF03477). There is never more than one member per genome. Common gene symbols given include nrdR, ybaD, ribX and ytcG. The member from Streptomyces coelicolor is found upstream in the operon of the class II oxygen-independent ribonucleotide reductase gene nrdJ and was shown to repress nrdJ expression. Many members of this family are found near genes for riboflavin biosynthesis in Gram-negative bacteria, suggesting a role in that pathway. However, a phylogenetic profiling study associates members of this family with the presence of a palindromic signal with consensus acaCwAtATaTwGtgt, termed the NrdR-box, an upstream element for most operons for ribonucleotide reductase of all three classes in bacterial genomes.
Probab=24.57  E-value=44  Score=24.68  Aligned_cols=11  Identities=18%  Similarity=-0.055  Sum_probs=6.1

Q ss_pred             CCChhhhCCCcc
Q psy8702         129 LEPSDMLGRRSR  140 (194)
Q Consensus       129 ~~C~~~Cgk~f~  140 (194)
                      =+|.. ||++|+
T Consensus        29 ReC~~-C~~RFT   39 (147)
T TIGR00244        29 RECLE-CHERFT   39 (147)
T ss_pred             ccCCc-cCCccc
Confidence            45654 666644


No 132
>KOG1813|consensus
Probab=24.41  E-value=48  Score=27.45  Aligned_cols=45  Identities=16%  Similarity=0.140  Sum_probs=23.0

Q ss_pred             CCccCCcCCCCcccchHHHHhhhhcCCCCCCCCCCccccccccccCCCCCCCCCC
Q psy8702          55 KPYECNSCGKVFRRNCDLRRHALTHAVGDVPPESSLAESIGSLKRDNLPPAPSPT  109 (194)
Q Consensus        55 k~~~C~~C~k~f~~~~~L~~H~~~h~~~~k~~~c~~c~~~~~~~~~~~~~~C~k~  109 (194)
                      -||.|.+|.+.|...        +-+ .-.-|-|..|.. ...++...|.+|++.
T Consensus       240 ~Pf~c~icr~~f~~p--------Vvt-~c~h~fc~~ca~-~~~qk~~~c~vC~~~  284 (313)
T KOG1813|consen  240 LPFKCFICRKYFYRP--------VVT-KCGHYFCEVCAL-KPYQKGEKCYVCSQQ  284 (313)
T ss_pred             CCccccccccccccc--------hhh-cCCceeehhhhc-cccccCCcceecccc
Confidence            467777777766643        112 234455655542 333444555555543


No 133
>PF12760 Zn_Tnp_IS1595:  Transposase zinc-ribbon domain;  InterPro: IPR024442 This zinc binding domain is found in a range of transposase proteins such as ISSPO8, ISSOD11, ISRSSP2 etc. It may be a zinc-binding beta ribbon domain that could bind DNA.
Probab=24.27  E-value=36  Score=19.54  Aligned_cols=9  Identities=33%  Similarity=1.114  Sum_probs=4.3

Q ss_pred             CCccCCcCC
Q psy8702          55 KPYECNSCG   63 (194)
Q Consensus        55 k~~~C~~C~   63 (194)
                      ..|.|..|+
T Consensus        36 ~~~~C~~C~   44 (46)
T PF12760_consen   36 GRYRCKACR   44 (46)
T ss_pred             CeEECCCCC
Confidence            345555544


No 134
>PF13824 zf-Mss51:  Zinc-finger of mitochondrial splicing suppressor 51
Probab=24.09  E-value=64  Score=19.64  Aligned_cols=14  Identities=21%  Similarity=0.218  Sum_probs=9.4

Q ss_pred             CCCCCCCChhhhCCC
Q psy8702         124 SEERSLEPSDMLGRR  138 (194)
Q Consensus       124 ~~ekp~~C~~~Cgk~  138 (194)
                      .+.-.|.|+. ||-.
T Consensus        10 ~~~v~~~Cp~-cGip   23 (55)
T PF13824_consen   10 PAHVNFECPD-CGIP   23 (55)
T ss_pred             ccccCCcCCC-CCCc
Confidence            3455688886 8866


No 135
>PF12013 DUF3505:  Protein of unknown function (DUF3505);  InterPro: IPR022698  This family of proteins is functionally uncharacterised. This protein is found in eukaryotes. Proteins in this family are typically between 247 to 1018 amino acids in length. This region contains two segments that are likely to be C2H2 zinc binding domains. 
Probab=23.33  E-value=53  Score=22.53  Aligned_cols=24  Identities=29%  Similarity=0.668  Sum_probs=21.4

Q ss_pred             ccC----CcCCCCcccchHHHHhhhhcC
Q psy8702          57 YEC----NSCGKVFRRNCDLRRHALTHA   80 (194)
Q Consensus        57 ~~C----~~C~k~f~~~~~L~~H~~~h~   80 (194)
                      |.|    ..|+....+...+..|.+.++
T Consensus        81 ~~C~~~~~~C~y~~~~~~~m~~H~~~~H  108 (109)
T PF12013_consen   81 YRCQCDPPHCGYITRSKKTMRKHWRKEH  108 (109)
T ss_pred             eeeecCCCCCCcEeccHHHHHHHHHHhc
Confidence            899    899999999999999988765


No 136
>COG1327 Predicted transcriptional regulator, consists of a Zn-ribbon and ATP-cone domains [Transcription]
Probab=23.14  E-value=52  Score=24.45  Aligned_cols=11  Identities=27%  Similarity=0.187  Sum_probs=6.2

Q ss_pred             CCChhhhCCCcc
Q psy8702         129 LEPSDMLGRRSR  140 (194)
Q Consensus       129 ~~C~~~Cgk~f~  140 (194)
                      -+|+. ||++|+
T Consensus        29 ReC~~-C~~RFT   39 (156)
T COG1327          29 RECLE-CGERFT   39 (156)
T ss_pred             hcccc-cccccc
Confidence            45654 666644


No 137
>PRK12496 hypothetical protein; Provisional
Probab=23.02  E-value=46  Score=24.94  Aligned_cols=14  Identities=7%  Similarity=-0.192  Sum_probs=10.8

Q ss_pred             CCcccccccccccc
Q psy8702          19 VNPGLSLYTKSFTW   32 (194)
Q Consensus        19 ~~~~c~~Cgk~f~~   32 (194)
                      ..+.|..||+.|..
T Consensus       126 w~~~C~gC~~~~~~  139 (164)
T PRK12496        126 WRKVCKGCKKKYPE  139 (164)
T ss_pred             eeEECCCCCccccC
Confidence            44778999988865


No 138
>PF12773 DZR:  Double zinc ribbon
Probab=22.87  E-value=92  Score=17.82  Aligned_cols=27  Identities=11%  Similarity=0.145  Sum_probs=14.5

Q ss_pred             CCCCCCCCCCCCCChhhHhhhccCCCCCCCCChhhhCCC
Q psy8702         100 DNLPPAPSPTGSQSSEDIEIERTPSEERSLEPSDMLGRR  138 (194)
Q Consensus       100 ~~~~~~C~k~F~~~~~L~~H~~~H~~ekp~~C~~~Cgk~  138 (194)
                      ...|+.||....           -.......|+. ||..
T Consensus        12 ~~fC~~CG~~l~-----------~~~~~~~~C~~-Cg~~   38 (50)
T PF12773_consen   12 AKFCPHCGTPLP-----------PPDQSKKICPN-CGAE   38 (50)
T ss_pred             ccCChhhcCChh-----------hccCCCCCCcC-CcCC
Confidence            444566665554           12233466775 7765


No 139
>COG1198 PriA Primosomal protein N' (replication factor Y) - superfamily II helicase [DNA replication, recombination, and repair]
Probab=22.08  E-value=49  Score=31.04  Aligned_cols=23  Identities=9%  Similarity=-0.197  Sum_probs=14.0

Q ss_pred             hhHhhhccCCCCCCCCChhhhCCC
Q psy8702         115 EDIEIERTPSEERSLEPSDMLGRR  138 (194)
Q Consensus       115 ~L~~H~~~H~~ekp~~C~~~Cgk~  138 (194)
                      .|.-|.--+....|..|++ ||-.
T Consensus       462 ~L~CH~Cg~~~~~p~~Cp~-Cgs~  484 (730)
T COG1198         462 QLRCHYCGYQEPIPQSCPE-CGSE  484 (730)
T ss_pred             eeEeCCCCCCCCCCCCCCC-CCCC
Confidence            3334443445567888887 8855


No 140
>PRK00432 30S ribosomal protein S27ae; Validated
Probab=20.84  E-value=68  Score=18.97  Aligned_cols=12  Identities=25%  Similarity=0.783  Sum_probs=9.0

Q ss_pred             CCccCCcCCCCc
Q psy8702          55 KPYECNSCGKVF   66 (194)
Q Consensus        55 k~~~C~~C~k~f   66 (194)
                      ..+.|..||..+
T Consensus        36 ~r~~C~~Cgyt~   47 (50)
T PRK00432         36 DRWHCGKCGYTE   47 (50)
T ss_pred             CcEECCCcCCEE
Confidence            468898888754


Done!