Query         psy838
Match_columns 152
No_of_seqs    228 out of 1476
Neff          7.7 
Searched_HMMs 46136
Date          Fri Aug 16 17:09:56 2013
Command       hhsearch -i /work/01045/syshi/Psyhhblits/psy838.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/838hhsearch_cdd -cpu 12 -v 0 

 No Hit                             Prob E-value P-value  Score    SS Cols Query HMM  Template HMM
  1 KOG0402|consensus               99.9 3.2E-26 6.8E-31  148.7   2.3   45    1-45      1-45  (92)
  2 PTZ00255 60S ribosomal protein  99.9 3.7E-25 8.1E-30  147.4   3.5   45    1-45      1-45  (90)
  3 PRK03976 rpl37ae 50S ribosomal  99.9 9.5E-25   2E-29  145.4   3.5   45    1-45      1-45  (90)
  4 TIGR00280 L37a ribosomal prote  99.9 4.3E-24 9.2E-29  142.4   3.4   44    1-45      1-44  (91)
  5 KOG2488|consensus               99.9 1.9E-22   4E-27  150.8  10.4   98   54-152    53-150 (202)
  6 PF01780 Ribosomal_L37ae:  Ribo  99.9 7.8E-24 1.7E-28  141.1   2.4   44    2-45      1-44  (90)
  7 COG1997 RPL43A Ribosomal prote  99.8 9.9E-22 2.2E-26  129.2   3.0   44    1-45      1-44  (89)
  8 PRK10146 aminoalkylphosphonic   99.3 3.2E-12   7E-17   91.1   7.0   94   49-151     6-105 (144)
  9 KOG3216|consensus               99.3   2E-11 4.3E-16   88.9   9.5  102   49-152     6-114 (163)
 10 PHA00673 acetyltransferase dom  99.2 1.1E-10 2.4E-15   85.9  10.8  101   49-151     9-114 (154)
 11 PF13673 Acetyltransf_10:  Acet  99.2 1.8E-10 3.9E-15   79.1  10.7   87   56-151     1-93  (117)
 12 TIGR03827 GNAT_ablB putative b  99.2 1.9E-10   4E-15   91.3  10.2   97   47-151   116-212 (266)
 13 PRK10314 putative acyltransfer  99.1 6.7E-10 1.4E-14   81.4  10.5   97   46-151     6-103 (153)
 14 TIGR01575 rimI ribosomal-prote  99.1   8E-10 1.7E-14   76.7   8.9   83   56-151     1-83  (131)
 15 PHA01807 hypothetical protein   99.1 7.1E-10 1.5E-14   81.6   8.5   98   49-152     6-111 (153)
 16 PTZ00330 acetyltransferase; Pr  99.1 3.2E-10   7E-15   80.9   6.4   95   49-151     9-111 (147)
 17 PRK07757 acetyltransferase; Pr  99.0 2.1E-09 4.5E-14   77.5   9.3   91   48-151     3-94  (152)
 18 TIGR03448 mycothiol_MshD mycot  99.0   3E-09 6.4E-14   84.6  10.9   95   48-151     2-99  (292)
 19 PRK09831 putative acyltransfer  99.0 2.6E-09 5.6E-14   77.1   9.6   89   49-151     3-101 (147)
 20 TIGR02406 ectoine_EctA L-2,4-d  99.0 1.9E-09 4.1E-14   79.1   8.9   93   50-151     2-95  (157)
 21 PRK03624 putative acetyltransf  99.0 2.4E-09 5.1E-14   75.0   8.3   91   49-151     5-97  (140)
 22 PF00583 Acetyltransf_1:  Acety  99.0 2.3E-09   5E-14   69.3   7.6   49  103-151     4-54  (83)
 23 PRK09491 rimI ribosomal-protei  99.0   5E-09 1.1E-13   75.1   9.6   90   48-151     3-92  (146)
 24 TIGR01890 N-Ac-Glu-synth amino  99.0 3.6E-09 7.7E-14   89.3   9.9   93   46-151   282-376 (429)
 25 PF13508 Acetyltransf_7:  Acety  99.0 4.7E-09   1E-13   68.0   7.9   53   93-151     3-55  (79)
 26 PLN02706 glucosamine 6-phospha  98.9 8.4E-09 1.8E-13   74.2   9.4   98   48-151     8-114 (150)
 27 PRK07922 N-acetylglutamate syn  98.9 6.3E-09 1.4E-13   77.3   8.8   93   48-151     7-99  (169)
 28 PF13527 Acetyltransf_9:  Acety  98.9 1.9E-09 4.1E-14   75.3   5.6   92   50-151     3-101 (127)
 29 PRK05279 N-acetylglutamate syn  98.9 6.4E-09 1.4E-13   88.0   9.8   94   46-151   294-388 (441)
 30 PRK10140 putative acetyltransf  98.9 5.6E-09 1.2E-13   75.4   8.1   98   49-151     6-107 (162)
 31 PLN02825 amino-acid N-acetyltr  98.9 7.1E-09 1.5E-13   89.3   9.9   94   46-151   367-461 (515)
 32 TIGR03103 trio_acet_GNAT GNAT-  98.9   1E-08 2.2E-13   89.2  10.8   92   53-151    90-184 (547)
 33 PRK10514 putative acetyltransf  98.9 1.2E-08 2.6E-13   72.8   9.3   94   48-151     3-98  (145)
 34 COG0456 RimI Acetyltransferase  98.9 1.5E-08 3.2E-13   74.3   8.2   98   49-151    14-120 (177)
 35 PRK12308 bifunctional arginino  98.8 3.6E-08 7.8E-13   86.7   9.4   92   48-151   465-556 (614)
 36 TIGR01686 FkbH FkbH-like domai  98.8 7.8E-08 1.7E-12   78.3  10.6   97   48-151   188-286 (320)
 37 COG1247 Sortase and related ac  98.7 9.5E-08 2.1E-12   71.3   9.2  101   49-152     4-111 (169)
 38 TIGR02382 wecD_rffC TDP-D-fuco  98.7 1.2E-07 2.7E-12   71.4  10.0   99   46-151    43-152 (191)
 39 PRK10975 TDP-fucosamine acetyl  98.7 2.1E-07 4.7E-12   70.1  10.5   97   48-151    48-155 (194)
 40 COG3153 Predicted acetyltransf  98.7   1E-07 2.2E-12   71.3   8.2   93   49-151     6-104 (171)
 41 PRK01346 hypothetical protein;  98.7   2E-07 4.3E-12   77.9  10.6   94   48-151     8-108 (411)
 42 cd04301 NAT_SF N-Acyltransfera  98.7 9.9E-08 2.1E-12   56.9   6.3   48  103-151     7-54  (65)
 43 PRK13688 hypothetical protein;  98.7 3.6E-07 7.7E-12   67.4  10.1   56   92-149    44-106 (156)
 44 PF13523 Acetyltransf_8:  Acety  98.6 1.4E-07   3E-12   68.0   7.1   97   51-151     3-107 (152)
 45 PRK10562 putative acetyltransf  98.6   5E-07 1.1E-11   64.7   9.8   88   49-150     2-96  (145)
 46 KOG3139|consensus               98.6 6.4E-07 1.4E-11   66.0   9.5   98   46-151    16-113 (165)
 47 COG1246 ArgA N-acetylglutamate  98.5 1.5E-07 3.4E-12   68.9   5.5   93   49-152     3-95  (153)
 48 PF13420 Acetyltransf_4:  Acety  98.5 3.5E-07 7.6E-12   65.7   7.4   95   50-151     2-105 (155)
 49 TIGR03448 mycothiol_MshD mycot  98.5 7.8E-07 1.7E-11   70.7  10.0   91   56-151   160-255 (292)
 50 TIGR03585 PseH pseudaminic aci  98.5 2.6E-06 5.7E-11   61.1  10.3   99   48-151     2-104 (156)
 51 PRK15130 spermidine N1-acetylt  98.4 2.7E-06 5.9E-11   63.3   9.8  100   47-151     7-111 (186)
 52 PRK10809 ribosomal-protein-S5-  98.4 6.7E-06 1.4E-10   61.6  10.7   99   47-151    18-132 (194)
 53 PF13302 Acetyltransf_3:  Acety  98.2 2.4E-06 5.2E-11   60.1   5.6  101   48-151     3-112 (142)
 54 PF14542 Acetyltransf_CG:  GCN5  98.2 7.4E-06 1.6E-10   53.6   7.4   45  103-151     7-51  (78)
 55 PRK10151 ribosomal-protein-L7/  98.2   3E-05 6.6E-10   57.2  11.5  101   47-151    11-121 (179)
 56 cd02169 Citrate_lyase_ligase C  98.1 4.5E-06 9.7E-11   67.6   5.9   41  103-151    14-54  (297)
 57 KOG3396|consensus               98.0 4.5E-05 9.8E-10   54.9   8.5   73   79-151    34-114 (150)
 58 COG2153 ElaA Predicted acyltra  98.0 1.1E-05 2.5E-10   58.6   5.2   98   46-151     7-105 (155)
 59 PF13718 GNAT_acetyltr_2:  GNAT  97.9 8.4E-05 1.8E-09   56.8   8.5   70   80-151    14-119 (196)
 60 COG2388 Predicted acetyltransf  97.9 3.9E-05 8.4E-10   52.5   5.9   56   92-152    14-69  (99)
 61 KOG3397|consensus               97.9   8E-05 1.7E-09   55.9   8.0   57   95-152    57-113 (225)
 62 TIGR00124 cit_ly_ligase [citra  97.9 3.6E-05 7.9E-10   63.3   6.5   49   93-151    31-79  (332)
 63 PF12568 DUF3749:  Acetyltransf  97.8 0.00059 1.3E-08   48.6  10.5   86   47-151     5-90  (128)
 64 KOG3235|consensus               97.8 0.00011 2.3E-09   54.4   6.8   51   97-148    45-97  (193)
 65 TIGR01211 ELP3 histone acetylt  97.7 0.00012 2.6E-09   63.5   6.7   59   93-151   411-486 (522)
 66 PF08445 FR47:  FR47-like prote  97.6 0.00027 5.8E-09   46.9   6.4   42  105-150     8-49  (86)
 67 KOG3234|consensus               97.5 0.00016 3.5E-09   53.3   5.0   51  101-151    48-98  (173)
 68 COG1444 Predicted P-loop ATPas  97.4   0.001 2.3E-08   59.8   9.1   71   80-151   457-560 (758)
 69 PF13480 Acetyltransf_6:  Acety  97.3   0.011 2.5E-07   41.0  11.9  114   20-151     5-123 (142)
 70 COG0454 WecD Histone acetyltra  97.2 0.00019 4.2E-09   45.8   2.0   24  128-151    87-110 (156)
 71 PF05301 Mec-17:  Touch recepto  97.1  0.0026 5.6E-08   44.9   6.7   47  104-150    18-74  (120)
 72 PF01233 NMT:  Myristoyl-CoA:pr  96.9   0.014 3.1E-07   43.2   9.3  102   47-151    27-139 (162)
 73 PF01853 MOZ_SAS:  MOZ/SAS fami  96.8  0.0034 7.3E-08   47.7   5.7   57   91-150    51-108 (188)
 74 COG3981 Predicted acetyltransf  96.7  0.0038 8.3E-08   46.6   5.4   54   92-152    68-127 (174)
 75 PF13880 Acetyltransf_13:  ESCO  96.5  0.0021 4.6E-08   41.2   2.4   27  123-149     6-32  (70)
 76 PLN03238 probable histone acet  96.4  0.0099 2.1E-07   47.9   6.2   55   93-150   128-183 (290)
 77 KOG3138|consensus               96.4  0.0035 7.6E-08   47.6   3.4   95   47-151    17-118 (187)
 78 PLN03239 histone acetyltransfe  96.0   0.015 3.2E-07   48.0   5.6   55   93-150   186-241 (351)
 79 COG1670 RimL Acetyltransferase  96.0   0.047   1E-06   39.3   7.6   47  104-151    77-124 (187)
 80 COG3393 Predicted acetyltransf  96.0   0.015 3.3E-07   46.3   5.2   52   92-148   176-227 (268)
 81 PF12746 GNAT_acetyltran:  GNAT  95.9   0.026 5.6E-07   45.2   6.2   53   92-151   165-217 (265)
 82 PF06852 DUF1248:  Protein of u  95.9   0.085 1.8E-06   39.9   8.6   63   85-148    37-104 (181)
 83 PLN00104 MYST -like histone ac  95.8   0.013 2.8E-07   50.0   4.4   54   93-149   279-333 (450)
 84 PTZ00064 histone acetyltransfe  95.8   0.017 3.8E-07   49.7   5.2   56   92-150   356-412 (552)
 85 KOG4601|consensus               95.6   0.091   2E-06   41.2   8.0   45  104-149    81-135 (264)
 86 TIGR03694 exosort_acyl putativ  95.3    0.34 7.3E-06   38.1  10.5   83   50-137    12-126 (241)
 87 PRK13834 putative autoinducer   95.2    0.42 9.1E-06   36.8  10.5   96   50-151    11-135 (207)
 88 COG5628 Predicted acetyltransf  95.0   0.076 1.7E-06   37.7   5.4   55   93-150    38-93  (143)
 89 PF00765 Autoind_synth:  Autoin  94.8    0.64 1.4E-05   35.1  10.3   93   51-151     4-125 (182)
 90 KOG2747|consensus               94.8   0.037 8.1E-07   46.4   3.9   54   91-148   233-286 (396)
 91 cd04264 DUF619-NAGS DUF619 dom  94.4    0.21 4.6E-06   34.1   6.3   44  105-149    18-61  (99)
 92 COG5630 ARG2 Acetylglutamate s  94.4    0.33 7.2E-06   40.8   8.5   99   37-149   329-428 (495)
 93 TIGR03243 arg_catab_AOST argin  93.4    0.27 5.9E-06   40.6   6.2   93   50-145     3-140 (335)
 94 KOG2779|consensus               93.3    0.28   6E-06   40.9   6.2  100   52-151    89-196 (421)
 95 TIGR03244 arg_catab_AstA argin  93.3    0.27 5.9E-06   40.6   6.2   93   50-145     3-140 (336)
 96 PF13444 Acetyltransf_5:  Acety  93.1    0.59 1.3E-05   31.5   6.7   60   84-144    21-100 (101)
 97 PRK10456 arginine succinyltran  93.0    0.31 6.7E-06   40.4   6.1   93   50-145     5-142 (344)
 98 PF08444 Gly_acyl_tr_C:  Aralky  92.2    0.18 3.8E-06   33.8   3.1   41  103-150     7-47  (89)
 99 PF04958 AstA:  Arginine N-succ  91.6    0.89 1.9E-05   37.7   7.2   93   50-145     5-144 (342)
100 TIGR03245 arg_AOST_alph argini  91.5    0.77 1.7E-05   38.0   6.6   93   50-145     3-141 (336)
101 cd04265 DUF619-NAGS-U DUF619 d  91.4    0.86 1.9E-05   31.1   5.8   29  121-149    33-61  (99)
102 COG3053 CitC Citrate lyase syn  91.3    0.65 1.4E-05   37.9   5.8   51   93-151    35-85  (352)
103 COG3375 Uncharacterized conser  91.0     1.7 3.8E-05   34.2   7.8   88   52-145     9-97  (266)
104 COG5027 SAS2 Histone acetyltra  91.0    0.14   3E-06   42.5   1.8   51   93-146   235-286 (395)
105 KOG4144|consensus               91.0     0.2 4.2E-06   37.2   2.4   43  104-147    71-126 (190)
106 PF13465 zf-H2C2_2:  Zinc-finge  90.9    0.19   4E-06   25.7   1.7   23   22-46      2-24  (26)
107 KOG2696|consensus               89.8    0.95 2.1E-05   37.9   5.8   24  125-148   220-243 (403)
108 PF04768 DUF619:  Protein of un  89.7     3.5 7.5E-05   30.8   8.3   92   46-149    22-115 (170)
109 PF09924 DUF2156:  Uncharacteri  89.1     2.8 6.2E-05   33.5   8.0  114   20-151   117-234 (299)
110 COG3818 Predicted acetyltransf  88.7    0.59 1.3E-05   33.8   3.3   32  120-151    82-113 (167)
111 KOG2036|consensus               88.1    0.39 8.5E-06   43.5   2.5   28  123-150   615-642 (1011)
112 COG4888 Uncharacterized Zn rib  86.4     0.2 4.3E-06   34.2  -0.1   23   23-45      9-31  (104)
113 COG5092 NMT1 N-myristoyl trans  84.5     4.2 9.2E-05   33.6   6.5  101   51-151    86-194 (451)
114 PF07754 DUF1610:  Domain of un  84.2     0.5 1.1E-05   23.9   0.7   12   33-44     13-24  (24)
115 KOG4135|consensus               83.9     1.4 3.1E-05   32.5   3.3   32  120-151   105-136 (185)
116 PF00096 zf-C2H2:  Zinc finger,  83.9    0.43 9.2E-06   23.1   0.4   14   37-50      1-14  (23)
117 PHA00616 hypothetical protein   83.1    0.14 3.1E-06   29.8  -1.7   15   36-50      1-15  (44)
118 COG3138 AstA Arginine/ornithin  82.7     3.7   8E-05   33.4   5.5   94   50-144     5-141 (336)
119 KOG3014|consensus               79.8      12 0.00025   29.9   7.2   27  122-148   183-209 (257)
120 PF13894 zf-C2H2_4:  C2H2-type   79.8    0.82 1.8E-05   21.7   0.6   13   37-49      1-13  (24)
121 PF09237 GAGA:  GAGA factor;  I  78.2    0.22 4.7E-06   30.0  -2.1   20   30-49     18-37  (54)
122 PF09538 FYDLN_acid:  Protein o  76.3     1.4 2.9E-05   30.6   1.1   14   35-48      8-21  (108)
123 KOG2270|consensus               76.2     7.8 0.00017   33.3   5.7   65   10-74    190-283 (520)
124 PF02945 Endonuclease_7:  Recom  75.8     1.5 3.2E-05   28.8   1.1   30   13-45      2-31  (81)
125 COG1243 ELP3 Histone acetyltra  74.9     4.1 8.9E-05   35.3   3.8   50  103-152   415-480 (515)
126 COG4552 Eis Predicted acetyltr  73.8     3.2   7E-05   34.7   2.8   54   95-150    41-98  (389)
127 cd04266 DUF619-NAGS-FABP DUF61  73.6      27 0.00058   24.2   7.0   30  120-149    37-67  (108)
128 PF14354 Lar_restr_allev:  Rest  72.4     1.7 3.6E-05   26.4   0.7    9   37-45      4-12  (61)
129 PF09151 DUF1936:  Domain of un  72.3     1.7 3.6E-05   23.4   0.6   10   36-45      1-10  (36)
130 PF04438 zf-HIT:  HIT zinc fing  72.2     1.5 3.3E-05   23.2   0.4   12   34-45     11-22  (30)
131 PHA02768 hypothetical protein;  71.4    0.92   2E-05   27.7  -0.6   18   35-52      4-21  (55)
132 PF04339 DUF482:  Protein of un  71.4      21 0.00045   30.0   7.2   66   46-115   204-271 (370)
133 PF14447 Prok-RING_4:  Prokaryo  71.2     2.4 5.1E-05   25.8   1.1   12   39-50     42-53  (55)
134 PF04606 Ogr_Delta:  Ogr/Delta-  69.9       2 4.2E-05   25.1   0.5   15   38-52      1-17  (47)
135 TIGR02300 FYDLN_acid conserved  68.4     2.7 5.9E-05   30.0   1.1   11   35-45      8-18  (129)
136 PF14787 zf-CCHC_5:  GAG-polypr  68.0     3.1 6.7E-05   23.1   1.1   15   37-51      3-17  (36)
137 KOG2593|consensus               67.3     2.7 5.9E-05   35.8   1.1   24   22-45    112-137 (436)
138 PRK09710 lar restriction allev  63.2     3.3 7.2E-05   26.0   0.6    8   38-45      8-15  (64)
139 PF06689 zf-C4_ClpX:  ClpX C4-t  63.0     3.3 7.2E-05   23.4   0.6    9   37-45      2-10  (41)
140 PF13912 zf-C2H2_6:  C2H2-type   62.9     3.2 6.9E-05   20.7   0.5   14   36-49      1-14  (27)
141 COG4068 Uncharacterized protei  62.4     9.5 0.00021   23.6   2.5   29   38-66     10-38  (64)
142 TIGR03655 anti_R_Lar restricti  62.2     3.4 7.3E-05   24.6   0.5    8   38-45      3-10  (53)
143 PF05605 zf-Di19:  Drought indu  62.2     6.1 0.00013   23.4   1.7   25   20-44     15-39  (54)
144 COG2401 ABC-type ATPase fused   60.7     7.4 0.00016   33.8   2.5   28  123-150   242-269 (593)
145 cd00350 rubredoxin_like Rubred  60.3     3.8 8.3E-05   22.0   0.5   12   34-45     15-26  (33)
146 PF12171 zf-C2H2_jaz:  Zinc-fin  59.6       6 0.00013   19.9   1.2   13   36-48      1-13  (27)
147 PF14353 CpXC:  CpXC protein     59.5       6 0.00013   27.7   1.5   20   26-45     28-47  (128)
148 COG5189 SFP1 Putative transcri  59.1     4.1 8.9E-05   33.7   0.7   17   35-51    397-413 (423)
149 PF09986 DUF2225:  Uncharacteri  58.6     9.4  0.0002   29.4   2.6   62    5-66     17-80  (214)
150 PF04423 Rad50_zn_hook:  Rad50   58.6      27 0.00058   20.6   4.1   34   22-64      9-42  (54)
151 PF08271 TF_Zn_Ribbon:  TFIIB z  58.5     4.9 0.00011   22.7   0.8    9   37-45      1-9   (43)
152 PRK14890 putative Zn-ribbon RN  58.1     4.8  0.0001   24.9   0.7   12   33-44     45-56  (59)
153 COG2888 Predicted Zn-ribbon RN  57.9     4.8  0.0001   24.9   0.7   12   33-44     47-58  (61)
154 PF09889 DUF2116:  Uncharacteri  57.8      17 0.00036   22.4   3.1   24   38-61      5-28  (59)
155 PRK09678 DNA-binding transcrip  57.7     6.2 0.00013   25.4   1.2   19   37-55      2-22  (72)
156 PF12294 DUF3626:  Protein of u  57.6      12 0.00027   30.4   3.1   22  125-146   191-212 (297)
157 PRK00398 rpoP DNA-directed RNA  57.5     3.4 7.5E-05   23.7   0.0   10   36-45     21-30  (46)
158 TIGR03019 pepcterm_femAB FemAB  57.1   1E+02  0.0022   24.8  10.4  112   16-151   134-248 (330)
159 smart00355 ZnF_C2H2 zinc finge  57.1     5.5 0.00012   18.8   0.7   12   37-48      1-12  (26)
160 PF03119 DNA_ligase_ZBD:  NAD-d  57.0     5.6 0.00012   20.7   0.8   10   38-47      1-10  (28)
161 PRK14892 putative transcriptio  56.0     3.6 7.9E-05   28.1  -0.1   14   32-45     17-30  (99)
162 PF09862 DUF2089:  Protein of u  55.7      14  0.0003   25.9   2.8   31   36-66     12-48  (113)
163 PRK15312 antimicrobial resista  55.4      35 0.00076   27.9   5.4  116    7-144   127-252 (298)
164 COG1675 TFA1 Transcription ini  55.0     9.7 0.00021   28.7   2.0   13   33-45    129-141 (176)
165 COG3916 LasI N-acyl-L-homoseri  54.6   1E+02  0.0022   24.0  10.3   96   50-151    10-133 (209)
166 PF13913 zf-C2HC_2:  zinc-finge  54.0     5.9 0.00013   19.9   0.5   11   37-47      3-13  (25)
167 cd00729 rubredoxin_SM Rubredox  53.4     5.6 0.00012   21.5   0.4   12   34-45     16-27  (34)
168 smart00531 TFIIE Transcription  52.5      11 0.00023   27.3   1.9   15   33-47     96-110 (147)
169 COG2075 RPL24A Ribosomal prote  52.2       7 0.00015   24.7   0.8   14   36-49      3-16  (66)
170 COG2260 Predicted Zn-ribbon RN  52.2     7.8 0.00017   23.9   0.9   20   37-56     18-39  (59)
171 PF13240 zinc_ribbon_2:  zinc-r  52.0     9.4  0.0002   18.8   1.1    6   39-44     16-21  (23)
172 PRK13130 H/ACA RNA-protein com  50.8      10 0.00022   23.1   1.3   21   36-56     17-39  (56)
173 COG3677 Transposase and inacti  50.8      22 0.00047   25.3   3.2   21   34-54     51-73  (129)
174 PF04377 ATE_C:  Arginine-tRNA-  50.7      88  0.0019   22.2   7.8   45  103-151    47-91  (128)
175 PF05741 zf-nanos:  Nanos RNA b  50.6     4.9 0.00011   24.5  -0.1   23   23-45     20-42  (55)
176 PF13824 zf-Mss51:  Zinc-finger  50.2     6.7 0.00015   23.9   0.5   25   32-56     10-34  (55)
177 PF05129 Elf1:  Transcription e  50.1     6.6 0.00014   25.7   0.4   14   31-44     17-30  (81)
178 COG1592 Rubrerythrin [Energy p  49.7      10 0.00023   28.3   1.5   25   20-46    120-144 (166)
179 PF10058 DUF2296:  Predicted in  49.5     7.3 0.00016   23.5   0.5   15   30-44     38-52  (54)
180 PRK10144 formate-dependent nit  48.9      21 0.00046   25.4   2.9   54   35-97     39-92  (126)
181 PF07395 Mig-14:  Mig-14;  Inte  48.5      57  0.0012   26.2   5.5  115    7-144    97-222 (264)
182 PF09390 DUF1999:  Protein of u  47.8      65  0.0014   23.7   5.3   56   92-149    54-109 (161)
183 PF03604 DNA_RNApol_7kD:  DNA d  47.7     7.2 0.00016   21.0   0.3   17   29-45     10-26  (32)
184 PF09943 DUF2175:  Uncharacteri  47.5     5.4 0.00012   27.3  -0.3   13   35-47      1-13  (101)
185 COG4049 Uncharacterized protei  47.2     5.9 0.00013   24.3  -0.1   14   36-49     17-30  (65)
186 PF10571 UPF0547:  Uncharacteri  46.4      11 0.00024   19.2   0.9    8   37-44     15-22  (26)
187 smart00504 Ubox Modified RING   46.3      29 0.00063   20.5   3.0   25   15-46     20-45  (63)
188 TIGR03147 cyt_nit_nrfF cytochr  46.2      24 0.00051   25.2   2.8   54   35-97     39-92  (126)
189 PF03884 DUF329:  Domain of unk  46.2       8 0.00017   23.7   0.3   13   36-48      2-14  (57)
190 PHA00626 hypothetical protein   45.7     9.9 0.00021   23.3   0.7   15   32-46     19-33  (59)
191 PF06559 DCD:  2'-deoxycytidine  45.7      18 0.00039   30.2   2.4   34  103-139   323-356 (364)
192 PF02891 zf-MIZ:  MIZ/SP-RING z  45.1       9  0.0002   22.6   0.4   11   35-45     40-50  (50)
193 PRK00464 nrdR transcriptional   44.9      10 0.00023   27.9   0.8    7   38-44      2-8   (154)
194 PF13530 SCP2_2:  Sterol carrie  44.6 1.2E+02  0.0026   23.0   6.8   52   92-149    23-76  (218)
195 smart00659 RPOLCX RNA polymera  43.9     7.9 0.00017   22.3   0.1   17   29-45     12-28  (44)
196 COG3024 Uncharacterized protei  43.7      14 0.00031   23.1   1.2   16   33-48      4-19  (65)
197 PF04959 ARS2:  Arsenite-resist  42.3     4.7  0.0001   31.3  -1.4   15   33-47     74-88  (214)
198 PF10013 DUF2256:  Uncharacteri  41.8      12 0.00025   21.5   0.5   15   36-50      8-22  (42)
199 PF09855 DUF2082:  Nucleic-acid  41.5      12 0.00026   23.4   0.6    9   37-45      1-9   (64)
200 COG4391 Uncharacterized protei  41.2      12 0.00025   23.3   0.5   15   33-47     45-59  (62)
201 PRK00114 hslO Hsp33-like chape  41.1      19 0.00041   29.1   1.9   26   33-62    265-290 (293)
202 PF13248 zf-ribbon_3:  zinc-rib  41.0      13 0.00027   18.7   0.6   10   37-46      3-12  (26)
203 PF10588 NADH-G_4Fe-4S_3:  NADH  40.8      21 0.00046   20.0   1.6   22   23-45      1-22  (41)
204 PF14255 Cys_rich_CPXG:  Cystei  40.5     6.7 0.00015   23.6  -0.6    9   37-45      1-9   (52)
205 KOG2907|consensus               40.5     8.9 0.00019   26.8  -0.1   11   35-45     73-83  (116)
206 PF13909 zf-H2C2_5:  C2H2-type   40.4      13 0.00027   17.9   0.5    7   37-43      1-7   (24)
207 PRK06266 transcription initiat  40.3      21 0.00046   26.8   1.9   13   33-45    114-126 (178)
208 PF13945 NST1:  Salt tolerance   40.2      26 0.00057   26.7   2.4   34   32-69    137-170 (190)
209 KOG2535|consensus               40.1      16 0.00035   30.8   1.3   21  132-152   497-517 (554)
210 PF08772 NOB1_Zn_bind:  Nin one  39.8      13 0.00027   24.0   0.5   13   33-45     21-33  (73)
211 PF10122 Mu-like_Com:  Mu-like   39.4      18 0.00039   21.7   1.1   22   24-45     12-33  (51)
212 KOG4317|consensus               38.8      14  0.0003   30.5   0.8   14   32-45     15-28  (383)
213 KOG3965|consensus               38.8      12 0.00026   27.6   0.3   26    8-33    117-142 (160)
214 PF11432 DUF3197:  Protein of u  38.5      19 0.00041   25.0   1.3   49   10-68     46-97  (113)
215 PF05191 ADK_lid:  Adenylate ki  38.4      12 0.00027   20.5   0.3   12   37-48      2-13  (36)
216 COG1998 RPS31 Ribosomal protei  37.9      16 0.00034   21.8   0.7   12   33-44     16-27  (51)
217 cd00730 rubredoxin Rubredoxin;  37.6      17 0.00037   21.5   0.9   13   33-45     31-43  (50)
218 smart00834 CxxC_CXXC_SSSS Puta  37.2      15 0.00033   20.0   0.6   10   36-45      5-14  (41)
219 cd00246 RabGEF Nucleotide exch  37.2      18 0.00039   24.9   1.0   13   36-48      2-14  (103)
220 PRK02983 lysS lysyl-tRNA synth  37.1   4E+02  0.0087   25.9  10.7  114   20-151   357-473 (1094)
221 PF10892 DUF2688:  Protein of u  36.2     5.7 0.00012   24.3  -1.4   30   27-59      4-35  (60)
222 PF13878 zf-C2H2_3:  zinc-finge  36.1      20 0.00044   20.2   1.0   16   36-51     13-28  (41)
223 TIGR00373 conserved hypothetic  35.8      16 0.00036   26.7   0.7   12   34-45    107-118 (158)
224 cd03173 DUF619-like DUF619 dom  35.8 1.1E+02  0.0023   20.8   4.6   30  120-149    31-60  (98)
225 TIGR02098 MJ0042_CXXC MJ0042 f  35.8      17 0.00038   19.6   0.6   10   36-45      2-11  (38)
226 TIGR01206 lysW lysine biosynth  35.6      18 0.00039   21.8   0.8   10   36-45      2-11  (54)
227 PF12874 zf-met:  Zinc-finger o  35.5      16 0.00034   17.6   0.4   12   37-48      1-12  (25)
228 COG4311 SoxD Sarcosine oxidase  35.3      24 0.00052   23.9   1.3    9   37-45      4-12  (97)
229 PF12760 Zn_Tnp_IS1595:  Transp  35.2      19  0.0004   20.6   0.7    9   37-45     19-27  (46)
230 PF08792 A2L_zn_ribbon:  A2L zi  34.9      18 0.00038   19.5   0.6   14   32-45     17-30  (33)
231 PF02591 DUF164:  Putative zinc  34.8      25 0.00055   20.9   1.3   12   34-45     44-55  (56)
232 PRK12380 hydrogenase nickel in  34.7      18  0.0004   25.0   0.8   11   35-45     85-95  (113)
233 PF14206 Cys_rich_CPCC:  Cystei  34.7      18 0.00038   23.6   0.6   10   36-45      1-10  (78)
234 COG1656 Uncharacterized conser  34.1      18 0.00038   27.0   0.6   30   32-66    126-155 (165)
235 COG5595 Zn-ribbon-containing,   33.6      28  0.0006   27.0   1.6   29   17-45    184-227 (256)
236 COG1693 Repressor of nif and g  33.5      49  0.0011   27.1   3.1   60    5-67     56-130 (325)
237 PF10071 DUF2310:  Zn-ribbon-co  33.4      31 0.00066   27.6   1.9   29   18-46    187-230 (258)
238 PRK01343 zinc-binding protein;  33.1      21 0.00045   21.9   0.7   12   35-46      8-19  (57)
239 KOG2462|consensus               33.0      22 0.00048   28.7   1.1   24   32-55    126-149 (279)
240 PF14690 zf-ISL3:  zinc-finger   32.9      22 0.00047   20.0   0.8   10   36-45      2-11  (47)
241 TIGR00100 hypA hydrogenase nic  32.8      20 0.00043   24.9   0.7   10   36-45     86-95  (115)
242 PF00571 CBS:  CBS domain CBS d  32.8      39 0.00085   19.2   1.9   32   80-112    17-48  (57)
243 PRK00418 DNA gyrase inhibitor;  32.7      23  0.0005   22.1   0.9   13   35-47      5-17  (62)
244 COG3877 Uncharacterized protei  32.6      39 0.00084   23.5   2.0   16   46-61     36-51  (122)
245 TIGR00244 transcriptional regu  32.5      21 0.00045   26.2   0.7    7   39-45     31-37  (147)
246 PF08209 Sgf11:  Sgf11 (transcr  32.4      20 0.00044   19.4   0.5   12   36-47      4-15  (33)
247 COG4469 CoiA Competence protei  32.3      20 0.00044   29.7   0.8   10   36-45     25-34  (342)
248 PRK03681 hypA hydrogenase nick  32.2      19 0.00041   25.0   0.5   10   36-45     87-96  (114)
249 smart00440 ZnF_C2C2 C2C2 Zinc   32.1      23 0.00049   19.8   0.7    9   37-45      1-9   (40)
250 smart00451 ZnF_U1 U1-like zinc  31.8      22 0.00049   18.5   0.7   16   35-50      2-17  (35)
251 PF00301 Rubredoxin:  Rubredoxi  31.7      13 0.00029   21.8  -0.3   16   36-51      1-16  (47)
252 PHA00732 hypothetical protein   31.7      23 0.00051   22.9   0.8   14   34-47     25-38  (79)
253 PF06221 zf-C2HC5:  Putative zi  31.7      51  0.0011   20.1   2.3   11   36-46     35-45  (57)
254 COG4847 Uncharacterized protei  30.9      15 0.00032   25.0  -0.2   17   33-49      3-19  (103)
255 PF01155 HypA:  Hydrogenase exp  30.8      19 0.00041   24.9   0.3    9   37-45     87-95  (113)
256 COG4338 Uncharacterized protei  30.5      15 0.00032   21.8  -0.2   16   36-51     12-27  (54)
257 TIGR01374 soxD sarcosine oxida  30.5      22 0.00049   23.5   0.6    8   38-45      3-10  (84)
258 KOG2879|consensus               30.5      34 0.00073   27.8   1.7   13   33-45    236-248 (298)
259 PF14952 zf-tcix:  Putative tre  30.4      24 0.00052   20.4   0.6   11   35-45     10-20  (44)
260 PRK00807 50S ribosomal protein  29.9      29 0.00064   20.6   1.0   15   37-51      2-16  (52)
261 PF04810 zf-Sec23_Sec24:  Sec23  29.8      21 0.00045   19.9   0.3   12   34-45     22-33  (40)
262 COG4416 Com Mu-like prophage p  29.4      38 0.00081   20.6   1.4   25   21-45      9-33  (60)
263 PF04135 Nop10p:  Nucleolar RNA  29.0      38 0.00083   20.4   1.4   20   37-56     18-39  (53)
264 TIGR02666 moaA molybdenum cofa  28.9      46   0.001   26.9   2.3   46   17-64      4-55  (334)
265 PRK00564 hypA hydrogenase nick  28.9      22 0.00048   24.7   0.4    8   38-45     90-97  (117)
266 PF04475 DUF555:  Protein of un  28.8      26 0.00056   23.9   0.7   10   36-45     47-56  (102)
267 PF10276 zf-CHCC:  Zinc-finger   28.6      22 0.00048   20.1   0.3   11   36-46     29-39  (40)
268 COG2761 FrnE Predicted dithiol  28.6      12 0.00027   29.3  -1.0   17   26-42      3-19  (225)
269 PRK03922 hypothetical protein;  28.6      26 0.00056   24.4   0.6   10   36-45     49-58  (113)
270 PF01096 TFIIS_C:  Transcriptio  28.1      27 0.00059   19.4   0.6    8   38-45      2-9   (39)
271 TIGR03032 conserved hypothetic  27.9      99  0.0021   25.7   4.0   37   93-136   282-318 (335)
272 PRK00420 hypothetical protein;  27.9      73  0.0016   22.2   2.8    9   37-45     24-32  (112)
273 smart00507 HNHc HNH nucleases.  27.6      34 0.00073   18.8   1.0   14   33-47      8-21  (52)
274 PF14205 Cys_rich_KTR:  Cystein  27.6      30 0.00066   21.0   0.8    9   37-45      5-13  (55)
275 PF14446 Prok-RING_1:  Prokaryo  27.6      27 0.00059   21.1   0.6   12   36-47     21-32  (54)
276 COG3478 Predicted nucleic-acid  27.6      29 0.00063   21.9   0.7   10   36-45      4-13  (68)
277 smart00116 CBS Domain in cysta  27.5      64  0.0014   16.5   2.1   27   85-112    15-41  (49)
278 KOG3993|consensus               27.4      26 0.00056   30.1   0.6   13   33-45    292-304 (500)
279 COG0419 SbcC ATPase involved i  27.3      54  0.0012   30.7   2.7   15   36-50    457-471 (908)
280 PF11039 DUF2824:  Protein of u  27.3 2.4E+02  0.0052   20.5   5.4   41   90-136    35-75  (151)
281 PRK00762 hypA hydrogenase nick  27.3      28 0.00061   24.5   0.7    9   37-45     93-101 (124)
282 PF01246 Ribosomal_L24e:  Ribos  27.3      31 0.00066   22.1   0.8   14   36-49      3-16  (71)
283 COG1327 Predicted transcriptio  27.2      29 0.00062   25.6   0.7    8   38-45      2-9   (156)
284 cd00085 HNHc HNH nucleases; HN  27.1      46   0.001   18.6   1.6   10   37-46     12-21  (57)
285 PF04267 SoxD:  Sarcosine oxida  27.0      20 0.00043   23.7  -0.1    8   38-45      3-10  (84)
286 TIGR03826 YvyF flagellar opero  26.9      33 0.00073   24.7   1.0   11   37-47      4-14  (137)
287 PRK03824 hypA hydrogenase nick  26.9      29 0.00064   24.8   0.7   12   34-45    105-116 (135)
288 smart00734 ZnF_Rad18 Rad18-lik  26.8      30 0.00064   17.5   0.6    9   38-46      3-11  (26)
289 KOG2779|consensus               26.5 2.2E+02  0.0049   24.1   5.8   65   48-115   262-328 (421)
290 PF06827 zf-FPG_IleRS:  Zinc fi  26.4      13 0.00029   19.1  -0.8   19   26-44     11-29  (30)
291 PF01927 Mut7-C:  Mut7-C RNAse   26.2      29 0.00064   24.9   0.6   11   36-46    124-134 (147)
292 COG1996 RPC10 DNA-directed RNA  26.1      27 0.00058   20.7   0.3   12   34-45     22-33  (49)
293 PF12387 Peptidase_C74:  Pestiv  26.0      29 0.00062   26.2   0.6   17   29-45    168-184 (200)
294 PF03811 Zn_Tnp_IS1:  InsA N-te  25.7      34 0.00073   18.8   0.7    9   37-45      6-14  (36)
295 TIGR03830 CxxCG_CxxCG_HTH puta  25.6 1.3E+02  0.0028   20.4   3.8   14   35-48     30-43  (127)
296 smart00714 LITAF Possible memb  25.6      34 0.00073   21.1   0.8   15   31-45     47-61  (67)
297 COG1885 Uncharacterized protei  25.5      33 0.00071   23.7   0.7   10   36-45     49-58  (115)
298 PRK12496 hypothetical protein;  25.3      31 0.00068   25.5   0.7   11   37-47    144-154 (164)
299 COG1645 Uncharacterized Zn-fin  25.2      30 0.00066   24.8   0.5   10   36-45     28-37  (131)
300 PRK13361 molybdenum cofactor b  25.2      58  0.0013   26.4   2.3   45   17-63      8-56  (329)
301 PF07315 DUF1462:  Protein of u  25.0      94   0.002   20.9   2.8   62   36-108     6-79  (93)
302 PF14803 Nudix_N_2:  Nudix N-te  25.0      34 0.00073   18.6   0.6    8   38-45      2-9   (34)
303 COG2093 DNA-directed RNA polym  25.0      35 0.00075   21.3   0.7    9   37-45     19-27  (64)
304 PF10609 ParA:  ParA/MinD ATPas  25.0      28 0.00062   22.8   0.3   13   33-45     62-74  (81)
305 PF10915 DUF2709:  Protein of u  24.9      64  0.0014   24.9   2.3   32   31-62     82-115 (238)
306 COG1029 FwdB Formylmethanofura  24.8      31 0.00067   29.2   0.6   11   35-45      4-14  (429)
307 KOG1491|consensus               24.6      64  0.0014   27.2   2.4   24  125-148    88-113 (391)
308 PF06397 Desulfoferrod_N:  Desu  24.6      30 0.00065   19.1   0.3   12   34-45      4-15  (36)
309 PF04564 U-box:  U-box domain;   24.4      99  0.0022   19.3   2.8   27   35-61     38-64  (73)
310 PF09723 Zn-ribbon_8:  Zinc rib  24.4      26 0.00056   19.7   0.0   10   36-45      5-14  (42)
311 PRK14852 hypothetical protein;  24.3 6.5E+02   0.014   24.3   9.5   88   52-147    35-146 (989)
312 PF12756 zf-C2H2_2:  C2H2 type   24.1      38 0.00082   21.6   0.8   13   35-47     49-61  (100)
313 COG5319 Uncharacterized protei  24.1      34 0.00073   24.6   0.6   13   33-45     29-41  (142)
314 PF08348 PAS_6:  YheO-like PAS   23.9 1.3E+02  0.0029   20.8   3.6   23   94-117    85-107 (118)
315 PF09706 Cas_CXXC_CXXC:  CRISPR  23.9      38 0.00081   21.3   0.7   13   33-45      2-14  (69)
316 COG4694 Uncharacterized protei  23.9      31 0.00067   30.9   0.5   10   36-45    278-287 (758)
317 COG0375 HybF Zn finger protein  23.8      39 0.00083   23.7   0.8   10   36-45     86-95  (115)
318 COG5270 PUA domain (predicted   23.7   1E+02  0.0022   23.7   3.1   33   94-139   164-196 (202)
319 PF03982 DAGAT:  Diacylglycerol  23.7 1.1E+02  0.0024   24.8   3.6   35   42-76    247-283 (297)
320 PRK14891 50S ribosomal protein  23.6      41 0.00089   24.1   0.9   14   36-49      4-17  (131)
321 PF13719 zinc_ribbon_5:  zinc-r  23.6      38 0.00082   18.5   0.6    9   37-45      3-11  (37)
322 COG1579 Zn-ribbon protein, pos  23.5      31 0.00067   27.3   0.3   39    7-46    189-231 (239)
323 KOG4602|consensus               23.3      33 0.00071   27.6   0.5   13   33-45    265-277 (318)
324 PF01396 zf-C4_Topoisom:  Topoi  23.3      39 0.00085   18.7   0.7    8   38-45      3-10  (39)
325 PF11023 DUF2614:  Protein of u  23.1      49  0.0011   23.1   1.2    9   37-45     70-78  (114)
326 PF12677 DUF3797:  Domain of un  23.1      39 0.00085   20.0   0.6    9   38-46     15-23  (49)
327 PF06170 DUF983:  Protein of un  23.0      47   0.001   21.9   1.1   23   36-61      8-30  (86)
328 PLN02294 cytochrome c oxidase   23.0      38 0.00082   25.5   0.7   13   35-47    140-152 (174)
329 PF04376 ATE_N:  Arginine-tRNA-  22.9 1.1E+02  0.0025   19.6   2.9   29   34-62      5-41  (80)
330 PF10815 ComZ:  ComZ;  InterPro  22.8 1.3E+02  0.0028   18.2   2.8   26   49-74     25-50  (56)
331 TIGR02668 moaA_archaeal probab  22.6      71  0.0015   25.4   2.3   46   17-64      4-52  (302)
332 PF14824 Sirohm_synth_M:  Siroh  22.6      52  0.0011   17.4   1.0   19   10-28     10-28  (30)
333 PF06677 Auto_anti-p27:  Sjogre  22.1      46   0.001   18.8   0.8   13   36-48     17-29  (41)
334 PF10601 zf-LITAF-like:  LITAF-  22.0      43 0.00093   21.0   0.7   14   32-45     54-67  (73)
335 PF02799 NMT_C:  Myristoyl-CoA:  21.9 1.2E+02  0.0027   23.1   3.3   66   47-115    29-96  (190)
336 smart00746 TRASH metallochaper  21.7      44 0.00096   16.3   0.7    7   39-45      1-7   (39)
337 PF03376 Adeno_E3B:  Adenovirus  21.7      40 0.00087   21.2   0.5   14  130-143    52-65  (67)
338 PRK00758 GMP synthase subunit   21.7 1.1E+02  0.0024   22.4   3.0   27  125-151   155-181 (184)
339 TIGR03823 FliZ flagellar regul  21.7      60  0.0013   24.1   1.5   35   32-66     18-58  (168)
340 PRK05978 hypothetical protein;  21.6      41  0.0009   24.6   0.7   11   35-45     32-42  (148)
341 cd00472 Ribosomal_L24e_L24 Rib  21.5      46   0.001   20.1   0.8   14   36-49      3-16  (54)
342 TIGR03831 YgiT_finger YgiT-typ  21.3      50  0.0011   18.2   0.9   18   31-48     27-44  (46)
343 PF06054 CoiA:  Competence prot  21.1      45 0.00099   27.9   0.9   12   34-45     28-39  (375)
344 smart00661 RPOL9 RNA polymeras  21.0      44 0.00095   19.1   0.6   11   35-45     19-29  (52)
345 PF10083 DUF2321:  Uncharacteri  21.0      72  0.0016   23.6   1.8   29   33-65     65-93  (158)
346 PF05265 DUF723:  Protein of un  20.7      46   0.001   20.6   0.6    9   35-43     52-60  (60)
347 TIGR02491 NrdG anaerobic ribon  20.7 1.3E+02  0.0029   21.5   3.3   33   34-66     24-60  (154)
348 PHA00733 hypothetical protein   20.6      58  0.0013   23.0   1.2   22   31-52     68-89  (128)
349 COG3058 FdhE Uncharacterized p  20.6      48   0.001   27.0   0.9   18   28-45    173-194 (308)
350 PRK00432 30S ribosomal protein  20.6      46 0.00099   19.6   0.6    9   36-44     20-28  (50)
351 PF08273 Prim_Zn_Ribbon:  Zinc-  20.6      44 0.00095   18.9   0.5    9   37-45      4-12  (40)
352 PF07282 OrfB_Zn_ribbon:  Putat  20.4      45 0.00097   20.4   0.6   15   31-45     41-55  (69)
353 cd04641 CBS_pair_28 The CBS do  20.3 1.6E+02  0.0035   19.1   3.4   27   85-112    88-114 (120)
354 KOG3029|consensus               20.2      21 0.00046   29.2  -1.1   16   30-45     90-105 (370)
355 PF02150 RNA_POL_M_15KD:  RNA p  20.0      46   0.001   18.0   0.5    8   38-45      3-10  (35)
356 PHA01733 hypothetical protein   20.0 3.7E+02   0.008   19.8   5.9   59   50-114     6-66  (153)
357 TIGR02605 CxxC_CxxC_SSSS putat  20.0      46   0.001   19.2   0.6   10   36-45      5-14  (52)

No 1  
>KOG0402|consensus
Probab=99.92  E-value=3.2e-26  Score=148.73  Aligned_cols=45  Identities=91%  Similarity=1.418  Sum_probs=44.6

Q ss_pred             CCCccceeeeccccccccchhHhHhHHhhhhhcccccccCCCCcc
Q psy838            1 MAKRTKKVGITGKYGTRYGASLRKMVKKMEITQHAKYTCSFCGKE   45 (152)
Q Consensus         1 m~~~~~~~~~~~~~g~ryg~~~rk~~~~~~~~~~~~~~c~~c~~~   45 (152)
                      |++|||||||+|+||+|||++|||++++||++||++|+|+||||.
T Consensus         1 m~krtKKVgI~gkyGtrYGaSLrk~vKkiei~Qhaky~CsfCGK~   45 (92)
T KOG0402|consen    1 MAKRTKKVGIVGKYGTRYGASLRKMVKKIEIQQHAKYTCSFCGKK   45 (92)
T ss_pred             CCcccceeeeeecccchhhHHHHHHHHHHHHHHhhhhhhhhcchh
Confidence            999999999999999999999999999999999999999999998


No 2  
>PTZ00255 60S ribosomal protein L37a; Provisional
Probab=99.91  E-value=3.7e-25  Score=147.37  Aligned_cols=45  Identities=84%  Similarity=1.353  Sum_probs=44.3

Q ss_pred             CCCccceeeeccccccccchhHhHhHHhhhhhcccccccCCCCcc
Q psy838            1 MAKRTKKVGITGKYGTRYGASLRKMVKKMEITQHAKYTCSFCGKE   45 (152)
Q Consensus         1 m~~~~~~~~~~~~~g~ryg~~~rk~~~~~~~~~~~~~~c~~c~~~   45 (152)
                      |++|||||||+||||+|||++|||+|++||++||++|+|||||+.
T Consensus         1 MakrtkkvG~~GrfG~RYG~slRK~v~kie~~q~a~y~CpfCgk~   45 (90)
T PTZ00255          1 MAKRTKKVGITGKYGTRYGASLRKQIKKIEISQHAKYFCPFCGKH   45 (90)
T ss_pred             CCCcCceeeecCCCcCccCHHHHHHHHHHHHHHhCCccCCCCCCC
Confidence            899999999999999999999999999999999999999999987


No 3  
>PRK03976 rpl37ae 50S ribosomal protein L37Ae; Reviewed
Probab=99.90  E-value=9.5e-25  Score=145.44  Aligned_cols=45  Identities=47%  Similarity=0.871  Sum_probs=44.1

Q ss_pred             CCCccceeeeccccccccchhHhHhHHhhhhhcccccccCCCCcc
Q psy838            1 MAKRTKKVGITGKYGTRYGASLRKMVKKMEITQHAKYTCSFCGKE   45 (152)
Q Consensus         1 m~~~~~~~~~~~~~g~ryg~~~rk~~~~~~~~~~~~~~c~~c~~~   45 (152)
                      |++|||||||+||||+|||++|||+|++||++||++|+|||||+.
T Consensus         1 m~~rtkkvGi~Gr~G~RYG~slRK~v~kie~~q~a~y~CpfCgk~   45 (90)
T PRK03976          1 MMSRTKKVGSAGRFGARYGRKIRKRVADIEEKMRAKHVCPVCGRP   45 (90)
T ss_pred             CCCcCceEeecCCCcCccCHHHHHHHHHHHHHHhcCccCCCCCCC
Confidence            888999999999999999999999999999999999999999987


No 4  
>TIGR00280 L37a ribosomal protein L37a. This model finds eukaryotic ribosomal protein L37a and its archaeal orthologs. The nomeclature is tricky because eukaryotes have proteins called both L37 and L37a.
Probab=99.89  E-value=4.3e-24  Score=142.43  Aligned_cols=44  Identities=68%  Similarity=1.222  Sum_probs=42.7

Q ss_pred             CCCccceeeeccccccccchhHhHhHHhhhhhcccccccCCCCcc
Q psy838            1 MAKRTKKVGITGKYGTRYGASLRKMVKKMEITQHAKYTCSFCGKE   45 (152)
Q Consensus         1 m~~~~~~~~~~~~~g~ryg~~~rk~~~~~~~~~~~~~~c~~c~~~   45 (152)
                      |+ |||||||+||||||||++|||+|++||++||++|+|||||+.
T Consensus         1 ma-rtkkvG~~GrfG~RYG~slRK~v~kie~~q~a~y~CpfCgk~   44 (91)
T TIGR00280         1 MK-RTKKVGITGRFGPRYGLKLRRQVKKIEIQQKAKYVCPFCGKK   44 (91)
T ss_pred             CC-CCceeeecCCCcCccCHHHHHHHHHHHHHHhcCccCCCCCCC
Confidence            67 899999999999999999999999999999999999999987


No 5  
>KOG2488|consensus
Probab=99.88  E-value=1.9e-22  Score=150.76  Aligned_cols=98  Identities=39%  Similarity=0.732  Sum_probs=92.8

Q ss_pred             HHHHHHHHHHHHHhHHHHHHhCCCCCChhhhhhhhcCCCceEEEEEeCCCCCeEEEEEEEeeccCCcceEEEeeeeeccc
Q psy838           54 AETKKWVWELEEKNMKHSYEVCDIGWDPQGKHSEMFDDRACYLVAKNGSSSTPVAFSHFRFDVDFGEPVLYCYELQLEKQ  133 (152)
Q Consensus        54 ~~d~~~i~~l~~~~m~~~y~~~~~~w~~~~~~~el~~~~~~~~v~~~~~~~~ivGf~~~~~~~~~~~~~~~I~~i~V~p~  133 (152)
                      .++++|+|+|.+.||.++|.++.|||++..+..||.+...+|+++.++ .+++|||++|+++.|.+.|++|+++++|.++
T Consensus        53 ~~~ldw~f~L~k~nm~~~Y~qs~~Gw~~~~K~~El~~~~~~Yi~a~~~-~~~~vgf~~Frf~vd~g~~vlYcyEvqv~~~  131 (202)
T KOG2488|consen   53 YEDLDWCFSLFKKNMGAMYRQSSWGWDDNSKAKELRNRKLRYICAWNN-KSKLVGFTMFRFTVDTGDPVLYCYEVQVASA  131 (202)
T ss_pred             HHHHHHHHHHHHhhhHHHhhhcccccCchhHHHHHhhccceEEEEEcC-CCceeeEEEEEEEcccCCeEEEEEEEeehhh
Confidence            389999999999999999999999999999999999999999999977 3499999999999999999999999999999


Q ss_pred             cccCCHHHHHHHHHHhhcC
Q psy838          134 VQRKGLGKFMMQAFYTYID  152 (152)
Q Consensus       134 ~qgkGiG~~Lm~~l~~~a~  152 (152)
                      |||+|||++||+.++..|+
T Consensus       132 yR~kGiGk~LL~~l~~~a~  150 (202)
T KOG2488|consen  132 YRGKGIGKFLLDTLEKLAD  150 (202)
T ss_pred             hhccChHHHHHHHHHHHHH
Confidence            9999999999999998764


No 6  
>PF01780 Ribosomal_L37ae:  Ribosomal L37ae protein family;  InterPro: IPR002674 Ribosomes are the particles that catalyse mRNA-directed protein synthesis in all organisms. The codons of the mRNA are exposed on the ribosome to allow tRNA binding. This leads to the incorporation of amino acids into the growing polypeptide chain in accordance with the genetic information. Incoming amino acid monomers enter the ribosomal A site in the form of aminoacyl-tRNAs complexed with elongation factor Tu (EF-Tu) and GTP. The growing polypeptide chain, situated in the P site as peptidyl-tRNA, is then transferred to aminoacyl-tRNA and the new peptidyl-tRNA, extended by one residue, is translocated to the P site with the aid the elongation factor G (EF-G) and GTP as the deacylated tRNA is released from the ribosome through one or more exit sites [, ]. About 2/3 of the mass of the ribosome consists of RNA and 1/3 of protein. The proteins are named in accordance with the subunit of the ribosome which they belong to - the small (S1 to S31) and the large (L1 to L44). Usually they decorate the rRNA cores of the subunits.  Many ribosomal proteins, particularly those of the large subunit, are composed of a globular, surfaced-exposed domain with long finger-like projections that extend into the rRNA core to stabilise its structure. Most of the proteins interact with multiple RNA elements, often from different domains. In the large subunit, about 1/3 of the 23S rRNA nucleotides are at least in van der Waal's contact with protein, and L22 interacts with all six domains of the 23S rRNA. Proteins S4 and S7, which initiate assembly of the 16S rRNA, are located at junctions of five and four RNA helices, respectively. In this way proteins serve to organise and stabilise the rRNA tertiary structure. While the crucial activities of decoding and peptide transfer are RNA based, proteins play an active role in functions that may have evolved to streamline the process of protein synthesis. In addition to their function in the ribosome, many ribosomal proteins have some function 'outside' the ribosome [, ]. This ribosomal protein is found in archaebacteria and eukaryotes []. Ribosomal protein L37 has a single zinc finger-like motif of the C2-C2 type [].; GO: 0003735 structural constituent of ribosome, 0006412 translation, 0005622 intracellular, 0005840 ribosome; PDB: 4A1E_Y 4A17_Y 4A1C_Y 4A1A_Y 3O58_g 3IZS_m 3O5H_g 1S1I_9 3IZR_m 1YSH_D ....
Probab=99.88  E-value=7.8e-24  Score=141.06  Aligned_cols=44  Identities=86%  Similarity=1.365  Sum_probs=39.6

Q ss_pred             CCccceeeeccccccccchhHhHhHHhhhhhcccccccCCCCcc
Q psy838            2 AKRTKKVGITGKYGTRYGASLRKMVKKMEITQHAKYTCSFCGKE   45 (152)
Q Consensus         2 ~~~~~~~~~~~~~g~ryg~~~rk~~~~~~~~~~~~~~c~~c~~~   45 (152)
                      |+|||||||+||||+|||++|||+|++||++||++|+|||||+.
T Consensus         1 a~rtkkvgi~GkyG~RYG~slRK~vkkie~~q~~ky~Cp~Cgk~   44 (90)
T PF01780_consen    1 AKRTKKVGIAGKYGTRYGASLRKRVKKIEISQHAKYTCPFCGKT   44 (90)
T ss_dssp             -S-SSS-TTGGGGTTSSTHHHHHHHHHHHHHHHS-BEESSSSSS
T ss_pred             CCCCceeeecCcCcCCcCHHHHHHHHHHHHHHhCCCcCCCCCCc
Confidence            57999999999999999999999999999999999999999998


No 7  
>COG1997 RPL43A Ribosomal protein L37AE/L43A [Translation, ribosomal structure and biogenesis]
Probab=99.84  E-value=9.9e-22  Score=129.15  Aligned_cols=44  Identities=59%  Similarity=1.089  Sum_probs=42.9

Q ss_pred             CCCccceeeeccccccccchhHhHhHHhhhhhcccccccCCCCcc
Q psy838            1 MAKRTKKVGITGKYGTRYGASLRKMVKKMEITQHAKYTCSFCGKE   45 (152)
Q Consensus         1 m~~~~~~~~~~~~~g~ryg~~~rk~~~~~~~~~~~~~~c~~c~~~   45 (152)
                      |++ |||||++||||||||+++|++|++||..|+++|.||+||+.
T Consensus         1 M~~-TkkvG~aGrfGpRYG~~~Rrrv~~ie~~~~~~~~Cp~C~~~   44 (89)
T COG1997           1 MAK-TKKVGIAGRFGPRYGSKLRRRVKEIEAQQRAKHVCPFCGRT   44 (89)
T ss_pred             CCc-cceeccCcccccccchHHHHHHHHHHHHHhcCCcCCCCCCc
Confidence            777 99999999999999999999999999999999999999998


No 8  
>PRK10146 aminoalkylphosphonic acid N-acetyltransferase; Provisional
Probab=99.34  E-value=3.2e-12  Score=91.06  Aligned_cols=94  Identities=13%  Similarity=0.029  Sum_probs=66.7

Q ss_pred             CCCCCHHHHHHHHHHHHHhHHHHHHhCCCCCCh----hhhhhhhcCCCceEEEEEeCCCCCeEEEEEEEeecc--CCcce
Q psy838           49 STDLGAETKKWVWELEEKNMKHSYEVCDIGWDP----QGKHSEMFDDRACYLVAKNGSSSTPVAFSHFRFDVD--FGEPV  122 (152)
Q Consensus        49 ~~~l~~~d~~~i~~l~~~~m~~~y~~~~~~w~~----~~~~~el~~~~~~~~v~~~~~~~~ivGf~~~~~~~~--~~~~~  122 (152)
                      ..+++++|++.+.++..+....       .|+.    +.+...+.++...++++..+  +++|||+.+.....  .....
T Consensus         6 ir~a~~~D~~~l~~l~~~~~~~-------~~~~~~~~~~~~~~l~~~~~~~~v~~~~--~~ivG~~~~~~~~~~~~~~~~   76 (144)
T PRK10146          6 LRPATQYDTDAVYALICELKQA-------EFDHQAFRVGFNANLRDPNMRYHLALLD--GEVVGMIGLHLQFHLHHVNWI   76 (144)
T ss_pred             EeeCcHhhHHHHHHHHHHHhcc-------cCCHHHHHHHHHHHhcCCCceEEEEEEC--CEEEEEEEEEecccccccchh
Confidence            4567899999999987653221       1222    22334454556666666655  89999999875422  12223


Q ss_pred             EEEeeeeeccccccCCHHHHHHHHHHhhc
Q psy838          123 LYCYELQLEKQVQRKGLGKFMMQAFYTYI  151 (152)
Q Consensus       123 ~~I~~i~V~p~~qgkGiG~~Lm~~l~~~a  151 (152)
                      .+|.+|+|+|++||+|+|+.||+.+++.|
T Consensus        77 ~~i~~l~v~p~~rg~GiG~~Ll~~~~~~a  105 (144)
T PRK10146         77 GEIQELVVMPQARGLNVGSKLLAWAEEEA  105 (144)
T ss_pred             heeheeEECHHHcCCCHHHHHHHHHHHHH
Confidence            56899999999999999999999999876


No 9  
>KOG3216|consensus
Probab=99.31  E-value=2e-11  Score=88.87  Aligned_cols=102  Identities=17%  Similarity=0.219  Sum_probs=76.2

Q ss_pred             CCCCCHHHHHHHHHHHHHhHHHHHHhCC--CCCChhhhhhhhc-C-CCceEEEEEe-CCCCCeEEEEEEEeecc--CCcc
Q psy838           49 STDLGAETKKWVWELEEKNMKHSYEVCD--IGWDPQGKHSEMF-D-DRACYLVAKN-GSSSTPVAFSHFRFDVD--FGEP  121 (152)
Q Consensus        49 ~~~l~~~d~~~i~~l~~~~m~~~y~~~~--~~w~~~~~~~el~-~-~~~~~~v~~~-~~~~~ivGf~~~~~~~~--~~~~  121 (152)
                      ...+++.|.+.+..|.++ | +-|+...  .--+++.+..+++ + +..+..++.- ..++.++||+.|.+.++  .+..
T Consensus         6 IR~at~~D~~~i~rLike-l-a~Fek~~~~v~~te~~l~~~~F~d~~~~~~~v~~ie~~~~~~aGf~~yf~~ystW~~k~   83 (163)
T KOG3216|consen    6 IRLATPKDCEDILRLIKE-L-AEFEKLEDQVEATEENLARDGFIDPPFKHWLVAAIETSGEVVAGFALYFNNYSTWLGKQ   83 (163)
T ss_pred             EEecCcccHHHHHHHHHH-H-HHHHHhccchhhchhhhhhhhccCCCccEEEEEEEecCCCceeEEeeeecccccccccc
Confidence            456888999999999887 2 3455433  2356777777643 2 3344444433 22589999999988765  5667


Q ss_pred             eEEEeeeeeccccccCCHHHHHHHHHHhhcC
Q psy838          122 VLYCYELQLEKQVQRKGLGKFMMQAFYTYID  152 (152)
Q Consensus       122 ~~~I~~i~V~p~~qgkGiG~~Lm~~l~~~a~  152 (152)
                      .+||.+|+|.|+|||+|+|+.|+..+-+.|+
T Consensus        84 ~iYleDlyV~e~yR~kG~Gs~Ll~~va~~A~  114 (163)
T KOG3216|consen   84 GIYLEDLYVREQYRGKGIGSKLLKFVAEEAD  114 (163)
T ss_pred             eEEEEeeEecchhcccChHHHHHHHHHHHHH
Confidence            8999999999999999999999999988774


No 10 
>PHA00673 acetyltransferase domain containing protein
Probab=99.25  E-value=1.1e-10  Score=85.90  Aligned_cols=101  Identities=11%  Similarity=-0.058  Sum_probs=68.9

Q ss_pred             CCCCCHHHHHHHHHHHHHhHHHHHHhCC-CCCCh-hhhhhhhcCCCceEEEEEeCCCCCeEEEEEEEeecc---CCcceE
Q psy838           49 STDLGAETKKWVWELEEKNMKHSYEVCD-IGWDP-QGKHSEMFDDRACYLVAKNGSSSTPVAFSHFRFDVD---FGEPVL  123 (152)
Q Consensus        49 ~~~l~~~d~~~i~~l~~~~m~~~y~~~~-~~w~~-~~~~~el~~~~~~~~v~~~~~~~~ivGf~~~~~~~~---~~~~~~  123 (152)
                      ..-++.+|++.|.+|..+.......... .+... +.+..-..+++..++++..+  |++|||+.+.+...   .+.+..
T Consensus         9 ~~~A~~~D~paI~~LLadd~l~~~r~d~~~~~~y~~af~ai~~dp~~~llVa~~~--g~vVG~~~l~~~p~l~~~~~~~~   86 (154)
T PHA00673          9 VAFAELADAPTFASLCAEYAHESANADLAGRAPDHHAYAGMEAAGVAHFLGVFRG--EELVGFACLLVTPVPHFKGQLIG   86 (154)
T ss_pred             HhhccHhhHHHHHHHHHhcccccccccccccchhHHHHHHHHhCCCcEEEEEEEC--CEEEEEEEEEEecCCccCCccEE
Confidence            4457899999999997662111100000 00000 11223345677777777665  89999999877643   245677


Q ss_pred             EEeeeeeccccccCCHHHHHHHHHHhhc
Q psy838          124 YCYELQLEKQVQRKGLGKFMMQAFYTYI  151 (152)
Q Consensus       124 ~I~~i~V~p~~qgkGiG~~Lm~~l~~~a  151 (152)
                      +|.+|+|+|++||+|||+.||+.++++|
T Consensus        87 ~Ie~l~V~~~~RGqGIG~~Ll~~A~~~A  114 (154)
T PHA00673         87 TTESIFVAAAHRPGGAGMALLRATEALA  114 (154)
T ss_pred             EEEEEEEChhccCCCHHHHHHHHHHHHH
Confidence            8999999999999999999999999886


No 11 
>PF13673 Acetyltransf_10:  Acetyltransferase (GNAT) domain; PDB: 2FIW_A 1BOB_A 3FNC_B 3EXN_A.
Probab=99.23  E-value=1.8e-10  Score=79.09  Aligned_cols=87  Identities=16%  Similarity=0.206  Sum_probs=59.9

Q ss_pred             HHHHHHHHHHHhHHHHHHhCC------CCCChhhhhhhhcCCCceEEEEEeCCCCCeEEEEEEEeeccCCcceEEEeeee
Q psy838           56 TKKWVWELEEKNMKHSYEVCD------IGWDPQGKHSEMFDDRACYLVAKNGSSSTPVAFSHFRFDVDFGEPVLYCYELQ  129 (152)
Q Consensus        56 d~~~i~~l~~~~m~~~y~~~~------~~w~~~~~~~el~~~~~~~~v~~~~~~~~ivGf~~~~~~~~~~~~~~~I~~i~  129 (152)
                      |++.+.+|..+.....+....      +.++++.....+.++...++++..+  +++|||+.+.   .++    .|..++
T Consensus         1 D~~~i~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~--~~ivG~~~~~---~~~----~i~~l~   71 (117)
T PF13673_consen    1 DIPAIAELYREAWQENYWDYGPEQIDAWRYSPEDLEEYLEEGSHTIFVAEEG--GEIVGFAWLE---PDG----EISHLY   71 (117)
T ss_dssp             GHHHHHHHHHHHHHHHTTTTSHHHHHHHHSSHHHHHHHHCTCCCEEEEEEET--TEEEEEEEEE---TCE----EEEEEE
T ss_pred             CHHHHHHHHHHHHHHhccCCCHHHHHHHhcCHHHHHHHHHhcCCEEEEEEEC--CEEEEEEEEc---CCC----eEEEEE
Confidence            667777777765544331110      0144555555555555667777766  9999999985   222    288899


Q ss_pred             eccccccCCHHHHHHHHHHhhc
Q psy838          130 LEKQVQRKGLGKFMMQAFYTYI  151 (152)
Q Consensus       130 V~p~~qgkGiG~~Lm~~l~~~a  151 (152)
                      |+|+|||+|+|+.||+.+++.+
T Consensus        72 v~p~~r~~Gig~~Ll~~~~~~~   93 (117)
T PF13673_consen   72 VLPEYRGRGIGRALLDAAEKEA   93 (117)
T ss_dssp             E-GGGTTSSHHHHHHHHHHHHH
T ss_pred             EChhhcCCcHHHHHHHHHHHHH
Confidence            9999999999999999999875


No 12 
>TIGR03827 GNAT_ablB putative beta-lysine N-acetyltransferase. Members of this protein family are GNAT family acetyltransferases, based on a seed alignment in which every member is associated with a lysine 2,3-aminomutase family protein, usually as the adjacent gene. This family includes AblB, the enzyme beta-lysine acetyltransferase that completes the two-step synthesis of the osmolyte (compatible solute) N-epsilon-acetyl-beta-lysine; all members of the family may have this function. Note that N-epsilon-acetyl-beta-lysine has been observed only in methanogenic archaea (e.g. Methanosarcina) but that this model, paired with TIGR03820, suggests a much broader distribution.
Probab=99.18  E-value=1.9e-10  Score=91.28  Aligned_cols=97  Identities=12%  Similarity=0.106  Sum_probs=69.8

Q ss_pred             ccCCCCCHHHHHHHHHHHHHhHHHHHHhCCCCCChhhhhhhhcCCCceEEEEEeCCCCCeEEEEEEEeeccCCcceEEEe
Q psy838           47 KKSTDLGAETKKWVWELEEKNMKHSYEVCDIGWDPQGKHSEMFDDRACYLVAKNGSSSTPVAFSHFRFDVDFGEPVLYCY  126 (152)
Q Consensus        47 ~~~~~l~~~d~~~i~~l~~~~m~~~y~~~~~~w~~~~~~~el~~~~~~~~v~~~~~~~~ivGf~~~~~~~~~~~~~~~I~  126 (152)
                      .....++++|++.+.+|..+.    |....++|.+.....+...+...++++..+  +++||++.+....  ....++|.
T Consensus       116 ~~IR~a~~~D~~~l~~L~~~v----~~~~~~~~~~~~~l~~~~~~~~~~~v~~~~--g~iVG~~~~~~~~--~~~~~eI~  187 (266)
T TIGR03827       116 FTLRIATEDDADAMAALYRKV----FPTYPFPIHDPAYLLETMKSNVVYFGVEDG--GKIIALASAEMDP--ENGNAEMT  187 (266)
T ss_pred             eEEEECCHHHHHHHHHHHHHH----hccCCCCccCHHHHHHHhcCCcEEEEEEEC--CEEEEEEEEecCC--CCCcEEEE
Confidence            345678999999999998763    322234444433333334455566666655  8999999875432  23357899


Q ss_pred             eeeeccccccCCHHHHHHHHHHhhc
Q psy838          127 ELQLEKQVQRKGLGKFMMQAFYTYI  151 (152)
Q Consensus       127 ~i~V~p~~qgkGiG~~Lm~~l~~~a  151 (152)
                      +++|+|+|||+|+|+.||+.+++.|
T Consensus       188 ~i~V~P~yRG~GiG~~Ll~~l~~~a  212 (266)
T TIGR03827       188 DFATLPEYRGKGLAKILLAAMEKEM  212 (266)
T ss_pred             EEEECHHHcCCCHHHHHHHHHHHHH
Confidence            9999999999999999999999876


No 13 
>PRK10314 putative acyltransferase; Provisional
Probab=99.14  E-value=6.7e-10  Score=81.43  Aligned_cols=97  Identities=7%  Similarity=0.023  Sum_probs=66.6

Q ss_pred             cccCCCCCHHHHHHHHHHHHHhHHHHHHhCCCCCChhhhhhhh-cCCCceEEEEEeCCCCCeEEEEEEEeeccCCcceEE
Q psy838           46 YKKSTDLGAETKKWVWELEEKNMKHSYEVCDIGWDPQGKHSEM-FDDRACYLVAKNGSSSTPVAFSHFRFDVDFGEPVLY  124 (152)
Q Consensus        46 ~~~~~~l~~~d~~~i~~l~~~~m~~~y~~~~~~w~~~~~~~el-~~~~~~~~v~~~~~~~~ivGf~~~~~~~~~~~~~~~  124 (152)
                      +.+..+++.+++..++.|-.+.+   +.....++. +  .++. ..+.+..+++..+  +++|||+.+....+ .....+
T Consensus         6 ~~~~~~l~~~~~~~~~~lR~~VF---~~eq~~~~~-e--~D~~d~~~~~~h~~~~~~--~~~vg~~r~~~~~~-~~~~~~   76 (153)
T PRK10314          6 DLHHSELSVSQLYALLQLRCAVF---VVEQNCPYQ-D--IDGDDLTGDNRHILGWKN--DELVAYARILKSDD-DLEPVV   76 (153)
T ss_pred             ecchhhCCHHHHHHHHHHHHHHh---hhhcCCCcc-c--cCCCCCCCCcEEEEEEEC--CEEEEEEEEecCCC-CCCCEE
Confidence            34568999999999999987643   222233332 1  1111 1123444556655  89999999876422 223478


Q ss_pred             EeeeeeccccccCCHHHHHHHHHHhhc
Q psy838          125 CYELQLEKQVQRKGLGKFMMQAFYTYI  151 (152)
Q Consensus       125 I~~i~V~p~~qgkGiG~~Lm~~l~~~a  151 (152)
                      |.+|+|+|+|||+|+|++||+.+++.+
T Consensus        77 i~rv~V~~~~rG~GiG~~Lm~~~~~~~  103 (153)
T PRK10314         77 IGRVIVSEALRGEKVGQQLMSKTLESC  103 (153)
T ss_pred             EEEEEECHHHhCCCHHHHHHHHHHHHH
Confidence            999999999999999999999988764


No 14 
>TIGR01575 rimI ribosomal-protein-alanine acetyltransferase. Members of this model belong to the GCN5-related N-acetyltransferase (GNAT) superfamily. This model covers prokarotes and the archaea. The seed contains a characterized accession for Gram negative E. coli. An untraceable characterized accession (PIR|S66013) for Gram positive B. subtilis scores well (205.0) in the full alignment. Characterized members are lacking in the archaea. Noise cutoff (72.4) was set to exclude M. loti paralog of rimI. Trusted cutoff (80.0) was set at next highest scoring member in the mini-database.
Probab=99.09  E-value=8e-10  Score=76.70  Aligned_cols=83  Identities=19%  Similarity=0.398  Sum_probs=61.6

Q ss_pred             HHHHHHHHHHHhHHHHHHhCCCCCChhhhhhhhcCCCceEEEEEeCCCCCeEEEEEEEeeccCCcceEEEeeeeeccccc
Q psy838           56 TKKWVWELEEKNMKHSYEVCDIGWDPQGKHSEMFDDRACYLVAKNGSSSTPVAFSHFRFDVDFGEPVLYCYELQLEKQVQ  135 (152)
Q Consensus        56 d~~~i~~l~~~~m~~~y~~~~~~w~~~~~~~el~~~~~~~~v~~~~~~~~ivGf~~~~~~~~~~~~~~~I~~i~V~p~~q  135 (152)
                      |++.++++..+    .|   ..+|+.+.+...+......++++..+  +++|||+.+....+    ..+|..++|+|++|
T Consensus         1 d~~~i~~~~~~----~~---~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~vg~~~~~~~~~----~~~i~~~~v~~~~r   67 (131)
T TIGR01575         1 DLKAVLEIEAA----AF---AFPWTEAQFAEELANYHLCYLLARIG--GKVVGYAGVQIVLD----EAHILNIAVKPEYQ   67 (131)
T ss_pred             CHHHHHHHHHh----hC---CCCCCHHHHHHHhcCCCceEEEEecC--CeEEEEEEEEecCC----CeEEEEEEECHHHc
Confidence            45667777544    34   34788888777776555555555544  89999999876432    34688999999999


Q ss_pred             cCCHHHHHHHHHHhhc
Q psy838          136 RKGLGKFMMQAFYTYI  151 (152)
Q Consensus       136 gkGiG~~Lm~~l~~~a  151 (152)
                      |+|+|+.||+.+++.+
T Consensus        68 g~G~g~~ll~~~~~~~   83 (131)
T TIGR01575        68 GQGIGRALLRELIDEA   83 (131)
T ss_pred             CCCHHHHHHHHHHHHH
Confidence            9999999999998875


No 15 
>PHA01807 hypothetical protein
Probab=99.08  E-value=7.1e-10  Score=81.61  Aligned_cols=98  Identities=9%  Similarity=-0.008  Sum_probs=62.9

Q ss_pred             CCCCCHHHHHHHHHHHHHhHHHHHHhCCC--CCC--hhh---hhhhhcCCCceEEEEEeCCCCCeEEEEEEEeeccC-Cc
Q psy838           49 STDLGAETKKWVWELEEKNMKHSYEVCDI--GWD--PQG---KHSEMFDDRACYLVAKNGSSSTPVAFSHFRFDVDF-GE  120 (152)
Q Consensus        49 ~~~l~~~d~~~i~~l~~~~m~~~y~~~~~--~w~--~~~---~~~el~~~~~~~~v~~~~~~~~ivGf~~~~~~~~~-~~  120 (152)
                      ...++.+|++....|..+.    +...+.  +|.  ++.   ...++.++....+++..+  +++|||+.+.+.... ..
T Consensus         6 ~~~~~~~d~~~~~~l~l~~----l~e~p~~~~w~s~ee~~~~~~~~~~~~~~~~lva~~d--g~lvG~~~l~~~~~~~~~   79 (153)
T PHA01807          6 RVHAKAGTPSELQGLCWLA----IQELEEFTLFRSKEEALERILDSTESNDRTELLVFRD--GKLAGIAVLVFEDDPHVG   79 (153)
T ss_pred             hhhhhhCCHHHHHHHHHHH----HHhCccCCCCCChHHHHHHHHHHhhCCCceEEEEEEC--CEEEEEEEEEcCCCccee
Confidence            3446677888888886553    333222  144  221   222234455555666655  899999998764322 11


Q ss_pred             ceEEEeeeeeccccccCCHHHHHHHHHHhhcC
Q psy838          121 PVLYCYELQLEKQVQRKGLGKFMMQAFYTYID  152 (152)
Q Consensus       121 ~~~~I~~i~V~p~~qgkGiG~~Lm~~l~~~a~  152 (152)
                      ..+.|..|+|+|+|||+|+|++||+.+++.|+
T Consensus        80 ~i~~l~~lYV~pe~RG~GiG~~Ll~~~~~~Ar  111 (153)
T PHA01807         80 PCLGVQWQYVLPEYRNAGVAREFLRELIRLAG  111 (153)
T ss_pred             eeccceeEEECHHHcCCCHHHHHHHHHHHHHH
Confidence            23345668999999999999999999998763


No 16 
>PTZ00330 acetyltransferase; Provisional
Probab=99.08  E-value=3.2e-10  Score=80.94  Aligned_cols=95  Identities=12%  Similarity=0.087  Sum_probs=62.5

Q ss_pred             CCCCCHHHHHHHHHHHHHhHHHHHHhCCCCCChhhhhhhhcC----CCc-eEEEEEeCCCCCeEEEEEEEeecc---CCc
Q psy838           49 STDLGAETKKWVWELEEKNMKHSYEVCDIGWDPQGKHSEMFD----DRA-CYLVAKNGSSSTPVAFSHFRFDVD---FGE  120 (152)
Q Consensus        49 ~~~l~~~d~~~i~~l~~~~m~~~y~~~~~~w~~~~~~~el~~----~~~-~~~v~~~~~~~~ivGf~~~~~~~~---~~~  120 (152)
                      ..+++++|++.+.++...    ...  ...++.+... .+.+    ++. ..+++..+ +|++|||+.+.....   .+.
T Consensus         9 ir~~~~~D~~~i~~l~~~----~~~--~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~-~~~~vG~~~~~~~~~~~~~~~   80 (147)
T PTZ00330          9 LRDLEEGDLGSVLELLSH----LTS--APALSQEELE-QIAARRRLAGVVTRVFVHSP-TQRIVGTASLFVEPKFTRGGK   80 (147)
T ss_pred             EEEcccccHHHHHHHHHH----hcC--CCccchhHHH-HHHHHHhcCCCceEEEEEeC-CCEEEEEEEEEeccccccCCC
Confidence            467889999999988654    111  1123443322 2211    121 22333333 489999999864322   233


Q ss_pred             ceEEEeeeeeccccccCCHHHHHHHHHHhhc
Q psy838          121 PVLYCYELQLEKQVQRKGLGKFMMQAFYTYI  151 (152)
Q Consensus       121 ~~~~I~~i~V~p~~qgkGiG~~Lm~~l~~~a  151 (152)
                      ...+|..++|+|+|||+|+|+.||+.+++.|
T Consensus        81 ~~~~i~~~~V~~~~rg~Gig~~l~~~~~~~a  111 (147)
T PTZ00330         81 CVGHIEDVVVDPSYRGQGLGRALISDLCEIA  111 (147)
T ss_pred             ceEEEEEEEECHHHcCCCHHHHHHHHHHHHH
Confidence            4678999999999999999999999999876


No 17 
>PRK07757 acetyltransferase; Provisional
Probab=99.04  E-value=2.1e-09  Score=77.54  Aligned_cols=91  Identities=10%  Similarity=0.080  Sum_probs=63.0

Q ss_pred             cCCCCCHHHHHHHHHHHHHhHHHHHHhC-CCCCChhhhhhhhcCCCceEEEEEeCCCCCeEEEEEEEeeccCCcceEEEe
Q psy838           48 KSTDLGAETKKWVWELEEKNMKHSYEVC-DIGWDPQGKHSEMFDDRACYLVAKNGSSSTPVAFSHFRFDVDFGEPVLYCY  126 (152)
Q Consensus        48 ~~~~l~~~d~~~i~~l~~~~m~~~y~~~-~~~w~~~~~~~el~~~~~~~~v~~~~~~~~ivGf~~~~~~~~~~~~~~~I~  126 (152)
                      ...+++++|++.+.++..+.    .... ..+++.+.....+    ..++++..+  +++|||+.+....+   ...+|.
T Consensus         3 ~ir~~~~~D~~~l~~l~~~~----~~~~~~~~~~~~~~~~~~----~~~~i~~~~--~~lvG~~~l~~~~~---~~~~i~   69 (152)
T PRK07757          3 EIRKARLSDVKAIHALINVY----AKKGLMLPRSLDELYENI----RDFYVAEEE--GEIVGCCALHILWE---DLAEIR   69 (152)
T ss_pred             eEeeCCcccHHHHHHHHHHH----HhcCCccCCCHHHHHhcc----CcEEEEEEC--CEEEEEEEEEeccC---CceEEE
Confidence            45678899999999987541    2110 1234443332222    245555554  89999999875322   245788


Q ss_pred             eeeeccccccCCHHHHHHHHHHhhc
Q psy838          127 ELQLEKQVQRKGLGKFMMQAFYTYI  151 (152)
Q Consensus       127 ~i~V~p~~qgkGiG~~Lm~~l~~~a  151 (152)
                      .++|+|+|||+|+|+.||+.+++.|
T Consensus        70 ~v~V~p~~rg~Glg~~Ll~~l~~~a   94 (152)
T PRK07757         70 SLAVSEDYRGQGIGRMLVEACLEEA   94 (152)
T ss_pred             EEEECHHHcCCCHHHHHHHHHHHHH
Confidence            9999999999999999999999876


No 18 
>TIGR03448 mycothiol_MshD mycothiol biosynthesis acetyltransferase. Members of this family are MshD, the acetyltransferase that catalyzes the final step of mycothiol biosynthesis in various members of the Actinomyctes, Mycothiol replaces glutathione in these species.
Probab=99.03  E-value=3e-09  Score=84.62  Aligned_cols=95  Identities=17%  Similarity=0.120  Sum_probs=68.7

Q ss_pred             cCCCCCHHHHHHHHHHHHHhHHHHHHhCCCCCChhhhhhhhcCC---CceEEEEEeCCCCCeEEEEEEEeeccCCcceEE
Q psy838           48 KSTDLGAETKKWVWELEEKNMKHSYEVCDIGWDPQGKHSEMFDD---RACYLVAKNGSSSTPVAFSHFRFDVDFGEPVLY  124 (152)
Q Consensus        48 ~~~~l~~~d~~~i~~l~~~~m~~~y~~~~~~w~~~~~~~el~~~---~~~~~v~~~~~~~~ivGf~~~~~~~~~~~~~~~  124 (152)
                      +..+++++|++.+.+|....-  .+.. ..+|+++. ..++.++   ....+++..+  +++|||+.+....+.   ..+
T Consensus         2 ~~~~l~~~d~~~v~~L~~~~~--~~~~-~~~~~~~~-~~~~~~~~~~~~~~~~~~~~--~~~vG~~~~~~~~~~---~~~   72 (292)
T TIGR03448         2 WRAALDADLRRDVRELLAAAT--AVDG-VAPVSEQV-LRGLREPGAGHTRHLVAVDS--DPIVGYANLVPARGT---DPA   72 (292)
T ss_pred             ccccCCHHHHHHHHHHHHHHH--hcCC-CCCCCHHH-HhhccccCCCCceEEEEEEC--CEEEEEEEEEcCCCC---cce
Confidence            457899999999999987632  2222 34688765 4555432   2345555554  899999998754222   235


Q ss_pred             EeeeeeccccccCCHHHHHHHHHHhhc
Q psy838          125 CYELQLEKQVQRKGLGKFMMQAFYTYI  151 (152)
Q Consensus       125 I~~i~V~p~~qgkGiG~~Lm~~l~~~a  151 (152)
                      |.+|+|+|++||+|||+.||+.+++.+
T Consensus        73 ~~~l~V~p~~rg~GiG~~Ll~~~~~~~   99 (292)
T TIGR03448        73 MAELVVHPAHRRRGIGRALIRALLAKG   99 (292)
T ss_pred             EEEEEECHhhcCCCHHHHHHHHHHHhc
Confidence            889999999999999999999998764


No 19 
>PRK09831 putative acyltransferase; Provisional
Probab=99.03  E-value=2.6e-09  Score=77.05  Aligned_cols=89  Identities=13%  Similarity=0.151  Sum_probs=61.5

Q ss_pred             CCCCCHHHHHHHHHHHHHhHHHHHHhCCCCCChhhhh----------hhhcCCCceEEEEEeCCCCCeEEEEEEEeeccC
Q psy838           49 STDLGAETKKWVWELEEKNMKHSYEVCDIGWDPQGKH----------SEMFDDRACYLVAKNGSSSTPVAFSHFRFDVDF  118 (152)
Q Consensus        49 ~~~l~~~d~~~i~~l~~~~m~~~y~~~~~~w~~~~~~----------~el~~~~~~~~v~~~~~~~~ivGf~~~~~~~~~  118 (152)
                      ..+++++|++.+.++..+.....+   ...++++...          ..... ...++++..+  |+++||+.+..    
T Consensus         3 ir~a~~~D~~~l~~l~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~v~~~~--~~iiG~~~~~~----   72 (147)
T PRK09831          3 IRNYQPGDFQQLCAIFIRAVTMTA---SQHYSPQQIAAWAQIDESRWKEKLA-KSQVRVAVIN--AQPVGFITCIE----   72 (147)
T ss_pred             cccCChhhHHHHHHHHHHHHHHhh---hhcCCHHHHHhccCCCHHHHHHHHh-cCceEEEEEC--CEEEEEEEehh----
Confidence            467889999999998877654433   1223332211          11112 3346666655  99999988731    


Q ss_pred             CcceEEEeeeeeccccccCCHHHHHHHHHHhhc
Q psy838          119 GEPVLYCYELQLEKQVQRKGLGKFMMQAFYTYI  151 (152)
Q Consensus       119 ~~~~~~I~~i~V~p~~qgkGiG~~Lm~~l~~~a  151 (152)
                          .+|..++|+|++||+|+|+.||+.+++.|
T Consensus        73 ----~~i~~~~v~p~~~g~GiG~~Ll~~~~~~~  101 (147)
T PRK09831         73 ----HYIDMLFVDPEYTRRGVASALLKPLIKSE  101 (147)
T ss_pred             ----ceeeeEEECHHHcCCCHHHHHHHHHHHHh
Confidence                35788999999999999999999998865


No 20 
>TIGR02406 ectoine_EctA L-2,4-diaminobutyric acid acetyltransferase. This enzyme family is the EctA of ectoine biosynthesis. Ectoine is a compatible solute, analagous to trehalose, betaines, etc., found often in halotolerant organisms. EctA is L-2,4-diaminobutyric acid acetyltransferase, also called DABA acetyltransferase.
Probab=99.03  E-value=1.9e-09  Score=79.09  Aligned_cols=93  Identities=16%  Similarity=0.237  Sum_probs=60.7

Q ss_pred             CCCCHHHHHHHHHHHHHhHHHHHHhCCCCCChhhhhhhhcC-CCceEEEEEeCCCCCeEEEEEEEeeccCCcceEEEeee
Q psy838           50 TDLGAETKKWVWELEEKNMKHSYEVCDIGWDPQGKHSEMFD-DRACYLVAKNGSSSTPVAFSHFRFDVDFGEPVLYCYEL  128 (152)
Q Consensus        50 ~~l~~~d~~~i~~l~~~~m~~~y~~~~~~w~~~~~~~el~~-~~~~~~v~~~~~~~~ivGf~~~~~~~~~~~~~~~I~~i  128 (152)
                      .+++++|++.+.+|..+.    +   ...++.........+ .....+++..+ ++++|||+.+..... ....++++.|
T Consensus         2 R~~~~~D~~~i~~L~~~~----~---~~~~~~~~~~~~~~~~~~~~~~v~~~~-~~~ivG~~~~~~~~~-~~~~~~i~~l   72 (157)
T TIGR02406         2 RPPRIEDGAGIWELVKDC----P---PLDLNSSYAYLLLCTDFADTSIVAESE-GGEIVGFVSGYLRPD-RPDVLFVWQV   72 (157)
T ss_pred             CCCccccHHHHHHHHHhC----C---CCCcccceehhhhhhhcCCcEEEEEcC-CCeEEEEEEEEecCC-CCCeEEEEEE
Confidence            467889999999998763    1   111111111111111 22334454433 479999997654332 2345789999


Q ss_pred             eeccccccCCHHHHHHHHHHhhc
Q psy838          129 QLEKQVQRKGLGKFMMQAFYTYI  151 (152)
Q Consensus       129 ~V~p~~qgkGiG~~Lm~~l~~~a  151 (152)
                      +|+|++||+|+|++||+.+++.|
T Consensus        73 ~V~p~~rg~GiG~~L~~~l~~~a   95 (157)
T TIGR02406        73 AVDPRARGKGLARRLLEALLERV   95 (157)
T ss_pred             EEChHhccCcHHHHHHHHHHHHH
Confidence            99999999999999999999875


No 21 
>PRK03624 putative acetyltransferase; Provisional
Probab=99.00  E-value=2.4e-09  Score=75.00  Aligned_cols=91  Identities=16%  Similarity=0.211  Sum_probs=62.0

Q ss_pred             CCCCCHHHHHHHHHHHHHhHHHHHHhCCCCCChhh--hhhhhcCCCceEEEEEeCCCCCeEEEEEEEeeccCCcceEEEe
Q psy838           49 STDLGAETKKWVWELEEKNMKHSYEVCDIGWDPQG--KHSEMFDDRACYLVAKNGSSSTPVAFSHFRFDVDFGEPVLYCY  126 (152)
Q Consensus        49 ~~~l~~~d~~~i~~l~~~~m~~~y~~~~~~w~~~~--~~~el~~~~~~~~v~~~~~~~~ivGf~~~~~~~~~~~~~~~I~  126 (152)
                      ..+++++|++.+.++....      ....+|.+..  ....+..+...++++..+  +++||++.+.....    ..++.
T Consensus         5 ir~~~~~d~~~i~~l~~~~------~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~--~~~vG~~~~~~~~~----~~~i~   72 (140)
T PRK03624          5 IRVFRQADFEAVIALWERC------DLTRPWNDPEMDIERKLNHDPSLFLVAEVG--GEVVGTVMGGYDGH----RGWAY   72 (140)
T ss_pred             EEEcccccHHHHHHHHHhc------CCCcchhhHHHHHHHHhcCCCceEEEEEcC--CcEEEEEEeeccCC----CceEE
Confidence            4567889999999887652      1123454332  222233444555666544  89999998764321    24578


Q ss_pred             eeeeccccccCCHHHHHHHHHHhhc
Q psy838          127 ELQLEKQVQRKGLGKFMMQAFYTYI  151 (152)
Q Consensus       127 ~i~V~p~~qgkGiG~~Lm~~l~~~a  151 (152)
                      .++|+|+|||+|+|+.||+.++..|
T Consensus        73 ~i~v~p~~rg~Gig~~ll~~~~~~~   97 (140)
T PRK03624         73 YLAVHPDFRGRGIGRALVARLEKKL   97 (140)
T ss_pred             EEEECHHHhCCCHHHHHHHHHHHHH
Confidence            8999999999999999999998865


No 22 
>PF00583 Acetyltransf_1:  Acetyltransferase (GNAT) family;  InterPro: IPR000182 The N-acetyltransferases (NAT) (EC 2.3.1.-) are enzymes that use acetyl coenzyme A (CoA) to transfer an acetyl group to a substrate, a reaction implicated in various functions from bacterial antibiotic resistance to mammalian circadian rhythm and chromatin remodeling. The Gcn5-related N-acetyltransferases (GNAT) catalyze the transfer of the acetyl from the CoA donor to a primary amine of the acceptor. The GNAT proteins share a domain composed of four conserved sequence motifs A-D [, ]. This GNAT domain is named after yeast GCN5 (from General Control Nonrepressed) and related histone acetyltransferases (HATs) like Hat1 and PCAF. HATs acetylate lysine residues of amino terminal histone tails, resulting in transcription activation. Another category of GNAT, the aminoglycoside N-acetyltransferases, confer antibiotic resistance by catalyzing the acetylation of amino groups in aminoglycoside antibiotics []. GNAT proteins can also have anabolic and catabolic functions in both prokaryotes and eukaryotes [, , , , ]. The acetyltransferase/GNAT domain forms a structurally conserved fold of 6 to 7 beta strands (B) and 4 helices (H) in the topology B1-H1-H2-B2-B3-B4-H3-B5-H4-B6, followed by a C-terminal strand which may be from the same monomer or contributed by another [, ]. Motifs D (B2-B3), A (B4-H3) and B (B5-H4) are collectively called the HAT core [, , ], while the N-terminal motif C (B1-H1) is less conserved. Some proteins known to contain a GNAT domain:   Yeast GCN5 and Hat1, which are histone acetyltransferases (EC 2.3.1.48). Human PCAF, a histone acetyltransferase. Mammalian serotonin N-acetyltransferase (SNAT) or arylalkylamine NAT (AANAT), which acetylates serotonin into a circadian neurohormone that may participate in light-dark rhythms, and human mood and behavior. Mammalian glucosamine 6-phosphate N-acetyltransferase (GNA1) (EC 2.3.1.4). Escherichia coli rimI and rimJ, which acetylate the N-terminal alanine of ribosomal proteins S18 and S5, respectively (EC 2.3.1.128). Mycobacterium tuberculosis aminoglycoside 2'-N-acetyltransferase (aac), which acetylates the 2' hydroxyl or amino group of a broad spectrum of aminoglycoside antibiotics. Bacillus subtilis bltD and paiA, which acetylate spermine and spermidine.  This entry represents the entire GNAT domain.; GO: 0008080 N-acetyltransferase activity, 0008152 metabolic process; PDB: 3T9Y_A 2R7H_B 2OZH_A 1Y9W_B 1VKC_B 2OH1_C 3R9E_B 3R9G_B 3R9F_A 3R96_A ....
Probab=99.00  E-value=2.3e-09  Score=69.28  Aligned_cols=49  Identities=18%  Similarity=0.274  Sum_probs=43.1

Q ss_pred             CCCeEEEEEEEeeccC--CcceEEEeeeeeccccccCCHHHHHHHHHHhhc
Q psy838          103 SSTPVAFSHFRFDVDF--GEPVLYCYELQLEKQVQRKGLGKFMMQAFYTYI  151 (152)
Q Consensus       103 ~~~ivGf~~~~~~~~~--~~~~~~I~~i~V~p~~qgkGiG~~Lm~~l~~~a  151 (152)
                      +|++||++.+.+..+.  ....++|..|+|+|+|||+|+|+.||+.+++.|
T Consensus         4 ~~~ivg~~~~~~~~~~~~~~~~~~i~~~~v~~~~r~~Gig~~L~~~~~~~~   54 (83)
T PF00583_consen    4 DGQIVGFASLRPPPEPFDHGNHAYIHRLAVDPEYRGQGIGSKLLQAAEEWA   54 (83)
T ss_dssp             TTEEEEEEEEEEEETTTTTTTEEEEEEEEECGGGTTSSHHHHHHHHHHHHH
T ss_pred             CCEEEEEEEEEECCCccccCCEEEEEEEEEcHHHhhCCCchhhhhhhhhhH
Confidence            4999999999886543  246899999999999999999999999999876


No 23 
>PRK09491 rimI ribosomal-protein-alanine N-acetyltransferase; Provisional
Probab=98.98  E-value=5e-09  Score=75.11  Aligned_cols=90  Identities=16%  Similarity=0.278  Sum_probs=63.5

Q ss_pred             cCCCCCHHHHHHHHHHHHHhHHHHHHhCCCCCChhhhhhhhcCCCceEEEEEeCCCCCeEEEEEEEeeccCCcceEEEee
Q psy838           48 KSTDLGAETKKWVWELEEKNMKHSYEVCDIGWDPQGKHSEMFDDRACYLVAKNGSSSTPVAFSHFRFDVDFGEPVLYCYE  127 (152)
Q Consensus        48 ~~~~l~~~d~~~i~~l~~~~m~~~y~~~~~~w~~~~~~~el~~~~~~~~v~~~~~~~~ivGf~~~~~~~~~~~~~~~I~~  127 (152)
                      ...+++++|++.+.++...    .+   ..+|+.+.+.... .+....+.+..+  +++|||+.+....+    ...+..
T Consensus         3 ~iR~~~~~D~~~l~~l~~~----~~---~~~~~~~~~~~~~-~~~~~~~~~~~~--~~~vG~~~~~~~~~----~~~~~~   68 (146)
T PRK09491          3 TISSLTPADLPAAYHIEQR----AH---AFPWSEKTFASNQ-GERYLNLKLTVN--GQMAAFAITQVVLD----EATLFN   68 (146)
T ss_pred             chhcCChhhhHHHHHHHHh----cC---CCCCCHHHHHHHH-hcCceEEEEEEC--CeEEEEEEEEeecC----ceEEEE
Confidence            4567899999999998543    11   4678776554332 222222233444  89999998865432    245788


Q ss_pred             eeeccccccCCHHHHHHHHHHhhc
Q psy838          128 LQLEKQVQRKGLGKFMMQAFYTYI  151 (152)
Q Consensus       128 i~V~p~~qgkGiG~~Lm~~l~~~a  151 (152)
                      |+|+|++||+|+|+.||+.+++.+
T Consensus        69 i~v~~~~rg~G~g~~ll~~~~~~~   92 (146)
T PRK09491         69 IAVDPDYQRQGLGRALLEHLIDEL   92 (146)
T ss_pred             EEECHHHccCCHHHHHHHHHHHHH
Confidence            999999999999999999998865


No 24 
>TIGR01890 N-Ac-Glu-synth amino-acid N-acetyltransferase. This model represents a clade of amino-acid N-acetyltransferases acting mainly on glutamate in the first step of the "acetylated" ornithine biosynthesis pathway. For this reason it is also called N-acetylglutamate synthase. The enzyme may also act on aspartate.
Probab=98.98  E-value=3.6e-09  Score=89.33  Aligned_cols=93  Identities=8%  Similarity=-0.015  Sum_probs=66.8

Q ss_pred             cccCCCCCHHHHHHHHHHHHHhHHHHHHhCCC--CCChhhhhhhhcCCCceEEEEEeCCCCCeEEEEEEEeeccCCcceE
Q psy838           46 YKKSTDLGAETKKWVWELEEKNMKHSYEVCDI--GWDPQGKHSEMFDDRACYLVAKNGSSSTPVAFSHFRFDVDFGEPVL  123 (152)
Q Consensus        46 ~~~~~~l~~~d~~~i~~l~~~~m~~~y~~~~~--~w~~~~~~~el~~~~~~~~v~~~~~~~~ivGf~~~~~~~~~~~~~~  123 (152)
                      +....+++++|++.+++|...    ..+. .+  +|+.+.+.    ++...++++..+  ++++||+.+.+..++  ...
T Consensus       282 y~~IR~at~~Dl~~I~~L~~~----~~~~-~~~~~~~~~~l~----~~~~~~~V~~~d--g~iVG~~~~~~~~~~--~~~  348 (429)
T TIGR01890       282 FESIRQATIDDIGGIAALIRP----LEEQ-GILVRRSREYLE----REISEFSIIEHD--GNIIGCAALYPYAEE--DCG  348 (429)
T ss_pred             hhheEECCHHHHHHHHHHHHH----HHHc-CCchhhhHHHHH----hhcCcEEEEEEC--CEEEEEEEEEecCCC--CeE
Confidence            445677899999999999753    2222 22  34444433    333345566555  899999998764332  356


Q ss_pred             EEeeeeeccccccCCHHHHHHHHHHhhc
Q psy838          124 YCYELQLEKQVQRKGLGKFMMQAFYTYI  151 (152)
Q Consensus       124 ~I~~i~V~p~~qgkGiG~~Lm~~l~~~a  151 (152)
                      +|..|+|+|+|||+|+|++||+.++++|
T Consensus       349 ~I~~l~V~p~~Rg~GiG~~Ll~~l~~~A  376 (429)
T TIGR01890       349 EMACLAVSPEYQDGGRGERLLAHIEDRA  376 (429)
T ss_pred             EEEEEEECHHHcCCCHHHHHHHHHHHHH
Confidence            7999999999999999999999999886


No 25 
>PF13508 Acetyltransf_7:  Acetyltransferase (GNAT) domain; PDB: 3EY5_A 3FRM_B 3D8P_B 3GY9_A 3GYA_A 3S6F_A 2Q7B_A 1CM0_B 1XEB_B 1Y7R_A ....
Probab=98.96  E-value=4.7e-09  Score=68.05  Aligned_cols=53  Identities=19%  Similarity=0.316  Sum_probs=42.2

Q ss_pred             ceEEEEEeCCCCCeEEEEEEEeeccCCcceEEEeeeeeccccccCCHHHHHHHHHHhhc
Q psy838           93 ACYLVAKNGSSSTPVAFSHFRFDVDFGEPVLYCYELQLEKQVQRKGLGKFMMQAFYTYI  151 (152)
Q Consensus        93 ~~~~v~~~~~~~~ivGf~~~~~~~~~~~~~~~I~~i~V~p~~qgkGiG~~Lm~~l~~~a  151 (152)
                      +.++++.++  +++||++.+....+    ..+|..|+|+|++||+|+|+.||+.+.+.+
T Consensus         3 ~~~~~~~~~--~~ivG~~~~~~~~~----~~~i~~~~v~~~~rg~Gig~~ll~~~~~~~   55 (79)
T PF13508_consen    3 ERFFVAEDD--GEIVGFIRLWPNED----FAYIGYLAVDPEYRGKGIGSKLLNYLLEKA   55 (79)
T ss_dssp             EEEEEEEET--TEEEEEEEEEETTT----EEEEEEEEE-GGGTTSSHHHHHHHHHHHHH
T ss_pred             cEEEEEEEC--CEEEEEEEEEEcCC----EEEEEEEEECHHHcCCCHHHHHHHHHHHHc
Confidence            345566655  99999999965422    678999999999999999999999998764


No 26 
>PLN02706 glucosamine 6-phosphate N-acetyltransferase
Probab=98.94  E-value=8.4e-09  Score=74.17  Aligned_cols=98  Identities=16%  Similarity=0.102  Sum_probs=63.7

Q ss_pred             cCCCCCHHHHH-HHHHHHHHhHHHHHHhCCCCCChhhhhhhhc---C-CCceEE-EEEeCCCCCeEEEEEEEeec---cC
Q psy838           48 KSTDLGAETKK-WVWELEEKNMKHSYEVCDIGWDPQGKHSEMF---D-DRACYL-VAKNGSSSTPVAFSHFRFDV---DF  118 (152)
Q Consensus        48 ~~~~l~~~d~~-~i~~l~~~~m~~~y~~~~~~w~~~~~~~el~---~-~~~~~~-v~~~~~~~~ivGf~~~~~~~---~~  118 (152)
                      +..+++++|++ .+..+..+    .. . ..+|+.+.+.+.+.   + +...++ ++..+.++++||++.+.+..   ..
T Consensus         8 ~ir~~~~~D~~~~~~~~~~~----~~-~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ivG~~~~~~~~~~~~~   81 (150)
T PLN02706          8 KVRRLEISDKSKGFLELLQQ----LT-V-VGDVTEEEFEARFQELASLGDDHLICVIEDAASGRIIATGSVFVERKFIRN   81 (150)
T ss_pred             EEeEhhhcccchHHHHHHHh----cc-C-CCCCCHHHHHHHHHHHHhCCCcEEEEEEEeCCCCcEEEEEEEEEEeecccC
Confidence            35667788877 36665543    11 1 23677666655543   2 233333 33332248999999875321   12


Q ss_pred             CcceEEEeeeeeccccccCCHHHHHHHHHHhhc
Q psy838          119 GEPVLYCYELQLEKQVQRKGLGKFMMQAFYTYI  151 (152)
Q Consensus       119 ~~~~~~I~~i~V~p~~qgkGiG~~Lm~~l~~~a  151 (152)
                      .....+|..|+|+|+|||+|+|+.||+.++++|
T Consensus        82 ~~~~~~i~~i~V~~~~rg~GiG~~ll~~~~~~a  114 (150)
T PLN02706         82 CGKVGHIEDVVVDSAARGKGLGKKIIEALTEHA  114 (150)
T ss_pred             CCcEEEEEEEEECHHHcCCCHHHHHHHHHHHHH
Confidence            234567899999999999999999999999886


No 27 
>PRK07922 N-acetylglutamate synthase; Validated
Probab=98.93  E-value=6.3e-09  Score=77.33  Aligned_cols=93  Identities=8%  Similarity=0.089  Sum_probs=62.1

Q ss_pred             cCCCCCHHHHHHHHHHHHHhHHHHHHhCCCCCChhhhhhhhcCCCceEEEEEeCCCCCeEEEEEEEeeccCCcceEEEee
Q psy838           48 KSTDLGAETKKWVWELEEKNMKHSYEVCDIGWDPQGKHSEMFDDRACYLVAKNGSSSTPVAFSHFRFDVDFGEPVLYCYE  127 (152)
Q Consensus        48 ~~~~l~~~d~~~i~~l~~~~m~~~y~~~~~~w~~~~~~~el~~~~~~~~v~~~~~~~~ivGf~~~~~~~~~~~~~~~I~~  127 (152)
                      ...+++++|.+.+.+|..+..     ........ . ......+...++++..+ +++++||+.+....+   ....|..
T Consensus         7 ~iR~a~~~D~~~i~~L~~~~~-----~~~~~~~~-~-~~~~~~~~~~~~va~~~-~~~iiG~~~~~~~~~---~~~~i~~   75 (169)
T PRK07922          7 TVRRARTSDVPAIKRLVDPYA-----QGRILLEK-N-LVTLYEAVQEFWVAEHL-DGEVVGCGALHVMWE---DLAEIRT   75 (169)
T ss_pred             eeecCCHhhHHHHHHHHHHHh-----hcCccccc-h-HHHHHhhcCcEEEEEec-CCcEEEEEEEeecCC---CceEEEE
Confidence            456789999999999876421     11111111 1 11122233445566522 489999998865432   2466889


Q ss_pred             eeeccccccCCHHHHHHHHHHhhc
Q psy838          128 LQLEKQVQRKGLGKFMMQAFYTYI  151 (152)
Q Consensus       128 i~V~p~~qgkGiG~~Lm~~l~~~a  151 (152)
                      ++|+|++||+|+|++||+.+++.|
T Consensus        76 l~V~p~~rgkGiG~~Ll~~~~~~a   99 (169)
T PRK07922         76 VAVDPAARGRGVGHAIVERLLDVA   99 (169)
T ss_pred             EEECHHHhCCCHHHHHHHHHHHHH
Confidence            999999999999999999999876


No 28 
>PF13527 Acetyltransf_9:  Acetyltransferase (GNAT) domain; PDB: 3SXN_C 2I00_D 1M4D_B 1M44_A 1M4G_B 1M4I_A 2OZG_A 2HV2_F 3N7Z_A 3RYO_B ....
Probab=98.93  E-value=1.9e-09  Score=75.32  Aligned_cols=92  Identities=15%  Similarity=0.196  Sum_probs=62.0

Q ss_pred             CCCCHHHHHHHHHHHHHhHHHHHHhCCCCCChh---hhhhhhcCCCceEEEEEeCCCCCeEEEEEEEee--ccCC--cce
Q psy838           50 TDLGAETKKWVWELEEKNMKHSYEVCDIGWDPQ---GKHSEMFDDRACYLVAKNGSSSTPVAFSHFRFD--VDFG--EPV  122 (152)
Q Consensus        50 ~~l~~~d~~~i~~l~~~~m~~~y~~~~~~w~~~---~~~~el~~~~~~~~v~~~~~~~~ivGf~~~~~~--~~~~--~~~  122 (152)
                      ..++++|++.+.+|...    .|...   .+..   .......++ +..+++.++  +++||.+.+.+.  ...+  -+.
T Consensus         3 R~~~~~d~~~i~~l~~~----~F~~~---~~~~~~~~~~~~~~~~-~~~~~~~~~--~~ivg~~~~~~~~~~~~g~~~~~   72 (127)
T PF13527_consen    3 RPLTESDFEQIIELFNE----AFGDS---ESPPEIWEYFRNLYGP-GRCVVAEDD--GKIVGHVGLIPRRLSVGGKKFKA   72 (127)
T ss_dssp             EEE-GGGHHHHHHHHHH----HTTT----CHHHHHHHHHHHHHHT-TEEEEEEET--TEEEEEEEEEEEEEEETTEEEEE
T ss_pred             eECCHHHHHHHHHHHHH----HCCCC---CCchhhhhhhhcccCc-CcEEEEEEC--CEEEEEEEEEEEEEEECCEEEEE
Confidence            45789999999999876    34321   1111   112223333 356666665  999999988653  1122  346


Q ss_pred             EEEeeeeeccccccCCHHHHHHHHHHhhc
Q psy838          123 LYCYELQLEKQVQRKGLGKFMMQAFYTYI  151 (152)
Q Consensus       123 ~~I~~i~V~p~~qgkGiG~~Lm~~l~~~a  151 (152)
                      .+|..++|+|+|||+|+|++||+.+++.+
T Consensus        73 ~~i~~v~v~p~~R~~Gl~~~L~~~~~~~~  101 (127)
T PF13527_consen   73 AYIGDVAVDPEYRGRGLGRQLMRALLERA  101 (127)
T ss_dssp             EEEEEEEE-GGGTTSSHHHHHHHHHHHHH
T ss_pred             EEEEEEEECHHHcCCCHHHHHHHHHHHHH
Confidence            88999999999999999999999998765


No 29 
>PRK05279 N-acetylglutamate synthase; Validated
Probab=98.93  E-value=6.4e-09  Score=88.02  Aligned_cols=94  Identities=9%  Similarity=-0.026  Sum_probs=66.8

Q ss_pred             cccCCCCCHHHHHHHHHHHHHhHHHHHHhC-CCCCChhhhhhhhcCCCceEEEEEeCCCCCeEEEEEEEeeccCCcceEE
Q psy838           46 YKKSTDLGAETKKWVWELEEKNMKHSYEVC-DIGWDPQGKHSEMFDDRACYLVAKNGSSSTPVAFSHFRFDVDFGEPVLY  124 (152)
Q Consensus        46 ~~~~~~l~~~d~~~i~~l~~~~m~~~y~~~-~~~w~~~~~~~el~~~~~~~~v~~~~~~~~ivGf~~~~~~~~~~~~~~~  124 (152)
                      +....+++++|++.+.+|...    ..... ...|+.+.+    .+....++++..+  ++++||+.+....+  ....+
T Consensus       294 y~~IR~at~~D~~~I~~L~~~----~~~~~~~~~~~~~~l----~~~~~~~~va~~d--g~iVG~~~~~~~~~--~~~~~  361 (441)
T PRK05279        294 LEQLRRATIDDVGGILELIRP----LEEQGILVRRSREQL----EREIDKFTVIERD--GLIIGCAALYPFPE--EKMGE  361 (441)
T ss_pred             hHHeEeCCHHHHHHHHHHHHH----HHHcCCccccCHHHH----hcccCcEEEEEEC--CEEEEEEEEEEcCC--CCeEE
Confidence            556678899999999999754    11110 123554433    3333345666655  89999998765322  23567


Q ss_pred             EeeeeeccccccCCHHHHHHHHHHhhc
Q psy838          125 CYELQLEKQVQRKGLGKFMMQAFYTYI  151 (152)
Q Consensus       125 I~~i~V~p~~qgkGiG~~Lm~~l~~~a  151 (152)
                      |..|+|+|+|||+|+|++||+.++++|
T Consensus       362 I~~l~V~p~~Rg~GiG~~Ll~~l~~~a  388 (441)
T PRK05279        362 MACLAVHPDYRGSGRGERLLKRIEQRA  388 (441)
T ss_pred             EEEEEECHHHcCCCHHHHHHHHHHHHH
Confidence            999999999999999999999999876


No 30 
>PRK10140 putative acetyltransferase YhhY; Provisional
Probab=98.93  E-value=5.6e-09  Score=75.41  Aligned_cols=98  Identities=9%  Similarity=0.070  Sum_probs=60.0

Q ss_pred             CCCCCHHHHHHHHHHHHHhHHHHHHhC-CCC-CChhhhhhhhcC-CCceEEEEEeCCCCCeEEEEEEEeecc-CCcceEE
Q psy838           49 STDLGAETKKWVWELEEKNMKHSYEVC-DIG-WDPQGKHSEMFD-DRACYLVAKNGSSSTPVAFSHFRFDVD-FGEPVLY  124 (152)
Q Consensus        49 ~~~l~~~d~~~i~~l~~~~m~~~y~~~-~~~-w~~~~~~~el~~-~~~~~~v~~~~~~~~ivGf~~~~~~~~-~~~~~~~  124 (152)
                      ..+++++|++.+.++..+..  .+... ..+ .+++.....+.+ +....+++..+  +++||++++..... .......
T Consensus         6 lr~~~~~D~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~--~~~vG~~~~~~~~~~~~~~~~~   81 (162)
T PRK10140          6 IRHAETRDYEAIRQIHAQPE--VYHNTLQVPHPSDHMWQERLADRPGIKQLVACID--GDVVGHLTIDVQQRPRRSHVAD   81 (162)
T ss_pred             EEecchhhHHHHHHHHhCcc--cccccccCCCcCHHHHHHHhhcCCCcEEEEEEEC--CEEEEEEEEecccccccceEEE
Confidence            46678899999999875421  11110 111 233333333433 23334455444  89999999875321 1111222


Q ss_pred             EeeeeeccccccCCHHHHHHHHHHhhc
Q psy838          125 CYELQLEKQVQRKGLGKFMMQAFYTYI  151 (152)
Q Consensus       125 I~~i~V~p~~qgkGiG~~Lm~~l~~~a  151 (152)
                       ..++|+|+|||+|+|+.||+.++++|
T Consensus        82 -~~~~v~p~~rg~Gig~~ll~~l~~~~  107 (162)
T PRK10140         82 -FGICVDSRWKNRGVASALMREMIEMC  107 (162)
T ss_pred             -EEEEECHHHcCCCHHHHHHHHHHHHH
Confidence             35899999999999999999998875


No 31 
>PLN02825 amino-acid N-acetyltransferase
Probab=98.92  E-value=7.1e-09  Score=89.29  Aligned_cols=94  Identities=9%  Similarity=0.049  Sum_probs=69.5

Q ss_pred             cccCCCCCHHHHHHHHHHHHHhHHHHHHhC-CCCCChhhhhhhhcCCCceEEEEEeCCCCCeEEEEEEEeeccCCcceEE
Q psy838           46 YKKSTDLGAETKKWVWELEEKNMKHSYEVC-DIGWDPQGKHSEMFDDRACYLVAKNGSSSTPVAFSHFRFDVDFGEPVLY  124 (152)
Q Consensus        46 ~~~~~~l~~~d~~~i~~l~~~~m~~~y~~~-~~~w~~~~~~~el~~~~~~~~v~~~~~~~~ivGf~~~~~~~~~~~~~~~  124 (152)
                      +-...+++++|++.|.+|.+..    .+.. ...|+.+.+..++    ..++++..+  +++||++.+.+..+  ....+
T Consensus       367 ~e~IR~At~eDi~~I~~Li~~l----ee~g~lv~rs~e~le~ei----~~f~V~e~D--g~IVG~aal~~~~~--~~~aE  434 (515)
T PLN02825        367 YEGTRMARVEDLAGIRQIIRPL----EESGILVRRTDEELLRAL----DSFVVVERE--GSIIACAALFPFFE--EKCGE  434 (515)
T ss_pred             HhhheeCCHHHHHHHHHHHHHH----HHcCCCcCCCHHHHHhcC----CcEEEEEEC--CEEEEEEEEEeecC--CCcEE
Confidence            5556789999999999998752    2221 1245555554443    346666655  99999998765433  23567


Q ss_pred             EeeeeeccccccCCHHHHHHHHHHhhc
Q psy838          125 CYELQLEKQVQRKGLGKFMMQAFYTYI  151 (152)
Q Consensus       125 I~~i~V~p~~qgkGiG~~Lm~~l~~~a  151 (152)
                      |..|+|+|+|||+|+|++||+.+++.|
T Consensus       435 I~~laV~P~yRGkGiG~~LL~~le~~A  461 (515)
T PLN02825        435 VAAIAVSPECRGQGQGDKLLDYIEKKA  461 (515)
T ss_pred             EEEEEECHHHcCCCHHHHHHHHHHHHH
Confidence            999999999999999999999999886


No 32 
>TIGR03103 trio_acet_GNAT GNAT-family acetyltransferase TIGR03103. Members of this protein family belong to the GNAT family of acetyltransferases. Each is part of a conserved three-gene cassette sparsely distributed across at least twenty different species known so far, including alpha, beta, and gamma Proteobacteria, Mycobacterium, and Prosthecochloris, which is a member of the Chlorobi. The other two members of the cassette are a probable protease and an asparagine synthetase family protein.
Probab=98.92  E-value=1e-08  Score=89.16  Aligned_cols=92  Identities=14%  Similarity=0.054  Sum_probs=68.5

Q ss_pred             CHHHHHHHHHHHHHhHHHHHHhCCCCCChhhhhhhhcCCCceEEEEEeCCCCCeEEEEEEEee---ccCCcceEEEeeee
Q psy838           53 GAETKKWVWELEEKNMKHSYEVCDIGWDPQGKHSEMFDDRACYLVAKNGSSSTPVAFSHFRFD---VDFGEPVLYCYELQ  129 (152)
Q Consensus        53 ~~~d~~~i~~l~~~~m~~~y~~~~~~w~~~~~~~el~~~~~~~~v~~~~~~~~ivGf~~~~~~---~~~~~~~~~I~~i~  129 (152)
                      +++|++++.+|..+.    .   ..+|+.+....++..+...++++..+.+|++|||+.+...   ..+.....+|..|+
T Consensus        90 ~~~D~~~I~~L~~~~----~---~~p~~~~~~~~~~~~~~~~~~vA~~~~~g~IVG~~~~~~~~~~~~d~~~~~~i~~l~  162 (547)
T TIGR03103        90 GPADVDAINRLYAAR----G---MVPVRVDFVLDHRHSRAITYLVAEDEASGAIIGTVMGVDHRKAFNDPEHGSSLWCLA  162 (547)
T ss_pred             ChhHHHHHHHHHHhc----C---CCCCCHHHHHHHhcCCCceEEEEEECCCCeEEEEEEEEeccccccCCCCCeEEEEEE
Confidence            678999999987652    1   2467777776666666667777765335899999975321   22333456799999


Q ss_pred             eccccccCCHHHHHHHHHHhhc
Q psy838          130 LEKQVQRKGLGKFMMQAFYTYI  151 (152)
Q Consensus       130 V~p~~qgkGiG~~Lm~~l~~~a  151 (152)
                      |+|+|||+|+|++||+.+++.+
T Consensus       163 V~P~~Rg~GIG~~Ll~~l~e~a  184 (547)
T TIGR03103       163 VDPQAAHPGVGEALVRALAEHF  184 (547)
T ss_pred             ECHHHcCCCHHHHHHHHHHHHH
Confidence            9999999999999999999875


No 33 
>PRK10514 putative acetyltransferase; Provisional
Probab=98.91  E-value=1.2e-08  Score=72.75  Aligned_cols=94  Identities=13%  Similarity=0.069  Sum_probs=60.7

Q ss_pred             cCCCCCHHHHHHHHHHHHHhHHHHHHhCCCCCChhhhhhhhc--CCCceEEEEEeCCCCCeEEEEEEEeeccCCcceEEE
Q psy838           48 KSTDLGAETKKWVWELEEKNMKHSYEVCDIGWDPQGKHSEMF--DDRACYLVAKNGSSSTPVAFSHFRFDVDFGEPVLYC  125 (152)
Q Consensus        48 ~~~~l~~~d~~~i~~l~~~~m~~~y~~~~~~w~~~~~~~el~--~~~~~~~v~~~~~~~~ivGf~~~~~~~~~~~~~~~I  125 (152)
                      ...+++++|.+.++++..++....... ..+++.+.....+.  .+...++++..+ ++++|||+.+..        .++
T Consensus         3 ~ir~~~~~D~~~l~~l~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~iG~~~~~~--------~~~   72 (145)
T PRK10514          3 SIRRSRHEEGERLVAIWRRSVDATHDF-LSAEDRAEIEELVRSFLPEAPLWVAVDE-RDQPVGFMLLSG--------GHM   72 (145)
T ss_pred             eeeecchhhHHHHHHHHHHHHHHhCcc-cCchhHHHHHHHHHHHhccCceEEEEec-CCcEEEEEEEec--------CcE
Confidence            346788999999999887654322110 11334433333222  123334444332 489999998742        125


Q ss_pred             eeeeeccccccCCHHHHHHHHHHhhc
Q psy838          126 YELQLEKQVQRKGLGKFMMQAFYTYI  151 (152)
Q Consensus       126 ~~i~V~p~~qgkGiG~~Lm~~l~~~a  151 (152)
                      ..++|+|+|||+|+|+.||+.+++.+
T Consensus        73 ~~~~v~p~~rgkGig~~Ll~~~~~~~   98 (145)
T PRK10514         73 EALFVDPDVRGCGVGRMLVEHALSLH   98 (145)
T ss_pred             eEEEECHHhccCCHHHHHHHHHHHhc
Confidence            67999999999999999999998754


No 34 
>COG0456 RimI Acetyltransferases [General function prediction only]
Probab=98.85  E-value=1.5e-08  Score=74.29  Aligned_cols=98  Identities=14%  Similarity=0.243  Sum_probs=68.5

Q ss_pred             CCCCCHHHHH--HHHHHHHHhHHHHHHhCCCCCChhhhhhhhcCCCceEEEEEeCC-CC----CeEEEEEEEeeccCC--
Q psy838           49 STDLGAETKK--WVWELEEKNMKHSYEVCDIGWDPQGKHSEMFDDRACYLVAKNGS-SS----TPVAFSHFRFDVDFG--  119 (152)
Q Consensus        49 ~~~l~~~d~~--~i~~l~~~~m~~~y~~~~~~w~~~~~~~el~~~~~~~~v~~~~~-~~----~ivGf~~~~~~~~~~--  119 (152)
                      ...+...|+.  .++++...    .|... .+|+...+...+.+....++++..+. ++    +++||+..+...+..  
T Consensus        14 ir~~~~~d~~~~~~~~~~~~----~~~~~-~~~~~~~~~~~l~~~~~~~~v~~~~~~~~~~~~~~~G~~~~~~~~~~~~~   88 (177)
T COG0456          14 IREAINKDLLDVALAALEAR----TFDIR-LPWSREYFEKDLTQAPELLLVAETGGLDGLLDGKVVGFLLVRVVDGRPSA   88 (177)
T ss_pred             hhhhhhcccchHHHHHHhhh----cCCCC-CcchHHHHHHHHhhCcceeEEEEecccCCCcccceeEEEEEEEecCCccc
Confidence            3455666666  66666554    23110 47999998888888777777766420 12    599999997443321  


Q ss_pred             cceEEEeeeeeccccccCCHHHHHHHHHHhhc
Q psy838          120 EPVLYCYELQLEKQVQRKGLGKFMMQAFYTYI  151 (152)
Q Consensus       120 ~~~~~I~~i~V~p~~qgkGiG~~Lm~~l~~~a  151 (152)
                      ....+|..|+|+|+|||+|||+.||+.+++.+
T Consensus        89 ~~~~~i~~iaV~p~~r~~Gig~~Ll~~~~~~~  120 (177)
T COG0456          89 DHEGHIYNLAVDPEYRGRGIGRALLDEALERL  120 (177)
T ss_pred             cCccEEEEEEEChHhhcCCHHHHHHHHHHHHH
Confidence            12467999999999999999999999887654


No 35 
>PRK12308 bifunctional argininosuccinate lyase/N-acetylglutamate synthase; Provisional
Probab=98.78  E-value=3.6e-08  Score=86.73  Aligned_cols=92  Identities=12%  Similarity=0.020  Sum_probs=64.4

Q ss_pred             cCCCCCHHHHHHHHHHHHHhHHHHHHhCCCCCChhhhhhhhcCCCceEEEEEeCCCCCeEEEEEEEeeccCCcceEEEee
Q psy838           48 KSTDLGAETKKWVWELEEKNMKHSYEVCDIGWDPQGKHSEMFDDRACYLVAKNGSSSTPVAFSHFRFDVDFGEPVLYCYE  127 (152)
Q Consensus        48 ~~~~l~~~d~~~i~~l~~~~m~~~y~~~~~~w~~~~~~~el~~~~~~~~v~~~~~~~~ivGf~~~~~~~~~~~~~~~I~~  127 (152)
                      ...+++++|++.+.+|...- ...  ....+|+.+.    +..+...++++..+  +++|||+.+... +.  ...+|..
T Consensus       465 ~IR~a~~~D~~~I~~L~~~~-~~~--~~~~~~~~~~----l~~~~~~~~Va~~~--g~IVG~~~l~~~-~~--~~~~I~~  532 (614)
T PRK12308        465 KVRPARLTDIDAIEGMVAYW-AGL--GENLPRSRNE----LVRDIGSFAVAEHH--GEVTGCASLYIY-DS--GLAEIRS  532 (614)
T ss_pred             EEEECCHHHHHHHHHHHHHH-Hhh--hcccccCHHH----HhcccCcEEEEEEC--CEEEEEEEEEEc-CC--CeEEEEE
Confidence            35678899999999987541 111  1124455443    33333455666555  899999988653 22  2467999


Q ss_pred             eeeccccccCCHHHHHHHHHHhhc
Q psy838          128 LQLEKQVQRKGLGKFMMQAFYTYI  151 (152)
Q Consensus       128 i~V~p~~qgkGiG~~Lm~~l~~~a  151 (152)
                      |+|+|+|||+|||+.||+.+++.|
T Consensus       533 i~V~P~~rGkGIGk~Ll~~l~~~a  556 (614)
T PRK12308        533 LGVEAGWQVQGQGSALVQYLVEKA  556 (614)
T ss_pred             EEECHHHcCCCHHHHHHHHHHHHH
Confidence            999999999999999999999876


No 36 
>TIGR01686 FkbH FkbH-like domain. The C-terminal portion of this domain is unique to this family (by BLAST).
Probab=98.77  E-value=7.8e-08  Score=78.25  Aligned_cols=97  Identities=11%  Similarity=0.131  Sum_probs=71.9

Q ss_pred             cCCCCCHHHHHHHHHHHHHhHHHHHHhCCCCCChhhhhhhhcCCCceEEEEEeC--CCCCeEEEEEEEeeccCCcceEEE
Q psy838           48 KSTDLGAETKKWVWELEEKNMKHSYEVCDIGWDPQGKHSEMFDDRACYLVAKNG--SSSTPVAFSHFRFDVDFGEPVLYC  125 (152)
Q Consensus        48 ~~~~l~~~d~~~i~~l~~~~m~~~y~~~~~~w~~~~~~~el~~~~~~~~v~~~~--~~~~ivGf~~~~~~~~~~~~~~~I  125 (152)
                      +...++++|++.+.+|....  ..|......|+++.+...+.++ ..+.+...+  .++.+|||+.+....    ..++|
T Consensus       188 ~Ir~a~~~Dl~ri~~L~~~t--nqfn~~~~~~s~~~i~~~l~~~-~~~~~~~~d~~gd~givG~~~~~~~~----~~~~I  260 (320)
T TIGR01686       188 NISKNDEQNVQRVEELLGRT--NQFNATYTRLNQEDVAQHMQKE-EIVTVSMSDRFGDSGIIGIFVFEKKE----GNLFI  260 (320)
T ss_pred             EEEECChhhhHHHHHHHHhH--HhhhccCccCCHHHHHHHhcCC-CEEEEEEEecCCCCceEEEEEEEecC----CcEEE
Confidence            46788999999999998653  2343334578988887777665 333333321  146899999886532    25679


Q ss_pred             eeeeeccccccCCHHHHHHHHHHhhc
Q psy838          126 YELQLEKQVQRKGLGKFMMQAFYTYI  151 (152)
Q Consensus       126 ~~i~V~p~~qgkGiG~~Lm~~l~~~a  151 (152)
                      ..|+|+|++||+|+|+.||+.+++.|
T Consensus       261 ~~l~vs~r~~grGig~~Ll~~l~~~a  286 (320)
T TIGR01686       261 DDLCMSCRALGRGVETRMLRWLFEQA  286 (320)
T ss_pred             EEEEEcHhHhcCcHHHHHHHHHHHHH
Confidence            99999999999999999999999876


No 37 
>COG1247 Sortase and related acyltransferases [Cell envelope biogenesis, outer membrane]
Probab=98.74  E-value=9.5e-08  Score=71.35  Aligned_cols=101  Identities=14%  Similarity=0.084  Sum_probs=62.5

Q ss_pred             CCCCCHHHHHHHHHHHHHhHHH---HHHhCCCCCChhhhhhhh--cCCCc-eEEEEEeCCCCCeEEEEEEEeecc-CCcc
Q psy838           49 STDLGAETKKWVWELEEKNMKH---SYEVCDIGWDPQGKHSEM--FDDRA-CYLVAKNGSSSTPVAFSHFRFDVD-FGEP  121 (152)
Q Consensus        49 ~~~l~~~d~~~i~~l~~~~m~~---~y~~~~~~w~~~~~~~el--~~~~~-~~~v~~~~~~~~ivGf~~~~~~~~-~~~~  121 (152)
                      ..+++..|++.|.++.......   .|+.  ++=+.+....-+  ...+. .++++..+ +|+++||+.+..-.+ .+=.
T Consensus         4 ir~~~~~Dl~~I~~IY~~~v~~~~a~~e~--~~~~~~~~~~~~~~~~~~g~p~~V~~~~-~g~v~G~a~~~~fr~r~ay~   80 (169)
T COG1247           4 IRPATAADLEAILEIYNGAVENTAATFEE--DPVSLEERAAWFSGRTRDGYPVVVAEEE-DGKVLGYASAGPFRERPAYR   80 (169)
T ss_pred             EecChHHhHHHHHHHHHHhhhcceEEEec--cCCCHHHHHHHHHhcccCCceEEEEEcC-CCeEEEEEEeeeccCccccc
Confidence            3567889999999987763211   1111  111222222112  22333 44444433 589999999875433 2222


Q ss_pred             eEEEeeeeeccccccCCHHHHHHHHHHhhcC
Q psy838          122 VLYCYELQLEKQVQRKGLGKFMMQAFYTYID  152 (152)
Q Consensus       122 ~~~I~~i~V~p~~qgkGiG~~Lm~~l~~~a~  152 (152)
                      ...-+-|||+|+.||+|+|++||+.+++.|.
T Consensus        81 ~tve~SiYv~~~~~g~GiG~~Ll~~Li~~~~  111 (169)
T COG1247          81 HTVELSIYLDPAARGKGLGKKLLQALITEAR  111 (169)
T ss_pred             eEEEEEEEECcccccccHHHHHHHHHHHHHH
Confidence            3345679999999999999999999998763


No 38 
>TIGR02382 wecD_rffC TDP-D-fucosamine acetyltransferase. This model represents the WecD protein (Formerly RffC) for the biosynthesis of enterobacterial common antigen (ECA), an outer leaflet, outer membrane glycolipid with a trisaccharide repeat unit. WecD is a member of the GNAT family of acetytransferases (pfam00583).
Probab=98.73  E-value=1.2e-07  Score=71.38  Aligned_cols=99  Identities=13%  Similarity=0.132  Sum_probs=64.1

Q ss_pred             cccCCCCCHHHHHHHHHHHHHhHHH-HHHhCCCCCChhhh--------hhhhc-CC-CceEEEEEeCCCCCeEEEEEEEe
Q psy838           46 YKKSTDLGAETKKWVWELEEKNMKH-SYEVCDIGWDPQGK--------HSEMF-DD-RACYLVAKNGSSSTPVAFSHFRF  114 (152)
Q Consensus        46 ~~~~~~l~~~d~~~i~~l~~~~m~~-~y~~~~~~w~~~~~--------~~el~-~~-~~~~~v~~~~~~~~ivGf~~~~~  114 (152)
                      -....+++++|++.+.++..+.... .|   ..+|.+...        ..+.. +. ....+++..+ ++++|||+.+..
T Consensus        43 ~~~lR~~~~~D~~~l~~l~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~-~g~iiG~i~l~~  118 (191)
T TIGR02382        43 DPGARVATETDIPALRQLASAAFALSRF---RAPWYAPDDSGRFYAQWVENAVRGTFDHQCLILRDA-SGDPRGYVTLRE  118 (191)
T ss_pred             CCcceeCChhhHHHHHHHHHHHhhcccc---CCCCcCHHHHHHHHHHHHHHHhcCCCCCeEEEEEcc-CCeEEEEEEEEe
Confidence            3467889999999999997764211 11   224432110        01121 22 2222333333 489999999865


Q ss_pred             eccCCcceEEEeeeeeccccccCCHHHHHHHHHHhhc
Q psy838          115 DVDFGEPVLYCYELQLEKQVQRKGLGKFMMQAFYTYI  151 (152)
Q Consensus       115 ~~~~~~~~~~I~~i~V~p~~qgkGiG~~Lm~~l~~~a  151 (152)
                      ..+   +..+|..++|+|++||+|+|+.||+.++++|
T Consensus       119 ~~~---~~~~i~~l~V~p~~rGkG~G~~ll~~~~~~a  152 (191)
T TIGR02382       119 LND---TDARIGLLAVFPGAQSRGIGAELMQTALNWC  152 (191)
T ss_pred             cCC---CceEEEEEEECHHHcCCCHHHHHHHHHHHHH
Confidence            322   2356889999999999999999999999886


No 39 
>PRK10975 TDP-fucosamine acetyltransferase; Provisional
Probab=98.70  E-value=2.1e-07  Score=70.07  Aligned_cols=97  Identities=14%  Similarity=0.126  Sum_probs=62.3

Q ss_pred             cCCCCCHHHHHHHHHHHHHhHHH-HHHhCCCCCChhh----hhhh-----hcCC-CceEEEEEeCCCCCeEEEEEEEeec
Q psy838           48 KSTDLGAETKKWVWELEEKNMKH-SYEVCDIGWDPQG----KHSE-----MFDD-RACYLVAKNGSSSTPVAFSHFRFDV  116 (152)
Q Consensus        48 ~~~~l~~~d~~~i~~l~~~~m~~-~y~~~~~~w~~~~----~~~e-----l~~~-~~~~~v~~~~~~~~ivGf~~~~~~~  116 (152)
                      ...+++++|++.+.++..+.... .|   ..+|....    ...+     +.+. ...++++..+ ++++|||+.+....
T Consensus        48 ~iR~a~~~D~~~i~~l~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~-~g~~vG~~~l~~~~  123 (194)
T PRK10975         48 GARVATETDIPALRQLAAQAFAQSRF---RAPWYAPDDSGRFYAQWIENAVRGTFDHQCLLLRDA-SGQIQGFVTLRELN  123 (194)
T ss_pred             CcccCCcccHHHHHHHHHHHhhhccc---cCccCChhHHHHHHHHHHHHhhccccCCcEEEEEcC-CCCEEEEEEEEecC
Confidence            34578889999999987653211 12   12342221    1111     1111 2244455433 58999999986532


Q ss_pred             cCCcceEEEeeeeeccccccCCHHHHHHHHHHhhc
Q psy838          117 DFGEPVLYCYELQLEKQVQRKGLGKFMMQAFYTYI  151 (152)
Q Consensus       117 ~~~~~~~~I~~i~V~p~~qgkGiG~~Lm~~l~~~a  151 (152)
                      +   ...+|..++|+|+|||+|+|+.||+.++++|
T Consensus       124 ~---~~~~i~~~~V~p~~rg~Gig~~Ll~~~~~~a  155 (194)
T PRK10975        124 D---TDARIGLLAVFPGAQGRGIGARLMQAALNWC  155 (194)
T ss_pred             C---CceEEEEEEEChhhcCCCHHHHHHHHHHHHH
Confidence            2   2467888999999999999999999999876


No 40 
>COG3153 Predicted acetyltransferase [General function prediction only]
Probab=98.69  E-value=1e-07  Score=71.29  Aligned_cols=93  Identities=12%  Similarity=0.030  Sum_probs=67.4

Q ss_pred             CCCCCHHHHHHHHHHHHHhHHHHHHhCCCCCChhhhhhhhcCCC----ceEEEEEeCCCCCeEEEEEEEeec-c-CCcce
Q psy838           49 STDLGAETKKWVWELEEKNMKHSYEVCDIGWDPQGKHSEMFDDR----ACYLVAKNGSSSTPVAFSHFRFDV-D-FGEPV  122 (152)
Q Consensus        49 ~~~l~~~d~~~i~~l~~~~m~~~y~~~~~~w~~~~~~~el~~~~----~~~~v~~~~~~~~ivGf~~~~~~~-~-~~~~~  122 (152)
                      ....++.|++.+.++.++.    |.    ++.+......+....    ..-+++.++  |++||.+.|+... . .....
T Consensus         6 ir~e~~~d~~~i~~~~~~a----F~----~~~e~~~v~~lR~~~~~~~~LslVA~d~--g~vvG~Il~s~v~~~g~~~~~   75 (171)
T COG3153           6 IRTETPADIPAIEALTREA----FG----PGREAKLVDKLREGGRPDLTLSLVAEDD--GEVVGHILFSPVTVGGEELGW   75 (171)
T ss_pred             EEecChhhHHHHHHHHHHH----hh----cchHHHHHHHHHhcCCcccceeEEEeeC--CEEEEEEEEeEEEecCcccce
Confidence            4567889999999998874    42    455555555555432    334455544  9999999998643 2 23445


Q ss_pred             EEEeeeeeccccccCCHHHHHHHHHHhhc
Q psy838          123 LYCYELQLEKQVQRKGLGKFMMQAFYTYI  151 (152)
Q Consensus       123 ~~I~~i~V~p~~qgkGiG~~Lm~~l~~~a  151 (152)
                      +-|--|+|+|++||||||+.||...++.+
T Consensus        76 ~~LaPLaV~p~~qg~GIG~~Lvr~~le~a  104 (171)
T COG3153          76 LGLAPLAVDPEYQGQGIGSALVREGLEAL  104 (171)
T ss_pred             EEEEeEEEchhhcCCcHHHHHHHHHHHHH
Confidence            66888999999999999999999988765


No 41 
>PRK01346 hypothetical protein; Provisional
Probab=98.68  E-value=2e-07  Score=77.88  Aligned_cols=94  Identities=13%  Similarity=-0.003  Sum_probs=63.9

Q ss_pred             cCCCCCHHHHHHHHHHHHHhHHHHHHhCCCCCChhhhhhh--hcCCCceEEEEEeCCCCCeEEEEEEEeec---cCC--c
Q psy838           48 KSTDLGAETKKWVWELEEKNMKHSYEVCDIGWDPQGKHSE--MFDDRACYLVAKNGSSSTPVAFSHFRFDV---DFG--E  120 (152)
Q Consensus        48 ~~~~l~~~d~~~i~~l~~~~m~~~y~~~~~~w~~~~~~~e--l~~~~~~~~v~~~~~~~~ivGf~~~~~~~---~~~--~  120 (152)
                      +..+++++|++.+.+|...    .|..   .++++.....  ...+ ...+++..+  +++||++.+....   ..+  .
T Consensus         8 ~iR~~~~~D~~~i~~L~~~----~f~~---~~~~~~~~~~~~~~~~-~~~~va~~~--~~lvg~~~~~~~~~~~~~~~~~   77 (411)
T PRK01346          8 TIRTATEEDWPAWFRAAAT----GFGD---SPSDEELEAWRALVEP-DRTLGAFDG--DEVVGTAGAFDLRLTVPGGAVL   77 (411)
T ss_pred             eeecCCHHHHHHHHHHHHH----HcCC---CCChHHHHHHHHhcCc-CCeEEEEEC--CEEEEEEEEeccccccCCCCcc
Confidence            3567789999999999754    3422   2233222111  1222 334556654  8999999876421   112  3


Q ss_pred             ceEEEeeeeeccccccCCHHHHHHHHHHhhc
Q psy838          121 PVLYCYELQLEKQVQRKGLGKFMMQAFYTYI  151 (152)
Q Consensus       121 ~~~~I~~i~V~p~~qgkGiG~~Lm~~l~~~a  151 (152)
                      +..+|..++|+|+|||+|+|+.||+.+++.+
T Consensus        78 ~~~~i~~v~V~P~~RgrGig~~Ll~~~l~~a  108 (411)
T PRK01346         78 PAAGVTAVTVAPTHRRRGLLTALMREQLRRI  108 (411)
T ss_pred             ceeEEEEEEEChhhcCCCHHHHHHHHHHHHH
Confidence            6788999999999999999999999998865


No 42 
>cd04301 NAT_SF N-Acyltransferase superfamily: Various enzymes that characteristically catalyze the transfer of an acyl group to a substrate. NAT (N-Acyltransferase) is a large superfamily of enzymes that mostly catalyze the transfer of an acyl group to a substrate and are implicated in a variety of functions, ranging from bacterial antibiotic resistance to circadian rhythms in mammals. Members include GCN5-related N-Acetyltransferases (GNAT) such as Aminoglycoside N-acetyltransferases, Histone N-acetyltransferase (HAT) enzymes, and Serotonin N-acetyltransferase, which catalyze the transfer of an acetyl group to a substrate. The kinetic mechanism of most GNATs involves the ordered formation of a ternary complex: the reaction begins with Acetyl Coenzyme A (AcCoA) binding, followed by binding of substrate, then direct transfer of the acetyl group from AcCoA to the substrate, followed by product and subsequent CoA release. Other family members include Arginine/ornithine N-succinyltransfera
Probab=98.67  E-value=9.9e-08  Score=56.93  Aligned_cols=48  Identities=21%  Similarity=0.284  Sum_probs=40.9

Q ss_pred             CCCeEEEEEEEeeccCCcceEEEeeeeeccccccCCHHHHHHHHHHhhc
Q psy838          103 SSTPVAFSHFRFDVDFGEPVLYCYELQLEKQVQRKGLGKFMMQAFYTYI  151 (152)
Q Consensus       103 ~~~ivGf~~~~~~~~~~~~~~~I~~i~V~p~~qgkGiG~~Lm~~l~~~a  151 (152)
                      +++++|++.+..... ....++|..+.|+|++||+|+|+.+|+.+++++
T Consensus         7 ~~~~ig~~~~~~~~~-~~~~~~l~~~~v~~~~~~~g~~~~~~~~~~~~~   54 (65)
T cd04301           7 DGEIVGFASLSPDGS-GGDTAYIGDLAVLPEYRGKGIGSALLEAAEEEA   54 (65)
T ss_pred             CCEEEEEEEEEecCC-CCccEEEEEEEECHHHcCcCHHHHHHHHHHHHH
Confidence            389999999987543 235688999999999999999999999998865


No 43 
>PRK13688 hypothetical protein; Provisional
Probab=98.65  E-value=3.6e-07  Score=67.43  Aligned_cols=56  Identities=14%  Similarity=0.155  Sum_probs=41.3

Q ss_pred             CceEEEEEeCCCCCeEEEEEEEeec-------cCCcceEEEeeeeeccccccCCHHHHHHHHHHh
Q psy838           92 RACYLVAKNGSSSTPVAFSHFRFDV-------DFGEPVLYCYELQLEKQVQRKGLGKFMMQAFYT  149 (152)
Q Consensus        92 ~~~~~v~~~~~~~~ivGf~~~~~~~-------~~~~~~~~I~~i~V~p~~qgkGiG~~Lm~~l~~  149 (152)
                      ...++++..+  +++||++.+....       ......++|.+|+|+|+|||+|+|++||+.+.+
T Consensus        44 ~~~~~~~~~~--~~~VG~~~l~~~dg~~~~~~~~~~~~~~L~~l~V~p~~rgkGiG~~Ll~~a~~  106 (156)
T PRK13688         44 ESPFYGIYYG--DSLVARMSLYKKGGVEEPYFEDTQDYLELWKLEVLPKYQNRGYGEMLVDFAKS  106 (156)
T ss_pred             CCCEEEEEEC--CEEEEEEEEEecCCcccccccCCCCeEEEEEEEECHHHcCCCHHHHHHHHHHH
Confidence            3444556655  8999998775321       112345789999999999999999999997764


No 44 
>PF13523 Acetyltransf_8:  Acetyltransferase (GNAT) domain; PDB: 2VQY_A 2BUE_A 1V0C_A 1YK3_D 2PR8_A 2QIR_A 2PRB_A 2QML_A 2PC1_A.
Probab=98.62  E-value=1.4e-07  Score=68.05  Aligned_cols=97  Identities=11%  Similarity=0.151  Sum_probs=63.5

Q ss_pred             CCC-HHHHHHHHHHHHHhHHHHHHhCCCCCCh---hhhhhhhc-CCCceEEEEEeCCCCCeEEEEEEEeecc---CCcce
Q psy838           51 DLG-AETKKWVWELEEKNMKHSYEVCDIGWDP---QGKHSEMF-DDRACYLVAKNGSSSTPVAFSHFRFDVD---FGEPV  122 (152)
Q Consensus        51 ~l~-~~d~~~i~~l~~~~m~~~y~~~~~~w~~---~~~~~el~-~~~~~~~v~~~~~~~~ivGf~~~~~~~~---~~~~~  122 (152)
                      +++ .+|++.|.++..+.--..+-.  ..|++   +.....+. ++....+|+..+  |+++||+.+....+   .....
T Consensus         3 ~a~~~~Dl~~i~~w~~~~~~~~~~~--~~~~~~~~~~~~~~l~~~~~~~~~v~~~d--g~~~g~~~~~~~~~~~~~~~~~   78 (152)
T PF13523_consen    3 PATTPDDLPLILQWLNQPHVREFWD--QDPSQEWVEEYPEQLEADPGHHPYVAEDD--GEPIGYFEIYWPDEDYDADDGD   78 (152)
T ss_dssp             E---GGGHHHHHHHHTSHHHHCCH---CCCTHHHHHHHHHHHCHTTTEEEEEEEET--TEEEEEEEEEEGGGSS---TTE
T ss_pred             eCccHHHHHHHHHHHHhHHHHHHcc--CCCCHHHHHHHHhhhcccCCceEEEEEEC--CEEEEEEEEecccccccCCCCE
Confidence            466 899999999876642122211  12332   22333453 566677777776  99999998754211   12345


Q ss_pred             EEEeeeeeccccccCCHHHHHHHHHHhhc
Q psy838          123 LYCYELQLEKQVQRKGLGKFMMQAFYTYI  151 (152)
Q Consensus       123 ~~I~~i~V~p~~qgkGiG~~Lm~~l~~~a  151 (152)
                      ..++.+.++|++||+|+|+.+|..+++.+
T Consensus        79 ~~~~~~~~~~~~rg~G~g~~~~~~~~~~~  107 (152)
T PF13523_consen   79 RGIHRLIVDPEYRGQGLGKAMLRALIEFL  107 (152)
T ss_dssp             EEEEEEESTGGGTTSSHHHHHHHHHHHHH
T ss_pred             EEEeeeeechhhcCCCHHHHHHHHHHHHH
Confidence            66888899999999999999999988764


No 45 
>PRK10562 putative acetyltransferase; Provisional
Probab=98.61  E-value=5e-07  Score=64.73  Aligned_cols=88  Identities=13%  Similarity=0.104  Sum_probs=56.2

Q ss_pred             CCCCCHHHHHHHHHHHHHhHHHHHHhCCCCCChhhhhh-------hhcCCCceEEEEEeCCCCCeEEEEEEEeeccCCcc
Q psy838           49 STDLGAETKKWVWELEEKNMKHSYEVCDIGWDPQGKHS-------EMFDDRACYLVAKNGSSSTPVAFSHFRFDVDFGEP  121 (152)
Q Consensus        49 ~~~l~~~d~~~i~~l~~~~m~~~y~~~~~~w~~~~~~~-------el~~~~~~~~v~~~~~~~~ivGf~~~~~~~~~~~~  121 (152)
                      ..+++.+|++.+++|..+.-  ..   ..++.+.....       +...+....+++..+  +++|||+.+...      
T Consensus         2 ir~~~~~D~~~i~~l~~~~~--~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~--~~~iG~~~~~~~------   68 (145)
T PRK10562          2 IREYQPSDLPAILQLWLEST--IW---AHPFIKEQYWRESAPLVRDVYLPAAQTWVWEED--GKLLGFVSVLEG------   68 (145)
T ss_pred             cccccchhhHHHHHHHHHhc--cc---cCCCCCHHHHHHhHHHhhhhhcCcccEEEEEEC--CEEEEEEEEeec------
Confidence            45788999999999865421  11   11222211111       111233334455554  899999987432      


Q ss_pred             eEEEeeeeeccccccCCHHHHHHHHHHhh
Q psy838          122 VLYCYELQLEKQVQRKGLGKFMMQAFYTY  150 (152)
Q Consensus       122 ~~~I~~i~V~p~~qgkGiG~~Lm~~l~~~  150 (152)
                       .+|..++|+|+|||+|+|+.||+.+++.
T Consensus        69 -~~i~~~~v~~~~rg~G~g~~ll~~~~~~   96 (145)
T PRK10562         69 -RFVGALFVAPKAVRRGIGKALMQHVQQR   96 (145)
T ss_pred             -cEEEEEEECHHHcCCCHHHHHHHHHHhh
Confidence             1367899999999999999999998764


No 46 
>KOG3139|consensus
Probab=98.58  E-value=6.4e-07  Score=65.96  Aligned_cols=98  Identities=13%  Similarity=0.162  Sum_probs=66.5

Q ss_pred             cccCCCCCHHHHHHHHHHHHHhHHHHHHhCCCCCChhhhhhhhcCCCceEEEEEeCCCCCeEEEEEEEeeccCCcceEEE
Q psy838           46 YKKSTDLGAETKKWVWELEEKNMKHSYEVCDIGWDPQGKHSEMFDDRACYLVAKNGSSSTPVAFSHFRFDVDFGEPVLYC  125 (152)
Q Consensus        46 ~~~~~~l~~~d~~~i~~l~~~~m~~~y~~~~~~w~~~~~~~el~~~~~~~~v~~~~~~~~ivGf~~~~~~~~~~~~~~~I  125 (152)
                      +....-.....++.+..|++.++.+-|       +.-. ..-+..+...+.++.-+.++..||.+......+.+...+||
T Consensus        16 i~~~~~~~~~~l~~im~Li~k~lsepy-------S~~t-yrYf~~~wp~~~~~a~d~~~~~VGai~ck~~~~r~~~rgyi   87 (165)
T KOG3139|consen   16 IRPSLYPAEEYLADIMRLIDKDLSEPY-------SIYT-YRYFVPNWPCFCFLALDEKGDTVGAIVCKLDTHRNTLRGYI   87 (165)
T ss_pred             eeeecchHHHHHHHHHHHHhhhcCchh-------HHHH-HHhcccCCceEEEEEEcCCCceEEEEEEeccccCCcceEEE
Confidence            444455566777778888887664444       2222 23344455555555443123369999887665544456899


Q ss_pred             eeeeeccccccCCHHHHHHHHHHhhc
Q psy838          126 YELQLEKQVQRKGLGKFMMQAFYTYI  151 (152)
Q Consensus       126 ~~i~V~p~~qgkGiG~~Lm~~l~~~a  151 (152)
                      ..|+|+++|||+|||+.|...+++.+
T Consensus        88 ~mLaV~~e~Rg~GIg~aLvr~aId~m  113 (165)
T KOG3139|consen   88 AMLAVDSEYRGQGIGKALVRKAIDAM  113 (165)
T ss_pred             EEEEechhhccccHHHHHHHHHHHHH
Confidence            99999999999999999999998765


No 47 
>COG1246 ArgA N-acetylglutamate synthase and related acetyltransferases [Amino acid transport and metabolism]
Probab=98.54  E-value=1.5e-07  Score=68.86  Aligned_cols=93  Identities=8%  Similarity=0.065  Sum_probs=64.4

Q ss_pred             CCCCCHHHHHHHHHHHHHhHHHHHHhCCCCCChhhhhhhhcCCCceEEEEEeCCCCCeEEEEEEEeeccCCcceEEEeee
Q psy838           49 STDLGAETKKWVWELEEKNMKHSYEVCDIGWDPQGKHSEMFDDRACYLVAKNGSSSTPVAFSHFRFDVDFGEPVLYCYEL  128 (152)
Q Consensus        49 ~~~l~~~d~~~i~~l~~~~m~~~y~~~~~~w~~~~~~~el~~~~~~~~v~~~~~~~~ivGf~~~~~~~~~~~~~~~I~~i  128 (152)
                      ...++.+|+.+|.+|...     |+..+  --....+..+...-..|.++..+  |.++|-+.+.+..+  .....+..|
T Consensus         3 iR~A~~~Di~~I~~Li~~-----~~~~g--il~~rs~~~le~~i~dF~i~E~~--g~viGC~aL~~~~~--~~~gE~~~l   71 (153)
T COG1246           3 IRKARISDIPAILELIRP-----LELQG--ILLRRSREQLEEEIDDFTIIERD--GKVIGCAALHPVLE--EDLGELRSL   71 (153)
T ss_pred             eeeccccchHHHHHHHHH-----Hhhcc--ccchhhHHHHHHHHhhheeeeeC--CcEEEEEeecccCc--cCeeeEEEE
Confidence            356788999999999864     32211  00111122333333345677665  99999999985322  235679999


Q ss_pred             eeccccccCCHHHHHHHHHHhhcC
Q psy838          129 QLEKQVQRKGLGKFMMQAFYTYID  152 (152)
Q Consensus       129 ~V~p~~qgkGiG~~Lm~~l~~~a~  152 (152)
                      +|+|++||+|+|..|++.++..|+
T Consensus        72 aV~pd~r~~G~G~~Ll~~~~~~Ar   95 (153)
T COG1246          72 AVHPDYRGSGRGERLLERLLADAR   95 (153)
T ss_pred             EECHHhcCCCcHHHHHHHHHHHHH
Confidence            999999999999999999998774


No 48 
>PF13420 Acetyltransf_4:  Acetyltransferase (GNAT) domain; PDB: 3DR8_A 3DR6_A 2AE6_B 2JLM_C 2J8R_A 1YVO_B 2J8M_A 2J8N_A 2BL1_A 3IWG_A ....
Probab=98.54  E-value=3.5e-07  Score=65.75  Aligned_cols=95  Identities=7%  Similarity=0.071  Sum_probs=60.5

Q ss_pred             CCCCHHHHHHHHHHHHHhHHHHHHhCCCCCCh-----hhhhhhhc----CCCceEEEEEeCCCCCeEEEEEEEeeccCCc
Q psy838           50 TDLGAETKKWVWELEEKNMKHSYEVCDIGWDP-----QGKHSEMF----DDRACYLVAKNGSSSTPVAFSHFRFDVDFGE  120 (152)
Q Consensus        50 ~~l~~~d~~~i~~l~~~~m~~~y~~~~~~w~~-----~~~~~el~----~~~~~~~v~~~~~~~~ivGf~~~~~~~~~~~  120 (152)
                      .+++++|++.++.+..+.    .....+.|..     +....-+.    ++....+++... +|++||++.+..... ..
T Consensus         2 R~~~~~D~~~i~~~~~~~----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~-~g~iiG~~~~~~~~~-~~   75 (155)
T PF13420_consen    2 RPATEEDLEEILKLYNEP----RHEYFFTFEYPEDSEESFERWIESIIDSSKQRLFLVAEE-DGKIIGYVSLRDIDP-YN   75 (155)
T ss_dssp             EE--GGGHHHHHHHHHHH----HHHTSSSSCSSHS-HHHHHHHHHHHHHHHTTEEEEEEEC-TTEEEEEEEEEESSS-GT
T ss_pred             CCCcHHHHHHHHHHHhhh----hhcceeEecCCCCCHHHHHHHHHHhcccCCCcEEEEEEc-CCcEEEEEEEEeeec-cC
Confidence            357889999999988653    2111233331     11111121    234555555552 499999999986433 33


Q ss_pred             ceEEEeeeeeccccccCCHHHHHHHHHHhhc
Q psy838          121 PVLYCYELQLEKQVQRKGLGKFMMQAFYTYI  151 (152)
Q Consensus       121 ~~~~I~~i~V~p~~qgkGiG~~Lm~~l~~~a  151 (152)
                      ..+.+. ++|.|++|++|+|+.|++.+++.|
T Consensus        76 ~~~~~~-~~v~~~~~~~gig~~l~~~l~~~a  105 (155)
T PF13420_consen   76 HTAELS-IYVSPDYRGKGIGRKLLDELIEYA  105 (155)
T ss_dssp             TEEEEE-EEEEGGGTTSSHHHHHHHHHHHHH
T ss_pred             CEEEEe-eEEChhHCCCcHHHHHHHHHHHHh
Confidence            344444 889999999999999999999887


No 49 
>TIGR03448 mycothiol_MshD mycothiol biosynthesis acetyltransferase. Members of this family are MshD, the acetyltransferase that catalyzes the final step of mycothiol biosynthesis in various members of the Actinomyctes, Mycothiol replaces glutathione in these species.
Probab=98.54  E-value=7.8e-07  Score=70.69  Aligned_cols=91  Identities=13%  Similarity=0.172  Sum_probs=56.8

Q ss_pred             HHHHHHHHHHHhHHHHHHhC--CCCCChhhhhhhhcC---CCceEEEEEeCCCCCeEEEEEEEeeccCCcceEEEeeeee
Q psy838           56 TKKWVWELEEKNMKHSYEVC--DIGWDPQGKHSEMFD---DRACYLVAKNGSSSTPVAFSHFRFDVDFGEPVLYCYELQL  130 (152)
Q Consensus        56 d~~~i~~l~~~~m~~~y~~~--~~~w~~~~~~~el~~---~~~~~~v~~~~~~~~ivGf~~~~~~~~~~~~~~~I~~i~V  130 (152)
                      |.+.+.++..+.    |..+  ..+|+.+.+......   ....++++..+.++++|||+.+....++ ....+|+.+.|
T Consensus       160 d~~~~~~~~~~~----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~vG~~~~~~~~~~-~~~~~i~~~~V  234 (292)
T TIGR03448       160 DDAEWLRVNNAA----FAWHPEQGGWTRADLAERRAEPWFDPAGLFLAFDDAPGELLGFHWTKVHPDE-PALGEVYVVGV  234 (292)
T ss_pred             chHHHHHHHHHH----hhCCCccCCcCHHHHHHHhhCcCCCcCceEEEEECCCCcEEEEEEEEecCCC-CceeEEEEEEE
Confidence            555666664442    3221  146877665443221   1223455555224899999765543222 22456788899


Q ss_pred             ccccccCCHHHHHHHHHHhhc
Q psy838          131 EKQVQRKGLGKFMMQAFYTYI  151 (152)
Q Consensus       131 ~p~~qgkGiG~~Lm~~l~~~a  151 (152)
                      +|+|||+|+|+.||..+++.+
T Consensus       235 ~p~~rg~GiG~~ll~~~~~~~  255 (292)
T TIGR03448       235 DPAAQGRGLGDALTLIGLHHL  255 (292)
T ss_pred             CHHHcCCCHHHHHHHHHHHHH
Confidence            999999999999999988765


No 50 
>TIGR03585 PseH pseudaminic acid biosynthesis N-acetyl transferase. Sequences in this family are members of the pfam00583 (GNAT) superfamily of acetyltransferases and are proposed to perform a N-acetylation step in the process of pseudaminic acid biosynthesis in Campylobacter species. This gene is commonly observed in apparent operons with other genes responsible for the biosynthesis of pseudaminic acid and as a component of flagellar and exopolysaccharide biosynthesis loci. Significantly, many genomes containing other components of this pathway lack this gene, indicating that some other N-acetyl transferases may be incolved and/or the step is optional, resulting in a non-acetylated pseudaminic acid variant sugar.
Probab=98.47  E-value=2.6e-06  Score=61.11  Aligned_cols=99  Identities=16%  Similarity=0.106  Sum_probs=60.4

Q ss_pred             cCCCCCHHHHHHHHHHHHHhHHHHHHhCCCCCChhh---hhhhhc-CCCceEEEEEeCCCCCeEEEEEEEeeccCCcceE
Q psy838           48 KSTDLGAETKKWVWELEEKNMKHSYEVCDIGWDPQG---KHSEMF-DDRACYLVAKNGSSSTPVAFSHFRFDVDFGEPVL  123 (152)
Q Consensus        48 ~~~~l~~~d~~~i~~l~~~~m~~~y~~~~~~w~~~~---~~~el~-~~~~~~~v~~~~~~~~ivGf~~~~~~~~~~~~~~  123 (152)
                      ...+++++|++.++++..+.-...|..+...-+.+.   ....+. ++...++++..+  |++||++.+..... .....
T Consensus         2 ~lr~~~~~D~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--g~~vG~~~~~~~~~-~~~~~   78 (156)
T TIGR03585         2 NFTPLNSEELELVLEWRNHPDVRANMYSDHLIDWEEHLHFIEALKQDPNRRYWIVCQE--SRPIGVISFTDINL-VHKSA   78 (156)
T ss_pred             CcccCCHHHHHHHHHhhCCHHHHhhccCcCCCCHHHHHHHHHHhhcCCCceEEEEEEC--CEEEEEEEEEecCh-hhCeE
Confidence            467899999999999764422122311111112111   112222 344455566554  99999999875432 12233


Q ss_pred             EEeeeeeccccccCCHHHHHHHHHHhhc
Q psy838          124 YCYELQLEKQVQRKGLGKFMMQAFYTYI  151 (152)
Q Consensus       124 ~I~~i~V~p~~qgkGiG~~Lm~~l~~~a  151 (152)
                      .+ .+++.|++| +|+|+.+|..++++|
T Consensus        79 ~~-g~~~~~~~~-~G~g~~~~~~~~~~a  104 (156)
T TIGR03585        79 FW-GIYANPFCK-PGVGSVLEEAALEYA  104 (156)
T ss_pred             EE-EEEeChhhh-cCchHHHHHHHHHHH
Confidence            33 466999999 999999999999875


No 51 
>PRK15130 spermidine N1-acetyltransferase; Provisional
Probab=98.43  E-value=2.7e-06  Score=63.27  Aligned_cols=100  Identities=13%  Similarity=0.107  Sum_probs=59.9

Q ss_pred             ccCCCCCHHHHHHHHHHHHHhHHHHHHhCCCCCChh-h----hhhhhcCCCceEEEEEeCCCCCeEEEEEEEeeccCCcc
Q psy838           47 KKSTDLGAETKKWVWELEEKNMKHSYEVCDIGWDPQ-G----KHSEMFDDRACYLVAKNGSSSTPVAFSHFRFDVDFGEP  121 (152)
Q Consensus        47 ~~~~~l~~~d~~~i~~l~~~~m~~~y~~~~~~w~~~-~----~~~el~~~~~~~~v~~~~~~~~ivGf~~~~~~~~~~~~  121 (152)
                      ....+++++|++.++++..+.....+-. ..+++.. .    +...+.++....+++..+  +++||++.+....... .
T Consensus         7 l~lR~~~~~D~~~l~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~--g~~iG~~~~~~~~~~~-~   82 (186)
T PRK15130          7 VKLRPLEREDLRFVHQLDNNASVMRYWF-EEPYEAFVELSDLYDKHIHDQSERRFVVECD--GEKAGLVELVEINHVH-R   82 (186)
T ss_pred             eEEecCCHHHHHHHHHHhcChHHHhhcC-CcccccHHHHHHHHHHhhhcccCcEEEEEEC--CEEEEEEEEEeecCCC-C
Confidence            3466788999999999865421011100 1111110 0    111122334444555544  8999999986543222 1


Q ss_pred             eEEEeeeeeccccccCCHHHHHHHHHHhhc
Q psy838          122 VLYCYELQLEKQVQRKGLGKFMMQAFYTYI  151 (152)
Q Consensus       122 ~~~I~~i~V~p~~qgkGiG~~Lm~~l~~~a  151 (152)
                      ...+ .++|+|+|||+|+|+.+++.+++++
T Consensus        83 ~~~~-~~~v~~~~~g~G~g~~l~~~l~~~~  111 (186)
T PRK15130         83 RAEF-QIIISPEYQGKGLATRAAKLAMDYG  111 (186)
T ss_pred             eEEE-EEEECHHHcCCCHHHHHHHHHHHHH
Confidence            2333 5999999999999999999998764


No 52 
>PRK10809 ribosomal-protein-S5-alanine N-acetyltransferase; Provisional
Probab=98.36  E-value=6.7e-06  Score=61.63  Aligned_cols=99  Identities=16%  Similarity=0.134  Sum_probs=60.7

Q ss_pred             ccCCCCCHHHHHHHHHHHHHhHHHHHHhCCCCCChh---------hh------hhhh-cCCCceEEEEEeCCCCCeEEEE
Q psy838           47 KKSTDLGAETKKWVWELEEKNMKHSYEVCDIGWDPQ---------GK------HSEM-FDDRACYLVAKNGSSSTPVAFS  110 (152)
Q Consensus        47 ~~~~~l~~~d~~~i~~l~~~~m~~~y~~~~~~w~~~---------~~------~~el-~~~~~~~~v~~~~~~~~ivGf~  110 (152)
                      ....+++++|.+.++++..++-  .+-   .+|.+.         ..      .... .++....+++....++++||++
T Consensus        18 l~LR~~~~~Da~~~~~l~~~~~--~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~iG~i   92 (194)
T PRK10809         18 LVVRLVHERDAWRLADYYAENR--HFL---KPWEPVRDESHCYPSGWQARLGMINEFHKQGSAFYFALLDPDEKEIIGVA   92 (194)
T ss_pred             EEEEeCCHHHHHHHHHHHHhCH--Hhc---cCCCCCCcccccCHHHHHHHHHHHHHHHhcCcEEEEEEEECCCCeEEEEE
Confidence            4467789999999999876542  121   122211         00      1111 2233333444333257999999


Q ss_pred             EEEeeccCCcceEEEeeeeeccccccCCHHHHHHHHHHhhc
Q psy838          111 HFRFDVDFGEPVLYCYELQLEKQVQRKGLGKFMMQAFYTYI  151 (152)
Q Consensus       111 ~~~~~~~~~~~~~~I~~i~V~p~~qgkGiG~~Lm~~l~~~a  151 (152)
                      .+........... .-.+.|.|+|||+|+|+.++..++++|
T Consensus        93 ~l~~~~~~~~~~~-eig~~i~~~~~G~G~~~ea~~~ll~~~  132 (194)
T PRK10809         93 NFSNVVRGSFHAC-YLGYSLGQKWQGQGLMFEALQAAIRYM  132 (194)
T ss_pred             EEEeecCCCeeeE-EEEEEECHHHcCCCHHHHHHHHHHHHH
Confidence            9875432222222 234779999999999999999999876


No 53 
>PF13302 Acetyltransf_3:  Acetyltransferase (GNAT) domain; PDB: 3TTH_C 3JUW_A 2ZXV_A 2Z0Z_A 2VI7_B 3EG7_F 1YRE_C 3IGR_B 3FBU_A 2FCK_A ....
Probab=98.25  E-value=2.4e-06  Score=60.11  Aligned_cols=101  Identities=9%  Similarity=0.072  Sum_probs=60.7

Q ss_pred             cCCCCCHHHHHHHHHHHHH-hHHHHHHhCCCC-CChhhhhhhhcC-----C--CceEEEEEeCCCCCeEEEEEEEeeccC
Q psy838           48 KSTDLGAETKKWVWELEEK-NMKHSYEVCDIG-WDPQGKHSEMFD-----D--RACYLVAKNGSSSTPVAFSHFRFDVDF  118 (152)
Q Consensus        48 ~~~~l~~~d~~~i~~l~~~-~m~~~y~~~~~~-w~~~~~~~el~~-----~--~~~~~v~~~~~~~~ivGf~~~~~~~~~  118 (152)
                      ...+++++|++.++++..+ .+. .|.....+ ++.+....-+.+     +  ...++++...+++++||++.+..... 
T Consensus         3 ~lr~~~~~D~~~i~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~iG~i~~~~~~~-   80 (142)
T PF13302_consen    3 TLRPLTPEDADAIYEWRSDPEIR-RYLPWGPPWPTLEEAEEWIQSRQDSWENHGYYYFAIEDKDDGEIIGFIGLYNIDK-   80 (142)
T ss_dssp             EEEE-HGGGHHHHHHHHTTTTHC-TTSSTTTSSSSHHHHHHHHHHHHHCHHEETEEEEEEEETTTTEEEEEEEEEEEET-
T ss_pred             EEEcCCHHHHHHHHHHhcCHHHH-HhcCCCCCCCCHHHHHHHHHHhhhhhhcccceEEEEEeccCCceEEEeeeeeccc-
Confidence            4567889999999988742 111 11100112 243433333321     1  13344555543468999999954322 


Q ss_pred             CcceEEEeeeeeccccccCCHHHHHHHHHHhhc
Q psy838          119 GEPVLYCYELQLEKQVQRKGLGKFMMQAFYTYI  151 (152)
Q Consensus       119 ~~~~~~I~~i~V~p~~qgkGiG~~Lm~~l~~~a  151 (152)
                      ......|- +.|.|++||+|+|+.++..+.+++
T Consensus        81 ~~~~~eig-~~i~~~~~g~G~~~~~~~~~~~~~  112 (142)
T PF13302_consen   81 NNNWAEIG-YWIGPDYRGKGYGTEALKLLLDWA  112 (142)
T ss_dssp             TTTEEEEE-EEEEGGGTTSSHHHHHHHHHHHHH
T ss_pred             CCCccccc-cchhHHHHhhhHHHHHHHHHHHHH
Confidence            23345555 889999999999999999998875


No 54 
>PF14542 Acetyltransf_CG:  GCN5-related N-acetyl-transferase; PDB: 2H5M_A 2Q44_A 1XMT_A 2Q4Y_A 2IL4_A 2EVN_A 1R57_A.
Probab=98.24  E-value=7.4e-06  Score=53.63  Aligned_cols=45  Identities=18%  Similarity=0.296  Sum_probs=38.2

Q ss_pred             CCCeEEEEEEEeeccCCcceEEEeeeeeccccccCCHHHHHHHHHHhhc
Q psy838          103 SSTPVAFSHFRFDVDFGEPVLYCYELQLEKQVQRKGLGKFMMQAFYTYI  151 (152)
Q Consensus       103 ~~~ivGf~~~~~~~~~~~~~~~I~~i~V~p~~qgkGiG~~Lm~~l~~~a  151 (152)
                      +|+.+|++.|..  +  ..++.|....|.|++||+|+|+.||+.+.++|
T Consensus         7 ~g~~~a~l~Y~~--~--~~~~~i~hT~V~~~~rGqGia~~L~~~~l~~a   51 (78)
T PF14542_consen    7 DGEEIAELTYRE--D--GGVIVITHTEVPPELRGQGIAKKLVEAALDYA   51 (78)
T ss_dssp             STTEEEEEEEEE--S--SSEEEEEEEEE-CSSSTTTHHHHHHHHHHHHH
T ss_pred             CCEEEEEEEEEe--C--CCEEEEEEEEECccccCCcHHHHHHHHHHHHH
Confidence            388999999965  2  34778999999999999999999999999886


No 55 
>PRK10151 ribosomal-protein-L7/L12-serine acetyltransferase; Provisional
Probab=98.23  E-value=3e-05  Score=57.23  Aligned_cols=101  Identities=11%  Similarity=0.034  Sum_probs=60.5

Q ss_pred             ccCCCCCHHHHHHHHHHHHHhHHHHHHhCCCC---CChhhhhhhhc------CCC-ceEEEEEeCCCCCeEEEEEEEeec
Q psy838           47 KKSTDLGAETKKWVWELEEKNMKHSYEVCDIG---WDPQGKHSEMF------DDR-ACYLVAKNGSSSTPVAFSHFRFDV  116 (152)
Q Consensus        47 ~~~~~l~~~d~~~i~~l~~~~m~~~y~~~~~~---w~~~~~~~el~------~~~-~~~~v~~~~~~~~ivGf~~~~~~~  116 (152)
                      ....+++++|++.++++..++-........++   -+.+.....+.      ... ..++++..+  +++||++.+....
T Consensus        11 l~Lr~~~~~D~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~i~~~--~~~iG~~~l~~~~   88 (179)
T PRK10151         11 LELHAVDESHVTPLHQLVCKNKTWLQQSLNWPQFVQSEEDTRKTVQGNVMLHQRGYAKMFMIFKE--DELIGVLSFNRIE   88 (179)
T ss_pred             EEEEeCCHHHHHHHHHHHHHhHHHHHhcCCCcCccCCHHHHHHHHHHHHHHHhcCCcEEEEEEEC--CEEEEEEEEEeec
Confidence            34678999999999998743211111100111   12232222222      112 223444444  8999999987643


Q ss_pred             cCCcceEEEeeeeeccccccCCHHHHHHHHHHhhc
Q psy838          117 DFGEPVLYCYELQLEKQVQRKGLGKFMMQAFYTYI  151 (152)
Q Consensus       117 ~~~~~~~~I~~i~V~p~~qgkGiG~~Lm~~l~~~a  151 (152)
                      +... ...| .+.|+|+|||+|+|+.++..++++|
T Consensus        89 ~~~~-~~~i-g~~i~~~~~g~G~~tea~~~l~~~~  121 (179)
T PRK10151         89 PLNK-TAYI-GYWLDESHQGQGIISQALQALIHHY  121 (179)
T ss_pred             cCCC-ceEE-EEEEChhhcCCcHHHHHHHHHHHHH
Confidence            3222 2333 4679999999999999999988765


No 56 
>cd02169 Citrate_lyase_ligase Citrate lyase ligase. Citrate lyase ligase, also known as [Citrate (pro-3S)-lyase] ligase, is responsible for acetylation of the (2-(5''-phosphoribosyl)-3'-dephosphocoenzyme-A) prosthetic group of the gamma subunit of citrate lyase, converting the inactive thiol form of this enzyme to the active form. The acetylation of 1 molecule of deacetyl-citrate lyase to enzymatically active citrate lyase requires 6 molecules of ATP. The Adenylylyltranferase activity of the enzyme involves the formation of AMP and and pyrophosphate in the acetylation reaction.
Probab=98.15  E-value=4.5e-06  Score=67.62  Aligned_cols=41  Identities=10%  Similarity=-0.005  Sum_probs=35.6

Q ss_pred             CCCeEEEEEEEeeccCCcceEEEeeeeeccccccCCHHHHHHHHHHhhc
Q psy838          103 SSTPVAFSHFRFDVDFGEPVLYCYELQLEKQVQRKGLGKFMMQAFYTYI  151 (152)
Q Consensus       103 ~~~ivGf~~~~~~~~~~~~~~~I~~i~V~p~~qgkGiG~~Lm~~l~~~a  151 (152)
                      ++++||++.+..        .+|..|+|+|+|||+|+|++||+.+++.|
T Consensus        14 ~~~iVG~~~l~~--------~~I~~vaV~p~~Rg~GiG~~Ll~~l~~~a   54 (297)
T cd02169          14 AGELIATGSIAG--------NVLKCVAVCPKYQGEGLALKIVSELINKA   54 (297)
T ss_pred             CCEEEEEEEecc--------CEEEEEEECHHHcCCCHHHHHHHHHHHHH
Confidence            489999988742        13889999999999999999999999876


No 57 
>KOG3396|consensus
Probab=98.03  E-value=4.5e-05  Score=54.88  Aligned_cols=73  Identities=10%  Similarity=0.118  Sum_probs=50.1

Q ss_pred             CChhhhhhhh---cCCCc-eEE-EEEeCCCCCeEEEEEEEeec---cCCcceEEEeeeeeccccccCCHHHHHHHHHHhh
Q psy838           79 WDPQGKHSEM---FDDRA-CYL-VAKNGSSSTPVAFSHFRFDV---DFGEPVLYCYELQLEKQVQRKGLGKFMMQAFYTY  150 (152)
Q Consensus        79 w~~~~~~~el---~~~~~-~~~-v~~~~~~~~ivGf~~~~~~~---~~~~~~~~I~~i~V~p~~qgkGiG~~Lm~~l~~~  150 (152)
                      -+++++...+   ....+ .++ ++++.+.++++|-+.+....   .......+|.++.|++++||++||+.|+..|.++
T Consensus        34 vt~e~F~krf~~mk~~~~~Y~i~Vied~~s~~vigtatL~IE~KfIh~~g~rGhiEDVVV~~~~rgk~LGkllv~~Lv~l  113 (150)
T KOG3396|consen   34 VTREQFEKRFEAMKKSGDWYYIVVIEDKESEKVIGTATLFIERKFIHGCGSRGHIEDVVVDSEYRGKQLGKLLVETLVDL  113 (150)
T ss_pred             cCHHHHHHHHHHHHhcCCcEEEEEEEeCCcCeEEEEEEEEEehhhhhcccccCceeEEEeChhhhhhHHhHHHHHHHHHH
Confidence            4555555543   33333 333 34444467899999876532   1223345699999999999999999999999987


Q ss_pred             c
Q psy838          151 I  151 (152)
Q Consensus       151 a  151 (152)
                      |
T Consensus       114 ~  114 (150)
T KOG3396|consen  114 A  114 (150)
T ss_pred             H
Confidence            6


No 58 
>COG2153 ElaA Predicted acyltransferase [General function prediction only]
Probab=98.01  E-value=1.1e-05  Score=58.55  Aligned_cols=98  Identities=14%  Similarity=0.116  Sum_probs=64.2

Q ss_pred             cccCCCCCHHHHHHHHHHHHHhHHHHHHh-CCCCCChhhhhhhhcCCCceEEEEEeCCCCCeEEEEEEEeeccCCcceEE
Q psy838           46 YKKSTDLGAETKKWVWELEEKNMKHSYEV-CDIGWDPQGKHSEMFDDRACYLVAKNGSSSTPVAFSHFRFDVDFGEPVLY  124 (152)
Q Consensus        46 ~~~~~~l~~~d~~~i~~l~~~~m~~~y~~-~~~~w~~~~~~~el~~~~~~~~v~~~~~~~~ivGf~~~~~~~~~~~~~~~  124 (152)
                      +....+++...+-+++.|..+.    |-. +.=|+.+-. -.++... .+.++...+ +|++++|+.+.+....... .-
T Consensus         7 ~~~~~~Lt~~ely~LlkLRv~V----FVVEQ~CPY~E~D-g~Dl~~~-~~Hl~~~~~-~g~LvAyaRLl~~~~~~~~-~~   78 (155)
T COG2153           7 CKHFNDLTVRELYELLKLRVDV----FVVEQNCPYPELD-GKDLLGD-TRHLLGWTP-DGELVAYARLLPPGAEYEE-VS   78 (155)
T ss_pred             hhhhhhcCHHHHHHHHHhheeE----EEEecCCCCcCcC-Ccccccc-cceEEEEcC-CCeEEEEEecCCCCCCcCc-ee
Confidence            4556788888888888886652    211 122333211 1122223 334555653 5999999998764333232 55


Q ss_pred             EeeeeeccccccCCHHHHHHHHHHhhc
Q psy838          125 CYELQLEKQVQRKGLGKFMMQAFYTYI  151 (152)
Q Consensus       125 I~~i~V~p~~qgkGiG~~Lm~~l~~~a  151 (152)
                      |-++.|.|+.||+|+|++||....+.|
T Consensus        79 iGRV~v~~~~RG~glG~~Lm~~AL~~~  105 (155)
T COG2153          79 IGRVIVSPAARGQGLGQQLMEKALETA  105 (155)
T ss_pred             eeeEEECHhhhccchhHHHHHHHHHHH
Confidence            999999999999999999999887764


No 59 
>PF13718 GNAT_acetyltr_2:  GNAT acetyltransferase 2; PDB: 2ZPA_B.
Probab=97.90  E-value=8.4e-05  Score=56.82  Aligned_cols=70  Identities=11%  Similarity=0.121  Sum_probs=48.4

Q ss_pred             ChhhhhhhhcCCCceEEEEEeCCCC--CeEEEEEEEeecc----------------C------------------CcceE
Q psy838           80 DPQGKHSEMFDDRACYLVAKNGSSS--TPVAFSHFRFDVD----------------F------------------GEPVL  123 (152)
Q Consensus        80 ~~~~~~~el~~~~~~~~v~~~~~~~--~ivGf~~~~~~~~----------------~------------------~~~~~  123 (152)
                      +++.++.-+-+|+.+.+++..+  +  +++|-+.+...-.                .                  ....+
T Consensus        14 sPnDL~~LlDaP~h~l~~l~~~--~~p~il~~~~v~~EG~l~~~l~~~i~~g~rRp~G~LiP~~L~~~~~~~~f~~l~g~   91 (196)
T PF13718_consen   14 SPNDLQLLLDAPNHRLFVLLQP--GDPDILGVAQVALEGGLSKELIEAILSGGRRPKGHLIPQTLAQHFGDPEFAQLSGA   91 (196)
T ss_dssp             -HHHHHHHHH-TTEEEEEEE-S--S--SEEEEEEEEEEE---HHHHHHHHTTS---SS-HHHHHHHHHSS-TTGGGSEEE
T ss_pred             CHHHHHHHhcCCcceeehhccC--CCceEEEEEEEEecCCCCHHHHHHHHhCCCCCCCCCHHHHHHHHhCCHHHHhhcce
Confidence            6666555555677777777766  6  9999988754311                0                  13457


Q ss_pred             EEeeeeeccccccCCHHHHHHHHHHhhc
Q psy838          124 YCYELQLEKQVQRKGLGKFMMQAFYTYI  151 (152)
Q Consensus       124 ~I~~i~V~p~~qgkGiG~~Lm~~l~~~a  151 (152)
                      -|.+|+|+|++|++|+|+++++.+++++
T Consensus        92 RIvRIAvhP~~q~~G~Gs~lL~~l~~~~  119 (196)
T PF13718_consen   92 RIVRIAVHPDLQRMGYGSRLLQQLEQYA  119 (196)
T ss_dssp             EEEEEEE-CCC-SSSHHHHHHHHHHHT-
T ss_pred             eEEEEEEChhhhcCCHHHHHHHHHHHHH
Confidence            7999999999999999999999999886


No 60 
>COG2388 Predicted acetyltransferase [General function prediction only]
Probab=97.90  E-value=3.9e-05  Score=52.53  Aligned_cols=56  Identities=7%  Similarity=0.116  Sum_probs=43.3

Q ss_pred             CceEEEEEeCCCCCeEEEEEEEeeccCCcceEEEeeeeeccccccCCHHHHHHHHHHhhcC
Q psy838           92 RACYLVAKNGSSSTPVAFSHFRFDVDFGEPVLYCYELQLEKQVQRKGLGKFMMQAFYTYID  152 (152)
Q Consensus        92 ~~~~~v~~~~~~~~ivGf~~~~~~~~~~~~~~~I~~i~V~p~~qgkGiG~~Lm~~l~~~a~  152 (152)
                      ....+++..+  |+.+|++.+...   +...+.|..-+|.+++||||+|+.|+....+.|+
T Consensus        14 ~~~~y~~~~~--G~~~~e~~y~~~---~~~~i~i~HT~V~d~lrGqGia~~L~~~al~~ar   69 (99)
T COG2388          14 ENGRYVLTDE--GEVIGEATYYDR---GENLIIIDHTYVPDELRGQGIAQKLVEKALEEAR   69 (99)
T ss_pred             CceEEEEecC--CcEEEEEEEecC---CCCEEEEecCcCCHHHcCCcHHHHHHHHHHHHHH
Confidence            3333444444  889999998754   2346779999999999999999999999988764


No 61 
>KOG3397|consensus
Probab=97.90  E-value=8e-05  Score=55.87  Aligned_cols=57  Identities=18%  Similarity=0.302  Sum_probs=45.4

Q ss_pred             EEEEEeCCCCCeEEEEEEEeeccCCcceEEEeeeeeccccccCCHHHHHHHHHHhhcC
Q psy838           95 YLVAKNGSSSTPVAFSHFRFDVDFGEPVLYCYELQLEKQVQRKGLGKFMMQAFYTYID  152 (152)
Q Consensus        95 ~~v~~~~~~~~ivGf~~~~~~~~~~~~~~~I~~i~V~p~~qgkGiG~~Lm~~l~~~a~  152 (152)
                      .+++..+...++||-+.+...... ...+++..+.|+.+.||+|.|+.||..+|++|+
T Consensus        57 sL~Ll~E~~~~VigH~rLS~i~n~-~~al~VEsVVV~k~~RG~GFGk~lMk~~E~~~R  113 (225)
T KOG3397|consen   57 SLLLLNEENDEVLGHSRLSHLPNR-DHALWVESVVVKKDQRGLGFGKFLMKSTEKWMR  113 (225)
T ss_pred             eeeeecccccceeeeeccccCCCC-CceeEEEEEEEehhhccccHHHHHHHHHHHHHH
Confidence            344444436799999998875443 346889999999999999999999999999874


No 62 
>TIGR00124 cit_ly_ligase [citrate (pro-3S)-lyase] ligase. ATP is cleaved to AMP and pyrophosphate during the reaction. The carboxyl end is homologous to a number of cytidyltransferases that also release pyrophosphate.
Probab=97.87  E-value=3.6e-05  Score=63.27  Aligned_cols=49  Identities=6%  Similarity=0.128  Sum_probs=39.9

Q ss_pred             ceEEEEEeCCCCCeEEEEEEEeeccCCcceEEEeeeeeccccccCCHHHHHHHHHHhhc
Q psy838           93 ACYLVAKNGSSSTPVAFSHFRFDVDFGEPVLYCYELQLEKQVQRKGLGKFMMQAFYTYI  151 (152)
Q Consensus        93 ~~~~v~~~~~~~~ivGf~~~~~~~~~~~~~~~I~~i~V~p~~qgkGiG~~Lm~~l~~~a  151 (152)
                      ..++++..+  +++||++++.-     .   .|..|+|+|+|||+|+|++||..+++.|
T Consensus        31 d~~vv~~~~--~~lVg~g~l~g-----~---~ik~vaV~~~~rG~Glg~~L~~~L~~~a   79 (332)
T TIGR00124        31 EIFIAVYED--EEIIGCGGIAG-----N---VIKCVAIDESLRGEGLALQLMTELENLA   79 (332)
T ss_pred             CEEEEEEEC--CEEEEEEEEec-----C---EEEEEEEcHHHcCCCHHHHHHHHHHHHH
Confidence            455556555  89999998731     1   3789999999999999999999999876


No 63 
>PF12568 DUF3749:  Acetyltransferase (GNAT) domain;  InterPro: IPR024612 This domain is found in uncharacterised proteins from Gammaproteobacteria, and is approximately 40 amino acids in length. It contains two completely conserved residues (D and I) that may be functionally important. Proteins having this domain are frequently annotated as acetyltransferases of the GNAT family; however there is little accompanying annotation to confirm this.; PDB: 2K5T_A.
Probab=97.78  E-value=0.00059  Score=48.65  Aligned_cols=86  Identities=12%  Similarity=0.228  Sum_probs=51.6

Q ss_pred             ccCCCCCHHHHHHHHHHHHHhHHHHHHhCCCCCChhhhhhhhcCCCceEEEEEeCCCCCeEEEEEEEeeccCCcceEEEe
Q psy838           47 KKSTDLGAETKKWVWELEEKNMKHSYEVCDIGWDPQGKHSEMFDDRACYLVAKNGSSSTPVAFSHFRFDVDFGEPVLYCY  126 (152)
Q Consensus        47 ~~~~~l~~~d~~~i~~l~~~~m~~~y~~~~~~w~~~~~~~el~~~~~~~~v~~~~~~~~ivGf~~~~~~~~~~~~~~~I~  126 (152)
                      .+...++++|..++-.|        |..    .+++.. ..-.+....++.+..+  +.++|-+.+.....    ...|.
T Consensus         5 ~rl~~ls~Qd~iDL~KI--------wp~----~~~~~l-~~~l~~~~~l~aArFN--dRlLgAv~v~~~~~----~~~L~   65 (128)
T PF12568_consen    5 ERLTTLSEQDRIDLAKI--------WPQ----QDPEQL-EQWLDEGHRLFAARFN--DRLLGAVKVTISGQ----QAELS   65 (128)
T ss_dssp             EE-SS--HHHHHHHHHH---------TT----S-----------SSEEEEEEEET--TEEEEEEEEEEETT----EEEEE
T ss_pred             EEcCCCCHHHHHHHHHh--------CCC----CCHHHH-HHHhccCCeEEEEEec--hheeeeEEEEEcCc----ceEEe
Confidence            34556777776665544        311    122332 2233556666677776  99999999876522    46699


Q ss_pred             eeeeccccccCCHHHHHHHHHHhhc
Q psy838          127 ELQLEKQVQRKGLGKFMMQAFYTYI  151 (152)
Q Consensus       127 ~i~V~p~~qgkGiG~~Lm~~l~~~a  151 (152)
                      +|.|+|.-||+|+|+.|++.+...+
T Consensus        66 ~l~VRevTRrRGVG~yLlee~~rq~   90 (128)
T PF12568_consen   66 DLCVREVTRRRGVGLYLLEEVLRQL   90 (128)
T ss_dssp             EEEE-TT-SSSSHHHHHHHHHHHHS
T ss_pred             eEEEeeccccccHHHHHHHHHHHHC
Confidence            9999999999999999999887653


No 64 
>KOG3235|consensus
Probab=97.77  E-value=0.00011  Score=54.43  Aligned_cols=51  Identities=20%  Similarity=0.234  Sum_probs=40.4

Q ss_pred             EEEeCCCCCeEEEEEEEeec--cCCcceEEEeeeeeccccccCCHHHHHHHHHH
Q psy838           97 VAKNGSSSTPVAFSHFRFDV--DFGEPVLYCYELQLEKQVQRKGLGKFMMQAFY  148 (152)
Q Consensus        97 v~~~~~~~~ivGf~~~~~~~--~~~~~~~~I~~i~V~p~~qgkGiG~~Lm~~l~  148 (152)
                      |+. +.+|+||||+......  ++..+..+|.-|.|...||+.|||+.||++..
T Consensus        45 VA~-D~~gkiVGYvlAkmee~p~~~~~hGhItSlaV~rs~RrlGla~kLm~qa~   97 (193)
T KOG3235|consen   45 VAE-DENGKIVGYVLAKMEEDPDDEPPHGHITSLAVKRSYRRLGLAQKLMNQAS   97 (193)
T ss_pred             EEE-cCCCcEEEEeeeehhhcccCCCCCCeeEEeeehhhHHHhhHHHHHHHHHH
Confidence            444 3479999999876432  34556788999999999999999999998754


No 65 
>TIGR01211 ELP3 histone acetyltransferase, ELP3 family. The Saccharomyces cerevisiae member YPL086C has been characterized in vitro as an N-terminal acetyltransferase for all four core histones. It is a component of the RNA polymerase II holoenzyme, designated Elp3p for Elongator Protein 3. Members of this family are found in eukaryotes and archaea. These proteins are part of the larger set of GNAT acetyltransferases.
Probab=97.66  E-value=0.00012  Score=63.54  Aligned_cols=59  Identities=14%  Similarity=0.267  Sum_probs=43.5

Q ss_pred             ceEEEEEe-CCCCCeEEEEEEEeeccC-----CcceEEEeeeeec-----------cccccCCHHHHHHHHHHhhc
Q psy838           93 ACYLVAKN-GSSSTPVAFSHFRFDVDF-----GEPVLYCYELQLE-----------KQVQRKGLGKFMMQAFYTYI  151 (152)
Q Consensus        93 ~~~~v~~~-~~~~~ivGf~~~~~~~~~-----~~~~~~I~~i~V~-----------p~~qgkGiG~~Lm~~l~~~a  151 (152)
                      ...++.+. ..++.++||+.++...+.     .....+|.+|+|.           |+|||+|+|++||+.++++|
T Consensus       411 ~e~F~~y~~~~~~~l~G~lrlr~~~~~~~~~~~~~~a~IrelhV~G~~~~~~~~~~~~~rg~GiG~~Ll~~ae~~A  486 (522)
T TIGR01211       411 TEFFLSYEDPKNDILIGFLRLRFPSEPAHRKEVDATALVRELHVYGSEVPIGERGDDEWQHRGYGRRLLEEAERIA  486 (522)
T ss_pred             CeEEEEEEcCCCCeEEEEEEEecCcccccccccCCCceEEEEEEeeeeccccccCChhHhCcCHHHHHHHHHHHHH
Confidence            34455554 224689999999875431     1224678888865           99999999999999999876


No 66 
>PF08445 FR47:  FR47-like protein;  InterPro: IPR013653 Proteins in this entry have a conserved region similar to the C-terminal region of the Drosophila melanogaster (Fruit fly) hypothetical protein FR47 (Q9VR51 from SWISSPROT). This protein has been found to consist of two N-acyltransferase-like domains swapped with the C-terminal strands. ; GO: 0016747 transferase activity, transferring acyl groups other than amino-acyl groups; PDB: 1SQH_A 3EC4_B.
Probab=97.61  E-value=0.00027  Score=46.87  Aligned_cols=42  Identities=21%  Similarity=0.202  Sum_probs=29.5

Q ss_pred             CeEEEEEEEeeccCCcceEEEeeeeeccccccCCHHHHHHHHHHhh
Q psy838          105 TPVAFSHFRFDVDFGEPVLYCYELQLEKQVQRKGLGKFMMQAFYTY  150 (152)
Q Consensus       105 ~ivGf~~~~~~~~~~~~~~~I~~i~V~p~~qgkGiG~~Lm~~l~~~  150 (152)
                      +.+..++.....+.+    .|..|+++|++||+|+|+.|+..+.+.
T Consensus         8 ~~~~l~~~~~~~~~g----~i~~v~t~p~~RrrGlg~~lv~~l~~~   49 (86)
T PF08445_consen    8 ELVALVAWIIRSDDG----EIGGVYTLPEHRRRGLGSALVAALARE   49 (86)
T ss_dssp             CCEEEEEEEEESCTC----CEEEEEE-GGGTTSSHHHHHHHHHHHH
T ss_pred             CccceeeEeeeCCCc----EEEEEEECHHHcCCCHHHHHHHHHHHH
Confidence            344444444443444    388999999999999999999888654


No 67 
>KOG3234|consensus
Probab=97.55  E-value=0.00016  Score=53.25  Aligned_cols=51  Identities=12%  Similarity=0.040  Sum_probs=42.8

Q ss_pred             CCCCCeEEEEEEEeeccCCcceEEEeeeeeccccccCCHHHHHHHHHHhhc
Q psy838          101 GSSSTPVAFSHFRFDVDFGEPVLYCYELQLEKQVQRKGLGKFMMQAFYTYI  151 (152)
Q Consensus       101 ~~~~~ivGf~~~~~~~~~~~~~~~I~~i~V~p~~qgkGiG~~Lm~~l~~~a  151 (152)
                      +.++++.||+.-...-..+.+..++..|.|.|+||+.|+|+.||+.+++..
T Consensus        48 ~p~~~imgyimgk~Eg~~~~wh~HvTAltVap~~Rrl~la~~lm~~led~~   98 (173)
T KOG3234|consen   48 APTGEIMGYIMGKVEGKDTEWHGHVTALTVAPDYRRLGLAAKLMDTLEDVS   98 (173)
T ss_pred             CCCCceEEEEeeeccccCcceeeEEEEEEechhHHHHHHHHHHHHHHHHHH
Confidence            336899999988654445667788999999999999999999999999864


No 68 
>COG1444 Predicted P-loop ATPase fused to an acetyltransferase [General function prediction only]
Probab=97.40  E-value=0.001  Score=59.76  Aligned_cols=71  Identities=11%  Similarity=0.122  Sum_probs=49.7

Q ss_pred             ChhhhhhhhcCCCceEEEEEeCCCCCeEEEEEEEeeccC---------------------------------CcceEEEe
Q psy838           80 DPQGKHSEMFDDRACYLVAKNGSSSTPVAFSHFRFDVDF---------------------------------GEPVLYCY  126 (152)
Q Consensus        80 ~~~~~~~el~~~~~~~~v~~~~~~~~ivGf~~~~~~~~~---------------------------------~~~~~~I~  126 (152)
                      +++....-+-.|+++++++..+ ++.+|+-+.+...-..                                 .-..+-|.
T Consensus       457 sP~DL~~L~DaP~h~~~al~~~-~~~~va~~qva~EG~l~~~~i~~~~~g~r~~GnlIp~~l~~~~~~~~fa~l~G~RIv  535 (758)
T COG1444         457 SPNDLRRLLDAPHHHIFALRAP-EGKPVAVWQVAEEGGLSDELIDIWLGGRRPRGNLIPDLLAKHHRDPEFAKLVGWRIV  535 (758)
T ss_pred             CHHHHHHHhcCCCCeeEEEEcC-CCceEEEEEeeccCCCcHHHHHHHhcCCCCCCcccHHHHHHhhcchhhcccceeeEE
Confidence            5666555555677777777665 4577766655332110                                 01235699


Q ss_pred             eeeeccccccCCHHHHHHHHHHhhc
Q psy838          127 ELQLEKQVQRKGLGKFMMQAFYTYI  151 (152)
Q Consensus       127 ~i~V~p~~qgkGiG~~Lm~~l~~~a  151 (152)
                      +|+|+|++|++|||++|++.+.++|
T Consensus       536 RIAvhPe~q~~GiGsrlL~~l~~~a  560 (758)
T COG1444         536 RIAVHPELQRMGIGSRLLALLIEEA  560 (758)
T ss_pred             EEEeCHHHHhcCHHHHHHHHHHHHH
Confidence            9999999999999999999999887


No 69 
>PF13480 Acetyltransf_6:  Acetyltransferase (GNAT) domain
Probab=97.29  E-value=0.011  Score=41.00  Aligned_cols=114  Identities=12%  Similarity=0.151  Sum_probs=70.3

Q ss_pred             hhHhHhHHhhhhhcccccccCCCCcccccCCCCCHHHHHHHHHHHHHhHHHHHHhC-CCCCChhhhhhhhcC---C-Cce
Q psy838           20 ASLRKMVKKMEITQHAKYTCSFCGKEYKKSTDLGAETKKWVWELEEKNMKHSYEVC-DIGWDPQGKHSEMFD---D-RAC   94 (152)
Q Consensus        20 ~~~rk~~~~~~~~~~~~~~c~~c~~~~~~~~~l~~~d~~~i~~l~~~~m~~~y~~~-~~~w~~~~~~~el~~---~-~~~   94 (152)
                      ..+|+.+++.+..-.-.         +..  .-+++++++++++..+++... ... ..+++.+.+..-+..   + ...
T Consensus         5 ~~~rr~~r~~~~~g~~~---------~~~--~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~l~~~~~~~~~~~   72 (142)
T PF13480_consen    5 KKIRRAIRRAEKLGGVR---------FEV--ATDPADLEAFYELYRESWARR-HGGFAPPFSRDFFRDLLRSLAESGRLR   72 (142)
T ss_pred             HHHHHHHHHHHhcCCEE---------EEE--eCCHHHHHHHHHHHHHHHhhh-hCCCCCcchHHHHHHHHHhhccCCCEE
Confidence            45677777776542111         111  237888999999987765443 010 123444433322221   2 344


Q ss_pred             EEEEEeCCCCCeEEEEEEEeeccCCcceEEEeeeeeccccccCCHHHHHHHHHHhhc
Q psy838           95 YLVAKNGSSSTPVAFSHFRFDVDFGEPVLYCYELQLEKQVQRKGLGKFMMQAFYTYI  151 (152)
Q Consensus        95 ~~v~~~~~~~~ivGf~~~~~~~~~~~~~~~I~~i~V~p~~qgkGiG~~Lm~~l~~~a  151 (152)
                      ++++..+  |++||+...-..  .  ..++.+-...+|+++..++|..|+..+++.|
T Consensus        73 l~~~~~~--g~~va~~~~~~~--~--~~~~~~~~g~~~~~~~~~~~~~l~~~~i~~a  123 (142)
T PF13480_consen   73 LFVLYDG--GEPVAFALGFRH--G--GTLYYWYGGYDPEYRKYSPGRLLLWEAIRWA  123 (142)
T ss_pred             EEEEEEC--CEEEEEEEEEEE--C--CEEEEEEEEECHhhHhCCHHHHHHHHHHHHH
Confidence            4455554  999999865432  1  2566778889999999999999999998875


No 70 
>COG0454 WecD Histone acetyltransferase HPA2 and related acetyltransferases [Transcription / General function prediction only]
Probab=97.23  E-value=0.00019  Score=45.77  Aligned_cols=24  Identities=21%  Similarity=0.409  Sum_probs=22.6

Q ss_pred             eeeccccccCCHHHHHHHHHHhhc
Q psy838          128 LQLEKQVQRKGLGKFMMQAFYTYI  151 (152)
Q Consensus       128 i~V~p~~qgkGiG~~Lm~~l~~~a  151 (152)
                      ++|+|++||+|+|+.|++.+++.|
T Consensus        87 l~v~~~~rg~Gig~~Ll~~~~~~~  110 (156)
T COG0454          87 LYVLPEYRGKGIGSALLEAALEWA  110 (156)
T ss_pred             EEecchhhccchHHHHHHHHHHHH
Confidence            999999999999999999988876


No 71 
>PF05301 Mec-17:  Touch receptor neuron protein Mec-17;  InterPro: IPR007965 Mec-17 is the protein product of one of the 18 genes required for the development and function of the touch receptor neuron for gentle touch. Mec-17 is specifically required for maintaining the differentiation of the touch receptor []. This family is conserved to higher eukaryotes.; GO: 0019799 tubulin N-acetyltransferase activity
Probab=97.10  E-value=0.0026  Score=44.86  Aligned_cols=47  Identities=23%  Similarity=0.470  Sum_probs=35.0

Q ss_pred             CCeEEEEEEEe----ecc-CC-----cceEEEeeeeeccccccCCHHHHHHHHHHhh
Q psy838          104 STPVAFSHFRF----DVD-FG-----EPVLYCYELQLEKQVQRKGLGKFMMQAFYTY  150 (152)
Q Consensus       104 ~~ivGf~~~~~----~~~-~~-----~~~~~I~~i~V~p~~qgkGiG~~Lm~~l~~~  150 (152)
                      +.++||+.+-.    ..+ .+     +|..-|.++||+++.|++|+|+.|++.+...
T Consensus        18 g~viG~LKVG~K~Lfl~d~~g~~~e~~~~~cvLDFyVhes~QR~G~Gk~LF~~ML~~   74 (120)
T PF05301_consen   18 GAVIGFLKVGYKKLFLLDERGQHREIEPLLCVLDFYVHESRQRRGYGKRLFDHMLQE   74 (120)
T ss_pred             ceEEEEEEEeeeeEEEEcCCCCEEEecccceeeeEEEEeceeccCchHHHHHHHHHH
Confidence            67999997642    112 11     3443578999999999999999999988653


No 72 
>PF01233 NMT:  Myristoyl-CoA:protein N-myristoyltransferase, N-terminal domain;  InterPro: IPR022676 Myristoyl-CoA:protein N-myristoyltransferase (2.3.1.97 from EC) (Nmt) [] is the enzyme responsible for transferring a myristate group on the N-terminal glycine of a number of cellular eukaryotics and viral proteins. Nmt is a monomeric protein of about 50 to 60kDa whose sequence appears to be well conserved.  The N and C-terminal domains of NMT are structurally similar, each adopting an acyl-CoA N-acyltransferase-like fold. This entry represents the N-terminal region. ; GO: 0004379 glycylpeptide N-tetradecanoyltransferase activity; PDB: 2P6G_B 2P6F_F 2P6E_A 1IIC_A 1IID_A 2NMT_A 4A33_A 3H5Z_A 4A2Z_A 2WSA_A ....
Probab=96.89  E-value=0.014  Score=43.22  Aligned_cols=102  Identities=15%  Similarity=0.010  Sum_probs=68.0

Q ss_pred             ccCCCCCHHHHHHHHHHHHHhHHHHHHhC----CCCCChhhhhhhhcCCCc---eEEEEEeCCCCCeEEEEEEEee----
Q psy838           47 KKSTDLGAETKKWVWELEEKNMKHSYEVC----DIGWDPQGKHSEMFDDRA---CYLVAKNGSSSTPVAFSHFRFD----  115 (152)
Q Consensus        47 ~~~~~l~~~d~~~i~~l~~~~m~~~y~~~----~~~w~~~~~~~el~~~~~---~~~v~~~~~~~~ivGf~~~~~~----  115 (152)
                      .+...-++++++++++|-.+|   +.+..    .+.++.+.+.=.|..|+.   +.+-+....++++|||++..+.    
T Consensus        27 ~~~dl~d~~~l~ely~lL~~n---YVEDdd~~fRf~YS~efL~WaL~pPg~~~~whiGVR~~~~~kLvgfIsaip~~irv  103 (162)
T PF01233_consen   27 STLDLNDDEELKELYELLNEN---YVEDDDNMFRFDYSKEFLKWALKPPGWKKEWHIGVRVKSSKKLVGFISAIPATIRV  103 (162)
T ss_dssp             EE--TTSHHHHHHHHHHHHHH---SSBTTTSSEEE---HHHHHHHHTSTT--GGGEEEEEETTTTEEEEEEEEEEEEEEE
T ss_pred             EecCCCCHHHHHHHHHHHHhc---CccCCcceEEeeCCHHHHhheeeCcCCccceEEEEEECCCCEEEEEEccceEEEEE
Confidence            334555788899999987765   22221    246888888878887643   2334444336899999997653    


Q ss_pred             ccCCcceEEEeeeeeccccccCCHHHHHHHHHHhhc
Q psy838          116 VDFGEPVLYCYELQLEKQVQRKGLGKFMMQAFYTYI  151 (152)
Q Consensus       116 ~~~~~~~~~I~~i~V~p~~qgkGiG~~Lm~~l~~~a  151 (152)
                      .+......+|.-|.|++..|.++++-.|+.++-..+
T Consensus       104 ~~~~~~~~eINFLCVhKklRskrlAPvLIkEItRRv  139 (162)
T PF01233_consen  104 RDKVIKMVEINFLCVHKKLRSKRLAPVLIKEITRRV  139 (162)
T ss_dssp             TTEEEEEEEEEEEEE-GGGTTSSHHHHHHHHHHHHH
T ss_pred             eeeEeeeeeEEEEeecHhHhhcCCcHHHHHHHHHHh
Confidence            223345678999999999999999999999887643


No 73 
>PF01853 MOZ_SAS:  MOZ/SAS family;  InterPro: IPR002717 Moz is a monocytic leukemia Zn_finger protein and the SAS protein from Saccharomyces cerevisiae (Baker's yeast) is involved in silencing the Hmr locus. These proteins were reported to be homologous to acetyltransferases [] but this similarity is not supported by standard sequence analysis.; GO: 0016747 transferase activity, transferring acyl groups other than amino-acyl groups, 0006355 regulation of transcription, DNA-dependent, 0005634 nucleus; PDB: 3TO6_A 1MJA_A 1MJ9_A 3TO7_A 3TO9_A 1MJB_A 1FY7_A 2OZU_A 2RC4_A 2OU2_A ....
Probab=96.82  E-value=0.0034  Score=47.67  Aligned_cols=57  Identities=16%  Similarity=0.137  Sum_probs=37.1

Q ss_pred             CCceEEEEEeCCCC-CeEEEEEEEeeccCCcceEEEeeeeeccccccCCHHHHHHHHHHhh
Q psy838           91 DRACYLVAKNGSSS-TPVAFSHFRFDVDFGEPVLYCYELQLEKQVQRKGLGKFMMQAFYTY  150 (152)
Q Consensus        91 ~~~~~~v~~~~~~~-~ivGf~~~~~~~~~~~~~~~I~~i~V~p~~qgkGiG~~Lm~~l~~~  150 (152)
                      +...|++...+++| .+|||.+=.-...+   ...|..|.|.|.||++|+|+.||+.--.+
T Consensus        51 ~F~FYVl~e~d~~g~h~vGyFSKEk~s~~---~~NLsCIl~lP~yQrkGyG~~LI~fSY~L  108 (188)
T PF01853_consen   51 PFLFYVLTEKDDDGFHIVGYFSKEKESWD---NNNLSCILTLPPYQRKGYGRFLIDFSYEL  108 (188)
T ss_dssp             TEEEEEEEEEETTEEEEEEEEEEESS-TT----EEESEEEE-GGGTTSSHHHHHHHHHHHH
T ss_pred             ceEEEEEEEecCccceeEEEEEEEecccC---CeeEeehhhcchhhhcchhhhhhhhHHHH
Confidence            34455555544222 58899875432222   34588899999999999999999875444


No 74 
>COG3981 Predicted acetyltransferase [General function prediction only]
Probab=96.74  E-value=0.0038  Score=46.64  Aligned_cols=54  Identities=19%  Similarity=0.273  Sum_probs=39.5

Q ss_pred             CceEEEEEeCCCCCeEEEEEEEeeccC------CcceEEEeeeeeccccccCCHHHHHHHHHHhhcC
Q psy838           92 RACYLVAKNGSSSTPVAFSHFRFDVDF------GEPVLYCYELQLEKQVQRKGLGKFMMQAFYTYID  152 (152)
Q Consensus        92 ~~~~~v~~~~~~~~ivGf~~~~~~~~~------~~~~~~I~~i~V~p~~qgkGiG~~Lm~~l~~~a~  152 (152)
                      .+.|+.+. + ++++||++.++....+      |+.     .=-|+|+.||+|+|++++....+.|+
T Consensus        68 ~~~y~~v~-~-d~~ivG~i~lRh~Ln~~ll~~gGHI-----GY~VrPseR~KGYA~emLkl~L~~ar  127 (174)
T COG3981          68 ASTYWAVD-E-DGQIVGFINLRHQLNDFLLEEGGHI-----GYSVRPSERRKGYAKEMLKLALEKAR  127 (174)
T ss_pred             ceeEEEEe-c-CCcEEEEEEeeeecchHHHhcCCcc-----cceeChhhhccCHHHHHHHHHHHHHH
Confidence            34454444 4 5999999999976542      221     13499999999999999999887763


No 75 
>PF13880 Acetyltransf_13:  ESCO1/2 acetyl-transferase
Probab=96.51  E-value=0.0021  Score=41.23  Aligned_cols=27  Identities=15%  Similarity=0.156  Sum_probs=23.9

Q ss_pred             EEEeeeeeccccccCCHHHHHHHHHHh
Q psy838          123 LYCYELQLEKQVQRKGLGKFMMQAFYT  149 (152)
Q Consensus       123 ~~I~~i~V~p~~qgkGiG~~Lm~~l~~  149 (152)
                      +=|.+|.|+|.+|++||++.||+.+-.
T Consensus         6 ~GI~RIWV~~~~RR~GIAt~Lld~ar~   32 (70)
T PF13880_consen    6 CGISRIWVSPSHRRKGIATRLLDAARE   32 (70)
T ss_pred             EEeEEEEeChhhhhhhHHHHHHHHHHH
Confidence            448899999999999999999998754


No 76 
>PLN03238 probable histone acetyltransferase MYST; Provisional
Probab=96.42  E-value=0.0099  Score=47.86  Aligned_cols=55  Identities=20%  Similarity=0.187  Sum_probs=37.4

Q ss_pred             ceEEEEEeCC-CCCeEEEEEEEeeccCCcceEEEeeeeeccccccCCHHHHHHHHHHhh
Q psy838           93 ACYLVAKNGS-SSTPVAFSHFRFDVDFGEPVLYCYELQLEKQVQRKGLGKFMMQAFYTY  150 (152)
Q Consensus        93 ~~~~v~~~~~-~~~ivGf~~~~~~~~~~~~~~~I~~i~V~p~~qgkGiG~~Lm~~l~~~  150 (152)
                      ..|++...++ +..+|||.+=.....++   ..|-.|-+.|.||++|+|+.||+.--.+
T Consensus       128 lFYVl~e~d~~g~h~vGYFSKEK~s~~~---nNLaCIltLPpyQrkGyG~~LI~fSYeL  183 (290)
T PLN03238        128 LFYVMTEVDDHGSHIVGYFSKEKVSAED---YNLACILTLPPYQRKGYGKFLISFAYEL  183 (290)
T ss_pred             EEEEEEEecCCCcEEEEEeceeccccCC---CcEEEEEecChhhhccHhHhHHHHHhHH
Confidence            3455544432 34699997754332222   3478899999999999999999876544


No 77 
>KOG3138|consensus
Probab=96.40  E-value=0.0035  Score=47.61  Aligned_cols=95  Identities=15%  Similarity=0.143  Sum_probs=59.6

Q ss_pred             ccCCCCCHHHHHHHHHHHHHhHHHHHHhCCCCCChhhhhhhhcCCCceEEEEEeCCCCCeEEEEEEEeecc-CC------
Q psy838           47 KKSTDLGAETKKWVWELEEKNMKHSYEVCDIGWDPQGKHSEMFDDRACYLVAKNGSSSTPVAFSHFRFDVD-FG------  119 (152)
Q Consensus        47 ~~~~~l~~~d~~~i~~l~~~~m~~~y~~~~~~w~~~~~~~el~~~~~~~~v~~~~~~~~ivGf~~~~~~~~-~~------  119 (152)
                      .....+++.++...-.|.+......|       ... +..+..+.....-++..+  +..+|-++...... .+      
T Consensus        17 ~~l~~it~~nl~~~~~l~~~~fP~~y-------~~k-fy~~~~~~~~~~~~A~~~--~~~v~a~~~k~~~~~~~~~r~~~   86 (187)
T KOG3138|consen   17 IELRLITPNNLKQLKQLNEDIFPISY-------VDK-FYPDVLSNGDLTQLAYYN--EIAVGAVACKLIKFVQNAKRLFG   86 (187)
T ss_pred             eeeccCCcchHHHHHHHhccccCcch-------HHH-HHHHHHhcCCHHHhhhhc--cccccceeeeehhhhhhhhhhhc
Confidence            44567788888877777665433333       222 334443332222233333  56666666654321 11      


Q ss_pred             cceEEEeeeeeccccccCCHHHHHHHHHHhhc
Q psy838          120 EPVLYCYELQLEKQVQRKGLGKFMMQAFYTYI  151 (152)
Q Consensus       120 ~~~~~I~~i~V~p~~qgkGiG~~Lm~~l~~~a  151 (152)
                      ..++||.-|-|.|.||.+|||+.||+.+.+.+
T Consensus        87 ~~~~yi~~Lgvl~~yR~~gIGs~Ll~~~~~~~  118 (187)
T KOG3138|consen   87 NRVIYILSLGVLPRYRNKGIGSKLLEFVKKYC  118 (187)
T ss_pred             cceeEEEeecccHHHHhcchHHHHHHHHHHHH
Confidence            11588999999999999999999999998875


No 78 
>PLN03239 histone acetyltransferase; Provisional
Probab=96.05  E-value=0.015  Score=48.05  Aligned_cols=55  Identities=22%  Similarity=0.297  Sum_probs=36.5

Q ss_pred             ceEEEEEeCC-CCCeEEEEEEEeeccCCcceEEEeeeeeccccccCCHHHHHHHHHHhh
Q psy838           93 ACYLVAKNGS-SSTPVAFSHFRFDVDFGEPVLYCYELQLEKQVQRKGLGKFMMQAFYTY  150 (152)
Q Consensus        93 ~~~~v~~~~~-~~~ivGf~~~~~~~~~~~~~~~I~~i~V~p~~qgkGiG~~Lm~~l~~~  150 (152)
                      ..|++...++ +-.+|||.+=.....++   ..|-.|-+.|.||++|+|+.||+.--.+
T Consensus       186 lFYVl~e~D~~g~h~vGYFSKEK~s~~~---~NLaCIltLPpyQrkGyG~lLI~fSYeL  241 (351)
T PLN03239        186 LFYVLCEVDERGFHPVGYYSKEKYSDVG---YNLACILTFPAHQRKGYGRFLIAFSYEL  241 (351)
T ss_pred             EEEEEEEecCCceEEEEEeeecccCCCC---CceEEEEecChhhhcchhhhhHhhhhHh
Confidence            3444444332 23689997743322222   3488899999999999999999875444


No 79 
>COG1670 RimL Acetyltransferases, including N-acetylases of ribosomal proteins [Translation, ribosomal structure and biogenesis]
Probab=95.99  E-value=0.047  Score=39.28  Aligned_cols=47  Identities=11%  Similarity=0.026  Sum_probs=34.3

Q ss_pred             CCeEEEEEEEeecc-CCcceEEEeeeeeccccccCCHHHHHHHHHHhhc
Q psy838          104 STPVAFSHFRFDVD-FGEPVLYCYELQLEKQVQRKGLGKFMMQAFYTYI  151 (152)
Q Consensus       104 ~~ivGf~~~~~~~~-~~~~~~~I~~i~V~p~~qgkGiG~~Lm~~l~~~a  151 (152)
                      +++||.+.+....+ ...... .....+.|+++|+|+|+..+..+.++|
T Consensus        77 ~~~iG~~~~~~~~~~~~~~~~-~ig~~l~~~~~g~G~~tea~~~~l~~~  124 (187)
T COG1670          77 GELIGVIGLSDIDRAANGDLA-EIGYWLDPEYWGKGYATEALRALLDYA  124 (187)
T ss_pred             CeEEEEEEEEEeccccccceE-EEEEEEChHHhcCchHHHHHHHHHHHh
Confidence            58999999986542 111112 334557999999999999999998875


No 80 
>COG3393 Predicted acetyltransferase [General function prediction only]
Probab=95.97  E-value=0.015  Score=46.25  Aligned_cols=52  Identities=10%  Similarity=0.030  Sum_probs=37.1

Q ss_pred             CceEEEEEeCCCCCeEEEEEEEeeccCCcceEEEeeeeeccccccCCHHHHHHHHHH
Q psy838           92 RACYLVAKNGSSSTPVAFSHFRFDVDFGEPVLYCYELQLEKQVQRKGLGKFMMQAFY  148 (152)
Q Consensus        92 ~~~~~v~~~~~~~~ivGf~~~~~~~~~~~~~~~I~~i~V~p~~qgkGiG~~Lm~~l~  148 (152)
                      .++.++..++  |++|.-+.-.-..+   ....|..++++|+|||+|+|..|+..|-
T Consensus       176 ~~~~~f~~~d--~~iVa~A~t~a~~~---~~~~I~gV~T~peyR~kGyAt~lva~L~  227 (268)
T COG3393         176 RSRTYFLEGD--GKIVAKAETAAENP---AYAQINGVYTHPEYRGKGYATALVATLA  227 (268)
T ss_pred             ceeEEEEccC--CcEEEeeeccccCC---cceEEEEEEcCHHHccccHHHHHHHHHH
Confidence            3444444443  69988877543322   2345899999999999999999998874


No 81 
>PF12746 GNAT_acetyltran:  GNAT acetyltransferase; PDB: 3G3S_B.
Probab=95.90  E-value=0.026  Score=45.16  Aligned_cols=53  Identities=15%  Similarity=0.160  Sum_probs=34.8

Q ss_pred             CceEEEEEeCCCCCeEEEEEEEeeccCCcceEEEeeeeeccccccCCHHHHHHHHHHhhc
Q psy838           92 RACYLVAKNGSSSTPVAFSHFRFDVDFGEPVLYCYELQLEKQVQRKGLGKFMMQAFYTYI  151 (152)
Q Consensus        92 ~~~~~v~~~~~~~~ivGf~~~~~~~~~~~~~~~I~~i~V~p~~qgkGiG~~Lm~~l~~~a  151 (152)
                      +--|.++ .+  +++|.-+.-.+.++.+   +.| +|..+|+|||+|+|+.+-..++..|
T Consensus       165 G~Gf~i~-~~--~~iVs~~~s~~~~~~~---~EI-~I~T~~~yR~kGLA~~~aa~~I~~C  217 (265)
T PF12746_consen  165 GFGFCIL-HD--GEIVSGCSSYFVYENG---IEI-DIETHPEYRGKGLATAVAAAFILEC  217 (265)
T ss_dssp             --EEEEE-ET--TEEEEEEEEEEEETTE---EEE-EEEE-CCCTTSSHHHHHHHHHHHHH
T ss_pred             CcEEEEE-EC--CEEEEEEEEEEEECCE---EEE-EEEECHHhhcCCHHHHHHHHHHHHH
Confidence            3344444 44  7877665544544433   223 5889999999999999999998765


No 82 
>PF06852 DUF1248:  Protein of unknown function (DUF1248);  InterPro: IPR009658 This entry represents a conserved region within a number of proteins of unknown function that seem to be specific to Caenorhabditis elegans. Note that some proteins in the entry contain more than one copy of this region.
Probab=95.86  E-value=0.085  Score=39.93  Aligned_cols=63  Identities=22%  Similarity=0.288  Sum_probs=39.4

Q ss_pred             hhhhcCC-CceEEEEEeCCCCCeEEEEEEE-eec---cCCcceEEEeeeeeccccccCCHHHHHHHHHH
Q psy838           85 HSEMFDD-RACYLVAKNGSSSTPVAFSHFR-FDV---DFGEPVLYCYELQLEKQVQRKGLGKFMMQAFY  148 (152)
Q Consensus        85 ~~el~~~-~~~~~v~~~~~~~~ivGf~~~~-~~~---~~~~~~~~I~~i~V~p~~qgkGiG~~Lm~~l~  148 (152)
                      ..+-+.. -..++++..+ .+++|+-+++- +..   ....|..++--.+++|+|||+|+++.+-+.+.
T Consensus        37 wk~sf~~~Y~l~~~~~Kg-T~~via~~~~~~~~~l~~~~d~pl~~~G~~w~~p~yRg~~~~kl~~~~~~  104 (181)
T PF06852_consen   37 WKESFDDDYWLVLTCLKG-TDRVIATVHLIRFDPLNPSPDKPLQFIGFFWIDPEYRGKGIMKLQDDICM  104 (181)
T ss_pred             HHHhhccCeEEEEEEEcC-CCcEEEEEEEEEeccCCCCCCCCeEEEeeeeeCCcccCcchHHHHHHHHH
Confidence            3333444 3344445544 56777776653 221   12367888999999999999999974444433


No 83 
>PLN00104 MYST -like histone acetyltransferase; Provisional
Probab=95.83  E-value=0.013  Score=49.96  Aligned_cols=54  Identities=19%  Similarity=0.215  Sum_probs=36.7

Q ss_pred             ceEEEEEeCC-CCCeEEEEEEEeeccCCcceEEEeeeeeccccccCCHHHHHHHHHHh
Q psy838           93 ACYLVAKNGS-SSTPVAFSHFRFDVDFGEPVLYCYELQLEKQVQRKGLGKFMMQAFYT  149 (152)
Q Consensus        93 ~~~~v~~~~~-~~~ivGf~~~~~~~~~~~~~~~I~~i~V~p~~qgkGiG~~Lm~~l~~  149 (152)
                      ..|++...++ +-.+|||.+=.....++   ..|-.|-+.|.||++|+|+.||+.--.
T Consensus       279 lFYvl~e~d~~g~h~vGyFSKEk~s~~~---~NLaCIltlP~yQrkGyG~~LI~~SYe  333 (450)
T PLN00104        279 LFYVLCECDDRGCHMVGYFSKEKHSEED---YNLACILTLPPYQRKGYGKFLIAFSYE  333 (450)
T ss_pred             EEEEEEEecCCCcEEEEEecccccCcCC---CceEEEEecchhhhcchhheehhheeh
Confidence            4455544332 34799997744332222   348899999999999999999986443


No 84 
>PTZ00064 histone acetyltransferase; Provisional
Probab=95.81  E-value=0.017  Score=49.69  Aligned_cols=56  Identities=18%  Similarity=0.129  Sum_probs=37.6

Q ss_pred             CceEEEEEeCC-CCCeEEEEEEEeeccCCcceEEEeeeeeccccccCCHHHHHHHHHHhh
Q psy838           92 RACYLVAKNGS-SSTPVAFSHFRFDVDFGEPVLYCYELQLEKQVQRKGLGKFMMQAFYTY  150 (152)
Q Consensus        92 ~~~~~v~~~~~-~~~ivGf~~~~~~~~~~~~~~~I~~i~V~p~~qgkGiG~~Lm~~l~~~  150 (152)
                      ...|++...++ +-.+|||.+=.....+   ...|-.|-+.|.||++|+|+.||+.--.+
T Consensus       356 FlFYVLtE~D~~G~HiVGYFSKEK~S~~---~nNLACILtLPpyQRKGYGklLIdfSYeL  412 (552)
T PTZ00064        356 FLFYIVTEVDEEGCHIVGYFSKEKVSLL---HYNLACILTLPCYQRKGYGKLLVDLSYKL  412 (552)
T ss_pred             eEEEEEEEecCCCcEEEEEecccccCcc---cCceEEEEecchhhhcchhhhhhhhhhhh
Confidence            34455544432 3479999774332222   23488999999999999999999875444


No 85 
>KOG4601|consensus
Probab=95.63  E-value=0.091  Score=41.18  Aligned_cols=45  Identities=18%  Similarity=0.191  Sum_probs=33.7

Q ss_pred             CCeEEEEEEEe----ecc------CCcceEEEeeeeeccccccCCHHHHHHHHHHh
Q psy838          104 STPVAFSHFRF----DVD------FGEPVLYCYELQLEKQVQRKGLGKFMMQAFYT  149 (152)
Q Consensus       104 ~~ivGf~~~~~----~~~------~~~~~~~I~~i~V~p~~qgkGiG~~Lm~~l~~  149 (152)
                      +.+.||+.+-+    .+|      +++ .+-|.++||+++.|++|.|+.|++.+.+
T Consensus        81 s~l~GllKVG~KkLfl~D~~~~~ye~e-~lcILDFyVheS~QR~G~G~~lfdyMl~  135 (264)
T KOG4601|consen   81 SILKGLLKVGYKKLFLTDNEQNQYEEE-ALCILDFYVHESEQRSGNGFKLFDYMLK  135 (264)
T ss_pred             hheeeeehccceeEEEeccHhhhhccC-CceEEEEEeehhhhhcCchHHHHHHHHH
Confidence            46888886532    223      233 3448999999999999999999998765


No 86 
>TIGR03694 exosort_acyl putative PEP-CTERM/exosortase system-associated acyltransferase. Members of this protein family are restricted to bacterial species with the PEP-CTERM/exosortase system predicted to act in exopolysaccharide-associated protein targeting. PSI-BLAST and CDD reveal relationships to the acyltransferase family that includes N-acyl-L-homoserine lactone synthetase. Several members of this family may be found in a single genome. These proteins likely contribute to chemical modifications in exopolysaccharide and biofilm structural material production.
Probab=95.29  E-value=0.34  Score=38.09  Aligned_cols=83  Identities=14%  Similarity=0.255  Sum_probs=52.4

Q ss_pred             CCCCHHHHHHHHHHHHHhHHHHHHhCCCCCChh-----hhhhhhcCCCceEEEEEeCCCCCeEEEEEEEeec--------
Q psy838           50 TDLGAETKKWVWELEEKNMKHSYEVCDIGWDPQ-----GKHSEMFDDRACYLVAKNGSSSTPVAFSHFRFDV--------  116 (152)
Q Consensus        50 ~~l~~~d~~~i~~l~~~~m~~~y~~~~~~w~~~-----~~~~el~~~~~~~~v~~~~~~~~ivGf~~~~~~~--------  116 (152)
                      ...++++++.++.|-.+    .|-. ..+|...     ....+-.+..+.++++...++|++||.+.+.+..        
T Consensus        12 ~a~~~~~~~~~~~lR~~----VFv~-e~gw~~~~~~~~~~E~D~~D~~~~h~l~~~~~~g~vvG~~RLl~t~~~~p~~~~   86 (241)
T TIGR03694        12 PAVTPELLEEAFRLRYQ----VYCE-ELGFEPPSDYPDGLETDEYDAHSVHSLLRHRRTGTFVGCVRLVLPNSSDPDQPF   86 (241)
T ss_pred             EcCCHHHHHHHHHHHHH----HHHH-hcCCCCCCCCCCCCcCCCCCCCCcEEEEEECCCCCEEEEEEEeccccccccccc
Confidence            34566778999999876    4433 4567632     2222333444555555543258999999987531        


Q ss_pred             c-------------------CCcceEEEeeeeeccccccC
Q psy838          117 D-------------------FGEPVLYCYELQLEKQVQRK  137 (152)
Q Consensus       117 ~-------------------~~~~~~~I~~i~V~p~~qgk  137 (152)
                      .                   ....++++.++.|+|++|+.
T Consensus        87 p~e~~~~~~~~~~~~~~~~~~~~~i~E~SRf~V~~~~r~r  126 (241)
T TIGR03694        87 PFEKHCSHSLDGLFLDPRRLPRSRIAEVSRLAVSKDFRRR  126 (241)
T ss_pred             cHHHHhccccchhhcCccccCCCceEEeehheECHhHhCC
Confidence            0                   01347889999999999974


No 87 
>PRK13834 putative autoinducer synthesis protein; Provisional
Probab=95.20  E-value=0.42  Score=36.75  Aligned_cols=96  Identities=13%  Similarity=0.194  Sum_probs=61.6

Q ss_pred             CCCCHHHHHHHHHHHHHhHHHHHHhCCCCCChhhh---h-hhhcCCCceEEEEEeCCCCCeEEEEEEEeecc--------
Q psy838           50 TDLGAETKKWVWELEEKNMKHSYEVCDIGWDPQGK---H-SEMFDDRACYLVAKNGSSSTPVAFSHFRFDVD--------  117 (152)
Q Consensus        50 ~~l~~~d~~~i~~l~~~~m~~~y~~~~~~w~~~~~---~-~el~~~~~~~~v~~~~~~~~ivGf~~~~~~~~--------  117 (152)
                      ....+++++.++.|-.+    .|.. ..+|+....   . ++.-++..+|+++..+ +|+++|.+.+.+...        
T Consensus        11 ~~~~~~~l~~~~rLR~~----VF~~-elgW~~~~~~g~E~D~yD~~~~~yll~~~~-~g~vvG~~RLlptt~p~ml~~~f   84 (207)
T PRK13834         11 YEREASLLKQMHRLRAR----VFGG-RLGWDVSITDGEERDQFDDLKPTYILAISD-SGRVAGCARLLPAIGPTMLAQVF   84 (207)
T ss_pred             hhcCHHHHHHHHHHHHH----Hhcc-ccCCCCCCCCCcCccCCCCCCCEEEEEEeC-CCeEEEEEecccCCCcchhhhhc
Confidence            45667889999999866    4533 467865432   1 1122345677777654 589999998754311        


Q ss_pred             ----------CCcceEEEeeeeeccccccC---C----HHHHHHHHHHhhc
Q psy838          118 ----------FGEPVLYCYELQLEKQVQRK---G----LGKFMMQAFYTYI  151 (152)
Q Consensus       118 ----------~~~~~~~I~~i~V~p~~qgk---G----iG~~Lm~~l~~~a  151 (152)
                                ....++++.++.|+|+++..   +    +...|+..+.++|
T Consensus        85 p~l~~~~~~~~~~~v~E~SRf~V~~~~~~~~~~~~~~~~~~~L~~~~~~~a  135 (207)
T PRK13834         85 PQLLPAGRLNAHPAMIESSRFCVDTALAEGRGGGQLHEATLTMFAGIIEWS  135 (207)
T ss_pred             HHhcCCCCCCCCCCEEEEeeeEEcccccccccccccCHHHHHHHHHHHHHH
Confidence                      12457889999999986322   2    5566777776654


No 88 
>COG5628 Predicted acetyltransferase [General function prediction only]
Probab=95.04  E-value=0.076  Score=37.70  Aligned_cols=55  Identities=18%  Similarity=0.219  Sum_probs=39.1

Q ss_pred             ceEEEEEeCCCCCeEEEEEEEee-ccCCcceEEEeeeeeccccccCCHHHHHHHHHHhh
Q psy838           93 ACYLVAKNGSSSTPVAFSHFRFD-VDFGEPVLYCYELQLEKQVQRKGLGKFMMQAFYTY  150 (152)
Q Consensus        93 ~~~~v~~~~~~~~ivGf~~~~~~-~~~~~~~~~I~~i~V~p~~qgkGiG~~Lm~~l~~~  150 (152)
                      +.|++.. +  |.+|||+.+--. .........+.++.|...|||+|+||+...++-..
T Consensus        38 ~~~~~~~-~--~~~igf~l~L~~~~~~~~iD~~~~efFIi~k~~~~GvGR~aaK~If~~   93 (143)
T COG5628          38 EAWLFRI-G--GLPVGFALVLDLAHSPTPIDRAVAEFFIVRKHRRRGVGRAAAKAIFGS   93 (143)
T ss_pred             ceeEEEE-C--CceeeeeeeecccCCCCcccccchheEeeehhhccchhHHHHHHHHHH
Confidence            4455544 3  899999976422 12223345688999999999999999999887543


No 89 
>PF00765 Autoind_synth:  Autoinducer synthetase;  InterPro: IPR001690 Bacterial species have many methods of controlling gene expression and cell growth. Regulation of gene expression in response to changes in cell density is termed quorum sensing [, ]. Quorum-sensing bacteria produce, release and respond to hormone-like molecules (autoinducers) that accumulate in the external environment as the cell population grows. Once a threshold of these molecules is reached, a signal transduction cascade is triggered that ultimately leads to behavioural changes in the bacterium []. Autoinducers are thus clearly important mediators of molecular communication. Conjugal transfer of Agrobacterium octopine-type Ti plasmids is activated by octopine, a metabolite released from plant tumours []. Octopine causes conjugal donors to secrete a pheromone, Agrobacterium autoinducer (AAI), and exogenous AAI further stimulates conjugation. The putative AAI synthase and an AAI-responsive transcriptional regulator have been found to be encoded by the Ti plasmid traI and traR genes, respectively. TraR and TraI are similar to the LuxR and LuxI regulatory proteins of Vibrio fischeri, and AAI is similar in structure to the diffusable V. fischeri autoinducer, the inducing ligand of LuxR. TraR activates target genes in the presence of AAI and also activates traR and traI themselves, creating two positive-feedback loops. TraR-AAI-mediated activation in wild-type Agrobacterium strains is enhanced by culturing on solid media, suggesting a possible role in cell density sensing []. Production of light by the marine bacterium V. fischeri and by recombinant hosts containing cloned lux genes is controlled by the density of the culture []. Density-dependent regulation of lux gene expression has been shown to require a locus consisting of the luxR and luxI genes. In these and other Gram-negative bacteria, N-(3-oxohexanoyl)-L-homoserine lactone (OHHL) acts as the autoinducer by binding to transcriptional regulatory proteins and activating them []. OHHL and related molecules, such as N-butanoyl- (BHL), N-hexanoyl- (HHL) and N-oxododecanoyl- (PAI) homoserine lactones, are produced by a family of proteins that share a high level of sequence similarity. Proteins which currently members of this family include:  luxI from V. fischeri. ahyI and asaI from Aeromonas species, which synthesize BHL and whose targets are ahyR and asaR respectively. carI from Erwinia carotovora. The target of OHHL is carR which activates genes involved in the biosynthesis of carbapenem antibiotics. eagI from Enterobacter agglomerans. The target of OHHL is not yet known. esaI from Erwinia stewartii.  expI from Erwinia carotovora.  lasI from Pseudomonas aeruginosa, which synthesizes PAI and whose target is lasR which activates the transcription of the elastase gene. rhlI (or vsmI) from P. aeruginosa, which synthesizes BHL and HHL and whose target is rhlR. swrI from Serratia liquefaciens, which synthesizes BHL. yenI from Yersinia enterocolitica.  ; GO: 0007165 signal transduction; PDB: 3P2H_A 3P2F_A 1KZF_A 1K4J_A 1RO5_A.
Probab=94.79  E-value=0.64  Score=35.07  Aligned_cols=93  Identities=16%  Similarity=0.236  Sum_probs=63.9

Q ss_pred             CCCHHHHHHHHHHHHHhHHHHHHhCCCCCChhhhhhhh-----cCCCceEEEEEeCCCCCeEEEEEEEeecc--------
Q psy838           51 DLGAETKKWVWELEEKNMKHSYEVCDIGWDPQGKHSEM-----FDDRACYLVAKNGSSSTPVAFSHFRFDVD--------  117 (152)
Q Consensus        51 ~l~~~d~~~i~~l~~~~m~~~y~~~~~~w~~~~~~~el-----~~~~~~~~v~~~~~~~~ivGf~~~~~~~~--------  117 (152)
                      .++.++++.++.+-.+    .|.. ..+|+.... ..+     -+++..|+++..+  |+++|.+.+.++..        
T Consensus         4 ~~~~~~l~~~~rlR~~----vFv~-rlgW~v~~~-dg~E~DqyD~~~~~ylv~~~~--g~v~g~~RLlptt~p~ML~~~F   75 (182)
T PF00765_consen    4 ELSRRLLEEMFRLRHR----VFVD-RLGWDVPCE-DGMEIDQYDDPDAVYLVALDD--GRVVGCARLLPTTGPYMLSDVF   75 (182)
T ss_dssp             HHHHHHHHHHHHHHHH----HHTT-CSCCCHHCC-TSEE--TTGCTT-EEEEEEET--TEEEEEEEEEETTS--HHHHCT
T ss_pred             ccCHHHHHHHHHHHHH----HHHH-hhCCCCcCC-CCcEeeecCCCCCeEEEEEEC--CEEEEEeeeccCCCcchhhhHH
Confidence            3456788899988765    5654 578987543 222     2356778888876  99999999986432        


Q ss_pred             ----------CCcceEEEeeeeecccccc------CCHHHHHHHHHHhhc
Q psy838          118 ----------FGEPVLYCYELQLEKQVQR------KGLGKFMMQAFYTYI  151 (152)
Q Consensus       118 ----------~~~~~~~I~~i~V~p~~qg------kGiG~~Lm~~l~~~a  151 (152)
                                .+..++.+.++.|+|+..+      .-+...|+..+.++|
T Consensus        76 ~~ll~~~~~p~~~~vwE~SRf~v~~~~~~~~~~~~~~~~~~L~~~~~e~a  125 (182)
T PF00765_consen   76 PHLLPDGPAPRSPDVWELSRFCVDPDRRRSRAGSRSPVTMELLLGMVEFA  125 (182)
T ss_dssp             GGGHTTS---SSTTEEEEEEEEE-HCCCHHCHSCC-THHHHHHHHHHHHH
T ss_pred             HHHhCCCCCCCCCcceeeeEEEEcccccccccccccHHHHHHHHHHHHHH
Confidence                      1246799999999998532      246778888887765


No 90 
>KOG2747|consensus
Probab=94.76  E-value=0.037  Score=46.43  Aligned_cols=54  Identities=19%  Similarity=0.178  Sum_probs=33.8

Q ss_pred             CCceEEEEEeCCCCCeEEEEEEEeeccCCcceEEEeeeeeccccccCCHHHHHHHHHH
Q psy838           91 DRACYLVAKNGSSSTPVAFSHFRFDVDFGEPVLYCYELQLEKQVQRKGLGKFMMQAFY  148 (152)
Q Consensus        91 ~~~~~~v~~~~~~~~ivGf~~~~~~~~~~~~~~~I~~i~V~p~~qgkGiG~~Lm~~l~  148 (152)
                      +...|++...++.| .|||.+=.-.   .....+|..|-|.|-||++|.|+.||+.--
T Consensus       233 pFlFYVlte~d~~G-~VGYFSKEK~---s~~~yNlaCILtLPpyQRkGYGklLIdFSY  286 (396)
T KOG2747|consen  233 PFLFYVLTECDSYG-CVGYFSKEKE---SSENYNLACILTLPPYQRKGYGKLLIDFSY  286 (396)
T ss_pred             ceEEEEEEecCCcc-eeeeeccccc---cccccceeeeeecChhhhcccchhhhhhhh
Confidence            33445554444233 4666542211   112345888999999999999999998643


No 91 
>cd04264 DUF619-NAGS DUF619 domain of various N-acetylglutamate Synthases of the fungal arginine-biosynthetic pathway and urea cycle found in humans and fish. DUF619-NAGS: This family includes the DUF619 domain of various N-acetylglutamate synthases (NAGS) of the urea cycle found in humans and fish, the DUF619 domain of the NAGS of the fungal arginine-biosynthetic pathway (FABP), as well as the DUF619 domain present in C-terminal of a NAG kinase-like domain in a limited number of predicted NAGSs found in bacteria and Dictyostelium. Ureogenic NAGS is a mitochondrial enzyme catalyzing the formation of NAG from acetylcoenzyme A and L-glutamate. NAGS is an essential allosteric activator of carbamylphosphate synthase I, the first and rate limiting enzyme of the urea cycle. Domain architecture of ureogenic and fungal NAGS consists of an N-terminal NAG kinase-like domain and a C-terminal DUF619 domain. The DUF619 domain function has yet to be characterized.
Probab=94.39  E-value=0.21  Score=34.08  Aligned_cols=44  Identities=11%  Similarity=0.126  Sum_probs=32.7

Q ss_pred             CeEEEEEEEeeccCCcceEEEeeeeeccccccCCHHHHHHHHHHh
Q psy838          105 TPVAFSHFRFDVDFGEPVLYCYELQLEKQVQRKGLGKFMMQAFYT  149 (152)
Q Consensus       105 ~ivGf~~~~~~~~~~~~~~~I~~i~V~p~~qgkGiG~~Lm~~l~~  149 (152)
                      ...|.+.+.... .+..+.||-.++|.|+.||+|+|..|++.+.+
T Consensus        18 ~y~~~aIvt~~~-~~~~~~yLdKfaV~~~~~g~gvad~vf~~i~~   61 (99)
T cd04264          18 GYNAAAIVTYEG-VNNGVPYLDKFAVSSSAQGEGTSDALWRRLRR   61 (99)
T ss_pred             CceEEEEEeccC-CCCCceEEEEEEEchhhhhcChHHHHHHHHHh
Confidence            344554443321 12457899999999999999999999999875


No 92 
>COG5630 ARG2 Acetylglutamate synthase [Amino acid transport and metabolism]
Probab=94.38  E-value=0.33  Score=40.79  Aligned_cols=99  Identities=13%  Similarity=0.114  Sum_probs=66.8

Q ss_pred             cccCCCCcccccCCCCCHHHHHHHHHHHHHhHHHHHHhCCCCCChhhhhhhhcCCCceEEEEEeCCCCCeEEEEEEEeec
Q psy838           37 YTCSFCGKEYKKSTDLGAETKKWVWELEEKNMKHSYEVCDIGWDPQGKHSEMFDDRACYLVAKNGSSSTPVAFSHFRFDV  116 (152)
Q Consensus        37 ~~c~~c~~~~~~~~~l~~~d~~~i~~l~~~~m~~~y~~~~~~w~~~~~~~el~~~~~~~~v~~~~~~~~ivGf~~~~~~~  116 (152)
                      +.-|.||.+  .+.....=|++.+-.|+.+    .|..   ..+++.....+..+-...+|+.     .--|.+.+.+.-
T Consensus       329 f~~p~~~~~--~sttw~~Ldl~r~q~LI~~----SFkR---TLd~h~y~~r~~~~La~~iVsg-----dY~g~aIlTyeg  394 (495)
T COG5630         329 FHVPEKGAK--PSTTWKDLDLPRLQHLIQS----SFKR---TLDPHYYETRINTPLARAIVSG-----DYRGAAILTYEG  394 (495)
T ss_pred             Eeccccccc--CCCChhhcCcHHHHHHHHH----HHhh---ccCHHHHHHhccCcceeEEeec-----cceeeEEEEeec
Confidence            444666643  3444455566777777765    5543   4578888888888877666542     234555554433


Q ss_pred             cCCcceEEEeeeeecccccc-CCHHHHHHHHHHh
Q psy838          117 DFGEPVLYCYELQLEKQVQR-KGLGKFMMQAFYT  149 (152)
Q Consensus       117 ~~~~~~~~I~~i~V~p~~qg-kGiG~~Lm~~l~~  149 (152)
                      ..++.+-||..++|.++.|| -|||..+.+-+-+
T Consensus       395 s~~~~vpYLDKfAVl~~aQGs~gisd~vfniM~e  428 (495)
T COG5630         395 SGENNVPYLDKFAVLDDAQGSEGISDAVFNIMRE  428 (495)
T ss_pred             cCCCCCcceeeeeccccccccchHHHHHHHHHHH
Confidence            34456889999999999999 9999999887654


No 93 
>TIGR03243 arg_catab_AOST arginine and ornithine succinyltransferase subunits. In many bacteria, the sole member of this protein family is arginine N-succinyltransferase (EC 2.3.1.109), the AstA protein of the arginine succinyltransferase (ast) pathway. However, in Pseudomonas aeruginosa and several other species, a tandem gene pair encodes alpha and beta subunits of a heterodimer that is designated arginine and ornithine succinyltransferase (AOST).
Probab=93.35  E-value=0.27  Score=40.59  Aligned_cols=93  Identities=12%  Similarity=0.085  Sum_probs=54.5

Q ss_pred             CCCCHHHHHHHHHHHHHhHHHHHHhCCCC-----------CChhhhhhhhcCCCceEEEEEeC-CCCCeEEEEEEEee--
Q psy838           50 TDLGAETKKWVWELEEKNMKHSYEVCDIG-----------WDPQGKHSEMFDDRACYLVAKNG-SSSTPVAFSHFRFD--  115 (152)
Q Consensus        50 ~~l~~~d~~~i~~l~~~~m~~~y~~~~~~-----------w~~~~~~~el~~~~~~~~v~~~~-~~~~ivGf~~~~~~--  115 (152)
                      .++..+|++.+++|..+.= .=+  .+.|           ++...+..+....+..|++|.++ ++|++||...+.-.  
T Consensus         3 Rpv~~~Dl~aL~~LA~~sg-~G~--TsLP~d~~~L~~rI~~S~~sF~~~~~~~~~~YlFVLED~~tg~vvGts~I~a~vG   79 (335)
T TIGR03243         3 RPVRTSDLDALMQLARESG-IGL--TSLPADRAALGSRIARSEKSFAGESTRGEEGYLFVLEDTETGTVAGVSAIEAAVG   79 (335)
T ss_pred             ccCccccHHHHHHHHHHcC-CCc--ccCCCCHHHHHHHHHHHHHHHhcccCCCCccEEEEEEeCCCCeEEEEEeEEeccc
Confidence            4677888999988876520 000  0122           34444433322235567776653 35899998874211  


Q ss_pred             ---------cc----------------------CCcceEEEeeeeeccccccCCHHHHHHH
Q psy838          116 ---------VD----------------------FGEPVLYCYELQLEKQVQRKGLGKFMMQ  145 (152)
Q Consensus       116 ---------~~----------------------~~~~~~~I~~i~V~p~~qgkGiG~~Lm~  145 (152)
                               .+                      +-.-.-.|-.|.|+|+||+-|.|+.|-.
T Consensus        80 ~~~PfY~yrv~~~vhaS~~L~v~~~~~~L~l~nd~tG~sElctLfL~p~~R~~~~G~LLSr  140 (335)
T TIGR03243        80 LDEPFYNYRVGTLVHASRELGVYNKIPTLTLSNDLTGSSELCTLFLDPDYRKGGNGRLLSR  140 (335)
T ss_pred             CCCCCEEEEcCceeecCcccCCccceeeEEeeccCCCCeeeEEEEECHHHcCCCchhhHHH
Confidence                     00                      0011234778999999999999998743


No 94 
>KOG2779|consensus
Probab=93.34  E-value=0.28  Score=40.85  Aligned_cols=100  Identities=13%  Similarity=0.063  Sum_probs=65.9

Q ss_pred             CCHHHHHHHHHHHHHhHH-HHHHhCCCCCChhhhhhhhcCCC---ceEEEEEeCCCCCeEEEEEEEee----ccCCcceE
Q psy838           52 LGAETKKWVWELEEKNMK-HSYEVCDIGWDPQGKHSEMFDDR---ACYLVAKNGSSSTPVAFSHFRFD----VDFGEPVL  123 (152)
Q Consensus        52 l~~~d~~~i~~l~~~~m~-~~y~~~~~~w~~~~~~~el~~~~---~~~~v~~~~~~~~ivGf~~~~~~----~~~~~~~~  123 (152)
                      -...|++++++|..+|.- +.+...-+.++.+.+.=.+..|+   .+.+.+....+.++|||++..+.    .+.-...+
T Consensus        89 ~~~~~l~el~~lL~enyVEd~~~m~rf~Ys~eFl~Wal~~pg~~~~WHiGVRv~~s~kLVaFIsaiP~~irvrdk~vk~v  168 (421)
T KOG2779|consen   89 SDFKDLEELYNLLNENYVEDDDSMFRFDYSPEFLKWALQPPGWKKEWHIGVRVKSSKKLVAFISAIPATIRVRDKVVKMV  168 (421)
T ss_pred             ccHhHHHHHHhhcccCCCCccccchhhhccHHHHHhhhcCCCCccceEEEEEEecCCceEEEEeccccEEEEccceeeee
Confidence            345667777776665411 11111124678887777776654   23334443335799999987642    23445578


Q ss_pred             EEeeeeeccccccCCHHHHHHHHHHhhc
Q psy838          124 YCYELQLEKQVQRKGLGKFMMQAFYTYI  151 (152)
Q Consensus       124 ~I~~i~V~p~~qgkGiG~~Lm~~l~~~a  151 (152)
                      .|+-|.|++..|+++++--|+.++-..+
T Consensus       169 eINFLCVHKkLRSKRlaPvLIrEITRRv  196 (421)
T KOG2779|consen  169 EINFLCVHKKLRSKRLAPVLIREITRRV  196 (421)
T ss_pred             eEEEEEEehhhhccccccHHHHHHHHHh
Confidence            8999999999999999999999886554


No 95 
>TIGR03244 arg_catab_AstA arginine N-succinyltransferase. In many bacteria, the arginine succinyltransferase (ast) pathway operon consists of five genes, including this protein, arginine N-succinyltransferase (EC 2.3.1.109). In a few species, such as Pseudomonas aeruginosa, the member of this family is encoded adjacent to a paralog, and the two polypeptides form a heterodimeric enzyme, active on both arginine and ornithine. In such species, this polypeptide may be treated as the beta subunit of an enzyme that may be named either arginine N-succinyltransferase (AST) or arginine and orthithine N-succinyltransferase (AOST).
Probab=93.31  E-value=0.27  Score=40.62  Aligned_cols=93  Identities=12%  Similarity=0.109  Sum_probs=53.7

Q ss_pred             CCCCHHHHHHHHHHHHHhHHHHHHhCCCCCChh-----------hhhhhhcCCCceEEEEEeC-CCCCeEEEEEEEee--
Q psy838           50 TDLGAETKKWVWELEEKNMKHSYEVCDIGWDPQ-----------GKHSEMFDDRACYLVAKNG-SSSTPVAFSHFRFD--  115 (152)
Q Consensus        50 ~~l~~~d~~~i~~l~~~~m~~~y~~~~~~w~~~-----------~~~~el~~~~~~~~v~~~~-~~~~ivGf~~~~~~--  115 (152)
                      .++...|++.+++|..+.= .=+  .+.|-+++           .+..+....+..|++|.++ ++|++||...+.-.  
T Consensus         3 RPv~~~Dl~aL~~LA~~sg-~G~--TsLP~d~~~L~~rI~~S~~sF~~~~~~~~~~YlFVLEDt~tg~vvGts~I~a~vG   79 (336)
T TIGR03244         3 RPVETSDLDALYQLAQSTG-IGL--TSLPANEDLLSARIERAEKTFSGELTRAEQGYLFVLEDTETGTVAGVSAIEAAVG   79 (336)
T ss_pred             ccCccccHHHHHHHHHHcC-CCc--ccCCCCHHHHHHHHHHHHHHhcCcCCCCCccEEEEEEeCCCCeEEEEEeEEeccc
Confidence            4677888999998876520 000  01233333           3322222234567777654 35899998874211  


Q ss_pred             ---------cc----------------------CCcceEEEeeeeeccccccCCHHHHHHH
Q psy838          116 ---------VD----------------------FGEPVLYCYELQLEKQVQRKGLGKFMMQ  145 (152)
Q Consensus       116 ---------~~----------------------~~~~~~~I~~i~V~p~~qgkGiG~~Lm~  145 (152)
                               .+                      +-.-.-.|-.|.++|+||+-|.|+.|-.
T Consensus        80 ~~~PfY~yr~~~~vhaS~~L~v~~~~~~L~l~nd~tG~SElctLfL~p~~R~~~~G~LLSr  140 (336)
T TIGR03244        80 LEEPFYNYRVGTVVHASKELGIYKALETLFLSNDLTGYSELCTLFLDPDYRKGGNGRLLSK  140 (336)
T ss_pred             CCCCCEEEEcCceeecCcccCCceeeeeEEeeccCCCCeeeEEEEECHHHcCCcchhhHHH
Confidence                     00                      0011234677899999999999988743


No 96 
>PF13444 Acetyltransf_5:  Acetyltransferase (GNAT) domain
Probab=93.07  E-value=0.59  Score=31.51  Aligned_cols=60  Identities=12%  Similarity=0.200  Sum_probs=42.2

Q ss_pred             hhhhhcCCCceEEEEEeCCCCCeEEEEEEEeecc--------------------CCcceEEEeeeeeccccccCCHHHHH
Q psy838           84 KHSEMFDDRACYLVAKNGSSSTPVAFSHFRFDVD--------------------FGEPVLYCYELQLEKQVQRKGLGKFM  143 (152)
Q Consensus        84 ~~~el~~~~~~~~v~~~~~~~~ivGf~~~~~~~~--------------------~~~~~~~I~~i~V~p~~qgkGiG~~L  143 (152)
                      ...+-.++.+..+++.++. .++||.+.+.....                    ....++.|-++.|+|+||+...-..|
T Consensus        21 ~e~D~fD~~~~h~lv~~~~-~~~VGt~Rl~~~~~~~~~~~~~~~~~f~l~~~~~~~~~~~EisRl~V~~~~R~~~~~~~L   99 (101)
T PF13444_consen   21 RERDEFDEHSVHLLVRDKN-TEVVGTVRLILPSPAGPLEGFYSESEFDLDPLLPLPRRVAEISRLCVHPEYRRRKVLLLL   99 (101)
T ss_pred             ccccCCCCCccEEEEEECC-CCEEEEEEeeccccccccccCCchhhcCcchhhccCCcEEEeehheECHhHCCChHHHHH
Confidence            4455566777777776652 36999998754211                    12367889999999999999887766


Q ss_pred             H
Q psy838          144 M  144 (152)
Q Consensus       144 m  144 (152)
                      .
T Consensus       100 ~  100 (101)
T PF13444_consen  100 W  100 (101)
T ss_pred             h
Confidence            4


No 97 
>PRK10456 arginine succinyltransferase; Provisional
Probab=93.00  E-value=0.31  Score=40.41  Aligned_cols=93  Identities=10%  Similarity=0.071  Sum_probs=53.9

Q ss_pred             CCCCHHHHHHHHHHHHHhHHHHHHhCCCCCChhhhhhh-----------hcCCCceEEEEEeC-CCCCeEEEEEEEee--
Q psy838           50 TDLGAETKKWVWELEEKNMKHSYEVCDIGWDPQGKHSE-----------MFDDRACYLVAKNG-SSSTPVAFSHFRFD--  115 (152)
Q Consensus        50 ~~l~~~d~~~i~~l~~~~m~~~y~~~~~~w~~~~~~~e-----------l~~~~~~~~v~~~~-~~~~ivGf~~~~~~--  115 (152)
                      .++..+|++.+++|..+.= .=+  .+.|-+++.+...           ....+..|++|.++ ++|++||...+.-.  
T Consensus         5 Rpv~~~Dl~aL~~LA~~sG-~G~--TsLP~d~~~L~~rI~~S~~sF~~~~~~~~~~YlFVLED~~tg~vvGts~I~a~vG   81 (344)
T PRK10456          5 RPVERSDLAALMQLAGKTG-GGL--TSLPANEATLAARIERALKTWQGELPKSEQGYVFVLEDSETGTVAGICAIEVAVG   81 (344)
T ss_pred             ecCccccHHHHHHHHHHcC-CCc--ccCCCCHHHHHHHHHHHHHHhcCcCCCCCccEEEEEEeCCCCcEEEEEeEEeccc
Confidence            4677889999998876520 000  0133333332222           22234566777654 35899998874211  


Q ss_pred             ---------cc----------------------CCcceEEEeeeeeccccccCCHHHHHHH
Q psy838          116 ---------VD----------------------FGEPVLYCYELQLEKQVQRKGLGKFMMQ  145 (152)
Q Consensus       116 ---------~~----------------------~~~~~~~I~~i~V~p~~qgkGiG~~Lm~  145 (152)
                               .+                      +-.-.-.|-.|.|+|+||+-|.|+.|-.
T Consensus        82 ~~~PfY~yr~~~~vhaS~~L~v~~~~~~L~l~nd~tG~sElctLfl~p~~R~~~~G~LLSr  142 (344)
T PRK10456         82 LNDPWYNYRVGTLVHASKELNVYNALPTLFLSNDHTGSSELCTLFLDPDWRKEGNGYLLSK  142 (344)
T ss_pred             CCCCCEEEEcCceeecCcccCCceeeeeEEeeccCCCCceeEEEEECHHHcCCCchhHHHH
Confidence                     00                      0011234677899999999999988743


No 98 
>PF08444 Gly_acyl_tr_C:  Aralkyl acyl-CoA:amino acid N-acyltransferase, C-terminal region;  InterPro: IPR013652 This entry represents mammalian-specific glycine N-acyltransferase (also called aralkyl acyl-CoA:amino acid N-acyltransferase; 2.3.1.13 from EC). Mitochondrial acyltransferases catalyse the transfer of an acyl group from acyl-CoA to the N terminus of glycine to produce N-acylglycine. These enzymes can conjugate a multitude of substrates to form a variety of N-acylglycines. The CoA derivatives of a number of aliphatic and aromatic acids, but not phenylacetyl-CoA or (indol-3-yl)acetyl-CoA, can act as donor [, ].
Probab=92.21  E-value=0.18  Score=33.85  Aligned_cols=41  Identities=17%  Similarity=0.095  Sum_probs=30.3

Q ss_pred             CCCeEEEEEEEeeccCCcceEEEeeeeeccccccCCHHHHHHHHHHhh
Q psy838          103 SSTPVAFSHFRFDVDFGEPVLYCYELQLEKQVQRKGLGKFMMQAFYTY  150 (152)
Q Consensus       103 ~~~ivGf~~~~~~~~~~~~~~~I~~i~V~p~~qgkGiG~~Lm~~l~~~  150 (152)
                      +|.+|-.+.....       ..+.--+..|+|||||+.+.++..+.+.
T Consensus         7 eG~PVSW~lmdqt-------ge~rmgyTlPeyR~~G~~~~v~~~~~~~   47 (89)
T PF08444_consen    7 EGNPVSWSLMDQT-------GEMRMGYTLPEYRGQGLMSQVMYHLAQY   47 (89)
T ss_pred             CCCEeEEEEeccc-------ccccccccCHhHhcCCHHHHHHHHHHHH
Confidence            5778766554221       2356678999999999999999887764


No 99 
>PF04958 AstA:  Arginine N-succinyltransferase beta subunit;  InterPro: IPR007041 Arginine N-succinyltransferase catalyses the transfer of succinyl-CoA to arginine to produce succinylarginine. This is the first step in arginine catabolism via the arginine succinyltransferase pathway. Six major L-arginine-degrading pathways have been described for prokaryotes []. Many bacteria arginine succinyltransferase 2.3.1.109 from EC, which is the AstA protein of the succinyltransferase (ast) pathway operon consists of five genes. In a few species, such as Pseudomonas aeruginosa, a tandem gene pair encodes alpha and beta subunits of a heterodimer that is designated arginine and ornithine succinyltransferase (AOST).  This entry represents the family of proteins that make up the beta subunit of the heterodimer of Ast and AOST.; GO: 0008791 arginine N-succinyltransferase activity, 0006527 arginine catabolic process; PDB: 1YLE_A.
Probab=91.62  E-value=0.89  Score=37.73  Aligned_cols=93  Identities=12%  Similarity=0.079  Sum_probs=48.1

Q ss_pred             CCCCHHHHHHHHHHHHHhHHHHHHhCCCCCChhhhhhh-----------hc-CC-CceEEEEEeC-CCCCeEEEEEEEee
Q psy838           50 TDLGAETKKWVWELEEKNMKHSYEVCDIGWDPQGKHSE-----------MF-DD-RACYLVAKNG-SSSTPVAFSHFRFD  115 (152)
Q Consensus        50 ~~l~~~d~~~i~~l~~~~m~~~y~~~~~~w~~~~~~~e-----------l~-~~-~~~~~v~~~~-~~~~ivGf~~~~~~  115 (152)
                      .++..+|++.+++|....= .-+  .+.|-+++.+...           .. .+ +..|++|.++ ++|+|||.+.+.-.
T Consensus         5 Rp~~~~Dl~aL~~LA~~sg-~G~--TsLP~d~~~L~~rI~~S~~sFa~~~~~~~~~~~YlfVLED~~tg~vvGts~I~a~   81 (342)
T PF04958_consen    5 RPARPSDLDALYALARESG-PGF--TSLPPDREALAERIERSERSFAGRDVDFPGDEGYLFVLEDTETGEVVGTSAIEAA   81 (342)
T ss_dssp             EE--GGGHHHHHHHHHHS--TT---TTS-S-HHHHHHHHHHHHHHHH-TT----S--EEEEEEEETTT--EEEEEEEESS
T ss_pred             ecCchhhHHHHHHHHHHcC-CCc--ccCCCCHHHHHHHHHHHHHHhhccccCCCCccceEEEEEecCCCcEEEEEeEEec
Confidence            4677889999999876521 111  0234444433222           21 12 3457777653 36899998874210


Q ss_pred             c-------------------c--------------CCcceEEEeeeeeccccccCCHHHHHHH
Q psy838          116 V-------------------D--------------FGEPVLYCYELQLEKQVQRKGLGKFMMQ  145 (152)
Q Consensus       116 ~-------------------~--------------~~~~~~~I~~i~V~p~~qgkGiG~~Lm~  145 (152)
                      .                   +              +-.-.-.|-.|+|+|+||+-|.|+.|-.
T Consensus        82 vG~~~PfY~yr~~~~vh~S~~L~v~~~~~~L~L~~d~tG~sEl~tLfL~p~~R~~~~G~lLSr  144 (342)
T PF04958_consen   82 VGLDEPFYSYRVSTLVHASRELGVRNRHETLTLSNDYTGCSELCTLFLDPDYRGGGNGRLLSR  144 (342)
T ss_dssp             TTSSS---EEEEEEEEEEETTTTEEEEEEEEEEE-TTTTSEEEEEEEE-GGGTTSHHHHHHHH
T ss_pred             cCCCCCcEEEEcCceeEcCcccCCccceeeEeeecCCCCCeeeEEEEECHHHcCCchHHHHHH
Confidence            0                   0              0111245778999999999999998753


No 100
>TIGR03245 arg_AOST_alph arginine/ornithine succinyltransferase, alpha subunit. In some bacteria, including Pseudomonas aeruginosa, the astB gene (arginine N-succinyltransferase) is replaced by tandem paralogs that form a heterodimer. This heterodimer from P. aeruginosa is characterized as arginine and ornithine N-2 succinyltransferase (AOST). Members of this protein family represent the less widespread paralog, designated AruI, or arginine/ornithine succinyltransferase, alpha subunit.
Probab=91.51  E-value=0.77  Score=38.00  Aligned_cols=93  Identities=6%  Similarity=0.000  Sum_probs=53.9

Q ss_pred             CCCCHHHHHHHHHHHHHhHHHHHHhCCCC-----------CChhhhhhhhc-CCCceEEEEEeC-CCCCeEEEEEEEee-
Q psy838           50 TDLGAETKKWVWELEEKNMKHSYEVCDIG-----------WDPQGKHSEMF-DDRACYLVAKNG-SSSTPVAFSHFRFD-  115 (152)
Q Consensus        50 ~~l~~~d~~~i~~l~~~~m~~~y~~~~~~-----------w~~~~~~~el~-~~~~~~~v~~~~-~~~~ivGf~~~~~~-  115 (152)
                      .++...|++.+++|..+.= .=+  .+.|           ++...+..+.. ..+..|++|.++ ++|++||...+.-. 
T Consensus         3 Rpv~~~Dl~aL~~LA~~sG-~G~--TsLP~d~~~L~~rI~~S~~sF~~~~~~~~~~~YlFVLEDt~tg~vvGts~I~a~v   79 (336)
T TIGR03245         3 RPSRFADLPAIERLANESA-IGV--TSLPADRAKLGEKIAQSERSFAAEVSFVGEERYLFVLEDTETGKLLGTSSIVASA   79 (336)
T ss_pred             ccCccccHHHHHHHHHHcC-CCc--ccCCCCHHHHHHHHHHHHHHHHhhcCCCCCccEEEEEEeCCCCcEEEEEeEEecc
Confidence            4677888999998876520 000  0123           33334422222 234556776653 35899998874211 


Q ss_pred             ----------cc----------------------CCcceEEEeeeeeccccccCCHHHHHHH
Q psy838          116 ----------VD----------------------FGEPVLYCYELQLEKQVQRKGLGKFMMQ  145 (152)
Q Consensus       116 ----------~~----------------------~~~~~~~I~~i~V~p~~qgkGiG~~Lm~  145 (152)
                                .+                      +-.-.-.|-.|+++|+||+-|.|+.|-.
T Consensus        80 G~~~PfY~yr~~~~vhaS~~L~v~~~~~~L~l~nd~tG~sElctLfL~p~~R~~~~G~lLSr  141 (336)
T TIGR03245        80 GYGEPFYSYRNDTLIHASRELKVNNKIHVLYMCHELTGSSLLCSFYVDPRLRKTEAAELLSR  141 (336)
T ss_pred             cCCCCCEEEEcCceeecCcccCCccceeeEEeeccCCCCeeeEEEEECHHHcCCCchhHHHH
Confidence                      00                      0011234778999999999999988743


No 101
>cd04265 DUF619-NAGS-U DUF619 domain of various N-acetylglutamate Synthases (NAGS) of the urea (U) cycle of humans and fish. This family includes the DUF619 domain of various N-acetylglutamate synthases (NAGS) of the urea cycle found in humans and fish, the DUF619 domain of the NAGS of the fungal arginine-biosynthetic pathway (FABP), as well as the DUF619 domain present in C-terminal of a NAG kinase-like domain in a limited number of predicted NAGSs found in bacteria and Dictyostelium. Ureogenic NAGS is a mitochondrial enzyme catalyzing the formation of NAG from acetylcoenzyme A and L-glutamate. NAGS is an essential allosteric activator of carbamylphosphate synthase I, the first and rate limiting enzyme of the urea cycle. Domain architecture of ureogenic and fungal NAGS consists of an N-terminal NAG kinase-like domain and a C-terminal DUF619 domain. The DUF619 domain function has yet to be characterized.
Probab=91.43  E-value=0.86  Score=31.09  Aligned_cols=29  Identities=17%  Similarity=0.287  Sum_probs=26.7

Q ss_pred             ceEEEeeeeeccccccCCHHHHHHHHHHh
Q psy838          121 PVLYCYELQLEKQVQRKGLGKFMMQAFYT  149 (152)
Q Consensus       121 ~~~~I~~i~V~p~~qgkGiG~~Lm~~l~~  149 (152)
                      .+.||-.++|.|+.||+|+|..|++.+.+
T Consensus        33 ~~~yLdKfaV~~~~~g~gv~d~vf~~i~~   61 (99)
T cd04265          33 GVPYLDKFAVSSSAQGEGTGEALWRRLRR   61 (99)
T ss_pred             CceEEEEEEEchhhhhcChHHHHHHHHHh
Confidence            46789999999999999999999999875


No 102
>COG3053 CitC Citrate lyase synthetase [Energy production and conversion]
Probab=91.28  E-value=0.65  Score=37.91  Aligned_cols=51  Identities=14%  Similarity=0.152  Sum_probs=38.5

Q ss_pred             ceEEEEEeCCCCCeEEEEEEEeeccCCcceEEEeeeeeccccccCCHHHHHHHHHHhhc
Q psy838           93 ACYLVAKNGSSSTPVAFSHFRFDVDFGEPVLYCYELQLEKQVQRKGLGKFMMQAFYTYI  151 (152)
Q Consensus        93 ~~~~v~~~~~~~~ivGf~~~~~~~~~~~~~~~I~~i~V~p~~qgkGiG~~Lm~~l~~~a  151 (152)
                      -.+.++..++++++|+..++.-.        -|-.|+|.|.+||-|+.-+|+..|+++|
T Consensus        35 ve~~v~~~~~~~~iiacGsiaGn--------vikcvAvs~s~qGeGl~lkl~TeLin~a   85 (352)
T COG3053          35 VEYFVAIYRDNEEIIACGSIAGN--------VIKCVAVSESLQGEGLALKLVTELINLA   85 (352)
T ss_pred             ceEEEEEEcCCCcEEEecccccc--------eeEEEEechhcccccHHHHHHHHHHHHH
Confidence            34455544436899988775311        1678999999999999999999998875


No 103
>COG3375 Uncharacterized conserved protein [Function unknown]
Probab=91.04  E-value=1.7  Score=34.15  Aligned_cols=88  Identities=16%  Similarity=0.057  Sum_probs=52.8

Q ss_pred             CCHHHHHHHHHHHHHhHHHHHHhCC-CCCChhhhhhhhcCCCceEEEEEeCCCCCeEEEEEEEeeccCCcceEEEeeeee
Q psy838           52 LGAETKKWVWELEEKNMKHSYEVCD-IGWDPQGKHSEMFDDRACYLVAKNGSSSTPVAFSHFRFDVDFGEPVLYCYELQL  130 (152)
Q Consensus        52 l~~~d~~~i~~l~~~~m~~~y~~~~-~~w~~~~~~~el~~~~~~~~v~~~~~~~~ivGf~~~~~~~~~~~~~~~I~~i~V  130 (152)
                      -++.++++..++...    .+..+. ...-.+.++ .|.....-.+-++.+ ++++||...-.+..-.+....|=|-+.|
T Consensus         9 ~dp~el~~~~dV~~~----aWg~~d~~~~~~d~i~-al~~~GGlvlgAf~~-dg~lVGls~G~pg~r~g~~y~ySH~~gV   82 (266)
T COG3375           9 TDPAELDEAEDVQAS----AWGSEDRDGAPADTIR-ALRYHGGLVLGAFSA-DGRLVGLSYGYPGGRGGSLYLYSHMLGV   82 (266)
T ss_pred             CCHHHHHHHHHHHHH----HhCccccccchHHHHH-HHHhcCCeEEEEEcC-CCcEEEEEeccCCcCCCceeeeeeehhc
Confidence            356677777766544    221111 011122222 344445555555655 4699998766554434555677888999


Q ss_pred             ccccccCCHHHHHHH
Q psy838          131 EKQVQRKGLGKFMMQ  145 (152)
Q Consensus       131 ~p~~qgkGiG~~Lm~  145 (152)
                      .|++++.|+|-.|=.
T Consensus        83 ~e~~k~sglg~aLK~   97 (266)
T COG3375          83 REEVKGSGLGVALKM   97 (266)
T ss_pred             cccccccchhhhhHH
Confidence            999999999987743


No 104
>COG5027 SAS2 Histone acetyltransferase (MYST family) [Chromatin structure and dynamics]
Probab=91.00  E-value=0.14  Score=42.46  Aligned_cols=51  Identities=16%  Similarity=0.253  Sum_probs=33.2

Q ss_pred             ceEEEEEeCCCC-CeEEEEEEEeeccCCcceEEEeeeeeccccccCCHHHHHHHH
Q psy838           93 ACYLVAKNGSSS-TPVAFSHFRFDVDFGEPVLYCYELQLEKQVQRKGLGKFMMQA  146 (152)
Q Consensus        93 ~~~~v~~~~~~~-~ivGf~~~~~~~~~~~~~~~I~~i~V~p~~qgkGiG~~Lm~~  146 (152)
                      ..|++...++.+ .+|||.+=....+   ....|..|-+.|-||++|.|+.||+.
T Consensus       235 lFYvl~~~~~~~~h~vGyFSKEK~S~---~~yNLaCILtLP~yQRrGYG~lLIdF  286 (395)
T COG5027         235 LFYVLTERGDTGCHLVGYFSKEKESE---QDYNLACILTLPPYQRRGYGKLLIDF  286 (395)
T ss_pred             EEEEEEEcCCcceeeeeeechhhccc---ccCceEEEEecChhHhcccceEeeee
Confidence            344444444222 4888876432222   23457889999999999999999864


No 105
>KOG4144|consensus
Probab=90.98  E-value=0.2  Score=37.22  Aligned_cols=43  Identities=9%  Similarity=0.087  Sum_probs=33.0

Q ss_pred             CCeEEEEEEEeecc-------------CCcceEEEeeeeeccccccCCHHHHHHHHH
Q psy838          104 STPVAFSHFRFDVD-------------FGEPVLYCYELQLEKQVQRKGLGKFMMQAF  147 (152)
Q Consensus       104 ~~ivGf~~~~~~~~-------------~~~~~~~I~~i~V~p~~qgkGiG~~Lm~~l  147 (152)
                      +.+||++.-.....             .++ ...|+-+.|+|+||.+|.|..|+...
T Consensus        71 ~tLIghIigs~~~~E~lt~ESm~kh~s~g~-ni~iHsl~Ihpa~rk~g~a~~Ll~~y  126 (190)
T KOG4144|consen   71 GTLIGHIIGSLWDKERLTQESMTKHRSGGH-NIHIHSLAIHPAFRKQGRAPILLWRY  126 (190)
T ss_pred             ccceehhhcccCcchhhhHHHHhhhhcCCc-ceeEEEEEecHHHHhcCcchhHHHHH
Confidence            78889886543211             233 47899999999999999999999873


No 106
>PF13465 zf-H2C2_2:  Zinc-finger double domain; PDB: 2EN7_A 1TF6_A 1TF3_A 2ELT_A 2EOS_A 2EN2_A 2DMD_A 2WBS_A 2WBU_A 2EM5_A ....
Probab=90.91  E-value=0.19  Score=25.65  Aligned_cols=23  Identities=35%  Similarity=0.964  Sum_probs=14.9

Q ss_pred             HhHhHHhhhhhcccccccCCCCccc
Q psy838           22 LRKMVKKMEITQHAKYTCSFCGKEY   46 (152)
Q Consensus        22 ~rk~~~~~~~~~~~~~~c~~c~~~~   46 (152)
                      |++.++ +. ....+|.||.|++.+
T Consensus         2 l~~H~~-~H-~~~k~~~C~~C~k~F   24 (26)
T PF13465_consen    2 LRRHMR-TH-TGEKPYKCPYCGKSF   24 (26)
T ss_dssp             HHHHHH-HH-SSSSSEEESSSSEEE
T ss_pred             HHHHhh-hc-CCCCCCCCCCCcCee
Confidence            344444 33 344679999999864


No 107
>KOG2696|consensus
Probab=89.85  E-value=0.95  Score=37.88  Aligned_cols=24  Identities=25%  Similarity=0.339  Sum_probs=22.1

Q ss_pred             EeeeeeccccccCCHHHHHHHHHH
Q psy838          125 CYELQLEKQVQRKGLGKFMMQAFY  148 (152)
Q Consensus       125 I~~i~V~p~~qgkGiG~~Lm~~l~  148 (152)
                      |..+-|.|.||++|+|.+||+.+.
T Consensus       220 iSQmlilpPfq~~Glgs~l~E~i~  243 (403)
T KOG2696|consen  220 ISQMLILPPFQGKGLGSQLYEAIA  243 (403)
T ss_pred             hheeEEeccccCCchHHHHHHHHH
Confidence            677889999999999999999987


No 108
>PF04768 DUF619:  Protein of unknown function (DUF619);  InterPro: IPR006855 This region of unknown function is found at the C terminus of Neurospora crassa acetylglutamate synthase (2.7.2.8 from EC). It is also found C-terminal to the amino acid kinase region in some fungal acetylglutamate kinase enzymes (IPR001048 from INTERPRO). These enzymes play a role in arginine biosynthesis.; PDB: 3S6K_A 4AB7_F 3ZZF_B 3ZZI_D 3ZZH_A 3ZZG_A 3S6G_Y 3S6H_A 3S7Y_A.
Probab=89.68  E-value=3.5  Score=30.82  Aligned_cols=92  Identities=16%  Similarity=0.106  Sum_probs=56.6

Q ss_pred             cccCCCCCHH-HHHHHHHHHHHhHHHHHHhCCCCCChhhhhhhhcCCCceEEEEEeCCCCCeEEEEEEEeec-cCCcceE
Q psy838           46 YKKSTDLGAE-TKKWVWELEEKNMKHSYEVCDIGWDPQGKHSEMFDDRACYLVAKNGSSSTPVAFSHFRFDV-DFGEPVL  123 (152)
Q Consensus        46 ~~~~~~l~~~-d~~~i~~l~~~~m~~~y~~~~~~w~~~~~~~el~~~~~~~~v~~~~~~~~ivGf~~~~~~~-~~~~~~~  123 (152)
                      +.....++.- |++.+.+|..+    .|   .-.-+.+.....+.+.....++  .   +..-|.+.+.+.. +...++.
T Consensus        22 i~~~~s~~~~~d~~kL~~ll~~----sf---~~~~~v~~yl~~l~~~~~~iy~--d---~~y~~~AIVt~e~~~~~~~v~   89 (170)
T PF04768_consen   22 ILKHSSLSEFVDLDKLRALLER----SF---GGKLDVDHYLDRLNNRLFKIYV--D---EDYEGAAIVTPEGPDSNGPVP   89 (170)
T ss_dssp             EEEESSCCCSS-HHHHHHHHHH----HS---TSSSBHTTHHHHHHTS-SEEEE--E---TTSSEEEEEEEE-SCTCTSEE
T ss_pred             eEEecCccccCCHHHHHHHHHh----cc---cccccHHHHHHHhhccceEEEE--e---CCceEEEEEEecCCCCCCCCe
Confidence            3334444444 78888887655    34   2234566666777655443333  2   3344555554422 3345689


Q ss_pred             EEeeeeeccccccCCHHHHHHHHHHh
Q psy838          124 YCYELQLEKQVQRKGLGKFMMQAFYT  149 (152)
Q Consensus       124 ~I~~i~V~p~~qgkGiG~~Lm~~l~~  149 (152)
                      ||..++|.|..||.|++-.+.+.+.+
T Consensus        90 yLdKFav~~~~~g~gv~D~vf~~i~~  115 (170)
T PF04768_consen   90 YLDKFAVSKSAQGSGVADNVFNAIRK  115 (170)
T ss_dssp             EEEEEEE-HHHHHTTHHHHHHHHHHH
T ss_pred             EEEEEEecchhhhcCHHHHHHHHHHH
Confidence            99999999999999999999998865


No 109
>PF09924 DUF2156:  Uncharacterized conserved protein (DUF2156);  InterPro: IPR024320 This domain of unknown function is found in uncharacterised proteins and in Lysylphosphatidylglycerol synthetase, which catalyses the transfer of a lysyl group from L-lysyl-tRNA(Lys) to membrane-bound phosphatidylglycerol [].; PDB: 2HQY_A.
Probab=89.07  E-value=2.8  Score=33.47  Aligned_cols=114  Identities=14%  Similarity=0.107  Sum_probs=57.8

Q ss_pred             hhHhHhHHhhhhhcccccccCCCCcccccCCCCCHHHHHHHHHHHHHhHHHHHHhCCCCCChhhhhhh-hc---CCCceE
Q psy838           20 ASLRKMVKKMEITQHAKYTCSFCGKEYKKSTDLGAETKKWVWELEEKNMKHSYEVCDIGWDPQGKHSE-MF---DDRACY   95 (152)
Q Consensus        20 ~~~rk~~~~~~~~~~~~~~c~~c~~~~~~~~~l~~~d~~~i~~l~~~~m~~~y~~~~~~w~~~~~~~e-l~---~~~~~~   95 (152)
                      .++|.++++.+..-+. +.       +.+...+++++.+.|.++.++=....     . -.+..+... +.   ..+...
T Consensus       117 ~~~Rn~in~~~k~G~~-~~-------~~~~~~~~~~~~~el~~i~~~W~~~~-----~-~~e~~~~~~~~~~~~~~~~~~  182 (299)
T PF09924_consen  117 RKKRNHINRFEKEGYT-FE-------VVPIPELDPELRDELLEISDEWLKEK-----E-RPERGFIMGALEHFDELGLRG  182 (299)
T ss_dssp             HHHHHHHHHHHHH--T--E-------EEE-----GGGHHHHHHHHHHHHHHC-----T-HHHHHHHHHHHHTHHHHT-EE
T ss_pred             hhHHHHHHHHhcCceE-EE-------EEECCCCCHHHHHHHHHHHHHHHhcC-----c-hhHHHHHhccccchhhcCceE
Confidence            5677777777765222 11       44455558888899988876521111     0 001111111 11   125566


Q ss_pred             EEEEeCCCCCeEEEEEEEeeccCCcceEEEeeeeeccccccCCHHHHHHHHHHhhc
Q psy838           96 LVAKNGSSSTPVAFSHFRFDVDFGEPVLYCYELQLEKQVQRKGLGKFMMQAFYTYI  151 (152)
Q Consensus        96 ~v~~~~~~~~ivGf~~~~~~~~~~~~~~~I~~i~V~p~~qgkGiG~~Lm~~l~~~a  151 (152)
                      +++... +|+++||+.+.+.....   .++.+|.-...--=+|+=..|+..+...+
T Consensus       183 ~~~~~~-dgki~af~~~~~~~~~~---~~~~~~~k~~~~a~~G~~e~l~~~~~~~~  234 (299)
T PF09924_consen  183 FVARVA-DGKIVAFAIGSPLGGRD---GWSIDFEKADPDAPKGIYEFLNVEFAEHL  234 (299)
T ss_dssp             EEEEE--TTEEEEEEEEEEEE-TT---EEEEEEEEE-TT-STTHHHHHHHHHHHHS
T ss_pred             EEEEEC-CCcEEEEEEEEEccCCc---cEEEEEEecCCCCCCcHHHHHHHHHHHhh
Confidence            777772 39999999998775221   23333433322235788888888776653


No 110
>COG3818 Predicted acetyltransferase, GNAT superfamily [General function prediction only]
Probab=88.68  E-value=0.59  Score=33.80  Aligned_cols=32  Identities=13%  Similarity=0.461  Sum_probs=29.1

Q ss_pred             cceEEEeeeeeccccccCCHHHHHHHHHHhhc
Q psy838          120 EPVLYCYELQLEKQVQRKGLGKFMMQAFYTYI  151 (152)
Q Consensus       120 ~~~~~I~~i~V~p~~qgkGiG~~Lm~~l~~~a  151 (152)
                      +.-+||.++.|....||.|+|+.|.+-+.+.|
T Consensus        82 e~F~YvDRvVVA~~aRGrG~aRalY~Dlf~~A  113 (167)
T COG3818          82 ENFFYVDRVVVASRARGRGVARALYADLFSYA  113 (167)
T ss_pred             CceEEEEEEEEEecccccchHHHHHHHHHHHH
Confidence            45689999999999999999999999988776


No 111
>KOG2036|consensus
Probab=88.07  E-value=0.39  Score=43.46  Aligned_cols=28  Identities=18%  Similarity=0.292  Sum_probs=25.3

Q ss_pred             EEEeeeeeccccccCCHHHHHHHHHHhh
Q psy838          123 LYCYELQLEKQVQRKGLGKFMMQAFYTY  150 (152)
Q Consensus       123 ~~I~~i~V~p~~qgkGiG~~Lm~~l~~~  150 (152)
                      ..|.+|+|+|+||+.|.|+.-++.+.++
T Consensus       615 aRIVRIAvhP~y~~MGYGsrAvqLL~~y  642 (1011)
T KOG2036|consen  615 ARIVRIAVHPEYQKMGYGSRAVQLLTDY  642 (1011)
T ss_pred             ceEEEEEeccchhccCccHHHHHHHHHH
Confidence            4589999999999999999999988765


No 112
>COG4888 Uncharacterized Zn ribbon-containing protein [General function prediction only]
Probab=86.45  E-value=0.2  Score=34.22  Aligned_cols=23  Identities=35%  Similarity=0.738  Sum_probs=16.6

Q ss_pred             hHhHHhhhhhcccccccCCCCcc
Q psy838           23 RKMVKKMEITQHAKYTCSFCGKE   45 (152)
Q Consensus        23 rk~~~~~~~~~~~~~~c~~c~~~   45 (152)
                      |+.++..-......|+||+||..
T Consensus         9 r~~ik~~~~~L~k~FtCp~Cghe   31 (104)
T COG4888           9 RKIIKRRPQVLPKTFTCPRCGHE   31 (104)
T ss_pred             cccCcccCccCCceEecCccCCe
Confidence            44555555556678999999987


No 113
>COG5092 NMT1 N-myristoyl transferase [Lipid metabolism]
Probab=84.45  E-value=4.2  Score=33.63  Aligned_cols=101  Identities=14%  Similarity=0.076  Sum_probs=66.4

Q ss_pred             CCCHHHHHHHHHHHHHhH-HHHHHhCCCCCChhhhhhhhcCCCc---eEEEEEeCCCCCeEEEEEEEeec----cCCcce
Q psy838           51 DLGAETKKWVWELEEKNM-KHSYEVCDIGWDPQGKHSEMFDDRA---CYLVAKNGSSSTPVAFSHFRFDV----DFGEPV  122 (152)
Q Consensus        51 ~l~~~d~~~i~~l~~~~m-~~~y~~~~~~w~~~~~~~el~~~~~---~~~v~~~~~~~~ivGf~~~~~~~----~~~~~~  122 (152)
                      .....++++++.|..+|. .+.|..+.+.++.+.++=.+..++.   +.+.+.-....++|||++..+..    +--.++
T Consensus        86 v~N~~ql~dv~~lL~eNYVED~~ag~rf~Y~~EFl~Wal~~pg~kK~whigvRvk~t~klVaFIsa~p~~v~vRgK~~~~  165 (451)
T COG5092          86 VANKKQLEDVFVLLEENYVEDIYAGHRFRYSVEFLQWALDGPGGKKRWHIGVRVKGTQKLVAFISAKPHLVSVRGKRSSV  165 (451)
T ss_pred             ccccchhHHHHHHHHhhhhhhhhhhhHHHHHHHHHHHhhcCCCCceeeEEEEEEcccceeEEEEecceeEEEEccccccc
Confidence            445667888888776663 2334333345666666666666542   22333333246999999875432    122356


Q ss_pred             EEEeeeeeccccccCCHHHHHHHHHHhhc
Q psy838          123 LYCYELQLEKQVQRKGLGKFMMQAFYTYI  151 (152)
Q Consensus       123 ~~I~~i~V~p~~qgkGiG~~Lm~~l~~~a  151 (152)
                      +.+.-|.|+.+.|++-+.--|+..+-..|
T Consensus       166 ~evNFLCihk~lRsKRltPvLIkEiTRR~  194 (451)
T COG5092         166 LEVNFLCIHKELRSKRLTPVLIKEITRRA  194 (451)
T ss_pred             ceEEEEEEehhhhhCccchHHHHHHHHhh
Confidence            78999999999999999999999887665


No 114
>PF07754 DUF1610:  Domain of unknown function (DUF1610);  InterPro: IPR011668 This domain is found in archaeal species. It is likely to bind zinc via its four well-conserved cysteine residues.
Probab=84.16  E-value=0.5  Score=23.90  Aligned_cols=12  Identities=33%  Similarity=0.838  Sum_probs=9.8

Q ss_pred             cccccccCCCCc
Q psy838           33 QHAKYTCSFCGK   44 (152)
Q Consensus        33 ~~~~~~c~~c~~   44 (152)
                      +-..|.||.||+
T Consensus        13 ~~v~f~CPnCG~   24 (24)
T PF07754_consen   13 QAVPFPCPNCGF   24 (24)
T ss_pred             cCceEeCCCCCC
Confidence            456899999995


No 115
>KOG4135|consensus
Probab=83.90  E-value=1.4  Score=32.52  Aligned_cols=32  Identities=19%  Similarity=0.154  Sum_probs=25.9

Q ss_pred             cceEEEeeeeeccccccCCHHHHHHHHHHhhc
Q psy838          120 EPVLYCYELQLEKQVQRKGLGKFMMQAFYTYI  151 (152)
Q Consensus       120 ~~~~~I~~i~V~p~~qgkGiG~~Lm~~l~~~a  151 (152)
                      .+...+.-..-.|.-||+|+|+..+.++..+|
T Consensus       105 ~~~gE~EvMIAEP~~RgKG~G~eav~~ml~y~  136 (185)
T KOG4135|consen  105 VITGEVEVMIAEPRGRGKGIGTEAVRAMLAYA  136 (185)
T ss_pred             eeeeeEEEEEecccccCCCccHHHHHHHHHHH
Confidence            34555666667899999999999999998875


No 116
>PF00096 zf-C2H2:  Zinc finger, C2H2 type;  InterPro: IPR007087 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target.  The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger: #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C], where X can be any amino acid, and numbers in brackets indicate the number of residues. The positions marked # are those that are important for the stable fold of the zinc finger. The final position can be either his or cys. The C2H2 zinc finger is composed of two short beta strands followed by an alpha helix. The amino terminal part of the helix binds the major groove in DNA binding zinc fingers. The accepted consensus binding sequence for Sp1 is usually defined by the asymmetric hexanucleotide core GGGCGG but this sequence does not include, among others, the GAG (=CTC) repeat that constitutes a high-affinity site for Sp1 binding to the wt1 promoter []. This entry represents the classical C2H2 zinc finger domain.  More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; GO: 0008270 zinc ion binding, 0005622 intracellular; PDB: 2D9H_A 2EPC_A 1SP1_A 1VA3_A 2WBT_B 2ELR_A 2YTP_A 2YTT_A 1VA1_A 2ELO_A ....
Probab=83.86  E-value=0.43  Score=23.07  Aligned_cols=14  Identities=36%  Similarity=1.170  Sum_probs=10.8

Q ss_pred             cccCCCCcccccCC
Q psy838           37 YTCSFCGKEYKKST   50 (152)
Q Consensus        37 ~~c~~c~~~~~~~~   50 (152)
                      |.||.||+.+.+..
T Consensus         1 y~C~~C~~~f~~~~   14 (23)
T PF00096_consen    1 YKCPICGKSFSSKS   14 (23)
T ss_dssp             EEETTTTEEESSHH
T ss_pred             CCCCCCCCccCCHH
Confidence            68999999866543


No 117
>PHA00616 hypothetical protein
Probab=83.14  E-value=0.14  Score=29.79  Aligned_cols=15  Identities=33%  Similarity=0.846  Sum_probs=11.3

Q ss_pred             ccccCCCCcccccCC
Q psy838           36 KYTCSFCGKEYKKST   50 (152)
Q Consensus        36 ~~~c~~c~~~~~~~~   50 (152)
                      .|.||.||+.++..+
T Consensus         1 pYqC~~CG~~F~~~s   15 (44)
T PHA00616          1 MYQCLRCGGIFRKKK   15 (44)
T ss_pred             CCccchhhHHHhhHH
Confidence            489999998765544


No 118
>COG3138 AstA Arginine/ornithine N-succinyltransferase beta subunit [Amino acid transport and metabolism]
Probab=82.70  E-value=3.7  Score=33.44  Aligned_cols=94  Identities=18%  Similarity=0.210  Sum_probs=54.4

Q ss_pred             CCCCHHHHHHHHHHHHHh---HHH------HHHhCCCCCChhhhhhhhcCCCceEEEEEeC-CCCCeEEEEEEEeec---
Q psy838           50 TDLGAETKKWVWELEEKN---MKH------SYEVCDIGWDPQGKHSEMFDDRACYLVAKNG-SSSTPVAFSHFRFDV---  116 (152)
Q Consensus        50 ~~l~~~d~~~i~~l~~~~---m~~------~y~~~~~~w~~~~~~~el~~~~~~~~v~~~~-~~~~ivGf~~~~~~~---  116 (152)
                      .+++..|++.+++|....   |..      .... ...|++..+..+....+..|+++.++ +.|+++|...+.-..   
T Consensus         5 RP~~~aDl~al~~LA~~sg~G~TsLP~de~~L~~-Ri~~se~sf~~~~~~ge~~Y~fVLEDsetG~VvG~saI~a~vGl~   83 (336)
T COG3138           5 RPVERADLEALMELAVKTGVGLTSLPADEATLRA-RIERSEKSFQGELPPGEAGYLFVLEDSETGTVVGISAIEAAVGLN   83 (336)
T ss_pred             ccccccCHHHHHHHHHhcCCCcccCCCCHHHHHH-HHHHHHHHHhcccCCCCccEEEEEEecCCceEEeEEEEEEeeccC
Confidence            467788999999886542   000      0000 01355555566565566667776653 468999987642100   


Q ss_pred             --------c---C---------Ccce----------EEEeeeeeccccccCCHHHHHH
Q psy838          117 --------D---F---------GEPV----------LYCYELQLEKQVQRKGLGKFMM  144 (152)
Q Consensus       117 --------~---~---------~~~~----------~~I~~i~V~p~~qgkGiG~~Lm  144 (152)
                              .   +         ..++          -.|-.|.++|++|.-|-|+.|-
T Consensus        84 ~PfYsyRv~tlvhaS~~L~v~~~i~~L~L~Nd~TG~SEl~sLFl~pd~Rkg~nG~Lls  141 (336)
T COG3138          84 DPFYSYRVGTLVHASPELNVYNEIPTLFLSNDLTGNSELCTLFLDPDWRKGGNGRLLS  141 (336)
T ss_pred             CccceeeeeeeeecCccccccccceeEEEeccCcCchhhhheeecHHHhcccchhhhh
Confidence                    0   0         0111          2245688999999888887653


No 119
>KOG3014|consensus
Probab=79.84  E-value=12  Score=29.88  Aligned_cols=27  Identities=11%  Similarity=0.218  Sum_probs=24.3

Q ss_pred             eEEEeeeeeccccccCCHHHHHHHHHH
Q psy838          122 VLYCYELQLEKQVQRKGLGKFMMQAFY  148 (152)
Q Consensus       122 ~~~I~~i~V~p~~qgkGiG~~Lm~~l~  148 (152)
                      .+-|.+|.|.+-.|++||++.|++.+.
T Consensus       183 ~~GIsRIWV~s~~Rr~gIAs~lldva~  209 (257)
T KOG3014|consen  183 ICGISRIWVSSLRRRKGIASLLLDVAR  209 (257)
T ss_pred             EeeeEEEEeehhhhhhhhHHHHHHHHH
Confidence            466999999999999999999998764


No 120
>PF13894 zf-C2H2_4:  C2H2-type zinc finger; PDB: 2ELX_A 2EPP_A 2DLK_A 1X6H_A 2EOU_A 2EMB_A 2GQJ_A 2CSH_A 2WBT_B 2ELM_A ....
Probab=79.79  E-value=0.82  Score=21.71  Aligned_cols=13  Identities=31%  Similarity=1.216  Sum_probs=8.2

Q ss_pred             cccCCCCcccccC
Q psy838           37 YTCSFCGKEYKKS   49 (152)
Q Consensus        37 ~~c~~c~~~~~~~   49 (152)
                      |.||.|++.+.+.
T Consensus         1 ~~C~~C~~~~~~~   13 (24)
T PF13894_consen    1 FQCPICGKSFRSK   13 (24)
T ss_dssp             EE-SSTS-EESSH
T ss_pred             CCCcCCCCcCCcH
Confidence            6799999876554


No 121
>PF09237 GAGA:  GAGA factor;  InterPro: IPR015318 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target.  Members of this entry bind to a 5'-GAGAG-3' DNA consensus binding site, and contain a Cys2-His2 zinc finger core as well as an N-terminal extension containing two highly basic regions. The zinc finger core binds in the DNA major groove and recognises the first three GAG bases of the consensus in a manner similar to that seen in other classical zinc finger-DNA complexes. The second basic region forms a helix that interacts in the major groove recognising the last G of the consensus, while the first basic region wraps around the DNA in the minor groove and recognises the A in the fourth position of the consensus sequence [].  More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; PDB: 1YUI_A 1YUJ_A.
Probab=78.17  E-value=0.22  Score=29.98  Aligned_cols=20  Identities=25%  Similarity=0.566  Sum_probs=10.6

Q ss_pred             hhhcccccccCCCCcccccC
Q psy838           30 EITQHAKYTCSFCGKEYKKS   49 (152)
Q Consensus        30 ~~~~~~~~~c~~c~~~~~~~   49 (152)
                      ......+.+||.|+..+.+.
T Consensus        18 ~~~S~~PatCP~C~a~~~~s   37 (54)
T PF09237_consen   18 KSQSEQPATCPICGAVIRQS   37 (54)
T ss_dssp             CCTTS--EE-TTT--EESSH
T ss_pred             hhccCCCCCCCcchhhccch
Confidence            44556689999999876654


No 122
>PF09538 FYDLN_acid:  Protein of unknown function (FYDLN_acid);  InterPro: IPR012644 Members of this family are bacterial proteins with a conserved motif [KR]FYDLN, sometimes flanked by a pair of CXXC motifs, followed by a long region of low complexity sequence in which roughly half the residues are Asp and Glu, including multiple runs of five or more acidic residues. The function of members of this family is unknown.
Probab=76.28  E-value=1.4  Score=30.62  Aligned_cols=14  Identities=36%  Similarity=0.857  Sum_probs=8.6

Q ss_pred             cccccCCCCccccc
Q psy838           35 AKYTCSFCGKEYKK   48 (152)
Q Consensus        35 ~~~~c~~c~~~~~~   48 (152)
                      .|++||.||+.|..
T Consensus         8 tKR~Cp~CG~kFYD   21 (108)
T PF09538_consen    8 TKRTCPSCGAKFYD   21 (108)
T ss_pred             CcccCCCCcchhcc
Confidence            36677777776333


No 123
>KOG2270|consensus
Probab=76.15  E-value=7.8  Score=33.28  Aligned_cols=65  Identities=23%  Similarity=0.432  Sum_probs=47.2

Q ss_pred             eccccccccc---hhHhHhHHhhhhhcccc--------cccC-------------CCCcc-----cccCCCCCHHHHHHH
Q psy838           10 ITGKYGTRYG---ASLRKMVKKMEITQHAK--------YTCS-------------FCGKE-----YKKSTDLGAETKKWV   60 (152)
Q Consensus        10 ~~~~~g~ryg---~~~rk~~~~~~~~~~~~--------~~c~-------------~c~~~-----~~~~~~l~~~d~~~i   60 (152)
                      +.|-|.-|+|   .--||+|+.+-+.--.+        -.||             |||++     .+...+++.+++..+
T Consensus       190 V~GEfRFRhgyck~NPRKMVk~WAEKE~RNLkRl~~aGIP~PePIlLk~hVLVM~FlGrdgw~aPkLKd~~ls~~ka~~~  269 (520)
T KOG2270|consen  190 VSGEFRFRHGYCKHNPRKMVKTWAEKEMRNLKRLNNAGIPCPEPILLKNHVLVMEFLGRDGWAAPKLKDASLSTSKAREL  269 (520)
T ss_pred             ccceeeeecccccCCcHHHHHHHHHHHHHHHHHHHhcCCCCCCceeeecceEeeeeccCCCCcCcccccccCChHHHHHH
Confidence            4677888888   66788888874332212        2355             67776     455567788899999


Q ss_pred             HHHHHHhHHHHHHh
Q psy838           61 WELEEKNMKHSYEV   74 (152)
Q Consensus        61 ~~l~~~~m~~~y~~   74 (152)
                      +......|+.+|..
T Consensus       270 Y~~~v~~MR~lY~~  283 (520)
T KOG2270|consen  270 YQQCVRIMRRLYQK  283 (520)
T ss_pred             HHHHHHHHHHHHHH
Confidence            99888899999966


No 124
>PF02945 Endonuclease_7:  Recombination endonuclease VII;  InterPro: IPR004211 This family of proteins which includes Bacteriophage T4 endonuclease VII, Mycobacteriophage D29 gene 59, and other as yet uncharacterised proteins. The T4 endonuclease VII (Endo VII) recognises a broad spectrum of DNA substrates ranging from branched DNAs to single base mismatches. The structure of this enzyme has been resolved and it was found that the monomers form an elongated, intertwined molecular dimer that exibits extreme domain swapping. Two pairs of antiparallel helices which form a novel 'four-helix cross' motif are the major dimerisation elements [].; PDB: 3GOX_A 3FC3_A 1EN7_B 1E7L_B 2QNF_A 2QNC_A 1E7D_A.
Probab=75.79  E-value=1.5  Score=28.85  Aligned_cols=30  Identities=27%  Similarity=0.542  Sum_probs=15.5

Q ss_pred             ccccccchhHhHhHHhhhhhcccccccCCCCcc
Q psy838           13 KYGTRYGASLRKMVKKMEITQHAKYTCSFCGKE   45 (152)
Q Consensus        13 ~~g~ryg~~~rk~~~~~~~~~~~~~~c~~c~~~   45 (152)
                      +.+.|||.+....-+-.+.+   ...|+.||++
T Consensus         2 ~~~~~Ygit~~~~~~l~~~q---~~~C~iC~~~   31 (81)
T PF02945_consen    2 RLKRRYGITPEEYEALLEEQ---GGRCAICGKP   31 (81)
T ss_dssp             -----H-HHHHHHHCCHHHT---TTE-TTT-SE
T ss_pred             CcccccCCCHHHHHHHHHHh---CCcCcCCCCC
Confidence            45789999888766444444   5599999984


No 125
>COG1243 ELP3 Histone acetyltransferase [Transcription / Chromatin structure and dynamics]
Probab=74.94  E-value=4.1  Score=35.29  Aligned_cols=50  Identities=16%  Similarity=0.250  Sum_probs=33.0

Q ss_pred             CCCeEEEEEEEeeccCC-----c-ceEEEeeeee----------ccccccCCHHHHHHHHHHhhcC
Q psy838          103 SSTPVAFSHFRFDVDFG-----E-PVLYCYELQL----------EKQVQRKGLGKFMMQAFYTYID  152 (152)
Q Consensus       103 ~~~ivGf~~~~~~~~~~-----~-~~~~I~~i~V----------~p~~qgkGiG~~Lm~~l~~~a~  152 (152)
                      ++.++||+.++...+..     . +..-+.+|+|          ...+|.+|.|+.||+..+.+|.
T Consensus       415 ~d~lig~lrlR~p~e~~~r~e~~~~~aivrelhvyg~~vpig~~~~~~QH~G~G~~L~~~AE~ia~  480 (515)
T COG1243         415 NDILIGFLRLREPSEGAHREEIDDKTAIVRELHVYGSEVPIGKREDEWQHRGYGRELLEEAERIAR  480 (515)
T ss_pred             hhhhhheeeecccccchhhhhcccchhhhhhhhccccccccccCcchhhcccHHHHHHHHHHHHHH
Confidence            47799999988643310     0 0111222222          4679999999999999998873


No 126
>COG4552 Eis Predicted acetyltransferase involved in intracellular survival and related acetyltransferases [General function prediction only]
Probab=73.78  E-value=3.2  Score=34.69  Aligned_cols=54  Identities=9%  Similarity=-0.071  Sum_probs=36.6

Q ss_pred             EEEEEeCCCCCeEEEEEEEe-ec---cCCcceEEEeeeeeccccccCCHHHHHHHHHHhh
Q psy838           95 YLVAKNGSSSTPVAFSHFRF-DV---DFGEPVLYCYELQLEKQVQRKGLGKFMMQAFYTY  150 (152)
Q Consensus        95 ~~v~~~~~~~~ivGf~~~~~-~~---~~~~~~~~I~~i~V~p~~qgkGiG~~Lm~~l~~~  150 (152)
                      +.+++.+  .++++-++..+ ..   ...-+...|--++++|+|||+|.-++||......
T Consensus        41 ~~vi~~n--qkl~s~L~i~~f~~~f~~q~l~t~GIa~Vas~P~~R~~G~~~~Ll~~sLre   98 (389)
T COG4552          41 SYVIYMN--QKLASRLHIPPFIFWFGNQVLPTAGIAGVASAPTYRRRGALRALLAHSLRE   98 (389)
T ss_pred             ceEEeeh--hhhhhcccccchheeeCCeeeeccceEEEEechhhccCcHHHHHHHHHHHH
Confidence            3455554  77777666431 11   1112345588999999999999999999887653


No 127
>cd04266 DUF619-NAGS-FABP DUF619 domain of N-acetylglutamate Synthase of the fungal arginine-biosynthetic pathway. DUF619-NAGS-FABP: This family includes the DUF619 domain of N-acetylglutamate synthase (NAGS) of the fungal arginine-biosynthetic pathway (FABP). This NAGS (also known as arginine-requiring protein 2 or ARG2) consists of an N-terminal NAG kinase-like domain and a C-terminal DUF619 domain. NAGS catalyzes the formation of NAG from acetylcoenzyme A and L-glutamate. The DUF619 domain, yet to be characterized, is predicted to function in NAGS association in fungi.
Probab=73.60  E-value=27  Score=24.19  Aligned_cols=30  Identities=17%  Similarity=0.380  Sum_probs=26.9

Q ss_pred             cceEEEeeeeecccccc-CCHHHHHHHHHHh
Q psy838          120 EPVLYCYELQLEKQVQR-KGLGKFMMQAFYT  149 (152)
Q Consensus       120 ~~~~~I~~i~V~p~~qg-kGiG~~Lm~~l~~  149 (152)
                      ..+-||..+.|.+..|| .|++..+.+++.+
T Consensus        37 ~~v~yLdKFav~~~~~gl~gv~D~vf~~m~~   67 (108)
T cd04266          37 EKIAYLDKFAVLPKAQGSDGIADILFNAMLD   67 (108)
T ss_pred             CCceEEEEEEEccccccccchHHHHHHHHHH
Confidence            56789999999999997 8999999998874


No 128
>PF14354 Lar_restr_allev:  Restriction alleviation protein Lar
Probab=72.43  E-value=1.7  Score=26.40  Aligned_cols=9  Identities=44%  Similarity=1.298  Sum_probs=7.5

Q ss_pred             cccCCCCcc
Q psy838           37 YTCSFCGKE   45 (152)
Q Consensus        37 ~~c~~c~~~   45 (152)
                      .-|||||..
T Consensus         4 kPCPFCG~~   12 (61)
T PF14354_consen    4 KPCPFCGSA   12 (61)
T ss_pred             cCCCCCCCc
Confidence            469999977


No 129
>PF09151 DUF1936:  Domain of unknown function (DUF1936);  InterPro: IPR015234 This domain is found in a set of hypothetical archaeal proteins. Its exact function has not, as yet, been defined. ; PDB: 2QH1_B 1PVM_B.
Probab=72.35  E-value=1.7  Score=23.36  Aligned_cols=10  Identities=30%  Similarity=0.740  Sum_probs=5.6

Q ss_pred             ccccCCCCcc
Q psy838           36 KYTCSFCGKE   45 (152)
Q Consensus        36 ~~~c~~c~~~   45 (152)
                      +|.||.||.-
T Consensus         1 ~hlcpkcgvg   10 (36)
T PF09151_consen    1 QHLCPKCGVG   10 (36)
T ss_dssp             --B-TTTSSS
T ss_pred             CccCCccCce
Confidence            4789999865


No 130
>PF04438 zf-HIT:  HIT zinc finger;  InterPro: IPR007529 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target.  This entry represents the HIT-type zinc finger, which contains 7 conserved cysteines and one histidine that can potentially coordinate two zinc atoms. It has been named after the first protein that originally defined the domain: the yeast HIT1 protein (P46973 from SWISSPROT) []. The HIT-type zinc finger displays some sequence similarities to the MYND-type zinc finger. The function of this domain is unknown but it is mainly found in nuclear proteins involved in gene regulation and chromatin remodeling. This domain is also found in the thyroid receptor interacting protein 3 (TRIP-3) Q15649 from SWISSPROT, that specifically interacts with the ligand binding domain of the thyroid receptor. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; PDB: 2YQP_A 2YQQ_A 1X4S_A.
Probab=72.21  E-value=1.5  Score=23.23  Aligned_cols=12  Identities=50%  Similarity=1.079  Sum_probs=8.0

Q ss_pred             ccccccCCCCcc
Q psy838           34 HAKYTCSFCGKE   45 (152)
Q Consensus        34 ~~~~~c~~c~~~   45 (152)
                      .++|+||.|+..
T Consensus        11 ~~kY~Cp~C~~~   22 (30)
T PF04438_consen   11 PAKYRCPRCGAR   22 (30)
T ss_dssp             EESEE-TTT--E
T ss_pred             CCEEECCCcCCc
Confidence            689999999976


No 131
>PHA02768 hypothetical protein; Provisional
Probab=71.41  E-value=0.92  Score=27.65  Aligned_cols=18  Identities=33%  Similarity=0.870  Sum_probs=13.6

Q ss_pred             cccccCCCCcccccCCCC
Q psy838           35 AKYTCSFCGKEYKKSTDL   52 (152)
Q Consensus        35 ~~~~c~~c~~~~~~~~~l   52 (152)
                      .-|.||.||+.+...+.+
T Consensus         4 ~~y~C~~CGK~Fs~~~~L   21 (55)
T PHA02768          4 LGYECPICGEIYIKRKSM   21 (55)
T ss_pred             cccCcchhCCeeccHHHH
Confidence            358999999997766544


No 132
>PF04339 DUF482:  Protein of unknown function, DUF482;  InterPro: IPR007434 This family contains several proteins of uncharacterised function.
Probab=71.41  E-value=21  Score=30.01  Aligned_cols=66  Identities=12%  Similarity=0.155  Sum_probs=40.4

Q ss_pred             cccCCCCCHHHHHHHHHHHHHhHHHHHHhCCCCCChhhhhhhhcC--CCceEEEEEeCCCCCeEEEEEEEee
Q psy838           46 YKKSTDLGAETKKWVWELEEKNMKHSYEVCDIGWDPQGKHSEMFD--DRACYLVAKNGSSSTPVAFSHFRFD  115 (152)
Q Consensus        46 ~~~~~~l~~~d~~~i~~l~~~~m~~~y~~~~~~w~~~~~~~el~~--~~~~~~v~~~~~~~~ivGf~~~~~~  115 (152)
                      ..+...+++++++.++.+..++.....   +.++-...+...+.+  ++...+++... +|++||++.+-..
T Consensus       204 ~l~G~~i~~~~~~~f~~~Y~~Ty~k~~---~~~yLt~~FF~~l~~~m~~~~~l~~A~~-~g~~Va~aL~l~~  271 (370)
T PF04339_consen  204 TLTGDEITDEDWDRFYRLYQNTYAKRW---GRPYLTREFFEQLAETMPEQVVLVVARR-DGQPVAFALCLRG  271 (370)
T ss_pred             EEeCCCCCHHHHHHHHHHHHHHHHhhC---CChhhcHHHHHHHHHhCcCCEEEEEEEE-CCeEEEEEEEEEe
Confidence            667889999999999999887543332   234433333444433  33333333332 4999999877543


No 133
>PF14447 Prok-RING_4:  Prokaryotic RING finger family 4
Probab=71.22  E-value=2.4  Score=25.85  Aligned_cols=12  Identities=33%  Similarity=1.112  Sum_probs=9.3

Q ss_pred             cCCCCcccccCC
Q psy838           39 CSFCGKEYKKST   50 (152)
Q Consensus        39 c~~c~~~~~~~~   50 (152)
                      ||+||+++-.++
T Consensus        42 CPfC~~~~~~~~   53 (55)
T PF14447_consen   42 CPFCGTPFEFDD   53 (55)
T ss_pred             CCCCCCcccCCC
Confidence            999999865543


No 134
>PF04606 Ogr_Delta:  Ogr/Delta-like zinc finger;  InterPro: IPR007684 This entry is represented by Bacteriophage P2, Ogr. The characteristics of the protein distribution suggest prophage matches in addition to the phage matches. This is a viral family of phage zinc-binding transcriptional activators, which also contains cryptic members in some bacterial genomes []. The P4 phage delta protein contains two such domains attached covalently, while the P2 phage Ogr proteins possess one domain but function as dimers. All the members of this family have the following consensus sequence: C-X(2)-C-X(3)-A-(X)2-R-X(15)-C-X(4)-C-X(3)-F [].; GO: 0006355 regulation of transcription, DNA-dependent
Probab=69.90  E-value=2  Score=25.07  Aligned_cols=15  Identities=33%  Similarity=0.888  Sum_probs=10.0

Q ss_pred             ccCCCCcc--cccCCCC
Q psy838           38 TCSFCGKE--YKKSTDL   52 (152)
Q Consensus        38 ~c~~c~~~--~~~~~~l   52 (152)
                      .||.||..  ++++..+
T Consensus         1 ~CP~Cg~~a~ir~S~~~   17 (47)
T PF04606_consen    1 RCPHCGSKARIRTSRQL   17 (47)
T ss_pred             CcCCCCCeeEEEEchhh
Confidence            59999988  4444443


No 135
>TIGR02300 FYDLN_acid conserved hypothetical protein TIGR02300. Members of this family are bacterial proteins with a conserved motif [KR]FYDLN, sometimes flanked by a pair of CXXC motifs, followed by a long region of low complexity sequence in which roughly half the residues are Asp and Glu, including multiple runs of five or more acidic residues. The function of members of this family is unknown.
Probab=68.40  E-value=2.7  Score=30.00  Aligned_cols=11  Identities=27%  Similarity=0.534  Sum_probs=6.4

Q ss_pred             cccccCCCCcc
Q psy838           35 AKYTCSFCGKE   45 (152)
Q Consensus        35 ~~~~c~~c~~~   45 (152)
                      .|++||.||+.
T Consensus         8 tKr~Cp~cg~k   18 (129)
T TIGR02300         8 TKRICPNTGSK   18 (129)
T ss_pred             ccccCCCcCcc
Confidence            35566666665


No 136
>PF14787 zf-CCHC_5:  GAG-polyprotein viral zinc-finger; PDB: 1CL4_A 1DSV_A.
Probab=68.01  E-value=3.1  Score=23.07  Aligned_cols=15  Identities=27%  Similarity=0.698  Sum_probs=9.7

Q ss_pred             cccCCCCcccccCCC
Q psy838           37 YTCSFCGKEYKKSTD   51 (152)
Q Consensus        37 ~~c~~c~~~~~~~~~   51 (152)
                      -.||.|||-.+++++
T Consensus         3 ~~CprC~kg~Hwa~~   17 (36)
T PF14787_consen    3 GLCPRCGKGFHWASE   17 (36)
T ss_dssp             -C-TTTSSSCS-TTT
T ss_pred             ccCcccCCCcchhhh
Confidence            369999999777654


No 137
>KOG2593|consensus
Probab=67.25  E-value=2.7  Score=35.79  Aligned_cols=24  Identities=33%  Similarity=0.531  Sum_probs=16.4

Q ss_pred             HhHhHHhhh--hhcccccccCCCCcc
Q psy838           22 LRKMVKKME--ITQHAKYTCSFCGKE   45 (152)
Q Consensus        22 ~rk~~~~~~--~~~~~~~~c~~c~~~   45 (152)
                      +||+++.-+  ....+-|.||+|.+.
T Consensus       112 m~krled~~~d~t~~~~Y~Cp~C~kk  137 (436)
T KOG2593|consen  112 MRKRLEDRLRDDTNVAGYVCPNCQKK  137 (436)
T ss_pred             HHHHHHHHhhhccccccccCCccccc
Confidence            455555442  346778999999988


No 138
>PRK09710 lar restriction alleviation and modification protein; Reviewed
Probab=63.15  E-value=3.3  Score=25.98  Aligned_cols=8  Identities=50%  Similarity=1.406  Sum_probs=7.1

Q ss_pred             ccCCCCcc
Q psy838           38 TCSFCGKE   45 (152)
Q Consensus        38 ~c~~c~~~   45 (152)
                      -|||||..
T Consensus         8 PCPFCG~~   15 (64)
T PRK09710          8 PCPFCGCP   15 (64)
T ss_pred             CCCCCCCc
Confidence            49999988


No 139
>PF06689 zf-C4_ClpX:  ClpX C4-type zinc finger;  InterPro: IPR010603 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target.  The ClpX heat shock protein of Escherichia coli is a member of the universally conserved Hsp100 family of proteins, and possesses a putative zinc finger motif of the C4 type []. This presumed zinc binding domain (ZBD) is found at the N terminus of the ClpX protein. ClpX is an ATPase which functions both as a substrate specificity component of the ClpXP protease and as a molecular chaperone. ZBD is a member of the treble clef zinc finger family, a motif known to facilitate protein-ligand, protein-DNA, and protein-protein interactions and forms a constitutive dimer that is essential for the degradation of some, but not all, ClpX substrates []. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; GO: 0008270 zinc ion binding, 0016887 ATPase activity, 0046983 protein dimerization activity, 0006200 ATP catabolic process, 0019538 protein metabolic process; PDB: 2DS8_B 2DS6_B 2DS5_A 1OVX_A 2DS7_A.
Probab=62.96  E-value=3.3  Score=23.44  Aligned_cols=9  Identities=56%  Similarity=1.431  Sum_probs=4.5

Q ss_pred             cccCCCCcc
Q psy838           37 YTCSFCGKE   45 (152)
Q Consensus        37 ~~c~~c~~~   45 (152)
                      ..|.|||+.
T Consensus         2 ~~CSFCgr~   10 (41)
T PF06689_consen    2 KRCSFCGRP   10 (41)
T ss_dssp             -B-TTT--B
T ss_pred             CCccCCCCC
Confidence            579999998


No 140
>PF13912 zf-C2H2_6:  C2H2-type zinc finger; PDB: 1JN7_A 1FU9_A 2L1O_A 1NJQ_A 2EN8_A 2EMM_A 1FV5_A 1Y0J_B 2L6Z_B.
Probab=62.90  E-value=3.2  Score=20.67  Aligned_cols=14  Identities=29%  Similarity=0.878  Sum_probs=10.6

Q ss_pred             ccccCCCCcccccC
Q psy838           36 KYTCSFCGKEYKKS   49 (152)
Q Consensus        36 ~~~c~~c~~~~~~~   49 (152)
                      .|.|+.|++.+.+.
T Consensus         1 ~~~C~~C~~~F~~~   14 (27)
T PF13912_consen    1 PFECDECGKTFSSL   14 (27)
T ss_dssp             SEEETTTTEEESSH
T ss_pred             CCCCCccCCccCCh
Confidence            47899999886543


No 141
>COG4068 Uncharacterized protein containing a Zn-ribbon [Function unknown]
Probab=62.43  E-value=9.5  Score=23.57  Aligned_cols=29  Identities=24%  Similarity=0.411  Sum_probs=23.1

Q ss_pred             ccCCCCcccccCCCCCHHHHHHHHHHHHH
Q psy838           38 TCSFCGKEYKKSTDLGAETKKWVWELEEK   66 (152)
Q Consensus        38 ~c~~c~~~~~~~~~l~~~d~~~i~~l~~~   66 (152)
                      -|+.||+.+...+..-.+.-.+|++-++.
T Consensus        10 HC~VCg~aIp~de~~CSe~C~eil~ker~   38 (64)
T COG4068          10 HCVVCGKAIPPDEQVCSEECGEILNKERK   38 (64)
T ss_pred             cccccCCcCCCccchHHHHHHHHHHHHHH
Confidence            49999999988888888877777765543


No 142
>TIGR03655 anti_R_Lar restriction alleviation protein, Lar family. Restriction alleviation proteins provide a countermeasure to host cell restriction enzyme defense against foreign DNA such as phage or plasmids. This family consists of homologs to the phage antirestriction protein Lar, and most members belong to phage genomes or prophage regions of bacterial genomes.
Probab=62.25  E-value=3.4  Score=24.62  Aligned_cols=8  Identities=50%  Similarity=1.497  Sum_probs=6.5

Q ss_pred             ccCCCCcc
Q psy838           38 TCSFCGKE   45 (152)
Q Consensus        38 ~c~~c~~~   45 (152)
                      -|||||..
T Consensus         3 PCPfCGg~   10 (53)
T TIGR03655         3 PCPFCGGA   10 (53)
T ss_pred             CCCCCCCc
Confidence            59999976


No 143
>PF05605 zf-Di19:  Drought induced 19 protein (Di19), zinc-binding;  InterPro: IPR008598 This entry consists of several drought induced 19 (Di19) like and RING finger 114 proteins. Di19 has been found to be strongly expressed in both the roots and leaves of Arabidopsis thaliana during progressive drought [], whilst RING finger proteins are thought to play a role in spermatogenesis. The precise function is unknown.
Probab=62.18  E-value=6.1  Score=23.44  Aligned_cols=25  Identities=16%  Similarity=0.402  Sum_probs=20.9

Q ss_pred             hhHhHhHHhhhhhcccccccCCCCc
Q psy838           20 ASLRKMVKKMEITQHAKYTCSFCGK   44 (152)
Q Consensus        20 ~~~rk~~~~~~~~~~~~~~c~~c~~   44 (152)
                      ..|...+.+-.......+.||.|.+
T Consensus        15 ~~L~~H~~~~H~~~~~~v~CPiC~~   39 (54)
T PF05605_consen   15 SSLVEHCEDEHRSESKNVVCPICSS   39 (54)
T ss_pred             HHHHHHHHhHCcCCCCCccCCCchh
Confidence            4688888888888878999999964


No 144
>COG2401 ABC-type ATPase fused to a predicted acetyltransferase domain [General function prediction only]
Probab=60.73  E-value=7.4  Score=33.76  Aligned_cols=28  Identities=14%  Similarity=0.243  Sum_probs=24.0

Q ss_pred             EEEeeeeeccccccCCHHHHHHHHHHhh
Q psy838          123 LYCYELQLEKQVQRKGLGKFMMQAFYTY  150 (152)
Q Consensus       123 ~~I~~i~V~p~~qgkGiG~~Lm~~l~~~  150 (152)
                      ..|-++.|||+||+-|+|..-|....++
T Consensus       242 ariarvvvhpdyr~dglg~~sv~~a~ew  269 (593)
T COG2401         242 ARIARVVVHPDYRADGLGQLSVIAALEW  269 (593)
T ss_pred             hheeEEEeccccccCccchhHHHHHHHH
Confidence            4488999999999999999988877665


No 145
>cd00350 rubredoxin_like Rubredoxin_like; nonheme iron binding domain containing a [Fe(SCys)4] center. The family includes rubredoxins, a small electron transfer protein, and a slightly smaller modular rubredoxin domain present in rubrerythrin and nigerythrin and detected either N- or C-terminal to such proteins as flavin reductase, NAD(P)H-nitrite reductase, and ferredoxin-thioredoxin reductase. In rubredoxin, the iron atom is coordinated by four cysteine residues (Fe(S-Cys)4), but iron can also be replaced by cobalt, nickel or zinc and believed to be involved in electron transfer.  Rubrerythrins and nigerythrins are small homodimeric proteins, generally consisting of 2 domains: a rubredoxin domain C-terminal to a non-sulfur, oxo-bridged diiron site in the N-terminal rubrerythrin domain.  Rubrerythrins and nigerythrins have putative peroxide activity.
Probab=60.25  E-value=3.8  Score=21.96  Aligned_cols=12  Identities=33%  Similarity=0.985  Sum_probs=9.0

Q ss_pred             ccccccCCCCcc
Q psy838           34 HAKYTCSFCGKE   45 (152)
Q Consensus        34 ~~~~~c~~c~~~   45 (152)
                      .....||.||..
T Consensus        15 ~~~~~CP~Cg~~   26 (33)
T cd00350          15 EAPWVCPVCGAP   26 (33)
T ss_pred             cCCCcCcCCCCc
Confidence            367889998864


No 146
>PF12171 zf-C2H2_jaz:  Zinc-finger double-stranded RNA-binding;  InterPro: IPR022755  This zinc finger is found in archaea and eukaryotes, and is approximately 30 amino acids in length. The mammalian members of this group occur multiple times along the protein, joined by flexible linkers, and are referred to as JAZ - dsRNA-binding ZF protein - zinc-fingers. The JAZ proteins are expressed in all tissues tested and localise in the nucleus, particularly the nucleolus []. JAZ preferentially binds to double-stranded (ds) RNA or RNA/DNA hybrids rather than DNA. In addition to binding double-stranded RNA, these zinc-fingers are required for nucleolar localisation.   This entry represents the multiple-adjacent-C2H2 zinc finger, JAZ. ; PDB: 4DGW_A 1ZR9_A.
Probab=59.59  E-value=6  Score=19.86  Aligned_cols=13  Identities=23%  Similarity=0.815  Sum_probs=9.7

Q ss_pred             ccccCCCCccccc
Q psy838           36 KYTCSFCGKEYKK   48 (152)
Q Consensus        36 ~~~c~~c~~~~~~   48 (152)
                      .|.|+.|++.+.+
T Consensus         1 q~~C~~C~k~f~~   13 (27)
T PF12171_consen    1 QFYCDACDKYFSS   13 (27)
T ss_dssp             -CBBTTTTBBBSS
T ss_pred             CCCcccCCCCcCC
Confidence            4789999988654


No 147
>PF14353 CpXC:  CpXC protein
Probab=59.50  E-value=6  Score=27.69  Aligned_cols=20  Identities=25%  Similarity=0.504  Sum_probs=15.7

Q ss_pred             HHhhhhhcccccccCCCCcc
Q psy838           26 VKKMEITQHAKYTCSFCGKE   45 (152)
Q Consensus        26 ~~~~~~~~~~~~~c~~c~~~   45 (152)
                      ..+|-..+...++||.||+.
T Consensus        28 ~e~il~g~l~~~~CP~Cg~~   47 (128)
T PF14353_consen   28 KEKILDGSLFSFTCPSCGHK   47 (128)
T ss_pred             HHHHHcCCcCEEECCCCCCc
Confidence            34555667779999999998


No 148
>COG5189 SFP1 Putative transcriptional repressor regulating G2/M transition [Transcription / Cell division and chromosome partitioning]
Probab=59.14  E-value=4.1  Score=33.67  Aligned_cols=17  Identities=35%  Similarity=0.860  Sum_probs=13.7

Q ss_pred             cccccCCCCcccccCCC
Q psy838           35 AKYTCSFCGKEYKKSTD   51 (152)
Q Consensus        35 ~~~~c~~c~~~~~~~~~   51 (152)
                      .+|.|++|+|.|.-+..
T Consensus       397 KPYrCevC~KRYKNlNG  413 (423)
T COG5189         397 KPYRCEVCDKRYKNLNG  413 (423)
T ss_pred             CceeccccchhhccCcc
Confidence            78999999998665543


No 149
>PF09986 DUF2225:  Uncharacterized protein conserved in bacteria (DUF2225);  InterPro: IPR018708 This conserved bacterial family has no known function.
Probab=58.61  E-value=9.4  Score=29.43  Aligned_cols=62  Identities=21%  Similarity=0.143  Sum_probs=40.3

Q ss_pred             cceeeeccccccccchhHhHhHHhhhhhcccccccCCCCcccc--cCCCCCHHHHHHHHHHHHH
Q psy838            5 TKKVGITGKYGTRYGASLRKMVKKMEITQHAKYTCSFCGKEYK--KSTDLGAETKKWVWELEEK   66 (152)
Q Consensus         5 ~~~~~~~~~~g~ryg~~~rk~~~~~~~~~~~~~~c~~c~~~~~--~~~~l~~~d~~~i~~l~~~   66 (152)
                      |++|.+.+-=..+.-.=++++++.+.---..-.+||.||--..  ....+++.+.+.+.+....
T Consensus        17 ~~~vrs~~~r~~~~d~D~~~~Y~~vnP~~Y~V~vCP~CgyA~~~~~F~~l~~~~~~~i~~~i~~   80 (214)
T PF09986_consen   17 TKKVRSGKIRVIRRDSDFCPRYKGVNPLFYEVWVCPHCGYAAFEEDFEKLSPEQKEKIKENISS   80 (214)
T ss_pred             eeEEEcCCceEeeecCCCccccCCCCCeeeeEEECCCCCCcccccccccCCHHHHHHHHHHHHh
Confidence            3444443332335556666777777777778889999997732  2467788888877775543


No 150
>PF04423 Rad50_zn_hook:  Rad50 zinc hook motif;  InterPro: IPR007517 The Mre11 complex (Mre11 Rad50 Nbs1) is central to chromosomal maintenance and functions in homologous recombination, telomere maintenance and sister chromatid association. The Rad50 coiled-coil region contains a dimer interface at the apex of the coiled coils in which pairs of conserved Cys-X-X-Cys motifs form interlocking hooks that bind one Zn ion. This alignment includes the zinc hook motif and a short stretch of coiled-coil on either side.; GO: 0004518 nuclease activity, 0005524 ATP binding, 0008270 zinc ion binding, 0006281 DNA repair; PDB: 1L8D_B.
Probab=58.57  E-value=27  Score=20.62  Aligned_cols=34  Identities=24%  Similarity=0.527  Sum_probs=16.4

Q ss_pred             HhHhHHhhhhhcccccccCCCCcccccCCCCCHHHHHHHHHHH
Q psy838           22 LRKMVKKMEITQHAKYTCSFCGKEYKKSTDLGAETKKWVWELE   64 (152)
Q Consensus        22 ~rk~~~~~~~~~~~~~~c~~c~~~~~~~~~l~~~d~~~i~~l~   64 (152)
                      +.+.+.++...   +-.||.||++      ++++....+..=.
T Consensus         9 ~~k~i~~l~~~---~~~CPlC~r~------l~~e~~~~li~~~   42 (54)
T PF04423_consen    9 LKKYIEELKEA---KGCCPLCGRP------LDEEHRQELIKKY   42 (54)
T ss_dssp             HHHHHHHHTT----SEE-TTT--E------E-HHHHHHHHHHH
T ss_pred             HHHHHHHHhcC---CCcCCCCCCC------CCHHHHHHHHHHH
Confidence            44555555543   3399999754      5555555555433


No 151
>PF08271 TF_Zn_Ribbon:  TFIIB zinc-binding;  InterPro: IPR013137 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target.  This entry represents a zinc finger motif found in transcription factor IIB (TFIIB). In eukaryotes the initiation of transcription of protein encoding genes by the polymerase II complexe (Pol II) is modulated by general and specific transcription factors. The general transcription factors operate through common promoters elements (such as the TATA box). At least seven different proteins associate to form the general transcription factors: TFIIA, -IIB, -IID, -IIE, -IIF, -IIG, and -IIH [].  TFIIB and TFIID are responsible for promoter recognition and interaction with pol II; together with Pol II, they form a minimal initiation complex capable of transcription under certain conditions. The TATA box of a Pol II promoter is bound in the initiation complex by the TBP subunit of TFIID, which bends the DNA around the C-terminal domain of TFIIB whereas the N-terminal zinc finger of TFIIB interacts with Pol II [, ]. The TFIIB zinc finger adopts a zinc ribbon fold characterised by two beta-hairpins forming two structurally similar zinc-binding sub-sites []. The zinc finger contacts the rbp1 subunit of Pol II through its dock domain, a conserved region of about 70 amino acids located close to the polymerase active site []. In the Pol II complex this surface is located near the RNA exit groove. Interestingly this sequence is best conserved in the three polymerases that utilise a TFIIB-like general transcription factor (Pol II, Pol III, and archaeal RNA polymerase) but not in Pol I [].  More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; GO: 0008270 zinc ion binding, 0006355 regulation of transcription, DNA-dependent; PDB: 1VD4_A 1PFT_A 3K1F_M 3K7A_M 1RO4_A 1RLY_A 1DL6_A.
Probab=58.45  E-value=4.9  Score=22.74  Aligned_cols=9  Identities=33%  Similarity=1.040  Sum_probs=6.1

Q ss_pred             cccCCCCcc
Q psy838           37 YTCSFCGKE   45 (152)
Q Consensus        37 ~~c~~c~~~   45 (152)
                      |.||.||..
T Consensus         1 m~Cp~Cg~~    9 (43)
T PF08271_consen    1 MKCPNCGSK    9 (43)
T ss_dssp             ESBTTTSSS
T ss_pred             CCCcCCcCC
Confidence            467777765


No 152
>PRK14890 putative Zn-ribbon RNA-binding protein; Provisional
Probab=58.15  E-value=4.8  Score=24.87  Aligned_cols=12  Identities=50%  Similarity=0.980  Sum_probs=9.2

Q ss_pred             cccccccCCCCc
Q psy838           33 QHAKYTCSFCGK   44 (152)
Q Consensus        33 ~~~~~~c~~c~~   44 (152)
                      |-..|+||.||-
T Consensus        45 ~~~~Y~CP~CGF   56 (59)
T PRK14890         45 QSNPYTCPKCGF   56 (59)
T ss_pred             cCCceECCCCCC
Confidence            446899999984


No 153
>COG2888 Predicted Zn-ribbon RNA-binding protein with a function in translation [Translation, ribosomal structure and biogenesis]
Probab=57.92  E-value=4.8  Score=24.94  Aligned_cols=12  Identities=33%  Similarity=0.753  Sum_probs=9.2

Q ss_pred             cccccccCCCCc
Q psy838           33 QHAKYTCSFCGK   44 (152)
Q Consensus        33 ~~~~~~c~~c~~   44 (152)
                      +-..|+||.||-
T Consensus        47 ~g~~Y~Cp~CGF   58 (61)
T COG2888          47 LGNPYRCPKCGF   58 (61)
T ss_pred             cCCceECCCcCc
Confidence            456889999984


No 154
>PF09889 DUF2116:  Uncharacterized protein containing a Zn-ribbon (DUF2116);  InterPro: IPR019216 This entry contains various hypothetical prokaryotic proteins whose functions are unknown. They contain a conserved zinc ribbon motif in the N-terminal part and a predicted transmembrane segment in the C-terminal part.
Probab=57.79  E-value=17  Score=22.44  Aligned_cols=24  Identities=17%  Similarity=0.418  Sum_probs=15.2

Q ss_pred             ccCCCCcccccCCCCCHHHHHHHH
Q psy838           38 TCSFCGKEYKKSTDLGAETKKWVW   61 (152)
Q Consensus        38 ~c~~c~~~~~~~~~l~~~d~~~i~   61 (152)
                      .||.||+++.+.+..-.++=...+
T Consensus         5 HC~~CG~~Ip~~~~fCS~~C~~~~   28 (59)
T PF09889_consen    5 HCPVCGKPIPPDESFCSPKCREEY   28 (59)
T ss_pred             cCCcCCCcCCcchhhhCHHHHHHH
Confidence            599999987765555444433333


No 155
>PRK09678 DNA-binding transcriptional regulator; Provisional
Probab=57.71  E-value=6.2  Score=25.36  Aligned_cols=19  Identities=16%  Similarity=0.543  Sum_probs=13.4

Q ss_pred             cccCCCCcc--cccCCCCCHH
Q psy838           37 YTCSFCGKE--YKKSTDLGAE   55 (152)
Q Consensus        37 ~~c~~c~~~--~~~~~~l~~~   55 (152)
                      ..||+||..  ++++..+++.
T Consensus         2 m~CP~Cg~~a~irtSr~~s~~   22 (72)
T PRK09678          2 FHCPLCQHAAHARTSRYITDT   22 (72)
T ss_pred             ccCCCCCCccEEEEChhcChh
Confidence            479999988  5666655543


No 156
>PF12294 DUF3626:  Protein of unknown function (DUF3626);  InterPro: IPR022074  This family of proteins is found in bacteria. Proteins in this family are typically between 294 and 374 amino acids in length. 
Probab=57.56  E-value=12  Score=30.38  Aligned_cols=22  Identities=9%  Similarity=-0.008  Sum_probs=18.1

Q ss_pred             EeeeeeccccccCCHHHHHHHH
Q psy838          125 CYELQLEKQVQRKGLGKFMMQA  146 (152)
Q Consensus       125 I~~i~V~p~~qgkGiG~~Lm~~  146 (152)
                      |..|.++|.|||.-+++.+-+.
T Consensus       191 VeaLVlDPsyrgT~ve~~~~~l  212 (297)
T PF12294_consen  191 VEALVLDPSYRGTEVEAAARAL  212 (297)
T ss_pred             hHHHhcCccccCChHHHHHHHH
Confidence            5568899999999999987543


No 157
>PRK00398 rpoP DNA-directed RNA polymerase subunit P; Provisional
Probab=57.51  E-value=3.4  Score=23.74  Aligned_cols=10  Identities=30%  Similarity=1.032  Sum_probs=8.4

Q ss_pred             ccccCCCCcc
Q psy838           36 KYTCSFCGKE   45 (152)
Q Consensus        36 ~~~c~~c~~~   45 (152)
                      ..+||+||..
T Consensus        21 ~~~Cp~CG~~   30 (46)
T PRK00398         21 GVRCPYCGYR   30 (46)
T ss_pred             ceECCCCCCe
Confidence            6789999887


No 158
>TIGR03019 pepcterm_femAB FemAB-related protein, PEP-CTERM system-associated. Members of this protein family are found always as part of extended exopolysaccharide biosynthesis loci in bacteria. In nearly every case, these loci contain determinants for the processing of the PEP-CTERM proposed C-terminal protein sorting signal. This family shows remote, local sequence similarity to the FemAB protein family (see pfam02388), whose members
Probab=57.12  E-value=1e+02  Score=24.84  Aligned_cols=112  Identities=11%  Similarity=0.035  Sum_probs=64.2

Q ss_pred             cccchhHhHhHHhhhhhcccccccCCCCcccccCCCCCHHHHHHHHHHHHHhHHHHHHhCCC-CCChhhhhhhh--cCCC
Q psy838           16 TRYGASLRKMVKKMEITQHAKYTCSFCGKEYKKSTDLGAETKKWVWELEEKNMKHSYEVCDI-GWDPQGKHSEM--FDDR   92 (152)
Q Consensus        16 ~ryg~~~rk~~~~~~~~~~~~~~c~~c~~~~~~~~~l~~~d~~~i~~l~~~~m~~~y~~~~~-~w~~~~~~~el--~~~~   92 (152)
                      .++.++.|+++++.+..   -.++       ..     .+|++..+++..++|...    +. ..+.+.+..-+  ..+.
T Consensus       134 ~~~~~k~R~~IRka~k~---Gv~v-------~~-----~~~l~~F~~l~~~t~~r~----g~p~~~~~~f~~l~~~~~~~  194 (330)
T TIGR03019       134 LAIPRKQRAMVRKGIKA---GLTV-------TV-----DGDLDRFYDVYAENMRDL----GTPVFSRRYFRLLKDVFGED  194 (330)
T ss_pred             HhcCHHHHHHHHHHHHC---CeEE-------EE-----CCcHHHHHHHHHHHHhcC----CCCCCCHHHHHHHHHhcccC
Confidence            34566778888876543   2221       11     134888888888877542    22 34454443322  1234


Q ss_pred             ceEEEEEeCCCCCeEEEEEEEeeccCCcceEEEeeeeeccccccCCHHHHHHHHHHhhc
Q psy838           93 ACYLVAKNGSSSTPVAFSHFRFDVDFGEPVLYCYELQLEKQVQRKGLGKFMMQAFYTYI  151 (152)
Q Consensus        93 ~~~~v~~~~~~~~ivGf~~~~~~~~~~~~~~~I~~i~V~p~~qgkGiG~~Lm~~l~~~a  151 (152)
                      ..++++... +|++|+.+.+... .  . .++.+-..-.++++..+.+..|+..+++.|
T Consensus       195 ~~l~~a~~~-~g~~va~~l~~~~-~--~-~~~~~~~g~~~~~~~~~~~~lL~w~~i~~a  248 (330)
T TIGR03019       195 CEVLTVRLG-DGVVASAVLSFYF-R--D-EVLPYYAGGLREARDVAANDLMYWELMRRA  248 (330)
T ss_pred             EEEEEEEeC-CCCEEEEEEEEEe-C--C-EEEEEeccChHHHHhhChHHHHHHHHHHHH
Confidence            445555522 3787776554221 1  1 122223456789999999999999888765


No 159
>smart00355 ZnF_C2H2 zinc finger.
Probab=57.12  E-value=5.5  Score=18.76  Aligned_cols=12  Identities=50%  Similarity=1.204  Sum_probs=9.1

Q ss_pred             cccCCCCccccc
Q psy838           37 YTCSFCGKEYKK   48 (152)
Q Consensus        37 ~~c~~c~~~~~~   48 (152)
                      |.|+.|++.+..
T Consensus         1 ~~C~~C~~~f~~   12 (26)
T smart00355        1 YRCPECGKVFKS   12 (26)
T ss_pred             CCCCCCcchhCC
Confidence            579999877554


No 160
>PF03119 DNA_ligase_ZBD:  NAD-dependent DNA ligase C4 zinc finger domain;  InterPro: IPR004149 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target.  This entry represents the zinc finger domain found in NAD-dependent DNA ligases. DNA ligases catalyse the crucial step of joining the breaks in duplex DNA during DNA replication, repair and recombination, utilizing either ATP or NAD(+) as a cofactor []. This domain is a small zinc binding motif that is presumably DNA binding. It is found only in NAD-dependent DNA ligases. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; GO: 0003911 DNA ligase (NAD+) activity, 0006260 DNA replication, 0006281 DNA repair; PDB: 1DGS_A 1V9P_B 2OWO_A.
Probab=57.03  E-value=5.6  Score=20.65  Aligned_cols=10  Identities=40%  Similarity=1.072  Sum_probs=5.3

Q ss_pred             ccCCCCcccc
Q psy838           38 TCSFCGKEYK   47 (152)
Q Consensus        38 ~c~~c~~~~~   47 (152)
                      +||.||....
T Consensus         1 ~CP~C~s~l~   10 (28)
T PF03119_consen    1 TCPVCGSKLV   10 (28)
T ss_dssp             B-TTT--BEE
T ss_pred             CcCCCCCEeE
Confidence            5999998844


No 161
>PRK14892 putative transcription elongation factor Elf1; Provisional
Probab=55.97  E-value=3.6  Score=28.07  Aligned_cols=14  Identities=29%  Similarity=0.667  Sum_probs=9.1

Q ss_pred             hcccccccCCCCcc
Q psy838           32 TQHAKYTCSFCGKE   45 (152)
Q Consensus        32 ~~~~~~~c~~c~~~   45 (152)
                      .....++||+||..
T Consensus        17 klpt~f~CP~Cge~   30 (99)
T PRK14892         17 KLPKIFECPRCGKV   30 (99)
T ss_pred             CCCcEeECCCCCCe
Confidence            33456778888843


No 162
>PF09862 DUF2089:  Protein of unknown function (DUF2089);  InterPro: IPR018658  This family consists of various hypothetical prokaryotic proteins. 
Probab=55.72  E-value=14  Score=25.85  Aligned_cols=31  Identities=19%  Similarity=0.391  Sum_probs=25.2

Q ss_pred             ccccCCCCcc------cccCCCCCHHHHHHHHHHHHH
Q psy838           36 KYTCSFCGKE------YKKSTDLGAETKKWVWELEEK   66 (152)
Q Consensus        36 ~~~c~~c~~~------~~~~~~l~~~d~~~i~~l~~~   66 (152)
                      .+.||.|+..      ..+...++++|++.+..+...
T Consensus        12 ~l~C~~C~t~i~G~F~l~~~~~L~~E~~~Fi~~Fi~~   48 (113)
T PF09862_consen   12 RLKCPSCGTEIEGEFELPWFARLSPEQLEFIKLFIKN   48 (113)
T ss_pred             EEEcCCCCCEEEeeeccchhhcCCHHHHHHHHHHHHh
Confidence            6778888887      677799999999988876554


No 163
>PRK15312 antimicrobial resistance protein Mig-14; Provisional
Probab=55.41  E-value=35  Score=27.86  Aligned_cols=116  Identities=10%  Similarity=0.157  Sum_probs=63.5

Q ss_pred             eeeeccc-cccccchhHhHhHHhhhhhcccccccCCCCcccccCCCCCHHHHHHHHHHHHHhHHHHHHhCCCCCCh----
Q psy838            7 KVGITGK-YGTRYGASLRKMVKKMEITQHAKYTCSFCGKEYKKSTDLGAETKKWVWELEEKNMKHSYEVCDIGWDP----   81 (152)
Q Consensus         7 ~~~~~~~-~g~ryg~~~rk~~~~~~~~~~~~~~c~~c~~~~~~~~~l~~~d~~~i~~l~~~~m~~~y~~~~~~w~~----   81 (152)
                      ..+.+.- |-..+-.+-|+-+++.+..          |-.+++.++.+++++.+|+.-       +|+. .|+-..    
T Consensus       127 ~~claK~~fSkKt~~~rrrEl~kF~~~----------GG~v~~is~fS~~Ela~iY~~-------Lf~~-Rwg~~~~~~~  188 (298)
T PRK15312        127 QNCLVKETFSSKFEKTRRNEYQRFLRN----------GGSVKSVADCSSDELTHIFIE-------LFRS-RFGNTLSCYP  188 (298)
T ss_pred             hhhhehhhhhhHhHHHHHHHHHHHHHc----------CCEEEEhHHCCHHHHHHHHHH-------HHHH-HhCCCCCccc
Confidence            3444444 5555555555555554432          455889999999999988842       3333 333222    


Q ss_pred             hhhhhhhcCC--C--ceEEEEEeCCCCCeEEEEEEEeeccCCcce-EEEeeeeeccccccCCHHHHHH
Q psy838           82 QGKHSEMFDD--R--ACYLVAKNGSSSTPVAFSHFRFDVDFGEPV-LYCYELQLEKQVQRKGLGKFMM  144 (152)
Q Consensus        82 ~~~~~el~~~--~--~~~~v~~~~~~~~ivGf~~~~~~~~~~~~~-~~I~~i~V~p~~qgkGiG~~Lm  144 (152)
                      .....++.+.  +  .-.++ +-+  |++++|=.+... +....+ +..-+.-++|++..--+|+-||
T Consensus       189 ~~~l~e~f~~Lr~l~fG~VL-fl~--~~PcA~qlv~k~-eSp~wi~~D~iNgG~Dpe~~~~spGSIL~  252 (298)
T PRK15312        189 ADNLANFFSQLRHLLFGHIL-YIE--GIPCAFDIVLKS-ESQMNVYFDVPNGAVKNECMPLSPGSILM  252 (298)
T ss_pred             HHHHHHHHHHhHHhheeeEE-EEC--CcceEEEEEEEe-cCCCcEEEecccCccCcccccCCCccEEE
Confidence            2222333321  1  11222 233  899998655432 222221 1133456999999999998876


No 164
>COG1675 TFA1 Transcription initiation factor IIE, alpha subunit [Transcription]
Probab=54.99  E-value=9.7  Score=28.71  Aligned_cols=13  Identities=31%  Similarity=0.905  Sum_probs=8.4

Q ss_pred             cccccccCCCCcc
Q psy838           33 QHAKYTCSFCGKE   45 (152)
Q Consensus        33 ~~~~~~c~~c~~~   45 (152)
                      ....++||.||.+
T Consensus       129 ~~~~F~Cp~Cg~~  141 (176)
T COG1675         129 MELGFTCPKCGED  141 (176)
T ss_pred             HHhCCCCCCCCch
Confidence            3445777777766


No 165
>COG3916 LasI N-acyl-L-homoserine lactone synthetase [Signal transduction mechanisms / Secondary metabolites biosynthesis, transport, and catabolism]
Probab=54.63  E-value=1e+02  Score=23.97  Aligned_cols=96  Identities=13%  Similarity=0.151  Sum_probs=58.2

Q ss_pred             CCCCHHHHHHHHHHHHHhHHHHHHhCCCCCChhhhh----hhhcCCCceEEEEEeCCCCCeEEEEEEEeecc--------
Q psy838           50 TDLGAETKKWVWELEEKNMKHSYEVCDIGWDPQGKH----SEMFDDRACYLVAKNGSSSTPVAFSHFRFDVD--------  117 (152)
Q Consensus        50 ~~l~~~d~~~i~~l~~~~m~~~y~~~~~~w~~~~~~----~el~~~~~~~~v~~~~~~~~ivGf~~~~~~~~--------  117 (152)
                      ....++-+++++.|-.+    .|.. ..+|+....-    ++.-+.+..|+++... +|+++|.+.+-+...        
T Consensus        10 ~~~~~~~l~em~rlR~~----vF~e-rL~W~v~~~~g~E~DqyD~~~t~Yll~~~~-~g~I~G~~RlLptt~P~mL~~vF   83 (209)
T COG3916          10 RELFPKALEEMHRLRYQ----VFKE-RLGWDVVCIDGFEIDQYDNLDTVYLLALTS-DGRIVGCVRLLPTTGPYMLTDVF   83 (209)
T ss_pred             chhcHHHHHHHHHHHHH----HHHH-hcCCceeccCCccccccCCCCceEEEEEcC-CCcEEEEEEeccCCCcchhhhhh
Confidence            44566778888888776    4444 4567654321    2223456778887544 699999999865321        


Q ss_pred             ----------CCcceEEEeeeeecc--ccccC---C-HHHHHHHHHHhhc
Q psy838          118 ----------FGEPVLYCYELQLEK--QVQRK---G-LGKFMMQAFYTYI  151 (152)
Q Consensus       118 ----------~~~~~~~I~~i~V~p--~~qgk---G-iG~~Lm~~l~~~a  151 (152)
                                ....+++..++.|++  .-+..   . ++..||..+.++|
T Consensus        84 ~~Ll~~~~~P~~p~vwEsSRF~vd~~~a~~~~g~~~~a~~el~~g~ie~a  133 (209)
T COG3916          84 PALLEGGPPPSSPGVWESSRFAVDKPSARRAAGGVSPAAYELFAGMIEYA  133 (209)
T ss_pred             HHHhcCCCCCCCCCeEEEeeeeeccccchhhcCCccHHHHHHHHHHHHHH
Confidence                      112467799998886  22222   2 3556666665543


No 166
>PF13913 zf-C2HC_2:  zinc-finger of a C2HC-type
Probab=53.96  E-value=5.9  Score=19.87  Aligned_cols=11  Identities=27%  Similarity=1.204  Sum_probs=8.6

Q ss_pred             cccCCCCcccc
Q psy838           37 YTCSFCGKEYK   47 (152)
Q Consensus        37 ~~c~~c~~~~~   47 (152)
                      ..||.||+.+.
T Consensus         3 ~~C~~CgR~F~   13 (25)
T PF13913_consen    3 VPCPICGRKFN   13 (25)
T ss_pred             CcCCCCCCEEC
Confidence            46999998864


No 167
>cd00729 rubredoxin_SM Rubredoxin, Small Modular nonheme iron binding domain containing a [Fe(SCys)4] center, present in rubrerythrin and nigerythrin and detected either N- or C-terminal to such proteins as flavin reductase, NAD(P)H-nitrite reductase, and ferredoxin-thioredoxin reductase. In rubredoxin, the iron atom is coordinated by four cysteine residues (Fe(S-Cys)4), and  believed to be involved in electron transfer. Rubrerythrins and nigerythrins are small homodimeric proteins, generally consisting of 2 domains: a rubredoxin domain C-terminal to a non-sulfur, oxo-bridged diiron site in the N-terminal rubrerythrin domain. Rubrerythrins and nigerythrins have putative peroxide activity.
Probab=53.40  E-value=5.6  Score=21.55  Aligned_cols=12  Identities=33%  Similarity=0.825  Sum_probs=7.9

Q ss_pred             ccccccCCCCcc
Q psy838           34 HAKYTCSFCGKE   45 (152)
Q Consensus        34 ~~~~~c~~c~~~   45 (152)
                      .++..||.||..
T Consensus        16 ~~p~~CP~Cg~~   27 (34)
T cd00729          16 EAPEKCPICGAP   27 (34)
T ss_pred             cCCCcCcCCCCc
Confidence            355678888754


No 168
>smart00531 TFIIE Transcription initiation factor IIE.
Probab=52.55  E-value=11  Score=27.28  Aligned_cols=15  Identities=33%  Similarity=0.857  Sum_probs=11.7

Q ss_pred             cccccccCCCCcccc
Q psy838           33 QHAKYTCSFCGKEYK   47 (152)
Q Consensus        33 ~~~~~~c~~c~~~~~   47 (152)
                      ...-|.||.||..+.
T Consensus        96 ~~~~Y~Cp~C~~~y~  110 (147)
T smart00531       96 NNAYYKCPNCQSKYT  110 (147)
T ss_pred             CCcEEECcCCCCEee
Confidence            567899999986654


No 169
>COG2075 RPL24A Ribosomal protein L24E [Translation, ribosomal structure and biogenesis]
Probab=52.23  E-value=7  Score=24.67  Aligned_cols=14  Identities=43%  Similarity=0.952  Sum_probs=10.1

Q ss_pred             ccccCCCCcccccC
Q psy838           36 KYTCSFCGKEYKKS   49 (152)
Q Consensus        36 ~~~c~~c~~~~~~~   49 (152)
                      -.+|.|||+++-+.
T Consensus         3 ~~~CsFcG~~I~PG   16 (66)
T COG2075           3 VRVCSFCGKKIEPG   16 (66)
T ss_pred             eeEecCcCCccCCC
Confidence            36899999984443


No 170
>COG2260 Predicted Zn-ribbon RNA-binding protein [Translation, ribosomal structure and biogenesis]
Probab=52.19  E-value=7.8  Score=23.88  Aligned_cols=20  Identities=20%  Similarity=0.576  Sum_probs=13.3

Q ss_pred             cccCCCCcc--cccCCCCCHHH
Q psy838           37 YTCSFCGKE--YKKSTDLGAET   56 (152)
Q Consensus        37 ~~c~~c~~~--~~~~~~l~~~d   56 (152)
                      -+||.||.+  ..--...+++|
T Consensus        18 e~Cp~CG~~t~~~~PprFSPeD   39 (59)
T COG2260          18 EKCPVCGGDTKVPHPPRFSPED   39 (59)
T ss_pred             ccCCCCCCccccCCCCCCCccc
Confidence            579999988  33334556666


No 171
>PF13240 zinc_ribbon_2:  zinc-ribbon domain
Probab=51.96  E-value=9.4  Score=18.82  Aligned_cols=6  Identities=50%  Similarity=1.497  Sum_probs=3.2

Q ss_pred             cCCCCc
Q psy838           39 CSFCGK   44 (152)
Q Consensus        39 c~~c~~   44 (152)
                      ||.||+
T Consensus        16 C~~CG~   21 (23)
T PF13240_consen   16 CPNCGT   21 (23)
T ss_pred             hhhhCC
Confidence            555554


No 172
>PRK13130 H/ACA RNA-protein complex component Nop10p; Reviewed
Probab=50.83  E-value=10  Score=23.15  Aligned_cols=21  Identities=29%  Similarity=0.608  Sum_probs=13.7

Q ss_pred             ccccCCCCcccccCC--CCCHHH
Q psy838           36 KYTCSFCGKEYKKST--DLGAET   56 (152)
Q Consensus        36 ~~~c~~c~~~~~~~~--~l~~~d   56 (152)
                      +-.||.||.....++  ..+++|
T Consensus        17 k~~CP~CG~~t~~~~P~rfSp~D   39 (56)
T PRK13130         17 KEICPVCGGKTKNPHPPRFSPED   39 (56)
T ss_pred             cccCcCCCCCCCCCCCCCCCCCC
Confidence            567999998855443  345555


No 173
>COG3677 Transposase and inactivated derivatives [DNA replication, recombination, and repair]
Probab=50.76  E-value=22  Score=25.31  Aligned_cols=21  Identities=29%  Similarity=0.773  Sum_probs=16.7

Q ss_pred             ccccccCCCCcc--cccCCCCCH
Q psy838           34 HAKYTCSFCGKE--YKKSTDLGA   54 (152)
Q Consensus        34 ~~~~~c~~c~~~--~~~~~~l~~   54 (152)
                      ++.|.|+.|++.  .....++..
T Consensus        51 ~qRyrC~~C~~tf~~~~~~~~~~   73 (129)
T COG3677          51 HQRYKCKSCGSTFTVETGSPLSK   73 (129)
T ss_pred             ccccccCCcCcceeeeccCcccc
Confidence            889999999999  555566665


No 174
>PF04377 ATE_C:  Arginine-tRNA-protein transferase, C terminus;  InterPro: IPR007472 Arginine-tRNA-protein transferase catalyses the post-translational conjugation of arginine to the N terminus of a protein. In eukaryotes, this functions as part of the N terminus rule pathway of protein degradation by conjugating a destabilising amino acid to the N-terminal aspartate or glutamate of a protein, targeting the protein for ubiquitin-dependent proteolysis. N-terminal cysteine is sometimes modified []. In Saccharomyces cerevisiae, Cys20, 23, 94 and/or 95 are thought to be important for activity []. Of these, only Cys 94 appears to be completely conserved in this family.  This entry represents the C-terminal region of the enzyme arginine-tRNA-protein transferase, found in both eukaryotic and prokaryotic enzymes.; GO: 0004057 arginyltransferase activity, 0016598 protein arginylation
Probab=50.73  E-value=88  Score=22.19  Aligned_cols=45  Identities=11%  Similarity=0.195  Sum_probs=31.4

Q ss_pred             CCCeEEEEEEEeeccCCcceEEEeeeeeccccccCCHHHHHHHHHHhhc
Q psy838          103 SSTPVAFSHFRFDVDFGEPVLYCYELQLEKQVQRKGLGKFMMQAFYTYI  151 (152)
Q Consensus       103 ~~~ivGf~~~~~~~~~~~~~~~I~~i~V~p~~qgkGiG~~Lm~~l~~~a  151 (152)
                      +|++||-+.+-.. +++-..  |+.+ -+|++...++|+..+-.-+++|
T Consensus        47 ~~kLiav~v~D~l-~~glSa--VY~f-yDPd~~~~SlG~~~iL~eI~~a   91 (128)
T PF04377_consen   47 DGKLIAVAVVDIL-PDGLSA--VYTF-YDPDYSKRSLGTYSILREIELA   91 (128)
T ss_pred             CCeEEEEEEeecc-cchhhh--eeee-eCCCccccCcHHHHHHHHHHHH
Confidence            4999998776443 333222  3322 5899999999999887777665


No 175
>PF05741 zf-nanos:  Nanos RNA binding domain;  InterPro: IPR024161 Nanos is a highly conserved RNA-binding protein in higher eukaryotes and functions as a key regulatory protein in translational control using a 3' untranslated region during the development and maintenance of germ cells. Nanos comprises a non-conserved amino-terminus and highly conserved carboxy- terminal regions. The C-terminal region has two conserved Cys-Cys-His-Cys (CCHC)-type zinc-finger motifs that are indispensable for nanos function [, , ]. The structure of the nanos-type zinc finger is composed of two independent zinc-finger (ZF) lobes, the N-terminal ZF1 and the C-terminal ZF2, which are connected by a linker helix []. These lobes create a large cleft. Zinc ions in ZF1 and ZF2 are bound to the CCHC motif by tetrahedral coordination.; PDB: 3ALR_B.
Probab=50.56  E-value=4.9  Score=24.47  Aligned_cols=23  Identities=22%  Similarity=0.544  Sum_probs=8.2

Q ss_pred             hHhHHhhhhhcccccccCCCCcc
Q psy838           23 RKMVKKMEITQHAKYTCSFCGKE   45 (152)
Q Consensus        23 rk~~~~~~~~~~~~~~c~~c~~~   45 (152)
                      |+.-.++.=+...+|+||.||.-
T Consensus        20 k~~~G~v~CPvLr~y~Cp~CgAt   42 (55)
T PF05741_consen   20 KDPDGRVTCPVLRKYVCPICGAT   42 (55)
T ss_dssp             B-TTS-B--TTGGG---TTT---
T ss_pred             ECCCCCEeCHHHhcCcCCCCcCc
Confidence            33333444455678999999964


No 176
>PF13824 zf-Mss51:  Zinc-finger of mitochondrial splicing suppressor 51
Probab=50.24  E-value=6.7  Score=23.87  Aligned_cols=25  Identities=20%  Similarity=0.430  Sum_probs=16.8

Q ss_pred             hcccccccCCCCcccccCCCCCHHH
Q psy838           32 TQHAKYTCSFCGKEYKKSTDLGAET   56 (152)
Q Consensus        32 ~~~~~~~c~~c~~~~~~~~~l~~~d   56 (152)
                      ..+.+|+||.||-+..-+...=.+|
T Consensus        10 ~~~v~~~Cp~cGipthcS~ehw~~D   34 (55)
T PF13824_consen   10 PAHVNFECPDCGIPTHCSEEHWEDD   34 (55)
T ss_pred             ccccCCcCCCCCCcCccCHHHHHHh
Confidence            4567899999999854444443444


No 177
>PF05129 Elf1:  Transcription elongation factor Elf1 like;  InterPro: IPR007808 This family of uncharacterised, mostly short, proteins contain a putative zinc binding domain with four conserved cysteines.; PDB: 1WII_A.
Probab=50.13  E-value=6.6  Score=25.70  Aligned_cols=14  Identities=21%  Similarity=0.695  Sum_probs=4.2

Q ss_pred             hhcccccccCCCCc
Q psy838           31 ITQHAKYTCSFCGK   44 (152)
Q Consensus        31 ~~~~~~~~c~~c~~   44 (152)
                      ......+.||||+.
T Consensus        17 ~~l~~~F~CPfC~~   30 (81)
T PF05129_consen   17 PKLPKVFDCPFCNH   30 (81)
T ss_dssp             ---SS----TTT--
T ss_pred             CCCCceEcCCcCCC
Confidence            34456899999993


No 178
>COG1592 Rubrerythrin [Energy production and conversion]
Probab=49.66  E-value=10  Score=28.27  Aligned_cols=25  Identities=20%  Similarity=0.599  Sum_probs=17.6

Q ss_pred             hhHhHhHHhhhhhcccccccCCCCccc
Q psy838           20 ASLRKMVKKMEITQHAKYTCSFCGKEY   46 (152)
Q Consensus        20 ~~~rk~~~~~~~~~~~~~~c~~c~~~~   46 (152)
                      ..+|+..+.++..+  .|.||.||-.+
T Consensus       120 ~~~~~~Le~~~~~~--~~vC~vCGy~~  144 (166)
T COG1592         120 EMFRGLLERLEEGK--VWVCPVCGYTH  144 (166)
T ss_pred             HHHHHHHHhhhcCC--EEEcCCCCCcc
Confidence            44566666666655  89999999653


No 179
>PF10058 DUF2296:  Predicted integral membrane metal-binding protein (DUF2296);  InterPro: IPR019273  This domain, found mainly in the eukaryotic lunapark proteins, has no known function []. 
Probab=49.54  E-value=7.3  Score=23.51  Aligned_cols=15  Identities=33%  Similarity=0.972  Sum_probs=11.2

Q ss_pred             hhhcccccccCCCCc
Q psy838           30 EITQHAKYTCSFCGK   44 (152)
Q Consensus        30 ~~~~~~~~~c~~c~~   44 (152)
                      +....-.|.||.||.
T Consensus        38 ~~~~~i~y~C~~Cg~   52 (54)
T PF10058_consen   38 EEFEEIQYRCPYCGA   52 (54)
T ss_pred             ccCCceEEEcCCCCC
Confidence            444555999999985


No 180
>PRK10144 formate-dependent nitrite reductase complex subunit NrfF; Provisional
Probab=48.86  E-value=21  Score=25.44  Aligned_cols=54  Identities=11%  Similarity=0.184  Sum_probs=30.0

Q ss_pred             cccccCCCCcccccCCCCCHHHHHHHHHHHHHhHHHHHHhCCCCCChhhhhhhhcCCCceEEE
Q psy838           35 AKYTCSFCGKEYKKSTDLGAETKKWVWELEEKNMKHSYEVCDIGWDPQGKHSEMFDDRACYLV   97 (152)
Q Consensus        35 ~~~~c~~c~~~~~~~~~l~~~d~~~i~~l~~~~m~~~y~~~~~~w~~~~~~~el~~~~~~~~v   97 (152)
                      ....||.|...     .+..++.+-.-++-..    .++.-..|.+++++.+-|.+....+++
T Consensus        39 ~~LRC~vCqnq-----siadSna~iA~dmR~~----Vr~~i~~G~sd~eI~~~~v~RYG~~Vl   92 (126)
T PRK10144         39 SQLRCPQCQNQ-----NLLESNAPVAVSMRHQ----VYSMVAEGKSEVEIIGWMTERYGDFVR   92 (126)
T ss_pred             HcCCCCCCCCC-----ChhhcCCHHHHHHHHH----HHHHHHcCCCHHHHHHHHHHhcCCeEE
Confidence            35789999755     3333333333333222    222224578999988888765444433


No 181
>PF07395 Mig-14:  Mig-14;  InterPro: IPR009977 This family contains a number of bacterial mig-14 proteins (approximately 270 residues long). In Salmonella, mig-14 contributes to resistance to antimicrobial peptides, although the mechanism is not fully understood [].
Probab=48.50  E-value=57  Score=26.23  Aligned_cols=115  Identities=10%  Similarity=0.168  Sum_probs=65.1

Q ss_pred             eeeecc---ccccccchhHhHhHHhhhhhcccccccCCCCcccccCCCCCHHHHHHHHHHHHHhHHHHHHhCCCCC--Ch
Q psy838            7 KVGITG---KYGTRYGASLRKMVKKMEITQHAKYTCSFCGKEYKKSTDLGAETKKWVWELEEKNMKHSYEVCDIGW--DP   81 (152)
Q Consensus         7 ~~~~~~---~~g~ryg~~~rk~~~~~~~~~~~~~~c~~c~~~~~~~~~l~~~d~~~i~~l~~~~m~~~y~~~~~~w--~~   81 (152)
                      ..+++.   .|-..+-.+-|+.+++.+..          |-.+++.++++++++.+|+.-       +|+. -|+.  ..
T Consensus        97 ~iclaK~~e~fSkKt~~~rrrElrkF~~~----------GG~v~~v~~~S~~Ela~iY~~-------Lf~~-Rwg~~~~~  158 (264)
T PF07395_consen   97 QICLAKGPESFSKKTRKNRRRELRKFIEA----------GGSVRPVSEFSPEELADIYID-------LFQK-RWGFRCYG  158 (264)
T ss_pred             ceeeEcCchhhchHHHHHHHHHHHHHHHc----------CCEEEEHHHCCHHHHHHHHHH-------HHHH-HhCCCCCc
Confidence            444444   56655556666666665543          445888999999999998853       3332 3331  11


Q ss_pred             hhhhhhhcCCC----ceEEEEEeCCCCCeEEEEEEEeeccCCcceEEE--eeeeeccccccCCHHHHHH
Q psy838           82 QGKHSEMFDDR----ACYLVAKNGSSSTPVAFSHFRFDVDFGEPVLYC--YELQLEKQVQRKGLGKFMM  144 (152)
Q Consensus        82 ~~~~~el~~~~----~~~~v~~~~~~~~ivGf~~~~~~~~~~~~~~~I--~~i~V~p~~qgkGiG~~Lm  144 (152)
                      .....++.+.-    .-.++. -+  |+++++=.+.-. +... .+++  -+.-++|++..--+|+.||
T Consensus       159 ~~~l~e~f~~Lr~~~fG~vL~-l~--~~P~Aiqlv~k~-es~~-wv~~D~iNgG~Dp~~~~~SpGSiL~  222 (264)
T PF07395_consen  159 KEHLAEFFSELRHMIFGSVLF-LN--GQPCAIQLVYKV-ESPK-WVYFDYINGGYDPECRDFSPGSILM  222 (264)
T ss_pred             HHHHHHHHHHhHHhheeeEEE-EC--CcceEEEEEEEe-cCCC-eEEEecccCccCcccccCCCccEEE
Confidence            12233333211    111222 33  899998655432 2222 2332  3445899999999998876


No 182
>PF09390 DUF1999:  Protein of unknown function (DUF1999);  InterPro: IPR018987  This family contains a putative Fe-S binding reductase (Q72J89 from SWISSPROT) whose structure adopts an alpha and beta fold. ; PDB: 2D4O_A 2D4P_A.
Probab=47.83  E-value=65  Score=23.70  Aligned_cols=56  Identities=11%  Similarity=0.073  Sum_probs=31.4

Q ss_pred             CceEEEEEeCCCCCeEEEEEEEeeccCCcceEEEeeeeeccccccCCHHHHHHHHHHh
Q psy838           92 RACYLVAKNGSSSTPVAFSHFRFDVDFGEPVLYCYELQLEKQVQRKGLGKFMMQAFYT  149 (152)
Q Consensus        92 ~~~~~v~~~~~~~~ivGf~~~~~~~~~~~~~~~I~~i~V~p~~qgkGiG~~Lm~~l~~  149 (152)
                      ..|-+++.++ ++++.||+.....-.-..|.+.+..|.+.|. +....-.-||.++.+
T Consensus        54 sgHSFvA~~e-~~~~~GfvLAQaVWQGdrptVlV~ri~~~~~-~~~~~~~GLLrAvvK  109 (161)
T PF09390_consen   54 SGHSFVAEDE-GGELQGFVLAQAVWQGDRPTVLVRRILLAPG-EPEEVYEGLLRAVVK  109 (161)
T ss_dssp             CS--EEEE-E-TTEEEEEEEEEEEE-SSSEEEEEEEE---EE-SSHHHHHHHHHHHHH
T ss_pred             cCCcEEEEcc-CCceeeeeehhHHhcCCCceEEEEEeecCCC-CcHHHHHHHHHHHHH
Confidence            3445666633 4899999987654333468888999987765 233455555555543


No 183
>PF03604 DNA_RNApol_7kD:  DNA directed RNA polymerase, 7 kDa subunit;  InterPro: IPR006591 DNA-dependent RNA polymerase catalyzes the transcription of DNA into RNA using the four ribonucleoside triphosphates as substrates. Each class of RNA polymerase is assembled from 9 to 15 different polypeptides. Rbp10 (RNA polymerase CX) is a domain found in RNA polymerase subunit 10; present in RNA polymerase I, II and III.; GO: 0003677 DNA binding, 0003899 DNA-directed RNA polymerase activity, 0006351 transcription, DNA-dependent; PDB: 2PMZ_Z 3HKZ_X 2NVX_L 3S1Q_L 2JA6_L 3S17_L 3HOW_L 3HOV_L 3PO2_L 3HOZ_L ....
Probab=47.69  E-value=7.2  Score=20.98  Aligned_cols=17  Identities=24%  Similarity=0.624  Sum_probs=10.2

Q ss_pred             hhhhcccccccCCCCcc
Q psy838           29 MEITQHAKYTCSFCGKE   45 (152)
Q Consensus        29 ~~~~~~~~~~c~~c~~~   45 (152)
                      +++.....-.||.||-.
T Consensus        10 ~~~~~~~~irC~~CG~R   26 (32)
T PF03604_consen   10 VELKPGDPIRCPECGHR   26 (32)
T ss_dssp             E-BSTSSTSSBSSSS-S
T ss_pred             eEcCCCCcEECCcCCCe
Confidence            34555666788888865


No 184
>PF09943 DUF2175:  Uncharacterized protein conserved in archaea (DUF2175);  InterPro: IPR018686  This family of various hypothetical archaeal proteins has no known function. 
Probab=47.48  E-value=5.4  Score=27.33  Aligned_cols=13  Identities=38%  Similarity=0.912  Sum_probs=10.3

Q ss_pred             cccccCCCCcccc
Q psy838           35 AKYTCSFCGKEYK   47 (152)
Q Consensus        35 ~~~~c~~c~~~~~   47 (152)
                      .+++|-+||+++.
T Consensus         1 ~kWkC~iCg~~I~   13 (101)
T PF09943_consen    1 KKWKCYICGKPIY   13 (101)
T ss_pred             CceEEEecCCeee
Confidence            3689999999933


No 185
>COG4049 Uncharacterized protein containing archaeal-type C2H2 Zn-finger [General function prediction only]
Probab=47.22  E-value=5.9  Score=24.30  Aligned_cols=14  Identities=21%  Similarity=0.721  Sum_probs=10.1

Q ss_pred             ccccCCCCcccccC
Q psy838           36 KYTCSFCGKEYKKS   49 (152)
Q Consensus        36 ~~~c~~c~~~~~~~   49 (152)
                      -..||.||+.++..
T Consensus        17 ~lrCPRC~~~FR~~   30 (65)
T COG4049          17 FLRCPRCGMVFRRR   30 (65)
T ss_pred             eeeCCchhHHHHHh
Confidence            46799999775543


No 186
>PF10571 UPF0547:  Uncharacterised protein family UPF0547;  InterPro: IPR018886  This domain may well be a type of zinc-finger as it carries two pairs of highly conserved cysteine residues though with no accompanying histidines. Several members are annotated as putative helicases. 
Probab=46.36  E-value=11  Score=19.21  Aligned_cols=8  Identities=38%  Similarity=0.978  Sum_probs=4.1

Q ss_pred             cccCCCCc
Q psy838           37 YTCSFCGK   44 (152)
Q Consensus        37 ~~c~~c~~   44 (152)
                      -.||.||-
T Consensus        15 ~~Cp~CG~   22 (26)
T PF10571_consen   15 KFCPHCGY   22 (26)
T ss_pred             CcCCCCCC
Confidence            34555553


No 187
>smart00504 Ubox Modified RING finger domain. Modified RING finger domain, without the full complement of Zn2+-binding ligands. Probable involvement in E2-dependent ubiquitination.
Probab=46.33  E-value=29  Score=20.53  Aligned_cols=25  Identities=20%  Similarity=0.303  Sum_probs=15.1

Q ss_pred             ccccc-hhHhHhHHhhhhhcccccccCCCCccc
Q psy838           15 GTRYG-ASLRKMVKKMEITQHAKYTCSFCGKEY   46 (152)
Q Consensus        15 g~ryg-~~~rk~~~~~~~~~~~~~~c~~c~~~~   46 (152)
                      |-.|. ..+.+.+++       .-+||.|++.+
T Consensus        20 G~v~~~~~i~~~~~~-------~~~cP~~~~~~   45 (63)
T smart00504       20 GQTYERRAIEKWLLS-------HGTDPVTGQPL   45 (63)
T ss_pred             CCEEeHHHHHHHHHH-------CCCCCCCcCCC
Confidence            45565 334444433       45899998765


No 188
>TIGR03147 cyt_nit_nrfF cytochrome c nitrite reductase, accessory protein NrfF.
Probab=46.25  E-value=24  Score=25.18  Aligned_cols=54  Identities=15%  Similarity=0.280  Sum_probs=29.8

Q ss_pred             cccccCCCCcccccCCCCCHHHHHHHHHHHHHhHHHHHHhCCCCCChhhhhhhhcCCCceEEE
Q psy838           35 AKYTCSFCGKEYKKSTDLGAETKKWVWELEEKNMKHSYEVCDIGWDPQGKHSEMFDDRACYLV   97 (152)
Q Consensus        35 ~~~~c~~c~~~~~~~~~l~~~d~~~i~~l~~~~m~~~y~~~~~~w~~~~~~~el~~~~~~~~v   97 (152)
                      +...||.|...     .+..++.+-.-++-..    .++.-..|.+++++.+-|.+....+++
T Consensus        39 ~~LRC~vCqnq-----siadS~a~iA~dmR~~----Vr~~i~~G~Sd~eI~~~~v~RYG~~Vl   92 (126)
T TIGR03147        39 KSLRCPQCQNQ-----NLVESNSPIAYDLRHE----VYSMVNEGKSNQQIIDFMTARFGDFVL   92 (126)
T ss_pred             HhCCCCCCCCC-----ChhhcCCHHHHHHHHH----HHHHHHcCCCHHHHHHHHHHhcCCeEE
Confidence            35789999755     3333332222233221    222224578999998888776544443


No 189
>PF03884 DUF329:  Domain of unknown function (DUF329);  InterPro: IPR005584 The biological function of these short proteins is unknown, but they contain four conserved cysteines, suggesting that they all bind zinc. YacG (Q5X8H6 from SWISSPROT) from Escherichia coli has been shown to bind zinc and contains the structural motifs typical of zinc-binding proteins []. The conserved four cysteine motif in these proteins (-C-X(2)-C-X(15)-C-X(3)-C-) is not found in other zinc-binding proteins with known structures.; GO: 0008270 zinc ion binding; PDB: 1LV3_A.
Probab=46.18  E-value=8  Score=23.71  Aligned_cols=13  Identities=31%  Similarity=0.754  Sum_probs=6.7

Q ss_pred             ccccCCCCccccc
Q psy838           36 KYTCSFCGKEYKK   48 (152)
Q Consensus        36 ~~~c~~c~~~~~~   48 (152)
                      +..||.|||...+
T Consensus         2 ~v~CP~C~k~~~~   14 (57)
T PF03884_consen    2 TVKCPICGKPVEW   14 (57)
T ss_dssp             EEE-TTT--EEE-
T ss_pred             cccCCCCCCeecc
Confidence            4689999998544


No 190
>PHA00626 hypothetical protein
Probab=45.71  E-value=9.9  Score=23.29  Aligned_cols=15  Identities=27%  Similarity=0.561  Sum_probs=11.6

Q ss_pred             hcccccccCCCCccc
Q psy838           32 TQHAKYTCSFCGKEY   46 (152)
Q Consensus        32 ~~~~~~~c~~c~~~~   46 (152)
                      .+...|.||.||-.+
T Consensus        19 ~~snrYkCkdCGY~f   33 (59)
T PHA00626         19 GWSDDYVCCDCGYND   33 (59)
T ss_pred             ccCcceEcCCCCCee
Confidence            346789999999653


No 191
>PF06559 DCD:  2'-deoxycytidine 5'-triphosphate deaminase (DCD);  InterPro: IPR010550 This family consists of several bacterial 2'-deoxycytidine 5'-triphosphate deaminase proteins (3.5.4.13 from EC).; GO: 0008829 dCTP deaminase activity; PDB: 2R9Q_C.
Probab=45.67  E-value=18  Score=30.16  Aligned_cols=34  Identities=18%  Similarity=0.121  Sum_probs=13.4

Q ss_pred             CCCeEEEEEEEeeccCCcceEEEeeeeeccccccCCH
Q psy838          103 SSTPVAFSHFRFDVDFGEPVLYCYELQLEKQVQRKGL  139 (152)
Q Consensus       103 ~~~ivGf~~~~~~~~~~~~~~~I~~i~V~p~~qgkGi  139 (152)
                      +|++||-+.|....+..+   .+|.-.+...|||||+
T Consensus       323 hGQ~vgrLvyE~m~~~P~---~lYG~~~gSnYq~QgL  356 (364)
T PF06559_consen  323 HGQIVGRLVYERMAERPE---RLYGAGIGSNYQGQGL  356 (364)
T ss_dssp             TT-EEEEEEEEEBSS-------TTSS-----------
T ss_pred             CCcEEEEEEehhhccCcc---ccccccccccchhhhh
Confidence            399999999987644321   1455668999999997


No 192
>PF02891 zf-MIZ:  MIZ/SP-RING zinc finger;  InterPro: IPR004181 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target.  This entry represents MIZ-type zinc finger domains. Miz1 (Msx-interacting-zinc finger) is a zinc finger-containing protein with homology to the yeast protein, Nfi-1. Miz1 is a sequence specific DNA binding protein that can function as a positive-acting transcription factor. Miz1 binds to the homeobox protein Msx2, enhancing the specific DNA-binding ability of Msx2 []. Other proteins containing this domain include the human pias family (protein inhibitor of activated STAT protein). More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; GO: 0008270 zinc ion binding; PDB: 3I2D_A.
Probab=45.06  E-value=9  Score=22.57  Aligned_cols=11  Identities=36%  Similarity=1.065  Sum_probs=3.9

Q ss_pred             cccccCCCCcc
Q psy838           35 AKYTCSFCGKE   45 (152)
Q Consensus        35 ~~~~c~~c~~~   45 (152)
                      ..+.||.|+++
T Consensus        40 ~~W~CPiC~~~   50 (50)
T PF02891_consen   40 PKWKCPICNKP   50 (50)
T ss_dssp             ---B-TTT---
T ss_pred             CCeECcCCcCc
Confidence            45899999874


No 193
>PRK00464 nrdR transcriptional regulator NrdR; Validated
Probab=44.88  E-value=10  Score=27.88  Aligned_cols=7  Identities=57%  Similarity=1.777  Sum_probs=3.6

Q ss_pred             ccCCCCc
Q psy838           38 TCSFCGK   44 (152)
Q Consensus        38 ~c~~c~~   44 (152)
                      .||+||.
T Consensus         2 ~cp~c~~    8 (154)
T PRK00464          2 RCPFCGH    8 (154)
T ss_pred             cCCCCCC
Confidence            3555554


No 194
>PF13530 SCP2_2:  Sterol carrier protein domain; PDB: 3SXN_C 3N7Z_A 3RYO_B 3R1K_A 3UY5_A 2HV2_F 2I00_D 2OZG_A.
Probab=44.62  E-value=1.2e+02  Score=22.96  Aligned_cols=52  Identities=19%  Similarity=0.266  Sum_probs=31.5

Q ss_pred             CceEEEEEeCCCCCeEEEEEEEeecc-CCcceEEEeee-eeccccccCCHHHHHHHHHHh
Q psy838           92 RACYLVAKNGSSSTPVAFSHFRFDVD-FGEPVLYCYEL-QLEKQVQRKGLGKFMMQAFYT  149 (152)
Q Consensus        92 ~~~~~v~~~~~~~~ivGf~~~~~~~~-~~~~~~~I~~i-~V~p~~qgkGiG~~Lm~~l~~  149 (152)
                      +..++++.++ +|++.||+.|.+... .....+.|.++ +.+|+.+     +.|++.+-.
T Consensus        23 ~~~~~~~~~~-~g~~~GY~~y~~~~~~~~~~~l~V~El~~~~~~A~-----~aLl~fl~~   76 (218)
T PF13530_consen   23 DRGYAVYYDE-DGEPDGYVIYRFKDDWEPGGTLEVRELVALDPEAY-----RALLAFLAS   76 (218)
T ss_dssp             GSEEEEEEEC-TSEEEEEEEEEEET-SSSTTEEEEEEEEESSHHHH-----HHHHHHHHT
T ss_pred             CceEEEEECC-CCCeeEEEEEEEcccCCCCceEEEEEEEeCCHHHH-----HHHHHHHHh
Confidence            3444555554 599999999998642 22346778885 5666643     345554443


No 195
>smart00659 RPOLCX RNA polymerase subunit CX. present in RNA polymerase I, II and III
Probab=43.87  E-value=7.9  Score=22.32  Aligned_cols=17  Identities=29%  Similarity=0.526  Sum_probs=12.0

Q ss_pred             hhhhcccccccCCCCcc
Q psy838           29 MEITQHAKYTCSFCGKE   45 (152)
Q Consensus        29 ~~~~~~~~~~c~~c~~~   45 (152)
                      +++.....-.||.||..
T Consensus        12 ~~~~~~~~irC~~CG~r   28 (44)
T smart00659       12 NEIKSKDVVRCRECGYR   28 (44)
T ss_pred             eecCCCCceECCCCCce
Confidence            44456677788888876


No 196
>COG3024 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=43.72  E-value=14  Score=23.14  Aligned_cols=16  Identities=25%  Similarity=0.449  Sum_probs=11.6

Q ss_pred             cccccccCCCCccccc
Q psy838           33 QHAKYTCSFCGKEYKK   48 (152)
Q Consensus        33 ~~~~~~c~~c~~~~~~   48 (152)
                      ....-.||.|||+..+
T Consensus         4 ~~~~v~CP~Cgkpv~w   19 (65)
T COG3024           4 LRITVPCPTCGKPVVW   19 (65)
T ss_pred             ccccccCCCCCCcccc
Confidence            3456689999998444


No 197
>PF04959 ARS2:  Arsenite-resistance protein 2;  InterPro: IPR007042 This entry represents Arsenite-resistance protein 2 (also known as Serrate RNA effector molecule homolog) which is thought to play a role in arsenite resistance [], although does not directly confer arsenite resistance but rather modulates arsenic sensitivity []. Arsenite is a carcinogenic compound which can act as a comutagen by inhibiting DNA repair. It is also involved in cell cycle progression at S phase. ; PDB: 3AX1_A.
Probab=42.33  E-value=4.7  Score=31.35  Aligned_cols=15  Identities=40%  Similarity=1.083  Sum_probs=9.8

Q ss_pred             cccccccCCCCcccc
Q psy838           33 QHAKYTCSFCGKEYK   47 (152)
Q Consensus        33 ~~~~~~c~~c~~~~~   47 (152)
                      ...||.||.|+|.++
T Consensus        74 ~~~K~~C~lc~KlFk   88 (214)
T PF04959_consen   74 DEDKWRCPLCGKLFK   88 (214)
T ss_dssp             SSEEEEE-SSS-EES
T ss_pred             cCCEECCCCCCcccC
Confidence            345899999999843


No 198
>PF10013 DUF2256:  Uncharacterized protein conserved in bacteria (DUF2256);  InterPro: IPR017136 There is currently no experimental data for members of this group or their homologues, nor do they exhibit features indicative of any function.
Probab=41.80  E-value=12  Score=21.54  Aligned_cols=15  Identities=20%  Similarity=0.696  Sum_probs=11.2

Q ss_pred             ccccCCCCcccccCC
Q psy838           36 KYTCSFCGKEYKKST   50 (152)
Q Consensus        36 ~~~c~~c~~~~~~~~   50 (152)
                      .=.||.||+++.+-.
T Consensus         8 ~K~C~~C~rpf~WRK   22 (42)
T PF10013_consen    8 SKICPVCGRPFTWRK   22 (42)
T ss_pred             CCcCcccCCcchHHH
Confidence            347999999876544


No 199
>PF09855 DUF2082:  Nucleic-acid-binding protein containing Zn-ribbon domain (DUF2082);  InterPro: IPR018652  This family of proteins contains various hypothetical prokaryotic proteins as well as some Zn-ribbon nucleic-acid-binding proteins.
Probab=41.54  E-value=12  Score=23.44  Aligned_cols=9  Identities=56%  Similarity=1.442  Sum_probs=7.0

Q ss_pred             cccCCCCcc
Q psy838           37 YTCSFCGKE   45 (152)
Q Consensus        37 ~~c~~c~~~   45 (152)
                      |.||.||..
T Consensus         1 y~C~KCg~~    9 (64)
T PF09855_consen    1 YKCPKCGNE    9 (64)
T ss_pred             CCCCCCCCc
Confidence            689999853


No 200
>COG4391 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=41.16  E-value=12  Score=23.33  Aligned_cols=15  Identities=20%  Similarity=0.928  Sum_probs=11.0

Q ss_pred             cccccccCCCCcccc
Q psy838           33 QHAKYTCSFCGKEYK   47 (152)
Q Consensus        33 ~~~~~~c~~c~~~~~   47 (152)
                      ....-.||.||+.++
T Consensus        45 ~~gev~CPYC~t~y~   59 (62)
T COG4391          45 DEGEVVCPYCSTRYR   59 (62)
T ss_pred             CCCcEecCccccEEE
Confidence            344568999998754


No 201
>PRK00114 hslO Hsp33-like chaperonin; Reviewed
Probab=41.11  E-value=19  Score=29.10  Aligned_cols=26  Identities=27%  Similarity=0.590  Sum_probs=17.7

Q ss_pred             cccccccCCCCcccccCCCCCHHHHHHHHH
Q psy838           33 QHAKYTCSFCGKEYKKSTDLGAETKKWVWE   62 (152)
Q Consensus        33 ~~~~~~c~~c~~~~~~~~~l~~~d~~~i~~   62 (152)
                      ....-+|.||++.|+    .+++|++.+++
T Consensus       265 ~~iev~C~FC~~~Y~----f~~~dl~~l~~  290 (293)
T PRK00114        265 GGAEMVCQFCGNKYL----FDEEDLEELIA  290 (293)
T ss_pred             CCEEEEEeCCCCEEE----eCHHHHHHHHh
Confidence            345778999988753    56667766653


No 202
>PF13248 zf-ribbon_3:  zinc-ribbon domain
Probab=41.00  E-value=13  Score=18.74  Aligned_cols=10  Identities=40%  Similarity=0.899  Sum_probs=7.4

Q ss_pred             cccCCCCccc
Q psy838           37 YTCSFCGKEY   46 (152)
Q Consensus        37 ~~c~~c~~~~   46 (152)
                      ..||.||+..
T Consensus         3 ~~Cp~Cg~~~   12 (26)
T PF13248_consen    3 MFCPNCGAEI   12 (26)
T ss_pred             CCCcccCCcC
Confidence            4799998753


No 203
>PF10588 NADH-G_4Fe-4S_3:  NADH-ubiquinone oxidoreductase-G iron-sulfur binding region;  InterPro: IPR019574  NADH:ubiquinone oxidoreductase (complex I) (1.6.5.3 from EC) is a respiratory-chain enzyme that catalyses the transfer of two electrons from NADH to ubiquinone in a reaction that is associated with proton translocation across the membrane (NADH + ubiquinone = NAD+ + ubiquinol) []. Complex I is a major source of reactive oxygen species (ROS) that are predominantly formed by electron transfer from FMNH(2). Complex I is found in bacteria, cyanobacteria (as a NADH-plastoquinone oxidoreductase), archaea [], mitochondira, and in the hydrogenosome, a mitochondria-derived organelle. In general, the bacterial complex consists of 14 different subunits, while the mitochondrial complex contains homologues to these subunits in addition to approximately 31 additional proteins []. Mitochondrial complex I, which is located in the inner mitochondrial membrane, is the largest multimeric respiratory enzyme in the mitochondria, consisting of more than 40 subunits, one FMN co-factor and eight FeS clusters []. The assembly of mitochondrial complex I is an intricate process that requires the cooperation of the nuclear and mitochondrial genomes [, ]. Mitochondrial complex I can cycle between active and deactive forms that can be distinguished by the reactivity towards divalent cations and thiol-reactive agents. All redox prosthetic groups reside in the peripheral arm of the L-shaped structure. The NADH oxidation domain harbouring the FMN cofactor is connected via a chain of iron-sulphur clusters to the ubiquinone reduction site that is located in a large pocket formed by the PSST and 49kDa subunits of complex I []. This entry describes the G subunit (one of 14 subunits, A to N) of the NADH-quinone oxidoreductase complex I which generally couples NADH and ubiquinone oxidation/reduction in bacteria and mammalian mitochondria while translocating protons, but may act on NADPH and/or plastoquinone in cyanobacteria and plant chloroplasts. This family does not contain related subunits from formate dehydrogenase complexes.  This entry represents the iron-sulphur binding domain of the G subunit.; GO: 0016491 oxidoreductase activity, 0055114 oxidation-reduction process; PDB: 3M9S_C 2FUG_L 3IAS_L 2YBB_3 3IAM_3 3I9V_3.
Probab=40.82  E-value=21  Score=20.04  Aligned_cols=22  Identities=27%  Similarity=0.670  Sum_probs=11.3

Q ss_pred             hHhHHhhhhhcccccccCCCCcc
Q psy838           23 RKMVKKMEITQHAKYTCSFCGKE   45 (152)
Q Consensus        23 rk~~~~~~~~~~~~~~c~~c~~~   45 (152)
                      ||.+-+.-.+.| +..|+.|.+.
T Consensus         1 Rr~~lelll~~H-~~dC~~C~~~   22 (41)
T PF10588_consen    1 RRTVLELLLANH-PLDCPTCDKN   22 (41)
T ss_dssp             -HHHHHHHHTT-----TTT-TTG
T ss_pred             CHHHHHHHHhCC-CCcCcCCCCC
Confidence            556666666666 7889999865


No 204
>PF14255 Cys_rich_CPXG:  Cysteine-rich CPXCG
Probab=40.51  E-value=6.7  Score=23.55  Aligned_cols=9  Identities=33%  Similarity=1.298  Sum_probs=7.1

Q ss_pred             cccCCCCcc
Q psy838           37 YTCSFCGKE   45 (152)
Q Consensus        37 ~~c~~c~~~   45 (152)
                      ..||.||..
T Consensus         1 i~CPyCge~    9 (52)
T PF14255_consen    1 IQCPYCGEP    9 (52)
T ss_pred             CCCCCCCCe
Confidence            368999877


No 205
>KOG2907|consensus
Probab=40.49  E-value=8.9  Score=26.81  Aligned_cols=11  Identities=45%  Similarity=0.945  Sum_probs=9.4

Q ss_pred             cccccCCCCcc
Q psy838           35 AKYTCSFCGKE   45 (152)
Q Consensus        35 ~~~~c~~c~~~   45 (152)
                      -++.||.||..
T Consensus        73 I~~kCpkCghe   83 (116)
T KOG2907|consen   73 IKHKCPKCGHE   83 (116)
T ss_pred             hhccCcccCCc
Confidence            37999999977


No 206
>PF13909 zf-H2C2_5:  C2H2-type zinc-finger domain; PDB: 1X5W_A.
Probab=40.44  E-value=13  Score=17.92  Aligned_cols=7  Identities=43%  Similarity=1.502  Sum_probs=4.9

Q ss_pred             cccCCCC
Q psy838           37 YTCSFCG   43 (152)
Q Consensus        37 ~~c~~c~   43 (152)
                      |.|+.|.
T Consensus         1 y~C~~C~    7 (24)
T PF13909_consen    1 YKCPHCS    7 (24)
T ss_dssp             EE-SSSS
T ss_pred             CCCCCCC
Confidence            6799997


No 207
>PRK06266 transcription initiation factor E subunit alpha; Validated
Probab=40.33  E-value=21  Score=26.76  Aligned_cols=13  Identities=15%  Similarity=0.462  Sum_probs=8.5

Q ss_pred             cccccccCCCCcc
Q psy838           33 QHAKYTCSFCGKE   45 (152)
Q Consensus        33 ~~~~~~c~~c~~~   45 (152)
                      ...-|.||.|++.
T Consensus       114 ~~~~Y~Cp~C~~r  126 (178)
T PRK06266        114 NNMFFFCPNCHIR  126 (178)
T ss_pred             CCCEEECCCCCcE
Confidence            3456777777765


No 208
>PF13945 NST1:  Salt tolerance down-regulator
Probab=40.25  E-value=26  Score=26.71  Aligned_cols=34  Identities=18%  Similarity=0.419  Sum_probs=21.4

Q ss_pred             hcccccccCCCCcccccCCCCCHHHHHHHHHHHHHhHH
Q psy838           32 TQHAKYTCSFCGKEYKKSTDLGAETKKWVWELEEKNMK   69 (152)
Q Consensus        32 ~~~~~~~c~~c~~~~~~~~~l~~~d~~~i~~l~~~~m~   69 (152)
                      .|+-.-.|++||+.    ..+=.++++.|++...+.+.
T Consensus       137 qq~h~C~C~vCgr~----~~~ie~ele~ly~~~y~~l~  170 (190)
T PF13945_consen  137 QQKHSCSCSVCGRK----RTAIEEELERLYDAYYEELE  170 (190)
T ss_pred             HhccCcccHHHhch----hhHHHHHHHHHHHHHHHHHH
Confidence            45445668999976    23445668888776555443


No 209
>KOG2535|consensus
Probab=40.13  E-value=16  Score=30.83  Aligned_cols=21  Identities=19%  Similarity=0.407  Sum_probs=18.5

Q ss_pred             cccccCCHHHHHHHHHHhhcC
Q psy838          132 KQVQRKGLGKFMMQAFYTYID  152 (152)
Q Consensus       132 p~~qgkGiG~~Lm~~l~~~a~  152 (152)
                      ..||.||.|..||+..+.+|+
T Consensus       497 ~KfQHQG~GtLLmeEAERIAr  517 (554)
T KOG2535|consen  497 TKFQHQGFGTLLMEEAERIAR  517 (554)
T ss_pred             hhhhhcchhhHHHHHHHHHHH
Confidence            369999999999999998873


No 210
>PF08772 NOB1_Zn_bind:  Nin one binding (NOB1) Zn-ribbon like;  InterPro: IPR014881 This entry corresponds to a zinc ribbon and is found on the RNA binding protein NOB1. ; PDB: 2CON_A.
Probab=39.80  E-value=13  Score=23.98  Aligned_cols=13  Identities=31%  Similarity=0.692  Sum_probs=5.9

Q ss_pred             cccccccCCCCcc
Q psy838           33 QHAKYTCSFCGKE   45 (152)
Q Consensus        33 ~~~~~~c~~c~~~   45 (152)
                      .-.+..||.||-.
T Consensus        21 ~~~k~FCp~CGn~   33 (73)
T PF08772_consen   21 DMTKQFCPKCGNA   33 (73)
T ss_dssp             -SS--S-SSS--S
T ss_pred             CCCceeCcccCCC
Confidence            3468899999988


No 211
>PF10122 Mu-like_Com:  Mu-like prophage protein Com;  InterPro: IPR019294  Members of this entry belong to the Com family of proteins that act as translational regulators of mom [, ]. 
Probab=39.40  E-value=18  Score=21.66  Aligned_cols=22  Identities=18%  Similarity=0.317  Sum_probs=14.8

Q ss_pred             HhHHhhhhhcccccccCCCCcc
Q psy838           24 KMVKKMEITQHAKYTCSFCGKE   45 (152)
Q Consensus        24 k~~~~~~~~~~~~~~c~~c~~~   45 (152)
                      |.+.+.....+-.-.||.||+.
T Consensus        12 klLa~~g~~~~leIKCpRC~ti   33 (51)
T PF10122_consen   12 KLLAKAGEVIELEIKCPRCKTI   33 (51)
T ss_pred             HHHhhhcCccEEEEECCCCCcc
Confidence            4444544455566789999988


No 212
>KOG4317|consensus
Probab=38.80  E-value=14  Score=30.52  Aligned_cols=14  Identities=36%  Similarity=0.712  Sum_probs=11.9

Q ss_pred             hcccccccCCCCcc
Q psy838           32 TQHAKYTCSFCGKE   45 (152)
Q Consensus        32 ~~~~~~~c~~c~~~   45 (152)
                      .|.+.|+||.|.-.
T Consensus        15 vq~~~YtCPRCn~~   28 (383)
T KOG4317|consen   15 VQKREYTCPRCNLL   28 (383)
T ss_pred             cccccccCCCCCcc
Confidence            46677999999988


No 213
>KOG3965|consensus
Probab=38.78  E-value=12  Score=27.58  Aligned_cols=26  Identities=27%  Similarity=0.509  Sum_probs=22.5

Q ss_pred             eeeccccccccchhHhHhHHhhhhhc
Q psy838            8 VGITGKYGTRYGASLRKMVKKMEITQ   33 (152)
Q Consensus         8 ~~~~~~~g~ryg~~~rk~~~~~~~~~   33 (152)
                      -+|.+.|+.|||..+-.+.+.+|...
T Consensus       117 d~ISP~y~E~Fg~~l~q~L~~lek~~  142 (160)
T KOG3965|consen  117 DDISPLYRERFGNKLEQRLKRLEKQR  142 (160)
T ss_pred             hhcChHHHHHHHHHHHHHHHHHHHHh
Confidence            46899999999999999999988653


No 214
>PF11432 DUF3197:  Protein of unknown function (DUF3197);  InterPro: IPR024443 This domain, whose function is unknown, is found in bacterial proteins.; PDB: 1WN9_A 1WNA_A.
Probab=38.52  E-value=19  Score=24.97  Aligned_cols=49  Identities=12%  Similarity=0.231  Sum_probs=27.5

Q ss_pred             eccccccccchhHhHhHHhhh---hhcccccccCCCCcccccCCCCCHHHHHHHHHHHHHhH
Q psy838           10 ITGKYGTRYGASLRKMVKKME---ITQHAKYTCSFCGKEYKKSTDLGAETKKWVWELEEKNM   68 (152)
Q Consensus        10 ~~~~~g~ryg~~~rk~~~~~~---~~~~~~~~c~~c~~~~~~~~~l~~~d~~~i~~l~~~~m   68 (152)
                      +..-||||||..=....+++-   ..+-.++.          -..++++++..++++..+..
T Consensus        46 t~dAFGPafG~~G~~ALaELv~wl~~~G~~f~----------EaVl~p~e~~~ll~~p~~~~   97 (113)
T PF11432_consen   46 TPDAFGPAFGPEGERALAELVRWLQERGARFY----------EAVLSPSEFAALLELPPEEE   97 (113)
T ss_dssp             EEEEESTTS-TTHHHHHHHHHHHHHHTT-EEE----------EEEE-GGGHHHHHTS-HHHH
T ss_pred             cccccCcccCccHHHHHHHHHHHHHHcCCchh----------heecCHHHHHHHHcCCcHHH
Confidence            456799999854444444332   22222222          45678999999998877653


No 215
>PF05191 ADK_lid:  Adenylate kinase, active site lid;  InterPro: IPR007862 Adenylate kinases (ADK; 2.7.4.3 from EC) are phosphotransferases that catalyse the Mg-dependent reversible conversion of ATP and AMP to two molecules of ADP, an essential reaction for many processes in living cells. In large variants of adenylate kinase, the AMP and ATP substrates are buried in a domain that undergoes conformational changes from an open to a closed state when bound to substrate; the ligand is then contained within a highly specific environment required for catalysis. Adenylate kinase is a 3-domain protein consisting of a large central CORE domain flanked by a LID domain on one side and the AMP-binding NMPbind domain on the other []. The LID domain binds ATP and covers the phosphates at the active site. The substrates first bind the CORE domain, followed by closure of the active site by the LID and NMPbind domains. Comparisons of adenylate kinases have revealed a particular divergence in the active site lid. In some organisms, particularly the Gram-positive bacteria, residues in the lid domain have been mutated to cysteines and these cysteine residues (two CX(n)C motifs) are responsible for the binding of a zinc ion. The bound zinc ion in the lid domain is clearly structurally homologous to Zinc-finger domains. However, it is unclear whether the adenylate kinase lid is a novel zinc-finger DNA/RNA binding domain, or that the lid bound zinc serves a purely structural function [].; GO: 0004017 adenylate kinase activity; PDB: 3BE4_A 2OSB_B 2ORI_A 2EU8_A 3DL0_A 1P3J_A 2QAJ_A 2OO7_A 2P3S_A 3DKV_A ....
Probab=38.38  E-value=12  Score=20.54  Aligned_cols=12  Identities=33%  Similarity=0.849  Sum_probs=9.0

Q ss_pred             cccCCCCccccc
Q psy838           37 YTCSFCGKEYKK   48 (152)
Q Consensus        37 ~~c~~c~~~~~~   48 (152)
                      ..||.||+.|+.
T Consensus         2 r~C~~Cg~~Yh~   13 (36)
T PF05191_consen    2 RICPKCGRIYHI   13 (36)
T ss_dssp             EEETTTTEEEET
T ss_pred             cCcCCCCCcccc
Confidence            479999988553


No 216
>COG1998 RPS31 Ribosomal protein S27AE [Translation, ribosomal structure and biogenesis]
Probab=37.90  E-value=16  Score=21.83  Aligned_cols=12  Identities=25%  Similarity=0.686  Sum_probs=9.2

Q ss_pred             cccccccCCCCc
Q psy838           33 QHAKYTCSFCGK   44 (152)
Q Consensus        33 ~~~~~~c~~c~~   44 (152)
                      -+.+-.||.||-
T Consensus        16 ~rk~~~CPrCG~   27 (51)
T COG1998          16 KRKNRFCPRCGP   27 (51)
T ss_pred             EEccccCCCCCC
Confidence            456778999994


No 217
>cd00730 rubredoxin Rubredoxin; nonheme iron binding domains containing a [Fe(SCys)4] center. Rubredoxins are small nonheme iron proteins. The iron atom is coordinated by four cysteine residues (Fe(S-Cys)4), but iron can also be replaced by cobalt, nickel or zinc. They are believed to be involved in electron transfer.
Probab=37.64  E-value=17  Score=21.53  Aligned_cols=13  Identities=23%  Similarity=0.782  Sum_probs=9.8

Q ss_pred             cccccccCCCCcc
Q psy838           33 QHAKYTCSFCGKE   45 (152)
Q Consensus        33 ~~~~~~c~~c~~~   45 (152)
                      .-..+.||.||..
T Consensus        31 Lp~~w~CP~C~a~   43 (50)
T cd00730          31 LPDDWVCPVCGAG   43 (50)
T ss_pred             CCCCCCCCCCCCc
Confidence            3457899999865


No 218
>smart00834 CxxC_CXXC_SSSS Putative regulatory protein. CxxC_CXXC_SSSS represents a region of about 41 amino acids found in a number of small proteins in a wide range of bacteria. The region usually begins with the initiator Met and contains two CxxC motifs separated by 17 amino acids. One protein in this entry has been noted as a putative regulatory protein, designated FmdB. Most proteins in this entry have a C-terminal region containing highly degenerate sequence.
Probab=37.22  E-value=15  Score=19.96  Aligned_cols=10  Identities=40%  Similarity=1.152  Sum_probs=8.1

Q ss_pred             ccccCCCCcc
Q psy838           36 KYTCSFCGKE   45 (152)
Q Consensus        36 ~~~c~~c~~~   45 (152)
                      .|.|+.||+.
T Consensus         5 ~y~C~~Cg~~   14 (41)
T smart00834        5 EYRCEDCGHT   14 (41)
T ss_pred             EEEcCCCCCE
Confidence            4789999886


No 219
>cd00246 RabGEF Nucleotide exchange factor for Rab-like small GTPases (RabGEF), Mss4 type; RabGEF positely regulates the function of  Rab GTPase by promoting exchange of GDP for GTP; members of the Rab subfamily of Ras GTPases are important in vesicular transport;
Probab=37.19  E-value=18  Score=24.90  Aligned_cols=13  Identities=23%  Similarity=0.544  Sum_probs=10.4

Q ss_pred             ccccCCCCccccc
Q psy838           36 KYTCSFCGKEYKK   48 (152)
Q Consensus        36 ~~~c~~c~~~~~~   48 (152)
                      +..|++|+..+++
T Consensus         2 ~v~C~~C~S~VL~   14 (103)
T cd00246           2 AVLCQRCGSRVLT   14 (103)
T ss_pred             ceECCCCCCEEEe
Confidence            4679999999663


No 220
>PRK02983 lysS lysyl-tRNA synthetase; Provisional
Probab=37.15  E-value=4e+02  Score=25.89  Aligned_cols=114  Identities=18%  Similarity=0.173  Sum_probs=60.5

Q ss_pred             hhHhHhHHhhhhhcccccccCCCCcccccCCCCCHHHHHHHHHHHHHhHHHHHHhCCCCCC--hhhhhhhhcCCCceEEE
Q psy838           20 ASLRKMVKKMEITQHAKYTCSFCGKEYKKSTDLGAETKKWVWELEEKNMKHSYEVCDIGWD--PQGKHSEMFDDRACYLV   97 (152)
Q Consensus        20 ~~~rk~~~~~~~~~~~~~~c~~c~~~~~~~~~l~~~d~~~i~~l~~~~m~~~y~~~~~~w~--~~~~~~el~~~~~~~~v   97 (152)
                      ..+|..+++.+..   -|+..     +.+..+++++.++.+-++.++=....-   .-+.+  .-.+ .+..+....++.
T Consensus       357 ~~lR~a~nra~r~---G~t~~-----i~~~~~~~~~~~~~L~~isd~Wl~~~~---EkGFSm~LGr~-~~~~~~~~~i~~  424 (1094)
T PRK02983        357 RPVRQAVTRVRRA---GYTVR-----IRRHRDLPAEEMAQVIARADAWRDTET---ERGFSMALGRL-GDPADGDCLLVE  424 (1094)
T ss_pred             HHHHHHHHHHHhC---CCEEE-----EeeCCCCCHHHHHHHHHHHHHHhcCCC---CCceeeecCcc-cchhcCceEEEE
Confidence            6788888877765   33311     334556777777777776654211100   01111  0000 111222333233


Q ss_pred             EEeCCCCCeEEEEEEEeeccCCcceEEEeeee-eccccccCCHHHHHHHHHHhhc
Q psy838           98 AKNGSSSTPVAFSHFRFDVDFGEPVLYCYELQ-LEKQVQRKGLGKFMMQAFYTYI  151 (152)
Q Consensus        98 ~~~~~~~~ivGf~~~~~~~~~~~~~~~I~~i~-V~p~~qgkGiG~~Lm~~l~~~a  151 (152)
                      +.++ +|+++||+.+.+...++   + -.|+. -+|+. =.|+=-.|+..++..+
T Consensus       425 a~d~-~G~i~af~s~~p~~~~g---~-slDLMRr~pda-pnGvmE~L~~~l~~~~  473 (1094)
T PRK02983        425 AHDA-DGQVVALLSFVPWGRRG---L-SLDLMRRSPDA-PNGVIELMVAELALEA  473 (1094)
T ss_pred             EECC-CCeEEEEEEEeeeCCCC---E-EEEecccCCCC-CCCHHHHHHHHHHHHH
Confidence            3333 58999999999854332   2 33444 34554 6888888888777654


No 221
>PF10892 DUF2688:  Protein of unknown function (DUF2688);  InterPro: IPR024392 Members of this protein family are annotated as KleB, and may play a role in the regulation of transcription in plasmids.
Probab=36.22  E-value=5.7  Score=24.27  Aligned_cols=30  Identities=30%  Similarity=0.569  Sum_probs=18.2

Q ss_pred             HhhhhhcccccccCCCCcccccC--CCCCHHHHHH
Q psy838           27 KKMEITQHAKYTCSFCGKEYKKS--TDLGAETKKW   59 (152)
Q Consensus        27 ~~~~~~~~~~~~c~~c~~~~~~~--~~l~~~d~~~   59 (152)
                      ++||+.   .-.|..|||.+++.  +.+..+|+..
T Consensus         4 ~kieIV---~t~CrRCGk~i~tl~~SL~Gad~lr~   35 (60)
T PF10892_consen    4 GKIEIV---ETPCRRCGKSIRTLSRSLIGADDLRV   35 (60)
T ss_pred             CceEee---eehhhhhCccHHHHHHHhhChHHHHH
Confidence            355555   56799999995544  3444555444


No 222
>PF13878 zf-C2H2_3:  zinc-finger of acetyl-transferase ESCO
Probab=36.11  E-value=20  Score=20.16  Aligned_cols=16  Identities=31%  Similarity=0.673  Sum_probs=11.9

Q ss_pred             ccccCCCCcccccCCC
Q psy838           36 KYTCSFCGKEYKKSTD   51 (152)
Q Consensus        36 ~~~c~~c~~~~~~~~~   51 (152)
                      .-+||.||-.|....+
T Consensus        13 ~~~C~~CgM~Y~~~~~   28 (41)
T PF13878_consen   13 ATTCPTCGMLYSPGSP   28 (41)
T ss_pred             CcCCCCCCCEECCCCH
Confidence            5689999987766543


No 223
>TIGR00373 conserved hypothetical protein TIGR00373. This family of proteins is, so far, restricted to archaeal genomes. The family appears to be distantly related to the N-terminal region of the eukaryotic transcription initiation factor IIE alpha chain.
Probab=35.79  E-value=16  Score=26.74  Aligned_cols=12  Identities=8%  Similarity=0.177  Sum_probs=6.2

Q ss_pred             ccccccCCCCcc
Q psy838           34 HAKYTCSFCGKE   45 (152)
Q Consensus        34 ~~~~~c~~c~~~   45 (152)
                      ..-|.||.|++.
T Consensus       107 ~~~Y~Cp~c~~r  118 (158)
T TIGR00373       107 NMFFICPNMCVR  118 (158)
T ss_pred             CCeEECCCCCcE
Confidence            344556666543


No 224
>cd03173 DUF619-like DUF619 domain of various N-acetylglutamate Kinases and N-acetylglutamate Synthases. DUF619-like: This family includes the DUF619 domain of various N-acetylglutamate synthases (NAGS) of the urea cycle found in humans and fish, the DUF619 domain of the NAGS of the fungal arginine-biosynthetic pathway (FABP), as well as the DUF619 domain present C-terminal of a NAG kinase-like domain in a limited number of predicted NAGSs found in bacteria and Dictyostelium. Ureogenic NAGS is a mitochondrial enzyme catalyzing the formation of NAG from acetylcoenzyme A and L-glutamate. NAGS is an essential allosteric activator of carbamylphosphate synthase I, the first and rate limiting enzyme of the urea cycle. Domain architecture of ureogenic and fungal NAGS consists of an N-terminal NAG kinase-like domain and a C-terminal DUF619 domain. This subgroup also includes the DUF619 domain of the FABP N-acetylglutamate kinase (NAGK), the enzyme that catalyzes the second reaction of arginine 
Probab=35.76  E-value=1.1e+02  Score=20.77  Aligned_cols=30  Identities=3%  Similarity=0.129  Sum_probs=26.5

Q ss_pred             cceEEEeeeeeccccccCCHHHHHHHHHHh
Q psy838          120 EPVLYCYELQLEKQVQRKGLGKFMMQAFYT  149 (152)
Q Consensus       120 ~~~~~I~~i~V~p~~qgkGiG~~Lm~~l~~  149 (152)
                      ..+-||..+.|.+.-++.|++..+.+.+.+
T Consensus        31 ~~v~~LdkFav~~~~~~~gv~D~vf~~i~~   60 (98)
T cd03173          31 NSIPYLDKFAVSDHLWLNNVTDNIFNLIRK   60 (98)
T ss_pred             CCCEEEEEEEEcccccccCHHHHHHHHHHh
Confidence            346689999999999999999999998874


No 225
>TIGR02098 MJ0042_CXXC MJ0042 family finger-like domain. This domain contains a CXXCX(19)CXXC motif suggestive of both zinc fingers and thioredoxin, usually found at the N-terminus of prokaryotic proteins. One partially characterized gene, agmX, is among a large set in Myxococcus whose interruption affects adventurous gliding motility.
Probab=35.76  E-value=17  Score=19.64  Aligned_cols=10  Identities=20%  Similarity=0.717  Sum_probs=7.5

Q ss_pred             ccccCCCCcc
Q psy838           36 KYTCSFCGKE   45 (152)
Q Consensus        36 ~~~c~~c~~~   45 (152)
                      +..||.||+.
T Consensus         2 ~~~CP~C~~~   11 (38)
T TIGR02098         2 RIQCPNCKTS   11 (38)
T ss_pred             EEECCCCCCE
Confidence            3578888876


No 226
>TIGR01206 lysW lysine biosynthesis protein LysW. This very small, poorly characterized protein has been shown essential in Thermus thermophilus for an unusual pathway of Lys biosynthesis from aspartate by way of alpha-aminoadipate (AAA) rather than diaminopimelate. It is found also in Deinococcus radiodurans and Pyrococcus horikoshii, which appear to share the AAA pathway.
Probab=35.61  E-value=18  Score=21.83  Aligned_cols=10  Identities=40%  Similarity=1.172  Sum_probs=7.4

Q ss_pred             ccccCCCCcc
Q psy838           36 KYTCSFCGKE   45 (152)
Q Consensus        36 ~~~c~~c~~~   45 (152)
                      .++||.||..
T Consensus         2 ~~~CP~CG~~   11 (54)
T TIGR01206         2 QFECPDCGAE   11 (54)
T ss_pred             ccCCCCCCCE
Confidence            4578999865


No 227
>PF12874 zf-met:  Zinc-finger of C2H2 type; PDB: 1ZU1_A 2KVG_A.
Probab=35.48  E-value=16  Score=17.58  Aligned_cols=12  Identities=25%  Similarity=1.021  Sum_probs=8.7

Q ss_pred             cccCCCCccccc
Q psy838           37 YTCSFCGKEYKK   48 (152)
Q Consensus        37 ~~c~~c~~~~~~   48 (152)
                      |.|+.|++.+.+
T Consensus         1 ~~C~~C~~~f~s   12 (25)
T PF12874_consen    1 FYCDICNKSFSS   12 (25)
T ss_dssp             EEETTTTEEESS
T ss_pred             CCCCCCCCCcCC
Confidence            578999877544


No 228
>COG4311 SoxD Sarcosine oxidase delta subunit [Amino acid transport and metabolism]
Probab=35.29  E-value=24  Score=23.90  Aligned_cols=9  Identities=33%  Similarity=1.335  Sum_probs=7.3

Q ss_pred             cccCCCCcc
Q psy838           37 YTCSFCGKE   45 (152)
Q Consensus        37 ~~c~~c~~~   45 (152)
                      -.||+||-.
T Consensus         4 I~CP~Cg~R   12 (97)
T COG4311           4 IPCPYCGER   12 (97)
T ss_pred             ecCCCCCCC
Confidence            479999975


No 229
>PF12760 Zn_Tnp_IS1595:  Transposase zinc-ribbon domain;  InterPro: IPR024442 This zinc binding domain is found in a range of transposase proteins such as ISSPO8, ISSOD11, ISRSSP2 etc. It may be a zinc-binding beta ribbon domain that could bind DNA.
Probab=35.16  E-value=19  Score=20.64  Aligned_cols=9  Identities=33%  Similarity=1.294  Sum_probs=7.7

Q ss_pred             cccCCCCcc
Q psy838           37 YTCSFCGKE   45 (152)
Q Consensus        37 ~~c~~c~~~   45 (152)
                      +.||.||..
T Consensus        19 ~~CP~Cg~~   27 (46)
T PF12760_consen   19 FVCPHCGST   27 (46)
T ss_pred             CCCCCCCCe
Confidence            679999975


No 230
>PF08792 A2L_zn_ribbon:  A2L zinc ribbon domain;  InterPro: IPR014900 This zinc ribbon protein is found associated with some viral A2L transcription factors []. 
Probab=34.91  E-value=18  Score=19.53  Aligned_cols=14  Identities=29%  Similarity=0.767  Sum_probs=10.8

Q ss_pred             hcccccccCCCCcc
Q psy838           32 TQHAKYTCSFCGKE   45 (152)
Q Consensus        32 ~~~~~~~c~~c~~~   45 (152)
                      +...-+.|++||..
T Consensus        17 ~~~~~~~C~~Cg~~   30 (33)
T PF08792_consen   17 KEDDYEVCIFCGSS   30 (33)
T ss_pred             ecCCeEEcccCCcE
Confidence            55677899999864


No 231
>PF02591 DUF164:  Putative zinc ribbon domain;  InterPro: IPR003743 This entry describes proteins of unknown function.
Probab=34.77  E-value=25  Score=20.88  Aligned_cols=12  Identities=25%  Similarity=0.609  Sum_probs=9.8

Q ss_pred             ccccccCCCCcc
Q psy838           34 HAKYTCSFCGKE   45 (152)
Q Consensus        34 ~~~~~c~~c~~~   45 (152)
                      ..-.+||.||+.
T Consensus        44 ~~i~~Cp~CgRi   55 (56)
T PF02591_consen   44 DEIVFCPNCGRI   55 (56)
T ss_pred             CCeEECcCCCcc
Confidence            567899999974


No 232
>PRK12380 hydrogenase nickel incorporation protein HybF; Provisional
Probab=34.73  E-value=18  Score=25.00  Aligned_cols=11  Identities=27%  Similarity=0.664  Sum_probs=8.1

Q ss_pred             cccccCCCCcc
Q psy838           35 AKYTCSFCGKE   45 (152)
Q Consensus        35 ~~~~c~~c~~~   45 (152)
                      ..+.||.||..
T Consensus        85 ~~~~CP~Cgs~   95 (113)
T PRK12380         85 HDAQCPHCHGE   95 (113)
T ss_pred             cCccCcCCCCC
Confidence            35569999975


No 233
>PF14206 Cys_rich_CPCC:  Cysteine-rich CPCC
Probab=34.73  E-value=18  Score=23.62  Aligned_cols=10  Identities=50%  Similarity=1.182  Sum_probs=8.0

Q ss_pred             ccccCCCCcc
Q psy838           36 KYTCSFCGKE   45 (152)
Q Consensus        36 ~~~c~~c~~~   45 (152)
                      ||.||-||..
T Consensus         1 K~~CPCCg~~   10 (78)
T PF14206_consen    1 KYPCPCCGYY   10 (78)
T ss_pred             CccCCCCCcE
Confidence            6789999865


No 234
>COG1656 Uncharacterized conserved protein [Function unknown]
Probab=34.12  E-value=18  Score=27.04  Aligned_cols=30  Identities=27%  Similarity=0.370  Sum_probs=18.7

Q ss_pred             hcccccccCCCCcccccCCCCCHHHHHHHHHHHHH
Q psy838           32 TQHAKYTCSFCGKEYKKSTDLGAETKKWVWELEEK   66 (152)
Q Consensus        32 ~~~~~~~c~~c~~~~~~~~~l~~~d~~~i~~l~~~   66 (152)
                      ..+.-+.||.||+.+=.     -+...++.+...+
T Consensus       126 ~~~~f~~C~~CgkiYW~-----GsHw~~m~~~~~~  155 (165)
T COG1656         126 NYEEFYRCPKCGKIYWK-----GSHWRRMVERIVE  155 (165)
T ss_pred             cccceeECCCCcccccC-----chHHHHHHHHHHH
Confidence            34455679999987533     3456666664444


No 235
>COG5595 Zn-ribbon-containing, possibly nucleic-acid-binding protein [General function prediction only]
Probab=33.56  E-value=28  Score=26.96  Aligned_cols=29  Identities=31%  Similarity=0.505  Sum_probs=19.0

Q ss_pred             ccchhHhHhHHhhhh---------------hcccccccCCCCcc
Q psy838           17 RYGASLRKMVKKMEI---------------TQHAKYTCSFCGKE   45 (152)
Q Consensus        17 ryg~~~rk~~~~~~~---------------~~~~~~~c~~c~~~   45 (152)
                      +-|+.++|++.+.--               .+..-..||.||++
T Consensus       184 k~Gr~l~~rIek~t~iptyYylyrVgGnSlAqek~r~CPsC~k~  227 (256)
T COG5595         184 KHGRYLSKRIEKCTGIPTYYYLYRVGGNSLAQEKYRCCPSCGKD  227 (256)
T ss_pred             HHHHHHHHHHHHHhCCCceEEEEEecCchhhhhccCCCCccccc
Confidence            446777777766532               23444569999987


No 236
>COG1693 Repressor of nif and glnA expression [Transcription]
Probab=33.46  E-value=49  Score=27.09  Aligned_cols=60  Identities=23%  Similarity=0.245  Sum_probs=43.5

Q ss_pred             cceeeeccccccccchhHhHh----------HHhhhhhcccccccCCC-----CcccccCCCCCHHHHHHHHHHHHHh
Q psy838            5 TKKVGITGKYGTRYGASLRKM----------VKKMEITQHAKYTCSFC-----GKEYKKSTDLGAETKKWVWELEEKN   67 (152)
Q Consensus         5 ~~~~~~~~~~g~ryg~~~rk~----------~~~~~~~~~~~~~c~~c-----~~~~~~~~~l~~~d~~~i~~l~~~~   67 (152)
                      |+|||.+||==|--|.+--|+          .+.||..   .|.|.|-     |+...-.+.++++|++.++++.+++
T Consensus        56 t~kvgyagr~iTe~Gl~el~~~~~~~R~g~~~SrIeem---~y~t~fdp~t~~G~Vv~N~s~i~k~dfedvl~iv~~~  130 (325)
T COG1693          56 TRKVGYAGREITEKGLKELKRALISERLGFISSRIEEM---AYRTNFDPETLKGKVVVNISVIDKSDFEDVLDIVKEV  130 (325)
T ss_pred             hhhccccceeehHhhHHHHhhhhhHHHHHHHHHHHHHH---HHhccCCchhcCceEEEEEEEecHHHHHHHHHHHHHH
Confidence            678888887666666443333          3455555   6777776     6667777889999999999998864


No 237
>PF10071 DUF2310:  Zn-ribbon-containing, possibly nucleic-acid-binding protein (DUF2310);  InterPro: IPR016908 This group represents uncharacterised conserved proteins.
Probab=33.43  E-value=31  Score=27.65  Aligned_cols=29  Identities=24%  Similarity=0.607  Sum_probs=19.3

Q ss_pred             cchhHhHhHHhhh---------------hhcccccccCCCCccc
Q psy838           18 YGASLRKMVKKME---------------ITQHAKYTCSFCGKEY   46 (152)
Q Consensus        18 yg~~~rk~~~~~~---------------~~~~~~~~c~~c~~~~   46 (152)
                      =|..|++++.++-               .....+..||.||++=
T Consensus       187 ~G~~l~~~Ie~~T~iPtYYyLyrv~g~~~~~e~~r~CP~Cg~~W  230 (258)
T PF10071_consen  187 RGRDLCKRIEKLTGIPTYYYLYRVGGESLASEQARKCPSCGGDW  230 (258)
T ss_pred             HHHHHHHHHHHHhCCCeEEEEEEccCcchHHhhCCCCCCCCCcc
Confidence            4666666666542               2345678899999883


No 238
>PRK01343 zinc-binding protein; Provisional
Probab=33.10  E-value=21  Score=21.90  Aligned_cols=12  Identities=33%  Similarity=0.711  Sum_probs=9.5

Q ss_pred             cccccCCCCccc
Q psy838           35 AKYTCSFCGKEY   46 (152)
Q Consensus        35 ~~~~c~~c~~~~   46 (152)
                      ...+||.|||..
T Consensus         8 p~~~CP~C~k~~   19 (57)
T PRK01343          8 PTRPCPECGKPS   19 (57)
T ss_pred             CCCcCCCCCCcC
Confidence            356799999984


No 239
>KOG2462|consensus
Probab=32.98  E-value=22  Score=28.67  Aligned_cols=24  Identities=38%  Similarity=0.924  Sum_probs=18.8

Q ss_pred             hcccccccCCCCcccccCCCCCHH
Q psy838           32 TQHAKYTCSFCGKEYKKSTDLGAE   55 (152)
Q Consensus        32 ~~~~~~~c~~c~~~~~~~~~l~~~   55 (152)
                      ..+-.|.|+.|||.+.+...|+..
T Consensus       126 ~~~~r~~c~eCgk~ysT~snLsrH  149 (279)
T KOG2462|consen  126 AKHPRYKCPECGKSYSTSSNLSRH  149 (279)
T ss_pred             ccCCceeccccccccccccccchh
Confidence            356789999999998888765544


No 240
>PF14690 zf-ISL3:  zinc-finger of transposase IS204/IS1001/IS1096/IS1165
Probab=32.89  E-value=22  Score=20.02  Aligned_cols=10  Identities=30%  Similarity=0.853  Sum_probs=6.9

Q ss_pred             ccccCCCCcc
Q psy838           36 KYTCSFCGKE   45 (152)
Q Consensus        36 ~~~c~~c~~~   45 (152)
                      +..||.||..
T Consensus         2 ~~~Cp~Cg~~   11 (47)
T PF14690_consen    2 PPRCPHCGSP   11 (47)
T ss_pred             CccCCCcCCC
Confidence            4578888844


No 241
>TIGR00100 hypA hydrogenase nickel insertion protein HypA. In Hpylori, hypA mutant abolished hydrogenase activity and decrease in urease activity. Nickel supplementation in media restored urease activity and partial hydrogenase activity. HypA probably involved in inserting Ni in enzymes.
Probab=32.85  E-value=20  Score=24.85  Aligned_cols=10  Identities=30%  Similarity=0.680  Sum_probs=7.8

Q ss_pred             ccccCCCCcc
Q psy838           36 KYTCSFCGKE   45 (152)
Q Consensus        36 ~~~c~~c~~~   45 (152)
                      .+.||.||..
T Consensus        86 ~~~CP~Cgs~   95 (115)
T TIGR00100        86 LYRCPKCHGI   95 (115)
T ss_pred             CccCcCCcCC
Confidence            4669999976


No 242
>PF00571 CBS:  CBS domain CBS domain web page. Mutations in the CBS domain of Swiss:P35520 lead to homocystinuria.;  InterPro: IPR000644 CBS (cystathionine-beta-synthase) domains are small intracellular modules, mostly found in two or four copies within a protein, that occur in a variety of proteins in bacteria, archaea, and eukaryotes [, ]. Tandem pairs of CBS domains can act as binding domains for adenosine derivatives and may regulate the activity of attached enzymatic or other domains []. In some cases, CBS domains may act as sensors of cellular energy status by being activated by AMP and inhibited by ATP []. In chloride ion channels, the CBS domains have been implicated in intracellular targeting and trafficking, as well as in protein-protein interactions, but results vary with different channels: in the CLC-5 channel, the CBS domain was shown to be required for trafficking [], while in the CLC-1 channel, the CBS domain was shown to be critical for channel function, but not necessary for trafficking []. Recent experiments revealing that CBS domains can bind adenosine-containing ligands such ATP, AMP, or S-adenosylmethionine have led to the hypothesis that CBS domains function as sensors of intracellular metabolites [, ]. Crystallographic studies of CBS domains have shown that pairs of CBS sequences form a globular domain where each CBS unit adopts a beta-alpha-beta-beta-alpha pattern []. Crystal structure of the CBS domains of the AMP-activated protein kinase in complexes with AMP and ATP shows that the phosphate groups of AMP/ATP lie in a surface pocket at the interface of two CBS domains, which is lined with basic residues, many of which are associated with disease-causing mutations [].  In humans, mutations in conserved residues within CBS domains cause a variety of human hereditary diseases, including (with the gene mutated in parentheses): homocystinuria (cystathionine beta-synthase); Wolff-Parkinson-White syndrome (gamma 2 subunit of AMP-activated protein kinase); retinitis pigmentosa (IMP dehydrogenase-1); congenital myotonia, idiopathic generalized epilepsy, hypercalciuric nephrolithiasis, and classic Bartter syndrome (CLC chloride channel family members).; GO: 0005515 protein binding; PDB: 3JTF_A 3TE5_C 3TDH_C 3T4N_C 2QLV_C 3OI8_A 3LV9_A 2QH1_B 1PVM_B 3LQN_A ....
Probab=32.81  E-value=39  Score=19.21  Aligned_cols=32  Identities=9%  Similarity=-0.049  Sum_probs=18.7

Q ss_pred             ChhhhhhhhcCCCceEEEEEeCCCCCeEEEEEE
Q psy838           80 DPQGKHSEMFDDRACYLVAKNGSSSTPVAFSHF  112 (152)
Q Consensus        80 ~~~~~~~el~~~~~~~~v~~~~~~~~ivGf~~~  112 (152)
                      +-......|.+.+...+.+.++ +++++|++..
T Consensus        17 ~l~~~~~~~~~~~~~~~~V~d~-~~~~~G~is~   48 (57)
T PF00571_consen   17 SLEEALEIMRKNGISRLPVVDE-DGKLVGIISR   48 (57)
T ss_dssp             BHHHHHHHHHHHTSSEEEEEST-TSBEEEEEEH
T ss_pred             cHHHHHHHHHHcCCcEEEEEec-CCEEEEEEEH
Confidence            3333444444444455555544 5999999874


No 243
>PRK00418 DNA gyrase inhibitor; Reviewed
Probab=32.70  E-value=23  Score=22.08  Aligned_cols=13  Identities=31%  Similarity=0.751  Sum_probs=10.1

Q ss_pred             cccccCCCCcccc
Q psy838           35 AKYTCSFCGKEYK   47 (152)
Q Consensus        35 ~~~~c~~c~~~~~   47 (152)
                      ....||.|||...
T Consensus         5 ~~v~CP~C~k~~~   17 (62)
T PRK00418          5 ITVNCPTCGKPVE   17 (62)
T ss_pred             ccccCCCCCCccc
Confidence            3578999999844


No 244
>COG3877 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=32.58  E-value=39  Score=23.50  Aligned_cols=16  Identities=19%  Similarity=0.125  Sum_probs=10.0

Q ss_pred             cccCCCCCHHHHHHHH
Q psy838           46 YKKSTDLGAETKKWVW   61 (152)
Q Consensus        46 ~~~~~~l~~~d~~~i~   61 (152)
                      .....-+++++++.+-
T Consensus        36 ~s~F~~Lt~d~LeFv~   51 (122)
T COG3877          36 MSKFEYLTSDQLEFVE   51 (122)
T ss_pred             cccccccCHhHhHHHH
Confidence            3445667777776654


No 245
>TIGR00244 transcriptional regulator NrdR. Members of this almost entirely bacterial family contain an ATP cone domain (PFAM:PF03477). There is never more than one member per genome. Common gene symbols given include nrdR, ybaD, ribX and ytcG. The member from Streptomyces coelicolor is found upstream in the operon of the class II oxygen-independent ribonucleotide reductase gene nrdJ and was shown to repress nrdJ expression. Many members of this family are found near genes for riboflavin biosynthesis in Gram-negative bacteria, suggesting a role in that pathway. However, a phylogenetic profiling study associates members of this family with the presence of a palindromic signal with consensus acaCwAtATaTwGtgt, termed the NrdR-box, an upstream element for most operons for ribonucleotide reductase of all three classes in bacterial genomes.
Probab=32.47  E-value=21  Score=26.17  Aligned_cols=7  Identities=29%  Similarity=0.819  Sum_probs=3.2

Q ss_pred             cCCCCcc
Q psy838           39 CSFCGKE   45 (152)
Q Consensus        39 c~~c~~~   45 (152)
                      |+.||+-
T Consensus        31 C~~C~~R   37 (147)
T TIGR00244        31 CLECHER   37 (147)
T ss_pred             CCccCCc
Confidence            4444444


No 246
>PF08209 Sgf11:  Sgf11 (transcriptional regulation protein);  InterPro: IPR013246 The Sgf11 family is a SAGA complex subunit in Saccharomyces cerevisiae (Baker's yeast). The SAGA complex is a multisubunit protein complex involved in transcriptional regulation. SAGA combines proteins involved in interactions with DNA-bound activators and TATA-binding protein (TBP), as well as enzymes for histone acetylation and deubiquitylation [].; PDB: 3M99_B 2LO2_A 3MHH_C 3MHS_C.
Probab=32.43  E-value=20  Score=19.39  Aligned_cols=12  Identities=25%  Similarity=0.675  Sum_probs=8.5

Q ss_pred             ccccCCCCcccc
Q psy838           36 KYTCSFCGKEYK   47 (152)
Q Consensus        36 ~~~c~~c~~~~~   47 (152)
                      .+.||.|++.+.
T Consensus         4 ~~~C~nC~R~v~   15 (33)
T PF08209_consen    4 YVECPNCGRPVA   15 (33)
T ss_dssp             EEE-TTTSSEEE
T ss_pred             eEECCCCcCCcc
Confidence            578999998743


No 247
>COG4469 CoiA Competence protein CoiA-like family, contains a predicted nuclease    domain [General function prediction only]
Probab=32.27  E-value=20  Score=29.69  Aligned_cols=10  Identities=30%  Similarity=1.168  Sum_probs=9.4

Q ss_pred             ccccCCCCcc
Q psy838           36 KYTCSFCGKE   45 (152)
Q Consensus        36 ~~~c~~c~~~   45 (152)
                      +|.||.||+.
T Consensus        25 ~ffCPaC~~~   34 (342)
T COG4469          25 RFFCPACGSQ   34 (342)
T ss_pred             ccccCCCCCe
Confidence            6999999998


No 248
>PRK03681 hypA hydrogenase nickel incorporation protein; Validated
Probab=32.15  E-value=19  Score=24.98  Aligned_cols=10  Identities=20%  Similarity=0.577  Sum_probs=7.5

Q ss_pred             ccccCCCCcc
Q psy838           36 KYTCSFCGKE   45 (152)
Q Consensus        36 ~~~c~~c~~~   45 (152)
                      .+.||.||..
T Consensus        87 ~~~CP~Cgs~   96 (114)
T PRK03681         87 VRRCPQCHGD   96 (114)
T ss_pred             CCcCcCcCCC
Confidence            3569999976


No 249
>smart00440 ZnF_C2C2 C2C2 Zinc finger. Nucleic-acid-binding motif in transcriptional elongation factor TFIIS and RNA polymerases.
Probab=32.06  E-value=23  Score=19.82  Aligned_cols=9  Identities=33%  Similarity=0.955  Sum_probs=6.4

Q ss_pred             cccCCCCcc
Q psy838           37 YTCSFCGKE   45 (152)
Q Consensus        37 ~~c~~c~~~   45 (152)
                      +.||.||..
T Consensus         1 ~~Cp~C~~~    9 (40)
T smart00440        1 APCPKCGNR    9 (40)
T ss_pred             CcCCCCCCC
Confidence            368888766


No 250
>smart00451 ZnF_U1 U1-like zinc finger. Family of C2H2-type zinc fingers, present in matrin, U1 small nuclear ribonucleoprotein C and other RNA-binding proteins.
Probab=31.83  E-value=22  Score=18.47  Aligned_cols=16  Identities=13%  Similarity=0.659  Sum_probs=11.6

Q ss_pred             cccccCCCCcccccCC
Q psy838           35 AKYTCSFCGKEYKKST   50 (152)
Q Consensus        35 ~~~~c~~c~~~~~~~~   50 (152)
                      .+|.|+.|++.+....
T Consensus         2 ~~~~C~~C~~~~~~~~   17 (35)
T smart00451        2 GGFYCKLCNVTFTDEI   17 (35)
T ss_pred             cCeEccccCCccCCHH
Confidence            3688999998766433


No 251
>PF00301 Rubredoxin:  Rubredoxin;  InterPro: IPR004039 Rubredoxin is a low molecular weight iron-containing bacterial protein involved in electron transfer [, ], sometimes replacing ferredoxin as an electron carrier []. The 3-D structures of a number of rubredoxins have been solved [, ]. The fold belongs to the alpha+beta class, with 2 alpha-helices and 2-3 beta-strands. Its active site contains an iron ion which is co-ordinated by the sulphurs of four conserved cysteine residues forming an almost regular tetrahedron. The conserved cysteines reside on two loops, which are the most conserved regions of the protein. In addition, a ring of acidic residues in the proximity of the [Fe(Cys)4] centre is also well-conserved []. ; GO: 0009055 electron carrier activity, 0046872 metal ion binding; PDB: 2RDV_C 1RDV_A 1S24_A 1T9O_B 1B2J_A 1SMW_A 2PVE_B 1BFY_A 1T9P_C 1C09_C ....
Probab=31.73  E-value=13  Score=21.75  Aligned_cols=16  Identities=38%  Similarity=0.941  Sum_probs=10.6

Q ss_pred             ccccCCCCcccccCCC
Q psy838           36 KYTCSFCGKEYKKSTD   51 (152)
Q Consensus        36 ~~~c~~c~~~~~~~~~   51 (152)
                      +|.|+.||-.+-....
T Consensus         1 ky~C~~CgyvYd~~~G   16 (47)
T PF00301_consen    1 KYQCPVCGYVYDPEKG   16 (47)
T ss_dssp             EEEETTTSBEEETTTB
T ss_pred             CcCCCCCCEEEcCCcC
Confidence            5788888866555443


No 252
>PHA00732 hypothetical protein
Probab=31.71  E-value=23  Score=22.94  Aligned_cols=14  Identities=36%  Similarity=0.963  Sum_probs=10.8

Q ss_pred             ccccccCCCCcccc
Q psy838           34 HAKYTCSFCGKEYK   47 (152)
Q Consensus        34 ~~~~~c~~c~~~~~   47 (152)
                      |..+.|+.||+.+.
T Consensus        25 H~~~~C~~CgKsF~   38 (79)
T PHA00732         25 HTLTKCPVCNKSYR   38 (79)
T ss_pred             cCCCccCCCCCEeC
Confidence            44678999998864


No 253
>PF06221 zf-C2HC5:  Putative zinc finger motif, C2HC5-type;  InterPro: IPR009349 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target.  This zinc finger appears to be common in activating signal cointegrator 1/thyroid receptor interacting protein 4. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; GO: 0008270 zinc ion binding, 0006355 regulation of transcription, DNA-dependent, 0005634 nucleus
Probab=31.66  E-value=51  Score=20.14  Aligned_cols=11  Identities=36%  Similarity=0.926  Sum_probs=7.6

Q ss_pred             ccccCCCCccc
Q psy838           36 KYTCSFCGKEY   46 (152)
Q Consensus        36 ~~~c~~c~~~~   46 (152)
                      .-.|+|||.++
T Consensus        35 ~~pC~fCg~~l   45 (57)
T PF06221_consen   35 LGPCPFCGTPL   45 (57)
T ss_pred             cCcCCCCCCcc
Confidence            46788888553


No 254
>COG4847 Uncharacterized protein conserved in archaea [Function unknown]
Probab=30.86  E-value=15  Score=24.98  Aligned_cols=17  Identities=18%  Similarity=0.540  Sum_probs=12.7

Q ss_pred             cccccccCCCCcccccC
Q psy838           33 QHAKYTCSFCGKEYKKS   49 (152)
Q Consensus        33 ~~~~~~c~~c~~~~~~~   49 (152)
                      +-+.|.|..||..+++.
T Consensus         3 ~lkewkC~VCg~~iieG   19 (103)
T COG4847           3 GLKEWKCYVCGGTIIEG   19 (103)
T ss_pred             ccceeeEeeeCCEeeec
Confidence            45789999999984443


No 255
>PF01155 HypA:  Hydrogenase expression/synthesis hypA family;  InterPro: IPR000688 Bacterial membrane-bound nickel-dependent hydrogenases requires a number of accessory proteins which are involved in their maturation. The exact role of these proteins is not yet clear, but some seem to be required for the incorporation of the nickel ions []. One of these proteins is generally known as hypA. It is a protein of about 12 to 14 kDa that contains, in its C-terminal region, four conserved cysteines that form a zinc-finger like motif. Escherichia coli has two proteins that belong to this family, hypA and hybF. A homologue, MJ0214, has also been found in a number of archaeal species, including the genome of Methanocaldococcus jannaschii (Methanococcus jannaschii).; GO: 0016151 nickel ion binding, 0006464 protein modification process; PDB: 2KDX_A 3A44_D 3A43_B.
Probab=30.80  E-value=19  Score=24.86  Aligned_cols=9  Identities=33%  Similarity=1.220  Sum_probs=6.3

Q ss_pred             cccCCCCcc
Q psy838           37 YTCSFCGKE   45 (152)
Q Consensus        37 ~~c~~c~~~   45 (152)
                      +.||.||..
T Consensus        87 ~~CP~Cgs~   95 (113)
T PF01155_consen   87 FSCPRCGSP   95 (113)
T ss_dssp             HH-SSSSSS
T ss_pred             CCCcCCcCC
Confidence            668888876


No 256
>COG4338 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=30.54  E-value=15  Score=21.82  Aligned_cols=16  Identities=13%  Similarity=0.513  Sum_probs=11.6

Q ss_pred             ccccCCCCcccccCCC
Q psy838           36 KYTCSFCGKEYKKSTD   51 (152)
Q Consensus        36 ~~~c~~c~~~~~~~~~   51 (152)
                      .-.||.|++++.+-..
T Consensus        12 ~KICpvCqRPFsWRkK   27 (54)
T COG4338          12 DKICPVCQRPFSWRKK   27 (54)
T ss_pred             hhhhhhhcCchHHHHH
Confidence            3459999999776543


No 257
>TIGR01374 soxD sarcosine oxidase, delta subunit family, heterotetrameric form. Sarcosine oxidase catalyzes the oxidative demethylation of sarcosine to glycine. The reaction converts tetrahydrofolate to 5,10-methylene-tetrahydrofolate. The enzyme is known in monomeric and heterotetrameric (alpha,beta,gamma,delta) form
Probab=30.47  E-value=22  Score=23.49  Aligned_cols=8  Identities=38%  Similarity=1.456  Sum_probs=6.7

Q ss_pred             ccCCCCcc
Q psy838           38 TCSFCGKE   45 (152)
Q Consensus        38 ~c~~c~~~   45 (152)
                      .||.||.-
T Consensus         3 ~CP~CG~R   10 (84)
T TIGR01374         3 PCPYCGPR   10 (84)
T ss_pred             cCCCCCCc
Confidence            69999955


No 258
>KOG2879|consensus
Probab=30.47  E-value=34  Score=27.76  Aligned_cols=13  Identities=23%  Similarity=0.746  Sum_probs=10.2

Q ss_pred             cccccccCCCCcc
Q psy838           33 QHAKYTCSFCGKE   45 (152)
Q Consensus        33 ~~~~~~c~~c~~~   45 (152)
                      ....-+||+||.+
T Consensus       236 ~t~~~~C~~Cg~~  248 (298)
T KOG2879|consen  236 GTSDTECPVCGEP  248 (298)
T ss_pred             ccCCceeeccCCC
Confidence            4456789999987


No 259
>PF14952 zf-tcix:  Putative treble-clef, zinc-finger, Zn-binding
Probab=30.44  E-value=24  Score=20.43  Aligned_cols=11  Identities=27%  Similarity=0.631  Sum_probs=8.6

Q ss_pred             cccccCCCCcc
Q psy838           35 AKYTCSFCGKE   45 (152)
Q Consensus        35 ~~~~c~~c~~~   45 (152)
                      ---.||.||..
T Consensus        10 GirkCp~CGt~   20 (44)
T PF14952_consen   10 GIRKCPKCGTY   20 (44)
T ss_pred             ccccCCcCcCc
Confidence            34579999987


No 260
>PRK00807 50S ribosomal protein L24e; Validated
Probab=29.87  E-value=29  Score=20.64  Aligned_cols=15  Identities=60%  Similarity=1.092  Sum_probs=10.6

Q ss_pred             cccCCCCcccccCCC
Q psy838           37 YTCSFCGKEYKKSTD   51 (152)
Q Consensus        37 ~~c~~c~~~~~~~~~   51 (152)
                      +.|.+||..+.+.+.
T Consensus         2 ~~C~fcG~~I~pg~G   16 (52)
T PRK00807          2 RTCSFCGKEIEPGTG   16 (52)
T ss_pred             cccCCCCCeEcCCCC
Confidence            579999988665443


No 261
>PF04810 zf-Sec23_Sec24:  Sec23/Sec24 zinc finger;  InterPro: IPR006895 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target.  COPII (coat protein complex II)-coated vesicles carry proteins from the endoplasmic reticulum (ER) to the Golgi complex []. COPII-coated vesicles form on the ER by the stepwise recruitment of three cytosolic components: Sar1-GTP to initiate coat formation, Sec23/24 heterodimer to select SNARE and cargo molecules, and Sec13/31 to induce coat polymerisation and membrane deformation [].  Sec23 p and Sec24p are structurally related, folding into five distinct domains: a beta-barrel, a zinc-finger, an alpha/beta trunk domain (IPR006896 from INTERPRO), an all-helical region (IPR006900 from INTERPRO), and a C-terminal gelsolin-like domain (IPR007123 from INTERPRO). This entry describes an approximately 55-residue Sec23/24 zinc-binding domain, which lies against the beta-barrel at the periphery of the complex. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; GO: 0008270 zinc ion binding, 0006886 intracellular protein transport, 0006888 ER to Golgi vesicle-mediated transport, 0030127 COPII vesicle coat; PDB: 3EFO_B 3EG9_B 3EGD_A 2YRC_A 2NUP_A 2YRD_A 3EGX_A 2NUT_A 3EH1_A 1PD0_A ....
Probab=29.76  E-value=21  Score=19.91  Aligned_cols=12  Identities=33%  Similarity=1.126  Sum_probs=7.9

Q ss_pred             ccccccCCCCcc
Q psy838           34 HAKYTCSFCGKE   45 (152)
Q Consensus        34 ~~~~~c~~c~~~   45 (152)
                      ...++|++|+..
T Consensus        22 ~~~w~C~~C~~~   33 (40)
T PF04810_consen   22 GKTWICNFCGTK   33 (40)
T ss_dssp             TTEEEETTT--E
T ss_pred             CCEEECcCCCCc
Confidence            357899999864


No 262
>COG4416 Com Mu-like prophage protein Com [General function prediction only]
Probab=29.37  E-value=38  Score=20.61  Aligned_cols=25  Identities=16%  Similarity=0.267  Sum_probs=20.0

Q ss_pred             hHhHhHHhhhhhcccccccCCCCcc
Q psy838           21 SLRKMVKKMEITQHAKYTCSFCGKE   45 (152)
Q Consensus        21 ~~rk~~~~~~~~~~~~~~c~~c~~~   45 (152)
                      ..-|.+.+.+-+-+.+-.||.|...
T Consensus         9 ~CnKlLa~a~~~~yle~KCPrCK~v   33 (60)
T COG4416           9 KCNKLLAEAEGQAYLEKKCPRCKEV   33 (60)
T ss_pred             HHhHHHHhcccceeeeecCCcccee
Confidence            4457788888888999999999654


No 263
>PF04135 Nop10p:  Nucleolar RNA-binding protein, Nop10p family;  InterPro: IPR007264 H/ACA ribonucleoprotein particles (RNPs) are a family of RNA pseudouridine synthases that specify modification sites through guide RNAs. More than 100 mammalian H/ACA RNAs form an equal number of ribonucleoproteins (RNPs) by associating with the same four core proteins: Cbf5, Gar1, Nhp2 and Nop10. The function of these H/ACA RNPs is essential for biogenesis of the ribosome, splicing of precursor mRNAs (pre-mRNAs), maintenance of telomeres and probably for additional cellular processes []. Recent crystal structures of archaeal H/ACA protein complexes show how the same four proteins accommodate >100 distinct but related H/ACA RNAs []. The complex contains a stable core composed of Cbf5 and Nop10, to which Gar1 and Nhp2 subsequently bind, the complex interacts with snoRNAs []. In eukaryotes Nop10 is a nucleolar protein that is specifically associated with H/ACA snoRNAs. It is essential for normal 18S rRNA production and rRNA pseudouridylation by the ribonucleoprotein particles containing H/ACA snoRNAs (H/ACA snoRNPs). Nop10 is probably necessary for the stability of these RNPs [].; PDB: 2RFK_B 3LWR_B 2HVY_C 3HAX_C 3MQK_B 3LWO_B 3LWV_B 3HAY_C 3HJY_B 2EY4_E ....
Probab=29.03  E-value=38  Score=20.38  Aligned_cols=20  Identities=15%  Similarity=0.454  Sum_probs=12.3

Q ss_pred             cccCCCCcccccCC--CCCHHH
Q psy838           37 YTCSFCGKEYKKST--DLGAET   56 (152)
Q Consensus        37 ~~c~~c~~~~~~~~--~l~~~d   56 (152)
                      -+||.||..-.+++  ..+++|
T Consensus        18 ~~cp~cG~~T~~ahPaRFSPdD   39 (53)
T PF04135_consen   18 DKCPPCGGPTESAHPARFSPDD   39 (53)
T ss_dssp             SBBTTTSSBSEESSSSSS-TTT
T ss_pred             CccCCCCCCCcCCcCCCCCCCC
Confidence            47999999944443  344444


No 264
>TIGR02666 moaA molybdenum cofactor biosynthesis protein A, bacterial. The model for this family describes molybdenum cofactor biosynthesis protein A, or MoaA, as found in bacteria. It does not include the family of probable functional equivalent proteins from the archaea. MoaA works together with MoaC to synthesize precursor Z from guanine.
Probab=28.91  E-value=46  Score=26.93  Aligned_cols=46  Identities=20%  Similarity=0.334  Sum_probs=32.8

Q ss_pred             ccchhHhHhHHhhhhhcccccccCCCCccc------ccCCCCCHHHHHHHHHHH
Q psy838           17 RYGASLRKMVKKMEITQHAKYTCSFCGKEY------KKSTDLGAETKKWVWELE   64 (152)
Q Consensus        17 ryg~~~rk~~~~~~~~~~~~~~c~~c~~~~------~~~~~l~~~d~~~i~~l~   64 (152)
                      +||+.++  +-.++++..=++.|.+|-...      .....++.+++..+++..
T Consensus         4 ~~gr~~~--~l~i~vT~~CNl~C~yC~~~~~~~~~~~~~~~ls~eei~~~i~~~   55 (334)
T TIGR02666         4 RFGRRID--YLRISVTDRCNLRCVYCMPEGGGLDFLPKEELLTFEEIERLVRAF   55 (334)
T ss_pred             CCCCccC--eEEEEecCccCcCCCCCCCCcCCCCcCCccCCCCHHHHHHHHHHH
Confidence            5777766  455888889999999997643      123567787777766653


No 265
>PRK00564 hypA hydrogenase nickel incorporation protein; Provisional
Probab=28.90  E-value=22  Score=24.72  Aligned_cols=8  Identities=25%  Similarity=0.825  Sum_probs=6.6

Q ss_pred             ccCCCCcc
Q psy838           38 TCSFCGKE   45 (152)
Q Consensus        38 ~c~~c~~~   45 (152)
                      .||.||..
T Consensus        90 ~CP~Cgs~   97 (117)
T PRK00564         90 VCEKCHSK   97 (117)
T ss_pred             cCcCCCCC
Confidence            49999976


No 266
>PF04475 DUF555:  Protein of unknown function (DUF555);  InterPro: IPR007564 This is a family of uncharacterised, hypothetical archaeal proteins.
Probab=28.78  E-value=26  Score=23.95  Aligned_cols=10  Identities=40%  Similarity=0.983  Sum_probs=8.4

Q ss_pred             ccccCCCCcc
Q psy838           36 KYTCSFCGKE   45 (152)
Q Consensus        36 ~~~c~~c~~~   45 (152)
                      ...||.||..
T Consensus        47 ~~~cP~Cge~   56 (102)
T PF04475_consen   47 DTICPKCGEE   56 (102)
T ss_pred             cccCCCCCCc
Confidence            5679999987


No 267
>PF10276 zf-CHCC:  Zinc-finger domain;  InterPro: IPR019401 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target.   This entry represents a short conserved zinc-finger domain. It contains the sequence motif Cx8Hx14Cx2C. ; PDB: 2JVM_A 2JRR_A 2JZ8_A.
Probab=28.63  E-value=22  Score=20.10  Aligned_cols=11  Identities=36%  Similarity=1.262  Sum_probs=8.7

Q ss_pred             ccccCCCCccc
Q psy838           36 KYTCSFCGKEY   46 (152)
Q Consensus        36 ~~~c~~c~~~~   46 (152)
                      .-.||.||..+
T Consensus        29 ~~~CpYCg~~y   39 (40)
T PF10276_consen   29 PVVCPYCGTRY   39 (40)
T ss_dssp             EEEETTTTEEE
T ss_pred             eEECCCCCCEE
Confidence            47899999754


No 268
>COG2761 FrnE Predicted dithiol-disulfide isomerase involved in polyketide biosynthesis [Secondary metabolites biosynthesis, transport, and catabolism]
Probab=28.62  E-value=12  Score=29.26  Aligned_cols=17  Identities=29%  Similarity=0.706  Sum_probs=14.8

Q ss_pred             HHhhhhhcccccccCCC
Q psy838           26 VKKMEITQHAKYTCSFC   42 (152)
Q Consensus        26 ~~~~~~~~~~~~~c~~c   42 (152)
                      ..+|++...+.+.||+|
T Consensus         3 ~~~i~I~v~sD~vCPwC   19 (225)
T COG2761           3 PMKIEIDVFSDVVCPWC   19 (225)
T ss_pred             CceEEEEEEeCCcCchh
Confidence            45788888999999999


No 269
>PRK03922 hypothetical protein; Provisional
Probab=28.58  E-value=26  Score=24.40  Aligned_cols=10  Identities=30%  Similarity=0.760  Sum_probs=8.4

Q ss_pred             ccccCCCCcc
Q psy838           36 KYTCSFCGKE   45 (152)
Q Consensus        36 ~~~c~~c~~~   45 (152)
                      ...||.||.+
T Consensus        49 ~~~cP~cge~   58 (113)
T PRK03922         49 LTICPKCGEP   58 (113)
T ss_pred             cccCCCCCCc
Confidence            4679999988


No 270
>PF01096 TFIIS_C:  Transcription factor S-II (TFIIS);  InterPro: IPR001222 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target.  This entry represents a zinc finger motif found in transcription factor IIs (TFIIS). In eukaryotes the initiation of transcription of protein encoding genes by polymerase II (Pol II) is modulated by general and specific transcription factors. The general transcription factors operate through common promoters elements (such as the TATA box). At least eight different proteins associate to form the general transcription factors: TFIIA, -IIB, -IID, -IIE, -IIF, -IIG, -IIH and -IIS []. During mRNA elongation, Pol II can encounter DNA sequences that cause reverse movement of the enzyme. Such backtracking involves extrusion of the RNA 3'-end into the pore, and can lead to transcriptional arrest. Escape from arrest requires cleavage of the extruded RNA with the help of TFIIS, which induces mRNA cleavage by enhancing the intrinsic nuclease activity of RNA polymerase (Pol) II, past template-encoded pause sites []. TFIIS extends from the polymerase surface via a pore to the internal active site. Two essential and invariant acidic residues in a TFIIS loop complement the Pol II active site and could position a metal ion and a water molecule for hydrolytic RNA cleavage. TFIIS also induces extensive structural changes in Pol II that would realign nucleic acids in the active centre.  TFIIS is a protein of about 300 amino acids. It contains three regions: a variable N-terminal domain not required for TFIIS activity; a conserved central domain required for Pol II binding; and a conserved C-terminal C4-type zinc finger essential for RNA cleavage. The zinc finger folds in a conformation termed a zinc ribbon [] characterised by a three-stranded antiparallel beta-sheet and two beta-hairpins. A backbone model for Pol II-TFIIS complex was obtained from X-ray analysis. It shows that a beta hairpin protrudes from the zinc finger and complements the pol II active site [].  Some viral proteins also contain the TFIIS zinc ribbon C-terminal domain. The Vaccinia virus protein, unlike its eukaryotic homologue, is an integral RNA polymerase subunit rather than a readily separable transcription factor []. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; GO: 0003676 nucleic acid binding, 0008270 zinc ion binding, 0006351 transcription, DNA-dependent; PDB: 3M4O_I 3S14_I 2E2J_I 4A3J_I 3HOZ_I 1TWA_I 3S1Q_I 3S1N_I 1TWG_I 3I4M_I ....
Probab=28.07  E-value=27  Score=19.36  Aligned_cols=8  Identities=38%  Similarity=1.178  Sum_probs=4.0

Q ss_pred             ccCCCCcc
Q psy838           38 TCSFCGKE   45 (152)
Q Consensus        38 ~c~~c~~~   45 (152)
                      .||.||..
T Consensus         2 ~Cp~Cg~~    9 (39)
T PF01096_consen    2 KCPKCGHN    9 (39)
T ss_dssp             --SSS-SS
T ss_pred             CCcCCCCC
Confidence            58888877


No 271
>TIGR03032 conserved hypothetical protein TIGR03032. This protein family is uncharacterized. A number of motifs are conserved perfectly among all member sequences. The function of this protein is unknown.
Probab=27.90  E-value=99  Score=25.72  Aligned_cols=37  Identities=19%  Similarity=0.156  Sum_probs=25.6

Q ss_pred             ceEEEEEeCCCCCeEEEEEEEeeccCCcceEEEeeeeecccccc
Q psy838           93 ACYLVAKNGSSSTPVAFSHFRFDVDFGEPVLYCYELQLEKQVQR  136 (152)
Q Consensus        93 ~~~~v~~~~~~~~ivGf~~~~~~~~~~~~~~~I~~i~V~p~~qg  136 (152)
                      .-..++--. +|.+|+++.|.-.      +-+|++++|.|..|.
T Consensus       282 CGv~vidl~-tG~vv~~l~feg~------v~EifdV~vLPg~r~  318 (335)
T TIGR03032       282 CGVAVIDLN-SGDVVHWLRFEGV------IEEIYDVAVLPGVRR  318 (335)
T ss_pred             ccEEEEECC-CCCEEEEEEeCCc------eeEEEEEEEecCCCC
Confidence            334444323 5999999987432      345999999999875


No 272
>PRK00420 hypothetical protein; Validated
Probab=27.88  E-value=73  Score=22.21  Aligned_cols=9  Identities=33%  Similarity=0.929  Sum_probs=4.4

Q ss_pred             cccCCCCcc
Q psy838           37 YTCSFCGKE   45 (152)
Q Consensus        37 ~~c~~c~~~   45 (152)
                      -.||.||.+
T Consensus        24 ~~CP~Cg~p   32 (112)
T PRK00420         24 KHCPVCGLP   32 (112)
T ss_pred             CCCCCCCCc
Confidence            345555544


No 273
>smart00507 HNHc HNH nucleases.
Probab=27.63  E-value=34  Score=18.81  Aligned_cols=14  Identities=29%  Similarity=0.807  Sum_probs=10.3

Q ss_pred             cccccccCCCCcccc
Q psy838           33 QHAKYTCSFCGKEYK   47 (152)
Q Consensus        33 ~~~~~~c~~c~~~~~   47 (152)
                      .+. +.|++||....
T Consensus         8 ~r~-~~C~~C~~~~~   21 (52)
T smart00507        8 HRD-GVCAYCGKPAS   21 (52)
T ss_pred             HHC-CCCcCCcCCCC
Confidence            344 89999998753


No 274
>PF14205 Cys_rich_KTR:  Cysteine-rich KTR
Probab=27.60  E-value=30  Score=20.99  Aligned_cols=9  Identities=33%  Similarity=1.147  Sum_probs=7.0

Q ss_pred             cccCCCCcc
Q psy838           37 YTCSFCGKE   45 (152)
Q Consensus        37 ~~c~~c~~~   45 (152)
                      ..||.||..
T Consensus         5 i~CP~CgnK   13 (55)
T PF14205_consen    5 ILCPICGNK   13 (55)
T ss_pred             EECCCCCCc
Confidence            369999965


No 275
>PF14446 Prok-RING_1:  Prokaryotic RING finger family 1
Probab=27.56  E-value=27  Score=21.12  Aligned_cols=12  Identities=33%  Similarity=0.844  Sum_probs=7.4

Q ss_pred             ccccCCCCcccc
Q psy838           36 KYTCSFCGKEYK   47 (152)
Q Consensus        36 ~~~c~~c~~~~~   47 (152)
                      --.||.||.+++
T Consensus        21 iVvCp~CgapyH   32 (54)
T PF14446_consen   21 IVVCPECGAPYH   32 (54)
T ss_pred             EEECCCCCCccc
Confidence            345777777643


No 276
>COG3478 Predicted nucleic-acid-binding protein containing a Zn-ribbon domain [General function prediction only]
Probab=27.56  E-value=29  Score=21.85  Aligned_cols=10  Identities=30%  Similarity=1.072  Sum_probs=7.0

Q ss_pred             ccccCCCCcc
Q psy838           36 KYTCSFCGKE   45 (152)
Q Consensus        36 ~~~c~~c~~~   45 (152)
                      ...||.||-.
T Consensus         4 ~~kCpKCgn~   13 (68)
T COG3478           4 AFKCPKCGNT   13 (68)
T ss_pred             cccCCCcCCc
Confidence            4559999843


No 277
>smart00116 CBS Domain in cystathionine beta-synthase and other proteins. Domain present in all 3 forms of cellular life. Present in two copies in inosine monophosphate dehydrogenase, of which one is disordered in the crystal structure [3]. A number of disease states are associated with CBS-containing proteins including homocystinuria, Becker's and Thomsen disease.
Probab=27.45  E-value=64  Score=16.48  Aligned_cols=27  Identities=7%  Similarity=-0.085  Sum_probs=13.8

Q ss_pred             hhhhcCCCceEEEEEeCCCCCeEEEEEE
Q psy838           85 HSEMFDDRACYLVAKNGSSSTPVAFSHF  112 (152)
Q Consensus        85 ~~el~~~~~~~~v~~~~~~~~ivGf~~~  112 (152)
                      ...+.+.....+++.++ +++++|++..
T Consensus        15 ~~~~~~~~~~~~~v~~~-~~~~~g~i~~   41 (49)
T smart00116       15 LELLREHGIRRLPVVDE-EGRLVGIVTR   41 (49)
T ss_pred             HHHHHHhCCCcccEECC-CCeEEEEEEH
Confidence            33444333333334333 3788888763


No 278
>KOG3993|consensus
Probab=27.44  E-value=26  Score=30.15  Aligned_cols=13  Identities=38%  Similarity=0.879  Sum_probs=11.2

Q ss_pred             cccccccCCCCcc
Q psy838           33 QHAKYTCSFCGKE   45 (152)
Q Consensus        33 ~~~~~~c~~c~~~   45 (152)
                      -|-.|.||.|+|.
T Consensus       292 V~vEYrCPEC~KV  304 (500)
T KOG3993|consen  292 VHVEYRCPECDKV  304 (500)
T ss_pred             EEeeecCCccccc
Confidence            4568999999998


No 279
>COG0419 SbcC ATPase involved in DNA repair [DNA replication, recombination, and repair]
Probab=27.30  E-value=54  Score=30.66  Aligned_cols=15  Identities=27%  Similarity=0.711  Sum_probs=10.6

Q ss_pred             ccccCCCCcccccCC
Q psy838           36 KYTCSFCGKEYKKST   50 (152)
Q Consensus        36 ~~~c~~c~~~~~~~~   50 (152)
                      .-.||.||+++....
T Consensus       457 ~~~CPvCg~~l~~~~  471 (908)
T COG0419         457 GEKCPVCGQELPEEH  471 (908)
T ss_pred             CCCCCCCCCCCCcHH
Confidence            578999997644444


No 280
>PF11039 DUF2824:  Protein of unknown function (DUF2824);  InterPro: IPR022568  This family of proteins has no known function. Members of the family are found in P22-like viruses and bacteria. Some of the phage members have been annotated as head assembly proteins, but this has not been confirmed.
Probab=27.30  E-value=2.4e+02  Score=20.50  Aligned_cols=41  Identities=12%  Similarity=0.088  Sum_probs=27.3

Q ss_pred             CCCceEEEEEeCCCCCeEEEEEEEeeccCCcceEEEeeeeecccccc
Q psy838           90 DDRACYLVAKNGSSSTPVAFSHFRFDVDFGEPVLYCYELQLEKQVQR  136 (152)
Q Consensus        90 ~~~~~~~v~~~~~~~~ivGf~~~~~~~~~~~~~~~I~~i~V~p~~qg  136 (152)
                      .++..|+=+...  +.++|+..++..-.   .++.+|-+| +|++||
T Consensus        35 K~~~~Y~gVyeg--~~l~Gi~~v~~i~~---~~vecHa~y-~P~fRG   75 (151)
T PF11039_consen   35 KPDQLYLGVYEG--GQLGGIVYVEEIQP---SVVECHAMY-DPGFRG   75 (151)
T ss_pred             cCccEEEEEEec--eEEEEEEEEEEEee---eeEEEEeee-ccccch
Confidence            356667767654  88999998865321   134456554 799998


No 281
>PRK00762 hypA hydrogenase nickel incorporation protein; Provisional
Probab=27.29  E-value=28  Score=24.47  Aligned_cols=9  Identities=33%  Similarity=1.158  Sum_probs=7.5

Q ss_pred             cccCCCCcc
Q psy838           37 YTCSFCGKE   45 (152)
Q Consensus        37 ~~c~~c~~~   45 (152)
                      +.||.||..
T Consensus        93 ~~CP~Cgs~  101 (124)
T PRK00762         93 IECPVCGNK  101 (124)
T ss_pred             CcCcCCCCC
Confidence            579999965


No 282
>PF01246 Ribosomal_L24e:  Ribosomal protein L24e;  InterPro: IPR000988 Ribosomes are the particles that catalyse mRNA-directed protein synthesis in all organisms. The codons of the mRNA are exposed on the ribosome to allow tRNA binding. This leads to the incorporation of amino acids into the growing polypeptide chain in accordance with the genetic information. Incoming amino acid monomers enter the ribosomal A site in the form of aminoacyl-tRNAs complexed with elongation factor Tu (EF-Tu) and GTP. The growing polypeptide chain, situated in the P site as peptidyl-tRNA, is then transferred to aminoacyl-tRNA and the new peptidyl-tRNA, extended by one residue, is translocated to the P site with the aid the elongation factor G (EF-G) and GTP as the deacylated tRNA is released from the ribosome through one or more exit sites [, ]. About 2/3 of the mass of the ribosome consists of RNA and 1/3 of protein. The proteins are named in accordance with the subunit of the ribosome which they belong to - the small (S1 to S31) and the large (L1 to L44). Usually they decorate the rRNA cores of the subunits.  Many ribosomal proteins, particularly those of the large subunit, are composed of a globular, surfaced-exposed domain with long finger-like projections that extend into the rRNA core to stabilise its structure. Most of the proteins interact with multiple RNA elements, often from different domains. In the large subunit, about 1/3 of the 23S rRNA nucleotides are at least in van der Waal's contact with protein, and L22 interacts with all six domains of the 23S rRNA. Proteins S4 and S7, which initiate assembly of the 16S rRNA, are located at junctions of five and four RNA helices, respectively. In this way proteins serve to organise and stabilise the rRNA tertiary structure. While the crucial activities of decoding and peptide transfer are RNA based, proteins play an active role in functions that may have evolved to streamline the process of protein synthesis. In addition to their function in the ribosome, many ribosomal proteins have some function 'outside' the ribosome [, ]. A number of eukaryotic and archaeabacterial ribosomal proteins can be grouped on the basis of sequence similarities. One of these families [] consists of mammalian ribosomal protein L24; yeast ribosomal protein L30A/B (Rp29) (YL21); Kluyveromyces lactis ribosomal protein L30; Arabidopsis thaliana ribosomal protein L24 homolog; Haloarcula marismortui ribosomal protein HL21/HL22; and Methanocaldococcus jannaschii (Methanococcus jannaschii) MJ1201. These proteins have 60 to 160 amino-acid residues. This entry represents proteins related to the L24e ribosomal proteins.; PDB: 2ZKR_u 1VQ9_U 1VQL_U 1KD1_V 1VQP_U 3CCM_U 3CD6_U 3CCL_U 3CCR_U 1Q86_V ....
Probab=27.27  E-value=31  Score=22.07  Aligned_cols=14  Identities=29%  Similarity=0.501  Sum_probs=8.0

Q ss_pred             ccccCCCCcccccC
Q psy838           36 KYTCSFCGKEYKKS   49 (152)
Q Consensus        36 ~~~c~~c~~~~~~~   49 (152)
                      ..+|.|||..+.+.
T Consensus         3 ~~~C~Fsg~~I~PG   16 (71)
T PF01246_consen    3 TEKCSFSGYKIYPG   16 (71)
T ss_dssp             SEE-TTT-SEE-SS
T ss_pred             eEEecccCCccCCC
Confidence            36799999985544


No 283
>COG1327 Predicted transcriptional regulator, consists of a Zn-ribbon and ATP-cone domains [Transcription]
Probab=27.16  E-value=29  Score=25.60  Aligned_cols=8  Identities=63%  Similarity=1.717  Sum_probs=6.3

Q ss_pred             ccCCCCcc
Q psy838           38 TCSFCGKE   45 (152)
Q Consensus        38 ~c~~c~~~   45 (152)
                      .||||+..
T Consensus         2 ~CPfC~~~    9 (156)
T COG1327           2 KCPFCGHE    9 (156)
T ss_pred             CCCCCCCC
Confidence            48888876


No 284
>cd00085 HNHc HNH nucleases; HNH endonuclease signature which is found in viral, prokaryotic, and eukaryotic proteins. The alignment includes members of the large group of homing endonucleases, yeast intron 1 protein, MutS, as well as bacterial colicins, pyocins, and anaredoxins.
Probab=27.11  E-value=46  Score=18.62  Aligned_cols=10  Identities=40%  Similarity=0.949  Sum_probs=8.4

Q ss_pred             cccCCCCccc
Q psy838           37 YTCSFCGKEY   46 (152)
Q Consensus        37 ~~c~~c~~~~   46 (152)
                      +.|+.||..+
T Consensus        12 ~~C~~c~~~~   21 (57)
T cd00085          12 GLCPYCGKPG   21 (57)
T ss_pred             CcCccCCCcC
Confidence            8999999753


No 285
>PF04267 SoxD:  Sarcosine oxidase, delta subunit family ;  InterPro: IPR006279 These sequences represent the delta subunit of a family of known and putative heterotetrameric sarcosine oxidases. Five operons of such oxidases are found in Rhizobium loti (Mesorhizobium loti) and three in Agrobacterium tumefaciens, a high enough copy number to suggest that not all members share the same function. Sarcosine oxidase catalyzes the oxidative demethylation of sarcosine to glycine. The reaction converts tetrahydrofolate to 5,10-methylene-tetrahydrofolate [].  Bacterial sarcosine oxidases have been isolated from over a dozen different organisms and fall into two major classes (1) monomeric form that contains only covalent flavin and (2) heterotetrameric (alpha, beta, gamma, delta) form that contain a covalent and noncovalent flavin, this entry represents the heterotetrameric form.; GO: 0008115 sarcosine oxidase activity, 0046653 tetrahydrofolate metabolic process; PDB: 3AD7_D 1X31_D 1VRQ_D 3AD8_D 3ADA_D 3AD9_D 2GAG_D 2GAH_D.
Probab=27.02  E-value=20  Score=23.74  Aligned_cols=8  Identities=38%  Similarity=1.240  Sum_probs=7.0

Q ss_pred             ccCCCCcc
Q psy838           38 TCSFCGKE   45 (152)
Q Consensus        38 ~c~~c~~~   45 (152)
                      .||+||.-
T Consensus         3 ~CP~CG~R   10 (84)
T PF04267_consen    3 PCPHCGPR   10 (84)
T ss_dssp             EETTTEEE
T ss_pred             cCCCCCcc
Confidence            69999986


No 286
>TIGR03826 YvyF flagellar operon protein TIGR03826. This gene is found in flagellar operons of Bacillus-related organisms. Its function has not been determined and an official gene symbol has not been assigned, although the gene is designated yvyF in B. subtilus. A tentative assignment as a regulator is suggested in the NCBI record GI:16080597.
Probab=26.94  E-value=33  Score=24.73  Aligned_cols=11  Identities=27%  Similarity=0.857  Sum_probs=7.9

Q ss_pred             cccCCCCcccc
Q psy838           37 YTCSFCGKEYK   47 (152)
Q Consensus        37 ~~c~~c~~~~~   47 (152)
                      -+||.||+.+.
T Consensus         4 ~nC~~CgklF~   14 (137)
T TIGR03826         4 ANCPKCGRLFV   14 (137)
T ss_pred             ccccccchhhh
Confidence            36888888843


No 287
>PRK03824 hypA hydrogenase nickel incorporation protein; Provisional
Probab=26.86  E-value=29  Score=24.75  Aligned_cols=12  Identities=42%  Similarity=0.899  Sum_probs=9.1

Q ss_pred             ccccccCCCCcc
Q psy838           34 HAKYTCSFCGKE   45 (152)
Q Consensus        34 ~~~~~c~~c~~~   45 (152)
                      ...+.||.||..
T Consensus       105 ~~~~~CP~Cgs~  116 (135)
T PRK03824        105 HAFLKCPKCGSR  116 (135)
T ss_pred             ccCcCCcCCCCC
Confidence            345779999975


No 288
>smart00734 ZnF_Rad18 Rad18-like CCHC zinc finger. Yeast Rad18p functions with Rad5p in error-free post-replicative DNA repair. This zinc finger is likely to bind nucleic-acids.
Probab=26.79  E-value=30  Score=17.47  Aligned_cols=9  Identities=33%  Similarity=0.870  Sum_probs=7.1

Q ss_pred             ccCCCCccc
Q psy838           38 TCSFCGKEY   46 (152)
Q Consensus        38 ~c~~c~~~~   46 (152)
                      .||.|++..
T Consensus         3 ~CPiC~~~v   11 (26)
T smart00734        3 QCPVCFREV   11 (26)
T ss_pred             cCCCCcCcc
Confidence            599998765


No 289
>KOG2779|consensus
Probab=26.50  E-value=2.2e+02  Score=24.13  Aligned_cols=65  Identities=12%  Similarity=0.173  Sum_probs=39.7

Q ss_pred             cCCCCCHHHHHHHHHHHHHhHHHHHHhCCCCCChhhhhhhhcC-CCce-EEEEEeCCCCCeEEEEEEEee
Q psy838           48 KSTDLGAETKKWVWELEEKNMKHSYEVCDIGWDPQGKHSEMFD-DRAC-YLVAKNGSSSTPVAFSHFRFD  115 (152)
Q Consensus        48 ~~~~l~~~d~~~i~~l~~~~m~~~y~~~~~~w~~~~~~~el~~-~~~~-~~v~~~~~~~~ivGf~~~~~~  115 (152)
                      -..++++.|++.+++|..+.+.+ |.- ..-.+++.+..-+.- ++-. .+++.. .+|.|-+|.+|...
T Consensus       262 G~R~me~kDvp~V~~Ll~~yl~q-f~l-a~~f~~eev~Hwf~p~e~VV~syVves-p~g~ITDF~SFy~l  328 (421)
T KOG2779|consen  262 GLREMEEKDVPAVFRLLRNYLKQ-FEL-APVFDEEEVEHWFLPRENVVYSYVVES-PNGKITDFCSFYSL  328 (421)
T ss_pred             CcccccccchHHHHHHHHHHHHh-eec-ccccCHHHhHhhcccccceEEEEEEEC-CCCcccceeeEEec
Confidence            35788999999999999876644 321 123455544444432 2222 233333 36899999999754


No 290
>PF06827 zf-FPG_IleRS:  Zinc finger found in FPG and IleRS;  InterPro: IPR010663 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target.  This entry represents a zinc finger domain found at the C-terminal in both DNA glycosylase/AP lyase enzymes and in isoleucyl tRNA synthetase. In these two types of enzymes, the C-terminal domain forms a zinc finger. Some related proteins may not bind zinc.  DNA glycosylase/AP lyase enzymes are involved in base excision repair of DNA damaged by oxidation or by mutagenic agents. These enzymes have both DNA glycosylase activity (3.2.2 from EC) and AP lyase activity (4.2.99.18 from EC) []. Examples include formamidopyrimidine-DNA glycosylases (Fpg; MutM) and endonuclease VIII (Nei). Formamidopyrimidine-DNA glycosylases (Fpg, MutM) is a trifunctional DNA base excision repair enzyme that removes a wide range of oxidation-damaged bases (N-glycosylase activity; 3.2.2.23 from EC) and cleaves both the 3'- and 5'-phosphodiester bonds of the resulting apurinic/apyrimidinic site (AP lyase activity; 4.2.99.18 from EC). Fpg has a preference for oxidised purines, excising oxidized purine bases such as 7,8-dihydro-8-oxoguanine (8-oxoG). ITs AP (apurinic/apyrimidinic) lyase activity introduces nicks in the DNA strand, cleaving the DNA backbone by beta-delta elimination to generate a single-strand break at the site of the removed base with both 3'- and 5'-phosphates. Fpg is a monomer composed of 2 domains connected by a flexible hinge []. The two DNA-binding motifs (a zinc finger and the helix-two-turns-helix motifs) suggest that the oxidized base is flipped out from double-stranded DNA in the binding mode and excised by a catalytic mechanism similar to that of bifunctional base excision repair enzymes []. Fpg binds one ion of zinc at the C terminus, which contains four conserved and essential cysteines []. Endonuclease VIII (Nei) has the same enzyme activities as Fpg above, but with a preference for oxidized pyrimidines, such as thymine glycol, 5,6-dihydrouracil and 5,6-dihydrothymine [, ].  An Fpg-type zinc finger is also found at the C terminus of isoleucyl tRNA synthetase (6.1.1.5 from EC) [, ]. This enzyme catalyses the attachment of isoleucine to tRNA(Ile). As IleRS can inadvertently accommodate and process structurally similar amino acids such as valine, to avoid such errors it has two additional distinct tRNA(Ile)-dependent editing activities. One activity is designated as 'pre-transfer' editing and involves the hydrolysis of activated Val-AMP. The other activity is designated 'post-transfer' editing and involves deacylation of mischarged Val-tRNA(Ile) [].  More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; GO: 0003824 catalytic activity; PDB: 1K82_C 1Q39_A 2OQ4_B 2OPF_A 1K3X_A 1K3W_A 1Q3B_A 2EA0_A 1Q3C_A 2XZF_A ....
Probab=26.38  E-value=13  Score=19.13  Aligned_cols=19  Identities=26%  Similarity=0.714  Sum_probs=13.3

Q ss_pred             HHhhhhhcccccccCCCCc
Q psy838           26 VKKMEITQHAKYTCSFCGK   44 (152)
Q Consensus        26 ~~~~~~~~~~~~~c~~c~~   44 (152)
                      +.++...++..+.||.|-+
T Consensus        11 ~~~~~~~~r~~~~C~rCq~   29 (30)
T PF06827_consen   11 IEDIGINGRSTYLCPRCQK   29 (30)
T ss_dssp             BEEEEETTEEEEE-TTTCC
T ss_pred             ceEeEecCCCCeECcCCcC
Confidence            4456667788999999954


No 291
>PF01927 Mut7-C:  Mut7-C RNAse domain;  InterPro: IPR002782 This prokaryotic family of proteins have no known function. The proteins contain four conserved cysteines that may be involved in metal binding or disulphide bridges.
Probab=26.18  E-value=29  Score=24.92  Aligned_cols=11  Identities=45%  Similarity=1.020  Sum_probs=8.2

Q ss_pred             ccccCCCCccc
Q psy838           36 KYTCSFCGKEY   46 (152)
Q Consensus        36 ~~~c~~c~~~~   46 (152)
                      =+.||.|||.|
T Consensus       124 f~~C~~C~kiy  134 (147)
T PF01927_consen  124 FWRCPGCGKIY  134 (147)
T ss_pred             EEECCCCCCEe
Confidence            46688888874


No 292
>COG1996 RPC10 DNA-directed RNA polymerase, subunit RPC10 (contains C4-type Zn-finger) [Transcription]
Probab=26.07  E-value=27  Score=20.72  Aligned_cols=12  Identities=25%  Similarity=0.908  Sum_probs=7.5

Q ss_pred             ccccccCCCCcc
Q psy838           34 HAKYTCSFCGKE   45 (152)
Q Consensus        34 ~~~~~c~~c~~~   45 (152)
                      .....||.||..
T Consensus        22 ~~~irCp~Cg~r   33 (49)
T COG1996          22 TRGIRCPYCGSR   33 (49)
T ss_pred             cCceeCCCCCcE
Confidence            345567777765


No 293
>PF12387 Peptidase_C74:  Pestivirus NS2 peptidase;  InterPro: IPR022120  The pestivirus NS2 peptidase is responsible for single cleavage between NS2 and NS3 of the Bovine viral diarrhea virus 1 polyprotein, a cleavage that is correlated with cytopathogenicity []. The peptidase is activated by its interaction with 'J-domain protein interacting with viral protein' - Jiv. ; GO: 0003968 RNA-directed RNA polymerase activity, 0004197 cysteine-type endopeptidase activity, 0004252 serine-type endopeptidase activity, 0016817 hydrolase activity, acting on acid anhydrides, 0017111 nucleoside-triphosphatase activity, 0070008 serine-type exopeptidase activity
Probab=25.99  E-value=29  Score=26.22  Aligned_cols=17  Identities=24%  Similarity=0.466  Sum_probs=13.2

Q ss_pred             hhhhcccccccCCCCcc
Q psy838           29 MEITQHAKYTCSFCGKE   45 (152)
Q Consensus        29 ~~~~~~~~~~c~~c~~~   45 (152)
                      +|..+...-+||.||+.
T Consensus       168 Ce~r~w~g~~CPKCGr~  184 (200)
T PF12387_consen  168 CEGREWKGGNCPKCGRH  184 (200)
T ss_pred             eecCccCCCCCCcccCC
Confidence            36666677789999986


No 294
>PF03811 Zn_Tnp_IS1:  InsA N-terminal domain;  InterPro: IPR003220 Insertion elements are mobile elements in DNA, usually encoding proteins required for transposition, for example transposases. Protein InsA is absolutely required for transposition of insertion element 1. This entry represents a short zinc binding domain found in IS1 InsA family protein. It is found at the N terminus of the protein and may be a DNA-binding domain.; GO: 0006313 transposition, DNA-mediated
Probab=25.67  E-value=34  Score=18.82  Aligned_cols=9  Identities=22%  Similarity=0.659  Sum_probs=6.9

Q ss_pred             cccCCCCcc
Q psy838           37 YTCSFCGKE   45 (152)
Q Consensus        37 ~~c~~c~~~   45 (152)
                      -.||+|+..
T Consensus         6 v~CP~C~s~   14 (36)
T PF03811_consen    6 VHCPRCQST   14 (36)
T ss_pred             eeCCCCCCC
Confidence            469999865


No 295
>TIGR03830 CxxCG_CxxCG_HTH putative zinc finger/helix-turn-helix protein, YgiT family. This model describes a family of predicted regulatory proteins with a conserved zinc finger/HTH architecture. The amino-terminal region contains a novel domain, featuring two CXXC motifs and occuring in a number of small bacterial proteins as well as in the present family. The carboxyl-terminal region consists of a helix-turn-helix domain, modeled by pfam01381. The predicted function is DNA binding and transcriptional regulation.
Probab=25.60  E-value=1.3e+02  Score=20.39  Aligned_cols=14  Identities=29%  Similarity=0.769  Sum_probs=9.7

Q ss_pred             cccccCCCCccccc
Q psy838           35 AKYTCSFCGKEYKK   48 (152)
Q Consensus        35 ~~~~c~~c~~~~~~   48 (152)
                      .-+.||.||-.+.+
T Consensus        30 ~~~~C~~CGe~~~~   43 (127)
T TIGR03830        30 PGWYCPACGEELLD   43 (127)
T ss_pred             eeeECCCCCCEEEc
Confidence            45689999965443


No 296
>smart00714 LITAF Possible membrane-associated motif in LPS-induced tumor necrosis factor alpha factor (LITAF), also known as PIG7, and other animal proteins.
Probab=25.58  E-value=34  Score=21.08  Aligned_cols=15  Identities=20%  Similarity=0.386  Sum_probs=11.6

Q ss_pred             hhcccccccCCCCcc
Q psy838           31 ITQHAKYTCSFCGKE   45 (152)
Q Consensus        31 ~~~~~~~~c~~c~~~   45 (152)
                      .-+...|.||.||+.
T Consensus        47 ~~kd~~H~Cp~C~~~   61 (67)
T smart00714       47 SFKDVNHYCPNCGAF   61 (67)
T ss_pred             cccCccEECCCCCCE
Confidence            345668999999976


No 297
>COG1885 Uncharacterized protein conserved in archaea [Function unknown]
Probab=25.53  E-value=33  Score=23.72  Aligned_cols=10  Identities=30%  Similarity=0.903  Sum_probs=8.2

Q ss_pred             ccccCCCCcc
Q psy838           36 KYTCSFCGKE   45 (152)
Q Consensus        36 ~~~c~~c~~~   45 (152)
                      -+.||.||.+
T Consensus        49 ~t~CP~Cg~~   58 (115)
T COG1885          49 STSCPKCGEP   58 (115)
T ss_pred             cccCCCCCCc
Confidence            4679999987


No 298
>PRK12496 hypothetical protein; Provisional
Probab=25.29  E-value=31  Score=25.46  Aligned_cols=11  Identities=36%  Similarity=0.899  Sum_probs=8.1

Q ss_pred             cccCCCCcccc
Q psy838           37 YTCSFCGKEYK   47 (152)
Q Consensus        37 ~~c~~c~~~~~   47 (152)
                      -.||.||.+..
T Consensus       144 ~~C~~CG~~~~  154 (164)
T PRK12496        144 DVCEICGSPVK  154 (164)
T ss_pred             CcCCCCCChhh
Confidence            45999998743


No 299
>COG1645 Uncharacterized Zn-finger containing protein [General function prediction only]
Probab=25.19  E-value=30  Score=24.82  Aligned_cols=10  Identities=30%  Similarity=0.783  Sum_probs=7.5

Q ss_pred             ccccCCCCcc
Q psy838           36 KYTCSFCGKE   45 (152)
Q Consensus        36 ~~~c~~c~~~   45 (152)
                      .+.||.||.+
T Consensus        28 ~~hCp~Cg~P   37 (131)
T COG1645          28 AKHCPKCGTP   37 (131)
T ss_pred             HhhCcccCCc
Confidence            4668888877


No 300
>PRK13361 molybdenum cofactor biosynthesis protein A; Provisional
Probab=25.15  E-value=58  Score=26.44  Aligned_cols=45  Identities=18%  Similarity=0.525  Sum_probs=31.0

Q ss_pred             ccchhHhHhHHhhhhhcccccccCCCCccc----ccCCCCCHHHHHHHHHH
Q psy838           17 RYGASLRKMVKKMEITQHAKYTCSFCGKEY----KKSTDLGAETKKWVWEL   63 (152)
Q Consensus        17 ryg~~~rk~~~~~~~~~~~~~~c~~c~~~~----~~~~~l~~~d~~~i~~l   63 (152)
                      +||+.+.  +-.++++..=++.|++|-..-    .....++.+++..+.+.
T Consensus         8 ~~gr~i~--~l~i~iT~~CNl~C~yC~~~~~~~~~~~~~ls~eei~~li~~   56 (329)
T PRK13361          8 SFGRTVT--YLRLSVTDRCDFRCVYCMSEDPCFLPRDQVLSLEELAWLAQA   56 (329)
T ss_pred             CCCCccC--eEEEEecCCccccCCCCCCCCCCcCCccCCCCHHHHHHHHHH
Confidence            5666544  556888999999999996541    22346777777776654


No 301
>PF07315 DUF1462:  Protein of unknown function (DUF1462);  InterPro: IPR009190 There are currently no experimental data for members of this group of bacterial proteins or their homologues. A crystal structure of Q7A6J8 from SWISSPROT revealed a thioredoxin-like fold, its core consisting of three layers alpha/beta/alpha.; PDB: 1XG8_A.
Probab=25.01  E-value=94  Score=20.92  Aligned_cols=62  Identities=18%  Similarity=0.249  Sum_probs=31.4

Q ss_pred             ccccCCCCcccccCCCCCHHHHHHHHHHHHHhHHHHHHhCCC---------CCCh---hhhhhhhcCCCceEEEEEeCCC
Q psy838           36 KYTCSFCGKEYKKSTDLGAETKKWVWELEEKNMKHSYEVCDI---------GWDP---QGKHSEMFDDRACYLVAKNGSS  103 (152)
Q Consensus        36 ~~~c~~c~~~~~~~~~l~~~d~~~i~~l~~~~m~~~y~~~~~---------~w~~---~~~~~el~~~~~~~~v~~~~~~  103 (152)
                      .-.|+.|=     ..+-+.+..+|+-.+...    -|....+         +-.+   ..+.+.+.+++..|=++.-+  
T Consensus         6 e~~CASCV-----n~PsSkeTyeWL~aal~R----Kyp~~~f~~~YiDi~~p~~~~~~~~~a~~I~ede~fYPlV~i~--   74 (93)
T PF07315_consen    6 EVICASCV-----NAPSSKETYEWLEAALKR----KYPDQPFEFTYIDIENPPENDHDQQFAERILEDELFYPLVVIN--   74 (93)
T ss_dssp             SS--GGGS-----SS--HHHHHHHHHHHHHH----H-TTS-EEEEEEETTT----HHHHHHHHHHHTTSS-SSEEEET--
T ss_pred             cccchhhc-----CCCCchhHHHHHHHHHhC----cCCCCceEEEEEecCCCCccHHHHHHHHHHHhcccccceEEEC--
Confidence            44688883     455667779998876554    4654322         1111   13334456666666666655  


Q ss_pred             CCeEE
Q psy838          104 STPVA  108 (152)
Q Consensus       104 ~~ivG  108 (152)
                      |++||
T Consensus        75 ~eiV~   79 (93)
T PF07315_consen   75 DEIVA   79 (93)
T ss_dssp             TEEEE
T ss_pred             CEEEe
Confidence            88886


No 302
>PF14803 Nudix_N_2:  Nudix N-terminal; PDB: 3CNG_C.
Probab=24.98  E-value=34  Score=18.58  Aligned_cols=8  Identities=38%  Similarity=0.983  Sum_probs=3.8

Q ss_pred             ccCCCCcc
Q psy838           38 TCSFCGKE   45 (152)
Q Consensus        38 ~c~~c~~~   45 (152)
                      .||.||..
T Consensus         2 fC~~CG~~    9 (34)
T PF14803_consen    2 FCPQCGGP    9 (34)
T ss_dssp             B-TTT--B
T ss_pred             ccccccCh
Confidence            48888877


No 303
>COG2093 DNA-directed RNA polymerase, subunit E'' [Transcription]
Probab=24.97  E-value=35  Score=21.35  Aligned_cols=9  Identities=33%  Similarity=0.992  Sum_probs=7.0

Q ss_pred             cccCCCCcc
Q psy838           37 YTCSFCGKE   45 (152)
Q Consensus        37 ~~c~~c~~~   45 (152)
                      -.||.||..
T Consensus        19 e~CP~Cgs~   27 (64)
T COG2093          19 EICPVCGST   27 (64)
T ss_pred             ccCCCCCCc
Confidence            359999965


No 304
>PF10609 ParA:  ParA/MinD ATPase like;  InterPro: IPR019591  This entry represents ATPases involved in plasmid partitioning []. It also contains cytosolic Fe-S cluster assembling factors, NBP35 and CFD1 which are required for biogenesis and export of both ribosomal subunits probably through assembling the ISCs in RLI1, a protein which performs rRNA processing and ribosome export [, , ].; PDB: 2PH1_A 3KB1_B.
Probab=24.95  E-value=28  Score=22.78  Aligned_cols=13  Identities=23%  Similarity=0.897  Sum_probs=7.1

Q ss_pred             cccccccCCCCcc
Q psy838           33 QHAKYTCSFCGKE   45 (152)
Q Consensus        33 ~~~~~~c~~c~~~   45 (152)
                      .-+-|.||.||+.
T Consensus        62 NMs~~~Cp~Cg~~   74 (81)
T PF10609_consen   62 NMSYFVCPHCGER   74 (81)
T ss_dssp             CT-EEE-TTT--E
T ss_pred             CCCccCCCCCCCe
Confidence            3467899999987


No 305
>PF10915 DUF2709:  Protein of unknown function (DUF2709);  InterPro: IPR024484 Members of this family appear restricted to Chlamydiales. Their function is unknown.
Probab=24.90  E-value=64  Score=24.89  Aligned_cols=32  Identities=25%  Similarity=0.534  Sum_probs=23.8

Q ss_pred             hhcccccccCCCCcccccCCCCCHHH--HHHHHH
Q psy838           31 ITQHAKYTCSFCGKEYKKSTDLGAET--KKWVWE   62 (152)
Q Consensus        31 ~~~~~~~~c~~c~~~~~~~~~l~~~d--~~~i~~   62 (152)
                      +..+.-|-|||-||.+-...-..++|  .+|+-.
T Consensus        82 E~TkkIYICPFTGKVF~DNt~~nPQDAIYDWvSk  115 (238)
T PF10915_consen   82 EQTKKIYICPFTGKVFGDNTHPNPQDAIYDWVSK  115 (238)
T ss_pred             cccceEEEcCCcCccccCCCCCChHHHHHHHHhh
Confidence            44566799999999988887777777  344443


No 306
>COG1029 FwdB Formylmethanofuran dehydrogenase subunit B [Energy production and conversion]
Probab=24.78  E-value=31  Score=29.21  Aligned_cols=11  Identities=36%  Similarity=1.017  Sum_probs=9.0

Q ss_pred             cccccCCCCcc
Q psy838           35 AKYTCSFCGKE   45 (152)
Q Consensus        35 ~~~~c~~c~~~   45 (152)
                      .+..||+||..
T Consensus         4 k~~vCp~CG~l   14 (429)
T COG1029           4 KNVVCPFCGTL   14 (429)
T ss_pred             eeEEcCccccc
Confidence            46789999987


No 307
>KOG1491|consensus
Probab=24.59  E-value=64  Score=27.23  Aligned_cols=24  Identities=17%  Similarity=0.439  Sum_probs=19.1

Q ss_pred             Eeeee--eccccccCCHHHHHHHHHH
Q psy838          125 CYELQ--LEKQVQRKGLGKFMMQAFY  148 (152)
Q Consensus       125 I~~i~--V~p~~qgkGiG~~Lm~~l~  148 (152)
                      |++|+  |.-+.+|+|+|...++.+-
T Consensus        88 v~DIAGLvkGAs~G~GLGN~FLs~iR  113 (391)
T KOG1491|consen   88 VYDIAGLVKGASAGEGLGNKFLSHIR  113 (391)
T ss_pred             EEeecccccCcccCcCchHHHHHhhh
Confidence            45555  7788999999999998764


No 308
>PF06397 Desulfoferrod_N:  Desulfoferrodoxin, N-terminal domain;  InterPro: IPR004462 This domain is found as essentially the full length of desulforedoxin, a 37-residue homodimeric non-haem iron protein. It is also found as the N-terminal domain of desulfoferrodoxin (rbo), a homodimeric non-haem iron protein with 2 Fe atoms per monomer in different oxidation states. This domain binds the ferric rather than the ferrous Fe of desulfoferrodoxin. Neelaredoxin, a monomeric blue non-haem iron protein, lacks this domain.; GO: 0005506 iron ion binding; PDB: 1DFX_A 1VZI_B 2JI2_D 1VZH_B 2JI3_C 2JI1_C 1VZG_A 1CFW_A 2LK5_B 1DHG_B ....
Probab=24.58  E-value=30  Score=19.12  Aligned_cols=12  Identities=33%  Similarity=0.861  Sum_probs=7.1

Q ss_pred             ccccccCCCCcc
Q psy838           34 HAKYTCSFCGKE   45 (152)
Q Consensus        34 ~~~~~c~~c~~~   45 (152)
                      ..-|.|..||..
T Consensus         4 ~~~YkC~~CGni   15 (36)
T PF06397_consen    4 GEFYKCEHCGNI   15 (36)
T ss_dssp             TEEEE-TTT--E
T ss_pred             ccEEEccCCCCE
Confidence            456899999976


No 309
>PF04564 U-box:  U-box domain;  InterPro: IPR003613 Quality control of intracellular proteins is essential for cellular homeostasis. Molecular chaperones recognise and contribute to the refolding of misfolded or unfolded proteins, whereas the ubiquitin-proteasome system mediates the degradation of such abnormal proteins. Ubiquitin-protein ligases (E3s) determine the substrate specificity for ubiquitylation and have been classified into HECT and RING-finger families. More recently, however, U-box proteins, which contain a domain (the U box) of about 70 amino acids that is conserved from yeast to humans, have been identified as a new type of E3 []. Members of the U-box family of proteins constitute a class of ubiquitin-protein ligases (E3s) distinct from the HECT-type and RING finger-containing E3 families []. Using yeast two-hybrid technology, all mammalian U-box proteins have been reported to interact with molecular chaperones or co-chaperones, including Hsp90, Hsp70, DnaJc7, EKN1, CRN, and VCP. This suggests that the function of U box-type E3s is to mediate the degradation of unfolded or misfolded proteins in conjunction with molecular chaperones as receptors that recognise such abnormal proteins [, ]. Unlike the RING finger domain, IPR001841 from INTERPRO, that is stabilised by Zn2+ ions coordinated by the cysteines and a histidine, the U-box scaffold is probably stabilised by a system of salt-bridges and hydrogen bonds. The charged and polar residues that participate in this network of bonds are more strongly conserved in the U-box proteins than in classic RING fingers, which supports their role in maintaining the stability of the U box. Thus, the U box appears to have evolved from a RING finger domain by appropriation of a new set of residues required to stabilise its structure, concomitant with the loss of the original, metal-chelating residues [].; GO: 0004842 ubiquitin-protein ligase activity, 0016567 protein ubiquitination, 0000151 ubiquitin ligase complex; PDB: 1T1H_A 2C2L_D 2C2V_V 1WGM_A 2KR4_A 3L1Z_B 3L1X_A 2KRE_A 3M63_A 2QIZ_A ....
Probab=24.45  E-value=99  Score=19.27  Aligned_cols=27  Identities=15%  Similarity=-0.018  Sum_probs=15.3

Q ss_pred             cccccCCCCcccccCCCCCHHHHHHHH
Q psy838           35 AKYTCSFCGKEYKKSTDLGAETKKWVW   61 (152)
Q Consensus        35 ~~~~c~~c~~~~~~~~~l~~~d~~~i~   61 (152)
                      ...+||+|+..+.....+....+....
T Consensus        38 ~~~~~P~t~~~l~~~~l~pn~~Lk~~I   64 (73)
T PF04564_consen   38 NGGTDPFTRQPLSESDLIPNRALKSAI   64 (73)
T ss_dssp             TSSB-TTT-SB-SGGGSEE-HHHHHHH
T ss_pred             CCCCCCCCCCcCCcccceECHHHHHHH
Confidence            478999998877666555555555444


No 310
>PF09723 Zn-ribbon_8:  Zinc ribbon domain;  InterPro: IPR013429  This entry represents a region of about 41 amino acids found in a number of small proteins in a wide range of bacteria. The region usually begins with the initiator Met and contains two CxxC motifs separated by 17 amino acids. One protein in this entry has been noted as a putative regulatory protein, designated FmdB []. Most proteins in this entry have a C-terminal region containing highly degenerate sequence.
Probab=24.43  E-value=26  Score=19.72  Aligned_cols=10  Identities=50%  Similarity=1.295  Sum_probs=7.1

Q ss_pred             ccccCCCCcc
Q psy838           36 KYTCSFCGKE   45 (152)
Q Consensus        36 ~~~c~~c~~~   45 (152)
                      .|.|+.||..
T Consensus         5 ey~C~~Cg~~   14 (42)
T PF09723_consen    5 EYRCEECGHE   14 (42)
T ss_pred             EEEeCCCCCE
Confidence            4678888766


No 311
>PRK14852 hypothetical protein; Provisional
Probab=24.32  E-value=6.5e+02  Score=24.30  Aligned_cols=88  Identities=13%  Similarity=0.088  Sum_probs=53.8

Q ss_pred             CCHHHHHHHHHHHHHhHHHHHHhCCCCCChhhhhhhhcC-----CCceEEEEEeCCCCCeEEEEEEEeecc---------
Q psy838           52 LGAETKKWVWELEEKNMKHSYEVCDIGWDPQGKHSEMFD-----DRACYLVAKNGSSSTPVAFSHFRFDVD---------  117 (152)
Q Consensus        52 l~~~d~~~i~~l~~~~m~~~y~~~~~~w~~~~~~~el~~-----~~~~~~v~~~~~~~~ivGf~~~~~~~~---------  117 (152)
                      -++++.+.++.|...    .|-.+  |+..+.......+     |.+..+++...  +.++|-+.+.++..         
T Consensus        35 et~~e~~~~~~L~~~----~Y~~~--Gy~~~~ps~~~~~~~~~lp~t~~~i~k~~--~~~l~T~t~~~ds~~~Gl~~D~l  106 (989)
T PRK14852         35 ETPDEYTRAFRLVYE----EYIRS--GYLKPHPSRMYYNVWSILPATSVFIFKSY--HDVLCTLTHIPDSGLFGLPMDTL  106 (989)
T ss_pred             CCHHHHHHHHHHHHH----HHHHc--CCCCcCcccccCCccccCCcceEEEeccC--CcEEEEEEEecCCcccCcCHHHH
Confidence            367889999999765    56543  3443333333322     34444554433  66777776655422         


Q ss_pred             ----------CCcceEEEeeeeeccccccCCHHHHHHHHH
Q psy838          118 ----------FGEPVLYCYELQLEKQVQRKGLGKFMMQAF  147 (152)
Q Consensus       118 ----------~~~~~~~I~~i~V~p~~qgkGiG~~Lm~~l  147 (152)
                                .|..++.+..|+++|+.+.+-+=-.|++.+
T Consensus       107 f~~eLd~lr~~Gr~v~EvtrLa~d~~~~~~~l~~~l~~~~  146 (989)
T PRK14852        107 YKPEVDALRAQGRNVVEVGALATQYSRRWTNLMVFLAKAM  146 (989)
T ss_pred             HHHHHHHHHHcCCeEEeeehheechhhcccchhHHHHHHH
Confidence                      256789999999998888766544555554


No 312
>PF12756 zf-C2H2_2:  C2H2 type zinc-finger (2 copies); PDB: 2DMI_A.
Probab=24.07  E-value=38  Score=21.63  Aligned_cols=13  Identities=23%  Similarity=1.147  Sum_probs=9.8

Q ss_pred             cccccCCCCcccc
Q psy838           35 AKYTCSFCGKEYK   47 (152)
Q Consensus        35 ~~~~c~~c~~~~~   47 (152)
                      ..+.|+.|++.+.
T Consensus        49 ~~~~C~~C~~~f~   61 (100)
T PF12756_consen   49 ESFRCPYCNKTFR   61 (100)
T ss_dssp             SSEEBSSSS-EES
T ss_pred             CCCCCCccCCCCc
Confidence            3799999998855


No 313
>COG5319 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=24.07  E-value=34  Score=24.56  Aligned_cols=13  Identities=31%  Similarity=0.825  Sum_probs=10.4

Q ss_pred             cccccccCCCCcc
Q psy838           33 QHAKYTCSFCGKE   45 (152)
Q Consensus        33 ~~~~~~c~~c~~~   45 (152)
                      .+...+||.||..
T Consensus        29 ~rgLv~CPvCgs~   41 (142)
T COG5319          29 ERGLVTCPVCGST   41 (142)
T ss_pred             HcCceeCCCCCcH
Confidence            3456789999988


No 314
>PF08348 PAS_6:  YheO-like PAS domain;  InterPro: IPR013559 This domain is found in various hypothetical bacterial proteins that are similar to the Escherichia coli protein YheO (P64624 from SWISSPROT). Their function is unknown, but a few members are annotated as being HTH-containing proteins and putative DNA-binding proteins. 
Probab=23.87  E-value=1.3e+02  Score=20.81  Aligned_cols=23  Identities=9%  Similarity=0.251  Sum_probs=16.6

Q ss_pred             eEEEEEeCCCCCeEEEEEEEeecc
Q psy838           94 CYLVAKNGSSSTPVAFSHFRFDVD  117 (152)
Q Consensus        94 ~~~v~~~~~~~~ivGf~~~~~~~~  117 (152)
                      ..+++.++ +|+++|.++..++.+
T Consensus        85 sT~~Ird~-~g~~iG~LCIN~D~s  107 (118)
T PF08348_consen   85 STFFIRDE-NGKLIGALCINFDIS  107 (118)
T ss_pred             EEEEEECC-CCCEEEEEEEEeccH
Confidence            34555554 689999999998644


No 315
>PF09706 Cas_CXXC_CXXC:  CRISPR-associated protein (Cas_CXXC_CXXC);  InterPro: IPR019121 Clustered Regularly Interspaced Short Palindromic Repeats (CRISPR) are a family of DNA direct repeats separated by regularly sized non-repetitive spacer sequences that are found in most bacterial and archaeal genomes []. CRISPRs appear to provide acquired resistance against bacteriophages, possibly acting with an RNA interference-like mechanism to inhibit gene functions of invasive DNA elements [, ]. Differences in the number and type of spacers between CRISPR repeats correlate with phage sensitivity. It is thought that following phage infection, bacteria integrate new spacers derived from phage genomic sequences, and that the removal or addition of particular spacers modifies the phage-resistance phenotype of the cell. Therefore, the specificity of CRISPRs may be determined by spacer-phage sequence similarity. In addition, there are many protein families known as CRISPR-associated sequences (Cas), which are encoded in the vicinity of CRISPR loci []. CRISPR/cas gene regions can be quite large, with up to 20 different, tandem-arranged cas genes next to a CRISPR cluster or filling the region between two repeat clusters. Cas genes and CRISPRs are found on mobile genetic elements such as plasmids, and have undergone extensive horizontal transfer. Cas proteins are thought to be involved in the propagation and functioning of CRISPRs. Some Cas proteins show similarity to helicases and repair proteins, although the functions of most are unknown. Cas families can be divided into subtypes according to operon organisation and phylogeny.  This entry represents a conserved domain of about 65 amino acids found in otherwise highly divergent proteins encoded in CRISPR-associated regions. This domain features two CXXC motifs. 
Probab=23.86  E-value=38  Score=21.33  Aligned_cols=13  Identities=46%  Similarity=1.237  Sum_probs=10.7

Q ss_pred             cccccccCCCCcc
Q psy838           33 QHAKYTCSFCGKE   45 (152)
Q Consensus        33 ~~~~~~c~~c~~~   45 (152)
                      +..++.|-+||..
T Consensus         2 ~k~~~~C~~Cg~r   14 (69)
T PF09706_consen    2 SKKKYNCIFCGER   14 (69)
T ss_pred             CCCCCcCcCCCCc
Confidence            5679999999954


No 316
>COG4694 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=23.85  E-value=31  Score=30.91  Aligned_cols=10  Identities=60%  Similarity=1.408  Sum_probs=8.1

Q ss_pred             ccccCCCCcc
Q psy838           36 KYTCSFCGKE   45 (152)
Q Consensus        36 ~~~c~~c~~~   45 (152)
                      .--|||||+.
T Consensus       278 ~q~CpFCg~e  287 (758)
T COG4694         278 NQICPFCGKE  287 (758)
T ss_pred             CCCCCccchH
Confidence            5679999965


No 317
>COG0375 HybF Zn finger protein HypA/HybF (possibly regulating hydrogenase expression) [General function prediction only]
Probab=23.80  E-value=39  Score=23.72  Aligned_cols=10  Identities=40%  Similarity=1.072  Sum_probs=7.8

Q ss_pred             ccccCCCCcc
Q psy838           36 KYTCSFCGKE   45 (152)
Q Consensus        36 ~~~c~~c~~~   45 (152)
                      .|.||.||..
T Consensus        86 ~~~CP~C~s~   95 (115)
T COG0375          86 DYRCPKCGSI   95 (115)
T ss_pred             eeECCCCCCC
Confidence            4559999976


No 318
>COG5270 PUA domain (predicted RNA-binding domain) [Translation, ribosomal structure and biogenesis]
Probab=23.67  E-value=1e+02  Score=23.66  Aligned_cols=33  Identities=9%  Similarity=0.136  Sum_probs=19.2

Q ss_pred             eEEEEEeCCCCCeEEEEEEEeeccCCcceEEEeeeeeccccccCCH
Q psy838           94 CYLVAKNGSSSTPVAFSHFRFDVDFGEPVLYCYELQLEKQVQRKGL  139 (152)
Q Consensus        94 ~~~v~~~~~~~~ivGf~~~~~~~~~~~~~~~I~~i~V~p~~qgkGi  139 (152)
                      +++++..  ||+++|+....-+.+          ..+.|+ ||+|+
T Consensus       164 ~viVv~~--ng~~vGVg~a~~~~~----------~~in~~-rG~~v  196 (202)
T COG5270         164 EVIVVSE--NGRVVGVGIAKKSYE----------ELINPE-RGTGV  196 (202)
T ss_pred             eEEEEec--CCEEEEEEEEecCHH----------HhcCcc-cCccc
Confidence            3444444  488888877654432          335666 66653


No 319
>PF03982 DAGAT:  Diacylglycerol acyltransferase ;  InterPro: IPR007130 The terminal step of triacylglycerol (TAG) formation is catalysed by the enzyme diacylglycerol acyltransferase (DAGAT) [, ].; GO: 0016747 transferase activity, transferring acyl groups other than amino-acyl groups
Probab=23.67  E-value=1.1e+02  Score=24.84  Aligned_cols=35  Identities=11%  Similarity=0.096  Sum_probs=27.6

Q ss_pred             CCcc--cccCCCCCHHHHHHHHHHHHHhHHHHHHhCC
Q psy838           42 CGKE--YKKSTDLGAETKKWVWELEEKNMKHSYEVCD   76 (152)
Q Consensus        42 c~~~--~~~~~~l~~~d~~~i~~l~~~~m~~~y~~~~   76 (152)
                      .|++  +.....-+++|++.+.+...+.++++|+++.
T Consensus       247 VG~PI~v~~~~~Pt~e~Vd~~H~~Y~~~L~~LFd~~K  283 (297)
T PF03982_consen  247 VGKPIPVPKIENPTQEDVDKLHARYIEALRELFDKHK  283 (297)
T ss_pred             eeceecccCCCCcCHHHHHHHHHHHHHHHHHHHHHHH
Confidence            4444  4445566899999999999999999998864


No 320
>PRK14891 50S ribosomal protein L24e/unknown domain fusion protein; Provisional
Probab=23.63  E-value=41  Score=24.08  Aligned_cols=14  Identities=29%  Similarity=0.693  Sum_probs=10.6

Q ss_pred             ccccCCCCcccccC
Q psy838           36 KYTCSFCGKEYKKS   49 (152)
Q Consensus        36 ~~~c~~c~~~~~~~   49 (152)
                      ..+|.|||.++.+.
T Consensus         4 ~e~CsFcG~kIyPG   17 (131)
T PRK14891          4 TRTCDYTGEEIEPG   17 (131)
T ss_pred             eeeecCcCCcccCC
Confidence            45799999985554


No 321
>PF13719 zinc_ribbon_5:  zinc-ribbon domain
Probab=23.62  E-value=38  Score=18.48  Aligned_cols=9  Identities=33%  Similarity=0.800  Sum_probs=6.9

Q ss_pred             cccCCCCcc
Q psy838           37 YTCSFCGKE   45 (152)
Q Consensus        37 ~~c~~c~~~   45 (152)
                      -+||.|++.
T Consensus         3 i~CP~C~~~   11 (37)
T PF13719_consen    3 ITCPNCQTR   11 (37)
T ss_pred             EECCCCCce
Confidence            368999876


No 322
>COG1579 Zn-ribbon protein, possibly nucleic acid-binding [General function prediction only]
Probab=23.48  E-value=31  Score=27.29  Aligned_cols=39  Identities=21%  Similarity=0.416  Sum_probs=22.6

Q ss_pred             eeeeccccccccc----hhHhHhHHhhhhhcccccccCCCCccc
Q psy838            7 KVGITGKYGTRYG----ASLRKMVKKMEITQHAKYTCSFCGKEY   46 (152)
Q Consensus         7 ~~~~~~~~g~ryg----~~~rk~~~~~~~~~~~~~~c~~c~~~~   46 (152)
                      .+|++.==|-+=|    .--+...++|-. .-.--+||.||+..
T Consensus       189 g~gvvpl~g~~C~GC~m~l~~~~~~~V~~-~d~iv~CP~CgRIL  231 (239)
T COG1579         189 GVGVVPLEGRVCGGCHMKLPSQTLSKVRK-KDEIVFCPYCGRIL  231 (239)
T ss_pred             CceEEeecCCcccCCeeeecHHHHHHHhc-CCCCccCCccchHH
Confidence            4555544443333    223444555544 66677899999873


No 323
>KOG4602|consensus
Probab=23.30  E-value=33  Score=27.59  Aligned_cols=13  Identities=31%  Similarity=0.958  Sum_probs=10.1

Q ss_pred             cccccccCCCCcc
Q psy838           33 QHAKYTCSFCGKE   45 (152)
Q Consensus        33 ~~~~~~c~~c~~~   45 (152)
                      -...|+||.||.-
T Consensus       265 kLR~YVCPiCGAT  277 (318)
T KOG4602|consen  265 KLRSYVCPICGAT  277 (318)
T ss_pred             hHhhhcCcccccc
Confidence            3447999999965


No 324
>PF01396 zf-C4_Topoisom:  Topoisomerase DNA binding C4 zinc finger;  InterPro: IPR013498 DNA topoisomerases regulate the number of topological links between two DNA strands (i.e. change the number of superhelical turns) by catalysing transient single- or double-strand breaks, crossing the strands through one another, then resealing the breaks []. These enzymes have several functions: to remove DNA supercoils during transcription and DNA replication; for strand breakage during recombination; for chromosome condensation; and to disentangle intertwined DNA during mitosis [, ]. DNA topoisomerases are divided into two classes: type I enzymes (5.99.1.2 from EC; topoisomerases I, III and V) break single-strand DNA, and type II enzymes (5.99.1.3 from EC; topoisomerases II, IV and VI) break double-strand DNA []. Type I topoisomerases are ATP-independent enzymes (except for reverse gyrase), and can be subdivided according to their structure and reaction mechanisms: type IA (bacterial and archaeal topoisomerase I, topoisomerase III and reverse gyrase) and type IB (eukaryotic topoisomerase I and topoisomerase V). These enzymes are primarily responsible for relaxing positively and/or negatively supercoiled DNA, except for reverse gyrase, which can introduce positive supercoils into DNA.  This entry represents the zinc-finger domain found in type IA topoisomerases, including bacterial and archaeal topoisomerase I and III enzymes, and in eukaryotic topoisomerase III enzymes. Escherichia coli topoisomerase I proteins contain five copies of a zinc-ribbon-like domain at their C terminus, two of which have lost their cysteine residues and are therefore probably not able to bind zinc []. This domain is still considered to be a member of the zinc-ribbon superfamily despite not being able to bind zinc. More information about this protein can be found at Protein of the Month: DNA Topoisomerase [].; GO: 0003677 DNA binding, 0003916 DNA topoisomerase activity, 0006265 DNA topological change, 0005694 chromosome
Probab=23.26  E-value=39  Score=18.71  Aligned_cols=8  Identities=38%  Similarity=1.049  Sum_probs=6.4

Q ss_pred             ccCCCCcc
Q psy838           38 TCSFCGKE   45 (152)
Q Consensus        38 ~c~~c~~~   45 (152)
                      .||.||..
T Consensus         3 ~CP~Cg~~   10 (39)
T PF01396_consen    3 KCPKCGGP   10 (39)
T ss_pred             CCCCCCce
Confidence            58999866


No 325
>PF11023 DUF2614:  Protein of unknown function (DUF2614);  InterPro: IPR020912 This entry describes proteins of unknown function, which are thought to be membrane proteins.; GO: 0005887 integral to plasma membrane
Probab=23.05  E-value=49  Score=23.14  Aligned_cols=9  Identities=44%  Similarity=1.132  Sum_probs=5.7

Q ss_pred             cccCCCCcc
Q psy838           37 YTCSFCGKE   45 (152)
Q Consensus        37 ~~c~~c~~~   45 (152)
                      ..||.|||.
T Consensus        70 V~CP~C~K~   78 (114)
T PF11023_consen   70 VECPNCGKQ   78 (114)
T ss_pred             eECCCCCCh
Confidence            347777666


No 326
>PF12677 DUF3797:  Domain of unknown function (DUF3797);  InterPro: IPR024256 This presumed domain is functionally uncharacterised. This domain family is found in bacteria and viruses, and is approximately 50 amino acids in length. There is a conserved CGN sequence motif.
Probab=23.05  E-value=39  Score=20.00  Aligned_cols=9  Identities=33%  Similarity=1.043  Sum_probs=7.4

Q ss_pred             ccCCCCccc
Q psy838           38 TCSFCGKEY   46 (152)
Q Consensus        38 ~c~~c~~~~   46 (152)
                      .||.||.+.
T Consensus        15 ~Cp~CGN~~   23 (49)
T PF12677_consen   15 KCPKCGNDK   23 (49)
T ss_pred             cCcccCCcE
Confidence            599999873


No 327
>PF06170 DUF983:  Protein of unknown function (DUF983);  InterPro: IPR009325 This family consists of several bacterial proteins of unknown function.
Probab=23.05  E-value=47  Score=21.92  Aligned_cols=23  Identities=22%  Similarity=0.559  Sum_probs=14.4

Q ss_pred             ccccCCCCcccccCCCCCHHHHHHHH
Q psy838           36 KYTCSFCGKEYKKSTDLGAETKKWVW   61 (152)
Q Consensus        36 ~~~c~~c~~~~~~~~~l~~~d~~~i~   61 (152)
                      .-.|+.||-++....   .+|-+..+
T Consensus         8 ~~~C~~CG~d~~~~~---adDgPA~f   30 (86)
T PF06170_consen    8 APRCPHCGLDYSHAR---ADDGPAYF   30 (86)
T ss_pred             CCcccccCCccccCC---cCccchhH
Confidence            457999998855443   34444444


No 328
>PLN02294 cytochrome c oxidase subunit Vb
Probab=23.00  E-value=38  Score=25.46  Aligned_cols=13  Identities=15%  Similarity=0.825  Sum_probs=10.2

Q ss_pred             cccccCCCCcccc
Q psy838           35 AKYTCSFCGKEYK   47 (152)
Q Consensus        35 ~~~~c~~c~~~~~   47 (152)
                      ..+.||.||..+.
T Consensus       140 kp~RCpeCG~~fk  152 (174)
T PLN02294        140 KSFECPVCTQYFE  152 (174)
T ss_pred             CceeCCCCCCEEE
Confidence            4778999998843


No 329
>PF04376 ATE_N:  Arginine-tRNA-protein transferase, N terminus;  InterPro: IPR007471 Arginine-tRNA-protein transferase catalyses the post-translational conjugation of arginine to the N terminus of a protein. In eukaryotes, this functions as part of the N terminus rule pathway of protein degradation by conjugating a destabilising amino acid to the N-terminal aspartate or glutamate of a protein, targeting the protein for ubiquitin-dependent proteolysis. N-terminal cysteine is sometimes modified []. In Saccharomyces cerevisiae, Cys20, 23, 94 and/or 95 are thought to be important for activity []. Of these, only Cys 94 appears to be completely conserved in this family.  This entry represents the N-terminal region of the enzyme arginine-tRNA-protein transferase, found in both eukaryotic and prokaryotic enzymes.; GO: 0004057 arginyltransferase activity, 0016598 protein arginylation
Probab=22.94  E-value=1.1e+02  Score=19.58  Aligned_cols=29  Identities=17%  Similarity=0.363  Sum_probs=20.1

Q ss_pred             ccccccCCCCcc------cc--cCCCCCHHHHHHHHH
Q psy838           34 HAKYTCSFCGKE------YK--KSTDLGAETKKWVWE   62 (152)
Q Consensus        34 ~~~~~c~~c~~~------~~--~~~~l~~~d~~~i~~   62 (152)
                      ..++.|+.|...      ..  +...++.++.+.+++
T Consensus         5 ~~~~~C~Y~~~~~~~~~~~~~~~~~~~~~~~y~~Ll~   41 (80)
T PF04376_consen    5 TQPSSCGYCPGRSARSSYTYGAPSESLSPEDYQQLLD   41 (80)
T ss_pred             CCCCcCCCCCCCccCEEEEeccccccCCHHHHHHHHH
Confidence            467899999522      22  577888877777664


No 330
>PF10815 ComZ:  ComZ;  InterPro: IPR024558 ComZ, which contains a leucine zipper motif, negatively regulates transcription of the ComG operon [].
Probab=22.80  E-value=1.3e+02  Score=18.22  Aligned_cols=26  Identities=27%  Similarity=0.360  Sum_probs=22.6

Q ss_pred             CCCCCHHHHHHHHHHHHHhHHHHHHh
Q psy838           49 STDLGAETKKWVWELEEKNMKHSYEV   74 (152)
Q Consensus        49 ~~~l~~~d~~~i~~l~~~~m~~~y~~   74 (152)
                      .-+++.+.+.-+++|....|.+.|+-
T Consensus        25 GIeLsme~~qP~m~L~~~VM~eAYEl   50 (56)
T PF10815_consen   25 GIELSMEMLQPLMQLLTKVMNEAYEL   50 (56)
T ss_pred             CccCCHHHHHHHHHHHHHHHHHHHHh
Confidence            46788999999999999999999953


No 331
>TIGR02668 moaA_archaeal probable molybdenum cofactor biosynthesis protein A, archaeal. This model describes an archaeal family related, and predicted to be functionally equivalent, to molybdenum cofactor biosynthesis protein A (MoaA) of bacteria (see TIGR02666).
Probab=22.63  E-value=71  Score=25.38  Aligned_cols=46  Identities=17%  Similarity=0.486  Sum_probs=31.7

Q ss_pred             ccchhHhHhHHhhhhhcccccccCCCCccccc---CCCCCHHHHHHHHHHH
Q psy838           17 RYGASLRKMVKKMEITQHAKYTCSFCGKEYKK---STDLGAETKKWVWELE   64 (152)
Q Consensus        17 ryg~~~rk~~~~~~~~~~~~~~c~~c~~~~~~---~~~l~~~d~~~i~~l~   64 (152)
                      +||+.+..  -.|+.+..=++.|.+|-.....   ...++.+++..+.+..
T Consensus         4 ~~gr~~~~--l~i~vT~~CNl~C~yC~~~~~~~~~~~~ls~eei~~~i~~~   52 (302)
T TIGR02668         4 RFGRPVTS--LRISVTDRCNLSCFYCHMEGEDRSGGNELSPEEIERIVRVA   52 (302)
T ss_pred             CCCCccCe--EEEEEcccccCCCCCCCccccCCCccCcCCHHHHHHHHHHH
Confidence            46666654  3678888889999999765322   3467888877766543


No 332
>PF14824 Sirohm_synth_M:  Sirohaem biosynthesis protein central; PDB: 1KYQ_B.
Probab=22.56  E-value=52  Score=17.45  Aligned_cols=19  Identities=21%  Similarity=0.293  Sum_probs=15.1

Q ss_pred             eccccccccchhHhHhHHh
Q psy838           10 ITGKYGTRYGASLRKMVKK   28 (152)
Q Consensus        10 ~~~~~g~ryg~~~rk~~~~   28 (152)
                      +++.=+||+.+.+|+.+..
T Consensus        10 STnG~sP~la~~iR~~ie~   28 (30)
T PF14824_consen   10 STNGKSPRLARLIRKEIER   28 (30)
T ss_dssp             EESSS-HHHHHHHHHHHHH
T ss_pred             ECCCCChHHHHHHHHHHHH
Confidence            5778899999999998764


No 333
>PF06677 Auto_anti-p27:  Sjogren's syndrome/scleroderma autoantigen 1 (Autoantigen p27);  InterPro: IPR009563 The proteins in this entry are functionally uncharacterised and include several proteins that characterise Sjogren's syndrome/scleroderma autoantigen 1 (Autoantigen p27). It is thought that the potential association of anti-p27 with anti-centromere antibodies suggests that autoantigen p27 might play a role in mitosis [].
Probab=22.08  E-value=46  Score=18.84  Aligned_cols=13  Identities=23%  Similarity=0.613  Sum_probs=9.5

Q ss_pred             ccccCCCCccccc
Q psy838           36 KYTCSFCGKEYKK   48 (152)
Q Consensus        36 ~~~c~~c~~~~~~   48 (152)
                      .-.||.||.+..+
T Consensus        17 ~~~Cp~C~~PL~~   29 (41)
T PF06677_consen   17 DEHCPDCGTPLMR   29 (41)
T ss_pred             cCccCCCCCeeEE
Confidence            3469999988544


No 334
>PF10601 zf-LITAF-like:  LITAF-like zinc ribbon domain;  InterPro: IPR006629 Members of this family display a conserved zinc ribbon structure [] with the motif C-XX-C- separated from the more C-terminal HX-C(P)X-C-X4-G-R motif by a variable region of usually 25-30 (hydrophobic) residues. Although it belongs to one of the zinc finger's fold groups (zinc ribbon), this particular domain was first identified in LPS-induced tumour necrosis alpha factor (LITAF) which is produced in mammalian cells after being challenged with lipopolysaccharide (LPS). The hydrophobic region probably inserts into the membrane rather than traversing it. Such an insertion brings together the N- and C-terminal C-XX-C motifs to form a compact Zn2+-binding structure []. 
Probab=22.03  E-value=43  Score=20.97  Aligned_cols=14  Identities=21%  Similarity=0.427  Sum_probs=11.1

Q ss_pred             hcccccccCCCCcc
Q psy838           32 TQHAKYTCSFCGKE   45 (152)
Q Consensus        32 ~~~~~~~c~~c~~~   45 (152)
                      -+...|.||.|++.
T Consensus        54 ~kd~~H~Cp~C~~~   67 (73)
T PF10601_consen   54 CKDVYHYCPNCGAF   67 (73)
T ss_pred             ccCceEECCCCCCE
Confidence            35668999999976


No 335
>PF02799 NMT_C:  Myristoyl-CoA:protein N-myristoyltransferase, C-terminal domain;  InterPro: IPR022677 Myristoyl-CoA:protein N-myristoyltransferase (2.3.1.97 from EC) (Nmt) [] is the enzyme responsible for transferring a myristate group on the N-terminal glycine of a number of cellular eukaryotics and viral proteins. Nmt is a monomeric protein of about 50 to 60kDa whose sequence appears to be well conserved.  The N and C-terminal domains of NMT are structurally similar, each adopting an acyl-CoA N-acyltransferase-like fold. This entry represents the C-terminal region.; GO: 0004379 glycylpeptide N-tetradecanoyltransferase activity; PDB: 2P6G_B 2P6F_F 2P6E_A 1IIC_A 1IID_A 2NMT_A 2WUU_A 1IYL_B 1NMT_B 1IYK_A ....
Probab=21.87  E-value=1.2e+02  Score=23.07  Aligned_cols=66  Identities=11%  Similarity=0.076  Sum_probs=38.5

Q ss_pred             ccCCCCCHHHHHHHHHHHHHhHHHHHHhCCCCCChhhhhhhhcC-CC-ceEEEEEeCCCCCeEEEEEEEee
Q psy838           47 KKSTDLGAETKKWVWELEEKNMKHSYEVCDIGWDPQGKHSEMFD-DR-ACYLVAKNGSSSTPVAFSHFRFD  115 (152)
Q Consensus        47 ~~~~~l~~~d~~~i~~l~~~~m~~~y~~~~~~w~~~~~~~el~~-~~-~~~~v~~~~~~~~ivGf~~~~~~  115 (152)
                      ....+++++|++.+..|..+.+.. |+- ..-.+++...--+.. ++ -..+|+.++ +++|-.|++|...
T Consensus        29 ~glR~m~~~Dv~~v~~Ll~~yl~~-f~l-~~~fs~eev~Hw~lp~~~Vv~syVve~~-~~~ITDf~SFY~L   96 (190)
T PF02799_consen   29 PGLRPMEEKDVPQVTKLLNKYLKK-FDL-APVFSEEEVKHWFLPRKNVVYSYVVEDP-DGKITDFFSFYSL   96 (190)
T ss_dssp             TTEEE--GGGHHHHHHHHHHHHTT-SSE-EEE--HHHHHHHHS-BTTTEEEEEEEET-TSEEEEEEEEEEE
T ss_pred             CccccCchhhHHHHHHHHHHHHHh-ccc-ccccCHHHHHhhcccCCCeEEEEEEecC-CCceeeEEEEeec
Confidence            336889999999999998875533 321 112455555444443 22 334455555 4799999999764


No 336
>smart00746 TRASH metallochaperone-like domain.
Probab=21.73  E-value=44  Score=16.33  Aligned_cols=7  Identities=86%  Similarity=2.124  Sum_probs=3.5

Q ss_pred             cCCCCcc
Q psy838           39 CSFCGKE   45 (152)
Q Consensus        39 c~~c~~~   45 (152)
                      ||.||..
T Consensus         1 c~~C~~~    7 (39)
T smart00746        1 CSFCGKD    7 (39)
T ss_pred             CCCCCCC
Confidence            4555544


No 337
>PF03376 Adeno_E3B:  Adenovirus E3B protein;  InterPro: IPR005041 Adenoviruses are medium-sized, non-enveloped viruses containing double-stranded DNA. They can cause a variety of diseases including pneumonia, cystitis, conjunctivitis and diarrhoea, all of which can be fatal to patients who are immunocompromised []. These viruses have many mechanisms to evade the host immune response, including several proteins which are expressed as part of the early transcription unit 3 (E3) []. One of the regions of E3, known as the E3B region, encodes three proteins known as 10.4K, 14.5K and 14.7K. Two of these proteins, 10.4K and 14.5K, form the RID complex (receptor internalisation and degradation) which protects the infected cell from host-induced lysis by clearing the the TNF and Fas receptors from the cell surface []. Other receptors, such as the epidermal growth factor receptor, are also known to be cleared by RID [].  This entry represents the E3B region 10.4K protein, also known as the RID alpha subunit.; GO: 0016020 membrane
Probab=21.73  E-value=40  Score=21.20  Aligned_cols=14  Identities=7%  Similarity=0.228  Sum_probs=10.6

Q ss_pred             eccccccCCHHHHH
Q psy838          130 LEKQVQRKGLGKFM  143 (152)
Q Consensus       130 V~p~~qgkGiG~~L  143 (152)
                      =+|+||++-|++-|
T Consensus        52 HhPqYrn~~iA~LL   65 (67)
T PF03376_consen   52 HHPQYRNQQIAALL   65 (67)
T ss_pred             cCchhcCHHHHHHh
Confidence            47888888887654


No 338
>PRK00758 GMP synthase subunit A; Validated
Probab=21.67  E-value=1.1e+02  Score=22.43  Aligned_cols=27  Identities=19%  Similarity=0.388  Sum_probs=23.5

Q ss_pred             EeeeeeccccccCCHHHHHHHHHHhhc
Q psy838          125 CYELQLEKQVQRKGLGKFMMQAFYTYI  151 (152)
Q Consensus       125 I~~i~V~p~~qgkGiG~~Lm~~l~~~a  151 (152)
                      ++.+|-|||+...--|.+|++.+.+.|
T Consensus       155 ~~g~QfHPE~~~~~~g~~l~~~f~~~~  181 (184)
T PRK00758        155 IYGVQFHPEVAHTEYGEEIFKNFLEIC  181 (184)
T ss_pred             EEEEEcCCccCCCchHHHHHHHHHHHH
Confidence            688999999987779999999988765


No 339
>TIGR03823 FliZ flagellar regulatory protein FliZ. FliZ is involved in the regulation of flagellar assembly and possibly also the down-regulation of the motile phenotype. FliZ interacts with the flagellar translational activator FlhCD complex.
Probab=21.66  E-value=60  Score=24.09  Aligned_cols=35  Identities=20%  Similarity=0.285  Sum_probs=28.8

Q ss_pred             hcccccccCCCCcc------cccCCCCCHHHHHHHHHHHHH
Q psy838           32 TQHAKYTCSFCGKE------YKKSTDLGAETKKWVWELEEK   66 (152)
Q Consensus        32 ~~~~~~~c~~c~~~------~~~~~~l~~~d~~~i~~l~~~   66 (152)
                      -.|+..-|..|+|.      +.....+..+.+..+=.++++
T Consensus        18 fKhsQthCshC~K~LDRisLV~~gqiinK~~Ia~md~liDd   58 (168)
T TIGR03823        18 FKHSQTHCSHCHKLLDRISLVFRGQIINKESISRMDTLIDD   58 (168)
T ss_pred             hccccchhhhhcchhhheeeeecceeecHHHHHHHhhhccH
Confidence            34567789999999      778899999999888887765


No 340
>PRK05978 hypothetical protein; Provisional
Probab=21.59  E-value=41  Score=24.59  Aligned_cols=11  Identities=27%  Similarity=0.736  Sum_probs=8.9

Q ss_pred             cccccCCCCcc
Q psy838           35 AKYTCSFCGKE   45 (152)
Q Consensus        35 ~~~~c~~c~~~   45 (152)
                      ..-+||.||+-
T Consensus        32 l~grCP~CG~G   42 (148)
T PRK05978         32 FRGRCPACGEG   42 (148)
T ss_pred             HcCcCCCCCCC
Confidence            46789999987


No 341
>cd00472 Ribosomal_L24e_L24 Ribosomal protein L24e/L24 is a ribosomal protein found in eukaryotes (L24) and in archaea (L24e, distinct from archaeal L24). L24e/L24 is located on the surface of the large subunit, adjacent to proteins L14 and L3, and near the translation factor binding site.  L24e/L24 appears to play a role in the kinetics of peptide synthesis, and may be involved in interactions between the large and small subunits, either directly or through other factors. In mouse, a deletion mutation in L24 has been identified as the cause for the belly spot and tail (Bst) mutation that results in disrupted pigmentation, somitogenesis and retinal cell fate determination.  L24 may be an important protein in eukaryotic reproduction:  in shrimp, L24 expression is elevated in the ovary, suggesting a role in oogenesis, and in Arabidopsis, L24 has been proposed to have a specific function in gynoecium development. No protein with sequence or structural homology to L24e/L24 has been identifi
Probab=21.54  E-value=46  Score=20.05  Aligned_cols=14  Identities=36%  Similarity=0.771  Sum_probs=9.5

Q ss_pred             ccccCCCCcccccC
Q psy838           36 KYTCSFCGKEYKKS   49 (152)
Q Consensus        36 ~~~c~~c~~~~~~~   49 (152)
                      ...|.|||..+.+.
T Consensus         3 ~~~C~f~g~~I~PG   16 (54)
T cd00472           3 TEKCSFCGYKIYPG   16 (54)
T ss_pred             EEEecCcCCeecCC
Confidence            35788888875444


No 342
>TIGR03831 YgiT_finger YgiT-type zinc finger domain. This domain model describes a small domain with two copies of a putative zinc-binding motif CXXC (usually CXXCG). Most member proteins consist largely of this domain or else carry an additional C-terminal helix-turn-helix domain, resembling that of the phage protein Cro and modeled by pfam01381.
Probab=21.35  E-value=50  Score=18.16  Aligned_cols=18  Identities=33%  Similarity=0.757  Sum_probs=11.6

Q ss_pred             hhcccccccCCCCccccc
Q psy838           31 ITQHAKYTCSFCGKEYKK   48 (152)
Q Consensus        31 ~~~~~~~~c~~c~~~~~~   48 (152)
                      +..-.-+.|+.||..+.+
T Consensus        27 i~~vp~~~C~~CGE~~~~   44 (46)
T TIGR03831        27 IENVPALVCPQCGEEYLD   44 (46)
T ss_pred             EeCCCccccccCCCEeeC
Confidence            334445679999976544


No 343
>PF06054 CoiA:  Competence protein CoiA-like family;  InterPro: IPR010330 Competence is the ability of a cell to take up exogenous DNA from its environment, resulting in transformation. It is widespread among bacteria and is probably an important mechanism for the horizontal transfer of genes. Cells that take up DNA inevitably acquire the nucleotides the DNA consists of, and, because nucleotides are needed for DNA and RNA synthesis and are expensive to synthesise, these may make a significant contribution to the cell's energy budget []. The lateral gene transfer caused by competence also contributes to the genetic diversity that makes evolution possible.  DNA usually becomes available by the death and lysis of other cells. Competent bacteria use components of extracellular filaments called type 4 pili to create pores in their membranes and pull DNA through the pores into the cytoplasm. This process, including the development of competence and the expression of the uptake machinery, is regulated in response to cell-cell signalling and/or nutritional conditions []. Many of the members of this family are described as transcription factors. CoiA falls within a competence-specific operon in Streptococcus. CoiA is an uncharacterised protein.
Probab=21.07  E-value=45  Score=27.93  Aligned_cols=12  Identities=42%  Similarity=0.982  Sum_probs=11.0

Q ss_pred             ccccccCCCCcc
Q psy838           34 HAKYTCSFCGKE   45 (152)
Q Consensus        34 ~~~~~c~~c~~~   45 (152)
                      ...|.||.||.+
T Consensus        28 ~~~~~CP~C~~~   39 (375)
T PF06054_consen   28 KGKYFCPGCGEP   39 (375)
T ss_pred             CCcEECCCCCCe
Confidence            779999999988


No 344
>smart00661 RPOL9 RNA polymerase subunit 9.
Probab=21.03  E-value=44  Score=19.12  Aligned_cols=11  Identities=36%  Similarity=1.053  Sum_probs=9.2

Q ss_pred             cccccCCCCcc
Q psy838           35 AKYTCSFCGKE   45 (152)
Q Consensus        35 ~~~~c~~c~~~   45 (152)
                      ..+.||.||..
T Consensus        19 ~~~vC~~Cg~~   29 (52)
T smart00661       19 RRFVCRKCGYE   29 (52)
T ss_pred             CEEECCcCCCe
Confidence            47899999976


No 345
>PF10083 DUF2321:  Uncharacterized protein conserved in bacteria (DUF2321);  InterPro: IPR016891 This entry is represented by Bacteriophage 'Lactobacillus prophage Lj928', Orf-Ljo1454. The characteristics of the protein distribution suggest prophage matches in addition to the phage matches. There is currently no experimental data for members of this group or their homologues, nor do they exhibit features indicative of any function.
Probab=20.99  E-value=72  Score=23.60  Aligned_cols=29  Identities=31%  Similarity=0.350  Sum_probs=19.1

Q ss_pred             cccccccCCCCcccccCCCCCHHHHHHHHHHHH
Q psy838           33 QHAKYTCSFCGKEYKKSTDLGAETKKWVWELEE   65 (152)
Q Consensus        33 ~~~~~~c~~c~~~~~~~~~l~~~d~~~i~~l~~   65 (152)
                      .+++..|..||+++.|    +..-++..-+|.+
T Consensus        65 ~~~PsYC~~CGkpyPW----t~~~L~aa~el~e   93 (158)
T PF10083_consen   65 YEAPSYCHNCGKPYPW----TENALEAANELIE   93 (158)
T ss_pred             CCCChhHHhCCCCCch----HHHHHHHHHHHHH
Confidence            3478889999998755    4445555555544


No 346
>PF05265 DUF723:  Protein of unknown function (DUF723);  InterPro: IPR007929 This family contains several uncharacterised proteins from Neisseria meningitidis. These proteins may have a role in DNA binding.
Probab=20.74  E-value=46  Score=20.60  Aligned_cols=9  Identities=44%  Similarity=1.003  Sum_probs=7.0

Q ss_pred             cccccCCCC
Q psy838           35 AKYTCSFCG   43 (152)
Q Consensus        35 ~~~~c~~c~   43 (152)
                      +++=||.||
T Consensus        52 sk~GCP~Cg   60 (60)
T PF05265_consen   52 SKHGCPECG   60 (60)
T ss_pred             hccCCCCCC
Confidence            456799997


No 347
>TIGR02491 NrdG anaerobic ribonucleoside-triphosphate reductase activating protein. This enzyme is a member of the radical-SAM family (pfam04055) and utilizes S-adenosyl methionine, an iron-sulfur cluster and a reductant (dihydroflavodoxin ) to produce a glycine-centered radical in the class III (anaerobic) ribonucleotide triphosphate reductase (NrdD, TIGR02487). The two components form an alpha-2/beta-2 heterodimer.
Probab=20.69  E-value=1.3e+02  Score=21.49  Aligned_cols=33  Identities=12%  Similarity=0.104  Sum_probs=22.6

Q ss_pred             ccccccCCCCcccc----cCCCCCHHHHHHHHHHHHH
Q psy838           34 HAKYTCSFCGKEYK----KSTDLGAETKKWVWELEEK   66 (152)
Q Consensus        34 ~~~~~c~~c~~~~~----~~~~l~~~d~~~i~~l~~~   66 (152)
                      -=++.|+.|-.+..    ....++.++.+.+++....
T Consensus        24 gCnl~C~~C~n~~~~~~~~g~~~~~~~~~~i~~~l~~   60 (154)
T TIGR02491        24 GCKHHCEGCFNKETWNFNGGKEFTEALEKEIIRDLND   60 (154)
T ss_pred             CCCCCCcCCCcccccCCCCCCcCCHHHHHHHHHHHHh
Confidence            34678999987632    3467887778887765543


No 348
>PHA00733 hypothetical protein
Probab=20.65  E-value=58  Score=22.99  Aligned_cols=22  Identities=23%  Similarity=0.538  Sum_probs=15.6

Q ss_pred             hhcccccccCCCCcccccCCCC
Q psy838           31 ITQHAKYTCSFCGKEYKKSTDL   52 (152)
Q Consensus        31 ~~~~~~~~c~~c~~~~~~~~~l   52 (152)
                      .....+|.|+.||+.+.+...+
T Consensus        68 ~~~~kPy~C~~Cgk~Fss~s~L   89 (128)
T PHA00733         68 SKAVSPYVCPLCLMPFSSSVSL   89 (128)
T ss_pred             cCCCCCccCCCCCCcCCCHHHH
Confidence            3345689999999986655444


No 349
>COG3058 FdhE Uncharacterized protein involved in formate dehydrogenase formation [Posttranslational modification, protein turnover, chaperones]
Probab=20.65  E-value=48  Score=26.97  Aligned_cols=18  Identities=17%  Similarity=0.429  Sum_probs=12.2

Q ss_pred             hhhhhccccc----ccCCCCcc
Q psy838           28 KMEITQHAKY----TCSFCGKE   45 (152)
Q Consensus        28 ~~~~~~~~~~----~c~~c~~~   45 (152)
                      .|+.+.+.++    .||.||..
T Consensus       173 ~i~~~~~~e~e~~~~CPvCGS~  194 (308)
T COG3058         173 GIPGKARVENESRQYCPVCGSM  194 (308)
T ss_pred             cCCccccccccccccCCCcCCC
Confidence            3444444444    99999988


No 350
>PRK00432 30S ribosomal protein S27ae; Validated
Probab=20.60  E-value=46  Score=19.62  Aligned_cols=9  Identities=33%  Similarity=0.866  Sum_probs=7.3

Q ss_pred             ccccCCCCc
Q psy838           36 KYTCSFCGK   44 (152)
Q Consensus        36 ~~~c~~c~~   44 (152)
                      .-.||.||.
T Consensus        20 ~~fCP~Cg~   28 (50)
T PRK00432         20 NKFCPRCGS   28 (50)
T ss_pred             cCcCcCCCc
Confidence            448999997


No 351
>PF08273 Prim_Zn_Ribbon:  Zinc-binding domain of primase-helicase;  InterPro: IPR013237 This entry is represented by bacteriophage T7 Gp4. The characteristics of the protein distribution suggest prophage matches in addition to the phage matches. This entry represents a zinc binding domain found in the N-terminal region of the bacteriophage T7 Gp4 and P4 alpha protein. P4 is a multifunctional protein with origin recognition, helicase and primase activities [, , ].; GO: 0003896 DNA primase activity, 0004386 helicase activity, 0008270 zinc ion binding; PDB: 1NUI_B.
Probab=20.55  E-value=44  Score=18.86  Aligned_cols=9  Identities=33%  Similarity=1.006  Sum_probs=4.4

Q ss_pred             cccCCCCcc
Q psy838           37 YTCSFCGKE   45 (152)
Q Consensus        37 ~~c~~c~~~   45 (152)
                      -.||.||-.
T Consensus         4 ~pCP~CGG~   12 (40)
T PF08273_consen    4 GPCPICGGK   12 (40)
T ss_dssp             E--TTTT-T
T ss_pred             CCCCCCcCc
Confidence            358888865


No 352
>PF07282 OrfB_Zn_ribbon:  Putative transposase DNA-binding domain;  InterPro: IPR010095 This entry represents a region of a sequence similarity between a family of putative transposases of Thermoanaerobacter tengcongensis, smaller related proteins from Bacillus anthracis, putative transposes described by IPR001959 from INTERPRO, and other proteins. More information about these proteins can be found at Protein of the Month: Transposase [].
Probab=20.42  E-value=45  Score=20.41  Aligned_cols=15  Identities=33%  Similarity=0.638  Sum_probs=11.2

Q ss_pred             hhcccccccCCCCcc
Q psy838           31 ITQHAKYTCSFCGKE   45 (152)
Q Consensus        31 ~~~~~~~~c~~c~~~   45 (152)
                      ......+.||.||..
T Consensus        41 ~~~~r~~~C~~Cg~~   55 (69)
T PF07282_consen   41 RRSGRVFTCPNCGFE   55 (69)
T ss_pred             ccccceEEcCCCCCE
Confidence            345668999999854


No 353
>cd04641 CBS_pair_28 The CBS domain, named after human CBS, is a small domain originally identified in cystathionine beta-synthase and is subsequently found in a wide range of different proteins. CBS domains usually occur in tandem repeats. They associate to form a so-called Bateman domain or a CBS pair based on crystallographic studies in bacteria.  The CBS pair was used as a basis for this cd hierarchy since the human CBS proteins can adopt the typical core structure and form an intramolecular CBS pair.  The interface between the two CBS domains forms a cleft that is a potential ligand binding site. The CBS pair coexists with a variety of other functional domains and this has been used to help in its classification here.  It has been proposed that the CBS domain may play a regulatory role, although its exact function is unknown. Mutations of conserved residues within this domain are associated with a variety of human hereditary diseases, including congenital myotonia, idiopathic gener
Probab=20.33  E-value=1.6e+02  Score=19.13  Aligned_cols=27  Identities=11%  Similarity=0.100  Sum_probs=16.9

Q ss_pred             hhhhcCCCceEEEEEeCCCCCeEEEEEE
Q psy838           85 HSEMFDDRACYLVAKNGSSSTPVAFSHF  112 (152)
Q Consensus        85 ~~el~~~~~~~~v~~~~~~~~ivGf~~~  112 (152)
                      ...|.+....++.+.++ +++++|++..
T Consensus        88 ~~~m~~~~~~~l~Vvd~-~~~~~Givt~  114 (120)
T cd04641          88 FDLIVKARVHRLVVVDE-NKRVEGIISL  114 (120)
T ss_pred             HHHHHhcCccEEEEECC-CCCEEEEEEH
Confidence            44455555555555544 4789999874


No 354
>KOG3029|consensus
Probab=20.21  E-value=21  Score=29.18  Aligned_cols=16  Identities=31%  Similarity=0.603  Sum_probs=11.2

Q ss_pred             hhhcccccccCCCCcc
Q psy838           30 EITQHAKYTCSFCGKE   45 (152)
Q Consensus        30 ~~~~~~~~~c~~c~~~   45 (152)
                      ++...+--+||||-|.
T Consensus        90 ~l~LyQyetCPFCcKV  105 (370)
T KOG3029|consen   90 DLVLYQYETCPFCCKV  105 (370)
T ss_pred             eEEEEeeccCchHHHH
Confidence            4444555689999987


No 355
>PF02150 RNA_POL_M_15KD:  RNA polymerases M/15 Kd subunit;  InterPro: IPR001529 DNA-directed RNA polymerases 2.7.7.6 from EC (also known as DNA-dependent RNA polymerases) are responsible for the polymerisation of ribonucleotides into a sequence complementary to the template DNA. In eukaryotes, there are three different forms of DNA-directed RNA polymerases transcribing different sets of genes. Most RNA polymerases are multimeric enzymes and are composed of a variable number of subunits. The core RNA polymerase complex consists of five subunits (two alpha, one beta, one beta-prime and one omega) and is sufficient for transcription elongation and termination but is unable to initiate transcription. Transcription initiation from promoter elements requires a sixth, dissociable subunit called a sigma factor, which reversibly associates with the core RNA polymerase complex to form a holoenzyme []. The core RNA polymerase complex forms a "crab claw"-like structure with an internal channel running along the full length []. The key functional sites of the enzyme, as defined by mutational and cross-linking analysis, are located on the inner wall of this channel. RNA synthesis follows after the attachment of RNA polymerase to a specific site, the promoter, on the template DNA strand. The RNA synthesis process continues until a termination sequence is reached. The RNA product, which is synthesised in the 5' to 3'direction, is known as the primary transcript. Eukaryotic nuclei contain three distinct types of RNA polymerases that differ in the RNA they synthesise:  RNA polymerase I: located in the nucleoli, synthesises precursors of most ribosomal RNAs. RNA polymerase II: occurs in the nucleoplasm, synthesises mRNA precursors.  RNA polymerase III: also occurs in the nucleoplasm, synthesises the precursors of 5S ribosomal RNA, the tRNAs, and a variety of other small nuclear and cytosolic RNAs.   Eukaryotic cells are also known to contain separate mitochondrial and chloroplast RNA polymerases. Eukaryotic RNA polymerases, whose molecular masses vary in size from 500 to 700 kDa, contain two non-identical large (>100 kDa) subunits and an array of up to 12 different small (less than 50 kDa) subunits. In archaebacteria, there is generally a single form of RNA polymerase which also consist of an oligomeric assemblage of 10 to 13 polypeptides. It has recently been shown [], [] that small subunits of about 15 kDa, found in polymerase types I and II, are highly conserved. These proteins contain a probable zinc finger in their N-terminal region and a C-terminal zinc ribbon domain (see IPR001222 from INTERPRO).; GO: 0003677 DNA binding, 0003899 DNA-directed RNA polymerase activity, 0006351 transcription, DNA-dependent; PDB: 3H0G_I 3M4O_I 3S14_I 2E2J_I 4A3J_I 3HOZ_I 1TWA_I 3S1Q_I 3S1N_I 1TWG_I ....
Probab=20.04  E-value=46  Score=18.01  Aligned_cols=8  Identities=38%  Similarity=0.912  Sum_probs=6.2

Q ss_pred             ccCCCCcc
Q psy838           38 TCSFCGKE   45 (152)
Q Consensus        38 ~c~~c~~~   45 (152)
                      .||.||..
T Consensus         3 FCp~C~nl   10 (35)
T PF02150_consen    3 FCPECGNL   10 (35)
T ss_dssp             BETTTTSB
T ss_pred             eCCCCCcc
Confidence            68888876


No 356
>PHA01733 hypothetical protein
Probab=20.03  E-value=3.7e+02  Score=19.82  Aligned_cols=59  Identities=12%  Similarity=0.076  Sum_probs=33.4

Q ss_pred             CCCCHHHHHHHHH-HHHHhHHHHHHhCCCCCChhhhhhhhcCCCce-EEEEEeCCCCCeEEEEEEEe
Q psy838           50 TDLGAETKKWVWE-LEEKNMKHSYEVCDIGWDPQGKHSEMFDDRAC-YLVAKNGSSSTPVAFSHFRF  114 (152)
Q Consensus        50 ~~l~~~d~~~i~~-l~~~~m~~~y~~~~~~w~~~~~~~el~~~~~~-~~v~~~~~~~~ivGf~~~~~  114 (152)
                      .+.|.+|+..+++ +..+...++.   ..+-++..+..-+...... +.++ -+  |++++......
T Consensus         6 rpaT~~d~~~l~~n~r~~Dr~E~e---alg~~p~~l~~~~~~s~~~v~~~~-~n--G~l~aI~Gv~~   66 (153)
T PHA01733          6 RPATQADATEVAQNLRQEDREEIE---GLGHSPLALHLSLDVSENVVAFVA-PD--GSLAGVAGLVE   66 (153)
T ss_pred             ccccHHHHHHHHccCCHHHHHHHH---HhCCCcccchhhhhccccceEEEe-cC--CcEEEEecccc
Confidence            4678888887777 6655554543   2222554333333333333 3333 33  89999888765


No 357
>TIGR02605 CxxC_CxxC_SSSS putative regulatory protein, FmdB family. This model represents a region of about 50 amino acids found in a number of small proteins in a wide range of bacteria. The region begins usually with the initiator Met and contains two CxxC motifs separated by 17 amino acids. One member of this family is has been noted as a putative regulatory protein, designated FmdB (PubMed:8841393). Most members of this family have a C-terminal region containing highly degenerate sequence, such as SSTSESTKSSGSSGSSGSSESKASGSTEKSTSSTTAAAAV in Mycobacterium tuberculosis and VAVGGSAPAPSPAPRAGGGGGGCCGGGCCG in Streptomyces avermitilis. These low complexity regions, which are not included in the model, resemble low-complexity C-terminal regions of some heterocycle-containing bacteriocin precursors.
Probab=20.02  E-value=46  Score=19.17  Aligned_cols=10  Identities=40%  Similarity=1.285  Sum_probs=7.4

Q ss_pred             ccccCCCCcc
Q psy838           36 KYTCSFCGKE   45 (152)
Q Consensus        36 ~~~c~~c~~~   45 (152)
                      .|.|+.||..
T Consensus         5 ey~C~~Cg~~   14 (52)
T TIGR02605         5 EYRCTACGHR   14 (52)
T ss_pred             EEEeCCCCCE
Confidence            4778888875


Done!