Query         psy3109
Match_columns 58
No_of_seqs    117 out of 723
Neff          5.9 
Searched_HMMs 46136
Date          Fri Aug 16 16:41:11 2013
Command       hhsearch -i /work/01045/syshi/Psyhhblits/psy3109.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/3109hhsearch_cdd -cpu 12 -v 0 

 No Hit                             Prob E-value P-value  Score    SS Cols Query HMM  Template HMM
  1 KOG2275|consensus               99.3 3.7E-12   8E-17   91.1   5.8   49    1-50    130-178 (420)
  2 PRK12891 allantoate amidohydro  98.9 3.6E-09 7.7E-14   73.5   4.8   49    1-51     96-144 (414)
  3 PF01546 Peptidase_M20:  Peptid  98.9 1.6E-08 3.4E-13   62.1   7.0   44    1-45     37-80  (189)
  4 TIGR03176 AllC allantoate amid  98.8   4E-09 8.6E-14   73.7   4.4   49    2-52     90-138 (406)
  5 PRK07205 hypothetical protein;  98.8   8E-09 1.7E-13   72.2   5.8   45    1-46    117-161 (444)
  6 PRK07338 hypothetical protein;  98.8 1.7E-08 3.6E-13   69.3   6.1   46    1-47    130-175 (402)
  7 PRK07906 hypothetical protein;  98.8 1.7E-08 3.7E-13   69.9   5.9   46    1-47    106-151 (426)
  8 PRK13983 diaminopimelate amino  98.8 1.4E-08 3.1E-13   68.9   5.4   47    1-48    118-164 (400)
  9 PRK13013 succinyl-diaminopimel  98.8 1.5E-08 3.3E-13   69.8   5.4   46    1-47    124-169 (427)
 10 PRK07318 dipeptidase PepV; Rev  98.7 2.5E-08 5.4E-13   70.3   5.9   43    1-45    119-161 (466)
 11 PRK08588 succinyl-diaminopimel  98.7 3.2E-08 6.9E-13   67.3   5.4   39    1-39    101-139 (377)
 12 PRK08262 hypothetical protein;  98.7   5E-08 1.1E-12   68.9   6.3   39    1-39    155-193 (486)
 13 TIGR01880 Ac-peptdase-euk N-ac  98.7 4.9E-08 1.1E-12   67.0   6.1   45    1-45    113-157 (400)
 14 PRK12890 allantoate amidohydro  98.7 3.5E-08 7.5E-13   68.3   4.9   48    1-50     94-141 (414)
 15 PRK13009 succinyl-diaminopimel  98.7 7.9E-08 1.7E-12   65.1   6.2   45    1-46    100-144 (375)
 16 PRK09104 hypothetical protein;  98.7 6.6E-08 1.4E-12   67.9   5.9   47    1-48    129-175 (464)
 17 TIGR01246 dapE_proteo succinyl  98.6 8.3E-08 1.8E-12   65.1   6.0   47    1-48     97-143 (370)
 18 TIGR01887 dipeptidaselike dipe  98.6 7.7E-08 1.7E-12   68.2   5.9   45    1-46    107-151 (447)
 19 PRK07473 carboxypeptidase; Pro  98.6   1E-07 2.2E-12   65.7   6.2   45    1-46    113-157 (376)
 20 PRK13590 putative bifunctional  98.6 5.1E-08 1.1E-12   71.3   4.7   49    1-51    271-319 (591)
 21 PRK09133 hypothetical protein;  98.6 8.4E-08 1.8E-12   67.5   5.6   45    1-46    142-186 (472)
 22 PRK08596 acetylornithine deace  98.6 9.9E-08 2.1E-12   66.4   5.5   39    1-39    119-157 (421)
 23 PRK06156 hypothetical protein;  98.6 1.3E-07 2.9E-12   67.9   6.2   45    1-46    155-199 (520)
 24 COG0624 ArgE Acetylornithine d  98.6   1E-07 2.2E-12   65.7   5.4   48    1-49    117-164 (409)
 25 PRK08201 hypothetical protein;  98.6 1.3E-07 2.8E-12   66.2   5.8   46    1-47    121-166 (456)
 26 TIGR01910 DapE-ArgE acetylorni  98.6 1.3E-07 2.9E-12   64.3   5.6   45    1-46    106-150 (375)
 27 PRK09290 allantoate amidohydro  98.6 1.2E-07 2.5E-12   65.8   5.1   37    1-37     93-129 (413)
 28 PRK12893 allantoate amidohydro  98.6 1.1E-07 2.4E-12   65.5   4.8   37    1-37     96-132 (412)
 29 TIGR01886 dipeptidase dipeptid  98.5 2.2E-07 4.9E-12   65.9   5.6   39    1-39    118-156 (466)
 30 PRK06915 acetylornithine deace  98.5 2.5E-07 5.4E-12   63.9   5.7   39    1-39    135-173 (422)
 31 PRK06446 hypothetical protein;  98.5 2.7E-07 5.8E-12   64.5   5.9   45    1-47    104-148 (436)
 32 PRK06133 glutamate carboxypept  98.5 3.2E-07 6.9E-12   63.8   6.1   39    1-39    137-175 (410)
 33 PRK07079 hypothetical protein;  98.5 2.8E-07   6E-12   65.0   5.8   39    1-39    128-167 (469)
 34 PRK04443 acetyl-lysine deacety  98.5 1.9E-07 4.1E-12   63.4   4.8   44    1-48     92-135 (348)
 35 TIGR01883 PepT-like peptidase   98.5 2.4E-07 5.2E-12   62.6   5.2   38    1-39    101-138 (361)
 36 PRK13799 unknown domain/N-carb  98.5 1.6E-07 3.6E-12   68.7   4.6   48    3-52    273-320 (591)
 37 PRK06837 acetylornithine deace  98.5 3.5E-07 7.7E-12   63.7   5.6   39    1-39    139-177 (427)
 38 TIGR01879 hydantase amidase, h  98.4 3.2E-07   7E-12   63.4   4.6   48    2-51     88-135 (401)
 39 PRK05469 peptidase T; Provisio  98.4 4.3E-07 9.3E-12   62.8   4.9   37    1-37    141-177 (408)
 40 TIGR01882 peptidase-T peptidas  98.4 4.9E-07 1.1E-11   62.9   5.1   37    1-37    143-179 (410)
 41 PRK08554 peptidase; Reviewed    98.4 6.6E-07 1.4E-11   63.2   5.6   41    1-44    104-144 (438)
 42 PRK13004 peptidase; Reviewed    98.4 9.2E-07   2E-11   61.0   6.0   38    1-38    111-148 (399)
 43 PRK12892 allantoate amidohydro  98.4 5.9E-07 1.3E-11   61.8   4.9   37    1-37     94-130 (412)
 44 PRK08652 acetylornithine deace  98.4 7.4E-07 1.6E-11   59.6   5.1   37    1-39     88-124 (347)
 45 TIGR01900 dapE-gram_pos succin  98.4 8.2E-07 1.8E-11   61.1   5.3   45    1-45    104-150 (373)
 46 PRK08651 succinyl-diaminopimel  98.3 1.1E-06 2.5E-11   59.9   5.1   36    1-39    115-150 (394)
 47 PRK13381 peptidase T; Provisio  98.3 1.3E-06 2.9E-11   60.3   5.4   38    1-39    139-176 (404)
 48 PRK07522 acetylornithine deace  98.3 1.3E-06 2.9E-11   59.3   5.3   43    1-46    105-147 (385)
 49 TIGR01892 AcOrn-deacetyl acety  98.3 1.5E-06 3.3E-11   58.4   5.3   37    1-39     99-135 (364)
 50 TIGR03106 trio_M42_hydro hydro  98.3 1.5E-06 3.2E-11   60.6   5.2   38    1-38    184-221 (343)
 51 PRK07907 hypothetical protein;  98.2 2.5E-06 5.4E-11   59.8   5.5   42    1-46    125-166 (449)
 52 PRK05111 acetylornithine deace  98.2 2.3E-06 4.9E-11   58.2   5.1   37    1-39    112-148 (383)
 53 PRK13007 succinyl-diaminopimel  98.2 2.7E-06 5.8E-11   57.2   4.9   41    1-45     96-136 (352)
 54 PRK00466 acetyl-lysine deacety  98.2 3.2E-06 6.8E-11   57.3   5.1   35    1-39     93-127 (346)
 55 TIGR03320 ygeY M20/DapE family  98.2 2.9E-06 6.3E-11   58.3   4.6   38    1-38    109-146 (395)
 56 PRK10199 alkaline phosphatase   98.2 5.4E-06 1.2E-10   58.4   5.7   37    1-39    144-180 (346)
 57 TIGR01902 dapE-lys-deAc N-acet  98.1 4.8E-06   1E-10   56.1   4.8   38    1-44     83-120 (336)
 58 TIGR03526 selenium_YgeY putati  98.0 1.8E-05 3.9E-10   54.5   5.6   37    1-37    109-145 (395)
 59 PF04389 Peptidase_M28:  Peptid  98.0 1.9E-05 4.2E-10   48.8   5.1   43    3-46     29-71  (179)
 60 PLN02693 IAA-amino acid hydrol  97.9 3.3E-05 7.2E-10   54.8   5.3   36    1-36    137-172 (437)
 61 PRK08737 acetylornithine deace  97.9 2.6E-05 5.7E-10   53.8   4.5   36    1-44    103-138 (364)
 62 TIGR01891 amidohydrolases amid  97.7 7.1E-05 1.5E-09   51.0   5.1   36    2-37     92-127 (363)
 63 TIGR01893 aa-his-dipept aminoa  97.7 5.2E-05 1.1E-09   53.8   4.1   35    1-39    110-144 (477)
 64 PRK15026 aminoacyl-histidine d  97.6 0.00013 2.7E-09   52.7   4.8   35    1-39    116-150 (485)
 65 PLN02280 IAA-amino acid hydrol  97.5 0.00022 4.7E-09   51.5   4.8   37    1-37    187-223 (478)
 66 PRK09961 exoaminopeptidase; Pr  96.9  0.0025 5.5E-08   44.3   5.4   43    1-47    167-209 (344)
 67 KOG2195|consensus               96.6  0.0078 1.7E-07   46.0   6.1   39    8-47    381-419 (702)
 68 TIGR03107 glu_aminopep glutamy  96.6  0.0058 1.3E-07   42.9   5.0   37    1-39    179-215 (350)
 69 PF05343 Peptidase_M42:  M42 gl  96.4   0.015 3.2E-07   39.9   6.2   41    2-46    136-176 (292)
 70 COG1363 FrvX Cellulase M and r  96.2  0.0076 1.6E-07   42.8   4.0   37    1-39    181-217 (355)
 71 COG2234 Iap Predicted aminopep  96.1   0.012 2.6E-07   40.9   4.6   35    3-39    231-265 (435)
 72 PRK09864 putative peptidase; P  96.1   0.013 2.8E-07   41.4   4.8   35    1-39    176-210 (356)
 73 COG2195 PepD Di- and tripeptid  95.7   0.014 3.1E-07   42.1   3.7   46    1-48    145-191 (414)
 74 COG4187 RocB Arginine degradat  95.7   0.026 5.6E-07   42.0   4.9   38    1-39    141-178 (553)
 75 KOG2276|consensus               93.7    0.09   2E-06   38.7   3.5   39    1-39    133-171 (473)
 76 PRK02256 putative aminopeptida  93.6    0.14 3.1E-06   37.4   4.5   34    2-39    262-295 (462)
 77 COG1473 AbgB Metal-dependent a  90.3    0.39 8.4E-06   34.4   3.4   46    5-52    108-153 (392)
 78 PF05450 Nicastrin:  Nicastrin;  89.7    0.87 1.9E-05   30.5   4.5   34    3-36     27-63  (234)
 79 PF02127 Peptidase_M18:  Aminop  89.0    0.68 1.5E-05   33.6   3.8   35    5-39    237-275 (432)
 80 PRK02813 putative aminopeptida  88.8    0.58 1.3E-05   33.8   3.4   33    2-39    236-268 (428)
 81 KOG2194|consensus               85.5     1.1 2.5E-05   35.2   3.5   44    3-47    166-209 (834)
 82 PTZ00371 aspartyl aminopeptida  85.1     1.2 2.6E-05   32.5   3.3   38    2-39    253-291 (465)
 83 PF06309 Torsin:  Torsin;  Inte  72.6      20 0.00042   22.3   5.5   49    7-57     36-84  (127)
 84 COG1362 LAP4 Aspartyl aminopep  72.0     5.1 0.00011   29.6   3.1   33    6-39    244-276 (437)
 85 PF06877 RraB:  Regulator of ri  61.8      23  0.0005   20.1   4.0   29    7-35      5-35  (104)
 86 COG4882 Predicted aminopeptida  61.6     9.1  0.0002   28.4   2.7   45    4-52    211-257 (486)
 87 PF12000 Glyco_trans_4_3:  Gkyc  56.7      17 0.00037   23.3   3.1   22    6-29     51-72  (171)
 88 PF14136 DUF4303:  Domain of un  50.5      47   0.001   20.0   4.3   20    4-23    104-123 (155)
 89 PF11181 YflT:  Heat induced st  50.4      28  0.0006   20.0   3.1   26    7-33     11-36  (103)
 90 KOG0615|consensus               48.8      37  0.0008   25.5   4.1   33    2-35    279-314 (475)
 91 PF13812 PPR_3:  Pentatricopept  47.3      26 0.00057   15.1   2.2   16    7-22     19-34  (34)
 92 COG4821 Uncharacterized protei  43.6      39 0.00084   23.2   3.3   25    7-33    199-223 (243)
 93 PF04055 Radical_SAM:  Radical   43.3      52  0.0011   18.6   3.5   30    5-35    126-155 (166)
 94 PF04114 Gaa1:  Gaa1-like, GPI   42.4      33 0.00072   25.4   3.1   33    3-36     37-69  (504)
 95 PF14681 UPRTase:  Uracil phosp  39.8      19  0.0004   23.3   1.4   30    6-35    135-164 (207)
 96 PF05237 MoeZ_MoeB:  MoeZ/MoeB   39.5      46   0.001   18.4   2.8   21    4-24     34-54  (84)
 97 PF08415 NRPS:  Nonribosomal pe  39.0      48   0.001   17.1   2.7   27    6-32      4-30  (58)
 98 COG0035 Upp Uracil phosphoribo  38.4      42 0.00091   22.6   2.9   30    5-35    137-166 (210)
 99 cd06287 PBP1_LacI_like_8 Ligan  38.0      69  0.0015   20.6   3.8   30    6-35    188-217 (269)
100 COG4703 Uncharacterized protei  37.8      80  0.0017   18.0   4.0   35    2-36      1-45  (74)
101 PF13377 Peripla_BP_3:  Peripla  37.3      64  0.0014   18.6   3.3   28    6-33     78-105 (160)
102 PF04028 DUF374:  Domain of unk  36.3      78  0.0017   17.6   3.4   25    2-33     43-67  (74)
103 cd06301 PBP1_rhizopine_binding  35.7      66  0.0014   20.1   3.4   27    7-34    194-220 (272)
104 PRK05943 50S ribosomal protein  35.2      60  0.0013   18.5   2.9   21   11-35     16-36  (94)
105 COG2875 CobM Precorrin-4 methy  34.9      79  0.0017   22.0   3.8   45    8-54    171-218 (254)
106 TIGR01481 ccpA catabolite cont  34.7      87  0.0019   20.4   3.9   30    6-35    246-275 (329)
107 PRK11191 RNase E inhibitor pro  34.1 1.2E+02  0.0026   19.0   6.1   31    5-35     11-43  (138)
108 cd06276 PBP1_FucR_like Ligand-  33.5 1.1E+02  0.0023   19.5   4.1   31    6-36    172-202 (247)
109 KOG2170|consensus               33.3 1.6E+02  0.0034   21.4   5.2   48    8-57     94-141 (344)
110 cd06272 PBP1_hexuronate_repres  32.5      93   0.002   19.3   3.7   28    6-33    182-209 (261)
111 PF07708 Tash_PEST:  Tash prote  30.8      48   0.001   14.3   1.5   12   24-35      6-17  (19)
112 PF04023 FeoA:  FeoA domain;  I  30.8      87  0.0019   16.3   3.0   21   10-30     23-43  (74)
113 cd01541 PBP1_AraR Ligand-bindi  30.4 1.2E+02  0.0026   19.0   4.0   29    7-35    194-222 (273)
114 cd01542 PBP1_TreR_like Ligand-  30.1 1.2E+02  0.0026   18.7   3.8   30    6-35    183-212 (259)
115 TIGR03642 cas_csx13 CRISPR-ass  29.2      93   0.002   19.1   3.1   28    8-36      7-34  (124)
116 PF08967 DUF1884:  Domain of un  29.0      71  0.0015   18.7   2.5   15    9-23     15-29  (85)
117 PF02780 Transketolase_C:  Tran  28.2      55  0.0012   18.9   2.0   32    4-35     20-52  (124)
118 PF02070 NMU:  Neuromedin U;  I  28.1      20 0.00044   16.5   0.1   13   30-43      1-13  (25)
119 cd06298 PBP1_CcpA_like Ligand-  28.0 1.1E+02  0.0024   18.9   3.4   29    7-35    188-216 (268)
120 KOG1067|consensus               27.8      51  0.0011   25.9   2.1   45    5-51    253-297 (760)
121 PF13738 Pyr_redox_3:  Pyridine  27.7 1.3E+02  0.0028   18.1   3.6   31    6-40      7-37  (203)
122 cd06285 PBP1_LacI_like_7 Ligan  27.7 1.3E+02  0.0028   18.8   3.7   27    7-33    186-212 (265)
123 PRK11303 DNA-binding transcrip  26.9 1.3E+02  0.0029   19.5   3.8   29    7-35    249-277 (328)
124 cd03036 ArsC_like Arsenate Red  26.7      77  0.0017   18.3   2.4   19   10-28     79-97  (111)
125 PRK10727 DNA-binding transcrip  26.6 1.3E+02  0.0029   19.8   3.8   29    6-34    247-275 (343)
126 cd00495 Ribosomal_L25_TL5_CTC   26.4 1.1E+02  0.0023   17.2   2.9   21   11-35     15-35  (91)
127 cd06280 PBP1_LacI_like_4 Ligan  26.1 1.7E+02  0.0037   18.2   4.1   28    6-33    182-209 (263)
128 COG1057 NadD Nicotinic acid mo  25.9      82  0.0018   20.6   2.6   26    8-34     86-111 (197)
129 PF11051 Mannosyl_trans3:  Mann  25.6 1.4E+02   0.003   20.0   3.8   29    6-36     14-42  (271)
130 PRK09526 lacI lac repressor; R  24.4 1.9E+02  0.0041   18.9   4.2   28    6-33    250-277 (342)
131 PRK14987 gluconate operon tran  23.9 1.7E+02  0.0037   19.1   3.9   30    6-35    249-278 (331)
132 PF00743 FMO-like:  Flavin-bind  23.4      89  0.0019   23.2   2.7   32    4-40      9-40  (531)
133 PF13454 NAD_binding_9:  FAD-NA  23.2 1.4E+02  0.0031   17.9   3.2   33    4-37      5-37  (156)
134 smart00493 TOPRIM topoisomeras  22.8 1.3E+02  0.0027   15.5   3.2   22   12-35     38-59  (76)
135 PRK10745 trkD potassium transp  22.8 1.7E+02  0.0038   22.7   4.2   34    3-38    303-336 (622)
136 PF13592 HTH_33:  Winged helix-  22.7      31 0.00067   18.0   0.1   24   12-35     28-51  (60)
137 TIGR00731 ctc_TL5 ribosomal pr  22.6 1.2E+02  0.0026   19.3   2.9   23   10-36     12-34  (176)
138 KOG1164|consensus               22.5   1E+02  0.0022   20.7   2.6   19    4-22    127-145 (322)
139 PF10850 DUF2653:  Protein of u  22.3 1.5E+02  0.0033   17.5   3.1   29    6-35     57-88  (91)
140 cd01544 PBP1_GalR Ligand-bindi  22.1 2.2E+02  0.0047   17.9   4.1   28    6-33    189-216 (270)
141 cd01543 PBP1_XylR Ligand-bindi  21.5 2.2E+02  0.0047   17.8   4.0   30    6-35    181-210 (265)
142 cd01575 PBP1_GntR Ligand-bindi  21.2   2E+02  0.0044   17.7   3.7   25    7-31    188-212 (268)
143 PF13662 Toprim_4:  Toprim doma  21.2      81  0.0018   16.9   1.7   29   23-54     46-74  (81)
144 PF10587 EF-1_beta_acid:  Eukar  21.0      32 0.00069   16.2  -0.0    7   29-35      1-7   (28)
145 PF14247 DUF4344:  Domain of un  20.9   2E+02  0.0044   19.2   3.8   31    8-38     21-51  (220)
146 TIGR00756 PPR pentatricopeptid  20.9      88  0.0019   13.0   2.3   17    7-23     18-34  (35)
147 cd01540 PBP1_arabinose_binding  20.8 2.1E+02  0.0045   18.1   3.7   23    7-30    205-227 (289)
148 PRK06973 nicotinic acid mononu  20.5 1.3E+02  0.0027   20.2   2.8   24    9-33    108-131 (243)
149 PRK00016 metal-binding heat sh  20.5   1E+02  0.0022   19.3   2.2   20   17-36     31-50  (159)
150 PF14433 SUKH-3:  SUKH-3 immuni  20.4      89  0.0019   18.7   1.9   21   11-31      4-24  (142)
151 PF02705 K_trans:  K+ potassium  20.0 2.5E+02  0.0054   21.4   4.4   34    3-38    292-325 (534)

No 1  
>KOG2275|consensus
Probab=99.31  E-value=3.7e-12  Score=91.06  Aligned_cols=49  Identities=47%  Similarity=0.694  Sum_probs=44.9

Q ss_pred             ChHHHHHHHHHHHHHHHcCCCCCceEEEEeeCCCCCCChhhhHHHHHHHH
Q psy3109           1 MKCVGIQYLETIRRYIAEGKKFKRTIHVSFVPGPVTQNETCGLAAANSVL   50 (58)
Q Consensus         1 MK~~~~~~leAl~~L~~~g~~p~r~I~l~f~~DEE~Gg~~~g~~~~~s~~   50 (58)
                      |||.++|+++|+|.|+..|++|+|||.++|++|||+||. .|+.-+++.+
T Consensus       130 ~K~~~va~leAir~L~~~g~kp~Rti~lsfvpDEEi~G~-~Gm~~fa~~~  178 (420)
T KOG2275|consen  130 MKCVGVAYLEAIRNLKASGFKPKRTIHLSFVPDEEIGGH-IGMKEFAKTE  178 (420)
T ss_pred             hHhHHHHHHHHHHHHHhcCCCcCceEEEEecCchhccCc-chHHHHhhhh
Confidence            899999999999999999999999999999999999997 5777777644


No 2  
>PRK12891 allantoate amidohydrolase; Reviewed
Probab=98.86  E-value=3.6e-09  Score=73.51  Aligned_cols=49  Identities=14%  Similarity=0.085  Sum_probs=43.2

Q ss_pred             ChHHHHHHHHHHHHHHHcCCCCCceEEEEeeCCCCCCChhhhHHHHHHHHH
Q psy3109           1 MKCVGIQYLETIRRYIAEGKKFKRTIHVSFVPGPVTQNETCGLAAANSVLY   51 (58)
Q Consensus         1 MK~~~~~~leAl~~L~~~g~~p~r~I~l~f~~DEE~Gg~~~g~~~~~s~~~   51 (58)
                      ||+++++.++|+++|++.|.+|+++|.+++++|||.|+  +|..+.||+.+
T Consensus        96 ~k~Gv~a~l~a~~~l~~~~~~~~~~i~v~~~~dEE~~~--f~~~~~Gs~~~  144 (414)
T PRK12891         96 GIYGVLGGLEVVRALNDAGIETERPVDVVIWTNEEGSR--FAPSMVGSGVF  144 (414)
T ss_pred             chhhHHHHHHHHHHHHHcCCCCCCCeEEEEecccccCc--CCcccccHHHH
Confidence            89999999999999999999999999999999999653  57777777654


No 3  
>PF01546 Peptidase_M20:  Peptidase family M20/M25/M40 This family only corresponds to M20 family;  InterPro: IPR002933 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of proteins contains the metallopeptidases and non-peptidase homologues (amidohydrolases) that belong to the MEROPS peptidase family M20 (clan MH) []. The peptidases of this clan have two catalytic zinc ions at the active site, bound by His/Asp, Asp, Glu, Asp/Glu and His. The catalysed reaction involves the release of an N-terminal amino acid, usually neutral or hydrophobic, from a polypeptide []. The peptidase M20 family has four sub-families:  M20A - type example, glutamate carboxypeptidase from Pseudomonas sp. RS16 (P06621 from SWISSPROT) M20B - type example, peptidase T from Escherichia coli (P29745 from SWISSPROT) M20C - type example, X-His dipeptidase from E. coli (P15288 from SWISSPROT) M20D - type example, carboxypeptidase Ss1 from Sulfolobus solfataricus (P80092 from SWISSPROT)  ; GO: 0016787 hydrolase activity, 0008152 metabolic process; PDB: 3T68_A 3T6M_A 2F8H_A 3GB0_A 3IO1_B 2ZOF_A 2ZOG_B 3MRU_B 3N5F_A 1Z2L_B ....
Probab=98.85  E-value=1.6e-08  Score=62.08  Aligned_cols=44  Identities=18%  Similarity=0.180  Sum_probs=39.6

Q ss_pred             ChHHHHHHHHHHHHHHHcCCCCCceEEEEeeCCCCCCChhhhHHH
Q psy3109           1 MKCVGIQYLETIRRYIAEGKKFKRTIHVSFVPGPVTQNETCGLAA   45 (58)
Q Consensus         1 MK~~~~~~leAl~~L~~~g~~p~r~I~l~f~~DEE~Gg~~~g~~~   45 (58)
                      ||+++++.+.|++.|++.+..++++|.++|++|||+|+. .|...
T Consensus        37 ~k~~~~~~l~a~~~l~~~~~~~~~~i~~~~~~~EE~g~~-~g~~~   80 (189)
T PF01546_consen   37 MKGGIAAMLAALKALKESGDDLPGNIIFLFTPDEEIGSI-GGAKH   80 (189)
T ss_dssp             THHHHHHHHHHHHHHHHTTTTCSSEEEEEEESTCCGTST-THHHH
T ss_pred             CcccHHHHHHHHHHHHhccccccccccccccccccCCCc-chhhh
Confidence            799999999999999999999999999999999999986 35443


No 4  
>TIGR03176 AllC allantoate amidohydrolase. This enzyme catalyzes the breakdown of allantoate, first to ureidoglycine by hydrolysis and then decarboxylation of one of the two equivalent ureido groups. Ureidoglycine then spontaneously exchanges ammonia for water resulting in ureidoglycolate. This enzyme is an alternative to allantoicase (3.5.3.4) which releases urea.
Probab=98.84  E-value=4e-09  Score=73.65  Aligned_cols=49  Identities=8%  Similarity=-0.110  Sum_probs=45.5

Q ss_pred             hHHHHHHHHHHHHHHHcCCCCCceEEEEeeCCCCCCChhhhHHHHHHHHHH
Q psy3109           2 KCVGIQYLETIRRYIAEGKKFKRTIHVSFVPGPVTQNETCGLAAANSVLYL   52 (58)
Q Consensus         2 K~~~~~~leAl~~L~~~g~~p~r~I~l~f~~DEE~Gg~~~g~~~~~s~~~~   52 (58)
                      +.++++.++|++.|+++|.+|+++|.++++.+||  |.+||.++++|+.+.
T Consensus        90 ~~Gv~~~le~~~~l~~~~~~~~~~i~vi~~~~EE--g~rf~~~~~Gs~~~~  138 (406)
T TIGR03176        90 QFGALAAWLAVDYLKEKYGAPLRTVEVLSMAEEE--GSRFPYVFWGSKNIF  138 (406)
T ss_pred             hhhHHHHHHHHHHHHHcCCCCCCCeEEEEecccc--CccCCcccccHHHHh
Confidence            5789999999999999999999999999999999  556999999999986


No 5  
>PRK07205 hypothetical protein; Provisional
Probab=98.83  E-value=8e-09  Score=72.20  Aligned_cols=45  Identities=18%  Similarity=0.203  Sum_probs=40.1

Q ss_pred             ChHHHHHHHHHHHHHHHcCCCCCceEEEEeeCCCCCCChhhhHHHH
Q psy3109           1 MKCVGIQYLETIRRYIAEGKKFKRTIHVSFVPGPVTQNETCGLAAA   46 (58)
Q Consensus         1 MK~~~~~~leAl~~L~~~g~~p~r~I~l~f~~DEE~Gg~~~g~~~~   46 (58)
                      ||+++++.+.|++.|++.|..|+++|.++|++|||+|+. +...+.
T Consensus       117 mKgglaa~l~Al~~l~~~~~~~~~~i~l~~~~dEE~g~~-g~~~~~  161 (444)
T PRK07205        117 DKGPSMAALYAVKALLDAGVQFNKRIRFIFGTDEETLWR-CMNRYN  161 (444)
T ss_pred             CcHHHHHHHHHHHHHHHcCCCCCCcEEEEEECCcccCcc-cHHHHH
Confidence            899999999999999999998999999999999999885 444443


No 6  
>PRK07338 hypothetical protein; Provisional
Probab=98.79  E-value=1.7e-08  Score=69.29  Aligned_cols=46  Identities=13%  Similarity=-0.010  Sum_probs=40.0

Q ss_pred             ChHHHHHHHHHHHHHHHcCCCCCceEEEEeeCCCCCCChhhhHHHHH
Q psy3109           1 MKCVGIQYLETIRRYIAEGKKFKRTIHVSFVPGPVTQNETCGLAAAN   47 (58)
Q Consensus         1 MK~~~~~~leAl~~L~~~g~~p~r~I~l~f~~DEE~Gg~~~g~~~~~   47 (58)
                      ||+++++.++|++.|++.+..++++|.++|++|||+|+. ++..++.
T Consensus       130 mKgg~aa~l~a~~~l~~~~~~~~~~i~~~~~~dEE~g~~-g~~~~~~  175 (402)
T PRK07338        130 MKGGIVVMLAALLAFERSPLADKLGYDVLINPDEEIGSP-ASAPLLA  175 (402)
T ss_pred             hhHHHHHHHHHHHHHHhcCCCCCCCEEEEEECCcccCCh-hhHHHHH
Confidence            899999999999999998888889999999999999885 4544443


No 7  
>PRK07906 hypothetical protein; Provisional
Probab=98.78  E-value=1.7e-08  Score=69.91  Aligned_cols=46  Identities=20%  Similarity=0.252  Sum_probs=41.1

Q ss_pred             ChHHHHHHHHHHHHHHHcCCCCCceEEEEeeCCCCCCChhhhHHHHH
Q psy3109           1 MKCVGIQYLETIRRYIAEGKKFKRTIHVSFVPGPVTQNETCGLAAAN   47 (58)
Q Consensus         1 MK~~~~~~leAl~~L~~~g~~p~r~I~l~f~~DEE~Gg~~~g~~~~~   47 (58)
                      ||+++++.++|++.|++.+..|+++|.++|++|||+||. .|..++.
T Consensus       106 ~Kg~~a~~l~a~~~l~~~~~~~~~~i~~~~~~dEE~g~~-~g~~~l~  151 (426)
T PRK07906        106 MKDMDAMMLAVVRHLARTGRRPPRDLVFAFVADEEAGGT-YGAHWLV  151 (426)
T ss_pred             cchHHHHHHHHHHHHHHcCCCCCccEEEEEecCcccchh-hhHHHHH
Confidence            899999999999999999999999999999999999886 4655553


No 8  
>PRK13983 diaminopimelate aminotransferase; Provisional
Probab=98.77  E-value=1.4e-08  Score=68.92  Aligned_cols=47  Identities=15%  Similarity=0.130  Sum_probs=41.6

Q ss_pred             ChHHHHHHHHHHHHHHHcCCCCCceEEEEeeCCCCCCChhhhHHHHHH
Q psy3109           1 MKCVGIQYLETIRRYIAEGKKFKRTIHVSFVPGPVTQNETCGLAAANS   48 (58)
Q Consensus         1 MK~~~~~~leAl~~L~~~g~~p~r~I~l~f~~DEE~Gg~~~g~~~~~s   48 (58)
                      ||+++++.++|++.|++.+..|+++|.++|++|||+|+. +|..++..
T Consensus       118 ~K~g~~a~l~a~~~l~~~~~~~~~~v~~~~~~dEE~g~~-~g~~~~~~  164 (400)
T PRK13983        118 NGQGIVSSLLALKALMDLGIRPKYNLGLAFVSDEETGSK-YGIQYLLK  164 (400)
T ss_pred             ccchHHHHHHHHHHHHHhCCCCCCcEEEEEEeccccCCc-ccHHHHHh
Confidence            799999999999999999988999999999999999886 56665554


No 9  
>PRK13013 succinyl-diaminopimelate desuccinylase; Reviewed
Probab=98.77  E-value=1.5e-08  Score=69.78  Aligned_cols=46  Identities=17%  Similarity=0.079  Sum_probs=40.7

Q ss_pred             ChHHHHHHHHHHHHHHHcCCCCCceEEEEeeCCCCCCChhhhHHHHH
Q psy3109           1 MKCVGIQYLETIRRYIAEGKKFKRTIHVSFVPGPVTQNETCGLAAAN   47 (58)
Q Consensus         1 MK~~~~~~leAl~~L~~~g~~p~r~I~l~f~~DEE~Gg~~~g~~~~~   47 (58)
                      ||+++++.+.|++.|++.+..++++|.++|++|||+||. +|..++.
T Consensus       124 ~Kg~~aa~l~a~~~l~~~~~~~~~~v~~~~~~dEE~g~~-~g~~~l~  169 (427)
T PRK13013        124 MKGGLAASIIAAEAFLAVYPDFAGSIEISGTADEESGGF-GGVAYLA  169 (427)
T ss_pred             cchHHHHHHHHHHHHHHhCCCCCccEEEEEEeccccCCh-hHHHHHH
Confidence            899999999999999999888889999999999999886 5655544


No 10 
>PRK07318 dipeptidase PepV; Reviewed
Probab=98.74  E-value=2.5e-08  Score=70.30  Aligned_cols=43  Identities=5%  Similarity=-0.055  Sum_probs=38.9

Q ss_pred             ChHHHHHHHHHHHHHHHcCCCCCceEEEEeeCCCCCCChhhhHHH
Q psy3109           1 MKCVGIQYLETIRRYIAEGKKFKRTIHVSFVPGPVTQNETCGLAA   45 (58)
Q Consensus         1 MK~~~~~~leAl~~L~~~g~~p~r~I~l~f~~DEE~Gg~~~g~~~   45 (58)
                      ||+++++.+.|++.|++.|+.|+++|.++|++|||+|+.  |..+
T Consensus       119 mKgg~aa~l~Al~~l~~~g~~~~~~i~l~~~~DEE~g~~--G~~~  161 (466)
T PRK07318        119 DKGPTMAAYYALKIIKELGLPLSKKVRFIVGTDEESGWK--CMDY  161 (466)
T ss_pred             CcHHHHHHHHHHHHHHHcCCCCCccEEEEEEcccccCch--hHHH
Confidence            899999999999999999988889999999999999874  5444


No 11 
>PRK08588 succinyl-diaminopimelate desuccinylase; Reviewed
Probab=98.71  E-value=3.2e-08  Score=67.28  Aligned_cols=39  Identities=15%  Similarity=0.110  Sum_probs=36.7

Q ss_pred             ChHHHHHHHHHHHHHHHcCCCCCceEEEEeeCCCCCCCh
Q psy3109           1 MKCVGIQYLETIRRYIAEGKKFKRTIHVSFVPGPVTQNE   39 (58)
Q Consensus         1 MK~~~~~~leAl~~L~~~g~~p~r~I~l~f~~DEE~Gg~   39 (58)
                      ||+++++.+.|++.|.+.+..|+++|.++|++|||+|+.
T Consensus       101 ~Kgg~aa~l~a~~~l~~~~~~~~~~i~l~~~~dEE~g~~  139 (377)
T PRK08588        101 MKSGLAALVIAMIELKEQGQLLNGTIRLLATAGEEVGEL  139 (377)
T ss_pred             ccchHHHHHHHHHHHHHcCCCCCCcEEEEEEcccccCch
Confidence            899999999999999999988999999999999998864


No 12 
>PRK08262 hypothetical protein; Provisional
Probab=98.70  E-value=5e-08  Score=68.89  Aligned_cols=39  Identities=23%  Similarity=0.186  Sum_probs=37.1

Q ss_pred             ChHHHHHHHHHHHHHHHcCCCCCceEEEEeeCCCCCCCh
Q psy3109           1 MKCVGIQYLETIRRYIAEGKKFKRTIHVSFVPGPVTQNE   39 (58)
Q Consensus         1 MK~~~~~~leAl~~L~~~g~~p~r~I~l~f~~DEE~Gg~   39 (58)
                      ||+++++.+.|++.|++.+..|+++|.++|.+|||+||.
T Consensus       155 ~Kg~~aa~L~A~~~l~~~~~~l~~~I~llf~~dEE~g~~  193 (486)
T PRK08262        155 DKGSLVAILEAAEALLAQGFQPRRTIYLAFGHDEEVGGL  193 (486)
T ss_pred             cchhHHHHHHHHHHHHHcCCCCCCeEEEEEecccccCCc
Confidence            899999999999999999988889999999999999885


No 13 
>TIGR01880 Ac-peptdase-euk N-acyl-L-amino-acid amidohydrolase. This model represents a family of eukaryotic N-acyl-L-amino-acid amidohydrolases active on fatty acid and acetyl amides of L-amino acids.
Probab=98.70  E-value=4.9e-08  Score=67.05  Aligned_cols=45  Identities=51%  Similarity=0.759  Sum_probs=39.2

Q ss_pred             ChHHHHHHHHHHHHHHHcCCCCCceEEEEeeCCCCCCChhhhHHH
Q psy3109           1 MKCVGIQYLETIRRYIAEGKKFKRTIHVSFVPGPVTQNETCGLAA   45 (58)
Q Consensus         1 MK~~~~~~leAl~~L~~~g~~p~r~I~l~f~~DEE~Gg~~~g~~~   45 (58)
                      ||+++++.+.|++.|++.+..|+++|.++|++|||+||..+...+
T Consensus       113 ~K~~~aa~l~a~~~l~~~~~~~~~~v~l~~~~dEE~g~~~G~~~~  157 (400)
T TIGR01880       113 MKCVGVQYLEAVRNLKASGFKFKRTIHISFVPDEEIGGHDGMEKF  157 (400)
T ss_pred             ccHHHHHHHHHHHHHHHcCCCCCceEEEEEeCCcccCcHhHHHHH
Confidence            899999999999999999988899999999999999875333333


No 14 
>PRK12890 allantoate amidohydrolase; Reviewed
Probab=98.67  E-value=3.5e-08  Score=68.27  Aligned_cols=48  Identities=13%  Similarity=-0.022  Sum_probs=40.2

Q ss_pred             ChHHHHHHHHHHHHHHHcCCCCCceEEEEeeCCCCCCChhhhHHHHHHHH
Q psy3109           1 MKCVGIQYLETIRRYIAEGKKFKRTIHVSFVPGPVTQNETCGLAAANSVL   50 (58)
Q Consensus         1 MK~~~~~~leAl~~L~~~g~~p~r~I~l~f~~DEE~Gg~~~g~~~~~s~~   50 (58)
                      ||+++++.++|+++|++.+..|+++|.+++++|||.|+  +|..+.+++.
T Consensus        94 ~~~g~aa~l~a~~~l~~~~~~~~~~i~~~~~~dEE~~~--~~~~~~G~~~  141 (414)
T PRK12890         94 GILGVLAGLEVVAALREAGIRPPHPLEVIAFTNEEGVR--FGPSMIGSRA  141 (414)
T ss_pred             CHHHHHHHHHHHHHHHHcCCCCCCCeEEEEEecccccc--cCCccccHHH
Confidence            79999999999999999998899999999999999755  3444455543


No 15 
>PRK13009 succinyl-diaminopimelate desuccinylase; Reviewed
Probab=98.66  E-value=7.9e-08  Score=65.07  Aligned_cols=45  Identities=16%  Similarity=0.158  Sum_probs=38.8

Q ss_pred             ChHHHHHHHHHHHHHHHcCCCCCceEEEEeeCCCCCCChhhhHHHH
Q psy3109           1 MKCVGIQYLETIRRYIAEGKKFKRTIHVSFVPGPVTQNETCGLAAA   46 (58)
Q Consensus         1 MK~~~~~~leAl~~L~~~g~~p~r~I~l~f~~DEE~Gg~~~g~~~~   46 (58)
                      ||+++++.+.|+++|++.+..++++|+++|++|||.|+. .|...+
T Consensus       100 ~Kgg~aa~l~a~~~l~~~~~~~~~~i~~~~~~~EE~~~~-~G~~~~  144 (375)
T PRK13009        100 MKGSLAAFVVAAERFVAAHPDHKGSIAFLITSDEEGPAI-NGTVKV  144 (375)
T ss_pred             ChHHHHHHHHHHHHHHHhcCCCCceEEEEEEeecccccc-cCHHHH
Confidence            899999999999999999888889999999999998764 344433


No 16 
>PRK09104 hypothetical protein; Validated
Probab=98.65  E-value=6.6e-08  Score=67.93  Aligned_cols=47  Identities=11%  Similarity=0.072  Sum_probs=41.1

Q ss_pred             ChHHHHHHHHHHHHHHHcCCCCCceEEEEeeCCCCCCChhhhHHHHHH
Q psy3109           1 MKCVGIQYLETIRRYIAEGKKFKRTIHVSFVPGPVTQNETCGLAAANS   48 (58)
Q Consensus         1 MK~~~~~~leAl~~L~~~g~~p~r~I~l~f~~DEE~Gg~~~g~~~~~s   48 (58)
                      ||+++++.++|+++|++.+..++.+|.++|++|||+||. ++..++.+
T Consensus       129 ~Kg~laa~l~a~~~l~~~~~~~~~~i~~~~~~dEE~g~~-g~~~~l~~  175 (464)
T PRK09104        129 DKGQLMTFVEACRAWKAVTGSLPVRVTILFEGEEESGSP-SLVPFLEA  175 (464)
T ss_pred             CcHHHHHHHHHHHHHHHhcCCCCCcEEEEEECccccCCc-cHHHHHHh
Confidence            899999999999999998766777899999999999986 66666654


No 17 
>TIGR01246 dapE_proteo succinyl-diaminopimelate desuccinylase, proteobacterial clade. This model describes a proteobacterial subset of succinyl-diaminopimelate desuccinylases. An experimentally confirmed Gram-positive lineage succinyl-diaminopimelate desuccinylase has been described for Corynebacterium glutamicum, and a neighbor-joining tree shows the seed members, SP:Q59284, and putative archaeal members such as TrEMBL:O58003 in a single clade. However, the archaeal members differ substantially, share a number of motifs with acetylornithine deacetylases rather than succinyl-diaminopimelate desuccinylases, and are not taken as trusted examples of succinyl-diaminopimelate desuccinylases. This model is limited to proteobacterial members for this reason.
Probab=98.65  E-value=8.3e-08  Score=65.11  Aligned_cols=47  Identities=13%  Similarity=0.150  Sum_probs=39.6

Q ss_pred             ChHHHHHHHHHHHHHHHcCCCCCceEEEEeeCCCCCCChhhhHHHHHH
Q psy3109           1 MKCVGIQYLETIRRYIAEGKKFKRTIHVSFVPGPVTQNETCGLAAANS   48 (58)
Q Consensus         1 MK~~~~~~leAl~~L~~~g~~p~r~I~l~f~~DEE~Gg~~~g~~~~~s   48 (58)
                      ||+++++++.|++.|++.+..++.+|.++|++|||+|+. .|...+..
T Consensus        97 ~Kgg~a~~l~a~~~l~~~~~~~~~~v~~~~~~dEE~~~~-~G~~~~~~  143 (370)
T TIGR01246        97 MKGSLAAFIVAAERFVKKNPDHKGSISLLITSDEEGTAI-DGTKKVVE  143 (370)
T ss_pred             chHHHHHHHHHHHHHHHhcCCCCCcEEEEEEeccccCCC-cCHHHHHH
Confidence            899999999999999999877788999999999998765 35544433


No 18 
>TIGR01887 dipeptidaselike dipeptidase, putative. This model represents a clade of probable zinc dipeptidases, closely related to the characterized non-specific dipeptidase, PepV. Many enzymes in this clade have been given names including the terms "Xaa-His" and "carnosinase" due to the early mis-characterization of the Lactobacillus delbrueckii PepV enzyme. These names are likely too specific.
Probab=98.64  E-value=7.7e-08  Score=68.19  Aligned_cols=45  Identities=22%  Similarity=0.084  Sum_probs=39.8

Q ss_pred             ChHHHHHHHHHHHHHHHcCCCCCceEEEEeeCCCCCCChhhhHHHH
Q psy3109           1 MKCVGIQYLETIRRYIAEGKKFKRTIHVSFVPGPVTQNETCGLAAA   46 (58)
Q Consensus         1 MK~~~~~~leAl~~L~~~g~~p~r~I~l~f~~DEE~Gg~~~g~~~~   46 (58)
                      ||+++++.+.|++.|++.+..|+++|.++|+.|||+|+. +...++
T Consensus       107 ~KG~laa~l~a~~~l~~~~~~~~~~i~~~~~~dEE~g~~-g~~~~l  151 (447)
T TIGR01887       107 DKGPTIAALYAMKILKELGLKLKKKIRFIFGTDEETGWA-CIDYYF  151 (447)
T ss_pred             CcHHHHHHHHHHHHHHHcCCCCCCcEEEEEECCcccCcH-hHHHHH
Confidence            899999999999999999998899999999999999875 444444


No 19 
>PRK07473 carboxypeptidase; Provisional
Probab=98.63  E-value=1e-07  Score=65.73  Aligned_cols=45  Identities=24%  Similarity=0.191  Sum_probs=38.8

Q ss_pred             ChHHHHHHHHHHHHHHHcCCCCCceEEEEeeCCCCCCChhhhHHHH
Q psy3109           1 MKCVGIQYLETIRRYIAEGKKFKRTIHVSFVPGPVTQNETCGLAAA   46 (58)
Q Consensus         1 MK~~~~~~leAl~~L~~~g~~p~r~I~l~f~~DEE~Gg~~~g~~~~   46 (58)
                      ||+++++.+.|++.|++.+..++.+|.++|++|||+|+. +...++
T Consensus       113 ~Kgglaa~l~A~~~l~~~~~~~~~~v~~~~~~dEE~g~~-g~~~~~  157 (376)
T PRK07473        113 MKGGNYLALEAIRQLARAGITTPLPITVLFTPDEEVGTP-STRDLI  157 (376)
T ss_pred             chHHHHHHHHHHHHHHHcCCCCCCCEEEEEeCCcccCCc-cHHHHH
Confidence            899999999999999999877778899999999999885 444443


No 20 
>PRK13590 putative bifunctional OHCU decarboxylase/allantoate amidohydrolase; Provisional
Probab=98.62  E-value=5.1e-08  Score=71.31  Aligned_cols=49  Identities=8%  Similarity=-0.020  Sum_probs=44.4

Q ss_pred             ChHHHHHHHHHHHHHHHcCCCCCceEEEEeeCCCCCCChhhhHHHHHHHHH
Q psy3109           1 MKCVGIQYLETIRRYIAEGKKFKRTIHVSFVPGPVTQNETCGLAAANSVLY   51 (58)
Q Consensus         1 MK~~~~~~leAl~~L~~~g~~p~r~I~l~f~~DEE~Gg~~~g~~~~~s~~~   51 (58)
                      +|.++++++++++.|++.|.+|+++|.++++.|||  |.+|+.++.||+.+
T Consensus       271 G~~Gv~a~lea~~~l~~~~~~~~~~i~vv~~~~EE--g~rF~~~~~GS~~~  319 (591)
T PRK13590        271 GRLGIFVPMACVRELHRQGRRLPFGLEVVGFAEEE--GQRYKATFLGSGAL  319 (591)
T ss_pred             cHHHHHHHHHHHHHHHHcCCCCCCCeEEEEecCCc--cccCCccccchHHH
Confidence            47899999999999999999999999999999999  55688889999864


No 21 
>PRK09133 hypothetical protein; Provisional
Probab=98.62  E-value=8.4e-08  Score=67.53  Aligned_cols=45  Identities=24%  Similarity=0.177  Sum_probs=38.5

Q ss_pred             ChHHHHHHHHHHHHHHHcCCCCCceEEEEeeCCCCCCChhhhHHHH
Q psy3109           1 MKCVGIQYLETIRRYIAEGKKFKRTIHVSFVPGPVTQNETCGLAAA   46 (58)
Q Consensus         1 MK~~~~~~leAl~~L~~~g~~p~r~I~l~f~~DEE~Gg~~~g~~~~   46 (58)
                      ||+++++.++|++.|++.+..|+++|.++|++|||+++. .|...+
T Consensus       142 ~Kg~~aa~l~a~~~l~~~~~~~~~~i~~~~~~dEE~~g~-~G~~~l  186 (472)
T PRK09133        142 DKADAAIWVATLIRLKREGFKPKRDIILALTGDEEGTPM-NGVAWL  186 (472)
T ss_pred             chHHHHHHHHHHHHHHhcCCCCCCCEEEEEECccccCcc-chHHHH
Confidence            899999999999999999988999999999999995443 354444


No 22 
>PRK08596 acetylornithine deacetylase; Validated
Probab=98.60  E-value=9.9e-08  Score=66.36  Aligned_cols=39  Identities=18%  Similarity=0.043  Sum_probs=36.9

Q ss_pred             ChHHHHHHHHHHHHHHHcCCCCCceEEEEeeCCCCCCCh
Q psy3109           1 MKCVGIQYLETIRRYIAEGKKFKRTIHVSFVPGPVTQNE   39 (58)
Q Consensus         1 MK~~~~~~leAl~~L~~~g~~p~r~I~l~f~~DEE~Gg~   39 (58)
                      ||+++++.+.|+++|++.+..|+++|.++|++|||+|+.
T Consensus       119 ~Kgg~a~~l~a~~~l~~~~~~~~~~v~~~~~~dEE~g~~  157 (421)
T PRK08596        119 MKGGLAGALFAIQLLHEAGIELPGDLIFQSVIGEEVGEA  157 (421)
T ss_pred             cchHHHHHHHHHHHHHHcCCCCCCcEEEEEEeccccCCc
Confidence            899999999999999999988889999999999999874


No 23 
>PRK06156 hypothetical protein; Provisional
Probab=98.59  E-value=1.3e-07  Score=67.88  Aligned_cols=45  Identities=13%  Similarity=-0.003  Sum_probs=39.6

Q ss_pred             ChHHHHHHHHHHHHHHHcCCCCCceEEEEeeCCCCCCChhhhHHHH
Q psy3109           1 MKCVGIQYLETIRRYIAEGKKFKRTIHVSFVPGPVTQNETCGLAAA   46 (58)
Q Consensus         1 MK~~~~~~leAl~~L~~~g~~p~r~I~l~f~~DEE~Gg~~~g~~~~   46 (58)
                      ||+++++.++|++.|++.+.+|+++|.++|++|||+|+. +...+.
T Consensus       155 ~Kgg~a~~l~a~~~l~~~~~~~~~~i~~~~~~dEE~g~~-G~~~~~  199 (520)
T PRK06156        155 DKGAIVTALYAMKAIKDSGLPLARRIELLVYTTEETDGD-PLKYYL  199 (520)
T ss_pred             chHHHHHHHHHHHHHHHcCCCCCceEEEEEecccccCch-hHHHHH
Confidence            899999999999999999998889999999999999885 444333


No 24 
>COG0624 ArgE Acetylornithine deacetylase/Succinyl-diaminopimelate desuccinylase and related deacylases [Amino acid transport and metabolism]
Probab=98.59  E-value=1e-07  Score=65.66  Aligned_cols=48  Identities=17%  Similarity=0.103  Sum_probs=42.8

Q ss_pred             ChHHHHHHHHHHHHHHHcCCCCCceEEEEeeCCCCCCChhhhHHHHHHH
Q psy3109           1 MKCVGIQYLETIRRYIAEGKKFKRTIHVSFVPGPVTQNETCGLAAANSV   49 (58)
Q Consensus         1 MK~~~~~~leAl~~L~~~g~~p~r~I~l~f~~DEE~Gg~~~g~~~~~s~   49 (58)
                      ||+++++.+.|++.+.+.|..++++|.++|++|||+||. ++..++...
T Consensus       117 ~KG~~~a~l~A~~~l~~~~~~~~~~v~~~~~~dEE~g~~-~~~~~~~~~  164 (409)
T COG0624         117 MKGGLAAALYALSALKAAGGELPGDVRLLFTADEESGGA-GGKAYLEEG  164 (409)
T ss_pred             cchHHHHHHHHHHHHHHhCCCCCeEEEEEEEeccccCCc-chHHHHHhc
Confidence            899999999999999999888999999999999999996 666665544


No 25 
>PRK08201 hypothetical protein; Provisional
Probab=98.59  E-value=1.3e-07  Score=66.24  Aligned_cols=46  Identities=4%  Similarity=-0.105  Sum_probs=39.3

Q ss_pred             ChHHHHHHHHHHHHHHHcCCCCCceEEEEeeCCCCCCChhhhHHHHH
Q psy3109           1 MKCVGIQYLETIRRYIAEGKKFKRTIHVSFVPGPVTQNETCGLAAAN   47 (58)
Q Consensus         1 MK~~~~~~leAl~~L~~~g~~p~r~I~l~f~~DEE~Gg~~~g~~~~~   47 (58)
                      ||+++++.++|+++|++.+..++++|.++|++|||+|+. +...++.
T Consensus       121 mKgglaa~l~a~~~l~~~~~~~~~~i~~~~~~dEE~g~~-g~~~~l~  166 (456)
T PRK08201        121 DKGQVFMHLKAVEALLKVEGTLPVNVKFCIEGEEEIGSP-NLDSFVE  166 (456)
T ss_pred             CcHHHHHHHHHHHHHHHhcCCCCCCEEEEEEcccccCCc-cHHHHHH
Confidence            899999999999999887666778999999999999986 5555543


No 26 
>TIGR01910 DapE-ArgE acetylornithine deacetylase or succinyl-diaminopimelate desuccinylase. This group of sequences contains annotations for both acetylornithine deacetylase and succinyl-diaminopimelate desuccinylase, but does not contain any members with experimental characterization. Bacillus, Staphylococcus and Sulfolobus species contain multiple hits to this subfamily and each may have a separate activity. Determining which is which must await further laboratory research.
Probab=98.58  E-value=1.3e-07  Score=64.31  Aligned_cols=45  Identities=18%  Similarity=0.051  Sum_probs=39.1

Q ss_pred             ChHHHHHHHHHHHHHHHcCCCCCceEEEEeeCCCCCCChhhhHHHH
Q psy3109           1 MKCVGIQYLETIRRYIAEGKKFKRTIHVSFVPGPVTQNETCGLAAA   46 (58)
Q Consensus         1 MK~~~~~~leAl~~L~~~g~~p~r~I~l~f~~DEE~Gg~~~g~~~~   46 (58)
                      ||++++++++|++.|++.+..++++|.++|++|||+|+. +...++
T Consensus       106 ~k~~~a~~l~a~~~l~~~~~~~~~~i~~~~~~~EE~g~~-G~~~~~  150 (375)
T TIGR01910       106 MKGGLVALLYALKAIREAGIKPNGNIILQSVVDEESGEA-GTLYLL  150 (375)
T ss_pred             cchHHHHHHHHHHHHHHcCCCCCccEEEEEEcCcccCch-hHHHHH
Confidence            799999999999999999888889999999999999875 444443


No 27 
>PRK09290 allantoate amidohydrolase; Reviewed
Probab=98.57  E-value=1.2e-07  Score=65.76  Aligned_cols=37  Identities=19%  Similarity=0.115  Sum_probs=35.1

Q ss_pred             ChHHHHHHHHHHHHHHHcCCCCCceEEEEeeCCCCCC
Q psy3109           1 MKCVGIQYLETIRRYIAEGKKFKRTIHVSFVPGPVTQ   37 (58)
Q Consensus         1 MK~~~~~~leAl~~L~~~g~~p~r~I~l~f~~DEE~G   37 (58)
                      ||+++++.++|+++|++.+..|+++|.+++++|||+|
T Consensus        93 ~k~g~aa~l~a~~~l~~~~~~~~~~i~~~~~~dEE~g  129 (413)
T PRK09290         93 GPLGVLAGLEAVRTLNERGIRPRRPIEVVAFTNEEGS  129 (413)
T ss_pred             CHHHHHHHHHHHHHHHHcCCCCCCCeEEEEEcCCccc
Confidence            6999999999999999999888999999999999986


No 28 
>PRK12893 allantoate amidohydrolase; Reviewed
Probab=98.56  E-value=1.1e-07  Score=65.53  Aligned_cols=37  Identities=19%  Similarity=0.103  Sum_probs=34.9

Q ss_pred             ChHHHHHHHHHHHHHHHcCCCCCceEEEEeeCCCCCC
Q psy3109           1 MKCVGIQYLETIRRYIAEGKKFKRTIHVSFVPGPVTQ   37 (58)
Q Consensus         1 MK~~~~~~leAl~~L~~~g~~p~r~I~l~f~~DEE~G   37 (58)
                      +|+++++.+.|++.|++.+..++++|.++|++|||+|
T Consensus        96 gk~gvaa~l~a~~~l~~~~~~~~~~v~~~~~~dEE~g  132 (412)
T PRK12893         96 GALGVLAALEVVRTLNDAGIRTRRPIEVVSWTNEEGA  132 (412)
T ss_pred             chhhHHHHHHHHHHHHHcCCCCCCCeEEEEEcccccc
Confidence            5899999999999999999888999999999999975


No 29 
>TIGR01886 dipeptidase dipeptidase PepV. This model represents a small clade of dipeptidase enzymes which are members of the larger M25 subfamily of metalloproteases. Two characterized enzymes are included in the seed. One, from Lactococcus lactis has been shown to act on a wide range of dipeptides, but not larger peptides. The enzyme from Lactobacillus delbrueckii was originally characterized as a Xaa-His dipeptidase, specifically a carnosinase (beta-Ala-His) by complementation of an E. coli mutant. Further study, including the crystallization of the enzyme, has shown it to also be a non-specific dipeptidase. This group also includes enzymes from Streptococcus and Enterococcus.
Probab=98.52  E-value=2.2e-07  Score=65.87  Aligned_cols=39  Identities=10%  Similarity=-0.077  Sum_probs=37.2

Q ss_pred             ChHHHHHHHHHHHHHHHcCCCCCceEEEEeeCCCCCCCh
Q psy3109           1 MKCVGIQYLETIRRYIAEGKKFKRTIHVSFVPGPVTQNE   39 (58)
Q Consensus         1 MK~~~~~~leAl~~L~~~g~~p~r~I~l~f~~DEE~Gg~   39 (58)
                      ||+++++.+.|+++|++.+..|+++|.++|+.|||+|+.
T Consensus       118 ~Kg~~~a~l~a~~~l~~~~~~~~~~i~~~~~~dEE~g~~  156 (466)
T TIGR01886       118 DKGPSLAAYYAMKILKELGLPPSKKIRFVVGTNEETGWV  156 (466)
T ss_pred             cchHHHHHHHHHHHHHHhCCCCCCCEEEEEECccccCcc
Confidence            899999999999999999998999999999999999875


No 30 
>PRK06915 acetylornithine deacetylase; Validated
Probab=98.52  E-value=2.5e-07  Score=63.95  Aligned_cols=39  Identities=18%  Similarity=0.178  Sum_probs=36.4

Q ss_pred             ChHHHHHHHHHHHHHHHcCCCCCceEEEEeeCCCCCCCh
Q psy3109           1 MKCVGIQYLETIRRYIAEGKKFKRTIHVSFVPGPVTQNE   39 (58)
Q Consensus         1 MK~~~~~~leAl~~L~~~g~~p~r~I~l~f~~DEE~Gg~   39 (58)
                      ||+++++.+.|++.|++.+..++.+|.++|++|||+|+.
T Consensus       135 ~Kgg~aa~l~a~~~l~~~~~~~~~~v~~~~~~dEE~g~~  173 (422)
T PRK06915        135 MKGGNVALLLAMEALIESGIELKGDVIFQSVIEEESGGA  173 (422)
T ss_pred             chHHHHHHHHHHHHHHHcCCCCCCcEEEEEecccccCCc
Confidence            899999999999999999887888999999999999875


No 31 
>PRK06446 hypothetical protein; Provisional
Probab=98.52  E-value=2.7e-07  Score=64.54  Aligned_cols=45  Identities=11%  Similarity=-0.009  Sum_probs=38.0

Q ss_pred             ChHHHHHHHHHHHHHHHcCCCCCceEEEEeeCCCCCCChhhhHHHHH
Q psy3109           1 MKCVGIQYLETIRRYIAEGKKFKRTIHVSFVPGPVTQNETCGLAAAN   47 (58)
Q Consensus         1 MK~~~~~~leAl~~L~~~g~~p~r~I~l~f~~DEE~Gg~~~g~~~~~   47 (58)
                      ||+++++.+.|+++|++.+ .|+.+|.++|++|||+|+. +...++.
T Consensus       104 mKgglaa~l~A~~~l~~~~-~~~~~i~~~~~~dEE~g~~-g~~~~l~  148 (436)
T PRK06446        104 NKGTLMARLFAIKHLIDKH-KLNVNVKFLYEGEEEIGSP-NLEDFIE  148 (436)
T ss_pred             CcHHHHHHHHHHHHHHHcC-CCCCCEEEEEEcccccCCH-hHHHHHH
Confidence            8999999999999998776 5778999999999999986 5444443


No 32 
>PRK06133 glutamate carboxypeptidase; Reviewed
Probab=98.51  E-value=3.2e-07  Score=63.75  Aligned_cols=39  Identities=23%  Similarity=0.079  Sum_probs=36.4

Q ss_pred             ChHHHHHHHHHHHHHHHcCCCCCceEEEEeeCCCCCCCh
Q psy3109           1 MKCVGIQYLETIRRYIAEGKKFKRTIHVSFVPGPVTQNE   39 (58)
Q Consensus         1 MK~~~~~~leAl~~L~~~g~~p~r~I~l~f~~DEE~Gg~   39 (58)
                      ||+++++.+.|++.|++.+..|+.+|.++|++|||+|+.
T Consensus       137 ~kgg~a~~l~a~~~l~~~~~~~~~~i~~~~~~dEE~g~~  175 (410)
T PRK06133        137 DKGGVAVILHALKILQQLGFKDYGTLTVLFNPDEETGSP  175 (410)
T ss_pred             chHHHHHHHHHHHHHHHcCCCCCCCEEEEEECCcccCCc
Confidence            899999999999999999888888999999999999874


No 33 
>PRK07079 hypothetical protein; Provisional
Probab=98.51  E-value=2.8e-07  Score=64.99  Aligned_cols=39  Identities=13%  Similarity=0.131  Sum_probs=35.5

Q ss_pred             ChHHHHHHHHHHHHHHHc-CCCCCceEEEEeeCCCCCCCh
Q psy3109           1 MKCVGIQYLETIRRYIAE-GKKFKRTIHVSFVPGPVTQNE   39 (58)
Q Consensus         1 MK~~~~~~leAl~~L~~~-g~~p~r~I~l~f~~DEE~Gg~   39 (58)
                      ||+++++.+.|++.|++. +.+++++|.++|++|||+||.
T Consensus       128 mKgg~aa~l~A~~~l~~~~~~~~~~~i~~~~~~dEE~g~~  167 (469)
T PRK07079        128 NKGQHTINLAALEQVLAARGGRLGFNVKLLIEMGEEIGSP  167 (469)
T ss_pred             CcHHHHHHHHHHHHHHHhcCCCCCCCEEEEEECccccCCc
Confidence            899999999999999764 578889999999999999885


No 34 
>PRK04443 acetyl-lysine deacetylase; Provisional
Probab=98.51  E-value=1.9e-07  Score=63.36  Aligned_cols=44  Identities=7%  Similarity=-0.034  Sum_probs=37.3

Q ss_pred             ChHHHHHHHHHHHHHHHcCCCCCceEEEEeeCCCCCCChhhhHHHHHH
Q psy3109           1 MKCVGIQYLETIRRYIAEGKKFKRTIHVSFVPGPVTQNETCGLAAANS   48 (58)
Q Consensus         1 MK~~~~~~leAl~~L~~~g~~p~r~I~l~f~~DEE~Gg~~~g~~~~~s   48 (58)
                      ||+++++.++|++.|   +..++++|.+++++|||+|+. ++..++.+
T Consensus        92 ~Kg~~aa~l~A~~~l---~~~~~~~i~~~~~~dEE~g~~-~~~~~l~~  135 (348)
T PRK04443         92 AKGPLAAFAAAAARL---EALVRARVSFVGAVEEEAPSS-GGARLVAD  135 (348)
T ss_pred             cccHHHHHHHHHHHh---cccCCCCEEEEEEcccccCCh-hHHHHHHh
Confidence            899999999999998   456888999999999999986 56555543


No 35 
>TIGR01883 PepT-like peptidase T-like protein. This model represents a clade of enzymes closely related to Peptidase T, an aminotripeptidase found in bacteria. This clade consists of gram positive bacteria of which several additionally contain a Peptidase T gene.
Probab=98.51  E-value=2.4e-07  Score=62.56  Aligned_cols=38  Identities=18%  Similarity=-0.018  Sum_probs=34.6

Q ss_pred             ChHHHHHHHHHHHHHHHcCCCCCceEEEEeeCCCCCCCh
Q psy3109           1 MKCVGIQYLETIRRYIAEGKKFKRTIHVSFVPGPVTQNE   39 (58)
Q Consensus         1 MK~~~~~~leAl~~L~~~g~~p~r~I~l~f~~DEE~Gg~   39 (58)
                      ||+++++++++++.|++.+ .|+++|.++|++|||.|+.
T Consensus       101 ~k~g~a~~l~~~~~l~~~~-~~~~~v~~~~~~~EE~g~~  138 (361)
T TIGR01883       101 DKAGVAAMLEAMDVLSTEE-TPHGTIEFIFTVKEELGLI  138 (361)
T ss_pred             ccHHHHHHHHHHHHHHhcC-CCCCCEEEEEEcccccCch
Confidence            8999999999999999876 5778999999999998764


No 36 
>PRK13799 unknown domain/N-carbamoyl-L-amino acid hydrolase fusion protein; Provisional
Probab=98.50  E-value=1.6e-07  Score=68.74  Aligned_cols=48  Identities=6%  Similarity=-0.015  Sum_probs=44.7

Q ss_pred             HHHHHHHHHHHHHHHcCCCCCceEEEEeeCCCCCCChhhhHHHHHHHHHH
Q psy3109           3 CVGIQYLETIRRYIAEGKKFKRTIHVSFVPGPVTQNETCGLAAANSVLYL   52 (58)
Q Consensus         3 ~~~~~~leAl~~L~~~g~~p~r~I~l~f~~DEE~Gg~~~g~~~~~s~~~~   52 (58)
                      .++++.+++++.|++.|.+|+++|.++.+.|||  |.+|+.++.||+.+.
T Consensus       273 ~Gv~a~l~~~~~l~~~~~~~~~~i~vi~~~~EE--g~rF~~~~~GS~~~~  320 (591)
T PRK13799        273 EGIFLAIACVKELHEQGERLPFHFEVIAFAEEE--GQRFKATFLGSGALI  320 (591)
T ss_pred             HHHHHHHHHHHHHHHcCCCCCCCeEEEEecCCC--ccCCCccccchHHHh
Confidence            478999999999999999999999999999999  667999999999985


No 37 
>PRK06837 acetylornithine deacetylase; Provisional
Probab=98.48  E-value=3.5e-07  Score=63.70  Aligned_cols=39  Identities=18%  Similarity=0.069  Sum_probs=36.8

Q ss_pred             ChHHHHHHHHHHHHHHHcCCCCCceEEEEeeCCCCCCCh
Q psy3109           1 MKCVGIQYLETIRRYIAEGKKFKRTIHVSFVPGPVTQNE   39 (58)
Q Consensus         1 MK~~~~~~leAl~~L~~~g~~p~r~I~l~f~~DEE~Gg~   39 (58)
                      ||+++++.+.|++.|++.|..|+++|.++|++|||.|+.
T Consensus       139 ~Kgg~~a~l~a~~~l~~~~~~~~~~i~~~~~~dEE~~g~  177 (427)
T PRK06837        139 MKAGLAAMLFALDALRAAGLAPAARVHFQSVIEEESTGN  177 (427)
T ss_pred             chHHHHHHHHHHHHHHHcCCCCCCcEEEEEEeccccCCH
Confidence            899999999999999999988899999999999998875


No 38 
>TIGR01879 hydantase amidase, hydantoinase/carbamoylase family. Enzymes in this subfamily hydrolize the amide bonds of compounds containing carbamoyl groups or hydantoin rings. These enzymes are members of the broader family of amidases represented by pfam01546.
Probab=98.44  E-value=3.2e-07  Score=63.45  Aligned_cols=48  Identities=8%  Similarity=-0.153  Sum_probs=38.4

Q ss_pred             hHHHHHHHHHHHHHHHcCCCCCceEEEEeeCCCCCCChhhhHHHHHHHHH
Q psy3109           2 KCVGIQYLETIRRYIAEGKKFKRTIHVSFVPGPVTQNETCGLAAANSVLY   51 (58)
Q Consensus         2 K~~~~~~leAl~~L~~~g~~p~r~I~l~f~~DEE~Gg~~~g~~~~~s~~~   51 (58)
                      |.++++.++|+++|++.|.+|+++|.+++++|||.+  +|+.++.+++.+
T Consensus        88 ~~gvaa~l~a~~~l~~~g~~~~~~i~~~~~~dEE~~--~f~~~~~Gs~~~  135 (401)
T TIGR01879        88 QLGVLAGIEVVDALKEAYVVPLHPIEVVAFTEEEGS--RFPYGMWGSRNM  135 (401)
T ss_pred             HHHHHHHHHHHHHHHHcCCCCCCCeEEEEEeCCcCc--CcccccccHHHH
Confidence            567899999999999999999999999999999942  244444444443


No 39 
>PRK05469 peptidase T; Provisional
Probab=98.42  E-value=4.3e-07  Score=62.76  Aligned_cols=37  Identities=19%  Similarity=0.166  Sum_probs=34.4

Q ss_pred             ChHHHHHHHHHHHHHHHcCCCCCceEEEEeeCCCCCC
Q psy3109           1 MKCVGIQYLETIRRYIAEGKKFKRTIHVSFVPGPVTQ   37 (58)
Q Consensus         1 MK~~~~~~leAl~~L~~~g~~p~r~I~l~f~~DEE~G   37 (58)
                      ||+++++.+.|++.|++.+..|+.+|.++|++|||+|
T Consensus       141 ~Kgglaa~l~a~~~l~~~~~~~~g~v~~~f~~dEE~g  177 (408)
T PRK05469        141 DKAGIAEIMTALEYLIAHPEIKHGDIRVAFTPDEEIG  177 (408)
T ss_pred             chHHHHHHHHHHHHHHhCCCCCCCCEEEEEecccccC
Confidence            8999999999999999887677789999999999986


No 40 
>TIGR01882 peptidase-T peptidase T. This model represents a tripeptide aminopeptidase known as Peptidase T, which has a substrate preference for hydrophobic peptides.
Probab=98.41  E-value=4.9e-07  Score=62.89  Aligned_cols=37  Identities=19%  Similarity=0.125  Sum_probs=33.0

Q ss_pred             ChHHHHHHHHHHHHHHHcCCCCCceEEEEeeCCCCCC
Q psy3109           1 MKCVGIQYLETIRRYIAEGKKFKRTIHVSFVPGPVTQ   37 (58)
Q Consensus         1 MK~~~~~~leAl~~L~~~g~~p~r~I~l~f~~DEE~G   37 (58)
                      ||++++++++|++.|++.+..++.+|.++|++|||+|
T Consensus       143 ~KgglAa~l~A~~~L~e~~~~~~g~I~~~ft~dEE~g  179 (410)
T TIGR01882       143 DKAGIAEIMTAADYLINHPEIKHGTIRVAFTPDEEIG  179 (410)
T ss_pred             CHHHHHHHHHHHHHHHhCCCCCCCCEEEEEECcccCC
Confidence            8999999999999999874446779999999999985


No 41 
>PRK08554 peptidase; Reviewed
Probab=98.41  E-value=6.6e-07  Score=63.23  Aligned_cols=41  Identities=7%  Similarity=0.041  Sum_probs=35.2

Q ss_pred             ChHHHHHHHHHHHHHHHcCCCCCceEEEEeeCCCCCCChhhhHH
Q psy3109           1 MKCVGIQYLETIRRYIAEGKKFKRTIHVSFVPGPVTQNETCGLA   44 (58)
Q Consensus         1 MK~~~~~~leAl~~L~~~g~~p~r~I~l~f~~DEE~Gg~~~g~~   44 (58)
                      ||+++++.+.|++.|++.+  ++++|.++|++|||+||. ++..
T Consensus       104 mKgg~aa~l~A~~~l~~~~--~~~~i~l~~~~dEE~g~~-~~~~  144 (438)
T PRK08554        104 DKGNVASVMLALKELSKEP--LNGKVIFAFTGDEEIGGA-MAMH  144 (438)
T ss_pred             chHHHHHHHHHHHHHHhcC--CCCCEEEEEEcccccCcc-ccHH
Confidence            8999999999999998754  667899999999999885 4543


No 42 
>PRK13004 peptidase; Reviewed
Probab=98.39  E-value=9.2e-07  Score=60.97  Aligned_cols=38  Identities=11%  Similarity=-0.144  Sum_probs=35.4

Q ss_pred             ChHHHHHHHHHHHHHHHcCCCCCceEEEEeeCCCCCCC
Q psy3109           1 MKCVGIQYLETIRRYIAEGKKFKRTIHVSFVPGPVTQN   38 (58)
Q Consensus         1 MK~~~~~~leAl~~L~~~g~~p~r~I~l~f~~DEE~Gg   38 (58)
                      ||+++++.+.|++.|++.+..++++|.++|++|||.++
T Consensus       111 ~Kg~~aa~l~a~~~l~~~~~~~~~~i~~~~~~~EE~~~  148 (399)
T PRK13004        111 QKGGMASMVYAAKIIKDLGLDDEYTLYVTGTVQEEDCD  148 (399)
T ss_pred             cchHHHHHHHHHHHHHhcCCCCCCeEEEEEEcccccCc
Confidence            89999999999999999998888999999999999754


No 43 
>PRK12892 allantoate amidohydrolase; Reviewed
Probab=98.39  E-value=5.9e-07  Score=61.81  Aligned_cols=37  Identities=14%  Similarity=-0.036  Sum_probs=34.5

Q ss_pred             ChHHHHHHHHHHHHHHHcCCCCCceEEEEeeCCCCCC
Q psy3109           1 MKCVGIQYLETIRRYIAEGKKFKRTIHVSFVPGPVTQ   37 (58)
Q Consensus         1 MK~~~~~~leAl~~L~~~g~~p~r~I~l~f~~DEE~G   37 (58)
                      +|++++++++|+++|++.|.+|+++|.+++++|||.|
T Consensus        94 g~~Gvaa~l~a~~~l~~~~~~~~~~i~~~~~~dEE~~  130 (412)
T PRK12892         94 GALGVVAGLEAARALNEHGIATRHPLDVVAWCDEEGS  130 (412)
T ss_pred             chHHHHHHHHHHHHHHHcCCCCCCCeEEEEecCcccc
Confidence            4788999999999999999999999999999999975


No 44 
>PRK08652 acetylornithine deacetylase; Provisional
Probab=98.38  E-value=7.4e-07  Score=59.57  Aligned_cols=37  Identities=14%  Similarity=0.003  Sum_probs=32.8

Q ss_pred             ChHHHHHHHHHHHHHHHcCCCCCceEEEEeeCCCCCCCh
Q psy3109           1 MKCVGIQYLETIRRYIAEGKKFKRTIHVSFVPGPVTQNE   39 (58)
Q Consensus         1 MK~~~~~~leAl~~L~~~g~~p~r~I~l~f~~DEE~Gg~   39 (58)
                      ||++++++++|++.|.+..  ++++|.++|++|||+|+.
T Consensus        88 ~Kg~~a~~l~a~~~l~~~~--~~~~v~~~~~~dEE~g~~  124 (347)
T PRK08652         88 AKGGVAAILLALEELGKEF--EDLNVGIAFVSDEEEGGR  124 (347)
T ss_pred             hhHHHHHHHHHHHHHhhcc--cCCCEEEEEecCcccCCh
Confidence            8999999999999998653  456999999999999875


No 45 
>TIGR01900 dapE-gram_pos succinyl-diaminopimelate desuccinylase. This enzyme is involved in the biosynthesis of lysine, and is related to the enzyme acetylornithine deacetylase and other amidases and peptidases found within pfam01546.
Probab=98.37  E-value=8.2e-07  Score=61.15  Aligned_cols=45  Identities=16%  Similarity=0.091  Sum_probs=36.0

Q ss_pred             ChHHHHHHHHHHHHHHH--cCCCCCceEEEEeeCCCCCCChhhhHHH
Q psy3109           1 MKCVGIQYLETIRRYIA--EGKKFKRTIHVSFVPGPVTQNETCGLAA   45 (58)
Q Consensus         1 MK~~~~~~leAl~~L~~--~g~~p~r~I~l~f~~DEE~Gg~~~g~~~   45 (58)
                      ||+++++.+.|++.|.+  .+..++.+|.++|++|||+|+...|...
T Consensus       104 mKgg~aa~l~a~~~l~~~~~~~~~~~~i~~~~~~dEE~~~~~~G~~~  150 (373)
T TIGR01900       104 MKAGDAVMLHLAATLDGRAPETELKHDLTLIAYDCEEVAAEKNGLGH  150 (373)
T ss_pred             hhHHHHHHHHHHHHHhhhccccCCCCCEEEEEEecccccCCCCCHHH
Confidence            89999999999999954  4557788999999999999753224433


No 46 
>PRK08651 succinyl-diaminopimelate desuccinylase; Reviewed
Probab=98.33  E-value=1.1e-06  Score=59.91  Aligned_cols=36  Identities=25%  Similarity=0.273  Sum_probs=33.0

Q ss_pred             ChHHHHHHHHHHHHHHHcCCCCCceEEEEeeCCCCCCCh
Q psy3109           1 MKCVGIQYLETIRRYIAEGKKFKRTIHVSFVPGPVTQNE   39 (58)
Q Consensus         1 MK~~~~~~leAl~~L~~~g~~p~r~I~l~f~~DEE~Gg~   39 (58)
                      ||+++++.++|++.|++.+   +++|.++|++|||+|+.
T Consensus       115 ~k~~~~~~l~a~~~l~~~~---~~~v~~~~~~~EE~g~~  150 (394)
T PRK08651        115 MKGGIAALLAAFERLDPAG---DGNIELAIVPDEETGGT  150 (394)
T ss_pred             cchHHHHHHHHHHHHHhcC---CCCEEEEEecCccccch
Confidence            7999999999999998876   68999999999999873


No 47 
>PRK13381 peptidase T; Provisional
Probab=98.32  E-value=1.3e-06  Score=60.27  Aligned_cols=38  Identities=16%  Similarity=0.091  Sum_probs=34.3

Q ss_pred             ChHHHHHHHHHHHHHHHcCCCCCceEEEEeeCCCCCCCh
Q psy3109           1 MKCVGIQYLETIRRYIAEGKKFKRTIHVSFVPGPVTQNE   39 (58)
Q Consensus         1 MK~~~~~~leAl~~L~~~g~~p~r~I~l~f~~DEE~Gg~   39 (58)
                      ||+++++.+.|++.|.+.+ .++.+|.++|++|||+|+.
T Consensus       139 mKgg~aa~l~a~~~l~~~~-~~~g~i~~~~~~dEE~g~~  176 (404)
T PRK13381        139 NKAAIAVVMTLLENLTENE-VEHGDIVVAFVPDEEIGLR  176 (404)
T ss_pred             cHHHHHHHHHHHHHHHhcC-CCCCCEEEEEEcccccccc
Confidence            8999999999999999885 4677999999999999864


No 48 
>PRK07522 acetylornithine deacetylase; Provisional
Probab=98.31  E-value=1.3e-06  Score=59.27  Aligned_cols=43  Identities=16%  Similarity=0.105  Sum_probs=35.6

Q ss_pred             ChHHHHHHHHHHHHHHHcCCCCCceEEEEeeCCCCCCChhhhHHHH
Q psy3109           1 MKCVGIQYLETIRRYIAEGKKFKRTIHVSFVPGPVTQNETCGLAAA   46 (58)
Q Consensus         1 MK~~~~~~leAl~~L~~~g~~p~r~I~l~f~~DEE~Gg~~~g~~~~   46 (58)
                      ||+++++.+.|++.|++.+  ++++|.++|++|||+|+. +...++
T Consensus       105 ~Kg~~a~~l~a~~~l~~~~--~~~~i~~~~~~dEE~g~~-G~~~l~  147 (385)
T PRK07522        105 MKGFIAAALAAVPELAAAP--LRRPLHLAFSYDEEVGCL-GVPSMI  147 (385)
T ss_pred             cchHHHHHHHHHHHHHhCC--CCCCEEEEEEeccccCCc-cHHHHH
Confidence            7999999999999998874  568999999999999874 433433


No 49 
>TIGR01892 AcOrn-deacetyl acetylornithine deacetylase (ArgE). This model represents a clade of acetylornithine deacetylases from proteobacteria. This enzyme is the final step of the "acetylated" ornithine biosynthesis pathway. The enzyme is closely related to dapE, succinyl-diaminopimelate desuccinylase, and outside of this clade annotation is very inaccurate as to which function should be ascribed to genes.
Probab=98.30  E-value=1.5e-06  Score=58.37  Aligned_cols=37  Identities=19%  Similarity=0.177  Sum_probs=33.1

Q ss_pred             ChHHHHHHHHHHHHHHHcCCCCCceEEEEeeCCCCCCCh
Q psy3109           1 MKCVGIQYLETIRRYIAEGKKFKRTIHVSFVPGPVTQNE   39 (58)
Q Consensus         1 MK~~~~~~leAl~~L~~~g~~p~r~I~l~f~~DEE~Gg~   39 (58)
                      ||+++++.+.|++.|++.+  .+++|.++|++|||+|+.
T Consensus        99 ~Kg~~a~~l~a~~~l~~~~--~~~~v~~~~~~~EE~g~~  135 (364)
T TIGR01892        99 MKGFLACALAAAPDLAAEQ--LKKPLHLALTADEEVGCT  135 (364)
T ss_pred             cchHHHHHHHHHHHHHhcC--cCCCEEEEEEeccccCCc
Confidence            8999999999999999865  467899999999999874


No 50 
>TIGR03106 trio_M42_hydro hydrolase, peptidase M42 family. This model describes a subfamily of MEROPS peptidase family M42, a glutamyl aminopeptidase family that also includes the cellulase CelM from Clostridium thermocellum and deblocking aminopeptidases that can remove acylated amino acids. Members of this family occur in a three gene cassette with an amidotransferase (TIGR03104)in the asparagine synthase (glutamine-hydrolyzing) family, and a probable acetyltransferase (TIGR03103) in the GNAT family.
Probab=98.30  E-value=1.5e-06  Score=60.56  Aligned_cols=38  Identities=11%  Similarity=0.067  Sum_probs=35.9

Q ss_pred             ChHHHHHHHHHHHHHHHcCCCCCceEEEEeeCCCCCCC
Q psy3109           1 MKCVGIQYLETIRRYIAEGKKFKRTIHVSFVPGPVTQN   38 (58)
Q Consensus         1 MK~~~~~~leAl~~L~~~g~~p~r~I~l~f~~DEE~Gg   38 (58)
                      ||++++++++++++|++.+..++++|+++|+.|||+|.
T Consensus       184 ~K~G~a~~l~~~~~l~~~~~~~~~~v~~~~t~qEEvG~  221 (343)
T TIGR03106       184 DKAGVAALLAALKAIVEHKVPLPVDVHPLFTITEEVGS  221 (343)
T ss_pred             cHHhHHHHHHHHHHHHhcCCCCCceEEEEEECCcccCc
Confidence            89999999999999999888888999999999999983


No 51 
>PRK07907 hypothetical protein; Provisional
Probab=98.24  E-value=2.5e-06  Score=59.80  Aligned_cols=42  Identities=12%  Similarity=-0.095  Sum_probs=34.8

Q ss_pred             ChHHHHHHHHHHHHHHHcCCCCCceEEEEeeCCCCCCChhhhHHHH
Q psy3109           1 MKCVGIQYLETIRRYIAEGKKFKRTIHVSFVPGPVTQNETCGLAAA   46 (58)
Q Consensus         1 MK~~~~~~leAl~~L~~~g~~p~r~I~l~f~~DEE~Gg~~~g~~~~   46 (58)
                      ||+++++.+.|+++|   +..++++|.+++++|||+|+. +...++
T Consensus       125 ~Kg~~aa~l~a~~~l---~~~~~~~i~~~~~~dEE~g~~-g~~~~l  166 (449)
T PRK07907        125 DKGGIAMHLAALRAL---GGDLPVGVTVFVEGEEEMGSP-SLERLL  166 (449)
T ss_pred             CcHHHHHHHHHHHHh---ccCCCCcEEEEEEcCcccCCc-cHHHHH
Confidence            899999999999999   445678999999999999985 443433


No 52 
>PRK05111 acetylornithine deacetylase; Provisional
Probab=98.24  E-value=2.3e-06  Score=58.24  Aligned_cols=37  Identities=19%  Similarity=0.098  Sum_probs=32.7

Q ss_pred             ChHHHHHHHHHHHHHHHcCCCCCceEEEEeeCCCCCCCh
Q psy3109           1 MKCVGIQYLETIRRYIAEGKKFKRTIHVSFVPGPVTQNE   39 (58)
Q Consensus         1 MK~~~~~~leAl~~L~~~g~~p~r~I~l~f~~DEE~Gg~   39 (58)
                      ||+++++++.|+++|++.+  ++.+|.++|++|||+|+.
T Consensus       112 ~Kg~~a~~l~a~~~l~~~~--~~~~i~~~~~~~EE~g~~  148 (383)
T PRK05111        112 MKGFFAFILEALRDIDLTK--LKKPLYILATADEETSMA  148 (383)
T ss_pred             ccHHHHHHHHHHHHHhhcC--CCCCeEEEEEeccccCcc
Confidence            8999999999999998764  346899999999999874


No 53 
>PRK13007 succinyl-diaminopimelate desuccinylase; Reviewed
Probab=98.21  E-value=2.7e-06  Score=57.20  Aligned_cols=41  Identities=17%  Similarity=0.117  Sum_probs=33.5

Q ss_pred             ChHHHHHHHHHHHHHHHcCCCCCceEEEEeeCCCCCCChhhhHHH
Q psy3109           1 MKCVGIQYLETIRRYIAEGKKFKRTIHVSFVPGPVTQNETCGLAA   45 (58)
Q Consensus         1 MK~~~~~~leAl~~L~~~g~~p~r~I~l~f~~DEE~Gg~~~g~~~   45 (58)
                      ||+++++.+.|++.|.    .|+++|.++|++|||+|+...|...
T Consensus        96 ~Kg~~a~~l~a~~~l~----~~~~~i~~~~~~~EE~~~~~~G~~~  136 (352)
T PRK13007         96 MKSGLAVMLHLAATLA----EPAHDLTLVFYDCEEVEAEANGLGR  136 (352)
T ss_pred             ccHHHHHHHHHHHHhh----ccCCCeEEEEEecccccCCcccHHH
Confidence            8999999999999993    3678999999999999764234433


No 54 
>PRK00466 acetyl-lysine deacetylase; Validated
Probab=98.20  E-value=3.2e-06  Score=57.27  Aligned_cols=35  Identities=14%  Similarity=-0.006  Sum_probs=31.4

Q ss_pred             ChHHHHHHHHHHHHHHHcCCCCCceEEEEeeCCCCCCCh
Q psy3109           1 MKCVGIQYLETIRRYIAEGKKFKRTIHVSFVPGPVTQNE   39 (58)
Q Consensus         1 MK~~~~~~leAl~~L~~~g~~p~r~I~l~f~~DEE~Gg~   39 (58)
                      ||+++++.++|++.|++.|.    +|.++|++|||+|+.
T Consensus        93 mKgg~aa~l~a~~~l~~~~~----~i~~~~~~dEE~g~~  127 (346)
T PRK00466         93 AKGPLISMIIAAWLLNEKGI----KVMVSGLADEESTSI  127 (346)
T ss_pred             cchHHHHHHHHHHHHHHcCC----CEEEEEEcCcccCCc
Confidence            89999999999999999873    489999999998764


No 55 
>TIGR03320 ygeY M20/DapE family protein YgeY. Members of this protein family, including the YgeY protein of Escherichia coli, typically are found in extended genomic regions associated with purine catabolism. Homologs include peptidases and deacylases of the M20/M25 /M40 and DapE/ArgE families. The function is unknown.
Probab=98.18  E-value=2.9e-06  Score=58.34  Aligned_cols=38  Identities=8%  Similarity=-0.229  Sum_probs=35.2

Q ss_pred             ChHHHHHHHHHHHHHHHcCCCCCceEEEEeeCCCCCCC
Q psy3109           1 MKCVGIQYLETIRRYIAEGKKFKRTIHVSFVPGPVTQN   38 (58)
Q Consensus         1 MK~~~~~~leAl~~L~~~g~~p~r~I~l~f~~DEE~Gg   38 (58)
                      ||+++++.+.|++.|++.|..++.+|.+.+++|||.++
T Consensus       109 ~Kg~~aa~l~A~~~l~~~g~~~~~~i~~~~~~dEE~~~  146 (395)
T TIGR03320       109 QEGGIASMVYAGKIIKDLGLLDDYTLLVTGTVQEEDCD  146 (395)
T ss_pred             ccchHHHHHHHHHHHHHcCCCCCceEEEEecccccccC
Confidence            89999999999999999998888899999999999754


No 56 
>PRK10199 alkaline phosphatase isozyme conversion aminopeptidase; Provisional
Probab=98.16  E-value=5.4e-06  Score=58.40  Aligned_cols=37  Identities=14%  Similarity=-0.099  Sum_probs=33.5

Q ss_pred             ChHHHHHHHHHHHHHHHcCCCCCceEEEEeeCCCCCCCh
Q psy3109           1 MKCVGIQYLETIRRYIAEGKKFKRTIHVSFVPGPVTQNE   39 (58)
Q Consensus         1 MK~~~~~~leAl~~L~~~g~~p~r~I~l~f~~DEE~Gg~   39 (58)
                      ||+++++++++++.|++.  .|+++|.+++++|||.|..
T Consensus       144 nasGvA~lLe~ar~l~~~--~~~~~I~fv~~~~EE~Gl~  180 (346)
T PRK10199        144 NAAGLGVMLELAERLKNV--PTEYGIRFVATSGEEEGKL  180 (346)
T ss_pred             cHHHHHHHHHHHHHHhhC--CCCCcEEEEEECCcccCcH
Confidence            799999999999999865  4788999999999999875


No 57 
>TIGR01902 dapE-lys-deAc N-acetyl-ornithine/N-acetyl-lysine deacetylase. This clade of mainly archaeal and related bacterial species contains two characterized enzymes, an deacetylase with specificity for both N-acetyl-ornithine and N-acetyl-lysine from Thermus which is found within a lysine biosynthesis operon, and a fusion protein with acetyl-glutamate kinase (an enzyme of ornithine biosynthesis) from Lactobacillus. It is possible that all of the sequences within this clade have dual specificity, or that a mix of specificities have evolved within this clade.
Probab=98.13  E-value=4.8e-06  Score=56.09  Aligned_cols=38  Identities=16%  Similarity=0.031  Sum_probs=32.7

Q ss_pred             ChHHHHHHHHHHHHHHHcCCCCCceEEEEeeCCCCCCChhhhHH
Q psy3109           1 MKCVGIQYLETIRRYIAEGKKFKRTIHVSFVPGPVTQNETCGLA   44 (58)
Q Consensus         1 MK~~~~~~leAl~~L~~~g~~p~r~I~l~f~~DEE~Gg~~~g~~   44 (58)
                      ||+++++.++|+++|++.+    .+|.+++++|||+|+.  |..
T Consensus        83 ~Kg~~aa~l~a~~~l~~~~----~~i~~~~~~dEE~g~~--G~~  120 (336)
T TIGR01902        83 AKGPLIAMIFATWLLNEKG----IKVIVSGLVDEESSSK--GAR  120 (336)
T ss_pred             CCcHHHHHHHHHHHHHhCC----CcEEEEEEeCcccCCc--cHH
Confidence            8999999999999998764    4799999999999774  544


No 58 
>TIGR03526 selenium_YgeY putative selenium metabolism hydrolase. SelD, selenophosphate synthase, is the selenium donor protein for both selenocysteine and selenouridine biosynthesis systems, but it occurs also in a few prokaryotes that have neither of those pathways. The method of partial phylogenetic profiling, starting from such orphan-selD genomes, identifies this protein as one of those most strongly correlated to SelD occurrence. Its distribution is also well correlated with that of family TIGR03309, a putative accessory protein of labile selenium (non-selenocysteine) enzyme maturation. This family includes the uncharacterized YgeY of Escherichia coli, and belongs to a larger family of metalloenzymes in which some are known peptidases, others enzymes of different types.
Probab=97.99  E-value=1.8e-05  Score=54.52  Aligned_cols=37  Identities=8%  Similarity=-0.255  Sum_probs=34.1

Q ss_pred             ChHHHHHHHHHHHHHHHcCCCCCceEEEEeeCCCCCC
Q psy3109           1 MKCVGIQYLETIRRYIAEGKKFKRTIHVSFVPGPVTQ   37 (58)
Q Consensus         1 MK~~~~~~leAl~~L~~~g~~p~r~I~l~f~~DEE~G   37 (58)
                      ||+++++.+.|++.|.+.+..+++++.+++++|||++
T Consensus       109 ~Kg~~aa~l~a~~~l~~~~~~~~~~v~~~~~~dEE~~  145 (395)
T TIGR03526       109 QEGGIASMVYAGKIIKDLGLLDDYTLLVTGTVQEEDC  145 (395)
T ss_pred             cchhHHHHHHHHHHHHHcCCCCCceEEEEEecccccC
Confidence            8999999999999999998878889999999999953


No 59 
>PF04389 Peptidase_M28:  Peptidase family M28;  InterPro: IPR007484 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This domain is found in metallopeptidases belonging to the MEROPS peptidase family M28 (aminopeptidase Y, clan MH) []. They also contain a transferrin receptor-like dimerisation domain (IPR007365 from INTERPRO) and a protease-associated PA domain (IPR003137 from INTERPRO).; GO: 0008233 peptidase activity, 0006508 proteolysis; PDB: 3BXM_A 2C6P_A 1Z8L_C 3SJF_A 3BHX_A 2C6G_A 3D7F_A 2XEG_A 3BI1_A 2C6C_A ....
Probab=97.98  E-value=1.9e-05  Score=48.78  Aligned_cols=43  Identities=23%  Similarity=0.050  Sum_probs=36.2

Q ss_pred             HHHHHHHHHHHHHHHcCCCCCceEEEEeeCCCCCCChhhhHHHH
Q psy3109           3 CVGIQYLETIRRYIAEGKKFKRTIHVSFVPGPVTQNETCGLAAA   46 (58)
Q Consensus         3 ~~~~~~leAl~~L~~~g~~p~r~I~l~f~~DEE~Gg~~~g~~~~   46 (58)
                      ++++.+||.+|.|.+.+.+|+|+|.++|+..||.|-. +...++
T Consensus        29 sGva~lLelAr~l~~~~~~~~~~i~fv~~~~EE~gl~-GS~~~~   71 (179)
T PF04389_consen   29 SGVAALLELARVLKELKPQPKRTIRFVFFDGEEQGLL-GSRAFV   71 (179)
T ss_dssp             HHHHHHHHHHHHHHHSTHSSSEEEEEEEESSGGGTSH-HHHHHH
T ss_pred             HHHHHHHHHHHHHHHhhcccCccEEEEEecccccCcc-chHHHH
Confidence            5789999999999998889999999999999998764 333333


No 60 
>PLN02693 IAA-amino acid hydrolase
Probab=97.87  E-value=3.3e-05  Score=54.80  Aligned_cols=36  Identities=11%  Similarity=0.079  Sum_probs=32.6

Q ss_pred             ChHHHHHHHHHHHHHHHcCCCCCceEEEEeeCCCCC
Q psy3109           1 MKCVGIQYLETIRRYIAEGKKFKRTIHVSFVPGPVT   36 (58)
Q Consensus         1 MK~~~~~~leAl~~L~~~g~~p~r~I~l~f~~DEE~   36 (58)
                      ||+.+++.+.|++.|++.+..++.+|.++|++|||.
T Consensus       137 hkg~~A~~l~Aa~~L~~~~~~~~g~V~~if~pdEE~  172 (437)
T PLN02693        137 HDGHVAMLLGAAKILQEHRHHLQGTVVLIFQPAEEG  172 (437)
T ss_pred             chHHHHHHHHHHHHHHhCcccCCceEEEEEEEcccc
Confidence            799999999999999998766678999999999993


No 61 
>PRK08737 acetylornithine deacetylase; Provisional
Probab=97.86  E-value=2.6e-05  Score=53.82  Aligned_cols=36  Identities=11%  Similarity=0.045  Sum_probs=29.8

Q ss_pred             ChHHHHHHHHHHHHHHHcCCCCCceEEEEeeCCCCCCChhhhHH
Q psy3109           1 MKCVGIQYLETIRRYIAEGKKFKRTIHVSFVPGPVTQNETCGLA   44 (58)
Q Consensus         1 MK~~~~~~leAl~~L~~~g~~p~r~I~l~f~~DEE~Gg~~~g~~   44 (58)
                      ||+++++++.|++.       +..+|.++|++|||+|+. .|..
T Consensus       103 mKg~~aa~l~a~~~-------~~~~v~~~~~~dEE~g~~-~g~~  138 (364)
T PRK08737        103 IKGAAAALLAAANA-------GDGDAAFLFSSDEEANDP-RCVA  138 (364)
T ss_pred             chHHHHHHHHHHHc-------cCCCEEEEEEcccccCch-hhHH
Confidence            89999999999874       346899999999999885 3543


No 62 
>TIGR01891 amidohydrolases amidohydrolase. This model represents a subfamily of amidohydrolases which are a subset of those sequences detected by pfam01546. Included within this group are hydrolases of hippurate (N-benzylglycine), indoleacetic acid (IAA) N-conjugates of amino acids, N-acetyl-L-amino acids and aminobenzoylglutamate. These hydrolases are of the carboxypeptidase-type, most likely utilizing a zinc ion in the active site.
Probab=97.74  E-value=7.1e-05  Score=51.00  Aligned_cols=36  Identities=14%  Similarity=0.092  Sum_probs=31.8

Q ss_pred             hHHHHHHHHHHHHHHHcCCCCCceEEEEeeCCCCCC
Q psy3109           2 KCVGIQYLETIRRYIAEGKKFKRTIHVSFVPGPVTQ   37 (58)
Q Consensus         2 K~~~~~~leAl~~L~~~g~~p~r~I~l~f~~DEE~G   37 (58)
                      |+++++++.|++.|++.+.+++++|.++|++|||+|
T Consensus        92 ~~~~a~~l~a~~~l~~~~~~~~~~i~~~~~~dEE~~  127 (363)
T TIGR01891        92 DLHTAILLGTAKLLKKLADLLEGTVRLIFQPAEEGG  127 (363)
T ss_pred             HHHHHHHHHHHHHHHhchhhCCceEEEEEeecCcCc
Confidence            577889999999999887777889999999999984


No 63 
>TIGR01893 aa-his-dipept aminoacyl-histidine dipeptidase.
Probab=97.70  E-value=5.2e-05  Score=53.82  Aligned_cols=35  Identities=9%  Similarity=-0.064  Sum_probs=28.8

Q ss_pred             ChHHHHHHHHHHHHHHHcCCCCCceEEEEeeCCCCCCCh
Q psy3109           1 MKCVGIQYLETIRRYIAEGKKFKRTIHVSFVPGPVTQNE   39 (58)
Q Consensus         1 MK~~~~~~leAl~~L~~~g~~p~r~I~l~f~~DEE~Gg~   39 (58)
                      ||+++++++++++.   .+ .+.++|.++|++|||+|+.
T Consensus       110 ~k~gva~~l~~~~~---~~-~~~~~i~~~~~~dEE~g~~  144 (477)
T TIGR01893       110 NGIGVAMGLAILED---NN-LKHPPLELLFTVDEETGMD  144 (477)
T ss_pred             ccHHHHHHHHHHhc---CC-CCCCCEEEEEEeccccCch
Confidence            79999999998775   33 3567999999999998764


No 64 
>PRK15026 aminoacyl-histidine dipeptidase; Provisional
Probab=97.59  E-value=0.00013  Score=52.72  Aligned_cols=35  Identities=9%  Similarity=-0.050  Sum_probs=28.3

Q ss_pred             ChHHHHHHHHHHHHHHHcCCCCCceEEEEeeCCCCCCCh
Q psy3109           1 MKCVGIQYLETIRRYIAEGKKFKRTIHVSFVPGPVTQNE   39 (58)
Q Consensus         1 MK~~~~~~leAl~~L~~~g~~p~r~I~l~f~~DEE~Gg~   39 (58)
                      ||++++++++++   .+.++ +..+|.++|++|||+|+.
T Consensus       116 ~k~gva~~l~~l---~~~~~-~~~~i~~l~t~dEE~G~~  150 (485)
T PRK15026        116 NGIGMASALAVL---ADENV-VHGPLEVLLTMTEEAGMD  150 (485)
T ss_pred             cHHHHHHHHHHH---HhCCC-CCCCEEEEEEcccccCcH
Confidence            799999988776   44554 467899999999999875


No 65 
>PLN02280 IAA-amino acid hydrolase
Probab=97.48  E-value=0.00022  Score=51.48  Aligned_cols=37  Identities=14%  Similarity=0.104  Sum_probs=33.3

Q ss_pred             ChHHHHHHHHHHHHHHHcCCCCCceEEEEeeCCCCCC
Q psy3109           1 MKCVGIQYLETIRRYIAEGKKFKRTIHVSFVPGPVTQ   37 (58)
Q Consensus         1 MK~~~~~~leAl~~L~~~g~~p~r~I~l~f~~DEE~G   37 (58)
                      ||+.+++++.|++.|.+.+..++.+|.++|.+|||+|
T Consensus       187 hd~~~A~~l~a~~~L~~~~~~~~g~V~~if~pdEE~g  223 (478)
T PLN02280        187 HDAHVAMLLGAAKILKSREHLLKGTVVLLFQPAEEAG  223 (478)
T ss_pred             CcHHHHHHHHHHHHHHhccccCCceEEEEeccccccc
Confidence            6889999999999999887777889999999999985


No 66 
>PRK09961 exoaminopeptidase; Provisional
Probab=96.93  E-value=0.0025  Score=44.34  Aligned_cols=43  Identities=7%  Similarity=-0.010  Sum_probs=34.8

Q ss_pred             ChHHHHHHHHHHHHHHHcCCCCCceEEEEeeCCCCCCChhhhHHHHH
Q psy3109           1 MKCVGIQYLETIRRYIAEGKKFKRTIHVSFVPGPVTQNETCGLAAAN   47 (58)
Q Consensus         1 MK~~~~~~leAl~~L~~~g~~p~r~I~l~f~~DEE~Gg~~~g~~~~~   47 (58)
                      +|.+++.+++++++|++.  .+..+++++|+.-||+|..  |...++
T Consensus       167 nR~g~~~lle~l~~l~~~--~~~~~v~~~~tvqEEvG~r--Ga~~aa  209 (344)
T PRK09961        167 DRLGCYLLVTLLRELHDA--ELPAEVWLVASSSEEVGLR--GGQTAT  209 (344)
T ss_pred             hhHhHHHHHHHHHHhhhc--CCCceEEEEEEcccccchH--HHHHHH
Confidence            478999999999999765  4678999999999999875  444433


No 67 
>KOG2195|consensus
Probab=96.58  E-value=0.0078  Score=45.98  Aligned_cols=39  Identities=13%  Similarity=-0.020  Sum_probs=30.4

Q ss_pred             HHHHHHHHHHcCCCCCceEEEEeeCCCCCCChhhhHHHHH
Q psy3109           8 YLETIRRYIAEGKKFKRTIHVSFVPGPVTQNETCGLAAAN   47 (58)
Q Consensus         8 ~leAl~~L~~~g~~p~r~I~l~f~~DEE~Gg~~~g~~~~~   47 (58)
                      +-.++..+++.|++|+|||+++.|..||-|.. |..-++-
T Consensus       381 i~~~~~~~~k~gwrP~RtI~F~sWdAeEfGli-GStE~~E  419 (702)
T KOG2195|consen  381 IARALSKLKKRGWRPRRTILFASWDAEEFGLL-GSTEWAE  419 (702)
T ss_pred             HHHHHHHHHHcCCCccceEEEEEccchhcccc-ccHHHHH
Confidence            44566677899999999999999999998775 4444443


No 68 
>TIGR03107 glu_aminopep glutamyl aminopeptidase. This model represents the M42.001 clade within MEROPS family M42. M42 includes glutamyl aminopeptidase as in the present model, deblocking aminopeptidases as from Pyrococcus horikoshii and related species, and endo-1,4-beta-glucanase (cellulase M) as from Clostridium thermocellum. The current family includes
Probab=96.56  E-value=0.0058  Score=42.92  Aligned_cols=37  Identities=8%  Similarity=-0.031  Sum_probs=32.5

Q ss_pred             ChHHHHHHHHHHHHHHHcCCCCCceEEEEeeCCCCCCCh
Q psy3109           1 MKCVGIQYLETIRRYIAEGKKFKRTIHVSFVPGPVTQNE   39 (58)
Q Consensus         1 MK~~~~~~leAl~~L~~~g~~p~r~I~l~f~~DEE~Gg~   39 (58)
                      +|.+++.+++++++|++.  .++.+++++|+.-||+|..
T Consensus       179 dR~g~a~l~e~l~~l~~~--~~~~~l~~~~tvqEEvG~r  215 (350)
T TIGR03107       179 NRYGVLMILELLESLKDQ--ELPNTLIAGANVQEEVGLR  215 (350)
T ss_pred             cHHHHHHHHHHHHHhhhc--CCCceEEEEEEChhhcCch
Confidence            478999999999999875  4778999999999999874


No 69 
>PF05343 Peptidase_M42:  M42 glutamyl aminopeptidase;  InterPro: IPR008007 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to MEROPS peptidase family M42 (glutamyl aminopeptidase family, clan MH). For members of this family and family M28 the predicted metal ligands occur in the same order in the sequence: H, D, E, D/E, H; and the active site residues occur in the motifs HXD and EE. ; PDB: 2WYR_C 2CF4_A 1VHO_A 3ISX_A 3KL9_G 1YLO_F 3CPX_C 1VHE_A 2GRE_F 1XFO_A ....
Probab=96.42  E-value=0.015  Score=39.88  Aligned_cols=41  Identities=15%  Similarity=0.089  Sum_probs=33.4

Q ss_pred             hHHHHHHHHHHHHHHHcCCCCCceEEEEeeCCCCCCChhhhHHHH
Q psy3109           2 KCVGIQYLETIRRYIAEGKKFKRTIHVSFVPGPVTQNETCGLAAA   46 (58)
Q Consensus         2 K~~~~~~leAl~~L~~~g~~p~r~I~l~f~~DEE~Gg~~~g~~~~   46 (58)
                      +.+++.++++++.|++.+.  +.+++++|+.-||+|..  |...+
T Consensus       136 R~g~~~lle~l~~l~~~~~--~~~v~~v~tvqEEvG~r--GA~~a  176 (292)
T PF05343_consen  136 RAGCAVLLELLRELKEKEL--DVDVYFVFTVQEEVGLR--GAKTA  176 (292)
T ss_dssp             HHHHHHHHHHHHHHTTSS---SSEEEEEEESSCTTTSH--HHHHH
T ss_pred             hhHHHHHHHHHHHHhhcCC--CceEEEEEEeeeeecCc--ceeec
Confidence            6789999999999998753  38999999999999874  44433


No 70 
>COG1363 FrvX Cellulase M and related proteins [Carbohydrate transport and metabolism]
Probab=96.20  E-value=0.0076  Score=42.81  Aligned_cols=37  Identities=8%  Similarity=-0.008  Sum_probs=33.4

Q ss_pred             ChHHHHHHHHHHHHHHHcCCCCCceEEEEeeCCCCCCCh
Q psy3109           1 MKCVGIQYLETIRRYIAEGKKFKRTIHVSFVPGPVTQNE   39 (58)
Q Consensus         1 MK~~~~~~leAl~~L~~~g~~p~r~I~l~f~~DEE~Gg~   39 (58)
                      +|.+++++++++++| + +..++.+++++|+.-||+|-.
T Consensus       181 dR~gva~lle~lk~l-~-~~~~~~~vy~v~tvqEEVGlr  217 (355)
T COG1363         181 DRAGVAALLELLKEL-K-GIELPADVYFVASVQEEVGLR  217 (355)
T ss_pred             chHhHHHHHHHHHHh-c-cCCCCceEEEEEecchhhccc
Confidence            488999999999999 5 778899999999999999864


No 71 
>COG2234 Iap Predicted aminopeptidases [General function prediction only]
Probab=96.12  E-value=0.012  Score=40.93  Aligned_cols=35  Identities=17%  Similarity=-0.028  Sum_probs=31.1

Q ss_pred             HHHHHHHHHHHHHHHcCCCCCceEEEEeeCCCCCCCh
Q psy3109           3 CVGIQYLETIRRYIAEGKKFKRTIHVSFVPGPVTQNE   39 (58)
Q Consensus         3 ~~~~~~leAl~~L~~~g~~p~r~I~l~f~~DEE~Gg~   39 (58)
                      ++++..+|..|.|+...  |+|+|.++++.-||.|..
T Consensus       231 sGva~llEiAr~l~~~~--p~~~v~f~~~~aEE~Gl~  265 (435)
T COG2234         231 SGVAALLELARVLKGNP--PKRTVRFVAFGAEESGLL  265 (435)
T ss_pred             HHHHHHHHHHHHHhcCC--CCceEEEEEecchhhccc
Confidence            57889999999998875  999999999999998775


No 72 
>PRK09864 putative peptidase; Provisional
Probab=96.10  E-value=0.013  Score=41.40  Aligned_cols=35  Identities=6%  Similarity=-0.262  Sum_probs=31.0

Q ss_pred             ChHHHHHHHHHHHHHHHcCCCCCceEEEEeeCCCCCCCh
Q psy3109           1 MKCVGIQYLETIRRYIAEGKKFKRTIHVSFVPGPVTQNE   39 (58)
Q Consensus         1 MK~~~~~~leAl~~L~~~g~~p~r~I~l~f~~DEE~Gg~   39 (58)
                      +|.+++.++++++.|++    |..+++++|+.-||+|-.
T Consensus       176 nR~g~~~lle~l~~l~~----~~~~vy~v~TvQEEvGlr  210 (356)
T PRK09864        176 NRIGCAMMAELLQTVNN----PEITLYGVGSVEEEVGLR  210 (356)
T ss_pred             cHHHHHHHHHHHHHhhc----CCCeEEEEEEcchhcchH
Confidence            47899999999999965    778999999999999864


No 73 
>COG2195 PepD Di- and tripeptidases [Amino acid transport and metabolism]
Probab=95.74  E-value=0.014  Score=42.12  Aligned_cols=46  Identities=13%  Similarity=0.065  Sum_probs=36.4

Q ss_pred             ChHHHHHHHHHHHHHHHcC-CCCCceEEEEeeCCCCCCChhhhHHHHHH
Q psy3109           1 MKCVGIQYLETIRRYIAEG-KKFKRTIHVSFVPGPVTQNETCGLAAANS   48 (58)
Q Consensus         1 MK~~~~~~leAl~~L~~~g-~~p~r~I~l~f~~DEE~Gg~~~g~~~~~s   48 (58)
                      .|++++.++.|++.+.+.- -.|-.+|.+.|++|||.|+.  |...+..
T Consensus       145 ~kAGia~i~~al~~~~~~~~~i~h~~i~~g~s~~Ee~g~r--g~~~~~~  191 (414)
T COG2195         145 DKAGIAEIMTALSVLREKHPEIPHGGIRGGFSPDEEIGGR--GAANKDV  191 (414)
T ss_pred             chhHHHHHHHHHHHHhhcCccccccCeEEEecchHHhhhh--hhhhccH
Confidence            4899999999999999663 34668999999999999875  4444433


No 74 
>COG4187 RocB Arginine degradation protein (predicted deacylase) [Amino acid transport and metabolism]
Probab=95.70  E-value=0.026  Score=42.04  Aligned_cols=38  Identities=16%  Similarity=0.141  Sum_probs=33.4

Q ss_pred             ChHHHHHHHHHHHHHHHcCCCCCceEEEEeeCCCCCCCh
Q psy3109           1 MKCVGIQYLETIRRYIAEGKKFKRTIHVSFVPGPVTQNE   39 (58)
Q Consensus         1 MK~~~~~~leAl~~L~~~g~~p~r~I~l~f~~DEE~Gg~   39 (58)
                      ||++++..+..++++.+.+ ..+-+|-++-+||||+.+.
T Consensus       141 MKsGlav~la~L~~fa~~~-~~~GNlLf~a~pdEE~~s~  178 (553)
T COG4187         141 MKSGLAVHLACLEEFAART-DRQGNLLFMAVPDEEVESR  178 (553)
T ss_pred             hhhhhHHHHHHHHHHhhCC-CCCCcEEEEeccchhhhcc
Confidence            9999999999999999984 4556899999999998764


No 75 
>KOG2276|consensus
Probab=93.67  E-value=0.09  Score=38.74  Aligned_cols=39  Identities=10%  Similarity=0.105  Sum_probs=36.1

Q ss_pred             ChHHHHHHHHHHHHHHHcCCCCCceEEEEeeCCCCCCCh
Q psy3109           1 MKCVGIQYLETIRRYIAEGKKFKRTIHVSFVPGPVTQNE   39 (58)
Q Consensus         1 MK~~~~~~leAl~~L~~~g~~p~r~I~l~f~~DEE~Gg~   39 (58)
                      +|+=+++.++|++++++.|...+-+|.++|-.-||.||.
T Consensus       133 dkGPv~~wi~av~a~~~~g~~lpvnv~f~~EgmEEsgS~  171 (473)
T KOG2276|consen  133 DKGPVLSWIHAVKALQQLGIDLPVNVVFVFEGMEESGSE  171 (473)
T ss_pred             CCccchHHHHHHHHHHHhCccccceEEEEEEechhccCc
Confidence            477789999999999999998889999999999999997


No 76 
>PRK02256 putative aminopeptidase 1; Provisional
Probab=93.64  E-value=0.14  Score=37.39  Aligned_cols=34  Identities=18%  Similarity=0.130  Sum_probs=25.6

Q ss_pred             hHHHHHHHHHHHHHHHcCCCCCceEEEEeeCCCCCCCh
Q psy3109           2 KCVGIQYLETIRRYIAEGKKFKRTIHVSFVPGPVTQNE   39 (58)
Q Consensus         2 K~~~~~~leAl~~L~~~g~~p~r~I~l~f~~DEE~Gg~   39 (58)
                      +.++.++++|++.+.    .+..+++++++.-||+|+.
T Consensus       262 r~~~~~~leal~~~~----~~~~~~~~~~~dqEEVGs~  295 (462)
T PRK02256        262 RVCAYTSLEALLELE----NPEKTAVVLLVDKEEIGSE  295 (462)
T ss_pred             HHHHHHHHHHHHhcc----cCCCeEEEEEEcccccCCc
Confidence            456777777776553    3556888999999999986


No 77 
>COG1473 AbgB Metal-dependent amidase/aminoacylase/carboxypeptidase [General function prediction only]
Probab=90.27  E-value=0.39  Score=34.44  Aligned_cols=46  Identities=11%  Similarity=0.079  Sum_probs=34.1

Q ss_pred             HHHHHHHHHHHHHcCCCCCceEEEEeeCCCCCCChhhhHHHHHHHHHH
Q psy3109           5 GIQYLETIRRYIAEGKKFKRTIHVSFVPGPVTQNETCGLAAANSVLYL   52 (58)
Q Consensus         5 ~~~~leAl~~L~~~g~~p~r~I~l~f~~DEE~Gg~~~g~~~~~s~~~~   52 (58)
                      .++.+-|.+.|.+....++-+|.++|-|-||.++  ++..++..-++.
T Consensus       108 ta~lLgaA~~L~~~~~~~~Gtv~~ifQPAEE~~~--Ga~~mi~~G~~~  153 (392)
T COG1473         108 TAILLGAALALAEHKDNLPGTVRLIFQPAEEGGG--GAKAMIEDGVFD  153 (392)
T ss_pred             HHHHHHHHHHHHhhhhhCCcEEEEEecccccccc--cHHHHHhcCCcc
Confidence            4667888888888755678899999999999874  455665554443


No 78 
>PF05450 Nicastrin:  Nicastrin;  InterPro: IPR008710 Nicastrin and presenilin are two major components of the gamma-secretase complex, which executes the intramembrane proteolysis of type I integral membrane proteins such as the amyloid precursor protein (APP) and Notch. Nicastrin is synthesised in fibroblasts and neurons as an endoglycosidase-H-sensitive glycosylated precursor protein (immature nicastrin) and is then modified by complex glycosylation in the Golgi apparatus and by sialylation in the trans-Golgi network (mature nicastrin) [].; GO: 0016485 protein processing, 0016021 integral to membrane
Probab=89.69  E-value=0.87  Score=30.50  Aligned_cols=34  Identities=9%  Similarity=0.127  Sum_probs=27.0

Q ss_pred             HHHHHHHHHHHHHHHc---CCCCCceEEEEeeCCCCC
Q psy3109           3 CVGIQYLETIRRYIAE---GKKFKRTIHVSFVPGPVT   36 (58)
Q Consensus         3 ~~~~~~leAl~~L~~~---g~~p~r~I~l~f~~DEE~   36 (58)
                      ++++..|+|++.|.+.   --.++|+|.++|+..|-.
T Consensus        27 sglvaLLaaA~aL~~~~~~~~~~~knV~F~~F~GEs~   63 (234)
T PF05450_consen   27 SGLVALLAAAEALSKLLPDSSNLNKNVLFAFFNGESF   63 (234)
T ss_pred             HHHHHHHHHHHHHHHhhhccccccCcEEEEEecCccc
Confidence            4778888888888765   235789999999999974


No 79 
>PF02127 Peptidase_M18:  Aminopeptidase I zinc metalloprotease (M18);  InterPro: IPR001948 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase family M18, (clan MH). The proteins have two catalytic zinc ions at the active site, bound by His/Asp, Asp, Glu, Asp/Glu and His. The catalysed reaction involves the release of an N-terminal aminoacid, usually neutral or hydrophobic, from a polypeptide []. The type example is aminopeptidase I from Saccharomyces cerevisiae (Baker's yeast), the sequence of which has been deduced, and the mature protein shown to consist of 469 amino acids []. A 45-residue presequence contains both positively- and negatively-charged and hydrophobic residues, which could be arranged in an N-terminal amphiphilic alpha-helix []. The presequence differs from signal sequences that direct proteins across bacterial plasma membranes and endoplasmic reticulum or into mitochondria. It is unclear how this unique presequence targets aminopeptidase I to yeast vacuoles, and how this sorting utilises classical protein secretory pathways [].; GO: 0004177 aminopeptidase activity, 0008270 zinc ion binding, 0006508 proteolysis; PDB: 1Y7E_A 2GLJ_R 4DYO_A 2IJZ_B 3VAT_A 3VAR_A 2GLF_B.
Probab=88.96  E-value=0.68  Score=33.64  Aligned_cols=35  Identities=11%  Similarity=0.002  Sum_probs=23.0

Q ss_pred             HHHHHHHHHHHHHcC---CCCC-ceEEEEeeCCCCCCCh
Q psy3109           5 GIQYLETIRRYIAEG---KKFK-RTIHVSFVPGPVTQNE   39 (58)
Q Consensus         5 ~~~~leAl~~L~~~g---~~p~-r~I~l~f~~DEE~Gg~   39 (58)
                      +++.+.++++|.+..   ..|. .+-.++++-.||+||.
T Consensus       237 l~s~~a~l~Al~~~~~~~~~~~~~~~v~~~fD~EEiGS~  275 (432)
T PF02127_consen  237 LASCYAALEALIDSSNDSLEPEDGTNVVVLFDNEEIGSE  275 (432)
T ss_dssp             HHHHHHHHHHHHHHTSSCCCCSSSEEEEEEESSGGGTST
T ss_pred             HHHHHHHHHHHHhhhcccccccCceEEEEEEcccccCCC
Confidence            456677777777663   2466 4445556656999996


No 80 
>PRK02813 putative aminopeptidase 2; Provisional
Probab=88.77  E-value=0.58  Score=33.81  Aligned_cols=33  Identities=9%  Similarity=-0.112  Sum_probs=23.7

Q ss_pred             hHHHHHHHHHHHHHHHcCCCCCceEEEEeeCCCCCCCh
Q psy3109           2 KCVGIQYLETIRRYIAEGKKFKRTIHVSFVPGPVTQNE   39 (58)
Q Consensus         2 K~~~~~~leAl~~L~~~g~~p~r~I~l~f~~DEE~Gg~   39 (58)
                      +.++...++|++.+..     ..+++++++--||+|+.
T Consensus       236 r~~~~~~l~al~~~~~-----~~~~~~~~~d~EEVGs~  268 (428)
T PRK02813        236 LSSCHAGLEALLAAAS-----DATNVLAAFDHEEVGSA  268 (428)
T ss_pred             HHHHHHHHHHHHhcCC-----CCeEEEEEEecCccCCC
Confidence            3456666666665432     56888999989999986


No 81 
>KOG2194|consensus
Probab=85.47  E-value=1.1  Score=35.24  Aligned_cols=44  Identities=14%  Similarity=0.012  Sum_probs=34.2

Q ss_pred             HHHHHHHHHHHHHHHcCCCCCceEEEEeeCCCCCCChhhhHHHHH
Q psy3109           3 CVGIQYLETIRRYIAEGKKFKRTIHVSFVPGPVTQNETCGLAAAN   47 (58)
Q Consensus         3 ~~~~~~leAl~~L~~~g~~p~r~I~l~f~~DEE~Gg~~~g~~~~~   47 (58)
                      .+++.+||++|.+.+.....+++|.++|+.-||.+-. +..+++.
T Consensus       166 ~~va~mLe~lRv~s~~~~~l~~~vVFLfNgaEE~~L~-gsH~FIt  209 (834)
T KOG2194|consen  166 SGVASMLEALRVLSKSDKLLTHSVVFLFNGAEESGLL-GSHAFIT  209 (834)
T ss_pred             hHHHHHHHHHHHhhcCCCcccccEEEEecCcccchhh-hccccee
Confidence            4688999999999988666689999999999996543 3444443


No 82 
>PTZ00371 aspartyl aminopeptidase; Provisional
Probab=85.08  E-value=1.2  Score=32.54  Aligned_cols=38  Identities=3%  Similarity=-0.061  Sum_probs=21.7

Q ss_pred             hHHHHHHHHHHHHHHHcCCCCCceEEEEe-eCCCCCCCh
Q psy3109           2 KCVGIQYLETIRRYIAEGKKFKRTIHVSF-VPGPVTQNE   39 (58)
Q Consensus         2 K~~~~~~leAl~~L~~~g~~p~r~I~l~f-~~DEE~Gg~   39 (58)
                      +.++...++|+..+...-......+.+++ +--||+||.
T Consensus       253 r~~~~~~l~al~~~~~~~~~~~~~~~v~~~~d~EEVGs~  291 (465)
T PTZ00371        253 LGSSFCAFKALTEAVESLGENSSNIRMVCLFDHEEVGSS  291 (465)
T ss_pred             HHHHHHHHHHHHhccccccCCCCceEEEEEECCcCCCCC
Confidence            45666677777655421001134455555 666999986


No 83 
>PF06309 Torsin:  Torsin;  InterPro: IPR010448 This family consists of several eukaryotic torsin proteins. Torsion dystonia is an autosomal dominant movement disorder characterised by involuntary, repetitive muscle contractions and twisted postures. The most severe early-onset form of dystonia has been linked to mutations in the human DYT1 (TOR1A) gene encoding a protein termed torsinA. While causative genetic alterations have been identified, the function of torsin proteins and the molecular mechanism underlying dystonia remain unknown. Phylogenetic analysis of the torsin protein family indicates these proteins share distant sequence similarity with the large and diverse family of AAA ATPase, central region containing proteins () proteins. It has been suggested that torsins play a role in effectively managing protein folding and that possible breakdown in a neuroprotective mechanism that is, in part, mediated by torsins may be responsible for the neuronal dysfunction associated with dystonia [].; GO: 0005524 ATP binding, 0051085 chaperone mediated protein folding requiring cofactor
Probab=72.58  E-value=20  Score=22.27  Aligned_cols=49  Identities=18%  Similarity=0.154  Sum_probs=36.6

Q ss_pred             HHHHHHHHHHHcCCCCCceEEEEeeCCCCCCChhhhHHHHHHHHHHhhhcc
Q psy3109           7 QYLETIRRYIAEGKKFKRTIHVSFVPGPVTQNETCGLAAANSVLYLLGLMS   57 (58)
Q Consensus         7 ~~leAl~~L~~~g~~p~r~I~l~f~~DEE~Gg~~~g~~~~~s~~~~~~~~~   57 (58)
                      .++.|++.-.+.. .|++++.++|-..-=+|=. +-..+++..+|..|+-|
T Consensus        36 ~v~~ai~~~l~~~-~p~KpLVlSfHG~tGtGKn-~v~~liA~~ly~~G~~S   84 (127)
T PF06309_consen   36 VVVNAIKGHLANP-NPRKPLVLSFHGWTGTGKN-FVSRLIAEHLYKSGMKS   84 (127)
T ss_pred             HHHHHHHHHHcCC-CCCCCEEEEeecCCCCcHH-HHHHHHHHHHHhcccCC
Confidence            4567777777664 7999999999877666543 56778888888887654


No 84 
>COG1362 LAP4 Aspartyl aminopeptidase [Amino acid transport and metabolism]
Probab=71.98  E-value=5.1  Score=29.62  Aligned_cols=33  Identities=9%  Similarity=0.031  Sum_probs=22.9

Q ss_pred             HHHHHHHHHHHHcCCCCCceEEEEeeCCCCCCCh
Q psy3109           6 IQYLETIRRYIAEGKKFKRTIHVSFVPGPVTQNE   39 (58)
Q Consensus         6 ~~~leAl~~L~~~g~~p~r~I~l~f~~DEE~Gg~   39 (58)
                      ++.+.++++|... ..+..+..++++--||+||.
T Consensus       244 ~~~~a~m~AL~~~-~~~~~~~v~~~fD~EEIGS~  276 (437)
T COG1362         244 ICCHAGMEALLAA-ANSDKTCVLALFDHEEIGSL  276 (437)
T ss_pred             HHHHHHHHHHHhc-cCCCCceEEEEechhhcccc
Confidence            4556667777665 34556667777777999996


No 85 
>PF06877 RraB:  Regulator of ribonuclease activity B;  InterPro: IPR009671 This entry occurs in several hypothetical bacterial proteins of around 120 residues in length. The function of these proteins is unknown. The protein structure has been determined for one member of this group, the hypothetical protein VCO424 from Vibrio cholerae; it has an alpha+beta sandwich fold.; PDB: 1NXI_A.
Probab=61.78  E-value=23  Score=20.06  Aligned_cols=29  Identities=17%  Similarity=0.185  Sum_probs=18.5

Q ss_pred             HHHHHHHHHHHcCCCC--CceEEEEeeCCCC
Q psy3109           7 QYLETIRRYIAEGKKF--KRTIHVSFVPGPV   35 (58)
Q Consensus         7 ~~leAl~~L~~~g~~p--~r~I~l~f~~DEE   35 (58)
                      +--..++.|.+.|..|  +|+|...|..+.|
T Consensus         5 ~n~~vl~~L~~~Gddl~~~r~ieh~~~f~~~   35 (104)
T PF06877_consen    5 ENREVLEALEEDGDDLSKPRPIEHWFYFEDE   35 (104)
T ss_dssp             HHHHHHHHHHHHT--TTS-EEEEEEEEES-H
T ss_pred             HHHHHHHHHHhcCCCCCCCeEEEEEEEeCCH
Confidence            3446788999999766  4888877665544


No 86 
>COG4882 Predicted aminopeptidase, Iap family [General function prediction only]
Probab=61.59  E-value=9.1  Score=28.43  Aligned_cols=45  Identities=18%  Similarity=-0.034  Sum_probs=31.3

Q ss_pred             HHHHHHHHHHHHHHcCCCCCceEEEEeeCCCCCCChhhhHHH--HHHHHHH
Q psy3109           4 VGIQYLETIRRYIAEGKKFKRTIHVSFVPGPVTQNETCGLAA--ANSVLYL   52 (58)
Q Consensus         4 ~~~~~leAl~~L~~~g~~p~r~I~l~f~~DEE~Gg~~~g~~~--~~s~~~~   52 (58)
                      ++.|+++++++|...+.    .+.++.+.-||.|++....-+  .||+.|+
T Consensus       211 g~~~AV~~~~~lr~~~~----~~~lv~FtAEE~g~p~~~sfyWa~GSr~~l  257 (486)
T COG4882         211 GVAQAVETAGRLRGRGL----AAGLVVFTAEEHGMPGMASFYWAAGSRGLL  257 (486)
T ss_pred             hHHHHHHHHHHHhhcCc----ceeEEEEeccccCCCCCcceeecccchHHH
Confidence            67889999999988874    357777788998887433222  3455443


No 87 
>PF12000 Glyco_trans_4_3:  Gkycosyl transferase family 4 group;  InterPro: IPR022623  This presumed domain is functionally uncharacterised and found in bacteria. This region is about 170 amino acids in length and is found N-terminal to PF00534 from PFAM. There is a single completely conserved residue G that may be functionally important. 
Probab=56.75  E-value=17  Score=23.34  Aligned_cols=22  Identities=14%  Similarity=0.102  Sum_probs=16.5

Q ss_pred             HHHHHHHHHHHHcCCCCCceEEEE
Q psy3109           6 IQYLETIRRYIAEGKKFKRTIHVS   29 (58)
Q Consensus         6 ~~~leAl~~L~~~g~~p~r~I~l~   29 (58)
                      -+...|++.|+++|+.|  +|++.
T Consensus        51 ~av~~a~~~L~~~Gf~P--DvI~~   72 (171)
T PF12000_consen   51 QAVARAARQLRAQGFVP--DVIIA   72 (171)
T ss_pred             HHHHHHHHHHHHcCCCC--CEEEE
Confidence            35678899999999999  44443


No 88 
>PF14136 DUF4303:  Domain of unknown function (DUF4303)
Probab=50.53  E-value=47  Score=20.02  Aligned_cols=20  Identities=15%  Similarity=0.044  Sum_probs=15.4

Q ss_pred             HHHHHHHHHHHHHHcCCCCC
Q psy3109           4 VGIQYLETIRRYIAEGKKFK   23 (58)
Q Consensus         4 ~~~~~leAl~~L~~~g~~p~   23 (58)
                      ..-+++.|++.|.+.|+.-.
T Consensus       104 ~~~~~~~aL~~l~~eg~f~~  123 (155)
T PF14136_consen  104 LLEAMVEALKQLDQEGFFGN  123 (155)
T ss_pred             HHHHHHHHHHHHHHhcccCC
Confidence            45678899999999986433


No 89 
>PF11181 YflT:  Heat induced stress protein YflT
Probab=50.38  E-value=28  Score=20.04  Aligned_cols=26  Identities=23%  Similarity=0.129  Sum_probs=19.9

Q ss_pred             HHHHHHHHHHHcCCCCCceEEEEeeCC
Q psy3109           7 QYLETIRRYIAEGKKFKRTIHVSFVPG   33 (58)
Q Consensus         7 ~~leAl~~L~~~g~~p~r~I~l~f~~D   33 (58)
                      .++.+++.|++.|+.. .+|+++--.+
T Consensus        11 E~~~~I~~L~~~Gy~~-ddI~Vva~d~   36 (103)
T PF11181_consen   11 EALSAIEELKAQGYSE-DDIYVVAKDK   36 (103)
T ss_pred             HHHHHHHHHHHcCCCc-ccEEEEEcCc
Confidence            5788999999999854 6787766433


No 90 
>KOG0615|consensus
Probab=48.79  E-value=37  Score=25.49  Aligned_cols=33  Identities=18%  Similarity=0.202  Sum_probs=25.7

Q ss_pred             hHHHHHHHHHHHHHHHcCCCCCceE---EEEeeCCCC
Q psy3109           2 KCVGIQYLETIRRYIAEGKKFKRTI---HVSFVPGPV   35 (58)
Q Consensus         2 K~~~~~~leAl~~L~~~g~~p~r~I---~l~f~~DEE   35 (58)
                      |...-|++.|++.|++.|+.. |+|   .+++..|+|
T Consensus       279 K~~f~Qll~avkYLH~~GI~H-RDiKPeNILl~~~~e  314 (475)
T KOG0615|consen  279 KLLFKQLLTAVKYLHSQGIIH-RDIKPENILLSNDAE  314 (475)
T ss_pred             HHHHHHHHHHHHHHHHcCccc-ccCCcceEEeccCCc
Confidence            666789999999999999755 344   478888854


No 91 
>PF13812 PPR_3:  Pentatricopeptide repeat domain
Probab=47.29  E-value=26  Score=15.12  Aligned_cols=16  Identities=19%  Similarity=0.144  Sum_probs=11.7

Q ss_pred             HHHHHHHHHHHcCCCC
Q psy3109           7 QYLETIRRYIAEGKKF   22 (58)
Q Consensus         7 ~~leAl~~L~~~g~~p   22 (58)
                      .++..++.+.+.|.+|
T Consensus        19 ~a~~~~~~M~~~gv~P   34 (34)
T PF13812_consen   19 AALQLFDEMKEQGVKP   34 (34)
T ss_pred             HHHHHHHHHHHhCCCC
Confidence            3567777788888776


No 92 
>COG4821 Uncharacterized protein containing SIS (Sugar ISomerase) phosphosugar binding domain [General function prediction only]
Probab=43.64  E-value=39  Score=23.22  Aligned_cols=25  Identities=16%  Similarity=0.206  Sum_probs=18.9

Q ss_pred             HHHHHHHHHHHcCCCCCceEEEEeeCC
Q psy3109           7 QYLETIRRYIAEGKKFKRTIHVSFVPG   33 (58)
Q Consensus         7 ~~leAl~~L~~~g~~p~r~I~l~f~~D   33 (58)
                      ...||++++.+.|+.|  ||+++-+-|
T Consensus       199 ~faeai~~mv~~g~~p--Pvf~S~Nid  223 (243)
T COG4821         199 TFAEAIELMVEKGYTP--PVFLSANID  223 (243)
T ss_pred             HHHHHHHHHHhCCCCC--CeeeecCCC
Confidence            4578999999999999  466655443


No 93 
>PF04055 Radical_SAM:  Radical SAM superfamily;  InterPro: IPR007197 Radical SAM proteins catalyze diverse reactions, including unusual methylations, isomerization, sulphur insertion, ring formation, anaerobic oxidation and protein radical formation. Evidence exists that these proteins generate a radical species by reductive cleavage of S:-adenosylmethionine (SAM) through an unusual Fe-S centre [, ].; GO: 0003824 catalytic activity, 0051536 iron-sulfur cluster binding; PDB: 2A5H_D 3T7V_A 3C8F_A 3CB8_A 2FB2_A 2FB3_A 3CIX_A 3IIX_A 3IIZ_A 3CIW_A ....
Probab=43.31  E-value=52  Score=18.62  Aligned_cols=30  Identities=23%  Similarity=0.209  Sum_probs=21.2

Q ss_pred             HHHHHHHHHHHHHcCCCCCceEEEEeeCCCC
Q psy3109           5 GIQYLETIRRYIAEGKKFKRTIHVSFVPGPV   35 (58)
Q Consensus         5 ~~~~leAl~~L~~~g~~p~r~I~l~f~~DEE   35 (58)
                      .-..+++++.+.+.|+.+ ....+...++|.
T Consensus       126 ~~~~~~~l~~l~~~g~~~-~~~~i~~~~~~~  155 (166)
T PF04055_consen  126 FERVLEALERLKEAGIPR-VIIFIVGLPGEN  155 (166)
T ss_dssp             HHHHHHHHHHHHHTTSET-EEEEEEEBTTTS
T ss_pred             HHHHHHHHHHHHHcCCCc-EEEEEEEeCCCC
Confidence            345689999999999755 555566666554


No 94 
>PF04114 Gaa1:  Gaa1-like, GPI transamidase component ;  InterPro: IPR007246 GPI (glycosyl phosphatidyl inositol) transamidase is a multiprotein complex required for a terminal step of adding the glycosylphosphatidylinositol (GPI) anchor attachment onto proteins. Gpi16, Gpi8 and Gaa1 form a sub-complex of the GPI transamidase.; GO: 0016021 integral to membrane, 0042765 GPI-anchor transamidase complex
Probab=42.38  E-value=33  Score=25.43  Aligned_cols=33  Identities=12%  Similarity=0.014  Sum_probs=24.4

Q ss_pred             HHHHHHHHHHHHHHHcCCCCCceEEEEeeCCCCC
Q psy3109           3 CVGIQYLETIRRYIAEGKKFKRTIHVSFVPGPVT   36 (58)
Q Consensus         3 ~~~~~~leAl~~L~~~g~~p~r~I~l~f~~DEE~   36 (58)
                      .+++..+.-++.+++..+. .+||.++|+.||..
T Consensus        37 ~~v~l~lal~~~~~~~~~w-sKDii~l~~~~~~~   69 (504)
T PF04114_consen   37 GGVALALALARYFRRQSYW-SKDIIFLFTDDELA   69 (504)
T ss_pred             hhHHHHHHHHHHhhhchhh-hccEEEEecCCcch
Confidence            4566777777777777654 78999999876653


No 95 
>PF14681 UPRTase:  Uracil phosphoribosyltransferase; PDB: 1V9S_B 1UPF_A 1UPU_D 1JLR_B 1BD4_A 1BD3_C 1JLS_D 1XTV_C 1XTU_H 3G6W_C ....
Probab=39.82  E-value=19  Score=23.34  Aligned_cols=30  Identities=20%  Similarity=0.174  Sum_probs=20.6

Q ss_pred             HHHHHHHHHHHHcCCCCCceEEEEeeCCCC
Q psy3109           6 IQYLETIRRYIAEGKKFKRTIHVSFVPGPV   35 (58)
Q Consensus         6 ~~~leAl~~L~~~g~~p~r~I~l~f~~DEE   35 (58)
                      -.++.|++.|++.|..+++-+.+.+.+-+|
T Consensus       135 ~s~~~ai~~L~~~G~~~~~I~~v~~ias~~  164 (207)
T PF14681_consen  135 GSAIAAIEILKEHGVPEENIIIVSVIASPE  164 (207)
T ss_dssp             HHHHHHHHHHHHTTG-GGEEEEEEEEEEHH
T ss_pred             hhHHHHHHHHHHcCCCcceEEEEEEEecHH
Confidence            356788999999998777666666665444


No 96 
>PF05237 MoeZ_MoeB:  MoeZ/MoeB domain;  InterPro: IPR007901 This putative domain is found in the MoeZ protein and the MoeB protein. The domain has two CXXC motifs that are only partly conserved. MoeZ is necessary for the synthesis of pyridine-2,6-bis(thiocarboxylic acid), a small secreted metabolite that has a high affinity for transition metals, increases iron uptake efficiency by 20% in Pseudomonas stutzeri, has the ability to reduce both soluble and mineral forms of iron, and has antimicrobial activity towards several species of bacteria. MoeB is the molybdopterin synthase activating enzyme in the molybdopterin cofactor biosynthesis pathway. Both these enzymes are members of a superfamily consisting of related but structurally distinct proteins that are members of pathways involved in the transfer of sulphur-containing moieties to metabolites [] and both also contain the UBA/THIF-type NAD/FAD binding fold (IPR000594 from INTERPRO). ; PDB: 1JWA_B 1JW9_B 1JWB_B 1ZKM_D 1ZUD_3 1ZFN_D.
Probab=39.51  E-value=46  Score=18.43  Aligned_cols=21  Identities=19%  Similarity=0.240  Sum_probs=10.2

Q ss_pred             HHHHHHHHHHHHHHcCCCCCc
Q psy3109           4 VGIQYLETIRRYIAEGKKFKR   24 (58)
Q Consensus         4 ~~~~~leAl~~L~~~g~~p~r   24 (58)
                      +.+|+.||++.|...+..+..
T Consensus        34 gslqA~eaik~l~g~~~~l~~   54 (84)
T PF05237_consen   34 GSLQANEAIKLLLGIGEPLSG   54 (84)
T ss_dssp             HHHHHHHHHHHHCT-S---BT
T ss_pred             HHHHHHHHHHHHHhcCCchhh
Confidence            345666777766655433333


No 97 
>PF08415 NRPS:  Nonribosomal peptide synthase;  InterPro: IPR013624 This domain is found in bacterial non-ribosomal peptide synthetases (NRPS). NRPS are megaenzymes organised as iterative modules, one for each amino acid to be built into the peptide product []. NRPS modules are involved in epothilone biosynthesis (EpoB), myxothiazol biosynthesis (MtaC and MtaD), and other functions []. The NRPS domain tends to be found together with the condensation domain (IPR001242 from INTERPRO) and the phosphopantetheine binding domain (IPR006163 from INTERPRO). 
Probab=38.96  E-value=48  Score=17.15  Aligned_cols=27  Identities=19%  Similarity=0.260  Sum_probs=16.0

Q ss_pred             HHHHHHHHHHHHcCCCCCceEEEEeeC
Q psy3109           6 IQYLETIRRYIAEGKKFKRTIHVSFVP   32 (58)
Q Consensus         6 ~~~leAl~~L~~~g~~p~r~I~l~f~~   32 (58)
                      +..++.+|.|.+.+.......-++|++
T Consensus         4 ~sGv~vlRel~r~~~~~~~~~PVVFTS   30 (58)
T PF08415_consen    4 FSGVEVLRELARRGGGRAAVMPVVFTS   30 (58)
T ss_pred             ccHHHHHHHHHHhcCCCCCcCCEEEeC
Confidence            346788899988733333334455554


No 98 
>COG0035 Upp Uracil phosphoribosyltransferase [Nucleotide transport and metabolism]
Probab=38.39  E-value=42  Score=22.57  Aligned_cols=30  Identities=27%  Similarity=0.234  Sum_probs=23.9

Q ss_pred             HHHHHHHHHHHHHcCCCCCceEEEEeeCCCC
Q psy3109           5 GIQYLETIRRYIAEGKKFKRTIHVSFVPGPV   35 (58)
Q Consensus         5 ~~~~leAl~~L~~~g~~p~r~I~l~f~~DEE   35 (58)
                      ...++.|++.|++.| .|++-..+.+.+-+|
T Consensus       137 G~s~i~ai~~L~~~G-~~~~I~~v~~vAape  166 (210)
T COG0035         137 GGSAIAAIDLLKKRG-GPKNIKVVSLVAAPE  166 (210)
T ss_pred             cHhHHHHHHHHHHhC-CCceEEEEEEEecHH
Confidence            456788999999998 787777777777766


No 99 
>cd06287 PBP1_LacI_like_8 Ligand-binding domain of uncharacterized DNA-binding regulatory proteins that are members of the LacI-GalR family of bacterial transcription repressors. This group includes the ligand-binding domain of uncharacterized DNA-binding regulatory proteins that are members of the LacI-GalR family of bacterial transcription repressors. The LacI-GalR family repressors are composed of two functional domains: an N-terminal HTH (helix-turn-helix) domain, which is responsible for the DNA-binding specificity, and a C-terminal ligand-binding domain, which is homologous to the sugar-binding domain of ABC-type transport systems that contain the type I periplasmic binding protein-like fold.  As also observed in the periplasmic binding proteins, the C-terminal domain of the bacterial transcription repressor undergoes a conformational change upon ligand binding which in turn changes the DNA binding affinity of the repressor.
Probab=38.00  E-value=69  Score=20.57  Aligned_cols=30  Identities=10%  Similarity=0.052  Sum_probs=22.3

Q ss_pred             HHHHHHHHHHHHcCCCCCceEEEEeeCCCC
Q psy3109           6 IQYLETIRRYIAEGKKFKRTIHVSFVPGPV   35 (58)
Q Consensus         6 ~~~leAl~~L~~~g~~p~r~I~l~f~~DEE   35 (58)
                      ..++.++++|++.|.+.+.+|.++-+.|.+
T Consensus       188 ~~A~gvl~al~~~gl~vP~dvsvig~~d~~  217 (269)
T cd06287         188 AFAVGAVRAATELGRAVPDQLRVVTRYDGL  217 (269)
T ss_pred             HHHHHHHHHHHHcCCCCCCceEEEeccCch
Confidence            356678888999999888889887534443


No 100
>COG4703 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=37.76  E-value=80  Score=18.01  Aligned_cols=35  Identities=14%  Similarity=0.160  Sum_probs=24.7

Q ss_pred             hHHHHHHHHHHHHHHHcCCCCCce----------EEEEeeCCCCC
Q psy3109           2 KCVGIQYLETIRRYIAEGKKFKRT----------IHVSFVPGPVT   36 (58)
Q Consensus         2 K~~~~~~leAl~~L~~~g~~p~r~----------I~l~f~~DEE~   36 (58)
                      ++.++.++.-++++.+.|-.++++          ..+.|++|+|+
T Consensus         1 ~ShLl~IinRL~~m~edggn~k~~rnFEr~G~vv~eV~ys~~~e~   45 (74)
T COG4703           1 MSHLLGIINRLEAMQEDGGNEKRQRNFERNGEVVCEVKYSEDNET   45 (74)
T ss_pred             CcHHHHHHHHHHHHHhccCCchhhhhhhcCCEEEEEEEecCCCce
Confidence            356778888888888866666651          23778888874


No 101
>PF13377 Peripla_BP_3:  Periplasmic binding protein-like domain; PDB: 3K9C_B 3BIL_B 3JVD_B 1ZAY_A 1VPW_A 1DBQ_A 2PUA_A 1QQA_A 1PNR_A 1JHZ_A ....
Probab=37.28  E-value=64  Score=18.59  Aligned_cols=28  Identities=18%  Similarity=0.119  Sum_probs=21.1

Q ss_pred             HHHHHHHHHHHHcCCCCCceEEEEeeCC
Q psy3109           6 IQYLETIRRYIAEGKKFKRTIHVSFVPG   33 (58)
Q Consensus         6 ~~~leAl~~L~~~g~~p~r~I~l~f~~D   33 (58)
                      ..++.+++.|.+.|...++++.++...|
T Consensus        78 ~~a~~~~~~l~~~g~~vP~di~vv~~~~  105 (160)
T PF13377_consen   78 RLALGVLRALRELGIRVPQDISVVSFDD  105 (160)
T ss_dssp             HHHHHHHHHHHHTTSCTTTTSEEEEESS
T ss_pred             HHHHHHHHHHHHcCCcccccccEEEecC
Confidence            3566778889999998777777766655


No 102
>PF04028 DUF374:  Domain of unknown function (DUF374);  InterPro: IPR007172 This is a bacterial domain of unknown function.
Probab=36.33  E-value=78  Score=17.63  Aligned_cols=25  Identities=20%  Similarity=0.309  Sum_probs=17.3

Q ss_pred             hHHHHHHHHHHHHHHHcCCCCCceEEEEeeCC
Q psy3109           2 KCVGIQYLETIRRYIAEGKKFKRTIHVSFVPG   33 (58)
Q Consensus         2 K~~~~~~leAl~~L~~~g~~p~r~I~l~f~~D   33 (58)
                      |++..+..+.++.|+ +|      ..+++++|
T Consensus        43 rgg~~Alr~~~~~lk-~G------~~~~itpD   67 (74)
T PF04028_consen   43 RGGARALREMLRALK-EG------YSIAITPD   67 (74)
T ss_pred             CcHHHHHHHHHHHHH-CC------CeEEEeCC
Confidence            667777777777776 44      25677777


No 103
>cd06301 PBP1_rhizopine_binding_like Periplasmic binding proteins specific to rhizopines. Periplasmic binding proteins specific to rhizopines, which are simple sugar-like compounds produced in the nodules induced by the symbiotic root nodule bacteria, such as Rhizobium and Sinorhizobium. Rhizopine-binding-like proteins from other bacteria are also included. Two inositol based rhizopine compounds are known to date: L-3-O-methly-scyllo-inosamine (3-O-MSI) and scyllo-inosamine. Bacterial strains that can metabolize rhizopine have a greater competitive advantage in nodulation and rhizopine synthesis is regulated by NifA/NtrA regulatory transcription activators which are maximally expressed at the onset of nitrogen fixation in bacteroids. The members of this group belong to the pentose/hexose sugar-binding protein family of the type I periplasmic binding protein superfamily.
Probab=35.71  E-value=66  Score=20.10  Aligned_cols=27  Identities=26%  Similarity=0.262  Sum_probs=18.6

Q ss_pred             HHHHHHHHHHHcCCCCCceEEEEeeCCC
Q psy3109           7 QYLETIRRYIAEGKKFKRTIHVSFVPGP   34 (58)
Q Consensus         7 ~~leAl~~L~~~g~~p~r~I~l~f~~DE   34 (58)
                      .++.+++.+++.|..| +++.++-+-|.
T Consensus       194 ~a~~~~~~l~~~g~~~-~di~ivg~d~~  220 (272)
T cd06301         194 MALGAIMALKAAGKSD-KDVPVAGIDGT  220 (272)
T ss_pred             hHHHHHHHHHHcCCCC-CCcEEEeeCCC
Confidence            4567888888999876 56665555443


No 104
>PRK05943 50S ribosomal protein L25; Reviewed
Probab=35.16  E-value=60  Score=18.54  Aligned_cols=21  Identities=19%  Similarity=0.139  Sum_probs=17.9

Q ss_pred             HHHHHHHcCCCCCceEEEEeeCCCC
Q psy3109          11 TIRRYIAEGKKFKRTIHVSFVPGPV   35 (58)
Q Consensus        11 Al~~L~~~g~~p~r~I~l~f~~DEE   35 (58)
                      +.++|.++|+.|    -++++.++|
T Consensus        16 ~~r~lR~~G~vP----aViYG~~~~   36 (94)
T PRK05943         16 ASRRLRRAGKFP----AIIYGGNEA   36 (94)
T ss_pred             HHHHHHHCCCCC----EEEECCCCC
Confidence            568999999999    788888877


No 105
>COG2875 CobM Precorrin-4 methylase [Coenzyme metabolism]
Probab=34.89  E-value=79  Score=21.97  Aligned_cols=45  Identities=11%  Similarity=0.094  Sum_probs=28.3

Q ss_pred             HHHHHHHHHHcCCCCCceEEEEe---eCCCCCCChhhhHHHHHHHHHHhh
Q psy3109           8 YLETIRRYIAEGKKFKRTIHVSF---VPGPVTQNETCGLAAANSVLYLLG   54 (58)
Q Consensus         8 ~leAl~~L~~~g~~p~r~I~l~f---~~DEE~Gg~~~g~~~~~s~~~~~~   54 (58)
                      +=..++.|..-|+.|.-|+.+++   |+||++  -++-..=++.+.-.-|
T Consensus       171 I~~vv~~L~~g~y~~dtPVaVV~rAsWpDe~i--i~GTL~dIa~kv~~~~  218 (254)
T COG2875         171 IDKVVEELLEGGYPPDTPVAVVYRASWPDEKI--IRGTLEDIAEKVKEAG  218 (254)
T ss_pred             HHHHHHHHhcCCCCCCCCEEEEEecCCCcccE--EEeeHHHHHHHHHhcC
Confidence            34567788887787777776665   899985  2334444445544443


No 106
>TIGR01481 ccpA catabolite control protein A. Catabolite control protein A is a LacI family global transcriptional regulator found in Gram-positive bacteria. CcpA is involved in repressing carbohydrate utilization genes [ex: alpha-amylase (amyE), acetyl-coenzyme A synthase (acsA)] and in activating genes involved in transporting excess carbon from the cell [ex: acetate kinase (ackA), alpha-acetolactate synthase (alsS)]. Additionally, disruption of CcpA in Bacillus megaterium, Staphylococcus xylosus, Lactobacillus casei and Lactocacillus pentosus also decreases growth rate, which suggests CcpA is involved in the regulation of other metabolic pathways.
Probab=34.65  E-value=87  Score=20.39  Aligned_cols=30  Identities=10%  Similarity=0.011  Sum_probs=23.0

Q ss_pred             HHHHHHHHHHHHcCCCCCceEEEEeeCCCC
Q psy3109           6 IQYLETIRRYIAEGKKFKRTIHVSFVPGPV   35 (58)
Q Consensus         6 ~~~leAl~~L~~~g~~p~r~I~l~f~~DEE   35 (58)
                      .+++.+++.|.+.|.+.+.+|.++-+-|-+
T Consensus       246 ~~A~g~~~al~~~g~~vP~dvsvvgfd~~~  275 (329)
T TIGR01481       246 EMAAGILNAAMDAGIKVPEDLEVITSNNTR  275 (329)
T ss_pred             HHHHHHHHHHHHcCCCCCCceEEEeeCCch
Confidence            356678899999999877888877665554


No 107
>PRK11191 RNase E inhibitor protein; Provisional
Probab=34.14  E-value=1.2e+02  Score=18.96  Aligned_cols=31  Identities=19%  Similarity=0.215  Sum_probs=24.2

Q ss_pred             HHHHHHHHHHHHHcCCCCC--ceEEEEeeCCCC
Q psy3109           5 GIQYLETIRRYIAEGKKFK--RTIHVSFVPGPV   35 (58)
Q Consensus         5 ~~~~leAl~~L~~~g~~p~--r~I~l~f~~DEE   35 (58)
                      ...--+.++.|++.|..|.  |+|...|.+..+
T Consensus        11 ~~~~~eVi~~L~edGsd~~~~~~IEH~~~f~d~   43 (138)
T PRK11191         11 REETREIIEELLEDGSDPDALYTIEHHFSADDF   43 (138)
T ss_pred             HHHHHHHHHHHHHcCCCcCCCEEEEEEEecCCH
Confidence            4555678899999998775  899988876655


No 108
>cd06276 PBP1_FucR_like Ligand-binding domain of a transcription repressor, FucR, which functions as a molecular sensor of L-fucose availability. Ligand-binding domain of a transcription repressor, FucR, which functions as a molecular sensor of L-fucose availability. FcuR acts as an inducer of fucRRIAK and as a corepressor of another locus that regulates production of fucosylated glycans. FcuR and its close homologs in this group are a member of the LacI-GalR family repressors that are composed of two functional domains: an N-terminal HTH (helix-turn-helix) domain, which is responsible for the DNA-binding specificity, and a C-terminal ligand-binding domain, which is homologous to the sugar-binding domain of ABC-type transport systems that contain the type I periplasmic binding protein-like fold. As also observed in the periplasmic binding proteins, the C-terminal domain of the bacterial transcription repressor undergoes a conformational change upon ligand binding which in turn changes t
Probab=33.49  E-value=1.1e+02  Score=19.47  Aligned_cols=31  Identities=10%  Similarity=0.046  Sum_probs=25.4

Q ss_pred             HHHHHHHHHHHHcCCCCCceEEEEeeCCCCC
Q psy3109           6 IQYLETIRRYIAEGKKFKRTIHVSFVPGPVT   36 (58)
Q Consensus         6 ~~~leAl~~L~~~g~~p~r~I~l~f~~DEE~   36 (58)
                      ..++.+++.+.+.|.+.+.+|.++-+.|.+.
T Consensus       172 ~~A~g~~~~l~~~g~~iP~disvigfd~~~~  202 (247)
T cd06276         172 TDLVFLIKKARESGLLLGKDIGIISYNDTPL  202 (247)
T ss_pred             HHHHHHHHHHHHcCCcCCceeEEEEecCchh
Confidence            4567888999999998888999888777664


No 109
>KOG2170|consensus
Probab=33.27  E-value=1.6e+02  Score=21.42  Aligned_cols=48  Identities=19%  Similarity=0.136  Sum_probs=34.7

Q ss_pred             HHHHHHHHHHcCCCCCceEEEEeeCCCCCCChhhhHHHHHHHHHHhhhcc
Q psy3109           8 YLETIRRYIAEGKKFKRTIHVSFVPGPVTQNETCGLAAANSVLYLLGLMS   57 (58)
Q Consensus         8 ~leAl~~L~~~g~~p~r~I~l~f~~DEE~Gg~~~g~~~~~s~~~~~~~~~   57 (58)
                      ++.|++....+. .|++++.++|-.--=||-. +=..+++...|..|++|
T Consensus        94 Vv~alk~~~~n~-~p~KPLvLSfHG~tGTGKN-~Va~iiA~n~~~~Gl~S  141 (344)
T KOG2170|consen   94 VVNALKSHWANP-NPRKPLVLSFHGWTGTGKN-YVAEIIAENLYRGGLRS  141 (344)
T ss_pred             HHHHHHHHhcCC-CCCCCeEEEecCCCCCchh-HHHHHHHHHHHhccccc
Confidence            455666666665 6999999999755544432 45777888888888887


No 110
>cd06272 PBP1_hexuronate_repressor_like Ligand-binding domain of DNA transcription repressor for the hexuronate utilization operon from Bacillus species and its close homologs from other bacteria, all of which are a member of the LacI-GalR family of bacterial transcription regulators. Ligand-binding domain of DNA transcription repressor for the hexuronate utilization operon from Bacillus species and its close homologs from other bacteria, all of which are a member of the LacI-GalR family of bacterial transcription regulators. The LacI-GalR family repressors are composed of two functional domains: an N-terminal HTH (helix-turn-helix) domain, which is responsible for the DNA-binding specificity, and a C-terminal ligand-binding domain, which is homologous to the sugar-binding domain of ABC-type transport systems that contain the type I periplasmic binding protein-like fold. As also observed in the periplasmic binding proteins, the C-terminal domain of the bacterial transcription repressor 
Probab=32.50  E-value=93  Score=19.35  Aligned_cols=28  Identities=11%  Similarity=0.035  Sum_probs=20.2

Q ss_pred             HHHHHHHHHHHHcCCCCCceEEEEeeCC
Q psy3109           6 IQYLETIRRYIAEGKKFKRTIHVSFVPG   33 (58)
Q Consensus         6 ~~~leAl~~L~~~g~~p~r~I~l~f~~D   33 (58)
                      .+++.+++.+++.|.+.+.+|.++-+.|
T Consensus       182 ~~a~~~~~~l~~~g~~vp~dv~vvg~d~  209 (261)
T cd06272         182 DIALGVLSALNKQGISIPEDIEIISYDN  209 (261)
T ss_pred             HHHHHHHHHHHHhCCCCCCceEEEeeCC
Confidence            3456788889999987667777665554


No 111
>PF07708 Tash_PEST:  Tash protein PEST motif;  InterPro: IPR011695 The PEST motif is found in one or more copies in Tash AT-hook proteins from Theileria annulata. Tash proteins are transported to the host nucleus and are thought to be involved in pathogenesis []. The PEST motif is often found in conjunction with the (IPR007480 from INTERPRO), whose function is unknown. These repeats may be part of the PEST motif (a signal for rapid proteolytic degradation) [], though this is not proven. This motif is also found in other T. annulata proteins, which have no other known domains.
Probab=30.81  E-value=48  Score=14.30  Aligned_cols=12  Identities=25%  Similarity=0.147  Sum_probs=10.2

Q ss_pred             ceEEEEeeCCCC
Q psy3109          24 RTIHVSFVPGPV   35 (58)
Q Consensus        24 r~I~l~f~~DEE   35 (58)
                      -+|.+=+.+|||
T Consensus         6 Eti~vEi~SDee   17 (19)
T PF07708_consen    6 ETIPVEIGSDEE   17 (19)
T ss_pred             ceEEEEeccccc
Confidence            578899999987


No 112
>PF04023 FeoA:  FeoA domain;  InterPro: IPR007167 This entry represents the core domain of the ferrous iron (Fe2+) transport protein FeoA found in bacteria. This domain also occurs at the C terminus in related proteins. The transporter Feo is composed of three proteins: FeoA a small, soluble SH3-domain protein probably located in the cytosol; FeoB, a large protein with a cytosolic N-terminal G-protein domain and a C-terminal integral inner-membrane domain containing two 'Gate' motifs which likely functions as the Fe2+ permease; and FeoC, a small protein apparently functioning as an [Fe-S]-dependent transcriptional repressor [, ]. Feo allows the bacterial cell to acquire iron from its environment. ; GO: 0005506 iron ion binding; PDB: 3HRT_B 3HRS_A 3HRU_B 1G3T_A 1C0W_B 1BI3_A 1QVP_A 1BI1_A 1BYM_A 2QQB_A ....
Probab=30.76  E-value=87  Score=16.32  Aligned_cols=21  Identities=29%  Similarity=0.194  Sum_probs=16.8

Q ss_pred             HHHHHHHHcCCCCCceEEEEe
Q psy3109          10 ETIRRYIAEGKKFKRTIHVSF   30 (58)
Q Consensus        10 eAl~~L~~~g~~p~r~I~l~f   30 (58)
                      +.+++|.+.|+.|-..|.+.-
T Consensus        23 ~~~~~L~~lGl~~G~~i~v~~   43 (74)
T PF04023_consen   23 ELLRRLADLGLTPGSEITVIR   43 (74)
T ss_dssp             HHHHHHHHCT-STTEEEEEEE
T ss_pred             HHHHHHHHCCCCCCCEEEEEE
Confidence            467899999999998888873


No 113
>cd01541 PBP1_AraR Ligand-binding domain of DNA transcription repressor specific for arabinose (AraR) which is a member of the LacI-GalR family of bacterial transcription regulators. Ligand-binding domain of DNA transcription repressor specific for arabinose (AraR) which is a member of the LacI-GalR family of bacterial transcription regulators. The ligand-binding domain of AraR is structurally homologous to the periplasmic sugar-binding domain of ABC-type transporters and both domains contain the type I periplasmic binding protein-like fold. The LacI-GalR family repressors are composed of two functional domains: an N-terminal HTH (helix-turn-helix) domain, which is responsible for the DNA-binding specificity, and a C-terminal ligand-binding domain, which is homologous to the type I periplasmic binding proteins. As also observed in the periplasmic binding proteins, the C-terminal domain of the bacterial transcription repressor undergoes a conformational change upon ligand binding which i
Probab=30.43  E-value=1.2e+02  Score=18.96  Aligned_cols=29  Identities=21%  Similarity=0.062  Sum_probs=19.6

Q ss_pred             HHHHHHHHHHHcCCCCCceEEEEeeCCCC
Q psy3109           7 QYLETIRRYIAEGKKFKRTIHVSFVPGPV   35 (58)
Q Consensus         7 ~~leAl~~L~~~g~~p~r~I~l~f~~DEE   35 (58)
                      ..+-+++.+++.|.+.+.++.++-..+.+
T Consensus       194 ~a~g~~~al~~~g~~~p~dv~vvg~d~~~  222 (273)
T cd01541         194 IALRVIDLLKELGLKIPEDISVVGFDDSY  222 (273)
T ss_pred             HHHHHHHHHHHcCCCCCCcEEEEEcCCcc
Confidence            45667888889998766677665554443


No 114
>cd01542 PBP1_TreR_like Ligand-binding domain of DNA transcription repressor specific for trehalose (TreR) which is a member of the LacI-GalR family of bacterial transcription regulators. Ligand-binding domain of DNA transcription repressor specific for trehalose (TreR) which is a member of the LacI-GalR family of bacterial transcription regulators. The ligand-binding domain of TreR is structurally homologous to the periplasmic sugar-binding domain of ABC-type transporters and both domains contain the type I periplasmic binding protein-like fold. The LacI-GalR family repressors are composed of two functional domains: an N-terminal HTH (helix-turn-helix) domain, which is responsible for the DNA-binding specificity, and a C-terminal ligand-binding domain, which is homologous to the type I periplasmic binding proteins. As also observed in the periplasmic binding proteins, the C-terminal domain of the bacterial transcription repressor undergoes a conformational change upon ligand binding wh
Probab=30.05  E-value=1.2e+02  Score=18.73  Aligned_cols=30  Identities=17%  Similarity=0.155  Sum_probs=21.3

Q ss_pred             HHHHHHHHHHHHcCCCCCceEEEEeeCCCC
Q psy3109           6 IQYLETIRRYIAEGKKFKRTIHVSFVPGPV   35 (58)
Q Consensus         6 ~~~leAl~~L~~~g~~p~r~I~l~f~~DEE   35 (58)
                      .+++.+++.|.+.|.+.+++|.++-+.|-+
T Consensus       183 ~~a~g~~~~l~~~g~~vp~di~v~g~d~~~  212 (259)
T cd01542         183 TIALGAMKYLQELGRRIPEDISVAGFGGYE  212 (259)
T ss_pred             HHHHHHHHHHHHcCCCCCCceEEEecCCch
Confidence            346678888889998877788766554433


No 115
>TIGR03642 cas_csx13 CRISPR-associated protein, Csx13 family. This model describes a protein N-terminal protein sequence domain strictly associated with CRISPR and CRISPR-associated protein systems. This model and TIGR02584 identify two separate clades from a larger homology domain family, both CRISPR-associated, while other homologs are found that may not be. Members are found in bacteria that include Pelotomaculum thermopropionicum SI, Thermoanaerobacter tengcongensis MB4, and Roseiflexus sp. RS-1, and in archaea that include Thermoplasma volcanium, Picrophilus torridus, and Methanospirillum hungatei. The molecular function is unknown.
Probab=29.16  E-value=93  Score=19.12  Aligned_cols=28  Identities=14%  Similarity=0.198  Sum_probs=22.4

Q ss_pred             HHHHHHHHHHcCCCCCceEEEEeeCCCCC
Q psy3109           8 YLETIRRYIAEGKKFKRTIHVSFVPGPVT   36 (58)
Q Consensus         8 ~leAl~~L~~~g~~p~r~I~l~f~~DEE~   36 (58)
                      +-|++.+|.+.|. |...|+++-+-|+++
T Consensus         7 vTEtl~aL~~~g~-~i~ev~lI~T~~~~v   34 (124)
T TIGR03642         7 ITEAIDYLKKKGE-PISDVILIYTKDPYV   34 (124)
T ss_pred             HHHHHHHHHhcCC-CCCeEEEEEcCCHHH
Confidence            4588899999986 567899998888775


No 116
>PF08967 DUF1884:  Domain of unknown function (DUF1884);  InterPro: IPR014418 This group represents an uncharacterised conserved protein.; PDB: 2PK8_A.
Probab=28.96  E-value=71  Score=18.73  Aligned_cols=15  Identities=20%  Similarity=0.166  Sum_probs=8.7

Q ss_pred             HHHHHHHHHcCCCCC
Q psy3109           9 LETIRRYIAEGKKFK   23 (58)
Q Consensus         9 leAl~~L~~~g~~p~   23 (58)
                      =++++.|+..|+.|.
T Consensus        15 e~~inELk~dG~ePD   29 (85)
T PF08967_consen   15 EEKINELKEDGFEPD   29 (85)
T ss_dssp             HHHHHHHHHTT----
T ss_pred             HHHHHHHHhcCCCCC
Confidence            356778999999993


No 117
>PF02780 Transketolase_C:  Transketolase, C-terminal domain;  InterPro: IPR005476 Transketolase 2.2.1.1 from EC (TK) catalyzes the reversible transfer of a two-carbon ketol unit from xylulose 5-phosphate to an aldose receptor, such as ribose 5-phosphate, to form sedoheptulose 7-phosphate and glyceraldehyde 3- phosphate. This enzyme, together with transaldolase, provides a link between the glycolytic and pentose-phosphate pathways. TK requires thiamine pyrophosphate as a cofactor. In most sources where TK has been purified, it is a homodimer of approximately 70 Kd subunits. TK sequences from a variety of eukaryotic and prokaryotic sources [, ] show that the enzyme has been evolutionarily conserved. In the peroxisomes of methylotrophic yeast Pichia angusta (Yeast) (Hansenula polymorpha), there is a highly related enzyme, dihydroxy-acetone synthase (DHAS) 2.2.1.3 from EC (also known as formaldehyde transketolase), which exhibits a very unusual specificity by including formaldehyde amongst its substrates.  1-deoxyxylulose-5-phosphate synthase (DXP synthase) [] is an enzyme so far found in bacteria (gene dxs) and plants (gene CLA1) which catalyzes the thiamine pyrophosphoate-dependent acyloin condensation reaction between carbon atoms 2 and 3 of pyruvate and glyceraldehyde 3-phosphate to yield 1-deoxy-D- xylulose-5-phosphate (dxp), a precursor in the biosynthetic pathway to isoprenoids, thiamine (vitamin B1), and pyridoxol (vitamin B6). DXP synthase is evolutionary related to TK. The N-terminal section, contains a histidine residue which appears to function in proton transfer during catalysis []. In the central section there are conserved acidic residues that are part of the active cleft and may participate in substrate-binding []. This family includes transketolase enzymes 2.2.1.1 from EC and also partially matches to 2-oxoisovalerate dehydrogenase beta subunit P37941 from SWISSPROT 1.2.4.4 from EC. Both these enzymes utilise thiamine pyrophosphate as a cofactor, suggesting there may be common aspects in their mechanism of catalysis.; GO: 0003824 catalytic activity, 0008152 metabolic process; PDB: 1NGS_B 1TKA_A 1TRK_B 1TKB_A 1GPU_B 1AY0_B 1TKC_B 2E6K_A 3JU3_A 2R8P_B ....
Probab=28.21  E-value=55  Score=18.94  Aligned_cols=32  Identities=16%  Similarity=-0.034  Sum_probs=23.4

Q ss_pred             HHHHHHHHHHHHHHcCCCCC-ceEEEEeeCCCC
Q psy3109           4 VGIQYLETIRRYIAEGKKFK-RTIHVSFVPGPV   35 (58)
Q Consensus         4 ~~~~~leAl~~L~~~g~~p~-r~I~l~f~~DEE   35 (58)
                      ++-.+++|.+.|.++|+... -+++.+--.|+|
T Consensus        20 ~~~~al~A~~~L~~~Gi~~~vi~~~~i~P~d~~   52 (124)
T PF02780_consen   20 MVEEALEAAEELEEEGIKAGVIDLRTIKPFDEE   52 (124)
T ss_dssp             HHHHHHHHHHHHHHTTCEEEEEEEEEEESSBHH
T ss_pred             HHHHHHHHHHHHHHcCCceeEEeeEEEeccccc
Confidence            45678999999999997654 455666666765


No 118
>PF02070 NMU:  Neuromedin U;  InterPro: IPR008199 Neuromedin U (NmU) [, ] is a vertebrate peptide which stimulates uterine smooth muscle contraction and causes selective vasoconstriction. Like most other active peptides, it is proteolytically processed from a larger precursor protein. The mature peptides are 8 (NmU-8) to 25 (NmU-25) residues long and C-terminally amidated. The sequence of the C-terminal extremity of NmU is extremely well conserved.; GO: 0006940 regulation of smooth muscle contraction
Probab=28.07  E-value=20  Score=16.48  Aligned_cols=13  Identities=23%  Similarity=0.233  Sum_probs=8.8

Q ss_pred             eeCCCCCCChhhhH
Q psy3109          30 FVPGPVTQNETCGL   43 (58)
Q Consensus        30 f~~DEE~Gg~~~g~   43 (58)
                      |-.|||..|+ +++
T Consensus         1 fk~dee~QgP-~~~   13 (25)
T PF02070_consen    1 FKVDEEFQGP-GGI   13 (25)
T ss_pred             CccchhccCC-ccc
Confidence            3469998887 443


No 119
>cd06298 PBP1_CcpA_like Ligand-binding domain of the catabolite control protein A (CcpA), which functions as the major transcriptional regulator of carbon catabolite repression/regulation. Ligand-binding domain of the catabolite control protein A (CcpA), which functions as the major transcriptional regulator of carbon catabolite repression/regulation (CCR), a process in which enzymes necessary for the metabolism of alternative sugars are inhibited in the presence of glucose. In gram-positive bacteria, CCR is controlled by HPr, a phosphoenolpyruvate:sugar phsophotrasnferase system (PTS) and a transcriptional regulator CcpA. Moreover, CcpA can regulate sporulation and antibiotic resistance as well as play a role in virulence development of certain pathogens such as the group A streptococcus. The ligand binding domain of CcpA is a member of the LacI-GalR family of bacterial transcription regulators.
Probab=28.00  E-value=1.1e+02  Score=18.94  Aligned_cols=29  Identities=7%  Similarity=-0.011  Sum_probs=21.0

Q ss_pred             HHHHHHHHHHHcCCCCCceEEEEeeCCCC
Q psy3109           7 QYLETIRRYIAEGKKFKRTIHVSFVPGPV   35 (58)
Q Consensus         7 ~~leAl~~L~~~g~~p~r~I~l~f~~DEE   35 (58)
                      .++.+++.|++.|...+++|.++-.-|.+
T Consensus       188 ~a~~~~~~l~~~g~~vp~di~vvg~d~~~  216 (268)
T cd06298         188 LAIGILNAAQDAGLKVPEDFEIIGFNNTK  216 (268)
T ss_pred             HHHHHHHHHHHcCCCCccceEEEeeccHH
Confidence            35677888889998777788766665544


No 120
>KOG1067|consensus
Probab=27.79  E-value=51  Score=25.91  Aligned_cols=45  Identities=27%  Similarity=0.262  Sum_probs=33.6

Q ss_pred             HHHHHHHHHHHHHcCCCCCceEEEEeeCCCCCCChhhhHHHHHHHHH
Q psy3109           5 GIQYLETIRRYIAEGKKFKRTIHVSFVPGPVTQNETCGLAAANSVLY   51 (58)
Q Consensus         5 ~~~~leAl~~L~~~g~~p~r~I~l~f~~DEE~Gg~~~g~~~~~s~~~   51 (58)
                      ..+++.++++|.+.--+++|+...+|.+|.|..-.  -..++..+++
T Consensus       253 ~q~~i~~i~~L~k~~Gk~Kr~~~~~~~p~pel~K~--v~~la~erl~  297 (760)
T KOG1067|consen  253 AQQIIQGIERLAKKYGKQKREPQKLFLPDPELVKH--VHKLAMERLY  297 (760)
T ss_pred             HHHHHHHHHHHHHHhCccccchhhhcCCCHHHHHH--HHHHHHHHHH
Confidence            35778899999877668999999999999997532  3445555444


No 121
>PF13738 Pyr_redox_3:  Pyridine nucleotide-disulphide oxidoreductase; PDB: 3D1C_A 4A9W_B 2YLX_A 2YM2_A 2YLW_A 2YLR_A 2YM1_A 2YLS_A 1W4X_A 2YLT_A ....
Probab=27.71  E-value=1.3e+02  Score=18.12  Aligned_cols=31  Identities=10%  Similarity=-0.075  Sum_probs=20.4

Q ss_pred             HHHHHHHHHHHHcCCCCCceEEEEeeCCCCCCChh
Q psy3109           6 IQYLETIRRYIAEGKKFKRTIHVSFVPGPVTQNET   40 (58)
Q Consensus         6 ~~~leAl~~L~~~g~~p~r~I~l~f~~DEE~Gg~~   40 (58)
                      .+.+.+...|++.|..+    .++|-..++.|+..
T Consensus         7 ~aGl~~a~~l~~~g~~~----v~v~e~~~~~Gg~w   37 (203)
T PF13738_consen    7 PAGLAAAAHLLERGIDP----VVVLERNDRPGGVW   37 (203)
T ss_dssp             HHHHHHHHHHHHTT-------EEEEESSSSSTTHH
T ss_pred             HHHHHHHHHHHhCCCCc----EEEEeCCCCCCCee
Confidence            45677778889988744    56677777787753


No 122
>cd06285 PBP1_LacI_like_7 Ligand-binding domain of uncharacterized DNA-binding regulatory proteins that are members of the LacI-GalR family of bacterial transcription repressors. This group includes the ligand-binding domain of uncharacterized DNA-binding regulatory proteins that are members of the LacI-GalR family of bacterial transcription repressors. The LacI-GalR family repressors are composed of two functional domains: an N-terminal HTH (helix-turn-helix) domain, which is responsible for the DNA-binding specificity, and a C-terminal ligand-binding domain, which is homologous to the sugar-binding domain of ABC-type transport systems that contain the type I periplasmic binding protein-like fold.  As also observed in the periplasmic binding proteins, the C-terminal domain of the bacterial transcription repressor undergoes a conformational change upon ligand binding which in turn changes the DNA binding affinity of the repressor.
Probab=27.69  E-value=1.3e+02  Score=18.76  Aligned_cols=27  Identities=4%  Similarity=-0.017  Sum_probs=19.5

Q ss_pred             HHHHHHHHHHHcCCCCCceEEEEeeCC
Q psy3109           7 QYLETIRRYIAEGKKFKRTIHVSFVPG   33 (58)
Q Consensus         7 ~~leAl~~L~~~g~~p~r~I~l~f~~D   33 (58)
                      +++.+++.+++.|.+.+.++.++-+-+
T Consensus       186 ~a~g~~~~l~~~g~~~p~di~iig~d~  212 (265)
T cd06285         186 AAIGVMGAARDRGLRVPDDVALVGYND  212 (265)
T ss_pred             HHHHHHHHHHHcCCCCCcceEEEeecC
Confidence            456788889999987667776555444


No 123
>PRK11303 DNA-binding transcriptional regulator FruR; Provisional
Probab=26.86  E-value=1.3e+02  Score=19.50  Aligned_cols=29  Identities=0%  Similarity=-0.007  Sum_probs=21.2

Q ss_pred             HHHHHHHHHHHcCCCCCceEEEEeeCCCC
Q psy3109           7 QYLETIRRYIAEGKKFKRTIHVSFVPGPV   35 (58)
Q Consensus         7 ~~leAl~~L~~~g~~p~r~I~l~f~~DEE   35 (58)
                      +++.+++.|.+.|.+.+.+|-++-+.|.+
T Consensus       249 ~A~g~~~al~~~g~~vP~disv~gfd~~~  277 (328)
T PRK11303        249 LLQGVLDVLLERPGELPSDLAIATFGDNE  277 (328)
T ss_pred             HHHHHHHHHHHcCCCCCCceEEEEeCChH
Confidence            45678889999998766788776555544


No 124
>cd03036 ArsC_like Arsenate Reductase (ArsC) family, unknown subfamily; uncharacterized proteins containing a CXXC motif with similarity to thioredoxin (TRX)-fold arsenic reductases, ArsC. Proteins containing a redox active CXXC motif like TRX and glutaredoxin (GRX) function as protein disulfide oxidoreductases, altering the redox state of target proteins via the reversible oxidation of the active site dithiol. ArsC catalyzes the reduction of arsenate [As(V)] to arsenite [As(III)], using reducing equivalents derived from glutathione via GRX, through a single catalytic cysteine.
Probab=26.73  E-value=77  Score=18.32  Aligned_cols=19  Identities=26%  Similarity=0.382  Sum_probs=14.6

Q ss_pred             HHHHHHHHcCCCCCceEEE
Q psy3109          10 ETIRRYIAEGKKFKRTIHV   28 (58)
Q Consensus        10 eAl~~L~~~g~~p~r~I~l   28 (58)
                      ++++.|.++...-+|||.+
T Consensus        79 e~~~~l~~~p~LikRPIi~   97 (111)
T cd03036          79 EALELLSSDGMLIKRPFVV   97 (111)
T ss_pred             HHHHHHHhCcCeeeCCEEE
Confidence            5677788887777888875


No 125
>PRK10727 DNA-binding transcriptional regulator GalR; Provisional
Probab=26.62  E-value=1.3e+02  Score=19.78  Aligned_cols=29  Identities=7%  Similarity=-0.105  Sum_probs=21.9

Q ss_pred             HHHHHHHHHHHHcCCCCCceEEEEeeCCC
Q psy3109           6 IQYLETIRRYIAEGKKFKRTIHVSFVPGP   34 (58)
Q Consensus         6 ~~~leAl~~L~~~g~~p~r~I~l~f~~DE   34 (58)
                      .+++.++++|.+.|++.+.+|-++-+-|-
T Consensus       247 ~~A~g~~~al~~~G~~vP~disVigfD~~  275 (343)
T PRK10727        247 SMAAGAMGVLNDNGIDVPGEISLIGFDDV  275 (343)
T ss_pred             HHHHHHHHHHHHcCCCCCcceeEEeecCc
Confidence            45677889999999987778887755454


No 126
>cd00495 Ribosomal_L25_TL5_CTC Ribosomal_L25_TL5_CTC: Ribosomal L25/TL5/CTC N-terminal 5S rRNA binding domain. L25 is a single-domain protein, homologous to the N-terminal domain of TL5 and CTC, which each contain two domains. CTC is a known stress protein, and proteins of this family are believed to have two functions, acting as both ribosomal and stress proteins. In Escherichia coli, cells deleted for L25 were found to be viable; however, these cells grew slowly and had impaired protein synthesis capability. In Bacillus subtilis, CTC is induced under stress conditions and located in the ribosome; it has been proposed that CTC may be necessary for accurate translation under stress conditions. Ribosomal_L25_TL5_CTC is found only in bacteria and some plastids. Due to its limited taxonomic diversity and the viability of cells deleted for L25, this protein is not believed to be necessary for ribosomal assembly. Eukaryotes contain a protein called L25, which is not homologous to bacterial L
Probab=26.38  E-value=1.1e+02  Score=17.17  Aligned_cols=21  Identities=29%  Similarity=0.249  Sum_probs=17.5

Q ss_pred             HHHHHHHcCCCCCceEEEEeeCCCC
Q psy3109          11 TIRRYIAEGKKFKRTIHVSFVPGPV   35 (58)
Q Consensus        11 Al~~L~~~g~~p~r~I~l~f~~DEE   35 (58)
                      +.++|.++|+.|    -++++.++|
T Consensus        15 ~~r~lR~~G~iP----avvYG~~~~   35 (91)
T cd00495          15 ASRRLRRAGKVP----AVIYGKGKE   35 (91)
T ss_pred             HHHHHHHCCCCC----EEEECCCCC
Confidence            578899999999    678888776


No 127
>cd06280 PBP1_LacI_like_4 Ligand-binding domain of uncharacterized DNA-binding regulatory proteins that are members of the LacI-GalR family of bacterial transcription repressors. This group includes the ligand-binding domain of uncharacterized DNA-binding regulatory proteins that are members of the LacI-GalR family of bacterial transcription repressors. The LacI-GalR family repressors are composed of two functional domains: an N-terminal HTH (helix-turn-helix) domain, which is responsible for the DNA-binding specificity, and a C-terminal ligand-binding domain, which is homologous to the sugar-binding domain of ABC-type transport systems that contain the type I periplasmic binding protein-like fold.  As also observed in the periplasmic binding proteins, the C-terminal domain of the bacterial transcription repressor undergoes a conformational change upon ligand binding which in turn changes the DNA binding affinity of the repressor.
Probab=26.06  E-value=1.7e+02  Score=18.21  Aligned_cols=28  Identities=14%  Similarity=0.233  Sum_probs=19.3

Q ss_pred             HHHHHHHHHHHHcCCCCCceEEEEeeCC
Q psy3109           6 IQYLETIRRYIAEGKKFKRTIHVSFVPG   33 (58)
Q Consensus         6 ~~~leAl~~L~~~g~~p~r~I~l~f~~D   33 (58)
                      .+++.+++.+++.|.+++.+|.++-+-|
T Consensus       182 ~~a~g~~~~l~~~g~~~p~di~iig~d~  209 (263)
T cd06280         182 LLLLGALRAVRAAGLRIPQDLALAGFDN  209 (263)
T ss_pred             HHHHHHHHHHHHcCCCCCCcEEEEEeCC
Confidence            3456778888888887767776654433


No 128
>COG1057 NadD Nicotinic acid mononucleotide adenylyltransferase [Coenzyme metabolism]
Probab=25.93  E-value=82  Score=20.58  Aligned_cols=26  Identities=8%  Similarity=-0.018  Sum_probs=19.7

Q ss_pred             HHHHHHHHHHcCCCCCceEEEEeeCCC
Q psy3109           8 YLETIRRYIAEGKKFKRTIHVSFVPGP   34 (58)
Q Consensus         8 ~leAl~~L~~~g~~p~r~I~l~f~~DE   34 (58)
                      .+.+++.+++. ..|...++++.++|=
T Consensus        86 T~dTl~~~~~~-~~p~~~~~fIiGaD~  111 (197)
T COG1057          86 TIDTLEHLRQE-YGPDVELYFIIGADN  111 (197)
T ss_pred             hHHHHHHHHHH-hCCCCcEEEEEehHH
Confidence            36778888844 347778999999994


No 129
>PF11051 Mannosyl_trans3:  Mannosyltransferase putative;  InterPro: IPR022751 Alpha-mannosyltransferase is responsible for the addition of residues to the outer chain of core N-linked polysaccharides and to O-linked mannotriose. It is implicated in late Golgi modifications [][][]. The proteins matching this entry are conserved in fungi and also found in some phototrophic organisms.; GO: 0006486 protein glycosylation
Probab=25.58  E-value=1.4e+02  Score=19.98  Aligned_cols=29  Identities=17%  Similarity=0.076  Sum_probs=21.8

Q ss_pred             HHHHHHHHHHHHcCCCCCceEEEEeeCCCCC
Q psy3109           6 IQYLETIRRYIAEGKKFKRTIHVSFVPGPVT   36 (58)
Q Consensus         6 ~~~leAl~~L~~~g~~p~r~I~l~f~~DEE~   36 (58)
                      .+++..++.|.+.|-  +-||.+++.+|+|.
T Consensus        14 ~~a~~lI~~LR~~g~--~LPIEI~~~~~~dl   42 (271)
T PF11051_consen   14 WLALRLIRVLRRLGN--TLPIEIIYPGDDDL   42 (271)
T ss_pred             HHHHHHHHHHHHhCC--CCCEEEEeCCcccc
Confidence            455677888877764  45799999998886


No 130
>PRK09526 lacI lac repressor; Reviewed
Probab=24.37  E-value=1.9e+02  Score=18.89  Aligned_cols=28  Identities=18%  Similarity=0.088  Sum_probs=20.4

Q ss_pred             HHHHHHHHHHHHcCCCCCceEEEEeeCC
Q psy3109           6 IQYLETIRRYIAEGKKFKRTIHVSFVPG   33 (58)
Q Consensus         6 ~~~leAl~~L~~~g~~p~r~I~l~f~~D   33 (58)
                      ..++.+++.|.+.|.+.+.+|.++-+-|
T Consensus       250 ~~A~g~~~al~~~g~~vP~disvig~d~  277 (342)
T PRK09526        250 QMALGVLRALHESGLRVPGQISVIGYDD  277 (342)
T ss_pred             HHHHHHHHHHHHcCCCCCCceEEEeeCC
Confidence            4567788899999988777777654444


No 131
>PRK14987 gluconate operon transcriptional regulator; Provisional
Probab=23.89  E-value=1.7e+02  Score=19.13  Aligned_cols=30  Identities=10%  Similarity=-0.066  Sum_probs=21.7

Q ss_pred             HHHHHHHHHHHHcCCCCCceEEEEeeCCCC
Q psy3109           6 IQYLETIRRYIAEGKKFKRTIHVSFVPGPV   35 (58)
Q Consensus         6 ~~~leAl~~L~~~g~~p~r~I~l~f~~DEE   35 (58)
                      .+++.++++|++.|.+.+.+|.++-+-|.+
T Consensus       249 ~~A~g~~~al~~~g~~vP~disvigfD~~~  278 (331)
T PRK14987        249 DLAVGAAFECQRLGLKVPDDMAIAGFHGHD  278 (331)
T ss_pred             HHHHHHHHHHHHcCCCCCCccEEEeeCCch
Confidence            356677888889998877888876555443


No 132
>PF00743 FMO-like:  Flavin-binding monooxygenase-like;  InterPro: IPR020946 Flavin-containing monooxygenases (FMOs) constitute a family of xenobiotic-metabolising enzymes []. Using an NADPH cofactor and FAD prosthetic group, these microsomal proteins catalyse the oxygenation of nucleophilic nitrogen, sulphur, phosphorous and selenium atoms in a range of structurally diverse compounds. FMOs have been implicated in the metabolism of a number of pharmaceuticals, pesticides and toxicants. In man, lack of hepatic FMO-catalysed trimethylamine metabolism results in trimethylaminuria (fish odour syndrome). Five mammalian forms of FMO are now known and have been designated FMO1-FMO5 [, , , , ]. This is a recent nomenclature based on comparison of amino acid sequences, and has been introduced in an attempt to eliminate confusion inherent in multiple, laboratory-specific designations and tissue-based classifications []. Following the determination of the complete nucleotide sequence of Saccharomyces cerevisiae (Baker's yeast) [], a novel gene was found to encode a protein with similarity to mammalian monooygenases.; GO: 0004499 flavin-containing monooxygenase activity, 0050660 flavin adenine dinucleotide binding, 0050661 NADP binding, 0055114 oxidation-reduction process; PDB: 2GVC_E 1VQW_B 2GV8_B 2XVI_B 2XVH_B 2XLS_A 2XLR_A 2XLU_D 2XLP_B 2XVE_A ....
Probab=23.38  E-value=89  Score=23.19  Aligned_cols=32  Identities=16%  Similarity=0.140  Sum_probs=24.2

Q ss_pred             HHHHHHHHHHHHHHcCCCCCceEEEEeeCCCCCCChh
Q psy3109           4 VGIQYLETIRRYIAEGKKFKRTIHVSFVPGPVTQNET   40 (58)
Q Consensus         4 ~~~~~leAl~~L~~~g~~p~r~I~l~f~~DEE~Gg~~   40 (58)
                      ...+.|.|++.|+++|+.+     ++|=-.+.+||..
T Consensus         9 aG~sGL~a~k~l~e~g~~~-----~~fE~~~~iGG~W   40 (531)
T PF00743_consen    9 AGPSGLAAAKNLLEEGLEV-----TCFEKSDDIGGLW   40 (531)
T ss_dssp             -SHHHHHHHHHHHHTT-EE-----EEEESSSSSSGGG
T ss_pred             ccHHHHHHHHHHHHCCCCC-----eEEecCCCCCccC
Confidence            4567899999999999733     5777888888874


No 133
>PF13454 NAD_binding_9:  FAD-NAD(P)-binding
Probab=23.16  E-value=1.4e+02  Score=17.89  Aligned_cols=33  Identities=12%  Similarity=0.137  Sum_probs=21.7

Q ss_pred             HHHHHHHHHHHHHHcCCCCCceEEEEeeCCCCCC
Q psy3109           4 VGIQYLETIRRYIAEGKKFKRTIHVSFVPGPVTQ   37 (58)
Q Consensus         4 ~~~~~leAl~~L~~~g~~p~r~I~l~f~~DEE~G   37 (58)
                      +....+.++++|.+.. .++.++.+.++-..+.|
T Consensus         5 ~G~~G~~~l~~L~~~~-~~~~~~~I~vfd~~~~G   37 (156)
T PF13454_consen    5 GGPSGLAVLERLLRQA-DPKPPLEITVFDPSPFG   37 (156)
T ss_pred             cCHHHHHHHHHHHHhc-CCCCCCEEEEEcCCCcc
Confidence            3456677888888875 55566666666556664


No 134
>smart00493 TOPRIM topoisomerases, DnaG-type primases, OLD family nucleases and RecR proteins.
Probab=22.83  E-value=1.3e+02  Score=15.52  Aligned_cols=22  Identities=14%  Similarity=0.087  Sum_probs=14.7

Q ss_pred             HHHHHHcCCCCCceEEEEeeCCCC
Q psy3109          12 IRRYIAEGKKFKRTIHVSFVPGPV   35 (58)
Q Consensus        12 l~~L~~~g~~p~r~I~l~f~~DEE   35 (58)
                      ++.+++..  +.+.|++++-+|+|
T Consensus        38 ~~~l~~~~--~~~~Iii~~D~D~~   59 (76)
T smart00493       38 IKLLKRLA--KKKEVILATDPDRE   59 (76)
T ss_pred             HHHHHHHh--cCCEEEEEcCCChh
Confidence            44444432  23679999999998


No 135
>PRK10745 trkD potassium transport protein Kup; Provisional
Probab=22.81  E-value=1.7e+02  Score=22.73  Aligned_cols=34  Identities=9%  Similarity=-0.014  Sum_probs=26.9

Q ss_pred             HHHHHHHHHHHHHHHcCCCCCceEEEEeeCCCCCCC
Q psy3109           3 CVGIQYLETIRRYIAEGKKFKRTIHVSFVPGPVTQN   38 (58)
Q Consensus         3 ~~~~~~leAl~~L~~~g~~p~r~I~l~f~~DEE~Gg   38 (58)
                      +.+.+.+.-.++..+.|+-|+  +.+..++++|.|-
T Consensus       303 AvISGaFSl~~QAi~Lg~~Pr--l~I~hTS~~~~GQ  336 (622)
T PRK10745        303 AVISGVFSLTRQAVRLGYLPP--MRIIHTSEMESGQ  336 (622)
T ss_pred             HHHHhHHHHHHHHHHhCCCCC--ceEEeCCCccCCc
Confidence            456667778889999999996  6788999998654


No 136
>PF13592 HTH_33:  Winged helix-turn helix
Probab=22.65  E-value=31  Score=18.01  Aligned_cols=24  Identities=8%  Similarity=-0.046  Sum_probs=18.4

Q ss_pred             HHHHHHcCCCCCceEEEEeeCCCC
Q psy3109          12 IRRYIAEGKKFKRTIHVSFVPGPV   35 (58)
Q Consensus        12 l~~L~~~g~~p~r~I~l~f~~DEE   35 (58)
                      -+.|++.|+.++++-...--.|||
T Consensus        28 ~~lL~r~G~s~~kp~~~~~k~d~~   51 (60)
T PF13592_consen   28 YRLLKRLGFSYQKPRPRPPKADEE   51 (60)
T ss_pred             HHHHHHcCCccccCCCCcccCCHH
Confidence            355778899888887777777776


No 137
>TIGR00731 ctc_TL5 ribosomal protein L25, Ctc-form. The C-terminal domain of sll1824, an apparent L25 of Synechocystis PCC6803, matches the N-terminal domain of this family. Examples of L25 and Ctc are not separated by a UPGMA tree built on the region of shared homology.
Probab=22.56  E-value=1.2e+02  Score=19.35  Aligned_cols=23  Identities=26%  Similarity=0.296  Sum_probs=19.0

Q ss_pred             HHHHHHHHcCCCCCceEEEEeeCCCCC
Q psy3109          10 ETIRRYIAEGKKFKRTIHVSFVPGPVT   36 (58)
Q Consensus        10 eAl~~L~~~g~~p~r~I~l~f~~DEE~   36 (58)
                      .+.++|.++|+.|    -++++.++|.
T Consensus        12 ~~~r~LR~~G~VP----aVvYG~~~~~   34 (176)
T TIGR00731        12 SAARRIRKEGRIP----AVVYGKGKEN   34 (176)
T ss_pred             hHHHHHHHCCCcc----EEEECCCCCC
Confidence            3678999999999    7888888773


No 138
>KOG1164|consensus
Probab=22.49  E-value=1e+02  Score=20.68  Aligned_cols=19  Identities=21%  Similarity=0.326  Sum_probs=15.9

Q ss_pred             HHHHHHHHHHHHHHcCCCC
Q psy3109           4 VGIQYLETIRRYIAEGKKF   22 (58)
Q Consensus         4 ~~~~~leAl~~L~~~g~~p   22 (58)
                      ..+|.+.|++.+++.|+..
T Consensus       127 ia~q~l~~l~~lH~~G~iH  145 (322)
T KOG1164|consen  127 IAIQNLNALEDLHSKGFIH  145 (322)
T ss_pred             HHHHHHHHHHHHHhcCccc
Confidence            3568999999999999755


No 139
>PF10850 DUF2653:  Protein of unknown function (DUF2653);  InterPro: IPR020516 This entry contains proteins with no known function.
Probab=22.33  E-value=1.5e+02  Score=17.49  Aligned_cols=29  Identities=24%  Similarity=0.377  Sum_probs=19.8

Q ss_pred             HHHHHHHHHHHHc--CCCCCc-eEEEEeeCCCC
Q psy3109           6 IQYLETIRRYIAE--GKKFKR-TIHVSFVPGPV   35 (58)
Q Consensus         6 ~~~leAl~~L~~~--g~~p~r-~I~l~f~~DEE   35 (58)
                      +.+++|++.+.+.  +..|.+ .|.+-+ -|||
T Consensus        57 ~nlieAIr~~l~~~~~~~p~~~~i~L~l-ddee   88 (91)
T PF10850_consen   57 ANLIEAIRQYLEEEYNMDPFRAGIELEL-DDEE   88 (91)
T ss_pred             hhHHHHHHHHHHHHhCCCcchhheEEEE-cccc
Confidence            4678999998755  678887 555544 3555


No 140
>cd01544 PBP1_GalR Ligand-binding domain of DNA transcription repressor GalR which is one of two regulatory proteins involved in galactose transport and metabolism. Ligand-binding domain of DNA transcription repressor GalR which is one of two regulatory proteins involved in galactose transport and metabolism. Transcription of the galactose regulon genes is regulated by Gal iso-repressor (GalS) and Gal repressor (GalR) in different ways, but both repressors recognize the same DNA binding site in the absence of D-galactose. GalR is a dimeric protein like GalS and is exclusively involved in the regulation of galactose permease, the low-affinity galactose transporter. GalS is involved in regulating expression of the high-affinity galactose transporter encoded by the mgl operon. GalS and GalR are members of the LacI-GalR family of transcription regulators and both contain the type I periplasmic binding protein-like fold.  Hence, they are structurally homologous to the periplasmic sugar bindi
Probab=22.09  E-value=2.2e+02  Score=17.92  Aligned_cols=28  Identities=14%  Similarity=0.126  Sum_probs=18.7

Q ss_pred             HHHHHHHHHHHHcCCCCCceEEEEeeCC
Q psy3109           6 IQYLETIRRYIAEGKKFKRTIHVSFVPG   33 (58)
Q Consensus         6 ~~~leAl~~L~~~g~~p~r~I~l~f~~D   33 (58)
                      .+++.+++.+++.|.+.+++|.++-+-+
T Consensus       189 ~~a~g~~~~l~~~g~~vp~di~v~g~d~  216 (270)
T cd01544         189 PMAIGALRALQEAGIKVPEDVSVISFND  216 (270)
T ss_pred             HHHHHHHHHHHHcCCCCCCceEEEEECC
Confidence            3456677888888887666777555533


No 141
>cd01543 PBP1_XylR Ligand-binding domain of DNA transcription repressor specific for xylose (XylR). Ligand-binding domain of DNA transcription repressor specific for xylose (XylR), a member of the LacI-GalR family of bacterial transcription regulators. The ligand-binding domain of XylR is structurally homologous to the periplasmic sugar-binding domain of ABC-type transporters and both domains contain the type I periplasmic binding protein-like fold. The LacI-GalR family repressors are composed of two functional domains: an N-terminal HTH (helix-turn-helix) domain, which is responsible for the DNA-binding specificity, and a C-terminal ligand-binding domain, which is homologous to the type I periplasmic binding proteins. As also observed in the periplasmic binding proteins, the C-terminal domain of the bacterial transcription repressor undergoes a conformational change upon ligand binding which in turn changes the DNA binding affinity of the repressor.
Probab=21.53  E-value=2.2e+02  Score=17.78  Aligned_cols=30  Identities=10%  Similarity=-0.013  Sum_probs=19.4

Q ss_pred             HHHHHHHHHHHHcCCCCCceEEEEeeCCCC
Q psy3109           6 IQYLETIRRYIAEGKKFKRTIHVSFVPGPV   35 (58)
Q Consensus         6 ~~~leAl~~L~~~g~~p~r~I~l~f~~DEE   35 (58)
                      ..++.+++.+++.|.+.+.+|.++-+-|.+
T Consensus       181 ~~a~g~~~~l~~~g~~vp~di~vigfd~~~  210 (265)
T cd01543         181 ARARQLLEACRRAGIAVPEEVAVLGVDNDE  210 (265)
T ss_pred             HHHHHHHHHHHHhCCCCCCceEEEeeCCch
Confidence            345667788888887655666655554543


No 142
>cd01575 PBP1_GntR Ligand-binding domain of DNA transcription repressor GntR specific for gluconate, a member of the LacI-GalR family of bacterial transcription regulators. This group represents the ligand-binding domain of DNA transcription repressor GntR specific for gluconate, a member of the LacI-GalR family of bacterial transcription regulators. The ligand-binding domain of GntR is structurally homologous to the periplasmic sugar-binding domain of ABC-type transporters and both domains contain the type I periplasmic binding protein-like fold. The LacI-GalR family repressors are composed of two functional domains: an N-terminal HTH (helix-turn-helix) domain, which is responsible for the DNA-binding specificity, and a C-terminal ligand-binding domain, which is homologous to the type I periplasmic binding proteins. As also observed in the periplasmic binding proteins, the C-terminal domain of the bacterial transcription repressor undergoes a conformational change upon ligand binding, 
Probab=21.24  E-value=2e+02  Score=17.66  Aligned_cols=25  Identities=12%  Similarity=0.058  Sum_probs=17.4

Q ss_pred             HHHHHHHHHHHcCCCCCceEEEEee
Q psy3109           7 QYLETIRRYIAEGKKFKRTIHVSFV   31 (58)
Q Consensus         7 ~~leAl~~L~~~g~~p~r~I~l~f~   31 (58)
                      .++.+++.+++.|.+++.+|.++-.
T Consensus       188 ~a~~~~~~l~~~g~~~p~di~vig~  212 (268)
T cd01575         188 LALGALFECQRRGISVPEDIAIAGF  212 (268)
T ss_pred             HHHHHHHHHHHhCCCCCcceEEEec
Confidence            4567888899999876666654433


No 143
>PF13662 Toprim_4:  Toprim domain; PDB: 1EQN_E 1DD9_A 3B39_B 1DDE_A.
Probab=21.21  E-value=81  Score=16.94  Aligned_cols=29  Identities=17%  Similarity=0.212  Sum_probs=17.0

Q ss_pred             CceEEEEeeCCCCCCChhhhHHHHHHHHHHhh
Q psy3109          23 KRTIHVSFVPGPVTQNETCGLAAANSVLYLLG   54 (58)
Q Consensus        23 ~r~I~l~f~~DEE~Gg~~~g~~~~~s~~~~~~   54 (58)
                      .+.|.+++-+|++  |. .....+...+...|
T Consensus        46 ~~~Vii~~D~D~~--G~-~~a~~i~~~l~~~g   74 (81)
T PF13662_consen   46 VKEVIIAFDNDKA--GE-KAAQKIAKKLLPLG   74 (81)
T ss_dssp             -SEEEEEEESSHH--HH-HHHHHHHHHHG---
T ss_pred             CceEEEEeCcCHH--HH-HHHHHHHHHHHhhc
Confidence            4789999999997  65 34444444443333


No 144
>PF10587 EF-1_beta_acid:  Eukaryotic elongation factor 1 beta central acidic region;  InterPro: IPR018940 Translation elongation factors are responsible for two main processes during protein synthesis on the ribosome [, , ]. EF1A (or EF-Tu) is responsible for the selection and binding of the cognate aminoacyl-tRNA to the A-site (acceptor site) of the ribosome. EF2 (or EF-G) is responsible for the translocation of the peptidyl-tRNA from the A-site to the P-site (peptidyl-tRNA site) of the ribosome, thereby freeing the A-site for the next aminoacyl-tRNA to bind. Elongation factors are responsible for achieving accuracy of translation and both EF1A and EF2 are remarkably conserved throughout evolution. Elongation factor EF1B (also known as EF-Ts or EF-1beta/gamma/delta) is a nucleotide exchange factor that is required to regenerate EF1A from its inactive form (EF1A-GDP) to its active form (EF1A-GTP). EF1A is then ready to interact with a new aminoacyl-tRNA to begin the cycle again. EF1B is more complex in eukaryotes than in bacteria, and can consist of three subunits: EF1B-alpha (or EF-1beta), EF1B-gamma (or EF-1gamma) and EF1B-beta (or EF-1delta) []. This region is found in the centre of the beta subunits of Elongation factor-1. More information about these proteins can be found at Protein of the Month: Elongation Factors [].
Probab=20.97  E-value=32  Score=16.25  Aligned_cols=7  Identities=14%  Similarity=-0.021  Sum_probs=4.8

Q ss_pred             EeeCCCC
Q psy3109          29 SFVPGPV   35 (58)
Q Consensus        29 ~f~~DEE   35 (58)
                      +|++|+|
T Consensus         1 LFGSddE    7 (28)
T PF10587_consen    1 LFGSDDE    7 (28)
T ss_pred             CCCCccc
Confidence            4778866


No 145
>PF14247 DUF4344:  Domain of unknown function (DUF4344)
Probab=20.90  E-value=2e+02  Score=19.22  Aligned_cols=31  Identities=16%  Similarity=0.219  Sum_probs=23.6

Q ss_pred             HHHHHHHHHHcCCCCCceEEEEeeCCCCCCC
Q psy3109           8 YLETIRRYIAEGKKFKRTIHVSFVPGPVTQN   38 (58)
Q Consensus         8 ~leAl~~L~~~g~~p~r~I~l~f~~DEE~Gg   38 (58)
                      .++.+..+....+..+++|.+.+..+-|.|.
T Consensus        21 vlE~~~~~in~~f~LP~~l~i~~~~CGe~nA   51 (220)
T PF14247_consen   21 VLEDVADLINDYFPLPRDLTIRFAECGEDNA   51 (220)
T ss_pred             hHHHHHHHHhhcCCCCCCeEEEEeecCCCCC
Confidence            4677777777889999999999966555443


No 146
>TIGR00756 PPR pentatricopeptide repeat domain (PPR motif). This family has a similar consensus to the TPR domain (tetratricopeptide), pfam pfam00515, a 33-residue repeat. It is predicted to form a pair of antiparallel helices similar to that of TPR.
Probab=20.90  E-value=88  Score=12.98  Aligned_cols=17  Identities=18%  Similarity=0.302  Sum_probs=11.9

Q ss_pred             HHHHHHHHHHHcCCCCC
Q psy3109           7 QYLETIRRYIAEGKKFK   23 (58)
Q Consensus         7 ~~leAl~~L~~~g~~p~   23 (58)
                      .+.+.++.+.+.|.+|.
T Consensus        18 ~a~~~~~~M~~~g~~p~   34 (35)
T TIGR00756        18 EALELFKEMLERGIEPD   34 (35)
T ss_pred             HHHHHHHHHHHcCCCCC
Confidence            35666777788887774


No 147
>cd01540 PBP1_arabinose_binding Periplasmic L-arabinose-binding protein (ABP), a member of a family of pentose/hexose sugar-binding proteins of the type I periplasmic binding protein superfamily. Periplasmic L-arabinose-binding protein (ABP), a member of a family of pentose/hexose sugar-binding proteins of the type I periplasmic binding protein superfamily. ABP is only involved in transport contrary to other related sugar-binding proteins such as the glucose/galactose-binding protein (GGBP) and the ribose-binding protein (RBP), both of which are involved in chemotaxis as well as transport. The periplasmic ABP consists of two alpha/beta globular domains connected by a three-stranded hinge, a Venus flytrap-like domain, which undergoes a transition from an open to a closed conformational state upon ligand binding. Moreover, ABP is homologous to the ligand-binding domain of eukaryotic receptors such as metabotropic glutamate receptor (mGluR) and DNA-binding transcriptional repressors such a
Probab=20.77  E-value=2.1e+02  Score=18.07  Aligned_cols=23  Identities=13%  Similarity=-0.028  Sum_probs=16.7

Q ss_pred             HHHHHHHHHHHcCCCCCceEEEEe
Q psy3109           7 QYLETIRRYIAEGKKFKRTIHVSF   30 (58)
Q Consensus         7 ~~leAl~~L~~~g~~p~r~I~l~f   30 (58)
                      +++.+++.+.+.|..| ++|.++-
T Consensus       205 ~a~g~~~al~~~g~~~-~di~vig  227 (289)
T cd01540         205 TVLGAVRATEQSGIAA-ADVIGVG  227 (289)
T ss_pred             HHHHHHHHHHHcCCCC-cceEEEe
Confidence            4667888888999886 5666543


No 148
>PRK06973 nicotinic acid mononucleotide adenylyltransferase; Provisional
Probab=20.50  E-value=1.3e+02  Score=20.24  Aligned_cols=24  Identities=8%  Similarity=0.144  Sum_probs=18.2

Q ss_pred             HHHHHHHHHcCCCCCceEEEEeeCC
Q psy3109           9 LETIRRYIAEGKKFKRTIHVSFVPG   33 (58)
Q Consensus         9 leAl~~L~~~g~~p~r~I~l~f~~D   33 (58)
                      +..++.|++. +.|...++++.++|
T Consensus       108 idTL~~l~~~-~~p~~~~~fiiG~D  131 (243)
T PRK06973        108 VDTLARWRER-IGPDASLALLIGAD  131 (243)
T ss_pred             HHHHHHHHHH-cCCCCCEEEEEchh
Confidence            4677888776 34667899999888


No 149
>PRK00016 metal-binding heat shock protein; Provisional
Probab=20.46  E-value=1e+02  Score=19.35  Aligned_cols=20  Identities=10%  Similarity=0.127  Sum_probs=14.6

Q ss_pred             HcCCCCCceEEEEeeCCCCC
Q psy3109          17 AEGKKFKRTIHVSFVPGPVT   36 (58)
Q Consensus        17 ~~g~~p~r~I~l~f~~DEE~   36 (58)
                      ..+..+...|.+.|+.|+|+
T Consensus        31 ~~~~~~~~ev~i~~v~~~~i   50 (159)
T PRK00016         31 KLGVQPEAELSIRLVDNEEI   50 (159)
T ss_pred             HcCCCCCeeEEEEEECcHHH
Confidence            34555557889999988875


No 150
>PF14433 SUKH-3:  SUKH-3 immunity protein
Probab=20.36  E-value=89  Score=18.69  Aligned_cols=21  Identities=29%  Similarity=0.194  Sum_probs=15.7

Q ss_pred             HHHHHHHcCCCCCceEEEEee
Q psy3109          11 TIRRYIAEGKKFKRTIHVSFV   31 (58)
Q Consensus        11 Al~~L~~~g~~p~r~I~l~f~   31 (58)
                      +.+.|.+.|+.|.|.|.+...
T Consensus         4 v~~~L~~aGW~~~R~idi~~~   24 (142)
T PF14433_consen    4 VIELLRAAGWYEGRKIDISLW   24 (142)
T ss_pred             HHHHHHHcCCCCCcccCHHHH
Confidence            567788999999888765443


No 151
>PF02705 K_trans:  K+ potassium transporter;  InterPro: IPR003855 This is a family of K+ potassium transporters that are conserved across phyla, having both bacterial (KUP) [], yeast (HAK) [], and plant (AtKT) [] sequences as members.; GO: 0015079 potassium ion transmembrane transporter activity, 0071805 potassium ion transmembrane transport, 0016020 membrane
Probab=20.03  E-value=2.5e+02  Score=21.41  Aligned_cols=34  Identities=12%  Similarity=0.021  Sum_probs=27.4

Q ss_pred             HHHHHHHHHHHHHHHcCCCCCceEEEEeeCCCCCCC
Q psy3109           3 CVGIQYLETIRRYIAEGKKFKRTIHVSFVPGPVTQN   38 (58)
Q Consensus         3 ~~~~~~leAl~~L~~~g~~p~r~I~l~f~~DEE~Gg   38 (58)
                      +.+.+...-+++..+.|+-|+  +.+..++++|.|-
T Consensus       292 A~Isg~FSl~~Qai~Lg~~Pr--~~I~hTS~~~~GQ  325 (534)
T PF02705_consen  292 AVISGAFSLTRQAIQLGYFPR--LKIVHTSEKEEGQ  325 (534)
T ss_pred             HHHHHHHHHHHHHHHhCCCCc--eEEEECCcccCCc
Confidence            456677778888999999886  7888999988654


Done!