Query psy3109
Match_columns 58
No_of_seqs 117 out of 723
Neff 5.9
Searched_HMMs 46136
Date Fri Aug 16 16:41:11 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy3109.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/3109hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 KOG2275|consensus 99.3 3.7E-12 8E-17 91.1 5.8 49 1-50 130-178 (420)
2 PRK12891 allantoate amidohydro 98.9 3.6E-09 7.7E-14 73.5 4.8 49 1-51 96-144 (414)
3 PF01546 Peptidase_M20: Peptid 98.9 1.6E-08 3.4E-13 62.1 7.0 44 1-45 37-80 (189)
4 TIGR03176 AllC allantoate amid 98.8 4E-09 8.6E-14 73.7 4.4 49 2-52 90-138 (406)
5 PRK07205 hypothetical protein; 98.8 8E-09 1.7E-13 72.2 5.8 45 1-46 117-161 (444)
6 PRK07338 hypothetical protein; 98.8 1.7E-08 3.6E-13 69.3 6.1 46 1-47 130-175 (402)
7 PRK07906 hypothetical protein; 98.8 1.7E-08 3.7E-13 69.9 5.9 46 1-47 106-151 (426)
8 PRK13983 diaminopimelate amino 98.8 1.4E-08 3.1E-13 68.9 5.4 47 1-48 118-164 (400)
9 PRK13013 succinyl-diaminopimel 98.8 1.5E-08 3.3E-13 69.8 5.4 46 1-47 124-169 (427)
10 PRK07318 dipeptidase PepV; Rev 98.7 2.5E-08 5.4E-13 70.3 5.9 43 1-45 119-161 (466)
11 PRK08588 succinyl-diaminopimel 98.7 3.2E-08 6.9E-13 67.3 5.4 39 1-39 101-139 (377)
12 PRK08262 hypothetical protein; 98.7 5E-08 1.1E-12 68.9 6.3 39 1-39 155-193 (486)
13 TIGR01880 Ac-peptdase-euk N-ac 98.7 4.9E-08 1.1E-12 67.0 6.1 45 1-45 113-157 (400)
14 PRK12890 allantoate amidohydro 98.7 3.5E-08 7.5E-13 68.3 4.9 48 1-50 94-141 (414)
15 PRK13009 succinyl-diaminopimel 98.7 7.9E-08 1.7E-12 65.1 6.2 45 1-46 100-144 (375)
16 PRK09104 hypothetical protein; 98.7 6.6E-08 1.4E-12 67.9 5.9 47 1-48 129-175 (464)
17 TIGR01246 dapE_proteo succinyl 98.6 8.3E-08 1.8E-12 65.1 6.0 47 1-48 97-143 (370)
18 TIGR01887 dipeptidaselike dipe 98.6 7.7E-08 1.7E-12 68.2 5.9 45 1-46 107-151 (447)
19 PRK07473 carboxypeptidase; Pro 98.6 1E-07 2.2E-12 65.7 6.2 45 1-46 113-157 (376)
20 PRK13590 putative bifunctional 98.6 5.1E-08 1.1E-12 71.3 4.7 49 1-51 271-319 (591)
21 PRK09133 hypothetical protein; 98.6 8.4E-08 1.8E-12 67.5 5.6 45 1-46 142-186 (472)
22 PRK08596 acetylornithine deace 98.6 9.9E-08 2.1E-12 66.4 5.5 39 1-39 119-157 (421)
23 PRK06156 hypothetical protein; 98.6 1.3E-07 2.9E-12 67.9 6.2 45 1-46 155-199 (520)
24 COG0624 ArgE Acetylornithine d 98.6 1E-07 2.2E-12 65.7 5.4 48 1-49 117-164 (409)
25 PRK08201 hypothetical protein; 98.6 1.3E-07 2.8E-12 66.2 5.8 46 1-47 121-166 (456)
26 TIGR01910 DapE-ArgE acetylorni 98.6 1.3E-07 2.9E-12 64.3 5.6 45 1-46 106-150 (375)
27 PRK09290 allantoate amidohydro 98.6 1.2E-07 2.5E-12 65.8 5.1 37 1-37 93-129 (413)
28 PRK12893 allantoate amidohydro 98.6 1.1E-07 2.4E-12 65.5 4.8 37 1-37 96-132 (412)
29 TIGR01886 dipeptidase dipeptid 98.5 2.2E-07 4.9E-12 65.9 5.6 39 1-39 118-156 (466)
30 PRK06915 acetylornithine deace 98.5 2.5E-07 5.4E-12 63.9 5.7 39 1-39 135-173 (422)
31 PRK06446 hypothetical protein; 98.5 2.7E-07 5.8E-12 64.5 5.9 45 1-47 104-148 (436)
32 PRK06133 glutamate carboxypept 98.5 3.2E-07 6.9E-12 63.8 6.1 39 1-39 137-175 (410)
33 PRK07079 hypothetical protein; 98.5 2.8E-07 6E-12 65.0 5.8 39 1-39 128-167 (469)
34 PRK04443 acetyl-lysine deacety 98.5 1.9E-07 4.1E-12 63.4 4.8 44 1-48 92-135 (348)
35 TIGR01883 PepT-like peptidase 98.5 2.4E-07 5.2E-12 62.6 5.2 38 1-39 101-138 (361)
36 PRK13799 unknown domain/N-carb 98.5 1.6E-07 3.6E-12 68.7 4.6 48 3-52 273-320 (591)
37 PRK06837 acetylornithine deace 98.5 3.5E-07 7.7E-12 63.7 5.6 39 1-39 139-177 (427)
38 TIGR01879 hydantase amidase, h 98.4 3.2E-07 7E-12 63.4 4.6 48 2-51 88-135 (401)
39 PRK05469 peptidase T; Provisio 98.4 4.3E-07 9.3E-12 62.8 4.9 37 1-37 141-177 (408)
40 TIGR01882 peptidase-T peptidas 98.4 4.9E-07 1.1E-11 62.9 5.1 37 1-37 143-179 (410)
41 PRK08554 peptidase; Reviewed 98.4 6.6E-07 1.4E-11 63.2 5.6 41 1-44 104-144 (438)
42 PRK13004 peptidase; Reviewed 98.4 9.2E-07 2E-11 61.0 6.0 38 1-38 111-148 (399)
43 PRK12892 allantoate amidohydro 98.4 5.9E-07 1.3E-11 61.8 4.9 37 1-37 94-130 (412)
44 PRK08652 acetylornithine deace 98.4 7.4E-07 1.6E-11 59.6 5.1 37 1-39 88-124 (347)
45 TIGR01900 dapE-gram_pos succin 98.4 8.2E-07 1.8E-11 61.1 5.3 45 1-45 104-150 (373)
46 PRK08651 succinyl-diaminopimel 98.3 1.1E-06 2.5E-11 59.9 5.1 36 1-39 115-150 (394)
47 PRK13381 peptidase T; Provisio 98.3 1.3E-06 2.9E-11 60.3 5.4 38 1-39 139-176 (404)
48 PRK07522 acetylornithine deace 98.3 1.3E-06 2.9E-11 59.3 5.3 43 1-46 105-147 (385)
49 TIGR01892 AcOrn-deacetyl acety 98.3 1.5E-06 3.3E-11 58.4 5.3 37 1-39 99-135 (364)
50 TIGR03106 trio_M42_hydro hydro 98.3 1.5E-06 3.2E-11 60.6 5.2 38 1-38 184-221 (343)
51 PRK07907 hypothetical protein; 98.2 2.5E-06 5.4E-11 59.8 5.5 42 1-46 125-166 (449)
52 PRK05111 acetylornithine deace 98.2 2.3E-06 4.9E-11 58.2 5.1 37 1-39 112-148 (383)
53 PRK13007 succinyl-diaminopimel 98.2 2.7E-06 5.8E-11 57.2 4.9 41 1-45 96-136 (352)
54 PRK00466 acetyl-lysine deacety 98.2 3.2E-06 6.8E-11 57.3 5.1 35 1-39 93-127 (346)
55 TIGR03320 ygeY M20/DapE family 98.2 2.9E-06 6.3E-11 58.3 4.6 38 1-38 109-146 (395)
56 PRK10199 alkaline phosphatase 98.2 5.4E-06 1.2E-10 58.4 5.7 37 1-39 144-180 (346)
57 TIGR01902 dapE-lys-deAc N-acet 98.1 4.8E-06 1E-10 56.1 4.8 38 1-44 83-120 (336)
58 TIGR03526 selenium_YgeY putati 98.0 1.8E-05 3.9E-10 54.5 5.6 37 1-37 109-145 (395)
59 PF04389 Peptidase_M28: Peptid 98.0 1.9E-05 4.2E-10 48.8 5.1 43 3-46 29-71 (179)
60 PLN02693 IAA-amino acid hydrol 97.9 3.3E-05 7.2E-10 54.8 5.3 36 1-36 137-172 (437)
61 PRK08737 acetylornithine deace 97.9 2.6E-05 5.7E-10 53.8 4.5 36 1-44 103-138 (364)
62 TIGR01891 amidohydrolases amid 97.7 7.1E-05 1.5E-09 51.0 5.1 36 2-37 92-127 (363)
63 TIGR01893 aa-his-dipept aminoa 97.7 5.2E-05 1.1E-09 53.8 4.1 35 1-39 110-144 (477)
64 PRK15026 aminoacyl-histidine d 97.6 0.00013 2.7E-09 52.7 4.8 35 1-39 116-150 (485)
65 PLN02280 IAA-amino acid hydrol 97.5 0.00022 4.7E-09 51.5 4.8 37 1-37 187-223 (478)
66 PRK09961 exoaminopeptidase; Pr 96.9 0.0025 5.5E-08 44.3 5.4 43 1-47 167-209 (344)
67 KOG2195|consensus 96.6 0.0078 1.7E-07 46.0 6.1 39 8-47 381-419 (702)
68 TIGR03107 glu_aminopep glutamy 96.6 0.0058 1.3E-07 42.9 5.0 37 1-39 179-215 (350)
69 PF05343 Peptidase_M42: M42 gl 96.4 0.015 3.2E-07 39.9 6.2 41 2-46 136-176 (292)
70 COG1363 FrvX Cellulase M and r 96.2 0.0076 1.6E-07 42.8 4.0 37 1-39 181-217 (355)
71 COG2234 Iap Predicted aminopep 96.1 0.012 2.6E-07 40.9 4.6 35 3-39 231-265 (435)
72 PRK09864 putative peptidase; P 96.1 0.013 2.8E-07 41.4 4.8 35 1-39 176-210 (356)
73 COG2195 PepD Di- and tripeptid 95.7 0.014 3.1E-07 42.1 3.7 46 1-48 145-191 (414)
74 COG4187 RocB Arginine degradat 95.7 0.026 5.6E-07 42.0 4.9 38 1-39 141-178 (553)
75 KOG2276|consensus 93.7 0.09 2E-06 38.7 3.5 39 1-39 133-171 (473)
76 PRK02256 putative aminopeptida 93.6 0.14 3.1E-06 37.4 4.5 34 2-39 262-295 (462)
77 COG1473 AbgB Metal-dependent a 90.3 0.39 8.4E-06 34.4 3.4 46 5-52 108-153 (392)
78 PF05450 Nicastrin: Nicastrin; 89.7 0.87 1.9E-05 30.5 4.5 34 3-36 27-63 (234)
79 PF02127 Peptidase_M18: Aminop 89.0 0.68 1.5E-05 33.6 3.8 35 5-39 237-275 (432)
80 PRK02813 putative aminopeptida 88.8 0.58 1.3E-05 33.8 3.4 33 2-39 236-268 (428)
81 KOG2194|consensus 85.5 1.1 2.5E-05 35.2 3.5 44 3-47 166-209 (834)
82 PTZ00371 aspartyl aminopeptida 85.1 1.2 2.6E-05 32.5 3.3 38 2-39 253-291 (465)
83 PF06309 Torsin: Torsin; Inte 72.6 20 0.00042 22.3 5.5 49 7-57 36-84 (127)
84 COG1362 LAP4 Aspartyl aminopep 72.0 5.1 0.00011 29.6 3.1 33 6-39 244-276 (437)
85 PF06877 RraB: Regulator of ri 61.8 23 0.0005 20.1 4.0 29 7-35 5-35 (104)
86 COG4882 Predicted aminopeptida 61.6 9.1 0.0002 28.4 2.7 45 4-52 211-257 (486)
87 PF12000 Glyco_trans_4_3: Gkyc 56.7 17 0.00037 23.3 3.1 22 6-29 51-72 (171)
88 PF14136 DUF4303: Domain of un 50.5 47 0.001 20.0 4.3 20 4-23 104-123 (155)
89 PF11181 YflT: Heat induced st 50.4 28 0.0006 20.0 3.1 26 7-33 11-36 (103)
90 KOG0615|consensus 48.8 37 0.0008 25.5 4.1 33 2-35 279-314 (475)
91 PF13812 PPR_3: Pentatricopept 47.3 26 0.00057 15.1 2.2 16 7-22 19-34 (34)
92 COG4821 Uncharacterized protei 43.6 39 0.00084 23.2 3.3 25 7-33 199-223 (243)
93 PF04055 Radical_SAM: Radical 43.3 52 0.0011 18.6 3.5 30 5-35 126-155 (166)
94 PF04114 Gaa1: Gaa1-like, GPI 42.4 33 0.00072 25.4 3.1 33 3-36 37-69 (504)
95 PF14681 UPRTase: Uracil phosp 39.8 19 0.0004 23.3 1.4 30 6-35 135-164 (207)
96 PF05237 MoeZ_MoeB: MoeZ/MoeB 39.5 46 0.001 18.4 2.8 21 4-24 34-54 (84)
97 PF08415 NRPS: Nonribosomal pe 39.0 48 0.001 17.1 2.7 27 6-32 4-30 (58)
98 COG0035 Upp Uracil phosphoribo 38.4 42 0.00091 22.6 2.9 30 5-35 137-166 (210)
99 cd06287 PBP1_LacI_like_8 Ligan 38.0 69 0.0015 20.6 3.8 30 6-35 188-217 (269)
100 COG4703 Uncharacterized protei 37.8 80 0.0017 18.0 4.0 35 2-36 1-45 (74)
101 PF13377 Peripla_BP_3: Peripla 37.3 64 0.0014 18.6 3.3 28 6-33 78-105 (160)
102 PF04028 DUF374: Domain of unk 36.3 78 0.0017 17.6 3.4 25 2-33 43-67 (74)
103 cd06301 PBP1_rhizopine_binding 35.7 66 0.0014 20.1 3.4 27 7-34 194-220 (272)
104 PRK05943 50S ribosomal protein 35.2 60 0.0013 18.5 2.9 21 11-35 16-36 (94)
105 COG2875 CobM Precorrin-4 methy 34.9 79 0.0017 22.0 3.8 45 8-54 171-218 (254)
106 TIGR01481 ccpA catabolite cont 34.7 87 0.0019 20.4 3.9 30 6-35 246-275 (329)
107 PRK11191 RNase E inhibitor pro 34.1 1.2E+02 0.0026 19.0 6.1 31 5-35 11-43 (138)
108 cd06276 PBP1_FucR_like Ligand- 33.5 1.1E+02 0.0023 19.5 4.1 31 6-36 172-202 (247)
109 KOG2170|consensus 33.3 1.6E+02 0.0034 21.4 5.2 48 8-57 94-141 (344)
110 cd06272 PBP1_hexuronate_repres 32.5 93 0.002 19.3 3.7 28 6-33 182-209 (261)
111 PF07708 Tash_PEST: Tash prote 30.8 48 0.001 14.3 1.5 12 24-35 6-17 (19)
112 PF04023 FeoA: FeoA domain; I 30.8 87 0.0019 16.3 3.0 21 10-30 23-43 (74)
113 cd01541 PBP1_AraR Ligand-bindi 30.4 1.2E+02 0.0026 19.0 4.0 29 7-35 194-222 (273)
114 cd01542 PBP1_TreR_like Ligand- 30.1 1.2E+02 0.0026 18.7 3.8 30 6-35 183-212 (259)
115 TIGR03642 cas_csx13 CRISPR-ass 29.2 93 0.002 19.1 3.1 28 8-36 7-34 (124)
116 PF08967 DUF1884: Domain of un 29.0 71 0.0015 18.7 2.5 15 9-23 15-29 (85)
117 PF02780 Transketolase_C: Tran 28.2 55 0.0012 18.9 2.0 32 4-35 20-52 (124)
118 PF02070 NMU: Neuromedin U; I 28.1 20 0.00044 16.5 0.1 13 30-43 1-13 (25)
119 cd06298 PBP1_CcpA_like Ligand- 28.0 1.1E+02 0.0024 18.9 3.4 29 7-35 188-216 (268)
120 KOG1067|consensus 27.8 51 0.0011 25.9 2.1 45 5-51 253-297 (760)
121 PF13738 Pyr_redox_3: Pyridine 27.7 1.3E+02 0.0028 18.1 3.6 31 6-40 7-37 (203)
122 cd06285 PBP1_LacI_like_7 Ligan 27.7 1.3E+02 0.0028 18.8 3.7 27 7-33 186-212 (265)
123 PRK11303 DNA-binding transcrip 26.9 1.3E+02 0.0029 19.5 3.8 29 7-35 249-277 (328)
124 cd03036 ArsC_like Arsenate Red 26.7 77 0.0017 18.3 2.4 19 10-28 79-97 (111)
125 PRK10727 DNA-binding transcrip 26.6 1.3E+02 0.0029 19.8 3.8 29 6-34 247-275 (343)
126 cd00495 Ribosomal_L25_TL5_CTC 26.4 1.1E+02 0.0023 17.2 2.9 21 11-35 15-35 (91)
127 cd06280 PBP1_LacI_like_4 Ligan 26.1 1.7E+02 0.0037 18.2 4.1 28 6-33 182-209 (263)
128 COG1057 NadD Nicotinic acid mo 25.9 82 0.0018 20.6 2.6 26 8-34 86-111 (197)
129 PF11051 Mannosyl_trans3: Mann 25.6 1.4E+02 0.003 20.0 3.8 29 6-36 14-42 (271)
130 PRK09526 lacI lac repressor; R 24.4 1.9E+02 0.0041 18.9 4.2 28 6-33 250-277 (342)
131 PRK14987 gluconate operon tran 23.9 1.7E+02 0.0037 19.1 3.9 30 6-35 249-278 (331)
132 PF00743 FMO-like: Flavin-bind 23.4 89 0.0019 23.2 2.7 32 4-40 9-40 (531)
133 PF13454 NAD_binding_9: FAD-NA 23.2 1.4E+02 0.0031 17.9 3.2 33 4-37 5-37 (156)
134 smart00493 TOPRIM topoisomeras 22.8 1.3E+02 0.0027 15.5 3.2 22 12-35 38-59 (76)
135 PRK10745 trkD potassium transp 22.8 1.7E+02 0.0038 22.7 4.2 34 3-38 303-336 (622)
136 PF13592 HTH_33: Winged helix- 22.7 31 0.00067 18.0 0.1 24 12-35 28-51 (60)
137 TIGR00731 ctc_TL5 ribosomal pr 22.6 1.2E+02 0.0026 19.3 2.9 23 10-36 12-34 (176)
138 KOG1164|consensus 22.5 1E+02 0.0022 20.7 2.6 19 4-22 127-145 (322)
139 PF10850 DUF2653: Protein of u 22.3 1.5E+02 0.0033 17.5 3.1 29 6-35 57-88 (91)
140 cd01544 PBP1_GalR Ligand-bindi 22.1 2.2E+02 0.0047 17.9 4.1 28 6-33 189-216 (270)
141 cd01543 PBP1_XylR Ligand-bindi 21.5 2.2E+02 0.0047 17.8 4.0 30 6-35 181-210 (265)
142 cd01575 PBP1_GntR Ligand-bindi 21.2 2E+02 0.0044 17.7 3.7 25 7-31 188-212 (268)
143 PF13662 Toprim_4: Toprim doma 21.2 81 0.0018 16.9 1.7 29 23-54 46-74 (81)
144 PF10587 EF-1_beta_acid: Eukar 21.0 32 0.00069 16.2 -0.0 7 29-35 1-7 (28)
145 PF14247 DUF4344: Domain of un 20.9 2E+02 0.0044 19.2 3.8 31 8-38 21-51 (220)
146 TIGR00756 PPR pentatricopeptid 20.9 88 0.0019 13.0 2.3 17 7-23 18-34 (35)
147 cd01540 PBP1_arabinose_binding 20.8 2.1E+02 0.0045 18.1 3.7 23 7-30 205-227 (289)
148 PRK06973 nicotinic acid mononu 20.5 1.3E+02 0.0027 20.2 2.8 24 9-33 108-131 (243)
149 PRK00016 metal-binding heat sh 20.5 1E+02 0.0022 19.3 2.2 20 17-36 31-50 (159)
150 PF14433 SUKH-3: SUKH-3 immuni 20.4 89 0.0019 18.7 1.9 21 11-31 4-24 (142)
151 PF02705 K_trans: K+ potassium 20.0 2.5E+02 0.0054 21.4 4.4 34 3-38 292-325 (534)
No 1
>KOG2275|consensus
Probab=99.31 E-value=3.7e-12 Score=91.06 Aligned_cols=49 Identities=47% Similarity=0.694 Sum_probs=44.9
Q ss_pred ChHHHHHHHHHHHHHHHcCCCCCceEEEEeeCCCCCCChhhhHHHHHHHH
Q psy3109 1 MKCVGIQYLETIRRYIAEGKKFKRTIHVSFVPGPVTQNETCGLAAANSVL 50 (58)
Q Consensus 1 MK~~~~~~leAl~~L~~~g~~p~r~I~l~f~~DEE~Gg~~~g~~~~~s~~ 50 (58)
|||.++|+++|+|.|+..|++|+|||.++|++|||+||. .|+.-+++.+
T Consensus 130 ~K~~~va~leAir~L~~~g~kp~Rti~lsfvpDEEi~G~-~Gm~~fa~~~ 178 (420)
T KOG2275|consen 130 MKCVGVAYLEAIRNLKASGFKPKRTIHLSFVPDEEIGGH-IGMKEFAKTE 178 (420)
T ss_pred hHhHHHHHHHHHHHHHhcCCCcCceEEEEecCchhccCc-chHHHHhhhh
Confidence 899999999999999999999999999999999999997 5777777644
No 2
>PRK12891 allantoate amidohydrolase; Reviewed
Probab=98.86 E-value=3.6e-09 Score=73.51 Aligned_cols=49 Identities=14% Similarity=0.085 Sum_probs=43.2
Q ss_pred ChHHHHHHHHHHHHHHHcCCCCCceEEEEeeCCCCCCChhhhHHHHHHHHH
Q psy3109 1 MKCVGIQYLETIRRYIAEGKKFKRTIHVSFVPGPVTQNETCGLAAANSVLY 51 (58)
Q Consensus 1 MK~~~~~~leAl~~L~~~g~~p~r~I~l~f~~DEE~Gg~~~g~~~~~s~~~ 51 (58)
||+++++.++|+++|++.|.+|+++|.+++++|||.|+ +|..+.||+.+
T Consensus 96 ~k~Gv~a~l~a~~~l~~~~~~~~~~i~v~~~~dEE~~~--f~~~~~Gs~~~ 144 (414)
T PRK12891 96 GIYGVLGGLEVVRALNDAGIETERPVDVVIWTNEEGSR--FAPSMVGSGVF 144 (414)
T ss_pred chhhHHHHHHHHHHHHHcCCCCCCCeEEEEecccccCc--CCcccccHHHH
Confidence 89999999999999999999999999999999999653 57777777654
No 3
>PF01546 Peptidase_M20: Peptidase family M20/M25/M40 This family only corresponds to M20 family; InterPro: IPR002933 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of proteins contains the metallopeptidases and non-peptidase homologues (amidohydrolases) that belong to the MEROPS peptidase family M20 (clan MH) []. The peptidases of this clan have two catalytic zinc ions at the active site, bound by His/Asp, Asp, Glu, Asp/Glu and His. The catalysed reaction involves the release of an N-terminal amino acid, usually neutral or hydrophobic, from a polypeptide []. The peptidase M20 family has four sub-families: M20A - type example, glutamate carboxypeptidase from Pseudomonas sp. RS16 (P06621 from SWISSPROT) M20B - type example, peptidase T from Escherichia coli (P29745 from SWISSPROT) M20C - type example, X-His dipeptidase from E. coli (P15288 from SWISSPROT) M20D - type example, carboxypeptidase Ss1 from Sulfolobus solfataricus (P80092 from SWISSPROT) ; GO: 0016787 hydrolase activity, 0008152 metabolic process; PDB: 3T68_A 3T6M_A 2F8H_A 3GB0_A 3IO1_B 2ZOF_A 2ZOG_B 3MRU_B 3N5F_A 1Z2L_B ....
Probab=98.85 E-value=1.6e-08 Score=62.08 Aligned_cols=44 Identities=18% Similarity=0.180 Sum_probs=39.6
Q ss_pred ChHHHHHHHHHHHHHHHcCCCCCceEEEEeeCCCCCCChhhhHHH
Q psy3109 1 MKCVGIQYLETIRRYIAEGKKFKRTIHVSFVPGPVTQNETCGLAA 45 (58)
Q Consensus 1 MK~~~~~~leAl~~L~~~g~~p~r~I~l~f~~DEE~Gg~~~g~~~ 45 (58)
||+++++.+.|++.|++.+..++++|.++|++|||+|+. .|...
T Consensus 37 ~k~~~~~~l~a~~~l~~~~~~~~~~i~~~~~~~EE~g~~-~g~~~ 80 (189)
T PF01546_consen 37 MKGGIAAMLAALKALKESGDDLPGNIIFLFTPDEEIGSI-GGAKH 80 (189)
T ss_dssp THHHHHHHHHHHHHHHHTTTTCSSEEEEEEESTCCGTST-THHHH
T ss_pred CcccHHHHHHHHHHHHhccccccccccccccccccCCCc-chhhh
Confidence 799999999999999999999999999999999999986 35443
No 4
>TIGR03176 AllC allantoate amidohydrolase. This enzyme catalyzes the breakdown of allantoate, first to ureidoglycine by hydrolysis and then decarboxylation of one of the two equivalent ureido groups. Ureidoglycine then spontaneously exchanges ammonia for water resulting in ureidoglycolate. This enzyme is an alternative to allantoicase (3.5.3.4) which releases urea.
Probab=98.84 E-value=4e-09 Score=73.65 Aligned_cols=49 Identities=8% Similarity=-0.110 Sum_probs=45.5
Q ss_pred hHHHHHHHHHHHHHHHcCCCCCceEEEEeeCCCCCCChhhhHHHHHHHHHH
Q psy3109 2 KCVGIQYLETIRRYIAEGKKFKRTIHVSFVPGPVTQNETCGLAAANSVLYL 52 (58)
Q Consensus 2 K~~~~~~leAl~~L~~~g~~p~r~I~l~f~~DEE~Gg~~~g~~~~~s~~~~ 52 (58)
+.++++.++|++.|+++|.+|+++|.++++.+|| |.+||.++++|+.+.
T Consensus 90 ~~Gv~~~le~~~~l~~~~~~~~~~i~vi~~~~EE--g~rf~~~~~Gs~~~~ 138 (406)
T TIGR03176 90 QFGALAAWLAVDYLKEKYGAPLRTVEVLSMAEEE--GSRFPYVFWGSKNIF 138 (406)
T ss_pred hhhHHHHHHHHHHHHHcCCCCCCCeEEEEecccc--CccCCcccccHHHHh
Confidence 5789999999999999999999999999999999 556999999999986
No 5
>PRK07205 hypothetical protein; Provisional
Probab=98.83 E-value=8e-09 Score=72.20 Aligned_cols=45 Identities=18% Similarity=0.203 Sum_probs=40.1
Q ss_pred ChHHHHHHHHHHHHHHHcCCCCCceEEEEeeCCCCCCChhhhHHHH
Q psy3109 1 MKCVGIQYLETIRRYIAEGKKFKRTIHVSFVPGPVTQNETCGLAAA 46 (58)
Q Consensus 1 MK~~~~~~leAl~~L~~~g~~p~r~I~l~f~~DEE~Gg~~~g~~~~ 46 (58)
||+++++.+.|++.|++.|..|+++|.++|++|||+|+. +...+.
T Consensus 117 mKgglaa~l~Al~~l~~~~~~~~~~i~l~~~~dEE~g~~-g~~~~~ 161 (444)
T PRK07205 117 DKGPSMAALYAVKALLDAGVQFNKRIRFIFGTDEETLWR-CMNRYN 161 (444)
T ss_pred CcHHHHHHHHHHHHHHHcCCCCCCcEEEEEECCcccCcc-cHHHHH
Confidence 899999999999999999998999999999999999885 444443
No 6
>PRK07338 hypothetical protein; Provisional
Probab=98.79 E-value=1.7e-08 Score=69.29 Aligned_cols=46 Identities=13% Similarity=-0.010 Sum_probs=40.0
Q ss_pred ChHHHHHHHHHHHHHHHcCCCCCceEEEEeeCCCCCCChhhhHHHHH
Q psy3109 1 MKCVGIQYLETIRRYIAEGKKFKRTIHVSFVPGPVTQNETCGLAAAN 47 (58)
Q Consensus 1 MK~~~~~~leAl~~L~~~g~~p~r~I~l~f~~DEE~Gg~~~g~~~~~ 47 (58)
||+++++.++|++.|++.+..++++|.++|++|||+|+. ++..++.
T Consensus 130 mKgg~aa~l~a~~~l~~~~~~~~~~i~~~~~~dEE~g~~-g~~~~~~ 175 (402)
T PRK07338 130 MKGGIVVMLAALLAFERSPLADKLGYDVLINPDEEIGSP-ASAPLLA 175 (402)
T ss_pred hhHHHHHHHHHHHHHHhcCCCCCCCEEEEEECCcccCCh-hhHHHHH
Confidence 899999999999999998888889999999999999885 4544443
No 7
>PRK07906 hypothetical protein; Provisional
Probab=98.78 E-value=1.7e-08 Score=69.91 Aligned_cols=46 Identities=20% Similarity=0.252 Sum_probs=41.1
Q ss_pred ChHHHHHHHHHHHHHHHcCCCCCceEEEEeeCCCCCCChhhhHHHHH
Q psy3109 1 MKCVGIQYLETIRRYIAEGKKFKRTIHVSFVPGPVTQNETCGLAAAN 47 (58)
Q Consensus 1 MK~~~~~~leAl~~L~~~g~~p~r~I~l~f~~DEE~Gg~~~g~~~~~ 47 (58)
||+++++.++|++.|++.+..|+++|.++|++|||+||. .|..++.
T Consensus 106 ~Kg~~a~~l~a~~~l~~~~~~~~~~i~~~~~~dEE~g~~-~g~~~l~ 151 (426)
T PRK07906 106 MKDMDAMMLAVVRHLARTGRRPPRDLVFAFVADEEAGGT-YGAHWLV 151 (426)
T ss_pred cchHHHHHHHHHHHHHHcCCCCCccEEEEEecCcccchh-hhHHHHH
Confidence 899999999999999999999999999999999999886 4655553
No 8
>PRK13983 diaminopimelate aminotransferase; Provisional
Probab=98.77 E-value=1.4e-08 Score=68.92 Aligned_cols=47 Identities=15% Similarity=0.130 Sum_probs=41.6
Q ss_pred ChHHHHHHHHHHHHHHHcCCCCCceEEEEeeCCCCCCChhhhHHHHHH
Q psy3109 1 MKCVGIQYLETIRRYIAEGKKFKRTIHVSFVPGPVTQNETCGLAAANS 48 (58)
Q Consensus 1 MK~~~~~~leAl~~L~~~g~~p~r~I~l~f~~DEE~Gg~~~g~~~~~s 48 (58)
||+++++.++|++.|++.+..|+++|.++|++|||+|+. +|..++..
T Consensus 118 ~K~g~~a~l~a~~~l~~~~~~~~~~v~~~~~~dEE~g~~-~g~~~~~~ 164 (400)
T PRK13983 118 NGQGIVSSLLALKALMDLGIRPKYNLGLAFVSDEETGSK-YGIQYLLK 164 (400)
T ss_pred ccchHHHHHHHHHHHHHhCCCCCCcEEEEEEeccccCCc-ccHHHHHh
Confidence 799999999999999999988999999999999999886 56665554
No 9
>PRK13013 succinyl-diaminopimelate desuccinylase; Reviewed
Probab=98.77 E-value=1.5e-08 Score=69.78 Aligned_cols=46 Identities=17% Similarity=0.079 Sum_probs=40.7
Q ss_pred ChHHHHHHHHHHHHHHHcCCCCCceEEEEeeCCCCCCChhhhHHHHH
Q psy3109 1 MKCVGIQYLETIRRYIAEGKKFKRTIHVSFVPGPVTQNETCGLAAAN 47 (58)
Q Consensus 1 MK~~~~~~leAl~~L~~~g~~p~r~I~l~f~~DEE~Gg~~~g~~~~~ 47 (58)
||+++++.+.|++.|++.+..++++|.++|++|||+||. +|..++.
T Consensus 124 ~Kg~~aa~l~a~~~l~~~~~~~~~~v~~~~~~dEE~g~~-~g~~~l~ 169 (427)
T PRK13013 124 MKGGLAASIIAAEAFLAVYPDFAGSIEISGTADEESGGF-GGVAYLA 169 (427)
T ss_pred cchHHHHHHHHHHHHHHhCCCCCccEEEEEEeccccCCh-hHHHHHH
Confidence 899999999999999999888889999999999999886 5655544
No 10
>PRK07318 dipeptidase PepV; Reviewed
Probab=98.74 E-value=2.5e-08 Score=70.30 Aligned_cols=43 Identities=5% Similarity=-0.055 Sum_probs=38.9
Q ss_pred ChHHHHHHHHHHHHHHHcCCCCCceEEEEeeCCCCCCChhhhHHH
Q psy3109 1 MKCVGIQYLETIRRYIAEGKKFKRTIHVSFVPGPVTQNETCGLAA 45 (58)
Q Consensus 1 MK~~~~~~leAl~~L~~~g~~p~r~I~l~f~~DEE~Gg~~~g~~~ 45 (58)
||+++++.+.|++.|++.|+.|+++|.++|++|||+|+. |..+
T Consensus 119 mKgg~aa~l~Al~~l~~~g~~~~~~i~l~~~~DEE~g~~--G~~~ 161 (466)
T PRK07318 119 DKGPTMAAYYALKIIKELGLPLSKKVRFIVGTDEESGWK--CMDY 161 (466)
T ss_pred CcHHHHHHHHHHHHHHHcCCCCCccEEEEEEcccccCch--hHHH
Confidence 899999999999999999988889999999999999874 5444
No 11
>PRK08588 succinyl-diaminopimelate desuccinylase; Reviewed
Probab=98.71 E-value=3.2e-08 Score=67.28 Aligned_cols=39 Identities=15% Similarity=0.110 Sum_probs=36.7
Q ss_pred ChHHHHHHHHHHHHHHHcCCCCCceEEEEeeCCCCCCCh
Q psy3109 1 MKCVGIQYLETIRRYIAEGKKFKRTIHVSFVPGPVTQNE 39 (58)
Q Consensus 1 MK~~~~~~leAl~~L~~~g~~p~r~I~l~f~~DEE~Gg~ 39 (58)
||+++++.+.|++.|.+.+..|+++|.++|++|||+|+.
T Consensus 101 ~Kgg~aa~l~a~~~l~~~~~~~~~~i~l~~~~dEE~g~~ 139 (377)
T PRK08588 101 MKSGLAALVIAMIELKEQGQLLNGTIRLLATAGEEVGEL 139 (377)
T ss_pred ccchHHHHHHHHHHHHHcCCCCCCcEEEEEEcccccCch
Confidence 899999999999999999988999999999999998864
No 12
>PRK08262 hypothetical protein; Provisional
Probab=98.70 E-value=5e-08 Score=68.89 Aligned_cols=39 Identities=23% Similarity=0.186 Sum_probs=37.1
Q ss_pred ChHHHHHHHHHHHHHHHcCCCCCceEEEEeeCCCCCCCh
Q psy3109 1 MKCVGIQYLETIRRYIAEGKKFKRTIHVSFVPGPVTQNE 39 (58)
Q Consensus 1 MK~~~~~~leAl~~L~~~g~~p~r~I~l~f~~DEE~Gg~ 39 (58)
||+++++.+.|++.|++.+..|+++|.++|.+|||+||.
T Consensus 155 ~Kg~~aa~L~A~~~l~~~~~~l~~~I~llf~~dEE~g~~ 193 (486)
T PRK08262 155 DKGSLVAILEAAEALLAQGFQPRRTIYLAFGHDEEVGGL 193 (486)
T ss_pred cchhHHHHHHHHHHHHHcCCCCCCeEEEEEecccccCCc
Confidence 899999999999999999988889999999999999885
No 13
>TIGR01880 Ac-peptdase-euk N-acyl-L-amino-acid amidohydrolase. This model represents a family of eukaryotic N-acyl-L-amino-acid amidohydrolases active on fatty acid and acetyl amides of L-amino acids.
Probab=98.70 E-value=4.9e-08 Score=67.05 Aligned_cols=45 Identities=51% Similarity=0.759 Sum_probs=39.2
Q ss_pred ChHHHHHHHHHHHHHHHcCCCCCceEEEEeeCCCCCCChhhhHHH
Q psy3109 1 MKCVGIQYLETIRRYIAEGKKFKRTIHVSFVPGPVTQNETCGLAA 45 (58)
Q Consensus 1 MK~~~~~~leAl~~L~~~g~~p~r~I~l~f~~DEE~Gg~~~g~~~ 45 (58)
||+++++.+.|++.|++.+..|+++|.++|++|||+||..+...+
T Consensus 113 ~K~~~aa~l~a~~~l~~~~~~~~~~v~l~~~~dEE~g~~~G~~~~ 157 (400)
T TIGR01880 113 MKCVGVQYLEAVRNLKASGFKFKRTIHISFVPDEEIGGHDGMEKF 157 (400)
T ss_pred ccHHHHHHHHHHHHHHHcCCCCCceEEEEEeCCcccCcHhHHHHH
Confidence 899999999999999999988899999999999999875333333
No 14
>PRK12890 allantoate amidohydrolase; Reviewed
Probab=98.67 E-value=3.5e-08 Score=68.27 Aligned_cols=48 Identities=13% Similarity=-0.022 Sum_probs=40.2
Q ss_pred ChHHHHHHHHHHHHHHHcCCCCCceEEEEeeCCCCCCChhhhHHHHHHHH
Q psy3109 1 MKCVGIQYLETIRRYIAEGKKFKRTIHVSFVPGPVTQNETCGLAAANSVL 50 (58)
Q Consensus 1 MK~~~~~~leAl~~L~~~g~~p~r~I~l~f~~DEE~Gg~~~g~~~~~s~~ 50 (58)
||+++++.++|+++|++.+..|+++|.+++++|||.|+ +|..+.+++.
T Consensus 94 ~~~g~aa~l~a~~~l~~~~~~~~~~i~~~~~~dEE~~~--~~~~~~G~~~ 141 (414)
T PRK12890 94 GILGVLAGLEVVAALREAGIRPPHPLEVIAFTNEEGVR--FGPSMIGSRA 141 (414)
T ss_pred CHHHHHHHHHHHHHHHHcCCCCCCCeEEEEEecccccc--cCCccccHHH
Confidence 79999999999999999998899999999999999755 3444455543
No 15
>PRK13009 succinyl-diaminopimelate desuccinylase; Reviewed
Probab=98.66 E-value=7.9e-08 Score=65.07 Aligned_cols=45 Identities=16% Similarity=0.158 Sum_probs=38.8
Q ss_pred ChHHHHHHHHHHHHHHHcCCCCCceEEEEeeCCCCCCChhhhHHHH
Q psy3109 1 MKCVGIQYLETIRRYIAEGKKFKRTIHVSFVPGPVTQNETCGLAAA 46 (58)
Q Consensus 1 MK~~~~~~leAl~~L~~~g~~p~r~I~l~f~~DEE~Gg~~~g~~~~ 46 (58)
||+++++.+.|+++|++.+..++++|+++|++|||.|+. .|...+
T Consensus 100 ~Kgg~aa~l~a~~~l~~~~~~~~~~i~~~~~~~EE~~~~-~G~~~~ 144 (375)
T PRK13009 100 MKGSLAAFVVAAERFVAAHPDHKGSIAFLITSDEEGPAI-NGTVKV 144 (375)
T ss_pred ChHHHHHHHHHHHHHHHhcCCCCceEEEEEEeecccccc-cCHHHH
Confidence 899999999999999999888889999999999998764 344433
No 16
>PRK09104 hypothetical protein; Validated
Probab=98.65 E-value=6.6e-08 Score=67.93 Aligned_cols=47 Identities=11% Similarity=0.072 Sum_probs=41.1
Q ss_pred ChHHHHHHHHHHHHHHHcCCCCCceEEEEeeCCCCCCChhhhHHHHHH
Q psy3109 1 MKCVGIQYLETIRRYIAEGKKFKRTIHVSFVPGPVTQNETCGLAAANS 48 (58)
Q Consensus 1 MK~~~~~~leAl~~L~~~g~~p~r~I~l~f~~DEE~Gg~~~g~~~~~s 48 (58)
||+++++.++|+++|++.+..++.+|.++|++|||+||. ++..++.+
T Consensus 129 ~Kg~laa~l~a~~~l~~~~~~~~~~i~~~~~~dEE~g~~-g~~~~l~~ 175 (464)
T PRK09104 129 DKGQLMTFVEACRAWKAVTGSLPVRVTILFEGEEESGSP-SLVPFLEA 175 (464)
T ss_pred CcHHHHHHHHHHHHHHHhcCCCCCcEEEEEECccccCCc-cHHHHHHh
Confidence 899999999999999998766777899999999999986 66666654
No 17
>TIGR01246 dapE_proteo succinyl-diaminopimelate desuccinylase, proteobacterial clade. This model describes a proteobacterial subset of succinyl-diaminopimelate desuccinylases. An experimentally confirmed Gram-positive lineage succinyl-diaminopimelate desuccinylase has been described for Corynebacterium glutamicum, and a neighbor-joining tree shows the seed members, SP:Q59284, and putative archaeal members such as TrEMBL:O58003 in a single clade. However, the archaeal members differ substantially, share a number of motifs with acetylornithine deacetylases rather than succinyl-diaminopimelate desuccinylases, and are not taken as trusted examples of succinyl-diaminopimelate desuccinylases. This model is limited to proteobacterial members for this reason.
Probab=98.65 E-value=8.3e-08 Score=65.11 Aligned_cols=47 Identities=13% Similarity=0.150 Sum_probs=39.6
Q ss_pred ChHHHHHHHHHHHHHHHcCCCCCceEEEEeeCCCCCCChhhhHHHHHH
Q psy3109 1 MKCVGIQYLETIRRYIAEGKKFKRTIHVSFVPGPVTQNETCGLAAANS 48 (58)
Q Consensus 1 MK~~~~~~leAl~~L~~~g~~p~r~I~l~f~~DEE~Gg~~~g~~~~~s 48 (58)
||+++++++.|++.|++.+..++.+|.++|++|||+|+. .|...+..
T Consensus 97 ~Kgg~a~~l~a~~~l~~~~~~~~~~v~~~~~~dEE~~~~-~G~~~~~~ 143 (370)
T TIGR01246 97 MKGSLAAFIVAAERFVKKNPDHKGSISLLITSDEEGTAI-DGTKKVVE 143 (370)
T ss_pred chHHHHHHHHHHHHHHHhcCCCCCcEEEEEEeccccCCC-cCHHHHHH
Confidence 899999999999999999877788999999999998765 35544433
No 18
>TIGR01887 dipeptidaselike dipeptidase, putative. This model represents a clade of probable zinc dipeptidases, closely related to the characterized non-specific dipeptidase, PepV. Many enzymes in this clade have been given names including the terms "Xaa-His" and "carnosinase" due to the early mis-characterization of the Lactobacillus delbrueckii PepV enzyme. These names are likely too specific.
Probab=98.64 E-value=7.7e-08 Score=68.19 Aligned_cols=45 Identities=22% Similarity=0.084 Sum_probs=39.8
Q ss_pred ChHHHHHHHHHHHHHHHcCCCCCceEEEEeeCCCCCCChhhhHHHH
Q psy3109 1 MKCVGIQYLETIRRYIAEGKKFKRTIHVSFVPGPVTQNETCGLAAA 46 (58)
Q Consensus 1 MK~~~~~~leAl~~L~~~g~~p~r~I~l~f~~DEE~Gg~~~g~~~~ 46 (58)
||+++++.+.|++.|++.+..|+++|.++|+.|||+|+. +...++
T Consensus 107 ~KG~laa~l~a~~~l~~~~~~~~~~i~~~~~~dEE~g~~-g~~~~l 151 (447)
T TIGR01887 107 DKGPTIAALYAMKILKELGLKLKKKIRFIFGTDEETGWA-CIDYYF 151 (447)
T ss_pred CcHHHHHHHHHHHHHHHcCCCCCCcEEEEEECCcccCcH-hHHHHH
Confidence 899999999999999999998899999999999999875 444444
No 19
>PRK07473 carboxypeptidase; Provisional
Probab=98.63 E-value=1e-07 Score=65.73 Aligned_cols=45 Identities=24% Similarity=0.191 Sum_probs=38.8
Q ss_pred ChHHHHHHHHHHHHHHHcCCCCCceEEEEeeCCCCCCChhhhHHHH
Q psy3109 1 MKCVGIQYLETIRRYIAEGKKFKRTIHVSFVPGPVTQNETCGLAAA 46 (58)
Q Consensus 1 MK~~~~~~leAl~~L~~~g~~p~r~I~l~f~~DEE~Gg~~~g~~~~ 46 (58)
||+++++.+.|++.|++.+..++.+|.++|++|||+|+. +...++
T Consensus 113 ~Kgglaa~l~A~~~l~~~~~~~~~~v~~~~~~dEE~g~~-g~~~~~ 157 (376)
T PRK07473 113 MKGGNYLALEAIRQLARAGITTPLPITVLFTPDEEVGTP-STRDLI 157 (376)
T ss_pred chHHHHHHHHHHHHHHHcCCCCCCCEEEEEeCCcccCCc-cHHHHH
Confidence 899999999999999999877778899999999999885 444443
No 20
>PRK13590 putative bifunctional OHCU decarboxylase/allantoate amidohydrolase; Provisional
Probab=98.62 E-value=5.1e-08 Score=71.31 Aligned_cols=49 Identities=8% Similarity=-0.020 Sum_probs=44.4
Q ss_pred ChHHHHHHHHHHHHHHHcCCCCCceEEEEeeCCCCCCChhhhHHHHHHHHH
Q psy3109 1 MKCVGIQYLETIRRYIAEGKKFKRTIHVSFVPGPVTQNETCGLAAANSVLY 51 (58)
Q Consensus 1 MK~~~~~~leAl~~L~~~g~~p~r~I~l~f~~DEE~Gg~~~g~~~~~s~~~ 51 (58)
+|.++++++++++.|++.|.+|+++|.++++.||| |.+|+.++.||+.+
T Consensus 271 G~~Gv~a~lea~~~l~~~~~~~~~~i~vv~~~~EE--g~rF~~~~~GS~~~ 319 (591)
T PRK13590 271 GRLGIFVPMACVRELHRQGRRLPFGLEVVGFAEEE--GQRYKATFLGSGAL 319 (591)
T ss_pred cHHHHHHHHHHHHHHHHcCCCCCCCeEEEEecCCc--cccCCccccchHHH
Confidence 47899999999999999999999999999999999 55688889999864
No 21
>PRK09133 hypothetical protein; Provisional
Probab=98.62 E-value=8.4e-08 Score=67.53 Aligned_cols=45 Identities=24% Similarity=0.177 Sum_probs=38.5
Q ss_pred ChHHHHHHHHHHHHHHHcCCCCCceEEEEeeCCCCCCChhhhHHHH
Q psy3109 1 MKCVGIQYLETIRRYIAEGKKFKRTIHVSFVPGPVTQNETCGLAAA 46 (58)
Q Consensus 1 MK~~~~~~leAl~~L~~~g~~p~r~I~l~f~~DEE~Gg~~~g~~~~ 46 (58)
||+++++.++|++.|++.+..|+++|.++|++|||+++. .|...+
T Consensus 142 ~Kg~~aa~l~a~~~l~~~~~~~~~~i~~~~~~dEE~~g~-~G~~~l 186 (472)
T PRK09133 142 DKADAAIWVATLIRLKREGFKPKRDIILALTGDEEGTPM-NGVAWL 186 (472)
T ss_pred chHHHHHHHHHHHHHHhcCCCCCCCEEEEEECccccCcc-chHHHH
Confidence 899999999999999999988999999999999995443 354444
No 22
>PRK08596 acetylornithine deacetylase; Validated
Probab=98.60 E-value=9.9e-08 Score=66.36 Aligned_cols=39 Identities=18% Similarity=0.043 Sum_probs=36.9
Q ss_pred ChHHHHHHHHHHHHHHHcCCCCCceEEEEeeCCCCCCCh
Q psy3109 1 MKCVGIQYLETIRRYIAEGKKFKRTIHVSFVPGPVTQNE 39 (58)
Q Consensus 1 MK~~~~~~leAl~~L~~~g~~p~r~I~l~f~~DEE~Gg~ 39 (58)
||+++++.+.|+++|++.+..|+++|.++|++|||+|+.
T Consensus 119 ~Kgg~a~~l~a~~~l~~~~~~~~~~v~~~~~~dEE~g~~ 157 (421)
T PRK08596 119 MKGGLAGALFAIQLLHEAGIELPGDLIFQSVIGEEVGEA 157 (421)
T ss_pred cchHHHHHHHHHHHHHHcCCCCCCcEEEEEEeccccCCc
Confidence 899999999999999999988889999999999999874
No 23
>PRK06156 hypothetical protein; Provisional
Probab=98.59 E-value=1.3e-07 Score=67.88 Aligned_cols=45 Identities=13% Similarity=-0.003 Sum_probs=39.6
Q ss_pred ChHHHHHHHHHHHHHHHcCCCCCceEEEEeeCCCCCCChhhhHHHH
Q psy3109 1 MKCVGIQYLETIRRYIAEGKKFKRTIHVSFVPGPVTQNETCGLAAA 46 (58)
Q Consensus 1 MK~~~~~~leAl~~L~~~g~~p~r~I~l~f~~DEE~Gg~~~g~~~~ 46 (58)
||+++++.++|++.|++.+.+|+++|.++|++|||+|+. +...+.
T Consensus 155 ~Kgg~a~~l~a~~~l~~~~~~~~~~i~~~~~~dEE~g~~-G~~~~~ 199 (520)
T PRK06156 155 DKGAIVTALYAMKAIKDSGLPLARRIELLVYTTEETDGD-PLKYYL 199 (520)
T ss_pred chHHHHHHHHHHHHHHHcCCCCCceEEEEEecccccCch-hHHHHH
Confidence 899999999999999999998889999999999999885 444333
No 24
>COG0624 ArgE Acetylornithine deacetylase/Succinyl-diaminopimelate desuccinylase and related deacylases [Amino acid transport and metabolism]
Probab=98.59 E-value=1e-07 Score=65.66 Aligned_cols=48 Identities=17% Similarity=0.103 Sum_probs=42.8
Q ss_pred ChHHHHHHHHHHHHHHHcCCCCCceEEEEeeCCCCCCChhhhHHHHHHH
Q psy3109 1 MKCVGIQYLETIRRYIAEGKKFKRTIHVSFVPGPVTQNETCGLAAANSV 49 (58)
Q Consensus 1 MK~~~~~~leAl~~L~~~g~~p~r~I~l~f~~DEE~Gg~~~g~~~~~s~ 49 (58)
||+++++.+.|++.+.+.|..++++|.++|++|||+||. ++..++...
T Consensus 117 ~KG~~~a~l~A~~~l~~~~~~~~~~v~~~~~~dEE~g~~-~~~~~~~~~ 164 (409)
T COG0624 117 MKGGLAAALYALSALKAAGGELPGDVRLLFTADEESGGA-GGKAYLEEG 164 (409)
T ss_pred cchHHHHHHHHHHHHHHhCCCCCeEEEEEEEeccccCCc-chHHHHHhc
Confidence 899999999999999999888999999999999999996 666665544
No 25
>PRK08201 hypothetical protein; Provisional
Probab=98.59 E-value=1.3e-07 Score=66.24 Aligned_cols=46 Identities=4% Similarity=-0.105 Sum_probs=39.3
Q ss_pred ChHHHHHHHHHHHHHHHcCCCCCceEEEEeeCCCCCCChhhhHHHHH
Q psy3109 1 MKCVGIQYLETIRRYIAEGKKFKRTIHVSFVPGPVTQNETCGLAAAN 47 (58)
Q Consensus 1 MK~~~~~~leAl~~L~~~g~~p~r~I~l~f~~DEE~Gg~~~g~~~~~ 47 (58)
||+++++.++|+++|++.+..++++|.++|++|||+|+. +...++.
T Consensus 121 mKgglaa~l~a~~~l~~~~~~~~~~i~~~~~~dEE~g~~-g~~~~l~ 166 (456)
T PRK08201 121 DKGQVFMHLKAVEALLKVEGTLPVNVKFCIEGEEEIGSP-NLDSFVE 166 (456)
T ss_pred CcHHHHHHHHHHHHHHHhcCCCCCCEEEEEEcccccCCc-cHHHHHH
Confidence 899999999999999887666778999999999999986 5555543
No 26
>TIGR01910 DapE-ArgE acetylornithine deacetylase or succinyl-diaminopimelate desuccinylase. This group of sequences contains annotations for both acetylornithine deacetylase and succinyl-diaminopimelate desuccinylase, but does not contain any members with experimental characterization. Bacillus, Staphylococcus and Sulfolobus species contain multiple hits to this subfamily and each may have a separate activity. Determining which is which must await further laboratory research.
Probab=98.58 E-value=1.3e-07 Score=64.31 Aligned_cols=45 Identities=18% Similarity=0.051 Sum_probs=39.1
Q ss_pred ChHHHHHHHHHHHHHHHcCCCCCceEEEEeeCCCCCCChhhhHHHH
Q psy3109 1 MKCVGIQYLETIRRYIAEGKKFKRTIHVSFVPGPVTQNETCGLAAA 46 (58)
Q Consensus 1 MK~~~~~~leAl~~L~~~g~~p~r~I~l~f~~DEE~Gg~~~g~~~~ 46 (58)
||++++++++|++.|++.+..++++|.++|++|||+|+. +...++
T Consensus 106 ~k~~~a~~l~a~~~l~~~~~~~~~~i~~~~~~~EE~g~~-G~~~~~ 150 (375)
T TIGR01910 106 MKGGLVALLYALKAIREAGIKPNGNIILQSVVDEESGEA-GTLYLL 150 (375)
T ss_pred cchHHHHHHHHHHHHHHcCCCCCccEEEEEEcCcccCch-hHHHHH
Confidence 799999999999999999888889999999999999875 444443
No 27
>PRK09290 allantoate amidohydrolase; Reviewed
Probab=98.57 E-value=1.2e-07 Score=65.76 Aligned_cols=37 Identities=19% Similarity=0.115 Sum_probs=35.1
Q ss_pred ChHHHHHHHHHHHHHHHcCCCCCceEEEEeeCCCCCC
Q psy3109 1 MKCVGIQYLETIRRYIAEGKKFKRTIHVSFVPGPVTQ 37 (58)
Q Consensus 1 MK~~~~~~leAl~~L~~~g~~p~r~I~l~f~~DEE~G 37 (58)
||+++++.++|+++|++.+..|+++|.+++++|||+|
T Consensus 93 ~k~g~aa~l~a~~~l~~~~~~~~~~i~~~~~~dEE~g 129 (413)
T PRK09290 93 GPLGVLAGLEAVRTLNERGIRPRRPIEVVAFTNEEGS 129 (413)
T ss_pred CHHHHHHHHHHHHHHHHcCCCCCCCeEEEEEcCCccc
Confidence 6999999999999999999888999999999999986
No 28
>PRK12893 allantoate amidohydrolase; Reviewed
Probab=98.56 E-value=1.1e-07 Score=65.53 Aligned_cols=37 Identities=19% Similarity=0.103 Sum_probs=34.9
Q ss_pred ChHHHHHHHHHHHHHHHcCCCCCceEEEEeeCCCCCC
Q psy3109 1 MKCVGIQYLETIRRYIAEGKKFKRTIHVSFVPGPVTQ 37 (58)
Q Consensus 1 MK~~~~~~leAl~~L~~~g~~p~r~I~l~f~~DEE~G 37 (58)
+|+++++.+.|++.|++.+..++++|.++|++|||+|
T Consensus 96 gk~gvaa~l~a~~~l~~~~~~~~~~v~~~~~~dEE~g 132 (412)
T PRK12893 96 GALGVLAALEVVRTLNDAGIRTRRPIEVVSWTNEEGA 132 (412)
T ss_pred chhhHHHHHHHHHHHHHcCCCCCCCeEEEEEcccccc
Confidence 5899999999999999999888999999999999975
No 29
>TIGR01886 dipeptidase dipeptidase PepV. This model represents a small clade of dipeptidase enzymes which are members of the larger M25 subfamily of metalloproteases. Two characterized enzymes are included in the seed. One, from Lactococcus lactis has been shown to act on a wide range of dipeptides, but not larger peptides. The enzyme from Lactobacillus delbrueckii was originally characterized as a Xaa-His dipeptidase, specifically a carnosinase (beta-Ala-His) by complementation of an E. coli mutant. Further study, including the crystallization of the enzyme, has shown it to also be a non-specific dipeptidase. This group also includes enzymes from Streptococcus and Enterococcus.
Probab=98.52 E-value=2.2e-07 Score=65.87 Aligned_cols=39 Identities=10% Similarity=-0.077 Sum_probs=37.2
Q ss_pred ChHHHHHHHHHHHHHHHcCCCCCceEEEEeeCCCCCCCh
Q psy3109 1 MKCVGIQYLETIRRYIAEGKKFKRTIHVSFVPGPVTQNE 39 (58)
Q Consensus 1 MK~~~~~~leAl~~L~~~g~~p~r~I~l~f~~DEE~Gg~ 39 (58)
||+++++.+.|+++|++.+..|+++|.++|+.|||+|+.
T Consensus 118 ~Kg~~~a~l~a~~~l~~~~~~~~~~i~~~~~~dEE~g~~ 156 (466)
T TIGR01886 118 DKGPSLAAYYAMKILKELGLPPSKKIRFVVGTNEETGWV 156 (466)
T ss_pred cchHHHHHHHHHHHHHHhCCCCCCCEEEEEECccccCcc
Confidence 899999999999999999998999999999999999875
No 30
>PRK06915 acetylornithine deacetylase; Validated
Probab=98.52 E-value=2.5e-07 Score=63.95 Aligned_cols=39 Identities=18% Similarity=0.178 Sum_probs=36.4
Q ss_pred ChHHHHHHHHHHHHHHHcCCCCCceEEEEeeCCCCCCCh
Q psy3109 1 MKCVGIQYLETIRRYIAEGKKFKRTIHVSFVPGPVTQNE 39 (58)
Q Consensus 1 MK~~~~~~leAl~~L~~~g~~p~r~I~l~f~~DEE~Gg~ 39 (58)
||+++++.+.|++.|++.+..++.+|.++|++|||+|+.
T Consensus 135 ~Kgg~aa~l~a~~~l~~~~~~~~~~v~~~~~~dEE~g~~ 173 (422)
T PRK06915 135 MKGGNVALLLAMEALIESGIELKGDVIFQSVIEEESGGA 173 (422)
T ss_pred chHHHHHHHHHHHHHHHcCCCCCCcEEEEEecccccCCc
Confidence 899999999999999999887888999999999999875
No 31
>PRK06446 hypothetical protein; Provisional
Probab=98.52 E-value=2.7e-07 Score=64.54 Aligned_cols=45 Identities=11% Similarity=-0.009 Sum_probs=38.0
Q ss_pred ChHHHHHHHHHHHHHHHcCCCCCceEEEEeeCCCCCCChhhhHHHHH
Q psy3109 1 MKCVGIQYLETIRRYIAEGKKFKRTIHVSFVPGPVTQNETCGLAAAN 47 (58)
Q Consensus 1 MK~~~~~~leAl~~L~~~g~~p~r~I~l~f~~DEE~Gg~~~g~~~~~ 47 (58)
||+++++.+.|+++|++.+ .|+.+|.++|++|||+|+. +...++.
T Consensus 104 mKgglaa~l~A~~~l~~~~-~~~~~i~~~~~~dEE~g~~-g~~~~l~ 148 (436)
T PRK06446 104 NKGTLMARLFAIKHLIDKH-KLNVNVKFLYEGEEEIGSP-NLEDFIE 148 (436)
T ss_pred CcHHHHHHHHHHHHHHHcC-CCCCCEEEEEEcccccCCH-hHHHHHH
Confidence 8999999999999998776 5778999999999999986 5444443
No 32
>PRK06133 glutamate carboxypeptidase; Reviewed
Probab=98.51 E-value=3.2e-07 Score=63.75 Aligned_cols=39 Identities=23% Similarity=0.079 Sum_probs=36.4
Q ss_pred ChHHHHHHHHHHHHHHHcCCCCCceEEEEeeCCCCCCCh
Q psy3109 1 MKCVGIQYLETIRRYIAEGKKFKRTIHVSFVPGPVTQNE 39 (58)
Q Consensus 1 MK~~~~~~leAl~~L~~~g~~p~r~I~l~f~~DEE~Gg~ 39 (58)
||+++++.+.|++.|++.+..|+.+|.++|++|||+|+.
T Consensus 137 ~kgg~a~~l~a~~~l~~~~~~~~~~i~~~~~~dEE~g~~ 175 (410)
T PRK06133 137 DKGGVAVILHALKILQQLGFKDYGTLTVLFNPDEETGSP 175 (410)
T ss_pred chHHHHHHHHHHHHHHHcCCCCCCCEEEEEECCcccCCc
Confidence 899999999999999999888888999999999999874
No 33
>PRK07079 hypothetical protein; Provisional
Probab=98.51 E-value=2.8e-07 Score=64.99 Aligned_cols=39 Identities=13% Similarity=0.131 Sum_probs=35.5
Q ss_pred ChHHHHHHHHHHHHHHHc-CCCCCceEEEEeeCCCCCCCh
Q psy3109 1 MKCVGIQYLETIRRYIAE-GKKFKRTIHVSFVPGPVTQNE 39 (58)
Q Consensus 1 MK~~~~~~leAl~~L~~~-g~~p~r~I~l~f~~DEE~Gg~ 39 (58)
||+++++.+.|++.|++. +.+++++|.++|++|||+||.
T Consensus 128 mKgg~aa~l~A~~~l~~~~~~~~~~~i~~~~~~dEE~g~~ 167 (469)
T PRK07079 128 NKGQHTINLAALEQVLAARGGRLGFNVKLLIEMGEEIGSP 167 (469)
T ss_pred CcHHHHHHHHHHHHHHHhcCCCCCCCEEEEEECccccCCc
Confidence 899999999999999764 578889999999999999885
No 34
>PRK04443 acetyl-lysine deacetylase; Provisional
Probab=98.51 E-value=1.9e-07 Score=63.36 Aligned_cols=44 Identities=7% Similarity=-0.034 Sum_probs=37.3
Q ss_pred ChHHHHHHHHHHHHHHHcCCCCCceEEEEeeCCCCCCChhhhHHHHHH
Q psy3109 1 MKCVGIQYLETIRRYIAEGKKFKRTIHVSFVPGPVTQNETCGLAAANS 48 (58)
Q Consensus 1 MK~~~~~~leAl~~L~~~g~~p~r~I~l~f~~DEE~Gg~~~g~~~~~s 48 (58)
||+++++.++|++.| +..++++|.+++++|||+|+. ++..++.+
T Consensus 92 ~Kg~~aa~l~A~~~l---~~~~~~~i~~~~~~dEE~g~~-~~~~~l~~ 135 (348)
T PRK04443 92 AKGPLAAFAAAAARL---EALVRARVSFVGAVEEEAPSS-GGARLVAD 135 (348)
T ss_pred cccHHHHHHHHHHHh---cccCCCCEEEEEEcccccCCh-hHHHHHHh
Confidence 899999999999998 456888999999999999986 56555543
No 35
>TIGR01883 PepT-like peptidase T-like protein. This model represents a clade of enzymes closely related to Peptidase T, an aminotripeptidase found in bacteria. This clade consists of gram positive bacteria of which several additionally contain a Peptidase T gene.
Probab=98.51 E-value=2.4e-07 Score=62.56 Aligned_cols=38 Identities=18% Similarity=-0.018 Sum_probs=34.6
Q ss_pred ChHHHHHHHHHHHHHHHcCCCCCceEEEEeeCCCCCCCh
Q psy3109 1 MKCVGIQYLETIRRYIAEGKKFKRTIHVSFVPGPVTQNE 39 (58)
Q Consensus 1 MK~~~~~~leAl~~L~~~g~~p~r~I~l~f~~DEE~Gg~ 39 (58)
||+++++++++++.|++.+ .|+++|.++|++|||.|+.
T Consensus 101 ~k~g~a~~l~~~~~l~~~~-~~~~~v~~~~~~~EE~g~~ 138 (361)
T TIGR01883 101 DKAGVAAMLEAMDVLSTEE-TPHGTIEFIFTVKEELGLI 138 (361)
T ss_pred ccHHHHHHHHHHHHHHhcC-CCCCCEEEEEEcccccCch
Confidence 8999999999999999876 5778999999999998764
No 36
>PRK13799 unknown domain/N-carbamoyl-L-amino acid hydrolase fusion protein; Provisional
Probab=98.50 E-value=1.6e-07 Score=68.74 Aligned_cols=48 Identities=6% Similarity=-0.015 Sum_probs=44.7
Q ss_pred HHHHHHHHHHHHHHHcCCCCCceEEEEeeCCCCCCChhhhHHHHHHHHHH
Q psy3109 3 CVGIQYLETIRRYIAEGKKFKRTIHVSFVPGPVTQNETCGLAAANSVLYL 52 (58)
Q Consensus 3 ~~~~~~leAl~~L~~~g~~p~r~I~l~f~~DEE~Gg~~~g~~~~~s~~~~ 52 (58)
.++++.+++++.|++.|.+|+++|.++.+.||| |.+|+.++.||+.+.
T Consensus 273 ~Gv~a~l~~~~~l~~~~~~~~~~i~vi~~~~EE--g~rF~~~~~GS~~~~ 320 (591)
T PRK13799 273 EGIFLAIACVKELHEQGERLPFHFEVIAFAEEE--GQRFKATFLGSGALI 320 (591)
T ss_pred HHHHHHHHHHHHHHHcCCCCCCCeEEEEecCCC--ccCCCccccchHHHh
Confidence 478999999999999999999999999999999 667999999999985
No 37
>PRK06837 acetylornithine deacetylase; Provisional
Probab=98.48 E-value=3.5e-07 Score=63.70 Aligned_cols=39 Identities=18% Similarity=0.069 Sum_probs=36.8
Q ss_pred ChHHHHHHHHHHHHHHHcCCCCCceEEEEeeCCCCCCCh
Q psy3109 1 MKCVGIQYLETIRRYIAEGKKFKRTIHVSFVPGPVTQNE 39 (58)
Q Consensus 1 MK~~~~~~leAl~~L~~~g~~p~r~I~l~f~~DEE~Gg~ 39 (58)
||+++++.+.|++.|++.|..|+++|.++|++|||.|+.
T Consensus 139 ~Kgg~~a~l~a~~~l~~~~~~~~~~i~~~~~~dEE~~g~ 177 (427)
T PRK06837 139 MKAGLAAMLFALDALRAAGLAPAARVHFQSVIEEESTGN 177 (427)
T ss_pred chHHHHHHHHHHHHHHHcCCCCCCcEEEEEEeccccCCH
Confidence 899999999999999999988899999999999998875
No 38
>TIGR01879 hydantase amidase, hydantoinase/carbamoylase family. Enzymes in this subfamily hydrolize the amide bonds of compounds containing carbamoyl groups or hydantoin rings. These enzymes are members of the broader family of amidases represented by pfam01546.
Probab=98.44 E-value=3.2e-07 Score=63.45 Aligned_cols=48 Identities=8% Similarity=-0.153 Sum_probs=38.4
Q ss_pred hHHHHHHHHHHHHHHHcCCCCCceEEEEeeCCCCCCChhhhHHHHHHHHH
Q psy3109 2 KCVGIQYLETIRRYIAEGKKFKRTIHVSFVPGPVTQNETCGLAAANSVLY 51 (58)
Q Consensus 2 K~~~~~~leAl~~L~~~g~~p~r~I~l~f~~DEE~Gg~~~g~~~~~s~~~ 51 (58)
|.++++.++|+++|++.|.+|+++|.+++++|||.+ +|+.++.+++.+
T Consensus 88 ~~gvaa~l~a~~~l~~~g~~~~~~i~~~~~~dEE~~--~f~~~~~Gs~~~ 135 (401)
T TIGR01879 88 QLGVLAGIEVVDALKEAYVVPLHPIEVVAFTEEEGS--RFPYGMWGSRNM 135 (401)
T ss_pred HHHHHHHHHHHHHHHHcCCCCCCCeEEEEEeCCcCc--CcccccccHHHH
Confidence 567899999999999999999999999999999942 244444444443
No 39
>PRK05469 peptidase T; Provisional
Probab=98.42 E-value=4.3e-07 Score=62.76 Aligned_cols=37 Identities=19% Similarity=0.166 Sum_probs=34.4
Q ss_pred ChHHHHHHHHHHHHHHHcCCCCCceEEEEeeCCCCCC
Q psy3109 1 MKCVGIQYLETIRRYIAEGKKFKRTIHVSFVPGPVTQ 37 (58)
Q Consensus 1 MK~~~~~~leAl~~L~~~g~~p~r~I~l~f~~DEE~G 37 (58)
||+++++.+.|++.|++.+..|+.+|.++|++|||+|
T Consensus 141 ~Kgglaa~l~a~~~l~~~~~~~~g~v~~~f~~dEE~g 177 (408)
T PRK05469 141 DKAGIAEIMTALEYLIAHPEIKHGDIRVAFTPDEEIG 177 (408)
T ss_pred chHHHHHHHHHHHHHHhCCCCCCCCEEEEEecccccC
Confidence 8999999999999999887677789999999999986
No 40
>TIGR01882 peptidase-T peptidase T. This model represents a tripeptide aminopeptidase known as Peptidase T, which has a substrate preference for hydrophobic peptides.
Probab=98.41 E-value=4.9e-07 Score=62.89 Aligned_cols=37 Identities=19% Similarity=0.125 Sum_probs=33.0
Q ss_pred ChHHHHHHHHHHHHHHHcCCCCCceEEEEeeCCCCCC
Q psy3109 1 MKCVGIQYLETIRRYIAEGKKFKRTIHVSFVPGPVTQ 37 (58)
Q Consensus 1 MK~~~~~~leAl~~L~~~g~~p~r~I~l~f~~DEE~G 37 (58)
||++++++++|++.|++.+..++.+|.++|++|||+|
T Consensus 143 ~KgglAa~l~A~~~L~e~~~~~~g~I~~~ft~dEE~g 179 (410)
T TIGR01882 143 DKAGIAEIMTAADYLINHPEIKHGTIRVAFTPDEEIG 179 (410)
T ss_pred CHHHHHHHHHHHHHHHhCCCCCCCCEEEEEECcccCC
Confidence 8999999999999999874446779999999999985
No 41
>PRK08554 peptidase; Reviewed
Probab=98.41 E-value=6.6e-07 Score=63.23 Aligned_cols=41 Identities=7% Similarity=0.041 Sum_probs=35.2
Q ss_pred ChHHHHHHHHHHHHHHHcCCCCCceEEEEeeCCCCCCChhhhHH
Q psy3109 1 MKCVGIQYLETIRRYIAEGKKFKRTIHVSFVPGPVTQNETCGLA 44 (58)
Q Consensus 1 MK~~~~~~leAl~~L~~~g~~p~r~I~l~f~~DEE~Gg~~~g~~ 44 (58)
||+++++.+.|++.|++.+ ++++|.++|++|||+||. ++..
T Consensus 104 mKgg~aa~l~A~~~l~~~~--~~~~i~l~~~~dEE~g~~-~~~~ 144 (438)
T PRK08554 104 DKGNVASVMLALKELSKEP--LNGKVIFAFTGDEEIGGA-MAMH 144 (438)
T ss_pred chHHHHHHHHHHHHHHhcC--CCCCEEEEEEcccccCcc-ccHH
Confidence 8999999999999998754 667899999999999885 4543
No 42
>PRK13004 peptidase; Reviewed
Probab=98.39 E-value=9.2e-07 Score=60.97 Aligned_cols=38 Identities=11% Similarity=-0.144 Sum_probs=35.4
Q ss_pred ChHHHHHHHHHHHHHHHcCCCCCceEEEEeeCCCCCCC
Q psy3109 1 MKCVGIQYLETIRRYIAEGKKFKRTIHVSFVPGPVTQN 38 (58)
Q Consensus 1 MK~~~~~~leAl~~L~~~g~~p~r~I~l~f~~DEE~Gg 38 (58)
||+++++.+.|++.|++.+..++++|.++|++|||.++
T Consensus 111 ~Kg~~aa~l~a~~~l~~~~~~~~~~i~~~~~~~EE~~~ 148 (399)
T PRK13004 111 QKGGMASMVYAAKIIKDLGLDDEYTLYVTGTVQEEDCD 148 (399)
T ss_pred cchHHHHHHHHHHHHHhcCCCCCCeEEEEEEcccccCc
Confidence 89999999999999999998888999999999999754
No 43
>PRK12892 allantoate amidohydrolase; Reviewed
Probab=98.39 E-value=5.9e-07 Score=61.81 Aligned_cols=37 Identities=14% Similarity=-0.036 Sum_probs=34.5
Q ss_pred ChHHHHHHHHHHHHHHHcCCCCCceEEEEeeCCCCCC
Q psy3109 1 MKCVGIQYLETIRRYIAEGKKFKRTIHVSFVPGPVTQ 37 (58)
Q Consensus 1 MK~~~~~~leAl~~L~~~g~~p~r~I~l~f~~DEE~G 37 (58)
+|++++++++|+++|++.|.+|+++|.+++++|||.|
T Consensus 94 g~~Gvaa~l~a~~~l~~~~~~~~~~i~~~~~~dEE~~ 130 (412)
T PRK12892 94 GALGVVAGLEAARALNEHGIATRHPLDVVAWCDEEGS 130 (412)
T ss_pred chHHHHHHHHHHHHHHHcCCCCCCCeEEEEecCcccc
Confidence 4788999999999999999999999999999999975
No 44
>PRK08652 acetylornithine deacetylase; Provisional
Probab=98.38 E-value=7.4e-07 Score=59.57 Aligned_cols=37 Identities=14% Similarity=0.003 Sum_probs=32.8
Q ss_pred ChHHHHHHHHHHHHHHHcCCCCCceEEEEeeCCCCCCCh
Q psy3109 1 MKCVGIQYLETIRRYIAEGKKFKRTIHVSFVPGPVTQNE 39 (58)
Q Consensus 1 MK~~~~~~leAl~~L~~~g~~p~r~I~l~f~~DEE~Gg~ 39 (58)
||++++++++|++.|.+.. ++++|.++|++|||+|+.
T Consensus 88 ~Kg~~a~~l~a~~~l~~~~--~~~~v~~~~~~dEE~g~~ 124 (347)
T PRK08652 88 AKGGVAAILLALEELGKEF--EDLNVGIAFVSDEEEGGR 124 (347)
T ss_pred hhHHHHHHHHHHHHHhhcc--cCCCEEEEEecCcccCCh
Confidence 8999999999999998653 456999999999999875
No 45
>TIGR01900 dapE-gram_pos succinyl-diaminopimelate desuccinylase. This enzyme is involved in the biosynthesis of lysine, and is related to the enzyme acetylornithine deacetylase and other amidases and peptidases found within pfam01546.
Probab=98.37 E-value=8.2e-07 Score=61.15 Aligned_cols=45 Identities=16% Similarity=0.091 Sum_probs=36.0
Q ss_pred ChHHHHHHHHHHHHHHH--cCCCCCceEEEEeeCCCCCCChhhhHHH
Q psy3109 1 MKCVGIQYLETIRRYIA--EGKKFKRTIHVSFVPGPVTQNETCGLAA 45 (58)
Q Consensus 1 MK~~~~~~leAl~~L~~--~g~~p~r~I~l~f~~DEE~Gg~~~g~~~ 45 (58)
||+++++.+.|++.|.+ .+..++.+|.++|++|||+|+...|...
T Consensus 104 mKgg~aa~l~a~~~l~~~~~~~~~~~~i~~~~~~dEE~~~~~~G~~~ 150 (373)
T TIGR01900 104 MKAGDAVMLHLAATLDGRAPETELKHDLTLIAYDCEEVAAEKNGLGH 150 (373)
T ss_pred hhHHHHHHHHHHHHHhhhccccCCCCCEEEEEEecccccCCCCCHHH
Confidence 89999999999999954 4557788999999999999753224433
No 46
>PRK08651 succinyl-diaminopimelate desuccinylase; Reviewed
Probab=98.33 E-value=1.1e-06 Score=59.91 Aligned_cols=36 Identities=25% Similarity=0.273 Sum_probs=33.0
Q ss_pred ChHHHHHHHHHHHHHHHcCCCCCceEEEEeeCCCCCCCh
Q psy3109 1 MKCVGIQYLETIRRYIAEGKKFKRTIHVSFVPGPVTQNE 39 (58)
Q Consensus 1 MK~~~~~~leAl~~L~~~g~~p~r~I~l~f~~DEE~Gg~ 39 (58)
||+++++.++|++.|++.+ +++|.++|++|||+|+.
T Consensus 115 ~k~~~~~~l~a~~~l~~~~---~~~v~~~~~~~EE~g~~ 150 (394)
T PRK08651 115 MKGGIAALLAAFERLDPAG---DGNIELAIVPDEETGGT 150 (394)
T ss_pred cchHHHHHHHHHHHHHhcC---CCCEEEEEecCccccch
Confidence 7999999999999998876 68999999999999873
No 47
>PRK13381 peptidase T; Provisional
Probab=98.32 E-value=1.3e-06 Score=60.27 Aligned_cols=38 Identities=16% Similarity=0.091 Sum_probs=34.3
Q ss_pred ChHHHHHHHHHHHHHHHcCCCCCceEEEEeeCCCCCCCh
Q psy3109 1 MKCVGIQYLETIRRYIAEGKKFKRTIHVSFVPGPVTQNE 39 (58)
Q Consensus 1 MK~~~~~~leAl~~L~~~g~~p~r~I~l~f~~DEE~Gg~ 39 (58)
||+++++.+.|++.|.+.+ .++.+|.++|++|||+|+.
T Consensus 139 mKgg~aa~l~a~~~l~~~~-~~~g~i~~~~~~dEE~g~~ 176 (404)
T PRK13381 139 NKAAIAVVMTLLENLTENE-VEHGDIVVAFVPDEEIGLR 176 (404)
T ss_pred cHHHHHHHHHHHHHHHhcC-CCCCCEEEEEEcccccccc
Confidence 8999999999999999885 4677999999999999864
No 48
>PRK07522 acetylornithine deacetylase; Provisional
Probab=98.31 E-value=1.3e-06 Score=59.27 Aligned_cols=43 Identities=16% Similarity=0.105 Sum_probs=35.6
Q ss_pred ChHHHHHHHHHHHHHHHcCCCCCceEEEEeeCCCCCCChhhhHHHH
Q psy3109 1 MKCVGIQYLETIRRYIAEGKKFKRTIHVSFVPGPVTQNETCGLAAA 46 (58)
Q Consensus 1 MK~~~~~~leAl~~L~~~g~~p~r~I~l~f~~DEE~Gg~~~g~~~~ 46 (58)
||+++++.+.|++.|++.+ ++++|.++|++|||+|+. +...++
T Consensus 105 ~Kg~~a~~l~a~~~l~~~~--~~~~i~~~~~~dEE~g~~-G~~~l~ 147 (385)
T PRK07522 105 MKGFIAAALAAVPELAAAP--LRRPLHLAFSYDEEVGCL-GVPSMI 147 (385)
T ss_pred cchHHHHHHHHHHHHHhCC--CCCCEEEEEEeccccCCc-cHHHHH
Confidence 7999999999999998874 568999999999999874 433433
No 49
>TIGR01892 AcOrn-deacetyl acetylornithine deacetylase (ArgE). This model represents a clade of acetylornithine deacetylases from proteobacteria. This enzyme is the final step of the "acetylated" ornithine biosynthesis pathway. The enzyme is closely related to dapE, succinyl-diaminopimelate desuccinylase, and outside of this clade annotation is very inaccurate as to which function should be ascribed to genes.
Probab=98.30 E-value=1.5e-06 Score=58.37 Aligned_cols=37 Identities=19% Similarity=0.177 Sum_probs=33.1
Q ss_pred ChHHHHHHHHHHHHHHHcCCCCCceEEEEeeCCCCCCCh
Q psy3109 1 MKCVGIQYLETIRRYIAEGKKFKRTIHVSFVPGPVTQNE 39 (58)
Q Consensus 1 MK~~~~~~leAl~~L~~~g~~p~r~I~l~f~~DEE~Gg~ 39 (58)
||+++++.+.|++.|++.+ .+++|.++|++|||+|+.
T Consensus 99 ~Kg~~a~~l~a~~~l~~~~--~~~~v~~~~~~~EE~g~~ 135 (364)
T TIGR01892 99 MKGFLACALAAAPDLAAEQ--LKKPLHLALTADEEVGCT 135 (364)
T ss_pred cchHHHHHHHHHHHHHhcC--cCCCEEEEEEeccccCCc
Confidence 8999999999999999865 467899999999999874
No 50
>TIGR03106 trio_M42_hydro hydrolase, peptidase M42 family. This model describes a subfamily of MEROPS peptidase family M42, a glutamyl aminopeptidase family that also includes the cellulase CelM from Clostridium thermocellum and deblocking aminopeptidases that can remove acylated amino acids. Members of this family occur in a three gene cassette with an amidotransferase (TIGR03104)in the asparagine synthase (glutamine-hydrolyzing) family, and a probable acetyltransferase (TIGR03103) in the GNAT family.
Probab=98.30 E-value=1.5e-06 Score=60.56 Aligned_cols=38 Identities=11% Similarity=0.067 Sum_probs=35.9
Q ss_pred ChHHHHHHHHHHHHHHHcCCCCCceEEEEeeCCCCCCC
Q psy3109 1 MKCVGIQYLETIRRYIAEGKKFKRTIHVSFVPGPVTQN 38 (58)
Q Consensus 1 MK~~~~~~leAl~~L~~~g~~p~r~I~l~f~~DEE~Gg 38 (58)
||++++++++++++|++.+..++++|+++|+.|||+|.
T Consensus 184 ~K~G~a~~l~~~~~l~~~~~~~~~~v~~~~t~qEEvG~ 221 (343)
T TIGR03106 184 DKAGVAALLAALKAIVEHKVPLPVDVHPLFTITEEVGS 221 (343)
T ss_pred cHHhHHHHHHHHHHHHhcCCCCCceEEEEEECCcccCc
Confidence 89999999999999999888888999999999999983
No 51
>PRK07907 hypothetical protein; Provisional
Probab=98.24 E-value=2.5e-06 Score=59.80 Aligned_cols=42 Identities=12% Similarity=-0.095 Sum_probs=34.8
Q ss_pred ChHHHHHHHHHHHHHHHcCCCCCceEEEEeeCCCCCCChhhhHHHH
Q psy3109 1 MKCVGIQYLETIRRYIAEGKKFKRTIHVSFVPGPVTQNETCGLAAA 46 (58)
Q Consensus 1 MK~~~~~~leAl~~L~~~g~~p~r~I~l~f~~DEE~Gg~~~g~~~~ 46 (58)
||+++++.+.|+++| +..++++|.+++++|||+|+. +...++
T Consensus 125 ~Kg~~aa~l~a~~~l---~~~~~~~i~~~~~~dEE~g~~-g~~~~l 166 (449)
T PRK07907 125 DKGGIAMHLAALRAL---GGDLPVGVTVFVEGEEEMGSP-SLERLL 166 (449)
T ss_pred CcHHHHHHHHHHHHh---ccCCCCcEEEEEEcCcccCCc-cHHHHH
Confidence 899999999999999 445678999999999999985 443433
No 52
>PRK05111 acetylornithine deacetylase; Provisional
Probab=98.24 E-value=2.3e-06 Score=58.24 Aligned_cols=37 Identities=19% Similarity=0.098 Sum_probs=32.7
Q ss_pred ChHHHHHHHHHHHHHHHcCCCCCceEEEEeeCCCCCCCh
Q psy3109 1 MKCVGIQYLETIRRYIAEGKKFKRTIHVSFVPGPVTQNE 39 (58)
Q Consensus 1 MK~~~~~~leAl~~L~~~g~~p~r~I~l~f~~DEE~Gg~ 39 (58)
||+++++++.|+++|++.+ ++.+|.++|++|||+|+.
T Consensus 112 ~Kg~~a~~l~a~~~l~~~~--~~~~i~~~~~~~EE~g~~ 148 (383)
T PRK05111 112 MKGFFAFILEALRDIDLTK--LKKPLYILATADEETSMA 148 (383)
T ss_pred ccHHHHHHHHHHHHHhhcC--CCCCeEEEEEeccccCcc
Confidence 8999999999999998764 346899999999999874
No 53
>PRK13007 succinyl-diaminopimelate desuccinylase; Reviewed
Probab=98.21 E-value=2.7e-06 Score=57.20 Aligned_cols=41 Identities=17% Similarity=0.117 Sum_probs=33.5
Q ss_pred ChHHHHHHHHHHHHHHHcCCCCCceEEEEeeCCCCCCChhhhHHH
Q psy3109 1 MKCVGIQYLETIRRYIAEGKKFKRTIHVSFVPGPVTQNETCGLAA 45 (58)
Q Consensus 1 MK~~~~~~leAl~~L~~~g~~p~r~I~l~f~~DEE~Gg~~~g~~~ 45 (58)
||+++++.+.|++.|. .|+++|.++|++|||+|+...|...
T Consensus 96 ~Kg~~a~~l~a~~~l~----~~~~~i~~~~~~~EE~~~~~~G~~~ 136 (352)
T PRK13007 96 MKSGLAVMLHLAATLA----EPAHDLTLVFYDCEEVEAEANGLGR 136 (352)
T ss_pred ccHHHHHHHHHHHHhh----ccCCCeEEEEEecccccCCcccHHH
Confidence 8999999999999993 3678999999999999764234433
No 54
>PRK00466 acetyl-lysine deacetylase; Validated
Probab=98.20 E-value=3.2e-06 Score=57.27 Aligned_cols=35 Identities=14% Similarity=-0.006 Sum_probs=31.4
Q ss_pred ChHHHHHHHHHHHHHHHcCCCCCceEEEEeeCCCCCCCh
Q psy3109 1 MKCVGIQYLETIRRYIAEGKKFKRTIHVSFVPGPVTQNE 39 (58)
Q Consensus 1 MK~~~~~~leAl~~L~~~g~~p~r~I~l~f~~DEE~Gg~ 39 (58)
||+++++.++|++.|++.|. +|.++|++|||+|+.
T Consensus 93 mKgg~aa~l~a~~~l~~~~~----~i~~~~~~dEE~g~~ 127 (346)
T PRK00466 93 AKGPLISMIIAAWLLNEKGI----KVMVSGLADEESTSI 127 (346)
T ss_pred cchHHHHHHHHHHHHHHcCC----CEEEEEEcCcccCCc
Confidence 89999999999999999873 489999999998764
No 55
>TIGR03320 ygeY M20/DapE family protein YgeY. Members of this protein family, including the YgeY protein of Escherichia coli, typically are found in extended genomic regions associated with purine catabolism. Homologs include peptidases and deacylases of the M20/M25 /M40 and DapE/ArgE families. The function is unknown.
Probab=98.18 E-value=2.9e-06 Score=58.34 Aligned_cols=38 Identities=8% Similarity=-0.229 Sum_probs=35.2
Q ss_pred ChHHHHHHHHHHHHHHHcCCCCCceEEEEeeCCCCCCC
Q psy3109 1 MKCVGIQYLETIRRYIAEGKKFKRTIHVSFVPGPVTQN 38 (58)
Q Consensus 1 MK~~~~~~leAl~~L~~~g~~p~r~I~l~f~~DEE~Gg 38 (58)
||+++++.+.|++.|++.|..++.+|.+.+++|||.++
T Consensus 109 ~Kg~~aa~l~A~~~l~~~g~~~~~~i~~~~~~dEE~~~ 146 (395)
T TIGR03320 109 QEGGIASMVYAGKIIKDLGLLDDYTLLVTGTVQEEDCD 146 (395)
T ss_pred ccchHHHHHHHHHHHHHcCCCCCceEEEEecccccccC
Confidence 89999999999999999998888899999999999754
No 56
>PRK10199 alkaline phosphatase isozyme conversion aminopeptidase; Provisional
Probab=98.16 E-value=5.4e-06 Score=58.40 Aligned_cols=37 Identities=14% Similarity=-0.099 Sum_probs=33.5
Q ss_pred ChHHHHHHHHHHHHHHHcCCCCCceEEEEeeCCCCCCCh
Q psy3109 1 MKCVGIQYLETIRRYIAEGKKFKRTIHVSFVPGPVTQNE 39 (58)
Q Consensus 1 MK~~~~~~leAl~~L~~~g~~p~r~I~l~f~~DEE~Gg~ 39 (58)
||+++++++++++.|++. .|+++|.+++++|||.|..
T Consensus 144 nasGvA~lLe~ar~l~~~--~~~~~I~fv~~~~EE~Gl~ 180 (346)
T PRK10199 144 NAAGLGVMLELAERLKNV--PTEYGIRFVATSGEEEGKL 180 (346)
T ss_pred cHHHHHHHHHHHHHHhhC--CCCCcEEEEEECCcccCcH
Confidence 799999999999999865 4788999999999999875
No 57
>TIGR01902 dapE-lys-deAc N-acetyl-ornithine/N-acetyl-lysine deacetylase. This clade of mainly archaeal and related bacterial species contains two characterized enzymes, an deacetylase with specificity for both N-acetyl-ornithine and N-acetyl-lysine from Thermus which is found within a lysine biosynthesis operon, and a fusion protein with acetyl-glutamate kinase (an enzyme of ornithine biosynthesis) from Lactobacillus. It is possible that all of the sequences within this clade have dual specificity, or that a mix of specificities have evolved within this clade.
Probab=98.13 E-value=4.8e-06 Score=56.09 Aligned_cols=38 Identities=16% Similarity=0.031 Sum_probs=32.7
Q ss_pred ChHHHHHHHHHHHHHHHcCCCCCceEEEEeeCCCCCCChhhhHH
Q psy3109 1 MKCVGIQYLETIRRYIAEGKKFKRTIHVSFVPGPVTQNETCGLA 44 (58)
Q Consensus 1 MK~~~~~~leAl~~L~~~g~~p~r~I~l~f~~DEE~Gg~~~g~~ 44 (58)
||+++++.++|+++|++.+ .+|.+++++|||+|+. |..
T Consensus 83 ~Kg~~aa~l~a~~~l~~~~----~~i~~~~~~dEE~g~~--G~~ 120 (336)
T TIGR01902 83 AKGPLIAMIFATWLLNEKG----IKVIVSGLVDEESSSK--GAR 120 (336)
T ss_pred CCcHHHHHHHHHHHHHhCC----CcEEEEEEeCcccCCc--cHH
Confidence 8999999999999998764 4799999999999774 544
No 58
>TIGR03526 selenium_YgeY putative selenium metabolism hydrolase. SelD, selenophosphate synthase, is the selenium donor protein for both selenocysteine and selenouridine biosynthesis systems, but it occurs also in a few prokaryotes that have neither of those pathways. The method of partial phylogenetic profiling, starting from such orphan-selD genomes, identifies this protein as one of those most strongly correlated to SelD occurrence. Its distribution is also well correlated with that of family TIGR03309, a putative accessory protein of labile selenium (non-selenocysteine) enzyme maturation. This family includes the uncharacterized YgeY of Escherichia coli, and belongs to a larger family of metalloenzymes in which some are known peptidases, others enzymes of different types.
Probab=97.99 E-value=1.8e-05 Score=54.52 Aligned_cols=37 Identities=8% Similarity=-0.255 Sum_probs=34.1
Q ss_pred ChHHHHHHHHHHHHHHHcCCCCCceEEEEeeCCCCCC
Q psy3109 1 MKCVGIQYLETIRRYIAEGKKFKRTIHVSFVPGPVTQ 37 (58)
Q Consensus 1 MK~~~~~~leAl~~L~~~g~~p~r~I~l~f~~DEE~G 37 (58)
||+++++.+.|++.|.+.+..+++++.+++++|||++
T Consensus 109 ~Kg~~aa~l~a~~~l~~~~~~~~~~v~~~~~~dEE~~ 145 (395)
T TIGR03526 109 QEGGIASMVYAGKIIKDLGLLDDYTLLVTGTVQEEDC 145 (395)
T ss_pred cchhHHHHHHHHHHHHHcCCCCCceEEEEEecccccC
Confidence 8999999999999999998878889999999999953
No 59
>PF04389 Peptidase_M28: Peptidase family M28; InterPro: IPR007484 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This domain is found in metallopeptidases belonging to the MEROPS peptidase family M28 (aminopeptidase Y, clan MH) []. They also contain a transferrin receptor-like dimerisation domain (IPR007365 from INTERPRO) and a protease-associated PA domain (IPR003137 from INTERPRO).; GO: 0008233 peptidase activity, 0006508 proteolysis; PDB: 3BXM_A 2C6P_A 1Z8L_C 3SJF_A 3BHX_A 2C6G_A 3D7F_A 2XEG_A 3BI1_A 2C6C_A ....
Probab=97.98 E-value=1.9e-05 Score=48.78 Aligned_cols=43 Identities=23% Similarity=0.050 Sum_probs=36.2
Q ss_pred HHHHHHHHHHHHHHHcCCCCCceEEEEeeCCCCCCChhhhHHHH
Q psy3109 3 CVGIQYLETIRRYIAEGKKFKRTIHVSFVPGPVTQNETCGLAAA 46 (58)
Q Consensus 3 ~~~~~~leAl~~L~~~g~~p~r~I~l~f~~DEE~Gg~~~g~~~~ 46 (58)
++++.+||.+|.|.+.+.+|+|+|.++|+..||.|-. +...++
T Consensus 29 sGva~lLelAr~l~~~~~~~~~~i~fv~~~~EE~gl~-GS~~~~ 71 (179)
T PF04389_consen 29 SGVAALLELARVLKELKPQPKRTIRFVFFDGEEQGLL-GSRAFV 71 (179)
T ss_dssp HHHHHHHHHHHHHHHSTHSSSEEEEEEEESSGGGTSH-HHHHHH
T ss_pred HHHHHHHHHHHHHHHhhcccCccEEEEEecccccCcc-chHHHH
Confidence 5789999999999998889999999999999998764 333333
No 60
>PLN02693 IAA-amino acid hydrolase
Probab=97.87 E-value=3.3e-05 Score=54.80 Aligned_cols=36 Identities=11% Similarity=0.079 Sum_probs=32.6
Q ss_pred ChHHHHHHHHHHHHHHHcCCCCCceEEEEeeCCCCC
Q psy3109 1 MKCVGIQYLETIRRYIAEGKKFKRTIHVSFVPGPVT 36 (58)
Q Consensus 1 MK~~~~~~leAl~~L~~~g~~p~r~I~l~f~~DEE~ 36 (58)
||+.+++.+.|++.|++.+..++.+|.++|++|||.
T Consensus 137 hkg~~A~~l~Aa~~L~~~~~~~~g~V~~if~pdEE~ 172 (437)
T PLN02693 137 HDGHVAMLLGAAKILQEHRHHLQGTVVLIFQPAEEG 172 (437)
T ss_pred chHHHHHHHHHHHHHHhCcccCCceEEEEEEEcccc
Confidence 799999999999999998766678999999999993
No 61
>PRK08737 acetylornithine deacetylase; Provisional
Probab=97.86 E-value=2.6e-05 Score=53.82 Aligned_cols=36 Identities=11% Similarity=0.045 Sum_probs=29.8
Q ss_pred ChHHHHHHHHHHHHHHHcCCCCCceEEEEeeCCCCCCChhhhHH
Q psy3109 1 MKCVGIQYLETIRRYIAEGKKFKRTIHVSFVPGPVTQNETCGLA 44 (58)
Q Consensus 1 MK~~~~~~leAl~~L~~~g~~p~r~I~l~f~~DEE~Gg~~~g~~ 44 (58)
||+++++++.|++. +..+|.++|++|||+|+. .|..
T Consensus 103 mKg~~aa~l~a~~~-------~~~~v~~~~~~dEE~g~~-~g~~ 138 (364)
T PRK08737 103 IKGAAAALLAAANA-------GDGDAAFLFSSDEEANDP-RCVA 138 (364)
T ss_pred chHHHHHHHHHHHc-------cCCCEEEEEEcccccCch-hhHH
Confidence 89999999999874 346899999999999885 3543
No 62
>TIGR01891 amidohydrolases amidohydrolase. This model represents a subfamily of amidohydrolases which are a subset of those sequences detected by pfam01546. Included within this group are hydrolases of hippurate (N-benzylglycine), indoleacetic acid (IAA) N-conjugates of amino acids, N-acetyl-L-amino acids and aminobenzoylglutamate. These hydrolases are of the carboxypeptidase-type, most likely utilizing a zinc ion in the active site.
Probab=97.74 E-value=7.1e-05 Score=51.00 Aligned_cols=36 Identities=14% Similarity=0.092 Sum_probs=31.8
Q ss_pred hHHHHHHHHHHHHHHHcCCCCCceEEEEeeCCCCCC
Q psy3109 2 KCVGIQYLETIRRYIAEGKKFKRTIHVSFVPGPVTQ 37 (58)
Q Consensus 2 K~~~~~~leAl~~L~~~g~~p~r~I~l~f~~DEE~G 37 (58)
|+++++++.|++.|++.+.+++++|.++|++|||+|
T Consensus 92 ~~~~a~~l~a~~~l~~~~~~~~~~i~~~~~~dEE~~ 127 (363)
T TIGR01891 92 DLHTAILLGTAKLLKKLADLLEGTVRLIFQPAEEGG 127 (363)
T ss_pred HHHHHHHHHHHHHHHhchhhCCceEEEEEeecCcCc
Confidence 577889999999999887777889999999999984
No 63
>TIGR01893 aa-his-dipept aminoacyl-histidine dipeptidase.
Probab=97.70 E-value=5.2e-05 Score=53.82 Aligned_cols=35 Identities=9% Similarity=-0.064 Sum_probs=28.8
Q ss_pred ChHHHHHHHHHHHHHHHcCCCCCceEEEEeeCCCCCCCh
Q psy3109 1 MKCVGIQYLETIRRYIAEGKKFKRTIHVSFVPGPVTQNE 39 (58)
Q Consensus 1 MK~~~~~~leAl~~L~~~g~~p~r~I~l~f~~DEE~Gg~ 39 (58)
||+++++++++++. .+ .+.++|.++|++|||+|+.
T Consensus 110 ~k~gva~~l~~~~~---~~-~~~~~i~~~~~~dEE~g~~ 144 (477)
T TIGR01893 110 NGIGVAMGLAILED---NN-LKHPPLELLFTVDEETGMD 144 (477)
T ss_pred ccHHHHHHHHHHhc---CC-CCCCCEEEEEEeccccCch
Confidence 79999999998775 33 3567999999999998764
No 64
>PRK15026 aminoacyl-histidine dipeptidase; Provisional
Probab=97.59 E-value=0.00013 Score=52.72 Aligned_cols=35 Identities=9% Similarity=-0.050 Sum_probs=28.3
Q ss_pred ChHHHHHHHHHHHHHHHcCCCCCceEEEEeeCCCCCCCh
Q psy3109 1 MKCVGIQYLETIRRYIAEGKKFKRTIHVSFVPGPVTQNE 39 (58)
Q Consensus 1 MK~~~~~~leAl~~L~~~g~~p~r~I~l~f~~DEE~Gg~ 39 (58)
||++++++++++ .+.++ +..+|.++|++|||+|+.
T Consensus 116 ~k~gva~~l~~l---~~~~~-~~~~i~~l~t~dEE~G~~ 150 (485)
T PRK15026 116 NGIGMASALAVL---ADENV-VHGPLEVLLTMTEEAGMD 150 (485)
T ss_pred cHHHHHHHHHHH---HhCCC-CCCCEEEEEEcccccCcH
Confidence 799999988776 44554 467899999999999875
No 65
>PLN02280 IAA-amino acid hydrolase
Probab=97.48 E-value=0.00022 Score=51.48 Aligned_cols=37 Identities=14% Similarity=0.104 Sum_probs=33.3
Q ss_pred ChHHHHHHHHHHHHHHHcCCCCCceEEEEeeCCCCCC
Q psy3109 1 MKCVGIQYLETIRRYIAEGKKFKRTIHVSFVPGPVTQ 37 (58)
Q Consensus 1 MK~~~~~~leAl~~L~~~g~~p~r~I~l~f~~DEE~G 37 (58)
||+.+++++.|++.|.+.+..++.+|.++|.+|||+|
T Consensus 187 hd~~~A~~l~a~~~L~~~~~~~~g~V~~if~pdEE~g 223 (478)
T PLN02280 187 HDAHVAMLLGAAKILKSREHLLKGTVVLLFQPAEEAG 223 (478)
T ss_pred CcHHHHHHHHHHHHHHhccccCCceEEEEeccccccc
Confidence 6889999999999999887777889999999999985
No 66
>PRK09961 exoaminopeptidase; Provisional
Probab=96.93 E-value=0.0025 Score=44.34 Aligned_cols=43 Identities=7% Similarity=-0.010 Sum_probs=34.8
Q ss_pred ChHHHHHHHHHHHHHHHcCCCCCceEEEEeeCCCCCCChhhhHHHHH
Q psy3109 1 MKCVGIQYLETIRRYIAEGKKFKRTIHVSFVPGPVTQNETCGLAAAN 47 (58)
Q Consensus 1 MK~~~~~~leAl~~L~~~g~~p~r~I~l~f~~DEE~Gg~~~g~~~~~ 47 (58)
+|.+++.+++++++|++. .+..+++++|+.-||+|.. |...++
T Consensus 167 nR~g~~~lle~l~~l~~~--~~~~~v~~~~tvqEEvG~r--Ga~~aa 209 (344)
T PRK09961 167 DRLGCYLLVTLLRELHDA--ELPAEVWLVASSSEEVGLR--GGQTAT 209 (344)
T ss_pred hhHhHHHHHHHHHHhhhc--CCCceEEEEEEcccccchH--HHHHHH
Confidence 478999999999999765 4678999999999999875 444433
No 67
>KOG2195|consensus
Probab=96.58 E-value=0.0078 Score=45.98 Aligned_cols=39 Identities=13% Similarity=-0.020 Sum_probs=30.4
Q ss_pred HHHHHHHHHHcCCCCCceEEEEeeCCCCCCChhhhHHHHH
Q psy3109 8 YLETIRRYIAEGKKFKRTIHVSFVPGPVTQNETCGLAAAN 47 (58)
Q Consensus 8 ~leAl~~L~~~g~~p~r~I~l~f~~DEE~Gg~~~g~~~~~ 47 (58)
+-.++..+++.|++|+|||+++.|..||-|.. |..-++-
T Consensus 381 i~~~~~~~~k~gwrP~RtI~F~sWdAeEfGli-GStE~~E 419 (702)
T KOG2195|consen 381 IARALSKLKKRGWRPRRTILFASWDAEEFGLL-GSTEWAE 419 (702)
T ss_pred HHHHHHHHHHcCCCccceEEEEEccchhcccc-ccHHHHH
Confidence 44566677899999999999999999998775 4444443
No 68
>TIGR03107 glu_aminopep glutamyl aminopeptidase. This model represents the M42.001 clade within MEROPS family M42. M42 includes glutamyl aminopeptidase as in the present model, deblocking aminopeptidases as from Pyrococcus horikoshii and related species, and endo-1,4-beta-glucanase (cellulase M) as from Clostridium thermocellum. The current family includes
Probab=96.56 E-value=0.0058 Score=42.92 Aligned_cols=37 Identities=8% Similarity=-0.031 Sum_probs=32.5
Q ss_pred ChHHHHHHHHHHHHHHHcCCCCCceEEEEeeCCCCCCCh
Q psy3109 1 MKCVGIQYLETIRRYIAEGKKFKRTIHVSFVPGPVTQNE 39 (58)
Q Consensus 1 MK~~~~~~leAl~~L~~~g~~p~r~I~l~f~~DEE~Gg~ 39 (58)
+|.+++.+++++++|++. .++.+++++|+.-||+|..
T Consensus 179 dR~g~a~l~e~l~~l~~~--~~~~~l~~~~tvqEEvG~r 215 (350)
T TIGR03107 179 NRYGVLMILELLESLKDQ--ELPNTLIAGANVQEEVGLR 215 (350)
T ss_pred cHHHHHHHHHHHHHhhhc--CCCceEEEEEEChhhcCch
Confidence 478999999999999875 4778999999999999874
No 69
>PF05343 Peptidase_M42: M42 glutamyl aminopeptidase; InterPro: IPR008007 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to MEROPS peptidase family M42 (glutamyl aminopeptidase family, clan MH). For members of this family and family M28 the predicted metal ligands occur in the same order in the sequence: H, D, E, D/E, H; and the active site residues occur in the motifs HXD and EE. ; PDB: 2WYR_C 2CF4_A 1VHO_A 3ISX_A 3KL9_G 1YLO_F 3CPX_C 1VHE_A 2GRE_F 1XFO_A ....
Probab=96.42 E-value=0.015 Score=39.88 Aligned_cols=41 Identities=15% Similarity=0.089 Sum_probs=33.4
Q ss_pred hHHHHHHHHHHHHHHHcCCCCCceEEEEeeCCCCCCChhhhHHHH
Q psy3109 2 KCVGIQYLETIRRYIAEGKKFKRTIHVSFVPGPVTQNETCGLAAA 46 (58)
Q Consensus 2 K~~~~~~leAl~~L~~~g~~p~r~I~l~f~~DEE~Gg~~~g~~~~ 46 (58)
+.+++.++++++.|++.+. +.+++++|+.-||+|.. |...+
T Consensus 136 R~g~~~lle~l~~l~~~~~--~~~v~~v~tvqEEvG~r--GA~~a 176 (292)
T PF05343_consen 136 RAGCAVLLELLRELKEKEL--DVDVYFVFTVQEEVGLR--GAKTA 176 (292)
T ss_dssp HHHHHHHHHHHHHHTTSS---SSEEEEEEESSCTTTSH--HHHHH
T ss_pred hhHHHHHHHHHHHHhhcCC--CceEEEEEEeeeeecCc--ceeec
Confidence 6789999999999998753 38999999999999874 44433
No 70
>COG1363 FrvX Cellulase M and related proteins [Carbohydrate transport and metabolism]
Probab=96.20 E-value=0.0076 Score=42.81 Aligned_cols=37 Identities=8% Similarity=-0.008 Sum_probs=33.4
Q ss_pred ChHHHHHHHHHHHHHHHcCCCCCceEEEEeeCCCCCCCh
Q psy3109 1 MKCVGIQYLETIRRYIAEGKKFKRTIHVSFVPGPVTQNE 39 (58)
Q Consensus 1 MK~~~~~~leAl~~L~~~g~~p~r~I~l~f~~DEE~Gg~ 39 (58)
+|.+++++++++++| + +..++.+++++|+.-||+|-.
T Consensus 181 dR~gva~lle~lk~l-~-~~~~~~~vy~v~tvqEEVGlr 217 (355)
T COG1363 181 DRAGVAALLELLKEL-K-GIELPADVYFVASVQEEVGLR 217 (355)
T ss_pred chHhHHHHHHHHHHh-c-cCCCCceEEEEEecchhhccc
Confidence 488999999999999 5 778899999999999999864
No 71
>COG2234 Iap Predicted aminopeptidases [General function prediction only]
Probab=96.12 E-value=0.012 Score=40.93 Aligned_cols=35 Identities=17% Similarity=-0.028 Sum_probs=31.1
Q ss_pred HHHHHHHHHHHHHHHcCCCCCceEEEEeeCCCCCCCh
Q psy3109 3 CVGIQYLETIRRYIAEGKKFKRTIHVSFVPGPVTQNE 39 (58)
Q Consensus 3 ~~~~~~leAl~~L~~~g~~p~r~I~l~f~~DEE~Gg~ 39 (58)
++++..+|..|.|+... |+|+|.++++.-||.|..
T Consensus 231 sGva~llEiAr~l~~~~--p~~~v~f~~~~aEE~Gl~ 265 (435)
T COG2234 231 SGVAALLELARVLKGNP--PKRTVRFVAFGAEESGLL 265 (435)
T ss_pred HHHHHHHHHHHHHhcCC--CCceEEEEEecchhhccc
Confidence 57889999999998875 999999999999998775
No 72
>PRK09864 putative peptidase; Provisional
Probab=96.10 E-value=0.013 Score=41.40 Aligned_cols=35 Identities=6% Similarity=-0.262 Sum_probs=31.0
Q ss_pred ChHHHHHHHHHHHHHHHcCCCCCceEEEEeeCCCCCCCh
Q psy3109 1 MKCVGIQYLETIRRYIAEGKKFKRTIHVSFVPGPVTQNE 39 (58)
Q Consensus 1 MK~~~~~~leAl~~L~~~g~~p~r~I~l~f~~DEE~Gg~ 39 (58)
+|.+++.++++++.|++ |..+++++|+.-||+|-.
T Consensus 176 nR~g~~~lle~l~~l~~----~~~~vy~v~TvQEEvGlr 210 (356)
T PRK09864 176 NRIGCAMMAELLQTVNN----PEITLYGVGSVEEEVGLR 210 (356)
T ss_pred cHHHHHHHHHHHHHhhc----CCCeEEEEEEcchhcchH
Confidence 47899999999999965 778999999999999864
No 73
>COG2195 PepD Di- and tripeptidases [Amino acid transport and metabolism]
Probab=95.74 E-value=0.014 Score=42.12 Aligned_cols=46 Identities=13% Similarity=0.065 Sum_probs=36.4
Q ss_pred ChHHHHHHHHHHHHHHHcC-CCCCceEEEEeeCCCCCCChhhhHHHHHH
Q psy3109 1 MKCVGIQYLETIRRYIAEG-KKFKRTIHVSFVPGPVTQNETCGLAAANS 48 (58)
Q Consensus 1 MK~~~~~~leAl~~L~~~g-~~p~r~I~l~f~~DEE~Gg~~~g~~~~~s 48 (58)
.|++++.++.|++.+.+.- -.|-.+|.+.|++|||.|+. |...+..
T Consensus 145 ~kAGia~i~~al~~~~~~~~~i~h~~i~~g~s~~Ee~g~r--g~~~~~~ 191 (414)
T COG2195 145 DKAGIAEIMTALSVLREKHPEIPHGGIRGGFSPDEEIGGR--GAANKDV 191 (414)
T ss_pred chhHHHHHHHHHHHHhhcCccccccCeEEEecchHHhhhh--hhhhccH
Confidence 4899999999999999663 34668999999999999875 4444433
No 74
>COG4187 RocB Arginine degradation protein (predicted deacylase) [Amino acid transport and metabolism]
Probab=95.70 E-value=0.026 Score=42.04 Aligned_cols=38 Identities=16% Similarity=0.141 Sum_probs=33.4
Q ss_pred ChHHHHHHHHHHHHHHHcCCCCCceEEEEeeCCCCCCCh
Q psy3109 1 MKCVGIQYLETIRRYIAEGKKFKRTIHVSFVPGPVTQNE 39 (58)
Q Consensus 1 MK~~~~~~leAl~~L~~~g~~p~r~I~l~f~~DEE~Gg~ 39 (58)
||++++..+..++++.+.+ ..+-+|-++-+||||+.+.
T Consensus 141 MKsGlav~la~L~~fa~~~-~~~GNlLf~a~pdEE~~s~ 178 (553)
T COG4187 141 MKSGLAVHLACLEEFAART-DRQGNLLFMAVPDEEVESR 178 (553)
T ss_pred hhhhhHHHHHHHHHHhhCC-CCCCcEEEEeccchhhhcc
Confidence 9999999999999999984 4556899999999998764
No 75
>KOG2276|consensus
Probab=93.67 E-value=0.09 Score=38.74 Aligned_cols=39 Identities=10% Similarity=0.105 Sum_probs=36.1
Q ss_pred ChHHHHHHHHHHHHHHHcCCCCCceEEEEeeCCCCCCCh
Q psy3109 1 MKCVGIQYLETIRRYIAEGKKFKRTIHVSFVPGPVTQNE 39 (58)
Q Consensus 1 MK~~~~~~leAl~~L~~~g~~p~r~I~l~f~~DEE~Gg~ 39 (58)
+|+=+++.++|++++++.|...+-+|.++|-.-||.||.
T Consensus 133 dkGPv~~wi~av~a~~~~g~~lpvnv~f~~EgmEEsgS~ 171 (473)
T KOG2276|consen 133 DKGPVLSWIHAVKALQQLGIDLPVNVVFVFEGMEESGSE 171 (473)
T ss_pred CCccchHHHHHHHHHHHhCccccceEEEEEEechhccCc
Confidence 477789999999999999998889999999999999997
No 76
>PRK02256 putative aminopeptidase 1; Provisional
Probab=93.64 E-value=0.14 Score=37.39 Aligned_cols=34 Identities=18% Similarity=0.130 Sum_probs=25.6
Q ss_pred hHHHHHHHHHHHHHHHcCCCCCceEEEEeeCCCCCCCh
Q psy3109 2 KCVGIQYLETIRRYIAEGKKFKRTIHVSFVPGPVTQNE 39 (58)
Q Consensus 2 K~~~~~~leAl~~L~~~g~~p~r~I~l~f~~DEE~Gg~ 39 (58)
+.++.++++|++.+. .+..+++++++.-||+|+.
T Consensus 262 r~~~~~~leal~~~~----~~~~~~~~~~~dqEEVGs~ 295 (462)
T PRK02256 262 RVCAYTSLEALLELE----NPEKTAVVLLVDKEEIGSE 295 (462)
T ss_pred HHHHHHHHHHHHhcc----cCCCeEEEEEEcccccCCc
Confidence 456777777776553 3556888999999999986
No 77
>COG1473 AbgB Metal-dependent amidase/aminoacylase/carboxypeptidase [General function prediction only]
Probab=90.27 E-value=0.39 Score=34.44 Aligned_cols=46 Identities=11% Similarity=0.079 Sum_probs=34.1
Q ss_pred HHHHHHHHHHHHHcCCCCCceEEEEeeCCCCCCChhhhHHHHHHHHHH
Q psy3109 5 GIQYLETIRRYIAEGKKFKRTIHVSFVPGPVTQNETCGLAAANSVLYL 52 (58)
Q Consensus 5 ~~~~leAl~~L~~~g~~p~r~I~l~f~~DEE~Gg~~~g~~~~~s~~~~ 52 (58)
.++.+-|.+.|.+....++-+|.++|-|-||.++ ++..++..-++.
T Consensus 108 ta~lLgaA~~L~~~~~~~~Gtv~~ifQPAEE~~~--Ga~~mi~~G~~~ 153 (392)
T COG1473 108 TAILLGAALALAEHKDNLPGTVRLIFQPAEEGGG--GAKAMIEDGVFD 153 (392)
T ss_pred HHHHHHHHHHHHhhhhhCCcEEEEEecccccccc--cHHHHHhcCCcc
Confidence 4667888888888755678899999999999874 455665554443
No 78
>PF05450 Nicastrin: Nicastrin; InterPro: IPR008710 Nicastrin and presenilin are two major components of the gamma-secretase complex, which executes the intramembrane proteolysis of type I integral membrane proteins such as the amyloid precursor protein (APP) and Notch. Nicastrin is synthesised in fibroblasts and neurons as an endoglycosidase-H-sensitive glycosylated precursor protein (immature nicastrin) and is then modified by complex glycosylation in the Golgi apparatus and by sialylation in the trans-Golgi network (mature nicastrin) [].; GO: 0016485 protein processing, 0016021 integral to membrane
Probab=89.69 E-value=0.87 Score=30.50 Aligned_cols=34 Identities=9% Similarity=0.127 Sum_probs=27.0
Q ss_pred HHHHHHHHHHHHHHHc---CCCCCceEEEEeeCCCCC
Q psy3109 3 CVGIQYLETIRRYIAE---GKKFKRTIHVSFVPGPVT 36 (58)
Q Consensus 3 ~~~~~~leAl~~L~~~---g~~p~r~I~l~f~~DEE~ 36 (58)
++++..|+|++.|.+. --.++|+|.++|+..|-.
T Consensus 27 sglvaLLaaA~aL~~~~~~~~~~~knV~F~~F~GEs~ 63 (234)
T PF05450_consen 27 SGLVALLAAAEALSKLLPDSSNLNKNVLFAFFNGESF 63 (234)
T ss_pred HHHHHHHHHHHHHHHhhhccccccCcEEEEEecCccc
Confidence 4778888888888765 235789999999999974
No 79
>PF02127 Peptidase_M18: Aminopeptidase I zinc metalloprotease (M18); InterPro: IPR001948 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase family M18, (clan MH). The proteins have two catalytic zinc ions at the active site, bound by His/Asp, Asp, Glu, Asp/Glu and His. The catalysed reaction involves the release of an N-terminal aminoacid, usually neutral or hydrophobic, from a polypeptide []. The type example is aminopeptidase I from Saccharomyces cerevisiae (Baker's yeast), the sequence of which has been deduced, and the mature protein shown to consist of 469 amino acids []. A 45-residue presequence contains both positively- and negatively-charged and hydrophobic residues, which could be arranged in an N-terminal amphiphilic alpha-helix []. The presequence differs from signal sequences that direct proteins across bacterial plasma membranes and endoplasmic reticulum or into mitochondria. It is unclear how this unique presequence targets aminopeptidase I to yeast vacuoles, and how this sorting utilises classical protein secretory pathways [].; GO: 0004177 aminopeptidase activity, 0008270 zinc ion binding, 0006508 proteolysis; PDB: 1Y7E_A 2GLJ_R 4DYO_A 2IJZ_B 3VAT_A 3VAR_A 2GLF_B.
Probab=88.96 E-value=0.68 Score=33.64 Aligned_cols=35 Identities=11% Similarity=0.002 Sum_probs=23.0
Q ss_pred HHHHHHHHHHHHHcC---CCCC-ceEEEEeeCCCCCCCh
Q psy3109 5 GIQYLETIRRYIAEG---KKFK-RTIHVSFVPGPVTQNE 39 (58)
Q Consensus 5 ~~~~leAl~~L~~~g---~~p~-r~I~l~f~~DEE~Gg~ 39 (58)
+++.+.++++|.+.. ..|. .+-.++++-.||+||.
T Consensus 237 l~s~~a~l~Al~~~~~~~~~~~~~~~v~~~fD~EEiGS~ 275 (432)
T PF02127_consen 237 LASCYAALEALIDSSNDSLEPEDGTNVVVLFDNEEIGSE 275 (432)
T ss_dssp HHHHHHHHHHHHHHTSSCCCCSSSEEEEEEESSGGGTST
T ss_pred HHHHHHHHHHHHhhhcccccccCceEEEEEEcccccCCC
Confidence 456677777777663 2466 4445556656999996
No 80
>PRK02813 putative aminopeptidase 2; Provisional
Probab=88.77 E-value=0.58 Score=33.81 Aligned_cols=33 Identities=9% Similarity=-0.112 Sum_probs=23.7
Q ss_pred hHHHHHHHHHHHHHHHcCCCCCceEEEEeeCCCCCCCh
Q psy3109 2 KCVGIQYLETIRRYIAEGKKFKRTIHVSFVPGPVTQNE 39 (58)
Q Consensus 2 K~~~~~~leAl~~L~~~g~~p~r~I~l~f~~DEE~Gg~ 39 (58)
+.++...++|++.+.. ..+++++++--||+|+.
T Consensus 236 r~~~~~~l~al~~~~~-----~~~~~~~~~d~EEVGs~ 268 (428)
T PRK02813 236 LSSCHAGLEALLAAAS-----DATNVLAAFDHEEVGSA 268 (428)
T ss_pred HHHHHHHHHHHHhcCC-----CCeEEEEEEecCccCCC
Confidence 3456666666665432 56888999989999986
No 81
>KOG2194|consensus
Probab=85.47 E-value=1.1 Score=35.24 Aligned_cols=44 Identities=14% Similarity=0.012 Sum_probs=34.2
Q ss_pred HHHHHHHHHHHHHHHcCCCCCceEEEEeeCCCCCCChhhhHHHHH
Q psy3109 3 CVGIQYLETIRRYIAEGKKFKRTIHVSFVPGPVTQNETCGLAAAN 47 (58)
Q Consensus 3 ~~~~~~leAl~~L~~~g~~p~r~I~l~f~~DEE~Gg~~~g~~~~~ 47 (58)
.+++.+||++|.+.+.....+++|.++|+.-||.+-. +..+++.
T Consensus 166 ~~va~mLe~lRv~s~~~~~l~~~vVFLfNgaEE~~L~-gsH~FIt 209 (834)
T KOG2194|consen 166 SGVASMLEALRVLSKSDKLLTHSVVFLFNGAEESGLL-GSHAFIT 209 (834)
T ss_pred hHHHHHHHHHHHhhcCCCcccccEEEEecCcccchhh-hccccee
Confidence 4688999999999988666689999999999996543 3444443
No 82
>PTZ00371 aspartyl aminopeptidase; Provisional
Probab=85.08 E-value=1.2 Score=32.54 Aligned_cols=38 Identities=3% Similarity=-0.061 Sum_probs=21.7
Q ss_pred hHHHHHHHHHHHHHHHcCCCCCceEEEEe-eCCCCCCCh
Q psy3109 2 KCVGIQYLETIRRYIAEGKKFKRTIHVSF-VPGPVTQNE 39 (58)
Q Consensus 2 K~~~~~~leAl~~L~~~g~~p~r~I~l~f-~~DEE~Gg~ 39 (58)
+.++...++|+..+...-......+.+++ +--||+||.
T Consensus 253 r~~~~~~l~al~~~~~~~~~~~~~~~v~~~~d~EEVGs~ 291 (465)
T PTZ00371 253 LGSSFCAFKALTEAVESLGENSSNIRMVCLFDHEEVGSS 291 (465)
T ss_pred HHHHHHHHHHHHhccccccCCCCceEEEEEECCcCCCCC
Confidence 45666677777655421001134455555 666999986
No 83
>PF06309 Torsin: Torsin; InterPro: IPR010448 This family consists of several eukaryotic torsin proteins. Torsion dystonia is an autosomal dominant movement disorder characterised by involuntary, repetitive muscle contractions and twisted postures. The most severe early-onset form of dystonia has been linked to mutations in the human DYT1 (TOR1A) gene encoding a protein termed torsinA. While causative genetic alterations have been identified, the function of torsin proteins and the molecular mechanism underlying dystonia remain unknown. Phylogenetic analysis of the torsin protein family indicates these proteins share distant sequence similarity with the large and diverse family of AAA ATPase, central region containing proteins () proteins. It has been suggested that torsins play a role in effectively managing protein folding and that possible breakdown in a neuroprotective mechanism that is, in part, mediated by torsins may be responsible for the neuronal dysfunction associated with dystonia [].; GO: 0005524 ATP binding, 0051085 chaperone mediated protein folding requiring cofactor
Probab=72.58 E-value=20 Score=22.27 Aligned_cols=49 Identities=18% Similarity=0.154 Sum_probs=36.6
Q ss_pred HHHHHHHHHHHcCCCCCceEEEEeeCCCCCCChhhhHHHHHHHHHHhhhcc
Q psy3109 7 QYLETIRRYIAEGKKFKRTIHVSFVPGPVTQNETCGLAAANSVLYLLGLMS 57 (58)
Q Consensus 7 ~~leAl~~L~~~g~~p~r~I~l~f~~DEE~Gg~~~g~~~~~s~~~~~~~~~ 57 (58)
.++.|++.-.+.. .|++++.++|-..-=+|=. +-..+++..+|..|+-|
T Consensus 36 ~v~~ai~~~l~~~-~p~KpLVlSfHG~tGtGKn-~v~~liA~~ly~~G~~S 84 (127)
T PF06309_consen 36 VVVNAIKGHLANP-NPRKPLVLSFHGWTGTGKN-FVSRLIAEHLYKSGMKS 84 (127)
T ss_pred HHHHHHHHHHcCC-CCCCCEEEEeecCCCCcHH-HHHHHHHHHHHhcccCC
Confidence 4567777777664 7999999999877666543 56778888888887654
No 84
>COG1362 LAP4 Aspartyl aminopeptidase [Amino acid transport and metabolism]
Probab=71.98 E-value=5.1 Score=29.62 Aligned_cols=33 Identities=9% Similarity=0.031 Sum_probs=22.9
Q ss_pred HHHHHHHHHHHHcCCCCCceEEEEeeCCCCCCCh
Q psy3109 6 IQYLETIRRYIAEGKKFKRTIHVSFVPGPVTQNE 39 (58)
Q Consensus 6 ~~~leAl~~L~~~g~~p~r~I~l~f~~DEE~Gg~ 39 (58)
++.+.++++|... ..+..+..++++--||+||.
T Consensus 244 ~~~~a~m~AL~~~-~~~~~~~v~~~fD~EEIGS~ 276 (437)
T COG1362 244 ICCHAGMEALLAA-ANSDKTCVLALFDHEEIGSL 276 (437)
T ss_pred HHHHHHHHHHHhc-cCCCCceEEEEechhhcccc
Confidence 4556667777665 34556667777777999996
No 85
>PF06877 RraB: Regulator of ribonuclease activity B; InterPro: IPR009671 This entry occurs in several hypothetical bacterial proteins of around 120 residues in length. The function of these proteins is unknown. The protein structure has been determined for one member of this group, the hypothetical protein VCO424 from Vibrio cholerae; it has an alpha+beta sandwich fold.; PDB: 1NXI_A.
Probab=61.78 E-value=23 Score=20.06 Aligned_cols=29 Identities=17% Similarity=0.185 Sum_probs=18.5
Q ss_pred HHHHHHHHHHHcCCCC--CceEEEEeeCCCC
Q psy3109 7 QYLETIRRYIAEGKKF--KRTIHVSFVPGPV 35 (58)
Q Consensus 7 ~~leAl~~L~~~g~~p--~r~I~l~f~~DEE 35 (58)
+--..++.|.+.|..| +|+|...|..+.|
T Consensus 5 ~n~~vl~~L~~~Gddl~~~r~ieh~~~f~~~ 35 (104)
T PF06877_consen 5 ENREVLEALEEDGDDLSKPRPIEHWFYFEDE 35 (104)
T ss_dssp HHHHHHHHHHHHT--TTS-EEEEEEEEES-H
T ss_pred HHHHHHHHHHhcCCCCCCCeEEEEEEEeCCH
Confidence 3446788999999766 4888877665544
No 86
>COG4882 Predicted aminopeptidase, Iap family [General function prediction only]
Probab=61.59 E-value=9.1 Score=28.43 Aligned_cols=45 Identities=18% Similarity=-0.034 Sum_probs=31.3
Q ss_pred HHHHHHHHHHHHHHcCCCCCceEEEEeeCCCCCCChhhhHHH--HHHHHHH
Q psy3109 4 VGIQYLETIRRYIAEGKKFKRTIHVSFVPGPVTQNETCGLAA--ANSVLYL 52 (58)
Q Consensus 4 ~~~~~leAl~~L~~~g~~p~r~I~l~f~~DEE~Gg~~~g~~~--~~s~~~~ 52 (58)
++.|+++++++|...+. .+.++.+.-||.|++....-+ .||+.|+
T Consensus 211 g~~~AV~~~~~lr~~~~----~~~lv~FtAEE~g~p~~~sfyWa~GSr~~l 257 (486)
T COG4882 211 GVAQAVETAGRLRGRGL----AAGLVVFTAEEHGMPGMASFYWAAGSRGLL 257 (486)
T ss_pred hHHHHHHHHHHHhhcCc----ceeEEEEeccccCCCCCcceeecccchHHH
Confidence 67889999999988874 357777788998887433222 3455443
No 87
>PF12000 Glyco_trans_4_3: Gkycosyl transferase family 4 group; InterPro: IPR022623 This presumed domain is functionally uncharacterised and found in bacteria. This region is about 170 amino acids in length and is found N-terminal to PF00534 from PFAM. There is a single completely conserved residue G that may be functionally important.
Probab=56.75 E-value=17 Score=23.34 Aligned_cols=22 Identities=14% Similarity=0.102 Sum_probs=16.5
Q ss_pred HHHHHHHHHHHHcCCCCCceEEEE
Q psy3109 6 IQYLETIRRYIAEGKKFKRTIHVS 29 (58)
Q Consensus 6 ~~~leAl~~L~~~g~~p~r~I~l~ 29 (58)
-+...|++.|+++|+.| +|++.
T Consensus 51 ~av~~a~~~L~~~Gf~P--DvI~~ 72 (171)
T PF12000_consen 51 QAVARAARQLRAQGFVP--DVIIA 72 (171)
T ss_pred HHHHHHHHHHHHcCCCC--CEEEE
Confidence 35678899999999999 44443
No 88
>PF14136 DUF4303: Domain of unknown function (DUF4303)
Probab=50.53 E-value=47 Score=20.02 Aligned_cols=20 Identities=15% Similarity=0.044 Sum_probs=15.4
Q ss_pred HHHHHHHHHHHHHHcCCCCC
Q psy3109 4 VGIQYLETIRRYIAEGKKFK 23 (58)
Q Consensus 4 ~~~~~leAl~~L~~~g~~p~ 23 (58)
..-+++.|++.|.+.|+.-.
T Consensus 104 ~~~~~~~aL~~l~~eg~f~~ 123 (155)
T PF14136_consen 104 LLEAMVEALKQLDQEGFFGN 123 (155)
T ss_pred HHHHHHHHHHHHHHhcccCC
Confidence 45678899999999986433
No 89
>PF11181 YflT: Heat induced stress protein YflT
Probab=50.38 E-value=28 Score=20.04 Aligned_cols=26 Identities=23% Similarity=0.129 Sum_probs=19.9
Q ss_pred HHHHHHHHHHHcCCCCCceEEEEeeCC
Q psy3109 7 QYLETIRRYIAEGKKFKRTIHVSFVPG 33 (58)
Q Consensus 7 ~~leAl~~L~~~g~~p~r~I~l~f~~D 33 (58)
.++.+++.|++.|+.. .+|+++--.+
T Consensus 11 E~~~~I~~L~~~Gy~~-ddI~Vva~d~ 36 (103)
T PF11181_consen 11 EALSAIEELKAQGYSE-DDIYVVAKDK 36 (103)
T ss_pred HHHHHHHHHHHcCCCc-ccEEEEEcCc
Confidence 5788999999999854 6787766433
No 90
>KOG0615|consensus
Probab=48.79 E-value=37 Score=25.49 Aligned_cols=33 Identities=18% Similarity=0.202 Sum_probs=25.7
Q ss_pred hHHHHHHHHHHHHHHHcCCCCCceE---EEEeeCCCC
Q psy3109 2 KCVGIQYLETIRRYIAEGKKFKRTI---HVSFVPGPV 35 (58)
Q Consensus 2 K~~~~~~leAl~~L~~~g~~p~r~I---~l~f~~DEE 35 (58)
|...-|++.|++.|++.|+.. |+| .+++..|+|
T Consensus 279 K~~f~Qll~avkYLH~~GI~H-RDiKPeNILl~~~~e 314 (475)
T KOG0615|consen 279 KLLFKQLLTAVKYLHSQGIIH-RDIKPENILLSNDAE 314 (475)
T ss_pred HHHHHHHHHHHHHHHHcCccc-ccCCcceEEeccCCc
Confidence 666789999999999999755 344 478888854
No 91
>PF13812 PPR_3: Pentatricopeptide repeat domain
Probab=47.29 E-value=26 Score=15.12 Aligned_cols=16 Identities=19% Similarity=0.144 Sum_probs=11.7
Q ss_pred HHHHHHHHHHHcCCCC
Q psy3109 7 QYLETIRRYIAEGKKF 22 (58)
Q Consensus 7 ~~leAl~~L~~~g~~p 22 (58)
.++..++.+.+.|.+|
T Consensus 19 ~a~~~~~~M~~~gv~P 34 (34)
T PF13812_consen 19 AALQLFDEMKEQGVKP 34 (34)
T ss_pred HHHHHHHHHHHhCCCC
Confidence 3567777788888776
No 92
>COG4821 Uncharacterized protein containing SIS (Sugar ISomerase) phosphosugar binding domain [General function prediction only]
Probab=43.64 E-value=39 Score=23.22 Aligned_cols=25 Identities=16% Similarity=0.206 Sum_probs=18.9
Q ss_pred HHHHHHHHHHHcCCCCCceEEEEeeCC
Q psy3109 7 QYLETIRRYIAEGKKFKRTIHVSFVPG 33 (58)
Q Consensus 7 ~~leAl~~L~~~g~~p~r~I~l~f~~D 33 (58)
...||++++.+.|+.| ||+++-+-|
T Consensus 199 ~faeai~~mv~~g~~p--Pvf~S~Nid 223 (243)
T COG4821 199 TFAEAIELMVEKGYTP--PVFLSANID 223 (243)
T ss_pred HHHHHHHHHHhCCCCC--CeeeecCCC
Confidence 4578999999999999 466655443
No 93
>PF04055 Radical_SAM: Radical SAM superfamily; InterPro: IPR007197 Radical SAM proteins catalyze diverse reactions, including unusual methylations, isomerization, sulphur insertion, ring formation, anaerobic oxidation and protein radical formation. Evidence exists that these proteins generate a radical species by reductive cleavage of S:-adenosylmethionine (SAM) through an unusual Fe-S centre [, ].; GO: 0003824 catalytic activity, 0051536 iron-sulfur cluster binding; PDB: 2A5H_D 3T7V_A 3C8F_A 3CB8_A 2FB2_A 2FB3_A 3CIX_A 3IIX_A 3IIZ_A 3CIW_A ....
Probab=43.31 E-value=52 Score=18.62 Aligned_cols=30 Identities=23% Similarity=0.209 Sum_probs=21.2
Q ss_pred HHHHHHHHHHHHHcCCCCCceEEEEeeCCCC
Q psy3109 5 GIQYLETIRRYIAEGKKFKRTIHVSFVPGPV 35 (58)
Q Consensus 5 ~~~~leAl~~L~~~g~~p~r~I~l~f~~DEE 35 (58)
.-..+++++.+.+.|+.+ ....+...++|.
T Consensus 126 ~~~~~~~l~~l~~~g~~~-~~~~i~~~~~~~ 155 (166)
T PF04055_consen 126 FERVLEALERLKEAGIPR-VIIFIVGLPGEN 155 (166)
T ss_dssp HHHHHHHHHHHHHTTSET-EEEEEEEBTTTS
T ss_pred HHHHHHHHHHHHHcCCCc-EEEEEEEeCCCC
Confidence 345689999999999755 555566666554
No 94
>PF04114 Gaa1: Gaa1-like, GPI transamidase component ; InterPro: IPR007246 GPI (glycosyl phosphatidyl inositol) transamidase is a multiprotein complex required for a terminal step of adding the glycosylphosphatidylinositol (GPI) anchor attachment onto proteins. Gpi16, Gpi8 and Gaa1 form a sub-complex of the GPI transamidase.; GO: 0016021 integral to membrane, 0042765 GPI-anchor transamidase complex
Probab=42.38 E-value=33 Score=25.43 Aligned_cols=33 Identities=12% Similarity=0.014 Sum_probs=24.4
Q ss_pred HHHHHHHHHHHHHHHcCCCCCceEEEEeeCCCCC
Q psy3109 3 CVGIQYLETIRRYIAEGKKFKRTIHVSFVPGPVT 36 (58)
Q Consensus 3 ~~~~~~leAl~~L~~~g~~p~r~I~l~f~~DEE~ 36 (58)
.+++..+.-++.+++..+. .+||.++|+.||..
T Consensus 37 ~~v~l~lal~~~~~~~~~w-sKDii~l~~~~~~~ 69 (504)
T PF04114_consen 37 GGVALALALARYFRRQSYW-SKDIIFLFTDDELA 69 (504)
T ss_pred hhHHHHHHHHHHhhhchhh-hccEEEEecCCcch
Confidence 4566777777777777654 78999999876653
No 95
>PF14681 UPRTase: Uracil phosphoribosyltransferase; PDB: 1V9S_B 1UPF_A 1UPU_D 1JLR_B 1BD4_A 1BD3_C 1JLS_D 1XTV_C 1XTU_H 3G6W_C ....
Probab=39.82 E-value=19 Score=23.34 Aligned_cols=30 Identities=20% Similarity=0.174 Sum_probs=20.6
Q ss_pred HHHHHHHHHHHHcCCCCCceEEEEeeCCCC
Q psy3109 6 IQYLETIRRYIAEGKKFKRTIHVSFVPGPV 35 (58)
Q Consensus 6 ~~~leAl~~L~~~g~~p~r~I~l~f~~DEE 35 (58)
-.++.|++.|++.|..+++-+.+.+.+-+|
T Consensus 135 ~s~~~ai~~L~~~G~~~~~I~~v~~ias~~ 164 (207)
T PF14681_consen 135 GSAIAAIEILKEHGVPEENIIIVSVIASPE 164 (207)
T ss_dssp HHHHHHHHHHHHTTG-GGEEEEEEEEEEHH
T ss_pred hhHHHHHHHHHHcCCCcceEEEEEEEecHH
Confidence 356788999999998777666666665444
No 96
>PF05237 MoeZ_MoeB: MoeZ/MoeB domain; InterPro: IPR007901 This putative domain is found in the MoeZ protein and the MoeB protein. The domain has two CXXC motifs that are only partly conserved. MoeZ is necessary for the synthesis of pyridine-2,6-bis(thiocarboxylic acid), a small secreted metabolite that has a high affinity for transition metals, increases iron uptake efficiency by 20% in Pseudomonas stutzeri, has the ability to reduce both soluble and mineral forms of iron, and has antimicrobial activity towards several species of bacteria. MoeB is the molybdopterin synthase activating enzyme in the molybdopterin cofactor biosynthesis pathway. Both these enzymes are members of a superfamily consisting of related but structurally distinct proteins that are members of pathways involved in the transfer of sulphur-containing moieties to metabolites [] and both also contain the UBA/THIF-type NAD/FAD binding fold (IPR000594 from INTERPRO). ; PDB: 1JWA_B 1JW9_B 1JWB_B 1ZKM_D 1ZUD_3 1ZFN_D.
Probab=39.51 E-value=46 Score=18.43 Aligned_cols=21 Identities=19% Similarity=0.240 Sum_probs=10.2
Q ss_pred HHHHHHHHHHHHHHcCCCCCc
Q psy3109 4 VGIQYLETIRRYIAEGKKFKR 24 (58)
Q Consensus 4 ~~~~~leAl~~L~~~g~~p~r 24 (58)
+.+|+.||++.|...+..+..
T Consensus 34 gslqA~eaik~l~g~~~~l~~ 54 (84)
T PF05237_consen 34 GSLQANEAIKLLLGIGEPLSG 54 (84)
T ss_dssp HHHHHHHHHHHHCT-S---BT
T ss_pred HHHHHHHHHHHHHhcCCchhh
Confidence 345666777766655433333
No 97
>PF08415 NRPS: Nonribosomal peptide synthase; InterPro: IPR013624 This domain is found in bacterial non-ribosomal peptide synthetases (NRPS). NRPS are megaenzymes organised as iterative modules, one for each amino acid to be built into the peptide product []. NRPS modules are involved in epothilone biosynthesis (EpoB), myxothiazol biosynthesis (MtaC and MtaD), and other functions []. The NRPS domain tends to be found together with the condensation domain (IPR001242 from INTERPRO) and the phosphopantetheine binding domain (IPR006163 from INTERPRO).
Probab=38.96 E-value=48 Score=17.15 Aligned_cols=27 Identities=19% Similarity=0.260 Sum_probs=16.0
Q ss_pred HHHHHHHHHHHHcCCCCCceEEEEeeC
Q psy3109 6 IQYLETIRRYIAEGKKFKRTIHVSFVP 32 (58)
Q Consensus 6 ~~~leAl~~L~~~g~~p~r~I~l~f~~ 32 (58)
+..++.+|.|.+.+.......-++|++
T Consensus 4 ~sGv~vlRel~r~~~~~~~~~PVVFTS 30 (58)
T PF08415_consen 4 FSGVEVLRELARRGGGRAAVMPVVFTS 30 (58)
T ss_pred ccHHHHHHHHHHhcCCCCCcCCEEEeC
Confidence 346788899988733333334455554
No 98
>COG0035 Upp Uracil phosphoribosyltransferase [Nucleotide transport and metabolism]
Probab=38.39 E-value=42 Score=22.57 Aligned_cols=30 Identities=27% Similarity=0.234 Sum_probs=23.9
Q ss_pred HHHHHHHHHHHHHcCCCCCceEEEEeeCCCC
Q psy3109 5 GIQYLETIRRYIAEGKKFKRTIHVSFVPGPV 35 (58)
Q Consensus 5 ~~~~leAl~~L~~~g~~p~r~I~l~f~~DEE 35 (58)
...++.|++.|++.| .|++-..+.+.+-+|
T Consensus 137 G~s~i~ai~~L~~~G-~~~~I~~v~~vAape 166 (210)
T COG0035 137 GGSAIAAIDLLKKRG-GPKNIKVVSLVAAPE 166 (210)
T ss_pred cHhHHHHHHHHHHhC-CCceEEEEEEEecHH
Confidence 456788999999998 787777777777766
No 99
>cd06287 PBP1_LacI_like_8 Ligand-binding domain of uncharacterized DNA-binding regulatory proteins that are members of the LacI-GalR family of bacterial transcription repressors. This group includes the ligand-binding domain of uncharacterized DNA-binding regulatory proteins that are members of the LacI-GalR family of bacterial transcription repressors. The LacI-GalR family repressors are composed of two functional domains: an N-terminal HTH (helix-turn-helix) domain, which is responsible for the DNA-binding specificity, and a C-terminal ligand-binding domain, which is homologous to the sugar-binding domain of ABC-type transport systems that contain the type I periplasmic binding protein-like fold. As also observed in the periplasmic binding proteins, the C-terminal domain of the bacterial transcription repressor undergoes a conformational change upon ligand binding which in turn changes the DNA binding affinity of the repressor.
Probab=38.00 E-value=69 Score=20.57 Aligned_cols=30 Identities=10% Similarity=0.052 Sum_probs=22.3
Q ss_pred HHHHHHHHHHHHcCCCCCceEEEEeeCCCC
Q psy3109 6 IQYLETIRRYIAEGKKFKRTIHVSFVPGPV 35 (58)
Q Consensus 6 ~~~leAl~~L~~~g~~p~r~I~l~f~~DEE 35 (58)
..++.++++|++.|.+.+.+|.++-+.|.+
T Consensus 188 ~~A~gvl~al~~~gl~vP~dvsvig~~d~~ 217 (269)
T cd06287 188 AFAVGAVRAATELGRAVPDQLRVVTRYDGL 217 (269)
T ss_pred HHHHHHHHHHHHcCCCCCCceEEEeccCch
Confidence 356678888999999888889887534443
No 100
>COG4703 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=37.76 E-value=80 Score=18.01 Aligned_cols=35 Identities=14% Similarity=0.160 Sum_probs=24.7
Q ss_pred hHHHHHHHHHHHHHHHcCCCCCce----------EEEEeeCCCCC
Q psy3109 2 KCVGIQYLETIRRYIAEGKKFKRT----------IHVSFVPGPVT 36 (58)
Q Consensus 2 K~~~~~~leAl~~L~~~g~~p~r~----------I~l~f~~DEE~ 36 (58)
++.++.++.-++++.+.|-.++++ ..+.|++|+|+
T Consensus 1 ~ShLl~IinRL~~m~edggn~k~~rnFEr~G~vv~eV~ys~~~e~ 45 (74)
T COG4703 1 MSHLLGIINRLEAMQEDGGNEKRQRNFERNGEVVCEVKYSEDNET 45 (74)
T ss_pred CcHHHHHHHHHHHHHhccCCchhhhhhhcCCEEEEEEEecCCCce
Confidence 356778888888888866666651 23778888874
No 101
>PF13377 Peripla_BP_3: Periplasmic binding protein-like domain; PDB: 3K9C_B 3BIL_B 3JVD_B 1ZAY_A 1VPW_A 1DBQ_A 2PUA_A 1QQA_A 1PNR_A 1JHZ_A ....
Probab=37.28 E-value=64 Score=18.59 Aligned_cols=28 Identities=18% Similarity=0.119 Sum_probs=21.1
Q ss_pred HHHHHHHHHHHHcCCCCCceEEEEeeCC
Q psy3109 6 IQYLETIRRYIAEGKKFKRTIHVSFVPG 33 (58)
Q Consensus 6 ~~~leAl~~L~~~g~~p~r~I~l~f~~D 33 (58)
..++.+++.|.+.|...++++.++...|
T Consensus 78 ~~a~~~~~~l~~~g~~vP~di~vv~~~~ 105 (160)
T PF13377_consen 78 RLALGVLRALRELGIRVPQDISVVSFDD 105 (160)
T ss_dssp HHHHHHHHHHHHTTSCTTTTSEEEEESS
T ss_pred HHHHHHHHHHHHcCCcccccccEEEecC
Confidence 3566778889999998777777766655
No 102
>PF04028 DUF374: Domain of unknown function (DUF374); InterPro: IPR007172 This is a bacterial domain of unknown function.
Probab=36.33 E-value=78 Score=17.63 Aligned_cols=25 Identities=20% Similarity=0.309 Sum_probs=17.3
Q ss_pred hHHHHHHHHHHHHHHHcCCCCCceEEEEeeCC
Q psy3109 2 KCVGIQYLETIRRYIAEGKKFKRTIHVSFVPG 33 (58)
Q Consensus 2 K~~~~~~leAl~~L~~~g~~p~r~I~l~f~~D 33 (58)
|++..+..+.++.|+ +| ..+++++|
T Consensus 43 rgg~~Alr~~~~~lk-~G------~~~~itpD 67 (74)
T PF04028_consen 43 RGGARALREMLRALK-EG------YSIAITPD 67 (74)
T ss_pred CcHHHHHHHHHHHHH-CC------CeEEEeCC
Confidence 667777777777776 44 25677777
No 103
>cd06301 PBP1_rhizopine_binding_like Periplasmic binding proteins specific to rhizopines. Periplasmic binding proteins specific to rhizopines, which are simple sugar-like compounds produced in the nodules induced by the symbiotic root nodule bacteria, such as Rhizobium and Sinorhizobium. Rhizopine-binding-like proteins from other bacteria are also included. Two inositol based rhizopine compounds are known to date: L-3-O-methly-scyllo-inosamine (3-O-MSI) and scyllo-inosamine. Bacterial strains that can metabolize rhizopine have a greater competitive advantage in nodulation and rhizopine synthesis is regulated by NifA/NtrA regulatory transcription activators which are maximally expressed at the onset of nitrogen fixation in bacteroids. The members of this group belong to the pentose/hexose sugar-binding protein family of the type I periplasmic binding protein superfamily.
Probab=35.71 E-value=66 Score=20.10 Aligned_cols=27 Identities=26% Similarity=0.262 Sum_probs=18.6
Q ss_pred HHHHHHHHHHHcCCCCCceEEEEeeCCC
Q psy3109 7 QYLETIRRYIAEGKKFKRTIHVSFVPGP 34 (58)
Q Consensus 7 ~~leAl~~L~~~g~~p~r~I~l~f~~DE 34 (58)
.++.+++.+++.|..| +++.++-+-|.
T Consensus 194 ~a~~~~~~l~~~g~~~-~di~ivg~d~~ 220 (272)
T cd06301 194 MALGAIMALKAAGKSD-KDVPVAGIDGT 220 (272)
T ss_pred hHHHHHHHHHHcCCCC-CCcEEEeeCCC
Confidence 4567888888999876 56665555443
No 104
>PRK05943 50S ribosomal protein L25; Reviewed
Probab=35.16 E-value=60 Score=18.54 Aligned_cols=21 Identities=19% Similarity=0.139 Sum_probs=17.9
Q ss_pred HHHHHHHcCCCCCceEEEEeeCCCC
Q psy3109 11 TIRRYIAEGKKFKRTIHVSFVPGPV 35 (58)
Q Consensus 11 Al~~L~~~g~~p~r~I~l~f~~DEE 35 (58)
+.++|.++|+.| -++++.++|
T Consensus 16 ~~r~lR~~G~vP----aViYG~~~~ 36 (94)
T PRK05943 16 ASRRLRRAGKFP----AIIYGGNEA 36 (94)
T ss_pred HHHHHHHCCCCC----EEEECCCCC
Confidence 568999999999 788888877
No 105
>COG2875 CobM Precorrin-4 methylase [Coenzyme metabolism]
Probab=34.89 E-value=79 Score=21.97 Aligned_cols=45 Identities=11% Similarity=0.094 Sum_probs=28.3
Q ss_pred HHHHHHHHHHcCCCCCceEEEEe---eCCCCCCChhhhHHHHHHHHHHhh
Q psy3109 8 YLETIRRYIAEGKKFKRTIHVSF---VPGPVTQNETCGLAAANSVLYLLG 54 (58)
Q Consensus 8 ~leAl~~L~~~g~~p~r~I~l~f---~~DEE~Gg~~~g~~~~~s~~~~~~ 54 (58)
+=..++.|..-|+.|.-|+.+++ |+||++ -++-..=++.+.-.-|
T Consensus 171 I~~vv~~L~~g~y~~dtPVaVV~rAsWpDe~i--i~GTL~dIa~kv~~~~ 218 (254)
T COG2875 171 IDKVVEELLEGGYPPDTPVAVVYRASWPDEKI--IRGTLEDIAEKVKEAG 218 (254)
T ss_pred HHHHHHHHhcCCCCCCCCEEEEEecCCCcccE--EEeeHHHHHHHHHhcC
Confidence 34567788887787777776665 899985 2334444445544443
No 106
>TIGR01481 ccpA catabolite control protein A. Catabolite control protein A is a LacI family global transcriptional regulator found in Gram-positive bacteria. CcpA is involved in repressing carbohydrate utilization genes [ex: alpha-amylase (amyE), acetyl-coenzyme A synthase (acsA)] and in activating genes involved in transporting excess carbon from the cell [ex: acetate kinase (ackA), alpha-acetolactate synthase (alsS)]. Additionally, disruption of CcpA in Bacillus megaterium, Staphylococcus xylosus, Lactobacillus casei and Lactocacillus pentosus also decreases growth rate, which suggests CcpA is involved in the regulation of other metabolic pathways.
Probab=34.65 E-value=87 Score=20.39 Aligned_cols=30 Identities=10% Similarity=0.011 Sum_probs=23.0
Q ss_pred HHHHHHHHHHHHcCCCCCceEEEEeeCCCC
Q psy3109 6 IQYLETIRRYIAEGKKFKRTIHVSFVPGPV 35 (58)
Q Consensus 6 ~~~leAl~~L~~~g~~p~r~I~l~f~~DEE 35 (58)
.+++.+++.|.+.|.+.+.+|.++-+-|-+
T Consensus 246 ~~A~g~~~al~~~g~~vP~dvsvvgfd~~~ 275 (329)
T TIGR01481 246 EMAAGILNAAMDAGIKVPEDLEVITSNNTR 275 (329)
T ss_pred HHHHHHHHHHHHcCCCCCCceEEEeeCCch
Confidence 356678899999999877888877665554
No 107
>PRK11191 RNase E inhibitor protein; Provisional
Probab=34.14 E-value=1.2e+02 Score=18.96 Aligned_cols=31 Identities=19% Similarity=0.215 Sum_probs=24.2
Q ss_pred HHHHHHHHHHHHHcCCCCC--ceEEEEeeCCCC
Q psy3109 5 GIQYLETIRRYIAEGKKFK--RTIHVSFVPGPV 35 (58)
Q Consensus 5 ~~~~leAl~~L~~~g~~p~--r~I~l~f~~DEE 35 (58)
...--+.++.|++.|..|. |+|...|.+..+
T Consensus 11 ~~~~~eVi~~L~edGsd~~~~~~IEH~~~f~d~ 43 (138)
T PRK11191 11 REETREIIEELLEDGSDPDALYTIEHHFSADDF 43 (138)
T ss_pred HHHHHHHHHHHHHcCCCcCCCEEEEEEEecCCH
Confidence 4555678899999998775 899988876655
No 108
>cd06276 PBP1_FucR_like Ligand-binding domain of a transcription repressor, FucR, which functions as a molecular sensor of L-fucose availability. Ligand-binding domain of a transcription repressor, FucR, which functions as a molecular sensor of L-fucose availability. FcuR acts as an inducer of fucRRIAK and as a corepressor of another locus that regulates production of fucosylated glycans. FcuR and its close homologs in this group are a member of the LacI-GalR family repressors that are composed of two functional domains: an N-terminal HTH (helix-turn-helix) domain, which is responsible for the DNA-binding specificity, and a C-terminal ligand-binding domain, which is homologous to the sugar-binding domain of ABC-type transport systems that contain the type I periplasmic binding protein-like fold. As also observed in the periplasmic binding proteins, the C-terminal domain of the bacterial transcription repressor undergoes a conformational change upon ligand binding which in turn changes t
Probab=33.49 E-value=1.1e+02 Score=19.47 Aligned_cols=31 Identities=10% Similarity=0.046 Sum_probs=25.4
Q ss_pred HHHHHHHHHHHHcCCCCCceEEEEeeCCCCC
Q psy3109 6 IQYLETIRRYIAEGKKFKRTIHVSFVPGPVT 36 (58)
Q Consensus 6 ~~~leAl~~L~~~g~~p~r~I~l~f~~DEE~ 36 (58)
..++.+++.+.+.|.+.+.+|.++-+.|.+.
T Consensus 172 ~~A~g~~~~l~~~g~~iP~disvigfd~~~~ 202 (247)
T cd06276 172 TDLVFLIKKARESGLLLGKDIGIISYNDTPL 202 (247)
T ss_pred HHHHHHHHHHHHcCCcCCceeEEEEecCchh
Confidence 4567888999999998888999888777664
No 109
>KOG2170|consensus
Probab=33.27 E-value=1.6e+02 Score=21.42 Aligned_cols=48 Identities=19% Similarity=0.136 Sum_probs=34.7
Q ss_pred HHHHHHHHHHcCCCCCceEEEEeeCCCCCCChhhhHHHHHHHHHHhhhcc
Q psy3109 8 YLETIRRYIAEGKKFKRTIHVSFVPGPVTQNETCGLAAANSVLYLLGLMS 57 (58)
Q Consensus 8 ~leAl~~L~~~g~~p~r~I~l~f~~DEE~Gg~~~g~~~~~s~~~~~~~~~ 57 (58)
++.|++....+. .|++++.++|-.--=||-. +=..+++...|..|++|
T Consensus 94 Vv~alk~~~~n~-~p~KPLvLSfHG~tGTGKN-~Va~iiA~n~~~~Gl~S 141 (344)
T KOG2170|consen 94 VVNALKSHWANP-NPRKPLVLSFHGWTGTGKN-YVAEIIAENLYRGGLRS 141 (344)
T ss_pred HHHHHHHHhcCC-CCCCCeEEEecCCCCCchh-HHHHHHHHHHHhccccc
Confidence 455666666665 6999999999755544432 45777888888888887
No 110
>cd06272 PBP1_hexuronate_repressor_like Ligand-binding domain of DNA transcription repressor for the hexuronate utilization operon from Bacillus species and its close homologs from other bacteria, all of which are a member of the LacI-GalR family of bacterial transcription regulators. Ligand-binding domain of DNA transcription repressor for the hexuronate utilization operon from Bacillus species and its close homologs from other bacteria, all of which are a member of the LacI-GalR family of bacterial transcription regulators. The LacI-GalR family repressors are composed of two functional domains: an N-terminal HTH (helix-turn-helix) domain, which is responsible for the DNA-binding specificity, and a C-terminal ligand-binding domain, which is homologous to the sugar-binding domain of ABC-type transport systems that contain the type I periplasmic binding protein-like fold. As also observed in the periplasmic binding proteins, the C-terminal domain of the bacterial transcription repressor
Probab=32.50 E-value=93 Score=19.35 Aligned_cols=28 Identities=11% Similarity=0.035 Sum_probs=20.2
Q ss_pred HHHHHHHHHHHHcCCCCCceEEEEeeCC
Q psy3109 6 IQYLETIRRYIAEGKKFKRTIHVSFVPG 33 (58)
Q Consensus 6 ~~~leAl~~L~~~g~~p~r~I~l~f~~D 33 (58)
.+++.+++.+++.|.+.+.+|.++-+.|
T Consensus 182 ~~a~~~~~~l~~~g~~vp~dv~vvg~d~ 209 (261)
T cd06272 182 DIALGVLSALNKQGISIPEDIEIISYDN 209 (261)
T ss_pred HHHHHHHHHHHHhCCCCCCceEEEeeCC
Confidence 3456788889999987667777665554
No 111
>PF07708 Tash_PEST: Tash protein PEST motif; InterPro: IPR011695 The PEST motif is found in one or more copies in Tash AT-hook proteins from Theileria annulata. Tash proteins are transported to the host nucleus and are thought to be involved in pathogenesis []. The PEST motif is often found in conjunction with the (IPR007480 from INTERPRO), whose function is unknown. These repeats may be part of the PEST motif (a signal for rapid proteolytic degradation) [], though this is not proven. This motif is also found in other T. annulata proteins, which have no other known domains.
Probab=30.81 E-value=48 Score=14.30 Aligned_cols=12 Identities=25% Similarity=0.147 Sum_probs=10.2
Q ss_pred ceEEEEeeCCCC
Q psy3109 24 RTIHVSFVPGPV 35 (58)
Q Consensus 24 r~I~l~f~~DEE 35 (58)
-+|.+=+.+|||
T Consensus 6 Eti~vEi~SDee 17 (19)
T PF07708_consen 6 ETIPVEIGSDEE 17 (19)
T ss_pred ceEEEEeccccc
Confidence 578899999987
No 112
>PF04023 FeoA: FeoA domain; InterPro: IPR007167 This entry represents the core domain of the ferrous iron (Fe2+) transport protein FeoA found in bacteria. This domain also occurs at the C terminus in related proteins. The transporter Feo is composed of three proteins: FeoA a small, soluble SH3-domain protein probably located in the cytosol; FeoB, a large protein with a cytosolic N-terminal G-protein domain and a C-terminal integral inner-membrane domain containing two 'Gate' motifs which likely functions as the Fe2+ permease; and FeoC, a small protein apparently functioning as an [Fe-S]-dependent transcriptional repressor [, ]. Feo allows the bacterial cell to acquire iron from its environment. ; GO: 0005506 iron ion binding; PDB: 3HRT_B 3HRS_A 3HRU_B 1G3T_A 1C0W_B 1BI3_A 1QVP_A 1BI1_A 1BYM_A 2QQB_A ....
Probab=30.76 E-value=87 Score=16.32 Aligned_cols=21 Identities=29% Similarity=0.194 Sum_probs=16.8
Q ss_pred HHHHHHHHcCCCCCceEEEEe
Q psy3109 10 ETIRRYIAEGKKFKRTIHVSF 30 (58)
Q Consensus 10 eAl~~L~~~g~~p~r~I~l~f 30 (58)
+.+++|.+.|+.|-..|.+.-
T Consensus 23 ~~~~~L~~lGl~~G~~i~v~~ 43 (74)
T PF04023_consen 23 ELLRRLADLGLTPGSEITVIR 43 (74)
T ss_dssp HHHHHHHHCT-STTEEEEEEE
T ss_pred HHHHHHHHCCCCCCCEEEEEE
Confidence 467899999999998888873
No 113
>cd01541 PBP1_AraR Ligand-binding domain of DNA transcription repressor specific for arabinose (AraR) which is a member of the LacI-GalR family of bacterial transcription regulators. Ligand-binding domain of DNA transcription repressor specific for arabinose (AraR) which is a member of the LacI-GalR family of bacterial transcription regulators. The ligand-binding domain of AraR is structurally homologous to the periplasmic sugar-binding domain of ABC-type transporters and both domains contain the type I periplasmic binding protein-like fold. The LacI-GalR family repressors are composed of two functional domains: an N-terminal HTH (helix-turn-helix) domain, which is responsible for the DNA-binding specificity, and a C-terminal ligand-binding domain, which is homologous to the type I periplasmic binding proteins. As also observed in the periplasmic binding proteins, the C-terminal domain of the bacterial transcription repressor undergoes a conformational change upon ligand binding which i
Probab=30.43 E-value=1.2e+02 Score=18.96 Aligned_cols=29 Identities=21% Similarity=0.062 Sum_probs=19.6
Q ss_pred HHHHHHHHHHHcCCCCCceEEEEeeCCCC
Q psy3109 7 QYLETIRRYIAEGKKFKRTIHVSFVPGPV 35 (58)
Q Consensus 7 ~~leAl~~L~~~g~~p~r~I~l~f~~DEE 35 (58)
..+-+++.+++.|.+.+.++.++-..+.+
T Consensus 194 ~a~g~~~al~~~g~~~p~dv~vvg~d~~~ 222 (273)
T cd01541 194 IALRVIDLLKELGLKIPEDISVVGFDDSY 222 (273)
T ss_pred HHHHHHHHHHHcCCCCCCcEEEEEcCCcc
Confidence 45667888889998766677665554443
No 114
>cd01542 PBP1_TreR_like Ligand-binding domain of DNA transcription repressor specific for trehalose (TreR) which is a member of the LacI-GalR family of bacterial transcription regulators. Ligand-binding domain of DNA transcription repressor specific for trehalose (TreR) which is a member of the LacI-GalR family of bacterial transcription regulators. The ligand-binding domain of TreR is structurally homologous to the periplasmic sugar-binding domain of ABC-type transporters and both domains contain the type I periplasmic binding protein-like fold. The LacI-GalR family repressors are composed of two functional domains: an N-terminal HTH (helix-turn-helix) domain, which is responsible for the DNA-binding specificity, and a C-terminal ligand-binding domain, which is homologous to the type I periplasmic binding proteins. As also observed in the periplasmic binding proteins, the C-terminal domain of the bacterial transcription repressor undergoes a conformational change upon ligand binding wh
Probab=30.05 E-value=1.2e+02 Score=18.73 Aligned_cols=30 Identities=17% Similarity=0.155 Sum_probs=21.3
Q ss_pred HHHHHHHHHHHHcCCCCCceEEEEeeCCCC
Q psy3109 6 IQYLETIRRYIAEGKKFKRTIHVSFVPGPV 35 (58)
Q Consensus 6 ~~~leAl~~L~~~g~~p~r~I~l~f~~DEE 35 (58)
.+++.+++.|.+.|.+.+++|.++-+.|-+
T Consensus 183 ~~a~g~~~~l~~~g~~vp~di~v~g~d~~~ 212 (259)
T cd01542 183 TIALGAMKYLQELGRRIPEDISVAGFGGYE 212 (259)
T ss_pred HHHHHHHHHHHHcCCCCCCceEEEecCCch
Confidence 346678888889998877788766554433
No 115
>TIGR03642 cas_csx13 CRISPR-associated protein, Csx13 family. This model describes a protein N-terminal protein sequence domain strictly associated with CRISPR and CRISPR-associated protein systems. This model and TIGR02584 identify two separate clades from a larger homology domain family, both CRISPR-associated, while other homologs are found that may not be. Members are found in bacteria that include Pelotomaculum thermopropionicum SI, Thermoanaerobacter tengcongensis MB4, and Roseiflexus sp. RS-1, and in archaea that include Thermoplasma volcanium, Picrophilus torridus, and Methanospirillum hungatei. The molecular function is unknown.
Probab=29.16 E-value=93 Score=19.12 Aligned_cols=28 Identities=14% Similarity=0.198 Sum_probs=22.4
Q ss_pred HHHHHHHHHHcCCCCCceEEEEeeCCCCC
Q psy3109 8 YLETIRRYIAEGKKFKRTIHVSFVPGPVT 36 (58)
Q Consensus 8 ~leAl~~L~~~g~~p~r~I~l~f~~DEE~ 36 (58)
+-|++.+|.+.|. |...|+++-+-|+++
T Consensus 7 vTEtl~aL~~~g~-~i~ev~lI~T~~~~v 34 (124)
T TIGR03642 7 ITEAIDYLKKKGE-PISDVILIYTKDPYV 34 (124)
T ss_pred HHHHHHHHHhcCC-CCCeEEEEEcCCHHH
Confidence 4588899999986 567899998888775
No 116
>PF08967 DUF1884: Domain of unknown function (DUF1884); InterPro: IPR014418 This group represents an uncharacterised conserved protein.; PDB: 2PK8_A.
Probab=28.96 E-value=71 Score=18.73 Aligned_cols=15 Identities=20% Similarity=0.166 Sum_probs=8.7
Q ss_pred HHHHHHHHHcCCCCC
Q psy3109 9 LETIRRYIAEGKKFK 23 (58)
Q Consensus 9 leAl~~L~~~g~~p~ 23 (58)
=++++.|+..|+.|.
T Consensus 15 e~~inELk~dG~ePD 29 (85)
T PF08967_consen 15 EEKINELKEDGFEPD 29 (85)
T ss_dssp HHHHHHHHHTT----
T ss_pred HHHHHHHHhcCCCCC
Confidence 356778999999993
No 117
>PF02780 Transketolase_C: Transketolase, C-terminal domain; InterPro: IPR005476 Transketolase 2.2.1.1 from EC (TK) catalyzes the reversible transfer of a two-carbon ketol unit from xylulose 5-phosphate to an aldose receptor, such as ribose 5-phosphate, to form sedoheptulose 7-phosphate and glyceraldehyde 3- phosphate. This enzyme, together with transaldolase, provides a link between the glycolytic and pentose-phosphate pathways. TK requires thiamine pyrophosphate as a cofactor. In most sources where TK has been purified, it is a homodimer of approximately 70 Kd subunits. TK sequences from a variety of eukaryotic and prokaryotic sources [, ] show that the enzyme has been evolutionarily conserved. In the peroxisomes of methylotrophic yeast Pichia angusta (Yeast) (Hansenula polymorpha), there is a highly related enzyme, dihydroxy-acetone synthase (DHAS) 2.2.1.3 from EC (also known as formaldehyde transketolase), which exhibits a very unusual specificity by including formaldehyde amongst its substrates. 1-deoxyxylulose-5-phosphate synthase (DXP synthase) [] is an enzyme so far found in bacteria (gene dxs) and plants (gene CLA1) which catalyzes the thiamine pyrophosphoate-dependent acyloin condensation reaction between carbon atoms 2 and 3 of pyruvate and glyceraldehyde 3-phosphate to yield 1-deoxy-D- xylulose-5-phosphate (dxp), a precursor in the biosynthetic pathway to isoprenoids, thiamine (vitamin B1), and pyridoxol (vitamin B6). DXP synthase is evolutionary related to TK. The N-terminal section, contains a histidine residue which appears to function in proton transfer during catalysis []. In the central section there are conserved acidic residues that are part of the active cleft and may participate in substrate-binding []. This family includes transketolase enzymes 2.2.1.1 from EC and also partially matches to 2-oxoisovalerate dehydrogenase beta subunit P37941 from SWISSPROT 1.2.4.4 from EC. Both these enzymes utilise thiamine pyrophosphate as a cofactor, suggesting there may be common aspects in their mechanism of catalysis.; GO: 0003824 catalytic activity, 0008152 metabolic process; PDB: 1NGS_B 1TKA_A 1TRK_B 1TKB_A 1GPU_B 1AY0_B 1TKC_B 2E6K_A 3JU3_A 2R8P_B ....
Probab=28.21 E-value=55 Score=18.94 Aligned_cols=32 Identities=16% Similarity=-0.034 Sum_probs=23.4
Q ss_pred HHHHHHHHHHHHHHcCCCCC-ceEEEEeeCCCC
Q psy3109 4 VGIQYLETIRRYIAEGKKFK-RTIHVSFVPGPV 35 (58)
Q Consensus 4 ~~~~~leAl~~L~~~g~~p~-r~I~l~f~~DEE 35 (58)
++-.+++|.+.|.++|+... -+++.+--.|+|
T Consensus 20 ~~~~al~A~~~L~~~Gi~~~vi~~~~i~P~d~~ 52 (124)
T PF02780_consen 20 MVEEALEAAEELEEEGIKAGVIDLRTIKPFDEE 52 (124)
T ss_dssp HHHHHHHHHHHHHHTTCEEEEEEEEEEESSBHH
T ss_pred HHHHHHHHHHHHHHcCCceeEEeeEEEeccccc
Confidence 45678999999999997654 455666666765
No 118
>PF02070 NMU: Neuromedin U; InterPro: IPR008199 Neuromedin U (NmU) [, ] is a vertebrate peptide which stimulates uterine smooth muscle contraction and causes selective vasoconstriction. Like most other active peptides, it is proteolytically processed from a larger precursor protein. The mature peptides are 8 (NmU-8) to 25 (NmU-25) residues long and C-terminally amidated. The sequence of the C-terminal extremity of NmU is extremely well conserved.; GO: 0006940 regulation of smooth muscle contraction
Probab=28.07 E-value=20 Score=16.48 Aligned_cols=13 Identities=23% Similarity=0.233 Sum_probs=8.8
Q ss_pred eeCCCCCCChhhhH
Q psy3109 30 FVPGPVTQNETCGL 43 (58)
Q Consensus 30 f~~DEE~Gg~~~g~ 43 (58)
|-.|||..|+ +++
T Consensus 1 fk~dee~QgP-~~~ 13 (25)
T PF02070_consen 1 FKVDEEFQGP-GGI 13 (25)
T ss_pred CccchhccCC-ccc
Confidence 3469998887 443
No 119
>cd06298 PBP1_CcpA_like Ligand-binding domain of the catabolite control protein A (CcpA), which functions as the major transcriptional regulator of carbon catabolite repression/regulation. Ligand-binding domain of the catabolite control protein A (CcpA), which functions as the major transcriptional regulator of carbon catabolite repression/regulation (CCR), a process in which enzymes necessary for the metabolism of alternative sugars are inhibited in the presence of glucose. In gram-positive bacteria, CCR is controlled by HPr, a phosphoenolpyruvate:sugar phsophotrasnferase system (PTS) and a transcriptional regulator CcpA. Moreover, CcpA can regulate sporulation and antibiotic resistance as well as play a role in virulence development of certain pathogens such as the group A streptococcus. The ligand binding domain of CcpA is a member of the LacI-GalR family of bacterial transcription regulators.
Probab=28.00 E-value=1.1e+02 Score=18.94 Aligned_cols=29 Identities=7% Similarity=-0.011 Sum_probs=21.0
Q ss_pred HHHHHHHHHHHcCCCCCceEEEEeeCCCC
Q psy3109 7 QYLETIRRYIAEGKKFKRTIHVSFVPGPV 35 (58)
Q Consensus 7 ~~leAl~~L~~~g~~p~r~I~l~f~~DEE 35 (58)
.++.+++.|++.|...+++|.++-.-|.+
T Consensus 188 ~a~~~~~~l~~~g~~vp~di~vvg~d~~~ 216 (268)
T cd06298 188 LAIGILNAAQDAGLKVPEDFEIIGFNNTK 216 (268)
T ss_pred HHHHHHHHHHHcCCCCccceEEEeeccHH
Confidence 35677888889998777788766665544
No 120
>KOG1067|consensus
Probab=27.79 E-value=51 Score=25.91 Aligned_cols=45 Identities=27% Similarity=0.262 Sum_probs=33.6
Q ss_pred HHHHHHHHHHHHHcCCCCCceEEEEeeCCCCCCChhhhHHHHHHHHH
Q psy3109 5 GIQYLETIRRYIAEGKKFKRTIHVSFVPGPVTQNETCGLAAANSVLY 51 (58)
Q Consensus 5 ~~~~leAl~~L~~~g~~p~r~I~l~f~~DEE~Gg~~~g~~~~~s~~~ 51 (58)
..+++.++++|.+.--+++|+...+|.+|.|..-. -..++..+++
T Consensus 253 ~q~~i~~i~~L~k~~Gk~Kr~~~~~~~p~pel~K~--v~~la~erl~ 297 (760)
T KOG1067|consen 253 AQQIIQGIERLAKKYGKQKREPQKLFLPDPELVKH--VHKLAMERLY 297 (760)
T ss_pred HHHHHHHHHHHHHHhCccccchhhhcCCCHHHHHH--HHHHHHHHHH
Confidence 35778899999877668999999999999997532 3445555444
No 121
>PF13738 Pyr_redox_3: Pyridine nucleotide-disulphide oxidoreductase; PDB: 3D1C_A 4A9W_B 2YLX_A 2YM2_A 2YLW_A 2YLR_A 2YM1_A 2YLS_A 1W4X_A 2YLT_A ....
Probab=27.71 E-value=1.3e+02 Score=18.12 Aligned_cols=31 Identities=10% Similarity=-0.075 Sum_probs=20.4
Q ss_pred HHHHHHHHHHHHcCCCCCceEEEEeeCCCCCCChh
Q psy3109 6 IQYLETIRRYIAEGKKFKRTIHVSFVPGPVTQNET 40 (58)
Q Consensus 6 ~~~leAl~~L~~~g~~p~r~I~l~f~~DEE~Gg~~ 40 (58)
.+.+.+...|++.|..+ .++|-..++.|+..
T Consensus 7 ~aGl~~a~~l~~~g~~~----v~v~e~~~~~Gg~w 37 (203)
T PF13738_consen 7 PAGLAAAAHLLERGIDP----VVVLERNDRPGGVW 37 (203)
T ss_dssp HHHHHHHHHHHHTT-------EEEEESSSSSTTHH
T ss_pred HHHHHHHHHHHhCCCCc----EEEEeCCCCCCCee
Confidence 45677778889988744 56677777787753
No 122
>cd06285 PBP1_LacI_like_7 Ligand-binding domain of uncharacterized DNA-binding regulatory proteins that are members of the LacI-GalR family of bacterial transcription repressors. This group includes the ligand-binding domain of uncharacterized DNA-binding regulatory proteins that are members of the LacI-GalR family of bacterial transcription repressors. The LacI-GalR family repressors are composed of two functional domains: an N-terminal HTH (helix-turn-helix) domain, which is responsible for the DNA-binding specificity, and a C-terminal ligand-binding domain, which is homologous to the sugar-binding domain of ABC-type transport systems that contain the type I periplasmic binding protein-like fold. As also observed in the periplasmic binding proteins, the C-terminal domain of the bacterial transcription repressor undergoes a conformational change upon ligand binding which in turn changes the DNA binding affinity of the repressor.
Probab=27.69 E-value=1.3e+02 Score=18.76 Aligned_cols=27 Identities=4% Similarity=-0.017 Sum_probs=19.5
Q ss_pred HHHHHHHHHHHcCCCCCceEEEEeeCC
Q psy3109 7 QYLETIRRYIAEGKKFKRTIHVSFVPG 33 (58)
Q Consensus 7 ~~leAl~~L~~~g~~p~r~I~l~f~~D 33 (58)
+++.+++.+++.|.+.+.++.++-+-+
T Consensus 186 ~a~g~~~~l~~~g~~~p~di~iig~d~ 212 (265)
T cd06285 186 AAIGVMGAARDRGLRVPDDVALVGYND 212 (265)
T ss_pred HHHHHHHHHHHcCCCCCcceEEEeecC
Confidence 456788889999987667776555444
No 123
>PRK11303 DNA-binding transcriptional regulator FruR; Provisional
Probab=26.86 E-value=1.3e+02 Score=19.50 Aligned_cols=29 Identities=0% Similarity=-0.007 Sum_probs=21.2
Q ss_pred HHHHHHHHHHHcCCCCCceEEEEeeCCCC
Q psy3109 7 QYLETIRRYIAEGKKFKRTIHVSFVPGPV 35 (58)
Q Consensus 7 ~~leAl~~L~~~g~~p~r~I~l~f~~DEE 35 (58)
+++.+++.|.+.|.+.+.+|-++-+.|.+
T Consensus 249 ~A~g~~~al~~~g~~vP~disv~gfd~~~ 277 (328)
T PRK11303 249 LLQGVLDVLLERPGELPSDLAIATFGDNE 277 (328)
T ss_pred HHHHHHHHHHHcCCCCCCceEEEEeCChH
Confidence 45678889999998766788776555544
No 124
>cd03036 ArsC_like Arsenate Reductase (ArsC) family, unknown subfamily; uncharacterized proteins containing a CXXC motif with similarity to thioredoxin (TRX)-fold arsenic reductases, ArsC. Proteins containing a redox active CXXC motif like TRX and glutaredoxin (GRX) function as protein disulfide oxidoreductases, altering the redox state of target proteins via the reversible oxidation of the active site dithiol. ArsC catalyzes the reduction of arsenate [As(V)] to arsenite [As(III)], using reducing equivalents derived from glutathione via GRX, through a single catalytic cysteine.
Probab=26.73 E-value=77 Score=18.32 Aligned_cols=19 Identities=26% Similarity=0.382 Sum_probs=14.6
Q ss_pred HHHHHHHHcCCCCCceEEE
Q psy3109 10 ETIRRYIAEGKKFKRTIHV 28 (58)
Q Consensus 10 eAl~~L~~~g~~p~r~I~l 28 (58)
++++.|.++...-+|||.+
T Consensus 79 e~~~~l~~~p~LikRPIi~ 97 (111)
T cd03036 79 EALELLSSDGMLIKRPFVV 97 (111)
T ss_pred HHHHHHHhCcCeeeCCEEE
Confidence 5677788887777888875
No 125
>PRK10727 DNA-binding transcriptional regulator GalR; Provisional
Probab=26.62 E-value=1.3e+02 Score=19.78 Aligned_cols=29 Identities=7% Similarity=-0.105 Sum_probs=21.9
Q ss_pred HHHHHHHHHHHHcCCCCCceEEEEeeCCC
Q psy3109 6 IQYLETIRRYIAEGKKFKRTIHVSFVPGP 34 (58)
Q Consensus 6 ~~~leAl~~L~~~g~~p~r~I~l~f~~DE 34 (58)
.+++.++++|.+.|++.+.+|-++-+-|-
T Consensus 247 ~~A~g~~~al~~~G~~vP~disVigfD~~ 275 (343)
T PRK10727 247 SMAAGAMGVLNDNGIDVPGEISLIGFDDV 275 (343)
T ss_pred HHHHHHHHHHHHcCCCCCcceeEEeecCc
Confidence 45677889999999987778887755454
No 126
>cd00495 Ribosomal_L25_TL5_CTC Ribosomal_L25_TL5_CTC: Ribosomal L25/TL5/CTC N-terminal 5S rRNA binding domain. L25 is a single-domain protein, homologous to the N-terminal domain of TL5 and CTC, which each contain two domains. CTC is a known stress protein, and proteins of this family are believed to have two functions, acting as both ribosomal and stress proteins. In Escherichia coli, cells deleted for L25 were found to be viable; however, these cells grew slowly and had impaired protein synthesis capability. In Bacillus subtilis, CTC is induced under stress conditions and located in the ribosome; it has been proposed that CTC may be necessary for accurate translation under stress conditions. Ribosomal_L25_TL5_CTC is found only in bacteria and some plastids. Due to its limited taxonomic diversity and the viability of cells deleted for L25, this protein is not believed to be necessary for ribosomal assembly. Eukaryotes contain a protein called L25, which is not homologous to bacterial L
Probab=26.38 E-value=1.1e+02 Score=17.17 Aligned_cols=21 Identities=29% Similarity=0.249 Sum_probs=17.5
Q ss_pred HHHHHHHcCCCCCceEEEEeeCCCC
Q psy3109 11 TIRRYIAEGKKFKRTIHVSFVPGPV 35 (58)
Q Consensus 11 Al~~L~~~g~~p~r~I~l~f~~DEE 35 (58)
+.++|.++|+.| -++++.++|
T Consensus 15 ~~r~lR~~G~iP----avvYG~~~~ 35 (91)
T cd00495 15 ASRRLRRAGKVP----AVIYGKGKE 35 (91)
T ss_pred HHHHHHHCCCCC----EEEECCCCC
Confidence 578899999999 678888776
No 127
>cd06280 PBP1_LacI_like_4 Ligand-binding domain of uncharacterized DNA-binding regulatory proteins that are members of the LacI-GalR family of bacterial transcription repressors. This group includes the ligand-binding domain of uncharacterized DNA-binding regulatory proteins that are members of the LacI-GalR family of bacterial transcription repressors. The LacI-GalR family repressors are composed of two functional domains: an N-terminal HTH (helix-turn-helix) domain, which is responsible for the DNA-binding specificity, and a C-terminal ligand-binding domain, which is homologous to the sugar-binding domain of ABC-type transport systems that contain the type I periplasmic binding protein-like fold. As also observed in the periplasmic binding proteins, the C-terminal domain of the bacterial transcription repressor undergoes a conformational change upon ligand binding which in turn changes the DNA binding affinity of the repressor.
Probab=26.06 E-value=1.7e+02 Score=18.21 Aligned_cols=28 Identities=14% Similarity=0.233 Sum_probs=19.3
Q ss_pred HHHHHHHHHHHHcCCCCCceEEEEeeCC
Q psy3109 6 IQYLETIRRYIAEGKKFKRTIHVSFVPG 33 (58)
Q Consensus 6 ~~~leAl~~L~~~g~~p~r~I~l~f~~D 33 (58)
.+++.+++.+++.|.+++.+|.++-+-|
T Consensus 182 ~~a~g~~~~l~~~g~~~p~di~iig~d~ 209 (263)
T cd06280 182 LLLLGALRAVRAAGLRIPQDLALAGFDN 209 (263)
T ss_pred HHHHHHHHHHHHcCCCCCCcEEEEEeCC
Confidence 3456778888888887767776654433
No 128
>COG1057 NadD Nicotinic acid mononucleotide adenylyltransferase [Coenzyme metabolism]
Probab=25.93 E-value=82 Score=20.58 Aligned_cols=26 Identities=8% Similarity=-0.018 Sum_probs=19.7
Q ss_pred HHHHHHHHHHcCCCCCceEEEEeeCCC
Q psy3109 8 YLETIRRYIAEGKKFKRTIHVSFVPGP 34 (58)
Q Consensus 8 ~leAl~~L~~~g~~p~r~I~l~f~~DE 34 (58)
.+.+++.+++. ..|...++++.++|=
T Consensus 86 T~dTl~~~~~~-~~p~~~~~fIiGaD~ 111 (197)
T COG1057 86 TIDTLEHLRQE-YGPDVELYFIIGADN 111 (197)
T ss_pred hHHHHHHHHHH-hCCCCcEEEEEehHH
Confidence 36778888844 347778999999994
No 129
>PF11051 Mannosyl_trans3: Mannosyltransferase putative; InterPro: IPR022751 Alpha-mannosyltransferase is responsible for the addition of residues to the outer chain of core N-linked polysaccharides and to O-linked mannotriose. It is implicated in late Golgi modifications [][][]. The proteins matching this entry are conserved in fungi and also found in some phototrophic organisms.; GO: 0006486 protein glycosylation
Probab=25.58 E-value=1.4e+02 Score=19.98 Aligned_cols=29 Identities=17% Similarity=0.076 Sum_probs=21.8
Q ss_pred HHHHHHHHHHHHcCCCCCceEEEEeeCCCCC
Q psy3109 6 IQYLETIRRYIAEGKKFKRTIHVSFVPGPVT 36 (58)
Q Consensus 6 ~~~leAl~~L~~~g~~p~r~I~l~f~~DEE~ 36 (58)
.+++..++.|.+.|- +-||.+++.+|+|.
T Consensus 14 ~~a~~lI~~LR~~g~--~LPIEI~~~~~~dl 42 (271)
T PF11051_consen 14 WLALRLIRVLRRLGN--TLPIEIIYPGDDDL 42 (271)
T ss_pred HHHHHHHHHHHHhCC--CCCEEEEeCCcccc
Confidence 455677888877764 45799999998886
No 130
>PRK09526 lacI lac repressor; Reviewed
Probab=24.37 E-value=1.9e+02 Score=18.89 Aligned_cols=28 Identities=18% Similarity=0.088 Sum_probs=20.4
Q ss_pred HHHHHHHHHHHHcCCCCCceEEEEeeCC
Q psy3109 6 IQYLETIRRYIAEGKKFKRTIHVSFVPG 33 (58)
Q Consensus 6 ~~~leAl~~L~~~g~~p~r~I~l~f~~D 33 (58)
..++.+++.|.+.|.+.+.+|.++-+-|
T Consensus 250 ~~A~g~~~al~~~g~~vP~disvig~d~ 277 (342)
T PRK09526 250 QMALGVLRALHESGLRVPGQISVIGYDD 277 (342)
T ss_pred HHHHHHHHHHHHcCCCCCCceEEEeeCC
Confidence 4567788899999988777777654444
No 131
>PRK14987 gluconate operon transcriptional regulator; Provisional
Probab=23.89 E-value=1.7e+02 Score=19.13 Aligned_cols=30 Identities=10% Similarity=-0.066 Sum_probs=21.7
Q ss_pred HHHHHHHHHHHHcCCCCCceEEEEeeCCCC
Q psy3109 6 IQYLETIRRYIAEGKKFKRTIHVSFVPGPV 35 (58)
Q Consensus 6 ~~~leAl~~L~~~g~~p~r~I~l~f~~DEE 35 (58)
.+++.++++|++.|.+.+.+|.++-+-|.+
T Consensus 249 ~~A~g~~~al~~~g~~vP~disvigfD~~~ 278 (331)
T PRK14987 249 DLAVGAAFECQRLGLKVPDDMAIAGFHGHD 278 (331)
T ss_pred HHHHHHHHHHHHcCCCCCCccEEEeeCCch
Confidence 356677888889998877888876555443
No 132
>PF00743 FMO-like: Flavin-binding monooxygenase-like; InterPro: IPR020946 Flavin-containing monooxygenases (FMOs) constitute a family of xenobiotic-metabolising enzymes []. Using an NADPH cofactor and FAD prosthetic group, these microsomal proteins catalyse the oxygenation of nucleophilic nitrogen, sulphur, phosphorous and selenium atoms in a range of structurally diverse compounds. FMOs have been implicated in the metabolism of a number of pharmaceuticals, pesticides and toxicants. In man, lack of hepatic FMO-catalysed trimethylamine metabolism results in trimethylaminuria (fish odour syndrome). Five mammalian forms of FMO are now known and have been designated FMO1-FMO5 [, , , , ]. This is a recent nomenclature based on comparison of amino acid sequences, and has been introduced in an attempt to eliminate confusion inherent in multiple, laboratory-specific designations and tissue-based classifications []. Following the determination of the complete nucleotide sequence of Saccharomyces cerevisiae (Baker's yeast) [], a novel gene was found to encode a protein with similarity to mammalian monooygenases.; GO: 0004499 flavin-containing monooxygenase activity, 0050660 flavin adenine dinucleotide binding, 0050661 NADP binding, 0055114 oxidation-reduction process; PDB: 2GVC_E 1VQW_B 2GV8_B 2XVI_B 2XVH_B 2XLS_A 2XLR_A 2XLU_D 2XLP_B 2XVE_A ....
Probab=23.38 E-value=89 Score=23.19 Aligned_cols=32 Identities=16% Similarity=0.140 Sum_probs=24.2
Q ss_pred HHHHHHHHHHHHHHcCCCCCceEEEEeeCCCCCCChh
Q psy3109 4 VGIQYLETIRRYIAEGKKFKRTIHVSFVPGPVTQNET 40 (58)
Q Consensus 4 ~~~~~leAl~~L~~~g~~p~r~I~l~f~~DEE~Gg~~ 40 (58)
...+.|.|++.|+++|+.+ ++|=-.+.+||..
T Consensus 9 aG~sGL~a~k~l~e~g~~~-----~~fE~~~~iGG~W 40 (531)
T PF00743_consen 9 AGPSGLAAAKNLLEEGLEV-----TCFEKSDDIGGLW 40 (531)
T ss_dssp -SHHHHHHHHHHHHTT-EE-----EEEESSSSSSGGG
T ss_pred ccHHHHHHHHHHHHCCCCC-----eEEecCCCCCccC
Confidence 4567899999999999733 5777888888874
No 133
>PF13454 NAD_binding_9: FAD-NAD(P)-binding
Probab=23.16 E-value=1.4e+02 Score=17.89 Aligned_cols=33 Identities=12% Similarity=0.137 Sum_probs=21.7
Q ss_pred HHHHHHHHHHHHHHcCCCCCceEEEEeeCCCCCC
Q psy3109 4 VGIQYLETIRRYIAEGKKFKRTIHVSFVPGPVTQ 37 (58)
Q Consensus 4 ~~~~~leAl~~L~~~g~~p~r~I~l~f~~DEE~G 37 (58)
+....+.++++|.+.. .++.++.+.++-..+.|
T Consensus 5 ~G~~G~~~l~~L~~~~-~~~~~~~I~vfd~~~~G 37 (156)
T PF13454_consen 5 GGPSGLAVLERLLRQA-DPKPPLEITVFDPSPFG 37 (156)
T ss_pred cCHHHHHHHHHHHHhc-CCCCCCEEEEEcCCCcc
Confidence 3456677888888875 55566666666556664
No 134
>smart00493 TOPRIM topoisomerases, DnaG-type primases, OLD family nucleases and RecR proteins.
Probab=22.83 E-value=1.3e+02 Score=15.52 Aligned_cols=22 Identities=14% Similarity=0.087 Sum_probs=14.7
Q ss_pred HHHHHHcCCCCCceEEEEeeCCCC
Q psy3109 12 IRRYIAEGKKFKRTIHVSFVPGPV 35 (58)
Q Consensus 12 l~~L~~~g~~p~r~I~l~f~~DEE 35 (58)
++.+++.. +.+.|++++-+|+|
T Consensus 38 ~~~l~~~~--~~~~Iii~~D~D~~ 59 (76)
T smart00493 38 IKLLKRLA--KKKEVILATDPDRE 59 (76)
T ss_pred HHHHHHHh--cCCEEEEEcCCChh
Confidence 44444432 23679999999998
No 135
>PRK10745 trkD potassium transport protein Kup; Provisional
Probab=22.81 E-value=1.7e+02 Score=22.73 Aligned_cols=34 Identities=9% Similarity=-0.014 Sum_probs=26.9
Q ss_pred HHHHHHHHHHHHHHHcCCCCCceEEEEeeCCCCCCC
Q psy3109 3 CVGIQYLETIRRYIAEGKKFKRTIHVSFVPGPVTQN 38 (58)
Q Consensus 3 ~~~~~~leAl~~L~~~g~~p~r~I~l~f~~DEE~Gg 38 (58)
+.+.+.+.-.++..+.|+-|+ +.+..++++|.|-
T Consensus 303 AvISGaFSl~~QAi~Lg~~Pr--l~I~hTS~~~~GQ 336 (622)
T PRK10745 303 AVISGVFSLTRQAVRLGYLPP--MRIIHTSEMESGQ 336 (622)
T ss_pred HHHHhHHHHHHHHHHhCCCCC--ceEEeCCCccCCc
Confidence 456667778889999999996 6788999998654
No 136
>PF13592 HTH_33: Winged helix-turn helix
Probab=22.65 E-value=31 Score=18.01 Aligned_cols=24 Identities=8% Similarity=-0.046 Sum_probs=18.4
Q ss_pred HHHHHHcCCCCCceEEEEeeCCCC
Q psy3109 12 IRRYIAEGKKFKRTIHVSFVPGPV 35 (58)
Q Consensus 12 l~~L~~~g~~p~r~I~l~f~~DEE 35 (58)
-+.|++.|+.++++-...--.|||
T Consensus 28 ~~lL~r~G~s~~kp~~~~~k~d~~ 51 (60)
T PF13592_consen 28 YRLLKRLGFSYQKPRPRPPKADEE 51 (60)
T ss_pred HHHHHHcCCccccCCCCcccCCHH
Confidence 355778899888887777777776
No 137
>TIGR00731 ctc_TL5 ribosomal protein L25, Ctc-form. The C-terminal domain of sll1824, an apparent L25 of Synechocystis PCC6803, matches the N-terminal domain of this family. Examples of L25 and Ctc are not separated by a UPGMA tree built on the region of shared homology.
Probab=22.56 E-value=1.2e+02 Score=19.35 Aligned_cols=23 Identities=26% Similarity=0.296 Sum_probs=19.0
Q ss_pred HHHHHHHHcCCCCCceEEEEeeCCCCC
Q psy3109 10 ETIRRYIAEGKKFKRTIHVSFVPGPVT 36 (58)
Q Consensus 10 eAl~~L~~~g~~p~r~I~l~f~~DEE~ 36 (58)
.+.++|.++|+.| -++++.++|.
T Consensus 12 ~~~r~LR~~G~VP----aVvYG~~~~~ 34 (176)
T TIGR00731 12 SAARRIRKEGRIP----AVVYGKGKEN 34 (176)
T ss_pred hHHHHHHHCCCcc----EEEECCCCCC
Confidence 3678999999999 7888888773
No 138
>KOG1164|consensus
Probab=22.49 E-value=1e+02 Score=20.68 Aligned_cols=19 Identities=21% Similarity=0.326 Sum_probs=15.9
Q ss_pred HHHHHHHHHHHHHHcCCCC
Q psy3109 4 VGIQYLETIRRYIAEGKKF 22 (58)
Q Consensus 4 ~~~~~leAl~~L~~~g~~p 22 (58)
..+|.+.|++.+++.|+..
T Consensus 127 ia~q~l~~l~~lH~~G~iH 145 (322)
T KOG1164|consen 127 IAIQNLNALEDLHSKGFIH 145 (322)
T ss_pred HHHHHHHHHHHHHhcCccc
Confidence 3568999999999999755
No 139
>PF10850 DUF2653: Protein of unknown function (DUF2653); InterPro: IPR020516 This entry contains proteins with no known function.
Probab=22.33 E-value=1.5e+02 Score=17.49 Aligned_cols=29 Identities=24% Similarity=0.377 Sum_probs=19.8
Q ss_pred HHHHHHHHHHHHc--CCCCCc-eEEEEeeCCCC
Q psy3109 6 IQYLETIRRYIAE--GKKFKR-TIHVSFVPGPV 35 (58)
Q Consensus 6 ~~~leAl~~L~~~--g~~p~r-~I~l~f~~DEE 35 (58)
+.+++|++.+.+. +..|.+ .|.+-+ -|||
T Consensus 57 ~nlieAIr~~l~~~~~~~p~~~~i~L~l-ddee 88 (91)
T PF10850_consen 57 ANLIEAIRQYLEEEYNMDPFRAGIELEL-DDEE 88 (91)
T ss_pred hhHHHHHHHHHHHHhCCCcchhheEEEE-cccc
Confidence 4678999998755 678887 555544 3555
No 140
>cd01544 PBP1_GalR Ligand-binding domain of DNA transcription repressor GalR which is one of two regulatory proteins involved in galactose transport and metabolism. Ligand-binding domain of DNA transcription repressor GalR which is one of two regulatory proteins involved in galactose transport and metabolism. Transcription of the galactose regulon genes is regulated by Gal iso-repressor (GalS) and Gal repressor (GalR) in different ways, but both repressors recognize the same DNA binding site in the absence of D-galactose. GalR is a dimeric protein like GalS and is exclusively involved in the regulation of galactose permease, the low-affinity galactose transporter. GalS is involved in regulating expression of the high-affinity galactose transporter encoded by the mgl operon. GalS and GalR are members of the LacI-GalR family of transcription regulators and both contain the type I periplasmic binding protein-like fold. Hence, they are structurally homologous to the periplasmic sugar bindi
Probab=22.09 E-value=2.2e+02 Score=17.92 Aligned_cols=28 Identities=14% Similarity=0.126 Sum_probs=18.7
Q ss_pred HHHHHHHHHHHHcCCCCCceEEEEeeCC
Q psy3109 6 IQYLETIRRYIAEGKKFKRTIHVSFVPG 33 (58)
Q Consensus 6 ~~~leAl~~L~~~g~~p~r~I~l~f~~D 33 (58)
.+++.+++.+++.|.+.+++|.++-+-+
T Consensus 189 ~~a~g~~~~l~~~g~~vp~di~v~g~d~ 216 (270)
T cd01544 189 PMAIGALRALQEAGIKVPEDVSVISFND 216 (270)
T ss_pred HHHHHHHHHHHHcCCCCCCceEEEEECC
Confidence 3456677888888887666777555533
No 141
>cd01543 PBP1_XylR Ligand-binding domain of DNA transcription repressor specific for xylose (XylR). Ligand-binding domain of DNA transcription repressor specific for xylose (XylR), a member of the LacI-GalR family of bacterial transcription regulators. The ligand-binding domain of XylR is structurally homologous to the periplasmic sugar-binding domain of ABC-type transporters and both domains contain the type I periplasmic binding protein-like fold. The LacI-GalR family repressors are composed of two functional domains: an N-terminal HTH (helix-turn-helix) domain, which is responsible for the DNA-binding specificity, and a C-terminal ligand-binding domain, which is homologous to the type I periplasmic binding proteins. As also observed in the periplasmic binding proteins, the C-terminal domain of the bacterial transcription repressor undergoes a conformational change upon ligand binding which in turn changes the DNA binding affinity of the repressor.
Probab=21.53 E-value=2.2e+02 Score=17.78 Aligned_cols=30 Identities=10% Similarity=-0.013 Sum_probs=19.4
Q ss_pred HHHHHHHHHHHHcCCCCCceEEEEeeCCCC
Q psy3109 6 IQYLETIRRYIAEGKKFKRTIHVSFVPGPV 35 (58)
Q Consensus 6 ~~~leAl~~L~~~g~~p~r~I~l~f~~DEE 35 (58)
..++.+++.+++.|.+.+.+|.++-+-|.+
T Consensus 181 ~~a~g~~~~l~~~g~~vp~di~vigfd~~~ 210 (265)
T cd01543 181 ARARQLLEACRRAGIAVPEEVAVLGVDNDE 210 (265)
T ss_pred HHHHHHHHHHHHhCCCCCCceEEEeeCCch
Confidence 345667788888887655666655554543
No 142
>cd01575 PBP1_GntR Ligand-binding domain of DNA transcription repressor GntR specific for gluconate, a member of the LacI-GalR family of bacterial transcription regulators. This group represents the ligand-binding domain of DNA transcription repressor GntR specific for gluconate, a member of the LacI-GalR family of bacterial transcription regulators. The ligand-binding domain of GntR is structurally homologous to the periplasmic sugar-binding domain of ABC-type transporters and both domains contain the type I periplasmic binding protein-like fold. The LacI-GalR family repressors are composed of two functional domains: an N-terminal HTH (helix-turn-helix) domain, which is responsible for the DNA-binding specificity, and a C-terminal ligand-binding domain, which is homologous to the type I periplasmic binding proteins. As also observed in the periplasmic binding proteins, the C-terminal domain of the bacterial transcription repressor undergoes a conformational change upon ligand binding,
Probab=21.24 E-value=2e+02 Score=17.66 Aligned_cols=25 Identities=12% Similarity=0.058 Sum_probs=17.4
Q ss_pred HHHHHHHHHHHcCCCCCceEEEEee
Q psy3109 7 QYLETIRRYIAEGKKFKRTIHVSFV 31 (58)
Q Consensus 7 ~~leAl~~L~~~g~~p~r~I~l~f~ 31 (58)
.++.+++.+++.|.+++.+|.++-.
T Consensus 188 ~a~~~~~~l~~~g~~~p~di~vig~ 212 (268)
T cd01575 188 LALGALFECQRRGISVPEDIAIAGF 212 (268)
T ss_pred HHHHHHHHHHHhCCCCCcceEEEec
Confidence 4567888899999876666654433
No 143
>PF13662 Toprim_4: Toprim domain; PDB: 1EQN_E 1DD9_A 3B39_B 1DDE_A.
Probab=21.21 E-value=81 Score=16.94 Aligned_cols=29 Identities=17% Similarity=0.212 Sum_probs=17.0
Q ss_pred CceEEEEeeCCCCCCChhhhHHHHHHHHHHhh
Q psy3109 23 KRTIHVSFVPGPVTQNETCGLAAANSVLYLLG 54 (58)
Q Consensus 23 ~r~I~l~f~~DEE~Gg~~~g~~~~~s~~~~~~ 54 (58)
.+.|.+++-+|++ |. .....+...+...|
T Consensus 46 ~~~Vii~~D~D~~--G~-~~a~~i~~~l~~~g 74 (81)
T PF13662_consen 46 VKEVIIAFDNDKA--GE-KAAQKIAKKLLPLG 74 (81)
T ss_dssp -SEEEEEEESSHH--HH-HHHHHHHHHHG---
T ss_pred CceEEEEeCcCHH--HH-HHHHHHHHHHHhhc
Confidence 4789999999997 65 34444444443333
No 144
>PF10587 EF-1_beta_acid: Eukaryotic elongation factor 1 beta central acidic region; InterPro: IPR018940 Translation elongation factors are responsible for two main processes during protein synthesis on the ribosome [, , ]. EF1A (or EF-Tu) is responsible for the selection and binding of the cognate aminoacyl-tRNA to the A-site (acceptor site) of the ribosome. EF2 (or EF-G) is responsible for the translocation of the peptidyl-tRNA from the A-site to the P-site (peptidyl-tRNA site) of the ribosome, thereby freeing the A-site for the next aminoacyl-tRNA to bind. Elongation factors are responsible for achieving accuracy of translation and both EF1A and EF2 are remarkably conserved throughout evolution. Elongation factor EF1B (also known as EF-Ts or EF-1beta/gamma/delta) is a nucleotide exchange factor that is required to regenerate EF1A from its inactive form (EF1A-GDP) to its active form (EF1A-GTP). EF1A is then ready to interact with a new aminoacyl-tRNA to begin the cycle again. EF1B is more complex in eukaryotes than in bacteria, and can consist of three subunits: EF1B-alpha (or EF-1beta), EF1B-gamma (or EF-1gamma) and EF1B-beta (or EF-1delta) []. This region is found in the centre of the beta subunits of Elongation factor-1. More information about these proteins can be found at Protein of the Month: Elongation Factors [].
Probab=20.97 E-value=32 Score=16.25 Aligned_cols=7 Identities=14% Similarity=-0.021 Sum_probs=4.8
Q ss_pred EeeCCCC
Q psy3109 29 SFVPGPV 35 (58)
Q Consensus 29 ~f~~DEE 35 (58)
+|++|+|
T Consensus 1 LFGSddE 7 (28)
T PF10587_consen 1 LFGSDDE 7 (28)
T ss_pred CCCCccc
Confidence 4778866
No 145
>PF14247 DUF4344: Domain of unknown function (DUF4344)
Probab=20.90 E-value=2e+02 Score=19.22 Aligned_cols=31 Identities=16% Similarity=0.219 Sum_probs=23.6
Q ss_pred HHHHHHHHHHcCCCCCceEEEEeeCCCCCCC
Q psy3109 8 YLETIRRYIAEGKKFKRTIHVSFVPGPVTQN 38 (58)
Q Consensus 8 ~leAl~~L~~~g~~p~r~I~l~f~~DEE~Gg 38 (58)
.++.+..+....+..+++|.+.+..+-|.|.
T Consensus 21 vlE~~~~~in~~f~LP~~l~i~~~~CGe~nA 51 (220)
T PF14247_consen 21 VLEDVADLINDYFPLPRDLTIRFAECGEDNA 51 (220)
T ss_pred hHHHHHHHHhhcCCCCCCeEEEEeecCCCCC
Confidence 4677777777889999999999966555443
No 146
>TIGR00756 PPR pentatricopeptide repeat domain (PPR motif). This family has a similar consensus to the TPR domain (tetratricopeptide), pfam pfam00515, a 33-residue repeat. It is predicted to form a pair of antiparallel helices similar to that of TPR.
Probab=20.90 E-value=88 Score=12.98 Aligned_cols=17 Identities=18% Similarity=0.302 Sum_probs=11.9
Q ss_pred HHHHHHHHHHHcCCCCC
Q psy3109 7 QYLETIRRYIAEGKKFK 23 (58)
Q Consensus 7 ~~leAl~~L~~~g~~p~ 23 (58)
.+.+.++.+.+.|.+|.
T Consensus 18 ~a~~~~~~M~~~g~~p~ 34 (35)
T TIGR00756 18 EALELFKEMLERGIEPD 34 (35)
T ss_pred HHHHHHHHHHHcCCCCC
Confidence 35666777788887774
No 147
>cd01540 PBP1_arabinose_binding Periplasmic L-arabinose-binding protein (ABP), a member of a family of pentose/hexose sugar-binding proteins of the type I periplasmic binding protein superfamily. Periplasmic L-arabinose-binding protein (ABP), a member of a family of pentose/hexose sugar-binding proteins of the type I periplasmic binding protein superfamily. ABP is only involved in transport contrary to other related sugar-binding proteins such as the glucose/galactose-binding protein (GGBP) and the ribose-binding protein (RBP), both of which are involved in chemotaxis as well as transport. The periplasmic ABP consists of two alpha/beta globular domains connected by a three-stranded hinge, a Venus flytrap-like domain, which undergoes a transition from an open to a closed conformational state upon ligand binding. Moreover, ABP is homologous to the ligand-binding domain of eukaryotic receptors such as metabotropic glutamate receptor (mGluR) and DNA-binding transcriptional repressors such a
Probab=20.77 E-value=2.1e+02 Score=18.07 Aligned_cols=23 Identities=13% Similarity=-0.028 Sum_probs=16.7
Q ss_pred HHHHHHHHHHHcCCCCCceEEEEe
Q psy3109 7 QYLETIRRYIAEGKKFKRTIHVSF 30 (58)
Q Consensus 7 ~~leAl~~L~~~g~~p~r~I~l~f 30 (58)
+++.+++.+.+.|..| ++|.++-
T Consensus 205 ~a~g~~~al~~~g~~~-~di~vig 227 (289)
T cd01540 205 TVLGAVRATEQSGIAA-ADVIGVG 227 (289)
T ss_pred HHHHHHHHHHHcCCCC-cceEEEe
Confidence 4667888888999886 5666543
No 148
>PRK06973 nicotinic acid mononucleotide adenylyltransferase; Provisional
Probab=20.50 E-value=1.3e+02 Score=20.24 Aligned_cols=24 Identities=8% Similarity=0.144 Sum_probs=18.2
Q ss_pred HHHHHHHHHcCCCCCceEEEEeeCC
Q psy3109 9 LETIRRYIAEGKKFKRTIHVSFVPG 33 (58)
Q Consensus 9 leAl~~L~~~g~~p~r~I~l~f~~D 33 (58)
+..++.|++. +.|...++++.++|
T Consensus 108 idTL~~l~~~-~~p~~~~~fiiG~D 131 (243)
T PRK06973 108 VDTLARWRER-IGPDASLALLIGAD 131 (243)
T ss_pred HHHHHHHHHH-cCCCCCEEEEEchh
Confidence 4677888776 34667899999888
No 149
>PRK00016 metal-binding heat shock protein; Provisional
Probab=20.46 E-value=1e+02 Score=19.35 Aligned_cols=20 Identities=10% Similarity=0.127 Sum_probs=14.6
Q ss_pred HcCCCCCceEEEEeeCCCCC
Q psy3109 17 AEGKKFKRTIHVSFVPGPVT 36 (58)
Q Consensus 17 ~~g~~p~r~I~l~f~~DEE~ 36 (58)
..+..+...|.+.|+.|+|+
T Consensus 31 ~~~~~~~~ev~i~~v~~~~i 50 (159)
T PRK00016 31 KLGVQPEAELSIRLVDNEEI 50 (159)
T ss_pred HcCCCCCeeEEEEEECcHHH
Confidence 34555557889999988875
No 150
>PF14433 SUKH-3: SUKH-3 immunity protein
Probab=20.36 E-value=89 Score=18.69 Aligned_cols=21 Identities=29% Similarity=0.194 Sum_probs=15.7
Q ss_pred HHHHHHHcCCCCCceEEEEee
Q psy3109 11 TIRRYIAEGKKFKRTIHVSFV 31 (58)
Q Consensus 11 Al~~L~~~g~~p~r~I~l~f~ 31 (58)
+.+.|.+.|+.|.|.|.+...
T Consensus 4 v~~~L~~aGW~~~R~idi~~~ 24 (142)
T PF14433_consen 4 VIELLRAAGWYEGRKIDISLW 24 (142)
T ss_pred HHHHHHHcCCCCCcccCHHHH
Confidence 567788999999888765443
No 151
>PF02705 K_trans: K+ potassium transporter; InterPro: IPR003855 This is a family of K+ potassium transporters that are conserved across phyla, having both bacterial (KUP) [], yeast (HAK) [], and plant (AtKT) [] sequences as members.; GO: 0015079 potassium ion transmembrane transporter activity, 0071805 potassium ion transmembrane transport, 0016020 membrane
Probab=20.03 E-value=2.5e+02 Score=21.41 Aligned_cols=34 Identities=12% Similarity=0.021 Sum_probs=27.4
Q ss_pred HHHHHHHHHHHHHHHcCCCCCceEEEEeeCCCCCCC
Q psy3109 3 CVGIQYLETIRRYIAEGKKFKRTIHVSFVPGPVTQN 38 (58)
Q Consensus 3 ~~~~~~leAl~~L~~~g~~p~r~I~l~f~~DEE~Gg 38 (58)
+.+.+...-+++..+.|+-|+ +.+..++++|.|-
T Consensus 292 A~Isg~FSl~~Qai~Lg~~Pr--~~I~hTS~~~~GQ 325 (534)
T PF02705_consen 292 AVISGAFSLTRQAIQLGYFPR--LKIVHTSEKEEGQ 325 (534)
T ss_pred HHHHHHHHHHHHHHHhCCCCc--eEEEECCcccCCc
Confidence 456677778888999999886 7888999988654
Done!